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METHODS OF DIAGNOSIS OF CANCER, COMPOSITIONS AND METHODS OF 
SCREENING FOR MODULATORS OF CANCER 

5 CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/340,376, filed December 14, 2001 ; 
Attorney Docket Number 018501-006400US, filed February 8, 2002; USSN 60/347,21 1, filed 
January 8, 2002; USSN 60/334,393, filed November 29, 2001; USSN 60/335,394, filed 
November 15, 2001; USSN 60/347,349, filed January 10, 2002; USSN 60/368,809, filed March 
10 29, 2002; USSN 60/409,450, filed September 9, 2002; USSN 60/359,077, filed February 20, 
2002; USSN 60/386,614, filed June 5, 2002; USSN 60/356,714, filed February 13, 2002; USSN 
60/397,775 filed July 22, 2002; USSN 60/332,464, filed November 21, 2001; USSN 
60/397,845, filed July 22, 2002; USSN 60/370,110, filed April 4, 2002; USSN 60/396,839, filed 
July 16, 2002; USSN 60/350,666, filed November 13, 2001; and USSN 60/372,246, filed April 
15 12, 2002; each of which is incorporated herein by reference for all pmposes. The application 
also incorporates by reference PCT/US02/29560; PCT/US02/02242; and PCT/US02/17594. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression profiles 
and nucleic acids, products, and antibodies thereto that are involved in cancer and other 
20 diseases; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of these conditions. The invention fiuther relates to methods for 
identifying and using agents and/or targets that modulate these conditions. 

BACKGROUND OF THE INVENTION 
Cancer is a major cause of morbidity in the United States. For example, in 1996, the 
25 American Cancer Society estimated that 1,359,1 50 people were diagnosed with a malignant 
neoplasm and 554,740 died &om one of these diseases. Cancer is responsible for 23.9 percmt 
of all American deaths and is exceeded only by heart disease as a cause of mortality (33 
percent). Unfortunately, cancer mortality is increasing and sometime early in this century, 
cancer is expected to become the leading cause of mortality in the United States as it already is 
30 in Japan, 

Cancers share the charactaristic of disordered control over normal cell division, growth, 
and differentiation. Their initial clinical manifestations are extremely heterogeneous, with over 

1 
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70 types of cancer arising in virtually every organ and tissue of the body. Moreover, some of 
those similarly classified cancer types may represent multiple different molecular diseases. 
Unfortunately, some cancers may be virtually asymptomatic until late in the disease course, 
when treatment is more difGcult, and prognosis grim. 

Treatment for cancer typically includes surgery, chemotherapy, and/or radiation therapy. 
Although nearly 50 percent of cancer patients can be effectively treated using these methods, 
the current therapies all induce serious side effects which diminish quality of life. The 
identification of novel therapeutic targets and diagnostic markers will be important for 
improving the diagnosis, prognosis, and treatment of cancer patients. 

Recent advances in molecular medicine have increased the interest in tumor-specific 
antigens that could serve as targets for various immunotherapeutic or small molecule strategies. 
Antigens suitable for immunotherapeutic strategies should be highly expressed in cancer tissues, 
preferably accessible fi-om the vasculature and at the cell surface, and ideally not expressed in 
normal aduU tissues. Expression in tissues that are dispensable for life, however, may be 
tolerated, e.g., reproductive organs, especially those absent in one sex. Examples of antigens 
that are currently available for the detection and treatment of certain cancers include Her2/neu 
and the B-cell antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer. See 
Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal antibodies 
(Rituxin®/rituximab) are used to efBsctively treat non-Hodgkin's lymphoma. See Maloney, et 
al. (1997) Blood 90:2188-2195; Leget and Czuczman (1998) Curr. Ooin. Oncol. 10:548-551. 

The elucidation of a role for novel proteins and compounds in disease states for 
identification of thersqjeutic targets and diagnostic markers is valuable for improving tiie current 
treatment of cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in various defined cancers. Additionally, provided herein are methods that can be 
used in diagnosis and prognosis of cancer. Further provided are methods that can be used to 
screen candidate bioactive agents for the ability to modulate cancer. 

SUMMARY OF THE INVENTION 

The present invention provides methods for determining the presence or absence of a 
pathological cell in a patient, the method comprising detecting a nucleic acid comprising a 
sequence at least 80% identical to a sequence as described in Tables 2A-80 iri a biological 
sample firom the patient, thereby determining the presence or absence of the pathological cell. 
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In certain embodiments of the method, the pathology is described in Table 1 , including a 
cancer, the biological sample comprises isolated nucleic acids; the nucleic acids are mRNA; the 
biological sample is tissue from an organ which is affected by the pathology of Table 1, 
including a cancer; a further step is used of amplifying nucleic acids before the step of detecting 
S the nucleic acid; the detecting is of a protein encoded by the nucleic acid; the nucleic acid 
comprises a sequence as described in Tables 2A-80; the detecting step is carried out by using a 
labeled nucleic acid probe, utilizing a biochip comprising at sequence at least 80% identical to a 
sequence as described in Tables 2A-80, or detecting a polypeptide encoded by the nucleic acid; 
or the patient is undergoing a therapeutic regimen to treat the pathology of Table 1, or is 
10 suspected of having the pathology or cancer. 

Compositions are also provided, e.g., an isolated nucleic acid molecule comprising a 
sequence as described in Tables 2A-80, including, e.g., those which are labeled; an expression 
vector comprising such nucleic acid; a host cell comprising such expression vector; an isolated 
polypeptide which is encoded by such a nucleic acid molecule comprising a sequence as 
15 described in Tables 2A-80; or an antibody that specifically binds the polypeptide. In particular 
embodiments, the antibody is: conjugated to an effector component, is conjugated to a 
detectable label (includmg, e.g., a fluorescent label, a radioisotope, or a cytotoxic chemical), an 
antibody fragment, or is a humanized antibody. 

Additional methods are provided, including methods for specifically targeting a 
20 compound to a pathological cell in a patient, the method comprising administering to the patient 
an antibody, as described, thereby providing the targettmg. Others include, e.g., methods for 
determining the presence or absence of a pathological cell in a patioit, the methods comprising 
contacting a biological sample with an antibody, as described hi more particular methods, the 
antibody is: conjugated to an effector component, or to a fiuorescmt label; or the biological 
25 sample is a blood, serum, urine, or stool sample. 

Further mefliods include those for identifying a compoimd that modulates a pathology- 
associated polypeptide, the method comprising steps of: contacting the compound with a 
pathology-associated polypeptide, the polypeptide encoded by a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as described in Tables 2A-80; and 
30 determining the functional effect of the compound upon the polypeptide. Another drug 

screening assay method comprises steps of: administering a test compound to a mammal having 
a pathology of Table 1 or a cell isolated therefirom; and comparing the level of gene expression 
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of a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as described in Tables 2A-80 in a treated cell or manunal Avith the level of gens expression of 
the polynucleotide in a control cell or mammal, wherein a test compound that modulates the 
level of expression of the polynucleotide is a candidate for the treatment of the pathology. 

5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outiined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for various disorders, e.g., angiogenesis, 
jBbrosis, and various defined forms of cancer, including metastatic cancer, as well as methods 
for screening for compositions which modulate such conditions. Also provided are methods for 

10 treating such disorders or cancers. See, e.g., American Society of Clinical Oncology (ed. 2001) 
ASCO Curriculum: Symptom Management Kendall/Hunt. ISBN: 0787277851; Bonadonna, et 
al. (2001) Textbook of Breast Cancer (2d ed.) Dunitz Martin, ISBN: 1853178241; Devita and 
Hellman (eds. 2001) Cancer Principles and Practice of Oncology (2 vols.), Lippincott Williams, 
ISBN: 0781723876; Howell, et al. (2001) Breast Cancer Isis Medical Media, ISBN: 

15 1901865584; Kaye and Laws (2001) Brain Tumours: An Encyclopedic Approach (2d ed.) 

Churchill Livingstone, ISBN: 0443064261; Mihm, et al. (2001) The Melanocvtic Proliferation: 
A Comprehensive Textbook of Pigmented Lesions Wiley-Liss, ISBN: 0471252719; 
Montgomery and Aaron (2001) Clinical Pattioloev of Soft-Tissue Tumors Marcel Dekker, 
ISBN: 0824702905; Petrovich, et al. (eds. 2001) Combined ModaHtv of Central Nervous 

20 System Tumors (Medical Radiology) Springer Verlag, ISBN: 3540660534; Rosen (2001) 
Rosen's Breast Pathology Lippincott Williams and Wilkms, ISBN: 0781723795; Shah, et al, 
(2001) Oral Cancer Isis Medical Media, ISBN: 189906687X; Weiss and Goldblum (2001) 
Enzineer and Weiss's Soft Tissue Tumors (4th ed.) Mosby, ISBN: 0323012000; Abeloff, et al 
(eds. 2000) ainical Oncology (2d ed.) C3iurchill Livingstone, ISBN: 044307545X; American 

25 Society of Clinical Oncology (ed. 2000) Cancer Genetics and Cancer Predisposition Testing 
Kendall/Hunt, ISBN: 0787276154; Fletcher (2000) Diagnostic Histopathology of Tumors (2 
vols. 2d ed.) Churchill Livingstone, ISBN: 0443079927; Vogelzang (ed, 2000) Comprehensive 
Textbook of Genitourinary Oncologv (2d ed.) Lippincott Williams and Wilkins, ISBN: 
0683306456; HoUand, et al. (eds. 2000) Holland>>Frei Cancer Medicine (Book with CD-ROM 

30 5th ed.) Decker, ISBN: 1550091131; Turrisi, et al. (2000) Lung Cancer Isis Medical Media, 
ISBN: 1901865428; Bartolozzi and Lencioni (eds. 1999) Liver Malignancies: Diagnostic and 
Interventional Radiology (Medical Radiology) Springer Verlag, ISBN: 3540647562; Gasparini 
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(ed. 1999) Prognostic Variables in Node-Negative and Node-Positive Breast Cancer Kluwer, 
ISBN: 0792384474; Hansen (ed. 1999) The LASLC Textbook of Lung Cancer: International 
Association for the Study of Lung Cancer Dunitz Martin, ISBN: 1853177083; Raghavan, et al. 
(eds. 1999) Textbook of Uncommon Cancer (2nd ed.) Wiley, ISBN: 0471929212; Thawley, et 

5 al. (eds. 1999) Comprehensive Management of Head and Neck Tumors (2 vols.) Saunders, 
ISBN: 0721655823; Whittaker and Holmes (eds. 1999) Leukemia and Related Disorders (3d 
ed.) Blackwell Science, ISBN: 0865426074; Aapro (ed. 1998) OncoMedia: Medical Oncolopv 
(CD-ROM) Elsevier Science, ISBN: 0080427480; Abeloff (1998) Clmical Oncology (Library 
Version 2 CD-ROM Individual Version 2.0 Windows and Macintosh) Harcourt Brace, ISBN: 

10 0443075557; Benson (ed. 1998) Gastrointestinal Oncologv (Cancer Treatment and Research, 
CTAR 98) Kluwer, ISBN: 0792382056; Brambilla and Brambilla (eds. 1998) Lung Tumors: 
Fundamental Biology and Clinical Manaeement (Vol 124) Marcel Dekker, ISBN: 0824701607; 
Canellos, et al. (eds. 1998) The Lvmphomas Saunders, ISBN: 0721650309; Greenspan and 
Remagen (1998) Differential Diagnosis of Txmiors and Tumor-Like Lesions of Bones and Joints 

15 Lippincott WilUams and WiUdns Publishers^ ISBN: 0397517106; Hiddemann (ed. 1998) Acute 
Leukemias VII: Experimental Approaches and Novel Therapies (Haematologie Und 
Bluttransfiision, Vol 39), Springer Verlag, ISBN: 3540635041; Husband and Reznek (1998) 
Imaging in Oncologv (2 vols.) Mosby, ISBN: 1899066489; Leibel and Phillips (eds. 1998) 
Texfcook of Radiation Oncologv Saunders, ISBN: 0721653367; Maloney and Miller (eds. 

20 1 998) Cutaneous Oncologv: Pathophvsiologv, Diagnosis, and Management Blackwell Science, 
ISBN: 0865425175; Mittal, et al. (eds. 1998) Advances in Radiation Therapv Kluwe. ISBN: 
0792399811; Oldham (ed. 1998) Princinies of Cancer Biotheraov (3d ed.) Kluwer, ISBN: 
0792335074; Ozols (ed. 1998) Gvnecologic Oncology Kluwer, ISBN: 0792380703; Parkin, et 
al. (eds. 1998) Cancer Incidence m Five Continents (laic Scientific Publications, No 143) 

25 Oxford University Press, ISBN: 9283221435; Perez and Brady (eds. 1998) Principles and 

Practice of Radiation Oncology Lippincott Williams and WiUdns, ISBN: 0397584164; Black, et 
al. (eds. 1997^ Cancer of the Nervous System Blackwell Science, ISBN: 0865423849; 
Bonadonna, et al. (1997) Textbook of Breast Cancer: A Clinical Guide to Therapy Blackwell 
Science, ISBN: 1853173487; PoUock(ed. 1997) Surgical Oncology Kluwer. ISBN: 

30 0792399005; Sheaves, et al. (eds. 1997) Clinical Endocrine Oncology Blackwell Science, 

ISBN: 086542862X; Vahrson (1997) Radiation Oncology of Gynecological Cancers Springer 
Verlag, ISBN: 0387567682; Walterhouse and Cohn (eds. 1997 ^ Diagnostic and Therapeutic 
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Advances in Pediatric Oncology Kluwer, ISBN: 0792399781; Aisner (ed. 1996) Comprehensive 
Textbook of Thoracic Oncology Lippincott, WilUams and Wilkins, ISBN: 0683000624; 
Bertino, et al. (eds. 1996) Encyclopedia of Cancer (3 vols.) Academic, ISBN: 012093230X; 
Cavalli, et al. (1996) Textbook of Medical Oncology Dunitz Martin, ISBN: 1853172901; 

5 Peckham, et al. (eds. 1995) Oxford Textbook of Oncology (2- Vols.) Oxford University Press, 
ISBN: 0192616854; and Freireich and Kantarjian (eds. 1996) Molecular Genetics and Therapy 
of Leukemia (Cancer Treatment and Research, V. 84) Kluwer, ISBN: 0792339126. 

In particular, identiiBcation of markers selectively expressed on defined cancers allows 
for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 

10 invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and small 
molecule agonists/antagonists, which will be useful to selectively identify those markers. For 
example, therapeutic methods may take the form of protein therapeutics which use the marker 
expression for selective localization or modulation of function (for those markers which have a 
causative disease effect), for vaccines, identification of binding partners, or antagonism, e.g., 

15 using antisense or RNAi. The markers may be useful for molecular characterization of subsets 
of the diseases, e.g., as provided in Table 1, which subsets may actually require very different 
treatments. Moreover, the markers may also be important in related diseases to the specific 
disorders and cancers, e.g., which affect similar tissues in non-malignant diseases, or have 
similar mechanisms of induction/maintenance. Metastatic processes or characteristics may also 

20 be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related but 

distinct diseases, or to determine treatment strategy. The detection methods may be based upon 
nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, IHC, etc. 
The diagnosis may be qualitative or quantitative, and may detect increases or decreases in 
expression levels. 

25 Tables 2B-76B provide unigene clust^ identification numbers for the nucleotide 

sequence of genes that exhibit increased or decreased expression in diseased samples (see 
Tables 1-3), particularly sequences involved in angiogenesis, arthritis, prostate cancer, breast 
cancer, colorectal cancer, cervical cancer, bladder cancer, head and neck cancer, esophageal 
cancer, limg cancer, ovarian cancer, pancreatic cancer, renal cancer, stomach cancer, skin 

30 cancer, testicular cancer, uterine cancer, gUoblastoma, Ewing sarcoma, soft tissue sarcoma, and 
lung fibrosis. Tables 2A-80 also provide an exemplar accession number that provides a 
nucleotide sequence that is part of the unigene cluster. 

6 
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Definitions 

The term "cancer protein" or "cancer polynucleotide" or "cancer-associated transcript" 
refers to nucleic acid and polypeptide polymorphic variants^ alleles, mutants, and interspecies 
homologues that: (1) have a nucleotide sequence that has greater than about 60% nucleotide 

5 sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably about 92%, 94%, 96%, 97%, 
98%, or 99% or greater nucleotide sequence identity, preferably over a region of over a region 
of at least about 25, 50, 100, 200, 500, 1000, or more nucleotides, to a nucleotide sequence of or 
associated with a gene of Tables 1-80; (2) bind to antibodies, e.g., polyclonal antibodies, raised 
against an immunogen comprising an amino acid sequence encoded by a nucleotide sequence of 

10 or associated with a gene of Tables 1-80, and conservatively modified variants thereof; (3) 

specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or the 
complement thereof of Tables 1-80 and conservatively modified variants thereof; or (4) have an 
amino acid sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 
75%, 80%, 85%, preferably 90%, 91%, 93%, 95%, 97%, 98%, or 99% or greater amino 

15 sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more amino acids, to an amino acid sequence encoded by a nucleotide sequence 
of or associated with a gene of Tables 1-80. A polynucleotide or polypeptide sequence is 
typically fix>m a mammal including, but not limited to, primate, e.g., human; rodent, e.g., rat, 
mouse, hamster; cow, pig, horse, sheep, or other manunal. A "cancer polypeptide" and a 

20 "cancer polynucleotide," include both naturally occurring or recombinant forms. 

A "fiiU length" cancer protein or nucleic acid refers to a cancer polypeptide or 
polynucleotide sequence, or a variant thereof that contains elements normally contained In one 
or more naturally occurring, wild type cancer polynucleotide or polypeptide sequences. The 
"fiill length" may be prior to, or aftCT, various stages of post-translational processing or spHcing, 

25 including alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that contains 
nucleic acids or polypeptides, e.g., of a cancer protein, polynucleotide, or transcript. Such 
samples include, but are not limited to, tissue isolated firom primates, e.g., humans, or rodents, 
e.g., nodce, and rats. Biological samples may also include sections of tissues such as biopsy and 

30 autopsy samples, firozen sections taken for histologic purposes, archival samples, blood, plasma, 
serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also include explants and 
primary and/or transformed cell cultures derived firom patient tissues. A biological sample is 
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typically obtained fix)m a enkaryotic organism, most preferably a mammal such as a primate, 
e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a 
bird; reptile; or fish. Livestock and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in methods 
5 described in this invention. Most often, this will be done by removing a sample of cells from an 
animal, but can also be accomplished by using previously isolated cells (e.g., isolated by 
another person, at anottier time, and/or for another purpose), or by performing the methods of 
the invention in vivo. Archival tissues or materials, having treatment or outcome history, will 
be particularly useful. 

10 The terms "identical" or percent "identity," in the context of two or more nucleic acids or 

polypeptide sequences, refer to two or more sequences or subsequences that are the same or 
have a specified percentage of amino acid residues or nucleotides that are the same (e.g., about 
70% identity, preferably 75%, 80%, 85%, 90%, 91%, 93%, 95%, 97%, 98%, 99%, or higher 
identity over a specified region, when compared and aligned for maximum correspondence over 

15 a comparison window or designated region) as measured using, e.g., a BLAST or BLAST 2.0 
sequence comparison algorithms with default parameters described below, or by manual 
aligmnent and visual inspection (see, e.g., NCBI web site http://www.ncbi.nhn.nih.gov/BLAST/ 
ortheUke). Such sequences are then said to be "substantially identical." This definition also 
refers to, or may be applied to, the complement of a test sequence. The definition also includes 

20 sequences that have deletions and/or insertions, substitutions, and naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists ov&r a region that is at 
least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
about 50-100 amino acids or nucleotides in length. 

25 For sequence comparison, typically one sequence acts as a reference sequence, to which 

test sequences are compared. When using a sequence comparison algorithm, test and reference 
sequences are entered into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. Preferably, default program 
parameters can be used, or alternative parameters can be designated. The sequence comparison 

30 algorithm then calculates the percent sequence identities for the test sequences relative to the 
reference sequence, based on the program parameters. 
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A "comparison window", as used herein, includes reference to a segment of contiguous 
positions selected from the group consisting typically of from about 20 to 600, usually about 50 
to 200, more usually about 100 to 150, in which a sequence may be compared to a reference 
sequence of the same number of contiguous positions after the two sequences are optimally 
5 aligned. Methodsof alignment of sequences for comparison are well-known. Optimal 

aligranent of sequences for comparison can be conducted, e.g., by the local homology algorithm 
of Smith and Waterman (1981) Adv. ApdL Math. 2:482-489, by the homology alignment 
algorithm of Needleman and Wunsch (1970) J. MoL Biol. 48:443-453, by the search for 
similarity method of Pearson and Lipman (1988) Proc. Natl> Acad. Sci, USA 85:2444-2448, by 

10 computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr,, 
Madison, WI), or by manual alignment and visual inspection (see, e.g., Ausubel, et al. (eds. 
1995 and supplements) Current Protocols in Molecular Biology Wiley), 

Preferred examples of algorithms that are suitable for determining percent sequence 

1 5 identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 

described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) J. 
Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 
herein, to determine percent sequence identity for the nucleic acids and proteins of the 
invention. Software for performing BLAST analyses is publicly available through the National 

20 Center for Biotechnology Information (http://www.ncbi.nhn.nih.gov/). This algorithm involves 
first identifying high scoring sequence pairs (HSPs) by identifying short words of length W m 
the query sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul, et al., supra). These initial neighborhood word 

25 hits act as seeds for initiating searches to find longer HSPs containing (hem. The word hits are 
extended in both directions along each sequence for as far as the cumulative alignment score 
can be increased. Cumulative scores are calculated using, e.g., for nucleotide sequences, tiie 
parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score 
for mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 

30 calculate the cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative aligmnent score falls off by the quantity X from its maximum achieved value; the 
• cumulative score goes to zero or below, due to the accumulation of one or more negative- 
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scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , an expectation (E) 
of 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP 

5 program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 
scoring matrix (see HenikofFand Henikoff (1992) Proc. Natl Acad. Sci. USA 89:10915-919) 
alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity between two 
sequences. See, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787. 

10 One measure of similarity provided by the BLAST algorithm is the smallest sum probability 
(P(N)), which provides an indication of the probability by which a match between two 
nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a reference sequence if the smallest sum probability in a comparison of the 
test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than 

15 about 0.01, and most preferably less than about 0.001 . Log values may be negative large 
numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 

20 polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
peptides diff^ only by conservative substitutions. Another indication that two nucleic acid 
sequences are substantially identical is that the two molecules or their complements hybridize to 
each other under stringent conditions. Yet another indication that two nucleic acid sequences 
are substantially identical is that the same primers can be used to amplify the sequences. 

25 A "host ceir is a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host cells 
msLy be cultured cells, explants, cells in vivo, and the like. Host cells may be prokaryotic cells 
such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells such as 
CHO, HeLa, and the like (see, e.g., the American Type Culture Collection catalog or web site, 

30 www.atcc.org). 

The terms "isolated," "purified," or 'T)iologically pure" refer to material that is 
substantially or essentially fi-ee fi:om components that nomially accompany it as found in its 

10 
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native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance Uquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
S some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified** in some embodiments denotes that a nucleic acid or 
protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means that the 
nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, and most 
preferably at least 99% pure. "Purify*' or "purification" in other embodiments means removing 
10 at least one contaminant or component from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
IS occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturally occuning amino acid polymers. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as well as 
anuno acid analogs and amino acid mimetics that fimction similarly to the naturally occurring 
amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as 
20 those amino acids that are later modified, e.g., hydroxyproline, 7-carboxyglutamate, and O- 
phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical 
structure as a naturally occurring amino acid, e.g., an a carbon that is bound to a hydrogen, a 
caiboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine 
siilfoxide, methionine methyl sulfonium. Such analogs may have modified R groups (e.g., 
25 norleucine) or modified peptide backbones, but retain somebasic chemical structure as a 

naturally occurring amino acid. Amino acid mimetic refers to a chemical compound that has a 
structure that is different from the general chemical structure of an amino acid, but that 
fimctions similarly to another anuno acid. 

Amino acids may be referred to herein by either their commonly known three letter 
30 symbols or by the one-letter symbols recommended by the lUPAC-IUB Biochemical 

Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

U 
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"Conservatively modified variant" applies to both amino acid and nucleic acid 
sequences. With respect to particular nucleic acid sequences, conservatively modified variants 
refCTs to those nucleic adds which encode identical or essentially idraitical amino acid 
sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially 
5 identical or associated, e.g., naturally contiguous, sequences. Because of the degeneracy of the 
genetic code, a large number of functionally identical nucleic acids encode most proteins. For 
instance, ti»e codons GCA, C3CC, GCG, and GCU each encode the amino acid alanine. Thus, at 
each position where an alanine is specified by a codon, the codon can be altered to another of 
flie corresponding codons described wifliout altering the encoded polypeptide. Such nucleic 

10 acid variations are "silent variations," which are one species of conservatively modified 
variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
silait variations of the nucleic acid. In certain contexts each codon in a nucleic acid (except 
AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only 
codon for tiyptophan) can be modified to yield a fimctionally similar molecule. Accordingly, a 

1 5 silent variation of a nucleic acid which encodes a polypeptide is implicit in a described 

sequence with respect to the expression product, but not necessarily with respect to actual probe 
sequences. 

As to amino acid sequences, one of skill will recognize tiiat individual substitutions, 
deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, 

20 adds, or deletes a single amino acid or a small percentage of amino acids in the encoded 

sequence is a "conservatively modified variant" where the alteration results in tiie substitution of 
an amino acid witii a chemicaUy similar amino acid. Conservative substihition tables providing 
fimctionally similar amino acids are well known. Such conservatively modified variants are in 
addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the 

25 invaition. TypicaUy conservative substitutions include for one anotiier 1) Alanine (A), Glycine 
(G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine 
(R), Lysine (K); 5) Isoleucme (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine 
(F), Tyrosine 00, Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteme (C), 
Mefliionine (M) (see, e.g., Creighton (1984) Proteins: Structure and Molecular Propertiess 

30 Freonan). 

Macromolecular stractures such as polypeptide stinctures can be described in terms of 
various levels of organization. For a general discussion of this organization, see, e.g., Alberts, 

12 
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et al. (eds. 2001) Molecular Biology of the Cell (4th ed.) Garland; and Cantor and Schunmel 
(1980) Biophysical Chemistry Part I: The Conformation of Biological Macromolecules 
Freeman. "Primary structure** refers to the amino acid sequence of a particular peptide. 
''Secondary structure" refers to locally ordered, three dimensional structures within a 
S polypq)tide. These structures are commonly known as domains. Domains are portions of a 
polypeptide that often form a compact unit of the polypq)tide and are typically 25 to 
approximately SOO sunino acids long. Typical domains are made up of sections of lesser 
organization such as stretches of P-sheet and a-helices. "Tertiary structure" refers to the 
complete three dimensional structure of a polypeptide monomer. "Quaternary structiure** refers 
10 to the three dimensional structure formed, usually by the noncovalent association of 
independent tertiary tmits. Anisotropic temis are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or ''polynucleotide" or grammatical equivalents used 
herein means at least two nucleotides covalently linked together. Oligonucleotides are typically 
from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50, or more nucleotides in length, up to about 
15 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any length, 
including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A 
nucleic acid of the present invention will generally contain phosphodiester bonds, although in 
some cases, nucleic acid analogs are included that may have at least one different linkahge, e.g., 
phosphoramidate, phosphorothioate, phosphorodithioate, or 0-methylphophoroamidite linkages 
20 (see Eckstein (1992) Oligonucleotides and Analogues: A Practical Approach Oxford Univ. 
Press); and peptide nucleic acid backbones and linkages. Other analog nucleic acids include 
those with positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7 of Sanghvi and 
Cook (eds. 1994) Carbohydrate Modifications in Antisense Research ACS Symposium Series 
25 580. Nucleic acids containing one or more caifoocyclic sugars are also included within one 
definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done for a 
variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of dififerent nucleic acid analogs, and mixtures of 
30 natiurally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, e.g., 
phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-1963 and references therein; 

13 
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Letsinger (1970) J. Ore. Chem. 35:3800-3803; Sprinzl. et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 
805, Letsinger, et al. (1988) J. Am. Chem. Soc. 110:4470-4471; and Pauwels, et al. (1986) 
ChemicaScrinta 26:141-149), phosphorothioate (Mag, et al. (1991) Nucleic Acids Res. 

5 19:1437-441; and U.S. Patent No. 5,644,048), phosphorodithioate (Brill, et al. (1989) J. Am. 
Chem. Soc. 11 1:2321-2322), O-methylphophoroamidite linkages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach. Oxford Univ. Press), and peptide 
nucleic acid backbones and linkages (see Eghotai (1 992) L Am. Chem. Soc. 114:1 895-1 897; 
Meier, et al. (1992) Chem. Int. Ed. Engl. 31:1008-1010: Nielsen (1993) Nature 365:566-568; 

10 Carlsson, et al. (1996) Nature 380:207, all of which are incorporated by reference). Other 
analog nucleic acids include those with positive backbones (Denpcy, et al. (1995) Proc. Natl. 
Acad. Sci. USA 92:6097-101; non-ionic backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 
5,602,240, 5,216,141, and 4,469,863; Kiedrowski, et al. (1991) Angew. Chem. Intl. Ed English 
30:423-426; Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-4471; Letsinger, et al. (1994) 

15 Nucleoside and Nucleotide 13:1597; Chapters 2 and 3 in Sanghvi and Cook (eds. 1994) 

Carbohydrate ModiiBcations in Antisense Research ACS Symposium Series 580; Mesmaeker, et 
al. (1994) Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular 
NMR 34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743) and non-ribose backbones, including 
those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7 in Sanghvi 

20 and Cook (eds. 1994) Carbohydrate Modifications in Antisense Research ACS Symposium 
Series 580. Nucleic acids containing one or more carbocyclic sugars are also included within 
one dejBnition of nucleic acids (see Jenkins, et al. (1995) Chem. Soc. Rev, pp 169-176), Several 
nucleic acid analogs are described in Rawls (page 35, June 2, 1997) C&ENews . 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 

25 acid analogs. These 'backbones are substantially non-ionic under neutral conditions, in contrast 
to the highly charged phosphodiester backbone of naturally occurring nucleic acids. This 
results in at least two advantages. The PNA backbone exhibits improved hybridization kinetics. 
PNAs have larger changes in the melting temperature (Tm) for mismatched versus perfectly 
matched basepairs. DNA and RNA typically exhibit a 2-4° C drop in T^ for an internal 

30 mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9"^ C. Similarly, due to 
their non-ionic nature, hybridization of the bases attached to these backbones is relatively 
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insensitive to salt concentration. In addition, PNAs are not degraded by cellular enzymes, and 
thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. The depiction of a single strand 

S also defines the sequence of the complementary strand; thus the sequences described herein also 
provide the complement of the sequence. The nucleic acid may be DNA, both genomic and 
cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and 
ribo-nucleotides, and combinations of bases, including uracil, adenine, tiiymine, cytosine, 
guanine, inosine, xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically 

10 refers to a naturally occurring RNA, e.g., a pre-mKNA, hnRNA, or mRNA. As used herein, the 
term "nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic acid, 
each containing a base, are referred to herein as a nucleoside. 

15 A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 

photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. In general, labels fall into three classes: a) isotopic labels, which may be radioactive or 
heavy isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags; and c) 
colored or fluorescent dyes. The labels may be incorporated into the cancer nucleic acids, 

20 proteins, and antibodies. For example, the label should be cs^able of producing, either directly 
or indirectly, a detectable signal. The detectable moiety may be a radioisotope, such as ^H, 
l^C, 32p^ 35s^ or l^Sj^ electron-dense reagents, a fluorescent or chemiluminescent compound, 
such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme (e.g., as conunonly 
used in an ELIS A), biotin, digoxigenin, or haptens and proteins or other entities which can be 

25 made detectable such as alkaline phosphatase, beta-galactosidase, or horseradish peroxidase.. 
Methods are known for conjugating the antibody to the label. See, e.g., Hunter, et al. (1962) 
Nature 144:945; David, et al. (1974) Biochemistry 13:1014-1021; Pain, et al. (1981) L 
Immunol. Meth. 40:219-230; and Nygren (1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "efifector component" is a molecule that is bound 

30 (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 

noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "efifector" can be a variety of molecules including, e.g., detection moieties including 

15 
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radioactive compounds, fluorescent compounds, enzymes or substrates, tags such as epitope 
tags, toxins; activatable moieties, chemotherapeutic agents; lipases; antibiotics; chemoattracting 
moieties, immune modulators (micA/B), or radioisotopes, e.g., emitting "hard" beta, radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is Jbound, e.g,, covalently, 
through a linker or a chemical bond, or noncovalently, through ionic, van der Waals, 
electrostatic, or hydrogen bonds to a label such that the presence of the probe may be detected 
by detecting the presence of the label bound to the probe. Alternatively, methods using high 
affinity interactions may achieve the same restilts where one of a pair of binding partners binds 
to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 
binding to a target nucleic acid of complementary sequence through one or more types of 
chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 
formation. As used herein, a probe may include natural (e.g., A, G, C, or T) or modified bases 
(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a linkage 
other than a phosphodiester bond, preferably one that does not functionally interfere with 
hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent bases 
are joined by peptide bonds rather than phosphodiester linkages. Probes may bind target 
sequences lacking complete complementarity with the probe sequence deprading upon the 
stringency of the hybridization conditions. The probes are preferably directly labeled, e.g., with 
isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with biotin to 
which a streptavidin complex may later bind. By assaying for the presence or absence of the 
probe, one can detect the presence or absence of the select sequence or subsequence. Diagnosis 
or prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein, or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protem, or tiiat the cell is derived from a cell so modified. Thus, e.g., recombinant cells 
express genes that are not found within ttie native (non-recombinant) form of the cell or express 
native genes that are otherwise abnormally expressed, under expressed, or not expressed at all. 
By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed in vitro, 
in general, by the manipulation of nucleic acid, e.g., using polymerases and endonucleases, in a 
form not normally found in nature. In this maimer, operably linkage of different sequences is 
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achieved. Thus an isolated nucleic acid, in a linear fonn, or an expression vector fonned in 
vitro by ligating DNA molecules that are not normally joined, are both considered recombinant 
for the purposes of this invention. It is understood that once a recombinant nucleic acid is made 
and reintroduced into a host cell or organism, it will replicate non-recombinantly, e.g., using the 
in vivo cellular machinery of the host cell rather than in vitro manipulations; however, such 
nucleic acids, once produced recombinantly, although subsequently replicated non- 
recombinantly, are still considered recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant techniques, e.g., 
through the expression of a recombinant nucleic acid as depicted above. . A recombinant protein 
is distinguished from naturally occurring protein by at least one or more characteristics. The 
protein may be isolated or purified away from some or most of the proteins and compounds 
with which it is normally associated in its wild type host, and thus may be substantially pure. 
An isolated protein is xmaccompanied by at least some of the material with which it is normally 
associated in its natural state, preferably constituting at least about 0.5%, more preferably at 
least about 5% by weight of the total protein in a given sample. A substantially pure protein 
comprises at least about 75% by weight of the total protein, with at least about 80% being 
prefenred, and at least about 90% being particularly preferred. The definition includes the 
production of a cancer protein from one organism in a different organism or host cell. 
Alternatively, the protein may be made at a significantly higher concentration than is normally 
seen, through the iise of an inducible promoter or high expression promoter, such that the 
protein is made at increased concentration levels. Alternatively, flie protein may be in a form 
not normally found in nature, as in the addition of an epitope tag or amino acid substitutions, 
insertions and deletions, as discussed below. 

The tCTn "heterologous" when used with reference to portions of a nucleic acid indicates 
that the nucleic acid comprises two or more subsequences that are not normally found in the 
same relationship to each oth^ in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences, e.g., fix)m unrelated genes arranged to 
make a new functional nucleic acid, e.g., a promoter from one source and a coding region from 
another source. Similarly, a heterologous protein will often refer to two or more subsequences 
that are not found in the same relationship to each other in nature (e.g., a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
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sequences near the start site of transcription, such as, in the case of a polymerase 11 type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thoiisand base pairs from the start site of 
transcription. A "constitutive" promote is a promoter that is active under most environmental 
and developmental conditions. An "inducible" promoter is active under environmental or 
developmental regulation. The term "operably linked" refers to a functional linkage between a 
nucleic acid expression control sequence (such as a promoter, or array of transcription factor 
binding sites) and a second nucleic acid sequence, e.g., wherein the expression control sequence 
directs transcription of the nucleic acid corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be transcribed 
in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, duplexing, 
or hybridizing of a molecule selectively to a particular nucleotide sequence under stringent 
hybridization conditions when that sequence is present in a complex nodxture (e.g., total cellular 
or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under which a probe 
will hybridize to its target subsequence, typically in a complex mixture of nucleic acids, but to 
no oth^ sequmces. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperaturesf An 
extensive guide to the hybridization of nucleic acids is found in "Overview of principles of 
hybridization and the strategy of nucleic acid assays" in Tijssen (1993) Hvbridization with 
Nucleic Probes (Laboratorv Techniques in Biochemistrv and Molecular Bioloev't (vol. 24) 
Elsevier. Generally, stringent conditions are selected to be about 5-10® C lower than the 
thermal meltmg point (T^) for the specific sequence at a defined ionic strength pH. The Tjn is 
the temperature (imder defined ionic strength, pH,. and nucleic concentration) at which 50% of 
the probes complementary to the target hybridize to the target sequence at equilibrium (as the 
target sequences are present in excess, at Tjn, 50% of the probes are occupied at equilibrium). 
Stringent conditions will be those in which the salt concentration is less than about 1 .0 M 
sodium ion, typically about 0.01-1.0 M sodium ion concentration (or other salts) at pH 7.0 to 
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8.3 and the temperature is at least about 30** C for short probes (e.g„ about 10-50 nucleotides) 
and at least about 60° C for long probes (e.g., greater than about 50 nucleotides). Stringent 
conditions may also be achieved with the addition of destabilizing agents such as formamide. 
For selective or specific hybridization, a positive signal is typicaUy at least two times 
background, preferably 10 times background hybridization. Exemplary stringent hybridization 
conditions can be as following: 50% fonnamide, 5x SSC, and 1% SDS, incubating at AT C, or, 
5x SSC, 1% SDS, incubating at 65° C, with wash m 0.2x SSC, and 0.1% SDS at 65° C. For 
PCR, a temperature of about 36° C is typical for low stringency amplification, although 
annealing temperatures may vary between about 32°-48° C depending on primer length. For 
high stringency PCR amplification, a temperature of about 62° C is typical, although high 
stringency annealing temperatures can range from about 50-65° C, depending on the primer 
length and specificity. Typical cycle conditions for both high and low stringency amplifications 
include a denaturation phase of 90-95° C for 30-120 sec, an annealing phase lasting 30-120 sec, 
and an extension phase of about 72° C for 1-2 min. Protocols and guidelines for low and high 
stringency amplification reactions are provided, e.g., in Innis, et al. (1990) PCR Protocols: A 
Guide to Methods and Applications Academic Press, NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 
permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent hybridization 
conditions" include a hybridization in a buffer of 40% fonnamide, 1 M NaCl, 1% SDS at 37° C, 
and a wash in IX SSC at 45° C. A positive hybridization is typically at least twice background. 
Alternative hybridization and wash conditions can be utilized to provide conditions of similar 
stringency. Additional guidelines for determining hybridization parameters are provided in 
numerous references, e.g., Ausubel, et al. (eds. 1991 and supplements) Current Protocols in 
Molecular Biology Wiley. 

The phrase "fimctional effects" in the context of assays for testing compounds that 
modulate activity of a cancer protein includes the determination of a parameter that is indirectly 
or directly under the influence of the cancer protein or nucleic acid, e.g., a physiological, 
fimctional, physical, or chemical effect, such as the ability to decrease cancer. It includes ligand 
binding activity; cell viability; cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein ^pression in cells undergoing 
metastasis; and other characteristics of cancer cells. "Functional effects" include in vitro, in 

5 vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that increases 
or decreases a parameter that is indirectly or directly under the influence of a cancer protein 
sequence, e.g., physiological, functional, enzymatic, physical, or chemical effects. Such 
fimctional effects can be measured, e.g., changes in spectroscopic characteristics (e.g., 

10 fluorescence, absoibance, refractive index), hydrodynamic (e.g., shape), chromatographic, or 
solubility properties for the protein, measuring inducible markers or transcriptional activation of 
the cancer protein, measuring binding activity or blading assays, e.g., binding to antibodies or 
other ligands, and measuring growth, cellular proUferation, cell viability, cellular 
transformation, growth factor or serum dependence, tumor specific marker levels, invasiveness 

15 into Matrigel, tumor growth and metastasis in vivo, ndlNA and protein expression, and other 
characteristics of cancer cells. The functional effects can be evaluated by many means, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for cancer-associated sequences, 
measurement of RNA stability, identification of downstream or reporter gene expression (CAT, 

20 luciferase, jS-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, colorimetric 
reactions, antibody binding, inducible markers, and ligand binding assays. 

' "Inhibitors", "activators," and "modulators" of cancer polynucleotide and polypeptide 

sequences are used to refer to activating, inhibitory, or modulating molecules or compounds 
identified using in vitro and in vivo assays of cancer polynucleotide and polypeptide sequences. 

25 Inhibitors are compounds fiiat, e.g., bind to, partially or totally block activity, decrease, prevent, 
delay activation, inactivate, desensitize, or down regulate the activity or expression of cancer 
proteins, e.g., antagonists. Antisense or inhibitory nucleic acids may seem to inhibit expression 
and subsequent function of the protein. "Activators" are compounds that increase, open, 
activate, facilitate, enhance activation, sensitize, agonize, or up regulate cancer protein activity. 

30 Inhibitors, activators, or modulators also include genetically modified versions of cancer 

proteins, e.g., versions with altered activity, as well as naturally occurring and synthetic ligands, 
antagonists, agonists, antibodies, small chemical molecules, and the like. Such assays for 
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inhibitors and activators include, e.g., expressing the cancer protein in vitro, in ceils, or cell 
membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of cancer can also be identified 
by incubating cancer cells with the test compound and determining increases or decreases in the 

5 expression of 1 or more cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more 
cancer proteins, such as cancer proteins encoded by the sequences set out in Tables 2A-80. 

Samples or assays comprising cancer proteins that are treated with a potential activator, 
inhibitor, or modulator are compared to control samples without the inhibitor, activator, or 
modulator to examine the extent of inhibition. Control samples (untreated with inhibitors) are 

10 assigned a relative protein activity value of 100%- Inhibition of a polypeptide is achieved when 
the activity value relative to the control is about 80%, preferably 50%, more preferably 25-0%. 
Activation of a cancer polypeptide is achieved when the activity value relative to the control 
(untreated with activators) is about 110%, more preferably 150%, more preferably 200-500% 
(e.g., two to five fold higher relative to the control), more preferably 1000-3000% higher. 

15 The phrase "changes in cell growth" refers to any change in cell growth and proliferation 

characteristics in vitro or in vivo, such as cell viability, formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell morphology, 
gaining or losing immortalization, gaming or losmg tumor specific markers, ability to form or 

20 suppress tumors when injected into suitable animal hosts, and/or inunortalization of the cell. 

See, e,g., pp. 23 1-241 in Freshney (1994) Culture of Animal Cells a Manual of Basic Technioue 
(2ded.)Wiley-Liss. 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells or "transformation" m tissue culture, refers to 

25 spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise fix)m infection with a transforming virus 
and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, tiiereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 

30 aberrant growth control, noimiorphological changes, and/or malignancy. See, Freshney (2000) 
Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley-Liss. 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized inmiunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon, 
and mu constant region genes, as well as the myriad inmiunoglobulin variable region genes. 
Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, 
mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, 
IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or its 
fimctional equivalent will be most critical in specificity and affinity of binding. See Paul (ed 
1999) Fundamental Immunology (4th ed) Raven. 

An exemplary immunoglobulin (antibody) structural imit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light" 
(about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a 
variable region of about 100 to 1 10 or more amino acids primarily responsible for antigen 
recognition. The terms variable light chain (Vl) and variable heavy chain (Vu) refer to these 

light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 
fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an antibody 
below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself 
is a light chain joined to Vh-Ch1 by a disulfide bond. The F(aby2 may be reduced under mild 
conditions to break tiie disulfide linkage in the hinge region, thereby converting the F(ab)'2 
dimer into an Fab* monomer. The Fab' monomer is essentially Fab with part of the hinge, region 
(see Paul (ed. 1999) Fundamental Immunologv (4tii ed.) Raven. While various antibody 
fi:agments are defined in terms of the digestion of an intact antibody, one of skill will appreciate 
that such fragments may be synthesized de novo either chemically or by using recombinant 
DNA methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
eithCT produced by the modification of whole antibodies, or those synthesized de novo using 
recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage display 
Hbraries (see, e.g., McCafferty, et al. (1990) Nature 348:552-554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal antibodies, 
many techniques known. See, e.g., Kohler and Milstein (1975) Nature 256:495-497; Kozbor, et 
al. (1983) Immunology Today 4:72; Cole, et al. (1985) pp. 77-96 in Reisfeld and Sell (1985) 

Monoclonal Antibodies and Cancer Therapy Liss: Coligan (19911 Current Protocols in 
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Immunology Lippincott; Harlow and Lane (1988) Antibodies: A Laboratory Manual CSH 
Press; and Coding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic 
Press. Techniques for the production of single chain antibodies (Q.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this iny^tion. Also, transgenic mice, or other 
organisms such as other manmials, may be used to express humanized antibodies. 
Altematively, phage display technology can be used to identify antibodies and heteromeric Fab 
fragments that specifically bind to selected antigens. See, e.g., McCafferty, et al. (1990) Nature 
348:552-554; Marks, et al. (1992) Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, and/or species, or an entirely 
different molecule which confers new properties to the chimeric antibody, e.g., an enzyme, 
toxin, hormone, growth factor, drug, effector function, chemoattractant, immune modulator, 
etc.; or (b) the variable region, or a portion thereof, is altered, replaced, or exchanged with a 
variable region having a different or altered antigen specificity. 
Identification of cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different patient samples 
for which diagnosis information is desired, to provide expression profiles. An expression 
profile of a particular sample is essentially a "fingerprint" of the state of the sample; while two 
states may have any particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is characteristic of the 
state of the cell. That is, normal tissue may be distinguished fiom cancerous or metastatic 
cancerous tissue, or cancer tissue or metastatic cancerous tissue can be compared with tissue 
fix)m surviving cancer patients. By comparing expression profiles of tissue in known different 
cancer states, information regarding which genes are important (including both up-and down- 
regulation of genes) in each of these states is obtained. Molecular profiling may distinguish 
subtypes of a currently collective disease designation, e.g., different forms of a cancer. 

The identification of sequences that are differentially expressed in cancer versus non- 
cancer tissue allows the use of this information in a nxmiber of ways. For example, a particular 
treatment regime may be evaluated: does a chemotherapeutic drug act to down-regulate cancer, 
and thus tumor growth or recurrence, in a particular patient. Altematively, a treatment step may 
induce other markers which may be used as targets to destroy tumor cells. Similarly, diagnosis 
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and treatment outcomes may be done or confirmed by comparing patient samples with the 
known expression profiles, Maliganant disease may be compared to non-malignant conditions. 
Metastatic tissue can also be analyzed to determine the stage of cancer in the tissue, or origin of 
primary tumor, e.g., metastasis firom a remote primary site. Furthermore, these gene expression 
5 profiles (or individual genes) allow screening of drug candidates with an eye to mimicking or 
altering a particular expression profile; e.g., screening can be done for drugs that suppress the 
cancer expression profile. This may be done by making biochips comprising sets of the 
important cancer genes, which can then be used in these screens. These methods can also be 
done on the protein basis; that is, protein expression levels of the cancer proteins can be 

10 evaluated for diagnostic purposes or to screen candidate agents. In addition, the cancer nucleic 
acid sequences can be administered for gene therapy purposes, including the administration of 
antisense nucleic acids, or the cancer proteins (including antibodies and other modulators 
thereof) administered as therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that are 

15 differentially expressed in cancer relative to normal tissues and/or non-malignant disease, or in 
different types of related diseases, herein termed "cancer sequences." As outlined below, cancer 
sequences include those that are up-regulated (e.g., expressed at a higher level) in cancer, as 
well as those that are down-regulated (e.g., expressed at a lower level). In a preferred 
embodiment, the cancer sequences are firom humans; however, cancer sequences firom other 

20 organisms may be useful in animal models of disease and drug evaluation; thus, other cancer 
sequences are provided, fiom vertebrates, including maimnals, including rodents (rats, mice, 
hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, Eorses, 
etc.) and pets (e.g., dogs, cats, etc.). Cancer sequences from other organisms may be obtained 
using the techniques outlined below. 

25 Cancer sequences can include both nucleic acid and amino acid sequences. In a 

preferred embodiment, the skin cancer sequences are recombinant nucleic acids. These nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, which 
will detect naturally occurring nucleic acids, as well as screening applications; e.g., biochips 
comprising nucleic acid probes or PCR microtiter plates with selected probes to the cancer 

30 sequences. 

A cancer sequence can be initially identified by substantial nucleic acid and/or amino 
acid sequence homology to the cancer sequences outlined herein. Such homology can be based 
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upon the overall nucleic acid or amino acid sequence, and is generally detennined as outlined 
below, e.g., using homology programs or hybridization conditions. 

For identifying cancer-associated sequences, the cancer screen typically mcludes 
comparing genes identified in different tissues, e.g., normal and cancerous tissues, cancer and 
S non-malignant conditions, non-malignant conditions and normal tissues, or tumor tissue 
samples from patients who have metastatic disease vs. non metastatic tissue. Other suitable 
tissue comparisons include comparing cancer samples with metastatic cancer samples from 
other cancers, such as lung, stomach, gastrointestinal cancers, etc. Samples of different stages 
of cancer, e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, are 
10 applied to biochips comprising nucleic acid probes. The samples are first microdissected, if 

applicable, and treated for preparation of mRNA. Suitable biochips are commercially available, 
e.g., bom Affymetrix, Santa Clara, CA. Gene expression profiles as described herein are 
generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
15 disease states are compared to genes expressed in other normal tissues, including, and not 
limited to lung, heart, brain, liver, stomach, kidney, muscle, colon, small intestine, large 
intestine, spleen, bone, and/or placenta. In a preferred embodiment, those genes identified 
during the cancer screen that are expressed in a significant amount in other tissues (e.g., 
essential organs) are removed fcom the profile, although in some embodiments, this is not 
20 necessary (e.g., where organs may be dispensible, e.g., female or male specific). That is, when 
screening for dmgs, it is usually preferable that the target expression be disease specific, to 
minimize possible side effects on other organs were there expression. 

In a preferred embodiment, cancer sequences are those that are up-regulated in cancer, 
that is, the expression of these genes is higher in the cancer tissue as compared to non-cancer or 
25 non-malignant tissue. "Up-regulation" as used herein often means at least about a two-fold 
change, preferably at least about a three fold change, with at least about five-fold or higher 
being preferred. Another embodiment is directed to sequences up-regulated in non-malignant 
conditions relative to normal. Uniformity among relevant samples is also preferred. 

Unigene cluster identification numbers and accession numbers herein are for the 
30 GenBank sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is available, see, e.g., Benson, et al. (1998) Nuc. Acids 
Res. 26:1-7; and http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, 
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e.g., European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
In some situations, the sequences may be derived from assembly of available sequences or be 
predicted from genomic DNA using exon prediction algoritbms, such as FGENESH. See 
Salamov and Solovyev (2000) Genome Res. 10:516-522. In other situations, sequences have 

5 been derived from cloning and sequencing of isolated nucleic acids. 

In another preferred embodiment, cancer sequences are those that are down-regulated in 
the cancer, that is, the expression of these genes is lower in cancer tissue as compared to non- 
cancerous tissue. ''Down-regulation" as used herein often means at least about a two-fold 
change, preferably at least about a three fold change, with at least about five-fold or higher 

10 bemg preferred. 
Informatics 

The ability to identify genes that are over or under expressed in cancer can additionally 
provide high-resolution, high-sensitivity datasets which can be used in the areas of diagnostics, 
therapeutics, drug development, pharmacogenetics, protein structure, biosensor development, 

15 and other related areas. For example, the expression profiles can be used in diagnostic or 

prognostic evaluation of patients with cancer or related diseases. See Tables 1-3. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson (June 1 1-12, 1998) Pharmaceutical Proteomics: Targets, 
Mechani55 m, and Function, paper presented at the IBC Proteomics conference, Coronado, CA). 

20 Subcellular toxicological information can also be utilized in a biological sensor device to 
predict the likely toxicological efTect of chemical exposures and likely tolerable exposure 
thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue from datasets relevant to 
other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, lipids, drugs, and the 
like). 

25 Thus, in another embodiment, the present invention provides a database that includes at 

least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format The database can be in a fonn in which data can be 
maintained and transmitted, but is preferably an electronic database. The electronic database of 
the invention can be maintained on any electronic device allowing for the storage of and access 

30 to the database, such as a personal computer, but is preferably distributed on a wide area 
network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. Similar databases can be assembled for assay data acquired using an 
assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing cancer, e.g., the identification of cancer-associated sequences described 
herein, provide an abundance of information which can be correlated with pathological 
conditions, predisposition to disease, drug testing, therapeutic monitoring, gene-disease causal 
linkages, identification of correlates of immunity and physiological status, among others. 
Although the data generated Scorn the assays of the invention is suited for manual review and 
analysis, in a preferred embodiment, data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is available. 
For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database system for 
storing biomolecular sequence information in a manner that allows sequences to be catalogued 
and searched according to one or more protem function hierarchies. U.S. Patent 5,953,727 
discloses a relational database having sequence records containing information in a format that 
allows a collection of partial-length DNA sequences to be catalogued and searched according to 
association with one or more sequencing projects for obtaining full-length sequences firom the 
collection of partial length sequences. U.S. Patent 5,706,498 discloses a gene database retrieval 
system for making a retrieval of a gene sequence similar to a sequence data item in a gene 
database based on the degree of similarity between a key sequence and a target sequence. U.S. 
Patent 5,538,897 discloses a method using mass spectroscopy fiiagmentation patterns of "* 
peptides to identify amino acid sequences in compute databases by comparison of predicted 
mass spectra with experimentally-derived mass spectra using a closeness-of-fit measure. U.S. 
Patent 5,926,818 discloses a multi-dunensional database comprising a functionality for multi- 
dimensional data analysis described as on-line analytical processing (OLAP), which entails the 
consolidation of projected and actual data according to more than one consoUdation path or 
dimension. U.S. Patent 5,295,261 reports a hybrid database structure in which the fields of each 
database record are divided into two classes, navigational and informational data, with 
navigational fields stored in a hierarchical topological map which can be viewed as a tree 
structure or as the merger of two or more such tree structures. See also Baxevanis, et al. (2001) 
Bioinformatics: A Practical Guuide to the Analvsis of Genes and Proteins Wiley; Mount (2001) 
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BioinfoTmatics: Sequence and Genome Analysis CSH Press, NY; Duibin, et al. (eds. 1999) 
Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids Cambridge 
University Press; Baxevanis and OeuUette (eds. 1998) Bioinfonnatics: A Practical Guide to the 
Analysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and Buehler (1999) 
Bioinfonnatics: Basic Applications in Biological Science and Medicine CRC Press; Setubal, et 
al. (eds. 1997) Introduction to Computational Molecular Bioloev Brooks/Cole; Misener and 
Krawetz (eds. 2000) Bioinfonnatics: Methods and Protocols Humana Press; Higgins and Taylor 
(eds, 2000) Bioinformatics: Sequence. Structure, and Databanks: A Practical Approach Oxford 
University Press; Brown (2001) Bioinfonnatics: A Biologist's Guide to Biocomputing and the 
Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concepts and Techniques ' 
Kaufinann Pub.; and Waterman (1995) Introduction to Computational Biology: Maps, 
Sequences, and Genomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., with 
data specifying the source of the target-containing sample from which each sequence specificity 
record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample is 
from a control tissue sample known to be free of pathological disorders. In a variation, at least 
one of the sources is a known patiiological tissue specimen, e-g., a neoplastic lesion or another 
tissue specimen to be analyzed for cancer. In another variation, the assay records cross-tabulate 
one or more of the following parameters for each target species in a sample: (1) a imique 
identification code, which can include, e.g., a target molecular structure and/or characteristic 
separation coordinate (e.g., electrophoretic coordinates); (2) sample source; and (3) absolute 
and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data in 
a computer data storage apparatus, which can include magnetic disks, optical disks, magneto- 
optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic bubble 
memory devices, and other data storage devices, including CPU registers and on-CPU data 
storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor and 
a charge storage area, which may be on the transistor). In one embodiment, the invention 
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provides such storage devices, and computer systems built therewith, comprising a bit pattern 
encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a method 
S for identifying related peptide or nucleic acid sequences, comprising performing a computerized 
comparison between a peptide or nucleic acid sequence assay record stored in or retrieved firom 
a computer storage device or database and at least one other sequence. The comparison can 
include a sequence analysis or comparison algorithm or computer program embodiment th^eof 
(e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may be of the relative amount 
10 of a peptide or nucleic acid sequence in a pool of sequences determined firom a polypq)tide or 
nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
15 Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention in 
a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN Une, 
20 wireless network, optical fiber, or other suitable signal transmission medium, whereby at least 
one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic domains 
(e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) composing aT)it 
pattem encoding data acquired from an assay of the invmtion. 

The invention also provides a method for transmitting assay data that includes 
25 g^erating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal ad^ter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattem encoding data from an assay or a database 
comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing a 
30 query target to a database containing an array of data stmctures, such as an assay result obtained 
by the method of the invention, and ranking database targets based on the degree of identity and 
gap weight to ihc target data, A central processor is preferably initialized to load and execute 
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the computer program for alignment and/or comparison of the assay results. Data for a query 
target is entered mto tiie central processor via an I/O device. Execution of the computer 
program results in the central processor retrieving the assay data fix>m the data file, which 
comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor can 
be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, MIPS 
4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain molecular 
biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a data file can 
be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, SRAM, SGRAM, 
SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can be a terminal 
comprising a video display and a keyboard, a modem, an ISDN terminal adapter, an Ethernet 
port, a punched card reader, a magnetic strip reader, or other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattem encoding a collection 
of peptide sequence specificity records obtained by the methods of the invention, which may be 
stored in the computer; (3) a comparison target, such as a query target; and (4) a program for 
aligmnent and comparison, typically with rank-ordering of comparison results on the basis of 
computed similarity values. See, e.g., Ewens and Grant (2001) Statistical Methods in 
Bioinformatics: An Introduction Springer-Verlag. Mathematical approaches can also be used to 
conclude whether similarities or differaices in the gene expression exhibited by dififermt 
samples are significant. See, e.g., Golub, et al, (1999) Science 286:531-537; Duda, et al. (2001) 
Pattem Classification Wiley, and Hastie, et al. (2001) The Elements of Statistic al Learning: 
Data Mining, Inference, and Prediction Springer-Verlag. One approach to determine whether a 
sample is more similar to or has maximum similarity with a given condition between the sample 
and one or more pools representing different conditions for comparison; the pool with the 
smallest vector angle is then chosen as the most similar to the biological sample among the 
pools compared. 



30 



wo 03/042661 



PCT/US02/36810 



Characteristics of cancer-associated proteins 

Cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins, or intracellular proteins. In one embodiment, the cancer protein is an 
intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the nucleus. 
Intracellular proteins are involved in all aspects of cellular function and replication (including, 
e.g., signaling pathways); aberrant expression of such proteins often results in unregulated or 
disregulated cellular processes (see, e.g., Alberts, et al. (eds. 1994) Molecular Biology of the 
Cell (3d ed.) Garland). For example, many intracellular proteins have enzymatic activity such 
as protein kinase activity, protein phosphatase activity, protease activity, nucleotide cyclase 
activity, polymerase activity, and the like. Intracellular proteins also serve as docking proteins 
that are involved in organizing complexes of proteins, or targeting proteins to various 
subcellular locaUzations, and are involved in maintaining the structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated targets 
in a sequence dependent manner. PTB domains, which are distinct fix>m SH2 domains, also 
bind tyrosme phosphorylated targets. SH3 domains bind to proline-rich targets. In addition, PH 
domains, tetratricopeptide repeats and WD domains to name only a few, have been shown to 
mediate protein-protein interactions. Some of these may also be mvolved in binding to 
phospholipids or other second messengers. These moti& can be identified on the basis oT amino 
acid sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple sequrace 
alignments and hidden Markov models covering many conmion protein domaiiis. Versions are 
available via the internet fcom Washington University in St. Louis, the Sanger Center in 
England, and the Karolmska Institute in Sweden. See, e.g., Bateman, et al. (2000) Nuc. Acids 
Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420 ; Bateman, et al. (1999) Nuc. 
Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320-322. 

In another embodiment, the cancer sequences are transmembrane proteins. 
Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
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have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular domain 
of transmembrane proteins serves both roles. For example certain receptor tyrosine kinases 
have both protein kinase activity and SH2 domains. In addition, autophosphorylation of 
tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. For 
example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases and 
receptor serine/threonine protein kinases contain a single transmembrane domain. However, 
various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 7 
membrane spannmg regions. Characteristics of transmembrane domains include approximately 
17 consecutive hydrophobic amino acids that may be followed by charged amino acids. 
Therefore, upon analysis of the amino acid sequence of a particular protein, the localization and 
number of transmembrane domains within the protein may be predicted (see, e.g., PSORT web 
site ht^://psort.nibb.ac.jp/). Important transmembrane protein receptors include, but are not 
limited to the insulin receptor, insulin-like growth factor receptor, human growth hormone 
receptor, glucose transporters, transferrin receptor, epidermal growth factor receptor, low 
density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, and interleukin 
recq)tors, e.g., IL-1 receptor, IL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 
motifs are found repeatedly among various extracellular domains. Conserved structure and/or 
functions have been ascribed to different extracellular motifs. Many extracellular domains are 
involved in binding to ottier molecules. In one aspect, extracellular domains are found on 
receptors. Factors that bind the receptor domain include circulatiuag ligands, which may be 
peptides, proteins, or small molecules such as adenosine and the like. For example, growth 
factors such as EOF, FGF, and PDGF are circulating growth factors that bind to their cognate 
receptors to initiate a variety of cellular responses. Other factors include cytokines, mitogenic 
factors, neurotrophic factors, and the like. Extracellular domains also bind to cell-associated 

32 



wo 03/042661 



PCT/US02/36810 



molecules. In this respect, they may mediate cell-cell interactions- Cell-associated ligands can 
be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) anchor, or may themselves 
be transmembrane proteins. Extracellular domains may also associate with the extracellular 
matrix and contribute to ttie maintenance of the cell structure. 

Cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for immunotherapeutics, as are described herein. 
In addition, as outlined below, transmembrane proteins can be also useful in imaging 
modalities. Antibodies may be used to label such readily accessible proteins in situ. 
Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. In addition, some membrane 
proteins can be processed to release a soluble protein, or to expose a residual fragment. 
Released soluble proteins may be useful diagnostic markers, processed residual protein 
fragments may be useful lung markers of disease. 

It will also be appreciated that a transmembrane protein can be made soluble by 
removing transmembrane sequences, e.g., through recombinant methods. Furthermore, 
transmembrane proteins that have been made soluble can be made to be secreted through 
recombinant means by adding an appropriate signal sequence. 

In another embodiment, the cancer proteins are secreted proteins; the secretion of which 
can be either constitutive or regulated. These proteins may have a signal peptide or signal 
sequence that targets the molecule to the secretory pathway. Secreted proteins are involved in 
numerous physiological events; e.g., if circulating, they often serve to transmit signals to 
various other cell types. The secreted protein may frmction in an autocrine manner (acting on 
the cell that secreted the factor), a paracrine manner (actmg on cells in close proximity to the 
cell that secreted the factor), an endocrine manner (acting on cells at a distance, e.g, secretion 
into the blood stream), or exocrine (secretion, e.g., through a duct or to adjacent epithelial 
surface as sweat glands, sebaceous gilands, pancreatic ducts, lacrimal glands, mammary glands, 
wax producing glands of the ear, etc.). Thus secreted molecules often find use in modulating or 
altering numerous aspects of physiology. Cancer proteins that are secreted proteins are 
particularly preferred in the present invention as they serve as good targets for diagnostic 
markers, e.g., for blood, plasma, serum, or stool tests. Those which are enzymes may be 
antibody or small molecule targets. Others may be usefiil as vaccine targets, e.g., via CTL 
mechanisms. 
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Use of cancer nucleic acids 

As described above, cancer sequence is initially identified by substantial nucleic acid 
and/or amino acid sequence homology or linkage to the cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is generally 
determined as outlined below, using either homology programs or hybridization conditions. 
Typically, linked sequences on a mRNA are found on the same molecule. 

As detailed elsewhere, percent identity can be determined using an algorithm such as 
BLAST. A preferred method utilizes the BLASTN module of WU-BLAST-2 set to the default 
parameters, with overlap span and overiap fraction set to 1 and 0.125, respectively. Alignment 
may include the introduction of gaps in the sequences to be aligned. In addition, for sequences 
which contain either more or fewer nucleotides than those of the nucleic acids described, the 
percentage of homology may be determined based on the number of homologous nucleosides in 
relation to the total number of nucleosides. Thus, e.g., homology of sequences shorter than 
those of the sequences identified will be detennined using the number of nucleosides in the 
shorter sequence. 

In one embodiment, the nucleic acid homology is determined through hybridization 
studies. Thus, e.g., nucleic acids which hybridize under high stringency to a described nucleic 
acid, or its complement, or is also found on naturally occurring mRN As is considered a cancer 
sequence. In another embodiment, less stringent hybridization conditions are used; e.g., 
moderate or low stringency conditions may be used; see Ausubel, supra, and Tijssen, supra. 

The cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 1-80, 
can be fragments of larger genes, e.g., they are nucleic acid segmrats. "Genes" in this context 
includes coding regions, non-coding regions, and mixtures of coding and non-coding regions. 
Accordingly, using flie sequences provided herein, extended sequences, in either direction, of 
the cancer genes can be obtained, using techniques well known for cloning eith^ longer 
sequences or the ML lengfli sequences; see Ausubel, et al., supra. Much can be done by 
mformatics and many sequences can be clustered to include multiple sequences corresponding 
to a smgle gene, e.g., systems such as UniGene (see, http://www.ncbi.nhn.nih.govAJniGene/). 

Once a cancer nucleic acid is identified, it can be cloned and, if necessary, its constituent 
parts recombined to form the entire cancer nucleic acid coding regions or the entire mRNA 
sequence. Once isolated from its natural source, e.g., contained within a plasmid or other vector 
or excised therefrom as a linear nucleic acid segment, the recombinant cancer nucleic acid can 
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be jfiirther used as a probe to identify and isolate other cancer nucleic acids, e.g., extended 
coding regions. It can also be used as a "precursor" nucleic acid to make modified or variant 
cancer nucleic acids and proteins. 

The cancer nucleic acids of the present invention are used in several ways. In one 
5 embodiment, nucleic acid probes to the cancer nucleic acids are made and attached to biochips 
to be used in screening and diagnostic methods, as outlined below, or for administration, e,g., 
for gene therapy, vaccine, RNAi, and/or antisense applications. Alternatively, cancer nucleic 
acids that include coding regions of cancer proteins can be put into expression vectors for the 
expression of cancer proteins, again for screening purposes or for administration to a patient. 
10 In a preferred embodiment, nucleic acid probes to cancer nucleic acids (both the nucleic 

acid sequences outlined in the figures and/or the complements thereof) are made. The nucleic 
acid probes attached to the biochip are designed to be substantially complementary to the cancer 
nucleic acids, e.g., the target sequence (either the target sequence of the sample or to other 
probe sequences, e.g., in sandwich assays), such that hybridization of the target sequence and 
15 the probes of the present invention occurs. As outlined below, this complementarity need not 
be perfect; there may be any number of base pair mismatches which will interfere with 
hybridization between the target sequence and the single stranded nucleic acids of the present 
invention. However, if the number of mutations is so great that no hybridization can occur 
under even the least stringent of hybridization conditions, the sequence is not a complementary 
20 target sequence. Thus, by "substantially complementary" herein is meant tiiat the probes are 
sufficiently complementary to the target sequences to hybridize under normal reaction 
conditions, particularly high stringency conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and partially 
double stranded. The strandedness of the probe is dictated by the structure, composition, and 
25 properties of the target sequence. In general, the nucleic acid probes range firom about 8-100 
bases long, with fi-om about 10-80 bases being preferred, and fix)m about 30-50 bases being 
particularly preferred. That is, generally whole genes are not used In some embodiments, 
much longer nucleic acids can be used, up to hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
30 overlapping probes or probes to different sections of the target being used. That is, two, three, 
four or more probes, with three being preferred, are used to build in a redimdancy for a 
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particular target. The probes can be overlapping (e.g., have some sequence in coxiunon), or 
separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 

Nucleic acids can be attached or immobilized to a solid support in a wide variety of 
ways. By "inunobilized" and grammatical equivalents herein is meant the association or 

5 binding between the nucleic acid probe and the solid support is sufficient to be stable under the 
conditions of binding, washing, analysis, and removal as outlined. The binding can typically be 
covalent or non-covalent. By "non-covalent binding" and grammatical equivalents herein is 
meant one or more of electrostatic, hydrophilic, and hydrophobic interactions. Included in non- 
covalent binding is the covalent attachment of a molecule, e.g., streptavidin to the support and 

10 the non-covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" 
and grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds, and coordination 
bonds, Covalent bonds can be formed directly between the probe and the solid support or can 
be formed by a cross linker or by inclusion of a specific reactive group on either the solid 

15 support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways. As 
described herein, the nucleic acids can either be synthesized first, with subsequent attachment to 
the biochip, or can be directly synthesized on the biochip. 

20 The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 

other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 
method. Often, the substrate may contain discrete individual sites appropriate for individual 
partitioning and identification. The number of possible substrates is very large, and include, but 

25 are not limited to, glass and modified or functionalized glass, plastics (including acrylics, 
polystyrene and copolymers of styrene and other materials, polypropylene, polyethylene, 
polybutylene, polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, 
silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic 
glasses, plastics, etc. In general, the substrates allow optical detection and do not £^preciably 

30 fluoresce. See WO 0055627. 

Generally the substrate is planar, although other configurations of substrates may be 
used as welL For example, the probes may be placed on the inside surface of a tube for flow- 
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through sample analysis to minimize sample volmne. Similarly, the substrate may be flexible, 
such as a flexible foam, including closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be derivatized 
with chemical functional groups for subsequent attachment of the two. Thus, e.g., the biochip is 

5 derivatized with a chemical functional group including, but not limited to, amino groups, 
caiboxy groups, oxo groups, and thiol groups, with amino groups being particularly preferred. 
Using these functional groups, the probes can be attached using functional groups on the probes. 
For example, nucleic acids containing amino groups can be attached to surfaces comprising 
amino groups, e.g., using linkers; e.g., homo-or hetero-bifimctional linkers as are well known 

10 (see 1994 Pierce Chemical Company catalog, technical section on cross-linkers, pages 155- 
200). In addition, in some cases, additional linkers, such as alkyl groups (including substituted 
and heteroalkyl groups) may be used. 

Jn this embodiment, oligonucleotides are synthesized, and then attached to the surface of 
the solid support. Either the 5' or 3* terminus may be attached to the solid support, or 

15 attachment may be via linkage to an internal nucleoside. In another embodiment, the 
immobilization to the solid support may be very strong, yet non-covalent. For example, 
biotinylated oligonucleotides can be made, which bind to surfaces covalently coated with 
streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface. For example, 

20 photoactivation techniques utilizing photopolymerization compounds and techniques are used. 
In a preferred embodiment, the nucleic acids can be synthesized in situ, using known 
photolithographic techniques, such as those described in WO 95/25 1 16; WO 95/35505; U.S. 
Patent Nos, 5,700,637 and 5,445,934; and references cited within, all of which are expressly 
incorporated by reference; these methods of attachment form the basis of the Affymetrix 

25 GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
cancer-associated sequences. These assays are typically p^ormed in conjunction with reverse 
transcription. In such assays, a cancer-associated nucleic acid sequence acts as a template in an 
amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a quantitative 

30 amplification, the amount of amplification product will be proportional to the amount of 

template in the original sample. Comparison to appropriate controls provides a measure of the 
amount of cancw-associated RNA. Methods of quantitative amplification are well known. 
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Detailed protocols for quantitative PGR are provided, e.g., in Innis, et al. (1990) PGR Protocols: 
A Guide to Methods and Applications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. TaqMan 
based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent dye and a 3' 
5 quenchmg agent. The probe hybridizes to a PGR product, but cannot itself be extended due to a 
blocking agent at the 3* end. When the PGR product is amplified in subsequent cycles, the 5' 
nuclease activity of the polymerase, e.g., AmpliTaq, results in the cleavage of the TaqMan 
probe. This cleavage separates the 5' fluorescent dye and the 3' quenching agent, thereby 
resultmg in an increase in fluorescence as a function of amplification (see, e.g., literature 

1 0 provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LGR) (see Wu and Wallace (1989) Genomics 4:560-569, Landegren, et al. (1988) 
Science 241:1077-1080, and Baninger, et al (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1 173-1 177), self-sustained 

15 sequence replication (Guatelli, et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), dot 
PGR, linker adapter PGR, etc. 
Expression of cancer proteins firom nucleic acids 

In a preferred embodiment, cancer nucleic acids, e.g., encoding cancer proteins, are used 
to make a variety of expression vectors to express cancer proteins which can then be used in 

20 screening assays, as described below. Expression vectors and recombinant DNA technology are 
well known (see, e.g., Ausubel, supra, and Fernandez and Hoeffler (eds. 1999) Gene Expression 
Systems Academic Press) to express protems. The expression vectors may be either self^ 
replicatmg extrachiomosomal vectors or vectors which integrate into a host genome. Generally, 
these expression vectors include transcriptional and translational reguhitory nucleic acid 

25 operably linked to the nucleic acid encoding the cancer protein. The texm "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organismu Control sequences that are suitable for prokaryotes, e.g., include a 
promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

30 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
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the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" 
means that the DNA sequences being linked are contiguous, and, in the case of a secretory 

5 leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is typically accomplished by ligation at convenient restriction sites. If such sites do not 
exist, synthetic oligonucleotide adaptors or linkers are used in accordance with conventional 
practice. Transcriptional and translational regulatory nucleic acid will generally be appropriate 
to the host cell used to express the cancer protein. Numerous types of appropriate expression 

10 vectors and suitable regulatory sequences are known for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 
not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start and 

15 stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known, and are useful in the present 
invention. 

20 An expression vector may comprise additional elem^ts. For example, the expression 

vector may have two replication systems, thus allowing it to be maintained in two organisms, 
e.g., in mammalian or insect cells for expression and in a prokaryotic host for cloning ancl 
amplification. Furthermore, for integrating expression vectors, the expression vector often 
contains at least one sequrace homologous to ttie host cell genome, and preferably two 

25 homologous sequences which flank the expression construct. The integrating vector may be 
directed to a specific locus in the host cell by selecting the appropriate homologous sequence 
for inclusion in the vector. Constructs for integrating vectors are available. See, e.g., 
Fernandez and HoefQer, supra; and Kitamura, et al. (1995) Proc. Natl Acad. Sci. USA 92:9146- 
9150. 

30 In addition, in a preferred embodiment, the expression vector contains a selectable 

marker gene to allow the selection of transformed host cells. Selection genes are well known 
and will vary with the host cell used. 
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The cancer proteins of fhe present invention are usually produced by culturing a host cell 
transformed with an expression vector containing nucleic acid encoding a cancer protein, under 
the appropriate conditions to induce or cause expression of ttie cancer protein. Conditions 
appropriate for cancer protein expression will vary with the choice of the expression vector and 

5 the host ceil, and will be easily ascertained through routine experimentation or optimization. 
For example, the use of constitutive promoters in the expression vector will require optimizing 
the growth and proliferation of the host cell, while the use of an inducible promoter requires the 
appropriate growth conditions for induction. In addition, in some embodiments, the timing of 
the harvest is important. For example, the baculoviral systems used in insect cell expression are 

10 lytic viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 
and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THPl cells (a 

15 macrophage cell line), and various other human cells and cell lines. 

Li a preferred embodiment, the cancer proteins are expressed in mammaUan cells. 
Mammalian expression systems may be used, and include retroviral and adenoviral systems. 
One expression vector system is a retroviral vector system such as is generally described in 
PCTAJS97/01019 and PCT/US97/01048. Of particular use as mammalian promoters are ttie 

20 promoters from mammalian viral gmes, since fhe viral genes are often highly expressed and 
have.a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus mq'or late promoter, herpes simplex virus promoter, and the 
CMV promoter (see, e.g., Fernandez and HoefQ^, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 3' 

25 to the translation stop codon and thus, together wifii the promoter elemoits, flank the coding 
sequence. Examples of transcription terminator and polyadenlyation signals include those 
derived from S V40. 

Methods of introducing exogenous nucleic acid into mammalian hosts, as well as other 
hosts, are available, and will vary with the host cell used. Techniques include dextran-mediated 
30 transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast 

fusion, electroporation, viral infection, encapsulation of the polynucleotide(s) in Uposomes, and 
direct microinjection of the DNA into nuclei. 
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In a preferred embodiment, cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used. In addition, synthetic promoters and hybrid 
promoters are also useful; e.g., the tac promoter is a hybrid of the tip and lac promoter 
sequences. Furthermore, a bacterial promoter can include naturally occurring promoters of non- 
5 bacterial origin that have the ability to bind bacterial RNA polymerase and initiate transcription. 
In addition to a functioning promoter sequence, an efficient ribosome binding site is desirable. 
The expression vector may also include a signal peptide sequence that provides for secretion of 
the cancer protein in bacteria. The protein is either secreted into the growth media (gram- 
positive bacteria) or into the periplasmic space, located between the inner and outer membrane 

10 of the cell (gram-negative bacteria). The bacterial expression vector may also include a 

selectable marker gene to allow for the selection of bacterial strains that have been transfomied.' 
Suitable selection genes include genes which render the bacteria resistant to drugs such as 
ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin, and tetracycline. Selectable 
markers also include biosynthetic genes, such as those in the histidine, tryptophan, and leucine 

15 biosynthetic pathways. These components are assembled into expression vectors. Expression 
vectors for bacteria are well known, and include vectors for Bacillus subtilis, E. coli, 
Streptococcus cremoris, and Streptococcus lividans, among others (e.g., Fernandez and 
Hoeffler, supra). The bacterial expression vectors are transformed into bacterial host cells xising 
techniques such as calcium chloride treatment, electroporation, and others. 

20 In one embodiment, cancer proteins are produced in insect cells using, e.gv, expression 

vectors for the transformation of insect cells, and in particular, baculovirus-based expression 
vectors. 

In a prefenred embodiment, a cancer protein is produced in yeast cells. Yeast expression 
systems are well known, and include expression vectors for Saccharomyces cerevisiae, Candida 
25 albicans and C. maltosa, Hansenula polymorpha, Kluyveromyces fragilis and K. lactis, Pichia 
guillerimondii and P. pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The cancer protein may also be made as a fusion protein, using available techniques. 
Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope is small, the cancer 
protein may be fused to a carrier protein to form an immunogen. Alternatively, the cancer 
30 protein may be made as a fusion protein to increase expression, or for other reasons. For 

example, when the cancer protein is a cancer peptide, the nucleic acid encoding the peptide may 
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be linked to other nucleic acid for expression purposes. Fusion with detection epitope tags can 
be made, e.g., with FLAG, His6, myc, HA, etc. 

In a preferred embodiment, the cancer protein is purified or isolated after expression. 
Cancer proteins may be isolated or purified in a variety of ways depending on what other 

5 components are present in the sample and the requirements for purified product, e.g., natural 
conformation or denatured Standard purification methods include ammonium sulfate 
precipitations, electrophoretic, molecular, immunological, and chromatographic techniques, 
including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and 
chromatofocusing. For example, the cancer protein may be purified using a standard anti- 

10 cancer protein antibody column. Ultrafiltration and diafiltration techniques, in conjunction with 
protein concentration, are also useful. See, e.g., Walsh (2002) Proteins: Biochemistrv and 
Biotechnology Wiley, Hardin, et al. (eds. 2001) Cloning, Gene Expression and Protein 
Purification Oxford Univ. Press; Wilson, et al. (eds. 2000) Encyclopedia of Separation Science 
Academic Press; and Scopes (1993) Protein Purification Springer-Verlag. The degree of 

15 purification necessary will vary depending on the use of the cancer protein, hi some instances 
no purification will be necessary. 

Once expressed and purified if necessary, the cancer proteins and nucleic acids are 
useful in a number of applications. They may be used as immunoselection reagents, as vaccine 
reagents, as screening agents, therapeutic entities, for production of antibodies, as transcription 

20 or translation inhibitors, etc. 
Variants of cancer proteins 

Also included within one embodiment of cancer proteins are amino acid variants of the 
naturally occurring sequences, as determined herein. Preferably, the variants are preferably 
greater than about 75% homologous to the wild-type sequ^ce, more preferably greater than 

25 about 80%, even more preferably greater ttian about 85%, and most preferably greater than 
90%. In some embodiments the homology will be as high as about 93-95% or 98%. As for 
nucleic acids, homology in this context means sequence similarity or identity, with identity 
being preferred. This homology will be determined using standard techniques, as are outlined 
above for nucleic acid homologies. 

30 Cancer proteins of the present invention may be shorter or longer than the wild type 

amino acid sequences. Thus, in a preferred embodiment, included within the definition of 
cancer proteins are portions or firagments of the wild type sequences herein. In addition, as 
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outlined above, the cancer nucleic acids of tiie invention may be used to obtain additional 
coding regions, and thus additional protein sequence. 

In one embodiment, the cancer proteins are d^vative or variant cancer proteins as 
compared to the wild-type sequence. That is, as outlined more fully below, the derivative 
cancer peptide will often contain at least one amino acid substitution, deletion, or insertion, with 
amino acid substitutions being particularly preferred. The amino acid substitution, insertion, or 
deletion may occur at many residue positions within the cancer peptide. 

Also included within one embodiment of cancer proteins of the present invention are 
amino acid sequence variants. These variants typically fall into one or more of three classes: 
substitutional, insertional, or deletional variants. These variants ordinarily are prepared by site 
specific mutagenesis of nucleotides in the DNA encoding the cancer protein, using cassette or 
PGR mutagenesis or other techniques, to produce DNA encoding the variant, and thereafter 
expressing the DNA in recombinant cell culture as outlined above. However, variant cancer 
protein fragments having up to about 100-150 residues may be prepared by in vitro synthesis 
using established techniques. Amino acid sequence variants are characterized by the 
predetermined nature of the variation, a feature that sets th^ apart from naturally occurring 
allelic or interspecies variation of the cancer protein amino acid sequence. The variants 
typically exhibit a similar qualitative biological activity as a naturally occurring analogue, 
althou^ variants can also be selected which have modified characteristics. 

While the site or region for introducing an amino acid sequence variation is often 
predetennined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be conducted 
at the target codon or region and the expressed cancer variants screened for the optimal 
combination of desired activity. Techniques for making substitution mutations at predetCTnined 
sites in DNA having a known sequence are well known, e.g., M13 primer mutagenesis and PGR 
mutagenesis. Soreening of mutants is often done using assays of cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of from about 1-20 amino acids, although considerably larger insertions may be 
tolerated. Deletions generally range Scorn about 1-20 residues, although in some cases deletions 
may be much larger. 

Substitutions, deletions, insertions, or combination thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
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alteration of the molecule. However, lai:ger changes may be tolerated in certain circumstances. 
When small alt^tions in the characteristics of the cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution relationships described. 

The variants typically exhibit essentially the same qualitative biological activity and will 
elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of cancer proteins as needed. Alternatively, the variant 
may be designed such that a biological activity of the cancer protein is altered. For example, 
glycosylation sites may be added, altered, or removed. 

Substantial changes in function or immunological identity are sometimes made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 
backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; the 
charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
Substitutions which generally are expected to produce the greatest changes in the polypeptide's 
properties are those in which (a) a hydrophilic residue, e.g., serine or threone is substituted for 
(or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, valine, or alanine; (b) a 
cysteine or proline is substituted for (or by) another residue; (c) a residue having an 

i 

electropositive side chain, e.g., lysine, arginine, or histidine, is substituted for (or by) an 
electronegative residue, e.g., glutamic or aspartic acid; (d) a residue having a bulky side chain, 
e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine; or (e) a 
proline residue is incorporated or substituted, which changes the degree of rotational fireedom of 
the peptidyl bond. 

Variants typically exhibit a shnilar qualitative biological activity and will elicit tiie same 
immune response as the naturally-occurring analog, althougih variants also are selected to 
modify the charact^stics of the skin cancer proteins as needed. Alternatively, the variant may 
be designed such that the biological activity of the cancer protein is altered. For example, 
glycosylation sites may be altered or removed. 

Covalent modifications of cancer polypeptides are included within the scope of this 
invention. One type of covalent modification includes reacting targeted amino acid residues of 
a cancer polypeptide with an organic derivatizing agent that is capable of reacting with selected 
side chains or the N-or C-terminal residues of a cancer polypeptide. Derivatization with 
bifimctional agents is useful, for instance, for crosslinking cancer polypeptides to a water- 
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insoluble support matrix or sur&ce for use in a method for purifying anti-cancer polypeptide 
antibodies or screening assays, as is more fully described below. Commonly used crosslinking 
agents include, e.g., l,l-bis(diazoacetyl>2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, e.g., esters with4-azidosalicylic acid, homobifunctional 
5 imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidyIpropionate), 
bifimctional maleimides such as bis-N-maleimido-l,8-octane and agents such as methyl-3-((p- 
a2idophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, 

10 phosphorylation of hydroxyl groups of serinyl, threonyl, or tyrosyl residues, methylation of the 
amino groups of the lysine, arginine, and histidine side chains (e.g., pp. 79-86, Creighton (1992) 
Proteins: St ructure and Molecular Properties Freeman), acetylation of the N-teiminal amine, 
and amidation of a C-teiminal carboxyl group. 

Another type of covalent modification of the cancer polypeptide included within the 

15 scope of this invention comprises altering the native glycosylation pattern of the polypeptide. 
"Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one 
or more carbohydrate moieties found in native sequence cancer polypeptide, and/or adding one 
or more glycosylation sites that are not present in the native sequence cancer polypeptide. 
Glycosylation patterns can be altered in many ways. Different cell types to express cancer- 

20 associated sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to cancer polypeptides may also be accomplished by 
altering the amino add sequence thereof. The alteration may be made, e.g., by the addition o^ 
or substitution by, one or more serine or threonine residues to the native sequence cancer 
polypeptide (for O-linked glycosylation sites). The cancer amino acid sequence may optionally 

25 be altered through changes at the DNA level, particularly by mutating the DNA encoding the 
cancer polypeptide at preselected bases such that codons are generated that will translate into 
the desired anuno acids. 

Another means of increasing the number of carbohydrate moieties on the cancer 
polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. See, e.g., 

30 WO 87/05330; pp. 259-306 in Aplin and Wriston (1981) CRC Crit. Rev. Biochem. 

Removal of carbohydrate moieties present on the cancer polypeptide inay be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding for 
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amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are applicable. See, e,g., Sojar and Bahl (1987) Arch. Biochem, Biophvs, 259:52-57 
and Edge, et al. (1981) Anal. Biochem. 118:131-137. Enzymatic cleavage of carbohydrate 
moieties on polypeptides can be achieved by the use of a variety of endo-and exo-glycosidases. 

5 See, e.g., Thotakura, et al. (1987) Meth. Enzvmol. 138:350-359. 

Another type of covalent modification of cancer comprises linking the cancer 
polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

10 Cancer polypq)tides of the present invention may also be modified in a way to form 

chimeric molecules comprising a cancer polypeptide fused to another heterologous polypeptide 
or amino acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of a 
cancer polypeptide with a tag polypeptide which provides an epitope to which an anti-tag 
antibody can selectively bind. The epitope tag is generally placed at the amino-or carboxyl- 

1 5 terminus of the cancer polypeptide. The presence of such epitope-tagged forms of a cancer 
polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of 
the epitope tag enables the cancer polypeptide to be readily purified by affinity purification 
using an anti-tag antibody or another type of affinity matrix that binds to the epitope tag. In an 
alt^ative embodiment, the chimeric molecule may comprise a fusion of a cancer polypeptide 

20 with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fiision could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are available. Examples'' 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) Mol. 

25 Cell. Biol. 8:2159-2165); flie c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 9E10 antibodies 
thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616): and the Herpes 
Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 
Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 
(1988) BioTechnology 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

30 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. BioL Chem. 266:15163-15166); 
and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) Proc. Natl. Acad. Sci. 
USA 87:6393-6397). 
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Also included are other cancer proteins of the cancer family, and cancer proteins fiom 
other organisms, which are cloned and expressed as outlined below. Thus, probe or degenerate 
polymerase chain reaction (PGR) primer sequences may be used to find other related cancer 
proteins fix)m humans or other organisms. Particularly usefiil probe and/or PGR primer 

5 sequences include the unique areas of the cancer nucleic acid sequence. Preferred PGR primers 
are &om about 15-35 nucleotides in length, with fi-om about 20-30 being preferred, and may 
contain inosine as needed. The conditions for PGR reaction have been well described (e.g., 
Innis, PGR Protocols, supra). 

In addition, cancer proteins can be made that are longer than those encoded by the 

10 nucleic acids of the Tables, e.g., by the elucidation of extended sequences, the addition of 
epitope or purification tags, the addition of other fiision sequences, etc. 

Cancer proteins may also be identified as being encoded by cancer nucleic acids. Thus, 
cancer proteins are encoded by nucleic acids that will hybridize to the sequences of the 
sequence listings, or their complements, as outlined herein. 

1 5 Antibodies to cancer proteins 

In a preferred embodiment, when the cancer protein is to be used to generate antibodies, 
e.g., for immunotherapy or immunodiagnosis, the cancer protein should share at least one 
epitope or determinant with the full length protein. By "epitope" or "determinant" herein is 
typically meant a portion of a protein which will generate and/or bind an antibody or T-cell 

20 receptor in the context of MHC. Thus, in most instances, antibodies made to a smaller cancer 
protein will be able to bind to the full-length protein, particularly linear epitopes. In a preferred 
embodiment, the q)itope is unique; that is, antibodies generated to a unique epitope show little 
or no cross-reactivity. In a preferred embodiment, the epitope is selected fi:om a protein 
sequence set out in the tables. 

25 Methods of preparing polyclonal antibodies exist (e.g., Coligan, supra; and Harlow and 

Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or more injections 
of an immunizing agent and, if desired, an adjuvant Typically, the immunizing agent and/or 
adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The immunizing agent may include a protein encoded by a nucleic acid of Tables 2A-80 or 

30 fi:agment fliereof or a fusion protein thereof It may be useful to conjugate the immunizing 
agent to a protein known to be inununogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
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albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which 
may be employed include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). Various immunization 
protocols may be used. 

5 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 

may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
ammal, is typically immunized with an inmiunizing agent to elicit lymphocytes that produce or 
are capable of producing antibodies that will specifically bind to the inmiunizing agent. 

10 Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 

typically include a polypeptide encoded by a nucleic acid of the tables or firagment thereof, or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are used if 
cells of human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line 

15 using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (e.g., pp, 
59-103 in Goding (1986) Monoclonal Antibodies: Principles and Practice Academic Press). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of 
rodent, bovine, or human origin. Usually, rat or mouse myeloma cell lines are employed. The 
hybridoma cells may be cultured in a suitable culture medium that preferably contains one or 

20 more substances that inhibit the growth or survival of the unfused, immortalized cells. For 

example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase 
(HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
growth of HGPRT-deficient cells. 

25 In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 

monoclonal, preferably human or humanized, antibodies that have binding specificities for at 
least two different antigens or that have bmding specificities for two epitopes on the same 
antigen. In one embodiment, one of the binding specificities is for a protein encoded by a 
nucleic acid of the tables or a firagment thereof, the other one is for another antigen, and 

30 preferably for a cell-surface protein or receptor or receptor subunit, preferably one that is tumor 
specific. Altematively, tetramer-type technology may create multivalent reagents. 
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In a preferred embodiment, the antibodies to cancer protein are capable of reducing or 
eliminating a biological function of a cancer protein, in a naked form or conjugated to an 
effector moiety, as is described below. That is, the addition of anti-cancer protein antibodies 
(either polyclonal or preforably monoclonal) to cancer tissue (or cells containing cancer) may 

5 reduce or eliminate the cancCT, Generally, at least a 25% decrease in activity, growth, size, or 
the like is preferred, with at least about 50% being particularly preferred and rf)out a 95-100% 
decrease being especially preferred. 

In a preferred embodiment the antibodies to the cancer proteins are humanized 
antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, Inc.) 

10 Humanized forms of non-hxmian (e.g., murine) antibodies are chimeric molecules of 

immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 
or other antigen-binding subsequences of antibodies) which contain niiiumal sequence derived 
from non-human immunoglobulin. Humanized antibodies include human immunoglobulins 
(recipient antibody) in which residues from a complementary determining region (CDR) of the 

15 recipient are replaced by residues from a CDR of a non-human species (donor antibody) such as 
mouse, rat, or rabbit having the desired specificity, ajSinity, and capacity. In some instances, Fv 
framework residues of a human inununoglobulin are replaced by corresponding non-hiunan 
residues. Humanized antibodies may also comprise residues which are found neither in the 
recipient antibody nor in the imported CDR or framework sequences. In general, a humanized 

20 antibody will comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will typically 
comprise at least a portion of an immunoglobulin constant region (Fc)y typically that of a human 

25 immunoglobulm (Jones, et al. (1986) Nature 321 :522-525; Riechmann, et al. (1988) Nature 
332:323-329; and Presta (1992) Cuir. On. Struct. Biol. 2:593-596). Humanization can be 
essentially performed following the method of Winter and co-workers (Jones, et al. (1986) 
Nature 321:522-525; Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) 
Science 239:1534-1536), by substituting rodent CDRs or CDR sequences for corresponding 

30 sequences of a himian antibody. Accordingly, such humanized antibodies are chimeric 

antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact human variable 
domain has been substituted by corresponding sequence from a non-human species. 
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Human antibodies can also be produced using phage display libraries (Hoogenboom and 
Winter (1992) J. MoL BioL 227:381-388; Marks, et al. (1991) J. MoL Biol. 222:581-597) or 
human monoclonal antibodies (e.g., p. 77, Cole, et al. in Reisfeld and Sell (1985) Monoclonal 
Antibodies and Cancer Therapy Liss; and Boemer, et al. (1991) T TmniiiTinl 147:86-95). 

5 Similarly, human antibodies can be made by introducing human immunoglobulin loci into 
transgenic animals, e.g., mice in which the endogenous iromunoglobulin genes have been 
partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in nearly all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, e.g., in U.S. 

10 Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the 

following scientific publications: Marks, et al, (1992) Bio/Technology 10:779-783; Lonberg, et 
al. (1994) Nature 368:856-859; Morrison (1994) Nature 368:812-13; Fishwild, et al. (1996) 
Nature Biotechnology 14:845-851; Neuberger (1996) Nature Biotechnology 14:826; and 
Lonberg andHuszar (1995) Intem. Rev. Immunol. 13:65-93. 

15 By immunotherapy is meant treatment of cancer with an antibody raised against cancer 

proteins. As used herein, immunotherapy can be passive or active. Passive immunothen^y as 
defined herein is the passive transfer of antibody to a recipient (patioit). Active immunization 
is the induction of antibody and/or T-cell responses in a recipient (patiait). hiduction of an 
immune response is the result of providing the recipient with an antigen to which antibodies are 

20 raised. The antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an immune 
response. 

In a preferred ^bodiment the cancer proteins against which antibodies are raised are 
25 secreted proteins as described above. Without being bound by theory, antibodies used for 
treatment may bind and prevent flie secreted protein bom bmding to its receptor, thereby 
inactivating the secreted cancer protein, e.g., in autocrine signaling. 

In another preferred embodiment, the cancer protein to which antibodies are raised is a 
transmembrane protein. Without being bound by theory, antibodies used for treatment may 
30 bind the extracellular domain of the cancer protein and prevent it fix)m binding to other proteins, 
such as circulating ligands or cell-associated molecules. The antibody may cause down- 
regulation of the transmembrane cancer protein. The antibody may be a competitive, non- 
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competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
cancer protein. The antibody may also be an antagonist of the cancer protein. Further, the 
antibody may prevent activation of the transmembrane cancer protein, or may induce or 
suppress a particular cellular pathway. In one aspect, when the antibody prevents the binding of 
5 other molecules to the cancer protein, the antibody prevents growth of the cell. The antibody 
may also be used to target or sensitize the cell to cytotoxic agents, including, but not limited to 
TNF-a, TNF-p, EL-l, INF-y, and IL-2, or chemotherapeutic agents including 5FU, vinblastine, 
actinomycin D, cisplatin, methotrexate, and the like. In some instances the antibody may 
belong to a sub-type that activates serum complement when complexed with the transmembrane 
10 protein thereby mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, 
cancer may be treated by administering to a patient antibodies directed against the 
transmembrane cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin 
payload, target a drug loaded liposome, or otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. The 
1 S effector moiety can be various molecules, including labeling moieties such as radioactive labels 
or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic moiety is a 
small molecule &at modulates the activity of a cancer protein. In another aspect the therapeutic 
moiety may modulate the activity of inolecules associated with or in close proximity to a cancer 
protein. The therapeutic moiety may inhibit enzymatic or signaling activity such as protease or 
20 coUagenase or protein kinase activity associated with cancer, or be an attractant of other cells, 
such as NK cells. See, e.g., USSN 09/544,494. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In this 
method, targeting the cytotoxic agent to cancer tissue or cells results in a reduction in the 
number of afQicted cells, thereby reducing symptoms associated with cancer. Cytotoxic agents 
25 are numerous and varied and include, but are not limited to, cytotoxic drugs or toxins or active 
fragments of such toxins. Suitable toxins and their corresponding fragments include diphttieria 
A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, crotin, phenomycin, enomycin, 
saporin, auristatin, and the like. Cytotoxic agents also include radiochemicals made by 
conjugating radioisotopes to antibodies raised against cancer proteins, or binding of a 
30 radionuclide to a chelating agent that has been covalently attached to the antibody. Targeting 
the therapeutic moiety to transmembrane cancer proteins not only serves to increase the local 
concentration of therapeutic moiety in the cancer afflicted area, but also serves to reduce 
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deleterious side effects that may be associated with the untargeted tfaenq)eutic moiety. 
Antibody fragments may be used to target toxin loaded liposomes. 

In another preferred embodiment, die cancer protein against which the antibodies are 
raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 
5 which facilitates entry into the cell. In one case, the antibody enters the cell by endocytosis. In 
another embodiment, a nucleic acid encoding the antibody is administered to the individual or 
cell. Moreover, wherein the cancer protein can be targeted within a cell, e.g., the nucleus, an 
antibody thereto may contain a signal for that target localization, e.g., a nuclear localization 
signal. 

10 The cancer antibodies of the invention specifically bind to cancer proteins. By 

"specifically bind" herein is meant that the antibodies bind to the protein with a of at least 
about 0.1 mM, more usually at least about 1 /iM, preferably at least about 0.1 or better, and 
most preferably, 0.01 fjM or better. Selectivity of binding to the specific target and not to 
related sequences is often also important. 

1 5 Detection of cancer sequence for diagnostic and therapeutic applications 

In one aspect, the SNA expression levels of genes are determined for different cellular 
states in the cancer phenotype. Expression levels of genes in normal tissue (e.g., not 
undergoing cancer) and in cancer tissue (and in some cases, for varying severities of cancer that 
relate to prognosis, as outlined below), or in non-malignant disease are evaluated to provide 

20 expression profiles. A gene expression profile of a particular cell state or point of development 
is essentially a "fingerprint" of the state of the cell. While two states may have a particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the generation 
of a gene mpression profile that is reflective of the state of the cell. By comparing expression 
profiles of cells in different states, information regarding which genes are important (including 

25 both up- and down-regulation of genes) in each of these states is obtanied. Then, diagnosis may 
be performed or confirmed to determine whether a tissue sample has the gene expression profile 
of normal or cancerous tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to qualitative 
or quantitative differences in the temporal and/or cellular gene expression pattems within and 

30 among cells and tissue. Thus, a differentially expressed gene can qualitatively have its 

expression altered, including an activation or inactivation, in, e.g., normal versus cancer tissue. 
Genes may be turned on or turned off in a particular state, relative to another state thus 
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permitting comparison of two or more states. A qualitatively regulated gene will exhibit an 
expression pattern within a state or cell type which is detectable by standard techniques. Some 
genes will be expressed in one state or cell type, but not in both. Alternatively, the difference in 
expression may be quantitative, e.g., in that expression is increased or decreased; e.g., gene 

5 expression is either upregulated, resulting in an increased amount of transcript, or 

downregulated, resulting in a decreased amount of transcript. The degree to which expression 
differs need only be large enough to quantify via standard characterization techniques as 
outlined below, such as by use of Afiymetrix GeneChip™ expression arrays. See, Lockhart 
(1996) Nature Biotechnology 14:1675-1680. Other techniques include, but are not limited to, 

10 quantitative reverse transcriptase PGR, northern analysis, and RNase protection. As outlined 
above, preferably the change in expression (e.g., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
• preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 
Evaluation may be at the gene transcript or the protein level. The amount of gene 

IS expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final gene 
product itself (protein) can be monitored, e.g., with antibodies to the cancer protein and standard 
immunoassays (ELIS As, etc.) or other techniques, including mass spectroscopy assays, 2D gel 
electrophoresis assays, etc. Proteins corresponding to cancer genes, e.g., those identified as 

20 being important in a cancer or disease phenotype, can be evaluated in a cancer diagnostic test. 
In a preferred embodiment, gene expression monitoring is performed simultaneously on a 
number of genes. Multiple protem expression monitoring can be performed as well. 

In this embodiment, the cancer nucleic acid probes are attached to biochips as outlined 
herein for the detection and quantification of cancer sequences in a particular cell. The assays 

25 are further described below in the example. PGR techniques can be used to provide greater 
sensitivity. 

In a pref^ied embodiment nucleic acids encoding the cancer protein are detected. 
Although DNA or KNA encoding the cancer protein may be detected, of particular interest are 
methods wherein an mRNA encoding a cancer protein is detected. Probes to detect mRNA can 
30 be a nucleotide/deoxynucleotide probe that is complementary to and hybridizes with the mRNA 
and includes, but is not limited to, oligonucleotides, cDNA, or RNA. Probes also should 
contain a detectable label, as defined herein. In one method the mRNA is detected after 
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immobilizing the nucleic acid to be examined on a solid support such as nylon membranes and 
hybridizing the probe with tiie sample. Following washing to remove the non-specifically 
bound probe, the label is detected. In another method, detection of the mRNA is performed in 
situ. In this method permeabilized cells or tissue samples are contacted with a detectably 

5 labeled nucleic acid probe for sufficient time to allow the probe to hybridize with the target 
mRNA. Following washing to remove the non-specifically boxmd probe, the label is detected. 
For example a digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA 
encoding a cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3-indoyl 

10 phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane, or intracellular proteins) are used in diagnostic 
assays. The cancer proteins, antibodies, nucleic acids, modified proteins, and cells containing 
cancer sequences are used in diagnostic assays. This can be performed on an individual gene or 

1 5 corresponding polypeptide level. In a preferred embodiment, the expression profiles are used, 
preferably in conjunction with high throughput screening techniques to allow monitoring for 
expression profile genes and/or corresponding polypeptides. 

As described and defined herein, cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markors of cancer, e.g., for prognostic or 

20 diagnostic purposes. Detection of these proteins in putative cancer tissue allows for detection, 
prognosis, or diagnosis of cancer or similar disease, and for selection of therq)eutic strategy. In 
one CTibodiment, antibodies are used to detect cancer proteins. A preferred method separates 
proteias from a sample by electrophoresis on a gel (typically a denaturing and reducing protein 
gel, but may be another type of gel, including isoelectric focusing gels and the like). Following 

25 separation of proteins, the canco: protein is detected, e.g., by immunoblotting with antibodies 
raised against the cancer protein. 

In another preferred method, antibodies to the cancer protein find use in in situ imaging 
techniques, e.g., in histology. See, e.g., Asai, et al. (eds. 1993) Methods in Cell Biology: 
Antibodies in Cell Bioloev (vol. 37) Academic Press. In this method, cells are contacted with 

30 firom one to many antibodies to the cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
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detectable label. In another method the primary antibody to the canc^ protein(s) contains a 
detectable label, e.g., an enzyme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and detectable label. 
This method finds particular use in simultaneous screening for a plurality of cancer proteins. 
5 Many other histological imaging techniques are also provided by the invention. 

In a preferred embodiment ttie label is detected in a fluorometer which has the ability to 
detect and distinguish emissions of different wavelengths. In addition, a fluorescence activated 
cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing cancer from blood, 

10 serum, plasma, stool, and other samples. Such samples, therefore, are useful as samples to be 
probed or tested for the presence of cancer proteins. Antibodies can be used to detect a cancer 
protein by previously described immunoassay techniques including ELIS A, immunoblotting 
(western blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the 
presence of antibodies may indicate an immune response against an endogenous cancer protein. 

15 In a preferred embodiment, in situ hybridization of labeled cancer nucleic acid probes to 

tissue arrays is done. For example, arrays of tissue samples, including cancer tissue and/or 
normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then performed. 
When comparing the fingerprints between an individual and a standard, a diagnosis, a 
prognosis, or a prediction may be based on the findings. It is further understood that the genes 

20 which indicate the diagnosis may differ &om those which indicate the prognosis and molecular 
profiling of the condition of the cells may lead to distinctions between responsive or reflectory 
conditions or may be predictive of outcomes. 

In a preferred embodiment, the cancer proteins, antibodies, nucleic acids, modified 
proteins, and cells containing cancer sequences are used in prognosis assays. As above, gene 

25 expression profiles can be generated ttiat conelate to cancer, clinical, pathological, or other 
information, in terms of long tenn prognosis. Again, fliis may be done on either a protein or 
gene level, with flie use of genes being preferred. Single or multiple genes may be useful in 
various combinations. As above, cancer probes may be attached to biochips for the detection 
and quantification of cancer sequences in a tissue or patient. The assays proceed as outlined 

30 above for diagnosis. PGR mefliod may provide more sensitive and accurate quantification. 
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Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The cancer proteins, antibodies, nucleic acids, 
modified proteins, and cells containing cancer sequences are used in drug screening assays or 
by evaluating the effect of drug candidates on a "gene expression profile" or expression profile 
of polypeptides. In a preferred embodiment, the expression profiles are used, preferably in 
conjunction with high througtq)Ut screening techniques, to allow monitoring for expression 
profile genes after treatment with a candidate agent (e.g., Zlokamik, et al. (1998) Science 
279:84-88; Held (1996) Genome Res. 6:986-994. 

In a preferred embodiment, the cancer proteins, antibodies, nucleic acids, modified 
proteins and cells containing the native or modified cancer proteins are used in screening 
assays. That is, the present invention provides novel methods for screening for compositions 
which modulate the cancer phenotype or an identified physiological fimction of a cancer 
protein. As above, this can be done on an individual gene level or by evaluating the effect of 
drug candidates on a "gene expression profile". In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques, to allow 
monitoring for expression profile genes aft^ treatment with a candidate agent, see Zlokamik, 
supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 
performed. In a preferred embodiment, assays may be run on an individual gene or protein 
level. That is, having identified a particular gene as up regulated in cancer, test compounds can 
be screened for the ability to modulate gene expression or for binding to the cancer protein. 
"Modulation" thus includes both an increase and a decrease in gene expression. The preferred 
amount of modulation will depend on the original change of the gene expression in normal 
versus tissue undergoing cancer, with changes of at least 10%, preferably 50%, more preferably 
100-300%, and in some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold 
increase in cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, a 10-fold decrease in cancer tissue compared to normal tissue often provides 
a target value of a 10-fold increase in expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, altematively, the gene product itself can be 
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monitored, e.g., through the use of antibodies to the cancer protein and standard immunoassays. 
Proteomics and separation techniques may also allow quantification of expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 
5 involve a plurality of those entities described herein. 

In this embodiment, the cancer nucleic acid probes are attached to biochips as outlined 
herein for the detection and quantification of cancer sequences in a particular cell. 
Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, may be used with 
dispensed primers in desired wells. A PGR reaction can then be performed and analyzed for 
10 each well. 



Modulators of cancer 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more cancer-associated sequences, e.g., a polynucleotide sequence set out 

15 in the tables. Generally, in a preferred embodiment, a test modulator is added to the cells prior 
to analysis. Moreover, screens are also provided to identify agents that modulate cancer, 
modulate cancer proteins, bind to a cancer protdn, or mterfere with the binding of a cancer 
protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 

20 equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the cancer phenotype or the expression of a cancer sequence, e.g., a nucleic acid 
or protein sequence. Tn preferred embodimmts, modulators alter expression profiles, or 
expression profile nucleic acids or proteins provided herein. In one embodiment, the modulator 

25 si^presses a cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In another 
embodiment, a modulator induced a cancer phenotype. Generally, a plurality of assay mixtures 
are run in parallel with different agent concentrations to obtain a differential response to the 
various concentrations. Typically, one of fliese concentrations serves as a negative control, e.g., 
at zero concentration or below the level of detection. 

30 Drug candidates encompass numerous chemical classes, though typically they are 

organic molecules, preferably small organic compounds having a molecular weight of more 
than 100 and less than about 2,500 daltons. Preferred small molecules are less than 2000, or 
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less than 1500, or less than 1000, or less than SOO D. Candidate agents comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, and 
typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two 
of the functional chemical groups. The candidate agents often comprise cyclical carbon or 

5 heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or more 
of the above functional groups. Candidate agents are also found among biomolecules including 
peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs, 
or combinations thereof. Particularly preferred are peptides. 

In one aspect, a modulator will neutralize the effect of a cancer proteiiL By "neutralize" 

10 is meant that activity of a protein is inhibited or blocked and the consequent effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be screened 
for an ability to bind to a cancer polypeptide or to modulate activity. Conventionally, new 
chemical entities with useful properties are generated by identifying a chemical compound 
(called a "lead compound") with some desirable property or activity, e.g., inhibiting activity, 

IS creating variants of the lead compound, and evaluating the property and activity of those variant 
compounds. Often, higih throughput screening (HTS) methods are employed for such an 
analysis. See, e.g., Janzen (2002) High Throughput Screening: Methods and Protocols 
Humana; Devlin (ed. 1997) High Throughout Screening: The Disco verv of Bioactive 
Substances Dekker, and Mei and Czamik (eds. 2002) Integrated Drug Discoverv Techniques 

20 Dekker. 

In one preferred embodiment, high tibroughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate compounds). 
Such "combinatorial chemical libraries** are then screened in one or more assays to identify 
those library members (particular chraiical species or subclasses) that display a desired 
25 characteristic activity. The compounds thus identified can serve as conventional "lead 
compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 
30 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compoiind length (e.g., the 
number of amino acids in a polypeptide compound). Millions of chemical compounds can be 
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syntiiesized thioug|i such combinatorial mixing of chemical building blocks (Gallop, et al. 
(1994 ) J. Med. Chem. 37:1233-1251). 

Prq>aration and screening of combinatorial chemical libraries is well known. Such 
combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. 
5 Patent No. 5,010,175, Furica (1991) Pent Prot. Res. 37:487-493, Houghton, et al. (1991) Nature 
354:84-88), peptoids (PCX Publication No WO 91/19735), encoded peptides (PCT Publication 
WO 93/20242), random bio-oligomers (PCT Publication WO 92/00091), benzodiazepines (U.S. 
Pat. No. 5,288,514), diversomws such as hydantoins, benzodiazepines and dipeptides (Hobbs, et 
al- (1993) Proc. Natl. Acad. Sci. USA 90:6909-6913, vinylogous polypeptides (Hagihara, et al. 

10 (1992) J. Amer. Chem. Soc. 1 14:6568-570), nonpeptidal peptidomimetics with a Beta-D- 
Glucose scaffolding (Hirschmann, et al. (1992) J. Amer. Chem. Soc. 1 14:9217-9218), 
analogous organic syntheses of small compound libraries (Chen, et al. (1994) J. Amer. Chem. 
Sgc, 116:2661-662), oligocarbamates (Cho, et al. (1993) Science 261:1303-1305), and/or 
peptidyl phosphonates (Campbell, et al. (1994) J. Org. Chem. 59:658). See, generaUy, Gordon, 

15 et al. (1994) J. Med. Chem. 37:1385-1401, nucleic acid libraries (see, e.g., Stratagene, Corp.), 
peptide nucleic acid Ubraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., 
Vaughn, et al. (1996) Nature Biotechnolopv 14(3):309-314, and PCT/US96/10287), 
carbohydrate libraries (see, e.g., Liang, et al. (1996) Science 274:1520-1522, and U.S. Patent 
No. 5,593,853), and small organic molecule libraries (see, e.g., benzodiazepines, page 33 Baum 

20 (Jan 18, 1993) C&EN: isoprenoids, U.S. Patent No. 5,569,588; thiazoUdinones and 

metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; moipholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are conunerdally available (see, 

25 e.g., 357 MPS, 390 MPS, Advanced Chem Tech, LouisvUle KY, Symphony, Rainin, Wobum, 
MA, 433 A Applied Biosystems, Foster Qty, CA, 9050 Plus, Millipore, Bedford, MA). 

A number of well known robotic systems have also beai developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
^paratus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 

30 systems utilizing robotic arms (Zymate n, Zymark Corporation, Hopkinton, Mass.; Orca, 

Hewlett-Packard, Palo Alto, Calif.), which mimic manual synthetic operations performed by a 
chemist The above devices are suitable for use with the present inventioru The nature and 
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implementation of modifications to these devices (if any) so that they can opmte as discussed 
herein will be apparent. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex. Moscow, Ru, Tripos, 
Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
5 Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are tenable to high throughput screening. Preferred 
assays thus detect enhancement or inhibition of cancer gene transcription, inhibition, or 
enhancement of polypeptide expression, and inhibition or enhancement of polypeptide activity. 
High throughput assays for the presence, absence, quantification, or other properties of 

10 particular nucleic acids or protein products are well known. Similarly, binding assays and 

reporter gene assays are similarly well known. Thus, e.g., U.S. Patent No. 5,559,410 discloses 
high throughput screening methods for proteins, U.S. Patent No. 5,585,639 discloses high 
throughput screening methods for nucleic acid binding (e.g., in arrays), while U.S. Patent Nos. 
5,576,220 and 5,541,061 disclose high throughput methods of screening for ligand/antibody 

15 binding. 

In addition, high througjhput screening systems are commercially available (see, e.g,, 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman Instruments, 
Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems typically 
automate entire procedures, including sample and reagent pipettmg, liquid dispensing, timed 

20 incubations, and final readings of the microplate in detector(s) appropriate for the assay. These 
configurable systems provide higji throughput and rapid start up as well as a hi^ degree of 
flexibility and customization. The manufacturers of such systems provide detailed protocols for 
various high throu^put sj^ms. Thus, e.g., Zymark Corp. provides technical bulletins 
describing screening systems for detecting the modulation of grae transcription, ligand binding, 

25 and the like. 

In one embodimrat, modulators are proteins, often naturally occurring proteins or 
fiagments of naturally occuning protems. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way libraries 
of proteins may be made for screening in the methods of the invention. Particularly preferred in 
30 this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, with the latter 
being preferred, and hxunan proteins being especially preferred. Particularly usefiil test 
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compound will be directed to the class of proteins to which the target belongs, e.g., substrates 
for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about S-30 amino acids, 
with fix)m about 5-20 amino acids being preferred, and from about 7-15 being particularly 
5 preferred. The peptides may be digests of naturally occurring proteins, random peptides, or 
"biased" random peptides. By "randomized" or grammatical equivalents herein is meant that 
each nucleic acid and peptide consists of essentially random nucleotides and mono acids, 
respectively. Since generally these random peptides (or nucleic acids, discussed below) are 
chemically synthesized, they may incorporate a nucleotide or amino acid at any position. The 

10 syntiietic process can be designed to generate randomized proteins or nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the sequence, thus 
forming a library of randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 

1 5 positions within the sequence are either held constant, or are selected from a limited number of 
possibilities. For example, in a preferred embodiment, the nucleotides or amino acid residues 
are randomized within a defined class, e.g., of hydrophobic amino acids, hydiophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 

20 threonines, tyrosines, or histidines for phosphorylation sites, etc., or to purines, etc. 
Modulators of cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For 
example, digests of prokaryotic or eukaryotic genomes maybe used as is outlined above for 

25 proteins. 

In a preferred onbodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the biochip. 
30 If required, the target sequence is prepared using known techniques. For example, the sample 
may be treated to lyse the cells, using known lysis buffers, electroporation, etc., with 
purification and/or amplification such as PGR performed as appropriate. For example, an in 
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vitro transcription with labels covalentiy attached to the nucleotides is performed. Generally, 
the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the target 

5 sequence's specific binding to a probe. The label also can be an enzyme, such as, alkaline 
phosphatase or horseradish peroxidase, which when provided with an appropriate substrate 
produces a product that can be detected. Alternatively, the label can be a labeled compound or 
small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or altered by the 
enzyme. The label also can be a moiety or compound, such as, an epitope tag or biotin which 

10 specifically binds to streptavidin. For the example of biotin, the streptavidin is labeled as 

described above, thereby, providing a detectable signal for the bound target sequence. Unbound 
labeled streptavidin is typically removed prior to analysis. 

These assays can be direct hybridization assays or can comprise "sandwich assays", 
which include the use of multiple probes, as is generally outiined in U.S. Patent Nos. 5,681,702, 

15 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 

5,624,802, 5,635,352, 5,594,1 18, 5,359,100, 5,124,246, and 5,681,697, all of which are hereby 
incorporated by reference. Li this embodiment, in general, the target nucleic acid is prepared as 
outiined above, and then added to the biochip comprising a plurality of nucleic acid probes, 
under conditions that allow the formation of a hybridization complex. 

20 A variety of hybridization conditions may be used in the present invention, including 

higih, moderate, and low stringency conditions as outlined above. The assays are generally run 
under stringency conditions which allows formation of the label probe hybridization coinplex 
only in the presence of target Stringency can be controlled by altering a step parameter that is a 
thermodynamic variable, including, but not lunited to, temperature, formamide concentration, 

25 salt concentration, chaotropic salt concentration, pH, organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlmed in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 

30 of the reaction may be added simultaneously, or sequentially, in different orders, with preferred 
embodiments outlined below. In addition, the reaction may include a variety' of other reagents. 
These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. which may be used 
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to facilitate optimal hybridization and detection, and/or reduce non-specific or background 
interactions. Reagents that otherwise improve the efficiency of the assay, such as protease 
inhibitors, nuclease inhibitors, anti-xnicrobial agents, etc., may also be used as appropriate, 
depending on the sample preparation methods and purity of the target. 

5 The assay data are analyzed to det^mine the expression levels, and changes in 

expression levels as between states of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the cancer phenotype. hi one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 

10 embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 
important in a particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, screens 

15 are performed to identify agents that bind and/or modulate the biological activity of the gene 
product. 

In addition, screens can be done for genes that are induced in response to a candidate 
agent or treatment process. After identifying a modulator based upon its ability to suppress a 
cancer expression pattern leading to a normal expression pattem (or its converse), or to 

20 modulate a single cancer gene expression profile so as to mimic the expression of the gene fijom 
normal tissue, a screw as described above can be p^ormed to identify genes that are 
specifically modulated in response to the agent. Comparing egression profiles betweenliormal 
tissue and agent treated cancer tissue reveals genes that are not expressed in normal tissue or 
cancer tissue, but are expressed in agent treated tissue. These agent-specific sequmces can be 

25 identified and used by methods described herein for cancer genes or proteins. In particular, 
these sequences and the proteins they encode find use in marking or identifying agent treated 
cells. In addition, antibodies can be raised against the agent induced proteins and used to target 
novel therapeutics, e.g., toxin loaded liposomes, to the treated cancer tissue sample. 

Thus, in one embodiment, a test compoimd is administered to a population of cancer 

30 cells that have an associated cancer expression profile. By "administration" or "contacting" 
herein is meant that the candidate agent is added to the cells in such a manner as to allow the 
agent to act upon the cell, whether by uptake and intracellular action, or by action at the cell 
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surface. In some embodiments, nucleic acid encoding a proteinaceous candidate agent (e.g., a 
peptide) may be put into a viral construct such as an adenoviral or retroviral construct, and 
added to the cell, such that expression of the peptide agent is accomplished, e.g., PCX 
US97/01019. Regulatable gene therapy systems can also be used, 
5 Once a test compound has been administered to the cells, the cells can be washed if 

desired and are allowed to incubate under preferably physiological conditions for some period 
of time. The cells are then harvested and a new gene expression profile is generated, as outiined 
herein. 

Thus, e.g., cancer or non-malignant tissue may be screened for agents that modulate, 
10 e.g., induce or suppress a cancer phenotype. A change m at least one gene, preferably many, of 
the expression profile indicates that the agent has an effect on cancer activity. By defining such 
a signature for the cancer phenotype, screens for new drugs that alter the phenotype can be 
devised. With this approach, the drug target need not be known and need not be represented in 
the original expression screening platform, nor does the level of transcript for the target protein 
15 need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual genes 
and gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of either the expression of the 
gene or the gene product itself can be done. The gene products of differentially expressed genes 

20 are sometimes referred to herein as "cancer proteins" or a "cancer modulatory protein". The 
cancer modulatory protein may be a fiiagment, or alternatively, be the fiill length protem to the 
fiagment encoded by the nucleic acids of the Tables. Preferably, the cancer modulatory protein 
is a fi^gment. In a preferred embodiment, the cancer amino acid sequence which is used to 
determine sequence identity or similarity is encoded by a nucleic acid of the Tables. In anotiier 

25 embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of the Tables. In anoth^ embodiment, the sequences are sequence variants as 
fiirther described herein. 

Preferably, the cancer modulatory protein is a firagment of about 14-24 amino acids 
long. More preferably tiie fragment is a soluble firagment. Preferably, the firagment includes a 

30 non-transmembrane region, hi a preferred embodiment, the firagment has an N-terminal Cys to 
aid in solubility. In one embodiment, the C-tOTninus of the fragment is kept as a free acid and 
the N-teiminus is a free amine to aid in coupling, e.g., to cysteine. 
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In one embodiment the cancer proteins are conjugated to an immunogenic agent as 
discussed herein. In one embodim^t the cancer protein is conjugated to BSA. 

Measurements of cancer polypeptide activity, or of cancer or the cancer phenotype can 
be performed using a variety of assays. For example, the effects of the test compoimds upon the 

5 function of the cancer polypeptides can be measured by examining parameters described above. 
A suitable physiological change that affects activity can be used to assess the influence of a test 
compound on the polypeptides of this invention. When the functional consequences are 
detennined using intact cells or animals, one can also measure a variety of effects such as, in the 
case of cancer associated with tumors, tumor growth, tumor metastasis, neovascularization, 

1 0 hormone release, transcriptional changes to both known and uncharacterized genetic markers 
(e.g., northern blots), changes in cell metabolism such as cell growth or pH changes, and 
changes in intracellular second messengers such as cGMP. In the assays of the invention, 
mammalian cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. For 

15 example, a cancer polypeptide is first contacted with a potential modulator and incubated for a 
suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the cancer polypeptide 
. levels are determined in vitro by measuring the level of protein or mRNA. The level of protein 
is typically measured using unmunoassays such as western blotting, ELIS A, and the like with 
an antibody that selectively binds to the cancer polypeptide or a fragment thereof For 

20 measurement of mRNA, amplification, e.g., usmg PGR, LCR, or hybridization assays, e.g., 
northern hybridization, RNAse protection, dot blottmg, are preferred. The level of protein or 
mRNA is typically detected using directly or indirectly labeled detection agents, e.g., 
fluorescently or radioactively labeled nucleic acids, radioactively or enzymatically labeled 
antibodies, and the like, as described herein. 

25 Alternatively, a reporter geae system can be devised using a cancer protein promoter 

operably linked to a reporter gene such as luciferase, green fluorescent protein, CAT, or jS-gal. 
The reporter construct is typically transfected into a cell. After treatment with a potential 
modulator, the amount of reporter gene transcription, translation, or activity is measured 
according to standard techniques. 

30 In a preferred embodunent, as outlined above, screens may be done on individual genes 

and gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of the expression of the gene or 
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the gene product itself can be done. The gene products of differentially expressed genes are 
sometimes referred to herein as "cancer proteins." The cancer protein may be a fragment, or 
altematively, the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
p^ormed. Typically, the expression ofonly one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially expressed 
proteins. These compounds are then evaluated for the ability to modulate differentially 
expressed activity. Moreover, once initial candidate compounds are identified, variants can be 
further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
gene product is used; that is, the gene products of one or more differentially expressed nucleic 
acids are made. For example, antibodies are generated to the protein gene products, and 
standard immunoassays are run to determine the amount of protein present. Alternatively, cells 
comprising the cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a cancer protein and 
a candidate compoimd, and detemiining the binding of the compound to the cancer protein. 
Preferred embodiments utilize the human cancer protem, althougih other mammalian protems 
may also be used, e.g., for the development of animal models of human disease. In some 
embodiments, as outlined herein, variant or d^vative cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the cancer protein or the 
candidate agent is non-difiiisably bound to an insoluble support, prefo^ly having isolated 
sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
made of a composition to which the compositions can be bound, is readily separated 6om 
soluble material, and is otherwise compatible with the overall method of screening. The surface 
of such supports may be solid or porous and of a convenient shape. Examples of suitable 
insoluble supports include microtiter plates, arrays, membranes, and beads. These are typically 
made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, teflon*'^, etc. 
Microtiter plates and arrays are especially convenient because a large number of assays can be 
carried out simultaneously, using small amounts of reagents and samples. The particular 
manner of binding of the composition is typically not crucial so long as it is compatible with the 
reagents and overall methods of the invention, maintains the activity of the composition, and is 
nondifiiisable. Preferred methods of binding include the use of antibodies (which do not 
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stericaUy block either the ligand binding site or activation sequence when the protein is bound 
to the support), direct binding to "sticky" or ionic supports, chemical crosslhJdng, the synthesis 
of the protein or agent on the surface, etc. Following binding of the protein or agent, excess 
unbound material is removed by washing. The sample receiving areas may then be blocked 
5 through incubation with bovine serum albumin (BSA), casein, or other innocuous protein or 
other moiety. 

In a preferred embodiment, the canc^ protein is boimd to the support, and a test 
compoxmd is added to the assay. Alternatively, the candidate agent is bound to the support and 
the cancer protein is added. Novel binding agents include specific antibodies, non-natural 

10 binding agents identified in screens of chemical libraries, peptide analogs, etc. Of particular 
interest are screening assays for agents that have a low toxicity for human cells. A wide variety 
of assays may be used for this purpose, including labeled in vitro protein-protein binding assays, 
electrophoretic mobility shift assays, inmumoassays for protein binding, fimctional assays 
(phosphorylation assays, etc.), and the like. 

15 The detemiination of the binding of the test modulating compound to the cancer protein 

may be done in a number of ways. In a preferred embodiment, the compoimd is labeled, and 
binding determined directly, e.g., by attaching all or a portion of the cancer protein to a solid 
support, adding a labeled candidate agent (e.g., a fluorescent label), washing ofT excess reagent, 
and determining whether the label is present on the soUd support. Various blocking and 

20 washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) can be labeled. Alternatively, more than one component 
can be labeled with different labels, e.g., 125i foj- the proteins and a fluorophor for ttie 
compound. Proximity reagents, e.g., quenching or energy transfer reagents are also usefiil. 

25 In one embodiment, the binding of the test compoxmd is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(e.g., a cancer protein), such as an antibody, peptide, binding partner, Ugand, etc. Under certain 
circumstances, there may be competitive binding between the compound and the binding 
moiety, with the binding moiety displacing the compound. In one embodiment, the test 

30 compound is labeled. Either the compound, or the competitor, or both, is added first to the 
protein for a time sufficient to allow binding, if present. Incubations may be performed at a 
temperature which facilitates optimal activity, typically between about 4-40** C. Incubation 
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periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
between 0.1-1 hour will be sufficient Excess reagent is generally removed or washed away. 
The second component is then added, and the presence or absence of the labeled component is 
followed, to indicate binding. 

5 In a preferred embodiment, the competitor is added first, followed by a test compound. 

Displacement of the competitor is an indication that the test compound is binding to the cancer 
protein and thus is capable of binding to, and potentially modulating, the activity of the cancer 
protein. In this embodiment, either component can be labeled. Thus, e.g., if the competitor is 
labeled, the presence of label in the wash solution indicates displacement by the agent. 

10 Alternatively, if the test compound is labeled, the presence of the label on the support indicates 
displacement. 

In an alternative embodiment, the test compoimd is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the cancer protein with a higher affinity. Thus, if the test 

15 compound is labeled, the presence of the label on the support, coupled with a lack of competitor 
binding, may indicate that the test compound is capable of binding to the cancer protein. 

In a preferred embodiment, the methods comprise difierential screening to identity 
agents that are capable of modulating the activity of the cancer proteins. In one embodunent, 
the methods comprise combining a cancer protein and a competitor in a first sample. A second 

20 sample comprises a test compound, a cancer protein, and a competitor. The binding of the 
competitor is determined for both samples, and a change, or difference m binding between the 
two samples indicates the presence of an agent capable of binding to the cancer protem and 
potentially modulating its activity. That is, if the bindmg of the competitor is different in the 
second sample relative to the first sample, the agent is capable of binding to the cancer protein. 

25 Altematively, differential screening is used to identify drug candidates that bind to the 

native cancer protein, but cannot bind to modified cancer proteins. The structure of the cancer 
protein may be modeled, and used in rational drug design to synthesize agents that interact with 
that site. Drug candidates that affect the activity of a cancer protein are also identified by 
screening drugs for the ability to either enhance or reduce the activity of the protein. 

30 Positive controls and negative controls may be used in the assays. Preferably control 

and test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficirat for the binding of the agent to the protein. 
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Following incubation, samples are washed free of non-specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to detemiine the amount of 
bound compound. 

5 A variety of other reagents may be included in the screening assays. These include 

reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which may be used to 
fecilitate optimal protein-protein binding and/or reduce non-specific or background interactions. 
Also reagents that otherwise improve the efficiency of the assay, such as protease inhibitors, 
nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture of components may 

10 be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a cancer protein. The methods comprise 
adding a test compound, as defined above, to a cell comprising cancer proteins. Preferred cell 
types include almost any cell. The cells contain a recombinant nucleic acid that encodes a 

1 5 cancer protein. In a preferred embodiment, a library of candidate agents are tested on a plurality 
of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth &ctors, action potentials, pharmacological agents including 
20 chemother^eutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). Iq another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate cancer agents are identified. Compounds With 
pharmacological activity are able to enhance or interfere with the activity of flie cancer protein. 
Once idratified, similar stmctures are evaluated to identify critical structural feature of the 
25 compound. 

In one embodunent, a method of inhibiting cancer cell division is provided. The method 
comprises administration of a cancer inhibitor. In another embodiment, a method of inhibitmg 
cancer is provided. Themethodmaycompriseadministrationof a cancer inhibitor. Inafiulher 
embodimmt, methods of treating cells or individuals with cancer are provided, e.g., comprising 
30 administration of a cancer inhibitor. 

In one embodiment, a cancer inhibitor is an antibody as discussed above. In another 
embodiment, the cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are available, as 
described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
5 transfoimed, they lose this phenotype and grow detached horn the substrate. For example, 
transfonned cells can grow in stirred suspension culture or suspended in semi-solid media, such 
as semi-soUd or soft agar. The transformed ceUs, when transfectcd with tumor suppressor 
genes, regenerate normal phenotype and require a solid substrate to attach and grow. Soft agar 
growfli or colony formation in suspension assays can be used to identify modulators of cancer 

10 sequaices, which when expressed in host cells, inhibit abnormal cellular proliferation aod 
transfonnation. A therapeutic compound would reduce or eliminate the host cells' ability to 
grow in stirred suspension culture or suspended in semi-solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are described, 
e.g., in Freshney (1998) Culture of An imal Cells: A Manual of Basic Techniq ue (3d ed.) Wiley- 

15 Liss; Freshney (2000) Culture of Animal Cells: A Manual of Basic Techniq ue (Ath eH ) Wil^y. 
Liss; and Garkavtsev, et al. (1996) Nature Genet. 14:415-20. 
Contact inhibition and density limitation of growth 

Normal cells typicaUy grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the ceUs touch one another, they are contact inhibited and stop 

20 growing. When cells are transformed, however, the cells are not contact inhibited and continue 
to grow to high densities in disorganized foci. Thus, the transformed cells grow to a hi^er 
saturation density than normal cells. This can be detected morphologically by the formation of 
a disoriented monolayer of cells or rounded cells in foci within the regular pattern of normal 
surrounding ceUs. Alternatively, labeling index with (3H)-fhymidine at saturation density can 

25 be used to measure density limitation of growth. See Freshney (2000), supra. The transformed 
cells, when transfected with tumor suppressor genes, regenerate a normal phenotype and 
become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (3H)-thymidine at saturation density is a preferred 
method of measuring density Umitation of growth. Transformed host cells are transfected with 

30 a cancer-associated sequence and are grown for 24 hours at saturation density in non-limiting 
medium conditions. The percentage of cells labeling with (3H)-thyniidine is determined 
autoradiographically. See, Freshney (1998), supra. 
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Growth factor or serum dependence 

Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J.Natl. Cancer Insti. 37:167-175; Eagle, et al.(1970) L 

Exp. Med. 131 :836-879); Freshney, supra. This is in part due to release of various growth 
5 factors by the transformed cells. Growth factor or serum dependence of transformed host cells 

can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor specific 

markers") than their normal counterparts. For example, plasminogen activator (PA) is released 
10 firom human glioma at a higher level than fix)m normal brain cells (see, e,g., Gullino 

"Angiogenesis, tumor vascularization, and potential interference with tumor growth" pp. 178- 

184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. Similarly, tumor 

angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 

counterparts. See, e.g., Folkman (1992) Sem. Cancer Biol. 3:89-96. 
1 5 Various techniques which measure the release of these factors are described in Freshney 

(1998), supra. Also, see, Unkeless, et al. (1974) J. BioL Chem. 249:4295-4305; Strickland and 

Beers (1976) J. Biol. Chem. 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 42:305-312: 

Gullino "Angiogenesis, tumor vascularization, and potoitial interference with tumor growth" 

pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum; Freshney (1985) 
20 Anticancer Res. 5:111-130. 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix constituent 

can be used as an assay to identify compounds that modulate cancer-associated sequences. 

Tumor cells exhibit a good correlation between malignancy and invasivraess of cells into 
25 Matrigel or some other extracellular matrix constituent. In this assay, tumorigenic cells are 

typicaUy used as host cells. Expression of a tumor suppressor gene in these host cells would 

decrease invasiveness of flie host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other 
30 extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance moved, 
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or by prelabeling the cells with 125i and counting the radioactivity on the distal side of the filter 
or bottom of the dish. See, e.g., Freshney (1984), supra. 
Tumor growth in vivo 

EjBFects of cancer-associated sequences on cell growth can be tested in transgenic or 
immune-suppressed mice. Knock-out transgenic mice can be made, in which the cancer gene is 
disrupted or in which a cancer gene is inserted. Knock-out transgenic mice can be made by 
insertion of a marker gene or other heterologous gene into the endogenous cancer gene site in 
the mouse genome via homologous recombination. Such mice can also be made by substituting 
the endogenous cancer gene with a mutated version of the cancer gene, or by mutating the 
endogenous cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is mtroduced into the nuclei of embryonic stem cells. Cells containing 
the newly engineered genetic lesion are injected into a host mouse embryo, which is re- 
implanted into a recipient female. Some of these embryos develop into chimeric mice that 
possess germ cells partially derived firoih the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic lesion 
(see, e.g., Capecchi, et al. (1989) Science 244:1288-1292). Chimeric targeted mice can be 
derived according to Hogan, et al, (1988) Manipulatmg tibie Mouse Embrvo: A Laboratorv 
Manual CSH Press; and Robertson (ed, 1987) Teratocarcinomas and Embrvonic Stem Cells: A 
Practical Approach IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Natl. Cancer Inst. 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. Cancer 
41:52-61) can be used as a host. Transplantable tumor cells (typically about 10^ cells) injected 
into isogenic hosts will produce invasive tumors in a high proportions of cases, while normal 
cells of similar origin will not. In hosts which developed invasive tumors, cells expressing a 
cancer-associated sequences are injected subcutaneously. After a suitable length of time, 
preferably about 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest 
dimensions) and compared to the control. Tumors that have statistically significant reduction 
(using, e.g.. Student's T test) are said to have inhibited growth. 



72 



wo 03/042661 



PCTAJS02/36810 



Polynucleotide modulators of cancer 
Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a cancer-associated protein is down-regulated, or 
entirely inhibited, by the use of an inhibitory or antisense polynucleotide, e.g., a nucleic acid 
5 complementary to, and which can preferably hybridi2» specifically to, a coding mRNA nucleic 
acid sequence, e.g., a cancer protein mRNA, or a subsequence thereof. Bmdiiig of the antisense 
polynucleotide to the mRNA reduces the translation and/or stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
10 close homologs. Antisense polynucleotides may also have altered sugar moieties or inter-sugar 
linkages. Exemplary among these are the phosphorothioate and other sulfur containing species. 
Analogs are comprehended by this invention so long as they function effectively to hybridize 
with the cancer protein mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., 
Natick, MA. 

15 Such antisense polynucleotides can readily be synthesized using recombinant means, or 

can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, including 
Applied Biosystems. The preparation of other oligonucleotides such as phosphorothioates and 
alkylated derivatives is also well known. 

Antisense molecules as used herem include antisense or sense oligonucleotides. Sense 

20 oligonucleotides can, e.g., be employed to block transcription by binding to flie anti-sense 
strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic acid 
sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA (antisense) 
sequences for cancer molecules. A preferred antisense molecule is for a cancer sequences in the 
Tables, or for a ligand or activator thereof Antisense or sense oligonucleotides, according to 

25 tiie present invention, comprise a fragment generally at least about 14 nucleotides, preferably 
from about 14-30 nucleotides. The ability to derive an antisense or a sense oligonucleotide, 
based upon a cDNA sequence encoding a given protein is described in, e.g.. Stein and Cohen 
(1988) Cancer Res. 48:2659-2668; and van der Krol, et al. (1988) BioTechniQues 6:958-976. 
RNA interference is a mechanism to suppress gene expression in a sequence specific 

30 manner. See, e.g., Brumelkamp, et al. (2002) Sciencexnress (21March2002); Sharp (1999) 
Genes Dev. 13:139-141; andCathew (2001) Curr. On. Cell BioL 13!244-24S: InmammaUan 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to be 
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effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494-498. 
The mechanism may be used to downregulate expression levels of idratified genes, e.g., 
treatment of or validation of relevance to disease. 
Ribozymes 

5 In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 

transcription of cancer-associated nucleotide sequences. A ribozyme is an RNA molecule that 
catalytically cleaves other RNA molecules. Different kinds of ribozymes have been described, 
including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, RNase P, and axhead 
ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 25: 289-317 for a general 
10 review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 
5,254,678. Methods of preparation are described in, e.g., WO 94/26877; Ojwang, et al. (1993) 
Proc. Nati. Acad. Sci. USA 90:6340-6344; Yamada, et al. (1994) Human Gene Therapy 1 :39- 
15 45; Leavitt, et al.(1995) Proc. Nati. Acad. Sci. USA 92:699-703; Leavitt, et al. (1994) Human 
GeneTheraov 5:1 151-120; and Yamada, et al. (1994) Virology 205: 121-126. 

Polynucleotide modulators of cancer may be introduced into a cell containing the target 
nucleotide sequence by formation of a conjugate with a ligand binding molecule, as described in 
WO 91/04753. Suitable ligand binding molecules include, but are not limited to, cell surface 
20 receptors, growth factors, other cytokines, or other ligahds that bind to cell surface receptors. 
Preferably, conjugation of the ligand binding molecule does not substantially interfere with the 
ability of the ligand binding molecule to bind to its corresponding molecule or recqptor, dr 
block entry of the sease or antisense oligonucleotide or its conjugated version into tiie cell. 
Alternatively, a polynucleotide modulator of cancer may be introduced into a cell containing the 
25 target nucleic acid sequence, e.g., by formation of an polynucleotide-lipid complex, as described 
in WO 90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to metiiods of 
treatment. 

Thus, in one embodiment, methods of modulating cancer in cells or organisms are 
30 provided. In one embodiment, the methods comprise administering to a cell an anti-cancer 
antibody that reduces or eliminates the biological activity of an endogenous cancer proteia 
Alternatively, the methods comprise administering to a cell or organism a recombinant nucleic 
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acid encoding a cancer protein. This may be accomplished in any number of ways. In a 
preferred embodiment, e.g., when the cancer sequence is down-regulated in cancer, such state 
may be reversed by increasing the amount of cancer gene product in the cell. This can be 
accomplished, e.g., by overexpressing die endogenous cancer gene or administering a gene 

5 encoding the cancer sequence, using known gene-therapy techniques. In a preferred 

embodiment, the gene therapy techniques include the incorporation of the exogenous gene using 
enhanced homologous recombination (EHR), e.g., as described in PCT/US93/0386. 
Alternatively, e.g., when the cancer sequence is up-regulated in cancer, the activity of the 
endogenous cancer gene is decreased, e.g., by the administration of a cancer antisense or other 

10 inhibitor, e.g., RNAi. 

In one embodiment, the cancer proteins of the present invention may be used to generate 
polyclonal and monoclonal antibodies to cancer proteins. Similarly, the cancer proteins can be 
coupled, using standard technology, to affinity chromatography columns. These columns may 
then be used to purify cancer antibodies useful for production, diagnostic, or therapeutic 

15 purposes. In a preferred embodiment, the antibodies are generated to epitopes unique to a 
cancer protein; that is, the antibodies show little or no cross-reactivity to other proteins. The 
cancer antibodies may be coupled to standard afiSnity chromatography columns and used to 
purify cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the cancer protein. 

20 Methods of identifying variant cancer-associated sequences 

Without being bound by theory, expression of various cancer sequences is correlated 
with cancer. Accordingly, disorders based on mutant or variant cancer genes may be 
determined. In one embodiment, tiie invention provides mediods for identifying cells 
containing variant cancer genes, e.g., determining all or part of the sequence of at least one 

25 endogenous cancer gene in a cell. In a preferred embodiment, the invention provides methods 
of identifying the cancer genotype of an individual, e.g., det^inining all or part of the sequence 
of at least one cancer gene of the individual. This is generally done in at least one tissue of the 
individual, and may include the evaluation of a number of tissues or different samples of the 
same tissue. The method may include comparing the sequence of the sequenced cancer gene to 

30 a known cancer gene, e.g., a wild-type gene. 

The sequence of all or part of the cancer gene can then be compared to the sequence of a 
known cancer gene to determine if any differences exist. This can be done using known 
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homology programs, such as Bestfit, etc. In a preferred embodiment, the presence of a 
difference in the sequence between the cancer gene of the patient and the known cancer gene 
correlates with a disease state or a propensity for a disease state, as outlined herein. 

In a preferred embodiment, the cancer genes are used as probes to detennine the number 

5 of copies of the cancer gene in the genome. 

In another preferred embodiment, ttie cancer genes are used as probes to determine the 
chromosomal localization of the cancer genes. Information such as chromosomal localization 
finds use in providing a diagnosis or prognosis in particular when chromosomal abnormalities 
such as translocations, and the like are identified in the cancer gene locus. 

1 0 Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a cancer protein or modulator 
thereof, is administered to a patient. By "therapeutically effective dose" herein is meant a dose 
that produces effects for which it is administered. The exact dose will dqjend on the purpose of 
the treatment, and will be ascertainable using known techniques. See, e.g., Ansel, et al. (1999) 

15 Pharmaceutical Dosage Forms and Drug Deliverv Lippincott; Liebennan (1 992) Pharmaceutical 
Dosage Forms (vols. 1-3) Dekker, ISBN 0824770846, 082476918X, 0824712692, 0824716981; 
Lloyd (1999) The Art Science and Technology of Pharmaceutical Compounding Amer. 
Phannaceut. Assn.; and Pickar (1998) Dosage Calculations Thomson. Adjustments for cancer 
degradation, systemic versiis localized delivery, and rate of new protease synthesis, as well as 

20 the age, body weight, graeral health, sex, diet, time of adnunistration, drag interaction, and the 
severity of the condition may be necessary. U.S. Patent Application No. 09/687,576, further 
discloses the use of compositions and methods of diagnosis and treatment in cancer. 

A "patient" for the purposes of the present invention includes both humans and o&er 
animals, particularly manoanals. Thus the methods are applicable to both human thorapy and 

25 veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred raibodiment the patient is human. 

The administration of the cancer proteins and modulators thereof of the present 
invention can be done in a variety of ways, including, but not limited to, orally, subcutaneously, 
intravenously, intranasally, transdennally, intraperitoneally, intramuscularly, intrapulmonary, 

30 vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment of wounds and 

inflammation, the cancer proteins and modulators may be directly applied as a solution or spray. 
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The pharmaceutical compositions of the present invention comprise a cancer protein in a 
form suitable for administration to a patient Li the preferred embodiment, the pharmaceutical 
compositions are in a water soluble form, such as being present as pharmaceutically acceptable 
salts, which is meant to include both acid and base addition salts. Tharmaceutically acceptable 
S acid addition salt" refers to those salts that retain the biological effectiveness of the free bases 
and that are not biologically or otherwise undesirable, formed with inorganic acids such as 
hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid, and the like, and 
organic acids such as acetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic 
acid, malonic acid, succinic acid, fiimaric acid, tartaric acid, citric acid, benzoic acid, cinnamic 
10 acid, mandelic acid, methanesulfonic acid, ethanesulfonie acid, p-toluenesulfonic acid, salicylic 
acid, and the like. "Pharmaceutically acceptable base addition salts" include those derived from 
inorganic bases such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, 
zinc, copper, manganese, aluminum salts, and the like. Particularly preferred are the 
ammonium, potassium, sodium, calcium, and magnesium salts. Salts derived from 
15 pharmaceutically acceptable organic non-toxic bases include salts of primary, secondary, and 
tertiary amines, substituted amines mcluding naturally occurring substituted amines, cyclic 
amines and basic ion exchange resms, such as isopropylamine, trimethylamine, diethylaxnine, 
triethylamine, tripropylamine, and etfaanolamine. 

The pharmaceutical compositions may also include one or more of the following: carri^ 
20 proteins such as serum albumin; bufiTers; fillers such as microcrystalline cellulose, lactose, com 
and other starches; bmding agents; sweeteners and other flavoring agents; coloring agents; and 
polye&ylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage forms 
dqpending upon the method of administratioit For example, unit dosage forms suitable for oral 

25 administration include, but are not limited to, powder, tablets, pills, capsules and lozenges. It is 
recognized that cancer protein modulators (e.g., antibodies, antisense constmcts, ribozymes, 
small organic molecules, etc.) when administered orally, should be protected from digestion. 
This is typically accomplished either by complexing the molecule(s) with a composition to 
render it resistant to acidic and enzymatic hydrolysis, or by packaging the molecule(s) in an 

30 2^propriately resistant carrier, such as a liposome or a protection barrier. Means of protecting 
agents from digestion are available. 
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The compositions for administration will commonly comprise a cancer protein 
modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. A 
variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions are 
sterile and gaierally free of undesirable matter. These con^sitions may be stCTilized by 
5 conventional, weU known sterilization techniques. The compositions may contain 

pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents, and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate, and 
the like. The concentration of active agwit in these formulations can vary widely, and will be 
10 selected primarily based on fluid volumes, viscosities, body weight, and the like in accordance 
with the particular mode of administration selected and the patient's needs (e.g., (1980) 
Remington's Pharmaceutical Sr.iRnr.ft (I8th ed.) Mack, and Hardman and Limbird (eds. 2001) 
Goodman and Oilman: The Pharmacolo gical Basis of Therapeutics (10th ed.) McGraw-Hill. 
Thus, a typical pharmaceutical composition for intravenous administration would be 
15 about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into the 
blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 
administrable compositions will be known or apparent. 
20 The compositions containing modulators of cancer proteins can be administered for 

ther^eutic or prophylactic treatments. In therapeutic plications, conq)ositions are 
administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficientTo cure 
or at least partially arrest the disease and its compUcations. An amount adequate to accomplish 
this is defined as a "therapeuticaUy effective dose." Amounts effective for this use will depend 
25 upon the severity of the disease and the general state of flie patient's health. Single or multiple 
administrations of the compositions maybe administered depending on the dosage and 
frequency as required and tolerated by the patient. In any event, the composition should 
provide a sufBcient quantity of the agents of this invention to effectively treat the patient. An 
amount of modulator that is capable of preventing or slowing the development of cancer in a 
mammal is referred to as a "prophylactically effective dose." The particular dose required for a 
prophylactic toeatment will depend upon the medical condition and history of the mammal, the 
particular cancer being prevented, as weU as other factors such as age, weight, gender, 
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administration route, efficiency, etc. Such prophylactic treatments may be used, e.g., in a 
mammal who has previously had cancer to prevent a recurrence of the cancer, or in a mammal 
who is suspected of having a significant likelihood of developing cancer based, at least in part, 
upon gene expression profiles. Vaccine strategies may be used, in either a DNA vaccine form, 
or protein vaccine. 

It will be appreciated that the present cancer protein-modulating compounds can be 
administered alone or in combination with additional cancer modulating compounds or with 
other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides comprising 
nucleic acid sequences set forth in the Tables, such as RNAi, antisense polynucleotides or 
ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of cancer-associated polypeptides 
and nucleic acids using in vitro (cell-fi^ee), ex vivo or in vivo (cell or organism-based) 
recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for introducing 
foreign nucleotide sequences into host cells may be used. These include the use of calcium 
phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, plasma 
vectors, viral vectors, and other well known methods for introducing cloned genomic DNA, 
cDNA, synflietic DNA, or other foreign genetic material into a host cell (see, e.g., Berger and 
Kimmel (1987) Guide to Molecular aonine Techniques from Methods in Enzvmologv (vol. 
152) Academic Press; Ausubel, et al. (eds. 1999 and supplements) Current Protocols lippincott; 
and Sambrook, et al. (2001) Molecular Cloning: A Laboratory Manual (3d ed., Vol. 1-3) CSH 
Press. 

In a preferred embodiment, cancer proteins and modulators are administered as 
therapeutic agents, and can be formulated as outlined above. Similarly, cancer genes (including 
both the full-length sequence, partial sequences, or regulatory sequences of the cancer coding 
re^ons) can be administered in a gene Iherapy applicatioiL These cancer genes can include 
inhibitory applications, e,g., as inhibitory RNA, gene therapy (e.g., for incorporation into the 
genome), or antis^ose compositions. 

Cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses. Such vaccine compositions can 
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include, e.g., lipidated peptides (see, e.g.,Vitiello, et al. (1995) J. Clin. Invest 95:341-349), 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 
(see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294,; Alonso, et al. (1994) Vaccine 
12:299-306; Jones, et al, (1995) Vaccine 13:675-681), peptide compositions contained in 

5 immune stimulating complexes ^COMS) (see, e.g., Takahashi, et al. (1990) Nature 344:873- 
875; Hu, et al. (1998) riin Ryp TTnmunol, 113:235-243), multiple antigen pq)tide systems 
(MAPs) (see, e.g., Tam (1988) Proc. Natl Acad. Sci. USA 85:5409-5413; Tam (1996) L 
Immunol. Methods 196:17-32), peptides formulated as multivalrat peptides; peptides for use in 
ballistic delivery systems, typically crystallized peptides, viral delivery vectors (Perkus, et al., p. 

10 379, in Kaujftnann (ed. 1996) Concepts in Vaccine Development de Gruyter; Chakrabarti, et al. 
(1986) Nature 320:535-537; Hu, et al. (1986) Nature 320:537-540; Kieny, et al. (1986) 
Bio/Technologv 4:790-795; Top, et al. (1971) J. Infect. Pis. 124:148-154; Chanda, et al. (1990) 
Virologv 175:535-547), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) J. 
hnmunol. Methods 192:25-35; Eldridge, et al. (1993) Sem. Hematol. 30:16-24; Falo, et al. 

15 (1995) Nature Med. 1 :649-653), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 4:369- 
388; Gupta, et al. (1993) Vaccine 11:293-306), liposomes (Reddy, et al. (1992) J. hnmunol. 
148:1585-1589; Rock (1996) Immunol. Todav 17:131-137), or, naked or particle absorbed 
cDNA (Ubner, et al. (1993) Science 259:1745-1749; Robinson, et al. (1993) Vaccme 11:957- 
960; Shiver, et al., p 423, in Kaufinann (ed. 1996) Concepts in Vaccme Development de 

20 Gruyter, Cease and Berzofsky (1994) Annu. Rev. Immunol. 12:923-989; and Eldridge, et al. 
(1993) Sem. Hematol. 30:16-24). Toxin-targeted delivery technologies, also known as receptor 
mediated targeting, such as tibiose of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) 
may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
25 designed to protect the antigen fiom rs^id catabolism, such as aluminum hydroxide or mineral 
oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis, or . 
Mycobacterium tuberculosis derived protdns. Certain adjuvants are commercially available as, 
e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); 
Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, 
30 Philadelphia, PA); aluminum salts such as almninum hydroxide gel (alum) or aluminum 

phosphate; salts of calcium, iron, or zinc; an insoluble suspension of acylated tyrosine; acylated 
sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
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biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF, 
interleukin-2, -7, -12, and other like growth factors, may also be used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
encoding one or more of the polypeptides, or a fragment thereof is administered to a patient. 

5 This approach is described, for instance, in Wolff et al. (1990) Science 247:1465-1468, as well 
as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,1 18; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies mclude 
"naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 

10 Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention can 
be expressed by viral or bacterial vectors. Examples of expression vectors include attenuated 
viral hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus, e.g., 
as a vector to express nucleotide sequences that encode cancer polypeptides or polypeptide 

15 fragments. Upon introduction into a host, the recombinant vaccinia virus expresses the 

immunogenic peptide, and thereby elicits an immune response. Vaccinia vectors and methods 
useful in immunization protocols are described in, e.g., U.S. Patent No. 4,722,848. Another 
vector is BCG (Bacilie Calmette Guerm). BCG vectors are described in Stover, et al. (1991) 
Nature 35 1 :456-460. A wide variety of olh«: vectors are availablel for therapeutic 

20 administration or immunization, e.g., adeno and adeno-associated virus vectors, retroviral 
vectors, Sahnonella typhi vectors, detoxified anthrax toxin vectors, and the like. See, e.g., 
Shata, et al. (2000) MolMedTodav 6:66-71; Shedlock, et al, (2000) J. Leukoc. Biol. 68:793- 
806; Hipp, et al, (2000) In Vivo 14:571-85. 

Methods for the use of genes as DNA vaccines are well known, and include placing a 

25 cancer gene or portion of a cancer gene under the control of a regulatable promoto: or a tissue- 
specific promoter for rapression in a cancer patient. The cancer gene used for DNA vaccines 
can encode fiiU-lenglh cancer proteins, but more preferably encodes portions of the canc^ 
proteins including peptides derived fiom the cancer protein. In one embodun^t, a patient is 
immunized with a DNA vaccine comprismg a plurality of nucleotide sequences derived firom a 

30 cancer gene. For example, cancer-associated genes or sequrace encoding subfiragments of a 
cancer protein are introduced into expression vectors and tested for their immunogenicity in the 
context of Class I MHC and an ability to generate cytotoxic T cell responses. This procedure 
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provides for production of cytotoxic T cell responses against cells which present antigen, 
including intracellular epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase the 
5 immunogenic response to the cancer polypeptide encoded by the DNA vaccine. Additional or 
alternative adjuvants are available. 

In another prefored embodiment, cancer genes find use in generating animal models of 
cancer. When the cancer gene identified is repressed or dinodnished in cancer tissue, gene 
therapy technology, e.g., whereui inhibitory or antisense RNA directed to the cancer gene will 

10 also diminish or repress expression of the gene. Animal models of cancer find use in screening 
for modulators of a cancer-associated sequence or modulators of cancer. Similarly, transgenic 
animal technology, including gene knockout technology, e.g., as a result of homologous 
recombmation with an appropriate gene targeting vector, will result in flie absence or increased 
expression of the cancer protein. When desired, tissue-specific expression or knockout of the 

15 cancer protein may be necessary. 

It is also possible that the cancer protein is overexpressed in cancer. As such, transgenic 
animals can be generated that overexpress the cancer protein. Depending on the desired 
expression level, promoters of various strengths can be employed to express the transgene. 
Also, the number of copies of the integrated transgene can be determined and compared for a 

20 determination of the expression level of the transgene. Animals generated by such methods will 
find use as animal models of cancer and are additionally usefiil in screening for modulators to 
treat cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, kits are 

25 also provided by the invention. In diagnostic and research appUcations, such kits may include 
at^least one of the following: assay reagents, buffers, cancer-specific nucleic acids or antibodies, 
hybridization probes and/or primers, antisense polynucleotides, ribozymes, dominant negative 
cancer polypeptides or polynucleotides, small molecule inhibitors of cancer-associated 
sequences etc. A therapeutic product may include sterile saline or another pharmaceutically 

30 acceptable emulsion and suspmsion base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
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typically comprise written or printed materials, they are not limited to such. A medium capable 
of storing such instructions and communicating them to an end user is contemplated by this 
invention. Such media include, but are not limited to, electronic storage media (e.g., magnetic 
discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such media may 
include addresses to intemet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of cancer- 
associated sequences. Such kits can be prepared from readily available materials and reagents. 
For example, such kits can comprise one or more of the following materials: a cancer-associated 
polypeptide or polynucleotide, reaction tubes, and instructions for testing cancer-associated 
activity. Optionally, the kit contains biologioaliy active cancer protein. A wide variety of kits 
and components can be prepared according to the present invention, depending upon the 
intended user of the kit and the particular needs of the user. Diagnosis would typically involve 
evaluation of a plurality of genes or products. The genes will typically be selected based on 
correlations with important parameters in disease which may be identified in historical or 
outcome data. 

EXAMPLES 

Example 1: Gene Chip Analysis 

Molecular profiles of various nonnal and cancerous tissues were determined and 
analyzed using gene chips. KNA was isolated and gene chip analysis was performed as 
described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al, (2000) Genes Dev. 14:981- 
993). 
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€ondiBons<!fu*idin9 various slages^ 

Jim^glfis. auto^^ RA. 8l£.iuven]bclm)nfcarthrffis. piginenM ete.). icM newSulalzaDon 

^Tjjw^g.. diabeSc retlhopaUiy. macular deoanerBGon. pesumad ocular Mstoptesmosls syndreme. ate.). sctoiOsfoonJuncflvflis. hypertrophic scam (keldid). bblh cootral. 

in 5!f^^*^3idnorralnsitu.papIlIaiycanSnori^ 

^. breast ductal caidnonia{ns&), lobular caidnoma In s&u 

1^ cervbc cancer of the oervbc vagina, or vuhra 

colonfteclumcp^^ disease (a^, neopiasCc polyps (adenomas), fanfial adenomatous polyposis, ulceratrve cofifis), colon cancer, aa. epiSielal tumor (tcL. 

^eno^^noma. mucinous adenocarctea. slgneWng ceH adenocarcinoma, squamous cefl carcinoma, adenosquamous candnoma, undiMated caidn^ un^ssified 
^n(^ rarinoid tumor {^S-'^e^^^^^ composite), non^plMal broor {e.9. lelmyosaicoma. others), inltemmalory bowd disease (e.9!Sa^ 

L\} carcinoma, paplaiy villous carcinoma, mutinous adenocarcinoma, melanoma^ w w. BwroBaonqma, 

esoph^: prcmallgnmtorpredlgxs^^ gastrodloesliw carcinomas 

(efl.. cancers of the stomacti, colon, or rectum) 

fibrosis: lung fibirebpop^lc pulmon^ fibrosb, hypeisensltivlly pneumonitis,lnterslitial pneumonlOs, nonspecific kfiopafhte pneumonitis), chronic obstructive pulmonary disease 
9^ <!i«::i"P'^>^*=*«™'c^ro^ oo5irucuvepu!monafyflu»ase 

^^^.^^.t^Tt^' n^opharynx oral cavity, oral phaiynx, lip, laiyro(. hypophaiynx, sallvanr glands, paragangliomas, esophagus 

Iddney: dea^cell(m>np^may) ca«^ papiB^rcantona. chrcmophobe renal carcinoma, hypemephr^ 

i«*«Jf^^S?'JKS'^?l^^ P"*^' uieleralcarcinoma. fibroma^ papIBafy adenoma, anglonvollpoma, oncocytoma 

leufcocytes^ute lymph^lasllc te«tem^^ dirwte lymphocyllc leukemia^ follicular lymphoma, laige freed lymphoma. Burtdtt lymphoma, plasma cell neoplasms, mantle cell 
30 tti^S^jS 'W^^ a^""tT<ell leukemiartymphoma, Hodgkln disease, acute myetogenoileukemia, chronfe m;elogenous 

O V leutemia. thymic hypeiptesia. hairy cen leukemia, malignant transtormation, inappropriate adhratton or abnormafities of leukocytes {e.a, tmrnature. precuisor B (orieior 

K^J W '««trcphils. eosinophils. basophls.dendrifioGeIls. lymphoblasts). arthittb. \S^S^^p^^^ 

iymphangilis. bacteremia. chronfcnonspedflclynvhadenilis,psoriasb.woun^ . '««*7>w«. irnpnwwnios, 

''S^rid^ w Mfdid^* ^' ^''"^"^ ^ ^^"^^ "^^"^ mesenchymal himofs. tumors of the 

lung: lung cancer^an cel^^^ 

^^^r mudno w tumora. endometrtoU tumors), gemi cell (e.9. teiBlomas. choridcaiclnonias. potyembiyomas. embryomd 

40 ^J^^^^^^J^^"**""™^-*^ 

pancreas: adeno^n^toma. d^ adenocarcinoma, mucfewus cyst adenocarcinoma acinar ceH carcinoma, unclassified large ceQ cardnoma. smaD ceD carcinoma, 
'^^■n!?^! 1"^^^ Ojfl-^atowra^^ and basal ceO carcinoma), carcinosarcoma, non^pithetlal 

''^'^'^^llflllirJf^^ lentigo (rommon benign tocafized hyperplasia of melanocytes), nevocellular nevi (congenital or acquired neoplasm of melanocytes), aclinic keratosb 
(overgnowtfi off outer layera of sWn). basal cell carcinoma, Mertel ceH carcinoma benign fibrous histiocytoma (dermal neoplasms of fibroblasts and hbttocytes) 
3lf '^S^J^"^ fibrosareomaofltoskbi), xanthomas (tumoi^ooltectl^ 

50 Ji;;^;'^;^^ 

soAllssuKSOltfissiietuiion (&g., fibrosarcoma. Dposarcoma. leiomyosarcoma, histtocytoma fibroWstiocytic sarcoma) smooth muscle tumors (e.g., rhabdomyoma 

^^^^^^^J^ lymph vessels (e.g. angiosarcoma lymj^angtosarcoma KaposTs sarcoma), perivascular tumors (e.g.. gtomus tumors, 

S ^^S*^^^' "»solhelk)ma), neural tumors (e^g.. neurofibroma neurofibrosarcoma maHgnanl periphery nerve sheath tumors, granular oeB 

55 SSSSSSt ^^'^"f^b'astoma neuroepitheltoma extrasketelal Ewlng'ssarcoma ach^onnomat neuroma gangloneuroma). paraganglioma e)tekeletal 

chondrosarcoma osteosarcoma), plunpotentlal mesenchymal hjmors, epHhefod sarcomas, rhabdoM tumors, desmoplastic small oefl 

^^^ii^m^^^^^^^!^ adenoacanthoma carcinoid, leton^arcoma gastritis (chronic atrophic. R pylori associated), hyperplastic polyps, lipoma 

60 ^1L2!?ll?i^lIIlf i^^^ carcliwmas. teratomas, choriocarcinomas, yolk sac tumors), sex chord stromal lumois (Including Leydlg cell tumors. 

OU S^toli cell himore. and Granulosa c^ hjmors). gemi ceQ and gonadal stroma) elemenis (e.g.. gonadoUastomas). adnexal and paratosQcutar himors (afl7m^^Z^ soft 

fisaiesaroomas, and adnexal olft^ P«««»««.aruimorsiao..mesouieiioma5.soii 

utaros: e^lhelial himors (e.9>.ergometriold. papS^ endometrioid, papiilary serous, clear cell, mucinous), mesenchymal tumors (ag,. endometrial stromal sarcoma 
ieiomyosarooma, nonspedSc sarcomas), mfaced (umore (ag., m^gnant mixed mullerian tumors, adenosarcoma) 

65 

70 ForeachprobesetlsBsledgenedustermimber 
tromwhfchdlgOTucteoMwwemdesfe Gene dusters were compOed using sequences derived liomGenbartkESTs and mRN^ These sequences were chislered based on 
?SnS!^^ Gsnbartkaccasstennumbersffbrsequences^^^ 

_ TaW«2C^72(X76C,anri79CislgenoiiteposltloningforPkey^ Fbreadi oredicted 

/D ewn Is listed genomfc sequence source used for piedk^on. NudeotldetocaBora «^«n« /a«,ie8pwav«y. rorewnpregcco 

Table 2A. Disease lndk:ations and Prefened Utilities for Selecled Genes 

Tabto2Aprovldsspretisirod disease lndfca5ons and preferred These genes were Wentilied using EosWfymetrfxGenedtipanays^ 



Pkey: Uidque Cos probeset Identifier number 

ExAocn: Exemplar Accession number 
UnigeneiDi Unlgene ID number 
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UniQdneTilla: Unlgena gene (ifla 

Disease: ivEfened diseases NIcatad for seJectod gene esdescri^ blad (bladder diseases). angfoOibod vessel diseases), EWS 

fjaom diseases). gSo (brein diseases), breast (breast diseases), cerv (cenrical diseases), colon (cotorectal diseases), esoph (esophageal diseases), 6bro (fibiDsis 
diseases), headnk (head & neck diseases), leio (leiomyoma diseases), teuk (leukocyte diseases). hepC (Over diseases), lung (lung diseases), ovar (ovarian 
diseases), endo (ovarian endometrioU diseases), omuc (ovarian nwdnous diseases), pane (panciesiic diseases), pros (pn»tats d^eases), lenal (renal diseases), 
sac (soft tissue and bone diseases), ntela (sMn diseases), stooi (stomach diseases), (est (testicular diseases), uter (uterine diseases) 

USiity: preferred utilities for selected gane as described in the text and ebfaievialed as fo^^ (HI (DNAvaodiiatBrgeO.(nag[dlaonoslic or progiBstic target}. mAb 

(monockmai antibody target), sm (smaQ molecuie targeQ 

Pkey; ExAccn; UnigenelD; Unigene TiQe; Disease; UQIily 

102892; BE440042; Ks.83326: niatrtx metalloprotelnase 3 (st; headnk; mAb^ag^sm 

104865: T79340; Ks.22S75: B<:ell OlTlymphoma 6, member ; anglo; CTL 

104978; Alig9268; H$.19322: Homo sapiens, Similar to RIKEN; colon, lung, pros, blad, stotn; CTL 

109424; NM.00532% Hs.8S962: hyahtronan synthase 3; blad. king; mAb^sm 

110765: AK000322: Hs.18457; hypolheOcal protein FU20315: colon, pros, stom. uter; mAb«dlag 

110906: AA03521 1; Hs.17404: S0X7 SRY (sex d^rrrtnlng regl; angb. Wad; CTL 

115522; BE614387: Hs.333893: o^ target JP01: colon, lung. blad. pane; OTL 

116176; AA311152; Hs.288708; hypothetical protein FU21 562; coton; CTL 

116695; AK000465;Hs.50061; KIAA1 199 prot^ colon, king; diag 

123049: BE047680; Hs.21 1669; dckkopf (Xenopus laevis) homo; EWS; mAlHdlag 

131486; F06972: Hs^2; endotheOa) tyrosine kinase (E; anglo; CTL^m 

133370; AF245505; Hs.72157; Adlican; breast, king, pane; dlag 

310016; AW449612: Hs.152475: ESTs; cdon; CTL 

322303; Ai357412; H5.157601; ESTs; cotan, pros, fibro, breast; CTL^diag 

400289; X07820: Hs.2256; matrix metaOoproteinase 10 (s; angio. Uad. kmg. oetv, ovar, headnk, esoph; mAb^dlag^jn. 
400297; AI127076; Ks.288381; hypotheScal protein DKFZp5640; breast, Uad, cok»i. pros; mAb 
400303; AA242758; Hs.79136: UV-I protein, estrogen reguia; breast, ovar. pros, stem, uter. blad, lung, headnk; mAb 
400843; ; ; NtUL003105*:Honio sapiens sortil: Uad; sm 

402075; ; ; ENSP00000251056*:Rasina membra; blad, lung, headnk, oenr. mela, esoph; mAi^diag 

402901; : ; NM_02S206*:Homo sapiens hypoOt; blad ; CTL 

404287; ; ; FGENESH predated novel CU&do; pane, hmg. coton. uter, esoph; mAlHsm 

404682; ; ; orlholog of mouse polydomain p; pane; dlag 

404875; ; ; NM_022819*:HorTO sapiens phosph; blad; CTL^m 

404977; ; ; Insulln4ike growth factor 2 (; blad, ovar. sarc; mAb^ag 

405033; ; ; C1002652':gii544327)sp|(»47991; Uatfc ftjn. 

408400; ; ; kalEkreinS (neuropsmAwasIn; ovar, uter; diag 

406964; M21 305; ; FGENES predicted novel secrete; angto, blad, Qko. sarc; diag 

407603; AW955705: Hs.62604; Homo sapiens, ctone IMAGE:4299; gib. blad; CTL 

407792; AI07771S; Hs.39384: putative secreted Hgand homol; ovar, uler, cerv, pane; mAb«diag 

407811; AW190902; H8.40098^ cystek» knot superfamlly 1, B; blad. pane, stom, uter, kmg, esoph; diag 

407836: T79340: Hs,22575: B^fl CLUIyniphoma 6. member ; angio; CTL 

407975; X69426; Ks.41716; endotheOai ceO^pedfic mole; angio, renal; dteg 

408243; Y00787; H3.624: Interiaukin 8; blad, stom, headnk. cen^. lung, anglo, esoph, pane; dtag 

408367; AK001178; K3.44424; Homo sapiens orphan neurotrans; mete; mAb^m. 

408369; R3843B; Hs.118747; SLC15A2 Sokrte carrier family ; pros, hmg. librD, uter, gHo, cerv, ovar; mAb 

408380: AF123050: Ks.44532; diublqulQn; king, blad, headnk. pane, stom. fibro. esoph. mela; CTL 

408482; NMJX)0876; Hs.45743; adenosine fi2b recepk)r; kjng. esoph. headnk. colon; mAlH«ja 

408562; AI436323; Hs.31141: roundabout (axon guWance rece; uter, fibn). sam; mAb*sm 

408790: AW580227; Hs.47860; neurotrophic tyrosine ktoase, ; kmg; mAlH6.m. 

408908; BE296227; Hs.260822; serinefthreonine kinase 1 5; blad, lung, headnk, stom,eoIon; sm 

409041; AB033025; Hs.50081; HypoSretical protein, XP.05186; uter, ovar. lung. cobn. skm, headnk. breast, pane; CTL-nfiag 

409079; W87707: H5^2065; interieukin 6 signal transduce; breast pros; inAb«sm 

409103; AF251237; Hs.1 12208; XAG&I protein; lung; CTL 

409178; BE39394e; Hs.50915: kalllkreki 5; ovar, breast, mela; diag 

409220; BE243323; H$.51 233: necrosb factor receptor; angk). renal, colon, stom; mAlHfi.m. 

409420; Z15008; Hs.54451; lamlnin, gamn^ 2 (nk»in (100k; king, headnk. pane, stom. cerv. esoph. blad; diag 

409632; W74001: Hsi5279: serine (or cysteine) proteinas; king, Uad, headnk; diag 

409663; AI7437S0: Hs.98306; KiAA1862 protein; renal; CTL 

409757; NM.001898; Ks.123114: cystaGn SN; pane. stom. kmg, blad; di^ 

409889; AW630041; Hs.56937; suppresston of tumorigenidly ; colon, ovar. pros; mAb^em 

409893; AW247090; Hs.57101; mbschromosome m^tenance del; king, cerv. blad, test, esoph; CTL-»sja 

409956: AW103364: Hs.727: bihibfai. belaA(acMn A, ac; brrast, pane, ovar, ookxi, headnk. lung, Uad, esoph; diag 

410001; AB041036: Hs.57771; kdGkreIn 11; ovar, pros, uter. ceiv. hmg ; (Hag 

410055; AJ250839; Hs.58241; gene for eerine/Ihreonlne prot renal; sja 

410153; 6E311926; Hs.15830; hypotheHeal protein FU12691; rena].blad; CTL 

410274; AA381807; Hs^$402; hypoxiaMudbte protein 2; king, renal; CTL 

41030% B£()43077; Hs.278153; 8lpha.2.84la)yltransfbrase II; pane; sm 

410407; )C68839; Hs.63287; cariionlc anhydrase IX; renal, lung, ooton, stom, ovar. uter, Uad, sarc; mAtnem 

410418; 03138^ Ks.63325; transmembrane protease, serine: cokm. Uad. lung, ovar, panc^ headnk; mAlndiag^em 

41 1274; NM.002776; Hs.69423; kaOIkreln 10; cobn, ovar, uter, cerv. headk, pane; diag 

41 141 1; AA345241: K$.S5gS(): ESTs, Weakly similar to KIAA13; renal; mAb«^ 

411773; NM_0067S9; Hs.72026; protease, serine. 21 (testlsin; over; diag 

41 1975; A19160S8; H1144583; 3'UTR of: dead ringer (Orosoph; test, colon; CTL 

412078; X69699: Hs.73149: paired box gene 8; ova; CTL 

412140; AA219691; Hs.73625; RAB6 interactbg. kinestivlike; lung. Uad. headnk. breast ovar. pane, angb. test mela; s.m. 

412314; AA82S247: H$.356084; G prote!n<oupted reeepbr 27 ; ovar, uter. test; mAt>«sm 

412609; Z48804; Hs.74124: ocular albinism 1 (Netileshl|>-; mela; sjn. 

412628: A1972402; Hs.306051; hypolhetbal preteb MGC2848; pros; diag 

412709; AL022327; H174S18; KIAA0027 protein: gtb, saic; mAtH&m. 

412719; AW016610; Hs.81&; ESTs; lung, headnk, Uad, QQo, cerv. sare; sjn. 

4129S9; 087458; Hs.7509flt iOAA0262 protdn; gOo; CTL^m 

41304% M93221; Hs.75182; mannose receptor, Clype 1; fibro^ pane; mAb 
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413063; AU»5737; Ks.75184; chlQnase ZBb 1 (caililage ; gBo. ovar. Mad. lung : dlag 

413278; BE563085; Hs.633; inteiferoivstatated proteiru pane. hing. Uad. tveasloeiv.ovar, hsadnk, esoph, mda; Ca-»8Jiv 

413324; V00S71; Hs .75294; corticotroptn reteasii^ homnn; blad; diag 

41338S; M34455; H3.840; InddeaminBiiyrrolB 2^ diaxyg; btad. lung, meta, fibiOb titer, saic; 8.m. 

413554; AA319146; K5.75426; secretogrsnin n (chro t nogf anl n; pane, gDo; d!ag 

41 3719; BE439580; Hs.75498; small inductble cytokine subfa; teuk, pane, tung. hoadnk, C8fV, oolon, uter, stem, eso|A; diag 

414555; N98569; HsJ6422: phosphc^pase A2, group IIA (p; pros; sm 

414577; AI056548; HS.37893B; hypothetical protein FU20992 ; angio; CTl-Kflag 

414774; X02419; Hs.77274; ptesmlnogen acfivator, woWna; lung, blad, headnk, pane, stom, ovar, esoph; (flag 

41481 2; X72755; H3.77367; monokine induced by gamma Ints; breast biad. tung, iibro, pane, cdon, headnk. cerv. stom, rendl ovar, test, meta, esoph; diag 

41 4883; AA926960; Hs.348669: COC28 protein kinase 1 ; tung. ovar, slom^ colon, can. headrtk. lest; sm 

414907; Xg072S; Hs.77597; polo (DrosophlaHikB kinase; blad, lung, ovar, test; sm 

414991; C17898; ; Homo saptens Ufnegulated by B; Iibro. lung; mAtHdlag 

4151 38; C183S6; Hs.295944; tissue factor pathway InhibitD; an^, pane, stnn. king, uter; CTL-nnag 

415539; At733881: Hs.72472: NAME OMITTED ... receptor kinase: breast; mAb*8.m. 

41 5668: AW9576B4: Ks.30681 4; Homo sapiens ^ (»idase4Dc mefac diag 

415G69; NM.Q05025; Hs.78589; serine (or cysteine) prolelnas; lung; mAb^-^sjn. 

415817; UBa967; Hs.78867; prolebi tyrosine phosphatase. ; king, glk), headnk, cenf, mela, esoph, fibre; mAb^^ 

415929: AA724373: Hs.304950; Homo saddens mucoG{^rv3 (MCOU meia; mAb 

416091; AF295370; Hs.283082t defensin, beta 3; headnk. esoph, meia; CTL^dlag 

416209; AA236776; HS.7907B; MAD2 (mitotic arrest deficient hing, headnk, colon, uler, stom: CR-tsm 

416250; AA581386; Hs.73452: Kremen 2; esoph. lung, ceiv. ovar; mAI>*e.m. 

416530; U62801; Hs.79381: kallikiein 6 (neurosin, zyme); ovar, uter, diag 

41 6636; N3253S: Hs.42645; sotuts carrier faniy 16 (mono; breast, pane. uter. meia; mAb^sm 

41665B; U03272; Hs.79432; fibriliin 2 (congenital contra; lung, ovar, uter, biad, angb. test saro; diag 

416836; D54745; Hs.80247; chotecystoUnin: pros, EWS. gBo ; diag 

416857; AA188775; Hs.292453; FGENESH predh:tBd TM containin; gHo; mAb«6m 

416965: N26223; Hs.160436; MDACI; fibro. ovar. uter; mAb 

417034; NM.006183; Hs.80962; neurotensin; lung, headnk. cerv; diag 

417079: U65590; H8.81 134; biterleukin 1 receptor antagon; blad, lung, headnk, ceiv. esoph; diag 

417166; AA431323; Hs.42146: Paired box protehi Pax^; meia, sarc; CTL 

417389: BE260964: Hs.62045; mldkine (neurite grawth^nul; ovar, lung. Uad. uter. cerv. pane, siom. meia, test colon, sarc; mAb^iliag 

417433; BE270266; Hs.82128; ST 4 oncofetal trophoUast glyc; pane, breast bl^, lung, headnk. cen^. uter. ovar, stom, renal; mAb 

417771; AA804898; Hs.82547; reSnoto add receptor respond: blad, ceiv, pane, pros, ovar, mAb 

417866: AW067903: Hs.62772: collagen, type XI, alpha 1: lung. pane, breast ovar, headnk, stom, saic; CTL 

417931; W95842: Hs.82961; trefoil factor 3 (htes6nai): ovar, pane, stom, colon, uter. pros; diag 

417933; X02308: Ks.B2962: thymidylate syrrihetase; blad. hing. angio^ colon, pane, esoph; s.m. 

418007; M13509; Hs.83169; matrfat metaBoprotelnase 1 (in; lung. Uad. fibro, headnk, pane, stom, colon, ovar, esoph. meia; mAb^diag^sjn. 

418030; 6E207573; Hs.83321; neuromedin B; gHo, pane; diag 

418064; BE3672B7; Ks.83384; S1G0 caldum^^nding protein, ; meia; diag 

41B281; U09550: Hs.1154; oviductal glycoprotein 1. 120k: uter, ovar; CTL-nliag 

418478; U36945; Ks.1174; cydiiHiep^ent kinase inhibi; king, blad, ovar, headnk. pane, cerv, meta. saro; 8.m. 

418506; AA0B4248; H5^726^1; Unknown prateki for MGC:29643 ; angio. ovar, gDo, uter, bng. blad, pane, mela^ sarc; mAlndiag 

418526; BE019020; Hs.8SB38;60hjtB carrier famBy 16 (mono; lung, blad, renat pane, stom. colon, ovar, mAtHS.m. 

418558: AW082266; H3.86131; Fas CrNFRSF6>^soelated via d; esoph, headnk; sm 

418678; Nli^001327; Hs.87225; cancer/lestis antigen (NY-ESO-; hmg, blad, stom, ovar, pane, esoph, cerv, saro; CTL 

418738; AW388633: Hs.6682; solute carrier family 7, (cati; angto, hing. ovar, blad, colon, stom, pane, uter, leuk; mAlHs.m. 

418830; 8E513731: Hs.348874: hypothettcal protein M6C4816; kmg; CTL 

418867; D31771; Hs.89404: msh (Drosophila) homeo box hom; Uad; s.m. 

418870; AF147204; Hs.89414; chemoklne (C-X-C moGI), reeep; leuk. ovar, breast blad, renal; mAb^sm 

419080; AVinS0835; Hs.1887a; hypotheScal protein FU21620: renal, lower uter, lung; CTL 

419121; AA374372; Hs.89626; parathyrOU honrane^ homw; king, esoph, Nadnk. blad; diag 

419171; NM.002846; Hs.89655; protein tyni^ pho sphatas e. ; hing; mAb^sm 

419172; AW338825: Hs.22026; ESTs; sfanOartoTRANSMEMBRAN; angto, renal; mAb«sm 

419183; U60669; K3.e9663; cytodirome P450, subMy XXI; btad, lung, headnk, pane; CTt*sm 

419216; AU076718: HS.1&4021; small Indudbie cytokine subfa; pane. tung. stom. cav. pros, boadOi, esoph; diag 

419235; AW470411; Hs.288433; neurctrtmln; pane, Gbro, headnk, hing; mAI>«dtag 

41 9452; U33635; H3.90572; FVQ protdn tyrosine kinase 7; ovar. pros, lung, breast uter, test pane. stom. sarc; mAlH&m. 

419508; AW99793d: Ks.90786: ATP-tnnding cassette, sub-fami; gHo, omuc, stom, hmg, pane, coton, renat uler; mAb«s.nL 

419556; U29615; Hs.91093: chiSnase 1 (chitotrk^stdase); lung, fibro, test; mAb^g 

419704: AA429104; Hs.45057: ESTs; gflo; CrL-»s.m. 

419723; AL120ig3; Hs.33S810; tongevity assurance (LA61, S. ; gflo; mAlndiag 

419741; NM.007019; Hs.93002; ubiqultin carrfer protein E2-C; blad, lung, colon, ovar, test esoph. meia, saro; CTLtsjn. 

419833; AA251131; Ks.220697; Homo sapiens tryptophanyt-tRNA; Iibro, stem, Uad, esoph. uter, dag 

4201 59; A1572490; Hs.99785: Honn sapiens cONA: FU21245 fl; blad, Stom; mAb 

420162; BE378432: Hs.95577; cycfiivdependent kinase 4; lung, mete, saro; sm 

420370; Y13645; Hs.97234: uroplakin 2; blad; mAb 

420440; NhL002407; H&97644: mamrriaglobk) 2; ovar, uter, cerv; diag 

420602: AF060877; Hs.99236; regulator of &f»otein signalh, headnk. gib, ceiv. m^ CTL'»s.m. 

420610; A!663183; Ks.99348: dlstaMess homeo box 5; uler, endo, lung; CTL 

420737; 108098; Hs.99899; CD70 ; \mot necrosis factor ; renal; mAlHS.m. 

420876; AA918425: Hs.177744: FGENES predicted novel protein: pane, blad; 84n. 

421066; AU076725; Hs.101408: branched chain aminotransferas; blad, hjng; CTL^sm 

4211 IQt AJ250717; Ks.1355; calhepstn E; Uad. pane, stom. hing, fibro. ovar. esoph; snndiag 

421340; F07783; Hs.1369; decay acceleraSng factor for ; angto, pane, stom; diag 

421379; Y1 5221; Hs.103982; smaS indudbie cytokine subfa; breast pane, headnk, hmg, stomi Uad, cerv. coton, leuk, fibro. test meia esoph; diag 
421471; U90545; Hs.327179; soluta cantor family 17 (sodc renal; mAtnem 
421474; U76362: Hs.l04637; solute canter family 1 (gluta; lung; mAtnem 
421524; AA312082: Ks.105445; GONF family receptor alpha 1; breast; mAlHsm 

421SS2;AR)26692;Ks.10S700; secreted fritzted^elatodptot; breast ovar, pia^ < 

421563; NM.006433: Hs.105806; granulysin: fibn; diag 

421574; AI000152: H8.105924; defensin. beta Z headnk, bng; Ca-^ 

421582; AI910275; Hs.350470; trefbQ factor 1 (breast cance; breast pane, hmg, omuc; diag 
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421659: NM-014459: Hs.l06511: protocatlheiin 17; Omx mAb 

421763; BE314828; Hs.107911; ATP-Nnding cassette, sub-faml; Im mAbtsJn. 

421817; AF146074; Hs.108660; ATP-bindhg cassettd, suthfami; Iun9. cerv. headnk, Uad; mAb^sin. 

421829; AB018330; Ks.10a708; caJdunMrnoduIirHiependent p; pros; sm 

422048; NH.012445; Hs.288126; spondin 2, extracellular matri; pane, pros, saic; dfag 

422083; NM_001141: Ks.111256; arachhkmatfi IS-Cpoxygenase, ; pros; 

422109; S73265; H3.1473; gastrin-releasing pepQde; pane, tung, colon, fibro; diag 

422158; L10343; Hs.1 12341; protease b^Ibitor 3, sklrt-der; lieadnK iitad, lung, cerv. stom, esoph; diag 

422192: AA305159; Hs.113019; Qs465; meta; &.m. 

422260; AA315993; H3.105484; regeneraOng gene type IV; cobn. omw, sionv pane; mAb^ag 
422282; AF01d225: Hs.1 14309; apoOpoprotain L; btad. hmg. headnk. tend!; diag 
422283; AW411307; Hs.11431 1; CX45 (ceO division cyde 45,; lung, blad. test oeiv. headitk, esoph; sjn. 
422330; D307B3: Hs.1 15263; eplregulin: pane, colon, blad; mAiMiag 

422397; AJ223386; Ks.116051; MYEOV Myeloma overexpressed ge; pane, stoni, colon, esoph, rena), blad; CTL^s-m. 

422424; AI186431; Hs.296638; prostate differenQaBon facto; Uad, pane, pros, angio, colon, stom, lung, mela; diag 

422578; AF239666; H9.1545: caudal (ype horneo box tnnscri; ookin; 

422627; BE336857: Hs.118787; l iditbfu im lf i y growth factor, be; odon, renal, sarc; mAb^dtag 

422765; AW409701; Ks.1578; bacutovlral lAP repeatcontain; hmg. btad; sm 

422809; AK001379; Hs.121028; ttypothelicai protdn FLJ10549; bN, cer, hjng. uler, angio, 8ton\ test; 8.m. 

422867; L32137; Hs.1584; carQage digomeric matrfoc pr; breast, ovar, pros, pane, lung, colon, uter, sarc; diag 

422956; BE545072; HS.122S79; ECT2 protein (EpftheOal oeH ; ovar, litad. pane, lung, headrrit. oolon.stom; CTL^em 

423161; AL049227: Hs.124776; downstream of dadherfn 6 (by 3; renal, ovar, Uad; mAb«e.in. 

423184; NM.004428; Hs.1624; ephrtn-AI; pros, pane, renaicdon; mAb«fi.m. 

423242; AL039402; Hs.12S783; DEME-6 protdn; bieast, renal, ovar, pros, colon; CTL 

423508; AW604297; Hs.12971 1; hepatitis A virus ceOutar rec; renal, cobn; mAb 

423583; AL12205S; Hs.129836; K1AA1028 protein; pros; sjn. 

423634; AVV959908; H3.1690; hepar1n4)!nding growth factor ; lung, trfad, headnk. pane; diag 

423673; BE003054; Hs.1695; matrix metaOoprotelnase 12 (m; Uad. lung, headnk, ovar, pane, colon, stom, uter, cerv. esoph, test; mAb^dlag^sm 
423936; U77629; Hs.135639; achaete-scute complex (Drosoph; colon, stom. ovar, CTL 

423961; D13666; Hs.138346; periostin (OSF-2os); txeast, colon, blad, tung. fibro. pane, headnk, ovar, mela, sarc; mAtn^ag 

424008; R02740; Hs.1375SS; putative chamoklne receptor; G; blad. headnk, stom, cerv, esoph; mAbtsm 

424046; AF027866; Hs.138202; serine (or cysteine) protelnas; headnk, lung, cerv; diag 

424321; W74048; Hs.1765; lyn^hocyte^pecific protdn ty; msla. fibro; 8.m. 

424381; AA2B5249; Hs.146329; pnstein kinase Chk2 {CmXm lung, cobn, test; sm 

424411; NM.005209; Hs.146549: crystallin. beta A2; pane, sare; sm 

424502; AF242338: Hs.149565: lengsin; lui«; 8.m. 

424503; NM_002205; Hs.1 49609; Integrin. alpha 5 (fibronectln; pane, pn)3.angio, Uad, hmg; mAb^.m. 
424620; AA101043; Hs.151254; kaikieln 7 (chymotjyplic, st; ovar, <fiag 

424687; J05070; Hs.151738; matrix metafloproteinase 9 (ge; headnk, pane, hrng, Uad, uteft cerv, cobn, stom, test, mela, sarc: dag 

424735; U31875; Hs.272499; shorVchaIn alcohol dehydrogen; Uad. breast GTU-^ja 

424825; AF207059: Ks.153357; proooitagen-lysine. 2-oxoghit% mela; CTL-»em 

424905; NA4.0O2497; Hs.153704; MIMA (never In mitosis gene a); ovar, Uad, tung, headnk, pane, stom; s.m. 

424917; At636208; Hs.96901 ; hypothelbat protein FU23049; fibro, uler, ovan CTL 

424943; AU07726Q; H5.153924; death^odated protein Unas; fitxo; 8.m. 

425009; X58288; Hs.1541S1; protein tyrosine phosphatase, ; rend, fibro; mAb«&nt 

425071; NM_013989; Hs.154424; dekxlinase, iodoihyionine, typ: pros. cobn. atom, uter, oerv, headnk, esop^ pane; diag 
425115; R44654; H8.123956: downstream of: 6 protein-coup; gtio; mAb^jn. 

425247; NIA.00S940; HS.15S324; maUx metaNoprdelnase 1 1 (s; breast, over, lung, cobn, pane, headnk. stonib uter, cerv, Uad. e5op^ saic; mAb^diag^sjn. 
425263; NII4.001ig7: Hs.155419; B(X2-lnteracting killer (apopt; pros: am 
425322; U63630; H&155637; protein Idnase, DMA-actrvated,; lung, headnk; 84n. 
425535; AB007937; Hs.158287; syndecan 3; mela, gib; mAbtsm 

425650; NM.001944; Hs.1925; desmogldn 3 (pemphigus vulgar, hmg, headnk, cerv. esoph, Uad; mAb 
425721: AC0Q2115; HS.1S9309; uroplakin 1A; Uad; mAb 

425723; NM.014420; Hs.1 59311; dtebcopf pCenopus laevls) homo; endo. uter, ooton; Cn-Hfiag 

425734; AF056209; Hs.159396; peptidylgtycine atpha^mklatin; lung; sjn. 

425776; U25128; Hs.159499; parathyrdd hormone receptor 2; ovar, uter, tung; mAb«dlag 

425842; AI587490; H5.159623: NK-2 (Drosophila) homdog B; pane, gib; sm 

425852; AK001504; Hs.159651; death receptor 6, TNF superf^ tiad, lung, headnk; mAb««m 

425883; AL137708; Hs.161031; Homo sapiens mRNA; cONA DKFZp4; Uad. pane; mAb 

425998; AU076629; HS.16S950; fibroblast growth factor recep; rensi; mAb^m 

426028; Nlti.001 110; Hs.172028; a didntegrin and metaltoprols; Uad; mAb«<fiag 

426215; AW963419; Hs.155223; stanniocaldn Z breast hmg. renat, cdon, ovar. uter; mAb«dIag 

426227; U67058; Hs.154299; Human protdnase activaled rec; pane, lung, cokjn, esoph, stom; mAb4«m 

426322; J05068; Hs.2012; transcobaMn 1 (vitamin B12 ; pane, blad, stom; diag 

426344; H41821; Ks.322469; transcripfiond acthrabrof t gib; CTL^m 

426427; M86699; H5.1 69840; TTK protein kinase; ovar, lung, headnk, cerv, cobn, uter, store, lest; GTt*sm 
426451; A!908165; Hs.169946; GATA-Unding protein 3 (T-ceO; Uad. breast; sm 
426514; BE616633; Ks.170195; bone morphogenetic protdn 7 (; ovar, cobn, Uad. bng, cerv; mAb«diag 
426600; NKL003378; Hs.171014; MQf nem growth fector indud; mda, 8aic; diag 

426761 ; A1015709; Hs.172089; POWN Prooncosb receptor i; lung, esoph. pros, uler, pane, cobn, ovar, headnk; mAb««jn. 

426812; AF105365; Hs.172613; sdute canier tamily 12 (pcta; renal; mAb«sm 

426890; AA393167; Hs.41294; ESTs; rend, cobn, ovar, uter. stom; CTL 

427239; BE270447; Hs.356512: uUqulOn carrier protdn; hmg. Uad, test, mda. sarc; CTL-^sm 

427335; AA448542; Hs.278444; G an6gen 78; lung, headnk, Uad. mela, esoph. saic; CTL 

427343; AI880O44; Hs.176977; protdn kinase C Unding prote; gib; Ca«sm 

427722; AK000123; Hs.180479; hypotheScd protdn FLJ20116; cobn. stonv panq CTL 

427747; AW41 1425; Hs.180655; seiineAhreonine kinase 1 2; bbd. tung, ovar. stom. tesl esoph. sarc; sm 

427923; AW274357: Hs.301406; F<£NESH predbted 11 TM prdd; mdac mAb 

427969; NM.001963; Hs.2230; epidennd growth factor (beta>; pane; mAb^diag 

428093; AW594506; Ks.104830: ESTs; ow, pane; CTL 

428179; A1127772; H3.380877: serum^uoocodboM regulated; breast; am 

428167; A16S7303; Hs.285529; G piotdiKOupled receptor 49; ovar, uter, ooton, stom; mAI>«sm 

428242; H5S70% HS.225Q; teukeiTtelnhibaDfy factor (ch; ovar, pane, leiik, tong;dbg 
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428298: NH.003(^ Ks.18357% solute carrier fsnitf 22 (orga; renal; mAtHsm 

426330; L22524; K5.225G; matiix metaOoprotetnase 7 (ma; uter, ovsr, 6bro, pros, pane tung. blad, haadnk. esoph, mela; mAlHdi3B«&m. 

428368; BE440042; Hs.83326; matrix melaOoproteinaso 3 (st; headnk, stom. esofdi, colon; diag 

428392; H10233; Hs^GS; seoetory granuta, neuroendocr; pane; (Sag 

428450; NM.014791; Ks.184339; K1AA0175 Qene product over, ceiv, pane, hmg, blad, mala; s.m. 

428479; Y00272; Hs.33456% cell division cydb 2, 61 to S; lung, blad, cdoni uter, ovan sja 

428484; AF104032; Hs.l 84601 ; 8o!uts canisr fair% 7 (calb; lung, blad, headnk, C8fv, esoph, glio. uter, stoffl, odon, mela: inAb«s.tn. 

428486; AW583497; Hs.184604; pancreaOc polypeptide; pane; diag 

428505; ALJ035461; Hs.22B1; chro m o g r anin B (secrelogranin ; pane, lung; diag 

428513; BE220806; Ks.184697; plaxin CI; meia, pane, breast $tom. headnk; mAb 

428579; NM.0O5756! Hs.1 84942; G pratsin^oupled receptor 64; ovar. EWS, uten inAb«s.m. 

428664; AK001666; Hs.189095; slmilarto SALL1 (sal (Orasoph; Uad. ovar. pn». tung, storn. test; CTL^m 

428698; M8$2773; Hs.334838; KiAA1886 protein; breast, colon, tung, pane, stom. headnk. ovar, EWS; mAb 

428748; AW593206; Ks.98785; Ksp37 protein; lung, sarc; (fiag 

428758; AA433988; Hs.98502: CA125 anfigen; mucin 16; ovar. oerv, lung, pane, stom, renal; diag 

428778; AK000530; Hs.193326; (ibrobtasl growth factor lecep; ovar. mAb«sjn. 

428953; AA306610; Hs.346183; tumor necrosis factor receptor; cerv, pane, colon, stonv headitk. reit^. mAb«diag 

428969; AF120274; Hs.194689; artemin; lung, ceiv; diag 

428970; BE276891 ; Hs.194691 ; relinok; acid induced 3 (RAI6; stom. pane, colon, ovar. tibro; mAb««.m. 
429149; AW193360; Hs.197962; Homolcg of mouse AOP-ribosylat; glio; mAb^em 

429211; AF0S2693; Hs.198249; gap function protein, beta 5 (; hJng, blad. headnk, ceiv* esoph. stom, mela; mAbtsm 

429263; AA019004: Hs.198396; ATP-binding cassette, sub-fami; lung; mAb«6m 

429276; AF056085; H8.1 98612; G protein-coupled receptor 51; angio, blad, glio; mAb«s.m. 

429353; All 17406; Hs.335891; ATP-binding cassette transport breast, pros : mAb^jn. 

429547; AW009166; Hs.99376; F6HNESH predicted novel secret; pane, headnk. hjng, ova; diag 

429610; AB024937; IHs^11092; LUNX protein; PLUNC (palate lu; lung, fibro; mAb^diag 

429903; AL134197; Hs.93597; cycfirvdependenl kinase 5. reg; lung, mela; sm 

429910; Nlil.000857; Hs.2507; 5^droxytryplamine (serotonin; leio; mAb*6.m. 

430147; 1^704; H3.234434; hai^fenhancer^of-spGl relate; glk); sm 

430178; AW44961i Hs.152475: 3TJTR of: achaetfrscute comple; colon, stom, ovar; CTL 

430377; NM-001922; Hs.301865; dopachrome taulomerase (dopach; mela; CJTL 

430413; AW842182; 83.241392; small inducible cytoldne A5 (R; fibro, esoph, mela; diag 

430486; BE062109; Hs.241551; chbdde channel, calcium acti; lung, blad, headnk, cerv, esoph; mAtHsm 

430822; AJ005371; Hs.248017; glycera!dehyde-3-phosphale deh; meia. sarc; sm 

431130; NM.006103; Hs.2719; HE4; epidldymi^pediic whey; ovar, uter; dag 

431462; AW583872; Hs.25631 1; granin-iike neuroendocrine pep; pane, hmg, test; diag 

431515; NM.012152; H$.258583; EDG-7 (endothe&al difterenO; ovar, pros, lung, btad; fnAb«s.m. 

431620; AA126109; Hs,264981; Z-y-oBgoatlenyiata synlhstas; esoph. car, CTl-^m. 

431629; AU07702S; Hs.265B27; interferon, alpha^nductbte pr, pane, uter, cerv, stom, esoph, mela; mAb^diag 

431630; NM-002204; H5u!65829; integiin, alpha 3 (antigen CD4; ovar, pane, blad, headnk. mela, ren^ mAb«8.m. 

431745; AW972446; Hs.163425; Novel FGENESH predicted cadher; fibro, ovar. uter, mAb 

431840; AA534908; Hs.2860; POU domdn. class 5^ transcrip; test, renal, blad; CTL 

431846; BE019924; Hs^lSSO; uroplaMn IB; hmg. Uad, headnk. uter. cerv. stem. ovar. mAb«dteg 

431956; X63629: Hs.2877: cadherin 3, type 1 , P-cadherin; hjng, Uad, oenr, headnk. ovar. oblon, pras. pane, breast, esoplu test, mela; mAb«dlag 

432101; AI916950; Hs.123642; EphA3; pros, pane, EWS sarc; 8.m. 

432179; X75208; Hs.2913; EphB3; ovar. coton; mAb*s.m. 

432196; AW300688: H8.273230; hypothetical protein FU10830; renal; CTL 

432201; AI538613; Ks,298241; Transmembrane protease, serine; breast colon, ovar. stom, pane, uter, cenr, kmg ; mAtndiag^sjn. 
432579: AF043244: Hs.278439; nucleolar protein 3 (apoptosis; renal; CTL 
432596; AJ224741; Hs.278461; matntin 3; pane, breast, eaic; diag 

432606; NhC002104; Hs.3066; granzyme K (serine prolease. g: renal breast, hmg, stom. hepC, fibro, leuk; CTL 
432829; W60377; Hs57772; ESTs; btad; Cll*6.m, 

432874; W94322; Hsi79651; melanoma InhlbRory activity; pane, stom. mela sarc; diag 

432990; AL036071; Hs.279899; tumor necrosis factor receptor; pros, renal; mAb«8.m. 

433001; AF217513; K5.279g0S; ckwa HQ0310 PR00310p1; coton. breast, lung. Uad. oeiv. uter. test, mela: sm 

433447; U29195; Hs.3281; neuronal pentraxln 11; mela. esoph, colon, renal; diag 

433848; AH)95719; Hs.93764; caitcxypeptklase M; headnk. esoph. King; sm 

433857; AK000596; H5.361B; hippocitfcli>^lke 1; renal; CTL 

434206; AW136973; Hs.362915; ESTs. Weakly similar to S69890; colon, lung, stom; CTL^m. 
434276; AF1 23659; Ks.93605: leucine zipper, putative hjmor; mela; sm 
434293; NM>004445; H3.3796; EphBB; blad, pros; sm 

435013; H91923; Hs.110O24; NK/L020142:Homo sapiens NAOH:ub; renal, lung, saic; CTL 

435472; AW972330; Hs.283022; triggering receptor expressed ; gSo; mAb 

435505; AF200492: Hs.211238; interteuktn-1 homotog 1; lung, headnk; diag 

436456; AW292677; Hs^48122: metat^n^ncenlraOng hormone ; me^ glto; mAb^m 

436460; AJ271643; Hs.87469; putafive add^enslng ton chan; gtto; mAb^sm 

436481; AA379597; Hs.6199; HSPC150 protein similar to ubi; lung, blad, coton. ovar. uter. headnk, test sm 

438576; Ai458213; Hs.77542; ESTs; renal, pane, headnk, king; mAt^em 

436608; AA628980; Hs.192371; down syndrome critical rec^n ; blad. king, sarc; CTL-^sm 

438895: AF0a7335; Hs^8: carbonic anhydrssaXil; breast, renfll, ovar.i^; mAb*6m 

436981; AW375974; Ks.156704: ESTs; hmg, pane, renal, uter. coton ; CTL 

438982; AB018305( Ks.5378; spondin 1, (^«pondb} extraee; ovar, fibro; diag 

437016; AU076916; Ks.53gd; guanine monphosphaie synthetas; lung. Uad. cerv, escph, coton, headnk; sm 

437044; AL035864; Hs.69517; dlRerenOally expressed b Fa; headnk. cerv, hmg. Uad. breast, pros. ovar. stom, esoph; CTL 

437789; M581344; Hs.127812; ESTe, W^sinto toT17330: lung; CTL 

437652: BE001836; Hs.256897; putaOve GPCR: Uad. hmg: mAlHsm 

438380: T06430; Hs.6194; chondrmlln sulfate proteoglyc; gito, mete; diag 

438549; BE386601; H5.21858: titoucteQQde repeal oontatnin; mela, ear, CTL-Kflag 

439018; AW300887; Hs.26638; membran&^panr^ 4<]onn^ s; uter, stom, pros, fibro; mAb 

439223; AW238299; HS.2S081 8; ai8 Unding protein 2; tong, headnk. cerv. esoph, teul^ Uad, oUon; mAb 

439477; WG981% H3.58042: ESTs, Moderately sbniiar to GF; hmg; mAb«sm 

43^ AW602166; Hs 777399; CEGP1 proton; breast pros, Uad; diag 

439606; W79123; Ks.58561; 6 protebKoupted receptor 87; tong, bted, headnk. cenr. esoph; mAtKami 
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435738; BE246502: HsJ598; senra domain, bnmunogtebulin do; bJad. hmg, cav, renal; mAb*sjn. 
439979; AVVSOOai; Hs.6823: hypothetol protein FU10430; 

440006; AK000517; H8.6844; NAIP2 protein; PYRIN-Oontelnln; Uad. ovar, lung, headnk. test; sjn. 
440065; W03476; K3^68331 ; Homo sapiens Fc receptor homol; melac diag 
440304; BE159984; H5.125395; hepaliQs A vinn cellular rec; renal, colon, bled: mAlHSia 
440516; S42303; Ks.161; cadherin 2. type 1. N^adherin; gfio. over, uter, renal, hepQ mAt^*dlag 
440672: AFQ83B1 1: Hs.7345; MAD1 (mRoQc arrest deifclent; mala; 8.m. 

441362; BE614410; Hs.23044; RAD51 (5. oerevislae) homolog ; lung. blad. headnk, test, mela, esoph; s.m. 

442117; AW664964; Hs.128899: ESTa; hypothetical protein for; breast, lung, Nad. pane, headnk, slom. ovar. pros, sarc; mAb^.ra. 

442133: AW674138; Hs.129017: ESTs; type la transmentbrane p; ov9. uter; mAb 

442275; AW449467; H5.54795: Homo sajiens secrelogtobfiv fa; fibro; iaq 

442652; AIQ05t63; Hs.20137a; Homo sapiens cDNA FU40427 lis; tlbro, ovar, uter. CTl 

443105: X98753; Hs.9004; chondroifln sdfate pioteoglyc; mela; mAlndi^ 

443247; BE61 4387; Hs^93; c-Myc target JP01 : colon, tung. Uad, pane; CTL 

443324: R44013:Hs.164225; ESTs; fibro; mAb^ 

443426; ARJ98158; Hs.9329: chromosome 20 open reading fira; colon, lung. Uad, stem, tssl meb. saic; CU 
443595; AF169312: Hs.9813; PPARfeamma) angfapotefin relet; renal; dag 
443846; AI085198; Hs.164226; Thrombospondin 1; angb, pane, uten dlag 
443859; NRt013409; Hg.9914; fblBstatln: lung, cerv, headnk. Uad, esoph; diag 
443887; AW163123; Hs.10071: seven transmembrane protefn TM; renal; mAlw.m. 

444006; BE395085; Hs.334762; type I transmembrane protein F; pane, coton. lung, over, renal, esoph, mela. Uad, stom. ceiv; mAb 
444090; S69115; Hs,1O306; natural kfllerceO group 7 se; fibro, renal, mela; diag 
444371; BE540274; Hs.239; fofkhead box Ml; lung, headnk, blad, gito, test, mela; SJn. 

1^1' ^f^I^^ W^^ hypolheScal protein BC014245: breast. cokMi. Wad. lung, pane, headnk. war. stom. uter, renal, sn^ test mela. esoph. saro; CR-Kliag 

444488;AW192879:Hs,3S6660;andentconserveddomaInprote;renal:mAb^.m. 

444527; Nlit005408; Hs.1 1 383; sm^ inducibie cytokine subfa; fibro. esoph: diag 

444781; NM^014400: Hs.1 1950; GPI-anchored metastasis-assod; lung. Uad. headnk. oerv; mAb«diag 

444783; AK001468; Hs.62180: anfllin (Orosophlla Scraps horn; over, lung, Uad, headnk, pane, cerv. stom, uter, coton, esoph; CTL-»s.m. 

445417; AK001058; Hs.12680; a dlslntegrin-lke and melaBo; pane, headnk, stom, lung, esoph, sarc. colon; diag 

445537; AJ245671; Hs.12844; EGF-KkeKJomaln. multiple 6; ovar, blad. uter, breast, lung, headnk, renal, fibro, pane, cerv, sanr, mAb*dlag 

445891; AW391342; Hs.199460; DPCR1 prOldn: stom. pane, esoph, omuc; 

445895; D29954; Hs.13421; KIAA0056 proton; pros; CTL 

446051; BE048061; Hs.37054; ephi1n-A3; colon, breast; mAb^dlag 

446163; AA026880; Hs.25252; prolactin receptor; breast cerv, uter, mAb+sm 

446341; AL040763; Hs.3t0735; FGENESH predictiofl similar to ; mela; mAb^em 

446619; AU076643: Hs313: secreted phosphoprotein 1 (est; ovar, fibro^ pane, headnk, lung. oUon. Uad» mela^ esoph, uter. saro: diag 

44665Q:AB016625:Hs.15613:8olutecanter(amily22(oiga;renal:mAb^.m. 

446921; AB012113: Hs.16530; smaB inducible cytokine subfa; breast pane, headnk. Iui«. fibre, meta; disg 

447033; AI357412; Hs.157601; Predicted gene; Bos ctoned; se; colon, pros, fibro, br^ over, king, pane, sarc; CTL-^diag 

447072; 061594; Hs.17279; tyrosylprotein sulfotransferas; gfto. pane; CTL-»sjn. 

447131; NM.0045B5: Ks.17466; retinolc acid receptor respond; renal, breast stom, lung. mela. ovar. mAtHsm 

447208; BE31S291; Hs.237971: hypotheScai protein MGCS627; esoph. stom. colon; CTl^diag 

447269; NM_004861; Hs.17958; cerebroside {S'nrfrasphoadenyly; renafc CTL 

447342; AI199268; Hs, 19322; Homo saptens, Similar to RIKEN; coton. blad, pros, king, stom, ovar, CTL 

447400; AKM0322; Hs.18457; hypothetical protein FU20315; colon, pros. stom. uten mAb^lag 

447674; BE27064(k Hs.19192; cyclin<lependent kinase 2; mela; sm 

448243; AW3e9ni; Hs^688; bitegrtn. beta 8; ovar, uter. king, stom, headnk, gIto, pane ; mAb^m 

448610; NM-005157; H8.21602; nel (chfcken)-liko 1; mela, sarc; diag 

448733; NM_005629: H5,187958; solute carrier family 6 (neuro; tong. renal; mAb^m 

448844; AI581519: H5.177164; FGENESH predicted novel cell s; pane, lung, stom. omuc; mAb^m 

449032; AA045573; Hs,22900; nuclear factor (eiythrold^ert; colon, test stom; CTL^ 

449048; 245051 ; Hs^920; similar to S68401 (caHle) gto; pane, ovar, uter, gfio, headnk, lung, sarc; mAb 

449444; AW818436; Hs.351306; soHite carrier fanfly 16 (mono; renat pane; mAb+S-m. 

449523; NR4_000579; Hs.54443: chemiAine (C-C motiO receptor; lung. pane, renal stom, hepC» fibio, leUk. mela; mAI>^.m. 

449720; AA311162; Hs.288708; hypolhetkal protein FU21 562; coton; CTL 

449722; BE280074; H5.23960; cydin B1; headnk. Uad, lung. pane, angle, test mete^ esoph: sm 

450001; N^L001044; H5.408: solute carrfer family 6 (neuro; renal; mAb«sm 

450375; AA009647; Hs^2537; a dislntegiln end metaltoprota; breast ovar, headiiki pane, hmg. esoph, coton, sarc: mAtndiao^ejn. 
450531; AW301032; Hs.203800: (BC017500) Similar to hypolhef coton; CTL 

450701; K39960; Hs.286467; hypothelical protein XP.098151: tung, headnk, pane, breast stom, ovar. esoph, cotoiL sere; mAb^ 

450726; AW204600; H5.355462: HUMPSPBA Human pulmonary surfa? fibro, king; sm 

450931; N25156; Hs.25648; himor necrosis factor receptor, king, renal; mAb«s.m. 

450983; AA305384: H5.2S740: ER01 (& oeievlSlaeHike; Uad. lung, ovar, pane; diag 

451310; AW250651 ; Ks.26213; Human DMA sequence from done ; coton, pane; CTL 

451527; AF022813: Hs.26518; transme m b r ane 4 superfismay me; renal; mAb 

451537; R56631; Hs.26550: retinoki X receptor, gamma; mela: CTL-wjn. 

451668; 243948; Hs^28444; carfilage actoto protein 1; Uad. ovar. king; mAb«dlag 

451939: U80456; H8.Z731 1 ; sbigtMnbuted (Ofosophila} hom; pros; CTL 

451979; F0fi972: Hs.27372; endothdal tyrosine Unase (E; angle; CTL^jn. 

451988; AF263928; Hs^410: papDtomavtros regulatory fact; renal; Ca 

452017; AF109302: H3J7495; prostate cancer associated pro; pros; 8.m. 

452097; AB002364: Ks.27916; a disintegrln^Ste and metaOo; ovar, mAb«SJn.^ag 

452190; H26735; Hs.91668; Homo sapiens ckme PP1 498 unkn; breast stom. pane; mAb 

452194; AI694413: Hs.373599: olfactory receptor, famfly 2. ; stom, pane, renal, coton. mela, fibro; mAb+sjn. 

452203; X57522: Hs^52018: transporter 1. ATP^rfnding cas; cerv. esoph. Uad, stom. mela, renal; mAb4s.m. 

452281: T93500; Hs^8792; Homo sapiens cONA FU1 1041 fis; Ueast headnk, pane, stom. lung, esoph, fibro; dl^ 

452401; NM.0071 15; H3.29352; tumor necrosis factor, atpha^; Uad. breast pane, headnk,stom, tong. leuk. renal, esooh: dloi 

452431; U88879;H8.29499;toiHiko receptor 3; renal. hepC; mAb 

452747; BE153855; Hs.61460: Ig supeifamiiy receptor LNIR; breast Uad. tong. headnk. ovar. stom, uter. pane; mAb 

452838; U65011: Hs.30743; preferentially expressed antig; king, ovar, breast mela. test esoph, renet sarc; CTL 

452862; AW378055: Ks.8687: A0A1^S2 (a disintegrMce a; headnk, breast cotoa teuk. hng, Uad, esooh. stom. sarc mAb«dtaa 

453195; BE24187e;Hs.32352;hypothetfcal protein OKF2p434K:ren* CTL 

453496; AA442103: HSJ3084; sotote canter fanfly 2 (tacik renat pies; mAlHS^ 
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453837; AL138387: Ks.2S6126; baculovirat lAP repeat-contain: rend, nieta sarc; 8.m. 
4S3966; AAB47843; H$.6271t; Hlgti mobQity ffXKip (nonhtston; tung, uter. blad. (est: ca-»6Jiv 
456546; AI690321: Hs.203845: KCNK1 5 potassium channel. suM over ; mAt>«s.m. 
456662: NM.00244d; Hs. 1 494; msh (DrosophQa) homeo tXR horn; utsr, over ; CTL 
457133; M54968; Hs.351221; v4<t^2 Kbslen ret sarcoma ; pm sm 
457489: AI693815: Hs.127179; crypfic gene; pane, pros. Imig; diag 
457819; AA057484; Hs.35406; FU20522 Hypothetical protein ; tung. cerv. headnk; mAtHdiag 
458079; AI796870; Hs.381 220; Hono sapiens slrnQar to RIKEN ; mala. Qao, safe; mAb 
456627; AW088642: Hs.97S84: SRT (sex determining region Y); ow. uter. test; CR 

TABLE 2B 

Pkey: Unique Eos protjeset identifier number 
CAT number qenedusternumbar 
Accession: Genfaenli accession numbers 

Pley CAT Number Accession 

414991 1785136.1 1778831 C17898 D78863 

TABLE 20 

Pley: Unique number corresponding to an Eos prebeset 

RbI: Sequence somoe. The? d^t mvrbersln ftris cdtumn are Genbank IdenlHier (GO nuntera. Dunham L et d.* refers to the pubDcaSon entilled The DNA 

sequence of human chfonusonie 22.' Dunham L et at.. Nature (1999) 402:469495. 

Strand: Indicales DNA strand ftom which axons were predicted. 

NLposiGon: Indicates nudeoBdeposlSons of predicted exons. 



Pkey 


Ref 


Strand 


NLPoslSon 


400843 


91^605 


Plus 


5863^70.7653-7784.8892-90233673-9807. 


402075 


8117407 


Plus 


121907-122035.122804-122921,124019.12416 


402901 


6894222 


Minus 


175426-175667 


404287 


2326514 


Plus 


53134-53281 


404662 


9797231 


Mnus 


40977-41150 


404675 


9801324 


Phis 


96588-96732.97722-97831 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


' 142358-142546 


406400 


9256298 


Phis 


1S53-171Z1878-2140.42S2-4365.5922-6077 


TaUs3A. C 


)iseaso IntScations and Pm 


iferred Utif 6es for Selected Genes 



TdUeSA provides prefisned disease indteaBors and pr8fiBrredtiti6^ These genes were IdenllM using BsMflymetrixGenechlp arrays. 

Ptey: Unique Eos probesetldenQlier number 

ExAocn: Exemplar Accession number 
UnlgenelD: Unigene ID number 
Unigene Title: Unigene gene fitia 

Disease: prefened diseases Indicated (or selected gene as described in table 1 and abbreviated as follows: biad (bladder diseases), angio (blood vessel diseases). EWS 
(bone diseases), glio (brain diseases), breast (breast diseases), ceiv (cervica! diseases), colon (cdorectat diseases), esoph (esophageal diseases), fibro (fibrosis 
diseases), headnit (head & neck diseases), lelo (leiomyoma diseases), leuk (leukocyte diseases). hepC (liver diseases), lung (lung (fiseases), ovar (ovarian 
diseases], endo (ovaiian endometrioid diseases), omuc (ovarian mucinous diseases), pane (paricreaSc diseases), pros (prostate diseases), renal (renal diseases), 
sarc (soft tissue and bone diseases), mela (stdn diseases), stom (stomach diseases), test (testicular diseases), uter (uterine diseases) 

Utility: preferred ulffiliee for selected gene as described in the texl and afabrevial^ GTL (DNA vacdne target), (teg (di^nosSc or piognosticta^^ 

(moROctonal anfibody targeQ, sjh (small moieode targeQ 

Picey; ExAocn; UnlgenelD: Undone TiHe; Disease; Uli&ty « 

100125; R02740; Hs.137555; putative chemotdne receptor. G; blad; mAb«$ja 

100131; 012485: Ns.11951; ectonudeotlde pyrophosphatase; breast; mAb 

100147; 013666; Ks.136348; perlosGn (OSF-2os); breast, colon, blad, lung, fibro, pane; mAb«diag 

100241; BE273S4a; hts.32S&3: cadherin 6, type 2. K-cadhedn; blad; mAb 

100299: 049493; Hs.2171; growth differenBaOon factor ; EWS; dr^ 

100335; AW247529; Hs.6793; ptatel^^ctivatlng factor ace; breast, lung, blad; sm 

100365; AI878927; Hs.79284; mesoderm epedfic transcripl {; colon, pros; dl^ 

100372; NM_014791: Hs.184339; KIAA0175 gene product; ovar. lung. cen^. pane; sm 

100405; AW291S87; Hs.82733; nidogen Z angio; ^ 

100420; D86963; Hs.1 18693; Meianoma associated gene; breast, pros. lung, ooion. angio; teulq dieg 

100448; AF234867; Hs.57652; cadherin, EGF LAG 8eveiH»ss G; breast; mAb*flm 

100452; D67742; Hs.241552: K1AA0268 proteb; pros; drag 

100559; NR/L000094; Ks.164Q; cdl^en. type \AI, alpha 1 (e; lung; CTI^m 

100654; A0375Q; Hs.18441 1 ; NM.00O47r:liomo sapteis albumk pros; diag 

100655: A03758; Hs.184411; En^ilficSily sdacted from AFFX; pros; diag 

100666: 105424; Hs.169610: 0044 antigen (homing function ; lung, breast; mAb 

100824; AI393237; Hs.1 93989; runkeiated transcription tec; ovar; CTL-^s.m. 

100930: J04129; Hs.82269; progestagen-assodated endomet; hmg; diag 

101063; D54745; HS.8Q247; diolecystokMn; pros. EWS; diag 

101097; BE245301: Hs.89414: chemoldne (C-X-C mofiO. recep; leuk, ovar. breast, btad; mAb*8m 
101104; AW862258: Hs.169286; neuropepSde Y receptor Y1; breast. EWS; mAb 
10119% BE247295; Hs.78452; sohite carrier bmily 20 (phos; angio; mAb^em 
101 193; U086t; Hs.15221 3; wingtess^pe MAfTV integraflon; Uad. hmg; diag 
101249; U8964; Ks.1904; protein Idnase C. tota;ov8r.8Jn. 
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101261; D30857; K5.823S3; protein C receptor, endotheOa; angb; mAtHsm 

101389; AW951430; Hs.7B888: dtszepam bbuftig inhiUtor (GA; pros; mAtHsm 

101431; 8E1B5289; Hs.1076; smaD proSno^ protein IB ; lung, btad; dlag 

101447; M21305; ; gb:Human alpha satellite and s; an^. btad; dlag 

101461; N98569; Hs.76422: phosphotipase A2. gnnip IIA (p; pros; dlag 

101485; AA296520; Ks.89546; seteclin E (endotbeOa) adhesi; pros, ovan mAb 

101506; J02931; Hs.62192; coagulaQon tactor 111 (Dvomb; pros; mAb 

101526; NM_002197; Hs^052d: aconltase 1. soluble; tung. colon, toadry^. pane; mAb 

101543; M31 168; Hs.2Q50; pentaxin-ielated gene, rapidly; angle, ovar; dlag 

101545; BE246154; Hs.154210: endothelial difrermliation, s; anglo; mAb«s.m. 

101560; AW958272; Hs.347326: intercellular adhesion molecui; angio; mAb 

101626; M57399; Hs.44; plelotrophln (heparin binding ; lung; diag 

101649; AW959908; Hs.1690: heparin^nding growth {actor ; lung, Mad; diag 

101714; M68874; Hs.21 1SS7; phosphoBpase A2, group IVA (c; angio: 8.m. 

101724; L1 1690; Hs.ig8689: bullous perr^Mgold anfigen 1 0 breast, pros, btad, hing; mAlnCTL 

101741; NM.003199; Hs.326198; transcription fector 4; angio; CTL-»sm 

101746; NM.001944: Hs.1925; desmogtein 3 (pemphigus vulg^ lung. btad. headnl(. cerv; mAb 

101759: M80244; Hs.184601; solutB canter family 7 (catio; hmg, gHo. blad. headnk; mAb«6.m. 

101791; M83622: HS.623S4; oeB division cyde 4#e: pros; em 

101604; M86699; Hs.169840; TTK protein kinase; ovar, lung, Uad. oerv; CTL-*6jn. 

101606; AA566a94: Hs.112408; SlOOcddunvfainding protein A; lung, breast. Uad; diiQ 

101809; M86649; H&323733; gap junction protein, beta 2. ; cobn. Uad. hmg, pane, headnk; mAb 

101839; AA446644; Hs.692; GA733-2 antigen; epitheTtal gl; ovar, pros; mAb 

101845; U88967; Hs.78867; protein tyrosine phosphatase. ; tung, glo, headnk, cerv; mAt>*s.m. 

101651; BE260964; Hs.82045; midkine (neurfte growth-promot; tung, blad, ovar, breast, pane; mAb^diag 

102009; 6E245149; Ks.B2643; protein tyrosine kinase 9; ovar; 8.m. 

102012; 6E259035; Hs.118400; singed (Drosophila)-tke (sea ; angto; diag 

102024; AA301867: Hs.76224; EGF-containlng fibuMXe ex; angio; dlag 

102048; U07225; H8.339; purlnergk: receptor P2Y. G-pro; blad; mAb 

102076; BE299197; Hs.179665; cydhHlependent kinase inhibi; pros; CTL-«sjn. 

102125; NM_006456; Hs.288215; siatyilransferase; breast, lung, ovar; s.ia 

102136; AA300576; HS.857G9: ackllc 62 kDa protein mRNA; ovan diag 

102151; T27013; H3.3132; sterokJogenic acute regulatory; ovan dlag 

102154; U17760; Hs.75517; taminin. beta 3 (nlceln (125kO; tung. btad, headnk; dag 

102176; AW178761; Hs.227948; serine (or cysteine} proteinas: blad; mAb^g 

102187; U20325; Ks.1707; cocaine- and amphetamir^'regu}; breast diag 

102193; A1036335; Hs.313; secreted phosphopmteln 1 (ost; ovar, tung, lifaro; diag 

102200; AA232362; h(&317432; branched chain aminotransferas; ovan s.m. 

102208; U22961; Hs.1 8441 1; gb:Human mlW clone wHh simii; pros; diag 

102211; 8E314524; Hs76776; putative transmembrane protein; breast, Uad; mAb 

102283; AW1615S2; Hs.83381: gttanlne nudeotldo binding pro; angle; CTL4s.m. 

102297; NM.001504; Hs.198252; 6 protelrvcoupted receptor 9; breast; mAb 

102304; AF015224; Hs.46452; mammagtobin 1; breast; dag 

102305; Ali)43202; Ks.90073; chromosome segregalton 1 (yeas; ovar, kmg, blad ; diag 

102348; U37519; Hs.e7539; aldehyde dehydrogenase 3 tamO; lung, blad; 9jm, 

1023B0; U40434; HS.15S981; mesothelin; ovar; dag 

102394; NM.003816; Ns.2442: adlsintegrln and metaltoprote; pane; 8.m. 

102455; U4870S; Hs.75562; discoidin domah receptor farr^; breast; mAb 

102457; NM.001394; Hs^59; dual spedSdty phosphatase 4; breast; s.m. 

102522; 6E2S0944; Hs.163556; solute canlerfamlly 1 (neutn pros; mAb 

102581; AU077228; Hs.77256; enhancer of zesto (Drosophita); btad. EWS, leuk; ClVs.m. 

102610; U65011: Hs.30743; preferentially expressed anSg; king, ovar; CTL 

102623; AW2492B5; H3.371 10; melanoma enttgen, family A, 9; lung, blad; mAb^CTl 

102869: U71207; Hs.29279; eyes absent (Drosopl^ homo); lung, pros; CTl*6Ja 

102696; BE540Z74; H3^39: Ibrkhead box Ml; lung, btad; sm 

102726; AB026187; Ks.374280; protocadherin 11; EWS; mAb 

102742; U79293; Hs.1 59264; Human done 23948 mRNA sequenc; breast ovar ; diag 

102745; AW753865; Hs.74376; olfactomedin reialed ER tocsfl; EWS; dlag 

102803; H48299; Hs.26126; ctaudin 10; ovan mAb 

102829; NAL006183; 1^.60962; neurotensin; hmg. ovar. headnk; dlag 

102836; U94320; Hs.158330; neuropepfkie Y receptor Y5; EWS; mAb 

102852; V00571; Hs.75294; corticotropin releasing hormon; Wad; diag 

102898; NM_0022O5; Hs. 149609; Inlegrln, alpha 5 (Gbrortedin; otglo. blad, tung, pros; mAb«s.m. 

102915; X07820; Hs.2258; matrix metaltoprotelnase 10 (s; angk), blad, lung, ovan mAb^dlag^SJn. 

102917; AI016712; Hs.287797; integiln. beta 1 (fibronectin ; angio; mAb 

102927; BE512730; Hs.65114; keraOn 18; ovan (9% 

102968; AU076611; Hs.154672; methylene tetrahydrofolate deh; ovan Mi 

102994; X51730; Ks^905; progesterone recepton blad; mAb^sjn. 

103003; AI910275; Hs.350470; trelbil factor 1 (breast cance; breast pane; diag 

103021; BE001596; H3.65266; integrfai, beta 4; lung btad; mAb 

103038; M13509: Hs.83169; noatrlx metaltoproteinase 1 (In; angb, coton. btad. lung, buk, ovar, headnk. fibro, pane, stom; mAlndiae^sjn. 
103037; BE018302; K3.2894; placentai growth factor, vasoc engio; diag 

103060; NM_005940; Ks.155324; matrix metaltoprotdinase 11 (s; breast hmg. ovar, pane; mAb4diag«8.m. 

103080; AU077231; Ks.62932; cycOn D1 (PRA01: parathyroid ; breast EWS; diag 

103095; NM.005424; H3.78824; tyrosine kinase with Immunogb; en^ mAb 

103111; NM.006103; 1^.2719; epklidymi&^pecific whey^; ovar. uten diag 

1031 19; X63629; Hs.2877: cadherin 3* type 1, P-cadherln; hmg. btad, ovar, colon, pros, pane, breast; mAI>*d]ag 

103206; X727S5; Hs.77367; monoidne Indueed by gamma bite; breast kmg; diag 

103210; X72925; Ks,69752; desmocOQIn 1; pros; mAb 

103280; U84722; Hs.76206: cadherin 5. type 2, VE-cadhert angb. libro; mAtnem 

103299; NM_005756; Hs.184942; G proteirvcoupled receptor 64; msr, mAlHeja 

103312; Y12642; Hs.31B5; lysosoma!; tong, blad; mAb 

103365; Xg0906: Hs.74126; fet^ add Unding protein 6, ; blad; (Seg 
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103408; NH.C01504: Hs.198252: G pn^ebKOupted receptor ft breast ; mAb 

103478; BE514982: Ks.38991 ; S100 cateiunvUndlng protsin A; tung. btad, headnk; dlag 

103587; 6E270266; Hs.82128; 5T4 oncofetal troptoblssi giyc; breast. Uad. lung; mAb 

103594; A136868(k Hs.816: SRY (sex detennMng legkm Y); lung. gSo; sun. 

103692; AW137912; Hs.227583: Homo sapiens chromosome X map ; angio: mAb^sm 

t03739; AA1 15173; ; gb2n30d02^1 Stralagene netuo; pros; sm 

103767; BE244667; Hs^8996; CO-lOO protein; angk); dteg 

103389; AA315993; Hs.105484; regeneraBng gene type IV; colon, omuc; mANdlag 

104052; NM.0024Dr; Hs.97644; mamm^loWn 2; ovar (flag 

104115;AF183810;Hs.261O2; opposite strandof: Irichort)); breast: mAb 

104252; AF002246; H5.210863; ceO adhesion molecute with t»; ovar. dag 

104301; AA768491; Hs,6783; hypoihefical protein FU22724; ovan dteg 

104308; N25117: Hs.355957; n*bosoma) protein 828; pros; dlag 

104394; AA1 29551; Ks.172129; Homo sapiens cONA: FU21409 6; coton; diag 

104542; R29657; ; gb:F1.1179D 22 weeitoW human ; pros; dlag 

104608; AF143867; Hs^588; ESTs. Moderately similar to S6; Wad; mAb 

104659; AW969769; Hs.100343; ESTs; EWS; di^ 

104660; BE^65; Hs.14846; Homo sapiens mRNA; cONA OKFZpS: liter, coion, pros; mAb 

104667; AI239923; H5.63931: dactishund (Drosophlla) homolog; bresBl, pros, colon; ^ 

1046B9: AA4204S0; Hs.380088; Plakophifin; hmg; dlag 

104691; WSSSOi Vb^44; Homo sapiens beta-1 adrenergic; pros. EWS; mAb«sjn. 

104755; T49951: Hs.9029; DKFZP434G032 protein: breast, colon: dlag 

104764; AI039243; Ks^78585; ESTs; aigio; dlag 

104786: AA027167; Hs.380438; KIAA0955 protein; an^; CTl^m 

104877; AI138635: Hs.22968; hitron of VEGFR; renal: diag 

104888; AW939591; Hs.5940; mucin 13. eidlheilal transn»mb; colon, s^ 

104919; AA028880; Hs^25% Homo sapiens cDNA FU13603 (is; breast cenr. uter: mAb^sm 

104943; AF072873; Hs.114218; fHzzled (Orosophila) homolog ; ovar, mAb^em 

104954; AW250651; Ks.26213; Human DNA sequence from done ; colon; diag 

104971; BE311926; Hs.15830; hypolhetical protein FU12891; Uad; CTl 

10S012; AF0981S8; Hs.9329; chromosome 20 open reading fra; colon, hmg. Uad; CTL 

105038; AW503733; Hs.9414; KtAA1488 protein; breast angio; CTL^m 

105039; AA907305; Hs.36475; ESTs; breast: dlag 

105093; AL1 37566; Hs.32405; Homo sapiens mRNA; cONADKFZp5; btad; diag 

105149; BE089288: Hs.8958; Homo sapiens cDNA FU1 2024 tis; pros; diag 

105175: AA305384: Hs^40: ER01 (8. cerevisiaBHiice; colon, hing; mAb 

105263; AW388633; Hs.6682: solute canrfer family 7. (caO; angio, hmg. ovar, blad, pane; mAb«s.m. 

105298; BE387790; Hs.26369; hypothetical protein FU20287: ovar, lung; dlag 

105301; AW352357; H8.7457: MAGE1 protein; EWS; diag 

105316; AI671245; Hs,24835; hypotheOcal protein FU14594: EWS; mAb 

105329; AA234561; Hs.22862; ESTs; breast pros; CTL^.m. 

105330; AW338625;Hs.22026: ESTs; similar toTRANSMEMBRAH; angle; mAb*sjn. 

105370; API 79274; Hs.22791 ; transmembrane protein with EGF; pros ; mAtnsjn. 

105500; AW602166; Hs.222399; CE6P1 protein; breast pros; dlag 

105503; AW963624; Hs.31707: ESTs. Weakly shiBlar to YEW4.Y; pros, breast colon: CTL^jn. 

105507; BE268348; K3.380963; CCR4-N0T transcription complex; colon; diag 

105516: AK001269; Hs.30736; hypotheScal protein FU10407: ovar; diag 

10S564: eE616694: Hs.288042: hypolheficai protein FU14299; breast; disg 

105645; AW294631: HS.35127Q; ESTs; pros; diag 

105715; BE621600; H5.29444; putative small membrane protei; colon; (Sag 

105743; BE248502; Hs.9598: soma domain. Immunoglobulin do; breast lung; mAb4e.ni. 

105746; AW151952; Ks.46679; tiypothelical protein FU20739: breast; CTL^m. 

105777; R42765; HS.2309&; ESTs; breast; d^ 

10578% H09748; Hs.57987: &«eil ClUlymphoma 11B (zinc ; EWS; CTL^sm 

105626; AA478756; H&194477; E3 ubiqultin iigase SmVFT; ai^io; s.mL 

105990; AI690586; H&29403; hypotheScal protein FU2206Q; breast; dlag 

106000; AW194426; Hs.20726; ESTs; breast; diag 

106012; AI240665; Hs.352537; ESTs; breast hmg; diag 

106014; AF123094; Hs.180566; mucosa associated lymphoid tis; leuk; diag 

106063; BE260415: Hs.34d198; hypothetical protein FIJ20262; pros; dlag 

105066; AW274357; H8.301406; hypoOietlcal protein PP3501; mela; Ca-*sm 

106111; AW875398; Hs.6451; PRO0659 protein; EWS; CTL^s-m. 

108124; H93366: HsJ567; branched chain amlnotransferas; angio; sm 

108155; AA425414; Hs.33287; nudear factor l/B; breast, pros, angb; diag 

105373; AW503807; H5.21907; histone acetyflransferase; breast: am 

106400; BE397649; Ks.27g807: Homo sapiens cDNA FU13634 lis; colon: (Sag 

108414; BE588205; HsuS827: mltogerv^tivated protein Mna; brest; sm 

106448; 242061; H3.27004; ESTs; pros; dlag 

106533; AU34708; HS.14599B; ESTs; EWS; diag 

106574; BE044325; H5.227260; U6 snRNArassodated Snv^ pr, colon; dlag 

106579; AA456135: H3.23023: ESTs; pros: diag 

106632; NM.014400; Ks.1 1950; GPI^chored metastasis^ssod; hmg. Uad, headnk; mAb^ 

106738; AW1 49266; H3.251 30; Homo sapiens d)NA FU14923 fis; ovan dag 

106793; H94g97; Hs.16450; ESTs; angio; diag 

106844; AA48S055; Hs.156213: spenn associated antigen 6; breast; mAb«CTL 

106906; AA861271: K3.222024: transcription factor BMAU hmg, Uad; diag 

106990; AA2B0722; Hs.24758; ESTs. Weaidy sImQar to 138022; breast; diag 

10703G; AI973016; Hs.15725; hypothetical protein SBB148: pros; dlag 

107102; AB037765: H3.30652; KIAA1344 protein; pros, breast; dlag 

107105; AW963419; Hs.155223; stanniocaidn 2; breast; disg 

107138; AV661958: Hs.8207; GK001 protein; breast colon; di^ 

107151; AW378065; Hs.6887; AQAMTS2 (a dislntegilivliitt a; breast ooton. ieuk, hmg. btad; mAb«dIag 
107216: D51069; Hs.211579; rnebnonra cell adhesion molecul; angio; dag 
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107248: AW^124; Hs.350547; nuclear receptor awepressort breast, ooton, pmsi mNb*Mi 

107284; NM.005629; Hs.187958: sduta carrier famay 6 (n^no; tung; mAI>^ 

107385; NM_005397; Hs.16426: podocalyxin-Cte; anglo; (Seg 

107901; L42612; Hs.335952; keraiin 6B; breast, Uad, hing; dlag 

107922; BE153855: Hs.61460: Ig superfamlly leceptor LNIR; breast Wad, hmg; mAb 

107932; AW392555; Hs.18878; hypoiheticaJ proteh aJ21620: lung; CTl 

108055; AJ404672: Hs.334483; hypothefical protein FU23571; breasl ovar. dlsg 

108059; S69002: Hs^34773: Homo sapiens cONA: FU222B1 ftovar. CTL'»6jt». 

108153; AW519204; Ks.40808: ESTs; prro; dlag 

1 081 86; AWD68579; Hs.7780; Horn sapiens mRNA; cONA OKFZpS; pros; dsg 

108242; AA062746; HsJ55244; gb3m03gl2^1 Stratagena coma; pms; diag 

108282; AA065142: ; gbamSOhll j1 Stiatagene Sbro; pros; dlag 

108505; AA083376; ; gb2n09g08£l Stratagene hf^ n; pros; diag 

108679; AA115963; Hs.323423; ESTs. Moderately similar toB ; pros; diag 

108895; AB029000: Ks.70823: KIAA1077 protein; breast cdon. lung; dlag 

108732; AA25B888; Hs,107476; ATP synthase. transporting,; pros; s.m. 

108778; AF133123; Hs.90847; general transcription factor 1; ovar; diag 

108828; AK001693: Hs.273344; DKFZP564O0483 protein; breast; di^ 

108860; AA133334; Hs.816; ESTs; lung; sm 

109001; AH)56548; Hs.72116; hypothetical protein FU20992 ; anglo; ai-Kfiag 
109032; AI219207; Hs.72222; hypothetical protein FU13459; blad; CTL 
109077; AI732617; Hs.182382; ESTs; Wad; diag 

109112: AW419196; Hs^924; hypothetical protein FU13782; breast pros. Wad; dlsg 

109141; AF174600t Hs.6978; ESTs. Highly similar to AF1746; colon: CTL+am 

109166; AA219691; Hs.73625; RAB6 interacting, Idnesin-Uce; hing, blad. breast colon, ovar. headnlt. EWS; 8.m. 

109220; AW958181; Hs.189998: ESTs; pros; di£g 

109273; AA375752; Hs.348140; Homo sapiens mRNA; cDNA DKFZpS; breast; diag 

109292; AW975746; Hs.188662; K1AA1702 protein; breast; diag 

109454; AA23225S; Hs.295232; ESTs, litoderately similar to A4: oven diSQ 

109456; AW956580: Hs.42699; ESTs; angio. pane; diag 

109514; AA234087; Hs.262346; ESTs, Weakly sbnllar to 872482; breast ; diag 

109530; AA908645: Hs.19597; K1AA1694 protem; pros; CTL-«S.m. 

109648: H17800; Hs.7154: ESTs; ovar; diag 

109680; AB037734; H3.4993: K1AA1313 protein; ovar; (Bag 

110009; BE075297; Hs.6614: ESTs. Wealdy similar to A43932: breast colon; diag 

110151; H18835; Hs.31608: hypothetical protein FU20041; pros. EWS; diag 

110156; AA581322; Hs.4213; hypolhetica! protein MGC16207; lung, blad; diag 

110240;A1668594:Hs.l76588;ESTs,WeaMystmitartDCP4Y H; breast; di^ 

110278; AF061573: Hs.19492; protocadherin 8; EWS; mAlH6.m. 

1 10675; H89355; Hs.249159; adrenergic, 8lpha-2A-. recepto; pros; mAb^am 

110728; AA737106; HS.322S0; ESTs, Moderately similar to 17; EWS; sm 

1 10844; AI740792; H5,167531: methytcrotonoyl-Coenzyme Acan pros, pros: sjn. 

1 10915; BE092285: Hs.29724: hypothetical protein FU13187: breast, pros; dlag ' 

110971; AI760098: Hs.21411; ESTs; pros; dlag 

111157; AL109729: Hs.99364; putativa transmembrane protein; pros; mAb«6.m. 

111179: AK00O136; HS.1076Q: espoiin (U^ class 1): breast colon; CTL^sm 

111185; AJ245671; H8.12844: EGRtike-domahi. muHIpts 6; ovar. blad; mAb^diag 

111223; AA852773; Hs,334838: KIAA1866 protein; breast cdon, lung. EWS; mAb 

111299; AB033091: Hs.355925: KIAA1265 protein; ovar diag 

111357; BE314949: Hs.87128; hypothetical prot^ FU23309; breast dlag 

111384; N94606: H$.288969: HSCARG protein; breast; dl^ 

111900; AF131784: Hs^5318: Homo sapiens done 25194 mRNA; breast; (Sag 

1 1 1929: AFOZ7208: Hs.l 12360: prominin (mouseHIke 1 ; colon, breast fibro; mAb 

112134; R41823;Hs.7413: ESTs: cdIsyntenln-2; breast EWS; diag 

112244; A6Q29000: Hs.70823; iaAA1077 protein; breast colon, blad. hmg: diag 

11228a AA863360; Hs.26040; ESTs. Weakly sbtliiar to fatty ; breast; sm 

112283; L14561: Hs.20952; ATPase. Ca^ transporting, pla: ovai; mAb 

112287; AB033064; Hs.236463: KIAA1238 protein; breast; diag 

112971; Z42387: Hs.83883: transmembrane, prostate androg; colom pros, pns; mAtnem 

113003; AW292315; Ks.7215; ESTs; EWS; diag 

113021; AL122055; Hs.129836; KIAA1028 protein: pros; sm 

113047; AI571940; Hs7549: ESTs; breast cohxi; dlag 

113073: N39342: Hs.103042: mlcrotubute-assodated protein; pros; CR+am 

113168; AW002393; Hs.337629: gbnwu61d05Ji1 NCI.CGAP.GC6 Horn; ovar, dlag 

113195; H83265: Ks.8881; ESTs. Weakly simflar to 841044; snglo, hing; diag 

1 13230: T61430; ; gbYc06a034l Slrat^ene hir« ; blad; diag 

113361; T79589: Ks.63325: trensmembrane protease, serine; ooton. Uad. lung, ovar, pane, headnk; mAb^diag-»Gjn. 

113374; T79925; H5.269165; ESTs, Weakly similarto AUUl Jt teUk; dag 

113443; AW083920:Hs.16098:cl3udin 2; colon, pane; mAb ' 

113471; AI765890; H3.16341: MAWD binding protein; pros; dlsg 

113490; BE17B110; Hs.173374: Homo sapiens cONA FU10500 tls: colon: diag 

1 13950; AI267652: Ks.246107; Homo sapiens mRNA; cONA 0KFZp4; breast pros ; diag 

1 13970; W27249: Hs.810g; hypoOieiical protein FU21080: breast lur«, stont titer, diag 

114124; W57554; HS.12S019; lymphoid mictear protein tJLf^: breast dlag 

114251; H15281; Hs.21948; ESTs; breast; dlag 

114292; A1815395; Hs.184641: fatty add desahirase 2; breast; sm 

1 14334: AB0377B4: Hs^l; K1AA1383 protmn; ovar; di^ 

1 14407; BE539976; Hs.103305: Homo sapiens mRNA; cONA DKFZp4; breast colon, lung; di^ 
114452; Ai36927S; HS.243010: Homo saptens cDNA FU14445 fls; oigto; dlag 
1 14460; BE066778; Hs.151678: UDP-N-acetyMphaOgSbctosac breast; sm 

1 14531; AA053033; Hs.63325: transmembrane protease, serine; oolan, Uad. lung, ovar, pane; headnk; mAlHdtemra 

114540: AI904232: Hs.75323; prohibiHn; breast; diag 

114542; AW97012B; Hs.91011; anterior gradient 2 (Xenepus 1; breast pms; diag 
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114587; AF086009; H5.296398: gb:Horoo saptens ftiB tenglh In; colon; dlsfi 

114724: R64730.comp: Hs.155986; DEAD« (As|>GiWU»AspA&) be ow. CTL-^SJn. 

114768: AF212848; Hs.182339: ets honwiogousfecton pros, 
- 114798; AA159181: K5.54»0; seroteglcafly defined colon ca; pros; CTL*S4a 
0 114908tAA454985;Hs.54973;cadheitfva9prot9{nN^ pros:(Sao 

114916; BE165762: Hs^518; hypoMca) pntdn from 8CRA; pros; diag 

11496S; A)733881; Hs.72472: NAME OMITTED ... receptor kinase; bre^ mAb 

DSS? flap lundton prot^ beta 5 1 hing, Wad. headnk; mAb^wn. 

^ 115221:AW365434:Hs.79741:hypotheticaIproteInFU10116;ovar:<liag 
1 U 115239: BE251328: H3.73291; hypolhelteal protein aJ10881: colon; diag 

115291; BE54S072; Hs.122579: hypothetical protein FU10461: ovar. tuna CTL-»s.in. 

115412; AW131 168; Hs.372382; ESTs, WeaWy similar b 138022; pros; diag 

1 Scr^ ovar, bng. Wad. headnk. pane. ceiv. storn. uter. ^ Ol^ 

^ 115674; AW992356; Hs.380760; Homo sapiens pyruvate dehydrog; ova; sm 

ID 115B75;VV87707:te.82055;inlBrteukin6signaltrBnsduc8;brBast,pfDs;mAl>tiu^ 

115683; AF255910; Hs.546S0: Junctional adhesion nclecute 2; angio, gfio; mAb 

•Sl?ilS^.?*'^° P^^' pane, headnJq mAlHdiag4s.nL 

115819: AA486620: Hs.41 135; endon)ucln-2; angto; diag 

115827; AA428000: Hs^72; ac6n related protein 2/3 comp; ando; dl» 

115844; AI373082: H5.332838: hypothetical frotein MGC5370; ptos; dlsg 

^V;^^ Hs.184942: G pfotelwxnipted receptor 64; ovar. EWS. uter. irtAtHsm 

115892; AA291377: Hs.50831; ESTs; ovar, Wad. hmg; dag 
^ 115909;AW872527;Hs.59761;EST8,Weakly5lmilartoDAP1 H; ovar. lUna: diao 
25 115947; R47479;Hs.94761;KIAAl691prolBh; colon: diag ""^ 

\ ISS' «S ^ differentially expms; lung. btad. biaast. pros. ovar. headnk; CTL 

116003; BE276469: Hs.66493; Down syndrome critical region ; colon: mAb 

1 1601 1; AU59053: Hs.576B4: Homo sapiens mRNA fuO length ; breast: dfeg 

116028; H59799;Hs.42644;lhiofodoxin4ike; ovar. king: diag 

116107;AL133916; Hs.47860; hypothetfcal protein FU20093; lung, breastdiag 

116202: BE159395; Hs^092: ESTa; prosidiag 

116238; AV660717: Hs.47144; DKFZP586N0819 protein: ovan diss 

116301; AW969706: Hs.293332; ESTs; EWS; diag 
^ ^ 116334; AL038450; Hs.48948; ESTs; pros; dsg 
JD 116335; AK001100: Hs.41690: desmoconin 3; hing. blad. headnk; diag 

116393; AI972402: Hs.305051; hypothetica) protein M6C2648: pros; din 

116399; AA889120: Hs.110637; homeo box A10; pros; CTL^sjn. 

116401; AW893940: Hs.59898: ESTa; ovar; diag 

116416; AW753676;Hs.39982; ESTs: ovar; diag 
W 116470; AI272141; H8.351928; SRY (sex determining region Y); coion. breast, angio. btad; diag 

116483; AI346201; Hs.761 18; ubiquHin carboxyt-tamlnaJ es; angio. lung; s.m. ' 

116610: D80449; Hs.184841; ESTs; pros; diag 

1 16732; AW152225; Hs.165909; ESTs, Weekly sNIar to 138022; coton; diag 

liSfSfiSS^^v"^^^' HomosapienscDI^ FU14415II8: pros, breast, cokui, proa; mAb 
**J 116962; H79877;;gb.7u76g10,sl Scares fetal Ih^ pros; diag 

117027; AW085208: Hs.l30093; ESTs; breast ; di£ 

117280; M18217; Hs.172129; Homo saptens cDNA: FLJ21409 fi; breast, coton, pros; diag 
117284: AK00170l;Hs.183779: Homo sajrfenscDNAFUlOSgO lis: pros; diM 
<;n ]2S^^SS5S?= Hs.211092: LUNX protein; aUrC{palaJehj; kino;mAtHdiag 
117367; Ai041793:H5.42502; ESTs; breast :dl^ 

1 1741 2; N32536; Hs.42646; solute carter famDy 16 (mono; breast, ovar ; mAlnsjn. 

117425; AK000028; Hs.356100; ribosoroal protein S24; pros; diag 

1 1 7563; AF055634; Hs.44553; unc5 (Celegans homolog) c; leuk ; diag 

]]5S^ N35020:Hs.44685; C3HC4^ zinc finger protein; EWS; CTLn^sm 
00 117921: AA021459; Hs.306480: Homo sapiens mRNA; cDNA DKFZp7; pros : diag 

117984; AF189723: Hs.106778; ATPase. transporting, type pros, breast o3on: mAb 

118049: N53145: ;flb.7v55f09.8l Soarasfela!Bv:proSd59 

118314: N4e580: H5.46692: ESTs; Wad. lung; drag 
A(\ ""-^^^^J *^°21: breast, ovar, blad, pros; CTL-^sm 

OU 1 18368: N64339: H3.48956; gap Junctkin prot^, beta 6 (; king, blad; mAb 

1 18417; Af 080229; ; gb:Human endogenous letrovinn; pros; 8.ia 

118472; AL157545; Hs.173179: bromodomrfn and PHD linger con; breast; diag 

118511; N75620:H3.43157: ESTs; angio; diffl 
^ - 118901; AW292577: Hs.94445: ESTs; breast; diag 
0^ 118905; AW973708; Hs.201925: Homo sapiens cONA RJ13446 lis; breast; dla 

1 19018; AABS1 143; Hs.278695; Honro sapiens proslein mRNA. co; pros, pros; 

119036; R95B72;H&.117572;chemoklne binding protein 2; breast ovar; mAb 

erythroWaslosfa vinis E; angto, pros; Cn,^ 
„^ 119082; AF252297; Ks.91546; cytochrome P450 retinoid metab; EWS: dlan 
/U 119126: R45175:Hs.117183; ESTs; pros, breast coton: diag 

119279; N57668;Hs.4802as EST; breast; diag 

119307; BE046061; Hs.37054; ephrl^A3; cobn, breast; mAtwfiag 

119478:AI624342:Hs.179082; ESTs: breast; diag 
„ 119617; AA516531: Jte.55999; NK homeobox (Diosophlla). faml; pros; dia 
/D 119743: AA947552; Hs.68086; branched chdn aminotransferas: ovar; sm 

119771; AI905687; H8.348419: AI905687:IL^T095-190199^19 8; breast diag 

]TS^ 3-^^ Hs.191381: hypotheScal protein; over, hmg: CTt^sm 

119789: BE393948; Hs,50915: kanikrein 5; ovar; diag 
Rn ^ proteln^pled receptor 87; lung. blad. headnk, cerv; mAlnsm 

oU 119940:AL050097;Hs.272531:DKFZP586B0319prolein:pro5:dIag 

I'SS^t^^^ H5.180479: hypotheficai protein FU20116;coton. hing:dlag 

120132; W57554; Hs.125019; lymphokl mjdear protein (LAF-: pros, breast to 

120147; AI917116: Hs^l; hanogtoUn. beli EWS; diag ' *^ 
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120206; K26735; Ks.91668: Homo sapiens done PP1438 unlav breast : 

120242; AWSG9587; H5.86366; ESTs; biad; 6^ 

120328; AA323278: HS.230SD5; ESTs, WeaMy sbnBar to protaa; pros; sjm. 

120438: AW01S242: Hs.99488: ESTs, WeaMy slirilv to YK54jr; ovan (fiao 

120471: AA2S1944: Hs.104056; CGI-29 prat^; ooton; dbg 

120485; AW368377: Hs.137569; tumor protebi 63 kDawUh stro: tung. biad. headnk; dlag 

120588; AA703226; Hs.161d3; Homo sapiens mRNA; cDNA DKFZpS; pros; diag 

120624; AW407987; Hs.173518; Mi)h8S6 phosphoprotein ttomolog; breast; sm 

120B30E A1568170: HS.968B5: ESTs; EWS; dbg 

120977; AA39815$; Hs.97600; ESTs; breast, ovan diag 

121027; A15724g0; Hs.99785: Homo sapiens cDNA: nJ21245 fi; blad; mAb 

121231; AAB14948; Hs.96343; ESTs, Weakly similar to AUJC_H; EWS; diag 

12133S; AA404418; ; gbzw37e0Z8l Soares_totaLfet; angb; diag 

121362; AF05D147; Hs.97932:chondiomodutin I precursor; EWS; mAb 

121457; W07404; Ks.102558: hypothef!c&) protein FU22055; cdon; diag 

121619; AAS28339; Hs.178062: ESTs. Weakly similar to pfiospli; EWS; sjn. 

121710; AF163474; Hs.96744; prostate androgen^lated ir. pros; diag 

121721; AL047051 ; Hs.199961; ESTs, Weakly similar to ALU7_H; pros; diag 

121723; AA243499; Hs.104800: hypolheOcai pnM FU10134; breast; diag 

121748: BE536911; Hs.234545; hypotheika! protein NUF2R; breast diag 

121779; AW513143; Hs.98367; SRY (sex detenMg region Y); over, diag 

121791; AA815378: Hs.293317; ESTs. Weakly simBar to GGC1 H; blad, headnk, luRg,ovaf; mAtnCTL 

121792: AW969726; Ms.98381: ESTs. We^ similar to serine; EWS; diag 

121913; AI249368; Ks.98558; ESTs; protease inhibitor 15 (: breast, pros; s.m. 

121920; AA428300: ; gbzwieb02^1 Soares ovary turn; ovar, uter, cerv; dlag 

122041; AA677577; Hs.380213; Homo sapiens Chromosome 16 BAG; pros; dlag 

122520; AW951324; H5.173609; pregnancy spedlte beta-1-glyc; colon; diag 

122797; AJ251027; Hs.99526; odorant-binding protein 2B (08; bieast; diag 

122802; AI687303; Hs^5529; G proteirvcoupled receptor 49; ovar, uter; mAlHsm 

122969; AW8212S2: Hs.104336: hypothetteal protein: ovar; drag 

123005; AW369771; Hs.367688; Integito. beta 6; ovar, hmg, headnk, glioc fnAb4s.ni. 

123044; AK0O1O35; Hs.130881; Been CtiAymphoma 11A (zinc : lung; dlag 

123137; AI073913; Hs.1 00686; ESTs, Weakly similar to JE0350; breast, colon, ovar, uter, hing. etom; dlag 

123158: AF161426; K&218329; hypothetk:al protein; breast; diag 

123160: AA488887; Ks.284235; ESTs, Weakly similar to 138022; lung; diag 

12316% A19S0087; Hs.369628; obwqOSc02J(1 NQjCGAPJ<kJ12 H; ovar. diag 

123209; AWg68543; Hs.203270: ESTs. WeaMy sfanSar to ALU1_H: pros; diag 

123308; 014187; Hs.157208; ESTs; EWS; dlag 

123339; AW1 88464; Hs.101515; ESTs; ovar; dlag 

123475; BE439553; H3.12329: Homo sapiens, dtons IMAGE:4098; pros; diag 

123494; AW1 79019; Hs.112110: mitochondrial rOnsomal protei; ovan diag 

123520; AA6085S0; ; gb:ae53d1Z8l Stratagene tung ; pros; sm 

123S33; AA6087S1; ; gb:8e56h07^1 Stratagene hmg ; coksn; (Dag 

123619; AA6029&4; Hs.366318; gb:no97c02^1 Na_CGAPJ>r2 Horn; breast; CTL-«sm 

123689; AA399323; Hs.285130; Homo sapiens pinch>2 protein m; ovar, diag 

123709; AA706910; Hs.112742; ESTs; breast; diag 

123829; AF251237; Hs,112208: XAGE-1 protein; lung. blad. test; CTL 

123972; T46848; Hs.70337; immunogtobulin supeifamlly. me; ovar; diag 

124006; AI147155; Hs.279727; ESTs; homologue of PEM^ (aon; breast, angk), lung, ovar. EWS; diag 

124059; BE387335; Hs^13; ESTs. Weakly similar to S64054; breast, ooton, blad, lung: CTL-»diag 

124153; AU077333; Hs.160463; erythrocyte membrane protein b; pros; mAb 

124352; AA640891; Hs.102406; ESTs; breast, pros, ovar, lung; diag 

124526; N62096: Hs.293185; ESTs. Weakly similar to JC7328; pros; mAb^ 

124579: A1693aiS: Hs.127179; cryptk: gene; pane; diag 

124777; R41933; Hs.140237: ESTs. Weakly sfanBar to AtaJ1_H; pros, breast; dlag 

125103; AA570056; Hs.1 22730; ESTs, Moderately simBar to Kl; colon; mAb 

1251 54; W38419: ; 0b2c78a07^1 Pancreatic islel; ovan diag 

125250; W26S24; HS.3S6686; protein phosphatase 4 regulato; ovar; COtjSja 

125266; W90022: Hs.186B09: ESTs, Highly sbiiitar to LCT^H; angto; dlag 

125453; BE385523: Hs.18048: melanoma antigen, family A, 10; blad; mAlnCTL 

125666; AL390172; Hs^17432; Homo sapiens cDNA: FU21270 fi; ovar; diag 

125770; AA143045: Hs.61665; v-Ml Haidy-Zuckerman 4 feline: EWS; diag 

125976; AA436760; Hs^2; gbczy67d11.r1 Soares_tot^fel; pros; dbg 

1 26399; AA088767; Hs.83B83; transmembrane, prostate androg; pane; mAb«&in. 

126645; AA316161: Hs.61635: sh transmembrane e^MsA a; pros, breast, lung, pane, headnk, EWS; mAb«CTL 

126756; AI559444: Hs.104679; ESTs; pros, breast; mAb 

126799; AW753865; Hs.74376: oltedomedin related ER hjcaTi; EWS; dbg 

126872; AW4S0979: ; gb:U)>H4l3^a«1 2-04JU1 N; bted; dbg 

126892; AF121856; Hs.284291; sorting nexin ^ ovan diag 

126960; AL390172; Hs.317432; branched chain ambiotransferas; ovar; sun. 

126966; R38438; Hs.1 18747; solute canter family 15(H-»/p: pros; mAb 

127003; AW816515; Hs. 173540; ATPase, Ctass V, type 10D; pros; mAb 

127221; BE052109; Hs.241S51; chloride channel, cakhim acB; hjng, bbd, headnk, cerv; mAb«sjn. 

127240; AJ005663: Ks.86998; nuclear factor of acfivated Ts pros: CTL-^ja 

127425; AF183810: Hs^6102: trtohortdnophabngeal syndrome: breast; mAb 

127479; D31 152: Hs.179729: collagen, type X. alpha 1 (Sch; breast hmg, headnk. pane; dbg 

127537; AI926047; Hs.162859; ESTs; pros; dlag 

127664; AA8061&4: H$.116502; ESTs; EWS; diag 

128046; AA873265; Hs.357313; gbx)h68h05.8l Na.CGAPJQdS Ho; pros, breast, coton; diag 

128305; AI954988; H8^65706; matrix Qa protein; breast; diag 

128478; AA708205; Hs.100343: ESTs: EWS; CTL-^m 

128515: BE395085; Hs.334762; type I transmembrane protein F; pane; mAb 

128595; U31875; Hs.272499; short^:haIn abohd dehydrogen; btad. breast; CTL-«8jn. 
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I2B610; 1448373; Hs. 10247; acthtited bucoqfle ceQ adhes; bmasl pres. lung, ovar, diag 

1 28734; AB0O83S0; Hs.104570; kaOikrein 8 (neuropsin/ovasin; ovan diag 

128790; AF025692; Hs.105700; secretad filzztetketated pret breast odon. pros, ovar.uter, panq (Sag 

128797; NIil.002g75; Hs.105927: stem cell growth factor lymph; EWS. (euk; diag 

128854; BE159181; Hs.168232; hypdheflcal prot^ FU13855; breast; diag 

12892S: RB7419; Hs.21851; Homo sapiens cONA RJ12900 lis; breast; diag 

128949^ AA009647; Hs.352537; a dislnlegRn and mataOopreta; breasts ovar, headnk. pane; mAb^diag^m 

128969: Z42047; Hs.107479; Homo saptens PROZ751 mRNA. conx pros; dag 

129041; BE382756; Hs.169902: sdule carrier (anvly 2 (fadl; hing, btad; mAlHsm 

129097; BE243933: H$.108842; zinc finger protein 22 (KOX 15: ovan CTl^sjn. 

129099: AF146074; Hs.1 08680; ATP-binding cassette. siibM lung, Mad. tieadnk ; mU>«&m. 

129184; AW161450; Hs.109201; CG)-86 protein; pros; mAb 

129260; AF077200; Hs.279813; hypoOietica! protein; colon; diag 

129284; AA318224; Hs.296141; ESTs; colon; di^ 

129362; U30246; Hs.110736; sohtte canler MIy 12 (sodi; colon, breast, pros; mAb 
129366; BE220806; Ks.184697; Homo sapiens done 23785 mRNA ; breast; diag 
129389; NM_012445: Hs.288126: spondin 2. extracenular matri; colon, pros; diag 
129404; A)267700; Ks.351201; ESTs; pros, ovar, lung, blad, headnk, pane; diag 
129466; L42583; Hs.334309; IteraQn 6A: tung. Uad; diag 
129482; AA188165; Hs.289043; spindHn; breast; diag 
129534; AK002126: Hs.11260; hypothetica) protein FU11264; pros; di^ 
129571; X51630t Hs.1145; VWIms tumor 1; ovar; 

129505; AF061812; Hs.115947; Keratin 16 (focal non^pidemio; hmg, Uad, headnk; diag 

129620; 079338; Ks.239720; CCR4-N0T transcripSon complex; breast anglo; diag 

129628; U36945: Hs.1 174; cyciin-dependent kinase inhibi; tung, Uad, ovar. headnl^ sm 

129650; AF10929d; Hs.1 18258; prostate cancer associated pro; pros, EWS; diag 

129689; AVyr748482; Hs.77873; 67 homok)g 3; breast; diag 

12970% BE38866S; H8.179999; Homo sapiens, ctons IMAGE:3457: EWS. leuk; diag 

129720; AA156214; Hs.12152: APMCF1 protein; breast; diag 

129750; AF056085; Hs.1 98612; G protelrvooupled receptor 51; angta blad; mAb«sjn. 

129869; AI222069; Hs.13015; hypothetkal protein similar t; breast diag 

129912; AF155096; Hs.107213; hypothetica] protein FU20585; ovar; Ca^.m. 

129936; AJ^17; Hs.1355; cattepsln E; blad; smnfiag 

129953; AA41219S; Hs.13740; ESTs; breast; diag 

129977; NM»000399; Hs.1395; eaily growth response 2 (Krox-; EWS; CTL-*sjn. 

130010; AA301116; Hs.142838; nucleolar phosphoprot^ Nopp3; ovar; diag 

130057; AF027153; H8.324787; solute earner family 5 (inosi; breast; mAb 

130095; AK001635: Ks.14838; hypotheScai prot^ RJ10773: breast diag 

130155; AA101043; Hs.151254; kallikrein 7 (chymotryptk:, st ovar. (fiag 

130181; AF052119; KS.15160B; Homo sapiens done 23822 mRNA ; pros; diag 

130184; H58306; Hs.15165; retlnoic add induced 14; anglo; (Sag 

130262; D63216; Hs.153684; bizzled-retated protein; pane, EWS. stem, renal; diag 

130343; AB040914; Hs.278628; KIAA1481 protein; breast diag 

130376; R40673: Hs.1 55174; C0C5 (ceO dMsfon cyde 5. S; ovv; CTL^m 

130385; AW067800; Hs.155223; stannbcatein 2; breast lung: mAt>iifiag 

130455; D90041; Hs.155956; N-acetyltransferase 1 (aryfaml; breast 8.m. 

130511: 132137; Hs.1$64; cartHage oQgomerIc matrix pr; breast ovar, diag 

130558; BE564937; Hs.15984: pp21 homolog; pros; CTL^»6in. 

130577; M69241; Hs.162; insuiln-llke growth factor bin; ovar; diag 

130604; AA383256: Hs.1657; estrogen receptor 1; breast mAb^jn. 

130827; BE003054; Ks.1695; matrix metdltoproteinase 12 (m; lung, cdon, blad. headnk. ovar, pane; mAb*dIag^m 

130637; AA356764; Hs. 17109; Integra! membrane protdn 2A; EWS; mAb*s.m. 

130648; AI458165: H3.17296; hypothetica] protein MGC2376: colon; diag 

130867; 6E246961; Hs.17639; Homo sapiens ubiqi^n protein; breast sm 

130690; A6006625; Hs.139033; paternal^ expressed 3; ovan diag 

130714; A134d274; K8.18212; DNA segment on chremosoroa X (u; breast; diag 

130760; AW379130; Hs.18953; phosphodiesterase OA; pros; CTL-^s.nu 

130800; AI187292: Hs.ig574: hypothetical protein MGC5469; cokin. lung; (Sag 

130839; AB011169; Hs.380875; sImQar to S. cerevisiaa SSM4; anglo; diag 

130844; U7624Q; H8.20191: seven bi absentia (Drosophlla}; breast diag 

13089% AL120637; Hs.20993; h^h-ghicose-reguiated protein: breast CTL^m. 

130941; NM_000669; Ks.2142; 5-hydiQxytryptaniine (serotonin; ovar. mAb 

130967; AA393071; HS.182S79: leudne amlnopepSdase; over, sjn. 

^mrz mm Ks.374468; Homo sapiens mRNA; cONA DKFZpS: angto; diag 

130987; BE613269; Hs.21893; hypothellca] protein OKFZp761N; ookm; diag 

131046; AA321649; Hs.2248; smaO Indudble cytokine subfa; breast tung, blad, ovar, fibre; diag 

131080; NM_001955; Hs.2271; endotheiln 1; angb; diag 

131083; Y09763; Hs.22785; gamma^nobutydc add (GABA); pros; mAb 

131 148; AW9S3575; Hs.303125; p53^duced protein RGPC1; breast colon, angio; diag 

131216; A1815486; Hs.243901; Homo sapiens cDNA FU20738 fls; cdon. breast diag 

131 228; AW207469; Ks.24485; chondroHin sidfate proteoglyc; ovar. diag 

131244; A1638429: Hs^4763; RAN binding protein 1; tung, blad. headnk; CTl-tsjn. 

131288; AA278482; Hs.2S328; ESTs, Moderately sln^ to AL; pros; diag 

131 289; AA296696: Hs.333418; FXYD domdrKontalning ion tra; colon; diag 

131307; NM-000025: Hs.2549; adrenergic, bet^. recepton EWS; mAb 

131313; R96290: HS.75B74; ribosomal protein L44: EWS; diag 

131492; Ai452601; Hs.268889; nuclear receptor subtamBy 2. ; pros; mAb^sjn. 

131544; AU55715; Hs.28555; programmed ceQ death 9 (P0C09; breast diag 

131559; AL078599; Hs.10764: hypothetical protein aJ20O37; breast diag 

131564; T93S0O; Hs.28792; Homo saj^ cONA aJ11041 Gs; breast dtog 

131603; X81334; Hs.2936; matrbc metafloprotetnase 13 (c; blad; sm 

13164$ AW410601: Hs.30026: HSPC162 protein; breast diao 

13173% AF017986; K8.3138G; secreted frtzzled^elated pret breast mAlHS jn. 
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131817: U20536; Hs.3280: caspase 6, epoptosls-ielatfid c; 6biQ, beast cerv. hmg. Uad. {oik; gOoc 

131885; BE502341; HS.34Q2; ESTs; breast diag 

131919; T1 S803; Hs^72458; piot^ phosphatase 3 {fonrwl; pros, breast; S-nu 

131925; AF151048; H5.ie3180; anaphase promoOng complex sub; breast; diao 

131965; W792B3; HS.35S62; ESTs; hmg. ovar; diag 

131985; AAS03020; H$.36563; hypotheticel protein aJ2241B; breast ovar, (flag 

132050; At267815; Hs.38022; ESTs; angb; diag 

132173; XS9426; H5.41716; endotheOal oelKspecffic mole; angb; iSag 

132180; NM.004460; Hs,418; fibroblast acthraHon protein,; colon, pane, esoph; mAb 

132191; AA507576; Hs.268361; Homo sapiens cONA: FU2269S tJ; ovar, diag 

132349; AW975654; Hs. 181286; serine protease inhibitor. Kaz; pros, btad; 8.m. 

132354; BE1BS289; Hs.1076; small protine^ch protein IB ; tung; diag 

132358; NliU)03542; Hs.48423; H4 htstone famHy. member G; pros; CTL^ 

132371; AA235448t Hs.222088; PRO2000 protein; breast; diag 

132454; BE2S6227; Hs.250822; serine/threonine kinase 15; biad. breast; 8.m. 

132490; NM_001290; Hs.4980; UM dom^n binding 2; an^ diag 

132520; AA257992; H&50651; Janus Idnase 1 (a protein EWS; sm 

132528: T78738; Hs,50758; 8MC4 (sbucbirel maintenance o; ovar, CTL^sm 

132543; BE568452: Hs.344037; proteb regutaior of cytoldnes; colon, tung: dtag 

132572; AI929859; Hs^37825; signal recognition paitido 72; ovar; diag 

132592; AW603564; Hs.288850; Homo sapiens cDNA: FU22528 fi; colon: diag 

132624; AA326108; Hs.33829; bHLH protein DEC2; ovar; diag 

132632; AU076gi6; Hs^398; guanine monphosphate synthetas; ovar. lung; sm 

132669; W38586; 83^0933; guanine nucleotide binding pro; colon; diag 

132710; W74001 ; Hs^279; serine (or cystine) proteinas; lung, btad, colon, headnli; diag 

132725; NM>006276; H5.1B4167; spTnang tactor. argininefeeri;ovar; CTUsjn. 

132767; BE1B2592; Hs.11261; smail proGne>rich protein 2A; hjng; diag 

132791; AB029S51; H3.7910; R1NG1 and YY1 binding protein; pros; CTL^.m. 

132837; AA370362: H3.57g58; EGF-TM7-latrophiiin-related pr. angio; diag 

132856; NKL001448; Ks.58367; glyplcan 4; breast colon, pros; mAb 

132888; NM_005476; H8.5920; UDP-N-acetylgiucosamlne-2-eplm; pros; 8.m. 

132902; A1936442; Hs.59838; hypothetical protein aJ10808; colon; diag 

132939; A6009284; Hs.61152: exostoses (mul(iple)-Sl« 2; ovar, dteg 

132964; A1362575; Hs,303171; ESTs; pros; diag 

132967; AA3t6181; Hs.61635; six transmembrane epithelial a; pros, pros; mAlnCa 

132990; X77343; Hs.334334; transcription factor AP-2 aiph; breast hJng: CTL-»«ja 

132994; AA112748; Hs.279905; clone HQ0310 PRO0310p1; colon, breast; sm 

133006; AW978435; Hs.62515; K1AA0494 gene product; colon: diag 

133015; AJ002744; Hs.246315; UDP-N^etyMpha-l>galactosa; breast colon, pros; sm 

13301G: AI439688; Hs.6289: hypothetic^ protein FU20886: breast; diag 

133061; Ai186431; Hs.298638; prostate dHIerenliation (ado; angio. pros, btad; diag 

133063; A1654133; Hs.356247; thyroid receptor int^acQng p; pros: mAb^sm 

133070; U92649: Hs.380136; a disintegrin and metaHoprote; leulq diag 

133179; U81S99; Hs.66731; homeo box 613; pros; CTLtsja 

13319% AF23igai: HS.2S0175; homolog oT yeast long chdn po; breast angb; CTL^.m. 

1 33260; AA403045: Hs.6906; Homo sapiens cDNA: FU23197 fi; angio; diag 

133272; NM.002776; HS.69423; tcaOilcrein 10; colon, ovan diag 

133314; AA102670: Hs.70725; gamma-amlnobutyric add (GABA); breast pane; mAb 

133321; T79526; Hs.179516; Integral type I protein; breast; diag 

133391: AW103364; Hs J27; inhibin, beta A (acthrin A, ac; breast blad. lung; diag 

133415; X6g899; Hs.73149: paired box gene 8: ovar, GTL 

133477; AW502935; Hs.740; PTK2 prot^ tyrosine Idnase 2; breast; sm 

133579; X75346; Hs.75074; mitogen-activated protein Idna; pros; diag 

133626; AW836130; Hs.75277; hypotheticel protein FU13910; pros; diag 

133736; 049958; Ks.75B19; glycoprotein ftA6A; pros; mAb 

133829; AW630088; Hs,76550; Homo sapiens mRNA; cDNA DKFZp5; ovar. diag 

133860; S78296; Hs.76888; hypothetical protein MGC12702: blad; diag 

133944; AW068579; Hs.7780; Homo sapiens mRNA; cDNA DKFZpS; pros; diag 

133975; C18356; Hs^5944; tissue factor pathvray bhibHo; angio. pane; CTL^ag 

133976; A1908165; Ks.169946; GATA^^nding protein 3 (T-celt breast btad; mAb«m 

134100; AA460085; Hs.171075; repIicaOon factor C (acSvato; pros; diag 

134110: U41060; Hs.79136; UV-1 protein, estrogen regula; breast Had, ovar, pros; mAb 

134159; AI690916: Hs.178137; transducer of ERBB2. 1; breast; CTL+SJn. 

134219: NM.000402; Hs.B0206; gtuco3&6-phosphate dehydrogen; breast; 8.m. 

134319; BE304999; Hs.285754; fumarate hydratase; colon; sm 

134326; AW903838; Hs.81800: chondroitin sulfate protsoglyc; ovar. breast pane, tung; diag 

134348; AW291946; Hs.e2065; Interieuidn 6 signal transduce; breast; mAb*8.mL 

134374; N22687; Hs.8236; ESTs; pros; diag 

134390; R35528; Hs.8258; DKFZP43401 335 protein; pros; CTL^m 

134401; AI916662; Hs^11577; Wnecfin 1 (Idnesin receptor); pros, breast mAb+sjn. 

134405; AW067903; Hs.82772; collagen, type XI. alpha 1; breast lung. ovar. headnlc CTL 

134470; X54942: Hs.83758; COC28 protelnldnase 2: lung. blad. headnk: sjn. 

134520: BE091005: Ks.34^G; acfhreled RNA polymerase 11 tr, ovar, sm 

134529; AW411479; Hs.848; FK505*in<fing protein 4 (59kD); breast diag 

134570, U66615; Hs.172280; SWI/SNF related, matrix assod; EWS; CTl+sm 

134654; AK001741; Hs.8739; hypothetical protein FU10879; breast ; dag 

134666; 6E39192% Hsi7S2: transmeirtbrane pnM 4; breast; mAb^sm 

134691; AW382997; Hs.88474: prostagtandiiKemloperoxlde syn: ovar.sjn. 

134727; X80507; Hs.84520; yes-assodated protein 65 kOa; blad; diag 

134731; D89377; Hs.89404; msh (Drosophlle} homeo box hone blad; sm 

134786; T29618; Hs.89640; TEK tyrosine kinase, endotheH; angio; sm 

134B24; 878723; Ks.298623; 5-hydrtnytryptamIne (serotonbi; Had; mAb 

134856: BE281128; Hs.9030; TONDU: Wadt CTL^ 
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134868; ABQ20889; Hs.90419( K1AA0882 protein; breast: (flag 

134924; BE294029; Hs.2799t^ Ras hcmotog eniiched b brain ; breast m/U) 

134972; AL033527; Hs,169252; VHTiyc avian myelocytomatosts v; ovar; CTL4Sjn, 

134975: R50333; H3.921 86; Leman coileckni) protein: breast; dlag 

134989; AW968058; Hs.92381 ; nudbt (nudeoslde (fipho^hate ; cokm; (Rag 

135073; W55956; Hs.94030; Homo sapiens mRNA; cONA DKFZpS; anglo; <fiaQ 

1351 17; W52493: Hs.94694; Homo sapiens cONA FU10561 lis; bmast diag 

135166; AA135867; Hs.280858; ESTs. Highly similar to A35661; pros; dlag 

135235; AW298244; Hs^ISS; ESTs; angtoc (Hag 

135242: AI583187; Hs.9700; cydln El; ovar, CJl^ 

135243; BE463721; Hs.97101: putaGva G proteiiv«oup)ed rec; colon: mAlHS.fa 

135255; Y13645; HsJ7234; uropl^ 2; Uad; mAb 

135309; AI564123: Hs^55689; A0P<ffbosyla8on factor-Ska 5; pros; dlag 

135315; K81 136; Hs.334604; Homo sapiens mRNA for K1AA1870; pros: diag 

135389; U05237: Hs.99872; fetal Alzheimer antigen; pros, breast, coloiv CTL-»sjn. 

1354(N^ X7BS92; HsMIS; androgen receptor (dittydrotest; pros; mAlHSjn. 

300254; AW1 83618; H5.55610; solute canter faniily 30 (zinc; pros, breast; mAb^m 

300256; AW591433; H5.298241: Transmembrane protease, serine; breast, colon, lung, ovar; inAb«diag«sja 

300318; AW444502: Hs.256982: ESTs. Wealcly similar to NEL1J1: anglo; CTL«<neQ 

300605; AI218847; Hs.152670; ESTs; pros; 

300921; AF146747; H5.232165; pdyi^emia rubra vera 1; eel; pros; mAb^em 

300923; AW136372; Hs.185?, add phosphatase, prostate; pros; sm 

301042; A]659131; Hs.366053: hypothetical protein MGC2849; pros; mAb 

301043; Ai160316; Hs.149155; voltag&<ie pendent anion channe; pros;mAb^.m. 

301050; AW136973; Hs.36291S; ESTs. Weakly similar to S69890; coton. tung: CTL^nv 

301341; AA887801; Hs.208229; G protebvcoupled receptor; breast, lung; niAb*sja 

302001; AB020711; Hs.374965; KIAA0904 protein: breast ; CTl^m 

302005; BE2S2922; Hs.1231 1 9; MAD (mothers against decapenta; pros; dtag 

302067; BE542706: Hs.222399: CEGP1 protein: breast diag 

302167; NM_006227: Hs^OO?; phosphoTipid transfer protein: pros; mAb 

302225; NM-007231; Hs.16221 1; solute earner family 6 (neuro; paic; mAb+s.m. 

302276; AW057736; Hs.323910; HER2 receptor tyrosine kinase ; breast mAb«Sjn. 

302290; AA179949; Hs.17S563; Homo safto mRNA; cDNA DKFZpS; pros, breast diag 

302372; All 17406; Hs^5891; ATP-bindIng cassette transport breast, pros ; mAlHsm 

302384; Ai678059; Hs^02676; synaptonemal comptex protein 2; breast, oeiv; diag 

302410; Nh/L004917; Hs.218366; kaOikrein 4 (prostasa. enamet pros; dlag 

302468; AL133561; Hs^155; DKFZP434B061 protein; pros; diag 

302562; BE14976^ Hs.48956; gap Junction protein, beta 6 (; lung, blad; mAb 

302881; AA508353; Hs.105314; relaxin 1 (HI); pros; diag 

303295; AA205625: Hs.208067: ESTs; blad; (Sag 

303380; AW962764: H5.303171; oHactory receptor, family 51.; pros; mAb 

303506; AA340605: Hs.105887; ESTs. Weakly similar to Homoto; pros, breast, coton ; dlag 

303699; BE143707; Ks.19525; hypotheOcal protein FU22794; pros; diag 

303753; AW503733: Hs.9414; KIAA1488 protein; pros, breast, coton; GTL^ 

305503; AA759177; H5^8148; ESTs. Weakly similar to KIAA05: pros; diag 

306273; AA936290: ; gb:on70aOU1 Soares_NFL.T_G8C: pros; diag 

306676; AI00S603; ; gbnvlSclO^I NQ J:GAPJ3C3 Horn; lung; di^ 

306840; At077477; Hs.307912: ESTs; an^; diag 

309177; AI951 118; Hs.326738; Homo sapiens breast cancer ant breast pros; mAb«CTL 

309583; AW1 70035; Hs.326736; Homo sapiens breast cancer ant breast mAb«Ca 

309931; AW341683; Ks.343663: 0b:hd13d01.x1 Soares^NFUT.GBC; hing; mAb 

310382; AI734009; Hs.127699; K1AA1603 proteto; pros; diag 

310431; AI420227; Hs.366053; ESTs, WeaMy similar to A46010; proa; dtog 

310573; AW292180; Hs.156142; ESTs; pros; diag 

310636; AI814373; Hs.184175; ESTs; lung; (fag 

310781; AI380797; Hs.158992: ESTs; breast ; diag 

310955; Ai476732; Hs.263912: ESTs; breast angto; di^ 

311034; BE567130: Hs,311389: ESTs, Highly similar to NKGO.H; lung; mAt^^m 

311166; AI821005: Hs.1 18599; intron of. BFF9 (GDNFRa).; breast ; diag 

311251; AI65566^ Hs.197698; ESTs; pros; dlag 

311557; AF200492; Hs.211238: lnterteukln-1 homotog 1; lung; d\SQ 

31 1596; AI662086: Hs.79375; stngto-rdnded (Drosophiia) hom; pros; CTL 

31 1630; AI915444: Hs.372037; ESTs; coton. Wad. lung, ovar, pane, headnk; mAb^ag*sja 

311877; AA084248; Hs.37265l; G protein^oupled receptor 39; ovar, anglo. gilo; mAb^.m. 

31 1911; R19175; Ks.169793; ribosomal protein 132; pros; dlag 

311928; T62216; Hs378028: ESTs; pros; (Sag 

312182; T94344; Hs.326263: ESTs; pros; (Sag 

312252; AI128388; Hs.143655; ESTs; blad; dlag 

312319; AA906997; Hs.180780; TERA protdn; cdon; CTL^ 

312521; AI263307: Hs.356g01; H2B hlstone family, maniber U ptoa. breast tung; diag 

312544; AA516420; Hs.35234Q; ESTs, Weakly sbnllar to 138022; breast dlag 

312742; A1550363; Hs.116462: ESTs; coton; diag 

312795; AW975014; H3.25; ferrochelatase (protoporphyria; pros; sjn. 

312857; BE083866: Hs.126914; KIAA1430 protein; coton. pros; CTL^sm 

312922; AA329256; H$.378739; ESTs, Moilerataly sbnOarto at pros;dtog 

313328; AW449211: Hs.105445; GDNF family receptor alpha 1; breast; mAtHSjn. 

313513; AW288800; Ks.64313: ESTs. Weakly similar to S59501: angto; mAI>^. 

313556; AA628517; Ks.118502; ESTs; an^ dlag 

313665; AW751201: Hs.120932; ESTs; angto; dl^ 

313774; AI916D58; Hs.144583; ESTs; coton; CR 

313915; C18883;Hs.163443; Intron of pertosUn (OSF^); breast (Sag 

313978^ AI870175; Hs.13957; ESTs; angto; (Sag 

314078; AW129357: HS.32970Q; ESTa; breast dtog 
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314097: AA648744: K5.289493: ECTs; breast: (fia 

314121: AI732083: Ks.187619; ESTs; pros, breast :dln 

314171: AIS21B9S: K3.193481; ESTs; pios; di^ 
- 31 4506: AA833655: Hs.206868: Homo sapiens cONA FUt405S 6s; breast: dte 
J 314547: AA399272: Hs.144341: ESTs: breast ; din 

314558: AI873274: Hs.370280: ESTs; breast, pros; diag 

314589; AK001432: Hs.153408: Homo sapiens cONA FU1057D lis; lung. Uad:dlao 

314691: AW207206; Hs.356962; ESTs: breast pros; (flag 
. ^ 314765: A)536226; Hs.32976; guanine nucleotide binding pto; cokm. pros; diag 
lU 314907: AW971082:Hs.222888: ESTs. WealdyslmDartoTRHY Hipios;^ 

?1SS? "'•298241; Transmembrane protease, serine; breast ooloa hmg, ovan mAtHdlag4«.ia 

315033: AI493046: Hs,146133: ESTs; colon; diag — « 

315051; AW292425; Hs.163484; ESTs; breast pros. Uad; di^ 

315052:AA876910:Hs.134427;ESTs;pios.bre85(;diag 
1 0 315196; AI3B7347; Hs.44898; Homo sapiens done TCXXTA00151: breast; diag 

315368; AB037745: Hs.104696: KIAA1324 protein: pros; dim 

315408; AW273261: Hs.216292: ESTs; pros; diag 

315634; AA837085; Hs,372254; ESTs; breast pros; diag 
. ^ 315720; AA292996; H5.163900: ESTs; biad; dleq 
ZU 316177; AI904982; Hs.293102; downstream of breast cancer an; breast: mAb+CTL 

316442; AA760894; Hs.1253a}; ESTs; pros; diag 

316580; AA938198; Hs.146123; poly(A) pdymerase gamma; breast angiocsm 

316886; AA836331: Hs,170261; ESTs; breast ; diag 

316943; AW014875: Hs.137007: ESTs; Wad; (fiag 
ZD 317079; 8E159g84; Hs.125395; ESTs; biad; mAb*s.m. 

317140; AA885430: Hs.201925: Homo sapiens cONA FU13446 fis; breast; diag 

31 7224; X73608; Hs.93029; sparcA)steonectin, cwcv and te; pras, angio: diao 

317548: BE568568: Hs.159066; ESTs; pros; CTL-^sm 

317803; AW664964; Hs.128899; ESTs; breast lung. ovar. pros; mAt>^.m 
JU 317881: Ai827248: Hs.224398: Homo sapiens cDNA RJ11459 fis; breast lung ; diag 

318240; A1085377;H3.143610; ESTs: lung; diag 

318524; AKOOIOSO; Hs.159066: tiypotiietlcal protein aJ10188; pros, cokm; CTL^jn. 

318532; AW139377: Hs.127179: crypUc gene; pane: dim 
^ - 318744; A1793124; Hs.144479: ESTs; breast; diag 
So 318754; W21423:Hs.44222;CGl-90 protein; pros; diag 

319080; AW967646; Hs.23023; ESTs; pros; diag 

319795; AB037821: Hs.146858; protxadfierin 10; pros, glioi mAtnsm 

320066; BE305242; Hs,16098; daudin 2; colon, pane; diag 

320167; AA984373; Hs.9D790; Homo sapiens cDNA: FU22930 fi; breast pros; diag 
4U 320203; AL049227; Hs.124776; downstream of cadlierin 6 (by 3; renat ovan mAb^ 

32021 1; AL039402; Hs.125763; 0EIM&6 protein: breast pros; CTL 

320322; AF077374; Hs.139322: small prolln»^ protein 3; lung; diag 

320324; AF071202; H5.139336; ATP^inding cassette, sub-femi; pros: mAb 

320561; AF085808; Hs.159330; uioplakin 3; pros, biad; diag 

320590; U67058; Hs.154299; Human proteinase acfivated rec; pros; mAtHsm 

320635; N50617; H8.80506: small nudear ribonucleoprotei; angto lung; diag 

320736; AA31S361; HS.170195; bone morphogenetic protelt) 7 (; ovar; mAb^m 

320796; AK001541; Hs.31218; secretory carrier membrane pro; pros, colon : diag 

320896; BE019924; Hs.271580: uroptakin IB; tung. biad. ovar. Iteadnk; mAlnSi 
5U 321023; AW294316:Hs.l25608; ESTs; colon; diag 

321107; AI732643: H8.144151; downstream of breast cancer an; breast; mAb^TTL 

321412: AI6743&3; Hs.22891: solute canlerfamily 7 (caQo; pros; mAb4s,m. 

321441; AF107493: Ks.201 675; Homo sapiens LUCA-15 pralein im pros, breast : diag 

321644; AW975944:HsJ37396: ESTs; breast pros; diag 
JD 321717; AW956580; Hs.42699: ESTs; angio; diag 

321906; AW270608; H5.170195: bone morphogenetic protein 7 (; ovar; mAb^ 

321911;AF026944;Hs.293797;ESTs;Bngio>i,Mad;dlag 

322035; At137517; Ks.306201; hypolheBcal protein DKI7p5640; breast b)a± mAb 
^ . 32^21; AF147347; ; gb:Homo sapiens fuU length in; breast : diag 
OU 322706: AA016899: Hs.127179: cryptic gene; pane; diag 

322762; AA056060: Hs.202577; Homo sapiens cDNA FU 12166 fis; pros; diag 

322818; AW043782; Hs.293616; ESTs; pros, breast, angio, gHo ; diag 

322882; AW246508; H8.279727; Homo sapiens cONA FU14035 lis: breast lung, ovar. ando. Uad; di» 

322975;C16391;;intronofbreastcanoeranfiga;breast:m«HCrL 
00 323168:AL120662;H$.124165;programmedcelldealli9(PDCO9;breast:di8B 

323226: AF055019: Hs.355279: Homo sapiens done 24870 mlW ; pros; diag 

323262; AL133990: Hs.190842; CEGP1 protein; breast pros, biad; diag 

323287; AA639902; Hs.104215: ESTs. Moderatdy dmDark) SP; pros; diag 

323332; AI829520: ; gbwl1fic06j(1 NCI.C6AP_U(1 Horn; breast ; diag 
/U 323335; AI655499:Hs.161712; ESTs; pros, bmast mAb 

323587; AI299709: Hs.131886: Homo sapiens cDNA: FU22113 fi; colon; diag 

323817; AA410943: ; NAME 0IM1TTED ... receptor Idnase; breast ; mAb 

324261; BE069341; ; gb:QV34T0381.2701 0007^8 B; breast: diag 
- 324295: AA434579; H8.143691: ESTs; pros; diag 
/ J 324338: AA927668; Hs.145078; regulator of differentiation C colon; dl^ 

324430; AA464018; Hs.335798: Homo sapiens cONA: FU23241 fi; pros, cdnv diag 

324432; AA464510; Hs.152812: ESTs; breast lung, pane dl^ 

324603; AW993S22; Hs^B67; ESTs; pros, breast di^ 
- 324617: AA508552; H8.222874; ESTs, Weaidy simflar to 138022; pros; diag 
OU 324658; A1694767; Hs.129179; Homo sapiens cDNA FU13581 fb; pros ; dia 

324718; AI557019: Hs.1 16467; smafl nudear protdn PRAC; cdon. pros; diag 

324866; AI541 214; H5.46320; Smafi pro&ne-ridi protein SPR; lung, biad; diag 

324871; AI690347; Hs^1923; Homo sapiens cOHA: FU22;85 fi; colon; diag 
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324987; AI375572: Hs.172634; ESTs; breastdlag 
32537% : ; Phase 2& 3 Exons; breast; cn.nM 
325544; ; ; Phase 2 & 3 Exons; breast angio; &g 
3Z7036; ; ; Phase 2 & 3 Exons; lung, angio; dag 
D 3Z7075; ; ; Phase 2 & 3 Exons; breast. Ibng; iSag 
327414; ; ; Phase 2 & 3 Exons; angb; dbg 
328700; ; ; Phase 2 & 3 Exons; breast, angio; diag 
330211; ; ; Phase 2 & 3 Exons; pres; CTL^sm 

330488; L10343: Hs.112341; protease mikf 3; sktn^ (ung. cobn. blarf; diag 
lU 330493: M27826;Hs.334372; endogenous retroviral protease; lung, colon; am 

330830: NM.00290% H3.79088: reficukx;atb{n 2; EF-hand calc; pros; diag 

330762: AW40733% H3.13014; ADP^ftosylaSon factor GTPase; pros; at^ro. 

330790; AI660243; Hs.318545; KuOl Chip Redos; pros, blad; dag 
, . 330814;AI955040;Hs.265398;PAR-6beta (parfitiontngdef; breast; diag 
15 330827; A!961486;Hs.249196;ESTs; lung, uten diag 

330892; AF109298; Hs.1 18258; prostate cancer sssodsted pro; pits; disQ 

331014; AW770994; Hs.30340; hypothetical protein KiM1 165; colon: diag 

331151; R82331; Hs.121602; ESTs; pros, breast; diag 

331183; T40769; Hs.B469: ESTs; coton; di^ 
ZU 331237; WB7874; H5JZ5277; Homo sapiens cDNA FU10717 fis; angio; diag 

331490; AF216751; Hs.26813; C0A14; pros; dag 

331578; A!246482; Hs.243010; ESTs; angio: cRag 

331614; N92293; Hs.205832; EST; Moderately sinriOar to ALU; breast; dag 

33181 1; AW885727; Hs.9914; HuOl Chip Redos; lung; diag 
/D 331889; AA677577;Hs.380213: Homo sapiens Chromosome 16 8AC; pros; dag 

331969: AA526gi 1; Hs.82772; coOagen; type XI; alpha 1; breast. lung; CTL 

332180; AF134160; Hs.7327; claudn 1; tung; mAb 

332247; AA669Q97; ; ESTs; pros, breast ; d^ 

332396; AW579842; Hs.380730: hypoiheOcal protein FU10697; pros; dag 
3U 332453; L42583; K5.334309; HuOl Chip Redos; lung; diag 

332466; AB018259; Hs.1 1814Q;-KIAA0718 gene product; angio; d^ 
. 332530; M31669; H&1735; inhibbi; beta B (adivln AB be; ovar, pros; dag 

332535; AF167706; Hs.19260; cysteine^ repeatcontainin; angio; dag 

332640; BE568452; H3.344037; protein regulator of cytoldnes; blad. headnk; diag 
3 5 332645; AA284371; Hs.118064; ESTs; breast, colon; dag 

332686; X69699S Hs.73149; paired box gene 6; ovar; CTL-»sm 

332697; X51405; HS.7S360; carboxypeptMase E; pros; di^ 

332740; BE409869; Hs.286241; Homo sapens cONA: nJ22698 fl; pros; dag 

332798: ; ; C22000007:gi|1231419S|emb|CAB9; pros, breast ; dag 
4U 333769; ; ; NM.005940'iionno sapiens matrix; breast colon, lung; mAMag^. 

333904; ; ; Chromosome 22; pros; diag 

334223; ; ; NMj005060*:Homo sapiens X-box ; pros, breast dag 

334447; ; ; NM-012429*Jiomo sapiens SEC14 ; pros; dag 
, 3351 15; ; ; NM.006498*itomo sapiens lecbn; pros; Cn^m. 
HO 335809; ; ; NM_014509*:Homo sapiens laaten; breast CTL^ 

335624; ; ; ENSP00000249072^IDJ222E13.1 (N; breast pros; CTL^s-m. 

335825; ; ; ENSP00000249072MXJ22ffi13.1 {N; breast dag 

335936; ; ; Chromosome 22; lung; diag 
- - 336034; ; ; NM.0071 72:Homo sapiens nucleop; breast, angto; CTL'»8.m. 
3U 336152; ; ; NM.014246:Homo sapiens cadieri; breast ; mAb 

336636; ; ; C22000024*:gi|10645308^IAAG2; lung, breast CTL-^sjn. 

338008; ; ; NM.005940*:Homo sapiens matrix; lung, breast colon; mAb^dag-^&m. 

338033; ; ; Chromosome 22; lung, angio; dag 

336158; ; : NM-012399*:Homo sapiens phosph; lung, dag 
J D 338255; ; ; NM.014323*iiomo sapiens zinc f; pros, breast, colon ; CTL^jn. 

400195; ; Hs.42650: Nh/L007057*:Homo sajte ZW10 1; lung; CTL-tsjn. 

400269; ; Hs.253495: Eos ContrOt fibre; dag 

400285; ; ; Eos Control; lung; dag 

400287; S39329; Hs.181350: KaHikrefn 2. prostatic; pros; dag 
OO 400288; X06256; Hs.149609: integrin. alpha 5 (fibionectin; pane, pros.anglo, bbd. hing; mAb^ra 

400290; H18836; Hs.31608; hypoJhefical protein FU20041; pros, cdon. EWS; mAb 

400294; N95796; Hs^8695; Homo sapiens prostain mRNA, co; pros, pros; mAb 

400295; W72838; Hs^19: AI90S687dL^09&'19019M19 B; breast dsg 
^ - 400298; AA032279; H5.61 635; dx transmembrane epithelial a; pane, tung, headnk. stem. EWS, ovan mAb*CTL 
OD 400328; XB7344; ; transporter 2, ATP-binding cas; lung; mAlHua 

400409; AF153341; ; Homo sapiens winged hsibcM; Uad; CTL-«9Jn. 

400419; AF08454S; ; Target lung, sarc; dag 

400440; XB3957; Hs33870; nebulin; saro; diag 
_ 400494; ;;ENSP00000238970*:aG30(Frasme; angio; mAb 
/ U 400517; ; ; lengsin; stom, oerv. uter. lung, pros, colon, hc^ diag 

400651; ; ; ENSP0000022803r:COPPER CHAFER; saro; sjn. 

400865; ; ; NM.002425:Homo sapiens matrix ; lung; mAb«di^jn. 

400773; ; ; NM>00310S*:Homo sapiens scitit blad; mAb 
_ - 400844; ; ; NM.003105*:Homo sapiens sortll; Uad; s.m. 
/D 400846; ; ; sortiraHelated receptor, UD; blad;(nAb«6.m. 

400881; ; ; NM.025080:Homo sapiens hypothe; ovan dag 

401093; ; ; C12000586':gi|63301671dbpAA8; blad, lung; CTL^sm 

401234; ; ; mitogen^cUvated protein klna; angio; dag 

401424; ;;NM.001172:Homo6apiensarglnas; pros; sm 
OU 401486; ; ; C400064r:gl)4756508trafINP.OO; headnk; mAb 

401704; ; ; NA4.021195*:Honu> sapiens daud; (est mAb 

401732: : ; NM.001176*:Homo sapiens RhoGD; pane; dag 

401747; ; : Homo sapiens VanSn 17 (KRT17: Uad. bing. headnk. mel^ diQ 
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401760: ; : Target Exon; btad, hmg. headnk, esoph; diag 

401760; ; ; NM.005557^Homo sapiens teiaS; lung, btad, headnk. esoph. mela; diag 

401781; ; ; Target Exon; lung, blad. headnk, esoph, cenr; diag 

401765; ; ; NM.002275':Homo sapiens keiaS; tung; (Sag 

401797; ; ; Target Exon; sarc; diag 

401994; ; ; Target Exon; lung; diag 

402145; ; ; Target Exon; test CTUi6.m. 

402199; ; ; Target Exon; lest CTL^wn. 

402230; ; ; Fgenesh predicted: CYTOCHROME ; btad; diag 

402239; ; ; Target Exon; btad; diag 

402260; ; ; NM_001436^Homo s^ens fibril: btad; CTL*sm 

402265; ; ; Target Exon; lung; diag 

402305: : ; C190a073S*:g!)4508027treIINP.O: blad; CTL-^ 

402420; ; ; C1000823*:gil1043240QIenib)CAC1; tung; diag 

402424; ; ; NM_024901:Homo s^ns hypothe; Wad; CTL-^m 

402447; ; ; 01 000201 :gi|204416(gblAAA02627: esoph; mAb 

402474; ; ; NN1.0D4079:Homo sapiens catheps; tung. coton. stom. tibro; (Sag 

4025S0: ; : Target Exon; fibre; diag 

402604; ;; Target Exon; gHo; diag 

402605; ; ; Target Exon; gBo; diag 

402606; ; ; Nh4.024626:Homo sapiens liypothe; orar, breast: mAb 

402680; ;; Target Exon; test mAb 

402777: : : C100265rtgi)5443Z71sptQ04799): blad; diag 

402860; ; ; ENSP00000239210AI50O24.4 (nov; meia; CTL^sJiv 

402888; ;; Target Exon; sarc; diag 

402992; ; ; Target Exon; sarc; diag 

402994; ; ; NM.002463*:Homo sapiens myxovi; esoph; diag 

403046; ; ; NM_005656*:Homo sapiens transm; pros; mAb 

403047; ; ; NM.005656*:Homo salens transm; pros, blad, colon; mAb 

403071: ; ; NM.003319*:Homo sapiens IKin ; sarc; diag 

403088; ; ; NM.003319*:Homo sapiens titin ; sarc; diag 

403171; ; ; C2001472*:gi|5809678)gbIAAB418; test <flag 

403326; ; ; Target Exon; mela; diag 

403329; ; ; unnamed protein product [Homo ; lung; diag 

403381; ; ; ENSP00000231844*:EcotropIc vb; blad; CTL^.nt 

403409; ; ; NM„005929:Homo sapiens anOgen; mela; mAb 

403433; ; ; NM_001622:Homo sapiens alpha-2: hepC; diag 

403478; ; ; Nlil.02234ZHon>o sapiens Mnestn; lung; Ca-««jn. 

403715; ; ; Target Exon; lung; diag 

403740; ; ; NMLOOIOTS'itomo sapiens UDP gt pros. hepC; sjn. 

403776: ! : ENSP0000022654r:SmaI] indudb; pane; (fiag 

403903; ; ; C5001632':gqi0645308|gb|AAG21; Wad; CTL-»s.m. 

404029; ; ; NM.016936*:Homo sapiens protoc; gOo; mAb 

404049; ; ; NM.018937*:Homo sapiens protoc; gBo; mAb 

40421 0: : ; NM.005936M)mo sapiens myelotd; pane, utsr, oerv. lung, ovar, pros, cdton, stom; diag 

404240; ; ; NM.018950:Homo sapiens major h; fibre: mAb 

404253; ; ; Nh^021058*:Homo sapiens H2B ht lung; CTL-»s.m. 

404286; ; ; G8001909:gi|704441]dbpAA1890; pane; diag 

404298; ; ; C6001238*:gi|121715isplP26697|; lung; sm 

404403; ; : Targ^ Exon; Wad; diao 

404440; ; ; NM.021048:Homo sapiens mfilanomi fang, Uad; mAb*€TL 

404866; ; ; ENSP000002511 12*:Sod{um/po(8ss; pane; sm 

404877; ; ; NM-005365:Komo sapiens metanom; king, Uad; CTL-tejn 

404927; ; ; Target Exon; lung, headnk; diag 

404996; ; ; Target Exon; kmg, headnk, esoph; diag 

40S001; ; ; intadeukin enhancer binding f, saro; diag 

405025; ; ; Homo sapiens bone morphogenefl; angio; diag 

405121 ;; : mltogen^alsd protein tdna; anglo, renat 8 m 

405238; ; : Target Exon; gito; diag 

405239; ; ; oxMaSve 3 alpha hydroxystero; gOo; sjn. 

405451; ; : Homo sapiens ghjtaminyii)epGd; n^la; s.m. 

405545; ; ; Target Exon; cerv; mAb 

405546; ; ; NM.016833*:Homo sapiens transp; cenr. mAb 

405547; ; ; NMj018833*:Homo s^ns transp; cenr, mela; mAb 

405646; ; ; C12000200:g]K557225irenNP_00; lung; diag 

405704; ; ; NI^001844*:Komo sapiens collag; saic; di^ 

405770; ; ; NM_002362:Homo saj^ns melanom; lung, esoph; mAb^CTl 

405849; ; ; Target Exon; pane; diag 

405932; ; ; C15000305:^l38061221gb|AAO691; blad. hmg, headnk. cerv; CTl^ 

40^81 ; ; ; Target Exon; btad; di^ 

406137; ; ; NM_000179":Homo sapiens mutS (; lung; CTL'»s.m. 

406173; ;; ENSP0000Q250148*:6rowth hormon; pane; CTL^sm 

406348; ; ; Target Exon; breast CTL*s.m. 

406360; ; ; Target Exon; hmg, headnk; diag 

406399; ; ; NI^003122*:Homo sapiens serine; btad; diag 

406434; ; ; NM.030579*:Homo aliens cytoch; blad; diag 

406467; ; ; Target Exon; bng, headnk. Uad; diag 

406506: ;; Target Exon; angio; diag 

406547; ; ; Target Exon; test diag 

406627; T64904; H3,163780; ESTs; angio; CTL-»sjn. 

406671; AAI29S47; Hs.2B5754; met prokH)ncooene (hepatocyte; pane; mAb 

406672; M28041: Hs.198253: malorUstDCOiiipaEbllftyGompt fibre: 

406685; M18728: ; gbituinan nonspedGc cressreae; pane, cohm, blad, haadnk, sto^ 
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408687; M31 126; HS.3S20S4; matrix metaOoprottoe 1 1 (s; breast, tung, ovar, oerv. uter, patiG, esoph. meb. sac; mAlndiaB^Am 
406690; M2S540; HS.220S29; cardnoembiyonte anfigeiHelat: tung. headnk. pane stom, Uad, colon, oenr; mAb*Ca 
406706; X03740; Ks^lSBI; myosin, heavy polypeptide 1, s; sarc: diag 
406850: A1624^0; K5.172928; coDagen, type I. alpha 1; sarc; GTL^m. 
5 406906; Z25424; ; gtjiRsaplens proteliKerineAh; fatad, lung: sjtv 
408937; U14622^ ; gt^Human transketolas&fka pn test; am 
406967; M24349: ; gt>:Human parathyroid tu3fTTnn&4; tung: CTL^ ja 
406974; M57293; ; glxHuman paFBtltyroid lionnone-r; lung, blad; diag 
407013; U35637; Hs.83870; gtcHuman nelwSn mRNA. paHal; sarc; diag 
lU 407034; U84540;:gt>:Humandysliol)fevfnisofonTi;glto; diag 

407103; M424881; Hs^6301; tiypothetica) protein MGC13170; pros; diag 
4071 18; AA156790: H5i62036; ESTs. Weekly simHar lo Z223.H; pros; di^ 
407122; H20Z76; Hs.31742; ESTs; pros; diag 

407137; T97307; ; gt).7e53M)6^1 Soares fetal Dv; tung. blad, ovar. pros, pane, lieadnk; diag 
15 407168; R45175; Hs.117163; ESTs: pros, breast colon; diag 

4071 78: AA195651; (^.352312; AP'2 beta transcription factor, breast ; CTL^ 
407202; N58172; Hs.109370; ESTs; pros; diag 
407216; N91773; Hs^8385;ty^oxidasa: pane; diag 

407242; M18728;;gb:Human nonspecific crossreac; pane, colon, blad, lieadnKstorn,^^^ 
20 407244; M10014; ; fibrinogen, gamma polypepSde; lung; diag • 

407245; X9056e; Hs.172D04; liiln; sarc; dl^ 

407251; U6761 1; Mm.29182: Iransaldolase 1; pros; sm 

407252; AA659037: Hs.163780; ESTs; angio; CTL-ism 
^ _ 407276; AI95111B: Hs.326736: Homo sapiens breast cancer ant; breast; mAb«CTL 
25 407289; AA135159: Hs.203349: Homo sapiens cDNA FU12149 lis; kmg; diag 

407366; AF026942; Hs.17518; gb.Homo sapiens clg33 mRNA, pa; ovar. hepC s1or\, mala, esoph; diag 

407581; R48402; HS.173S08; P3ECSL; blad; CTL^.m. 

407601; AC002300; lis^129; sodium channel, nonvottage^t; blad; mAb 
^ _ 407619; AUD50341; Ks.37165; collagen, type IX, alpha 2; sarc; diag 
30 407634; AW016569; Hs.136414: UDP-GcNAcbetaGa] beta-1,3-N-; hing, headnk; am 

407710; AW022727; Hs.23616; ESTs; test; diag 

407720; AB037776; Hs.38002; KIAA1355 protein; hing; mAb 

407746; AK001962: Hs.38114; hypothetical protein FU11100: hjng; diag 

407756; AA116021; Hs.38260; ut^uitin specific protease 18; pane. lung, esoph, fibro, mala; CTL^sm 
35 407758; D50915; Hs.38365; K1AA012S gene product lung; diag 

407777; AA161071; Hs.71465; squalene epoiddase; pane, esoph; sm 

407782; AA6089S6; Hs.1 12619; ESTs. Moderately similar to PU; hing; diag 

407786; AA687538; Hs.38972; tetraspan 1; pros, colon, uter, stem, ovar, cerv; mAb 
. ^ 407788; BE514982; Hs.38991; S100 caldum-blnding protein A; headnk. pane. blad. lung, fibro; diag 
40 407818; Ali)21938; Hs.40154; Jumoi^l (mouse) homolog; test; CTl'^Sjn. 

407824; AA147884; Ks.9812; Homo sapiens cDNA FU14388 fis; saic; diag 

407839; AA045144; H5.161566; ESTs: Mad, headnk; mAb 

407846; AA426202; Hs.40403; Cbp/p30{Hnteracling transacO; mela; diag 
. - 407653; AA336797; Hs.40499; dlckkopf (Xenopus laevis) homo; colon, atom, renal, breast ovar, uter, ceir. diag 
45 407656; AA045281; Ks.266175; phosphoprot^ assocbted with; mela; diag 

407872; AB039723; Hs.40735; frizzled (Drosophila) homolog ; ovan mAb 

407881; AW072003; Hs.40968; heparan sulfate (glucosamine} ; pane; sjn. 

407910; AA650274; H5.41296; fibronecfin leucine rich trans; fibro; mAb 

407944; R34006:Hs.239727;desmocolIln 2; hmg, headnk, esoph; mAb 
50 407949; W21874; H8J47057: ESTa. Vlteakly similar to 210926; fibro, Uad; diag 

407962; AI133530; Hs.62930: ESTs. Weakly slmRar to S59501; angio; mAlHsm 

408000; 111690; Hs,198889; bUBous pemphigohJ anOgen 1 0 breast pos, blad, hing. headnk, ceiv. esoph; mAt>*CTL 

408015; AW136771; Ks.244349; epidemial differentiation oomp; mela. sarc; dbQ 

408045; AW1 38959; Hs.245123; ESTs; breast ; diag 
55 408056; AA312329; Hs.42331; ephrin^ ovar. diag 

408063; BE086548; Ks.381047; calcinauiin^ding protein ca; pros, lung; diag 

408081; AW451597; Hs.167409: Intran of bask^hefix-loop^el; ovar. gtio; diag 

408101; AW988504; H3.278346; CDC2-{elated protein kinase 7; breast lung, stom; 8.m. 

408122; AI432652; Hs.42824; hypotheticalprotein 1^10718; lung; diag 
OO 408209; NML0044S4; H3.43697; ets variant gene 5 (ets-retate; mela; ca^m 

408296; AL117452; H5.44155: OKI7P586G1S17 protein; angio; diag 

408308; AL033377: Hs.44197; hypothetical protein OKFZp564D; pane, renal, cohm; mAb 

408353; BE439838; Ks.44298: mitochondrial ribosomat protei; hing; diag 
. . 40843a. S79876: Ks.44926; dipepfidytpeptidase IV (CD26, ; pros; mAb 
05 408522; AI541214; Hs.46320: SmaO proQne^ protein SPR; hmg. Uad. headnk, eosph. cenr. diag 

408561; AI308037: KS.B4120; hypothefieal protein MGa3016; mela: CTL^em 

408570; AL046406; Hs.103483; KIAA1798 protein; angto; CTL^sm 

408572; AA05561 1; KSJZ26568; ESTs, Moderately simSar to AU hmg; dteg 
_^ 408591; AF015224; K3.46452: mammagtotnn 1 ; breast cerr, diag 
70 408611; NM.004367; Ks.46468; chemoUne (OC motif) receptor, mela; mAb 

408633; AW963372: K3.222088: PRO2000 protein; blad, lung, headnk, pros; diag 

408660; AAS25775; Hs.89040; ESTs, Moderately simBar to PC; ovar, pane, pros, esoph. sarc; diag 

408728; AL137379; Hs.47125; hypothe{h:al proton FU13912; test; diag 
_ _ 408758; NM_003ra6; Hs.47504; exonuctease 1: mela; CTL-^sjn. 
75 408770; AW270608; Hs.170195: bona morphogeneOc protein 7 (; ovar; mAt^diag 

408771; AW732573; H3.47584: potasshim vottege-gated channe; lung; mAb 

408780: D31797; Hs.652; tumor necrosis factor (Ogand); teuk; diag 

408795; AW749126; Hs.170345; hypoth^ protein FU 13710; ovar. diag 

408826; AF21 6077; Ks.48376; Homo sa^^ens done HB-2 mRNA s; pane, pros; diag 
80 408833; AW612232;Hs.254835; ESTs; pros; diag 

406877; AA479033: Hs.130315; ESTs, Weddy slnflar to A475a2; breast; di^ 

408S15; NKit0166S1; HS.489S0: heptacethdar carcinoma novel ; pane, sarc; diag 

408930c AA146721; Hs.334686; hypotheScal pretebi aJ21S88; blad; Ol^m, 
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408982: BE385436: Hs.44317: SRY (sex detenmWng region Y); mefa; (Dag 

40899^ AA059325; Hs.301 1 4; guarine nudeolido binding pfo; lung; dlag 

408996; AI979168; Hs.82226; filycoproteln (transmembrane) n; mela; mAtnsja 

40901^ AL1 17435; Hs.49725; DKFZP434I216 protein; sarc; CTL-^sjil 

409038; T97490: Ks^WZ; small inducible cytokbte sub^ mela; diag 

409051; AA080912; ; gb3n04d0ar1 Strat^ena hNT m pros; sm 

409077; AA063037; Hs.65803; ESTs; fong; diag 

409093( BE243834; Hs.50441; 0GM)4 pmtein; lung; dlag 

409123; AA063403; ; gb3m04d12^1 Stratagene come; pros; sja 

409142; AL136877; Hs.50758; SMC4 (sbudural maintenance o; over, lung, mela; disg 

409153; W03754; Hs.5081 3; hypolhe&al pnM RJ20Q22: fibio; dteg 

409200; AUM2914: Hs.51039: KIAA007S gene product; sarc: CR^&m. 

409203; AA760473; Hs.687; cytochmme P450. subfamily IVB; fibm, blad, over, diag 

409226; R16811; Hs.22010: ESTs. WeaMy slmii? to 210926; (ung; mAb 

409231; AA446644; Hs.692; GA733-2 an^en; epItteHal gt; pros. over, breast, utar. pane, colon, stem; mAb 

409243; AB037761; Hs.51743; KIAA1 340 protein; test; cfiag 

409262; AK000631: H5.52256; hypofhetical protein FU20624; pros; Cn.-*6.m. 

409264; KH.014937: Hs.52463; KIAA0966 protein; mela; CTL^ 

409269; AA576953; Hs.22972; steroid 5 alpha^uctase 2-6; breast over. lung. pane, uter mAb 

409327; L41 162; KS.S3563; co&agen, type IX. alpha 3; colon, pane, sarc; CTlnsR. 

409940; BE174629; Hs.32113Qe hypoOietical pnM MGC2771: mela; CTl-^sm 

409342: AU0770S8; Hs.54089; BRCAI assodated RING domain 1; test; C1I-»sm 

409348; AI401535: H5,146090; ESTs; renal, glio; diag 

409361; NM.005982; Hs.54416; sine ocuDs homeobox (Drosophi; blad. lung, pros; CTL^.m. 

409389; AB007979: Hs.301281; Homo sapiens mRNA, chnunosome ; gfio; dag 

40939S; U4S745; K5.336678; dystrobrevtiv a!p)u( gHo; dlag 

409402; AF208234; Hs.695; cystafin B (stefin B); blad; diag 

409415; AAS79258; 1^.6083; Homo sapiens cDfiA: FU21028 fi; mela; di^ 

409421; AA199883; H3.67624; ESTs; test; dlag 

409430; R21 945; Hs.346735; spHdng factor, aginineteeri; mela: diSQ 

409432; 049372; Hs.54460; BmaQ inducible cytokine sufaCa; storo. esoph; diag 

409433: AA074382; Hs.135255: ESTs; gHo. sarc; diag 

409509; AL036923; Hs.322710; ESTs; at^to; diag 

409512; AW979187: Hs^3591; melanoma dHferentiafion assoc; mda, esoph; CTUsja 
40954?. AA503020; Hs.36563: hypotheOcsl protein FU22418; breast, ovai; dicg 
409582; R2743Q: Hs.271555; ESTs; lung; diag 

409801; AF237621; H3.80828: iceratin 1 (epidermolySc hypen beadnk. mela, sarc; CTL-»5jn. 

409633; AW449822; Hs.65200; ESTs: sarc; diag 

409637; AA323948: Ks.55407; Homo sapiens mRNA; cONA DKFZp4; renal; diag 

409838; AW450420; Hs.21335; ESTs; glto; diag 

409670; AI368109; Hs.381163; KIAA1856 protein; test; CTL^sm 

409703; NM-006187; Hs.56009; 2'-5'-oIig(»denytete synthetas; pane, esopb, meta; sm 

40970S; M37762; Ks.56023; brtiin-derlved neurotrophic fac; hmg; diag 

409719; AI769160; Hs.108681; Homo sapiens brain tumor assoc; hn^ dlag 

409731; AA125985; Hs.56145; thymosin, beta, UenfiSed In ; pros, sarc; CTl^ 

409745; AA077391; ; gb:7B14E12 Chromosome 7 Fetal ; ovar, CFL^sm 

40993S; AW511413; Hs.187393; ESTs; lung; diag 

409958; NR4_001523; H3.57697; hyahnonan synthase 1; panc^ ovar. mAb 

409988; N27687; Hs.334334; transnipQbn bctor AP-2 s^; mela; diag 

410006; AW732308; Hs.57783; eukaiyotte translation iniOat; lest; dl^ 

410037; AB020725; H3.58009; KIAA0918 protein; pros; di^ 

410044; BE566742; Hs.58169: highly expressed in cancer, rt blad; diag 

410048; W76467; Hs.343874; proline oxidase homolog; test; 8.m. • 

410076; T05387; Hs.7991; ESTs; lung, pros; dlag 

410079; U94362; Hs^7S7; glycogenin 2; mela; diag 

410082; AA081594; Hs.158311; MusashI (DrosopMla) homolog 1; pros; diag 

410102; AW248508: Hs.2797Z7: ESTs; homologue of P&^Z [Oom ovar. breast. Uad. lung, angle, saro; diag 

410174; AA306007; Hs.59461: DKFZP434C245 protein; mela; ^ 

410240; AL157424; Hs.61289; synaptojanin 2; anglo; dlag 

410247; AF181721; Hs,61345; RU2S; ovar. CTL-»ojn. 

410266; AA316181; Hs.61635; 8b( transnrambrane epHhetial a; pane, pros. EWS; mAb^CTL 

410290; AA402307; H3.322844; hypolheflcd proteh DKFZp564A; mela; dlag 

410310; J02931; Hs.62192; coagulaSon factor ill (Unomb; pros, pane; mAb 

410361; BE391804; Hs.62661; guanytate binding protein 1, Y, mela. esoph, hepCkibro. uten diag 

410438; AW748012; Hs.45207; hypotheScal protein KIAA1335; lung; CTL^jn. 

410467; AF1(S546; Hs.63931; dachshund (DrosophOa) homolog; breast, coton. liter, stom; dlag 

410480: R97457: Hs.63984; cadherin 13. Hcadheifn (heart an^o; mAb 

410491; AA465131 ; H5.64001; Homo sapiens clone 25218 mRNA ; mela. esoph; OdQ 

410530; M25809; Hs.64173; ATPaso. H transporting, lysoso; ovar. mAb 

410553; AW016824; Hs^2068; hypoOteOcal protein MGC14128; blad. lung; dlag 

410555; U92649; KS.38013G; a dislntegrtn and metaltoprots; hmg; mAb 

410561 ; BE540255: Hs.6994; Homo sapiens cONA: FU22044 fi; lung; diag 

410566; AA373210; Hs.43047; Homo sa;^ cONA FU13565 6s; pane; dlag 

410500; AW575742; Hs.351 676; ESTs, Moderately similar to S6; mela; mAtHm 

410621; AA194329; Hs.172004: BOn; sarc; dlag 

410681; AW248890; Hs.65425; calblndln 1. (28kO); lung; diag 

410687; U24389; Hs.65436: lysyl oxidase^ 1; pane; diag 

410733; D84284; Hs.66052: C038 antigen (p45); pros; mM>*CTl 

410763; AF279145; Hs.8966; hypotheticsi protein FU21776: pane; mAb 

410855; X97795; Ks.66716; RA054 (SxerovlsiasHikB; test; 

410867; X63556; Ks.750; fibtiliin 1 (Marfan syndrome); pane; dlag 

410870: U81599; H3.66731; homeo box B1% pros; CTL^sjil 

410883: D43767; HS.G6742; Gai7 chemoldne (TARQ (SCYAI; leUk; dbg 
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410889; X91662: Hs.68744; twist (Drosophlla) homolog (ac; sarc; CTt«&nL 

410929: H47233: Hs.30643; ESTs; ovar, test diag 

41 1078; A)222020: Hs.182364: CocoaOrtsp; pros, qBo, breasl; diag 

411089: M456454: Hs^14291;c(^(l)vbloncyde 2^8 1 C'ltm^ 

41 124^ AB039866: Hs.69319: CAII; esoph; dieg 

41 1248; AA551538; Hs.69321; Homo sapiens cDNA aJ1440d fis; blad; dl^ 

411257: AA62B367; Hs.115274; Indan hedgehog protein (IKH) ; ovar; diag 

41 1263; BE297802; Hs.69360: IcinesifvGka 6 {mitoQc centro; lung, biad, hradnk; CTLiSJTk 

411296: BE2073D7: his.101 14; growth suppressor 1; san; dlag 

41 1356; R47479: Hs.94761; K1AA1691 protein: meta.renal. sarc; mAb 

41 1388; X72925: Ks.69752: desmocdlin 1: headnlt. mela; nnAb 

411393; AW797437; Hs.69771; &Mr, properdin (COMPUEMEN; ovar, dlag 

41 1558; AA102670; Hs.70725; gamnuhamlnotKityric add (GABA); pane, pras. stom. breast, uter, cerv, ovan 

41 1573; ABQ29D00; Ks.70823: K1AA1077 protein: pane, headnk. hmg, stom; diag 

41 1 579; AO0O5258; Ks.70830; U6 snRNA-assodated Sirvtice pr; lung; dl^ 

41 1732; U47924: Hs J1642; guanine nucleotide t}inding pre; lung; diag 

411768; NM-013371: K5.71979: interteuMn 19; ovar, uter, cerv; diag 

41 1789; AF245505; Hs.72157; AdDcan; fareast, pane. lung, storn, headnK ovar. uter. esoph. sarc; diag 

41 1825; AK000334; Ks.35241 5; solute canier family 39 (zinc; oolon, ovar; mAb 

41 1628; AW161449: Hs.722g0: wingiess-type MMTV integialion; ovan (flag 

41 1869; W20027: Hs^3439: ESTs; angio; diag 

411874; AA096106: Hs.20403: ESTs; blad; diag 

41 1880: AWB72477; ; ^m30f03j(1 Na_CGAP_Thy4 Ho; blad; diag . 

411945: AL0335Z7: H3.92137: L^^yc-2 protein (MYCU): blad, ovar; CTL^m 

412006: AW451618; Hs.380683; ESTs; sarc; diag 

412026; AA383618; i^73073: testis^pedfic anicyrin motif ; test; diag 

412045; AA09980% Hs.83883; transmembrane, prostate androg; pros; mAb^m 

412099; U64198: Ks.73165: intsrieukin 12 receptor, beta ; io^ meia; mAb 

412104; AW205197; Hs.240951; Homo sapiens. Similar to I^KEN: pane, fibro; diag 

412115; AK001783; Hs.73239: hypothelical protein FU10901: lung, blad; CTL"»64n. 

412116; AW402166; Hs.784; Epstein-Baft vims induced gen; pane; mAb 

412133; U83460; Hs.380728; solute canter family 31 (copp; pros; mAb 

412228; AW503785; Hs.73792; complement component (3d/Epste: meia: mAb 

412247; AF022375; Hs.73793: vascular endothelial growth fa; renal, gib, blad, colon: <fiag 

412265; AA101325; H3.66154: hypothetical protein aJ12457; test CTL-»sm 

412326; R07566: Hs.73817: small indudbla cytokine A3 (h; pros, ieuk; diag 

412351; AL1 35960; 1^.73826; T-oeH acute lymphocytic leuicis; angio; CTL^&m. 

412420; ALJ035668; Hs.73853: bone morphogenetlc protein 2; blad, gBo, bng, stom, angloi diag 

412448; 112964; Ks.73895; tumor necrosis factor rec^tor, Ieuk; mAb 

412471; M63193; Hs.73946; endothelial caB gimvSi facton cerv. mela, esoph; dla« 

412490; AW803564: Hs.288850; Homo sapiens cDNA: FU22528fi: dlag 

412519; AA186241: Hs.73980; tiepodn T1, skeletal, slow; sanr. (Sag 

412530; AA766268: Hs.266273; hypothetical protein FU13346: blad, lung; dlag 

41 2564: X83703; Hs.355934; cardiac snkydn repeat protein; angio; diag 

412580; AA113262; Hs.17g01; similar to CABLES IHemo sapien: mala; diag 

412610: X90908: Hs.74126; fatty add binding protein 6. ; Uad; dlag 

412661; N32860: Hs.24611; ESTs, Weakly similar to 154374: Uad; CTLism 

412715; NM.000947; Ks.74519: primase, polypeptide 2A (58kD); pros; sjn. 

412723; AA648459; Hs.33S951: hypothetical protein AF301222: lung. blad. headnk, colon, stom, uter; diag 

412755; BE144306: Hs.179891: ESTs. Weakly slmnar to P4HA^H: angto; am 

412811; H08382: Hs.349705: ESTs; lung; dl^ 

412817; A1037159; Hs.74619; proteasome (prosome, macropain: hmg; sm 

412843; AF007555: Hs.74624; protein tyrosine phosphatase. ; pros; mAb 

412656: 6E386745: Ks74631; basigin (OK blood group): mela; mAb 

412926; AI879076: Hs.75081; macrophage myrislqyiated aiani; mela; CTL-^m 

412939; AW411491; Ks.75069; euk^otic Iranslalbn elongat; mela, rsnal: diag 

412970; AB026435; Hs.177534; dual specKidty phosphatase 1; breast, melas sja 

412986; X81120: Hs.751 10; cannablnoid receptor 1 (brain): gHo; mAb 

413004; T35901: Hs75117; interieuMn enhancer binding f; (ung; diag 

413011; AW068115: Hs.821; blgfycan; hjng; CTL-*sjn. 

413049; N^^002151: Hs.623: hepsin (transmembrane protease; pros; mAb 

413095; AA494359: Ks.30715: pdasshim voltago^ated dianne; pane. stom. rend, coton; mAb«sm 

413125; BE244589: HS.752D7; giyoxalase I; pros; sm 

413126; AW419203; Hs.174174: ESTs; angfo; diag 

413129; AF292100: H3.104613; RP42 homdog: hing; dlag 

413132: NMJ)06823; Hs.75209: protein kinase (cAMP-dependent; angb: CTL-»sjn. 

413142; M81740; Hs.75212; ornithine decaifomcyiase 1; hmg: sjn. 

413163; Y00815; Hs J5216: protein tyrosine phosphatase. ; pros: mAb 

413171; AA31832S: H3.75219: tyroslnase^ated protein 1; mela; mAb 

413190; AA151802; Hs.40368; adaptor-related protein compte; mela: diag 

413219; AA878200: Hs.1 18727; Homo sapiens cONA RJ13692 lis: esoph, cerv; dlag 

413223; AI732182; Hs.191886: ESTs; hmg; diag 

413268; AL03S079; Hs.75256: regulator of &protein slgnaO; headnk; CTL-»s.m. 

413281: AA861271; Hs.222024; tianscrtptton fador BMAL2; lung, blad. headnk, pane, angio: <ftag 

413313; NM.002047; Hs.3388S: gtycyf-tRi^ synthetase; test; sm 

413328; Y15723; Hs.75295: guanylato cydase 1, sohibls. : pros; SJn. 

413335; A1613318; Hs.48442; ESTs; ovar; diag 

413354: BES36218: Hs.137516: fktgetli>aa 1; lung; dlag 

413372; H55532; Hs^9695: tobuDn. alpha 2; test diag 

413435; X51405: Hs.75360: caifaaxypeptkfase E; pros, glio. pane, sarc; diag 

413436: AF238083; Hs.68061: sphlngoslna kinase 1: saro; sja 

413472: BE242B70: Hs.75379: soluto eanfer lami^ 1 (glial: tfto: mAb 

413566; AW604451; Hs.381 153; sprouly (Dresophfla) homotog 4; sarc; CTL^ 
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413573; AI7338S9: Hs.1490e9: ESTs; lung; diag 

41358% AW295647: K3.71331 : hypotheUca) protBin MGC5350; lung; diag 

413597; AW302885: Hs.117183; ESTs; pros; (Sag 

41 3823; AA825721; Ks.246973; InUon of BicautSal D homolog 1; ovar, pns: diag 

413691; AB023173; Hs.75478; ATPase. Qass VI. type 1 1 B; lung; raAb 

41371 1; AW291765; K5.75486; heat shock transciiption facto; renal; diag 

413753; U17760: K8.75517; lamlnln, beta 3 (nicein (125kD; hmg, Uad. headnK pane. cew. esoph, colon; diag 

413762; AW41 1479; Hs.848: FK506Ming protein 4 (59kD); test: (Sag 

41 3778; AA090235; Hs.75535; myosin, f ght polypeptide 2, r; saic; diag 

413794; AF234S32; Hs.61638; myodnX mela; diag 

413804; T64682; ; gtJ7c48t)02n Stratagene liven t}lad; diag 

41 3808; J00287; Ks.350038; Homo sapians mRNA for caldesmo; esoph; diag 

413813; M96956; Hs.75561; teratocardnoni^lenved growth; oolon; diag 

413833; Zt5005; H5.75573; centiumers protein E (312icO): lung; CTl«6.m. 

413842; M29363: Hs.856; interferon, gamma; leuk; diag 

413889; NM_OO0d78; Hs.75596; Inteileukin 2 receptor. t)eta; filiro, rena!; mAb 

413880; Ai660842: Hs.110915: Intedeukin 22 receptor; pane, coton; mAb4S.m. 

413924; AL11S964; H175B16; Gdadin-1; pros, bieast. ovan diag 

413943; AW294416: Hs.144687; Homo sapiens cONA FLJ12981 fis; Uad. lung; CTL^m. 

413985; At01866G; Hs.75667; synaptophysin; glio. sarc; mAb 

414004; AA737033; Hs.7155; ESTs, Moderately similar to 21; pane, mela; diag 

414020; NM_002984: Hs.75703; Ca4 Chemokine ((>C moCi) Dg; pros; diag 

414034; U89277: Hs.305S85: eariy devek)pment reguIatDr 1 ; test; CTL^ia 

414035; Y0063D; Hs.75716: serine (or cysteine) pnrteinas; lung, cerv. tieadrA, Uad; sm 

414053; BE391635; Hs.75725; transgsUn 2; Mad; (fiag 

414061; NM_000899: Hs.300280; amytase, afpha 2A; pancreatic: ovan diag 

414065; AW515373: Hs.27124g: Homo sapiens cONA aJ13580 (is; pros; diag 

414085; AA1 14016; Hb.75748: aldehyde dehydrogenase 1 famO; pros, pane, sarc; s.m. 

414142; AW368397; H8.3344e5: hemicentin (fibulin 6); fibio, pane, sarc; diag 

414161; AA136106; H9.184852; K1AA1553 protein; test; diag 

414166; AW888941; Hs.75789: N-n^ downstream reguiated; pros, rend; diag 

414217; A1309298;Hs^79898; Homo sapiens cONA: FU23165fi;glk):diag 

414219: W20010; Hs.75823: ALU-fused gene from chromosom; sarr. diag 

414221; AW450979; ; ob:UI-H-fil34la-a-12-04Jl.s1 N; blad; diag 

414251; A1J042306: Hs.97689;VASA protein; test; CTL^m 

414259; W44633; Ks.301296; biiegiln, beta^ 1 (with EG; pane; diag 

414334; AA824298; Hs.21331; hypolheScal pmtein FU 10036; test; diag 

414359; M62194; Hs.75929; cadhertn 11. type 2, OB-cadhen breast, over, (iter, pros, colon, pane, sarc; mAb 
414368; W70171; Hs.75939; uridine monophosphate kinase; lung; sjn. 
414386; X00442; Hs.75990; haptoglobin; ovar. diag 

41441 6; AW4099B5; H3.76084: hypotheScal protein MGC2721; Uad, lung; CTL-»8.m. 

414420; AA043424; Hs.76095; fai\medlate early response 3; pane, colon; diag 

414430; AI346201; Hs.761 18; ubiquIOn cartx»^termlna] es; lung; sm 

414443; AU077268; Hs.76144; pi^t-derived growth factor sarc, pane, headnk; mAb 

414476; AA301867; Hs.76224; EGF-containIng fibuDn^Ike ex; angio; diag 

414477; 1141635; H5.76228; ampfified in osteosarcoma; sere; diag 

414509; AW161311; Hs.76294; C063 antigen (meianoma 1 antig; mela; mAb 

414521; D28124; Hs.76307; neuroblastoma, suppression of; breast, ovar, uter, pros. Uad, pane. ookm. stem, fibr; diag 

414565; AA502972: Ks.183390: hypotheHcal protein FU13590: pros; diag 

414569; AF109298: HS.1182SQ; prostate cancer associated pro; pros. EWS; diag 

414575; H112S7; H&375743: Homo s^ens done IMA6E:45193; lenal; diag 

414595; AA641726; Ks.289015; hypothefleal protein M6C4171; Mad; diag 

414602; AW630088; Hs.76550; Homo sapiens mRNA; cDNA DKFZpS; pros; mAb 

414683; S7B298; Hs76888; hypotheiteal protein MGC12702: Uad. lung, test; diag 

414732; AW410976: Hs.77152; ndnichromosome maintenanca def. test. Uad; diag 

414761; AU07722B; Hs.77256; enhancer of zeste (DrosophDa); lung. Uad, test; Ca-^m 

414776; AA155598; Hs^2839: hypothefoa! prot^ FU14195;; an^: diag 

414786; AI246482; K3.243010: Homo sapiens cDNA FU14372 fis; angto; diag 

414789; AI752416; Hs.77326; insuOn^Ike growth factor Un; renal; diag 

414806; D14G94: Hs.77329: phosphaOdylserfne synthase 1; hmg; mAb 

414807; AI738616; Hs.77348; hydroxypiostaglandln dehydroge; blad; sja 

414609; A]434699;Hs.77356:translerTln receptor (p90, 007; lung; mAb^ein. 

414825; X06370; Hs.77432: epidemnal growth factor recept glio, lung, renal, esoph. pane, headtik. leidc ; mAtn&m. 

414915; NR4.002462; Ks.76391; myxovkus (influenza) resistan; esoph; diag 

414918; AI219207; Hs.72222; hypothetical protein FU13459; blad; CTL 

414921; BE390551; Hs.77628; steroidogenic acute regulatory; breast; diag 

414945; BE076358; H5.77667: lymphocyte antigen 6 con^ ; mela; mAb 

414987; AA524394; Hs.294022; hypotheScal protein FiJ14950: Uad, pane, esoph; diag 

41 4998; NM-002543; Hs.77729; oxidised tow density fipoprote; fibio, ovar, pane, coton; mAb 

415003; M11437; H3.77741; kininogen; pane; diag 

415025; AW207091; Hs.72307; ESTs; Wad; diag 

415091; fimmz Hs.77910: 34)ydrox^3^thy1glularyU>>e; hng. headidc Bxn. 

415178; 080503: Hs.46692: ESTs; blad; diag 

415214; AI445236; Hs.125124; EphB2; colon, stem; mAb 

415314; N88802: Hs.5422: glycoprotein M66; mela; mAb 

415457; AW081710: H8.7359; Homo sapiens testes specfflc A; tibro, ovar, uter; CTl-^in. 

415511; A1732617; Hs.182362; ESTs; blad, ovar, renal; diag 

415542; R13474: H3.290263; ESTs, Weakly similar to 138022; blad; diag 

415724; NM_003580: Hs.78687: neutral 8p«ngoTnyelinase(ri-SM; lest; CTL^m 

415752; BE314524; H3.78776; putative transmembrane protein; endo, uter, breast stom, Uad. mela; mAb 

415766; AW419196; Hs.257924; hypothetical protein FU13782; breast pros, Uad; diag 

415787; H01463; Hs^534; ESTs; pros; diag 

415819: AU077330; HS.36Q791; transcr^ptton elongaOonfapto; test; CTL^sjn. 
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415829; AW450198; Hs.163742; ESTs: test; diag 

415857; AA866115; H8.127797: Homo sapiens cONA faJ11381 Qs; lung, test; dl^ 

415910; U20350: Hs.78913: chemoMne (C-X3^ receptor 1; gOo; mAb 

415947; U04045; Ks.7B934; mutS (E coQ) homoiog 2 (ooto; tssl; (flag 

415989; AI267700; Hs.351^1; ESTs; pms.ov8r, tdad, tung, headidt, pane, cotoa sarc; diag 

415992; C05837; Hs.145807; hypothetical protein aJ13593; pros, Gbro; mAb 

415999; AA172179; H5.294029; ESTs; pros, tder ; diag 

41 6018; AW138239; Hs.78977; pioprote&i convertase subtiDs; coton, pane, hmg; diag 

416030; H15261; Hs.21948; ESTs; breast fibro; diag 

416065; BE267931; Hs.789S6; prc^feraitng ceS nudear ant; blad, lung, headnk, mdta( CTL<*sja 

416111; AA033813; HS.7901B; chromatin assembly factor 1, s; fung, stem; CTL'^m 

41618B; BE157260: Hs.79070; v-myc avian myetocytomalosis v; pros; diag 

416201; AA4677S2: Hs.195161: ESTs; test; dag 

41 6208: AW291 1 68: Ks.41 295; ESTs, Vfeaidy dmter to MUC^Jt tung; diag 

416224; NM.0O2gO2: Hs.79088: reticdocaibin 2, EP-hand caic; ovar. diag 

416225; AA577730; Hs.188884; ESTs. Weakly simitar to PC4259; pros, Uad; dicg 

416350; AF168625; H5.189S07: phosphoiipasa A2, group liD; test meia fibre; diag 

416370^ N90470t Hs.203897; C038 anOgen tp4Q: pros. gSo; mAlnCTL 

416373; AA1SS845; K8.73680; ESTs, WeaMy sMtar to S12658; saro; diag 

416402; NM.000715; h(s.1012; complement componeni 4-b!ndlng; fibro; (Sag 

416448; L13210; Hs.79339; lectin, gaiactoslde^ng, s; ovar, coion. stom; diag 

416438; U33632^ K5.79351; potassium channet subfamily K; pane, stom, breast ovar, colon; mAb 

416539; Y07909: Ks.79368; epHhellal membrane protein 1; pros, lieadnk; (Sag 

41 6602: Nl^006159; H3.367895; Protein Unase OUnding prote; brest; diag 

416640; BE262478; Hs.13406; neuion^pecHic protein; meia; diag 

416661 ; AA634543; H3.79440: IGF-ll mRNA^Inding protein 3; btad, lung, headnk. ceiv. pane, angto; diag 

416773; AK000340; Hs.79828; hypothefical protein FU20333; lest; CTL^m 

416615; IM1514; H$.8012O; UDP-N-acatyi^fr(Vgal8Ct03a:angio;84n. 

416819; U77735; Hs.80205; pim-2 oncogene; lung, test; diag 

4168B1; N32520; Hs.141358; ESTs; meta; diag 

416929: N20535; Hs.43265; melastaSn 1; mela; diag 

416975; NM-004131; Hs.1051; granzyme B {granzyme 2, cytoto; mela; sjn. 

417003; AU)38170; Hs^756; betains-homocysteine meVvlba; btad; sm 

41707O; Z19077; Hs.172004; titin; sarc; diag 

417105; X60992: Hs.B1226; COS antigen; fibro; mAb 

417115; AW952792; Hs.334612: small nuclear rIbonucteoprDtel; test; CTL^sm 

417124; BE122762; Hs.25338; ESTs; angio; diag 

417148; AA359896; Hs.374554; hypothetical protein aJ14902; pane; diag 

417151; AA194055; H3.293858: ESTs; Uad; diag 

417153; X57010; KsJ1343; coiiagen. type D, alpha 1 (pn pros, saro; diag 

417218; AA005247; Hs.285754; met prot(H)ncogend (hef^iof^ lung; mAb«s.m. 

417237; K66385; Hs.81737; palmitoyH)roteIn thloesterase; mela; sm 

417259; AW903838: Hs.81800; chondroitin sulfate proteoglyc; pane, breast; diag 

417275; X63578; Hs.295449; pan^bumbi; blad; diag 

417295; AW993524; H8.43148; epitheliafl mentoie piot^ 1; pros; diag 

417308; H6O720! Hs.81892: KiAAOIOI gene product hing. headnk, Uad. oenr, angio, mela^ sarc; diag 

417312; AW888411; K8.2S0811; Isukemid-assodated phosphopro; Uad; CTt^m 

417333; AL157545; Hs.173179; bromodoman and PHD finger con; breast diag 

417355; D13168; H5.82002; endotheSn receptor type B; gfio, meia; mAb 

4173S5; D50683t HsJ2028; transforming growth factor, be; fibro, angio; mAb 

417366; BE18S289; Ks.1076; smalt proDne-iich protein IB ; hmg. Uad. headnk, pane, esoph. mela; diag 

417370; T28651; Hs.374466; tryptophanyi4RNA synthetase; fibro, mela; diag 

417391; AW291946; Ks.82085; Interleukin 6£lgna] transduce; breast mAb^ 

41 7400; AA663486; K3.123072: E^B, member RAS oncogene ^ pros; diag 

417407; AA923276; Ks^0g05; ESTs. WeaUy sin^ar to protea; test pros; SJii 

417409; BE272506; Hs.82109; syndecan 1; blad; diag 

417412; X16896; HsJ2112; interleukin 1 receptor, type I; fibro. pros, pane; mAb 

417426; NH.002291; Hs.82124;lam]nin. beta 1; angio; diag 

41 7437; U52682: H3.82132; interferon regdaloiy factor 4; mela; CTL+sm 

41751 2; X76534; Hs.82226; glycoprotein (transmembrane) n; hing, meia, headnk. pane, breast mAb 

41 751 5; U4203; Hs.82237; ataxlatelangieclasia group [>; hing, headnk. biad; diag 

417542; J04129; Hs.62269; progestagen-assodaled endomet lung, meia; diag 

417592; AA204664; Hs.162437; ESTs. Weakly similar to 154383; test diag 

417599; AA204688; Hs.62954; ESTs; Wad, esoph; diag 

417621; AV654694; Hs.82316; interferon-induced, hepaOIis ; esoph; diag 

417696; BE241624; Hs.82401; C069 antigen (p60. eaify T-cet pros; mAb 

417705; AW134952; Hs.175220; hypolhetk:a] protein FU14S41; test mAb 

417715; AWg69587; Hs.86386; ESTs; btad. lung, headnk; diag 

417720; AA205625: Hs.208067; ESTs; Uad, tung. esoph. headnk; diag 

417750; Ai267720; Hs.260S23; synovial sarcoma, banslocated; sarc; diag 

417777; AI823763; Hs.7055; ESTs, Weakly similar to I7B885; test 8.m. 

417791; AW965339; Hs.44269; ESTs; ovar, Uad, hing. headnk; CTL-»8m 

417801; AA417383; H3.82582; integral, beta^ke 1 (with EG; pane, fibro; diag 

417805; U38545; Hs.82587; phosphoilpase D1, phophalUytc; angio; sm 

417831; H16423; Hs.82685; C047 antigen (f^h^related antig; ovan mAb 

417843; W07381; Ks.22545; Homo sapiens cONA Fyi2935 lis; pros; diag 

417847; AI521558; Hs.7331; hypcfhetica] pMn FU22316; ovan diag 

417849: AW291S87; Ks.82733; nkjogen 2; engk). headidc diag 

417874; BE616160; Hs.82829; protein tyrosine phosphatase, ; pane; mAb*sjn. 

417880; 6E241595; Ks.82848; selectin L (lymphocyte adhesto; meta; mAb 

417686; AA214564; ; ESTs; test ovar. di^ 

417900; BE250127; HS.B2906: CDC20 (oelldiv!sbncycle20,; hmg, stoni, test Uad. headnk, cerv. esoph; GTUsm 
41791 1; AA333387: Ks.82916: chaperonin oontaJhino TCP1. su; test (Sag 
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417944; AU()77196: 88.82985; coQagan. type V. alpha 2: sarc; diag 

417975; AA64183G; Ks.30085; bypolheOcal protebi FU2318Ge colon, skua tung; mAb 

417976: BE565892; Hs.83077; inteiteuUn 18 (interferen-gam; coton. stonv fitnos; diag 

416004: U37519; Hs,87539: aldehydo dehydrogenase 3 (am9; tu% headrtk, osoph; 8.ni. 

416036; Z37976; Hs.83337: latent transfbrming growth fac; anglo; diag 

416054; NhL002318: K5.83354: lysyl (Bddas&ace 2;.luno. angio. test, saic; dteg 

418067: AI127958: Hs.83393; cystatin EM; haadnk. pain, htad; db« 

41 80^ AW971 155; Hs.293902; ESTs. Weakly aimOar lo ISHUSS; tilad; sm 

41 81 13: AI272141; Hs.B3484; SRY (sex delennining legton Y); blad, breast liter, colon, lung, ovar. gOo^ lest saiu; CTL^m. 
418134: AA397769; Hs.86617: ESTs; test: di^ 

41814(k BE613836: Hs.83551; microQirillar-assoclated prot: lung, headnk, esoph, ovar, sarc; dlag 
418156; W170S6: Hs.83623: nudear leoeptor subfamily 1, ; {3)ro: mAb^sJn. 
418203: X54942; Hs.83758; CX)C28 protein kinase 2; \mg, blad, test mela. sbnc SJn. 
418216; AA66224D: H3,283039; AF1 5q14 proteli^ headnk, lung, Wad; dlag 
418245; AA088767; Hs.83883; tjansniemt)fane. prostate androg; pane; mAtnem 

418283; S79895: Ks.83942: cathepsfn K (pycnodysostosk); breast cefv. ovar, uter. pros, headnk. fung. pano, colon, stonn; dlag 

418318; U47732: Hs.84072; transmembrane 4 superfamfly me: pane, pros, colon, stem, omuc; mAb 

418322; AA284166; Hs.84113; cydn^iependent kinase inhU; headnk. tung, blad; sm 

418338; NM_.0O2522; Hs.84154; neuronal pentraxin 1; sarc; diag 

418339; AA639902: Hs.104215: ESTs, Moderately similar to SP; pros; dlag 

416345; AJ001696; Hs.241407; serine (or cysteine) proteinas; cefv. lung; s-nv 

416371; M13560; Ks.84298: C074 anBgen (hwartant pdype; renat; mAb 

418379; AA21 8940; Hs.137516; fidgeting 1; lung; diag 

418394; AF13281 8; Ks.84728; Kruppel-Rke factor 5 ftntesti; pane; CTL-»6.m. 

418396; AI765805; Hs.26691; SLC2A12 Solute carrier famDy ; pros; mAb 

418397; NMJ)01269; Hs.84746; chromosome condensation 1; lung: dlag 

418399; AF131781; Ks.84753; hypothetteal protein FU12442: test blad, saro; diag 

416406; X73501 ; K5^905; cylokerafin 20: blad, coton; diag 

416432; M14156; Ks.85112; insuSn^le growth bctor 1 (; pros, fibre; diag 

416460; M26315; Hs.65256; C08 anSgen. alpha po^peptkle; libro; mAb 

41846^. BE001596; Hs.85266; integrin. beta 4; blad, cerv, headnk, ovar. nM> 

418543; NM.005329; Hs.85962; hyaturonan synthase 3; biad. lung; mAb 

418576; AW968159; Hs.302740: Epithelial calcium channel I : pros; mAt>^m. 

418610; AW245993; Hs.32417; hypothetteai protein M6C2742; pros: diag 

418641; BE243136; Hs.66947; a dislntegfin and melalloprote; cerv, lung, pane, blad. headnk, slom; mAb 

418655: AA226354; Hs.11 1240; ESTs; pros; dlag ' 

418661; Nk\.001949; H8.1169: E2F transcriptkMi factor 3; ovar, lung, mela; CTL^m 

416663; AK001100; Hs.41690; desmocolSn 3; lung, blad, headnk, eeiv, esoph; mAb 

418683: U90908; Ks.67241; hypoMca! protein from cton; anglo; CTL'*sjn. 

416686: Z36830; H5.87268; annexin A8: blad, tung; dlag 

418893; AI750878; Hs.87409; thrombospondin 1; anglo, pane; diag 

416696: AW959433; Hs.326290; hypoOieOcal protein BJ12581: test: dlag 

418739; AA310964: Hs.8e012; SHP2 interacGng transmembrane; mela: dlag 

418756; AA252254: Hs.226949: ESTs; test : diag 

416825; AA228881; Hs.22394; hypofhefical protein RJIOdgS; anglo; diag 

416829; AA516531; Ks.55999: NK homeobox (DrosophDa), faml; pros; diag 

418882; NM.004996: Hs.89433: ATP-blnding cass^, sub-fami; ovar, pros, breast lung; diag 

418883; BE387036; Hs.1211: acid phosphatase 5, tartrate r, m>i sm 

416886; AU076801; Hs^9436; cadherin 17, U cadherin (live; colon, etom, ovar. uter. pane; mAb«sja 

416916; X07871; Hs.69476; CD2 anSgen (p50), sheep red b; mela, fibro; mAb 

418932; L34059; H8.89484: cadherin 4, type 1. flcadherin; glto; mAb 

416941: AA452970; K3.239527; ElB^SkOa-assodaled protein 5; angio. Uad; dlag 

418968; NM.000076; Hs.89538; cholesteiyl ester transfer pro; mela; diag 

416994; AA296S20; Hs.89546; selectin E (endothelial adhesi; pros, angto; mAb 

419038; AW134924; H3.58290; ESTs; pros; dlag 

41 9073; AW372170; Hs.183918; transmembrane receptor Uhc5H2 ; owr, renat Uad, hmg; mAb 

41 9076: M931 19: Hs.89584; Insulinoma-assodated 1; Uad, hing, pane, safe; dlag 

419079; AW014836; Hs.18844; ESTs; esoph. hmg. stom, colon: dl^ 

419086; NA^000216: Ks.e9591; Kallmann syndrome 1 sequence; fibro; diag 

419092; J05581: Hs.89603; mucin 1, (ransmembrane; breast pano, kmg, blad, fairo; mAb 

419222; AD001 528; Hs.89718; spernibie synthase; pros; sjn. 

419223; X60111; Hs.1244; C09 antigen (p24}; breast pros, ovan mAb 

419231; AIJ046294; H3,136245; ESTs, WedWy similar to T17227; laro; dlag 

419261; X07876; H5.89791; tivtngless4ype MMTV Integration; pane; dlag 

419264; AA8ni04; Hs^72; ESTs, Weakly similar to ALUB.K pros; di^ 

419290: AI128114; Hs.1 12865; spinal oordderived grouuth fiac; pane; dlag 

419356; A1656166: Hs.7331; hypothetkal protein FU22316: uter. ovan dieg 

419359; AL043202: H3^G073; chromosome segregation 1 (y^ lung, Mad. test; dbg 

419440; AB020689; Hs.90419; KIAA0882 protein; bieast dlag 

419485; AA489023; Hs.99807: ESTs, Weakly similar to unname: mela; dlag 

419490; NM_008144; H8.90708: granzyme A (granryme 1 , cytoto; libro: sm 

419519; AI198719; Hs.176376: ESTs; mela; diag 

419551; AW5822S6; Hs.9101 1; anterfor gradisnt 2 (Xenepus I; pane. pros, breast: dteg 
419559; Y07828; H5.91096; ring finger protein: Wad, colon, stom; ai-^ 
419568; AB0261 16; Hs.28307B: hQAT4; renal; mAb 
419569; AI971651; Ks.91143; jagged 1 (At3(^ syndrome); headnk. hmg; diag 
419628; H67546: Hs.49768; ESTs; mela. sarc; dlag 

419867; AU077005; Hs.92208; a dlslntegnn and melalloprote; breast cerv, angio; mAb 

419693; AA133749: Hs.30135Q; FXYO domdiHxintalning Ion tra; pros, breast ovar. pane, kmg; mAb 

419721; NM.001650: Hs.315369; aquaponn 4; gOo, hmg, fibro; mAb 

419743; AW408762: Hs.5957: Homo sapiens done 24416 mRNA ; blad. headnk; dl^ 

419749; X73608; Ks.93029; sparcteteonecfin, cwcv and ka; pros, pane, hmg; dteg 

419752: AA24957a; HS.1S2618; ESTs, Moderately simaar to ZN; kmg; dlag 
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419839; U24577: H&93304; phosphollpase A2. group VII (p: pras. lung; (Sag 

41 9870; AW40391 1 ; Hi.266175: phtsphoprotBin associated wHh; melac (Sm 

419875; AA853410: Hs.93557i proenkephalln; saic; dlag 
- 419948; AB041035: Hs^7; NMJ)16931:Homo s^tora NAOPH o; angto: mAb 
O 419956; AL137939: Hs.40096; cadherin 19, type 2; mela; mAb 

419958; X04430; Hs^13; intefleuMn 6 (inteiferon. t)8t: tung. pane, esoph; diag 

419981; AAB97581; Hs.128773; ESTs; angio; tfaa 

42000S; AW271 106; Hs.133294: ESTs; tung. test. Iilad, colon; diag 

420062;AW411096:H5.94785;TGF(betaHndueedtranscrtplio;test;CTL*s.ni 
i U 420067; T52431 ; Hs^795; Homo sapiens mRNA; cDNA DKFZpS; sarc; diag 

420137; AA308476; Hs.95327i C03D antigen, delta polypi; fibn; mAi> 

420154; AI093155: Ks.95420; G anflgen family C 1 protein (: pnjs. taia; CTL-»m 

420174; AI824144; Hs.199749; ESTs; angio; CTl^m. 

420208: BE276055; H3.95972; sDver (mouse homoiog) lika; mela. sac; mAb 
1 D 420209: AA2S6444: Hs.126485: liypothetica) protein faJ12604;: angio; diag 

42Q218; AW9S8037; HsJ81105; ribosomd protein U: mela, pros: dl^ 

420255; NM.00728g( Hs.1298; membfane metEflcMndopeptidase; pros; mAb 

420256; U84722: Hs.76206: cadherin 5. type Z VEnadheri; angio. fibre; mAb 

420287; N37030: Hs.173337; ESTs; mela, sarc; dlag 
ZU 420281; A1623693; H8.323494; Predteted cation efflux pun^ lung, t)}ad, ovar. pane; mAb 

420301; AA767526; Hs.22030; paired box gene 5 (&<eD line; mela; dlag 

420338; AAB25595; Hs.88269; Homo sapiens, done MGC:17339,; mela; mAb 

420340; NM.000734; Hs.97087; C032 anflgen. zela polypeptide; fibro; mAb 

420344; BE463721: Hs.97101; putative G proiein^oupled rec; colon, pros, biad. Iieadnk, pane, atom, mAb 
Z:> 420347; AL033539; Ks.97124; Human DNA sequence from done ; tesi; diag 

420360; U63171; Hs.97203; small Inducible cytoldne subl^ leulc 

420367; AA259090; Hs.2S7028: ESTs; test; diaQ 

420376; AL137471; Hs.97265; protocadherin 18;88ro; mAb*SJa 
^ ^ 420378; Nl\4-014143; Hs.97269; B7411 protdn; leuk; mAb 
iO 420380: AA640891; Hs.102406: ESTs; lung; dlag 

420424; AB033036: Hs.97594; KIAA1210 protein; pros; diag 

420462; AF050147; Hs.97932; chondromoduEn I precurson lung, EWS. sarc; mAb 

420474; 109753; Hs.1313; tumor necrosis factor (Rgand); leuk; mAb 

420544; AA677577; Hs.380213: Homo sapiens Chromosome 16 BAC; pros; diag 
3d 420576: AA297634: Hs.54925; KIAA1858 protein: sarc; diag 

420598; NM.002692: Hs.99185; potymerase PHA directed), eps; test; CTL4s.m. 

420833; NW.014581; Hs.274480; odoranW)indlng proteh 2B (06; breast, endo: diag 

420856; AA279098; Ks.187638: ESTs; fibro; diag 

420710; NM_007009:Hs.g9875: zona peHucida binding protein; test; diag 
40 420729; AW964897; Hs.290825; ESTs; pros; diag 

420757; X78S92: Hs^9915; androgen receptor (dihydrolest pros; mAb*sjn. 

420759; T1 1832; H8.127797; Homo sapiens cONA FU1 1381 fis; test; diag 

420783; At6S9838; H8,99923; lecQn. gaiactosld&bindlng. a: lung, blad. headnlc diag 
. mm, AI670057: H8.199882; ESTs; renal.; di^ 
4D 420859; AW468397; Hs.100000; S100 caidum-binding protein A; saic; dlag 

420908; AL049974; HS.10Q261 ; Homo sapiens mRNA; cDNA DWTpS; pane; diag 

420923; AF097021; H$.273321; difliBrenfiaiiy expressed In he; bled, colon; dlag 

420931; AF044197; H8.100431: smaH inducible cytokine B sub; breast lung, meisc dISQ 

420981: L40904; H8.100724; peroxisome proliferafive acUv; colon; mAb^sm 
DU 421016; AA504583; H8.101047; transcription factor 3 (E2A Im; test; CTLt«m 

421044; AF061871; Hs.101302; Human DNA sequence from ctorw ; pane; diag 

421059; AI654133; Hs.356247; thyroid receptor Inleracfing p; pros; mAb*«ja 

421064; AI245432; Hs.101382; tumor necrosis factor, atphfri; Uad. uten diag 

421070;AA283185;Hs.19327; ESTs; Wad; dlag 
00 421100; AW35ie39;Hs.124650; Homo sapiens cONA:nJ21763fl; blad; dlag 

421133: AA814971; Hs.26410: ESTs; lung; diag 

421154; AA284333: Hs.287631; Homo sapiens cONAajl4269 to; BPH; dlag 

421156; H87879; Hs.102267; lysyl oxidase; headnk, pane, rena). sarc: dlag 

421218; NW.000499; Hs.72912; cytochrome P45a subfamily I {; blad. angb; diag 
OU 421233; AA209534: Hs^243; tetra^ NET-6 protein; pros, breast, ovar; mAb 

421241; X91817; Hs.102866; transltetolas^ 1; test; sm 

421302; T34462; Hs.1G3291; neuritin; uter, endo ; dlag 

421305: BE397354: Hs,324830; diplheila toxin resistance pro; over; diag 

421307; 6E539976: H3.10330S; Homo sapiens mRNA; eONA 0KFZp4; breast lung, angio^ lest sarc; diag 
OD 421341;AJ243212;Hs^g611;deletedmmaIign8ntfarBinlum;pane,lung:(fl» 

421350; AW301608; Hs.278188: ESTs. Moderately similar to 15: test; ditt 

421373; AA808229; Hs^B8; ESTs; Wad; diag 

421433; AI829192; H8,22380; ESTs; pros; dlag 

421451; AA291377: Hs^831; ESTs; w», Uad. lung; diag 
/U 421458: NM.003654;Hs.104576;carfaoivdFBle(kefatan8uH^;$aiG;8Jit 

421478; A1683243; HS.9725B: ESTa, Itoderateiy slnnllarb $2; ovar, Wad. renat uter; diae 

421481; AW391972; Hs.104696; KIAA1324 proton; pros; dlag 

421502; AF111856; Hs.105039; solute canier tamiiy 34 (sodfc ovar, fibro; mAb 
- - 421506: BE302796; Hs.105097; thymidine kinase 1 , sduWe; hing, headnk. esoph; s.nv 
iO 421508; NM.004833i Hl105115; absent In melanoma 2; Wad. esoph. lung, melaE Ca-»8m 

421535; AB002359; Hs.105478; phosphoribosylformylglyrfnamld: test 8.m. 

421537; 6E383488; Hs.105547; neural prolifiaretion. differen; pros; dlag 

421566; NRC000399; Hs.1395: eaiiy growth response 2 (Krox-; pros; CTL-»fl.m. 

421579; NM02975; Hs.105927: stem ceO growth fadon tymph; sarc; mAb 
oU 42163% AF121860; HS.1Q6260; 8or6ng nexin IQc mela; dlag 

4216SQ; AA781795; Hs.343800; ESTs; mela: diao 

421666; AL0352S0: H3.1408( endotheSn 3; mdec diag 

421727; Y13153: Hs.1073186 l^urenine ^monooxygenase 0^ breast am 
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42177% W69233; Hs.1 12457; ESTs; mela. oso^ saic; <fi^ 
421777^ BE562088; Hs.108196: H$P0Q37 protebi: esopb, cerv, lung; dlag 
421779; AI8781S9: Hs.108219; \A^tes&4ype MMTV iniegraSon; cobn, ovar; (Sag 
. 421738; K74880; Hs^62; N^phlngosine amidohydrota; libra; ua, 
J 421614; L12350: Ks.108623; Ihrombospondin 2; pane; (flag 

421831; AA298838:Hs.22026:EST8; 8lnilartoT1WISMEMBIW);ai«to;mAb*sjn. 
421867; AW161450: Hs.109201; CGI-66 proton; pros; mAb 
421896; N&2293: H5.45107; ESTs; pros: <fiaQ 
, _ 421917; AB028343; Ks.109445; KIAA1020 protein; tast (Hag 
1 U 421920; BH551245; H3.1438; gamma-antiobutyrtc at^ (GABA); sarc; mAb 

421924; BE514514; Hs.109606; coronln, acto-blnding protein; fibro; dlag 

421 946; L42563; Hs.334309; keratin 6A; (ung, headnk, Uad. esoph, ceiv, mela; dlag 

421952; AA300900; KS.S8849; dyneln Dght chain 2B (DNIC2B); fibnn diag 

421991; NM.014918; H5.110468; KIAA0990 protein; pane; diag 
1 S 421996: AW583807; Hs.1460; giucagon; pane; dlag 

422002; X70070: Hs.1 10642; neurotensin receptor 1 (higti a; colon; mAb 

422027; AL043100; Hs^8828; fatty add amkte hydrolase: pros; 8.m. 

422033; AW24580S; Hs.110903; daudln 5 (transmefflbiane preta; gOo; mAb 
^ . 422087; X56968; Hs.1 11301; matrfx melalloproleinase 2 (ge; sarc; dlag 
20 422089; AAS23172; Hs.103135; ESTs. WeaHy simllaf to SFR4_H; pros; drag 

422094; AF129535; H3.272027; F-box only protein 5; blad, lung; CTL-»s.m. 

422095; At866872; Hs282B04; hypothetical protein FU22704; lung, pane, ovar, breast; CTL-»s.m. 

422099; AA156022; Hs.111518: hypotheficat protebi; angkr, GTL-tsja 

422100; AI096988; Hs.111554; ADP^toylatibn factor^ 7; hmg. esoph; CTL-sja 
25 4221 10; A1376736; Hs.121555; secioted pMn, addic, cyst; pane; diag 

422119; AI277629; Hs.1 11862; KlAA0S90gene product; blad; diag 

422134; AW17g019: H3.112110; mltochondiial ribosomal p^ote^. lung; diag 

422152; AA909249; H8.112282; solute carrier family 30 (zinc; blad; mAb^em 

422163; AF027208;H$.112360;prominIn(nK)use)-like1;cokm, breast. fibrD;mA^ 
30 422164; NM_014312; H8.112377; corfic ai thymocyte receptor (; btad; mAb«sm 

422168; AA5d6894; Hs.112408; SlOOcaiciunvbinding protein A; lung, blad, headnk, breast, meta^ esoph, sarc, cerv; CTL-im 

422170; AI791949; H9.112432: anti-Muilerlan hormone; uter. blad; dag 

422173; BE38S828: Hs^19; phorboSiHike protein MOS019 ; mela; diag 

422247; U18244; Hs.113602: sohite carrier family 1 (Ngh ; blad; mAb 
3 5 422278; AF072873; H5.1 1421 8; fttzzled (Dmsophlla} homokjg ; ovar, headnk, Uad. cerv, lung, pane, stem; mAb 

422309; U79745; Hs.1 14924; solute carrier family 16 (mono; mela; mAtm.m. 

42231 1; AF073515; Hs.114948; cytokine receptor-Cke factor ; lung, fibro; diag 

422355: AW403724: H3.300697; co^ulaUon foctor Vil (serum ; nbro; diag 

422363; T55979: Hs.1 1 5474; repllcatkin factor C(act]vato: mela. ooton; diag 
40 422398; AI476149; Hs.334489; hypothetteal protein aJ21992; tavo; CTL-»&m. 

422406; AF025441: Hs.l 16205; Opa4nterac(ing protein 5: Uad, hjng; diag 

422420; UD3398; Hs.1524; tumor necrosis factor (ligand); colon, pane, stom, leuk; mAb 

422423; AF283777: K5.1 16481; CD72 antigen; spleen, leuk; mAb 
. - 422440; NM-004812; Hs.116724: aidototo reductase family 1, ; lung, headnk; sm 
4S 422487; AJ010901; Hs.198267; mudn 4. tiachaobronchial; lung, headnk, pane, sarc; diag 

422532: AL008726; Hs.11B126; protecSve protein for beta^a; renal, mela; &m. 

422565; BE259035; H8.118400; singed (OrosophOaH'to (sea ; pane, test mela; diag 

422573; AW297985; Ks.295726; integrin. alpha V (vitronecG; pane; mAb*s.m. 

422596; AF063611; H8.118533: 2-S'<oIIgoadenylate synthetas; esoph, mela; sja 
SO 422603; BE242687; Hs.118651; hematopoletk^dqy expressed ho; angio; CTL'^m 

422833; X56832; Hs.118804; enolase 3, (bebi, imtsde): aaro; sjn. 

422658; AF231981; Hs.250175; homolog of yeast long chain po; pros; dlag 

422689; AW856665; Hs.299797; gb:RC3^0297-29010(M)13^03 C; test; dlag 
. 422726; U11690; Hs.1572; faciogenitat dysplasia (Aarsko; test; <Sag 
5 J 422728; AW937826; H3.103262; MAD (mothers agtotdecapienta; pros; dlag 

422789; AK001113; HS.120B42: hypothetka) protein nj102S1: test CrL-*fijn. 

422835; BE218705; Hs.121378: metano9ik}nein^ 5, tesQs; breast; di^ 

422871; A1J031228; Hs.12150g; coOagen, type XI. atphaZ: sarc; diag 

422887; AI751848: Hs.49215; ESTs; sare; CTL-em 
dO 422938; (01.001809; Nil 594; cefltromere protein A (17kO); lung, lest (771*900. 

422963; M79141; Hs.13234: ESTs; lung, pane; diag 

422997; 6E018212; Ks.122908; DNA repHcaSon factor; test; CTL-»s.m. 

423017; AW178761; Hs.227948; serine (or cysteine) proteinas; blad, headnk, mela; mAb*<Ilag 
^ - 4230S2; M28214; Ks.123072; RAB3B, member RAS oncogene fam; pros; dlag 
65 423189; M59371; Hs.171S96: EphA2;coh)n. ovar; mAb 

423196; AK001866: Hs.125139: hypothetical prot^ FU11004; 6bro; CTL^.m. 

423198; M8ig33; H8.1634; ceU division cycle 2SA; test; CTl-Hum. 

423201; NM.000163; H3.125180; grovtrth hormone receptor, pros; mAb 
_ - 423217; NM-000094: Hs.1640; collagen, type Vli. alpha 1 (e; lung, esoph; diag 
70 423271; W47225; Ks.126256; intarteukin 1. beta; blad. stom. esoph; diag 

423309; 6E006775; Hs.126782; sushi^opeat protein; iung, colon; diag 

423354; A601 1130; Hs.1 27436; caldum channel, voltage-depen; test, fibro; mAb 

423387; A)012074; H3.348500; vasoacQve hteslinal pep6de ; pros: mAb 

423397; NM_001838;Ks.1652;cherrtcklne((>C motif) receptor: biad,fnela; mAb 
75 423412; AF109300:H5.351615; prostata cancer assodatsd pro; pros; dlag 

423422; AC005175: H3.128425; NY-REN.24 anfigen; gSo; mAb4CTL 

423445: Nl\1.014324; Ks.128749: alphMnethytacyVCoA raoemasa; pros; sm 

423453: AW450737: Hs.128791: CGI^ protein: hmg; CTL-^m. 

423458; AI204212: Hs.351113: ESTs; test; CTL*s.m. 
oO 42351 1; AF036329; H5.1 29715; gonadotropln-releasing hormone; lung: dlag 

423515; AA327017: Hs.176594; ESTs; ovar, dlag 

423541; AA296922; H8.129778; serine protaasa InhOdtor, Kaz; coloa, pane; dl^ 

423575; (^8863; Hs.163443;intnn of periostin (OSF-2D8); headnk, breast, pane, hmg, Gbro, esoph; diag 
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423605; AF047826;Hs.129887;cadhedn 19. type metac inAb 

42384% AW452650: Hs.157148; hypolhetica) pratetn MGC13204; lung; <S3q 

423662: AKD01035; Hs.130881; B<eB CUyiymphoma 1 lA 

423685; BE3K)494; Hs,49753; uveal autoanflgen with (xM ; pane. uter. coton; CR^ja 
D 423725; AJ403108; Hs.132127; hypo(heScal pmlsSn LOC57822; lung, headnk, blad; (fiao 

423739; AA398155; H3.97600: ESTs; breast, war. pane; dlag 

423761; NM_006194; Hs.132576; pdred box gene 9; headnk; CTL+sjn. 

423765: R23858; Hs.143375; Homo sapiens, done IMAGE:3840: test diag 

423778; Y09267; Hs.l32821; Bavki containing nwKXwygenas; fibro; sm 
1 U 423779; AW071B37; Hs.67971; TRANSCRIPTION FACTOR HEW; gflo; diag 

423787; AJ295745: Hs^204; nuctear pora complex protein; test, esoph; diag 

423798; AF047033; H3.132904; sotule canter family 4, sodlu; angta; mAb 

423799; AW026300; H8,132908: 19A24 proteki; mela; mAb 

423849; AL157425; Hs.133315: Homo sapiens m5^ cDNA DKFZp7^ lung; diag 
1 5 423887; AL08O207; Hs.134585; DKFZP434G232 protein; headnk, lung; dlag 

423899; NM-001427; Hs,134989; engrailed homolog 2; mela; Ca<»sjn. 

423905; AW579960: Hs.135150; tung ^ ceO membranes; test; mAb 

423909; AJ223183; Hs.135194: Inmunogtobulln ai^ierfemOy, me; test dtag 

423934: U89995; Hs.159234; Ibrkhead box El (thyroid trans; tung; CTLi&m. 
/U 424001: W67883; fte. 137476; patemaDy expressed 10; breast ovan dteg 

424012; AW3683n; Hs.137569; tumor protein 63 kDa wKh stro; hing, Ijlad. headnk, esopti; dlag 

424036; AA770688; Hs.348495; H2A histone fantliy. member I; pane. ovar. CTL^m. 

424054; AA334511; Hs.26838: membran&spanning 4-4lonu!RS. s; pros, fibrat mAb 

424078; AB006625; Hs.13S033t paternally escpressed 3; ovar, oten Cn^.fn. 
25 424081; NM-006413; Ks.139120; fibonudease P (30kD): test; s.m. 

424098: AF077374; Ks.139322: smaD prollnMich protdn 3; lung, blad. headnk, esoph. ceiv; dteg 

424125: M31669; Hs.173S; inhiliin, beta B (adhrin AO be; ovv. pros;di^ 

424144; AA454033; Hs.41644: AKAP-«ssoclaled spemi protdn; fibio; dlag 
^ _ 424163; AA451737; Hs.141496; MAGE^ike 2; mela; CTL-»9.m. 
3U 424165: AW582904; Hs.142255; Islet amyloid polypepSde; pane; mAb 

424212; NNC0D5814: Hs.143131: glycoprotein A33 (transmembran; colon, stom, ovan mAb 

424218; AF031624; Hs.143212; cystatin F (leiikocystaSn); mela. (ibro; diag 

424244; AV647184; H8.143601; hypothetical protein hCU-lso; blad; diag 

424252; AK000520; Hs,14381 1; hypothetical protein FU20513: cdon. stom; dlag 
i D 424264; 080400; Ks^39388; Human DMA sequence from done ; blad; mAb 

424308; AW975531: H3.154443: mintchromosome maintenance de^ t)lad, hmg. test; diag 

424310; AA33B648; Ks.50334; testes devetopment-ratated NYO; fibro; disQ 

424332; AA33B919: Hs.101615; ESTs; pros; diag 

424339; BE257148: Hs.145416: endoglycan; pros, lung; diag 
4U 424343; AW956360;Hs.4748: adenylate cydaseac6vatiRgp:gno. ovar, uter; mAb 

424384; ^W383228: Hs.1&3834; ESTs, Weakly similar to G01763; hmg. i)lad. headnk. car. diag 

424399; AI905687; Hs.348419; AI905687:IL-BT095>1 9019M)19 B; breast, uter. headnk; diag 

424420; BE614743; Ks.146688; prostaglandin E synthase; lung. Uad; 8.m. 
. _ 424440; AA340743; Hs.133208; ESTs; saic; dlag 
45 424441: X14850: Hs.147097; H2A histone fenBy, member X; lung; diag 

424450; AL137526; Ks.147472; dynein tnlemiediate chain 2; fibro; diag 

42452% AL134847; K5.149957; ribosomal protein S6 kinase, 9; breast, cerv. ovar. uter, blad, colon, stom; 9,(n. 

424527; AW1 38558; Hs,334873; ESTs. Weakly similar to 154374; fibro; diag 

424578; AK001973; Ns.150890; hypothetksl protein; test; CTL^m 
5U 424581; M62062: Hs.150917; cslenln (cadhartn-asscdated p; glto^ ovar. uten mAtH&m. 

424586: NM_003401: Hs.150930: Xf^ repdr complementing defi pane; CTL«sjn. 

424629; M90856: H5.151393; gIutamatfr«y8tBlne Rgase. cat; lung; CTL^m 

424635; AA420687; Hs.115455: Homo sapiens cDNA FU142S9 fis; gllo; (fiag 

424676; Y08565;Ks.151678;UDP44^1yMpha.D^aIaBtosa; breast; am 
5 5 424704; AI263293; Hs.1 S209Gi eytochrome P450. subfamily tU; ren^ am 

42471 1; NMJ005795; Hs.152175; caldtonin receptor-like: angk); mAb 

424717; H03754; Hs.152213: winglessHype MMW integralton; Uad. lung. heaAik; diag 

424800; AU)35588; Hs.153203; MyoD <amily Inhibitor; test, pros; diag 

424806; AA382523: H&105689: MSTP031 protein; angto; mAb 
OU 424834; AK001432: Hs.153408; Homo sapiens cONA FU10570 lis; tung, Uad, ovar. headnk. esop^ cerv, ulen dien 

424846; AU077324: Hs.1632: neuropeptide Y; pros; diag 

424897; 063216; Hs.153684: frizzied^ated protehi; pane, EWS. stom. renal; diag 

424902: NM.003866; Hs.153687; hwsitol potyphosphate^hosp; pane. letA, meia; CTL-t8.m. 

424954; NMJX)0546; H3.1846; himor protein p53 (U^raumenl; mela, colon: CTL-»6ja 
05 424971; AA479005; Hs. 15403G; hmior suppressing subtransfera; pane, mda; CTt*sjn. 

424998: 1^6515: Hs.154138; chiSnase ^ 2; gib; dlag 

425023; AWS56889;H5.154210;ED&1 (endotheTial difierenti; angio; mAb 

425048; H05468; Hs.164502; ESTs; hing. Uad; diag 
_ _ 425057; AA826434: H3.1619; adtaetMcute eomplax (Drosoph; gBo, lung; CTL-«Sin. 
70 425088; AA663372: Hs.l69395: hypothetical protahi FU12015; gib. mela; dlag 

425154; NM.001851; Hs.1548S0: cdl^en, type tX, alpha 1; saic; d^ 

425159; NM_004341; 83,154888; carbamoylfhosphate synthetase; lung, test; am 

425200; BE255203.comp; Hs.155101; ATP synthase. H transporting, ; pane; 8.mi 

425206; NM_002153; H3.155109; hydroxysterdd (IT-beta) dehyd; Wad; mAb 
/ 5 425211; M18667; Hs.1867: progastiicsin {pepsinogen Q; fibro, esoph, pros; dlag 

425234; AW152225; Hs.165909; ESTs. Weakly similar to 138022; hmg. angb^ Uad, mels; diag 

425235; AA353113; Ks.105468; Homo sapiens cONA: FU22743 fi; angto; diag 

425237; U07695; Hs.155227; EphB4: test; mAb 
^ - 425245; AI751768; Hs.155314; KIAA0095 gene produd; hjng; diag 
SO 425259; AL049260; H3.145010; Homo sapiens mRNA; cONA DKFZp5; pros; dl^ 

425262; 087119; Hs.155418. GS3955 protein; mel^ rend; CTL-»«jn. 

425266; J00077: H9.155421: alphalBtopratein; hmg; diag 

42S274; BE281191; Ks.155462; minlchiomosame mafailenance daft lest diag 
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42S2B% AW13934% HS.1S5530; Interfenxv gamma4rahJ^ 

425308; M97639: Hs.155S8S; receptor tynslneldnasa-lilcs: pms, sarc; inAb 

425357; 6E271 188; H3.155975: protein tyrosine phosphatase. ; Gbro: diag 

425371; D49441; Ks.155981; mesoSierm; ovar. lung. Glvo; mAb 
5 425397; J04088; Hs.156346; topolsomerase PNA) II alpha C lung, Uad, pane, angio. test mela, esoph; CTL-»6jn. 

4^427: AI85266% Hs.317432; branched chain amlnotransferas; test: sjil 

425428;AL110281;Hs.157211:DKFZPS88B0821 protdn; para:; diag 

425465; L18964; Hs.t904; protein kinase C, lota; ovar, pros, coton; SJiv 
, _ 425525; AA3588a3: Hs.23871; ESTs: sarc; dlag 
1 0 425545; N98529; Hs.15a295; Homo sapiens, done M6C:12401.; sarc; d^ 

425555; AA359291; Hs.130767; Homo sapiens cONA: FU23553 fi; fibro; CTlisia 

425572; AB01 1076; Ks.158307; undiSerenSated entxyonic eel; test* CTl^sjn. 

425601; AW629485; Hs.140720; GSK-3 binding protein FRAT2: test; CTL-*&nL 

425606; U521 12; Hs.158331; renln-Unding protein; mela; diag 
1 5 425628; NM.004476; Hs.191 5; folate hydrolase (prostat&«pe: pros; s m 

425638; NM_012337; Hs.158450: nasopharyng^ epilheiium speo; fibro; Cn.<»&fn. 

425679; X05997; Hs.159177; lipase, gastric; esoph; sm 

425692; D90041; Hs.155956; M^tyltransferase 1 (arytami; breast; sm 

425695; NM.005401; Hs.15g238; protein tyrosine phosphatase, ; lung; mAb«6jn. 
20 425709;AA383076;K3.159274:outerden5efibreofspenntal;test;diag 

425710; AFO3O880: Ks.1 59275; sotuts carrier family, memtier ; pros; mAb 

425722; AI659076; Hs.97031; hypolhelica] protein MGC13047; mela; diag 

425726; AF085808; Hs.159330e uroplddn 3; pros, blad; diag 
^ _ 425769; U72S13: Hs.159486: Human RPL13-2 pseudogene mRNA.; test; diag 
25 425811; AL039104: H3.159557; karyopherin alpha 2 (RAG cohor. test; diag 

425849; AJ000512; Hs.296323; senim/glucocorficokl regulated; lung, headnk; sm 

425921; NM_007231; Hs.16221 1; sohits carrter famOy 6 (neuro; stem, pane; mAb 

425976; C75094; H5^14; NG22 proton; pros, ovan mAb 
^ - 426027; NM_002608; H5.1976; plateleMerived growth facton sarc; diag 
30 426050; AF017307; Hs.166096: E744ito factor 3 (ets domain ; ovar, Uad. stom; CTL-«sm 

426059; BE292842; Hs.1661 20; interferon regulatory factor 7; esoph, cenr; CTL«s.m. 

426067; AW664691; Hs.97053: ESTs; lung; diag 

426088; AF038007; Hs.166196; ATPase, Dass I, type 8B, memb; blad, hing; mAb 
. - 426094; AF034611; Hs.166206; cublfin (Intrinsic factor-eoba; renal; diag 
35 426116; AA868729; Ks.144694; ESTs; fibro; diag 

426125; X87241; Hs.166994; FAT tumor suppressor (prosoj^; colOn, stom, pane, pros, renal, fibro, ceiv; mAb 

426156; 6E244537; Hs. 167382; natriuretic peptide receptor A; ovar; mAb ■ 

426158; NM.001 982; Hs.199067; v-erb-b2 avian erythrofalasik! ; breast, ovar, uter, pros. Uad, pane, colon, fibro, mela; diag 

426172; AA371307; Hs.125056; ESTs; pros; dlag 
40 426174; AA547959; Hs.11 5838; Homo sapiens simitar to Echino; breast, pros, fibro; diag 

426212; S71824; Hs.167988; neural cell adheskin molecute ; gTio; mAb 

426271: AF026547; H3.169047; chondroHIn sulfate proteoglyc; gHo; dlag 

426274; D38122; Hs.2007; tumor necrosis factor (Ugand): fibro, mela; mAb 

426300; U1 5979; K5.169228; detta^itoe homotog (DrosophUa; ovar. sarc; mAb 
45 426310; Wii.000909; Hs.169266; neuropepSde Y receptor Y1; breast; mAb 

426312; AF026939: Hs.181874; interferon-lnduced protein wH; eso^, mela; diag 

426320; W47595: Hs.169300; faansfbrming gravrth factor, be; ovar, pros, Uad, pane; diag 

426350; NM_003245; Hs.2022; transglutaminase 3 (E polypept; cenr; sjn. 

- . 426363; M58524; Hs.2025; transfonning growth factor, be; pros; diag 
50 426370; R98288; Hs.281706: sortOln 1; sarc; dl^ 

42641 6; AW612744: Hs.169B24: IdQerceO lecfiiMiks recept; fibre; mAb 

426440; BE382756; Hs.169902; solute canter family 2 ffadSi hnQ. pane. ovar. Uad. headnk, esoph; mAb 
426462; U5911 1; Hs.169993; dermatan sulphate proteoglycan; sarc; dlag 

- - 40470; AA528794; Hs.128644: ESTs; mela; diag 

55 426471; 11^440: Hs.17000%tFBnsli9rm[riggiDwtti factor, ai; headnk, 

426491^ Nlil.001621; H3.170087; aiyl hydrocarbon reoepton pane; m^Hsm 

426501; AW0437K^ Hs.293616; ESTs; pros, breast, glio, hing, mela; mAb 

42650% Y07759; Hs.170157; myosin VA (heavy potypepOde 1 ; mela; dlag 

426534; U58098; Hs.2051: tesQs spedfic protein. Y-lin; test; ca^m 
OO 42653S; AU077012: Hs.170279; ESTs. WeaMy sbnSar to oUquI; angto; diag 

426555; NMJ)00372: Hs.2053; tyrosinase (ocutocutaneous db; meta, sarc; mAb 

426559; AB00191 4; Hs.170414; paired basic ammo add deavl; hepC breast over, rend; diag 

426575; M74826; Hs.170808; glutamate decarboxylase 2 (pan; pane; sm 
^ - 4266Z7; AF012359; Hs.195685; ESTs; test diag 
05 426635; BE395109: Ks.129327; hypothetical protdn MGC13057; ovan CTL-*5.m. 

426682; AV660038; Hs.2056; UDP glyoosyttransferase 1 fami; blad, tung; sja 

426691; N1A.006201; Hs.171834: PCTAIRE protein kinase 1; ovar; C\l^.nu 

426696; AW363332; Hs.171844; Homo salens cONA: FU22298 fi; angb; mAb 

- - 426721; AA383S88; Hs.288545; ESTs, Weakly similar to T2g012; fibro; diag 
70 426726; AA488915; HS.1719S5; trophinfai assodated protein (; test; diag 

426747; AA535210; Ks.171995; kaflikreln 3, (prostate spedf; pros; diag 

426752; X69490; Hs.172004; titin; sarc; diag 

426759; AI590401; H5^1213; ESTs; mela; dlag 
_ _ 426793; X89887; Hs.172350; HIR (hlstone ceO cyde regite; pros; CTL^em 
75 426828; NM.000020: H3.172670; acGvln A receptor type ll^ik; angto; mAb 

426866; U02330; Ks.172Bt6; neureguQn 1; esoph; C\l*^ 

426897; AW976570; Hs.97387; ESTs; lung; diag 

426900; AW163564; Hs.142375; ESTs; Uad, pros; mAb 
„ - 426935; NM-OOOOSB; Hs.172928; collagen, type I, alpha 1; test, saic; CTL^sm 
80 426986; AI493134; H3.349204; sderostin: hing; diag 

426968; U07616; H3.173034; amphiphysin (Stiff-Mann syndro; blad; mAbKITL 

426991: AK001536; Hsjn4410; Homo S8|ton cONA FU10674 fis; ovar. saro; di^ 

427080; AWD68287t Hs^l 175; ras-ralaled (3 botuflnum toxin; mela^ 
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427099; AB03295$ Hs.17356a c(M QzAoHQ homoloo 2 (Dr^ 

4271 22: AW057736: Hs^391 0; HER2 receptor lyroshe Unase ; breasl; mAb 

427244: AMOZm K5.t78045: ESTs; esoph: dlag 
- 427250: AA663848; ; gb:ae70b06^1 StrabSsne schic lung; diag 
J 427274; NM_00521 1; Hs.174142; colony sfinfmlaflng factor t re; p«)s» saw; mAb 

427298: AA40049$; : ESTs; test; diag 

427318^ AF18&081: Hs.175783; zinc transporten pros; mAb 

427333; AF067797; Hs.176658; aquaporin 8; pane, coJon; mAb 

427344; NM.000869; Hs.2142; S^iydroxytiyptamlne (serotonin; ovan mAb 
1 U 427356; AW023482: Hs.97849; ESTs; over, breast pns, blad. lung; diag 

427398: AW390020: Hs.2a41S: chromosome 21 open reading fra; pros; diag 

427427: AF077345; Hs.177936; lectin, superfamBy member 1 (; breast; (Sag 

427441; AA412605: H5.293266; SPANX family, member C; hni, esoph; CU^wn. 

4274S1; AA531527; Hs.332040: hypotheOcal pnjteln MGC13010; pros; mAb 
1 D 427474; U13192; Hs.2159: aggrecan 1 (chondroitin sulfat; sare; iSag 

427466; AA974433; Ks.362432; (ibroblasl grewth factor 4 (tie; test; diag 

427510; Z47542; Hs.179312: smaQ nuclear RKA activaOng c; lung; GTL^ 

427515; T79S26; Hs.179516; Integral type I protein; pros: dlag 

427521; AW973352:: ESTs: lest diag 
Z\J 427528; AU077143; Hs.1 79565: minlctuomosome maintenance def; mela; CTl'»s.m. 

427535; R29543; Hs.2164: pro-p!aleIet basic protein (In; fibre; di^ 

427546: AA40S280; Hs.36793; hypothefJcal protein f^J23188; lung; di^ 

427550: BE242818; Ks.31 1609; nuclear RNA heScase. DECO van mela; CTL^ 

427557; NK^002659; Hs.179657; plasminogen activator, urekina: pane, cohun atom, ovar, cerv, blad. lung, headrtk, esoph: mAb 
25 427578; A1591305; Hs.169084: ESTs, Highly slmila- to TUU_H; lest; di^ 
427583; M82962: Hs.179704; meprin A. alpha (PABA peptide ; coion; mAb 
427584: BE410293; Hs.179718; v-myb avian myeiobtastosis vii; test; ca->$ja 

427585; 031 152; H8.179729: coOagen, type X. alpha 1 (Sch; breast, lung, headnk. panc^ colon, ovar. oeiv. sare; diag 
_ ^ 427615; BE410107: Hs.179817; CGI^ proldn. PSDRI; pros; di&g 
3U 427634: A1399745; Hs.18449; hypothetical protein MGC10820; mela, sarc; diag 

427647; W19744: Hs.180059; Homo sapiens cDNA FU20653 fis; sarc: dlag 

427666; At791495; Hs.180142; calmodulin^tlce sidn protein (; breast, cerv, blad. hing. headnl^. esoph; diag 

427667; AK001279: Hs.180171; Homo sapiens cONA FU10417 fls; test; diag 

427668; AA298760; Hs.180191; hypothetical protein FU14904; hng. test; dlag 
J J 427681; ABQ18263; H3.284232; himor necrosis factor receptor; ovan mAb^sjn. 

427698; AW972594; Hs.335499; ESTs; fibre? CTL^m 

427701; AA411101; Hs.243886; nuclear autoantlgenic sperm pn lung; mAb^ 

427715; BE245Z74; Ha.180428; KIAA1161 protein; pros; di^ 

42771ft AI393122: Hs.134726: ESTs; test Wad: diag 
4U 427730; AW250549; Hs.180S77; granutin; mela; diag 

427786; BE407863: H3.256871; ESTs; esoph, blad; dlag 

427809; M26380; Hs.180878; Gpoprotein Upase; ovar; mAb 

427811; M81057; Hs.180884; carfaoxypepSdase B1 ((issue); breast; sm 

427897; NM.017413; Hs,303084; apelin; peptide llgand for APJ; angto, renal, pros; diag 

4 J 427912; AL022310; Hs.181097: tumor necrosis factor (Hgand); anglo! mAb 

427958; AA416000: Hs.376771: potassium Intemiediatefemali c; pros, gtio; mAb 

427961; AW293165: Hs,143134; ESTs; lung, sarc; dlag 

426001; H97428; Hs^19907; ESTs. Moderately similar b Tr; mela; dlag 

428004: AA449563; Hs.151393; glutamate<ystelne ligase, cat; lung; sm 
D\) 428023; AL038843; Hs.374530: Homo sapiens cDNA: FU23602 H; lung; dlag 

428046; AW812795; H8.337634; ESTs. Moderately similar to 13; hmg, colon ; dian 

428062; AA420683: m.98321; hypothetical protein FU14103; angle; dlag 

428087; AA100673; Hs.182421; troponin C2, fast; sarc: CTL^.m. 
_ - 428141; D50402: Hs.182611; solute canierfamny 11 (prot; glio; mAb 

428153; AW513143;Hs,98367;SRY (sex determining region Y); ovar; diag 

428169; AI928984; Hs.182793; gotgi phosphoprotein 2; pros; diag 

428182; BE386042; Hs.293317; ESTs, Weekly similar to 6GC1_H; Wad, headnk, hma ovar, sare; CTL^sjn. 

428183; AW969726; Hs,98381; ESTa, Weakly similar to serine; EWS; diag 

428206; AB020643; Hs.183006: KIAA0836 protein; angio; mAb 
OU 428221; U96781; Hs.183075; ATPase. Ca transpoffing, cardt; sanr, 8ja 

428227; AA321 649; Hs.2248: small bidudWe cytokine subfa; breast, lung, blad, ovar, headnk, fibro, coton. store, cerv, buk. renal, test, mela, esoph. hepC; diag 

428248; All 26772; Hs.40479; ESTs; sarc; diag r- r 

428293; BE250944; H5.183556; solute canierfanfly 1 (neuir; pros; mAb 
- 428305; AA446628; Hs.2799; cartilage linking protein 1; sarc; dag 

05 42832% AA42G091 ; Hs.98453: ESTs, Moderately similar to R2; test; diag 
428336; AAS03115; Ks.18375% ndcroseminoprotein. beta-; pros; dl^ 
428355; BE2S6452; Ks.2257; vitnonectln (senim spreading t ooton; Sas 
428398; Ai249368; Hs.98558; ESTe; pros, breast ; diag 

„ 428405; Y00762;H5.2266:chQl]nergk: receptor, ntooHni; esoph, saro; mAb 
7U 428423; AU076517; Ks.184276: solute carrier f^ 9 (sodhi; ovar; CTL4Sjn. 

428434; AW383590; Hs.65551 : Homo sapiens. StmOar to OKA s; hing, to; diag 

428467; AK002121; Hs.184465; hypothetical protein RJ 11 259; to; mAb 

428471; X57348; Hs.184510; stratifin; lung, headnk, colon, pane; diag 

428645; AA431400; Hs.98729; ESTs, WeaWy similar to 201720; lung; sjn. 
fj 428651: AFig6478; Ks.188401: annexbi A10; Wad, stem, pane; dlag 

428667; AI375550; Hs.346868: nucleolar protein p40: homotog; to, utei; dlag 

428722; U76456: H5.190787; tissue InNWtor of metallopro; glk); diag 

428728; NM_016625; Ks.191381; hypolheticsl protebi; ovar, lung. BPH; CTL^sm 
. . 428771; AB028992; Hs.193143; taAA1069 protein; tong; CTL^ 
oU 428784; Y128S1; Hs.193470; purtnerglc receptor P2X, Igan; gHa mela; mAb 

42880ft M57527: Hs.193717: InterieuMn 1ft fibro; diag 

428801; AW277121: Ks^BBI; ESTs; pros; dl^ 

428804; AK000713; Hs.193738: hypothetical pralebi HJ20706; mel^ (flag 
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428810: AF068236; Hs. 193788; ni!rto oxidB synthase 2A (indu: tung; sm 
426819; A1.135623; H3.193914: KIMOSTSgene pioducl; pros; CTL-»&m. 
426824; W23624: Hs.173059; ESTs; pane; (Hag 
. 428832:AA578229;Ks.324239:EST8,ftotfer^siinlartoZri;panc,utBnda^ 
5 428841; A)41843Q; Hs.104935; ESTs; renal; diag 

428848: NM.000230; Hs.194236; tepSn (murine obesity homolog; saic; (Sag 

428862: NhL000346; Hs.2316: SRY (sex detenninlns region Y): pros, saic; GTL^em 

428927; AM41 837: HSw902S(): Homo saptetis hypolheScd prat; 
, . 428928: BE4()9838;H3.194657:cadherbi1. type 1.E<adherin: pros, breast. stD^ 
1 0 428949; AA442153: Hs.104744; tiypothetea! protein DKFZp434J; saic; dlag 

428957: NM.003681: Hs.194679: Wt^l Inducible signaling pathw: cerv; dlag 

428959: AF100779; Hs.194880; WNTI bidudble sign^g pathw; saic; diag 

4269n; AK001404: Hs.194698: cycfo B2: test: CTL^m. 

428981: BE313077; Hs.93135; ESTs. WeaMy similar to ALU^.H: saic; diag 
1 5 429002; AW248439: Hs.2340: junction plakogtobin; Uad: CTl^m. 

429010; Y18198; Hs.194725; one cut domdn, famDy member ; pane; (Sag 

429038; AL023513; Hs.194766; seizure reiated gene 6 (mouse); lung; mAb 

429058; AF138863; Hs^54; hypothetical protein Flfi8421; esoph; diag 

429065; AI753247;Hs.29643; Homo sapiens cONAFUl 3103 fis;hing: diag 
20 429083; Y09397; Hs.227817; BCl2-feiated protein A1; mela; dlag 

429113; D28235; Hs.196364; prostagtandin^ndoperoxide syn; angb, biad, stom; sjn. 

429120; AK001673; Hs.196530; hypolheOcal protein FU 10811; lest; diag 

429150; AF120103; Hs.197366: smoothened (Orosophila) homob; over; mAb 

429163; AA884766; ; gb:am20a10.8l Soares^NFl^T^GBC; pros; diag 
25 429170; NML001 394; HS.23S9; dual specificity phosphatase 4; breast, pancstom, hn^ mda; 8.nt 

429201; X03178; Hs.198248; group^pedfic component (vita; pane; dfcag 

429220; AW207206; Hs.356962: ESTs; breast, pros. BPH; dlag 

429228; A1553633; Hs.356828; ESTs; hing. fibro. headnk, esoph; diag 

429259; AA420450: Ks.380088: Plaicophirm; hmg, headnlq diag 
30 42929Q( AF203Q32; Ks.198760; neurofilament, heavy polypepfi; pros; CTl-^sm 

42929% At620463; H$.347408; hypothetical protein MGC13102; pros, cerv; diag 

429329; AA456140; Hs.99235; Homo sapiens pannexin 3 (PANX3; saro; mAb 

429345; R11141; Hs.199695; hypothetical protein; Wad; dlag 

429359; W00462; Hs.2399; matrix metalloproteinse 14 (m; headnic, breast, oenr. ovar. Nad. lung, esoph. mela. saic; mAb 
35 429413; NiyL014058; 85.201877; DESC1 protein; lung, btad; diag 

429415; NM.002S93; Hs.202097: procOOagen C«idopeptidase en; sarc; dlag 

429423; At016712; Hs.380983; bitegrin. beta 1 (Cbronectin ; anglo; mAb 

429432; AI676059; H5.202676; synaptonemal complex protein 2; breast, cenr. dtag 

429441; AJ224172: Hs.204096; Itpophilin B (uteroglobtn fami; breast pros, ovan diag 
40 429466; M85835: Hs.12827; ESTs; glio. uter, CTL^ 

429469; M64590; Hs.380791: glycine dehydrogenase (decarbo: test: 8.ia 

429486; AF155827; Hs.203963; hypothefical protein FU10339; blad, hmg. headnit, test; (Sag 

429500; X78565: Hs^891 14; hexabrachion (tenasdn C, cyto; lung. gllo. headnlt. mela. sarc; diag 

429504; X99133; Hs^238: lipocaTm 2 (oncogene 24p3) (N; ovar. hmg, blad; diag 
4S 429505: AW620035; H8.276679; a dislntegrin and metalloprote; colon, leuk; mAb 

429538: BE182S92; H5.139322: smaH proOnMlch protein 2A: lung, esoph; diag 

429563; BE619413: Hs^437; euVaryotic transtaton IniQat; tung; diag 

429586; T73510; Hs^1S3; angiopoletin^ilffl 3; hepQ (JTl^SJn. 
_ _ 429597; NM»003816; Hs.2442; a dislntegrin and metaHoprote; pane, colon, stom. hjng; mAb 
SO 429609; AR)02246; H3.210863; oeO adhesbn molecule with ho; ovar. mata; diag 

429812: AF062649; H3.252567: (iintafy tumor^ransfbrrT^g 1; tung, blad. headnk; diag 

429655; U48959; Hs.21 1562; myosin. light polypeptide Idna; pros; sm 

429663; M68874; H&211587; phosphollpase A2. group IVA (c; angio. hntg; sm 
. _ 429664; 120433; Hs,21 1588; POU domain, class 4, transcrip; sarc; CTL^sm 
55 429736; AF125304; H&21268Q: tumor necrosis factor receptor, hmg; mAb 

429747; M87S07; H3.2490; caspase 1, apoptosis-ieialed c; colon, stom^ Gbro; sjn. 

429764; 6E245076: Hs.216958; KiAA0194 protein; pros; mAb 

429769; NH.004917: Hs.218386; imnin 4 (prostase. enamel; pros; sm 

429784; M89796; H3.30; membrane-spanning 4-domans, s; fibro; mAb 
60 429823; AA459443; Hs.181400; ESTs; saro; diag 

429859; NIyL007050; Ks.225952; protein tyrosine phosphatase, ; breast; mAb«8.m. 

429918: AW873986: Hs.119383; ESTs; pros. gOo; diag 

429921; AA526911: Hs.82772: ooOagen, type XI. alpha 1; headnk. pane, saic; CTL 
^. 429983; W92620:Hs260855; ESTs; blad; diag 

05 4299S6;AF092047:H3.227277; sine oculishomeobQx(Drosophl; tung; CTL*s,in. 

430014; H59354; Hs.374303; acOnin. alpha 4; renal; diag 

430016; NM.0(>4738; HS.2Z7656; xeriotropte and poVtrapfc leti: ovar; mA^ 

430044; AA464510; Hs.152812; ESTs; breast, luiq. pane, headiik. om, stom. esoph; diag 
- . 430056; X97548; Hs.22B059: KRAB-assodated protein 1; test; (Hl^ 
70 430129; BE301708; Hs.233955: hypothetical protein FU20401; angio; diag 

430130; AL13731 1 ; Hs.234074; Homo sapiens mRNA; cONA 0KFZp7; pros; mAb 

4301 44; AI732722; Hs.98927; ERa protein; ERaC5»^ PH pros; diag 

4301 52; AB00132S; H8.234642: aquaporin 3; Uad. fibro; mAb 
_ ^ 4301 54; AW583058; Ks^726; serine (or cysteine) proteinas; pros; dlag 
75 430157; BE348708; K3.278543: ESTs; blad; diag 

430168; AWS68343; Hs.145582; Di<FZP434l1735 protein; blad; diag 

430223; NMJ002514; Hs^935; nephroblastoma overexpiessed g; mela; dlag 

430226; BE245562; Ks.2551; adraiergic, betB>2-. receptor.; pros; mAb 
_ ^ 430228; AW9S0939; H$.6382; ESTs. Highly similar to T00391 ; glio; dlag 
80 430252: AI638774; H3.105326; testes development^elated NYD; test; CTL-^sjn. 

430253; AK001514; Hs.236844; hypotheOcal protein FU10652: test; CTL^ 

430255; AK000703; Ks,323822; Homo sapiens mRNA for K1AA1 551; test; CTL^sm 

430259: BE55018% H3.37514% RalGEF-Dke protein 3. mouse h; ovar; CTL^m 
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430280; AA381258; Hs^7868: interteiddn 7 receptor; mela, tung, pane, stom, esoph. h8adnl(,fibro; mAb«&m. 

430287; AW182459; Hs.125759: ESTs, We^ slrnBar to LEUSJt test dteg 

430294; At53S226: Hs.32976; guanine nudeoflde binding pio; pros; (0^ 

430337; M36707; HS.239S00; calmodulirvlikB 3; lung; dag 

430354; AA954B10; H3.239764; human homolog of Drosophlla Sc; ovar; dlag 

430378; Z29572; Ks.2556; tumor necrosis factor receptor. Iung.fibn>, t)re3st headnk, btad. breast cdon. stone (Sag 

430393; 8E18S030: Hs.241305; estrogeiHespQiislve B box proi; hing; dlag 

430398; 04974% Hs^41363; hyaluionan-bMing protsin 2; pane; diag 

430407: H23551: Ks.30974: ESTs; pane; diag 

430439; AL133561 ; Hs^155; DKFZP434B061 pmtein; (ung. test; diag 

430451; AA836472^ Hs.297939; calhepsin B; ovar, lung, headnk, pane, stom; dlag 

430454; AW4e9011: Hs.105635; ESTs; lung; diag 

430476; AA447465; HS.2S03; tachykinin, piecwsor 1 (subst; san; dbg 

430487; 087742; Hs^41552: KIAA0268 protein; pros; diag 

430491; AL109791 ; Hs.241559; Homo sapiens mRNA fuil length ; oven diag 

430498; X02910; Hsi4l570; tumor necrosis factor (TNF sup; leult; diag 

430508; AI015435; (^.104837; ESTs; lung; mAb^.m. 

430521; NM_016383; Hs.242183; HOM-TES^ tumor antigen; lest CTL^m 

430540; AW245422;Hs.106357; Homo sapiens cONA:FU221 05 fi; mela; mAb 

430563; AA481269; Hs.348628; ATP-bindlng cassette. sul>lami; lung; dlag 

430594; AK000790; H3^46885; hypolhettcai protein aJ20783: mela; diag 

430634; AI860851; Hs.26685; calcyphosine; ovar; dlag 

430637; BE160081; Hs.256290; S100 caldunvblnding protein A; mela; diag 

430647; ACC03682; H8.127988; ESTs, Weakly similar to Z21LH; test dlag 

430676; AF084866; Hs.372585; gbiHomo sapiens envelope prate; test diag 

430677; Z26317; Hs.359784; desmogl^ Z lung, colon; mAb 

430678; AI456174; H5.192855; ESTs; lung; diag 

430686; NiyL001942; Hs.2633; desmoglein 1; lung, headnk. mela; mAb 

430691; C14187; HS.1S7208; anstatess-^elated homeobox pn EWS, ovar, pane; diag 

430704; AW813091: H5.335799; ESTs; stom; diag 

430770: AA765694; Hs.123296: ESTs; mela; diag 

430832; AI073913; Ks.100686; ESTs. Weakly simflar to JE0350: breast colon, ovar, uter. lung. stom. fibio; diag 

430838; N46684; Hs.16g3g5; hypolheticai protein FU12015; meia; CTL^m. 

430890; X&4232; Hs^; glypican 1; glio, lung, cerv. blad. esoph; mAb 

430985: AA490232: Ks.27323; ESTs. WeaMy similar to 178885; lung; mAb 

431009; BE14g762; Ks.48956; gap Junction proteh, beta 6 C lung. Uad. headnk. esoph; mAb 

431053; S40369; Hs^49141; Glutamate receptor subunit glio; mAb 

431070; AW4081&4; Hs.249184; transcripQon factor 19 (SCI); Uad; diag 

431089; 8E041395; Hs.374629; ESTs, Weakly simitar to unknow; blad, lung, pn». angto. fibro; diag 

431099; Y13387; Hsi49235; phosphoinositide^kinase. da; pros; CTL^sja 

431103; M57399: Fk.44; pteiotrophtn (heparin binding ; sarc, mela; cfiag 

431124; AF284221; H8.59506; doubtesex and mat>^ related tr; lung; CTL^m 

431151; BE207083; H5,366053; gb:ba10d10.y1 NIH_MGCJ Homos; pros; mAb 

431164; AA493650; Hs.94367; thyroid Iranscriptton fader 1; fibro; CTL'»«.m. 

431163; NM.006855; Hs.250696; KOEl (Lys^p-Qu-Leu] endopla; mela. pros. pane, colon, stom; mAb 

431211; M86849; Hs^733; gap junction protein, beta 2. ; colon, blad, lung, pane, headnk, esoph; mAb 

431217; NM.013427; Hs.2»}830; Rho GTPase activating protein ; pros; CTl^sm 

431221; AA449015; Hs.286145; SRB7 (suppressor of RNA pclyme; tung; ca-»8.m. 

431222; X56777; Hs^73790; zona pelluckia glycoprotein 3A; pros; diag 

431250; BE264649: Hs.251377; taxd resistance associated ge; esoph; diag 

431322; AW970622; H3.376626; gb:EST382704 I^GE resequences.; blad. ovar, uter. dlag 

431347; AI133461; H3.251664; Insulin-Bke growth factor 2 (; blad; mAb«dlag 

431354; 6E046956; Hs.251673; DNA (cytosine-5-)-mefhyltransf. test CTL-«6m 

431360; NhL000427; Hs.251680: torfcrin; mela. sarc; dlag 

431362; AI874223; Hs.293560; ESTs; angle; diag 

431369; 6E184455: Hs.251754; secretory leukocyte protease i; ovar. blad; diag 

431384; BE158000; H5.334372; gb:MR24fr0377.1 50200-202^03 H; hmg; dlag 

431335; BE178536; H3.11090; membrane-spanning 4<domaIns. s; pane; dlag 

431441; UB1961; Ks.2794; socfium channel, nonvottage^at ovar, pros, blad; mAb 

431448; AL137517: Hs.3062O1; hypothetk^a! protein DKFZp5640; blad; mAb 

431457; NM12211; Hs.256297: Integrtn. alpha 11; headnk; mAb 

431474: AL133990: Hs.190642; CEGP1 protein: breast pros, btad; diag 

431494; AA991355: Ks^8312; hypothetfeal protein OKFZp434A; lung; diag 

431512; BE270734; Hs.2795; lactate dehydrogenase A; pane; sjn. 

431548; AI834273; H3.9711; novel protein; lung, angle, pros; diag 

431553; X78075; Hsi799; cartilage linking protein 1; saro; diag 

431578; AWg710B2: Hs222886: ESTs. WeSkfy rimOar toTRHYJt pnis; diag 

431616; AA508552: Hs.222874; ESTs, WaaMy similar to 138022; pros. pane, oohm ; mAb 

431674; AA098901; Hs.301642; G-protein coupled receptor; ovar, niAb«sjn. 

431723; AW058350;Hs.278966; Homo sapiens mRNA; cONADKFZpS; libra; diag 

431728; NM.007351; H8.268107: muiiimerin; angio; diag 

431608; 1M30703: Hs.270833; amphbagulin (schwannoma^edr.braast headnk, pane, colan; dlag 
431838; AF178532; H8.27141 1; bet»slto APP-deaving enzyme ; mela; mAb 
431870; AW449902; Hs.105500; ESTs; renal; di^ 

431690; X17033; Hs^ig86; tntegrin. alpha 2 (CD49B, aiph; blad, headnk. lung, pane, oenr. stom; mAb 

431938; AA938471; Ks.&4431 ; specific granule protein (28 k; pane; diag 

431939; AW008061; Hs^1994; ESTs; renal cotan; diag 

431941; AK000106; Hs^2227; Homo sapiens cONA FU20099 lis; cenr, gDo; diig 

431989; AW972870: Ks.291069; ESTs; ovar; diag 

431992; NM_002742; Hs.2891; protein kinase C. mu; pros, gito; sm 

432004; BE018302; H3.2894; placental growth factor, vascu; renat diag 

432015; AL157504: K$.1S91 15; Homo sapiens mRNA; cONA OKFZpS; Uadi diag 

432023; AW273128; Hs.300268; EST; tung; diag 
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432097; X5173Q; Hs^OS; pregesterons leceptor; Uadt mAb*6jiL 
432117; ALJ036195; Hs.2909; protamine 1; test; CTl^m 
432128: AA127221: Hs.66; ESTs; angto; (Bag 

432141; BE410964; Hs^2736; nudear receptor tiinding prole; test; mAlHSjn. 
J 432189; AA527941; ; gb:nb30c04.s1 Ka_CGAP_Pr3 Horn; pros; diag 
432199; A1693815; Hs.127179; crypOc gene; pane; dag 

432210; A1567421; Hs.273330; Komo sapiens, ckme IMAG&3544; over, tung, t)lad; dag 

43222:^ AI204995; : gb:an03c03jc1 Stmtegene sctiiz; ai^ blad. fibro; dag 

432231; AA339977; Hs.274127: CLST 1 1240 protein; fibro; dag 
1 U 432239; X81334; Hs.2936; mabix metalloprotetnase 13 (c; blad, lung, Iteadnk, esopti, saic; sjn. 

432240; A1894767; Hs.129179; Homo sapiens cOHA FU13581 fis; pies; dag 

432305; M62402; Hs^4313; Insulin-like growth factor bin; cenf; d^ 

432374: W68815; Hs.301885; Homo sapiens cONA FU11346 lis; cenr. lung, fibro, pros; dag 
^ - 432407; AA221036; ; gbzi03f1 2.r1 Stratagene m2 n; lung, testcolon; dag 
15 43241 5; T 1 6971 : ; ESTs, WeaMy sintiar to A43932; ovar. pros; dag 

432432; AA541323; Hs.115831; ESTs; liter, pros; dag 

432435; BE218688; Hs.282070; ESTs; pros. uter. colon, stom. fibro; diag 

432441; AW292425: Hs.163484: btfron of hepatocyte (KKlear blad, Sbra 
. ^ 432473; AI202703: H3.1S2414: ESTs; pros; diag 
20 432481; AW4S1645; Hs.151504: Intron of collagen, type XI, a; sarc; dag 

432512; NM_003284; H5.3017; translfion protein 1 (during h; test; CTL*sjn. 

432519; A122131 1; H$.130704; ESTs. WeaMy similar to BCHUiA; fibro, ovar, titer; Ca-*sm 

432S27; AW97S028; Hs.102754; ESTs; pros, titer, ovar. can ; dag 

432542; AW083920; Hs.16098;claudn 2:colon, pane; dag 
25 432583; AW023624; Hs.162282; potassium channel TASK-4; pota; lung; mAb 

432615; AA557191; Hs.55028; ESTs. Wealdy similar to 154374; pros; dag 

432821; Al2g8501; Hs.21192: ESTs. Weakly sImSar to T46428; pros; mAb 

432629; AW860548; Ks.280658; ESTs; ovar. d% 
^ _ 432653; N52096; Hs.293185; ESTs. Weakly simflar to JC7328; pros, hing ; fnAb^sm 
30 4326S6; AW204069; Hs.351118; ESTs. Weakly similar to unname; test; diag 

432706; NM.013230; Hs.2861 24; CD24 antigen (smaB oea lung ; coton. ovar. pros; mAb*€TL 

432730; A1066520; Hs.131358; ESTs; test; dag 

432731; R31 178; Hs.287820; fibronectin 1; pane, fibro; dag 

432788; AA521091; Hs.178499; Homo sapiens cONA: aJ231 17 B; lung, ovan CTL^. 
35 432800: BE391046; Hs.278362; AIM-1 protein; mela, pros; mAb 

432842; AW674093; Hs.334822; hypotheGcal protein MGC4485; blad. lung, headnk; GTL^&m.' 

432850; X67723; Hs.3110: angiotensin receptor 2 (AT2); tdo; mAb 

432855; AF017988; Hs.279565; secreted fdzzled-related prot; pane; dag 
. ^ 432867; AW016936; Hs.233364; ESTs; stom. colon; dsg 
40 432878; BE386490; H3.279663; Pirin; mela; CTL^sm 

432887; AI926047; Ks.162859; AK056805: Homo sapiens cDNA pros; dag 

432938; T27013; H8.3132; steroldogenk: acute regulatory; ovar. dag 

432966; AA650114; Hs.32519B; ESTs; pros; dag 

433012; NM.004045; Hs^9910; ATX1 (antioxidant protein 1. y; mela; dag 
45 433013; AI697890; Hs.1 27337; axin 2 (oonducSn, adi); ooton; CTl^m 

433043; W57554: HS.12S019: lymphoid nudear protein (iAF-; pros, breast; dag 

433068; NM.006456; Hs^215; siatyttiansferase; brsasl. ovar, mela; sm 

433078; AW015188; Hs.121575; Homo sapiens cDNA FU12231 lis; blad; dag 
- 433091; Y12642; Hs.318S; lymphocyte antigen 6 complex, ; blad. tung, headnk, cenr, mAb 
SO 433147iAF{}91434;Hs.4308(hplatetetdeffvedgiowlhl^ovar.pan^ 

433159; ABOSSSOB: Hs.150587; MneslrHikB protein 2; ovar. uter. coton, bbd; dag 

433170; AB03781 6; H8.8982; K1AA1395; angle; dag 

4331 83; AF231 338; Hs,222024; transcrfptton factor BfJiMZ lung; dag 

433228; F28212; Hs.14953; KIAA1491 protoin; test; GTL^ jn. 
5 5 433258; AI806626; mSSim ESTs. Weakly similar to ALUB_H: hmg; dag 
433285; AW975944; Ks.237395; EST^ breast pros ; diag 

433293; AF007835; Hs.32417; hypothetical protein M6C2742; fibro, pros, stem, pane; CTL-isjn. 

433323: AA80S132; Hs.159142; ESTs; pros; diag 
_ 433334; AI927208;Hs.2319S8; matrix metaltoproteinase 28; pane; 8.m. 
OO 433336; AF017968; H8.31365: secreted frtzzled-relaied prot; ovar. fibro, headnk, hing, pane, bladt dag 

433364; AI075407; Ks.298083; ESTs. Moderately similar to 15; mela; dag 

433365; AF026944; Hs.293797; ESTs; Uad; dag 

433376; A1249351; Hs.74122; caspase 4, apoptosis^iated c; angto; sm 
, 433388; AJ432672; Hs.288539; hypoSietteal protein FU22191 ; ovar; CTL-»6m 
65 433404; T32982; H&352670; Homo sapiens cDKA FU32054 fis; pros; dag 

433437; 1/20536; Hs.3280; caspase 6, apoptosls-related c; fibre, breast cerv, lung, Uad, pane, gfio. coton; sm 

433444; AW975324; Hs.129816; ESTs; pros; dae 

433466; AA508353; Hs.105314; reladn 1 (HI); pros; dag 
_ . 433485; AI493076; Hs.308(»8: aido^ reductase MIy 1. ; lung; sm 
70 433495; AW3737&4; Hs.71; a!pha-2-glyooprotan 1. zkic: breast pros; dag 

433576; BE0B0715; Hs.151091; ESTs: mela; dag 

433662; W0716^ Hs.1508%; RAB25 RAB25, men^ RAS oricoge; OOlon: dag 

433571; AW1387g7; Hs.132906; 19A24 protein; fibro; mAb 
_ - 433701; AW445023; Hs.15155; ESTs; test; dag 
75 433724; AI827749; Hs.144924; serine/Uireonine protein kinas; test; CTL-^jn. 

433764; AW753676; Hs.39982; zinc finger protein RINZF (NMJ pros. ovar. dag 

433800; AI034361; HS.13S1S0: lung type4 ceD membrsie-esso; gBo. lung, test; mAb 

433914; AF108138;Ks.1 12160; Homo sapiens DNAheOcase homo; test; sm 
„ - 43401 1: AW953437; Hs,5486; done FLB5214; pros; dag 
80 4341 05; AW952124; Hs.13094; preseniSns associated rbomboi; lung; dag 

434217; AW014795; Hs^9; ESTs; angto; d^ 

434262: AF121658; Hs.12169: sorting nexin 8; nielac CTL^ 

434274; AA628539; Hs.57783: ESTs, MaderateV similar to AL: test; dag 
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434334; AA912476: Hs.1 16750; Homo sapiens cONA FUI 3221 6s; test; (flag 

434340; Ml 93043; Ks.353146; ESTs, We^ Mat to T1722G; tung; dlag 

434360; AW015415; Hs.127780; ESTs; bmg; di^ 
- 434370; AF130988; Hs.58346; ectaJysplasIn 1. anWdnoOc re; colon, stom; dlag 
0 434377; AW137148; HS.3Q6S93: Intron of periostin (OSF.2o5); headnk; 

434398; AA121098E Hs.3838; senBivinduciblo kinase (SNIQ; ai^ breast CTL-«8m 

434411: AA6a2649; Hs,201372: ESTs; stom. leuk; dlag 

434414; AI798376: : flb*34b07j{1 NCI^CGAP,Ov23 Ho; Jung, test, colon; diag 
^ 434423; NM.006769; H8.3844; UM dom^ only 4; pane; dl^ 
1 U 434449; AW953484; Hs,3849; hypotheOcal protein FU22041 ; sarc; disg 

434487; AF143867; Hs.337588; ESTs, Moderately simflar to SB; Wad; mAlnajn. 

434598; T5953B; ; gb^beSgmi Stratagene ovary; angio; sm 

434608; AA805443; Hs.179909; hypolhefica! protein aJ22995; test CTL+sjiu 

434609;R76593:;gb7i60c1lJlSoaresplacenta;pros;dlag 
1 J 434636; AA083764; Hs.349208; hypothelicaJ protein MGC3178; anlo; dlM 

43464% AA738254: Hs.165390; ESTs. Highly similar to A40350: test; dlag 

434665; AA642125; Hs.74502; gb:ni60c01.s1 NCLCGAP.Lym3 Ho; pane; di^ 

434666; AF151103; Hs,112259; T cell receptor gamma locus; pros; mAb^sjn. 

434699; AA643587; Hs. 149425; Homo sapiens cONA FU 11980 fis; pane; d^ 
ZU 434826; AF155661: Hs.22265; pyruvate deliydrogenase phospha; mela; sm 

434846; AW295389: Hs.119768; ESTs; angle; diag 

434876; AF160477; Hs.61460; Ig supeifamOy receptor miR; lung, Uad; mAb 

434927; H46612: Ks.29361 5; Homo sapiens HSPC285 mRNA, par anafoc diaa 

434973; AW449285;Hs.313636; EST; pros; (fiag 
ZD 435045; BE297155; Hs. 143698; ESTs; test dlag 

435047; AA454985: Hs.54973; cadherin-Bko protein VR20; pros; mAb 

435066; BE261750; Hs.4747; dyskeratosis congenita 1, dysk; cdon; CTL^m 

435080; AI631760; HS.1S51 11; hypothelica! proton FU14428: renal; mAb 

435094; AI560129: HsiOOOOO; EST; over, cerv; dlag 

435099; AC004770; Hs.4756; flap structui«pecific endonu; Uad, lest, mela; CTL^m 

435140; AA668123; Hs.134170; ESTs; fibro; dlag 

435159; AA668879; Hs.116649: ESTs; tung; diag 

435206; AI432364; Hsw160S94; ESTs; test dlsg 
^ 435243; AW292886; Hs.348932; hypotlielica) protein dJ434014.;ceiv, headnk; diag 
35 435292; N20514;H5,172965; ESTs; mela; dlag 

435299; AI74545B; Hs.343026; ESTs, Weakly similar to T20593; libro; diag 

435479; AF197137; Hs.155101; ATP synthase. H transporting. ; pros; sm 

435496; AW840171: H5.26539B; PAR« bet^ breast pane, ovar; dlag 
. 435563; AF210317: H8.95497; solute canter famHy 2 (facfl; blad; mAl>*6.m. 
4U 435576; AF213457; H3.44234; triggertng receptor expressed ; fibro; mAb*8ja 

435602; AF217515: Hs.283532; uncharacteriied bone manow pn test diag 

435615; Y15065; Hs.4975; potassium vdlage^ated channo; gfio; mAb 

435652; N32388; HS.33437Q; uncharacteHzed hypothalamus p; pane; diag 
. 435793; AB037734;Hs.4993;WAA1313 protein; ovar, lung, uler; diag 
45 435849; BE305242; Hs.16098; daudin 2; coton, paic; dlag 

435876; AW612586; Hs.160271; G proteirKoupled receptor 48; pros; mAb 

435897; AF269223; H8.128322; l^xmiplex 1 1 (a murine tcp horn; test diag 

435904; AF261655; Hs.8910; 1,2-aiph^mannosWase IC; blad; sja 

435918; AF263538; Hs.86232; growth diflerentla8on factor ; test dlag 
DU 435974; U2969ft Hs^44; Homo sapiens beta-l adreneigk:; pros, EWS; roAb*sjn. 

436032; AA150797; Hs.109276; latexin protein; pane, anglo; diag 

436063; AK000028; Hs.356100; libosoma) proteb S24; pros; dlag 

436120; AI248193; Hs.119860; ESTs; fibro; dlag 
. . 436199; R38946; Ks.127951; hypothetical protein FU14S03; renal; diag 
55 436246; AW4509S3;Hs.119991; ESTs; Uad; diag 

436251; BE515065: H3.296585; nudeotar protein (KKBD repeat cdlon. test, Uadt Ca^&m. 

436278; BE396290; Hs.5097; synaptogyrfn 2; pros; mAb 

436291; BE568452; Hs.3i4037; protein regu!alorof cytokines: king. Had, headntc din 

436293; A1601188;Hs,308201; ESTs; Wad; dlag 
OU 436302: Al^55841;Hs^330;hypotheticaiprotdnFLJ23588;lu^ 

436315; BE390513; H9.27935; hypotheScal protein M6C4837; mela; diag 

436396; AI663487: Ks.152213; wingless^ MMTV integration: hm, headnlc. pn& pane diaa 

436420; AA443966:Hs.31595; ESTs; angto; mAb 
^ - 436476; AA326108; H5.33829: bH\M protdn 0EC2; pane; diag 
05 436511; AA721252; K5.291502; ESTs; tung:diao 

436553; AW407157; Hs.181 125; fmmunogtobulln lambda tocus; tung; dlag 

436569; BE439S39; Ks.301961; ghitathlone S-transferase M2 (; blad; sja 

436614; AW104388; Hs.149091; ESTs; mela; CTL*5.m. 

436700; AI69369(y, Ks.301406; hypothetteal protein PP3501; mel^ mAb 
/ U 436729; BE621807: Hs.351316; tansmembiane 4 superMy me; pane, colon, slon^ ovar. lung, Nad; mAb 

436772; AW975888; Hs.348918; metallolhlonein IE (functional; engio: diag 

436775; AA731111; Ks.372225; ESTs; uter. ovar; diag 

435839; AA767346: Hs.372277; ESTs; lung; diag 
- - 436856; AI46935S: Hs.127310; ESTs; melac diag 
/5 436954; AA740151; Hs.130425; ESTs; fibro. uter. ovar; diag 

435972; AA284679; Hs.25640; daudin 3; ovar. lung, pros; mAb 

437052; AAB61697; Hs.120591; ESTs; pros; diag 

437099; N77793; Hs.48659; ESTs. Highly similar to S14458; test dlag 
0^ 437100; AI761073; Hs.14535; Homo sapiens cDNA: FU22314 6; pane, renat dlag 
oU 437119: AI379921: Hs.177043: XP_171 387 similar to rftoteWn; fibre; dlag 

437145; AF007216; Hs.5462: sehite canfar family 4. sodiu; pane, pros, stem; mAb 

437156; AI916600: Hs.121194: Homo sapiens cONA: FU21569 % slom. lenaL coton: dlag 

437181; AI30661S: HS.12S343: ESTs. Weakly similar to KIAA07; Uad; mAb«fijn. 
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437204; AL11Q216: Hs.355981: ESTs. WBakt/stmOar to 155214: huig: CTUsm 

43721^ AI765021; Hs.210775i ESTs; renal uter. ovar, (flag 

437224: AL117628: Hs.97808: ESTs; lest (flag 

437259; AI377755; Hs.120695; ESTs; lung; diag 
0 437267; AW511443: Hs.258110: ESTs: BPH: diag 

437289; AA334384; H5.149420: ESTs; angio; disg 

437330; A1353944: Hs.S0115i Hbmo sapiens mRNA: cONA DKFZp7; S8rc;dlag 

437381; NMJ03584; Hs^Sl; MAP Wnase^nteracflng serine^ g«o; CTL*«ja 

437390; AI125859; Hs.l12607; ESTs; lung; diag 
1 U 437412; BE08928B; Hs.34744; Homo sapiens niRNA; cONA DKFZpS; lung; diag 

437435; AA249439: Hs^027: hypothefcal protein DKFZp762H; lung; diag 

437437; AA2268S9; Hs.35ia23; hypotheQcat protein OKFZp7e2L; test CTL^m. 

43747B; AL390172: Hs.317432; Iwanched chain aminotransfefas: angio; s.m. 
^ _ 437553; AIB29935; K5.130497; ESTs. Weakly strnOartoMATB_H; blad: mAb 
1 5 437571; AA760894; Hs.125350; ESTs; pros; diag 

437623; 063^0; Hs.5719: chromosome condensafion^lo; test diag 

437740; AA810265: Hs.122915; ESTs; meta; diag 

437802; A1475995; Ks.12291Q; ESTs; pane; diag 
- . 437862; AW976107; H3.5884; Homo sapiens mRNA; cONA DKFZpS; mela; CTL^m 
20 437908; A1082424; Hs.351043; ESTs; test diag 

43791 5; AI637993; Hs.20231 2; Homo sapiens ctone N1 1 NTera2D; lung, headnk. ovar, Uad, ulen diag 

437931; AI249468; Hs.124434; ESTs; blad; diag 

437935; AW93gS91; Hs.5940; mucin 13. epiliwHal transmemb; colon, slom. uter. pane; mAb«s.nL 
. _ 437938;AI950087; Hs.369628:gb»vq05c02j(1 NCLCGAP_Kid12H; renal. ovar, uter, cerv. blad; diag 
25 437939; AW298600: Hs.64313; ESTs, Weakly similar to S59501; angb; mAb^^m 

437960; AI669586; Hs.389312; ESTs; uter, ovar; diag 

438167; R28363; Hs.24286; chemokine binding protein 2 (C; ovar. breast uten mAb 

438199; AW016531; Hs.122147; hypotlteticai protein FLJ13189 ; breast diag 

438209; AL1206S9; Hs.61 11; aiyMiydrocarbon receptor nuct mete; mAlnsm 
30 438233: W52448;Hs,56147: ESTs; pros, cerv; diag 

438274; AI918906: Hs.55080; ESTs; iieadnk; di^ 

438403; AA806607; Hs.292206; ESTs; lung; mAb 

438438; AA2S7992; Hs.50651; Janus kinase 1 (a protein tyro; EWS; sm 
. . 438450; AI050866; Hs.65653; nodal, mouse, homobg; test diag 
35 438456; AA913381; Hs.279763; ESTs; test diag 

438552; AJ245820: Hs.6314; type I transmembrane receptor ; pros, ovan diag 

438670; A1275803: Hs.123428; ESTs; fibro; CTl.*s.m. 

438702; AI879064; Hs.7164; ESTs; lung; diag 

438707; 108239; Hs.5326; amino acid system Ntransporte; ovar; mAb 
40 438746; AI885815; Hs.184727; Human melanoma^ociated anO; pane, blad, mete, ovar; mAlHCTL 

438817; AI023799; Hs.163242; ESTs; ovar, uter, Uad. rend!; diag 

438859; AI55g626; Hs.93522; Homo sapiens mFtNA for K1AA1 647; renal; dag 

438666; U44385: Hs.6441; Sssue inhibitor of metaltopro; mela; diag 
, _ 438873; A1302471; Hs.124292; Homo sapiens cDNA: FU23123 fi; fibre; diag 
45 438898; A1819863; Hs,106243: ESTs; lung; diag 

438915; AA280174; Hs.355711; WnSams^uren syndrome chrom; lung. test, mete; (H^ 

438929; AW195515; Hs.253177; ESTs; renal; diag 

438956; W00847: Hs.135056; Human DNA sequence from ctone ; lung; diag 
. . 438966; AW979074; ; gb:EST391 184 l^GE resequences,; renal; diag 
50 438983; AF085884: Hs^0029; proacrosin binding protein sp3; test CTL-iSjm. 

438993; AA828995; ; gb:od77b08^1 Na^CGAP.0v2 Mom; ovan mAlnsm 

439053; BE244588; Hs.6456; chaperonin contelning TCP1. su; test diag 

439092; AA830149; ; gb:oc44f08^1 NCI_CGAP.GC81 Ho; pros; dteg 

439176; A)446444; Hs.190394; ESTs. Weakly similar to B28096; pros; iflag 
D 5 439180; At393742; Hs.199067; v^rt>42 svten eiylhrobteslto ; breast ovar. uter. pros, btod. pane, ooton. fibro, mete; mAb 

439221; AA737106; Hs.32250: ESTs. Moderately sindter to 17; EWS; s.m. 

439237; AW408158; Hs^18893; ESTs. Weakly sintilar to A47582; mete, fibro; diag 

439239; AI031540; Ks.235331; ESTs; bted; diag 

439285; A1133916; H5.47660: hypothetical protein FU20093; lung, breast diag 
OO 439310; AF086120; HS.1Q2793; ESTs; mete; diag 

439316; AW837046: Hs.6527; G proteinK»upled receptor 56; ooton, breast ovar. uter, cen^. pros, lung, headnk, blad. mete; mAb^sm 

439335; AA742697; Hs.62492; NM_052863:Homo sapiens secnato; fijfo. uter. dteg 

439366; AF100143: Hs.6540;fibrobtestgTow8i factor 13; pros; CTUs.m. 
, _ 439382; BE247684; Hs.103070; ESTs; angio; dlsg 
05 439394; AA149250; Hs^lOS; ESTs; king; dteg 

439410; AA632012: lte.186746; ESTs; angto; diag 

439453; BE264974; Ks.6566; thyroid hormone receptor Inter; kmg, esoph. ovar. mAlHS.m. 
439496; BE616501; K3.32343; Homo sapiens. SlmSar to RIKEN; mela, esoph; diag 
439659; AW970780; H3^9483; teudne^teh repeatconblnlng; ovar. stem, mete, coton; mAb 
/ 0 439668; AI091277; Hs.302634; frizzled (Drosophite) homolog ; ovar. uter; mAb 

439670; AF086O76; Hs.S9507; ESTs, WeaMy similar to AC0048; lung, headnk. cenr; diag 
439702; AW085525: Hs.55984; ESTa; mela; diag 

439706; AW872527: Hs.59761; ESTs, Weakly similar to 0AP1.lt ovar, teng, headnk; diag 
439735; A1635386: Hs.142846; hypothetical protein; pros; diag 
/5 439737: AI7S1438; Hs.41271: Homo sapiens mRNA lull tength ; pane; diag 

439750; AL3S9053; Hs.57664; Homo sapiens mRNA fufl tei^ ; pane, fibro. breast dteg 
439755; AW748482; Hs.77873; B7 homolog 3; saic; mAb 

439759; AL359055; Hs.67709; Homo sapiens mRNA lull tength ; coton. stom, pane. teuk. teng; dlsg 
Q 439778; AL109729; H3.99364; putaOve transmembrane protdn; pros; mAb^m. 
oU 439780; All 09688; ; gkHomo sapfens mRNA fuH leng: bted, esoph: dteg 

439820; AL360204; Hs.283853; Homo sapiens mRNA fufl length ; ovar. uter. oera. breast pros ; disg 

439864; A1720078; H6.291997: ESTs. Weakly simiter to A47582: test dteg 

439867: AAB4751Q; Hs.161292; ESTs; pane; dteg 
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439920; H05430; Hs.288433: neuroWmta; panc^ 

439926; AW014875: Hs.137007: ESTs; blad. esoph. lung, ceiv; diag 

439963; AW247529: Hs.6793; plateleMivaSng fector ace; breast, lung, blad; s m 

- 440042: AI(J73387:Hs.13389a; ESTs; ovar:CTL^ 

J 440086; NM.005402: Hs.6906: \wa) simian leukemia viral on; angto; diao 
440099; AU)800S8: Hs.6909; DKFZP564G202 protein; pane; diag 
4401 19; AA855455; Hs.125331; ESTs. Moderately similjr to un; test; diag 
440138; AB033023: Hs.318127; hypothefica! protein RJI0201: lung; CTL-^jn. 

- ^ 440151: AA86B167: ; gb:ak38e07 ^1 Soare3_teslMH: sarc; c&d 
10 440207; AI371978JHS.128326; ESTs; test; di^ 

440209; H0SO49; Hs^47837; neurexin 3; Cbro; di^ 

440210; AW674562; Hs.122128; ESTs; gHo; diag 

440225; BE295782; Hs.159; tumor necrosb txtor receptor, gflo; mAb 

- - 440238; AW451970: Hs.155644; paired box gene 2; ovar; dia 
1 J 440260: Ai972887; Hs.7130; copbie IV; pros; diag 

440273; AI805392: Fte.32533S; Homo sapiens cDNA: RJ23523 Q: lung, fibra; diao 
440274; R24595: H3.7122: scrapie responsive protein 1; sarc; diag 
44031 1; AI733079; Hs.125407; ESTs. Moderately similar to AL; rend; diag 
440325; NI\^C03812: Hs.7164; a dislntegrin and metaDoprote; lung; mAb 

ZU 440333; AI378424: Hs,288761; hypothetical protein aj21749; pros; Cn-»s.m. 

440449; AA865430: H8^1925: HOmo sapiens cONA RJ 13446 fis; breast; diag 
440452;AI925136;Hs.55150;EST8,We*lysimflartoCAYPH;fibro;dlag 
440457; BE387593; Hs.21321; Homo sapiens done F1B9213 PRO; mela; diag 
440484; BE328156;Hs.150356: ESTs: pane: diag 

ZD 440529; AW207640: Hs.l6478; Homo sapiens cONA: FU21718 fi; pros; diag 
440659; AF134160:Hs.73Z/:daudin1:liu«: mAb 
440704; M69241; Hs.162; bisutin-fito growth factor Kn; lung, glio, ovan dlag 
440773; AA352702; Hs.37747; Homo sapiens. Slmflar to RIKEN; test dla 
440801; AAg06356;Hs.370038: ESTs; pros; diag 

iU 440819; AI809444; Hs.202108; ESTs; pros; diag 

440901; AA909358; Hs.128812; ESTs; ovar. pros; dlag 

440943; AW082298; Hs.146161; hypothefical protein MGC2408; lung; diag 

440983; M20681; Hs.7594; solute carrier famfly 2 (facil; test; mAb 

^ 441020; W79283; Hs.35952: ESTs; lung, pane; diag 

J J 441031; AI110684: H8.7645; fibrinogen. B beta polypeptide; hmg. pane, colon: CTL-»fijR 

441085: AW136551: Hs.1 81245; Homo sapiens cDNA aJ12532 fis; pane, owr. stom, uter. luna: diaa 

441 134: W29092; Hs.346950; cellular rSic add^Mng; sarc; di^ ^ ^ 

441247; AW1 18681; Hs. 128051 ; Homo sapiens thymic stmmal ly; pras; dleg 

441321; H17182; Hs.7771; B<ell associalBd protehi; test diag 
W 441345; AW068579; Hs,7780; Homo sapiens mRN\- cDNA DKFZpS; pra8;'diag 

441350; AB020690: Hs.7782; paraneoplastic antigen MA2; pane; mAlnCTL 

441377; BE218239; Hs^02656; ESTs; uter. endo. lung; dlag 

441384; AA447849; Hs^8860; retinoic add Induced 3; ovan mAlHe.m. 
. 441392; AW451831;Hs.222119; ESTs. Wealdy similar to 33043% renal; dlag 
45 441457; AW996651;Hs.43838; ESTs; angio; dl^ 

441495; AW294603: Hs.127039: ESTs; blad; disg 

441525: AW241867; Hs.127728: ESTs; bng; disg 

441553; AA2B1219: HS.121296J ESTs; lung. test, ovar; CTL^«m 
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441801; AW242799; H3.86386; ESTs; Wad; diag 

441635; AB038432; Ks.184i advanced glycosylalion end pro; fibra; mAb 

441859: AW194364; Hs^0444: fntei1euMiv4 induced gene-l p; ovar, mela, libitK mAb 

441878: AI801869;Hs.127982: ESTs; test; dl^ .n««,iio™jm«ii 

DD 442006; AW975183; Hs.372210; ESTs. WeaWy similar to S72482: fibro. angle; CTL^ 
442082; R41823: Hs.7413; calsyntenIn-2; breast, pros, ovar; dleg 
442104; U0971; Hs.188; phosphodiesterase 4B, cAIWP-^ipe; angio; CTl^m 
442108; AW4S2849; Hs.343259; ESTs; lung; dlag 
442242; AV647908: Hs.90424; Homo sapiens cONA: FU232e5 fi; BPH; dlag 

60 442323; AW016669;Hs.29190; ESTs; breast; <fia 
442333; AI550877; H3.129302: ESTs; test; (Sag 

SJSSS t!®-2SlSl^*^'=' ^23468; (dad. lung. pane, esoph. mela: CTL^ 
442438; AA995998:Hs.370007;gb.-os26b03.s1 Na_CGAPjad5 Ho; uter, ovar, renal; dlag 

^- 442441; AI820662;Hs.129598; ESTs; breast; dlag t «-.o«y 

QD 442503; AF147078; Hs.375031: p53^Bsponslve gene 5; melsc diag 
442S06; BE556411: Hs.41726; ESTs; angio; dlag 

442573; H93366; Hs.7667; brandied chabi aminotransferas; ovar. panc^ nglo. lest sm 

442577; AA292998; Hs.163900; ESTs; Uad, pane, colon, slom, ovar; (Sao 
„ 442580; AI733682:Hs.130239; ESTs; bmasi; dlag 
/U 442609; AUtt0996; Hs.8518: sdenoproleln N; mela; diag 

442613; AI004002; Hs.130522; KvchanneHnteracCng protein; gOo; dim 

442622; NM_000435; Hs.8546; Notdi (OrDsophBa) homdog 3; ova; mAb 

44271 1; AF151073; Hs.8645; hypothetical protein; angio. mela, sarc; dlag 
_ 442739; NM_007274; Hs,8679; cytosoBc acyl coenzyme A thio; mda; sjn. 
/ 0 442757; AI739528; Hs.28346; ESTs; mela; (flag 

442816; AK001741; Hs.S739; hypothetical protein RJ10B79: breast; dlag 

442821; BE391929; Hs.6752: transmembrane protein 4; ovar; dten 

442832; AW206560; Ks.253569; ESTs; pros, fibro; dlag 

442896; R37725; Hs.283093; ESTs; pane; diag 
OU 442994; AI02671 8; Hs.16954; ESTs; blad, fibro; dlag 

443054; A1745165; Hs.84520; ye&assodated pnM 66 kOa; bladb dian 

443162; T49951: Hs.9029: DKFZP434G032 protein; blad, lung; Cfl*Sn, 

443171: BE2B1128; Hs.9030:TONDU; Wad. war: dl^T 
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443184; AI838728; Ks.1351Sg; ESTs; sac; di^ 

44321 1: A1128388; Hs.143655: ESTs; Uad. ovar. lung, headnk. slann odon; (flag 

44321 S; W80487; KsJ24521 ; hypothaticSf protefei OCSO: test ^ 

443257; AI334040: Hs.11614; H^PCOSS pmtein; 0X0! CTLw 

443400; R28424; Hs.250648; ESTs; lung: diag 

443523: AK00157S; Hs.9536; hypottttOcel protein HJ1Q713; test; Ca^.m. 

443537; D1 3305; Hs.203; Chofecystokinin B rec^ 

443848; AI085377; Hs-143810; ESTs; Jung, headnk; di^ 

443709; AI082692; Hs.134662; ESTs; fibre; <Sag 

443715; AI583187; Hs.9700; cyclin El; lung. stom. ovar. colon; CTL^ja 

443785; AW4499S3^ Hs.19012S[ ba5lotieflx.too|>^-PAS pro; gOoiiiter, ovan CTL^tSjn. 

443802: AW504924: Hs.9805: KIAA1291 protein; saic: diag 

443883; AA114212; Hs,9930; serine (or cysteine) protelnas; saic; sm 

443885; K91806; Hs.15284; ESTs; mela; dlag 

443892; AI889S72: H5.246875; ESTs; lung: dlag 

443950: NM_001425; Hs,9999; epilhellal membrane protein 3; melac niAb 

443988: AA287702: H3.10a31; KIAA09S5 protein; anglo; disg 

443983; H04482; Ks.163724; ESTs; meia: mAb 

443991; NM002250; Hs.10082; potassium intermedialefemall c; pros, colon, liter mAb 

444009; AI380792; Ks.135104; ESTs; angk); di^ 

444151; AW972917; Hs.128749; aJphfrmethylacyl-CoA racemase; pros; mAb 

444159; AF1 16846; Hs.10431: dead ringer (DrosophllaHike ; test; CTL'»«.m. 

444163; AI126098; ; FGEKESK predicted RNaseH domai; bted; sm 

444301; AK000136: Hs.10760; asporin (URRdass 1); pane; diao 

444325; AW152618; Hs.15757; ESTs; esoph; dlag 

444330; AI5976S5: Hs.49265; ESTs; anglo; dlag 

444342; NM_014398; Hs. 10887; slir^ to lysosome-associated; hepC lung, fibro. blad. esoph; diag 

444378; R41339; Hs.47860; neurotrophic tyrosine kinase, ; lung, glto; mAlHSjn. 

444409; AI792140; Hs.49255: ESTs; anglo; dlag 

444M4; AI149332: Hs.14855; ESTs; lilad; dlag 

444471; AB020684; Hs.11217: KIAA0877 protein; gHo, lung, coton ; mAb 

444476; AF020036; Hs.11223: isodtrate dehydrogenase 1 (NA; blad; s.m. 

444484; AK002126; Hs.11260; hypotlieticd prole&i FU11264; pros; diag 

444633; AF111713; Hs. 12284; jun^liona! adhesbn molecule 1; ovar, uter. breast, cerr. blad. headnk: mAb 

444649; AW207523; Hs.371001; ESTs; blad; dlag 

444670: H58373; Hs.332938; hypothetical protein MGC5370; sarc; (Sag 

444754; T8391 1; Ks.11 881; transmembrane 4 superfan^ ma; pane, omuc, slom, lung, colon; mAb^sm 

444809; eE207568; Hs.208219; oculospanin; mela; mAb 

444823: BE262989; Hs.12045: putative proteh; test; diag 

444863; AW384082: H3.104879; serine (or cysteine) protelnas; mela? sm 

444895; AI674383; Hs.22891; solute carrier family 7 (catio; ovar; mAt>*s.m. 

444995; AJ272265; Hs.12230: secreted phosphoprolein 2, 24k; hepC, pane; diag 

445019; Ar205540: Hs.281295; ESTs; headnk, hing. coton; dlag 

44507Q; NK^000677; Hs,258; adenosine A3 recepton gHo, renal: mAb 

445076; AI206888; Ks.154131; ESTs; test; diag 

445084; H38914; Hs.250848; hypotheticai protein FU14761: saic: mAb 

445093: AI207197; Hs.156905: ESTs; test; dl^ 

445109; AF039916: Hs.12330: ecbnucleosUe triphosphate di; pros; s.m. 

445119; AF035121; Hs.12337; kinase insert domain receptor ; aigto; mAb 

445160: AI299144: Ks.101937; sine ocufis homeobox (Drosotihi; saro; CTL-wn. 

445182; AW1897S7: Hs.361778: ESTs; blad; diag 

445247; AW274290: Hs.l53997; ESTs; mela; diag 

445279; R41900; Hs.22245: ESTs; anglo; di^ 

445383; NM-005993; Hs.12570: tubunihspedfte chaperone d; test; diag 

445413; AA1 51342; Hs.12677: C61-147 protein; pros, colon, uter, ovar, tung, pane ; diag 

445418; AW139377:Hs.127179;aypflc gene; pane; diag ^ r- 

445424; AB028945; Hs.12696: cortacUn SH3 domaln^inding p; proa : diag 

445443: AV553838;Hs.295131; ESTs; lung; dlag 

445654; X91247: Hs.13046; thtoredoxin reductase 1; lung; sm 

445684; AKroi696; Hs.13109; Ran binding protein 11; rngk); dlag 

445701; AF055581: Hs.13131; lymphoeyta adaptor protdn: angfo; CTL-»8.m. 

445784; AI253155: Hs.146065; ESTs; meb; CTL^m 

445^ AI734009; Hs.127e99; KIAA1603 protein; pros, fibro; dlag 

445900; AF070526; Hs.1 25036: Homo sapiens clone 24787 mRNA ; renal, teuk; mAb 

44591 1; AI985987: Hs.145645; ESTa, Moderately simRar to Al; blad; di^ 

445982; BE410233; Hs.13501; pescadiBo (zebrafish) homolog; mete; di« 

446057; Ai420227: Hs.366053;Trpf8 transient receptor pota; pros; mAb 

446082; AI274139: H3,156452; ESTs; blad; dlag 

446098; AW072215; Hs.20847Ds ESTs; angto; dlag 

446101% AW9671Q9: H8.13804; hypothetical protein dJ4620234 pros; diao 

446102: AW168067;Hs.317694; ESTs; hing;dl^ 

4461 13; AW957553: Hs.323518: Homo sapiens mRNA for FUO0O83; test; mAb 

446269; AW263155; Hs.14559; bypoUietical protein FU10540: kmg, headnk; CTL*»sja 

446291; BE397753: H8.14623; interferon, gamma^udble pr; mela; dl^ 

446292: AF08149^ Hs.279682: Rh type C ^tyooprotein; hng, cerv; mAb 

448293; AI420213; Ks.149722: UM domain banscripQon bcto; ow. test; dlag 

44632(fc AF1 26245; Hs.1 4791; acy^Coenzyme A dei^rogenase ; pros; 8.m. 

446332; AK001635: Hs.14838; hypothetical protein FU10773: breast; diag 

446342; BE29886S; Hs.l4846; sohite carrier fanrdy 7 (cafo; uter. colon, pros, meia; mAb 

446428; AW082270; Hs.12496; ESTs, Weakly slmOar to ALU4_H; fibro; dlag 

446528; AU076640; Hs.15243; nucleolar protein 1 (120kD); huig. test; diag 

446608; ri75217:Hs.175622: ESTs; uter, ttrot diag 

446626; AW292180; H8.158142: ESTa; pros; diag 
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446638; AO002S63: H3.15767; citron (rticMnteracflng, serin; hmg; CTl-^in. 

446844; NN1.003272; Hs.21065; transmembrane 7 supeifsmlly me; mela; mAb 

446673; NM.016381; H8.15671; LPAP for lysophospha&fic acid; Uad; di^ 
f 446727: AB011095; Hs. 16032; KlAA0523proteIii: sngio: Cn.*sjn 
J 446733: AA863360; K5.26040: ESTs. Weakly similar tD fatty ; breast; sm 

446755; AW4S1473; Hs.16134; serineflhreonine kinase lOc meia; CTLw 

446791; AI63227a; Hs.195922: ESTs; test: diag 

446839; BE091926; Hs.16244; ntitDQc spindle coUetkoB ra; test; d!» 
. ^ 446856; AI814373; Hs.164175; ESTs; lung; diag 
lU 446868; AV660737;H5.348297: ESTs; pane; di^ 

44687% X970SB; H8.1636% p y rin ti d i neigic receptof P2Y, G; tung; mAb 

446932; AAg81459; Hs.12S644: ESTs; tibro; diag 

446967; AI699629; Hs.156781; ESTs; fibio; diag 

446979; AI654443: Hs.197683: ESTs; lest; <fiag 
1 5 446984; AB020722; Hs.15714; Rtio guanine exchange factor (G; anglo; CTL+sjn. 

44698% AK001898; Hs.15740; hypothetical protein FU1 1036; lung, headntqdiag 

446996; N99013; K5^8g66; Homo sapiens mRHA; cDNA DKFZp5; pane, fibro; diag 

446999; AA151S20: Hs.351416; hypoUieOcai protein M6C448S; headnlq diag 
^ . 447004; AW296968; Hs.157539; FGENESH predicted seoBted pro; glto; diag 
20 447078; AW885727; Hs.9914; ESTs; hing; dteg 

447126: AW150632; H5.170307; Ral guanine nudeo6d8 exchang; angio; diag 

447164; AF026941; Hs.17518; viplrfn; similar to Inflammat; colon, lung, breast, stom. hepC, esoph, mela; diag 

447178; AW594641; Hs.192417; ESTs; mela; diag 

447188; H55423; Hs,17631; hypothetical protein DKFZp434E; test; diag 
Z3 447210; AF035269; Hs.17752; phosphafidytserine-specinc ph; pros, mela; sm 

447289; AW247017; Hs.38978; melanoma antigen, family A, 3; hmg. mela; mAbtCTL 

447334; AA515032; Hs.91 109; ESTs; Wad; diag 

447343; AA256641; Hs.236894; ESTs. Highly similar to $02392; tung, Uad. pane, headnk. melsc mAb«&m. 

4473S0: AI375572; Hs.172634; v«b^ avian erythroblasOc t breast, ovar. uten (Sag 
JU 447377; X77343; Hs.334334: transcripSon factor AP.2 alph; breast, lung, mela; CTL^m 

447395; A1418412; Hs.1 84793; Homo sapiens cDNA: aJ21880 fi; pane; diag 

447437; U07225; H3.339; purinergic receptor P2Y, G-pro; blad; mAb 

447499; AW26258D: Hs.147674; protocadhailn beta 1 6; pros. gOo^ ovar : fflAb*8m 
. . 447532; AK000614; Hs.18791; hypotheCcal protein FU20607; hmg, blad; CTL^m. 
3 5 447534; AW95393S! Hs.288655; ESTs; lung, test; diag 

447578; AA912347; Ks.136585: ESTs, WeaMy similar to JCS314; ovar; s.m. 

44759S; AW379130; HS.189S3; phosphodiesterase 9A; pros; CTL-isja 

447604: AW089933;Hs401342; hypothetical protdn MGC4342; meta; diag 
, ^ 447636: Y10043; H5.19114; WghKnoNHty group (nonhiston; hmg; CTL^wn. 
40 447733; AF157482; Hs,19400; MAD2 (mitotic arrest delkient; lest; diag 

447749; T53260; Hs^97; ESTs; renal; (Sag 

447761; AF061S73; Hs.19492; protocadherin 8; EWS, gQo; mAb 

447768: XB6400: Hs.19520: FXYD domaiiH»nta!nlng bn tra; renal; mAb 
. . 447618: W79940; Ns.355279: Homo sapiens done 24670 mRNA ; ranal; diag 
45 447835; AW591623;Hs.164129: ESTs. WeaMy similar to 138022; renal, ovar, Uten diag 

447881; BE620886; HS.35S279; GCN1 (general control of amino; renal; diag 

447937; AL1 09715; Hs.20034; Homo sapiens mRNA full length ; mela; mAb 

447993; AW139525; Hs.170362; ESTs; colon, blad, lung, ovar, pane, headnk; mAb+dlag^m 

448030: N30714; Hs.325960: membran^spannlng 4^omalns, s; pane, leuk, renal, stom lung; mAb 
DO 448045; AJ297436; Hs.20166; prostate stem ceD antigen; blad, pane, pros; mAb 

448105; AW591433; Hs.298241; Transmembrane protease, serine; breast, pane, cglon. lung. ovar. stom; cnAlHdiag'tflja 

448133; AA723157; H173769; folate receptor 1 (aduH); ovar, fibro; mAb 

448140; AF1 46761; Hs.20450; SCU%a membrane protein prec; fibro, mela. leuk; mAb 

- - 448153; Y10805; Hsi0521; HMT1 (hnRNP methyilransferase.; test; CTL^.m. 
JD 448181; AF272833;Hs^763; hypoffietical protein FU10504; test; diag 

448204; AI475124; Hs.170561; ESTs; saic; (fiag 
448231; AI701916; Hs.202509; ESTs; angle; diag 
448258; BE386983; Hs.343214; hypothetical protein FU20396; mela, ovan mAb 
448262; AW880830; Hs.186273; ESTs; Wad; diag 
60 448275; BES14434; Hs.20830; kinesln^ika 2; ovar, esoph, mel^ diag 
448278; W07369; Hs.11782; ESTs; lung; diag 
448290; AK002107; Hs.20843; Homo sapiens cDflA FU1 1245 lis; pros; diag 
448321; NM-005883; H5.20912; adenomatous polyposis coB Dk; ^; CTLism 

- - 448357; N20169: Hs,108923: RAB38. member RAS oncogene fam; lung, mela; diag 
03 448410: AK000227; Hs.21126; hypothalteai prateh FIJ20220; mete; dicg 

448437; AW470125; Hs220529; gb3(w60c04j(1 NCt.CGAPJ'anI Ho; paie, coton; diag 
448499; BE613280; Hs.77550; p53^ulatBd DDA3; glio: diag 

448569; BE382657; K3.21486; signal transducer and acOvato; pane, headnk, fibro. caiv, mela, renat CTL^am 
_^ 448588: AI970276; Hs.156905; KIAA1676: test; CTL^m 
70 448595; AB014544: Ks.21572; KIAA0644gene prodxt; breast gSo: mAb 

448664; AIB79317i Hs.334691; splicing factor 3a subonit 1 ,; mala; CTL-ism 

448874; W31178: Hs.154140: ovary^pedfie addie protein; angle; diag 

448692: AW013907; Hs.167531 ; methytcrotonoyl-ajenzymo A can pros, pros; sm 

448706; AW291095; Ks.21814; Interteukin 20 receptor, alpha; pros, uter, blad, colon: mAb 
/ 3 448719; AA033827; Hs.21856; trlnudeoSde repeat containin: mela saro; CTL-^dbg 

448775; AB025237; Hs.388; nudbc (nudeosUe d^phate ; test: diag 

448811; AI590371; Hs.199460: ESTs; esoph. pane; mAb 

448939; BE267795; Hs.22595: hypothefical protein FU10837: test; CTL-»«.m. 

448966; AW372914; Hs.86149; phospholnositol Jfhosphale-bi: mela; CTL-»sjn. 
SO 448981: AI968719; H&195387; ESTs; test; diag 

448988; Y0976* H522785: gamma«nInobutyrie acW (GABAJ; pros, cerv, coton, 

448993; AI471630; H1355952 KIAA0144geno product; lung. Uad; diag 

44899% AF179Z74: H122791; transmembrane protein wOh EGF; pros, 06o ; mAb 
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449003i X76342; Hs^ sIoqM deMrogenase 7 (dass; 1^ 

449019; AI949095; Ks.67776: ESTs. VfeaMy sbiiBar to T22341: Mad. hmg; di^ 

449027: AJ271216: Hs^SSO: dlpepSdytpepitdase Hi; blad. coton. ovai; SJn. 

449040; NM-012191; Hs^lS; putafive tumor suppressor lung: CR^iam 

449078: AK001256: Ks^S; K1AA1576 protein: mela; diag 

449101: AA205847: Ki.23016: G pioteliKOupled receptor; lung, headnlc mAb 

449109; AW270992: Hs.120949: ESTs, Weakly abnilar to ALU7.H: san; dlsg 

449156; AF103907: HS.1713S3: prostate cancer anfigen 3, non; pios; mAtKCTL 

449207; AU)44222: Hs2325S; nucteoporin ISSIcO; lung: dlag 

449228: AJ403107: Hs.14d590; proton relaled wilh psoriasis; lung: diag 

449230; BE61 3348: K{5.356392^ melanoma cell adhesion molecut; hmg, ceiv. headnk, tiiad. ovar. oolon; mAb 
449317: AW293413; H3,132906: 19A24 protein; mela; mAb 

449318: AW236021: Hs,78531; Homo sapiens. Similar to RIKEN; headnk, lui«, aitgto; CTL^m. 

449322: AI638616: K5.196556: ESTs; tesl:di^ 

449338; H73444: Hs.394; adrenomeduIHa* renal; (Sag 

449394: AA004388: Hs.18160: Homo sapiens cDNA RJIISSO fis; anglo; mAb 

449437; AI702038: Hs.100067; Homo sapiens cONA: aJ22902 fi; test; diag 

449448; D60730; H3.57471; ESTs; blad. lung, headnk. breast: diag 

449487; AW205006; Hs.197042; ESTs; lung; diag 

449494; AW237014: Hs.315369; aquaporin 4; fibro; diag 

449569; A1656634; Hs.195389; ESTs; test; dlag 

449592; AI655494; Hs.195718; ESTs; pane; diag 

449618; AI076459; Hs.15976: KIAA1272 protein: angto; dlag 

449525; NM_014253; Hs.349094; odz (odd OzAerwn. DTOSOphUa); pros; diag 

449650; AF055575: Hs.23836; calcium channel, voltag&depen; pros; mAb 

449680: Ai033821: H&12160; ESTs; renal; diag 

449981; AW265634: Hs.133100: ESTs: giio, esoph, lung, blad ; dlag 

449976; H06350; HS.1350S6: Human DNA sequence from done ; hing; diag 

450096; AI682088; Hs.79375; single-minded (Orosophita) horn; pros; CTL 

450098; W2724g; Hs.8109; hypoO^elical protein RJ21080; breast, king, atom, uter, dlag 

450149; AW969781: Hs,132863; Zic family member 2 (odd-paire; sate; CTL<«8m 

450152; AI138635; Hs.22g68; intron of VEGFR* renal; diag 

4503n; AB033091: Hs.35592S: KIAA1265 protein; ovar, colon; dlag 

450382; AA397658: HS.602S7: Homo sapiens GDNAFU13598tis: pros; dlag 

450400; AI694722: Hs.279744; ESTs; pane; diag 

450431; AW135797: Hs^041: ESTs; lest; dlag 

450451; AW591528; Hs^02072: ESTs; uter, endo; diag 

450508; NM_004460: Hs.41 8; fibroblast adivalion protein.: pane, esoph; diag 

450534; AI570189: Hs.25132; KIAA0470 gene product; angto; CTL^.m. 

450581; AF081513; 83.25195; TGF^ieta 4; uter. cerv, test; dlag 

450635; AW403954: Hs.25237; mesenchymal stem cell protein ; Uad; mAb 

450642; R39773: Hs.7130: copine iV; pros; (Sag 

450656; AA010539; Hs.18912^ unnamed protein product; fibro. uten CTUsm 

450663; H43540: Hs.25292; ribonuclease HI, large subunit; mela: 8.m. 

450676; A1147155: Hs.279727: ESTs:sarc: dlag 

450684; AA872805: Ks.25333; Interieukin 1 receptor, type i; blad, iung. headnk; mAb 
450690; AA296698; Hs^18; FXYD domain^ntaining ton Ha; mela; diag 
450693; AW450461: Hs.203965; ESTs; pros, uter; dlag 
450719; AI096837; Hs.21349; ESTs. Weakly similar to RBOBJi; test: diag 

450737; AW0071 52; Hs.63325: transmembrane protease, serine: colbn, blad, hmg. ovar, pane, headnk; mAb^diag^s jn. 
450785; AA85Z713: Hs.108885: Homosaptens, a!pha*1 {VQ ool;saro: CTL^ 
450832; AW970602; Hs.105421; ESTs; lung; diSQ 

451027: AW519204; Hs.40808: Homo sapiens, Simlar to RIKEN; pros, uter, ^ diag 

451035; AU076785: Hs.430: ptestin 1 (I isotorm); pane; diag 

451050; AW837420; Hs.351869; ESTs; mete; dag 

45109% R52795; Hs.25954; hterteuMn 13 receptor, alpha; glto^ fibra^ mela; mAb 

451108; BE382701; K5.2S980; N-MYC oncogene; test, ovar. CTUsm 

451110: AI955040; Hs^53g8; PAR^ beta (paitfttonlng def, breast, ovar, bng, cOton; cn*9,m. 

451181; AI7g8330: Hs^07461: ESTs; pane; dlag 

451253; H48299; Hs^126; daudin 10; lung, ovar, pane; mAb 

451291: R39288; H16702: ESTs; tung; diag 

451295; A1557212: Hs.17132: ESTs. Moderately similar to 15; pane; dlag 

451320; AW1 18072; Hs.350251: diacylglycerol kinase, zeta (1; lung; 8.m. 

451346; N^006338: Ks.26312; glioma ampCQed on chromosome; ovar; mAb 

451386; AB029006; Hs.26334; spasSc paraplegia 4 (autosoma; hmg: dlag 

451398: AI793124: Hs.144479; ESTs; breast, ovan diag 

451411; AA017492: H3.135S55: EST; pros; (Sag 

451497; H83294: Hs.284122: Whl Inhibitory factor-1: uter. fibro, pros, coton, saro; diag 

451541; BE279383: Hs^557: piakophiiln 3; tong. Uad. ovar; diag 

451592; AI80541 6; Hs.213897; ESTs; tong. headnk; dlag 

451635; AA018899; Hs.127179: crypfo gene; pane; diag 

451663; AI872360; Hs.209293: ESTs; pros: dlag 

451720;AW970985: Hs,290853; ESTs; pros; dlag 

451743; AW074266; Hs.336428: ESTs; lung; <SaQ 

451620; AW058357: H3.19924e: ESTs; pane; mAb 

451844; T61430; ; gb:yc06a03.8l Stratagene iung ; blad; diag 

451982; F13036; Hs^373: Homo sapiens mRNA; cOm DKFZp5: pros, Uad; mAb 

451999; AW176401; Hs.380623; DEAD/H (Asp^Slu-AlaAsp^) b; test; CTl^ 

452046; AB0ie345; Hs.27657; KIAA0802 protein; lung, uter, Cll^m 

452208; AA024792: H3.31695; hypothetical protein MGC4093; renai; diag 

452240; AI591 147; Hs.61232: ESTs: blad, tung, headnk, pane. cenr. diag 

452243t AL3557 1 5; Hs28555; prograRwned eel death 9 (PDC09; breast; diag 

452244; N3353Q: Hs.176674; ESTs; mete; diag 
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452291; AF01559%Hs.28853; GDC7 (c^ division cycto 7, S; tBSt CTL49Ja 
452295; BE3y9936;Hs.28866; p r g gf a n vned cell death lOt tuna: dlag 
452298; AI039243; Ks.276585; £STs; angb; (Sag 

452304; AA025386: Hs.6131 1; ESTs. Weakly similar to 810590; tung, pane. blad. stonv esopii. fibn^colofi; sm 

452316; AA298484; Ks.61 265; ESTs. litoderslefy similar to G7; Uad: diag 

452340; NM.002202; Kls.505; 1SL1 transoipiion (actor. UlJ^ pane, pros; CRi&m. 

452353; CI 8825; Hs.29191; epithelial memtiiane protein 2; pn». bieasi: mAb 

452355; N5492S; Hsi9202; G proteiivcQUpted receptor 34; gliOi Gbro. pane; mAb 

452367; U71 207; 14s^279; eyes absent (Drosophlla) honuA bng. pros, ovsr, uter, CTL-tem 

452416; AA026115; Hs.114777; ESTs; fibre; (flag 

452461; N78223; Hs.108106; transcription factor; blad. lung, headnic. over. gGo^ atom, odon. cerv; CTL^sm 
452571; W31518; K5^665; ESTs; slom, tung, pane, coton, fibnr. dlag 
452594; AU076405; Hs.29981; sobito carrier famOy 26 (suff; ovan mAb 
452613; AA461599; Hs.23459; ESTs; lung; fSsQ 

452679; Z42387; Ks.83883; transmembrane, prostate androg; pros,ooton, pane, pros; mAb 

452705; H49805; Hs.246005; ESTs; pane; dlag 

452717; AW160399; Hs.30376; hypotheOcal protein; pros; dlag 

452721; AJ269529; Hs.301871; sotute earner family 37 pros; mAb 

45273^ BE300078; Hs.80449: Homo saptens, done IMAGE:3535; biad; dlag 

452744; A1267652; Hs.246107; Homo sapiens mfWA; cONA Oi(FZp4; mela; dlag 

452792; AB037765; Hs.30652; KIAA1344 protein; pros, liter, breast ; diag 

452795; AW392555; H3.16878; hypothetical protein FU2162Q; renal, headnk. coton, lung, pane; CTL 

452796; AB01 11 00; H8.30656: KIAA0528 gene pnxluct; test; di^ 

452833; BE559681; Hs.30736; KIAA0124 protein; lung, mela; CTb^m 

452865; AI924046; Ks.1 19567; ESTs. Wealcly simiter to A47682; lung; dlag 

452699; M9S739; hls.30956; nescient helix loop heilx 1; sarc; Cn-»s.m. 

452924; AW580939; Hs.971 99; comptement component Clq recep; angle; dlag 

452933: AW391423; Hs.288555; Homo eaptens cDNA: aj22425 ti; angto; Ca^sm 

452934; AAS81322; Hs.4213; hypotheticat protein MGC16207; teng. Mad; dl^ 

452955; AW390282: Hs.31 1 30; transmembrane 7 superfamBy me; pms; mAb^jn. 

453006; AI382575; Hs.303171; ESTs; pros; diag 

453028; AB006532; Hs.3144^ RecQ protetn-iOce 4; blad. lung, test; CTL-»s.m. 

453085; AW954243; Hs.351573; KIAA0251 protein; angle; dteg 

453096; AW294631; Hs.351270; ESTs; pros; diag 

453102; NM.007197; Hs.31664; frizzted (DrosophBa) hofflotog ; teng. headnk. coton; mAb 

453107; NM-016113; Hs^9746; vanlltoid receptoMike protel; mela; mAb 

453134; AA032211; Hs.118493: ESTs; blad; tfiag 

453142; AA033848; Hs.7473; Homo sapiens gap junction prot fibro; mAb 

453160; AI263307; Hs.356301; H2B hbtone family, member U lung. pane, pros; diag 

453210; AL133161; H5.32380; hypotlietical protein FU10887; hmg; CTL^jn. 

453216; AL137566; Hs.32405; progesterone receptor (PR); btadt mAb^&m. 

453256; Ai565587; Hs.32556; KIAA0379 protein; mela; diag 

45331 0; X70697; Hs.553; solute carrier family 6 (neuro; fibro; mAb 

453321; AI984381; Hs.232521; ESTs; Wad; dlag 

453323; AF034102; Hs.32951; solute carrier family 29 (niicl; ovar; CTL^m. 

453331; AI240665; Ks.352537; ESTs; breast, lung, pane, esoph; mAb^dteg^m 

453344; BE349075; Hs.44571; ESTs; mela; dtag 

453348; BE272318: Hs.8595; hypothetical protein FU12438; test; CTL+sm 

453365; AA035211; Ks.17404: S0X7 8RY (sex determining tegl; angio. bted; CTL-^m 

453370; AI470523; K5.139336; ATP-tiinding cassette, eiMmi: pros; mAb 

453389; BE273648; Hs.32963; cadheito 6, type 2, KK»dhet1n; rnial. ovar, Uad; mAb^ 

453392; U2375^ Hs.32964; SFOT (sec determining region Y); ovar. lung. gOo, sare; CTL^em 

453459; BE047032; HS.2S7789; ESTs; ovar, een<. blad. uter, pane, angte^ lung; diag 

453464; AI884911; Hs.32989; receptor (calcitonin} acfivily: pros; mAb 

453633; AA357001: Hs.34045: hypolheticd protebi FU20764; hmg. esoph. test; diag 

453837; NiyL0Q2589; Hs.34073; BHiNotocadherto (bialn-hearQ; headnk; mAb 

453842; AI370936; Hs.34074; dlpepOdyfpepQdase VI; glto; mAb 

453779; N35187; Hs.43388; 26kO interteron responsive pro; mela: dieig 

453789; AA628517: Hs.118502; ESTs; angio; dlag 

453857; AU)80235; H13S861; Ras^nduoed senescence 1 (RIS1; glto. lung, uter. headnk, cerv. pane, pros, sarc; mAb 

453883; AI838516; Hs.347524; oofector required for Spl Iran; blad. tung; diag 

453884; AA3S592S; Ks.36232; KIAA0186 gene product; teng, ovar. test, esoph; dlag 

453912; AL121031; Hs.356843; 8W1/SNF related, matrix sssoci; mela; diag 

453922; AF053306; H3.36708; budding uninhibited bf bendmi; coton, stem, teng, lest; CTl'»6.m. 

453935; A1633770: Hs.42572; ESTs; pane; diag 

453941; U39817; K3.36820; Bloom syndrome; lung, eerv. headnk; CTL^em 

453354; At981486; Ks.249196; ESTs; teng; (Sag 

453966; BE148734; Hs.63325; transmembrane protease, serine; ooton, bted, teng, ovar. pane, headnk; mAb«diag«8m 
453385; N44545; H3.251865; ESTs; test; diag 

454034; NMJ)00691; HS.S7S; aMehyde dehydrogenase 3 tamil; lung, headhk; sm 

454042; H22S70; H3.47860; hypothetical protein FU20093; hmg; dteg 

454066; X00356: Hs.37058; caIc{ton}nA:akjtonin.reteted ; lung; diag 

454071; A1041793; Ks.42502; ESTs; breast ; dlag 

454077; AC005952; Hs37062: InsuCn^ike 3 (l^ydig ceil); test; dlag 

454098; W27953; Hs^17493: PlakophOin; lung; diag 

4541 17; BE410100; Hs.40368; adaptor-feteted protein compte; mete; GTL-»sm 

454360; L78207: Hs.54470; ATP-bintfing cassette. 8ub4aml; glto; mAb 

454429; 6E273437; Ks^1406; hypothetk»l proteto PP3501; mela; mAb 

454439; AW819152; H3.154320; DKFZP56601646 protein; lung; dlag 

454478; AW805749; Ks.372763; superoxide (fismutase 2. mltedi; meto; sjn. 

455601; A1368680; H3.8I6; SRY (sex detemnlning regton Y); lung. cerv. esoph; sjn. 

456034;AW450979E;gb:U-H«»«l8«-12-04nAl N;blad,tibrocdteo 

456062: A»66286; HS.719S2; ESTs, sbnOar to B36298: fibre, ovar, uter. dteg 
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456177; NK1.012391: Hs.79414; praslsto epttheQunKpedfic E; breast, pns; ciiag 

456286; L2S073; K5.198726; coU shock domain protdn A; pane; CR*s.RV 

456321; NM.001327; Hs.87225: cancei/testb antigen; tin^ CTL 

456553; AA721325; Hs.189058; ESTs, Highly sbnQar to Sintia; pane; diag 

456723; Z43902; Hs.4746; adenylate cyc^ advaSng p; gib; mAt>«6ia 

456736; AW248217: H8.1619( achaetd^e complex (Drosoph; tung; dlag 

456759; BE259150; Hs.1 27792; delta (Drosophila}^ 3; gOo. lung; mAb 

456847; A!3$0456; Hs.86088: ESTs; test; diag 

456938; X52509: Hs.161640; tyrosine anilnotransfBrase; tireast s.m. 

456977; AK000252: H5.16g758; hypotheScat protein FU20245; angio; (Sag 

457200; U33749: Hs.197764; thyroid transcr!p9on (actor 1; fbttn CTL-»&m. 

457211; AW972565; Hs.32399; ESTs. WeaWy similar to S51797; meta, pros; CTL^ 

457292; A)921270; Hs.281462; hypothetica] protein FU14251; blad; mAb 

457313; AF047002; Ks.241520; transaipt}onal coacSvator; test CTUsm 

457411; AVIW961; Hs.13(K}93: iroquoi^Bss l)omeobox pfotei; 

457465; AW301344; Hs.122908: ONA lepTtcation factor; test melac diag 

457498; AI732230; Hs.191737; ESTs; pros; diag 

457561; AA331S17: Hs.286055; chimerin (cMmaefh) % gib: mAb 

45759Q; A1612809; NS.53TB; hypothetjcal proldn MGC10724; over; diag 

457869; AU077188: Ks.108885; Homo sapiens. alplia-1 (Vl) col; sac; CTL'^s.m. 

458092; W67353; H3,350558: K1AA0251 protein; lung; diag 

458124; AW005548; Hs.124590; ESTs; fibro; disQ 

458435: AI410718: Hs.144121: ESTs. Wealdy similar to 746916; gllo; diag 

458471; AV648609; Hs.194240; ESTs; renal, pane hepC; di^ 

458933; AI638429: Hs^4763; RAN binding protein 1; tung. test; diag 

459373; 6E408266; H3.301 408; liypothG6cal protein PP3S01 ; mela: mAb 

459578; AW612538; Hs.304491 ; EST; mela; diag 

459702; Ai204995; ; gb:an03c03J(1 Stratagene scNz; btad, fibro; diag 

459705; BE082764; Hs.2702S2; ESTs, Weakly similar to androg; fibro; mAb^.m. 

TABLE 3B 

Pkey: Unique Eos probeset Identifier number 
GAT number Gene cluster number 
Accession: Genbanli accession numbers 



Plcey 



CAT Number Accession 



103739 
108282 
113230 
118417 



121335 
126872 
322521 
322975 
323332 
323817 
324261 
406665 
409051 

409123 
409745 



411880 
413804 
414221 
417886 
427260 
427298 
427521 

429163 
432189 
432222 
432407 



49403.2 AA1 1 5173 AA07S709 AA076354 AA083101 AA076396 AA085391 AA070684 AA0833&8 AA07S779 AA075221 AA07639S AA6S0486 AA083500 
108971 1 AA085143AA065142 
2327174.1 AI820546AJ821336 T61430 

LI AF080229 AF080232 U67593 U87592 U87591 U87590 AI636743 AI633818 AW206802 AI58371 8 AF080231 AF080234 AF080233 AL535594 
A1818326 AF080230 S46404 AI970376 AA463992 AW565466 BF512210 U87595 U87589 BE550633 Ai672574 BE467547 AI680833 AW6149S1 
N29986 N25695 H69001 U87586 BE673974 A1797496 AI701526 AA703396 AW139734 H92278 N66048 BE219539 BE671665 A1624817BE466611 
A1206344 AA574397 BF593413 BG231271 6F773517 U87S94 BF062180 BE466420 AI887798 BF674385 AA20473S AW49680B AA204833 
AA207155 BI004756 AA206262 AI365204 H77608 AW590511 
AA404418AI217248 

AA136653 AA136656 AW45Q979 AA9B43S8 AA80S054 AW238038 AA492073 BE168945 
AF147347 T65503T55426 
a6391C16413 

A»29520AI791632AI791823AA229315AA22B414AA229211 
AA410943 BF366S82 AA334202 AA332882 BF371899 AW9469S3 
AL044891 Aig08240 AA393060 AW748403 BE069341 6F330573 
M18728 

AA075419AA082953AA080912 AA062835AA071252AA084926 AA078992AA113913AA081881 AA(rr0343AA083821 AA082836AA1 13892 
AA07S316 AA076594 AA078900AA134801 AA063293 AA083403 AW974306 
AA070050 AA070823 AA083403 

BI030997AA921874AW1B8822BI027662AI347618A1361453Ai088754AW207491 AA077391 6G01 2775 BG997382AA286833AA1 50722 
61007625 BI027B64 Bt009100 61006275 BI006270 61031000 B1029864 61006277 61007627 61006266 BI006991 61006990 61007763 61007762 
BG897377 AA150760 61033518 BI027816 66015789 61033807 AA341445 
BE0881O1TOS9g0AW872477 
eE1682566E168190 T64682 

AA1 38653 AA1 36656 AW450979 AA984358 AA80g054 AW238038 AA492073 BE168945 
AA210987 057294 AA214584AA207006 DS6572 
AAf401424AA400100AA663848 
AAS33717 6F061897 AW628327 AA641788 AA400495 

AW973352 BF222929 AW016853 BF059130 AI651829 eES51767 AA558414 AI339359 6F0SS801 AI961162 AI341422 AI208248 A1206165 
AA548736 AA768578 AI539081 AW025957 AA736837 N79575 AV»94357 AA480892 
AW974271 AAS92975 AA447312 AAB84766 
AA527941 AAB35266 AI810608 At620190 

BG207209 BE166299 A1204995 BG199355 AW969908 AA5287S6 AW440776 61044354 

BG036675 BF77a»5 BF771866 BG960386 BG960381 NM-00S712AF1 10315 BE074534 BE182776 BE158000 BE157999 BE714315 AW818104 
AW847519AA099426AVV817981AW856396 66961122 AA224498AA308542AW8218336Fg02155AI732411 BG778634 BG283641 BE748279 
BE74B870BG319S40BE748864 BF739224 BG986155AK057283BI861466 AA683341 AA457S91 BG949294AW392888AA071122AA227B49 
AA58491 8 BG959S70 6F773486 AL041698 6FS59013 RB7170 C16859 6F77041 1 6F7712S8 A1075321 113823 AA216700 BF771864 AWB61 859 
BE537068 C18935 AA155719 8F771 172 BF769107 BF804964 AW818172 AW818143 AW392930 AW817057 AW858044 BI746211 AA17992B 
AWB61687 AW821826 B1055726 BF242643 AA207189 BF770412 BF771 157 BG430030 AA(K5592 
432415 11371J BG1^382 AW161088 N42363 BE935013 B£934^ BG291451 AV700520 BG152773 Ai2249% AI079635 AW0S4706 AA843979 AI744193 FO4060 
T234S7 F04044 AA723859 AA977643 AA283764 AI123609 N21561 BFD55052 eE856661 AI804220 AA843394 AI472045 A17404g0 AAS78830 
H09495 AI283334 AA609495 A!122773 AW162643 AW161798 BF940077 AI80B825 AI360866 An23189 R40236 R20726 AW975899 BE764052 
N31709 N31708 AI031947 AW194138 

3S978J AF134164 BF809407 AA218S67 BF842863 AI267168 6F876178 BG9992S3 AW661851 AW858362 A1817548 BF771300 AA1 13928 AA223422 
AA065556 6F773400 6Fg98869 68)81333 6E073424 6E142245 H59571 H59570 6F871556 BF871064 6E001 1 32 BF826B31 AW754298 



.1 

665586J 

14637.1 

1784158J 

245301.1 

887879.1 

1Q26976J 

OJ) 

107934J 

108378.1 
MH1944^ 



1139083.1 

1556651.1 

685588.1 

1031334.1 

11272J0 

115241.1 

513212.1 

1238297.1 
112710_1 
639529.1 
|yiH1429J2 



434414 
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434598 
434609 
438366 
438993 
439092 
439780 
440151 
444163 
451844 
456034 
459702 



14701J 
14739_1 
1242593 1 
2580163.1 
919640J 
49082_1 
1B79911J 
682245.1 
2327174J 
J 
ij 



AA223267 BG997895 BG9S7897 AW991957 M534354 BG319501 8F738309 At694285 AA045564 BG950256 A1829309 B6S87BS0 8E093175 
BF854337 

AF147374 T59538 T59589 T59598 T59542 

AF147390 R76593 R75594 

AW979074 AA834841 AA828650 

AI926361AA834879AA828995 

AW978407 AA830149 M85983 AW503637 BF352096 

A110^R23665 R26576 

AA868167 F21558 F31418 F35624 

BG403169AI148521 All 84746 All 26098 R05933 6(057330 

AI820546AI821336 T61430 

AA136653AA136656AW4S0g79 AA384356 AA809054 AW238038AA492073 BE168945 
BG207209B£166299A1204995 B6ig9355AW969908AA528756AW440776Bi044354 



TABLE 3C 



Pksy: Unique number oorrespCHidli^ to an Eos probesot 

Refi Sequence SOUT08. The7diBilnuinbeninthlsoatumnareGenbanklden3i6r(G0numbefs. 'DunhamI.etal/ietotoQieinibiicallonentitM 

sequence of human chromosome 22 * Dunham L el til. Nature (1999) 402:489495, 
Strand: (ndlcetes DMA etrend from which exons were pfetSctod. 
Ntjxisifion: Indicates nudeoSde portions of predcted emns. 



Pkey 


RBf 


Strand 


NLposiSon 


325372 


5866920 


Afinus 


1117061-1117304 


325544 


6682452 


RU3 


171228-171288 


327036 


6531965 


Rus 


319951-320040 


327075 


6531965 


nus 


40413184041431 


327414 


5887750 


Pius 


102461-102586 - 


328700 


5868264 


Pius 


764089-764203 


330211 


6013592 


Pius 


59158-59215 


332798 


Dunham, i. eta). 


Minus 232147o231974 


333769 


Dunham, 1. elal. 


Rus 7696625-7696707 


333904 


Dunham, 1. elal. 


Minus 8217374^217261 


334223 


Dunham. 1. elal. 


Minus 12734365-12734269 


334447 


' Dunham, 1. elat. 


Plus 14308764-14308824 


335115 


Dunham,!, eta). 


Minus 21386250-21388146 


335809 


Dunham,!, elal. 


Plus 26310772-26310909 


335824 


Dunham, I et.al. 


Phis 26376860-26376942 


335825 


Dunham, l.et.al. 


Plus 26378175-26378268 


335936 


Dunham, 1. etaL 


Minus 27360474-27380400 


336034 


Dunham, L elal 


Rus 29014404-29014590 


336152 


Dunham, 1. elal. 


Minus 30156053-30155870 


336636 


Dunham, L el^ 


Plus 988418-989185 


338008 


Dunham, L etaL 


Rus 76970687697236 


338033 


Dunhan^ I elal. 


Ptus 809212M092271 


338158 


Dunham, I elal. 


Minus 11794465-11794343 


338255 


Dunham,!, ela). 


Minus 16242294-15242231 


400494 


9714719 


Phis 


169845.170272 


400517 


9796888 


Minus 


49998-60346 


400651 


8117978 


Minus 


81488<«1646 


400665 


8118496 


Pius 


16879-17023 


400773 


8131629 


Minus 


4411644238.4620&48321 


400844 


9188605 


Phis 


24746-24872,2503&-25204 


400846 


9186605 


Plus 


39310^74 


400881 


2842777 


Minus 


91446-91603.92123.92255 


401093 


8516137 


Minus 


22335-23166 


401234 


9929642 


Pius 


120173-120337 


401424 


8176894 


Pius 


24223-24428 


401486 


7341763 


Phis 


32S8&W55,36281'36540,40791-40939,4401 


401704 


3097841 


Plus 


24712-25374 


401732 


1200312 


Phis 


19346-19525,19625-19708,19897-19973,2006 


401747 


9789672 


Minus 


118596-118816,119119-119244.119609-11976 


401760 


9929699 


Plus 


83126^S03S3»^-8554a,94719'95287 


401780 


7249190 


Minus 


28397-28617,28920-29045^135-29296,2941 


401781 


7249190 
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83215^35,83531-83656,8374063901,8423 
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7249190 


Minus 


165776-165986,166189-166314,166408-16656 


401797 


6730720 
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6973-7118 


401994 


4153858 
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29782-29932 


402239 


7690131 
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402260 


3399665 
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402265 
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7547175 


Minus 


5352&-5362B,5575&6592a57S3&OT57 
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7652009 
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8041^60673 


402604 


9909420 
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402605 


9909420 
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4768047973 
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403071 
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9438598 
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40371S 


72396G9 


Plus 
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403740 
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403776 
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403903 


7710671 


Minus 


101165-102597 


404029 


7671252 
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4AA^4A 4 444 4 A 
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404049 


3688074 


Mnis 
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404210 


5005246 


Plus 


4^AWtf 4YA4A4 

169926-170121 


404240 


5002624 


Minus 


4 4^4^\ 4 4ffJA"l 4 4fi0Pfk 4 4^tfWW% 

1 16132-1 16407,1 16653*1 16922 


404253 


9367202 


Minus 


55675-56055 


404286 


2326514 


Plus 


^4 AAA g4ttA4 
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404298 


9944263 


Minus 


73591-73723 


404403 
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Minus 


72053-72238 


404440 . 


7528051 


Plus 
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804^)-61561 


404866 


9366919 


Minus 


4 4^ 4 A 4 4 

11743-11929 . 


404877 


1519284 


Hus 


1095-2107 


404927 


7342002 


Plus 


68690^553 


404996 


6007690 


Phis 


37999-381 45,38652-38998,39727-39872,4055 


405001 


i>A4 r JAP 

6015406 


Minus 


104646-104819 


405025 


7107727 


Plus 


105267-105343,106184-106294,106387-10653 


405121 


8102330 


Minus 


3581 6-36004,36587-36634 


405238 


7249119 


Minus 


51728-51838 


405239 


7249119 


Plus 


144345-144464,144690-144836,151750-15188 


405451 


7622517 


Minus 


145949-146227 


405545 


1054740 


Plus 


1 18677-1 18807,1 19091-1 19296,121626-12182 


405546 


1054740 


Plus 


124010-124183 


405547 


1054740 


Plus 


124361-124520.124914-125050 


405646 


4914350 


Plus 


741-969 


405704 


4204244 


Phis 


138842-139051 


405770 


2735037 


Plus 


61057-62075 


405849 


7651817 


Minus 


17705-18287 


405932 


7767812 


Minus 


123525-123713 


406081 


9123861 


Minus 


38115-38691 


406137 


9166422 


Minus 


30487-31056 


406173 


7230224 


Rus 


12925-13213 


406348 


9255985 


Minus 


71764-71944 


406360 


9256107 


Minus 


7513-7673 


406399 


92S62B8 


Minus 


63448-63554 


406434 


9256651 


Minus 


17803-17931 


406467 


9795551 


Plus 


182212-182958 


406506 


7711374 


Minus 


6843W7 


406547 


7711513 


Minus 


172780^174358 



Table 4A Dsts about 425 genes up^egidated In breast cancer compared to nonnal adult tissues and to nofHnaiignant breast tissues. These genes were selected from a starting 
coilec8on of ^t 59680 pnbesets on Uie Aflymetit(/Eo34tu03 GenaCMp® array. The 'average' breast cancer level was set to the 90th percenSle vahie amongst breast cancer 
speclnwns. The 'average' nonndaduft tissue levdl was eel to fheTSlhperisnf^ Certain predicted protein domains are noted. 

TaUe 6A lists about 231 genes ujKegulated In breast cancer compared to nonnal adult Qssues and to noiHnatignant breast tissues fital arsiacely to encode proteins amenable to 
modulation by small molecuies, peptides, or antibodies. These were selected similarly as for Table 1 . The 'average' breast cancer level was set to the 90tii percentile value amongst 
breast cancer specimens. The 'average* nonnal adult Ossue level was set to the SOlh percenGIe value amongst norHndBgnant adult tissues. The prot^ products of these genes often 
contain one or more domains indicative of have oncogenic (unction or of transducing intraceHutar signals, or of betog modulatable by amdll molecula, peptides, or aniibodles (e.g. 
pidnase, death-domann, 7tm, phosphatase, or toiLtransporter). Certain predicted protein domains are noted. 

TABLE 4A: ABOt/T 425 GENES UP-REGUlATEDtN BREAST CANCER COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset Identifier number 

ExAccm Exemplar accesston number, GenBank accession number 

UnlGenelD: Untgene number 

Pred.ProLOomains: Cert^n predicted proleto domains. AbbreviaSons used: TU transmembrane domain: SS,6lgnd 

Dkeiy to contain: <=N, lass nkaly to conlato.Aa other protein dom^n abbrevialionsarBlmPFAM(Nud^ 
UrilGene Tllto: Urugene gene tills 

R1: RaOo of 90th percenSIe of tumor to 75lhperDentito of nonnal body Ossue 

Pkay ExAccn UnlGenelD PtQdmDomslns UnlGeneTlte Ri 
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419551 
426174 
409340 
428471 
417931 
447966 
406387 
421814 
406667 
426104 
421481 
422511 
426539 
41S693 
419329 
418344 
407116 
417389 
419452 
421552 
409453 
409632 
417515 
411573 
418751 
422087 
421143 
408491 
417944 
409082 
422281 
425308 
408349 
449019 
435561 
410687 
429455 
414407 
419390 
453902 
411069 
450172 
449717 
451529 
435370 
411761 



408796 
407230 
422830 
447528 
430168 
423225 
414822 
452683 
444784 
453857 
413859 
440369 
416140 
441384 
424464 
423582 
432562 
433320 
429165 
416984 
448913 
430154 
400496 
442599 
448520 
431309 
426485 
435858 
446051 
451982 
450334 
431890 
434449 



AW582256 Hs.91011 

AA547959 Hs.115838 

BE174629 Hs.321130 

X57348 Hs.164510 

W95642 Hs.82961 

AA340605 Hs.105887 



L12350 

AA157857 

Ai204418 

AW391972 

AU076442 

AB011155 

AA133749 

AY007220 

AA216387 

AA130986 

BE260964 

U3363S 



AI885516 

W74001 

L24203 

A6029000 

BE3B9014 



AB024536 

A1088063 

AU077196 

ALl5748fl 

M36803 

M97639 

BE546947 

AI949095 

AA351978 

U2438g 

AJ472111 

AA147026 

AI701162 

BE502341 

AA456454 

NKLO0S864 

AB040935 

AI917901 

AI984074 

At733648 

AJ243706 

NM.000088 

AA6B8292 

AA157857 

AC007954 

AI612027 

AW968343 

AA852604 

AA156S42 

Ai089575 

D12485 

AU}8023S 

AW992356 

AW176150 

BE613836 

AA447&49 

R68537 

BE000831 

BE531048 

D60647 

AW009885 

H38765 

AA194422 

AW58305B 

AF078037 

AB0O2367 

AW45t711 

KM.006207 

AF254260 

BE048061 

F13036 



X17033 

AW953484 

BE410590 



Hs.108623 
Hs.182265 
Hs.190080 
Hs.104696 
Hs.1 17938 
Hs.170290 
Hs.301350 
H8.2889g8 

Hs.271627 

Hs.82045 

Hs.90572 

HS.1057W 

Hs.95612 

Hs.55279 

Hs.82237 

Hs.70823 

Hs.372548 

Hs.1 11301 

Hs.102171 

Hs.7882 

Hs.82985 

Hs.50150 

Hs.346935 

Hs.155585 

Hs.44276 

Hs.67776 

Hs.4943 

Hs.65436 

Hs^8694 

Hs.76704 

Hs.331904 

Hs.3402 

Hs.355702 

Hs^4567 

Hs.23954 

Hs.208641 

HS.225B38 

Hs.71935 

Hs.143323 

Hs.172928 

H3.170345 

Hs.182265 

Hs,121371 

Hs.76277 

Hs.145582 

Hs.125359 

Hs.72127 

Hs.374574 

Hs.1 1951 

Hs.35861 

HS.83G4 

Hs.132449 

Hs,83551 

Hs.288660 

Hs.17g62 

Hs.23837 

H8^78422 

H3.250879 

Hs.116258 

Hs.80706 

Hs.22564 

Hs.234726 

Hs.324051 

Hs.21355 

Hs.313760 

Hs.170040 

Hs.283009 

Hs.37054 

Hs.27373 

Hs.24879 

Hs.271986 

Hs^9 

Hs.119^ 



aaLpenne8S6s,pyridox^de 
14^:TM=M:8S«N 
tra«DitTM=N:SS^ ' 
Jac8lln;TM«H;SS»M 

EGF,tsp 1,vwc,TSPN,tsp 3; 
filainenU>ZIP;TM41;SS=M 

CoIlagen,non6 

SH3kPD2.Guanyfate_ldn;TM= 

A7P1G1J>LM.MAT8;TM=Y;SS=M 

8.100:TM:4I4;SS«N 



PTH.MK;TM=M;SS=Y 

iO,p)dnasd:TM=Y:SS:=M 

Fz.NTR:TM=N;SS=M 

cadherln.cadherin.Cadherl 

S8rpin;TM=N:SS=N 

zW«bo)uKIBR1:TM=M:SS= 

Suiratase;TM^SS=N 

SH2,none 

fn2^0niopex!n,Pep1hJase.M 
fe.LRR,LRRNT.LRRCT;TM=M;S 

vwcColJaoen,CX)LFI:TM=N:S 

tiemopexin:TMsN:SS°M 
ioJ(flngtd,pMnase,Fz:TMs 
homeot)Ox;TM4il;SSsN 

MAGE,CysJmol,EGFJam!n!n 
lysyLoxidase;TM=N;SS='M 

PMP2^.ClaudinPMP2^Ctaud 



SH3.homione3:TM=M;SS=N 
Glyco.transL25:TM=N;SS=N 
acttn,noo8 

EGF,fn3,fibrinogen_C1oxl 
zW2H2:TM=M;SS=N 
PHD^VRlDjn^Cjrr^N^H 
vwcCd]agen,COLFI;TM=M;S 

flamenM)ZIP;TM=N:8S»M 

TB2_DP1^HVA22;TM=Y;SS=M 
efhand.efhand 

l9.RibosomaLS19;TIUNyi;8S» 

homeobox,HLH 

homeobox.none 

SoniatomsdIn.B.Endonucteas 

TM=Y:SS=M 

SAMJ'NT.none 

TM=M;SS=M 

7tnL3^na 

homeobox,none 

TGFbj)ropeplide.TGF-b6ta, 

zf-C2H2:TM=M;SSaN 



R3VO(toxlnJ^TM=M;88=N 

fnniZMlanBP,pkinase,6ST.- 

seiptniTMshtSSsM 

TM=Y;SS=N 

SH3.anIqTM=M;SS=N 

pklnase.DCX:TM»M:SS»N 

homeobox,non8 

i9:TM=N:8S4ll 

bZIP;TM=M;SS^ 

Ephi!nA.<leamln4srnu-al 

NA;NA 

PAP2:T»*»Y;SS=*I 
vwa,liTteofM.F&GAP:TM=» 
efhand,FKBP:TMsM:8S^ 
SH3.HS1jep;TM=M:SS=N 



aiterior grafient 2 (Xenepus bevb 
Homo sapiens sbnibr to Echlnoidin 
mejanophlBn (MLPHJ, mRNA 



lreliailbc(or3(intBsflnal) 

ESTs. WeaUy Mlar to Homotog of 

Target Exon 

Oirombospondln 2 

kefBtln19 

ESTs 

KIAA1324 protein 
ooQagen, type XVII, atpha 1 
discs, large (Orosophfla) homolog 5 
FXYD domsin^ntaining ion transpor 
S100>type catelum tending protein A 
8baic16b02.s1 NCLCGAPJ>r1 Homo sap 
ESTs 

midldne (neudte growlh-promoUng f 
PTK7 protein tyrosine kinase 7 
seoeted frizzled-retated protein 4 
ESTs 

seHne (or cysteine) proteinase hh 
ataxia-telangiectasia group [>«ssoc 
KIAA1077proteh 

phospiK}!nositide-3>)dnase, regulato 
matrix metafioproteinase 2 (gelatin 
immunogtobulin superMly oontanti 
ESTs 

ooHagen, type V, alpha 2 

Homo sapiens mRNA; cDNA DKFZp564B18 

hemopeidn 

leoeptor tyrosine Mnase4ile crpha 
homeotxHiCIO 

ESTs. Weakly similar to T22341 hypo 

hepatocellular carcinoma assodatod 

ly^oxidase-iikDl 

CD209anDgen 

ESTs 

hypothetical protein MGC11138 
ESTs 

cell division cycle2-Eke1(PITSL 
signal transduction protein (SH3 co 
ceretiral cell adhesion molecule 
ESTs 
ESTs 

putative zinc finger protein from E 
putathra ONAfchromatin binding moti 
collagen, type I. alpha 1 
ESTs 
I(era8n 19 

hypothetical protein Df<FZp434P0111 
Homo sapiens, done MGOSSSI. mRNA. 
0KFZP434I1735 protein 
Ttif-1 cell surface antigen 
ESTs 

progesterone membrane bbiding prote 
ectonudeotkle pyrophoephatase/phos 
Ras^nduced senescence 1 (RtSI) 
Homo sapiens pyrovate deMrogenase 
downstream of breast cancer anfigen 
microiibiinar-associated protein 2 
refinoicacMindxedS 
ESTs 

Homo sapiens cDNA FU11812fis, do 

0KFZP586G1122prot^ 

ESTs, Highly simBar to CTXN RAT CO 

prostate cancer associated protein 

diaphomse (NAOH/NAOPH) (cytochrome 

myosin \A 

serins (or cysteine) protehiase tnh 
ENSP00000224716*:GTP.bindlng protei 
RelA-assodated inhlt^itor 
doublecoftin and CaM kinase-lllce 1 
ESTs. Weakly simBar to 138022 hypo 
piateiet^enved growth tactor reco 
tufleOn 1 
ephiin-A3 

Homo sapiens mRNA; cDNA 0KFZp564O17 
phosphalidk: add phosphatase type 
integrin. alphe 2 (CD49B. dpha 2 s 
hypothetical proldn FU22041 simll 
emsl soQuenco (mammary tumor and sq 
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Hnmn RanJnm rDNA Pi 1199^1 fk rJn 


J.9 


nl lost 


M57609 


Hs.72916 


■«I.P9U>TLb=M*QC9M 


uofMuppei iwnuy ni^TiOcr uuo (VsrB 


g.9 




C75094 


Hs.334514 


imNtvm n r*TIUbV>fiSsU 
VOnBQBJLfLUi 1 NFT ,S>0-M 




0.9 


■ilOI 1 


At272141 


Hs.63484 


MMfS twnr liAmanhmrTMsiihfifis 


OrVT ISoX QoIBlliUning leylwn TJ-DOX 


<3.9 


418753 


BE217818 


Hs.67016 




hvnniholb^nl nmlotn PI I99Q^R 


J.O 


452679 


Z42387 


Hs.83883 


IKi— T,OO^IVl 


trsnsniGnibrsnQ, pnjslsto sndroQsn In 


1 R 
0.0 


421030 


AW161357 


Hs.101174 


h ihi illruhf nH Inn'TIUbM- <>5UM 


ImIU UUwUlo*e5aOCicllDa piDlclli loU 


IR 
0.0 


•tOIODf 


N51357 


Hs^60855 




nuiiio sapiens cuixn. rij< 14 lu ns, ci 


J.O 




M58524 


Hs^2S 


1 ur-OolSl, 1 V3rD,jJlupspuQB, 


tfsnsfonning growth fsctor, b6t3 3 


d.O 


447151 


AI022813 


Hs.92679 




nouiu sapiens aonB M/nDrUui4 inivw\ 8 


0.0 


•WOl 1 1 


R67419 


Hs^1B51 


KLH,honi6obox,riOfio 


Honno sspisns cONA FLI12900 fis. do 


0.0 


425867 


D60385 


Hs.12079 


riv{honn>TMsY'<>^M 
CaOnolUli 1 IVH T |0O~'IV| 


i^ftf c UT\tartlf%_0 


1 R 
0.0 




NM_012429 HSJ277728 




CPr*1 A fQ <>AmulelaaYJnm 9 
OCU 1 4 (0> CBlBnSIBgJnlKs i 




mxifm, 


AA383828 


Hs.181131 




cols 


0.0 




AA700553 


Ks.388614 


8n/as,R8cFvnond 


CO 18 


3^8 


42)005 


AW271106 


Hs.133294 




ESTs 


3.8 


■tlUOOf 


X63556 


Hs.760 


Pl^ TR umt PR Til •TLb=KI'(^ 


filjrilfin 1 (Msifsn syndroinc) 


0.0 




AB037745 


Hs.104696 






lA 


449029 


N28989 


Hs.22891 


aa narmo9eBtt>TMsV'fifi=M 


sotuto Confer fsmi^ 7 (csfioiA? 8 


IR 
O.0 


424806 


AA382523 


Hs.105689 


TMsY«jifi&M 


MO 1 r\M 1 piUlBUl 


IR 
0.0 




AI091631 


Hs.203845 


hfi hwM'TMsV'fiSsll 
KHl_liBIR»i 1 IVPT |O0"M 


two poiB potsssium chsnnci KT3.3 


J.0 




AA813887 


Hs.188173 




UnntA eanlana i«nMA CI I191fl7 lie <*U 

nomo ssptcns cuws rui i«io/ ns, cio 


1 ft 
0.0 


wHmD 


AA77068B 


HS.34B495 




U9A McfniiJi fatnllu matMha* 1 

ncn niuona lanniy, nanuier l 


o.r 




AW970980 


Hs.2g3821 


P6p_M12B_propepiRepnd!ysi 


CCTa 

to IS 


3.7 




AA070801 


Ks.51615 


hQTrnon8,j6C|Zf''C4 


PQT« WmMii ebntlarfri AMI? Ml IMAM 


0.r 




AI570830 


Hs.174870 




CO IS 






AW022192 


K3^00197 


nonieobox,none 


ESTs 


3.7 


AiSSRi 


AI493206 


Hs.120785 




cole 


0.7 


•iU9lfO 


BE393948 


HS.S0915 


fnmeWTU^UI'QSsV 




17 

o.r 




AL037672 


Hs.81071 




extracellutar matrix prot^ 1 


3.7 




D43704 


H3^0712 


C3jchaniveUB,R6p6j)rotfiin 


caldum channel, voltag&^ependent. 


3.7 




AI924533 


H3.105607 


nuujjcouansp; I nft;oo=n 


UcsrbonatB b^poiter rdated pro 


3.7 




W33191 


Hs.28907 




nypomeDcai protBiii r U202oo 


3.7 


427378 


BE515037 


Hs.177556 


MAGE:TMsM.'SSsN 




3.7 


424012 


AW368377 


Hs.137569 


SAM.P53;TM=M:SS=N 


•tumor protein ^ kOa with strong ho 


3.7 


416840 


AI821614 


HS.1B5831 




ESTs 


3.7 


433573 


AF234887 


Hs.57652 


7tfaJ2,EGF,cadhefln.tainini 


cadherln, EGF lAG seven-pass &type 


3.7 


433430 


A1863735 


Hs.389982 


&iyn)elobuIln_1JGFBP;Ef- 


ESTs 


3.7 


422491 


AA338548 


Hs.1 17546 


TM=M;SS=Y 


nsuronaOn 


ZJ 


435114 


AA775483 


Hs.28e936 


OOCJ\Z.Ribosom4.L9JI:TM:> 


mllDchondria] ribosomal protein L9 


3.7 


416899 


BE262645 


Hs.80420 


IL8;TM=M;SS=M 


smaO Indudbla cytokhe subfami^ 


3.7 


422110 


AI376736 


KS.1215S5 


kazd.none 


secreted prot^ addle, cysteine- 


17 


448560 


BE613183 


H3.23213 


zM^BP.MDM2.Ndr 


ESTs 


16 


414945 


BE076358 


Hs.77657 


UPAILLY6;TM=M:SS«M 


lymjdiocytB anUgen 6 complex, tocos 


16 


422119 


AI277829 


Hs.111662 


WD4Q;TM»M:SS»N 


KIAA0590genepR)duct 


16 


447335 


BE617695 


Hs.286192 




bypotheOcal protein FU20940 


16 
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450683 


H43540 


Hs.25292 


RNasQLHICTM=N;S^M 


libonudesse Kl, targe stdnndl 


<ac 
M 


417387 


AVV021102 


Hs^509 


zKSH2,Ron8 


bSlS 


4fi 
M 


450825 


Au005354 


KS.2S5Z7 




tight juncQon pnMA 3 (zona ood 


AO 


439755 


Aw748482 


Hs, 77873 




Oy hnmntnrt 4 

D/IURnOIQga 


JM 


439873 


BE159253 


Hs.300o3o 


CCT» 

bolS 


ao 




A!656707 


Ks.48713 


pnna80,non6 


CCTa 


<t ft 


419235 


AW470411 


Hs.288433 


neurotrtmin 


oLO 


445033 


AV652402 


Hs.72901 


8nK;TM=N;a5)=N 


cydirvdapondent Idnaso inhibitor 2 


AO 


404394 


AF332975 


Ks.307004 


EGF,m1,vvvc,vwd>MAM,KerBt 


Zonadhesin 


3.6 


452222 


AW806287 


Hs^1432 


j» noi ^fM 

SeinajlG,rSl,GDi 


SEX gene 


1 c 
J.D 


422961 


Y13620 


Hs.122607 


TA/pM;SS<bN 


D 1 ». o 

D-ceu CUJiyniphana 9 


AO 


420988 


AW006352 


KS.159&43 




ESTs, Weakly similar to T32S54 hypo 


3.6 


439660 


AW245741 


Hs^B461 


zh«ZH2,THiS,KRAB;TNifM;o 


ESTs, Weaidy slnular to A35o59 knie 


3.6 


426615 


D59505 


HS.3S1344 


i9,SET,PHD^CXXC^ap_c 


ESTs, Weaidy similar to K1 Q.HUMAN 


3.6 


437446 


AA788946 


HS.1013Q2 


6i3,vwa,Conaoen.TSPN;TM= 


ESTs, Modeiatety simBar to CA1C RA 


3.6 


421690 


AW162667 


H5.106857 


efnand;7M=M;S^N 


caunndin 2, (29kO, caretinin) 


3.5 


453939 


AM18160 


Hs.86043 




Homo sapiens cONA FU13558 fis, do 


3.5 


426158 


NM.001982 Hs.199067 




i^«iD-o2 anan enflnroobsfe bwe 


3.S 


439246 


At498072 


Hs.351474 


ank,pkiRase,UPF007^i;T^N^^, 


memfarane-assodated ^frosino* snd t 


3l5 


4108S3 


BE383768 


Ks.65238 


zf>C3HC4^P3fnk4sM;SSsN 


95 IcDa retinoblastoma protein bind! 


3.5 


412703 


AW984744 






gb:RC1-HN001 5-040400-01 1-d03 HN0015 


3.5 


427871 


AW992405 


Hs.352406 


TM=M;SS=N 


Homo sapiens, done tMAGE:3507281, 
HbnunoduHn 


15 


444273 


At903474 


Hs^ 


LRR,LRRNT;TMsM;SSsM 


3.5 


434936 


AI285970 


Hs.183817 


UCH-2 


ESTs 


3.5 


457869 


AU077185 


HS.10888S 


vw3,Conagen;TM=M;SSsM 


Homo sapiens, dlpha-1 (VI) collagen 


3.5 


422575 


AK000546 


Hs.1 18552 


PTR2;TMaY;SS=M 


nypotneocal protein FLJ20539 


3.5 


428343 


AL043021 


Hs.12705 


WD40;TM=N;SS3M 


ESTs 


3.5 


426716 


NM-00637g Hs.171921 


l9,SeinairSi;TM=N;SS=M 


sama domain, immunoglotnilln domain 


3.5 


42377B 


Y09267 


Hs.132821 


FM<Wike,pyrjedox;TM=Y;S 


flavin contaNng monooxygenase 2 


3.5 


451558 


NM_001089 Hs.26630 


ABCJran,SKP54;TMaY;SS=M 


ATP-bindIng cassette, sub-fairaly A 


3.5 


407926 


AW956382 


Hs,59771 


TYA;TM=N;SS=M 


ESTs 


3.5 


447041 


AL135480 


Hs.250705 




' Homo sapiens cONA FlJi16B5 lis, do 


3.5 


419073 


AW372170 


Hs.163918 


<tealh,ZU5;TM=N:SS=M 


Homo sapiens ci>NA FU1Z797 ns^ do 


3.5 


446945 


AI193115 


Hs.16611 


TM=M;SS=N 


tumor protein DS2-li)«e 1 


AO 


416322 


BE019494 


Hs.79217 


P5CR,NAO_G}y3PJh,Octopin 


f^noIin&^'Carboxylate reductase 1 


3.S 


447347 


AA570056 


Hs.122730 


NA;NA 


^Oir^ * ■ -* 1--*- L/IA A414 

ESTs, Moderately sonaar to KIAA121 


15 


448984 


AW751955 


Hs^753 


1W=M;SS=N 


hypotneOcal protein FU2Z31o 


3.5 


421778 


AA428000 


Hs.283a72 


NAjNA 


actin related pmteln 20 Gomplex, 


15 


423363 


BE544348 


Hs.127562 


homeobO)qTM=M;SS=N 


homeoboxCII 


15 


432S45 


X52486 


H5^041 


cycQnjioae 


uracO'E^ glyoosylase 2 


15 


408495 


W68796 


Hs^37731 




ESTs 


4 e 
3.5 


406851 


AA609784 


HS.3S2392 


lg,MHCJLbeta;TM=M;SS«Y 


major histocompalibinty complex, c 


3.5 


418736 


T18979 


HS.8790B 


heIlcase_C,ATju)ok,SNF2„N 


Sn^>reialed CBP activator prolan 


3,4 


410197 


NVL005518 Ks.59889 


HMG.Co^Jsynt;TM=N;SS=N 


34iydroxy-3-methylglutaryl-Coenzyme 


3.4 


453597 


BE2ai130 


Hs.33713 


KH-(lonidn,Ribo8oni8L83.Ct 


myo^nodld 1-(d)osphate synthase A 


3.4 


417259 


AW903B38 


Hs.81800 


EGF,ioJectliLC,susM,Xli 


dwndrolfln sulfBte proteoglycan 2 


3.4 


453985 


N44545 


HS.2S1865 


PH,noro 


ESTs 


14 


412634 


U55984 


Hs.366531 




heat shock 90kD protein 1 , alpha 


3.4 


407204 


R41933 


Hs.140237 


Mstonojiistone 


ESTs. Weakly similar to ALU1..HUMAN 


14 


444371 


BE540Z74 


Hs.239 


FbrtUiead:TM=^SS4J 


foinieadpcKMI 


14 


447334 


AA515032 


H&91109 




ESTs 


14 


426530 


U24578 


K3J78625 


A2M,NTRANAT0^RUI,prBiV 


complemeni cornponent 4A 


14 


419749 


X73608 


Hs.93029 


kaza),thyToo!obun!\.1 ;TMs 


spaic/ostsonecBi^ cwcv and kazaM 


14 


423595 


R82826 


Hs.220702 


tiomeoboxjione 


ESTs 


14 


406673 


M34996 


Hs.198253 


{0,MHCJLalpha;TM=M;8S=M 


msjor MstooompaGtiill^oontpla)^ c 


14 


434241 


AF119913 




TMsN;SS=M 


HofT» sapiens PRO3077 mRNA, oompteta 


14 


412490 


AWB03564 


Ks.268850 




Homo sapens cuNA: rUZ2S2o ns, d 


3.4 


452277 


AU)49013 


HS.2B783 


ank;TM=M;SS=N 


KlAA1223prot^ 


3.4 


431457 


NM-012211 


1 Ks.256297 


F6-GAP,vwatTM='Y;SS=M 


integrtn, aif^all 


14 


421777 


6E562088 


H3.108196 




HSPO037 protein 


14 


453082 


H16835 


Hs.31608 


Ion_tr8ns;TM=Y;SS=M 


hypolnetica protein FU20041 


3.4 


414085 


AA114016 


Hs.75746 


al(fedh;TM=N;SSsM 


aldehyde dehydrogenase 1 family, ma 


14 


440300 


N39760 


H3.8859 


TM=M;8S=N 


Homo sapiens, Similar to RIKEN cDNA 


14 


400290 


H18836 


Hs.31608 


Cysjcnot 


hypotheOcal protein FU20041 


14 


433339 


AR)19226 


Hs.8036 




glioblastonm overexpressed 


14 


419301 


AA236166 


H3.132957 


TW=Y;SS=M 


tenomoduSn protein 


14 


414792 


6E314949 


Hs.87128 


TM=Y;SS=M 


hypothetical protein FLJ23309 


14 


451428 


AW083384 


H3.11067 




ESTs. Highly simSar to T46395 hypo 


14 


432210 


AI567421 


Ks.273330 


EGFikazalJaminln^GFJani 


Homo ssqpiens, done 1MAGE:3544662, 


14 


452242 


R50956 


Hs.159993 




gycosyOransfnasa 


14 


450676 


AI147155 


Hs^79727 




ESTs 


4 A 
0.H 


413014 


AW250533 


HSJ5139 


TM=ftSS=M 


partner of RAC1 (aifaptln 2) 


14 


427919 


AA173942 


Hs.326416 


CTFJJn.noiie 


Homo mRNA; cONA OKFZp564H19 


3.4 


424005 


A6033041 


Hs,137507 


T»yi=Y;SS=N 


vang (vangogh, DrasopMlaHiloB 2 


14 


422072 


AB018255 


Hs.111138 


RhoGAP:TM4^SS»N 


KiAA0712 gene product 


14 


440995 


T57773 


H3.10263 




ESTs 


14 


426150 


NM.003658 Hs.167218 


honieobox:TM=N;8S=M 


B8rH4ikehomeobQx2 


13 


416877 


BE386266 


Ks.85658 


zKSH2:TM»M;SS»N 


hypothetk:al (ffotetn FU23436 * 


13 


452191 


AU076408 


K3.2B3Q9 


UDPG_MGDP.dh,UOPGJAGDP_dh 


UDP-gtuoose dehydrogenase 


13 


450273 


AW296454 


1 Hs^4743 


rnrvnons 


hypothetical protein FU20171 


13 


456177 


NM.012391 H8.79414 


as,SAMJ'MT;TM=M;SS«N 


prostata epttheOunvspedSc Els tr 


13 


423062 


NM.003655 Hs^ 


chronio:TM=N;SS»M 


ESTs 


13 


421848 


X15880 


Hs.108885 


vw%OoRa9en;TM3M;;SSBM 


odlagen,^ VI. alpha 1 


13 
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433577 


AfVDOTuoO 


nS.2o4i92 




ESTs 


3.3 








A9„trans;TNFY;SS=N 


Bidno add transporter systom A1 


3.3 


404730 


AA4oo704 


Hs.33287 


CTF_NFI,non© 


Nuclear factor t/B 


3.3 


422940 


DC077490 




Sec7J'H^rjec8ptor^ 


gb:RC1-6T06084)90S0O4n>bQ4 BTOGOo 


3.3 


410001 


AB041036 


Hs.57771 


liypsm;TM=M;SS=M 


kalUkF^II 


3.3 




AA531527 


Hs.332040 


Tm=Y;55=M 


nypoUieocal protem MGCISOIO 


3l3 


453468 


W00712 


Hs.32930 




DKFZP566F084 protein 


3l3 


443807 


Vl^2930 


Hs.9822 


HAT;Tm=N;SSbm 


HCNP protein; XPA-blncBng protein 2 


3.3 




AW450979 


Hs.356377 




gDnJHH^I3'8la-a>12^MJU1 NCL06 


3.3 




AW293399 




ftudear receptor cO'CepiBssor 1 


3.3 




BE144308 


Hs.179891 




ESTs, WeaUy sntiflar to P4H\.nuMAN 


3,3 


423690 


AW956329 


Hs.23721 


sugar.tr,Ribosomal.S2S 


ESTs 


3.3 


423472 


AR)41260 


Hs.129057 


TM=M;SS=N 


brsast csrdnoma smpltfied sequence 


3.3 


424118 


BE269041 


Hs.140452 


penIlpin;TM=N;S5=M 


cargo selection protein (mannose 6 


3^3 


437275 


AW976035 


Hs.292396 


FrizzlRl,Fz 


ESTs, Wealdy sinnilar Id A47582 &ce 


3.3 


437454 


AA323298 


Hs.97837 




Komo sapiens mRhiA; cONA DKFZp547J04 


3.3 


433592 


NM.004642 Hs;3436 


TM=M;SS«N 


deleted in oral cancer (mouse, homo 


a3 


434931 


AW968941 


Hs.166254 




hypothetica) protein OiiFZp566l133 


3.3 


451691 


AI809278 


H5^152 


C2 


ESTs 


3.3 


430433 


AA478883 


Hs.273766 


WW,nonQ 


ESTs 


3.3 


429343 


AK000785 


Hs.199480 


VHS,ElirH,UlM;TM=N;8S=W 


Homo sapiens, Similar to epsln 3, c . 


3.3 


450835 


BE262773 


HS.255B4 


ArfGap(TM=N;SS«M 


hypothetical protein FU10767 


3.3 


414591 


AI8B8490 


Hs.248107 


• 


ESTs, Weaidy stmOar to AUI8 J1UMAN 


3.3 


45S79 


AA1316S7 


K5.23830 


CNJiydtdase 


ESTs 


3.3 


409980 


BE26ig44 


Hs.355264 


hexoMnasel 


3.3 


406850 


AI624300 


Hs.172928 


vw,CoIlagen,OOLFI;TM=M;S 


collagen, type 1, alpha 1 


3.3 


453874 


AW591783 


Hs.36131 




coOagen, type XIV, alpha 1 (unduil 


12 


425964 


AW88g928 


Hs.9071 


hofnoobQxjnoRd 


progesterone membrane binding prole 


12 


428412 


AA428240 


Hs.126083 


■ 


ESTs 


3.2 


430316 


NM.000875 Hs.239176 


ft>3,Furin-lik6,pldnase,R6 


Insulin-tilce growth factor 1 recapt 


3.2 


440087 


W28969 


Hs.7718 


KOW,Rioosom8L54e,S4,ftm; 


hypothetical protein FU22678 


3.2 


449933 


AW157098 


Hs.324104 


DUF176.efhand;TM^iSS5N * 


Human DNA sequence from done FlPI-6 


3.2 


441128 


AA5702S6 


Hs.348504 


TA a_\/.(tp_At 

TW=Y;5S=M 


ESTs, Weakly similar to T23273 hypo 


3,2 


434182 


W20309 


Hs.8107 


G^aTnfn^TM=M;55=N 


G-protein gamma-12 subunit 


3.2 


422737 


M26939 


Hs.119571 


Cwagen,COLFI:iM=N;SSsM 


ooUagen, type III, alpha 1 (Ehlers 


3.2 


408202 


AA227710 


Hs.43658 


OLF:TM=f^SS=N 


DKFZP586L151 protein 


3.2 


424971 


AA479005 


Hs.154036 


PH;TM=M;SS=N 


tumor suf^ressbg subtransferaUe c 


3.2 


407869 


AI827976 


Hs.24391 


emara;TM=M;SS=^ 


hypothetic^ protein FLJ13612 


3.2 


444734 


NM_001360 Hs.11805 


ERG4„ERG24;TM=Y;SS=M 


7-dehydrocholesterol reductase 


3.2 


426991 


AKD01536 


Hs.214410 




Homo sapiens cONA RJ10S74 fis, do 


3.2 


414081 


AWg69976 


Hs.365706 


gIa;TM=N;SS=Y 


matrix Gla proton 


3.2 


408795 


AW749126 


Hs.170345 


homionejBcrMM 


hypothetical protein F1J13710 


3.2 


452345 


AA293279 


Hs.29173 


DSPc;TM=M:SS=N 


hypothetical pn^in FU2051 5 


3.2 


437879 


BE282082 


Hs.5894 


TM=N;SS=N 


hypotheScai protein FU10305 


3.2 


407872 


AB039723 


Hs.40735 


Fz,Frinled,7lm_2,DUF81 ;T 


frizzled (Drosopt^) Itoinolog 3 


3.2 


427289 


A1097346 


Hs.323878 


8nninotrBiu5,SDF,none 


phosplnserfne arrinotransferase 


12 


432375 


BE536069 


Hs^62 


efhand,S_100;T^N;SS=M 


S100 catcium4)Inding protain P 


12 


429415 


NM.002593 Hs.202097 


CUB,NTR;TM=N;SS«M 


procollagen (rendopep^Uase enhance 


12 


412774 


AA120865 


Hs.380149 


hormone jeczf-C4 


ESTs 


12 


445942 


TB0334 


Hs.13479 




hypothetical protein FLJ20847 


12 


439456 


AI752409 


Hs.109314 


zF-C2H2:TM3N;SSbM 


hypothetical protein FU20980 
plasminogen acOvator, urokinase 


12 


414774 


X02419 


Hs.77274 


Mnste,tTypsin,ptanLtM 


12 


433336 


AF017986 


Hs^1386 


F2,NTR;TM=N;SS=M 


secreted fHzzled-related prolein 2 


12 


439905 


AW799755 


Ks.110g53 


HLH;TM=M;SS=N 


retinolc acid induced 1 


12 


420251 


AW374968 


Hs^79829 




Human DMA sequence from done RP5-1 


12 


413004 


T35901 


HS.7S117 


TM=M;SS«N 


biterieuKIn enhancer landing factor 


12 


418686 


Z38830 


Ks.87268 


annodtiiTNNMiSSsN 




12 


410279 


BE271977 


Hs.61809 


ras;TM^8S4l 


hypothetical protein FLJ141 17 


3.2 


424391 


BE550112 


Hs.158549 




ESTs, Weakly stmHar to T2D3_HUMAN 


3.2 


440409 


AW294316 


H8.12S608 


manA 


ESTs 


3.2 


452689 


F33868 


Hs.284176 


v8n8i8mn,rai-ooin8in,inn 


transferrin 


3.2 


418154 


6E165866 


Hs.352403 


horTnono_rec^C4,none 


nuclear receptor subfanniy It group 


12 


434384 


AA631910 


Hs.370133 




ESTs 


12 


413436 


AF238083 


Ks.68061 


DAGKc;TM=M;SS^N 


sphtngoslne kinase 1 


12 


431663 


NM.016589 Hs.267182 


T-b(sqTM=M;SS^ 


liBX3-iso protein 


12 


432874 


Wg4322 


Hs^9651 


SH3;TM=M:SS=Y 


melanoma inhibitory activity 


12 


436252 


AI539519 


H3.142827 




Homo sapiens cDNA FU11S62 lis. do 


12 


421044 


AR)61871 


Hs.101302 


6i3.vwa^GoQaoen,TSPM;'TM= 


Human DMA sequence from done RP1-2 


12 


419102 


AA2340g8 


Hs.42424 




ESTs, Weakly similar to 2004399Ach 


12 


419359 


AU043202 


Hs.90073 


CAS.CSE1;TM=M;SS=N 


chromosome segregatwn 1 (yeast hom 


12 




AW194384 


Hs.9877 


An^nojcudddSOi FWJUnding 


irTteiteuktn-4 oKiuced gens-l protd 


3.1 


426418 


Aa>464 


Hs.169825 


CollaoeivC4,VPR:Tiy»=N;SS=4Ul 


collagen, type IV, alpha 5 (Alport 


11 


413076 


U10564 


Hs,75188 


pkina3e;B«»M:SS»N 


weel (S. pombe) homc^ 


11 


407874 


AI766311 


Hs^B9047 


C0Q7 


Homo sapiens cDNA FU14059 tis. ck) 


11 


448019 


AW947164 


Hs.195641 




ESTs, Moderately similar to 138022 


11 


427024 


AA397572 


Hs.348902 




chromosome 11 open reading frame 14 


11 


410281 


AF076612 


Hs.166188 


vwcW2^.MlF4G 


Homo sapiens done 23928 mRNAseque 


11 


447205 


BE617015 


H3.11006 


LEM»i1Ii|)In:TM»M;SS=N 


ESTs, Moderately similar to T17372 


11 


434433 


AW629759 






gb.-hh70e05.y1 NO JCGAP.GUI Komo s^ 


11 


439737 


AI751438 


Hs.41271 


C1q.0ollaoeanond 


Homo sapiens mRNA fkdilength inser 


11 


450157 


AW961576 


Hs.60178 


PH,Banl.41.RhoGEF.none 


ESTs 


11 


445989 


H97754 


Hs.11108 




ESTs 


11 


442213 


N38110 


Ks.305971 


8ugarJr:TM»Y;SSBM 


solute carter feffllly 2 (fadlltata 


11 
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iMMR 








TBfQStExon 


3.1 








DrVT'Til^iCC— Al 

nib, 1 npN|0«fm 


duomosoine 19 open reading frsnio 3 


3.1 






Lu eoAQO 




Nw^Uozooj: nOTTtO 88pI8nS SCCTBtOQiOPI 


3.1 




BE262621 


nS.f orao 


MIF,sugarJr,nono 


macrofdtage migraSon InhOAoiy fee 


11 






nS.OUao2 




dsimatoponltn 


3.1 


^157 






u8nsfn6nlDfBno4inon8 


CCT* 
CbIS 


3.1 


452753 


AA028049 


HSi277728 


^O&l TDI/^ t»«wM 


aC^AA 10 J^Jl-imiftjili Lj l\ ttVn ^ 

aCwl4 (a» <XreViS188/-ilKS i. 


3.1 


414420 


AA043424 




IIVFfVl,OOHM 


Iininedlals carty response 3 


3.1 


44^29 


nil »Hl30'* 


Lie AAAAti 


TM=M:SS=N 


WAAi ouy protBin 


3.1 


453143 






ta«si n •TKl-IJ-CC-l 1 

sefpin, 1 ivfncoo-m 


pmtdn ^froslne phosphatase, recep 


3.1 


411441 








WO repeal domain 10 


3.1 








AAAJionnofio_fec,zf'C4 


rtomo sapcens cdna: rUZjZoO ns, d 


3.1 


AAAAAi 


HWOI0041 




IRK:TM=Y;SS=N 


nypolneucai pnxein M6C3101 


3.1 


•MOl £9 


RPR91Rn7 




Tkl-V*CC-ll 
1 WFi,oi>-M 


transmembrane 4 superfamfy inernber 


3.1 




AMOOfOI 


Ue vncn 


Rr\.DNAjDinding,iionB 


ESTs 


3.1 


WKWO 


nVOOIDOU 






ESTs 


3.1 


427876 


A)494291 


Hs^69171 


CGTb 

col 8 


3.1 




AA193338 


Hs.12321 




socStim catcbim exchanger 


3.1 




T79526 


Hs.179516 




uuegra type 1 (voutn 


3.1 




AI2072S6 


Hs.13765 




Honb sapiens nANA far FU00074 prot 


3l1 


AAVJOO 
*¥*£c£c 


AI061301 


Hs.164773 


iryp5njcnng»,UrAK.LYo 


ESTs 


3.1 




AA461599 


Hs.23459 


ESTs 


3.1 


447191 


NM014521 


H8.17667 


5H3;TM=m;5S=t»J 


SH3-domain faindng protein 4 


3b1 




T85247 


Hs,351875 


fV^VG^Tftl^-Al-CC-Al 


cytochrome c oxhtase subuidt Vic 


3.1 




BE244231 


Hs.84038 


AAA AID Ao^nAAn r\Aniki.LiA. 

AAA»NBnARC,PAAD_[]APlN;NA; 


COHloprcxetn 


3.1 


44UwO 


AK000517 


H$.6844 


NALP2 protein; PYRIN-ContalnIng APA 


3.1 




AI5B9941 


Hs.8254 




Homo sapiens, Similar to tumor €\fi 


3.1 




AA297634 


HS.S4925 




KiAAiooD protein 


3.1 


424927 


NM-002447 Hs.2942 


nUnaea Cams DCt kA C 


macrophage simulating 1 receptor ( 


3.1 


AW973686 


Hs.153850 




nypoineucaj proiein uo^iU2.4 


3.1 


440100 


BE382665 


H5.15d549 




cols, weaKiy stmiiar u) uUo_nUMAN 


3.1 




AA306477 


Ks.29379 


IIVFAH,00=ri 


nypotJttucai proiein rU i0do7 


3.1 




BE614410 


Hs^044 


TtJ|sM.CC-M 
IIW-«,0O-N 


KAUdi (0. cerevistae) nomong fcco 


3^1 




AI902899 


HS.B5155 


ZKAA/ti, 1 A/Ryi;%KFri 


butyrate response foctor 1 (EGF-res 


3.1 


4£O404 


NM.016240 Hs.128856 


Cou^en;TM=Y;SSBN 


CSR1 protein 


3.1 




AW865388 


Hs,151076 


TM=M;55=N 


KIAA1243 protein 


3.1 




AF161486 


Hs.94769 


ras,none 


RAB23, member RAS oncogene famBy 


3.1 


453271 


AA903424 


Hs.6788 


IJm;TM=M;SS=N 


ESTs 


11 


411274 


NM.002776 Hs.69423 


trypsln;TM=M;SS=N 


kainiueinlO 


11 




AA011117 


Hs.3745 




mnk \A gloDUle>EGF factor 8 pnne 


11 








nn l|H>Ttl— 11 • CC>-A1 


NM-03141 9^:Homo sapiens mofecule po 


11 


413244 


AW955951 


H5.159265 


BTB,P0p.M1 l&jpm^p^^ 


kruppeMela^ zinc finger protein 


11 




BE294350 


Hs.71891 


nUnma CC CD Ium HtTltM 

pi»nase»p&j-oJqfpe_C;TNpY 


discddin domain receptor Miy. m 


11 


409007 


AL122107 


H&495g9 




Homo sapiens mRNA; cDNA UKrZp434G08 


11 




AA335295 


Hs.74120 


1 CA*T1A-U>ce— M 

lca;tk/fm;ss^ 


adipose specffio 2 


11 




M62194 


Hs.75929 


cad}iei1n,Cadh8ririjCjteini; 


w 1- 44 » M AB w ■ - 1^ 

caonenn 11, ^ype 2. O&cadnenn (0 


11 




BE218049 


Hs.121820 




ESTs 


11 




D50920 


Hs.23106 


Ttl-tl>CC— il 

lM=M;b!>=N 


KIAA0130 gene product 


3.1 


AIA 1 7ft 


A!026984 


Hs.293662 


Murs^n8l,lanun!n.B,lainin 


ESTs 


10 


AifnAti. 


AI732742 


Hs.87440 




ESTs 


10 




AA635305 


Hs.375591 




ESTs 


10 




BE616694 


Hs.28e042 




hypothetica) protein FU14299 


10 




AA352723 


Hs.241471 


U/U<l 'TI 1— A1>CC— Al 


RN66 


3.0 


AMQSn 
40<oOr 


At702223 


Hs.107253 


M)ox;TM=N;S&>M 


hypothetical protein DKFZp761F241 


10 


401DrO 


AA374181 


Ks.26799 




DKFZP564D0764 protan 


10 




AF168793 


H3.I2743 


jmultjJiJLjiftTll-H-Ce-.AI 

usnuBcyiuansfiT nfmiss^n 


camlfine 0-octanoyllranski3S8 


10 




AA04392S 


Hs.339352 




Hnno sapiens brother of CDO (BOC) m 


10 




AU0n025 


Hs.265827 


T&l— tl'CC— V 

1 M=M,oo-Y 


nterteron, alpn»nduc9)le protein 


10 




AA345857 


H8.274307 




ISIAA1442 protan 


10 


AA9UQ 


AI751601 


HsJ37S 


UATU ^TDAC •>! /^LtPltTtl- 


1 Nr recepROT'flSSoctaieo laciiDr 4 


10 




AI472068 


HS.37S604 


eirdjeu-zo,w2; 1 iufm:ss=n 


KIAA18S5 protein 


10 




AKD00322 


Hs.18457 


ZKanv4, 1 NFT^oo^ 


hypomeccal protein rU20315 


10 


411734 


AW374954 


H8.71779 




Homo salens WA firomchroniosome 19 


10 


443547 


AW271273 


Hs.356487 


fh3,non8 


hypothetical protein FIJ12686 


10 


AiTnnri 

^1 fUUU 


BEZ77919 


Hs^eoio 


TAl— V-CC— tl 


ESTs, Weany similar to ALU7 JiUMAN 


10 


AICQOV 
4l09Of 


D86957 


Hs.80712 


GTr„CDC;TM=N;SS=M 


KIAA0202 protein 


10 


A'iAAQA 


U78575 


Hs.149255 


moci/'Tki— AkcCikti 

nr5K;TM=N;SS*M 


phosphaG(^inositol-4-pho6phata 5- 


10 




W738S3 


H5^5S424 


pKinase,F5_P8JypQ_C,aalL 


ESTs 


10 




A1569936 
BE392921 


Hs.296178 


OoaudHi;TM^SS=N 


hypotheScal protab FLJ22637 


10 




Ks.3797 


ra3,aJtTM=M;SS=N 


RA626, member RAS oncogene fam9y 


10 


401038 


AA088767 


Hs.83883 




c 1 iuuu42o:gtl4aD774 iiret|Nr.0033 10. 


10 


418245 


TM=Y;SS=M 


transmen&rane, prostate androgen in ' 


10 


407688 


W25317 


Hs.37616 




KumanD9spBcevartantBmRNiA.com 


10 


456908 


AF117646 


HS.1S6637 


zM3HC4.Cbl_N,CfaLN2.CU 


Cas^-M (murine) ectropfc letrovir 


10 


424744 


AW175781 


H8.1S2720 




M-phase phosptK^UDtsia 6 


3.0 


452195 


AA994712 


Hs.116876 




ESTs 


10 


415988 


BE407713 


Hs.78343 


PepLC1-nke;TM=N:SS=M 


bteomydn hydrolase 


10 


418399 


AF131781 


Hs,84753 


TW=N:SS='N 


hypothetical protein FU12442 


10 


420568 


F09247 


H8.247735 


cadherin.l(poc^;TMsM;S 


protocadhedn alpha 10 


10 


404661 






TW=M:SS=N 


C90Q0306*3gi)12737280^efp(P_006682 


10 


414152 


NH4.003248 Hs.75774 


EGF.TSPN.bp.3;TM=M;SS=M 


Qi^wnibospomlin 4 


10 


421307 


Bb599976 
BE560471 


H8.103306 


chnxno 


Homo sapiens mRNA: cONA 0KFZp434B04 
hypotfisUcSl prot^ 


10 


444868 


H3.12101 


TM»N;SS«M 


10 
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450214 

452664 

422105 

422278 

434067 

412876 

426801 

421983 

429299 

408912 < 

438746 



BE439763 



AI929700 

AF072873 

H18913 

NM.000165 

AA486846 

AI2S2640 

AI620453 

AB011084 

AI885815 



H1227571 

Hs.18397 

Hs.111680 

Hs.114218 

Hs.124023 

I&74471 

HS571795 

Hs.110384 

Hs^7408 

Hs.48924 

H8.184727 



RGS;TM»M;SS:44 

Fz^rizzled.7bnJi:TMsY;SS 

ooim»la.Conns)dii43;TM»Y; 

pfDjsofn6fsso,nono 
TW'YiSSsN 

Annadillo.8eg;TM=f^SSsM 
trBnstiBnin,Guany1ateLUR 



fcgiia^of &(irotBin stgnaffinQ 4 
hypoOieOca} pnM FL^3221 
endosulfine slpha 
ftizzted (Drosophlla) homotog 6 
Homo salens cDNA FIJ14216 fe, do 
gap juncSon protein, alpha 1, 43kO 
ESTs. Weakly sbnOar to 138022 hypo 
pepIidylprDlyl isomerase C (cycbph 
hypoihsGcal protein MGC13102 
KIAA0512 gene product; ALEX2 
Human irotenoma-associ&tad antigen p 



TABLE 4B 



Pkey: Unb^ue Eos probeset identifier numtier 
CAT number Gene duster number 
Accesston: Genbank accession numbers 



Pkey 

418344 
412703 
434241 
422940 
456034 
434433 

TABLE 4C 



CAT Number Accession 

245371J AA216387 T63548AA228676 

1243670.1 AW984759AW984744 

63414J AF119913AI207698R57074 

58443.1 BC012771 6G397153BF386196 AA337277 AA319285AW843252 

685586.1 AA136653 AA136656AW4S0979 AA984358AA609054AW23B038AA492073 BE168945 

111338.1 AA633408AW749955AW629759A1651005 



10 

ao 

3.0 
3.0 
3.0 

ao 

3.0 
10 
3.0 
3.0 
3.0 



PIcey: Unique number conBSponding to an Eos probeset 

Refi Sequence souroe. The 7 digit numbefs In tMs cdumn are Genbank Identifier (GO numbers. Dunham I. el al.' refers to the publicalibn entitled The DNA 

sequence of human chromosome 22.* Dunham I. et tiL. Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which axons were predicted. 
Ntj)osiGon: Indicates nudeotide positions of predicted axons. 



Pkey 


Ref 


Strand 


NLpcsiQon 


406387 


9256180 


Phis 


116229-116371.117512-117651 


400496 


9743564 


Plus 


4151541695 


402496 


9797769 


Minus 


8815-9103 


403439 


9719679 


Plus 


91463-91532 


401038 


7232177 


Minus 


4277-4469 


404661 


9797073 


Pkis 


33374-33675.33769^008 



TABLE SA: ABOUT 231 GENES UP-REGULATED IN BREAST CANCER COMPARED TO NORMAL ADULT TISSUES THAT MAY ENCODE EITHER ENZYMES OR 
PROTEINS AMENABLE TO MODULATION BY SMALL MOl£CULES. PEPTIDES. OR ANTIBODIES 

Pkey: Unique Eos probeset identilier number 

ExAccn: Exemplar aooessiofl number. GenBank accesston number 

UnlGenelD: UnlGene number 

RedPmLDomsins: Certeiln predicted protein domab6.AbbrevlaBon8 used: TM.tiansn^ 

iiiteiy to contain; «N, less Cicely to contain. AH other protein domain abbreviations are from PFAM (Nudeie Adds Researeh. 2002. 30:276-280). 

UniGene TUe: UnlGene gene fitia 

R1: Rafk) of 90th peroenfile tumor to SOthpercenSte of nonnai body tissue 



Pkey ■ 


&iAocn 


UhlGenelO 


fVed.PraLDom8lns 


409340 


BE174629 


Hs,321130 


a^jiemneaseStpyddoxaljle 


421481 


AW391972 


Hs.104698 


TM4^8S=M 


419693 


AA133749 


H8.301350 


ATPlG1.PLMJWATe;TM=Y;S8=M 


417389 


6E260964 


Hs.82045 


PTNJ«<;TM=M;SSaY 


414521 


D28124 


Hs.76307 


DAN;TM==M;SSbM 


438091 


AW373062 


H5^1546 


homionejrec,zf-C4,none 


413815 


AU)46341 


H5.75562 


pklnase,F5J8Jype.C;TM=Y 


439180 


A1333742 


Hs.1990$7 


Furavfil(e,p)dnase.Recep. 


431441 


U81961 


Hs.2794 


ASC;TM=Y;SS=N 


452547 


AA335295 


Hs.74120 


LEA:TM=M;SS^N 


452239 


AW379378 


Ks.356289 




441384 


AA447849 


Hs.288660 


7bTu3.non8 


419223 


X60111 


Hs.1244 


transmembr8ne4;TM3Y;8S"M 


413859 


AW992356 


Hs.8364 


8AM-PNT,non8 


410587 


U24389 


K3.65436 


LysyLoxldase;TM=N;SS=M 


422599 


BE410590 


H3.119257 


SH3.HS1jBp;TM=M;SSsN 


419452 


U33635 


Hs.90572 


]g.pkinas8:TM»Y:SS=M 


427378 


BE515037 


HS.1775S6 


MAGE:TM=M;SS=N 


4447B4 


D12485 


Hs.11951 


SomatDmedin_B.Endonucteas 


436972 


AA284679 


Hs.25640 


PMP22_Claudin;TM=Y;8S=M 


412926 


Ai879076 


Hs.75061 


MARCKS:TM=N;SS=M 


425280 


U31519 


Hs.1872 


PEPCK;TM=M;SS=N 


432636 


AA340864 


HS.278S62 


PMP22.aaj(fin'TM=Y:SS=M 


423778 


Y09267 


H3.132821 


FMO^,pyrjedox;TM=Y;S 


424206 


NM.003734 


Hs.198241 


Cu.andn6undd,OiuEBnIne.ox 


444797 


AB016333 


Hs.12002 


SH3,8AK«TM»M:SSbN 



UniGeneTiOa 


R1 


melanophilin{MlPH),mRNA 


20.5 


KtAA1324 protein 


16.3 


FXYDdontBin-cantalnlng Ion transpor 


117 


nAlkine (neurite grouvth-promoting ( 


117 


neuroblastoma, suppression of tumor (DAN) 


117 


nuclear reoeptorsubfamiiy 1, group 


114 


(Bsooldin domain receptor famiiy, m 


13.2 


v-erb-b2 avian ^ythroblasOc tsuke 


13.2 


sodium channel, nonvdtage^ated 1 


123 


adipose specific 2 


12.4 


protein tyrosine phosphatase, recep 


1Z1 


reSnolc add induced 3 


113 


CD9 anSgen (p2^ 


11.7 


Homo sapiens pynivate dehydrogenase 


11.5 


lysyloxidas&aQel 


lU 


emsl sequence (mammary tumor and sq 


10.1 


PTK7 prcteln tyrosine kinase 7 


9.9 


melanoma ant^en, fanily D, 1 


9.9 


ectonudeoQde pyrophosphatase/phos 


9.9 


daudln3 


9.7 


macrophage myiistoylated atadne-d 


9.5 


phosphoendpynjvate cartxKyldnase 1 


9.5 


daudln7 


9.4 


flavin containing monooxygenase 2 


9.4 


amhie oxidase, copper oontdnfaig 3 


a4 


taAA0790 protein 


ao 
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WaSS 
44^)992 


ARM3329 


Hs.173717 


PAP2iTM=Y:SS=M 


PPAP2B PhospnatWIc add phosphaUype 2B 
tsnsk) 


9.0 


AW888Z22 


Ks.9973 


SH2,VvW^D,n(»)e 


6b9 


4£l i4j 


AB024538 


Hs.102171 


(g,lW?,LRRNT,LRRCT;'TM=MtS 


immunoglobtinn supetfamOy cont^ 


&8 




NM_C04642 


Hs.3436 


TM=M;SS=N 


defeted h oral cancer (mouse, homo 


8.7 


4iUooo 


DCJ79/94 


Hs. 159651 


aaam,TNFK^6;TM=Y;SS=M 


hypotheScsl proton 


8.7 


433862 


W07162 


Hs. 150826 


ras^C_tran,arf;TM=M:SS= 


RAB25RABK, member RAS oncogene la 


a6 


421 853 


/^1 17472 


Hs. 108924 


Sn3,6(xb;TM=M;SSsN 


SH3-domaln protein 5 (portsin) 


8.6 


425335 


BE394327 


Hs.298267 


efhand,kazaUrf^,7lnL 


fbllista6nto1 


8.5 




nioo3o 


Urn 14 CAD 


Cysjowt 


nypotneOca protan FU20041 


8.5 


438089 


WD5391 


Hs,351546 


hoimon8jeczW>4,nonD 


nuclear receptor subfamily 1 , group 


8.4 


426158 


4111 AA4 rum 

NM-001982 


Hs. 199067 


FuTin4Ike,pkln3se^R8cep^ 


v-eH>4)2 avian erythroblastic leuite 


8.3 


447191 


nM_014521 


Ks.17667 


8H3:TM3^^sss:^t 


SH3^main binding protein 4 


8.2 


439941 


AI392640 


Hs.18272 


AqJtrens;TM^;SS«N 


amino add transport^' system A1 


8.2 


439318 


AWB37046 


Hs.6527 


71inJ^CytC_asin.GP^TM»Y;S 


6 |vo(dn-ooupted receptor 56 


8.1 


442213 


N36110 


Hs.305971 


sugar_tr;TMaY;SS=M 


soiute carrier brrfly 2 (teiOlate 


&1 


412649 


NM.002206 


Hs74369 


lntegiin^FG-GAP;TM=^SS 


integrin, dpita 7 


&1 


448913 


AA194422 


Hs.22564 


inn,zl-RanBP,pklnas8.GST. 


myosin VI 


ai 


420166 


AW732276 


H3.95583 


tiansmembF8ne4;TM'Y;SS=M 


transmemt)rane 4 superfamHy membw 


ao 


407102 


AA007629 


Hs.348601 


transpartj)rotSWiB,ASC 


g!yoen)l<3-phosphate dehydrogenase 


7.9 


452516 


AA058630 


Hs.29759 


TM=N;SS=M 


RNA POLYMERASE I AND TRANSCRIPT REL 


7.9 


413076 


U10564 


Hs.75188 


pWnaso;TM=M;SS=N 


weel (S. pombe) homolog 


7.9 


443604 


C03577 


Hs.9615 


elhand;TM=M;SS=N 


myosin regulatory Rghtcliain 2, am 


7.7 


429002 


AW248439 


Hs.2340 


Anna(i9IOLSeo!TM"M;SS^ 


juncSon pm(0^t)ln 


7.6 


432562 


BE531048 


Hs.278422 


zf-C2H2;TMBM;SS4il 


OKFZP586G1 122 protein 


7.6 


426359 


AA376409 


Hs.10862 


adeny)ate1dnas8,none 


Homo sapiens cONA: FLJ23313 lis. d 


7.6 


417733 


AL048678 


Ks^2503 


NAi'NA 


Hsapiens mRNA for TUTR of unknown 


7.5 


451541 


BE279383 


Hs.26557 


Annad3loj5eg:TM=M;SS=N 


plai(0philin 3 


7.4 


443951 


F13272 


Hs.356835 


PUP2!LCIaudA,]ion9 


ferritin, light polypeptide 


7.4 


409960 


6E261944 


H&355264 




hexoktnase 1 


7.3 


423184 


NM-.004428 


Hs.1624 


Ephrin;TM=M:SS=M 


ephrin^l 


7.3 


405121 


X0438S 


Hs.110802 


QfsJ?not,TGF-bela,vwa.vwc 


von Witlebrand f xtor (VWF), mRNA 


7.1 


438974 


AF089816 


Hs.6454 


PDZ;7Mf=N:SS=M 


chromosome 19 open reading frame 3 


7.1 


417771 


AAa)4698 


Hs.82547 




retirtolc add receptor respondsr (t 


7.0 


424118 


BE269041 


H8.140452 


peri!lpin;TM=N;SS=M 


cargo selection protein (mannose 6 


7.0 


402705 


X57951 


Hs.180909 


AhpOTSA;TM;3M;SSsM 


peroxIrBdoxin 1 (PRDX1) 


7.0 


417115 


AW952792 


Hs.334€12 


Sin,pkinas6;TM=N;S8=N 


small nuclear rlbonudeoproteln pd 


7.0 


442572 


At001922 


Hs.135121 


HSP70 


hypothelicai protein FU22415 


6.9 


447215 


R75812 


H$.169248 


cytochrom8jc;NA:NA 


p75NTR>8SS0ctated cell death execut 


&9 


422276 


AF072B73 


HS.11421B 


Fz,Frizzled,7tin.2;TMsY;SS 


frizzled (Drusophila) homdog 6 


6.9 


414657 


AA424074 


Hs.7o780 


TM=M;SS=N 


protein phosphatase 1. reguteloty ( 
Homo sapiens, done MGC:9381, mRNA. 


6.9 


447528 


AI612027 


H8.76277 


TBJLDP1-HVA22:TM=Y;S8=M 


6.9 


436729 


BE621807 


Hs.351316 


TM=Y;SSsM 


transmsmt)rane 4 superfarr^ memt)er 


6.9 


428013 


AF151020 


Hs.181444 


TM=Y;SS=M 


hypothetical protein 


6.9 


444143 


AW747996 


Hs.160999 


Bd-2,none 


ESTs, Moderately similar to A56194 


6.8 


414443 


AU07726B 


H8.76144 


igipk!nas8;Tiyt»Y;SS^ 


plateiet-derfved growth (actor reos 


a7 


418751 


BE389014 


Hs.372548 


SH2.none 


phosphoino8ltIde-34(h)ase, regulato 


6J 


448479 


H96115 


Hs^1293 


UDPGP;TM=M;SS=N 


UDP-N-adeylgluccsamine pyrophospho 


&6 


410552 


X66945 


Hs,748 


lg,pk]na$e.SH2,SH3,C2PH, 


fibroblast growth fador receptor 1 


&6 


414683 


AA926960 


Hs^669 


CKS;TM=N:SS=N 


CDC28 protein kinase 1 


6.6 


417426 


N(iL002291 


Hs32124 


iam!nIn_E(^,iaminin_Ntenn 


landdivbeta 1 


6.6 


428179 


AI127772 


Hs^9G98 


pkinas6.PX,pkinase_C;TMBN 


serum^Iucocorfico^ regulated Idna 


&6 


443195 


BE148235 


Ks.193063 


Aa_tra:^one 


Homo sapiens cONA FU 14201 lis. do 


6.5 


424512 


X53002 


Hs.149846 


lntegriiuB,EGF;TM=Y;SS'M 


integrin, beta 5 


&5 


421733 


AL1 19871 


Hs.1420 


ig.pWnase;7M=Y;8S«M 


fibmt^ast growth factor receptor 3 


&5 


428950 


BE311879 


Hs.194673 


DED;TM=M;SS=N 


phospho{nx>tein enriched in astrocyt 


6.5 


450172 


NM.005864 


Hs.24587 


SH3.horTrone3;Th^M;SS=N 


signal transduction protein (SH3 co 


6.5 


416078 


AIJ034349 


Hs.79005 




protein tyrosine phosphatase, recep 


6.5 


408912 


A6011084 


Hs,48924 


Armadilb_$e9;TM='M;SS«M 


KIAA0512 gene product; ALEX2 


6.4 


428373 


AI751658 


Hs.1 83986 


lg;TM=Y;SS=M 


poOovims receptor-fetated 2 (herp 


6.4 


449029 


N26989 


Hs.22891 


8gj)enTieases;TM=Y;SS=M 


sdute carrier family 7 {calionic a 


6.4 


406621 


X57809 


Hs.181125 


Hl,HSP70,Ppx-<jppA;TW=M;SS 


ImmunogkMn lambda locus 


6.4 


431629 


AU077025 


Hs^658Z7 


TM=M:SS=Y 


interferon, alphfrindudble protein 


6.4 


428169 


AI928984 


Hs.1 82793 


photoRCUPFOI 18;TMaY;8S»N 


gdgi phosphopmtein 2 


6.4 


443337 


Y07604 


Hs.9235 


NDK;TM=N;SS=N 


non-metastatlc ceDs 4, protein exp 


6.4 


451292 


AB037716 


Hs.26204 


SH3;TM=M:SS=N 


KiAA1295 protein 


6.4 


425976 


C75094 


H$.334514 


vdlage_CLC;TM=Y:SS=M 


NG22 protein 


6.4 


426539 


AB011155 


Hs.170290 


SH3,PDZ,Guanyl8tBjdn;TM» 


discs, large (Drosophiia} homolog 5 


6.3 


417208 


S67773 


HS.B166S 


i04iUn8se:TM='Y;8S=M 


v-kit Hardy-Zuckerman 4 feilne sarc 


6.3 


438278 


BE409248 


Hs57988 


TF1IS.RNA^POLX.15KO,UPFO 


hypothetical protein FU22357 simS 


6.3 


429455 


AJ472111 


Hs^8694 


lecCfLC 


CD209 antigen 


6J 




AVV29o135 




/Hi f\A/^ DC Kmu4 DU Dluk/*CC 


vav 3 oncogene 


6.3 


445033 


AV652402 


Hs.72901 


8rriqTM=N:SS=N 


cydin-dependent kinase tnhlUlor 2 


&3 


411756 


BE294350 


Hs.71891 


pMra5e,F5je.ln)e_C;TM=Y 


disooldin domain receptor fandly, m 


6.3 


453902 


BE502341 


Hs.3402 




ESTs 


6.3 


418005 


AJ186220 


Hs.83164 


Colag8n.TSPN;TlUNM:SS=M 


odiagen. type XV, a^a 1 


&2 


449924 


W30681 


Hs.146233 


SH3tnone 


Homo sapiens cDNA: FU22130 fis. d 


&2 


426520 


BE54S684 


HS.343S66 


a^jJonnBSseSipyricloxaljds 


KIAA0251 protdn 


&2 


453064 


R40334 


Hs.89463 




potasdum large oondudanco cslkiium 


6.2 


448520 


AB002367 


Hs.21355 


pkina5e.DCX:TM=M;SS^ 


doublecorlin and CaMklnasfr^ike 1 


6.2 


452683 


A1089575 


Hs,374574 


hom80box,nona 


progesterone membrane binding prote 


6.2 


402575 


AF043329 


H8.173717 


PAP2:TM=Y;SSsM 


PPAP2B Phosphatidic add phosphat type 2B 


6.2 


444672 


Z95636 


Hs.11669 


lamIniii^,landnliuG,EGF 


tsnMn, alpha 5 


6.2 


450440 


AB024334 


HS.2S001 


14.»;TW=M;SS^ 


^frodne 3-4nonooxygenase/lryptophan 


6.2 
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432314 AA533447 Hs.285in XlinMona ESTs &1 

438564 AA381553 Hs.198253 IgJMHCJLalphaMtane m^HstooompaMycomi^ 6.1 

444252 R21135 H$.54985 ESTs &1 

. 4251B4 BE278288 Ks.155048 lg:TM:>Y;SS=M Lutheran bbod group (AubetQer ban 6.1 

5 4318S0 X17033 HS.271S88 vwaJntegrinJ^F&GAP;TM5 intdgrin. afpha 2 (C049B. alpha 2 8 &1 

449475 At348027 Hs.129826 transmenibrane4;TM3Y;SSBM hypoDielicalpratsinPPIOS? 6.1 

449538 AI559444 H3.104679 TMs»A;SS=M ESTs 6.0 

414496 W73853 Hs355424 pkinase.F5.Fa.typ8.Qadn. ESTs 6.0 

414217 AI309298 Hs.279898 NA;NA Homo sapiens cONA: FU231 65 fis.d aO 

10 445333 BES37641 H&44278 r33,aif.TK:TMNN:SS=M hypotlratica) protein FU 1 2538 simil 6.9 

431183 NM.008655 Hsm96 EfLtumerueceptTK^k^SS^M KDa(Lys-Asp^iKleu) endoplasmic 5.9 

409645 AI142265 HS.S5498 poiyprenyL8ynt:TM=M;SS»N geranytgemny) diphosphate synthase 5.9 

412276 BE262621 Hs.73798 MIF,sugarJlr,none macrophage migration iniiibilofy fx 5.9 

^. 416137 BE279513 H5.27^7 ptdnase.UBA.ThiFrrM=M;$Ss utdquiltn activating enzyme El -iiie 5.9 

15 412969 AI373162 Hs.75103 14.3^TK^N:SS<:M tyrosine 3-monoQxygen8S8/liyptophan 5.9 

414504 AW069181 Hs.115175 pkhia5e.SAM:TM^M;SS=N sterile^pha motif and leucine zip 5.9 

433573 AF234887 HS.576S2 71m.2,EGF.cadhei1n.iamtnl cadherfn, E6F LAG seven^ass G4ype 5.9 

436415 BE265254 Hs.343258 PeptidaseJA24,Furt(v4iln. proliferBilon^ssociated 2G4. 38X0 5.9 

413900 AW409747 Hs.75612 TPR.PDZ.WW,GuanyfalQjdn: stressMuced-phosphoproleln 1 (Hs 5.9 

20 441455 AJ271671 Hs.7854 Zip;T^^Y;SS=M zinc/iron regulated transpcrter-filt 5.9 

444006 BE395085 H3.334762 ldljGcepLa^.MHCJ:TM type I transmembrane protein Fn14 &8 

408269 AW888219 Hs.44077 CH;TM=M;$S=N pan^. alpha &8 

411372 AI147861 Hs.213289 Glyco_transt11.EGF,tdLr low density lipoprotein receptor (f 5.8 

^_ 450825 ACOOSgSA Hs.25527 PDZ,Guanylate k]n:TM=N;SS tight Junction protein 3 (zona ocd 5.8 

25 456534 X91ig5 Hs.10G523 UM,POZ,pktnase;TMsN;SSsM phosphoHpase C, beta 3, neighbor p 5.7 

451558 N^L001089 Ks.26630 ABCLtran,SRP54tTM»Y;SS»M ATP-bindlng cassette, sub-fantily A 5.7 

446812 AL042279 Hs.16206 pklnase uncharacterized hypo&iaiamus protei 5.7 

424307 AW293399 Hs.356377 nudear receptor anepiBSSorl 5.7 

405484 )a4_093451 TM»N:SS»M C3002124*:gl|12737280|renXP_008682 5.7 

30 425367 6E271188 KS.15597S TM=I\4;SS=Y protein tyrosine phosphatase, recep 5.7 

444607 AW405635 Hs.293687 PI-PLC-X.PH,P».PLC.Y ESTs 5.7 

421456 AW579842 Hs.104557 zf-C2H2.DUF18.efhand.C2,P hypothetical protein FU 10697 5.6 

412810 M21S74 Hs.74615 lQ,pldnas8^11;TiyNM;SS= plalele(<feffved growth factor lece 5.6 

450334 AF035959 Hs.24879 PAP2:TMNY:SS»M phosphalidic add phosphatase type 5.6 

35 453880 A1803166 Hs.135121 HSP70,none ESTs. WeaMy similar to 138022 hypo &6 

439578 AW263124 Hs^0547 WD40:TM=M;SS=N nudear receptor c&feprBssar/HDAC3 5.6 

450954 AI904740 Hs.25691 TM»Y;SSsM receptor (catdtonln) activity modi 5.6 

414555 N98569 Hs.76422 phoslipJft«'M;8&>Y phosphoHpaseAZ. group llA(piatBl &6 

409963 AA133590 H&377830 MBOAT^wne csildumfcdhiodufin-dBpendentprotei 5.6 

40 450463 AW952016 Hs.201398 C1q,Coflagen;TM»M;SS»Y G protein coupled receptor Interact 5.6 

425177 AF127577 Hs.155017 TM=N:SS=M midear receptor interacting prolei 6.6 

445496 AB007860 Hs.12802 SH3.ank.PH;^Gap;TM=M;SS development and diffefenfiaBon enh 5.5 

428981 BE313077 Hs.93135 imi EST8,W6aidy8nnlartDAUKLHUMAN 5.5 

424441 X14850 Hs.l47097 hislone.C8FD_NFYB_HMF:TI\*= H2A hlstone family, member X 5l5 

45 415662 AW972481 Hs.170810 pldnase.none ESTs, Highly slndlar to G01887IMEK 5.5 

422105 A192g700 Hs.111680 TM=M:SS=N endosulflne alpha 5.5 

429556 AW139399 Hs.314807 TM=IW:SS=N ESTs 6.5 

408056 AA312329 Hs.42331 Ephrin:TM=M;SS=M ephiitvA4 5.5 

425205 NM_005854 Hs.155106 TM=Y;SS=N receptor (cadtonin) activity modi 6.5 

50 444633 AF111713 Hs.12284 ig;T1^Y:SS=fi^ Juncfonal adhesion molecule 1 5.5 

431565 AF161470 Hs.260622 TM^YiSS^J butyrate^nduced transcript 1 5.5 

429655 ti48959 Hs.211582 pkinase,th3,lg,none myosin, light polypeptide kinase 5.5 

431686 L77964 Hs.271980 pkin3se;TM=M;SS=N mitogen-activated protein kinase 6 5.5 

453143 AA382234 Hs.356289 8erptn;TMsN;SS=M protein tyrosine phosphatase, recep 5.4 

55 451883 AL120634 Ks.331803 cpn60.TCP1.E1-EajVTPase.C ATPase,Catranspor(inB. plasma mem 6.4 

422293 X94453 Hs.114366 aldedh.aakin3se;TM4i;SS<:N pym^Qne-S^arboxylatesynflietase 5.4 

432179 X75208 Hs.2913 EPHJbd.fn3.pkinasa,SAMtT EphB3 5.4 

408048 NM.007203 Ks.42322 Pardemmin;TM=M;SS»f4 AkIn3se(PRKA) andtorprot^ 2 5.4 

446153 Y10805 HsJZ0521 NusG;TlyNN:SS»M HMT1 (hnRNP methyltrensferase, S. c 5.4 

60 421251 Z28913 H8.102948 lJM.POZ:TM»N;SSsM enigma (UM domain proldn) 5.4 

439039 A1656707 Hs.46713 pkhase,none ESTs 5.4 

409882 AJ243191 Hs.56a74 HSP20:nA=N;SSsM heat shock Z7KD protein family, mem 5.4 

451295 Ai557212 Hs.17132 pk{nase.DA6J>&blnd.pkIna ESTs. Moderately similar to 154374 &4 

^. 442549 AI751601 Hs.8375 MATH.z^TRAF,z»:3HC4;TM' TNFreceptor-assodatodfador4 S.4 
65 445330 AFO55O09 Hs.13456 DA6KcDAGKaank.WD404)ZIP Homo safriens done 24747 mRKAeeque 5.4 
453082 H16635 K5.31608 ioiutrans:TM»Y:SS=M hypoQtetical protein FU20041 6.4 

426432 AF001601 Hs.169857 Arylesterase;TM=ft«SS=N p3raoxonase2 5.4 

415753 US2819 Hs.78781 PDGF;TM=M:8S^ vascular endothelial growth factor S.4 

450776 U81375 HS.2&450 NudeosidQ.tran;TM=Y;SS»M solute carrier family 29 (nudaosid 5.4 
70 414739 U83867 Hs.77198 efhand.8H3,8pedrin:TM=N; spedrin. alpha. norMiythiDcyOcI 5.3 

421233 AA209534 Hs.284243 transmembrane4;TM=Y:SS=M tetraspan NET-6 protein 6.3 
414774 X02419 Hs.77274 ki1ngle.trypski,ptant.ihi plasminogen acSvator, uroMnase 5.3 

414368 W70171 H5.75939 PRK.CoaE;TM=N;SS=N uridine monophosphate kinase 5.3 

^- 446051 6E046061 Hs.37054 EphftnA.de3min.dsnn,z-al ephrln^ 5.3 

75 423619 T48691 HS.2491S9 7liTLl.7tmJ;TM=Y;SS»M adrenergh; slpha-2A^ receptor 5.3 

440188 AKD01812 Hs.7036 ROK;TM=M;SS=N fMcetytgtocosamlne Unase 5.3 

414135 NH.004419 Hs.2128 Rhod8nese,OSPc.YjihQSphal dual &pedik:ity phosphatase 5 5.3 
444838 AV651680 Hs.208558 lntegrtnJ^F&GAP,none ESTs 5.3 

_^ 447918 AI129320 HS.11S175 pktnase.SAM.none ESTs. Highly similar to JC5618gamm &3 

80 405517 AF000974 Hs.1 19498 UM;TM=ft^SS=N thyroid honnone receptor tnteractor 6 5.3 

413588 AA971014 Hs.75432 IMPDH.C,C8$.IMP0H_N;TKNll: IMP (inosine monophosphate) dehydro &2 
411089 AA456454 Hs.355702 celldivlstoncyde24ike1(PfT8L &2 

416157 NM.003243 Hs.342874 anaLpeDuddacTlllsY;S8°M (ransfomifog growth fador. beta re 5.2 
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AB020629 


HSk38095 


ABCjranjT<K;TM=T;55Ryi 


ATP^tirafing cassette, sub4Biidy A 


C 9 


446108 


AUMOOsO 


Ks.42322 




A LSmm-m /DDI/At nnnhnr NMUiiJn O 

A nnaso (rKKA) ancnor pfoiBin < 


C9 








l:lS;iM=M,bo-N 


Els2 repfossor factor 




JUTAQD 
4I/U90 


/\oU1r«KO 


n3.i73oo9 




inzziea (urosopfuia} nomoiog r 


R 9 
0*c 


430526 


AC4IHOCO 


nS^42407 


7in\„3;TNpy;55=M 


G proteiivcoupled recaptsr, famOy 


C9 


4i41f D 


DCI4U0JO 


Hs,75794 


7nT\_i ,Q\Co; iM=T ;o5FN 


Cnr* 0 f„...1..1hj.tl^l ■ITttamnnnlLfi.il. 


C 9 






nS.34o9D 




nyp0ln8uCai piDiBtn rU/JdSM 


C 9 


417896 


AA37a//U 


Hs.B2o90 


DAD;TM=Y;SS=M 


oafenoef against can daatn 1 


5.2 




AVAJOCCDC1 


Ub ICOOCC 


BTB.PbpJ^I 2B j)ropep,R6pro 


knippekelalod zinc finger protein 


C 9 


421637 


Ar1351oo 


nS.1DooQ2 


AAA - -> U _JaJO 1 MD AD 

AAA,Oic4aLN,odC4o_2,No-/v\ 


N-etnyiniaiairriiOa-sensiuve factor 


C9 


429379 




nS.2u0598 




t/1 A AAC^T _<^>.A. rl« «ii 

KiAAuW/ gend proQUCi 


C9 


429619 


AL1 20751 


Hs^11568 




eUkaryoDc transbSon IniQation ( 


5.2 


437275 


AW976035 


Hsu(92398 


Fnzzledirz 


ESTs, WeaUy sndar to A475B2 B-oa 


S.1 


421071 


Au11238 


H5,1 04476 


TMsY:8S4A 


ESTs, Waany siRttlar b> CGHU1E ooD 


5.1 


448581 


NM-002709 


Hs.21537 




protein phosphatffie 1, catalytic su 


8.1 


452568 


AA805634 


Hs.300870 


n3_Pl4Jan8se:TM=M:SS=M 


Komo sa{»ens iro<NA; cuNA DKrZp547M07 


5.1 


452069 


A8028949 


H$.183994 


ft - ■ 1_A A» Al 

Metallopno5;TM^SS=N 


KIAA1026 protem 


5.1 


437175 


AW968076 


HsJ7773 


p)dn83Q,pldn8S8jCynone 


pfoirai Rnasa^ cAiwr-oepefMeni cat 


5.1 


437056 


AI147061 




5pecWn,SH3,PH.CH 


glxok33a11,s1 SoaresLNSF^j9W.OT_P 


5.1 


450998 


6E387614 


Hs^5797 


inn;TM=M;SS=N 


spScing factor 3b, subuntl 4« 49!(0 


5.1 


444441 


AW613841 


Hs.301394 


IRK;TM=Y;SS=N 


tiypotheBcai protein M6C3t01 


5.1 


448528 


BE613248 


Hs.172084 


PHO;TMs|yl;SS=N 


KOmo sapiens, done IMA6E:3627860, 


5.1 


452345 


AA293279 


H&29173 




*- ■ ja. - jf ■ A_*_ pa l1Af4e 

hypometical (xoton FU20515 




443412 


W84893 


H3,9305 




angiotensin receptor-Kcs 1 


5.1 


412853 


M34175 


Hs.74626 


AciaptiiLN^naL8oaptinC2 


adaptor-related protein complex 2, 


5.1 


439866 


AA280717 


Hs,6727 


lTm,NTF2:TM=M;SS=N 


Ras^GTPase acovating protein Sri3o 


5.1 






bte.6817 


noil 1 1 p_PMus, 1 wi~pn,wo~i'i 


fawslnn IrinhnsnluMsKA fntirlPfKlflA 


5.1 


435S23 


T62B49 


H&11090 


TM=Y:SS=M 


(nambrand-^tafudtig 4<k]inains, sutiteni 


i\ 


433423 


BE407127 


Hs.8997 


HSP70JgPpx-GppA;BNyi:SS 


heat sliock 70kD protein 1A 


5.1 


412841 


M16660 


Hs.74335 


HSP9aHATPa5ejc;TM=ll4;SS=N 


l)eat&hocl(90](D protein 1,t)eta 


5.1 


431236 


AV656840 


Hs.285115 


fh3:TM»Y:SSsM 


inteHeuMn 13 receptor, alptia 1 


5.1 


438552 


AJ245820 


Hs.6314 




type I transmemtjrane receptor (seiz 


5.0 


422765 


AW409701 


Hs.1578 


BIR:TM=M:SS=N 


bacutovlral lAP repeat-containing 5 


&0 


427502 


At811865 


Hs.7133* 


TM=M;SS=N 


Homo sapiens, clone IMAG£:3161564, 


&0 


414166 


AW888941 


Hs.75789 


DEAO.heScase C.rnn,Ndr,C 


14-myc downstream regulated 


5.0 


424954 


NM^000546 


Hs,1846 


P53.WD40.1RK;TM=M;SS=N 


tumor protein p53 (U-Fraumeni synd 


5.0 


422089 


AA523172 


Hs.103135 


REJ.PLAT,PKD,WSCLRR 


ESTs, Weaidy simOar to SFR4_HUMAN 


5.0 


426638 


BE242634 


K3^55 


ThiF,UBACT;TM=M;SS=N 


Ut)!quitin'activating enzyme El (A1S 


&0 


410793 


AV\^1906 


Hs.66392 


SH3.efhand.C2,PH.RhoGEF,M 


intersectin 1 (SH3 domain protein) 


5.0 



TABLE 5B 



Plcey: Unique Eos probeset identifier numtier 
CAT numt)er Gene duster ntimt)er 

Aooessloni Genbank accession nuintwis 

PIcey CAT Number Accession 

437056 428504.3 AW976398AI147061 AA765223AA743380A1803927 
TABLE SO 

Pl«y: Unique number corresponding to an Eos probeset 

Refi Sequence source. Tlie 7 digit numbers )n tliis column are Genbanlt Identifier (Gl) numbers. Duniiam i. et al.* refers to ttie pubUcafion entitled "Ttie DMA 

sequence of human chromosome 22.* Dunham I. et aL, Nature (199^ 402:4SM95. 

Strand: Indicates DMA strand from whteheocons were predicted. 

NCposifion: indicates nudeoOde positions of predidedexons. 

PIcey Ref Strand Ntj)os{{Ion 

405464 5922025 Plus 199214-199579.199872-199920,200262.20049 



TABLE6A: 777 GENES UP-REGUUTED IN CaON CANCER COMPARED TO NOfWL ADULT TISSUES 

TaUe 6A Gsts777 genes qnegutated in colon cancer compared to nomid adult tissues. These were selected from 59680 probesets on the Affymetrix/Eos Htj03 GeneCMp array such 
ftai the ratio of 'evergge' colon cancer to 'average* nonnal adult tissues was greater than or equal to 3.0. The 'average' colon cancer level was set to the gO" percenlUe amongst 95 
colon cancers. Th9 'average' normal adult tissue level was set to the 90* percentile amongst 209 non-mangnant tissues. In order to rerrove gene-spedBc background levels of non- 
spedlic hybridizalion. the 15°* perceniaa value amongst the 209 non-maBqnant tissues was subtracted from boft the numerator and the denominator before the raBo wa s evahiated. 

Pkey: Unique Gos probeset Identitier number 

ExAocm Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1; RsSo of tumor to nomd body flssue 
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447033 

409041 

418007 

422330 

428187 

428368 

422956 

416209 

444783 

415989 

427585 

441031 

421470 

439759 

449032 

450531 

432887 

443211 

450221 

406984 

410355 

441377 

413318 

442409 

440304 

426427 

451561 

434032 

426365 

422420 

428664 

446232 

424252 

450149 

419752 

422011 

452461 

431808 

400534 

448706 

453888 

426890 

408950 

423020 

453922 

447505 

425761 

404567 

428536 

414972 

459504 

438018 

447663 

442353 

424905 

451917 

420900 

438839 

439521 

445676 

408489 

418738 

446311 

423349 

400195 

411765 

418895 

424653 

448776 

417218 

414559 

445436 

403778 

433447 

407168 

419335 

422505 

458242 

452943 

446155 



AI357412 

AB033025 

M13509 

D30783 

AI687303 

BE440042 

BE545072 

AA238776 

AK001468 

AI267700 

031152 

AI110684 

R27496 

AL359055 

AA045573 

AW301032 

AWD16936 

A)128388 

AA328102 

M21305 



BE218239 
AU076607 
BE208843 
BE159984 
M86G99 
NS2812 
AW009951 
AA295331 
U03398 
AKD01666 
AI281848 
AK0OO520 
AW969761 
AA249573 
U30246 
N78223 
M30703 

AW291095 
AW381270 
AA393167 
AA707814 
AA383092 
AF053306 
AL049266 
AW664214 

AI143139 

BE263782 
BE5141Z7 
AK001160 
ALD47611 
BE379594 
NM-002497 
AW391351 
AL04S633 
AI27036O 



AI247763 

Ai082437 

AW368833 

AW007294 

AFD10258 

NA 

H43346 

AA894638 

AW977534 

6E302464 

AA005247 

AV656184 

A1224105 

U29195 

R45175 

AW980146 

AL120862 

6E2S9588 

BE247449 

A1553895 

T5112S 



Hs,157601 
Hs^l 
Hs.83169 
HS.11S263 
Hs.285529 
Hs.83326 
H3.122579 
HsJ907e 
Hs.62160 
Hs.317584 
Hs,l 79729 
Hs.7645 
Hs.1378 
Hs.67709 
Hs.22900 
Hs.203800 
Hs.233364 
H8.143655 
HsJ4641 

Hs.153057 
Hs.202656 
Hs.75285 
Hs.129544 
Hs.125395 
Hs.169840 
Hs.177403 
Hs.206892 
Hs.183861 
H5.1524 
HS.1B9095 
H9.194691 
Hs.143811 
H5,132863 
Hs.152618 
Hs.110738 
H3.108106 
Hs.270833 

Hs^1814 
Hs.194110 
Hs.41294 
Hs.14945 
HS.16Q8 
Hs^6708 
H3.18724 
Hs.198729 

Hs^ 

Hs.77695 

Hs.5999 
H8.288885 
Ks.49136 
Hs.153704 
Hs.50820 
Ks.44269 
Hs.31409 
Hs.58248 
Hs.16928 
Hs^6690 
Hs.6682 
Hs.149795 
Hs.127428 



EST8 

K1AA1199 protein 

matrix melailoproteinase 1 (intBfsQtiai 



G prolebvcoupled receptor 49 
matrix melaOoprotidnasa 3 (stramely^ 
hypoOietlca] protein aJ10461 
MAD2 (mitoiicafTest deltdent, yeast, ti 
aniinn prasophila Scraps homOtog}. ad 
ESTs 

collagen, type X. alpha 1 (SdwnUmetaph 
fUogen. B beta polypepSda 

annextnAS 

Homo sapiens mRNA tull length insert cON 
nuctear tedor (eiyfhrokMerived 2Hik 
ESTs 
ESTs 
ESTs 

cytoskeleton assodated protein 2 
gbiHuman alpha satellite and sateOite 3 
spenn assodated antigen 1 
ESTs 

lntBr^phaColobuBn}lnhR)ltor. H2pol 
hypotheticdpn]leinA46C15438 
ESTs 

TTK protein kinase 
ESTs 
ESTs 

Homo sapiens cONA FIJ20042 fis, done CO 
tumor neciosb factor (ligand) supofam) 
shnQartoSAai (sal (Drascph!la).Gke 
reSnolc add Induced 3 
hypothetica! protein FLJ20513 
2c family member 2 (odd-paired DrosophI 
ESTs. Moderately similar to ZN91.HUMAN Z 
solute carrier ^Dy 12 (sodIuni/k)otassl 
transcription factor 

amphiregutin (scliwannomfrderivad growth 

InterleuUn 20 receptor, alpha 
hypotheOcd protebi PRO2730 

ESTs 

lorig fatty acyt-CoA synthetase 2 gene 
repHcation protein A3 (14kO) 
budding unbihibtted by benzimUazoles 1 
Homo sapiens mRNA; d)NA DKFZ^093 (fr 
ESTs 

vlslnln^l 
KIAA0008geneprodud 

gb:60131S874F1 NiH_MGC_8 Homo sapiens cD 
hypothetical protein FU10298 
Homo sapiens CONAFU14246 fis. done OV 
ESTs. Moderately 8iirilIartoALU7_HUMANA 
raMA (never h mitosis gene aHdaled k 
Homo sapiens unknown mlW 
ESTs 
ESTs 
'ESTs 
ESTs 
ESTs 

solute canler family 7, (catlonlc amino 
ESTs. Moderately simSar to AUJ1.HUMAN A 
homeoboxAO 
NA 

gb7p09a04 j1 Scares breast SNbHBst Homo 
H3.14600 ESTs 

Hs.1 51469 caldumfcabnodurm-dependent serine prot 
Hs.30057 MRS2 (S. ceiBvisIae)-Ske, magnesium horn 
Hs.285754 met prDto«noogene(hef»tocytB growth 
Hs.76452 Ofeadive protein, penfraxliHelaled 
f^.151408 ESTs 

Hs.3281 neuronal pentradn 11 

Hs.117183 ESTs 

Hs^137 hypotheScal protein FU12888 

Hs.124165 ESTs 

HS.2B465 Homo sapiens cONA:FU21869 fis. done H 

HSJ1082 hypotheticsl protein FU1052S 

Hs.159422 Homo sapiens CCINAFU13997 (is, done Y7 

Hs.8493 ESTs 



31^ 

29.00 

26.41 

24^8 

24.00 

23115 

22.70 

21.60 

21,15 

20J5 

19.35 

ia68 

18.05 

17.30 

17.15 

16.60 

16.35 

15.80 

15.10 

15.00 

14.70 

14.45 

14.35 

14.35 

14.25 

13.60 

12.60 

12.75 

1Z65 

12.55 

1Z40 

12.25 

1i18 

11.85 

11.80 

11,65 

11.42 

11.35 

11.00 

10.75 

10.75 

10.60 

10.55 

10.50 

10.50 

10.40 

10.25 

10.15 

10.10 

10.05 

9.95 

9.90 

9.85 

ass 

a78 

a73 
a68 

9.55 
9.55 

aso 
aso 

9,37 

aoo 
ag6 
a90 
a90 
a8s 
aso 
a75 
a65 
a64 
a50 
aso 
aso 
aai 
a30 
a2s 
a20 
ai5 
aio 
ao5 
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416561 AA634543 Hs.7g440 IGF-11 mRNMiinding protein 3 8.00 

443814 AV655366 Hs.7645 fibrinogen. B beta pdypeptide 7.93 

406360 NA NA 7.80 

443450 N66045 H3.133529 ESTs 7.75 

5 414422 AA147224 Hs^7232 ESTs 7.75 

442611 6E077155 Hs.177537 hypothetical pra(BlnDKFZp761B1 514 7.70 

438604 AA811896 Hs.44604 ESTs 7.60 

447254 NM.004153 Hs.17908 Ofl^faoognnionoomplex.8ubunit1 (y 7.55 

400250 NA NA , 7.53 

10 420759 T11832 Ks.127797 Homo sapiens cDNAFUII 381 Bs, clone HE 7.50 

404996 7^ 

450459 AI697193 Hs.299254 Homo sapiens cDNA: RJ23597 to, done L 7.45 

445019 A1205540 K8.281295 ESTs 7.30 

409269 AAS769S3 Hs.22g72 hypotheticai protein aJ13352 7.25 

IS 448816 AB033052 H5.22151 KIAA1226 protein 7.25 

4M361 W76027 Ks^3920 hypothetlcalprotehFUIIias 7.25 

449370 AK002114 Hs23495 hypothetica) protein FU1 1252 7.20 

433859 AW896758 Hs^73789 ESTs 7^ 

416143 AI955650 Hs.79033 glutamlnyH^pBdecyclotransferasefelu 7.20 

20 456120 AA535244 K5.78305 RAB^ member RAS oncogene famity 7.20 

419751 AW195581 Hs.93121 KIAA0761 protein 7.16 

417830 AW504786 Hs.122579 hypothetical protein FU10461 7.15 

456553 AA721325 Ks.189058 ESTs. Highly sbnOar to Similar to a Oe 7.15 

421373 AA808229 Hs.167771 ESTs 7.10 

25 418763 AK000219 H3.88367 hypothetical pn3teinFU2021 2 7.00 

423248 AA380177 Hs.125845 ribulo&&-6i)hosphatfr^ptmer8se 6.96 

444798 BE242144 Hs.12013 ATP-blndlng cassette, sub-family E(OABP a95 

429486 AF155827 K3.203963 hypothetical pTOtdnFU 10339 &95 

413573 A1733859 Hs.149089 ESTs &95 

30 442660 AW138174 Hs.l30651 ESTs 6.93 

427878 C05766 Ks.181022 CGl-07 protein 6.90 

438394 BE379623 Hs.27693 peptidytprotyi Isomersse (cydophiOnH &85 

422711 D60641 Hs^1739 Homo sapiens mRNA.'cONADKFZ|>586l1518(f 6.82 

457030 AI301740 Hs.173381 dihydropyrimidinase-nke 2 6.80 

35 417168 AL133117 Hs.61376 Homo88piensmRNA:cONAOKFZp586L1121 tf 6.80 

442973 BE567665 Hs.268550 Homo sapiens cONA:FU23156 lis, done L &80 

453102 NM-007197 Hs.31664 fiizzled (Drosophlla) homdog 10 6.75 

416018 AW138239 Hs.78977 proprotein converlase sMsin/lcexin t 6.75 

424051 AL110203 Hs.138411 Homo sapiens mRNA;cONAOKFZp586J1922(f 6.71 

40 401644 B.70 

450434 AA166950 Hs.19S870 hypothetical protein FU14991 a70 

428479 Y00Z72 Hs.184572 oeH division cyde 2, G1 .to S and G2 to &69 

406747 AI925153 Hs.217493 annexlnA2 &65 

445191 AF048666 Hs.12393 (ITDP-D^ucose 4,6^hydratase 6.60 

45 424298 A1631874 Hs.155140 casein kinase 2. alpha 1 pdypeptide 6.60 

428392 H10233 HS.226S secretory granule, neuroendocftne protd 6.55 

427072 H38046 Hs.303193 ESTs 6.55 

452588 AA889120 Hs.110637 homeoboxAlO 6.53 

439809 R41396 Hs.101774 hypothetical protdn nJ23045 6.50 

50 423123 NM 012247 Hs.124027 SELENOPHOSPHATE SYNTHETASE; Human selen ^ 

418454 AA315308 Hs.1 95870 hypothetical protein FU14991 6.50 

423685 8E3S0494 Hs.49753 uveaiautoanligenwiOiGdIedcoiidoinal 6.60 

447342 A1199268 Hs.19322 Homo sapiens. Stmilar to RIKBIcDNA 2010 &50 

410908 AA121686 H8.10592 ESTs 6.47 

55 406671 AA129547 Hs.285754 met protooncogenaChepatocyte growth fa 6.47 

450638 AK001826 Hs.25245 hypothetical protein FU11269 &45 

452838 U65011 Hs.30743 praferentmiy expressed anOgen in meta &42 

451389 N73222 H3.279009 matrix Glaprotdn a40 

438202 AW169287 Hs^588 ESTs 140 

60 452198 AI097560 Hs.61210 ESTs, Weakly similar to 138022 hypotheU &40 

425860 1^9339 Hs.1964 sdute carrier family 5 (sodlum/^tucose ^37 

435538 AB011540 Hs.4930 low density lipoprotein receptor-ietated asS 

436539 AI005457 Hs.275048 ESTs a35 

417404 mjmsO Hs.82101 ptedcslrin homology^Bce domdn. family a34 

65 430388 AA356923 H8.240770 nudearcap binding protein subunit 2. 2 a32 

425905 AB032959 Hs.31B584 novel C3HC4 type Zinc finger (ring finge a31 

407237 AA169872 Ks.6216 Homo sapiens hepatoceOUtarcarelnom^as a30 

413597 AW302B8S Hs.117183 ESTs aSO 

„ 429529 AA454190 H3.24283 ESTs, Moderately Mar to reduced expr a30 

70 409916 BE313625 Hs.57435 solute earner fanfly 1 1 (protorHXWpted 6.25 

407746 AK001962 Hs.38114 hypothetlcai protein FU 11 100 a20 

426921 AA037145 Hs.172865 deavagestimulatkmfador.a'pre-RNA. 6.20 

438050 BE262816 Hsi061 protdn kinase. AMP-adhrated, beta 1 no 6.20 

416857 AA188775 Hs.292453 ESTs a20 

75 409683 U33317 Hs.711 defensin.Blpha6,Panethce&«pedSc ai8 

452291 AF015592 Ks.28853 COC7(ce!l(fivlsioncyde7.S.cerevIsl ai5 

420096 AA775910 Hs.95011 syntrophin. beta 1 (dystropMn^sodate ai5 

448693 AW004854 H3.228320 hypothsBcal protein FU23537 6.15 

433393 AF038564 H8,98074 itchy (mouse homdog) E3 uKqdtln prote ail 

oO 424745 AA214618 H3.152759 acfivatordS phase Wnase aiO 

408771 AW732573 Hs.47584 potassium vdtage^ateddtannel, delayed aiO 

454438 AA224053 Hs.172405 celdlvistoncyds27 a08 

407771 AL138272 H3.62713 ESTs a08 
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AI9Z7382 


Hs.29857 


ESTs 


&05 




AA314907 


Hs.65950 


ESTs 


&W 




NM-.016117 Hs^7182 


ftfiosph(^pas8 A2'acSvaling pnisin 


&00 




AB00942S 


Hs.560 


opoiipopnxBin b rnrvxA ooiung mtjum, C8 


oso 




AI160873 


Hs.69233 


zinc finder protsln 


Sl9d 




BE018217 


Hs.183528 


nypo8i8ucai protein rUi^yuo 


Sk95 




AW407987 


Hs.173518 


M-phas6 phosphoftftd^ homoiog 


C AC 


•11 W*! 


AA134006 


Hs.79306 


euksvyoQc trsnsls&xi irdlisfion factof 


5i95 


<K)90W 


R22479 


Hs.167073 


rorno sapiens cuNA rLJ 1 r ns, oons NT 


5.90 




BE243842 


Hs.283077 


centrosomd P4.1-8ssodated piotein; uic 


Su90 


429201 


X03178 


Hs.198246 


9roui>5pedfic component (viMn 0 bind 


&87 




BE296227 


Hs.250822 


SGilneAhieonlne idnase 15 


Si86 


413585 


AI133452 


Hs.75431 


fibrinogen, ganunapc^ypepOde 


5.88 




AL118812 


Hs.274293 


Honn sapiens niKNA;cuNAuKFZp7DlG11 11 (f 


5.85 




AK001380 


Hs.145479 


Homo8a|»ens cDNA FlJ10518 fis, cJwie NT 


&85 


441645 


AI222279 


Hs^1555 


ESTs, Weakly similar to T23406 hypothefl 


5.65 


409187 


AF154830 


H5^0966 


cart)anioy1-phosphatB synQietasa 1, miloch 


5.85 


406687 


M31126 


Hs^2620 


preonanpy spocffic b8t9>1>g^fcopnDt^ 9 


&85 


452281 


T93S00 


Hs^2 


II- — - « 1 a_ -1 w 

Homo sapiens cDNA FU11041 fis, done FL 


5.82 


429345 


NA4.008729 Hs.226483 


diaphanous (DrosopMte, hocnoloo) 2 


S.60 


459309 


M040620 


Hs.5672 


hypotheuca) protein AF1 40225 


5.80 


410080 


NM.001448 Hs.58367 


giyptC8n4 


5.79 




AA331247 


Hs.86617 


cST8 


5.77 


454030 


AA3747S6 


Hs.93560 


Koino sapiens inRNA lor KIAA1771 proran, 


Si75 


430345 


AK000282 


Hs.239681 


hypothetical proton FU20275 


5.75 


411096 


U80034 


Hs.68583 


mitochondrid intermediate peptidase 


5.75 


429125 


M446854 


Hs.271004 


ESTs, Wealdy simBar to 138022 hypotheQ 


5.75 


442357 


AI949952 


H5.49397 


ESTs 


5.75 


426642 


AW068223 


Hs.171581 


ublquiUn C-termina nydrnase UCH37 


5.70 


426518 


Z43039 


H$.170198 


K)AA0009 gene product 


570 


441894 


AA134329 


Hs.24170 


Homo sapiens, clone IMA\3c:3oBo39q, ntnNA, 


5.70 


415385 


R17798 


Hs.7535 


COBW-UKe protein 


5.70 


409757 


NM_001898 Hs.123114 


cystaunoN 


5.69 




AA743991 




gD.ny3/gui.si Ni^UwrjrTio Homo sapiens 


5.00 


424492 


AI133482 




gb:HA2093 Human fetal Fiver cDNA librafy 


5,60 


452606 


N45202 


Hs.90012 


nypoirietica proton rLJZ3441 


5.60 


438777 


AA625487 


Hs.142179 


ESTs 


5.60 


417235 


AA810276 


HSJ24250 


ESTs 


5.60 


451177 


A1969716 


Hs.13034 


ESTs 


5.60 


415227 


AWB21113 


Hs.72402 


ESTs 


5.58 


438217 


T53925 


Hs.107 


Qonnogen^ke 1 


5.56 


452881 


AW135220 


H5.241921 


ESTs 


555 




AA136553 


Hs.1975 


nypotneOca protra FUZiOO? 


5.55 




AI631984 


Hs.34447 


ESTs 


5.55 




AW969626 


Hs^1704 


col 8, Weady sinwar (d KIAA0227 pisapi 


6.53 


AAA^A^ 

444743 


AA045648 


K8.301957 


ninfix (nucleoside diphosphate fttked moi 


5.52 


442980 


AAS57025 


Hs.6878 


Mnean-UKel 


5.50 


418882 


NM_0049g8 Hs.89433 


ATP-tnnaing cassette, sub-family C (GFTR 


5.50 


453884 


AA35592S 


Hs.36232 


KIAA0166 gene product 


5.47 


444478 


WO/318 


H5^40 


M-phase phosphoprotein 1 


5.47 


419502 


AUO76704 


H3.90765 


fibdnogen, A di^a potypepSde 


5.47 


420216 


AW958037 


Ks^86 


r9>osomaj protein L4 


&45 


421155 


H87879 


Hs.102267 


iysyi oxidase 


5.45 


441421 


AA356792 


Hs.334824 


hypothetical pret^ FLJ14825 


5.45 


^6435 


AI880384 


Hs.270747 


coTs, Wesdy snwar to AlAi<J1UMAN ALU S 


5.45 


428046 


AWB12795 


Hb.155381 


ESTs, Moderately similar to 138022 hypot 


6.44 


446372 


AB020644 


Hs.14945 


long fatty acyt-GoA synthetase 2 gens 


5.42 


421477 


AI904743 


Hs.104650 


hypothetteai protein FLJ10292 


1 5.41 


409554 


AA045857 


H8.54943 


fracture callus 1 ^sQ homoiog 


6.41 


453080 


AI423056 


Hs^921 


hypothetical protein DKRp547A023 


5.35 


430217 


N47863 


Hs^6901 


ft tx)somat protdn 824 


5.33 


417372 


T99755 


HS.33472B 


ESTs 


5.30 


415139 


AW975942 


Hs.48524 


ESTs 


5.30 


412140 


AA219691 


Hs.73625 


RAB6 Interacting, Idnesin^ (rabUnes 


5.29 


424086 


AI351010 


Hs.102267 


lysyl oxidase 


5.27 


409327 


U1162 


Hs.53563 


ooQagen. type IX, alpha 3 


5,27 


417576 


AA339449 


K8.82285 


phosphoribosylglydnamide tomnyltransfer 


5.26 


452131 


AI860677 


Hs.72325 


Human DNA sequence from done RP1-187J11 


5.2 


436016 


AA806465 


Hs.121536 


Human ONA sequence from done RP1 1-472E5 


525 


449347 


AV64g748 


Hs^901 


KIAA0493 protein 


5.25 




AI635444 


Hs.143917 


(il^A7iJl1 1 nmtpJn 
aj**Qf n 1 1 • 1 pivicin 


9.«9 


453921 


AI824009 


Hs.44577 


ESTs 


5.25 


413582 


AW295647 


HS71331 


hypothetical protein M6CS3S0 


5.25 


421076 


AW007968 


Hs.233299 


ESTs. Wbaldy simiiar to 138022 hypolhefi 


5.25 


407884 


BE075316 


Hs.95011 


syntropMn, beta 1 (dystrophn>«ssociate 


5.24 


433384 


AI021992 


Hs.124244 


ESTs 


5.23 


422026 


U80736 


Hs.110826 


tilnucteoffde repeat containing 9 


&21 


447020 


T27308 


Hs.16966 


hypothetical protebFU11046 


5.20 


441795 


N58115 


Hs^1137 


AD024 protein 


5.20 


449416 


A!651016 


Ks.246311 


ESTs 


&20 


416379 


AA216940 


Ks.137516 


fldgeD^likel 


5.20 


426753 


TB9832 


HS.17QZ76 


ESTs 


5.18 


422109 


87328S 


H8.1473 


gaslifiwQieasIng peptide 


5.17 
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424085 NiyL002914 HS.13S226 lepBcalton factor C (activator 1)2 (40 5L17 

416806 N»^0002B8 Hs.79993 peioxlsoma]t)iogenesistelor7 &17 

424717 H03754 Hs.152213 wlnotess4ypeMMf1Vlnlegra8on6nBfaiiil &15 

- 427728 AJ24560D Hs.180545 Homo saptensnfiNAtiv hypothetical piots &15 

S 447713 AI420733 Hs.2O7083 ESTs 5.15 

425739 T19016 Ks.1 59410 motytidapteiin synthase sidfuiytase &15 

420170 U43374 Hs.95631 Human riOTnai keratbiocytB mRNA &15 

407244 M10014 H575431 fibrinogen, gamma polypGptide &13 

441139 AW449009 Hs.126647 ESTs 5.13 

10 451121 AW973795 Hs.128927 Homo sapiens cONAFU13903fis. done TH 5.10 

435202 AQ71313 Hs.170204 K1AA0551 protein 5.10 

431699 NM.001173 Hs.267831 Rto GTPase activa6ng protein 5 5^10 

416384 AW149256 Hs.25130 Homo sapiens cDNAFU14923fis, done PL 5.09 

422805 AA436989 Hs.121017 H2AtilstDnefamHy, member A 5.07 

15 411750 BES62298 Ks.71827 KIAA01 12 protein; tiomolog of yeast nlns 5.06 

435496 AW840171 Hs^6S398 EST8.W^aldy8imaartotransfbmiaSon-r 5.06 

434539 AW748078 H3.214410 ESTs, VteaklyslmlartoMUC2_HUMANMUaN 5.05 

413293 A1047483 Hs^02498 GTP-binding protein tuxnotogous to Saccha S.05 

445238 AK001676 Ks.12457 hypoOiet!calprot^FU10814 5.05 

20 415091 AIJ044872 Hs.77910 34Iy^l^oxy-3^Tletllylg!utaryW>)enzym8 A sy 5.05 

441675 A1914329 Hs.5461 ESTs 5.00 

449802 AWg01804 Hs.23g84 hypotheOcfi) protein FU20147 5.00 

401480 HA NA 5.00 

408582 AM36323 H3.31141 Homo sapiens mf»IAfbrKIAAlS68 protein. 4.95 

25 426269 H1S302 Hs.168950 Homo sapiens mRNA:cDNAOKFZ0S66A1O469 4.92 

414718 H95348 Hs.107987 ESTs 4.91 

419139 A1123517 Hs.269940 ESTs 4.90 

430789 AA632S77 HS.31023S ESTs, Weakly similar to 178685 serineAh 4.90 

425420 BE536911 Hs.234545 hypotheOca) protein NUF2R 4.90 

30 408758 NM.0036B6 Hs.47504 exonudeasel 4.90 

439741 8E37g646 Hs.6904 Homo sapiens mRNA full length insert d)N 4.90 

418612 AB037788 Hs.224961 cleavage and polyadenyiationspedftc fa 4.90 

433927 AI557D19 Hs.116467 small nudearpfotebiPRAC 4.89 

450568 AL050078 Hs.25159 Homo sapiens cDNARJI 0784 lis. done NT 4.88 

35 455777 AA524285 Hs.154172 ESTs. Moderately slmiiar to BCGF.HUMANB 4.87 

426743 A1030060 Hs.301549 Homo sapiens mRNA; cONA DKFZp564H172 (fr 4.87 

421126 M74587 Hs.102122 insufhvlike growth factor binding prote 4.66 

411835 U29343 Ks.72550 hyaluronan^atedmofiiity receptor (R 4.85 

418396 AI765805 Hs.26691 ESTs 4.85 

40 430510 AW162916 Hsj!41576 hypothetical protein PR02577 4.84 

408430 S79876 H8.44926 dipeptldyipep8dase!V(C026. adenosine 4.60 

406414 4.75 

430178 AW449612 Hs.l52475 ESTs 4.71 

411901 AA166730 Hs.6966 Human DNA sequence fiom done RPM87J11 4.70 

45 404025 NA NA 4.70 

451807 W52854 Hs.27099 hypotheHcd protein HJ23293s{mnar to ^ 4.66 

436662 AI582393 Hs.126695 ESTs 4.68 

414140 AA281279 Hs.23317 hypothefical protein RJ14681 4.68 

410044 BE566742 Hs.58169 highly expressed in cancer, (teh in leuc 4.65 

SO 431041 AA490967 Hs.197955 KtAAO704prol^ 4^ 

417860 AW408557 Hs.235498 ' hypotheOcd protein FU14075 4.65 

410658 AW10S231 Hs.192035 ESTs 4.65 

425895 At2694d4 Hs.161427 zinc finger protein 215 4.65 

422892 AA988176 Hs.121553 hypotheCcal protein FU20&41 4.65 

55 438397 AA715013 Hs.169335 ESTs 4.60 

439225 AA192669 Hs.45032 ' ESTs 4.60 

423197 T91418 . Hs.125156 transcriptional adaptor 2 (ADA2. yeast. 4.60 

413374 NM.001034 Hs.75319 ribonucleotide reductase M2 polypeptide 4.60 

412723 AA648459 Hs^3S851 hypothefical protein AF301222 4.59 

60 42S745 IM4060 Ks.14427 Homo sapiens cONA:FU21800 lis, done H 469 

452795 AW392555 Hs.18878 hypothetical protein FIJ21 620 4.58 

430704 AW813091 Hs.335799 ESTs 4.56 

429682 NM.006306 Hs.211602 SMC1 (structural malntenanco of dunmoso 4.55 

433326 AI379488 Hs.1S9430 ESTs 4.55 

65 437958 BE139550 Hs.121668 ESTs. Moderately sbnOar to PC4259«eni 4.55 

410566 AA373210 Hs.43047 Homo sapiens cONAnJ13S85 lis. done PL 4.55 

423343 AA324643 Hs.246106 ESTs 4.55 

416467 H57585 Hs.37467 ESTs 4.65 

408867 AA437199 Hs.6S6 ceQdMs)oncyde25C 4.54 

70 419423 D2&488 H3.90315 KIAA0007 protdn 4.54 

414132 AI801235 Hs.48480 ESTs 4.53 

423948 AW392342 Hs.283077 centrosomd P4.1-a8SOdated proteta; unc 4.53 

425746 NM.001701 Hs.159440 bSe add Coenzyme A: amino add 4.50 

451009 AA013140 H5.115707 ESTs 4.50 

75 431084 A1903735 gb:MR-BT035-2001994)31 8T035 H0mo8apien 4.50 

432725 AL137496 K8.g001 ESTs 4.50 

400298 AA032279 Hs.61635 sb( transmembrane epifhellalanfigen of 4.50 

410486 AW235094 Hs.69233 zinc finger protein 4.50 

428532 AF157326 Ks.184786 TBP-lntsracting protein 4.50 

80 429782 NM_005754 H3.220689 Res-GTPBS&«dIvaBng protein SHMomaIn 4.50 

408380 AF123050 Hs.44532 diublquifin 4.49 

423936 U77629 Hs.135639 adiaet»«cutooomp(Bx(Dro5ophBa}homd 4.47 

434294 AJ271379 Hs.76194 ribosomd protein 85 4.47 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



442875 BE623003 Hs.23625 Homo sapiens done TOOGTA0O142mRNAsequ 

447102 BE167434 H3.98471 ESTs. W^6fani]artoT18712hypott«8 

410142 AA081924 Hs.124918 K1AA1795 pnstebi 

434894 AW977850 Hs^3856 hypothetical proteta MGCS297 

420092 AA814043 K3.88045 ESTs 

400115 NA NA 

430987 H16791 Hs.31445 ESTs 

438078 AI016377 H3.131693 ESTs 

412359 AW837985 9i>K3V3-LTOO48-14O20O4»3<05 LT0048 Homo 

429774 A1522215 Hs.50883 WAAI 804 protein 

426214 K59846 Hs.128355 EST8.MDderBle(yshnilartoALU7J1UMANA 

419790 U79250 Hs.93201 glyceio^hosphste dehydrogenase 2 (mi 

450480 X82125 Ks^5040 zinc finger protein 239 

421211 AA284966 H3.266308 mos^ serine proleasa 

419261. X07876 Hs.89791 wingless^ Mh/OV integraOon site M 

434414 AI7g8378 gb*34b07j(l Na.CGAP.Ov23 Homo sapiens 

448305 AA625207 Hs^15 Homo8apienscONAFU12908fis.clonehn' 

410568 AW162948 Hs.64542 cleavage and poiyadenylaBonspet^fe 

459574 AI741122 Hs.10ieiO Homo sapiens cONAFU14232fis, done NT 

448243 AW369771 HS.S2620 integrin. beta 6 

438069 N80701 Ks^O ESTTs 

446152 A1292036 Hs.150028 ESTs 

439580 AF086401 Hs.293847 ESTs. Moderately simflar to S65657 alpha 

436211 AKD01581 Hs.334828 hypothetical protein FU 1071 9: WAAI 794 

452401 NM.007115 H9^2 tumor necrosis factor, alpha-biduoed pro 

42283S BE218705 Hs.121378 metaOothtonefa^likeS.testl^spedlic 

418845 AA852g85 Hs.89232 chramobox homotog 5 (IDtosophila HP1 alph 

439619 AW975998 Hs.68595 ESTs. WfeaWy sinto to 138022 hypotheti 

458076 RB0061 HsJ64478 hypothetical protein FU21939stmilar to 

450192 AA2$3143 Hs^4S96 RA051-interac8ng protein 

443232 AF161S21 Hs.9081 phenylatanyMTOJA synthetase beta^uburd 

413881 100190 HS.75S99 serine (or cysteine) proteinase inhlbito 

434217 AW014795 Hs.23349 ESTs 

409723 AW885757 Hs.257862 ESTs 

417956 AA210704 Hs.190465 ESTs 

458433 AL135352 Hs.255883 ESTs. Weakly simiiar to 138022 hypolheS 

409928 AI.137163 Hs.57549 hypothetica! protein dJ473B4 

447400 AK000322 H3.18457 hypothetical protein aJ20315 

424583 AR}17445 Ks.150926 fucose-l -phosphate guanylyttransferase 

429436 AA452934 Hs.279813 hypolheUcal protein 

424625 AWg04466 Hs.321197 PDZ domain pnM(DrosophilainaIMike 

448912 D83781 Hs.22559 WAA0197 protein 

442671 AI005668 Hs.134779 EST 

411893 R82845 HsJJ73789 ESTs 

456281 AA284166 Hs.84113 cycBn-dependenlWnase inhibitor 3 (COK 

421108 AA877124 Hs.172844 ESTs 

451401 AI793163 gbMn52g03.y5Na.CGAP.Co8 Homo saptens 

404516 NA ^ ^ ^ 

414968 C16096 Hs^2826 (mpomodufln 3 (ubiquitous) 

421308 AA687322 Hs.192843 leudne Zipper protein FKSG1 4 

419229 AI827237 Hs.282884 ESTs 

412104 AW205197 HS.2409S1 Homo sapiens. StoiHar to RIKENcDNA 2210 

453911 AW503857 Hs.4007 SarcotemmakfisodalBd protein 

433159 AB035898 Hs.150587 kinesin^ protein 2 

419247 865791 Hs.89764 ftagileX mental retardaSbnl 

432491 AA662910 Hs.42635 hypothetical protein OKI=2p434K2435 

422093 AF151852 Hs.111449 CGK94 protein 

428692 AI372822 Hs.110103 RNA polymerase I transcripSon factor RR 

446999 AA151S20 Hs.334822 hypothetical protein MGC448S 

414538 AWB1222B Hs.107987 ESTs 

453931 AL121278 Hs.25144 ESTs 

427718 AI798680 Hs.25933 ESTs 

453863 )(02544 Hs.572 orosomucoldl 

440209 HD5049 Ks.22289 neurexln3 

435148 Aigi8049 Hs.124961 ESTs 

409732 Nf^016122 Hs.56148 NY-REN-58 anflgen 

448692 AW013907 Hs.167531 me(hytcrotonoyM>}enzymeAcart)oxytasa2 

428301 AW828666 Hs.98440 ESTs. Wea»y similar to 138022 hypotheO 

444188 AI393165 K3.699 peptidytproly)isomerasBB(cydophinn 

457059 BE561665 H$.177677 exosome component 

407162 N63B55 Hs.142634 zinc finger protein 

408117 

434370 AF130988 Hs.5B346 ectodysplasin l.arihidrolic receptor 

427651 AW405731 Hs.18498 Homo sapiens cDNAFU12277 (is, done MA 

448666 NM_014953 Hs.323348 KIAA1008 protein 

450375 AA009647 H$.6850 a disintegrin and metalloprotelnasa doma 

410467 AF102546 Ks.63931 dachshund (OrosophDa) homolog 

455030 AA136106 H5.184852 WAAI 553 protein 

434082 AI373481 Hs.131715 hypothetfcal protdn PR01777 

443646 AI085198 Hs.164226 ESTs 

434265 AA846811 Hs.130554 Homo sapiens cDNA:fU23089fis. done L 

418524 AA300576 Hs.85769 addle 82 kOa protein mRNA 

432619 AW291722 Hs^26 (Bl^tolheNtefflilnusonre 
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4.30 
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4.10 
4.09 
4.08 
4.07 
4.07 
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447207 AM42233 Hs.17731 hypoQieeca) protein FU12B92 4.05 

423873 BB)03054 H5.16% malrixine!sDoprotebase12(niac^^ 404 

414853 1131116 Hs.77S01 sarcogtycan, beta(43kOdystroph!iv«$sre 4.04 

424176 AL137273 Hs.142307 hypotheSca! protdn 4.04 

5 452259 AA317439 Hs.28707 signal sequence rooeptor. gamma (Iranslo 4.04 

418203 X54942 H&83758 CDC28pioteIn kinase 2 4.03 

435420 AI926513 H3.59203 ESTs 4.03 

408666 V00495 Hs.184411 albumin 4.02 

417555 AA780791 Hs.14014 hypotheBcd protein FU14813 4.00 

10 449448 060730 Hs.57471 ESTs 4.00 

421037 AI684806 Hs.197653 ESTs 4.00 

448310 AI480316 gb:tm26li09j(1 Soares.NFU.T.GBC_81 Homos 4.00 

408155 AB014528 H5.43133 KIAA0628 gene product 4.00 

413841 M34276 HS.7S576 plasniinogen 198 

15 400110 MA NA 3.98 

400289 X07820 KS.225B matrix metaOopreteinase 10 (stromelysin a97 

443715 A1583187 Hs.g700 cycfinEI 197 

408296 AL117452 Hs.44155 DKFZPS86G1517 protein 3.97 

450164 A1239923 H3.30099 ESTs 197 

20 451592 A1805416 Hs.213897 ESTs 3.95 

402373 AL135225 Hs.301865 dopachroroelautDinerase((k)pachrorne delta 3.95 

426199 AA371865 H5.97090 ESTs 195 

414148 BE084049 gb:PM0-BT0651 -2704004)03402 BT0651 Homo 195 

417008 AW673606 Hs,80758 aspartyMRNA synthetase 194 

25 449532 W746S3 Hs.271593 ESTs. Moderately simHar to A47582M 193 

' 434551 BE387162 Hs.280856 ESTs. Klgtdy similar to A3S661 ONAexds 193 

436291 BE568452 Hs.5101 protein regidalor of cytokinesis 1 192 

423337 NM-004655 Hs.127337 axin2(conduc8n.adq 191 

416185 AW975B61 Hs.47367 KIAA1785 protein 3.91 

30 443054 AI745185 Hs.8939 yes-assodaled protein 65 kDa 190 

432596 AJ224741 Hs.27B461 malrffin3 ' 190 

451229 AW967707 Hs.48473 ESTs 190 

413583 AL120806 HS.S888 ESTs 190 

432702 AW9739S3 Hs.293744 ESTs 190 

35 ' 437207 T27503 Hs.15929 tiypolhelfcal protein FU1 2910 3.90 

434699 AA643S87 Ks.149425 Homo sapiens cDNAFU11980fis. done HE 3.90 

423697 BE088697 Hs.131834 Homo sapiens mRNA; cONA DKFZp434B0328 (f 190 

428822 W28418 Ns.30715 pol85Stonv6ttag&«atBddianne).tsk-roi 3.90 

432289 AI860145 Hs.55118 ESTs 189 

40 413384 NML000401 HsJ5334 exostoses (muIBpte) 2 3.88 

414136 AA812434 Hs.119023 SMC2(strudura! maintenance of cluomoso 188 

433042 AW193534 Hs.281895 Homo sapiens cDNARJ11660f]s. dons HE 168 

410094 BE147897 Hs.58593 general transcilpQon factor IIF. polype 3.88 

441826 AW503603' Hs.129915 ptiosphotrfesterase related 187 

45 444059 f%9743 H5.1 16774 integih, alptia 1 3.66 

426262 AI792141 H3.196270 folate transpoileiA^rrier 3.85 

452641 AW952893 Hs.237825 signal recognition pa«de 72(0 185 

454403 BE065985 gb:RC3«T031 9-1 202004)1 4-809 BT031 9 Homo 185 

448315 AW290912 Hs.20797 ESTs. VteaWy similar to ALU1_HUf^ ALUS 185 

50 411343 U77949 Hs.69563 C0C6(cdl division cyde 6, &cerBvisi 165 

409734 BE161664 Hs.S6155 hypothetical protdn 3.85 

454014 AW016670 Hs.233275 ESTs 184 

453116 AI276680 Hs.146088 ESTs 183 

449508 AK001566 Hs.23618 hypolheScal protein FU10704 182 

55 428227 AA321649 Hs.2248 small indudbia cytokine subfamIlyB(Cy 182 

435040 Ai932350 Hs.152825 ESTs 181 

426249 F05422 Hs.168352 nudeoporlrv^ike protein 1 181 

451110 A!955040 His.265398 ESTs. Weakly similar to transformaSorK 181 

431716 089053 Hs.268012 taUy^CoenzymeAIigase.tong^M) 181 

60 437631 AA764749 Hs^67245 hypoQietical protein FU1 4803 3.80 

429118 H20669 Hs.35406 ESTs. Highly similar to unnamed protdn 180 

405769 3.80 

438295 AI394151 Hs.37932 ESTs 180 

^- 453628 AW243307 Hs.83937 hypothetical protein 3.80 

65 450096 AI682086 Hs.79375 holocarbCKylase synthetase (bfolln^ 180 

449318 AW236021 Hs.78531 Homo sapiens. SImBartDRIKENCDNA 5730 176 

423881 AK001720 Hs.134403 hypothetical protein FU10858 175 

408728 AL137379 H3.47125 hypothetfeal protein FU1391 2 175 

422219 AW978073 Hs.101O regulator of nQloQc spindle assembly 1 175 

70 416661 NiyL001949 Hs.1189 E2F transcription (ador 3 174 

420726 K02402 Hs.1330 coagdaOonfador IX (plasma 0uombo|ila 174 

418413 R95735 H5.1177S3 ESTs. Weakly similar to A48866odiprol 3.73 

443354 AW970672 Hs.9247 protein kinase. AMP-adhmled.dpha 1c * 173 

406667 M12523 Hs.184411 albumin 172 

75 436411 AW674352 glcba63d]7.y1 KIHJAGCL1 2 Homo sapiens cO 172 

417246 Ar760098 Hs.21411 ESTs 3.72 

410664 NM_006033 Hs.65370 lipase, endolhdid 171 

432686 BE223007 Hs.152460 Homo sapiens cOHAFUl 2909 fis. done NT 170 

442881 AI023175 Hs.167022 ESTs 170 

80 432356 AA831032 H8.111670 ESTs, Highly sbnBar to JC2257proly)c) 170 

450218 R02018 Hs.168840 ankylosis, progressive (mouse) homolog 170 

405460 NA NA 170 

452824 W27643 Hs.73965 6pSdngfedar.8tglnInefeeifniHkli2 170 
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445B73 AA2S0970 H&2S1946 pa^A}4indtaopnMi.cytDp^^ 3^70 

42S282 AW163518 Hs.155485 hunQngSnlnteracGng protein 2 169 

439857 AA847194 Hs.232002 ESTs 3169 

. 430403 AF039330 H$.2413a2 tumor necrosb factor (DgantOsupeitanl 3.69 

5 444471 AB0206B4 Hs.11217 KIAA0877 prot^ 169 

419559 YD7828 Hs.91096 ring fif«er protein 169 

437641 AAB11452 Hs.291911 ESTs 3.6B 

455364 H72176 H5.4273 tiypottteOca) protein FU131 59 167 

417791 AW965339 Hs.111471 ESTs 3.66 

10 414271 AK00027S Hs.75871 protein UnaseCtilnding protein 1 3.66 

432023 AW273128 H&330144 EST 3.66 

430294 AI538226 Hs.32976 ouanfaenucteoSdetAuDng protein 4 165 

450600 BE079478 Hs.24860 ESTs 165 

420595 AA278865 Hs.86523 ESTs 165 

15 404477 NA NA 165 

457003 S78234 Hs.172405 ceU dMsion cycte 27 165 

443055 AV653742 Hs.15536 hypothe&al protein DKFZp7&1J139 165 

452220 BE158005 Hs.212296 ESTs 165 

^_ 414463 T69078 H3.76177 a!pha-1-(nicrogtotwlinMunh precursor 165 

20 457465 AW301344 Hs.122908 DMA replcaSon factor 164 

436149 AI754308 Hs.l59452 ESTs 163 

433790 BE^215 Hs.288368 RAB22A. member RAS oncogene family 163 

427920 Z11502 Hs.181107 annexinA13 163 

424641 AB001106 Hs.151413 gOa maturaOon factor, beta 163 

25 426514 BE616833 Hs.170ig5 bone morpi)ogeneGc protein 7 (osteogenic 3.62 

411975 AI916058 Hs.144583 ESTs 161 

409239 AA740675 Hs.44307 ESTs. Moderately simHar to 138022 hypol 161 

429628 H09604 Hs.13268 ESTs 161 

449722 6E280074 Hs.23960 cydinBI 160 

30 426389 AW135714 Hs^83127 ESTs. Weakly similar to T1 9201 ttypoUiet} 160 

419945 AW29097S Ks.1 16923 ESTs 160 

410365 AI2B7518 Hs.62669 Homo sapiens inRt^cONADKFZp586D0923(f 160 

420685 AW505139 Hs.9460 Homo sapiens mRNA;cDtMDKFZp547C244(rr 160 

407809 AW082279 Hb.244108 ESTs 160 

35 457708 AA80S443 Hs.179909 tiypottieOcal protein FU22995 3.60 

427943 AW959075 Ks.238797 ESTs. Moderately similar to 138022 liypot 160 

428771 AB028892 Hs.193143 KIAA1069 protein 160 

446638 AL133063 Hs.15783 Homo sapiens mRNA:cONADKFZp434P11 15 (f 159 

418688 T85017 Hs.1192 KIAA0074prot^ 159 

40 436961 AW375974 Hs.156704 ESTs 158 

430514 AA318501 Hs.241S87 megalcaiyocytfr^niiancedgenetransciiptl 158 

415245 N59650 Hs.27252 ESTs 157 

452823 AB012124 Hs.30696 transcription f^-lil(6 5 (baste lieiix 155 

423508 AW604297 Ks.129711 iiepaiitis A viros cellular receptor 1 155 

45 401165 NA NA 155 

415382 AI743539 Ks72465 ESTs. Weakly similar to nonto beta ga 155 

433988 AL157518 Hs.90421 PR02463 protein 155 

421528 AB037837 Hs.105461 hypolhetical protein FU20357 155 

443325 BE398006 Hs.90462 Homo sapiens, clone IMAG&4132043.mRNA, 155 

50 444355 BE383686 Hs.191621 ESTs. Moderately similar to ALU6_HUMAN A 155 

450715 A1266484 Hs.31570 ESTs. Wealdy similar to KIAA1 324 protein 155 
427510 Z47542 Hs.179312 small nudear RNA acfivaOng complex, po 155 
455630 AV655701 Hs.75183 cytochrome P450. subMly tIE (elhanol- 155 
441085 AW136551 Hs.181245 Homo sapiens CDNAFU12532 lis. done NT 154 
55 434206 AW136973 H3.268516 ESTs. Weakly similar to S69890 mitogen i 154 
446619 AU076643 Hs.313 secreted phosphopretein 1 (osteopoidln. 153 
432542 AW083920 Hs.16098 daudin2 153 
433675 AW977653 Hs.75319 ribonudeoSde reductase M2 poiypeplida 152 
423441 R68&49 HS.2783S9 absent in melanoma 1 1Sce 151 
60 452940 AA029722 Hs^173 fuoosyltrans1erase4(alpha(1.3)fuco6y 3.51 
440400 AA994364 Hs.125594 ESTs. Weakly similar to T25472hypottie0 150 
453439 AIS72438 Hs^2976 guanine nudeotide binding protein 4 148 
447247 AW389351 Hs^87955 Homo sapiens cDNAFU13090fis. done NT 148 
449915 NM.004529 Hs.404 myelbidAymplioidormixed^ineagaieukem 147 
65 427975 AI536065 Hs.122460 ESTs 146 
400297 A1127ty78 Hs.334473 hypotheiicat protein DKFZp56401 278 145 
404253 145 
435567 AW504944 Hs.162990 Homo sapiens cDNAFU14193ft3. done NT 145 
432158 W33165 Hs.22983 UDP-giucoserglycoproteingiuoosyltransfe 145 
70 417315 At080O42 Hs.336901 ribosomaIprol^S24 145 
419140 A1982647 HS.21572S ESTs 144 
446901 AI347274 0b:tc05d02x1 NO.CGAP.Coie Homo sapiens 143 

451806 NVL003729 Hs.27076 RNA3'-tenninal phosptiatscydase 3.42 
411571 AA122393 Hs.70811 hypoUetKal proton FU20516 142 
75 442717 R88362 H8.180S91 ESTs. W^ similar to T23976liypotl«6 141 
443426 AF098158 HsJ329 chromosome 20 open reading frame 1 141 
419131 AA406293 Ks.41167 ESTs 141 
430264 AA4705ig gb3U71f1 0.81 NCI CX3APJ>r1 Homo sapiens 140 

450159 AI702416 H$.200771 ESTs. Moderately slmBar to A Chain A, T 140 
80 4S3S31 AA417940 Ks^140O ESTs.Vfeddy8iiniIvtoJC5795 C0EPprot 140 
444626 AI674482 Hs.148441 ESTs 140 
445354 AV653485 Hs.6390 Homo sapiens done FlB3344PRO0845mRNA. 140 
444078 BE24G919 Hs.10290 US snRNP-specHic 40 kOa protein (hPrpS- 140 



141 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65' 

70 

75 

80 



418939 


AW630803 


(^69497 


tdJKIII Dl 


a.4U 


416134 


AA397769 


Hs^17 


ESTs 


lAn 


430544 


AA481(^ 


Hs.105153 






427927 


AI879165 


H5^27 


f!nAATfpmhnnnPf Kiwlhwi nmfpin /H/PRPl 




410804 


U64820 


Hs.66521 


MachflrfftJnfiBPh dhtfiasA /ficlnwMnhflflar 


3.39 


410406 


AI969703 


H&1466 




3.38 


4084S4 


AA554714 


^187578 


Hfwnft cantons <*fWA CT l11A3Qfle i<lMiaUt! 


AOO 


452930 


AW195285 


Hs.194097 


ESTs. Vfealdv fi1ntn» to 138022 hraathefl 




446432 


AI377320 


Hs.150058 


ESTs 




456653 


AI807519 


Hs 104520 


KcvnoBantftnscrUiJAnjnRQifk rinnn PI 




42S322 


U63630 


Hs.t55637 


juotelfl kinssBi DNAfOcSvaUuIi cstsiyOc 




428261 






cots. vveBiay SDiiiBr V) tNnucK igc rC I6 


Mi 


421039 






ctdlin S 


AiN 


407819 


R42185 


Hs^4803 


ESTs 




424698 


AA164366 


Hs.151973 


hvnathnficnt nmfpln Fl lOVi^^ 


J.00 


4333S1 


AW46997a 


ri»iivwwi*f f 


fibosofvis) i^otsln L39 




435022 


AW1833fi5 


Hs^627 


PQTq WpflVhf RlmRw tn rmi-t Ml IMAM IfLPn 


J. JO 


447164 


nrvcwm 1 


ns.iroio 


Homo sspicns c^5 (nRNA, psriis) 8Q(|U8nco 


3.35 


454018 


AVVD16892 




ESTs 


0.00 


439871 


R88518 


H3,46738 




J.O0 


453941 


U39817 


Hs.36820 


BioofTi syndronw 


1 "iA 


411560 


AW851186 


Hs.179909 




^ ai 


435532 


AW291488 


Hs.1 17305 




J.JJ 


438008 


AA775025 


Hs.203802 


F<rrc 


0,00 


421246 


AW582g82 


H5.102897 


CGi^y Rfnff^n 


111 


451707 


AW051061 


Hs.60973 


CO IS 


0.O0 


457574 


H88717 


Hs.27774 


F<«Tii Hlnhhr elmtlAr In API ftl 1 MQPfYt 


J.01 


443613 


AI079356 




nfmT'^QhflQ si RrauiM MhUUPki fil Unmn eani 

yu4JM9uu9<9 1 ooEua9jnnniwni_.oi nwiiD sapi 


Ml 


415857 


AA866115 


Hs.127797 


Hnrm cnrwHK rflNA Fl 111 IRI fie rinna UP 


0*41 


438746 


A1885815 


Hs.184727 


ESTs 




408860 


AA525775 


Hs.292523 


CO 1 S, MOOcialciy wiIiuHr 10 rKAeXn TciTi 


aoU 


427738 


NM00318 Ks.180812 


peiuMsanidi ineinuiane imuibui o (09XU, lb 


g.oU 


458855 


AW381299 


Hs.107000 


hvDOttiBt^ DfDlBbi FLJ11294 


0.0U 


417221 


AW379029 


Ks.118338 


FSTft Wafricltf similar in iinnanwH nmbilii 


OtOU 


424770 


AA425562 


H5.11065 


Homo saoiens HDCME13P mRNA oatM ak 


3.30 


417720 


AA205625 


Hs.208067 


ESTs 




428571 


NM-006531 


Hs^291 


rrauo n 1 g f i3# yiviycysuc luunof QiScaSvi 




452862 


AW378065 


H&8687 


ESTs 


^ 9ft 


414343 


A1036168 


Hs.323378 


costcd vcsldo mcmbfsns protsln 


0.£D 


437222 


AL1 17588 


Hs.12778 


ESTs 


O.£0 


422665 


AJ011812 


Hs.1 19018 


transcrh)Kon fdcfor NRF 


3.28 


414706 


AW340125 


Hs.76g89 


KIAAfWOT nana flmHiif4 




446565 


D13757 


H&311 


phosphorflio^ l^fniphosiAstB sniidotransr 


0.27 


447829 


AI433029 


Ks.164104 


ESTs 


0>£f 


427576 


BE242611 


Ks.2173 


focosyltrsnsferaso 4 {^ha (1 ,3) ftjcosy 


3.27 




BE219771 


Hs.237146 


nypoinoucai pcoKin rUictiyc 


3.26 


WfOVv 


AA71S284 




QDaiViK)iUj.n NU.iA3ArjDra nonv) ssptens 


3.26 




NK^007019 Hs.93002 


ubk|uiGn csniBr protein E2-C 






M62062 


Hs.150917 


catsntn {cadhefhv<associated piot^}, o 


3.25 


445592 


AV654382 


Hs.17947 


ESTs. Weakly similar to T16534 hypotheO 


Q OK 
OuCO 




AW450240 


Hs.257274 


cols 


3.25 


451797 


AW663850 


Hs.333513 


email fawliM*tKlft tt^MUttA »tiUatttHu C hm 

sfnasi noucioio vyvonno suDianiuy Ct mo 


* OS 
OlZO 


413930 


M86153 


Hs.75618 


rv\oi In, HttRiiiiw tsno oiiouyoiio lanuy 




410659 


AI080175 


H&.68826 


CQTd 

bo 18 


3.25 


446202 


AI279706 


Hs.149474 


F<rrB 

CO la 




432193 


AA372264 


Hs.273193 


nypouiwiGai piuwiii rui ivr w 


o,£o 


439262 


AA832333 


Hs.333045 


CCTe 

CO IS 


3.25 




NA 




MA 
Tin 


3.25 


441264 


AA327170 


Ks.23290 


ESTs 


0,£0 


424081 


NM.006413 Hs.139120 




9 9A 
O.Z^ 


408321 


AW405682 


Hs.44205 


\MU9laUn 


9 Oil 

o»m 


447432 


AW958473 


Hs.301957 


iiiwiA ^nuMcuoRw iJipitwpnotG iuiNBo moi 


^ 9A 


404519 










AI800271 


Hs.129445 


nypouieaca proiinii rui i<490 


3.23 


422660 


AW297582 


Hs.103267 


InmAlhallral Mwlatii CI 199KAA etmMar Ia 

iiypouiBwaJ puinii rU4c9W miihkb id 




427961 


AW293165 


H3,143134 


CO IS 


499 
0.£4 




Ai741320 


Hs.114121 


nOmO sapiens CUN/^ rU<MZ^ lis, CnXIO w> 


3.22 


411643 


AI924519 


Hs.192570 


hwn/iltwiftrat nmfatn Ci lOOnOB 

nypouiouca] ptuiein rijciajzo 


0.21 


458652 


AW375610 


Hs.117102 


hypoOieOcal protein FU13046 similar to 


Act 




BE246138 


Hs.30^ 


cST8 


3l21 


410389 


AW954049 


HS3177 


ESTs, Wealdy slrrilar to RHUB6 salivary 


3.21 


439979 


AW600291 


HS.6B23 


hypolheScal protein RJ10430 


3.21 


422363 


T6S979 


Hs.115474 


replication factor C (activator 1)3(36 


3.20 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


3.20 


431678 


AW072372 


Hs^44$ 


hypolhefical protein RJ111B4 


120 


430439 


AL133561 


Hs.241426 


DKFZP434B061 protein 


120 


407201 


N31998 


Hs.164256 


hypothetical protein FLJ206S7 


3.20 


437905 


AW363121 


Hs.1755^ 


ESTs. WeaMy sfnAar to T2e935 hypothoU 


120 


434160 


BE551198 


K8.1 14275 


ESTs 


120 


407995 


AI094748 


Hs.100134 


hypotiieflcal prateb) FU12787 


120 


412986 


BE243311 


Hs^24 


IK cytoMna, d owRweQulator of KlA Q 


119 


414386 


XD0442 


Hs.75990 


119 


444381 


BE38733S 


HsJ»3713 


ESTs. sfanHar to S64054 hypolhefl 


118 



142 



wo 03/042661 



PCT/US02/36810 



429597 Nllt003816 H3.2442 a dlslntegrfn and meteBoprote fn asa doma ai8 

433764 AW753676 H5.39382 ESTs ai7 

459370 AAB69982 Hs^1B28 ESTs. WeaUystmOar to 138022 hypolhetl 3.17 

_ 429616 AI982722 Hs.120845 ESTs 117 

5 41S083 AI632683 Hs^179 Homo sapiens CDNARJ12933 6s. cione NT 3.16 

424687 J05070 Hs.151738 matrix metaltopro^ase 9 (gelsOnasaB 116 

428839 AI767756 Ks.82302 Homo sapiens cONAFU14814fis. dona NT 3.16 

423629 AW021173 Ks.18612 Homo sa|^ cONA: aJ2ig09 fordone H 3.15 

443830 AI142095 Hs.143273 ESTs 115 

10 413516 BE145907 gb:MR041T0208-221 299.204^1 2 HT0208 Homo X15 

433527 AW235613 Hs.133020 ESTs 3.15 

427986 N45214 H$.282387 Homo sapiens cDHA:FU21637as. done H 3.15 

457453 Z70695 Hs.272240 Homo8apienscONAFU11086fis.dQn8PL 115 

427687 AW003867 Hs.1570 hlstannine receptor HI 115 

IS 455068 A1807894 Hs.47274 Homo sapiens mRNA;cDNADKFZp564B176tfr 115 

441720 A1346487 HsOT39 ESTs 115 

419569 AI971651 H5.91143 j^ed 1 {Alanine syndrome) 115 

445921 AW01S211 Hs.146181 ESTs 115 

429957 AW204S30 Hs.99500 ESTs 115 

20 403137 114 

425268 AI807883 HS.1800S9 Homo saptens cDNA FU20653 lis, done KA 114 

428645 AA431400 H5.98729 ESTs.WeaUysimiiarto2017205Adihydio 114 

439835 AA477288 H&g4891 tiypothellcal protein FU22729 114 

408808 AW847814 Hs.289005 Homo sapiens cDNA:FlJ21532 S3, done C 114 

25 439277 R80061 H5.164478 hypothe1tedlpR9teinFU219398iniIlarto 113 

443564 AI807036 Hs.267245 liypoihetical protein RJ14803 113 

406668 T62745 H5.184411 albumin 113 

452194 A1694413 Hs.332849 ollactDiyieceptor.MIy2.sut)f8mily 113 

437594 AA761431 Hs.74335 heat 8hodc90i(O protein 1. beta 113 

30 433759 AA680003 Hs.109363 Homo sapiens cDNA:FU23603 lis. done L 113 

420911 U77413 Hs.100293 O-Onlced N-acetylgtucosamina (GicNAc) tr 113 

414080 AA135257 H$.47783 B aggressive lymphoma Qena 111 

450209 AW0a8921 Hs.13138 Homo sapiens, done IMAGE:3446343,mlRNA. Ill 

441790 AW294909 Hs.132208 ESTs 111 

35 439352 6E614347 Hs.169615 hypotheGcat protein FU20989 110 

449664 R06212 Hs.127733 ESTs 110 

435979 W03698 Hs^3513 ESTs.Weai(ly8imilartoAlJU1J{UMANA1^8 110 

424602 AK002055 Hs.151046 hypotlrafical protein FU1 1193 110 

402963 110 

40 428367 AW978441 H&296100 ESTs 110 

455838 6E145808 gb:MRO-HT0208-101 299-103^1 1 HT0208 Homo 110 

407502 U52096 gb:Human zinc finger protein (lcr-znfl)m 110 

426853 U32974 Ha.172777 baculoviral lAP repeat^ntMg 4 110 

417845 AL117451 Hs.82719 Homo sapiens mRNA;cDNADKFZp586F1822(f 110 

45 421056 AI076890 Hs.146B47 TRAP family member-assodaledNFKBadIv 110 

420817 AKD01652 Hs.99423 ATP-dependent RNA heHcase 110 

421841 AA908197 Hs.108850 MAK^Iated Idnase 110 

429534 AW97eg87 Hs.163327 ESTs. Weal(tysimHarto2109260ABcell 110 

408353 BE439838 Hs.44298 mitodiondriai ribosomai protein S17 110 

50 433037 NAy)141S8 H8.279938 HSPC067 protein 109 

443183 R16258 Hs.6217 Homo sapiens CDNARJ12521 Us. done NT 109 

457726 AI217477 Hs.194591 ESTs 109 

415786 AW419196 Hs^57924 hypothetical protein HJ13782 108 

433013 AI697890 Hs.127337 axin 2 (oonducfin. a»1) 108 

55 417601 NIIU)14735 H&82292 KIAA021 5 gano product 108 

420276 AA290938 Hs.190561 ESTs, HighV similar to SORLHUMANSOI^n 107 

443323 BE560621 Hs.9222 estrogen receptor IMng site assodate 107 

446223 BE300091 Hs.119899 hypotheOca) protein aJ12989 107 

425851 NM.001490 Hs.159642 ghioosamlnyl (N-aoetyQ transferase l.c 107 

60 438203 6E384982 HS.S076 Homo sapiens cONA:FU22128 lis. done H 107 

416402 NM.000715 H8.1012 oomptementcomponenf44)ihdingproteb. 106 

436554 AI9B5810 H3.301173 ESTs 106 

413801 M62246 Ks.35408 ESTs. Highly similar to unnamed protein 106 

416248 H99169 Hs.23450 mitodiondriai rfbosomat protein S25 106 

65 445413 AA151342 Hs.12677 C6)-147 protein 106 

452909 NML015368 Hs.30985 pannexlnl 106 

447048 AW393080 Hs.228320 hypolhsBcal protein RJ23537 105 

425942 AU077195 Hs.164036 glucosamine (N.acetyt)««ilfat85e(S8nf 3.05 

406333 105 

70 428454 U55936 Hs.184376 synaptosomal-assodated protdn. 23t(D 105 

411864 AW948147 gb:RCO-fynr001128030(M}31^3 MT0013 Homo 3.05 

458632 AI744445 H8.167073 Homo sapiens cONAFU13047fis, done NT 3.05 

448292 BE281316 Hs,47334 hypothetical protein FU 14495 105 

459055 N23235 Hs.30567 ESTs. Weakly similar to B34087 hypoBieti 105 

75 402167 105 

433133 A802724g Hs.104741 PDZ-b{ndinQidn3S8:T-ceIloriginatBdpr 105 

437828 AW976806 Hs.73149 p^ box gene 8 105 

404232 105 

418164 AI76t820 Hs.41074 ESTs. Weakly sImDar to 139294 AteLeodsy 105 

80 412610 mm Hs.74126 ^ add binding protdn a fled (gas 105 

45Z787 AW294022 Ks.222707 KIAA1718 protein 105 

425782 U66468 Hs.159525 oeO growth regdatory with EF-hand doma 105 

410716 AI920783 Hs.191435 ESTs 104 
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AI798376 S46400 AW811617 AW811616 W00557 BE142245 AWe58232 AW861851 AW856362 AA232351 AA218567 AA055556 AW858231 
AW857S41 AW814172 K66214 AW814398 AF134164 AA243093 AA17334S AA199942 AA223384 AA227092 AA227080 T1 2379 AA092174 T61 139 
AA149776 AA699829 AW8791B8 AW813567 AW813538 AI267168 AA1 577 18 AAl 5771 9 AA100472 AA100774 AA130756 AA157705 AA157730 
AA157715AA053524 AW849581 AW854566 C05254AW882836 T92637AW812&21 AA206583AA209204 BE1 56909 AA226824AI829309AW991 957 
N66951 AA527374 H86215 AA045564 AI694265 H60808 AA149726 AW195520 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

AI872932 AAS82306 BE220163 W86e95 T81 307 K91447 

AWB743S2 AA715374 Z2S205 

A1079356W23287 

AI347274AW844024 

AI480316AW847535 ' 

AI793163 AW675182 AW875178 AWB75176 



455838 1374605.1 
TABI£6C 



AW848047AW84B202AW848631 AW846142 AW848702AW848121 AW848632 AW848140 AW846571 AW848009 AW848067 AW648069 AW848905 
AW848214 

BE14S808BE145807BE181883 



Pltey: Unique number oonespondlng to an Eos probeset 

Ftef: Sequence source. The 7 digit numbers In this cotumn are Genbank lden6fier(G0numben. Dunham l.e!8L'refei8 to the pubBcaSonenfiUed The DMA 

sequence of human chromosome 2Z* Dunham L el eLNdure (199^ 402:489^. 
Strand: Indicates DMA strand from which axons were predidsd. 

tQjnsHhn: Indicates mjcteotUeposiSons of predicted exons. 
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5 
10 
15 
20 
25 
30 

35 



40 

45 
50 
55 
60 
65 
70 
75 



Pkey 


Ref 


Strand 




400534 


6981826 


Minus 




401165 


9438376 






401480 


7321503 


Ptus 


1fiE1?0-1B6347 1E&lS1.1fifi<K7 1RQR.i:i.1RQR^ 

8265&439S9 


401644 


8576138 


Plus 


401823 


2282095 


Minus 


42S7S-42697 4318943287 iSSBtUSBIA 


402167 


K71795 


Ptus 




402963 


5419853 


Minus 




403137 


9211494 


{Minus 




403776 


7770611 


Minus 


1414-1SI3Ll62417Sfi 


404025 


7341444 


Ptus 


131740.131 SOS 


404232 


8218045 


fi/Qnus 




404253 


9367202 


Minus 


55675*56055 


404477 


8080699 


Plus 


113390-113577 


404516 


8151967 


Ptus 


114153-114322 


404519 


8152000 


Plus 


12817-13000 


404567 


7249169 


Minus 


101320-101501 


404996 


6007890 


Ptus 


37999-38145.38652-389g83727-39872.40S57^4,42351-42450 


405394 


6624123 


Minus 


31900-32373 


405460 


7684569 


Minus 


52223-52389 


405769 


3046270 


Minus 


76844-77193 


406117 


9142932 


Ptus 


54304-54584 


406333 


9213235 


Plus 


64689^798 


408360 


9256107 


Minus 


7513-7673 


406414 


9256407 


Phis 


4959349850 



TABLE 7A: 516 GENES UP-REGULATEO IN COLON CANCER COMPARED TO NORMAL COLON 
Table 7A Osis 516 genes up^eguialBd in Gokin cancer compared to nomid 

BssuB 8sl as the nonnal sarootes In datermlninQ the denominator value and the rallo was eoual to or greater than &a 

Pkey: Unique Eos probesetidenlilier number 

ExAocn: Exemplar Accession number, Genbank anesslon number 

UnigenelD: Unlgene number 

Unigene Title: Untgenegeneb*8e 

R1: Raao of tumor to nomial colon 



Pkey 


ExAocn 


UnigendD 


Unl^neTHIe 


R1 


441031 


AI110684 


Hs.7645 


iibrinogen. B beta polypepSde 


57^2 


406667 


M12523 


Hs.184411 


atbunto 


49^ 


409041 


AB033025 


Hs.5fl081 


KIAA1ig9 protein 


49.18 


428330 


122524 


H3^56 


mabix metalloprotBinase 7 (matrOyBln, 


A222. 


421552 


AR)26692 


H3.1057C0 


secreted ftlzzled^elated protein 4 


34.64 


429201 


X03178 


H3.198246 


grou|>«peciftc component (vitamin D bind 


33^8 


452281 


T93500 


Hs.28792 


Komo sapiens cDNA FU1 1041 Us, ctone PL 


33.10 


447033 


A1357412 


Hs.157601 


ESTs 


3U4 


428839 


AI767756 


Hs.82302 


Homo saplNTS cDNA FU14814 lis, ctone NT 


26.84 


438461 


AW075485 


Hs^86049 


phosphoserine andnotransferase 
plasminogen 


25.40 


413841 


M34276 


Hs.75576 


24.68 


428187 


AE687303 


Hs.285529 


6 proti^iKooupled receptor 49 


24.00 


408806 


AW847814 


Hs.269005 


Komo sapiens CDNA: FLJ21532 lis, done C 


ai8 


452862 


AW378065 


Hs.8587 


ESTs 


21.34 


415989 


AI267700 


H3.317584 


ESTs 


20.92 


427585 


D31152 


H5.179729 


coHagen, lypa X. alpha 1 (Sclunid metaph 


19.22 


421470 


R27496 


Hs.1378 


anncDdn A3 


17.92 


424051 


AL110203 


Hs.138411 


Komo sapiens mRNA; cONA DKFZp586J1922 (f 


17.36 


439759 


AL359055 


Hs.67709 


Komo sapiens mRNA luQ length Insert cON 


17.28 


449032 


AA045573 


Hs^O 


nuclear factor (eiythroid^lerlved IQ-lik 


17.08 


421462 


AF016495 


Hs.104624 


equapoifnO 


17,02 


424252 


AKD00520 


Hs.143811 


hypothetic^ pfot^ FLJ20513 


16.98 


452823 


AB012124 


Hs.30696 


transcription factor-fike 5 (baste heBx 


16.70 


432340 


AA534222 




gb:iil21d024i1 Na.C6APJ^l Komo sapiens 


• 16.64 


433447 


U29195 


Hs.3281 


neuronal penbaxln n 


16.59 


414386 


X00442 


Hs.75990 


haptoglobin 


16.19 


425260 


L47726 


Ks.1870 


phenylalanine hydnixytase 


16.08 


444754 


TO911 


Hs.11881 


transmembrane 4 supeffamily member 4 


15.82 


43%18 


W76326 




8b2d60d04j1 Soares.fetaU)earLNbHH19W 


15.80 


443211 


AI128388 


Hs.143655 


ESTs 


15.78 


439608 


AW864696 


Hs.301732 


hypoOnOcai protein MGC5306 


15.52 


414559 


AV6S61B4 


Hs.76452 


C-reac8vo protein, pentraxtn-retaled 


15.42 


412719 


AWD16610 


H1I299II 


ESTs 


15.24 


439451 


AF088270 


Hs.278554 


heterechraniallivJlke proteh 1 


15.18 


448974 


AL049390 


H8.22689 


Homo sapiens mRNA; cDNA DKFZ0586O1318 (f 


15^ 
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416402 KM-000715 Hs.1012 ccmptein8ntanipon8m4«MlRg protein. 14.60 

453863 X02544 Hs.572 OTXOmxddl 14^ 

441243 AI767056 Hs-193002 ESTs 14^ 

^ 413316 AU076607 HS.7S2B5 inter^pha (gtobulin) IntMir. H2 pd 14.30 

5 451917 AW391351 Hs^820 Homo sapi^ unknown mRHA 13^1 

433213 AW66513D Hs.137190 ESTs 1180 

428261 AW006855 Hs.118392 ESTs. VfeaMysbnUar to LNHUERtgE Fere 13^2 

438578 AA611244 Hs.164168 ESTs 13.40 

^ . 429170 NM.001394 Hs.2359 dual spedficOy phosphatase 4 13^ 

10 417006 AW673606 Hs.80758 aspartyMRNAsynOietBse 1X00 

449199 AI990122 Hs.198988 ESTs U98 

436393 AW022213 Hs.143617 ESTs 1Z90 

413585 AI133452 Ks.75431 fibrinogen. ganvnapolypepOde 12.83 

451561 N52812 Hs.177403 ESTs 1Z72 

15 420734 AW972872 Hs.293736 ESTs 12.70 

422420 U03398 Hs.1524 tumor necn)5isfiaclor{|igand)8uperfBml 12.50 

441377 BE218239 HS.2D2656 ESTs 1Z45 

435981 H74319 Hs.188620 ESTs 12.38 

417296 L36198 Hs.B1884 suffotransferase family. cytosoIic2A. 12.38 

20 428664 AK001666 Hs.189095 slmBartoSALLI (8^(DrosophilaHn» 12.38 

459370 AAB89982 Hs^71826 ESTs. WeaMyslmOar to 136022 hypoUieti 12.34 

430290 AI734110 H5.136355 ESTs 12,30 

450221 AA328102 Hs.24&41 cytoskeleton associated protein 2 12.26 

450628 AW382884 H3.204715 ESTs 12.24 

25 4^232 A)281B48 Hs.194691 retinoic acid Indxed 3 12.16 

428223 AA424313 Hs.98402 ESTs 12.08 

432582 AI623817 Hs.168457 ESTs 1Z08 

407884 BE075316 Hs.95011 synlrophin. beta 1 (dyslropNnnassociate 12.02 

426493 AK001745 Hs.164628 hypothetical protein FU 10683 12.01 

30 407202 N58172 Hs.109370 ESTs 11.84 

422109 S73285 H8.1473 gastrin-ieleaslng peptide 11.68 

434699 AA643687 Hs.149425 Homo sapiens cDNAFU11980 lis. done HE 11.68 

447532 AKD00614 H3.18791 hypothetical protein FIJ20607 11.67 

443162 T49951 Hs.9029 DKFZP434a)32 protein 11.67 

35 450149 AWg69781 Hs.132663 Zicfiamilymemt)er2(odd.p8lredDro5ophi 11.62 

446372 AB020644 Hs.14945 long fatty apy^CoA synthetase 2 gene 11.46 

453909 AW004045 H&203365 ESTs 11.42 

416655 AW968613 Hs.79428 BCL2Menovlni8E1B19ia»nterBctin9pro 11.34 

452903 AI95342S Hs.246911 ESTs. Weddy similar to 138022 hypotheS 11J2 

40 433011 m%0 Hs.306044 C61-05 protein lUO 

423575 C18863 H3.1&3443 Homo sapiens cONAFUl 1576 fis. done HE 11.22 

455777 AA524265 H8.154172 ESTs. Moderately similar to BC6F.HUIiAANB 11.20 

425745 U44060 H8.14427 Homo 88|)ten8cONA:FU21600 fis, done H 11.08 

407168 R45175 Hs,117183 ESTs 10.91 

45 407633 NM.0O7069 Hs.37189 similar to rat HREV107 10.90 

400534 10.88 

450434 AA166950 Hs.195870 hypothe&al protein FU 14991 10.76 

445873 AA2S0970 Hs.251946 poly(A)-b]nding prateh. cytoplasniic 1-1 10.68 

440526 AI832243 Hs.211471 ESTs 10.63 

SO 427544 AI767152 Hs.181400 ESTs, W^aidy Mar to 178885 serinefih 10.62 

447974 R76886 gb7i64t)03.s1 Soaies placenta Nb2HPHomo 10.52 

431956 X63629 Hs.2877 cadheHn3,type1.P-cadherto(ptecente 10.52 

453922 AF053306 Hs.36708 budding uninhibited by benziinidazdesl 10.50 

457065 AI476318 Hs.192480 ESTs 10.40 

55 406950 AA707814 H3.14945 long fatty aiqrl^ synthetase 2 gene 1038 

419852 AV\^56 Hs.266184 hypothetical protein dl551D2.5 10.34 

414718 K95348 Hs.107987 ESTs 10.29 

447505 AL049266 H8.18724 HoniosaptensnifWA:cONADKFZpS64F093(fr 10.28 

404567 10.14 

60 428536 AI143139 Hs.2288 vfstniiKlile 1 10.06 

444381 BE387335 K8.283713 ESTs,W^6lro9artoS64054hypoiheti 10.05 

437267 AVW11443 Hs,25B110 ESTs 10.00 

420593 H77859 Hs.65450 re&uIon4 10.00 

^. 407244 M10014 Hs.75431 fibrinogen, gamma pdypepQde 9.99 

65 425071 NM.013989 Hs.154424 deio(finase,todatfvronIne.typeD 9.97 

449655 AI021987 HS.S9970 ESTs 9.91 

459504 8E514127 gb:601315974F1HIKJMGCJ Homo sapiens cD 9.90 

434609 R76593 gb7i60c11.r1 Scares placenta Nb2KP Homo 9.90 

412104 AW205197 Hs.240g51 Homo 8a|to. Similar to RIKENcONA 2210 9.68 

70 447863 AU)47611 H8.286885 Homo 88ptenscDNAFU14246 Us, done OV a84 

448106 A180O47O Hs.171941 ESTs 9.64 

439192 AW970536 Hs.105413 ESTs 9.64 

446619 AU076643 K8.313 secreted phosphoproteln 1 (osteoponfln« 9.61 

427535 R29543 Hs.2164 proiriatetet basic protein (indudespte &S0 

75 440591 AA431sg9 Hs.132799 liypoOieQcal protein FU23451 6.44 

440404 A1015881 Hs.324527 mitodiondrial ribosomd protein S5 9.40 

426283 NM.003937 H3.1 69139 l(ynureninase (l-^urentne hydrolase) 9.38 

401742 NA NA 9.30 

416393 N54037 H3.262869 plasmhiogen^Ite 9.28 

80 413339 A1818080 Hs.194290 ESTs 9.28 

437641 AA811452 Hs.29igi1 ESTs 9.28 

426743 AL080060 Hs.301549 Homo88ptensmRNA:cONADKFZp564H172(lr 9.22 

431319 AA8733S0 H8.302232 ESTs 9.21 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



434008 


AA740878 


K&112982 


ESTs 


9J0 


446311 


AVV007294 


H5.149795 


ESTs, Modarate}y sintilar to ALU1_HUMAN A 


9.16 


419968 


X04430 


Hs.93913 


IntBileuktn 6 (IntBTferon, beta 2^ 


9.16 


413597 


AW302885 


H3.1 17183 


ESTs 


9.15 


440527 


AV557117 


Ks.184164 


ESTs, Moderately sintfar lo S65657 alpha 


9i14 


409327 


L41162 


Ks.53563 


coQa^ai, typQ ^ alpha 3 


9i14 


444107 


T46839 


H5.10319 


UDP $iyco6y1tran$feras8 2 family, ^titffo 


9.05 


456653 


AI807519 


Hs.104520 


Homo sapiens cONA |rU13694 fte, done PL 


8.98 


450164 


AI239923 


Hs.3(M)98 


ESTs 


8.95 


432B67 


AW016g36 


Hs.233364 


ESTs 


a93 


4373S6 


BE140396 


Hs.21621 


tiypoUieScsl protein DKFZp7620D76 


&92 


421128 


M74587 


Hs.102122 




8.92 


452712 


AW838816 




gb:RC5-LT0054-14020(M)134301 LT00S4 Kofno 


8.90 


419131 


AA406293 


Hs.41167 


ESTs , 1 


8.86 


444783 


AK001468 


Hs.62180 


(Drosof^la Scraps homdog), act 


8.82 


418895 


AA894638 


Hs.14600 


ESTs 


8.82 


422665 


AJ011812 


Hs.1 19018 


traoscflpOon factor NI^F 


8.82 


409757 


NNL001898 


Hs.123114 


cystaiin SN 


8.78 


419752 


AA249573 


Hs.152618 


ESTs, Moderately sMIartoZN91JIUMANZ 


8.74 


412446 


AI76801S 


Hs.92127 


ESTs 


8.71 


433285 


AVVg7S944 


Ha. 237396 


ESTs 


8.68 


414538 


AVV612228 


HSt 107987 


ESTs 


8.64 


449318 


AW236021 


Hs.78531 


Homosaolens SiniQartoRIKENcDNA5730 


8,62 


430835 


A)240006 


Hs.192326 


ESTs 


8.60 


445467 


At239832 


Hs.15617 


EST&WaaklvshTiQartQALU4 HUMAN ALUS 


8.59 


445537 


AJ245871 


H5!l2844 


EGF-like*domEjn, fnulHple 6 


8.52 


450375 


AA009647 


H5.8850 


a dsinteQnn and roetallopn^slnase doma 


8.52 


428355 


BE256452 


Hs.2257 


\^tit)neclin (sorum spreading factofi s(Hn 


8.50 


444476 


W07318 


Hs.240 


M-phase phosphoprotdin 1 


8.47 


439398 


AA284267 


Hs.221504 


ESTs 


8.44 


428368 


BE440042 


Hs.63326 


malilx metdoprottinase 3 (stroin^dn 


8.43 


403776 






8.42 


418973 


AA233056 


Hs.191518 


ESTs 


a42 


445436 


A]224105 


Hs.1 51408 


ESTs 


8.38 


417958 


AA767382 


Hs.193417 


ESTs 


8.34 


452838 


U6S011 


Hs.30743 


prsfBiantially axpressed anGgon tn mda 


8.34 


425761 


AW664214 


Hs.1 96729 


ESTs 


8,33 


449419 


R34910 


Hs.119172 


ESTs 


8,29 


407007 


U22961 




ah'Hutnan mRNA dona wHh slmflaitlv tn 1 




420900 


AL045633 


Ks.44269 


ESTs 


8.25 


452503 


AB0O05O9 


Hs.29736 


TNF receptor*assod^sd factor 5 


8.23 


458242 


BE299S88 


H$.28465 


Homo sapiens cDNA: FLJ21869 fis, done H 


8.18 


420120 


AU)49610 


H8.95243 


IrfloscrfpBon etonoaDon factor A ^11)* 


8.16 


432363 


AA534489 




alnir76a11j8l NCI CGAP GoSHaimsaDlens 


8.16 


418738 


AVV388633 


Hs.6682 


fioluta canlpr famlltf 7 f caBofilc finriM 


8.12 


446155 


AI553695 


Hs 159422 


l^rtio^anli^r^idlMA FIJI 3997 lis i4nfiAV7 


8,10 


418379 


AA218940 


K&.137516 


fidgeBn-filQe 1 


8.07 


424560 


AA1S8727 






8.06 


453116 


AI276680 


Hs.146086 


ESTs 


8.04 


419929 


U90268 


Hs.93610 


oefebrd cavernous maifisnnations 1 


8.04 


409687 


T51125 


Hs,8493 


ESTs 


8.00 


407790 


AI027274 


Hs.288941 


Homo sapiens cDNA FLJ14866 fis, done PL 


8lOO 


416661 


AA634543 


H8.79440 


IGF^I iriRNAFbln<finfl onl^ 3 


8.00 


418038 


^7976 


Hs.83337 


Ifitent transfomilng Qrowtii factor beta b 


7QQ 


422762 


Ali31320 


Hs 11^6 


Uitmsn nNA fimnenM limni RlmiA RP1.9nM9 n 
rHiiiioii wi'in «ioi|iiviiwo iitnii muiiv rvr I'MIiv u 


7QA 
r>9V 


421633 


AF121860 


Hs.106260 


soitli^ nexin 10 


7.92 


432542 


AW083920 


Hs.16098 


dautfin2 


7.86 


414869 


AA157291 


Hs.21479 


ubfawdeln 1 


7.84 


419502 


AU076704 


Hs 90765 


fitflnflOMl. A nlrtfu) nnhrmnHrin 


7Rn 

r.ov 


406666 


VD0495 


Hs.184411 


^btimln 


7.78 


418007 


Mt3509 


Hs.83169 


niatfb( (netaltoprot^ase 1 (intBrstitial 


7.78 


439616 


BE018635 


Hs.58582 


HoflK) sapiens cONA FLJ12789 fis, clone NT 


7.77 


406360 


NA 




NA 


778 


406687 


M31126 


Hs.272620 




77B 


431510 


AA580082 


Hs!l 12264 


ESTs 


7.76 


414312 


AA1 55894 


Hs 191060 


ESTs 


7 71 


443450 


N66045 


Hs.1 33529 


ESTs 


770 


449870 


A)672487 


Hs. 16423 


hvDothetica] orot^ HDCiyCIMP 


7.64 


425681 


AB018297 


Hs 159183 






408897 


N50204 


Hs.283709 


npopolysBocharido spedlic response*? p 


7fi9 


443285 


A1301918 


H&334264 


ESTs 


7.60 


420807 


AA280827 


Hs.57846 


ESTs 


• 7.60 


424650 


AW576156 


Hs.250824 


Homo sapiens cONA:FU23435 lis. done H ^ 


7.60 


410718 


A)920783 


Hs.191435 


ESTs 


7.60 


430848 


AW021726 




Qbsdf27e02.y1 Morton Fetal CocMea Homo 


7.60 


434294 


AJ271379 


Hs.76194 


fftnsQinaJ protein S5 


7.60 


445808 


AV655234 


Hs.298083 


ESTs. Moderately dniBar to PC4259 lisni 


7.56 


438804 


AA811896 


Hs.44604 


ESTs 


7.54 


456997 


AW937420 


Hs.69662 


ESTs 


7.54 


400289 


XD7820 


Hs.2258 


matrix melalloprotdnase 10 (stremslydn 




409187 


AF154830 


Hs.50966 


cartiamoi^hosphate synthetase 1. nritoch 


7.62 


445640 


AW969626 


Hs.31704 


ESTs. WeaMy dmlar to KIAA0227 [Ksapi 


7.49 


404S96 






7.48 
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420759 T11832 Hs.1277g7 Homo 8aplen$cONAaJ11381fis, done HE 7.48 

433859 AW896758 H&273789 ESTs 7.44 

423952 AWSTHSr Hs.136102 KIAA08S3 prot^ 7.44 

_ 431193. AW749505 H3.296770 KIAA1719 protein 7.43 

5 413530 AA1301S8 H$.19977 ESTs. Moderately slmOar to ALU8.HUMAN A 7.40 

417479 AI057052 Hs.133554 ESTs, Weakly slmQar to Z195J1UMAN2INC 7.40 

424613 AU)79850 Hs.151236 highly charged protein 7.37 

4in20 AA205625 Hs.208067 ESTs 7.35 

449347 AV649748 Hs.295901 KIAA0493 protein 7.34 

10 449444 AW818436 Hs.23590 sohits carrier femOy 16 (monocaiboxylic 7.34 

447499 AW2625a0 H$.147674 pmtocadherinbetaie 7.32 

426890 AA393167 Hs.41294 ESTs 7.31 

450568 AL050078 ^^159 Komo sapiens cDNAFU10784fis,cton8 NT 7J2B 

445019 AI205540 Hs,281295 ESTs 7.28 

15 419474 AW968619 Hs.155849 ESTs 7.24 

417015 M83772 Hs.60876 flavin conta!i^ monooxygenase 3 7.24 

411765 H43346 gb:yp09a04/1 Soares tveast SNbHBst Homo 7.24 

44^16 AB033052 Hs.22151 K1AA122S protein 7.18 

416143 AI95565Q Hs.79033 glutantinyH)6ptide cydotransferasa (glu 7.18 

20 451815 AW974911 HS.1&4793 Homo safto cDNA: aJ21 880 Its. done H 7.18 

420777 AA28Q223 Hs.130865 ESTs 7.16 

414463 T69078 Hs.76177 dph8-1-mlcrogIot}u!inA)ilQjnin precursor 7.14 

409269 AA576953 Hs.22972 hypotheScal protein FU13352 7.14 

443614 AV555386 Hs.7645 fibrinogem B t)eta polypeptide 7.14 

25 448706 AW291095 Hs.21814 Interteuldn 20 receptor. Alpha 7.13 

441124 T97717 Hs.119563 ESTs 7.12 

414680 AA743331 Hs.272572 hemoglotjin. alpha 2 7.10 

446432 AI377320 HS.1S0058 ESTs 7.10 

439295 AW206091 HsJ53538 ESTs 7.08 

30 436902 AW247145 Hs.192729 ESTs 7.08 

428679 AA431765 gb2w80c03.s1 Soares.tesfis_NHT Homo sap 7.08 

414865 AA1571S5 Hs.274414 hypotheOca) protein FU14457 7.08 

421373 AA808229 Hs.167771 ESTs 7.06 

432435 BE218886 Ks.282070 ESTs 7.05 

35 427933 AW974643 Hs.190571 ESTs 7.04 

436330 NM.004413 Hs.109 dIpepUdase t (renal) 7.04 

433800 At034361 Hs.135150 tung type^ ceQ membrane-assodaied gly 7.01 

418763 AK000219 Hs.68367 hypotheflcd protein FU2021 2 6.98 

446322 N23033 Hs,155814 ESTs &98 

40 442577 AA29299B H8.163900 ESTs a96 

429486 AF155827 Hs.203983 hypothetical protein FU10339 a94 

412280 AW205116 Hs.272814 hypothetical protein DKFZp434E1 723 &92 

436679 AI127483 Hs.120451 ESTs. Weakly similar to unnamed protein 6.90 

429125 AA446854 Hs.271004 ESTs. Weakly similar to 138022 hypotheO 6.90 

45 453204 R10799 Hs.191990 ESTs &90 

431576 M7666S Hs.275215 tiydroxyste(old(11-bet8) dehydrogenase 1 a88 

427878 C05766 Hs.181022 CGI-07 protein 6.88 

454438 AA224053 Ks.172405 ceildhrlsIoncycleZr 6.86 

424402 M63108 Hs.1769 luteinizing hormonetehoitogonadotropln r 6.86 

50 438394 BE379623 Hsl7G93 pepDdytproiylisomerase(cyclophllnH ^§4 

414271 AKD0Q275 K^.75871 protein Mnase C bbidlno proteh 1 6.80 

417168 AL133117 Hs.81376 Homo sapiens inRNA;cONADKFZp586L1 121 (f 6.80 

419S29 AB020695 Hs.91662 K!AA088B protein 6.60 

451686 AA059246 Hs.110293 ESTs 6.80 

55 430829 AW451999 Hs.194024 ESTs ^78 

446601 AI302616 Hs.150819 ESTs 6.78 

442973 BES67665 Hs.288550 Homo sapiens cONA: FU231S6 lis. done L 6.78 

457030 AI301740 Hs.173381 dihydropyrimldlnase^ 2 6.76 

437773 U24186 Hs.283018 lepltoafionprotdn A complex 34 kdsubu 6.73 

60 416018 AW138239 Hs.7B977 proproteinoontferfasesuljillslnfteidnt &72 

425478 AB0079S3 Hs.288a40 ESTs &70 

411643 At924519 Hs.192570 hypothetical protein FLJ22028 6.70 

418555 AI417215 Hs.87159 hypothetical protein FU 12577 &67 

453102 NM-007197 Hs.31664 Mzzlad (Drosophla] homdog 10 6.62 

65 433615 AA732982 Hs.269607 ESTs. Weakly similar to ALU1_HUMAN ALUS 6.62 

450638 AK001826 Hs.25245 hypothe&al protein FU1 1269 6.60 

450480 XS2125 H3.25040 zinc finger protein 239 6.58 

445191 AF048686 Hs.12393 dTDP-CVghicose 4,6^ehydratase &56 

„ 414575 H11257 K8.22988 Homo sapiens done IMAGE:451 939, mRNAse a54 

70 432639 AW973785 gb:EST385886 MAGE resequences, MAGM Homo ^54 

410116 AW530671 Hs.58636 squamous ce&cardnoma antigen reoogniz 6.54 

449894 AK001578 Hs.24129 01117 prot^ 6.53 

442914 AW1B8551 H3.99519 hypothefica) protein FU14007 6.53 

^_ 424745 AA214618 Hs.152759 ac&vatof of S phase kinase 6.52 

75 441801 AW242799 Hs.86366 ESTs ^52 

435542 AA687376 Hs.269533 ESTs ^51 

427072 H38046 Hs.303193 ESTs asO 

418051 AW192535 Hs.19479 ESTs a46 

436217 T53925 Hs.107 fibrtnogeiv«ke 1 a46 

80 439809 R41396 Hs.101774 hypothe8cdproteinFU23045 a46 

430704 AW813091 Hs.3357g9 ESTs a44 

410227 AB009284 Hs£11S2 exostoses {mulilpleHike 2 &43 

417067 AJ001417 HS.810B8 solute canierlMy 22 (extraneuronal a41 
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428392 H10233 Hs.2265 secretory granule, neuroendocrtnaprotel 6.40 

452t98 AI097560 H3.6t210 EST8.W^stmHar to 138022 hypotheti 6.40 

438202 AW169287 Hs.22588 ESTs 6.38 

_ 458311 AF069478 gb:AF069478 Homo sapiens astrocytDiraSb 6.36 

5 451389 N73222 Ks779009 matrix Ga protein ^38 

427899 AA82928B H5.332053 senim aRiytoid A1 6.35 

448693 AW004854 H5.228320 hypothetical protein aJ23537 6.34 

425492 AL021918 Hs.158174 zinc finger piotefn 184 (KfUppeMike) &34 

435532 AW291468 Hs.117305 Hoim sapiens, done IMAGE:36B2908,inRNA 6.32 

10 433908 AW298141 Hs.157975 ESTa 6.32 

430114 AA847744 Ks.99640 ESTs &32 

434032 AW009SS1 Hs.206892 ESTs 6.31 

444656 A1277g24 Hs.145199 ESTs 6.30 

433607 AA602004 Hs.23260 ESTs 6.26 

15 440659 .AF134160 H5.7327 daudinl 6,25 

435663 A1023707 Hs,134273 ESTs a24 

409916 BE313625 Hs.57435 solute carrter fan% 1 1 C)reknv€Oupied 6.24 

436547 AJ297351 Hs.30824 leudne zipper bansoiption factor-like 6.24 

447500 AI381900 Ks.159212 ESTs 6.24 

20 407237 AA169872 Hs.6216 Homo sapiens hepatocellular caidnoma^ 6.22 

417715 AW969587 Hs.86366 ESTs 6.22 

423276 AC003034 Hs.12S261 Homo sapiens Chnxnosome 16 BAC done CIT 6.20 

438138 R98299 Hs.177502 ESTs 6.20 

424590 AW986399 Hs.46821 hypothetical protein RJ20086 6J20 

25 416857 AA188775 Hs.292453 ESTs 6.20 

427667 Ai<001279 Hs.180171 Homo sapiens tDNAFU10417fis, done NT 6.18 

429598 AA811257 Hs.269710 ESTs 6.18 

426921 AA037145 Hs.172865 deavage stimulation factor, 3* pre^ 6.18 

438940 AF075045 H8.271609 ESTs 6.18 

30 400195 MA NA 6.15 

430473 AW130890 Hs.59962 ESTs 6.12 

452291 AF015592 Hs.28853 COC7(ceildhiisionpycle7,S.cerevisi 6.10 

420098 AA775910 Hs.95011 syntropMn. Iteta 1 tdyslroi)Mn-assadat9 6.10 

427513 Ai476318 Hs.192480 ESTs 6.10 

35 448934 AI598134 Hs.225592 ESTs, Highly similar to T51 146 rtng4)0X 6.10 

410389 AW954049 Hs.8177 ESTs, Wealdy similar to PiHUB68divary 6.08 

430345 AK000282 Hs.239681 hypothetical protein FU20275 6.08 

424856 AA347746 Hs.9521 ESTs. WeaHy similar to ZN43JtUMANZlKC 6.08 

409048 H59990 Hs.37699 ESTs 6.08 

40 427674 mUXOm Ks.2178 H2Bhlston6 family. memlierQ 6.08 

452689 F33868 Hs.284176 traisterrin 6.06 

453804 AA300204 Hs.35276 KIAA0852 protein 6.08 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142mRNAsequ 6.05 

408243 Y00787 H5.624 InterieuldnS 6.04 

45 420721 AA927802 Hs.169471 ZAPS protein 6.04 

429393 AA383024 Hs.201603 Homo sapiens mRNA;d3NADKFZp434D0917(f 6.04 

435420 Ai928513 Hs.59203 ESTs 6.04 

428046 AW812795 Hs.155381 ESTs, Moderately simlar to 138022 hypot a04 

407746 AK001962 H8.38114 hypothetical protein FU1 1100 6.02 

50 442116 AI884570 Hs.128813 ESTs &00 

423568 NM.00S256 Hs.129818 growth anest^pedlic 2 6.00 

422011 U30246 Hs.110736 solute carrier fanrtQy 12 (sodiunypotassl 5.99 

441794 AW197794 Hs.253338 ESTs 5.99 

434739 AA804487 Hs.144130 ESTs 5.98 

55 449802 AWS01804 Hs.23984 tvpolhelical protein FU20147 5.96 

420218 AW958037 Hs.286 ribosomd protein L4 5.96 

425707 AF115402 Hs.11713 E74^1«fBdDr 5 {ete domain transcript 5.94 

445546 AW468821 Hs.156054 ESTs 5.94 

43909S AA830185 Hs.269680 ESTs 5.94 

60 452606 N45202 Hs.90012 hypotheficai protein FU23441 SlM 

427701 AA411101 Hs.243886 nudear autoanfigenic spemi protein (his 5.94 

417246 AI760098 Hs.21411 ESTs 5.94 

433190 M26g01 H3.3210 renin 5.92 

418744 A1887288 Hs.196379 ESTs. similar to putative pi 50 (H 5.92 

65 421477 A1904743 H5.104650 hypothetical protein FU10292 5.92 

434342 AI791138 Hs.116768 ESTs 5.92 

406668 T62745 Hs.184411 aitMsnin 5.92 

418668 AW407987 Hs.173518 Mfhase phosphoprotein hcmdog 6.90 

_^ 429984 AtJ050102 H3.227209 hypothetical protein FU21 61 7 5.90 

70 408867 AA437199 Hs.656 celi(ffvlsloncyde25C 6.90 

409913 BE243842 Hs.283077 centrosomal P4.1>assodated protein; unc &88 
450380 At863675 Hs.114017 ESTs ' 5.88 

413026 AA809485 Hs.124219 hypothetical protein FU 12934 5.88 

454653 AW812227 gb:RC2-ST0173-2010994)11^09ST0173Homo 5.87 

75 457876 AI821940 H5.2&4622 ESTs.Modaf8tely8imDartoAUj8JlUMANA 5.86 

437222 AL117588 Hs.12778 ESTs 5.86 

455530 AV655701 Hs.75183 cytodirome P450. sulifamily DE (tihand- 5.86 

426269 H15302 H8.16B950 Homo saptensmRNA:d)NADKFZp566A1 046 (f 5.64 

409045 AA63S062 H8.50094 HornO8aptensinRNA:cONADKFZp434O0615(f 5.84 

80 441645 AI222279 H&201555 ESTs. Wedtly similar to T23406l9potheti 5.84 

401352 5.84 

419088 A1538323 Hs32620 integi1n.beteB 5.84 

431379 AAS04264 H5.182937 pe(^pnlyllsomer8seA(cydoph!lin 6.83 
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439580 AF086401 Ks^3847 ESTs. Moderately s!niaaibS65657 alpha &B2 

424345 AK001380 H3.145479 Homo 8aptenscDNAFU10518&, done NT 5.82 

419220 AAB11938 Hs^l759 ESTs &82 

439303 W00605 Hs.102784 ESTs 5l80 

5 41S954 AA171850 Hs.42251 ESTs 5.80 

418849 AW474S47 Hs^65 Komosa^^nsPI&MrMAformannosyltian &80 

42^5 Nlkl.00&729 Hs.226483 diaphanous (Drosophila, homoiog) 2 &80 

439527 AW298119 Hs^538 ESTs 5.78 

435380 AA679001 Hs.l92221 ESTs 5.78 

10 424088 AI351010 Hs.102267 lysyi oxidase 5.76 

425898 AA365649 Hs^69478 EST6.WealdysIm0artoPC42S9ferri6n &76 

428460 AM28865 Hs38563 ESTs 5.74 

416623 N74925 Hs.38761 Homo sapiens cDNA:aJ21564fls. done C 5.74 

413982 BES03035 Hs279193 ESTs 5.74 

15 453240 Aig69S64 Hs.166254 hypothetical prDtdnDKFZp566l1 33 5.74 

410505 AW762139 H3.314323 ESTs 5.72 

447072 061594 Hs.17279 lyiosylprotein suUotrensferase 1 5.72 

433312 AI24133r Ks.131765 ESTs. Moderately similar b 138937 DNAff^ 5.72 

448968 AW372914 Hs.88149 phospholrujsitol 3-phosphate4)(nding prot 5.71 

20 433384 AI021992 Ks.124244 ESTs 5.70 

415385 R17798 Hs.7535 COBW-lflffl protein 5.70 

441894 AA134329 Hs.24170 Homo sapieRS. done tMAGE:3685396,mRNA, 5.70 

414696 AF002020 H$.76918 Niemann^ldt disease, type CI 5.68 

448666 NMJI149S3 Hs.323346 KIAA1008 prot^ 5.66 

25 412246 A1160873 Hs.69233 zinc finger protein 5.68 

426518 Z43039 H3.170198 K1AA0009 gene produd 5.66 

418269 AA806113 Hs.189025 ESTs 5.64 

443316 AI476463 Hs.18443 aldehyde dehydrogenase 8 family, memtier 5.64 

422805 AA436989 Hs.121017 HaAhlstDnofamfty, member A &62 

30 442252 A173339S Hs.129124 ESTs 5.60 

414372 AA143654 gb2o65a02/1 Stratagene pancreas (93720 &60 

435040 Ai932350 Hs.152825 ESTs 159 

438777 AA82S487 Hs.142l79 ESTs 6l68 

433849 BE455884 Hs.280728 ESTs 5.58 

35 438639 AI278360 Hs.31409 ESTs 5.58 

411274 NVL002776 Hs.69423 tt^UIOBlnlO 5.55 

435008 AF1S0262 K3.162898 ESTs &S5 

434194 AF119847 Hs283940 Homo sapiens PR01 550 mRNA, partial cds 5.54 

452881 AW135220 Hs.241921 ESTs 5.54 

40 426010 AA136663 HS.197S hypothefical proNn FU21007 5.54 

424492 AI133482 gb:HA2093 Human fetal IhrercDNAtibraiy 5.54 

418971 AA360392 Hs,87ll3 ESTs 5.52 

453716 AA037675 Hs.152675 ESTs 5.52 

406972 M32053 gb:Human HI 9 RNA gene, complete ods. 5.51 

45 417543 AA203620 HS.1101S3 ESTs 5.51 

419423 D26488 Hs.90315 KIAA0007 protein 5.51 

434674 AA831879 Hs,136985 ESTs 6.50 

442980 AA857025 Hs.8878 klnesin^l 5.50 

418882 NH.004996 Hs.89433 ATP-blndIng cassette. 8ut>4am]lyC(CFTR 5.50 

50 404227 NA MA 5.49 

412766 BE544475 Hs.54347 ESTs 5.49 

441708 AI469911 HS.2&498 hypotheticdp(OleinFU21657 5.49 

408432 AW195262 gb3(n67ti05Jc1 NCLC6AP.CML1 Homo sapiens 5.48 

437440 AA846804 H5.1 23694 ESTs 5.48 

55 410486 AW235094 Hs.69233 zinc finger protein &46 

456435 AIB80384 Hs.270747 ESTs. WteaMy similar to ALU^UMAN ALUS 5.44 

437378 AI198B23 Hs.160473 ESTs 5.44 

436907 AA737171 Hs.131809 ESTs 5.44 

407300 AA102616 gb:zn43e07.8l Str8ta9eneHeUce!ls393 5.44 

60 413582 AW295647 Hs.71331 hypothetical protein MGCS350 5.42 

441795 N58115 Hs.21137 AD024prot^ 5.42 

452449 AW068658 H3.20943 ESTs 5.42 

439699 AF086534 Hs.187561 ESTs. Moderately similar to ALUI.KUMAN A 5.40 

431941 AK000106 H8.272227 Homo sapiens cDNAaJ20099fis, done CO 5.40 

65 441217 AI922183 Hs,213246 ESTs 5.40 

432383 AK000144 Hs.274449 Homo sapiens cDNAFU20137fis, done CO 5.40 

453891 AB037751 Hs.36353 Homo sapiens mRNA full length Insert cDN 5.40 

408690 AW864542 gb:PM4^N001 &-1 20500003^2 SK001 6 Homo 5.40 

„ 433759 AA680003 Hs.109363 Homo sapiens cONA:HJ23603fis. done L 5.39 

70 446142 AI754693 Hs.145968 ESTs 6.38 

408562 AI436323 Hs.31141 Homo sapiens mRNA for K1AA1 568 protein. 5.36 

433929 AI375499 H3JZ7379 ESTs 5.36 

421155 H87879 Ks.102267 lysyl oxidase 6.34 

424653 BE549737 Hs.132967 Human EST done 122887 mailnertransposo 5.34 

75 453931 AL121278 H3.25144 ESTs 5.34 

409091 AW970386 Hs.269423 ESTs 5.33 

416057 AI9Z7362 Hs.29857 ESTs 5.33 

438647 AA813116 H5.163230 ESTs 5^ 

415091 AIJ044872 Hs.77910 S^iydroxy^thylglutafyl-Coenzyme A sy 5.32 

80 445038 A1635444 Hs.143917 dJ467N11.1 protdn 5.30 

408622 AA056060 Hs.202577 Homo sapiens CONAFU12166 (is. done MA 5.30 

403432 &29 

435820 AA7005B0 Ks.189000 ESTs 5.28 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 

70 



401714 
449S08 
413151 
414653 
417372 
443813 
412B10 
408943 
41S139 
447982 
430789 
453921 
409582 
420911 
422356 
418661 
446271 
435905 
434551 
415245 
436016 
431242 
439B18 
424281 
449138 
449416 
430092 
436574 
433377 
440987 
426116 
441928 
432557 
438011 
437257 
423926 
433393 
415757 
420170 
420493 
425739 
440652 
419706 
427728 
416113 
446223 
407624 
447197 
452465 
442833 
448952 
408170 
424238 
421072 
424717 
423654 



NA 

AK001566 

H47969 

U31116 

T99755 

AI0793S6 

X90908 

NiyL007070 

AW975942 

H22953 

AA632577 

A[824009 

R27430 

U77413 

BE54S072 

NM_001949 

D82484 

AW997484 

8E387162 

N59650 

AA805465 

AA987742 

AL360137 

AA766243 

AW294215 

A)651016 

AI821399 

AW293527 

AI752713 

AA911705 

AA865729 

AI3701B8 

AA831815 

BE466173 

A!28308S 

X03833 

AF038564 

AA830854 

U43374 

AI635113 

T19016 

AI216751 

C04649 

AJ245600 

AA173S25 

6E300091 

AW157431 

R36075 

AA610211 

AA3281S3 



436554 
433264 
452387 
412666 
430287 
413^ 
418217 
401480 
456179 

TABLETS 



AW204516 
AA337401 
AI21S069 
H03764 
AI674253 
AI821940 
AI985810 
D8S782 
AI680772 
AIJ080116 
AW182459 
AL047483 
AI910647 
NA 

H75490 



Ha23618 
Hs.141971 
Hs.77501 
Hs^728 

Hs.74126 

Hs.49105 

HS.46S24 

Hs.137551 

H3.310235 

Hs.44577 

Hs^71565 

Hs.100293 

Hs.122579 

Hs.1169 

Ks.330994 

Hs.5003 

Hs.260856 

Hs.27252 

Hs.121536 

Hs^51278 

H3.19934 

Hs.195631 

Hs^48311 

H5.16514 

HS.12G465 

Hs.43845 

Hs.130229 

H3.144694 

Hs^11454 

H3^70940 

Hs.145696 

Hs^90931 

Hs,1722 

Hs^8074 

HS.187B10 

H3.95631 

Hs270386 

H8.159410 

Hs.143977 

Hs.77899 

HS.1B0545 

HS.1187S8 

Hs.119699 

Hs^48941 

Hs^244 

Hs^201 

Ks^8038 

Hs^1835 

Hs.137635 

Hs.89113 

Hs.152213 

Hs^828 

Hs^64622 

Hs^01173 

H$^229 

Hs^6094 

Hs.74420 

Hs.125759 

Hs^498 

Hs.13442 

Hs.271930 



NA 


&28 


hypolhsttea] protein RJ10704 


&28 


ESTs, Weakly similar b AUMJfUMAN ALU S 


&28 


san9giyc8n,beta(43kDdystmph!}ves50c 


S.28 


ESTs 


Si8 


gbsz39b09j1 SQares_NhHMR)_S1 H&mosap} 


5.28 


fatty add binding protein 6. Deal (gas 


&27 


FKBP-assodated protein 


5.26 


ESTs 


5.26 


ESTs 


5.26 


ESTs, WbaMy simBar to 178885 seiineAh 


&24 


ESTs 


5J24 


ESTs 


5.24 


CMinkad N-acetytgluoosamine (QcNAc) tr 


5^ 


hypotheOcd protein FU10461 


573 


E2F transcription factor 3 


5^ 


ESTs 


5.22 


KIAA0456 protein 


5.21 


ESTs. HigiiVsimBartoA35661 DNAexds 


5.21 


ESTs 


5.20 


Human DMA sequence from done RP11-472E5 


&20 


KIAA1201 protein 


5.20 


Homo sapiens niRNA fiiD length insert cON 


5.20 


Gb:oa13b11^1 NCLGGAP.GCBI Homo sapiens 


5.20 


ESTs 


5.20 


ESTs 


5.20 


ESTs 


5.20 


ESTs 


5.18 


ESTs 


5.18 


ESTs 


5.18 


ESTs 


5.18 


ESTs 


5.17 


ESTs. Weakly similar to 178865 seiine/lh 


5.17 


splidng factor (CC1.3} 


5.16 


ESTs, Weakly similar to YFJ7.YEAST HYPOT 


5.16 


InteriBukinl, alpha 


5.16 


itchy (mouse homolog) E3 ubk^uHin prote 


5.15 


ESTs 


5.14 


Human normal kerallnocyte mf^ 


5.14 


ESTs, Weakly similar to 178865 sei&ie/th 


5.12 


mdybdoptsrin synthase sutfuiybse 


5.12 


ESTs 


5.12 


tropomyosin 1 (alpha) 


5.12 


Homo sapiens mRNA for hypothetical prote 


5.12 


ESTs. Weakly similar to RLF [H^apiens) 


5.12 


ftypoUietkal protein nj12969 


5.11 


ESTs 


5.11 


gb7h88t)01.8l Soares placenta Nh2HP Homo 


5.11 


ESTs 


5.10 


ESTs. Weakly slndiartoAChain A. Oryst 


5.10 


ESTs 


5.10 


ESTs 


5.08 


ESTs 


5.07 


ESTs 


5.06 


wingless^ MMTV integrafton site fami 


5.06 


ESTs 


5.06 


ESTs. Moderately simHartoALUSJIUMAN A 


5.06 


ESTs 


5.05 


cysteine dloxygenase. type 1 


5.04 


trimmleottde repeat containing 12 


5.04 


origin recognition complex, subunit 3 (y 


5.03 


ESTs, Weakly siniar to LEU5J4UMAN LEUXE 


5.03 


GTP-Unding pre^ homologous to Saodis 


5.00 


ESTs 


5.00 


NA 


5.00 


ESTs 


5.ra 



Pkey: UnhpjeEospnjtjesetUenfiflernimdier 

CATnumben Gene duster number 
Accession: Genbank accesston numbers 



75 
80 



Pkey CATnumber Aooesston 

408432 1058667J AW195262 R27868AW811262 

408690 107490J AW864542 AA056557 AW882724 

411765 125700_1 H43346AA248302AA095182 

414372 143909J AA143654AW753140AA213770AW970865AA568075AA492132 

424281 23774^.1 AA766243AA3382S2AA338213 
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424432 240008J AI133482AI207619AA341B26 

428679 294(M9J AA431765AA432015 

430848 324621.1 AW021726AA487752AA488085 

432340 345248L1 AAS34222AA632632T81234 

432383 345469.1 AAS34469AW970240AW970323 

432639 351744_1 AW973785 H80163AA55760B 

434509 38950J R76593 AF147390 R76594 

439518 47334.1 W76326 AF08S341 W72300 

443613 575391J A1079356 W23287 

447197 711623.1 R36075AI366546R36167 

447974 745643^1 R76885 AI453674 R77049 

452712 928309.1 Am36816AW838660BE144343AI914520AW888910BE184854BE184784 

454653 1228081 J AWB12227AW812294AWB12092 

456311 543550.1 AF069478AR)69479AFa69480 



TABLE 7C 



Ptey: Unique number conespondli^ to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this oohimn are Genbank MenGter (GJ) number, ^ham L et si* refers to the puUicafion entiDed TYie ONA 

sequence of human chromosome 22.* Dunham i et al., Nature (1999) 402:489^95. 
Strand: Indicates DNA strand from which axons were predictod. 

NLposISon: Indicates nucleotide positions of predicted axons. 



Play 


Ref 


Strand 


NtjMKlBon 


400534 


6981826 


Minus 


278637-279292 


401352 


9931258 


Minus 


26064-26208 


401460 


7321503 


Phis 


166120-166347^66451-166557.169651-169832 


401714 


6715702 


Phis 


96484.96661 


401742 


2911728 


Plus 


6400^^147 


403432 


9719611 


Minus 


68204^2 


403778 


7770611 


Minus 


1414-1513,1624-1756 


404227 


7838233 


Mnus 


93110-93259 


404567 


7249169 


Minus 


101320-101501 


404996 


6007690 


Rus 


37999<38145.38852-38998,39727-mr2,40557-40674.4235M2450 


406360 


9256107 


Minus 


7513-7673 



Tabte 8A shows 538 genes sIgnKicantiy down-regulated in colon cancer compared to nomial 00^ 
0.33. 

TABtJ 8A: 538 GENES StGNIFICANTlY DOWf<-REGUlATED m COION CANCER COMPARED TO NORMAL COLON 

Pley: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Un^enelD: Unigene number 

LMgeneTIIto: Unigene gene title 

R1: Rattoofhunortonomnalcoton 



Pley ExAccn UnigenetO Unigene Title R1 

421996 AW ag807 Hs.1460 glucagon 0.0233 

429970 AKD00072 KS.2270S9 chloride channd.C8lchimac6v8ted.fiam 0.0307 

457407 AA505035 Hs.l95651 ESTs 0.0416 

423690 AA329846 H8.23804 ESTs. Weakly similar to PN0099 8003 prot 0,0664 

426651 AU076646 Hs.171683 nuctearieceptorsubfamityl. group Km 0.0567 

425920 AIJ049977 Hs.162209 daudinB 0.0601 

431436 AA50503S Hs.195651 ESTs 0.0607 

433084 M18079 Hs.282265 tatty add binding protein 2, intestinal 0.0629 

442009 AI733281 H8.128320 ESTs 0.0634 

416869 AW2S0318 Hs.60395 mal. T-ceU (SRerentiaSon protein . 0.0707 

429050 XB1333 Hs.194777 mepr1nA,beta 0.0714 

427019 AAD01732 Hs.173233 hypothettea! protein FU10970 0.0735 

446947 AF146747 Hs.232165 polycythemia mbra vera 1; cell surface 0.0739 

429657 D13626 Hs.2465 KIAA0001 gene product; putative G-protel 0.0769 

443506 H10661 Hs.192124 ESTs, Weakly similar to 138022 tnpothetl 00838 

415314 N88802 Hs.5422 gtyooprotein MOB 0.0853 

451161 Ar7g6330 Hs.207461 ESTs 0.0673 

429001 AF098951 Hs.194720 ATP4)lndlhg cassette, aut^fanByGWIWr 0.08B8 

426635 BE395109 iis.129327 hypothetica!prateliiMGC13057 aOSOO 

429350 A)754634 H8.131987 ESTs a0927 

432251 AWg72983 HsJZ32165 potycytharnianibraveraliceasuifeoe a0931 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
•70 
75 
80 



411S29 


AA430348 


Hs^17596 


nwltti sapians BUNA t%Ji£SCl Va, CKXiB N 1 




441066 


AW20S427 


HS.19Q726 


COloi VtoBNf Sinnn Of fUJl/l^UIVInri nUI O 




431252 


NM_005478 H5^1380 


insunn-<ikd5 


U.U9oO 


437099 


N77793 


Hs.48659 


PQTe Mrnhkf rimllof In Ql^Cfilaminln B 

C9io» rugiiij siTuiai ni oi'Kt30 lainunn b 






AI075877 


Hs.125461 


hvnnfhf»nnnl rvnfpfn Ft 11 


n inn? 




D63175 




MifUl HkiCniOnOD P1nn4<M>K tiiijuLim ■jMMattt'i mm 

giKnuMMiiDwo LiOfuficn nunon pidcenia po 


n iim 


409921 


AW600239 




1 UVUU9 puCtVr 1 HDldJ/ nuiliu oc^cnS 


A ins? 




X63597 


Ks.2996 


A iin7 


4^66 


NML004S73 Hs.241519 


cnyiDpOjeuliHUvo 1 


U. 1 1 l*t 


*iZ/lO( 


A)239607 


H5.99196 


nypOuioucaj pnnstn Mbu i iJ24 


n 11VI7 
0.1 14/ 


^1919 


AW242447 


Hs.14o182 


cytosoBc bol&tf ucosldflsfl 


n 11R7 

U.I IDf 




AF047826 


HS.12S887 


csdhsiln 19, typo 2 


A 11QA 


Aii'iM 
91 100 1 


AW841662 


Hs.306831 


Momo sfiptons cuna. riJctyvi ns, ciono n 


0.1211 




AW9612B4 


Ks.296235 


CCTe 


A 10^ 




Z7029S 


Hs.32966 


Qudnylstd cyd3S6 sc^stor 26 (uroQUdny 


A 19/A 
U. iZ4U 








A lo^a 


■1^1910 


AI934365 


K5.IO9439 


ostBogiycin (ostGotnaucuvs factor, nuind 


A 197^ 




AW85G093 


Hs.1 83817 


CdlS 


ft 1977 




BE277929 


Hs,11081 


UBa oofnaiKontaning 2 


0.1277 




AA370141 


Hs.2281 


chrofnoQTflnin B (sociBtogrsnin 1) 


A 19Q1 




AA285136 


Hs301914 


nsuronal spedSc transcftpOon factor D 


A lono 




AW205369 


HsJ12830 


ESTs 


0.1315 


j| 017X1 


AK001679 ' 


Hs.107527 


nypouisticai protQin rUiiiUi/ 


A ItOC 
0.1 j£9 




AWB20852 




9I>.KU£S}Iww1*IAIAaHI1M1< OlVvVI nOiliD 


A 199R 
U.I J4o 




T77609 


Hs.1 17970 


snkyrin 2, neuronal 


ft 


4^9049 


AJ000512 


H8.298323 


SQiunVQlucocor6co!d regulated ktnsso 


A ITTO 


ill ififin 
411O0U 


AW872477 




90.ninounMj(i NUjUuAr.i 1^4 nomo sapiens 


A 190^ 


iiifiCfM: 
41tK)oO 


X54162 


Hs.79386 


BKUnOQin 1 [GmOOtll nXlSCiO) 


A IMC 




AF2S5910 


Hs.54650 


liin#4tnnal aHhaelnti mrJnriitn 9 

juncuona BonesiDii muiccuie £ 


U.143D 




A8020629 


Hs.38095 


t\ 1 r-UinQing CoSScUOf 5UO>TdJiUiy f\ \/\o\* 1 


U. 1*1 09 


•HWf Of 








A liRft 




AJ235664 




QDinOfno sapiens mKNA tor imnuinoQioouBn 


A 1Afi9 
U.I4D£ 


427359 


AVV020782 


Hs.79881 


tl«\mn eantaiw <*fl&lA* CI I9WM fSe f<lnnA 1 

rwiliu sapions cmw rucowo us, uono l 




452768 


AW069459 


Hs£1539 


Cvi9 


0.1466 




AK000268 


Hs^7383 


nypoineDCol pnueui 


A 1/71 
U.l4f 1 




M31158 


Hs.77439 


pioteln tdnase, cAMP-depondent, te^ato 


A 1/71 
U.I4f 1 


407551 . 


. Y10516 




QO ji.S8pten5 IDKNA lOT uuoo 1 i proiein. 


0.1486 


4U2lJrD 






A lJltt7 

UL140f 




A1478427 


Hs.43125 


esopnageal cancer related gena 4 protein 


A ICAA 

0.1500 




AW444761 


H3.44565 


ESTs 


A ICAA 


■MlfUO 


At816086 


Hs,296341 


adenyty) cydase-assodated protein 2 


ft i<;i^ 


A'MttAll 

4tyi>4i> 


AI824164 


Hs.77667 


lymphocyte antigen 6 compiexi locus E 


0.1523 


4i0oon 


AF133820 


Hs^5 


Clin Immunoglobuiin domain protein (myo 


0.1529 




AW023337 


Hs.5422 


glycoprotein M6B 


A 1R19 


4^t>UJ4 


AI276989 


Hs.56123 


Honu) sapleiu cDNA ru 19443 nSt ^ono PL 


0.1538 


429609 


AF002246 


Hs.210863 


cell adhesion molecula with honulogy to 


0.1542 


40B221 


AA912163 


Hs,47447 


ESTs 


0.1552 


425220 


AW975317 


Hs.162987 


ESTs 


0.1558 


445200 


AA084460 


Hs.12409 


somatostatin 


0.1558 


AA^VM 

44a2JO 


T78886 


Hs.284450 


ESTs 


0.1563 


•K>t>U04 


AA256213 


Hs.72010 


ESTs 


A 4 COO 

0.1002 


428133 


AW167727 


Hs.11873 


ESTs 


0.1605 


AAT*iCi 

447zo1 


NM_006691 


Hs.17917 


extracellular Urai domaiv-Gontaiung 1 


0.1615 


•Ml r04 


AA693951 


Hs.180284 


CCTe 
CO 18 


A IRTT 
U.lDJf 




A1S68801 


Hs.71721 


col 8 


U.1000 


il1Q09C 


T28499 


Hs.89485 


carbonic anhydrasa IV 


A icce 
0.1 boo 


*l 1 


AI365585 


Hs.146246 


ESTs 


ft IRRfl 


AA'iAtiti 

442495 


R55073 


Hs.124130 


ESTs 


0.1676 


4gU09<J 


AW450461 


Hs.203985 


CCTe 

colS 


A iftoa 
0.1 Dso 


420736 


AI263022 


K8.82204 


CCT« 
COlS 


A 171B 
Vllf 10 










A 17^6 










ft I7i;i 


4£/«}«M 


AF087797 


Ks.176658 


aQuaportno 


A 17C7 
0.1 ?07 








v.lrDJ 


il497nc 


BE044593 


Hs.1 12704 


ESTs 


(K1767 


^IWCfi 

41^000 


T28160 


H5.778 


guanylate (^asa ac6vator IB (rethis) 


(X1769 


*mJD9oU 


S69265 




\n\Jnc) 


A 17Q1 

0.1 rol 


441 OOD 


ALJ03525O 


Hs.1408 


enooineiin i 


0l1784 


452854 


AA437Q81 


Hs.14060 


proklnelicin 1 precursor 


0.1795 


400514 






U.1B05 


4941 OD 


BE141030 




nk»MDn-Urnna7 munnn fwv» U44 UTIMCT Unmn 

go.MnU>n 1 wOf'£0iV9lHMlrm I n 1 uuor nOmO 


n iftftft 
U.I0UO 


410765 


A1694972 


HS.661B0 


nudeosome assemt)!/ protein l-like 2 


0.1812 


413724 


AA131466 


Hs^7 


hypothetical protein FU12665 


0.1812 


412474 


AI791451 




gb:ni50c09.y5 NCLCGAP.0v2 Homo sapiens 


0.1812 


436008 


A»79428 


Hsi8785 


ESTs 


ai620 


42M24 


AF150241 


H8.128433 


piDStagtandln 172 synthase, hematopoietic 


0.1828 


431728 


NiyL007351 Hsi68107 


nuillimerin 


0.1832 


419746 


AW867943 


Hs.127216 


hypothetical protebi FU13455 


ai835 


410877 


NhA.003278 Ks^24 


tetranecQn (plasmino^ivUndlng protein 


ai838 


415672 


N53097 


H3.193579 


ESTs 


0.1838 


419050 


NM.000036 H8.89570 


adenodne monophosphate (teanAiasa 1 (|so 


0.1838 


417342 


W40277 


Ks.81994 


glycophoffa) C (Geibich Uood group) 


0.1842 


413714 


AIS60944 


HS.7142B 


ESTs 


ai845 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



427081 
4052B2 
400163 
412295 
447414 
407891 
437140 
431544 



447184 
453898 
423743 
428412 
420056 
434683 
421885 
452786 
407601 
419758 
437740 
405610 
437145 
416961 
401465 
437425 
416231 
401753 
433430 
432150 
411644 
403957 
435900 
424208 
445500 
419956 
418026 



AB032971 Hs.173392 KIAA114Sp(0teln 



Ks.1 17178 poly(AHiIndbig protein, nudaar 1 

Hs.18551 neuroUastoma (nen^e tissue) protein 

Hs.41135 endomudn-2 

Hs.283689 acQvstorof CREMtntesSs 

Hs.299329 Homo sapiens cDNA FU20763 lis, done CO 

Hs.144464 ESTs 

Hs.17518 Homo S8ptenscig5mRNA,par&i sequence 

Hs.42746 ESTs 

Hs.173373 KlAA0931pfOl^ 

H5.126083 ESTs 

Hs.94694 sapienscONA RJ10561 tis^done NT 

Ks.202639 ESTs 

Hs.109012 MAX dlmerfzafion pratdn 

Hs.106642 ESTs. Weakly simBar to TD9052 hypotheO 

Hs.37129 sodium channel, nonvottaoe^ated 1, beta 

Hs.93173 phosphodiasterBsa 6C| cGMP-specOc, eon 

Hs,122915 ESTs 



D82343 

AA486620 

AA312799 

AK000770 

AI217900 

AF026941 

AA037615 

AB023148 

AA428240 

AK001423 

AW298724 

AA609911 

R61362 

AC002300 

U31973 

AA810265 



401361 
426452 
433476 
42340S 
442826 
427060 
437354 
447734 
424585 
458016 
423893 
401521 
430130 
401024 
414602 
441083 
417355 
422440 
442930 
431069 
444567 
405654 
415471 
449243 
436088 
434098 
427552 
416439 
459390 
459395 
439039 
43357S 
416035 
428415 
441899 
437191 
434839 
435731 
400865 
446294 
414193 
411514 
453098 
430378 
432374 
443482 



AF007216 Hs.5462 

BE391476 Hs.80617 

AW183714 Hs,20981 

H30333 HS.165C82 

A1863735 Hs.186755 

AK000224 HS.27Z789 

K92064 H$.278626 

AI243036 Hs.16094 

AW583123 Hs.143113 

AW451938 Hs.257512 

AL137939 Hs.40096 

BE379727 Hs^213 

AA72242S H$.182765 



solute carrier famSy 4, sodium Ucarbon 
ritxjsoma) prot^ S16 

hypothetic^ protein DKRp547M236 
ESTs 

ESTs 

hypoUieOca] protein FU20217 
Arg/AbMntefBcting protein Arg8P2 

ESTs 

pancreaStc l^tase-felaied proteli 2 

ESTs 

ESTs 

fatty add binding protein 4, adipocyte 
ESTs, Moderately Similar to 1 207289A lev 



AW614271 Hs.121647 ESTs. Highly similar to AC006014 8 simO 

AA594394 Hs.152616 ESTs 

NML0141S1 Hs.128155 HSPC053 protein 

A1018777 HS.131241 ESTs • ■ 

AW378993 Hs.90286 ESTs 

AA749215 Ks.291B86 ESTs 

AI421412 Hs.163659 ESTs 

AA464840 Hs.131987 ESTs 

AW188099 Hs.131813 ESTs 

AIJ031709 Ks.134846 Human ONAEequance from done 316G12 on 

AL137311 Ks^34074 ^mo sapiens mRNA;cDNADKFZp761 6021 21 ( 

AI793107 Hs.27018 
BES62811 

013168 Hs.82002 

NM.004812 Hs.116724 

AW881975 Hs.213923 

BE04139S H3.2B3876 

AV654020 Hs.184261 



F09747 Hs.288707 
AW295031 Hs.198671 
AA704687 Hs.191294 



Hs.179608 
Hs.118769 

HsJ81935 
H3.48713 
Hs.39720 

Ks.184222 

Hs.8022 

Hs^lSSS 

Hs.134738 

Hs.166811 



Nlil.005771 

AA180363 

BE385725 

Z30300 

A1656707 

AA600175 

H42314 

AA337211 

A1372588 

NhL008846 

AI743069 

AA699581 



A)284935 
BE260069 
AW850178 

Z25935 Hs,86379 

Z29572 Hs^ 

W68815 HSJ01885 

AW186093 K1250385 



0b:6O1336446F1 N1K-MGC^44 Homo sapiens c 
endotheiin receptor type B 
aldo-keto reductase fiamily 1, member 610 
ESTs 

ESTs, Weakly similar to unknown protein 
ESTs, SimBar to T26686 hypotheQ 

ESTs 
ESTs 
ESTs 

0b»f69QO8j1 SoaesJIhHMPiLSI Homosapl 

letlnd dehydrogenase homdog 

ESTs 

gb;601276347F1 NIHJAGC^ Homo sapiens c 

ESTs 

ESTs 

ESTs 

gb:yo09e02^1 Soares adult brdn N2b5HB5 
Down syndrome crtOcal regkxi gena 1 
Tim protein 

senna prataasa Inhibitor, Kazal fype, 5 

ESTs 

ESTs 

gb:qk55g09jc1 NCLCGAP.CoS Homo sapiens 
gb£011S0964F1 NiH.M6CJ9 Homo sarins c 
gbdl3^219-27109&<>22-H12 a0219 Homo 
ESTs 

himor necrosis lactor receptor Euj^ffami 
Homo sapiens cONA FU11346 8s. done PL 
ESTs 



ai847 
ai848 
0.1855 
0.1863 
0.1876 
ai895 
0.1901 
0.1904 
0.1905 
0.1926 
0.1928 
0.1941 
0.1942 
0.1944 
0.1957 
ai957 
0.1969 
0.1981 
0.2004 
0.2016 
a2017 
02025 
0.2041 
a2045 
a2049 
02049 
0.2050 
0.2051 
0.2052 
0.2059 
012083 
OL2070 
02075 
O2075 
O2090 
O2093 
02118 
021 20 
02122 
02125 
0.2130 
02132 
02137 
02137 
02144 
0.2146 
02151 
0.2151 
0.2157 
0.2165 
0.2171 
02179 
0.2185 
02186 
02188 
0.2193 
0.2209 
02212 
02217 
02222 
02229 
02232 
02242 
02243 
02244 
0.2254 
02257 
02268 
02268 
0.2273 
0.2283 
02283 
O2290 
02294 
02299 
0.2304 
0.2305 
O2309 
02315 
02315 
0.2319 
O2320 
02328 
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432134 A1816782 Hs.122583 hypothetical protein RJ21934 0.2329 

421539 AA292747 Hs.97296 ESTs a2330 

448520 AB002357 Ks.21355 doutdeoortln and CsM kin8se4il(e 1 a2344 

_ 408001 AAD46458 Hs,95298 ESTs 0.2347 

5 409331 M38634 Hs.53973 vasoacQvetntsstinai peptide 0.2351 

431094 AWg72Z76 Hs.116195 ESTs a2354 

429575 AA706003 Hs.99387 ESTs a2358 

404958 0.2361 

439731 AI953135 H3.45140 hypolhelica) protein RJ 14084 0.2364 

10 4S2742 AV\^89945 Hs.97876 hypothetical protein OKFZp564KD322 Ol2380 

417511 AU)49176 Ks.82223 chocdin^ a2381 

404927 a2387 

430297 AW243166 Hs.129806 ESTs 0^412 

447482 AB033059 Hs.18705 KIAA1233 protfita 0,2415 

15 418332 R34976 Hs.78293 ESTs 0.2416 

454145 AA046872 Hs.62798 ESTs a2421 

422472 R59096 Hs.279939 mKochondrisd canter homolog 1 a2424 

404070 0.2427 

421232 AA989220 HsJS2100 ESTs 0.2427 

20 414539 BE379046 gb:601236646F1 NIH_MGC.44 Homo sapiens c 0.2432 

412622 AW664706 Hs.171959 ESTs a2433 

454430 AI082777 Hsj61384 semadoniabi, seven thnmbospondin repeat 0.2437 

416694 AW161284 Hs.79554 neuronal PAS domain protein 1 0.2443 

426724 AA383623 Hs.293816 ESTs 0.2444 

25 405073 0.2445 

401238 H24185 Hs.92916 hypothetical protein 0.2445 

414203 BE262170 Hs.76629 ATPase. Na4./K^ transporting, beta 1 poty 0.2451 

401776 0.2452 

404696 0.2462 

30 426666 AW500131 Hs.171763 C022 antigen 0.2471 

427078 A1676C82 Hs.1 11902 ESTs 0.2474 

424682 AW604804 Hs.151717 KIAA0437 protein 0.2478 

440383 AA6&4208 Hs.30484 ESTs 0.2481 

^_ 419118 AA234223 Hs.139204 ESTs 0.2494 

35 443515 AV657547 Hs.266014 Homo sapiens, done liyiAGE:3867243. mRNA 0.2495 

424648 AA344576 gb:EST50478 Gail bladder I Homo sa(riens 02499 

404605 0.2500 

446066 AI343931 H8.149383 ESTs 0.2505 

408345 R936S1 Hs.63063 ESTs 0.2506 

40 418358 102840 Hs.84244 potassium voltag&^ed channel, Shab^ 0.2508 

416950 AL049798 Hs.80552 demiatoponGn 0.2510 

423555 AW958201 Hs.17e589 hepatocellular carcinoma antigen gene 52 0.2513 

449833 R82252 K8.106106 protein kinase (cAMP-dependent. caialyfi 0.2515 

459275 A1808913 HS.3393S2 Homo sapiens brother of COO (BOC) mRNA, 0.2519 

45 405897 M57417 gb:Homo sapiens mucin (mucin) mRNA, part 0.2523 

422743 BE304678 Ks.119598 ribosomai prot^ L3 0.2526 

459668 U72671 Hs.151250 Intercellular adhesion molecule 5. telen 0.2532 

450880 AK002183 H1285685 Homo sapiens CDNAFU11321 lis. done PL 0.2536 

457330 AB013818 Hs.247220 peroxisomdbiooeneslsfoctorlO 0.2536 

50 451979 F06972 Hs.27372 BMXnoiHeceptoriymslne kinase 0.2549 

440274 R24595 Hs.7122 scrapie responshn protein 1 0.2553 

430097 AI523245 Hs.127638 ESTs 02558 

410626 8E407727 gb.'601299771F1 NliiMGCJI Homo sapiens c 0.2564 

402695 0^565 

55 453992 AW01499$ Hs.201080 ESTs 0.2569 

453888 AW450670 Hs;S2819 ESTs 0.2569 

401371 0.2574 

456145 BE299427 Hs.21446 KiAA1716 protein 0.2579 

^. 408134 AK000184 Hs.42945 acid sphlngomyennase-Gke phosphodieste 0.2580 

OO 422591 11)7648 Ks.118630 MAX^teiacSng protein 1 0.2562 

452359 6E167229 H3.29206 hypothetical protein MGC1 4376 0.2584 

447569 AI393202 Hs.147554 hypolheGcai prot^ FU23392 0.2586 

405860 0.2568 

420321 D78761 Hs56657 hyothetical protein 0.2595 

65 454415 AK000846 Hs.5867g solute carrier family 7, (catkmie amino 0.2602 

437032 AW887372 Hs.302063 ImmunogtobidEnh^ constant mu 02604 

448025 BE502965 Hs.170426 ESTs O2605 

444304 AW628433 Hs.271296 ESTs, Weakly similar to 154374 gene NF2 O2605 

424885 AI333771 Hs.82204 ESTs O2608 

70 425381 084371 HS.18SB paraoxonasel 02611 

457413 AA743462 Hs.165337 ESTs 02618 

452078 AA022620 Hs52170 ESTs 02624 

450785 AA852713 Hs^9 Homo sapiens, alpha-l (VI) odlagen 0.2628 

401974 02639 

75 411319 6E537094 gt»:601083333F1 MHJ^JO Homo sapiens c 0.2646 

417761 R13727 Hs.21435 ESTs 0.2648 

426132 AA370501 gb:EST82261 Prostate gland I Homo sapien 02653 

455771 BE084820 Hs.188711 hypotheOcsl protein FU20070 0.2653 

414349 BE512968 gb:601172296F1 NiHJAGCLl5 Homo sapiens c 0.2660 

SO 402182 • O2660 

402610 0.2661 

444814 BE010749 Hs.2550g7 ESTs 0i663 

450017 W56434 H5^1608 ESTs 02683 
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408684 
444209 
415022 
416184 
422909 
412047 
426356 
442238 
402425 
450545 
417118 
419850 
428020 
441493 
413541 
428470 
455597 
447809 
444195 
415160 
421823 
434464 
414376 
430073 
432018 
422954 
416397 
442420 
410950 
427114 
448466 
434445 
457115 
459511 
421321 
433633 
440236 
405691 
405334 
403047 
412506 
441042 



444453 
457736 
454012 
427448 
419564 
435021 
413344 
447787 
457290 
458244 
437483 
407938 
417332 
428212 
424433 
425195 
404769 
411620 
428746 
431822 
441290 
422033 
421935 
447955 
405364 
422165 
431087 
450610 
445627 
436144 
445152 
430304 
455614 
419683 
411886 
430770 
444459 
444918 
410845 



R61377 Hs.12727 hypotbefcal protein FU21610 

AI753134 Hs.146494 ESTs 

X59960 Hs.77813 spWngomyeDn phasphocflesteraM 1, add 

R48481 Hs^rn EST8.Vteak!y^n]ilartDALU6_HUMANALUS 

AA533356 filCTi67f10^1 NO.CGAPJMO Homo sapiens 

AA934589 Hs,49696 ESTs 

BE536836 Ks.98682 hypcOiefeal protein FKS632 

AW135374 Ks^0949 ESTs, Moderately sini!arbF419fflliypol 

AW135582 Hs^1767 ESTs 

U38654 Hs.50477 RAB27A.fnennberRASonoooeRe^ 

F06844 gtKHSCIMEOginomializedlftbntbralncDN 

119058 HS.181S81 glutamate receptor, ionotropic, katnate 

AW070446 Hs.127037 ESTs 

BE147036 gb:QV44n'0222-091 1994}24«10 HT0222 Homo 

AO002301 HS.184S07 Homo sapiens Chromosome 16 SAC done OT 

BB)08545 Hs.1561 10 immunogtolnilin Icappa constant 

AW207605 H8.164230 ESTs, Highiy similar to JC7256 3',5'-cyc 

AB002351 H$,105B7 KIAA0353 protein 

T82802 gb7d38a04.r1 Soares fetal Over spleen 

N406S0 HS.2862S ESTs 

6E063921 Hs^5971 ESTs 

BE393856 Hs.66915 ESTs. WBaldy8indarlo16.7Kdpmt8}n[ 

U86136 Hs^70 tefomerase-assodated protein 1 

AA524447 Hs.152377 ESTs 

AW996605 gb:PMO^N0065-100300^1-b10 6N0065 Homo 

H53035 Hs.337620 Homo sapiens AFG3L1 isofbrm 1 mi^ part 

AI024834 Hs.131729 ESTs 

AWB11&33 gb:RC2.ST01584}910994)11^ST0158 Homo 

A)219896 H3.97592 ESTs 

AI522109 Hs.171068 ESTs 

AI349306 Hs,11782 ESTs 

AA420712 9b:nc63dJ7^1 NCL0GAPJY1 Homosapiens 

A!14237g gb:qg64c01.r1 Soares^teslisLNHr Hdmo sap 

NM_005309 Hs.103502 gtutamlopynivate transaminase (alanine 

AI880516 Hs.84630 ESTs.Wealdy8imiiarto200439gAdvomos 

AW996722 Hs.125297 ESTs 



AW957159 gb:EST369229 MAGE resequences. MAGD Homo 

AA077736 gb:7B48A07 Cttromosoma 7 Felai Brain cDNA 

AA764768 Hs.121158 tiypotiteQcai protein OKFZp434J01 13 

AW379394 Hs.145126 ESTs 

AK000390 HS.420S irypotheScai protein FU20124 

M76424 Hs J7014 carbonic anhydrase VII 

BE246449 Hs^157 WislcottAtdricI) syndrome (eczema^hiombo 

U08889 Hs^1139 sdlutBC8rTferMiy1(neuraiUlfepIilie 

AA922192 Hs54709 ESTs 

U46024 Hs.75302 myctubular rnyopaOty 1 

BE620108 gb:601483015F1 NIH_MGC.69 Homo sapiens c 

AA465293 Hs.105069 ESTs 

AI929453 H3.122489 Homo sapiens cONARJ13289 (is, done OV 

AL390174 gb:Homo sapiens mRNA; cDNA DKFZp547J164 

AA905097 Hs^50 phospholamban 

AW972717 Hs^88462 liypothetical protein FU21 511 

AW444451 Hs.134812 ESTs 

H04607 Hs.9218 ESTs 

AA352026 H5.94319 VPS10 domain receptor protein 

AW854536 gb:RC3-CTD255.200 100^4^ CT0255 Homo 

AW503820 Hs.192861 Spi-B transcription factor (Spl-1/PU.1 r 

AA516049 gb3io65d01.s1 Na_CGAP„Up2 Homo sapiens 

W27501 Ks.89605 ctwUnerglc receptor, nicotinic, aiptia p 

AW245605 Hs.1 10903 daudin 5 (transmembrane protein deleted 

AA131632 Hs.1 09572 CMP^euAC:(beta}^ac8tyIoalaclos8mbilde 

BE544271 Hs.288390 tiypottiefical protein FU22795 

AL041199 Hs.1481 histidlne decarboxylase 

H12723 Hs.290791 ESTs 

AA010370 Hs.60386 nudevRNA export factor 3 

AWB1&475 Hs.7363 ESTs 

AW8ai250 Hs.148367 ESTs 

At214667 H3.283597 ESTs 

AL122071 Hs^8927 Homo sapiens mRNA; cDNAOKFZp434H1235(f 

AI69336g Hs.202274 ESTs 

AA248897 Ks.4d784 ESTs 

AL046810 Hs.20021 veside^ssodated membrane protein 1 (s 

AA765694 Hs.123296 ESTs 

AI680624 Hs.148676 ESTs 

AI2022B2 Hs.2a3382 ' ESTs 

AW807182 gb:MR4^T0062-18Q20(Mni-b04 8T0062 Homo 

AA689470 Hs.163026 ESTs 



02667 



0^77 
02681 
02661 
02693 
02703 
O2709 
O2710 
O2710 
02725 
02727 
O2730 
02733 
02733 
02734 
O2740 
O2740 
02743 
02747 
02755 
02755 
02756 
02762 
02763 
02768 
0.2775 
02775 
02778 
02778 
0.2778 
0.2784 
02785 
0.2786 
02794 
02799 
02799 
02804 
O2804 
O2809 
O2809 
02815 
02816 
02817 
O2B20 
02621 
02822 
02B27 
02828 
02837 
O2840 
02841 
02841 
02842 
02845 
02846 
028S3 
0.2857 
02857 
02863 
02868 
O2870 
02872 
02874 
02877 
02878 
O2880 
02881 
02882 
02882 
02882 
0.2883 
0.2886 
02891 
02891 
02899 
0.2900 
O2904 
02913 
02913 
02915 
02915 
02921 
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413056 


6£083031 




gb:MR(^TO265-23119&O02-«09 BT0265 Homo 


a2922 


443998 


Ai820661 


Hs^6276 


ESTs 


a2924 


424412 


H15512 


Hs.10043 


hypolheCcal proteh FU13074 


0^25 


421204 


AW081587 


Ks.165051 


ESTs 


0.29^ 


420931 


AR)44197 


Hs.100431 


smal bidudbie qrtoUno B subfamily (Cy 


0.2938 


440507 


K06994 




gb:y!81b07.r1 Sosfcs Infant brsin 1NIB H 


0.2943 


445555 


AW974013 


Hsi60B09 


ESTs 


0.2945 


438570 


AW888554 


Hs.84298 


G07A sntiQBn (invsitsnt polypsptkls of m 


0.2948 


447195 


T73745 


Hs^79870 


ESTs, Weakly similar to A46010 X-Onted 


o!2950 


423267 


AL137416 


Hs,126177 


Homo saitiens mRNA; cONA DXFZp4340192 (fr 


0.2956 


421920 


BES51245 


Hs.1438 


QflfUfnftflfnhiobu^fffc sdd (GAB/^ rocoptoTi 


0.2958 


412177 


223091 


Ks.73734 


f ilu>¥Mii niain \t f f^tafntafl 
^yUrpJIIMOIII V ^|HQIPiQ^ 


0.2959 


428042 


AA419529 


Hs.76391 


myxoviius (influsitts) resbtenop 1| homo 


0.2959 


433745 


AF075320 


K5.28980 


hypolhelicai protein FU14540 


0.2969 


417935 


R53897 


H5.170044 


ESTs 


0.2970 


420674 


NMOOOOSS KS.13Z7 


bu^fiybhoBnostofiKO 


0^2973 


413537 


ttPMSStftR 
OClWDOO 




iib*(]V4-HT0222-21 1099414>1DB HT0!222 Homo 


0.2973 


445194 


AkciODDf 




ESTs 


0.2974 


454135 




HttMRTM 
nSaC'lOfOiJ 


ESTs 


0.2976 


403418 








0.2988 


457605 


A\IRC7TTft 
nVvvt f to 


Hs^14 


solenopfotebi P, plasire, 1 


012989 


408896 






nibflfi protein 


0.2993 


448542 


DCC90I fu 


Ue 970719 


ouksiyotic translation initiaGon fisctof 


Ql2994 


417945 


R90fl79 




gb:FM01 D 22 week old human fetal Rver 


0.2994 


412518 


RPAd7fv{7 




hypothetical protein RJ 10297 


0.2996 


424566 


MlDWl 


Us 17Qn 
n8.l(i}U 


nustsar receptor subfamily 3, group m 


0.2997 


430778 


D90337 




natrtuTBOc nefittde ofecursor C 


0.3000 


451531 






ESTs 


0.3003 


444926 






ESTs. Wealdy sbnSar b CLD4_Himj CLAUD 


0.3003 


407366 






gbil^omo sapiens c^33 mf^NA, partly soqu 


0.3012 


459456 




Up 1Q019>< 


ESTs 


0.3012 


417111 


AWQIojZi 




destrin (acfin dspolymsrizing factor) 


0^3012 


452975 






Homo saptens, dono IMAGEi361171% mRNAt 


0.3012 


451959 


AAUoOZUJ 


Ue 97T17 


hypothetical proteSn FLJ20623 


0.3012 


410482 


AU/7794n7 


rlS.1 91099 


ESTs 


0.3013 


417700 






POU domain^ ddss 2i Iranscnpbon fado 


0.3018 


404414 






0.3019 


432247 




nS.lUs>ovX} 


ESTs 


0.3023 


453471 


Al ntTQQ? 




ESTs 


0.3028 


417481 


A hOMfiM 




ESTs 


0.3029 


432306 


VlftOft? 
T lOZUr 




protein phosphatase 1 , regulatoty (inhQ> 


0.3032 


448744 


A) 1^c^94 


Ue QilRQ 


ntflckfitrin honrudfiov dofnsdn-contfliiilna f 


0.3033 


429223 


dc2d4i02 


n&2Zl994 


ESTs 


a3034 


404501 


AWZ4rZ9Z 


Hs.75514 




0L3037 


408629 




U» Oil 900 

nS .04/30 


CD74 antigfin (invariant potypepfido of m 


0.3039 


438839 


A\Af9Q70v(C 


nS.iZ049U 


ESTs 


0.3039 


431848 


nW/OOOf 


Ue 19ir7Cft 


ESTs Hiflhhf simiiar to AF17S283 1 zine 

cwtO| myiHj ouiuwi iuf>r ifw<ow i cunt 


0.3042 


456373 


DE2477W 


Hs39751 


(iiBfiftfanO'^panidnQ 44onialnS) suUsnd^ A 


0^3045 


458789 


Al 4CrACO 

AL1o74oo 


Hs.o2So25 


Homo sapiens cDNA FLI20848 Ss, done AD 


0.3048 


443294 


M733625 


Ks.133053 


ESTs 


0.3050 


447023 


AA35o7o4 


LL> 474nA 

nS.17109 


tnteoral ntsmhrflna onsteln 2A 


0,3052 


458583 


A1479o46 


Hs.1o70o1 


hvDdhefkal oreteln lyl6C4170 


0,3056 


414567 


dc2d1097 


KS.1B4519 


hvmttiafical nmtalfi FLI12fi49 


0.3057 


445123 


^62911 


Hs.145369 


ESTs 


0.3064 


412682 






Ob:RC^0002-060400^12-h09 HN0002 Homo 


0I3O65 


434361 


AF129755 


H9.117772 


ESTs 


0.3071 


414028 


BE241713 




Qb:TCA/U'1E0472 Pediatric scute n^ogeno 


o!3072 


432149 


AW614326 


Hs.157022 


ESTs, Weakly ^m^arlo T34549 probable 


0.3073 


408350 


AW183350 


Hs^0127 


ESTs 


0.3074 


401042 








0*3077 


422586 


AA312704 


HSJ9457 


hvDothetical oroteln nJ22127 


0.3077 


438692 


AB007950 


Hs.63»} 


i^]AA0481 gene product 


o!3077 


447452 


BE618256 


K5.102480 


Homo sapiens, done lMAGEi3B69590, mRNA, 


a3083 


444414 


AW293214 


Hs,8752 


tran^smfvnhfRTiA nmlpbi A 


0.3085 


422373 


AK001843 


Hs.115700 




0.3088 


430410 


AR)99144 


Hs.250700 


Infnf 9ca hota 1 
UjllldSo UbUJ I 


0.3090 


419299 


AI311085 


Hs.62406 


h\fix)thfiitcnl nfniBin PLJ29S73 


0.3091 


400672 






0.3094 


444010 


AW976457 


K3^82887 


ESTs 


0.3096 




AL11B571 


H8.121S57 


coTe WAflUw dmihv fn DPI HI (MAN POI YPO 


113008 


432471 


6E244667 


H8^6155 




0.3105 


405277 








0.3106 


456765 


A1497900 


Hs^7 


ESTs 


0.3106 


452090 


AA022684 


Hs.124673 


Homo sapiens cONA RJ11477 lis. done HE 


0.3106 


426497 


AA379913 




gb£ST92B07 SUn tumor 1 Homosaptens cD 


0.3106 


406592 








0.3108 


423621 


6E002904 




gb:QV4«N0090^0400-16}«07 6N0090 Homo 


0.3107 


417919 


AI928203 


H&B6379 


ESTs 


0.3110 


414484 


BE314385 




gb.*601194649F1 NIHJ4GCJ9 Homo sapiens c 


0.3110 


457439 


AW410408 


H!k271167 




03116 


426449 


AL134009 


Hs.1699% 


Homo sapiens mRNA; cONA DKFZp568N1918 a 


03116 


419942 


U2S138 


H9.93841 


potasshm laige condudsnco cddunvacQ 


03119 


458544 


Ai831036 


^^196843 


ESTs 


03119 
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447778 
449097 
429338 
451385 
446404 
446616 
409404 
417318 
443980 
459138 
414807 
434704 
414214 
446378 



428193 

426515 

426597 

459729 

405071 

407457 

409922 

438219 

412944 

431103 

426662 

444391 

432168 

411084 

425367 

448812 

411268 

422864 

405535 

458663 

455353 

414540 

428568 

426106 

411856 

445682 

437568 

448943 

431999 

419279 

405913 

425383 

424729 

440020 

429082 

433663 

400641 

406140 

415280 

447635 

401887 

400767 

457713 

448758 

444750 

411466 

432749 

408112 

433234 

422831 

403215 

451868 

446901 

430553 

445848 

441143 

405138 

412688 

409662 

425438 

416426 

423512 

436777 

431651 

454117 

426048 

451096 

426942 



Hs.71190 
Hs.95110 
Hs.13967 

Hs^6216 

Hs^41569 

Hs.129056 

K5.12891 

Hs.29g087 

Hs.77348 
Hs.192311 
Hs.75819 
H1239760 



NM.004235 Hs.7934 
BE394222 Hs^1444 
AA382250 K&145601 
ALa3728S Ks.289848 



BE620592 

BE271708 

AW170591 

AA017656 

AA019981 

R65964 

BE220053 

AW953337 

Ai459140 

AI903291 

AI738616 

AW135276 

D49958 



AJ242724 

AW505582 

AI916151 

AA384110 

M57399 

AA879474 

AL137597 

AK000563 

T1B987 

BE2711B8 

H30775 

AWB35511 

AW860975 



Hs.130732 
H5^57194 
Hs.197143 
Hs.44 

Hs.122710 

Hs.11114 

Hs^2B05 

Hs.125472 

Hs.155975 

Hs.22140 

Hs.13256 



AV658444 


Hs.280776 


W26786 




BE379050 


H3.306969 


AC004755 


Hs.164922 


BE620016 


HS.1B2470 


H67899 


Hs.4190 


AW378397 




A1954795 


H8.156135 


A1608810 


Hs.133288 


AL133573 


Hs^2312 


AA235900 


Hs.87500 


D83407 


Hs.156007 


AF063012 


Hs.152531 


A)480204 


Hs.177131 


AL135682 


Hs^452 


AF083131 


Hs.229535 


R56473 


Hs.268715 


AI669669 


Hs.195362 


H47495 


Hs.13810 


AB016311 


Hs^1917 


AW242684 


Hs^43823 


AW847669 




NM_014438 K5.27B909 


AW451982 


Hs^46613 


AB04092B 


Hs.65366 


R02504 


H$^2943 


R85962 


Hs^1926 


AI347274 




AW392821 




AA774824 


H8.13377 


AI027604 


Ks,159850 


M86151 




AW452320 


Hs.279728 


T62216 


Hs.270840 


AA1B0256 


Hs.210473 


AW844694 


H3.306762 


AA731199 


Ks.293130 


6E250915 


Hs.266914 


BE410100 


Hs.40368 


A}768853 


Hs.134478 


BE383234 


Hs.25925 


AA393551 


Hs.97450 



EST8, Wealdy GlrnSar to'S16506 bypoM 
ESTs, Weakly sImBar to A55943 1-phospha 
ESTs. slmBar to PSM_HUMAN PROSTA 
gb:ze39h01/1 Soares reOna N2b4HR Homo 
LOCS0627 

ESTs. VMdy sbnlter to ALU8 jnJMAN ALU S 

ESTs 

ESTs 

ESTs 

gb:ROBn)2&<)8019»047 BT029 Homo saf^en 
hydroxyprostaglandoi dehydrogenaso IS^N 
ESTs 

glyooproteln M8A 
cRratB synthase 

gb:MR0^0015-160799^2-b06 CT0015 Homo 

Kftippet^factor4(gut} 

Homo sapiens. Siinilar to hypothaScel fir 

ESTs 

EST. Wealdy simflar to ALIMJIUMAN ALU SU 

gb:Homo sapfens mRHA for parQal putativ 

KiAA1575pret^ 

ESTs 

ESTs 

ptebtrophbi (heparin binding growth fac 
ESTs 

hypothetical protein dJ1181N3.1 
hypolheticai protein FU20556 
ESTs. Moderalety similar to KIAA0877 pro 
protein tyrosine phosphatase, receptor t 
BM88 antigen 

gb:QVO-LT0015.180200.127^J02 LT0015 Homo 
ESTs 

tar*yrase, TRFI-inleracOng ankyrin^eia 
gb:15d7 Human retina cOHA randomly prime 
Hbmosapiens, done MGC:10782. mRNA, com 
Homo sapiens chromosome 19, fosmid 37502 
PTD010 protein 

Homo sapiens cONA: FU2326g fis, done C 
gb:RC34fT02204)3129941 2^06 HT0220 Homo 
ESTs 
ESTs 

Homo sapiens mRHA; d3NA DKFZk)434J2235 (f 
ESTs 

Down syndrome critical region gene 1-Iik 
heart and neural crest derivaSves expre 
ESTs 

HomosapiensRiRNA1briaAA1737protBin. * 
CA1X*16 protein 



ESTs 
ESTs 



hypothetical protein II4GC15504 

KIAA0768 protein 

ESTs 

gb:iL3.CT0213-260100<)56^10 CT0213 Homo 

interleukin 1, ete 

ESTs 

KIAA1495 protein 
ESTs 

ESTs. Weakly stmBar to 138022 hypdheti 
gb:lc05d02j(1 NCt.CGAP.Oo16 Homo saptens 
gb£M4«T027542129»)53^ 8T0275 Homo 
Homo saptens done 23649 and 23765 unkno 
ESTs 

gb:EST02679 HIppocamptis. Strategene (cat 

ESTs 

ESTs 

Homo sapiens cONA FU14B72 fis. dbno PL 
Homo sapiens cONA: FU21391 fis. done C 
ESTs 

hypolheScal protein FLJ10355 
adaptor-felated protein complex 1, sigma 
ESTs 

Homo saptens, done MGC15393, mRNA. com 
ESTs 



a3121 

0.31S 

0.3125 

a312S 

a3130 

03132 

0.3135 

03139 

03140 

03142 

0.3143 

03143 

03145 

0^145 

0.3146 

0.3148 

0.3150 

03153 

0.3157 

O3160 

03162 

0.3172 

03173 

0.3175 

0.3178 

0.3178 

0.3179 

0.3181 

03183 

03185 

0.3188 

03189 

O3190 

03195 

03195 

0.3195 

0.3195 

03195 

0.3196 

O3202 

O3205 

O3205 

O3205 

O3207 

0.3208 

0.3209 

0.3212 

03212 

0.3213 

03215 

0.3215 

03216 

03216 

03217 

03217 

03217 

03221 

03221 

0.3222 

0.3223 

03226 

03231 

0^ 

0.3231 

0.3234 

03236 

03236 

03242 

03254 

03257 

0.3257 

0.3262 

03262 

03262 

03263 

03253 

03264 

0.3267 

03287 

03268 

0.3269 

O3270 

03271 
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454947 


AWB46590 




413814 


BE1698S2 




422818 


AA4O4290 


HS37848 


423634 


AW959908 


Hs.1690 


414002 


NM.006732 Ks.75678 


452164 


AI883171 




458477 


NVL000314 H8.10712 


433197 


AB040B89 


Hs^81022 


405701 






437782 


AI370876 


Hs.79090 


459001 


AI761313 


Hs.204605 


422783 


AA598956 


Hs.120439 


417036 


AF039918 


HS.B0975 


456041 


BE27079S 


Hs.268864 


423310 


AA325225 


Hs.124023 


427530 


AA405093 


Hs.126519 


420172 


AAG01122 


Hs.95655 


445610 


AI83164d 


Hs.143993 


411328 


AW837083 





gb:QVD^18(M}1109&O2&<»]7Cr0180HQmo 0.3275 

gb:PM1-HT0527.2902004X)&^HT0527KOfno a3275 

ESTs a3277 

heparfivtindbtg grow^ factor Unding pr 0.3278 

FBJnurine osteosarcoma vird oncogene h 0.3278 

gb:tz44b02jt1 Na.CGAP^S2 Homosapien a327g 

phosphatase and tsnsln homolog (mutated a3279 

KIAA1456prDtek) 0.3280 

0.3282 

expOTfti 1 (CRM1, yeast homolog} 0.3284 

ESTs 0L3288 

ethanoiandna kinase 0.3289 

ectonudeosicte triphosphate diphosphohyd a3290 

ESTs 0.3295 

Homo sapiens cONA FLJ14218 fis, done NT 0.3296 

ESTs 0.3296 

secietsd and transmamtvana 1 0.3297 

ESTs a3297 

gtxQVUT0037.1S0200^08LT0037Homo a3300 



Pkey: Uidqua Eos probeset Idemiiier numt)er 

CATnunter: Gene duster nunto 
Accession: GenlianitBocessIonnumbere 



Pley CATnumtnr Accession 



409921 
410826 
410845 
410950 
411288 
411319 
411328 
411466 
411514 
411620 
411880 
412474 
412506 
412682 
412868 

413056 
413537 
413541 
413814 
414026 
414193 
414349 
414484 
414539 
415154 
415160 
416035 
417945 
419850 
422909 
422954 
423621 
424648 
426132 
426497 
430553 
431822 
434098 
437483 
440507 
441042 
441083 
445682 
446294 
446901 
447787 
451385 
452164 
454186 



1159516. 

1212621_- 

1223881J 

1227728J 

1237709 

1238595 

1238987. 

1246771J 

t248638J 

1252014. 

12631 10J 

129869J 

1301336J 

1321572.1 

1334784. 

1347545. 

137544L 

1375499J 

1391574 

1411109. 

1424706 

1437515 

1452830. 

1460320 

1525577J 

1525766 

1557254. 

1711126. 

188485.1 

222858.1 

223239J 

230314.1 

241947.1 

261431.1 

268121.1 

319868.1 

338082.1 

380006.1 

43756.1 

495677.1 

50823.1 

50904.1 

647580.1 

670076.1 

697809.1 

73719J 

86787.1 

902091.1 

1049791.1 



AW6OO239AW60Q255AW505332 
BE407727 

AVrai1W3AVTO11^ 

AW835511 AW835517AW835513 

BE537094AW836542 

AW837063AW935882AW935957 

AW847689 AW847667 BE145799 

AW850178AW85Q233AW8S0445AW850446 

AW854538AW854417AW854495AW854355 

AW872477BE088101T05990 

AI791451 AI791288BE019234 BE296601 AA111939 

AW957159 H09937T75143 

AW983772 AW983730 AW983769 AW983836 AW983835 AW983837 

M86151 BE061884 BE061883 BE061898 BE061B82 BE061887 BE061891 6E061690 BE061696 BE061893BE061895 BE061894 BE0618B5 BE007474 
BE007481 BE007653 

BE063031 BE063002 BE063008 BE063024 6E063040 6E063006 BE063072 
BE146866 BE146865 6E146867 

BE147036 BE146951 BE146958 6E146966 BE146976 6E146g55 

BE169692BE169421 

BE241713BE241912 

6E260069 

BE512g68 

6E314385 

BE379046BE395459 
D63175 D78984 D63269 
T82B02D78670 R08505 
H42314 H43080 H45217 H15384 

R29072 R29717 R29699 R29709 R29751 R29609 R29060 R29718 R29057 1^91 R29683 R29575 R28913 R28910 
F06844F06845 Z45488 AW748501 AW748591 AW752D21 AW748545AW853362AW653363AW853427AA2512S3 
AA533356 AW468427 R67736 AA778031 AA614088 AI823404AA318991 AA72(S88 
AW998605 AW993131 BE514709AA319445 
BE002904 H64680AA328679 
AA344576 AA732430 AA344601 
AA370501 AW962784AA370727 
AA379913 AA379981 AW963S23 

AW392821 AW392809AW84325BAW843049AW503156BE165656AWB21728 

AA516049AW004922 

AA625499 AA62S289 AA625184 

AL390174AW898817 

H06994BE147898 

AA077738 AA07B505 BES62497 Z17859 
BE562611 AA43S054 

AW378397AW378390AW378358AI247957 
AI2B4935 AW409822 BE408182 
AI347274AW844024 

6E620108 BE312062 AW896316 BE262548 
AA01 7656 AA01 7374 AA019761 
AI863171 BE047919 

5S«?!! F^^^^ BEI41753 BE141024 BE141761 AW177583 AW177579 AW177S82 AW177585AW1 77587 AWB07582 AW177581 

52«i!I SlIIH^^*^^ BE141492 BE141028 BE141775 BE141489 BE141751 AW177699 BE141750 AW177697 BE141512 BE141460 
OC141749 AW1 77598 
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454790 1234752.1 

454947 1245953 1 

455353 12842B9 1 

457115 286601.1 

459138 918860J 

459233 944881.1 



AW820852 AW820773 AW821088 

AW846590 AW84661 5 AW8465B4 AW846592 AW846621 AW84e810 
W26786 AW998S12 AW902Z72 
AA420712 AA469165 AA420737 

AI903291Ara03455AI903367AB03403AI903447AI903405AB03364AI903229Al90^ 
AIS39966 AI939988AI939951 AI939981 AI939976AI9399S9 



T^ac 



Pkoy: 
Ref: 

Stran± 
NLposHbn: 



Unique number conesponding to an Eos probeset 
Sequence 80UTO-Th87dlgftnumbeiB in Ihlscota 

sequence of human chranosoms 22.* Dunham Letal. Nature (1999) 402:489495. «««w«wpio««ronenoneg ineUNA 

Indicates DNAstFBnd finom which exons were predl^ 
Indicates nucleoQde positions or predicted exons. 



PIcey 


Rsf 


Strand 


400514 


9796594 


Minus 


400641 


8117693 


Rus 


400672 


8118724 


{ySnus 


400767 


8131627 


(hfinus 


400865 


1945037 


bfinus 


401024 


8117489 


Plus 


401042 


8117611 


Plus 


401371 


9650602 


Rus 


401381 


8570226 


fyfinus 


401465 


6682292 


Rus 


401521 


7705251 


Plus 


401753 


9838183 


Minus 


401776 


9966323 


Plus 


4018B7 


7229981 


Plus 


401974 


3126777 


Pius 


402076 


8117410 


Pbis 


402162 


8575917 


Minus 


402425 


9796347 


Mini^ 


402610 


9926549 


Minus 


402695 


8569871 


Minus 


403047 


3540153 


Minus 


403215 


7630945 


Minus 


403418 


6862692 


Minus 


403548 


8081591 


Minus 


403957 


8076835 


Minus 


404070 


2996642 


Rus 


404246 


7406725 


Rus 


404414 


7382165 


Rus 


404605 


9212S66 


Rus 


404638 


9795751 


Minus 


404696 


9800109 


Minus 


404767 


7882827 


Minus 


404769 


6099713 


Minus 


404927 


7342002 


Rus 


404958 


7407941 


Minus 


405071 


7708797 


Minus 


405073 


7769921 


Plus 


405138 


8576241 


Rus 


405277 


3980473 


Ptus 


405282 


3810573 


Minus 


405334 


3135285 


Rus 


405364 


2261075 


Minus 


405385 


6552772 


Rus 


405535 


9795658 


Rus 


405610 


5757653 


Minus 


405654 


4895155 


Minus 


405691 


4508112 


Phis 


405701 


4263751 


Plus 


405880 


6758747 


Minus 


405913 


7712139 


Minus 


406140 


9168231 


Minus 


406592 


4567182 


Rus 



NLposiOon 

78844-79025.8085^80991.89754^941,93750-93891 

47864992 

148067-148503 

80531-80629.82169^278 

4448^45526 

60551-60802 

151364-151606 

80901-81283 

118629-119146.119392-119657 

25676-2S80D 

9127-9234 

155287-155529,159719.159997 

115535-115743,117746-117839^120290.120455 

93973-94120 

85330^5683 

128316.128627 



50224-50395 

22955-23124 

159927-160055 

5979^^68 

177270-177971 

176202-176395 

38760-39352 

81649^1754 

7210-7414.10043-10195 

82477-82628,B2721^17,82910-83071,8314»«3387 

143127-143398 

125032-125291 

99433-99528,100035-100161 

60037-60144.62675^81 

23244-23759 

176801-176823 



2731-4531 

11115-11552 

3141»^1774 

90303.90516 

23471-23572 

10482-10689 

139386.139856 

4832&48491.4913M9252 

4833248454 

63384-63545 

71907-72080 

63624-53759 

171350-171739 

93243-93364 

5567a^287 

7484.7678 

49887-50219 



IS^^^fn «I>Jfgidated hcojon compared to nom»aI aduil tissues and to no^mangnant colon tissues that are likely to encode protdns amenable to 

S?^)^ sman molaades, peptides, or anObodles. These were selected ftom the starting collection of 59880 protiesets on (he Affwnetrfx/E^^SD® Wtows* 

Sl^^^iSi lissues.L-averi9' cJon 

stgnai. or 01 oetng moduiaiaue by smaO molecules. pepOdes, or antiwdies (e.o. pMnase. deatlHtomaln. Vm, phosphatase. ortorLbansporter}. In older to remove gena-specaic 
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tacksroundtevebd^ 
ino raw was flvaniabd. 

TABU 1^ genes up-dated In colon cancer ocmipared to non^ 

Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession mrniber 

Unigeneia Unigene number 

Unigene TUIe: Unlgene gene one 

R1: 90^ percentile of tumor samdes divided bv the 70* oeffifiniilairf 





numerator and denominator 


PltBy 


ExAocn 


UnlgenelD i 


436749 


AA584890 


Hs.5302 1 


4O66S0 


M29540 


H3^29 1 


406667 


M12523 




414386 


XD0442 


Hs.75990 1 


428934 


AF039401 


Ks.194659 1 


416768 


AA363733 


H5.1032 1 


446787 


U67167 


H3.315 ] 


431912 


AI660552 


1^.356183 1 


437935 


AW939591 


Ks.5940 1 


407242 


Ml 8728 




423541 


AA296922 


Hs.1 29778 


441031 


All 10684 


Hs.7645 1 


406685 


M1B728 




422578 


AF239666 


Hs.1545 i 


432542 


AW083920 


H3.1609B 1 


421341 


AJ243212 


Hs.374281 


453863 


XD2544 


H5.572 


421582 


Aigi0275 


Hs.350470 


436217 


T53925 


Hs.107 


422260 


AA315993 


Hs.105484 


416888 


AU076801 


Hs.89436 


407243 


AA058357 


H8.74466 


424212 


NMJ005814 


H&143131 


414463 


T69078 


HS76177 


407007 


U22961 


Hs.184411 


413719 


BE439580 


Hs.75498 


450685 


L15533 


Hs,423 


418007 


M13509 


HS.B3169 


423673 


BE003054 


Hs.1695 


423371 


AU076819 


Hs.1650 


421964 


X73079 


iis.288579 


447400 


AK000322 


Hs.18457 


421100 


AW351839 


Hs.124660 


406741 


AA058357 


Hs.74466 1 


427583 


M82962 


H8.179704 


422281 


M36803 


H8,346935 


406687 


M31126 


Hs.352054 


409153 


W03754 


HS.S0813 


424687 


J05070 


Hs.151738 i 


422664 


AA315933 


Hs.120879 1 


452304 


AA02S386 


Hs.61311 


430569 


AF2412S4 


H3.178098 


413881 


U0190 


Hs.75599 1 


408399 






422424 


AI186431 


Ks^6638 i 


428470 


AOQ02301 


Hs.184507 


417931 


W95642 


Ks.82961 


435538 


AB011540 


Ks.4930 


430272 


X04898 


Hs^7658 


451917 


AW391351 


H8^082D 


421907 


BE018556 


Hs.109358 


452316 


AA298484 


Hs.61265 


452594 


AU076405 


Hs^981 


424326 


NIU)14479 


Hs.145298 


443426 


AF0981S8 


Hs.9329 


452194 


AI694413 


Hs^73599 


411975 


AI916058 


Hs.144583 


408243 


Y00787 


H5.624 


422310 


AA316622 


Ks.98370 


431330 


X69532 


H5^777 


420344 


BE463721 


Hs^7101 


422330 


D30783 


H3.115263 


412104 


AW205197 


Hs^40951 


451035 


AU076785 


Hs.430 


428753 


AW933252 


Hs.192927 


430677 


Z26317 


Hs^9784 


422487 


AJ010901 


Hs.198267 


444381 


BE387335 


Hs^83713 


409632 


W74001 


Hs^279 



Unigene TlSe 

NM_008149:Honio sapiens tecBn, galactosi 
(locuslin)(}Ntit004383:Homo sapiens carcin 

NM.0051 43:Homo sapiens liaptoglobin (HP). 
NM.00128S:Komo sapiens chloride channel, 
NM.006507:Honio sapiens negeneraUng isle 
NM.0024S7:Homo sapiens mudn 2^ intestin 
Hs358183:ESTs. Weakly similar to S3B4.H 
NM.033049:Komo sapiens mucin 13, epllhel 
(locusflnk)NM_0024B3:Homo sapiens carcin 
NM.014471:Komo sapiens serine protease 1 
NM.005141:Komo sapiens fibrinogen, B bet 
(tocus0nk}Nhl.002483:Homo sapiens carcin 
NM.001804:Komo sapiens caudal type homeo 
NM.020384:Komo sapiens daudin 2 <CIDN2) 
NM.007329:Homo sapiens deleted In m^tgn 
Hs.572:orosomucoid 1 
NM.003225:Homo sapiens trefoil tactor 1 
NM»004467:Homo sapiens fibrinogen4Ike 1 
NM32044:Homo sapiens regenerating gene 
NM_004063:Homo sapiens cadherin 17. U c 
()ocusiink)NN1.006890:Homo sapiens carcin 
NM_005814:HQmo sapiens glycoprotein A33 
NM.001633:Homo sapiens alpha-l-nteglob 
NM_000477:Homo sapiens albumin (At^, mR 
NM-004591:Homo sapiens anaQ inducible c 
NM-138938:IHomo sapiens pancreallti5<8SS0 
NM-002421:Homo sapiens matrix metaHopro 
NM.002426:Homo sapiens matrix meteflopro 
NM_0001 1 1:Komo sapiens sohtte eanfer fa 
NM.002644:Komo sapiens polymeric imnvnog 
NM1 7763:Komo sapiens hypothetical prot 
Hs.124660£STs, Moderately simOar to 21 
(locusllnlONMJ)06880:Homo sapiens caicin 
NM.005568:Honn sapiens meprin A, alpha ( 
NM«000613:Hon» sapiens hemopexin (HPJQ, 
H5.352054:pregnancy specific beta-1-giyc 
NMJ)17625:Homo sapiens intelectin (ITIN) 
NM.004994iiomD sapiens matrix m^aDopro 
Ks.120879:Honio sapiens, done MGC:32871 
Hs.6131 1£STs. WeaJdy sln^ to S10S90 
NM.021804:Honio sapiens angiotensin I con 
(locuslInl<)NMj000488:Homo sapiens serine 



Hs.184507:Komo sapiens, similar to Homd 
H3.82981:Homo sapiens, clone MGC:22588 1 
Hs.4930:low density lipoprotein receptor 
HsuQ7658:8polipoprolein A-ll 
Hs.S082Q:hypofhetical canfiac/skeletal m 
Ks.1093S8:ATPase. Qass V, type 10B 
NM-138805;Homo sapiens fiamity with seque 
Hs^981xotute canfer family 26 (sulfa 
NM-014479i{omo sapiens ADAM#e. decyd 
(Iocustink)NM_012112:Homo sapiens chromo 
Hs^3599£ST 

Hs.144583:Homo sapiens, done iMAGE:3462 
NKL000584:Homo sapiens interfeukin 8 (II 
(locusnnk)N»^030622:Homo sapiens cytoch 
NM.002215:Homo sapiens inter-alpha (gtob 
NM.014373:Homo s^ns putaOve G pn^ 
NM-001432:Homo sapiens epireguOn (EREG) 
Oocuslink)NM_.033120:Homo8dptens naked 
NM-002670:Homo sapiens plasGn 1 (I isof 
KMJM772&:Homo sapiens protein phosphata 
NM.001943:Honio sapiens desmogteln 2 (0S6 
NM-018406:Homo sapiens mudn 4, tradwob 
NM.13845SMimo sapbns coll^ triple h 
NM.0O2639M)mo sapiens serine (or cystd 



R1 

29.34 

25.56 

20.28 

18.84 

17.38 

16.99 

16.61 

16.42 

15.92 

15.84 

15.59 

15.02 

14.54 

13.68 

13.23 

13.21 

13.06 

1Z35 

12.11 

11.99 

11.87 

1U1 

11.27 

11.18 

10.82 

10.73 

10.57 

10.39 

10.10 

9.91 

9.68 

9.44 

9.38 

9.34 

9.18 

9.06 

9.02 

8.89 

8.53 

&23 

aio 

8.05 
7.96 
7.73 
7.71 
7>I3 
7.40 
7.29 
7.25 
7.21 
7.19 
7.18 
7.03 
7.00 
6.92 
&88 
&76 
6.59 
&S5 
6.53 
6.49 
6.33 

aai 

&30 
&29 
&28 
&27 
6.26 
&23 
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417491 AW376842 Hs.1085 KM.004963:Homo sapiens guanytala cyclase &23 

413936 AF113S76 Hs^TBBI NM-000295:Homo sapiens serine (or cysts) 6.23 

422627 BE336857 Hs.1187a7 H&118787.-transfoni«ig growth factor, be 6.19 

^ 411B25 AKD00334 KsJ52415 NM-017767:Homo sapiens soWe canter fa 6.17 

5 44G921 AB012113 Hs.16530 NAL0()2388:Hoino sapiens small Induclbiec 6.17 

415214 AI445236 • Hs.125124 NML004442:HomO8apiensEphB2{EPHB2).lr 6.17 

414816 Y13709 Hs.77399 NIil.001265:Homo sapiens caudd) type homeo 6.16 

422106 084239 Hs,111732 NM.003890:Hofno sapiens IgGFctMngpr &14 

423803 NM.005709 Hs.132945 (tacusDnl()NM_0(K709:Homo sapiens P02.73 6l13 

10 452281 T93500 Hs.28792 Hs^792:Homo sapiens cONA aJ11041 fis. 6.09 

447342 AI199268 Hs.19322 Hs.19322:Hon»&aplettS. Similar to RIKEN 6.02 

443957 AAS21049 Hs.353013 Hs.353013xhromosoim 20 open reading lia 5.95 

403220 5.90 

408908 BE296227 Hs.250822 (tocusEnk}NM.003158:Homo sapiens serine &88 

15 449722 BE280074 Hs.23960 Hs.23960KycIin 81 R87 

425976 C75094 Hs.334514 NM-025257:l^on» sapiens chnmosoma 6 open 5.79 

414617 AI339S20 Hs.288817 (locusnn](}N1^025130:Komo sapiens hypoth &79 

408983 NM.000492 Ks.663 NM.000492:KofTK> sapiens cystic fibrosb I 5.77 

423445 NM_014324 Hs.128749 NM_01 4324:Homo sapiens alplwnelhylacyl- 5.77 

20 421379 Y15221 ^103982 NM.005409:Ho(no sapiens small inducible c 5.76 

431301 AA502384 Hs.l51529 Hs.l51529:EST8 &71 

418318 U47732 Hs.84072 NM.004616:HofTK> sapiens transmembrane 4 8 5.71 

415992 C05837 HsJ45807 Hs.145807:hypcthe8cai protein FU 13593 5.68 

436972 AA284679 H3.25640 Hs.25640xlaudln 3 5.66 

25 414987 AA524394 Hs.294022 NML032865:Homo sapiens hypolheilcalprot 5.61 

431657 AI345227 Hs.105448 Hs.105448:prote!n kinase, lysine defide 5.57 

424273 W40460 Ks.144442 NM.003561:Homo sapiens phosplwl!p3seA2. 5.55 

413916 N49813 Hs.75615 N^^0004B3:Ho^w sapiens apoilpoprotein C- 5.54 

409757 NW>001898 Hs.123114 NIUL001 898:Homo sapiens cyslafin SN(CST1 5.53 

30 430204 AA618335 Hs.356664 Hs.356664:hypotheticai protein FU32334 5.50 

425227 U67058 Hs.1 54299 {iocuslinlQNM_00524iHomo sapiens coagu! 5.38 

420542 NM.000505 H5.1321 NM.000505:Honu> sapiens coagulation facto &33 

414809 AI434699 H&77356 Hs.77356dr8nsferrin receptor (pgo, C071 &32 

414639 X67055 Hs.76716 NM-002217:HoiTU) sapiens pre^{g1obul 5.32 

35 410418 D31382 Hs.63325 NM.019894:Homo sapiens transmembrane pro 528 

414590 NM-000506 Hs.76530 NM-000S0S:Homo sapiens coagulation facto &28 

444151 AW972917 Hs.128749 (locusQni()NM_014324:Homo sapiens fiipha- 5i7 

438746 AI885815 Hs.184727 H8.184727£ST8.Weald|y8lniilartoT45738 5.26 

406704 AA056635 Hs.5366 NR1.13g053:Homo sapiens eptdermal growth 5.25 

40 414798 Ai286323 Hs.97411 Hs.97411;hypothe8cai protein MGC12335 6.25 

436251 BE515065 Hs.296S85 (!ocusnnk)NM-00&392:Homo sapiens nucteo S.2S 

414753 AF158255 Hs.77225 NML006437;Homo sapiens ADP-rfbosytbansf 5.23 

428970 BE276891 Hs.194691 NM.003979->lofno8ai^reQm)1cacldlnd &22 

443991 NM.0022S0 Hs.10082 NH.002H):Homo8a{fens potash intam» 5.20 

45 432978 AF126743 Hs.279884 NM.013238:Honio sapiens DNAJdomaifrConta 5.18 

425834 NW.001639 H8.1957 Hs.1957:amylold P component, serum 5,13 

432179 X75208 Hs.2913 NVL004443:Homo sapiens EphB3(EPHB3),mR 5.12 

408482 NM00876 Hs.45743 NM.000676:Komo sapiens adenosine A2biec 6.11 

430135 Nf^000035 Hs.234234 NM,000035:Homo sapiens aldolase B,lhicl 5.08 

50 426174 AA547959 Hs.115838 Hs,115838:EST8 5.07 

403218 5.07 

411142 NM14256 Hs.69009 NM.014256:Homo sapiens UDP-GlcNAKbelaGa aOO 

4490Z7 AJ271216 Hs.22880 N)l^005700;Homo saddens dIpepUdylpepOda 4.98 

433083 AL042759 Hs.191762 Hs.1 91762-Jvpothetical protein MGC20258 496 

55 431779 AW971178 H5.268571 (locu8llnk)NM_001 64S:Homo sapiens apollp 492 

421408 AI688223 Hs.91096 NM.052ai6:Homo sapiens trtparflte motif- 4.91 

430603 AA14ai64 Hs.247280 H3.247280xhromosome 20 open reading fra 4.91 

422867 L32137 H3.1584 Hs.1584:cartiiageollgomerlc matrix prot 4.90 

422539 AJ009936 Hs.118138 NM-033013:Homo sapiens nuctear receptor 4.89 

60 424010 AtJ080188 Hs.137556 NM_033100:Komo8aptens MT-protocadheiin 4.68 

428953 AA308610 Hs.348183 NI^003823:Homo6aiitens tumor necrosis fa 4.86 

457001 J03258 Hs.2062 Hs.20B2Mt8mlnD(1,25.(fihydiDKyvitami 4.83 

425983 AK000226 Ks.l65619 NML031265iiomo sapiens mucin and cadheri 4.81 

428289 M26301 Hs.2253 Hs.2253xomplemenl component 2 4.79 

65 418322 AA2B4166 H8.84113 NM_005192:Homo sapiens cydlrHtependenl 4.78 

409889 AVU830041 Hs.56S37 NM.021978:Homo sapiens suppresdon of bi 4.77 

447472 AW207347 Hs.211101 Hs211101£ST8 4.74 

423164 AK000232 Hs.124835 NM.019062:Komo sapiens hypotheOcal prot 4.72 

429345 R11141 Ks.196695 Ks.199695:hypoth8tica] protein MAC30 4.72 

70 430680 AW138724 Hs.168974 Ks.168974:EST8 4.69 

414998 NIUL0Q2543 Hs.77729 NM.Q02543:Homo sapiens oiddlsed low dens 4.69 

417165 RB0137 Hs.302738 Hs.302738:Homo sapiens cONA:FU21425 6 4.67 

403221 4*65 

415000 AW02S529 Hs.239812 Hs.239812:8eroiogicaI]y defined breast c 4.65 

75 433437 U20636 H3.3280 NhL001226:Honio sapiens caspase 6, apopto 464 

414052 AW578849 Hs.283552 Hs.283552:hypotheSc8l protein 80016153 464 

406673 M34998 Hs.198253 Hs.198^:m£^rhistocompaiibil]tycampi 464 

418203 X54942 Hs.83758 NI\1.001827:Homo8aptensCOC28 protein tdn 460 

422714 AB016335 Hs.119387 NM_014698:Homo sapiens KIAA0702 gene pro 4.60 

80 410639 BE269047 Hs.65234 (locusOnk)NM_01789S:Homo sapiens OEADfH 460 

411393 AW797437 Ks.69771 NMJI)0171 OrHomo sapiens ^factor, properd 459 

431193 AW749S05 Hs.2g6770 H8.2g6770daAA171 9 protein 457 

428450 NM_014791 Hs.184339 rflyL014791:Hoino sapiens matBind embryo^ 4.56 
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42SB73 


NM_013390 HS.1B0417 


Hs.1 504 17:trBnsiiicnibr8n6 protein 2 


4.56 


422765 


AW409701 


Hs.1578 


NM.001 1 68:Homo safdens bacutovirsi lAP r 


455 


445109 


AF03^16 


Hs.12330 


NM.001247:Homo seplens ectonudeosMo tr 


4.55 


422S35 


AA311d14 


Hs.154578 


H5.154578:HofflO sapiens (nRNAfbrFU0025S 


4.S5 


422609 


Z4G023 


Hs.118721 


NML000434itomo sapiens sidlktee 1 (lyso 


4.54 


431548 


A1834273 


Hs.9711 


NM_D1751 5:Homo sapiens novel protein (HS 


4.53 


429271 


AF039850 


H5.198515 


NM_005224:Homo sapiens dead clngeMike 


4.53 


432269 


NML002447 Hs^2 


Hs.29423nacrophage sQmulating 1 recepto 


4.53 


408194 


AA601038 


Hs.191797 


H3.191797;ESTs 


4.52 


439580 


AFDB6401 


Hs.293847 


H3.293847:EST5 


4.60 


443464 


BE548446 


H3.321579 


NM_021095:Hcmo sapiens sohite canter fa 


4.49 


420981 


L40S04 


Hs.100724 


NM_005037:Homo sapiens pensdsome pTOlif 


4.48 


432378 


A)493046 


Hs.146133 


Hs.146133:EST5 


4.48 


431958 


X63829 


Hs^877 


NM-001793:Homo sapiens cadherin 3, trpe 


4.47 


415099 


A1492170 


Hs.77917 


NML006002:Homo sapiens ubiqiifin cartoxy 


4.47 


437009 


AF1Z7026 


Hs.5394 


NM.005379:Homo sapiens myosin lA (MY01 A) 


4.47 


422511 


AU076442 


Hs.117938 


NM.000494:Homo sapiens coHagen. type XV 


4.46 


436469 


AK0D1455 


Hs^198 


Hs.5198:Down syndrome criSca) region ge 


4.46 


445417 


AK001058 


Hs.12680 , 


Hs.1 268Q:Homo saddens cONA aJ10196 lis. 


4.44 


428024 


Z29067 


Hs2238 


Ks.2238:NIMA (never In mitosis gene a}-r 


4.44 


41S474 


NM.0142S2 Hs.78457 


NMJ014252HQino sapiens solute canter fa 


4.43 


441384 


AA447849 


H8.268660 


Hs.288660'i{omo saptens cDNA* FU22182 fl 


4.43 


428479 


Y00272 


Hs.334562 


NM_00l786:Homo sapiens cell division eye 


4.40 


407944 


R34008 


Hs.239727 


NM.024422:Homo sapiens desmocollin 2 (DS 


4.38 


414108 


Ai267592 


K8.75761 


NM-003137:Homo saptens SFRS pnsteln idna 


4.37 


447320 


A1675419 


Hs.164464 


H&164464:Homo saptens, done iyiGC:236S6 


4.36 


410850 


AW382867 


Hs.302738 


Hs.302738:Homo saddens cONA: FU21425 fi 


4.36 


439453 


BE264974 


HS.6S66 


Hs.6566:tt)yroid honnone receptor biterao * 


4.35 


409231 


AA446644 


Hs.692 


N^002354:Hamo sapiens tumor-assodated 


4.33 


432575 


AA553722 


Hs.194346 


Hs.194346:Splr-2 protein 


4.33 


427747 


AW411425 


Hs.ie0655 


(tocusilnl()NI004217:Homo saptens serine 


4.33 


439963 


AW247529 


Ks.6793 


Hs£793:platetet-ac6va6ng fiactor acely 


4.32 


418245 


AA088767 


KS.B3883 


NM.02D182i1omo sapiens (rensmembrane. pr 


4.32 


428407 


NM.003963 Hs.184194 


NM.003963itoo sapiens transmembrane 4 s 


4.30 


424825 


AF207G69 


Hs.153357 


NM.001064:Homo sapiens proooltagen^ln 


4.30 


429833 


NM_012079 Hs^88627 


NM-012079:Homo sapiens diacylglycerol 0- 


4.30 


411257 


AA628967 


Hs.1 15274 


Hs.1 1S274Man hedgehog homolog (Oroso 


4.30 


413219 


AA878200 


Hs.1 18727 


H3.1 18727i{omo saptens cONA FLI33803 fis 


4.29 


408847 


AW290997 


Hs,190153 


Ks.ig0153:Homo saptens cONA FU33988fis 


4.29 


422163 


AF027208 


Hs.1 12360 


Hs.112360n}romintn4iKe 1 (mouse) 


4.28 


425206 


NM^002153 Hs.155109 


NM_002153:Homo sapiens hydraj^ysten^ (1 


4.28 


441085 


AW136551 


Ks.181245 


Hs.181245*ibmo sapiens cONA FU12532 fis 


4.27 


439975 


AW328081 


Hs.6817 


NM-033453:Homo sapiens inoslne ^hosph 


4.27 


414361 


AI086138 


Hs.204044 


Hs.204044:ESTs 


4.26 


417115 


AW952792 


Hs.334612 


NM.003094:Homo saptens small nuctoar rib 


4.26 


415927 


AL120168 


H3.78919 


NM-021083:Hamo sapiens KeH blood group 


4.24 


409012 


AL117435 


Ks.49725 


Hs.49725:DKFZP434l216 protein 


4.22 


420039 


m^004605 Hs^76147 


Hs.3761 47:Homa saptens cDNA FLI39099 fis 


4.20 


446051 


6E048061 


Hs.37054 


Hs.37054»phriniA3 


4.19 


421506 


BE302796 


Hs.105097 


Ks.105097:thymldine idnase 1, soluble 


4.18 


407811 


AW190902 


Hs.40098 


Hs.40098xystBlne knot superMy 1, BM 


4.18 


418054 


NM.002318 Hs.83354 


NM.002318;Honio saptens lysyt oxldase^k 


4.18 


409142 


AL136877 


H&5075B 


H8^758:SM04 stnidural matetenance of 


4.18 


419508 


AW997938 


Hs.90786 


NMJ0O3786:Homo saptens ATMilndlng casse 


4.17 


426761 


A1015709 


K5.172089 


Hs.172089^)r(Hmoosis receptor indudng 


4.17 


408113 


TB2427 


Ks.194101 


H$.194101:Honio saptens cDNA: FU20869 fi 


4.16 


425743 


BE396495 


HS.1S9428 


NM-138761:Komo sapiens BCU-assocteted X 


4.15 


4537S1 


R36762 


Hs.101282 


Hs.101282:Homo sai^ns mRNA; cDNA DKFZp4 


4.15 


435327 


6ES01871 


Hs.4867 


Hs.4B67»nannosyl (aIpha>1,3-}^lyooprote 


4.14 


403219 








4.14 


407103 


AM24881 


Hs^56301 


Hs.2S6301iiypo8ietteaI protein MGC13170 


4.14 


411126 


NM.001202 Hs.68879 


(tocusink)NMj001202'.Homo sapiens bone m 


4.14 


424837* 


BE276113 


Hs.333034 


NM.003491:Homo sapiens AR01 homolog, N-a 


4.13 


409956 


AW103364 


H5.727 


NM.0021 92:Homo sapiens InhiUn. beta A ( 


4.12 


439659 


AW970780 


Hs.69483 


Ks.59483:teuclne4teh repealKXintainlng 


4.12 


425397 


J04088 


Hs,156346 


NM.001067:Homo saptens topdsamerase (DN 


4.12 


413753 


U17760 


H3.75517 


NM.000228:Komo saptens laminin, beta 3 ( 


4.11 


428698 


AA852773 


Hs.334838 


H3.334838:KIAA1666 protein 


4.10 


427557 


NM.002659 H3,179857 


NM.0026S9:Homo saptens plasminogen acQv 


4.09 


441623 


AA315805 


Hs.348710 


Hs.348710:Homo saptens, done l(iMGE:4242 


4.07 


417866 


AW067903 


Hs.82772 


Hs.82772coaagen, type XI. alpha 1 


4.07 


442013 


AA506476 


Hs.375009 


H5.375009:Homo sapiens mRNA;cDNADKFZp6 - 


4.06 


425247 


NM-005940 Hs.155324 


Hs.155324:malit( metalloproteinase 11 (s 


4.06 


428385 


AF112213 


Hs.184062 


Hs.184062:chTOmosome 20 open reading fra 


4.05 


412612 


Nfi/L000047 Hs.74131 


NM-000047:Homo saptens arytsultetase E ( 


4.04 


425280 


U31519 


Hs.1872 


NM.002591:Homo sapiens phosphoenolpyniva 


4.03 


411263 


BE297802 


Hs.69380 


NM-006845:Homo sapiens Unesin^ke 6 (m 


4.03 


452017 


AF1093a2 


H5.27495 


Hs.27495:prostate canc^ assodated prot 


4.03 


452721 


AJ^29 


Hs.301871 


Hs.301871»^ canter family 37 (gtyc 


4.02 


450737 


AW007152 


Ks.63325 


H3.63^5:(ransmemt}rBne protease, serine 


4.01 


456906 


AF117646 


H8.1S6637 


NM.0121 16:Homo sapiens Cas-Br-M (murine) 


4.01 


412974 


R1697B 


Hs.75105 


NM_006579:Homo s{q)lens emopamO binding 


4.01 


426108 


AA622037 


Hs.166468 


NM.004708:Homo s^itens pnipanmed oeO d 


4.01 
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416065 BE2S7931 Hs.78996 NM.002592:Homo sapiens proBferaQngce! 3.99 

403739 3.99 

417576 AA339449 K5^285 NM_000819:Honio88pi8nsphosphoift)Osy1gty 198 

^ 405484 3.98 

0 409162 K25530 Hs.50888 Hs.50668so!ut8 carrier family 22 (organ 3.88 

430514 AA318501 Hs^41587 NM-021246:Haro8a;to lymphocyte oitlge 3.97 

414595 B£439915 Ks.76913 Hs.76913:proteasome(pfosom8,niacn}p^) 3.97 

411165 NM.000169 Hs.69089 NM.000169:Homo sapiens oalactosidaso,8l 197 

421975 AWg61017 Ks.6459 (locuslink}NM.024531:Homo sapiens hypolh 3.96 

10 431836 AF178532 Hs.271411 NM.138992:Honi08aplensbetfr6lteAPP^ 196 

412133 U83480 Hs.104557 MM.001859:Hoino sapiens soUla canter fa 196 

41287D N22788 Hs.82407 NM.022059:Homo8apfen8ChemoMn8(C-X<; 195 

413278 BES6308S Hs.833 Hs.833dnterfenxK(!mu)ated protein. 15 185 

400130 Hs.155560 NM.001746:Homo sapiens ca!nexin(CAN)Q. 193 

IS 430896 AA531276 Ks.59509 Hs.S9509£ST8» Weakly similar to similar 193 

443802 AV\e04924 Hs.9805 Hs.9806:expoitin 5 193 

407777 AA161071 Hs.71465 Hs.71465:squateneepoxk]ase 192 

456629 AW891965 K5.387942 Hs.387942:Homo sapiens, done IMAGE:4701 192 

421943 BE616520 Hs.343912 NNL033504:HomosapiensCAC-1 (CAC-1),mR 191 

20 422293 X94453 Hs.114366 h{s.114366:pytT0iln&5<ait)axyiatesynlhe 190 

419488 AA316241 Hs.90691 NML006991Homo sapiens nudeophosminAnjc 189 

425123 AW205274 Hs.154695 NM.000303:Honio sapiens phosphomannomutas 189 

447343 AA256641 Ks.236894 Hs.236894:ESTs.^h!y similar to 802392 188 

413254 U40272 Hs.75253 NM.004135:Hofno sapiens isocitratodehydr 188 

25 413950 AA249096 Hs.32793 Ks.32793:Homo sapiens CDNAFU31 108 fis. 188 

409453 AI885516 Hs.95612 H3.95612:EST8 187 

452888 AW955454 Hs.30942 NM.004093:HomosaptensephrM2(EFNB2) 186 

421910 NM-014588 Hs.109437 NM.014586:Homo sapiens hoHnonallyupresu 186 

434263 N34895 Hs.79187 Hs.79187:cox5aclde virus and adenovinjs 185 

30 444700 NM_003645 Hs.11729 NWL003645:Homo sapiens fatty-add-Coenzy 185 

433862 W07162 Hs.150826 NM_020387:Homo sapiens RAB25. member RAS 184 

418526 BE019020 Hs.85838 NM_004207:Homo sapiens soluto carrier ta 184 

425996 AU076629 Hs.165950 NIUt002011:Homo sapiens fibfOblastgrovHth 164 

422616 6E300330 Hs.1 18725 N»^012248:Komo sapiens selenophosplietes 183 

3S 408056 AA312329 Hs.42331 Ks.42331:ephrlnVV4 183 

407233 X16354 Hs.50964 .(tocusI!nl(}NIUL001 71 2:Homo sapiens carcin 183 

426514 BE616633 Hs.170195 Hs.170195:bone moiphogenetic protein 7 ( 182 

451541 BE279383 HsJ26S57 NMJX)7183:HQmos8ptenspia)a3pliilIn3(PK 182 

424441 X14850 Hs.147097 Hs.147097:H2AhIstonef8mIly. member X 181 

40 426378 U80082 Ks.169600 H5.169600:KIAA0826 protein 161 

409638 AA305729 Hs.18272 (lociJsnn1()NM.O30674:itomo sapiens solute 181 

407786 AA687538 Hs.38972 NVL005727.1tomosaptenstetraspan 1 (TSPA 180 

431945 AWQ00827 Hs.11962 NM.030766:H(»T»8apteflsapoptesisreoutet 179 

414561 AI064813 Hs.1951S5 H8.195155:sohjtecanferfamily38kmeinb 179 

45 405556 179 

453082 H18835 Hs.31608 (locusnnk)NM_01763GM)fnO88ptenstiansl 179 

400529 179 

412889 AA290712 Hs^407 Hs.82407xhemddne (CX^ molH) figand 178 

427239 BE270447 HS.356S12 Hs.3S6512:ESTs.Wea)dyste^teUBGM 178 

50 429638 Ai916662 Hs.211577 (tocU5fink}NI^004986:HQmo8aptenskinect 177 

445462 AA378776 Hs.288649 (locuslinl()NM.024051:Homos^)tenshypoth 3.77 

415003 M11437 H8.77741 Hs.77741:ldninogen 177 

443639 BE269042 Hs.9661 H3.9661:proteasom8(prosom8.macropain) 176 

400290 H18836 Hs.31603 (locusnnk)NI^017636:Homo sapiens transi 176 

55 431350 AI192528 Hs.164537 H5.164537;ESTs 176 

430154 AW583058 Hs.234726 NM.001085:Homo8aplens serine (or cystel 176 

435099 Aa]04770 HS.47S6 N^1.004111:Homo sapiens flap stnictur&^ 175 

418216 AA662240 Ks.283099 Hs.283099:AF15q14 protein 3.74 

414907 X90725 Ks.77597 NM.000998:Homo8aptensribosoma] protein 174 

60 413063 AL035737 HS.7S1&4 H3.75164xhil]nBse34ke1 (cafiageg 173 

420665 AW469240 Hs.371581 Hs.371581£ST8 173 

452299 AW206330 Ks.355663 K5.355663:ESTs 172 

444664 N26362 Ks.11615 NKL01 6088:Homo sapiens map idnasephosph 3.72 

450334 Ai^)35959 K5.24879 Hs.24879:phosphalidicaddpho$phateS8t 172 

65 444006 6E395085 Hs.334762 (locusM()Nh^032S32:Komo8a;^rtypoth 172 

449437 AI702038 Hs.100057 Hs.100057'xerine/lhreonine kinase 35 171 

412939 AW411491 Hs.75069 Hs.7S069xerinsiiydn»ymethyltr8nsterase 171 

427490 Z95152 Ks.178695 NM_002754:Komos8ptensmitogei>-ad]vated 3.71 

427333 AF067797 Hs.176658 Nft^001169:Homo8aptensaquaporin8(AQP8 3.70 

70 434203 BE262677 HS.2B355B NiyU)18509M»no8aptenshypoMcaIpfOt 3.70 

414808 D14894 H8.77329 (locustink)NM.0147$4:Homo8apten8phosph 3.70 

456362 AWg73003 Hs.179909 (IocusIin)c)NM.024831:Hdmo sapiens nuctea 169 

409093 6E24383« iti.50441 NM-01S936:HomosaptensCGM}4 protein (I 3.69 

„^ 437016 AU076916 Hs.5398 Hs.539B:ouaninemonphosphate8ynthetese 3.69 

75 430387 AW372B84 HsJ!40770 H8.2407703Wd8arcapb(ndlnopratdnsd 169 

428023 AL038843 Hs.374530 Hs.374530:HQmo8aptenscD»«A:FU23«)2fi 3.69 

432593 AW301003 Hs.51483 Hs.51 483:Homo sapiens, Slmtter to RIKEN 168 

413813 M96956 Hs.75561 NM.003212:Homo sapiens teratocardnoma^ 168 

428376 AF119665 Hs.184011 Hs.184011.TyrQphosphate3e(inoio8nic) 3.67 

80 431890 X17033 H&271986 NML002201KQmo6aptonsIntegiin,aIpha2 167 

446696 AF279265 Ks.298476 NiyL022911itomo sapiens sohite carrier fe 167 

419378 R24922 H8.g0078 Hs.90078:nudeo8dMuoartranspoilBrsl 167 

448140 AF146761 Hs.20450 NM.020125:Komo8apten8Blymptiocyteacfl 167 
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452679 


Z42387 


HS.638B3 


432636 


AA340884 


Hs.278562 


433020 


AI375726 


HS2Z7152 


425003 


AF119048 


H3.154149 


413095 


AA4943S9 


Hs.30715 


417386 


AL037228 


Ks.301957 


409152 


AA1765B5 


Hs.194346 


404828 






453111 


AB014598 


Hs.31720 


409964 


AW368226 


Ks.67928 


446342 


BE298665 


Hs.14846 


452098 


AI856183 




428072 


BE258602 


Hs.182366 


439223 


AW238299 


H&250618 


408137 


AI594131 


K3.29002 


421959 


AW751497 


Ks.98370 


436856 


AI48935S 


Hs.127310 


449667 


AB023227 


Ks.23860 


406684 


X163S4 


Ks.50964 


424534 


087682 


H5.150275 


426031 


AA295251 


Hs.166066 


417526 


AA568906 


H&82240 


443044 


N28522 


H&8935 


424154 


AF026004 


Ks.141660 


432886 


6E159028 


Hs.279704 


413880 


AI860842 


Hs.1 10915 


421357 


AK000609 


Hs.103808 


436827 


H72187 


HS.3S6668 


416084 


L16991 


HS.790C6 


420162 


8E378432 


H8.95577 


413476 


U25849 


H&7S393 


412115 


AKD01763 


H8,73239 


413588 


AA971014 


Hs.75432 


431512 


BE270734 


Hs.2795 


435777 


AW419202 


Hs.286192 


431211 


M86849 


Hs.323733 


453258 


AW293134 


Hs.32597 


414812 


X72755 


Hs.77367 


423058 


M25629 


Hs,123107 


443180 


R15875 


Hs.258576 


425047 


U34038 


Hs.154299 


449057 


AB037784 


Hs.22941 


444184 


T87841 


Hs.282990 


412641 


M16660 


Hs74335 


413781 


J05272 


Hs.850 


409213 


U61412 


H3.S1133 


447495 


AW401864 


H5.18720 


447200 


BE543146 


H&281434 


408683 


R56665 


Hs.46847 


431B42 


NM-005764 Hs,271473 


457284 


AF102850 


Hs^7933 


411678 


AI907114 


H3.71465 


437704 


AA766142 


Hs.131810 


419693 


AA133749 


Hs.301350 


407971 


AI469117 


Ks.62918 


424865 


AF011333 


Hs.153563 


432211 


BE274530 


Hs.273333 


436014 


AF281134 


Hs.283741 


436278 


BE386290 


H&5097 


440334 


BE276112 


H5.716S 


428788 


AF082283 


Hs.193516 


424909 


S78187 


Hs.153752 


407722 


8E252241 


H&38041 


417129 


A1381800 


Hs.300684 


409463 


A1458165 


Hs,17296 


407137 


T97307 




454390 


AB020713 


Hs.56966 


438485 


WS7578 


Hs.378718 


423750 


AF165883 


Hs.298229 


446946 


AI878932 


Hs.317 


413380 


AI904232 


Hs.75323 


430237 


AI272144 


H8.236522 


436127 


W94824 


Hs.11555 


407770 


AW607831 


H3.38738 


411950 


T28407 


Hs.81564 


434845 


BEa67057 


Hs.325321 


420319 


. AW406289 


H5.96593 


425209 


AIJ049761 


Hs.155140 


410174 


AA308C07 


Ks.59461 


429023 


NM_000312 Hs.2351 


4264^ 


AF151812 


H5.169992 


437867 


BE277414 


H&S947 


428093 


AW594506 


H8.104830 



(loct]sIlnk)NM-020182:Komo se|to trsnsm 
N^^.001307;Homo saptens claudln 7 {CLDM7) 
N^016391:Homo sapiens hypothetical pmt 
NM.014481:Homo sapiens APEX nudaasa (ap 
Hs^lSipotassium vdtag&galed channel 
^ayL018144:HQmo sapiens Sec61 alpha fonn 
Hs.194346:S|rir.2prot8ln 

NM_01 4799:HonK> sapiens hephaesSn (HEPH) 
HS.6792&EST8 

Hs.l4846:HonK} sapiens mRNA: cONA 0KFZp56 
BF755039:QV^CT0583.18100(M28-f07 CT058 
NM-016292:Honvo sa^ heal shock prate) 
NM„02S217:Hofno sapiens UU6 Undb^ pnrt 
Hs.29002:KIAA1706 protein 
NM^030622Miino sapiens cytochrame P4S0, 
(locusrin)()NM.144624;Homo saj^ Idnase 
Hs.23860rKlAA1010 protein 
(locusfink}NM.001 71 2:Homo sapiens carcin 
Hs.l50275:KIAA0241 protein 
(1ocuslin}i)NI^006G97:Homo sapiens dspla 
Hs.62240syntaxtn3A 

NM.014298:Homo sapiens quinoHnate phosp 
NM.004366:HQmo sapiens chloride channel 
H5.2797D4xhioniatin aocessibOily oompla 
N»l.Q21258:Homo sapiens interletildn 22re 
NM.017896:Homo sapiens chromosome 20 ope 
(tocusI)nl()NM_005274:ita> sapiens guanin 
NM.012145:Homo sapiens deoxythymldyiate 
NM_052S84:Homo s^ns cydin^ependent 
NMJ04300:Homo sapiens add phosphatase 
Hs.73239:hypoth6lica! protein FU10901 
NM.000B84:Homo sapiens IMP (inosine mono 
H8.2795jactate dehydrogenase A 
NM-032192:Homo sapiens protein phosphata 
Hs.323733:gap juncHon protein, t}eta 2. 
NM.005977:Homo sapiens ring finger prote 
NM.002416:Homo sapiens monoldne induced 
NM.002257:Homo sapiens ttaOiiuein 1. ren 
NM_012129:Homo sapiens daudin 12 (aONI 
NM_005242:Homo sapiens coaguiatlbn facto 
Hs.22941:KIAA1363pn>tein 
{k)ct]sliiik}NM.033550:Homo sapiens chrome 
Hs.74335:heat shodc 90ld> protein 1 . beta 
Cocustin)()NM.000883:Homo sapiens IMP (i 
NM_005975:Homo sapiens PTK6 protein tyro 
NM.004208:Homo sapiens programmed ceil d 
H8.281434:Homo sapiens tXm FLJ31 373 lis 
NM_01 661 4:Homo sapfens TRAP and TNF leoe 
Hs.271473:epitheR8l protein up-reguiate 
NM.013338:Homo sapiens A]g5. S. cerevisl 
NM.003129iiomo sapiens sqidena epoxUas 
Hs.131810M}mo sapiens cONA FU35976 lis 
H&3013SO:FXYO domdbKContainlno ton tra 
H3.6291B:CDC91 ceil dhrtsioncycte 91-11 
NMJI02349ilomo saptens lymphocyte antige 
H&273333Jtypo1het!cal protein RJ10986 
NM^0201 58:Komo sapiens exosone component 
H&5097xynaptooyrin 2 
NM.003g04i{omo sapiens zinc finger prote 
NM.003921:Homo saj^ B^ CLLAympho 
(locuslin)()NM.004353:Komo sapiens eel d 
NM-003681*ilomo sapiens pyrtdoxal (pyrido 
Hs.300684»alcQonin gene^Iated peptid 
NA^023930:Homo sapiens hypotheOca) prot 

(Iocusl!nk)NM.024923:Homo sapiens hypoth 
)Hs.378718:HQmo sapiens cDNA FU33433 fis 
NM.012394:Homo sapiens prefoldin 2 (PFDN 
NM.003268:Homo sapiens topoisomerase (DN 
Hs.75323:prohibi8n 
Hs.238522J)KF2P434P106 protein 
NM.080748:Homo sapiens chrwnosome 20 ope 
NM_014343:Homo sapiens daudin 15 (CL0N1 
NM-002819:Homo sapiens ptetetetfiactor4 
Hs.325321:WD repeat domain 18 
NM_019034:Homo sapiens ras homdog gene 
NM.001895:Homo sapiens casdn Unase 2, 
Ks.59461:DKFZP434C245 pmtein 
N14.000312:Homo salens protein C {teadl 
NM.015966:Homo sapfens serologlcdly def 
NM.005370:Homo saptens md trsnstbnrfng 
Hs.104830£ST8 
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BE621310 


Hs^23 








413186 


AUOHMI 


Hs^4548 




NM.001197 Hs.155419 




AL080235 


Hs^861 




AB023182 


Hs.184523 




AI538871 


H3.378965 




AB020643 


Hs.183006 




NM_004460 Hs.418 




N99692 


Hs.75227 


>UAft7C 
*WUOfO 


NM 004987 Hs.112378 








401000 


AW296135 


Hs.267659 


410199 


AW377424 


Hs.205126 


**i£QiSa 


Ai796390 


Hs.210667 


429344 


R94038 


Hs.374664 




W21223 


Hs.151734 


456950 


AF111170 


Hs.306165 




BE244231 


H8.84D38 


459454 


AW052006 


Ks.374973 


40044B 






424142 


AI678727 


Hs.378970 


430720 


Ud5768 


Hs.247838 


416412 


NM.014742 Hs.79305 




AA296363 


HSJ21520 


412946 


BE243313 


Hs.334851 


4oil£9 


BE072881 




4/30 


U63630 


Hs.155637 


440491 


BE3977S3 


Hs.14623 


4alMi 


BE266322 


Hs.211374 


4£Jl90 


M81933 


Hs.1634 


440U9s) 


AW977382 


Hs.15898 


414U40 


NM_M2951 


Hs.75722 


4Cll9V 


U95031 


Hs.102462 


•ilsOUf 


R52557 


Hs.91579 


4J09/D 


AL118990 


Hs.373554 


410410 


U11700 


Hs.84999 


4iJi30fU 


AI580053 


Hs.109007 


441 UO 


AA570256 


Hs.348504 


40ZJZU 


AW411066 


Hs.274351 


444Ul» 


BE173977 


Hs.10098 


4j^0 


T47364 


Hs.278613 


410219 


T98226 


HS.1719S2 


410663 


AA194952 


Hs.36093 


402829 






445921 


AW015211 


Hs.153799 


414198 


AW505308 


Hs.75812 


443425 


AI056776 


Hs.133397 


436485 


X59135 


Ks.156110 


410268 


AA316181 


Ks.61635 


425159 


NiyL004341 HS.15486B 


420614 


AL110291 


H$.99364 


421814 


L12350 


Hs.108623 


432215 


AU076609 


H3^34 


409402 


AF208234 


Hs.695 


421038 


AU}80192 


Hs.101282 


4244UO 


At754613 


H9.146428 


448775 


AB025237 


Hs^8B 


442821 


BE391929 


Hs£752 


459306 


AW578452 




400846 






422256 


M64673 


Ks.1499 


jnonsn 


H59799 


Hs.42644 


432078 


BE314877 


Hs.24553 


435575 


AF213457 


Hs.44234 


456534 


X91195 


Hs.100623 


44 wo 


BE617695 


HSJS86192 


414368 


W70171 


H5.75939 


422599 


BE387202 


Hs.1 18638 


437897 


AA770561 


Hs.146170 


431183 


N^t006855 Ks.250696 




AV660976 


Ks.3569 


432391 


AI732374 


Hs.339827 


417540 


D30857 


Hs.82353 


440086 


NM 005402 Hs.6908 


401179 






411125 


AA151647 


Hs.688n 


453323 


AFD34102 


Hs.32951 


407238 


W79485 


Hs.173980 


447250 


Ai878909 


Hs.17883 


452875 


6E275760 


H5.30928 


428390 


At640377 


H3.350077 


425811 


AL039104 


HS.1S9557 



H3.923slnflte«tran(Jed binding jrote 

Hs.374548»riut8 csrrter famOy 1 6 {mono 
NM.001 197:Homo sapiens BCU-lnleracfing 
Hs,35851:RBs^uc8d senescence 1 
Hs.184523serine/(hreon!n8 kinase 38 Gk 
R5,378965:Homo sapiens cDNA RJ37658 fis 
Ks.183006£kely homolog of mouse hepari 
(tocusCnk)NM.004460:Homo8^iiens fibrob 
Nll4.005002:Honto sapiens NADH dehydrogenas 
(tocusEnk)NM-004987:Homo salens UIW an 

NK^006113:Homo sapiens vav 3 oncogene (V 
HsJ205126:Homo sapiens cDNA: FU22667 li 
Hs^10667£Srs 

NM_005538:Homo sapiens tnhlbh. beta C ( 
Hs.1 51734siuctear Iransport lactor 2 
Hs.306165:EST6, Highly simiiar to unknow 
Nt4_015937:Homo sapiens CGM)6 protein (L 
Nkl.0046g7:Homo sapiens PRP4 pre^RNA pro 

Hs^78970:Homo sapiens cDNA FLJ35102 fis 
NM.002991itomo sapiens small inducible c 
Hs.7930S:iaAA0255 gene product 
Hs.121520:Homo sapiens cONA FU35792 fis 
Hs.334851:UM and SH3 protein 1 
BE072881.'RC2-8T0548-20030^012-e09 BT054 
NM.008904:Homo sapiens protein Unase. D 
Hs.14623:lnterferDn. gammaMudbls pro 
Oocusilnk)NM_145051:Homo sapiens t^tti 
Hs.1634:cendivi5toncycle2SA 
Hs.1 5898:2,4-dienoyt CoA reductase 2, pe 
NM.002951:Honio s^tens ribophorin II (RP 
Hs.102482:mucin 5. subtype B. tracheobro 
NNt033416:Homo sapiens similar to HYPOTH 
(locusnnk)NhL130786:Komo saptens alpha- 
Nhl.000053:Homo sapiens ATPase, Cu<h> tran 
Hs.109007:Homo sapiens. Simiiar to L0C15 
Hs.3485D4:bypotiiet}cal protein BC014072 
NML016032:Homo sapiens zinc finger, DHHC 
NAiL019082M)ino sapiens putative nudeota 
(k)cuslink)NK1.005532M}mo sapiens tntarf 
Hs.1719S2»ccludln 

Ks.36093:Homo salens cDNA FU12885fls. 

H8.1S3799:Homo sapiens cDNA aJ38333 fis 
Nh^004563:Honio sapiens pliosphoenolpyniva 
Hs.133397:ESTs 

HS.1S61 lOnmmunogtobuBn kappa constant 
NM_012449:Hon[io sapiens sU transmembrane 
NiyLQ04341:Homo sapiens carbamoyliphosplia 
Hs.99364:abhydrolase domain containing 1 
NM_003247:Homo sapiens thrambospondin 2 
NH.601033:Homo sapiens nbonudeoSde re 
Hs.695:cy8taiinB(8tefInB) 
Hs.101282:Homo sapiens mRNA; cONA DKFZip4 
Hs.146428:ooIlagen. type V. alpha 1 
NML002452:Homo sapiens mtdix (nudeoskJe 
Hs.8752:transmembr3ne protein 4 
AW8452:RC1^T0252^3010O^bO7 CT025 

NM.005526:Homo sapiens heat shock transc 
K3.42844Mrodoxii>4n(e2 
(locusBnk]N^L022369:Homo sapiens hypolh 
KM_018965:Homo sapiens tiiggering leoept 
NM_138689:Homo sapiens proton phosphata 
N^4.032192:Homo sapiens piotefn phosphata 
Nh4_012474:Homo sapiens uridine monophosp 
Hs.1 IBOdS-jtorwnetastalic ceils 1 . protel 
Hs.146170:tiypotheQcaI protein RJ22969 
N^L006855:Homo sapiens KOEL (Lys-Asp^u 
Hs.3569:chromosome 20 open leading liame 
Hs.339B27:ESTs. Weakly simitar to protea 
NM.006404:Homo sapiens protein C reoepto 
NM.006402:Homo sapiens ^wa! simian leuk 

Hs.68S77xytochrome b-245, alpha polypep 
NM.001532:Homo sapiens solute carrier fa 
Hs.173980atticlear matrix protein NMP200 
NM_002707J4omo sapiens protein pho^ata 
NM.0061 14:Homo sapiens transkicase of ou 
NM.000982:Komo ss^ns dbosoma) protein 
NMJKK2266:KDmo sapiens loyopherin ajpha 
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418558 
444706 
444734 
424482 
438203 
409686 
418681 
419705 
420186 
413835 
448153 
425274 
435472 
451932 
420085 
412006 
424954 
437741 
414602 
400263 
434457 
452203 
413431 
437379 
408716 
433627 
430393 
417268 
434224 
428028 
445580 
420531 
417389 
412276 
452700 
433604 
414883 
421743 
447698 
424089 
414788 
442315 
449944 
426244 
426484 
418703 
436415 
447151 
418862 
410636 
423599 
435886 
458778 
420190 
428371 
450690 
413900 
406698 
407797 
447321 
426841 
418650 
420676 
438444 
439778 
412326 
447656 
428109 
421779 
400262 
418803 
449230 
421532 
432026 
433179 
452264 
418841 
421612 
427349 
457670 



AI816736 

BE002798 

NM.004182 

AW082266 

AK000398 

NM.001360 

BE268621 

BE540090 

AKD00002 

AA287786 

AW368634 

NM.015925 

AI272727 

Y10805 

BE281191 

AW972330 

AA360954 

AI741909 

AW451618 

raiL000546 

BE561610 

AW630088 

AF141332 

X57522 

AW246428 

AL359575 

AI56783g 

AF078866 

GE185030 

AA122237 

AA380731 

U52112 

AF167572 

AI652069 

BE260964 

eE262621 



NM_013W2 

AA926960 

T35958 

A1420156 

AL036662 

X78342 

AA173992 

AF290512 

AK002127 

AF104032 

NM.014448 

BE265254 

AI022813 

BE550964 

AA088177 

AI805664 

BE265839 

AW451034 

A1816209 

AB012193 

AA296656 

AW409747 

X03068 

AK000S24 

AW271217 

AI052356 

BE386750 

AI434780 

A1C64707 

AL109729 



NIUL003726 
AW732918 
AI879159 

U50079 

BE613348 

AW138207 

AAS24545 

AW3&2945 

AU077013 

BE243138 

AF161254 

AA360154 

AF119666 



te.14896 

Ks^7850 

Hs.172791 

H$.B6131 

Hs.11747 

Ks.11806 

Hs.149155 

H3.7345 

Ks.55879 

Ks^3449 

Hs.154331 

Hs.95697 

Hs.249163 

H8.20521 

K5.155462 

K8^83022 

Hs.27268 

Hs.44680 

Hs^16 

HS.1B46 

H!L5609 

H5.76550 

Hs,75309 

Hs.2a0333 

Hs.352018 

Hs.75355 

Hs,23765 

Hs.151714 

Hs^296 

Hs^41305 

H&81874 

Hs^ 

Hs.182018 

Hs.12912 

H&98814 

HSJ2045 

Hs.737Sd 

Hs^88940 

Hs.3439 

H&348669 

HS.107&14 

Hs^26733 

Hs.144949 

Hs.77313 

Hs-7956 

H&S8215 

Hs.155313 

Ks.184601 

Hs.87435 

Hs.343258 

H8.92679 

Hs.89399 

H3.172870 

Hs.31731 

K3.12126 

Ks.326525 

Hs.95887 

Hs.183874 

Hs.333418 

Hs.75612 

Hs.73931 

HS.398S0 

Hs^1434 

H8.131741 

Hs.86978 

Ks.424d 

Ks.301226 

K8.99364 

H5.73817 

Hs.19126 

Hs.182490 

H5.108219 

HS.7S309 



^148962{nc finger, OHHC domain conti 
NM.002219:Homo saftem bitegral menibrane 
NM.004182:Homo sapiens uUquiiaus^-expr 
Hs^lSlrFasfTNFRSFQsBssociatedvfade 
(locuslh)()NM_017798:HDmo sapiens dinxno 
NM.001360bHomo sapiens 7-dehydrect)oleste 
(iocusSnX}NM_003374:Homo sapiens voliag 
Hs.7345:MA01 ndtoGc anest deficfent^ 
(locusflnh}Nlil.03345Q:Homo sapiens multid 
Hs.23449jnsu!in receptor tyrosine Idnas 
Hs.154331:ESTs 

H3.95697:Iiver'Bpecific bHLMp banscr 
NM_024306:Homo sapiens fatty acid tiydrox 
NM.001636:Homo sapiens HNfTI tinRNP methyl 
H3.1S5462:MCM6 minicttfomosome maintenanc 
NhC018643:Homo sapiens triggering recept 
Ks77268:itomo sapiens cDNA: FU21933 fis 
Hs.44680:hypofhel)cai protein aJ20979 
Hs.290216:ESTs 

NM_000546:Komo sapiens tumor proteio p53 
NM-020470:Homo sapiens putative transmem 
NI^0S2886:Hoim sapiens mal, T-cell ditfe 
NM.001961:Homo sapiens euttaryotx: liansl 
NM_018690:Homo sapiens apolipopfotein B4 
NM_000593:HomQ sapiens transporter 1, AT 
NM.003348:Homo sapiens ubiquitin^Juga 
Hs.23765:membr8na metaOt^endopepSdase- 
(locus&nk)NM.033405:Homo sapiens penud 
NH.033161:Homo sapiens surfeit 4 {SURF4) 
(!ocusEnk)NIA.006470:Homo sapiens ti^ar 
NM.002413:Homo sapiens microsomal glutat 
NM_000206:Homo sapiens hterteukin 2 rec 
Hs.182018:lnterteuidn-1 receptor-assoda 
NM.006109:Homo sapiens SKB1 inmolog (S. 
NM_004587:HQmo sapiens ribosome binding 
H8.82045:midl(ine (neurite growOvf mmotl 
H8.73798:macrophage migration inhibitory 
NM.Q21259:Homo sapiens transmembrane pro 
Hs.3439'^tomalin (EP672}-&1(e 2 
Hs.348669:COC28 protein kinase 1 
Hs.l07614:DKFZP564l1171 protein 
NMj052858:Homo sapiens simHar to RIKEN 
Hs.144949:ESTs 

Hs.77313:cycrnMlependentMn8se (C0C2-I 
Hs.7956:EST6 

(iocusnnk)NM_033046:Homo sapiens rhotek 
NM.022105:Komo sapiens deatti assodatsd 
(1ocu5t!nl(}NIVL003486:Homo sapiens solute 
HsJ7435:Rho guanine exchange factor (GE 
l^006191:Homo sapiens proiiferatlon^ 
(iocustInk)NVt145754:Homo sapiens Mnesi 
NM.005176:Homo sapiens ATP synthase, 
Ks.172870:iaAA1913 protein 
(!ocus!Ink}NM.012094:Homo sapiens peroxi 
NM„018487iiomo sapiens hepatocenuiarca 
NM.001669-iiomo sapiens arylsutfialase D ( 
(iocusIin)(}NA^024112:Kcmo sapiens chromo 
NM.0035B9M}mo sapiens culln 4A (CUL4A) 
(loctt8nni<)NM.014164:Homo sapiens FXYD d 
NM-008819:Homo sapiens stressMucedi)h 
hts.73931uncjor histocompaQbi&ty oomple 
Hs^850:uridineldnas»#e1 
Hs^81434:Komo sapiens cONAaJ31373 fis 
Hs.131741:EST8 



H$.356392 
Hs.146170 
Hs^4630 
Hs.162459 
Hs^57 
Hs^7 
Hs.106196 
Hs.177415 
Ks23449 
1 



Hs.4248:Homo sapiens PP3781 mRNA, oompJe 
Hs^1226:Homo sapiens, done iI^GE:3456 
Hs.S9364»bhydrolase domain containing 1 
NM_002983:Homo saptens snoB indudUe c 
NRi.003726iiomo sapiens src ferrfly assod 
Hs.182490:leudneHlch PPI^molif contain 
NM_004626.'Homo sapiens wingless-^pe MMT 
N^LC01g61:Homo sapiens eukaiyolic trans) 
Nh^004g64:Homo sapiens hlstone deacetyta 
H3.356392:ESTs. H^hly sfenilar to S-phas 
NM.022842Komo sapiens hypothetical prot 
Hs^4630:Honio sapiens cDNA FU33318 Bs 
Hs.162459:ESTs 

Hs.28757.1ransmembrane 9 superfamily mem 
NM.001 109:HomD sapiens a didntegrln end 
OocuslWONli|,016579:HomD sapiens 808 an 
OocusiinlONM.001997:Homo sapiens Rntel 
NM.0t8842:Homo sapiens insulin receptor 



3.24 
3.24 
3.23 
3.23 
3.23 
3.23 
3.23 
3.23 
3.23 
a23 

m 

3.22 
3.22 
a22 
3.21 
3.21 
121 

a2i 

3.21 
3.21 
3^1 
120 
3.20 
3.20 
3.20 
3.19 
3.19 
3.19 
3.19 
3.19 
ai8 
3.18 
ai8 
3.18 
3.18 
3.18 
3.18 
3.18 

ai8 
ai8 
ai8 
ai7 

3.17 

ai7 

3.17 
117 
116 
116 
116 
115 
115 
115 
115 
115 
115 
114 
114 
114 
114 
113 
113 
113 
113 
113 
112 
112 
112 
112 
112 
112 
111 
111 
111 
111 
111 
111 
111 
111 
111 
110 
110 
110 
110 
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43B407 At457122 Hs.129S73 Hs.129873»ukBiyoticlranslaSonifaBal 3l10 

428293 BE2S0944 H5.1835S6 H3.183556xotu1a canter family t(neulr aiO 

436823 AW749865 Hs.117077 Ks.1170772lnc finger protein 284 110 

- 418181 U37012 HsJ3727 NiyL013291:Komo sapiens daavage and poly 3.10 

5 434826 'AF155661 H&22265 Hs.2226S:pynJvate dehydrogenase phosphat 3.10 

428734 BE303044 Hs.192023 NAij)03757:Hoino sapiens eukanioltetran^ 3.09 

423022 AA32052S Hs.201076 Hs.201076:EST8 109 

427648 A1376722 Hs.1 60062 Hs.180062:protea5ome(prosom8,maaop8ln 3.08 

404240 3.08 

10 408989 AW361666 Hs.4950O Ks.49500:KlAA0746 prot^ 108 

452835 AK001259 Hs.30738 NM.01 80B7:Homo sapiens hypothetical prol 108 

446506 AI123118 Ks.15159 (locusQnk)NM.016326:Hofno sapiens chemok 108 

402260 108 

429671 BE379335 Hs.211594 Hs.211S94:pn>teasome (prosonie.macropaln 108 

IS 409267 NM.012453 Ks.S2515 NM-012453:Homo sapiens transdudn (beta) 108 

445937 AI452943 Hs.321231 (tocusank}NM.003779: Homo sapiens UDP^ 107 

431243 U46455 Hs.252189 N(uL002999:Homo sapiens syndecan 4 (amphi 107 

422611 AAt58177 Hs.118722 (tocusIiRl4NhL00448aHo(no8qplef8fiJ0OSy 107 

427647 W19744 Hs.180059 Ks.180059:Hoino sapiens CDNAFU31380 lis 107 

20 449644 AWgG0707 Hs.148324 Hs.148324:ESTs 107 

448719 AA033627 K3.21858 Hs.21856serine (or cysteine) proteinase 107 

417457 AA378907 Hs.349326 Hs.349326:Homo sapiens cDNAFLJ306770s 107 

454128 AL0312S9 Ks.367900 Hs^900:progrBnnnedcendBalh2 107 

424927 AW973866 Hs.153850 Hs.1 53850:hypothettca)pn)tdnC321 02.4 107 

25 428144 BE269243 Hs.18262S Hs.182625:VAMP(veslde-associatedni8mbr 107 

426440 BE382756 Hs.169902 NM-006S16>iiomo sapiens solute center (a 106 

426410 BE298446 Hs.305890 NM.138578:Honi08apiensBCL24lke1 (B(X2 108 

426812 AF105365 Hs.172813 NlliL006S98:Honio sapiens sdutecsnier fa 3.06 

435750 ABQ2g012 H3.4990 Hs.4990:KIAA1089 protein 106 

30 421602 BE261458 Hs.108408 (locuslink]NM.016022:Homo salens CGI-78 106 

421905 A1660247 Ks.3269g Hs.32699:Homosaptens. Similar to RIKEN 106 

430542 AI557486 Hs.119122 Hs.1191223ibo$onial protein L13a 106 

427268 X76S20 H&.174139 NML001829:HofflO sapiens ddoride channel 106 

412525 AA581439 Hs.152328 Hs.152328£8T8 106 

35 422613 AV656571 Hs.121068 (locustinl()NlUL003270:Homo sapiens transm 105 

441406 Z45957 Hs.7837 Hs.7837:phosphopnitein regulated t}y mito 3.05 

406806 AW847614 HS.7S608 H3.75601ii9hl {unction protein 2 (zona 105 

435730 AB020635 Ks.4984 Hs.49B4:KIAA0828 protein 105 

432871 NM-016142 Hs.279617 Hs.279817±ydroxysterold {17-beta) dehyd 3.05 

40 447783 AF054178 Hs.19561 NiyL005001:Homo sapiens NADH dehydrogenas 105 

426266 AF083420 Hs.168913 Nlk^003576:Honio sapiens serlneAhreonine 105 

450447 AF212223 I4S.2S010 NML018698:HoniosaptenshypothetiC8iprDt 3.05 

427337 Z46223 Hs.176663 Nli4.000569:Hono sapiens Fc fragment of Ig 3.05 

406363 105 

45 439841 AF038961 H5.6710 NM.004870:Homo sapiens mannose-P-4loncho 3.05 

431738 AW237726 Hs.288549 NML032828:Homo sapiens (diiquiGn UBF-fl 3.04 

447966 AA340605 Ks.105887 (locuslInk)NM.1452S2:Homo8aptenssIinila 104 

439246 AI498072 Hs.351474 H3.351474:Homo sapiens cONARJ30002fis 104 

419493 AR»1212 Hs.90744 Hs.90744:pioteasoma (prosome, macropain) 3.04 

50 427597 015049 Ha.179770 NAC002&42:Hon»8aptens protein (yioslne 104 

430281 AI876842 H8.237924 NIUL016016:ltoosaptensC6l^ protein (L 104 

446620 AA12e808 Hs.179902 (locusIinl()NM.022109:Komo sapiens CDw92 104 

452865 AI924046 Ks.119567 Ks.119567:EST8.WeaIdy8tmitertoALU1_H 104 

422164 NH4.014312 Hs.112377 H3.112377.'cortical thymocyte receptor {X 104 

55 444301 AKD00138 Hs.10760 NM.01768ftHomo sapiens aspoite(LRRdas 104 

451455 AI937227 Hs.8821 . NfyL02117S:Homo sapiens hepddinanfiniicr 103 

417777 AI823763 HsJOSS H3.7055:Homo sapiens cDNAFU33420 (is, 103 

417144 AA382104 H8.81337 Ks.81337:tec6n,ga!actosideWng,80 3.03 

450825 AaX)5954 tis.25527 (locusnnk)NM_014428:Homo sapiens tight 103 

60 414774 X02419 H8.77274 NM.002858:HomosaptensptesfflInO9enacflv 103 

409430 R21945 KS.34673S Hs.346735:Homosaptens.donelMAGE:3881 3.03 

440659 AF134160 Hs.7327 NM.021101:HomosapiensClaud]n1 (CLDN1) 103 

432268 BE3118S6 H5.274230 Hs.274230'.3'-phosphoadenosineS^-phospho 103 

438930 AW843633 Hs.343261 Hs,3432611iIstocompatitdRy(mbwr) 13 102 

65 412599 AU076782 Hsj;48267 (1ocusliny)NM-021126:Komo8aptensnneicap 3.02 

426788 U66615 Ks.172280 N»A.003074:Homo saptens SWI/SNF reteted, * 102 

425966 NM_001761 Hs.1973 NM-001 761 itano sapiens cydlnF(CX5NF), 102 

448847 A1587180 Hs.11090S Hs.110906'iiypothetical protein BC004501 102 

431236 AV656840 Hs.285115 NM_001560itonosaptenslnterteuttn 13ro 102 

70 414702 122005 Hs.76932 NR/L004359:Homo sapiens ceO division eye 102 

430024 At808780 Hs.227730 NKt000210:Honio sapiens Integrtn, alpha 6 102 

424394 BE277024 Hs.146381 Ks.146381iVMt)lndIng motif protein. Xc 101 

442993 BE018682 H3.166196 Hs.166196;ATP8sa,Cla$sl,type8B.men)b 3.01 

437712 X04588 Hs,85844 Hs.65844:neim)trophic tyrosine Idnase, r 3.01 

75 410293 mmi K3.6I95O NM.018992:Homo sapiens hypothetical prot 101 

454358 AW792876 Hs.288938 NI^03142O.Homo sapiens rr^tochondrtaliib 101 

411531 AB014511 Ks.70804 Ks.70604-i^TPasd. Class It. type 9A 100 

447032 AK000310 Ks.17138 (Iocuslfaik}NM_017755:Homo sapiens hypoth 100 

414249 AI797994 H3.279929 (locusnnk)NM-017510:Homo5aptensgp25l^ 100 

80 448440 AA173467 Ks.62402 i^62402:p2iyCdc42/Rx1*8Ct]vatedldn3se 3.00 

423184 N1^004428 Hs.1624 NM.004428:Homo saptens ephilnVVI (EFNAI) 3.00 

419452 U33635 Hs.90572 Hs.90572m7prot^^raslne kinase 7 100 

417878 U90916 Hs.82845 Hs.8284ScHomo88ptensCDHA:IU21930fls 100 
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431884 


AA521246 


Hs^0792 


Hs.210792:Honn sapiens cONA FU36691 lis 


3,00 


425261 


8E385099 


Hs.355814 


Hs355814:Homo sapiens done iMAG&29333 


3.00 


445229 


BE276013 


Hs^28 


Hs.343828:Homo sapiens mRNA; cDNA 0KFZp7 


3.00 


412146 


M92444 


Hs.73722 


Hs.73722:APEX nudeaso (rradSuncSonal 


2.99 


437763 


AA469369 


Hs.5831 


Hs.5831:6ssw bihlbRor of nvtaDoprote 


2l99 


406885 


AI025931 


Hs.181357 


H5.18t357ctanMn loceptDr 1 {67ldX ribo 


2,99 


425725 


NM.012243 H8.159322 


(locuslInilONMj012243:HonK) sapions sdute 


2.99 


417259 




lilt oionn 


H$.810)OKhondroQin sulfate proteogtyca 


2.98 


453302 






NA^^139177*Httno sapiens chrofnosome 17 ope 


2.99 


432398 




nS.11WU 


Qocusrin)()Nli4J)32527iHoino sapiens hypoth 


2.98 


411358 




tlfl CU7B1 


HS.94761MAA1691 protebi 


2.98 


453518 




Hs.27268 


HfiJ7268:Hnino sanlens cDNAr fU91933 fis 


2.98 


448913 






NM>004999iHomo sapiens myosin VI (MY06), 


2,98 


440943 


AWAR99Qft 


n5,l«#01Dl 


NM_032331 :Honio sapiens hypothecs! prot 


2,98 


414013 


AL7tttttilVi 
AAfDOOUO 




NM 024fi42!HDmD salens hvDofhfificflll nml 


2.98 


452124 


AM3422D 


Urn C44m 


Hs.61170:EST8 


2.98 


416391 


A1B7R007 


Ue TOTO>l 


NM.0(K402:Honv> sapiens mesodenn specific 


2.98 


419092 




riS.o9oUo 


NM.00245&HonK) sapiens mucin 1, (ransmetn 


2,97 


426375 


AK000597 


Hs.169549 


NM..017893iHonio sapiens seno rtomaln, fanmu 


2.97 


437296 


AA350994 


H3^281 


Hs.20281:MAPK phosphata5e-7 


2.97 


407736 


N41744 


Hs.349326 


Ks.349326:Honio sapiens cONA FU30677 fis 


2.97 


443303 


U67319 


Hs,9216 


NM^03334O:Honio sapiens caspase 7, apopto 


2.97 


424758 


AW504657 


H5.152931 


nocusfink) N M 0D2?3fi' Homo ssnls ns 1 anrrin 


2.97 


430354 


AA954810 


Hs.239784 


Hs.239784:scribbi6 


2.97 


417079 


U65590 


Ks.81134 


(locusnnk}NM_000577:Konio sapiens bited 


2.97 


425221 


AV649864 


Hs.155168 


NM.005642:Hoino sapiens JfiF7 RNA polym^ 


2.97 


414188 


U33448 


Hs.75799 


Hs.75799^pfote3S6| serine, 8 (pfostasin) 


2SI 


432065 


AA401039 


K3.2903 


Hs.2903^protein phosphatase 4 (fonneriy 


2.98 


452012 


AA307703 


Hs.279766 




2.98 


412429 


AV650262 


Hs.75765 


NM_00^9'i1onK> sapiens GR02 oncogene (GR 


2,98 


452714 


AW770994 


K3.30340 


Ks.30340:hypothetical protein KIAA1165 


2.96 


429922 


297630 


Hs.226117 


NM 00531 8*Homo saolens HI histona famllv 


2,96 


'434931 


AW968941 


Hs.166254 


Hs.166254:]B{ahr Qfttitrfoa of ral vttuole 


2.96 


416293 


BE244454 


Hs.79162 


Hs»79162!StfuctuiQ specific lecoQnitioi) 


2.96 


428781 


AF164799 


Hs.193384 


Hs.193384:putala8vo 28 kDa protein 


2.96 


445350 


AF052112 


Hs.12540 


Nh^008330tHonv> sapiens lysophosph(^ipase 


2.96 


422396 


W21872 


Hs.7907 


(locus!ink)NK^145059!:Honu) sapiens L-fiioo 


2.98 


429597 


MA.003816 Hs2442 


Hs.24423 dMrtflflfln and metaltoproteln 


2.95 


421179 


U72684 


Hs.148495 


N|y^00281O!Honio sapiens proteastvns (prose 


2.95 


417691 


AU076610 


Hs.82399 


NI^007357:Hotno sapiens coniponenl (rf olig 


2.95 


427715 


BE245274 


Hs.180428 


Hs.180428:IOAA11B1 protein 


2.95 


430589 


AJ002744 


H8.246315 


NM_017423:Honio sapiens UDP-N-ace^^ph 


2.95 


409220 


BE243323 


H&51233 


flocusOnldNM 003842:Homo saolens tumor 


2.95 


443883 


AA114212, 


H8.9930 


Hs.9930!86rine (or cystine) potelnase 


2.95 


445720 


ALJ04O482 


Hs.286173 






429583 


NH.006412 Ks.209119 


NN^006412iHonK} sapiens l-acytglycerol-^ 


2.95 


427581 


NM.014788 Hs.179703 


NIVL014766^Honio saf^ens tHp^fiis motif- 


2.94 


419193 


D29643 


Hs.34789 


Nli^005216;Homo sapiens doSchyMiphosph 


2.94 


419152 


L12711 


Ks.89643 


(iocus6nk}NM^001064:Homo sapiens transk 


2,94 


444824 


AAB4357S 


H3.12056 


NM flfl1fi7i'Mnfrn sarong fl«ilfllfvilvMnmfpl 


2.94 


431629 


AU077025 


Hs.265827 


MU n99A7^'Unmn KflnipflQ Inlnrfprrvi ntnhn 
m«^u*M/f w>nwiitu oo^Mciid iiiiuitciuiit oi^ia 


2.94 


425118 


AU076611 


Ks.154572 


Hfi.154fi72*mfilhvIfiRa tRtrahvrinQfobda dsh 

1 lO* I^^Wf ^IllMlllJIWllO IDUCWtjUIWlUIOU7 UQtl 


2.94 


422010 


AA302049 


Hs.31181 


Hs 31181'HQmo saolens cDMA! FLJ23230 fis 


2.93 


436075 


BE090176 


Hs.179902 


NM D80546'Homo saolens C0w928nfiQenrCD 

IllV^wU^^rvUvfWIilV DOylwitJ wUiTwA OliUlJCII \Wv 


2.93 


412338 


AA151527 


Hs.69485 


((ocus!inlc)NMj02466l!Homo sapfens hypoth 


293 


413073 


AL0^165 


Hs.75187 


NM_01 4765:Hoino salens Iranslocaso of oo 


2.93 


412088 


AI689498 


Hs.106g32 




2,93 


447140 


AF070537 


H3.17481 


Nli/L.138391iHonio sapiens hypothetical prot 


2.92 


426746 


J03626 


Hs^057 


NNt,000373:Hotno sapiens ursine monc^osp 


2.92 


424291 


AL120051 


Hs.144700 


NM-004429:Homo sapiens ephrin-61 (EFNB1) 


2.92 


417944 


AU077198 


H3.82985 


NI\4„000393^Horno sapiens oollagen, type V, 


2.92 


42B343 


AL043021 


Hs.12705 


(!ocuslin)()N(V^145294:HorTK) s^ens sin)fla 


2.92 


435640 


AF220053 


Hs.54960 


NM 01fi4fi8'HoiTio fiaotans urtcharft^lBrfTefi h 


2.92 


451608 


AA384525 


Hs^45 


NM 016499'Honn saolens KSPC244 fMGDiaa? 


2.92 


434608 


AA805443 


Hs.179909 


NM 024fl31*Hain>fiafriflnfifiiielAarmam(or 

lllV^Vfi»IVV i*rivlliw OOflivllw IIUMwOI t^0GUvM 


2,92 


437186 


AA338305 


Hs.377816 


R« tTTSIfrHnmn «af^« riVdA PI l%PflR As 
nSitff rO lOai HMiM SapKniS yUdn r mIJOWD 115 




429574 


BE268321 


H3.208912 


Hs.2QS912:hv[]0theticd ms(^ MfiCSfil 


Z91 


438549 


BE386801 


Hs.21858 


91ftM*CArino /nr mMcfalbiAl nmtatnaeA 
ns.A io»}OAoiino yjt cfotBunj proiBiiiaSo 




440246 


W52010 


Hs.191379 


Ha KHQTQ'PCTfl 


9 01 


426924 


BE222542 


Hs.128782 


Ha.19lt7ft9*I^Mn itanlnns rTttSL. PI VII ti19 fk 
nsticflfocti iwiMSopwnsiujnnriJOioic h9 


901 


444193 


Y17B01 


Hs.10574 


Hs.10574solutB carrier family 2, (tadl 


2.91 


422030 


X51416 


Hs.110849 


(locusibik)NM.004451:Komo sapiens estrog 


2.91 


415938 


BE383S07 


Hs.78921 


NM.003488:Homo sapiens A khase (PRKA) a 


Z91 


450167 


AA446404 


Hs^4563 


NM-01 3248;Homo sapiens NTF2-lika export 


291 


408815 


AW957974 


HS25485 


(locuslink)NK4-024599:Komo sapiens hypo&i 


2.91 


414820 


AA371931 


Hs.77422 


Hs.77422:pfot80(ipid protein 2 (cotonic 


2.91 


410013 


AP087173 


Hs^904 


Hs^g04:mago^ash] homoloo. praGfM 


291 


444823 


BE282989 


Hs,12045 


Hs.12045:C2f protein 


291 


422197 


AW974265 


Hs.111632 


H5.111632:L6ni3 protein 


290 


432710 


AAfiogsss 


Hs276872 


NM.005698:Homo sqilens membrane oomponen 


290 


40S203 








290 


432465 


D56165 


H8^5163 


NM.0Q2512:Honio sapiens norHielastaic oe 


290 


412926 


AI878076 


H8.75Q61 


Hs.75061anacraphao8 (nyrfstoytated alanh 


290 
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45S987 

402104 

414814 

442739 

456157 

429505 

430587 

437822 

438543 

426158 

4414S5 

420166 

415674 

409591 

418062 

436540 

426675- 

417018 

421684 

429404 

411030 

413822 

438085 

409132 

440490 

431498 

423570 



L12535 Hs.75651 



014697 

NM.007274 

AVW79153 

AW820035 

N\L003028 

AW450485 

AA810141 

N\L001982 

AJ271671 

AW732276 

BE394784 

AAS32963 

AW630656 

6E397032 

AW084791 

M16038 

BE281591 

NM.005738 

eE387193 



451711 
442643 
447887 
421178 
443329 
416448 
453145 
427775 
424732 
426125 
450273 
407082 
450036 
-457274 
417831 
417824 
426969 
434916 
412664 
414172 
409504 



418462 
442199 
406710 
433435 
415402 
448730 
433027 
449090 
439737 
403912 
423225 
458376 
429211 
452518 
418127 
448469 
426194 
422129 
437651 
415173 
408201 
444758 
423323 
439720 
435550 
425907 
426234 
427640 
433233 
415697 
441321 
430040 
449954 



R52518 

AJ2245^ 

AW513884 

AK001777 

AW838306 

BE382657 

AK000461 

U82756 

AA114O50 

BE%7934 

BE262943 

L13210 

R63438 

R26944 

080001 

X87241 

AW296454 

Z47055 

AA005159 

AW674193 

H16423 

AA0847g8 

AI815206 

AF161383 

AA421404 

AW954324 

AA304g81 

H05430 

BE001596 

BE277633 

AI70B347 

BE545277 

AA164687 

A6032983 

AF191018 

AK001735 

AI751438 

AA852604 

AB023179 

AF052693 

AA2B0722 

6E243982 

AI523875 

T50872 

AU076635 

BE560672 

AW501735 

AK000558 

AU)44878 

A!951628 

AI935202 

A1224456 

AA3657S2 

BE314534 

AF058293 

A6040927 

AI36S603 

H17ie2 

AW503115 

AA641636 



Hs.77393 

HS.B679 

Ks.336881 

Ks.278679 

Hs.244542 

Hs.4437 

Hs.192182 

Hs.199067 

Hs.7854 

HS.S5583 

Hs.78595 

Hs.9100 

H5.83383 

Hs.14468 

Hs.133122 

Hs.80887 

Hs.106768 

Hs.10706 

H8.678g6 

Hs.272044 

Hs,7967 

Hs^732 

Hs.7218 

Hs.258551 

Hs.129819 

Hs.21486 

Hs.26890 

Hs.374973 

Hs.211610 

Hs.102419 

Hs,g234 

H3.79339 

Hs.183454 

Hs.180777 

H&152629 

H8.16e994 

K5.24743 

H3.188489 

Hs;a7152 

H&.82685 

H5.82646 

Hs.100293 

Hs.284207 

Hs.346868 

H8.75790 

Hs.699 

Hs.288433 

Hs.85266 

Hs.372542 

Hs.184014 

H&340959 

H3.177576 

Ks^1894 

H3^9923 

Hs.22983 

Hs.41271 

H5.12S359 

HS.90S9 

Hs.198249 

H3.24758 

Ks.83532 

H3,2001 

Hs,1478 

Hs.13543 

HS.1800S9 

Hs.43654 

Hs.11899 

Hs.127007 

Hs.31181 

Hs.324507 

KS.1S5965 

Hs.168159 

Hs.180015 

H3.301804 

Hs.279698 

Hs.7ni 

Hs227B23 

HS37477 



(loaisfink)NM.01242S:Komo saddens Ras su 

(lKusI!nk}NM.O02004:Honn sapiens fames 
(locusink}NM.007274:Hoino sapiens cyteo 
Hs.336881:EST8 

Nh4-033274:Kanio sapiens a dtebitegrin and 
Ks.244542:Homo sapiens cONA HJ38908 lis 
NM_000991 ton sapiens ribosomal protebi 
Hs.1»182:EST8 

NM.001 982;Honu> sapiens v-ert>^2 eiyttirob 
NM.014437i1omo sapiens solutB carrier fa 
NIULQ1 2339:Homo sapiens transmenriRune 4 s 
NM.002797itonio sapiens protsasome (proso 
Ks.91 00:hypo9ietica) gene supported A 
NM_008406:Homo sapiens peranrodcvdn 4 ( 
NMJt2Q230:Homo sapiens peter pan homolog 
Hs.133122:hypolheilca) protein FU14524 
Hs.80887>^1 Yamagudii sarcoma viral 
NM_0181 20:Homo sapbns hypothetical prot 
NM.005738:Homo sapiens ADP-rtbosyfalion 
0ocusnnk)NM_007346:Homo sapiens opioid 
Hs.272044:ESTs, Weakly similar to hypolh 
Hs.7967:ESTs. Weakly similar to extensin 
NM.005399:HonK) sapiens protein kinase. A 
Hs,7218:acety1-Coenzyme A synthetase 2 ( 
NM_0121 00:Homo sapiens aspartyl andnopep 
NM_018344:Homo sapiens bypothelteai prot 
Hs^1486:signa] transducer and acth/ator 
NM.017829:Homo sapiens cat eye syndroms 
(k)CUsIlnk)NM.004697:Homo sapiens PRP4 p 
NM_001228:Homo sapiens caspasa 8, apopto 
Hs.102419:zinc finger protein 
NM-032635:Homo sapiens seven transmento 
NM_005567:Honio sapiens lectin. galactoSi 
Hs.183454:Homo sapiens cONA FU148B3 lis 
Hs.1 80777:Hcmo sapiens mfVM; cDNA DKRpS 
Ks.152629:iaAA0179 protein 
HS.166994JFAT tumor suppressor horoOtog 1 
Hs.24743:hypothefcal protein RJ20171 

Hs.1B8489:ESTs 

NM.016391:Honio sapiens hypothetteal prot 
H8,82685.'C047 anSgen (RlHalated antlge 
NM.008145:HanK> sapiens DnaJ (Hsp40) homo 
Hs.10D293:Omnked N-acetyiglucosamlne ( 
Hs.284207;hypolhetical protein 6C003515 
NM.006824i1omo sapiens EBNA1 l^inding pro 
(locusfink)NM_002642:Homo sapiens phosph 
Hs.699:peptidytprolyt Isonierase B (cycto 
NM_016522Homo sapiens neurotrinitn (H^(T) 
Hs.85266Jntagrin.beta4 
NM_004879:Homo sapiens etoposlde-lnduced 
H3.1&4014:ril)osomal protein L31 
NM.005726:Konio sapiens Ts lianstaOon d 
Hs,1775763nannosy1 (alpha-l^-glycopro 
H3^894:KIAA1157 protein 
Oocu5tlnk}NMJ}14366:Homo sapiens putati 
NM-020121:Konio sapiens UOP-gluoose ceram 
Ks.41271:Homo sapiens mi^ full length 1 

Nh^006288:Homo sapiens Hiy^l oeH suifae 
Hs.9059:KlAA0962 protein 
NMJ)05268:Homo sapiens gap luncQon prot 
Hs^4758:Homo sapiens cDNA 1^32068 fis. 
(locusnnk)NM.00238ftHamo sapiens mamfara 
R457B2:Ha61» Adutt heart Ctontech Horn 
NM_Q01061:Homo sapbns ttvomtioxane A syn 
NM.000185:Homo sapiens serine (or cyslel 
(k»uslInk]NM.145214:Hbmo sapiens tripar 
Hs.160059:Homo sapiens cONA FU31360 fis 
NM.017B82:Hon)o sapiens cercAMpofusdn 
NM_000859:Homo sapiens S4Tydroxy4fliethy 
NM.003740:Honn sapiens potassium channel 
Ks.31 181 :Homo sapiens cDNA: aJ23230 fis 
Hs^4507diypolheQc8l protein inJ20986 
Hs.15S965:EST6 

Hs.168159:t}ifunctIonal apoptosls regutal 
NM.001 355:Honx} s^^ CMJopadnone taut 
Hs.301804:iCIAA1494 protein 
Hs CT69BJ )KFZW66l1024 protebi 
NM07273M)nn eplens repressor of estr 
NM-014287:Homo sapiens pM5 protein (PM^ 
H3.37477£STs,W^ similar to T46220 
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427022 

410047 

400845 

419501 

418140 

422032 

419768 

436873 

421140 

450126 

439018 

433061 

400278 

407338 

410240 

423880 

422098 

426680 

437672 

456602 

457329 

426437 

412627 

452695 

409531 

448988 

447627 

419846 

412969 

424867 

410600 

451452 

407844 

448752 

419235 

407754 

422282 

414181 

418869 

419444 

430250 

412760 

423013 

449703 

447402 

417896 

422)51 

450507 

419757 

409932 

408044 

430014 

451690 

446950 

444207 

417089 

445985 

425978 

422753 

449051 

450701 

412890 

415752 

427609 

450770 

419594 

450876 

417767 

439968 

426520 

441028 

445033 

418478 

428157 

416178 

435025 

421917 

406621 

408196 

412600 

452806 

451356 

421643 



AW245839 H3.173255 
A11S7810 HSJ79753 



AW843822 

BE613836 

AA476966 

T72104 

AF201931 

AA298741 

BH)18138 

AW300887 

AW068033 

AA773213 

AL157424 

SE278111 

H03117 

AA320160 

AW748265 

AA411607 

AI634860 

6E076537 

BE391959 

AW780199 

BE3a4319 

Y09763 

AF090922 

NM.015977 

AI373162 

At024860 

AW575742 

BE560065 

AW073716 

AA593867 

AW470411 

AA527348 

AFD19225 

AK000476 

AW516565 

NM-002496 

NM_016929 

AW379030 

AWB75443 

H61001 

H&4520 

AA379770 

AW327546 

AL050373 

AA773820 

AJ376750 

BE^6939 

H59354 

AW451469 

AA305800 

AI565004 

H52280 

BE621800 

BE2S3927 

AI928995 

AW961400 



T85247 

BE314524 

AK000436 

AA019924 

AA013051 

AFie9062 

BE242241 

AA224760 

BE545684 

AI333660 

AV652402 

U38945 

A1738719 

AI808527 

T08990 

AB028943 

X57809 

AL034548 

L28824 

AW014549 

AA748418 

BE281170 



H5.199961 

Hs.83551 

Hs.1 10857 

Hs.93194 

Hs.5268 

Hs.102135 

H3^4447 

Ks.26638 

Hs^6422 

Hs.2280 

Hs.91202 

Hs.61289 

Hs.134200 

Hs.111497 

Hs.171811 

Hs.5741 

Ks.118964 

Hs^9682 

Ks.16g895 

Hs,74276 

Hs.30327 

Hs.54702 

HS2278S 

Hs.152738 

Hs.285681 

Hs.75103 

Hs.153591 

HS.3S1676 

Hs.26433 

Hs.8037 

Hs.300842 

Hs.288433 

Hs^88967 

Hs,1 14309 

Hs.7679fl 

Hs.90443 

Hs^83021 

H3,41324 

Hs.22209 

Hs,l71802 

Hs.351327 

Hs.82890 

Hs.111024 

Hs^213 

Hs.63970 

Hs^600 

Hs,42287 

Hs^74303 

Ks.2099gO 

Hs5672 

Hs^74415 

H8.18812 

Hs.29444 

HS24983 

Jte.1575 

Hs^26 

Hs^8467 

Hs.351875 

H3.78778 

H3.179791 

Hs^8803 

Hs^1417 

Hs^5976 

Hs^2542 

Hs.153 

Hs.343566 

Hs.17558 

H8.72901 

Hs,1174 

Hs.198427 

Hs.192822 

H5.4742 

Ks.109445 

K3.181125 

H5.43827 

Hs.74101 

Hsi8373 

H$.33368 

Hs.106357 



NM.004596itono sapleiis smaD oudsar rib 
KsJ79753:Komo sapiens cONA FIJ33176 & 

Hs.199961 fSTs, Weakly slmRar to hypoth 
(locu5lliilONM.002403:Komo saptens microf 
NM_01631Qibmo sapiens potymerasa (RNA) 
Hs33194:apo!ipoprotein A-l 
H3.52683]nc finger, DKHC domain oontatn 
NI\^006280:Homo sapiens signsl sequencer 
(locusiiid()NM_005866:Homo sapiens type I 
NM.031457:Homo sapiens menibfane^spanning 
Oocu8lIn)ONM_025233:Homo sapiens nucteo 
NM.002950:Homo sapiens ribophorin I (RPN 
Hs.91202:Homo sapiens cONA RJ2S946 fis. 
Hs.61289xynaptpianin2 
Hs.134200:DKFZPS64C186 pmtein 
Hs.111497nieuronaI protein 17.3 
Ni\^001625.-Homo sapiens adenylate Idnase 
NhL016230:Komo sapiens flsvohsmoproteln 
NM_017660:Komo sapiens hypothelicai pro! 
(iocusiink)NM_016442:Honto sapiens type 1 
Hs.169695.-ut)iqu{tin-conjuga]ino enzyme E 
Hs.74276:cliloitte intracellular channel 
NM-003668:Homo sapiens mitogen-actlvated 
(^ocuslinl{)N^^007255:Hom{) sapiens xytosy 
NRC021987Jtomo sapiens 9amn»«ninobutyri 
NM.016050:Homo sapiens mHochondrlai rib 
NI\4.032951:Homo.8^ns Williams Beuren s 
NNL003406:Homo sapiens tyrosine S-monoox 
NM_00S787iiomo sapiens Nol56 p. melanog 
Ks.351676:Honx) sapiens cDNA FU25921 lis 
NM-001382:H(xno sapiens dOlichy)-phospl)at 
(locusrink}NM_005723:Homo sapiens (stras 
NM.024d20:Homo sapiens KIAA1608 protein 
NM_016522:Homo sapiens neurotrimin (HNT) 
Hs.288967:Honio sapiens, similar to RIKEN 
(iocusBnk)NM-003661:Homo sapiens apoHp 
NM_016470:Homo sapiens chiomosnro 20 ope 
AA229762:nc49l01/1 NCLOGAP_Pr3Hdmo8a 
NM_002496:Momo sapiens NADH deiiydroeenas 
NM.016929:Homo sapiens ditoride Intraoel 
Hs.41324:ESTs 

Hs.22209Becreted modular cddum^iindln 
Hs.1718Q2:Honio sapiens, done IMAGE:3956 
(IocusIinl()N[lA.017828:Homo sapiens iiypotli 
Hs.82S90-^etender against cell death 1 
(locusIInk)NM.006g84:Komo sapiens solUto 
NAy)1S677:Homo sapiens hypothetical prot 
Hs.63970cESTs 

NN^001283:Komo sapiens adaptor-felated p 
NM_001952:Komo sapiens E2F transcription 
(iocuslink)NA|.14469t:Homo sapiens hypoth 
i^^09990:EST8 

(locuslin)4NK1.030799:Homo sapiens noioi 
Hs^74415:ESTs 

Kls.18612:Homo sapiens cONA: FU21909 lis 
Hs.29444.-putafivo small membrane protein 
Hsi4983:hypolhetical protein kom EUROI 
Hs.1575xmall nuclear ribonudeoproleb 
NM_032339:Homo sapiens hypolheticsl prot 
H3^68467:Homo sapiens cDNA FU12280 fis 
NM_004374:Homo sapiens cytochrome c oxid 
NM.012342:Homo sapiens putative iransmem 
Nk^017817:Homo sapiens RAB20, member RAS 
Hs28603:ESTs 

(locusIink}NI^007027:h{omo sapiens topois 
(iocusIink)l^013384:Homo sapiens UG1 1 
NM»001637:Homo sapiens acyto^aqyl hydro 
NM_000971:Kon» sapiens ribosomal pnit^ 
Hs.343566:KIAA0251pratsln 
Hs.17556:Homo sapiens, done IMAGE-40704 
NM_07B487:Homo sapiens cydln^ependent 
NM«000077:HomD sapiens cydln-dependent 
NM-000189:Homo sapiens hexoklnase 2 {HK2 
NM.030949:Hon» sapfens protein phosphyate 
Hs.4742:GPAA1 P anchor attechment protein 
H5.109445:hypennethyialed inc8ncer2 
Hs.181125:bnmunoglobu]In lambda locus 
NM.006943:Homo sapiens SRY (sex detemdn 
Hs.74101»pteen tyresina Idnase 
Hs.58373:ESTs 

Hs.33368Jiypoth80c8l protein IU11175 
NMj007126:Homo saptens valosirMontabiin 
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423527 


AI206965 


Hs.105881 
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NM.003365 H5.119251 


NM_003365:Homo sajdens ubfqtdnokytix^ 
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9T4 

Z71 


410109 


W42913 


Hs.78089 


MM mA'yi^ 'Untnn Banian b ATDoea fr<MMtn 

Nivi_w4zji .rionio sapiens a i rase, M'*' uansp 


9 T4 

^r1 


4UUZr r 




Hs^ 


Niv^uuz90U.fio{no sapiens noofKigiin i \r\rN 


9 74 
£.71 




AI908774 


Hs.259785 


Hs.25978Sicani!8ne palmitoytlransferase 


9 71 
£.71 


445515 


BE388665 


H5.179999 


ns.i /999i):nonio saptens, cions iMAbc.o40r 


9 74 

Z71 


ill cone 

4109/0 


BE243985 


Hs.80680 


ns.ouooiMn^or vauii pioiein 


9T4 
£.71 


441238 


Ai372555 


Hs.322456 


NM_032uo9:HonK) sapiens nypotneucal prot 


Z71 




ArOS2692 


K&98485 


NM.024009iHonu) sapiens g^P Junction prot 


9 74 


4Z49D0 


AW956282 


Ks.144609 


kiti nOnCCO>LlMmn (•«irwan» ftmSl^rla DItAM 

Niv^UoUo94inonio saptens simJiar 10 kiincn 


O 74 
£.71 


4£l0Uo 


AK0C0157 


HS.108S02 


MM M TCfiQ'Llnma Bonlana Ktmn4haK#>al nmf 

Niv^ui rDoanonK} sapiens nypouteucai pnx 


0 74 
£.71 


ill 0(174 


L3736B 


Hs.75104 


LI* TCmil'DMA Mn>nna nmtain C4 <<a«{na 

ns.7oiU4:KNA DuKiuig prDiein 01, senns- 


Z70 


4iunj 


AW996564 


Ks^50824 


nS.^0UoZ4.n0ni0 Sapisns CUNA. rLiZo4oo II 


O 7/1 

£,7U 


4io092 


AA126856 


Hs.118665 


nS.11BD00.EoT8 


OTA 

Z70 


44 1 USD 


BE539199 


Hs.62112 


/iMMMJtnlrlliM ArMitCTtUnmA B«mtan» vbtM 9 

(locusnniqNiuLviM^oftnonio sapiens znc t 


9 TA 

£.70 


450493 


M93718 


Hs.166373 


ns.l66373:nitnc oxide synuiase 3 (endot 


Z70 


410 f 40 


AW247252 


Hs.76514 


kIM nnn07fVLfnnu* o<«nVirM> m><JaA(>t#4a nkfM>nK 

NM_00U£7(KrU)nK> sapiens nucwosuo pnospn 


Z70 


450747 


AI064821 


Hs.129953 


Hs.1 2%53:Ewlng sarooma biaakpolnt legio 


Z70 


436042 


AF284422 


Hs.119178 


(locusunk)NMJ02024aKonio ssiKens caoon 


Z70 


432981 


NR^002733 H8J136 


ns^oioKprotein KinasQ, AMr-acDvaoo, g 


Z70 


431341 


AA307211 


K3^51531 


N^002789:Hoino saj^ns protsasonie (proso 


Z70 




AA454501 


H3.43666 


N^/L007079tKo^to sapiens protdn fyrosine 


Z70 


416770 


AW163570 


Hs.79768 


NbA.014740:nomo saptens KIAA0111 gens pro 


Z70 


44TC07 


ucocoe 

IW9U9D 


Hs.18747 


NMLWooorinonto sapiens rUr/ (precassmg 


Z7U 


424500 


AR)40704 


Hs.149443 


Oocus!tnJgN^1.007(l22:^o^H} salens putao 


Z69 


414£or 


BE536554 


H8.278270 


Ks.278270.'unacffvs pfogestsrana receptor 


9 CO 

£.69 


4UUZo1 




H9.169476 


kill nntn/iCtiir^wmm^ •>o»t<«Mn Mk><><«MUMk>M4A o 

NM„002iMo:nonK) sap)^ grycefanenyaeo- 


Z69 


4tt1«U9 


NM.001533 K5.2730 


H8.273(^ tietefog enaous nudaar ribont^eo 


9CO 


444llo . 


AA458542 


Hs.10a26 


NK4JD07263!Hoino sapiens coatomer protein 


9 ea 

£.09 


AOMtnU 
4Z4QU0 


XB0695 


H$.161134 


ns.i9iio4X)xnas6 (cyiocnromacj sssenuN 


9 CO 
£.09 


410040 


AA224827 




A A00i(B07»H«Mnnil b4 M^ /V*AD D«0 Umiam bb 

AAZ44o2#:ncj2gu4.s 1 NU_UaAr jrfZ nomo sa 


O CO 
£.09 


440002 


AW769844 


Hs.1 11222 


ns.1 11222:nypouieoc8] protein rU22o75 


2.69 


449957 


D31365 


Hs.24220 


(locusunx)NMj016479:nomo sapiens sooon 


Z69 




U37689 


^te.3128 


nil nAC090iLf«»w> Jin^tnnn i.j Jljiuu L .j uij L fDIIAt 

NM.(AjDZo2Honio sap^ns poiynierase (KNA) 


Z69 


400306 


AL080080 


Hs.24765 


nil nnmCCijj»-... II i tnnn 1''- - »- -»- -» « 

NM_03u7oo:n(xno sapiens Uitoradoxln domai 


Z69 


429544 


BE299343 


Hs^430 


NVL00S997iteo sa|]iens tiBisciipQon fac 


Z68 


428582 


BE336699 


Hs.185055 


Hs.185055:GENE protein 


Z68 


445139 


AB037848 


Hs.12385 


Hs.12365:8ynaptotagmin Xili 


Z68 


453905 


NM.002314 KsJ6566 


NM_016735:HQmo sapiens UM domain kinase 


Z68 


41^83 


6E387036 


H8.1211 


MM nAlfil 1*UAmn eontone nhnenhofoeo 

niv^w 10 1 unwinu sopions ouu pnudprmuAv 


£.00 


420957 


X96743 


Hs.100555 


Hs.10Q555:0EAO/H (Asp^Bhi-Aia^Aspmis) b 


Z68 


418187 


NM.004604 Ks.83734 


NMj0046O4:HonK> sapiens syntaxln 4A (plac 


Z68 


409533 


AW969543 


Hs.144609 


NM-080652:Honio sapiens simflar to RIKEN 


Z68 


433184 


AA147979 


Ks^BSOOS 


NM.020243:Homo sapiens transtocase of ou 


Z68 


455303 


AW892049 




BE066891:PM34T0338-211299«02-e12 BT033 


Z68 


452600 


A1910842 


Hs.103381 


Hs.103381'£ST8, Weakly siniar to hypoth 


Z68 


415410 


AF037332 


Hs.278569 


NM-014748:Hon» sapiens KIAA0064 gene pro 


Z67 


426432 


AF001601 


H5.159857 


NM„000305:Hoino sapiens paraoxonase 2 (PO 


Z67 


435049 


AL122067 


Hs.4746 


NI\|.021941:Komo saiaens liypdhetlcal prot 


Z67 


450528 


NM.014072 Hs.25063 


NM-031268:Komo saptens PRO0461 protein ( 


Z67 


433339 


AF019226 


Ks.8038 


Nli4-004283itonw sapiens RAB3D. nember RAS 


Z67 


408783 


AF192522 


H9.47701 


NM.013389Msno sapieRS NPC1 (Niemann^nc 


Z67 
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451738 
42ni6 
436319 
415116 
425B38 
41B708 
410165 
410134 



BE237567 

L3fi951 

H90727 

AA160383 

NM.014071 

U73524 

BE56Q22B 

U88140 



H8^047 

H3.160446 

H8.5123 



Hs^047:hypott»0csl proton nJ20392 
K3.160448:fcaiyopheitn ^mpoifin) beta 1 
Hs.S123:hypoQietic8l protein BC008246 



42S910 
427954 
439971 
436449 
435906 
433387 
447191 
444099 
417821 
416S29 
426025 
420187 
408150 
444395 
431222 
408790 
440708 
416526 
413995 
424908 
442110 
452882 
406862 
451295 
448428 
426611 
426216 
407223 
427725 
420157 
426471 
451544 
413245 
415020 
437193 
418684 
410868 
436183 
441226 
432788 
432746 
450377 
434633 
424707 
427600 
446522 
436906 
410701 
410182 
408716 
430308 
431074 
412867 
440524 



AAB30797 

J03060 

W32474 

AK001333 

A1688379 

L76526 

NM-014521 

087432 

BE245149 

AW005695 

AW138330 

AKD01714 

BE620274 

N66148 

X56777 

AA293303 

AF038962 

H61082 

BE048146 

AW513963 

AF113008 

AWg72990 

AWISQ807 

AI557212 

AF282874 

BE178050 

N77630 



U66839 

AAfi57991 

X5734B 

AK000429 

BE244334 

6E249915 

BE259190 

U82987 

BE379794 

AI146327 

BE563042 

AA521091 

AA564512 

AB033091 

A1189587 

BE061914 

AW630918 

NM.003876 



422672 
447528 
408774 



435311 



410678 
414839 
443217 
448749 
450009 
407687 
442232 
419625 
416114 
439437 
408452 
443142 
426152 



AF1986a) 

NM.001983 

AW148546 

BE540B65 

BE072772 

AU076861 

R71264 

AW161481 

X12784 

A1612027 

AW518383 

AW748482 

W86610 

AW578252 

BE540516 

X63692 

NML001545 

AW859679 

AI399947 

AKDOaOll 

AI357813 

U91616 

A]e95549 

AI207768 

AA054683 

Af696513 



H5.159613 

Hs^465 

H3.71669 

Hs^927 

HS.23782S 

Ks.184760 

Hs^47551 

HU01746 

Hs.6216 

Hs.110798 

Hs3260 

Hs.17667 

Hs.10315 

Hs.82643 

Hs.250897 

Hs.17558 

Hs,95744 

Hs.43112 

Hs,11125 

Hs^73790 

H5^56342 

Hs.7381 

Hs.14743 

Hs,75671 

Hs.39143 

Hs^102 

H8.1S6270 

H8^6262 

HS.17132 

Hs^1201 

Hs.171271 

Hs.13895 

Hs.180533 

H3.12310S 

Hs.184510 

Hs^O 

H8.75249 

Hs.293533 

Hs^8g721 

Hs.87246 

Hs.159651 

Hs.334802 

Hs.118820 

Hs.178499 

Hs.372775 

H3.355925 

Hs.120915 

Ks.10844 

Hs,179774 

Hs.15ig8 

Hs.181244 

Hs.10283 

Hs^9544 

H&169476 

Hs.238990 

Hs,8997 

Hs.74637 

Hs.16798 

H8.111577 

Hs.119129 

Hs.76277 

Hs.177592 

Hs.77873 

Ks.185738 

Hs.190161 

Hs.378825 

Hs.77462 

Hs.9078 

Hs.21902 

Hs.166468 

Hs.37558 

Hs^7460 

Hs.ie2B85 

Hs.183888 

Ks.343628 

Hs.222728 

Hs.10a705 

Hs.t67246 



NM.014071:Honio sapiens nudear reoeplpr 
NM.005831:Komo sapiens ATP/GTP-binding p 
NM.013258:Homo sapiens apoptosis^associa 
(lKusBnk}NM.002533:Homo sapiens nudea 
Hs.23782Sx]gn8l reoograCon partide 72 
NM_005760:Homo sapiens CCAAT-ixjx-Wndlng 
N^1002455:Homo sapiens metaxin 1 (MTX1), 
Hs.301746:Honx> sapiens cONA FU37267 fe 
Ks.6216:DnaJ (Hsp40} homolGg, subMy 
(Iocusltnk)NM_0201S0:Homo sapiens SARI p 
NM.000021:Homo sap»ns preseniEn 1 (Alz 
()ocusfink)NH.014521:Komo sapiens SKMo 
NM.003983:Hoino sapiens solute canler b 
NM.002B22:Homo sapiens protein tyrosine 
Hs.250897:TW^fused8eno 
Hs.17658:Homo sapiens, done I1IV^GE:40704 
NI\^01902&Honio sapiens hypoHi^cal prol 
Hs.43 1 1 2:Homo sai^ mRNA: cDNA OKFZp43 
(tocu5nnk)NM.014041:Hofno sapiens stgna) 
NM.007155:HonK) sa;^ zona peibdda gl 
Hs.356342:EST8, Highly simSar to 21 1320 
H3.7381nrailaoMtependeiit anion channel 
Hs.14743:EST8 

NM.004603:Homo sapiens synlaxin 1A (brai 
H3^143:hypolhefic8l prolan hrtGC13125 
NM.001023:Honu> saf^ns ribosomal protein 
NM.030760:Homo sapiens fdato tran8|)orte 
H^3S6262£STs. HigtOy similar to A31233 
Hs.17132:ESTs 

NM_015480:Homo sapiens nedin 3(DKFZP56 
NM.001904:Komo sapiens catenin (cadheiin 
Hs.1389S:Hon» sapiens cDNA aJ11654 0s. 
H96850riv03b12^1 Scares melanocyte 2NbH 
NM_002756:Komo sapiens mHogetvacQvatsd 
Ks.123106:ESTs 
Hs.184510:stratifln 

NM.017814:Homo sapiens iiypotheQcal prot 
Hs.75249-ADP-ribosyiation factor^ilce 6 
Hs^93533:Honio sapiens cDNA FU37093 lis 
Hs^89721 :growth arrest^pedfc 5 
NM_014417:Homo sapiens BCL2 binding comp 
NM«014452J1omo sapiens tumor necrosis fa 
(locuslin)()NM.024718:Homo sapiens hypotii 
Hs.118820*JiypotheticaI protein BC007882 
H5.178499:HSPC063 protein 
H5^72775:Homo sapiens, done tMAGE:3946 
Ks;35S925:KlAA1265 protein 
Hs.120915:EST8 

Hs.10844:l8Udnfrifdi alpha-Z«lyooprole 
Ks.179774:proteasome (prosome, macropdn 
NM.003876:Homo sapiens putaOve receptor 
Hs.1812443n£jorhlstocompai)bUltycon:vi 
NhL00S105:Homo sapiens RNA binding mofif 
NMJX)196%HQmo sapiens eoxision repair c 
Ks.169476:giyoer8ldehyde-3i)lKsphate deh 
NM_004064:Homo sapiens cydlivdependent 
Hs.8997:Sad1 un&64 dom^ protein 1 
H5.74637:testls enhanced gene traiscript 
Ks.16798:(fomQ sapiens mRNA; cDNA DKFZp56 
(locusnnk)NM.O30926:Homo sapiens int^r 
NK.001845:HonK> sapiens coSagen, type IV 
NK^138393:Homo sapiens hypc^Dcd pro) 
Hs.177592uibosoma) protein, large, PI 
H5.77673:B7honiolog3 
Hs.185736:ESTs 
Hs.190161:LR8 protein 
H3.376825;Homo sapiens cONA FU37650 lis 
(locuslink)NKL001379:Homo sapiens DNA {c 
Hs.907&irfimature colon carcinoma transcr 
Hs.21902:Homo sapiens ctono 25237 mRNA s 
Hs.166486:Homo sapiens cONA FU1 1432 lis 
NM-0ie339:HomD sapiens hypoOieOcd prrt 
Hs.337460f STs. Highly sWIar to HYEP_H 
NM.004556:Homo sapiens nuclear foctor of 
Hs.183868:gtucuronidasd, beta 
Hs.343628xlalyttr8nsf5rase 4B (beta^al 
Hs^28:Homo sapiens cDNA FU39004 fis 
Hs.108705:pro(eln phosphatase 2 (famterl 
HS.167246P4S0 (cytodnxne) oiIdQreduda 
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Z67 
Z67 
Z67 
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Z66 
2L66 
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2.66 
Z66 
Z66 
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265 
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264 
264 
264 
264 
264 
264 
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419667 


AU077005 




MM OQ3S1S!Hamosao{ens adlsintefrti avi 


2.61 


4)5072 


BE2S3S87 




H!L77fl7&hvno{hfifleal oena MGC19S9S 

■ Kkf f Of W>IIJ|WUn«UwCU IfCIW WVJW 


Z61 


406670 


W79632 


Hs.256301 


Hs.256301 ihypoOtsOcd proteh) MGC13170 


2^1 


403339 








2.61 


419579 


W49529 


Hs«296200 


NM.02394ftHonio ssidsns hypotheficsl prol 


2.61 


437202 


AA326110 


Hs^4481 


HsJ74481 £STs, WeaUy sbiflar b T34549 


261 


414020 


MM 009984 


Hs 7fi703 


NM flOSfiftdrHttno Bafdens sfiifl]! tnAxihla e 


261 


421295 


AW081061 


Hs.103180 


H3.103180:DC2 protein 


2,61 


446468 


AB037782 


Ks.15119 


Hs.15119dCIAA1361 protein 


2.61 


442504 


BE503373 


HsM4335 


NM 099dAil'Hflnio finolnns hvnattrpfleal nml 


2,60 


448204 


AM75124 


Hs.170561 


Hs.170561 :ESTs 


2.60 


449175 


AJ005892 


Hs.23170 


^0cusSn)QN1^012280iH(Mnos8pl6ns RsJ h 


2,60 


411201 


T74588 


Hs.8509 


Hs-BSOQ-ESTs WaaMvs1mHartDC3HUcom 


2.60 


424805 


AF230904 


Hs.153260 


NM_031892iHonK> GSftens 8H3-{Jonidn klrwsft 


2,60 


425421 


L1 1669 


Hs 157145 




2.60 


422739 


H20106 


Hs.1195ai 


flnffiwBnlr^^ 0(M069*Hamo saotens sdaidD 


2,60 


450858 


CI 8458 


Hs.25597 


Hfi.25597'fitnnoalion flf vpfv (nnn fdisin f 


2,59 


443195 


BE14fi23S 


Hs1d3063 

rw« i9ww\M 


Hs 1 930fi3:Hfimo saDians cDNA FL114201 fts 


2.59 


430504 


H52761 


Hs.44095 


Hs44095xvcOn M3 


2.59 


439576 


AW263124 


Hs.350547 


NM 094fifi5'HDmo <wV^ns nudear m»ntnr 


2,59 


416041 


AA345547 




flnntcIlnkMylM n9dR47*Hnrnn cantAm niwJnn 

\IOWi5<llllV/WlV^K^*>0*< 1 .rMlIU OOfwl l5 UtMW , 


2.59 


451920 


AA224483 


Hs.27239 


Hs 2793Ch7fnr Gnoer DHHCdom^ mnlai 


2.59 


414163 




Hs 7«r7fl9 


KIU 001(^91 'Mnmn eanlomi nonarsi tranerrin 


2.59 


422140 


BE295S1B 


Hs.112193 


flfKusIlnV^NM 09S9S9*HDma <uinlprK ehmma 
iiy inv^_vfc«/(t.wv« t iw 1 w «M>^icii« viuvittv 


2.59 


452817 


AA322859 


Hs 284275 


Hs^84275-r>21 fCDKNIAV-asiivated tdn^ 


2.59 


413353 


AWM3S42 




Hft 7'>3nQ'AtiknfvnKr francl^nn tAnnns/l\ 


2.59 


421700 






MM nififtdi'Mnmn csnlorra monihfnno tntprarf 

IXIV^V lUUt ■•TtUMIU 90^46119 IlidlHMaiiO BUBIOW 


Z59 


410801 


BE275469 


Hs 66493 


NM 01fi<l3Q!Hamo fiantftns Dovun fivnrirQiTi& eri 


259 


440511 


AF1 32959 


Hs.7236 


NM_015953iHonK) ssplsns 6N0S intcrscfing 


259 


407887 


AAST^fiS 

/VWf 9vwD 


Hs.41072 


(locusrink}NNL004568:Homo sapiens serine 


259 


425356 






NM nn'v'^ZI'Hornn ennionft tnA<tit<il mlvntme 

IXn^UU^M*? KnUIIUI OapiOllfl UlUdlUII |WIJ^II\I9 


259 


408102 


U46351 


H3.621 


Hs BSIHefiBn oalactositilvhlndinn sohi 


259 


417952 


AI19283B 




Hs 17ai3&riud.fin«jfk^ fvimlnAJYL 


259 


433053 


BE3019Q8 


Hs^79952 


NM 015917:HmiosaDlBfisahitaQiloftB84ra 


259 


45093S 


BfS1d743 

DC*J Itltw 


Hq ^7q(l')Q 


KIM nn'tft'ti'Vlnffin cnnlfHio himnr minnmQcnr 


259 


417891 


W79410 


Hs.82887 


(tocusrink)NM.021 959iHoino Sdjnens pfotei 


2.59 


438364 


fvWvUOw 




NM n9ftAA1<Unnin snnfeno mmnin ivHtn^l 
niv^v«irrf itrKniiu e^wno vvn/nuii acuirtii 


9<n 


430976 


AAS05112 


1^82990 


NM 033SSfr Homo Banfana chmtnoaonifl 20 aim 


258 


444838 


AV6518B0 


H&2Q855B 




258 


416435 


A)431301 




Hs^4897'Hanm saoieis cDNA FLI38B74 fis 


258 


415444 


BE247295 


K3.78452 


Ks.784525oIutB carrier fondly 20 (phosp 


258 






Us 




9<n 


400541 








444309 


U83236 


Hs 10803 


Hs.inflfi9*Rflkjuni nivl Inlpnrin hifwUnd 1 


258 


416116 


H51847 


Hs 99858 


Hs Q^ftffl'rihASflmfll nm!(»tn L7a 


258 


418629 




ns.oooo9 




258 


432996 


fir iuiW£%/ 


He 97QQni 




257 


426781 




Ha 179907 


flnfiKlinb^NM Rfl73R3*Hftntn aariAiis non-PO 


257 


452636 




Hq 14(>97Q 


H« 1^'i97Q'RPT Irnnslrv*nf1nn rmvplnlH Ifiti 


257 


406851 


AAfi097fi4 


H«; •iS7i99 


He ^^93Q9'in9)fir htfitooomnafihiritv nvrml 


257 


447674 




H5.19192 




257 


445647 


AV654627 


He 971flflfl 


Us 9718flfrH(mn satins cHNA FUSSOIfl fis 


257 


444736 


AA533491 


Hs.23317 


NM tQTtOA'MniMi saHhwis hvnolliPlieal nmt 


257 


402661 








257 


450069 






Hs^0M93:EST8 


257 


414029 




H<t7'i7nQ 

n9<f 9#U9 


NM nn9^'>*UnfTin snnhim mannmRUUnhnsiditi 


257 


427700 


AA9R99Qd 




NM fVII Q^R'Unfnn ennkms ritifll snnrHirifv 
riiv^w i9Ho.nuiiiu tM^Jiciio uuai opwiHviiy 


257 


449961 




Hfi 133100 


H<j mi00"P<?T«; 


256 


449378 


AW6S4096 


Hs 59899 


Hs S9892!ESTs. Ufeakiv slmfla' to aioha fi 


256 




APflfTlUn? 
nTUflMMf 




|iocusiinK}iw|„uui}ooj.nDnio sapions rainra 




448833 


AA311428 


Hs.21635 


MM nniHTn'Unnm aonlAno ttihiltfi waitwwg 1 

nw^uu iUf Mnuino sopions uiijuiiiii ganwiid i 


256 


416078 


AL034349 


Hs.79005 


NM nft9IU^*Hnfnn RflntPfis nmti*iii h/mdnn 


256 


428044 


AA093322 


Ks.301404 


NM nnfiT^'^UniTmeantAns RNA hhufinn mnf!f 


256 


451564 


AU076698 


Hs.132760 


\HJvU9iiiii^jwF^uv itof <mjiiiu ooi^ivm ywww 


256 


457601 


AF041429 


Hs^265 




256 




AA313607 


Ks.58633 


UsfUVvl't-Uninneflrtfmwi'nMA^n 1991 tffk 




419587 


S62907 


Hs.91343 


"*»v-W WOU f JVM i (0 oSptcna goMiillcrtiiiiinOwljn 


9(i<; 

£.99 




BE250564 


Hs^83655 


H« 9ft.%(K'tv«ftnhAenhr^m9«A 11 


9 

««9a 




A1917771 


Hs.61790 


(lucusuni^tHiVLV^'TOuo.rHMiiu oapiena uiipvn 


9 IK 
4.99 


423720 


AUM4191 


Haui3388 


nNLMwwBi r Jwiinj sapiens nypouicucaj pioi 




400237 




Ks.83347 


NMJ001087;Hoino sapiens anQhyassodated, 


255 


420856 


6E513294 


Hs.205736 


Hs.205736:KIAA197e protein 


255 


421541 


Nh/L003942 Hs.105584 


H5.1055643ibosom3t protdn S6 Idnase, 0 


255 


434848 


BE2S6304 


Hs.32148 


NM.018445:Hon]o sapiens AI>01 5 protein (L 


255 


424468 


AK000413 


H8.148227 


(loaisDnlQNM.017806:HQmo sapiens hypoth 


255 


449089 


078850 


Hs.250465 


Hs.2S046S:Homo saptens mRNA; cONA DKFZp4 


255 


430053 


AF052155 


Hs.227949 


NM-030673->1amo sapiens SEC134ike 1 (S. . 


255 


437469 


AW753112 


H3.15514 


Hs.15514:hypothetic8j protein MGC3260 


265 


407755 


A1151353 


Hs.29742 


Hs^742Hoino sapiens cDNA nJ32147 fis. 


255 


446673 


NM-016361 HS.1S871 


NM.016361iiomo sapiens LPAP for tysophos 


255 


411766 


AA39g671 


Hs.71969 


Hs.71669:H0mo sapiens mRNA; cONA OKFZp66 


255 


415198 


AW00948Q 


K8.943 


Hs.943aiaIuT8l URsr oeD transcript 4 


255 


436495 


BE2S8948 


Hs.290874 


Hs^0874:Ho(nD sapiens, done M6C:31984 


255 
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417765 X^12 Hs£2S68 NM.000784:Ho(TO sapiens cytrehiO(n8P4S0. Z55 

443358 H55417 Hs.17757 (toaisHnk)NM_021622:Homosap{8nsptecks 255 

452349 AB028944 H3,29189 Hs^imiPase. Class VI. type 11A 2^ 

- 427721 AI582843 Hs.180455 NM.00S053:Homo sapiens RA023hQmoloo A ( 2^ 

5 407559 AA313352 Hs.260858 H3.260858:Homo sapiens CONAFU32370 6s 254 

413426 U88837 Hs.75354 Hs.75354K;CN1 general amtrol of andntx 254 

425465 L18S64 Ks.1904 Hs.1904:prot8]nldnasa(;{ota 254 

444152 AI125594 Hs.149305 Ks.149305:hypo!hsScdprats!nMGC2603 254 

451620 AW0SS357 Hs.199248 NM.000958:Homo sapiens pfOstaglandlnEr 254 

10 441356 BE384361 Ks.182885 (locusrin)ONM.004556:Homo sapiens nudea 254 

444410 BE387360 Hs^718 Hs^719:Koniosapbns.sfn)Iiar(otfataa 254 

415200 AL040328 Hs.78202 NM.003072Hon» sapiens SWUSNF related. Z54 

403955 254 

430361 AI033965 Ks.23m Hs.239926xtero}-C4^thyl oxidase^ 254 

IS 432401 NIUL013330 Hsi74479 KML013330:HoniosaptensNM£7(NME7).mRNA 2.54 

446719 W39500 Hs501872 Hs501872dvpolhetical protein M6C4840 Z54 

439941 A1392640 Hs.18272 »a4.030674:Homo sapiens sdute carrier fa 254 

436685 W28651 Hs5288 Hs.5288:HDmo sapiens mRNAicDNADKFZp434 Z54 

424522 AL134847 Hs.149957 Hs.149g57:f{l)osomal protein 86 Unase. 9 254 

20 442904 AW575008 Hs.11355 Hs.11355.1hyntopoie6n 154 

422605 H16646 Hs.1ie666 Hs.118686:hn»1h8licalprotdnPP591 254 

442069 AW664144 H8J297007 Hs.297007:Homo sapiens C0NAFU32174 lis 254 

447362 AW176120 H&S061 NM.024099:Homo sapiens ttypoUieGcalprQl 253 

_^ 416305 AU075628 Hs.79187 NM_001338:Homo6a;rfenscoxsacH©vinisa 253 

25 422624 BE616678 Hs.76152 NI\L006854:Honv> sapiens KDEl(Ly»^p<3Iu 253 

447298 BE617527 Hs.239618 NM.006219:Honio sapiens phosphoinoSitida- 2.53 

412833 AW960547 H5.298262 Hs.298262:rit)osomal protein 819 253 

404854 2.53 

415761 AA132666 Hs.78802 ()ocusIink)N^^002093:HQmo sapiens glycog Z53 

30 431104 AW970859 Hs.313503 Hs.313503:EST8 253 

439180 AI393742 Hs,199067 Hs.1990B7:v-€rl>*2eiythrob!asilcteul<em 2.53 

424250 AF073310 Hs.143648 NM.003749:Homa sapiens insulin receptor 153 

452876 AWQ81128 Hs.246374 Hs.246374:Hon» sapiens cDNAFU31 250 fis Z53 

415742 8E410243 Hs.78769 NM-003249:Homo sapiens IMmetoBgopepfi 2.53 

35 404140 253 

407255 AA012992 HsuI56301 Hs.256301:hypoth8Qcal protein MGC13170 253 

422509 AA258513 Hs.117865 H8.117865:8otut0 carrier family 17 (anio 253 

434866 AW002565 HS.3S5460 Hs555460:Komo sapiens cDNA:FU21 76311 253 

429743 AA804398 Hs.288995 {lqcusl!nk}NM_017961:Homo sapiens hypoth 253 

40 433047 IU186135 Hs.279946 NM-004990:Honio sapiens methiontne-tRNA 8 153 

418945 BE246762 Hs.89499 Hs.69499:arac»donate5^poxygenase 252 

445926 AF054284 Hs.334826 NM.012433:Honu) sapiens splicing factor 3 252 

411353 BE383533 Hs^9784 Hs.279784:pn)lacfinfesutatoiy element b 2.52 

446252 BE622791 Hs.12199 NM.030577:Homo sapiens hypoOieGcalprol 252 

45 447365 BE383876 Hs334 (locustink)NI^005435:Ho(no sapiens Rhogu 252 

414844 AA298874 Hs.77494 NM.08a916:Homo sapiens deoxyguanosine Id 252 

444025 AA578364 Hs.34g093 NM.015945:Homo sapiens ovarian cancer ov 252 

416149 AA311965 H8.79058 NM.Q03168ilomo sapiens suppressor of Ty 2.52 

^_ 418741 H83265 Hs.8881 Hs5881:Homo8apiensGONAFU32163fis, 252 

50 437952 D63209 Hs5944 NI^014585:Homo sapiens solute carrier fa 252 

445625 BE246743 Hs.353iei (locusrink)NtyL025092:Homo sapiens hypoih 2.52 

431555 AF161470 Ks.260622 Ks.260622±utyr8ld-induced transcript 1 252 

410179 W27723 H559498 {locusHnk)NM.003718:Homo sapiens ceBd 2.52 

431476 BE612705 K$.256697 (locusnnk}NM.005340:Homo sapiens lOsOd 252 

55 406672 M26041 H3.198253 (bcus6nV)NM_002122:Homo sapiens major 252 

418160 BE618087 Ks53724 H5.83724:hypothetlca! protein MGC5466 Z52 

428248 Ai126772 Hs^0479 H3.40479:Homo sapiens cDNA FU25802 Ss, 162 

419935 AB020980 Hs.93832 Hs.93832:putativa membrane protein 2.52 

446143 BE245342 Hs.308079 NhL013336:Homosa{den3 protein transport 252 

60 426691 NM_006201 H3.171834 (iocusIInl()NM.006201:Homo sapiens PCTAIR 151 

406124 U89337 Hs.42853 NM.004381:Homo sapiens cAMP responsive e 151 

456266 1^73 Hs.198726 NM.003651:Homo sapiens cold shock dom^ 151 

428921 Z43809 Hs.194638 Ks.194638:polymer3se(RNA}ll(DNAdtrec 161 

^_ 414721 X90392 Hs.77091 NfL006730:Homo sapiens deoxyribonudeaso 151 

65 422607 Z45471 Hs.l 18684 NM_006923:HomoGai}lens stromal ceiMJeri 151 

421846 AA017707 H3.1432 NM-002743:Komo sapiens protein kinase C 151 

414874 D26351 K5.77515 N]\t002224:Homo sapiens inositol 1.4,54r 151 

432956 AL037695 Hs.279861 NM-015959:Homo sapiens CGI^I protebi(L 151 

438393 AA351815 Ks50740 Ks.50740:H(mv> sapiens mRNA;cDNADKFZp76 151 

70 418360 AW296974 Hs54264 N^006401:Homo sapiens addle (leucines 150 

401061 ISO 

426559 AB001914 Hs.170414 Hs.170414:pa!rod bask: amino add deavl 150 

412204 AI125507 H3^4g37 Ks^4937:transfi)ni^-2 alpha (htfa-2 alp 150 

448950 AF2B8687 Hs.9275 NM.020410i{omo sapiens CG1-152 protein ( 150 

75 409936 AK001691 Hs57655 (k)cusDnk)NiyL018234:Homo sapiens duduD ISO 

414675 R7901S Hs.288968 Hs.28896B:RAS22\ member RAS oncogene fia 150 

409983 050922 Hs57729 (tocusank)NlyL012289:Homo sapiens Kek^h- 150 

450914 AI743761 H3.142528 Hs,142528;ESTs ISO 

444630 Ai753230 H5.323562 (locus8nk)NM.032121:Honu) sapiens hypoth 150 

80 401353 150 

441680 AW444598 Hs.7940 (locuslM4NM.021 1S9:Horm sapiens RAP1. 160 

406860 AA876469 AA87646e»e48b04j1 Na.0GAPJ>l[2SHomo8 ISO 

449163 AW161356 Hs23119 WM,003492i to mo sapien s Chromosome X open 150 
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432975 AA331517 Hs.286055 Hsi8605SxHmeHn (chbnaerin) 2 Z50 
430600 AW950987 Hs^4348 NllC004639:Hoinos^iensKlA«assxlatsd 2.50 
407584 WK945 Ks.8173 Hs^173:hypotheiical protein FU10803 ^50 

TABLE 9B 

Pksy: Uniqus Eos probesst {denSfisr mnriber 
CAT number Gem duster number 
AccesdoK Genbank accession mvnbeis 

Pkey CAT Number Accession 

406685 0.0 M18728 

452098 161393 1 BG028348BF772844K83066AW817969 H30985BF7550a9AI856183 

451129 1495511 J 6E072881 A)762181 BED72946 

459306 223120,-4 AW578452 

448489 2189115J R45782 R45781 

418869 12789J4 AA229762 AA230035 

418546 242838J TS9708AA224827T598436E1S6903 

455303 1152492.1 BE066891 BE066895AW892049BE068897BE903884 

406860 0.0 AA876469 

TABLE 9C 

Pkey: Unique number corresponding to an Eos piobesel 

Ref: Sequence source. The 7 digit numfaeis in this column are Genbank (denHfier (GQ numbefs. Dunham I. et a!.* refers to the pubTicaikm entitled The ONA 

sequence of human chromosome 2Z' Dunham L el aL. Nature (1999) 402:489^95. 
Strand: IndcatesDNA strand from which exons were predicted. 
NLpcsiSon: Indtoates nucleotide positions of predated exons. 



Pkey 


Ref 


Strand 


Ntj>ostt!on 


408399 


9256288 


Minus 


63448.63564 


403220 


7630969 


Plus 


6433&«4517 


403218 


7630969 


Plus 


S8039-58149 


403221 


7630969 


Plus 


68294-66438.6693&67124 


403219 


7630989 


Plus 


61858-61995 


403739 


7630882 


Plus 


4456344766.4820&48483^2255-52495 


405484 


5922025 


Pkjs 


199214-19957%ig9672-19992a200262-20049 


405556 


1552511 


Plus 


163497.163623.164715-16496B.1653G9-165S0 


400529 


9796988 


Plus 


138232-138423 


404826 


6572184 


Pius 


47726-48046 


400750 


8119067 


Rus 


19899M99168,199316-199548 


400847 


9188606 


Plus 


4464344835 


400448 


9887687 


Minus 


177372-177674 


402829 


8918414 


Pius 


101532-101852.102008-102263 


400846 


9188605 


Plus 


3931(W9474 


401179 


9438647 


Plus 


113477-113893 


404240 


5002624 


Minus 


116132-116407,116653-116922 


402260 


3399665 


Minus 


113765-113910,115653.115765,116808-11694 


406383 


9256114 


Pius 


14403.14602.17000.17147,17241-17368 


405203 


7230116 


Plus 


125295-125463 


402104 


8119072 


Plus 


1224O9.122600 


403912 


7710730 


Mbius 


72000-72290.72431-72700.7292S-73199 


400845 


9188605 


Phis 


34428-34612 


403217 


7830969 


Pius 


54089-54163.55427.55623 


403483 


9966188 


Minus 


144546-144854 


400509 


9796539 


Minus 


157909-158430 


403399 


6684178 


Plus 


61841-62145,62357-62756 


400541 


7574902 


Plus 


126235-126380,126478-126597 


402861 


2814366 


Minus 


14933-15231,15387-15627 


403955 


7770475 


Minus 


54527-54740 


4048M 


7143420 


Pkis 


14260.14537 


404140 


9843520 


Plus 


3n61.38147 


401061 


3242744 


Minus 


99468-99549.100707-100848,100918-101 107. 


401353 


9931296 


Minus 


50831-51352 



Table 10A Usts about 1.103 genes up-regulated In oCton cancer compared to nomrat adult tissues exduding notHnangnanl coiOn Gssues Owbole colon and coton epiihefium). These 
were selectBd from the star£ig coDectton d59680 pnbesets on Ih^ 

was greater than or eqtnl to 2.5. the 'average* colon level was set to the 90th peroenblevahie amongst oolon p ri maiy cancer specimens and ookxi Over derived metatoses. the 
'average' normal adutt tissue level was set to the 85lh percentile vahje amongst non-maSgnant tissues mbnis the eolortte derived samples, the "averager oOton value was greater than 
or equal to 50 units. In order to remove gene^pedfic background levels of non-8pe(^hybrtdiz8lion. the 1^ 
subtracted from both the numerator and the denominator before the raOo was evaluated. 

TABLE 1QA: 1.103 genes up^latod In odon cancer compared to normal adidt tees excluding non^nalignant colon tissues (whole colon andcokm epHhethni) 

Pkey: Unique Eos probesetUenGfier number 

ExAccn: Exemplar AcccKSSlon number, Genbank areesston number 

UnigottlD: Unigene number 

Umiene TUe: Unigene gene 6fte 

R1: RaSooftumortononnaladunttssues 
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5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Pkey 


ExAccn 


UnigenelD 


UnigenaTHlo 


R1 


436749 


AA5848gO 


Hs.5302 


NM-006149:Honio sapiens lectin, galactosi 


15.54 


428934 


AFD39401 


Hs.194659 


NM_001285:Homo sapiens chbride channal, 


14.52 


446787 


U67167 


Hs.315 


NM.002457:Hqnio safdens mudn 2, IntesGn 


14.04 


431912 


At660552 


Hs^183 


Hs^183:ESTs, WeaMy simlar to S384J1 


14.(X2 


423541 


AA298922 


Hs.129778 


NM_014471 :Homo sapiens serine protease 1 


13.72 


40^0 


M29540 


Hs.220529 


(tocusIink}NM-004383cHomo sapiens caidn 


13.44 


416406 


X73501 


HS.8490S 


ttm n itrtfiri m it II If in 1 iitii on 
nSilMsUaXyTOKBIaiUI cU 


12.70 


406667 


M12523 




12.42 


416768 


AA383733 


Hs.1032 


N^^006507iHoino sspiens fCQencfaftiQ Isio 


11 S8 


437935 


AW939591 


Hs.5940 


NIU[j033049tHomo sapiens nwdh 13i gpWifl 


11.58 


^£^2576 


AF23g666 


Hs.1545 


NM_001804:Hofno sapiens caudal type Itomeo 


11.50 


418888 


AU076801 


Ks.89438 


NM.004063:Homo sapiens cadhertn 17, U c 


10.16 


418007 


M13509 


Hs.63169 


NM.002421 :Hdmo sajdens matrix metaBbprD 


10.11 


422260 


AA315993 


HS.10S484 


MM fl32044!Hfinio sfloiens reraneiBlha oena 


10.01 


421582 


AI910275 


Hs^TO 


NK4JD032^Hotno s^ifens tr^il factor t 


9.77 


441031 


AI110684 


Hs.7645 • 


NM.005141:HonK) sapiens fit}rinooent B bet 


9i69 


424212 


NM.005814 Hs.143131 


NM.00S814:HoTno sapiens glycofvotain A33 


9.48 


453863 


X02544 


Hs^2 


Ms 1 $7?jOro5onffucoltl 1 


9.20 


407243 


AA0583S7 


HS.744G6 


Ooousnnk}NMj006690:Komo sapiens csrcin 


9!l8 


436217 


T53925 


HS.107 


NM-004457:Honio sapiens (ituinogen-IBcB 1 


9.04 


423371 


AU076819 


Hs.1650 


NM_0001 1 1 iHomo sapiens solute canter fa 


8.69 


430178 


AW449612 


Hs.152475 


K5.152475:EST5 


8.51 


423673 


BE003054 


Hs.1695 


NM.002426:Honx> sapiens matrix metaltopro 


a43 


447400 


AK000322 


Hs.18457 


NlyL017763:Homo sapiens tiypotheQcal prot 


ai7 


409683 


U33317 


Hs.711 


NM-001926:Komosa;^ d^nsin, alpha 6 


ai2 


431777 


AA570296 


Hs.307047 


hAiL032579;HonK) sapiens colon and small 1 


ao8 


450685 


L15533 


Hs.423 


NM.136938:Hamo sapiens pancreafiSs-asso 


8.06 


427583 


M82g62 


Hs.179704 


NM-005588:Homo sapiens meprin A, alpha ( 


7.95 


436624 


T64297 


Hs.351719 


NM.001443:H(xno ss^ns fatty add btndln 


7i74 


410407 


X66839 


Hs.83287 


NM_001216J4omo sapiens cart)or& anh^ras 


7.48 


413719 


BE439580 


Hs.75498 


■ NM_004591i^omo sapiens small Indudi^c 


7!41 


407007 


U22961 


Hs.184411 


NM_000477:Homo sapiens albumin (ALB), mR 


7.40 


412374 


X01388 


Hs,73849 


NM_000040:Honio sapiens apoiipoprotdn C> 


7.34 


407244 


M10014 






7.31 


419741 


Nftl.007019 Hs.93002 


Hs.93002:ut)iquItin'ConJU98Qno enzyme E2 


7.31 


406741 


AA058357 


H5.74466 


9reu5liiil()NMjD06890tHomo s^i^ns caicin 


7.26 


414385 


X00442 


H5.75990 


NM.0(Klt43:Homo saj^ns haptogioldn (HP)^ 


7i21 


404519 






7.18 


413585 


A]133452 


Hs.75431 ' 


NM.000509:Homo saiHens fit}rinogen, gamma 


7.13 


422281 


M36803 


Hs.346935 


NMj000613:Homo sapiens tiemopexin (HP)Q, 


7.10 


414463 


T68078 


Hs.76177 


NMJ001633:Homo Syrians ^pi)a>1-ndGfOQlob 


7.02 


430828 


AI783257 


H8.B6327 


Hs.86327:homeo box 69 


6.83 


433927 


AI557019 


H5.1 16467 


NM_032391.'Komo sapiens sm^ nuclear pro 


6.81 


406687 


M31126 


Hs,352054 


Hs.352054^pregnancy ^ledtic tjeta-l^glyc 


6.78 


423S38 


AW603823 


Hs.146268 


H8.14626B:EST8, Weakly simitar to C71400 


&53 


434206 


AW13e973 


Hs^15 


lfc_3fi9Q1S'Hoiiio winhuis cDMA FlJ3A87fi fh 


8,37 


409041 


AB033025 


Hs.50081 


KsiO081:KIAA1199 protein 


6.33 


432542 


AW083920 


Hs.16098 




6.19 


422664 


AA315933 


Hs.120879 


tts.120879'Homo saoians. dona MGC!32S71 


6.19 


436330 


NlhlL004413 H8.109 


NM 00441 S'Homo satins dloeafldasfl 1 in 


6.01 


421964 


)C73079 


K8.288579 


NI4,_002644;Homo saptens po^nwrte bnmunog 


6.00 


430272 


XD4898 


Hs.237658 


H&237fiS8'aDoIbianfr^n AJI 


5.97 


413881 


100190 


Hs.75599 


(tocustInk)Ni^(KX}486:Homo saplerts serine 


5^94 


42(fi23 


AF097021 


Hs.273321 


NiyL00641 8;Homo sapiens (K'ferentially ex 


5.94 


428470 


AO002301 


Ks.184507 


Ks.184507:Honio sapiens, slndtar to Homo) 


5.90 


420802 


U22376 


H8.1334 


NMj005375tHomo saptens v-myb cnyelolilasto 


5.89 


452304 


AA025388 


Hs.61311 


Hs.61 31 1:E8Ts, Weakly slmBar to S1059O 


5.89 


431727 


AW293464 


Hs.162031 


Hs.1 62031 :EST8 


&85 


421341 


AJ243212 


Hs,374281 


NM-0Q7329:Homo ssfriens deleted h maRgn 


5.64 


432023 


AW273128 


HsJ00268 


Hs.300268:EST 


5.75 


447033 


AI357412 


Hs.157601 


Hs.157601:EST8 


5.69 


411734 


AW374954 


H5.71779 


H3.71779:EST8, Weddy similar to S24C 


5.69 


406685 


M18728 




Oocusfink}NM„0024d3:Homo saddens cardn 


5.55 


428753 


AW939252 


Hs.192927 


NM_017728iKomo saf^erc protein phosphata 


5.48 


443247 


BE614387 


Hs.333893 


Hs.333893xell division cycle associated 


5.45 


409153 


W03754 


Ks^813 


NMj017625iHomo sapiens Intelectki (ITLN} 


5.44 


449388 


K53191 


Hs,35723 


Hs 36723'EST8 Weakfv sfaniar to O05G5.5 


5!38 


428046 


AW812795 


Hs.337534 


Hs^fi34'HfirM saoiens cDNA FL.12S241 fis 


5.38 


433013 


AI697890 


Hs.127337 


(locusllnk}NM_004655:Homo8apien8 axbi 2 


5,38 


419079 


AW014836 


Hs.18844 


Ks.18844:ESTs 


5.37 


428355 


BE256452 


Hs.2257 


NkC000638:Homo sapiens vitronectin (seni 


5.35 


422956 


BE545072 


Hs.122579 


(locu$linl4NM-018098:Homo sapiens ejpiOie 


S.34 


424326 


NM.014479 H8.145296 


NiyL014479:Homo saptens AIMMike, decysi 


5.30 


450543 


AI394037 


Hs.170296 


Hs.170296:Komo sapiens cONA: FU22090 fi 


5.30 


428187 


At687303 


Hs;285529 


HsJZ85529:6 proteliKOupted reoe^ 49 


5.30 


435538 


AB011540 


Ks.4930 


H$.4930dow density Epoprotein receptor 


5.28 


411825 


AK000334 


Hs.352415 


NMjOl 7767i1aino sapiens solute carrier fa 


6.28 


427722 


AK000123 


Hs.180479 


NML017671:Homo sapiens chromosoRn 20 opo 


6.26 


430569 


AF241254 


Hs.176098 


NM.021804:Komo sapiens angiobnsin 1 oon 


5.26 


414816 


Y13709 


Hs.77399 


NM.00126S:Hofflo sapiens caudal type homeo 


5.21 


430677 


Z26317 


Hs.359784 


NM.00ig43:Honio sapiens desmo^n 2 (086 


5.16 
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447208 


BE31S»1 


HsOT971 


430207 


AW07K59 


Hs.152256 


417491 


AW376842 


Hs.1085 


421379 


Y15221 


Hs.103982 


447342 


AI199268 


Hs.19322 


452194 


A1894413 


HS.373S99 


421307 


BE018S56 


Hs.109358 


406399 






403220 






408380 


AF123050 


Hs.44532 


415214 


A1445236 


Hs.125124 


431330 


X69532 


Hs.2777 


420344 


BE463721 


Hs.97101 


452594 


AU076405 


Hs^l 


441377 


BE218239 


Hs.202656 


444666 


BE293347 


Hs.11638 


415701 


NM>003878 


Hs.78619 


419354 


M62839 


Hs.1252 


455630 


AV655701 


Hs.75183 


422310 


AA316622 


Hs.98370 


423337 


NM_004655 


Hs,127337 


422330 


D30783 


Hs.115263 


408308 


BE296227 


HsJiS0822 


423936 


U77629 


Hs.135639 


404661 






406704 


AA056635 


Hs.5366 


420005 


AW271106 


Hs.133294 


451035 


AU076785 


Hs.430 


427506 


AK000134 


Hs.179100 


423445 


NM.014324 


Hs.128749 


453684 


AA35S925 


Hs.36232 


431301 


AA502384 


Hs.151529 


408983 


NM_O0M92 


Hs.663 


449032 


AA045573 


Hs^O 


434540 


NM.016045 


Hs^945 


407242 


M18728 




458748 


A1381530 


Hs^1132 


408298 


AI745325 


Hs.271923 


424273 


W40460 


Hs.144442 


411975 


AI916058 


Hs.144583 


425371 


D49441 


Hs.155981 


451917 


AW391351 


Hs.50820 


432867 


AW016936 


Hs.233364 


419559 


Y07828 


Hs.91096 


430294 


AI538226 


Hs.32976 


411248 


AA551538 


Hs.69321 


402496 






430937 


X534$3 


Hs^04 


434414 


AI798376 




443426 


AF098158 


H8.9329 


422539 


AJ009338 


H3.116138 


436972 


AA284679 


Ks.25640 


450531 


AW301032 


Hs.203800 


403055 






414809 


AM34699 


Hs.77356 


400965 






430204 


AA618335 


Hs.356664 


432978 


AF126743 


HS.279B84 


417931 


W95642 


Hs^l 


430832 


AI073913 


Hs.100686 


408482 


NM,000676 


Hs.45743 


422487 


AJ010901 


Hs.198267 


414617 


AI339520 


H1288817 


452940 


AA029722 


Hs2m 


439211 


AI890347 


Hs.271923 


459299 


6E094291 


H3.155651 


449720 


AA311152 


Hs.288708 


411142 


NM.014256 


K8.69009 


421777 


BE562088 


K9.108198 


419395 


BE268326 


KSJ0260 


443211 


Ah28388 


HS.1436S5 


403218 






430603 


AA148164 


Hs.247280 


409757 


NM.001898 


H3.123114 


426227 


U67058 


Hs.154299 


421408 


A1688223 


Hs^1096 


421100 


AW351839 


Hs.124660 


440869 


NM.014297 


Hs.7488 


414075 


U11862 


Hs.75741 


444151 


AW972917 


Hs.128749 


440409 


AW294316 


Hs.125608 


445564 


AB028957 


H3.12896 


424687 


J05070 


Hs.151738 



NM_024096:Homo sapiens hypoIheGcal prot 
Ks.152258:EST8 

NM_004963:Homo sapiens guanylato cyclase 
NM_005409:Honio sapiens sneD MudUa c 
Ks.19322:Homo sapiens. SimBarlo RIKEN 
Hl373599:EST 

H5.109358:ATP85e. Qass V. ^ 10B 



NM.006398:Konio sapiens uUquIGn D (UBD) 
NR.004442itofRO sapiens EphB2 (EPHB2). tr 
Nh\.002215:Ka(no sapiens Inter-alpha (stob 
NKL014373:Ko(no sapiens putaDve G protel 
Hs^81sotut8 canfer faniBy 26 [tUta 
Hs.202656:EST8 

(locusnn)c)NN1.016234:Homo sapiens fetty 
(locusIInl()NM_O03e78:Homo sapiens 
NM-000042:Homo sapiens apoBpoprotBin H 
NK|.000773:Hwno sapiens cytodirome P450. 
(iocusfink}Nlil.030622:Homo sapiens cytoch 
NM.004655:Honto sapiens axln 2 (oomhicOn 
NM.001432i{oino sapiens epinsgulin (EREG) 
(lDcusIInh}Nlil.003158:Homo sapiens seffine 
NM.005170:H{]ino sapiens adiaete-scute com 

NM.139053:Hafno sapiens epidennal grtMth 
Hs.133294£ST8 

MM>002670:Homo sapiens plasOn 1 (I isof 
NM.017678:Homo sapiens hypothetica! prol 
NMJ)14324:Honio sapiens alpha^neiliytacyi- 
NM.021067Mxno sapiens KIAA0186 gene pro 
Hs.1S1S29£STs 

NMJ)00492Honio sapiens cysQc fibiiosis t 
NM.004289:Homo sapiens nuclear factor (e 
NM.016045:Homo sapiens cltramosonie 20 ope 
(locusIinigNM.002483:Homo sapiens candn 
Ks.371132;EST8 

HsJ;71923:Homo sapiens cONA: FU22785 fi 
NM_003561 :Honio sapiens plu)sphoni}ase A2. 
Hs.144583:Homo sapiens, done IMAGE:3462 
NM-005823;Homo sapiens mesolheTin (MSLN) 
Hs.S0820:liypoU»6calcardiac/skeleia!m ' 
Hs.233364:ESTs 

NM.007028:Homo sapiens tripartita mo6{- 
(locustini()NIil.004485:Homo sapiens guanin 
Hs^321:KIAA13S9 protein 

Nftl.(K)2083:Hoino sapbns gtutalhione peiox 
AF134163:Homo sapiens Hiinian endogenous r 
(iocuslinlONM_012112:Homo sapi^ duomo 
NH.033013:Homo sapiens nudear receptor 
Hs.2S640xlaudln3 
Hsi03800:ESTs 

Hs.77358iranslierTb receptor (p9D. CD71 

Hs.356664:hypoihdiC8! protein FU32334 
NMj01323B:Hoinosaplens DNAJ domaliMonta 
Hs.82961 :Homo sapiens, ctone MGC:22588 1 
Hs.100686:an(ertor gradient protein 3 
Nhl.000676:Homo sapiens adenosine A2b lec 
NMJ018406:Homo sapiens mudn 4. traclieob 
(locusllRk)NMJD25130:Homo sapiens hypoUi 
NAl,002033:Hdmo sapiens fuonyRransferas 
Hs.271923:Homo sapiens cONA' FU22785 fl 
NM_021784:Homo sapte hepatocyte nwlea 
(locuslInk)NM_025l13:Homos^ns iiypotli 
NM.014256:Komo sapiens UDP-GlcNAc±etaGa 
NM.01609S:Homo sapiens HSPC037 protein ( 
HsM280:5«rnlnoimidazote4<8iix)xariiMe 
Ks.143855:EST8 

Hs.247280:ciiFoniosome 20 open readir^ bd 
NM.001698:Homo sapiens cystaSn SN (CST1 
(Iocusiin)(]NkL0GS242:Homo sapiens co^ut 
NM_062816:Homo sapiens tripartite motif- 
Hs.124660:ESTs, Moderately slmBarto 21 
NM.014297:Homo sapiens protein expressed 
NM.001091:Homo sapiens amBoride binding 
{tocusSnl()NM.0t4324:Homo ssqiiens dpha- 
Hs.1^608:EST8 
Hs.12896d<IAA1034 protein 
NM-004994:Homo sapiens matrix metaflopro 



5.14 
&12 
5.12 
&07 
SOS 
5.01 
499 
4.98 
4.94 
4.92 
4.92 
4.85 
4.84 
481 
481 
480 
480 
4.80 
478 
478 
475 
473 
470 
470 
4.68 
467 
466 
461 
460 
459 
4.55 
454 
4.54 
4.54 
454 
4.53 
453 
451 
450 
449 
449 
446 
444 
444 
442 
439 
4.38 
437 
438 
4.35 
433 
433 
433 
431 
431 
430 
429 
4.29 
428 
428 
428 
427 
427 
4.27 
427 
4.2S 
424 
4.24 
4.21 
420 
420 
4.20 
419 
419 
419 
418 
418 
417 
417 
416 
416 
416 
412 
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401866 411 

431611 U58766 H5^28 Hs.254428.-0ssue specific transplantaflo 4.10 

430187 AI799909 Hs.158389 Hs.158989:Homo sapiens cD^iAFU37936 lis 4.10 

414590 NM_000506 Hs.76530 NM_000506:Homo sapiens coaguteilon facto 4.08 

449281 AI808699 Hs.162717 NM032766;Homo sapiens hypottistfeai pfol 4.08 

449722 BE280074 fe.23960 Hs23960CcydInB1 4.08 

418318 U47732 Hs.84072 NKL004616:Homo sapiens Iransmembraie 48 4.05 

435066 BE281750 Hs.4747 NiyL001383:Hon»> sapiens dysteratosiscong 4.05 

414753 AF158255 Hs.77225 NiyL006437:Hbnn sapiens ADP-ribosynransf 4.04 

403221 4.04 

409B89 AW630041 HS.S6937 NM.02ig7a:KomosaplBre suppression Of hi 4.03 

449027 AJ271216 Ks.22880 NM.005700:Konx)8a(^d!pep^ 4.03 

431657 AI345227 Hs.105448 H3.10S448:proteln kinase, lysine deficfe 4.01 

439759 AL359055 Hs.67709 Hs.67709tow> sapiens mRNA full length I 4.00 

441362 BE614410 Hs.23044 NMJ)80668:Homo sapiens simflar to RIKEN 199 

417900 6E250127 Hs.82906 Hs.82906:CDC20ceS division cyde 20 ho 3l99 

428987 NM_004751 Hs.194710 NML004751«omo sapiens glucosamlnyt(N^ 199 

456977 AK000252 HS.16975B NM_017723:Homo8apfenshypolhetIcdprot 3.99 

445919 T53519 Hs.334692 Hs.334592-Jiypo!hefical protein MGC1 41 41 3.98 

423164 AK00O232 Ks.124835 NM.019062:HoroD sapiens hypothetical prat 198 

422627 BE336857 K5.118787 Hs.118787:tran5fonii!ng growth factor, be 3.97 

422363 T55979 Hs.115474 NM_002915:Honio sapiens replication facto 3.97 

433437 U20536 Hs.3280 NM„001226:Homo sapfens caspase 6, epopto 197 

415992 C05837 Hs.145807 Hs,145807dTypothetlca) protein FU 13593 196 

425834 NM.001639 Hs.ig57 Hs.1957:aniyto!dP component, serum 186 

41SD0O AWQ25529 Hs.239812 Hs.238812xerologlca))y defined breast c 195 

432407 AA221036 AF134164:Homo8aplens Human endogenous r 195 

408243 Y00787 Hs.624 NM>000584:Homo sapiens Inlerleukin 8 (IL 194 

408494 AA554714 Hs.187578 Hs.187578:Homo sapiens cDNAFUl 1639 (Is 194 

412610 X90908 Ks.74126 NM.001445:Homo salens fatty add bindin 194 

433323 AA805132 Hs.159142 Hs.1 591 42jufiaf]c fringe homoiog(Drasop 194 

422515 AW5O0470 Hs.117950 Hs.llTSSO.-phosphoribosylarrtnolmldarotec 192 

•436543 NM_002212 Hs.5215 Hs.5215:lntegiin beta 4 binding protein 191 

418113 AI272141 Hs.B3484 Hs.83484:SRY (sex determinii^ region Y). 191 

433083 AL042759 Hs.l91762 Hs.191762:hypoihBacal protein MGC20258 190 

431939 AW00B061 Hs.231994 Hs.231994:HomD sapiens, clone IMAGE:4341 190 

453439 AI572438 Hs.32976 NNL004485:Komo sapiens guanine nucleotid 3.89 

441888 A1733306 Hs.128071 NML022901:Komo sapiens hypotheticatprot 3.89 

432150 AK000224 Hs,272789 NM_01 771 6:Homo sapiens membrane<5paining 189 

425234 AW15222S Hs.165909 Hs.l65909:EST8. Weakly sftnHar to hypoth 3.89 

423803 Nl\4_005709 H3,132945 (tocuslInk)NM-005709:Homo sapiens PDZ.73 187 

410418 D31382 Ks.63325 NMJ)19894:Homo sapiens transmembrane pro 185 

436251 BE515065 HS.2S6585 0ocuslInk)NM.OO6392M)mo sapiens nudeo 185 

422424 AI166431 Hs.296638 K8.296638:prostdBd]f1isfentlaflonfeeto 184 

432269 NM.002447 Hs.2942 Hs^2anacrophage sQmuIaCng 1 recepto 184 

424905 NM_002497 Hs.153704 NM.002497:Homo sapiens NIMA (never in ml 184 

416209 AA236776 Hs.79078 NI\/L002358;Homo sapiens MAD2 mitotic ane 3.83 

430660 AW138724 Hs.168974 H8.168974£Sr6 183 

434370 AF130988- Hs.58346 NM.022336:Homo sapiens ectodysplaslnl. 162 

436481 AA379597 Hs.5199 NM014176:Homosaj^nsHSPC150prolebi8 182 

453700 AB009426 HS.S60 NM.001644:Homo sapiens apdlpQproleinB 181 

410619 BE512730 H3.65114 Hs.65114d(eiatin 18 181 

40d420 Z15008 Ks.54451 NA1.006S62:Homo sapiens tamlnln. gamma 2 179 

422535 AA311914 Ks.154576 Hs.154578:Homo sapiens mRMA for FU002S6 179 

432179 X75208 Hs^13 NM_004W3:Homo sapiens EphB3 (EPHB3), mR 178 

453967 AW009077 Hs.232947 Hs.232947£ST8 178 

428106 AI678765 Hs.21812 Ks.21812£ST8 178 

434170 AA626509 Hs.159642 (tocuslInk)NM.0D1490:Homo sapiens glucos 178 

418322 AA284166 H3.84113 NM.005192:Homo sapiens cydln^Jependent 178 

444381 BE387335 Hs.283713 NM.138465:Homo sapiens collagen triple h 174 

419229 AI827237 Hs.362919 H8.362919:ESTs 174 

437156 AI916600 H3.121194 Hs.l21194:Homo sapiens cDNA: FU21 569 fi 174 

452833 BE559681 Hs.30736 (locusynk)NM-015201:Komo sapiens block 173 

426831 BE2g6216 Hs.l72673 NM_000687:Homo sapiens S^enosylhomocys 173 

428970 BE276891 Hs,194691 NHL003979-itomo sapiens r^nolc add Ind 172 

443957 AA521049 Hs.353013 Hs.353013:dTOmosome 20 open reading !ra 171 

408832 AW085690 Hs.63428 Hs.63428:Homo sapiens CONAFU3W57 lis. 171 

440300 N39760 Hs,8859 NM-138791Homosaifcns8pyr8se(SHAPY), 171 

425976 a5094 Hs.334514 NM.025257:Homo sapiens dvomosome 6 open 171 

432575 AA553722 Hs.194346 Hs.194346:Splr-2 protein 171 

412104 AW205197 K&.240951 (tocusIlnk)NM.03312aHomo sapiens naked 171 

417001 AU076648 Hs.80741 NM.000282:Homo sapiens proptonylCoenzym . 169 

421225 AA463798 Hs.102698 Hs.102635:MCT-1 protein 169 

445109 AF039916 Ks.12330 NM.001247:KomosapiensedDnudeos{detr 169 

428330 1^524 HS.22S6 NiyU»2423:H(8no sapiens matrix mstattopro 187 

447472 AW207347 Hs.211101 Hs.211101£STs 167 

423349 AF010258 Hs.127428 NK^002142:Homo sapiens homeo box A9(H0X 167 

422026 U60736 H3.110826 Ks.l10826:tnro2deo0de repeat contM 166 

419574 AK0Oig89 H5.91165 Ks.91165:tiypolhetical protein FU1 1127 166 

417720 AA205625 Ks.208067 Ks.208067£ST8 168 

411257 AA628S67 Hs.115274 Hs.115274dndian hedgehog homotog(0ro60 166 

421515 Y11339 Hs.10S352 (tocuslink]NKL01B414Mnu) sapiens GaMAc 165 

433875 AW977653 Hs.75319 Hs.7S319a1bomideo8de reductase M2pQl 165 

179 
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mm 

445417 
412140 
420542 
439453 
414798 
428662 
414052 
412056 
401519 
428011 
450505 
421903 
413936 
424544 
431563 
435602 
434359 
439963 
447334 
422150 
45(^63 
424825 
422765 
422106 
422532 
425860 
442053 
437386 
415927 
446372 
432378 
434171 
428479 
415099 
414918 



418384 
418203 
429833 
409231 
431567 
453883 
442700 
410237 
428407 
436213 
44323 
431548 
452316 
448993 
447320 
414108 
420996 
439580 
422158 
418256 
400157 



453751 
421526 
415164 
405451 
414361 
422237 
417576 
457001 



421828 



417348 
423554 
451310 
425873 
429271 
437575 
432677 
439955 
443991 
435745 
403532 
413916 
42S247 



AF127026 

AK001058 

AA219691 

NM.000505 

BE264974 

A)286323 

NM.000346 

AWS78849 

T28160 

BE387514 

NM.004572 

AW079940 

AF113676 

M88700 

AI027643 

AF217515 

AI650363 

AW247529 

AA515032 

AI867118 

H43540 

AF207069 

AW409701 

D84239 

AL008726 

L29339 

R35343 

W52452 

AL120168 

ABQ20644 

AI493046 

BE247688 

Y00272 

AI492170 

AI219207 

AW895503 

AW149266 

X54942 

NiyL012079 

AA446644 

N51357 

AI63B516 

AA377618 

AI750589 

N^t003963 

AA325512 

AW248322 

AI834273 

AA298484 

AI471630 

AI675419 

AI267592 

AK001927 

AF086401 

L10343 

AW845318 

A)355761 
R36762 
A1060121 
AW084352 

AI08613B 

M13149 

AA339449 

J0325a 

AA305384 

AW89ig85 

BE387040 

AI940507 

M90516 

AW250651 

NbL013390 

AF039850 

AW954355 

NM_C04482 

AW203959 

NIVL002250 

AW967059 



Hs^94 

Hs.12680 

Ks.736^ 

H9.1321 

Hs.6568 

HsJ7411 

H3^316 

Hs.283552 

Hs.776 

Hs.181416 

Hs^5051 

K5.15951 

Hs^681 

Hs.150403 

Hs.120912 

Hs.283532 

Hs.116462 

Hs.6793 

Hs.91109 

Hs.279607 

H5.25292 

H5.153357 

Hs.1578 

Hs.111732 

H5.118126 

Hs.1964 

Hs.24968 

Hs.356766 

Hs.78919 

H8.14945 

Hs.146133 

H5.347349 

K5.334562 

Hs.77917 

H8.72222 

HS.125Z76 

H5.25130 

Hs.83758 

Hs.288627 

H5.692 

Hs,260855 

Hs.347524 

Hs.103834 

Ks.61258 

Hs.184194 

Hs.71472 

H5.95835 

Hs.9711 

Hs.61265 

Hs.355952 

K8.164464 

HS75761 

Hs.100895 

Hs.293847 

Hs.112341 

Hs.12271 

Hs356473 

Hs.242463 

Hs.101282 

Hs.105460 

Hs.157123 

Hs^04044 

HS.149B 

Hs.82285 

Hs^062 

Hs^740 

Hs.367942 

Hs.182476 

Hs.318526 

Hs.1674 

Hs.26213 

H5.160417 

Hs.198515 

Hs^29 

HSJ278811 

Hs.149532 

Hs.10082 

Hs.374342 



NM.005379cHomo sapiens myosin lA (MY01A) 
Hs.12680:Hbmo sapiens cONA FU10196 lis. 
NM.005733:Homo sapiens RAB6 interacSng, 
NM.00060S:Homo sapiens ooaQutaOon facto 
Hs.6566tthynAI honnona fecaptor fnterac 
H3.9741 1 Jiypolheticar protein MGC12335 
Hs.2316:SRY (sax detennining region Y>t 
Hs^B3552iiypothe8cal protein 60016153 
H8.77B:Buanylato cyclase acfivalDr 2A(g 

NM.014730:Komo sapiens KIAA0152 gene pro 
NM.004572:Komo sapiens ptakophlTin 2 (PK 
(locusnnI()NM.145202:Homo sapiens prdin 
NM-00Q295:Homo sapiens serine (orcystei 
NM_000790i{omo saf^ns dopa decarboxyias 
Hs.120912:EST8 

NVL018455:Homo sapiens unchaiaclerfzed b 
Hs.116462£ST8 

Hs.6793:piatelet^0vaQng factor acety 
Hs.91 109:ESTs. Wealdy slmflar to pulaiiv 
Hs.27g807:Homo sapiens GONAFU34399 lis 
(Hs^292:n1}onuclease H2, large sUbtma 
Nk(L001084:HonK> sapiens proarflagen^n 
NMJX}1168:Homo sapiens baculoviral lAP r 
Nhl.003690:Homo sapiens IgG Fc fabidino pr 
(locusl]nlO)iliL000308itQmo8a{]ienspiote 
14000343.-8^ sapiens solute canier la 
Hs^4968:hypothetlcaI protein 6C016683 
H3^58766:Homo sapiens mRNA; cDNA DKF2p7 
NhL021083:Hcmo sapiens KeQ blood gro^p 
Hs.14945:^-aci(K>)enzymeAl^.l 
Hs.146133:ESTs 

(Iocusi(n)()NM.004749:Homo sapiens ceil c 
NM_001786:Homo sai^ns ceii division eye 
NM_006002:Homo sapiens ubtquifln carfooxy 
Hs.72222rfer-1-li((e 4 (C elegans) 
Hs.125276:Homo sapiens cDNA FU25833 fis 
Hs.25130:Homo sapiens cONA RJ14923 fis. 
Nfc/L001B27:Homo saj^ens CDC28 protein Idn 
NR/L01 2079:Homo sapiens diacytgiyceroi 0- 
NM.002354:Homo sapiens tumor-assodated 
(IocusnniqNM_145175:Homo sapiens NSE1 ( 
Hs^7524:Homo sapiens, clone MGC24665 
NhC024056:Komo salens hypoUMUcal pro! 
Hs.61258:arginInosuccinate lyase 
NM.003g63:Hofno sapiens transmembrane 4 s 
NM_024662:Komo sapiens hypotheficsl prot 

NliL017515:Komo sapiens novel protein (HS 
NM.138805:Komo sapiens famOy with seque 
Hs^5952:ESTs. Weakly similar to 090320 
H8.164464:Honio sapiens, done MGC;23656 
NM.003137:Homo sapiens SFRS |m^n kina 
(tocusllnk)NKlL018099:Homo sapiens bypoU) 
HsJ93847:EST8 

NM_002638:Homo sapiens protease Inhibito 
(Iocuslink)NM.012162:Homo sapiens F-box 
NML006713itomo8apfen8 acGvated RNA pd 
Hs^42463:teraen8 

Hs.101262:Homo sapiens mRNA; cDNA DKFZp4 
NlVL015393:Homo sapiens DKFZP564O0823 pro 
Hs.157123£ST8 

Hs.204044:ESTs 

NM.000412:Homo sapiens his6din&^li gl 
NM-000819:Homo s^ns phosphoribosytgly 
Hs^082Mtam!n D (1,25- dihydroxyvilami 
NM.014584:Homo sapiens EROI-Uite (S. cer 
Hs^67942:Homo sapiens, done tMAGE:4701 
NM.031295:Homo sapiens Willlann Beuren s 
NM.025138:Homo s^ns hypotheScaf prot 
NM-002056:Homo s^ns g!utamlne4ruct08 
NM.052951 :Homo ss^ns chromosome 20 ope 
Hs.160417dransmembraie protein 2 
NM05224:Homo sapiens dead rlr^er-Gita 
NM.024320:Homo sapiens hypothetical prot 
NH.004482:Komo sapiens UOP-N-acetyMph 
Hs.149532:EST8 

NM^002250:Homo sapfens potassium Interme 
Hs.374342:Homo sapiens done 2471 1 mRNA 



N49813 Hs.75615 
Nlil.00S940 Hs.155324 



NM.000483:Honio sapiens apd^piotoln C- 
H8.1SS3243natrix metaOoprotelnase 11 (s 



a65 
a64 
3.64 
163 
3.63 
a62 
a62 
3.62 
3.61 
3.60 
3.60 
160 
158 
158 
158 
157 
157 
157 
156 
156 
156 
156 
156 
155 
155 
155 
155 
155 
154 
153 
153 
153 
152 
152 
151 
3.61 
151 
151 
151 
151 
160 
150 
160 
150 
150 



148 
148 
148 
148 
147 
147 
146 
146 
146 
146 
3.46 
146 
3.45 
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144 
144 
144 
144 
143 
143 
142 
142 
142 
142 
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141 
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140 
140 
140 
140 
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424996 


AF006005 


Hs.154104 


NM»(X)2657:HoiTio saplsns ptstomofpNc sdon 


3.38 


402344 








137 


417165 


R80137 


Hs.302738 


HsJ02738:Hdino sapiens cONA: RJ21425 fi 


3.37 


427526 


AU077143 


H5.17g565 


NM.002388:Hoino saptens MCM3 minlchromoso 


3.37 


426711 


AA383471 


Hs.343800 


(|ocusIink)NM.033^KgnK) sapiens epiOie 


3.37 


439186 


AI697274 


Hs,105«5 


Ks.10S435.-GDPwn3nnose4.Mehydrat8S8 


a36 


444783 


AK001468 


H3.62180 


NM.018685:Komo sapiens anllltn, acOn U 


3.36 


426174 


AA547959 


Ks.115838 


Hs.11S836:ESTs 


3.36 


421585 


1)95626 


KS302043 


Nk^003965:Homo sapiens chemoMne (C-C mo 


3.35 


421605 


8E44O108 


Ks.106127 


NMJD15972:HonK) sapiens RNA poiymersse ) 


138 


446921 


AB012113 


Hs.16530 


NMJ)029B8:Homo sapiens smali inducible c 


3.36 


438746 


AI885815 


Hs.184727 


Hs.184727£STs.W^8iin!iartoT45736 


3.36 


403219 








3.36 


420981 


L40904 


HS.1C0724 


NM»005037:Komo sapiens peroadsome praGf 


3.34 


456946 


T29678 


H5.166068 


Hs.166088:vHEn1 


3.33 


425580 


111144 


Hs.1907 


Hs.1907:ga!anln 


3.33 


412605 


AW410734 


Hs,74111 


Hs.7411 1:RNA binding protein (autoanOge 


3.33 


4413B4 


AA447849 


Hs288660 


H3.288660:Homo sapiens cONA: FU22162 fi 


3.33 


416782 


L35035 


88.79888 


GxusRnK)NM-144563:Komo sapiens ribose 


3.33 


426761 


A1015709 


Hs.172089 


Hs.172089:pr(H)ncosis receplor Inducing 


133 


441633 


AW958544 


Hs.112242 


N^1.03241 3:Homo sapiens nomtal mucosa of 


3.32 


416975 


NM_004131 


Hs.1051 


NM.004131:Homo sapiens gianzyme B (granz 


131 


428874 


W32133 


H5.194366 


Hs.1 94366:transthyr8tin (prealbumin, amy 


131 


431192 


At670056 


Hs.137274 


Hs,1 37274:EST8. WeaMy similar to hypoth 


130 


431836 


AF178S32 


Hs^1411 


NM..1 3B992:Homo sapiens beta^ APP-cta 


130 


413219 


AA878200 


Hs.118727 


- H5.118727:HomosapienscONAFlJ33803fis 


130 


410639 


BE269047 


Hs.65234 


(locusnnk)NML017895:Homo sapiens U&m 


129 


450737 


AW007152 


HS.6332S 


Hs.6332&transm8mbrane protease, serine 


129 


410850 


AW352867 


Ks^02738 


Ks.302738:Homo sapiens cONA: FU21425 fi 


129 


428450 


N1\1.014791 


K&184339 


NM-014791:HonK> sapiens maternal embiyonl 


129 


437B10 


BE246399 


Hs.387646 


NM.016617:Homo sapiens hypoDiefical prat 


3.29 


403381 








126 


434031 


BE384165 


Hs.23723 


(locuslink)NM.025215:Hbmo sapiens pseudo 


3.28 


421975 


AW961017 


Hs.6459 


(locusIinl()NM.024531:Homo sapiens hypoth 


3.28 


452299 


AW206330 


Hs.355663 


Hs.355663:ESTs 


128 


428024 


Z29067 


H5.2236 


Hs.2236:NIMA (never in mitosis gene a)-r 


3.28 


412994 


D322S7 


HS75113 


H8.751 1 3:gensral transcription factor 11 


126 


443162 


T49951 


Hs.9029 


(Iocustink]NM.015515:Hon» saptens type 1 


128 


'435327 


BE301871 


Hs,4867 


Ks.4867;mann06yl (alpha-1,3>)^)ycoprote 


128 


424010 


AL080188 


Hs.137556 


NM-0331 00:Homo safte MT>protocadtterin 


128 


428953 


AA306610 


Hs.348183 


NM.003823:Homo sapiens tumor necrosis fa 


127 


419359 


AL043202 


Hs.90073 


Hs^0073:CSE1 chromosome segregation 1-i 


127 


414695 


BE439915 


Hs.76913 


Hs.76913:proteasome (pnisome, macropain) 


127 


444371 


BE540274 


Hs,239 


Hs^39:fotkhe3dboxM1 


127 


450221 


AA328102 


Hs,24841 


NM.018204:Homo sap^ns cytoskeleton asso 


127 


449207 


AL044222 


Hs.23255 


NM>004298:Momo8aptens nudeoporin ISSkO 


127 


422609 


Z46023 


Hs.1 18721 


NM-000434:Homo sapiens siatidase 1 (iyso 


127 


403485 








127 


441623 


AA315605 


Hs.346710 


Hs^8710:Homo sapiens, dona tMAGE:4242 


3.26 


421943 


BE6165:H) 


H5.343912 


NKL033504:Homo sapiens CAC-1 (CAC-1). mR 


126 


405484 








3.28 


435849 


BE305242 


Hs.16098 


HsJ6098:ciaudin2 


3.26 


449139 


BE268315 


Hs.23111 


NM.0O4461:Homo s^iiens |dienylalanin&4RN 


3.28 


404664 






125 


447188 


H65423 


H5.17631 


NMj030804:Homo salens hypotheOca) pro! 


125 


423228 


AA323414 


Ks.146109 


Ks.146109:EST8. We^ similar to T28937 


124 


413254 


U40272 


Hs.75253 


NM-004135:Homo sapiens isodtrata dehydr 


124 


424243 


AI949359 


Hs.143800 


H8.143600:gol^ ptiosplK^ratein 4 


3.24 


435014 


6ES60898 


Hs.10026 


NM.022061:Homo sapiens ribosoma! protein 


3.24 


452261 


T93500 


H!a8792 


Hs.28792:Homo sapiens cONA FU1 1041 lis, 


124 


416065 


BE287931 


Hs.78996 


NM_002592:Homo sapiens proTifsrafing eel 


123 


427333 


AF087797 


Hs.176658 


NM.001 1 69*ilomo sapiens aquaporin 8 (AQP8 


123 


443464 


BE548446 


Hs^1579 


NM.021095M)mo sapiens solute canter fa 


123 


432035 


AA524725 


Hs.162108 


Hs.162108:EST8 


3.23 


408868 


AW292286 


K9^5058 


Hs.255058:EST8 


123 


429504 


X99133 


Hs.204238 


H3.204238:lipocaiin 2 (oncogene 24p3) 


122 


441085 


AW136551 


Hs.181245 


H$.181245:Homo sapiens cONA aJ125326s 


122 


426991 


AK001536 


Ks.214410 


Hs^14410:Homo sapiens cONA FU31S73 fis 


122 


408901 


AK001330 


Ks.48855 


(locustlnk]NNt018101:Homo sapiens hypoth 


122 


439979 


AW600291 


HS.6B23 


NMJ)18092:Honio sapiens neuropilin (NRP) 


122 


453968 


AA847843 


Hs.62711 


Hs^271 liHomo sapiens, done IMAGE:3351 2 


122 


457465 


AW301344 


Hs.122908 


NM-030928:Homo sapiens ONA repScaOon f 


122 


426317 


AA312350 


Hs.169294 


NMJD03202:Homo sapiens transcriptton fee 


121 


414639 


X67055 


Hs.76716 


NIiC002217:Komo sapiens pre-alpha [globu] 


121 


439975 


AW326081 


Hs.6817 


NM-033453:Komo sapiens inoslne triphospli 


120 


4M261 


AA298958 


Hs.10724 


Hs.10724:m!tochondrial ribosomal protein 


120 


454033 


AF107457 


Hs.37035 


NM_005515:ltomo sapiens homeo box KB9 (HL 


120 


424837 


BE276113 


Hs.333034 


NM.003491:Komo sapiens Af^l homoiog. N-a 


120 


427747 


AW411425 


Hs.180655 


(iocusnnk)NM.004217:Klomo sapiens serine 


120 


438469 


AK001455 


Ks.5198 


H8.51 98:Down syndrome ofllcal region ge 


119 


400130 




Hs.155560 


NM.001746:Komosa|toc8lnexin (CANX), 


119 


422293 


X94453 


Hs.114366 


Hs.1 1436&9ynotine-&€artaa(ylatB synihe 


119 


400290 


H16836 


Hs^1608 


(kjcusfnk)NM.01763eMiRio sapiens band 


110 
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419239 


AM68183 


Hs^98 


(tacusHnk)NM.033103:HotTio sapiens rtiopN 


3.18 


426215 


AW963419 


Hs.155223 


l^003714i{omo sapiens stanniocatdn 2 ( 


3.18 


425743 


BE39&495 


Hs.159428 


N^^l38761i1omo sapiens BCL2-8ssodated X 


3.17 


413313 


Nti.002047 Hssaom 


NM.002047:Hamo sapiens glycyMRNA synlh 


3.17 


422714 


AB018335 


Hs 119387 


NM.014698:Homo sai^ns KIAA0792 gene pro 


3.17 


408353 


BE439838 


Hs.44298 


N^4.015969:KQmo sapiens mitochondrial rib 


ai7 


400203 




H3.1390 


NM.002794:Homo sapiens proteasome (proso 


3,16 


412870 


N22788 


H3.B2407 


NM_022059:Komo sajiiens chemoUne (C-X-C 


3.16 


426088 


AF038007 


Hs. 166196 


NM_005603:Homo sapiens ATPase, Class 1, 


3.16 


416984 


H38765 


Hs.60706 


NM_000903:Komo sapiens NAO{P)H deiiydroge 


3.16 


450635 


AW403954 


Hs^237 


NM-016647:Homo sapiens mesenchymal stem 


3.16 


408708 


AI2fi27S9 




At282759:qt84a01j(1 NCLCGAP.Col4 Homos 


3.16 


452838 


AWg55454 


Ks^942 


NM.004093:Homo sapiens ephiin-B2 (EFNB2) 


3.16 


430127 


AA219498 




H5.233952:protBasome (prosome, maoopain 


3.16 


417308 


H60720 


H&B1892 


NM.014736:Honio sapiens KiAAOlOl gene pro 


ai5 


408116 


AA251393 


Hs.289052 


NM.OS2842:Homo sapiens BCL24ite 12 (pro 


3L15 


402474 








ai5 


419488 


AA316241 


Hs.90691 


NM.006993:Homo sapiens nudeophosmlnMuc 


ai5 


446595 


T57448 


Hs.15467 


NM_017943:Homo sapiens hypolhelicSi pot 


3.15 


444954 


AW247076 


Ks.12163 


NM.003908:Homo sapiens eukaryoiic trsnsl 


3.15 


434263 


N34895 


H8.79187 


Hs.79187»QX8ac)de vims and adenovins 


ai5 


411165 


NNL000169 


Hs.69089 


NM.000169'itomo sapiens galactosidase, at 


3.15 


430696 


AA531276 


KS.S9509 


Hs.S9S09:EST8. Weakly similar to sin^ar 


ai5 


436391 


AJ227692 


Hs.146274 


Hs,146274:ESTs 


ai5 


409142 


AL136877 


Hs.50758 


ite.50758:SMC4 structural maintenance of 


ai4 


445873 


AA2S0970 


Hs.251946 


Ks^51946:Homo sapiens cONARJ11840fis 


3.14 


432370 


AA308334 


Hs.274424 


NM_016946:Hoaio sapiens N^etyineuraminl 


ai4 


430514 


AA318501 


Hs.241587 


NM_021246:Homo sapiens lymphocyte aniige 


ai4 


417791 


AWg65339 


Hs.44269 


Hs.44269:Homo sapiens cDNA FU37972 fis, 


ai4 


417115 


AW952792 


Hs.334612 


NM.003094:Homo sapiens smaO nudear rib 


ai3 


411126 


NI^001202 Hs.68a79 


(locusIinlONM.OOl 202:Homo sapiens bone m 


ai3 


456906 


AF117646 


Hs.156637 


NM.012116:Homo sapiens Cas-Br>M (murine) 


ai3 


425123 


AW205274 


Hs.154695 


NM-000303:Koino sapiens phosphomannomutas 


ai3 


408056 


AA312329 


Hs.42331 


Hs.42331:ephiirvA4 


ai3 


446366 


AI032108 


Hs^24 


Hs^24:hepatocyte nudear factor 4, al 


ai2 


407804 


AF228803 


H3.39957 


NM.016445:}tomo sapiens pleclcstrfn 2 (mou 


ai2 


412723. 


AA648459 


Hs.335951 


Hs.335951:hypolhetical protein AF301222 


ai2 


407233 


X16354 


Hs.50964 


(locuslnlONR^001712:Homo sapiens cardn 


ail 


407168 


R45175 


Hs.117183 


Hs.117183:HQmo sapiens mRI^ cONA OKFZpS 


an 


452721 


AJ269529 


Hs.301871 


Hs.301871 solute carrier tamBy 37 (gtyc 


an 


426427 


M86699 


Ks.16g840 


Hs.169840:nK protein kinase 


an 


409162 


H25530 


Hs.50868 


Hs.5086B:soiute canier family 22 (orpn 


aio 


412612 


Nli^000047 Hs.74131 


NM.000047:Homo sapiens arylsulfalase £ ( 


aio 


434881 


AA206153 


Hs.4209 


NiyL016491:Homo sapiens iritochandriii rib 


aio 


407944 


R34008 


Hs.239727 


N^1.024422J^omo sapiens desmocolBn 2 (OS 


aio 


414806 


D14694 


Hs.77329 


(tocusnnlONM_014754:Homo sapiens pttosph 


ao9 


454464 


AWB11606 


Hs.271819 


H3.271819:mudn 17 


ao9 


415474 


NMt0142S2 Hs.78457 


NM-014252:Homo sapiens solute carrier fa 


ao9 


422616 


BE300330 


H5.118725 


NM.01224&Homo sapiens selenophosph^ s 


ao9 


421470 


R27496 


Hs.1378 


NM-0051 39:Homo sapiens annexin A3 (ANXA3 


ao9 


419551 


AW5622S6 


Hs.91011 


NM.006408:Hon:v> sapiens anterior gradient 


ao9 


418691 


AW752389 


Hs.87296 


Hs.67296:K(Hno sapiens cONA FU20269 fls. 


3.08 


422163 


AFQ27208 


Hs,112360 


Hs.ll2360:proniinln>like 1 (mouse) 


ao8 


447760 


AI431328 


Hs.346605 


NliU)52963:Homo sapiens mitochondiia! top 


ao8 


405506 








ao8 


429957 


AW204530 


Hs.99500 


Hs.99500:EST8 


ao8 


410166 


AK001376 


H5.59346 


NM.018122:Homo sapiens hypothetic^ prot 


ao8 


422680 


AF228704 


Hs.193974 


Hs. 1 93974:glutatWone reductase 


aod 


442013 


AA506476 


Hs.375009 


Hs.375009:Honx> sapiens mRNA: cDNA DKFZp6 


ao8 


431722 


AF16152B 


Hs.268049 


(locusIink}NhL016101:Komo sapiens hypoth 


3.08 


421508 


BE3027g8 


Ks.105097 


Hs.105097%midin6 kinase 1, sohibte 


3.08 


433659 


AK001301 


Hs.3487 


NA4^018093:Homo sapiens hypothetical prot 


3.07 


439492 


AF086310 


Hs.103159 


Hs.103159:ESTs. Weakly similar to T06291 


ao7 


417866 


AW067903 


Hs.82772 


Hs.8277Z'coIIagen, type XI, alpha 1 


3.07 


412530 


AA766268 


Hs.266273 


(k)cu8link)NMJ)24918:Homo sapiens chromo 


3.07 


412869 


AA2g0712 


H5.82407 


Hs.82407:ch8moldne (OX-C motif} ligand 


ao6 


453132 


AW951952 


Hs^93724 


Hs.293724:Homo sapiens cDNA FU12683fis 


ao6 


424971 


AA479005 


Hs.154036 


NM.003311:Homo sapiens tumor suppressing 


ao6 


427557 


NM.002659 Hs.179657 


NM_002659:Homo sapiens plasminogen ectlv 


ao5 


439273 


AW1390g9 


Hs.367692 


Ks.367692MmiD sapiens cDNA FU25668fi8 


ao5 


431945 


AW000827 


Hs.1ig62 


NM_03076e:Homo89ptens ctpoplosls regulat 


ao5 


435703 


AW630133 


Hs^13 


(locus!ink)NM.020192:HoniO88piens GK003 


ao5 


407289 


AA1351S9 


Hs.203349 


Hs.203349'itono sapiens cONA FU12149 fis 


ao4 


403739 






ao4 


444664 


N26362 


Ks.11615 


NMJ)16086:Homo sapiens map kinase phosph 


ao4 


409152 


AA17668S 


Hs.194346 


Hs.194346:Spir.2 protein 


ao4 


409093 


6E243834 


HS50441 


NM.Ol5936:Homo sapiens CGM)4 protein (L 


ao4 


406545 






3.03 


450653 


AWBS0613 


Hs.8715 


Ks.B715%pothetkal protein MGC3232 


ao3 


416867 


031771 


Hs.69404 


NK1.00244g:Ho(no sapiens msh homeo box horn 


ao3 


422976 


AU076657 


Hs.1600 


Hs.160ftchap8ionln containing TCP1, subu 


3.03 


434523 


AA7Q3709 


Hs23410 


(|ocuslink)NH.016S39:Homo sapiens sktui 


ao3 


440088 


BESS9877 


Hs.183232 


NM.02483ftHanio sapiens hypothetical prol 


ao2 



182 



wo 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



414907 




Hs.77597 


4Un(M 


#vl4«40oi 


nS^8301 


4<942UJ 


6E2S2677 


Hs.283558 


4ZZ/o3 


AXMAAittYT 


Hs.l 14311 


412133 


Uoo4dU 


Hs. 104557 


450334 


AF035S59 


Hs.24879 


417975 


AA541B36 


Hs.30085 


420162 


Bt 376432 


Hs.95577 


436561 


BE560135 


Hs.5232 


426031 


AA235251 


H&166066 


417678 


X06560 


H5.82396 


417386 


AL037228 


Hs.301957 


429983 


W92620 


Hs.260855 


417526 


AA568906 


Hs.62240 


414732 


AW410976 


Hs.77152 


409614 


BE297412 


H3.55189 


439053 


BE244588 


Hs.6456 


411096 


U80034 


Hs.66583 


433312 


AI241331 


Hs.131766 


420767 


AF072711 


Hs.99916 


429523 


AK000788 


Ks.205280 


423242 


AL039402 


Hs.l 25783 


420552 


AK000492 


Hs.98806 


413380 


A1904232 


H3.75323 


421533 


N71826 


Hs.105465 


439352 


Bfc614347 


Hs.169615 


426023 


AL038843 


Ks.374530 


431193 


AW749505 


Hs^770 


457211 


AW972565 


K5.32399 


410467 


AF102548 


Hs.63931 


422066 


AW249275 


H3.343521 


418526 


BE019020 


Hs.85838 


453012 


TB5804 


Hs.31334 


412939 


AW4 11491 


HsJ5069 


413813 


M96956 


Ks.75561 


418362 


AL031714 


Hs.84285 


431350 


AI192S28 


H3.1 64537 


417911 


AA333387 


Hs.82916 


413597 


AW302885 


Hs.1 17183 


409956 


AIAMA<%<90« 

AW1 03364 


Hs.727 


445462 


AA378775 


Hs.288649 


418245 


AA088767 


Hs.63883 


408194 


AA601038 


Hs.1 91797 


421959 


AW751497 


HSJ8370 


459306 


AVV578452 




425209 


AL049761 


H$.155140 


439659 


AW97076Q 


H859483 


408683 


R58665 


Hs.46847 


432843 


BE250865 


Hs.279529 


406684 


X16354 


Hs.50984 


410006 


AW732308 


H3J7783 


442S77 


AA292998 


Hs.163900 


425003 


AF1 19046 


Hs.1 54149 


449437 


A)702038 


Hs.1 00057 


427779 


AA906997 


Hs.180760 


446698 


Ar279265 


HSu^76 


414549 


BE393089 


Hs.183506 


410817 


A!262789 


H5.93659 


419378 


R24922 


Hs,90078 


428376 


AFl 19565 


Hs,184011 


414416 


AW409985 


Hs,76084 


434094 


AA305599 


Hs^38205 


409012 


AL1 17435 


HS.4972S 


429048 


AI372949 


Hs.44241 


426378 


U80082 


Hs.169600 


422397 


AJ223368 


H3.116051 


426715 


AB037855 


Hs.171917 


429539 


AK001839 


Hs.206501 


443715 


AI583187 


K3.9700 


453082 


H1883S 


Ks^1608 


433888 


BE176078 


Hs.30819 


A0AK7A 

4^4aJ4 


[)87662 


Hs.150275 


422556 


NliL006420 Hs.118249 


407777 


AA161071 


Hs.71465 


429626 


U367B7 


Hs.211571 


413374 


NM001034 Hs.75319 


442159 


AW163390 


Hs.278554 


400133 




Hs.184693 


419381 


A6023420 


Hs.90093 


436827 


H721B7 


Hs.356668 


426108 


AA622037 


Hs.166468 


436278 


BE398290 


Hs.5097 


425397 


;04088 


Hs.156346 



NM.000998:Komo sapiens nbosoma) proteto 
Hs.256301:hypoth8tical piotsin M6C13170 
NM.018509:Homo sapiens hypoMcsl piot 
NM»003504:Homo saptefls COC45 ceD divW 
NMJ)018S9:Honio saito sotuts earner fa 
Hs.24879:phosphati(fic add pho^atasa t 
NM_02461 6:Homo sapiens hypolheficai pro( 
NM_052984:Homo sapiens cycdtHlapendent 
NM.014165:HGnio sapiens HSPC125 fsotein ( 
()ocusQnk)NhU)06697:Hc»nosaffensdspla 
(locusIink)NM.002534:Homo sapiens 7^- 
NM.016144:Komo sapiens Sec61 ^ha fonn 
(locusnnk)NM.145175:Hoino sapiens NSE1 ( 
Hs.82240!8yntaxfn3A 

Hs.771S2:MCM7 ninichmnosome maintenance 
Mhl.01 6489:Homo sapiens S*-nudeo6das8, 
Hs.6456»hapen)nin containing TCP1. subu 
NM.005932:Homo sapiens mitoctiondrial int 
Hs.131765*£STs, Moderately simHar to 13 



Ks.205280:Homo sapiens cONA FU20781 lis 
Hs.125783:ctiromosome 1 open reading tram 
Hs.98806:hypotheScal protein FLJ20485 
Hs.75323:pn3h]bitin 

Nkl.003095:Homo sapiens smali nuclear rfb 
NM_023080:Homo sapiens hypotheGcal prol 
H5.374530:Homo saptens cONA: FU23602 II 
Hs.296770:KIAA1719 protein 
(locuslink}NM^145240:Homo sapiens simlta 
NM.080759:Homo sapiens dactishund homotog 
Hs.343521 imaiate dehydrogenase 2. NAD (m 
NM_004207:Homo sapiens solute carrier fa 
NKAjOl 2469:Homa sapiens chnHnosome 20 ope 
Hs75069:sefine hydioxymetiiyltransferase 
NM_003212Homo sapiens terslDcandnonia<l 
NM.003345:Homo sapiens utlquiSrKoniuga 
Hs.164537:ESTs 

Hs.82916:chapc»onin containing TCPl , sub 
Hs.1 17183:Homo sai^ns mflNA; cDNA DKFZpS 
NAL002192:Homo sapiens intilt)ln, betaA( 
(locuslinl()N«t024051:HDmo sapiens hypoth 
NM_020162:Homo sapiens (ransmemlxsnei pr 
Hs.191797:EST8 

NM_030622:Homo saptens cytodirome P450. 
AW578452:RCm0252^3010Omb07 CTD25 
NM.001895:Homo saptens casein kinase % 
Hs.S9483:leucinMldi repedkontahilng 
Ni\1.016514:Homo sapiens TIW and TNF rece 
NM-013237:Homo sapiens px194il« protein 
{iocusfinl()NM.001712:Homo sapiens carcln 
NM.003751:Homo sapiens eulcaiyottc transi 
Hs.163900:ESTs. Hlgfily similar to winged 
NM.014481:Homo sapiens APEX nuclease (ap 
Hs.l 00057»erine/lhreonine Idnase 35 
NM.021238:Homo sapiens TERA protein (TER 
NM_02291 1:Homo sapiens sotuts canter fa 
NM.024641 :Homo sapiens tiypoQieOcai prot 
(locuslink)I^M.00491 1:Homo sapiens pnitei 
Hs.90076:nucteoQd&«ugar transporter si 
Hs.164011:pyroptiospi»tase (inorganic) 
(locusnny)NM.032737:Homo sapiens hypoih 
Hs.238205;tiypottieticat protein PRO2013 
Hs.49725:OXFZP4341216 protein 
Hs.44241 iHomo sapiens cDNA: FU21447 lis 
Hs.169600:iaAA0828 protein 
OocusIlnl(]NM.l 38768:Homo sainens myelom 
Hs.171917:t]ypotheScal protein FU11085 
OoGusIink}NI020467ilonio saptens hypoOi 
NM.001238:Homo8aphn8CydinE1 (CCNEI) 
(locusQn)()NM.017636:KQmo sapiens tiansl 
Hs.30819:hypotheSc8i protein C40 
Hs.1S0275:KIAA0241 protein 
Hs.1 18249'^-«il)osylaQon feetor guanin 
Hs.71465siiu^Iene epoxidase 
NI^005333:Homo saptens holocylochrome c 
NM^001034:HQmo sapiens ritionudeoQde re 
NM.007276:Homo saptens cluomobox homotog 
NM.005648:Homo saptens transcr^pBon ek> 
Hs.90093:heat sliocit TOId) protein 4 
Oocusnnk}NML005274:Komo sq^tens guanin 
NNl.004708:Homo saptens programmed ceil d 
Hl5097:8ynaptogyiin 2 
ray)01087:Homo SQptens topo lso mer as e pN 
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ao2 

3,02 
3.02 
3L02 
3.02 

ao2 

3.02 
3L01 
101 
3.01 

aoi 
aoo 

100 
100 
100 
3.00 
100 
Z99 
Z99 
299 
299 
299 
299 
299 
299 
298 
298 
298 
298 
297 
297 
297 
2.97 
297 
297 
297 
296 
296 
296 
296 
296 
296 
296 
296 
296 
296 
295 
295 
295 
295 
294 
294 
294 
294 
294 
294 
293 
293 
293 
293 
293 
293 
293 
293 
292 
292 
292 
292 
292 
292 
292 
292 
292 
292 
292 
292 
292 
291 
291 
291 
291 
291 
291 
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433487* 
41G933 
430287 
434026 
447698 
411263 
432754 
437016 
446228 
420421 
428385 
414420 
457284 
401405 
453111 
400247 
421910 
413610 
413588 
418661 
427490 
417534 
435099 
42S811 
446649 
412974 
413179 
407770 
408847 
448250 
428810 
427505 
418443 
447343 
409252 
443323 
450378 
411761 
415691 
417715 
452099 
436138 
432856 
434457 
444237 
456382 
411393 
424270 
414396 
426120 
448663 
443802 
445863 



440334 
449057 
432680 
446421 
427239 
425649 
429638 
435777 
424441 
407833 
415083 
421462 
443572 
443180 
413753 
453028 
425047 
432593 
410197 
413095 
417677 
425263 
437430 



407137 
400750 
428788 
432633 
432816 



U31814 

BE561850 

AW182459 

R15466 

A)420156 

BE297802 

BE241691 

AU076916 

NVL016046 

AF281133 

AF112213 

AA043424 

AF102850 

AS014598 

NiyL014588 

A1117554 

AA971014 

NM.001949 

Z95152 

W27202 

AC004770 

AL039104 

AU076617 

R18978 

N99692 

AW607831 

AW290997 

NM_016034 

AF068236 

AA361562 

NVL005239 

AA256641 

AK000631 

BE560821 

AW249181 

AI733848 

AW963979 

AW969567 

BE612992 

H53323 

6E616609 

AF141332 

AA336878 

AVtf973003 

AW797437 

AK001818 

BE548266 

AA32S243 

BE614599 

AW504924 

R12234 

AF155108 

BE276112 

AB037784 

T47364 

BE297434 

BE270447 

U30930 

AI916662 

AW419202 

X14850 

AW955632 

A1632683 

AR)16495 

AA025610 

R1587S 

U17760 

AB006532 

U34038 

AW301003 

NKL005518 

AA494359 

NM.016055 

NM.001197 

W44671 

M28301 

T97307 



Hs.3352 

H180506 

H3.125759 

Hs.285218 

Ks^26733 

H5.69360 

Hs.3100 

Hs^9B 

Hs.14415 

Hs.343589 

Hs.184062 

Hs,76095 

Hs^7933 

HS51720 

HsJ56473 

H3,109437 

K5.119908 

Hs.75432 

Ks.1189 

Hs.178695 

Hs.82327 

H3.4755 

Hs.159557 

Hs.16251 

H5.75105 

Hs.75227 

Hs.38738 

Hs.190153 

Hs^776 

Hs.193788 

Hs.178761 

H5.85146 

Ks^94 

HS.522S6 

H$.9222 

H&154796 

Hs.71935 

H5:i4723 

H5.86366 

HS27931 

Hs.25717 

Hs^91 

Hs^00333 

HS.9&42 

Hs.179g09 

Hs.69771 

Hs.144407 

Hs.76057 

H3.166887 

KsJ56501 

H&9805 

Hs.13398 

Hs^56150 

Hs.7165 

Hs^l 

Hs^6613 

Hs.15071 

Hs.356512 

Hs.158540 

Ha211577 

Hs^86192 

H6.147097 

Hs.66666 

Hs^179 

Hs.104624 

Hs.9605 

Hs.258576 

Hs.75517 

H3^1442 

Hs.154299 

Ks^1483 

Hsi9889 

Hs.30715 

Hs.82389 

Hs.155419 

Hs.124 

Hs^ 



MMj0O15Z7:Ho(no sapions hlstone deaoe)yta 
NM„003090:Homo safHem small nuclear rib 
Hs.125759£ce}y Oflhdog oT mouse RING 
Oocus!!nk}NM.02121d:Homo saplans phospb 
NM.052858: Homo sapiens slnflar to RIKEN 
NM_006845:Homo sapiens kfnesiivEks 6 (m 
Hs^lOOilysyMRNA synthetase 
K5.5398:guanine monphosphata synOietase 
NMJ)16048:Homo sapiens axosomal core pro 
Hs.343589»(osome component Rrp41 
Hs.184062xfYon)osome 20 open reading Ira 
NM.052815:Homo sapiens immediate earty r 
NM.013338:Homo sapiens Aig5, S. cerevisi 

NM.014799:Homo sapiens hephaasfin tHEPH) 
Ni;^006713:Komo sapiens activated RNA pd 
NM.01458&Homo sapiens homionfitiy upregu 
NMJ)15934:Homo sapiens nudeoiar protein 
NM,000884:Homo sajfens IMP (Inoslne mono 
NM„001949:Homo sapiens E2F transcrfplion 
NI\4_002754:Homo sapiens mltogen-acOvated 
NM.000178:Horno sa^^ns glutathione synth 
Nh4,0041 1 l:Homo sapiens flap strudure-sp 
NM.002266:Honio sapiens karyophertn alpha 
(tocusnnl()NkL016207:Homo sapiens deava 
NM.006579:Homo sapiens emopamU binding 
NI\4.005002:Homo sapiens NAOH dehydragenas 
NM-014343:Homo sapiens daudln 15 {aONl 
Hs.1901S3:Homo sapiens cDI^ FU33988 (is 
(locusGnl(}NM.016034:Homo sapiens mitoch 
NM.000625:Homo sapiens idtiic aOito aynt 
H&17876VJt6S proteasomMsodded padi 
Hs.85146>ets erythrobtastosis vims E2 
Hs.236894:ESTs. Highly sbnflar lo 802392 
Hs.52256:hypothetica] protein FU20524 
Hs.9222»5lrogen receptor Unding sRe a 
Hs.1 54796:Homo sapiens cDNA Fli37976 lis 
NAi.021 220;Homo saplera zkic finger prote 
Hs.24723£ST8 
Hs.86366£STs 

Hs.27931diypothetical protein FU106O7 s 
Hs^5717:Homo sapiens cONA: FU23454 fls 
Hs.278591 :Homo sapiens. Similar to RNA p 
N1^018690:Homo sapiens apoHpoprol^ 84 
Hs.9842:ESTs 

()ocu8}ihk)Ni^024831:Homo sai^ens nudea 
NM„001710:Homo sapiens 8-faclor, properd 
NM.018283;Homo sapiens hypothetical prot 
(tocusllnk}NM_000403:Homo saptofts gaiad 
Hs.166887xoplnel 

(tociisllnk)NM.032335:Homo sapiens hypoOi 
Hs.980S:expoitin5 

H$.13336:Homo sapiens done 25026 mRNA s 
Hs.256150:NY^N-41 anOgen 
NM^003904:Homo sapiens zinc finger prota 
Hs^VJ<lAA1383 protein 
Oocusiink)NMj005S32:Homo sapiens Interf 
Hl1S071xh8ip6ronfn containing TCP1. sub 
Hs.356512:ESTs, similar to UBGAJ^ 
(locusIInl()NM.003360:Homo sapiens UOP gl 
Oocusfink)NM04986:Homo sapiens Wnect 
NA4.032192Homo sapiens protein phosphate 
H3.147Q97:H2A histona family, member X 
H5.66666£hromosome 7 open reading frame 
Hs^179:Homo sapiens cDNA fli12933 fis, 
NMJ)20980:Homo sapiens aquaporin 9 (AQP9 
Hs.9605.xie3vage and polyadenytation spe 
NM.012129-Jtomo sapiens daudln 12(aON1 
NI^000228:Komo sapiens lamlnin, beta 3 { 
NM_004260:Homo sapiens RecQ prDteln-Ilke 
NM,00S242:Homo sapiens coagulalton facto 
Hs^1483:Homo sapiens, SlmDar to RIKEN 
aocu5link)NM.0055t8:H(jmo sapiens 
Hs.30716:potasshffn voltag&flaled diannel 
N&^016055:Honx} sapiens mltDchondita) rfb 
NM.001197:Homo sapiens BCU-lnteracting 
NM.014628:Homo saplans gene predicted fr 
Hs2253»omptenient component 2 



AF082283 Hs,193516 
AI796390 Hs.210667 
N38913 Hsi21575 



NM„003921:Homo sapbns Mi OUympho 

H5.210667:EST8 

Hs^1575:ESrs 



2.91 
2.90 
Z90 
190 
Z60 
Z90 
290 
2.90 
Z90 
Z89 
ZB9 
2.89 
Z89 
Z89 
Z89 
Z89 
Z69 
Z89 
Z89 
Z88 
Z88 
Z68 
Z88 
Z88 
Z68 
Z88 
2.88 
Z68 
Z87 
Z67 
Z87 
Z87 
Z87 
Z87 
Z87 
Z87 
Z86 
Z86 
Z88 
Z86 
Z88 
Z6S 
Z86 
Z86 
Z85 
Z85 
Z85 
2.65 
Z85 
Z85 
Z85 
2.85 
Z65 
Z85 
Z85 
Z8S 

ze4 

Z84 
Z84 
Z84 
Z&4 
Z84 
Z84 
2.64 
Z83 
Z83 
Z83 
Z83 
Z83 
Z83 
Z83 
Z83 
Z63 
Z83 
Z83 
Z82 
Z82 
Z82 
Z82 
Z62 
Z82 

ze2 

2.62 
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410045 


AAo06930 


Ks.58169 


Hl58189:euicayo8c trsnslaSon tnlQaO 


Z9Z 


454144 


BkZB047B 


H3.182695 


NMj024O26m» saptens mtochondrial rib 


2.82 


430387 


AVV3728B4 


Hs.240770 


Hs^40770*juidear cap binding protein 8U 


2.61 


434583 


AA095761 


Hs.349092 


Hs^9092:ESTs, Weaidy similar to A42442 


ZB1 


431512 


BE270734 


Hs.2795 


Hs.2795:iactale dehydrogenase A 


i81 


428093 


AW594506 


Ks.104830 


Ks.104830:EST8 


2.81 


416047 


BE439894 


Hs.78991 


NM.012080:Homo sapiens DMA segment nuns 


Z8t 


447495 


AW401B84 


Hs.16720 


NMJI04208:Homo sa^ilens pregraniined oeQ d 


2.81 


452199 


BE255643 


Ks.110695 


Ks.110695Aypo0w8c8l protein MGC3133 


Z81 


425998 


AU076629 


Ks.165950 


NA4.00201 1:Homo salens fibroblast growth 


ZBQ 


445921 


AW015211 


Hs.153799 


Hs.1 53799:Homo sapiens cONA FU38333 (is 


2.80 


422809 


AK001379 


HS.12102B 


NM_018136:Homo sapiens hypothetical prot 


180 


417869 


BE0762S4 


Hs^793 


Hs.82793:proteasome (prosome, maoopain) 


Z80 


436127 


W94824 


Hs.11565 


NM.080748:Homo sapiens chramosome 20 ope 


2.80 


418731 


AI264688 


Hs.1197 


I^M.002157:Honto sajdens heat shock lOid) p 


2.80 


432840 


AK001403 


Hs^9521 


HSv279521:hypoMcai protein FU20530 


2.80 


428028 


U52112 


Hs.182018 


H3.182018:interieukin-1 receptor-assoda 


180 


441181 


AA416925 


Hs^74811 


Hs.37481 1 :Homo sapiens, similar to 4-1BB 


zeo 


409463 


AI45816S 


Hs.17296 


NM.023930:Komo sapiens hypotheScal prot 


Z79 


450010 


AW293801 


H&2550S2 


Hs^50S2:EST8 


Z79 


418960 


NM.004494 Hs^9525 


(tociisiiriI()NM.004494:Hoino sapiens hepalo 


Z79 


401179 








Z79 


419252 


AW138434 


Hs,129805 


Hs.129805:EST8 


Z79 


434750 


BE019254 


Hs.4112 


Hs.4112:VcompIex 1 


Z79 


412948 


BE243313 


H3.334851 


Hs.334651:UM and SH3 proteh 1 


Z79 


400529 






2.79 


436414 


BE264633 


H8.143638 


NM.033561 :Homo sapiens WD repeal domain 


2.79 


436291 


BE568452 


Hs.344037 


(locusiinI()N^4_003981:Homo sapiens piot^ 


2.79 


427963 


AK)42582 


Hs,181271 


NM.016057:Homo sapiens CGi-120 protein ( 


Z79 


426459 


AF151812 


Hs,169992 


NM-015866:Homo sapiens seroiogfeally def 


Z79 


413880 


AI660842 


Hs.110915 


NMJQ21258:Homo sapiens Interleukin 22 re 


Z79 


442468 


N77737 


Hs.8349 


NM.138933:Homo sapiens apobeo-l compteme 


Z79 


413476 


U25849 


Hs.75393 


NM_004300:Homo sapiens acid phosphatase 


Z79 


413278 


BES63085 


Hs.833 


Hs.833:interferon<timulated protein, 15 


Z79 


430120 


AW675296 


Hs.233694 


(lacus6nl()NhL018396:Homo sapiens putaB 


2.79 


452875 


BE275760 


H9^8 


NM.0061 14:Homo sapiens transtocase of ou 


tn 


417164 


AA338283 


Hs.81381 


Hs.81361:heterogeneous nuclear ribonude 


Z79 


407811 


AW190902 


HS.40D98 


Ks.40098:cysteine VstcH superfamfly 1, BM 


Z79 


409636 


AA305729 


Hs.18272 


(locusiink}NM_030874:Homo sapiens solute 


Z78 


447619 


AI174800 


H&190S4 


(|ooiislin)0NM_01BS30:Komo sapiens hypottt 


Z78 


434645 


BE267057 


Hs.325321 


H3.325321:WD lepeat domain 18 


Z78 


414862 


BE621310 


Hs.923 


Hs.923:6lngl8-stFanded DNA binding prote 


Z78 


443639 


BE269042 


Hs.9661 


Ks.9661:proteasome (prosome, macropain) 


Z78 


414045 


NM.002951 


H5.75722 


Nh4_002951:Komo sapiens rtbophortn li (RP 


Z78 


430612 


AF182294 


H&241578 


NMJ01620aHomo sapiens U6 snRNA^ssodat 


Z78 


432636 


AA340864 


H&278S62 


N^1.001307:Hamo sapiens daudin 7 (CLDN7) 


Z78 


414697 


BE266134 


H3.76927 


K5.76927iranslocase of outer mtochondr 


2.78 


420665 


AW469240 


Hs.371581 


Hs.371581 :ESTs 


Z78 


428474 


AB023182 


Hs.184523 


Hs.1 84523:ser1ne/thFeonine Idnase 38 \Jk 


Z78 


448093 


AW977382 


Hs.15898 


Hs.1 5898; 2,4<jienoyi CoA reductase 2. pe 


Z77 


443343 


BE409809 


KS.3010OS 


H8.301005:histone H2A.FS variant 


2.77 


418313 


BE244231 


Hs.84038 


NM-0l5937:Homo sapiens C(aM)6 protein (L 


Z77 


424154 


AF026004 


Hs.141660 


NMJ)04366:Homo sapiens chlorida channel 


Z77 


456950 


AF111170 


Hs.306165 


Hs.3061 65:EST8, Highiy simiiar to unknow 


Z77 


432543 


AAS52690 


Hs.152423 


K5.152423:Komo 8^)ien3 cONA: FU21274 fi 


Z77 


423271 


W47225 


H&126256 


NM.000576:Homo sapiens interteuUn 1, be 


Z77 


410595 


AW629223 


Hs.64794 


NM.008978:Homo sapiens zhic linger piote 


Z77 


448140 


AF146761 


Hs.20450 


NM.020125:Homo sapiens 6 lymphocyte acU 


Z77 


457757 


AA434109 


K5.12271 


NM_012162:Homo sapiens F-box and leucine 


Z77 


420186 


NM.01592S H3.95697 


Hs.95697:(hrer-6pecilic bHiiVZIp transcr 


Z77 


410094 


BE147897 


HS.S8S93 


NhiL004128:Hoino sapiens general transcrip 


Z77 


403817 






Z77 


459125 


AA811383 


Ks^9464 


Hs.29464:Homo sapiens cONA: FLJ23460 lis 


Z77 


432705 


A1879473 


Hs.157123 


Hs.1 571 23:EST8 


Z77 


446658 


AI440137 


H3.164989 


NM>13849?.Homo sapiens hypotheGcS! prot 


Z76 


419485 


AA489023 


Ks^9d07 


Hs.99807:Homo sapiens mRNA; cONA 0KFZp31 


Z76 


432886 


BE159028 


HS.27S704 


Hs.279704:diroinalin aooessibiGty compte 


Z76 


428438 


NM.001955 Hs^l 


NMj00igS5:HoniD sapiens endoOi^ 1 (EDN 


2.76 


414767 


BE541381 


Hs.178705 


NM.03351 5:Homo sapiens MacGAP protein (M 


Z76 


406830 


Ai829848 


K3.342389 


Hs.342389:peptidyiprotyl komarase A (cy 


Z76 


432320 


AW411066 


H8.274351 


NM.016032Honio saj^ens znic finger. DHHC 


Z76 


430450 


R235S3 


H8.241489 


Oocu5l]nk)NM.015913:HQmo82|plen8 hypoth 


Z76 


433808 


NM.014062 KS.3S68 


H3.3566:ART-4 protein 


Z75 


431890 


X17033 


Hs.271986 


NM.002203:Homo sapiens Integrin, a!pha 2 


Z75 


433369 


Z49254 


Hs.3254 


NM.021 134:Homo sapiens mitDchorulrtai rib 


275 


446946 


AI878932 


Hs^17 


NM^003286:Homo sapiens topoisomerBse [UH 


Z75 


432204 


AI916132 


KS.121S93 


H3.121693:Homo sapiens cDNA FU13123 fis 


Z75 


424438 


AA340724 


H5.271912 


Hs.271912ilomo sajto cDNA FU38690 fis 


Z75 


433862 


086960 


Ks^10 


NMJ)14873:Homo sapiens KtAA0206 gene pro 


Z7S 


417080 


8E392846 


Hs.1063 


Hs.1063xniaO nuclear ribonucleoprotein 


Z75 


428242 


H55709 


Ks.2250 


Hs.2250:leukemia Inhibitory (actor (chol 


Z7S 


416188 


8E157260 


K5.7907O 


NM_002467:Komo sapiens v-myc myelocytoma 


2.75 


436014 


AF281134 


H$.283741 


N^L0201 58:Homo sapiens exosome component 


Z75 
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419489 AW411280 Hs.90S93 NACm340OM)nnapiensrep0iat)onlni0 Z75 

407971 AI469117 Hs.62918 Hs.62918:CDC91cdl division cyde 91-6 Z75 

432403 AA550815 Hs.124840 (locusIWc)NM-138456:Honio sapiens hypolh 2.75 

410775 AB014460 Hs.66196 NfyL002528-itoTno sapiens nth endonudeas© Z75 

444197 BE266947 Hs.10590 N!yL018883:Homo sapiens zinc finger protD ZIS 

447250 AI878909 Hs.17883 NI^002707:Homo sapiens protein phosphata Z7B 

406808 AW088535 Hs.350108 H3,3501083lbosomalpnoteIalai9e,PO Z75 

4115S0 A1080088 Hs,70877 Niyi015421:Homo sapiens DKFZP564K2062 pro Z75 

433662 W07162 H3.150826 NlifL020387:Homo sapiens RAB25, member RAS Z74 

426235 AI631964 Hs.34447 H8.34447:Homo sapiens cONAnJ38512 lis, Z74 

413186 AU077141 Hs.374548 Hs.374548sohitacanief family 16 (mono Z74 

419713 AWS68058 Hs.92381 NM-019094:Homo sapiens nudb((nudeosida Z74 

410174 AA306007 Hs.59461 Ks.69461:DKFZP434C245 protein 274 

430720 U85768 Hs^47638 NM_002991:H(mio sapiens sniatltndudbiec Z74 

429345 R11141 Hs.193B95 Hs.199895JiypolhelIcal protein MAC30 Z74 

452767 AW014195 Hs.61472 Hs.61472:Homo sapiens, done 1MAGE:51B41 Z7A 

414561 AI084813 Hs.195155 Hs.195155:8olutBcanierfanAy38,n«nb Z73 

423198 MB1933 Hs.1634 Hs.1634xeDdMslon cycle 2SA 2.73 

421038 A10B0192 Hs.101282 Hs.101282:Hcnio sapiens fnRNA;cONADKFZp4 Z73 

444708 AKD00398 . H3.11747 (locusllnk)NM.017798:Homo sapiens chromo Z73 

423908 AJ008422 Hs.135183 NI^006869:Horoo sapiens centaurln. alpha Z73 

433412 AV653729 Hs.8185 N(i4.021199:Homo sapiens sulfldedehydroge Z73 

438485 W57578 Hs378718 H8^8718:Honio sapiens cONAFU33433 lis 2.73 

452461 N78223 Hs.108106 Hs.108106:ub{qidQn-iike. conlaining PHD Z73 

407699 AA825974 Hs.32646 NH.02462ZHon» sapiens hypothetical prol Z73 

412258 AA376768 Hs.96125 (tocuslinl()NM_025151:Homo sapiens Rabco Z73 

450256 AA286887 Hs.24724 H8.24724:Homo sapiens cONAFU391 85 lis, Z72 

443905 AI215948 H5.143989 Hs.1439e9:ESTs Z72 

413274 Nftt004893 Hs.75258 NM_004893:Horoo sapiens H2Ahlstonefamfl Z72 

408885 C02741 Hs.48712 NM.017948:Homo sapiens hypolheOcdprol Z72 

424685 W21223 Hs.151734 Hs.151734:nuclearlr8nsport factor 2 Z72 

424692 AA429834 Hs.151791 NM_014679:Homo sapfens KIAA0092 gene pro Z72 

413762 AW411479 Hs.848 Nlit002014;Homo sapiens FK506 binding pro Z72 

418054 NMJ)02318 Hs.83354 NM_002318:Homo sapfens lysyl oxidase^ Z72 

450164 AI239923 Hs.63931 NM.080759:Komo sapiens dachshund homolog Z71 

412115 AK001763 H3.73239 Hs.73239:hypothetica! pfotdn aj10901 Z71 

450897 W16741 Hs.351629 NM_014017:Homo S2(fcnsHSPC003 protein ( Z71 

447349 A1375546 BE743847«01577765F1 NiHJ^GC_9Ho(nosapl Z71 

445413 AA151342 Hs.12677 (locuslink)NfA,016077:Homo salens Cai4 Z71 

448826 A1S80252 Hs.255565 Hs.255565:Homo sapiens cONAFU33892fb Z71 

406671 AA129547 Hs^85754 NM_00024Sitomo sapiens met protCHnwogen Z71 

433804 NM13442 Hs.3439 Hs.34393tomatin(EPB72Hike2 Z71 

441020 W79283 Hs.35982 H3.3596ZHomo sapiens mRNA:cONADKFZp68 Z70 

458933 AI638429 H3.24763 NM.002882:Hon)O ssfiiens RAN binding prote Z70 

423787 AJ295745 Hs.238204 Hs.236204:raidearpQro complex protein Z70 

430462 A1584155 Hs.105640 Hs.10S64O-JiypothsScal protein 8C007772 Z70 

439656 AW138241 Hs.160602 Ks.160602:Homo sapiens cDNAFU36008fis Z70 

425236 AW057800 Hs.155223 NI\4-003714:Homo sapiens stanniocatein 2 ( Z70 

420085 AI74ig09 Hs.44680 Hs.44680:hypothstical protein FU20979 Z70 

448296 BE62Z756 Hs.10949 Ks.10949:Hoii»saptettScDNAFU141621!s. Z70 

430200 BE613337 Hs.234896 Hs.234898.'genilnln Z70 

424308 AW975531 Hs.154443 Hs.154443:MCM4minlchromo$omem^ntenanc Z70 

423453 AW450737 Hs.128791 NI^015939:Homo sapiens 00-09 proteln(L Z70 

421344 AVI/831030 Hs.103885 (IociJSllnk)N(yL015873:Homo6aptensvlIlin Z70 

446607 At691065 Hs.155780 Hs.l55780:ESTs Z70 

418558 AW082266 Hs.86131 Hs.86131:Fas(TNFRSF6)-a5Sociatedvlade 2.70 

443835 AF016371 H3.g880 NM.006347:Homo sapiens peptidyl prolyl I Z70 

413794 AF234532 H8.61638 NM_012334:Homo sapiens myosin X(MYO10). Z70 

451481 AA300228 Hs.295666 (locuslinl(}NMj030974: Homo sapiens hypoth Z70 

456820 B&552161 Hs.108118 Hs.10811BiiypolheflcaI protein FU22474 Z70 

425905 AB032959 Hs.318584 NM.032173:Homo sapiens hypoQieQcai prat Z69 

408089 K59799 Hs.42644 Hs.42644Uhloredoxin^ 2 Z6g 

431201 AA678405 Hs.8854 Hs.8854:Pvt1 oncogene homotag, MYC acllv Ze9 

437897 AA770561 Hs.146170 Hs.1 461 70:hypoeietical protein FU22969 . Z69 

441703 AW390054 Hs.192843 NM.022145:Homo sapiens leucine zipper pr Z69 

433916 AW732839 - Hs.3631 NK^001551:Homo sapiens Immunoglobulin (C Z69 

422516 eE258662 Hs.117950 KM.006452:Horoo sapiens pho^horibosylaini ZG9 

416084 L16991 Hs.79006 r4M.012145:Honio sapiens deoxylhymldytalB ZG9 

427464 BE262958 Hs.178292 Hs.178292:proteln O-ftjcosyltransferase 1 Z69 

453876 AW021748 Hs.110406 Hs,110406:ESTs Z69 

424373 AJ133798 Hs.146219 NM.014427*^omo sapfens coplneVn(CPNE7 Z69 

411619 A1418609 Hs.71040 NM.017818i{omo6ap{8nshypothelicdprot Z69 

413004 T35901 Hs.75117 Hs.75117interteuWn enhancer landing fa ZG9 

420062 AW411096 Hs.94785 (locusIlnk)NM.021809:Homo saddens TGFW Ze9 

446077 8E251048 Hs.22579 Hs.22579:Homo sapiens ctoneCOABP0036mR Z68 

446269 AW263155 Hs.14559 NM.018t31itofno sapiens hypothetical prot Z68 

428728 NM.01662S Hs.191381 Hs.191381:hypotheliC8i protebi LOC51319 Z68 

400263 HSJ5309 NM.001S61:Homo sapiens eukaiyoticlmnsl Z68 

421933 R98881 Hs.109655 NM.006746:Homo8^ns8excombonmklle Z68 

417750 Al^7720 Hs.260523 Hs.260S23m8uroblastomaRASvlrd(VHr8S Z68 

429671 BE379335 Hs.211594 Hs.211594:prot»eBome(prosom8.maciopain Z68 

421720 AF155096 Hs.107213 Hs.107213:liD(niin binding protsbi 3 Z68 

425801 AW82948S Hs.140720 NM.012083:Homo sapiens ftequenOyroana Z68 
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425274 BE281191 Ks.155462 H5.1S5462:MCM6 m h Och romo so ire mahitenanc 2.68 

433159 AB035898 Ks.150587 NM-020242:HofflO sapiens Unesinte 7 (K Z68 

406673 M34996 K5.198253 Hs.1982533na}orhlstooonipaMtycompl 2.68 

428206 AB020643 Hs.163006 Hs.163008dikelyhomo!Dg of mouse hepaii Z68 

429344 R34038 Hs^74664 NM.00553&Homosap!8nsinhibin.betaC( 168 

427719 A!393122 Hs.134726 9(X»slink)royL14506a.Homos8^ 168 

408113 T82427 Hs.194101 Hs.194101M)mos^nscONA:FU20869fi 2.6B 

457313 AF047002 Hs.241520 NM.005782:Homo sapiens Iraisaiptlondc 167 

413142 M81740 Hs.75212 (locuslInk]NM.002539:Homo88ptensomIth 167 

414998 NM_002543 HS.77729 NM.002543itomo sapiens oxidised bw dens 167 

432391 AI732374 Ks.33g^ Hs.339aZ7£STs,WeaMy similar to protea 167 

446342 BE298665 Hs.t4B46 Hs.14846:Hoino8apte(6mRNA;€OI^Di<FZp56 2.67 

447913 AW438602 Hs.191179 Hs.191179£STs Z67 

418738 AW388833 Hs.6682 Hs.6681soIutecaiiierfBmDy7.{csllon 167 

439586 AA922936 Hs.110039 K5.110039:EST8 167 

427477 AW973119 Hs.178391 NM.021029:Homo sapiens fibosomal protein 167 

421839 BE2S8778 Hs.108809 NM-006429:Homo sapiens chaperor^ contai 167 

400448 2.57 

418416 U11700 Hs,84999 NM-O00053:Homo sapiens ATPase.Cu**tran 167 

445304 BE613206 Hs.279607 Hs.279607:Homo sapiens cONAFU34399tis 167 

417601 NM-01473S Hs.82292 NI^014735:Homo6apiens KIAA0215genepfO 166 

444700 NM.003645 Hs.11729 NM.003545:Homo sapiens fatty-a(A^Coenzy 166 

431021 A1869664 Hs.351863 {k)cusiink)NH.00331 1 Homo sapiens (hiosu 166 

453157 AR)77036 Hs.31989 NIA.015449:Homo sapiens NICE^ protein IH 168 

453454 AWD52006 Hs.374973 NM-004697:Homo sapiens PRP4 prennRNA pro 166 

422599 BE387202 Hs-1 18836 Hs.1ie63B3ton-met85taticce]l3l, protei 186 

456248 AL035786 Hs.82425 NM.005717:Homo sapiens actin related pra 166 

427691 AW194426 Hs.20726 H3.20726:ESTs, Moderately sImH» to liyp 166 

419705 AW368634 Hs.154331 Hs.1 54331 :ESTs 2.68 

421254 AK001724 Hs.1029S0 NM_016128:Homo sapiens coat proton garran 166 

422719 BE017985 Hs.102558 Hs.102558:Homo sapiens cONAFLJ40369fis 166 

446356 Aiai6736 Hs.14e96 Hs.148962inc finger, OHHC domain oontai 166 

432435 BE218866 Hs.282070 Hs.282070'.^8 166 

433020 AI375726 Hs.227152 NlUL016391:Homo sapiens hypoUieticalprot 185 

436106 At050715 Hs.2331 Hs.2331:E2F transcription tetorS.p130 165 

431127 U66618 Hs.250581 Hs.250581:SWt/SNFrdaled.ntililx85Sod 165 

425568 AW963118 Hs.161784 Hs.161784£STs Z65 

430508 AI015435 Ks.104637 Hs.104637sohiteC8nlerMly1 (glute 165 

414761 AU077226 Hs.77256 mtO(M456:Homo sapiens enlnncer of zeste 165 

412738 K34731 Hs.74562 NM.078480:Homo sapiens fuse^ndingprol 165 

409893 AW247090 Hs.57101 Ks.57101:MCM2mirichromosomem^tenance 165 

421743 T35958 Hs.107614 Hs.107614:DKF2P564I1 171 protein 164 

428072 BE2S86Q2 Hs.182366 Nlli.016292:Homosa|^nshBatshocltpiotel 164 

417957 H53497 Hs.83006 fttl.016071:Homo sapiens mitochondrial rib 164 

409119 AA531133 Hs,4253 Hs.4253:hypothelical protein MGC2574 164 

447200 BE543146 Hs.281434 Hs.281434:Homo sapiens CONARJ31373 lis 164 

409214 AW405967 Hs.333388 Hs.3333a8:8imnar to CG371 4 gene product 164 

414883 AA926960 Hs.348669 Hs.348669:COC28 protein Wnasel 184 

433570 AI580053 Hs.109007 K3.109007:Homo sapiens. Sim!lartoLOC16 164 

408633 AW963372 Hs.222088 N1UL01 41 09:Homo salens PRO2000 protein ( 164 

447769 AW873704 Hs.320831 Ks.320831xhromosonia 20 open reading fira 164 

432964 AF118395 Hs.279865 1^.01 431 7:Homosa;rfenstran3-prenyttrBns 163 

444855 BE409261 Hs.12084 Ks.12(m:Tu iranslaiion elongation facto 163 

428144 BE269243 H8.182625 Hs.162625:VAMP (vesid&ttsociated memtv 163 

408137 AI694131 Hs.2g002 HS.29001KIAA1706 proWn 163 

418703 NM.014448 Hs.8743S HsJ7435:Rho guanine exchange f^(GE 163 

446051 BEO48061 Hs.37054 Hs.37054:ephrinV\3 163 

430024 At808780 Hs.227730 l4iyL000210'Jiomo sapiens Integiln. alpha 6 163 

406122 2.63 

420988 AW006352 H5.159643 Hs.l59643:EST8. Weakly similar to putatf 163 

436433 AW631437 Hs.5184 (Iocusnnk)NM.016397:Homo sapiens TH1-B 163 

417129 AI381800 K8.300684 Hs.300684:calcitOf*i genewdaled pepiid 163 

410397 AF217517 Hs.63042 NM_01B457:Homo8aptens DKFZp564J157piol 163 

419420 AA355435 Ks.30724 (iocusIink)N]yL001516:Komo sapiens genera 163 

400298 AA032279 Ks.61635 H8.616353ix transmembrane epHheBal an 163 

412599 AU076782 Hs.248287 (locu5llhI0NM.02112G:HbmoB^nsmercap 163 

436199 R38946 Hs.127951 Hs.127951:Homo sapiens cONAFU14503fis 163 

425081 X74794 Hs.154443 Hs.154443:MCM4 minichromosome maintenanc 163 

442025 AW887434 Hs.11810 NM.032026:Homo sapiens COAIIpiQteIn (CD 162 

437379 AL3S957S Hs.23765 H8J237653nembrBn8metallo«idopep0da8e- 162 

409703 NM.006187 HS.S6009 Hsj6009:Z^Hol^adenyt8tB synthetase 182 

419170 BE002798 Hs.287850 NM_002219-itemosapiens Integral membrane 162 

418216 AA662240 Hs^83099 Hs.283099->\F15q14 protein 162 

451926 AW134519 HsJ6125 (iocusilnl()t4M.025151:KomosaptensRa5oo 162 

413781 J06272 Hs.850 (locusIinl()NM.000883ifamo8epteR8lMP(i 162 

407236 W79485 Hs.173980 Hs.l73980»uelaar matrix protein NMP200 162 

421405 AA251944 H8.104058 NM_015957:Homo sapiens 00-29 protein (L 162 

429491 NM_012111 Hs.204041 ^^^C012111:Homo sapiens chromosome 14 ope 162 

453335 AW857376 Hs.169238 l^000149:Homo sapiens fuajsyOransferas 162 

441126 NRL000429 Ks^lS (Iocus!ink)NM_000429:Homo8^)iensmethto 162 

417404 NK1.007350 Hs.82101 (locustink}NM.007350:Homo8S|]|en8ptecia 162 

432211 BE274S30 Hs,273333 H3J£73333:hypothe&al protein FU10988 162 

446768 AR)83208 Hs.16178 NiyL012138Momo8affensapoptosisBntflQon 182 

187 



wo 03/042661 



PCTAJS02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



437033 


AW248364 


Hs.5409 


412123 


BE251328 


Hs.73291 


454128 


AL031259 


Hs^7900 


433037 


NKLOKISS Hs^79938 


414438 


AI879277 


Hs.76136 


416221 


BE513171 


Hs.79086 


443896 


AW804^ 


Ks.9950 


410007 


AW950887 


Ks^813 


412715 


NII^„000947 Hs.74519 


449864 


BE276386 


Hs.111429 


448625 


AW970786 


Hs.178470 


452835 


AK001269 


H3.30738 


410886 


AI733735 


Hs.114905 


411400 


AA3t1919 


Hs.69851 


429770 


AI766047 


Hs.99738 


425983 


AK000226 


Hs.165619 


430237 


AI272144 


Hs.238522 


419607 


R5S57 


H3^1579 


419508 


AW997938 


Hs.90786 


453258 


AW293134 


Ks.32597 


457234 


AW988360 


Hs.14355 


420911 


U77413 


Hs.100293 


418478 


U38945 


Hs.1174 


438533 


AI440286 


Hs.170673 


421699 


AL161994 


Hs.107003 


452220 


BE158006 


Hs^2296 


439148 


AA372280 


Hs.176576 


453949 


AU077146 


Hs.36927 


451110 


AI9S5040 


Hs^5398 


446291 


BE397753 


Hs.14623 


411125 


AA151647 


H5.6fi877 


426858 


NM.0O4182 Hs.172791 


442990 


AA197226 


H5.19347 


424197 


AF096834 


HS.1429B9 


445580 


AF167572 


H5.12912 


410219 


T98226 


H5.171952 


436415 


6E265254 


Ks.343258 


441153 


BE562826 




410570 


A)1330g6 


Hs.64593 


430594 


AK000790 


Hs^46885 


410315 


AI638871 


Hs^78965 


443303 


U67319 


Hs.9216 


425725 


NM.012243 Hs.159322 


449019 


AI949095 


Hs.67776 


410442 


X73424 


Hs.63788 


456629 


AW891965 


Hs.367942 


454417 


Ar244459 


H9.110826 


416330 


AU077101 


Hs.79222 


437712 


X0458& 


Hs.65844 


423750 


AF165863 


Hs.298229 


412641 


M16660 


H8.74335 


406180 






416297 


AA157634 


Hs,79172 


418803 


U50079 


Hs^556 


447532 


AM000614 


Hs.18791 


420309 


AWD43637 


Hs.21766 


447418 


AA063074 


Hs.18552 


424142 


AI678727 


Hs.378970 


428342 


AI739168 


Hs.349283 


427254 


AL121523 


H3.97774 


458778 


AW451034 


Hs.326525 


425669 


W16460 


K5.24283 


452700 


AI859390 


Hs^8940 


427678 


BE267756 


Ks.180312 


444656 


AI277a24 


H&145199 


425206 


NM.002153 Ks.155109 


416412 


NM.014742 H$.79305 


427648 


AI375722 


Ks.180062 


419193 


029643 


HsJ4789 


409984 


AW368228 


Ks.67928 


431910 


AK000142 


Hs.101774 


413010 


AA393273 


HS.75133 


452264 


AU077013 


HS28757 


419423 


D26488 


Hs.90315 


425221 


AV649864 


HS.15S1B8 


437623 


063880 


H$.5719 


444184 


T87841 


Hs.282990 


418650 


BE38675D 


Hs^O 


425368 


AB014595 


Hs.155976 


420614 


AL1 10291 


Hs.93364 


427878 


A1494291 


Hs.389171 


418862 


BES50964 


Hs.89399 


416432 


BE391767 


Hs.79322 



0ocusIin)0NM^OO4875:Honx) sapiens RNApo 162 

NM.01 8K6:Homo 8^)!ens WO repeat domain 2.62 

Hs.367900:programmedCGlldeaUi2 2£1 

NM_014158:Homo sapiens HSPC067prete!n( 2.61 

0ocusKn)gNM.003329:Homo sapiens Uto Z61 

(Iocuslln}ONM.00720&HonK>8^nsiTtioch Z61 

NM.014302Honio sapiens Sec61 gamma (SE06 Z61 

NM.014S96:Hamosapi8iis zinc ribbon domai Z61 

NM„000947itomo sapiens primasdi pdlypept Z61 

NM_032486:Homo sapiens dynactm 4 (MGC32 Z61 

NM_024829:Homo safu'ens hypothetical prot Z61 

NM.01 6087:Homo sapiens hypotheOca! prot 161 

NM.033266:HQmo sapiens ER to nucleus sH) Z60 

NM.016983:Homo sapiens nucfeolar protein Z60 

HS.99736:hypoth8Qcai protein MGC393S0 Z60 

NM.031265:Komo safdens mucin and cadheil Z60 

HsJ236522:OKFZP434P106 protein Z60 

NMJD3341G:iionio sapiens fibnnar to HYPCrTH 2.60 

NMj003786:Homo sapiens ATP-bindingcasse 2.60 

NH.005977:HomD saj^ns ring finger prote 2.60 

Hs.14355:Hcmo s^ns cDNA aJ13207 fls, Z60 

Hs.100293:CHnl(ed N-acetytglucosamino ( 2.60 

NM.000077:Komo sapiens cydbvdependenl Z60 

NM.138989:Homo sapiens retinal short cha Z60 

NMJ021 178:Komo sapiens enhancer of Invas Z60 

Hs.212296:ESTs Z60 

Oocuslink)NM.030877:Homo sapiens cateni ZOO 

(Iocusl}n!()NM.006644:Homo sapiens heats Z59 

Hs.265398:ESTs, Moderate^ similar to hy 2.59 

Hs.14623:interferon.gamnia^iica}tepro Z59 

Ks.66677:cytochrDnie b-245. afpha poiypep Z59 

NM_004182:Homo sapiens ubiqidtous^^expr Z59 

NM.032351:Homo sapiens mitochondna) rib Z59 

NM-01 5982:Homo sapiens gerni cell spedii 2.59 

NM.006109:Homo sapiens SKB1 homolog (S. Z59 

Hs.171952Mcchjdln Z59 

N^^006191:Homo sapiens profiterab'on^s Z59 

8E962826:601336534F1 NIH.MGC.44 Homo sap 2.59 

NM_0Q5356:Homo sapiens ATP synthase, ZS8 

NK.017958:Hofno8aplens hypothetical pro! 2.58 

Hs.378965:Homos9p!enscDNAFU37658fis Z58 

N&4.033340:Homo sapiens caspase 7, apopto Z58 

(iocusnnk)NM^012243;Homo sapiens solute 158 

Ks.67776:Homo sapiens, done IMAGE:S4556 158 

HSw63768:propioity1 Coenzyme A cartwxytBS 158 

HsJ67942:Homo sapiens, done I1MAGE:4701 158 

Hs.1 10826:tr1nucleot)da repeat contalnin 158 

Hs.79222:9alac(DsIdase.beta1 158 

Hsi5844:neurotrophic^rDsine kinase, r 158 

NiyL012394M)mDsapiensprelbldin2(PFDN 158 

Hs.74335:heal6hod(90kD protein 1. beta 158 

158 

Ks.79171soiut8 canter family 25 (mitoc 158 

NM.004964:i1omo8apienshbtonedeacetyla 158 

NVL01 7699:Homo sapiens hypothetical prot 157 

Hs.21766:EST8. V\teaKJy simBar to hypothe 157 

H3.18551E2)62 protein 157 

Hs.378970:HQmo s^itens cDNA FU35102 fis 157 

Ks.349283:Homo sapiens cONA FU31763 lis 167 

H3.97774:E8Ts 157 

NM.001669:Homo sapiens aiytsulfatase 0 ( 157 

Hs.24283:Homo sapiens cONA FU2SgS2 6s, 167 

NM..021259:Homo sapiens transmembrane pro 157 

NM.016065:Homo s^iiens mitochondria) rib 157 

H3.145199:E8Ts.Wea)dys)miIartohypoth 157 

NA^002153:Homo sapiens hydroxysteroidO 157 

H3.79305:KIAA0255 gene product 156 

H3.180062:prDteasome (prosome. macropaln 156 

NM.005216:Homo sapiens doDchyl-diphosph 156 

Ks.67928:£STs 156 

Hs.101774;hypotheQcal protein FU23045 156 

NM.003201:Homo sqpiens transcrfpDon tao 166 

Hsu!8757A a ns n iembrane 9 superfamily roem 156 

K3.90315:KIAA0007 proton 156 
N1^005541Homo sapiens TAR RNA polymera 156 

NiyL014865:Honio sapiens chromosome conden 156 

(h)Cus8nlONMJ)33550:Homo sqptens chromo 156 
Hl86978:prolytendope^ldasa 156 

aocust2nJ()NM.003588:Hon»88plenscu(0n 156 

Hs.99364:abhyilroiase domdn oontairdng 1 156 
H3.389171:EST8 156 
NM.005176Mino8apiensATP8ynlhase.H^ 158 
(locusIin]0Nlil.005051:Homo sapiens ghitam 155 
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456814 
446153 
454003 
411950 
457400 
426410 
402829 
425643 
400995 
452945 
417144 
422192 
424755 
410012 
431236 
449042 
420281 
418661 
406629 
421612 
446715 
431183 
407722 
427368 
426266 
409632 
442875 
456031 
442432 
437741 
448775 
414368 
432876 
431731 
425994 
445982 
444232 



AM98957 Hs^1937 

Y10805 HS.20S21 

AA058944 Hs.116602 

T2B407 Hs.61564 

AF032906 Hs.252549 

BE298446 Hs.30S890 



Hs^1937:Ttoomal prolBin. brge P2 
NVL001S36:Hoino sapiens HMTI hnRNP methyl 
H3.116602:tiypofhe{]cal protein BO009115 
m/L002619'itomo sapiens platelet factor 4 
NM.001336:Honio sapiens calhepsin Z (CTSZ 
KhL138578:Homosaftens BOMs 1 (BCU 



BE313280 Hs.159627 NM.004632:Honio sapiens death assodsted 



417686 
417933 
426812 
452313 
438317 



423599 
412525 
424291 
427581 
414987 
434274 
400282 
425322 
453344 
449915 
417691 
439012 
434931 
411678 
442315 
447140 
422385 
433517 
450230 



433001 
440773 
440587 
422813 
424259 
438075 
453204 
453665 
432353 
433271 
431770 
444019 



404826 



AW978187 

AA3821D4 

AA3051S9 

AB033094 

AW015832 

AV656840 

AW294985 

AI623693 

AA287766 

AW277078 

AF161254 

AI337735 

NM_006855 

BE252241 

BE041451 

AF083420 

W74001 

BE623003 

AA335996 

BE093569 

8E561610 

AB025237 

W70171 

AW248272 

BE266322 

AK0OO207 

BE410233 

W56010 

AW10S663 

AAre91S5 

X02308 

AF10S36S 

Y00486 

AA826401 

AA045650 

AI805664 

AA581439 

AL120051 

NM.01478B 

AA524394 

AA628539 

U63630 

6E34g075 

NM^004529 

AU076610 

BE383814 

AW966941 

AI907114 

AA173992 

AF070537 

6E549407 

AWQ22133 

AW016607 

BE395875 

AF217513 

AA352702 

AL138461 

AV556571 

AK001776 

BE090176 

R10799 

AA626250 

N^^016558 

BE621697 

BE221860 

BE173977 

AI767756 



Ks.79103 
Hs.61337 
Hs.113019 

Hs.152925 

Hs.57898 

Hs^511S 

K&30715 

Ks.323494 

Ks.23449 

Ks.181165 

H5.106196 

Hs.173919 

Hs.250696 

Hs.38041 

H5.177507 

Hs.168913 

Hs.55279 

Hs^625 

Hs^5907 

H5^178 

Hs.5609 

Ks.388 

H8.75939 

Hs.279652 

H3.211374 

Hs.165803 

Hs.13501 

Hs.347297 

H8.6947 

Ks.235498 

Hs.82962 

Hs,172613 

H$.28914 

Hs.122393 

HS.S3125 

Hs.31731 

Hs.1 52328 

Hs.144700 

H3.179703 

Hs.294022 

Hs^7783 

K5.289101 

HS.15S637 

Hs.44571 

Hs.404 

fte.82399 

K3.6455 

Hs.166254 

Hs.71465 

Hs.7956 

Hs.17481 

Hs.1 15823 

Hs.18983d 

Ks.201582 

HS.Z79609 

Hs.279905 

H8.37747 

Ks.323084 

Ks.121068 

H3.143954 

Hs.179902 

Hs.191990 

Hs.326184 

Hs^74411 

Hs.14317 



Ks.10098 
Hs.82302 



434474 
424482 
450422 
440214 



BE185499 Hs^471 

AL042938 Hs^11571 

BE268821 Hs.149155 

AA74352S Hs.60300 

AA247116 Hs.7049 



NM-030S79:HGmo sapiens cytochrome b5 out 
H&81337deetin, galactoside4)inding. so 
N1^01S931:Homo safdens f!s485 (LOC51065) 
Hs.15292S:KIAA1268 protein 
{locusfink)NM_017819:Homo seplens hypolh 
NM.00156ftHomo sapiens Inteiteukin 13 re 
Ks.3071S:potass}um voitage-oated channel 
(locusSnk)NM.017964:Homo sapiens hypoth 
Ks.23449:lnsu!in receptor tyrosine tdnas 
Hs.1 81 165:euk8iyot]c transtaSon elongat 
(tocusHnIONM-016579'.Homo sapiens 8D6 an 
Hs.173919:ESTs. Weakly sbrnlar to neuron 
NM.006855ilonio sapiens KDEL (LysV^fKGtu 
Nt^003681 :Homo sapiens pyridoxal (pyrido 
Hs.177507:hypolheQcal protein HSPC155 
NM_003576:Homo sapiens serine/threonine 
NM»002639:Homo sapiens seifne (or cystel 
H9.23825:Homo sapiens clone TCCCTA00142 
Hs.355907£ST6» Weakly simaar to protd 
NK|.024629:Homo sapiens hypotheflbal prol 
NM-020470:Homo sapiens putaOve Iransmem 
NM.002452:Hon)O sapiens nudix (nudeosida 
NM„012474:Homo sapiens tsidine monophosp 
NM.015956:Homo sapiens mttochondrial rib 
(focusl!nk)NM.14S051:Homo sapiens hypoth 
NM^017708;Honio sapiens hypothetk^l prat 
(locuslink)NM.014303:Homo sapiens pescad 
(locuslink}NM-013397:Homo sapiens ovef-« 
(tocusnnk)NM.014159:Homo sapiens HunBn 
Hs^98:hypo(hetical pmtein FU14075 
NK1.001071->lomo sapiens thymldylate synth 
NM.006598:Homo sapiens solute carrier fa 
Hs2 8914|a denlne phosphoribosyltransfeia 
Hs.122393:EST8 

N^1.004597:HonlO sapiens smaB nudear rib 
{focusIink}NM_012094:Homo sapiens peroxl 
H3.152328:EST8 

NM>004429;Homo sapiens ephrin^l (EFNB1) 
NM-014788:Honto sapiens tripaitita motit^ 
NM.032865.*Homo sapiens hypothotlcal prot 
Hs.57783:eukaiyotic translafion Initiafi 
NM-005313:Homo sapiens glucose regulated 
NM-006904:Homo sapiens protein kinase, D 
Hs.44571:EST8 

NM-004529:Homo sapiens myelokl/lymphold 
NKy)07357:HomQ sapiens component of olig 
NM.006666:Homo sapiens RuvB^e 2 (E c 
Hs.1662&4dikely orUwIog of rat vacuole 
NM.003129:Homo sapiens squabna eposddas 
HS.796&ESTS 

NM-136391:Komo sapiens hypotheOca! prot 
(locusIInk}NM.006638:Homo sapiens ribonu 
Ks.189838:ESTs 
Hs.201582:EST8 

Nhl.014342:Konio sapiens mitochondrial car 
NH.016359:Homo sapiens nucleolar protein 
NMJD22767:Homo sapiens hyp(Ai8fic&l prot 
PocusIink)N»L031 209:Homo sai^ tRKA^ 
(locuslink)NM.003270:Komo sapiens transm 
(locusiInk)NM.018270:Homo sapiens chiomo 
NMJD80546:Homo sapiens CDw92 anfigen (CO 
H8.1919g0:E8Ts 

Hs.326184:Honio sapiens nuclear protein p 
NM.016558iHomo sapiens SCAN dom^ conta 
NM-018648dtoo sapiens nucleolar protein 
NM.01Z^Homo sapiens 5"^ exoribonud 
NM.019082Homo sapiens putative nudeola 
(locusfink)NM„147174:Homo sapiens hepara 

NM_014878:Honio sapiens K1AA0020 gene pro 
(locusthk}NNL005333:Homo sapiens hotocy 
(locusank}NMJ)03374:Homo sapiens voitag 
NiyL033414:Homo saptans bypotheOcal prot 
(locuslink)NMJD18386:Konio sapiens hypoth 



^55 
2.55 
ISS 
Z55 
Z55 
2.55 
155 
2.55 
Z55 
255 
155 
155 
155 
155 
155 
155 
155 
155 
155 
154 
154 
154 
154 
154 
154 
154 
154 
154 
154 
154 
154 
154 
153 
153 
153 
153 
153 
1^ 
153 
153 
153 
153 
153 
153 
163 
153 
153 
153 
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163 
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151 
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151 
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421168 AF182277 Ks^OTBO Hs.330780xytochrom6P450. subfamily DB ^51 

43S750 AB029012 Hs.4930 Hs.499Q:K)AA1089 protein Z51 

452101 T602S8 H3.10844 NM.052972:Homo sapiens tsudn&fichalph 2^1 

436043 AWS63838 HS.16883D Hs.166830:KGnio sapiens cONAnjI 2136 fis Z51 

424909 S76187 HS.1S37S2 (lOcuslblk)NM.0043S8:HomoSBp)ensceQd 2^1 

435677 AA694142 Ks£685 Hs.6685:8tyRiUhOfnm receptor bito Z51 

406363 Z51 

452018 AW102941 Hs^11265 H3,21126S:ESTs 2^1 

409591 AA532963 Ks.gi00 Hs.9100:hypothd]ca] gene supported by A ^S^ 

421937 AI878a57 Hs.109706 NML016185:Homos4)iens hematological and Z51 

416293 BE244454 H3.79162 Hs.791623tnicturespecfScfecogniQon 2.51 

421532 AW138207 Hs.14617Q NM_Q22842:HonK) sapiens hypoiiieOcal prot Z50 

434584 OS7341 Hs.188361 H&1 88361 :Homo sapiens cONAFUl 2807 lis 2^ 

426109 AW732918 Hs.162490 Hs.1824g0deucine^PPR4notir contain Z50 

426053 U68105 Hs.172182 NM.002568:Homo sapiens poly(A} binding p Z50 

432642 BE297635 Ks.3069 NM.004134:Homo sapiens heat stiock 70(0 p 2.50 

452390 AI864142 Hs.2928B (Iocuslink)NM.022759:Homo sapiens endo^ Z50 

429023 N1^000312 Hs^l Nli4.000312M)mo sapiens pfoleinC{lnacQ Z50 

400076 ' 250 

420596 NM.002692 Hs.99185 NM.002692:Homo sapiens poIytneresePNAd Z50 

422244 Y08890 Hs.113503 NlUL002271:Homo sapiens l(aryoprtei1n(impo 2.50 

410723 AA100883 Hs.372108 Hs.372108:EST8 2.50 

435496 AW640171 K&265398 Hs.265398£ST&,iyioderateiyslntilarbhy 250 

425159 NlyL004341 HS.1S4888 Nl^004341:HomD sapiens caibamqy1i)hospha 250 

433626 AFU78859 Hs^7 NML013341:Hoino sapiens hypothetical prot 250 

448391 H71025 Hs^1075 NhL016328:Honio sapiens 6TF2I repeat doma 250 

TABl£10B 



Rcey: Unique Eos probesel identiSer number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Ptey 



CAT Number Accession 



406685 0 0 
434414 35976.1 



432407 MH1429_12 



405708 OJO 

459306 223120.-4 

447349 1083443.1 

441153 26448QL3 

TABUE10C 



M18728 

AF134164 BF809407 AA218567 BF642863 A1267168 BF876178 BG999253 AW861851 AW85B362 AI817548 BF771300AA1 13928 AA223422 
AA055556 BF773400 BP9988S9 BE081333 BE073424 BE142245 H59571 H59570 BFB71558 BF871064 BE001132 BF826831 AW754298 
AA223267 BG997895 B6997897 AW991957 AA534354 B6319501 BF736309 At694265 AA045564 BG950256 AI829309 B6987850 BE093176 
BF854337 

BG036675 BF77200S BF771866 6G860386 BG98a381 NM.005712 AF110315 BE074S34 BE182776 BE1580W BE1 57999 BE71431S AW818104 
AW847519 AA099426 AW8179B1 AW856396 BG961122 AA224498 AA308542 AW821833 BF902155 AI732411 B677B834 BG283641 BE748279 
6E748870 BG319540 BE748864 BF739224BG986155 AK057283 Bt861466 AA663341 AA457591 66949294 AW392686 AA071122 AA227849 
AA584918 BG959570 BF773486 AL041698 BFg59013 R87170 016859 BF770411 6F771298 AI075321 LI 3823 AA21 6700 6F771864 AW861659 
BE537068 C18935 AA1557ig BF771 172 BF769107 BFB04964 AW818172 AWB18143 AW392930 AW817057 AW858044 8F746211 AA179928 
AW8616e7 AW821826 BI055726 BF242643 AA207189 BF770412 BF771 157 BG430030 AA0S5592 
AI2827S9 
AW578452 

8E743847 AW809803 BltM69626 A1375546 
BES62828 BE378727 



Pkey: Unk|ue number ootresponding to an Eos piobeset 

Ref. Sequence source. The 7 digit nuntben in this column are Genbank Idento 'DunhamLetaL'reftotothepubOcaOonenMTheDNA ' 

sequence of human chromosome 22' Dunham I. et al., Nature (1999) 402489495. 
Stiand: Indicates DMA sfrand from which exons were predicted. 
NtjnslOofl: bxHcatssnucteolide portions of predlctBd axons. 



Pkey 



Ref 



Slrend NLposiBon 



404519 


8152000 


Ptus 


12817-13000 


405399 


9256288 


Minus 


63448-63554 


403220 


7630989 


Phis 


64338^17 


404661 


9797073 


Plus 


33374^675,33769-34008 


402496 


9797769 


Minus 


8615-9103 


403055 


8748904 


Mnus 


109532-110225 


400965 


7770576 


Moius 


173043-173564 


403218 


7630969 


Ptus 


56039-58149 


401866 


8018106 


Rus 


73126-73623 


403221 


7630969 


Plus 


6629446438.6693&«7124 


401519 


6649315 


Ptus 


157315-157950 


405451 


7622517 


Minus 


145949-146227 


403532 


8076842 


Minus 


8175081901 


402944 


9368423 


Ptus 


110411-110716.111173-111640 


403219 


7630969 


Rus 


61858^1895 


403381 


9438267 


Minus 


26009-26178 


403485 


9966528 


Rus 


288&^1.3198-353236554117 


405464 


5922025 


Rus 


199214-199579.199672-199920200262-20049 


404684 


9797403 


Minus 


110881-111020 


402474 


7547175 


Minus 


53526-53628,55755-55920^530^57 


405506 


6466469 


Rus 


80014«}401,80S33-8112S 


403739 


7630882 


Rus 


4456^6G,4820948483LS225&«2495 
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406545 


7711S10 


Plus 


145662-145781,147854-147984.148098-14824 


401405 


n68128 


Minus 


69276^52.6954^69958 


400750 


8119067 


Rus 


198991-199168,199316-199548 


401179 


9438647 


Plus 


113477-113893 


400529 


9796988 


Plus 


138232-136423 


403817 


8962065 


Phis 


110297-111052 


400448 


9867687 


Minus 


177372-177674 


406122 


9144087 


Minus 


3094(V31386 


406160 


7283201 


Minus 


3892M9107 


402829 


8918414 


Plus 


101532-101852.102006-102263 


400995 


6099094 


Plus 


141188-141601 


404826 


6572184 


Plus 


47726^046 


406363 


9256114 


Plus 


14403-14602.17000-17147.17241-17368 



I 



Tabte 1 1 A Dsb about 958 genes up^ulated jn ooton cancer comp^ 

likely to encode proteins amenable to modufafion Iqr small mdecdes. pepSdes. or antibodies. These were selected from Ibe starting collectjon of 69580 probesets on the 
Aflymetrix/Eo5-Hu03 GenoChlp® anay as (bitows: (he lafflo of "average" colon to "average' nomial adult (issues was greater (ban or equal to 3,0. the "average' coton level was set to 
the 90th percentile value amongst coton primary cancer specimens and colon liver derived melalases, (he "average" nomial adult tissue level was set to the 70th percentHa value 
amongst non-malignant fissues minus the cotonlc derived samples, the "average" cdon value was greater than or equal to 50 units, and the predWod protein contained a structural 
domain that is indicafive of have an oncogenic function or of transducing an btlracenular signal or of being moduiataUe by small molecules, peptides, or antibodies (e^). pkinase 
deallHfomain. 7tra. phosphatase. orioLtransporter). h order to remove gene«ped6c background bvels of noiKspecic hybridization, the 15lh peroeniae vahn amonostthe not*. 
maGgrtant tissues was subtracted fipom both the numerator end (he dsnon^ 

TA8LE11A: 958 genes ujKegulated In colon cancer compared to nonnal adult tissues ex^^ 

Ptey: Unique Eos piobeset identifier number 
ExAocn: Exemplar Accession nuntber, Genbank accession number 



lAnlgene Title Rl 

NM.006149-itomo sapiens lectin, galactosi 37.18 

{locusllnk)NM_004363:Homo6apenscarcto 31.24 

(locusfink)NM-002483:ii}mo saj^ns caroin 24.81 

(locusnnigNM.002483:Homo sapiens cardn 20.54 

Hs.356183£ST8.We8k)y8lmlartoS3B4Ji 20.36 

NM.001285.'HomosapienschlorUechannel. 20.13 

19.89 

NM.033049:Komo sapiens mucin 13, epithel 1 9.68 

NMJ002457:Komo sapiens mudn 2. intesSn 19.55 

NM.014471:Homo sapiens seitne protease i 18.33 

NM_007329:Homo sapiens deleted In mal^n 17.47 

NM_005143:Homo saj^ens haptogtoWn (HP), 17.37 

NM-006507:Homo sapiens regenerafing iste 1 6.99 

NM.001804:Honu> sapiens caudal type homeo 15.15 

NM.005141itonM)6apien8librinogen, 6 bet 15.02 

NM.003225.'Hano8apiensb^factor1 14.23 

(k)Cus1ink)NM_006890:Homo sapiens caidn 14.12 

Nhl.032044:Homo sapiens fegenersSng gene 13.64 

NM-020384:Homo88p!ensdaudIn2(CLON^ 13.48 

NM-005814:Homo s a piens glyccproteln A33 13.43 

NA^004063:HoniO8aptenscadherin17,Uc 13.20 

Hs.572:orosomucoid 1 13.06 

NM.004591i{omo6arMens small indudbtec 12.58 

NMJ004467:Komo sapiens librinogen^lkBl 1234 

Hs.124660:EST8.Moderate|yslmil8rto21 11.72 

NM.017625:Homo sapiens IntetecHnflTUI) 11.72 

NM.138805:Homo sapiens family with seque 1 1.49 

11.25 

NM.001633:i{omo sapiens aipha-l-micreglob 11.16 

NM.002644:Homo sapiens polymeric imniunog 1 1.12 

NM_000477*tomo8ap»ns albumin (ALB). mR 11.01 

NM-002426:Homo8apten8 matrix mkaSopro 10.70 

NM.017763itomo8aptens hypothetical prot 10.69 

NM-138938:Homo saptens pancreaSSs-asso 10.57 

Nhl.005588:Homo saj^ns meprin A. alpha ( 10.48 

NM.002421i{omo saf^ matrix metallopro 10.39 

OocusBnK)NM.006830:Homo saptens cardn 10.20 

H5.2S6538:prost3to (fifferenHalion facto 10.19 

N^L000111:Homo saptens sohJto carrier fa 9.91 

Hs.61311:£STs, Wteakly similar to S10590 9.72 

NM.003890:Homo saptens IgG binding pr 9.70 

NM_021804:Homo saptens angtetensin 1 con 9.65 

ite.352054:pfegnancy specific beta-1-glyc a52 

NM.000638:Homo sapiens vitronectin (senj 9.47 

NM_000613«omo saptens hemopaxln(HW(), 9.41 

NM.000509itomo saptens fibrinogen, gamma 9.39 

Hs.120879:Hoino sapiens, done M6C:32871 9.31 

Hsi2961:Homo6apters,Glon8MGC22588l 9.30 



UnigenelO: linigene number 


Unigene T18e:Unigene gene Gtte 


Rl: 


Rafio of tumor to normal 


Pkey 


ExAocn 


UniaenelD 


436749 


AAS84890 


Hs.6302 


406690 


M29540 


HS.22DS29 


407242 


M18728 




408685 


M18728 




431912 


A1660552 


Hs.356183 


428934 


AF039401 


Hs.194659 


406687 


M12523 




437935 


AW939591 


Hs.5940 


446787 


U67167 


Hs.315 


423541 


AA295922 


Hs,129778 


421341 


AJ243212 


Hs.374281 


414386 


X00442 


Hs.759g0 


416768 


AA363733 


Hs,1032 


422578 


AF239666 


HS.154S 


441031 


All 10684 


Hs.7645 


421582 


AI910275 


Hs.350470 


407243 


AA058357 


Hs.74466 


422260 


AA315993 


Ks.105484 


432542 


AW083920 


K5.16098 


«4212 


NM-005814 


Hs.143131 


418888 


AU076801 


Hs.89438 


453663 


X02544 


Hs.572 


413719 


BE4395B0 


Hs.75498 


436217 


T53925 


Hs.107 


421100 


AW351839 


H8.124660 


409153 


W03754 


HS.S0813 


452316 


AA298484 


K16126S 


406399 






414463 


T69078 


H3.76177 


421964 


X73079 


Hs.286579 


407007 


U22961 


Hs.164411 


423673 


BE003054 


Hs.1695 


447400 


AK000322 


Hs.18457 


450685 


L15533 


Ks.423 


427563 


MB2962 


Hs.179704 


418007 


M13509 


H3^3169 


406741 


AA0583S7 


Hs.74466 


422424 


A11S6431 


Hs.296638 


423371 


AU076819 


Hs.1650 


452304 


AAQ25386 


Hs.61311 


422106 


D84239 


Hs.1 11732 


430569 


Al^41254 


Hs.178098 


406687 


M31126 


Hs.352054 


428355 


BE256452 


Hs.2257 


422281 


M38803 


Hs.346935 


413585 


AI133452 


Hs.75431 


422664 


AA31S833 


Hs.120879 


417931 


W95642 


H8.82961 
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422487 
413936 
424687 
420344 
44G921 
428470 
452S94 
422310 
421907 
424326 
435538 
413881 
443426 
436972 
430677 
409832 
423803 
430272 
451035 
422330 
425976 
411625 
451917 
410416 
418318 
414617 
447342 
452194 
414616 
417491 
423445 
403220 
415992 
449722 
414798 
415214 
411976 
422511 



AJ010901 

AF113576 

J05070 

BE463721 

AB012113 

AO002301 

AU076405 

AA316622 



431301 
426970 
424273 
431657 
431330 
425983 
408243 
428187 
408704 
428753 
426227 
419354 
414987 
422627 
407786 
414809 
425280 
432179 
443957 
426174 
430135 
420542 
409453 
408482 
444151 
444381 
421408 
430204 
443991 
428874 
408908 
411142 
452281 
412115 
421379 
433083 
403218 
412104 
449027 
429345 
414753 
415000 
425208 
432978 



NNL014479 

AB011540 

LiX)190 

AF(»8158 

AA284579 

Z26317 

W74001 

NM.005709 

X04698 

AU076785 

D30783 

C75094 

AK000334 

AW391351 

D31382 

U47732 

AI339520 

AI199268 

AI694413 

Y1370g 

AW376842 

NhL014324 

C05837 

BE280074 

AI266323 

At445236 

AI916058 

AU076442 

NWL000492 

AA502384 

BE276891 

W40460 

A)345227 

X69532 

AK000226 

¥00787 

At687303 

AA05663S 

AWS39252 

U67058 

M62839 

AA524394 



AA687538 

At434699 

U31519 

X75208 

AA521049 

AA5479S9 

NM_000035 

NM_000505 

AI885516 

NM.000676 

AW972917 

BE387335 

AI688223 

AA618335 

NM_002250 

W32133 

BE296227 

NM_014256 

T93500 

AK001763 

Y15221 

ALD42759 



Ks.198267 

Hs^681 

Ks.151736 

HSS7101 

Ks.16530 

Hs.184507 

Hs.29981 

Hs.98370 

Hs.109358 

Hs.145296 

Hs>1930 

H5.75599 

Hs.6329 

H3^5640 

Hs.359784 

Hs^5279 

Hs.132945 

Hs^37658 

Hs.430 

Hs.1 15263 

Hs.334514 

Hs.352415 

Hs.50820 

Hs.63325 

HSJ4072 

H8^17 

Hs.19322 

Hs.373599 

Hs.77399 

Hs.1085 

Hs.128749 

Hs.145807 

Hs.23960 

Hs.97411 

Hs.125124 

Hs.144583 

Hs.1 17938 

HS.G63 

Hs.151529 

Hs.194691 

Hs.144442 

Hs.105448 

Hs.2777 

Hs.165619 

Hs.624 

Hs.285529 

Hs.5366 

Hs.192927 

Hs.154299 

Hs,1252 

H$.294022 

Hs.118787 

Hs.38972 

Hs.77356 

Hs.1872 

Hs.2913 

H8.3S3013 

H3.1 15838 

Hs,234234 

Hs.1321 

Hs.95612 

Hs.45743 

Hs.128749 

Hs.283713 

Hs.91096 

K5.356664 

Hs.10082 

Hs.194366 

K8.250822 

HS.69D09 

Hs.28792 

Hs.73239 

Ks.103982 

Hs.191762 



AW205197 Hs.240951 

AJ271216 Hs.22880 

R11141 H8.199695 

AF1582S5 Hs.77225 

AW025529 H8.239812 

NlyL002153 Hs.155109 

AF126743 Hs.27g864 



NM.01 8408:Homo sapiens mudn 4. tracheob &01 

NM.OOQ295:Honio sapiens serine (or cystsi a99 

NM.004994:Homo8afdensnBtrixmet^lopro aso 

NM.014373:Homo sapiens putaOvaGprotei 8.71 

NM.002988:Homo small btdudbte c a67 

Hs.l84507:Homo sapiens, simBar to Homol a47 

H5.29981so!ute carrier farraly 26 (sulfa a47 

(locusIin)()NM.030622:Honro sapiens cytodi a43 

Hs.l0935B:ATPase.aassV,type10B a34 

NH.014479:Homo8apiensAOAM^,decysi ai2 

HS.4930-JOW density OpoprDlein receptor a09 

(tocusllnl4NI\1.000488M)ino8a|toserhe 7.96 

(tocusiInJ0NIA_012112:Hoiiiosaplensd!romo 7.92 

Hs,25640slaudin3 7.89 

NM.001943:Honio8apiens desmoglein 2(D5G 7.87 

NM.002639:Homo sapiens sertoe (orcystoi 7.71 

(tocusIlnl()NI\/L005709:HDmo sapiens PDZ-73 7.58 

Hs.23765BapoOpopTotelnA4l 7.48 

NM_002670:Homt) sapiens plasQn 1 0 isof 7.31 

I^W1432:Konio sapiens epireguIin(EREG} 7.31 

NM.025257:Kofno sapiens chromosome 6 open 7.29 

I4M.01 7767:Homo sapiens sotute canier fa 7.23 

H3.50820:hypoMcal cardlac/^lteietal m 7.21 

NM.01 9894:Homo sapiens transmembrane pro 7.12 

NM.004616:Homo sapiens transmembrane 4 s 7.12 

0ocuslIrd()NK^O2513O:Homo sapiens liypoth 7.10 

Hs.19322:Homo sapiens. Smflar to f^KEN 7.08 

Hs.373599:EST 7.02 

NH.001265:Homo sapiens caudal type ixsmeo a97 

NM.004963:Homo sapiens guanytato cyclase ags 

NM.014324:Hon» sapiens alptta-mdhyiacyt- a96 

a95 

H3,145807AypoihetJcal proleln FU13593 a87 

Hs.23960:cycnn B1 a87 

Hs.97411:lvpolhefical protein MGC12335 6.80 

NlWL004442:Homo sapiens EphB2 {EPHB2), Ir 6.78 

Hs.144583:Homo sapiens, done {MAGE:3462 &76 

NM.000494:Homo sapiens coHagen. type XV a66 

NM.000492:Homo sapiens cysdc fibrosis I a65 

Hs,15l529:ESTs a62 

NM.003979:Homo sapiens refoioic acid Ind a59 

NM.003561:Homo sapiens phosphoDpase A2, a56 

Hs.105448:pratBinidnase, lysine deflcle a54 

NK1.00221 &Homo sapiens inter-alpha (glob asS 

NM.031265:Homo sapiens mudn end cadheri a50 

NI\4_0005B4:Homo sapiens interteuldn B (IL a47 

Hs.285529:Gproteirvcoupted receptor 49 6.46 

NM.130053:Homoeapiens epidemial growth 6.45 

NM.017726:Homo sapiens protein phinphata 6.41 

(Iocuslink]NM.005242:Homo sapiens coagul a41 

NN1.000042:Homo sapiens apoDpoprotein H 6.27 

NM_032865:Homo sapiens hypdheGcat pro! 6.20 

Hs.1 18787;transfomilng growth factor, be ai9 

NM_005727:Homo sapiens tetraspan 1 (TSPA ai9 

H5.77356lF8nsfenin receptor (p90, €071 ai8 

NM.002591:Komo sapiens phosf^oeno^iyniva ai6 

NM_004443:Homo sapiens EphB3 (EPHB3), mR at6 

Hs.353013:citromo8om820openiBadinafra aiS 

Hs.115836:HST6 aiO 

NM.000035:Homos8piensaldolase8.1ruct a07 

NM.000505:Homo sapiens ooegulatton facto a08 

H3.95612:ESTs a06 

NM.00067&Homo sapiens adenosine A2b rec a03 

(locusiinIc)NM.014324:Homo sapiens alpha- ag9 

NM.138455:Homo sapiens cdtagen (ripte h a97 

NM.05281&Homo sapiens tripartite motif- a95 

H3.356664:hypothe9cal protein FU32334 ' a92 

Nh^002250:Homo saf^ens potassium Inierma a90 

Hs.194365:transthyn9tin (prealbumin, amy a68 

CocustinI()NIA.0031S8Mno8apienssertae a88 

NVL014256:Homo8apl6nsUDP.GIcNAcbe1aGa a86 

HsJffl792:Homo sapiens cONA RJ11041 fis, a82 

H3,73239-Jiypolheiical protein aJ10901 a77 

NM.005409:Homo sapiens small todudbie c a76 

Hs.191762iiypothe8c8l protein M6C20256 a75 

a74 

{locuslink)NM.033120:Homo sapiens naiced a72 

NM_005700i{omo sapiens (fipeptldylpeptida a72 

Hs.199695:hypotheticaiprDtoinMAC30 a72 

NMj006437:Ho(no sapiens AOP^fbosyOransf a72 

Hs.239812»eraloolcaiiy defined brsasl c a71 

NM.002153M)n)O8apienshydrajqrst8iQld(1 a70 

NM.013238:Homo sapiens DNAJdomain-conb a70 
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4(»889 
414052 
413916 
418322 
433437 
424010 
43874S 
414590 
457001 
423164 
409757 
425397 
403221 
420981 
417165 
419508 
41C8S0 
436251 
430803 
450505 
422535 
441384 
425834 
430680 
432378 
419893 
422163 
447320 
415927 
416203 
407944 



409231 
446051 
432269 
427557 
414639 
422765 
428479 
425873 
432575 
409142 
427747 
422609 
414361 
452940 
435849 
411257 
416065 
406673 
451541 
429833 
411393 
445109 
431548 
419574 
426450 
431211 
437009 
439453 
430696 
431779 
436469 
414108 
422539 
428024 
428407 
407811 
409162 
434370 
413763 
405484 
410639 
428953 
447343 
421462 
415474 
415099 
417115 
426761 
453751 
452721 
424905 



AW630041 K5^6937 NM.021978:Komo sapiens suppiesslon of tu 5.67 

AVI576849 H5^83552 H3.2B3552iiypo0}et}cal proteta BC016153 167 

N49813 Ks.75615 NluL000483:Harm sapiens apoOpoproteinC- 5.63 

AA284156 H5-84113 r^005192:Hon» sapiens cydbvdependent 5.62 

U20536 K5^280 NM-001226:Komo sapiens caspasa 6. apopto 5.60 

AL0801B8 Hs.137556 NNL03310ftKomo sapiens MTiiratocadltt^ 5.S9 

AI885815 Hs.184727 Hs.184727:ESTs.Wb8(tdysinflarbT45738 &S6 

NM_C00506 K5.76530 NM-00050a'Hom08aptensooaoul3ilon facto 5.56 

J0325B Hs.2062 Hs.2062;vitamlnO(1,25.dihy*wyv^ 5.56 

AK000232 H5.124635 h(M.019062:Homo sapiens hypolheOcalprot 5.54 

NM_001898 Hs.123114 NM.001898:Homosaitenscys1a6nSN(CST1 5.53 

JD4088 Ks.156346 NM_001067i1omos8piens(opdSQmeTB5e(ON 5.53 

5.52 

L40904 Ks.100724 NM-005037:Homo sapiens pensdsoma proGf 5.52 

RB0137 Hs.302738 Hs.302738:Homo sapiens cDNA: FU21425 fi 5.48 

AW997938 Hs.90786 NM-003786i1omo sapiens ATP-tindlngcasse 5.44 

AW362867 Hs^02738 Hs.302738:Homo saptens cDNA: FU21425 fi 5.42 

BE515065 Hs.296585 (locu5iirdONfyL006392:Homo sapiens nudeo 5.41 

AA148164 Hs^472ro Hs.247280.x}irDmosom8 20openreadiiQfira 5.38 

NM-004572 Hs.25051 NiyL00457XHonio sapiens plalwphinn 2 (PK 5.34 

AA311914- Hs.154578 H5.154578:Homo8ajfensml«IAforRJTO256 5.33 

AA447849 Hs.288660 Hs.2B8660:Homo sapiens cONA: FU22162li 5.32 

NM.001639 Hs.1957 Hs.1957:8mytohiPoQnuionent.8enfln 5.31 

AW138724 Hs.168974 H5.168974:E8T8 5.S 

Ai493046 Hs.146133 Hs.146133:EST8 5.25 

AA133749 Hs,301350 Hs.3013S0:FXYDdomajH»nlaininfltonlra 5.24 

AF027208 Hs.112360 Hs.112360:prominiivflc8l (mouse) 6.21 

AI675419 Hs.164464 KS.164464:Homo sapiens, done MGC:23656 5.21 

AL120168 Ks.78919 NM.021083:HoR»8^)iensKell blood ^oup 5.21 

X54g42 Hs.83758 NM_001627:HomosapiensCOC28pfoteinldn 5.20 

R34008 Hs.239727 NM_024422i^ sapiens desmxollin 2 (DS 5.20 

M26301 Hs.2253 Hs.2253:coinplement component 2 5.19 

Ks.692 NM.002354:HonK) sapiens tumoF'^SQCiated 5.19 

Hs.37054 Hs.370S4:ephrin^ 5.15 

Hs.2942 Ks.2942:ma«ophao8stimutaIing1(Qcepto 5.13 

Hs.179657 NM.002659:Homo sapiens plasmhogenactiv 5.11 

H5.76716 NM.002217:Homo sapiens pro-alpha (giobul 5.09 

Hs.1578 NM_001 168:Homo s^ns bacuioviral lAP r S.08 

H&334562 NM.001786:Homo sapiens ceS division eye 5.08 

Ks.160417 H3.160417:transm8ml)rane iHotein 2 5.07 

Hs.194348 Hs.194346:Spir.2 protein 6.07 

HS.S07S8 Hs.50758:SMC4stnJCturd maintenance of 5.07 

HS.1806S5 (iocusi]nk)NM.004217:Homo sapiens serine 5l06 

Hs.118721 N^^000434:Homo sapiens siaTidaselOyso 5.06 

Hs.204044 Hs.204044:ESTs &04 

Hs.2173 N^002033:Homo sapiens Ikioosyltransferas 5.03 

Hs.16098 Hs.16098:claudin2 5.03 

Hs.1 15274 Hs.115274:lndian tiedgeliog tiomoiog (Droso 5.01 

Hs.78996 NM_002592:Homo sapiens proliferaling eel 5.00 

Ks.ig8253 Hs.198263:ma]orhistocon4StIb]]nyoompl 4.99 

Hs.26557 NM_007183:l^mo sapiens pialcopiiinn 3 (PK 4.99 

Hs.288627 NM_012079:Homq sapiens diacyfglycefolO- 4.98 

H3.69771 NMJ)0171 0:Homo sapiens Wactor. pfopefd 4.98 

Hs.1 2330 NM.001247:Komo sapiens ectonucleoside tr 4.98 

Hs.9711 Ntl/t017515:Homo8apiensnovelprateln(HS 4.97 

Hs.91165 Hs.911&S:liypot}ie6calpretalnFU11127 4.97 

Hs.184339 NM.014791:Homo sapiens maternal embryoni 4.95 

Hs.323733 Hs.323733:gap juncGon protein, beta 2. 4.95 

Hs.5394 NM.005379:Homo sapiens myosin iA(MY01 A) 4.93 

Hs.6566 Hs£566:ftyrold hormone reoqrtor interac 4.93 

Hs.59509 H8.59509:EST8.Wbakly6imlIarto6lmirar 4.93 

Hs.268571 GocusIink]Nhi.001645:HOmosaplensapo)]p 4.92 

Hs.51 98 H8.5198:Down syndrome critlca) region ge 4.91 

H8.75761 NM.003137:Homo sapiens SFRS protein Una 4.91 

Hs.118138 Nli4.033013:Homo8apiensnucIearrec8ptor 4.89 

Hs.2236 H8.2236:NIMA (never in mitosis gene aK 4.89 

Hs.184194 NM.003963:Homo sapiens transmembrane 4 8 4.89 

Hs.40098 Hs.40098:cysteine tcnot superMy 1, BM 4.88 

HS.SQ86& Hs.50888:8Olute carrier fatniiy 22 (organ 4.88 

Ha.S8346 NM.022336:Homos8piensectodysp(3sin1, 4.87 

Hs.75517 NM.000228:Homo sapiens laminln, beta 3 ( 4.67 

4.87 

Hs.65234 (bcusfink)NiyL017695:Homo sapiens DEAD/H 4.87 

Hs.3481 83 NM.003623:Homo sapiens tun»r necrosis fa 4.88 

Hs.236894 H3.238894:ESTs.Kl9tiiy8imnartoS02392 4.83 

Hs.104624 KKt020980:Komo saptens aquapoiln 9 (AQP9 4.81 

HS.764S7 NM.0142S2:Homo sapiens soiute canter fa 4.79 

Hs.77917 NM_006002:Homo sapiens ubiqulHncarboxy 4.79 

Hs.334612 NAiL003094:Homo sapiens smaD nudear rib 4.79 

Hs.172089 H3.172089:pnH}noosis receptor inductng 4.78 

Hs.101282 Ks.101282:Homo sapiens mRNA; cDNA 0KFZp4 4.77 

Hs.301871 Hs.30ie71soliitec8rrfertBmiiy37(glyc 4.76 

NM.O02497 Hs.153704 NWL002497tHon¥)saplensNI»ltA (never to iri 4.76 

193 



6&048061 

NM.002447 

NM_002659 

X67055 

AW409701 

Y00272 

NM-013390 

AA553722 

AL136877 

AW411425 

246023 

AI086138 

AA029722 

BE305242 

AA628967 

6E267g31 

11^96 

BE279383 

NM_012079 

AW797437 

AF039916 

Atd34273 

AK001969 

NM-014791 

M86849 

AF127026 

BE284974 

AA531276 

AW971178 

AK001455 

AI267592 

AJ009936 

Z29087 

NM.003963 

AW190902 

H25530 

AF130988 

U17760 

8E269047 

AA308610 

AA2S6641 

Af=016495 

NM.014252 

AI492170 

AW952792 

AI015709 

f^38762 
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421343 
400529 
407233 
447472 
447966 
439963 
407103 
405556 
421506 
444700 
443464 
441623 
423068 
422714 
403739 
427490 



BE616520 Hs.343912 KM_033504*fomo sapiens CAC-I (CAC-lXmR 



X16354 Hs.50964 

AW207347 Hs.211101 

AA340605 Hs.105687 

AW247529 Hs.6793 

AM24861 H5.2S8301 

BE302796 Hs.105O97 

NML003645 Hs,11729 

BE546446 Hs.321579 

AA31S605 H5^8710 

M25629 H8.123107 

AB018335 Hs.119387 



412723 
424825 
447335 
424441 
432160 
414695 
450737 
435327 
411263 
408056 
409964 
417576 
432407 
439975 
409213 
403219 
412974 
406194 
422237 
4S6906 
425123 
425743 
406684 
439560 
411126 



434263 
431945 
422616 
452299 
414998 
452888 
442013 
450334 
445417 
433662 
419559 
425880 
408847 
431836 
435777 
422867 
431350 
432593 
421975 
412870 
412133 
422293 
425998 
453082 
453111 
432677 
429271 
426108 
412612 
424865 
427239 
413254 



417526 
413186 
436391 
413219 
452017 
408113 



Z95152 

AF038007 

AA648459 

AF207069 

BE617695 

X14850 

AK000224 

BE439915 

AW007152 

BE301671 

BE297802 

AA312329 

AW368226 

AA339449 

AA221038 

AW328061 

U61412 

R18978 

AA601038 

M13149 

AF117646 

AW20S274 

BE396495 

X16354 

AFD86401 

NMJ)01202 

AF112213 

N3489S 

AW000827 

6E300330 

AVII206330 

NM.0Q2S43 

AW955454 

AA506476 

AF035959 

AK001058 

W07162 

Y07828 

L29339 

AW290997 

AF178532 

AW419202 

L32137 

AI192528 

AW301003 

AW961017 

N22788 

U83460 

X94453 

AU076629 

H18835 

AB014598 

NM.004482 

AF039850 

AA622037 

NM.000047 

AF011333 

BE270447 

U40Z72 

AW970780 

AA568906 

AU077141 

AJ2Z7892 

AA878200 

AF109302 

T82427 

Ai916662 



Hs.178695 

Hs.166196 

Hs.335951 

Hs.153357 

Hs.286192 

Hs.147097 

Hs^789 

Hs.76913 

Hs.53325 

Hs.4867 

Ks.69360 

Ha.42331 

Hs.67928 

Hs.82286 

Hs.6817 
H5.51133 

Hs.75105 

Hs.191797 

Hs.1498 

Hs.156637 

Hs.154695 

Ks.159428 

Ks.50984 

K5.293847 

Hs.68679 

Ks.184062 

Hs.79187 

Hs.11962 

Hs,1 18725 

HsJ5S663 

Hs.77729 

Hs.30942 

Hs.375009 

Ks^4S79 

Hs.12680 

Hs.150826 

Ks.gi096 

HS.19&4 

Ks.190153 

H8.271411 

Hs^86192 

Hs.1584 

Hs.164537 

Hs.514e3 

Hs.6469 

Hs.82407 

Hs.104557 

Hs.1 14366 

Hs.165950 

Hs.31608 

Hs.31720 

Hs.278611 

Hs.198515 

Hs.166468 

Hs.74131 

Hs.153563 

HS356512 

Hs.75253 

KS.S9483 

H3.62240 

Hs.374548 

Hs.146274 

H$.1 18727 

KS.Z7495 

Hs.194101 

H8511577 



(loajsnnk)KM.001712:Homo sapiens cadn 
Hs.211101:EST8 

(locus6nk)NM.1 4522H(Hno sapiens sinila 
H&.6793tptaielel-acQvaSng factor acely 
H5.256301:hypothe0cal protein MGC13170 

Hs.105D97:thymIdine kinase 1, soluble 
NM.003645:Komo sapfens fatty-add^Coenzy 
NM.021095;Honn sapiens solule carrier fa 
Hs.348710:Homo saptens. done tMAGE:4242 
NM.Q02257:Honno sapiens kallDoetn 1, ten 
NM.014698:Ho(no sapiens K1AA0792 gens po 

NM.002754:Hoino sapbns mnogen-acMed 
NK^005603i{omo sapiens ATPase, Class I. 
Hs.33S951:hypolhelfe^ protehi AF301222 
NKL001084:Homo saf^ procoltagen^bi 
NNL032192:Homo sapiens protein phosphate 
Hs.147097:H2A histona fantily, memlw X 
NMJ)1 771 6:Homo sapiens membrane-spanning 
Hs.7691 3:proteasome (prosome. cracropain) 
Hs.63325.iransniemt)r3na proteasOt serins 
Hs.48673nanno3y1 (alpha-l^HltyooprDls 
NM.006845:Homo sapiens tdnesin^s 6 (m 
Hs.42331:ephrfivA4 
Hs.67928:EST8 

NK4_000819:Homo sapiens phosphoribosylgly 
AF134164:Hocno sapiens Human endogenous r 
NM_033453:Homo sapiens inoslne triphosph 
NM_005975'.Homo sai^ns PTK& protein tyro 

NM.006579*i1oino sapiens emopamil binding 
Hs.l91797:ESTs 

NM.000412:Hcnio sapiens hisQdlne-dch gl 
NM_012116:Honto sapiens Cas-8r-M (murine) 
NM_000303dioino sapiens phosptnnennomutas 
NKt138761:Homos8p)ens BCL2-assoc{ated X 
(locus8nk)NM.001712:Homo sapiens cardn 
Hs.293847:ESTs 

(iocus6n)()NlyL001202:Hofno saptens bone m 
Hs.184062xtiroinosome 20 open laading fra 
Ks.79187:cox8ac1devinis and adenovlms 
NM_030766:Honio sapiens apoptosis regulat 
NM-012248:Homo sapiens setsnophosphats s 
Hs.355663:ESTs 

NM-002543:Homo sapiens oxidised low dens 
NM.004093:Homo sapiens ephrin^2 (EFNB2) 
Hs.375009:Konio sapiens mRNA; cONA DKF^ 
Hs.24879:phospha9dIc acid phosphatase t 
Ks.12680:Homo ^Sfesm cONA aJ10196 fis. 
NM.020387:Homo sapiens RAB2S, member RAS 
NM-007028:Klomo sapiens tripartite motif- 
NM.000343:Homo sapiens soiuts carrier fa 
H3.190153:Homo sapiens cDNA FU33988 fis 
NKA.136992:Homo sapiens beta«ite APP-de 
NM-032192:Homo sapiens protein phosphate 
Hs.1584:car^e ofigomeric matrix prot 
Hs.164537:ESTs 

Hs^1483:Homo sapiens, SbnHarto RtKEN 
(iocusllni()Nh^024531:Komo sapiens hypolh 
NM.022059:Komo sapiens chemoldne (C-X^ 
NM.001859:Homo sapiens sOlute cairiar fa 
Hs.1 14386:pynioi]ne^^«art)oxylals synthe 
NM.00201 lifomo sapiens filnioblast growth 
(locusfint4NM.017636:Homo sapiens tiansi 
NM.014799'i{omo sapiens hephaesUn (HEPH) 
NM.004482Homo sapiens UOP-H-aoe^ph 
NM-i»S224:Hofflosapiens dead ringer-{ii» 
NM_004708:Homo sapiens programmed cell d 
NM.000047:Homo sapiens arytsuf^lase E ( 
NM.002349:Homo sapiens lymphocyte ant^e 
HS.35651Z-ESTS. Wealdy 6im9ar to U6CAJV 
NM.004135:Homo sapiais isodtrats dehydr 
Hs.59483:leudnfrrich repeaUxmtalning 
Hs.82240:8yntaxtn3A 
Hs.374548.-so(uts canter tMy 16 (mono 
Hs.l46274:EST8 

Hs.11B727M)mo sapiens cDNA FU33803fis 
Hs27495:ptostatecanoer assodaled pnst 
Hs.194101 itomo sapiens CDNA: FU20869fi 
(tocusllM4NM_004S88ilQmofiaq)iens tt^ 
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4.75 
4.75 
4.75 
474 
4.72 
4.72 
4.70 
4.70 
4.70 
4.70 
4.70 
4.70 
4^ 
4.64 
4.61 
4.61 
4.61 
4.60 
4.60 
4.59 
4.59 
4.59 
AJSB 
4.58 
4.57 
4.57 
4.55 
4.54 
4.54 
4.54 
4.53 
4.53 
4.53 
4.52 
4.52 
4.51 
451 
4.51 
4.50 
4.50 
4.50 
4.49 
4.48 
4.47 
4.47 
4.46 
4.46 
4.46 
4.46 
4.46 
4.45 
4.44 
4.44 
4.44 
4.43 
4.43 
4.43 
4.42 
4.41 
439 
4.39 
4.39 
4.38 
4.38 
4.38 
4.38 
4.37 
4.36 
4.36 
4^ 
434 
434 
4J4 
433 
4.32 
432 
432 
431 
4.30 
4.29 
429 
429 
429 
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432636 
412869 
443639 
418245 
409636 



426023 
410199 
431685 
409956 
413278 
431193 
436856 
456629 
426682 
416054 
418526 
437897 
435099 
419378 
421585 
420039 
426427 
425263 
426031 
441085 
412939 
430514 
431842 
430387 
404826 
414196 
434203 
426378 
433020 
420319 
446696 
400130 
431890 
425003 
417386 
424837 
424534 
445462 
428471 
409012 
424154 
400290 
409152 
427333 
413835 
444664 
421959 
407777 
414806 
421190 



419468 
414907 
425247 
443802 
411165 
434808 
428376 
418216 
436278 
421910 
417866 
449057 
414561 
414812 
456382 
431958 
418661 
419092 
414013 
409093 
445873 
436485 
422164 
437016 
449437 
446946 



AA340864 Hs^8562 NM«001307:Homo sapiens daudln 7 (ODNT) 429 

AA290712 Hs.82407 Hs.82407xheinoWn9(C-X-CniotiOBgand 4.29 

BE269042 Hs.9661 Hs^i:proteasofre(p«>som8, macmpah) 4^ 

AA088767 Hs.B3883 NK1.020182:Homo sapiens transmembraw, pf 4,27 

AA305729 Hs.18272 {locuslink}NM.030674:Homo sapiens sotuto 4.27 

AW361666 Hs.49500 Hs.49500:KIAA0746pfot^ 427 

AU)38843 Hs.374530 Hs374530:Komos^enscONA:aJ23602fi 427 

AW377424 Hs^126 Hs.2G5126*tomosapfenscONA:RJ22667 e 4.24 

AW296135 Hs.^659 NM.006113:Homo sapiens vav 3 oncogene (V 424 

AW103354 Hs.727 NM-002192;Homo sapiens InWWn. beta A ( 424 

BE563085 Hs.833 Hs^ftiterferoiMfinwIatedpn^ 423 

AW749S05 Hs.296770 Hs296770daAA1719 protein 423 

A1469355 Hs.127310 (tocuslInk)NM-144624:Homo sapiens kinase 423 

AW891965 Hs.367942 Hs,367942*iomo sapiens, done IMAGE:4701 423 

AV660038 Hs.2056 H520S6:UOPg)ycosyitrmsfefase1 My 423 

NM.0Q2318 HsJ3354 NKLQ02318itono sapiens lysytoxidas&jk 422 

BE0ig020 HS.8583B NM.0042O7:Homo sapiens solute canier (a 422 

AA770561 Hs.146170 Hs.146170:hypolhetlcal protein FU22969 421 

AC004770 Hs.4756 NM_004111:Hon» sapiens Hap slnicture^p 421 

R24922 Hs.90078 Hs.90078:nucteotid&«ugar transporter si 421 

U95628 Hs.302043 NM_003965:Homo sapiens dremokine (C-C mo 4.20 

NM»004605 Hs.376147 Hs.376147ilomo sapiens cDNAFU39099 6s 4.20 

M86699 Hs.169840 Hs.169840:TTK protein kinase 419 

NKL001197 te.155419 NM-001197:Homo sapiens BCI2-interaceng 419 

AA295251 Hs.166066 " (locusIink)NIVL006e97:Homo sapiens dspla 419 

AW136551 Hs.181245 H3.181245:Homo sapiens cDNAFU12532fls 4.19 

AW411491 Hs.75069 Hs.75069:serinelTydroxymeliiyItransierase 418 

AA318501 HS241587 NM»021246:Homosa;^nslynipliocyteanlig8 417 

NM_005764 Hs271473 Hs.271473:ep]theiial protein ujMegulate 417 

AW372884 Hs.240770 Hs240770:nuclearcap binding protein su 417 

4.17 

AW505308 Hs.75612 NI\^004563ik)mo sapiens phosplioenolpyruva 417 

BE262677 Hs283556 NM.018509:Homo sapiens hypothettedprot 417 

U80082 Hs.169600 Hs.169600:KIAA0826proleln 416 

AI375726 Hs.227152 NI^016391:Hon)O sapiens tiypolheticalprol 416 

AW406289 Hs.96593 NM-019034:Homo sapiens rashomolog gene 415 

AF279265 Hs.298476 NK4_022911:Homo sapiens solute carrier fa 4.15 

Hs.155560 flll^00174G:HoniO88pienscalnexln(CANX). 4.14 

X17033 HS271986 NK|.002203:Homo salens integrimalpiia 2 414 

AF119046 Hs.154149 NW.014481:Homo sajfens APEX nuclease (ap 413 

A1JD37228 Hs.301957 NM-018144:Homo sapiens Sec61 aipiiafbrai 413 

BE276113 Hs.333034 NW.003491:Honio sapiens ARD1 homolo9,N-a 413 

087662 K8.150275 H5.15a275:KIAA0241 protein 413 

AA378776 Hs.28B649 {k)cus8nk)NM-024051:Hoino sapiens iiypoth 412 

X57348 Hs,184610 Hs,184510xtratjfin 412 

AL117435 Hs.49725 Hs.49725:DKFZP434l216 protein 411 

AF026004 Hs.141660 NI\1.004366:Homo sapiens cDtoridadianne! 410 

H18838 H5.31608 (k}CUSlnk}NM.017636:Homo sapiens transi 410 

AA176585 Hs.194346 Hs.194346:Spir.2 protein 410 

AF067797 Ks.176658 Nhl.001169-i^o sapiens aquaporin 8 (AQP8 410 

Ai272727 Hs249163 NM_024306:Homo sapiens fatty add hydrox 409 

N26362 Hs.11615 NIA.016086:Homo sapiens map kinase phosph 409 

AW751497 H8.98370 NM_030622:Homo sapiens cytochrome P450, 4.09 

AA161071 Hs.71465 Hs.71465.'8qualene epoxidase 409 

D14694 Hs.77329 {locus8nk)NM-014754:Homo sapiens phosph 408 

U9S031 Hs.102482 Hs.102482:roudn 5. subtype B. tracbeobro 408 

R58665 Hs.45847 i^016614:HomD sapiens TRAF and TNF race 408 

AA316241 Hs.90691 NM_006993itemD sapiens nudeoptwsminftwc 406 

X90725 Hs,77597 KW-000998-itomo sapiens libosomal protein 408 

NM_005940 Hs.155324 Hs.155324:m2trb(meta!Ioprotetnase 11 (s 406 

AW504924 H3.9805 H3.9805:exportln 5 404 

NM.000169 HS.69Q89 NM_000169:Hofno sapiens galactosidase, at 404 

AF155108 HS256150 Hs256150:NY.REN^1 antigen 404 

AF119665 Hs.184011 Hs,18«)11:pyrophospl«tase(infljgante)' 404 

AA662240 Hs.283099 Hs.283099-.AF1Sq14 protti 402 

BE396290 Hs.5097 H5.5097xynaplogyrtn2 4m 

NM.0145a6 Hs.109437 Ni^014586:Homo sapiens hormonallyupreeu 4.02 

AW067903 Hs.82772 Hs.82772KoIIaoen, type XI, ajpha 1 402 

AB037784 Hs.22941 Ks22941:KIAA1363 protein 401 

AI064813 HS.1951S5 Ks.195155»ilutB canter fantlly 38. merrib 400 

X72755 Hs.77367 NMJ0024ie:Hon» sapiens monokine Induced 400 

AW973003 HsJ79909 Oocuslnk)NNL024831:HcnK> sapiens nudea 3.99 

X63629 Hs.2877 NI^001793:Homo sapiens cadhefin 3, type 3.98 

NM.001949 Hs.1189 NM.001949:Homo sapiens E2Ftranscflpik)n 3.98 

J05581 Hs.89603 NM.002456M)roo sapiens mudn1.transmem 198 

AA766605 Hs.47099 N«UQ4642:Homo sapiens hypothetkslprot 3.98 

BE243834 Hs.50441 NfyL015936:HomosaftonsC6M)4protebi(L 3^7 

AA250970 Hs251946 Hs.2S1946:Homo sapiens cONAFUl 1840 lis 3.96 

X59135 Hs.156110 Hs.156110:inimunogk)burin kappa constant 3196 

NM.014312 Hs.1 12377 K8.112377xor0caltiiymoc^ receptor (X 3l95 

AU076916 Hs.5398 Ks.53S8:guan{nemonphosph8ta synthetase 3L94 

Ai702038 Hs.100057 Hs-IOtX^serine/Bireonine kinase 35 3.94 

AI878932 Hb.317 WMJ03286tono sapiens topolsomerase {OH 3194 

195 
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420162 GE378432 Hs.95577 NM.052S84i1omo sapiens cycMependent 3.94 

443180 R15875 Hs.258576 NM.01 21 29:Homo sapiens daudin 12 (CL0N1 3.93 

418736 AW388633 H5.6682 Hs.6682xQ(uts canter family 7. (cafion 3.93 

^ 409463 AI458165 Hs,17296 NM_023930:Homo sapiens hypothe&al prol 3.92 

5 447495 AW401884 Hs.1B720 NM.004208;Hamo sapiens piogrammad cell d 3.92 

448093 AW977382 Hs.15898 Ks.15898:2.4^noylCQA reductase 2. po 3.91 

426898 AA852773 Hs.334838 K5.334638:KIAA1866 protein ^90 

438465 W57576 Hs.378718 H3^8718:Homo sapiens cONARJ33433fis 189 

436827 H72187 H8.356668 (locusiinK)NM.005274:Homo sapiens guanin 3.89 

10 407971 A1469117 Hs.62gi8 Ks.62918:CX)C91 cell division cycle 91-8 3.89 

400750 3.89 

448140 AF146761 iis.20450 NM.020125:Homo sapiens Btymphocytaacli 3.89 

413880 At6S0842 Hs.110915 NK1.021258:Homo8aftointedeu]un22Fe 3.89 

453258 AW293134 K5.32597 NN^005977a^omo sapiens ring finger prota 3.89 

15 428788 AF082283 Hs.193516 NiM.003921:Homo sapiens BK^eUaUlympho 3.86 

443044 N28522 H5.6935 Nhl.014298:HonK) sapiens quinonnatephosp 3.88 

413095 AA494359 H5.30715 Ks.3071S:potassiumvoltag&^8led channel 3.88 

417129 A1381800 Hs.300684 Hs.300684xslc«Dninoena^atedpeplid 3.87 

410268 AA316181 Hs.61635 NM.012449:Homo sapiens sUtransnttntbrane 3.87 

20 425047 U34038 Hs.154299 NM.005242Homo sapiens coEQulatlonfecto 167 

416084 L16991 Hs.79008 N(4.01 21 45:Homo sapiens deoxythymUylate 3.8S 

449867 AB023227 HS.2386Q K5.23660:K1AA1010 protein 3.88 

400298 AA032279 Hs.61635 Hs.61635:^ transmentatseptlheliat an 3.85 

407770 AW607831 Hs.3B738 NA1.014343:Hon» sapiens daudin 15 (aONI 3.65 

25 418313 BE244231 Hs.64038 NM^015937:Homo sapiens CGI4)8 protein (L 3.85 

413380 AI904232 Hs.75323 Hs.75323:piohil}ilIn 3.65 

452220 BE1S8006 Hs.212296 Hs.212298:ESTs 3.65 

413568 AA971Q14 H8.75432 NML00G884:Homo sapiens IMP (inosine mono 3.85 

433858 L03678 Hs.156110 Hs.156110amnTunog)obuIin kappa constant 3.84 

30 428474 AB023182 H$.184523 Ks.184523:serine/8treonine kinase 38 Sk 3.64 

430237 At272144 Hs.236522 Hs236522:DKFZP434P1 06 protein 184 

414862 BE621310 Hs.923 Hs.923:8ingl&«tranded DNA binding prate 3.84 

437967 BE277414 H5.5947 NM_005370:Honio sapiens mel transfbnning 3.64 

427318 AF188061 Hs.175783 N1^014579:Homo sapiens solute carrier fa 183 

35 459306 AW5784S2 AW578452:RC1-CT0252-03010a023^7CF025 183 

446342 BE298665 Ks.14846 Hs.14846:Homo sapiens niRNA;cDNAD}<FZpS6 163 

432886 BE159028 Hs.279704 Hs.279704:chromatin aixiessibiiity comple 162 

434845 6E267057 1^325321 Hs.325321:WD repeat domain 18 162 

426514 6E616633 Hs.170195 Hs.170195:bonemoipiiogen6tic protein 7 ( 162 

40 410315 AI638S71 Ks.378965 Hs.378965:Homo sapiens cDNAai37656 lis 182 

421905 A1660247 Hs.32699 Ks.32699:Homo sapiens, Similar to RIKEN 181 

421481 AW391972 Hs.104696 Hs.104698:KlAA1324 protein 181 

445921 AW015211 Hs.153799 H8.153799:Homo sapiens cDNAFU38333fc 180 

414368 W70171 Hs.75939 NM.012474:Homo sapiens uridine monophosp 180 

45 457284 AF102650 Hs.227933 NM.013338:Homo sapiens A!g5,S.GerBvfsi 160 

413813 M96956 HS.7S561 NM-003212:Horno sapiens teratecardnoma^l 180 

414602 AW630088 Hs.76550 NM.052886:Homo sapiens malj^difle 180 

410219 T98226 Hs.1719$2 H8.171952»(Xludin 180 
407137 T97307 178 

50 430462 A[584156 Hs.105640 Hs.105640:hypolheticd protein BC007772 176 

432680 T47364 Hs.278613 (locuslInk)Nli4_005532:Homo sapiens Interf 176 

460010 AW293801 HS.25S052 Hs.255052:ESTs 178 

440334 BE276112 Hs.7165 NM-003904:Homo8a(te zinc finger prote 178 

440676 NM_004987 Hs.112376 Oocus]lnk)NM.004987:Homo saptens UM an 177 

55 428072 BE258602 Hs.1 62366 NMJ)16292:Honno sapiens heat sluxkprotei 177 

407722 BE252241 Hs.36041 NMJ}03661:Homo8ai^nspyridoxai(pyrido 177 
426459 AF151812 Hs.169992 . NM.01S966:Homo saptens serologicaltydef 177 

443323 BE560621 Hs.9222 Hs.9222:es!rogen receptor binding site a 176 

406621 X57609 Hs.18112S Hs.181125:lmmunogtebulin lambda locus 3.76 

60 423198 M81933 Hs.1634 Hs.1634xelldivistencyde25A 176 

428206 AB020643 Hs.163006 Ks.183006:likeiyhomotog of mouse hepari 175 

447200 BE543146 Hs.281434 Hs.281434i1omo saptens cDNAaJ31 373 fis 174 

425209 AL04g761 Hs.155140 NM_001895:Homo saptens casein kinase 2. 174 

^_ 411950 T28407 HsX1564 NM»Q02619:Homo sapiens piatetel factor 4 174 

05 418681 AA287786 Hs.23449 H323449:tn3ufin receptor tyrosine kinas 174 

421532 AW138207 Hs.146170 NI4.022842:Homo sapiens hypolheScal prot 174 

446291 6E397753 Hs.14623 Hs.14623dntenBron,ganvnaMudbtepio 174 

435888 BE265839 Hs.12126 NM_018487i^ saptens hepatoceSuterca 173 
417286 AA122237 Hs.81874 NM.002413:Homo saptens iTtesomatglutat 3.73 
70 421743 T35958 Hs.107614 Hs.107614^)KFZP564i1 171 protein 173 

400419 AFH»4545 AF084545:Komo sapiens verstean Vint Isof 173 
421357 AK000609 Hs.103806 NM_017896:Homo saptens chromosome 20 ope ' 3.73 
420665 AW469240 ite.371581 Hs.371581:EST8 3.73 
418703 NM.014448 Hs.67435 Hs.87435:Rho guanine exchange factor (GE 3.73 
75 452679 Z42387 Hs.B3883 (locusQnk}N»1.020182:Homo saptens transm 3.72 
419743 AW408762 Hs.S957 Hs.5957:Honio saptens done 24416 mRNAse 172 
435730 AB020635 Hs.4984 Hs.4984:KIAA0828 preteln 172 
431512 BE270734 Hs.2795 Hs.2795dactate dehydrogenase A 172 
444006 BE395085 Ks.334762 (locusfink)NM.032632:Homo saptens hypolh 172 
80 442875 6E623003 HS.2362S Hs.23825:Komo saptens done TCCCTA00142 171 
413431 AW246428 K3.75355 Nhl.003348:Homo sapiens ubiriuitliKOi^a 171 
413950 AA249068 Hs.32793 Hs.32793:Ho(no saptens cONARJ31 108 fte. 171 
411125 AA151647 Hs.68877 K3.68877:qf1odtrom8tH245» alpha pdypep 171 
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410099 


AW992356 


H5.8364 
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HS.11S65 
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4oZfUU 


AI859390 


H&.288940 


41oo03 


U50079 
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453323 


AF034102 


Hs^2K1 


42^13 


AV656571 


Hs.121066 


452488 


N74921 


H3.184389 




BE3118S6 


Hs.274230 


425811 


ALJ039104 


Hs.159557 


437741 


6E561610 


H3.5809 


421802 


BE26145B 


Hs.108408 


428582 


BE336699 


Hs.185055 


446147 


AL133064 


H5.14051 


408716 


AI567839 


Hs.151714 


450825 


AC0059S4 


H3^5527 


442007 


AA301116 


Hs.142838 


453454 


AW052008 


Hs.374973 


421612 


AF161S4 


Hs.106196 


428371 


AB012193 


Hs.183874 


421340 


P07783 


H$.1369 



Hs.293441:Homo saptefs SNC73 protein (SN 
NM»000053*ton» sapiens ATPaso, Cu-M-tran 
Hs.101282:Homo sapiens mRNA; cDNA DKFZp4 
KNl.001853:HQnio sapiens coDagen, type DC 
NM_004300:Koino sainens acid phosphatase 

H8.77741«lnlnogen 

Hs.29002:KIAA1708 protein 



NM_005002Homo sapiens NAOH dehydrogenas 
NM.004632itomo sapiens death associated 
NM.001033:Homo sapiens ribonudeoSde re ' 
(iocuslIn)()NlyL000883:HDmo sapiens IMPO 
NMJ)05538:Homo sapiens inhibin. beta C( 
Hs.795e:ESTs 

NMJD06114:Homo septens transtocasa of ou 

Hs.59461:DKFZP434C245 protein 
Hs.86131:Fas (TNFRSF6)-assodated via de 
KM_005402:Homo saftens v-rat simian teuic 
Ks.695xystatinB(5tetinB) 
NM.020158:Homo sapiens exosome component 
Ks.210667£STs 

(iocusnn)()Nh^021126:Homo sapiens meicap 
Hs.35e81:RasHinduc8d senescence 1 
Hs.273333:hypo&telica! ptotein RJ10986 
NM.002991:Homo sapiens small indudi)ie c 
NhC016032:Honio sajsiens line finger. DHHC 
Ks.9S697:liver-spedfic bHUi-Zlp transcr 
Hs.348504d7pothetica] protein BC014072 
(locusrink)NM_033550:Homo sapiens chromo 
NM.003129:Homo sapiens squatene epoxides 
N^.012394:Homo sapiens prsKokfin 2 (PFDN 
Hs.3348S1:LtM and SH3 protein 1 
BF755039K3VO-CT0583.18100a428-f07 CT058 
NM-00021 OiHomo sapiens Integrtn, a^ha 6 
Hs.79305:KIAA025Sgene product 
Hs.65844murDtiopldc tyrosine Idnase. r 

NMJ000245:Homo sapiens met piotiHincogen 
Ks.74335±eat shodt 90(0 protein 1 . beta 



Hs.352537:Homo sapiens cONA FU31066 fis 
H5.7837:phosphoprciiein regulated tff mito 
Hs.8204S3nididne (neurite groutrth-promoe 
hat033416:Homo sapiens sfmliar to HYPOTH 
NM.002707:Homo sapiens protein phosphata 
Hs.148962inc finger, DHHC domain oontai 
NIUOOIseOiHomo sapiens lnNeiAbi13fe 
NII^Q02S3BMxno sapiens oodudln (OCXN), 
NM.004587:Homo sapiens ribosome binding 
NM^003090:Homo sapiens small nudear rib 
NM.052858:Honio sapiens similar to RIKEN 
NM.0188SOEHomo sapiens apdilpoprotein 84 
Hs.6364:pyruvate dehydrogenase Idnase. f 
Hs.28757:transmembr8ne 9 superfamBy mem 
Hs.99364:abhydn)las6 domain containing 1 
Nlii.000206M)mo sapiens Inteiteuidn 2 rec 
NMJX16904i1amo sapiens protein kinase, D 
H8.129673»u](aryolic translaOon InlOat 
Hs.8364:pyruvate dehydrogenase Idnase, i 
N^^X1829:h{omo sapiens chloride channel 
NM.080748:Hon» sapiens chromosome 20 ope 
(locus6nk]NM.017988:Homo sapiens hypoth 
NM.021 2S9:Homo s^iriers transmembrane pro 
N^004964:Homo s^ens hlstona deaoetyta 
NtA.001532:Homo sapiens solute canter fa 
(locus0nk)N^U}03270:Homo sapiens tiansm 
Ks.184389£STs. Moderately similar to 81 
Hs.274230'^-pho6phoadeno8lne ?*phospho 
NMj002286:Homo sapiens Icaryopherin alpha 
NM_020470:Homo sapiens putative transmem 
(locuslinl(}NM.016022:Homo sapiens CG)-78 
K5.185055:BENEpmtein 
(locusltnlONM.145696:Homo sapiens endoze 
{IocU5Bnk)NM.033405:Homo sapiens peroxl 
(locU3Gnl()NK^014426:HomQ sapiens Gght 
NM.032390:Homo sapiens MK167 (FHA domain 
NKy)04697:Homo sapiens PRP4 pre-mRNA pro 
(locuslin)QNM.016579:Homo sapiens 8i}6 an 
^iMJD03S89:Hon»8apienscuain4A(CUL4A} 
NM.000574:Homo sapiens decay aooelerafin 
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H5.62402 
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417457 


AA378907 


Hs^9326 
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AU)44878 


Hs.11899 
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8E00S775 


Hs.126782 
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XB7241 


Ks.166994 


452835 


AK001269 


Hs.30738 


419493 


AF001212 
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457670 


AF119666 


Ks.23449 
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Ks.125078 


429404 


NM-005738 
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410293 


AK000047 


H5.61960 


434826 


AF155661 


Hs.22255 


423599 


AI805664 


H831731 


427715 
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Hs.18Ma8 



Nlil^DOOSieMmsapiens protein CpnacB 
H5.8887£ST8 

Hs.166196'ATPa58. Class 1. type 8B. memb 

({oaisnnfc)NM-0(M35aHonio sapfens ce9 d 
NM-006412:Homo sapiens 1-acytgIycen)K3- 
(locusl}nk)NM-003779:Homo sapiens UOP-Ga 
NMJ)00546:Homo sapiens tumor prot^ p53 
Hs^78970:Hanx) sapiens cONA RJ35102 fis 
Hs.182018:inlerfsu)(in-1 receptor-assoda 
NM-138689:Homo sapiens protein phospliata 
K5.1 51734:nuclear transport factor 2 
^A1.00221 9:Homo sapteis int^ membrane 
NM_004870:Homo sapiens mannoseMilichD 
NKL000982:Homo sapiens ribosomsl protein 
NM.017423:Homo sapiens UDP-N-aoety1^ 
NM.0068S5:Homo sapiens KDEL (l^s^sp^ 
Hs-IIOeSSiRon-fnetastatto cells 1. pretet 
Hs.3569xhreinosonno 20 open ibsiSi^ l^ame 
Hs.154331:EST8 

1^ocu5nnlONM.024923:Homo saptens hypoOi 

H5.6097d;Homo safnens cONA fU40829 tis. 
Hs.3439stom8!in iJEPBJT^ 2 
H5.44680:hypothsfical protein RJ20979 
Hs.131810:Komo sapiens cDNA RJ35976 fis 
NM.025217:Homo sapiens UL16 blntftig pn)t 
NM.000593:Homo saptens transporter 1, AT 
Hs^4743:tiypotheSc8l protein aJ20171 
Hs.42644:thioredoxin4iXe2 
(locus(]nk)NM.030799:Honio saptens go^ 
Hs.23765:nnembrane metedlo^dopeptidase- 
Nhy)14788;iHomo sapiens tripartite motif- 
NM.0331 61:Homo sapiens surfeit 4 (SUi^4} 
NM_001961:Homo sapiens etdcaryofictransl 
Hs.109007:Homo sapiens, Similar lo LOCIO 
Hs.172870:KiAA1913 protein 
Hs^06165:ESTs. Higiily simiiar to imknow 
Hs^9827:EST8, Weakly similar to pmtsa 
H5.182625:VAMP (veslde-assoctated meiribr 
H3.81337:iec6n, gaiactosida-bindins. so 
NM_001669;Homo sapiens arylsutfatase D ( 
Hs.155462:MCM6 mlnicliTDmosome maintenanc 
NM.004999:Komo sapiens myosin VI (MY06). 
Hs.110006diypothetical protein BCO04501 
NMJ)12339:Kofflo sapiens transmembrane 48 
NMJ001200:Homo sapiens bone morpltogsnefl 
(iocus6nk)NM.003374:Komo sapiens voitag 
Hs J5799:protBase, serine. 8 (pn^tasn) 
Qocu5!inl4NM.145069:Homo sapiens Uuco 
Hs.104830:EST8 

Hs.18355630lutB canfer My 1 (neutr 
NM.006109:Homo 8a(^s SKB1 liomolQg(S. 
Hs.8752:tFansmembrane protein 4 
Nlhij002842M)mo sapiens protein tyrosine 
H8.180062:proteasome (prxome, macropdn 
NM-003757:Homo sapiens eufcayotic tnansi 
NN1.139177:Homo sapiens chromosome 17 ope 
H3,3031^:p534nduced protein PIGPC1 
Nli4_002951:Komo sapiens r&ophorfn ti (iV 
NM„003740:Homo safilens potassium channel 
NM-033340cHomo sapiens caspasa 7> apopto 
NM_003576:Homo 6aj)iens serine/thieonhe 
NM-005526:Homo sspiens tieat shock transc 
BE072881 J?C2^T0548-20030^012^9 BT054 
NM^006265:Homo sapiens RA021 homoiog (8. 
Hs.162490:leucind^ch PPR-moGf cont^n 
Hs.62402:p21/Cdc42/i^1>ac8vated kinase 
NM-G04182:Komo sapiens ubiquitously^ 
K5.3493aS:Homos8ptocDNAFU30677fis 

NM.000859:Horno sapiens Utydraxyud^nethy 
NhiL014467:Homo sapiens sustikepeat prot 
Hs.166994:FAT tumor suppressor homoiog 1 
NM_018087:Homo sapiens hypotheScal prot 
Hs.90744:proteasome (prosome. macropain) 
Nh4.018B42:Homo saptens Insulin receptor 
(IocusEnk)NM.004152HQmo sapiens ormth 
NM.005738:Homo sapiens AOP-nbosytation 
Nik^OI 8992:Homo sapiens hypotheSca! prot 
Ks.22265:pynnrat0 dehydrogenase phosphat 
PoaBSnk)Nht012094:Homo sapiens peroxi 
H&180428JaAA1iaiproie{n 
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H3,82399 

85.75061 
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AK000002 Hs.55879 

NM_012453 Hs.52515 

AP054178 HS.19S61 

AF1(ffi365 Hs.172613 



Hs.173919tEST8. Weakty slmlar to neinon 136 

NM.003074:HDmo sapiens SWUSNF related. 3^ 

Hs^490xaspas6 1. apoptosixdated cys 3^36 

H3.73931:ma|orhishx»npa!ibil}tycomp)9 3:36 

HsJ9364:^hydro!as8 domain containing 1 136 

mUK)5176:Homo sapiens ATP synthase. 3.36 

NM-008280:Komo sapiens signal sequence r 3.36 

Hs77268:Homo sapiens ctm RJ21933 lis 3^ 

Ks.3071S.'potass{um vo{(age^afed channel 3.36 

NNt000373:Homo sapiens uridine monophosp 3.36 

(!ocusljn)ONM.01224a-Hon)osapleR$sohits 3.35 

NM.001360:Homo sapiens 7<dehydrocholeste ' 135 

NML006191:Homo sapiens praBerallon^ 135 

(locusGnl()NMj016442:Hon» sapiens type 1 135 

NMJ002765:Homo ^iens phosphorit)osyl py 135 

Hs.l52328:EST8 3.35 

NM.002402:Homo sapiens mesodenn specific 135 

Nlil.005216:HomosapIensdbllchyNf(phosph 135 

Hs.2g03:pratein phosphatase 4 (tomteriy 134 

NM-001961 :Homo sapiens eukaryotic transt 134 

NM_O20400:Homo sapiens G protein-coupled 134 

NM.004429:Homo sapiens ephrin^l (EFNB1) 134 

(locusfinl()KM_004460:^ sapiens fibiob 134 

Hs.20281'i^phosphat8se>7 134 

(tocuslln)c)NM-145051:Homo sapiens hypoth 134 

NM.004341 :Homo sapiens carfaamoyl-phospha 134 

(locuslink)NM.001997:Homo sapiens Finkel 134 

Hs.351474:Homosa;^enscONARJ30002fis 134 

Nh^002452:Homo sapiens nudix (nxleoside 134 

Hs.173980:nuclear matrix protein NMP200 134 

NM.006330:Homo sapiens lysophosphoiipase 134 

(tasrinl£)NM_017798:Homo sapiens duomo 134 

Ks208912:hypothelical protein liAGC861 133 

Hs.l80059:Homo sapiens cONA FU31360 lis 133 

NM.003488:Komo sapiens A Idnase (PRKA) a 133 

OocuslinigNM.012408:Homo sapiens protei 133 

Ks.146381:RNA binding motif protein, Xc 133 

Hs.377B16:Homosapen8cDl^aJ36808fi3 132 

Hs.1 19122:rtoomal protein L13a 132 

NM.019082:Homo sapiens putative nudedla 132 

Hs.166254:Dkely orthotog til rat vacuole 132 

NM-001982:Homos8piensv«b-t)2erythfDb 132 

(|ocu8lnk)NM_008470cHomo sapiens tripar 132 

132 

(locusSnk)NM.022103:Homo8eplensCDw92 132 

Hs.133397:ESTs 132 

Hs.348669:CDC28 protein kinase 1 131 

Hs.75184:chitlnase ma 1 [caWage g 131 

(|ocuslink)NM.001467:KQmo sapiens glucos 131 

NM.000991:Homo sapiens ribosomd pratein 131 

Hs.21201:nedin3 131 

Hs.343261 :Wstooompatil)irity (minor) 13 131 

I^M_01693Vitomo sapiens fls485 (LOC51066) 131 

(|ocu5link)NM.016326:Homo sapiens chemok 131 

NM.000433;Honw sqplens neubQphil cytoso 131 

H8.148324:ESTs 131 

0ocusM)NM_OO448O:Homo sapiens fucosy 131 

NM.008404:Homo sapiens protein C recepto 130 

NMJD00189:Homo sai^ens hexokinase 2 (HK2 130 

Hs.281434:Honio sapiens cONAFU31373fe 130 

NMj018698:Homo sapiens hypothefteal prot 130 

NM_00336&Homo sapiens ui^u}no)-<^hr 130 

NM_017893:Homo sapiens sema domain, immu 130 

H8.356392f STs, Highly slmnar to S-phas 129 

(iocus6nk}l^002296:Komo sapiens iamin 129 

Hs.5957:Homo sapiens done 24416 mRNA se 129 

NM-016016:Homo sapiens CGl-69 protein (L 129 

NlA-001538:Homo sapiens HMT1 hnRNP methyl 129 

NM-001756:Homo sapiens serine (or cystel 129 

NM_007357:Homo sapiens component of otig 129 

Ks.75061'jnacrophage myitslpytaled alanln 128 

HS.174905MAA0033 protein 128 

Ks.224630:Homo8aplenscDNAFU333161is 128 

Ks.196988£ST8 128 

(tocusnnk]NM.007274:Homo sai^ens cytoso 128 

(locustin)(}NM_005984:Homo sapiens sdufa 128 

Hs.30340JiypothelicalprolebiKlAA1165 128 

Hs.210792Mxnosapiens€ONAFU366911is 128 

3 28 

OocusOnk3NlVL033450:Homo sapiens muttW 128 

NM.012451Hbnio sapiens transducin(l)eta} 128 

Nhl.005001:Homo sapiens NAOHdehydrogenaa 127 

NM-006588:Hoinoaapl8nssolUtecairi8ffa 127 
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415402 AA184687 Hs.177576 Hs.177576anannosyl{alpha-1.3-)^ptD 3.17 

451032 VTO3692 Hs^23079 Hs^79:Hoto sapiens mRNA: cONA OKFZpS 3.17 

412146 M92444 H3.73722 HS.73722APEX nuclease (mulIifuTOfional ai7 

, 417018 M16038 Hs.80887 HsJ0887>ye5.1 YamagucH sarcoma vlrai 3.16 

5 425244 AK002127 Hs.155313 NM-022105:Komo sapiens death sssodsled 3.16 

417878 U90916 Hs.82845 Hsi2845itomo6a{BenscONA:FU21930fis 116 

426675 AW084791 Hs.133122 Hs.l33122±ypofh8ficd) protein FU14524 3.16 

432728 NM_006979 Hs.278721 NM-00697e:Honio sapiens KLAdassUfegi 3.16 

442843 U82756 Hs.374973 (iocusllnkJNM-004597:Homo saptens PRP4 p 3.16 

10 418462 BE001596 Hs.85266 Ks.85266:integrin, beta 4 3.16 

429922 Z97630 Hs.228117 NK/L005318:Honio sapiens HI hbtone family 3.16 

429556 AW1393S9 K5.314807 Hs.314807:hypothetical protein MQC28S5 3.15 

418127 BeZ43982 Hs.83532 (lQCUslnk^002389:Homo sapiens membra 3.15 

^_ 416293 BE244454 Hs.79162 H8.79162xbucturB8pe^reooght6on 115 

IS 435968 AW161481 Ks.111577 (locusl!nlONM.030926:Homo sapiens integr 115 

414702 1.22005 Hs.76932 NI\1.004359:Homo sapiens cefldhrfston eye 115 

437672 AW748265 Hs.S741 NIC016230:Homo8S«))ensflavohemoprQteIn 115 

435750 AB029012 Hs.4990 Hs.4990:KIAA1089 protein 114 

^. 432710 AA609685 H5.278872 NMJ005898:Komo saptens membrane oomponen 114 

20 407961 AW672939 H5.41694 K$.41694:orfgin recognition complex, sub 114 

438203 BE540090 Hs.7345 Hs.7345:MAD1 mlloDc arrest deiidBnt^i 114 

426410 BE298446 Hs.305890 NM.138578:Homo saptens 602^ 1 (BCU 114 

422282 AF019225 H5.114309 0DCUsrink}NM.OO3661:Homo saptens 8po6p 114 

450295 AI766732 Hs.210628 Hs.210628:EST8 114 

25 413900 AW409747 Hs.75612 NM_006819:HoTno sapiens stress^iducediih 113 

452695 AW780199 Ks.30327 KM_003668:Komo sapiens mitogen-adivaled 113 

407797 AK000524 Hs.39850 Hs.39B50:uridine tdnase^lce 1 113 

412006 AW451618 Hs.290216 Hs^0216:EST8 113 

424247 mom Hs.234734 NM_000239:Homo saptens lysozyme (renal a 113 

30 '447627 AF090922 Hs.152738 NML016050:Homo saptens miiochondrial rib 113 

422753 A1928995 H$.1575 Hs.1575xmall nuclear ribonucieoprotein 3.12 

441321 H17182 Hs.7771 NM_007273:Konio sapiens repressor of estr 112 

411358 R47479 Ks.94761 Hs.94761:KIAA1 691 protein 112 

438444 AI064707 Hs.301226 Hs.301226:HDmo sapiens, ctone IMAGE:3456 112 

35 424727 AW590378 Hs.378965 Hs.378g65:Homo saptens cONA FU37658 Us 112 

435975 AL118990 Hs.373554 (locusitnl()NM.130786:Honio saptens alpha- 112 

426680 AA320160 Hs.171811 NIC001625:Momo saptens adenylate kinase 112 

412326 R07566 H3.73817 NM.002983:Homo saptens smaH inductbte c 112 

409220 BE243323 Hs.61233 (tocusHnk)NI^003842:Homo saptens tumor 112 

40 428699 AW5782S2 Hs.190161 Hs.1 901 61 :U%8 protein 112 

422675 BE018517 Hs.119140 NR4-00197aHomo saptens eukaiyotlctransi 112 

444301 AK000136 Ks.10760 NM1768Q:Homo saptens asporin(LKR das 112 

409932 A1376750 H3.57600 NM.001283:Homo sapiens adaptor-feteted p 112 

419152 L12711 H5.89643 (locustink)Nti4.001064:Homo saptens Iransk 112 

45 410240 AL157424 Hs.61289 K8.61289:8ynaptojanln2 112 

413073 AL038165 Hs.75187 NM.014765:Homo saptens transtocaseoTou 111 

406865 At025931 Hs.181357 Hs.181357:tenilnin receptor 1 (67kO. ribo 111 
405203 111 

441028 AI333660 H8.17558 Hs.17558:Homo saptens. done tliMGE:40704 111 

50 417211 T97617 Hs.269092 Hs.269092:EST8 111 

421684 BE281591 Hs.10e768 NM.018120:Homo saptens hypotheticdprot 111 

429824 AA296363 Hs.121520 Hs.l 21 520:Homo sapiens cDNAFU35792fis 111 

426234 BE314534 Hs.168159 Hs.l 681 59:bjfuncttenal8poptossregulat 111 

408805 H69912 K&48269 NM.003384:Homo salens vaodnteieteted 110 

55 417821 BE245149 Hs.82643 NM.0Q2822:Homo saptens protein tyrosine 110 

416976 6E243985 Hs.80880 Hs.80680an^ vault protein 110 

410013 AF067173 H5.57904 Hs.57904:mag&flaslil homotog, proRteraS 110 
429597 NM.003816 H3.2442 Hs^442;ad1sintegTinandmstanopR}tein 110 

453518 AW503205 Hs.27268 Hsi7^:Homo8aidanscDNA:FlJ21933fis 110 

OO 429238 NH.002849 Ks.198288 NM.002849:Homo saptens protein tyrosine 110 

433409 AJ278802 Hs,25661 Hs.25661£ST8.Moderateiy8imitertohyp 110 

426440 BE382756 Hs.169902 NM.006516:Ho(no saptens solute canierfe 110 

435472 AW972330 Hs.283022 NKU}18643:Honw saptens triggering lecept 110 

427609 AK000436 Hs.179791 NM.017817:Komo sapiens RAB20, member RAS 110 

65 .408201 AK000S68 Hs.43654 NKL01 7882:HomD sapiens ceioid^ipofusdn 110 

'418181 U37012 Hs.83727 NM.013291:Homo saptens deavage and poly 110 

445176 A1878907 Hs.12379 NM.001419:Homo saptens ELAV (embryonic I 110 

438908 K95990 Hs.181244 Hs.181244:m^histeoompaGbaityoompl 110 

427337 246223 Hs.176653 NM-.000569:Homo8a^Fc fragment of tg 110 
70 433435 BE54527L-HS.340959 NM_005726:Homo sapiens Ts transtelton el 109 
408150 8E620274 H5.43112 Ks.43112:Honrto saptens mRNA:d)NADKFZp43 109 
431738 AW237726 Hs.288549 NM-032828:Komo saptens ubIqui&iUBF-fl 109 
449703 H61001 Hs.171802 Hs.1 71802:Homo saptens, done IMAGE:3956 109 
423184 NVL004428 Hs.1624 NM»004428:Honio saptens q)hrin.A1 (EFNA1) 109 
75 400278 Hs.2280 NM.002950:Homo saptens ribophorin I (RPN 109 

449051 AW961400 Hs.333526 NH.032339:Homo saptens hypotheOcaiprot 109 
428297 AA236291 H9.1B3583 NM.030666:Hamo saptens serine (or cystei 109 
430066 AI92g659 Hs.237825 Ks.23782S»lgn8treoognlltonpai^72 109 
428044 AA093322 Hs.301404 NZyL006741Kofno saptens RNAbtedingmolIf 109 
SO 426989 AI81520B Hs.100293 H3.100293:(MInked N-ecetytghicosdmlne ( 108 
447887 AA11405O Hs.211610 NM.001228:Honio saptens caspase 8. apopto 108 
422010 AA302049 Hs.31161 Kl31181:HDino8aplaacONA:RJ23230fis 106 
444823 BE2B2989 Hs.12045 Hs.12045w'C2f protein 108 
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10 
15 
20 
25 
30 
35 
40 
45 
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AiUOOB 


6E379794 


Hs,159651 




nWouirjd 


nS.1BUQS9 


419/5/ 


AA//Jo2D 


Hs.d3970 


427725 


U66839 


Hs.180533 


44408 


AI754o13 


Hs.l 46428 


433914 




Ks.574 


42010/ 


A)w01714 


HS.95744 


431498 


AKUU 1777 


HS.25B551 


41K2J 


AXAtttACHOy 

AWo4o3o7 


nS.D6B04 


449954 


AAo41d3o 


Hs.37477 


423671 


AWooOldO 


H3.234101 


448719 


AA(woo27 


Hs^185B 


^1455 


AIW7ZZ7 


Ll* OQ04 

Hs.8c21 


450876 


AF1 89052 


Hs^85976 


432465 


D56165 


K5.275163 


421808 


A1/AAA4CY 

Aw)00157 


Ks.1 08502 


430014 


H59354 


Li_ O^JOAO 

Ks.3743u3 


436553 


AW4071S7 


Ks.181i25 


458814 


AI498957 


HS.3S1937 


450247 


AF123303 


Hs^4713 


416052 


AW630o56 




407223 


H9o650 




418841 


BE243136 


Hs.86947 


450690 


AA296696 


Ks.333418 


408124 


Uo93j7 


Ks.42853 


435550 


A loo J J C0 

A!22445o 


11— OOJf AT 

H3,324507 


421779 


Ala791S9 


K8.108219 


440246 


V\^010 


H3.I91379 


44a770 


AV6o09u9 


Hs.1549oo 


440708 


Ar03o9o2 


Ks.7381 


425118 


AUD7001 1 


Hs,1 54672 




AA534296 


H8.20953 


412867 


Ai imeoet 
AUD76WI 


Hs.74637 


422032 


AA47D900 


ns.1 10857 


441238 


AI372555 


Hs^22456 


408524 


D87942 


H3.4o32o 


408102 


U46351 


Hs.621 


430508 


Ai0i5435 


re.104637 


432281 




ns.2742o3 


443883 


AA114212 


Hs.9930 


423570 


AVv838306 


11. 4 inn 4 o 

Hs.129819 


443653 


AA1 37043 


Hs.9663 


451711 


AKD0O461 


Ks^6890 


444736 


A A POO An4 

AA533491 


Hs.23317 


407687 


AK002011 


Hs^SSB 


423022 


AA32052S 


Hs.201076 


453450 


AW797627 


Hs.347459 


412708 


R26830 


Hs.106137 


448569 


8E382657 


H&21486 


443329 


BE262943 


Hs.9234 


439018 


AW300887 


Hs>26638 


412627 


BE391959 


Hs.74276 


444309 


U83236 


Hs.10803 


412969 


AI373162 


Hs.75103 


430354 


AA9S4ai0 


Hs.239784 


414774 


X02419 


H5,77274 


450607 


AIJ050373 


Hs.25213 


421179 


U72664 


Hs.148495 


451796 


BE297567 


Hs.27047 


428428 


AL037544 


Hs.184298 


447656 


NM.003726 Hs.19126 


436823 


AW749865 


Hs.1 17077 


417898 


AA379770 


Ks.82890 


444193 


Y17801 


Hs.10574 


418741 


H83265 


Hs.8881 


414421 


AI521130 


H3.355126 


424887 


AI024880 


H5.1S3591 


442504 


BE503373 


K3.334335 


437651 


BE560672 


Ks.13543 


436540 


GE397032 


Hs.14468 


438000 


AI825680 


Ks.5985 


415697 


A1365603 


Hs^79696 


437469 


AW753112 


H3.15S14 


436319 


H90727 


Hs^123 


450126 


6E018138 


Hs^4447 


445985 


BE621800 


Ks.29444 


406868 


AAS0S445 


HS.300G97 


424439 


AA579635 


Hs.1770 


408452 


AA054683 


Hs.22272a 


417831 


H16423 


HsJ2685 


TABLE 11B 





NM_014452Hoino sapiens tunornecnids fa 3i08 

Hs.180059:KQnio88ptenscONARJ313606s m 

H3.63970:EST8 3L08 

NM.(X)2756:Homo sapiens mitogon-acMed a08 

^146428xollaoen. type V, alpha 1 3.08 

NM.000507:Nomo s^ns fructoso-1.&l}tsp ao& 

NM.019028:Hoino ssfoeits hypdh8&:al prat 108 

NM.012100:Homo sapiens sspartylaminopep a07 

(locusl!nk)NM_139248:Homo sapiens membra 3L07 

KsJ7477:EST8. Weakly similar to T46220 107 

Ks^34101:Homo sapiens. stmBar to chdi 3^7 

H5.21B58xerine (or cysteine) proteinase 107 

NM.021175ita)saplenshc^nanfiniicr 107 

(locu5lln)()NM.013384:Homo sapiens UG1 1 107 

NM.002S1 2:Homo sapens fKNHnelastaSc oe 107 

NM_017668:Komo sapiens hypottieiicai prot 107 

(locusrmk)NM_144691:Horno8apiuahypoth 107 

Hs.18112Staunag}obuiIn lambda locus 108 

H8^519379lbosomslprateln. large P2 106 

NM_013386i1omo sapiens hypolheScal prot 106 

NM.006406:Homo sapiens peroxiredoxin 4 ( 106 

K968507w03b12.8l Scares melanocyte 2NbH 106 

NM.001109:HQmo sapiens a disinlegrin and 108 

(locus6nk)NM„014164:Homo sapiens FXYDd 106 

NM_004381:Honu> sapiens cAMP responsive e 106 

Hs^24507:liypofhe9cal protein F1J209B6 106 

NR.004626:Homo sapiens wingisss-type MMT 105 

Ks.191379:ESTs 105 

H8.154986:EST8, Wealdy simOar to PaP_H 105 

Hs.7381nn)ttago-d8pendent anion ctiannel 105 

Ks.1 54672:methytene tetrahydrafolata deii 105 

Ks.20953:liypothetlca) protein BC010003 105 

Hs.74637:testisenti3nced gene transcript 105 

NM.016310:Homo sapiens polymerase (RNA) 105 

l4M_032039:Homo sapiens hypotiretjcai prot 105 

Hs.46328iuc03yltransterase 2 {secretor 105 

Hs.621:lectin,ga!actoside4)indlng,8o!u 105 

Hs. 1 04637:solute carrier famOy 1 (gtuta 104 

Hs.274263±ypotl)ellca! protein FU10377 104 

Hs.9930:serine (or cysts^) proteinase 104 

NI^018344:Homo sapiens hypothetical prot 104 

NM_013374:Homo sapiens programmed ceD d 104 

Nh4.017829:Homo sapiens cat eye syndrome 104 

NM.032824:Homo sapiens hypothetlca! prot 104 

NM.018339M)n¥> sapiens hypothetical prd 104 

H8.201076:EST8 104 

Hs.347459:Homo sapiens cONA FU13900 lis 104 

Hs.106137'i1omo sapiens mRNA for OK/SW<U. 104 

Hs21486»ignal transducer and acfivator 104 

N^032635:Homo sapiens seven transmembra 103 

NK4J}31457:Komo sapiens membran&spanning 103 

Hs.74276xhloi1de tntraceilular channel 103 

Hs.1Q803xa)dum and integrin binding 1 103 

NM.003406:Homo sapiens tyrosine l^nonoox . 103 

Hs.239784»aibbie 103 

NMJ)02658:Homo sapiens plasminogen aclhr 103 

NM.01 5677:Komo sapiens hypothetical prot 103 

NMJ)02810:Komo sapiens proteasome (proso 3.02 

Hs.27047:hypothetica] protein FU20392 102 

NMJ)0l799:Komosafdenscyc&rHlependent 102 

NM_003726:Homo sapiens sro family assod 102 

Hs.1 170772!nc finger prot^ 264 102 

K&82890:ds(snder against ceH death 1 102 

Hs.10574:solutB canter family 2, (fiacO 102 

K5^81:Homo sapiens cDNA FLJ32163 lis. 101 

(locu5linl()NM.144686:Homo sapiens hypoth 101 

NMJ)05787:Homo sapiens Noi56 (D. melanog 101 

NMJ)22484:Homo sapiens hypothetical prot 101 

(locusiink)NM.145214:Homos^en8trIp8r 101 

NM.020230itoiio sapiens peter pan homolog 100 

Ks^9B5:non^n3seCdo42 effector protei 100 

Ks.279696:OKFZP566]1024 protein 100 

Hs.15514:hypothelical protein M6C32eO 100 

Hs^123:hypolhe{IcaIprotdnBC008246 100 

(locusIink}NM.005866:Homo sapiens type i 100 

H&29444:putativesmaB membrane protein 100 

Hs^0697immunoglobulln heavy constant 100 

K5.1770:figase 1. DMA, ATPntependent 100 

Ks^28:Homo8aplenscONAilJ3g0048s 100 

Hs£2685:C047 antigen (Rh^eiatedanQge 100 
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Pkey: LNque Eos probesetidenSfler number 
CAT number Gene clister number 
Accession; Genbank' accession numbere 



Pkey 

406685 
432407 



459308 
452098 
451129 

TABLE lie 



CAT Number Aooessfan 
0.0 M18728 

MH1429_12 BG0366r5 BF772005 BFTZm BG960386 BG960381 NM.005712 AF110315 BE074534 BE182776 BE158000 BE157999 BE714315 AW818104 
AW847519AA099426 AW817981 AW856396 BG961122AA224498 AA308542 AW821833 BP9t)2155 A!732411 BG778834 BG283641 BE748279 
BE748870 BG319540 BE748864 BF739224 BG986155 AK057283 61861466 AA663341 AA457591 BG949294 AW392886 AA071122 AA227849 
AAS84918BG959570BF773486AL041698BF959013R87170 016889 BF770411 BF771298AI075321 L13823AA216700BF771864AWB61859 
BE537068 C1B935 AA155719 BF771 172 BF769107 BF804984 AW8iei72 AW8iai43 AW392930 AW817057 AW858044 BF746211 AA179928 
AVV861687AW821826BI055726BF242643AA207189BF770412BF771157BG430030AA055592 "VMfw^o 

223120.-4 AV\^8452 

161393.1 BG028348 BF772844 H83066 AW817g69 H90985 6F755039 AI858183 
1495511.1 Be072881AI762181BE072948 



Pkey. Unktue number corresponding to an Eos probese! 

Ref: Sequence source. The 7<Qgll numben In iMs cobm are Genbank Identifier (GQ numbers. 'Dunham L et eL' refeislo ttie 

sequence oC human chromosome 22.' Dunham t. et al.. Nature (1999) 402489-495. 
Strand: IndleatssDNA strand from which axons were predkited. 
NtpoSifion: Imficates nucleotide posntonsoTpredtetedexons. 



"TheDNA 



Pkey 


Ref 


Strand 


Ntjx)sifion 


406399 


9256288 


Minus 


63448-63554 


403220 


7630969 


Rus 


64338^517 


403218 


7630969 


Plus 


58039-56149 


403221 


7830969 


Rus 


66294^6438,6693&67124 


405484 


5922025 


Rus 


199214.199579.199672-199920.200262-20049 


400529 


9796988 


Rus 


138232-138423 


405556 


1552511 


Plus 


163497-163623,164715-164968,165369-16550 


403739 


7630882 


Phis 


44563-44766.4820^46483.52255^495 


403219 


7630969 


Rus 


61858-61995 


404826 


6572184 


Rus 


47726-48046 


400750 


8119067 


Rus 


198991-199168,199316-199548 


400646 


9188605 


Rus 


39310^4 


401179 


9438647 


Rus 


113477-113893 


400847 


9188605 


Rus 


44643-44835 


404854 


7143420 


Plus 


14260-14537 


400448 


9687667 


Minus 


177372-177674 


404240 


5002624 


Minus 


116132.116407.116653-116922 


402829 


8918414 


Rus 


101532-101852,102006-102263 


406363 


9256114 


Plus 


14403-14602,17000-17147,17241-17368 


402104 


8119072 


Phjs 


122409-122600 


402260 


3399665 


Mbus 


113765-113910.115653-115765.116808-11694 


403912 


7710730 


Mhus 


72000-7229a72431-72700,72929-73199 


400836 


8954179 


Rus 


677-1188 


400845 


9188605 


Rus 


34426-34612 


403217 


7630969 


Plus 


5408^163,55427-55623 


405387 


6587915 


Mbnis 


376&^5708-5895 


405203 


7230116 


Rus 


125295-125463 



Table 12A Dsts about 1006 genes up-regutated in cenrfcal cancer compared to nonna) adutt tissues. These were selected from 59680 piobesets on the Aflymetrixa»-Hu03 
toeChip aray such that the ratk> of 'average' cenrical cancer b 'average' nornial adult tissues was greater than or equal k> Z5. The "average" ceivlcal cancer level was set Id the 
1^.?";^^?.®^^*"^*'^^ The'average-nomHlBduIltlssgBtevelwasseltDthegs^percenlHovalueamongstnon^ 

spedfc background levels of ooiMpecHic hybrfcfeaBon. the 15^ percentile vdue amongst non-malignant tissues was sufalracted from both (he numerator and Die denominator before 
the rafio was evaluated. 

TABI^ 1 2A: 1006 GENES UP-REGULATED IN CERVICAL CANCER COMPARED TO NORMAL ADULT TISSUES 

Ray: Unk)ue Eos probeset tientifier number 
ExAoca* Exemplar Accession number, Genbank accession number 
UnlgenelD: Unigene number 
Urigene TlQe: Unigene gene fa'tle 



R1: 

Rtey 

402076 
425650 
418007 
435094 
439606 
452240 
444783 
417094 
424046 



Rafio of tumor to nonnal body 6ssu8 
ExAccn UnlgenelD Unigene Title 



U19557 

NM.001944 

M13509 

AI560129 

W79123 

AIS91147 

AK001468 

NhL006163 

AR127866 

BES45072 



Hs.123035 

Hs.1925 

Hs.63169 

Hs.329062 

Hs.58561 

Hs.61232 

Hs.62180 

Hs.80962 

Hs.136202 

Hs.122579 



squamous eel carcinoma antigen 2 (S0CA2) 
desmogtein 3 (pemphigus vutgads antigen) 
matibt metanopmteinase 1 finteistifiai ool 
EST 

G pmtebvcoupled receptor 87 
ESTs 

anann (Drasophia Scraps homolog). adin bl 



serine (or cysteine) proteinase inhibttor. d 
hypotheflcal protein aj10461 



R1 

81.1 
43.6 
38.9 
30.3 
28.8 
27.0 
26.0 
25.5 
24.5 
23.7 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



435505 


AF20O492 


Hs^11238 


lntateuldn-1 homotog 1 


21^ 


400289 


X0782O 


Hs.2258 


matrix metafloprotehase 10 (stromelystn 2} 


20.5 


418345 


AJ001596 


Hs^41407 


serins proletnasa inhibilor 13 (Pt13; serpin 


20.1 


452461 


N78223 


Hs.108106 


trenscHpfon tector 


19.8 


423017 


AW178761 


Hs.227948 


serine (or cysteine] proteinase biKbHor, d 


19.2 


428432 


AI678059 


HsJ202676 


synaptonemal oom^ protein 2 


17.5 


415817 


U88967 


H&.76867 


protein tyrosine phosphatase, reoeptor-type, 


16.4 


435243 


AW292886 


Hs.281373 


hypotheteal protein dJ434014.3 


16.3 


428864 


AK001666 


Hs.189095 


simOar to SALil (sal (DrosophflaHDe 


16.2 


428227 


AA321649 


H3.2248 


smaH inducible cytokine subtentlly B (Cys-X-C 


15.9 


421373 


AA808229 


Hs.1 57771 


ESTs 


15.7 


416209 


AA236776 


Hs.79078 


MAD2 (miiDtic arosl deiident yeast, hoirak) 


15.4 


441459 


AI919142 


HsJM4233 


ESTSk Wiealdy GbnOar to ALU1_HUMAN ALU SUBFAM 


14.6 


447164 


AFQ26941 


H8.17518 


Homo sapiens c^S mRN^ parfial sequence 


13.8 


412719 


AW016610 


H3.129911 


ESTs 


13.4 


417386 


BE185289 


Hs.1076 


smaO pidine^ protein IB (oomilin) 


13.3 


431753 


X76029 


Hs^l 


neifloniedin U 


13.2 


438817 


AI023799 


Hs.163242 


ESTs 


13.1 


404996 






TaigelExDn 


13.1 


443211 


A)128388 


Hs.143655 


ESTs 


12.9 


414764 


AW013887 


Hs.72047 


ESTs^ 


12.9 


^ 428618 


AA885360 


Hs.160199 


Target CAT 


12.7 


416661 


AA634543 


Hs.79440 


\GFA\ mI^NA4)tnding protein 3 


12.7 


421478 


A1683243 


Ks^7258 


ESTs, Moderately Mat to S29539 ritiosoma) 


12.6 


4294B6 


AF155827 


Hs^03963 


hypothetical protein nj10339 


12.6 


432666 


AW204059 


H$.312716 


ESTs, Weakly simitar to unnanied protein piodu 


12.6 


42758S 


031152 


Hs,179729 


collagen, type X alpha 1 (Schmid metaphyseal 


12.6 


419183 


U60669 


Hs.89863 


cytochrome P450. sutrfamlly XXIV (vitamin 0 24 


123 


450221 


AA328102 


Hs,24641 


cytoskeieton assodated protein 2 


12.2 


422168 


AA586894 


Hs.1 12408 


S100 calcium-t)inding protein A7 (psoriastn 1) 


11.6 


415989 


A1267700 


Hs.317584 


ESTs 


11.5 


452838 


U55011 


Hs.30743 


preferenOalty expressed antigen in melanoma 


11.4 


447048 


AW393080 


Hs.228320 


hypothec protein FU23537 


11.3 


453922 


AF053306 


Hs.36708 


tMidcfing uninhitnied t)y t^enzimidazotes 1 (yeas 


11.0 


449260 


AA741180 


HSJ29879 


ESTs 


11.0 


423049 


XS9373 


HS.188D23 


ESTs, Moderately Mar to HXOA.HUMAN H0ME06 


10.7 


406467 






Target Exon 


10.5 


439926 


AW014875 


Hs.137007 


ESTs 


10.2 


428388 


BE440042 


Hs.83326 


matrix metaBoprotemase 3 (stromely^ 1, pr 


10.2 


420759 


T11832 


Hs.t27797 


Homo sapfens cONA FLJ11381 Us, done HEMBA10 


10.2 


414422 


AA147224 


Hs^7232 


HQmeobaxA13 


10.2 


442660 


AW138174 


Hs.130651 


ESTs 


10.1 


449003 


,X76342 


H3.38g 


ekx)hoi dehydrogenase 7 (dass IV), mu or sig 


10.0 


408572 


AA055811 


HS.226S68 


ESTs, toderateiy similar to ALU4.HUMAN ALU SU 


9.9 


439B20 


AL380204 


Hs.283853 


Homo sapiens mRNA full lengfl) insert cDNA do 


as 


400195 


NM_007057 


NI^007057*:Homo sapiens ZW10 interactor (ZWll^ 


9.8 


422426 


W79117 


Hs.58559 


ESTs 


9.7 


447700 


AI420183 


Hs.171077 


ESTs« Weakly similar to T21259 hypolhetk:al p 


9.7 


458194 


AW383818 


Hs.265459 


ESTs, Moderately 8im9ar to ALU^IHUMAN ALU SU 


9.4 


455601 


AI388680 


Ks.816 


SIW (sex detemilning regton Y)-box 2 


9.A 


418882 


NM_004996 Hs.B9433 


ATP-bin(fing cassette. sut>-family C (CFTR/MRP) 


9.4 


437789 


AIS81344 


Hs.127812 


ESTs, Weakly shnilar to T17330 hypolhedcal p 


9.4 


419247 


S65791 


H3^g764 


fragile X mentsd retardation 1 


9,1 


419750 


AU)79741 


Hs.183114 


Homo saptens cONA RJ14236 lis, dona Kr2RP40 


9.1 


420602 


AR)60877 


Ks.99238 


regulator of rotein signalling 20 


9.0 


428845 


AL157579 


Hs.153610 


K1AA0751 gene produd 


9.0 


425427 


M86699 


Hs.169840 


TTK protein ktoase 


9.0 


-■^429538 


BE1825g2 


H3.11261 


sman proDne-fich protein 2A 


9.0 


446232 


AI28184B 


Hs.194691 


retindc acU induced 3 


as 


430520 


NWL016190 Hs.242057 


chromosome 1 open reading frane 10 


8.9 


439772 


AL365406 


Hs.10268 


Homo sapiens mRI^ full length insert cONA cto 


8.9 


424905 


NM-002497 Hs.153704 


NIMA (never in mitosis gene a}-felated kinase 


8.9 


423020 


AA383092 


Hs.1608 


replicadion protebi A3 (14kD) 


8.9 


430486 


BE082109 


Hs^41551 


chioiide channel, caidum activated. fanHiy m 


8.7 


452291 


AR)15592 


H3.28853 


C0C7 (ceO division cyde 7. S. cerevislae, h 


a7 


432193 


AA372264 


Hs^73193 


hypotheHcaJ protein FU10708 


&7 


407642 


AW17B963 




glKMRO^TD032-2008994X)1-l>11 8T0032 Homo sapi 


a7 


422420 


U03398 


Hs.1524 


tumor neciQsfs factor Qigand) supeftenily, m 


a7 


458027 


L49054 


Hs35195 


myelddtoukeinia factor 1 


8.4 


1 424086 


AI351010 


Hs.102267 


lysyl oxidase 


&3 


420092 


AAB14043 


Hs.88045 


ESTs 


8.3 


449034 


A1624049 




^:ts41a09j(1 Na_CX^_Ut1 Homo saptens cDNA 


&3 


408522 


AI541214 


Hs.46320 


SniBsO proline^ protein SPRK [human, odont 


a2 


418478 


U38945 


Hs.1 174 


cycCiHtependent kinase Inhibitor 2A (melanom 


a2 


436279 


AW900372 


Hs.180793 


ESTs. Weakly dirflar to $65657 alpha-IOadren 


a2 


440834 


AA907027 


Hs.128606 


ESTs 


a2 


452724 


R84810 


HS30464 


cydInE2 


ai 


423849 


AL157425 


H5.133315 


Homo saptens mRNA; d)NA DKFZp761J1324 (fiomc 


8.1 


410044 


BES66742 


Hs.58169 


highly expressed in cancer, rich In leucine h 


ao 


! 429228 


AI553633 


Hs^7139 


ESTs 


7.9 


438915 


AA280174 


Hs^85681 


WiSams-Beuren syndrome chfomosomaregton 14 


7.9 


425710 


AF030880 


HS.1S9275 


sdute carrier family, memt)er 4 


7.8 


421308 


AA687322 


Hs.192843 


tetidne zipper protein FKSG14 


7.8 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 

80 



435159 
439232 
437616 
406554 
433133 
424098 
418134 
446364 
447254 
414148 
429648 
423725 
450192 
450149 
431941 
427821 
436211 
425761 
450026 
409719 
444342 
413573 
422330 
45498B 
447342 
403471 
409041 
407839 
415652 
420900 
444271 
410153 
448593 
431622 
457405 
424345 
414812 
42S734 
446435 
421948 
419335 
429259 
406747 
453884 
423735 
421773 
457435 
450025 
427043 
409723 
459462 
423244 
427217 
441620 
4379SB 
430791 
416734 
413385 
434828 
414299 



AA668879 H5.1 16649 
N46590 Hs.45693 
AI797163 Hs.207954 



413753 
413625 
416049 
415064 



451381 
415900 
444478 
449811 
420837 
438639 
414972 
438394 
428365 
407746 
452401 
432239 
451307 
441631 
418663 
410659 
432869 



AB027249 

AF077374 

AA397769 

AB006624 

NM_004153 

BE084049 

AW138872 

AJ403108 

AA263143 

AW969781 

AK000106 

AA470158 

AK001581 

AW664214 

AI912012 



Nlil.014398 

Ar733859 

D30783 

AW850140 

At199268 

AB033025 

AA045144 

T79213 

AL045633 

AW452569 

BE311926 

AW004854 

AW979271 

AA504860 

AK001380 

X72755 

AF056209 

AW206737 

L42583 

AW960146 

AA420450 

At925153 

AA35S925 

AA330259 

W69233 

AW972024 

AK001875 

AA397679 

AWQ85757 

AA481396 

AIJD39379 

AA399272 

AA969119 

BE139550 

AA486293 

H81213 

M34455 

090070 

AA142989 

AA090421 

U17760 

AW451103 

AI970536 

AA159804 

NhL005401 

BE241831 

Z43758 

W07318 

A1970394 

AW976153 

A]278360 

BE263782 

6E379623 

AA295331 

AK001962 

NM.007115 

X81334 

AW293207 

AW291239 

AK001100 

At080175 

AW974094 



Hs.104741 

Hs.139322 

Ks.86617 

Hs.14912 

Hs,17908 

Hs.135268 

H8.1321Z7 

Hs^4598 

Hs,132e63 

H5.272227 

H8.98202 

Hs.334828 

H3.ig6729 

Hs.200737 

Hs.108681 

H8.10887 

Hs.149089 

Hs.115263 

Hs.19322 

Hs,50081 

Hs.161566 

Hs.272073 

Hs.44269 

Hs.149804 

Hs.15830 

K&228320 

K&293184 

H5.145479 

H8.77367 

H1159396 

H&253582 

Hs.334309 

Hs.284137 

Hs.292911 

H8^17493 

Hs.36232 

Hs.112457 

Hs.142653 

Hs.24321 

H8.3g91 

KS.2S7862 

Hs.105167 

Hs.209602 

Hs.144341 

Hs.143502 

Ks.121668 

Ks.272068 

Hs.14825 

Hs.840 

Hs56 

Hs.71730 

Hs.5555 

H3.75517 

Hs.71371 

H&16603 

Hs.149305 

Hs.159238 

Hs.172330 

Hs^7 

K&240 

Hs.197075 

Hs.31409 

Hs.77695 

Hs.27693 

Ks.183881 

Hs.38114 

Hs^2 

Hs.2936 

H8.211S16 

HS271111 

K8.41690 

Hs.68826 



ESTs 7J 

EST8 7.7 

ESTs 7.6 

Tafflet Exon 7.4 

PDZ-lilnding kinase: T-ceO odginated piotein 7.4 

smaOprelind^ protein 3 7J 

ESTs 7.2 

K1AA0288 protein 7.2 

origin rscogniOon complex, subunit 1 (yeast 7.1 

glRPM^BT0651.27O400mR)2 BT0651 Homo sapi 7.0 

ESTs 7.0 

hypotheticai protein LOC57822 7.0 

iW)51-{nteracOng protein 7.0 

2c£3miiyfnembCT2(odd-p^DrosophBaiio 6.9 

Homo sapiens cDI^RJ20099iis, done COL0454 6.9 

ESTs 6.9 

hypdheeca]pro(einHJ10719;K!AA1794prote &g 

ESTs &9 

ESTs &8 

Homo saptensbiain tumor assodated protein N &8 

similar to lysosome-assodsted mainbrane glyoo 6.8 

ESTs 6.6 

eptregulin &8 

gb:IL3H:nO219-261099^23-D1 1 CT0219 Homo sap( 6.8 

Homo sapiens, Similar to RtKEN cONA 2010317E2 6.8 

Target Exon 6.7 

KiAA1199protein 6.7 

ESTs &6 

ESTs a6 

ESTs 6.6 

ESTs 6.6 

Irypothetical protein FU12691 6.5 

hypotiwtlcai protein FU23537 6.5 

ESTs 6.5 

gb'.ab03a10.s1 Stratagene fetei retina 937202 6.4 

Homo softens cDNAFU10518fis, done NT2RP20 6.4 

monoldne induced by gamma intertiBTon 6.4 

pepttdylglydne a^tia-amldaBng monooxygenase 6.3 

ESTs 6.3 

keratin 6A 6.3 

hypoUietic^ protein RJ12B8d 6.3 

ESTs, H^hly similar to S60712 band-6frotein 6.2 

annexin A2 6.2 

KiAAOl 86 gene product 6.2 

gb:EST33963 Embryo. 12 week II Homo saptens c 6.2 

ESTs 6.2 

ret finger protein &1 

Homo sapiens cDNAFU1202Bfi8. done HEMBB10 6.1 

ESTs 6w1 

ESTs &1 

ESTs R1 

ESTs. WeaUystmllar to ubiquitous TPR motif, &0 

ESTs ao 

ESTs. WeaMy similar to envelope protein [Rs 6.0 

ESTs. Moderately slmnar to PC425g fem'lin a aO 

ESTs, Weakly similar to AUJ3 JiUMAN ALU SUBFAM B.9 

ESTs, Weakly similar to KIAA1S03 protein [H.S &9 

Inddeamine-pyrrole 2,3 dioxygenase &9 

phoibo)-12*myrist3te-13«cetet»inducedprote a9 

ESTs as 

tiypotttetkai protein MGCS347 a6 

laminin, bete 3 (nicein (125kO}. kalinin (140 aS 

ESTs as 

hypothefical protein FIJ13163 aS 

liypotheOcd protein MGC2603 a7 

protdn tyrosbie phosptiataso, nuHeceptor ty a7 

hypothetica) prot^ MG(2705 a7 

ESTs a7 

Mi)ti8se phosphoprotein 1 a7 

ESTs a7 
0b:E8T388282 K4AGE resequences. MAGN Homo sapl a7 
ESTs a7 
KIAA0008 gene product a7 
peptidyipro(ylisomerase(cydopliin}4'ke1 aO 
HomosaptenscONARJ20042ti$.don8COlJ0042 a6 
hypothetfcal protein FU11100 a6 
tumornecrosis factor, alphfrinduoed protein aS 
matrb( metaltoproteinase 13 (odtegenasa 3) a5 

ESTs as 
ESTs as 

desinoooliin 3 aS 
ESTs as 

gb£ST386197MAGEresequenc8S,MAGMHomo8q)I aS 
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431255 AA497043 HS.11S685 ESTs 5.5 

407386 AR)2G942 gb'itomo sapiens dg33mRNA,p3r^6equenco. 5.5 

433091 Y12642 Hs31B5 lynvhocytesnfigen 6 complex. k)cusD 5.4 

_ 418502 R99288 Hs.35152 ESTs 5.4 

5 440320 AA879294 gb3iw8Se09.s1NCLCGAPJ>r1 2 Homo sapiens cONA 5.4 

439579 AR)86400 gtxKonio sapiens fuH length (nsertcONA dona 5.4 

4207B3 AI659838 HS.9S923 tecatv gaiactostde^inding. soiuUe. 7 (gala &4 

408538 AW381532 HS.13518B ESTs 5.4 

^. 408758 NM.003586 Hs.47504 , exonudsasel 5.4 

10 451411 AA017492 Hs.135655 EST 5.4 

424834 AK001432 Ks.15340d Konu) sapiens cDNAaJ10570 lis. done NT2RP20 5.3 

407853 AA33S797 H5.40499 djckkDpr(Xenopustaevi5)homolog 1 5.3 

412140 AA219691 Hs.73625 RAB6 Inlerading, kinesln4iks (latddn^ 5.2 

420026 AI831190 HS.16G676 ESTs 5.2 

15 427356 AW023482 Hs.97649 ESTs 5.2 

420440 NM.002407 Hs.97644 mammagloliln 2 5.2 

430082 AW514083 Hs.190135 ESTs 5.2 

445259 AI798994 Ks.152923 ESTs S2 

457345 AI699933 Hs.192l75 ESTs 5.2 

20 453161 AA628G08 Hs.61656 ESTs 5.2 

445019 Ai20S540 HS.28129S ESTs 5.2 

425420 BE536911 Hs.234545 hypolhetical protein NUF2R 5.2 

439706 AW872527 HS.S9761 EST8.WeakiyttmllartDDAP1J<UMANDEAnVASS0 5.2 

431494 AA991355 Hs.298312 hypolheScd pnM DKFZp434A1315 &2 

25 443179 AI926402 Ks.6933 hypolhellcal prolsin FU12684 5.2 

432226 AW182766 H3.273558 phosphate cytklyiyUrensfefBset &2 

441020 W79283 Hs.35962 ESTs &1 

437044 ALJ035B64 Ks.69517 cONA for dHTerentlaDy expressed C01 6 gene &1 

419520 AB009303 H5.90800 matrix matalloproteinase 16 (membrane^nserte 5.1 

30 430563 AA481269 Ks.108660 ATP4Aiding cassette. 8Ub'MlyC(CFrR/MRP) 5.1 

405547 NM.018833*:HomosapienstranspQrter2,ATIM]i 5.1 

435206 AI432364 Hs.160594 ESTs 5.1 

409269 AA576953 Hs.22972 hypothetical protein aJ13352 5.1 

439223 AW236299 Hs.250618 U116 tilnding protein 2 5.1 

35 413251 Ai932903 H&211535 ESTs 5.1 

426320 W47595 HS.1G9300 transforming giowth factor. t)eta 2 5.1 

456829 A1557388 gb:PT2.1X603/tuinor2 Homo sapiens cDNA 3*, 5.0 

416208 AW291168 Hs.41295 EST8.Weakly6lniltartoMUCUfUMANMUaN2PR 5.0 

432473 AI202703 Hs.152414 ESTs 5.0 

40 416738 AW388633 Hs.6682 solute carrier famBy 7, (caMc amino add &0 

423634 AW95g908 Hs.1690 heparln-blnding growth factor tiinding protein 5.0 

428350 NM-003245 H5.2022 Iransgtutaminase 3 (E poiypeptids. protebi^ 5.0 

432867 AW016936 Hs.233364 ESTs SlO 

449448 D60730 H3.57471 ESTs 5.0 

45 409744 AW675258 Hs.56265 Homo sapiens mRNA:cDNAOKFZp586P2321 (frame 4.9 

405857 C700024&gtI72477|pir|lDVHY1Cmultldrugre8!s 4.9 

429682 NM.006306 Hs.211602 SMC1 (structural maintenanoe of chromosomes 1 4.9 

446704 AI337228 Hs.197083 ESTs 4.9 

434376 AA631492 Hs.23921 hypothaficd pioleb DKFZp547A023 4.9 

50 407378 AA299264 Ks£7776 ESTs, litoderetely similar to 138022 hypothetic 4.9 

421155 H87879 Hs.102267 lysyl oxidase 4.9 

443335 T89697 Hs.16645 ESTs 4.9 

444461 R53734 H5.25978 ESTs, Vfeaidy similar to 2109260AB cell growt 48 

410361 BE391804 Hs.62661 guanytate binding protein l.interferoivinduc 4.8 

55 423673 BE003054 HS.1G95 matrix metaUopioleinase 12 (macrophage elast 4.8 

441720 AI346487 Hs.28739 ESTs 4.6 

442980 AA857025 H5.8878 Idnesbv4ike1 4.8 

450375 AA009647 Hs.8650 a distntegrtn and metafloprotelnase domain 12 48 

417592 AA204664 Ks.182437 ESTs. WeaMy similar to 154383 chiomosomeseg 48 

60 429846 AB023021 H8.228945 (iioo6yltransfBfase9(a]pha(1.3)1uoesylban 48 

418939 AW630803 Hs.89497 Iambi 81 47 

417235 AA810278 Hs.24250 ESTs 47 
411958 AA099020 gb2n45h01. Si SIratageneHeLa ceil 83 937216 47 

^. 433858 N69243 Ks.192974 hypothetical protein FU12735 47 

65 ' 450434 AA166950 Ks.195670 hypothaScal protein inJ14991 47 

418379 AA218940 H3.137516 fidgeting 1 47 
401747 Homo sa^nens Iceratln 17 (KRT17) 47 

439759 AL359055 Ks.67709 Homo salens mRNA Ml length Insert cDNA do 47 

_^ 441421 AA356792 Hs.334824 hypolhetical protein FU14826 4.7 
70 457465 AW301344 KS.12290B DMA replication fador 4.6 
433159 AB035898 Hs.150587 Mnesii>4ike protein 2 4.6 
412333 AW937485 gbK}V3-DT0044-221 299^)454)09 DT0044 Homosapi 46 

401137 TagetExon 46 

401575 Target Exon 46 

75 423448 AKOOOne Hs.128753 Homo sapiens cONAFU20769fis. done (X}U)867 46 
421978 AJ243862 Hs.110196 NIC&-1 protein 46 
408728 AL137379 HS.4712S hypothetical protein FU13912 4.6 
431956 AK002032 Hs.272245 Homo sapiens CDNAFU11 170 fis. done PLACE10 4.6 
450510 AA010056 Hs.242998 ESTs 46 
80 436291 BE568452 Hs.5101 protein regulator of cyto)(tob 1 46 
424902 Nlk\.003866 Hs.153887 InosUol poiyphosphate^phosphatase. type 11 46 
428484 mwm Hs.184601 sotute carrier family 7 (cattorA^ambio add 4.5 
449416 AI651016 Hs.246311 ESTs 45 
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416168 H23687 glxynTZjIZrl So8resaluttbrdnN2b5HB$SYHo 4^ 

447D33 AI357412 K8.157601 ESTs 4.5 

446353 AI290919 Hs.153861 ESTs 4.5 

- 443715 Ai583187 H5.97D0 cycOnEI 4.5 

5 454707 AW814989 gb:MR1-STD208>17040(M)24i)05ST02Q6 Homo sap! 4.5 

435435 T89473 HS.19232B ESTs 4.5 

412099 U6419a Hs.73165 intateiiUn 12 receptor, beta 2 4.4 

409731 AA12S985 Hs.56145 thymosin, beta Uenli&ed In neurotlastornac 4.4 

422809 AK001379 Hs.121028 hypothetical protein FU10549 4.4 

10 430919 AA489041 H5.295448 ESTs 4.4 

435313 AI76940G Hs.189729 ESTs 4.4 

425071 NM_013989 Hs.154424 delodnase. lodoihyronlne. type D 4.4 

433322 H50621 Hs.134156 ESTs. WeaMy similar to 138022 hypothetical p 4.4 

416111 AA033813 Hs.79018 chromaSn assembly factor l.subunit A (pi 50) 4.4 

IS 408908 BE296227 Ks.250B22 setineAhreonbie idnase 15 4.4 

444781 NML01440O Hs.11950 GR^ored metastasis-associated pretein ho 4.4 

429170 NML001394 H5.2359 dual spectfictty phosphatase 4 4.4 

414035 Y00530 Hs.75716 serine (or cysteine) piDtdnaselnhibBor.cl 4.4 

41B216 AA562240 Hs.283099 AF15q14proldn 4.4 

20 446252 AI283125 Hs.150009 ESTs 4.4 

447519 U46258 Hs.339665 ESTs 4.4 

425916 NM.005786 Hs.162200 un}tensln2 43 

409420 Z1S008 H554451 lainlfan.gamma2(niceln(100kP).lQ&iln{10 4.3 

416320 H47857 Hs.34024 ESTs 4.3 

25 431808 M30703 Hs.270833 8mphIregtdin(sctiwannom»<ierivedgn»vIh facto 4.3 

441582 BE550200 Hs.127197 ESTs 4.3 

414132 A1801235 Hs.48480 ESTs 4.3 

424012 AW366377 Hs.137569 tumor protein 63 kOa with strong homology to 4.3 

411835 U29343 HS.72S50 hyafuronan-mediated motirity receptor (RHAMM) 4.3 

30 433330 AW207084 Ks.132816 hypothetical protein MGC14801 4.3 

428613 AB037749 Hs.18692B KiAAl 328 protein 4.3 

425921 NhiL007231 Hs.162211 solute earner family 6 (neurotiansmittertTa 4.3 

447078 AW885727 H8.301S70 ESTs 4.3 

434699 AA6436S7 Hs.149425 Homo sapiens cDNARJ1 1980 lis. ctoneHEMBSIO 4.3 

35 428756 AA433988 Hs.98502 hypothetical protein FU14303 4.3 

405708 Target Exon 4.3 

433405 AW157566 Hs.156892 ESTs 4.3 

456443 AWg67500 HS.133S43 ESTs 4.3 

428479 Y00272 Hs.184572 cell division cycle 2. 61 to S and G2 to M 4.2 

40 448621 A1097144 Hs.5250 ESTs. Weakly similar to AUI1.HUMAN ALU SUBFAM 4.2 

412608 AA247995 Hs.44698 Homosaplens done TCCCTA001 51 mRNA sequence 4.2 

406687 M31126 Hs.272620 pregnancy specific beta-1-^lyooprotein 9 4.2 

455365 AW948343 sb:RC0*MT0015-13040OO31<01 MT0015Homo6apt 4.2 

452930 AW195285 Hs.194097 ESTs, Weakly similar to 138022 hypotheScalp 4.2 

45 430134 BE380149 Hs.105223 ESTs. Weakly similar to T33188hypolhe&!aIp 4.2 

423035 AW449679 Hs.156739 H.sap)ensXG mRNA (done PEP11) 42 

427666 AI791495 Hs.180142 calmodiMke skin protein 42 

444602 AI174456 Hs.271925 ESTs. Moderatolysinnlar to 138022 hypotheBc 4.2 

417791 AW965339 Ks.111471 ESTs 42 

SO 444266 AI424984 Hs.125465 ESTa 42 

439394 AA1492S0 Hs.56105 ESTs 42 

457336 AW969657 Hs.291029 ESTs 42 

429125 AA446854 Hs.271004 ESTs. Weakly sbnBar to 138022 hypothetical p 4.2 

404440 NM-021048:Homo sapiens melanoma antigen, fami 42 

55 449228 AJ403107 Hs.148590 protein itiialed with psoriasis A2 

437144 AU)49466 HS.78S9 ESTs 4.2 

448599 AW880912 gb:QVO-CT0387.17O20O>12lH:05CT0387 Homo sap) 42 

431810 X67155 Hs.270845 ktnesb)^ 5 (mitotic Mnesin^Iks protein 42 

419991 AJ000098 H5.94210 eyes absent (OrosophHa)homotog1 42 

60 444361 W76027 Hs.23320 liypotheltodi protein FU11105 42 

458116 AW977549 Hs.47367 KIAA1785 protein 41 

444105 AW189097 Hs.166597 ESTs 41 

426010 AA136563 Hs.1975 hypothetical prot^FU21 007 41 

409632 W74001 Hs.55279 serine (or cysteine) proteinase Inhibitor, d 41 

65 408096 BE2S0162 H&83765 dliydrofbye reductase 41 

457820 AA602711 Hs.336753 EST 41 

402048 Target Exon 41 

427025 AA397589 Hs.97523 ESTs 41 

423515 AA327017 H3.162204 ESTs 41 

70 423891 AK002042 Hs.134795 Homosaplens cDNAFUIIISOfis, done PUCE10 4.1 

455310 AW893961 gbJ^C4^N0027-060400^11-d11 NN0027Homosapl 41 

418696 AW959433 Hs.326290 hypotheOcd protein FU12581 41 

426642 AW088223 Hs.171581 ubiqidUn C^tenninal hydrolase UCH37 4.1 

423738 AB002134 Hs.l32195 airway trypsirvlike protease 41 

75 448243 AW389771 HS.S2620 inlegrin. beta 8 41 
411559 BE144081 gb:MR&-HT016&<)30200^7406HT0165Homosapl 41 

423553 AA405635 Hs.968S4 ESTs, Weakly sin^ar to 0YLXJ1UMAN CYTOPLASMI 41 
457030 AI301740 Hs.173381 dihydropyrtmldlnase^ 2 41 
424115 AA33S497 Hs.293g65 ESTs. Weakly similar to 138022 hypothetkalp 41 
80 432374 W68815 Hs.301885 Homo sapiens cDNAFU 11 346 fis, dona PUCE10 41 
424745 AA214618 Hs.152759 edhrator of S phase kinase 40 
433384 AI021992 Hs.124244 ESTs 40 
448995 AI613276 Hs.5662 guartlRanudeotkleblndino proteta(G protein 4.0 
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448504 
432009 
432441 
424794 
432184 
408321 
417801 
445537 
447724 
446155 



420252 
412811 
415857 
438390 
428949 
429900 
421270 
430733 
454241 
424131 
457059 
424717 
404959 
411571 
426536 



432757 
418686 
437845 
413801 
419312 
429441 
410553 
444665 
438014 
442163 
438656 



407395 
404132 
409601 
439238 
433289 
436149 
446292 
444078 
40S545 
442875 
409687 
407291 
415532 
402047 
415317 
438777 
422938 
423217 
405943 
430885 
458242 
425733 
418582 
453900 
433849 
449592 
453028 
435612 
417742 
418735 
444707 
421508 
450813 
418969 
432837 
442353 
423441 
449978 
410784 
409582 
428182 
458446 
433040 
452183 



AIB58128 Hs.171136 ESTs 

AL137424 H$.306458 Homo sa|teisniRNA:cONADKF2p761G2123 (tome 

AW292425 Hs.163484 ESTs 

M85646 Hs^0695 ESTs 

AW971 125 gb:EST383212 MAGE resequences, MAGL Homo sapi 

AW405882 Hs.44205 cortistaQn 

AA417383 H$^2582 integiin.beta4ike1 (MiQiEGF-like repeal d 

AJ245671 Hs.12844 EGF4ik&<!omain. midOple 6 

AW298375 Hs.24477 ESTs 

AI553895 Hs.159422 Homo sapiens CDNARJ13997 lis. done Y79AA10 

AK001898 Hs.16740 hypofhelicalpratelnFUIIOSS 

AW270404 Hs.193161 ESTs 

H06382 Hsi1400 ESTs 

AA866115 Hs.127797 Homo saf^cDNARJ1 1381 fis, done HEMBA10 

A)422017 gb.i{45f12j(1 NCLCGAP_Bm23 Homo sapiens cDN 

AA4421S3 Hs.104744 hypotfieOcal protein DKFZp434J0617 

AM60421 Hs^0875 ESTs 

H56037 Ks.108146 ESTs 

AW97S920 Hsl83361 ESTs 

BE144666 gb.*CM24n'017&O4109»O17^2 HT0176 Homosapl 

AA335714 Hs.199665 ESTs 

BE561665 Hs.1 77677 exosome component Rn>40 

H03754 Hs.152213 wingless^ MMTTV integraibn sits famfly. m 

NM.02S001*:Homo sapiens trypotheOcal pfotrin 

AA122393 Hs.70811 liypotheticai protein FU2051 6 

Ai143139 Hs^ visinliv4i]»1 

AA385751 Hs.196379 ESTs. Weaidy similar to putative pi SO (H.sapi 

AF113013 Hs.278919 PRO0B08 protein 

Z35830 Hs.87268 annexInAS 

AA769578 Hs.90488 ESTs 

M62246 H$.35406 ESTs, Highly similar to unnamed protetn produ 

AA831850 Hs.58149 tiypotltetical pmlein MGC14136 

AJ224172 Hs.204096 lipophrinB(uteroglot)inlami]yinemtier).pro 

AW016824 Hs.255527 hypoUieScal protein MGC14128 

BE613126 Hs.47783 B aggressive lymplioma gene 

N71183 Hs.121806 Homo s^ilenscDNAFUl 1971 tis, done KEMBB10 

Ai791749 H3.128896 ESTs 

H85310 Hs.209456 ESTs. Weahfy similar to NG22|H^ns) 
ENSP0000001G943-.'CDNA 

AF005082 gbiHomo sapiens skin-spedfic protein (xp33) 
Target Exon 

keratin 1 (epklermolytlc hyperlceretosis) 
ESTs 

glKHomo sapiens lamlnin alpha 3b chain mRNA, 
ESTs 

Rh type C glycoprotein 

U5 snRNP-spedSc 40 kOa protein (hPrp&^indl 
(MDFVrAP)(TAP2) 

6E623003 Hs^5 Homo sapiens done TCCXn^A00142 mRNA sequence 

T51125 Hs^93 ESTs 

AA001464 gb3e45b0lJl Soaes retina N2b4HR Homo sai^ 

R14760 Hs.12826 ESTs 

AKD01921 Hs.169575 hypolh^ protein MGC2550 

Z43388 gb:KSC1AF121 nonnalized infant brain cONA Horn 

AA825487 Hs.142179 ESTs 

NM.001809 Hs.1594 centromere protein A (17kD) 

NNL000094 Hs.1640 collagen, type VII, alpha 1 (epktenndysisbu 
Target Exon 

NM.001942 Hs^ desmogldnl 

BE299588 H3^8465 Homo sapiens cONA:FU21869tis» done HEP024 

F13287 Hs.159388 Homo sapiens done 23578 mRNA sequence 

BE244318 Hs^13194 hypothetical protdn M6C10895 

AV\I003582 Hs^14 ESTs. Weakly similar to AUJ8J1UMAN ALU SUBFAM 

BE465884 HS.2B0728 ESTs 

A1655494 Hs.195718 ESTs 

AB006532 HsJ1442 RecQproleln4ik84 

AA693537 HsJ21411 ESTs 

R64719 gb£ST22d11WATM1Homo8aplenscOHAd0ne22d 

N48769 Hs.44609 ESTs 

AI188813 Hs41690 desmoodIln3 

NM.Q04833 Hs.105115 absent In melanoma 2 

AI702055 gb.1q20g10j(1 NCLCGAP.UtI Homo sepiens cONA 

W33191 Hs.28907 hypothsflcal protein FU20258 

AA310693 H8.67329 HSPC072protek) 

BE379594 Hs.49136 ESTs. Moderately slmOar to ALU7JHUMAN ALU SU 

R6B649 Hs778359 absent In melanoma 1 Eke 

AI806335 HiMm ESTs. V^teaklyslmbr to T30171 nindn- mouse 

AW803201 9bdL24JIVl0077-0705004)80-EOB UM0077 Homo sapl 

R27430 Hs^1565 ESTs 

BE386042 Hs.293317 ESTs, Wfeakfydmilar to GGCIJttJMANG ANTIGEN 

AW394104 Hs.43744 ESTs^ Moderately sbnBar to 154374 gene NF2 p 

H70423 Hs^ll ESTs 

AASSTSSI Hs.184993 ESTs 
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AF237621 Hs.80828 

N47305 Hs.46668 
AF00S258 

AI764308 Hs.159452 

AF081497 H3^9682 

BE246919 Hs.10290 



4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
3.9 
19 
3.9 
3.9 
3.9 
3.9 
3.9 
3.9 
3.9 

as 

3.9 
3.9 
3.9 
3.9 
3.9 
19 
3.9 
19 
19 
19 
19 
18 
18 
18 
18 
16 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
• 17 
3.7 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
3.7 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
16 
16 
16 
16 
16 
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408771 


AWr32573 


Hs.47584 


potssslum vo!taD8-aatBl diannel deiavfid-fGct 


3w6 


440046 


AW402306 


Hs,6877 


hypotheljcd pro^ FU10483 


3l6 


459575 


BE080825 




gb:QV1'6T0631>1802Q(V07ftcQB BfT0831 Homosaoi 


16 


400441 


Ml 5530 


Hs.99879 


B-oeQ 9tKyth factor 1 (12kO) 


3.6 


440138 


AB033023 


Hs.318127 




16 


417830 


AWS04768 


Hs.122579 


hnoUieQcd orateih HJ104fi1 


16 


427131 


AA448460 


Hs.1 12017 




16 


413278 


BE563085 


HsJ33 


{nIfiffeTOivfiiimidatef! rvnlptn 1*% bn^ 


16 


420373 


AW968228 




g(b£ST380198 MAGE resequences, MAGJ Homofiapi 


16 


414136 


AA812434 


Hs.1 19023 


SMC2 fstnjcttiral mrfntanatKMi nf chmmnsfunas 2 


16 


424296 


AI631874 


Hs.155140 


cflscin kbi8S8 2i fil^fis 1 polypepfi^B 


16 


424639 


AI917494 


Ks.9812 


Hbfnofiai)!fins cDNA PLSMSSR fie dom HPMRAIQ 


W.V 


419198 


AF110908 


Hs.297660 


TNF recflntQr-a<KnelntMl ffvrfnr ^ 

1111 f wvQ|/U/I^S90UWc3llCv luvWl w 




434321 


AA629368 




gb:uj78a1 13I Scares_tesfis_NHT Homo sopiens 


16 


426514 


BE616633 


Hs.1 70195 


bone (mphogenetfc protstn 7 (osteoQsnic pfot 


16 


427335 


AA448542 


H9.251677 


GanQgen7B 


16 


413472 


BE242S70 


Hs.75379 


golute C8n1er family 1 (gTial hiQh sfSnlty 


16 


408000 


LI 1690 


Hs.620 


bullous pemphipokl aniD^Befl 1 (23QI240kO) 


16 


426749 


A162371B 




ESTs 




443899 


AW&422B3 


Hs 79933 


E^s HkiMv similar to &21424 nesfin IRsaol 


AD 


440664 


AI253123 


Hs, 127356 




16 


424927 


AW973666 


Hs 153RHQ 






408591 


AR)15224 


Hs.46452 


fn3JTim3QioUn 1 




408867 


AA437199 


Hs.656 


cell (fivfeion cycle 250 


15 


428508 


BE2S2383 


Hs.184666 


SBBP1 protein 


15 


43112D 


AA492S88 




gbu^99c08.s1 NCLCGAP.Thyl Homo sapiens cDNA 


15 


401760 






HtA 0fl5SS7*'Hoim finnipns knrafin Ifi ffrv^ imn 
i»i»(_vw«*»*o# >riuiiiu o(i|M«iis ivCHaiiii lo ^tUfOi iivm 


15 


409695 


AA296961 








456671 


AB011142 


Hs.1 14293 


IGAA0S7D oprtR nforiiirl 


15 


412155 


R38167 


Hs.1 2449 


ftantnnQ IninstnnmiifsirM nmMn HTMPin MT 




435244 


N77221 


Hs.187e24 


ESTs 


3.5 


436246 


AW450963 


Hs.1 19991 


ESTs 


15 


431917 


D16181 


Hs.2868 


per^rtteid myelin protein 2 


15 


443113 


AI040666 


Hs.132908 


ESTs 


15 


443341 


AW631480 


H&8688 


ESTs 


15 


407756 


AA116021 


Hs.38260 


ubif^uiOn specific protease 1 8 


15 


418347 


AA21G419 




gb:nc16e03^1 NCLCGAPJ'rl Homo sapiens cONA 


15 


423841 


AW753967 




ab:RC2-CT0304^08010(M>1 1-b1 2 CT0304 Homo saoi 


15 


408633 


AWS63372 


Hs,46677 


PRO20Q0Dmlein 


15 


427878 


C05766 


Hs.181022 




AO 


419945 


AW290975 


Hs.1 18923 


ESTs 


AO 


448372 


AW445166 


Hs.170802 


ESTs 


15 


407137 


T97307 




dKye53h05.s1 SoaresfetdlhmrsDtBen 1NFLS 


IS 


411274 


Nlil.Q02776 HS.G9423 


kalDHpeln 10 


15 


400666 






niv^uu£H^9.nu(iiu ocfHens iimuw niDwu^HuiBnia 


0.0 


426920 


AA3933S1 


Hs.132121 


ESTs 


AO 


402639 






Taiget Exon 


14 


454891 


AW837349 




flh'C(\/2-LT0038.2703Q0.1Qa-d1? 1 TQn3fl kkuno sari 


14 


412471 


M53193 


Hs.73946 


endofheG&l i^It nrauuih facter 1 fnlafatsUrf a 


14 


456296 


AA829976 


Hs.239114 


rmnnnsIflflSA trinha rinsA 1A nvni^fO 
iiioiinuaiuoaCi cupiiai inossi ti\ iiraiwd £ 


14 


450650 


T65617 


Hs.101257 


hvDothfilleal nmtein MGC339S 


14 


429274 


AI379772 


Hs.99206 


ESTs 


14 


430704 


AW813091 


Hs.335799 


ESTs 


14 


419807 


R77402 




nb'vi75f1 1^1 Soares nlamnfa Nh9HP Hnmo sanl 


14 


451778 


AI826131 


Hs.71243 


ESTs, Weddy sindlar to zinc finQer protein ( 


3.4 


430397 


AI924533 


Hs.105607 




14 


432113 


AAg35065 


Hs.152385 


ESTs 


3.4 


449722 


BE280074 


Ks.23960 


cvefln B1 


34 


455092 


BE152428 




ab'CMO-ifr0323'151299-126-b04 HT0323 Homo sad 


3.4 


418203 


X54942 


Hs.63758 


(^nn9n ntnfnln khuMtt 9 

\j%J\jt.%t piWlDUl NIMSttl c 


A4 


453775 


Nh4_002916 Hs,35120 


leoIicaBon Cador C (adfvslor 1) 4 /37kD1 


14 


417009 


AA191719 


Hs.314714 


ESTs 


14 


444743 


AA045648 


Hs.301957 


nudhc fnuefBosida dlohasohata tlnke^l maialv X 


14 


434206 


AW138973 


Hs.288516 


ESTfi Vtteaklv similar h)S6S850ffl{k»enindud 


A^ 


413219 


AA876200 


Hs.118727 


iKflmi otn^ioiia cunn ru uikk uB| mwio r i/tuccu 


A*f 


422487 


AJOIOgOl 


Hs.ig8267 


mucin 4k tracheobronchial 


A^ 


438993 
403274 


AA826995 




oh*nd77hflA Nf^ f^T^P Ov9 Hnmn RantpiK «f1MA 


^A 
Ah 








44 


435360 


AF105366 


Hs.4876 


fiflhiln mrnRf fninilv 19 fnniA<iKhimA!hlnriHfi 

SUIUW vCUIIOI ItUIUIJ M yWUKMUMIIfWHUIIUO 


Al 


426572 


AB037783 


Hs.170623 


By[WmPliyol |MV(Bill rLJ 1 1 IvJ 


OA 


432615 


AA557191 


Hs.55028 


ESTs. Wed(^ sindlar to 154374 gene NF2 prole 


14 


443462 


AK)84690 


Hs.171176 


ESTs 


13 


444910 


AI201849 




gb:qs76o04j(1 NCLCGAPJPr28 Homo sapiens cONA 


13 


426269 


H15302 


Hs.168950 


Homo sapiens mRNA; dm OKI^566A1046 (fiom c 


13 


449101 


AA205B47 


Hs.23016 


G pretdn-ooupled reoeptor 


13 


418994 


AA296520 


Hs.89546 


seiectin E (endolhelia! adhesion molecuto 1) 


13 


427660 


Ar741320 


Hs.114121 


Homo sapiens cONA: aJ23228 63. done CAE066 


13 


419751 


AW195561 


KSJ3121 


KiAA0761 protein 


13 


404782 






C7001692-:9]|67240g6Igb|AAF26844.1i (AF195Q21 


13 


415613 


R20233 




gb;yg18h1 1 j1 Soares infant brabi INIB Homo s 


3.3 


452198 


AI09756O 


Hs.61210 


ESTs. Weakly stiT^ to 138022 hypothetical p 


13 


406599 






TafQst ExDii 


13 


433001 


AF217513 


HS.27990S 


donsHQOSIOPROOdlOpI 


13 
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412879 BE092219 Ob:(L2*8T0734-24MO(MI71-804 610734 Homo sapl 

421107 AA283822 H&S5606 EST8.Wealdysbnnar to S47072 finger pratdn 

436985 M740946 Hs.150895 ESTs 

443303 AI220547 Hs.135223 ESTs 

432015 AL157504 Hs.159115 Homo sapiens (nI^cONADKFZpS86O0724 (frame 

447153 AAB0$2O2 Hs.315562 ESTs 

450769 AA057418 H&33G54 ESTs 

421379 Y15221 Hs.103982 small Inducible cytokine subfamily B(Cys-X^ 

402481 NK1.001821^Homosaptenschon]ideremia^(R 

459394 BE403894 Hs.126522 Homo sapiens, done MGC:16722.mRNA, complete 

424364 AW383226 Hs.201189 ESTs. Vfeddy similar to 001763 sbopMn-1 (H. 

417708 N74392 H&S0495 ESTs 

414869 AA157291 Hs.21479 ublnudeinl 

441690 1^1733 Ks.33106 ESTs 

414774 X02419 Hs.77274 plasminogen acGvatar, inoldnase 

412246 AI160873 Hs.69233 zinc finger protein 

412303 BE007967 Hs.155796 ESTs 

431958 X63629 Hs.2877 cadherin3,type 1,P-cadherin(placentaO 

451177 AI969716 H3.13034 ESTs 

409990 AA07g337 gb:zm95b09j1 Stratagene colon HT23 (937221) 

418462 BEOOISge Hs.85266 Integrin, beta 4 

424687 J05070 Ks.151738 matrix metaUoprotBinase 9 (gelailnase B, 92k 

405196 C2000662*:oli7512792|pirpi2482hypothet]cat 

459267 AJ003631 gb AJ003S31 Selected chromosome 21 cDNA Bbra 

432917 NM.014125 HS.279B12 PRO0327 protein 

448251 BE280486 Hs.84045 tiypothefcai protein RJ20288 

415025 AW207091 Hs.72307 ESTs 

420218 AW958037 Hs.286 ribosomal protein L4 

429594 AK001126 H$.210297 Homo sapiens cDNAnJ10268 lis. done HEMBBIO 

447762 AI939461 H8.161370 ESTs 

414147 BE091634 gb:IL2-BT073 1.240400069^ BT0731 Homo sapl 

445038 AI635444 Hs.143917 dJ467N11.1 piDtBin 

448866 Nlit014953 H1323346 KIAA1008 protein 

402800 Target Exon 

411263 BE297802 Hs.69360 kinesin^ike 6 (mitotic centromero-assodated 

424308 AW975531 Hs.154443 mtnteiiromoson\e maintenance d8iident(S. cere 

455203 AW865450 gb:PI\M^N002a010400mb09SN0020Homosapi 

459666 W27362 gb:30g7 Human retina cONA randomly primed sub 

401458 Target Exon 

432361 A1378562 HS.1S958S ESTs 

444008 BE544855 H1220756 ESTs. W^aldy similar to SFR4.HUMANSPUaNGF 

405336 Target Exon 

446563 BE326588 Hs.141454 ESTs 

449276 AW241510 Hs.252713 ESTs 

455838 BE14580B gb:MR041T0208-101299-103^11HT0208 Homo sapl 

420591 X53655 H8.99171 neurotrophln 3 

401486 C400064r:gi|4758508Iref|NP_00425ai| einvay 

432979 AA573263 Hs.120860 ESTs 

413833 Z15005 Hs.75573 centromere protein E (31 2kD) 

438325 AA804258 H5.123229 ESTs 

421751 AW813731 Hs.159153 ESTs, Moderately similar to S655578lplB.1<>a 

431938 AA938471 Hs.54431 spedTic granule protein (28 kDa};cystetn&f 

421777 BE562088 Hs.1061S6 HSPG037 protein 

4081 13 T62427 H5.194101 Homo sapiens cDi^ HJ20869 lis. done ADKA02 

427359 AW020782 Hs.79881 Homo 6aidenscONA:FU23008fis. done LN6004 

402337 Target Exon 

420930 AW8886S0 gb:CM4-NT0007-130500.551-fl)6»(T0007Homo8api 

436168 AK000883 Hs^1645 Homo sapiens cDNA FU10021 fis. done HEMBA10 

443426 AF098158 Hs.9329 diromosome 20 open reading frame 1 

437641 AAB11452 Ks^1911 ESTs 

414761 AU077228 Hs.77256 enhancer of zeste (Drosopliiia)homotog 2 

434208 T92641 Hs.127648 hypothetica] protein PR021 76 

433222 AW514472 Hs.238415 ESTs. Moderately similar to ALU8JHUMAN ALU SU 

452934 AA581d22 Hs.4213 hypothetical protein MGC16207 

458923 Y12812 H8.24422 regulatoiy (actor Xosodated prolan 

448988 Y09763 Hs.22785 gamma«nbiobutyite add (GABA) A receptor, ep 

439750 A1359053 Hs.57664 Homo sapiens mRNAtuDtengOi insert cONAdO 

422283 AW411307 Hs.114311 CDC45(cen division cyde 45, Sxerevlslae. 

445885 Ai734009 Hs.127699 KiAA1603 protein 

441962 AW972542 Hs.289008 Homo8a(denscDHA:FU21814 lis, done HEP010 

424653 AW977534 Hs.151469 cabMlcdmodufln^ependent serine protein k 

431322 AW970622 gb:EST382704 MAGE resequences. MA6K Homo sapl 

423934 1189995 Hs.159234 tbrkhead box El (thyrdd transcription factor 

455987 BE178323 gb:RC3-HT0600-240400423fl05 HT0600 Homo sapl 

421426 AA291101 Hs33020 Homo sapiens, done llMAGE:3939163.mRNA. part 

423887 AtJ080207 Hs.134585 Di07P434G232 pnteln 

408296 All 17452 Hs.4415S OKFZP586G1517proldn 

447815 AI432199 H3^47D84 ESTs 

441874 AI6B3782 Hs.128245 ESTs 

446474 AI301227 HS.1S0186 hypoOiefical protein DKFZp566K1946 

452166 AI948607 Hs.264680 ESTe 

451659 BE379761 H3.14248 ESTe 

439979 AW600291 Hs.6823 hypothe1fc9lpiotetoFU10430 
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447701 B^9626 H5.255527 hypofheltcal prelfiin MGC14128 3.1 

443848 A1085377 Hs.143610 ESTs 3^1 

414696 AF002020 Hs.76918 Nlemann-PickdisaaseiiypeCI 3.1 

449441 A15S6040 Hs,196532 ESTs. 3.1 

5 458145 AI239457 HS.1307W ESTs 3.1 

444588 A1221321 H5.167559 ESTs 3.1 

450832 AW970602 Hs.105421 ESTs 11 

449425 AW103433 HS.19S884 ESTs 3.1 

440953 AI883036 Hs.124135 HomosapIenscONAaJ13051 fis.clonet(T2RP30 11 

10 422156 110343 Hs.112341 protease inMbilor 3, sMiwleitved (SKALP) 11 

416120 H46739 gb7o14h02.s1 Soares adull brabt N2b5KB55Y Ho 11 

425081 X74794 Hs.154443 mIn]chn)nios(mi8 mabitoiarvie deficient (S. cere 11 

404107 Target Exon 11 

419553 AA526235 Hs.193162 Homo sapiens cONAOJ 11 983 fi3.dbneHEMBB10 11 

15 417168 A1133117 Hs.81376 Homo sapiens mRNA;cONADKFZp568L1 121 (from c 11 

435604 AA625279 H5.26892 uncharaclBiized bone manow protein 6M040 11 

442824 8E176065 Hs.144081 ESTs 11 

43S061 AI651474 Hs.163344 ESTs 11 

420589 AA419360 Hs.192708 ESTs 10 

20 434569 AI311295 Hs.8294 KIAA01 96 gene product 10 

420039 NM.004605 Hs.94581 sulfotransferase family, cytosoltc 2B, membe 10 

453883 AI638516 Hs.22630 cofactor required for Spl transcilpiiona) act 10 

402892 TaigetExon 10 

40S087 Target Exon 10 

25 411770 NM.014278 Ks.71992 heat shock protein (hspl 10 Canfly) 10 

45^87 AA031956 ob:zk15e04.s1 Scares jvegnanLutenisjNbHPU Ho 10 

435990 AI015862 Hs.131793 ESTs 10 

442577 AA292998 Hs.163g00 ESTs 10 

448733 NM-00S629 Hs.187958 sohite canter family 6 {neurotransmitlBrtre 10 

30 458154 AW816379 Hs.33S018 ESTs 3.0 

416809 N67253 Hs.271691 ESTs 10 

444946 AW13920S Hs.156457 hypotheScal protein RJ22406 10 

437938 A1950087 0t>Mt|OSoO2j(1 NCLCGAPJ<id12 Homo sapiens cON 10 

413281 AA881271 Hs222024 tiansct^ factor BMAL2 10 

35 434808 AF155108 H3.256150 Homo saptens. Similar to RIKENcONA 261002701 10 

445505 A19711S6 H8.148891 ESTs 10 

425005 AI565851 gb:tnO7g03j(1 Na_CGAP_Bm2S Homo sapiens cON 10 

435673 AF202961 H3.284200 Homo sapiens uncharacterized gastric protein 10 

432189 AA527941 gb:nh30c04.s1 NCLCGAPJ>r3 Homo sapiens cONA 10 

40 453509 AL040021 Hs.252674 ESTs, Wealdy similar to altemafively spliced 10 

455750 BE075114 gb:PM1-BTOS8S11020aOO3«11 BTOSSSHomosapl 10 

407777 AA161071 H&.71465 squalene epoiddase 10 

424441 X14850 Hs.147097 H2Ah{stone family. meniberX 10 

417734 Z42667 Hs.6724 ESTs 10 

45 449676 AW380579 Hs.203657 ESTs 10 

445425 AI223511 Hs.300722 ESTs 3.0 

427061 AB032971 Hs.173392 K1AA1145 prot^ 10 

433584 AW295399 gb:UM^*6l2-ahv4v034MIU1 Na.C6AP_Sub4 Ho 3.0 

444477 AI150548 H8.23155 ESTs 10 

SO 446255 AI283257 Hs.257090 ESTs 10 

400612 C10001034:gil7513113|(dil|T13076KIAA0992pro 10 

450841 AI741466 H5.270515 ESTs 3.0 

410561 BE540255 H9.6994 Homo sapiens cDNA:RJ22044 lis. done HEP091 10 

433871 W02410 Hs,20S555 ESTs 10 

55 401994 TargetExon 3.0 

449272 AW137656 Hs.197645 ESTs 10 

409703 KM.006187 Hs.56009 2'-7-on9oadenylate synthetase 3 (100 kO) 10 

400250 Eos Control 10 

40801S AW138771 H5.244349 epUemtatdifferentiatiOfl complex protein flk 10 

60 436414 BE264633 Hs.143838 WO repeat domain 4 10 

432220 AI571306 Hs.232224 ESTs 3.0 

420631 AA280824 Hs.190035 ESTs 10 

433644 AW342028 gb:hb75d01x1 NCLC6AP.Ut2 Homo sapiens cDNA 3.0 

447313 U92931 Hs.18081 Homo sapiens done DT1P1B6mRNA,CAG repeat r 10 

65 428262 A1651324 Ks.72g8 Upheny) hydrolase^ (serine hydrolase; br 10 

452778 R71330 Hs.5921 Homo 8apbnscDNA:FU21592fis, done COL07O 3.0 

417308 H60720 Hs.81892 K1AA0101 gene pioduci 10 

436281 AW411194 Hs.e5195 myetoid leukemia factor 1 10 

„ 433485 A1493076 Hs.201987 aldo^ceto reductase family 1. member C2(dihy 10 

70 423556 R72694 Hs.7720 dynein. cytoplasntic heavy polypepUe 1 10 

426890 AA393167 Ks.41294 ESTs 2.9 

436333 AA709270 Hs.136672 EST 2.9 

440006 AK000517 Ks.6844 hypothetica] protein FU20510 2.9 

402556 C1001383*:8l|538695|pifIIA61183 hypothetical Z9 

75 411098 AW817238 gb:Qy/lWTO247.090200-105-b07ST0247Homosapl 2.9 

435399 AA679463 gb'.ac50c03.8l Stratagene hNT neuron (937233) Z9 

431070 AW408164 KS.2491&4 transcription factor 19 (8C1) 2.9 

427988 N45214 Hs.282387 Homo sapiens cONA:FU218370s. done HEP01 6 Z9 

410658 AW105231 Hs.19a)35 ESTs 2.9 

80 434539 AW748078 H3,214410 ESTs,Weak^ysim^artoMUC^J^UMANMUaN2PR 25 

412279 BE245511 gb:TC8AP103235 Pediatric pi&B cell acute lym Z9 

405277 ENSP00000211621'-J<BrBBn,typeUcytoskelela 2Jd 

423733 AA330281 gb:EST33985Embiyo^ 12 week II Homo sapiens c 19 
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455319 


AW895387 




407638 


AJ404672 


Hs.334483 


413306 


AW303544 


Ks.1 18654 


432215 


AU076609 


Hs.2934 


434423 


NM.008769 Hs.3844 


446269 


AW2631S5 


Hs.14559 


412387 


AW94S964 




436148 


BE005252 


Hs.321583 


421190 


U9S031 


H8.102482 


404981 








AA147629 


Hs.301431 


452732 


BE300078 


HS.B0449 


401760 






443859 


Nk4_013409 Ks.9914 


404253 






432491 


AA662910 


Hs.42635 




AA954229 


Hs.114052 


429035 


BE549781 


HS.27047S 


446733 


AA863360 


HSZ6040 


446417 


AI299050 




HOIOOl 


AJ003029 


H5.65792 




BE393822 


Hs.29645 




BE093589 


Hs.38178 




W31489 


H5.95044 


439430 


AF1242S0 


Hs.6564 


434876 


AF150477 


Hs.61460 


436268 


AA782163 


HS.2935Q2 


401781 






439625 


AR)86453 


Hs.56611 


425234 


AW1 52225 


Hs.165909 


410743 


AA069474 


Hs.272153 


414915 


N^.002462 Ks.76391 




AI668594 


H8.176588 


443479 


AF027219 


Hs.9443 


442601 


A1684g69 


H5.46772 


405932 






405454 










H8.1200 






Hs.31141 


4nRR9Q 


NH.008a42 H5.48384 


414581 


AA2S6213 


Hs.72010 


411268 


AKD00512 


Hs.69388 


450024 


AA005129 






AI127076 


Hs.334473 


•wow 1 


AA379597 


H3.5199 


400631 


AF173937 


Hs.109494 


n£Sl to 


H20669 


Hs.35406 


444381 


BE387335 


Hs.283713 


jnera 

HOOI t 1 


AF226667 


H3.5B553 




AI538323 


Hs.52620 


A'iiR'Xi 


AU077025 


H&265827 




At620463 


Hs.293d84 




AW665452 


Hs.246503 


•liMIDc 


AA584062 


Hs,27279B 


■fvOZOI 






*MOIOI 


BE08902B 


Hs.20158 


409103 


AF2S1237 


Hs.112208 




AW386745 


Hs.214140 




BE281191 


Hs.155462 




AG0O477O 


Hs.4756 


•WOOfO 


AA2S0970 


H8.251946 


40c/ JU 


AAQ27952 


H3.165216 




Be064528 






AA742577 


Hs.303781 


•M0114 


AI467906 


H&8882 




BE587665 


H8.286550 


44U994 


A1160011 


Hs.193341 


•142290 


AI827246 


Hs.22439a 


422689 


AW856665 




41D910 


J0S931 


Hs.62192 


474inR 


AI754813 


Hs.146428 


433788 


At610534 


H3.1 61275 


403806 






437162 


Aismm 




453955 


AW579207 


Hs.304666 


420795 


AA323037 


Hs.128645 


452696 


AI826645 


Hs.211534 


432656 


NKL000246 Hs.3076 


438052 


AA776564 


Hs.41891 


441755 


AW450826 


Hs.127786 


427961 


AW293165 


H8.143134 


449765 


At225235 


Hs.288300 


450451 


AWS91528 


HsZ02072 



gb:QV4IIN0038-300300-157-c1D mm Homo sapi 

bypothBQcalpralein FU23571 

EST8 

ribonudeoOda reductase Ml polypepQde 
UM(iom^onIy4 
hypolhellcai RJ10540 
gb:OF\/O.ET0001-O5050O'228^ ET0001 Homo sapi 
Homo sapiens cONA aJ20779 fis, done C3OL0507 
mucin 5. subtype B, tracheobronchial 
ENSP00000252242*:Keralin, lype II cyloskeleta 
emtotheOa) zinc finger pfoteln Induced by tu 
Homo sapiens, done IMAG£:3535294. mRNA. part 
Target Exon 
follistatin 

NM.021058*:Hamo sapiens H2B tdstone fairaly, m 

hypoUtttlcat protein DKFZp434K2435 

ESTs 

ESTs 

ESTs, WfeaWy similar to (aHy add omega^iydr 
gbx|n14d12j<1 Na.CGAP_ljj5 Homo sapiens cDNA 
syntrophin, gamma 2 

Homo sapi^ mRNA; cONA OKFZp761 C029 (from d 

hypothetical protein FIJ23488 

ESTs, Wealc^ slmOar to 136022 hypotheOcal p 

cervical cancer antatrogen resistance 3 

Ig superfamDy receptor LNIR 

ESTs 

Target Exon 
ESTs 

ESTs. Weakly similar (o 138022 hypothetical p 
ESTs 

myxDvirus Cmfluenza) resistance 1 , homobg o 
EST8.Weal(fysimi]artoCP4YJHmMN CYTOCHi^OME 
zinc finger protein 202 
ESTs 

C1S000305:gi|380612%b|AAC69198.1| (Al^97887 

C1200054 1:gii57298841refjNP,006539.1| IGRI 

arachldonste 12-lip0]qfgena6e 

Homo sapiens mRNA for KIAA1568 protein, parti 

heparan n^ate (gtuoosamine) 3-(>5uiibtra(isf 

ESTs 

hypothetical protdn FU20S05 
fi^2h90h08/1 Soares_fetaLriver.Epleen.1NFLS 
hypothetica] protdn 0KFZp564O1278 
HSPC1 50 protein similar to ubiquitin^njugat 
secreted protein of unknown function 
ESTs, Hlglity similar to unnamed protein produ 
ESTs. Weakly simHar Id 864054 hypothelicail p 
CTP synthase 11 



interferon, elpha-indudbte protein (done IF 
hypotheOca! protein MGC1 3102 
ESTs 

hypotheticai protein FU20413 
NMJ)02864:Homo sapiens piegnancy-zone protein 
ESTs, WeaMy similar to S34159 transcription 
XAGE-1 protein 

ESTs, Weaidy similar to AlULHUI^ ALU SUBFAiyi 
minichromosome maintenance deficient (mls5, S 
flap structure-specific endonuclease 1 
poiy(A)-binding protein, cytopiasmk: l-Blce 
ESTs 

gb:RC4~BT031 1-250200^144)08 BT(»11 Homo sapi 

EST 

ESTs 

Homo sapiens cDNA: FU231S6 fis. done 1>1G096 
ESTs 

Homo sapiens cONA FU1 1469 fis. done HEMBA10 
gb:RC3^0297-29010041M03 CTQ297 Homo sapi 
coagulation factor 111 (thromboplastin, Qssu 
cdlagen, type V, alpha 1 
ESTs 

Target Exon 

gbiiomo sapiens mRNA; cONA OKFZp564C1072 (fro 
ESTs, Weakly sin^ to 178885 seitne/lhreoni 
sorting nexin 16 
ESTs 

MHC class 11 trensacSvator 
zinc finger 1111 
ESTs 
ESTs 

hypothetxa! protein faJ23231 
ESTs 
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406831 


N73448 


HS.5Q272 


411558 


AA102670 


Hs.70725 


439453 


BE264974 


Hs.6568 


413582 


AW295647 


H5.71331 




AI611378 


H5.192610 




AVra2S980 


Ks.138965 


410664 


NM.006033 Hs.65370 


444550 


BE250716 


H5.87614 


422109 


S73265 


Hs.1473 


4/17700 


BE514982 


Ks.38991 


445941 


AI267371 


Hs.172836 


'KfOI 19 


AW749511 


H$^1554 


413943 


AW294416 


Hs.144687 




AA477509 


Hs.89563 


iooqni 


AW628666 


Hs.98440 




BE247275 


Hs.151787 


4l^Zf «) 


U75679 


Hs,75257 




T639S9 


tb.228320 




NM_003816 Hs.2442 




AB026438 


Hs.177534 


40 dub 1 


BE079752 






AW579475 






W31518 


Hs.34665 




BE065813 




4^DJ4t> 


Nl^014642 Hs.169387 




U31116 


Hs,77501 




AL136877 


Hs.50758 




X99226 


Hs^153 




AA446869 


Hs.119316 




AW405731 


Hs.18498 




AI681545 


H8.152982 


*HWf DO 






444172 


6E147740 


HS.104S5B 




AA505665 


Hs.217493 




AA381209 






AAB53410 


Hs.93557 




AW084176 


Hs.223236 




AF052S73 


Hs^41517 


400101 


L36463 


Hs.1030 




H62953 






AW974175 


Hs.188751 


40904/ 


AW014873 


Hs.1 16963 


401003 


AW971949 


Hs.291252 


41 foao 


N78541 


Hs.177366 


4U00oU 


AF123050 


Hs.44532 


A^779A 


AW444828 


Hs.184323 


4U000U 


AK000093 


Hs.46821 


4o4zU£ 


AW17B363 




441 0D« 


6E614410 


Hs.23044 


Awnti 
**£cifO 


AF072B73 


Hs.114218 








44/f4o 


AI422023 


Hs.161338 


440*00 


AI079496 


K5.134169 


400/40 


AF075312 


Hs^GO 


440009 


BE091926 


Hs.16244 


4U0O/a 






4<40£d 


AW904466 


Hs.321197 


41040U 


AA5813B6 


Hs.73452 


440dM 


N69546 


Hs.44563 


4Ufco9 


AA135159 


H5^3349 




AI678812 




440*/0 


6E514434 


Hs.20830 


4i<£y// 


AA125910 


Hs.191461 


401 f£} 


AB0329g6 


Hs.268044 


4ifoor 


AF260257 


Hs.131917 


4iHB90 


AA032279 


H&.61635 


4104C»4 


D86980 


H3.79170 


4dC£cul 


Ai989961 


Hs^77 


4204oo 


AF231022 


H9.1S8159 


401230 






422056 


AA862231 


HSJ34443 


452747 


6E1S3855 


Hs.61460 


430152 


AB001325 


Hs.234642 


444008 


6E395085 


Hs.10086 


443500 


AV646388 


HS33981 


416030 


BE207573 


K5.83321 


445640 


AW969826 


H3J1704 


418869 


AW516565 




431686 


AA513906 




427579 


AA366143 


Hs.179669 


423175 


W27895 


Hs.18653 


414737 


AI160386 


HS.12S087 


451813 


NM.016117 HS27182 



EST8. WBsMy sindar b RSIAJIUMAN 40S RIBOSO 
gaiTtfna«idnobutyric acid (GABA) A receptor, pi 
Ihynoid honnono receptor InteradDr 13 
hypothetical piQl^MGCSSSO 
ESTs 

ESTs,VVee!ity8im!l8rtol38022hn)othet!^ 



EST8 

gastrin-releaslng peptide 
S100 catdunvbintfing protein A2 
ESTs 

ESTs, We^ similar to Al=133298 1 cytDchreme 
Homo sapiens cDNA RJ12981 fis. done NT2RP20 
nuctear cap tMng prot^ subunit 1, 80kD 
ESTs. WeaMy sin^ to 138022 hypotlieticd p 
U5 snlW-spedfic prolrin, 115 Id) 
stem-loop (tiistone} binding protein 
hypothetica] protein FLJ23537 
a disintegrin and metaDoproteinase domain 9 
dual spediid^ ptwsphgtose 10 
gb:RC&6T0627-140200-011>A04 BT0627 Homo sapi 
gb:RaH)T0076-1 10100031^110 DT0076 Homo sapi 
ESTs 

gb:RC2«T0318-110100^12-a08 DTD318 Homosapi 

KIAA0036 gene product 

sarcoglycan, beta {43icD dystrophin^odated 

SMC4 (structural maintenance of chromosomes 4 

Fanoon) anemia^ complementaScn group A 

ESTs 

Homo sapiens cDNA RJ12277 fis. done MAMKMIO 
hypothetical protein nj13117 
N}uL0010S9*:Honio sapiens tachykinin receptor 3 
ESTs, Moderately sbniter to 136022 hypothetic 
annexinA2 

gb:EST942S7 Activated Tk»!Is 1 Homo sapiens 
proenl(ephalin 

ESTs. Weakly similar to 138022 hypotheOcal p 
polymerase (DMA directed). Iheta 
rasinhOftDr 

gb:yr47f06j1 Soares fetal Gver spleen 1NFLS 

ESTs. Weakly shrilar to MAPBJ1UMAN MCRQTUBUL 

ESTs 

ESTs. Weakly slnnilar to ZN91 J1UMAM ZINC RNGE 

ESTs 

diubiqdtin 

ESTs 

hypothetteal protein FU20086 

gb:RC3.HT010W)109994H)24^06 HT0105 Homosapi 

RAD51 (S. cerevlsiae} homotog (E cdi RecA ho 

(rlzzted (Drosophila) homolog 6 

Target Exon 

ESTs 

ESTs 

Homo sapiens done HQ0262 
rolloBcsidndle cofledcoB rdaled protdn 
Target Exon 

POZ domain protein (DrosophUa InaO<like) 
hypothetk:al protein M6C10791 
hy|X)theticaI prolebi 

Homo sapiens cDNA FU12149 lis, done MAMMA10 
gb:hi59d08j(1 NCL0GAP_6as4 Homo sapiens cDNA 

kinesin-like 2 
ESTs 

KIAA117p protein 

letinlfls i^gmentosa GTPase regulator interac 
sb( transmembrane epitheOai an^en of the p 
K1AA0227 protein 

ESTs, Moderately slmDar to A Chain A, Cydop 
FAT tumor suppressor (Drosophila) homolog 2 
NM.014191*:Homo sapiens sodhim dtannd, vdta 
ESTs 

Ig superfamOy receptor IMIR 
aqu8porln3 

type I transmembrane protein Fn14 

Homo sapiens mRNA; cONA Ot^FZp667O095 (from d 

neuromedin B 

ESTs. Weakly simQar to KIAA0227 (H^apiens) 
gb3cq01d05jc1 Soares_NHCeC_cervicaLtumor Horn 
gb:ng67d)aji1 NCLCGAP_Up2 Homo sapiens cOKA 
hypothetical prot^ FU20637 
hypotheOcal protein FU14627 
ESTs 

phosphoOpase A2*acOvatEng protein 
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446659 AI335361 H5.226376 ESTs 2.7 

413833 AA251131 Ks^97 ESTs 2.7 

411819 AW947884 obJ>M14«T001&2003(XM)01-g08MT0010Hoino88pi Z6 

445592 AV654382 Hs.17947 ESTs,VVteakhr8lninarbT16534hypolheac8lp 2.6 

5 446102 AW168067 H5^52956 ESTs 2.6 

441408 AI733249 Hs.126897 ESTs U 

438027 AI864053 Ks.39972 ESTs, Weakly sbnQar to 138588 reverse transc 2.6 

416283 NM.005429 Hs.79141 vascular endotheSaigmwlh factor C 16 

421470 R27496 Hs.1378 annextaA3 2.6 

10 416658 U03272 Ks.79432 fibriIIhi2(cono8i^t^(antFactuialar^ 2.6 

428082 AM20683 Hs.98321 hypothefical protein FU14103 2.6 

431832 AW276866 Hs.192715 ESTs 2.6 

426698 AA394104 ^b.97469 ESTs 2.6 

433288 AI368873 H8^1257 ESTs. WeflMy similar to 138022 hypothetical p 2.6 

IS 4102S0 AA402307 H3.322844 hypothefical protein DKFZp564A1 76 ZS 

432865 A]753709 Hs.152484 ESTs. Weakly similar to 138022 hypotlieQcalp 2.6 

408690 AW864542 ob:PM4^N001&-120500mh02SN0016Komosapi Z6 

440048 AA897461 Hl32a737 ESTs, W^ similar to envelope protein |Ha 2.6 

401260 01001(01 W305041)ref|NP.038B76.1|efylh(o 16 

20 435136 R27299 H3.10172 ESTs 2.6 

412108 AA1002g3 Hs.185043 ESTs 2.6 

434442 AA737415 Hs.152626 ESTs 2.6 

443204 AW205878 Hs.2g643 Homo sapiens GONAFU13103 lis. ctoRsNT2RP30 Z6 

419667 AU077005 HS.922Q8 adtsintogiinandmetaltoprotdnasedomainlS 16 

25 422892 AA988176 H5.121553 hypothetica) protein aJ20841 16 

409594 AA076118 gbizml 8806^1 Stratagene pancreas (937208) Ho 2.6 

410008 AA079552 gb2m20h12.s1 Stratagene pancreas (93720^ Ho 16 

423038 D26628 Hs.123056 DEAD/H (Asp^Ala-Asp/His) tnx polypepBds 16 

405671 AA129547 Hs.285754 met proto«icogene(hepatocytB growth factor 16 

30 450983 AA305384 Hs.25740 ER01 (S. cerevisiaeKtke 16 

434444 AI76S276 Hs.101257 hypothetical protein MGC3295 16 

421817 AF146074 Hs,108660 ATP-IAicfing cassette, 5Ul>-f3milyC(CFTR/MRP) 16 

411465 AW847663 gb:llJ-CT0213-28010a<}56-F02CTQ213Hotiios8pt 16 

408625 AW243323 Hs.2667B5 ESTs 16 

35 412530 AA766268 H3.266273 hypothetical protein FU13346 16 

439245 NM-013381 Ks.6510 thyrotropin^eieasing hormone degraiSngectoe 16 

431890 X17033 Hs.271986 tntegrin.8!pha2(C0498, eIpha2subunnof 16 

414747 U30B72 Hs.77204 centromere protein F (350/400kO, mitosin) 16 

401050 NM»014155*:Homo8aplensHSPC063 protein (HSPC 2.6 

40 405897 Target Exon 16 

451153 BE092900 gb:CM2-BT0742<100400-147-h04 BT0742 Homo sapl 16 

407327 AA487162 Hs.269414 ESTs. Weakly similar to Z1 95 J1UMAN ZINC RNGE 16 

440159 AI637S99 H8.126127 ESTs 16 

404184 NM.030903*:HonK} sapiens olfactory receptor, f 16 

45 428552 AW274560 Hs.129520 ESTs 16 

401387 Target Exon 16 

428450 NHL014791 Hs.1B4339 KIAA01 75 gene product 16 

425698 NM.016112 Hs.159241 polycystic kidney disease 2-Gke 1 16 

424783 AA913909 Hs,153088 TATA box binding protein (TBP}«sociatedfac 16 

SO 449432 AW451361 Hs.196529 ESTs 16 

425082 809748 Hs.57987 a«eliaUlymphoma11B(zincfeigerpratoln) 16 

452281 T93500 Hs.28792 Homo sapiens cDNAFU11041fis. done PLACE10 16 

450705 U90304 Hs.25351 Iroquois homeobox protein 5 16 

421506 BE3027g6 Hs.105097 thymidine Idnase 1. soluble 16 

55 408391 AW&59276 ob:lltR1-CT03S2-240200.105<ft)2 CT0352 Homo sapi 16 

427099 AB0329S3 (te.173560 oddOz/teiHnhQmolDg2(DrosopMia,mouse) 16 

431750 AA514988 Hs.283705 ESTs 16 

451807 W52854 Ks.27099 hypotheScal protein HJ23293 Similar (oARl^ 16 

453331 A1240665 Hs.8895 ESTs 16 

60 447175 AI36S208 Hs.293608 ESTs 16 

451878 AI821027 Hs.8429 ESTs 16 

433790 BE2g821S Hs.2B8966 RAB22A, member f^S oncogene family 2.6 

418282 AA215535 Ks.98133 ESTs 16 

434557 AW855466 Hs.271886 ESTs. Weakty Similar to ALUIJiUMANAl^SUBFAM 16 

65 425704 U79293 Hs.159264 Human done 23948 mRNA sequence 16 

420894 AA744597 H5.8e854 ESTs 16 

435663 AI023707 Hs.134273 ESTs 16 

448037 AW195634 Hs.170401 ESTs 16 

418087 A11279S8 KsJ3393 cystatlnEm 16 

70 439524 BE542950 Hs.155548 ESTs 16 

402298 Target Exon 16 

424081 Ntl.006413 Hs.139120 ribonudeaseP(30kD) 16 

407471 055644 gb:Human spleen PABL(p6eudoautosoma] bounder 16 

430994 AA490346 Hs.40530 Homo sapiens, done MGC:1 7624. mR)^ complete 16 

75 419983 W55956 Hs.94030 Komo sapiens mRNA;cONADKFZp586E1624 (from c 16 

419699 AA246998 HS.173Q44 ESTs, Weakly stmSar to 138022 hypotheScal p 16 

423537 AL137279 Hs.130167 Homo sapiens mR14A;cONA01<FZp434O1214 (from c 16 

425415 M13903 Hs.157091 invoiucrin 16 

444826 AI674482 Hs.148441 ESTs 16 

80 413331 BE083950 gb:PMO«T0551>26020(M)01-b11 BT0651 Homosapl 16 

414987 AA524394 Hs.294022 hypothsQcat protein FU14950 16 

40S041 C3001708*30i)1345652tspiP1S9891CA38^CHiCKOOL 16 

413864 BE175562 stKf^C&4fn}580-100SQ(Ma2^1 HT0680Hom089pi 16 
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438746 AI885815 H5.184727 ESTs Z5 

438966 AW979074 gb:EST391 184 MAGE rese(|itences,MAGP Homo sapi 2S 

433365 'AF026944 Hs.293797 ESTs 2^ 

412723 AA64e459 Hs.335951 hypolh86cdpn>tebAF301222 15 

5 422656 AI870435 Hs.l569 UM homeobox protein 2 Z5 

411171 AWS20260 gb:OV2^T0296-15020CM)4a«10ST0298Honiosapl 7JS 

447197 R36075 gb.7h88bOU1 Soares placenta Nb2HP Homo sapl 2.S 

459688 072671 Hs.151250 bte]iceIhjlaradhe^moIecu)8S,tBlenceph8 15 

414883 AA926960 Hs.334d83 COC28 protein Kinase 1 2^ 

10 419741 NM.007019 Hs.93002 ufaiquiiin carrier pmtein E2-C Z5 

437412 6E069288 Hs.34744 Homo sapiens ntRNA:cDNADKFZp547a36 Oram d IS 

427368 AA885108 Hs.223806 TATA box I^Kling protein (TBP)-85S0ciat8dfiac 2^ 

455549 AW994222 gb:RC3-BN0036'2502(XM)12-«02BN0038KontO8api ZS 

409676 AA077118 Hs.197298 NS1-t)!ndjng protein 2^ 

IS 448305 AA62S207 Hs^915 Homo s^nscDNA ajl 2906 fis. done NT2RP20 2^ 

429413 NI\4.01405B Hs^1B77 DESC1 protein Z5 

424420 BE614743 Hs.146688 prostagiamfin E synthase 2^ 

427510 Z47542 Hs.179312 smaD nuclear RNAacOva1ingooinp1ex,pol]fpep 2^ 

452834 Ai638627 H5.105685 KIAA1 688 protein 2^ 

20 424354 NA4.014314 Hs.145612 RNAheGcase 2.5 

455095 AW85559B gb:CM1*Cr0278<)31199W-e08CT0278Homosapi ZS 

431241 AA496799 H5.36958 ESTs Z5 

424073 U03493 / Hs.138959 gap fundlon protein, alpha 7, 45)(D(connexln 25 

427239 BE270447 H3,174070 ul)lqui(in carrier protein 25 

25 407103 AA424881 Hs.256301 hypothetic^ protein MGC1 31 70 25 

458175 AW296024 Hs.150434 ESTs 25 

431130 NI^006103 Hs.2719 epididymis-spedficwhey-acidic protein type 25 

453379 AA035261 Hs.61753 ESTs 25 

438533 A1440266 Hs.170673 ESTs, Weakly sirraiar to T24832hypotheticaip 25 

30 412313 AW936832 gb:PM2-DT002a^04O04034i03OT0023HoniO8api 25 

421733 AL119671 Hs.1420 lit}roblast growth teior receptor 3 (adwndro 25 

443757 H05479 Hs.62314 ESTs 25 

449300 A)656959 Hs.222165 ESTs 25 

434913 AW872860 Hs.110S6 RAUP1 protein 25 

35 448946 AI652855 Hs^3363 liypothatical protein FU10963 25 

437327 AL353942 Hs.306504 Homo sapiens mRNA:cONADKFZp761l231 21 (liom 25 

450262 AW409872 Hs.184846 Homo sapiens, Simiiar to zInc finger protein 25 

453204 R107B9 Hs.191990 ESTs 25 

420170 U43374 Ks.95631 Human normdloFatkiocytBmRNA 25 

40 449344 AI640355 Hs.312691 ESTs 25 

439436 BE140845 Hs.57B68 ESTs 25 

449667 AI672379 Hs.122970 hypothetical protein FU21579 25 

452220 BE158006 Hs.212296 ESTs 25 

433675 AV\/977653 Hs.75319 fibomideoOde reductase M2 polypeptide 25 

4S 429163 AA884766 gb'^20a10.s1 Soares.NFU.T.GBC.Sl Komosapien 25 

441703 AW390054 Hs.192843 lexine zipper protein FKSG1 4 25 

415030 031118 Hs.191735 hypotheSca) protein MGC1 0520 25 

417975 AA641836 Hs^BS hypothelicdi protein FU23186 25 

451105 AI761324 gfa:wi60b11j[1 Na_CGAP.Co16 Homo sapiens cONA 25 

SO 400301 X03635 Hs.1657 estrogen receptor 1 25 

429386 AK001795 Hs^llTS hypothetical protein HJ10933 25 

423949 AI014546 Hs.130gi2 ESTs 25 

411768 NM.013371 Hs.71979 interteuldn 19 25 

436961 AW375974 Hs,156704 ESTs 26 

SS 431124 AF284221 Ks.5950S douUesex and matvS related Iranscriptionlac 25 

410878 AW809201 Hs.314248 EST8,Weaidysimi!artoAtaMJ1UMANALJUSUBFAM 25 

418717 A1334430 Hs.86984 ESTs 25 

443270 NM_004272 Hs.337737 Homer, neuronal Immediataearty gene, IB 25 

448454 NM_005879 Hs^1254 TRAP Interacting protein 2.5 

60 430072 X13294 H8.300S92 v-myb avian myetobtastosis viral oncogene horn 25 

442966 AI394036 Hs.132237 ESTs. Weakly simHar to DUS8.HUIi^ DUAL SPED 25 

451494 AI799444 Hs.247095 ESTs. Moderately similar to ALU7J{UMAN ALU SU 25 

418327 U70370 Hs.B4136 paired^ homeodomaintranscrfpfonlador 25 

440381 AA917808 Hs.190495 ESTs 26 

65 403983 Target Exon 25 

451340 AW336273 gbK}VO-DT0020-090200-107i)07Dr0020Hemosapi 25 

447888 BE620911 Hs.126889 ESTs 25 

441794 AW197794 Hs^53388 ESTs 25 

424153 AA451737 Hs.141496 MAGE-like2 25 

70 453633 AA357001 H5.34045 hypo9»licalpfotelnRJ20764 25 

435647 AI653240 Hs.49823 ESTs 25 

428780 At478578 Hs.50636 ESTs 25 

439108 AW163034 H3.6467 8ynaptogyrtn3 25 

422565 BE259035 Hs.118400 singed (Oiosophlla)-Gke (sea urdiin fiasdn h 25 

75 428054 AI94B688 HS.2G6619 ESTs 25 

418811 AKD01407 Hs.88883 hypothetical protein FU10545 25 

409893 AW247090 Hs.57101 tninichroniosome mainlenanca dafidBnt(8. cere 25 

443362 A1053464 Hs.166505 ESTs 25 

433183 AF231338 Hs.222024 transcription foctor Bh^ 25 

80 438214 H06076 Hs.26320 TRABtO protein 25 

446745 AW118189 Hs.156400 ESTs 25 

414416 AW40g98S HS.760B4 hypothetical protein MGCZ721 25 

426333 AW26g088 Hs.118183 tiypolhetical proton FU2Z833 25 
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TABLE 12B 



Pk^ UniquoEos prabssfitUenGEternumbor 
CATnumben Genocftistsr number 
Acoessbrc Genbank accession numbers 



Ptey CAT number 
407642 1007175J 



408391 
408690 
409594 
409695 
409990 
410008 
410049 
410784 
411098 
411171 
411465 
411559 
411619 
411958 
412279 
412313 



1055687J 

107490J 

114249J 

114876J 

116644J 

11681:L1 

1172307 J 

1221005.1 

1232093.1 

1234393.1 

1246768.1 

1249417.1 

1259748.1 

126717J 

1287332^1 

1288355.1 



412333 
412367 

412879 

413083 

413331 

413864 

414147 

414148 

415317 

415613 

415747 

416120 

416168 

416546 

417742 

418347 

418869 

419807 

420373 

420637 

420930 

422889 

423733 

423735 

423841 

425005 

429163 

431120 

431322 

431688 

432184 

432189 

432869 

433289 

433584 

433644 

434321 

435399 

437182 

437938 



1289037.1 
1291505.1 

1334272.1 

1348639.1 

1381726J 

1395788.1 

142127.-1 

142133L1 

1533847.1 

1540602-1 

155189.1 

1571266.1 

1574545.1 

1600181.1 

1696282-1 

174149.1 

179863.1 

18825^1 

193194.1 

195241.1 

107736J 

21989^1 

231476.1 

231498.1 

232507.1 

245908.1 

300543.1 

328264.1 

331543.1 

336609.1 

342677J 

342819.1 

355475.1 

36202.-1 

370400.1 

371919.1 

383473.1 

405576.1 

43421.1 

44573J2 



438390 45662.1 
438966 467436L1 
438993 467651.1 



Accessbns 

AW178963AVV845721BE141041AW806977AW845690AW845691AVV84S724AW17^ 

AW845693 AW845585 AWB45707 BE141053 BE141046 AW84557D AW845575 BE141(M3 BE141039 BE141070 AW062443 AW806980 AW845643 

AW806969 AW845686 BE141054 BE141040 BE141044 BE141042 AW84SS71 AW845604 6E141047 BE141071 AW082442 AWB45633 AW178968 

AW859276 AW859274 AW190959 T91463 

AW864542 AA056567 AW882724 

AA0761 18 AA975618 AA076220 

AA298961 AA296889 AA076945 AA077528 AA077497 

AA079337 AA079338 AW272100 

AA079552BE142625BE142527 

AW579475 AW939654 AW939655 

AW803201 6E079700 BE082940 

AW817238AW993g85AW993998 

AW820260 AW820332 R94408 

AW847663 AW847861 AW861080 

BE144081 BE144190AWB51155 

AW947884AW947918AW947888AW947883AW947897AW947910AW947905 AW864751 AW947878 

AA099020AW751275AW751276AW751289 

6E24S511 BE24$133AW935247 

AW936832AW936609AVV936657AW936611AVV936738AVW36734AVV936779AW93^ 

AW936740 AW936833 AW936777 AW936830 AW936834 AW936B29 AW936772 AW936638 AVyi936658 AW936636 AW936774 AW936778 AW936766 
AW936776 AW936831 AW936760 AW936ai9 
AW937485 AW937589 AWg37658 AW937654 AW937492 

AW945964AW946020AW946034 AW946027 AW946041 AW946044 AW94G033 AW946024 AW946021 AW946029AW946015 AWg46016AW346039 

AW946045 AW946028 AW946036 

BE092219BE092361 6E006789 

BE064528 BE064589 8E064S61 

BE083950 BE084017 BE084016 

BE175582 BE175514 6E175505 BE175591 6E175530 

BE091634 

BE084049 AW29^7 AA135984 
Z43388 F05453 R19673 R20275 H06917 
R20233F12901 T74740 
AA381209 AA381245 AA167683 
K45739 H51513H19779 
H23687 H46460H40239 
H62953 N76608 N72413 
R64719 Z44680R12451 
AA216419F03238AA229517 
AW516565AA229762AA230035 
R77402 AA262462 AA2S0g88 R06794 
AW958228AA259146W01465 
AW976163 AA278945 AA747691 

AW886650AW888651 BE1 49946 BE1 49948 BE1499S1 BE149947AW888649AA281840AA281822AW888652 

AWB56666 AA31 5006 AW954733 

AA330281 AA330232 AW962521 

AA3302S9 AA661805 AA502431 AW974633AA649496 

AW753967 AA370795 AA331 630 AW962550 

AI565851AA349556 R24798 

AA884768 AW974271 AA592975AA447312 

AA492588AA4g2498AA492571 

AW970622 AAS03009 AA502998 AA502989 AAS02805 T92168 

AA513906AA847734AI357044 

AW971125AA527731 N52655A)821508AA532420 

AA527941 AIB10608AI620ig0AA635266 

AW974094AA56g074AA602574 

AF005258 

AW2g5399 AW207772 AW300641 AW070290BE348854AW170383 AA600968AA778832 
AW342028 AA641080 AA603282 
AAB29368 AW849574 AW849673 
AA679463 AWB13779 AW813709 
AL080098 AL037472 AA432051 

A1950087 N70208 R97040 N36809 A!308119AW967677 N35320 AI251473 H59397 AW971573 R97278W01059AW987671 AA908598 AA251875 
AI820501 AI820532 W87891 T85904 U71458T82391 BE328571 n5102 R34725 AA884922 BE328517 AI219788 AA884444 N92578 F13493 AA927794 
AI560251AW874088AL134043AVV235363AAfi63345AVra08282AA488964AA2B3144AI89^ 
AIB72193AI763273AW173588AW150329AI853832AI762ra8AA988777AA488892AI356394AW103813AI539542AA^ 
AI961530 AW629970 BE61 2881 AW276997 AW513601 AW512843 AA044209 AW855538 AA180009 AA337499 AW981 101 AA251689 AA251874 
AI819225 AW205862 A1683338 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 AW513995 AA293273 AA969759 N75628 N22388 
H84729 K60052 T92487 A10220S8 AA780419 AA551005 WB0701 AW613458 AI373032 AIS64269 F00531 H83488 W371 81 W78802 R66056 AI002839 
R67840 AA300207 AW9S9S81 T63226 FO4005 

AM22017AI422945A1383249AI423113AI92S592A1420785AI208187A1423279AI42364SA1424090A13S983^ 
AW979074 AA834841 AA828SS0 
AA826995 AA834879 AI926361 
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439579 47404 1 AR)B6400 W79232 W73990 

440320 491930J AAB79»4 N67538 AI474541 

444910 624951 1 A]201&4g BE069007 AW946544 

446417 6763B4ll A1299O50 BE2S6910 

447197 711623 1 R3a)75 AI366546 R38157 

448599 n0766J AW860912A1540866 

449034 794617 1 AI624049AW1 17770 AI658360 

450024 62298 1 AA005129AA679084AA694399 

450G13 840016.1 AI702055 R89204 R86260 

451105 859083 1 AI761324AW880941 AW880937 

451153 85054 ? BE0929C0 AA01 5877 AA01 8521 

451340 66840~1 AW936273AW34035QAA017208 

454202 10SOS07 1 AW178383AW846011AW84S964AW845988AW64S977AW646002 

454241 1087B07J BE144668BE184942AW238414BE164S46 

454707 1230250 1 AW81 4989 AW81 4852 AW81 4608 

454891 1239217J AW837349AW837355AW882717 

454988 1248607 1 AW850140AW850195AW850192 

455091 1252939 1 BE07g752BE079688BE146989AW855532BE148818BE148815BE146796 

455092 1252971*1 BE152428AW855572AW85S607 

455095 12S2987"l AW855598AW855608BE148763BE148764AVV855645AW855615AW855596AW855610AW855601^ 

455203 1259973"l AWB65450AVV865119AVV865452AW865461 AVV885325AVra65114AW865116AW86^ 

455310 1278158~1 AW893g81 AW893998AW694034AW894019 

455319 1279172L1 AVV895387AW895547AW895564AW895323AVV895405AW695539AVV895538 

455355 1284681 1 AW948343AW948341 AW9028S5AW984737 

455549 1324698 1 AW994222AW994377 

455666 1349545 1 BE065813BEO65788BEO65889BE065832 

455750 1355998 1 BE075114BE075283BE075118 

455838 1374605 1 BE145808BE145807BE181883 

455987 1397735_1 BE1 78323 BE1 77978 

457405 333127 1 AA504860AA504911 

458829 773443_1 AI557388 BE158936 

459267 966605J AJ003631 M003850AJ003651 

TABLE 12C 

PkBv: Unique nuirtber corresponding to an Eos probesel ...j^^rvMA t 

Ref: S^uence source, Th97digiliuimbef5lnlhteoolunmafeGenbankldentifi^ TDunhamLetal/rrferstolhepuMcattonenliltedTTwDNAsequ^ 

human chromosome 22." Dunham 1. el sU Nature (1999) 402:489^95. 
Strand: Indicates DNA strand fiom which exons were predicted. 
Nt^posHion: Indicates nucleotide posHions of predlcledexons. 



Pltey 


Ref 


Strand 


NLposlfion 


400612 


9929646 


Minus 


151513-151662 


400866 


8118496 


Plus 


17982-18115.20297^56 


401050 


8117628 


Mnus 


7844^79425 


401137 


2547238 


Mnus 


598-1009 


401230 


9929527 


Minus 


33B3&-34006.3453&^2.36461-3874S.4e92S^9098,S2604-52758 


401260 


8076883 


Minus 


86008-86355 


401367 


9796198 


Minus 


145356-145807 


401458 


9187885 


Rus 


76485-77597 


401486 


7341763 


Phis 


32585^766.38281-36540.40791-40933.44018-44179 


401575 


7229804 


Minus 


76253-76364 


401747 


9789872 


Minus 


118596-118816.119119-119244.119609-119761,120422-120991130161-130381,130468-1305^^^ 






131932,132451-132575,133580-134011 


401760 


9929699 


Pius 


83126-83250,85320-85540.94719-9S287 


401780 


7249190 


Minus 


28397.28617.28920.29045.29135-29296J29411-29567,29705-297B7,3022«0573 


401781 


7249190 


Mnus 


8321S63435,83531-836563374O«3901,84237-84393.84955.85037.8629OW14 


401994 


4153858 


Minus 


42904-43124.43211-43336/44607-44763.4519945281,46337-46732 


402048 


8072512 


Plus 


4393&44078 


402298 


6598824 


Plus 


36758^7953 


402337 


6957691 


Phis 


4116-4286.16811-16973.17107-17256,19715-20040.22029^2205 


402481 


9797406 


Plus 


87891-68991 


402556 


9863723 


Ptus 


13579-14026 


402639 


9958129 


Minus 


20167-22383 


402800 


6010175 


Rus 


43921-44049.46161-46273 


402892 


8086844 


Minus 


194384-194645 


403274 


8072441 


Minus 


104069-104179,105683-105859 


403471 


9930659 


Minus 


85867-85983 


403763 


7229888 


Minus 


4357543887 


403806 


8140491 


Plus 


146390-146678 


403983 


8576059 


Mhuis 


82441-82701 


404107 


8099028 


Minus 


201699-202363 


404132 


6981900 


Rus 


11307-12434 


404164 


4581418 


Minus 


12652-13548 


404253 


9367202 


Minus 


55675-56055 


404440 


7528051 


Plus 


80430^1581 


404782 


9910094 


Minus 


15455-15589 


404959 


7407864 


Rus 


4524345388 


404981 
404996 


4432779 


Minus 


20626-20770,22513-22721 


6007690 


Plus 


' 37999.38145,38652-38998,39727-39872.40557-40574,42351-42450 


405041 


7547195 


Rus 


121230-121714 


405196 


7230083 


Minus 


135716-135651 



217 



wo 03/042661 



PCTAJS02/36810 



40S277 


3980473 


Plus 


23471-23572 


4052B1 


6133075 


Minus 


•Sii4n4 <>i4C4 4C4AJ 9eQM ^cii4<i JCJITC jtmi^CQCfl AnOtt-^ACt AORAtLAORKSi iflTOn AOUiA C|yM1J!A349 (!imj%JDRR7 <«dimSA7?Q 

34ZDZ-34351 ,351 94-35338,4541 M5475,4573i«4d3^ i-o«r» 


405336 


6094635 


Plus 


33267-33563 


405454 


7656675 


Plus 


133807-134053 


405545 


1054740 


Plus 


118677-118807.119091-119296,121626-121823 


405547 


1054740 


Plus 


1 24^1-1 Z4o20, 1 Z43 i4-129UaO 


405657 


4827303 


Mnius 


104132-104293 


405675 


4557087 


Plus 


70304-70530 


405708 


4156162 


Plus 


55030.55604 


405897 


6758795 


Plus 


59828^)535 


405932 


7757812 


Minus 


123525-123713 


405943 


6756796 


PkJS 


20605-20812 


406087 


9123919 


Minus 


7234-7626 


406092 


9123919 


Pius 


251370-251797^52168-252882 


406467 


9795551 


Plus 


182212-182958 


406554 


7711566 


Plus 


106956-107121 


406560 


7711569 


Minus 


35162-35292 


406599 


6248616 


Plus 


10933-11086 



TABLE 13A: 465 GENES UP-REGUUTEO IN CERVICAL CANCER COMPARED 70 NORMAL ADULT TISSUES. UKELY TO ENCODE EXFRACatUlAR OR CELL-SURFACE 
PROTEINS 

Tabts 13A Rsts about 465 genes up^ulaled In cen^ cancer compared to nomial adult tissues that are My to encode extracellular or cell-surface proteins. These were selected 
as for Tdble 12A, except that the ratio was greater than or eqpiai to 1 ,7. and the pereentHe value amongst cetvtca) cancels was greater than or equal 100 units, and the predicled 
protein contained a stnicbjrai dom^ that is imScaiva of extiaoelutar locallzaOon (e^g. Ig. fh3. egf. 7tm domains, signal sequences, Iransmentoie domains). The piedicted prot^ 
domains are noted. 

Pkey: Unique Eos proi)eset idenfilier numl)er 

ExAocn: Exemplar Accession mpb&, Genfaank accession number 

UnlgenelD: Unigene number 

PPOomalns; PredictBd Protein Domains 

Unigene TitletUnlgene gene Qlle 

R1: RaOooftumorto normal adult fissues 



42S650 
418007 



452240 
424046 
400289 
418345 
423017 
428227 
.447164 
414764 
416661 
427585 
406467 



QcAocn IMgenelD PPDomabis 

N]yL001944 Ha.1925 TM.cadheiin,Cadherin.CJenn 

M13509 Hs.e3169 SS4iemapexin,PepfidasQj410 

W79123 Hs.58561 TM.7lm_1 

AI591147 Hs.61232 TM 

AF027866 Hs.138202 SS.TM,serpin 

X07820 HS.22S8 hemopexin,Pep6daseJA10 

AJOOiegS Hs.241407 SS.TM,serpin 

AW178761 Hs.227948 SS^eipln 

AA321649 Hs^48 S$,TM.ILB 

AF026941 Hs.17518 TM»iBR 

AWD13687 Hs.72047 TM 

AA634543 Hs.79440 TM 

D31152 Hs.179729 SS,C1q.Cotldgen 
TM,efhand 

BE440042 Hs.83326 SS.PepQdaseJA10J}emop6xin 

418882 NM.004996 ti3.8S433 TM,ABGLP)embrBne^CLIran 

419247 865791 Hs.89764 TM,KHH}omain 

446232 AI281&48 Ks.194691 TM,7tni.3RibosofflaLL13 

424905 NM^002497 1^153704 TM.pkinase 

422420 U03398 Hs.1524 TM,tubuIIn.TNF 

427821 AA470158 Hs.98202 TM,7lnLl 

436211 AK001581 Hs.334828 AmmoniunLlransp 
444342 NM_014393 H3.10887 lamp 
422330 D30783 H3.1 15263 SS.TUEGF 
447342 AI199268 Hs.19322 SSJipocaDn 
407839 AA045144 Ks.161566 TM,cadherfn.Cadherin.Cjemi 
410153 6E311928 Ks.15830 Glycos_lmnsf.2 
414812 X72755 Hs.77367 SS,TM,IL8 
421773 W69233 Hs,112457 SS. 
413385 M34455 Hs.840 TM,IOO 

413753 U17760 H3,75517 SS.lamlnirLEGFJaminiruNtemi.adhjhort 

432239 X81334 Hs^936 SS.Peptidase.M10Jiemopex!n 

418663 AKD01100 H5.41690 TM,C8dt)eitn 

407366 AF026942 TM.tBR 

433091 Y12642 H5.3185 SS,TM,UPAILLY6 

408536 AW381532 Hs.135188 8S.TM.E1-E2J^TPase.C8BoiLATPase_C.N 

420440 NM-002407 Hs.97644 SRCRUteiogiobin 

437044 AL035B84 Hs.69517 TM 

405547 8S.TMMCjnemfarane,ABCJiani)g 

439223 AW238299 Hs:?50618 SS 

426320 W47595 Hs,1693W SS,TM,TGF-beta.T6Fbj)ropeptlde 
423834 AW959908 Hs.1690 TM 

426350 NM.003245 Hs.2022 TM.Tra(efilulanfti_aTransglulamiiv>l,TfB^ 
409744 AW67S258 Hs.56265 TM,meteIOiioJ(elch 
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UMgeneTIOa 

desmoglein 3 (pemphigus vulgans an^an) 
matrix matatloprotelnase 1 (interstitial 
G protein-coupled receptor 87 
ESTs 

serine (or cysteine) proteinase inhibitor 
matrix metalioprotelnase 10 (stromelysin 
serine proteinase inhibitor 13(Pi13; se 
serine (or cysteine) proteinase inhibitor 
small indudble cytoidne subfamily B (Cys 
Homo sapiens c^5 mRNA, partial sequence 
ESTs 

IGNImRNA-bimfmg protein 3 
collagen, type X, alpha 1 (Schmid met^hy 
Target Exon 

matrix met^oprotdnase 3 (stromdysi n 1 

ATP-bMing csssefte. subtly C (CFTR/ 

fragile X mental fetardaOon 1 

lednoic add induced 3 

NIMA (never in mitosis gene a)-re)ated Id 

tumor necrosis factor (l^and) supeifamU 

ESTs 

hypothelicai protein FU10719; KIAA1794 p 
sImDar to lysosomd-assoctated membrane g 



Homo sapiens. Similar to RIKEN cDNA 201 03 
ESTs 

hypolheSeai protein FU1 2691 
monbMne induced t^ gamma tnterferon 
ESTs 

lndoleamkiei>ynole 2,3 dioxygenase 
Mnin, beta 3 (nioein (12SlcD). kaTmin 
mabbc metaltoprotelnase 13 (coOagenase 
desmocoIHn 3 

gb:Homo sapiens dg33 mRNA, partial seque 
lymphocyte antigen 6 complex, locus D 
ESTs 

maiTKn^lobln 2 

cDNA for differenOaBy expressed C016 ga 
NM_016833*:Homo sapiens transporler 2, AT 
UL16 binding protein 2 
transfom^ growth (actor, beta 2 
heparirMiinding growth factor binding pro 
bansghitamlnase 3 (E polypeptide, protei 
Homo sapiens mRNA: CDNA DKFZp586P2321 ( 



R1 

43.6 
38.9 
28.8 
27.0 
24.5 
20,5 
20.1 
19.2 
15.9 
13.8 
1Z9 
1Z7 
12.6 
10.5 
10.2 
-9.4 
9.1 
8.9 
&9 
8.7 
6.9 
6.9 
6.8 
6.8 
6.8 
6.6 
6.5 
6.4 
6.2 
6.9 
5.8 
5.5 
5.5 
5.5 
5.4 
5.4 
5.2 
5.1 
5.1 
5.1 
5.1 
5.0 
5.0 
4.9 
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444461 R53734 Hs25978 TM 

410381 BE391604 KS.62G61 SS.TM.6eP 

423673 BE0O3O54 H3.1695 SSJM.P^a5e.M10.ltemopexin 

450375 AA009647 HsiSSO TM.dlsInte9riivPiBiLM12BLprope|^Rep^ 

401575 KA TM 

428484 AF104032 Hs.184601 TM 

425071 NM-013969 H8.154424 SS,TM.T4.dtiodinase 

431808 M30703 Ks.270833 SS,TM.EGF 

434699 AA643687 Hs.1494^ TM,Nudeo^e.tra2 

406687 M31126 Hs^72620 SS.Pept!dase_M10,heinope]dn 

404440 TM,MAG£ 

449228 AJ403107 Hs.148590 TM,PAF^p450 

444105 AW189097 Hs.166597 TUcadhertn 

409632 W74001 Hs^Z79 SS.sefp!n 

423515 AA327017 Hs.162204 SS.TM,UPAIU.Y6 

423738 AB002134 Ks.132195 SS,TM.tiypsIn.8EA 

423553 AA405635 HS.S6854 TM 

445537 AJ245671 K3.12844 TM,ras 



446889 


AK001898 


Hs.16740 


TM 


426536 


AI143139 


Hs.2288 


TM.efhand.Syndecan 


413801 


M62246 


Hs.35406 


TM 


429441 


AJ224172 


Hs.204096 


Uteroglobin 


409601 


AF237621 


Hs.80828 


TMjBairenfQtainefll,C2 


439238 


N47305 


Hs.46668 


TM 


446292 


AF081497 


H$.279682 


Antmonium^transp 


405545 






SSJM4)roteasQfmil9,ABCjnemb,tranABC.tfaa 


422938 


NM.001809 Hs.1594 


TM,thlolase 


423217 


NM.000094 Hs.1640 


SS.TM,1n3kVwa.ConageruKunil;cBPTI 


430686 


NiyL001942 Hs.2633 


SS,TM,C8dhefin,C8dheria.CJenn 


444707 


AI188613 


Hs.41690 


TMfCadhertn 


409582 


R27430 


Hs^1565 


TM 


408771 


AW732573 


HS.47S84 


TM, K_t8tra,lofX-lrans 


400441 


M15530 


Hs.99879 


TM.G-aIpha 


413278 


BE563085 


HS.B33 


TM,ublquiUnlandnbuG,lam'niiLfGFJ(azal 


426514 


BE616633 


Hs.170195 


SSJGFbjHopeptldeJGF-bete 


424927 


AW973668 


H3.153850 


SS 


408591 


AF015224 


Hs.46452 


SSiTHUterogtobb) 


407756 


AA116021 


H8.38260 


SS.UCH-1,UCH-2 


407137 


T97307 




TM,GDA1.C039 


411274 


NWL002776 Hs.69423 


trypsin 


400666 






SS,heinopexin,PepIid3seJA10 


412471 


M63193 


Hs.73946 


SS,TM,Glyoo$.transtJiCamjBcy1transf 


450550 


T65617 


Hs.101257 


TM 


451778 


A)826131 


Hs.71243 


ig 


430397 


AI924533 


Ks.105607 


SS.TM 


449722 


BE280074 


Hs^3g60 


TM.cycHn 


422487 


AJ010901 


Hs.198267 


TM,vwl 


449101 


AA205e47 


Hs.23016 


SSJM,7tnLl 


416994 


AA296520 


H5.89546 


SS,TM.IecQnjc.sushi,EGF 


.421379 


Y15221 


H8.103882 


SSJMJLS 


414774 


X02419 


Hs.77274 


SS4(i1ngl6.tfypsIn 


431958 


X63629 


Hs,2877 


SSJM,CadheriiLC.term,cadh8rm 


418452 


BE001596 


Hs.85266 


SS,TM,integrln3«fn3 


424687 


J05070 


Hs.151738 


SS,PeptldaseJA10,fn2«hemopexin 


401488 


NA 




SSiTMibypsIn 


408113 


T82427 


Hs.194101 


TM,7bn.3RibosomaU.13 


427359 


AW020782 


Hs.79881 


TM,7lm_1 


452934 


AA581322 


Hs.4213 


SS,TM,I9 


448988 


Y09763 


K&22785 


SS,TM 


439750 


AL358053 


H5^664 


TMilntegrin^,RlcInJBJecllnrnn 


414696 


AF002O2O 


Hs.76918 


SS,TM,PaIched 


435604 


AA625279 


Hs:{6892 


TM 


453883 


AI638516 


HS22630 


TM.Ets,SAM-PNT 


448733 


NM.005629 HS.1879S8 


SS.TM.SNF.ABC_tranMlh.p)(Inase.DSPc.RibosomaL 


444946 


AW139205 


Hs.156457 


SSJMiabhydrelase 


437938 


AI950087 




TM.hlstone.ig.MHCJ 


424441 


X14850 


H8,147097 


TM,htstone 


427051 


AB032971 


Hs.173392 


TM 


409703 


NM.006187 Hs.56009 


SS 


447313 


U92981 


Hs.18081 


TGF^ 


431070 


AW408164 


Hs^49184 


ABC tran 


446269 


AW263155 


H3.14559 


TM 


421190 


U95031 


Hs.102482 


TM,yvwJ 


452732 


BE30007e 


Hs.80449 


TM 


443859 


NNL01340g Hs.9914 


SS,ksal 


446733 


AA863360 


Hs^6040 


TM,p450 


449746 


AI668594 


Hs.176588 


SSj)450 


418844 


M62982 


Hs.1200 


SS.TMJlpoxygenase.PLAT 


414581 


AA256213 


Hs.72010 


TM.Cam.^c^tasf.ChoIIn8jun3se,SC01-SenC 


431629 


AU077025 


Hs^5827 


SS.(RNA^8nOSH2.SH3,pUnase 


445873 


AA250970 


Hs^1946 


SS.nm.PABPp)dnase.14^,[nn 


438113 


AI487908 


HS.B882 


TM,7lm-1 


410310 


JQ2931 


H8^192 


SS.TM,T1ssueL^ 
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ESTs>VVtoMyslmnarb210926a\BceDg 4.8 

guanylale binding proteiR 1. inteffiBnMV-i 4^ 

matrix melfiltoproleinasa 12 (maoopiiage e 4.8 

Q d^tegrin and metalloprotidnase domal 4.6 

Target Exon 4.6 

sohitD canter famQy 7 (caiionlc amino a 4.5 

deiodinase.iodothyron{ne.typell 4.4 

amptilregulln (sc^wannoma•d8rtved growth f 4.3 

Honrto sapiens cONA aJ11980 fis. done HEM 4.3 

pregnancy epecHic beta-1-0|yoopn}tein 9 4.2 

NMl021048:Homo sapiens melanofflaanflgen, 4.2 

protein lelatfid with psoriaste 42 

ESTs - 4.1 

serine (or cysteine) proteinase bihlbttor 4.1 

^STs 4.1 

airway trypsln-like protease 4.1 

ESTs. Wealdy similar to DYULHUMAN CYTOPL 4.1 

EGF-iike<!omaii^ miriUpIe 6 40 

hypothetical protein FU11036 4.0 

visinin-Gke 1 3.9 

ESTs. Highly similar to unnamed protein p 3.9 

iipophilin B (uleroglot^ln famfly member), 3.9 

iteralin 1 (ep!dermoty& hypertteretosls) 3.6 

ESTs 3.8 

Rh type C glycoprotein 3.8 

(MDfVrAP)(TAP2) 3.8 

centromere protein A (1 7kD} 3.7 

collagen, ^pe VU. alpha 1 (epidenndlysi 3w7 

desmogldn 1 3.7 

desmocollln 3 3.7 

ESTs 3.6 

potassium voltage^ated channel, delayed- 3.6 

B^l growth factor 1(12i(D) 16 

interferon«{imulated protein. 15 IcOa 3.6 

bone morphogenetic protein 7 (osteogenic 3.6 

hypolhetica) protein C321D2.4 3.6 

mammaglotnn 1 3.5 

utsiquilin specific protease 18 3.5 

gb:yeS3h05.s1 Scares feldl Ihrer spleen 1 3.5 

ItailiiaeinIO 3.5 

NM.00242S:Homo sapiens matrix metaitoprot 3.5 

endothella] cell giowtti factor 1 (platele 3.4 

hypothetical protein MGC3295 a4 

ESTs, Weakly similar to zinc linger prote 3.4 

bicarbonate transporter related protein 1 3.4 

cydin B1 3.4 

mucin 4. (racheobronchlal 3.4 

G protein-coupled receptor 3.3 

seledln E (endothelial adhesion motecute 3.3 

small inducible cytokine subfamily 6 (Cys 3.3 

plasminogen activator, urokinase 3.3 

cadherin 3, type 1 , Rcadherin (placental 3.3 

integrin.beta4 3.3 

matrix metditoprotelnase 9 (geiaHnase B, 3.3 

C4OOO647*:0i|47585O8(reqNP.OO4253.1|8ir -'3.2 

Homo sapiens cDNA: FU20869 fis, done AO 3.2 

Homo sapiens cDNA: FU23006 Us. done LN 3.2 

hypothetical protein MGC16207 3.1 

ganvna-amlnobutyric add (GABA) A receptor 3.1 

Homo sapiens mfU4AfuiI length insert cDNA 3.1 

Niemann-Pick disease, type CI 3.1 

uncharacterized bone marrow protein BM040 3.1 

oofactor required for Spl transcriptional 3.0 

sohite canler family 6 (neurotransmitter 3.0 

hypothetica] protein FU22408 3.0 

gbnivq0Sd}2.x1 NQ_CGAP.KU12 Homo sapiens 3.0 

H2A histone family, member X 3.0 

K1AA1145 protein 3.0 

2'-5'-oilgo3daiyiate synthetase 3 (100 kO 3.0 

Homo sa]^ns done 0T1 P1B6 mRNA, CAQ repe 3.0 

tFanscriptionfadDr19(SCl) Z9 

hypolheticSI protein FU10540 2.9 

nnudn 5, sidriype B, tracheobronchial 2.0 

Homo sapiens, done 1MAG£:3535294. mRNA. 2.9 

fbllistatin Z9 

ESTs, WeaUy similar to fatty add omega- 2.9 

ESTs. similar to CP4yjfUMANCVT0CH Z9 

arachldonate 124ipQxygenase • 2.9 

ESTs Z9 

Interferon, alpha-lndudbie protein (don 2.8 

poiy(A)-blAdIng protein, cytoplasmic l-Q 2.8 

ESTs i8 

coa g i rt a C on factor 111 (thromboplastin, t .* 2.8 
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AA10SB70 


H170725 




U75679 


Hs.75257 




NM.014642 HS.1B9387 


433345 


AI581545 


Hs.152982 




AW084176 


Hs.223296 




U8463 


H5.1030 


SUIMOU 


AF123060 


Hs.44532 


422278 


AF072873 


Hs.114218 




BH)919^ 


Hs.16244 


4TDcdU 


AA581366 


H5.73452 


407287 


A1678812 






AA12S9f0 


Ks.191461 


400298 


AA032279 


Ks.61635 


4294aJ 


AF231022 


Ks.158159 


430152 


AB001325 


Hs.234642 


444000 
410009 


BE395085 
AV\616S65 


Hs.10086 


4lD00o 


U03272 


Hs.79432 


41lu9U 


AA402307 


Hs.322a44 


4l900f 


AU077005 


Hs.92208 


4U0D/I 


AA129547 


Hs.285754 


A'^AAAA 
404444 


AI765276 


Hs.101257 


4iel0lr 


AF146074 


Hs.108660 


40l09U 


X17033 


Hs^1986 


4!>tZ0l 


T93500 


Hsi8792 


4^I0UD 


BE302796 


Hs,105097 


400001 


AI240665 


Hs.8895 


J471Q7 


R36075 




409000 


U72671 


HS.1S1250 


40/4u 


BE06928d 


Hs^744 


429413 


NM_0140S8 Hs.201877 


4&44/U 


BE614743 


Hs.146688 


4Zr2oa 


BE270447 


Hs.174070 


4Q7i04 


AA424881 


Hs^56301 


4oi (OU 


NM_006103 Hs^719 


40oJr9 


AA035261 


Ks,61753 


4z1/0o 


AL1 19671 


Hs.1420 




BE158006 


Hs^12298 


41 /y/s 


AA641836 


Hs.30085 


44U00I 


AA917808 


Ks,190495 


441 fS4 


AW197794 


Hs^53338 


4091 vO 


AW163034 


H3.6467 


AM AM 

401100 


NA 




40D00O 


AW269920 


Hs.2621 


4oU1/9 


6E301708 


Hs.233955 


415621 


AI648602 


Hs.55468 


402745 






407758 


O5091S 


Hs^8365 


457570 


AAS79426 




429574 


BE268321 


Hs.208912 


431211 


M66849 


Hs.323733 


452665 


AI924W6 


Hs.119567 


420511 


AF052692 


Hs.98485 


437897 


AA770561 


Hs.146170 


437B46 


AA773866 


HS244569 


418432 


M14t56 


Hs.dS112 


4ooiUo 


AI471795 


H5.287776 


4oo4uO 


AI192987 


Hs.61784 


435542 


AA687376 


Hs.269533 


4j4oi/ 


AA635690 


Hs.337251 


431630 


NM.002204 Hs.265829 


4Z2oiO 


AA316622 


Hs.98370 


441954 


AI744935 


H5.8047 


416091 


AF295370 


^te.283082 


429359 


W00482 


Hs.2399 


409402 


AF208234 


Hs.695 


432284 


AAS32807 


Hs.105822 


408243 


Y00787 


Hs.624 


423229 


AC003965 


H3.125532 


408713 


NM.001248 Hs.47042 




AI824113 


Hs.78281 


429929 


AB014583 


Hs.226275 


439963 


AW247529 


Hs.6793 


428953 


AA306610 


Hs.194676 


439398 


AA284267 


Hs.221604 


440371 


BE268550 


Hs.80449 


452203 


X57522 


Hs.158164 


407811 


AW190902 


Hs.40098 


432078 


BE314877 


Hs^4553 


429113 


D28235 


Hs.195384 


452755 


AW138937 


Hs.213436 


428434 


AW363590 


Hs.65551 


429922 


Z9763D 


Hs.226117 



SS.TM 

Tlt^ig.pkinase 

TM.SCAhmni.1 

SS 

TM 

TMMVPS9 

TM,ubiquitin7tm_3ANFjeceptor^usW 

TM.Frf2zIed.Fz 

TM 

TM^,PlATmWS(XLRRCr.GPSPMP22.Claud5n 

TM/ascadherin 

T6F4)ete 

TM 

E6F,cadherin,lambi{iLG 

SS.TM,M1P 

SS.TM 

TM,RasGAP.IQ,WW 
SaTHEGF.TB 

SS,TM.SemaiT16Plaxbuepeat 
SSJM,dishtegriaRepra^Pep.M12Bj)^ 
TM.pkina5e,PlBxIruepeatS«na,TIG^lM 
TM 

TM.AB(XJtoi,ABCjn8mfarBne,Rhombaid 

vwa.F&GAPjntsgriiLA 

TGF-beta,TGFbj)rope|)lId8 

TM.TK 

TM,dtsintegrinpep.M12Bj)ropep,RepxQlysin 

TM.SOF 

SS,TM.Ig 

TM.ABC_lran^jnembran6.Rhoinbdd 

trypsin 

TM.MAPEG 

TM.UCLcon 

TM.cNMP_bindlngt[ypsin 

SS,TM.wap 

PAN,krin9te,tryp5in 

SSJM^,pkirase 

TM.lntegfinJ^FaGAP 

SS,b7psin 

TM 

7W 

SS,TM 

TM,vwd 

TM,cy8ta6n 

TM,GlyoGLlransL11 

TMjiistDn8,Secl8ugar.tr 

SS,THEGFJdUecepUttViogIobuBfLl 

8S.TM 

TM 

SS,TM 

SS,TM,connexin 

SS,TM.PMP2^Claudln 

SS^THconnexin 

SSipioJsonieraso 

TM 



TM 

pWfiase,FurirMikB,Recep_^_domalh 

pkinase.RhoGEF,lg,PH,SH3 

TM 

SS,TM,FG^.lntegrirLA 

SS.THfh3,l94}ldndse,Ra)0$oflfiaU^.p45^ 

TM.8andJ^\A.C3dc48JI 

SS.TMlOefens{nJ)eta 

8S,TM,Peptid«eJ410.h8mopodn 

TM,cystaOn 

TM^ase 

SS,TM.IU 

SS,trypsin 

GDA1^C039 

R6S,GoLjocoJ^O 

TM 

TM.p45PEts 

SSJM.TNFRjc8,arf,Stathmin.DEAD 

SS 

TM 

SS,TM.ABCjbsivABCjn8inbrane 

SS 

TM 

SS.TM.EGF 



gamma-aminobutyric acid (GABA) A receptor 

stsnvloop (histone) Ending protein 

K1AA0036 gene product 

hypothetic^ protein FU13117 

ESTs. Weakly simitar to 138022 hypothefic 

rB3bhit)(lQr 

diubiquinn 

fttzzfed (DrosopMla) homolog 6 
ndtolic 8{»indteooiIeikoD related preta 
hypotheOcal protein MGC10791 
gbAi59d08j(1 NCLCGAP.Gas4 Homo sapiens 
ESTs 

Six transmGmt)rane epithena! anGgen of t 
FAT tumor suppressor (Drosophila) homolog 
aquaporinS 

type I transmemtirane protein Fn14 
gb3cq01d05Ji1 So9res_NHCeC_ceMcaLtunior 
fibrilltn 2 (congenital contractura! arac 
hypothetical protebi DKFZp584A176 
a disintegdn and metallopnotelnase ddmaf 
met proto-oncogena (hepatocyte growth fac 
hypothetical protein MGC3295 
ATP.l)lndlng cassette, sul>-famHy C (CRR/ 
Integrin, alpha 2 (C049a alpha 2 sulxinit 
Homo sapiens cDNA FU11041 lis, clone PLA 
thymidine kinase 1,80hJt}te 
ESTs 

gb.7h88t)01.s1 Soares placenta Nb2HP Homo 

Intercellular adhesbn molecule 5, leienc 

Homo sapiens mRNA; cDNA DKFZp547C136 (fro 

DESC1 protein 

prostaglandin E synthase 

ubiquttin canter protein 

hypotheGcal protein MGC13170 

epIdidymis«pecHic whey-acldic protain 

ESTs 

fibroblast growth fxtor receptor 3 (acho 
ESTs 

hypotheticat protein FU23166 

ESTs 

ESTs 

synaptogyrtn 3 

C12001233.-gl|730536l(rBqNP_0386521{ oto 

cystaSnA(8t8tinA) 

nypoth8lic8fprolelnFU20401 

ESTs 

NM-002508:Homo sapiens nidogen [enx)&r\) 
KIAA0125 gene product 

flb:nf37c09.8l NCLCGAPJW Homo sapiens c 
hypotheScal protein M6C861 
gap junc&on protein, beta 2, 261(D (oornie 
ESTs, Weakly similar to A47582 &<eti gro 
gap luncfion protein, beta 3, 31lcD (oonne 
hypotheScal protein FU22989 
esophagus cancer-related gene^ 
Insutin-On growth factor 1 (somatomedin 
vaniDoid receptor-cBlated ostmacaHy ao 
hypctheliesi protein FU14451 
ESTs * 

hypolhe&af protein M6C2487 
integrin, ^ha 3 (antigen CI349C elplia 3 
cytochrome P450, subfamily IIS. po^pU 
Fanconi anemia, comptementadon group 0 
dsfensln,beta3 

matrix metalloproteinase 14 (membran&to 

cys(atInB(steSnB) 

ESTs 

imerieuidno 
protease, serine, 26 



TM,linkBU»slone7biLl 



regulator of G^Hotein signalling 12 
KtAA0683 gens product 
pl^telet-acGvating factor acelylhydralas 
tumor necrosis factor receptor supetfamil 
ESTs 

Homo sapiens, done IMAGE:3535294, mRNA. 
transpofter 1, ATP-bbufing cassette, sub- 
cystE^ lowt superfamOy 1, BMP anlagoni 
hypothetical protein FU12541 similar to 
prostaglandirvendoperoxide synthase 2 (pr 
ESTs, Weakly similar to A34087 hypothetic 
Homo s apte ns. Similar to DNA sepnenl. Chr 
Hf histone family, member 0 
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TM 
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SS 
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TM 

TM 

TM 

TM,WD40,pro_lsomerase 
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TM,7tm_1 

SS.TM,DSUEGF 
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TM 
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TM 
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TM,GiycoJqfdR>_2 
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TM.GNS1_SUR4 
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8S.TM.DSLEGF.NUDtX 
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TM 
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lyin(dK))D]dnbet8ffiCBptor(TNFR6up6rf3iTii 2.2 

HCR(a4ieIUccRe(kQ0rodh«nQb9U8) 2.2 

bnmortafizaSoTHipregulated protein 2JZ 

pttosphoribosytglycbiaml^foniiyttranslera 2.2 

acetyl LDL receptor. SREC 2.2 

5r4 onoobtal tioptKriilsst gtycopiQtsn 2.2 

soluts canfer family 3 (activators of di Z2 

ESTs, WeaMy slmHar to 138022 hypothetic 12 

ESTs Z2 

minidvomosome maintenance d8lictent(S. 2.2 

HMT1 (hnRNP methyitransferase, 8. ceievls 2.2 

artemin 2.2 

TONDU Z2 

hypotheUca] protein DKFZp564O0462 Z2 

mltogeivactivated protein Idnase Idnase k Z2 

hypotiteCcd protein FU20561 " Z 1 

glycoprotein (transmembrane) nmb Z 1 

COC20(oeiI division cycie 20. S. ceievls Z1 

serine protease bitilt)itori Kazal ^fpe; 5 2.1 

Homo sapiens cDNA:FU23006fis. done LN Z1 

delta (DrosophilaHOcel z\ 

cydlrvdependent kinase 2 2.1 

FOS-Dkeanflgen-l 2.1 

hypothetical prOtdn DKFZp434l1930 2.1 

procollagen &endopepfkiase enliancer 2.1 

mutated In ootoredai cancers 2.I 

protein tyrosine phosphatase, receptor ty 2.1 

hypothetical protnnFU23399 Z^ 

transmembrane, pros^ androgen induced 2.1 

Inhlbln. beta A (activin A. activin AB al 2.1 

ESTs zt 

NM.007196:Homo sapiens kaflikrein 8 (neur 2.1 

Homosaplens, Similar to RIKENcDNA 57305 2.1 

guaribie nucieoQde binding protein (G pro 2.1 

stratllin z^ 

metallocafboxypepfldase CPX-1 2.1 

potassium ciiannel, subfamily K. member 1 Z1 

CGI-09 protein 2.1 

collagen, type V, alpha 2 2.1 

genn ceil specific Y-box binding protein 2. 1 

prdactin receptor 2.1 

SHC (Src honnology 2 domain^ntaining) tr Z1 

smaO Inducible cytokine A5 (RANTES) Zl 

ATPase, Ca transporting, type 2C, member Z1 

gbntv35f03.n NCI.CGAP^BrS Homo s^ens c Z^ 

putative secreted iigand homologous to Q ZQ 

hypothefical protein M6C2991 ZO 

sema domain, knmunogkibulin domain (Ig), ZO 

ESTg ZO 

EST& . 2.0 

pyrimkflnergic receptor P2Y, (Hnotein co 2.0 

bone morphogenetic protein 1 2.0 

squamous cell carcinoma antigen recognize 2.0 

K1AA0450 gene product 2.0 
offiactory receptor, MIy 2; subfamily I " ZO 

ndciofibrillaF-associatBd protein 2 2.0 

kaHlkreln 12 2.0 

ESTs 2.0 

tumor necrosis factor (Hgand) superfamil ZO 

ESTs 2.0 

cartilage bitenmedlate layer protein, nuc 2.0 

Homo sapiens mRr^cONADKFZ^i64O1016(tr 2.0 

ESTs 2.0 

RAD1 (S. pombe) homolog ZO 

Homo sapiens. Similar to RlKENcDNA 11 100 2.0 

gb:MR0-Hn)06&O81tg9^4)08 HT006S Homo ZO 

cadherin, EGF LAGseveniJass G-type recep 2.0 

gapJuncSon protein, beta 5 (connexin 31 2.0 

tumor necrosis factor (iigand) superMi 2.0 

gb:RC1-NN0063.100500^H:08 NN0063 Homo 2.0 

survhralofmotorneuronl.teiomerfc ZO 

homoiog of yeast k}ng chain potyunsaMl ZO 

DKFZP564i1922 protein ZO 

zinc/Iron regulated transporter-ilke ZO 

jagged 2 ZO 

hypotheOcal protein 60m EUROIMAGE 17593 ZO 

mitochondrial rtbosomal protein L17 1.9 

ESTs, Highly slmliar to unnamed protein p 1.9 

collagen, type III, eipha 1 (Ehlers-Danlo 1 .9 

ESTs 1.9 

Homo sapiens, ckme IMAGE:3544662. mRNA, 1.9 

ESTs 1.9 
smaOWUdblecytoklnesubtamiiyBCCys • 1.9 
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432169 YQ0971 Hs.2910 TM,Pii)OsynFBn phosphoiibosyl pyropho splMte s ywOirtase 2 1.9 

441128 AA570256 Hs^828 Th^ras EST8.WfeaMys!nillartoT23273hypothefic 1.9 

447160 AA330310 Hs,24181 TM ESTe 1.9 

419138 U48508 HS.89B31 TM,RYDR.rTPR,RyR.SPRY ryanodine receptor 1 (skeJetal) 1.9 

5 457817 AA247751 H5.79572 TM.heinopexin,PepaJase.M10 caltepsln D (lysosomal esparty! protease) 1.9 

431009 BE149762 Hs.48956 8S.TM^nnGxin gap juncBon proton, beta 6 (connexin 30 1.9 

428957 NM-003881 Hs.194679 SS.TM,wrc,IGFBP.lsp_1 VVNT1 Inducibte sigriaHrig pathway protein 1.9 

416546 AA224827 TM,vwa^G-GAPM3itnJV gb3ic32g04.s1 NCI_CX3AP_Pr2 Homo sapiens c 1.9 

400749 SS,TMMjecepLAlh3^jecepLb NM_003105*iiomosajtienssofiaMelated 1.9 

10 408369 R38438 H5.182575 f=-protein solute earner (amily 15 (H?77transport6 1.9 

422765 AW409701 Hs.1578 TM.BIR bactl!oviidtAPrepeat«m»alnlno5(8urv 1.9 

417409 BE272508 Hs.82109 TM.Syndecan syndecani 1.9 

407720 AB037775 Hs.38002 TM,calponln,CH KIAA1355 protein • 1.9 

41M30 BE513731 Hs.88959 TM,COP^ILP_transf.MCM tiypolhettea! protein MGC481 6 1.9 

15 434759 AA648884 Hs.134278 TM,COP-OHJP_transfMCM Homosaplens cDNA RJ 1 2676 fis. done NT2 1,9 

421593 NM.017436 Hs,105956 SS.TM gtobotH80sylcer8mide/C077 synthase; Gb3/ 1,9 

426064 BE387014 H9.166146 TM.WH1 Homer, neuronal Imniedistaearty gene. 3 1.9 

404604 NA TM Target Exon 1.9 

422753 AI928995 Hs.157S SS,TM.Sm smaB nuclear ribonucteoprot^D3 polype 1.9 

20 422739 H20106 Hs.119591 SS.Ct^ledaptorjs adaptor-related protein con^ 2. sigma 1.9 

433066 NM_00&456 Hs.288215 SS,Pi1bosyltran siaiyttransferase 1.9 

419594 AA013051 H3.91417 TM topoisomerase (DNA) II binding protein 1.9 

428168 M98447 Hs.22 THTiansg)utaiiiln.(;TranssUtambUiTr8nsg^ transglutaninaseUK polypeptide epidenn 1.9 

428343 AL043021 HS.1270S TM.RhQmboid.HMGLbo)tTPR ESTs 1.9 

25 429592 AB029041 Hs.209646 Troponin KIAA11 18 protein 1.9 

431620 AA126109 Hs.264981 C2,PH,RasGAP,KrPJtransU Z-5'-oligoatenylate synthetase 2 {69-71 1.9 

424670 W81215 Hs.116651 ig epithelial V-Uke antigen 1 1.9 

428373 Anr51656 H$.183986 SS,TMJq poliQvinisfeceptor-feiatBd2(herpesviru 1.9 

453449 W16752 Hs.32981 SS.tg.Sema sema domain, fanmunogtobuilndom^flg]. 1.9 

30 432304 AA932186 Hs.69297 TM.7tm^1 ESTs 1.9 

432673 AB028859 Hs.278605 TM.DnaJ,DnaJ_COnaJ DnaJ (Hsp40) homolog. subfamily B, member 1.9 

416207 NM_014745 Hs.336433 SS.TM.zH)HHC Homo sapiens, done MGC:2908,mRNA,compl 1.9 

408988 AL119844 Hs.49476 TM,P1exiiuepeatSema,tspJ Homo sapiens dona TUA8C>hdiKh8tregio 1.9 

417426 NM.002291 H8.82124 SS,laminin.EGF.Iamin!n.Ntenn laminin. beta 1 1.9 

35 443883 AA114212 Hs.9930 SS.TfyUerp]n.Mar^ serine (or cysteine) proteinase bihibAor 1.9 

433326 AW2981S9 Hs.23644 SS.TM ESTs. Weakly similar to S65824 reverse tr 1.9 

419981 AA897581 H3.128773 TRSkLSno ESTs 1,8 

420931 AF044197 Ks.100431 SS.TM,IL8 8manilKludMecytoHrwB8uMBinlly(Cy8 1.8 

415023 AA932146 Hs.133494 TM,Ra)osomaLS17l^lbosomalJL13 Homo sapiens done TCCCIA00164mRNAseque 1.8 

40 413644 BE154910 Hs^78793 TM.GIyco.hydroJ ESTs, Weflkly similar to Z195_HUMAN ZINC F 1.8 

449987 AW079749 Hs,184719 TM>VBC.tran,ABCjnentbra ne ldcUrans ESTs, WfeakiyslmBarlDALULHUMAN ALU SU 18 

421340 F07783 H3.1369 SS.sushI decay acceterafing factor for complenient 1.8 

417866 AW067903 Hs.82772 SS,TM,Concoen,COLR.TSPN cotlagen. type XI. alpha 1 1.6 

430259 BE550182 Hs.127826 TM,lransmembrane4RasGEFJ^ RalGEF-iilffl protein 3. mouse homolog 1.8 

45 432998 AA835948 Hs.153307 TM.SDF ESTs 1,8 

431671 NM-016937 Hs,267289 TM,NA polymerase (DNA directed), aM»a 1.8 

411773 NM_006799 Ks.72026 trypsin protease, serine, 21 (tesfisln) 1.8 

425247 NM_005940 Hs.155324 SS,TM,PeptIdase.M10,hemopejdn matiixmetalloprotBlnasell (stiomelysin 1.8 

422976 AU076657 Hs.1600 TM.cpn60_TCP1,5ema diaperonin containing TCPl.subunit 5 (ep 1.8 

50 4^159 NMJ0O4341 Hs.154868 SS,TM,GATase.<n€ace,CP$a5e.ljDhahi.OilvdiQoro caibamoyHihosphate synthetase 2. asparta 1.8 

447776 AIS2S62S Hs.130181 RdnJJadin UDP-H.ecetyl^ha4>«a!actosamine:pdype 1.8 

426908 AW815163 Hs.172851 SS,TM,fusion^JMy06irLteaadhLShort ar^e,typetl 1.6 

408116 AA251393 Hs.269052 TM,Na.CaJExCamjBcyttransf Homo sapiens. Similar to RIKENcDNA 54304 1.8 

417847 AI521558 Hs,7331 Utarogiabin hypothetical protein FU22316 1.8 

55 415791 H09366 Hs.78853 SS.TM.UNG uiadH)NAg»ycosyla8e -1,8 

41^903 A1287341 Hs.154029 TM.ubkpiitlhJaniMiLGJam!i^^ bHlH factor Hes4 1.8 

422511 AU076442 Hs.117938 TM.p450 collagen, type }CVII.8lpha1 1.8 

414117 W88559 Hs.1787 TM.ion.trans»IC,tetra proteoQpld protdn 1 (Peiizaei»MefZbach 1.8 

426841 AI052358 Hs.193726 TM.asp ESTs 1.8 

60 415272 AA164215 Ks.203188 TM.TPR.pMn85eJb,B56 ESTs 1.8 

428440 BE3827S6 Hs.169902 TM.su8ar.lr,R)fOead sdute canter family 2 (fadUd^ 1.8 

419488 AA316241 Hs.90691 FGF nudeophosnMudeoplasmln 3 1.8 

418462 BE379749 H835201 SS,TM,lectin«c 04ype (calrium dependent, carbohydrates 1.8 

431363 M86528 Hs.266902 SS.NGF neurotrophIn5(neurotrophfai4/5) 1.8 

65 440975 AW499914 H8.7579 8S.TM hypotheOcfil protein FU104O2 1.8 

438962 BE046594 TGF.bela,bZIP ' gb:hn41c1lJ(1 Na.C6AP_PDF2HonK> sapiens 1.8 

414602 AW630088 Hs.76550 5S Homo sapiens niRNA;cONADKFZp564B1264(lir 1.8 

418054 NH.002318 Hs.83354 TM,initojcarr.Lysytaxldasa lysytQ}adase4ike2 1.8 

440501 AA887391 Hs.202229 TM,GalactosyLT ESTs 1.8 

70 449309 AW589823 Hs.224189 TM ESTs 1.8 

421461 AW291023 HS.972S5 TM,LysyLoxidase,SCP2.6and.7 ESTs, Weakly similar to A46010X-rtnkedr 1.8 

412584 X54870 Hs.74085 TM,lecUn.c DNA segment on chromosome 12 (urrfque) 248 1.8 

441565 AW953575 Hs.303125 TM p53-induced protein PIGPC1 1.8 

431837 T79326 Hs.326553 TM,7tm-3>WFjeoeptDr,susW olfadory receptor, tamily 2, suWantily I 1.8 

75 438251 BE515065 Hs.2g8585 SS,Yj)hosphalaseTlG nudeolar protein (KKE/D repeat) 1.8 

448633 AA311426 HS.2163S TM.EGF,l3mIni(LG.fibrtnogeiLC,F5_F8_type.(;tubuOn tubulin, gamma 1 1.8 

424291 AL120051 Hs.144700 TM.Ephrin.Hlstdeacetyl ephrin-BI 1.8 

415388 APOieOBI Hs.78409 SS.TM.TSPN.Co(]a8en cdlagen, type XVill. alpha 1 • 1.6 

435550 A1224458 Hs.4934 TM.IJW.LRRCT Ksaidens pdyA site DNA 1.8 

80 448568 AA149121 Hs.71947 TM,IJ«CT ESTs 1.8 

439246 AI498072 Hs.77783 SS.THREI.PUT,PKO,W8C.lRRGT,GPSPMP22„Qa membrane^ssodated tyrosine- and threoni 1.8 

410001 AB041038 Hs.57771 SS,TM.trypsin kdOkrelnll 1.8 

417312 AW888411 HS.B191S SS,StathiTlln leukemifressodatedphosptiapntebp18(s - 1.8 
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4441S2 AI125694 Hs,149305 TM hypoOieJical protoh MGC2e03 1.8 

453454 AWO520O6 Hs.8551 TM PRP4/STWWD spOdng factor 1.8 

449320 AB030835 Hs,23476 SS.adenytateWnase Opl-tateracfing rinc finger piot^ 1.8 

428329 AA426091 Hs.98453 T^4,G8H)indJecen ESTs, Moderately similar ta R27328 2 IH.8 1.8 

5 452875 BE275760 K5.30928 TM»Apo5poprotBWg ONA segment on chnomosome 19 (unique) 117 1.8 

444031 BBZ71513 Hs.25303 TM^SdaseJ410.hemopexbt hypoOieQcal protein FU1 3154 18 

443534 An}76123 TM gb»y92e04jc1 Soares.fetaLQver.spteetuI 1.8 

413313 NVL002047 Hs.75280 TM,WHEP-TRS,7bnJZ glycyMRNAsynftelase 1.8 

452874 AK001061 Hs.30925 SS hypothetical prtteln FU10199 1.8 

10 453140 AA032238 Hs.170531 TM ESTs 1.8 

418641 BE243138 Hs.86947 SS.TM^isfntegfln^JM12Bj)ropep^Repfe)ysin adisintegiinandrnetalloprDlBlnasedomai 1.8 

432925 AA878324 Hs.192734 SS ESTs 1.8 

453857 AL080235 Hs^61 TM DKFZP586E1 621 protein • 1.8 

457663 AW371946 Hs.337459 TM ESTs 1.8 

15 452873 AK001247 fe.30922 TM hypolheticel protein FU10385 1,8 

436396 AI683487 Hs.152213 SS.wnl wlngtess-typeMMlVlntegraBonsilefamB 1.8 

452835 AK001269 Hs.30738 TM hypothetical protein FU10407 1,7. 

459847 R34107 Hs.198287 ig pregnancyspeclficbeta-l.fl!ycoprDlein11 1.7 

418245 AA08B767 H5.83883 TM.PEPCK trensmentae, prostate androgen Induced 1.7 

20 448484 BE613340 K3.334725 TM Homo sapiens, Similar to RIKENcONA 94300 1J 

431369 BE184455 Hs.251754 SS.w^ secretory leukocyte protease inhUtor (a 1.7 

434877 AW974792 Hs.292171 TM ESTs 1.7 

428923 BE047698 H&18878S TM ESTs 1.7 

402915 NA TM.HC03 cotransp 0(SPOOOOO2O2S8r£Icartnnate transporter- 1.7 

25 420185 AUM4056 Hs.158047 TM ESTs 1.7 

445739 AW136354 Hs.145303 TM ESTs 1.7 

409435 AI810721 Hs.95424 TM.p450 ESTs 1.7 

408688 AI634522 Hs.152925 TM iaAA1268 proleh 1.7 

420085 AI741909 Hs.44680 TM hypothetical prolain FU20979 1.7 

30 433933 AI754389 Hs.133494 TM.Ril)OSOmal.S17Ribosoma!.L13 Homo saptens done TCCCIA00164mRNA8eque 1.7 

430965 AA489732 Hs.154918 hormonejecProgjeceptor^ ESTs 1.7 

414703 BE243877 Hs.76941 SSJM.Na^KATPaseE2F_TDP ATPase, Na? transporting, beta 3 polypepl 1.7 

423464 NM_016240 Hs.128856 TM,Collagen CSR1 protein 1.7 

416737 AF154335 Hs.79691 SS,TM.UM,PDZsugar.tr.POZ.UM UM domain protein 1.7 

35 409012 AL117435 Hs.49725 TM.RhoGEFzf4)HH(;adluBhort DKFZP434t216 protein 1.7 

423804 AW403448 Hs.1706 TM,IRF Interferon^thnuiated transcription (iado 1.7 

410418 D31382 Hs.63325 SS,TM.lrypsin,IdUeoepU transmembrane protease, serine 4 1.7 

440028 AW473675 Hs.125843 TM E8T8.V\teddy*nilar to T17227 hypothetic 1.7 

457646 AA725650 Hs.112948 TM,SPRY ESTs 1.7 

40 445439 BE243084 Hs.12719 SS.TGIMwta regulator of nonsense transcripts 1 1.7 

420426 AA26204S Hs.36567 TM,GalactosyLTJATP-5ynLC Homo sapiens cDNAaJ14227 6s. clone NT2 1.7 

431341 AA307211 H5.251531 TM,prote8Some proteasome (prosome, macropain) subunil 1.7 

412338 AA1515Z7 Hs.69485 TM,SBma.RexlnjepeatT1G,Ple)drUBpeat hypothetical protein FU1 2436 1.7 

414799 AI752416 Hs.77326 SS,lhyroglobuDn.VGFBP lnsufin4ike growth factor binding prolel 1.7 

45 452700 AI859390 Hs.288940 TM.DIX.RGS.ttiioied five-span transmembrane protein M83 1.7 

430877 NM-005269 Hs.2693 6ST_C;iRNA«ynLl.WHEP-TRS.TGF-beta gHoma^sodated oncogene homoJog (zinc 1.7 

428624 At125222 Hs.98712 TM,fiiiored,YjihosphatBSB,MAM^,fh3MSP.doma!n hypothec protein DKFZp434H0311 17 

444065 AW449415 Hs.10260 TMjon.trais Homo sapiens cONAFUl 1341 fis. done PLA 1.7 

416319 AI815601 Hs.79197 SS.TMJg CD83 antigen (activated B lymphocytes, Im 1.7 

50 429367 AB007867 Hs278311 Sema,Pte]d(uepeai.T1G ptexlnBI 1.7 

430425 AAS314ffl Hs.241412 TM apoHpoprotBlnL,2 1.7 

441668 AI611973 Hs.l27S25 THAmmonlunLbansp ESTs 1.7 

418469 U34879 Hs.85279 SS,TM,adh.^ hydroxysteroid (174)ela) dehydrogenase 1 1.7 

450835 BE262773 Hs.25584 TKArfiSap hypolheticai protein FU10757 1.7 

55 418859 AA229558 TM flbnclSdlOsI NCLCGAPJV1 Homo sapiens c "'1.7 

425304 AA463844 Hs31339 TM,ig,ITAM flbroWastgioiBlh factor 11 1.7 

423835 X65019 Ks.130181 TM.Rida3Jecfin UDP41-acetyt4!phfr(VgSiacto88n[4ns:polypo 1.7 

414620 AA371931 Hs.77422 TM.iQn^trans.UM.Synaplophysin proteolipid protein 2 (colonic epIMum 1.7 

440654 AW014242 Hs.159998 TM.connexIn ESTs 1.7 

60 412276 8E282621 Hb.73798 SS,M]F macrophage mlgreSon inhibitory factor (g 1.7 

422087 X58968 Hs.111301 SSJ>ept]d3se_M10.fh2.heniopexin mabfa(metaitoprote!nase2(ge!aQnasaA, 1.7 

407151 H25836 Hs.301527 SS.TNF ESTs, Moderately similar to unknown pisa 1.7 

410726 AI623859 Hs.15936 TM.PX ESTs 1,7 

452012 AA307703 Hs.279766 TM,kinesin kinasin family member 4A 1.7 

65 433627 AF076866 Hs.2842g6 TM,SURF4,SURF1,DEAOIipOCalin Homo sapiens cONA:FU22993 lis. done KA 1.7 

409220 BE243323 H8.51233 TM.death,TNFRj« bsnor necrosis factor receptor superfamO 1.7 

427082 AB03785B H3.173484 TM.mito_carr hypothetkal protein FU 10337 1.7 

426410 BE2g$446 Ks.305890 TM,6d-2.BH4 6a2-0ke1 1.7 

433598 AI762836 Hs.271433 TM,CytWytyItransf,SIR27tm»2 ESTs, Moderately similar to ALUZ^HUMANAL 1.7 

70 438495 6E25e948 Ks.2g0874 TMMradillojseg ESTs. WeaMy similar to ALUSJiUMAN ALU 8U 1.7 

422032 AA476966 Hs.1 10857 TM,TFI]S,RNA.POU4_15KDseipin.hormone_rec,zKM polymerase (RNA) ill (DMA directed) pdyp 1.7 

429736 AF1 25304 K3.2t2680 SS,TNFrLc6 tumor necfosts factor feceptcrsuperfamil 1.7 

427600 AW6309ie Hs.179774 Tiyi.Transgiutamin_CTrans0hjtanntnJN,TrBnsSluLcore proteasome (prosome. macropain) adrvator 1.7 

„^ 431981 AA664069 Hs.115779 lamlnln_BJaminin_E6F ESTs 1.7 

75 407738 N41744 Hs.19978 TM.Sulfalaso CGWO protein 1-7 

420187 AK001714 Hs.95744 TM hypotheficai protein similar to ankyiln r 1.7 

424620 AA101043 Hs.151254 SS.TM.trypsin kaHikrein 7 (chymobyptk; stratum come 1.7 

430488 D19589 Hs.13453 TM hypotheficai protein FU 14753 • 1.7 

„^ 423393 R37772 H$^1420 TM.Ihtonsdpkinase p21-aclhrBtod protein kinase 6 1.7 
80 444051 N48373 Hs.10247 SSJg 
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TABLE 138 

Rcey: Unique Eos probesstUenOSsfntinber 
CAT numbsn Gsno dustsr number 
Accession: Genbank accession numbers 



Pkey 

408344 
418S46 
418859 
418869 
437938 



CAT Number Accessions 



438962 
443534 
447197 
454197 



455333 
457570 



105240.1 
1766nj 
179717J 
179863J 
44573J 



4873S0_1 
572957J 
711623L1 
1050392J 



12810MJ 
357443J 



AA053843 BE162213 
AA224827 T59708T59843 BE156903 
AA229558 AA345492 AA229582 
AW51 6565 AA229762 AA23003S 

AI950087 N70208 R97D40 N36809 AI308119 AW967G77 N35320 At251473 H59397 AWg71573 107278 W(n059 AW967671 AA908S98 AA251675 

A1820501 AI820532 W87691 TBS904 U71456 T82391 BE328571 T751Q2 R34725 AA884g22 6E32B517 A1219788 AA884444 N92578 F13433 

AA927794 A)5602S1 AW874068 A1134043 AW235363 AA663345 AWD08282 AA468964 AA2831 44 AI8903S7 AI950344 Ai741346 AI689062 

AA282915 AW102898 AI872193 A1763273 AW173586 AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AWIOSBIS AI539642 

AA642769 AA856975 AW505512 AI961530 AW829970 BE612881 AW276997 AW513601 AW512843 AA044209 AW8S6S38 AA180009 AA337499 

AVV961101AA251669AA251874AI81922SAW205862At683338AIB58509AVV276905AI633006AA972^ 

AA293273AA989759 N75628 N22388 H84729H80052T92487AI02205BAA780419AAS51005VV60701AW61^ 

K63468 W37181 W76802 R66058 AI002839 R67840 AA30O2O7 AW959561 T63226 F04005 

BE046594 BE046667 AA828585 A1207343 

AI076123Ai244834AI69S239 

R36075AI366546 R36167 

BE140966 BE140981 BE140987 BE141Q06 BE140985 BE14(»70 6E141669 BE141G53 BE141664 BE141655 BE141661 BE141660 BE140969 
6E141673 BE141650 BE141674 6E141550 6E141688 AW178241 BE140994 6Et41666 8E140996 BE141 008 BE140988 BE141011 BE140975 
8E141667 BE141675 BE141657 BE141681 BE141656 BE141672 BE141 680 AW1 78237 BE141012 BE14O990 BE141658 BE141648 BE14101d 
BE141668 BE140973 BE141004 BE140953 BEU0984 BE141009 AW178232 BE141007 BE141649 AW178293 BE140993 AW176233 6E141646 
BE141005 BE141691 6E141000 BE141652 BE140965 BE141562 6E140960 6E140962 6E141001 BE140978 AW178229AW1 78239 BE141671 
AW178230 BE141547 AW17823S BE14t663 BE141549 BE140998 BE141003 AW178236 BE141002 BE141556 
AW897851AW897852 
AA579426AA579436AA573736 



TABLE 130 

Pkey: Unique number conesponding to an Eos probeset 

Raft Sequence source. The 7 dlgtt numbers In this cdumn are Genbank tdeniifier (Gl) numbers, tkinham I. et ai.' refers to the pubMon enStled The DMA 

sequence of human chromosome 22.* Dunham 1. et a)., Nature (1999) 402:489495. 
Strand IndicatBsDNA strand from whtehexons were fHedicted. 
Ntj^iton: bidieatBS nucleotide positions of predicted emms. 



Pkey 


Rd 


Strand 


Nt_posi6on 


400666 


8118496 


Ptus 


17982.18115.20297-20456 


400749 


7331445 


Minus 


9162-9293 


401103 


8568122 


Minus 


98330-98449 


4014B6 


7341763 


Rus 


32585^756.36281-36540.40791-40933.4401644179 


401576 


7229804 




76253-76364 


402745 


9212200 


Minus 


76516-76690 


402915 


7406502 


Minus 


140-276 


404440 


7528051 


Plus 


60430^1581 


404604 


9212537 


Minus 


72019-72509 


405545 


1054740 


Pius 


118877-118807,119091-119296,121626-121823 


405547 


1054740 


Ptus 


124361-124520,124914-125050 


406400 


9256298 


Rus 


1553-17^1878-2140,4252-4385,5922-6077 


408467 


8785551 


PkB 


182212-1829SB 



TABIE 14A: 209 GENES UP-REGULATED IN CERVICAL CANCER COMPARED TO NORMAL ADULT TISSUES THAT ARE UKELY TO ENCODE EFTHER ENZYMES OR 
PROTEINS AMENABLE TO MODUUTION BY SMALL MOLECULES 

Table 14A lists about 209 genes up-regulated In cervical cancer compared to nonnat adult tissues thai are IQtely to encode either enzymes or proteins amenabie to nwduialton by 
small molecules. These were selected as tor Table 12A, except that the raito was greater than or equal to 2.0, and the 96^ percentile value amongst cervical cancers was greater Qian 
or equal 40 units, and fiie predicted protein contained a stnictural domain that Is indicative of enzymatic functionor of bdng modulatabte by smaQ moiecutes (e.g. pkinase, peptkJase. 
phosphatase. ATPase, or knUransportar domains). The predtoied protein domabis are noted. 

Pkey: Unique Eos probeset kJenGiier number 

ExAocn: Exemplar Accession number. Genbank aocesston number 

UnigenelD: Unlgene number 

PPDomains: Predicted Protebi Domains 

Unlgene Title: Unlgene gene title 

R1: RaSo of tumor to normal adult Gssues 



Pkey ExAccn UnigenelD PPDomans Unigene Title R1 

418007 M13509 Hs.83169 SSMnopexIn,Peptidase.M10 matrbt metaDoprotelnase 1 (inters! 38.9 

439608 W79123 Hs.58561 TM.7tnu1 GprolelnH»upled receptor 87 283 

400269 X07620 Hs.2258 h8fflOpexin.Peptktese_M10 mabixmet^loproteinaselOtstrome 20i 

415817 U88967 Hs.78867 SS,TM.Y_phosph8l8se,carb.anhyd protein tyrosine phosphatase, recep 16.4 

416209 AA236776 Hs.79078 TM,HORMA MAD2(inIlDtfc arrest defidenl yea 15.4 

404996 NM_001333 Hs.87417 PepOdase_C1 crSL2 Calhepsin L2 13.1 
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428618 AA885360 Hs.16019g pUnase Target CAT 1Z7 

429486 AF155827 Hs.203983 heDcasoLdSNR^DielicasflLC hypoSieQca! protein FU 10339 12.6 

419183 U60669 Hs.69663 p4S0 cytochrome P450.subf3mity XXIV (vl 12J 

. 428368 BE440042 H5.63326 S8J>epQdase_M1(Ui6mopexfn matrix metaDoproteinase 3 (stnofnel 10^ 

5 420769 T11632 Hs.127797 heilcaseLC Homo sapiens cONAFUl 1381 fis^cb 1(U 

456194 AW383618 Hs^65459 p450 ESTs. Moderately similar to AUJ?_HU 9.4 

448232 AI281848 Hs.194691 TM.7tnL3Rteomd.L13 reSnolc add Induced 3 &9 

424905 NM_002497 Hs.153704 TM^pUnase NIMA (never in mllosis gene aHela &9 

452291 AF015592 K3.28853 TM^ldnase C0C7(cell division cycle?, 8. cer 8.7 

10 424086 AI351010 Hs>1022&7 LysyLoxidase lysyl oxidase a3 

425710 AF030880 Hs.159275 TM.Sulfate_transp.STAS solute carrier family, memt)er 4 7.8 

433133 AB027249 H5.104741 THCol]agen,pianase PDZ^dlng kinase; T<ell original 7.4 

447254 NM.004153 Hs.17908 SS.AAA,BAH origin recogtd&m complex. subunU 7.1- 

431941 AK000106 Hs.272227 pkin8se.Furin#e,RBcep.Ldom Homo sapiens cONAFU20099 lis. do 6.9 

IS 427821 AA470158 Hs.98202 TM.7tm.1 ESTs 6.9 

438211 AK001581 H3.334828 Ammoniunutransp Irypoiheticai protein FU10719; KIAA 6.9 

403471 NA SS,TM,trypsin Target Exon 6.7 

410153 BE311926 Hs.15830 Qycos_transL2 hypothetica! protein FU12691 6.5 

457405 AA5048S0 TM.7tm.2 glKabOSalOxl Stratagene fetal leO 6.4 

20 421948 U2583 HsJ34309 fnam8nl.HC03jcotranspfiIament ker^6A 6.3 

439292 AA090421 Ks.5555 TM^^Ferricjedud hypolhefical protein M6C5347 5.8 

413625 AW451103 Hs.71371 TM.E1-E?>TPase,Hydrolase ESTs 5.8 

425695 NM-005401 Hs.159238 TM^andJI.Yjtophdtase piOtein tyrosine phosphatase, non^ 5.7 

438394 BE379623 H5.27693 SS,proJsofnerase peptldylprolytlsoinerase(cyciophil 5.6 

25 432239 X81334 Hs.2936 SS,Fepfidas6_M10,hemope)dn matrix metaDoproteinase 13 (coRag 5.5 

408536 AW381532 Hs.135188 SS,TM.E1-EZ^TPase,CaIkXLATPa ESTs 5.4 

432226 AW182766 Hs.27355B Cytidytyitransf phosphate cytidyiyltransferase l.c 5.2 

419520 AB00g303 Hs.90800 TM.hemopexin,Peptid3sejyi10 matrix metalloproteinasel 6 (memiira 5.1 

426350 NM-003245 Hs.2022 TM,Transgiutamin.C,TrBnsg!utam transglutaminase 3 (E polypeptide, 5.0 

30 421155 H87879 Hs.102267 SS,LysyLoxidase,Aldose_e;An.E lysyl oxidase 4.9 

423673 BE003054 Hs.1695 SS.TM.Pepiidase.M10.hemopexin matrix metailoproteinasa 12 (macrop 4.8 

450375 AA009847 Hs.8850 TM.disintegrin.PepJ412Bj)npep adisinlegrinandmetalloproteinase 4.8 

418379 AA218940 Hs.137516 AAA ttdgefin^ike 1 4.7 

_^ 457465 AW301344 Hs.122908 Prilxisyltran.Suiratase DNA replication factor 46 

35 412333 AW937485 TM.7lm_1 gb.-QV3-DT0044-22129944&*09DT0044 4.6 

450510 AA010056 Hs.242998 DNA.topoisoll,ONAJop6isoMGF ESTs 4.6 

436291 BE566452 Hs.5101 dihydrolase protein regulator of cytokinesis 1 4.6 

446353 AI290919 Hs.153661 KECTpkinase ESTs 4.5 

435435 T89473 Hs.19232B ripase.PLAT ESTs 4.5 

40 425071 NM_013989 Hs.154424 SS,TM.T4.d8iodInase debdinase. lodolhyrontne, type II 4.4 

433322 H50621 Hs.134156 TMJon_lr8nsN&ARCCARDjllltojB ESTs. Weakly similar to 138022 hypo 4.4 

408908 BE296227 Hs.250822 TMjiMnase serine/threonme kinase 15 4.4 

444781 NM.014400 Hs.119S0 PHlactamasO GPI-andwred metastasis-associated 4.4 

426479 Y00272 Hs.1B4572 pklnase ceD division cyde^G1 to Sand 4.2 

45 406687 M31126 Hs.272620 SS.PepSdKOll10Jhemopexin pregnancy epedficbeta-l-glycopret A2 

423035 AW449679 HS.1S6739 TM.GtycoJi8n8fJ H.S8pien3X6mRNA(donePEP11) A2 

449228 AJ403107 Hs.14B590 TM.PAF-AH,p450 protein related with psoriasis 4.2 

423738 A6002134 Hs.132195 S$,Tl^tiyps{n.SEA airvay Irypsiiv^ protease 4.1 

457030 AI301740 Hs.173381 TM,Dihydroorotase dihydropyrimidinase^ke 2 41 

50 448995 AI613276 HS.S662 adenylalekinaso guanine nucteoBde binding pre(^ 40 

415857 AA866115 K8.127797 helto8se_C Homo sapiens cDNAFUII 381 lis, do 40 

438390 AI422017 TM,OSl.7tm-17Im-1 gb:tf45f12j<1 KCI^CGAPJBm23Honi08 40 

429900 AA460421 Hs.30875 pktnase ESTs 40 

446292 AF081497 Hs.279682 Ammonium-transp RhtypeCgtycoproteIn 3J 

55 422938 Nlil.001809 Hs.1594 TM.thtotase cenlromere protein A (17kD) a7 

408771 AW732573 Hs.47584 TM,KJtetraMJians potassium vdtage^ated channel, de 3.6 

424296 AI631874 Hs.155140 pUnase casein kinase 2, 1 polypef^ 3.6 

436246 AW450963 Hs.119991 connexinhonnonejee^ ESTs 3.5 

411274 NM.002776 Hs.6g423 trypsin kdlikielnIO 3.5 

60 400666 SSJiem0P6xh,Pep«asejyi10 NM-002425MNno sapiens matrix m^ 15 

428920 AA393351 Hs.132121 PDEese ESTs 3.5 

412471 M63193 Hs.73946 SS.TM.GtycosJransfJyCarn^ endothelial cell growth factor 1 (p 3.4 
430704 AW813091 Hs.335799 Epbnerase ESTs 3.4 

455092 BE152428 Sutfalase gb:CM0-HTO323-l51299-12&M4HT0323 3.4 

05 453775 NNL002916 Hs.35120 AAA.Pi3J>l4J(lnasQ.PI3KaPI3!e fepiicattonfadorC(activalorl) 3.4 
438993 AA828995 lntegrin.B gb:od77b08.s1NQ_CGAP.Ov2 Homo sap 3.4 

426572 AB037783 Hs.170623 hormor»jec4'C4 hypothetical protein FU1 1183 a4 

449101 AA205847 Ha.23016 SS,TM,7tnu1 G proteiiKOupled receptor 3J 

__ 427660 AI741320 Hs.1 14121 hormonejeczl^ Homo sapiens d3NA:FU23228fis.d 33 

70 402481 TM,GOI.7tnLl Nli^001821':Homo sapiens chorotdeiem 3.3 

414774 X02419 K3.77274 SS,kringte.tryp3in piasmhogen acMor. urokinase 3.3 

412246 AI160873 Hs.6g233 SuifotransferACOX zinc finger prot^ 3L3 

418482 BE001596 Hs.85266 SS.TM.integrin_B,f[t3 intsgrin, l)eta 4 3J 

424667 J05070 H3.151738 8S.Peptidase_M10,fn2.hemopexin matrix metaitofKOtelnase 9 [gelatin 3.3 
75 401486 NA SS,TM,tjypshi C400064r:g{}4758508|refINP.004253. 3.2 

408113 T824Z7 Hs.194101 TM.7tm_3aibosomal.L13 Homo8a^n3cONA:FU20869e3.d U 

427359 AW020762 H&79881 TM.7tnu1 Homo sapiens cONA:FU23006 (is, d 3.2 

402337 . SS.P450 Target Exon U 

420930 AW888&50 ritx)ntKlease.T2 gb:CM4•^a0007-13O5OO^1•fO6KrO0O7 3w2 

80 443426 AF098158 Hs.9329 pklnase chromosome 20 open reading frame 1 il 

439750 AL359053 Hs.57664 TMintegrin_B,RidiLB_(ecSraT Homo sapiens mRNA fun length Inser ai 

420039 NM.004605 HsS4S61 CAIV.SullbtrBnsferDAGi<c sultalransferase family. cytosoQc. ao 

448733 NM.005629 Hs.187956 SS,TM.SNF/BCLtran.lsodh,pktna sdute canter Mly 6 (neurotrans aO 
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5 

10 

15 

20 

25 

30 
35 
40 
45 
50 
55 
60 

. 65 
70 
75 
80 



444946 AW139205 
450841 AI741486 
428262 AI651324 
435399 AA579463 
446733 AAB63360 
449746 AI66B594 
418844 M62982 
414581 AA256213 
431629 AU077025 
445873 AA250970 
438113 AI467908 
422689 AW856665 
439453 BE264974 
413582 AW295847 
410664 NM.006033 
456456 AA477609 
413273 U75679 
426343 NiyL014M2 
403763 

408380 AF123050 
401230 

416030 BE207573 
445540 AWg69626 
432885 AI753709 
419667 AII077005 
406571 AA129547 
412530 AA766268 
431890 X17033 
404184 NA 
428450 NM_014791 
425698 NM_016112 
453331 A!240665 
444826 A1674482 
414987 AA524394 
438746 AI685815 
429413 NM.014058 
407103 AA424881 
453379 AA035261 
421733 AL119671 
452220 BE158006 
417975 AA641636 
400301 X03635 
408938 AA05g013 
411643 A1924519 
446638 AL133063 
430129 BE301708 
417655 AA780791 
448005 AW207437 
423973 AF038461 
437897 AA770561 
425397 J04088 
432777 AA564991 
421247 BE391727 
425465 L18964 
419281 Hg8452 
434205 AF1 19861 
453406 AI192g87 
435542 AA687376 
443151 AI827193 
431630 Nhi.002204 
422310 AA316622 
441954 At744935 
414907 X90725 
439810 AL109710 
429359 WD0462 
432284 AA532807 
452947 AW130413 
423229 AG003865 
453941 U39817 
439963 AW247529 
424439 AA579635 
452755 AW138337 
429922 Z97630 
434149 Z43829 
417576 AA339449 
409994 086864 
416763 AI908127 
414733 BE5145K 
443171 BE281128 
430637 BE160081 
452367 U71207 
408308 ALJ033377 
417900 BE2S01Z7 



Ks.156457 
Hs^0515 
Hs.7238 

Hs3040 

Hs.176588 

Hs.1200 

Hs-72010 

Hs.26^ 

H&251946 

Hs.8882 

Hs^ 

H5.71331 

Hs.65370 

Hs.89563 

Hs.75257 

Hs.169387 

Hs.44532 

Hs.83321 

Hs.31704 

Hs.152484 

Hs.92208 

HS585754 

H8^66273 

Hs^71986 

Hs.184339 

Hs.1 59241 

Hs.8895 

Hs.148441 

Hs.294022 

H8.1B4727 

Hs^1877 

Hs.256301 

Hs.61753 

Hs.1420 

H8212296 

Hs^5 

Hs.1657 

Hs:22607 

Hs.192570 

Hs.15783 

H$.233955 

Hs.14014 

Hs.170378 

H3.136574 

Hs.146170 

Hs.156346 

Hsi!69477 

Hs.102910 

Hs.1904 

Ks.42189 

H3^63032 

Hs.61784 

Hs.269533 

Hs.132714 

Ks^5829 

Hs.98370 

Hs^047 

Hs.77597 

Hs.85568 

H&2399 

Hs.105822 

Hs.125532 

Hs.36820 

Hs.6793 

Hs,1770 

HS^13436 

Hs^6117 

H3.19574 

Hs.62285 

Hs.57735 

H8.7g748 

Hs.77171 

Ms.9030 

Hs.256290 

Hs.29279 

Ks^197 

Hs.82906 



SS.TM.^iQlase 
pro JscKTiefssG 
deslh,pl(in35e 
pkbiase 
TM.P450 
SS^50 

SS,TM,8poxy9eiiasB,PlAT 
TM.Cam_acyUransf,Cholm8jdn 
&S.lRNA.antiSH2.SK3^8se 
SS^PABPpkinass,14^^ 
TM,7lmJ 

tiencase.CSNF2JAi8l)c8se.C 
SSAAA 
csfbjonhydrase 
TM,iipas8,PLAT 
FBPase 

TM,8GAN7lni.1 
TM,7!ni_1 

TM,ublqu)fln7tm_3^F_receptor 
SS,TMJon.(r8ns,IQ 
SS,TM,Pep{idase.S26.Bombesin 
TMalpha-amylasd 

T^A.ion_lransNB^,CARD,WD40^ 
SS,TM.disbTtegr1n.Repralysiii,P 
Tli^p)dnase,PleKliu8pe3l.Se^ 
abhydralase 
vwa.FG<3AP.intogTM 
SS.TM.7tm.1 
p)dnase.KA1 
TM.plunase,kjnJrans 
TM4is!i\t8giin,Pep_M12B_propep 
pkInase.SAM 

connexinhormon0jec,zMM,conn 
Ribosonnal_S2,transfenin 
trypsin 

TM.cNW>_Wndlngtryp3in 
PAfU(ringle,byp6in 
SS»TM.Ig.pltee 
TMMitn>.F&GAP 
8S.tiyp^ 

TM.OesUecep,2K)4,honnonejB 
Yj>hosphatas8 
DEAD,heiicase.C 
TMjiidnasa 
TM.Glyoo.transLt1 
PepU(l3se_M41>\AApV]nase 



TM4ipQixygen3se.PLAT 
SS,pio.isom8rBse 
DNA^topoisoII.DNA.1opoIsoiVI6F 
atf^a-amylasa 

TM.lRMA«ynLl,SPRYF5.F8_lrt)e« 

TM.pHnase.DAG„PE-Wnd,OPR,pW 

7M^1-E2>TP8S8.HMA.Hydnrfase 

SH3.ethand.C2,PH.RhoGEFWPG 

p)dnase,FutM(e,Recep_U.dom 

pkin8se.RhoGEF3g.PaSH3 

DNAjnisj8palr,HATPasejcAcylph 

SS.TM.F043APJntagiin^ 

SS.TM,fh3,lg.p!dn3se.RibQS0(nal 

TM.Band.7;kAA,cdc48J« 

SSJM.pkln8se.FOLO.box 

aconUase^nHase_C 

SS.TM.Pep6dasaJA10Lhenu)pB)dn 

TM,pk]n3se 

dpha-amyfase 

SS.tjypsin 

DEAO.HRDC^elicasflLC 

TM4}450Ets 

ONAJgase 

GlycoLlbnsfJ9 

TMMerJiistoneTtnLl 

TM,EPHJbdMpMnasaiSAM 

TM.AIRS,tbmiyUransf,GARS 

IPJr8nsSH2.SH3 

TM,ali)h»flnvlasa7tnLl 

TM.MCMHenie.oxygenase 

SS,TM,7tin_1.rnn 

8_100PepMas8jifl18 

SS,Hydro)ase 

m7lm_2 

TM,WD40.prt)Jsoroerase 



bypoMcal protein FU22408 3-0 

EST8 3.0 

biphenythydiOtase-likB (serine hyd 3.0 

gbac50c0l8l Strategene hffT neuron Z9 

ESTs^Weddy similar tDfetty acid 19 

ESTs.Wea)dysin«8rtoCP4YJ^UMAN 19 

arachhtonata 124pwygenase 2.9 

ESTs 2.9 

inteiferon,alpha4Rdudble protein 2.8 

poiytA}^dIng protein^ cytoplastri 2.8 

ESTs 2.8 

gb:RC3-CT0297-28010OO13Kl03 CT0297 18 

fiiyioldhomionereceplorlntefaclDr 2.8* 

hypolheticsl protein MGCS350 2.8 

lipase, endothella) 2.8 

nudear cap tiding protein subunil 2.8 

stem-loop Oiistone) binding protein 2.8 

K1AA0036 gene product 2.8 

NMLOOIOSd^MnnoeapienstachyKlnin 2.7 

diu^uitin 2.7 

NM.01419rM}mo sapiens sodluni Chan 2.7 

neurome(&i B 2.7 

ESTB,Wealdy6lmitartoKIAA0227[H 17 

E8T8,Vfe^8in\ilartol38022hypo 16 

a disinlegrin and metalloproteinase 16 

met protW)ncogene (hepatocyte grow 16 

tiypothelicat protein FU13346 16 

IntBglt^e)ptla2(CD495.8lpha2s 2.6 

NiyL030903*i1omo sapiens oiractoiyr 16 

KIAA0175 gene product 16 

po!ycyst]Giddney(Ss8ase 2-11(61 16 

ESTs 18 

ESTs 16 

liypothetica) protein FU14950 16 

ESTs 16 

0ESC1 protein IS 

iiypothefcal protein M6C13170 2.5 

ESTs 15 

fibroblast growth (actor receptor 3 2.5 

ESTs 16 

hypothetical protein FLJ23186 15 

estrogen receptor 1 IS 

ESTs 14 

hypothetic^ protein RJ22028 14 

Homo sapiens mRNA:cONADKFZp434P11 14 

hypothetical protein FU204O1 14 

hypolhetica! protein FLJ14d13 14 

ESTs 14 

arachldonate 12-)ipoxygenase, 12R 1 14 

hypQthetlca]proi8lnFU22969 14 

topoi$omerase(DNA) II alpha (ITOId) 14 

ESTs 14 

genera) transcription factor UK P 14 

protein kinase iota 14 

ESTs 14 

hypothetical protein PRO2015 14 

hypothetical protein FU14451 13 

ESTs 13 

ESTe 13 

integrin. alpha 3 (anflgen CD49C a 13 

cytochrome P450.subfamnyilS.pd 13 

Fenconi anemia. comp!efflenta8on gro 13 

pQlo(Droso|ME94itoidnase 13 

EST 13 

matrix mel^Iloprotalnase 14 (mentbra 13 

ESTs 13 

gb:xf50f04j(1 Na.CGAP_G3s4 Homosa 13 

protease, serine, 26 13 

Bloom syndrome 13 

platelet-activaQng factor acetythy 13 

iigase I. DKA. ATP-depemtent 12 

ESTs. Weakly similar to A34087 hypo 12 

HI histonefsn^i member 0 12 

hypothetical protisin MGCS469 12 

[j)06phonl)osylg1ycinamidefonnytlra 12 

acetyl LOl receptor; SREC 12 

sohjte earner famiy 3 (acSvaton 12 

minichromosoma maintenance deSdsn 12 

TONDU 12 

S100 caldum-tMng protein A1 1 (c 12 

eyes absent (Oro80ph{ta)homolog 2 12 

hypolhe&al protein OKFZpS64D0462 12 

CDC20(cendMSloncycle20.ac 11 
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424490 


AJ2/0V1D 




iMjwn3se,anK 


onxynn reposi aoniani 9 


91 


41cOi94 




ns.79881 


TM,7tin^1 


Umim ^fifilAMe tf*OMA* CI l90fVtf« fie jit 

nomD oQpions cunn, ru^juuD uSi n 


91 


•tiuooo 


VDTTOC 


nS.DD7iD 






9 1 










yD^n£ 1 itn/HJo 1 rtm_vAdnr.uvx> 1 nunn so 


91 

All 


44lOf«» 


dU/Uo4U 


tin IQIQO 




cycOiwlBpondsnt Uiusb 2 


9 1 


450663 


H43540 


nS.25292 


OP Tti . itH 

SS,TM,RN3S8.HIl 


ribonudsssQ HI, isrgG subunit 




W00U9 








vaocbila relatBd Idnsss 1 


9 1 

All 


44CW10 




KS.2U2097 


ss.cajb, ^^^l^MA^A,Tl^TUB,vwo«cr 


procollagfin C>6ndop6pQd8SO snhsncB 


9 1 
£.1 




U73727 


nS.i97iO 


00. T M,T j)nospnatase,injbig«MAM 


piotBin tyiQ^nfi pho^Jhataso* reocp 


9 1 






nS.3u32ii 


Gyco6_trBnsf.2 


CO IS 


9 4 
£.1 










IdU n^^yOAA^^ U«i 1 ftOiilBfia tt_fMa aulttw 

nIVLWtW* .TwTnO 8a|NaB V*iiJS BViail 


9 1 


403133 






pKlna5e.lvt8ira,Bana_4l,Rho(Ar 


Target Exon 


9 1 
C.I 


440249 


M24o590 


Ks.337279 


VHL.TaiD_DNase 


ESTs 


2.1 


438580 


AABIIZoZ 


Hs.299202 


TM«pkinasesugarJr 


CCTb 


9 4 
A.1 


406400 






SS,TM,bypsln 


NMJ)07196:Konio sapiens kalfikiBin o 


2.1 


427375 


AL035460 


Hs.1 77535 


SSZnjB8rbOpep(,hormon85Reprol 


mstaOocarboxypepOiteso C(%1 


9 4 
4.1 


423453 


AW450737 


H8.12B791 


SS.GrBnln,CDP-0H_P.ir8nsi 


CGHJ9 protein 


*l 4 
2.1 


433716 


AAouSBOB 


Hs.225118 


AifylphasphaiBso 


ESTs 


2.1 


420757 


X78592 


Hs.99915 


TM|tuKnioML(^iAndnyAAjBC6p, 


androgen leceptor (dihydnitestoslN 


9 4 

2.1 


425018 


BE245277 


Hs.1 541 95 


DNas8j,K.tetra 


E4r iranscnpuon tacior i 


9 4 

2.1 


421685 


AF189723 


Hs.1 06778 


TM,E1 -E<^TP3se»nyara8SeE1'E2 


ATPase, Ca transporting, type 2C, m 


2.1 


457288 


AA52145B 


Hs.192738 


proL.|soin6r3SB 


ESTs 


2.1 


407305 


AA715284 




TMipUnase^SeRiaPleiAuepsat, 


go JivjaiUo.n NujLAaAr JSR) nORio sap 


9 4 
2.1 


456327 


H68741 


Hs.36774 


TM»G^ooJu3nstJ9 


ESTs 


2.0 


422429 


AA310527 






gb:EST181333 Jurkat T-ceDs V Homo 


2.0 


402974 


NM.001501 


Hs,1 29715 


GnRHhormone5ihonnon84 


gonadotR^itaHelBasIng honnone 2 


10 


458016 


AW188099 


Hs.131813 




ESTs 


10 


452194 


At694413 


Hs.332849 


TM,7tmJ>\NFjxep(Dr^ushi 


olfadary receptor. famBy 2. subfa 


10 


428028 


U52112 


Hs.182016 


TM.pUnas8,MB0 


tnterieuMn-l receptor-associated k 


10 


427747 


AW411425 


Ks.180655 


pklnase.Rpoxygenase.PlAT1ipox 


serine/threorfne Idnase 12 


10 


452841 


T17431 


Hs.65412 


TM.DEAO^case_C 


DEAD^ (Asp^luAla^p/HIs) tnx po 


10 


449539 


W80363 


Hs.58446 


pkin3se,FiiilfvlIke,Recep.L<lom 


ESTs^ 


10 


418140 


BE613836 


H5.83551 


TM.E1-ELATP8se 


ntoofitviBar-assodated prot^ 2 


10 


430076 


AA465115 


Hs.318773 


AAA.BAH 


KIAA1836 protein 


10 


425749 


AW328587 


H3.159448 


Rnx^om8LL7A8XRIURRCr.pUna 


surfeit 2 


10 


425855 


AF135025 


Hs.159679 


SS,trypsin 


kainkre!n12 


10 


400135 


L40027 


Hs.118890 


pUnase 


glycogen synthase Mnase 3 alpha 


10 



TABLE 14B 

Pkey: Unique Eos prat^set Identifier number 
CAT number Gene cluster number 

Accession: Genbank aooession numbers 

Pkey CAT number Accessions 
* 

412333 1289037_1 AW937485AW937589 AW937658AW937654AW337492 

418804 179138.1 AA809632AI917245A1701732AA228406 

420930 197736L1 AVV888650AW888651BE1499466E149948BE149951BE149947AW888649AA281840AA281822AVV8886^ 

422429 216469J AA310527AW962295Z44865H06641 

422689 21989U AW856665AA315006AW954733 

435399 405576_1 AA679463AW81 3779 AW81 3709 

438390 45662.1 AI422017A1422945AI363249Ai423113AI925S92AI420795AI208187AI423279AU23645A)424090^^^ 

438993 46765U AA828995AA834879Ai926361 

452947 939810.1 AW130413Ai932362 

455092 1252971.1 BE152428AW8S5572AW855607 

457405 333127.1 AA504860AA504911 



TABLE 14C 

Pkey: Unkfue number corresponding to an Eos pnibeset 

Ref: Sequence source. The 7 d^itrnjinbereb this column are Genbank kfentiiief{GI)nmiber^ "Dunham Letal.' refers to (he pubDcaBonenGtled The DNA 

sequence of human chromosome 22.' Dunham I. et al., Nahro (1999) 401489495. 
Strand: Indicates ONA strand Cromwhh^hexons were predictBd. 
Ntj)os{l!on: Indicates nudeotida positions of predicted exons. 



Pkey 


Ref 


Strand 


NtjxjsHion 


400666 


8118498 


Plus 


17982-181 15.20297-20456 


401230 


9929527 


Minus 


33835^006,3453»459136461-3674S.4892S49098.52604^758 


401486 


7341763 


Rus 


32585^2756,36281-36&40.4079140933,4401B44179 


402337 


6957691 


Plus 


411M288,16811-16973,17107-172S6.19715-20040.22029-22205 


402481 


9797406 


Plus 


8/891-88991 


403133 


7331427 


Plus 


38314-36634 


403471 


9930659 


Minus 


85867-65983 


403763 


7229888 


Minus 


4357W3887 


404184 


4581418 


Minus 


12652-13546 


404274 


9885189 


Plus 


104127-104318 


406400 


9256298 


Plus 


1553-1712,1878-2140.4252-438S.5922W7 
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TABLE15A: 752 GENES UP-HEOnATED IN CEWICAL CANCER COMPATCO TO 
TaWe ISA Dsts about 752 genes up^egulaled to ceivkal cancer com^^ 

equal to 7.0, the denominator was the median value for three noiHnignanloervica) specimens, and (he 96^ peicentile value amongst covicai cancels was grealer than or equal 8 
units. 

Pkey: Unique Eos probesetUenCfier mimber 

ExAccK Exemplar Accession number, Genbank accession nurnber 

UntgeneiD: Unigenenun^ 

Unigene Title: Unlgene gene tiUe 

R1: RaSo of ceivicsi cancer to nonnatoenHx 



Pkey ExAocn UnlgenelO Unlgene TlSe 



414915 NM.002462 HsJ6391 
411248 AA551538 Ks.334605 
428227 AA321649 Hs.2248 
421508 NK1004833 Hs.105115 
443639 BE263042 Hs.9661 
454390 AB020713 Hs.56966 
416065 BE267931 Hs.7899S 
433226 AV\^733 Hs.9414 
413503 BE410228 H3.75410 
411669 BE612676 Hs.303116 
414132 AI801235 Hs.48480 
422809 AK001379 Hs.121028 
448569 BE382657 Hs.21486 
432906 BE265489 Hs.3123 
416963 BE304571 Ks.8952g 
439953 AW247529 Hs.6793 
449722 6E280074 Hs^3960 
414812 X72755 Hs.77367 
406405 AK001332 Hs.44672 
432917 NhL014125 H$.279812 
412530 AA766268 Hs.266273 
457465 AW301344 Hs.122908 
408806 AW847814 Hs.289005 
429083 Y09397 Hs.227817 
401405 

426272 AW450671 HS.189284 
424878 H57111 Hs.221132 
412140 AA219691 Hs.73625 
444371 BE54Q274 Hs.239 
418030 BE2(r7573 Hs.83321 
452291 AF015592 Hs.28853 
400196 

416795 AI497778 
424865 AF011333 Hs.153563 
438011 BE466173 H8.145S96 
428368 BE440042 Ks.83326 
436923 AW293704 Hs.122658 
415791 H09366 Hs.78853 
448775 AB025237 K&.388 
435647 AI653240 Hs.49823 
431049 AA848576 Ks.103267 
429486 AF155827 Hs.203963 
428433 AA521410 Hs.41371 
416322 AA284166 Hs.84113 
417308 H50720 
429574 BE26B321 
407204 R41933 
408901 AK001330 Hs.48855 
438899 AF085833 Hs.135624 
456362 AW973003 Hs.179909 
438598 AI805943 Hs.326067 
408908 BE2962Z7 
427488 M91401 
400195 

414747 U30872 
410324 AW292539 Hs.30177 
453028 AB006532 Hs^1442 
410608 AI538438 H8.159087 
432503 AA551196 Hs.188952 
430512 AF182294 Hs.241578 
430709 R34356 
449962 AA004879 Hs.187820 
425408 AB00237S Hs.156814 
440774 AI42Q611 Ks.127832 
408201 AKD00568 Hs.43654 
438110 AA704699 K5.291651 
426897 AW976570 H5.97387 
447700 A1420183 H3.171077 



Hs.20509 



Hs.81892 

H5.208912 

Hs.140237 



Hs.250822 
HS.1786S8 

Hs.77204 



myxovinjs Onfluenza) resistance 1. homolog of murtne 
Homo sapiens cONA RJ14408 lis. done KEMBA1004341 
small Indudbte cytoMne subMly 8 (Cys-X-Cys). mem 
absent bi melanoma 2 

pToteasoms (prosome, macropato) subuiiil, beb type, 1 
KIAA0906 protein 
protilerating cell nuclear antigen 
KIAA1488 protein 

heal shock 7QkD protein 5 (gtucose^uialed protein. 

stromal celi-dertved factor 2-13(6 1 

ESTs 

hypothetical protein FIJ10549 

signal transducer and aclivator of transcription 1, 9 

lethal giant larvae (DrosophBa} homolog 2 

akb-keto reductase famDy 1, member A1 (ddehyde red 

platetet-acfivatino factor aceiylhydrolase^ isoform i 

cydinBI 

monokine induced by gamma interferon 
hypothetical protein FU10470 
PRO0327 protein 
hypothetical protein FU13346 
DNA repQcalnn factor 

Homo sapiens cONA: aJ21532 fis. done CCL06049 
BCL2-related protein A1 
Target Exon 
E8T8 

ESTs 

RAB6 intoracSng. Mnesln^ (rabMnesin 6} 

fbrkheadboxMI 

neuromedin 8 

0DC7 (ceU division cyde 7, S. cerevisiae, homotog)- 
Eos Control 

HBVpXassodated protein^ 
lymphocyte anSgen 75 
spSdnQ factor (CGI .3) 

matrix metaOoprotelnase 3 (sbomelysin 1, progdIaSn 
ESTs 

uraciMDNA glycosytase 

nudb( (nudeosUe diphosphate linked moie^ X)-type m 
ESTs 

hypotheficat protein FU22548 similar to gene trap PA 

hypothe6cai protein FU10339 

ESTs 

cydinKlependent kinase inhibitor 3 (C0K2-a$sodated 
KIAA0101 gene product 
hypothetical protein KtGC861 

ESTs. Weakly sImllartoALUIJIUMAN ALU SiAFAMILY J SE 

hypothetical prot^aJ10488 

ESTs 

hypothefical protein FU22995 

hypotheOcd protein MG(S178 

serine/lhieonine kinase 15 

RA023 (S. cerevisiae) homotog B 

NIA.007057*:Homo sapiens ZW10 interactor (ZWINT). Iran 

centromero protein F (SSQMOOkO, mitoslh) 

ESTs 

RecQproteirvaeA 

ESTs 

ESTs 

U6 snRNA^odated Snvllke protein t^mS 
gb:yh85dOU1 Scares placenta Nb2HP Homo sapiens cONA 
ESTs 

K1AA0377 gene product 

ESTs 

hypotheOcal protebi FU^)561 

ESTs. Weakly similar to 138022 hypothetical protein ( 

ESTs 

ESTs. Weakly simBar to T21259 hypotheOcal protein F 
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R1 

58^ 

35.6 
33.6 
3Z0 
30.5 
30.4 
30.0 
29.4 
28.9 
28.3 
28.1 
27.4 
27.2 
26.9 
26.5 
26.2 
25.3 
25.1 
24.6 
23£ 

ai 

225 
225 
22.8 
22.7 
22.6 
22.5 
22.2 

m 

21.6 
2U 
21.2 

m 

20.7 
20.3 
20.2 
20.0 
19.6 
19.6 
19^ 
19.5 
19.4 
193 

m 

19.2 
19.0 
19.0 
19.0 
185 
18.8 
185 
18.6 
18.5 
18>l 

ia3 

18.1 
18.1 
175 
17.7 
17.6 
17.3 
17.1 
165 
16.7 
16.7 
165 
16.5 
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433159 AB035898 Hs.150587 Idnesin^ike protein 2 16^ 

' 430466 AF05257a Hs^41517 polymerase (DNA directed). Iheta ' 16J 

453883 AI638516 Ks^630 cotactor lequiied for Sp1 traitscriptbna) acflv^ 16^ 

_ 453941 U39817 Hs.36820 Bloom syndrome 16.1 

5 438461 AW075485 Ks.286049 phosphosenne aminotransferaso 16.0 

407999 AI126271 Hs.49433 EST5,VVteMysiJTin» to YZ28_HUMAN HYPOTHETICAL PROTE 16j0 

413943 AW294416 Hs.144687 Homo 8^)ien3cONAFU12981fis, done Nr2RP2006454 1i7 

407720 AB037776 \^m02 KIAA1355prot^ 1M 

425316 AA354977 Hs,191565 ESTs, Moderately 6lmaartoT14342NSD1 protein- mou 1i6 

10 423673 BE003054 Ks.1695 matrix meteUoprotelriase 12 (macTophageete 

419777 D60134 Hs.270975 ESTs 1&3 

453886 R66282 H8.20247 ESTs, Weakly slmQar to 865657 atpha-IOadrenerglc re 1Sl2 

443715 AI5B3187 Hs.9700 cydmEl 1S2 

407786 AA687538 Hs^972 telraspani 1&2 

IS 431910 AK000142 Ks.101774 hypotheflcal protein FU23045 15.2 

417634 W27202 Hs.82327 glutaOilond synthetase 15.1 

43^2 AW974944 Hs.200577 ESTs 15-1 

446921 A8012113 Hs.16530 snoBlntiiGMecytDldnesutjM^ 1&1 

427999 AI435128 Hs.181369 ut)b|ijllinfitsiondegradallon l-lte 1&0 

20 413869 NH.000878 Hs.75596 fnterieuUn 2 receptor, beta 15.0 

431629 AU077025 Hs.265827 lnteTfsron.alpl»lnduc&)te protein (clone ^-6-16) 14.9 

435354 AA678267 Hs.11711S ESTs 14.8 

406836 AW514501 Hs.156110 ImmunoglolxiBn kappa constant 14.B 

416109 AI420311 Hs.126550 suppressor of K transport defect 1 14.8 

25 417933 X02308 Hs.62962 Oiymidytate synthetase 14.7 

436970 AA837782 H3.321058 ESTs 14.7 

409680 W31092 Ks.55847 mitochondifal rtt)0S0mal protein 64 14.7 

432401 N!yL013330 Hs.274479 NME7 14.6 

425397 J04088 H5.1S6346 topoisomerase (DMA) II alpha (17DkD} 14.5 

30 420734 AW972872 Hs.293736 ESTs t4.5 

434256 AI378817 H5.191B47 ESTs 14.5 

418269 AA806113 Hs,189025 ESTs 14.3 

427372 AW960673 Hs.177530 ATP synthase. Htransporftig.mitochondfaFI compla 14.3 

427081 A1474533 Hs.170528 ESTs, Moderately sbnHar to ALUCHUMANilll ALU CUSS 14.2 

35 420309 AW043837 Hs.21766 ESTs, Weakly similar to ALU5JIUMAN ALU SUBFAMILY SOS 14.2 

429966 BE081342 Hs.283037 HSPC039 protein 14.2 

410361 BE391804 Hs.62661 guanylato binding protein 1, Interferon-inducible. 67 14.1 

443957 AA521049 Hs.34487 hypothe1k»l protdn FU23412 14.0 

418803 U50079 Hs.885S6 histonedeacetylase 1 14.0 

40 434094 AA30S599 HS.23820S hypothe9cal protein PRO2013 14.0 

420139 N1l^005357 Hs.95351 lipase, hormone^ensiSve 13.9 

444783 AK001468 Hs.62180 anlHIn (Drosophila Scraps homolog), acSn binding pr 13.9 

433255 Ai274270 Hs.95840 KIAA1527 protein 13.9 

431838 AI097229 Hs.2174d4 ESTs. Weakly similar to ALU1J1UMAN ALU SUBFAMILY J SE 13.8 

45 449801 AM773S5 Hs.288300 hypothetied protein FU23231 13.8 

447076 AW885727 H9.301570 ESTs 13.8 

441240 AA923749 Hs.132442 ESTs 13.8 

439398 AA284267 Hs.221504 ESTs 13.6 

404630 Target Exon 13.8 

50 408321 AW405882 HS.4420S cortistalin 13.6 

426427 M66699 Hs.169840 TTK protein Mnase 13.5 

413278 BES63085 Ks.833 hterferon-stimulated protein. 15 kOa 13.5 

403055 C2002219*:9i|12737280{reflXP_00668Z21 keialni 18 |Ho 13J 

456614 AV653110 Hs.106650 hypothelical protein FU28533 13.5 

55 42S281 8E38S099 H&334727 hypoOieOcal proldn MGC3017 13.3 

439926 AW014875 Hs.137007 ESTs 13.3 

411263 BE297802 Hs.69360 kinesin-Ito 6 (mitotic centromere^octatodldnedin 13.2 

451141 AW772713 Hs.247186 ESTs 13.2 

447390 X95384 Hs.18426 translalional inhibitor protein p14.5 13.2 

60 419828 T81422 Hs.14922 ESTs 13.2 

428147 AW62g965 Hs.234983 ESTs. WeaMy similar to 210926QABcen growth factor 13.2 

410068 AI633888 Hs.58435 FYN^nding protein (FYB-120n30) 13.1 

407595 6E350012 Hs.248365 ESTs 13.1 

^_ 432721 AL121476 Hs.180$32 glucoss phosphate isomerase 111 

65 416975 NM_004131 H8.1051 gTBnzymeB(granzynieZcytotoxteT-lymphocytfr«sod 13.0 

413314 BE081585 gb:Cf\^-BT0635'210400-15&t07BT0635 Homo sapiens cONA 13j0 

430929 AA489166 Hs.156933 ESTs 12.9 

449571 AW016812 H3.200266 ESTs 12.9 

„ 400298 AA032279 Hs.61635 six transmembrane epithelial antigen of 0» prostate 12J 

70 417105 X60992 Hs.81226 COeenflgen 1Z6 

434263 N34695 Hs.44648 ESTs 12-6 

412059 AA317962 H6.24g721 ESTs, Moderately sbr^ to PC4259fenifinessodate 12.6 

407756 AA1 16021 Hs.38260 ubhiullin speciiic protease 18 12.5 

437056 AI147061 gb:ok33a11.s1 SoaresJISF.F8_9W,OTJ»A.P-S1 Homosapten 12.5 

75 438766 AI307416 Hs.164675 ESTs. Moderately shnOar to AIU7J1UMAN ALU SUBFAMILY 115 

444478 W07318 Hs.240 Mfhase phosphoprotein 1 125 

450738 AA010907 H8.184456 hypoUiet&al protein 12.4 

416205 L21715 Hs.83760 troponin I. sketetaLfest 12.4 

442994 AI026718 Hs.16954 ESTs 12.4 

80 433301 AW29B2B0 Hs.152016 Homo sapiens cDNA:FU221 40 fe.ctoneHEP20977 114 

437457 AA7$7900 HsJZ70823 ESTs. Weakly simDar to S65657 dpha-IC^nerglc re 12.4 

435327 BE301871 Hs.4867 mannosyl (alpha-13-K!ycoproteln bete>1.4^acetylg 12J 

422785 AW409701 Hs.1578 bacuiovirsltAPrepeat4onteiitlng5(sunMn) 113 
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15 
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410245 C17908 Hs.194125 
424927 AW973666 K3.153a50 
418941 AA452970 HS.15S218 
432325 AW973209 Hs.2617B2 
414761 AU077228 Hs.77258 
416618 U66097 
449296 A1.137257 
415657 AAB66115 
427295 AW291212 Hs.^3943 
415443 T07353 Hs,7948 
429770 AI756047 
428955 AA579297 
435244 N77221 
432810 AAB83400 
434423 NM_006769 Ks.3844 
443378 AW392550 Hs.9280 
459273 AW608906 Hs.334767 
419945 AW290975 Hs.118923 
442159 AW163390 Hs^78554 
438169 AA888311 
407804 AF228603 
401557 

434408 A1031771 
408747 A)925153 
409231 AA446644 
429412 NM^006235 Hs^407 
445655 AA873830 Hs.167746 
419138 U48508 
427527 AI609057 
432287 AK001D57 
408548 AA05S449 
409703 NM.006187 H$.560O9 
447082 T85314 Hs.42644 
409931 eE293233 
426172 AA371307 
424723 BE409813 
456880 AW015644 Hs.155005 
433849 BE465884 Hs^0728 
430519 AF12g534 
434442 AA73741S 
457205 AI905780 
422713 AA902780 
443491 AW499665 Hs.9456 
424339 BE257148 Hs.145416 
419741 NM_007019 Hs.930O2 
450208 Ai686945 H3.272062 
446849 AU076617 Hs.16251 
424965 AW956282 H5.144609 
442737 AB002319 
409113 AA074897 
415782 AA169345 
417958 AA767382 
402539 AW502761 
413677 AW503116 Hs.301819 
414706 AW340125 Hs.76989 
421632 AA825426 
438995 AI277986 
400289 X0782O 
432363 AA534489 
451655 H85689 
429237 AA448417 
427719 AI393122 
444665 BB613126 
410093 AW589659 Ks^120 
' 400080 

424517 AI539443 



Hs.86724 
Hs^5B 
Hs.127797 



Hs.99736 
Hs^6937 
Ks.187824 
HS23054 



H8.17602 
Hs.39957 

Hs.132586 
Hs^17493 
Ks.692 



Hs.89631 
Hs.302063 
Hs.274268 
Hs.63187 



Hs.129771 
Hs.125056 
H5.1 52337 



Hs.49210 

Hs.162826 

Hs.198272 

Hs.1 19325 



H&8663 

Hs.123177 
Hs.193417 
HS.30S09 



Hs.238832 
Hs.164875 
HS.22S8 

Hs^5560 
Hs.104990 
Hs.134726 
HS47783 



EST8 

hypolhefica) protein C321DZ4 
ElB-SSkDa-assodated protein 5 
ESTs 

enhancer of zesie (Dfosophlla} homotog 2 

GTP cyclohydiolase 1 (dopa^ponsive dystonia) 

Homo sapiens cONA: RJ23015 lis, done LN600818 

Homo sapiens cDNA aJ113Bm done HEMBA1 000501 

hypothetical protein MGC11266 

ESTs 

ESTs 

bnSn and nasoptiaryngeal caicinoma8UScepta1)!!ity pro 

ESTs 

ESTs 

UM domain only 4 

proteasome (prosome. macmpaln) sutHmil. beta type. 9 

hypothetical protein MGCS629 

ESTs 

tieterodiromaOn-IOGB protein 1 

Homo sapiens cDNA FU12381 lis. done MAMMA1002566 

ptedcstrin 2 (mouse) homotog 

Target Exon 

ESTs 

annexInA2 

GA733-2 anOgen; epHheKal glycoprotein (EGP) (KSA) 

POU dom^n, dass 2, assocMng factor 1 

B cell llnlcer protein • 

ryanodine receptor 1 (siteietai) 

lmmunoglot)uiln heavy constant mu 

Homo sapiens cDNA FU10195 fis, clone HEMBA1004771 

ESTs, WeaWy similar to ALUC^HUMAN ID! ALU OASS C W 

?-5'-oligoadenyiatB synthetase 3 (100 kD) 

(hioredoxMiiffi 

ESTs 

ESTs 

pmtein arglnlne hknethyttransferase 3(hnRNP methyitr 
TEA domain family member 1 (SV40 transcriptional enha 
ESTs 

F*t)0xoniyprot^4 
ESTs 

Target CAT 

KunSngSn-interacSng protdn A 

SWUSNF related, matrix assodaled, adin dependent r 



401539 



H8.137447 

T93096 Ks.17126 

451066 At758660 Hs:206132 

409235 AA188827 Hs.7988 

451730 AF095687 Hs:Z6937 

428054 AI948688 Hs^6619 

441838 AAG81848 Hs.7921 

438654 AI005270 H$.123543 

424321 W74048 Hs.1765 

449035 AI815728 Hs.19980 

436137 AI066769 Hs.133512 

417863 AB000450 Hs.82771 

439975 AW328081 Hs.6817 

434206 AW136973 Ks:S8516 

454355 AW812535 

435542 AA667376 Hs^9533 

431386 AAS04359 Hs.110067 



ulAiuitin carder protein E2-C 
ESTs 

deavage and polyaderiytalion spedfio fador 3k 73kD 
Homo sapiens. Similar to RIKe<cONA5730S76N08 gene. 
iaAA0321 protein 

gb2m85a05.r1 Stralagene ovarian cancer (937219) Homo 

ESTs 

ESTs 

K1AA0430 gene product 
zinc finger protein 146 
K1AA0097 gen© product 

ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SE 
ESTs 

matrix melalkiprotfiinase 10 (stromelysin 2) 

gb:nf76g1U1 Na.CGAP.Go3 Homo sapiens cONA clone 3* 

ESTs 

ESTs 

ESTs 

B eggressive lymphoma gene 

ESTs, Weakly sinntar to K1AA0970 protein [H^iens] 

Zos Control 

Homo sapiens cDNA FU12169 tis. done imMA1000643 
NIUL00267S:Hofflo sapiens pnmyelocyliQ leukemia (PMa^, 
hypothefk»l protein M6C15912 
ESTs 

ESTs. Weakly similar to 138022 hypotheBcal protein [ 
br^ and nasophaiyngeal carcinoma susceiriibiriiy pro 
ESTs 

Homo sapiens mRNA; dSNA OKFZp566E1 83 (from done WFZ 
ESTs 

tymptiocyte-spediic protein tyrosine kinase 
DNA polymerase epsllon p12 subunit 
ESTs 

vaodnia related Mnase 2 

inostne triphosphatase (nucleoside triphcsphate pyrop 
ESTs, Weakly sbniiar to S69890 mitogen indudbia gene 
gb:CM4-ST01 82-051 099-021-c09 8T0182 Homo sapiens cDNA 
ESTs 

ESTs. WeaMysimBar to ALUSJiUMAN ALU SUBFAMILY 8CS 
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123 

\U 

1Z2 

12^ 
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12.1 
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12.1 
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11.0 

11.0 

11.0 

10.9 

10.9 

10.9 

10.9 

10.9 

10.8 

10.8 

10.8 

10.8 

10.8 

10.7 

10.7 

10.7 

10,7 

10.6 

10.6 

10.6 

10.6 

10.6 

10.6 

10.6 

10.6 

10.6 

10.6 

ia6 
las 



wo 03/042661 



PCTAJS02/36810 



416S64 AW795793 Ks.179827 Homo sapiens cONARJ12257fi5.clon9MAMMA1001601.h 105 

447769 AW873704 Hs.320831 HonH) sapiens cDNA FU14S97 lis, done ^^^2roM002330 105 

408329 AF155510 Hs.44227 heparanase 10.5 

410146 AW592655 gb:hf45ri2j(1 Soares_NFU.T_GBC.S1 Homo sapiens cDNAc 105 

5 427600 AW530918 Hs.179774 proteasome (prosoms, macropain) acBvator siA^^ 104 

416007 M13509 Hs.83169 matrix metaitoproteinase 1 (inteisGQalcoDagenase) 10.4 

407241 M34516 gbMiman omega GgMch^ protein 14.1 (Ig lambda ch 104 

43S061 AI651474 Hs.163944 ESTs 10.4 

409653 AW451693 Hs.220825 ESTs 10.4 

10 428294 AA425468 gb2w46d02.n Soares_tDtaLfetus_Nb2HF8.9w Homo sapie 104 

433160 AW207002 Hs.134342 TASP for teslis-specHic adriaiTTydn sensiSvity prote 104 

408809 AWZ74673 Hs.279706 ESTs. VUbaUy similar to A47582BH»IigrowSi factor p 104 

410174 AA308007 Hs.59461 DKFZP434C245prDt9ln 104 

424792 U92538 Hs.153138 ori()In recognition oompiex,8ubun(t 5 (yeast Iramolog) 103 

IS 422406 AF025441 Hs.116208 Opa4nterac6ng protein 5 10^ 

424687 J05070 Hs.1 51738 matrix metalloprotelnase 9 (gelaSnase B. 92(0 geiati 10^ 

413809 L25851 Hs.B51 Integrin. dpha E (anQgen CD103. human mucosd lymph 10.3 

413507 BE145360 Hs.190084 ESTs, Vltealdy similar to 138022 hypothetical protein ( 10.3 

448119 H385e7 Hs^2295 detficalorofcyto^dneslsl 10.2 

20 457288 AA521458 H5.192738 ESTs 102 

402025 Nh/L021 624:Homo sapiens histamine H4 receptor (HRH4), 10.2 

440572 AWie3778 Hs.249584 ESTs, WeaWyslmBartoMYSA^HUMAN MYOSIN HEAVY CHAIN 10.2 

453323 AF034102 Hs.32951 solule carrier family 29 (mKieosldetmnsportera}.m 10.2 

443780 Nh/LO12068 Hs.9754 acSvaSnglranscripSon factor 5 10.1 

25 422429 AA310527 gb:EST181333Juri(atT-ce!lsV Homo sapiens cDNAS* en 10.1 

444314 AI140497 gbw76b09.s1 SoafBS_fetaUlver_spieeii_1NFLS.81 Homo 10.1 

426125 X87241 Hs.166994 FAT tumor suppressor (Drosophila}homolog 101 

430848 AW021726 gb:dQ7e02.y1 Morton Fetal Cochlea Homo sapiens cDNA 101 

422470 AB017919 Hs.1 17232 pepBdy!arginlnedelmlnase,typeV 101 

30 449501 A1652924 Hs.231942 ESTs 101 

420731 AIJ042052 Hs.104432 ESTs 101 

404345 AA730407 Hs.159156 piotocadherinll 101 

400438 AF185611 Target 101 

438170 AI9166B5 Hs.194501 ESTs 101 

35 432193 AA372264 Hs.273193 hypotheScd protein FU10708 101 

458715 AK000973 Hs.16725 hypothefcdpratdnFUIOIII 10.1 

427766 AA412258 Hs.188817 ESTs 101 

443426 AF098158 H8.9329 chromosome 20 open reacfing frame 1 10.0 

403038 Target Exon 10.0 

40 434674 AAB31879 H3.136985 ESTs 1O0 

439685 AW956781 Hs.293937 ESTs. VV^6iniilartoFXD2JHUMANFORKHEAD BOX PROTE 1O0 

439428 AA835825 Hs.19O490 ESTs 1O0 

403310 Target Exon 09 

408392 U28831 Hs.44566 KIAA1641 protein 09 

45 421849 AW410872 Hs.106894 hypothefIcsiprotoinFlJ20411 09 

433384 AIQ21992 Hs.124244 ESTs 09 

443343 BE4O9809 Hs.301005 purin&4ich element binding protehB 09 

437267 AW511443 Hs.258110 ESTs 9.9 

455878 AI310151 Hs.173624 ESTs 09 

SO 435851 AA700946 H8.191933 ESTs 8.9 

452243 AIJ55715 Hs.28555 programmed cell death 9 09 

441703 AW390054 Hs.192843 leucine zipper protein FKSG1 4 09 

414001 A1610347 Hs.l03812 ESTs. Moderately similar to AUJ1_HUMAN ALU SUBFAMILY 9.9 

438669 AA535975 Hs.174308 Homo sapiens, done IMAGE:3453347,mRNA, partial cds 08 

55 421502 AF111856 Hs.105039 solute carrier family 34 (sodium phosphate), member 2 9.8 

417087 AA193193 Hs.188325 Homo sapiens cONAFU 11 484 lis, done HEMBA1001 835 08 

455855 BE147440 gb:RC1-HT022W18010OO15^HT0229Homo8airfenscDNA 08 

410390 AA876905 Hs.125286 ESTs 08 

416526 BE019020 Hs.85838 solute carrier My 16 (monocarboxyllc add transpo 9.8 

60 442660 AW138174 Hs.130651 ESTs 08 

436186 BE390717 Hs.5074 slmDartoS-pombediml 9.8 

426773 NIA.015S56 H8.172180 KIAA0440 prot^ 9.8 

413476 U25849 Hs.7S393 add phosphatase 1, soluble 9.7 

418347 AA216419 gbnw:16e03.8l NCLCGAPJ^I Homo sapiens cDNA done sl 9.7 

•65 448752 AA593867 H3.300842 KIAA1608 protein 07 

440349 AA884195 Hs.31476 Homo sapiens cDNAFU13872fis, done THYRO1001322 9.7 

431363 M8652B H8.266902 neurotrophin5(neurDlrophin4^ 07 

430752 AA485330 Hs.303278 ESTs 07 

^. 436523 6E612990 Hs.5212 slngle^trandseledivemonofMonalurBdlDNAgly 9.7 

70 415740 N80486 Hs.39911 Homo sapiens mRNA for FU00089pfolBin,parflaiods 9.7 

411930 F06485 Hs.7740 oxystarol binding prottir^ 1 07 

430832 AI073913 Hs.100688 ESTs* Weakly similar to JE03S0 Anterior oraSenU(H 07 

452234 AW084176 Hs.223296 ESTs, Weatdyshtllar to 138022 hypothetical protein ( 06 

409997 AI908055 Hs.57749 synaptic nudelexpressedgene 2; KIAAIOII protein 9.6 

75 434957 AF283775 Hs.35380 x 001 protein 9.6 

407292 AAS76638 gbm45e06.8l NCI.CGAPJ>r12 Homo sapiens cONA done s 9.6 

459109 AW292447 Hs.140321 ESTs 9.6 

457892 AA744389 gbarySlelOsI NCLCGAP.PrlSHomosapbnscONAdones 06 

432074 AA525248 Hs.149723 ESTs 9.6 

80 440463 AI733087 Hs.129994 ESTs 9.6 

420651 AA281062 Hs.29493 hypothetic^ protein FU20142 9.6 

445328 AI220072 HS.16S893 ESTs 9.6 

434K3 BE049102 Hs.121573 ESTs.VV^dniflartoTRHY.HUMANTRICHOHYAU|Hjsap 9.6 
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420351 N92054 Hs,194718 

415853 K06016 Hs.100855 

429599 AAB06106 Hs.123664 

417037 BE083938 Hs.80976 

449317 AW293413 Hs.132906 

436588 AA759233 HsJ285Q6 

409261 BE315042 H$.19210 
401069 

414065 AW515373 Ks.271249 

409902 AI337658 Hs.156351 

432258 AW973078 Hs^3039 

438581 AW977766 H5.292133 
405536 

418216 AA662240 Hs.263099 

434573 AW372340 Hs.159717 

439354 AF0a6174 

455410 AW93667B 
400736 

419474 AW968619 Hs.155B49 
406464 
407881 



AW072003 Hs.40968 
Hs.39421 



427258 AA4O0O91 
404680 

433340 AA129782 Hs.3576 

423642 AW452650 Hs.157148 

457008 AA410446 Hs.112011 

435099 AC004770 Hs.4756 

451846 T65840 Hs.11762 

419988 W39388 Ks.55338 
402967 

455601 AI368880 Hs^16 

441075 AA91S991 Hs.179214 

451107 AA23510B Hb.17639 
404649 

420897 AW139261 Hs.232280 

418867 D31771 Hs.69404 

420298 A)ig9510 Hs^67912 

449893 T97999 Hs.1B214 

420101 AWS00529 H8.95180 

428166 AA423849 Hs.79530 

420022 AA2$62S3 Hs.120817 

444020 R92962 Hs.35052 
454765 AW819629 

41S021 R54409 Hs^01693 

418508 AA084248 Hs.85339 

415009 C7S2S3 H8^20950 

428845 AL157579 Hs.153610 

433348 AA8779g6 HS.12S376 
417881 AIB79117 
446354 AW449550 Hs.202249 
427018 AA397538 Hs.136280 
434410 AA632644 
448072 A]4S9306 Hs^4908 
457322 AI815486 Ks^43901 
424317 AI865032 H3^6017 
4330D1 AF217513 Hsj279905 
. 404112 BE302729 Hs.173162 
433334 A1927208 Hs.231958 
434960 AW374941 Hs.72545 
431658 BE409917 K5.266935 
439158 R60323 Hs.193888 
443081 H86658 Ks.132909 
429432 A1678059 Hs.202676 
452706 AW449390 Hs.257150 
' 437044 AlJ035e&4 H3.69517 
430780 N9S102 Hs.33485B 
426793 X89e87 HS.1723S0 
418379 AA218940 Hs.137516 
431405 AM70895 Hs.252574 
405454 

438362 AAS05678 Ks.12326 
401940 

424834 AKD01432 K5.1S3408 
442432 BE0935Bg Hs.38178 
459086 AA021163 H5.22287 
418653 AI7340&4 Hs.138212 
444152 AI125694 Ks.149305 
437534 AAB14471 Hs^lSOO 
435074 Ar760944 Hs,116937 
405722 H27498 H3.293441 
442B» AW263123 Hs.127554 
431675 AAS99965 Hs^237S 
447164 AP026941 H3.17518 



zinc biQsr protein 265 

ESTs 

ESTs 

snOgen idenSQcd by monodoosl snQbody IMtl 

19A24protsin 

ESTs 

hypothetical protein MGC1 1308 
C11000374*:gi|107647781gb|AAG22817.1|AF3Q2150.1 (AF30 
Homo sapiens cDNA FU13560 fis, dona PIACE1006851 
ESTs 
ESTs 

ESTs. Moderately similar to 178885 sertne/threonlne^s 
NM_005805:Kofno sapiens 26S proteasome^ssodated padi 
AF15q14 proton 
ESTs 

gbiHomo sapiens fuO (ength insert cONA done ZB94A08 
gb:PI\A2-OT002»)S030a004'a04 DT0023 Homo sapiens cDNA 
Target Exon 
ESTs 

C170001689i|7294725|gb|AAF50062.1| (AE003544) CG7547 
heparan stdfate (gHuoosainhe) 3-0«utfoliansferase 1 
ESTs 

TarQet Exon 

Homo sapiffl^s mRNA fuii iength insert cONA done EUROi 

hypottietic^ protein MGC13204 

ESTs. Weaidy similar to unlcnown [H sapiens] 

flap structure^pedlic endonudease 1 

ESTs 

Homo sapiens, done 1i4GC:17421. mRNA. comi^ete cds 
Target Exon 

SRY (sex detennining region Y)-box 2 
ets variant gene 3 

Homo sapiens ubiqutfln protein llgase (UBE3B) mRNA. p 

Target Exon 

ESTs 

msh (prosophUa) homeo box homotog 2 

ESTs. Weakly shnOar to AiaJ7.HUMAN ALU SUBFAMILY SQ S 

ESTs. WediOy similar b B34087 Ivpoiheticd protdn 1 

KIAA0767 protein 

M5>14 protein 

ESTs 

ESTs 

Qb:RCS^T0293-140200-014-H05 ST0293 Homo sapiens cDNA 
Homo sapiens, done IMAGE:3638994, mRNA, partid cds 
G proldiMOtipled receptor 39 
ESTs 

KIAA0751 gene product 

ESTs, Weaidy similar to JC5314 CDC28tolc2-tae kinase 
0ba^4g09.y1 Schneider fetal brain 00004 Homo sapien 
ESTs 

Homo sapiens cDNA: RJ22286 fis, dona HRO04157 
gb:np87b07^1 NCLCGAP.Thyl Homo sapiens gDNA done s 
ESTs 

Homo sapiens cDNA RJ20738 lis, done HEP08257 
ESTs 

doneHQ0310PRO0310p1 
ne^hboror COX4 
nnairU m^allopnrtBhase 28 
ESTs 

tRNA selenocysldhe assodated protein 

ESTs 

ESTs 

synaptonemal complex protein 2 

ESTs, Moderately sln«v to SUR1.HUMAN SURFEIT LOCUS 

cDNA for (fifferentiany expressed C016 gene 

hypothetical protein M6C122S0 

HIR (Nstone cell cyde regulaQon defedM* S. oere 

ftdgelin^ike 1 

dbosomsl protdn LlOa 

C12000541:giI5729884|renNP.006539.1| IGF4 mRNMiin 
ESTs 

Target Exon 

Homo sapiens cONA FU10570 fis. done NT2RP20031 17 

hypothefical protdn FU23468 

ESTs 

ESTs 

hypolheBcal protein MGC2603 

ESTs 

ESTs 

Homo sapiens SKC73 prot^ (SNC7^ mRNA, completB cds 

ESTs 

ESTs 

Homo sapiens dgS mRNA. pailisi sequence 
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420183 Vtf92885 
421133 AA814971 
407605 W03512 
441370 AI242433 
437986 AWB91130 
4263^ AW290981 
448111 AA053486 
408021 AW137133 
429228 AI553633 
433914 AF108138 
431184 AW970116 
42S219 AF2078ai 
439774 AL380257 
432573 AA553612 
450881 W80462 
437835 A1146771 
453204 R10799 
412719 AW016610 
408805 H69912 
428281 AA194554 
422583 AA410506 
448148 NM.016578 
453005 AW0SS30B 
454132 AW131759 
425263 NWL001197 
416209 AA236776 
441525 AW241867 
459539 A1279166 
443148 AI034357 
424255 A1192657 
459435 AA320038 
443117 AI248826 
457434 AW628192 
442S05 AW003775 
430901 AA488833 
439223 AW238299 



417739 Z4399S 

415961 H10983 

424042 Y10601 

451035 AU076765 

447155 AA100605 

412668 AA456195 

458042 AW058464 

456530 A1J049437 

433345 AI6B1545 

445008 W91903 

453922 AF053306 

455161 BE145900 

424308 AW975531 

430413 AW842182 

423494 AW504365 

415018 U49395 
404534 

438451 AI081972 

435176 AA744875 

443245 ARMOSSS 

443162 T49951 

457478 T15803 



434932 
420991 
457854 



402798 
423426 
429568 
404110 
424441 
433155 
414839 
408867 
418Z78 



BE613162 
AW504814 
8E547674 
8E179190 



Hs.143408 

Hs^S7634 

Hs.5479 

Hs.270085 

Hs.38173 

Hs.2112g6 

Hs.20315 

H5^45867 

Hs.337139 

H3.1 12160 

Hs310616 

Hs.155185 

Hs.213493 

Hs.324696 

Hs.270521 

Hs.158008 

Hs.191990 

Hs.129911 

Hs.48269 

Hs.183434 

Hs;i7973 

HS.20S09 

HSJ1803 

Hs.248286 

K3.155419 

Hs.79078 

Hs.12772e 

Hs.211194 
Hs.143897 

Hs.42029 

HS.188S1 

Hs.198248 

Hs.126711 

Hs.250618 

Hs.155919 
H3,137674 
Hs.430 
Hs.121557 
Hs.10056 
Hs.6430 
HS.10Q292 
Hs.152982 
K8.124814 
Hs.36708 

HS.1S4443 
Hs.241392 
H8.24143 

Hs.77807 

Hs^0261 
H3.189413 
K8.151973 
H8^029 
Hs^SB 

Hs^135 
Hs.121004 
Hs^ie9 



AW389579 
AI088691 

X14850 
AL037035 
X63592 
AA157857 



456248 
403152 
407649 
448387 
433671 
425891 
447347 
439079 
458115 
428144 



Hs.128434 
Hs^08414 

Hs.147097 

Ks.100426 

Hs.77462 

HS.18226S 

HS.B3937 

Hs.199890 

Hs^42S 

Hs^7078 

Hs^7427 

Hs.170810 

Hs.132906 

Hs,132141 

Hs.122730 

Hs.38348 



BE269243 Hs.182625 



AW375333 

AL035786 

AA576664 

BE066724 

AIB74402 

AW138797 

AI041717 

AA570056 

AF085937 



ESTs 8.9 

ESTs 8-9 

hypotttt1icdprotelhM6C13272 6>9 

EST8,WeddystntotoAUJ1JHUMANAlUSUBFAMILYJSE &9 

ESTs 8B 

ESTs. Weakly similar to 2109260A B ceO Qrowth factor 8.8 

intetferoivtnduced protein with telratncopeptkle rep 8.8 

ESTs 8.8 

ESTs 8.8 

Homo sapiens DNA heficase homoioo (RFI) mRNA, paiHa 8.8 

ESTs 8.8 

cytosoQc ovarian carcinoma anC^n 1 8.8 

Komosspto mRNA fun tanglh Insert cONA done EUROl a8 

K1AA1594 protein 8.8 

ESTs.HIghly slmOartoALUUIUMAN ALU SUBFAMILY SB S &8 

ESTs 8.8 

ESTs 8.7 

ESTs 8.7 

vaocbda related Mnasel 6.7 

ATPase, H transpoiling. tysosomat (vacuolar proton pu 8.7 

KIAA0874 protein 8.7 

HBVpX associated protein^ 8.7 

E$T8.W^siirilartDrmASP[H.88plens] &7 

ESTs 8.7 

BCL2-^iTteiaclingUlsr(apopbsifrMuclng) &7 

MA02(mitolicaRestd6ficlentyeasllU)RnlogH 8.7 

ESTs 8.7 

gl)»pi24a04j(irK:i.OGAP.LuSHon»88plenscONAck^ &7 

ESTs, Weakly sim09tDALU8JiUMAN ALU SUBFAMILY SXS &7 

dysteriln, limb ginfle muscidar dystrophy 2B {autosom a7 

gb:EST22383 Adipose fissue. whits II Homo sapiens cDN &7 

ESTs 8.5 

hypothetical protein FU10875 a6 

UDP-GakbetaGlcNAc t)e1a 1 .4- gatactosyttransferase, p 8.6 

ESTs, We^ simitar to 138588 reverse transcriptase 8.6 

UL16bindingprotstn2 8.6 

gb:HSC1QB121 nomiailzed infant br^n cONA Homo sapien &6 

ESTs 8.6 

ankyrin-Ske with (ransmembiane domains 1 8.6 

plastin 1 (I isofanii) 8.6 

ESTs. WeaWy similar to DP1_KUMAN POLYPOSIS LOCUS PRO 8.6 

hypothetica!protdnFU14621 a6 

protein with polyglulamine repeat calchim (caQ home 8.6 

Homosaplens mRNA; cDNA DKFZp5e6E1120 (from done DKF a6 

hypothefieal protein FU13117 a6 

ESTs ' 8.6 

budding unlnhBiHed by benzhrddazotes 1 (yeast homdo as 

gb:MKO-HT0208-2212g9>2044)12 HT0208 Homo sapiens cONA 8.5 

nnnichromosonie maintenance deficient (8. cerevislae} as 

8indlbiducib)eoytokineA5(RANTES) as 

Wistott<Aldrlch syndrome pnjtBfai tntBracthg protein 8.5 

purineiigicfeoeptorP2X.iioand^atedtenchanneL6 6.5 

C110O1758*:gil12621132NINP.O7S243.1) MEGF1 (Raitus as 

ESTs a5 

ESTs 8.5 

hypolheBcdl protein FU23511 as 

DKFZP434G032 protein aS 

protein phosphatase3(lbniiei1y2B),catelyt)c8Ubunl as 

Target Exon 8.5 

hypotheGcal prolan M6C3038 aS 

Homosaptens(nRKAtorFU00111proteln«paitiatcds a4 

ESTs. Weaidy similar to 865657 dpha-ICedrenerglc n tA 

gb:RC04fr0613-2103004»2407 HTQ613 Hon» saptens cDNA a4 

Target Exon 8.4 

Homo sapiens EUSC-1 mRNA, pai^ ods a4 

Homo sapiens mRNA; cONA DKFZpS64D0472 (from ctone DKF a4 

NMj020245*'i(omo sapiens hibby super-femlly prot^ (T a4 

H2Ahistone family, member X a4 

Homo sapiens cervlcai cancer melastasis-suppressor1(B 8.4 

DNA (<^osln&6-HnelhynrBRsferase 1 a4 

keratin 19 8.4 

hypotheOca) protein ' 8.4* 

ESTs 8.4 

acCn related protein 2/3 compteXtSUbunit 5 (16 1(0} a4 

v^ciK avian sanx)mavinjsCT10 oncogene hcfnoIog-Ii)ce 8.4 

erythrocyte membrane protein band 4.1 (dnptocytosis a4 

ESTs 8.4 

19A24proldn a4 

ESTs a4 

EST8.ModsratelysbnDarbKIAA1215protein(H.88(& a4 

ESTs a4 

gbjL2-6T0731-24040(M)69^ BT0731 Homo sapiete cOKA 6.4 

VAMP (vesidMSodated membrane proteln)-assodatod 8.4 
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424259 AK001776 Hs.143954 hypQ(he6c8lpnteinFU10914 6^ 

443056 AI457996 Hs.132578 £STs &3 

410391 H17881 Hs.15043 Homo sapiens ch)neFlJB5227PR01357mRNA, complete cds &3 

407989 AW135208 Hs.256092 ESTs 13 

5 410536 N39533 gb7v27d04£l Scares fsta) fiver spleen INFLS Homo sa &3 

452273 AI8706BS Hs.231022 ESTs M 

454297 A)223335 Hsi0651 Janus Kinase 1 (a protein tyrosine kinase) a3 

453718 AL119317 HS.1203GO phospho&paseA2. group VI (eytD50llc,ca(cfainfrMepe 13 

401654 NM.007242:Homo sapiens OEAD«(AsiKGhhAIaAspMs)b 13 

10 432891 AF161483 Hs.279761 HSPC1 34 protein 13 

419923 AWQ8145S Hs.120219 ESTs 12 

433627 ARI78866 Hsi84296 Homo sapiens cDNA:RJ22993fls, done KAT1 191 4 12 

435452 AAB310O4 Hs.124674 ESTs 12 

418583 U90908 H5^41 hypothe8ca!protebifipom clones 23549 end 23762 12 

15 440065 W03476 Hs:!66331 hypotheOcd protein M6C4595 12 

439762 T78968 Hs.i4411 ESTs 12 

447983 AW812726 Hsi82113 ESTs, WeaMyMtar to 138022 hypothetical protein ( 12 

441966 AAS66689 Hs.16131 hypothetical protein ai12876 12 

406182 AA047854 gb2f49g04.r1 Soaresf8tinaN2l)4HRHomosapl^cDNA 12 

20 432180 Y18418 Hs^72822 RuvB (EcoOhomoiogVlikel 12 

436005 BE551650 Hs.158126 Homo sapiens cONAFU13350fis. done OVARC1002143 12 

414962 AF273304 Hs.235376 XPMC2 protein 12 

444476 AR)20038 Hs.11223 isodtrate dehydrogenase 1 (NADP). soiufale 12 

408175 W29089 Hs.19066 hypothetical protein DKFZp66702416 12 

25 413940 Ai633205 Hs.159914 ESTs, Weekly similar to 178885 sefineAhreonine^ped 12 

437277 AA748016 Hs.1 23370 ESTs 12 

431445 AA505135 Hs,44037 ESTs 11 

416927 BE349635 Hs.190284 ESTs 11 

452446 AA086123 H8.297856 ESTs 11 

30 445380 A1222019 Hs.144838 ESTs 11 

421174 AW969058 Hs.291974 ESTs. l\toderateiy8hniiartoA46010X-4ihl(edietinopal 11 

444374 AA009841 Hs.11039 hypothetical protein li4GC2722 11 

417247 N58024 gb:yv63c01.8l Soares fetal Bver spleen INaS Homo sa 11 

438335 Ai498421 Hs.243168 ESTs 11 

35 445235 AI564022 H3.138207 ESTs 11 

422585 Nli4_016186 Hs.1 18620 protein Z-<lependent protease inhitdtor precursor 11 

442522 AI087038 Hs.146592 ESTs, Vfealdy similar toALU7JHUMAN ALU SUBFAMILY SQS 11 

430684 Ai808979 H3^193 ESTs 11 

448442 BE221533 Hs^57858 ESTs 11 

40 441410 AA932669 Hsji33304 ESTs. W^siiniiar to 138022 hypotheOcal protein I 10 

419485 AA489023 Hs.99807 ESTs, Weakly stmilar to Unnamed protolnprodudlH^a 8.0 

449539 W80363 Hs.56446 ESTs 10 

408663 U24683 K3.302083 Immunogtotniiin heavy constant mu 10 

423767 H18263 Hs.132753 F-box only protein 2 8.0 

45 450937 M9131 Ks.26267 ATP-dependant Interferon response protdnl 8.0 

430977 AA490069 Hs.306676 Homo sapiens cDNAFU14302fis. done PLACE2000003 10 

455677 BE056051 K8A867 cystelne^lch, anglooenlc Inducer, 61 10 

436706 AA725808 H5.194809 ESTs 10 

459407 N92114 8b2a22h11.r1 Scares fetal Bver spleen INFLS Homo sa 10 

50 444132 AK000452 Hs.10340 hypotheScsl prolan FU20445 10 

437149 AI686651 K3.202234 ESTs, Weakly similar to ALU4.KUMAN ALU SUBFAMILY SB2 10 

418499 AI627392 Hs.302023 hypothetical protein FKSG25 10 

411298 AW835858 gb:PM04.T0017^)3129WJ01-h07LT0017 Homo sapiens cOhiA 10 

432571 AF151054 Hs.278429 hepatocellular cardnoma-assodated antigen 59 10 

55 416295 AI0S4824 KS.19338S ESTs 10 

427485 AF039652 Hs.178655 ribonudeaseHI 10 

409857 AW501908 gb:UI-HF-BR0H;M>-12-WJI.r1 NIH_MGC.62 Homo sapien 7.9 

433854 AA610649 Hs.333239 ESTs 7.9 

458080 BE142728 gb:MI«-HT0157^299Wd08HT0157 Homo sapiens cDNA 7.9 

60 423573 AA328504 gb:EST31993Emt)iyo,12weekl HomosapienscONAS'en 7.9 

404495 C8001441':gil8923061^f]NP„0601 14.11 hypothetical pr 

443135 Ai376331 Hs.156103 ESTs 75 

448939 BE267795 Hs.22595 hypothefical protein FU1 0637 7.9 

413283 R78669 Hs.23756 hyjwthclical protein similar to swine acytneuraminale 7.9 

65 443987 AW163123 Hs.10071 seven transmembrane protein TM7SF3 7.9 

434197 AAB27223 gb:nq63b043l NCLCGAP_0v6 Homo sapiens cONA done si 7.9 

438882 AW016722 Hs.194976 8H2domainKX)nt£aning phosphatase anchor protein 1 7.9 

434502 AW974915 Hs.116550 ESTs 7.9 

„ 435507 A)143579 Hs.26510 vacuolar protein sof6ng33B (yeast homdog) 7.9 

70 444895 A1201480 Hs.144856 ESTs 7.9 

419320 K98666 Hs.6137 ESTs 7.9 

446269 AW263155 Hs.14559 liypOthetical pfOldn RJ10540 7.9 

425569 AA359597 Ks.301701 Homo sapiens cOnAFU12073fis, done HEMB61002387 7.9 

445209 AW294230 Hs.80988 collagen, type VI, alpha 3 7.9 

75 449193 A1637997 H5.195653 ESTs 7.9 

447397 BE247676 Hs.18442 E-1 enzyme 7.9 

455037 BE144549 gb:MR0>HTO167-O8119M01-e02HT0167 Homo sapiens cDNA 7.9 

453367 AW732847 Ks.70573 PKai-rdated HIT protein 7.8 

439317 AF086127 HS.S0600 ESTs, Wftaldy similar to T471 56 hypotheOcdprotetaD 7.8 

80 424006 AF054815 Hs.137548 C084 anOgen (leulcocyte antigen) 7.8 

406562 NM.004520^itomo sapiens Unesln heavy chain member 2 7.8 

435192 AKD00739 Hs.4835 eu)(aiyD(IctrBnslaltoninIll8lionfactor3,6ubunB8 7.0 

413500 BE144914 gb£lyi34fr0183-1810g94)2»)05HT0163 Homo sapiens cDNA 7.8 
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Hs.190209 



Hs.101850 
Hs.172089 



Hs^7330 
Hs.113999 



43S216 AA3808B7 Hsi085 

418623 AW1947S7 Hs.266804 

447197 R38075 

430146 AW815330 

441841 AA971819 Hs.176083 

457677 AA528890 Hs.158701 

421090 BE301870 Ks.101813 

436481 AA379597 H5^199 

434407 AW815333 

406410 

453579 AI204463 Hs£1857 
427584 BE410^ Hs.179718 
452139 AA0999e9 Hs.16331 
405510 

440777 AAS94020 Hs.126553 
446424 AW134529 Hs544647 
448004 AW451477 Hs^574S6 
430610 AIB21465 Hs.168810 
4Z7080 AWD68287 Hs.173466 
451693 BE220445 H3^79835 
417558 AF045229 H5.82280 
420344 BE463721 Hs.97101 
427735 AA916785 Hs.180610 
425423 NH.005897 Hs.157180 
450663 H43540 Hs.25292 
4325BS AA705591 
402682 
400247 

421116 T19132 
426761 AI015709 
405514 

412406 AW948172 
440226 AA873387 
435625 H50854 
418529 AW005695 Ks.250897 
407758 050916 Hs.38365 
447276 AL049795 Hs.17987 
449938 AW970612 Hs.172635 
422893 X98411 Hs.121555 
451593 AF151879 H5^6706 
424148 BE242274 Hs.1741 
447519 U46258 Hs.339565 
409361 NM_005962 Hs.54416 
436279 AW900372 H8.180793 
426523 SG8616 Hs.170222 
456926 AB018284 
416294 D86980 
409206 AW364844 
417086 AA194446 
418181 U37D12 
436910 AA926944 
401008 

413245 BE244334 
446820 AW295037 Hs.254986 
439279 A1039473 H8.130636 
426116 AA868729 
410098 6E326839 
422326 A111487S 
435513 AW40407S Hs.42785 
421629 N80121 Hd.4963 
434663 AA641972 
452461 N78223 
418811 AK001407 
405417 

414076 AA467738 
435014 BE560898 Hs.10026 
449810 A1242042 Hs.14044 
403397 

436873 N23&74 HS.S0477 
451388 AB02S006 HsJ6334 
404914 

419839 U24577 Hs33304 
432820 Ai554057 Hs.152477 
418978 TB5295 Hs.268606 
446636 AC002563 Hs.15767 
454639 AW811633 
434522 AF189259 H5.283081 
458236 AW297043 Hs.255604 
441043 AA913422 H$.182104 
422838 AA524065 Hs.93670 
455095 AW855718 
442307 AWDZ7690 H8J0037 
425453 AW3742B4 Hs^lS 



Hs.158688 
Hs.79170 

Hs.73451 
Hs^27 



Hs.75249 



H8.144694 
H3.17433 
H5.78592 



H5.13005B 
Hs.108106 
Hs.88663 



doQchylfhosptiateniamtosyllraRsferesepQl^^ 7-8 

ESTs 7-8 

fib:yh88fa01^1S<mplac8ntaNb2HPHorno88ptocONA 7.8 

gbK3VO^02t»)8O10O4)8^ST0215 Homo sapiens cONA 7.8 

ESTs 7.8 

ESTs 7.8 

solute earlier famOy 9 (sodhim/hydrogen exchangsr), 7.8 

KSPC150 protein simSar to ubiqidtiiH»n^aIingenzy 7.8 

gb:QV(WT02154)601004)83^1 ST021 5 Homo sapiens cDNA 7.8 

CS000010*:9i|10440464|dl^BAB15765.1| (AK024475) FUO 7.8 

ESTs 7.8 

^Hnyb avian myetoblastosisvN oncogens homdo^iK 7.7 

Homo sapiens cONA: FU21482 fis. cbna COL05135 7.7 

ENSP00000233779*i1n»lheBc8l68jOkDa protein. 7.7 

ESTs 7.7 

ESTs 7.7 

ESTs 7.7 

EST8.VVBa]dyslfrii8rloAUJ6LHUMANAlU8UBFAMLYSP8 7.7 

r8S-rcAated^t»tuDnurn(oxinsutsti8lB2(rt»famil 7.7 

ESTs 7.7 

regulator of &proteIn signalling 10 7.7 

putative GprotelrHjoupted receptor 7.7 

8pBcingbctorproIiitei^utanfiifldfk:ti(po()fpyfliTMln 7.7 

Intradslemd A parfldOiXQinriBd ^titj/ps^lSdi 7.7 

ritxmudease Hi, large sulninit 7.7 

ESTs 7.7 

Target Exon 7 J 

Eos Controi 7.7 

rs&i(4*bind]ng protein 1, oellutar 7.7 

Homo sapiens mRNA; cONA OKFZp5B6i2022 (from done DKF 7.7 

ENSP00000241075:TRRAP PROTEIN. 7.7 

gb:RC0-MT0013-28O30O^21-faO6 Mr0013 Homo sapiens cDNA 7.7 

ESTs 7.7 

ESTs 7.7 
TRK-fusedgene 7.6 
KlAA0125gena product 7.6 
tiypotheScdl protein M6C1203 7.6 
Konv) sapiens cimaJ21367 fis. done COL03051 7;6 
myosin IF 7.6 
CXBI-121 protein 7.6 
integTin.beta7 7.6 
ESTs 7.6 
s{neocurtsttomeot}ox(Drosoph!1a)iiOfTiolog1 7.6 
ESTs. Wealdy similar to S65657 alpiia-l&adrenerglc re 7.6 
solute earner fan% 9 (sodium/liydrogen exchanger), 7.6 
K1AA0741 gens product 7.6 
KIAA0227 protein 7.6 
gb:QV^T0044-221299O45€03 DT0044 Homo sapiens cONA 7.6 
ESTs, Wedidy similar to S55024 nebulin, s)(eletd muse 7.6 
deavage and potyadenylation specific factor 1,160kD 7.5 
gb:oni68g01.8l NCLCGAPJGC4 Homo sapiens cONA done 3* 7i 
Target Exon 7.5 
AOP-rOxisylaiion factor-Oa 6 interaeling protsin 7.5 
ESTs 7.5 
ESTs 7^ 
ESTs 7.5 
hypoQ»&8t protein FU20967 7.5 
euitaryoactranslaSonlniSalion factor 2B,8UtHinll 7.5 
DC11 protein 7.5 
ESTs 7.4 
ESTs 7.4 
transcrlpfion factor 7.4 
tiypotheticai protein FU10545 7.4 
CXD01 144*:gIf72429731dljIBAA92547.1| (AB037730) WAAI 7.4 
gb:nc74e05.8l Na.CGAP.Pr2 Homo sapiens cDNA done, m 7.4 
mitochondrial Tit)06omal protein L17 7.4 
ESTs 7.4 
Target Exon 7.4 
RA827A.meml)erRAS oncogene family 7.4 
spastic paraplegia 4 (autosomal dorr^ant; spasQn) 7.4 
NH.004046':Komo sapiens ATP synOiase, H^ transpor&ig 7.4 
phosphoHpase A2, group VH (plalaleMivating ladf 7.4 
ESTs 7.4 
ESTs 7.4 
dtron (rt«>4nteractlng, serine/threonine kinase 21) 7.4 
gb:RC2-ST015M)9109&«11-d05 ST0158 Homo sapiens cONA 7.4 
gammaamlnobu^ add (GABA) reoeplDr, theta 7.4 
ESTs. similar to A47234homeoi»xpiDtB!nH6(H 7.4 
ESTs 7.4 
Homo sapiens cDNA:FU22664 fis. dons HS108202 7.3 
gb:RC1-Cr027»<m)10&O21^ CT0279 Homo sapiens cONA 7.3 

ESTs 7.3 

Homosapnens chfomosome i% ca&iuU R26894 7.3 
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Hs,3402 
Hs.176992 
H3.41639 
HS.1731G2 

Hs.207080 



HS.105G97 
Hs^7128 
Hs^44746 
Hs^D1441 
Hs.188633 
Hs.175936 
HS.1709B0 
Hs.133494 



Hs.304447 
HS.7B185 
Hs.334473 
Hs.14912 



HS.8S279 
H5,77597 
Hs.29643 



455327 AW896238 HS.3348Q6 
420982 AW576160 Hs.100729 
424563 AA446932 H5.1S1428 
417125 AW181998 Hs.B1248 
453902 6E502341 
448842 A1343510 
454128 AL031259 
427011 BE302729 
450872 AI742594 
451512 AI800236 
405708 A1262759 
432576 AW157424 Hs.165954 
459304 AWD05809 K5.281076 
401375 

413258 BE075114 
408016 

421508 BE302798 
422742 AA316117 
440031 BE045970 
429389 AA454779 
449656 AA002008 
444310 A1140432 
459274 AA382S90 
425404 BE048060 
431150 T63857 
443217 NM.001545 Hs.9078 
413405 AW022253 Hs.215976 
447553 BE327277 Hs.161145 
414704 NM.014757 Hs.76986 
424048 AF0Z7866 Hs.138202 
409188 AW363284 Hs^2553 
453493 AU)39478 
456111 AK000150 
400297 AI127076 
446384 AB006624 
432216 AU076609 H&.2934 
436943 AA773838 H9.5353 
445336 AW816038 Hs.161251 
418469 U34879 
414907 X90725 
429065 AI753247 
424568 AF005418 Hs.150595 
416450 AA180467 
449714 AB033015 
455447 AW947507 
437154 AI023133 
423059 AW378445 Hs.123080 
419092 J05561 Hs.89603 
426736 AA431615 
417748 Z43011 
434748 A)862604 
436929 AW195515 H3^53177 
452061 A!074259 Hs.469 
446416 AV658299 
415023 AA932146 
434766 AA742222 
432566 AW439330 Hs^56889 
420252 AW270404 Ks.193161 
435403 AA779987 Hs.269658 
430151 AW968203 
427908 AA417272 Hs^4122 
417758 U27699 Hs^2S35 



AW975090 

AI472213 

AW958751 

AI825204 

N24231 

BE275607 

AW407504 

BE2809S1 

NKt005171 

A1037915 

AL109729 

R07478 

AI883770 

RB4810 

AA311426 

AA57e526 

AI651558 

H91200 

BED46609 

AA649242 



Hs.23941 
Hs.10739 



Hs.130722 
Hs^1169 
Hs^11884 



Hs.163959 
Hs.133494 
H8.120634 



412647 
437234 
453366 



447383 
423864 
450799 



453945 
425196 
439778 
417662 
438087 
452724 
448833 
433154 
440094 
409253 
431270 
407629 



Homo sapiens cONAFU14604fis, dona KT2RP1000363^m 7.3 

KlAAD6g2 protein 7.3 

ret finger prateln 2 7.3 

CUG triplet repeal RNA^ng protein 1 7.3 

ESTs 7.3 

ESTs 7.3 

progrannmed cell death 2 7.3 

n^liborofCOX4 7.3 

gbwg55hOSj(1 SoarBS.NSFJ^_9W_0fT.PA^P_S1 Homosa|rien 7.3 

ESTs 7.3 

gb:qt84a01 jcl NCLCGAP„Co14 Homo sapiens cONA done I 7.3 

ESTs. We^sbnBar to 138022 hypotheOcd protein I 7.3 

EST8. Wbakty similar to CHD4J1UMAN CHROMOOOMAIN HEUC 7.3 

NMU)209g9*:Homo8apiens neufQgenin3(NEUR063).fnRNA. 7.3 

Qb:PM1^TO585-11020OO0^11 BT05B5 Homo sapiens cDNA 7.3 

Target Exon 7.3 

thymidine kinase 1,solul))e 7.3 

ESTs 7.3 

ESTs 7.3 

KomosapienscDNA RJ11079 fls. done PIACE1005111 7.3 

ESTs 7.3 

ESTs 7.3 

K1AA0948 protein 7.3 

Homo sapiens done TCCCtA00164niRNA sequence 7.3 

gb7c16e01.6l Stralagene hing (937210) Homo sapiens c 7.3 

iminatuiB colon cardroma transcript 1 7i2 

ESTs 7.2 

ESTs 7.2 

mastermind (DrosophBa), homolog of 7.2 

serine (or cystatne) proteinase \rhMor, dade B (o 7.2 

ESTs 7.2 

ESTs, Weddy Mar to ALULHUI^^N ALU SUBFAMILY J SE 7.2 

MAX-4ikebHLHZIPprotdn 7.2 

hypotheiicai protein DKFZp56401 276 72 

tOAAI^pratdn 7.2 

ifbcnudeotlde reductase Ml polypepOde 7.2 

caspase 10. apoptosls-celatad cysteine protease 7.2 

ESTs 7.2 

hydroxystBr^d{17-t}eta) dehydrogenase 1 7.2 

polo (DrosophlaHite Idnase 7.2 

Homo sapiens cDNA aJ13103 fis, done Nr2RP3002304 7.2 

cytodvome P450, sutrfanilly XXVIA. pdypeplide 1 7.2 

gt):zp14g0d.s1 Stralagene fetal retina 937202 Homo sap 7.2 

KIAA1189 protein 7.2 

gb:RO(Myrr0002-14030»)1l4l2MT0002HQmosaplenscDNA 7.2 

ESTs 7.2 

Homo sapiens unknown protdnmRNA, partial ods 7.2 

mudn 1, tiansnieinbrane 7.2 

ESTs 72 

ESTs 7.2 

ESTs, Weakly similar to ALULHUMAM AUU SUBFAMILY J SE 7.2 

ESTs 7,2 

succinate dehydrogenase complex, sutwnil A. Ilavopiot 7.1 

ESTs 7.1 

Homo sapiens done TGCCtA00164mRNA8equenoe 7.1 

ESTs 7.1 

ESTs. Weakly dmHar to 2109260A B odi growth fedor 7.1 

ESTs 7.1 

ESTs 7.1 

gb:EST380398 MAGE resequences, MAGJ Homo sapiens cDNA 7.1 

ESTs 7,1 

solutB earner family 6 (neurotransn^tter transporter 7.1 

Eos Control 7.1 

gb£ST387196 MAGE resequences, MAGN Homo sapiens cONA 7,1 

Hs.247711 hypothetical protein FLJ20557 7.1 

Hs.28921 zinc finger protein 7.1 

H5.211408 ESTs 7.1 

gb:yx22a11/1SoarosmeIanocyte2NbHMHonio8apienscD 7.1 

Hs.1708 diaperonin containing TCP1, sUbudt 3 (gamma) 7.1 

gb:UI-HF^M0^k-g-12-OAn Jl NIHjyiGCJB Homo sapiens 7.1 

Hs.55058 Elfdomain containing 4 7.1 

HS.3G908 acGvaOng transcription factor 1 7.1 

Hs.155097 carbonic anhydrasen 7.0 

Hs.99364 putaSve transmembrane proldn 7.0 

Hs.268845 ESTs 7.0 

Hs.190422 ESTs 7.0 

Hs.30464 cydInE2 7.0 

Hs.21635 tubuOn. gamma 1 7.0 

H5.160994 ESTs 7.0 

Hs,270372 ESTs. WeaWy similar to ALU1_HUMAN ALU SUBFAMILY J a 7.0 

Hs.52002 C05 antigen^ (scavenger receptor cysteine rich fa 7.0 

^:hn41e1U1 NCLCGAPJ«)F2 Homo sapiens d)NA done 3 7.0 

Hs.62632 ESTs 7.0 
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40829S 


AL117452 


Hs,44155 


DKFZP585G1517 protein 


445439 


BE243084 


Hs.12719 


regutalor of nonsense transcripts 1 


4Z7t08 


AA398193 


Hs.97584 


ESTs 


408623 


AW811978 


Ks.254037 


ESTs 


426561 


AA381437 




gb:EST94514 AcQvated T-ceOs 1 Homo sapiens cDNA 5* 


m\S2 


AA555217 


Hs.183684 


eulcaiyoQc trsnslaSon iriSaSon f3ctQr 4 satitms^ 2 


428894 


AA437088 


Hs.271736 


ESTs 


419102 


AA234096 


H3.42424 


COlS« WBoiay SimUar id tUWOSoM WHVUWSWiiiai (HWisUI 


429067 


AA446019 


Hs.104967 


ESTs 


422684 


BE561617 


Hs.1 19192 


H2A histone (amOy. member Z 


424701 


NM.005923 H3.151988 


m8D98i>>8ciivaled piotein kinase kinase Idnase 5 


412513 


AA322599 


Hs^t63 


ESTs, Weakly sbnDar to AF151840 1 CGM2 proNn (H. 


443599 


A1079559 


KS.13412S 


ESTs 


400715 






ENSPOOOOQ23708r:KIAA1217 PROTEIN (FRAGMENT). 


446514 


AW449233 


Hs.150847 


ESTs 


413992 


W26276 


Hs.136075 


RNA.U2 small nudear 


402442 






Target Exon 


419497 


NML006410 H&90753 


TaMnteractlng protdn (3QM)) 


439575 


W79259 




Ob2d75c06/1 8oare5.(BtaLheaLNbHH1SW Homo sapiens 


407027 


U83312 




gbMjman cosmU ai2NC01-242E1, ETV6 gene, exons IB fi 



7.0 
7,0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 
7.0 



Td)le1SB 

Pfcey: Unique Eos probeset Mentifier number 
CAT number. Gene duster number 
Accession: Genbank aocesskm numbers 



Pkey 

408182 
409113 



409206 
409857 
410146 
410536 
411298 



CAT Number Accessions 



412647 
413256 
413314 
413500 
414076 
416450 
417247 
417739 
417881 
418347 
422429 
423573 
426561 
428294 
430148 
430151 
430709 
430848 
431150 
431270 
432363 
434197 

434407 
434410 
436910 
437056 
439354 
439575 
444314 
447197 
447363 
450799 
450872 
454355 
454639 
454765 



104479J 
110079.2 



1108151J 
1156298.1 
1 178974J 
1207322^1 
1237955.1 



412406 1293055.1 



1317604J 

1355998.1 

1360034.1 

1373933.1 

14149QL1 

159551.1 

1660859.1 

1696198.1 

170544.1 

174149.1 

216469.1 

229714.1 

269158.1 

289365.1 

313552.1 

313868.1 

322338.1 

324621.1 

328626l1 

330676.1 

345469J 

381655.1 

385744.1 

385798.1 

42918^1 

43226U 

47146J 

47400 1 

600667 1 

711623L1 

71990.1 

847242.1 

849959.1 

1130264.1 

122n28J 

1233905.1 



AA047854 AA057506 AA053B41 

AA074897 AA1 13914 AA064871 AA079329 AA071309 AA084710 AA129030 AA075042 AA074794 AA071453 AA07B803 AA148628 AA122204 

AA074159 AA126185 AA079117 AA127089AA070912AA079280AA131372 AA078833 AA071087 AA076131 AA071047 AA079401 AA083070 

AA102076 AA1 15163 AA074198 AA134725 AA1 13889 AA121 103 AA075041 AA065148 AA071310 AA101144 AA079659 AA078931 AA079209 

AA070928 AA068994 AA059817 AA0761 87 AA069053 AA131489 AA071308 AA063317 AA0701 56 AA071430 AA076056 AA075684 AA070053 

AA126283 AA126078 AA075895 AA079208 AA074583 AA071086 AA079623 AA070627 AA078802 AA076622 AA065051 AA079143 AAD71 1 10 

AA079434 AA148748 AA079230 AA0851 88 AA074485 AA070680 AA076151 AA083166 AA0851 18 AA079450 AA085044 AA120938 AA079200 

AA100188AA081472AA122355AA12g031AA085362AA069220AA070940AA07S868AA074563AA084027AA11^ 

AW384844 AW364847 AW937534 AW937593 AW937659 

AW50190BAWS02959AWS02540 

AW592655 R05927R06916 

N39533 AW753094 AW753093 

AW835858AW835836AW835823AW835834 AW835831 AW835832 AWB35843AW835B16AW835833AW835815AW835649AW835835 

AWB35848 AW835851 AW835852AW835862 AW835855AW83S825AW83S847 AW835838 

AVV94B172AVV948178AVV948ie9AVV948176AVV948191AVV946192AW948186A^^ 

AW948177 AW948171 AW948183AW948173 

AW975090 N44182 

BE075114 6E075283 BE075118 

6E081585 BE081717 BE081863 BE081794 BE081659 

BE144914BE394989 

AA467736AA135210AW968166AA467804 
AA180467 AA449184 AA464831 AAS05048 
N5B024 T58194 T1 1693 N64222T05848 
Z43895R12357 R34740 

AI879117 AW161351 Z45755 8E003661 AA206949 AM76541 

AA216419F03238AA229517 

AA31 0527 AWg62295 Z44865 H06641 

AA328504 AA327783 AW962370 

AA381437 AA628833 AW407275 

AA425488AA496895 F23221 

AW815330 AW968170 Ar732687 AI732725 AA468343 AA467817 AW063961 

AW968203Ar7327S7AA470353AA46802SAA468479A1734151 

R34356AW969880AA484613 

AW021726 AA487752 AA488085 

T63857 AW971220 AA493469 T63899 

BE046609 BE046118 AAS01S04 

AAS34489AW970240AW970323 

AA6Z7223 AA643443 AA650619 AA643463 AA643463 AA643439 AA643438 AVV802964 AVV821S95^^^ 
AW827513 

AW81 5333 AWB1 5409 AA632563 

AA632644AA635376AA664188 

AA926944AA767974AA737237 

AI147061 AA743380 AA76S223 AW976398 AI8039Z7 

AF088174 W31798 W04694 

W79259AF086396 W73927 

A1140497 AW749625 AW749626 AW749644 

R38075AI366548R36167 

N24231BE617964N36313 

AW407504 W31274 AI738877 

AI742594 At761397 R31 198 Ai819332 R31257 

AW812535AW812538AW390307 

AW81 1633 AWB11652 AW811898 

AW819829AWB54320 
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455161 
^ 455410 

455447 
455855 
455994 
457892 
10 458080 
458115 
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1249783J BE144549AW851677AW851643AWB51711AW851719 
1253078.1 AW855718AW855740AW355748 

1256167.1 BE145800 AW859^8 BE145895 BE145831 BE145914 BE145820 6E145817 BE14S690 6E145908 
1288380.1 AVV936578AVV336$37AW938682AVV936685AVV936817AVV936811AVV936762AVV9366 

AW936823 AW936821 AW936732AW936730 AW938781 
1292444 J AW347S07 AW947509 AW947791 BE008335 
137S634.1 BE147440 BE147708 BE147563 BE147456 
1398737J 6E179190 BE179206 BE179182 BE179185 BE179188 BE179194 
432926.1 AA74438gAA744270AA744284AA744299AA745380AA744337AA846905AA847688 
471050.1 BE142728AA834047AW937124 

477Q5J9 BE091587 BE0917308E091577 BE091655 BE091729 6E091640 BE091578 6E091727 6E091803 BE091660 6E091721 BE0882S5 BB)765B2 

AWg92312 BE008791 BE0B2385 BE0B3S04 BB)83466 AW997B67 AW997991 BE166595 AW843686 AWB44334 BE079091 AW603391 8E081427 
BEOTSSM BE184580 flE009g62 BE008722 AW579912 AW860561 AWB901M AW795276 AWBS0410 AW86041 1 AW610330 AWB60564 AW860578 
AW862S19 BE073924 BE0086B7 BE073857 BE073921 AW2741Q6 BE01 1050 AW268120 AI335067 AW793748 AW997736 BE080117 AW867987 
AI547161 AW844767 AW393596 AW5794M BE083334 AIS4715B AW799863 AA585179 AW992792 AW88221 5 BE01191 3 AW997894 AI647159 ' 
AW992772 AW58117B AA092247 AW843916 BE079190 AW87847B BE083648 BE066454 AI469937 AW393594 AW579899 AW939276 BE173265 
AW876631 AW878638 AW992602 BE079913 AA633638AW389008BE076590AW843456 AW992791 BE173247 AW843921 AW843333AW878334 
BE090235 6E076240 BE056325 AW603276 BE169310 AWB17299 BE091641 BE000160 AW898164 AW994624 AWg99391 AW862797 AyV899436 
AW750667 AW939685 AW940017 AI200402 BE167391 AW939318 AW799837 AW939953 AW939681 BE078188 AA449059 BE076S95 AW297451 
6E076544 AW603372 6E081223 AW939237 6E084239 All 74202 BE077804 6E078028 AA512912 At124808 AI147524 At084882 AWg39962 
AW939254 BE171687 AW998400 AW99B348 AW998353 AW998288 AW998303 AW998302 AW998299 AA502748 BE077882 AW998295 BE090238 
AW504665 AI423051 BE093093 AW579913 AW508384 BE008143 BE170415 AW998054 BE084608 AW992779 BE08B1 11 AW610555 AW844153 
AW939423 BE0B5404 AW579905 BE080994 AW468482 AW876865 6E091581 BE080940 AI811 189 AW66B088 AW833127 BE080Q64 BE1 84254 
AW998350 AW884228 AWg92315 AWg92384 6E091S69 AW750880 BE066386 AA578227 BEQ9173S AW93BB30 BB076710 BE087253 BE084182 
AW800859AW801017AW5B1371 BE0a8300AW995341 BE090233AWB637B6BE091739BE080113AW578162AW799799AW992366AW994673 
BE185170 AW792778 AW663225 BE075590BE080111 AA582934 BE090227 AI475441 BE085684 BE090223 AW581366 BE010705 AW898740 
BE08817DAW992375 6E077833 BE083557 6E010688 AW998450AW803434 BE083280 AW892655 AA506866 BE088288 6E005859 BE1738S6 
BE001319 AA610814 BE01ig85 BE005855 6E0OS869 AAg73929 BE1B5729 AW8842S8 6E185743 BE001342 BE005876 AI002988 AW799056 
6E065411 AWB41264 AW603110 BE006134 BE006139 BE006148 6E006147 6E006155 AA578273 BE008706 8E185440 AW946428 AA501940 
AW6031 14 BE085757 AI460195 AA491 145 AA772914 AA632730 AA508388 BE080196 BE1B5442 BE093446 AW946433 BE0801 19 BE001352 
AW839003 BE006145 BE085405 BE0088W BE081428 AW581373 AW607246 BE094328 BE001335 AW868170 BE074119 AW884149 BE091734 
BE008744 At540867 BE185808 BE080193BE18585B AA476398 BE081040 BE074724 BE085426BE074725 AWg98297 AW867606 BE185798 
AW898734 BE076369 BE081672 BE088178 AA610284 BE0881 16 AA2B4217 AW57B085 BE074S1B BE001359 BE001328 AW820227 AWB68196 
AWB68190AW904548BE008526BE012037BE079061 BE005870AWB67804AW878433BE008751 68)05875 6E008748 BE093440 6E1830S0 
AA5066766E001329BE008803BE080123BE008041 AW99468BAW994675AW994760AW994691 AW994681 BE080189BE080112AW868173 
Af768000 AW8a3094 AW868179 BE080201 AW665449BE067473 BE008746 BE184053 BE076437 BE076376 BE076402 AA480395 flE082436 
BE184134 BE185224 BEQ85428 6E00B682 AW858181 AW998358 AW866102 BE083507 BE077974 BE008835 8E0g3439 BE076108 AI416987 
BE008768 BE069909 BE093441 BE16S502 8E1 83053 AW7S0669 6E011812 6E008872 BE0ai684 BE093445 AW866184 6E081839 BE008797 
AW842067 BE008678 BE008670 BE551820 AWB38974 BE081637 BE0466g6 BE006673 BE010328 BE0832SO BE089614 BE082052 BE081424 
BE001678 AW581368 AA503194 AW883721 AW883522 BE0B5564 AW868717 BE171078 BE078249 BE078194 AA565255 BE083486 AW842081 
AW842080 AW868204 BE008717 AA484369 AI831719 AW997365 BE079327 AA503956 BE091999 AW793852 BE080251 BE078086 BE092515 
BE170384 AW866193 AW08795d BE008042 BE008761 6E0816B1 BE081671 AW867400 BE082003 BE082253 8E081439 BE081466 BE081692 
AW606020AAS01778 AW996417 BE045756 BE088394 AA491068 AW893099AW578$95 BE1504406E006150 6E084684 AW992795 BE086891 
AW866792 AW753605 BE082045 BE081106 BE008373 6E0753g9 AW998628 AW578707 BE084309 AW753604 BE185916 AW842220 BE185222 
6E006152 BE008795 AV1CT706 BE080256 6E183984 AI934532 AA449648 AW5768g9 6E150S14 AW863580 AA493S68 6EQ85748 AW753601 
BE150562 AVy882677 BE091797 AW899123 BE081679 BE080121 AW806787 AW603410 6B)01317 AWg0S799 BE150513 BE092206 AW996343 
BE086922 88)08806 AW844759 AW6Q6009 6E150487 AW750728 BE150491 BE150515 AW606010 BE150508 BEG08718 AW578702 BE150509 
AA436751 AW883918 BE1838B3 AW753807 BE008669 BE150446 AA533458 BE079219 AW838884 AW063806 AW063837 AW87B479 6E078815 
6E008802 AW992789 BE007925 AW802204 BEOl 1825 BE092130 BE184059 BE079087 BE1S0568 BE185497 BE078808 AWB83761 AWB42295 
BE161S23 AA484796 AA480390 AW994667 6E073205AW607316 BE083201 AW802265AW57B700BE078715AW860403AW897455AW9g6558 
AW890602 AW860413 AM2S412 BE0083&4 BE150438 AWe02606 A143S236 AA574285 AI823745 AAS01773 A1002987 AW832749BE185491 
AW9g6489 BE001442 AW94842S BE001588 A)524864 6E085556 AW887549 AW604038 BE079832 AI752160 AW999398 AW883904 AW882376 
AW467098 BE0801 16 AW883984 AW883995 AA424095 BE074091 AW996348 AW860625 AW860633 AW946513 BE083485 AW860412 AW602207 
6E075407AVV838972AVV807023 AW602201 AW799772AVV882452AW862451 BE50S041 6E16^ 

AW860&32 AW862457 AWg98019 AW860405 BE092062 AW603921 BE183386 AW868194 BE07S664 BED78184 A1S41202 AI2D4g49-eB)92451 
AVV603111AM84587AA484402AVV998675AVV896064BE069923AVV867965BE089919BE092089AA^ 

AA506738 BE080080 AW749523 AA493134 AW370137 AA491Ba AA504425 AW605473 BE092456 6E010682 AW502608 BE079093 AA48491 1 
BEOl 0942 AI205087 AW794933 BE081848 BEOl 1792 AW799897 BE174618 AW838848 AW821741 AW842724 BE008764 BE1839B2 AA501765 
BE092513BE183342AW799806AA442935BE092268AA580022AyV843219BE09330d6E092275BE087l11BE1833^^ 
BE183895 BE185278 BE082343 AW946219 BE079199 8E092272 AA588687 AW948109 AW946175 AW946184 6E008385 BE078172 BE085673 
BE076240 8E083194 6E010804 BE079198 AW878638 AW799803 6E001348 6&077883 6E081635 AW992309 6E081012 BG078106 AW881899 
BE008407 BE008410 AW842670 AW803738 BE088661 AA484571 AI799184 BE174545 BE001405 AA436967 AW995884 AW995785 AW896598 
AW883999 6E076967 AA503938 BE092281 BE092279 BE087460 86087669 BE081542 AI375386 AW843886 BE080115 BE171517 BE079898 
AW882382 BE001450 BE076430 AW579377 BE008412 BE008790 BE182296 BE182297 BE078805 AW899132 6E078810 BE185867 BE087790 
AA484928 AW57898S BE00B400 ^074080 AW605101 BE076110 AW79g904 AI2K094 BE008370 BE182345 BE182373 BE008401 AA984441 
6E182362 BE1B2372 BE008414 BE078186 BE009165BE010266 BE009162 BE009167 BE011D08 BE073335 BE1 82370 AW750556 BE182347 
BEOl 1000 AA484676 BE092982 BE183897 BE09S73 AA573037 AW882317 BE081832 AA478471 AA551613 BE182356 AW83B888 AW026827 
BED08413 AW896605 AA503558 AA776622 BE084825 AA502971 8E081842 8E010628 AW80221d AI688924 AW8S7986 AW881 775 BE079220 
At241060 AW802041 AW802005 BEOl 1244 BE087051 AA984758 AA452997 AW992768 AW797500 6E077829 BE008402 AW881760 6E093516 
AW802084 AW369007 6E185123 BE087775 AW801018 6E093443 AW867978 AW643271 BEt73850 AWg97859 BE0l062a AW992516 AW843908 
BE083200 BE164675 BE074340 AW880289 BEG75433 6E008456 AW946438 BE066S70 BE093547 AA508107 AW867992 BE076239 BE1 83881 
AW817422 BE087717 AW899147 BE01060B AW992295 AA438737 6E075412 BB)9301 1 AW581656 BE089529 BE008756 BE150494 AW903020 
AW883102 BE076370 BE000625 BE165095 AW8S7979 BE182159 AW577601 AW577488 AW577491 BE010637 BE06g910 BE093295 BE005243 
AI6207B3 AW992650 AW890590 AW577498 A\fle77504 AW842725 AW842666 AW864691 AW997722 AWB42662 BE009233 8E087809 BE083196 
AA287768 AW939B91 AW815631 BE001453AW841903BE077613AW577500BE081479AW992558 BE01106S AWB431B7 AW867990AW898296 
BE074339 AA501697 AW749997 BE075249 AW867991 BE085718 AW994607 BE010878 BE075438 AW386825 AA484467 BE081144 AW577492 
AW997932 AW899089 AW842706 AW890727 AW843175 BE075428 AW843155 AW842679 AW842708 BE069915 AW842721 AW438792 AI2S1478 
BE069911 BE067054 BE079889 BE075453 BE069927 AA49192D BE170606 BE1B2305 BE080052 AW8434Q6 6E01107S BE075969 AA525261 
AW391518 88)79202 AA658195 6E076138 AW799901 AA493859 AW992510 BE011810 AA508724 BB)75488 BE075661 AA2S8982 BE078726 
BE069914AA213698BE075376AA484600AW580999BE077872AA503571AVV8B4724AW880124AVV^ 

AW992S45 BQ075806 AW994608 BE350368 AA557838 BE077682 AW844660 AW883431 BE085872 AW838887 AW843890 AW888404 AA578417 
6E074115AVU842fi80AW277193AVV890728AVIffi05111BE093940AW890710BBD85560AVV6^ 

AVlf842G88 AW868310 BE011Q71 66075429 AW8431$2 AW905848 BE07S397 AW842/62 BE07S402 BE0779S0 AW837B10 BE079988 BE183965 
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BE07S431 AW81S917 AWg98359 AW799883 AWS03782 AA5S74B0 AWB41444 BE07591S AA548034 AW843393 AW391559 BE083265 AW939721 
AW800857AW07«09AVV3M90tAI435993AA985526AW799848BEl82453AAmil1AVV^ 

BE182443 BH)10296 AW577808 BE008415 BE184036 BE076597 AI817413 AW7B5053 AW896751 AW841433 BE182458 AW603796 AWB42676 
BE085455 AW884879 BE075414 AW838838 AW8782ra AW998088 AW799778 AWB99125 BE082247 AA774B70 BE001 401 BE001485 AWB17297 
AW796S70 AW3940S3 BE001336 AW394070 AW603797 BE182447 AW582483 AWB43283 AW749520 AWBe7449 AW899274 AW578232 
AWG03765 AW843919 AW578235 BE184139 AW997742 BE183923 BE084210 AW802033 AW748724 AW33901 8 AW9974S9 AW842742 AA213697 
6E182308 BE01107B AW607702 AWB82623 BE080016 AW580994 BE076531 AA443462 AWG07407 AW883382 AW939399 AW605627 AW844615 
AW939724 AWB1 5931 AW883765 AA287421 BE075628 AW946171 AW841445 AW797994 AW81 5957 AA583300 AW369004 BE075368 BE081560 
AW605626 AW339398AA507280AA5()6317AVV841230 AW992519 AA465332 AA425246 BE090234 BE090236AM83259 AA451^1 AA535566 
AAS06406 AAB88571 AASOSSBS AA507130 AAS32944 AAS0t872 BE168634 AA492022 AAS07662 AWS42286 AM94226 AA77603d AA442419 
AW579900 BE171816 AAS63065 AM91916 AA4474gO AA461423 AA434543 AA243279 AW997466 AW603740 BE000295 AAfi56S71 



TABLE 15C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 (fieitnumbeiBtathUcohimn am Genbanklden&fier(GQ numbers. 1)unhamL el aL'refera to the putilfcdion entitled HIieONA 

sequence of human chromosome 22.' Dunham I. etaL, Nature (1999) 402:489495. 
Strand: Indicates DNA strand from vvhtohexons were pred^ 
NLposiBon: Indicates nucleotide positions Of predicted exons. 



Pkey 


Ref 


Strand 


NtposiOon 


400715 


8118885 


Minus 


80 151-80297 


400738 




Plus 




401008 


8117391 


Minus 


81421-d1S51.82364-82512.82862-82938 


401069 


33Z7852 


Mhttis 


4S6B2-45831 


401375 


7417809 


Minus 


6121-6766 


401405 


7768126 


Mnus 


69276-69452,69548-69958 


401539 


8072433 


Minus 


6202W2608 


401557 


8099866 


Minus 


112785-112924 


401654 


9097132 


Minus 


6469&«4797 


401940 


3738108 


Plus 


153460-153592 


402025 


7547159 


Rus 


173835-173998 


402442 


9796503 


Phis 


141714-141842.142010-142122 


402682 


8138477 


Minus 


147522-147795 


402798 


3646083 


Minus 


61284265.641&6689 


402967 


5360987 


Minus 


33518-34546 


403038 


7717439 


Mnus 


290021-290284 


403055 


8748904 


Minus 


109532-110225 


403310 


8139938 


Minus 


183883-184026 


403397 


9438368 


Minus 


64481-84655 


403839 


4176355 


Pkis 


21201-22223 


404110 


9212839 


Minus 


18344-18510 


404495 


8151634 


Minus 


59449^0477 


404534 


8247909 


Minus 


147853-148086 


404630 


9796565 


Plus 


74495-74715 


404649 


9798926 


Minus 


100027-100399 


404660 


9797204 


Minus 


15981O-15g979.160213-ie0321J61023-161304.162B62-16314OJ64490-164644,166404-166530.16693& 


404914 


7341760 


Plus 


92603-92827 


405417 


4753290 


Minus 


50704^1499 


405454 


7656675 


PllB 


133807-134053 


405510 


7630909 


Minus 


101028-101174 


4(S514 


9454624 


Phis 


35953^51 


406538 


9795661 


Plus 


1&4091-164162,164397*164516.16672Q-166790,16778S-167935 


406016 


8272661 


Phjs 


41341-41940 


406410 


9256394 


Minus 


115806-116104 


406464 


9789674 


Pius 


72161-72562 


406562 


7711584 


Phis 


3731&37426 



TABLE 16A: 200 GENES DOWM-REGUIATED IN CERVICAL CANCER COMPARED TO NORMAL ADULT CERViX 

Tdile 1 6A shows 200 genes dowwegulated In cenrfcal cancer compared 
were swHehed. the median value amongst oormst cervices was 
normal cervix). 

Pkey: Unique Eos probesetUentitler number 

ExAccn: Exemplar Accession number, Genbank accession nundier 

IWgeneiD: Unlgene number 

Unigene Tltie: Unigene gene tifle 

R1; Rafio of cen/ical cancer to nomdcenrbc 



Pkey ExAccn UnigenelD Unigene TiOe R1 

453596 AA441638 Ks.62905 hypotheGcal pnddn FU14834 18.1 

443912 R37257 Hs.184780 ESTs 16^ 

420923 AF097021 Hs.273321 diflefentiaBy expressed hhematopoieOcDneages 13.6 

414422 • AA147224 Ks.337232 HomeoboxA13 13-1 

420058 AK001423 Hs34694 Homo sapiens cDNA FU10561 fia, dona NT2RP2002672 12.9 

412639 AW961284 Hs.296235 ESTs 12.4 
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41B994 AA296520 H3.89546 sdectin E (endoOieOBl adhesion niolec^ ^^4 

407938 AA905097 Hs.85050 phosphobmban 11.3 

410544 AI446543 Hs.95511 ESTs 11.3 

413802 AW9S4490 Hs.32241 ESTs. Wed(lysfaii8artoS65657aIpha-1&adien8ro!crece 11.1 

5 42^90 AA32g648 Hs.23804 ESTs. Wea]dy8iniilarbPN0099soii3 protein [Kssplens] 11.0 

420674 NM.000055 Hs.1327 butyrytcholResterase 10^ 

453060 AW294092 Hs.21594 hypothetical prolein M6C1S754 10.6 

424765 AA428211 Hs.284256 hypottet]calpfOle)nHJ140338im9artot9poxial^ 10.5 

. 452106 AI141031 Hs.21342 ESTs 9.5 

0 428780 A1478578 KS.S0636 ESTs 9.5 

431706 AI816086 Hs.296341 adenytyt cydase^sodated protein 2 9.2 

419589 AW973708 Hs.201925 Homo sapiens cDNAFU13446tis. done PLACE1002988 9.0 

430468 N1I1.004673 HS.241S19 aiglopoietliKfike 1 9.0 

443790 NMJ)03500 Hs.9795 acyi-Coenzyma A oxidase 2, branded diain a7 

O 448944 AB014605 H&22599 atiDphln-Hnteractbig protein 1;activin receptor Into 8.6 

401486 HA C400064r:(^7S8508]ref|NP.004253.1|Binvay trypsin^ &4 

417511 AL049176 Hs.82223 diordMIke a3 

429900 AA480421 Hs.30875 ESTs a2 

411908 IJ7943 Hs.72924 cyOdlne deaminase ao 

10 408134 AK000184 Hs.42945 add sphingomyelinase^ phosphodleslerase 6.0 

446543 AWe97741 Hs.21380 Homo sapiens n^c0hIADl07p586P1 124 (from done Di(FZp ao 

437846 AA773866 Hs.244569 esophagus cancsr-felated gene-2 aO 

421666 Aii)35250 Hs.1408 endolhelin3 7.9 

^_ 450164 Ai239923 H3.30098 ESTs 7.9 

15 412642 BE244S98 Hs.809 hepatocytograwQifador(hepapdelin A; scatter fador 7.7 

425608 AA360486 Hs.92448 ESTs 7.6 

442748 AI016713 H3.135787 ESTs 7.3 

415672 N53097 Hs.193579 ESTs 7.2 

414175 A130B876 Hs.103849 hypothetical protein DKFZp761D1 12 7.2 

>0 409601 AF237621 Ks.80828 keratin 1 (epldennolytic hyperkeratosis) 7.0 

424634 Nlyt003613 H5.151407 carlilsQe intermediate layer protdn,nudeotUepyroph 6.7 

414214 D49958 Hs.75819 glycoprotein M6A aS 

436637 AI783629 Hs.26766 ESTs a5 

406621 A)970672 H$.46638 chromosome 11 open reading frame 8 6.5 

)5 432101 AI918950 Hs.123642 EphA3 6.3 

456440 AI095468 Hs.135254 Homo sapiens done llhromtiospondinmRNA. complete ods a3 

424153 AA451737 Hs.141496 MAGE^ke2 a3 

420228 R2S023 H5.123G9 ESTs a2 

418390 AF133820 Hs.84665 6tM immunoglobulin domain protein (myoQIIn) ai 

(0 444931 AV652066 Hs.75113 general transcription (adortllA ai 

449394 AA004368 K3.iei60 Homo sapiens cONAFUl 1550 fis. done HEMBA1002970 ai 

425849 AJ000512 Hs.296323 semm/blucocorfcoki regulated Idnase ai 

410425 8E278367 Hs.63510 KiAA0141 geneprodud 6.0 

410765 AI694972 K5.66180 mideosome assembly protein l-Oke 2 aO 

45 424973 X92S21 Hs.154057 matrix metaOoprotelnase 19 6.0 

436547 AJ297351 Hs.30824 ieudne zipper (ranscriptbnfactor-Hkel 5.9 

429414 AI783656 Hs.202095 empty spirades (OrosopMla) homotog 2 aO 

440594 AW445167 H5.126036 ESTs ag 

452768 AW069459 H&61539 ESTs 5.9 

50 427669 AW451832 HS.2S5938 ESTs. Moderate^ dn^to KIAA1 200 protein (Ksapiens 6.9 

448533 AL119710 Hs.21365 nudeosomaassemfa(yprotdn1-ike3 5.9 

425010 T16837 Hs/241 ESTs 5.9 

426342 AF093419 H8.169378 multiple PDZdbmdn protein 5.8 

437980 R50393 Hs.278436 KIAA1474 protein 5.8 

55 425292 NlyL005824 Hs.155545 37 kOaieudn&fid) repeat (UW)proteln ' a7 

404097 MA CS000242*:gii9369379|gb)AAFB712ai|AC008434J24(AC00643 a7 

422546 AB007969 Hs.301478 KIAA0500 prdein a7 

445872 AI681573 Hs.288671 Homo sapiens cOHAFU1ig97 fis. done HEMB81001458 a7 

429999 AI761902 Hs.99597 ESTs a6 

60 453354 W55946 Hs.234663 Homo sapiens cDNARJ120B2 lis. done HEMBB1002492 5.6 

442082 R41623 Hs.7413 ESTs a5 

452073 AA625150 Ks.82098 ESTs a4 

430032 AW936136 Hs.99610 ESTs 5.4 

^_ 408767 AA057279 Hs.211928 ESTs 5.4 

55 433234 AB040928 H5.65366 KtAA1495 protein a3 

431708 Ai698136 Hs.108873 ESTs 5.3 

421200 AA284811 Hs.264433 ESTs 5.2 

435133 AJ010482 Hs.31412 Homo sapiettScONAFUl 1422 fis. done HEMBA1Q01008 a2 

409643 AW450866 Hs.257359 ESTs ai 

70 416676 AW392022 Hs.79507 KIAA0562 proteh ai 

420357 U94333 Hs.97199 comptement component Clq receptor aO 

417355 D13168 Hs.82002 endotheHn receptor type B aO 

423448 AK000776 Hs.128753 Homo sapiens cOHARJ20769 fis. done 00108674 aO 

^- 430965 AA489732 Hs.154918 ESTs 4.9 

75 419968 )(D4430 Hs33913 lnteiteuidn6Cnterferon.beta2) 4.9 

447471 AF039843 H5.18876 8prouty(Dro5ophna)homolog2 4.8 

404485 KA Target Exon 4.8 

429594 AK001128 Hs.210297 Homo sapiens cDtMFU10266 lis, done HEMBB1001 024 4.8 

417692 R09338 Ks.50724 Homo sapiens cOI^FU10934 fis. done OVARC1000&40 4.8 

80 432304 AA932186 Hs.69297 ESTs 4.7 

430895 tJ66581 H8.248121 6 proteln^iupied receptor 22 4.7 

448851 AI582207 H5.177166 ESTs 4.7 

405523 C8001409^441226lpir|iS31212odb«endphaipaV) 4.7 
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450655 AA0t0539 Hs.16912 ESTs 46 

422942 AF0S4S39 Hs.122540 tdraspan2 4.6 

401479 T49304 Ks.110950 Rag C protein 4.6 

444192 AW469413 Hs.151145 ESTs 4.6 

5 439648 AW7B0192 Hs.267596 ESTs 4.5 

410378 R23324 Hs.41693 DnaJ(K5p40}honnQbg,suMariiily8,niember4 45 

444702 A1220122 Hs.326560 hypoMca) protein MGC2780 4.5 

410909 AW898161 Ks.53112 ESTs. Moderately similar b)ALU8_HUMANAUI SUBFAMILY SX 4.5 

452249 BE394412 H3.202095 empty sp^acles (Orosophfla] homOtog 2 4.5 

10 430376 AW292053 Hs.12532 dvomosomel open reading frame 21 4.5 

411037 BE145915 Hs.99472 ESTs 4.4 

442B03 AI675298 Hs.199917 ESTs 4.4 

414831 M3115B Hs.77439 protein Unase. cAA^-dependenl, regulatory, type II. be 4.4 

400628 NA C1000187V:gi|1705533fspIP32018|CA1t»CHlCK COUAGEN ALP 4.3 

IS 414629 AA345824 HS.7668B cart»xy1esterase 1 (inonocyteAiQcrophage se^ 4.3 

437110 AL049240 ^te.144^95 ESTs 4.2 

410646 W79408 Hs.50745 ESTs 4.2 

456304 AI820973 gb3«21c02.y5NCLCGAPJ>r1 Hoino sapiens cONA(:lone^rnRN 4.2 

401270 Target Bam 4.2 

20 419447 BE092696 Hs.75928 ESTs 4.2 

414807 Ai738616 Ks.77348 hydroxyprostaglemfin dehydrogenase 1S(NAP) 4.2 

427019 AA001732 K3.173233 hypothettoal protein RJ10970 4.2 

434469 AA634806 glKal)28o02.r1 Stratagene lung (937210) Homo sapiens cDN 4.1 

444618 AV653785 Hs.173334 EUmATEDRNA POLYMERASE II. ELONGATION FACTOR 4.1 

25 418947 W52990 Hs.22880 ESTs 4.1 

416434 AW163045 Hs,79334 nuclear factor, InteHeuidn 3 regulated 4.0 

454736 8E184348 gb:CM04{T067&<l10500-35&«11HT0676 K0nio sapiens cONA, 4.0 

407945 X692QB Hs.606 ATPB58,CutrBn8poiflng.GllphapolypepOde (Menkes synd 4.0 

447499 AW262580 K5.147674 protocadharin beta 16 4.0 

30 430686 NKl.001942 Hs.2633 desmogtelnl 4.0 

409882 AJ243t91 Hs.56874 tieal shook 27kOpro^^fly, member 7 (caidiovascula 3.9 

419047 AW952771 Hs.90043 ESTs 3.9 

414272 AI651603 Hs.46988 ESTs a9 

443808 AW377736 Hs.12420 ESTs 19 

35 426883 H215») Ks.35088 ESTs 3.9 

410659 AI080175 Hs.68826 ESTs 3.9 

431292 AA370141 Ks.2281 dtiomogranin B (secretogranln 1) 3.9 

432181 AA527650 Hs.1S6037 ESTs 3.9 

422890 Z437B4 Hs.75693 anl(ynn3.noded Ranvier(anl(yrinG) 3.8 

40 453298 AA034413 Hs.62560 ESTs 3.8 

400878 NA TatgetExon 3.8 

401103 NA C12001233:g]|7305361|ref)NP.03d652.1iotogeIinlMusnius 3.8 

436670 AI690021 Hs.201536 ESTs 3.7 

432251 AW972383 H5.232165 polycythemia rubra vera 1; cell surface receptor 3.7 

45 408793 BE258371 Hs.254660 ESTs 3.7 

419093 A1804054 Hs.1 12885 spinal coid-derfved growth 17 

434844 AF157116 Hs.22350 hypolhafical protein LOC56757 ZJ 

450776 NM.007250 H5.320861 KruppeMilce factor 8 3.7 

437140 AA312799 Hs.283889 acthrator of CREM in testis 3.6 

50 416421 R58620 Hs£50S0 phospholamban 3.6 

443476 AW068594 Hs.133876 ESTs. WeaUy similar to YCDIJftJMAN HYPOTHETICAL PROTEIN 3.6 

417194 N53793 gb7z07a01.r1 Soaresjnulfa'plejscIeiosisJNbHMSPHomosa 3.6 

443567 A1077540 H8.134090 ESTs 3.6 

... 451079 AI821030 ^7b52ni.ySStrBtagene ovary (937217) Homo sapiens cO 3.6 

55 421013 M62397 H8.1345 mutaM In cotorectdi cancers 3.5 

451896 AF198304 Hs.27197 SUMO-l-spedSc protease 3.5 

413237 A1468574 Hs.171965 ESTs 3,5 

424636 AA453734 H8.10198 ESTs 3.5 

432660 A1288430 Hs.64004 ESTs 3.5 

60 414681 ALJD79440 Ks.74002 nuclear receptor ooacSvalorl 3.5 

400802 NA Target Exon 3.5 

430015 AW768399 Hs.112157 ESTs 15 

451978 AW813747 H3.27371 Homo sapiens mRNA:cOHAI)KFZp566J123(fnm dona DKFZpS 3.5 

449088 AI6S4048 H3.ig8556 ESTs 15 

65 425113 AI936992 Hs.154656 pleckstrtn and Sec7 domain potdh 15 

458459 A1124553 Hs.48g65 Homo sapiens cDNA:FU21693 lis, done COL09609 IS 

420249 BE262895 H3.276916 nudear receptor subfamily 1, group D. member 1 15 

401159 NA Target Exon 3.5 

„ 442789 AWg04381 Hs.131191 ESTs. Weakly sln^ to ALU7J1UMAN ALU SUBFAMILY SQSEQ 15 

70 426083 AW962712 Ks.126712 ESTs, Weakly similar to AF191020 1 EZIGSIKsapiens] 14 

407118 AA156790 Hs.262036 ESTs, Weakly sImRartoZ223.HUMAN ZINC FINGER PROTEIN 14 

423587 AA328074 Hs.284256 hypolheHcal protein FU14033 similar to hypoxia indud 14 ' 

443178 AI631241 Hs.47312 ESTs 14 

_-. 430694 AAB10624 Hs.30936 ESTs. Weakly similar to K2BK.HUMANHISTONEH2BH{K8ap 14 

75 423073 BE252922 Hs.123119 MAD (mothers against decapentqplegicDrosQphBa)tnmd 14 

437950 U79244 Hs.112642 ESTs 13 

419366 A1753518 Hs.209464 K1AA1604 protein 13 

447335 BE617695 Hs.286192 hypdhaUcal protein FU20940 13 

451398 AI793124 H5.144479 ESTs 13 

80 452814 AI092790 Hs.334703 hypothetical protein FU14S29 13 

407570 Z19002 Hs.37096 zinc linger protein 145 (KnippeMike. expressed bi pro 13 

412295 AW088826 Hs.117176 pdy(AH)lndIng protein, ntslear 1 13 

447261 NKL006691 Hs.17917 extracdhilarllnkdomdln^xmtdnlnol 13 
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444216 
418ni 
433036 
4045B4 
404195 
428819 
425198 
420833 
413156 
413607 
443960 
428790 
434520 
432247 
429303 
439734 
433546 
430317 
425130 
444195 
409007 
453773 
442974 



4540B6 
420271 
435545 
445175 

TABLE 168 



025303 

AA807881 

AA574091 



AL135623 

AA3520^ 

R47948 

AA127133 

T64741 

AK93577 

AF023456 

AA205273 

AA531287 

AW137635 

ACX)05013 

AI075877 

AB020645 

AA448208 

AB002351 

AL122107 

AL133761 

AI025670 

H10207 



H5222 

Hs.25329 

H$.10S964 



Hs.193914 
Hs.128003 
Hs.188732 



Hs^16 

Hs.193558 

Hs,177011 

H5.105605 

Ks.44230 

Ks.149 

Ks.125461 

K5^189 

Hs.99163 

H5.10587 



AI954365 
AA6S7415 
AV652851 



Hs.109308 
Hs.47314 
Ks.6975 



K3^107 
H8.20255 



Integrini8tpha9 

ESTs 

ESTs 

TanietExon 

NM.01 5718*:Komo sapiens NAOPH oxidase 3 (N0X3), mRNA. V 

K1AA0575 gene product 

hypoUieOc&lpn)tsinFU21213 

ESTs 

gb2)87e03.r1 Stratagene coton (937204) Homo sapiens cD 

gb:yo48ri 1.r1 Stratagene fiver (937224) Homo sapiens cO 

hypotteOcal protein aJ21966 

protein phosphatase, EF liand caldunMitndtng domain 2 

hypothsticd protein 

ESTs 

ESTs. Weakly similar to S65857 dlpha-IC^renergic rece 

CAMP response etemenMAiding protein CRE^Pa 

hypolhstica] protein FU11539 

gltitamtnase 

ESTs 

KIAA0353 protein 

Homo sapiens mRNA: cONA DKFZp434G0827 (from done DKFZp 
gb:OKFZp761C1413^ 761 (synonym: hamy2) Homo sapiens c 
ESTs. Weakly stmOar to leudn»fich g6oin»iiac6vated 
ESTs 

PRO1073 protein 
ESTs 
ESTs 
ESTs 



Pkey: Unkfue Eos prot)esel Identifier numtier 
CAT numt)en Gene duster nunfcer 
Accession: Gen1»nk aocession numbers 



•Pkey 

413156 
413607 
417194 
434469 
451879 
453773 
454738 
456304 

TABIH16C 



CAT Numlier Accessions 

135116J AA127133AA384396AW958912T72119 

1379911.1 T64741 BE158393BE152805 

1657323.1 N53793 N53716N53739 

387447.1 AA534806 C18732 AA729161 AA729860 

888642.1 AI821030 T47126 AI821318 

980699J AL133761AU33767 

1232235J BE184348AW817453Be011068 

176820J AI820973AI734077AI820984AA225796AA22S060AA225101 



3.3 

as 

12 
3.2 
3.2 
U 

ai 

3.1 

ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 
ao 



Pkey; Unkjue nuntber conesponding to an Eos probeset 

Reft Sequence source. The7dJgttraimberBlnthlscolimmareGenbankWenli!ler(60ramiberB. Thmhaml.etd.'ieteistDlbepubRcaitonenfitied'nteDNA 

8equenceofhumancliiomosome22.' Dunham Let at, Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
Ntjiosiflon: Indicates nucleotide positions of predicted axons. 



Pkey 


Ref 


Strand 


Ntjjosition 


400628 


3818355 


Pius 


4185H1984 


400802 


8567867 


IMnus 


174571-174856 


400878 


9864757 


Pius 


31493-32842 


401103 


6568122 


Minus 


98330-98449 


401159 


6067118 


Minus 


3180*3953 


401270 


9797168 


Minus 


141659-141813 


401486 


7341763 


Plus 


3258&^56,38281-36540.4079140833,440ia44179 


404097 


7770701 


Plus 


55512-55781 


404195 


^917 


Minus 


39186-39332 


404485 


8096921 


Plus 


75166-75264.124036-124232 


4045B4 


9857511 


Pius 


13865M39153 


405523 


9454643 


Plus 


114550.114688,117265.117407,119490.119599.123237-123395.131140-131217 



TABt£ 17A: 605 genes upregulated in tesficular cancer reiafive to nomial body Bssues 

If^nfi^!^^. ^r. 'V!^'^^Jf^f K"T^ body fissues that are Ckeiy to encode proteins amenable to moduiaOon by smafl molecules, pepfides. 

"TJf^^ fam 69681^ probeseb on UieEn^^finetib Hu03 GenecMp array. Gene expression data for each probeset obtained from IMs analysis 

nSS^J^riTS^^iSfSl^^ peptides, or antibodies (ftg. pJdnase.dBalhKlomah. 7tm, 

phosphatase, or loiLtranspQrtBr). Certain predicted protebfdonelns are nM r™~. 

Pkey: Unique Eos probeset idenSfier nuntber 

ExAocn: Exemplar accession number, (3enBank accession nuntber 

INGeneiD: UniGene number 

RodProtDomalns: Certain predicted protein domains. Abbrevtafions used: TMtb 
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QXefy to contain; olhsr protein domatn afatvevtafions are from PFAM (Nudeic Adds ResearcK 2002. 30:276-280). 
UnlGeneTldB: Ur^GenegenetUts 

95th pefcentae of ceivlcat cancer AlsdhfUed by BieSOthpercenSlednonnal tissue Als. where (he 10th percentileof all nonnal tissueAb was sutitracted from 
tnth the numerator end denon^nator 

PKey: ExAccn: UnfoenelO; Unlgene Title; Pred.PnsLDonnains; R1 

4085Z2; AI541214; Hs.46320; Smafl proline^ pmtsin SPRK (human. ; nQne.Comi!in; 33.942 

422168; AA588894; Hs.112408; S100 cdciunv^ng protein A7 (psortas; ethand.S 100;TM»M:SSbN; 33^05 

424098; AF077374; Hs.139322: smafl pro&wich protein 3; ComIfm;TM=M;SS=N; 31856 

422158; L10343; H3.112341; protease inhibitor 3. skin-derived (SKAU wap;Tiy4=I^SS=Y; 29.604 

433091; Y12642; Hs.3185; lymphocyte antigen 6 complex, bicus D; UPAR.LY6.tox!n>WMnjecp;TM=M;SS=Y; 27.95064945 

421948; U2583; Hs.33430% Iteratin 6A; fii3mentRhoGAP,DUF286,bZiPJropomyosin.tubu]in,DUF164,TBC^GoIte9^ 26.778 • 

446292; AF081497; Hs.279682; Rh type C glycoprotein; Amnionium-transp,FecCO:TM=Y:SS=M; 26.1133829 

407242; M18728: 1 gb:Human nonspecific crossreacOng anfig; lg;TM=M;SS8M; 23.382 

424687; J05070; Hs.151738; matilx metalloproleinase 9(geiaBnas8 B; fh24vemope)dnP^)Qda5e_M10;; 22^22 

412719; AWD16610: Hs.816; ESTs; none,none; 21.198 

406690; M29540; Hs.220S^; cardnoemtiryonlc ant^erwelated ceH ad; ig;TM'M;SSsM; 20.028 
402075; ; ; ENSP000002510S6^Plasma membrane caldum; noner, 19.038 

431958; X63629: H8.2877; cadherin 3. type 1, PKjadherfn (placenta; C8dheiln.CadheTtfLC,temi;TM=Y:SS=M; 17.92061281 
412471; M63193; Hs.73946! endothelial ooO growth (actor 1 (plat^ ©ycos.transf J»aycosLtrans„3N:TM=M;SS=M; 17.8978979 
417308; mm. 88.81892; KIAAD101 gene product none:TM=M;SS^ 17.08333333 
429259; AA4204S0; K5.3800S8; PtatophOln; none^none; 17.08235294 
417079; U85590; K5.81 134; biterieukin 1 receptor antagonist; ILIn 1&91S88628 
439926; AW014875; Hs.137007; ESTs; nonoA^ne; 16.69 

419693; AA133749; H5.301350; FXYO domdn^xjntalning Ion transport reg; ATP1G1J»li/LMAT8;TM=Y;SS=M; 18.365 
413753; U1776Q; Hs.76517: lamliiin, beta 3 (nloeln (125kD), kaiinh; laminlnJEGF.Iaminin_Nterm;; 15.75294118 
413278; BES63085; Hs.833; Interferon-stimulaled protein, 15 kOa; ubk)uilinn 15.48600509 
401781; ; ; Target Exon; filament:TM=M;SS=N; 15.43658831 
420440; NM-002407; H5.97644; mamm£Qk)bin Z UterogloUn;: 15.394 

441633; AW9S8544; Hs.112242; nomial mucosa of esophagus specHio 1; none;TMNit;SS«M; 15.12284151 
452240; Ai591147; Hs.61232; ESTs; none,none; 14^ 

428957; NM_003881: Hs,194679; WNTl inducible signaling paOiway protein; lsp^1,vwc.lGFBP;TM=M;SS=M; 14.49772727 

414987; AA524394; Hs.294022; hypolhefical protein FU14950; SH2;TM=M;SS=N: 14.4389313 

432374; W68d1S; Hs.301885; Homo sapiens cONA FU1 1346 fis. clone PL; none.none; 14.00909091 

400289; X07820; Hs.2258; matrbc metaltoprotelnase 10 (strometysin; hemopextn,Pep8daseJI110>\stacin;: 13.824 

414812; X72755; Hs.77367; monokine induced by gamma Interteron; iLB;THA=M;SS=Y; 117754386 

421552; AF026692; Hs.105700; secreted te!ed-re!ated protein 4; F2,NTR;; 13.74595843 

400284; ; ; NM_000125':Homo sapiens estrogen recepto; hormone,/eczJ-C4,Oestjecep;TM=M:SS=*^ 13.31578947 

428227; AA321649; Hs.2248; smaO Inducible cytokine subfamily B (Cy; IL8;TM=M;SS=Y; 13.052941 16 

411274; Nl^002776; Hs.69423; kalllltrein 10; trypsln;TI«=M;SS=N; 13.038 

406687; M31126: Hs.352054; maiTixmetaltoproteinase 11 (siromelysin; hemopexIaPeptid3se.M10;; U00311527 

427666; At791495; Hs.180142; calmoduBn-fkB skin protein (CISP); efhand:TM=M;SS4l; 12.79 

400301: X03635; Hs.1657; estrogen receptor 1; F^hommnojec^,OesUecep,adh.zlncMoacykynl4)p^ 

synLC^E7,RFX_DNAJblnding;TM=M:5S=N; 12.472 
410001; AB041036; Hs.57771; kaHlkreln 11; trypsln;TM=M;SS=M; 1Z47 

422310; AA316622; 83.98370; cytochrome P450. subfamily US. pOtypept; none.pkinse,fh3,lg; 12.28597122 
430630: AW269920; Hs.2621;cy8ta8n A(steiin A):cystafln;ThNyt;SS»N; 1Z13379205 
437044; Aii)35864; Hs.69517: differentlafly expressed in FanconTb an; none;Ti\^M;SS»M; 1Z04945055 
418462; BE001596: Hs.85266; integrin, beta 4; fh3Mgna.B.Caix-beta,EGF;TiyNM;SS=M; 1 1.95538462 
443859; NML013409: Hs.9914: fbtHstatin; kazai;; 11.95467422 

426350; tiM.003245; Hs.2022; trsnsfllutaminase 3 (E polypeplide, prote; Tr3nsghitamirU<T(8nsglutaminjCTransgluLcore;TM^ri;SS<^; 11,61 
408243; Y00787; Hs.624; intefteukin 8: HLHPASJLB:Tliit=M;S&4« 11.564 

444761; NM.014400; H3.11950; GPknchored metastaSI&^ocialed prote; UPAa.LY6.lactaniase.B; 11.55285714 
428484; AF104032; Hs.184501; solute canier family 7 (cafionte amino ; 

aaj)emieases.pyrtdoxaLdQ(;bromodcmain,PHD,MBDATJiook,DOT,PI3J'i4Jdnase.FAT^ 
418663; AK001100; H5.41690; desmocdlin 3; cadherin.Cadheri^.C_tenn,none; 1 1.456 
433001; AF217513; Hs.279905: done KQ0310 PRO0310p1: none;; 1 1 .453521 13 

423217; NM-000094; Hs.1640: collagen, type Vli, a^ 1 (epidennolys; i<unil2^BPTl,fn3,vwa,Coaagen.beta4actama3o;TMaM;SS=M: 11.32234432 
428970; BE276891; H5.194691; retinoic add induced 3 (RAIG1): metabo; 7tmJ;Tiyi='Y;SSaM: 11^8686327 
424834; AK001432; Hs.153408; Homo sapiens cONA FU10570 fis, done NT; none.none; 11.076 
451541; 6E279383: Hs.26557; piakophiDn 3; Amiadino.8eg;Tft/NM;SSeN: 11.0381579 
418478; U38945; H8.1174; cydin^lependent kinase inhibitor 2A (me; ank;; 11 

423673; BE003054; H3.1695; noatrbc met^protelnase 12 (macrophage ; hemope}dn,Peptidase_M10;ThNM;S&=M; 11 

425071; NM.013989; Hs.154424; d^nase. kxtolhyronine, type II; T4^dek)d1nase;Tf^Ii^SS°Y: 10.93859649 

437938; A1950087: HS.35962B; gbwq05c02jc1 Na_CGAPJ<M12 Homosaplen; none,none; 10.78064516 

425367; BE271188; H5.155975; protein tyrosine phosphalase. receptor t none;TM=M;SS«Y: 10.74825176 

439706; AW872S27: Hs.59761: ESTs. Weakly strnfiar to DAP1J1UMAN DEATH; none.none; 10.542 

437897; AA770561; Ks.146170; hypotheOcat protein FU22g69; zH^HHCnone; 10.49538462 

431629; AU077025; HS.2658Z7: interferon. alphaMtdble protein (dd; nofte;Tliil»iyi;SS»Y; 10.48210736 

41 1558; AA102670; Hs.70725; gamma-aminobutyric add (GABA) A recepto; Neur_ch8iulBD.Neur_chanjnembcTMaY;SSsM: 10.26714286 
409142; AL136877; Hs.50758; SMC4 (strocturat maintenance of chromoso; ABC.tran,M,SMC N,SMC COUF164.none: 10.142 
421508; NM.004833; Hs.105115: absent In melanoma 2; PAA0^DAP1N,HIN;TM=M;SS«N; iai 

418641; 6E243136; H3.86947; adislntegrfn and metaRoprateinase doma; disintegiin,Repiotysin.Pep_M12B_propep.EGF;TM«'Y;SSsiU|; 10.072 

425397; J040a8; HS.1S6346; topdscnierasa (P»4A) II alpha (ITOkO); DNMyr8SeB.DNAJopoisolV,HATPasd.c;; ft996383636 

414035; Y00630; Ks.75716; aerfne (or cysteine) proteinase inhibito; serpin;; 9.896825397 

421506; BE302798; Hs.l05097; thymidine kinase 1, sduble; TK;TM=M;SS=N; 9.888888889 

407786; AA687538; Hs.38972; tetraspan 1; transmembrane4;TM=Y;SS=M; 9876058338 

424441; X14850! Hj.147097: H2A hislono famBy, member X hi5tone,C8FO_NFYBJ1MF;; 9.851635514 

438091: AW37306% Hs.351546; nudear receptor subfamily 1. group 1, m; tonnonej8czf^,none: Mmsom 

413859: AW992356; Hs.8364: Homo sapiens pyruvate dehydrogenase Una; SAMJ>Nr.none; 9.823170732 

408000; L11890: Hs.198889: bulkws pemphfgdd antigen 1 (23QQ40kD); elhand,8pedilaGAS%SH3.Piectin.RA.Xylb5eL)som,FiD^ 

lZm.<;aiAIP3;TRMyi;SS=M: 9.812 ^ 
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409893: AW24709Q; Hs^lOI; ndrddiramosome maintenance defictent (S; MCM.aUoJ(eUed:TM=M;SS^ 9.787878788 

442599: AF078037; Hs.324051; ReUV^ssodated InhlUtDr. SH3.ank;TM»M:SS41: 9.637037037 

425650; NM_001944; Hs.1925: desmogteln 3 (pennpWgus virtgaris enOgen; cadh8rfn;TM=M:SS=^4: 9.598 

417900; BE250127; Hs.82906; COC20 (oell dhfiston cycle 2a S. cerevl; WWO:TM=M;SS=N; 9.558 

444946: AW139205; Hs. 156457; hypothetical pict^ aj22408; ebhy*oIas8,abhydrolase^TM3Y;SS=M; ft5S 

421481; AW391972: Hs.104696; KIAA1324 protein; none;TM=M;SS=H a529085B73 

408591; AF015224: Hs.46452: mammagiobin 1; Utero^b!n:TM4A;SS^ 9.S06 

444381; BE387335; H3.283713: liypolhetJcal protein BC014245; Conagen;TM=M;SS=M: M77961433 

444006; BE395085: Hs.334762; type I transmemtjrane protein Fn14: ldUBcepLa.PKD.MHCJ:TMNM:S8=Y; ft415151515 

413719; BE439580; Hs.75498; smaH inducible cytokine eubfandly A (Cy: iL8;; 9.40B 

424364: AW383226: H5.163834; ESTe, WeaMy similar to G01763 alrophln-: ras;TM=M;SS=N{ 9^ 

429002; AW248439; Hs^0;junc8on plakoglobin;AnnadHto_seg;TM=M;SS=N; 031 5693431 

421379: Y15221; Hs.103982: small Inducible cytokine suWamily B (Cy; IL8:TM=M;SS=Y; 9.31 

41 8004; U3751 9; Hs.87539; aldeliyde detiydrogenase 3 family, member ; aWedh;TM=M;SS=M: 9.29 

454034; NM_000691: Hs.575: aldehyde dehydrogenase 3 famKy, member ; aldedh:; 9.264 

417389; BE260964: Hs.82045; mkJWne (neurite growth^ranofina factor, PTHJ^Th^f^SS^Y; 9.241561 181 

445033; AV652402; Hs.72901 ; cydiiKlependent Wnase Inhibitor 2B (pi; ank:; 9.207272727 

443426: AF09815B; Hs.0329: chromosome 20 open reading frame 1; none:TM=M;SS=N: 9.195167286 

439223: AV\/238299; Hs.250618; UL16 binding protein 2; ldljecept^a,PKD,MHCJ;TI*=M:SS«Y: ai08 

428758; AA433988: Hs.98502; CA125 antigen; mudn 16; SEA;TM=Y;SS=N; 9.028 

421777; BE5520B8; Hs.108196; HSP(X)37 protein; none;TM=M;SS=N; 9.004 

448988; Y09753: H5.22785: gamraa^obutyrte ackJ (GABA) A recepto; Meurjchan_LBD,Neur^chan_memb;TM=Y:SS=M; 9.001096491 

418989; W33191: Hs^7; hypothetical prat^ aJ2Q258t SK3:TM4tS&:N; 8.942 

455601; AI368680: Hs.816: SRY (sex determining regton YHxac 2; KMGJnx;: 8.87 

429211; AF052693; Hs.198249: gap junction protein, betaS (connejdn 3; connexln;TM=Y;SS=M; 8.77131783 

455906; AF1 17646; H3.156637; C8S-Br4wI (murtne) edropic retroviral Ir, zfC3HC4,(nlN.Cbl.N2.CbLN3;TM=^*SS=N; 8.738 

430397; Ai924533; Hs.105607; bicarbonate transporter relaled protein ; H(X>3.cotransp:TM»Y;SS=t); 8.736 

417034; N^006183; K5.80982; neurotensin; none;; 6.592 

429083; Y09397; Hs.227817: BCl2-iB!aled protein A1; Bcl-2:TM=M;SS=N; 8536 

408113; T82427: Hs.194101; Homo sapiens cDNA: FLJ20869fis, done A; 7tm.3,none; 8.49 

439285; AL133916; H5.47860; hypolhelicai protein FLJ20093; ig.pklnase,LRI^U<RNT,UWCT,nofte; 8.460655738 

409420; Z15008; Hs.54451; laminin, gamma 2 (nioein (lOOkO). kallnl; laminIn_BJaminin_EGF; 8.414 

438746; AI885815; Hs.1 84727; IHuman melanoma-assodaled antigen p97 (m; tran&(enin,Gu8nylatBjdn.PDZ.8H3: &376205788 

439606; W7gi23; Hs.58561; G protein-coupled receptor 87; 7tm_1;TMsY:S8sM; 6.37 

430486; BE062109; Hs.241551; chtoride chanriel, cak^m actlvaled, fam; none;TM=Y;S8=M; 8.384 

429170; NM.001394: Hs.2359; dual specificity phosphatase 4; Rhodanese.OSPcYj)hosphalase.i^bosomdLS3LN;TM=M:8SsN; a266 

417771; AA804698; Hs.82547; retinotc add receptor responder (lazaro; none,nQne; a248314607 

431620; AA126109; Hs.264981: 2'-S'-origoadenyiaIe synthetase 2 (69-71; NTP_transL2:TM=M;SS=N; 8.166 

412270; AC»05262; Hs.73797: guanine nucleofide binding protein (G pr; &a!pha,arf;TM=M:SS=N; &142857143 

448733; NM.005629; Hs,187958; sotirte carrier family 6 (neurolransntitte; SNF;TM=Y;SS=N; ai37558809 

427557; NM-002659; Hs.179657; plasminogen activator, urokinase recepto; UPAK.LY6.ETnAiJnh:; 8.043478261 

424439; AA579635: Hs.1770; Rgase I. DMA, ATP^Jependent; ONAJ^ase;; a0381 94444 

418322; AA284166; Hs.841 13; cydinKJependent kinase InhiKtor 3 {GOK Yj)hosphatase.DSPc;TM=l«;SSsN; 8.(C4752475 

453857: ALC80235: Hs.35861; R8&4nduced senescence 1 (WSI); none;TM=Y;SS=M: 6 

424046; AF027888: Hs.138202: serine (or cysteine) proteinase InhlbltD; 8erpin:TM=M;SS=N: 7.982 

418528: BE019020; Hs.85838; soWe carrier family 15 (monocarboxyiic; none;TM=Y;SS=M; 7.97368421 1 

413219; AA878200; H3.118727; Homo sapiens cDNA RJ13692 fis, done Pl^ HLKdea9i.TNFR,c6>^cyM>>A^hydro; 7,892 

422809; AK001379; Hs.121028; hypolheticd protein FU10549; IQ;TM=M:SS=N; 7.824 

452203; XS7522; Ha.352018; transporter 1. ATP4)indIng cassette, sub; ABC_tran,ABCjnembrane,SRP54,ThymkJylalB_Wn;TM=Y;SS=M; 7.823874755 

431630; NM-002204; H8.265829; talegrtn. aJphaSCanBgen CD49C, alpha ; FG.GAP.I^abd_fllycopJntegriaATM=Y;SS=M; 7.758985201 

432874; Wg4322; Hs.27S651; melanoma bihibltDiy acSvIly; 8H3:TM=M:SS=Y; 7.75887574 

439453: BE264974; H5.6586; Ihyrdd hormone receptor Interador 13; AAA,AB(Lban,CoaE;TM=l*SS=N; 7.757751938 

452747; 6E153855; Hs.61460; Ig superfantDy receptor LNIR; ig.rai8bd_glycop:TM^Y;8S=M; 7.624 

438089; W05391: Ks.351546; nudear receptor subfamSy 1, group I. m; honnonajeczf^.none: 7.605660377 

427747: AW411425: HS.18065S; serine/threonine kinase 12; pklnase:TMsM;SS»N: 7.578 

430280; AA361258: Hs.237868; InterietikIn 7 receptor; fh3,none; 7.476 

429299: AI620463; Hs,347408; hypothetical protehi MGC13102; none;TI*=Y;SS=N; 7.442S28736 

441384; AA447849; Hs.288660; reHnob add induced 3; 7tnL3.none; 7.442495127 

446163; AA026880; Hs.25252; prolacSn receptor, none:NA:NA; 7.436781609 

414774; X02419; H3.77274; plasminogen acthrator, urokinase; lafngta,tjypsin,pl8nUhk»insn 7.435897436 

439237; AW4081 58; Hs.31 8893; ESTs, Weakly slmTar to A47682 B«D gr; Furin^ike,pkinase,ltece(Ujlomaln,YIJ»,noiie; 7.398360656 

432636; AA340884; Hs.278562: daudin 7; PMP22ja3udin:TW=Y;SS=M: 7.394039735 

431890; X17033: Hs.271986; integrin. alpha 2 ((»49B. alpha 2 subunh vwaJntegnaAF6^TM=Y;SS=M; 7.383419689 
416084; L16991; Hs.79006; deoxyfhymldylate kinase (ihymUytale kin; none,none; 7.382 
436972; AA284679; Hs.25640; dautfin 3; PMP22.(aaudIn:TM=Y;SS»M; 7.327160494 
409799; 01 1928; Hs.76845; phosphoseitne phosphatas&4ike: Hydrolase;TM=M;SS=N; 7.316 

448569; BE382657; Hs.21488: signal transducer and adivator of trans; 8H2,STAT,STAT_blnd.STATj)rot;TM=M:8S»N; 7.315412186 
428450; NM-014791; Hs.184339: K1AA0175 gene produd; KA1.pkinase:TM=M;SS=N; 7.2984375 
422283; AW41 1307; H5.1 1431 1; CXXA5 (cell dhrtsion cyde 45, acerevis; COC45;TM=M;SS=N; 7.28 
461253; H48299; Ks.26126; daudin 10; PMPZLaaudln.PeplidaseJA1 JUBtr8;TM»Y;SS=M; 7156802721 
416819; U77735: Hs.80205; plnv2 oncogene: pWnase;: 7.234455959 

421817; AF146074; K5.IO866O: ATP-Wnding cassette, sub-family C ((KR; FasddlnABC_tran,ABCjnenibrBne,GTP.EFTU:TMsM;SS44: 7.162534435 

451035; AU076785; Hs.430: plastin 1 (I isoJwm); e!hand.O^AJaptin_N;: 7.145454546 

424008; 1^2740; H5.137555; putalhfechemoklne receptor; GTP-biiwJlng; 7tnu1;TM=Y:SS=M: 7.126 

414482; S57498; Hs.76252: endotheBn receptor type A; 7tm_1;TM=Y:SS=M; 7.122413793 

425003; AF119046: Hs.154149; apudnfctepyrimkJWc endomjdease(APEX ; Troponin.Exo.endoj)hos,lQ:TM=M;SS=N; 7.106719368 

430890: X54232;H3.2699;glyplcan1:G!ypfcan;TM=M;SS=M; 7.088937093 " 

407792; AI077715; Hs.39384; putaOve secreted Bgand horootogous to f; nono;TM=M;8S=Y; 7.052 

426514; BE616633; Hs.170195; bono morphogenette protein 7 (osteogente; T6F.beta,T^_propeptide;: 7.042 

431241; AA495799; Hs.36958: ESTs; SH2.fteGEF/»one: 7.03 

437139: W73685; H3.1 18513; ESTs. WeaMy similar to RTA RAT PROBABl^; 7tm.1;TM=Y:SS=M: 7.03 
42031 1; AW44S044: Hs.38207: Human DNA sequence torn done RP4^15; none,ftone; 7X26 
439979; AW600291; Hs.6623; lypothetical pioteh FU1043Q: nonecTM4^N; 7.008 
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422846^ BE51 3934; Hs. 1583; neutrophO GytosoQc factor 1 (ITU), dr. SH3mTM=M:SS^ a99162S794 
416250; AAS813B8; Ks.73452: Kremen % kringl8.CUB.WSCr. 6^72 
430770; AA765894; Hs.123296: ESTs; none,non8; 6.95 

418869; AW5t6565; ; gb^cqOldOSjd Soaes.NHCeC.oervtcd.tumo; none.RasGAP.WW.I(^ 6.948 

428953; AA306610; Hs.348183; tumor necrosis factor receptor superfami; 60s.fibosomai,Ribosomal^LiaTNFfLc6,DEACte &914 

418283; S79895; Hs,83942; cslhepsin K (pycnodysostosis); Peptidase.Clr, 6.876190476 

419667; AU077005; Hs.92208; a dislntfigrin and metaflopr^elnase doma; dlsIntegrfn,Reproly^Pep,M12Bj)iopep;TM=M;SS=M; 6.862970711 

421143; AB024536; Hs.102171; tmmmoglobuSn superfamBy containing ts; ig.U«UJ^.LRRCT;TM=M;SS»M: 6.849056604 

456181; U8463: Hs.1030; ras Inhil^ RA.SH2.VPS9;TM=M;SS=>J; ^762 

436856; AI469355; Hs.127310; ESTs; pklnas8.rnT5TM=M;SS=N; a721428571 

41 1125: AA151647; Hs.68877; cytochrome b-lAS. alpha polypepBde; non8;TM=Y;SS=M; 6.720348837 

439750; AL359053; Hs.57554; Homo sapiens mRNA fun length btsert cON; lMPDH_CIMP0K.N.C8S.InlegTin_B.Rlcii\_BJeclin; 6.717307692 

450334; AF035959; Hs.24879; phosphatidte acid phosphatase type 2C: PAP2;TP^Y;SS=M; 6,715240642 

426437; BE076537; Hs.169895; uWquiOrvconjugailng enzyme E2L 6; Amiadnio,seg.UQ.con.none; 6.688194444 

439738; BE246502; Hs.9598; sema domain, Immunoglobulin domain (Ig).; Sema.PSl,lntegrin^B;TM=Y;SS=N: 6.670553936 

428385; AF1 12213; Hs.184062; putaBve RabWnteracflng protein; SH2.SH3:; 6.662921348 

456534; X91 195; Hs.100623; phosphofipase C. beta 3, neighbor pseudo; UM,PDZ,piclnaser. a653713299 

425289: AW139342; Hs.155S30; interim, oamma^ndudble prM 16; PAAD.OAPIN.HiN;: &652671756 

426500; NM.014638; H5.170156: KlAA0450flene produd; C2PM»LC"YiTM=M;8SBN; 6.639855172 

438113; AI467908; Hs,B882; ESTs; 7bn_1.non8: a6 

444783; AK001468; Hs.62180; anlllln (OrosopWIa Scraps homolog), act; PH.none: 8,6 

408462; NM.000676; Hs.45743; adenosine A2b receptor. 7lnLl:TM=:Y;SS=M; 6.548148148 

410290; AA402307; Hs.322844; hypotheOcal protein DKFZp564A176; Sema.PSI.TIG^inl89rlrLB;TM»Y;S8=M; 6.532763533 

414809; At434699; K9.77356: trensfeirln receptor (p9a CD71); PA:TM»Y;SSsN; 6.526961673 

426440; BE382756; Hs.169902; sohjte carrier family 2 (faci&talBd glu; 8Ugar.tnTiM»Y;S&4ll: 6.512704174 

420039; NM_00460S; Hs.376147; sulfotransferase family, cytosoBo. 2^ : SullblFanster;; 6.498 

423031; AI278995; Hs.374579; ESTs; none.none; 6.447658402 

421445; AAgi3059; Hs.104433; Homo sapiens, done 1MAGE:4064868. mRNA; lonJrans.K.tstfa,esp: &426666667 
433933; AI7S4389; H5.355397; Homo salens done TCCCiA00164 mRNA sequ; none;NA;NA; ^4 
435094; AI560129; Hs.289008; EST; none.none; 6.312 

432105; N58323; H8.269098; ESTs. Weaidy similar to RETROVIRUS-RELAT; SH3,P02,GuanylatB.Wn,none; ^276556777 
427640; AF058293; Hs. 18001 5; D^iopachrome tautomerase; COXB.SKMT,MIF,GST_C,EF1G_domaln,GST.N,S1J^z.Fri2zted,calreticuIinJtm^2.inaP 
6.272727273 

435232; NSt001262; Hs.4854; cydl^dependenlldnase Inhibitor 20 (pi; ank;TM=M;SS=N; 6.269720102 

418203; X54942: Hs,83758: COC28 protein kinase Z CKS;; 6219081272 

41 1263; BE297802: H3.69360; IdnesltvUko 6 (mHoUc centromere^soc; ldnesln;TM=M;SS=N; 6.19 

409512; AW979187; Hs.293591; melanoma differenfiafion assodaled prot; OEAD.heltease_CCARD;TM=M;SS=N; ai88886689 

449230; BE613348; Hs.356392: melanoma cell adhesion mdecde; lg,isodh.Rlbosomal^L6,F-box;TW=Y:SS=M; 6.188046647 

440006; AK000517; Hs.6844; NALP2 protein; PYRIN-Contairtng APAFI^; AAA.NB^CPAAD.DAPIN;NA;NA; 6.15503876 

450581; AF081513; Hs.25195; TGF^ta 4; TGF-bela.TGFbjxopepildB;; 6.152 

432314; AA533447; Hs.2B5173; ESTs; Xnnk.none; 6.1230407S2 

418844; M62982; Hs.1200; aradiidooate 12-^poxygenase; lipoxygenase,PlAT;TM=M;SS=N; ai2 
421733; AL119671; H5.1420; fibroblast growfli fador receptor 3 (ad); Ig,pkIn8se;TM=Y:SS=M; 6.095758355 
422051; AW327546; Hs.1 1 1024; solute earner family 25 (mitodtondrid ; mltDjcafT;TM=M;SS=N; 6.0891 64786 
452683; AI069576; Hs.374574; progesterone membrane binding protein: homeobox.none; a06284153 
445537; AJ245671: Hs.12844: EGF4II»-<lomaln. multiple 6; EGF.MAMn 6.05513308 
444309; U83236; H3.10803; caldum and Integrin binding protein [DN; elhand;; 6.04015544 

414166; AW888941; Hs.75789; N-myc downstream regdated; DEAD,heIicase.C^Ndr,Cy8jtfiot,Tlt^wa,wrc,wri,l(>,Rlla,aW 

beta.DUF139,TPR.DSPc.tsp.1.RibosonidLS21.fvp;TM=M:SS=N; 6.009562842 
438108; AI471796; HS.2S7776; vanQlQld feceptor-ralatfid osmoBcally a; arikJoiUran8;TM"Y;SS4l; 6.004 
413869; NhiL000S7e; Hs.75596; Inlerieuldn 2 leoeptor. beta; itone:TM»Y',SS«M; 5.964536083 
405484; ; ; C3002124*:ffl127372801refjXP_006682.2| k; nonen 6.978964401 

414907; X90725; Hs.77597; pdo (DrosophlaHke Wnase; RIbosomaLU7a9.pklnase,PaOJ)OX,tRNA^1b.dynamin.dynamii^ 6.978431373 
419216; AU076718; Hs.164021: smaH indudtile cytokine subMy B (Cy; ILB;; 5.976 

414135; Nlil.004419; Hs.2128; dud spedSd^ ptwsphdaseSt Rhodanesa|)SPt,Yjiltosphatase;TM4t88BN; 5.96938n55 
411756; BE294350; Hs.71891; disoddbi domdn rocepbrMly. member; pkkiase.F5J^type.C;TM»Y;SS«H 5S5184136 
424291; AL120051; Hs.144700: ephrin^l; Ephiin;TM=Y;SS»=Mt 6.951550388 
453459; BE047032; H8.257789; ESTs; nono.none; 5.95 

456373; BE247706: HS.B6693; membrane-spanning 4<kimains, subfamily A; none;TM==Y;SS=N; 5.938 

429359; W00482; Hs.2399; matrix metalteprotdnase 14 (membrane^n; hemopexln,Pep8dase_H«l(kTM=M:SS=M; 5517857143 

414703; BE243877; H8.380063; ATPase, Na? transporting, beta 3 pdypep; NaJ(-ATP8se;TM=Y;SSsM; 5.910456487 

448775; AB025237; Hs.388; nudU [nudeosMe diphosphate linked mol; NUO]X:B^M;SS=M; 5.901886793 

452239; AW379378; Hs.356289; protdn tyrosine pho^ihatsse, receptor t; none.non8; 5.868362632 

418345; AJ001696; Hs.241407; serine (or cysteine) proteinase Inhlbito; 8efphi;TM=Y;SS=ftA; 5.642 

452875; BE27S760; Ks.30928; DNA segment on chromosome 19 (unique) 11; Eulu»rtn;TM»M;SS4«; 5.816363636 

439625; AF086453; Hs.58511; ESTs; ForOe3d.glycolyfc_enzy.Na.sulph.symp; 5.811694203 

447343; AA2S6641 ; H3.236894; ESTs. Highly dmiiar to S02392 a)pha.2-m; nona.none; 5.81 

422765; AW4(H701; Hs.1578; baculovlral lAP repeakontdning 5 (sun BIR;TM=M;SS=N; 5.808 

415198; AW009480; Hs.943; naturd kfUerceQ transcript 4; nond;TM=M;SS=N; 6.804137931 

431941; AK000106; Hs.272227; Homo sapiens cDNA FU20099 fls. done CO; pklnase.Furii^llke,Recep_U.domdn.none; 5.8 

457001; J03258; Hs.2062: vltanDin D (1.25. dlhydroxyvllamin D3) re; hormonejecrf.C4.Metaltolhk).6;TM=M;SS=N; 6.794 

439335; AA742697; Hs.62492; NMJ052863:Homo sapiens secretogtodn. fa; nonen 5778588808 

439246; AI498072; Hs.351474; membrane-assodated tyrosine- and threon; aiik.pkkiase.UPF0073n 5.763492084 

452461; N78223; Hs.108106; transcription fador, zW^C4,uKqd(in,PHD.YDG.SRA;TliA=M;SS=N; 5.728 

414883; AA926960; Hs.348659; C0C28 protein idnase 1; CKS;; 5.714634146 

424517; A1539443; Hs.137447; Homo sapiens cDNA FU12169 fis, done MA; SH2.STAT,STAT.bind.STATjirolnone; 5.701666667 

419056; M89957; Ks.89S75; CD79B snGgen Connunoglobt&vessodated; ig,ITAM;TM=Y;SS4^ 5.692 

432269; Nf^002447; Hs.2942: macrophage sSmuIaQng 1 receptor (o^t; pkln8se.Sema.P8I.T1GA4_EXrRA:TMs»«SS«M; 5i686 

452696; AI826645; Hs,211534; ESTs; AffG8p,PH,ank,Guar^Jdn.P0Z.SH3; 5.683873469 

411030; BE3B7193; Hs.67896; 7-60 protein; none;TM=M;SS=N; 5676767677 

447131; NM.004585; Hs.17468; retlndc add receptor responder (tazaro; none;TM^Y:8S=N: 5.672977625 

426227; U67058; H9.154299: Hunnan protdnase acfhrated (eceptor-2 mR; 7tnLl:TM»Y;S^ 5.666 

40772% BE252241: H8.38041: pyridoxd (pyridoidne. vOamln 6Q Idnas; plkB;TM4«SS«N: 5.655616943 
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4274S0; Z95152: Hs.178695: mitogeivacGvated prnteb kinass 13; pkInase:TM=AtSS=N; 5.6485623 

418010; NM_004203; Hs.777e3: membranfr^sodaled ^ro^ end Onson; Bnk.pkinase.UPF0073;: 5.648 

452690; A1536070; Hs.15085; ESTs; |Kni.homeobO]U^chan^Fjecepton 5.646 

424321; W74048; Hs.1755; tynvhocytwpecific protein tyrosfna kin; SH2.SH3,pWnas8;TM=M;SS=N; 5.642405063 
5 418703; NM_014448; Hs.87435; RhoQuanine exchangB factor (GEF) 16; SH3.PH,RhoGEF,BinJ8LVP3;Tfc^M;SS=N; 5,636 

426108: AA622037; Ks.166468; programmed cen death 5; DUF122:TM=M;SS"N; &635087719 

424430; AJ278016; Hs.55565; aidcyrin repeat dom^ 3; 8nk.pkinase;TM=M;SS=N: &620930233 

432065; AA401039; Hs.2303; protein phosphatase 4 (formeily X). cata; M8tatIophos;TM=M:SSBN: 5.608352145 
. . 417018; M16038; Hs.80887; v-yes-1 Yamaguchi sarcoma vlra! related ; SH2,SH3,p)dnase:TMsM;8Ss4l; 5.596052632 
10 430696; AA531276: Hs,59509; ESTs; panasB.PP2Cfwne; 5.575112108 

435017; AA336522: Hs.12854; angiotensin 11, type I receptor-assodat; none;TM=»Y;SS=M; 5.556910559 

439963: AW247S29; Hs.6793: plataietactivaSng f actor acet/lhydrula; PAF-AH_<b,Up35e.G0SUTM=M;SStrN: 5.556195965 

415012; NIML004383: Hs.777g3; &«fctyiostne kinase; SH2^4it(inaseJM=M;SS4n 5.555421687 

424909: S78187; Hs.153752; ceS dtvlston cyde 2SB; Rhodanesen 5.549751244 
1 5 413989; X14034; Hs.76648; phosphoHpase C. gamma 2 ftdiosphafidylin; SH2.SH3.C2,PH.PI-PLCY.H^C-X.POGFs 5541366907 

406621; X57809; Hs.181 125; ImmunogtobuDn lantbda tocus; ig,HSP70,Ppx.6ppA;TM=M;SS=N; 5.54076087 

417700; M36542: Hs.1 101 ; POU domain, dass 2» banscripBon facto; homeobQx.pou;TM=M;S&=N: 5.536 

456362; AW973003; Hs.179909: hypoOietlcd protein FU22995; none;tMsM:SS4l; S.S2661897 

436576;AI458213; Hs.77542; ESTs:7tnu1,DnaJ;5,52638191 
20 425465; L18954: H8.1904; protein kinase C tola; pWnase,DAG_PE^nd.pklnase_COPI^TM=M;SS=N; 5.519672131 

412276; BE262621; Hs.73798; macrophage migreBon inhiKtoiy factor (; MlF^ugarJr.none; 5.516453382 

417433; BE270266; Hs.82128; 5T4 oncofetal Iropiioblast glycoprotein; UW,UWWT,LRRCT;TM=Y;SS=M; 5.514964789 

447827; U73727; Hs.19718; protein tyrosine phosphatase, receptor t; fti3,lg,Yj)hosphalaso,MAM;TM3Y;SS=M; 5.494202899 

419508; AW997938; Ks.90786; ATP.|)lndlng cassette, sulvfamfiy C (CFTI^ ABC,tran^Cjneinl)r8n8:TM=Y;SS=M; 5.471947195 
25 410608; AI538438; H5.159087; ESTs; ub!quiiin.lntegrin_0,UBA.none; 5.465384615 

448633; AA311426; H3.21635; tubulin, gamma 1; tubuEn;TK^M;SS=N; 5.460076046 

408716: AI567839: H8,151714: Homo sapiens mlVIAfor KiAA1769 protein, ; lJvTO.heilcase.RNB,Runt;TM=M;SS=N; 5.450413223 

426410; BE2g8446; Hs.305890; BCU-tike 1; Bc^2,BH4,none; 6.444805195 

457819; AA057484; Hs.35406: FU20522 Hypothetical protein FU20522: none.none; 6.444281625 
30 422597; BE245909; Hs.11B634; ATP^lng cassette, sut)-famay B (MOW; ABC„tran,ABCjnembrane,PIW;TM=Y;SS=N; &437931035 

429191; AF06521 5: Hs.198161; phosphollpase A2, group iVB (cytbsollc); C2J>lA2_BJm|C;TM«M;SS=N; 5.4375 

449961; AW265634; Hs.133100; ESTs; pkinase,Forin-nke,ftecepXdoman^one; 6.435211268 

409012; AL1 17435; Hs.49725; OKFZP434I216 protein; PH.RhoGEF;TM=M;SS=M; 5.433333333 

443466; BE243123; Hs.321045; IKK-relaled kinase epsilon; Inducible Ik; pkinase.RI01;TM=M;SS=N; 5.429657795 
3 5 434826; AF155661; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 5.423322684 

423189; M59371; Hs.171596; EphA2; fn3,pkIn3S8,SAiyi,EPHJbd;TM=Y:SS»M; 5.421621622 

452291; AF015592: Hs.28853; C0C7 (ceil dMsion cycle 7, S. cerevisi; pMnase;TM=M;SS=N: 5.412 

432527; AW975028; Hs.102754; ESTs; none,none; 5.40625 

422278; AF072873; Hs.114218; frizzled (OrosofJliiia) homoiog 6; F2,Frizzled.7lm_2;TM=Y;SS=|y|; 5.405504587 
40 410024; AW191024; H8.5S016; Ivpolhelkd protein FU21935: SH3;TM=M;SS»N; 5.396 

434467: BE552368: Hs.231853; Homo sapiens cONA FU13445 fis, done PI4 7im_1^one: 5.391472868 
438974; AF08981 6; Ks.6454: chromosome 19 open reading frame 3; PDZ." 5.389250814 
439670; AF088076; Hs.59507; ESTs, Weakly similar to AC004858 3 U1 sm; nona,none: 5.382 

437016; AU076916; Hs.5398: guanine monphosphate synlhetase; PHD.SET,zf. 

45 O0(aEGFM.i»Wi.VVW.FCH,GATase,GMP^ynLa0cdudin.YEAT^^ 
424848; AI263231; Hs.327090; EST; SH3JPDZ.6uanyfateJdn,none; &36 

452721; AJ269529t Hs.301871; solute canier fanily 37 felyceroJ^ho; MORM,sugarJr.TM=Y;SS=M; 5.35971223 

405932; ; ; C15000305:gl|3805122[gblAAC69198.11 ^AFO; r8s:TM=M;SS=N; ^349226804 
^ 416714; AF283770; Hs.79630; CD79A antigen (immunogtobuiirv^odated; lg,ITAM,Zh_clus;TM=Y;SS«M; 5.346153846 
SO 453143: AA382234: Hs.356289; |m)teln tyrosine phosphatase, receptor t:seipin:: 5.3336673^ 

423973; AF038461;Hl136574;arad^Monate1^^ipoxygenase. 12R^ripoxygenaso,PI.AT?rM=M;SS=N;5J3 

408308: AL033377; Hs.44197: hypothetical protein DKFZp564D0462; none,none: 5.328 

41 4821; M63835; Hs.77424; Fc fragment of IgG, high affinity la, re; lg;TM3Y;SS=M; 5.316 

427337; Z46223; Hs.176683; Fc fragment of I9G, tow affinity lllb. n lg;TM=Y:SS=M; 5.309638554 
55 410165; BE560228: H5.71869; apoplcsls«sodated speck-Bee protein ; PAAD.DAPIN,CARO;TM=M;SS=N; 5.293560606 

415817: U88967; Hs.78B87; protein tyrosine phosphatase, receptor-4; fn3,YjJho8ph8tase,carbjBnhydr8se:TW=Y;88»M; 6.28 

427316; AA179949; Hs.175563; Homo sapiens mRNA; cDNA DKFZp564N0763 (f; none,6pectrin»SH3,PH.CH; 5,278947368 

431441; U81961; Hs.2794; sodium channel, nonvdlage-flated 1 alpha: ASC;TM=Y,'SS=N; 5.274746193 

416207; NM-01 4745; Hs.79077: Homo sapiens, done MGC:2908, mim, comp; none;TM=Y;SS=M; 5.272222222 
60 4151 17; AF120499; Hs.78016; polynudeoUde kinase ?-phosphatasej VlraLhelfaase1;TM=M;88=N; 6.27 

435905; AW997484: Hs.5003; KIAA0456 protein; SH3,RhoGAPJ=CH;T(*'M;SS=N; 5JS1865672 

409430; R21945; Hs.346735; splldng factor, argidne/^edne^di 5; OSPo.Rhodanese.none; 5.24B 

431846; BE019924: Hs.271560; uroptekin 1B;transmembrane4:TM=Y;SS=M; 5.232 

422017; KM_003877: Hs.110776: STAT Induced STAT lnhibitor.2; SH2;; 5.212418301 
65 438469; AK001455: Hs.5198; Down syndrome aiCcd region gene 2; none;; 5.209259259 

421502; AF111856; Hs.105039; solute canter family 34 (sodium phospha; RibosomaLL20,Na„PLcotrans;TM=Y;SS=N; SiQ2 

425356; BE244879: Hs.155939; Inositol polyphosphatB-5-phosphatese, 14; Exo_endoj)hos.SH2:TM=M;SS=N; 5.19979716 

437412; BE069288; Hs.34744; Homo saptens mRNA; cOi^A DKFZp647C136 (fr; ABC.tran.GTP^EFTU,ABCjnembr8r»,none; 5.199074074 
„ . 416602; NM.006159; Hs.367895; Protein kinase C-Wnding protein NEU2: EGF,vw,TSPNr, 6.198224852 
70 429556: AW139399; Hs.314807; ESTs; none;TM=M;SS=N; 6.192433863 

427857; AL133017; Hs.288679; hypothetfcal protein FU22865; myosliUie3d,IQ,2tMYNI>:TM=M;SS=M; 5.190251572 

400517: ; ; tengsin; ncne;Ti\*=M;SS=N; 5.18 

413436; AF238083: Hs.68061; sphlngoslne kinase 1; DAQte;TM=H*SS=N; 5.172881356 

423527; AI206965; Hs.105861; hypolhetfcal protein FU13824: none;TM=M;SS=N; 5,165060241 
75 419138; U48508; Hs.89631; ryanodine receptor 1 (sketetal); ton_trans.8PRY,RYDRJTPR,RyR.HAiR;TM=Y:SS=N; 5.156976744 

437809; AL137723; Hs.5855; Homo saptens mRNA; cDNA DKFZp434O0818 (f; none,none: 5.154676259 

452089: AB028949: Hs.183994: KiAA1026 protein; MeiaUophos;TM»M:S&:N; 5.152360515 

409340; BE174629; Hs.321130: hypothetk»l protein MGC2771; 

aaj)enTieases.pyridoxaLdeabromodom^.PHD>fflDATJiook.OOT^ 5.144859813 
80 442875; BE623003; Hs.23625; Homo saptens doneTCX;CTA00142 mRNA sequ; K-tetrB.DUF51.none: &142 

434883; AW381538; Hs.19807; hypothetteal protdn MGC12959: SH3,PH.WW.RhoGAP;; 5.141534392 

434808; AF155108; HS.2S6150: NY-I^N^I anfigen; none;TM>M;SS>=N: 5.14 

431341: AA307211: HS.2S1531; |»oteasome (Irosome, inacropaiiO subuiiiU 
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431685; AW298135; Hs^659: vw 3 oncogene; CH.OAa.P&Wnd.PH.RhoG£F.SH2.SH3,OC1;TM=M:SS=N; 5.129476584 
411190; AA306342; Hs.69171; protein kinase C^Bca 2; pktnase.panasfiLC,HR1;TM=M;SS=N; 5.121527778 

433573; AF234887; HS.576S2: cadheHn, EGF UG seveiHass reoe; 7!nL2£GF^herin,lamln!iLEGF.IamInbLG,Ti^^ 5.107438017 
433662:VVD7162:Hs.1S0826;IVVB2SRAB2S.fnentberRASonc^ 

419493; AF001212; Hs.90744; preteasome (presane, macropaby) 26S subu; COK5_BcBv8tor,Pa,non8; 5.095194065 

41 1027; AF072099; Hs.67846; (eutocyta Immunogtobull^fika receptor, ; lnosltoLP,lo;TM=M;SS=N: 5.092 

435243; AW292886; H3.348932; hypottieQcal protdn dl434014.3; IRFjwno; 5,092 

434417: AL1 10157; Hs.3843: Homo sapiens mRNA; cDNA DKFZp585F2224 (f; DSPcnone; 5l091922006 

415323; BE2693S2; Ks.949: neutrophil cytosoCc factor 2 (65kO. chr; SK3.TPR;TM=:M;SS4lt 5.088932806 

4186^ BE2475S0; Hs.86859: gnwUi (actor receptor-bound pro^ 7; SH2.PKRA;: 5.082640237 

420030: BE503994: Hs.146233; KIAA0418 gene product; SH^,nm 5.080645161 

444065; AW449415; Hs.10260; Konio sapiens cONA FU11341 lis, dona PL; SH3n &083953488 

421677; H64092; Hs.38282: ESTs; AlppArmadiBo seg.188; 5.058 

411165; NM.000169; H5.69089; galactosUasa. eiptia; Melibiase;: 5.054133858 

428479; Y00272; Hs.334562; cell division cycle 2. G1 to S and G2 to; pkbiase,ICtj)10,ICE_p20;TM=M;SS=M; a054 

423883; AF250238; Ks.134514:ATP^ndlng cassette, 8Utvtam9y A (ABC1:ABC_trBn.pholoR(;SRP54.(X 4a;TKNY:SSsft/t 5.051724138 

421917; AB028943: H3.109445; KIAA1020 protein; BTB,zM:2H2PI3^PI4JdnasB.PI3Ka;TM=M;SS=N: 5.051282051 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t; klnesln,fh3,Y_pho5phata5e:TM=M:SS<:N; 5.047311828 

410026; AI912061: Hs.55016; hypothefcal protein aJ21935; none,none; 5.04674221 

426395; BE151985; Hs.355669; hypothetical protein FU23316; pWn3se,none: 5.040298508 

418054; NH4.002318; Hs.833S4; lysyl oxWase^ke 2; SRCRJ.ysyLoxldase;TM=M;SS=M; 5.039039039 

444895; AI674383; Hs.22891; solute canter family 7 (cationlc amino ; ASC,dealh.TNFK.c6; &037151703 

413472; BE242870: Hs.75379; solute carrier family 1 (giial high afii; SDF:TM=Y;SScM; 5.034 

446272; BE268912: Hs.14601: hematopolaticceiVspeclfic Lyn suljstral; SH3,HS1 jBp;TM=M;SS=N; 5.03030303 

410772; 8E275297: Hs.194685; Homo sapiens done 24875 mRNA sequence: TopolsomJ)ac,Toprim;: 5.027985075 

418613; AA744529; K5.86575; mitogen-aclivated protein kinase kinase : pk!nasQ.GNH:TM=>lilkSS>=N; 5.014652015 

415166; NM.003652; Hs.76068; ca;t)oxypepOdaseZ:Ziucart)Opepl.I}ioxygenase.Fzn &012269939 

416498; Ua3632; Hs.79351; potassium channel, sutfamly K. menter 1; lon_trans:TM=Y;SS=M; 5.001811594 

410687; U24389; Hs,65436; lysyl oxMase^Ike 1; LysyLoxidase;; 4.997983871 

430024; AI808780: H5.227730: Integtin, alpha 6; integriaAFG^TM=Y;SS=M; 4.994871795 

409220; BE243323: Hs.51 233; tumor necrosis factor receptor superM; TNFK_cMat^UpoprOtein.5,TIL;T^NY;SS»M; 4987135506 

423804; AW40344B: HS.1706; Intarterorvstimulated transctfplton tad; IRF,zK:3HC4JBR,2lRanBP:TM»M;SS=N: 4.985185185 

423011; NM.00Q863; Hs.123022; adrenergic. alpha-20. recepton 7lirLl;TM=^;SS>4UI: 4.984 

419577; 1^531; Hs.91296: Integrin, dtpha 8; lntegrinJVF&GAP;T»sY;SS=N; 4.968 

402328; ; ; Target Exon; pkinase:T1i^M:SS^ 4.96728972 

421242; AW161386; Hs.13561: hypothetic^ protein MGC4692; none;NA:NA: 4.986334165 

435523; T62849; Hs.1 1090; membrsn&^pani^g 4^omalns. subfamily A; none;TM=Y:$S^M: 4.964491363 

414203; BE262170; Hs.78629; ATPase, Na? transporting, beta 1 polypep: none,none; 4.961956522 

409582; R27430; Hs.271565; ESTs; nons.Neurjchan.tJBD,Neur.diarLmemb; 4.946 

451292; AB037716: Hs.26204; KIAA1295 protein; SH3;TI^SS»N: 4.943161818 

453449; W16752; Hs.32981; senna domain, bmminogtobuBn dondn (Ig).; ig.Sema.PSI;: 4.930508475 

425233: Z17861; Hs.155218: ElB-SSkOa-assodatBd protein 5; SPRY,SAP,pkinase,fh3.lg; 4.926347222 

425247: N»^005940; Hs. 155324; matrix melaltoprotelnase 11 (stromelysln; hemopexin.P6ptkJasejyi10K4.92 

422282: AF019225: Hs.114309: apoBpoprotein U MotA^B;TM='Y;SS=M; 4.912181303 

442572; AI001922; Hs.135121; hypotheOca! protein FU22415; none,HSP70: 4.910224439 

425743; BE396495; Hs.159428; BCL2-assodated X protein; BcK2;TM=Y:SS=N; 4,909972299 

449523; NM_000579; Hs.54443; chemoWne (&C motif) receptor 5; 7tnLl;TM=Y;SS=M; 4.904 

409213; U61412; Hs.51133; PTK6 protein tyrosine kinase 6; SH2.SH3.pWnase;TftM4:SS=N; 4.897338403 

411770; NM_014278; Hs.71992: heat shock protein (hspl 10 family): HSP70;TM=M:SS«N: 4.894 

446872: X97058: Hs,16362: pyrimldinergic receptor P2Y. G^)roteIn c; 7tnLl :TM=Y;SS=M; 4.888 

423198: M81933: Hs.1634: ceB diviston cycte 25A; Rhodane8e.none: 4.884 

445462; AA378776; Hs.288649; hypothetical protein MGC3077: none;; 4.876379891 

448163; Y10805: H5.20521: HMT1 (hnRNP methyitransfwase, & cerevi: NusG:: 4,876117497 

427792; M63928; Hs.1B0841: tumor necrosis factor receptor soperfami; SRP14.TNFR.c6;: 4.873684211 

449027; AJ271216; Hs.22880: dipeptidytpepSdase 111; PeptklaseJM9.EGF.ig.NeureguiIn:TMsM;SS=N; 41872641509 

421541; NM.003942: Hs.105584; ribosomai protein S6 kinase. gOkO, polyp; pkInase,pkinasa.C;Ti^l\/tSSsN; 4869318182 

429819: AL120751; Hs.21 1568; eukaryofic translation IntSatton factor; none.none; 4868073879 

458873; AW150717: H5.345728: STAT induced STAT toWbHor 3; none^one; 4861538462 

437669; At358105: Hs.123164: ESTs, Weddy sindtor to mateh to ESTs AA; Rone.pUnase.pk!nase Q 4.854651163 

405545; ; ; TaiBel Exon; ABC.bBn,8RP54,ABCjnembfane:TM»Y:SS=M: 485 

424779: A1046851: Hs.153053; CD37 antigen; transmembrane4;Tl*=Y;SS=M; 4848387097 

424263; M77640; H3.1757: LI ceH adbeston molecule (hydrocephalus: fh3,ig.lRK:TM»Y;SS=M: 4.846153846 

425421; L11669; Hs.l57145; (atracydine trsnspoiter-liks protein; 8ug8r.ti;TM=iY:SS=M: 4843694494 

421267; BE314724: H$.103081; ribosomai protein S6 kinase. 70kD. polyp; pkfnase.pldnase C;TM=^^SS°N: 4842532468 

418736; T18979; Hs.87908: Snf2-related CBP acflvator protein; heIfcase_CATJwok,SNFi N:TM=M;SS4I; 4.842 

444143; AW747996: Hs.160999: ESTs, Moderately similar to A56194 throm; Bcl-2^ne: 4.841071429 

417331; AW411297: Hs.81972; SHC(Src homology 2 domai^containlng) t; SH2,PiD^fC2H2.SCAN>W^)lndlng,KRAB:TM=M;SS=N; 4839464883 

422010^ AA302049; Hs.31 181; Homo sapiens cDNA: FU23230 lis, done C; none.SDF,sugar Jn 4.837837838 

434521; NM_002267; Hs.3886: karyopherin aSpha 3 (Impoilin alpha 4); ArmadiDo seg.l6B:Th^M;SS=N; 4633333333 

450447; AF212223; Hs.25010; hypothetical protein P15-2: NTF2:TM=ft/tSS=N; 4.821666667 

453082; H18835: Hs.31608: hypothetical protdn FIJ20041: ton..tran3;TMaY:SS=M; 4.820936639 

417949: AL049795: Hs.83004; interteuMn 14; none>\rmadiik^&eg.l66.WO40: 4.81443299 ' 

439569; AW602166: Hs.222399: CE6P1 protein; EGF JNFRj:6,granufin,CUB,KeraflnJ2JIUTW=M:SS=iyt 4.81 

432581; AU076465: Hs.278441; KIAA0015 gene product: PP2C:TM=M;SS=N; 4.805063»1 

432194; AU)40601: 85.273219; breast cancer anS-estrogen resistance 1; SH3:: 4803191489 

431472; AK001023; Ks.256549: nudeoBde binding protein 2 (ExoO Mn; fer4JjlfHParA>^PSJ(InaseAsAuATPase:TliNy(SSsN; 4800990099 
450690; AA298696; Hs.333418: FXYD domaln^t^ ion (ranspot rag; ATP1G1 JTiU/tAT8:TM=Y:SS«^\ft 4.795480881 
448850; AF288687; Ks.9275: CGI-152 protein: E1-E2jaPase.Hydrotase:TM:»Y:SS=N: 4.776923077 

427681; AB018263; Hs,284232; tumor liecrosis factor receptor superfemi; death.TNFR.c6,PH.XIInkJWioGEF,MetaltotWo.5:TM=M:SSsM: 4772196252 

432827; Z68128; HsJ109: Rho GTPase acBvating proteta 4; FaiRhoGAP,SH3;TM=M:SS=N; 4.7601 15607 

433376; AI249361: Hs.74122; caspase 4 apoptDsiwBlated cysteine pr; CARD.IC^j)10,ICE_p2D;; 4.751162791 

419981; AA897581: Hs.128773; ESTs; pklnase.DAGLP6*WjWnas6L<;OPR.non8; 4748 

431657; AI345227: Hs.105446; ESTs» VteaWy sImDsrto B34087hypolhBtfcpWna8ePA28jIpha,PA2BJ^ 
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412958; 6E391579t Hs.7S087; Fs&«cBvatBd seMQueoflbie Unase; nonen 4738781609 

41444% AU0772G8; Hs.76144: plateleMeilved gnwQi fector receptor.: i9.pKin8se(TM»Y;S&44: 4.733 

419250; AW770185; m.356066; U5 snRNP-epedfic protaln, 116 kO; 7t^L13AH.zM:XXX;DN^Jnell1yl3se: 4725454546 

417903; NM.002342;Ks.1116; rymphotoxinbetarec6p1or(TNFRsupe}fanxTNFR.c6:TM=^ 

414368; WTOUI; K3.75939; uridine monophosphate kinase; PRK,CoaEn 4.718838565 

426059; BE292842: Hs.166120: interferon re^tdalory factor 7; iRF;; 4.718543046 

414788; X78342; Hs.77313; cyc&HJependent kinase (C0C2mke] 10; pkinase;TM=M;SS>=N; 4.708 

407601; AC002300; Ks.37129; socfium channel, nonvoltageisated 1, t)etatASC;TM»Y;SS='M; 4.707920792 

448520; AB002367; Ks.2135S; dout}!eco(tin and CaM Unas&Uke 1; pkin8se.DCX:T|yNlA.'SSBN; 4.707671958 

407143; C14076; Hs.332329; EST; none;TM»Y:SS=M; 4,68^75815 

42858% BE336699; Hs.185055; BENE protein; none;TM=Y;SSsM; 4.681818182 

408806; AWB47814; lte.75608; Homo sajflens cONA: FU21532 fb. ctone C; SH3.PDZ,6uanylaleJdn,none; 4.680440771 

448133; AA723157; Hs.73769; fotale receptor 1 (adult); Fo!atB.fecMlP:TM=M;SS=M; 4.679841897 

418836; A!655499: Hs.161712; ESTs; pkin3se.Actjvauecp^DZ;:U5,de3lh; 4.679180887 

425308; M97639; H3.155585; receptor tyrosine kinase^ike orphan rec; ig,ki^l8.pkinase.Fz:TM=Y;SS^ 4.675342466 

414665; AA160873; H3.3563(J7: serum amykAJ A1; zl.C2H2.BTB,ICtelrajione; 4.67447496 

449843; RS5337; Ks.24030: sotutacanier famQy 31 (copper transpo; none:TM=Y;SS^ 4.673701299 

428245; AF151048; H1183180; anaphase proniotingcoiDplexsutRinlt 11 (y; none;: 4.656756757 

417088; M5491S: HS.B117Q; pinvl oncogene; pkinas8;TM::M:SS4J; 4.656190476 

420340; NM.000734: Hs.97087: C03Z antigen, zeta polypepQde (7173 conv rTAM:TVNM;SS»M; 4.66 

425966; NWL001761; Hs,1973; cyclin F; cyclh.F-l)Ox/;ycIin_C;TM=M;SS=N; 4.644 

417929; R27219: Hs.74647; Human T-ceH leceplor active atpha«hain: ig^tihydrolase; 4.640384615 

430603; AA148164: Hi247280; HBV assodaled factn; zK:3HC4zH%anBP.pMnasa; 4.630853266 

419273; BE271180; Hs.293490; ECTs. similar to 138022 hypottiefi; none.non8; 4.628 

453880; AI803166; Hs.135121: ESTs. WeflWy slrnRar to 138022 hypotheti; HSP70,none: 4.619047619 

459399; BE407712; Hs.153998; creatine kinase, mitochondriai 1 (ul^I; nono,none; 4.618577075 

412970; AB026438; Hs.177534; dual specifictty phosphatase 10; Rhodanese.OSPc;; 4.616 

433577; AW007080; Hs.284192; ESTs; none.none; 4.614 

*444838; AV651680; Ks.208558; ESTs; integrtnJ^F&GAP^none; 4.612149533 

408369; R3B438; Hs.118747; SLC15A2 Solute carrier family 15 (H-»^p; PTR2;TM=Y;SS*N; 4.602 

450825; AOQ05954: Hs.25527; tight junction protein 3 (zonaocctudens; PDZ,Guanyiate_kin;; 4.596875 

443951; F13272; H8.3S6835; fBrriHn. light polypeptide; PMP22_Qaudln.none; 4.587931035 

433083; AL042759; Hs.191762; ESTs; SH3,PX;TM=M;SS=N; 4.586 

407239; AA076350; Hs.67846; leukocyte ImmunogtobuBn^ receptor. ; lg;TM=Y;SS=M; 4.58557047 

412926; AI879076; Hs.75061; macrophage myifetoyteted atanlne-rlch C ; MARCKS;; 4.579087049 

422009; AI742845; Hs.1 10713; DEK oncogena (DMA binding); SAPn 4.576347305 

412584; X54670: Hs.74085; DNA segment on chramosoms 12 (unique) 24; nonejectin^c; 4.57312253 

414561; AI064813; Hs.195155; Homo sapiens amino add transport system; Aa.trBns;TM=Y;SS=N; 4.573015873 

422627; BE336857; Hs.1 18787; transforming growth factor, beta-induced; Fasciclin.ABC.tran>^C.membrane,GTP_EFTU;TMsM;SS=M; 4.570525316 
459053; Ai807052; Hs.97792; ESTs; none.7tnUGPS; 4.569230769 

424247; X14008; Hs.234734; lysozyme (renal anvtoWosis); ly3jg,FAD.Synth,ldh.Mh_C,pklnase;; 4.566195373 

439975; AW328081; Hs.6817: inosine triphosphatase (nucleoside triph; Ham1pJike;TM=M;SS«N: 4.56056338 

416178; AI808527: Hs.192822; serotogteally defined breast cancer anti; none;TM»M:SS44; 4.658 

408051: A1623351; H5.172148: ESTs; PH.RhoGAP,nonB; 4.552307692 

42184G; AA017707; H8.1432; protAbi Unase C substrate 80K'H; ethandjdl jeoepLac 4.547761194 

439859; AW970780; Hs,S9483: teudoMich repeal^ontalning 6 protein; 7tm.1,LRR;TM»Y;SS=N; 4.54716981 1 

426201; AW182614; Hs.128499; ESTs; SH3,none; 4.641666667 

42490S; N^4«002497; Hs.153704; NIMA (never In mllDsls gene a)^elated k; pkinase;TM=M;SS=N; 4.636 
445229; BE276013: Hs.343828; Homo sapiens mRNA for FU00086 protein. ; G-a!pha;TM=M;SS=N; 4.5305882% 
413109; AW3Bg84S; HS.11085S; ESTs; PH04.none; 4.529761905 

426125; )(87241; Hs.166994; FAT hJmor suppressor (Diosophlla) homi*); EGF,cadheiin.lamlnbL6:TM»Y;SS=« 4^29710145 
402330; ; ; Target Exon; pkInase.none; 4.526070175 

439238; IW7305; Hs.302161; EDM (endofliefial differenSaOon. sph; 7trn_1:TM=Y;SS=M; 4.524 
433437; U20536; K5.3280; caspase 6. apoptosis-felated cysteine pr. iC^10JCEji20;: 4.523715415 
413781; J05Z72: Hs^; IIMP (inos&ia monophosphate) dehydrogenas; C8SJMPDH.(;iMPDKJ«.NPI>;TM»M:8S>^ 4522900763 
431429; AF072813;Hs.252831; leOcuIon 3; Reticulon.FzJgMigte.pkinase;TM=Y;SS4l; 4.612 
. 424078; AB006625; Hs.139033; patemdiiy expressed 3; zM:2H2,KRAB,none; 4.512 
420602; AF060877; Hs.99236; regulator or Giirotein signalling 20; RGS;TM=M;SS=N: 4.51 
449101; AA205847; Hs.23016: 6 protein-coupled receptor, 7tm.1;TM=Y;SS=ti^ 4.506 
408157; AA047685; Hs.62946; ESTs; none,pkinase; 4.504 

446291; BE397753; Hs.14623; interferon, gamm&indudble protein 30; GiLT:TM=M;SS=Y; 4.50215208 

431326; AW970S80; Hs.198689; K1AA0728 proti^ none.none; 4.501 

428072; BE258602: Hs.182366; heat shock protein 75; KATPase.c.KSP90;TM»K«S8=N: 448828125 

415149; X12451; Hs.78056; cathepsin U Pept}dase.C1;; 4.484375 

421959; AW751497; 83.98370; cytochrome P450, subfamily IIS. polypept; p450:TM=Y;SSs=M; 4.48 

445143; U29171; H3.378918; casein kinase 1, delta; zf<3H(M.RIanii.2f*.box,NHL.pkinase.zf-MIZ;TM==M;SS=N; 4.478092784 

421071; A131 1 238; Hs.104476: ESTs, WeaWy slntiter to CGHU1 E collagen ; none;TNNY;SS8M; 4.4773371 11 

410590; BE615216; Ks.64746; chiortde hitrace!lular channel 3; none;TM=M;SS^; 4476 

438774; AA431620; Hs.379034; hypothetteal protein MGC2746; none.none; 4474874372 

410726; Ai6238S9; Hs.1S936; ESTs; pkinase.proJsomeiase.none; 447 

429903; AL134197; H3.93597; cycEnntependent kinase 5. regulatory su; CDK5.ac6vator.none: 4488 

426485; NM.006207; Hs.170040; plateteMeiived growth factor receptors ig;; 4464944649 ' 

433846; AA603319; Hs.1551 95; ESTs: pou.homeoboxJg.chan.ANFjeceplor, 4.458 

410293; AK00D047; Hs.61960; hypothetical protein; K.tetra;TM=M;SS=N: 4,453020134 

453464; AI8B491 1; Hs.32989; receptor (cateitonin) acSvlty modl^g; none;TM3Y;SSBt4: 4448198198 

410583; AW770280; Ks.362S8; ESTs. Moderately similar to JC5238 galac; SH3.PDZ.&janylateJdn,none; 4.446927374 

441455; AJ271671: Hs.7854: zincfiron reoulatBd transporter-Ske; Zip;TM=Y;SS=M; 4.445010183 

453084; R40334; H3.89463; potassium large conductance caktea:8; nons,none; 4436480187 

443303; 087319; Hs.9218; caspase 7, apoptosis-reiated cysteine pr; pkinase.lCE_p10.ICEj)20;TM^SS=A^ 4433411215 

411825; AK000334; Hs.3524 15; solute canter family 39 (zinc transport; SNF.2p;TM3Y^S^; 4.432765152 

428376; AF119865; Hs.18401 1; pyrophosphatase Cmoiganic): Pyrophosphat3se;TM=M;SSsN; 4428571429 

429S92: AB029041: Hs.209646: KIAA11 18 protdn; TrapQnln.E»jBndoj)h08.IQ;Ttt:>M;SSsN; 4.428 

419344 U9490S: H8.277445; dacylg^oerol Unase. zeta (104kP); 8nk.DAGKa,0A6Kc;DAGJ>EM^TM=M;SS»N; 4.426229508 
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427138; N77624: Ks.173717: phospha&fic add phosphatase type 2B; PAPZnona; 4.4234375 

414496; W73853; Hs.3S$424: ESTs; pkinase«F5J8_type_C^_8hort.none; 4.42114094 

429432: A1678059: Hs^676; synaptonefnd complex protein 2; none;TM=M;SS=N; 4.42 

429922: 297630; Hs.2261 17; HI histone family, member 0; Ilnker.hlstone;TM=M:SS4J: 4.419207317 

446700; AW208S7; Hs.156326: Human DNAsequence from done RP1M45L2; none;TM=M;SS=N; 4.418181818 

43541 1; AW444619; Hs.138211: ESTs: nonejikinase; 4.414 

414581: AA256213: Hs.72010: ESTs; none.Canuacvttrans'.ChonneJdnBse.SCOI-SenaGhrcos.t^ 4.41 

418558: AW082265; H5.88131; Fas crNFRSF6)-8ssoci3ted via death dom^ deaUvDEDn 4.406523909 

442259; AI690269: Hs.201345: ESTs; Acetyltransf,RhoGAP,FCH.SH3.Kelch,fn3: 4.406 

415660; D56051; Hs.78888; diazepam Wmflng InWWtor (GABA recepto; ACBP:TM=M;SS=N; 4.404678363 

434419; AIJ040606: 85.296938; dual spedficily phosphatase 7; DSPc;TM=M;SS»N: 4.404 

404440; ; ; NM.021048:Homo sapiens melanoma anOgea; MAGE;TM=M:SS"N; 4^ 

435542; AA6B7376; Hs,351226; ESTs; SH3,lg.pklnase.Paspectrin.RhoGEF^ne; 4.394 

41 3367; NM^006517; Hs.75317; solute carrier fanHy 16 (monocaitoxyfic; 8ijgar_tr;TMaY;SS=N; 4.39028777 

435732; AF229176; Hs.1 231 36; leucine ilch repeat and death domain con; none,nonB: 4.38490566 

427359; AW020782: Hs.79881: Homo saj^ens cONA: FU23006 fis. done U 7biLl*none: 4.382129278 

425749; AW328587; Hs.159448; surfeit 2; nonen 4.382 

417874; BEei616Q; 88.82829; protein tyrosine phosphatase, non-recept; Y_phosphatase:TM=Y;SS=N: 4.381422925 
414806; D14694; Hs.77329; phosphafldytseifne synthase 1; PSS:TM=Y:SS=M; 43)681818 
431837; T79326; Hs.331967; dfacloiy receptor, family Z subfamily ; none,7tnL3.sushlANF_receptor; 4.376 
417115; AW952792; K3.334612: small nudear rtbonudeoprotein polypept; Snvpidnsse;: 4.370247934 

434876; AF160477; Hs.61460; Ig superfamily receptor UIIR; lg.Rhabd jlycop:TM=Y;SS=M; 4.37 

430379; AF134149: Hs.240395; potassium channel subfamay K, member 6; ioiLtrans;TM=Y:SS=M: 4.367777778 
403912; ; ; C5000394*:9li12737280(refpCP_006682.2| K none:TM=iyi;SS4l; 4.367684478 
426268; AF083420; H5.168913; sertneAhreontne kinase 24 (St02a yeast; iricinase;; 4.386348449 
434263; N34895: Hs.79187; ESTs: Ig.none: 4.358527132 
404760; ; ; Target Exon; cadhefin;TM»M;SS=M; 4.356 

413076; U10564; Hs.75188; wel (S. pombe) homofeg; pHnas©;TW=l*SS=N; 4 J5472973 

420757; X78592: Hs.99915; androgen receptor {dihydrotestostarone r; homra»jBcrWM/ndrogenjeoep;TM=M:SS=N; 4.354 

426812; AF106365: Hs.172613; sdute carrier family 12 (polasshimfchio; none:TM=Y:SS=N: 4.353244838 

431674: AA098901: Hs.301642; G-protein coupled receptor none,GCV.H; 4.35 

431886; L77954; Hs.271980: mitoge^activated protein kinase 6; pldnase;TM=M;SS=N; 4.347893916 

447719; BE387402; Hs.19333; hypottieflcal protein FU10349: adenylateklnase,ATP-Wnd:TM=M;SS=N; 4.346007605 

424837; BE2761 13; Hs.333034; N-acctyltransferase. homolog of S. cerev; AcetyllransftTM=M:SSBN: 4.344 

449437; A1702038; Hs.100057: Homo sapiens d)NA: FU22902 fIs. done K; none.none: 4.334722222 

411768; NM_013371: Hs.71979: Interieukin 19; ILIO;; 4.322 

445350; AF052112; Hs.12540; lysophosphortpase !; abhydrDlaseJ?;TM=M;SS=N; 4.320359281 

425964; AW889928; Hs J071; progesterone membrane binding protein; homeobojcnone; 4.318867925 

450998; BE387614; Hs.25797; splldng factor 3b. subunit4, 49kD; mm:TM=M;SS=H: 4.316573557 

408908; BE296227; Hs.250822; serlnefthreonlne tdnase 15; pklnase;; 4.316 

400290; H18836; Hs.31608: hypothetical protein FU20041; none,CysJnol: 4.314728682 

438899; AF085833; Hs.135824: ESTs; none.PI3,P14Jdnase.P13Ka.PI3K.C2,Pl3K_rbd.PI3Kj)85B; 4.314084507 

418883; BE387036; Hs.1211; add phosphatase 5. tartrate resistant; MetaBophos:TM=M;SS=M; 4,312121212 

419607; R52557; Hs,91579; Homo sapiens dona 23783 mRNA sequerwe; IMP4;TM=M:SS=N: 4.304407714 

421532: AW138207; H8.146170I hypothefical protein FU22969; Am™adiiloj5eg.HEAT;TM=M;SS=N: 4.304 

430017: AA263172; Hs.35; protein tyrosine phosphatase, non^ecept: Yj}hosphatase;TM=M;SS-M; 4.302 

447224; BE617125; Hs.142076; gb:60144l664F1 NIH J^.65 Honw sapiens c; none;NA:NA: 4.302 

425424; NlyL004954; Hs.157199: ELKL moffl Wnase; pWnase.UBA.KA1;TM=M;SS=N; 4.301639344 

454042: H22570; Hs.47860; hypothetical protein FU20093; ig.pklnase,U^R.U^4J^CT,none: 4.30141844 

446143; BE24S342: Hs.306079; 8ec61 homotog; NtJDIX,8ecY»E1jdehydrog.transketj)ynTM»Y;SS^4.300872093 

42B981: BE313077: Hs.93135: ESTs, Weakly sbnilar to ALU^JIUMAN ALU 8; none,mn; 4.292620865 

432562; BE531048: Hs.278422: DKFZP58661122 protein; 2^C2H2:TM=M;SS=ht 4.290258449 

432945; AL043683; Hs.8173; hypothetical protekt FU10803: none;TM=M:SS=M; 4.288405797 

421921; H83363: Hs.355993; transtocese of inner mitochondrial membr. zf-TIm10.DDP.6fhand,Ca8pectrln,se(|dn;TM=M;8841; 4.284 
448564; Atj044962; H8.21453: inositol 1.4,54risphOGphate 3-klnase C; IPK.*; 4.28067554 
453941; U39817; Hs.36820; Bloom syndrome; DEADJteltease.C|HRDC:TM44;SS4l; 4.28 

437712; X04588; Hs.e5844: neurotrophic tyrosine kinase, receptor, ; TroponTyosln,pklnase,IJ^LRRCT,HydantDlnaseJ>tyrfantolnase^TM=*l;SS^ 4.277477478 

417426; NM_002291; Hs.82124; laminln. beta 1; temInin.EGF,laminln„Ntem».integrin.B;; 4.276162791 

450296; AL041949; Hs.24756: hepatocyte gnwvth factor-regulated tyros; none,none; 4.273927393 

438012; AA393254: Hs.43619; ESTs; Annadilto^.none; 4.273134328 

409619; AK001015; Hs.55220: BCL2-assodaled athanogene 2; BAG;TM»M;SS:'H; 4.273109244 

418529: AW005695: Hs.250897; TRK-fused gene; Band.41.ERM.pklnase,LRR,LRRCT,MAM.Nudeoplasn*iTropornyosln,<»'Kf^^ 
4.272123894 

415214;AI445236; Hs.125124: EphB2;fn34Jkinase.SAM,EPHJbd;TM=Y:SS=M; 4.268 ' 

438233: W52448; Hsi6147; ESTs; Neur.dtanJJ0.Neur„chanjnemb,MA6E; 4.26284585 

429019; AA443280: Hs.279907: myosin IIIA; myDSlnJwad,pWnase.PRK,IQ;TNNM;SS=N; 4.262 

424959; NM_005781; Hs.153g37; activated p21cdc42K8 kinase; kfii.idK.C.SH3,pkinase.UBA:TMsM;SSBN; 4.258695652 

453655; AW960427; Ks.342874: translbrming growth factor, beta recepto; zonaj)eIlucida.none; 4.257208766 

417414: AA434589; Hs.357676; dUTP pyrophosphatase; dlKPasB,KRAB: 4.251785714 

453905; NM_002314; Hs.36566: UM domain kinase 1; pklnase,UM.PDZ,zf-PARP;TM=M(8S"N; 4.249116608 

424232; AB015982: Hs.143460: prot^ kinase C nu; pklr^e,DAG_PE.bInd,PH;TM=M;SS=N; 4,247692308 

404883; ;; ENSP00000216009:Sodiun>glucose cotranspo; SSF;TM=Y;SS=M; 4.242424242 

412507; L36645; Hs,73964: EphA4; fh3,pkinase.SAM.EPH_B)d:TM=Y;6S=M: 4,239285714 

411089; AA45M54: Hs.355702; celi division cyde 2-IIke 1 (PITSLREpr,none,nonB: 4.237313433 

436957; AA902488; Hs.122952: ESTs; none.DAGKcDAGKa.RA,OAGJ^-bInd; 4.236 

452568; AA805634; Hs.300870: Homo sapiens mRNA; cDNA DKFZp547M072 (fn PI3_PI4Jdnase;TM=M;SS=M; 4.23537415 

433535; AF1 11106; Hs.3382; protein phosphatase 4. regulatory subunl; HEAT:TMnM;SS=N; 4.234793187 

432728; NiO06979; Hs.278721; HLAdass H lagbn expressed gene KE4; Zipjis.dian:TM»Y;SS4« 4.234545455 

416350; AF188625; Hs.189507: phosphoDpase A2. group m, phosGp;TM=M;&S=Y; 4.234 

409533; AW969543; Hs.144609; mltogen^ated protein kinase kinase ; PepQdase_C48.none; 4.230666667 

427127; AW802282; H3.22265; pyruvate dehydrogenase phosphatase; PP2C,none; 4.228009259 

403362; ; ; NM-OOI 61 S':Homo sapiens adin. gamma 2, ; adinn 4.22688478 

417866; AW067903; HS.B2772; ooDagen. type XI. alpha 1; CdIagen,GOUa.TSPN JamMiLaOorA;: 4.226368889 
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42889n AJ245719: Hs.19438Si hypotheOcd protein FU20234; SH2:TiyNyl;SS«N: 4^4731 163 

425771; 8E56177G: H3.159494; Bruton agammagtobUlinsiria (ynsIneMnas; 8H2,8H3jMn35e,PH.BTK;TMsM;SS^ 4.223684211 
416566: C21220: Hs.321717; hypotheticd protein FU1(W75;2f^H2.BTB.ietBtia.7tin_1; 4.222807018 
454098; W27953; Hs.217493; RakophiHn; none^ne; AJ22 

424381; AA285249; K5.146329: protein Mnase aik2 (CKEK2); pldn3se.FHA,Dnal;TM=M;SS=N: 4.21875 

419223; X60111: Hs.1244: C09 antigen (p24); transmenibrane4;TM=Y;S&4y( 4.217130215 

436756: Z1 8384: Hs.198298; avian sarooma (SchmUt^pIn A-2); none,none; 4.216 

450167; AA446404; Hs.24563; MTF2.ielaled export protein 1; NTF2:TM=M:SS=N; 4.215163934 

416224; NfA_002902; H3,79088: neticulocafltin 2, EF-hand caldum Wndln; cfhand;; 4.212041885 

432539; AL138169; Hs,278378; kaiyopherin t)eta 21), Iransportln; none,DS,UPF0139,ayoO-hydro.38; 4.207407407 

416651; AA634543; Hs.79440; IGF4 mRNA-binding protdn 3; KH^tanain.inn;TM=M;SS»N; 4.206 

432284; AA532807: Hs.287740; ESTs; pWnase,none; 4.205454545 

418758; AW95931 1; Hs. 17201 2; hypothetical protatn DKFZp434J037; pWnasQ.Ri01;TM=M;SS=N; 4.204142012 

450056; BE047394; Hs.502; ESTs, WeaWy sWlar to S71512 liypclheti; ABC^tran>\BCjnerol)rano.ig.MHCXl)eta,Si^54.proteasoni8^Cjnen*^^ 4.202572347 

412817; AL037159; Hs.74519; proleasome (prosoms, macropain) 25S subu; PCjep;TM=M;SS=N; 4.202061856 

425334; AA356730; Hs.323949; kangal 1 (suppress of tumorigenfcOy ; transmemt3rane4,non9; 4.195014663 

449335; AW150717; Hs.345728; STAT induced STAT Inhibitor 3; SH2:TM=M;SS=N; 4.192248062 

41502% AA932146; H5.35S397; Homo sapiens done TCCCtA00164 mRNAseqM: non6;NA:NA: 4.192 

443907; AU07M84; Hs.9963: TYRO prolan tyrosine kinase binding pro; nonB;TM=M;SS«Y: 4.191878981 

445330; R52658; Hs.21691: ESTs; 7tnLl,none; 4.189922481 

430016; NWL004736; Hs.227656; xenotropic and polytropic relrovinis rec; SPX,EXS;TI*=Y;SS=N; 4.188333333 
434633; AI189587; Hs.120915; ESTs; 8H3,PH.Ri!oGAP,none; 4,187106918 

452908; AB001451: H5.30985; neuronal Stic adaptor homolog; 8H2.PI0,Zn.c8ibOpept:TM=l\^SS=N: 4.1 66885246 
439318; AW83704e; Hs,65Z7; G prolein««ipied receptor 66; 7tnL2,CytC.asm.6PS;TM=Y;SS«M; 3.930957684 
432201: A1538613; Hs.298241: Transmembrane protease, sertne 3; Idljecepta,trypsin;TM=Y:SS=Mt 3.893103448 
428969; AF120274; Hs.194689; artemln; TGF-beta;; 3.884030418 
444633; AF111713; Hs.1 2284; junctional adliesion molecule 1; lg;TW'Y;SS=M: 3.831669044 

432305; M62402; Hs.274313; insulin-ni® growtli factor binding prote; tlvro*>buIiiLl JGFBP.A2KLN:TM=M:SS=N; 1742996346 

405547; ; ; NM-01B83y:Homo sapiens transporter 2, A; ABC_lran.SRP54»ABCjnembrane;TM=Y;SS»Mt 3.876 

407853; AA336797; Ks.40499; diddtopf (Xenopus laevls) homotog 1; nonB;TM=M;SS=Y; 3.634 

426427; M866g9; Hs.1 69840; TTK protein kinase; pkinase;; 3.562 

427585; 031152; H5.179729: collagen, type X, alpha 1 (Schmid metaph; C1q,Collagen;; 3.49 

405546: ; ; NIWL018833»:Homo sapiens transporter 2. A; ABC_tran.SRPS4.ABCLjnenibfBne;TMaY;SS=M: 1422661871 

439820; AL360204; Hs.283853; Homo sapiens miWA fuH length Insert c!»l; none^ons: 1402 

404210: ; ; NiyL005936:Hoino sapiens myetoW/lymphoId ; FHA.POZ,RA.DlL;TM=M;SS=N: 1368807339 

424522; AL134B47; Hs.1499S7: ribosomai protein S6 kinase. 90kD, polyp; pWnase.pklnase.C"; 1213402062 

418678: NM.001327; HS.B7225; canceiftesBs anlioen (NY^ SO-1); none;TMsM;SS=N; 3.084 

451106; BE382701; H5.2S960; N-MYC oncogene; HlHMycJLlBnnsTM»M;88=N: 1.65 

TABLE 17B 



Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 

Genbank accession nunrtbeis 



Pkey CAT Number Accession 

4188G9 12789.14 AA229762AA230035 

TABLE 170 

Pker. Unique numberoonesponding to an Bos probeset . ^ 

Ref: Sequence souree. The 7 digit numbere In this column are Genbank IdentSer (Gl) numbers. Dunham I, el tH: nfyts to the pubikatkm entitled 'The DMA 

sequenceof humandiromosome22.' OunhamLetai., Nature (1999) 402:48W95. 

Strand: Indicates DNA strand from whtehexDnswaropredlctel 

NLpositkm: Indicates nudeotMe portions of predicted exons. 



Pkey 


Ref 


Sbarat 


NLposaton 


402075 


8117407 


Phis 


121907-1220%.122804-122921.124019-12416 


401781 


7249190 


Mnus 


8321$-83435.83531-836S6,8374a839013423 


405484 


5922025 


Plus 


199214.199579,199672-199920.200282-20049 


405932 


7767812 


Minus 


123525-123713 


400517 


9796686 


Minus 


4999&«0346 ' 


402328 


4464283 


Minus 


13758.13922.14558-14752 


405545 


1054740 


Plus 


118677-118807.119091.1192g6,121626-12182 


402330 


4464263 


Minus 


15325.15380,15484-15568,15842-15915 


404440 


7528051 


Pkis 


8043(^61581 


403912 


7710730 


Mbius 


72000-72290.72431-72700.72929.73199 


404760 


7767724 


Plus 


223266-223352.224472-224585 


404883 


5101762 


Minus 


94626-94730.96998-97069 


403362 


8571772 


Plus 


64O9&^260 


405547 


1054740 


Ptus 


124361.124520.124914-125050 


405546 


1054740 


Plus 


124010.124183 


404210 


5006248 


Ptus 


169926.170121 



Table 16A: 194 Up^gulated Genes ki Uterine Cancer Versus Nonnai AdUlt Tissues 

Tabto 18A iists diout 194 genes u^-regulated to uterine cancer compared to nonnal ^^^^ 

enty such that the ratio of *ever^' uterine cancer to 'average' nomtal aduil (issues was greater Iha^ 

annngst uterine cancers. The 'average* nomel adUU tissue level MS set to Ihe 9^ 
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Isvals of non-specific hytNlffizaSoiK the 1S>^ 



Rcey: Unique Eos probeset identiSer number 
S ExAccn: Exemplar Accession number, Genbank accession number 
UhigeneiD: Unlgene number 
Unigene 'nile:Unigen6 gene tale 
R1: RaOo of tumor vs. normal tissue 

10 Pkey ExAccn UnigenelD UnigeneTlOe 



15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



449034 
435094 
438817 
421478 



450451 
428153 
428187 



435775 
430491 
441377 
40Q289 
400292 



A)624049 

At560129 

Ai023799 

At663243 

U65D11 

AW591528 

AW513143 

AI687303 

AA628995 

AA731111 

AL109791 

BE21M39 

X07B20 

AA250737 



442438 
447350 
453964 

443830 

459325 

415245 

446608 

426635 

433426 

437960 

4410B1 

440048 

447835 

440870 

412925 

408562 

429272 

453197 

437938 

420610 

448672 

452461 

413335 

449611 

449260 

412140 

443715 

432113 

424834 

410858 

426465 

446704 

419503 

444342 

436078 

406687 

445256 

440901 

434636 

429334 

418852 

459583 

438787 

400301 

428771 

444929 

453922 



AI375S72 
AI961486 

AI142095 
AW088369 



410102 
433283 
443270 
410247 
422589 
452771 
407275 
420440 
451105 



N75217 

BE395109 

H69125 

Ai669586 

AI584019 

AA897461 

AW591623 

A1687284 

AI089319 

AI436323 

W25140 

A1916269 

AI950087 

AI683183 

AI955511 

N78223 

AI613318 

Aig70394 

AA741180 

AA21S691 

A1583187 

AA935085 

AK0Q1432 

AW10S231 

Ar758948 

AI337228 

AA243642 

rayL0i4398 

AI193277 

M31126 

Ai635931 

AA909358 

AAD83764 

D63078 

BE537037 

AI907673 

AAg08554 

X03635 

AB026992 

AI685841 

AF053305 

AW248508 
BE041135 



Hs.277523 

Hs.163242 

HS.972S8 

Hs.30743 

Hs.202072 

Hs.98367 

HS285529 

Hs^2620 

Hs^91891 

H5.241559 

HS.2026S6 

HS.22SB 

H8.72472 



Hs.1939 

Hs.12744 

Hs.143273 

H5^82164 

Hs.27252 

Hs^7846 

Hs.129327 

Hs.133525 

Hs.222194 

Hs.169006 

Hs.158469 

Hs.164129 

Hs,150539 

Hs.179243 

Hs^1141 

Hs.1 10657 

Hs.109057 

H8.99348 

Hs.225106 

Hs.108106 

Hs.48442 

H8.197Q75 

H9^S79 

Hs.73625 

Ks.9700 

H8.152385 

K8.1534Q8 

Hs.192035 

Hs,197083 

Hs.137422 

H8.10887 

Hs.120954 

Hs572620 

Hs.147613 

Hs.128612 

Hs.241334 

Hs.186180 

Hs.273294 

Hs.192756 
H8.1657 . 
Hs.193143 
Hs.161354 
Hs.36708 

Hs^79727 
H3.175622 



NVL004272 Hs.9192 

AF1B1721 Hs.61345 

AA312735 Hs.179725 
T05477 
AI364186 

NH.002407 Hs.97644 
A1761324 



gb:ts41flD9a1 Na.CGAP_Ut1 Homo salens cONA 

EST 

ESTs 

ESTs 

PmferenOaDy expressed anOgen In melanoma 
ESTs 

hypothetica! piotefn RJ22252 

G pfoteiiKDupled receptor 49 (GPR49) (HG3Q 

integrln; beta 8 

ESTs 

Homo sapiens mRNA fiill lengtb Insert cOf^ do 
ESTs 

Matrix Metalloproteinase 1 0 (Stromolysin 2) 
BMPR-4b;bone morphogenetic protein receptor lb 
predicted exon 

Obu}s26b03^1 NCi.CGAP_Kid5 Homo sapiens cONA 

HER4(i>erb-B4) 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Moderately sImHar to plakophiOn 2b [H 
ESTs, VfeaMy similar to envebpe protefai (Rs 
ESTs 

HonK> sapiens cONA FU13793 lis, done THYRO10 
ESTs 

Roundabout homolog 2 transmembrane receptor (n)bo2) 
ESTs 

ESTs. Weakly similar to ALU5.HUMAN ALU 8UBFAM 
ESTs; WeaUy similar to Gag^ pdyprotdn 
distaMess Itomeo box 5 
ESTs 

IranscripOon factor 

ESTs 

ESTs 

ESTs 

RAB6 bitsrading, Unesin^iice (r^nesln6) 

cyc^EI 

ESTs 

Homo sapiens cO^^A FU10570 fis, clone NT2RP20 
ESTs 

gbiy16l07 j(1 NCLCGAP.U0 Homo sapiens cDNA 
ESTs 

ESTs ^ 

stmllar to lysosome-essociatBd membrane glyco 

ESTs 

pregnancy specifto beta-1-giycoprotoin 9 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA: FIJ23038 Us, done LNG020 
I^theScal protein FU20069 
gb:IL'BT152-4)8039»)04 BT1S2 Homo sapiens cON 
ESTs 

Estrogen receptor 1 
K1AA1069 protein 
ESTs 

budding uninhibited by benzimldazdtes 1 

predicted exon 

ESTs; 

ESTs 

Homer, neunmal Immedlato earfy gene, IB 

RU2S 

ESTs 

gb:EST03366 Fetal brain, Stratagene (cat93820 
gb:q«i34li07 Jrt Na.CGAP.Ut4 Homo sapiens cOMA 
mammaglobbi 2 

gbMl60b1lJ(1 NCLCGAPjOoie HomosapienscONA 
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R1 

55.7 

45J 

42.6 

35i 

27.3 

26.0 

24.8 

24.2 

16.7 

14J 

13.5 

13,5 

123 

10.7 

10.1 

10.0 

9.8 

9.7 

9.1 

9.0 

8.9 

a9 

8.8 

8.7 

8.5 

8.3 

7J 

7.2 

7.1 

7.0 

7.0 

6.9 

&3 

6.2 

6.2 

6.1 

&1 

6.1 

6.0 

6.0 

6.0 

6.0 

S.9 

&7 

5.7 

5.7 

5Ji 

fi.5 

5.4 

5.4 

5.3 
5.3 
5.3 
5.2 
5.2 
5.2 
6l2 
6.1 
6.1 
5.0 
&0 
5.0 
&0 
4.8 
4.8 
4.7 
4.7 
4.7 
4.7 
4.6 
4.6 
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5 

10 
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65 
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75 
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453616 NKt0034&2 
424115 AA335497 
414245 BE148072 
423244 AL039379 
441031 All 10684 
447048 AW393080 
456861 A1530223 
428758 AA433988 
420149 AA2S5920 
433479 AW511459 
449416 AI651016 
457551 AW821319 
450109 AIS39295 
438954 AA740151 
415511 A1732617 
410153 BE311926 
406411 

416209 AA236776 
416456 HS70S2 
454692 AW813350 
452249 BE394412 
435211 AK001581 
449765 N92293 
434988 At418055 
423515 AA327017 
435407 AI149774 
440886 AW511032 
444783 AK001468 
452039 A1922988 
407300 AA102616 
425176 AW015644 
449433 A1672096 
419335 AW960146 
422711 060641 
453096 AW294631 
441962 AW972542 
445034 AW293376 
418677 S83308 
422219 AW978073 
440304 6E159984 
421863 At952677 
431322 AW970622 
400250 

428227 AA321649 
420092 AA814043 
415138 C18356 
437212 AI765021 
409867 AW502161 
421477 AI904743 
427119 AW880662 
4581S4 AW816379 
434539 AW748078 
424717 H03754 
412078 X69899 
447342 Ai199268 
413472 BE242870 



H3^6 

Ks^65 

HS.7S850 

H3.209602 

Hs.7645 

Hs.228320 

HS.98S02 

Hs.88095 

Hs.249972 

Hs^46311 

Hs.288928 

H5.17967 

H3.130425 

Hs.182362 

H&1S830 

Hs.79078 
Hs.176626 

H8.61252 

Hs.80981 

Hs.206832 

Hs.161160 

Hs.162204 

HS.11717B 

H3.190516 

Hs.62160 

H5.172510 

Hs.120769 

Hs.301430 

Hs.9012 

Hs.284137 

H3.21739 

Hs.11325 

Hs^9008 

HS.160323 

Hsi7224 

Hs.12S3g5 
Hs.108972 



Hs.2248 
Hs^5 
Hs.78045 
H5210775 

Hs.104650 
H8.114574 

Hs.214410 
Hs.152213 
K5.73149 
Hs.19322 
Hs.75379 
Hs^3 
K5.36353 



dynein, axixwnd, flght Menn6(fiato pblyp8pl 
ESTs 

WAS protein family. Tnember 1 

ESTs. similar to ubiquitous TPR moGf. 

fibrinogen. B beta polypepQde 

Homo sapiens cONA: RJ23537 fis, done liJG076 

PHD finger ONA binding protein isofbm) 1 (tnt 

Homo sapiens cONA FU14303 fis, done PLACE20 

ESTs 

ESTs 

ESTs 

Homo sapiens cOI^ faJ232g6 (is. done KHP106 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU12691 fis, dons NT2RM40 
predicled exon 

MA02 (mltoQc anesl deOdant. yeast, homolo 
hypoMcal protein EOAG-1 
gb:ti/IR3^T0192.100100^24fl07 
ESTs 

polymerase (DNA directed), gamma 

EST.Moderately similar to ALUSJftfMAN ALU SUB 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo saito CDNA aJ20463 fis. done KAT0614 
ESTs. Moderate sImBar to TEFIJiUMANTRANSC 
ESTs 

Homo sapiens cDNA FU12888 fis. done KT2RP20 
Homo sapiens mRNA;cONA DKFZp586l1518 
ESTs 

Homo sapiens cDNA: FU21814 lis. done HEP010 

ESTs 

SRY (sex detennlnlng region YHwx 5 
gb:£ST390182 MAGE resequences 
ESTfi 

Homo sapiens mRNA; cDNA DKFZp434P228 
gb:EST382704 MAGE resequences. MAGK 
predict^ exon 

INTERFERON-GAMMA imSO^D PROTEIN 
ESTs 

tissue fador paOivray fnNbitor 2 TFPI2 
ESTs 

gb:U-HF.flR0;Mi]r^12-O^l.r1 NIHJ46C_62 

hypothetbal protein FLJ10292 

ESTa 

gbK3V4.ST0234-181ig9.03$^1 ST0234 
ESTs 

wingless^ MMTV integration site fian% 



446619 AU076643 

453891 AB037751 

443613 AI079356 

441285 NM_002374 

409731 AA125985 

417847 AI521558 

441484 AA935481 

415802 AA169515 

448112 AW245919 

428330 122524 
402606 

407905 AW103655 

424917 AI636208 

436982 AB018305 

451842 Aia20S39 
455666 BEQ65813 
431731 BE266322 
443695 AW2D4099 
410358 AW97S168 
406030 

409745 AA077391 
430481 AA479678 
437641 AAB11452 
415211 R64730.comp Hs.155986 
443450 N66045 Hs.133529 
457438 NMJD14053 Hsl70594 
451254 A1571016 Hs.172967 



Hs.167 

Hs.56145 

Hs^12 

Hs.58972 

Hs.6008 

Hs.301018 

Hs.2256 

Hsi52905 
Hs.96901 
Hs.5378 
HS267087 

H8^1374 
KS.1127S9 
H8.13337 



Hs.203269 
Hs^1911 



ESTs; Weddy simnar to lUI ALU SUBFAMILY J 

solute canier family 1 (glial high affinity 

secrsted phosphoprotein 1 (osteoponfln) 

Homo sapiens mRNA full length Insert cONA do 

gb:az39b09^1 SoaresJIhHMPu_S1 Homo sapiens c 

mtcrotubUte-assodated protdn 2 

thymosin, beta, identilied in neuraUastoma c 

Homo &aptensd}NA: FLI22316 fis. done HRC052 

ESTs 

ESTs 

ESTs 

matrix metaAoprotelnase 7 (malrilysln} 

predicted exon 

ESTs 

Homo sapiens cONA: FU23049 fis. done LNG025 
spondin 1 , (f^pondln] extraceDutar matrix p 
ESTs, Moderately sinflar to ALU4.HUMAN ALU SU 
gb:RC2-BT0318-1 101(NM)12>a08 BT0318 Homo sapi 
ESTs. Weakly similar to SP49JiUMAN SPUCEOSOM 
ESTs, WeaMy similar to AF126780 1 reSnal sh 
ESTs. Weakly simiiar to unnamed protaln produ 

pfctfictftd G9Q0n 

gb:7B14E12 Ghnmosome 7 Fetfll Brain cDNA Ubr 
ESTs, ModeratdysimllartoALUBJftJMAN ALU SU 
ESTs 

ESTs; Highly sfaifOar to SPERM SURFACE PROTEIN 
ESTs 

aVCRpiotdn 
ESTs 
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4.6 
4.6 

4.6 

4.5 

4.5 

4.4 

4.4 

4.3 

4.3 

4,3 

4.3 

4,3 

4.3 

4.3 

4.3 

4.3 

4.2 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

as 
as 

3.9 
3.9 
3.9 

as 

3.8 

as 
as 
as 
as 
as 
a8 

3.8 

as 

3.8 

a7 
a7 

3.7 

a7 
a7 
a7 

17 

a7 
a7 

3.7 
3.7 
3.6 

ae 
a6 
ae 
a6 

3.6 ) 
3.6 

a6 
ae 
ae 

3.6 

ae 
ae 
ae 
ae 
ae 

3.6 

as 
ae 
as 
ae 
a4 
a4 
a4 
a4 
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419563 

427778 

435031 

4073K 

417411 

431548 

432415 

423126 

433420 

435174 

444743 

452588 

427304 

419917 

417728 

419356 

458627 

435185 

416623 

405174 

403776 

426274 

431255 

442353 

456662 

416530 

454392 

406400 



AA526235 

AA412323 

AI632091 

AR)26942 

AW500008 

AI834273 

T16971 

AA322245 

AI674093 

AA687378 

AA045648 

AA889120 

AA761526 

AA320088 

AW138437 

AI656166 

AW088642 



N74S25 



D38122 

AA497043 

BE379594 

NM_002448 

U62801 



430704 
401517 
417830 
435267 
426384 
422797 
428832 
449722 
418478 
422689 



AW979197 
AW8130gi 

AW504786 

N23797 

AI472078 

AB033064 

AA578229 

BE2BQ074 

038945 

AW856665 



Hs.193162 
Hs.105323 
Hs.116877 

Ks.6986 

Hs.9711 

Ks.289014 

Hs.290165 

Hs^61 

Hs.194624 

Ks.11817 

Hs.110637 

K3.163853 

H5.93701 

K5^4790 

Hs,7331 

Hs^7984 

Hs^lOS 

H5.38761 



Hs.2007 

Hs.115665 

Ks.49136 

Hs.1494 

Hs.79361 



Hs.292073 



Hs.132808 
H8.110114 

H3.120908 

Hs.23960 
Hs.1174 



HDR»8apIenscONAFU11983lis.clDn8HEMBB10 a4 

ESTs 3>3 

ESTs 3.3 

gbiiomo sapiens dgSSmRNA. partial sequsfloe. 3.3 

Human DNA sequence from done RPM87J1 1 on c 3.3 

HomosapienscONAFU13018ti3.clonellT2RP30 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs 3J 

ESTs 3.2 

nudix (nudeoside diphosphate linked motely X 12 

HomeoboxAlO 3.2 

ESTs 3.2 

Homo sapiens mRNA;cDNADKFZp434E232(rmmd ai 

KIAA1573 protein 3.1 

ESTs 3.1 

ESTs; WeaMy sNIar to WASP-famOy protein I 3. 1 

dimethyiafgininddimBlhylambiotv^^ 3.1 

Homo sapiens tDNA: FU21S64 (is. done COLQ64 3.1 

predicted exon 3.1 

predicted exon 3.1 

tumor necrosis fadOf{!^|and)superfamny,m 3.1 

ESTs 3.1 

ESTs 3.1 

msh (Drosophila) homeo box homolog 1 (fonner) 3.1 

ka]IIKrein6(neurosIn,zyme) 3-1 

gtKG011506nFl NiH_MGC_19 Homo sapiens cONAc 3.1 

kaHikiein 8 (neuropsbiAivasIn} 3.0 
ESTs . 3,0 
0b:RC3^T0186'240400-1l1-d07ST0186Homo8api 3.0 
predicted exon 3.0 
epiihelisl ceQ transfoniAig sequence 2 oncoQ 3.0 
ESTs . 3.0 
ESTs 3.0 
KlAA1238protdn 3.0 
Qb:nl22b12j{1 NCI.CGAPJ1SC1 Homo sapiens CONA ao 
cydinBl 3.0 
cydin^ependentUnasa inhibitor 2A 3.0 
gb:RC3-CT0297*2901004}13403 CTOZg? Homo Gapi 10 



TABLE 18B 

Hcey: Unique Eos probeset idenlifler number 

CAT number. Gene duster number 

AcoBssion: Genbank accession numbers 



Pkey CAT number Accession 

409745 115237 1 AA077391 A1347618AI361453A1088754AW207491 AW960912AA921674AA286833AA150722BE1 52353 AWI^ 

409867 1156530J AW502161 AWSQ2S87 AW502345 

422219 213547.1 AW978073AWS78072AA8075SOAA306567 

422689 219896.1 AW8S6665AA315008AW954733 

426384 266211J AI472O78AA377209AA855807 

426465 267664.1 A1758948AA379527AA379948AA379262AW963933 

428832 296144.1 AA578229AA438432AA481375AA481363 

430704 322217.1 AWB13091 AW206655AA484440 

431322 331543 1 AW970622 AA503009 AA502998 AA502989 AA502805 T92188 

437938 44573 J AI950D37 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 WO1059 AW967671 AA908598 

AA251875 A1820501 AI820532 W87891 T85904 U714S6 T82391 BE328S71 T75102 R34725 AA684922 BE328517 A1219788 AA884444 N92578 F13493 
AAS27794AI560251AVVS74068AL134043AVV23S363AA663345AVV008282AA488984AA283144A18g0387M^ 
AW102898AI872193AI763273AW173568AW1S0329AI853832AI762688AAS88777AMB8692AI3S6394AW103^^ 
AW505512A1981530AW62g970 BE612881 AW276997AW513601 AW512843AA044209 AW856538AA1 80009 AA33749gAW961 101 AA251669 
AA251874 AI819225 AW2058S2 A!583338 AJ858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 AA969759 N75628 
N22388 H84729 H60052T92487A1022Q58AA780419AA55100SW80701 AW613456AI373032AI564269 F00531 H83488 W37181 W78802 R66056 
AI002839 R67840 AA300207 AW959S81 T63226 F04005 

438993 487651.1 AA828995AA834879AI928361 

442438 542469 1 AA995998 AI916584 R61781 T77332 F07756 F08149 F07647 

443613 575391.1 AI079356 W23287 

449034 794817.1 Ai624049AW1 17770 AI858360 

451 105 859083.1 AI761324 AW880941 AW880937 

452771 93C983J T05477 T07855AI917711 

454392 11588^1 BE260893AA078319R85057AW803Q24H85811 AA078293 
454692 1229118 1 AW81 3350 AW816082AW81 3476 AW81 3383 
455666 134g545''l BE(^13BE0657886E065889BE065832 
458154 491760.1 AW816379AA888282AA879046AA879195 
458861 798085 1 AI630223 AI630470 



TABLE 18C 

Pkey: 
RbI: 



Unique number conespondino to an Eos probeset 

Sequence souico. The 7 digit numbers In ihbooiumn are Genbank ldentH!er(Gl)mm)bef8.1)un^ 
humai chromosome 2Z' Dunham L et aL, Nature (1999) 40248^95. 
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Strand: Indicates ONA strand frem which axons were predicts 
NLposKion: Indicates nudeotkJe positions of ptatficted axons. 



Ptey 


Ref 


Strand 




401517 


7677912 


Rus 


2927&-29770 


402606 


9909429 


Minus 


81747-82094 


403776 


7770611 


Minus 


1414-1513,1624-1756 


403899 


7381715 


Minus 


9144-9350 


405174 


710B030 


liAnus 


102814-103063 


4OS609 


5757553 


Minus 


42814-43010.4358^783.44d6345033,4642&^(6554,4781S46018.49961'S01S3.51624^ 




529ia55469-S5601.57111.57307,5816»^296.60215^332.61482-6l727 


406030 


B312328 


Minus 


96123-95547 


406400 


9256298 


nus 


1553-1712,1678-2140,4252^5922-6077 


40S411 


9256407 


Plus 


7400-7527 



Tatde 19A: 225 Up-Ragutatad Genes Encoding Extrac^ular/Cen Sinfaoe Protahs, UTERINE Cancer Veisus Nomial Adult Tissues 

Table 19A lists about 225 genes ufKegutated in uterine cancer compared to normal aduil tissues that are Kkety to encode extracellutar or ceiKsuilace proteins. These were selected 
as for Table 18A. except that the ratio was greater than or equal to 2.0, and the predicted protein contained a structural domain that is indicativB of extracelhilar localization (e.g. tg. 
(h3, egt 7tm domains, ^nd setiuances. IranstnenibiBne do^ 

Pkey: Unique Eos probeset UenfHiar mnnber 

ExAccn: Exemplar Accesston number. Genbank accession number 

UnigenelO: Unigene number 

Unigene TiDeitJnigene gene tiOe 

PSDomain: Protein Structural Domain 

R1: RBiiooftumarvs.nonnal6ssue 

UnigenelO Unigene Title 

Hs.30743 Preferentially expressed antigen in melanoma 
Hs.52620 {nlegfin;beta8 
Hs.2258 MatrixK 
Hs.257846 ESTs 
H&.133525 ESTs 

Hs.150539 Homo sapiens CONAFU13793 lis, do 
Hs.31 141 Roundabout homolog 2 transmembrane 
Hs.99348 dlstaltohomeobox5 
Hs.73625 RAB8 Interacting, Unesin-tike(r8b 
Hs.9700 cycRnEI 
H5.152385 ESTs 
H8.137422 ESTs 

Hs.10887 sinAartolysosome-assodatedmemb 
Hs.120954 ESTs 

Hs.272620 pregnancy specific b^l-glyooprot 
H$.273294 hypothetical protein FU20069 

8bdL.BT1S24)B039»)04 0T152 Homo 8 
Hs.1657 Estrogen receptor 1 

predicted exon 
Hs.36708 budding urdnhlbHed fay benzlRddazol 
Hs.279727 ESTs; 
Hs.176622 ESTs 

H8.9192 Homer, neuronal Immediate earty gen 
Hs.61345 RU2S 
H5.179725 ESTs 

fib:qw34h07jrt Na_CGAP.Ut4 Homo sap 
Hs.97644 mammaglobln 2 
Hs.33846 dynein. axonemd, light Intemiedislt 
Hs.293965 ESTs 

H8.75850 WAS protein famity, member 1 

PHD finger DNA binding protein isof 
Hs.246311 ESTs 
Hs.88095 ESTs 
H8.249972 ESTs 

Hs.288928 Homo sapiens cDNA:FU23296fl8,cl 

predicted exon 
Hs.176628 hypothetlcalprotdnEDAG-l 

gb:MR3^0192-10010(M)24^ ST0192 
Hs.80981 polymerase (DNAdirecled). gamma 
Hs.161160 ESTs 
Hs.62180 ESTs 
Hs,190516 ESTs 

Hs,301430 ESTs. Moderately similar to TEF1_HU 
H5.160323 ESTs 

Hs.87224 SRY(sexdetennin!ngieg|onYHKix 

predicted exon 
Hs^40 Interferon-gamma induced protein 
Hs.78045 Qssue factor pathway inhibitor 2 T 

gb:QV4ST0234-1811994»&g01 8T0234 
K5.104650 hypothetical prot^FlJ10292 



Pkey 


ExAccn 


452838 


U65011 


438993 


AA828995 


400289 


X07620 


446508 


N75217 


433426 


H69125 


440870 


AI687284 


408562 


AI436323 


420610 


AI683183 


412140 


AA219591 


443715 


AI583187 


432113 


AA935065 


419503 


AA243642 


444342 


NM.014398 


436076 


AI193277 


406687 


M31126 


418852 


BE537037 


459583 


A1907673 


400301 


X03635 


405609 




453922 


AF053306 


410102 


AW246508 


433283 


BE041135 


443270 


NM.004272 


410247 


AF181721 


422589 


AA312735 


407275 


AI3&4186 


420440 


NM_002407 


453616 


Nli/L003462 


424115 


AA335497 


414245 


BE148072 


458861 


AI630223 


449416 


AI651016 


420149 


AA255920 


433479 


AW511459 


457551 


AW821319 


406411 




416456 


H57052 


454692 


AWB13350 


436211 


AKD01561 


434988 


AI418055 


444783 


AK001468 


440888 


AW511032 


425176 


AW015644 


445034 


AW293376 


418877 


S83308 


400250 




428227 


AA321&49 


415138 


C18356 


456154 


AW816379 


421477 


AI904743 



PSOomain 


m 


TM 


27.3 


SS,TM,integrin3 


16,7 


SSJiemops)dn 


12,3 


TM 


8.9 


TM 


B.7 


TM,PAX 


7,1 


SS.TM,lg,fin3 


7.0 


TM,homeol)OK 


6.2 


TMJdnesin 


6.0 


TM^clin 


6.0 


TM 


5.9 


TM 


5.5 


TM,L£mp 


5.4 


TM 


5.4 


TM,hemopexln 


5.3 


TM 


6.2 


TM 


5.2 


TMMmonejec,zf<;4 


5.1 


TM.Myosl(Ltal,myosinJnad 


6.0 


TM 


5.0 


SS.TM, 


5.0 


TM 


4.6 


TM 


4J 


TM 


4.7 


TM 


4.7 


TM 


4.7 


TMiUteioglo&in 


4.6 


TM,Ribosomal.SZ7e 


4.6 


TM 


4.6 


TM,WH2 


4.6 


TM.PHO 


4.4 


8S.TII*, 


4.3 


TM 


4^ 


TM 


4.3 


TM 


4^ 


TM.vwa,FGW 


4.2 


TM 


4J2 


TM 


4.1 


TM 


4.1 


TM » 


4.1 


TM.PH 


4.0 


TM.FG-GAP 


4.0 


TM,Glyco.trBnsf.29.TEA 


4.0 


TM 


3.8 


TM.HMG_box 


as 


TM.HisLdeacetyl 


3.8 


TM.tL8 


18 


TM.Kunitz^BPTI,G^8mma 


as 


THWW 


3.7 


TM 


X7 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



413472 BE24287D 
447342 Alig926B 
446619 AU076643 
453891 AB037751 
441285 NM.002374 
409731 AA125985 
441484 AAS35481 
428330 L22524 
407905 AW103555 
435982 AB0183a5 
402606 

443695 AW204099 
437641 AAB11452 
415211 R64730.comp 
443450 N66045 
457438 NM.014053 
435031 A)632091 
417411 AW50G008 
435174 AA687378 
444743 AA045648 
433420 AIS74093 
419917 AA320068 
417728 AW138437 
403776 

426274 038122 
416623 N74925 
405174 

431255 AA497043 
456662 NM.002448 
416530 U62801 
454392 6E260893 
406400 
401517 

417830 AW504786 
435287 N23797 
449722 BE280074 
418478 U38945 
422689 AW856665 
441794 AW197794 
416658 U03272 
431130 NM-006103 
418113 AI272141 
402373 AL13S225 
431989 AW972870 
400284 

433578 AAB11244 
423513 AF035960 
448966 AW372914 
431870 AW449902 
409457 AW818081 
438777 AA825487 
451807 W52654 
433328 A(379486 
448221 BE622615 
448141 AI4718B8 
458311 AA225632 
405454 

459287 AL079369 
438935 H40665 
421312 AA824627 
418882 NM_004996 
424345 AK001380 
417956 AA210704 
445537 AJ245671 
448089 AI467945 
446643 AA194417 
456671 AB011142 
457256 AA459443 



435313 
417351 
412198 
413278 
421502 
418092 
41X08 
420362 
431974 
438209 
447578 
414812 
421515 
416402 



solutB cantor family 1 ttDal tdgh 
ESTs: Vfeaidy ma b U» ALU SU 
secreted phosphoprotein 1 {osteopon 
Homo sapiens mRNA fufi (ength inser 
ntootutRile-assodstBd pfotein 2 
ttqfinosln, bela^ UenSfiad in nemo 
ESTs 

cnatrta metaDopnilsinase 7 (matrOy 
ESTs 

spondin 1 , (f-«pondln) extracelluta 
predicted exon 

ESTs, WeaWy simBar to AF126780 1 
ESTs 

ESTs; HlgMy sImiiartoSPERMSURFA 

ESTs 

avCR protein 
ESTs 

Human DMA sequence from done RP1-1 
ESTs 

nudbc (nucleoside diphosphate linke 
ESTs 

Homo sapiens mRNA; cONA DKFZp434E23 
KIAA1573 protein 



Hs.75379 

Hs.19322 

Ks.313 

Hs.36353 

Hs,167 

H3^145 

Hs^8972 

Hs^56 

Hs.252905 

Hs.5378 

Hs.112759 

Hs.291911 

Hs.155986 

HS.133S29 

Hs.270594 

Hs.116877 

Hs<6966 

Hs.194624 

Hs.11817 

Hs^3961 

Hs.93701 

HS24790 



Hs.2007 tumor necrosis (actor {llgand)8Upe 
Hs.38761 Homo sapiens cDNA: FLJ21564 lis, d 

predicted exon 
HS.11S685 ESTs 

msh (DrosophBa) homeo bam homolog 
kaffikreln 6 (neuosia zyme) 
gb:601150677F1 N1H_MGC„19 Homo sapl 
icalliivBin 6 (newopsln/ovasin) 
predicted exon 
Hs.132808 epittiOOaloentranslbmiingsequen 
Hs.110114 ESTs 

cydln B1 

cyclbHtopendent Idnase inhU}itor 2 
gtyJ^C3-CT0297-2g010&«1M03 CT0297 
Hs.253338 ESTs 

fibrilBn 2 (congciAal conlrectura 
ep(didymi»«peGffic; whey^ddic pr 
ESTs 



Hs.1494 
Hs.79361 



Hs^960 
Hs.1174 



AI7694Q0 

T90278 

AA937111 

BE563085 

AF111856 

R45154 

AA0795S2 

U79734 

AW97268g 

AL1206S9 

AA912347 

X72755 

Y11339 

NiyL000715 



Hs,79432 
Hs:!719 
Hs.83484 

Hs.301865 dopad)Tometautom8ra5e(( 
Hs:»1069 ESTs 

Estrogen receptor 1 
Hs.164168 ESTs 
Hs.129719 transghjIamlnaseS 
Hs.287462 Homo sapiens cONARJ1 1875 fis.cto 
Hs.105500 ESTs 

gl>:CM4^T0276-101299.0594i09 ST0276 
Hs.142179 ESTs.VteaWy6lmilartoORF2|M4nus 
H&27099 DKFZP564J0863 protein 
Hs.1 59430 ESTs 

gb:601440775ri N1H.MGC.72 Homo sapl 
HS.197S31 ESTs 
H&190016 ESTs 

predicted exon 

gb£IKI7p564G2378j1 564 (synonym: ti 
Hs.31564 ESTs 
Hs^1670 ESTs 

Hs.89433 ATTMiindIng cassette, 8iib4amRyC 
Hs.145479 Homo sapiens cDNA FU1051B lis, do 
Hs.190465 ESTs 

Hs.12844 EGRi1<B^tomdn;miitllpto6 
H5.173696 ESTs 
HS2620S0 ESTs 
Ks.1 14293 ICAA0570 gene product 
Hs^1816 ESTs 
Hs.269307 ESTs 
Hs.18g729 ESTs 
ESTs 
ESTs 

IntBTferon-s&milated pmtdn, 15 1( 
Hs.105039 sduto carrier family 34 (sodium ph 
Hs.106604 ESTs 

gb'.zm20h12^1 Stratagene pancreas ( 
hunOngfin InteracQng protein 1 
Hs^34 ESTs 
Ks.6111 KIAA0307gen8pmdud 
Hs.136585 ESTs 

Hs.77387 monoldno Induced t>y gamma hterfero 
Hs.105352 GalNAc8lpha>2.&<i8iytliansterase 
Hs.1012 oomptementoomponem 4-biffi8hg prot 



Hs.1 5049 
HS.691B5 
K3.833 



Ks.97206 



TI^SDF 
Tl^ 

T^A,Osteopo^lin 
TM 

TM,tubulin-bindtng 

TM,Thymosin 

TIM.fh3.ig.Yj)hospliatese 

SS,Pep8d8se_M10 

SS,TM.Eplifin 

SS,TM. 

m 

TM 
TM 

TM,iQ.Rna 

TM 

TM 

TM,RhoGEF,PH 
TM 

TM.SPRY 

TM,mulT 

TM 

TM 

TM 

SS.TM,ltJ 
TM,TNF 
TM,Ets 
TM 



a7 
a7 

17 
3.7 
3.6 
3.6 
3.6 
3.6 

3.6 

a6 

3.6 
15 
14 
14 
14 
13 
13 
3.2 
12 
12 
11 
11 
11 
11 
11 
11 



TM 


3.1 


TM.homeotxjx 


1 1 

0.1 


TM,bypsin,proJsomefase 


i3>l 


TM 


"i. 1 
J.I 


TIA,trypsln 


in 


TM,HMG14«17 


3.0 


TM 




TM 




TM,cydin 




TM,anl( 


10 


TII/I,SNF2_N 


1 fk 


IHantc 


Z9 


TM,EGF,TB 


t Q 


SS.W3P 


2.9 


TM,HMG.box 


Z9 


TMjTEA 


2.9 




9Q 


TM.hormon8jec;tf-C4 


Z9 


TM.fonnyLtransf,AIRS.GARS 


2.9 


TM JransglutanALN 


2,8 


TM 


2.8 


TMpMHCWg 


Z8 


TM 


2.8 


TM 


2.8 


TM 


Z8 


TM 


2.8 


TM 


2.8 


TM.t»ZIP 


2.8 


TM,Sec7 


Z8 


TM 


2.8 


TM 


2.8 


TM 


Z7 


TMlGiald) 


Z7 


TM,ABCjnembfaRe;^C.tran 


Zf 


TM 


n 


SS.sushi 


2.7 


SS^GF 


17 


SS.TM. 


Z6 


TM.aaLadaptOfjB 


16 


TM 


16 


SS 


16 


TM.Spln-Ssty 


15 




15 


TM,CH 


IS 


TM 


15 


TM,ubiquitin 


15 


TM,NaJ>Lcolr8ns 


15 


TM.pklnase 


15 


TM.FG^ 


15 


TM,ENTKl.LWEQ 


15 


TM.bZlP 


15 


TM,HUiPAS 


15 


TM 


15 


SS,lt^ 


15 


TM,Gfyoo.lrBnslJ29 


14 


TMiGUShi 


14 
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439699 AR)88S34 Hs.187581 EST8.MO(terBiBly6lildlarloAtU1J1U TM 14 

428242 H55709 Ks^ 1sulcemteInliib»oryfaclgr(cho^ SS.L1F.0SM 24 

417693 AW959741 Hs.40368 adafriOMeteled protein conqriu TM,0aLailaptor.8 Z4 

- 426679 AA431765 gbzw80c03j1 Soares.tBsSsLNHTHom TM.KECT 2.4 

5 438311 M70B958 H8.168732 ESTs TM 2.4 

426920 AA393351 K5.132121 ESTs TM Z4 

426698 AA394104 HsJ7489 ESTs TM 2.4 

443426 AFD98158 Hs.9329 Homo sapensmRNA for fls353,oompl TM Z4 

^. 406815 AA833930 H3.288038 tRNA isopentsnyipyrophosphate trans THIPPT Z4 

10 434808 AF155108 Hs^150 ESTs. Highly simHar to NY-REN-41 a TM tZ 

432441 AW29242S Hs.163484 EST TM.ForiUi8ad 2.3 

435615 Y1S06S Ks.4975 potassium voOage^atBd channel. KQ TM Z3 

4022S8 predicted exon TM.zK:2H2.KRAB Z3 

435542 AA687378 H3i69533 ESTs TM 2.3 

15 442952 AI743261 Hs.131860 ESTs TM 2.3 

416203 )C54942 Hs.83758 CDC2B protein kinase 2 TM.CKS 23 

429228 A1553633 Hs.104985 ESTs TM 23 

418969 W33191 Hs28907 hypothetical protein RJ20258 TM.SH3 23 

447570 AI868315 Hs.996e9 ESTs TM^D 23 

20 405032 predicted axon TM.FMO^ 23 

•416566 NM_003914 Hs.7937B cycIinAl TM/^din 23 

420900 AL045633 Hs.44269 ESTs TM.FAD.lAwfing.5 23 

430563 AA481269 H5.178381 ESTs TM>VBCjnembiane.p450 23 

417372 T99755 H5.290814 ESTs TM 23 

25 449083 AI948808 Hs.191144 ESTs TM 23 

410361 8E391804 Hs.62661 guanytatebindoig protein l.interf TM,GBP 23 

434131 AI858275 Hs.143859 ESTs TM 23 

431648 BE018924 Hs.271580 UnplaldnIB TM.transmembraRe4 23 

425638 NML012337 Hs.158450 nasopharyngeal epitheGum specific TM 23 

30 440006 AK000517 Hs.6844 hypotheOcai protein FU20510 TM 23 

445870 AW410053 Hs.13408 syntaxInlS TM 23 

430639 AW02S427 Hs.2335S2 ESTs TM^pkinase 23 

439018 AW300887 Hs.26638 membran&^pannlng ^domains, subfam SS.TM, 23 

422095 AI868872 H$.288966 cerotoplasn^n (ferroxMase) SS.TM.CiK)xidase 22 

35 411558 AA102670 Hs.70725 'Human GABAA receptor pi subunftm TM^neurjAian 22 

408380 AF123050 Hs.44532 diubktuilin TMJtnL^ANFjeoeptor 22 

403721 predictedexon TM 22 

440711 AA904389 Hs.143511 ESTs TM.rnn 22 

457285 Ai038858 Hs.228780 ESTs. Highly ^iar to AF1995971 TM.e1hand 22 

40 422956 BE545072 Hs.122579 ESTs TM 22 

433482 AI953499 H5.152617 ESTs TM 22 

431980 AA523896 Hs.222695 Homo sapiens cONA:FU20986 Us. d TM 22 

420777 AA280223 Hs.130865 ESTs TM 22^ 

446659 AI335381 Hs.226376 ESTs TM 22 

45 410227 AB009284 Ks.61152 exostoses (mulfiple)^ 2 TM 22 

422282 AF019225 Hs.114309 apotipoprotein L TM 22 

431701 AW93S490 Hs.t46S8 ESTs TM,Oocludin 22 

426910 AA470023 Hs.190O89 ESTs TM,MMfLHSR1 22 • 

405638 Hs.153595 predictedexon SS,TM.EGFJdljiecepU 22 

50 401933 predictedexon TMjonJrans 21 

436679 AI127483 Hs.120451 ESTs, Weakly similar to unnamed pro TM 21 

451061 AW291487 Hs213659 ESTs. Weakly Mlar to KIAA1 357 pr TM 21 

410664 NM.006033 Ks.65370 Rpase. endothflUI SS.TM.RibosorndLL22|pase 21 

449378 AW664026 Hs.59892 ESTs TM 21 

55 433345 AI681545 Hs.152982 EST duster (not in UnlGene) TM 21 

4258S1 NM.001490 Hs.159G42 gluoosamlnyl (NmlyO transferase SS.TM.Brandi 21 

431832 AW276865 Hs.192715 ESTs TM.Ets.SAMJ'NT 21 

448275 BE514434 Hs.20830 synaptic Ras6TP8seac1hra8ngprot TM.kinestn .abhydralasej 21 

423049 XS9373 H5.18B023 ESTs TM,homeol)ox 21 

60 427510 Z47542 Hs.179312 small nudearRNAacdvaBngoompte TM 21 

418076 RB1388 H5.6724 ESTs TM 21 

413670 AB00011S Hs.75470 hypolhefica! protein, expressed in TM 21 

429183 AB014604 Hs.ig795S KiAA0704 protein TM 21 

439031 AF075079 gb:Homo sapiens fiill lenglh insert TM 21 

65 431060 AF039307 H8.249171 homeoboxAII TMJwmeobox 21 

451494 AI799444 Hs.247095 ESTs.ModerdelysIinBartoALU7J1U TM 21 

419978 NM.001454 Hs.93974 forkheadboxJI THForKJ»ad 21 

404535 Z25884 Hs.121483 dilor^channell .skeletal musd 88 21 

445181 AW338972 Hs.147471 ESTs TM 21 

70 452367 U71207 H8.29279 eyes absent posopMb)homotoB 2 TM.Hydrol3se 21 

443591 AI078281 H8.179240 ESTs TM 21 

448105 AW591433 H5.170675 ESTs, Weakly slnrtDar to TMS^JIUMAN TM.tiyps]n 21 

424310 AA338648 Ks.50334 ESTs TM 20 

450193 AI916071 H3.224623 ESTs TM,pklnase 20 

75 436009 H57130 Hs.120925 ESTs SS.TM.Ephrin 20 

453313 BE005771 H3.153746 Homo sapiens cDNA:FU22490 lis. d TM 20 

41M33 AA2S1131 Hs^97 ESTs TM,WHEP-TR8 20 

437555 AA759263 Hs.14041 ESTs TM.NrBmp 20 ■ 

411828 AW181449 Hs.72290 wingless^ MMTVintegraftm site TM.wnt 20 

80 440052 AI633744 Hs.195648 ESTs TM.PAC 20 

410718 AI920783 Hs.191435 ESTs TM.SQSJ»SY 20 

404767 predidBdexsn TM 20 

447462 AW337214 HS.1S8973 ESTs TM 20 
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4422S5 


Ar701857 


Hi.202388 


ESTs 


TM 


2.0 


410292 


AAS43087 


Hs.t24194 


ESTs 


TM 


2.0 


442748 


At015713 


Hs.135787 


ESTs 


TM 


2.0 


458760 


A!498631 


Hs.111334 


ferTi]ln.lighipalype))l!de 


TM.HC03_ootransp 


2.0 


409799 


011928 


Hs.76845 


phosphoserlne phosphatase^ike 


TM.Hydralase 


2.0 


401324 






predated exon 


TM,niyQ^ilJhe3d 


10 


432140 


AKD00404 


Hs.272888 


hypoUtsGcai pmtein FIJ20397 


SS 


2.0 


447541 


AH0OQ288 


Hs.18800 


hyjntha&al protdn FLJ20281 




to 


421379 


Y15221 


Hs.1039a2 


sRid indudbte cytoMno subbirfly 


SS.TIUUIB 


ZO 


TABLE 198 











Ptoy: Unique Eos pnibeset identiiiar number 
CAT number Gene chistar number 
Accession: Genbank accession numbets 

Pks/ CAT number Accession 



409457 


1132521 1 


AW816081 AW392887 AW514700AW392881 


410008 


116812_1 


AA079552 eH142525 BE142S27 


422889 


219896l1 


AW856665 AA3 1 5006 AW954733 


428879 


294049_1 


AA431765AA432015 


438993 


467651 1 


AA828995AA834879A1926361 


439031 


46798^1 


AR)75079 H48801H467g6 


448221 


75534^-1 


BE622615 


454392 


115882_.1 


BE260693 AA078319 R850S7 AW803024 H85811 AA078293 


454692 


1229118J 


AW813350AW816082AW813476AW813383 


458154 


491768J 


AW81 6379 AA888282 AAS79046 AA879195 


458861 


798085J 


AI630223A)630470 


459287 


977129.1 


AU)79369 D81804 



TABLE 19C 



Pkey: UnkiuanunlberconespondlngtoanEosptobeset 

Ref: Sequence somce. The 7 digit nnrba^ In this column are Genbank tdenlffier (Gl) numbers. Thmham I. at al.* refera to the pubBcaSon enOOed The DNA sequence of 

human chromosome 22.' Dunham I et ai.. Nature (1999) 402:489495. 
8tren± Indteates DNA strand from which axons were predicted. 
Nt_po5ition: tndcates nudeoikle positions of predtoted exons. 

Pitey Ref Strand NLposiGon 



401324 


9863791 


Pius 


234057-234174 


401517 


7677912 


Plus 


29278-29770 


401933 


3810668 


Minus 


4872&490S7.51864-51955,52424-52588 


402298 


6598824 


Ptus 


36758^953 


402606 


9909429 


Minus 


81747-82094 


403721 


7528046 


Minus 


156647-157368 


403776 


7770611 


Minus 


1414-1513.1624-1756 


404767 


7882827 


Minus 


23244-23759 


405032 


7107731 


Minus 


131945-132224 


405174 


7108030 


Minus 


102814-103063 


405454 


7656675 


Plus 


133807-134053 


405609 


6757653 


Minus 


4281443010,4358^43783,4486345033.4642^46554.4781548018.49961-50153,51624- 








51727,51823-51959,52702-52918^9^601,57111-57307^169-58296,60215. 








60332.61482-61727 


405636 


5123990 


Plus 


66384^7 


406400 


9256298 


Phis 


1553-171%1878-214a42S24385.5922-8077 


408411 


9256407 


Plus 


rmiszr 



Table 20A: 56 Up-Regul^ Genes Encoding Extracellu!ar/Cell Surface Proteins, Uterine Cancer Versus Nomiat Adult Tissues 

Tdble 2QA lists about 56 genes up^ulatad in uterkie cancer compared to normd adult tissues that are 
molecutes. These were selected as Ibr TM 18A. except thai the ralk) was gre^ 

anzymaBclundionorofbdnomoddated by small molacules (e.9. pMnase, peplUase, tsomerese, transpoitais). The predicted protein domains are noted. 

Pkey: Unique Eos probeset identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UidgenelDc Unpens number 

Unlgene TUarUnigene gene titiB 

PSDofliain: Piroli^ Stiuctural Dondn 



R1: 


Rafio of tumor vs. normal tl 


Issue 






Pkey 


ExAccn 


UnlgonelO 


Unlgene TUte 


PSDomaIn 


R1 


428187 


A!687303 


K5.285529 


G protein^pled receptor 49 (GPR49) 


7tnu1 


24^ 


400289 


X07820 


Hs.2258 


Matrfo( Metaltopnjtelnase 10 (Stromoly 


hemopextn 


12.3 


447350 


AI375572 


Hs.172634 


HER4(&efra4) 


kinase 


9.8 


420810 


AI883163 


Hs.99348 


distaHess homeo box 5 


homeolxw 


6.2 


405609 








Myosin_taH,myo5lnJhead 


6.0 


458861 


NM.007358 


Hs.31016 


PHD engar DNA binding pntein 


PHD 


4.4 


410153 


BQ11926 


HS.1&830 


Itxno sapiens cONA FU12691 fis. done 


NA 


4.3 
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438211 


AK001581 


KsA1961 


polyniersso (DMA dhoclBd)t QSRvns 


NA 


4,1 


444783 


AND01468 


Hs.62180 


ESTs 


PH 


4.0 


418677 


S83308 


Ks.87224 


SKY (sex aetannniRg fegion T}>Dax 5 


HMG.DOX 


Q B 

ao 


413472 


6E242870 


Hs.75379 


sdute carrier family 1 


one 




443613 


AI079355 


Hs^1B07 


gbx)z35o09.8l Soares_NnHMf\LS1 Homo s 


zf-C2H2 


3k6 


426330 


L22524 


KsZ2S6 


« . __lJLl.LnjL.---l.tjJjkOIlJ^ 7 i III! Ill lTK.lJltw.\ 

niatnx metsuopfoietnasG r (inBuiiysini 


rep&dasejMiD 


AO 


443895 


AW204099 


nS.1 12759 


cois, weaxiy similar lOAri Arw i re 


MA 

NA 


4£ 
J.0 


435031 


AI632091 


Hs.1 16877 


ESTs 


RhoGEFJ*H 


3.3 


417411 


AW500008 


KS.69K 


Kmnan ONA sequence ftom done RP1'-187 


NA 


3.3 


435185 


AA669490 


Hs.289109 


dimsttiytarginlne dlmeOiylandnohydrola 


NA 


3l1 


416530 


U62801 


Hs.79361 


kalHkreIn 6 (neurosin, zyme) 


trypsbiprojsoinerase 


3.1 


408400 


AA343629 


H&.10457O 


kaQikrdn 8 {neuropsInAovasin} 


trypsin 


3.0 


418478 


U38945 


Hs.1 174 


cydin-dependent kinase inf^Mlor 2A 


ank 


3i0 


441794 


AW197794 


Hs.253338 


ESTs 


ank 


2.9 


402373 


All 35225 


Hs301855 


dopachicina tautDmeraso (dopachroira do 


TEA 


2.9 


423513 


AF035960 


Hs.129719 


trEQisgtutamlnase 5 


TransgltitamlnL^ 


2.8 


448141 


AJ471598 


HS.197S31 


ESTs 


tmp 


2.8 


41S076 


NM.000857 


Hs.77890 


guanylalB cydase 1, sotuUe, beta 3 


guanylatBjcyc 


Z7 


418882 


NNL004996 


H9J9433 


ATP-Unding cassetlB» sub-teiiilly C 


ABCjnemtvane^CJIran 


2.7 


448435 


BE293439 


Hs.182278 


ca!nK)du8n2 


NA 


2.6 


417351 


T90278 


Ks.15049 


ESTs 


CH 


2.5 


430372 


AI206173 


H&211375 


ESTs 


SH3,efhand»C2,PH 


Z5 


431974 


AW972689 


Hs.200934 


ESTs 


bZIP 


Z5 


428046 


AW812795 


Hs.155381 


ESTs. Moderately simuar b 138022 ny 


ank 


Z4 


421515 


Y11339 


HS.10S352 


GalNAc 8iphd-2, &sialyUransferase 1 


Glyco„transL29 


Z4 


403095 






predicted exon 


nonieotx}x,PAX 


Z4 


408815 


AA833930 


Hs.288036 


IRNA isopentenytpyrophosf^ate (ransfe 


IPPT 


Z4 


43S615 


Y1S065 


Hs.4975 


potasdum voftagoigated chsnnd 


kMucharael 


Z3 


402298 






predicted exon 


ZM}2H2,KRAB 


Z3 


418203 


X54942 


H&83758 


CDC28 protwi kinase 2 


CKS 


Z3 


430563 


AA481259 


HS.1783B1 


ESTs 


ABC_membrane4)450 


Z3 


447570 


AI868315 


Hs.99669 


ESTs 


PHD 


Z3 


439018 


AW300887 


Hs.26638 


niendirand-spanring 4Kloindn3, subfamfl 


NA 


Z3 


415539 


Ar733881 


Hs.72472 


BMPRrIb; 


bone n%)rphogenelic protein NA 


Z2 


422095 


Ato6DB72 


Hs.28896d 


cenitoplasnwi fbntuddase) 


CiK>xMase 


Z2 


408380 


AF1Z3050 


1 1_ J J coo 

Hs.44532 


(flublqultin 


AklC Mini, ulna nimtit Ttm 4 

ANr_rBoepnr,$usm,7viLi 


£.2 




AA904369 


Hs.1 43511 


ESTs 




22 


457285 


A)038858 


Hs!228780 


ESTs, Highly similar to AF1 99597 1 ^ 


efhand 


22 


418506 


AA064248 


Hs.85339 


G protelrHXHipIed receptor 39 


NA 


22 


410664 


NM_006033 


Ks.65370 


npase, endothelial 


Ribosonfal_l^,llpase,PlAT 


Z1 


425851 


NM_001490 


Hs,159642 


fihioosamliTyl (N-acetyl) transferase 1 


Branch 


Z1 


448275 


BES14434 


Hs.20830 


synaptic Ras GTPase activating protein 


kinesin^HD,at)hydro)ase.2 


Z1 


429782 


NM-005754 


Hs.220689 


Ras-GTPase-activating protein SH3^ 


nm,NTF2 


Z1 


404535 


Z2S884 


Hs.121483 


chloride chsnnd 1 , sketet^ muscle 


NA 


Z1 


448105 


AW591433 


Hs.170676 


EST8,W^s!miartoTMS2 


trypsin 


Z1 


446342 


DCtdODDO 


Hs.14846 


Catlonlc amino add transporter (ecto 


NA 


ZO 


456760 


AH98631 


H8.111334 


ferritin, light pdypepUde 


HC03j»tran8p^C3HC4 


ZO 


409799 


D11928 


K5.76845 


phosphoserine phosphalase4Qe 


Hydrolase 


20 


401324 






predidedexon 


myoslnJiead 


ZO 



TABI£206 



Pkey: Ihilque Eos probeset iden0er number 
CAT number Gene duster number 

Aocesdon. Genbank aocesdon mnnbeis "' 

Plcey CAT number Acoesston 

443613 57S391J AI079356 W23287 
458881 798085J AI630223AI630470 

TABl£20C 

Pkey. Unique number corresponding to en Eos probeset 

RbK Sequence source. Tim 7 digit numbers In this column are GenbailkUenliRer(GQniinbei$^ 

human chromosome 22.' Dunham I. et aL. Nature (1999) 402:48&495l 
Strand: bidteatesONA strand from whkshexons were pretficted. 
Ntj)osl6on: IndkidBS nudeoti d e postBons of prectetBd owns. 



Pkey 


Rsf 


Slrarxl 


t^j)osUlon 


401324 


9863791 


Phjs 


234057-234174 


402298 


6598824 


Hus 


36758^953 


403095 


8954339 


Ptus 


150025-150240,151564-151690 


405609 


57S75S3 


hEnus 


42814-43010,43583^783.4486345033,4642946554,47815-48018,49981-50153,51624- 








51727^182^1959.52702-52918,55469«5601,57111-57307.5816»-58296,60215- 








60332,61462-61727 


406400 


9256298 


PhJS 


15S3-171Z187a'214a42524385,S9226077 
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TaWa 21A: 270 Up-Regulated Genes. Uterine Cancer Vereus Nomtil Uterus 

Tat)te 21A lists about 270 genes up-reguiated In utertne cancer oompaied to normal uterus. These were 
5A and the denominator vyas the median value ftv sU non-maliBnant utari^ 
Ptey: Unique Eos prot)esetldentifiarnumt)er 
ExAccn: Exemplar Accession numl)er.Genl)ankaooesstan number 
UnigenelD: Unlgene numt)er 
Unlgene Title: Unigene gene lille 
R1: Ratio of tumor vs. nonnalOssue 



selected as for TfiUe ISA, except that the rath) was greater than or equal to 



Pkey ExAccn UtIgenelD Unigene Title ri 

AI624049 gb:ts41a09j(1 Na_CGAP_Ut1 Homo sapiens 55.7 

AI560129 Hs.277523 EST 45^ 

AW07548S Hs.288049 phosphoserlne anMansfeiBse 19.6 

AF153815 Hs.50151 potassium iRwaRfl^fect^ng channel 15.6 

AW958544 Hs.112242 ESTa 15^ 

AB014604 Hs.197955 KIAA0704 protein 146 

AA731111 Hs.291891 ESTs 14,3 

AI110684 Hs.7645 fibrinogen. B t}eta polypeptide 14.0 

AB012113 Hs.16530 CCchemokineSCYA18(MIP-4)(PARO 13.0 

U17760 Hs.301103 lamlnin,beta3(nlceIn(125kD).kalinin 12.9 

Y11339 Hs.105352 GalNAc alpha-2. fr«laiyltransferase f. I 12.2 

AA353776 Hs.901 C048 antigen (^celi membrane protein) IZO 

A8037751 Hs.35353 Homo sapiens mRNA full length Insert cDN 11,7 

AU)37915 HS.15S097 carbonic anhydraseH 11.4 

AW384062 H$.301323 ESTs 113 

AI225235 Hs.268300 Homo sapiens cONA: RJ23231 fis.ctoneC 111 

BE091926 Hs.16244 mitotic spindle coBed^ related prot 10.9 

AA477355 Hs.288300 Homo sapiens cONA:RJ23231fis, clone C 10.3 

NM_0067g9 Hs.72026 .protease, serine. 21 (tes6sin) IQ^ 

X72755 Hs.77367 monokine induced by gamma interferon 10.2 

BE391804 Hs.62661 guanylale binding protein 1, interferon- 10.1 

AI216532 Hs.147487 ESTs 10.1 

AA995998 gb:os26b03.s1 NCLCGAPJddS Homo sapiens 10.0 

AW419196 Hs^57924 ESTs 10 0 

AA813426 Hs.192034 ESTs. WeaWyslmIlartoKIAA0705 protein 1o!o 

AI8312g7 Hs.123310 ESTs fto 

AI813318 Hs.48442 ESTs 97 

AI628272 Hs.88323 ESTs g'e 

AI948829 H3.2137e6 ESTs g'e 

BE543469 Hs.266263 Homo sapiens cDNAFU14115fls, done MA 9.4 

Y00787 Hs.624 inierieuklnS 93 

AA351847 Hs.2642 eukaryoHctranslalionelongalidn^ 93 

BE245374 Hs.27842 hypothetical protein FU11210 9.2 

NM_014338 Hs.10887 simHariolysosome^sociated membrane 9.2 

Ai142095 Hs.143273 ESTs gi 

AA306997 Hs.268362 ESTs, WeaMy similar to hypothetical pro 9,0 

AF121860 Hs.106260 sorfingnexin 10 9.O 

0 a9 

BE395109 Hs.129327 ESTs aS 

AA972165 Hs.150308 ESTs 8.7 

BES68452 Hs.5101 ESTs; Highly sbnaar to protBbireguIaS 8.7 

AW969610 Hs.151179 ESTa 8.7 

AW300887 Hs.26638 membranwpanning 4KfomaIns, subfamify A 8.7 

AU077312 Hs.1 53985 sotutecanierfamiiy7(c3lionlcan^no a6 

AA358454 Hs.78026 ESTs. Weakly similar to similar to ankyr u 

NM.000346 Hs.2316 SRY(sexHtetBnnInin8iegtonYHKix9 8.5 

ARJ85888 Hs.269307 ESTs ^ ^ 

AL138411 gbi«CFZp434A1229j1 434 (synonym: Mes3) 8^4 

AI584019 Hs.169008 ESTs. Moderately similar to piakopNHn 8.3 

C05837 H8.146807 Homo saptens cDNA RJI 3593 fis, done PL a2 

MB6849 HS.S566 Homo sapiens connexin 26 (GJB2) mRNA, co 8.2 

AW502208 gb:UWF-BR0p#J-©O9^l,r1 NIH_MGC 6 ao 

AI627292 Hs.190877 ESTa " aO 

AI243276 Hs.149017 ESTs 79 

AW292425 Hs,163484 EST u 

BE514434 Hs.20B30 synaptic RasGTPaseaciivaling proton 1 7.6 

A1912498 Hs.25895 ESTs. WeaWysfmiiff to PW kinase IH.8 7.8 

AI199268 Hs.19322 ESTs; WeaWy similar to IIU ALU SUBFAMI 7.7 

R38438 Hs,182575 solute carrier family 15 (HVpeptWetra 7.7 

AF262992 Hs.123159 sperm assodated anSgen 4 76 

BE3143B5 gb:6011S4649F1 NIHJ(IGCJ9 Homo sapiens c 7.6 

AH)44t97 H8.100431 smaUindudble cytokine 8 8UbtemiIy(Cy 7.6 

A1903396 gb:RC-BT029-120199-219J BT029 Homosapl 7.6 

BE086142 gb:CM4BT0320.221199WH|108TO320Homo 7.5 

AI127483 H3.120451 ESTs. WeaMysbhilar to unnamed protein 7.5 

AI150029 Hs.61438 ESTs 75 

BE281128 Hs.9030 TONDU 7;4 

W07808 gbzbOSalZrlSoaresjBtaUungJIbHLtgw 7.4 

AASO3083 Hs.79742 ESTs 7.4 

A1680624 Hs.148676 ESTs 7.4 

259 



449034 
435094 
438461 
434779 
441633 
429183 
436775 
441031 
446921 
413753 
421515 
414646 
453891 
425196 
444883 
449785 
446839 
449801 
411773 
414812 
410361 
423645 
442438 
415766 
456017 
435525 
413335 
420297 
452799 
434311 
408243 
430713 
452092 
444342 
443830 
442647 
421633 
403381 
426635 
440500 
436291 
431668 
439018 
424986 
425495 
428862 
438986 
422731 
441081 
415992 
431211 
409865 
448158 
401519 
441730 
432441 
448275 
438424 
447342 
408369 
423081 
414464 
420931 
459142 
411094 
436679 
452607 
443171 
459081 
431195 
444459 
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429334 
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409219 
400491 
403485 
408350 
445873 
400995 
406086 
403378 
426227 
422038 
431842 
429732 
427494 
429272 
427258 
449309 
4X104 
416402 
404767 



AW409701 

AI219207 

063078 

R35027 

AA393383 

H25530 

AW183350 
AA250970 



U87058 

R390g8 

NM_005764 

U20158 

AI628365 

W25140 

AA400091 

AW58g823 



439750 
403127 
418203 
425858 
421712 
456903 
414564 
457942 
410442 
424598 
445537 



413472 
410664 
428575 
406400 
426317 
441460 
412570 
424349 
448881 
445258 
456032 
404727 
422810 
440044 
416498 
426600 
422170 
449611 
402539 
456983 
407910 
457887 
431765 
420344 
443494 
456844 
416623 
4139B2 
458091 
402104 
428771 
435313 



416111 
427308 
423089 
416655 
417079 
449409 
400855 
454692 
414869 
439682 
445181 
437129 
4401^ 
443715 



M29540 

AL3S9053 

AI904493 

X54942 

AA364923 

AKD00140 

D49441 

AA164803 

AW665665 

X73424 

AB020839 

AJ245671 

NML003816 

BE242870 

NM.006033 

M19684 

AA312350 
A1962478 
AA033517 
AF141289 
AL109781 
At635931 
AWg57446 

AA317400 

AW665167 

U33832 

NM.003378 

At791949 

A1970394 

AW502761 

AI081687 

AA650274 

AI240007 

AF124249 

BE463721 

T99719 

Ai264155 

N74925 

BE503035 

AF150286 

AB028992 

AI769400 

A1188346 

AA033813 

D26067 

W15613 

AW968813 

U85590 

AI650935 

AW813350 

AA157291 

H97552 

AW338972 

AIJD49327 

AA982623 

AI5B31S7 



H3.1578 

Hs.72222 

Ks.186180 

Ks.133331 
Ks.50866 

Hs.250127 
Hs.251946 



Hs.168102 

Hs.192028 

Hs.271473 

Hs.2488 

Hs.130412 

Hs.1 10867 

Ks.39421 

Hs.224189 



NM.000715 Hs.1012 



Hs^0529 
Hs.57684 
Hs.99890 
Hs.83758 

Hs.107139 

Hs.155981 

Hs.71994 

Hs.153034 

Hs.63788 

H8.151017 

Hs.12844 

Hsi442 

Hs.75379 

Hs.65370 

Hs.184929 

Hs,169294 

Hs^6804 

Hs.74047 

Hs.145550 

Hs^754 

Hs.147613 

H5.301711 



Hs.259563 

Hs.79351 

H5.171014 

Hs.112492 

Hs.197075 

Hs^909 

Hs.170225 

Hs.41296 

H8.148812 

Hs.266541 

Hs.97101 

Hs^0404 

Hs.152981 

Hs^76t 

Hs^9193 



H5.193143 

Hs.189729 

Hs^1776 

Hs.79018 

Hs.174905 

Hs.1613 

Hs.79428 

Hs.81134 

Ks.301694 



Hs72163 
Rs^69060 
Hs.147471 

Hs.189144 
Hs.9700 



bactdovtral lAP rep8al-oonta!ring 5 (sur 

HypoOieQcd proteh FIJ13459 

Homo sapiens cDNA: FU23038 fis, done L 

glKra60s02.r1 Soarn bifont brain 1I4IB H 

EST8 

solute carrieffamBy 22 (ocgan i cca fi on 
0 

EST8 

Hnno sapiens cOHA: FU23107 lis, done L 

0 

0 

0 

Human proteinase acBvated receplor-2 mR 
ESTs 

epithelial protein up^utated in caid 

lymphocyte cytosoQc protein 2 (^H2 doma 

ESTs, Weakly stntBar to 8r&2 pXdegans 

ESTs 

ESTs 

ESTs 

0 

ooinpbRenl component 4-faindino protein, 
0 

CEA (carcinoembryohb anfigen-relalsd ce 
ESTs 

pdymerase (DNAdiroctaO, deHa 1, cata 

COC28 protein Idnase 2 

0t}:EST75602 Pineal gland n Homo sapiens 

hypdhelicd protdn 

mesothelin 

ESTs 

ESTs 

propionyt Coenzyme A cartwxytasa, beta p 
estrogeiwelsted receptor gamma 
EGPtodomaln; muIOpIe 6 
a disinlegrin and metailoprotelnase doma 
solute carrier family 1 (glial high afS 
Dpase, endothelial 

serine {or cysteine) proteinase Inhiblto 
Icallikrein 8 (neuropsln/ovasin) 
transcription factor 7 (T-cell spedfic, 
ESTs, Moderately similar to ALUC.HUIMAN ) 
etecbon-transfer-flavoprotdn, beta pd 
sdute carrier family 7 (cafionic amino 
Homo sapiens mRNA full length insert dM 
ESTs 
ESTs 
0 

gb:ESn9374 Retina ti Homo sapiens cONA 
EST 

potassium channel, subfamity K, member 1 
VGF nerve growth fiador indudbis 



ESTs 

KIAA0430gen8produd 
thymopoieSn 

fibronedin teudne rich transmembrane p 
ESTs 

novd St^-cont^ng protein 1 
PutaSve G protelrHXKipied receplor CPCR 
Homo sapiens cONA: FIJ22389 fis, done H 
CDP-dlacytglyceroi synthase (phosphalida 
Homo sapiens cDNA: aj21564fIs,doneC 
ESTs 

gb:AF150288 Human mfV^fpom cd344- stem c 
0 

K1AA1069 protein 

ESTs 

ESTs 

chromaOn assembly factor 1, subunit A ( 

KIAA0033 protein 

adenosine A2a receptor 

Ba2/adenovirus Eie 19t(I>interading pro 

inlerieukin 1 receptor antagonist 

ESTs 

0 

gb:MR^T0192.10010»124g07 ST0192 Homo 

ESTs 

ESTs 

ESTs 

gbiHomo sapiens mRNA: cONA DKFZp664E016 
ESTs, WesMy dmilarto NPTmuiMAN i^ENAL 
qidbEl 

260 



7.3 
7.3 
7,3 
7^ 
7.3 
7.2 
7.2 
7.2 
7.1 
7.1 
7.1 
7.0 
7.0 
7.0 
&9 
6.9 
6.9 
6.9 
6.9 
&9 
&9 
&8 
&8 
8.8 

as 

6.8 
6.6 
6.8 
6.7 
6.7 
6.7 

a7 

6.7 
6.7 

a7 

6.6 
&6 
&6 
6.6 
6.6 
6.5 
6.5 
&5 
6.5 
6.4 
6.4 
&4 
6.4 
6.4 
6.4 
&4 
&4 
&4 
6.4 
6.3 
6.3 
6.3 
&3 
6.3 
6.2 
6.2 
&2 

a2 
a2 

B.2 

a2 
ai 
ai 
&i 
ai 
&i 
ai 
ai 
ai 
ai 
ai 
ao 

6.0 

ao 
ao 
ao 
ao 
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422355 AW403724 Hs.140 trnmunogtobuOnhesvy constant gamma 3 &9 

405291 0 

432113 AAS35065 Hs.152385 EST$ 5.9 

441236 AAS23489 Hs.130432 ESTs 5.9 

S 42441B BE503432 H&fi6170 HSKMW protein 5.9 

453028 AB005532 Hs.3t442 RecQproteIivlikB4 5.8 

407137 T97307 Hs,199067 EST 5.8 

443462 AI064690 Hs.171176 ESTs 5J 

454392 BE260B93 gb£01150677F1 N»iMGCJ9 Homo sapiens c Sl8 

10 456311 AA225632 Hs.190016 ESTs 5.6 

446501 A)302616 Hs.150819 ESTs 5.6 

433921 AA618174 gb3nq14f01.s1 NQ.CGAP.Thyl Komo sapiens 5.8 

409615 AW444881 gb:UMfBlMz«04^l.s1 Na.CGAP^Su 5.8 

459360 8E384526 gtK601277gi3F1NIHJAGGL20HomD sapiens c S3 

IS 403824 0 &B 

428187 A1687303 HsiB5529 Gprotdivcoupled receptor 49 (GPR49) &8 

412140 AA219691 H5.73825 RAB6 interacting. IdnesMleCralildnes &7 

410658 AW105231 Hs.19203S ESTs S.7 

426465 AI758948 gb:ty1 6107 Jil NO.CGAP.UO Homo sapiens 5.7 

20 443695 AW204099 Hs.112759 EST8.W^aMy8imaartoAF1267801r8fin &7 

437372 AA323968 Hsi83631 liypotheQcal prolein DKFZp547G183 .5.7 

405392 0 6.7 

437100 Anr61073 Hs.14535 Homo 88f^cDNA:FU22314fis. done H 5.7 

449796 AA004321 Hs.194397 ESTs 5.7 

25 409361 NM.00S982 Hs.54416 sineoculisliomeobox(DrosophBa}homolo &7 

428987 NML0047S1 Hs.194710 glucos8mInyl(l4-ac8lyl) transferase 3 5.7 

404220 0 5.6 

420973 AA743415 Hs.291368 ESTs 5.6 

^- 430491 A1.109791 H3^41559 Homo sapiens mHNAfuQ length insert cDN 5.6 

30 442549 AI751601 Hs.8375 TNFreoeplor-associated factor 4 5.6 

409867 AWS02161 gb:U14{F-BR0p^^12-O4JlJ1 NIHJMGC.S 5.6 

451110 AI95S040 Ks.301584 ESTs 5.6 

418216 AA562240 Hs.263099 AF1Sq14 protein 5.6 

^_ 411897 AW875066 gb:RC6^0001-18010(M)21-F04 PT0001 Homo 5,6 

35 456161 BE264645 Hs^2093 Homo sapiens cONA:FU21918 lis. done H 5.6 

406536 0 5.6 

43S40 AI821517 Hs.105666 ESTs 5.6 

446315 NM-016293 Hs.14770 bridging integrator 2 S.6 

443270 NM.004272 Hs.9192 Homer, neuronal immedl^eariy gene, IB 5.6 

40 451035 AU076785 Hs.430 plastin 1 (I isofbrni) 5.6 

406685 M18728 gb:Human nonspedfic crossreacQng antlg 5.5 

454590 AW809762 Hs.222056 Konio sapiens cDI^FU11572fis, done HE 5.5 

402430 0 5.5 

446704 AI33722B H3.197083 ESTs 5.5 

45 435282 AA677428 Hs.189731 ESTs 5.5 

426062 N57014 Hs.44013 ESTs 5.5 

415451 H1941S H3.268720 ESTs, Moderateiy similar to ALULHUIMAN A 5.5 

456002 AI828729 Hs.191450 ESTs, Wbal^yslnflar to type II membrane 5.5 

409813 AW444816 Hs.171537 Homo sapiens cONA:FLI21596 lis. done C 5.5 

50 430259 BE550182 H5.127826 RalGEF^ protein 3. mouse ftomOlag 5.5 

434609 R76593 gb.'^60c11.n Soares placenta Nb2HP Homo &5 

430250 Ni^016929 Hs.283021 chtoiide intraoeliular diannel 5 5.6 

416327 U70370 H8.64136 paireiMilnhomeodom^transcrfpQonfa 5.4 

400379 NlyL018432 Hs.283076 Homo sapiens ovarian cancer related prat 5.4 

55 436076 A1193277 Hs.120954 ESTs 5.4 

432119 TB0289 gb:yd03ti04j1 Scares Infant brsliil NIB H &4 

417175 R44558 Hs.94O02 ESTs &4 

445774 AI254165 H8.145S04 ESTs Su4 

455604 BB011183 gb:Pf&BN0218*10Q500^IO;M09BN0218Homo 5A 

60 411426 BE141714 gb:QVO-HT0101-061099O32-c04 HT0101 Homo 6.4 

445262 AW205650 Hs.253503 ESTs 5.4 

412517 BE271584 gb:601141065F1 raHJAGC.O Homo sapiens cD 5.4 

434756 AA827850 Hs.259307 ESTa 5.3 

454417 Ai244459 Hs.110826 trintideoBde repeat containing 9 5.3 

65 439949 AW979197 Hs.292073 ESTs &3 

414995 C18200 gbK:i82Q0 Human placenta cDNACTFu lwara 5.3 

428071 AF212848 Hs.182339 transcription factor ESC-3B 5.3 

412323 AW937143 gb:PM1-DT0041^1299-001^1 DT0041 Homo 5.3 

434283 AW235341 Hs.58715 mouse thiamin pyrophospholdnasehomdog 5.3 

70 447793 AI42S049 Hs.1ig629 ESTs, Moderately similar to ALULHUIMAN A 5.3 

401723 0 5.3 

406270 0 5.3 

452194 At694413 Hs.298262 ESTs. Weakly similar to dt88Jai [H.8api 5.3 

415757 AA830854 Hs.187810 ESTs 5.3 

75 430051 AA4e4611 H3.52S15 transdudn (betaHDo 2 6.2 

435615 Y15065 Hs.4975 potassium voUag^ated channel. KQT-lik 5.2 

459583 AI907673 gb:IL-BT152-080399WBT152Homosaplen 5.2 

449009 BE044755 Hs.224812 ESTs 5.2 

424001 W67883 Hs.137476 KIAA1051 protein 5.2 

oO 409479 BE163800 Ks.138912 ESTs 5.2 

437852 BE001838 HS.2S6897 E8Ts.Weaidyshilartodl36501Z1 [Ksa 5.2 

43S928 K64345 Hs.183981 ESTs 5.2 

447397 BE247676 Hs.18442 E-t enzyme 5.2 
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MM PA 
•nsiCM 


AW445022 


nS>l9vSKKI 


MmneMilMitteniJA* Fl I911%fk^ i^naC 

IMIIU BOjncnS GUIW^ rUcI 103 BUllO w 


&2 










S2 








ESTs 


&2 


410153 




Hs, 15830 




5.1 


439509 


ARffi6332 


K5.58314 


ESTs 


5.1 


MKBa 




Ub \*OAjn 
ns>i lowc 


cycbiC 


&1 


4SI654 






db£ST381912MAGEreseatieMes fyiAGKHomo 


S.1 








nh<lV/4.TTrtfinftjni1Q(U19<L(>fM TTnOflR Hrwn 


S.1 






Ue 11CA19 


Mnrnn umfenv rOMA Fl Jl^l fra rinnA TH 
noniD sopicns cuivM ruiiijwi us, cions in 


5.1 


425188 




rKh lOvvf 1 


hvfwlhRfted funtdn FLI1 
njIwuioUMii iHVtcui ru 1 1 isu 


sil 


117174 


IIR1VI7 






sii 


433279 






wfc-CQTWWM MAf2C n>«aomwnf»oc UAftl HnmA 


0.1 


40'tilc 




ns.«3Uiouo 


PCTe 


^ 1 








nh*M.QfMPAmi HMmanntH (nbnt hraln rHM 
QD.ndVi 1 c/\uj I nwiiiaiKoo miaju wsui vun 


9.I 








hvnnfhpfiral ivntptn PRHIOfK 


5.1 








EQTe WoaUw dndliv In <ifiQQll hiffwrfane 
coi6i vvocuuy ofliHia ID iiypBiwns 










WlinMt04f19 r1 CnorBO ialal IIubp imlafMi 
QDijUO 11 1 €Jf 1 O0dlO9 KISI inrei S|N80i1 


S.1 


A'iiAR') 






8C|U8porin 9 


O.I 


4Z401 f 






nomo sapMUis cuiv\ riJ iciiKf ns, cnno Rtf\ 


<! 1 
3il 








n 




430832 


A1073913 


Hs.100686 


EST&k Vlfoddy similar to secrstBd cernenl 


5.1 


435070 


AK000073 




gbiHomo sapiens cONA FU^66 fis. done 


5.0 


416969 


A1815443 


Hs.283404 


organic caOon transporter 


5.0 


444929 


Ai685841 


Hs.161354 


ESTs 


5.0 


453922 


AF053306 


Hs.36708 


budding uninhiUled by benzinsdazoies 1 


5.0 


439031 


AF075fl79 




gb:Homo sapiens fuQ length insert cONtA 


5.0 


414539 


BE379046 




gb:601236646F1 NIH.MGC_44 Hono sapiens c 


5.0 


425349 


AA425234 


H&79686 


ribose 5-phosphato isofneraso A (liboso 5 


5.0 


449986 


AW864502 




gb:PM4-SN0016-120400mb12SN0016 Homo 


&0 


418717 


A1334430 


Hs^5984 


ESTs 


5.0 


438769 


AA830884 


Hs.163428 


ESTs 


5.0 


441859 


AW194384 


H5.128022 


ESTs. Weakly similar to R61 MOtJSE FIG-1 


5.0 


446469 


BE094848 


Hs.15113 


homogofllisato li^-dioxyganase 


5.0 



TABl£21B 



Ptey: Unique Eos probeset UentiAer number 
CAT number Gene cluster nunlber 

Accession: Genbank accession mimbeis 

Pkey CAT number Accession 

409615 114342SJ AW444861 BE074994BG074966BE074992 

409865 1156S18J AW502208A^2366AW502148 

409867 1156530J AW502161 AW502587AW502345 

410146 1178974J AW592655 R05927 R08916 

411094 1231982.1 BE0661 42 AW8 17074 

411231 12363S6J AW833501 AVV833508AW833722AVV833332AVV833509AW833511 AVV833767AVV8333^ 

411426 1245515J BE141714AW845993AW845989 

411897 1264907J AW875056AW875079AW875075AW875052AW875061 AW875074 

412323 1288770 1 AVV937143AW937150AVV937141AVW37151 AVV937132AVV937160AW937191 AW937174AW9371^ 
AVV937159AW937139AW937171AVV937142AVV937145AVV937165AVV937163AW937164AW937137A 
AW937156 AW937140 AW937135 AW937170 

412517 130281.1 BE271 584 AAl 12511 

414484 1452830.-1 BE314385 

414539 1460320.1 BE379046 BE395459 

414995 1611738J C18200D78681 T82025 

416719 1611345.1 H79731H79732 

422731 220507.1 AL138411 A1138412 AA315860 

«2810 221630.1 AA317400AA434584 

42^1 226553.1 Z43509H09001 AA375202AW9S4383 

425858 257265.1 AA364923 AW963483 BE182774 C21461 

426465 267664 1 AI758948AA379527AA378948AA379262AW963933 

430664 321423.1 AW969a34AAS28493AA483165AW969842 

432119 34170J T80289AF052168 

433279 381800.1 AWS71745AAS813S9AA581358 

433921 377350.1 AA618174AI114549 R35464 R36465 

434609 38950J R76593AF1 47390 R76594 

436070 41426.1 AK000073AA380163AA360181 AW963533 

437129 43343J AUM9327AA847105 

439031 46798.1 AF075079 H48601 H48795 

442438 542469.1 AA^998AI916584R61781T77332F07756F08149FD7647 ■ 

448865 78535J R35027R1 2034 BE407 120 

449034 794817.1 At624049AWl 17770 A{858360 

449986 821463.1 AW864502AW864369AI678780 

454392 115882.1 BH260893 AA078319 R85057 AW803024 H8S811 AA078293 

454692 1229118J AW81 3350 AW816082AW81 3476 AW813383 

455604 1337197 1 BEOl1l838B)l1170BE011333BE011188BBni18lBE011324BE011161 BE011169 

458091 472385.1 AF1S0286AA835857 

459081 889426.1 W07808At822066 

459142 918906.1 AI903396AI803361 AI9O3360 

TABt£21C 
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Ptey: UiA)U8nunibercoiTespondIngto8nGospiobeset 

Ref: Seqiiexesouice. The 7 (SgQ number in this cohinn are Gen^ 'DunhamLetaL'refeistothepubDcalnnenljQed'TtttONA 

sequence of human chromosome 22.' Dunham I. el al. Mature (1999) 402489495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntjxsition: Inificates nucleotide positions of predictBd enxis. 



PIcey 


Rof 


Strend 




400855 


1931571 


Minus 


17601-18228 


400995 


8099094 


Phjs 


141186-141601 


401519 


6649315 


Pius 


157315-157950 


401723 


7656694 


Plus 


147273-147503 


402104 


8119072 


Pho 


122409-122600 


402430 


9796372 


Minus 


62382^2552 


403378 


9438244 




^4264^^43 


403381 


9438267 


Minus 


26009-26178 


403383 


9438267 


Minus 


119837-121197 


403485 


9966528 


Plus 


2888^001.3198^38554117 


403824 


9798468 


Plus 


47^7 


404220 


6706820 


Plus 


461D7-46439 


404727 


8081050 


Plus 


115534-115747 


404767 


7882827 


Minus 


23244-23759 


405291 


3845420 


Rus 


19999-20473.20672>21038.21147-21265^1 378-21667 


405392 


6624069 


Minus 


116167-116289.11B879-119030 


406086 


7107817 


Plus 


9418-9573 


408270 


7534217 


Plus 


13138-13591 


4064X 


9256298 


Plus 


1553-1712.1878-2140.42524385.99228077 


405536 


7711478 


PhB 


25655-25782 



TABLE 22/V: 430 SIGNiFICANTlY DOWN-REGULATED GENES, UTERINE CANCER VERSUS NORMAL UTERUS 

Tdble 22A lists atMut 430 genes significantly down-regulated In uterine cancer compared to normal uterus. These were selected as fior Table 21A, except that the numerator and 
denominator were switched, and the latio was greater than or equal to 14 (JLe. 144bld down-regulated In tumor vs. nomud uterus). 

Pk^ Unique Eos probeset identifier number 

ExAocn: Exemplar Accession raimber, Genlnnk access i on ntnntier 

UnigenelD: Unigene numtwr 

Unigene Titte: Unigene gene title 

R1 : RdOo of tumor vs. nomia) Gssue 



Pkey 


ExAocn 


UnigenelD 


Unpens Title 


R1 


414063 


H26904 


Hs.75736 


apoltpoprotetn D 


93.0 


447990 


BE048821 


Hs.20144 


small indudtde cytoldne sutufamiiy A, member 14 


75.7 


407815 


AW373B60 


K8.301716 


ESTs 


68.7 


452547 


AA335295 


Hs.74120 


adipose specific 2 


61.1 


415165 


AWB87604 


Hs.78065 


complement con^xxient 7 


55.1 


453655 


AW960427 


Ks.300878 


ESTs. Moderately 8tmil8rtoTGR3JtUMANTGF-eE 


64.0 


429350 


AI75463* 


Hs.131987 


ESTs 


52.6 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


52.0 


425869 


AA524547 


Ks.160318 


FXYD dom^n-contdning ton transport regulato 


51.6 


416585 


X54162 


Hs.79386 


ieiomodin 1 (smoolh musde) 


61,4 


408614 


AL137698 


Ks.46531 


Homo sapiens mRNA; cDNA DKFZp434C1915 (6om c 


49.7 


417642 


J04129 


Hs.82269 


progestagen-assodated endometrial prolan (p 


49.3 


412295 


AW088626 


Hs.22971 


ESTs 


48.0 


421998 


R74441 


Hs.117176 


poly(AH)inding protein* nuclear 1 


47.0 


452093 


AA447453 


Hs^860 


Homo sapiens mRNA; cDNA |}KFZt)S86M0723 (!rom c 


46.7 


429707 


W76631 


Ks.2118ig 


matrix metalloproteinase 23B 


45.7 


416950 


AL049798 


Ks.60552 


demiatoponSn 


45.6 


408221 


AA912163 


Hs.47447 


ESTs 


44.6 


408791 


AI220684 


K3.272572 


hemoglobin, alpha 2 


43.0 


446500 


U78093 


Hs.15154 


8Ushi-fepeat<onta!ning protein, X chromosome 


426 


407938 


AA905097 


HS.8S050 


phosphoiamban 


41.1 


410677 


NM-003278 


KS.6&424 


tetrsnedin (plasminogen-binding proti^) 


41.0 


412524 


AA417813 


Hs.11177 


ESTs 


39.4 


452426 


At904823 


H3.31297 


Homo sapiens cONA: FU23001 fo.ctoneLNG002 


38.6 


414290 


At568801 


H5.71721 


ESTs 


38.2 


439627 


BE621702 


Hs.29076 


Homo sapiens cDNA: aJ21841 Us, done HEP018 


38.0 


400258 




Ks.79064 


deoxyhypusine synthase 


37.0 


414607 


AI73B616 


Hs.77348 


hydroxyprostaglandin dehydrogenase 1&<NAD) 


36.1 


410023 


AB017169 


Hs.57929 


snt(0rosophn8}homo{og3 


35.4 


407663 


NM.016429 


Hs.37482 


C0PZ2 for nondaihiin ooat protein zetaCOP 


34.3 


410288 


AI739159 


Hs.61898 


DKFZP586N2124 protein 


33.8 


418986 


AI123555 


Ks.81796 


ESTs 


33.1 


409060 


AI815687 


Ks.50130 


neccfin (mouse) homotog 


33,1 


436569 


BE43g539 


Hs.279837 


ghitalMone S-transferase M2 (musda) 


328 


420674 


NI4.0000S5 


Hs.1327 


butyiyldulinesterase 


32.6 


417967 


BE244373 


Hs.1119 


nuctMT receptor subfamBy 4, group A, member 


32.4 


450810 


. BE207588 


Hs.25511 


transforanino growth factor tieta 1 induced tra 


31.7 


438150 


AA037534 


H&300B78 


ESTs. Moderate^ Mar toTGR3JHUMANTGF-fiE 


31.6 
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430468 NM.004673 Hs.130699 ESTs 31.5 

453060 AW294092 HS.21S94 ESTs 3U 

424206 NM_003734 Hs.138241 cmiro oxidase, copper contdntng 3 (vascdar 30.8 

422128 AW973784 Ks.112028 MisshapeiWNIK-fetated kinase 30.5 

5 406082 S47833 H$.82S27 adenosine nwRoptosphaie deaminase 2 fisoform 30.3 

42163S NVL012082 Ks.297921 Homo saptensniRNAfuillengtt) insert CDNA do 30.3 

402S20 29.9 

418043 AW377752 Hs.83341 KsapiensmRNA (or tyrosine kinase receptor 29.7 

443906 AA348031 Ks,7913 ESTs 29.7 

10 450958 AL137669 Hs.25700 iiomosa|]lensml^cOKAD]<l=Zp434M0435(fiDmc 29.4 

418823 AF020774 HS.B8844 Homo sapiens hair and sIdnepidennaMypa 12- 29.4 

412B28 AL13339S Ks.74621 prion protebi(p27-30)(Cretitzfeld>)ala)bdise 29.4 

429507 N^4.003102 Hs.2420 superoxide disnvjtase 3. extraceOuiar 29.2 

400545 29.1 

15 425078 NM.002599 Hs.154437 phosphodiesterase 2A.c6IwlP.stlnuilated 29.1 

429942 A)336993 Hs.134535 ESTs 28.9 

438303 AB0289g8 Hs.6147 K1AA107S protein 28.7 

419971 AA400027 Ks.296234 ESTs, Highly stmilar to mitogeo^cQvated pro 28.7 

443060 D78874 Hs.8944 procollagen Oendopeptidase enhancer 2 28.7 

20 452877 A1250789 Hs.32478 ESTs 28.6 

412442 A)983730 H3.26530 serum depiivaQon response (phospha&dylsefin 28.6 

424378 W28020 H8.184367 GTPase activating protein^ilw 28.6 

421623 N40850 H8.2862$ ESTs 27.9 

447786 6^20810 Hs.39619 hypothelfea! protein LOC57333 27.5 

25 400023 AFFXcontJOl:18Sril)OSomaIRNA 27.5 

453874 AW591783 Hs,36131 co«£een. type XIV. alpha 1 (undulb\) 27.2 

414134 X60188 Hs.861 niitogen-activated piotan Idnase 3 27.1 

428451 AWS70451 Hs.g8570 ESTs 26.9 

435520 AA297990 Hs.9315 HNOEL-lso pmteb 26.6 

30 437179 AA393508 Hs.171409 serologically defined colon cancer andgen 8 26.4 

441481 AA935303 Hs.270553 ESTs 26.0 

450227 BE388192 Hs.78S21 Homosapiens cDNA: RJ21 193 Us. clone COL001 25.6 

403731 25.5 

452814 A1092790 Hs.65016 hypoQietical protein RJ21935 25.5 

35 410036 RS7171 Hs.57975 calsequestrin 2. caidlac muscle 25.5 

416854 H40164 Hs.80296 Puildnie ceB protein 4 25.4 

418421 RS8620 Hs.85050 phospholamban 25.4 

407000 U12139 gb:Humandipha1(Xl)ooIlagen(COL11A1)gene. 25.3 

421803 NM.012205 Hs.108441 34iydroxyanthranflale3,4<Boxygenase 25.3 

40 445613 BE550889 Hs.158491 ESTs 25.1 

432302 AA345857 H3.274307 KiAA1442 protein 24.8 

420796 134355 Hs.99931 sarcoglycan. alpha (SOkDdystrophii^associata 24.8 

423720 AL044igi Ks.23388 Homo sapiens cONA:FU21 310 lis. c(oneCOL021 24.7 

417302 BE245812 Hs.8941 ESTs 24.6 

45 421913 AI93438S Hs.109439 osteogiydn (osteoinducthre fecbir, irAnocan) 24.6 

440130 AI083899 Hs.157527 ESTs 24.5 

431967 AJ243653 Hs.283404 organic cation transporter 24.5 

424580 AA446539 Hs.35092 ESTs 24.4 

406907 Z25427 gb:lisapiensprotein^serine/lhreonine kinase 24.2 

50 443746 AB03g670 Hs.9728 Al^l protein 24.1 

429101 AW452174 H3.173780 ESTs 23.5 

410691 AW239226 Hs.654$0 r9b'ciilon4 23.4 

408853 AW291484 Hs.254957 ESTs 23.3 

407979 AA046306 Hs.62927 ESTs 23.1 

55 448619 A1867182 HS.2022S6 ESTs 22.8 

424585 AA464840 gbz](43h11.r1 Soai8sJol8Lfeius.Nb2HF8.9wHo 22.7 

407891 AA486620 H8.41135 Endomucin2 22.6 

407198 D11747 Hs,177415 Rnkel^lskis-Reiiiy murine sarcoma virus (FB 2Z5 

426990 AL044315 Hs.173094 Homo sapiens mRNA;cDNAI)i(FZp564H142 (from d 2Z6 

60 450493 M93718 Hs.166373 nitric oxide synthase 3 (endothelial ceii} 2^1 

420120 AIJ049610 Hs.95243 transcription eiongaOontactor A (SiiHke 2^0 

423690 AA329648 Hs.23804 ESTs 2Z0 

402865 21.9 

,^ 417387 AW021102 Hs.21509 ESTs 21J 

65 456698 NM.001928 HS.155S97 D component of complement (adipsln) 21.9 

459722 Homos8piensd3NA:aJ23449fis.doneHSt0S6 21.8 

42^ AW247388 H8.301423 caldum tmding protein 1 (cdbr^) 21.8 

402195 21.7 

418213 AW978753 Hs.l27327 ESTs 21.6 

70 440274 R24595 Hs.7122 screpte responsh« protein 1 21.6 

455818 Ai733747 gb2n86d04.y5Str^agenehmgcaicinonia 93721 21.4 

420861 AI039044 Hs.88827 Homo sapiens mRNA^FU00033 protein, parii 21.4 

405228 21.3 

441292 AF131218 Hs,7765 chromosome 16 open reading frame 6 21.3 

75 432553 AA553334 Hs,211095 ESTs 21.3 

417098 AB017365 Hs.173859 frizzled (Drosophila) homotog 7 21.2 

453642 A1370936 Hs.34074 dipepSdytpeptkJaseVi 21.2 

405313 21,1 

410243 D83402 H5J289006 ESTs. Weakly similar to altema&velyspCced 21.1 

80 413186 AU077141 Hs.75231 sohits carrier family 16 (monocarixuvRcacU 21.1 

425954 AK000633 H3.164476 hypo8»8ca) protein FU20826 21.0 

421770 AA374ia2 - Hs.108124 ribosomal pnM L41 21i) 

43S265 AA779956 H&18S932 ESTs 20.8 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



430036 
430233 
436130 
434843 
429303 
442422 
4103S9 
435869 
447384 
440610 
445806 
433657 
436467 
440191 
417511 
406976 
443547 
417998 
419313 
408322 
448422 
403121 
424198 
459060 
457829 
445029 
424362 
417067 
413972 
435891 
447551 
400637 
409882 
430310 
402741 
401703 
409229 
453856 
430342 
404033 
411939 
431227 
452669 



AL050284 

AW367902 

AA341497 

R43707 

AW137635 

Ai344415 

BE068889 

AF255910 

A1377221 

Ai733098 

AL137516 

AI244368 

AW450276 

AI990417 

AL049176 

IV[50299 

AW271273 

AW957420 

AA843387 

AW181985 

BE263813 

AB029010 

H89244 

AI742291 

AF196481 

AL137646 

AJ0O1417 

BE279548 

AW249394 



416253 
418556 
408877 
415994 
417054 
404654 
420174 
400625 
406150 
457635 
420105 
404619 
423282 
42m 
448543 
427605 
406535 
418947 
414323 
457111 
418373 
424461 
451565 
407751 
432031 
404608 
451962 
424100 
451509 
453512 
429924 
423780 
427030 
439872 
407836 
451427 
424462 
451533 
422319 



AJ243191 
U60115 



K60333 

AA804789 

Nl^005938 

AI365585 

X63755 

AA2183$3 

AW779654 

BE250659 

T02850 

AA47S033 

NM.002923 

AF017060 

Aia24144 



AW015571 

BE514535 

AL137563 

M13981 

AW897741 

NIA.000997 

W52g90 

NM.014759 

AA4820Z7 

AW760770 

D83542 

NM_000897 

BE2760g6 

AF039196 

H5e589 

AWD78832 

A1793080 

A!g69529 

AL040160 

W39693 

AA352013 

AA397600 

T81058 

T79340 

Ai091441 

AU076666 

NH.004857 

AW403342 



H$.227782 OKFZP586M1019pretBln 

Hs.236443 Hocm sapiens aiRHA:cONADIQ7p564N1063{lh)m 

HS.3140B ESTs 

Hs.133159 ESTs.WBa)dystni!lartoPIHUSOs^pcdr 

Hs.44238 ESTs 

Hs.156082 ESTs 

Hs.63236 synudeln, gamma (breast cancer-spedOc prol 

HS.546S0 ESTs, Weakly similar to (defline notavailabi 

Hs.40528 ESTs 

Hs.130800 ESTs 

Hs.13323 hypotlielical protein aJ22059 

Hs.8124 PHdonnatncontabtiiig protein bireSnal 

Hs.91661 ESTs 

Hs,116107 Horno sapiens genorlfcDN^ch^ol!10SO^to21q, sec 

Hs.82223 diordln-iao 

gbiHuman dtpha-1 ocnagan type II gene, exons 

Hs^767 ESTs 

gb:EST379495 MAGE resequences. MAGJ Homo safd 

Hs^7279 ESTs 

Hs.249986 ESTs 

gb:601194177F1 NiHJAGC.? Homo sapiens cDNAd 

H5.143028 K1AA10S7 protein 

Hs.79625 heteiQosneousnudearribonucieoproteinD(AU 

Hs.210843 ESTs, WbaMy similar to dJ1039K5.2 [Helens 

H8.12256 midline 2 

Hs.146001 Homo sapiens mRNA; i:ONA DKFZp586F0824 (from c 

Hs^1086 solute canier family 22 (extraneuionalmonoa 

Hs.162717 ESTs, Weakly similar to HPPD_HUMAN4-HYDR0XYP 

Hs.5002 copper chaperene ffx superoxide (Ssmutase 

HsS29 fliyosin, hea^f p6lypep6de 7, cardiec musde, 

Hs^4 lieatstrack 27kO protein family, member 7 {car 

Hs^9069 fourandaltalfUMdoniainsI 



Hs.251928 nuclear pore complex Interading protein 

Hs.19447 Homo sapiens mRNA for FU00106 protein, pvti 

H&23g663 myelold/lymphoidormlxed-lineageieukemia(t 

Hs.146246 ESTs 

Hs^43 iterafin, cuSde. ultral^h $u)p})ur 1 

H5^62958 EST8.Weai(|yMartoaItemat]vely'8pOoed 

Hs^5876 ESTs 

Hs.15463 ESTs 

gb:i^12A9 Fetel brdn. Stratagene Homosapien 

Hs.130315 ESTs 

Hs.78944 regulator of &piot^ signaBtng 2. 241(0 

Hs.174151 aldeiiyde oxidase 1 



Hs^2 ESTs 



Hs.192375 ESTs. Highly 8im{lartodJ127B20.3(Ksap{ens 

Hs.32244 ESTs 

H8.77171 minichromosome maintenance defldent(S. cere 

Hs.126378 putaSve ABC transporter 

Hs.1734 inhibln.dpha 

Hs^1380 Homo saptens dma cONA DKFZp586P1 124 (from c 

Hs,179779 ribosomal protein L37 

H8.22860 ESTs 

HSJZ39500 IOAAa2739eneproducl 

H5.142569 ESTs 

Hs.84344 CGI.135 protein 

H8.148090 cadherin 15, M^aflierfn (myotubute) 

Hs.456 teukotrtene C4 synthase 

Hs.38205 taHst^cydifHtependent kinase 2 IntefacB 

Hsj;841 26 hairtess (mouse) homolog 

Hs^156 Homo saptens cONA FU11027 lis, done PLACE10 

Hs^6806 ESTs 

Hs.1 23525 ESTs, Weakly similar to NGAL RAT NEUTROPHIL 6 

Hs.171637 Homo sapiens d)NA: FU21937 fis, done HEP044 

Hs.209542 ESTs. Weakly sintiiar to Been linker protein 

Hs^6138 Komo8Sf)fensnRNA:cDf4AOXF2p566H24»(1iomc 

gb:EST59935 Infant brain Homo saptens cONA S 

Hs.97531 ESTs 

gb7d26c08.r1 Soares fetal Over spteen 1NaS 

Hs.22575 Homo sapiens dDUfic FU21042 tls. done CAE1 12 

KS.264D1 bmnornecnjsls factor (|igand)Guperfandy.m 

Ks.148101 serum consGluent protein 

Hs.26CT0_ aerun deprfvaSon response (phosphaBdyfeerbi 

KS.11S232 spQdng factor 3a sirtNndlZ66kO 
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20.1 
20.1 
20.1 
20.1 
20.0 
20.0 
19.9 
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19.5 
19.4 
19.4 
19.4 
19.3 
19.3 
19.3 
19.3 
19,2 
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18.9 
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18.9 
18.9 
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18.8 
18.8 
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1&6 
18.6 
18.5 
18.5 
18.5 
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m 
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18.1 
18.1 
18.1 
18.1 
18.1 
18.0 
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400489 
454421 
449282 
420495 
429790 
42279S 
427980 
409543 
440206 
455904 
427707 
437140 
417637 
419171 
417808 
426232 
440747 
415307 
407049 
454054 
41108S 
443104 
4241G6 
446716 
448677 
434819 
401171 
456804 
453621 
413419 
426515 
428937 
424562 
444655 
447424 
425439 
446707 
405324 
434340 
422942 
421820 
420037 
428818 
426485 
404847 
412677 
401551 
408053 
^016 
418179 
416994 
457514 
426275 
457924 
430712 
455144 
407524 
426712 
429954 
446208 
442792 
420485 
426767 
436950 
415198 
442197 
433457 
402316 
409736 
407964 
433677 
425507 
413724 
408922 
413055 
435977 
442208 
402426 
412399 
413200 
404597 
453143 



BE409759 Hs,59563 Homo sa|tensmRNA for FU00007pfDton, parti 

AL048056 H8.23437 Homo sapiens cONAFU13555te. done PLACEIO 

AI338247 Ks.98314 Homosa|densmRNA;cDNADKFZp58ai)120(fiomc 

AK001352 Ks^1737 hypofheOcal protein FU1 0490 

AW897265 gb<WNN0057.1504aM3S«04 NN0057 Homo sapi 

AM18305 ob3v96g05^1 SoaresJlhHMPu SI Homosaptensc 

AW410200 gb.-fli05b12jc1 N!K>MGC_17 Homo sapiens cONAd 

AI762232 Hs.46794 ESTs 

BE156173 gb:QVD4fr0367-201299«79^02 HT0367 Homo sapi 

NM_005578 Ks.180398 UMdofndnHXffOalnlivpfefemdtransloc^ 

AA312799 K5.283689 acfivdoroTCREM in testis 

AA204969 Hs.234863 Homo sapiens cONAFU12082fis, done HBdSBIO 

NM-002846 H5.89555 protein tyrosine phosphatase, receptor type, 

AF177g09 Hs.12828 tuieety (Drosophila) homotog 1 

Z70024 Hs.168157 nodeartranscitpaon factor Y, gamma 

AW297226 Hs.137840 ESTs. Moderately sim9ar to SIX1.HUMAN HOMEOB 

F05232 Hs^7495 prostate cancer assodated protein 7 

X72632 (NONE) 

AI336329 H5^01S19 Homo sapiens cONARJ12S36fi8, done NT2RliM0 

AR)22991 H5.68398 period (Drosophila) homolog 1 

AA088470 Hs.83135 p53Hesponsh«gene6 

AA412442 Hs,98132 ESTs 

AA436575 Hs.16602 ESTs 

AiS60769 H5^27051 ESTs 

A1821740 Hs.116531 ESTs 

AA360954 Hs^8 Homo sapiens mRNA;d>lADKFZp564N198 (from d 

A1421645 H$.139851 caveorin2 

AW7499B3 gb:QV3-BT0537-280100-070«04 BT0537 Homo sapi 

6E093888 ' Ks.48938 Homo sapiens cONA: RJ21802 fis. done HEP007 

BE384222 Hs.231444 ESTs 

T82221 Hs.56729 lymphocyte^pedfic protein 1 

Ai420859 HS.15Q557 t}8sic transcription eioment binding protdnl 

AFOSSaae Hs.11590 calhepsInF 

A1681105 Hs.181641 ESTs 

D38024 Hs.157425 dout)le homeobOR, 2 

AI591214 Hs.156336 ESTs 

A1193043 Hs.128685 ESTs 

AF054839 Hs.122540 tetraspan2 

AW662990 Hs.108675 h8m&4}|nding protein 

BE299598 Hs.135569 ESTs. Weaidy simiiar to NEUROO |H.sapiens) 

AI131291 Hs.98866 ^8 

N^006207 Hs.170040 plalfilelKterived growth factor receptor^ 

AW029608 Hs.17384 ESTs 

AW139474 Hs.246882 ESTs 

AA376049 Ks.154162 AOP-ribosyialionM)r-fiice2 

X51630 Hs.1145 VWims tumor 1 

AA296520 Hs.Bg546 Seledin E (endottidia) adhesion motecule 1) 

AA775208 H3.136423 ESTs 

BE151551 gb:RC0.HT0297-201 19W)31-fl2 HT0297 Homo sapi 

AL390142 H8^8697 Homo sapiens cONA FU13661 fis, done THYRO10 

AW044647 Hs.196284 ESTs 

AW875942 ob:CM1-PTG013-131299-067-b10 PT0013 Homo sal 

X64985 gbiRsapiens mRNA HTPCRX11 forolfadoiy race 

AW173177 Hs.197755 ESTs 

Ai918130 Hs^1374 ESTs 

BE258323 H5J225795 ESTs,HighiysimBartoOTX1jnjMANH0ly!EOB0XP 

AI352340 Hs.131194 ESTs 

AF218586 HslB8835 oelldeath^dudngDFFA^IceeffedOfb 

AA384398 Ks.192491 ESTs 

L05779 Ks.113 epoxide tiydrolase 2. cytoplasmic 

AK000150 Hs.78185 MAX-liice bHLHZiP protein 

AW837912 flb:QV3.LT0048.26010(M6ft<02LT0048Homoiiapl 

AA830194 Hs.199417 Homo sapiens mRNA for nJ00027 protein, parti 

AA078828 ^7P07H07 Chromosome 7 RacentafcONAUbrar 

AW130334 Ks^in ESTs 

AI791912 Hs.190885 ESTs, Moderately simiiar to ALULHUMAI4 ALU SU 

A1684745 Hs.165983 hypothetical C2H2 zinc finger protein FU2250 

AA131466 Hs.23767 Homo sapiens cONAFU12666Bs, done NT2RM40 

RB7388 gb7Tn88g04.r1 Soares adult brain N2b4HB55Y Ho 

AV655701 Hs.75183 cylDdvomeP450,8uMBm{lyiE(eth8nQHnduc 

A1138079 Hs.5012 brain^pecificniembrane-andioied protein 

AW2g6984 Hs,255596 ESTs, Wealdy similar to PSF_HUMAMPTB^SOaA 

N53816 Hs.14394 hypotheOcai protein FU201 57 

AA127395 H8JZ22414 ESTs 

AA382234 Hs.170121 pntdn tyrosine phosphatesa. receptor type. 

BE177442 flbdlC14fT0595-200400412«1 HT0595 Homo sapi 
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416193 


T25400 




gb:PrHI059HTaiL1 Homo sapiens cDNA 5^0 Sim 


15.9 


407065 


Y10141 




gb:Ksap)ens 0AT1 gene, VNTR 


15.9 


441785 


AW138139 


Hs^44598 


ESTs 


15.9 


413784 


BE165819 




gb:a«)4fT048&-2203a)^1-<f12 HT0486 Homo sapl 


15.9 


429092 


AI190864 


Hs.178226 


ESTs. Weakly wrStat to ALUIJftJMAN ALU SUBFAM 


15.8 


408499 


AW205323 


Hs.253475 


ESTs 


15.8 


453754 


AW972580 


Hs.172753 


ESTs 


15.8 


450828 


U43030 


Hs^5537 


CanUOowpTuIi 1 


15.8 


428486 


AWS63497 


Hs.184604 


pancresfic polypepUdG 


15.7 


405895 






15.7 


409108 


AA339443 


Hs.48793 


ESTs 


15.7 


423334 


AK000906 


Hs.1 27273 


hypolheGce) protein FU10044 


15.6 


422948 


AVV810824 


Hs^351 


ESTs 


15.6 


447852 


AW504781 




gbaJI-HF-BN0-alrvc-04^Jl NtHJIiGCJOHomo 


15.6 


419084 


AA496539 


Hs.179902 


transportBf-Cka protein 


15.6 


456771 


AW016739 


HSJ32201 


ESTs 


1&6 


438564 


AA381553 


H5.198253 


msjv histprompflrtibtiny oompleXi dsss lit D 


15.6 


448705 


K05072 


K5.124984 


ESTs, Moderately ski^ to unnamed protein p 


15.6 


454460 


X66945 


Hs,748 


fibroblast growth factor receptor 1 (Irns-rda 


15.5 


458893 


BE161733 


H3.97283 


ESTs, We^ shnaar to ALU1 jiUMAN ALU SUBFAM 


15.5 


426759 


AI590401 


Hs.21213 


ESTs 


15.5 


453769 


R3S281 


Hs^4947 


ESTs 


15.4 


434179 


AI743448 


Hs.116177 


ESTs 


15.4 


404111 








15.4 


402056 








15.4 


458602 


AI262208 


Hs:E76489 


ESTs 


15.3 


427530 


AA405093 


Ks.1265ig 


ESTs 


15.3 


414716 


AF199598 


Hs^044 


Kv channel-interacting protein 2 


15.3 


400632 








15.3 


443918 


AA305475 


H3JI2660 


Homo sapiens cONA FU11658 fis. done HEMBA10 


15.3 


432037 


AW450592 


HsJ00459 


ESTs 


15.3 


412921 


BE009345 


Hs.128942 


ESTs 


15.3 


421905 


AI660247 


Hs^26g9 


ESTs. Weakly similar to LiV-1 proton (H^pl 


15.3 


441704 


AI458766 


H9.201988 


ESTs 


15.3 


414272 


Ai651603 


Hs.46988 


ESTs 


15.3 


448224 


R4d700 


H5^0733 


EH-doma!n containing 2 


15.2 


404611 


H56589 


Hs.35156 


Homo sapiens cONA FU1 1027 lis. done PIACEIO 


15.2 


448381 


D61580 


Hs.21036 


Homo sapiens ntRNA; cONA OKFZp434A1010 (from c 


15.2 


454719 


BE006547 




gb:RC2-BN013a040400411-b03 6ND130 Homo sapl 


15.2 


446973 


H95724 


Hs.4283 


ESTs 


15.2 


457760 


AA668123 


Hs.1 34170 


ESTs 


15.2 


440144 


AW082297 


Hs.88523 


ESTs 


^S2 


407387 


ABQ00895 




gkHomo sapiens mRNA far cadherln RBI, parii 


15.2 


427850 


AA416756 


Hs.161051 


ESTs. Modtfs^simltartoAIXimUMAN ALU SU 


15.2 


404244 








15.1 


402959 








15.1 


435487 


W07343 


H8.182538 


phosphol^ scramUase 4 


15.1 


414213 


BE297765 




^601176246F1 NIHjyiGC 17 Homo sapiens cOMAc 


15.0 


455916 


BE156710 




gb:GM>41TO368^1030O-18t<H)1 HT0368 Hbmosapi 


15.0 


448943 


AI608810 


K$.193288 


ESTs 


15.0 


418026 


BE379727 


Hs.63213 


fatty add binding protein 4. adtpocyte 


15.0 


454082 


AF283508 


Hs.63168 


cell death regulator aven 


14.9 


453308 


AW9S9731 


KsJ2538 


ESTs 


14^ 


458823 


AW207574 


H1179501 


ESTs 


14.9 


452532 


AI905811 


H3.1 10757 


DNA segment on diromosome 21 (unique) 2056 ex 


14.9 


416464 


Ra7580 




gb:ym89h07.r1 Soares adult br^ N2b4HB55Y Ho 


14.9 


409473 


AL137716 


Hs^6567 


Homo sapiens mRNA; cDNA DKFZp434D2030 (from c 


14.8 


449779 


AA0042S8 


Hs^218 


ESTs, Weakly similar to ALUB_HUMAN Ull ALU C 


14.8 


457546 


AAS68484 


Hs.153632 


ESTs 


14.8 


403368 








14.8 


432163 


AK000440 


Hs.272799 


hypothetic^ prdlein FU20433 


14.8 


421531 


AA713505 


Hs.291769 


ESTs 


14.8 


428283 


AI439096 


H8^5832 


Homo sapiens mRNA; cONA DKFZp564P1 16 (from d 


14.8 


443528 


AK001778 


Hs.9547 


hypothetical protein FU10916 


14.8 


402399 






14.8 


410545 


U32324 


H&&4310 


interteukin 11 receptor, alpha 


14.6 


450300 


ALJ041440 


Hs^lO 


ESTs 


14.6 


403552 








14.7 


406929 


U04690 




gbi^man olCactory receptor (OR17''210) gene. 


14.7 


436365 


AW44454d 


Hs.163118 


ESTs 


14.7 


402S50 








14.7 


441782 


AW140126 


Hs.132357 


ESTs 


14.7 


415672 


N53097 


Hs.193579 


ESTs 


14.7 


430582 


AI215509 


Hs.143964 


ESTs 


14,7 


425770 


NM.014363 


Hs.159492 


spastic alasda or Chaderabc-Saguensiy (sacsh 


14.7 


432683 


AW99S441 


Hs.10475 


ESTs 


14.7 


441871 


AI308150 


HS.15345D 


ESTs, Wbddy slntDar to 19091 23A Na glucose c 


14.6 


447481 


AF052151 


Hs.16686 


Mouse Mammary Turnvr Vims Receptor homolog 


14.6 


405114 






14.6 


401082 








14.6 


454316 


AW366144 




gbKI^HT0101-C610994)32-b12 HT0101 Homo sapi 


14.6 


421572 


AA531607 


Ks.125143 


ESTs, simlar to P012 MOUSE RETROVIRUS 


14.6 


424591 


R55704 


HS.150S68 


hypocreSn (orexbi) leceptor 1 


14.6 
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441503 




ns.iii«kAic 


416199 


R83537 




420360 


U83171 




425126 


N32759 


ns. 1 1 


417421 


ALl 38201 




405100 






454012 


M76424 


Ue X7MA 


402457 






454613 


AW810814 




429821 


l\LMSJQi'tS 




431073 


BE254470 


l1Si^4illOD 


421143 




Ue tn9171 


401223 






438827 


ntUOf JoO 


ns. 1 £0'* f o 


407124 


R08160 




437217 


AW77Q9^1 
nvn fVtH 1 




427627 


FB75S2 


Uc ITQQIi: 

ns.i f 9910 


410258 


X52638 


ns./ J9 


413237 


AMCfiC?/ 


nS.ir 19D5 


412975 




Ue 7(;inR 

rl5.f DIUO 


426488 


X03350 


ns.4 


416667 






405479 






418432 


M141<ifi 

nl 1 M 1 UU 


Ue flCii9 


426316 






412171 


AWRQ74<i9 




447241 


DCww40J0 


Ue 10^99 


402100 






438286 


AW139266 


nSii JWUf 


407947 




nS.lvKoor 


402275 






402358 






439624 


AA838771 




444455 


AI149879 


R« 17<V)9d 

rw. 1 f 9v^*l 


455314 


Y17114 


Ue 7T3Q9 


427872 




Hs.21111 


409826 




Ue UAttJ 


414002 




UeTCCTfl 


442682 


Ai01454S 




457033 






410480 


DQ74C7 




401007 






4S8274 


AP1iQ9q7 


Ue flnn7 


••□•HUD 


UIgOOf 




432928 




Ue 4QC4C7 

nS.1oo4o7 


425352 




Hs.1697 






U« 970C00 


434927 


H46612 


HS.293B15 


404282 




422561 


N^L016339 


Hs,118562 


424823 


N^^006226 


Hs,153322 


408107 


AA806754 


Hs,62835 


401577 






433883 


AI925686 


Hs.222312 


408104 


AW972927 


Hs^93968 


404642 






400675 






408059 






448386 


AB037750 


Hs.21061 


407287 


AI676812 


Hs^16S8 



ESTs 

gb:yq120OBj1 Scares felal Over spleen INas 
smaO induciUe cytokine subfamOy A (Cys-Cys 
chorionic gonadotropin, beta polypeptide 
nudeer receptor subfamDy 4. group A. member 

carbonic anhydrsseVn 

9b:MR2-ST0129-201099meO1 ST0129 Homo sapi 
Homo sapiens mRNA; cONA 0KFZp434G153 (fromd 
cone-rod homeobox 

Inntunoglobulin superfanity containing leucine 
ESTs 

ESTs. WeaWy simflar b ALUlJiUMAN ALU Sl^FAM 
ESTs 

guanine nudeoSde binding protein (G protein 
&|}iosphofructo-2-ldnase/lrudoso-2,6-b^^ 

dusteiin (comptemenllysls Inhibflor, SP-40, 
alcohol dehydrogenase 2 (dass 1). beta potyp 
hypotheOcai protein FU20519 

insuMe growth factor 1 (somatometfia C) 
Human DMA sequence from clone 437G10 on cftfom 
gb:a/IO-NNOQ5a'150400-337-b08 NN0058 Homo ssf^ 
ESTs 

Homo sapiens cDNA FU12057 fis, done HEMBB10 
ESTs. Wbaldy similar to hTd^ [H^eplens] 



ESTs 

Homo sapiens cDNA: FU23447 fis. done HSI033 

eyes absent (OrosophUa] homolog 4 

ESTs 

hypothetical protein FU23412 

FBJ murine osteosarcoma vird onoogene homolo 

EST 

KIAA1605proldn 
cadherin13.H«adherin(he8tQ 

NA&5 protein 
ESTs 

ESTs, Moderately simter to ALULHUiyiAN ALU SU 
proopiomdanooorUn (adraROoorliootrDpin/ bet 
ESTs 

ESTs. Weakly similar to PLMJ1UMAN PHOSPHOLEIWIM 

Link guariine nucleotide exchange factor tl 

phosphd^taseCepsHon 

ESTs 

ESTs, Weakly simSar to B24264 prdln^fidi p 
ESTs 



K1AA1329 protein 



14.6 
14.6 
14.6 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.5 
14.4 
14.4 
14.4 
14.4 
14.4 
14.4 
14.4 
14.4 
14.4 
14.4 
14.3 
14.3 
14.3 
14.3 
14.3 
14.2 
14.2 
14.2 
14.2 
14.2 
14.2 
14.2 
14.2 
14.2 
14.2 
14.2 
14.1 
14.1 
14.1 
14.1 
14.1 
14.1 
14.1 
14.1 
14.1 
14.1 
14.1 
14.0 
14.0 
14.0 
14.0 
14.0 
14.0 
14.0 
14.0 
14.0 
14.0 
14.0 



TABLE 22B 



Picey: Unique Eos piobeset identifier number 
CAT number. Gene duster number 
Accessionc Genbank accession numbers 



Pkey 

408922 
409543 
409736 
412171 
413784 
414213 
416193 
416199 
417998 
416464 
418556 
422798 



CAT number Accession 



109017J 
1138723 1 
115189J 
1260759.1 
1389150J 
1426375J 
1577102.1 
1577561J 
171375J 

1759038_-2 R87580 
1767866.-1 T028S0 
221S00J 



R87388 R84328AA058916 

AW410200 AW409705AW411433 BE296766 BE270309 
AA078628 R09051 AA07B197 AA077334 AW748808 AW748807 
AW897452 220302 055805 D52877 060432 
BE165819BE165853 W01386 
BE297765 BE262061 BE302686 T8391S 
T25400 H26834 H44554 R73193 
R83537 W80940 H27368 
AW967420 AA210915 AA236991 AA210916 



AW897265AW897274AL119504AW897275AW897270AW897312AW897318AWB97317AA317240AW961361 
T06241 AA326794AU38130AW407975AW999277 #*tw«v«owi 
423780 231952.1 AA352013AA330878AA339379AW966303 
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424S85 
426275 
4Z7S80 
439872 
442197 
447852 
448422 
453621 
454316 
454613 

454719 
455144 
455818 
455904 
455916 
455B84 

TABLE 22C 



241151.1 AA464840AA343628 

263712^1 BE1S1551 AA373783 BE182852 BE008826 BE0Q8827 BE008781 BE008699 

285225.1 AA418305A)2&4351 

47823J T81058AL357200T70270 

535550.1 AW83791 2 AW837934 AA984475 AW997490 

73973.1 AW504781 BE620394 

762770_1 8E263813 BE253504 A)500202 BE251 145 

974526_1 AW743983AIJM5823 

1109350.1 AW366144 AW385t 54 AW366142 AW3661 51 AW366140 AW356155 BE141715 8E141718 BE141698 

1226904^1 AW810814 AW810787 AW810854 AW810773 AW810735 AW81 0785 AW810660AW81 0834 AW81 0674 AW81 0723 

AWB10681 AW810791 AW810844 AW8106S9 AW810676 

1230646J BE006547 AW815576 AW815311 AWB56304 

1254914.1 AWB75942AW858234AW875938AW875941AW858235AW875958 

137219.1 AI733747AA129802 

1382290.1 BE156173 BE156305 BE156196 

1382748.1 BE156710 BE156726 BE155712 

1397288J BE177442 BE177439 BE177445 BE177440 BE177448 BE177444 BE177433 



Ptcey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column ere Genbank Identifier (Gl) numbers. •Dunham I el al." refers to the pubOcaOon entilled "The DMA seauence of 

human chR)mD$ofn82Z* Dunham 1. el d., Nature (1990) 402:489495. 
Strand: Indfcales DMA strand from wHch exons were predicled. 

NLposifion: Indicates nucleotide positions of predicted e 



Ptey 


Ref 


Strand 


NLposlGon 


400489 


8954013 


Plus 


131475.131652 


400545 


9800107 


Minus 


124618-124881 


400625 


7228177 


^us 


117266-117441 


400632 


3818355 


Hus 


72875-73447,75874-76425 
68901-69507 


400637 


8894326 


Plus 


400675 


8118750 


Rus 


11223-11816 


401007 


8117333 


Minus 


140821-141050 


401082 


3242744 


Rus 


22937.23494.27577-27966 


401223 


8099088 


Rus 


148940^150214 


401551 


8096896 


Minus 


189824-190728 


401577 


9280797 


Minus 


139377-139674.141195-141281.142217.142340 


401703 


4826475 


Rus 


135-1229 


402056 


6084234 


Rus 


207002-207288 


402100 


8117697 


Rus 


133549-133792 


402195 


7589778 


Minus 


147901-148884 


402275 


2935596 


Minus 


31065-31233,33680.33771,34345J4411,38B90-39125,39779-39943 


402316 


7527774 


Minus 


10751-10919.18617-19052.22131-22328 


402358 


8886976 


Minus 


131788-132729 


402399 


1905915 


Minus 


24502.24666.24986-25102 


402426 


9796361 


Minus 


73590-73824 


402457 


9796782 


Minus 


16513.16577.16638-16926 


402520 


7596899 


Minus 


171761-171996 


402550 


76S2009 


Minus 


80413.80673 


402741 


9212200 


Minus 


18603-18760,19719-19890 


402865 


9716300 


Rus 


3197-3429,3722-3914,S795•5987.680^.6981.8653^1S^929^9660 


402959 


9368493 


Rus 


36729-37084 


403121 


9180223 


Plus 


40594258 


403368 


4388738 


Rus 


70286-70429.75165.75256 


403552 


6862638 


Minus 


117504-117882 


403731 


7543752 


Minus 


144000-144616 


404033 


8122195 


Rus 


7976^156 


404111 


9408736 


Rus 


161506.161781 


404244 


5672609 


Minus 


98173-98517 


404282 


2276311 


Rus 


61503^2205 


404597 


9958262 


Minus 


114369-114599 


404642 


9796810 


Rus 


102999-103145 


404654 


9797010 


Rus 


^75-6527 


404947 


7382205 


Rus 


29740-30105,30176-30412 


405100 


8076846 


Rus 


144114-144234 


405114 


8096938 


Minus 


97013-97560 


405228 


7248990 


Rus 


92234-95905 


405313 


3638954 


Rus 


68924-69093 


405324 


3342751 


Minus 


5475-5677 


405479 


6453391 


Rus 


1668-1844 


405895 


7677903 


Minus 


66990-67484 


406059 


9103984 


Minus 


1385&-14004 


406150 


9866026 


Minus 


59331-59701 


406S35 


7711477 


Rus 


83135^3362 



TABI£ 23A: 626 genes upiBOulaiBd in uterine cancer lelatho to nomial body 

T^le^li$lsBbout626genesupregulatedlnuterinecanoerfelalh peptides 
oranGbodies. These geries were setectedlinom 59680 probesels on the WA^melrb(Hu03Genechip array. Gene expresstondato for each probnetolriatoed from this analysis ' 
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was expressed as average Intensity (Al). a normalized VBtuereflec^^ The protein prodijcts of these genes often contain one or more domains 

indicative of have oncogenic (unction or of transducing intraceOutar signals, or of being modulatatrie by small molecules, peptides, or anSbodies (e.g. pkinase, deailHiomain, 7tm, 
phosphatase, or ion^lransportBi). Cettain predicted pnM 

Pkey: Unique Eos probeset idenfifter number 

ExAccn: Exemplar accession number. GenBank accession number 

UnlGenelD: UniGene number 

Pred.Pn)LDomalns: Certain predicted protein domains. AbbrewlaSons teed: TM, transmembrane domain; SS. signal sequence: =Y, very likely to contain: 

likely to contain; other protein domain abbrevf afions are from f>FAM (Nucleic Acids Research, 2002, 30:276-280). 
UnlGena Tills: UniGene gene title 

R1 9Sth percentile of uterine cancer Als divMed by Die 50lh percentile of normal OssusAb, where the 10th percenlBeofaB nomial tissue Als was subtracted from 

both the numerator and denominator 

1^; ExAccn; UnigenelO; Un^ene Title; Pred.ProlDomabtt; R1 

428330; tJ»524; Hs.2256; matrix metaDoprotebase 7 (matnlysln. ; Peptidase 1^10;; 35.1 1 

420440; NM_002407; Hs.97644; mam[nagk>bin 2; Uterogtobtn;; 22.80 

439335; AA742697; Hs.62492; NM.052883:Komo sapiens secretegloUn. fa; none:; 21.66 

425723; NM-014420; Hs.1593t 1; (fickkopf (Xenopus laevis) homotog 4; none;; 21.1 1 

421481; AW391972; Hs.104696; KIAA1324 protein; none;TM=M;SS=M; 20.20 

437938; Ai950087; Hs.369628; gb:wq05c02.x1 NCI_GGAPJ<id12Homosapien; noncnone; 19.83 

406687; M3112S; Hs.352054; matrix melaitoproteinase 11 (slromeiysin; hemopexln,f>eptidase_M10;; 17.68 

446619; AU076643; Hs.313; secreted phosphoprotein 1 (osteoponSn, : Osteopontin:: 17.60 

41B281; U09550: Hs.1154: oviductai glyoopiotein 1, 120kO [mucin 9; GlycoJl^ro_18;TiyNM:SS=M: 17.48 

431130; NM.006103: Hs.2719; HE4; epMidyniis^pecifk:, whey-addte pr, wap:TM=IU(SS=Y; 16.59 

400301; X03635; H5.1657; estrogen receptor 1; F4)ox.hormone,rec.zf.C4,OesLrecep,adh jlnc,keloacylwtPP*ln*f^^ 

syntCMA.E7.RF)eDNA^binding;TM=M;SS=N; 15.11 
419356; AI656166; Hs.7331; hypothetical protein FU22316: Asparaginase^^none; 15.90 
433222; AW514472; Hs.238415: dickkopf (Xenopus laevis) homdog 4; none,PH04; 15.39 
417931; W95642; Hs.82961; trefoB factor 3 (Intestinal): trebfl;; 1&39 

400284; ; ; NM_000125*:Homo sapiens estrogen recepto; hoTmonejeczf^«OesLrecep:T)ul>M;SS=M: 15.23 
456662; NM_002448; Hs.1494; msh (DrosophUa) homeo box homdoo 1 (lb; homebbox^none: 15.04 
438817: AI023799: Hs.163242: ESTs; none.none: 13.72 

453657; ALD80235; K5.35861; Ras-induced senescence 1 (RISI); none;TM=Y;SS=M: 1 3.67 
424687: J06070: Hs.151738; matibc metdtopratdnase 9 (gelaSnase 8; fh2,hemopexin,Peptidasa M 10;; 13.51 
458827; AW088642: Hs.97984: SRY (sex determining regkm Y>-box 17 (S; HMG box;TM=M;SS=>N; 13.44 
410001; AB041036; Hs.57771; kallikrein 11; trypsin:TM=M;SS=M; 13.41 

421445; AA913059; Hs.1 04433; Homo sapiens, done tMAGE:4054668. mRNA; toa.trans,K.tetra.asp; 13.27 
449048; Z45051; Hs.22920; similar to S68401 (cattle) glucose indue; LBmp;Tli^M:SS=M: 12.76 
436972: AA284679: Hs.25640: ciaudin 3; PMP2^Qaudin:TM=Y;SS=M; 12.59 
450693; AW450461; Ks.203g85; ESTs; Sema.ig.none; 1Z52 

415457; AW081710; Ks.7369; Homo sapiens testes spedfn: A2 homdog ; MORN,sugar tnTMsY:SS-M: 12.46 

413719; BE439580; Hs.75498; small Indudble cytokine subfamily A (Cy; IL8;; 12.23 

431629; AU077025: Hs.265827; interferon, alph&^ndudble protein (do; none;TM=M;SS=Y; 12.09 

417389; BE260964; Hs.82045: mldWne (neurite growth-promoting fador; PTN.MK:TM=M;SS=Y: U08 

407786; AA687538; Hs.38972; tetraspan 1; transmembran64;TM=Y:SSsM; 11.91 

444381; BE387335; Hs.283713; hypothetical protdn BC014245; (Jdlagen;TIifl=M;SS=M; 11.86 

446608; N75217; Hs.175622; ESTs; Amiadilk)_seg,HEATJ>BS;Ti;^M;SS=M: 11.72 

447835; AW591623; Hs.164129; ESTs, Weakly simiter to 138022 hypotheU; none,Ua.Gon: 11.59 

420181; A1380089: Hs.158951; ESTs; none,lg.pkinase.LRR,UWCT; 11.49 

451253; H48299; Hs.26126; daudin 10; PI^2^.aaudin,Peptidase_M1,K.tetra;TM»Y;SS"M: 11.45 

453968; AA847843; Hs.62711; High mobBKy group (nonhistone chromoso; HM6J»x,none: 11.42 

448133; AA723157; Hs.73769, fdate receptor 1 (adult); Fd3lejecMIP;TM=l«;SS=M: 11.37 

421552; AF026592; H5.105700; secreted frtzzleckelated protein 4; Fz,^^■R;; 11.08 

452367; U71207; Hs.29279; eyes absent (DrosophHa) homdog 2; Hydtolase;; 11.01 

409745; AA077391: : gb:7614E12 Chromosome 7 Fetel Brain cONA; 7teu1.zf^(M,fh3.8PRy,KRAB,zf^2,n«,zl«Jbox:TM=Y 10.96 

415138; C16356: H8.295944; tissue fador pathway inhibitor 2; KunitLBPTl.none: 10.91 

416658; U03272; Hs.79432; fibriTin 2 (congenitei conlradural ara; EGF.TB,granuiin,PSt,EB.TII^TM=lUl;SS=M; 10.81 

41 1558; AA102670; Hs.70725; gamma-aminobutyric acid (GABA) A recepto; Neur_chan_t£D.N€ur_chan_memb!TM?»Y;SS=M: 10.72 

438091; AW373062: Hs.351546; nudear receptor subfamBy 1, group 1. m; honnonejec,zf^,none; 10.66 

425071: NM.013989: H8,154424; debdlnase. lodolhyronhe. type II; T4jdetodin8se;TM4A:S8nY: 10.66 

430832; AI073913; Hs.100686; ESTs, WtaMy similar to JE03SO Anterfor ; none,none: 10.52 

451497; H83294: Hs.284122:Wnl Inhibitory fador-l; EGF.WIF;: 10.50 

421478; AI663243; Ks.g7258; ESTs. Moderately similar to S29539 ribos; none.none; 10.50 

409231; AA446644; Hs.692: GA733-2 antigen; epilheTial glycoprotdn; Ihyroglobulin 1;TM=Y;SSs|^; 10.35 

443785; AW449952; Hs.ig0125: basto-heIbtloo|>^PAS protein; HLH.PAS;TM=M;SS^N; 10.34 

409142; AL136877; H$.507S8: SMOl (stnidural mdntenance of chromoso; ABG_tran,M,SMG_N.SICC,DUF1&4,none; 10.34 

431846; BE019924: Hs.271580: uroplaMn IB; transmentaie4;TM=Y;SS=M; 10.34 

415539; AI733881; Hs.72472; NAME OMIHEO ... receptor kinase: pkinase,AdhrinjecpPDZ,ZU5.de8(h; 10.31 

411274; NM.002776; Hs.69423; kainkreln 10; trypsin;TM44:SS»N; 10.24 

423873; BE003054; H3.1695; matrix metaltoprotelnase 12 (macrophage ; hemopexin.P6ptkIa5e.M10;TM-M:SS"IM; 10.24 
4413n; BE218239: Hs.202656; ESTs; none.none: 10.17 

400292; AA250737: Hs.72472; NAME OMITTED ... receptor kinase; pkin8se>Vctiviiuecp,PDZ,ZU5.death; iai7 

452594; AU076405: Hs.29981 ; solute carrier famfly 26 (sulfate transp; xan.urj]enneasB,Sulfale_trBnsp,STAS.HM6 box; 10.12 

429863; M58874; Hs.211587; phospholipase A2. group IVA (cytosolfc, ; C2,PU^B;TM=M;SS=N; 9.87 

413859: AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kina; SAMJPNT,none; 9.87 

408562; AI436323; Hs.31141; roundabout (axon guWance receptor. Dros; ig.fn3;TM=M;SS=N; 9.86 

428970; BE276891; Hs. 194691; re6noic add induced 3 (RA1G1); metebo; 7tnL3;TM»Y;SS^ 9.72 

438089; W05391; Hs.351546; nudear receptor subfamily 1, group I. m; honTione.rec,zMM,nQne; 9.68 

411089; AA45&4M: Hs.355702; cell (fiviskmcyde 2-like 1 (PITSLAE pr; none,none; 9.53 

450451: AW591528; Hs.202072: ESTs: none.none: 9.53 

456062; A1866286; Hs.71962; ESTs. Weakly dmiter to B36298 protiner, none,none; 9.50 
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4181 13; A1272141: Hs.834B4; SRY (sex d^rmining re^ YVtox 4; KMGLbox.ho(Tieot»x;TM=M;SS»N: 9^ 

41 2791; A1131192: Hs.143199: ESTs, Weatdy similar to S72481 probable ; pkinase,PBO,non8; 9.36 

432435; BE216886; Hs.282070; ESTs; non8,non8: 9.35 

416530; UB2801: Hs.7g361; kaOikrein 6 (neuiosln. zyme); liypsin:TM=M;SS=M; 9.32 

439018; AW300887; Hs.26638; membraie^nning 4Kiom^ subfamily A; none;TM»Y;SS=Nt 9.23 

445537;'AJ245671; Hs.12844; EGF-fite^iomaln. muWpte 6; EGF.MAM;; 9.19 

410407; X66839; Hs.63287; carbonic anhydrase iX cart).anhydrase:TM=M;SS=M: 9.19 

417165; Ra0137; H5.302738; Homo sapiens cQNA: FU21425 fis, done C; Sulfat^tran5p.STAS.HMG_bCK: ai7 

453459: BEO47032; H5.a7789; ESTs; none,nonB; 9.14 

431674; AA098901; H3.301642: G-prote!n coupled receptor. none,GCV.H; 9.05 

418004; U37519; Hs.87S39; aldehyde dehydrogenase 3 famSy, member ; a!dedh;TM=M;SSsM: 9.00 

413278; BE563085; H$.833; interferorhstimuiated protein. IS kOa; ubiquitin;; 8.93 

436954: M740151: Hs.130425; ESTs; none.none; 8.91 

420344; BE463721; Ks.97101; putative 6 proteirnnupled receptor. Melhytbansf 5;TM=Y:SS=M; 8.89 

425397; J040B8; Hs.156346; lopoteomerase PNA) II alpha (170kD); DNAjyraseB.DNA^topoIsolV.HATPasflLc:: &85 

407792; A1077715; Hs.39384; putative secreted Bgand homologws to f; rKme;TM=M:SS=Y; 8.80 

451027; AW519204; Hs.40808; Homo sa^, Similar to RIKEN cDNA 2810. nor;e.none; 8.79 

422809; AK001379; HS.12102B; hypothetic^ prolsln FU10549: IQ;TM=lk/l;SS=N; 8.72 

413385; M34455; H5.&40: indolean^8i)yrroIe 2.3 dknygenase; IDO;TM=M;SS=N; a70 

444784; D124S5; Hs.11951; ectonucleotide pyrophosphatase/^ihosphodl; SomatomedlA_B,Endonudease.Phosphodl8Sl:TM=Y;SSs|yi; 8.69 

421502; AF1 1 1656; Ks.105039; soluts carrier family 34 (sodium phospha; Ribosoma) UZO.Na Pi cotr8n8;TM»Y;SSsN; &67 

437935; AW939591; Hs,5940: mucin 13. epithelial transmembrane; EGF,SEA;TM=Y;SS=M; 8.56 

408692; AL040127; Hs.34074; dipepSdylpeptUase Vi; DPPiVJiJeTm.Peptldase S9,none; 8.55 

414812; X72755: Hs.77387; monokine Induced by gamma Interferon; IL8;TM=M;SS=Y; 8.53 

428187; A1687303; Hs.265529; G protebvcoupled receptor 49; 7tn[Ll.rvone; a49 

448672; A195551 1; Hs.374290; ESTs; Rg_chan,ANF_receptor.SBP_bac_3;TM=Y;SS=M; 8.44 

425775; U25128; Hs.159499: parathyroid hormone receptor 2; 7tm^HRM;TM=Y;SS=M; 8.40 

443426; AF098158; Hs.9329; chromosome 20 open reading frame 1; none;TMsM;SS=N; 8.40 

452093; AA447453; Hs.27860; Homo sapiens mRNA; cONA DKF2p586M0723 (f; 7lm«1.none; 8.33 

407694; AJ278313; Ks.41 143; phospholnositUe-spedfic phosphoUpase ; C2.Pt-PLC-Y.PI>PL&X;T|yNM;SS=K: a23 

409799; 01 1928; Hs.76845; phosphoserfne phosphatase^ike; HydrDlase;TM=M:S8=N; &21 

419508; AW997938; Hs.90786; ATP^nding cassette, sub-family C (CFTR; ABC.tran>^Cjn8mbrane;TIW=Y;SS=M: a20 

424441; X14850; Hs.l 47097; H2A histone family, member X; histone,C8FD.NFYB HMF;; 8.20 

408243; Y00787; Hs.624; Interieukin 8; HLH.PAS,ilJ;TM=M;SS=N; aoo 

415752; 6E314S24; Hs.78776: putative transmembrane protdn; none;TM»Y:SS«N; 7.99 

422608; AW160644; Hs.118695; potassium vdlage^ated channel, subfami; k>n_trans.K.letr8;TM»Y;SSsN; 7.99 

433001; AF217513; Hs.279905; clone HQ0310 PRO0310p1; none;; 7,95 

409649; AA159216; Hs.55505; hypothetical protein FU20442; Yj)hosphatase,DSPc;TIIH4;SS»N; 7.95 

424078; AB006625; Hs.139033: paternally expressed 3; zf-C2H2.KRAB,none: 7.86 

432179: X75208: Hs.2913; EphB3; EPHJbd.fn3.pkinase.SAI^TM»Y;$S-iyi; 7.65 

424581; M62082; Hs.150917: catenh (cadherin-assoctated protein), a; Vincu1in.DNA^rigase ZBD;TM=M;SS=N; 7.84 

420610; AI683183; Hs.99348; distaWess homeo box 5; homeobox;TM=M;SS=N; 7.81 

436856; A1469355; Hs.127310; ESTs; pkinase,rrm;TM=M;SS=N; 7.81 

41 1125; AA151647: Hs.68877; cytochrorae b.245, alpha polypeptide; none;TM=Y:SS=M; 7.80 

407811; AWig0902: H8.40098; cysteine loiotauperfaniily 1, BMPantagon; TGF-beta,OAN:; 7.78 

424399: AI905687: Hs.348419: Aig05687:IL-BT095-1901994)19 6T095 Homo ; none;TM^M;SS=M; 7.65 

418836; AI655499; H5.161712; ESTs; pklnase,AcUvlnjEcp,PDZ.ZU5,dealh; 7^4 

435793; AB037734; Hs,4993; KIAA1313 protein; none;TM=M;S8=N: 7.61 

426201; AW182614; Hs.128499; ESTs; STO.none; 7.59 

447400; AK000322; Hs.18457; hypothetical protein FU20315; zMJ3HC4;TM=Y;SS=M; 7,55 

41 0850; AW362867; Hs.302738; Homo sapiens cDNA: FU21425 fis. done C; Sulfate_lransp,STAS,HMG box; 7.55 

453464; AI884911; Hs.32989; receptor (calcitonin) activity modifying; none;TM=Y;SS=N: 7.54 

417115; AW9S2792; Hs.334612: small nudear ribonudeoprotein polypept; Sm,pklnase;; 7.52 

437897; AA770561; Hs.146170; hypolhefical protein FU22969; d-OHHC.none; 7.38 

443991; NM.002250; Hs. 10082; potasshnn intennedlata^sm^ conductance; CaMBD,SK_channel,ton trans;TM=Y:SS=M; 7^ 

414617; AI339520; Hs.288817; ESTs. Moderately similar to U Chdn N, Mi hexoklnase.hexokinase2;TM=Y;SSs^^; 7.35 

422017; NM_003877; Hs.110776; STAT Induced STAT lnWKtor-2: SH2;; 7.33 

424834; AK001432; Hs.153408; Homo sapiens cONA FU10570 fis, done NT; none,none; 7.30 

409041; A8033025; Hs.50081; Hypothetteal protein. )(P„051860 (KIAA119; none;TNH/l;SS=M; 7.28 

417079; U65590; Hs.ei134; tnterleukin 1 receptor antagonist; ill;; 7.28 

429170; NM-001394: Hs.2359; dual spedfidty phosphatase 4; Rhodanese,DSPc.Y_phosphata$e.Ribosomal S3 N:TM::M;SSbN; 7.28 

418506; AA084248; Hs.372651; Unknown protein for M6C:29643 (tonneriy ; none,none;7.27 

448913; AA194422; Hs.22564; myosin VI; n7n.if^^P,pkina5e.GST_C,Els,SAhl.PW>kBC2jnembr8ne,myosinJ»a^ 

C2H2,PHD,BTB,TFIISAT hOGk,SAM;TM=M;SS=N; 7.26 
409340; BE174629; Hs.321130; hypothettoal protein MGC2771; 

aaj)enneases.pyridoxaljdeC,bronK>domain.PHD.MBOATJ»oKDDT,R3.PI4Jd^^^ 
424317; AI865032; Hs^U; ESTs; non8.pkinase: 7.21 

410361; BE39ie04: Hs.62661; guanytale binding proleto 1. Merfeton-; GBP.6BP_C;Tiyi»Y;SS»M: 7.21 
428450; NM.014791; Hs.184339; K1AA0175 gene produd; KA1.pkin8se;Tli4=M:SS»N; 7.14 
438707; L08239: Hs.5326; amino add system N transporter 2; porcu; ACAT,KABOAT;TfA=Y;SS=M; 7.05 
423011; NM.000683; Ks.123022; adienergic. a!pha-2&. receptor; 7tm_1:TM=Y:SS»M: 7.03 
435021; AA922192: H5.73962: ESTs; EPKJbd.pkfnase,fh3.SAIUlnon8; 7.02 
446163; AA026880; Hs.25252; prdacQn recepton none:NA;NA: 7.01 

447768; X66400; Hs.19520; FXYD domalrK»ntdning ton transport reg; ATP 1G1J>LhLKMTaTiyNM:SS"N: 7.00 

439453; BE264974; Hs.6566; thyroid hormone receptor interactor 13; AAA.ABC_tr3n,CoaE;TM=M:SS=N; a99 

451035: AU0767e5: Hs.430; plasOn 1 (I isofonn); ^8nd.CH.Adaptin_N;; a99 

450581: AF08151% HS.2S195; TGF-beta 4; TGF^ta.TGFbjirapeptkIe;; a95 

424054; AA3345t 1: Hs.26638; membrane^pannlng 4-domalns. subfamily A; none;TM=Y;SS^M: a93 

432519; AI221311: Hs.130704; ESTs. Weakly similar to BCHUIA S-100 pro; none.none; 6.93 

436481; AA379597: Hs.5199; HSPC1 50 protein simDar to ubiqinOrxxm; UQ.con;TM=(^SS=N; a92 

419693; AA133749: Hs.301350; FXYD domaln<onlairing ion transport reg; ATP1 GlJ>Uil.MAT8;TMaY;SS=M; a92 

437139: W73685: HS.11B513: ESTs. Wteakly similar to RTA RAT PROBABli; 7tmJ;THA=Y;SS=M: &87 

416054; NM.002316; Hs.83354; lysyl oxUasd4ik6 2; SRCR.LysyLoxtoase;TM=^4:SS4A; 6.87 
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418203; XM942; Hs.e3758: COC28 protein HnasB i CKSs 6.86 

410467; AF102546: Hs.63931: dachshund (DrosophHa) homolog; 8l(LSno;TM=A/tSS=M: 6.86 

425247; NA^005940; Hs.155324; matrix inetaOoproteinasd 11 {stiDmelysin; hemQpexin,P6p8das8 MtO;; &85 

453064; R40334; Hs.89463; potassium lar^e conductance caWunMcft irane,n(m8; 6.83 

452046; AB018345: Hs.27657; WAA0802 protein; none;TM=M;SS=N; 6.79 

417771; AAS04698; Hs.82547; retinofeedd receptor responder(tazafo; non0.non8; 179 

422293; X94453; Hs.1 14366; pyrroline-S^afaoxylalB synthetase {glut; aIdedhMJnase;TM=M;SS=N; 6.77 

431470; AA832417; Hs.139650; ESTs; non8,lg.pklnase,imiJWCT; 6.7B 

4 18738; AW388633; Hs.6682; solute carrier famBy 7. (callonlc amino; none.none; 6.75 

418751: BE389014: Hs.37254B; phosphoinositide^ldnase. regulalory su; SH2,non8; 6.74 

417886; AA214584; ; ESTs; 5PRY,7tnL3>VNF«fBC8ptor^e; 6.72 

412926; AI879076: Hs.7S061; macrophage myristoytated alai^neKich C ; MARCKS;; 6.70 

437960; At669586; Hs.369312; ESTs; non8.none: 168 

428953; M306610: Hs.348183; tumor necrosis factor receptor supeifanti; 60s rHxisomal.RibosomaLL10.TNFR.c6,OEAO:: 6.66 
444006; BE395085; Hs.334762; type I transmembrane protein Fn14: ldUiBCepLa.PKD,MHC l:TM=M;SS=y; 6.65 
41304(^AA193338; Hs.1 2321; sodHim calcium exchanger; N^CSLEx;TiyNY;8S^ 6.64 
449656: AA002008; Hs.1B8533; ESTs; RPSKnone; 6.64 

447495; AW401864; Hs.18720; progiammed cell death 8 (apoptosls^uc; pyrjedox;TliNytSS=l4- 6.62 

446083; A1720140; Hs.151079: ESTs; ISI^Channel.none; 6.61 

424762; AL119442; Hs.1836&4: eukaryotic Iranslalton initiation factor; none,none: 660 

421554; AW137676; Hs.97775; ESTs; none.none; 6,59 

418478; U38945: Hs,1174; cycBn^lependent itinase inhibitor 2A (me; anlg; 6.S5 

424905; NM-002497: Hs.153704; NIMA (never in mitosis gene e)-(elated k; pldnase;TM=M;SS»N: &54 

448730; AB032983: i^s.21894; i<IAA1157 protein; PP2C;TM=M;SS»N; 6.64 

433577; AW007080; Hs.284192; ESTs; none,none; 6.63 

422627; BE336857: Hs. 11 8787; transforming growth tactor, bet3*duced; Fascidin^VBC IranABC membrane.GTP EFTU:TM=I\/I:SS=M: 6.53 
442133; AW874138;Hs.129017; ESTs; type la transmembrane protein: IJ?I^LRRNT,LRRCT:TIWI=Y:5S=M; 6.52 "* 
430259; BE550182; H5.375142; Ral6EF-lil» protein 3, mouse homolog: 1h3,RA.RasGEF:T|yNl\4:SS»M; 6.52 
434263; N34895; Hs.7gi87; ESTs; lg.none: &49 

418322; AA264166; Hs.84113; cycflnHtopendent Idnase inhB}{tor 3 (CDK; Y phosphatase.DSPc;TM^IiA;SS=N; 6i48 
419942; U25138; Hs.93841; potasshim large conductance caldunvactl; CaKfi;T|yt»Y;SS»l\4; 6.47 
421064; AI245432: H5.101382; turror necrosis factor» alpha^iduced pro; none;TiWNM;SS=N: 6.47 
432636; AA340864; H8.278562: daudin 7; PMP2^C3audin:TI*=Y;SS=M: 6.45 

431685; AW296135; Hs.267659; vav 3 oncogene; CH,OAG_PE-blnd,PH,RhoGEF,SH2.SH3.0C1:TM=M;SS=N: 6.44 
428832; AA578229; Hs.324239; ESTs. Moderately simflar to ZN91_HUMAN Z; Osteopontin^e; 6.39 
438775; AA731 111; Hs.372225: ESTs; non8.none; 6.39 

424343; AW956360; Hs.4748; adenyi^ cyclase acthrating polypeptide; 7tm.2.HRIi<,none; 637 
421071;AI311238: Hs.104475; ESTs. Weddy similar to CGHU IE cdlagen ; none;Tiyi=Y;SS=M; 6.37 
438209; AL120659; Hs.61 11; aryt-hydrocaiton receptor nudear transl; HU<,PAS,IL8;TM=M;SS=N; 6.37 
438993; AA828995; ; gb:od77b08.s1 Na_(X3AP^Ov2 Homo sapiens ; EGF,metaithio,integrin B,PSI^ne: 6.27 
408400; ; ; Italtil<reln 8 (neuropsin/ovasin) (KLK8); lryp5in;TM=M;SS*M: 627 
429556; AW139399; Hs.314807; ESTs; none;TM=M;SS=N; 6.26 

409269: AA576953; Hs,22972; steroid 5 alpha^uctase 2-D«; H5AR g; Steroid dh:TM=Y;S&=M; 6.25 

435732; AF229178; Hs.123136; leudne rich repeat and death domain con; none.none; 6.24 

439668; AI091277; Hs.302634; frizzled (Drosophila) homdog 8; Fffezled,F2.7lmJ.toxln J:TM=Y:8S=M; 124 

41 2276; BE262621; Hs.73798; macrophage migration Inhibitory factor (; IUIIF,8ugarJrJione: 6.23 

436961; AW375974; Hs.156704; ESTs; none.none; 6.23 

434808; AF155108; Hs.256150; NY4^N-41 antigen; none;TH*=M;SS=N; 6.22 

440008; AK000517; Hs.6844; mP2 protein; PYRIN-ContalnIng APAFHI; AAA.N^AR(;PAAD OAPIN;NA:NA; 6i0 

416224; NWL002902; Hs.79088; reticulocalbln 2. EF-hand calcium bindin: eftiand:; 6.20 

449027; AJ271216; Hs.22880; (fipepfidytpeptidase III; PeptIdase,M49.EGFJg.Meureguiin:TM=M;SS=N: 6.19 

418318; U47732; Hs.84072; transmembrane 4 supeifamily member 3; transmembrane4;TM=Y;SSslUI: 6.19 

452551; L27071; Hs.29877; TXK tyrosine Idnase; B8adi.WD40.SH2.SH3.p!dnase:TM=li4;SSsN: 6.14 

418969; W33191; Hs.28907; hypothetical protein FU20258; SH3:TM=M;SSsN; 6.12 

445462; AA378776; Hs.288649; hypothetical protein MGC3077; none;; 111 

456534; X91195; Hs.100623; phosphdipaseC, bela3.ndghbor pseudo; UMPDZ.pWnase::6.11 

449700; L02867; Hs.78358; paraneoplastic antigen; non8;TM=M;SS=N; 110 

428479: Y00272: Hs.334562; cell division cyde 2. G1 to S and G2 to; pWnaso.lCEjl 0,ICE_p20;TM=M:SS=M: 110 

416209; AA23677e; H3.79078; MAD2 (mitollc arrest deiident, yeast, h; HORMA;TM=M:SS=N; 6.09 

433159; AB035898; K8.150S67; kinesin^ protein 2; bZIPJdnesIn:; 108 

432432; AA541323; H8.115B31; ESTs; lg,Sema.PSI.none: 6.07 

409619; AK001015; Hs.55220; BCU-assodated alhanogene 2; BAG;TM=M;SS=N; 104 

412723; AA648459; Hs.3359Sl; hypothetical protein AF301222; none:TM=M;SS=N; 103 

424273; W40460; Hs.144442; phosphdipase A2, group X; phosnprrM=IUI;SS=Y: 6.03 

409430; R21M5: Hs^6735: spHdng factor, erginlnefcerine-rldi 5; OSPcRhodanese.none; 6.00 

421143; AB024536; H8.102171; Immunoglobufin superfamily containing le; lg,UW,tJ^.LRRCT;TM=M:SS=M: 100 

428677; AI657119; Hs.351582; troponin I. cardiac; none;TM=M;SS=N; 100 

450096; W27249: HS.B109; hypothetical protein FU21080; SET;rf^ND:TM=M:SS=N; 100 

410422: AL042014; Hs.63348; Homo sapiens, done M6C:15203, mRNA. com; C1q.Cdlagen:: 5.99 

419972:AL041465;H8.182982;golgl^67:none.none;5.99 

421251; Z28913; Hs.102948; erdgma (UM domain protein); UhiPOZ;: 197 

403362; ; ; NM.001615*;Homo sapiens acOn, gamma 2, ; adln;; 5.95 

451541; BE279383; Hs.26557; ptalcophin 3; Armadillo.seg;TM=M;SS=N: 5.95 

420253; AI656055; Hs.96200; neighbor of A-klnase anchoring protein 9; nonB:NA;NA; 193 

421506; BE3Q2796; Hs.105097; thymidine kinase 1. sduWe; TK:TM=M;SS=N; 5.93 

450447: AF212223; Hs.25O10; hypothetfcal protein P15-2; NTF2;TM=M;SS=N: 192 

450747; AI064821; Hs.129953; ESTs. Highly similar to 181 8357A EWS gen; rmuf.RanBP.GAS2; 192 

415211: R64m<^Hs.155986; DEAO/H (Asp^u-Afe^pWIs) box pdypep; DEAOJielkase.CmNdr.CysJtnolTIUvwa.vwc,vwd,IQ,RIIa.8^ 

beta;DUF139.TPR.DSPc.tspJ.RIbosomaI_S21^;TM=*l;SS=N;5.91 w «r- 

429910; NM_000867; Hs.2507; Wiydroxytryptarrtne (serotonin) receptor; 7tm_1;TNt=Y;SS=N; 190 
447131; N^L004585; Hs,17466; retinofc add receptor responder (tazaro; none:TM=Y;SSsN: 5.89 
437952; 063209; Hs.5944; solule earner famBy 1 1 (protoihcoupled: none;TM»Y:SS»lt 169 
446291: BE3977S3; Hs.14623: Interferon. gamm&Mucaile protein 30; QLT:TMs|Ul;SS-Y: 169 
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453102: NM-007197: Hs.31664; frizzled (OrosophBa) homotog 10; Fz,Frizzted,7lm_2;TM=Y;SS=M; 5.87 

426761; AI01570d; Hs.172089; PORIMIN Prooncosis receptor inducing me: nona:TM»Y:SS^; 

425367; 6E2711B8: Hs. 155975; protein tyrosine phosphatase, recejitor t; none:TM4itSS»Y; 5.85 

426108; AA622037; Hs.166468; pfogrammed cell death 5; OUF122:TM=M:SSsN: 5.84 

450502; TQ80S5; Hs.1 18252; ESTs; ton„tr8nsJon.trans; 5.84 

442652; AIQ05163; Hs.201378; Homo sa^xens cDNA FU40427 fis; non8;TM=M;SS=N; 5.83 

424917; A!638208; Hs.9690t; hypothetica) protetrt FU23049: nona;TM=M;SS=N; 5.83 

448569: BE382657: Hs.21488: signal transducer and acQvator oflrans: SH2.STAT.STAT.bInd,STATj)rot;TM»IM:SS«N; 5^2 
422616: BE30O33O; Hs.1 18725; selenophosphata synthetase 2; AlRS^RS.ClTM^SS-N: 5.62 

445133: AW157646; Ks.198669; ESTs; eih3nd^pectrin.GAS^SH3.Plecl}n.RA,Xylose.lsonvFIi0.1iZIPJi^^ 5.79 

426215; AW983419; Hs.155223; stamtiocatdn 2; Stanniocaldn;; 5.78 

414482: S57498; H5.76252: endolheOn receptor type A; 7Im_1:TMNY:SS=M: &75 

414809; AI434899; Hs.77356: transferrin receptor (p9a C071}: PA;Tlk«'Y;8S>N; 5.74 

452683: AI08957S: Hs.374574; progesterone mentrane tAiding protebi; tiome^>ox.none: 5.72 

432201; AI538613; Hs.298241; Transmembrane protease, serine 3; ldLrecepLAtryp6in;TM'Y;SSsM; 5.72 

4^345; R11141: H9.199&95; hypotheiica) protein; )Ctetra,SAM: 5.72 

449458; AI805078; Hs.208261; ESTs; FrizzIed.Fz.none; 5.72 

418526; BE019020; Hs.85838; solute carrier family 16 (monocariXKyOc; none;TMsY;SS^ 5.71 
416848; AY820961; Ks.ig3465; ESTs; PDZ,pklnase,none; 5.70 

426227; US7058; Hs.1 54299; Human proteinase activated receptDr-2 mR; 7tm„1;TM=Y;SS=R^ 5.69 

411190; AA306342; Ks.69171; protein kinase O^Dce 2; pkin8se,pkinase_C,HR1;TlVNM;SS=N; 5.69 

411263; BE297802; Hs.69360; kinesin-Hke 6 (mttotic centromer&-assoc; ldnesin;TM=M;SS=N; 5.69 

445136; AI348014; Ks.143949: ESTs, Wealdy similar to Achaete-Scute ho; iorutrans,{oa.trans; 5.69 

409223; AA312572; Hs.362852 phospholnosltide^lunase. regulatory su; SH2.SH3.RhoGAP.none; 5.67 

430016; NM_004736; Hs.227656; xenotropic and polyiiopic retrovims rec; SPX,EXS;TM=Y;SS=N; 5.66 

429638;AI916662;Hs.211S77;kfnectin1 (kinesin receptor); bZiP.Tropomyosin.spectrin.LBP BPLCETP,B56.M;TMN>Y;SS"lill;5.65 

450334; AF035959; Hs.24879; phosphstidic acid phosphatase type 2C; PAP2:Tiyit»Y;SSsM: 5.64 

453950; AA156998; Ks.348037; eukaryotic translaBon Initiation factor none;; 5.64 

425889; M57414; Hs.161305; tachykinin receptor 2; 7tnLl;TM=Y;SS=IW; 6.64 

432527; AW975028; Hs.102754: ESTs; none,none; 5.64 

441384; AA447849: Hs.288660; retinoic acid induced 3; 7tnv.3.none; 5.63 

419080; AW150835; Hs.18878; hypothetical protein FU21620; 20G-Fell Oxy;TM=M;SS=N; 5.63 

447217; BE4657M; Hs.17778; neuropnin 2; CUB.MAM.F5.F8_type„C;TM=M;SS=M; 5.61 

440422; AW452696; Ks.130760; myosin phosphatase, target subgnit 2; BTB,Kelch,ank,none; 5.58 

431341; AA30721 1; Hs.251531; proteasoma (pnsome, macropain) subunit.; piot8a6ome:TM=M;SS=N; &68 

432805: X94630: Hs.3107: CD97 anOgen; 7tnL2,EGF.GPS.FecCO;TM>Y;SS»llA; S.S5 

449230; BE613348; Hs.356392; melanoma cell adhesion molectile; Ig,isodh.RibosomalJ.6,F-box:TM=Y:SS=M; 5.55 

441607; NM.005010: Hs.79t2; neuronal cell adheston molecule; WD40,fh3.]g;TM=lul:SS=N; 5.54 

400303; AA24275B; H3.79136; LIV-1 protein, estrogen regulated; none.none; 5.54 

434826; AF15S661: Hs.22265: pynivato dehydrogenase phosphatase; PP2C,none; 5.53 

404210; ; ; NM.005936:Hbmo sapiens myetoid^mphoid ; FHA^RADIUTM^KtSSsN; 5.53 

408051; AI623351; Ks.172148; ESTs; PH.RhoGAP.none: 5.53 

436726; AA324975; Hs.198689; ESTs. Weakly similar to T00079hypotheti: eihand,8pactrin.GAS2.SH3.PtoctIn.RMytoseJsom.RiD^PJiQpofny^ 

U,M.ldh^C.CH>UP3;TM=M;SS=N; 5.53 
416084; LIC^I; Hs.79006; deoxythymklylate kinase (Ihymidytate kin; none,none; 5.52 
428667; AI375550; Hs.346868; nucleolar protein p40; homolog of yeast ; none.none; 5.51 
433907; AW298107; Hs.152686: ESTs; Amiadil]o_8eg,none: 5.50 
442821; 6E391929; Hs.8752: transmembrane protein 4; none;; 5.50 
422282; AF019225; Hs.1 14309; apol^protdn U Mot^.ExbB;TVNY;SS^M; 5.49 
439820; A1360204; Hs.283853; Homo sapens mRNA hill length insert cDN; none.none; 5.49 
428771; AB028992; Hs.193143; KIAA1059 protein; C2.PI-PIjC.Y,PI-PLC-X;TM=M;SS=N; 5.48 
452256; AK000933; H3.28661; Homo sapiens cONA FU10071 lis, done HE: GDI.7tnLl.none: 5.48 
442013; AA506476; Hs.375009; Human DNA sequence from done RP11453C1; none.none: 5.48 
408056; AA312329; H3.42331; ephrin-A4; Ephrtn;Tli/t=M;S&4A; 5.47 
422765; AW409701; Hs.1578; bacdoviral lAP repeaUontaining 5 (sun 8IR;TM=M;SS^; 5.47 
420297; AI628272; Hs.1 28757; ESTs, Weakly similar to ALUI.HUMAN ALU S; pkln8se.TUDOR.none; 6.47 
428385; AF112213: Hs.164062; putative RabS-intoracting protein; SH2.SH3n 5.46 

424517; AI539443; Hs.137447; Homo sapiens cDNA FU12169 lia. done MA; SH2,STAT.STATJbind.STAT.proLnone; 5.45 

441550: F13386; H3.7888; v-erb^ avian erythroWaslic leukemia vl; pkInase.Recep_Ldomaln.FuilivJike,YlP,none; 5.44 

414883; AA926960; Hs.348669; COC28 protehi kinase 1; CKS;; 5.43 

450402; BE218027; Hs.89969; ESTs; SH3.none; 5.42 

428484; AF104032; Hs.184601; solute canier family 7 (cattonic amino ; 

aaj)enneases.pyridQxaLdeC.bromodom^.PHD,MBD.ATJiook.DDT.P13_Pi4JdnasB^AT.FAT 
430696: AA531276; Hs.59509; ESTs; pkinase,PP2C.none; 542 

412350; Ai6593Q6; Hs.73828: protein tyrosine phosphatase. noiKecept: Yj)h08phalase.6and.41,PDZ;TIUNM;SS^ 5.42 

444783; AK001468; Hs.62160: anflUn (Drosophila Scraps homdog}, act; PH,none; 5.41 

448379: AI097463: Hs.21035; KIAA1130 prot^ none.Zip: 5l41 

410082: AA081594; HS.15B311: MusasW (Dtosophila) homtitog I; frm(Tiyi=M:SS:=N: 5.41 

41 1817; BE302900; Hs.72241; n^togen-acOvated protein kinase kinase : pkIn8se;TM=M;SS=M: 5.40 

445413; AA151342; Hs.12677; CGI-147 protein; UPF0099;TM=M;SS=H^; 5.39 

451863; AL1 20634; Hs.331803; ATPase. Ga transporting, plasma membrane; cpn60_TCP1.E1-£^TPase,Caik)nJ^TPase CCatlonJ^TPase N,Hydrolase;: 5.38 

442875; BE623003: Ks.23625; Homo sapiens done TCCCTA00142 mRKA sequ; K_letra.DUF51.nGne; 5.38 

439963; AW247529; Hs.6793; platelet^ih^g factor acetylhydrota; PAF^Ib,Lipase.GDSL;TM=M:SS=N; 5.38 

450825; AG00S954; Hs.25527; tight junction protein 3 (zona ocdudens; PDZ,Guanylate.kin;; 5.34 

441031; All 10684; Hs.7645: fibrinogen. B beta polypeptide; fibrinogen C,&aipha,ari;TM=M;SS=M: 5.33 

406369; R38438; Hs.1 18747; SLC15A2 Sohrte carrier family 15 (HWp^; PTR2;TM=Y:SS=N; 5.33 

435391; AA704588; Hs.58934: ESTs; PIP5K.none: 5.33 

411779; AA292811; Ks.72050; non-metastatic cells 5, protein expresse; NDK;; 5.33 

422170; A]791949; Hs.112432; anU-fAjilerian hormone; TGF-beta;: 5.32 

447350; Ai375572; Hs.172634; v-erb^ avian erythroblasSc teukemia vi; pkinasa,Recep.L^domai(i.Furiivlike.YLP.none: 5.32 
449964; AW001741; Hs.24243; hypothetical protein FU10706: pkinase:TM=M;S&=N: 5.31 
426427; M86699; HS.169&40; TTK protein kinase; pldnase;; 5.30 
430407; H23551; Hs.30974; ESTs; pktoasePBD.none; 5.29 
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416847; L4382t;HsA)261: enhancer of filameniaBon 1 {cas^ka do; SH3;TM=M;SS=M; &27 

425308: M97639: Hs.155565; receptor tyrosine Mnas&ae orphan rec; igJoing)e,pkinffi8.F2:TM=Y;SSsM; 5.27 

424595; AB020639; Hs.151017; esfrogeiMelalBd receptor gamma; hormoneLfBC2M>«;TM=M;SS=N: 5.27 

428013; AF151020; Hs.181444; hypothetical protein; none;TM»Y;SS=M; &26 

447384; AI377221; Hs.40528; ESTs; SH3,Sorb.nqne; 5.26 

441824; AB007871; Hs.7977; KIM0411 gene product: SH3.RhoGAP;TM=M;SS=N; 5.26 

438493; AI130740; H5.6241; phosphdnosltUe-SJcinase, regulatoiy su; SH2.SH3.RhoGAP;TM=M;SS=N; 5.26 

428579; NM_005756; Hs.184942; G piDtehvcoupled receptor 64; 7tnL2.GPS:TM=Y;SS=M; 5.25 

414359; M62194; Hs.75929; cadherin 1 1. type 2. OBodherin {osteob; cadher1n.C8dhertn C term;TM=Y;SSsM; 5.25 

426440; BE382756; Hs.169902; solule carrier family 2 (fadiitated glu; sugar tr;TM=Y;SS=M;l24 

427157; U51166; Hs.173824: thymlne-DNAglycosylase; UDG;TM=M;SS=N; 5.24 

423685; BE350494; Hs.49753: uved autoanBgen with colled coQ domai; 8nk«l)ZIP,M.OUF164>UP3;; 5.23 

452721; AJ269529; Hs.301871; solute canier fairaly 37 (gtyceroi^pho; M0RN,8UgarJn*msY;SS»M; 5.23 

427747; AW411425; Hs.180655; serlnerthreonine kinase 12; pkinase;TM=M;SS=N; &23 

417821; BE245149; Hs.82643; protein tyrosine idnase 9; cofinojAOFr. 5.23 

412507; L36645; Hs.73964; EphA4; fn3.pWnase,SAM.EPHJ)d:TM=Y;SS=M; 5.23 

426770; AI948618; Hs.150178; ESTs; SuIfatB.transp,STAS;TM3Y;SS»N; 5.23 

422583; AA410506; Hs.27973; KiAA0874 protein; anl(.&dpha;T(M»|yi;SS>iN: S.22 

414366; W70171 ; H3.75939; uridine monophosphate Unase; PRK.CoaE;; 5.22 

448093; AWg77382; Ks.15e98; 2.4^enoyt CoA reductase 2, peroxisome; adhjBhortn 5.21 

443646; AI085198;Hs.1 64226; Thrombospondin1;EGF.lsp 1.vwcTSPN.tsp 3,none;S.18 

457916; BE08S271; Hs.6834; ring finger protein 3; pkinas8,none; 5.18 

433933; AI754389; Hs.355397: Homo sapiens done TCCCtA00164 mRNAsequ; none;NA;NA; 5.18 

436469; AK001455; Hs.5198; Down syndrome critical regton gene 2; none;; 5.17 

433662; W071 62; Hs.1 50826; RAB25 RAB25. member RAS oncogene famiy, r83,ABC tranMTM=IWI;SSsM: 5.17 

45051 1; R07423; Hs.85092; thyroid hormone receptor hiteractor 11; Myosin ld.E(y; 5.16 

409132; AJ224538; Hs.50732; protein kinase, Ali^P-acfivaled, beta 2 no; none;TM=M;SS=N; 5.15 

454438; AA224053; H8.17240S; cefl dh^ cyda 27; SPRY,7tm 3,ANFjecepton 5.14 

413869; NM.000878; H8.75596;lnterieuldn 2 receptor, beta; none;TM=Y;SS=l\4; 5.14 

422051; AW327546; Hs.11 1024; solute carrier family 25 (milochondrial ; mito,C3rr,TM=M;SS=N; 5.14 

415474; NM_014252; H3.78457; solute carrier family 25 (mitochondrial ; mito caiT;TM=IA-SS=N; 5.14 

41 1704; AI499220; Hs.71573; hypothelical protein FU10074; pWnase;TW=M:SS=N: 5.13 

422278; AF072873; Hs.1 14218; frizzled (Drosophila) homolog 6; Fz.Fri22ted.7tnL2;TM=Y;SS=M; 5.13 

454128; AL031259; Hs,36790O; programmed ceQ death 2; zf^MYND;TM=M;SS=N; 5.13 

431322; AW970622; H5.376626; gb:EST382704 MAGE resequences, MAGK Homo; none,none; 5.13 

444754; T83911; Hs.1 1881; transmembrane 4 superfamlly member 4; none;TM=Y;SS=M; 5.12 

422867; L32137; Ks.1564; cartilage oTigomeric matrix protein {pse; tsp 3,EGFh &12 

416498; 113363^ Hs.79351; potassium channel, subfamily K. member 1; Ion trans;TM=Y;SS=M; ai 1 

438494; AA720997; H5.128295; ESTs; none.CAP_GLY,HCX)3jK)transp.GI^JiydrD 63.PH; 5.11 

419833: AA251131; Hs.220597; Homo sapiens tryptophanyl-lRNA synthetas; WHEP-fRS,lRNA-8ynL1bJione; 5.10 

453387; AI990741: Hs.252809; ESTs; Na_Ca_Ex.none; 5.07 

413076: U10S64; Hs.75188: weel (S. pombe) homotog; pklnase;Tl\4=M;SS=N; 5.07 

456906; AF1 17646; Hs.1 56637; Cas-Br-M (murine) ectropic retiovira! tr. zf^HC4.CbLN,CbLN2,Cbl_N3;TM=M;SS=N; 5.07 
438746: AI8B5B15; K5.1 84727; Human melanoma-associated antigen p97 (m; lransfemn,Gu8nylalB kln,P0Z,SH3; 5.07 
448520; AB002367; Hs.21355; douWecortin and CaM Mnase^ke 1; pWn8se.DCX;TM=M;SS=N: 5.06" 
413745; AW247252; Hs.75514: nucleoside phosphorylase; MlapJ»NP;; 5.06 

407235; D20569; H5.169407; 8AC2 (suppressor of actin mutalions 2, y; none,Ribosonial S13.GalactosyLT.Zip.adh 8hort.zK;3HC4; 5.06 

421369; NM-005089: Hs.171909: U2smafl nuclear ribonudeoproteln auxil; rmul-(XCHIecfin„c.intBgrin_B;TM=M;SS«4^; 5.06 

412170; D16532; Hs.73729; very tow density lipoprotein receptor, ldljecepLa.ldljecepLb,E6F;TM=M:SS=M; 5.06 

442599; AF078037: Hs.324051: Ret/^essodatBd Inhibitor: SH3,ari)c;TM«iyi;SS»N: 5.05 

421109; L32832; Hs.101842; AT-hindlng transcrfpSon factor 1; HMG14Ll74iomeobo>c,zKaH2;TM=M;SS=M; 5.05 

453880; AI803166; Hs.135121; ESTs, Weakly similar to 138022 hypotheB; HSP7anone: 5.05 

431512; BE270734; Hs.2795; lactate dehydrogenase A; Wh.ldh^C;SH3.pklnaBe.UBA;TM=M;SS=N: 6.05 

435411; AW444619: Hs.138211; ESTs; none.pkInase; 5.04 

419088; AIS38323; Hs.367688: Integrbi. beta 8; IntegrfiuB^one: 5.04 

425003; AF1 19046; Hs.154149; apurinicfepyrinWinlc endonuclease(APEX; Troponhi.Exo endoj)ho3,IQ;TM=M;SS=N; 5.04 

428376; AF1 19665; Hs.l8401 1; pyrophosphatase (Inorgante); PyrDphosphatasB;TM=M;SS=N; 5.03 

413073; AL038165; Hs.75187; translocase of outer mitochondrial membn MAS20,2fA20,VPS9;TM=A<l;SS=M; 5.03 

436415; BE265254; HS.34325B; proTiferafion-associated 2G4. 38kD; PBptidaseJtt4,Furin-Gke.pWnase.RBcep l^domaIn,efhand; 5.01 

449674; AW444937; H8.233482; ESTs; C2.PI-PLC.YWC-X;ione; 5.01 

445333; BE537641; Hs.44278; hypothetical protein FU12538 sintiar to; ra3,arf.TK:; 5.01 

412133; U83460; Hs.104557; solute carrier family 31 (copper transpo; none;TM=Y;SS=N; 5.01 

446488; AB037782; Hs.15119; KIAA1361 protein; pkinase;; 5.00 

449474; AA019344; Hs.2055; ubiqultin-actlvating enzyme El (A1S9T an; ThiF,UBACT,pkinase,UCH-2.UCH.l.fTni,z|.(:2HZzH^BP.G4)ateh' &00 

416365; U15131; Hs.79265; suppresston of lumorigenicity 5; DENN,dDENN.uDENN;TM=iyi:SS=N; 5.00 ' 

421351; AU076667; Hs.1 03755; receptor-Interacting serine-threonine W; CARD,pkIn3se;TM=M;SS=N; 4.99 

413219; AA878200; Hs.1 18727; Homo sapiens cONA FU13692 fis, done PL; HIii,deaIh,TNFf^c6,Acyl-CoAJiydro; 4 98 

442007; AA301116; Hs.142838; nudedar phosphoprotein Nopp34; mn,IRK," 495 

441085; AW136551; Hs.l81245; Homo sapiens cDNAFU12532 lis. done NT; none,none; 4.95 

426310; NM_000909; Hs.169266; neuropeptide Y receptor Yl; 7tm_1;TM=Y;SS=M; 4,95 

457718; F18572; Hs.22978; ESTs, Weakly similar to ALU4 HUMAN ALUS; pldnase,pWnase; 4.94 

408805; H69912: Hs.48269: vacdnia related kinase 1; pkinase;TM=M;SS=N; 4.94 

427541 ; AI798983; Hs.375835; solute carrier family 35 (CM^slaSc ad; none,none; 4.94 

452792; AB037765; H3.30652: K1AA1344 protein; thtofed:TM=M;SS=M; 4.93 

430713; AA351647;Hs.2642;eukBfyottotransiafionetongallonfactonGTP.EFTU,6TP^EFTU D3,GTP EFTUJ)2:: 4.93 
444838; AV651680; Hs.20a558; ESTs; lntegrin_A,FG<3AP,none; 4.93 

440516; S42303; Hs.161; cadherin 2, type 1, Nodherin (neuronal: HNH,cadhefift,Cadherin.C.tenn;TMsM;SS=N; 4.92 
421302; T34462: Hs.l03291; neuritln: nono;TM=M;SS=Y; 4.91 

452291; AF015592; H$.28853: C0C7 (ceil dhrision cyde 7, S. cerevisi; pUnase:TM^M:S&:N: 4.91 
408857; AA782601; H3.378649; ESTs; B56.none; 4.91 

439750; AU59053;Hs.57664; Homo sapiens mRNAfuD length Insert cON;IMPOH.ClMPDH N,CBS,lntegrin B.Ridn.BJectin-4 91 
42146^ AF016495: Hs.104624; aquaporin 9; MIP:TM=Y;SS=IW; 4.90 • ^ ^ ^ ^ . • 

424503; NM^002205; Hs.149609; Inlegrfn, alpha 5 (fifaroneclln receptor,; lntegrtnAFGW;TM=Y;SS=N: 4.89 
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43K54: AA381553: Hs.198253; ma{or histocompatniQity oomplex, dass ; Ij^MHCJIjlpha.none; 4^ 

434521; NM»002267;Hs.3885;lcaryopherfn alpha 3 fm^ - . . ^^-w.s.oo 

414821; M63835; Hs.77424; Fc fragment of IgG, high affinity la. re; lg;TM=Y;SS=M; 4.88 

4241 18; BE269041; Hs.140452: caigo selection protein (mannose 6 phosp; penSpin;; 4.88 

410639; BE269047; H8.65234; hypoth^ protein FU20596; OEAD^efica5e.C.PRlCAlP3;TW=M;SS=N; 4.87 

417089; H52280: Hs.18612: Homo sapiens cDNA: FU21909 lis. done H; voltage aCCBS.none: 4.86 

42503; AW137635; Hs.44238; ESTs. WeaMy sbitilar to S65657 a^ha-IC-; RiosphodIest,Soma!omedifLB.ERdDntide8se,none: 4.86 

417666; A1345001; Hs.82380; menage a trote 1 (CAK assembly factor); zM3HC4;TM=M;SS=N; 4.86 

453B64; AW021407: Hs.21068; hypothetical protein; noo8.none; 4.85 

453082; H18835j Hs.31608; hypothetica! protein RJ20041; lon.trans;TM=Y:SS=M; 4.85 

413407; AI356293; Hs.75339; Inotitol polyphosphate phosphatase^ ; SHZSAM.Exo endojjhosr, 4.85 

417866; AW067903; Hs.82772; collagen, type XI. alpha 1; Collagen.COtJlJSPN,laminln G.OorA:; 4^ 

435652; N32388: Hs.334370; uncharacterized hypothalamus protein HBE; ncne;TM=M:SS=N; 4.84 

419355; AA428520; Hs.90061; progesterone Wniflng prot^; hemeJ;TM=Y;SS=N^ 4.83 

431441; U81961; Hs.2794; sodium channel, nonvollage^aled 1 alpha; ASC;TM=Y;SS=N; 4.83 

408983; NM_000492; H3.663; cysOc fibrosb Iransmemljrane oonductanc; ABC_lran>\BCjnembrane,PRK,Bac export_3;TMaY:SS=N- 483 

425465; 118964; Hs.1904; protein kinase C lota; pWnasB,DAGJ»E-bind,pkinase COPR;TM=M:SS»N; 4.82 " 

435232; NM.001262; H3.4854; cycOn^fependenl kinase Inhibitor 2C (p1: ank;Tft(SM;SS=M; 4.81 

424490; AJ278016: Hs.555€5; ankyrin repeal domain 3; ank,pklnase;TM=M;SS=N; 4.81 

425743; BE396495; Hs.159428; BCL2-associated X protein; Bct.2;TM=Y;SS=N: 4.81 

453354; W55946: Hs.234863; Homo sapiens cDNA FU1 2082 lis, done HE; lransmefflbf8ne4.none; 4.81 

450883; NWL001348; Hs.25619; dealh-assodaled protein kinase 3; pkinase;TlyNM;SS=N; 4.79 

429736; AF125304: Hs.212680; himor necrosis factor receptor superfant TNFIl.c6:TM=M;SS=M; 4.79 

431183; N^^006855; Hs.2S0696; KDEL (Lys^p^Leo) endopIasnA: relic; ERJumen receptTM=M;SS=M: 4.79 

409960; BE261944; Hs.355264; hexoWnase 1; none.none; 4.78 

422795; AB033109; Hs.375610; K1AA1283 protein: 7tnLl.te2al>\2M.A2lyLN:TIW=Y;SSaM: 4.78 
423519; T48691; Hs.249159; adrenergic. 8lpha.2A., roceplon 7tni.1,7li!U:TM=Y:SS«M; 4.78 
429305; AF095727; Hs.287832; myeTin protein zanMike 1; i9.lransmembrane4:T|yNY:SS=M: 4.78 
427700; AA262294; Hs. 180383; dual spedflclly phosphatase 6; Rhodanese,DSPc;T^4=M;SS=N; 4 77 
447343; AA256641; Hs.236a94; ESTs, Highly sintilar to S02392 alpha.2-ni; none,none; 4.76 

407722: BE252241:Hl38041:pyikloxal(pyrldoxln8, vitamin B6)kInas;ptkB^ r-™. n , 

438330; AW450572: Hs.257316: ESTs; pkinase,zfLC4,ERM.CNHnone: 475 

410687; U24389; Hs.65435; lysyl ox!dasB-like 1; LysyLoxidase;; 4.75 

411165; Nft^000169; Hs.69089; galactosWase, alpha; Mellbiase;; 4.75 

425548; AA890023: Hs.1906; prolactin recepton fn3:TM=Y;SS=M; 4.73 

434158; T86534: Hs.14372; ESTs; 8denyl8lekinase.nQne: 4.73 

459035; AW291 109; Hs.332S63; ESTs, WeaWy similar to T31 61 1 hypotheU; none,SH3,myoslnhead IQ- 4 73 
409012; AL117435; Hs.49725; OKFZP434I216 protein; PH.RhoGEF;TM=M:SS=M; 4.73 
434503; T96231; Hs.17762; ESTs; SH3.Sorb,none: 4.73 

446342: BE298665; Hs. 14846; solute carrier family 7 (cationic amino : none;TM=M;SS=N; 4.72 
427418; AA402587: Hs.356667; LAT1-3TM protdn; none,none; 4.71 

449433; AI672096; Hs.9012; ESTs. Weakly similar to S26650 DNA^ndl; Frlzz1ed,F2.Frizzled,Fj; 4.71 
41891 0; Z25821 ; Hs.89466; Homo sapiens, Similar to dodecenoyW^oen; ECH;; 4.70 

414907; X90725: Hs.77697; pdo (OrosophlaHike kinase; i^omal_l^7aB,pWnase,POLO_box.tRNA^Ub.dynamlivdynamin 2,GED.b2!K 4 70 

442199; BE277633: Hs.372542; eloposide^nduced mRNA; none;TM=Y;$S=M; 4.69 wnamin^ui:u.o£ir.M;. ^J^, 

418870; AF147204; Hs.89414: chemoWne (C-X-C moBf), rBceptor4 (fus; 7tjrLl,7tm^;TMaY;SS«M; 4.69 

453922: AF053306; Hs.35708; budding uninhibited by benzimidazotes 1 ; none;; 4.69 

418558; AW082266;Hs.88131; Fas (TNFRSF6>«sodated via death dom8l:de8l^DED;; 4.66 

434164; AW207019; Hs.148135; serine/threonine kinase 33: pMnase;TM=M;SS=N; 4.68 

443323; BE560621; Hs.9222; estrogen receptor binding site assodate; none:T1WNM;SS=M: 4.68 

400288; X06256; Hs.149609: Wegrin. alpha 5 (fibronectln rBceptor,; lntegittAFG-GAP;TM=Y;5S=N- 4 68 

418838: AW385224; Hs.35198; ectonudeotide pyrophosphatase/phosphodl; PhosphodiesfcTM=Y:SS=M; 4,67 

417425; NM.002291; Hs,82124; lamlnin. bota 1: laminln_^,lamlniiLNlemUntegTnLB;: 4.67 

425976; C75094; Hs.334514; NG22 protein: voRage_CLC:TM=Y;SSeM: 4.66 

407844; AW073716; H3.8037; ESTs; lransmembrBne4,none; 4.66 

450656; AA010539; Hs. 18912; unnamed protein piodiKf zf-C2H2:; 4.66 

42031 1; AW445044: Hs.38207; Human ONA sequence lirom done RP4-530I15: none,none; 4.65 

404287; ; ; FGENESH piedided novd CU&domain conta; none,non8; 4.64 

452747; BE153B55: Hs.61460; Ig superfamily receptor LNIR; lg,Rhabd jIycop:TM=Y;SS=M; 4.63 

426880; AA320160; Hs. 171811: adenylate kinase 2; adenylateklnase;TM=M;SS=N; 4.63 

430397; AI924533; Hs.105607; bicarbonate transporter related protein ; HC03 cotransp;TM=Y;SS=N: 4.63 

447655; NM.003726; Hs.19126: sn: kinase-assodated phosphoprotein of ; SH3yPH;TM=M;SS=N: 4.63 

414271; AK000275; HS.75B71; prolan kinase C binding protdn 1; binmodomalaPHD,PWWP.zf^«YW);TM44;SSs^^ 4.62 

429126;AW172356:H8.99083:EST8;7tmJ.none:4.61 

429150; AF120103; Hs.197366; smoothened (Drosophila) homotog; COX8,SHMT,MiF,GST,aB^1GJomain,GST.N.Sl.Fz,Frtzzled.cdrB8cuIln.^^ assoc;TM=Y;SS*M: 

4,60 ~ 
409220; BE243323; Hs.51233; himor necrosis fador receptor superM; TNFK.c6.death,Upoproteirv.S,TIL:TM;>Y:SSsM: 4 60 
421921; HB3363: Hs.355993: Uanstocasa of Inner mitochondrial membr. zf-Tim10_DOP.cfhand.CH.spedrin^n;TlyNMtSS=N; 4.60 
429083; Y09397:Hs.227817;BCU-rolated protein A1;Bd.2;TM=M;SS=N; 4.59 
400290; H18836; Hs.31608; hypotheScal protein FU20041; none.Cys_knot 4.59 
430379; AF134149; Hs.240395: potassium channel, subfamily K, menier 6; fon_tr8ns;TM=Y;SS=M; 4.59 
409645: AI142265: Hs.55498; geranylgeranyl diphwphate synthase 1: polyprenyLsynt;TM=M;SS=N: 4.59 
427373; AB00797% Hs.130760( myosin phosphatase, target subunit 2; adcTM^iytSSsN; 4.58 
437212; AI765021: Hs.210775; ESTs; UDPGT.none; 4.58 

430396; 049742; H8.241363; hyaluronarvbinding protein 2; ank.dealh.2U5.EGF>ringto,typsIn,Nehdln,Uiyt: 4^ 

452069; AB028949;Hs.183994;KIAA1026 protdn; Matal!ophos;TM=M:SS=N; 4.56 

416041; AA345547: Hs.53263: hypo&ieticai protein FU13287; WD40;; 4.55 

434511; R28982: Ks. 161 06; ESTs; pkinase.Glyco hydro 39; 4.55 

410174; AA306007; Hs.69461; DKFZP434C245 (iiteln; WoSPc; 4.55 

418758; AW95931 1: Hs.172012: hypothetical pntdn DXFZp434J037; pkin8se,R10l;TIMsMI:SS=N; 4.55 

451367; AA923729; Hs.28322: ceD cyde relatsd ktnasa; pkInase;rM»M;SS=lt 454 
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nlZ^ S-S ?? 5r4onoofetBl«rophoblastrtwn>tet^ 4.54 
41 1296; BE207307: Hs.10114: growth suppressor 1; 20Gfen_0xy;TM=M:SS=M; 4.53 

fi!|^28081; Hs.6817; inoslne triphosphatase {nucleoside tr^; Ham1p_aB:TM=*/tSS=N: 4.53 
^ 1^1!^ ^2^1002742: Hs.2891; protein Idnase C, mu: pWnasaDAG.PE-bind WM=M:SS=M; 4.53 

tSX? ISSISI^'?'^ ^' apoptosis-felaled cysteine pr pWnase jCEj)10.ICEj)20;TM=M;SS=M; 4,53 
428005; AW302245; Hs.181390: casein Mnasa 1, Qamma 2; pkinase;TM=M;SS=N; 4.52 
Ijni* Hs.38772; RAB38, ftmber RAS oncogene family; ras,ari;TM=M;SS=N; 4.52 

435523; T62849; Hs.11090: membrane-spanning 4-domain3, subfamily A; none;TMaY;SS«M; 4.5t 
1 A H^^^' AW271217; Hs,281434; Homo sapiens cONA RJ14028 lis. done HE; non8.none; 4,61 
I U 405484; ; ; C3002124':glll2737280|feflXP 006682.2j It; none;; 4.50 
443605; H06865: Hs.134131; ESTs; efhai^.loa.trans.nonB; 4.50 

t^S^H"'?!^!:'!^^^^^^ ^^^^^^ 7lnu1^HC4.fn3.SPRY.KRAB.rf^H2.iv8^B.box;TM=Y;SS=iW; 4.50 
422112; BE540240: Hs. 1 1 1783; \jun\ protein; Sm,BAG;: 4.49 

- 418869: AW516565: ; gb3(q01d05J<1 Soares.NHCeC.cewlcal^tumo; none.RasGAP,WW,IQ: 4.48 
ID 447898;AW969638:Hs.112318;a2kdprDtein;none.none:4.48 

450607; AU)50373; Hs.25213: hypolheDcal proteoi; SH3;TM=M;SS=N; 4.48 

418918; X07871; Hs.89476; C02 antigen (p50). sheep red titood cell ; lg:TM=Y;SS=M; 4.48 

'^^^006226; Hs.153322: phosphoiipase C. epsBon; C2,PH,P|.PLC.Yj>|.PLC-X:TM=M;SS=N; 4.48 

426812; AF105365; Hs.1 72613; sobte carrier family 12 (potassiumiichio; none;TM»Y;SS=N; 4.47 

^V^^' 6J"*artoA47582B<eII gn Fun^ikB.pkinase.Recep_Ldomaln.YIP.noi>B;4.47 

435615; Y15065; Hs.4975: potassium voltage^ated channel. KQT-lilq lon_tr8ns.KCNQ1.channet:TM=Y;SS=N; 4.47 

427557; NM.002659: Hs.179557: plasminogen acthfalor, urokinase reccpto; UPAK.LY6,ET.PLA2Jnh;; 4.47 

428727; AF078847; Hs.78452; general transcr^ tactor IIH. polype; PH04,UM;TH^iytSS=N: 4.46 

41276(^ AW379030: Hs.41324: ESTs; CbLN.CbLN2.ChLN3.UBA;EM:3HC4.none; 4.46 
ZD 409093; BE243834: Hs.50441; CGMM protein; rabosomaI_l37ae.pWnase.POLO,box.lIW>^ynUb,dynanin,(Mi^^ hZiP.M= 4 46 
E^cIS.^? ^i^' Jjhosphoadenoslne 5'-phosphosuK8te sy; APSJdnaseATP^uHurylase PRK.ThymWylale.kln;; 4.46 

447434; R16890: Hs.137135; ESTs; pkinase.lh3jg,pklnase.fn3; 4.45 

422010: AA302049; Hs.31 181; Homo sapiens cDNA: FU23230 fis. clone C; none.SDF,sugar Ir; 4.45 

414108; AI267592; Hs.75761: SFRS protein kinase 1; ank,PH.03(ysterol BP,pklnase:TM=l\4;SS=N; 4.44 
DU 457001; J03258; H5.2062; vitamin D (1,25- dihydroxyvltamin D3) re; homionejBc^f^.Metallothk) 5;TM=M;8S*N; 4.44 

409886; AK000002; Hs.55879; Homo sapiens mRNA; cDNA DKFZp434L0827 (f; ABC lran>\BC meml)rane;TiyNM:SS=M- 4 44 

408113; T82427; Hs.1941 01; Homo sapiens cDNA: nj20869 fis, ctone A; 7lnL3,none: 4.44 

436823; AW749865; Hs.1 17077; ESTs, Weakly similar to 138022 hypolheti; aa_penne3ses,zf. 
^ C»I^KR|^.pyi^^^ 4^ 

JD 450505: NJ^004572; Hs.25051; plakophilln 2; Armadnio seg:TM=M;SS=N; 4.43 

437915; AI637993; Hs.202312; Homo sapiens ctone N11 NTefa2D1 teratoca; nonB,nons; 4.43 

417412; X16896; Hs.82112; interieukin 1 receptor, type I; l9.TIR;TM=M;SS=M; 4,43 

445033: AV652402: Hs,72901; cydi^dependent Wnase inhibitor 2B (pi; ank;; 4,43 
. ^ 41 1027; AF072099; Hs.67846; letikocyta ImmunoglobullTHike receptor, ; inositol P,lg:TM=M;SS=N; 4.43 
4U 452124; AA454220:Hs.61170; ESTs; pkinascnone; 4.43 

422599; BE387202; Hs.1 1 8638; non-metastaOc ceHs 1, protdn (NM23A) ; NDICPKOxysterDLBP:: 4.42 

416202: AW984492; Hs.169624; ESTs; none;TM=IW;SS=N; 4.42 ^^^or. 

441518; AW161697; Hs.294150: ESTs; Yj)hosphatase.DSPcnone: 4.42 

- - 441680;AW444598:Hs.7940;RAP1, GTP^DP dissociation sBmulator1;Armadiik) seg;TM=M;SS=N- 4.42 
4D 431429; AF072B13; Hs.262831: reticulon 3; ReUcukjn.FzJgWngte.pklnase;TM=Y;S§=N; 4.42 

453870; AW385001; H8.8042: Homo sapiens cDNA; FU23173 fis. done L: FaGAP.lnlegrin>V.NlF: 4.41 
421242; AW161386; Hs.13561; hypothetical protein IWGC4692; none:NA;NA; 4.41 
456362; AW973003; Hs,179909: hypothetical protein FU22995; none;TM=M;SS=N; 4.41 

- . 419073; AW372170; HS.1B3918: Homo sapiens cDNA FU12797 fis. ctone NT; death,ZU5:; 4.41 
DU 432211; BE274530; Hs.273333: hypothetical protein FU10986; FGGY.C;TW=M;SS=N: 4.41 

413367; NM.006S17; H8.75317; solute carrier family 16 (monocarboxyDc; sugarJr:TM=Y;SS=N; 4.41 
458097; AW341135; Hs,68104; ESTs; none,SH3,PID; 4.40 

458248; BE407379; Hs.108082; ESTs. WeaWy sbrtlar to T31636 hypothetl; C1q.Co!lagen;TM=M;SS=Y; 4.40 

DD 44369ft AI344782;Hs.9683:DnaJ(Hsp40)horndog, subfamily C.membe;rTn^^ 

437162; AW005505: Hs.5464; Oiyroid hormone reoeptor ooacBvatlng pn bromodomain;TM=M;SS=N; 4.39 

453891; AB0377S1; H3.301242: Homo sapiens mRNA M length Insert cDN; none.none: 4.39 

442572; AI001922; Hs.13S121; hypotheUcal protein FU22415; nonB,HSP70; 4.39 

427337; Z46223: Hs,176663; Fc fiagment of IgG, taw affinity lllb. r; Ig:TM=Y;SS=iyi; 4.37 
OU 445817; NM-003642; HS.1334Q; Mstone acetyHransferase 1; none:TM=M;SSsN; 4.37 

444895; A1874383; Hs.22891; sdute carter family 7 (catkinic amino ; ASC.dealh.TNFR^c6: 4.37 

408912; AB011084; Hs.48924; K1AA0512 gene product; ALEX2; ArmadHto seg;TM=M;SS=M; 4.37 

432106; NS8323; Hs.269098; ESTs, WeaWy similar to RETROVIRUS-REIAT; SH3,P0Z,Guanylato kin.none: 4.37 
, 418283; S79895; Hs,83942; calhepsin K (pycnodysostosis); PepUdase Cin 4.37 

OD 445828; BE313754; Hs.13350; Homo sapiens mRNA; cDNA OKF2p585D0918 (f; ig.tsp 1.ZU5.Nucleoslde tran; 4J7 

417874; BE616160; Hs.82829; protein tyrosine phosphatase, norwecepl: Y_phosphatase;TM=Y;SS=N: 4.^ 

400257; ; Hs.76365: ENSP00000000452:BAD protein (Ba.2 blndi; none:TM=M;SS=N; 4.36 

431476; BE612705; Hs.255597: hislidlne triad nucleo8de^ng prote; HIT;; 4.36 

416178; AI808527; Hs.192822: seratoglcaBy defined breast cancer anS; none;TM=M;SSeN; 4.36 
/ U 456629; AW891965; Hs.367942: histone deacetylase 3; HSP90,HATPase c.zfCT<2.PHD.none; 4.36 
' 427716; tJ8951; Hs. 180446; karyopherin (Importin) beta 1; Annadilk) 8eg,HEAT;TiyN«;SS=N; 4.35 

?u?S »?'»59827; death assodaled protein 3; myb_ONA4)indlngmBAKbromodoinaln,PHD.SET;TM=IWJS$^ 4.35 

420261; AW206093; H3.748; fibroblast growth factor receptor 1 (fms; pklnase.lg4)klnasaJg,p450.SET.PWWP; 435 

445926; AF054284; Hs.334826; splidng factor 3b. subunit 1, ISSkD; none;TM=M;SSsN; 4.35 
/ J 410726; AI623859: Hs.15936; ESTs; pkinase,proJsomerase.none; 4.35 

433996; AW135357; H3,192374; ESTs; HSP90,HATPasejD.UDG: 4.34 

433592; NM.004642: Hs.3436: deleted in oral cancer (mouse, homolog) ; none;TfcMyi-SS»M: 4.34 

437103; AW139408: Hs.152940: ESTs; Chonne.Wnase.none; 4.34 
-.^ 410058; AI633888: Hs.58435; FY^W>lnding protein (FYB.12(V130); SH3;Tiy4=M;SS=N; 4.34 
OU 427349: AA3601 54; Hs.17741 5; Rnkei-Biskis-Reilly murine sarcoma viru; ubkiuilin;TM=M:SS=N; 4.33 

439807; AA376417; H5.374608; hypothetical protein MGC:5244.; abhydrolaseJ2;TM=M;SS=M; 4,33 

453308; AW959731; Hs.323099; ESTs; none.pklnase/ctlvbuacp; 4.33 

424893; AW2951 12; Hs.153648; Homo sapfans cONA FU13303 Os, done OV^ SAM;; 4.33 
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444664; N26352: Hs.1 1615; map kinase ph05phglas&«k8 protein MK^ OSPt:TIUNI:SS^ 433 
429655; 048959; Hs.21 1582; myosin, light polypepSde kinase: pMnase.th3.lg.noRe; 4.32 
409121; AA902256; Hs.78979: GcJgl apparatus pral^ 1: cysjWi^GFRnone; 4.32 
430280: AA361258: Ks.237868; Intarieukin 7 receptor. fh3.none; 4.32 

423798: AF047033; Hs.132904: sohite canter tamily 4. sodium ttotwn; HC03_cotransp;TM=Y;SS=M: 4.29 

425654; AB033022: Hs.158654; KIAA1196 prelein: zf^H%TlliNM;SS«N: 4.29 

457500: NM.0027S9: H5.2743B2: protebi kinase, interferon^ndticada dou; dsrm,pMnase:TM:=:M;SS"N: 4.29 

427127; AW802282; Hs.22265; pyrovate detiydrogenase phosphatase; mCjme^ 4.29 

447191; NM_014S21; H5.17667; SHWomain UnSt^ protein 4; SH3;TM=M;SS=N; 4.29 

40B331; NM.00724O; Hs.44229: dua} spedrdty phosphatase 12; DSPc;TM={ul;SS=N; 4.29 

441130; AI160734: Hs.267604: Homo sapiens PNAS.129 mRNA. comptete cds; BTB,Keteh,K.t8tra.0SPc;TM=M;S&=N: 4.28 

430057; AW450303: Hs.2534: bone morphogenette protein receptor, lyp; Aclivin_fecp.pkinase:TM=Y;SS=M; 4.28 

430250; NM_016929; Hs.283021: chtoiida IntraceDuIar channel 5; none;TM«W;SS=N; 4.28 

406774; AW5183B3; Hs, 177592; rftjosomal protein, large. PI; eOsjibosomal;; 4.28 

413809; L25851: H5.B51; Integrin. alpha E (antigen CD1G3. human ; vwajntegrtn J^FG-GAP;TM=M;SS=Y; 4.27 

44396ft AI093577; Hs.255416: hypothetical protein nJ21986:TTL;TM=M;SS=N; 4.27 

427378; BE515037; Hs.177556: melanoma antigen, fantily 0, 1; MAGE;TM=M;SS=N; 4.27 

412204; AI125507: Hs.24937: ESTs; lg,nm,none; 4.26 

439505; AI361238; Hs.41136; ESTs; MAM.pkinase.NucieopIasmln.none; 4.26 

451295;AI557212:Hs.17132;ESTs,Moderate}y6imi[artol54374gene:pkinase.DAG PE-blnd.pkinase COPRJione:4.26 
45248B; N74921; H3.184389; ESTs; none:TM=M;SS=N: 4.26 

450973; AF012072; Hs,25732; eukaryotic translation initiation factor; W2.MA3,MlF4G:TMsM;SS=N: 4.26 
452437; AA026237; Hs.181272; ESTs; elhand.lorLtrans.nonB; 4.26 
438204; AI589545; Hs.128690; ESTs; none,7lmJ: 4.25 

424756; AW504657; Hs.152931: lamin B recepton ERG4_ER624/KBP;TI*»Y;SS=N; 4^5 

430570; AI417881; Hs.292464; ESTs; 7tni.2.Fz.Fri2zIedjione;4.25 

445709; H02592; H5.74280: ESTs; PDZ^none; 4.25 

428134; AA421773; Hs.161008: ESTs; ArmadiaojBegme: 4^4 

434149: Z43829; Hs.244624; hypothefical protein M6C5469; none:TIUNM:SSsM; 4.24 

4251 18; AU07661 1: H3.154672; methylene tetrahydrofblate dehydrogenase; myb [>NA- 

t)indingJHF.DH6_CYH.THF_DHG_CYK_CCAP_GI.Y.AAA.lON.PepBd^e_C9.b2IP.IAxan urj)ennea8e,HC03 colransp:TM=M;SS=N: 4.24 
433376; AI249361; Hs.74122: caspase 4. apoptosis^elated cysteine pn CARO,1CEj)10,ICEj>20h 4.24 
447818; W79940; HS.35S279; Homo sapiens done 24670 mRNA sequence; none^pklnase; 4.24 
450684: AAB72805; Hs.25333; tnterleukin 1 receptor, type II; ig;TM=Y;SS=M; 4.23 
435542; AA687376; Hs.351226; ESTs; SK3,lg.pWnase.PH.spectrin,RhoGEF.none; 4.23 
426224; BE0B5860; Hs.374468; karyopherln pmportin) be\a 2; ArmadiDoj5eg,HEAT;TM=M;SS=N; 4.23 
413284; AU077055; Hs.289107; baculoviral lAP repeal-containing 2; zf^HC4.CARD.BlR,deatMg;TM=M;SSaN: 4.22 
421917; AB028943:Hs.109445:KIAA1020protdn:BTB.zf-C2H2.PI3 PI4 kinase,PI3Ka;TM=lW;SS=N; 4.22 
431239; AL039971; Hs.251216; hypothefical protein DKFZp434A196: SH2.ank,WH2;; 4.22 
419585; W76083; H5.13418S; ESTs; none;TM=M;SS=N; 4.22 

431530; NIW,002204; Hs,265829; Integrin, alpha 3 {antigen C049C alpha ; FG^.Rhabdj!ycop.lnlegiin>:TM=Y;SS=M; 4.22 
425177; AF127577; Hs.155017; nuclear receptor interacUng protein 1: none;; 4.21 
422219; AW978073; Hs.1010; regulator of mitotic spindle assemtily 1; pkinase.none; 4.21 
450746; D82673; Hs.278589; general transcription factor II, i; none.SH3,PX; 4.21 
428023; AL038843; Hs.374530; Homo sapiens cDNA: FU23602 fis. ctone U* 

aaj)enneases.pyTidoxaLdeC.bromodomata.PHD,MBO.ATJ»ok.DDT.PI3_PI4.kinase,FAT,FATC;Bo^ 
416907; W60909; ; gtKzd29g10,8l Scares JetaLheart^NbHH19Wi kwutransjjone; 4.21 
411768; NM.013371; Hs.71979; Inteileukin 19; ILIO;; 4.21 
425262; D87119: Hs.155418; GS3955 protein; pidnase;; 4.21 

430035; NM.003463; Hs.227777; protein tyrosine phosphatase ^ IVA. m; Yj)hosphatose,DSPc;TMaM:SS»N; 4.21 

41 1789; AF245505; Hs.72157: Adlican; lg.LRR.LRRNT,UWCT;TM=M;SS=M; 4.15 

416636; N32536; Hs.42645; solute earner Can^y 16 (monocarboxyric; none.none; 4.14 

419452; U33635: Hs.90572; PTK7 protein tyrosine Wnase 7; lg.pkinase:TM:»Y;SS=M; 4.13 

431745; AW972448: Hs.163425; Novel FGENESH predicted cadherin repeal ; none.none: 4.10 

416965; N26223: Hs.160438: MDAC1; O0ne:NA;NA; a94 

426890; AA393167; Hs.41294; ESTs; none.none: 3.88 

442438; AA995998;Hs.370C07;gbMs26b03.5l NO^CGAPJOdS Homo sapiens: i»one,ONAj)oLB.ONAj)oLB exo:3.86 

41 2314; AA825247: H3,356084; downstream of: G proteln^oupled recepto; 7tnLl;TM3Y;SS=M; 3^ 

448243; AW369771: Hs.367688; Integrin. beta 8; Integrin J,none; 164 

439318; AW837046: H5.6527: G piolelrHXMipted receptor 56; 7lnUCytC_asm.6PS:TM=Y,'SS=I\lt 3.81 

415999; AA172179; H5.294029; ESTs; none.none:3.60 

429466; M85835; Hs.12827; ESTs; none.nono; 3.45 

407853; AA336797: Hs.40499; dickkopf (Xenopus Ms) homolog 1; none;TiyNUI;SS=Y; 3.34 
400517; ; ; lengsin; none:TM=M;SS=N; ai7 

439180; AI393742:Hs.l99067;\^cifrW avian eiylhroblasBcleuI^ Ldomain.Peptldase M24:Z88 

426158; NM-001982;Hs.199067;v<rtM)2 avian eiythroblaslJc leukemia v:FuiWike^ l>.domain,Furifrlike,pkinase.Re(»p l^domaln.Pepfld«e M24'Z84 

414521; D2B124;Hs.76307; neuroblastoma, suppresskm of lumorigeni;OAN;TM=M:SS=M ~ r - . 

424522; AL134847; Hs.149957: ribosomai protein S6 kinase, 90kD, po^; |iidnase,pkinase C;; Z70 
438167; R26363; Hs.24288; chemoklne binding protein 2 (CC8P2). mRN; none:TM»Y;SS^ 2.66 
418886; AU076801; HS.8943&: cadhedn 17. U cadherin (Rver-intestin; cadh8rin:TM»Y:SSn|y|; Z17 

TABI£23B 

Pkey: Unique Eos probeset Uentifier number 
CAT tmber. Gene duster numtier 
Aooesston: Genbank accesston numbers 

Pkey CAT Number Accession 

409745 MH1944_5 BI030997AA921874AW188822BI027862A1347618AI361453AI088754 AW207491 AA077391 BGOIOT^ 

BI027884 BI009100 BI006275 BI006270 BI031000 BI029864 BI006277 BI007627 BH)06266 BI006991 81006990 BI007763 81007762 BG997377 AA150780 BI033516 
BIQ27818 BG015789 BI033607 AA341445 

417888 1031334.1 AA21 0987 D57294AA21 4584 AA207006DS6572 

438993 2580163.1 AI926361AAB34879AA828995 
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418869 
416907 

TABLE 23C 



12789.14 
1112245.1 



AA229762AA230035 

W60909 W61051 M78905 BG959483 



Pkey: 

Ret: 



Unique number correspojxfmg to an Eos probesel 

Sequence rource. The 7 dlgll numbers In Uiis cohnm are Genbank UenCSer (Gl) numbers. Dunham I. et aL' lebis b (he publicaSon entiOed The ONA 
sequence of human chmmosome 22.' Dunham 1. el d..rfalure (1999) 40i48M95. 
Strand: IncTK^ ONA strand from which enons were predicted. 
NLposItion: Indicates nucleotide posQiDns of predicted exon& 



Pkey 

406400 

403362 

404210 

404287 

405484 

400517 



Ref 

9256298 
8571772 
5006246 
2326514 



Strand ^fLpasHio^ 

Plus 1553-1712.1878-2140.4252^5922^ 

Plus 6409»«4260 

Phis 169926-170121 

Phis 53134^261 

PkB 199214-199579.199672-199920.200252-20049 

Minus 49998^346 



TABLE 24A: 571 GENES UP-REGUUTED IN HEAD AND NECK TUMORS COMPARED VVTTH NORMAL BODY TISSUES 

1^1^^^ ^ uiHoguteted in head and neck tumors compared with normal body Ossuas. These genes ware sdteded from 59680 probesets on the Eos/Affwietrix 

Pkey: Unique Eos probeset identiflar number 

ExAccn: Exemplar Accesskmnuidier. Genbank accession nunter ' 

UnlgenelD: Unigene number 

Unigens Title: Unigene gene tilie 

R1 : 70th peicengle of Al f or head and neck cancer samples vs. the 80th percenMe of the Al for nonnal body tissues 



Pkey 

421155 
452401 
434377 
438274 
401488 
446999 
423887 
419569 
428505 
420602 
445019 
452281 
449722 
423575 
424086 
447078 
429597 
409506 
426471 
413266 
419948 
451807 
442875 
452795 
420931 
416283 
450221 
449101 
442611 
438533 
414132 
447164 
402047 
414972 
452943 
416661 
427099 
449318 
418345 
415076 
414142 
432865 
431608 
411750 
418612 
438394 
452198 
423020 



ExAocn UnlgenelD Uhigene Title 

H87879 Hs.1 02267 lysyl oxidase 

NM.0071 15 Ks.29352 tumor necrosis factor, aipha^nduced pro 

AW137148 Hs.308593 Homo sapiens cDNAFU11382fis.ck)ne HE 

At918906 Hs.55080 ESTs 

AA151520 Hs.334822 hypotheOca) protein MGC4485 

AL080207 Hs.134585 DKFZP434G232 protein 

AI971651 Hs.91143 Jagged 1 (Alagiiie syndrome) 

AL035461 Hs.2281 chromogranln B (secretogranin 1) 

AF060877 Hs.99236 regulator of frprotein stanaTing 20 

AI205540 Hs,281295 ESTs 

T93S00 Hs.28792 Homo sajdens cDNA FU1 1 041 lis, done PL 

BE280074 Hs.23980 cycHnBI 

C18863 Hs.163443 Homo saddens cDNAFU11576fls,ck)nB HE 

AI351010 Hs.102267 lysyl oxidase 

AW885727 Hs.301570 ESTs 

NM_003616 H3.2442 a dislntegrin and metaOoprolelnase doma 

NM.006153 Hs.64589 NCK adaptor protein 1 

M22440 Hs.170009 Iransfomtfng growth factor, alpha 

AL039079 Hs.75256 regulator of Gprotein slgnaillng 1 

AB041035 Hs.93847 NADPH oxidase 4 

W52854 Hs.27099 hypotheficaJ protein FU23293 similar to 

BE623003 Hs.23625 Homo sapiens ck)naTCCCTA00142mRNAsequ 

AW392555 Hs.18878 hypothetical protein FLJ21620 

AF044197 Hs.100431 small inducible cytokinaBsubfemilyfCy 

NM.005429 Hs,79141 vascular endotheBsl growth factor 0 

AA328102 H3.24641 cytosketeton assodaled protein 2 

AA205847 Hs.23016 G proteln<oupled receptor 

BE077155 Hs.177537 hypolhetfcal protein DKFZp761B1514 

A1440266 Hs.170673 ESTs. Weakly similar to T24832hYpo(hetI 

AI801235 Hs,48480 ESTs 

AF026941 Hs.17518 Homo sapiens cig5mRNA.p8rti8] sequence 

AK001921 HS.1S9575 hypothetical protein M6C2550 

BE263782 Hs.77695 K1AA0008 gene product 

8E247449 HsJ1082 hypothetical protein FU10S25 

AA634543 Hs.79440 IGF4I mRNA^ndlng protein 3 

AB032953 H5.173560 odd Ozrtenmi homotog 2 (Drosophlla, mous 

AW235021 H8.78531 Homo sapiens, Similar to RIKENcDNA 5730 

AJ001696 Hs.241407 serine (or cysteine) proteinase Inhlbtio 

NM_000857 H3.77890 guanylato cyclase 1. soluble, bete 3 

AW368397 Hs.150042 Homo sapiens cONA FU14438 fc, ctone HE 

AI753709 HS.1S2484 ESTs, Weakly similar to 138022 hypotheO 

M30703 H5.270833 amphlregulin (schwannoma^erived growth 

BE562298 Hs,71827 K1AA01 12 protein; homolog of yeast ribos 

AB037788 Hs.224961 deavageandpolyadenytalkMispecHfcte 

BE379623 Hs.27693 pepfidylproiyl Isomerasa (cydophOinj-l 

AI097560 Hs.61210 ESTs, Weakly similar to 138022 hypotheO 

AA363092 Hs.160B repScalton protein A3 (14kQ) 
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422426 


W79117 


HS.S85S9 


406747 


AI9251S3 


Hs.217493 


445828 


F0S802 


H$.81907 


431806 


AF186I14 


Hs.270737 


452909 


NM.015368 


Hs.30985 


432226 


AW182766 


Hs.273558 


458027 


U9054 


Hs.65195 


443354 


AW97(K72 


Hs.9247 


416049 


AI970538 


Hs.16603 


431494 


AA991355 


Hs 298312 


433859 


AW8967S8 


Hs^3789 


426753 


T89832 


Hs 170278 


400792 


AA635062 


Hs.50094 


402D34 






424073 


U03493 


Hs 1389S9 


458424 


AI084049 


Hs.206761 


435159 


AA668879 


Hs 116649 


409269 


AA576953 


Hs^972 


444361 


W76027 


Hs.23920 


439128 


AI949371 


Hs 153089 


420795 


AA323037 


Hs. 128645 


422505 


AL1 20862 


Hs.124165 


434828 


D90070 


Hs.96 


41056i 


BE540255 


Hs6994 


423035 


AW449679 


Hs. 156739 


417655 


AA780791 


Hs.14014 


414869 


AA157291 


Hs.21479 


453049 


BE537217 


Hs.30343 


417801 


AA417383 


Hs.82582 


435243 


AW292886 


Hs.261373 


431211 


M86849 


Hs.323733 


407746 


AK001962 


H$.38114 


416498 


U33632 


Hs.79351 


414231 


A)468004 


H$.278956 


426227 


U67058 


Hs.168102 


439452 


AA918317 


Hs.57987 


439999 


AA115811 


Hs.6838 


417791 


AWg65339 


Hs.1 11471 


436466 


AA742221 


Hs.120633 


432731 


R31178 


Hs,287820 


429903 


AL134197 


Hs 93597 


435039 


AVV043921 


Hs.130526 


419743 


AW408762 


Hs.5957 


457001 


J03258 


Hs20fi2 


450684 


AAS72605 




422440 


KM 004812 


Hs 11S7?d 


458531 


AA36771B 


Hfi 1<)QnR!) 

no. 1 


416065 


BE287931 




411386 


X7292S 


H<; (i<i7V) 


419750 


AL079741 


Hfi 1fl')11d 


429370 


CI 9097 


Hs 89709 


429921 


AA526911 


Hs.82772 


449467 


AW205006 


Hs 1<)7tU7 

113. 1^1 VfC 


453102 


NIWL007197 




453637 


NK/L002S89 


Hs.34073 


400289 


X07820 


Hs.2258 


418007 


Ml 3509 


Hs.83169 


428368 


BE440042 


Hs.83326 


428227 


AA321649 


Hs.2248 


450375 


AA009647 


Hs.88S0 


427585 


D31152 


Hs 17Q79Q 
no. 1 ( 9 f £3 


424717 


H03754 


Hs.1 52213 


424735 


U31B75 


Hs.272499 


420159 


AI572490 




415511 


Ar732617 


He 1893^0 


406467 






422330 


D30783 


no. 1 1 o^QO 


452461 


N78223 


no. luoiuo 


415542 


R13474 


Ue 9Qft9fi^ 

no<£!ni4C0J 


413324 


VUVSf 1 


nS.r9494 


431571 




Us iflfKin 
ns.iouoiu 


443211 


AI128386 


Hs.143855 


451844 


T61430 




441877 


AW273802 


Hs.11340 


439926 


AW014875 


Hs.137007 


432015 


AL1S7504 


Ks.159115 


421103 


AI625835 


Hs^104 


448062 


AW295923 


Ks^SS472 


432222 


Al2049g5 




421577 


BE465451 


KS.10592S 


421187 


NM.014721 


Hs.102471 


408908 


BE296227 


Hs.250822 


437214 


BE092336 
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34.00 


ESTs 
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Co IS 


33.00 


nypouiBuca proiBm ru io«Xm 


32.00 




31.00 


CO IS 


29.20 


soning neMn ID 


26.00 


ESTs 


25.00 


pnOi DOr 1 £-nijnSV3l9- 1 i>aC8ISlB-inuuC60 


24.00 


noi(iu ;ie^Rino cuiv\. ruH£\j*n us, Gions n 


22.10 


M «nn}onc yf^ mQIklA Mnna DCD1 1\ 
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19.00 
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17.37 
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14.00 




13.00 


^pjuiiuKni |jiu^n, ijeid ZOKU (Conn 


10.80 


hvinf helical nmleln PtJ1 1 inn 


10.00 


|wu)99iuiii unanntn, suuianBij i\ nianuin 1 


9.60 


hvnalheUcBl nmlain PIJ19Q9Q 


9.00 


niHiiaii (JiwciiidSI? ovUVdiBQ reCOptOi*^ mK 


6.09 


D'^fQii wi>uijiiipiivma 1 ID ^nc unyoT pro 


6,07 
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las MuiiiuHjg geiio lamiiy, meniDer c 


8.07 


ESTs 
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ESTs 


7.23 


fjhnvinrlln 1 


7.00 


cydbvdependont Mnaso 5, iBgulatoiy su 


5.18 


P<sTe 
CO IS 


5.00 


MiwnA cdnione Hnna OAA^R mDMA c^nuuum 
mniKj aaptono uuno i q inririA ScvfUenCS 


4.25 


viuunin u ( it&9* auiyaroAyviianun uoj re 


4,24 


uiienBuiun i icCopiOi» (ypo u 


3.74 


flMf\.lratA Mf(tN>laea ImwIIu 1 ■ Imi D4 n 

ouoiwio icDucioso laraif i, nieniDBr b lu 


3,19 


ESTs 


3.00 


prDGfBfdUnQ cell nuclssf sntiQon 


2.53 


OcoUIOWUin 1 


1,00 


nomo sapiens cuw\ rtj ih^d nSi oons rn 


1.00 


nit il ann^f A-^f^wef a] n a 4^* a m A.4tl*Ar m tit 

yiuiamai&^ysiejne iigaso, mouitiersuDu 


1.00 


fWIswon Krna Yl ainha 1 
wu^Kff}, lypo Al| Bipna 1 
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ESTs 
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1.00 
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fTtfldhf nwlfllfnnmiainaea 1ft /glwmmliialo 
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517.00 


fTufriy molalhinmialnaca 1 flnlaiBlllt«t 

iitdUM ineidiiDpfv/icinaso i iinioiSuUw 
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iTiduix moiaiiupoicinsoQ J (suonwiysiii * 
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STTudl tnffivthln rvtnVina Ruhfamflw R /fV 
wiKMi iiiuiKiuiu cyuMuno suDiamJiy d \\if 
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Hfliagen, lype a, aipna i (ocnnw mstapn 


264.00 


wii^)ess«1ypG MMTV integraiton sfts fairi 


124.00 


short'Chain ^o(rfu]l dchydroQsnasB family 


1.00 


Unmn aonlane nOUA' CI 194 04 R fi» mIm>m f* 

nomo sspiens cuna. rij< iZ4o »s, done u 


1.00 


CsIB 


1,00 




141.00 


eplFBQuIin 


98.00 


uanscnpuon factor 


159.00 


c() IS, weaKiy simiar to uoozz nypotneu 


1.00 


ooiliootropinreiBasing hormone 


1.00 


soDcina facfnr rvrJifM/ahriaminn rirh f 
o|<iH«uiy m«iui ^wuKHjjiuwiniiic libit t 


1 en 
7.DU 


ESTs 


99.00 


gb:yc06a03^1 Stmtagene lung [93721Q)H 


1.00 


hypothetical protein aJ23047 


3.00 


ESTs 


Z79 


Homo sapiens mRNA; cONA OKFZp586O0724 (f 


94.00 


Homo sapiens mRNA: cONA DKFZp6670226 (fr 


1.22 


KIAA1843 protein 


1.00 


gb»n03c03j(1 Stratagena schizo brain SI 


1.27 


sbigle^nded (Drosophila) homotog 1 


1.00 


KIAA0680gena product 


5.00 


serlneAhieoRiRe kinase IS 


89.00 


8b:IL2-«T0734.240400^^12 BT0734 Homo 


too 
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449773 • 


R76294 




ESTs 


1.00 


443054 




Us flow 


yes-assoaated protein 03 RDa 


90.00 


432097 


X51730 


Hs.2905 


progesterone receptor 


1.00 


453216 


AL137566 


Hs.32405 


Homo sapiens mRNA; cONA OKFZp586G0321 (f 


(X38 


4301 84 


AB013802 


Hs.234790 


contadIn5 


1.00 


432239 


X81334 




matrix mebflbprotebiase 13 (odtagenase 


7.23 


415025 


AW207091 


Hs.72307 


ESTs 


1.00 


416575 


W02414 


m.38oo3 


ESTs 


1.00 


443171 


BE28112B 


Ks.90^ 


TONDU 


0:92 


424834 


/Uv0O1432 


Ks.1 53408 


Homo sapiens d)NA FU10570 US, done NT 


20.30 


400644 








0.60 


409402 


AF208234 


Ks.695 


cystalin B (stefin B) 


1.96 


412420 


AU)35668 


Hs.73853 


bono morptiogeneSc protein 2 


1.38 


435563 


AF210317 


Hs,95497 


solute carrier tonily 2 (f adfitated ghi 


Z60 


400751 








1.34 


436361 


AAS25814 


Hs. 149065 


ESTs 


0.92 


455612 


BE042896 


Hs.274648 


ESTs 


0.81 


447437 


U07225 


Hs.339 


puifner^ leceplor P2Y| Gfroteb) coupl 


1.55 


404148 








0.77 


431089 


6E041395 


Hs.283676 


ESTs, Weakly stmilar to unknown protein 


1.00 


446619 


AU076643 


Hs.313 


secreted pttosptraprotein 1 (osteoponlin. 


&47 


416658 


U03272 


Ks.79432 


M^n 2 (congenital contractural ara 


3.92 


442994 


AI026718 


Hs.169$4 


ESTs 


0.40 


415327 


H22769 




gt):ymS4cOZr1 Soares Infant t)rdn IKIB H 


0.47 


418624 


AI734080 


Hs.104211 


ESTs 


1.90 


452850 


H23230 


Hs22481 


ESTs, Moderately similar b A46010 


0.54 


401747 






Homo sapiens keratin 17 (KFtJIT), 


7.22 


442432 


BE093S89 


Hs^178 


hypotlieScal pnM FU23468 


5.10 


418259 


A AAA PAnA 

AA215404 


Hs.1 37289 


ESTs 


1.28 


432374 


W68815 


HS.301B85 


Homo sapiens cDNA FIJ1 1346 lis, ckme PL 


8.13 


403381 








21.00 


420923 


AF097021 


Hs^73321 


difierenlially expressed in hematopoleS 


0.00 


418216 


A A00#li'l AA 

AA6o2240 


H&283099 


AF15q14 protein 


11.29 


444649 


AW207523 


Hs.197628 


ESTs 


0.10 


407811 


AW1 90902 


Ks.4D09o 


cysteine knot superfamily 1 , BIMP antagon 


4.64 


402230 








1.64 


412530 


AA766268 


Hs.266273 


tiypotiietical protein FLJ13346 


2.97 


447334 


AA515032 


H5.9i109 


ESTs 


0.62 


432829 


W60377 


Hs,57772 


ESTs 


0.86 


418686 


Z36830 


Hs.87268 


ennexinAS 


8.44 


421508 


NM^004o33 


Hs.105115 


atisent in melanoma 2 


2.68 


410553 


AW016824 


Hs.255527 


hypol}wtk:ai protein M6Ct4128 


2.22 


419183 


U80669 


Hs.89663 


cytochrome P450. suti^amily XXIV (vitamin 


78.00 


425721 


AO002115 


tl^ 4 CA4AA 

Hs. 159309 


uroplakin 1A 


0.68 


420370 


Y13645 


1 1. A^VAAA 

Hs.97234 


uroplakln2 


0.67 


417720 


AA205625 


HS.2080S7 


ESTs 


5.83 


437852 


BE001836 


Hs.256897 


ESTs,Weakly8imilarbdJ36501^1 |H.sa 


1,07 


431753 


X76029 




neuromedin U 


7.00 


402075 








286.00 


423017 


AW178761 


Hs.227948 


serine (or cysteine) proteinase \Mto 


363.00 


408687 


M31126 


Hs^2620 


pregnancy spedlic beta-1-glycoproteki 9 


4.81 


405064 








1.00 


426864 


AK001866 


Hs.189095 


similar to SAUI (sal (Drosophilapke 


ZOO 


441233 


AA972965 


Hs.135568 


ESTs 


1.00 


456034 


AW450979 




gl):UI-H'BI3-ala4-12-0-UlAl NCI_CGAP.Su 


1.23 


414221 


AW450979 




g)i:UI-H-B134lfra-12-4}4JU1 NCLCGAP^Su 


a65 


412296 


AW936233 




gl>:QVO-DT002^090200-107-a06 DT0020 Homo 


1.00 


405494 








1.00 


407189 


AAS98927 




gb:ae37e03.s1 Gessler Wilms (umor Homo s 


1.00 


403085 






1.00 


408633 


AW963372 


Hs.46677 


PRO2000 protein 


Z46 


435257 


AA677026 


Hs.191217 


ESTs 


1.00 


410044 


BES66742 


Hs.58169 


liigtily expressed In cancer, rich In teuc 


1.00 


445162 


AW1B9787 


Hs.1 47474 


ESTs 


0.50 


417275 


X63578 


Hs.295449 


paraalbundn 


1.00 


418408 


X73501 


Hs.84905 


cytokeratin 20 


1.00 


421110 


AJ250717 


Hs.1355 


calhepsinE 


1.00 


408081 








Z13 


449448 


D60730 


Hs.57471 


ESTs 


i2aoo 


451668 


Z43948 ' 


Hs.326444 


caitilage acidic protein 1 


0.37 


408243 


Y00787 


HsX24 


interieukin 8 


3.35 


438246 


AW450963 


Hs.119991 


ESTs 


51i)0 


440304 


BE159984 


Hs.125395 


ESTs 


1.00 


402778 








1.00 


406117 








1.00 


406360 








71.00 


435347 


AW014873 


Hs.1 16983 


ESTs 


1.00 


445550 


AI242754 


Hs.137306 


ESTs 


1.00 


451359 


H85334 


Hs.336623 


ESTs 


1.00 


419559 


Y07828 


Hs.910g6 


ring finger protein 


1.00 


429488 


AF155827 


H1203963 


hypothetkal protein FU10339 


58.00 


42S420 


BE536911 


Hs.234545 


hypothetical protein NUF2R 


1.00 


402901 
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414918 AI219207 K5.72222 hypoiheOcd protein FU134S9 aS7 

417715 AW98^ Hs.86366 ESTs 5l12 

442S77 AA2929S8 Hs.183900 ESTs 2.19 

. 41S867 031771 H8.89404 nish(Drosoprii]a)homeobQXliomo]092 1.54 

5 42G088 AFD38007 Ks.166195 ATPase. Class 1. type 8B. member 1 1.11 

412610 X90908 Hs.74126 fatty acid bindrng protein 6. Baal (gas 1.27 

414683 S78296 Hs.76868 hypothetical protein MGC12702 a67 

431322 AW970622 gli£ST3K704 MAGE resequences. MAGK Homo 0.03 

. _ 403903 0.87 

10 405033 0.13 

422282 AF019225 Hs.114309 8pcQpopn>t8in L 2.13 

425652 AK001504 H&159651 death receptor 6. TNFsuperfam]iyinemt)er 1.05 

414987 AAS24394 Hs.294022 hypotheScal protein FU14960 ZS9 

430168 AW968343 Hs.24255 DKFZP434l1735protetn 1.69 

15 459702 AI204995 1.00 

446082 AI274139 Hs.1S6452 ESTs 0.60 

400843 0.76 

417409 BE272506 Hs.82109 syndecanl 1J8 

439738 BE246502 Hs.9Sg8 8emad«natn,lmmunogtot)uRn domain (Ig), 1.20 

20 437181 AI306615 Hs.125343 ESTs. WeaUysMlar to KIAA0758 protein 0.50 

404875 0.80 

436293 AI601188 Hs.120910 ESTs 1.40 

422809 AK001379 Hs.121028 hypothetical protein FU 10549 3.03 

425683 AL137708 Hs.161031 Homo sapiens mRNA;cDNADKFZp434K0322(f 0.94 

25 404977 Insu&n^ growth factor 2 (somatomedi 0.99 

431347 Ai133481 Hs.2S1664 tnsuiin^ike growth factor 2 (somatomafi 1.10 

413804 T64682 gb7c48b02.r1 Stratagene Over (937224) 0.85 

432842 AW674093 H5.334822 hypothetical prolelri MG04485 ' 1.20 

420876 AA918425 Hs.177744 ESTs 0.85 \ 

30 422119 At277829 Hs.111862 KIAA0590 gene product 0.71 

400846 sortHin-related receptor. L(DU^class) 0.75 

421100 AW351839 H8.1246G0 Homo sapiens cONA:FU21763 lis, done C 4.01 

430152 AB001325 Hs234642 8quaportn3 1.74 

402777 0.70 

35 417151 AA194055 HS.2938S8 ESTs a99 

411248 AA551538 Hs.334605 Homo sapiens cDNAFU14408rts, done HE 1.46 

405034 AL03S754 Hs.2474 toUtke receptor 1 1.00' 

406671 AA129547 .Hs.285754 melprot(H)ncogene(hepatocytegrQwgi(a 18.68 

431070 AW408164 Hs.249184 transcription factor 19 (SCI) 1.94 

40 453134 AA032211 Hs.118493 ESTs a70 

440006 AK000517 Hs.6844 hypoUietical protein FU20510 Z19 

418068 AW971155 H5.2939Q2 ESTs, Wealdy similar to ISHUSS protein d a25 

424364 AW383226 Hs.201169 ESTs. Weakly similar to G01763atrophin- Z74 

439780 AL109688 gb:Homo sapiens mRNA full length insert a07 

45 438315 R56795 Hs.82419 ESTs 0.65 

418937 T71508 H5.13661 ESTs. Weakly similar to T42383 probaUe 1.18 

444163 AI126098 gb:qc54g07j(1 SoaresLPlacent^6to9weete. 0.65 

444444 A1149332 Hs.14855 ESTs ' 0.59 

407581 R48402 Hs.173508 P3ECSL a82 

50 433078 AW015188 Hs.121575 Homo sapiens cDNAFU 12231 fis. done MA a92 

417003 AL038170 Ks.80756 betalne-homocysteinemethytlFansferase 0.62 

446024 AB040946 Hs.284227 KIAA1 51 3 protein a92 

427747 AW411425 H3.180655 sertneAhieonine kinase 12 1.42 

__. 419741 NM_007019 Hs.93002 ubkjultincanfer protein E2-C 1.68 

55 453883 A1638516 H$.22630 cofactor required for Spl transcrfpttona XJST 

426847 S78723 Hs.298623 5^xytryptamine (serotonin) leoeptor a08 

446009 AI969885 Hs.231926 ESTs 1.00 

457292 AI921270 H3.334882 hypofheticd protein PL) 14251 0.98 

415949 H10562 Hs.21691 ESTs a61 

60 420281 AI623693 Hs.191533 ESTs 7.01 

446673 NM_016361 Hs.15871 U>AP for iysophosphatidic add phosphata 0.72 

450983 AA305384 K5.25740 Ei^l (S. cerevlsiae)4ike 2.21 

414792 BE314949 H3.87128 hypothetical protein FU23309 0.99 

437553 AI829935 Hs.130497 ESTs. Weakly similar to MAT8.HUMANCHL0R 0.54 

65 421218 NKL000499 Hs.72912 cytochrome P450. subfamily I (aromaSec a08 

426900 AW163S64 Hs.142375 ESTs 0.48 

414595 AAS41726 Hs.289015 hypothsQcal protein MGC4171 aB3 

4(t2305 0.89 

_^ 453823 A1137967 gb:DKFZp761D2315,r1 761 (synonym: hamy2) a04 

70 445911 AI985987 Hs.145645 ESTs. IA)deratety similar to AtULHlWN A a49 

436608 AA628980 Hs.192371 down syndrome critical regton protein DS aSS 

423916 AW993496 Hs.17235 Homo sapiens done TCCOAOOITB mRNA sequ a63 

405932 1.76 

401760 Z61 

75 452240 AI591147 Hs.51232 ESTs 453.00 

421064 Ai245432 Hs.101362 tumor necrosis (ador. alpha-induced pro 1.04 

421373 AA808229 Hs.167771 ESTs 17.00 

427239 BE270447 H3.174070 ubtquiOn carrier protein 1.16 

435099 AC004770 Hs.4756 (lapstructuTMpediicendonudease 1 1.68 

SO 422406 AF02&441 Hs.116206 Op&interacting protein 5 3.19 

413278 BE563085 Hs.833 interferon^timulated protein, 15 kOa 1.73 

453389 BE273648 Hs.32963 cadherin6.lype2.K<adh8rin(ibtalM 1.00 

454789 BE156314 gb.-QVD-HT0367-1S0200.114<102 HT0367 Homo 1.00 

281 



wo 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



434487 


AF143867 


na-Mwi 


PRT* MMfornhilw dfidar In S(SfiS7 cMm 
COl8| MuuiNiuBlj suiMw W 009u9r O^HiB 


0.06 


424008 


R02740 




pUwIVe CnenwIunB recspaji, uir-Dinaing 


2.45 


418067 


Ail 27958 


Hs.83393 


Cjwaun C/M 


1.41 


441801 


AW94979Q 




CO IS 


14U.UI 


423536 


L22075 


Hs.1666 


guanine iwdeotida binding protebi (G pr 




41 01 S3 


BE31197fi 


HS.1SB3D 


hwivifhotinJ nmtotn PI 119RQ1 


cn nn 
90.UU 








Homo sa^Mens winged heiix/lorUtead trans 


1.17 


452316 






CO 1 s, Mooeraieiy soniiar to broQ^UMAN r 


0.68 


427587 






CO 1 s, weaxiy sonuar lO iroood sennanfi 


0l91 


451161 


M211329 


H5.26006 




1 nn 


453204 


R10799 




ESTs 




437240 






gDnixo5ciia£i Nu.CGAr.GCBI Homo sapiens 


1.00 


405531 






0l92 


440249 




He T»777i: 


CO IS 


1.32 


426763 


Z19084 


U« 17991 n 

ru. 1 f 1 U 


MIIPI nmtatn 
MUr f pTuloui 


1.17 


434192 


AVV397314 


Hs.34371 


CO IS 


1.D0 


407881 


AWD72QQ3 


ru.*tw900 


Kongrgii eitifeift fnhw^eomtnAt Q_^^eiillAl 
nopditai SUiioro (giuCuSaiiwiBj iM>5Uinn 


87.14 








37.00 




nr I tOU 10 




rKUiDcl pfOISSi 


1.(N) 


451592 


AI805416 


Ue 9l3ftq7 


CO IS 


10.00 


*tcctlV 




ns. 1 1 


an&>Mu!ledan hofmone 


0.67 


408947 






nomo sapiens mruvv, cuna UlsrZpdMNlDOZ (I 


1.00 


452732 






kbvnn eaniane' /4nna IMA/SC*')nRO(M mDKIA 

nofuo sapnnSi cionB iiw\uc.jM02ti4, hIKNA, 


0.99 


441940 


AW^IIS 


He 19Rm9 


ESTs 


0.88 




nustDO 


ns. iMotM 


CCTe 
CO IS 


0.33 


444008 




He 99070% 


CO IS, vvedNj suuiiar (D orw_nuiviAN orLiu 


1.01 


421307 


BE539976 


Hs 103305 




l.UO 


423853 


AB011537 




elH /ftfweftuhtta^ Kntnninn 4 

SHI lutusupiuw) iHNiiuiug i 


ft Ac 

11.40 


407846 


AA426202 


Hs.40403 


f!hfjtftflfl.l nlowriinft trcmeoHiualrv luit 
Miprp«iuv-mwla«lllig uonSaCOVaKni *nl 




410346 


r\VV lOrCUQJ 


He QiUftO 


PCTe 
Cols 


1.00 


419078 


M93t19 




irTeiiflnnma.necfv*(Qf(kH i 
uiduiun/iiia'tuiniuiausu i 


A t\A 


414907 


X90725 


Hs.77597 


ncSn /rVnennhiflUiIcA kinsien 


1 M. 
1.U4 


441795 


N58115 


H&21137 


ADfl94 nmlAin 

r%UVCrj JliUlBUI 


in AA 


418583 




He Rfi9i 1 


nypoiTieucai protein 


1.22 


444342 


MM ni43Qfl 


ns.iuooi 


similar to lysosome-associated membrane 


108.67 


413385 


M3445S 


Hs.840 


inouicaniniB^iiuiB ZnS aioiqrgariase 


2.30 


441495 


AW294603 


Hfi 197(L3Q 


ESTs 


A AA 

U.44 




AIMoVO 


M«A9QR9 


|}iym!dyl3lB syntlietaso 


Z46 


412661 


N3S60 


He 9iR1 1 


CO IS, vveaioy similar H> ia4df 4 gene nrc 


1.00 


411880 


AW872477 




nh'hm3nfll3 vl NPJ CliAQ IhvA Hrwnn eanlone 
gu.iuTiJuiuoJii iMU^i/ow_inj^ nmiHi Sapiens 


i AA 

l.w 


417771 




He R9>^7 


rslin(^ add rsceptor responder (tazsro 


1.44 


A3flA3i 


ADulQg 




zinc finger protein 41 


1.{X) 


4913711 


1 ItfMl 




small inauciDis cyionne suDiamtiy u (cy 


ZB7 


454417 


AI244459 




uinucieuuoB tvpsm Cvniauiing 9 


A CA 

0.54 


429257 


AW1R37Q4 


Me IQfnR^ 


2,3-bisphoGphoglycefat9 rnulase 


2.27 


417599 




He 11ft9A1 


PCTd 

CO is 


1.01 


438366 


AAfttV>7fin 


He VI7RR7 


PCTe 
Co IS 


1.00 


438746 


Aiftfl<»m 


Me 1{M777 


PCTe 
CO IS 


1.47 


409691 


T89983 


He 9if^9 


Homo sapiens, done M6&5437, mRNA« comp 


1.00 


408827 


AW57<»73n 
nvf £1 w< «}u 




CO 15 


1.00 




DCW0U1D 




ESTs 


1.00 


422278 




He 11/91R 

ns. 1 iHcio 


Hizzieo ^urosopnua/ nomoiog 0 


5.21 


412719 




He 19QQ11 


COIp 


494.00 


417034 


MM nnfi1R3 


He ftnQR9 


neurotensin 


1.00 


430486 




He OAifltli 


f>,Ulr\nAa Mkantuil MoWitm ontttttthtA fnm 

cniotue cnannei, caxsum acttvaieo, om 


23,36 
6.98 


407788 




He TRQQI 


01 uu caxwRVDuiuing fxoiein kc 


413753 


U17760 


He 71^17 


kuiunin, Doia 0 \\iMJSm \ 1 £9kUJ, Koiinin 


7.50 


424012 


AW3fift377 
nWOOM/ f 


nS.1J/9D9 


tumor (^ein 63 kDa with strong liomolog 


9.77 






Ue 1Q0C 


desmogtdn 3 (pemphigus vulgsvis anfigen 


445.00 


*rl«l*iv 




Ue TiOC 


KAUo inierainng, KUiestn-iiKe {raonnes 


13.93 


423673 


88)03054 


He IfiQC 


iiktuw nKiauDpiuieinase iz (maciopnage 


12.77 


452838 


LI65011 


He 3f¥7A3 

ns.ou/Hj 


preferentially expressed antigen in meta 


45.00 


418663 




He dlfiOn 


uBsmocuiun o 


10.89 


409632 


W74001 


He <V;97Q 


serine (or cyst^e) proteinase inliil)ito 


6.29 


429610 


/^QQ24937 


Hs91in09 


1 1 IMV iwrkltiW Dl 1 IM^ /n<^«ta hiiwi am4 >MMt 

LUNA proiein, rLuisu ipawie ning ana nss 


0.88 






Ue vuwia 


candnoembryonic antigen-related ceS ad 


38.31 


4910^ ft 


1 /OEM 




liefatin 6A 


36.81 


431846 


RFD1QQ7A 


He 971U)n 


uroplaldn 1 B 


1.37 








smati proltne-ifch protein 3 


8.85 


45J9D4 




H8.12744 


ESTs 


0.40 


446856 


AI814373 


Hs.164175 


ESTs 


1 1ft 

1. ID 


443646 


AI085377 


Hs.143610 


ESTs 


^^s 


408522 


AI541214 


Hs.46320 


Small proltn&^ protein SPRK [tiuman. 


4.39 


431384 


BE158000 




gb:MR2-HT0377.150200-202-e03 HT0377 Homo 


1.18 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


4.22 


435505 


AF2004g2 


Hs.211238 


interteukln-1 iiomolog 1 


164.00 


417386 


BE165289 


Hs.1076 


smati proiinMich pfdeln IB (comllln} 


9.85 


431958 


X63629 


H3.2877 


cadirarin 3, type 1, P-cadberin (placenta 


S.S8 


441020 


W79283 


Hs.35962 


ESTs 


5.76 


423217 


NM.000094 


Hs.1640 


collagen, type Vtl, alpha 1 (epidermolys 


1.97 


448733 


NM_005629 


H3,187958 


solute canter fan^ 6 (neurotransmitte 


1.09 


444371 


8E540274 


Hs.239 


torttheadboxlt^l 


ZM 


422168 


AAS86694 


Hs.112408 


S100 cddum-Undino protein A7 (psorias 


a39 



282 



wo 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



429259 


AA4204S0 


Hs.292911 


ESTs. Motdv sInAir In ^£0719 \^tAAa 


9C3 


426440 


BE382756 


Hs1G99Q2 


fiohilB carrier famllw 9 f fw^lHntRrf ohi 


1 ST 
l.0r 


437044 


A1035864 


Hs,69517 


cDNA for (fiifefenlially expressed C016 Q 


230 


423662 


AK001035 


Hs.130881 


Kfpil CV L /h/mnhnma 11 A liinr finnpr nra 


1 n4 


428484 


AF104032 


Ks.184601 


fiflhita earrtef (Jirniv 7 iRnttonh* nminfi 

41VNilw MOIIICI IGlllliJ f IvwUVIllv illlUIIU 


^11 


401781 






11 A7 

11. Uf 


401760 








9«9't 


429211 




Kk. 168949 
rR>« 1 oocta 


juiicugn piowlll, Vma 9 (uOnilBMn O 


1 tto 
l.OZ 


417389 


6E260964 


Hs 82045 




1 19 


423634 


AVVSSg908 


H5.1^90 


hpnarifvl)iftriinn nmu/lh fArlnr Krwlinn nr 
iicpdiiirijinuiny yiuwui lacvoi MiHjiny pi 


047 nn 


41751 S 


L2^03 


Hs>^2237 


ntnrin tnlninitiM'I'n i » nmtm fV9cc/w4ota/4 
olaMirlnaiiyicCIaSId yiuup LrSSSOOolcU 


970 

4.r9 


441362 




Hs^044 


iwjo I (o. veiwisiacj nofnoiog con r\o 


^ 1£ 


440704 


M69241 


Hs.162 


ulsuuiMiw yiuwui lacioi DuiOinQ prow 


1 na 


425322 


U63830 


Ms 1SSfi37 


praiDin Mnoool unn'ocuvaico, Caiaiyuc 




All 991 






SRB7 (suppcessi^ of RNA potymersse B« ye 


2.53 






ns^9 


alooho) dehydrogenasB 7 (dsss IV), mu o 


72.00 


431009 




rK}.*to90o 


gap junwoii pioiBui, ocia 0 (ConnexHi o 


10 OR 

la.90 


409103 




Hs 1 1 9908 


VAfip.l nmlrin 


vAt 


417542 


J04129 


H<:_A99RQ 


(ffOyBSlagoiraSSDCiaiBU OiluuiiitfUidi fiiuu 




426471 


X57348 




Suouiin 


J. 09 


416004 




r».Of 909 


uiODuyua 0 cnyui uy ciioSa o loinny, mciTioor 


1 M 


414761 




H«; 779SR 


onhont^or Af 70cfa fnnseftnhllaX knmnlnA 9 


9 91 
4.01 


451541 




Uc 9Ac;(r7 


ptdKupnuin 0 


1.82 


416203 




n3.09f 30 


r^O^Ofi nmlain Hfigga 0 

\aj\jM piutein Kinase z 


8.00 


447343 




Uc O^ROQA 
n5«£0()09't 


CCTe t^h(u ctmllap 4n CnO'SOO alt4«a.9-ni 

CO 1 0, niyniy simusr w oU£i)fc aipn8>c-fn 


2.7B 


437016 




rK>i9990 


gudniiis rnonpnospndto DynineiaSfl 


4.U1 


429812 




Hs9S9^7 


nittrflanf li ifimr-frtinelnrmlnn 1 

piiuiuHy iuinof"fian9iu' iiufiy i 


9 1A 
4.10 


449230 




ns.£l I3f9 


iiiBianonia cbu eoflesnin nwiecwie 


O CO 
4.aD 






Lie 1R7M 


nypouieuwBi pnwKi ru i iiuo 


4.00 


457819 


AAG57484 




FRTe Hlnhhf ehntbif in iinnpmoft iwnipln 


99it 


410555 


U92649 


Hs.64311 


B Hlctn(f>nrin anri fnalnlllnnfnlAlnaeA >4nma 

o uioutiBynn sna inBiaiiopniiuinBSO ouiiia 


11 M 


430877 


226317 


Ue Q^Cftn 
rK»>9^w0v 


HacriMVilain 9 

uosnivQicin * 


1 M 


424587 


J05070 


(te 1S17% 


msrfrftf mftlfllWwHtflifiBfta Q /nolstinneA R 


9nQ 
4.U9 


414430 


AI346201 


Ffe 76118 


vwifUHin liaiOUAjFriBinwMi twlBlaBB LI 


1 no 

I.v9 


422963 


M79141 


Hs 13234 


ESTs 


99R 
£.40 


418462 


BEQ01596 


Hs 85266 

m<U9£vU 


IntAflrin hnia 4 


1 4n 

I.W 


450832 




Hs 105491 

n9* 1 U9tC 1 


ESTs 


10.01 


410274 


AAafll&O? 


^fi17fi9 


hypoodB*Iiiducibl6 luolsln 2 


1 9R 

1.C9 






n5«*nc90 


niKoclionditel ribosomd protsln S17 


1 DO 
1.09 


458933 




Ms 74763 

n9<£*tf VM 


RAM MnHInn nmtarn 1 


1 1;4 


439394 




no,90 1 U9 


ESTs 


0.O9 


418478 


U38945 


Hs.1174 


rvpfifwlMwnriAni ktnjKA tnMhilnr 9A Imo 


1 77 


453633 




Hs.34045 


himntholW^I mrtlaTn CI )9n7Rjl 
nypOUtBUCal piUlBHi rLtlZUIO't 


1 n 
1.02 


446269 




Hs.14559 


hvrwifhAliriil nrrAtun PI Iin't4n 


0.11 


422765 




Hs 1578 


Ktt^ifntilrol IAD raf^aal-fv^nlglnfna C fciir 
UdUvIOVllal lAr 1 epcBl-wn UW I u \y 9 \SUr 


9 1A 


438481 




Ur (;iQQ 

nd-9 i 99 


HQPP1 (iH nmlatn eimtliv tn iihtmMlm-/*An 

norV/iou pioictn S11T1110 lu uoiquiiirvon 


1 OA 
1.W 


440325 




H3.7164 


a fttelftfaAf4n onH msiallMtmlataaea ^ama 

o Qisintcsnn cuia rncidiiopioicinaso ooiiki 


U.01 


439608 


W79123 




0 nmlaln-^minlaH ra^anfar A7 
0 puloUI-OOUpicQ iBCepttll 0/ 


dUoLW 


453884 






I^AAfMftR nana niwIiiH 

iwvuiloo gene piuouci 


in ec 
iU.09 


452934 




M«491^ 

n3.«ici9 


nypoineucsi proran muiwidaI/ 


1 40 

I.0O 


451743 




He 9^71 


Co IS 


9 on 
4.gU 


413129 




ns,iu<K] 10 


r\r*t* noiTioiog 


2.38 








gbiHum&n psrathyrdd hoTFnone-related pep 


1,00 


413281 


AA861271 


He 999094 


udJiscnpuon lacuM owmx 


5.92 


444781 




r\9,t 199V 


uri-tinciiuica iiMwaiasis>a5oociaiBa piuie 


9 in 


416819 


U77735 


ns«0v^9 




1 ni 

l.Ul 


451320 


AW11fl079 


Hs.89981 




A ft7 

U.Or 


416543 


KIM OfK^TQ 


Re 8RQf>9 


nyoiuiuiion oyniiiaso o 


i 10 

1.13 


454034 




119.919 


flMaKuiHa ffaKuffmaaaa^a ^ f^mHu mamKar 

axjonyijo uBnyQiOgBnase o lanuy, mBinoQr 


4.90 


425397 




Mr 1fiA34R 
nS.lDDO*rQ 


ffifwitentnomea irtMAI 11 abiha f ITAkfll 

uipaisoniefBse (UNnj ii aipna (i rUKUj 




413004 


T35901 




tntnrloiilflii atiKanivr Mfwfffw f«M»l<w 9 Jl 

misnouKin cnnancer DHiuiny iBCiDr C| *» 


1 (U 

1.04 


407634 


AWniR'Xq 

/>f iUI03D9 


He 1^fi414 


UUr'OlCrinC.OclaV^ 06l9'l,9-i>raCBiyi9lUC 


7.04 


415617 


1)88967 


Ue 7RPR7 
nS.f DODf 


protein tyrosine phxphstase, receptor*! 


Qfli nn 
209.00 


435013 


H91923 


H« iinn94 


UA^U*iiKlrtiitaAna avlilfwai^ w«la^a ILA Of\ eiiKii 
nnUriiUDRf UUIOnB DXIOOreoUCiaSB MLnU 6U0U 


1 M 
1.00 


430337 


M36707 




*«alnww4iiltnJ!Va 9 
WOHIMOUInHlHB Q 


1 M 
1.04 








paralhyidd hofmone-Q^e hormone 


81jOO 






Ue 0197 


VIAAA4 Jlit nana nMwIiinl 

i\iAAvi44 gene proauci 


1.03 






Ue 

n8.olour 


nypotneiicai proiein ruiu/Di 


28.00 








Ml T'OinainQ cassene, suD-iamiiy it (\jr i k 


1.24 




Al7f;i7Aft 




VIAAnnCK nana nfvwiiml 

isiAAuusK) gene proQuci 


1.40 


430393 


BE185030 


H8u241305 


estrosen-responsiTO 6 box proton 


1.55 


420462 


AF050147 


K3.97932 


chondromodulin 1 precursor 


1.00 


418876 


Nht001327 


Hs.167379 


cancerftestis sniigen 


0.82 


428182 


BE386042 


Hs^17 


ESTs. Weakly slmilv to GGC1.HUMAN G ANT 


1.00 


427335 


AA44B542 


HS251677 


G8nlioen76 


0.91 


409420 


Z15008 


Ha.544S1 


lamfann, gamma 2 (nic^ (lOOkO). kaDni 


&53 


438956 


WD0847 


H3.13S0S6 


Human DMA sequence ftom done RP5-fi50E9 


1.35 


421917 


A8028943 


H5.109445 


KIAA1020protdn 


0.94 


404440 






38.57 


409582 


R27430 


Hs.271565 


ESTs 


3.19 


415669 


KM.00502S 


KS.785&9 


serine (or cysteine) proteinase inhibtto 


ZA$ 


433091 


Y12842 


Hs^lBS 


lymphocyte snSgon 6 complex, locus D 


1.61 


408122 


AI432652 


Ns.42824 


hypoiheficd protein FU10716 


1.95 



283 



wo 03/042661 



PCTAJS02/36810 



408380 AF123050 Hs.44532 diubiquIQn 7.23 

437412 BE069288 Hs^744 Homo sapiens mRNA;cDNADKFZp547C136(fr 1.35 

449976 W6350 Hs.135056 Human DMA sequence from done RP5-850E9 0.81 

_ 446102 AW168067 Hs.252956 ESTs 1.03 

5 428479 Y00272 Hs.184572 ceO division cyde 2. G1 to Sand 62 to 137.00 

422487 AJ010901 Hs.198267 mucin 4. tracheobronchial 7.91 

423761 NM.006194 Hs.132S76 palmd box gene 9 38.00 
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^ 415380 F07g53 Hs.16085 putative G-piot^ coupled receptor 0.18 

20 423849 AL157425 Hs.133315 Homo sapiens mRN^cONADKFZp761J1324(f 1.00 

446292 AF081497 K5.2796e2 1^ type C glycoprotein 2.09 

429538 BE1B2592 Hs.11261 small proQne-rich protein 2A 6.14 

447269 AW247017 Hs^978 melanoma antigen, family A. 3 1.00 

426004 AA449583 Hs.151393 ghitamate^tetneiigase. catalytic sub 44.00 

25 415091 AL044872 Hs.77910 3-hydroxy-3^n8tllylgluta7l-Coe^zyme A sy 149.00 

416209 AA236776 Hs.79078 MAD2 (mitotic anestdefident. yeast h 127.00 

408572 AA055611 Hs.226568 ESTs. i\ik)derately similar to ALU4.HUly(AN A 20.00 

404996 147.00 

438915 AA280174 Hs.285681 WHilams-Beuren syndrome chromosome regl 1.00 

30 453922 AF053306 Hs.36708 budding uninhibited by benzlmidazoies 1 54.00 

424046 AF027866 Hs.138202 serine (or cysteine) proteinase inhibito 139.00 

430563 AA481269 Hs.108660 ATP-binding cassette, 8ul>familyC(CFTR 22.00 

438702 A1879064 Hs.54618 ESTs 1.00 

444378 R41339 Hs.12569 ESTs 1.00 

35 433485 AI493076 Hs.2019S7 aldo-keto redudase family 1,meniberC2 41.00 

407839 AA045144 Hs.161566 ESTs 7.50 

439223 AW238299 Hs.250818 UL1 6 binding protein 2 139 

409041 AB033025 HsM)81 KIAA1ig9pratein 245.00 

429228 AI553633 Hs,337139 ESTs 10.89 

40 409757 NM-001898 Hs.123114 cystaOnSN 3.19 

411089 AA456454 Hs.1B3418 cell division cyde 2-lilo 1 (PlTSLREpr 0.78 

436511 AA721252 Hs.291S02 ESTs a23 

449207 AL044222 Hs.23255 nucleopodnlSSkD 1.68 

453331 A1240665 H3.8B95 ESTs 5.21 

45 409935 AW511413 H&276025 ESTs 0.75 

428969 AF120274 Hs.194689 atlemin 1.17 

445443 AV653838 Hs.322971 ESTs 1.00 

407137 197307 gb^eSdhOSjl Soaresfetd liver spleen 11.42 

401785 2.76 

50 412723 AA64B459 Hs.335951 hypotheDcd protein AF301222 107.00 

450701 H39960 Hs.288467 Homo sapiens CONAFU12280 lis. done MA 2.17 
405770 2.42 

439453 BE264974 Hs.6566 thyroid homione receptor interactor 13 4.24 

420783 At659838 Hs.99923 ledin. gdadoside-bindlng. sdubts. 7 4.50 

55 414774 X02419 Hs.77274 |riasm!nogen adivator, uroidnase 1.95 

424629 M90656 HS.1S1393 glutamate^telne Dgase. catalytic sub 1.44 

437789 AI581344 Hs.127812 ESTs. WeaMysimaar to T1 7330 hypotheti 1.00 

454098 W27953 Hs.292911 ESTs. Highly similar to S6071 2 band'6i)r 1.33 

453968 AA847B43 Hs.62711 Homo sapiens, done lMAGE:3351295.mRNA 1.11 

60 427441 AA412605 Hs.176053 SPANXIamlly. member C 5.00 
403478 0.78 
400842 0.16 

441525 AW241867 Hs.127728 ESTs 0.79 

452865 Ai924046 Hs.119567 ESTs. WeaWy similar to A47582B-cengr 1.50 
d5 405646 1.05 

427260 AA663848 gb:ae70b0&8l StratageneschizobrdnSI 0.79 

431413 AA504777 Hs.106293 ESTs 1.00 
401994 3.25 
402420 0.05 
70 404298 a64 
404927 88.00 

434105 AW952124 H5.13094 preseniTtfis assodated rhomboid^e pro 0.96 

436981 AW375974 Hs.156704 ESTs 158 

429563 BE619413 Hs.2437 eui<aryotic translafion initiation fador a92 

75 426067 AW664691 Hs.97053 ESTs a97 

428810 AF088236 Hs.193788 nitiic ojdde synthase 2A (indudble, hep a66 

426897 AW976570 Hs.97387 ESTs 1.29 

443892 A1889572 Hs.134791 ESTs 1.00 

413223 AI732182 Hs.191866 ESTs 0.79 

80 413691 AB023173 Hs.75478 ATPase. QassVl. type 118 1.51 

423934 U89995 Hs.15g234 fbrkhead box El (thyroid transcription f 2.59 

425159 NM.004341 Hs.154868 carbamoyl-phosphate synthetase 2. aspait 1.39 

420758 AW297536 HsJ3053 ESTs 0.89 
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423816 


AU)31985 


Hs,133034 


hypothetical protein 


1.QD 


447534 


AW953935 


Hs.30837 


ESTs 


1.68 


451919 


WD5086 


Hs.114256 


ESTa. Weakly similar to 176885 seilneAh 


ail 


409226 


R16811 


Hs.22010 


ESTs. Wedkly similar to 210926QA B eel 


a92 


403715 








a89 


423645 


AA431400 


Hs.98729 


ESTs. Weakly similar Id 201 7205A diliytfro 


1.00 


425734 


ARJ56209 


Hs.1593g8 


pepQdyiglycina alpha-amidaBng raonooxyg 


37.00 


436839 


AA767348 


Hs^1614 


ESTs 


1.00 


413562 


AW295647 


Hs.71331 


hypothetical prolesiMGC53S0 


59.00 


413573 


A)733859 


KS.149D89 


ESTs 


78.00 


430685 


NH.001942 


H3^33 


desmoglein 1 


127.08 


438993 


AA82899S 




gb»d77b08j1 Na^CGAP.0¥2 Homo sapiens 


1.00 


448243 


AVV369771 


K&52620 


integrin, beta B 


133.00 


444783 


AK001468 


H5.62180 


anfifin (Drosophila Scraps homolog), ed 


232.00 


426427 


M86699 


Hs.169840 


TTK protein kinase 


66.00 


422956 


BE545072 


Hs,122579 


hypothetical protein aJ10461 


148.00 


445537 


AJ245671 


Hs.12844 


EGF-iiko^tomain, multiple 6 


40.75 


453392 


U23752 


Hs.32^ 


SRV (sex determinlno region YHmx 1 1 


13.x 


424905 


NM,002497 


Hs.153704 


NIMA (never in mitosis gene a}4e)ated k 


106.00 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


159.00 


415989 


AI267700 


HS.3175B4 


ESTs 


19&00 


408000 


LI 1690 


Hs.620 


bullous pemphigoid anSgen 1 (230/240kD) 


32.44 


453160 


A!263307 


Hs.2398d4 


H2B histone family, member L 


7.00 


409361 


NH.005982 


Hs.54416 


sine oculis homeobox (Drosophila) homolo 


4.13 


416208 


AW291168 


Hs.412g5 


ESTs. W^aMy siroilar to MUC^UMAN MUaN 


45.00 



mem. 



Pkey: Unique Eos probeset idenlifier number 
CAT number Gene cluster number 
Accessions: Genbank accession numbers 



PReif 

411880 
412296 
413804 
414221 

415327 

427260 
431322 
431364 



432222 
437214 
437240 
438993 
439780 
444163 
451844 
453823 
454789 
456034 



CAT number Accessions' 



1263110.1 
1288043 1 
1390710J 
142695.1 

1534137J 
276598^1 
331543.1 
33264J 



343347.1 

434730.1 

435139.1 

467651.1 

47673.1 

593658.1 

888230.1 

982526.1 

1234742.1 

142696.1 



AW872477BE088101T05990 

AW936233AW936272 

T64682BE168190BE1682S6 

AW450979 AA136653 AA136658 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BEOl 1212 6E011359 BEOl 1367 
BE011368 BE011362 6^)11215 BEOl 1365 BE(»11363 
H22769 R35182 Z43S45 F05783 N92089 H71928 
AA663848 AA400100 AA401424 

AWg70522 AAS03009 AA502g98 AA502989 AA502805 T92188 

BE158000 BE157999 H75671 H70965 C18895 BE386512 BE385815 BE390298 AI341995 BE074534 AA055592 AA132265 AI733757 

AA134504BE145037AA055887BE070191 R66492 AW85B0ie AW856056AW817057 AW862031 AW661688 AWB62029AW858806AW858792 

AW862028 AW858017 AW819164 AW853698 A1522161 AW854789 AW817408 BE152005 AI73241 1 AA133064 

AI204995 AW827539 AW969908 AW440778 AA528756 

BE092336 6E092259 BE092497 BE092051 AA746882 Ai336378 

AA747537 BEQ89068 6E089070 

AA828995 AA834879 A1926361 

AL109688 R23665 R26578 

A1126098AI184746A1148521 

T61430AI820546AI821336 

AL137967BE064160BE0641B6 

BE156314 BE156316AW820750 

AW450g79 AA136653 AA136656 AW419381 AA984358 AA492073 8E168945 AA809054 AW238038 BEOl 1212 BE011359 BEOl 1367 
BE011368 BE011362 BE011215 BE011365 6E011363 



TABLE 24C 



Ptey: Unk)ue number oonesponding to an Eos piobeset 

Refi Sequence source. The 7 digit numbers In thb column are Genbank ldentiiiar(GI)numbei8. "Dunham L el aL' refers to the pUblicalion entitled The DNA sequence of 

human chromosome 2Z' Dunham I. etaL. Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpositlon: hidlcatssnucleotkie positions of predkited axons. 



Pkey 


Ref 


Sbnd 


■ WLposhlon '■ ■ ■ 


400751 


7331445 


Minus 


35395-35533 


400842 


1927148 


PhB 


90462-90873 


400843 


9188605 


Rus 


5863^97a7653-7784,6892.9023,g673-9807.10634-10789,152S4.15403.23827-239S8 


400844 


9188605 


Plus 


24746-24872.25035-25204 


400846 


9188605 


Plus 


39310-39474 


401486 


7341763 


Plus 


32585-32756.36281-38540.4079140933,4401844179 


401747 


9789672 


Minus 


118598-118816,119119-119244.119609-119761.120422.120990.130161-130381.130468-130593.1^ 








131932.132451-132575.133580-134011 


401760 


9929699 


Rus 


8312&83250.85320^5540.94719-95287 


401780 


7249190 


Minus 


28397.28617.28920-29045,29135.29298.29411-29567,29705.29787.30224-30573 


401781 


7249190 


Minus 


83215-83435.83531.83656.83740«3901.&4237-84393.849S5-85037.86290-86814 


401785 


7249190 


Wism 


165776.165996.166189.166314,166408-166569.167112-167268,167387.167469.168634-166942 


401994 


4153858 


Minus 


4290443124.43211-43338.44607-44763.4519945281.48337-48732 


402001 


9501818 


Rus 


68052-68223 
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402034 
402075 
402230 
402305 
402420 
402777 
402778 
402901 
403085 
403381 
403478 
403715 
403903 
404148 
404296 
404440 
404875 
404927 
404977 
404996 
405033 
405064 
405494 
405531 
405646 
405770 
405932 
405081 
406117 
406360 
406467 



7684482 
8117407 
9966312 
7328724 
9796339 
9588235 



6834222 
8954241 
9438267 
9958258 
7239869 
7710671 
9863703 
9944263 
7528051 
9801324 
7342002 
3738341 
6007890 
7107731 
7658416 
8050952 
9665194 
4914350 
2735037 
7767812 
9123861 
9142932 
9256107 
9795551 



Minus 

Rus 

Minus 

Plus 

Rus 

Rus 

Rus 

Minus 

Rus 

Mmus 

Rus 

Plus 

Minus 

Phis 

Minus 

Rus 

Rus 

Rus 

Minus 

Plus 

Minus 

Rus 

Minus 

Rus 

Rus 

Rus 

Minus 

Minus 

Plus 

Minus 

Rus 



86227-88451 

121907.122035.122804-122921.124019-124161.124455.124610.125672-126076 

29782-29932 

40832-41362 

129750-129919 

126786-126948 

128560.128702 

175426-175667 

165035-165334,165420-165713 

26009-26178 

116458-116564 

85128^292 

10116S.102597 

78218-78418.79571-79709 

73591-73723 

80430-81581 

96588-96732.97722.97831 



43081-43229 

37999-38145.38652-38998.3972749872.40557-40674,42351.42450 

142358-142546 

81207-81416 

70284-70518 

35602-35803 

741-969 

61057-62075 

123525-123713 

38115-38691 

54304-54584 

7513-7673 

182212 182958 



TABLE 25A: 691 genss upregdated in head and neck cancer relaOve to nomel body tissues 

Table 2SA Esls about 691 genes upregulated In head and neck cancer relative to normal body tissues that are nicely to encode proteins amenable to modulation by small molecules, 
peptides, or antibodies. These genes were selected from 59680 probesels on the Eos/Affymetrix Hu03 Genechip an^, Gene expression data tor each probeset obtained from this 
analysis was expressed as average intensity (Al), a normalized value retlecSno the relaiive level of mRNA expression. The protein products of these genes often contain one or more 
domains Indicative of have oncogenic function or of transducing tntraoelhilaf slgnats, or of being modulatabie by sntiti mdlecules, peptides, or antibodies (e.g. pklnase. death-domain, 
7tm, phosphatase, or ion.transporter). Certain predicted protein domains are noted. 

Rey: Unique Eos probeset idenliiier number 

ExAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

Red.PralDomains: Certain predicted protein domains. Abbreviations used: TM, transmembrane domain; SS. signal sequence; =Y, very likely to contain; =M, 

likely to contain; other protein domain abbreviations are from PFAM (Nucleic Acids Research, 2002, 30:276-280). 
tJniGene Title: UniGene gene title 

R1 85th peroentile of head and nedc cancer Als divided by the 50th percentile of nonnal tissue Ats, where the 10th percentile of aO normal tissue Als was 

sut)tracted ftom bcXt\ ttie numerator and denominator 

Pkey; ExAocn; UnlgenelD; Unigene Title; Pred.ProtDomalns; R1 

422168; AA586894; Hs.1 1 2408; S100 calduiivbinding protein A7 (psorlas; e8jBnd,SJ00;TM=M;SS=N; 46.25 
408522; AI541214; Hs.46320; Small proDne-rlch protein SPRK [human, ; none,Comlfin: 40.37 
417366; BE185289; Hs.1076; small praHne^fch protein IB (comifin); Comifin;TM=M;SS=N; 38.94 
401781; ; ; Target Exon; filament:TM=M:SS=N: 29.74 

422158; L10343; H3.112341; protease Inhibitor 3. skln^Jerived (SKAL; wap:TM=M;SS=Y; 29.54 
401780; ; ; NM_005557*:Homo sa^ keratin 16 (tbca; filamentfiiament; 28.58 
424098; AF077374; Hs.139322; smafi proDne-rich protein 3; Comifin;TM=M;SS=N; 28.55 

421948; 142583; Hs.334309; keratin 6A; filament.RhoQAP,DUF286,bZlP,Tropomyosin,tubuIin,DUF164.TBCA,Conagen:TM=M;SS=N; 25.74 

428471; X57348; H8.184510; stratifin; 14-3-3;TK^M;SS"N: a65 

417079; U65590: Hs.81134; interleiddh 1 receptor antagonist: HI;: 21.02 

421574; AJ000152; Hs.105924; defensin. beta 2; E)efenslnJ)et^TM=M;SS=M: 20.83 

409601; AF237621: Hs.80828; keratin 1 (epidermolyfic hyperkeratosis); filament.bZlP,UvrD.heIiC8se,TBCA;TM=M;SS=N; 20.72 

433091; Y12642: Hs.3185: lymphocyte antigen 6 oomptex, k)cus 0; UPAfl.LY6.toxin/cthrinjecp;TM^M;SS=Y; 19.63 

446292: AF061497; Hs.279682: Rh fype C glyoopfoteln: Ammonlum.trBnsp,FecCO:TM"Y:SS"M: 19.53 

420783; A1659838: Hs.99923: lectin. g^actosUe-Unding. solubie, 7 ; Gat-bindJeclin;TM=M;S&=N; 19.12 

407788; BE514982; Hs.38991; S100 catdunvbinding protein A2: efhand,S J 00,8.1 OO.efhand: 17.93 

416091; AF295370: Hs.283082: defensin. beta 3; De(ens!n.beta;TM^SS»M; 17.63 

431211; M86849; Hs.323733; gap junction protein, beta 2. 26kD (conn;connexin;TM=Y;S&=M; 16.94 

429259; AA420450; Hs.380O88; RakophQin; none,none; 14.92 

417515; L24203; Hs.82237; alaxia^elangieclasia group D-assodated; zl^.box.zf-U8R1;TM=M;SS=N; 14.75 

423634; AW959908; Hs.1690; heparin^unding growth factor binding pr. none;TM=M;SS=M; 14.45 

418007; M13509; Ks.83169; matrix metaOoproteinase 1 (IntefsHeal; hemopexin,Peptid8$e.M10;\staclnP6„bind]ng.1;: 13.02 

409632; W74001; Hs.55279; serine (or cysteine) proteinase inhib'to; serpin;; 12.82 

406621; X57609: Hs.1B1125; immunoglobulin lambda kxnis; ig.HSP70,Ppx-GppA;TM=M;SS=N; 1Z81 

431958; X63629; Ks.28n; cadherin 3. type 1, P-cadherIn (placenta; cadherin.C8dherin.Cjerm;TMsY:SS»M; 1Z45 

446921; AB0121 13; H5.16530; smaO inducible cytokine subfanily A (Cy; IL8;; 11.71 

401760; : ; Target Exon; none,bromodomaln; 1 1.68 

407839; AA045144; Hs.161566; ESTs; cadherin.cadherin; 1 1,65 

454034; NM.000691; H5.575; aldehyde dehydrogenase 3 family, member ; aldedh- 1 1 .56 

444781; NM.014400: Hs.119S0; GPI-anchoied melaslasis-assodated prole; UPAfCLY6.1actBmase.B; 11.31 
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453857; AL080235; H5.35661; RasOnduced senescanoe 1 (RlSl); none:TM=Y;SS=M: 1 1.03 

424012; AW368377; Hs.137569; tumor pretein 63 kOa m&i strong homolog; SAM.P53;TM=M:SS=N: ia75 

430630; AW26S92Q: Hs.2621; cystatin A (stefin A): cy8tatin;TM=M;SS=N; 10^ 

419693: AA133749: HS.3013SO; HOTD domdiMniMIng ton transport reg; ATP1G1_P1M.MAT8:TM=>Y;SS=M: mSO 

411274; NM.002776: H5.69423; kallitoBln 10: tiypsin;TM=M;SS=N; 10.25 

441633; AW958544; H5.1 12242; normal mucosa of esophagus specUtc 1; none;TM=M:SS^M; 9.84 

446989: AK001898: Hs.16740; liypoiheBcsl protein FU1 1038: none:TM»Y:SSsN: 9.74 

402075; ; ; ENSP00000251058*:Pla5ma membrane calcium; nonen 9^ 

444381; BE387335; Hs.283713; hypothetical preletai BC014245; C6llagen:TM»M;SS»M: 9.S0 

431009; BE149762; Hs.48956: gap junction proton. Iieta 6 (connexin 3; connexin:TM=Y;SS=M: a48 

433310; AF086120; H5.102793; ESTs; casein_kappa,p)dnasejg.non8; 9.43 

414987: AAS24394: Hs.294022; hypothetical proteb FU14950: SH2;TM=M:SS=N: 9.33 

418004; U37519: Hs.87539; aktehyde dehydrogenase 3 family, member ; afdedhn'M=M;SS=M; 9.14 

408000; L1 1690: Hs.198689; buflous pemphigoid antigen 1 (23Q/240i(D): eih3nd.specirin.GAS2.SH3.Plect]n.RA.Xy!ose_iso(i\RO.bZ!PJn)pomyosin.Myc- 

lZM.Wh_C,CH>UP3;TM=M;SS=N; 9.12 
451541; BE279383; Hs.26557; platophiQn 3; ArmadHto 8eg;TM=M:SS^ 9.11 
425650; NM.001 944; Hs.1925: desmoglein 3 (pemphigus vulgaris antigen; cadhei1n;TMsM;SS«M: 8.66 
452240: Ai591147: Ms.61232: ESTs: none,none: 8.57 
429228; AI5S3633; Hs.356828: ESTs; none,none; &46 

400289; X07620: Hs.2258; matrix metaOoprotelnase 10 (stromelysin; hemopexIn,Peptidasa_M10.Astacin:; 8.44 

425071; NN1.013989: Hs.154424; delodinase. iodothyronine. type II; T4.deiodlnase;TM=M:SSsY; aiS 

407242; M18728; ; gbrHuman nonspecific crossreactlng anDg; lg;TM-M;SS=M; 8.05 

407944; R34008; Hs.239727; desmocolSn 2; cadhei{n.Cadherin_C_term.Hanta.G2;TMsY:SS=Mi 7.90 

413276; BE563G85; Hs.833; intertsron^timulated protein, ISiiDa; ubiquiiin;: 7.82 

428330; L22524; Hs.2256; matiU metatloproleinase 7 (mabilysin, ; Peptid3se_M10n 7.82 

417308; H60720: HS.B1892; KIAA0101 gene product none;TM=M;SS=N: 7.77 

413753; U17760: Hs.75517: iamlnln, beta 3 (nicein (125kD), Icallnln; lamlnln_EGF4amlnin„Nlerm:; 7.76 

423217; NVL0D0094; H5.1640: coilagen. type VII. alpha 1 (epidermotys; Kunitz^BPTl,fn3,\wa,Coitagen,bela4actama5e;TM^:SS«M; 7.71 
430685; NM.001942; Hs.2633; desmoglein 1; cadheTin,Cadherln„Cjerm;TM=Y;SS=M; 7.69 
412270; AC005262; Hs.73797; guanine nudeoDde Ending protein (G pr; G-8]pha.8r1;TM=M;SS=N; 7.54 
428484: AF104032; Ks.184601; solute carrier family 7 (cationic amino ; 

aajJermeases.pyiWoxaLdeC,bromodoma!n,PHD.MBD;^T^hook.t)OT.B3_R4Jdnase.FAT.FATaB^ 
418663; AK001100; Hs.41690; desmocoilin 3; cadherin.Cadlierin.C_term,none; 7.30 
452281; T93500; H5.28792: Homo sapiens cDNA FU11041 lis. done PL; TGFb.pfOpeptideJGF-b6ta.none; 7.28 
42921 1; AF0S2693: Hs.198249: gap Junction protein, beta 5 (connexin 3; oonnBxin:TM»Y;SS^ 7.26 
412719; AW016610: H5.8I6; ESTs; none.none: 7.17 

446619; AU076643; Hs.313: secreted phosphopntein 1 (osteoponOn. ; Osteopontfn;; 7.10 

423961: D13666; Hs.138348; periosOn (OSF.2os); Fascicfin;TM=M;SS=M; 7.09 \ 
427666; Al7914g5; Ks.180142; cdmoduMto skin protein (CLSP); efhand;TM=iUl:SS=N; 7.08 
431846; BE0ig924; Hs.271560; uroplakln IB; (ransmenibrane4;TM=Y;S&^M; 7.06 

423673: BE003054: Hs.1695; matrix metaDoprotelnase 12 (macrophage ; hemopexin.Peptidase_M10;TM^A:SSsM: 7.03 

401747; ; ; Homo sapiens Icerafin 17 (KRTIT); none,brocnodoma!n; 7.01 

413859; AW992356; Hs.6364; Homo sapiens pyruvate dehydrogenase kina; SAM.PNT.nQne; &98 

429002; AW248439; Hs.2340: )unctlon plakogtobln; ArmadtDo^eg:TM=M;SS=N: 6.96 

432239; XB1334; Hs.2936; matrix melalloproteinase 13 (ooHagenase; hefT»pexin.PepUasBjll1Q:; 6.87 

417715: AWg69587; Hs.86366; ESTs; none.none; 6.72 

422440; NIUL004812; Hs.1 16724; aUo-keto reductase family 1. member BIO; aIdo_keUed.ROK;TM=M;SS"N: aSO 

429359; W00482; Hs.2399; matrix metalioprotelnase 14 (membrane^n; hemopexin.PepSda5e.M10:TM=M;S8sM: 6.39 

418844; M62982; Hs.1200; arachidonata 12-Cpoxygenase; lipoxygenase,PLAT;TM=M;SS=N; 6.38 

420039; NM.004605; Hs.376147; sulfotransferase family, cytosdlc. 2B. ; Sulfbiransfer;; 6.38 

425397; J04088; Hs.156346; topoisomerase PNA) II dpha (170kO): DN/L8yrase6,DN/LlDpoisolV,HATPase.c~ 6.35 

442599; AF078037; Hs.324051; RelA-assodaled inhibitor. SH3.ank:TM=M;SS=N; 6.30 

409420; Z15008; Hs.54451: laminin. gamma 2 (nicein (lOOkO). kalini; laminln.B.lamlnin_EGF:; 6.28 

424364; AW383226; Hs.163834: ESTs. Weakly similar to G01763 atrophin-; ras;TM=M;SS==N: 6.27 

414812; X72755; Ks.77367; monokine induced by gamma Interferon; tL8;TMsM;SS»Y: 6.23 

424687; J05070; Hs.1S1738; matrix metattoprotelnase 9 (gstatinase B; fn2,hemopexjn.PepQdase.M10;: 6.22 

443426; AF09815B; Hs.9329; chromosome 20 open reading frame 1; none:TM«M:SS=N; 6.21 

428970; BE276891: Hs.194691; retinofc acid Induced 3 (RA1G1); metabp; 7tm_3:TM=Y;SS=f<; ai2 

423017; AW1 78761; Hs.227948; serine (or cysteine) proteinase inhibilo; serpin;; 6.08 

424834; AK001432: Ks.153408; Homo sapiens cDNA FU10570 fis. done NT; none,nona; 6.08 

426440; BE3827S6: Hs.169902: sdute carrier family 2 (faciiltated glu; &ugar.tr;TM^Y:SS=M; &04 

439335: AA742697: H&.62492: NKL052863.'Homo sapiens 8ecfelogk}bin. fa; none;: 5.81 

439223; AW238299: H8.250618: UL16 Unding protein 2; IdLrecepta.PKO.MHC.I:TM=M;SS»Y; 5.77 

418054; N^L002318: Hs.83354; lysyl oxidase 2; SRC(^LysyLoxidase;TM=M;SS=M; 5.72 

454098; W27953: Hs.217493: Rakophilin; none.none; S.71 

417900; BE250127: Hs.62906; COC20 (ceO dhrlston cyde 20. S. cerevi; WD40;TM=M:SS»N: 5.70 

435505; AF200492: Hs.211238: interteuMn-1 homotog 1; IL1;TM=M;SS^ BJBB 

406685; M18728; ; gbiHuman nonspecific crossreactlng ant^; lg;TM=M;SSsM: 5.67 

430280; AA36125B; Hs.237668; interleukin 7 receptor; fh3,none; 5.63 

430486; BE062109; Ks.241551: chtoride diannel, caldum activated, fam; none;TM>Y;SS»M: 5.61 

449722; BE280074: Hs.23960; cyclin B1; cydin.cydin_C:THA=M;SS=N: 5.61 

439806; W79123; Hs.S6561: G proteirvcoupled receptor 87; 7tnu1:TMsY;SS^ 5.60 

452662; AW378065: Hs.8687; A0AMTS2 (a disintegrin^ and metaito: Pep.M12BjKopep.tsp.1,f^eproiysin.none: 5.58 
433662; W07162; Hs.150826; RAB25 RAB25, member oncogene family; r3S.ABC,tranMTM=M:SS=M; 5.57 
411295; BE207307: Hs.101l4; growth suppressor 1; 20G-Fell_0xy;TM=M:SS=M; 5.55 
433848; AF095719: H3.93764; carhoxypeptkJase A4; Zn_carbOpeptPropepJ414;; 5.54 
41681^. U7773S; Hs.80205; pim-2 oncogene; pkinase;; 5.48 

426368; BE440042: Hs.63326: malrbc metaDoproteinase 3 (stromelysin ; hemopexin.Peplidase_M10.Astadn:; 5.47 

452747; BE153855; Ks.61460; Ig superfamily receptor LKIR; ig,Rhabd^lycop;TM3Y;SS=M; &46 

444946; AW139205: Hs.156457; hypothetical protein FU22408; abhydrotase.abhydrolase_2;TM»Y;SS»M; Sl42 

413719; BE4395e0; Hs.75498: smaO indudble cytoldne subfamDy A (Cy; IL8;; 5.35 

445033; AV652402; Hs.72g01; cycGrnjependent kinase inNbHor 2B (pi; ankn 5.28 

418462; BE001596; Hs.85266; integrin, beta 4; fn3.lntegHn.B.Cab(^ta,EGF;TM=M:SS=M; 5.26 



287 



wo 03/042661 



PCTAJS02/36810 



429554; NM.01227S: Hs.207224: Interlsuktn 1. delta; IL1:TM:=M;SS=N: 5.14 

421S0B; NM_004833; Ks.105115: absent in ni8iau>ina 2; PAAO.DAPIN.KIN:TM=M;SS=N; 5.13 

439979; AW600291; Hs.6823: hypolhe&al protein RJ10430; none;TM=M;SS»N; 5.11 

427099; AB032953; Hs.173560; odd Ozfi&m twnolog 2 {Drosophfla. mous; NHl4TM=M;SS^N; 5.11 

428227; AA321649; Hs.2248; smSD Mudbte cytokine sutifamHy B {Of, tL8;TM44;SS»Y: 5.08 

436398: AI683487; Hs.1522U wingless^ MMTVlntegratiQnsItB fami: wnt^: 107 

40669% M29540; Hs.220529: csrdnoeniljryonic anSgen^ed ceO ad; ig:TM=M:SS=M; 5.05 

453905; NM»002314: Hs.36566; LIM domdn Hnaso 1; pldnaso.LIM.PDZ^ARP,TM=M;SS=N; 5.04 

414035; Y00630; Hs.75716; serine {or cysielne) proteinase InhMo; sei|snr, 5.00 

413219; AA878200; Hs.1 18727; Homo sapiens cONA FU13692 fis, clone PU HlH,death.TNFf^c6>\cyl-CoAuliydfD; 4.96 
421508; BE302796; Hs. 106097; thynddine kinase 1. soluble; TK;TM=M;SS=N; 4.93 
412140; AA219691; H5.73625; RAB8 interacting, klnesitv^ike (rabMnes; kinesin.Tropomyosin;TM:=IUt;SS^ 4.92 
445537; AJ24S671; Hs.12B44: EGlMk&<Jomain. muifiple 6; EGF,MAM;; 4.91 

428953; AA306§1 0: Hs.348183; tumor necrosis (actor receptor superfaml; 60s jfl)OSomal^bosomaLL10.TNFKjB6.DEADn 4.90 

436553; AW4071S7; HS.18112S; Immunogiobufin lambda locus; ig.HSP70.Ppx-GppA;m=ll^;SS»N; 4.69 

447343; AA256641; Hs.236894; ESTs, Highly similar to S02392 alpha-2-m; none.none; 4.84 

430024; AI808780; Hs.227730: Integrin. alpte 6; lntegiin_A,F&GAP;TM=Y;SS=M; 4J1 

439706; AW872527; Hs.59761: ESTs. WeaWy similar to DAPl.HUMAN DEATH; none.none: 4.80 

444371; BE540274; Hs.239: foriOiead box Ml; FbrtLhead;TM=IVI;SS=N: 4.75 

428582; BE336699: Hs. 185055; BENE protein; none;TM=Y;SS=M: 4.74 

419596; BE379320: Hs.91448; MKP-l Hke protein tyrosine phosphatase; DSPcn 4.69 

431S30; I^M_002204; Hs.265829; Integrfn. alpha 3 {antigen C049C; alpha ; FGW.Rh8bd jlycop,integrinJ^TM=Y;SS=M; 4.69 
422310; AA316622: Hs.98370; cytochrome P450, subfamily IIS, po^pept; none.pkin3se,fn3.ig; 4.68 
418067; AI127958; H3.83393; cystatin m cyslaDn;; 4.66 

414774; X02419; Hs.77274: plasminogen activator, urokinase; kringte,liypsln,pianLlhionlns:; 4.64 
45B534; X91 195; H5.100623; phosphoSpasa C. beta 3. neighbor pseudo; UHPOZ,pkinase;: 4.62 
410418; 031382; Hs.63325; transmembrane protease, serine 4; ld]jecepLa.trypsin:TM=Y;SS-M; 4.60 
417866; AW067903; Hs.82772; cdlagen. type XI. alpha 1; C0liagen.COlJFI.TSPN.Iamlnln_G,CorAs 4.60 
4381 13; AI467908; Hs.8882: ESTs; 7tm-l^one; 4.60 

418140; BE613836; Hs.83551; microfibnitar-associated protein 2; none;TM=M;SS=l^ 4.57 
408380; AF123050; Hs.44532; diubk;uitin; ubiquilin;TM=M;SS=N; 4.55 

422627; BE335657; Ks.116787; transfoming growth factor, betfrinduced; Fascic6n.ABC.tran.A6C.membrsne,6TP_EFTU;TM^SS=^M; 4.50 
425247; NM.005940; Hs.155324: matrix inetalk}proteinase 11 (stromelysin; hemopexin.PeptidaseJA10;; 4.50 
418558; AW082266; Hs.86131: Fas (TNFRSF6}-associated via death domal; death,DEC);: 4.49 
408482; NII/L000676; Hs.45743; adenosine A2b receptor; 7tm_1:TI\*=Y;SS=M; 4.48 

414166; AW888941; Hs.75789; N-myc downstream regulated; DEAD,heIicase^CmNdr,CysJaiot,TlUvwa,m!C,vwd.lQ,Rlta.ablv^ 

beta,DUF139.TPR.DSPt:.tspJ.Ribosom8l_S21.rvp;T^Ny1;SS^N;4.47 
416178; AI608527; Hs.192822; seroiogk»By defined breast cancer anti; none;TM=M;SS^ 4.47 
41 1789: AF245505; Hs.72157; Adltean; lg.LRR,U^RNT,lPRCT;TM=M;SS=M; 4.47 
414561; A1064813; H5.195155; Homo sapfens amino acid transport system; Aa^trws;TM=Y;SS=N; 4.47 
422765; AW409701; Hs.1578; baculoviral lAP repeatcontaining 5 (sur; BII^TM=M;SS=N; 4.45 
427557; NM.002659; Hs.179657; plasminogen acOvator. tffokinase recepto; UPAfLLY6,ET,PLAajnh:; 4.43 
416322; AA284166; H5.84113; cyclin-dependeni kinase inttiUtor 3(C0K: Yj)hosph^,OSPC;TM»M;SS=N; 4.42 
409041; AB033025; Hs.50081; Hypothetical protein, XP_051880(KIAA119; none;TM«M;SS=M; 4.41 
406908; Z25437; ; gb:iisapien$ protein-tyrosine kinase gen; none,nona; 4.40 
450701; H39960; Hs.288467; hypolhelfcal protein XP.098151 (teudne.; none,U«CT,LRR; 4.40 
409213; U61412; Hs.51 133; PTK6 protein tyrosine kinase 6; SH2,SH3.pkinase;TM44;SS=N; 4.38 
429500; X78565; Hs.289114; hexabrachion {tenasdn C, cytotaclin); EGF,fn3,fibrinogen„C,toxln_2,Keratin_B2;TM=M;SS=Y; 4.38 
448569; BE382657; Hs.21486; signal transducer and activator of trans; SH2.STAT.STAT.bInd.STATjjrat;TM=M;SS=N; 4.32 
423725; AJ40310B; Hs,132127; hypothetical protein LOC57822; nofte;TM=M;SS=N: 4J2 
411573; AB029000; Hs.70823; KIAA1077 prolan; Sulfatase;TM=M;SS=N; 4.31 
408243; Y00787; Hs.624; Interieukin 8; HLH.PAS,ll.8;TM=M;SS=N; 4.31 
418738; AW38B633; Hs.6682; solute canier fan% 7, {catlonte amino; none.none: 4.30 
437897; AA770561: Hs.146170; hypotheScal protein FU22969; zfOHHQnone; 4.29 
424247; X14008; Hs.234734; lysozyme {renal amytoidosis}; lysJgJ'AO.SynlhJdh.UtLQpklnass;; 4.29 
414821; M63835; H5.77424; Fc fragment of IgG, high affinity la. re: |g;TM»Y:8SBM: 4.29 
404996; : ; Target Exon; Peptldase_C1;TM='M;SS=M; 4.29 

416539; Y07909; Hs.79368; epithelial membrane proton 1; PHff>22_Claudin.oxldored_q5_N;TWNY;SS=M; 4.28 

409142; A1.136877: Hs.50758; SMC4 (stnictural maintenance of duomoso:ABC.traaM,SMC.KSiyiC_C.OUF164.no^^ 4.25 

421532; AW138207; Hs.146170: hypothetical protein FU22969; Armad!DDj6eg.HEAT:TiyNM:8SsN: 4.25 

424503; NM_00220S: Hs.149609: tntegrfn. alpha 5 (Rbronedin receptor,: integrfnJ\.F&GAP:TlWNY;SS=l4: 4.24 

414809; AI434699; Hs.77356; transfenin receptor (p90, CD71); PA;TM=Y;SS=N; 4.24 

439720; AI935202: Hs.31 181; Homo sapierw cDNA: FU23230 fis. done C; none.SOF^sugar.tr; 4.23 

437044; AIJ035B64; Hs.69S17: diiferentiaiiy expressed in Fanconfs an; none:TM=M;SS=M; 4.23 

409956; AW103364: Hs.727; Inhibin, bete A (acSvIn A. adhfin AS a; T6F4)etaiTGFbjn)pepf]de.Tiib:; 4.20 

439453: BE264974; Hs.6S66; thyroid honnone receptor interador 13; AAA.ABCjr8n.(>»E:TM:>MtSS»N; 4.20 

417389; 8E260964: Hs.82045; mldkine {neuHte growth^iromoting factor. PTHJ4K:TM»M:8S»Y: 4.19 

407137; T97307; ; gb:yeS3h05.s1 Soares fetal Ever spleen ; 60A1.CD39.non8: 4.18 

419235; AW47041 1; HS.2B8433: neurotrimtn; none.none; 4.18 

410290: AA402307: Hs.322844: hypothetk^al protein DKFZp564A176; Sema,PSI.T1G^tegrin.6;TM3Y;SS=M; 4.18 

456906; AF1 17646; H8.156637; Cas^r^ (murine) edropte retroviral ti; zf^C4.CbLN.CbLN2.Cbl.N3;TM=M;SS=N; 4.17 

448775; A6025237; Hs.368; nudbc (nudeoskie dlphosph^ Unked moi; NUDIX;TM=M;SS=^4; 4.17 

400288; X06256; Hs.149609; totegrin. etpha 5 {fibrcnectin receptor.; lntegrin-A.FG<5AP;TMaY:SS=N: 4.14 

409799; 011926; Hs.76845; phosphoserine phosphatasfriike; Hydrotase;TM=M;S&44; 4.13 

445417; AK001058: H5.12680; a disintegrln-fiks and metaHoprotease w; tspJ.Reprdy$iivPep_M12Bj}ropep,none; 4.12 

433895; A!287912; Hs.3628; mitogen-adivated protein kinase kinase ; pkina$e.zf-C4,CNH.ERM:T(y|=M;SS=N; 4.12 

424490: AJ278018; Ks.55555; ankyrin repeal domain 3; ank,pklnase;TM=M;SS=N; 4.09 

4191 21; AA374372; Hs.69626; parathyraU hormone^ hormone; none.none; 4.08 

416602; NM..006159; H5.367895; Protein khiase Wandlng protein NEa2; EGF.wifcTSPN;; 4.07 

424008; R02740; Hs.l37555; putathrachemoWne receptor. GTP-blnding; 7tnLl;Tiyi=»Y;S$=M: 4.07 

427747; AW41142S; Hs.18065S; serineAhieonine kinase 12; pklnase;TM=M;S&:N; 4.06 

4Z7490C Z951 52; Hs.178695; mitogen^dhfated protein kinase 13; pkinase;Tiyi»M;SS»N; 4.03 

439738; BE2465Q2; Hs.9598;Gema domain, fanmunosltobtdindomairiOgK:Sema^,to^ 
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414683: AA926960: Ks.346669: COC28 protein Mnase 1; GKS.-; 4.02 

413186: AU0ni41: Ks.374548: sdutB earner temDy 16 (monocarboxyQc; su9ar.trJM=Y;SS^M: 4.01 

418203; X54342: Hs.83758; COC28 proldn Unass Z CKS:; 4i01 

406908: Z25424; ; pbcH-sapfere proteSn-sefineflhrconinB 1^ none,none; 3i98 

450375: AA009647: Hs.352537: a dishtegrin and metaOopruleinase doma: Repn)lysln,f^JA129..prapep/fisintegrin.ltepi^^ 3.98 

410887; U24389; Hs.65436; lysyl oxidaso^ke 1; LysyLoxkJaso;: 3.96 

410342: R3135Q; Hs.743: Fc fraoment of IgE. high affinity I. rec; ITAM:TM=Y;SS=M; 3.95 

425849; AJOOOSI^ Hs.298323i senimllBlucoQortond regulated kinase; pkinase4)Unase.C;TM»M;SSsM; 195 

417433: BE270266: Hs.8212a: 5T4 oncofetal trophoblast glycoprotdn; LRR.LRRNT,IHRCT:TM»Y;SS=M; 3.85 

427792: M63928: Hs.180841: tunwr necrosis factor receptor superfami; SRP14.TNFR.c6n 193 

407792; A1077715: Ks.39384; putative secfeted ligand homologous to f; none;TI\A=W:SSsY: 3,91 

424441; X14850: Hs.147097: H2A hlstone famDy, member X; hislone.C8FD_NFYB.HMF:: 3.91 

415989; AI267700; H3.351201; ESTs: none.none; 3.90 

423189; M59371; Hs.171596; EphA2; fa3.pWnasa.SAM.EPKJbd:TM=Y:SS=M; 190 

443859: NM_013409: H5.9914: folOslatin; kazal;; 189 

429612; AF062649; Hs.252587: pitiitary tumor-transforming 1; nonen 189 ^ 
419073; AW372170: Hs.183918: Homo sapiens cONA FU12797 lis, clone NT; dealh,ZU5;; 188 
450684: AA872605; Hs.2S333; interteukin 1 receptor, type II: ig;TM=Y;SS^M; 3.88 
428450; NM_014791; Ks.184339; KIAA0175 gene product; KA1,pldnase;TM=M;SS=N; 186 
413441; AI929374: Hs.75367; Sro-Dke-adapten SH2^H3:TM=M;SS=N; 184 

437763; AA469369; Hs.5a31 ; tissue inhit)itor of metaltoproteinase 1 ; TlMP.pkInase.DAG_PE-t>lnd,RBO; 183 
436291; BE568452; Hs.344037: protein regulator of cytokinesis 1; none;TM=^M;SS»N: 182 
417512: X76534: Hs.82226; glycoprotein (transmembrane) nmb; PKD:TM=Y;SS=M: 3.81 
427647; W19744: Hs.180059; Homo sapiens cDNA FU20653 lis. ctone KA; none.pkinase: 180 
431629; AU0770K: Hs26SB27; interferon, aipha^udtte protein (do; none;TM=M;S&>Y: 160 
434699: AA643687: Hs.149425; Homo sapiens cONA FU 1 1980 fis. ctone HE; NucleosIde.tia2.none; 3.80 
430378; Z2957^ Hs.2556; tumor necrosis factor receptor superfann; IL2:: 179 
428157; AI738719; Hs.1 98427; hexokinase 2; hexokinase,hexokinase2,none; 3.78 

409512; AW979187; Hs.2S3S91; melanoma dfTerentiation associated prot: OEAO,hericase_CCARD:TM=M;SS=N; 178 
417720; AA2}5625; Hs.208067; ESTs; none.nona; 177 

439237: AW408158: Ks.318893: ESTs, sinrdar to A47582 &ceil gn Furfn^.pkInase,Rec8p.U.doin8in.YlP.nQn&; 3.77 

449029; N28969; Hs.22891: solute canler family 7 (calionic amino : aaj)enneases;TM=:Y;S&:M: 3.76 

413436; AF238083; Hs.68061; sphingoslne kinase 1; DAGKc;TM=M;SS=N; 175 

416714; AF263770; Hs.79630: CD79A antigen Onvnunogbtxinivassociated; ig.lTAM,Zn.clus;TM»Y;8S»M; 174 

413281: AA861271: Hs.222024: traisafpOon factor 6MAL2: HLH.PAS;; 174 

436481; AA379597; Hs.5199; HSPC150 protein similar to libkiuilliKon: UQLPon;TM^SS»N; 174 

431890; X17033: Hs.271986; Integrin. alpha 2 (C049B. alpha 2 subuni; vwa.lntegriaAF&GAP:TM=Y;8S=M: 174 . 

424118; BE269041; Hs.140452: cargo selection protein (mannose6phosp; periTtpln:; 173 

426471; M22440; Hs.170009; transfomiing growth factor, dpha; EGF;TMsM;SS«M; 172 

422487; AJ010901: H3.ig8267; mucin 4. tracheobronchial; EGF,vwd.AMOP;: 172 

450125; AA005418; Hs,158188; ESTs; CIDE.N.7tnLl,none; 3.71 

427337; Z46223; Hs.l76663; Fc fragment of IgG. low affinity lllb, r; lg;TM=Y:SS=M: 3.70 

444006; BE395085: Hs.334762; type I transmembrane protein Fn14; ldljecepLa,PKD,MHCJ;TW=M;SS=Y; 3.70 

422010; AA302049; Hs.31 181; Homo sapiens cDNA: FU23230 Qs, done C; none,SOF,sugar_tr; 3.70 

418969; W3319t: Hs.28907; hypothetteal protein FU20258; SH3;TM=M;SS=N; 3.69 

415817; U88967; Hs.76B67; protein tyrosine phosphatase, receptor-t; fnlY.pho5phatasa,carb.anhydrase;TM=Y:SS=M: 168 

443759; BE390832; Hs.134729; FXYD dcmaln^ntalnlng Ion transport reg; ATP1G1_PLM.MAT8;TM=Y;SS=M; 168 

452344; AI264357: Hs.55405; hypothetical protein MGC16212; SuIfate_transp,STAS;: 168 

439625; AF085453; Hs.58611: ESTs; For1ehead,glycolyec_enry,Na„sulph_3ymp: 3^ 

426227; U67058; Hs.154299; Human proteinase acSvated receptor-2 mR; 7tnu1:TM=Y;SS=M; 166 

452363; At582743; Hs.94953: Homo sapiens. Similar to complement comp; C1q,Co!lagen:; 165 

447365; BE383676: Hs.334; Rho guanine nucteoftJe exchange factor 0 SH3.PH,RhoGEF;TM=M;SS=N; 3.64 

418883; 6E387038; H5.1211; ackl phosphatase 5, tartrate resistant; M8tanophos;TM=M;SS=M; 164 

438707; L08239; Hs.6326; amino add system N transporter 2; porcu; ACAT.MBOAT;TM=Y;SS=M; 3.64 

422596; AF06361 1; Hs.1 18633; Z.^-cllgoadenylale synthetase-like; uKquiCn;; 3.63 

449318; AW236Q21; Hs.78531; Homo sapiens, Simiiar to RIKEN cDNA 5730; none;TM=M:SS»N: 162 

436746; AI88581S; Hs.184727; Hunnan melanomfrassodated antigen p97 (m; (ransfenin,GuanylatoJ(in,P0Z,SH3; 162 

446291; BE397753: Hs.14623; interferon, gamm^^ndudbte protein 30; Gia;TMNM;SS=Y; 162 

452698; AI826645; Hs.211S34; ESTs; ArfGap,PH,ank.6uanyfateJdn.PDZ.SH3; 3.60 

407634; AW016569; Hs.136414; UDP-GlcNAcrbetaGal betd-1>N^tylgluc; Galadosyl J;TM=M:SS»Y: 159 

423575; C18863; Hs.163443; intron of periosttn (OSF-2os): F3sddin.none; 159 

421391; AW304350; Hs.191958; immunogtobiriin superfamHy receptor tan; ig.none: 158 

419452; U33635; Hs.90572; PTK7 protein tyroshie kinase 7; ig,pkinase:TM=Y;SSnM; 158 

419912; AF249745; Hs,6066; Rho guanine nudeolida exchange factor {; SH3,PH,raM>GEF:TM=M;SS=N; 158 

431457: NM.012211; Hs.256297; Integrin. alpha 11; FG<5AP.vwa;TM=Y;SS=M; 157 

430379; AF134149; Hs.240395; polassKim channel, subfamily K. member 6; k)n_tr8ns;Th^Y;SS=M; 155 

418526; 8E019020: Hs.85638; sotuto carrier family 1 6 (monocarboxyite; none;TMaY;SS=M; 155 

441362; BE614410; Hs.23044; RAD51 (S. ceravi^) homolog (E coD Re; nonen 3i3 

426500; NM_014638; Hs.170156; KIAA0450 gene product; C2.R-PLOY;TM=M;SSaN: 153 

429556; AW139399: Hs,314807: ESTs; none;TM=M;SS=N: 152 

449101; AA205847; Hs.23016; G proteirwoupled leceptor, 7tnu1;TM=Y;SS=M; 152 

432636; AA340864: Hs.278562: daudin 7; PMP22.aaudln;TM=Y;SS=M: 151 

433470; AW960564; H3.351315; transmembrane 4 superfamDy member 1; none;TM=Y;SS=M: 161 

452203; X57522; Hs.352018; transporter 1, ATP-bindbg cassette, sub; ABC_tr8n,ABCjnembfane.SRP54.ThyniMyt8leJin;TMaY;SSsM: 3.49 
425566; AW162943; Hs.250618; UL16 binding protein 2; ldLrecepLa,PKD.MHCJ;TM=M;SS=Y; 148 
402447; ; ; CI 000201 :gll204416IgblAAA02627.1| (10519; none;TM=Y;SS=M; 148 

431183; NM.006855; Hs.250696; KDEL (Lys^p^Iu-Leu) endoplasmk; retic; ERJumenjecept;TH/!=M:SS=M; 3.48 

448988; Y09763: Hs.22785: gamma^nobutyite add (GABA) A recepto; Neurjchan_IBD.Neur_chanjnemb:TM=Y:SS=M; 148 

415323; BE259352: Hs.949: neutrophD cytosoDc factor 2 (65kD, chr; SH3,TPR:TM»M;SS»N; 148 

425003: AF1 19046; Hs.154149; apurimc/apyrlmWldc endonudeaseJAPEX ; TioponIn,B(0.endoj)hos.lQ;TM=M;8S=N: 147 

424909; 878187; Hs.153752; ceU divisfon cyde 258; Rhodanesen 144 

446051; BE048061; Hs.37054; ephrinA3; Ephrtn^deamin,dsnn,i.alpha: 3.43 

418641; BE243136; Hs.86947: a disintegrin and metaltoproteinasodonw; disWBgrin.ReprolysiaPtep.M12Bj)ropep,E^^ 142 
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417426; NML002291: Hs.82124; laminln. beta 1; larWn EGF,Iairtnln.Ntem\htBgnn_B:: 3.42 
41 1263; BE297802; Hs.69360: kinesiivfio 6 (rnltoOc centromere^oc; Hnesln;TM=M;SS=N; 3,42 
430044; AA4645tO: Hs.152812; ESTs; nonage; 3.42 

4252B9; AW139342: Hs.155530: Intertercn. gamma^ndudbb protein 16; PAAD.OAPlN.HlNn 3.39 
425354; U62027; Hs. 155935; complement component 3a receptor 1; 7tnul:TM=Y:SS=M; 3.39 
428293: BE250944;Hs.l83556;soIutecarrierfemIIy1 (neutral airino a: eIF6,SDF;TM^ 
443648; AIOSSSH; Hs.143610; ESTs; ForiLhead,none; 3.39 

418869; AW51656S; : gbJtqOldO&xl SoaresJWCeCcervlc^lumo: nQne.Ra5GAP,WW,IQ; 3^ 
432179; X75208: Hs.2913: EphB3: EP»tDNt.(rt3.p»n8$e,SAM:TM=Y:SSsM: a38 
418918; X07871; Hs.89476; C02 antigen (pSO), sheep red bbod cell ; lg;TMnY;SS=M; 3.38 
414368; W70171; Hs.75939?ui1dine monophosphate Wnase; PRK.CoaE;; 3.37 

408716; AI567839; H5.151714; Homo sapiens mRNA for KiAA1769 protein, ; Uvf1XI«!iC8se.RNB.Runt;TM=M;SS=N; a37 
457001; J032S8: H$.2062; vitamin D (1.25- dihydroxyvitamin D3} re; homiDnejetUEK>I.Met^io_5:TM=M;SS=N; 3.37 
422283; AW411307; Hs.1 1431 1: COC45 (cell division cyds 45. Suxrevis: COC45;Tiyi=M;SS:sN: 137 

421817; AF146074; Hs.108660; ATP-Wnding cassette. 6ut)43r«ly C (CFTR; F8sddln.ABC_tran^Cjnanbrane,GTP.EFRJ:TM=M;SS«M; 3^6 

400298; AA032279; Hs.61635; sU tr8nsmemt)rane eptthelial antigen d ; none:TM^Y;SS=N; &35 

428385; AF1 12213; Hs.16406?. putative Rab5-interacting proton; SH2,SK3;; 3.33 

400261; ; Ks.1802; Eos Control; ig.MHCJLbeta;TM»Y;S&:K« 3.33 

410024; AW191024; Hs.55D16; hypolheteal protein FU21935; SH3;TM=M;SS^N; a32 

412584; X54870; Hs.74085; DNA segment on chromosome 12 (unique) 24; nDneMn.c; 3.32 

416065; BE267931; Hs.78996; pronferaling cell nudear antigen; PCNA.PCNA^C;TM=I\A;SS=N; 3.31 

426437; BE076537; Hs.169895; ubiquidivconjugatlng enzyme E2L 6; Arm3diHo_seg,Ua.con.none; 3.31 

426840; BE244217: KS.17269D; diacytglycerol kinase, alpha (801(0); 6fhand.DAG_PE-t)ind.DAGKa.DAGKcDC1;TM»M:SS^ 3.31 

434419; AU040606: Hs.296938; dual specifony phosphatase 7; DSPc;TM=M;SS=N; 3.31 

416758; AW959311; Hs.172012; hypothetica] protein DKFZp434J037; pldnase,RI01;TM=M:S&=N; 3.31 

424779; AU)46851; Hs.153053; CD37 antigen; trans(nembr8na4;TM=Y;SS«IUl; 3.31 

423973; AF038461; Hs.138574; aracWdonale 12-tlpoxygenase. 12R type; Iipoxygenase,PLAT;TM=Nt.^ 

421733; ALII^I; Hs,1420; tit)rot)last growth factor receptor 3 (ach; (g,p!cinase;TM::Y;SS=M; 3.30 

449027; AJ271216; H5.22B80; dipepHdytpeplidase III; Pep(klase_M49.EGFJg.NeureguGn;TW=M;SS=N; 3.28 

423778; Y09267; Hs.132821; flavin cofdainino monooxyganase Z FM&IIke.pyrjedox;TI^Y;SS=M; 3.28 

426457; AW894667: Hs.38013d: chlmertn (chlmasrin) 1; OAG.PE^nd.RhoGAP.SH2;TM»M;SS=N; 3.28 

435523; T62849; Hs.1 1090; menArane-spanning Romans, subfamiiy A; non8;TM=Y;SS=M; 3.27 

431886; L77964; Hs.271980; mitogen^Svated protein kinase 6; pkinase;TM=M;SS«N; 3.27 

430397; A1924533; Hs.105607; btearbonale transporter related protein ; HC03_cotFBnsp:TM»Y;SS=N; 3.27 

425322: U63630: Hs.15S637: proton kinase. DNA-acQvaled, catalytic; PIU>l4Jdnase.FAT.FATC;TM:=M:SS«N: 3.26 

446006; NM_0O44O3; H3.13530: deafness, autosomal domhanl 5; none;TM^SS"M; 3.26 

444783; AK001468; Hs.62180; anillln (Orosophila Scraps homotog), act PH,none: 3.25 

422278; AF072B73; Hs.1 14218; frteded (Drosophila) homotog 6; Fz,Frizzled.7lnu2;TM=Y;8S=M; 3.25 

405932; ; ; C15000305:gq3806122|gb|AAC69198.1) (AFO; ra3:TM=M:SS=N; 3.25 

400205; ; Hs.81848; NM.006265*MnK> sapiens RAD21 (S. pombe); DUF173:: 3.25 

432874; W94322: H3.279651 ; melanoma inhibitory activily; SH3;TM=M;SS=Y; 3.24 

412942; AL120344; Ks75074; mitogen-acfivated protein kinas&^tival; pklnase;TM=M;SS=N; 3.23 

435472; AW972330; Hs.283022; triggering receptor expressed on myetold; lg:TM=M;SS=M; 3.22 

439285; AL133916; Hs.47860; hypothetical protein FU20093; lg,pkinase.LRR.LRRNT.lJRRCT.none; 3.22 

410434; AF051 152; Hs,63668: tdl^ike receptor 2; LRflUWCT.T1R;TM=M;SS=M; 3.22 

427318; AF186081; Hs.1757B3: zinc transporten Zip;Tl^Y;SS=M; 3.22 

436075; BE090176; Hs.179902; transporter-like protein; none;TM=Y;SS=|yi; 3.22 

428698; AA652773: HS.334B38; KIAA1866 protein; none:NA;NA; 3.22 

4488B8; AWt96663; Hs.200242; caspase recruitment domain protein 6; CARD;TM=M;SS=N; 3.22 

415149; X12451; Hs.78D56; cathepsln 1^ Peptidase^CI;; 3.21 

423393; R37772; Hs.21420; p21-activated protein kinase 6; pWnase,PBO:TM=M;SS=N; 3.21 

424618; L29472; Hs.1802; major histocompatibliily complex, ddss ; }g.MHCJI.beta;TMsY;SS^ 3.20 

438564; AA381553; H5.198253; ma{or hlstocompatit^Ilty complex, class ; tg.MHCJLalphajione; 3.20 

456181; U6463; Hs. 1 030; ras inhibitor; RA,SH2.VPS9;TM=M:SS=N; 3.20 

418613; AA744529; Hs.66575; mitogen^ated pioteb kinase kinase ; pkinase.CNH;TM=M;SS=N; 3.19 

440682; AW362152; Hs.27181: nuclei receptor binding factor.2; cyc9n4}ZIP;TM=M;S&=N; ai8 

415010; NMJD04203; Hs.77763; membrane-assodated ^rosbie- and threon; ank.pkinase.UPF0073;; 3.16 

413216; AU076718; K$.164021; small Indudble cytokine subfamily B (Cy; ILBn 3.16 

450737; AW007152; Hs.63325; transmembrane protease, serine 4; trypsin,ULrecepLa.none; 3.16 

426395; BE1S1985; Hs.355669: hypoth6tk:al protein FU23316; pkinase,none; 115 

412939; AW411491; Hs.750e9; eukaryotk: translation etongafion facton none,none; 115 

433376: AI249361: Hs.74122: caspase 4. apoptosls^elated cysteine f. CARO,IC^10JCEji20'.; 115 

410668; BE379794; Hs.159651; hypothetical protein; deafh.TNFa.c$;TM=Y:SSsM; 115 

431441; U81961; Hs.2794; sodium channel, nonvoltage^ated 1 alpha; ASC;TiyNY;SS=N; 115 

432251 ; AW972983; Hs.2321 65; polycythemia rubra vera 1; cell surface ; rK>ne;TM»M;SSsM; 115 

407844; AW073716; Hs.8037: ESTs; transmembrane4,none; 114 

408834; AW407254; H3.356216; cslmoduiln 2 (phosphotylase kinase, dell; none.none; 114 

423061; AI290473; Hs.44807; ESTs; lntBgrin.B,Sema.PSI.TIGjtone; 3.14 

438974; AF089816; H5.6454; chromosome 19 open reading frame 3; PDZ;; 113 

431236; AV656840; Hs.285115; interteukin 13 receptor, alpha 1; fn3;TM=Y;S&=M; 113 

425394; AA356730; Hs.323949; kangal 1 (suppresston of tumofigenldty ; transmembrBne4jione; 113 

429336: AB005038; Hs.199270; cytochrome P450, subfamily XXVIB (25^; p450;; 113 

449230; BE613348; Hs.356392; melanoma cell adhesk)n molecule; lg.isodh.RibosomaLl6.F-box;TM:'Y;S&:f^ 3.13 

429305; AF095727; Hs.287832; myelin protein zen>.like 1; lg.tr8nsmembrane4;TM=»Y;SS=M; 112 

419034; Nl^002110; Hs.89555; hemopcicte cell kinase; SH2,SH3.pkInaso;TM=M;SS=N; 112 

417386; AL037228; H3.301957; D123 gene produd; NUDIX.secY.El.d8hydrog.transketj>yr;TM=Y;SS=M; 111 

419138; U48508; Hs.89631; ryanodloB receptor 1 (skeletai): b)rLtrans,SPRY,RYDRJTPR,Ryf?,MIR;TM=Y;SS=N; 111 

440008; AK000517: Hs.6844: NALP2 protein; PYWN^nt^lng APAFI-B; AAAW«C,PAADJ)APIN;NA;NA: 110 

406467; ; ; Target Exon; eO)and.AcyUransferase.non8; 110 

422956; BE545072; HS.12S79; Ea2 pret^ (Epithenal ceO transfonn}; 6RCT.RhoGEF;TM^M;SSsN; 110 
417ni; AA804698; Hs.82547; reOnofc add receptor respondcr (tazaro; none.none; 109 
437016; AU076916; Hs.5398; guanine monphosphste synlhstase; PHO,SET.zt. 

CXXC£GF.ank.notch.WW.FCH,GATase,GMP_synLC.Ocdudin.YEATS.metaUhto.E6.hefl^ 
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441384; AA447849: Hs.288660; r^ote add Induced 3; Tbrua^na; 3.09 

416636; N32536; K5.42645; soiuts earner famSy 16 (monocarfaoxyfic; non8,none; 3.09 

41 6498; U33632: H&79351; potassium channel. suUaniDy K. member 1; lon.(r8ns:TM»Y:SS«M: 3.09 

426108; AA622037: H5.166468; piogrsmmed cell death 5: DUF122;TM=M;SS»N; 3.08 

414606: D14694; Ks.77329; phosphalidytserine synthase 1; PSS:TM=Y:SS=M; 3.08 

402233; ; ; NM_030760*:Homo sa^tens endolheial diff; 7lm.1:TM=Y:SS=M: 3.07 

430066; AI929659; Hs.237825; signal recogmlion partUs 72kO; TPaAIKC.SAICAK.8ynt; 3.07 

425367; BE27118a; HS.15S97S: pnt^ tynstne phosjdiatasa, receptor % non8:TMsM;SS»Y: 3.06 

434263; N34895; Hs.79167; ESTs; l9.none: 3.06 

443907; AU076484; H3^63; TYRO protein tyrosine kinase liincfing pro; none;TM=^M;SS=Y; 3.05 

409378; U42387; Hs^26: pancreatic potypepOde receptor 1; 7Inv.1:TM=Y:SS=M; 3.05 

410165; BE560228; H5.71869; apoptosis-assoctated speckle protein ; PAA0.DAPiN.CAR0:Tli4=M;SS^; 3.05 

440270; NM.01S986: HsJ120; cytokine leceptor-Gke motecule 9; (h3;; 3.05 

449003; X76342: Ks.369; alcohol dehydrogenase 7 (class IV), rou o; adn^ncTM^tUtSS^N; 3.05 

420189; AW296380; K3.95821 ; osteoclast sSmtiiaOng factor 1; SK3.8nk;; 3.05 

429732. U20158; Hs.2468; lymphocyte cytosoilc protein 2 (SH2 doma; SH2;; 3.05 

421541; NM.003942: Hs.105584; ribosomal protein S6 kinase. 90kO, polyp; pkinase.pkinase.C;TM=M:SS=N; 3.04 
415444; BE247295; H5.78452; solute carrier famity 20 (phosphate tran; PH04.UM;TM=M;SS=N; 3.03 
425118; AU076611; Hs.154672; methylene telrahydrolblate dehydrogenase; myb.ONA- 

binding.TKF.DHG_CYH.THF^DHG.CYH.aCAP.aY>^L0N.Peptidase_C9.bZIP,fcU^ 
449046; Z45051: Hs.22920; similar to S68401 (catOe) glucose Indue; Lamp;TI^M;SS«M: a03 
413869; NM.00087B; Hs.75596; tnterleukin 2 receptor, beiK none;TMsY;SS^M; a02 
442875; BE623003: Hs.23625; Homo sapiens clone TCCCTA00142 mRNA sequ; K.tBtra.DUF51,none; 3.02 
436576; AI458213; Hs.77542; ESTs; 7tm_1.0naJ; 3.02 

446269; AW263155; H3.14559; hypothetical protein FU10540; none;TM=W;SS=N: 3.02 

418870; AF147204; Hs.89414: chemoklne (C-X-C motil), receptor 4 (fus; 7tnLl.7tmJJ;TM=Y;SS=M: aOI 

421379; Y15221; H3.103982; small Indudtjlo cytokine sulrfamBy B (Cy; IL8;TM=M;SS=Y; 3.00 

421267; BE314724; Hs.103081; rit)0SQmal protein S6 kinase. 70kO. polyp; pkinase.p)dnase.C;TMsM;SS#l; 3.00 

409705; M37762; Hs.56023; t)raln^Jerived neurotrophic factor; NGF;; Z99 

429903: ALl34ig7; Hs.93597; cyclin-dependent kinase 5. regulatory su; COK5_acQvator,none; 2.99 

430696; AA531276; K5.59509; ESTs; pkinase,PP2C,none; Z98 

418299; AA279530; Hs.83968; Integrfn. Ijela 2 (antigen C018 (p95K ly; lntegrin^B,EGF.PSI;TM=Y;S$=M; 2.97 

410026; Ai912061: Hs.55016: hypotheOcal protein Fy21935; none,none; Z97 

448733; NM.005629; HS.18795B; solute carrier family 6 (neurotransmitte; SNF;TM»Y:SS»N; 2.97 

43256% BE531048: Hs.278422; DKFZP586G1122 pnM; zK:2H2;TlUNy1;SS=N; Z97 

453035; AW581943; Hs.334; Rho guanine nucleotide exchange factor (; ftone.non6; 2.97 

427315; AA179949: Hs.175563; Homo sapiens mRNA; cDNA OKFZp564N0763 (f; none.spectfm,SH3.PH.CH; 2.97 

431941; AK0001 06; Hs.272227; Homo sapiens cDNA FLJ20099 lis, ctone CO; pkln3se.Furin-IB(e,Recep.l^domain.none: 2.96 

441389; AF134838; Hs.783S; endocytic receptor (macrophage mannose r. fn24ectin_c;TMsY:SSsM; 2.95 

412276; BE262621; Hs.73798; macrophage migrafion \m\Diy ^ t MIF.sugar.tr.none: Z94 

427359; AW020782; Hs.79881 : Homo sapiens cDNA: FU23008 lis, done U 7tnLl.n0Re: Z94 

402558; ; ; C1000201:9i|204416igb1AAA02627.1| (ti)519: none:TM«Y;SS»M: 194 

425852; AK001504; Hs.159851: death receptor 6. TNF sitperMly memben death.TNFR.c6;TMsY;SS»M; Z94 

442080; AW444761; Hs.72901; ESTs; ankr. i94 

450447; AF212223; H3.2501O: hypothetical protein P15-2; NTF2:TM=M;SS«N; 193 

444809: BE207568; Hs.208219; oculospanin; l i ai tt ineiitoane4;TM=Y;SS=N; 2.93 

449843: RB5337; Hs.24030: solute carrier family 31 (copper transpo; none:TM5Y;SS=M: 2.93 

4161 10; Z42262; Hs.322844; hypothetical protein OKFZp564A176: Sema.PSi.TJG.Integria_B;TM=Y;SS^M; Z93 

453768; BE382670: Hs.19851 1; Homo sapiens mRNA; cONA DKFZp761l177 (fr; arf,G-aIpha,none; 2.92 

414825; X06370; Hs.77432; epWermal growth factor receptor (avian ; Furin.Iike,pkhiase,Rficep_l^dom8ln:TM=M;SS=M; 2.92 

421429; NM_014922; Hs.1043D5; death effector fflament-fomiing Ced^B; imPAAD^DAPIN>W^CARD.N&V\RC;NA;NA; 2.92 

434826; AF1 55661; Hs.22265; pyruvate dehydrogenase phosphatase; PP2C.none; 2.91 

451292; AB037716: Hs.26204; KIAA1295 protein; SH3;TM=M;SS=N; 2.91 

422127; AW504286;Hs.112049: SET Ijinding factor 1;dDENN.0ENN.GRAI«,PH;; Z91 

41 9508: AW997938: Hs.90786; ATP-ljInding cassette, 8ul>.family C (CFTR; ABC.trBn.ABCjnemt)r8ne;TNNY;S$=M: Z90 

430451; AA836472; H8.297939; cathepsin B; Pepiidase.Cl.proJsomeraser. Z90 

424046; AF027886; Hs.138202; serine (or cysteine) proteinase InhitiHo; 8erpin;Tl\il=M;S&>N; 2.89 

414907; X90725; Hs.77597; polo (OrosophlaHIke kinase; RIt}osomaLL37ae.plunase.POLO_box.tRNA-6ynL1b.dynai)nin.dyn^^ 2.89 

429619; AL1 20751 ; H5.21 1566; eukaryotic translation Inftiaiion factor; none,none; Z89 

413879; AA132961; Hs.212533: Homo sapiens cDNA: FU22572 fe. clone H; none.nonB; Z89 

417018; M16038: Hs.B0887; v-yes-1 Yamaguchi sarcoma viral related ; SH2.SH3.pkinase;TM=M;SS=N; Z89 

422610; AF153820: Hs.1547: potassium Inwardly-rectifying channel, s; IRK;TM=Y:SS=N; Z89 

405556; ; ; homeodomaln-interactlng protein kinase 3; tjy(^n;TM=M:SS=N; 2.89 

423804; AW403448; Hs.1706; Interferon-sSmulated transcrtptkx) facit IRF,zf^HC4,IBR.zM^BP.TM^M;SSeN: 2.89 
425262; D87119: H8.155418; GS3955 protein; pklnase;; Z88 

422599; BE367202; Hs.1 18638; non^tastatc cells 1, proteto (NM23A) ; NDK,PH.OxysleroL6P;: 2.88 
452886: AW9S5454: Hs.30942: ephrin^ Ephrtn.fn2;TM:>Y;SS»M: Z88 

414703: 6E243877: Hs.380063; ATPase. Na? transporting, beta 3 pdlypep: NaJ<-ATPase:TM=Y:SSMiA: Z87 

444143; AW747996; Hs.160999; ESTs. Moderately similar to A56ig4 Ihrom; Bc{-2.none; Z86 

413472: 8E242870; Hs,75379; solute canter family 1 (glial high afii; SDF:TM=Y;SS»M; ZB6 

458039; AA635884; Hs.130685: leukobiene b4 receptor (chemokine recep; aDE.N.none; 2.86 

434417; AL1 10157; K8.3843: Homo sapiens mRNA; cDNA DKFZp686F2224 ((; DSPcnone; Z86 

425802; Y14638; ; chernddne-like receptor 1; 7tm_1.none; Z86 

403112; ; ; Target Exon; efhand.C2.PH,PI-PLC.Y,PI-PLC-X;; Z86 

435563; AF210317; Hs.95497: solute carrier fanffly 2 (facffilated ghi; sugar_tr;TM=Y;SS=N; Z85 

442117; AW664964; Hs.128899; ESTs; hypotheticai protein for IMAGE:447: none.none; 2.84 

457819: AA057484: Hs.35406: FU20522 Hypothetk^l protein FU20522; none.none: Z84 

456629; AW891965: Hs.367942: histone deacetylase 3; HSP90.HATPase^c.zf^2.PHO.none; Z83 

408873: AU)46017; Hs.356216; calmodulin 2 (phosphorylase kinase, delt; none.nona: Z83 

448947; AF146747; Hs.232165: polycythemia rubra vera 1; cen surface ; none:TM=iM;SS^ ZB3 

448386; AB037750: Hs.21061; KIAA1329 protein; PKO.BNR;TM=Y;SS=M; Z82 

427857; AL133017; Hs.288679: hypothetical protein FU22865: myosinJiead.lQ.zM4YNO;Tf4=M;SS=M: Z82 

407601; ACOO2300: Hs.37129; sodium channel, nonvoitage^ated 1. beta; ASC:TM»Y;SS=M; Z82 
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459707; AA631382; Hs,120866: gb:np86bOU1 NCLCGAP.Thyl Homo sapiens; Tftiu' Jwne; Z82 

422899; BE410S90; H5.119257; emsl sequence (mammary lumor and squamou; SH3.HS1 jep;TM=M:SS::N; Z82 

438108; A1471795: Hs.287776; vanU receptor-related osmotlcany a; ankM_tr8ns;TM3Y;SS'=N; Z82 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t; lQnesn^n3.Yj)tiosphatase;TM=(ytSSm: 2.82 

448595: AB014544; Hs^1572; K1AA0644 gene pnxhict: LAaLRRCT;TM=Y;SS=M; Z81 

423598; BE247500; Hs.377968; ESTs; 7tm.1;TM=Y;SS=M; 281 

412970; AB026436; Ks.177534; dual specifK^ity phosphatase 10; Rhodanes8.DSPc: Z81 

414198; AW505308; Hs.75812; phosphoenotpyiwalecatboxyldnase 2 (mO; PEPCK: 2^1 

438729; BE621807; Hs.35131S; transmembrane 4 stipeifandly member 1; non8;TM>Y;SS»M; Z81 

432314: AA533447: Hs.285173; ESTs; XBnk.none: 181 

416207; NM.014745; Hs.7g077; Homo sapiens, done M6C2908. mRNA, comp; none;TM»Y;SS»M; Z60 
446985; AL038704: Hs.156827; ESTs. Weakly similar to ALU1J1UMAN ALU S; SAM.SH3^1_iep; 2.80 
428023; A1038843: HS.374S30: Homo sapiens cONA: FU23602 lis. dona L; 

aaLpemieases.pyrfdcixa)JeabiiuTiodomd!n.PHD.MBD^TJ^ 
432886; BE15g028; Hs.279704; chromatin accessiUrity complex 1; none:TM=M;SS=N; 2.80 
426006; R49031: Hs.22627; ESTs; pkinase,TBC; 2.79 

414217; AI309298: Hs.279898: Homo sapiens cDNA: aJ23165 (is. clone t^ none;NA;NA: 2.79 
411165; NMJ0OO169; Hs.G90^ gdactosldase, alphat MeOliiase;: 2.79 

450056: BE047394: Hs.S02: ESTs. Wealdy simDar to S71512 hypolheQ; ABC.tran.ABCjnenibrane.ig.MHCJLbetaSRP54.pcDte8Some.W 2.78 

424291; AL120051; Hs.144700; ephiln-BI: Ephiin;TM=Y;SS=M: 2,78 

421448; AF033850: Hs.104519; phosphdlpase 02; PH.PLDc,PX;TM=M;SS=N: 2.78 

410226; AIB31958; Hs.61053; hypothetical protein; SH3.TPR;TM=M;SS=N; 2.78 

433535; AF111108; Hs.3382; protein phosphatase 4, regulatory subuni; HEAT;TM44;SS^ Z78 

442503; AF147078; Hs.375031: p53-rBsponsivd gene 5; )^tetra,ion_trans,none; Z77 

413900: AW409747; Hs.75612; stress-lnduced-phosphopfotein 1 (Hsp70/H; TPR,PDZ,WW,GuanyIate.Wn;TM=M:SS=N; Z77 

454294; ABQ00734; Hs.50640: JAK binding protein; SH2;TM=M;SS=N; 2.77 

440168; AK001612; Hs.7038: N-Acetylglucosamine kinase: ROK;TMstyl;SS»N: Z77 

449539; W80363; Hs.58446; ESTs; pidnase.Furin-like.Recep_Ldomain,none; 2.76 

422667; H25642; Hs.132821; ESTs; FMCme.FMO-Iike: Z76 

41501?. NM.004383; Hs.77793: o«c tyrosine Idnase; SH2»SH3.pkinase:TM=M;SS=N; 2.76 

402316; ; ; NIA.013447:Homo sapiens egf-fike module c; 7tnL2,GPS;TM=M;SS=M; 2,75 

425455; L18964: Hs.1904; protein kinase C tola; pklnase,DAG_PE.bind.pkinase.C.OPR;TM=M;SS=N; Z76 

447250; AI878909; Hs.17B83; protein phosphatase 1G (fomierty 2q. ma; PP2C;TM=M;SS=N; 2.75 

438829; AI167380; HsJ{57170; ESTs, We^ similar to T1251 5 hypothefi; TNFK.c6,none; Z75 

451144; AW955103: H3.61712; pyruvate dehydrogenase Wnase. isoenzyme: HATPase_Rnono: 2.74 

408543; N78098; Hs.44289; ESTs; none;TM^M;SS«N; 2.74 

429345; R11141; Hs.199595; hypothetical protein; K.tetra.SAM: Z74 

407722; BE252241: Hs.38041: pyridoxa) (pyrtdoxine, vitamin 66) kinas; pfkB;TM=M;S&=N: 2.73 

420602; AF060877; Hs.99236; regulator of G-protein signalling 20; RGS;TM=M;SS^N; Z73 

407217; AA477136; Hs.105584: ribosomal protein S6 kinase. 90KD. polyp; pkinase,pkinase_C;TM=M;SS»N; Z73 

414135; NM.004419: Hs.2128; dual specificity phosphatase 5; Rhod3nes6.0SPcYj}hosphat3se:TM:4l4;SS»N: Z73 

410590; BE615216; Hs.64746; diloride intraceHular channel 3; ncne;TM=M;SS^N; Z73 

411125; AA151647; Hs.6B877; cytochrome b-245, alpha polypeptide; none;TM=Y;SS=M; 2.73 

438022; AW517524; Hs.135201: N0D2 protein; LRR.CAR0.GTP.GDC;VinaUielIcase1;TM=M;SS=N; 2.72 

420929; At694143; Hs^2624B; programmed ceO dedlh 4; MA3;TMsM;SS»N: 2.72 

421 155; H87879: Hs.102267; lysyl oxidase: Ly5yljDxidase.AldosejepiraEpimefasen 2.72 

446554; AL044962; H5.21453; Inositol 1,4.5-tr1sphosphat9 3-klnase C; IPK:: 2.71 

449961; AW265634; Hs.133100; ESTs; pkinase,Furin^ike,Recep_U_dom^,none; 2.71 

444633; AF1 11713; Hs.12284; juncfional adhesion molecule 1; lg;TM=Y:SS=M; Z71 

412259; AI560292: Hs.279909; protein phosphatase 2 (taerly 2A), reg: WD40:T»Hll:SS*:N: Z71 

419569; AI971651;Hs.91143:}agged 1 (Alagiile syndrome); DSL.EGF.IarninifLEGF.vwo^thlo;TM^ 

452401; NM-007115; Ks.29352; tumor necrosis factor, alpha^nduced pro; }aM{.CUB*H' Z71 

458190; BE561793; H3.21446; K1AA1716 protsin; ASCGalactosyLT.none; Z70 

432128; AA865239; Hs,37196; ESTs; 7tm_1;TM=Y:SS=M; 2.70 

422616; BE300330; Hs.118725: seienophosphate synthetase 2; AIRSAIRS_C:TM=M:SS"N: 2.70 

424717; H03754; Hs.152213; wingless-type MMTV integration site faml; wnl,none; 2.70 

414108; Ara592; Hs.75761; SFRS protein kinase 1; ank.PH,0xysteTd_BP.pWnase;TM3M;SS=Nt Z70 

446272; BE268912; Hs.14601: hematopoietic celkpeciiicLynsubstrat; SH3.HS1.[ep;TM=M;SSsN; 2.70 

416084; LI 6991; H5.79006; deoxythymldylate kinase (Ihymidylate kin; non8,none: 2i9 

427157; U51166; Hs.173824; thymln^ONAglyoo8ylase;UDG;TNNMSSS=N;Z69 

404891; ; ; Target Exon; none,none; Z69 

432581: AU078465: Ks.278441; KIAA001S gene product; PP2C:TM»M;SSsN; Z69 

424321; W74048; Hs.1765: lymphocyt&^edBc protein tyrosine Idn; SH2,SH3^pklnase;TM=M;SS=N; Z68 

425308; M97639; Hs.155585; receptor tyrosine kinase^ orphan rec; tgJatngle,pklnase,Fz;TM=Y:SS=M: Z68 

414443; AU077268; Hs.76144: plateleWerived growth factor receptor^ lg,pklnase;TM=Y;SS=N: Z68 

427274; NM.00521 1 ; Hs.174142; colony stimulaHng factor 1 receptor, to; ig,pklnase;TM=Y;SS=M; 2.68 

436856; A1469355; Hs.127310; ESTs; pkin3se.rmi;TM=M;SS=W: Z68 

437429; H79981:Hs.5613; Homo sapiens mRNA; cONADKFZp584E2222(f: SH2,SK3.BTB: Z67 

450690; AA296696; Hs.33341 8: FXYD domain-containing ion transport reg; ATP1G1_PLM-MAT6:TM»Y;SSsM: Z67 

452069; AB028949; Hs.183g94; KIAA1026 protein; MetdIk)phos;TM^:SS»N: 2j67 

445330; R52655: Ks.21691; ESTs; 7tin.1.none: Z67 

452698; Nlil.0012%; Hs.301921: chemokine [OC motil) receptor 1: 7tm_1:TM=Y;SSsM: Z67 
419754; H52299; Hs.308467; Homo sapiens mRNA; cONA DKFZp586IOS23 (f; none:TM=M;SS=N; Z67 
434237; AF119908; Hs.235516; hypothetical protein PR02955; none;; Z67 

445826; BE313754; Hs.13350: Homo sapiens mRNA; cDNA OKFZp58600918 (f; ig.tsp_1.ZU5,Nudeoside.tran; 2.68 

446698; AF279265; Hs.2g8476: solute canter Mly 26, member 6; Su!fate.transp.STAS,xan.ur_pemiaase;TM»Y;SS=N; 2.66 

439750; AL359053; HS.57G64; Homo sapiens mRNA fuH length insert cON; lMPOH_C.IMPDH.N.CBSJntBgrln.B.RicbLB JecOn; 2.66 

413745; AW247252: Hs.7S514; mcleosMe phosphorytase; Mtap^PNPn Z66 

429083; Y09397; Ks.227817; BCL2-related protein A1; Bd-2;TM=M;SS=N: 2.66 

449523; NH.000579: H5.54443; chemoWne (C^ moCQ receptor 5; 7tm_1;Tf*=Y;SS=M; Z66 

449030; AI355582; Hs.57100: Konu) sapiens mRNA for FU00018 prot^ ; tran5membrana4,TM=Y:SS=M; 2.66 

434979; A19S3054; Hs.89643: transketolase (W^mkfco-Korsstolf ^ra; ASatr8nsketdase.lfanskeLpyr.lr8nskelolasa.C4ddnas8; Z66 

406137; ; ; NluL000179*M)mo sapiens mutS (E coB) h; MiitS.CPWWPMS_N;TM=M;SS=N: ZG6 
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41293S: 8E267045: K».75064; bibuIlMpedSc chapenno a none;; Z66 

40863% AW963372: H5.222088; PRO2000 protein; bnmodomdnJ^SigmaM.actival:: 2.66 

412B17; At0371S9; Ks.74619; proleasome (prosoma, macropain) 25S subu: PCjep;TM=M;SS=N: 2.65 

452682; AA456193; Hs.374574; progesterone membrBne binding protein; homeobox.none; 2.65 

401752; ; ; RAN binding protein 3; SH2.STAT,STAT_blnd.STATj)rot.loo trans^A(;PAS,Orejdn; Z65 

450747; A1064821: Hs.129953; ESTs. Hghty 8lnii!ar Id 1818357A EWS gen; rmi.z(-RanBP.GAS2; 2.65 

425776; U25128; Ks.159499: parathyroid hormone receptor 2; 7lra.2.HRM;TM»Y;SS^ 164 

452701: NK1.005110; Hs.30332; glutamine-hic(ose-6-pho5ph3l8 transaniin;GATase.2,SIS;TI^SS=N; Z64 

433933; AI7&4389; Hs.355397; Homo sapiens done TCCaAO0t64 mRNA sequ; none:NA.-HA: Z64 

421677; H54092; Hs.38282; ESTs; Alpp^adillo.seg .IBB; Z64 

436469; AK001455; Hs.5198; Down syndrome critical re^ gene 2; none;; Z64 

423198; M81 933; Ks.1634; cefl division cyde 25A; Rhodanese.none; 164 

435905; AW997484; Hs.5003; K1AA0456 protein; SH3,RhoGAP.FCH;TI*:M;SS=N; 2.64 

437712; X04588; H3.8S44: neurotropltic tyrosine kinase, receptor, ; Tropomyosln,pkInase4J^llLRRCT.Hydantoinase3.Hydank)inasO^TM^ Z63 

458946; AA009716; Hs.4231 1; ESTs; none.DSPcY_pliosph8tase; 163 

447217; BE465754: Hs.17778; neuropllin 2; CUB,MAM,F5^FB.type_C;TM=M;SS=iyi: 2.63 

445462; AA378776: Hs.288649; hypothetical protein MGC3077; none;; 2.63 

425075; AA506324; Hs.1852; acid phosphatase, prostate; ac!d_phosphat;TM=:Y:SSs|i/l: 163 

405588; ; ; NM.0X299*'i^omo sapiens piakophilln 1 {e; Annadnb.seg;TM=M;SSsN; 163 

438330; AW450572: Hs.257316; ESTs; pHnase^C4.ERM.CNH.none: 163 

448243; AW369771; Hs.387688; Integrin. t)eia 8; integrin B.nona; 163 

45201 2; AA307703; Hs.279766; Idnesin (amPy member 4A; klnes{n.DNA_tQpoi$olV,K-toqTM=M;SS=N; 163 

41 2182; AA205588; Hs.73737; Splicing factor, ogininefeerinMiGh, 4; rmi,honnQRejec.zf-C4,sugar.ti;; 2.63 

423887; AL080207: Hs.134585: DKFZP434G232 protein; ABC.tran:TM?Y;S8»N; 163 

417497; AW402482; Hs.82212; C053 antigen; trBnsmembrane4;TM=Y;SS=M; 162 

413407; Ai356293; Hs.75339; inositol pdyphosphata phosphatase^ ; SHlSAM,Exo_endo_phos:; 162 

414998; NM.002543; Hs.77729; oxidtsed low density lipoprotein (lectin; leclinjc;Tf4»Y;SS»l^ 162 

417880; BE241595; Hs.82848; selectin L (lymphocyte adhesion molecule; E6FJedin.csushI;TIUI»A4;SS=liA; 162 

429922; Z97630; H8.2261 17; HI histone Miy, member 0; linkerJi]stone;TM=M;SS=N; 162 

401812; ; ; sorting nexin 14; AAA,NB-ARC.APS_kinase.cdc48„N.cdc48_lnono; 161 

417886; AA214S84; ; ESTs; SPRY,7lnu3ANF„receptor.none: 161 

457670; AF1 19866; Hs.23449; insulin receptor tyrosine kinase substre; SH3;TM=M;SS=N; 161 

428512; Ai018187; Hb.375624; Human im sequence from ckme RP1 1'243J1; none:. 161 

426746; J03626: Hs.20S7; urkflne monophosphate synthetase (orotat; Prn)osyItran,OMPdecase;TM=M;SS=N: 2.61 

454042; H22570; Hs.47860; hypothetical protein FU20093; tg.pkinase,LRR,lRRNT,LRRCT.nQne; 161 

421077; AK000061; Hs.101590; hypolheticai protein; ank,pkin3se.dBath,SPRY,SAP,Ribosomal.L24e.SRP54,dDENN,OENN,uI)ENN;TM=l«^ 160 

420162; 6E378432; K5.95S77; cyclin-dependent kinase 4; pkInase;TM»M;SS«:N; 160 

416661; AA634543; Hs.79440; IGF-I) mRNA-binding protein 3; KH^h)main.nm;T1kNUI;SS=N; 160 

417821; BE245149; Hs.82643; protein tyrosine kinase 9; cofiiia/DF:; 160 

41 1 133; AW819204; ; gb:CM1'ST0283-071 299-061 -h03 ST0283 Homo; ANFjeoeptor.none; 160 

405602; ; ; Target Exon; pklnase;; 160 

400440; X83957; Hs.83870: nebulin; SH3.HebuGn;; 160 

424848; AI263231; Hs.327090; EST; SH3,PDZ.Guanylate kln,none; 159 

432268; BE31 1856; Hs.274230; 3^-phosphoadenosine 5'-phosphosuirate sy; APSJ(^eATP-8ulfurylase;TM^M;SS=N; 159 

452690; A1536070; Hs.15085; ESTs; pou,homeobQx.Rg.chan>VNFjecepton 159 

422753; AI928995; H5.1575; small nuclear ribonucleoprotein D3 polyp: Sm;; 2.59 

428028; U52112; Hs.182018; interieukin-l receptor^odated kinase; dedlh.pktnase;Tli^M;SS»N; 158 

433573; AF234887; Hs.57652: cadherin, EGF UG seven-pass G-type rece; 7tnx;Z.EGF.cadherin.laminin-EGFJaminln.G,Trypan Jlyoop,GPS^;TIW=Y;S8^ 2.58 

422785; AI8241 1 4; Hs.289088; heal shodc 90kD protein 1. alpha; zK:2Hlnone; 158 

418685; U76376; H3.87247; haraMrl, BCt2-{nterac6ng protein (oont; none;TiyNyi:SS4il; 158 

452329; N36626: Hs.29106: n^en-xthrated protein kinase phospha; DSPc:TM4A:SS=N; 158 

428405; Y00762; Hs.2266; chdinergic receptor. nicoQnte, alpha p; Neur_chan_l£aNeurjBhanjiiemb;TIUNY;SS4/l; 158 

421251; 228913: Hs.102948; erfgma (LIM domain protein); UHPD2;; 157 

407245; X90568: Hs.172004; Clin; fn3.ig.SGXXSG.pkinase;TI^M;SS«N; 157 

422309; 079745; Hs.114924; solute canteriamiiy 16 (monocarboxylic; 8ugar.tr;TM>Y;SS»M: 157 

401751; ; ; RAN binding protein 3; Orexin.SH2.STAT.STAT.btnd.STATj)roUon_trans,PAC,PAS,none; 157 

447887; AA1 14050; Hs.21 1610: caspase 8, apoptosis^ed cysteine pr; ICEj)10,tcLp20.OED:TM^M:SS='N; 157 

422282; AF019225; Hs.114309; apolipoprotein l; MotA.ExbB;TH*=Y;SS=M: 157 

439863; 6E547830; Hs.375208; paired lmmunoglobu1In-6ke receptor beta; lipoxygenase,PtAT,npoxygenase,PLAT: 157 

425743; BE396495; Hs.159428; BCU^sociated X pnOOa] Bd^TKA^Y.-SSsN; 157 

401218; ; ; eukanroQe translatton elongatnn factor; ion.ljans:TM=Y;SSsN; 157 

412773; H15785; Hs.74573: similar to vacdnia vtrus HIndill K4L OR; PLDc;TM=M;SS=N; 157 

444743; AA045648: Hs.3019S7; nudix (nudeosWe diphosphate (inked md; NUDiX,8ecY,E1_dehydrog,lransketj>ynTM=Y;SSsM; 158 

429782; NiyL005754; Hs.220889; Ras^TPase-acthraQng protein SH3<k>main: nm.rfrF2;TM=M;SS=N: 156 

442994; AI026718; Hs.16954: ESTs; ank.pkinase.death,Ribo8omai.S14; 2.56 

456802; AA411607; Hs.1 18964; ESTs. Weakly similar to KIAA1 ISO protein; none.pkinase: 156 

422846; BE513934; Hs.1583; neutrophil cytosdic factor 1 (47kD. dm SH3.PX;TM=M;S&:N; 156 

441699; AVW11126: Hs.127572; ESTs; noneM.trans; 156 

447912: AW576549: Hs.165728; ESTs. Weakly similar to 138022 hypotheb'; none,GSHPxABC_tran; 2.56 
442945: AI024849: H3.131853; ESTs; pkln85e,mne; 156 

453199; A1336266; Ks.32353; mitogen-activated protein kinase kinase ; pkinase;TM=M;SS=N: 156 
451477; AI798425; H3.42710; ESTs; SH3.none; 156 

41 5091; AL044872; Hs,77910; 3-hydroxy-3methyIgtutaryW)oenzymB A sy; HMG.CoA^; 155 
413529; U11874; Hs.M6; Interteukin 8 receptor, beta; 7tnLl;TM=Y;SS=N: 155 

425345; AU077297: Hs.l55894: protein tyroshe phosphatase, norwecept; Y_phosphatase4)SPc:TM=M;SS«M; 155 
401321; ; ; receptor tyrosine kinase^ike orphan rec; none;TMsM:SS-N; 2.55 
446999; AA151520: Hs.351416: hypothetkai protein MGC4485; none,nona; 155 
401057; ; ; eukaryofic translatnn elongation factor ion trBns.lQ;TM3Y;SSsN; 155 
414^. AW161311; Hs.76294: CD63 antigen (melanoma 1 antigen); transmembrane4;TM=Y;SS=l^ 155 
408204; AA454501; Hs.43666; protein tyrosine phosphatase type IVA. m; Yj)hosphatase;TM^M;SS>'N: 2.54 
424539; 10291 1; Hs.150402; Acfivin A receptor, type I (ACVRI) (AU<; pkinase,Act]vi[uecp;TM^SS=M; 154 
459060; H89244; Hs.303627: heterogeneous nudear ifbonudeoproleln : mn.pMnase;TM^SS^ 154 
450167; AA446404: Hs.24563: NTF2^ated export protein 1; NTF2;TiiH«;8S<4« 
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425966: NhL001761: Hs.1973: cycGn F; cydin.Fto.cycOn.C;TM=M:SS»N; 2M 

446566; K95741; Hs.17914: membrane-spanning ^domains, subfamOy A; none:TM3Y:SS»M: Z54 

412834; Rni23; Hs.79881; Homo sapiens cD^iA: FU23006 Ss, done L; 7tm_1^ne; 2.54 

457255: AL1 3301 1; Hs.253920; Homo sapiens mRNA; cONA DKFZp434P201 (fr. aone.none; Z54 

431341; AA30721 1; Hs.251531; proteasome (prosome, macropain) subunil; pmte3Som8;TM=M;SS=N; 2.53 

417331; AW411297; Hs.81972; SHC (Src homology 2 domaiHsntaning) t; SH2.PlD,zf-C2H2.SCAN,AMP-bIndtngJ<RAB;ThN ZS3 
414570: Y(X)285; Hs.76473; insuRn^ growih factor 2 receptor. fn2.aMR;TMsM;SS»M: 2.53 
444838; AV6516B0: H3.208558: ESTs; lntegrtnJ^FGW.none; Z53 

422609; Z46023; Hs.1 18721; siaTtdase 1 (tysosomal siaRdase); 8NR.SH2.SK3,p)0nasa:TM»Y;SSBM; 2.53 

450296; AL04 1949; Hs.24756: hepatocyte growih lactor-regutated tyros; none^e; 2.53 

400702; ; ; Target Exon; Bg_chan.SBP_bac_3ANF_recepto;TM=Y;SS=M; 153 

432336; NM.002759; H5.Z74382; protein kinase, interferon^nductbie dou; dsmi.p)dn3se:TM=M;SS=N: 253 

442643; UB2758; Hs.374973: PRP4/STKWD spDdng factor; W040:; 2.52 

452060; W26980; Ks.349089; ATP-binding cassette, subfamily F (GCN2; ABC_tran.lRK.SWIB: 2.52 

443951; F13272; Hs356835; ferritin, fight pdypepfide; PMP22^aaudln.ncne; 2.52 

428975; NM_004672; ^194694; mitogeivacSvated protein Idnase tdnase ; pktnase;; Z52 

407608: AI928218; Hs.360063; ATPase, NaT? transporting, t)eta 3 polypep; none.none; Z51 

414482; S57498; Hs.762S2; endothelin receptor type A; 7lnLl:TM=Y;SS«*fr, 2.51 

410293; AK000047: Hs.61960; hypothetical protein; ietetra;TM=M;8S=N; 2.51 

429663; M68874; Hs.211587: phospholipasa A2. group IVA (cytosolic ; C2.PLA2_B;TM=M:SS»N; 151 

425424; NM_004954; Hs,157199; EUa motif Idnase; pkinase,UBA.KA1;TM=M:SS=N; 2.51 

457013; AA037145; Hs.172865; deavage stimulation (actor, 3* pro-RNA,; WD40;TM=M;SS=N; 2.51 

439221: AA737108: Hs.322S0c EST6, Modaralely similar to 178885 seiin; adlLshort3iU8H4.none; Z51 

405429; ; ; Target Exon; Yjiha8phatase,none; !IS1 

443466; BE243123; Hs.321045; iKK-reiated kinase epsDon; tnducit)le Oq pkinase.RI01:TM=M;SSsN: 2.51 

418478; U38945; Hs.1174; cydin-dependent kinase Inhibitor 2A (me; ankr. 2.50 

408056; AA312329; Hs.42331: ephriivM; Ephfin:TI\4»M;SS=M; Z50 

414419: F06829; Hs.760g0; tumor necrosis foctor, alph»lnduced pro; K.letra(TM»M:S&^ Z50 

405389; : ; NII/L005569*:Homo sapiens UM domain Unas; pkma5e.UM,P0Z;; 2.50 

418216; AA562240; Hs.283099; AF15q14 protein; Hemagglutlnln.squash;TM=Y;SS=N; 2.50 

404321; ; ; C700174V:g!124996291splQ63932IMPK2^MOUS; none.none; 2.50 

430900: U91939: Hs.248123: G prolein-coupted receptor 25; 7tm_1;TM=Y:SS=M: 2.49 

440861; BE244115; lte.7482; KIAA0682 gene produd; rTm.GuanyiateJdn;TMsM:SS»N; 2.49 

415801; R24219: H3.278443; Fc fragment of IgG. tow affinity lib, re; ig;TM=Y;SS=N; 2.49 

418741; H83265; Hs.6881; ESTs, Weakly similar to S41044 dvomosom; pkinase^:lIvliLrecp,pldnase>^divbuecp; 2.49 
' 417034; NML006183;Hs.80962: neurotensin; none;; Z49 
400303; AA242758: Hs.79136; LIV-1 protein, estrogen regulated; none,none; 2.49 
40B805; H69912; Hs.4B269; vaccinia related kinase 1; pkina5e;TM»M;SS=N: Z49 
41 8255; AW135405; Hs.37251: ESTs; pkinase.none; Z49 

424905; N^L002497; Ks.153704: NIMA (never in mitosis gene a)-related k; pkinase;TM=M;SS»N; 2.44 

417791; AW965339: Hs.44269; ESTs; none.1erZFAD_binding_5>UdJCan JiuC.fer2_2>Ud J(aa.dh_C2,CX^^ Z44 

453941; U39817; Hs.38820; Bloom syndrome; DEAD.heIicdse_CHR0C;TM=M;SSsl4; 2.41 

417849; AW291587; Hs.82733; nidogen 2; EGF,t(fljeoepLb.lhyrogtobuIin„1:TMNM:SS«M: Z39 

408908; BE296227; Hs.250822; sertne/lhreonine kinase 15; pkinase;; 2.32 

428513; BE220806; Hs.184697; plexin CI; PSI.none; 2.31 

426761; AI01S709; Hs.172089; PORIMIN Pro^ncosls receptor hdudng me; none;TM=Y;SSeM; 231 

427585; 03115Z Hs.179729; collagen, type X. alpha 1 (SchmkJ metaph; C1q.Collagen;: 2.28 

412723; AA648459; Ks.335951; hypothetical protein AF301222; none;TM=M;SS»N; Z28 

452461: N78223; Hs.108106; transcription fadon zK)3HC4.ubk)uilin.PHD.YDG_SRA;TM=M;SS»N: 2.26 

429547; AW009166; Ks.99376: FGENESH predteted novd eecreted protein; none.none: 2.15 

429486: AF155827: Hs.203983: hypothflOcal protein FU10339: 8HF2 Jl,heDcase.C;TiyNM:S8eN: Z15 

401486; ; ; C4000647*:gI|4758508|reflNP_004253.1| tf; none;TM=Y;SS=M: 2.15 

416209; AA236776; Hs.79078; MAD2 (mitotic anest delident. yeasl. h; HORMA;TM=M;SS=N: 2.14 

424399; AI905687: Hs.348419: Ai905687:IL-6T095-190199^19 8T095 Homo ; none;TMsM:SSeM; 2.14 

423761; Nlit006194; Hs.132S76: pained box gene 9; PAX:TM»M;$S«N: Z13 

439670; AF088076:K3.59507; ESTs. Weakly simnar to AC0048S63Ul8m:none,non8; 2.13 

439316; AW837046; Hs.6527; G proteliKXMpled receptor 56; 7tm_%CytC.^GPS;TM=Y;SS=H 2.03 

445019; AI205540: Hs.281295; ESTs; none.none; ZOO 

443211; AI128388; Hs.143655; ESTs; none.none; 1.98 

449448; D60730: H8.67471: ESTs; nond.non0; 1.92 

435243: AW292886: Hs.348932; hypothetical protein dJ43(014.3; IRF^none; liS 

406360; ; ; Target Exon; W040:TM=M;SS=N; 1.84 

411388; X72925; HS.697S2: desmocdRn 1; cadher1n;TM»Y;SS^N; 1.84 

453102; Nl\^007197; H5.31664; tnzzied (Drosophila) homotog 10; Fz.FrizzIed,7tm.2;TM»Y;SS=M; 1.79 

419183; U60869; Hs.89863; cytodirome P450, subfamily XXiV (vitamin; p450;; 1.78 

420344; BE463721; Hs.97101; putative G pmteln^pled receptor. Methyltransf_5;TM»Y;SSsM; 1.77 

432842: AW674093; Hs.334822: hypothetical protein M6C4485: RibosomaLU;TM=iytS&=N; 1.76 

41974% AW408762; Hs.5957; Homo sapiens done 24416 mRNA sequence none,none: 1.73 

426427; M86699; Hs.169840: TTK protein kinase; pkinase.'; 1.62 

437915; AI637993: Hs.202312: Homo sapiens done Nil NTera2D1 tefatoca; none.none; 1.58 

433338; AF017986: Hs.31386; secreted frtzzled^lated protein 2 (str. FZ.KTR:: 1.50 

434377; AW137148; Ks.306593: intron of pertostin (OSF-20S); Fasddln.none; 1.47 

451592; AI805416; Hs.213897; ESTs; none.none; 1.47 

404927; ; : Target Exon; GaIactosyLT:TM=M;SS=Y; 1.28 

421552; AF02669^ Hs.105700; secreted frizzted^elaled protein 4; Fz,NTI^ 1.27 

427335: AA448542; Hs.278444: G anUgen 7B: none;; 1.25 

431808; M30703; Hs.270833; ampWreguHn (schwannomaderfved growth ; EGF:TM=Y;SS=M; 1.24 
447993; AW139525; Hs.170362; ESTs; none/ione; 1.21 

428182; 6E386042; Hs.293317; ESTs. Weakly simSar to GGC1.HUMAN G ANT; none;TM=M;SS=N; 1.18 
453637; NI\A.002589; Hs.34073; BH^rotocadherin (brain-heart); C8dhain:TM»Y;SS4llt 1.14 
438274; AI918906; Hs.55080: ESTs; PAKnone; 1.14 

453966; BE148734; H163325: transmembrane protease, serine 4; trypsin.klljecepUa,none; 1.10 
413268; AL039079; Hs.752S6: regulator of ^protein signalling 1; RGS;TM4i);SS»N; }m 
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42M21; AAS2691 1; Hs.82772: coBagen. type M. alpha 1; CoQagen.COtRJSPN,laminh>_G,CofA;; 1 .00 
452795; AW392555: Hs.18878; hypoihefical protein FU21S20: 20G.FelL0xy;TM=M;SS=N: 1.00 

TABLE 259 

Pkey: Unique Eos probeset tdenSfier lumber 
CAT number Gene cluster nanitei 
Accesskw: GenttaricaKesstonnumbeis 

Pkey CAT Hmbet Accession 

408685 0.0 M18728 

418869 12769.14 AA229762AA230035 

425802 8884.3 AA122298AA360788 

417885 1031334.1 AA210987DS7294AA214584AA207006 056572 

411133 1070995.1 AW819203AW819204AWB19197AW819202AW819211 BE158469AW819221 BE1 58473 AW81 9235 AW81 9207 AW819220AW81 9208 AW819238 
AW81919aAW819234 

TABLE 25C 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digH numbers in this column are Genbank Mentifier (Gl) numbers. "Dunham L et al.* refers to the pubOcation entitled "The ONA 

sequence of human chromosome 22.* Dunham I. el aL, Nature (199^ 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpcsHlon: Indicates nucleotide positions of predkrtod exons. 



Pkey 


Ref 


Strand 


NLpositton 


401781 


7249190 


Minus 


83215^5.B3531-83656.83740-83901,8423 


401780 


7249190 


Minus 


28397-28617.28920-29045.29135-29296.2941 


401760 


9929599 


Plus 


83126-83250,85320^0,94719-95287 


402075 


8117407 


Plus 


121907.122035,122804-122921,124019-12416 


401747 


9789872 


Minus 


11 8596-1 18816,1 191 19-1 19244.1 1 9609-1 1976 


404996 


6007890 


Plus 


37999.38145,38652-38998,39727-39872.4055 


402447 


9796640 


Plus 


4760fr47729^1&9&51821 ,5207052257,5330 


405932 


7767812 


Mbius 


123525-123713 


406467 


9795551 


Plus 


162212-162958 


402233 


7690102 


Plus 


90281-91477 


402558 


9863760 


Plus 


19047-19145.21133.21293.33968^89 


405556 


1552511 


Phjs 


183497-163623^164715-16496^,165369-16550 


403112 


8980973 


Minus 


113051-113195 


402316 


7527774 


Minus 


10751-10919.18817-19052.22131-22328 


404891 


7329392 


Plus 


84974-85125 


406137 


9156422 


Minus 


30487-31058 


401752 


9828651 


Phis 


144600-144794 


405588 


5002511 


Ptus 


4618046366 


401812 


7407975 


Mnus 


55084-55391 


405602 


4753260 


Pius 


4464744778 


401751 


9828651 


Ptus 


139165-139322 


401216 


9929301 


Minus 


4079^1031 


401321 


9863631 


Minus 


104278-104748 


401057 


8117645 


Plus 


158309-159238 


400702 


811B856 


Minus 


11457-11585.26311-26536.27902-26067,3204 


405429 


7321905 


Minus 


51577-51723 


405369 


2078459 


Minus 


34183-34357,35686-35751 


404321 


9665209 


Minus 


76594-77805 


401486 


7341763 


Pius 


3258&^2756,36281-38540.4079140933.4401 


406380 


9256107 


Minus 


7513-7673 


404927 


7342002 


Pkis 


68690-69563 



TABLE 26A: 834 GEhES UP-REGULATEO IN EWINGTS SARCOMA COMPARED TO NORMAL ADULT TISSUES 

Table 26A Hsts about 634 genes up-regulated in Swing's sarcoma compared to nonnai adult Gssues. These were selected from 35403 probesets on the Affymetrix/Eos Hu03 
GeneChip array such that the ratio of 'averase' kidney cancer to 'average' normal adult tissues was greater than or equal to 1 .5. The 'average' kidney cancer level was set to the 
75^ pereenQle amongst Ewfng sanxxnas. The 'everogoT normal adult tissue level was set to the 85^ percantiia amongst noiHn^nant tissues. In order to remove gene^pedfk; 
badtgiDund levels of nonspecific hybridizatkm. the 7.5^ peicenfile vahia amongst nofHTiSBgnantassues was subtracted from both the mnneiBtor and the denodnalor before the ralto 
was evaluated. 

Pkey: Unique Eos probeset MentHier number 

ExAccn: Exemplar Acc css k w i numt)ef, Gent)ank acoessfan numbef 

UnlgendD: Un^ne number 

Unlgena TlSe: Uidgene gene fHle 

f^: Ralio of Ewing sarcoma to nonnai fissuQ 



Pkey ' ExAocn UnKSenetO UniGene Title RT" 

101447 M21305 gb:Human alpha satellite and sateOite 3 38.4 

115861 NM.0057S6 Hs.184942 G protein-coupled receptor 64 34.2 

110278 AF061573 Hs.19492 piolocadherin 8 32.2 

121362 AF050147 Hs.97932 chondromoduOnl precursor 30J 

101104 AWB62258 Hs.1fi9266 neuiopepOdeY receptor Y1 26.3 
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121792 AW969726 Hs.98381 ESTs. WeaWy 5 Wlar to scrfne protease 24.4 

121619 AAS28339 H3.178082 ESTs, Weakly similar to phosphatitJylseri 23.4 

104659 AW369769 Ks.105201 ESTs 20.2 

- 106533 AL134708 Hs.145998 ESTs 16.9 

5 124006 A1147155 Hs.270016 ESTs 15.0 

110728 AA737106 HS.322S0 ESTs, Moderately slnfltf to 178885 serin 14.8 

105782 HD9748 Hs.57987 &ceQaUlyniphoma11B (zinc finger pfo 14.6 

102836 U94320 Hs.158330 neuropeptldaY receptor Y5 14.5 

104691 U2S690 Hs.37744 Homo sapiens beta*1 adrenergic receptor 13.7 

10 121231 AA814948 H3.96343 ESTs. Weakly simiiar to ALUCHUH^ilfl 12.3 

129526 S696B1 H5.177582 suifactant, pulmonary^ssoclatod protein 12.1 

119791 AA554907 HS.S6291 ESTs 11.7 

116301 AW969706 Hs.293332 ESTs 11.2 

123308 C14187 Hs.103538 ESTs 10.9 

15 127742 AW293496 Hs.180138 . ESTs 10.8 

131601 Nt^007115 Hs.29352 tumor necrosis f^.sdphshinducedpio 10.7 

127489 AA650250 Hs.272076 ESTs 10.6 

115909 AW872527 K3.59761 ESTs, W^slmBar to DAP1J1UMAN DEATH 10.6 

101083 054745 H5.80247 chotecystoUnin'' 10.6 

20 134570 U86615 H5.172280 SWI/SNF related, matrix associated, acO 10.S 

100299 D49493 Hs.2171 growth differenfiafion factor 10 10.1 

127987 AI022103 Hs.124511 ESTs 10.1 

131313 R96290 Hs.336629 ribosomal protein L44 a2 

126799 AW7S3865 Hs74376 oUactomedln related ERtocaBzedprotei BlS 

25 125847 AW161885 H5.249034 ESTs 7.0 

100360 D82343 Hs.18551 r\eurob1astonta (nerve tissue) protein 6.9 

114837 BE244930 Hs.166895 ESTs 6.6 

123049 BE047680 Hs.211869 dickkDpf(Xencpuslaavis)homolog 2 6.6 

^ 129977 NM^000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 6.5 

30 127695 AA714731 Hs^1457 ESTs. Weakly sNlar to heterogeneous r 6.5 

125166 AA61I»20 Hs.181244 m^histoconipatibnity complex, class 6.4 

116644 AA443241 Hs^6629 ribosomal protein L44 6.3 

119717 AA918317 Hs.57987 B<»n aUlymphoma 11B (zhw linger pro 6.3 

101879 AA176374 Hs.243886 nuclear autoantigenlc spenr) protein (his 6.1 

35 113003 AW292315 Hs.7215 ESTs &8 

126645 AA316161 Hs.61635 six transmembrane epilheiid anfigen Of 5.7 

101050 AU077324 Hs.1832 neuropeptide Y 5.7 

116790 AW161357 Hs.101174 mterotubute-assodated protdn tau 5.5 

119082 AF2S2297 Hs.91&46 cytochrome P450 refinold metabolizing pr 5.1 

40 132315 AF091086 H8.44563 hypothefical protein 5.0 

126098 M79088 gb:EST01 236 Subtracted Hippocampus, Sira 4.9 

126077 M78772 Hs.210836 ESTs 4.7 

126426 AA12S984 gbnm27h06j1 Strstagene neuroeptthdhm 4.6 

131307 NliC000025 Hs.2549 adreneiB'ic. beta4>. receptor 4.5 

45 123619 AA602964 gb:no97c02.8l NCLCGAP.Pi2 Homo sapiens 4.4 

128361 AW172570 Hs.130246 ESTs 4.3 

127003 AW816515 Hs.173540 ATPase.QassV.typelOD 4.3 

100020 4.2 

125556 AB033064 Hs.334808 KIAA1238prot^ 4.2 

50 105316 AI671245 H8.2483S hypothetical protein FU 14594 4.0 

112268 W39609 H$.22Q03 solute Garrier family 6 (neurotransndtte 4.0 

106516 AL137311 Hs^074 Homo sapiens mRNA; cONA OKFZp761G02121 ( 3.9 

128132 AA225632 gb:nc08a07.r1 NCLCGAP_Pn Homo sapiens 3.9 

129012 R81936 Hs.336629 ribosomal protein L44 3.9 

55 125447 Ai582222 Hs.128686 ESTs 3.8 

134676 W2B051 H5.87819 Homo sapiens, clone M6C:2492. mRNA, comp 3.6 

119040 R02394 Hs.269436 ESTs. Moderelely similar to PC4259ferri 3.6 

128391 AW1B8326 H5.170652 ESTs 3.5 

_^ 123829 AF251237 Hs.112208 XAGE-1 protein 3,4 

60 123949 AA621665 Hs.208957 EST 3.4 

126872 AW450979 gb:Ul-H.BMa*12^Ul.s1 NCLCGAP.Su 3.4 

101266 136645 Hs.73984 EphA4 3.3 

121309 AA293834 Hs.97312 ESTs 3.3 

130637 AA3S6764 Hs.17109 integral membrane protein 2A 3.2 

65 125464 N71807 gbvz29d09.r1 Soaresjnultiplejdarosls^ 3.2 

135175 IUI91463 Hs.9595e solute carrier famRy 2 (faaltatedgtu 3.2 

107599 AW664072 Hs.60136 ESTs a2 

1026B1 Y08890 Hs.1 13503 karyopheiin (importin) beta 3 3.2 

131688 AI935413 Hs.30692 p21 (CDKN1A>«ilvated Kinase 2 ai 

70 120147 A1917116 Hs.1S5376 hemogtoUn. beta ai 

110343 AW138703 Hs.17268 ESTs 11 

127664 AA806164 H8.116502 ESTs XO 

103076 NM.001034 Hs.75319 ribonudeoQde reductase M2 pdypepUda 3.0 

126127 N95428 gb3bB0d09^1 Soares.senescentJbroblas 3.0 

75 125558 R59305 gb:yh16c10Jl Soares infant brain 1 NIB H aO 

100335 AW247529 Hs.6793 platetet^dh^ng factor acetyihydrola 19 

133421 AF134160 Hs.7327 daudinl 18 

102581 AU077228 Hs.77256 enhancer of zeste(Dro5ophila)homolog 2 2.8 

113577 AI300699 Hs.278937 PRO0470 protein 2.8 

80 118397 BE139479 Hs.161492 ESTs 18 

115773 AW44S044 Hs.38207 Human DMA sequence from done RP4^I15 18 

128859 AW630087 Hs.103315 tdnudeoSde repeal conteinino 1 18 

127262 AA828125 gb»d71 809.81 Na_CGAP.0v2 Homo sapiens 18 
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106472 AI207162 Hs^lS st3lhnMkes)roteIn RB3 t7 

125032 T748d4 gb:yc5BdOZs1 Stralagene liver (937224) 2.7 

127315 AF116622 fibiHomo sapiens done 084217 mRNAseque 2.7 

126600 AA699949 Hs.191385 ESTs 17 

5 120325 AA195651 Hs.104106 ESTs Z7 

127256 At738610 H&267967 ESTs, Moderately sImDar to ALUB.KUMAN 17 

117357 N24B29 gb:yx98i)12j1 Soafesni8tanocyte2NbHMHo 2.7 

126735 M69113 Hs.226795 gluteiMoneS-lransferasepI 2.7 

102745 AW753865 Hs.74376 olfectomedin related ERtacdlzedpfOtn* 2.7 

10 128040 AW5004e6 H3.1 80610 splicing factor pronne/^lutamina rich ( 2.6 

129706 AA443241 Hs.336629 ribosomal protein L44 2.6 

107731 AA016086 Hs.272106 ESTs. WeaMyslmOar to 138022 tiypotheO 2.6 

128283 At07657O Hs.134053 ESTs 16 

125165 W45350 gb2c81h08.s1 Pancreatic Islet Homo sapi 16 

IS 111148 AB020690 Hs.7782 paraneoptasHc antigen MA2 Z6 

105577 AW852257 Hs.171391 Otenn&ral binding protrin 2 2.6 

128301 U90552 H5.284263 butyrophilln. subfamily 3. member A1 2.6 

130262 D63216 Hs.15%B4 frizzted^ated protein 16 

^ 132967 AA316181 Hs.61635 sbc transmembrane epIlheDal an&gen of 16 

20 102479 NiyL001991 Hs.194669 enhancer of zests (Drosophila)bomo)09l 2.6 

128531 H03721 Hs.2953 ribosoma) protein S15a 16 

126155 AI741Bt6 Hs.125897 ESTs 16 

126088 H75681 gb:yr77g01.r1 Scares fetal Over spleen 15 

118967 AI688670 Hs.216756 ESTs 2.5 

25 120830 AI568170 Hs,96886 ESTs 15 

127229 AA316iei Hs.61635 six transmembrane epIlheOal antigen of 15 

129428 AA255906 Hs.111364 ESTs. WeaMy similar to ubiquitous TPRm 15 

110151 H18835 Hs.31608 hypotiietica) protein FU20041 15 

131381 h492642 Hs.26208 collagen, type XVI, alpha 1 15 

30 133761 AF041430 Hs.75922 brain protein 13 15 

125590 R23858 H8.143375 Homo sapiens, done iMAGE:3840937.mRNA, 15 

126693 C05723 gb:C05723 Human pancreatic Islet Homo sa 15 

126021 AA775894 Hs.187516 ESTs 15 

125905 AI678638 Hs.64S6 chaperonincontdnlngTCP1.sia)unit2(b 15 

35 102507 U52154 Hs.193044 potassium inwaidty-fecti^ng channel, 8 15 

125743 H17151 gb:ym378Q5j1 Soares Intent brabilNIBH 15 

130580 N3238d Hs.334370 uncharactertzed hypothetemus protein HBE 15 

113119 T47910 ob:yb18b11.8lStratagene final spleen (9 14 

123110 AA486256 Hs.193510 EST 14 

40 113283 T66813 Hs.12947 EST 14 

lOnil W96141 Hs.220687 ESTs 14 

128992 H04150 Hs.107708 ESTs 14 

106111 AW875398 Hs,6451 PRO0659 prolrin 14 

129948 AI537162 Hs.263988 ESTs 14 

45 125728 AW954565 Hs.57987 B<eliaUlymphoma11B(zlnc finger pro 14 

116728 F13687 Hs.227976 EST 14 

103100 NM_O05574 Hs.184585 UM domain only 2 (fhombotin^l) 14 

124971 T23800 Hs.15l001 hypothetical protein FU 14728 14 

_ 131019 W28614 Hs.306155 chorionic somatomammotropin hormone 1 (p 14 

50 128671 A188S045 Hs.211586 phosphdnosHide^nase. regulatory s 14 

111795 A1435437 H3.24567 EST5.Weddy5WlartoKBF3J^UMAN NUa 14 

119127 AA70803S Hs.12248 ESTs 14 

117602 N35020 Hs.44685 C3HC44Ike zinc finger protein 14 

111898 R3B944 Hs.183475 Homo sapiens clone 25061 mRNA sequence 14 

55 131916 AA025976 Hs.34569 ESTs 14 

130850 AB040922 Hs.20237 DKFZPS66C134 pratein 14 

100571 L14561 Hs.7B546 ATPase.Ca^tr8nsporiing, plasma membra 14 

126722 N66148 Hs.11125 HSPCX)33 piottin 14 

123720 AA609734 Hs.1 12755 EST 14 

60 113609 T93263 Hs.16875 ESTs. VteaWy simitar to S23650rebomr 14 

131136 AB033099 Hs.23413 KIAAl 273 protein 14 

1^001 AA443323 Hs.107612 BP(K protein 14 

133529 W45623 Hs.74571 AOP-ribosytaOon factor 1 14 

107593 AI093688 Hs,60051 ESTs 2.4 

65 123910 AA621262 Hs.179923 ESTs. WeaMy similar to S65657alpha.1C- 2.4 

128817 BE395776 Hs.168640 ankylosis, progressive (mouse) homolog 14 

103080 AU077231 Hs.82932 cycOn D1 (PRAD1: parathynM edenonatos 14 

128367 AWB11791 Hs.150742 ESTs 14 

„ 123729 AlX)39779 Hs.278672 membrane component, Chromosome 11, suria 14 

70 112342 AW410273 Hs.92614 longevity assurance (IA61. S. oerevlsiae . 13 

114721 061939 Hs.103822 ESTs 13 

127768 AW085002 Hs.156187 ESTs 13 

127706 AI174238 Hs.186982 ESTs 13 

126029 AA704253 Hs.169359 ESTs 13 

75 124250 AA350256 Hs.323875. EST, Wealdysimaarto2109260ABcell 2.3 

117265 AA451966 Hs.43005 RABMe protein 13 

112501 AA972447 Hs.288833 Homo sapiens mRNA; cONAOKFZp434K087(fr 13 

129079 AK0001S7 Hs.108502 hypothetical protein FU20150 13 

127252 AI049545 Hs.94 DnaJ(Hsp40) homotog, subfamily A. membe 13 

80 129228 U40714 Hs.233307 lyrosyt-tRNA synthetase 13 

114092 H81213 Hs.14825 ESTs, WdaUy similar to KIAAl 503 protein 13 

109252 BE440t57 Hs.85944 ESTs 13 

127889 AI147408 H3.144941 ESTs 13 
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121292 AA401B07 gb3v65f1U1 Soafes.totaLtehis_Nb2HF8„ i3 

128797 NM„002975 Hs,105927 stem ceDgrowlhfactonlyn#ocyt8 seer 2.3 

132985 AL045579 Hs.62n3 KIAA0717 (Kotein 2L3 

125174 V\61835 Hs.231082 EST tZ 

5 125401 AI204637 Hs^7585 ESTs. Highly slmHar to KIAAD350 [HjapJ Z3 

135278 AA399542 H3.229671 EST. Moderately similar to PEPTIDYL-PROL 2.3 

119155 R61715 Hs.310598 ESTs. Moderately similar to ALUl.KUMAN 2.3 

123423 AAS98484 gb:a838l04^1 Gesster Wilms tumor Hoinos tZ 

123258 AA4g0929 Hs.105274 ESTs,We^8imB3rtoRMSl_HUMANREGUL 2.3 

10 128826 Z40313 Hs.106330 Homo sapiens clorteiMAGE:23371.mRNAseq IZ 

105014 AA121123 Hs.269287 ESTs, Weakly similar to AF161 361 1HSPC 13 

101086 AA382524 Ks.250959 hislaQnl 2.3 

110679 AA0D4798 H$.108311 ESrs.W^ similar to T00351hypo(he0 2.3 

126879 D90391 Hs.1265 brandied chain keto acid dehydrogenase E tZ 

15 132317 BE262438 Hs.44592 beta-1.4 mannosyltransferase 2.3 

124691 R05835 Hs.110153 ESTs Z3 

113474 R50752 Hs.23856 hypolhellcai protein MGC5297 IZ 

103175 X69089 H&79227 myomesin(M-prDlein) 2 (1651(0) 2.2 

129052 6E275031 Hs.158210 hypoa«QcaIprot^M6C2555 2.2 

20 129248 WD4606 Hs.171637 hypotheticai protein MGC2628 22 

100760 BE561958 Hs.302063 ■ immunogtobuiin heavy constant mu 2.2 

135416 BE281018 Hs.g9989 fusion, derived from t(1?, 16) malignant 2.2 

129928 A)338993 Hs.1 34535 ESTs 2.2 

103319 X83492 Hs.82359 tumor necrosis factor reoeptorsuperfami 22 

25 110256 H63947 H8,237955 RAB7, member RAS oncogene family 22 

120734 AA299948 . gb:EST1 2544 Uterus tumor I Homo sapiens 22 

111777 AK001100 Hs.41690 desmxoIIin3 22 

128963 J03890 H&1074 surfactant, pulmonary-associated protein 22 

^ 108451 AA079195 Qb2m92h12.s1 Stralagene ovarian cancer 22 

30 134964 A1803516 Hs.272891 hippocalcin^ilce protein 4 22 

127248 AA36419S gb:ESn501 5 Pineal gland H Homo sapiens 2.2 

12^61 1^51 gb7h99b03.r1 Scares placenta Nb2HP Homo 22 

101358 M10058 Hs.12056 aslalogiyroprotein receptor 1 22 

101613 M24283 Hs.1683B3 Intercellular adhesion motectite 1 (C054) 22 

35 107121 AB015427 Hs.250493 zinc finger protoln 219 2.2 

118751 N74210 H8.50454 ESTs 22 

128952 AL043463 Hs.6755 RaP2 interacting protein 8 U 

126581 W73306 H5.3Q6668 Homo sapiens cONAFU14089fis. done i^ 22 

127634 AA633469 H3.193283 ESTs, Weakly similar to unnamed protein 22 

40 130755 BE293520 Hs.18910 prostate cancer orarexpressed gens 1 22 

132867 Al^667 Hs.58553 CTPsynthaseti 22 

126323 N77584 Hs.68644 Homo sapiens microsoma) signal peptidase 22 

111790 AW7698B3 Hs.6734 ESTs. Weakly similar to S26&50DNA'b]ndi 22 

125549 R20215 gb7gt8b09.r1 Soares infant brain 1NIBH 22 

45 128059 AA972446 Hs.145096 ESTs 22 

132342 AW16275B Hs.45232 ESTs. Weakly similar to ALUS^HUMAN ALUS 22 

125722 H29796 Hs.269622 ESTs 22 

106383 AA447453 Hs.27860 Homo sapiens mRKA;cONADKFZp586M0723(f 22 

127644 N88858 Hs,155101 ATP synthase. H+transpoftir»g.mitochond 22 

50 128179 AW293689 H8.127116 ESTs 22 

133461 1^000762 Hs.334345 cytochrome P450, subfamily IIA(plienobar 22 

126952 R12014 Hs.20976 ESTs 22 

112369 AWg66243 Hs.4243 hypothalcal protein FU 12650 22 

133582 BE391579 Hs.75087 Fas-activated serine/threonine Mnase 22 

55 112276 R53442 Hs.26D38 ESTs. Weakly simitor to 138022 hypothat 22 

108743 AI580150 Hs.71074 ESTs 22 

133726 Ai803188 Hs.252716 oxysterDl4)inding proteirHelatad prolel 22 

131263 AU077002 H5.24950 regulator of GiNDteinslgnatiing 5 22 

,^ 109929 AA773187 Hs^027 ESTs 22 

60 129059 AW069534 Hs.279583 CGI-81 protein 22 

110724 AW016783 Hs.30799 Homo8aplen3cDNAFU13471 fis,ctonePL 22 

116962 H79677 gb:yu76g10.s1 Soares told liver sptoen 22 

119232 A1655226 Hs.117559 ESTs, Weakly similar to T46481 hypothefl 22 

106711 BE390125 H3.143187 hypothelfcal protein 22 

65 135191 X16866 Hs.301086 cytochrome P4S0. subfamily IID(debrisoq 22 

125822 H03162 Hs.26e766 ESTs 22 

130215 BE301883 H3.152707 glioblastoma ampHfied sequence 22 

133363 AI866286 Hs.71962 ESTs. Weakly simflar to B36298pn)nne-r 22 

126250 AL050391 Hs.321247 Homo sapiens mRNA:cDNADKF2p586A1 81 (fr 22 
70 103392 X94563 gb:asapiens dbi/acbp gene axon 1 & 2 22 

129794 AF161399 Hs.23259 hypothetical protein FU13433 22 

100253 038024 Hs.157425 doUUe homeobox. 2 22 

130743 AIJ049266 Hs.18724 Homo sapiens mRNA:cONAOKFZip564F093 (fir 22 

125466 miM Hs.180461 ESTs 22 

75 122682 AA984531 ris.159293 ESTs 22 

133347 BE2S7758 Hs.71475 add cluster protein 33 22 

104455 AL110261 Hs.157211. DKFZPS6660621 protein 22 

116332 AA491208 Hs.62620 chromosome 6 open reading frame 1 22 

131163 AA099524 Hs.23754 ESTs 22 

80 109592 AI198059 Hs.26370 ESTs 22 

128721 AW403911 Hs.266175 phosphoproteln assocteted with GEMs 21 

114046 BE016658 Hs.141003 Homo s8iptenscONA:FU21 691 fis. done C 21 

128434 A1190914 " H8.143880 ESTs 21 
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103163 AU077018 Hs^35 keraQn4 2.1 

112379 AK001713 K5.17860 hypoUieBca) protein FU10851 Z1 

127507 AA249573 Hs.152616 EST8,MoiMBlysimQartoZN91_HUMANZ 2.1 

- 133097 W03512 Ks.6479 hypotheOcd prot^ MGC13272 2.1 

5 126153 H85692 H$.40730 ESTs 2.1 

122110 AI123000 Hs.301240 mdanocoriin 1 receptor (alpha melanocyt 2.1 

100554 M95923 gb:Human 124ipoxygenasG mRNA. partial c Z1 

104799 AA029703 fib2e95hOd^1 Soares fetaUeaiLNbHH19W ^1 

132664 AI740461 Hs.54542 ESTs 2.1 

10 114620 AA642974 gb:ni«OhOU1 i4a.CGAP.Lym3 Homo saptens II 

115348 AA281562 Hs.292100 ESTs Z\ 

133231 AK000517 Hs.6844 hypolheticd protein FU20510 Z\ 

133160 N54968 Hs.66309 hypoOieOcal protein MGC1 1081 Z1 

124656 AW297702 Hs.102915 ESTs Z1 

15 133576 M19650 H5.150741 ?.3'K:yclicraic)eoQd83'phospiK)dieste t\ 

132676 N92589 Hs.261038 ESTs, Weakly Simitar to 136022 tvpothet Z1 

126505 AA282861 H$.190057 ESTs 21 

118865 AA736405 HsS(530 ESTs II 

134267 A1174S96 Hs.198209 RAE1 (FlNA export 1. apombe) homolog II 

20 134104 L35253 Hs.79107 initogetvactivated protein kinase 14 2.1 

133493 AW998046 Hs.194369 arglnlne^utamic add dipeptkle (RE) re 2.1 

112853 T02643 gb:FB11H5 Fetal brain. Stratagene Homos 2.1 

117457 ^29682 Hs.44071 ESTs. W^s'siiar to ALUS JftlMAN ALU 2.1 

112246 RS1321 Hs.25760 Homo8aptenscDNAFU12252fis.ck)neMA Z1 

25 134869 AL157518 Hs.90421 PR02463 protein Z1 

128869 AA766242 Hs.B0616 hypolhetical protein 2.1 

129179 AW969025 Hs,109154 ESTs 2.1 

104857 AI920902 Hs.ig068 ESTs. Moderately slniar to S65657 alpha 2.1 

101651 AL037111 H8.75641 gdlactose-l-phosphateurUylyltiansferas Z1 

30 129726 H15474 Hs.132898 fatty add desalurase 1 II 

117188 H98gd8 Hs.42612 ESTs, Weekly slmHar to ALU1JKUMAN ALUS II 

126271 A1250773 Hs.270012 ESTs II 

116925 H73110 Hs.260603 ESTs, Moderately sbnOar to A47582 &ce II 

128468 T23625 Hs.150580 putetive translation iniUalion factor II 

35 .116031 AA452239 Hs.103329 K1AA0970 protein II 

130724 AK001507 Hs.306084 HomosapiensdoneFLB6914PR01821 mKNA. II 

121897 AA427419 Hs.229162 EST, Weakly simBar to ZN91 .HUMAN ZINC II 

123808 AA620552 gb:ae58g11.s1 Stratagene hingcardnoma 2.1 

122333 AA625872 Hs.98977 ESTs, Moderately similar to T34561 hypot II 

40 127841 AW136558 Hs.125246 ESTs II 

100023 II 

113002 BE243513 Hs.7212 tiypothetlcd protein PP1044 II 

111567 F12628 H5.334786 hypothetical protein MGC16040 II 

113697 T97183 Hs.17992 Homo sapiens mRNA; cDNA DKF2p434J1726 (f II 

45 128033 AI248705 Hs.149321 ESTs II 

105225 AA211777 gbzn57dOIs1 Stratagene musde 937209 H II 

112370 AF052095 Hs.1 67344 Homo sapiens done 2391 1 mRMA sequence II 

132786 BE083422 Hs.56851 hypothetical protein MGC2668 II 

113226 AI821008 Hs.10697 ESTs II 

50 117997 N52090 H8.47420 EST II 

116998 HB3935 Hs.40535 ESTs II 

127002 AL353940 Hs.24979 hypothetical protein DKFZp761P1010 II 

122591 A1188219 Hs.99311 ESTs, Weakly similar to HSJ^HUI^DNAJ II 

_ _ 107279 S57296 Ks.323910 v-ert>4)2 avian erythroblastic leukemia II 

55 103898 AA248884 gb:k35l7xeq.F Human fetel heart, Lambda II 

110312 BE2S6986 HS.118S6 hypotheiicSl protein FU1 2089 II 
127447 AA386192 Hs.193482 Homo saptenscDNAFU11903 lis, done HE II 
128352 AW137413 Hs.169942 ESTs II 
^- 113649 N94768 Hs.16400 ESTs. Weakly simSar to K1AA1435 protein 10 
60 12B275 AI216235 H8.131240 ESTs 10 
12S976 AA436760 gb:zv67d11j1 SoaresJot3Uetus_Nb2HF6_ 2.0 

120820 AA347417 Hs.96869 EST 10 
134937 AI251449 Hs.171939 ESTs 10 
129602 AI282193 H8.198298 v-sro avian sarcoma (Schmidt'RuppinA-2) 10 
65 129535 AA397972 Hs,169965 dilmerin (diimaerin) 1 10 
106095 AF1 15402 H5.11713 E744ike factor 5 (ets domain transcript 10 
128538 R44214 Hs.101189 ESTs 10 
105593 AA279341 Ks.174151 aldehyde oxidase 1 10 
105788 AB009698 Hs.23965 solute carrier family 22 (organic anton 10 
70 128148 AA918175 Hs.126637 ESTs 10 
125982 R98091 gb:yr30e11.r1 Soarssfeteilh/er spleen 10 

125746 AL137506 Hs.274256 hypothetical protein FU 23563 10 
127835 AA748762 Hs.163113 ESTs, Weakly simOar to 138022 hypotheQ 10 
__ 100661 BE623001 Hs.132748 Homo sapiens ribosomal protein L39 mRNA. 10 
75 124282 AA01B408 Hs.1 10287 ESTs 10 
126926 AA179472 H3.832 ESTs, Highly similar to A41029Integrln 10 
100221 D28383 gb:Human mRNA for ATP synthase Bdtain. 10 

126053 H64450 gbYu62d01.r1 Wdzmannat^doryEpithd 10 

100944 L07518 Hs.159593 mudn6,gastrfc 10 
SO 125581 A1272846 Hs.75309 eukaiyotic translatton etongatton factor 10 
128604 AI879099 Hs.102397 GIOT-3forgonadDtr(ip{nlnducIbtetransc 10 
114612 AI124SS7 Hs.9S456 ESTs 10 
130453 U80735 Hs.173854 PAX banscffpBonadhraBon domain into 10 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



135060 AK001B87 
114419 AI248013 
126283 N403S9 
112003 AWg78731 
127391 AW380B93 
I2ni7 F12209 
126893 AJ2S2060 
1067S8 BE25Z749 
103760 AA642973 
118922 AW208193 
133195 AI434760 
133424 AA350994 
133765 M62194 
132347 BE2710t6 
125599 H13295 
114459 AW445217 
128478 AA7D8205 
127271 H96820 
111122 N63753 
130695 T97205 
133571 BE515037 
119244 AW407564 
127603 A1016798 
113626 T94318 
128115 A1435590 
117639 AA377165 
127033 AF169301 
112411 R43090 
114601 AA075566 
127573 AAS94ig6 
125500 AW952654 
119416 T97186 
115467 A1366784 
128902 AAD36637 
127684 AA668631 
126288 AW449560 
122059 AA431737 
125486 AI023895 
128895 AW467000 
105301 AW3523S7 
125536 F08266 
121387 AA405854 
134126 NW_003747 
126860 BE242814 
102907 BE409861 
127804 AA740634 
130566 R85474 
113782 AK001567 
124119 AA040123 
132490 NM_001290 
125494 AU077029 
100237 D30715 
127687 AW772383 
103136 AF087917 
125704 R55094 
126208 N22588 
131902 AA180145 
128660 AA011S97 
118049 N53145 
134624 AF035119 
127432 AW067708 
126414 A1363157 
120861 AA350394 
124669 AI571594 
126096 F08208 
103891 NM.007212 
128727 AI223335 
126831 AI929107 
125360 AWB98892 
124276 mm 
126524 Z45455 
126647 AK000283 
125957 H41694 
121782 AW452957 
124059 BE3e7335 
130945 U20562 
126348 T16243 
103556 BE516547 
126982 AA211419 
125613 AA76S957 
129601 AB032964 
126007 K51097 
123827 AA909619 



Hs.259842 protein kinase, AMP-acQvatsd. gamma 2 n 2.0 

Hs.106532 ESTs.W^ simitar to 138588 reverse! 2.0 

Hs.271896 ESTs 2.0 

Hs.301824 hypotheBcd protein PR01331 2.0 

Hs.1 1039 hypdheGcal protein MGC2722 2.0 

Hs,173380 CK2lnteracGng protein 1;HQ0024cprote 2.0 

Hs.26320 TRABIDprol^ 2.0 

Hs.20556 hypolheOcd pnslein FU20345 2.0 

Hs.1 83842 ubit^nB 2.0 

H5.91065 hypotheticai protein DKFZp761B2423 2.0 

Hs.279949 KIAA1007 piolein 2.0 

Hs.20281 KiAA1700 2.0 

H5.75929 caditerin11.type2,OB-cadhei{n(osteob 2.0 

HS.169B50 ESTs, Weakly similar to T21554hypo(h8g ^0 

HS.10S135 ESTs 2.0 

Hs.103362 ESTs 2.0 

Hs,100343 ESTs 2.0 

gb:yv99b03 j1 Soares melanocyte 2NbHM Kd ^0 

Hs.16492 DKFZP564G2022 protein 2.0 

Hs.17g98 ESTs. Wealdy similar to 2109260ABC6II 2.0 

Hs.177556 melanoma antigen, family 0, 1 2.0 

HS.27S865 ribosomal protein SI 8 2.0 

Hs.9925 tiypotheticai protein FU20772 2.0 

Hs.17359 ESTs. Moderately similar to l^44_HUMAN 6 2.0 

Hs.130168 ESTs 2.0 

Hs.44833 ESTs 2.0 

H8.S098 sulfate transporter 1 2.0 

Hs.271510 ESTs, Moderaleiy simflar to ALU1 J<UMAN A 2.0 

Gb2m86i06.8 1 Stratagene ovarian cancer 2.0 

Hs.269464 ESTs. Weal(}y similar to S65657alpha-1C- 2.0 

Hs.244624 ESTs 2.0 

gb7a50h09^1 Soares fetal Dver spteen 2.0 

Hs.48820 TATA box binding protein CrBP)-ffiSOciate 2.0 

Hs.107052 ESTs 2.0 

Hs.32556 KlAA0379prol^ 2.0 

Hs.89576 inner mitochondrial membrane pepOdase 2 2.0 

Hs.98749 EST,Moderately8iniitertoT4267ltiypoti) 2.0 

Hs.190587 ESTs 2.0 

Hs.106985 ESTs 2.0 

Hs,7457 MAGE1 protein ZO 

Hs.77g48 ESTs. WealdysimllartoALU1.HUMAN ALUS 2.0 

gb3u66g0d.s1 Soares.teslis.NHT Honno sap 2.0 

Hs.131814 tankyrase,TRF14nteractinganJ?yrifrfBia 2.0 

Hs.323494 ESTs. Wealdysimnar to T27544 zinc resi 2.0 

Hs.202833 lieme oxygenase (decydhig) 1 2.0 

Hs.292084 ESTs 2.0 

Hs.16073 ESTe 1-9 

H3,311002 Homos8plenscONAFIJ10705fis,don8l^ 1.9 

H5.248953 sohite carrier fanriily 27 (fatly acid tra 1.9 

Hs.4980 Ul^ domain binding 2 1.9 

Hs.177543 antigen Identified t^nranodonalantibod 1.9 

Hs.306333 Human PAP (pancreaSUs-associated pmt 1.9 

Hs.300635 ESTs 1.9 

Hs.247936 olfactory receptor, family 1, sidrfamily 1.9 

Hs.26239 Human DNA sequence from done RP1 1-438B2 1.9 

HS.288S48 Homo sapiens cDNAFUl 2368 fis, done MA 1.9 

HS34348 Hom9sefiensmRNA;cONADKFZp434P0235a 1-9 

H5.177398 ESTs 1-9 

gb7v55fD9.8l Soares fetal liver spleen 1.9 

H$.87ro deleted in livercancerl 1.9 

H$.170311 heterogeneous nuclear ribonudeopfotein 19 

Hs.24756 hepatocyte growth fador-feguteled tyros 1.9 

88.96952 ESTs 1-9 

Hs.102943 hypothetical protein MGC12916 1.9 

HS.2B3844 similar to rat tricarboxytatecanter-O 1.9 

Hs.124166 ring finger protein 2 1.9 

HS.S0651 Janus kinase 1(3 protein tymsinatdnas 1.9 

Hs.79933 cydini 1-9 

Hs.189741 ESTs 1.9 

gb7s91a1 Ul Soares reOna ht2b5HR Homo 1.9 

Hs.162447 heterogeneous nudearribonudeoproteln 1.9 

Hs.270502 hypothetical protein FU20276 1.9 

gb7o06b06.r1 Scares adult brain N2b5HB5 1.9 

Ks.334698 Homo sapiens, done MGC:15203.niRNA. com 1.9 

Ks.283713 ESTs.WeaidysimitertoS64054hypothell 1.9 

Hs.2149 acflnCto protein 1.9 

Hs.6473 Homo 8aptenscDNAFU13992 lis. done Y7 1.9 

Hs.2785 keratin 17 1.9 

gb2nS5g05.8l Strategene musde 937209 H 1.9 

Hs.21077 K1AA0532 protein 1.9 

Hs.115726 KIAA1 138 protein 1.9 

Hs.143261 ESTs 1.9 

Hs.112668 ESTs 1.9 
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111587 A1125887 H5^734 ESTs 1-9 

135231 BE613615 Hs.74280 hypothetical protein nJ22237 1.9 

128897 AW979134 Hs.10700 hypothelical protein 1.9 

109891 H04757 Hs^23176 ESTs 1.9 

5 1277W AA679509 gb:ag72c023l Gesster Wilms tumor Homos 1.9 

129340 H75334 Hs.11050 F4xn only protein 9 1.9 

126502 T10077 Hs.13453 hypothetical protein FU 14753 1.9 

129819 AA209534 H5.284243 tetmspanNET<6 protein 1.9 

127136 R36277 Hs.7773 Homo sapiens uhlquiBncon^igalingenzym 1.9 

10 110836 H72868 Hs.19110 ESTs 1.9 

12^2 BE250742 Hs.106673 eukayotic translation iniliaOon factor 1.9 

104669 AA420450 Hs792911 ESTs, Highly similar to S60712band^ 1.9 

130829 6E262530 Hs.2006 glutathione S-transferase M3 (Ivain) 1.9 

125768 AI557486 Hs.119122 itaomat protein L13a 1.9 

IS 123813 AA609158 Hs^1166 EST 1.9 

127506 T61039 Ks.252574 rilnsomal protein LlOa ' 1.9 

123546 AA608817 Hs.112597 EST 1.9 

126516 R95872 Hs.117572 chemoldne binding protein 2 1.9 

103973 AA305729 Hs.18272 amino acid transporter system A1 1.9 

20 127426 AA854756 Hs.124076 ESTs 1.9 

112339 R56570 Hs,50547 ESTs 1.9 

129101 NIA.013403 Hs.108665 zinedin 1.9 

109442 AW296134 H8J6999 ESTs. WeaMy similar teS656578lpha-1C- 1.9 

118103 AA401733 Hs.184134 ESTs 1.9 

25 125752 AW136622 Hs^6673 ESTs 1.9 

102926 W28363 HsJ39752 nudear receptor subfamily 2, group F,m 1.9 

133975 C18356 Hs.29S944 tissue factor pathway inhibitor 2 1.9 

134470 XS4942 HS.6375B COC28 protein Idnase 2 1.9 

127329 AW160551 Hs.124021 soggy-1 gene 1.B 

30 126659 T18245 gb:NIB1005RNom\aIized infant brain, Ben 1.8 

127297 AW629485 Hs.140720 GSK-3 binding protein FRAT2 1.8 

127640 AI557486 Hs.l 19122 ribosomal protdn L13a 1.8 

103409 NM_004454 Ks.43697 ets variant gene 5 (ets^slaled molecule 1.8 

127964 F06298 gb:HSC13F081 nomiaDzed infant brain cDN 1.8 

35 122365 AA813546 K3.99034 GTP-binding protein i%o7 18 

128193 AJ224442 Hs.155020 putative metiiyttransferasa 1.8 

115173 BE61294a Hs.88252 ESTs 1.8 

125532 Ai734146 Hs^71800 ESTs, We^ similar to alternatively sp 1.8 

126541 AJ271671 Hs.7854 zinc/Iron regulated transporter-lilce 1.8 

40 127309 AI66976S Hs.133184 ESTs 1.8 

129062 AA452970 Hs.155216 E1B-55ia)»associaled protein 5 1.8 

126770 AI292320 Hs.61361 heterogeneous nudear ribonudeoprotein 1.8 

127775 AA128808 Hs,179902 transporter-flke protein 1.8 

126994 AA455265 Hs.86686 ESTs, Moderately similar to 154374 gene 1.8 

45 130734 AW137091 Hs.18624 KIAA10S2 protein 1.8 

114461 AA531187 Hs.126705 ESTs 1.8 

100842 U05597 gbiHuman anion exchanger 3 cardiac Isofo 1.8 

127389 T65126 Hs.12743 camltbe O-octanoyllransferase 1.8 

125394 BE178502 Hs.173772 ESTs. Wealdy similar to 178885 serinefth 1.8 

SO 107735 AA016239 HS.6071S ESTs 18 

125669 R51308 Hs.333256 ESTs. VlfeaHy similar to ALU8JWMAN ALU 1.8 

100370 D79989 Hs.184884 KiAAOl 67 gene product 1.8 

113479 Ai023133 Hs.10739 ESTs 1.8 

105165 BE2807B7 Hs,16079 hypothetical protein FU1 0233 1.8 

55 120602 AA808018 Hs.109302 ESTs 1.8 

112399 R60920 Hs.296770 KIAA1719 protein 1.8 

123474 AA599209 gb:8934b1U1 Jia bone marrow stroma Hom IJ 

134212 AA654353 Hs,17719 EBP5WDZInteractorof64l(D 1.8 

104204 AK001691 Hs.57655 hypothelical prol^ FU10829 1.8 

60 127454 AW971875 Hs.292071 ESTs 1.8 

116716 All 17440 Hs.170263 tumor protein p53^lndin9 protein. 1 1.8 

115041 AA252457 Hs.86543 ESTs, Moderately similar to T00256hypot 1.8 

132380 AW373665 Hs.46B53 ESTs 1.8 

120087 AF186780 Hs.79219 RaiGDS-fflce gene; KIAA0959 protein 1.8 

65 116356 Ai371223 Hs.288671 Homo sapiens cDNAFU11997fis, done HE 1.8 

125499 H10543 gb7m04c05j1 Scares infant br^ 1 NIB H 1.8 

12B846 AA730767 Hs.285753 SCG10.tikeiir6tein 1.8 

123869 AA620924 Hs.112923 EST 1.8 

108889 AA135722 Hs.61481 ESTs 1.8 

70 126528 Z24895 gb:HS687F122STRATAGENE Human skdetelm 1.8 

127629 AA293279 Hs.29173 hypothetical prot^ FLJ2051 5 1.8 

130004 AA703664 Hs.245474 ESTs, Moderately similar to ALU5_KUMAN A 1.8 

130847 A1672483 Hs.20220 lipase protein 1.8 

111620 R14853 H5.30747B EST, Weakly similar to 139058 hypolheU 1.8 

75 131971 BE567100 Hs.1S4938 hypothetical protein Mi>S025 1.8 

121360 AA405635 Hs.96854 ESTs. Weakly similar to DYUeHUMANCYTOP 1.8 

127705 AJ003322 obAI0O3322 Selected chromosome 21 cDNA 1.8 

124687 AA833902 HsJ70745 ESTs 1.8 

126698 AI221147 Hs.145088 ESTs, Weakly similar to T1 5936 hypotheti 1.8 

80 126730 AA442429 gb2v70g02.r1 Soares_lolalJeUjs.Nb2HF8. 1,8 

127916 A1239950 Hs.294111 ESTs. Moderately simaar to B34087 hypo 1.8 

128408 AI183407 Hs.143704 EST 1.8 

128440 AWD90340 Hs.l4337 Homo 8aitocONARJ14407 lis. done HE 1.8 
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i^S ^^l!?JLll 9b:8ri9g05j»1 SoaresjDteUeluSulfl>2HF8. 

^^25?^ "^-^^^^ EST8.WeaklysimlIartoS55Q24nebu!int 

107242 AB020&72 Hs.175411 KIAA0865 pretek) 

13^ Ain5943 Hs.32e067 hypolheScd fwteh MGC5178 

125387 AJ243689 Hs.B127 KIAA0144 gene product 

121578 AA398791 Hs.178185 ESTs 

132944 T96641 Hs.5127 HonK)5a;fenscONA:FU23020fi8.cteneL 

126295 AI281459 Hs270114 ESTs 

I3333S BE251012 Hs.263812 nuclear distribuSon gene CCA.nldutat5) 

129879 AK001696 H5.13109 Ran binding protein 11 

25175 W52355 Hs.303030 EST 

26919 AA577730 Hs.1886B4 ESTs. Weakly similar loPC4259fenftin 

27773 AA725863 Hs.120508 ESTs 

26495 AB029021 Hs.137732 WMI 098 protein 

26948 AW968535 Hs.14328 hypothetical protein RJ20071 

26671 CX)310S Hs^65847 GGI-19protein 

15428 AA284112 Hs.94680 ESTs. WbaMysbrilar to 178885 seiiheA 

28232 AI830319 Hs.334641 hypothetical protein DKFZp434t1 916 

26082 H81188 Hs.269571 ESTs 

20467 AW292562 H5.187628 ESTs 

24041 AW590171 Hs.101413 ESTs 

105012 AF0g8158 Hs.9329 chfomosome 20 open reading frame 1 

23951 AB012922 Hs.173043 metaslasb^sodaled l-IDce 1 

26449 AF223944 Hs.325443 breast cell glutantinase 

124554 N85961 sb2a27d03^1 Scares fetalGver spleen 

133651 At301740 Hs.173361 dihydropyffniid!naseae2 

128780 R12421 Hs.5811 chromosome 21 open reading frame 59 

125651 AA491830 Hs.2S689 ESTs 

25888 H18298 9b7n48b09.r1 Scares aduH brain N2t)5HB5 

127245 AA323958 gb:EST28810 Cerebenum II Homo sapiens c 

11223 AAfi52773 Hs.334838 K1AA1866 protein 

15611 R44789 Hs.33191 Homo sapiens. Simflar to transmembrane r 

24846 R59977 Hs.158196 transcriptionai adaptor 3 (ADA3. yeast 

00397 D84424 Hs.57697 hyaiuronan synthase 1 

27180 T27097 H8.22790 ESTs 

102598 BE250742 Hs.106673 eultaryotJc translation Initiation factor 

134076 AF086215 gb:Homo sapiens fall length Insert cDNA 

15659 W99382 Hs.2B3709 Bpopolysacdiaride Specific nesponse.7p 

25555 R19382 Hs.117869 ESTs 

12 AI138886 Hs.143243 ESTs 

27710 AA682B67 Hs.191901 ESTs 

25445 A14S2722 Hs,7709 WWdomain binding protein 1 

29951 AL110282 Hs.268024 Homo sapiens, done tMAG&3873720.mRNA 

19898 R93325 Hs.58690 ESTs 

29703 BE388665 Hs.179999 Homo sapiens, done IMAGE:3457003,mRI^ 

33531 BE276738 Hs.74578 DEADftI (Asp-GhMla-Aspftfc) box polypep 

19726 AF086289 Hs.234766 sMn-speciiic protein 

25198 W69474 Hs.323140 ESTs 

21414 AW291477 Hs.188763 testis expressed sequence 13A 

12542 AI458867 Hs.24276 ESTs 

101358 H413058 Hs73952 prollnwich protein Haelll subfamiiy 2 

25820 AA730136 Hs75561 teralocarctnom&4erived grovrth factor 1 

29091 AA056483 Hs.301463 Human Chromosome 16 BAC done CfT987SK-A 

32609 U20165 Hs,53250 bone morphogenetic protein receptor, typ 

19447 W31714 Hs.1226S6 ESTs. Highly simOar to lbrmb)2'aepr 

13675 T81034 Hs.14841 ESTs 

13701 T97301 Hs.l8026 ESTs 

16180 AA4639Q2 Hs.13522 ESTs, Weai^ly similar to 138022 hypothet 

127133 AA280740 Hs.29a072 ESTs. Moderately slmflar to A45010X-(in 

13316 T70318 Hs^68581 ESTs 

23316 A129D561 Hs.155361 ESTs 

22638 AL137476 Hs.123609 Homo sapiens mRNA; cDNA DKFZp434l0623 (f 

105053 Ai884911 Hs.32989 receptor {caldtonin)actwitymodi^ 

03305 X82279 gbiRsapiens Fas, Apo-1 gene (promoter a 

10384 H45282 Hs.268798 ESTs 

15626 AW630870 Hs.86674 ESTs. WeaWy slmflar to hypothetical pro 

26905 AW504027 Hs.15301 Homo sapiens CDNAFU12596 lis, done NT 

130820 At353934 Hs.288798 hypothetical protein FU21 01 2 

12394 AKD00373 Hs.8358 hypothetical protein FU20386 

29589 AW504292 Hs.11517 ESTs 

26446 NM.015670 Hs.11B926 sentrin/SUMO^pedflc protease 3 

26547 U47732 Hs.84072 transmembrane 4 superfamilymemtwr 3 

20287 AF219946 Hs.102237 tut>by super-famlly protein 

129991 R2B386 Hs.179925 ESTs. Weakly similar to AHJ8,HUMAN ALU 

23912 AA621283 Hs.332855 EST 

102071 AL120051 Hs.144700 ephrin-BI 

21046 AB033083 Hs,97377 KIAA1 257 protein 

128403 AI908006 Hs.295362 Homo sapiens cONAFU 14459 lis. done HE 

104268 Ali)43864 Hs.70604 ATPase, Class II. type 9A 

11598 R11505 Hs.268912 ESTs 

28109 AW269421 Ks.12B093 ESTs 

25435 R08480 Hs.272138 ESTs. W^ similar to ALU1J1UMAN ALU 8 

33104 AI091195 HS.65Q29 growth aresUpedSc 1 
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126826 AA099764 gb2n61f12j1 StrBtagenemusda 937209 H 1J 

106483 NM.006S46 Hs.30299 (GF-U ntfVIA-binding prolem 2 1.7 

129765 M86933 Hs.1238 amologenin (Y duDmosome] 1.7 

115904 AI167560 Hs.61297 ESTs U 

J 125514 AB040912 Hs.191098 hypolheflcal protein aJ1159B 1.7 

125797 K03117 Hs.111497 similar to mouse neuionalprofein 15.6 17 

133179 U81599 Hs.66731 hmieoboxB13 1.7 

115167 AA749209 Hs.43728 hypotheGcal protein 1.7 

11803S AI471B82 Hs.196008 Homo sapiens cONAaJ11723b, done HE 1.7 

10 124540 N63232 gb7239a12.s1 Morton Fetal Codilea Homo 1.7 

126183 BE018708 H8.81972 SHC (Src homology 2 domalfvcontdning)t 1,7 

127897 AA773881 gt):af77b12.r1 Soares.NhHMPu^SI Homosapi 1.7 

126660 F07097 Hs. 133865 transmembrane 6 superfamfly member 1 1.7 

126972 NM_016255 Hs.95260 Autosomal HIgWy Conserved Protein 1.7 

IS 130805 BE514362 HS.30G024 FK50&b!nding protein 3 (2S)cD) 1.7 

127541 AA573449 Hs.171515 ESTs 1.7 

127392 AI816736 Hs.14896 DHHC1 protein 1J 

106879 AI19076S Hs.33020 Homo sapiens, done 1MAG£:3939 1 63. mRNA. 1.7 

12B303 AI096444 Hs.7187 hypotheflcd protein FU 10707 1.7 

20 126469 BE384361 Hs.182885 ESTs. VltoMy similar to J C5024 UOP-galac 1.7 

125766 BE174587 Hs.289721 growth arrest spedliclranscript 5 1.7 

132332 AW978906 Hs.45005 hypothetical protein FU12960 1.6 

127142 AW452942 Hs.130393 ESTs 16 

128416 F13165 Ks.12549 ESTs. Weaiay similar to 210926QABceO 16 

25 103790 AL122044 Hs.331633 hypotheficd protein DKFZp566N034 16 

134578 AL110193 Hs224137 hypothetical proldn 1.6 

110023 AW294701 Hs.31040 ESTs 16 

125511 AJ271379 Hs.76194 ribosomal protein 85 16 

111483 1^9 Hs.269534 ESTs 16 

30 127363 AF064104 Hs.22116 CDC14(ceDdivfsioncyde14.S.cerevi 1.6 

126231 AA991766 Hs.300793 ESTs 16 

106181 Ai803651 Hs.191608 ESTs 16 

114767 AI8S9865 Hs.154443 minidinmosome maintenance (le(ideitt(S 16 

119929 WB6464 Hs.304825 ESTs 16 

35 132542 AL1377S1 H5.263671 Homo sapiens mRNA,'cDNAOKFZp434l0812(f 16 

127155 AA2B4993 gb3l23e10.r1 Soares ovary tumor NbHOT H 16 

125956 AK00O2t4 Hs.129014 hypothetic^ protein FU 20207 16 

126854 AJ275968 Hs.71414 transcription factor (SH^IF gene] 16 

131330 013969 H5.184669 zinc finger protein 144 (Mel-18) 16 

40 129445 W52452 Hs^797 ribosomal proteb L10 16 

113427 T85105 Hs.15471 ESTs 16 

106124 H93366 H3.7567 Homo sapiens cDNA: FU21962 lis. done H 16 

128135 AA954381 Hs.269721 ESTs, Moderately Similar toAliJIJIUMAN 16 

111460 R02728 Hs.117331 ESTs 16 

45 125636 H12382 Hs.25119 ESTs, Weakly similar to YEX0.YEASTHYPOT 16 

134118 6E336680 Hs.182877 KiAAOII 6 protein 16 

111570 AF059203 Hs.20580 sterol O^tiansfeiase 2 16 

113511 T89578 Hs.189740 ESTs 16 

113286 AW449560 Hs.89576 inner mitochondrial membrane peptidase 2 1.6 

50 109875 H03260 Hs.30385 ESTs 16 

105930 AF016371 Hs.9880 peplidylprdylisomer8seH(cyclopliilin 16 

105564 BE616694 Hs.288042 hypothe5cdl protein FU14299 16 

128063 AI377750 H3.167177 ESTs 16 

109779 AB029396 Hs.3353 beta-l^ucuronyttransferase 1 (gtucur 16 

55 125334 T86569 Hs.182118 ESTs 16 

127208 AW816490 Hs.337508 ESTs 16 

108845 AW362901 Hs.68864 ESTs.WeaMydmHartophosidiafidylseri 16 

132520 AA257992 Hs.50651 Janus kinase 1 (a protein tyrosneldnas 16 

114062 AI560984 Hs.27283 ESTs 16 

60 122S50 AA451859 Hs.99253 ESTs 1.6 

113413 R08872 Hs.186512 ESTs 16 

127019 AI929355 Hs.286128 hypofheiicai protein FU23329 1.6 

106251 R12607 H3.35101 proGneM Gla (&carbaxyg!utamic edd 16 

112670 AL138012 H5.183840 ESTs, Moderately drdter to ALU7_HUMAN A 16 

65 114913 A143S199 Hs^8940 EST8.Vlteddy8{miIar to 138022 tiypotheQ 16 

126604 AI023299 Hs.269806 ESTs 16 

125324 R07765 gb:yf1Sc06.r1 Soares tela! liver spleen 16 

121438 AW445024 Hs.139389 ESTs 16 

127289 AI041014 Hs.220752 ESTs. We^slniar to unnamed protein 16 

70 128935 A]ig8535 H3.89463 potassium tergeoonductancecaldum-acfl 16 

132430 AW973652 Hs.283105 ESTs 18 

133541 H75334 H3.11050 F-box only protein 9 16 

102612 U65402 Hs.248124 G proteiiHxniptod lacepior 31 16 

120228 AI192528 Hs.164537 ESTs 1.6 

75 122652 AA454641 gb3x99d05.s1 Soares.NhHMPu.SI Homosapl 16 

103456 AA4g6425 Hs.9629 papBIaiy renal ceO carcinoma (transloc 16 

105355 AL031447 Hs.26938 Homo saptens, done IMA6E:4053044.mRNA, 16 

108043 AA042873 Hs.160412 ESTs 16 

128695 HM.003478 Hs.101299 cuffinS 16 

80 127984 AA846377 Hs.193706 ESTs. W^aMysimHar to ALUS JiUMAN ALUS 16 

124405 AA228137 Hs.25005 hypolheticai protein MGC3329 1.6 

103934 BE276111 Hs.134200 DKF2PS64C1 86 protein 16 

12419$ H83034 gb:yq48e07j1 Soaresfetelliversptoen 16 
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110938 N48982 Hs.38034 Homo salens cONAFU12924Gs, done NT 1.6 

102687 NM_007019 H$.930O2 ubiqutln carrier protein E2-C 1.6 

121228 AA364109 Hs.1779g0 ESTs 1.6 

120415 AA235810 gb2s41a03^1 Soare3.NhNMPu.S1 Homosapi 1.6 

5 123864 AA620882 gb:af95g01.s1 SoaBs.tesUsJIHT Homo sap 1.6 

125045 All 14630 Hs.208334 Homo sapiens cDNA:nJ21874 (is, clone H 1.6 

133425 AA4443gO Hs.15S482 hydroxyacyl glutBlhione hydrolase 1.6 

126578 AF151861 H5.107528 androgen induced protdn 1.6 

102406 U43177 (NONE) 1.6 

10 114126 BES66362 Hs.7063 Homo sapiens cDNA:aJ20913 redone A 1.6 

125233 W85713 Hs.110092 ESTs 1.6 

109635 1^296 Hs.169161 ESTs. Highly similar to MAONJUiMANNADP- 1.6 

125675 BE294972 HS.S6406 Homo sapiens CONAHJ13549 lis. done PL 1.6 

129707 AW572317 Hs.12082 Homo sapiens mRNA;d)NA[)KF20566L2O3(tr 1.6 

IS 127569 A1765107 Ks.274422 hypothetical pTOtefai FLJ20550 1.6 

113302 T66919 Hs.268575 ESTs 1.6 

119705 Ai984203 Hs.57874 ESTs 1.6 

127226 ALj036559 Ks3463 ribOGomd protdn 823 1.6 

123469 AA599708 gtKagI 1a1 0^1 Gessler Wilms tumor Homos 1.6 

20 107468 AA740979 Hs.91389 ESTs 1.6 

115916 AI052731 Hs,91910 ESTs 1.6 

127B15 AA743490 Hs.255015 ESTs 1.6 

100364 NM.004341 Hs.154888 cart»moyH)hosph8le synthetase 2, aspart 1.6 

125568 AW515396 Hs.105613 ESTs 1.6 

25 105260 N81201 Hs.31755 ESTs 1.6 

125659 TS7693 Hs.87929 Homo sapiens cDNAFU13707fis. done PL 1.6 

111275 N70970 Hs.35006 ESTs 1.6 

106542 AA339541 Hs.24956 hypotheOcal protein RJ22056 1.6 

133423 T84084 H5.196008 Homo sapiens cDNARJ1 1723 fls, done HE 16 

30 124770 AA9B4414 Hs.120429 ESTs 1.6 

117936 AI382904 Hs.47213 ESTs 1.6 

134385 M14660 Hs.169274 ESTs. Highly sImDar to IFT2_HUMAN INTER 1.6 

108357 AW410478 H5.104019 Iransfomiing, addle coiled^»il contain 1.6 

131143 NM_000312 Hs.2351 protein C(inadivalor of coagulation fa 1.6 

35 105441 N28522 Hs.893S quMnate phosphodbosyltransfefBse (n 1.6 

128215 AA973310 gb:op91e06AlSoarBS^NFl^T.6BC.S1Hoino5 1.5 

127344 AI003929 Hs.80624 hypothetical protein MGC2560 1.6 

126478 BE541249 Hs.109697 ESTs 1.6 

122053 AI637498 Hs.98745 ESTs 1.5 

40 111760 BE551929 Hs.268754 Homo sapiens cDNARJ1 1949 lis, done HE 1.5 

112401 R61279 Hs.237538 EST«.Vte8JdysimllarloAF151067l HSPC2 1.5 

103023 AW50t)470 Hs.117950 multifunctional polypepSde similar to S 1.5 

125575 H14983 gb:ym19h09.r1 Soares Infant brabi 1 NIB H 1.5 

128765 AF073310 Hs.143646 insuHn receptor substrate 2 1.5 

45 108935 AA147848 H5.67991 hypothefical protein OKFZp434G0522 1.5 

121221 AI140708 Hs.97461 ESTs 1.5 

120091 AW024672 H859558 EST 1.5 

107375 BE011845 Hs.251064 htgh-mot% group (nonhlslonedtfomoso 1.5 

125803 AW876115 Hs.29852 ESTs 1.5 

50 115132 AAB11762 Hs.71433 ESTs 1.5 

113346 AF143876 Hs.14318 Homo sapiens done tMA6E:113399mRNA8eq 1.5 

107357 U63973 Hs.103501 ihodopsln Wnase 1.5 

125443 BE251057 Hs.177592 ribosomai protdn, large, PI 1.5 
133803 M24461 Hs.76305 surfactant, pulmonary-assodaled protein 1.5 

55 113378 T80738 Hs.14757 ESTs 1.5 

105540 BE391690 Hs.9265 hypothelical protein FU20917 . 1.5 

127446 F13008 gb:HSC3H£011 nomtillzedlnfBnt brain d}N 1.5 

134075 NM.012201 H3.78979 Golgi apparatus protein 1 1.6 

127585 AA604144 H5.190632 ESTs 1.5 

60 125824 Z45258 KsJ286013 short coBed^protdn 1.5 

127606 AA621135 Hs.136552 ESTs 1.5 

125585 AW298113 Hs.92909 SON DNA binding protein 1.5 

107757 BE621721 HS.2B0792 hypothetical protein HJ1 2387 slndtar to 1.5 

109978 H09356 Hs.22S28 ESTs 1.5 

65 132297 BE272446 Hs.265317 hypothetical protein MGC2562 1.5 

115784 AW402151 Hs.54673 tumor necrosis factor {ligand)5upeiiaml 1.5 

127880 W39735 Hs.73818 ubiqulnokytochrome c reductase hinge p 1.5 

102305 ALD43202 Hs.90(nr3 chromosome segregation 1 (yeast homdog) 1.5 

„^ 102868 XD2419 Hs.77274 plasminogen actWator, uroldnase 1.5 

70 133457 J04948 Hs.333509 alkaBne phosphatase, placentaHlke 2 1.5 

130339 AA435746 gb2t79e03.s1 Soares.tesOs^NHT Homo sap 1.5 

125444 N26476 Hs.15916t RhoGDPdissod3tioninhibftor(GO0aIp 1.5 
123470 AW303285 Hs.303632 Human DNA sequence torn done RP1 1-1 10H4 1.5 
100025 1.6 

75 127063 A1276526 Hs.331564 Homo sapiens mRNA;cONADXFZp434H1 215 (f U 

127945 AA815031 HS.123S98 ESTs 1.5 

111557 R09510 Hs.20373 EST 15 

116009 AW137635 Hs.44238 ESTs. WfeaMy similar to S65657aIpha-1C- 1.5 

119858 W01370 Hs.46824 ESTs 1.5 

80 108509 A1042309 Hs,64552 hypothetical protein MGC1 5563 1.5 

124124 AW294404 Hs.14451S Homo sapiens cONAi=U 11 6728$. done HE 1.5 

126713 AW249181 Hs.19954 ESTs.Wte8kly8bidar(oT19873hypotheti 1.5 

128475 AW95g07S Hs.238797 ESTs. Moderately sinto to 138022 hypot 1.5 
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126851 R40611 H5.137565 ESTs 1^ 

104820 AW162768 Ks.22620 ESTs U 

127235 /U817309 HsJ22S583 ESTs, Weakly sbnilsr to 2004399Achromos 1.S 

_ 126552 AF168711 Hs.159397 x 010 prolan 1.5 

5 127523 AA517637 gb:np34h1Z5l NO.CGAP.Lul Homo sapiens 1.5 

131692 BE559681 Hs.30738 K1AA0124 protein 1.5 

112974 AI.353965 Hs.101174 microtubule-assoclatsd protein tau 1.5 

118921 N91914 Hs.54751 ESTs 1.5 

100676 X02761 Hs.287820 fibronecfinl 1.5 

10 127721 T59578 Hs,188440 EST8.Wea)dy5tmilartoALUFJHUMAN!lIl 1.5 

115254 AA279024 Hs.269316 ESTs. Weakly 8lniilark>S65657slph8-1C 1.5 

128173 AI457242 Hs.127024 ESTs 1.5 

126846 AA663527 H5.1 16910 ESTs 1.5 

125294 R40025 Hs.10S551 ESTs 1.5 

15 127494 AW978730 Hs.291956 ESTs. WeaMy similar to AIU8_HUMAN ALUS 1,5 

134191 W26532 Hs.7979 KIAA0736 gene product 1.5 

107394 AA884798 Hs.186180 Homo sapiens cDNA:RJ23038 lis, done L 1.5 

131562 NhL003512 Hs.28777 HZAhislDne family, member L 1.5 

127310 AW450671 Hs.189284 ESTs 1.5 

20 122359 AA523485 gb:n!67f1 1 .si Na.CGAP_Pr1 2 Homo sapiens 1.5 

100524 M80902 Hs.183704 ut}tquItInC 1.5 

128422 T77794 gb:yd20d09.r1 Soaresfetsiliver^leen 1.5 

129902 AA076Z76 Hs.132n hypothetical protdnFU22054 1.5 

126784 T81887 Hs.108854 HSPC183proWn 1.5 

25 123343 AI761902 Hs.99597 ESTs 1.5 

10545B AW954377 Hs.26412 ring finger protein 26 1.5 

112266 AI652534 Hs.25934 ESTs. WeaMy simitar to HSHU 11 hlsloneH 1.5 

127622 AA628222 Hs.97883 ESTs 1.5 

113659 R06545 Hs.189781 ESTs. Weakly slmUar to ALU INHUMAN ALU S 1.5 

30 116892 AI573283 Hs.38458 ESTs 1.5 

126995 NM 014351 Hs.189810 suKortranferase family 4A. member 1 1.5 

111657 R07364 Hs.268657 ESTs. Weakly slmilarloALULHUMAI^ ALU 1.5 

100243 AB028125 Hs.77854 regucaldn (senescence m3rf(erpfotdn-30 1.5 

116153 AF107203 Hs.57937 ataxin 2^inding protein 1 1.5 

35 108892 AK000002 Hs.55879 Homo sapiens mRNA; cDNA DKFZp434U)827 (f 1.5 

113294 AI037922 Hs.11000 Isptin receptor overlapping transcrtpM 1.5 

126691 WD3046 Hs.263664 aspartate bet&tiydnoxytase 1.5 

106979 AW015227 Hs.289053 hypolheticaJ protein FU 14733 1.5 

125546 H09950 gb7m01dl2.r1 Soares Infant brdn 1 NIB H 1.5 

40 113990 AI497945 Hs.83097 hypolheticai protein FU22955 1.5 

129295 U63127 Hs.110121 SECTiwmoiog 1.5 

125431 AW851639 Hs.75584 po)yntyosItis/^deraderma8Utoant]gan2( 1.5 

112558 AK001621 Hs.15921 hypothetical proton FU10759 1.5 

122046 A1560456 Hs.107319 ESTs 1.5 

45 122472 AA448509 Hs.128652 ESTs 1.5 

130753 AA205223 H$.189 phosphodiesterase 4C, cAMP-specific (dun 1.5 

131714 AA642831 Hs.31016 putalhra DMA binding protebi 1.5 

101233 AL135173 Hs.878 sorblto! dehydrogenase 1.5 

109501 AF047437 Hs.90436 spemi associated antigen 7 1.5 

50 128984 AA213820 Hs.256533 ESTs.W^8lmllartoS119981ingerpr 1.5 

125765 BE243877 Hs.76941 ATP8se,Na^4> transporting, beta 3 po^ 1.5 

127693 AA676727 gbJ!j68b11.s1 Soares.falaUhrefjBpleetu 1.5 

128453 X02761 Hs.267820 fibionecfinl 1.5 

119418 T97590 Hs.221711 ESTs. Weakly similar to ALU1.HUMAN ALU 1.5 

55 132669 W38586 Hs.293981 guanine nudeoSdetrinrfing protein (6 pr 1.5 

116708 F10528 Hs.70001 ESTs, Moderately similar to JC6169nud 1.5 

122420 AA446971 gb:wB5f11.s1 Soares_tDtalJetus_Nb2HFB_ 1.5 

100238 L24959 Hs.348 caldum/calmodulin-dependent protein Un 15 

109710 D20044 Hs.12929 hypotheticai protein FU20721 1.5 

60 105704 AI282341 Hs.75431 fibitnogen. gamma polypeptide 1.5 

112712 Rgi060 Hs.330761 ESTs 1.5 

100098 AF003743 gbiHomo sapiens delayed rectifier potass 1.5 

114122 R46128 Hs.12751 ESTs 1.5 

^_ 132397 AA021160 Hs.4750 hypotheGcal protein DKFZp564K0822 1.5 

65 107681 AI566350 Hs.61273 hypothetical protein MGC2650 1.5 

106302 AA39S859 Hs.1B397 hypolhetKat protein FLJ23221 1.5 

125898 AK00t823 Hs.92267 Homo sapiens mRNA; cONA DKFZpS64C2476 (f 1.5 

104957 AI359009 Hs.10026 ntitochondrial ribosomal protein L17 1.5 

102909 NM-00S269 Hs.2693 gltoma^odated oncogene homolog (zinc 1.5 

70 125559 BE297488 Hs.279877 ceH division protein FtsJ 1.5 

109834 H17053 H3.183846 ESTs 1.5 

116607 W05238 Hs.94316 ESTs. WeaWy similar to T3 161 3 hypotheti 1.5 

127175 R11937 gh7B4b08.r1 Scares Infant br^ 1 NIB H 1.5 

110617 W93231 Hs.285901 Homo sapiens, clone iMAGE'.3948563k mRNA. 1.5 

75 125988 W27648 gb.37e10 Human retina cDNA randomly pifm 1.5 

115093 AI241932 Hs.3542 hypolhe&al protein FU 11 273 1.5 

121207 AA705799 Hs.1B3714 ESTs 1.5 

112652 BE269699 Hs.235782 sdutecarrterfamlly 21 (organic anion 1.5 

125213 AB014554 Hs.l09299 protein tyrosine phosphatase, receptor I 1.5 

80 125912 AW867467 Hs.278712 eukaryotic translaOon iniOaibn factor 1.5 

133046 R96881 Hs.63609 Hpall tiny fragments locus 9C 1.5 

122791 AL122055 Hs.129836 KIAA1028 protein , 1.5 
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Tdme268 

Pkey: Unique Eos probesel idenfifier number 

CAT number Gene duster number 

Accession: Genbank accession monbeis 

P^f CAT number Accesston 

108451 13766 27 AA0791 95 AA084955AA1 26308 AA0649S6 

124195 2606i K83034 H52379 

123619 371681_1 AA602964 AA609200 

125165 1852047J W45350 W45406 

125324 1692163.1 R07785T65948 T86972 

126053 1601238.1 H644S0K64464 

126086 1606215_1 H75681H70975 

126098 1629789.1 M79088 N88221 

125464 168460.1 N71807 AA203399 

1^99 1562851.1 H10543R11878 

126127 1205826J N95428 W24040AW751388 H81987 

125546 356478.1 H09950R18413AA570553AW973425 

125549 1702179.1 R20215R1B767 

125558 1703083.1 R59305R19748 

125575 1566885.1 H14983 R21S54 

125743 5025.5 H17151 H11956 

125761 1744008.1 R68351 R68364 

126426 110687 1 AA125984AA127189AA065075AA070377AA100017 

AA079891 AA1 13255 AA075168 AA082764 AA083380 N84829 AA084752 AA076512 AA085119 AAD85208 AA085045 

127155 200358 1 AA284993AA476122AA477923 

127175 1595805.1 R11937Z45532 

126528 1276201J 224B95AW891336 R01294 

125957 1583542.1 H41694H45213 

125976 296453 1 AA436760 AW237453 BE327496 N47347 N56967 

1259B2 1766315 1 R98091W92B98 

125988 1365728 1 W27648 R99193 BE090398 

127245 226662^1 AA323958 AA370268 

127248 227560 1 AA364195AA325029AW962050 

127262 231725.1 AA828125AA834883AA330555 

126659 1541209 1 T16245R19694 F13S45H10299T66048 T65Z79H18006 



126693 


87363.1 


C0S723AA01B342 


127315 


37938.1 


AF116622AI114507AA640834AA377999 


126730 


297653.1 


AA442429T19477 


103898 


187213.-3 


AA248884 


127446 


16Q01_2 


F13008T75435 


126826 


127356.1 


AA099764AA1 12950 


126872 


142696L1 


AW450979AA136653AA136656AW419381 AA984358AA492073, BE 1 68945 AA809054AW238038BE011 212 BE01 1359 BE011 367 


BE01 1388 BE011362 BE011215 BE01 1385 BE01 1363 


128132 


177108.1 


AA225632 AI820970 AI820952AA226472 Ai732140 An320S9 AA226307 






AA225500 


127523 


351071.1 


AA517637AA554963 


126982 


171753.1 


AA211419AA211566 


128215 


530345.-1 


AAg73310 


127704 


405690.1 


AA679809AA694592 


127705 


9662S3.2 


AJ003322AJ003324 


128422 


1611283.1 


T77794T85681 


127697 


446527.1 


AA773681AA773857 


120734 


20888^.1 


AA299948AA299949 


100098 


25117.-13 


AF003743 


114620 


3206^8 


AAB42974AA084223 


122652 


26401.-30 AA454641 


100842 




U9r.HT4398 U05597 


123783 




genban)LAA610112 


12S032 




genbanlLT74884 


123808 




9enbanKjAA620552 


123864 




genbanlLAA620882 


118049 




genbanK.N53145 


102406 




entre2^U43177 


116962 




genbanlLH79677 


134076 


40321.1 


AF086215W02702AA284288 W25655 


125888 


266863.1 


H18298K46830 


127271 


321389.1 


K96820 H79463 


113119 




genbanleT47910 T47910 


104799 




genbanKjyA029703 AA029703 


127693 


790317.1 


AA676727AA704704 


120415 




g8nbanJLAA235810 AA235810 


127964 


135151.1 


F06298R18057 


122359 




681003 1 AA5234e6AW026780AI621660AA443898 


122420 




genbanKM446971 


124276 




genban)LH8346S 


101447 




entraOfi1305 


124540 




genbanKJ163232 


124554 




genbanK.NB5961 


117357 




genb8nlLN24829 


103305 




entrezJ(82Z79 
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103392 


entrez X94S63 


119416 


genbanleT97188 


105225 


9enbanjCM211777 


121292 


genb3nlLM401807 


1128S3 


8enbanKJQ2843 


121387 


genb8n)LAA4056S4 


114601 


g8nbanK.AA075566 


100221 


entrez.D28383 


130339 


genban]U\A435746 


100554 


0gr„HT2241 


123423 


genbanKuAA598484 


123474 


genban)LAAS99209 


123489 


genbanKJ^A599708 



TABLE 27A: ABOUT 895 GENES UP-REGULATED IN COMBINED LUNG FIBROSIS COMPARED TO NORMAL BODY 

Teble 27A lists about 695 genes that are upregulated bi lung fibrosb (cotleclion of IPF, HP. and NSIP) samples as compared wiOi the normal 'body map* samples. These were 
selected from about 59680 probesets on an AflymetrWEos Hu03 GeneChip array sxh that the ratio of 'average' fibrosis sample expression level b 'average* nomnal adult tissues 
was greater than or equal to about 2.0. The 'average' fibrosis sample expressbn level was set to the 90^ peicentite amongst fibrosis samples. The 'average' normal adult tissue 
level was set to the 95^ peicenfale amongst iv»Hnafignant tissues. In order to remove gene^pedfic background levels of non-spedfic hybridization, the 1 5^ percentHe value amongst 
non-mdignarrt Bssues was subtracted from both the numeiator and the denon^ 

Pkey: Unique Eos probeset idenGfier number 

ExAccn: Exemplar Accession nuniber, Genbank aocesskm number 

UnigenelD: Unigene number 

Unlgene Title: Unpens gene title 

R1: RaSo of fibrosis to nonnal body tissue 



Pkey 


ExAccn 


UnigenelD 


1 ■ -* Ttllo 04 

Ungene Title Ri 




431164 


AA493o50 


H3.943o7 


Homo sapiens cDNA: PU23494 ns, done L 


56.0 


424917 


AI636208 


Hs.96901 


Homo sapiens cDNA: FLJ23049 fis, done L 


26.S 


453310 


X70897 


Hs.553 


solute curler fan% 6 (neurotransmitte 


25.5 


457200 


U33749 


Hs.197764 


thyrdd transcription factor 1 


22.2 


414517 


M24461 


Hs.76305 


surfactant, puimonary-assodated protein 


21.1 


423272 


W25140 


Hs.1 10667 


ESTs 


19,4 


418007 


M13509 


Hs.83169 


matrix melattopiotetnase 1 (interstitial 


19.1 


442006 


AW975183 


HS.2926&3 


ESTs 


18.8 


445685 


A1734009 


Hs.1 27699 


KIAA1603 proton 


18.0 


440452 


AI925136 


Hs.55150 


ESTs. Weakly sintolo CAYP.HUMAN CAtCY 


17.8 


422426 


W79117 


Hs.5e559 


ESTs, Weakly similar to rholekln (M.musc 


17.4 


444929 


At685841 


Hs.1 61354 


ESTs 


16.5 


440807 


AW269421 


Hs.128093 


ESTs 


16.3 


408626 


AF216077 


Hs.48376 


Homo sapiens done HB-2 mRNA sequence 


14.2 


446967 


AI6S9S29 


Hs.1 56781 


ESTs 


13.3 


417801 


AA417383 


Hs.82582 


Integrin, beta^ 1 (with EGF-like rep 


12.3 


437119 


At379921 


Hs.177043 


ESTs 


12.3 


451103 


R52804 


Hs^5956 


DKFZP564O206 protein 


11.5 


443450 


N66045 


Hs.133529 


ESTs 


11.4 


411880 


AW872477 




gb:hm30n)3j(1 NCLCGAP Jhy4 Homo sapiens 


11.3 


432519 


AI221311 


Hs.130704 


ESTs 


11.3 


414142 


AW368397 


Hs.1 50042 


ESTs 


11.0 


433283 


BE041135 


Hs.175622 


ESTs 


10.1 


441082 


AW444804 


Hs.202655 


ESTs 


10.1 


452039 


At922988 


Hs.172510 


ESTs 


10.0 


417204 


N81037 


Hs.1074 


surfactant, pulmonary-assodated pnjteln 


ao 


4219S2 


AA300900 


Hs.98849 


ESTs. Moderately similar Id AF161S11 1 H 


9.8 


412372 


R65998 


Hs.118615 


ESTs 


9.8 


426274 


038122 


Hs.2007 


taimor necrosis factor (Dgand) superfand 


9.7 


431007 


AF039564 


HS.24B211 


retinoblastomskfalnding protdn 9 


. 9.4 


443709 


A1082692 


Hs.134662 


ESTs 


9.3 


446232 


AI281848 


Hs.165547 


ESTs 


a2 


448253 


H25899 


Hs^S91 


ESTs 


9.2 


432133 


AB033088 


Hs.272567 


KIAA1262protdn 


ai 


409236 


AL049g90 


Hs.51515 


Homo sapiens mRNA; cDNA DKFZp664G11 2 (fr 


ao 


431353 


AAB28032 


Hs.189076 


ESTs 


as 


450050 


A1681268 


Hs.257883 


ESTs 


as 


458194 


AW383618 


H&265459 


ESTs, Moderately simllartoALU?_HUMANA 


as 


414968 


C16096 


Hs.297777 


ESTs 


a7 


425664 


AJ006276 


H5.1 59003 


transient receptor potential channel 6 


a7 


408562 


A1436323 


Hs.31141 


Homo sapiens mRNA for KIAA1 568 protefai, 


as 


453672 


U73531 


Hs.34526 


G proteln<oou()led receptor 


as 


429420 


AK001679 


Hs.202289 


hypothettod protein FU10376 


as 


421476 


AI683243 


HS.972S8 


ESTs 


a4 


404916 








8.4 


444396 


T65213 


Hs.4257 


ESTs 


a3 


442275 


AW449467 


HS.547SS 


ESTs 


a3 


437479 


R61666 


lte.101277 


ESTs 


a2 


432203 


AA305746 


Hs.49 


macrophage scavenger leoeptor 1 


a2 


431433 


X65018 


HS.2S3495 


surfectent puknonsy-assodaled protein 


7.9 


408747 


AI925153 


Hs.217493 


annexing 


7.8 
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445537 AJ245671 
450025 AK001875 
421798 N74880 
421155 H87879 
446917 A1347883 
422798 R92347 
426630 AA385751 
437157 BE048860 
433231 AB040926 
451561 N52812 
430656 AA482900 
448206 BE622585 
420209 AA2S6444 
426803 AA362568 
427383 NM_005411 
409718 D86640 
443324 R44013 
431924 AK00(^ 
427356 AWD23482 
418735 N48769 
429945 NM.006729 
407510 U96191 
430099 AW19498a 
441635 AB036432 
428508 BE252383 
438202 AW169287 
441233 AA972965 
433384 A1021992 
427043 AA397679 
425921 N»L007231 
438909 AF085639 
433365 AF02S944 
456964 H59846 
445166 AW614544 
431337 N48107 
434819 AA650099 
458219 H22195 
434377 AW137148 
435933 AA605520 
438954 AA740151 
445424 AB028945 
449108 AI140583 
410334 AW979261 
447112 H17800 
447700 AI420183 
449208 AW263635 
445657 AW612141 
421554 AW137676 
435299 AI745458 
416769 AI339257 
433527 AW235613 
452771 T05477 
427585 031152 
411514 AW85017B 
424084 AI940575 
444527 NVL005408 
429710 AI337113 
432113 AA935066 
447997 H00656 
449328 A)962493 
416575 W02414 
432009 AL137424 
434088 AF116677 
444342 NM1439B 
414299 AA142g89 
431041 AA490g67 
448104 AI674B1B 
445279 R41900 
408978 AL133617 
415094 059513 
428244 At584123 
45Z784 BE463857 
455431 AW938484 
449416 A1651016 
421659 NM-014459 
407638 AJ404672 
446164 AW273539 
413048 M93221 
446608 N75217 
419807 R77402 
447154 AF026941 
442652 A1005163 
429496 AA453800 



EGF-IDc^doniain, mulliple 6 

Homo sapiens cONA FU12028 fis. done HE 

N^acylsphlngoslne snftlohydiolasa (add c 



Hs.12844 
Hs.24321 
Hs.264330 

Hs.102267 lysylondase 

Hs.156672 ESTs 

HS.34S74 ESTs 

H$.160392 ESTs 

Hs.1 20655 ESTs 

H8.143552 K1AA1433 protein 

H3.177403 ESTs 

Hs.162080 ESTs 

Hs.3731 ESTs 

Hs.32295 Homo sapiens cONAnJ12604 (is, done KT 

Hs.179747 ecoiropic virai InlegraB on site S 

Hs. 177582 surfactant, puhnonsfy-asso d a t od pretein 

Hs.56045 SIC homology Ihree (SH3) and cysteine rt 

Hs.154225 ESTs 

H5^72203 Homo sapiens cDNAFLt20843fis, done AO 

Hs.97849 ESTs 

H$.44G09 ESTs 

Hs.226483 diaphanous (Drosophila. homolog] 2 

gbiHuman trophoblast hypoxla^utated f 

Hs.20537 Homo sapiens cDNA FU13942 fis, dona Y7 

Hs.184 advanced gtycosylation end product-sped 

Hs.184668 SBBt31 protein 

Hs^588 ESTs 

Hs.135568 ESTs 

Hs.124244 ESTs 

Hs.298460 ESTs 

Hs.162211 fiotuld carrier ^mBy 6 (neuratransmltte 

gbiHomo sapfens fuO length insert cDNA 

Hs^797 ESTs 

HS.12B355 ESTs. Mod8raleiysimDartoALU7_HUMAN A 

Hs.123641 protein tyrosine phosphatase, receptor t 

Hs^93 ESTs 

Hs.291541 ESTs 
ESTs 

osteol)lasl spedfic fador 2 (fesddtn 



Hs^1B74 
H3.136348 
Hs.192075 ESTs 
Hs.130425 ESTs 
Hs.12696 
HS.9832B 
H3.291993 ESTs 
Hs.7154 ESTs 
Hs.171077 
H8.48543 



ooiladfn SH3 domaln-t)inding protein 
ESTs 



ESTs. Weddy similar to dmHar to serin 
ESTs 
Hs.279575 ESTs 

Hs.97775 ESTs, Weakly simliar to Tesfis-specliic 
Hs.122614 ESTs. Weaidy similar to apoptoiic protsa 
Hs.115436 ESTs 
Hs.133020 ESTs 

gb:EST03366 Feta) brain. Stratagene (cat 
Hs.179729 coDagen. type X, alpha 1 (Sdimld metaph 
Hs.18995 KIAA1304 protein 
Hs.20914 Homo sapiens cDNA: FU23056tis. done L 
Hs.11383 smaO Indudble cytddne subfamily A (Cy 
Hs.146025 HomosapienscONA:FU23594fi8,doneL 
Hs.152385 ESTs 
Hs^792 ESTs 
H3.197647 ESTs 
HS.383B3 ESTs 

' gb:Homo sapiens mRNA; cONA OKFZp761G2123 
Hs.249270 hypdhelical protein PR01 966 
Hs.1 0887 simliar b iysosome-associaled memtsane 
Hs.71730 ESTs 
Hs.105276 ESTs 
Hs.178391 ribosomd protein I>t4 
Hs^245 ESTs 

Hs,49421 Homo sapiens mRNA; cDNA DKF2p434M0728 (f 
gb:HUM042H10S Oontedi human fetal brdn 

Hs.42500 ADP^ibosylaiion factor^ 5 

Ks.151258 Homo sapiens cONA: FU21062 fis, done C 

Hs.6073e sialophorin (gpL115. leulcosidln. CD43) 

Hs^46311 ESTs 

Hs.108511 protocadherin 17 

Hs.288693 Homo sapiens cDNA FU 1 1 667 lis, done HE 

Hs.199329 Homosaptens cDNA: FU23S77 Qs, done L 

HS.7S182 mannose receptor. C type 1 

Hs^5784S ESTs 

gb:yrr5f1 U1 Scares placenta Nb2HP Homo 

Hs.17518 Homo sapiens dgS mRNA, parllal sequence 

Hs.201378 ESTs. simliar bKIAAD944protdn 

Hs.192793 ESTs 
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&2 
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6.1 

60 
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5.9 
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5.6 

5.6 

5.6 
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429aS9 NK.007050 
432824 AK001783 
425509 AF079363 
424717 H03754 
436061 AI248564 
444218 AF070641 
453382 AA7092B5 
447033 AJ357412 
417235 AA810278 
41B200 AW529751 
427652 AI673025 
431255 AM97043 
441143 A1027604 
452293 AI871B33 
443903 AI220547 
422352 AA766296 
424105 AI142338 
4397S9 AL3S9055 
428227 AA321649 
430510 AW162916 
425804 BESOISgS 
435347 AW014873 
446002 AI346468 
452883 X80031 
442176 AA9B3764 
443253 A1041212 
419556 U29515 
439920 H05430 
421502 AF111856 
434424 AI811202 
408625 AW243323 
449299 AA299919 
450656 AA010539 
433815 Aie96602 
416879 H98899 
432182 AW607789 
4453B6 A1422005 
450478 AW451709 
453080 A)423056 
435496 AW840171 
443257 AI334040 
453921 AI824009 
419721 Nh4.001650 
432316 AW973235 
435202 A1971313 
440320 AA879294 
438796 W57821 
400269 

447724 AW298375 
446509 AF169693 
451620 AW449B88 
451963 A!d25440 
456408 AI2B834B 
425895 AI269464 
447048 AW393080 
454024 AA993527 
415929 AA724373 
426625 T78300 
434334 AA912476 
437138 AI935622 
455024 AW851309 
436246 AW450963 
416030 H15261 
459267 AJ003631 
445122 AW241632 
414812 X72755 
421160 ALJ08O215 
425734 AF0S6»)9 
429208 AA447990 
442957 Ai949952 
444050 AW138295 
444078 BE246919 
451024 AA442176 
442832 AW206560 
423377 AU)49377 
451895 T93S73 
442353 BE379594 
421484 AA291553 
404043 

407055 X89211 
410008 AA079552 
410247 AF1B1721 
417461 R38403 



Hs.225952 prqteh tyrosine phosphatase, reoeplort 

HsJ79012 hypolheticd protein FU10921 

Hs.156213 sperm assodatsd snfigen 6 

Ks.152213 wingless^ MMTVbitegraiion site fami 

Hs.190745 Homo sapiens cONA:aJ21326fi5. done C 

Ks.10684 Homo sapiens done 24421 mRNA sequence 

Hs.5997 Homo sapiens cONA aJ13078 fis. done MT 

HS.1576W ESTs 

Hs.24250 ESTs 

Hs.205654 ESTs, VteaHy similar to aftemativetysp 

Hs.43874 ESTs 

Hs.115685 ESTs 

Hs.159650 ESTs 

gbnivmSlhOgjcl NCLGGAP.Ut2 Homo sapiens 

HS.13S223 ESTs 

Hs.99200 ESTs 

Hs,43977 ESTs 

Hs.67709 HoRiosapIensinRNAIuOlenglhinsertcDN 

Hs.2248 small IndudblecytoUnesublinillyBCCy ' 

Hs.241576 hypothetic^ protein PR02577 

Hs^S6189 ESTs 

Hs.1 16983 ESTs 

Hs.1457e9 ESTs 

Hs.150318 ESTs 

H5.126910 ESTs 

Hs.132117 ESTs 

Hs.91093 diilinase1(diitotitosidase} 

Hs.144455 ESTs 

Hs.105039 solute carrier family 34 (sodium phospha 

Hs.125365 Homo saj^ns cONA: FU23523 fis. done L 

Hs.266785 ESTs 

gb:EST1 2592 Utenis tumor I Homo sapiens 

Hs.18912 ESTs 

Hs.112757 ESTs 

Hs.42599 ESTs 

Hs,293119 ESTs. Vfe^ dmBar to ALU7_HUMAN AUU S 

Hs.160380 ESTs 
Hs.271200 ESTs 

Hs.23921 Homo sapiens cDNA I^J12482 fis, done NT 

HS.26539B ESTs.WeddyslmSartotrsnstbrmation^ 
Hs.11614 Homo sapiens cONA: FU235S5 fis. done L 
Hs.44577 ESTs 
Hs.288650 aquaporin 4 
Hs.293697 ESTs 
Hs.170204 KIAA05S1 protein 

gb3m86e09Al NQ.CGAP_Pr12 Homo sapiens 
Hs.109590 genelhonin 1 

Hs.24477 ESTs 

Hs.132892 pcotocadh8rin20 

Hs.257224 ESTs 

Hs.224952 ESTs 

Hs.23450 mRNA for FU00023 protein 

HS.1614Z7 zinc finger protein 215 

Hs.228320 Homo sapiens cDNA: FU23537 fis. done L 

HS.162B1 hypothetical protelnFU23403 

Hs^95306 ESTs, Highly similar to unnamed protein 

Hs.171409 ssmtoglcaily defined odon cancer antig 

Hs.1 16750 Homo sapiens cDNA FU13221 fis, clone NT 

Hs^1245 ESTs 

0b:ll.W:TO220.17020tM)67-C11 a0220Homo 

Hs.119991 ESTs 

Hs.21946 ESTs 

gbA)003831 Sdected dvomosome 21 cDHA 

Hs.147377 Homo sapiens cOHA: FU23598 fis. done L 

Hs.77367 monokine induced by gamma interferon 

Hs.102301 HomO8apIensmRNA;d}I^OKFZp5B6J0323(r 

Hs.159398 peptidylglydne dpha^midaling monooiqrB 
H3.190478 ESTs 
H8.49397 ESTs 
Hs.135024 ESTs 

Hs.10290 U5snRNP-6pedflc40kDaprotelft{hPrp6- 
gb2w63b0Bj1 Soafes.tolaUeIus_N&2HF8. 

H3.2S3569 ESTs 

gbiKomo sapiens mRHA: cONA OKFZpSB6H0718 

Hs.16970 ESTs 

Hs.49136 ESTs 

Hs.190086 ESTs 

gbrH^apIens DMA for endogenous retrovlr 
gb3in20h12^1 Stratagene pancreas {93720 

Hs.61345 RU2S 

Hs.13305 ESTs 

309 
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423809 AA328348 
440444 AA865221 
446254 BE179829 
447505 AL049266 
423244 AL039379 
446921 AB012113 
444271 AW452S69 
434217 AW014795 
452571 W31518 
423575 C18853 
408ni AW732573 
431322 AW970622 
445034 AW293376 
438842 AA827176 
424908 AI586086 
415025 AW207091 
420313 AB023230 
423448 AKD00776 
433492 AW80Se49 
434636 AA083764 
435747 AID79519 
458158 AW296778 
419261 X07B76 
410060 NI\L001448 
426116 AA866729 
409203 AA780473 
414259 W44633 
406671 AA129547 
431889 AA521277 
430414 AW365565 
433426 H69125 
421764 A1681535 
410785 AW803341 
455235 AW875951 
408399 NM.005426 
429784 M89796 
436982 AB018305 
432231 AA339977 
432837 AA310693 
452166 AI948607 
458154 AW816379 
420362 U79734 
424202 BE350295 
410658 AW105231 
415457 AWD81710 
419503 AA243642 
439479 AI734258 
448404 BE069973 
424268 AA397653 
420637 AWg76153 
450715 AI266484 
428927 AA441837 
422544 AB018259 
431207 AA49592S 
424508 AL080103 
441484 AA935481 
425916 NM_006786 
401793 

431169 AW971240 
438038 AI73262g 
439619 AWg7SS98 
446577 AB040933 
450445 AW974536 
459462 AA625339 
445495 BE622641 
428743 AL080060 
426320 W4759S 
432889 AW974094 
419235 AW470411 
429703 T93154 
413499 BE144884 
406182 

417307 N99S73 
430140 AW296771 
436111 A!8030a2 
449729 R72032 
457620 AA602711 
428434 AW363590 
406554 

451381 BE241831 
443113 A1040686 
421470 R27496 
446428 AWD82270 



Hs.218289 EST5 
KS.1S69B4 ESTs 

Hs.179852 Homo sapiens cONAFU12832fis, done NT 
Hs.18724 Homo sapiens mRNA; cdHA DKFZp564F093 (fr 
Hs^09602 ESTs.WftaMysirallartouttqultousTPRm 
Hs.16530 small tnducibto cytokine subfannOy A CC^ 
85.149804 ESTs 
Hs.23349 ESTs 
Hs.34665 ESTs 

Hs.163443 Homo sapiens cONA aj11576 fis. done HE 
HS.47S84 potassium vdtage^ated channe). d^ed 

gl):EST382704 MAGE resequences. MAGK Homo 
1^5.160323 ESTs 
H5.124316 ESTs 

H5.153716 Homo sapiens mRNA (iorHmol>33 protein. 3* 

Hs.72307 ESTs 

Hs.96427 KiAA1013 protein 

Hs.128753 Homo8api8nscONARJ207691is,doneCO 

gb:MRO-HTQ241.200100406^HT0241 Homo 
Hs.241334 ESTs 
Hs.134398 ESTs 

Hs.300357 ESTs. Highly simflar to dJ41 6F21 .2 iH.sa 
H5.89791 wingless^ MMTVintegraSon site fami 
Hs.58367 glyplcan4 
Hs.144694 ESTs 

Hs.687 cytodvome P450, sulrfamily IVB, pdypepl 
Hs.25044 Homo sapiens cDNA: aJ23131 tis. done L 
Hs.285754 met prot»onoogene(hepatocyte growth fa 
H3.124946 ESTs 
Hs.120388 ESTs 
Hs.133525 ESTs 

Hs.99342 ESTs, WeaWy Blmflar to KCC1_HUMAN CALG 
gb:lL2-UM0079^9030^050O03 UM0079 Homo 
gb:CM1-PT0013.131299^7-f09 PT0013 Homo 

Hs.44585 tumor protein p53-bin(fino protein. 2 

Hs.30 membrane-spanning 4K)omains, subfamily A 

Hs.5378 spondin 1 , (f-spondtn) extracellular mat 

Hs.274127 OST 11240 protein 

H&.279512 HSPC072 protein 

Hs.264680 ESTs 

^QV4-ST0234-181199-035^1 ST0234Homo 

Hs.97206 huntinglln Interacting protein 1 

H3.15032 ESTs, Weakly similar to RAN bhiding piot 

Hs.192035 ESTs 

HS.73G9 E8Ts.W^8imi!artoALU1.HUMANALUS 
Hs.137422 ESTs 

Hs.245367 ESTs, Weaidy similar to AlOJIJiUMAN ALU S 
gb:RC6-BT0709^10300^1-G07 BT0709 Homo 

Hs.144339 Human DNA sequence from done 495O10 on 

gb:EST388262 MAGE resequences, MAGN Homo 

Hs.31570 ESTs, Weakly simHar to K1AA1324 protein 

Hs^250 ESTs 

Hs.118140 KIAA0716 gene product 

H3.9394 ESTs 

Hs.149770 Homo sapiens cOHA aJ13858 lis. done PL 
Hs.58972 ESTs 
Hs.162200 urotensin2 



H5.194161 

Hs.58595 

H3.15420 

Hs.194563 

Hsi37052 

Hs.38489 

Hs.301549 

Hs.169300 

HsiB8433 
Hs.28705 



Hs.3565 
Hs.221999 
Hs.157212 
HS29235 

Hs.65551 



gb:EST383329 MAGE resequences. MAGL Homo 
ESTs, Weakly 8lmlIartDTA2R HUMAN, BETA 
ESTs 

KIAA1500 protein 
ESTs 

EST. WeaWy similar to ALUl^HUMAN ALU SU 
ESTs 

Homo sapiens mRNA; cDNA 0}<FZp564H172 (fr 
transfonnlng growth (actor, beta 2 
gb:EST388197 MAGE resequences. MAGM Homo 
neurotrlmin 
ESTs 

8b:CMO.KT0182-O410994)6&«11 HT0182 Homo 

ESTs. Weakly similar I0AF1267431 DNAJ 

ESTs 

ESTs 

ESTs 

gbnip03h08^1 NCLGGAPJ>r2 Homo sapiens 
ESTs. Weddy simQar toAF172993 1 PLUNC 



gb:TCAAP2E001 1 Pediatric acute myelogeno 
Hs.132908 ESTs 
Hs.1378 anne}dnA3 

Hs.210617 ESTs. Weakly similar to ALU4.HUMAN ALU S 
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435031 AI632091 

413138 BE066841 

429228 A1553633 

4202S2 AWZ7O404 

423629 AW021173 

444339 T96555 

434164 AW207019 
404599 



AA393351 

453736 AL1 18674 

408923 H73881 

430919 AA469041 

431622 AW979271 

433564 AW29S399 

437073 AI885608 

438394 BE379623 

446242 N66336 

452542 AW812256 

454009 AW015927 

449765 N92293 



H5.116877 ESTs 

gb:PMO-BT034^09129W)02-a11 BT0340Homo 

Hs.104985 ESTs 

Hs.193161 ESTs 

H5.16612 Komo sapiens cDNA: aJ21909 fis. dons H 

Hs31562 ESTs 

Hs.146135 ESTs 

Hs.132121 ESTs 

Hs34871 ICIAA056998ne product 

Hs.255436 ESTs 

Hs.295448 ESTs 

HS.2931B4 ESTs 

gb:Ul-H-8t2-ahv-M)34)U8l NCLCGAP.Su 
Hs.94122 ESTs 
Hs.27693 CGI>124piotdn 
Hs.7360 ESTs 

gb:RC0^174-191099^1-a07 ST0174 Homo 
Hs.233071 ESTs 

Hs.206832 EST. Atoderately similar to ALUBjnJMANAL 





415652 T79213 


Hs.272073 


ESTs 




453931 AL121278 


Hs.25144 


ESTs 




439382 BE247684 


Hs,103070 


ESTs 


25 


420077 AW512260 


Hs.87767 


ESTs 


430437 A1768801 


Hs.169943 


Homo sapiens cDNA FLI13569 fis. done PL 




446745 AW118169 


Hs.156400 


ESTs 




408308 AL033377 


Hs.44197 


hypothec protein DKFZp564D0452 




450320 AW291775 


Hs^13793 


ESTs 


30 


429597 NM_003816 


Hs^442 


a disintegrin and metafloproteinase doma 


449523 NM.000579 


Ks.54443 


chemoMne (C-C motiO receptor 5 




451110 AI9S5040 


Hs.301584 


ESTs 




431745 AW972448 


Hs.163425 


ESTs 




410781 AI37S672 


Hs.165026 


ESTs 


35 


419546 AA244199 




gb:nc06c05.8l NO^CGAP^Prl Homo sapiens 


444330 A1S97655 


Hs.49265 


ESTs 




408761 AA057264 


H&238936 


ESTs 




409026 AL137554 


Hs.49927 


Homo sapiens mRNA; cONA DKFZp434H1720 (f 




432055 AW972359 


Hs.293334 


ESTs 


40 


432441 AW292425 


Hs.163464 


ESTs 


408045 AW136959 


HS.24S123 


ESTs 




427191 BE221625 


HS.97G91 


ESTs 




416965 N26223 


Hs.160436 


ESTs 




441594 AL041080 


HS.2087&5 


ESTs 


45 


408992 S82472 




gb:beta -pol-ONA polymerase beta {exon a 


431941 AKD00106 


HS.2722Z7 


Homo sapiens cDNA FU20099 fis. done 00 




438323 AI985394 


H$.1233G9 


ESTs 




427698 AW972594 


H5.294140 


ESTs 




424296 AI631674 


Hs.169391 


ESTs 


50 


450522 AI698839 




gbwd31f02jc1 Soares_NFUr.GBC_S1 Homo s 


407942 AA378608 


Hs.5894 


hypothetical protein FLI10305 




417991 AA731452 


Hs.190008 


ESTs 




422589 AA312735 


H8.179725 


ESTs 




437583 AA761190 


Hs.244627 


ESTs 


55 


452019 AL157503 


H8.27552 


Homo sapiens mRNA; cONA DKFZp586N2424 (f 


449494 AW237014 


Hs.286650 


aquaporin4 




444188 AI393165 


Hs.19175 


ESTs 




400297 AI127075 


Hs^88381 


hypotheacal protein DI^FZp56401278 




410811 AW805687 


Hs^00648 


ESTs 


60 


450584 AA040403 


Hs.60371 


ESTs 


428043 T92248 


Hs^240 


uterogiobln 




436120 AI246193 


Hs.119860 


ESTs 




442324 R63578 


Hs.28426 


ESTs 




448693 AW004854 


Hs.228320 


Homo sapiens cDNA: FU23537 Rs. done L 


65 


425555 AA359291 


Hs.130767 


Homo sapiens cONA: FU23553 lis. done L 


431385 BE178538 


Hs.11090 


high affinity hnmunoglobulin epsllon lec 




408427 AW194270 


Hs.177236 


ESTs 




459567 AA031956 




Ob2k15e04^1 SoaresjuegnantjulenjsJIbH 




438128 AA904430 


Hs.122049 


ESTs, WlBflldy slmBar to U4/U6 sroail nud 


70 


408938 AA059013 


Hs.22607 


ESTs 


419276 BE165909 


Hs.134682 


Homo sapiens CI3NA: FU23161 lis, done L 




422022 AA302420 


Ks.200442 


ESTs 




426890 AA393167 


Hs.41294 


ESTs 




427374 A1150033 


Hs.143688 


ESTs 


75 


434208 T92641 


Hs.127648 


hypolhetical protein PR02176 


446466 H38026 


H5.30B 


arrestln 3. retina) (X«restin) 




451229 AW967707 


Ks.48473 


ESTs 




415511 AI732617 


Hs.182382 


ESTs 




408n6 AA057385 


KS.633S6 


ESTs 


80 


421110 AJ250717 


Hs.1355 


calhepsinE 


453638 R67837 


Hs.169872 


ESTs 




436578 A1091435 


Hs.134859 


ESTs 




426083 AWg62712 


Hs.12$712 


ESTs. Weakly 8lmlartoAF191020 1 E21G5 




419231 AlJ0462g4 


Hs.136245 


ESTs. Weakly siniEar to dJ202l21 .4 [Hja 
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408171 AA301228 Hs.43299 Homo sapiens cONAFUl 2890 lis. done t4T 2^ 

445189 AI9364S0 Hs.147482 ESTs 2^ 

419150 T29818 Hs.89640 TEK tyrosine kinase, endothelial (venous 2.8 

. 427457 AW779105 Hs.164682 ESTs. WeaMy similar to 0RF2 consensus 8 2.8 

5 435082 AA664273 Hs.186104 Homo sapieRScDNAFU13803te. done TH 2.8 

446932 AA961459 Hs.125644 ESTs 2.8 

439140 W85737 Hs.290830 ESTs 2.8 

40504V 2.8 

421306 AA806207 Hs.125889 ESTs IB 

10 427514 AA640773 Hs.209224 ESTs 18 

427939 T92459 Hs.16886 ESTs Z8 

429127 AA749382 Hs.107233 ESTs ZB 

429590 A1219490 Hs.44445 ESTs, Weakly similar to Keteh motif con) ZB 

433163 R40468 Hs.163582 ESTs 2.8 

IS 439635 AA477288 Hs.94891 Homo sapiens cONA: FU22729 lis. done H 28 

448015 A]456085 Hs.23196 ESTs ZB 

456761 D69899 Hs.127842 CGI.142 2.8 

457112 AW772449 H8.268081 ESTs. WeaMy similar to ALUIJIUMANAIUS 18 

449540 AA001713 gb:zh86e08.s1 Soares.fetaUiver.8pleeiL 2.8 

20 447020 T27308 Hs.16986 hypothetical protein FU11046 2.6 

412810 xg0908 Hs.74126 fatty add binding protein 6. Beal (gas 2.8 

433515 AA595800 Hs.190246 ESTs 2.8 

424450 AL137526 Hs.147472 dyneinlntennediatodiain2 ZB 

438122 AI620270 Hs.129837 ESTs ZB 

25 424086 A1351010 Hs.102267 tysyt oxidase 18 

438885 AI886558 Hs.184987 ESTs 16 

412903 BE007967 Hs.155795 ESTs 18 

454111 AW081681 Hs^9064 ESTs 18 

439398 AA284267 Hs.221504 ESTs 18 

30 449802 AW901604 Hs.23984 hypotheticai protein FU20147 18 

434812 AA649860 Hs.189496 ESTs 18 

432583 AW023624 Hs.162282 ESTs 18 

428104 AA421350 Ks.191604 ESTs 18 

408217 A1433201 H5.279860 hypolhelical protein FU2003O 18 

35 438016 Ai949638 Hs.109150 SK3^oma!n binding protein 5 (BTK-assod 18 

436396 AI683487 Hs.299112 HomosapienscONAFU11441 ris.doneHE 17 

430887 N66801 Hs.260287 ESTs. Wealdy similar to ALU7_HUMAN ALUS 17 

446311 AW007294 Hs.149795 ESTs. Weakly dmOar to AUIIJIUMAN ALUS 17 

416185 AW975881 Hs.291995 ESTs 17 

40 408813 AW242086 Hs.253987 ESTs 17 

442510 AF150179 Hs.249890 ESTs 17 

433293 AF007835 Hs.32417 ESTs 17 

413875 BE176778 gb:RC3.HT0586-11030OO11-g09HT0586Homo 17 

404488 17 

45 408936 AL13B043 Hs.293W9 ESTs 17 

431980 AA523696 Hs.222695 Homo sapiens cDNA: FLJ20986 fis. done C 17 

436738 AW102613 H$.152913 ESTs 17 

451797 AW663858 Hs.56120 ESTs 17 

452163 AI863140 gbnz43h11x1 Na_CGAP_Bm52Homosapien 17 

50 452778 R71338 Hs.5921 HomosapienscONA:FU21592 lis. done C 17 

459366 AA129703 gb2ng2b05.r1 Stratagenelungcaidnoma 17 

431448 AL137517 Hs.2883B1 hypotheOcd protein 0KFZp664O1 278 2.7 

430733 AW975920 Hs.283361 ESTs 17 

453652 AWD09640 Hs.28368 ESTs 17 

55 453816 NML003462 Hs.33846 ifynein. axonemal. lighl Intemnedlate pd 17 

411905 BE265087 gb:601ig3893F1 14IK.M6C.7 Homo sapiens cD 17 

408729 AA195764 Hs.72639 ESTs 17 

450726 AW204600 Hs.264330 N-acylsphingoslneBmldohydiolase(adde 17 

447720 AIJD38765 Hs.161304 ESTs 17 

60 451497 H83294 Hs.284122 WhtlnhlbItofyfector-1 17 

442074 C17511 Hs.128430 ESTs 17 

424115 AA335497 Hs.293965 ESTs 17 

417728 AW138437 HS.247S0 t<lAA1 573 protein 17 

^- 433803 AI823593 Hs.27688 ESTs 17 

65 419247 S65791 Hs.89764 (ragiieXmentdlretaTdaltonI 17 

424310 AA338648 Hs.50334 ESTs 16 

438504 AW665281 Hs.224625 ESTs 16 

426486 BE17&285 Hs.170056 Homo sapiens mRNA;cDNADKFZp5B6B0220a 16 

430417 AA461045 Hs.50701 ESTs 16 

70 438297 AW515196 Hs.258238 ESTs. Moderately sImDar to ALU1J1UMAN A 16 

422505 AL120862 Hs.1241G5 ESTs 16 

457285 AI038858 Hs.228780 ESTs, HIgMysimtlar to AF1 99597 lA-typ 16 

428667 /U375550 Hs.74407 nudedarprDtolnp40;homdog of yeast 16 

__ 431750 AA514986 H3.283705 ESTs 16 

75 435575 AF213457 Hs.44234 triggering receptor expressed on myeloid 18 

413385 M34455 Ks.840 inddeaminei)yrTole 13 dioxygenase 16 

403903 16 

407910 AA650274 Ks.41298 fibronectin leucine rich transmembrane p 15 

423424 AF150241 Hs.128433 prostaglandin 02 synthase, hematopdedc 16 

80 436043 AW963838 H3.168830 Homo sapiens CDNAFU12138 Us. done MA 16 

436645 AW023424 Hs.1S6520 ESTs 16 

408360 AF123050 Hs.44532 diubiquitin 16 

402829 16 
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406594 2.6 

415122 D60708 H8.22245 ESTs 2.6 

416747 AW876523 Hs.15929 Homo sapiens cONAnJ12910fis. done NT 16 

. 420159 A1572490 Hs.997B5 Homo sapiens cDNA: FU21 245 fis. done C 16 

5 444361 W76027 H5.23920 Homo sapiens cONA FU13124 fis, dona NT 16 

446609 BE395090 Hs.15535 Human gene fiDmPACB85K2.dvomosQme1 16 

449260 AA7411B0 Hs.29879 ESTs 2.6 

452311 AW304029 Hs.252744' ESTs 2.6 

413802 AW964490 Hs.32241 ESTs 16 

10 417318 AW953937 Hs.12891 ESTs 16 

440028 AW473675 Hs.125843 ESTs 16 

437960 A1659588 H5.222194 ESTs 16 

433687 AA743991 gb:ny57g01.5l NCI_CGAP_Pr18 Homo sapiens 16 

430573 AA744550 Hs.136345 ESTs 16 

15 439737 AI751438 Ks.41271 Homo sapiens mRNAfuD length insert cON 16 

453204 R10799 Hs.191990 ESTs 16 

436751 AA732217 Hs.294054 ESTs 16 

406165 AL137573 Hs.43143 Homo sapiens mRNA:cONADKFZp554A2463(f 16 

431120 AA492588 gb3ig99c085l Na.CGAP_Thy1 Homo sapiens 15 

20 446638 AL133063 Ks.157B3 Homo sapiens mRNA:cONADKFZp434P11 15 (f 15 

438458 AW975186 H5.162675 ESTs. VlteddyslrnRar to ALU1_HUMAN ALUS 15 

446063 AI720140 Hs.151079 ESTs 15 

430499 AW969408 H8.2319gi ESTs 15 

450496 AW4492S1 Hs.257131 ESTs 15 

25 441330 A1692984 H5.129354 ESTs 15 

424433 H04607 Hs.9218 ESTs IS 

434677 AW444575 Hs.130834 ESTs 15 

445779 A12S3104 H5.189267 ESTs 15 

444649 AW207523 Hs.197628 ESTs 15 

30 415451 H19415 Hs.268720 ESTs. Moderaldy similar to ALU1.HUMAN A 15 

432222 AI204995 gb:8n03c03j(1 Stralagene sdiizo brain SI IS 

404288 15 

408572 AA06S611 Hs226568 ESTs. Moderately similar loALU4_HUMAN A 15 

408727 AL137259 Hs.47ll5 hypothetlcai prolan DKFZp434D0513 15 

35 408728 AL137379 Hs.47125 hypothetical protein FU 1391 2 15 

410095 AW589S38 Hs.258947 ESTs 15 

410947 AK000305 Hs.670S5 hypolheticai protein FU20298 15 

418343 AA216372 Hs.159501 ESTs IS 

423401 NM.001992 Hs.128087 coagulation factor II {thrombin) recepto 15 

40 428537 AW979258 gb:EST391 378 MAGE resequences, MAGP Homo 15 

429846 AB023021 Hs.225945 fuoosyltransferase9(^ha(1.3}fucosy 15 

432507 BE391093 gb:601286042F1 NlH_MGCjl4 Homo sapiens c 15 

433858 N89243 H5.192974 Homo sapiens cDNAFU12735fi8, done NT 15 

438651 H64500 Hs.123646 ESTs 15 

45 443830 AI142095 Hs.143273 ESTs 15 

446800 AI341635 Hs.156466 ESTs 15 

450262 AW409872 Hs.271166 ESTs. Moderaleiy similar to AIU7_HUMAN A 15 

451343 AWg75057 Hs.293353 ESTs 15 

451539 AA059467 Hs.218933 ESTs 15 

50 452412 AA029608 Hs.61373 ESTs 15 

454288 BE222648 Hs.279468 ESTs. Highly similar to c380A1.1bIH.8ap 15 

445745 AB007g24 Hs.l3245 KIAA0455 gene product 15 

424943 AU077260 Hs.153924 death-assodated protdn Mnase 1 15 

440106 AA864968 Hs.127699 K1AA1603 protein 15 

55 458429 AV646559 Hs.12346 Homo sapiens cDNA: aJ21399 fis. done C 15 

41S261 T40928 H5.8346 ESTs 15 

420026 A1831190 H5.166676 ESTs 15 

431806 AF186114 H3.270737 tumor necrosis factor Oigand)$upeiM 15 

458722 AA741545 Hs.282B32 ESTs 15 

60 419449 H18417 Hs^483 Homo sapiens cDNAHJ 14294 lis, done PL IS 

436260 BE172762 Hs.292710 ESTs. W^aMy similar to ALUS J1UMAN ALUS 15 

433644 AW342026 Hs.256112 ESTs 15 

419172 AW338625 Hs.22120 ESTs 15 

437982 N93466 Hs.121764 ESTs, WeaMy similar to testicular tekG 15 

65 443348 AW873S96 H857572 ESTs 15 

417218 AA005247 Hs.285754 met proUH)noogene(hepatocyte growth fa 15 

419236 AA330447 Hs.13S159 Homo sapiens cONAFUl 1481 fis. done HE 15 

448030 N30714 Hs.20161 HDCME31P protein 15 

417203 AA406341 Hs^9908 Homo sapiens cONARJ1 1991 fis, done HE 15 

70 449275 AW450848 Hs.205457 KIAA1620 protein 14 

436198 AK001125 Hs.300922 Homo sapiens CONAFU10263 its. done HE 14 

452281 T93500 Hs.28792 Homo sapiens cDNAFU11041 lis. done PL 14 

442191 W95186 Hs.8136 endotheOa! PAS domain protein 1 , 14 

428571 NM_006531 Hs.2291 PtobehTg737 (polycystic kidney disease. 14 

75 453142 AA033648 HsJ473 ESTs 14 

425657 T89839 Hs.l 19471 ESTs 14 

452822 X85689 Hs.288617 Homo sapiens cDI^FU22621 fis. done H 14 

416778 M16505 Hs.79876 stsrdd suifatase (micn>somal), arylsilf 14 

458332 AI000341 Hs.22a491 ESTs 14 

oU 448140 AF146761 Hs.20450 BCKHOes membrane protein preewsor 14 

459644 2.4 

429125 AA446854 Hs.271004 ESTs 14 

448337 AW2084S3 Hs^B2 ESTs 14 

313 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



427778 AA412323 
425371 D49441 
448299 AA497044 
447610 AW2962B8 
409519 AA075368 
441006 AW605267 
440817 AI341423 
420020 BE295866 
435395 AA729235 
424144 AA454033 
405494 

458145 AI239457 
408547 AAS74291 
408941 AI4524S9 
409457 AW818081 
417137 U46265 
418950 n8517 
420756 AA411800 
428316 AI860775 
432896 NM_014097 
436148 BE005252 
436284 AA708016 
437327 AL353942 
442611 BE077155 
456082 AI866286 
433014 NM.014711 
401335 

428771 AB028992 
419140 AI982647 
454693 AW813428 
427785 X81053 
407339 AA777542 
408369 R38438 
427019 AA001732 
431089 BE041395 
452561 AI692181 
427878 C05766 
419752 AA249573 
430073 U88136 
452401 NM.007115 
430345 AK000282 
407905 AW103655 
427660 AI741320 
422355 AW403724 
453049 BE537217 
438568 R98865 
453445 ALJ036532 
424711 NM.005795 
446346 AI290205 
441974 A1683782 
444805 AB007899 
424027 AW337575 
419606 AW294795 
428613 AB037749 
434340 AI193043 
450297 AW901347 
432779 AW979241 
433650 AA603472 
419086 NM-000216 
428758 AA433988 
430153 AW966128 
416883 BE3B7035 
427669 AW451832 
400610 
402222 

407162 N63855 
415250 F02614 
421751 AW813731 
428552 AW274560 
432658 AW973769 
434742 AA648302 
436586 A1308682 
441675 A1914329 
442039 AW276240 
443160 A146791S 
448764 A1566607 
449579 AW207260 
439810 AL109710 
413714 AI560944 
400289 X07820 
413384 NM.000401 
438670 A127S803 
419991 AJ000098 



Hs.105323 ESTs 
H5.155981 mesothefin 



H8^7 
Hs^55534 

Hs,7627 



hypotheflcal protein FU10392 
ESTs 

gb2n«6h10Jl Stratagena ovsfan earner 
CGl^praleln 
Hs.270165 ESTs 
Hs.94382 adenosine kinase 
H8.117907 ESTs 

Hs.41844 Homo sapiens cONA:FU23003 6s. done L 

Hs.130794 ESTs 
Hs^837 ESTs 
Hs.165221 ESTs 



gb:CM4^T0276-101299«59^ STQ276 Homo 
hypotheScai pfolein 
ESTb 



Hs.81281 
Hs.13941 
H8.189900 ESTs 
H5.98508 ESTs 
Hs.279778 PR01693prate!n 

gb:CM1-8N011&03O4O0-17lHgO2 BN0116 Homo 
Hs.190389 ESTs 

gb:Honio sapiens mRNA; cONA DKFZp761L2312 
Hs.177537 ESTs 
Hs.71962 ESTs 
H5^9912 KtAA0419oenepioducl 



KIAA1069( 
ESTs 

gb:limSTD1924)10200-21&c05 ST0192 Homo 

0QDagen.lypelV,dtph84 

ESTs 

solute carrier family 15 (H-»/peptide tra 

hypothetical protein FU10970 

ESTs, WealOy similar to indoiown protein 

K1AA1634 protein 

C6M)7 protein 

ESTs 

telomerase-associated prot^ 1 
tumor neciDsis factor. e)pii»induoed pro 
hypothetical protein FU20275 
ESTs 

Homo sapiens cDNA: FU23228 lis, done C 
immunoglobunn heavy constant gamma 3 (G 
ESTs 

mafor hIstocompaSbifity complex, class 
ESTs 

c^citontn receptor-ili^e 

gb:qi79g06j(1 Soares_NhHMPu_S1 Homosapi 
ESTs 

KIAA0439 protein; homolog of yeast ubiqu 
ESTs 

ESTs. Weakly similar b similar to acy^ 

K1AA1328 protein 

ESTs 

Homo sapiens cDNA: aJ23342 lis. done H 
gb:EST391351 MAGE resequences, mOP Homo 
ESTs 

Kallmann syndrome 1 sequence 
Homo sapiens cONA FU14303 lis, done PL 
gb£Sn80338 MAGE resequences, MAGJ Homo 
add phosphatase 5. tartrate resistant 
ESTs, Moderately sirollar to K1AA1 200 pro 



Hs.142634 zinc finger protein 

Hs^319 ESTs 

HS.1S9153 ESTs 

Hs.129520 ESTs 

Hs.162319 ESTs 

Hs^91695 ESTs 

Hs.167028 ESTs 

Hs^l ESTs 

Hs.128352 ESTs, Weaidy similar to p80|Rjwrvegicu 

Hs.36053 ESTs 

Hs.162112 ESTs 

Hs.134014 prostate cancer assodated protein 6 

H8^5568 EST 

H3.71428 ESTs 

Hs.2258 matrix metalloprotelnase 1 0 (stromelysln 

Hs.75334 exostoses {multiple) 2 

HS.12342B ESTs 

H^.94210 eyes ebsenl prosophil^ homfllog 1 



Hs.193143 
Hs.215725 

Hs.180828 

Hs.132570 

Hs.182576 

Hs.173233 

Hs^83676 

Hs.49169 

HS.1B1022 

Hs.152618 

Hs.232070 

Hs^2 

Hs.23968f 

H8.252905 

Hs.114121 

Hs.140 

Hs^343 

Hs.11135 

Hs^1453 

Hs.152175 

Hs.128245 

Hs»12017 

Hs.201591 

Hs.198529 

Hs.18e928 

HS.12B685 

Hs.38592 

Hs.28456 
Hs^9591 
Hs.98502 

Hs.1211 
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459702 

414888 AL039185 
438474 AW865818 
453037 AA045175 
428467 AKQ02121 
413930 M86153 
422429 AA310527 
41SD83 A1632683 
417015 M83772 
405508 

448330 AL036449 
409719 AI769160 
423354 AB011130 
425188 Ata»2052 
427961 AW293165 
447357 A1375922 
412642 BE244598 
453716 AA037675 
437370 A1J359567 
407949 W21874 
427972 AA854870 
453313 BE005771 
426476 m 003296 
424238 AA337401 
452930 AW195265 
424527 AW138558 
453095 AW295660 
449161 N53431 
429586 T73510 
423782 A)472209 
458124 AW005548 
450109 AI539295 
421461 AW291023 
412222 AA528283 
418882 NM.004996 
441736 AW292n9 
401049 

440727 AI073991 
419751 AW195581 
445640 AW969626 
421379 Y15221 
422109 S73265 
410292 AA843087 
434265 AA846B11 
449695 AA164569 
429399 AA452244 
444042 NM_004915 
432343 NM.002980 
436772 AW975688 
428784 Y12851 
445268 A1218358 
402481 

412608 AA247995 
416521 H60929 
416624 H6g044 
419780 AA713522 
421211 AA284966 
427541 AI798983 
432013 /U7S8879 
436461 AW511956 
438002 A)560246 
440312 AW614597 
440479 AA886461 
441178 W907e9 
441235 A1884586 
443314 AW771701 
422165 AU)41199 
450898 AI854223 
432974 BE3487g3 
404200 

435990 AI0158&2 
421309 AI222086 
451558 KM-001089 
416642 T96118 
406672 M26041 
417819 A(253112 
417355 D13168 
459574 AI741122 
404274 

415086 A1597963 
418210 R54575 
419220 AA811938 
444314 AI140497 



Hs.77558 thyroid hoimone receptor interactor 7 

Hs.6232 KIAA0764 gene product 

Hs.177552 ESTs 

Hs.184465 hypothetical protein FU1 1259 

Hs.75618 RAB1 1 A. member RAS oncoger^e (amOy 

gb:EST181333 Jurtot T-cetIs V Homo sapie 

Hs^179 HQino8apien5cONARJ12933fis.cbnsflT 

Hs.80876 flav{ncQntatitingiiionoo)(ygena$s3 

Hs^07163 ESTs 

Hs.108681 ESTs 

Hs.127436 calcium channel, vottagfr^ndent, 8l|A 

Hs.155071 hypothetical protein FU11190 

Hs.143134 ESTs 

Hs.159367 ESTs 

Hs.809 hepatocyte growth factor (hepapoiatin A; 

HS.1S2675 ESTs 

Hs.161962 Homo sapiens mRNA: cDNA DKFZpS470023 ffr 

Hs^47057 ESTs 

Hs,181304 putaSve oer>e product 

Hs.153746 Homo sapiens cONA: FiJ22490 fis, clone H 

Hs.2042 testis spedficpfDteinI (probe H4-1p3 

Hs.137635 ESTs 

Hs.194097 ESTs 

Hs.267158 ESTs 

Hs.252756 ESTs 

H3.47647 ESTs. Wealdy slnularto KIAA0423 (H^api 

H5.209153 anglopotatin^ilcsd 

Hs,288369 ESTs 

Hs.124590 ESTs 

Hs.17967 ESTs 

Hs.97255 ESTs 

Hs 292737 ESTs 

H8189433 ATP-falnding cassette, subfamily C (CFTR 

Hs.169799 ESTs 

HS.13426B ESTs 

Hs.93121 KiAA0761 protein 

Hs.31704 ESTs. We^ similar to KIAA0227 (H^api 

Hs.103982 small Indudbie cytokine subfamily 6 (Cy 

Hs.1473 gastrin-releasingpepMe 

Hs.124194 ESTs 

Hs.130554 Homo sapiens cDNA: FU23089 lis, done L 

Hs.34550 ESTs 

Hs.16727 ESTs 

Hs.10237 ATP-blnding cassette, sub-family 6 (WHIT 

Hs.2961 S100 calctum-btnding protein A3 

Hs.250867 zona peilucida glycoprotein 3A (sperm re 

H5.19347D purineigic receptor P2X.ligand^atedb 

H5.17S048 ESTs 

Hs.44698 Homo sapiens done TCCCTA00151 mRNA sequ 

Hs.44197 hypothetical protein OKFZp564D0462 
gb7r77h05.s1 Goares fetalliver spleen 

Hs.87752' ESTs 

H8.266308 ESTs, Weakly8imllartoAF2163121 type 

Hs.97961 ESTs 

Hs.162102 ESTs 

Hs.293261 ESTs 

Hs.201648 ESTs,WeaIdysImilartoZN42J^UMANZlNC 

Hs.72475 ESTs 

Hs,208161 ESTs 

Hs,153976 ESTs 

HS.135S70 Homo sapiens cDNA: FU21268 lis. done C 

HS.S4646 ESTs 

Hs.1481 hlstidine decarboxylase 

Hs.16026 Homosapiens cDNA: aJ23191 fis, done L 

gb:ht70gO2j(1 NCLCGAP_lji24 Homosapiens 

Hs.131793 ESTs 

Hsl70449 ESTs. Moderaldy sinular to ALULHUMAN A 

Hs^630 ATP4)inding cassette, sub-family A (ABC1 

Hs.226313 ESTs. Wealdy simflar to ALU1.HUMAN ALU S 

Kls.1 98253 m^ histocompaia)iiity complex, dass 

Hs.133540 ESTs 

Hs.82002 endotheOnreoeptortypeB 

K5.101810 Homo sapiens cONAFlJl4232fis. done NT 

Hs.118726 ESTs 

Hs.13337 ESTs, Wealdy similar to unnamed protein 
Hs;S17S9 ESTs 

gbu)w76b09j1 SoaresJMaUver-ndeen. 
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451050 AW937420 
417412 X16836 
428414 AL049960 
412925 AI089319 
433192 A1859065 
410976 R36207 
406673 M34996 
449677 AA002071 
449321 AA001150 
418557 BE140602 
416320 H47867 
426384 AI472078 
414140 AA281Z79 
419520 AB009303 
446999 AA151520 
457447 X7B261 
451099 R52795 
407366 AF026942 
410048 W76467 
400880 

418092 R45154 
428780 A147B578 
431067 AW574823 
432803 AAS65398 
412104 AW205197 
422819 AL122084 
454359 N71277 
424806 AA382S23 
434445 Ai349306 
442994 AI0267ie 
410371 AA084482 
450232 BE30081S 
417924 AU077231 
430899 BE018217 
431814 BE256242 
417543 AA203820 
444542 Ai161293 
404593 

434803 AWg74640 
451623 H77818 
452466 N84635 
402046 

434927 K46612 
436192 W93847 
401987 

423119 AA322201 
427112 Z32B87 
414464 A!870175 
447829 Ai433029 
449679 Ai823951 
405472 

413S21 At808848 
432212 AW137742 
404289 

415362 F06735 
4Z7739 AW198765 
427772 AA412289 
430844 T94960 
434335 AA630107 
436052 AI021983 
442773 AB037722 
446799 AW978373 
450221 AA328102 
455673 BE065939 
456624 AI362790 
405095 

447207 AA442233 
433589 AA886530 
438398 AA808526 
447233 AW246333 
447197 R36075 
431087 H12723 
409064 AA062954 
427558 D49493 
426457 AW894667 
438118 AW753311 
427621 BE621182 
452114 N22687 
448782 AL050295 
403937 

416402 NH.Q00715 
452416 AA026115 
451609 AU)46019 



Ks.69662 ESTs 

H5.82112 intertmddn 1 receptor* type I 
Hs.184216 DKFZP564C152 protein 
Hs.179243 ESTs 

Hs.16808 ESTs. Weakly sMst to paraplegln-likB 
Hs.25092 ESTs 

Ks.198253 m^hlslDcumpalibiBty complex, dass 

gb2h85dOU1 SoarBS_feld|Jiver.spleeii_ 

Hs,t32937 ESTs 

Hs.246645 ESTs 

Hs.34024 ESTs 

9b:^j85h03Ji1 Soares_NSF,F8^9W_OT.PA^P_S 

Hs^ir ESTs 

Hs.297790 Human done 23734 mRNA sequence 
Hs,27^2S hypolheCcal protein PRO2605 
HS.Z72177 H.saplensmRNAforTRE175re3(lremIlyan 
Hs.25954 interieukin 13 receptor, alpha 2 

gbiloroo sapiens cig33 mRNA. partial sequ 
Hs274550 proKne oxidase homolog 

Hs.106604 ESTs 

Hs^836 ESTs 

K5^13 ESTs 

gb:nk41f01.s1 Na.CGAP_GC2 Homo sapiens 

H52409S1 ESTs 

Hs.121073 hypolhetical protein FU10466 

gb:za36e03£l Soares fetal liver spleen 

Hs.105689 ESTs 

Hs.11782 ESTs 

Hs.16954 ESTs 

Hs.1 15850 ESTs 

H5^1326 ESTs 

Hs.82932 cydin D1 (PRAD1: parathyroid adenomatos 

Hs.1 63528 ESTs. Weakly similar to Bein4&^lke prote 

Hs.270847 dellMubuIin 

Hs.1 10153 ESTs. Weakly slmllaf to BCGFJ^UMAN B^L 

Hs.146862 ESTs. Weakly similar to KIAA0525 protein 

gb:EST3B6744 MAGE resequences. MAGM Homo 
Hs268991 ESTs 

Hs^664 Human DMA sequence liom done 682J15 on 

Hs29381 5 Homo sapiens HSPC285 mRN^ pam cds 
Hs.24139 Homo sapiens cDNA: FLJ23137 lis. done L 

Hs.131976 EST 

Hs.290951 ESTs 

Hs.13957 ESTs 

Hs.164104 ESTs 

Hs^6668 Homo sapiens cDNARJ11846ti8, done HE 

Hs.184156 ESTs 
Hs:»3451 ESTs 

gb:K5C1JB091 norroalizod Infant brain cDN 
Hs.98105 ESTs 
Hs.98123 ESTs 

gb7e38d07.r1 Stratagene hmg (93721(9 H 
Hs.213220^ ESTs 
Hs^71432 ESTs 

Hs.8707 Homo8a{^ensmRNA;cONADKFZp434N1131 (f 
Hs.49221 ESTs. Weakly similar to zinc finger prot 
Hs^4641 cytosketeton assodated proton 2 

gb:RC3-BT0319-100100412-c11 BT0319 Homo 
Hs.181801 ESTs 

Hs.17731 hypothe6cal protein FU12892 
Hs.188912 ESTs 
Hs.130277 ESTs 

H8.17901 Homo sapiens cONA: FU21974 fis. done H 

gb7h88b01.s1 Soares placenta Nb2HP Homo 
Hs.290791 ESTs 
Hs.,141883 ESTs 

Hs,21 71 growth diffisrenliation factor 10 
Hs.1 69965 chlmerin (chlmaertn) 1 
Hs.259415 ESTs 

H8.179882 Homo sapiens cONAaj12437 its, done NT 
Hs.8238 ESTs 
KS.301S50 KIAA0758 protein 

Hs.1012 compiemeni component 4-binding protein, 
Hs.1 14777 ESTs 
HS.209Z76 ESTs 



Z2 
2.2 
Z2 
22 
22 
2.2 
2.2 
Z2 
2.2 
2.2 
22 
2.2 
12 
Z2 
2.2 
t2 
2.1 
11 
t\ 
ti 
2.1 
2.1 
Z1 
2.1 

^^ 

Z1 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 

21 
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43S334 Rt9382 Ks.1178G9 ESTs 2-1 

445158 A1992108 Hs.127206 ESTs 2.1 

407930 AA045647 K3.1B8381 Homo8apieRScONARJ128076s.clonaNT 21 

439335 AA742697 Hs,62492 ESTs.WteaWydmBartoSSSBSSodlagen 

5 443949 AW827419 Hs.235070 ESTs 2.1 

429716 R25685 Hs.211933 oollagen. type XI tl. alpha 1 Z1 

415817 U88967 Ks.78857 protein tyrosine phosphatase, receptor-t 2.1 

438576 AA813745 Hs.123446 ESTs 2.1 

405848 2.1 

10 416940 N75620 Hs.43157 ESTs 21 

442381 A1185136 Hs.48650 ESTs 21 

420036 R60338 Hs.52792 Homo sapiens mRNA; cONA DKFZp5861 1623 (( 21 

4362S2 AIS39519 Ha.120969 Homo sapiens cONAFU 11 562 fis. dona HE 21 

413450 Z99716 Hs.75372 N-acetylga)actosam!ntdase. alpha- 21 

15 426572 AB037783 Hs.170623 hypothetical protein FU1 1183 21 

439425 AF086244 H3.114659 ESTs 21 

421168 AF162277 Hs.1360 cytochrome P450.subfami)yllB(phenot»r 21 

449611 AI970394 Hs.197075 ESTs 21 

404548 21 

20 416734 K81213 Hs.14825 ESTs 21 

435855 AA883552 H3.16810 ESTs 21 

439072 AF085930 Hs.269123 ESTs 21 

447482 AB0330S9 Hs.l8705 KlAAl 233 protein 21 

457292 A1921270 Hs.214178 Homo sapiens cONAFU14251 (Is, done OV 21 

25 444974 AI203500 Hs.151612 ESTs 21 

456034 AW450979 gb:UI-H-BI^8-12-0-Ut.8l Na.OGAP.Su 21 

430634 AI860851 H5.2668S ESTs 21 

426782 R14614 Hs.191254 ESTs 20 

452943 BE247449 Hs.31082 hypothetical protein FU10525 2.0 

30 445326 AI220072 Hs,165B93 ESTs 20 

421247 BE391727 Hs.1O2910 general transcription factor IIH, pdype 20 

409994 D86864 Hs.57735 acetyl LDL receptor; SREC 20 

443268 A1800271 Hs.12944S hypothetical protein FU1 2496 20 

455226 AW902103 gb:QVO-NN1022*1205Q0.22&c07NN1022Homo 20 

35 417321 N68722 Hs.191368 ESTs 20 

423778 Y09267 Hs.1 32821 flavin containing monooxygenase 2 20 

404323 20 

448133 AA723157 Hs.73769 folate receptor 1 (aduiO 20 

421047 AW514772 Hs,104473 ESTs 20 

40 425497 AA524596 Hs.188844 ESTs 20 

444623 AI183829 Hs.202111 ESTs 20 

412303 AV\/936336 ob:QV4.DT0021-2812994)7a9l1 DT0021 Homo 20 

433563 AI732637 H8.277901 ESTs 20 

408485 2.0 

45 428330 122524 Hs2258 matrix metaOoproteinase 7 (matrflysin, 20 

455607 BE141140 gb:MR0-KT007&O21299-00&d07HT0075Homo 20 

425465 L18964 Hs.1904 protein IgnaseQiOtB 20 

449424 AW448937 Hs.197030 ESTs 20 

427940 AA417812 H5^8n5 ESTs 20 

50 411502 AW946605 H8.250154 Homo8aplenscONAFU12973fis.dbneNT 20 

411365 M76477 H&278242 tutwtin. atpha. ubiquilous 20 

412369 K80456 Hs.285243 Homo sapiens cDNA: FU22029 ds, done H 20 

452959 AI933418 H8.189674 ESTs 20 

416580 T61572 Hs.79385 Human done 23574 mRNA sequence 20 

55 42B775 AA434S79 Hs.1436»1 ESTs 20 

420000 AB036063 Ks.180726 Homo sapiens cDNAFU13543 lis. done PL 20 

408321 AW405882 H3.44205 cortislatin 20 

410011 AB020641 Hs.67656 PFTAIRE protein kinase 1 20 

411050 AW814902 gb:MR1-ST0206-12040a022-f08ST0208Hon» 20 

60 452453 AI902519 gbKaV-BTD09-10119&<»1 BT009HomO8aplen 20 

428978 AA442784 Hs.125445 ESTs 20 

458562 N34128 Hs.145268 ESTs 20 

425527 AL162032 Hs.1582S8 Homo sapiens mRNA;cONADKFZp434B1 272 (f 20 

403760 20 

65 424368 AB037766 Hs.146085 KIAA1 345 protein 20 

421229 AI056590 Hs.7086 Homo sapiens cDNA: FU23000 Us, done L 20 

435304 AA339622 Hs.108837 ESTs 20 

453498 BE181412 Hs.23245 Homo sapiens cONAFU11767fls. done HE 20 

439018 AW300887 Hs.26538 ESTs, WfeaWy similar to Unnamed protein 20 

70 453280 AL157476 HsJ2913 Homo sapiens inRNA;cDNAOKFZt)761 0082 (tr 20 

420193 AI46C080 HsJ02B59 ESTs 20 

444810 AI174783 gb:HA2S01 Human fetal Over cDNAIibraiy 20 

401575 2.0 

419092 J05581 H3.89603 rmidnl, transmembrane 20 

75 430129 BE3O1708 H3.233955 hypothetical protein FU20401 20 

410763 AF279145 Hs.8966 tumor endothelial marker 8 20 

414783 AW069569 Hs.75839 zifc finger protein 6 (CMPXl) 20 

411492 T46848 Hs.70337 Immunoglobulin superfamity, member 4 20 

405963 20 

80 418378 AW962081 gb:EST374154 MAGE resequences. »MGG Homo 20 

420831 AA280824 Hs.190035 ESTs 20 

424152 AL133591 Hs.a01405 Homo sapiens niRNA:cONADKFZp434N07g(tr 20 

424641 AB001106 Hs.1S1413 gQamahiraiionfBClDr.beta 20 
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10 



15 



427616 


AI698684 


Hs.98028 


ESTs 


435115 


A1821726 


Hs.t16603 


ESTs 


437638 


AA764761 


KS.291S44 


ESTs 


438235 


A!394151 


H5.37932 


ESTs 


439430 


AF124250 


Hs.6564 


bfaast cancer antabogen resislanoe 3 


44S388 


AI925280 


Hs.236842 


EST 


447101 


N72185 


Hs.44169 


ESTs 


448796 


AA147829 


H&33193 


ESTs. HigMy Mar to AC007228 3 BC372 


449623 


000719 


Hs.120440 


ESTs 


450159 


A1702416 


Hs.200771 


ESTs. VlteaUy similar to CAN^.HUMAN CALPA 


456813 


R19992 


Hs.106620 


Homo sapiens done 23950 mRNA sequence 


457233 


AI355009 


Hs.2216g8 


ESTs 


457384 


AAS01760 


Hs.18075 


chramosome 9 open reading frame 3 


457471 


AW971364 




gb:EST383453 MAGE lesequences. MAGI Homo 



2.0 
2.0 
ZO 
2.0 
ZO 
2.0 
2.0 
10 
2.0 
10 
10 
10 
10 
10 



TABl£27B 

20 Plcey: UnlqUQ Eos probeset idenlifief number 

CAT rnnter. Gene duster numt)er 

Accession: Genban)( accession numt)er5 

_ _ Pkey CAT numljer Accession 

25 409457 1132521J AW8 1 808 1 AW392B87AW51 4700 AW392881 

409519 113722„1 AA075368 AA075369 

410008 11681?_1 AA079552BE142525BE142527 

410785 1221055 1 AVV803341 AW803265AVV803403AW803466AVV803402AW803413AW803268AW8^ 

411050 1230330'l AW814902 BE156656BE1 56667 BE155590BE156441 BE156447 

30 411880 1263110J AW872477BE088101 T05990 

411905 1265181J BE265067 BE264978AWB75420 

412303 1286130J AW936336 AW936339 

413136 1350379 1 BE066941 BE066911 BE066979BE066929 BE066925 

413499 1373910J 6E1 44884 H97942 

35 413875 1396766J BE176n6 H85072 

415094 1522103.1 D59513 D59515 D801 74 059514 

415362 1534980 1 F06735 R55896R12110H08697 

416624 1604694J H69044 T47567 H75691 T50292 

418378 174656 1 AW962081 AA218925 AA354237 

40 419546 185766J AA244ig9 AA244272 H57440 

419807 188252.1 R77402AA262462AA250988RD6794 

420637 195241 1 AW976163 AA278945 AA747691 

422429 216469 1 AA310527 AW962295 Z44855 H06641 

423377 22769 T AL049377 AL079930 AL047223 AW885968 AA385235 

45 426384 266211.1 Al472078AAa77209AA865807 

428637 293660^1 AW979268AAB78419 AA431342AA431628 

430153 313709J AW968128AA468102AA468165 

430844 324570J T94960AA487679T95013 

431120 328264J AA492588 AA492498AM92571 

50 431169 328799 1 AW971240AA493843AA493723 

431322 331543"l AW970622AA503009AA502998AA5029e9AA502805 T92188 

432009 34025_T AL1 37424 BE007148T52277 

432222 343347 1 AI204995AW827539AW969908AW440776AA528756 

432507 348711.1 BE391093AA551334BE389643 

55 432779 354024 1 AW979241 AA565006AA847102 

432803 354257.1 AA555398 AW894072 H97930 

432869 355475 J AW974094 AA569074 AA602574 

432974 356950.1 BE348793AA573118N79366 

433492 367934^1 AWe05849AW262898N41060AA594852 

60 433584 370400 1 AW295399AW207772AW300641 AW070290BE348854AW170383AA600968AA778832 

433687 373061 1 AA743991 AA604852AW272737 

434803 393471 1 AW974640AA649S16N75626 

436146 41500.1 BE005252 AK000786 

437327 43610.1 AL353942AW994305 

65 438909 46664 1 AIH)85839 869137 AW188788 R69254 

440320 491930.1 AA879294 N67538 A)474541 

444314 600667 1 AI140497AW749625AW749626AW749844 

444610 612257.1 AJ174783 R83569 R12271 

446346 673545 1 AI290205AW235762AI651268 

70 447197 711623J R36075A1356546 R36167 

448404 761515.1 BE089973AI49B612AW805032 

449299 80436 1 AA299919 AW957012 AA001107 T83831 BE156389 

449540 8094512 AA001713H63836 

449877 81270 1 AA002071 AA002232 T99209 

75 450522 837264 1 A!688e39AI909260A1909259 

451024 85565 J AA442176AA259181 

451381 867770 1 BE241831 AVV249135 BE548847 AW2S0245 

452163 ' 902067.1 AI863140W80703R43474 

452293 909195.-1 AI871833 

80 452453 918300.1 AI902519A1902518AI9O2516 

452542 921410 1 AW812258AW812257AI906423AI906422 

452771 930983.1 T05477 T07855Ar917711 

454359 1130674J N71277 AW390764 
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454693 1229132.1 AW81342BAWB13444AW813367AW8133B8AW813429AW813424 

455024 1249196 1 AW851309AW850888 AW851419AW851412AW851299 

455226 1262534.1 AW902103AW889012AW869139 

455235 1265634 1 AW875951 AW875950AW875936AW875948AW875939AWB75857 

4SS673 1349656.1 BE065939 BE066079 BE065956 

455807 1370914 1 8E141140BE141139BE141105BE141143BE141127BE141202BE141108 

456034 ~ 142696 1 AW450979AA136653AA136556AW419381 AA98435BAA492073BE168945AAB09054AW238038BE01 1212 BE01 1359 B»^ 
BE01136BBE011362BE011215BE011365BE011363 

457471 340916 1 AW971364AA52S021 AA5707S9 

457620 371514 1 AA602711 6E078290 

458154 491768 1 AW816379AAB882B2AA879046AAS7919S 

459257 966605.1 AJ003631 AJ003650 AJ003651 



TABIE27C 

Pkey: Unique nunibof cofresponding to an Eos probeset 

Ref. Sequence source: 7 agllminibers in thbcdunm an Genbaiikldento 1)imhamL el gL' refers to the puUicalion entitled 

chromosome 22.' Dunham I. et a)., Nature (1999) 402:489^95» 
Strand: Indicates DMA strand from which exons were predicted. 
NtjKJSition: Indicates nucleotide positions ot predicted axons. 





Ref 


Strand 


NLposition 


400610 


9887671 


Minus 


1 17606-1 17928,1 24040-1 24147 


400880 


9931121 


Rus 


29235-29336.36363^580 


401049 


7232177 


Rus 


149157-150692 


401335 


9884881 


Plus 


15736-16352 


401575 


7229804 


Minus 


76253-75364 


401793 


7263888 


Minus 


102945-103083 


401987 


4406829 


Minus 


72893-73021,76938-77049 


402046 


8072415 


Plus 


166394-166556,168167-168395 


402222 


9958106 


Plus 


3261-3834.3939-4269 


402481 


9797406 


Rus 


87891-88991 


402629 


9931216 


Rus 


33641-33775,34182-34372,3600^-36084.40343-40612 


403760 


7712202 


Minus 


4591046260.47563^7824 


403903 


7710571 


Minus 


101165-102597 


403937 


7711761 


Minus 


12609-12773 


404043 


9556573 


Phis 


29042-29135,46597-46699 


404200 


6010176 


Minus 


7066.7210 


404274 


9885189 


Rus 


104127-104318 


404288 


2769644 


Rus 


3512-3691 


404269 


2769644 


Rus 


15049-15286,30267-30457 


404323 


9719753 


Minus 


31913^19 


404488 


8113286 


Minus 


6483544994 


404548 


8570305 


Minus 


8389&^162 


404593 


9944068 


Minus 


74922-75788 


404599 


8705107 


Rus 


110443-110733 


404916 


7341826 


Rus 


91057-91188 


405041 


7547195 


Rus 


121230-121714 


405095 


8072599 


Rub 


138877-139066 


405472 


8439781 


Rus 


108297-106447,108462-108596 


405494 


8050952 


Minus 


70284-70518 


405848 


7651809 


Minus 


2813&'28244 


405963 


8247788 


Rus 


40564699 


406182 


5923650 


Mnus 


28256-28935 


406485 


7711305 


Rus 


125036-125422 


406506 


7711374 


Minus 


684W077 


406554 


7711566 


Plus 


1069S6-107121 


406594 


8248611 


Minus 


35543-35845 



TABLE 28A: ABOUT 796 GENES OOWN-REGULATED IN LUNG RBROSIS COMPARED TO NORMAL BODY 

Tatile 28A iisls of atx)ut 798 genes that are downregulated in lung fitirDsis (collection of tPF, HP, and NSIP) samples as compared with nonnat "body map' samples. These were 
setected from at»ut 59680 profaesets on an Affymsbix/Eos Hu03 GeneChlp array such that the ratio of 'average' fibrosis sample expression levei to 'average* nonnal aduit tissues 
sample expression was toss than or equal to 0.1. The "avereger nonnal lung tissue level was set to the 75^ percentile amongst nomnal lung tissues. The 'average' fibrosis 
expression level was set to the 95* percentile amongst fibrosis samfdes. In onJer to remove gene-spedlic background levels of noivspedfic hybridization, the 1 50 percentile vahie 
amongst itorwnalignant tissues was subtracted from both the mjrneretor and (he deno^ 

Pkey: Unique Eos probeset UenGfier number 

ExAocn: Eramplar Accession number. Genbank accession nunAer 

UnigenelD: Un^ne number 

Unigene TUe: Ur^ne gene title 

R1 : Ratio of normal hjng to fibrosis 

Pkey ExAocn UnigenelD Unigene TiOe R1 

414002 NM-006732 Ks.75678 FBJ murine osteosarcoma vtral oncogene h 18,18 

421218 NM_000499 Hs.72912 cytochrome P450, subfamily I (aromatic c 9.39 

404518 AI81S601 Hs.79197 C083 antigen (acthratedBlymphocytos, I 8.30 

404795 5.58 
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403211 

417967 

400489 

425571 

406357 

407979 

452378 

408053 

421770 

4251 28 

402388 

402448 

448245 

413778 

419968 

447768 

405163 

437120 

409020 

431073 

433495 

403716 

424969 

404348 

407070 

412919 

402409 

456150 

427030 

426328 

429307 

400172 

431227 

433883 

446850 

405147 

406821 

402752 

401496 

421201 

402911 

425330 

438004 

448185 

433367 

416596 

400545 

418464 

426507 

403479 

406082 

401919 

449031 

400116 

401590 

401007 

404610 

408641 

407196 

410258 

433232 

457937 

406101 

407080 

419947 

421905 

454019 

428674 

402056 

425182 

425393 

433657 

402158 

404938 

403376 

418828 

402423 

416253 

435265 

42S65S 

428704 

425439 

445613 



BE244373 Hs.1119 nudear receptor subfernly 4, group A. m 

AJ007292 Hs.158306 epMiinAZ 

AA046306 Ha62927 ESTs 

AA025855 Hs.19597 ESTs 

AW139474 H&246862 ESTs 

AA374192 Hs.108124 ribosomal protdn U1 

N327S9 H8.172944 chorionic gonadobopln. beta polypeptide 



AI923551 Hs.170843 ESTs 

AA09Q235 Hs.75535 myosin, light polypeptide 2. regulatory, 

X04430 H&93913 interieukin6(lntertenfKt)ela2) 

X86400 H8.19520 FXYD doRiaiivoontdii^ ton transport reg 

A)356125 Hs.157767 ESTs, Weakly sbnDar to human H0XA2(as 

AA0&2549 Hs.21162 ESTs 

BE254470 Hs.249186 con»iod homeobox 

AW373784 Hs.71 8lpha*2<9lycoproteIn 1. zinc 

AW950928 H5.153998 creatine Unase, mltochondrlai 1 (ubiqid 

Y10209 etKReapiens mRNA for CO30L protein 

AI368680 Hs.616 SRY (sex determining region YHxM 2 

Z42308 gb:HSC0FB121 normafized infant broln cON 

AA397600 Hs.97531 ESTs 

AW6312S6 gb:hh83c0ay1 NCLCGAP.GU1 Homo sapiens 

AU076S92 HS.1989S1 jun B proto^noogene 

X63755 ' Hs.2743 l^eratin, cuticle, uitrahlgti sulphur 1 

AI925688 Hs.222312 ESTs, Wealdy sNIar to 624264 prolines 

R71245 Hs.174303 ESTs 

AAg77896 Hs.128873 ESTs. HigMy similar to ALFAJHUMANFRUCT 



AW241940 HS.102S00 hypothetical protein 11320481 

D25216 K5.155&50 KiAA0014 gene product 

AA774984 Rs.220649 ESTs.VfeaWyfiinillartoFCE2M0liSEL0WA 

AI633040 Hs.172730 ESTs 

AA584g30 K5^9451 ESTs, Weakly simOar to XAP-5^prote 

H67669 Ks.38564 ESTs 

R87580 gb^mSaiO? j1 Scares adult brain N2b4HB5 

AA380285 gb:EST93491 Supt ceBs Homo sapiens cONA 

S47833 Hs.82927 adenosine monophosphate deaminase 2 (Iso 

AI867S02 HS.Z71462 ESTs 



K58589 Hs.35158 HomO8apt8nscONAFU11027tis.ctonePL 

AW245207 Hs.55S5 Komosap)enscDNAFU13170fis,cloneNT 

D11747 Hs.177415 Finkel-Biskis.Reiily murine sarcoma viro 

X52638 Ks.739 6-ph06pholhicito-2-Mnaseftuctose-2,6-bi 

A1658621 Hs.127769 ESTs 

AW976930 Hs.128760 ESTs 

Z38133 Hs.1 13973 myosin, heavy polypepfide 8, skeletal mu 

AW298744 Hs.1 18894 ESTs 

A1660247 Hs.32699 ESTs. W^ similar to UV-1 protein [H 

D31846 Hs^025 a()uaporto2(coitoc6ngducq 

AA431734 Hs.104915 ESTs 

AF041259 Hs.155040 zhc finger protein 217 

NM.000218 H$.1561 15 potassium voltage^ated channel, KQT-tk 

AI244368 Ks.8124 PHdomtfnoontabilng protein in retinal 



AF020774 Hs.88844 Homo sapiens hair and skin epidemia^rp 



BE250659 
AA779958 
BE61455t 
AA432007 
D38024 



Hs.15463 ESTs 

Hs.165932 ESTs 

Hs.158675 ribosomal proton L14 

Hs.249484 ESTs 

Hs.157425 double homeobox. 2 

Hs.158491 ESTs 



5.46 

5.43 

5.19 

Sl19 

5.08 

5.08 

4.78 

4.62 

4.52 

4.49 

4.39 

4.37 

4.31 

4.29 

4.24 

4.21 

4.19 

4.19 

4.09 

4.07 

4.05 

3.99 

3.94 

3.90 

3.82 

3.61 

3.80 

3,79 

3.76 

3.74 

3.71 

3.70 

3.66 

3.68 

3.67 

3.64 

3.57 

3.55 

3.50 

3.50 

3.49 

3.49 

3.46 

3.46 

3.43 

3.41 

3.39 

3.37 

3.35 

a34 

3.34 

3.33 

3.33 

3.31 

3.29 

3.28 

3.25 

3.25 

3.23 

3.23 

3.23 

3.23 

3.18 

3.18 

ai6 

3.16 
3.16 
3.14 
3.06 
3.06 
3.08 
105 
3.03 
102 
101 
100 
Z99 
2.99 
199 
198 
198 
197 
197 
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402714 2.96 

403526 2.96 

403605 2.95 

. 441852 AB02e968 Hs.7989 KIAA1045 protoi 2.95 

5 417629 T76945 Hs.64211 ESTSiWeaMysirnnar to similar to acyt- Z94 

447744 AA313230 Hs.19413 SlOOcalcium^iindingpfOteta Z91 

419821 AWg87486 H$.189119 ESTs 2.90 

446993 AI570964 Hs.154257 ESTs i89 

414580 BE386918 ob:601275386F1 NIH„M6C_20 Homo sapiens e 188 

10 423379 A198S349 Hs.157492 Homo sapiens cONAFU 1407968, done HE 168 

440206 Af762232 Hs.46794 ESTs 188 

402212 AW502761 Hs.30909 KIAA0430 gene product 187 

406059 188 

423548 AR)07194 H8.129782 mudn 3A.inte5tina) 186 

15 402051 185 

41519S AK000150 Hs.78185 MAX-liXs t)HLHZlP protein 185 

455446 AW947749 gb:RCO-MT0005-130300-031-t}01 MTOOOSHomo 185 

442428 8E464988 Hs.298302 ESTs 164 

403247 183 

20 40482S 183 

459184 135001 Hs.95669 ESTs 183 

402968 182 

417575 R00362 Hs.191ig9 ESTs 182 

404668 181 

25 420619 AF130255 Hs.99430 tes9s zinc finger protein 161 

447241 BE382838 H3.19322 ESTs 180 

446793 AI864581 Hs.215477 ESTs 179 

453014 AI937242 H3.176590 ESTs 179 

446775 Af792836 Hs.232273 ESTs 178 

30 455075 AW654650 gb:QV2-Cr0261-20109&O11-h03CT0261 Homo 178 

406704 M21665 Ks.929 myosin, heavy pdypepiide 7, cannacmus 175 

457546 AA568484 Hs.153632 ESTs 175 

410197 NM.005518 HS.S9889 3^ydroxy-3^elfiytglutaryl-CoenzyfnaAsy 174 

433677 AI791912 Ks.190885 ESTs. Moderately siiroTar to ALU1.HUMAN A 174 

35 405703 173 

408640 AW277132 HsJ254880 ESTs 173 

413958 BE277913 Hs.172364 Homo saddens mRNA for aJ00086 proldn, 173 

454421 BE409759 Ks.59563 Homo sapiens mRNA for FU00007 protein, 173 

406702 Z20656 H3.278432 myosin, tieavy polypeptide 6, canSac mus 172 

40 408664 R56362 gb:yg93c07j1 Soares Infant brain 1 NIB H 2.72 

402457 171 

403612 171 

407049 X72632 (NONE) 171 

415423 AA164743 Hs.187617 Homo sapiens cDNA FU13941 fis, done Y7 170 

45 402862 169 

403540 169 

431465 AW293178 Hs.180086 ESTs 169 

406563 168 

417003 AL038170 Hs.80756 betalne-homocystelnemethyltrBnsfefBse 168 

50 426220 AI363475 Hs.171697 ESTs, Vteddy similar to invmrnoglobulins 168 

446707 At591214 HS.1S6338 ESTs 166 

447557 AW028809 HS.229S70 ESTs 168 

413529 U11874 Hs.846. interieukin 8 receptor, beta 2.67 

403997 166 

55 408704 AA056635 Hs.5368 Homo sapiens cDNA: FU21522 Us, done C 166 

407005 U20230 gb:HumanguanylcydaseCgene,partid 165 

405075 164 

430728 AW968522 ob:EST3805S8 MAGE resequences, MA6J Homo 164 

405327 163 

OO 409419 BE207219 Hs70474 EST8,HigMy8&n]IartoS17l12intorfe(0 163 

434300 AA740944 H5.116295 ESTs 163 

405895 162 

431929 AW294163 Hs.146127 ESTs 161 

405217 160 

65 437569 AA760849 Hs.294052 ESTs 160 

419822 AW986864 Hs.255780 ESTs 159 

445918 AW014139 Hs.145656 ESTs 159 

446149 6E242980 Hs.203161 ESTs 159 

457829 AI742291 Hs.210843 ESTs,WealdyslnfiaartodJ1039K5.2(asa 158 

70 404282 153 

409778 AW499705 gb;Ul-HF.BR0p^T<^05<»-Ul.r1 Nia.MGC.5 153 

445353 BE551465 Hs.175211 ESTs 153 

458764 BE619386 gb£01473204F1 NIH.M6C_68 Homo sapiens c 153 

402195 152 

75 404247 152 

427584 BE410293 H3.179718 v-myb avian mydobiastosis vira) oncogen . 152 

402588 150 

432301 U34249 Hs.t67075 ring finger prOtdn 9 150 

424958 AA984420 Hs,283659 ESTs 149 

80 442197 AW637912 gb:QV3.LT004$-26010(M)5^LT0048Homo 2.49 

415003 M11437 Hs.77741 idnlnogen 146 

420767 AF072711 HS.9S91S carboxyl ester Upasa (bile salMmjla 148 

42268S BE244068 H3.121544 biterleukln 12 receptor, beta 1 147 

321 



>V0 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



440424 

402153 

432152 

454414 

401603 

408493 

408513 

409826 

400672 

430713 

449748 

453756 

400624 

403125 

406118 

402165 

416982 

425515 

402951 

427886 

447173 

448703 

426344 

401840 

403731 

405378 

405555 

415559 

438216 

448427 

451588 

423011 

451172 

401015 

414705 

439894 

446305 

453512 

418556 

457197 

457275 

458766 

414075 

430210 

442614 

402538 

439891 

440056 

406150 

426880 

447129 

458893 

456778 

401728 

404139 

414095 

432037 

451965 

416768 

427586 

454108 

429749 

434507 

436652 

437433 

401688 

441748 

453072 

400635 

417176 

42785B 

454886 

458232 

408922 

423668 

440338 

403115 

409125 

426887 

413811 

442962 

403921 

413140 



AI991125 Hs.189109 Homo sapiens cONA:aJ214SB lis. done C 



Hs.272790 Homo salens cONA FU20238 fs, dona CO 
Hs.164675 ESTs 



AK000245 
R55574 

BE2)6854 
AW206468 
AVK01112 

AA351647 

H23S63 

AW139415 



J05401 
W26609 

AA417083 
AW449385 
BE613942 
H41821 



Hs.46039 phosphogiyceralB mutase 2 (musde) 

Hs.103118 ESTs 

Hs.34487 hypolhefic^ protein FU23412 

Hs.2642 eukaryoQc translaSon eiongaOon factor 

Hs^3 ESTs 

Hs.61906 ESTs 



Hs.80691 creaQne kinase, mitochondria) 2 (saitom 

0b:35f12 Human retina cONA randofrty prim 

Hs.104789 ESTs 

Hs.157294 ESTs 

Hs.170890 Homo sapiens cDNA: FU21 129 lis. done C 

Hs.169393 banscrtptional adivator of the c-fos p 



Y09306 

A1039195 

ZB3952 

BE395260 

AW072057 

NM.000683 

AW206465 

BE464157 

AA853077 

AW270149 

AL040160 

T02850 

AB016092 

AA463422 

AW183618 

U11862 

AL157426 

A1269030 

A1389940 
BE294828 

AA453482 
AWD14123 
BE161733 
Al45830g 



Hs^0146 trameodomain-interading pmtem kinase 3 

Hs.128060 ESTs. We^ simOarto cDrtA ESTyk48l9$ 

Hs^52815 ESTs 

gb:601311130F1 N1H.MGC.44 Homosaplensc 
9bn«858g05.x1 Na_CGAP„Bm25 Homo sapien 

Hs.299847 ESTs. Highly similar to A2A0.HUMAN ALPHA 

Hs^7423 ESTs 

Hs.261455 ESTs 

gb:NHT6Caa03a05f1 Normal Human Trabecula 
HsJS4515 ESTs. Moderately similar to AF248953 1 g 
Hs.209542 ESTs. Weakly simHar to BoeH linker pr 

gb:FB12A9 Fetal brain, Stratagene Homo s 
H8.1971 14 RNA bfnding protein; AT-ridi element bin 
Hs.209431 ESTs 

HS.18B417 ESTs, Weakly simflar to ZnT-3 [H sapiens 
Hs.75741 amBorlde binding protein 1 {arrtne oxida 
Hs^35390 Homo safsens mRNA; cDNA OKFZp7616101 (fr 
Ob»y73c1Zx1 NCLCGAPJKUa Homo sapiens 

Hs.109968 ESTs 

Hs.13323 hypothetical protein FU22Q69 

gb3x47a11 j1 Soares_tesli3jlHr Homo sap 
Hs.161402 ESTs 

HsJg72B3 ESTs,WeaklysimilartoALU1J1UMANALUS 
Hs.117406 ESTs 



BE293546 

AW450592 

AA021163 

AA363733 

AA609661 

AA161071 

AI685174 

AW511138 

AA724543 

R74016 

R14439 
BE2S1845 

AW974475 

Nl\4.001971 

AW837063 

6E217072 

RB7388 

Y1014B 

R62431 

R17268 
AI971975 
BE168828 
AI025315 

T06607 



gb:601186671F1 NIH_MGCJ5 Homo sapiens c 

Hs.300459 ESTs 

Hs.22287 ESTs 

Hs.1032 regenerating Isiel^lerived 1 alpha (pane 

HS.190S92 ESTs 

Hs.71465 squalene epoxMase 

Hs^3 ESTs 

Hs^65ai ESTs 

Hs.166824 ESTs 

H&121561 ESTs 



Hs^94 
Hs.221516 

Hs.143467 
Hs.21 

Hs.279537 

Hs.131138 
HS.127S8 

HS.3015G0 
Hsi12892 

Hs.131615 

Hs.6846 



ESTs 

ESTs. WaaMy similar to tetraspen TM4SF 

ESTs 

elastase 1. pancreatic 

gbK3V1-LT0037-150200-069^ LT0037 Homo 
ESTs 

gbvm88g04j1 Scares adullbrdn N2b4HB5 

neurotensin receptor 2 

ESTs 

ESTs 
ESTs 

gb:OV1.HT0517<020400'145-fD4 HT0517 Homo 
ESTs 



hypothetical protein FU13055 



2.47 

Z48 

2.46 

145 

2.44 

2.44 

Z43 

142 

Z41 

2.41 

141 

141 

140 

140 

139 

138 

138 

138 

137 

137 

137 

137 

136 

135 

134 

134 

134 

134 

134 

134 

134 

133 

133 

132 

132 

131 

131 

129 

128 

128 

128 

128 

127 

127 

2.27 

126 

126 

126 

125 

125 

125 

2.25 

124 

123 

123 

123 

2.23 

123 

122 

122 

122 

121 

121 

121 

121 

120 

119 

119 

118 

118 

118 

118 

118 

117 

117 

117 

116 

116 

116 

115 

115 

114 

114 
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421998 AW583807 Hs.t460 gfueaoon ^14 

436130 AA3414g7 Hs.31408 ESTs Z14 

407243 AA0S8357 Hs.74466 caidnoembryoriicantigefHeladfidce^ 2.13 

- 407708 AF0t9988 Hs.37936 suppressor of variegation 3-9 (DrosopNI 2.13 

5 442792 AI352340 Hs.131194 ESTs 2.12 

454408 AA213S05 HS.267B61 ESTs 112 

424648 AA34457& gb:ESTS0478GaDfaladd8r I Kofm sapiens 2.11 

433963 AI218808 Hs.187778 ESTs 111 

400736 2.10 

10 406343 2.10 

409702 A1752244 H8.285749 Human DNA from dvomosome lO-spedfic CO 2.10 

432092 AF135026 gbiHonv) sapiens kaDiHrein-Iike pmt^ 2.10 

441915 A1566116 Ks.207086 ESTs. We^ simOar to FOG [Mjnuscuhis] 2.10 

453147 AA733098 Hs.279909 CGMSpnJlein 2,10 

IS 415604 Z44177 Hs.170434 Homo sapiens cDNAFU 14242 lis, done OV 208 

422927 AW2473B8 Hs.301423 caldum binding protelnl (catbr^) 2.0S 

401211 Ai004832 Ks.5038 neutopalhy target esterase Z07 

413808 J00287 Hs.182163 cddesmonl Z07 

414433 8E407755 Hs.169100 Homo sapiens c[mFU12529fis. done NT 2.07 

20 421978 AJ243662 Hs.110196 NICE-I protein 2.07 

431204 F28a41 Hs.250760 cytodtromec oxidase sUbunit Via pdypep 2.07 

433605 A)378012 Hs.147g53 ESTs 2.06 

4493B3 AW444712 Hs.196573 ESTs Z08 

455652 BE064675 gb:RC1-BT0313^12994}12-h11 BT0313 Homo Z05 

25 402382 2.04 

407282 Ai345597 Hs.254727 ESTs 2.04 

457273 AI167145 Hs.165538 ESTs » 2.04 

459073 AW966616 Hs.296234 ESTs. HigMy similar to mHogen^Ovale 2.04 

402394 2.03 

30 428875 AW451624 Hs.178202 ESTs 2.03 

456634 AA609911 Hs.109012 ESTs 2.03 

434352 AF129505 Hs.86492 small musde protein. X-lini^ Z02 

439281 AA100768 Ks.48485 ESTs 202 

444153 AK001610 Hs.10414 hypolheiicai prolefn FU1074B Z02 

35 401122 2.01 

444340 AI143198 Hs.143561 ESTs 2.01 

455104 BE064863 gb:RC1>BT0313-110300-015.f06BT0313HonK> 101 

415011 AW963085 gb:EST3751S8lkMGE resequences. MAGHHonn 2.00 

440144 AW082297 Hs.88523 ESTs 200 

40 4031B3 1-99 

409802 AW500732 gb:UI-HF^N0<ai(nv4h074)-Ul.r1 NIHJA6C.50 1.98 

430144 AI732722 Hs.187694 ESTs 198 

4445BQ AI168365 Hs.268663 ESTs 1.98 

401704 1.97 

45 401810 1.97 

424473 AK001405 Hs.148584 Homo sapiens cDNAFU10543 its. done NT 1.97 

438573 AW135084 Hs.299865 ESTs 1.97 

412921 BE009345 Hs.128942 ESTs 1.96 

422233 AB002058 Hs.1 13275 purinergic receptor P2X-nke 1. orphan r 1.96 

50 425352 NiyLO0a939 Hs.18g7 piaopiomdanooa(fb){adienocoifiootropln 1.96 

410285 AA0B3609 gb2m63(l05j1Stra!agene1ibrobtast(937 1.95 

414323 NM_014759 K5.239500 KiAA0273 gene produd 1.94 

428119 AW298211 Hs.255737 ESTs 1.94 

424510 AK001841 Hs.14g797 hypothetical protein FU10979 1.92 

55 425280 U31519 H8.1B72 plwsplioenolpyruvatecaitaQxyldnasel (sd 1.92 

429785 H62114 Hs.301769 ESTs 1.92 

437344 R90921 Hs.6846 hypothetic^ proton FU 13055 1.92 

451819 A1819096 Hs.249260 ESTs 1.92 

^. 459060 H89244 Hs.79625 heterogeneous nudear ribonudaoproldn 1.92 

60 422664 AA315933 H5.120879 ESTs 1.91 

432247 AA5312B7 Hs.105805 ESTs 1.91 

453820 R77494 Hs.754l6 OAZ associated protein 2 1.91 

400675 1.90 

405556 Y09306 Hs.30148 homeodomaiiv^nterading protein Idnase 3 1.90 

65 407099 M948gi H$.278423 pregnancy spedRc beta-l-^protein 4 1.90 

440297 BE560553 Hs.205450 Honw sapiens cONA:FU22570 lis, done H 1.90 

443104 AA088470 Hs.B3t35 p534esponsl»egene6 1.90 

444329 W73753 Hs.58330 ESTs 1.90 

402690 1.89 

70 432354 AW137262 Hs.192713 ESTs 1J9 

427811 M81057 H5.180884 caitoxypepQdase 61 (tissue) 1.88 

443322 AI825B17 Hs.143272 ESTs 1.88 

458185 AI762757 Hs.129859 EST6,Wealdy8lniBartoAF1136851 PR009 1.88 

459072 A!B1597B Hs.160427 ESTs 188 

75 402534 187 

409889 AA078492 gb:7P04D11 Chromosome 7 Placental cONAL 1.87 

416931 D45371 Hs.804€5 adipose most ebundant gene transcript 1 1.87 

430176 AL161995 Hs.234775 neufturin 1.87 

430631 AJ003147 Hs,278464 oUadoiy receptor, family 1, subfamily 1.87 

oO 433114 AA121579 gb2n77f02r1 Stratagene NT2 neuronal pr 187 

439254 U57352 Hs.6517 amiloride-sensiSve cation channel l.ne 187 

446461 AW166358 Hs.124979 ESTs 187 

450675 AA010662 Hs.166839 ESTs 1.67 
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401767 1.88 

449891 N64867 Hs.37848 ESTs 1.85 

400527 1.84 

^ 428581 AA430570 Hs.104881 ESTs 1.84 

S 443647 A\/653846 Hs.126261 Homo sapiens Chromosome 16 BAG done OT 1.84 

444785 AV551441 Hs.2B2475 ESTs 1.84 

449566 AA001778 Hs.288156 Homo sapiens cONA: FlJ21819fi3. done H 1.84 

435752 AW298529 Hs,255774 ESTs 1.83 

^ 437405 AA338837 Hs.42547 Homo sapiens cDNAFU13975fis, done Y7 1.83 

10 449174 T66136 Hs.1288D ESTs 1.83 

449887 AW080843 Hs.200275 ESTs 1.83 

453261 AA034116 Hs.118494 ESTs 1.83 

454243 AW24ig01 H8.2S0683 ESTs 1.83 

459188 AA216382 Hs.30002 S»0«ontainIng protein SH3GLfi2 1.63 

IS 424334 AA393460 gb:zl71e05.r1 Soares_te5t!s_NKT Homo sap 1.82 

432150 AK000224 Hs.272789 hypothetical protein FU2021 7 1.82 

40B123 AW163377 Qb:au94e02.y1 Sdmoder fetal brain 0000 1.81 

428722 U78456 Hs.19Q7S7 tissu8lntiB)itorofmetaflopn}lelna584 1.80 

442196 AI902646 Hs^1844 Homo sapiens cONAFU12588fis. done NT 1.80 

20 421419 IM99587 H3.104134 homeob(n(H6(amOy)1 1.79 

405420 1.78 

405737 1.78 

414016 AA134S94 Hs.71S28 ESTs 1.76 

415744 AW9$4850 H5.279307 ESTs 1.78 

25 420375 AF182077 Hs.97244 gDcmatumorsuppressofcandidala region 1.78 

426322 J05068 Hs.2012 transcobaiamin I (vitamin B1 2 binding pr 1.78 

421592 AF009a01 Hs.105941 bagpipe homeobox (Drosophiia) homolog 1 1.77 

401743 1.75 

405187 1.75 

30 442763 AI017037 Hs.131121 ESTs 1.75 

451621 AI879148 Hs.26770 fatly add WncDng protein 7, bcdn 1.75 

413248 T6485a Hs,21433 ESTs 1.74 

423913 NM.016436 Hs.301055 hepatocellular cardnoma.associated anO 1.74 

439999 AA115811 Hs.6838 ras homolog gene tangly, member E 1.74 

35 440185 AW104546 Hs.270929 ESTs 1.74 

4504B2 A1697844 Hs.221720 ESTs 1.74 

413972 BE279548 H3.162717 ESTs,WeaWy8lmilaf ^oHPPDJ^UMAN4-HYD 1.73 

420476 AW575863 Hs.136232 ESTs 173 

428748 AW593208 Hs.98765 ESTs 1.73 

40 431148 AA5Q2653 H8^21 ESTs 1.73 

447205 BE617015 Hs.110O6 ESTs 1.73 

455994 BE179190 gb:RC0-HT0613-21030&O32'f07 HT0613 Homo 1.73 
401039 1.72 
403251 1-72 

45 409762 AW498B84 Hs.257970 ESTs 1.72 

440914 AAS09552 Hs.143884 ESTs 1.72 

448507 AL133109 Hs.21333 . Homo sapiens mRNA;cDNADKFZp566N1 047 p 1.72 

409805 AW444477 Hs.258507 ESTs 1.71 

441212 AW242447 Hs.146182 ESTs. Weakly similar to lactase phlorizi 1.71 

50 445624 AW140103 Hs.76880 OvB (bacterial acetoladatesynthaseH 1-71 

458619 AA872064 Hs.301218 ESTs, WeaUydnflar to Unknown gene pro 1.71 
401969 1.70 
403327 1.70 

407245 X90568 Hs.172004 tlBn 1.70 

55 417361 N1WL000275 Hs.82Q27 ocdocutaneous albinism il(pnlt^da 1.70 

436034 AF2B2693 H8.150185 Inflanmialion^ated G proteln^pied r 1.70 

442682 AI014545 Hs.231027 EST 170 

458494 AI380906 H9.158438 ESTs 1.70 
404682 1.69 

60 407402 AF035303 gb:Hofno sapiens done 23943 mi^soquenc 1.69 

409366 AA071059 gb2m66a10.r1 Stratagene neuroepithellum 1.69 

440362 AA883812 HS.125S08 ESTs 1.69 

448866 BE297743 Hs.284203 niyogenicfedor3 1.69 
402201 1.68 

65 426230 AA367D19 Hs^41395 protease, serine. 1 {trypsin 1) 1.68 
403186 1.67 

409543 AW410200 gb:fhOSb12j(1 NIKJ^.I? Homo sapiens cO 1.67 

443872 AA323382 Hs.9667 butyiobetaine (gamma). 2^ogiutarstedi 1.67 

_^ • 450391 AI694522 Hs.202280 ESTs 1.67 

70 408919 AW295352 Hs.251836 ESTs 1.66 

416136 H45027 Hs.181770 ESTs 1.66 

416865 H97863 H8.42456 ESTs 1.66 

419682 H13139 Hs.92282 paM4ike homeodomain transcffpOon fa 1.66 

437237 BE513073 gb:601 1 71 435F1 NIHJA6CJS Homo sapiens c 1.66 

75 429134 AA446953 Hs,99004 ESTs 1.65 

445041 T641B3 Hs.11398 ESTs 1.65 

453240 AI969564 Hs.284249 Homo sapiens d}NA:FU22334 lis, done H 1.65 

405243 • 1.64 

426039 BE265133 Hs.217493 annexInA2 1,64 

oO 430135 NH^000035 Hs^234 aldolase B. frudose4)isphosphaIe 1.64 

435942 R05285 Hs,191215 ESTs 1.64 

448106 A1800470 Hs.171941 ESTs 1.64 

408591 AF015224 KS.4&4S2 mammaglobfnl 1.63 
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410881 AW809157 gb:RO0^TO11B4M109»)31-c07J STOII8H0 1.63 

4in43 R14738 Hs.8312 ESTs.Wed(lysiniDartoAF1707231pnste 1.62 

430632 AO004597 Hs.248088 dfactoy receptor, famOy 10. sutMy 1.62 

448851 BE246440 KS.9372B pre-B^ leukenva transcripGon factor 1.62 

3 453718 A1119317 Hs.120360 phospholipaGe A2, giDup VI (cytosoTic. c 1.62 

459499 AW4026S3 Hs.28355 Homo sapiens cOHA:FU22402Gs, done H 1.62 

412374 X01388 Hs.73649 apoDpoptotein C-i» 1.61 

419113 AI446565 H5.21835 ESTs 1.61 

425795 AIB10474 Hs.19e945 ESTs 1.61 

10 426998 BE274360 9b:601121068F1 NIHJi(GC.20 Homo sapiens c 1.61 

428407 NMt003963 Hs.184194 transmembrane 4 superfamDyoieinber 5 1.61 

444475 C75571 gb:C75571 Human pancreaQc islet Homo sa 1.61 

453399 Z7029S Hs.32966 guanytete c/dase adivalor 2B (uroguany 1.61 

456275 AW976183 HsJ8414 ESTs.WB3Wysimflarto(U512E2.1 (H-sap 1.61 

IS 414060 BE246327 ob:TCBAP1E1967Pediatncpre-Bcelacut 1.60 

418004 U37519 Hs.87539 aldehyde dehydrogenase 8 1.60 

428651 AF196476 Hs.18B401 annexinAlO 1.60 

443853 AI0890&4 Hs.250644 ESTs 1.60 

407007 U22961 gb'.Human mRNA done with similarity bL 1.S9 

20 412067 N45697 gb:yy78d01.r1 SoaresjnulQpiejsdefOSis. 1.59 

419080 AW15Q835 Hs.18878 hypothetical protein FLJ21 620 1.S9 

448619 A1B67182 Hs.202255 ESTs 1.59 

403665 1-68 

407S24 X64g8S gb:H^8piens mRNA HTPCRX11 for olfactory 1.58 

25 424286 AA3382B5 Hs.90744 proteasome(pfOSomB,macropain] 28$subu 1.58 

412056 T28160 Hs.778 guanyiale cyclase acGvator IB (reOna) - 1.57 

430218 AW998B65 Hs.l86703 ESTs 1.57 

431882 NhL001426 Hs.271977 engrailed homdog 1 1.57 

450797 AI761930 Hs.205127 ESTs 1.57 

30 455366 AW947563 . gb:RCO-MTX04-140300^1-g11 MT0004Homo 1.57 

408421 AW193734 Hs.253067 ESTs 1.56 

421907 BE0185S6 Hs.109358 ATPase. Class V, type 108 156 

432742 AAS64453 Hs.162339 ESTs 1.56 

436624 T64297 H8.5241 fatty add Nnding protein 1, liver 1.56 

35 439543 W75935 Hs.146083 ESTs 1.56 

443317 AI051601 Hs.200191 ESTs 1.56 

449097 BE271708 Hs.95110 ESTs, Weakly similar to PIP6_KUMAN 1-PHO 156 

457127 AA1945&4 HS.1B3434 ATPase. transporting, lysosomal (vacu 1.56 

407387 AB000895 gb:Homo saddens mRNA for cadherfn RBI, 1.55 

40 418837 U48263 Hs.89040 prepronodcepttn 1.55 

436749 AA584890 Hs.5302 lectin, gdadoside^nding, sdubte, 4 1.55 

458475 A1650322 Hs.143249 ESTs 1.55 
402561 1.54 

411187 AW821291 gb:PM3.ST0307-241299402-403 ST0307 Homo 1.54 

45 419224 NI^0121B9 Hs.2S2716 fibroushealhln II 1.54 

414657 AA424074 HsJ6780 protein phosphatase 1, regulatory (jnhib 1.53 

415426 Z41991 Hs.23197 ESTs 1.53 

421428 1126726 Hs.1376 hydnixystero!d(11-beta) dehydrogenase 2 1.53 

-.^ 426300 U15979 Hs.16922B delta^te homolog prosophila) 1.53 

50 428489 A1807459 Hs.98582 ESTs 1.53 

437728 AA766719 Qb:oa39c09.s1 NCLCGAP.GCB1 Homo sapiens 1.63 

407124 R08160 H3.268857 ESTs, WeaWy similar to ALU1_HUMAN ALUS 1.52 

414932 C14577 Hs,194517 ESTs 1.52 

433500 AF064255 Hs.1 11401 very lon^^halnacyl-CoAsynlhetase homo 1.52 

55 439688 AW44S181 Hs.209637 Homo sapiens cDNA RJ12921 fis, done NT 1.52 

453391 AW600302 Hs.23265S ESTs 1.52 

424688 AA216287 Hs.1815 myosin. nghtpdypepOde 3. aik8B:ven 1.51 

436895 AF037335 H3.5338 carbonic anhydrase Ml 1.51 

443012 AI566813 Hs.132278 ESTs 1.51 

60 415624 D42039 Hs.78871 mesoderm development candidate 2 1.50 

445152 At214667 H5.283597 ESTs 1.50 

455941 6E160011 Hs.129999 Homo sapiens CCINARJ14267 Us. done PL 1.60 

457889 AL035864 H5,69517 ESTs,Wghiy sirrtfartodifferenfiallye 150 

458503 AL133933 Hs.64310 Interteukin 11 receptor, dpha 150 
65 400694 149 

420937 AW966719 Hs.1340 coHpase, pancreatic 1.49 

426752 X69490 Hs.172004 fiOn 149 

426784 U03749 Hs.172216 chromogrartn A (parathyrrfd secretory pr 149 

428874 W32133 Hs.194366 transthyretin (prealbumin, amytoldosis t 149 

70 444287 mam H5.IO755 dlhydracvitmidinase 149 

450684 AAB72605 Hs.25333 MerfeuUn 1 receptor, type 11 149 

425747 AI457620 H5.205360 ESTs 148 

432378 AI493046 Hs.146133 ESTs 148 

447999 AW138840 Hsi01778 ESTs 148 

75 453688 AW450670 HS.2S2819 ESTs 148 

406667 M12523 Hs.75442 dbundn 147 

418129 X52997 Hs.1144 glycoprotein iX (plateleO 147 

426309 A1912555 Hs.157195 peptide YY, 2 (seirdnalplasmin) 147 

426755 BE253469 gb:601108143F1 NIH.MGC_16 Homo sapiens c 147 

80 414258 AA203285 Hs.294141 ESTs. WOaldy similar to dJ733D15.1 (H.8a 146 

417421 AL138201 Hs.82120 midear receptor subfamiiy 4, group \ m 1.46 

420562 A1345569 H8.190046 ESTs 146 

425011 T51988 Hs.283108 hemoglobin, gamma G 148 
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443050 AI81278a Hs.132348 ESTs, VfeaMysfanflarb diaphanous 1 (K 1.46 

411074 X60435 Hs.68137 adenylala cydasa aclivaQng polypeptide 1.45 

434680 T11738 H3.127574 ESTs 1.45 

^ 454771 AW819339 Hs.273529 ESTs 1.45 

5 415672 N53097 Hs.193579 ESTs 1.44 

418141 AW845738 Hs.171118 Homo sapiens mRNA for RJ00026 protein, 1.44 

406706 X03740 Hs^SISSI myosin, heavy polypeptida 1. skelela) niu 1.43 

418197 AA214253 gb2n58g01r1 Stratagene musde 937209 H 1.43 

431821 AW452256 Hs:J71221 hypothetical protein FU20064 1.43 

10 455433 AW939463 gb:QVl.OT0072-31010W}5&flO2 DT0072Hon» 1.43 

407743 AW814118 gb:MR3^T0203.15l199^l1-d09ST0203Homo 1.42 

418888 AU076801 Hs.89436 cadhertn 17. U cadhann (Rver-faitestin 1.42 

434001 AW9S0905 Hs.3697 serine (or cyst^) proteinase Inhlbito 1.42 

441031 AI110684 Hs.7645 fibrinogen. B beta polypepSde 1.42 

15 452456 BE080763 gb:QV1-BT063M50200.071<ff)9 BT0631 Homo 1.42 

456535 AA305079 Hs.1342 cytochrome c oxidase subunftVb 1.42 

406349 BE546947 Hs.44276 homeoboxCIO 1.41 

420391 AM56891 Hs.79123 KIAA0084 protein 1.41 

421126 M74567 Hs.102122 lnsulln.lika growth fector binding prote 1.41 

20 449329 AW752783 gb:IL3^T0219-221199429-F03CT0219Homo 1.41 

453615 AA195712 Hs.l32696 ESTs 1.41 

417296 136196 Hs.81884 sulfotransferasefaniOy.cytosolic, 2A, 1.40 

420287 AA740307 Hs.88297 ESTs 1.40 

427583 MB2962 Hs.179704 meprin A. alpha (PABA peptide hydrolase) 1.40 

25 418787 AW296134 Hs.86999 ESTs 1.39 

422072 AB0182S5 Hs. 1 1 11 38 KIAA0712 gene product 1.39 

425988 BE045897 Hs.274454 ESTs 1.39 

428087 AA100573 Hs.182421 troponin C2. fast 1.39 

438136 NkL002390 Hs.6088 a disintegrin and melalloproteinase doma 1.39 

30 455579 BE011320 gb:PM3^02ia<»050(M)02^ BN0218 Homo 1.39 

402316 1.38 

417084 H08370 Hs.33067 ESTs 1.38 

423276 AC003034 H5.126261 Homo sapiens Chromoscme 16 BAG done OT 1.38 

433787 AI472951 Hs.173688 ESTs 1.38 

35 413830 6E263439 Hs.13144 HSPC1 60 protein 1.37 

423576 NM.000383 Hs.129829 au(dmnujneiegatolor{aulotnlmmunepdyen 1.37 

401686 1>36 

412688 AW583062 Hs.74502 diymotrypsinogen B1 1.36 

401238 1-34 

40 421511 AA4e8940 Hs.105216 hypothetical protein FU1 1125 1.34 

422440 NII/L004812 H8.116724 aido4ceto reductase fan%1, member B11 1.34 

• 425450 U14755 Hs.157449 UM homeobox protein 1 134 

427333 AF067797 Hs.176658 aquaporinS 1.34 

430937 X53463 H$.2704 ^utathione peroxidase 2 (gastrolntestin 1.34 

45 445204 AW135523 H5.245653 ESTs 1.34 

452030 AL137578 Hs.27607 Homo sapiens fflRNA;cDNADKFZpS64N2464(f 1.34 

456379 W22206 gb:83E10 Human retina cDHATspS091^to 1.34 

457416 BE142052 Bb:CM3-HT0137-1S0999^11-b05HT0137Horao 1.34 

415741 AI902761 HS.2720B7 ESTs 1.33 

50 422260 AA315893 Hs.105484 ESTs. Utealdy similar bUTBJIUMANUTHO 1,33 

429188 ABD11171 H3.198037 K1AA0599 protein 1.33 

442776 AW959498 Hs.8709 chymobypsln C (caldeoin) 1.33 

454748 AW862014 gb:RC3-CT0347-1602004)lW09CT0347Homo 1.33 

437744 AW290g05 Hs.300288 ESTs. Weaidy similar to CGHU2E collagen 1.32 

55 451997 AA021351 Ha.1SB4g7 KIAA0724 gene product 1.32 

452340 NM.002202 Hs.505 tSLI transcription factor, LIMAomeodorna 1.32 

411879 BE145354 Hs.273758 Homo sapiens cONA:FU231 12 lis. done L 1.31 

424304 NiyL001395 Hs.144879 dual spedfid^ phosphatase 9 1.31 

401442 1.30 

60 403942 1.30 

443687 F13040 Hs.182937 peptidylprotyl isomerase A (cydophlTn 1.30 

401624 1.29 

41188S AA452636 Hs.131057 ESTs, Moderately 8li(lIlartaCRGD.HUMAN 6 1.29 

418575 AA225313 Hs.222B85 ESTs 1.29 

65 419818 AI657122 Hs.301931 ESTs 1.29 

429845 AB020337 H3.225943 UOP-Gal:belaGtcNAcbeta1,S^tosyitr 1.29 

447566 A1061980 Hs.285829 solute carrier family 25 (mitochondrial 1.29 

407013 U35637 gb:HumannebuIinmRNA, partial cds * 1.28 

„ 428470 ACO02301 Hs.1&4507 Homo sapiens Chromosome 16 BAG done OT 1.28 

70 429780 AL137518 Hs.300388 ESTs 1.28 

453539 AVmi886 Hs.95196 ESTs, Weakly similar to T20B1Z3(C.deg 1.28 

400846 1.27 

420257 AA257035 Hs.190042 ESTs 1.27 

429184 AF095735 Hs.198003 sarcoslne dehydrogenase 1.27 

75 437389 AL359587 Hs.271586 hypothetical protein OKFZp762Ml 15 1.27 

444412 AI147652 Hs.216381 Homo sapiens done HH409 unknown mRNA 1.27 

451139 AW293316 HS.20S5S8 ESTs 1.27 

431284 AAS70148 Hs.126783 Homo sapiens cONA:FU22610fiS. done H 1.26 

431969 AA366217 Hs.2879 caiboxypeptidaseAl (pancreatic) 1.26 

80 406158 1.25 

419648 T73661 Hs.91877 ESTs. Highly similar to TH1H.HUMANTHYR0 1.25 

430681 AWg89675 Hs.291232 ESTs 1.25 

434680 U02388 Hs.101 cytDdirameP450.suUamiiylVF.pQlypept 1.25 
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436217 T53925 Hs.107 Gbdnogeivtke 1 1.25 

440089 AA864468 HS.13S646 ESTs 1-2S 

446787 U57167 Hs^lS mucin Z intestlnaVtrachsal 1.» 

448207 AI475490 HS.1705J7 ESTa 1-25 

5 454869 AWB36004 ob:PM01T0019-17020{M)01-d11 LT0019Homo 1.25 

413271 AA127873 Hs.114949 ESTs U4 

422619 AA313322 gb:EST185218Ck>)mcaicinoma(HCC)ceO U4 

422796 AW897265 obKM)-NN0O57.1SO40fr-33&«04NN00S7Hoino 1.24 

427530 AA405093 Hs.126519 ESTs 1.24 

10 437727 M765707 Hs.153039 ESTs 1.24 

426435 AI827946 Hs.189118 ESTs 1.23 

426429 X73114 Hs.169849 myosiiMiifNfing protein C, skw^ype 1.22 

407964 AW130334 Hs^81111 ESTs 1.21 

430828 Ar763257 Hs.66327 Homo sapiens cDNA:FlJ22431 lis. done H 1.21 

IS 432029 D31628 HS.2B99 4-hydrDxyphenylpyruvatedioxygenase 1.21 

457843 AW138211 Hs.128746 ESTs 1.21 

413242 BE074165 gb:PM3.BT0584-030300.002-«12 BT0564 Homo 1.20 

446057 AI420227 Hs.149358 ESTs 1.20 

447198 061523 Ks.283435 ESTs 1.20 

20 449513 A!6532a2 Hs.1950S9 EST 1.20 

41SS66 F12119 gb:HSC35H091 normalized InM brain cON 1.19 

423315 R54109 Hs.26096 ESTs 1.19 

455817 BE142384 gb£M2-HT0144-210999.01 1^104 HTD144 Homo 1.19 

459354 BE514778 gb:601317094Fl NIH_li46C_9 Homo salens cD 1.19 

25 408432 AW195262 gbaai67b05j(1 NQ.CGAP^CMLI Homo sapiens 1.18 

414275 AW970254 Hs,889 Charot-Leyden ciystal protein 1.18 

419251 NM.001486 Hs.89771 glucokinase(liexol(lnase 4} regulatory pr 1.18 

456702 A1584534 gb.-wa72f10a1 SoanBS.NFLT_6BC_S1 Homos 1.18 

458009 AI221409 Hs.144983 ESTs 1.18 

30 410193 AJ132592 Hs.59757 zinc finger protein 281 1.17 

417779 AA829526 Hs.124977 ESTs 1.17 

435101 AI743156 Hs.131064 ESTs 1.17 

445360 AI798776 H8.156029 ESTs 1.17 

414160 8E257021 gb:601117426F1 Nl H_MGC J 6 Homo sapiens c 1.15 

35 418078 AA521268 Hs.85508 ESTs 1.15 

425133 NM_002513 Hs.154729 3i)hosp»K)lnosIfide dependent protein Idn 1.15 

437935 AW93g591 Hs.5940 iiypotheKca) protein FU20063 1.15 

446377 AW014022 Hs.170953 ESTs 1.15 

420097 AA700127 Hs.190504 ESTs 1.13 

40 446591 H44186 Hs.15456 POZ dom^ containing 1 1.13 

451477 AI798425 Hs.42710 ESTs 113 

459197 BE244587 gb:TCBAP2E0851 Pediatric pro-B cell acut 1.13 

426934 AF039401 Hs.194659 ditortde channel, caldum activated, fam 1.12 

431191 AW972118 Hs.100002 HSPC162 protein 1.12 

45 424403 R)6183 Hs.1799 CD1D antigen, dpolypepOde 1.11 

433546 A1075877 Hs.125461 Homo sapiens cDNAFUl 1539 lis, done HE 1.11 

451179 W05469 Hs.31818 ESTs 1.11 

400302 N48056 Hs.1915 folate hydrdase (prostete^pedfic memb 1.10 

420774 AA280209 Hs.165270 ESTs 1.10 

50 428887 AA437009 H5.98984 ESTs 110 

430582 Ai215509 Hs.143984 ESTs 1.10 

453642 A1370936 H8.34074 dIpepOdyipeplIdase VI 1.10 

406690 IU129540 Hs.220529 carotnoembryonlc antlgen^related cell ad 1.09 

417998 AW967420 gb:EST379495 H/IAGE resequences. MAGJ Homo 1.09 

55 456387 VV28876 gb:52h7 Human retina cONArandofnly prime 1.09 

427865 000306 Hs.183864 elastaseSB 1.08 

447388 AW630534 Hs.76277 ESTs. Wealdy similar to TB2(H.6aptens) 1.08 

413841 M34276 Hs.75576 plasminogen 1.07 

429201 X03178 Hs,198246 group^pecific component (vitamin 0 bind 1.07 

60 433313 W20128 Hs.296039 ESTs 1.07 

439450 R51613 Hs.125304 ESTs 1.07 

458963 AI701393 Hs.278728 RadandGenHelated2(rathomdog} 1.07 

405161 1.06 

406741 AA058357 Hs.74466 cardnoembryonlc antigen^elated cell ad 1.06 

65 424294 BE299311 gb:601119256F1 NIHJ^GCJ7 Homosaptensc 1.06 

424544 Me8700 Hs.150403 dopa decarboxylase (aromaScL-amlnoeci 1.06 

444687 AW972109 Hs.135107 ESTs 1.06 

444754 TB3911 Hs.118B1 transmen(brane4superfamilymenlber4 1.06 

421243 AW873803 Hs.102B76 pancreatic lipase 1.05 

70 .444290 AA262496 Hs.29280 ESTs 1.05 

4079M AW134708 Hs.243569 ESTs 1.04 

439706 AW872S27 H&59761 ESTs 104 

402194 1 03 

427506 AK000134 Hs.179100 hypotlieScal protein FU201 27 1.03 

75 428819 AL135623 H8.193914 KIAA0575 gene product 1.03 

434590 T47232 gb:yb64ta08.sl Stratagene ovary (937217) 1.03 

416378 AW044467 Hs.73708 ESTs. Weddy similar to AS7291 cytokine 1.02 

431912 At660S52 Hs.1S4903 ESTs. similar to A561 54 Ablsubst 1.02 

443316 AI478483 Hs.18443 ESTs 1.02 

80 428585 AB007863 Ks.185140 KtAA0403 protein 1.01 

400440 X83957 Hs.83870 nebuDn 1-00 

404619 BES14S35 K5.77171 ntidchromosomemalntenancadefldent(& 1.00 

407168 R45175 gb:yg40fDl4l Soares Infant bidn 1^aBH 1.00 
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AmS2 AV^1117 Hs^3585 ESTs 1.00 

409187 AF154830 Hs^66 cafbdnioyH)hosphat8 synthetase 1, miloch 1.00 

403327 L41162 H&535&3 collagen, type IX alpha 3 1.00 

- 410234 NM_003837 Hs.61255 fhJctose>1.64)tsphxphat35e 2 1.00 

5 410319 R23413 H5.71935 putaQve zinc fmger protein bom EUROIM 1.00 

411000 N40449 Hs.201619 ESTs. W^aMysbidtar to 8EB4B (Helens 1.00 

412098 AI493054 HS.15BS68 ESTs 1.00 

412446 A1768015 Hs.92127 ESTs 1.00 

412637 AA115097 Hs.261313 ESTs 1.00 

10 413147 BE067271 gb:PM2-BT0349-161299^01-b05BT0349Homo 1.00 

413597 AW302885 HS.1171B3 ESTs 1.00 

414117 Wa85S9 Hs.1787 proteoflpld protein (Pelizaeus-Merzbache 1.00 

414523 AU076633 Hs.76353 seifne (or cysteine) proteinase inhibito 1.00 

417074 Z49878 Hs.81131 guanldinoae8latel*Nnethyltrans(erase 1.00 

IS 418390 AF133820 Hs.64665 Qlin tnununogUbuQn dom^ protein (myo 1.00 

419766 T72104 Hs.93194 apoI^ioproteinA^ 1.00 

420182 Z44245 Ks.22999 ESTs 1.00 

420923 AF097021 Hs.27332t diOerenBdiy expressed In hematopoteO 1.00 

421100 AW351839 Hs.124860 Homo 8apten8cDNA:FLJ21763 lis. done C 1.00 

20 421204 AW081587 Hs.165051 ESTs 1.00 

422169 AF252292 Hs.112933 Tax intefacOon protein 40 1.00 

422792 AI951548 Hs.135163 ESTs 1.00 

423371 AU076819 Hs.1650 solute carrier faniBy 26. meml)er 3 1.00 

424208 AW563123 H3.143113 pancreatic Rpase^ated protein 2 - 1.00 

25 424922 BE386547 Hs.217112 ESTs. Wealdyslmllaf to Similarity to Ye 1.00 

425206 NM.002153 IHs.1 55109 hydroxysteroid (17.beta) dehydrogenase 2 1.00 

425545 N98529 Hs.1 58295 Human mRNA for myosin l^ht chain 3 (MLC 1.00 

425983 AK000226 Hs.165619 mudn and cadherln^ke 1.00 

^ 426004 AW600300 H3.124123 ESTs. Weakly similar to syr«»IIh (Rnor 1.00 

30 427627 R87582 H5.179915 guanine nucleotide binding protein (G pr 1.00 

428848 Nl^_000230 Hs.1 94238 leptin (murine obesity horrdog) 1.00 

429027 AU)22314 H5.194750 Human DMA sequence from done 1170K4 on 1.00 

429231 AA813214 gb:aj32e09.s1 SoaresJestis_NHT Homo sap 1.00 

429441 AJ224172 Hs.204095 Bpophllin B (uterogtobln family member) 1.00 

35 429930 AI580809 Hs.99569 ESTs 1.00 

429970 AK000072 H5.2270S9 chloride channel, calcium acBvated, fam 1.00 

430418 R98852 Hs.36029 heart and neural crest derivatives expre ' 1.00 

431845 AA516469 Hs.270554 ESTs 1.00 

433084 M18079 Hs.262265 fotty acid binding protein 2. InlesOnfll 1.00 

40 433839 F3S430 Hs.146070 ESTs, Wtoklysimiter to ALU1J1UMAN ALUS 1.00 

4344S2 AA634333 Hs.1 16822 ESTs 1.00 

435499 R89344 Hs.14148 ESTs 1.00 

438433 AB01B274 Hs.6214 KiAA0731 proteh 1.00 

442403 AW207724 Hs.129516 ESTs 100 

45 442803 A1675298 Hs.199917 ESTs 100 

443266 A1277101 Hs.25890 ESTs. Weakly Mar to transdudn{H^ 1.00 

444656 AI277924 Hs.1451» ESTs 1.00 

445573 A1439646 Hs.157494 ESTs, Weakly dmilar to K1AA0676 protein 1.00 

446163 AA026880 H5,25252 Homo 5eplenscONAFU13603fe, clone PL 1.00 

50 447359 Nm.012093 Hs.18268 adenylate kinase 5 1.00 

447551 BE066634 Hs.929 myodn. heavy polypeptide 7, cardiac mus 1.00 

448657 BE147B57 Hs.293841 ESTs. Weakly similar to KIAA0672 protrin 1.00 

449238 AA42B229 Hs.85524 musda-spet^RlNG-finger protein homo 1.00 

450085 AW293791 H5.60162 Homo sapiens cONA:FU21528 lis, done C 1.00 

55 4S0390 N93227 Hs.98403 ESTs 1.00 

451681 Z28564 Hs.25SgS0 ESTs, Weakly simlter to AA64_HUMAN 64 KD 1.00 

452093 AA447453 Hs.27860 Homo sapfans mRNA; cONA DKF2p586M0723 (f 1.00 

452528 AA742457 Hs.291479 ESTs 1.00 

452624 AU076606 Hs.300S4 coagu{aSonfadorV(proa(xelerin.ld)i 1.00 

60 453754 AW972580 Hs.172753 ESTs 1.00 

453991 AW014915 Hs.273741 ESTs 1.00 

454517 AW803340 gb:tL2-UM00794)90300.060^UM0079Honio 1.00 

459367 BE148877 gb:CM4-KT0244-11119»)4(Mi12HT0244Homo 1.00 

408021 AW137133 Hs.245867 ESTs a99 

65 417435 NM.005181 Hs,82129 cartMMilc anhydrasellLmusdospedlto 0.99 

437206 AW975934 Hs.283382 ESTs, Weddy similar to Protein sequence 0.99 

422890 Z43784 Hs.76713 sdute carrier family 25 (mitodrandrid a98 

425878 AW964806 Hs.38085 ESTs. Weakly similar to put^glydne 0.98 

„^ 441888 AI733306 Hs.128071 hypothetical protein FU21 302 0.98 

70 423068 M25629 H5.123107 kallikrdn 1. renal/pancreasMdIvary 0.97 

453534 NM.014796 Hs.33187 K1AA0748 gene product 0.97 

457787 AA683268 gb:ae92b04.s1 SlratagenesddzobrdnSI 0.97 

421285 NM_000102 H3.1363 cytodtrome P450. subfamDy XVn (sterdd 0.96 

422069 AJ010083 Hs.111110 Gfinn^ (tetethodn) 0.96 

75 425260 L47726 Hs.1 870 phenylalanine hydroxytese 0.96 

418406 X73501 Hs.84905 cytoker&tin20 0.95 

425670 AW9a8538 H$.190146 ESTs 0.95 

416373 AAt95845 Hs.73880 ESTs. Weakly simRar to AF1984551epith a94 

452243 AL355715 Hs,28555 programmed eel death 9 a94 

80 411908 L27943 Hs.72924 cytidine deaminase a93 

415067 A1264969 Hs.929 myosin, heavy polypeptide 7. cardiac mu8 0.93 

437156 AI916600 Hs.121194 Homo sapiens cDNA:FU21569fis. done C 0.93 

450685 L15S33 H5.423 pancrealilis^ssodated piddn 0.32 
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0.78 


4U9U90 


AAi944i£ 


Ue cnccn 


safoomanc muscla prat^ 


n77 


432134 


A1816782 


HS.1225B3 


Homo sapiens cDNA: RJ21934 lis, done H 


0.76 


437066 


AA743S70 


H5.2OO935 


ESTs 


0.76 


427003 


U19487 


Hs.2090 


proslagtandin E feceptor 2 (subtype EP^ 


0.75 


423634 


AW959908 


. Hs.1690 


hepai1n4)lnding growth fador binding pr 


0.73 


413333 


M74028 


HS.7S297 


fibroblast growth factor 1 (acidic) 


0.71 


420567 


AKD00812 


H5.98S74 


similar ta praline-ffeh piotdn 48 


0.71 


447145 


AA761073 


Hs.192943 


ESTs 


0.71 


452103 


R42764 


H5J248 


mutS (E. coG) homolog 6 


0.71 


410929 


H47233 


Hs^0643 


ESTs 


0,70 


400301 


X03635 


Hs,1657 


estrogen receptor 1 


0.69 


415702 


F28877 




gb:HSP01B414 HM3 Homo sapiens cONA done 


0.67 


411396 


C04646 


Hs.85428 


ESTs 


0.65 


431706 


A1B16086 


HsJ»6341 


adenytyl cydase-assodalBd protein 2 


0.65 



TABLE 28B 



Pkey: Unique Eos probeset UenGlier number 
CAT number. Gene duster number 
Accession: Genbank accession nuntbers 



Pkey 


CAT number Accession 


407743 


1012151J 


AW8141 IB AW814257 AW072376 


408123 


1040435J 


AW163377AW160398 


408432 


1058667J 


AW195262 R27858AW811262 


406664 


1073340J 


RS6362 AW248096 R07152 R07285 


408922 


109017_1 


IW7388RB4328AA058916 


409368 


112377J 


AA071O59 AA08S201 AA0B5Q20 


409543 


1138723 J 


AW410200 AW40970S AW41 1433 BE296786 8E270309 


409689 


114833J 


AA07B492 AA078333 AA077450 AA077746 AAff76B96 


409778 


1154206J 


AW499705AWS02537AW503016 


409802 


1155179J 


AW500732AWS04061 


410285 


119128J 


AA083609AA083790AA112048 


410881 


1225882.1 


AW809157 AW812181 AW81217S AW812172 AW812161 AW81216S 


411187 


1235092.1 


AW821291 AW821264AW821287AW8212SOAW821285AW821280AW821259 


412067 


1275641.1 


N4S697 N45540AWB90595 


413147 


1350637J 


BE067271 BE087266 BE087286 BE067278 BE067299 BE087285 


413242 


13S5323J 


BE074165 8E075001 BE075009 


413811 


1391117J 


BE16882B BE168830 BE168823 BE168928 6E168820 BE168826 


414080 


1413697J 


BE246327 BE244704 


414095 


1416521J 


BE2935468E249848 


414160 


1422273J 


BE2S7021 6E258316BE257099 


414580 


1463848.1 


BE386918 BE408833 BE365437 


415011 


151328J 


AW963085 AA1S9005 AW963073 


415566 


1539861J 


F12119Z45475T64832 


415702 


1547874.1 


F28877 F35687 


417998 


171375.1 


AWS67420 AA210915 AA236991 AA210916 


418197 


172884.1 


AA214253 AA214259 Z28472 Z28881 Z17828 


418464 


1759038.-2 


R87S80 


418556 


1767868.-1 


T02850 
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422619 218670.1 AA313322BE408282AA4G5612BE073382 

422796 221500 1 AVV897265AW897274AL119504AWB97275AW897270AVITO7312AWB9731BAVW7317 

AW407975AW999277 

424294 237907 1 BE29931 1 AA338954 AA33846B AWg60907 

424334 238221.1 AA393460AA338940AW96S277AA419006 

424648 241947.1 AA344576AA732430AA344601 

425515 252721.1 W26609 W27360AA358818 

426328 264901 1 AW631296AA375484 

426507 26838^.1 AA380285 AW934727 AW934914 

426755 271382^1 BE253469BE176417BE176415AA384133 

426880 273277 2 AA453482 AfH)12388 

426998 274259_.1 BE274360 

429231 301463.1 AA813214AI936567AI743529M448279AA994476A1807452AI2181B0AA972858 

430728 322706.1 AW968522 AA4851 12 AA485162 AW968698 

432092 34124 J AF135026AA583908 

433114 35904J AA121579AB005217 

434590 38931J T47232AF1 47355 T47231 

437237 43506.-2 BE513073 

437728 441520.1 AA766719 AA767041 AW977440 

439894 478738.1 AAB53077 AA852175 

442197 535550 1 AW837912AW837934AA984475AW997490 

442614 547073.1 A1269030A12O4085AI004047 

444475 607874.1 C75571 AI15O469T1077B 

448427 762970 1 BE395260 AW291036 AI500420 

449329 80484 1 AW752783 H38266AA001166 

451588 87667.1 AW072057AI225096AA018702 

452456 918391 1 BE080763 T96699BE081135A1902630H49182AI904021 AI902697 

454517 1221063.1 AW803340AW803260AW803275AW803415AW603343AW803422 

454748 1233013.1 AW862014AVV858740AVV858735AWB18542AW856765AVV862027AW8S8775AW85B771 AW858763 

454869 1238137 1 AW836004AW836087AVV836163AW836162AVV836085AW836084AW636079AW836083AW 

AW836002 AW836078 AW836161 AW662135 AW836165 AWa36003 
454886 1238987.1 AW837063 AW935882 AW93Sg57 

455075 1252389.1 AW8548S0AW85484a 

455104 1253737 1 BE0646636E1S3698AVV856751 BE153e20BE064737BE153674BE064730BE065062BE153536AW8S6622BE155079BE0^ 
BEC64650BE064691 

455366 1284685 1 AW947563AW947543AW947553AW947549AW947717AW902859AW902927BE011032 

455433 1290311 1 AVI/939463 AW939484AW939480AW939459AVV939546AW93g593AW939548AVV939595AW939106 

455446 1291969ll AW947749AW947748AW947754AW946638AW946674 

455579 1333944 1 BE011320BE006381 BE006361 BE011160BE011328BE0113KBE011157BE006384BE006387BE006385BE011160BE011319BE011^ 

6E006370 BE006386 BE011 173 BE006389 6E006376 6E008375 BE006364 BE011321 BE0Q6379 
455652 1348736 1 BE064675BE064761 BE064809BEO&46736E064672BE084674 
455817 1371986.1 BE142384 BE142387 

455994 1398737.1 BE179190 BE179206BE179182BE179185BE179188BE179194 

456150 1574395J Z42308 H23514 

455379 1839113.2 W22206 W22498 W26922 

456387 1842730.1 W28B76W26158 

456702 219191.1 A1684534BE262411 AA3 14031 AW752724 

457416 334503 1 BE142052AW265586AA506741 

457787 407235.1 AA683268 BE002903 BE002672 

458764 73207.1 BE619386AA3006S7 

459197 924229.1 8E244587AVU938684AW176490A1940102AW844995AVV938670AI909850AI90988SAI940079M^ 



TABLE 2BC 

Ptey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. Tlie 7 digit numbers in (his column are Genbanit identifier (Gl) numbers. Dunham I. et aL' refers to the puUkafon entitled *The DNA sequence of 

human chromosome 2Z' Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which axons wterepredteled. 
NtjMisilion: Indfcales nucleotide posiQons (tf predicted eiccMis, 



Ptrey 


Ref 


Strand 


l^tjjositlon 


400489 


8954013 


Pius 


131475-131652 


400527 


9796866 


Plus 


160750-161007 


400545 


9800107 


kAinus 


124618-124881 


400624 


7228177 


Minus 


94097-94756 


400635 


8567750 


Minus 


10280O.102932.107482-107689 


400672 


8118724 


Minus 


148067-148503 


400675 


8116750 


Plus 


11223-11816 


400694 


8118802 


Phra 


94288-94442 


400738 


8118985 


Plus 


143447-143851 


400846 


8188605 


Phjs 


39310-39474 


401007 


8117333 


^s 


140821-141050 


401015 


8117441 


Rus 


72260-72369 


401039 


7232177 


Rus 


658&^ 


401122 


8570298 


Plus 


68256^8444 


401238 


9954455 


Minus 


49473^9644 


401442 


6346725 


Minus 


85574-85910 


401496 


7381769 


Minus 


82790-83002 


401590 


9966320 


Minus 


33547-33649 


401603 


7689963 


Minus 


116659-116780 


401624 


8575907 


Rus 


168318-168444.172964-173647 
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Dhte 
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OU042j4 
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9A7nn^ 9A700fi 


402153 


8247879 
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l2Z5oO-l£29a/ 


402158 


8516165 


Minus 


14o290-l4o429,lW3W>»W00f r 


402165 


fiCCQQ<3n 

O5o9o90 


Minus 


cc/icAJica7Q 
00w4^>d9/g 


402194 


8576113 


Plus 


7ntM7 71 


402195 


70897/0 


Minus 


14/yUV1400W 


402201 


8576119 


Rus 


655-951 


402316 


7527774 


Minus 


4n7Ct 1DQ17 10/169 991^1 99Q9ft 

1075 i*1U9l 9,1 bol /-19V«lc,2ciJ^^ 


402362 


9D9Do14 


Minus 


icn^i i>;i;c9R 
IdMOi-IMdAO 


402366 


9799769 


Phis 




402394 


99296% 


Plus 


3jo0&-m4o2 


402409 


9796255 


Minus 


OD/l-IOOOl 


Jinn An 
402423 


mom A A 
379W44 


kSnus 


R9^7 R9fiR^ 


402448 


9/90D4U 


rlUS 


1 1 <9*tt* 1 1 0UD9| 1 l*MUi^ I 1 


402457 


979o7oZ 


A/Snus 


ICCII-iAirT? 4RmiL1R(l9R 
lo31«>-100r /glDOdtr 1D«D 


402S34 


OfiMOAl 

UoOlUDi 


rlUS 


00909' 39*iu 1 


40^8 


9801137 


Minus 


90 J 1 '^9Q009 


402561 


9o040rD 


riUS 


70Qfi7 711 ft^ 
rZ9uf ->rO IDo 


4uZ5oB 




Minus 


00U4 / -00 1 00|09U0^09 1 90 






rlUS 


lOOOt^ 10990 


402714 




Minus 


innil IflRRA 1Qin<L1Q.^7fl 1Qit9'w1Q7fi4 
lOOl I-IOOOO, l91U0-l90&0t I90<«r I9(w4 


402762 


9230904 


Minus 






£930D0U 


Mnus 


1Rli1ft.1ftR.liA 
lOOIv'IOOOO 


402911 


7263904 


Plus 


4 >19Ca0.4 ^9070 
l<tZO09- 1 l^g/g 


402951 


94U0f If 


Plus 


T?9W 7^^Q7171A.7W777fi917.7(i9QQ7S1(l<L7R401 


402968 




Minus 


00000-009c'» 


403047 


9540193 


Minus 




403115 


74<94iinil 


Minus 


1A9(K9.14inQ4 14S474.14<iR'i3 14ft9fi<t14ft44iLl'i9B1&1S9Q9a 


403125 


OlOAAie 


Minus 


4Q7liAR_1Q7719 


403183 


0000079 


rlUS 


inQQ^iinn74 

IU990U-I \M\ifH 


403186 


OQOO0B7 


Minus 


1 Ual^l If 000 


403211 


TC^ndAA 

7K}0o41 


Minus 


1RQ911 lUViQ 


403247 


7656833 


Minus 


7fiR9ft.7714A 


403251 


7677983 


Plus 


4/vtQQi inncm 

1UU09i*1UU00£ 


403327 


04400^9 


Minus 


47A'311 A7AAK\ 17A'iR7 1747n.(i 17iK9^17SfiQ9 
1 fWll-1 /'IADl,l/900r-l/*»rU»^lfOO<*-l»039* 


403376 


9369545 


^nus 


4/tQcofl i/vinin 


403479 


7329292 


Minus 


14830^ 1 4OdJ0, 1 0UOf fhl9UoD9 


403526 


8017144 


Rus 


3000 r -00*100 


403540 


8077057 


Minus 


OD0liK)O40U 


403605 


6862654 


Plus 


04C4 A 0171Q 

yiDi4-yi/io 


403812 


8469060 


Minus 


94r«co-5#4ttoy 


403665 


7249278 


Rus 


O902r-C90/0 


403716 


7239669 


Plus 


Dconn on A fi 
00o99'87122 


403731 


7543752 


Minus 


144UU0-14401O 


403740 


7O30892 


PUS 


O00U4-0//2/ 


403921 


7711590 


Minus 


Q907 

oZsr-OOJO 


403942 


7711825 


Minus 


OOAn(LOa7C7 
990Ut>-99fOf 


403997 


77OOOI9 


Plus 


1044^/-I0405W 


404139 


9o9B119 


Ohio 

nus 


fOfU/'fD09l 


404247 


7406725 


Minus 


M(MO-AA94A RA719JUA1I; ftAAOQJURT? ftAflTRJtCIIA 
O0W?r0'J*I*KO4OI*-OWlO,0W95HWOf /,O'KI/0<O0 1 IS 


404262 


22/ Mil 


Dhte 

rRIS 


fi4Km.R99nQ 


404348 


7BOAflCB 
7lMU050 


Minus 


9OO0e.9QflA9 
Z0090*£9vO^ 


404668 


0707 OAjI 

9797204 


Minus 


44^9 44UR 49RIU-4974ft 


40^2 


9797231 


Minus 


iinaT7^4lCA 
4U9rr-411W 


404795 


4826439 


Plus 


4Jl7eA4 4ilTVM 

i47o01-l4f fOU 


404825 


6478944 


Rus 


21UooZ*Z1U494 


404938 


7381808 


Minus 


4CCOOO 4CCQCA 

165o3o-1 65950 


405075 


7770506 


Minus 


44JieeA 19C094 

124ooO-i 25321 


405147 


9438278 


Minus 


15899&>i 59557 


Ana ft4 




Plus 


1 c7Rn7.i ^mPR 


405163 


9986267 


Minus 


16117M61299 


405187 


7229825 


Plus 


117025-117170.118567-118736 


405217 


7239551 


Rus 


32646-33138 


405232 


7249042 


Plus 


125904-126063 


405243 


7249201 


Minus 


22312-23197 


405327 


6094661 


Minus 


120550.120750 


405378 


6491714 


Rus 


91139-91440 


405420 


7211837 


Minus 


13428-13582 


405703 


4240388 


Minus 


15850*16061 


405737 


9943984 


Minus 


104275-104508,104755-104877 


405835 


7677903 


Minus 


66990^484 


4060S9 


9103984 


Mnus 


13858-14004 
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406101 


9124019 


Rus 


125325.125831 


406118 


9143818 


Plus 


53997-54629 


406150 


S8B6026 


Minus 


59331-59701 


406158 


7144874 


Plus 


62393^16.65012-65578 


406343 


9255974 


Plus 


17284-17440.18489-l8546.18917-ig004.19384-19S38 


406357 


9256093 


Minus 


77181-77415 


406563 


7711604 


Plus 


34401-34538 



TABLE 29A: 22B6 GENES UP-REGUIATED IN IDIOPATHIC PULMONARY FIBROSIS (IPF) COMPARED TO NORMAL BODY 

Tabid 29A lists about 2286 genes that are up leguiated In idiopalMc pulmonary fibrosis samples as compared vriUi nomH "body map* samples. Tliese were selected from about 
59680 prebesets on an A^titc/Eos Hu03 GeneChlp array such that tl» rdBo of 'average' idiopathic pulmonary Gbresls expression tevd to 'averagO* normal aduit tissues sample 
expression was greater than or equal to about 2.0. The 'averege* librosb sample expression level was set to the 90** percentiia amongst Idlopalhic pulmonary fibrosis samples. The 
'average' rwmiai adult tissue level was set to the 95^ percentile amongst non^aDgnant (issues. In order to remove gene^pedlic bacltground levels of non-specHIc hybridizaiion. the 
1 percentile value amongst non-malignanl tissues was subtracted bom both the numerator and the denominator before the ratio was evaluated. 

Pkey. Uhique Eos probeset identifier mnrtber 

ExAocn: Exemplar Accession number, GenbaiA accesston number 

UnigenelO: Unigene number , 

Dnlgene Title: Unigene gene title 

R1: Ralio of IPF to normal body tissue 



Pitey 


ExAccn 


Unigene ID 


Unigene TtUe 


R1 


427383 


NM.005411 


Hs.177582 


surfactant, pulmonary-associated protein 


211.8 


442275 


AW449457 


Hs.54795 


ESTs 


189J 


431433 


X65018 


Hs.253495 


surfactant, putmonary-assodated protein 


134.1 


441835 


AB036432 


Hs.184 


advanced glycosylation end product-sped 


130.4 


417204 


N81037 


Hs.1074 


surfactant, pulmonary-assodated protein 


116.8 


421798 


N74e80 


Hs.264330 


N-acytsphingosine amidohydrolase (acid c 


92.1 


406964 


M21305 




gbiHuman a^ha sateDib and saielfile 3 


80.7 


443709 


AI082892 


Hs.134662 


ESTs 


67.1 


431164 


AA493650 


H8.94367 


Homo sapiens cDNA: FU23494 fis. done L 


61.4 


445537 


AJ245671 


Hs.12844 


EGF-BIcMiomain, multiple 6 


57.4 


418007 


M13509 


Hs.63169 


matrix melaHoprotelnase 1 Gnterstltial 


54.6 


457200 


U33749 


Hs.197764 


thyroid transcription factor 1 


44.9 


432519 


AI221311 


Hs.130704 


ESTs, Weakly sImSar to BCHUIA S-100 pro 


42.7 


443324 


R44013 


HS.16422S 


ESTs 


39.6 


414142 


AW368397 


Hs.150042 


Homo sapiens cDNA aJ14438 fis, done HE 


27.3 


442006 


AW975183 


Hs.292663 


ESTs, Weakly similar to S72482 hypotheti 


27.1 


444527 


NKL005408 


Hs.11383 


small indudbte cytokine subfamily A (Cy 


27.1 


453310 


X70697 


Hs.553 


sohito carrier famPy 6 (neurotransmitiB 


26.9 


424084 


A1940875 


Hs.20914 


hypoOteOcal protein FU230S6 


22.2 


421659 


NMJ14459 


Hs.106511 


protocadherin 17 


21.0 


450478 


AW451709 


Hs.271200 


ESTs 


20.2 


444342 


NM-014398 


Hs.10887 


simtlar to lysosome^sodated membrane 


19.7 


447033 


AI357412 


Hs.157601 


ESTs 


19.4 


445885 


AI734009 


Hs.127699 


KIAA1603 protein 


m 


411880 


AWB72477 




gb*m30f03jt1 Na,CGAP_Tby4 Homo sapiens 


175 


432437 


W07088 


Ks.293885 


ESTs 


17.8 


424105 


AI142336 


Hs.43977 


Human DNA sequence fipom done RPl 1-196N1 


17.3 


431941 


AK000106 


Hs.272227 


Homo sapiens cDNA FLJ20099 lis, done CO 


17.2 


440807 


AW269421 


Hs.128093 


ESTs 


16.7 


424917 


AI636208 


Hs.96901 


hypothefical protein FU23049 


16.4 


433365 


AF026944 


H3.293797 


ESTs 


16.4 


445279 


R41900 


HS.2224S 


ESTs 


16.4 


417801 


AA417383 


Hs.82582 


integrin, beto^ike 1 (with EGF-tike rep 


16.4 


405654 








16.1 


449328 


At962493 


Hs.197647 


ESTs 


18.1 


449494 


AW237014 


H5.315369 


Homo saf^ens cDNA: FLI23G7$ fis. done L 


15.7 


408826 


AF216077 


Hs.48376 


Homo sapiens done HB-2 mRNA sequence 


15.5 


417728 


AW138437 


•Hs.24790 


K]AA1573protdn 


15.0 


440452 


Ai925136 


H3.55150 


ESTs, Weakly simBar to CAYPJ1UMAN CALCY 


14.8 


452039 


AI922988 


Hs.172510 


ESTs 


14.4 


408771 


AW732573 


H8.47S64 


potasshim vdtage^ated channel delayed 


14.3 


421464 


AA291553 


Hs.igOD88 


ESTs 


14.1 


421554 


AW137676 


Hs.97775 


ESTs 


13.8 


431889 


AA521277 


Hs.124946 


ESTs, We^ similar to A46010 X-Onked 


13.2 


434424 


AI811202 


Hs.325335 


Homo sapiens cDNA: FU23523 fis, done L 


13.2 


431924 


AKD008SO 


H5.272203 


Homo sapiens cDNA FU20843 fis, done AO 


^9 


459702 








1Z7 


421110 


AJ250717 


Hs.1355 


cathepsinE 


1Z6 


407638 


AJ404672 


Hs.334483 


hypothetk:al protein FU23571 


12.6 


423575 


C18853 


Hs.163443 


Homo sapiens cONA FU1 1576 fis, done HE 


1ZS 


423244 


AU}39379 


Hs.209602 


ESTs, Weddy similar to ubk)uitous TPR m 


12.2 


427585 


031152 


Hs.17g729 


cottagen, type X, alpha 1 (Schnddmetaph 


12.1 


438982 


AB0ie305 


Hs.5378 


spordin 1, (f^spondin) extraceOular mat 


1Z1 


451561 


N52812 


H3.177403 


ESTs 


12.0 


424086 


AI3S1010 


Ks.102267 


lysyi oxidase 


12.0 


435299 


AI74S458 


Hs.122614 


ESTs. Weakly similar to T20593 hypotheO 


12.0 


429496 


AA453800 


Hs.192793 


ESTs 


11.9 
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432365 AK001106 Hs.274419 hypotfteUca]pratsInRJ10244 11.9 

403637 11^ 

436061 AI248584 H5.190745 Homo 8apiansGONA:aJ21326 lis, done C 11.2 

4313B5 BE178538 Hs.11(^ membrane^psrwing 44lQmains. subf^ 10.9 

5 421470 R27496 Hs.i37B snne3dnA3 10.8 

440209 HD5049 Hs.22269 neurexinS 10.8 

428927 AA441837 Hs.90250 ESTs 10.7 

448693 AW004854 Hs.228320 hypotheficd protein FU23537 10.S 

424717 H03754 Hs.152213 wingless^ype MMTV Integralion site faml 10.4 

10 416402 NM.000715 Hs.1012 comptemenl component ^binding protein. 10.4 

446998 N99013 Hs.16762 Homo sapiens mRNA; cDNA DKFZpS64B2062 (f 10.4 

442176 AA983764 Hs.128910 ESTs 10.4 

439606 W79123 Hs.58561 Gprote!a€Oup)edfeceplDr87 10.3 

452883 X80031 Hs.530 coliagen, type IV. alpha 3 (Goodpasture 10^ 

IS 417015 M83772 Hs.80876 flavin containing monooxyganase 3 10.3 

422022 AA302420 Hs.200442 ESTs 10.3 

447724 AW298375 Hs.24477 ESTs 10.2 

406671 AA129547 Hs285754 metpratomganeOwpatocytegravvthfe 10.0 

458194 AW383618 Hs.265459 ESTs, Moderately 8im9ar to ALU^JftJMAN A 9.9 

20 446232 AI281846 Hs.194691 refined add induced 3 9.9 

416208 AW2gi168 Ks.41295 ESTs, Weakly simitar to K/tUC2J^UMANMUQN 9.9 

453382 AA709285 Ks.5997 hypothetica! protein FU1 3078 9.8 

412372 R65g98 Hs.28S243 hypoUwflcal protein FU22029 9.8 

426830 AA3857S1 Hs.196379 ESTs,Weak^sin«ar to putative p1S0{H 9.6 

25 407568 AA740964 Hs.62699 ESTs 9.8 

414259 W44633 Hs.301296 Homo sapiens cDNA: FU231 31 fis. done L 9.6 

400289 X07820 Hs.2258 matrix metaHoproteinase 10 (stromelysin 9.5 

441484 AA9354B1 Hs.58972 ESTa 9.5 

422426 W79117 Hs.58559 ESTs 9.4 

30 406747 A1925153 HsJ217493 annexlnA2 9.4 

450050 Ai681268 Hs.257883 ESTs 9.4 

431337 N48107 Hs.292593 ESTs 9.3 

408427 AW194270 Hs.177236 ESTs 9.3 

447048 AW393080 Hs.228320 hypothetical pralein FU23537 9.3 

35 453636 R67837 Hs.169872 ESTs 9.3 

443450 N66045 Hs.133529 ESTs 9.2 

418735 N48769 Hs.44609 ESTs 9.2 

421160 AL080215 Hs.102301 Homo sapiens mRNA; cONADKFZp586J0323{f 9.1 

449802 AW901804 Hs.23984 hypothetical protein FU20147 9.1 

40 441233 AA972965 H5.13S568 ESTs 9.1 

459587 AA031956 gb:zk15e04.s1 SoaresjvegnanLutonisJIbH 9.0 

436246 AW450963 Hs.119991 ESTs 9.0 

445189 AI936450 Hs.147482 ESTs 6.9 

410781 AI375672 HS.16502B ESTs 8.9 

45 446868 AV6G0737 Hs.135100 ESTs aS 

415817 U88967 Hs.78867 prOtein tyfOSlM phosphatase, receptor4 8.8 

425664 AJ006276 Hs.15g003 transient leceplor potential channel 6 B.8 

414968 CI 6096 Hs.22826 tiopomodulin 3 (ubiquitous) 8.8 

410334 AW979261 Hs.291993 ESTs 8,8 

50 442S10 AF150179 Hs.249890 ESTs 8.6 

409238 AU)49990 Ks.51515 Homo sapiens mRNA; cONA()KFZpS64G11 2 (fr 8.7 

431089 BE041395 Hs.283676 ESTs. WeaMy similar to unknown pnsteln 8.7 

444929 AI685841 Ks.161354 ESTs 8.7 

413802 AW984490 Hs.32241 ESTs. Weakly simflar to S65657alpha.10 a6 

55 444218 AF07O641 Ks.10684 Homo saiHens done 24421 mRNA sequence &6 

412719 AW016610 Hs,129911 ESTs 8.6 

453445 AL036532 Hs.91453 ESTs 8.5 

419261 X07876 Hs.89791 wlngless^type MMTV integration site fami 8.5 

451110 Aig55040 ite.265398 ESTs. We^ simflar to Iransfonnafion^ aS 

60 433815 AI696602 Hs.112757 ESTs 13 

432203 AA305746 Hs.49 macrophage scavenger receptor 1 aS 

451103 R52804 Hs.25956 OKFZPS64D206pfOteln a3 

425921 NM_007231 Hs.162211 sohjtocanierfaniily6(neurotransniitto a3 

424989 AA985520 H3.23575 ESTs t2 

65 433231 AB040926 Hs.143552 KIAA1493 protein Z2 

408217 A1433201 Hs.279860 tumor protein. transl8Gona])y<contn)nQ ai 

431806 AF18&114 Hsj270737 tumor necrosis factor (Bgand)8uperfami ai 

436751 AA732217 Hs.294054 ESTs 8.0 

452891 N75582 Hs.212e75 ESTs. WteaWy similar to DYH9J<UMAN QUA ao 

70 413048 NA93221 Hs.75182 mannose receptor, C type 1 aO 

426803 AA362568 Hs.179747 ecotropic viral integration site 5 7.9 

409718 D86640 Hs.56045 src homdogy three (SH3) and cysteine ri 7.8 

423424 AF150241 Hs.128433 prostaglandin 02 synthase. hematopdeOc 7.8 

429597 NM.003816 Hs.2442 adisintegflnendmelalk)proteinasedoma 7.7 

75 421013 M62397 Ks.1345 mutated in cokxectal cancers 7.7 

437479 R61866 Hs.101277 ESTs 7.6 

416778 M16S05 Hs.79876 sleroM6uIf8tase(niiB(QSoni8l).anf(5ulf 7.6 

421478 AI683243 H5.97258 ESTs. Moderately similar to 329539 ribos 7.6 

444396 T65213 Hs.4257 ESTs 7.6 

80 423629 AW021173 tte.18612 Homo sapiens cDNA:FU21909fis. done H 7.6 

450715 AI266484 Ks.31570 ESTs. W^ similar to KIAA1 324 protdn 7.8 

445495 BE622641 HsJ8489 ESTs. WeaUy similar to 138022 hypotheH 7.6 

446466 K38026 Hs.308 arrssUn3.r8tlnd(X.anesQn) 7.6 
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449108 A1140683 Hs.98328 IvpotheOcd prel^ MGC13040 7.5 

422798 R92347 Hs.34574 EST5.WeaMyslmBartoALU1J^MANALUS 7.5 

415030 H15261 HS.2194B ESTs 7.5 

426488 fiE178285 H3.1700S6 Homo sapiens mRNA:cDNADKFZp586B0220(r 7.4 

5 424906 A)5660B6 Hs.153716 Homo sapiens mRNA for Hmob33 protein. 7 7.4 

448206 BE622S85 Hs^731 ESTs, Moderaietyeifflaar to 138022 hypol 7.3 

432133 AB033088 Hs.272567 KIAA1 262 protein 7.3 

447112 H17800 H5.7154 ESTs 7.3 

^ 446917 A1347883 Hs.1 56672 ESTs 7.3 

10 428227 AA321649 Hs.2248 smaO indudbis cytokine subfamily B (Cy 7.3 

431761 AW002846 Hs.105468 hypolhetica) protein FU226S0 7.3 

428743 A1080080 H5.301549 Homo sapiens mRNA:cONADKFZp564H1 72 (fr 7.2 

413499 BE144884 gb:CM0^01 82-04 1 0994)6S« 1 1 HT0182 Homo 7.2 

423909 AJ223163 Hs.135194 immunogtobuGn superfamSy, member 6 7.2 

15 438122 AI62O270 Hs.129837 ESTs, WeaWysirrtlar to Z263J1UMAN ZINC 7.2 

449611 AI970394 Hs.197075 ESTs 7.2 

453816 Nl^003462 Hs.33846 dyneln. axonemal. Ogtit intermediate pot 7.2 

410060 NM.001448 HS.5B367 g!ypican4 7.2 

442353 BE379594 Hs.49136 ESTs. Moderately simitar to ALU7_HUMAN A 7.2 

20 452571 W3151B Hs.34665 ESTs 7.2 

453736 AL118674 Hs.34871 zinc finger Iwnfiobox IB 7.2 

409203 AA780473 Hs.687 cytochrome P450. subfamDy IVB, poiypept 7.2 

405494 7.2 

442832 AW2065eO Hs.253569 ESTs 7.1 

25 420193 AM60080 Hs.202869 ESTs 7.1 

434217 AW014795 Hs.23349 ESTs 7.0 

427356 AW023482 Hs.97849 ESTs 7.0 

436396 A1683487 HS.1S2213 winoie8s4ypeMlkXTV integration site fam) 6.9 

408308 AL033377 Hs.44197 hypothetical protein DKFZp564D0462 6.9 

30 442377 AA993807 Hs.167367 ESTs 6.9 

441143 A10Z7604 Ks.l59550 ESTs 6.9 

445122 AW241632 Hs.147377 hypothetical protein [^123598 6.9 

431353 AA828032 H5.189076 ESTs 6.9 

407510 U96191 gb:Hiimantrophoblas1hypo)da^utetedf 6.8 

35 426753 T89832 Hs.170278 ESTs 6.8 

445166 AW614544 Hs.123841 protein tyrosine phosphatese, receptor t 6.8 

451963 AI825440 Hs.224952 ESTs 6.6 

400298 AA032279 Hs.61635 slxtrsnsmentbraneepithenaiantisenor 6.8 

433425 H69125 Hs.133525 ESTs 6.8 

40 434377 AW137148 Hs.306593 Homo sapfen8cDKARJ113821i8. done HE 6.8 

415236 R41400 gb:yf94b1 2.51 Soaies intent brain 1NIBH 6.8 

409031 AA376838 Hs.76728 ESTs 6.7 

427558 049493 Hs.2171 growth dlfferentiailon tectar 10 ( 6.7 

437259 A1377755 Hs.120695 ESTs 6.7 

45 421952 AA300g00 Hs.98649 ESTs, Moderately similar to AF 161 5111 H 6.7 

447081 Y13896 Hs.l7287 potesslum Inwardly-rectifying channel, 8 6,7 

430099 AW194988 Hs.20537 hypotheiice* protein RJ 13942 6.7 

422475 At^9938 Hs.1 17313 Meis (mouse) homotog 3 6.7 

413472 BE242870 Hs.75379 solute carrier famfly 1 (gnal high efii 6,7 

50 424750 029956 Hs.152818 ubiquitin s|>ecific protease 8 6.6 

403574 6.6 

439759 A1359055 H3.67709 Homo saptensmRNA full length insert cON 6.6 

415025 AW207091 Hs.72307 ESTs 6.5 

448104 AI674818 Hs.316433 Homo saptenscONAFU11375t]5, clone HE 6.5 

55 444271 AW45a69 H5.149804 ESTs 6.5 

437157 BE048860 Hs.120655 ESTs 6.5 

444050 AW138295 Hs.135024 ESTs 6.5 

414569 AF109298 Hs.1 18258 prostate cancer assoctated proteb 1 6.5 

447505 AL049266 Hs.18724 Homo sapiens mRNA:cONADKFZp564F093 (fir 6.5 

60 424433 H04607 Hs.9218 ESTs 6.4 

407378 AA299264 H5.57776 ESTe, Moderately slmflar to 138022 hypot 6.4 

445424 A602894S Ks.12696 cortactin SH3 domdn^ng protein 6.4 

422544 AB018259 Hs.118140 KIAA0716 gene product 6.4 

449765 N92293 Ks.208832 ESTs, Moderately similar to Aiii8_HUMAN A 6.3 

65 413930 M66153 Hs.75610 RAB11A. member RAS oncogene temHy 6.3 

417246 AI760098 Hs.21411 ESTs 6.3 

453652 AW009640 Hs.28368 ESTs, Moderately similar to S65657 alpha 6.3 

411514 AW850178 gb:llJ-CT0219-271099<»22+l12CT0219Homo 6.3 

„^ 438909 AF085839 gb:Homo sapiens tell length insert cDNA 6.3 

70 446002 AI346468 Hs.145789 ESTs 6.3 

419236 AA330447 Hs.135159 Homo sapiens cONAFUl 1481 fis. done HE 6.3 

419150 T29818 Hs.89640 TEK tyrosine kinase, endothelial (venous &3 

424202 B£3502g5 Hs.lS032 RAN binding protein 17 6.3 

431723 AW0583S0 Hs.16762 Homo sapiens mRNA; cONA OKFZp564B2062 (( 6.2 

75 415511 AI732617 Hs,182362 ESTs &2 

430510 AW162gi6 Hs.241576 hypothe&al protein PR02577 &2 

416879 H98899 Hs.42599 ESTs 6.2 

432803 AA5653g8 gb:nk4lf01.s1 Na.CGAP.GC2 Homo sapiens 6.2 

_ _ 442852 BE080429 Hs.15738 ESTs 6.2 

80 435974 U29690 H3.37744 Homo sapiens beta-1 adrener^ receptor 6.2 

441082 AW444804 Hs.202655 ESTs 6.2 

404599 6.1 

453931 AL121278 Hs.25144 ESTs 6.1 
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4202S2 AW270404 Hs.193161 ESTs 6l1 

431622 AW979271 H&293184 ESTs 6.1 

456984 H59846 Hs.128355 ESTs.Modera!dy8liiiS8ftoALU7.HUMANA ai 

415457 AW081710 Hs.7369 ESTs. WeaWyshillaf to ALU1J1UMAN ALUS aO 

5 424693 BEieSSIO Hs.47557 ESTs 6.0 

419172 AW338525 Hs^120 ESTs 6.0 

413384 NM.000401 Hs.75334 exostoses (muffiple) 2 6.0 

453037 AA045175 Hs.177552 ESTs 6.0 

^ 444042 NIUL004915 Hs.10237 ATP-Wnding cassette, sub-farrtly G {WHIT 6.0 

10 431159 AW971240 gb:EST383329 MAGE resequences, MAGL Homo 6.0 

422352 AA766296 Hs.99200 ESTs 5.9 

433527 AW235613 Hs.133020 ESTs 5.9 

420077 AW512260 Hs.87767 ESTs 5.9 

429703 T93154 Hs.28705 ESTs 5.9 

15 433098 AW190593 Hs.151143 ESTs 5.9 

451099 R52795 Hs.25954 inteileukin 13feoeptDr,alpha2 5.9 

449416 AI651016 Hs.246311 ESTs 5.9 

459023 AW988226 Hs.60798 ESTs 5.9 

450584 AA040403 Hs.60371 ESTs 5.9 

20 427660 A(741320 Hs.114121 Homo sapiens cONA: FU23228fJS, Clone C 5.9 

429125 AA446854 Hs.271004 ESTs, Weakly sinvtar to 138022 hypotheQ 5.9 

450025 AK001675 Hs.24321 Homo sapiens cONAFU1202a lis, dona HE 5.9 

433479 AW5114S9 Hs.249972 ESTs 5.B 

443113 AI040686 Hs.132908 ESTs 5.8 

25 430414 AW365665 Hs,120388 ESTs 5.8 

419752 AA249573 H5.152618 ESTs, Moderately similar to ZNOIJttJMANZ 5.8 

435420 A1928513 H5.59203 ESTs 5.8 

404916 S.8 

424310 AA338648 H8.50334 testes devetopmenkelated NYO^P22 S.8 

30 448253 H25899 Hs.201591 ESTs 5.8 

430899 6E018217 Hs.183528 hypotheScdl protein FU 14906 5.8 

446987 . AI699629 Hs.156781 ESTs 5.8 

435082 AA664273 Hs.186104 Homo sapiens CDNAFU13803 (is, done TH 5.7 

438842 AAa27176 Hs.124316 ESTs 5.7 

35 437260 AA747807 Hs.UgsW ESTs 5.7 

410934 AW811114 gb:MR2-ST0131.111199O1&a04ST0131 Homo 5.7 

42B043 T92248 Hs.2240 iiterog!ot)ln 5.7 

408045 AW138959 Hs.245123 ESTs 5.7 

450568 AL050078 Hs.25159 Homo sapiens cDNAFU 10784 83, done MT 5.7 

40 428508 6E252383 Hs.164668 SBBI31 protein 5.7 

453393 AW956392 Hs.110376 ESTs 5.6 

444805 AB007699 Hs.12017 homolog of yeast utjiquilln-protelnnQas 5.6 

446921 AB012113 Hs.16530 small indudble cytoldne subfamily A (Cy 5.6 

429784 MS9796 Hs.30 membrane-spanning 4^omains, subfamily A 5.6 

45 433225 AW81651S Hs.173540 ATPaso. Class V, type 10D 5.6 

416575 W02414 Hs.38383 ESTs 5.5 

404043 5.5 ^ 

415094 D59513 Hs.33077B ESTs 5.5 

453049 BE537217 Hs.30343 ESTs 5.5 

50 430153 AW968128 Hs.336679 ESTs 5.5 

410811 AW805687 Hs.300648 ESTs 5.5 

443903 AI220547 Hs,135223 ESTs 5.5 

429420 AK001679 Hs.202289 hypothelical protein DKF2p434P1735 5.5 

444471 AB020684 Hs.11217 KIAA0877 protein 5.5 

55 452542 AW8122S8 gbilC0^174-1910994)31-a07ST0174Homo 5.5 

434088 AF116677 Hs.249270 hypothefcd protein PR01 966 5.5 

432113 AA935065 Hs,152385 ESTs 5.5 

446608 N75217 Hs.257846 ESTs 5.5 

419945 AW290975 Hs.118923 ESTs 5.5 

60 454024 AA993527 Hs.293907 hypolhe6calprDtelnFIJ23403 5.4 

420209 AA2S6444 Hs.126485 hypothetic^ protein HJ 12604; KIAA1692 5.4 

439382 BE247684 Hs.103070 ESTs 5.4 

428895 AA437124 Hs.187247 ESTs 5.4 

^_ 446577 AB040933 Hs.15420 KIAA1 500 protein 5.4 

65 419247 S65791 H3.69764 fiagite X mental rotardaOonl 5.4 

427778 AA412323 H3.105323 ESTs 5.4 

437138 AI935622 Hs.271245 ESTs 5.4 

431322 AW970622 gb:EST382704 MAGE resequences. MAGK Homo 5.4 

430437 AI768B01 Hs.169943 Homo sapiens cDNAFU 13569 lis, done PL 5.4 

70 435202 AI971313 Hs.170204 KIAA0551 protein 5.4 

415076 NM.000857 Hs.77890 guanyiatecydase 1, soluble, bete 3 5.3 

434992 AA658501 Hs.283358 ESTs 5.3 

454039 AWD79064 Hs.245540 ESTs 5.3 

456408 AI288348 Hs.23450 mttochondrial nTwsomal protein S25 5.3 

75 406554 5.3 

426269 H15302 Hs.1689S0 Homo sapiens mRNA; cDNA DKFZp566A1048 (f 5.3 

416769 AI339257 Hs.1 15436 ESTs, Moderately sfanilar to 154374 gene 5.3 

414299 AA142989 Hs.71730 ESTs 5.3 

«^ 420362 U79734 H8.97206 hunHngtin bleractlng protein 1 5.3 

80 459664 5.3 

425509 AF079363 Ks.158213 spem) assodated anfigen 6 5.3 

401497 5.3 

440727 A1073991 Hs.134268 ESTs,WtoaUysimlterto2109260ABoeO 5.2 
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426434 AW383590 
408776 AA0S7385 
419991 AJ000098 
451050 AW937420 
400297 A)127076 
404957 

452771 T05477 
438885 AI886558 
428244 A1564123 
420481 U50525 
455047 AW852530 
408729 AA195764 
457100 AM17878 
426342 AF093419 
417154 AIS74701 
411869 W20027 
427043 AA397679 
445635 A1769774 
442973 BES67665 
422063 BE156476 
446299 AA497044 
408877 AI279892 
404097 

437636 AA764781 
452822 X85689 
410733 D84284 
439140 WBS737 
407366 AF026942 
405547 

423377 AL049377 
449168 NM.016208 
455431 AW93B484 
452281 T935W 
411149 N68715 
432441 AW292425 
419807 R77402 
440615 AI733055 
450109 A1539295 
449695 AA164569 
421764 AI681535 
404593 

423607 AA328329 
432009 AL137424 
419235 AW470411 
436304 AA339622 
434613 Ai821626 
421502 AF111856 
415245 N59650 
428780 AI478578 
406333 

445034 AW293376 
440202 AW516211 
424638 AI472106 
451497 H83294 
427652 AI673025 
458722 AA741545 
407327 AA487182 
411018 AW813428 
415261 T40928 
453543 AA48542S 
438014 N71183 
407829 AA045084 
441006 AW605267 
412222 AAS28283 
424115 AA335497 
453197 Ai916269 
439398 AA284267 
436397 AA715013 
427535 R29543 
410901 AW810001 
425916 NH.008786 
447020 T27308 
427457 AW779105 
451620 AW449888 
408938 AA059013 
420036 R60336 
424508 A1080103 
430345 AKD00282 
427669 AW451832 
417181 L10123 
435347 AW014873 
425458 H89317 
432869 AW974094 



Hs.65551 Homosapten9,Slm!lartoDNAseomenlCli 

HS.633S6 E5T8.W^^n«ar Id 138022 hypolheQ 

H3.9421 0 eyes absent (OrosophQa) homotog 1 

Hs.69662 ESTs 

Hs^73 hypothetical protein DKFZp56401278 

H5.333265 ESTs 

Hs.184987 ESTs 

Hs.42500 AOP-ribosyta!ion factor-fika 5 

Hs.98201 Human 6RCA2 region, mRNA sequence CG029 
gb:PM1.CTa2434)710S94)01-g06 CT0243 Homo 

Hs.72639 ESTs 

Hs.46401 ESTs, Moderately sin«lar(DALU8.HUMANA 

Hs.169378 mul&piePOZdomafai protein 

H3.21388 ESTs 

Hs.23439 ESTs 

H5.3991 ESTs 

Hs.209831 ESTs. Weakly similar to ALUIJ^UMAN At^ S 

Hs.288550 Homo sapiens cDNA:aJ231 56 fis. done L 

gb^»VOW036S^0100^-c05 HT0368 Homo 

Hs.20887 hypothetical protein FU10392 

Ks.46801 sorting nexin 14 

HS.291B44 ESTs 

Hs.288617 hypothetical protein aj22621 
Hs.66052 C038 antigen {p45) 
Hs.290830 ESTs 

gtKHomo sapiens dg33 mRNA. partial sequ 

gb:Homo sapiens mRNA; cONA DKFZpSe6H0718 
Hs.23142 colon carcinoma related prot^ 

gb:CMO-DT0057-290200^25W06 DT0057 Homo 
Hs.28792 Homo sapiens cOKA RJ11041 lis, done PL 
Hs.269128 ESTs 
HS.1634B4 ESTs 

gb:yi75f1U1 Scares placenta Nb2HP Homo 
Hs.1 30806 ESTs 
Hs.1 15740 KIAA0210 gene product 
Hs.34550 ESTs 

Hs.1 48135 senne/threon!neUnase33 
Hs.6591 ESTs 

Hs.306458 Homo sapiens mRNA; cDNADKFZp761G21 23 (f 
Hs^88433 neurotrhnin 
Hs.108887 ESTs 

gbms92bl0,x5 NCLCGAP_Pr3 Homo sapiens 
Hs.1 05039 solute carrier family 34 (socfium phospha 
Hs.27252 ESTs 
HS.S0636 ESTs 

HS.1436S9 ESTs 

Hs.125300 rtng linger protein 21. hiterferon^espo 

Hs.49303 Homo sapiens cONA iaJ11663 fis, done HE 

Hs.284122 Wntinhibitofyfactor-I 

Hs.43874 ESTs. Moderately sbritar to 154374 gene 

Hs.282832 ESTs.Weal<ly8linRartoT24961 hypolheS 

Hs.269414 ESTs.Wea!(lysimIlartoZ195JHUMANZINC 

gb:MR3^T0192-4)1O20O-21O«05 ST0192 Homo 

Hs.6346 ESTs 

H$.48919 Homo sapiens cONARJIISOB lis. done HE 

Hs.121806 Homo sapiens cDNAaJ11971lis. done HE 

H5.29725 hypothetical pra(^aJ13197 

Hs.7627 CGWO protein 

Hs.292737 ESTs 

H&293965 ESTs. Weakly sin^ar to 138022 hypoOiefl 

Hs.109057 ESTs. Weakly similar to ALU5.HUMANAljUS 

Hs.221504 ESTs 

Hs.169835 ESTs 

Hs.2164 pro-platetet basb pratdn (indudes pla 

gb:MR4.ST0124-270300^11 ST0124 Homo 

Hs.162200 urotensin2 

Hs.16986 hypothetical protein FU11046 

Hs.164682 ESTs 

H3.257224 ESTs 

H$.22607 ESTs 

Hs.52792 Homo sapiens mRNA: cDNADKFZp586l1823(f 

Hs.149770 Homo sapiens CONAFU13658 fis. done PL 

H3.239681 hypothetical protein FU2027S 

Hs.255938 ESTs. Moderately similar to KiAA1200 pro 

H3.1071 surfactant prol^ A Unding protein 

Hs.1 16363 ESTs 

Hs 182869 ESTs 

gty£8T388197 MAGE resequences. MAGM Homo 
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43G594 Al4ig982 
421237 U2S029 
432731 R31178 
419750- AL079741 
428320 W47595 
41S751 AW19558t 
433515 AA535800 
451381 BE241831 
4527S4 B£463a57 
438297 AW515196 
406992 882472 
431291 N2S521 
435933 AA80S520 
447997 K00856 
445657 AW612141 
426985 BE394849 
447700 A)420183 
423735 AA330259 
424144 AA454033 
416256 N45661 
410011 AB020641 
454359 N71277 
422977 AA631498 
433485 A1493076 
450192 AA253143 
432015 AL157504 
407266 AJ235664 
409041 AB03302S 
434265 AA846811 
452526 W38537 
403271 

450656 AA010539 
446096 AI276454 
454036 AA374756 
437960 Ai66gS86 
440862 H39048 
410615 AW772721 
413563 AL120806 
419449 H1B417 
442324 R63578 
453080 AI423056 
435747 A1079519 
446509 AF169693 
448030 N30714 
414998 N^^00^543 
448089 AI467945 
434387 AB020700 
434757 AI038997 
413453 AA129640 
454438 AA224053 
458154 AW816379 
430417 AA461045 
434819 AA650099 
438796 W67821 
415451 H19415 
420931 AR)44197 
414812 X72755 
451895 T93573 
435434 AA680387 
449623 C00719 
433563 AI732837 
444649 AW207523 
441594 AL041080 
443314 AW771701 
400292 AA250737 
427972 AA884870 
446932 AA961459 
445640 AWg69626 
452393 H87398 
443204 AW20S878 
400608 

411155 AW819939 
435772 AA700019 
439830 AAB46666 
455511 BE144762 
443257 AI334040 
436033 H75391 
420214 AI146375 
410519 AW612264 
401189 

416852 BE537037 
425733 F13287 
447863 AL047611 



Ks.156189 

Hs.102761 

Ksu!87820 

Hs,183114 

Hs.16g300 

Hs^121 

Hs.ig0246 

Hs.172330 

Hs.151258 

Hs.258238 

HS.2S275 

Hs.192075 

Hs.29792 

Hs^9575 

H5.131905 

Hs.171077 

H3.41644 
HS.90011 
Hs.57858 



Hs.201967 
H5.24596 
Hs.159115 

Hs.50081 

Hs.130554 

Hs.280740 



ESTs 

Human glucocortinid reoeiJtor alp^ 
libfonecOn 1 

Homo sapiens cDNA FU14235 fls. done MT 
transfomAig growth factor, beta 2 
KIAA0781 protein 
ESTs 

hypothafcai protein MGC2705 

hyiwtheQcal protein FU21 082 

ESTs, Moderately siinQar to ALU1.HUMAN A 

gb:beta -ptiNONA polymerase beta {exon a 

KnjppeMype zinc finger protein 

ESTs 

ESTs. Weakly sbnHar to 138022 hypotheti 
Homo s^ilens G-protein coupled receptor 
ESTs. Moderately similar to Z1 95.HUMAN Z 
ESTs, Weakly similar loT21259 hypotheti 
gb:EST33963 Embryo. 12 week D Homo sapi 
AKAP-associated sperm protein 
adenylosucdnale synthase 
PRAIRE protein kinase 1 
gb:za36e03.s1 Scares fetal liver s^een 
gb:np83h04.s1 NQ.CGAP^Thyl Homo sapiens 
aldo-keto reductase famliy 1, member C2 
RAD51 -Interacting protein 
Homo sapiens mRNA: cONA DKFZp566O0724 (f 
gbiHomo sapiens mRNA for immunoglobulin 
KIAA1199 protein 

Homo sapiens cDNA: FU23089 fts. ctone L 
hypolhettoal prol^ MGC3040 



Hs.18912 ESTs 

gb:q|7la12j(1 Soares JlhHMPu^SI Homo sapi 

H5.93560 Homo sapiens mRNA for KIAA1771 pretein. 

Hs.222194 ESTs 

Hs.127432 ESTs 

gb:hl95c01ji1 Na.CGAP^Thy8 Homo sapiens 

HS.S888 ESTs 

Hs.57483 Homo sapiens cONA FU14294 fis. done PL 

Hs^8426 ESTs 

Hs.23921 hypothetical protein DKFZp547A023 

Hs.134398 ESTs 

Hs.132892 prolxadheT{n20 

Hs.325960 membrane-spanning 4^omains, subfamHy A 

Hs.77729 oxidised low density lipoprotein (lectin 

H5.173698 ESTs 

Hs.3830 K1AA0893 protein 

H5.132921 ESTs 

H5.128065 ESTs 

Ks.172405 cell division cyde 27 

Hs^35018 ESTs 

Hs.50701 ESTs 

Hs.291541 ESTs, We^y similar to ALUBJIUMAN IHl 

Hs.1095g0 genethonin 1 

Hs.268720 ESTs. Moderately simBar to ALU1_HUMAN A 

Hs.100431 small Induclbte cytokine Bsub(amily(Cy 

Hs.77367 monokine Induced by gamma interferon 
Hs.16970 ESTs 
Hs.187850 ESTs 
Hs.120440 EST 
Hs^77901 ESTs 
Hs.197628 ESTs 

Hs.208765 ESTs. Moderately simiiar to ALU7.HUMAN A 

Hs.54646 ESTs 

Hs.72472 ESTs 

Hs.181304 putative gene produd 

Hs.125644 ESTs 

H3.31704 ESTs. Weakly similar b KIAA0227 (H japi 

Hs^858 ribosomal protein I7a 

H8^9643 Homo sapiens cDNA aJ13103 fis. done NT 

Hs.273629 ESTs 

Hs.132992 ATP-blnding cassette, sub-famlly G (WHIT 

Hs.151489 ESTs. WeaWy simflar to XE7_HUMAN PROTEl 

gb:CM0-HT018&041099'O654i04 HT0180 Homo 

Hs.11614 HSPC065 protein 

Hs^55748 ESTs 

Hs.288073 ESTs. Moderately similar to AliJ5.HUMAN A 

Hs.131705 ESTs 

H&.273294 hypotheik»l protein FU20069 

Hs.15g388 Homo sapiens done 23S78 mRNA sequence 

H&288885 Homo sapiens cONAfU14246 lis. done OV 
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422429 AA310527 Qb:EST1B1333 Jurist T-oiQsVHonnBaple 

434677 AW444575 Hs.130834 ESTs 

403310 

451830 Hia433 H3J1542 KtAA1035 protein 

422222 AJ639372 Hs.193247 hypoths&al protein DKFZp434A171 

435527 W88774 Hs.t18370 ESTs 

436461 AW511956 Hs.293261 ESTs 

452166 A1948607 Hs.264680 ESTs 

413998 AW103807 H3.243933 ESTs 

416642 T96118 Hs.226313 ESTs 

452081 AW958859 Hs.7514 Homo sapiens cDNAaJ12141fe. clone MA 

452930 AW195285 H3.194097 ESTs. Weakly simDar to 138022 hypotheti 

407910 AA6S0274 Hs.41296 fibrenecGn leucine rich transmemtiranep 

428042 AA419529 Hs.76391 niyxovinis(inlluenza)resistanoe l.homo 

424641 A6001106 H5.151413 gtia maturation factor, lieta 

417412 X16896 H5.62112 intsrleutdn 1 leoeptor.typsl 

409629 AW449589 Hs.279724 ESTs 

458771 AW295151 Hs.163612 ESTs 

41S929 AA724373 Hs.49344 hypothetical protean FLJ11006 

436645 AW023424 Hs.156520 ESTs 

426083 AW952712 Hs.126712 ESTs, Weakly slmDar to AF1910201E2IG5 

445268 AI218358 Hs.175048 ESTs 

429629 6E501732 Hs.30622 Homo sapiens cDI^FU13010fi$, done NT 

431917 016181 Hs.2868 peripheral myelin protein 2 

443348 AW873596 Hs-1 82276 calmodulin 2 (phosphoiylase kinase, dell 

443151 AI827193 Hs.132714 ESTs 

419255 AA23S672 Hs.87491 ESTs 

452291 AF015592 Hs.28853 CDC7(ceOd(visibACyde7,S.ceievl5l 

452561 AI692181 Hs.49169 KIAA1634 protein 

421106 AA877124 H8.172844 ESTs 

424268 AA397653 Hs.307438 Human DNA sequence from done 495010 on 

425211 1^18667 Hs.1867 progastHcsin {pepsinogen Q 

421515 Y11339 Hs.105352 GalNAc alpha-2, &«lalyltransferase I, I 

423045 AW967472 Hs.1 83302 PCTAIRE protein Idnase 2 

428771 AB028992 Hs.193143 KIAAI 069 protein 

445745 AB007924 Hs.13245 KIAA04S5 gene product 

417009 AA191719 Hs^14714 ESTs 

436517 BE080932 Hs.135225 ESTs 

425905 AB032959 Hs.3185B4 novel C3HC4 type Ttnc finger (ring finge 

414083 AL121282 H5.257766 ESTs 

452728 AI91SG76 Hs.239708 ESTs 

409920 BE169746 Hs.12504 likely orlholog of mouse Aricadia 

441802 AA966636 Hs.127877 ESTs 

431956 AK002032 Hs.272245 Homo sapiens cDNAFUl 1170 fis, dona PL 
413875 BE176776 9t):RO-HT058^11030a011-g09 HT0S86 Homo 

444009 A1360792 Hs.135104 ESTs 

410785 AW803341 gb:IL2-UM0079^30a-05^O03 UM0079 Homo 

418882 NM_004998 Hs.89433 ATP-bmding cassettB, sut)4amily C (CFTR 
438993 AA828995 gb:od77l)06^1 NCI_C6AP_Ov2 Homo sapiens 

435256 AF193766 Hs.13872 cytokine«e protein C1 7 
428104 AA421350 Hs.191604 ESTs 
439648 AW7e0192 Hs.267596 ESTs 

436194 AK001074 Hs.333435 Homo sapiens CDNAFU10212 lis. done HE 
446364 AB006624 Hs.l4912 KIAA0286 protein 
452744 AI267652 Hs,30504 Homo sapiens mRNA:cONAOKFZp434E082{fr 
439294 AW975326 Hs.6523 dvomosomel open reading frame 12 
R38438 Hs.162575 solute carrier family 1 5 (H-»/^ptidetr8 



404561 
401576 
419296 
432055 
439107 
450320 
447350 
441974 
453142 



410292 
415811 
420218 
426625 
417708 
451024 
411745 
422058 
439479 



432222 
444188 
453096 
450297 
421002 
445414 



AA236115 

AW972359 

AU)46134 

AW291775 

A1375572 

AI683782 

AA033648 

AL137163 

AA843087 

AA450181 

AW95B037. 

T78300 

N74392 

AA442176 

AW867826 

AA862231 

AI734258 

AK00072S 

AI204995 

AI393165 

AW294631 

AW901347 

AF116030 

AV653692 



H8.120785 

Hs.293334 

Hs.13944 

Hs.213793 

H5.172634 

Hs.128245 

Hs.7473 

Hs,57549 

Hs.1 24194 

H3.172963 

Hs.286 

Hs.300642 

HS.S0495 



Hs.334443 
Hs.245367 
Hs.50579 

Hs.699 

Hs.11325 

HS.38S92 

Hs.100932 

Hs.146105 



ESTs 
ESTs 

adrenergic beta, receptor kinase 2 

ESTs 

ESTs 

ESTs ■ 

ESTs 

hypothefcdi protein di473B4 
ESTs 

hypolhetfcal protein FU14624 
rit)osoma) protein U 
serologically defined colon cancer antig 
ESTs 

gb3w63t)08j1 SoaresJotal_fetus_Nb2HF6_ 
gb:MRO^N0039-300300^01<€02 SN0039 Homo 
ESTs 

ESTs. Weakly similar to ALU1J4UMAN ALUS 
hypothetkal protein FU2071 8 
^:8n03c03jc1 Stratagens schizo brain SI 
pepiidytproly) Isomerase B (cydophllin 
ESTs 

hypothetical protein FU23342 
transcriptton fBCtor 17 
ESTs 
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447207 AA442233 K5.17731 hypothetical prolebl aJ12892 4.0 

451353 N21043 Hs.42932 ESTs 4.0 

437075 AA743748 Hs.40758 ESTs a9 

. 410505 AW752139 HsJ14323 ESTs 3.9 

5 449746 AI868594 H8.176588 EST8,V^teaWy5imParlDCP4Y.HUMANCYT0C 3.9 

426116 AA668729 Hs.144694 ESTs 3.9 

415716 N59294 Hs.179662 nitdeosoms assembly pioteinl-IIkel 3.9 

436298 AW293496 Hs.180138 ESTs 3.9 

417718 T86540 Hs.19i3981 ESTs 3.9 

10 436772 AW97S688 Hs.74170 metaDo{MonQn1E(tundlonai} 3.9 

401045 3.9 

408767 AA057279 HS.21192B ESTs 3.9 

407303 AA016296 Ks.165200 ESTs, Weakly simflar to A561 94 (hromlXMa a9 

432563 AW023624 Hs.162282 potassium channel TASK-4: potassium Chan 3.9 

15 451623 K77618 Hs.26^1 ESTs 3.9 

450063 A1681509 Hs.277133 ESTs 19 

416734 H81213 Hs.14625 ESTs. Weakly similar to KIAA1503 protein 3.9 

419276 BE165909 Hs.306881 MSTP043 protein 3.9 

433132 AB026264 Hs.284245 hypothetted protein IMPACT 3.9 

20 436149 AI754308 Hs.159452 ESTs 19 

42^ H25642 Hs.133471 ESTs 3.9 

443486 NIA.003428 Hs.9450 zinc finger protein 84 {HPF2) 3.9 

458219 H22195 Hs.31874 ESTs 3.9 

443613 AI079356 gb»z39b09.s1 Soares_NhHMPu_S1 Homo sap! 19 

25 439810 AL109710 Hs.85568 EST 19 

436578 AI091435 Hs.134859 ESTs 19 

415598 A1433165 Hs.9856 ESTs 19 

425087 R62424 Hs.126059 ESTs 19 

454111 AW081581 Hs.269064 ESTs. Weakly simliar to T42689hypolhetI 3.9 

30 409719 A1769160 Hs.108681 Homo sapiens brain tumor associated pro! 19 

452466 N84635 Hs.29864 hypoMcaiprDleinDKFZp564B052 19 

424962 NM.012288 Hs.153954 TRAMe protein 19 

435823 R07856 H5.16355 ESTs 19 

440633 A114068S Hs.263320 ESTs 19 

35 429334 D63078 Hs.1861B0 Homo sapiens cDNA: nJ23038 fis, done t 3.9 

444743 AA045648 Hs.301957 ntxfix (nucleoside diphosphate linked moi 19 

430039 BE253012 Hs.153400 ESTs. Weakly similar to ALU 1 .HUMAN ALU S 19 

417461 R38403 Hs.13305 ESTs 3.9 

424051 AL110203 H5.138411 Homo sapiens mRNA; cONA DKF2p588J1922 (f 3.8 

40 419140 AI982647 H5.215725 ESTs 18 

415652 T79213 Hs.272073 ESTs 18 

430140 AW296771 Hs.221999 ESTs 18 

446896 T15767 Hs.22452 Homo sapiens mRNA for KIAA1737 protein. 3.6 

422165 AL041199 Hs.1481 hisBdine decarboxylase IB 

45 417706 T90797 Hs.268623 ESTs IB 

424296 AI631874 H5.15S140 casein kinase 2, alpha 1 potypepUde 18 

450522 A!698839 flb.'wd31f02j(1 Scares JIFI^T^GBC.SI Homo s 3.8 

446619 AU076643 Hs313 secreted phosphoproteln 1 (osleoponlin. 18 

449729 R72032 Hs.29235 ESTs 18 

50 414700 H53202 Hs,38163 ESTs 18 

440899 AW449445 Hs.120O21 OKF2P434I092 protein 18 

439335 AA7426g7 Hs.62492 ESTs. Weakly sinular to B39086 proilne-r 18 

408625 AW243323 Hs.266785 ESTs 18 

421987 AI133161 H5.286131 OGI-101 protein 18 

55 418915 AI47477B Hs,11B977 ESTs 18 

410224 M55513 Hs.150208 potassium voltag&^ated channel, shaker- 18 

429846 AB023021 Hs.225945 fuco6yltran5lerase9(a!pha(1,3)fuco3y 18 

442849 R10099 Hs.2G9805 ESTs 18 

^- 427191 BE221825 Hs.97691 ESTs 18 

OO 407942 AA37B808 HS.S894 hypothelica) protein FU10305 18 

437030 AA742577 Hs.303781 EST 18 

427940 AA417812 Hs.38775 ESTs 17 

443054 AI745185 Hs.8939 yss-assodaled protein 65 kOa 17 

^_ 449679 AI823951 Hs.129700 tollofaMtol 17 

55 425937 NI^O13240 Hs.163846 pulath^e N&ONA^nelhyllransforBse 17 

456663 AV658444 Hs.280776 tankyrase. TRFI-interading ankyitn^ela 17 

456443 AW967500 H5,133543 ESTs . 17 

439957 A1453184 Hs.66357 ESTs 17 

446999 AA151520 Hs.334822 hypothetical protein MGC4485 17 

70 428414 AL049980 Hs.184216 DKFZP564C1S2 protein 17 

455170 AWB60972 gb:QVO-CT03B7-180300-167^07 CT0387 Homo 17 

418379 AA216940 Hs.137516 fidgeting 1 17 

419720 AA249131 Hs.337778 hypolhetfcal protein FU 11 068 17 

__ 443584 AIB07038 Hs.267245 hypolhetfcal protein FU 14803 17 

75 416185 AW975861 Hs.47367 KIAA 1785 protein 17 

417235 AA810278 Hs.24250 ESTs 17 

441720 AI346467 Hs^8739 ESTs 17 

451421 W16522 Hs.237689 Homo sapiens cONAFU13539 fis. done PL 17 

417355 D13168 Hs.82002 endotheBn receptor type B 17 

OU 449321 AA001150 Hs.132937 ESTs 17 

424806 AA382523 H3.10S689 MSTP031 protein 17 

452338 AW608920 Hs.29159 zinc finger protein 75 (D8C6) 17 

409248 AB033035 Hs.51965 KIAA1209 protein 17 
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421037 AI684808 

427088 AA398085 

420637 AW976i53 

420026 AI831190 

429419 AB023226 

447410 A1470235 
404274 



Hs.197653 EST8 
Hs.142390 ESTs 

gb:EST388262 MAGE resoiusnces. MAGN Hoinp 
Hs.166676 ESTs 
Hs^2276 KIAA1009 protein 
HS.1726S8 EST 



416320 H47867 
412642 BE24459B 
431716 D89053 
446025 AW305075 
450456 AA00992& 
423099 NM_002837 
438257 AW474419 
440887 AI799488 
454693 AW613428 
432189 AA527941 
408687 AL110280 
407726 AA435679 
436026 AI349764 
448776 BE302464 
452293 AIB71833 
428330 122524 
443268 AI800271 
429208 AA447980 
458429 AV646S59 
404476 
405848 

438209 AL120659 
403937 

437918 AI761449 
432408 N39127 
437641 AA811452 
439635 AA477288 
446102 AW168(167 
418384 AW149266 
425403 AIJ0237S3 
432030 Ai908400 
446453 AV658469 
452055 AI377431 
440801 AA906366 
432779 AWg79241 
440886 AW511032 
401049 

449424 AW448937 
418076 R6138B 
423035 AW449679 
435463 AA682507 
438016 AI949638 
455201 AW947884 
433293 AF007835 
456536 AW135986 
428579 AA431765 
414400 X06948 
43S344 AA700326 
445056 AB014530 
449444 AW818436 
442652 AI005163 
423121 AWB64848 
449540 AA001713 
425734 AF056209 
428409 AW117207 
431087 H12723 
426920 AA393351 
427687 AW003867 
437583 AA751190 
421599 AA2936S5 
433887 AA743991 
421863 AI952677 
430499 AWg69408 
451531 AA018311 
457620 AA502711 
410658 AW105231 
427865 AM16931 
453390 AAB62496 
419983 W55956 
454600 AW810001 
427718 AI798680 
416548 H62953 
420381 D50640 
410908 AA121686 
442080 AW444761 
406685 Mie728 



Hs.34024 ESTs 

Hs,809 hepatoc/tegrwlh factor (hepapoteUn A; 
Hs.268012 talty^U-Coenzyme A Hgase, tong-cha'n 
Hs.160948 iaAA0729 protein 

9b3i07e05.r1 Scares>taljver_spleen, 
Ks.1 23641 protein tyrosine phosphatase, receptor I 
Hs.224794 ESTs 
Hs.135905 ESTs 

0b:iyiR3^T01924)1020O-21O<H}5 ST0192 Homo 
gbmh30c04^1 NCL0GAP_Pr3 Homo sapiens 
Hs^1152 Homo sapiens mRNA;cDNA0KFZp434F053{fr 
H5.88594 ESTs 

Hs.217081 ESTs ^ ^ 

Hs.30057 MRS2(S.oerewlslaB}4ike, magnesium hom 

Hs.304809 ESTs 

Hs.2256 matrix metalloproteinase 7 (mamlysin, 

HS.12944S hypotHelicd protein FU12496 

Hs.190478 ESTs ^ ^ 

Hs.12346 Homo sapiens cDNA: FU21399 fis, done C 



Hs.6111 ayWiydrocartonreceplornoclearfrans! 

Hs.121629 ESTs 

Hs,332557 ESTs.Wealdy5iniilarloA4B010X*iked 

Hs.291911 ESTs 

Hs.94891 hypothelicd) protein FU22729 

Hs.252956 ESTs , ^ 

Hs^5130 Homo sapiens cONA FU14923 lis. done PL 

Hs.156406 Human DMA sequence from done 1 198H6 on 

Hs.143789 ESTs 

HS.188S46 ESTs. Wealdy similar to ALU1_HUMAN ALU S 

Hs.141693 hypothetical protein MGC10858 

Hs.190535 ESTs .,.^nu. 

gb;EST391351 MAGE resequences, MAGP Homo 

HS.190S16 ESTs 

Hs.197030 ESTs 
Hs.6724 ESTs 

Hs.156739 Ksaptens XG mRNA (done PEP1 1) 

gb3j18108Al SoaresJeldJiver„spleen. 

Hs.336846 ^jj.p^,_,^^Q.20030^001-fl08MT0010Homo 
Hs.32417 hyjxitheticsl pMn MGC4309 
Hs.257859 ESTs 

gb3w80c03.s1 SoaresJtestis^NHT Homo sap 
Hs.897 Fc fragment of IgE, high affinity I, rec 
Hs.190599 ESTs 
Hs.1 2259 KIAA0830 protein 
Hs.23590 solute carrier family 16 (monocarboxyfic 
Hs.201378 ESTs.Weal(lyslmilartoT12545hypothea 

gh:PM2-SN0018-2S0300-003x09 SN0018 Homo 
gb2h88e08£l Soares_felaUiverjspleen_ 
pepSdylglydne atpha-amldaling monooxyg 
ESTs 
ESTs 
ESTs 

histamine receptor H1 
ESTs 
ESTs 

gb3ry57g01.8l Na_CGAPJ^18 Homo sapiens 
Homo sapiens mRNA; cONA DKFZp434P22B (fr 
ESTs 
ESTs 
EST 
ESTs 
ESTs 



Hs.159396 
Hs.98523 
Hs.290791 
Hs.132121 
HS.1S70 
Ks.244627 
H3.97293 

Hs.108972 
Hs.231991 
Hs.114762 
Hs.336753 
Hs.192035 
HS.12G065 
Hs^8482 
H8M)30 

Hs.25933 

Hs^16 
Hs.10592 
Hs.44565 



ESTs 

Homo sapiens mlUlA; cOf^ DKF2p586£1624 (f 
gb:MR4-ST0124-27030OO054>11 ST0124Homo 
ESTs 

gb.r47f0&r1 Scares fetal liver spleen 
phosphodiesterase 3a cGMIMnMbited 
ESTs 
ESTs 
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404200 

417976 BE565B92 
433285 AW975944 
432868 AW974093 
433492 AW505849 
410252 AW821182 
428804 AK000713 
428775 AA434579 
410004 AI298027 
422093 AF151B52 
441736 AW292779 
423017 AW178761 
405970 

431954 AK001974 
459482 AA625339 
410361 BE391804 
410804 U64820 
402230 

436120 A1246193 
405336 

434374 AA631439 
428911 Z43846 
437783 AI683150 
416057 AI927382 
435496 AW840171 
436088 AA704687 
408554 AA836381 
454076 AW204712 
431733 AW298410 
432974 BE348793 
412576 AA447718 
446142 A1754693 
447432 AW966473 
433384 AI021992 
413821 A1808648 
419546 AA24419g 
436111 AI803082 
421236 AI287622 
433917 AI809325 
403515 

429657 013626 
453375 AI990114 
44B1B6 AA262105 
412209 AW9014S6 
421065 AA329711 
409642 AW450809 
420092 AA814043 
453365 AA035211 
437007 AA741300 
408031 AA081395 
439024 R96696 
416432 M14156 
417991 AA731452 
403356 

433650 AA603472 
410318 AA084050 
427019 AA001732 
413714 A(560944 
430887 N66801 
413618 BE15407B 
420908 AL049974 
436168 AK000883 
405692 

432809 AA565509 
433605 AA7OS910 
436192 W93847 
43S4S1 AF195420 
411849 AW964970 
448404 6E089973 
410434 AF0S1152 
416421 AA134006 
438394 BE379623 
444301 AK000136 
428795 R45503 
458924 BE2421SB 
435934 R19382 
400269 

410555 U92649 
412903 6E007967 
400889 

449585 A1655321 
408806 AW847814 
416557 6E140602 



HS.B3G77 InteiteuMn 18 (}nteifenNvg8mma4ndudn 

Hs.237396 ESTs 

Hs.292775 ESTs 

Sb:MRD.HT0241.2001C^0OfrflO2 HT0241 H 

Hs.61418 ntoofitwQIar-associated pcol^ 1 

H5.193736 hypoDietlcdl protein RJ20706 

Hs.143691 ESTs 

Ks.5057 catxjxypeptHlaseD 

Hs.111449 C6I-94 protein 

Hs.169799 ESTs 

Ks.227948 seiine {or cysteine) protonase InMUto 

Hs.272242 tiypotheOcal protein FU11112 

Hs^37052 EST, WeaMy similar to 138022 hypothetic 

H5.62661 guanytdte tending protein 1 , tntafenxv- 

HS.66S21 Mac}l8dl>^lo5eph disease (spinooeiBbeDar 



Hs.119860 ESTs 

gb:np85d02.6l Na.CGAP_Thy1 Homo sapiens 

Ks.194478 Homo sapiens mRNA; cONA DKFZp43401572 (f 

Hs.201550 ESTs. Weakly similar to ALU1 JiUMAN ALU S 

Hs.29857 ESTs 

Hs^653S8 ESTs, Weakly similar to transformaUon-r 

Hs.191294 ESTs 

Hs.3151 1 1 nudear receptor (XHepressor/HDAC3 comp 

Hs.61957 ESTs 

Hs,21475 ESTs 

Hs.233331 ESTs 

Hs.107057 ESTs 

Hs.145968 ESTs 

Hs^1957 nudix (nucleoside diphosphate linked mol 

Hs.124244 ESTs 

Hs.184156 ESTs 

gbuu:06c05.s1 NCLCGAPJPrI Homo sapiens 

Hs.157212 ESTs 

HS.1S19S6 ESTs 

H8.122814 Human ONAsequence from done RPS-1028D1 

Ks.2465 K1AA0001 gene product; putative G-piotei 

Hs.240091 ESTs 

Hs.4094 Homo saptons CDMA aJ14208fis. clone NT 

Qb:RCO-NN1012-270300^1-c07 NN1012 Homo 
gb:EST33382 Entryo. 12 week II Homo sapi 

Hs^57347 ESTs 

Hs.88045 ESTs 

Hs.17404 ESTs 

HS.202S99 ESTs, Weakly similar to 138022 hypotheti 

H5.42173 Homo sapiens cONA FU 10366 fis. ctona NT 

Hs.35598* ESTs 

Hs.851 12 insulln-like grouvth factor 1 (somatomedi 

Hs.190006 ESTs 

Hs^6456 ESTs 

HS.2692S9 ESTs. Weakly simlar to S23650 retrovtnj 

Hs.173233 hypothetical protein FU10970 

Hs.71428 ESTs 

Hs.260287 KIAA1841 protssn 

gb:PM0-HT0339-20040{M)10-FO4 HT0339 Homo 

Hs.100261 Homo sapiens mRNA; cDl4A DKFZp564B222 (Ir 

H3.301645 Homo sapiens cONA FU10021 Us, clone HE 

Hs.131703 ESTs 

Hs.112742 ESTs 

Hs^4139 Homo sapiens cONA: FU23137 fis, done L 

Hs.303006 ESTs. Weakly slmBar to gamma-heregulin 

HS.188&1 ESTs, Moderately similar to KIAA1 276 pro 

gb:RC8-BT0709^1030(V021-G07 BT0709 Homo 

Hs.63868 tolHike receptor 2 

Hs!79306 eukarycfictianslaUonlnMon factor 

Hs.27693 peptiriylprotyl Isomerase (cydophinH 

Hs.10760 asporin [iSVR dass 1) 

Hs.97469 ESTs, Highly similar to A3976g N^tyO 

Hs^44Z7 DKFZP56601646protdn 

Hs.117869 ESTs 

Hs.6431 1 a disintegiln and metsdtoprotelnase dome 

Hs.155795 ESTs 

Ks.ig7693 ESTs 

Hs^OS Homo 8ai^cDNA:RJ21532 fis, done C 
Hs^46645 ESTs 
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453204 R10799 Hs.191990 ESTs 3.3 

4506SS AI654223 Hs.16026 hypoOttQcsl protein FU23191 3.3 

427374 AI150033 H3.143686 ESTs 3.3 

443367 AW071349 ESTs 3.3 

5 446645 AI336596 Hs,156294 ESTs 13 

449897 AW819642 Hs.24135 transmembrane protrin vezalin: hypotheQ 3.3 

428457 AK002121 Hs.184465 hypomelical protein FU 11 259 3.3 

408761 AA057264 Ks.238936 ESTs. W^y similar 10 (deflinenolava 3.3 

403895 3.2 

10 414899 AW975433 H3.36288 ESTs 3.2 

409044 AI12958S Hs.33033 hypolheScd protein FU 14623 3.2 

447233 AW246333 Hs.17901 Homo sapiens, done IMAGE:393701 5, mRNA, 3.2 

422219 AW976073 K5.IOIO regulator of mitoGc spindle assembly 1 3.2 

427119 AW880562 Hs.114574 ESTs 3.2 

IS 437073 AI885608 Hs.94122 ESTs 3.2 

443830 A1142095 Hs.143273 ESTs 3.2 

454962 AW847645 fib:nj.Cn)213-2B0100.05&A04 CT0213 Homo 3.2 

433544 AW342028 ob:hb75d03j(1 Na^CGAP.Ut2Homo88ften$ 3.2 

417561 AWg74345 gb£ST386449 MAGE resequences. MAGM Homo 3.2 

20 446063 Ai720140 Hs.1S1079 ESTs 3.2 

423809 AA32B34a Ks.218289 ESTs 3.2 

426004 AA449563 K3.151393 glirtamat&cysteineliga5e.catalylic8ub 12 

453370 AI470523 Hs.139336 ATP-binding cassette, subfamily C(CRR 3.2 

435808 AA702866 Hs.113150 ESTs 3.2 

25 424001 W67883 Hs.137476 patemaOyexpiessed 10 3.2 

415635 F13168 gbifSC3JF101 iKxmBlized infant brahcON 3.2 

416946 AI798841 Hs.l64526 ESTs 3.2 

431750 AA514986 Hs.283705 ESTs 3,2 

425168 AK002052 Hs.l55071 hypothetical protein FU1 1190 3.2 

30 428268 AA424957 Hs.294132 ESTs 3.2 

418876 W20090 Hs.6616 ESTs 3.2 

416565 AW000960 Hs.44970 endoplasmic reOculum resident piot^ 5 12 

454286 BE22264d HS.2794S8 ESTs. Higldy similar to c38QA1.1b|H.sap 12 

446428 AW082270 Hs.12496 ESTs, Weaidy similar to AUW^HUMANAIOJS 12 
35 404588 12 

413087 BE064655 gbi^C1-8T0311301299-012-€09BT0313Homo 12 

444910 Ai201849 gbx)s76g04.x1 Na.CGAP_Pr28 Homo sapiens 12 

407339 AA777542 Hs.132670 ESTs 3.2 

414093 BE544667 Hs.283077 cenbosomal P4.1-assoclated protein; unc 3.2 

40 438458 AW975186 gb:EST387294 MAGE resequences, MAGN Homo 12 

419340 AA236590 Hs.67530 ESTs 12 

423448 AK000776 Hs.128753 Homo sapiens cDNA FU20769 fls. clone CO 12 

457030 A1301740 Hs.173381 dOiydropyilinidinaso^ 2 12 

421187 NM.014721 Hs.102471 K1AA0680 gene product 12 

45 419929 U90268 Hs.93610 cerebral cavernous maifbrmaOons 1 12 

429276 AF056085 Hs.198612 G protein<oupled receptor 51 12 

423841 AW7S3967 gb:RC2-CT0304480100^11-h12CT0304Homo 12 

438839 AW297945 Hs.128490 ESTs 12 

410085 AA428462 Hs.58589 g(yoogenln2 12 

50 427961 AW293165 Hs.143134 ESTs 12 

429228 AI553633 Hs.337139 ESTs 12 

431548 Ai834273 Hs.9711 novel protein 11 

441839 AWg75512 Hs.29160 ESTs 11 

410389 AW954049 Hs.8177 ESTs, Weddy similar to f^HUBOsdivary 11 

55 441274 AW593781 Hs.131357 ESTs 11 

452401 NM.007115 Hs.29352 tumor necrosis factor, alpha^nducedpio 11 

436154 AA764950 Hs.119898 ESTs . 11 

406752 AI2BS598 gb:qii49R».x1 Na_CGAP_Lym6 Homo sapiens 11 

450689 A1369275 Hs.243010 Homo sapiens CDNAFU14445 lis, done HE 11 

60 434164 AW207019 Hs.148135 sertneAhreonlne Idnase 33 11 

436739 BE208022 Hs.127685 KIAA1 627 protein 11 

451674 AA019104 Ks.175463 Homo sapiens cONA: FU22016 fis. done H 11 

421166 AA305407 H3.102308 potassium inwanfly-ieclilying channel, s 11 

437872 AKD02015 Hs.5887 RNA binding molifprot^ 7 11 

65 440046 AW402306 H8.6877 hypoliieScal protein RJ10463 11 

452824 W27643 Hs.73965 spTidng factor. arginine/^erin&<fdt 2 11 

426457 AW894667 Hs.l69965 ditmerin (chlmaefin) 1 11 

424780 U39576 Hs.l53058 bulyroplillln. subfamily 1, member Al 11 

456551 AW975051 Hs.293156 ESTs. Wealdy similar to 178885 serineAh 11 

70 410763 AF279145 Hs,8966 hypotheHcai protein FU21776 11 

431814 8E256242 Hs.Z70847 della-tubutin 11 

440099 AU)80058 Hs.690g DKFZP564G202 protein 11 

436401 A1087958 Hs.29088 ESTs 11 

437439 H29796 Hs.269622 ESTs 11 
75 4032n 11 

408547 AA574291 Hs.57a37 ESTs 11 

424131 AA335714 Hs.199665 ESTs 11 

433222 AW514472 H5.238415 ESTs. Moderately similar to ALU8.HUMAN A 11 

434636 AA063764 Hs.6101 hypothetical protein MGC3178 11 

80 450519 AA010066 Hs.224849 Homo sapienscONAFU12583 lis, done NT 11 

415083 AIS32683 Hs.27179 Homo sai>tonscONAFU12933 fis, done NT 11 

407905 AW1036S5 HS.2S290S ESTs 11 

452311 AW304029 H8.2S2744 ESTs 11 
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AI269484 


nS.lQ14Zf 


«lnr flnn«r nmMn 91 R 

Zhic nnycr iNuiein & lo 




nnlXKKnls 


Us.4flai9 


Homo saoiens cDNA FLJ1451 1 lis done NT 


407340 


AAB10168 


Hs^2B9 


v{tillg&«ssocialed protein Vrr-1 


401862 








444325 


AW152618 


Hs.16757 


ESTs 


408171 


AA301228 


Hs.43299 


hypoOietical protein FU12890 


423949 


A1014546 


Hs.130912 


ESTs 


419519 


AI198719 


Hs.176376 


ESTs 


434683 


AW298724 


Hs.202639 


ESTs 


418454 


AA315308 


Hs.195870 


hypottwOcd protein mi4S91 


415086 


A1597963 


Ks.1 18726 


ESTs 


419220 


AA811938 


H5.291759 


ESTs 


418849 


AW474547 


Hs.53565 


Homo sapiens PIG-li^ mRNA for mannosyltran 


443634 


H73972 


Hs.134460 


ESTs 





429682 


NM.Q06306 


45 


405090 




432267 


AK000872 




443253 


A1041212 




444974 


AI203500 




445717 


AW664658 


50 


449347 


AV649748 


452778 


R71338 




'414888 


A1039185 




424406 


D54120 




410371 


AADB4482 


55 


426384 


A1472078 


418200 


AW629751 




427050 


AA397789 




449579 


AW207260 




411004 


AW813242 


60 


454032 


W31790 


455601 


AI368680 




447482 


AB033059 




439416 


W58294 




436635 


AW104325 


65 


419086 


NhL00D216 


412566 


AW962S74 




415452 


F09134 




427874 


AA732367 




447046 


AA326187 


70 


454193 


BE141183 


454678 


AW813089 




415122 


D60708 




444665 


BE613126 




400227 




75 


411905 


BE265067 


419503 


AA243642 




446563 


BE326S88 




457285 


A1038858 




434998 


AW975157 


80 


436203 


8E384g82 


424539 


L02911 




449856 


AA203155 




427698 


AW972594 




451494 


AI799444 



Hail 1602 SIMC1 (structural maMenance of chramoso 



Hs.274227 

H3.132117 

Hs.151612 

Hs.149332 

Hs.295901 

Hs.5921 

Hs.77558 

Hs.146409 

Hs.115850 

Hs.303662 

Hs.206654 

Hs.161803 

Hs.134014 

H3.194293 

Hs^ie 

HS.1870S 

Hs.56254 

Hs.272093 

HS.89S91 

H8.12839 
Hs.g8198 
H8.17170 



Hs,22245 
Hs.47783 



Homo sapiens cDNA FU10010 lis, clone HE 

ESTs 

ESTs 

ESTs 

K1AA0493 protein 

Homo sapiens cDNA: FU21592 fc. done C 
thyroid liomnne receptor interactor 7 
cell (fivisnn cyda 42 (GTP-btnding prot 
ESTs 
ESTs 

ESTs, VfeaMy similar to alternatively sp 
ESTs . 

ESTs, Weakly shrllar to T46425 hypotheti 
gb:MR3^T0l9l-0202OO-2a7-g1OSTO191 Homo 
ESTs. Weaidy similar to 154374 gene NF2 
SRY (sex determining region YHxsx 2 
KIAA1233 protein 
ESTs 

ESTs, Weakly stmHar to 178885 serlne/lh 
Kallmann syndrome 1 sequence 
gb£ST374647 MAGE resequences. MAG6 Homo 
ESTs 
ESTs 

G protein-ooupled receptor 4 
gb:MRO-HT0O7M91199^1'faO4HTOO71 Homo 
gb:RC»T0186-240400-111'b05STO186Homo 
ESTs 

B aggressive lymphoma gene 



gb:601193693F1 NIHJ4GC_7 Homo sapiens cO 

Hs.137422 ESTs 

Hs.141454 ESTs 

Hs.130522 Kv channeMntenacting protein 1 

Hs.26037 ESTs 

Hs.5076 Homo sapiens cONA: FU22128 fis. done H 

Hs.150402 adMn A receptor, lype I 

Hs.18200 ESTs 

Hs^140 ESTs 

Hs247095 ESTs, Moderately sinOlar to ALU7.HUMAN A 
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442994 AI026718 H5.16K4 ESTs 

408165 AL137573 Hs.43143 Hoino sapiens nMM;cONADXFZp564A2463(r 

421072 A1215Q&9 HsJ9113 ESTs 

456273 AF154846 Hs.1148 zinc finger poteln 
5 404548 

428201 AA424158 HS.20S461 ESTs 

441519 AA972740 H8.127092 ESTs 

445413 AA151342 Hs.12677 CGH47 protein 

418717 AI334430 Hs,86984 ESTs 

10 428839 AI767756 Hs.82302 Homo sapiens cIWAaJ14814 (is. done NT 

407758 050915 HsJ8365 K1AA0125 gene product 

431906 AW328038 H3.37486 ESTs 

424968 AA34g088 Hs^9746 ESTs,Weafcryslm!lartoA460lOX4inted 

431023 AI283133 Hs.297420 ESTs 

15 432596 AJ224741 Hs.278461 maWDnS 

452412 AA029608 Hs.61373 ESTs 

421309 AI222086 Hs.270449 ESTs, Modeiatety drnllar to ALU1.HUMAN A 

438128 AA904430 Hs.l22049 ESTs. WealdysimUar to T2D4J<UMAN TRANS 

408321 AW405882 Hs.44205 coffctalln 

20 439236 BE160952 Hs^47117 ESTs. Moderately sliriflar to ALUFJWMAN ! 
400B80 

417014 AA251720 Hs.104347 ESTs. Weakly sintilar to ALUC^HUMANlUi 

422278 AF072873 Hs.1 14218 frtzzled (OrosophlJa) homotog 6 
406603 

25 425573 AB006423 Hs,158308 serine (or cysteine) proteinase Inhibito 

427878 C05766 Hs,lB1022 CGW)7prot^ 

451700 AM70262 Hsi9553 ESTs 

451797 AW663856 Hs^13 smaQinduciUecytoMne subfamily £.n» 

423025 AA831267 Hs.12244 hypothetic^ protein FU20097 

30 422634 NM_016010 Hs.118821 CGW2 protein 

448966 AW372914 HsJ8149 phosphoinositol 3-phosphaie-blndlng prol 

408590 AW864542 gb:PM4-SN0016-120500-003-h02SN0016Homo 

408525 AW20e972 Hs.253595 ESTs 

412248 BE176480 gb:R<»W0585-1603004)22<02HT0585Homo 

35 432507 BE391093 Hs.324667 ESTs 

447290 AI476732 Hs.263912 ESTs 

424188 AW954552 Hs,142634 zinc finger protdn 

431448 AL137517 Hs^73 hypolhelical protein DKFZp56401278 

400325 M85292 Hsi47924 Honx) sapiens endogenous HlV-1 related se 

40 408408 AF070S71 M$.44690 Homo sapiens done 24739 mRNA sequence 

423119 AA322201 Hs,131976 ESTs 

423717 AA330036 Hs.152003 ESTs 

424152 AL133591 Hs.141480 Homo sapiens mRNA;cDNADKFZp434N079{fr 

431980 AA523696 Hs.324507 hypothetical Fotein FU209B8 

45 434980 AW770553 Hs.14553 sterql 0«y«ransfafBse (acyMkienzyme 

444339 T96555 Hs.31562 ESTs 

446745 AW118189 Hs.156400 ESTs _ 

459201 AW391177 gb:MRWT0203.22129M23^05ST0203HomD 

430573 AA744550 Hs,136345 ESTs 

50 451073 AI758905 H8^063 ESTs 

440575 AA8B9870 Hs.126006 ESTs 
402046 

426882 AA393108 Hs.97355 ESTs 

435736 AA699633 Hs.269543 ESTs, Weakly sbiilar to A561 94 thromboxa 

55 420656 AA279098 Hs.187636 ESTs 

438323 AI985394 Hs.123369 ESTs 

453123 AI95371B H5^849 ESTs 

418343 AA215372 Hs.159501 ESTs 

431595 AA508196 gb:nh60l07^1 Na_CGAP.Pr8 Homo sapiens 

60 435187 AKD00998 Hs.297221 HQmosapIenscONAFU10136fis.doneHE 

4S9440 BE048054 gb:li46c03.y1 Na_CGAP.Bm52 Homo sapten 

451957 AI796320 Hs.10299 Homo5aplenscDNAFU13545fis,donePL 

40M34 AW195317 Hs.107716 hypothetical protein FU 22344 

456034 AW450979 gb:UI-H.BI3-ala*12^UU1 Na_CGAP_Su 

65 442118 AA976718 \^22A2 ESTs 

420727 H75701 Hs.99886 com(itement component 4^ilndlng protein, 

433849 BE465884 Hs^8072B ESTs 

424235 NIyL003191 H5,143507 T brachyuiy (mouse) homdog 

429826 N93266 H8.40747 ESTs 

70 437913 AI140825 Hs.121523 ESTs 

441330 AI692984 Ks.129354 ESTs 

443458 R05385 Hs.143509 hypothetol protein FU21 924 

438873 AI302471 H3.1 24292 HomosapienscDNA:FU23123fis.doneL 
444561 NM.004469 Hs.l1392 o-fos Induced growth factor (vascular en 

75 444631 AW995395 HS.84S20 EST5,WeaKlyslmItertoTRHY.HUMANTRICH 

458166 AA904244 Hs.153205 ESTs 

436043 AW963838 Hs.166830 Homo 6aplenscONAFU12136 lis, done MA 

415757 AA830854 HS.187B10 ESTs 

449299 AA299919 Hs.84561 ESTs 

80 457003 S78234 Hs.172405 cell division cycle 27 

408875 NM-015434 Hs.48604 OKFZP434B 168 protein 

424602 AK002055 Hs.151046 liypothelical protein FU 11 193 

426174 AA5479S9 Hs.115838 ESTs 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
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55 
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65 
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449318 AW23G021 
429950 AWD81&08 
412733 AA984472 
423637 AL137279 
442655 AW027457 
4205S6 AA278300 
430447 W17064 
416871 H38716 
439737 AI751438 
406815 AA833930 
401094 
401526 

414140 AA2B1279 
417320 AA195667 
418282 AA215S35 
442927 A1024347 
450006 A1241555 
419231 ALJ046294 
416623 N74925 
403329 

41469S AF002020 
419038 AW134924 
440106 AA864958 
448019 AW947164 
431745 AW972448 
421426 AA291101 
433014 NM_014711 
455100 6E1G0198 
441790 AW294909 
404443 

428129 A1244311 
435047 AM549B5 
423948 AW392342 
449327 AI638743 
400983 

415786 AW419196 
411213 AA676939 
420898 AW149342 
409994 D86864 
430388 AA355923 
419530 X98330 
455092 BE1S2428 
456118 AA3802B7 
440192 AAB72282 
448466 AI522109 
414869 AA157291 
440351 AF030933 
407594 AW057584 
439235 N45513 
417061 AI675944 
434812 AA649860 
409731 AA12S985 
455512 AW983808 
408380 AF1230SO 
435990 A1015662 
410872 AW794600 
432798 AA565309 
416288 H51299 
436886 AA827728 
4S15SB NM.001089 
416940 N7S620 
421750 AKD00768 
438398 AA806526 
435313 AI769400 
414605 BE390440 
436508 AW604361 
413195 AA127382 
413829 NM.001872 
401323 

408296 AL117452 
428532 AF157326 
423454 A1110456 
436027 AI864053 
406970 M29994 
428172 AA371307 
452114 N22687 
439750 AL359053 
423130 AW897586 
430660 R11884 
434138 AA625804 
427469 AA403084 
429881 T80112 
411492 T46B48 



Ks.78531 Homo sapiens. SlmSar to RIKENcONA 5730 

Hs.105053 ESTs 

Hs.74554 K1AA0080 protein 

Hs.1 301 87 Homo sapiens mRfJA; cONA DKFZp43401214 (f 

Hs.30323 EST8. WeaWy simter to B34087 liypolheti 

H5.1 24292 Homo sapiens cONA: FU231 23 6s. done L 

Ks^2B48 SVWSNFretated. matrix 8SS0datea.actj 

gl);yx13d08^1 Scares melanocyte 2NbHM Ho 

Hs.41271 Homo sapiens mRNA full tenflth insert cON 

Hs.288036 tRNA isopentenylpyrophosphate transferas 



H5.23317 tiypotheUcBl protein FU14681 

HS.B6022 ESTs 

Ks.98133 ESTs 

Hs.131519 ESTs 

Hs.60171 ESTs 

Ks.136245 ESTs. WeaUy similar b T172Z7 hypolheti 

Hs.38761 Homo sapiens cONA: HJ21564 lis. done C 

H$.76916 Niemann-Pfcic diseasei type CI 

Hs.190325 ESTs 

Hs.127699 K1AA1603 protein 

K5.ig5641 ESTs. Moderately simBar to 138022 liypol 

Hs.163425 ESTs 

Hs.33020 Homo sapiens, done IMAGE3939163, mRMA, 

Hs.279912 KIAA0419 gene produd 

gb:QV1-HT0413<>102004)59^03 HT0413 Homo 

Hs.132208 ESTs 

Hs.26912 ESTs 

Hs.54973 cadherin4to protein VR20 

Hs^83077 Genlro$omalP4.1-8ssodated protein; unc 

HsJ!24672 ESTs 

Hs.257924 hypoUieltcai protein FU13782 
Hs.69285 neurapllin 1 

Hs^4444 Hofn08apienscONA:FU22165<is,doneH 
Hs.57735 acetyl U)Lrace(rtor:SREC 
Hs^40770 nudaarcapt)}ndingpfoteInsubunit2.2 
Hs.90821 lyanodine receptor 2 (cardiac) 

gbK>IIO-HT0323.151299.1264)04 HT0323 Homo 
Hs.78277 OKFZP434P2021 protein 
Hs.190596 ESTs 
Hs.171066 ESTs 
Hs.21479 ubinudein 1 
Hs.7 1 79 RA01 (S. pombe) homOiog 
Hs.160681 ESTs 
H5.46608 ESTs 

Hs.188691 Homo sapiens cDNA FU12033 lis, done HE 
Hs,189496 ESTs 

Hs.56145 ttrymosin, beta, identified in neurobidst 

gb:RC3-HN0001-240400^2-c01 HN0001 Homo 

Hs.44532 diublquttin 

Hs.131793 ESTs 

gb:RC&4JM0014.170300^00S UM0014 Homo 

Hs.194015 ESTs 

gb7p07c06.8l Soares breast SNbHBst Homo 

H&1 28705 ESTs. Weakly simHar to AF149422 2 unkno 

Hs.26630 ATP-blnding cassette, sub-famiiy A (ABC1 

Hs.43157 ESTs 

H5.1 07872 hypothetical protein FIJ20761 
Hs.130277 ESTs 
Hs.189729 ESTs 

Qb:6012B3801F1 NIH.lkAGC„44 Homo sapiens c 
H8.121121 ESTs. VlfeaUy amOar to S0O755 pted(stri 
Hs^404 protease, serine, 12 (Reunibyps!n.moto 
Hs.75572 caiboxypep8da8eB2 (plasma) 

HS.441S5 OKFZPS86G1517 protein 
Hs.164768 TBP-tnteradingprot^ 
K5.469 sucdnale d^rogenase complex, subunit 
Hs.39972 ESTs, Wealdy similar to 138588 reverse t 
gb:Hum3n alphas spectrin gene, exon 12 
HS.1250S6 ESTs 
Hs.8236 ESTs 

Hs,57664 Homo sapiens mRMA hiOlength Insert cDN 
Hs.21213 ESTs 
Hs.100826 ESTs 

0b2u88h01.s1 Soares.tes8s_NHT Homo sap 
Hs^9347 ESTs. Weakly similar to 2109260A B cell 
Hs.192245 ESTs 

Hs.70337 Immunoglobufa superfenffly, membef 4 
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409435 AI810721 Hs.95424 ESTs 2.8 

442191 W95186 Hs^136 endotheOal PAS domain proteh 1 ZB 

407305 AA715284 gbuw35«)3j1 Na.CGAP.BrS Homo sapiens Z8 

444381 BE387335 Hs.283713 ESTs, Weakly simflar to S54054 hypotheU 2.8 

5 412189 R60982 Hs.22581 ESTs 2.8 

420976 A1924940 Hs.108082 ESTs. Weakly 8M]ar(oT3163Shypothet} ^8 

448330 AL03S449 Hs.207163 ESTs Z8 

418912 NM.00066S Hs.89472 angkjtensin receptor 1 2.8 

422S0S AL120862 Hs.l24165 ESTs 2.8 

10 427752 AM70687 Hs.104772 ESTs 2.8 

433513 AIS66356 Hs.171437 ESTs 2.8 

433703 AA21C863 Hs.3532 nemoae kinase 2.8 

448912 D83781 H3^2S59 KIAA0ig7 protein 2.8 

405621 2.8 

IS 430687 BE274217 H&249247 heterogeneous nuclear protein sbnOar to 2.8 

450400 AI694722 Hs.279744 ESTs 2.8 

456644 AI264155 Hs.152981 COP^idcylBtycerol synthase (phosptiallda 2.8 

418342 BE002723 Hs.226627 leptin receptor 2.8 

420756 AA411800 HS.1S9900 ESTs ^8 

20 423532 BE090S03 gb:RC6«T0717-110400^1 14^11 BT0717 Homo ZB 

440320 AA079294 gb:nw86e0as1 Na_CGAP_Pn2Honio sapiens 2.8 

457314 AA479597 Ks.19366g hypotheticdpiotefaiDKFZp588Jl1l9 Z8 

439831 AW136488 Hs.25545 ESTs 2.6 

425661 AL133627 Hs.l5B923 Homo sapiens mRNA;cONAOKFZp434KD722(f Z8 

25 407949 W21874 Hs.247057 ESTs, Weakly eimilar to 2109260AB cell 18 

418558 AW874263 Hs.32468 ESTs Z8 

409978 D31897 Ks.57714 double C2>Gke domains, alpha 2.8 

421340 F07783 Hs.1369 decay acceleratino fxtor for complement 2.8 

449071 NM^005872 Hs.22960 breast carcinoma amplified sequence 2 2.8 

30 409241 AF070602 Hs.51649 Homo sapiens clone 24504 mRNA sequence 2.8 

448219 AA226092 Hs.42656 KIAA1681 protein 2.8 

408936 AL138043 Hs.293549 ESTs 2.8 

410784 AW803201 ' gb:IL2-UM0O77-07050O^8OE06 UM0077 Homo 2.8 

426471 M22440 Hs.170009 transforming growth factor, alpha 2.8 

35 454455 AW752710 gb:IL3^T0219-281099^24.A03CT0219Horao Z8 

455310 AW893961 gb:RC4'NN00Z7-O6040O«11-d1 1 NN0027 Homo Z8 

401335 2.7 

435577 W84774 Hs.17643 ESTs 2.7 

409519 AA075368 gb2m86h10.r1 Stratageno ovarian cancer 2.7 

40 421003 T72080 Hs.95667 F^x protein 30 2.7 

429593 AK000332 Hs.209927 Homo88ptensc0NARJ20325fe.ck)neKE 11 

450434 AA166950 Hs.195870 hypothetical prot^ FU14991 2.7 

436007 AI247716 Hs.232168 ESTs 2.7 

408874 AW818091 Hs.252730 ESTs 2.7 

45 418036 Z37976 Hs.83337 latent transforming growth factor bete b Z7 

435625 H50654 Hs.113999 ESTs 2.7 

435766 R11673 Hs.186498 ESTs Z7 

410327 T33130 Hs.301746 RAP2A. member of RAS oncogene family t7 

416805 F13271 Hs.79981 Human clone 23560 mRNA sequence 2.7 

50 417177 NM_004458 Hs.81452 faUy-ackl-CoenzymeAligasa.k)ng4hain 2.7 

423020 AA383092 Hs.lGOS teplcalhMi protein A3 (14kD} 2.7 

427134 AA398409 Hs.173561 EST 2.7 

428137 AA421792 H5.170999 ESTs 2.7 

429710 Ai337113 Hs.146025 hypothetical protein FU23594 2.7 

55 430844 T94960 gb.-ye38d07j1 Stratagene lung (937210) H 2.7 

417576 AA339449 Hs.82285 'phosphorlbosylglycinamidefQmiyltiansfer 2.7 

441928 AI370188 Hs.211454 ESTs 21 

409721 AW887732 Hs.257861 ESTs tl 

427112 232887 H5.290951 ESTs Z7 

60 403775 2.7 

420159 A1572490 Hs.99785 Homo sapiens cOKA:FU21245 lis. done C 2.7 

427839 AA508823 Hs.98244 ESTs 2.7 

432837 AA310693 Hs.87329 HSPC072 protein 2.7 

438782 AA828380 Hs.126733 ESTs 2.7 

65 449396 BE169100 Hs,195029 ESTs 2.7 

458043 AW979009 H5.326108 ESTs 2.7 

438171 AW976507 Hs.293515 ESTs Z7 

452959 AI933416 Hs.189674 ESTs 2.7 

439556 AI623752- Hs.163603 ESTs Z7 

70 448152 AI292036 Hs.150028 ESTs Z7 

434803 AW974640 Ks.303413 ESTs tl 

407771 AL138272 Hs.62713 ESTs 2.7 

411069 AL133092 Hs.68055 hypothetkal protein DKFZp434l0429 2.7 

417543 AA203620 Hs.110153 ESTs Z7 

75 401517 2.7 

403677 2.7 

416337 H48713 gb:yq78d02j1 Scares (etd fiver spleen 17 

423401 Nlil.001992 Hs.128087 ooagtdafion factor II (thrombin) receplo 17 

446800 AI341635 Hs.1564e6 ESTs 17 

80 457906 AW975939 Hs.153290 Homo sapiens cDNAFUl4318 lis. ctone PL 17 

452277 AL049013 Hs28783 K1AA1 223 protein 17 

416913 AW934714 gb:RC1.DT0001^129W)11<11 DT0001 Homo 17 

416370 N90470 Hs.203697 ESTs, WbaMysbnilar to 138022 hypolheO 17 
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408715 AA768873 
410743 AA089474 
427138 N77624 
436280 BE1727S2 
427585 AI287280 
408092 

410008 AA079S52 
438504 AW655281 
414783 AW069569 
411479 AW848047 
418688 Z38830 
413795 AUM017B 
457528 AW973791 
444230 H95537 
403760 

416624 H69044 
428S04 A1312526 
446311 AW0072g4 
458638 N78553 
459267 Aj003631 
424834 AK001432 
433906 AI167816 
428966 AP059214 
446554 AA151730 
446035 NM_006558 
421585 1)95626 
445158 AI992108 
421175 AI879099 
401793 

410181 AI468210 
427038 NM_014633 
4S1343 AW975057 
455992 BE179015 
438475 W03856 
455571 BE003714 
426298 AW965058 
407930 AAD45847 
453891 AB037751 
451487 AA016072 
418269 AAB06113 
419196 AF110908 
459160 AI904723 
"441963 AI733307 
440273 A1805392 
426902 At125334 
414271 AK000275 
453313 BEOOSni 
445265 AI218295 
422988 AW673847 
428813 AB037749 
444619 BE538082 
457300 AW297436 
402800 

425071 NM.013g89 
414729 BE466928 
453716 AA037675 
452693 T79153 
439818 A1360137 
443305 AI050693 
416709 R99369 
419077 AA233885 
453878 AWg64440 
445660 Ai702868 
446817 AI700684 
442137 AA977235 
410406 AI969703 
442242 AV647908 
407830 NM.0010B6 
415138 C18356 
407055 X89211 
408812 BE397160 
440310 AA878939 
425659 AK000590 
418217 Ai910647 
428667 AI375550 
414573 A1821846 
420000 AB036063 
452821 AW471181 
440138 AB033023 
428483 AI90B539 
441350 ABQ20690 
405059 

425178 H16097 



Hs.1 12250 hypolheScal protein nj23518 

H5.272153 ESTs 

Hs.173717 phosphaOdtc acid (ihosphstaso type 28 

Hs^lO ESTs. WeaMysimilvtoALUS_HUMAN ALUS 

Hs.97933 EST8.WeaMysliniIartoT46370lvpotheti 

glmi20h12^1 Slialagene pancreas (93720 
HS.22462S ESTs 

Hs^78270 unacSveprogestefone receptor, 23 kD 

gb:n.3<:T0214-29129M52-Al2 CT0214 Homo 
Hs.87268' annexinAB 
Hs.142003 ESTs 
Hs^84 ESTs 
H5.146067 ESTs 

Qb7r77h05.s1 Soares fetal Bver sptaen 

Hs.46540 ESTs 

Hs.149795 ESTs. Moderately sknitar to ALUI.HUMAN A 

HS.2S2204 nudeoscmiai tending protein 1 

ObA1003631 Selected chromosome 21 cONA 

Hs.153408 Homo sapiens CDNAFU10570 lis. done NT 

Hs.43355 ESTs 

H5.194687 cho1estero)254iydioxytasa 

Hs.301789 nudix (nucteoslde diphosphate Fiiked moi 

Hs.1 3565 Sam&64ike phosphotyrosine protein. T-ST 

Hs.302043 cheniokine(C-CmotH)receptor-iike2 

Hs.127206 ESTs 

Hs.102397 GiOT-3 for gonadotropin inducible transc 

Hs.261285 plelolropic reguiator 1 (PRL1, Aratiidops 
Hs.173288 KiAAOISS gene product 
H8.293353 ESTs 

gb:RC3-HT0612^5OO^miO HT0612 Homo 
Hs.13168 ESTs. Highly slmilsr to Gene product wit 

gb:QV3'BN009&-200400-161-a01 BN0096 Homo 
Hs.1 1 1583 ESTs. Weakly similar to 138022 hypoiheO 
HS.18B361 Homo sapiens cDNA aJ12807 lis. done HT 
Hs^53 Homo sapiens mRNAfidlteiigth Insert d)N 

gbze51g02.rl Scares reOna N2b4HR Homo 
Hs.189025 ESTs 

Hs^7660 TNF receptor-associated factor 3 

gb:CM-BT066-120299-092 BT066 Homo saplen 
Hs.128002 ESTs 

Hs.325335 Homo sapiens cDNA: aj23523 lis. done L 
Hs.97408 ESTs 

H3.75e71 protein kfaiasa C binding protein 1 

H5.153746 hypotheticat protein FU22490 

Hs.144g42 ESTs 

Hs.97321 ESTs 

Hs.186928 K1AA1328 protein 

Hs.8172 ESTs, Moderately simHar to A46010 X-Tm 

Hs.158849 Homo sapiens cOI^ fU21663 fis, done C 

Hs.154424 deIo(finase. todolhyionine. type II 

Hs.281901 ESTs 

Hs.152675 ESTs 

HS.48S89 Zinc finger protein 228 

Hs.19934 Homo sapiens niRNA fun length insert d}N 

Hs.133318 ESTs 

Hs^83108 hemogtobin.gamma6 

Hs.164526 ESTs 

Hs.19025 OC32 

Hs^l^ ESTs 

Hs.134166 ESTs 

Hs.128830 ESTs, Weaidy slmSar to Z19^.HUMAN ZINC 

Hs.1466 glycerol idnase 

Hs.90424 Homo sapiens cONA: FU23285 Ss. done H 

Hsi67 arylacelamide deacetylase (esterase) 

Hs:295944 tissiiefador pathway Inhibitor 2 

gbHsaptens DNA for endogenous letrovir 

Hs2&4763 ESTs, Weakly siniilar to A42442 integifn 

Hs.125408 ESTs 

Hs.15e836 hypothetical protetoai20563 

Hs.13442 ESTs 

Hs.74407 nucl«^ protein p40;hoinotog of yeast 

Hs.71999 ESTs 

Hs.94262 pSd-indudbte ribor\ucleotida reductase s 

Hs.160874 ESTs 

Hs.318127 hypothet»aiprotdnFU10201 

HSJ21444 K1AA0344 gene product 

Hs.7782 paraneoplastic antigen MA2 



Hs.161027 ESTs 
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442952 AI743261 Hs.131860 
428692 AI372822 Hs.ltOKO 
456179 H75490 Hs.271930 
414136 AAB12434 Hs.1 19023 
5 459456 AA486036 Hs.190124 
425527 AL162032 Hs.156258 
424711 NNL005795 Ks.152175 
417956 AA210704 ' Hs.190465 
420621 AA278808 
10 425698 NM.016112 
438295 AI394151 
445SS0 AI242754 
450469 A1955049 
458S04 A115762S 
15 443657 R14973 
429250 H56585 
437906 AA771704 
426775 AA384564 
443372 AI792557 
20 453765 A1366236 
425465 L18964 
422746 NM^004484 
413450 Z99716 
424527 AW138558 
25 414180 AI853304 
411402 BE297855 
445264 AI218263 
458661 AI630223 
415227 AW821113 
30 435429 AW592035 
434445 AI349305 
446570 AI923944 
452381 H23329 
422879 AI241409 
35 409026 AL137554 
425717 X07282 
429127 AA749382 
438298 H23542 
442717 R88362 
40 443555 N71710 
444517 AI939339 
451813 NM.016117 
452453 A1902S19 
455870 AW452631 
45 437939 AW298600 
430719 AA488988 
452864 AA033714 
432095 AWD22273 
431086 A1829692 
50 407783 AW996872 
423952 AW877787 
453403 BE466639 
408172 W02488 
430933 AW863635 
55 420691 AA829433 
429761 A1276780 
437958 BE139550 
407494 U10072 
436464 At016176 
60 407137 T97307 
446223 BE300091 
438647 AA813118 
438192 AI8S9065 
417218 AA005247 
65 440460 H92571 
414612 BE274552 
428170 H0S530 
457343 NM.013936 
424020 R75946 
70 4SS226 AW902103 
411965 eE48733a 
432656 NM.000246 
455488 AA102322 
434340 .Ai193043 
75 404285 

416744 AI887288 
454714 AW815098 
429828 AB019494 
436387 AA7147eO 
80 446587 A1S39652 
432855 A17S3709 
440479 AA886461 
443160 AI46791S 



ESTs 

RNA potymerase 1 transcription factor RR 
ESTs 

SMC2 (structural m^nlenanoe of ctuomoso 

ESTs 

Homo sapiens mRKA; cONA DKFZp434B1272 (f 
catoMn receptar4DtB 
ESTs 

gb2s79c09.rl Na^CGAP^GCBI Homo sapiens 
polycystic kidney disease 2«likB 1 
ESTs 
ESTs 
ESTs 

gb:DKFZp761L2016^1 761 {synonym: hamy2) 
gb7f42f10.s1 Scares fetal liver spleen 
tryptophan rid) baste protein 
ESTs 
ESTs 
ESTs 

ESTs. Moderately similar to ALUI.HUMAN A 
protein kinase C lota 
gtyptcan 3 

N-acetylgalactosamlnldase, alpha- 
ESTs, Wealdy similar to 154374 gene NF2 
Homo sapiens cONA FU1 1448 fis. done HE 
NRAS-relaled gene 
EST 

gb:ad06g08.r1 Proliferating Erythroid Ce 
ESTs 

ESTs. \nfeakly similar to 18051958 RNA-btn 
ESTs 
ESTs 

ESTs. Weakly slmflar to ALU1J^UMAN ALU S 
ESTs 

protein kinase NYD-SP1 5 
relinotc acid receptor, beta 
ubiquiftvconjugding enzyme E2D 3 (homo 
ESTs 

ESTs, Weakly similar to T23976 hypolhefi 
ESTs, Moderately similar to A Chain A, H 
ESTs 

phosphoUpase A2-acilvating protein 
gb:QV-eT009-1 01 198^51 BT009 Homo saplen 
ESTs, hfighly similar to AF157833 1 noncl 
ESTs. WeaWy similar to S59501 biterfeiD 
ESTs 

hypothec protein FU14260 
ESTs 
ESTs 

a disinlegfin and metafloprotelnase dome 
KIAA0853 protein 

Homo sapiens cDNA FU13591 fis. done PL 
phosphoglyoerate mutase 2 (musde) 
gb:MR3^N0010-270300.103^i02 SN0010 Homo 
ESTs 
ESTs 

ESTs, Moderately similar to P(M259 ferii 
gb:Human forkhead famfly (AFX1) mRNA, pa 
ESTs. WBaWy similar to AlAJI JtUMAN ALU S 
gb7eS3h05.8l Soares fetal Over spleen 
hypothetical protein FLJ12969 
ESTs 

Homo sapiens AFG3L1 Isofomi 1 mRMA, part 
met protiHJncogene (hepatocyto growth fa 
Homo sapiens cOKA: FU22648fis, done H 
protein tnhitiitor of adivated STATS 

ESTs 

otfach^ receptor, family 12, sulifMy 
ESTs 

gb:QVO-NN1O22-12050O-22(k07 NN1022Homo 
ESTs 

MHCdasslltransactivator 
gb3l90fO3.r1 Stralagene colon (937204) 
Hs.128685 ESTs. Weakly similar to T17226 hypotheti 

H5.196379 ESTs. Weakly similar to putaSve pi 50 [H 

gb:QV4.ST0212^1199-023^0ST0212Homo 

Hs^25767 1DN3 protein 

Hs^40075 Homo sapiens cONA FU13234 fis, done OV 

HS.2B338 KIAA1546 protein 

HS.1S2484 ESTs. Weakly similar to 138022 hypotheS 

HS^161 ESTs 

Hs.36053 ESTs 



HS.1S9241 
Hs.37932 
H5,137306 
Hs.281326 



Hs.ig8308 
Hs.194626 

Hs.108829 
H8.133107 
Hs.283732 
Hs.1904 
Hs.1 19651 
H3J5372 
Hs.267158 
H$.1 20905 
Hs.69855 
Hs.323472 

H5.72402 
H3.254414 
Hs.11782 
H8.30913 
Hs.290880 
Hs,188092 
Hs.49927 
Hs.171495 
Hs.1 18797 
Hs.181788 
Hs.180591 
Hs.21398 
Hs.146883 
H3.27182 

Hs.313803 
Hs.141840 
Hs.293796 
Hs.287629 
H3.105769 
Hs.211561 
Hs.172028 
Hs.136102 
Hs.61779 
Hs.46039 

H5.275343 
H3.135173 
H8.121668 

Hs.269783 

Hs.119699 
Hs.163230 
Hs.337620 
H8.285754 
Hs.234478 
H8.76578 
Hs.12565 
Hs.247882 
Hs.39738 

K5.280115 
Hs.3076 
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42B978 
444570 
453459 
418122 
442B75 
414373 
458760 
434131 
441805 
457292 
417351 
409695 
432824 
438038 
454836 



422487 

408727 
427491 
435102 
409817 
455866 
432887 
407756 
401078 
410365 
425201 
457112 
455252 
444542 
419249 
428497 
457336 
427621 
423782 
430403 
429927 
408562 
417137 
435787 
440331 
429716 
417169 
453020 
455286 
450654 
400432 
415747 
446346 
450209 
453202 
425523 
433124 
408741 
425657 
401254 
426604 
449535 
433138 
425804 
429515 
437267 
454305 
455631 
401878 
450350 
436532 
457460 
427304 
419721 
431699 
446252 
421229 
434273 



AA442784 
H58373 
BE047032 
R4277B 
BE623003 
AW162907 
AH98831 
AI858275 
AA285138 
AI921270 
T90278 
AA2^61 
AK001783 
AI732629 
AW833711 
AW959912 
AJ010901 
AL137259 
R43279 
AW899053 
BE003760 
BE149024 
AI926047 
AA1ie021 



434353 
438198 
452232 
433764 
412050 
422342 
427510 
428336 



AI287518 
AA352111 
AW772449 
AW876627 
A1161293 
X14767 
6E010877 
AW969657 
BE621182 
A1472209 
AF039390 
NML001115 
AI436323 
U46265 
AAg08554 
A1J046412 
R25585 
R13550 
AL162039 
BE144384 
AJ245687 
AX015809 
AA381209 
AI290205 
AW0Q8921 
AW085781 
AB007948 
U51712 
M73720 
T89839 

H53354 
W15267 
AB0294g8 
BE501698 
AU)31228 
AW511443 
BE062633 
BE063031 

T97817 
AA721522 
AJ143312 
AA761526 
Nf/L001650 
NM-001173 
AI283125 
A1056590 
AA913143 
BE177320 
AA630863 
AK001125 
AW020603 
AW753676 
H9 6503 
AA309272 
Z47542 
AA503115 



Hs.125445 
Hs.332938 
Hs.257789 
Hs.22217 
Hs23625 
Hs.75g59 
Hs,1 11334 
Hs.143659 
Hs.301914 
Ks.334882 
Hs.15049 

Hs.279012 
Hs.194161 

Hs.7076 
H8.198267 

Hs.47115 
Hs.22574 
H8.76917 
Hs^5209 

H$.162859 
Hs^260 

Hs.62669 

Hs.268081 

Hs.280380 
Hs.89768 
Hs.98564 
Hs.291029 
Hs.179882 
Hs.323117 
Hs.2413a2 
H5.2522 
Hs,31141 
Hs.81281 
Hs.192756 
H3.202151 
H5,211933 
H8^46773 
Hs.31422 

Hs.25275 
H3.287767 

Hs;309940 
Hs,13138 
Hs^6270 
Hs.158244 
H8.13775 
H8.646 
Hs.1 19471 



ESTs 

hypolheticd protein MGC5370 
ESTs 

Homo sapiens done liyiAGE:32106. mRNA seq 
Homo sapiens done TOCCTAD0142 mRNA sequ 
pfoGne^ preletn vrilh nudear taigel 
ferritin, light poiypejiBde 
ESTs 

neuronal spedQc transcripOon fador D 
hypolheOca) prolan RJ14251 
ESTs 

gb:EST112514 Adrenal gland tumor Homo sa 
hypothetical protein FU10921 
ESTs, Weakly similar Id TA2R HUMAN, BETA 
flb:QV4-TT0008-251199We11 TTOOOSHomo 
KIAA1705 protein 
mudn 4. tradieobronchtal 
hypotheCcal protein DKFZp434D0513 
ESTs. Weakly similar to 138022 hypotheti 
F>l)Ox only protein 8 

Homo sapiens mRNA; cONA DKFZp434K0514 (f 
gb:CM0-HTQ249-2910994»44O4 HT0249 Homo 
ESTs 

ut)lqunin spedte protease IB 

Homo sapieis mRNA; d)NA DKFZp586D0923 (f 
gb:EST60061 Adivated T-cells XX Homo sa 
ESTs, We^y similar to ALU1.HUMAN ALU 8 
9b:RCm0028.120200^13<I11 PT0028Homo 



ganvna^nlnobutyito add (GABA) A recepto 

ESTs 

ESTs 

hypothetical protein RJ12443 

ESTs 

tumor necrosis factor (llgand) superfami 

adenylate cyclase 8 (brdn) 

Homo sapiens mRNA for KIAA1S68 protein, 

mitochondrfd ribosomd protein 821 

ESTs 

ESTs 

collagen, type XIII, alpha 1 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 
gb:MRO-HT0l65-191199^KW-c11 HT0166 Homo 
Kruppehtype zinc finger protein 
Sequence 8 from Patent WO99S0285 
gb*£ST942S7 Actfvated T<cells I Homo sap 
ESTs 

Homo sapiens, done iMAG£:3448343, mRNA. 

hypothetical protein FU1 1588 

KIAA0479protan 

hypothetical protein SMAP31 

catboxypeptldase A3 (mast cell) 

ESTs 



Hs.97141 ESTs. WbaMy similar to hypothetlcai pio 

HsJ23672 low density ItpopiolBlnreceptor-f^lated 

H3.59729 8emaphor1nseiii2 

Hsi5ei89 ESTs 

Hs.204370 DNA segment on chromosome 6 (uhiqiie, pse 

Hs.258110 ESTs 

Hs.28338 iaAA1546 protein 

gb:MRO-BT0265-231199W-e09 BT0265 Homo 

H8.174880 ESTs 

gb:mf54h12.r1 NCLCGAP_Ew1 Homo sapiens 

Hs.129206 casein kinase 1, gamma 3 

Hs.163853 ESTs 

Hs.288650 aquaporift4 

H5.267831 RhoGTPeseacthraGngpnileinS 

Hs.150009 ESTs 

Ks.7086 hypothetical protdn MGC12435 

Hs.26303 ESTs 

Hs.156148 hypothetical protein FU13231 

Hs.1 31375 ESTs. Moderately similar to ALUB_HUMAN 1 

Hs^00922 Homo sapiens cONA FU10263 fis, done HE 

Hs.1 58423 radial spoke protein 3 

Hs.39982 ESTs 

Hs.109087 Homo sapiens cDNA: FU2284S fis, done K 

^:EST180209 Uver, hepatocethjiarcard 
Hs.179312 small nudear RNA adivaSng complex, po 
Hs.183752 n^craseminoprotein, bela- 
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408813 AIS60090 Hs.48295 RNAheficssefMy ^5 

414109 BE250744 gb:600943376F1 NIH^MGCLI? Homo sapiens c 2.5 

451678 AA374181 Hs.26799 DKFZP564O0764 protein 2.5 

- 419985 H66373 Hs.15973 ESTs. Highly similar to bA393J1 6.3 [H.5a 2.5 

5 417859 T26453 gb:AB214F6R Infant brrin.LLNL array of IS 

434334 AA912476 HS.11S750 Homo sapiens cDNA FU13221 Qs. done KT 15 

448015 AI458065 Hs.23196 ESTs ^5 

454190 AW177821 gb:IL3-HT0059-180699^-C05HT0059HomD Z5 

4458S5 AI262584 Hs.145575 ESTs ^5 

10 451800 AW977435 Hs.323867 ESTs 2.5 

456987 AI557290 Hs.173536 ESTs 2.5 

403568 Z5 

435209 AW027809 Hs.187698 Homo sapiens cytomegatovinis partial fus 2.5 

430371 087486 Hs.240112 iaAA0276 protein IB 

IS 416033 W68180 Hs.259855 elongation (actor-2 kinase 2.5 

412095 AI624707 Hs^ Honu) sapiens cDNA: FU21592 (is. done C 2.5 

453619 H87648 Hs^922 Homo sapiens, dona M6C:9084. mRNA.comp 2.5 

431071 AA491379 0baa65IO5.r1 NQ.CGAP^GCBI Homo sapiens ^5 

407939 WDS608 Hs.312679 £ST8,WBBklysimaartoA4goi9dyneint)e 2.5 

20 409045 AA635062 Hs.50094 Homo sapiens nANA;cONADKFZp434O051 5 (f 15 

444575 AI264847 Hs.22545 Homo sapiens CDNAFU12935 lis. done NT Z5 

408420 NNL006915 Hs.44766 retinitis pigmentosa 2 (X-tinked recess! 2.4 

417318 AW9S3937 Hs.12891 ESTs Z4 

413382 BE090589 gb:RC1-BT072&.2803004)11-flBBTO720Homo 2.4 

25 406748 AW339106 Hs.217493 annexinA2 Z4 

445898 AF07D623 Hs.13423 Homo salens done 24468 mRNA sequence ZA 

441817 AW969706 Hs.293332 ESTs 2.4 

450551 AJ010046 Hs.25155 neuroepithenal CdH transforming gene 1 2.4 

457940 AL360159 Hs.306517 Homo sapiens TRipartite motif protein ps 2.4 

30 446135 AW130288 Hs.170316 hypothetical protein FU 10147 2.4 

436907 AA737171 Hs.131809 ESTs 2.4 

429399 AA452244 Hs.16727 ESTs 2.4 

448782 AL050295 Hs.22039 K1AA0758 protein 2.4 

434404 AW44S034 Hs.256578 ESTs 2.4 

35 428571 NM„005531 Hs.2291 Prol)e hTg737 (polycystic kWney disease. Z4 

448164 R61680 Hs.2G904 ESTs. Moderately similar to Z195_HUMANZ 2.4 

442295 At827248 Ks.224398 Homo sapiens cDNAFUl 1469 (Is. dons HE Z4 

450705 U90304 H5.25351 Iroquois homeot)ox protein 5 2.4 

425506 Nl\^003866 Hs.15820S task; leucine zipper nudear factor 1 (J 2.4 

40 423961 013666 Hs.136348 osteoblast sped(« (actor 2 (fasddin ZA 

458988 AI802772 Hs.208655 ESTs 2.4 

443861 AW449462 Hs.134743 ESTs 2.4 

412879 BE092219 gb:IU2-BT0734-24040^071-B04 BT0734 Homo 2.4 

415250 F02614 Hs.27319 ESTs 2.4 

45 434627 AI221894 Hs.39311 ESTs 2.4 

443919 AI091284 Hb.135224 ESTs. Weakly simDar to A47582B^»llgr 2.4 

440400 AA994364 Hs.125594 ESTs. Weakly similar to T25472 hypotheti 2.4 

400385 N^1.020389 Ks.283104 putative capadtativecakmjm channel 2.4 

411322 AW887330 Hs.172405 ceD (fivlston cyde 27 2.4 

50 434638 H50758 gb7p86e06.r1 Soares fetal liver spleen 2.4 

435559 AF209198 Hs.42636 zinc linger protein 277 2.4 

447849 A153B147 Hs.154277 ESTs ZA 

448005 AW207437 Hs.17037B ESTs 2.4 

454201 AB023191 Hs.44131 KIAAQ974 protein ZA 

55 456869 BE467912 H5.154294 discs, large (OrosophSa) homdog 1 2.4 

449486 AI652715 Hs.270811 ESTs ZA 

421516 AI362418 Hs.105379 FTOOSproteht 2.4 

412167 AW897230 gb:CMO.NN0057-15040(W35.a11 NN0057 Homo 2.4 

426910 AA470023 Hs.190089 ESTs, Moderately simitar to ALU I.HUMAN A ZA 

60 429673 AA864407 Hs.211595 protein tyrosine phosphatase, non^ecept 2.4 

400641 2.4 

430576 AA76712S Ks.293S74 ESTs 2.4 

434423 NM-006769 Hs.3d44 domain only 4 14 

412104 AW205197 Hs.240951 Homo sapiens, Similar to RlKENcDNA 2210 2.4 

65 441499 AW298235 Hs.1016B9 ESTs 2.4 

418113 AI272141 Hs.83484 SRY{sexde1emiiningiegk)nYH»x4 ZA 

417819 A1253t12 Hs.133540 ESTs 2.4 

• 431728 NM.007351 H8.268107 mutfimeiin 2.4 

425025 AW953168 Hs.12407 ESTs 2.4 

70 421168 AF182277 Hs.330780 cytodvomeP450.5ubfamaynB(phenobar 2.4 

409432 D49372 Hs.54460 small Indudble cytokine subfandly A (Cy 2.4 

408867 AA437199 Hs.656 celt diviston cyde 25C 2.4 

439446 AI927629 Hs.57873 ESTs 2.4 

445038 AI635444 Hs.143917 dJ467N1 1.1 protein 2.4 

75 450682 Z42993 Hs.25320 Homo sapiens done 25142 mRNA sequence 2.4 

455107 BE154113 gb:PM1-HT034O.15129W)03-a08HT0340Homo 2.4 

458624 A13627gO Hs.278639 KiAA1684 protein; likely homdog of mous 2.'4 

449523 NiyL000579 Hs.54443 diemokine (OC mofiQ receptor 5 2.4 

428784 Y128S1 Hs.193470 purtneigic receptor P2X, HgancHiated lo 2.4 

80 453864 AW021407 Hs.21068 hypolhelteal protein 2.4 

426497 AA379913 gb:EST92807 Skin tumor 1 Homo sapiens cD 2.4 

418203 X54942 Hs.83758 ' COC28 protein kinase 2 ZA 

426603 AA382291 gb:ESTg5683 Testis I Homo sapiens cONA 5 2.4 
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447357 
452631 
405041 
405472 
409744 



AI375922 



Hs.159367 ESTs 
Hs.67496 ESTs 



437119 
455114 
431B13 
434936 
408918 
444106 
416580 
430299 
446659 
418636 
434995 
438005 
444755 
427131 
442039 
448595 
432949 
444314 
417420 
427551 
420057 
434950 
425497 
438214 
416100 
419637 
449432 
454403 
419179 
436391 
449511 
447499 
407244 
412877 
435985 
440674 
448476 
444100 
435731 
437105 
406091 
457024 
404249 
419556 
424943 
444229 
404860 
432223 
410467 
420843 
434927 
413642 
436998 
441235 
445748 
451018 
409073 
418782 
447870 
437370 
424765 
407385 
424049 
42S398 
430702 
409820 
441675 
430884 
445523 
416972 
443547 
417583 
422182 
404513 
410999 
448506 
434611 



AW675258 
AA612960 
AI379921 
AW857121 
AA018515 
A1285970 
BE218603 
A)123922 
T61572 
W2B673 
A)335361 
AW74S855 
AW974995 
BE151746 
AA431791 
AA448460 
AW276240 
AB014544 
AA570749 
A1140497 
T85150 
T96203 
AA806899 
AW974892 
AA524S96 
H06076 
H18700 
W27493 
AW451361 



AW275291 
AJ227892 
A1436187 
AW262S80 
M10O14 
BE011168 
AA703154 
BE561546 
AW294072 
AA3a3343 
AA699581 
AA744554 



Homo sapiens mRNA; cDNA OKFZp586P2321 |f 
ESTs 
ESTs 

gb:RC1-a0302-a40400^)17-e12 Cr0302 Homo 
Homo sapiens mRNA; cONA DKFZp761A041 1 (f 
ESTs 
ESTs 

Homo sdfktts cONA FU 11 400 lis, done HE 
Human dons 23574 mRNA sequence 
sefine carboxypepOdase 1 pfecwsor plot 
ESTs 

gb:QV4^0S34-2812g9^S3«05 BT0534 Homo 
gb:EST387100 MAGE resequences. MAGN Homo 
gb:PM1-HT030&4)61299-003^ HT0305 Homo 
OpX (casdnolytic protease X. E. cOfi) 
GE36g6n8 
ESTs 

KIAA0644 gene product 
ESTs 

gb:ow76b09Al Soares JeULPverj^Jleen. 
ESTs 

gb7e48b07.r1 Soares fetal livar spleen 
ESTs 

gb:EST386997 MAGE resequences. RAAGN Homo 
gb:nh34b0is1 Na_CGAP_Pr3 Homo sapiens 
Hs^6320 TRABtD protein 
Hs.268799 ESTs 

gb:31h10 Human retina cDNA randomly prim 
Hs.196529 ESTs 

gb:RC3-BT0319-120200-014.a09 810319 Homo 
Hs.113009 hypolhelicd protein FU22527 
H5.146274 ESTs 

Hs.296261 guanine nucleotide l^imfing protein (G pr 
Hs.147674 piotocadherin beta 16 
Hs.75431 fibrinogen, gamma polypepfida 

gb:PM3-BN02l8-100500md08BN0218 Homo 
Hs.191934 ESTs 

gb:601347208F1 NIH.MGC.8 Homo sapiens cO 
Hs.141376 ESTs 

Hs.221 16 C0C14 (ceil division cyda 14, S. cerevi 
Hs.186811 ESTs 
Hs.222127 ESTs 



HS.5S265 
Hs.337300 
Hs.177043 

Hs.264482 
Hs.183817 
Hs.279708 
Hs.138215 
Hs.79385 
Hs.106747 
Hs.226376 



Hs.113823 
H5.112017 
H8.128352 
Hs.21572 
Hs.298866 

H5.268814 

H5.184387 



AA397546 Hs.119151 ESTs 

U29815 H3.91093 chltinase 1 (dutotriosidase) 

AU077260 Hs.153924 daath-assoctated protein Unasel 

AV648613 Hs.282397 ESTs 

AA333283 Hs.1 21001 Homo sapiens, done IMAGE:34602B0, mRNA 

AF102546 Hs.63931 dachstiund (Drosophila) homolog 

H96982 Hs.42321 ESTs 

H46612 Hs^3815 Homo sapiens HSPC285 mRNA. partial cds 

BE154837 gb:PIUI1-HT0345.121199^1-c08 HT0345 Homo 

AA74562S Hs.291414 ESTs, Wealdy similar to AUUS^HUMAN ALU S 

AI884586 Hs. 135570 Homo S8|rfenscDNA:FU21268fis, done C 

U80766 Hs.1 3252 Human EST done 22453 maitner transposon 

AW965599 Hs.247324 mitochondrial ribosomal protein S14 

AA063458 gb:zf71a07.s1 Soares_^edUland_N3HPG 

AI792648 Hs.14665 ESTs 

BE139479 Hs.161492 ESTs 

AL359567 Hs.161962 Homo sapiens mRNA; cDNADKFZp547D023{fr 

AA428211 Hs.284256 hypothetical proteb RJ14033 similar to 

AA810150 Hs.272072 ESTs, WeaMy similar to 138022 hypotheli 

AB014524 Hs.138380 K1AA0624 protein 

AU049689 Hs.156369 hypotheflcal protein similar to tenasdn 

U56979 HS.2S0651 H factor 1 (completnenl) 

AA076278 Hs.13277 hypothetical protein FU22054 

A1914329 Hs.5461 ESTs 

AF053746 H5^48114 gQai cell derived neurotrophic factor 

Z301ia Hs.293788 ESTs, lAxlerately similar to unnamed prot 

BB)19670 gb±b28c0lJi1 NIH.MGa.5 Homo sapiens cON 

AW271273 Hs.23767 hypothetical protein FU 12666 

AA668782 Hs. 1 91 284 ESTs, WeaMy similar to ALUI^HUMAN ALU S 

AU)43892 Hs.180582 Homo sapiens cONA: FU21836 fis. done H 

AW813004 gb:RC3^0186-23030D^194i02ST0186HQmo 
AI524673 Hs^170 ESTs 
AW97120S Hs.1 14280 ESTs 



Z4 
ZA 
2.4 
lA 

ZA 

ZA 

2.4 

2.4 

2.4 

ZA 

2.4 

2.4 

2.4 

2.4 

2.4 

2.4 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

2.4 

2.4 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

2.4 

2.4 

ZA 

ZA 

ZA 

2.4 

2.4 

2.4 

ZA 

ZA 

ZA 

ZA 

ZA 

ZA 

2.4 

ZA 

2.3 

2.3 
2.3 
Z3 
2.3 
2.3 
2.3 
Z3 
2.3 
Z3 
2.3 
2.3 
Z3 
2.3 
2.3 
2.3 
2.3 
Z3 
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457065 
407945 
419865 
4235S6 
455607 
435667 
440196 
401213 
407291 
442490 
452943 
438133 
440283 
447039 
412777 
421424 
406673 
440555 
451516 
424690 
421046 
423604 
409029 
444206 
451836 
454784 
423673 
436671 
434988 
452B62 
439480 
410605 
426535 
432239 
430217 
417479 
421253 
438180 
439715 
441398 
443055 
413585 
448831 
412953 
430789 
422757 
423003 
428595 
437887 
447720 
452355 
408374 
440361 
425478 
432231 
431757 
417517 
452837 
417426 
423739 
416847 
425876 
457411 
413136 
420313 
421751 
424827 
436331 
439275 
449272 
454352 
428758 
407242 
445326 
423778 
452607 
423161 
418851 
456332 
432565 
437511 
430957 



A)476318 K5.192480 ESTs 

X69208 Hs.605 ATPase. Ok* UanspwUng, alpha polypep 

NWL007020 Hs.93502 U1-snRNP binding protein homolog (70kO) 

AA326195 H5.234101 ESTs. We^ similar to CTL1 protein |H. 

BE141140 gb:MR0.HT0075«M2994J0M07HT0075Honio 

AA954229 Hs.114052 ESTs 

N72847 HS.12S221 ESTs 



AA001464 
AW965078 
BE247449 
R98299 
AI732892 
AV651798 
AI335773 
AW452690 
M34996 
D31292 
A)800515 
BE538356 
AAB10B54 
AA48658S 
BE087807 
AW301017 
T63673 
AW820626 
6E003054 
AW137159 
A1418055 
AW378065 
AL038511 
AW418779 
AU077012 
X81334 
N47863 
At057052 
AI188102 
AA808189 
AA524504 
AA932398 
AV653742 
AI133452 
AL080123 
Z45794 
AA632577 
Ai909935 
AL120077 
AB037795 
AAB11524 
AL038765 
N54926 
AW025430 
AA917808 
AB007953 
AA339977 
AA196930 
AF001176 
AL121053 
NM-002291 
AA398155 
L43821 
AW005887 
AW085961 
BE066941 
AB023230 
AW813731 
AI0S7094 
AI239495 
AF086093 
AW137656 
AW389668 
AA433986 
M18728 
A1220072 
Y09267 
A1160029 
AU)49227 
AI417828 
A1000341 
AA553477 
AI807SOO 
Ai937072 
AA38S649 



gb:ze45b01.r1 Soares relina N2b4HR Homo 
Hs^212 Oiyroid receptor inleiading prot^ 15 
Hs.31082 hvpolheticai protein FU10525 
Hs,177502 ESTs 
Hs.190489 ESTs 
Hs.282915 ESTs 
Hs.270123 ESTs 
HS.2S877S ESTs 

Hs.198253 major histoconpalibiniy complex, class 
Hs.6853 hypolhetica) protein FU22167 
Hs.12024 ESTs 

Hs.1 51777 eultaryottc lrar\siation ir^tlalion factor 
Hs.89081 ESTs 
H5.258901 ESTs 

gb:QV1.BT0681-290400-181^2 BT0681 Homo 
Hs.148492 ESTs 
Hs.173220 ESTs 

gb:RCO^0299-ig0100412-e1O ST0299 Homo 
H8.1695 matrix metailoproteinase 12 (macrophage 
Hs.146151 ESTs 
Hs.161160 ESTs 
HS.86B7 ESTs 

Hs.1 2531 6 ESTs. Weakly dnillar to S33990 finger pr 
Hs.114889 ESTs 

Hs.288582 ESTs, Weakly similar to ubiquitous TPRm 

Hs.2936 malrKmelai]opro!einase13[coltogenase 

H$.336901 rtboson^al protein S24 

Hs.133554 ESTs. Weakly similar to Zigs_HUl«AN ZINC 

Hs.31028 ESTs 

Hs.272151 ESTs 

Hs.42612 ESTs. Weddy similar to ALU1J<UMAN ALU S 
Hs.292036 ESTs, Weakly similar to B34087 iiypolheti 
Hs.15536 liypothetical protein DKFZp761J139 
Hs.75431 fibrinogen, gamma polypeptide 
Hs.22162 zinc finger pratetn 23 (KOX 1 6) 
Hs.238809 ESTs 

Hs.310235 ESTs, Weakly similar to 176885 sertoe/lh 
Hs.65551 Homo sapiens, Similar to DMA segment. Ch 
Hs,122967 telch (DrosophllaHike 2 (Wlayven) 
Hs.186547 KIAA1374 prolan 
Hs.29263 hypoOietlcal protein FU11896 
Hs.161304 ESTs 

Hs.29202 Gprot^nKxnjpled receptor 34 

Hs.155591 (brkheadboxFI 

Hs.190495 ESTs 

Hs.268840 ESTs 

H5.274127 CLST 11240 protein 

Hs.268526 Homo sapiens chromosome 21q22.1 anonymoo 

H$.82238 P0P4 (processing of precursor . S. cerev 

H8i534 Hom08aptonscDNAFU12981fi8.cton8Kr 

Hs.82124 lamfaitn. betel 

Hs.97600 ESTs 

Hs.80261 enhancer of fibmenteGon 1 (cas^ do 
HS.23405B ESTs 
Hs.130093 ESTs 

gb:PMO.BT0340^1299^-«11 6T0340 Honx) 
K1AA1013 proton 

ESTs. Moderately similar to S65657 alpha 
Homo sapiens cDNA: aJ23155 fis. ctone L 
ESTs 
ESTs 



Hs.96427 
Hs,159153 
Hs.96667 
Hs.120189 
Hs.141566 
Hs.197645 

Hs.98502 

H3.165893 

HS.132B21 

H8.61438 

Hs.124776 

Hs.192435 

Hs.220491 

HS.1S2428 

Hs.125247 

HS55043 

H5^9478 



ESTs 

gb:RC2-ST0168^7129W)13^5 STD168Homo 
hypothetical protein FU14303 
gb:Hum8n nonspecific crossreaetlng an6g 
ESTs 

flavin containing monooxygenase 2 
ESTs 

Homo sapiens mRNA; cDNA DKFZpS64N1 116 (f 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU1 3277 8s, ctone OV 
ESTs. W^ simaar to PC42S9 feniQn 
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13 

tz 
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Z3 
Z3 
2.3 
2.3 
Z3 
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Z3 
Z3 
Z3 
2.3 
ZZ 
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2.3 
2.3 
13 
13 
13 
13 
"13 
13 
13 
13 
13 
13 
13 
13 
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13 
13 
13 
13 
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428976 

458925 

440348 

436340 

444190 

438462 

411124 

442138 

412505 

416236 

423582 

453901 

418565 

433404 

409517 

439871 

4W641 

449276 

436547 

437770 

409064 

442607 

449869 

422108 

418251 

43200S 

413638 

415980 

449232 

430882 

454389 



400238 

404488 

407809 

412303 

420478 

422711 

424073 

426567 

435708 

441417 

445117 

447197 

434228 

445527 

445280 

420653 

419926 

447541 

424408 

411893 

426192 

435834 

437637 

438018 

446164 

450232 



AI476429 

BE31S170 

N5ni3 

AA248998 

AL037824 

R15891 

AW015802 

R42246 

AI878916 

Ar624122 

AW196937 

AA445973 

AA974491 

AW99400S 

BE000831 

BE065902 

AK001529 

T32982 

X90760 

R88516 

AI245987 

AW241510 

AJ297351 

AA767881 

AA062954 

AAS07576 

W57990 

AA297914 

AAB32123 

AA524190 

H71252 

R52414 

AW192780 

BE174240 

AW752571 

W05391 



K3.19238 

Hs.8087 

K3.260899 

Hs.173044 

Hs.194695 

Hs.281587 

Hs.47023 

Hs^1606 

H5.10526 

H&a9578 

Hs.53929 

Hs.13303 

Hs^1734 

Hs.337534 

H3.23837 

Hs.86149 

Hs.102720 

Hs.120036 

Hs.46736 

Hs.149442 

Hs.252713 

Ks.30824 

H5.122897 

Hs.141883 

H5^88361 

Hs,60059 

Hs.i11749 

Hs.177723 

Hs.120777 



Hs.196080 
Hs.79024 

H&83S23 



402745 
434008 
439492 
436853 
417648 
427690 
414217 
450229 
400756 
408447 
403388 
433643 
442078 
455685 
432242 
439920 
432436 
429493 
42S55S 
450101 



AW082279 

AW936336 

AA521259 

060641 

U03493 

AA381579 

A13S2949 

AI733297 

A1208754 

R36075 

Z42047 

W39694 

AW0S5063 

AI224532 

AW900992 

AK000288 

AI754813 

R82845 

AA424051 

T82384 

AJ003029 

AK001160 

AW273539 

BE30081S 

AR)86S34 

AA740878 
AR)86310 
BE328074 
R06552 
A1253134 
AI309298 
R18717 



EST8 

NAG^ protein 

ESTs. Moderately similar to ZN91_HUMAN Z 
ESTs. Weakly ^lar to 138022 hypoiheli 
ras homolog gene family, iromber I 
Human (done CT&A4) mRNA sequence 
ESTs 
ESTs 

cysteine end glycine^ protein 2 
general transalpSon factor ItK potype 
ESTs. Weakly similar to ALUS J<UMAN ini 
Homo sapiens cDNA: FU21784 fis. done H 
ESTs 
ESTs 

Homo sapiens cDNA nJ11812 fis, ctone HE 
gb:RC2-BT0318-150200-011-b09 BT0318 Homo 
ptwsphoinosttd d-phosphate^nding prot 
ESTs 

troponin li cardiac 
hypothetical protein FU23476 
ESTs 
ESTs 

teudne zipper transcription eactor#e 

ESTs 

ESTs 

Homo sapiens cDNA: RJ22696 fis. done H 
Homo sapiens cDNA FU11476 fis, done HE 
postmetoQc segregaBon Increased (S. ce 
ESTs 

ESTs. Weakly similar to ELIXHUMAN RNA P 
gb7Sl2h12.&1 Soares fetal Hver spleen 
gb7g80l)05.r1 Soares Infant brato INIB H 
ESTs 

heterogeneous nudear dbonudeoprotsin 
gb:IL3^T0213.1701004)55.F02 a0213 Homo 
nudear receptor subfamily 1. group I, m 



Hs.244106 ESTs 

gb:QV4-Dr0021-28129W)70fl11 DT0021 Homo 

Hs.193796 ESTs 

Hs^1739 Homos8pien5iiiRNA;d3NADKFZp56611518(f 

Ks.138959 gapjundion proleln. dpha7, 45kD (con 

Hs.182962 ESTs 

H8.75169 ESTs 

Hs.144474 ESTs 

Hs.147369 ESTs 

gb:yh88bOU1 Soares placenta Nb2HP Homo 

Hs.283978 Homo sapiens PR02751 mRNA. complete cds 

Hs.83288 ESTs. mOif slmOar to S14747 spMngomy 

Hs.306088 v-crk avian sarcoma vims CT10 oncogene 

Hs.88550 ESTs 

Hs.93796 DKFZP5B5D2223 protein 

Hs.16800 hypothetical protein FU202&1 

H5.146428 collagen, type V. alpha 1 

Hs.273789 ESTs 

H3.304742 ESTs 

gb:yc14fD5j1 Stratagene lung (937210) H 

Ks.65792 syntrophin. gamma 2 
Hs.5999 hypothsttoa! protein FLJ10298 
HS.2887S0 hyfioiheflcal protein FU23577 
Hs^1328 ESTs 

H8.187S61 ESTs. Moderately similar to ALU1.HUMAN A 



H5.112982 ESTs 

HS.1031S9 ESTs 

Hs.146661 ESTs 

gb:yf098l2j1 Soares fetal Rver spleen 

Hs.283410 ESTs 

Hs^79898 Homo sapiens cONA:FU23165 lis. done L 

Hs.8929 hypothetical protein FU1 1362 

AK002089 HS.4S080 Homo sapiens CDNAFU112Z7 fis. done PL 

A1821787 HS.1795B6 ESTs 

AW268583 Hs.262629 ESTs 

BE066976 ob:PMMT0340.21129M03<12BT0340Homd 

AW022715 Hs.162160 ESTs, WfeaMy similar to AUMJiUMANAUUS 

K05430 Ks.288433 neurotrimin 

AW300248 Hs.1 81693 ESTs 

AL134708 HS.1459S8 ESTs 

AA359291 Hs.130767 Homo sapiens cONA:FU23553 fis. done L 

AV649969 Hs.24385 Human hbc647 mRNA sequence 
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442757 W739S28 
430212 AA4591S3 
437146 M730977 
432101 AI918950 
459644 

453687 BE564037 
431170 AW971246 
428062 AA4206B3 
443682 AI383061 
400441 M15530 
453874 AWS91783 
425810 AI923627 
433037 NM.014158 
407162 N63855 
441826 AW503603 
446901 AI347274 
454766 AW866497 
414221 AW450979 
459608 AL119471 
400639 
406149 • 
424027 AW337575 
427531 AA405097 
448353 BE407125 
417669 T99B98 
449650 AF055575 
452335 AW188944 
419216 AU07671S 
447748 AI422023 
403534 

410594 AW770778 
438550 AW976002 
439626 N22415 
444540 A1693927 
450024 AA005129 
450221 AA328102 
439443 AF086261 
418824 AW751661 
451273 NM-014811 
430607 AWg73521 
432702 AW973953 
414195 BE263293 
425570 AA3595S8 
414935 C15871 



Hs.28345 ESTft 

Obw67f04^1 Na.CGAP.Pr1 Homo sapiens 
gtnmSSfDS^I NCLCGAP_Ew1 Homo sapiens 

Hs.123642 EpliA3 

Hs.36237 bypolheficalpralein 
Hs.291022 ESTs 

Hs.98321 hypoUiete^ proton aJ14103 

H5,47248 ESTs, Highly similar to similar to Cdc14 

HSS9879 &«en growth factor 1 (121(0} 

Hs^131 GoUagen, type XIV, 8llpha1(undufin) 

Hs^1903 ESTs 

Hs,279938 HSPC067 protein 

H$.142634 zinc finger protein 

Hs.129915 phosptiotriestsrase related 

gb:tc05d02jc1 Na_CGAP_Co16 Homo sapiens 
gb:QV4-SN0024-170400-17&e07 SN0024 Homo 
gb:U141-BI3^a-a-124)4J)^1 NCLCGAP.Su 
gb:DKFZp761IM141.r1 761 (synonync hamy2) 



Hs.201591 ESTs 

Hs.97957 ESTs 

Hs.231510 ESTs 

gb:ye68g01 j1 Soares fetal Over sfrieen 

Hs.23838 caAdum channel. vdltage^pendent,Lty 

Ks.61272 ESTs 

Hb.164021 small inducible cytokine subfamily B(Cy 

Hs.161338 ESTs 



453153 

430832 AI073913 

439867 AA847510 

419780 AA713522 

433420 A1674093 

434690 AI867679 

436572 AA723274 

447044 AR)30107 

431688 AA513906 
403133 

414885 AA157531 

432111 AW972777 

410073 AW408163 
AI792798 



429525 N92540 

446565 013757 

422385 AF105374 

406687 M31126 
403378 

431369 BE184455 

438560 AA811262 

409191 AW818390 

412282 BE160188 

411966 AA099113 

443915 Z40763 

427785 X81053 

446094 AK001760 

436486 AA742221 

411139 AW819461 

409070 AA063003 

432713 AL11B623 

419384 AA490866 

416858 AWg61605 

408444 AW661639 

432128 AA127221 

418027 AB037807 

446060 Z42061 

436198 AK001084 

411987 AA37S975 



Hs^81238 
Hs.256402 
Hs.189080 
Hs.265165 

Hs.24641 

Hs.127892 

Hs.53542 

Hs.26163 

Hs^47324 

Hs.293744 

Hs.89805 



Ks.24360 

Hs.100686 

Hs.161292 

Hs.87752 

Hs^93961 

H8.148410 

Hs.279596 

Hs.17165 



ESTs 
ESTs 
ESTs 
ESTs 

gb:zh90h0Bwr1 Soares_felaUiver.s|rieeiL 
cytoskeleton associated protein 2 
ESTs 

choreoacanlhocytosis gene; KIAA0986 prot 
K1AA0649 gene product 
mitochondrtai ribosoma) protein S14 
ESTs 

choBnergic receptor, nicotinic, alpha p 
gb:EST6d590 Fetal lung II Homo sapiens c 
gb:C15671 Qontech human aorta polyA-*^ mR 
ESTs 

ESTs. We3k)yslmnartoJ£0350 Anterior 

ESTs 

ESTs 

ESTs. Moderaiely similar to putaiWe DMA 

ESTs 

ESTs 

regulator of &proteIn signalfing 13 
gb:ng67c064l Na.CGAP.Up2 Homo sapiens 



H5269276 ESTs.Moderalely8briilartoS658578lpha 

gb:EST384871 MAGE resequences. MAGL Homo 

Ks.5848e cater^ (cadhedn-assodated protein), a 

Hs.1 2496 ESTs, Wealdy sirrto to ALU4_HUMAN ALU S 

Hs.205353 ectonucteoside triphosphate diphosphohyd 

Hs^l 1 phosphoribosyt pyrophosphate amtdotransf 

Hs.1 15830 heparan sutl^ (gtooosanwe) 3-0<ulfot 

Hs.272620 pregnancy specific beta-l-glycoprotein 9 

Hs.251754 secretory leukocyte protease inhlbilor ( 

Hs.299202 ESTs 

Hs.175613 homotog of Xenopus Ctaspin 

gb:QV1-HT0413^10200'059^ HT0413 Homo 

Hs.118609 ESTs 

Hs.135292 ESTs 

Hs.180828 collagen, type IV. alpha4 

Hs.13801 K1AA1685 protein 

Hs.120533 ESTs 

gb:RC&^0293^299-031-G03 ST0293 Homo 

Hs^4S60 ESTs 

Hsi9494 PR01912 protein 

H5:39429 ESTs 

Hs.21 145 hypothetical protein RG083M0S.2 

Hs^53204 ESTs 

Hs.1 17037 ESTs 

Ks.83293 hypotheficdl protein 

Hs.27004 ESTs 

Hs.333498 Homo sapiens cDNA 1^10222 lb. ctone HE 

H5.183380 EST8.Moder8tdysintilartoALU8.HUMANA 
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443401 


AI394067 


Hs.160159 


424665 


AW368576 


Hs.139851 


416143 


Ai9556S0 


Hs.79033 


426261 


AW242243 


Hs.168870 


414564 


AA164803 


Hs.71994 


427897 


NN^017413 


Hs.303084 


419160 


AA911342 


Hs.35524 


420111 


AA2SS6S2 




442879 


AF032922 


Ks^13 


430486 


BE062109 


Hs.241551 


45^23 


AL137967 




432074 


AA525248 


Hs. 149723 


431848 


AI3768S7 


Hs.126758 


447072 


D61594 


Hs.17279 


409723 


AW88S757 


Hs.257862 


425627 


AF019612 


Hs.297007 


435090 


BE217923 


Hs.149595 


449369 


AAa01256 


HS27280 


4255U 


AF112345 


Hs.158237 


455821 


BE143341 




427224 


ALt35554 


Hs,101937 


432284 


AA532807 


Hs.105822 


403467 






436032 


AA150797 


Hs.109276 


404356 






434205 


AF1ig861 


HsJ»3032 


405257 






402103 







456649 R74441 

432985 T92363 

417649 AW2392B5 

431277 AA501806 

454056 A)368B38 
401694 

423531 AW752782 

431364 AW971362 

445908 R13580 

448390 AL035414 

449939 T88420 

455678 BE086007 
404555 

418186 BE541042 

419981 AAB97581 

449581 A)989517 

419229 A1827237 
403691 

423728 AW891294 
443479 AF027219 
425329 AI961644 
453345 AA302862 
424335 AW021508 
451072 AA013451 
417845 All 17461 
411571 AA122393 
436035 AA938198 
432374 W68815 
400241 

408908 BE296227 
409564 AA045857 
411151 AW866497 
446009 AI989885 
Ara3552B 



416441 BE407197 

414891 R27124 

430172 AA468591 

422109 S73265 

422S85 AU076411 

448552 AWg73653 

422373 AK001843 

450726 AW204600 

438379 N23018 

412608 AA247995 

451270 AW341392 

412140 AA219691 

415992 C05837 

415533 T74009 

437132 AL049353 

415304 T66079 

450152 AI138635 

421147 AW592167 

421413 A1826128 

451750 AA2B0851 

417924 AU077Z31 



EST5 
C3veofin2 

glulaminyl-peptide cydobansferase (glu 
penudsoflial fam^yiated protein 
EST&. Weakly shnHar to 138022 hypothafl 
epeTm; peptide Qgand for APJ ceo^ 
KIAA1559 protein 

gbcz821hlU1 Na.CX^.GCBI Homosaptens 

syntajdn binding protein 3 

chloride chann^ calchjm activated, (am 

Qb:DKFZp761D2315j1 761 (synonym: hamy2) 

ESTs 

EST8, Highly st njtaf to A F1 75283 1 zinc 
^ftosyipnildn stdtotransferase 1 
ESTs 

memtnanfr'tiound franscf^fion (actor prol 

ESTs 

ESTs 

Inlegrin, alpha 10 

gb:MR0-HT0162-1910994XKNI04 HT0162 Homo 
sine ocuGs homeQbox(Diosophaa) homoto 
ESTs 



H3.1 17178 poly(A)'l»nding protein, nudear 1 

Hs.178703 ESTs 

Ks.82359 tumor necrosis factor receptor superfami 

Hs^49965 ESTs 

HS24808 ESTs. Weakly simBar to 138022 hypdheti 

HS.1297S0 hypothetical protein aJ10546 
Hs.294016 ESTs, Moderately sinular to B34087 hypot 
Hs.13436 Homo sapiens clone 24425 mRNA sequence 
Hs.21 068 hypothetical protdn 
Hs.272139 ESTs 

gb:RC3^0319-120200^14<i09 BT0319 Homo 

Hs.23240 Homo sapiens cDNA: FU21848 lis. done H 

Hs.128773 ESTs 

HS.18160S ESTs 

H&282884 ESTs 

K&l 321 38 solute canler family 4, sodium tiicarfaon 

Hs,9443 zinc finger protein 202 

Hs.145444 Homo sapiens cDNA aJ11494 lis, clone HE 

Hs.90063 neurocaldn delta 

Hs^B170 ESTs 

Hs.l 17929 ESTs 

Hs.82719 Homo sapiens mRNA; cDNA DKFZpS86F1822 (f 

H3.7081 1 hypoiheticdi protein FU20516 

Ks.146123 pdytA} polymerase gamma 

Hs.301885 Homo sapiens cDf<AFU11346fi8.ctona PL 

Hs.250822 serine/threonine kinase 15 

Hs54943 fradurecailus 1 (rat) homoiog 

gb:QV4-SN0024-l7040D>17&e07 SN0024 Homo 

Hs 231926 ESTs 

Hs!l53863 MAO (mothers against decapentapiegic, Or 

gb:601301552F1 NIH J^GCJI Homo sapiens c 

Hs.268754 Homo88|^enscDNAFU11949lis,ctoneH£ 

Ks.161889 ESTs 

Hs.1473 gastrin-releasing peptide 

Hs.1602 dthydropyrtmidine dehydrogenase 

Hs^l04 hypothetical protein FU00052 

Hs.l 15700 Homo sapiens cDNA:FU23515fis. done L 

Hs.250505 reOnoic add receptor, alpha 

Hs.171391 Otemiinal binding protein 2 

Hs.44898 Homo sapiens done TCCCTA00151 mRNAsequ 

H8.235795 ESTs 

Hs.73625 RAB6 interacting. klnesiMike(rabkInes 

Hs.145807 hypothetical protein FU 13533 

Hs.268738 ESTs. Weakly similar to ALU7 JiUMAN ALU S 

Hs.297268 ESTs 

Ks.12799 ESTs 

Hs.22^ Homo salens dons IMA6&451939. mRNA se 

Hs.293299 ESTs 

HS.S7637 ESTs, Weakly similar to A493G4 59 protd 

HS28930 ESTs 

Hs.82932 cyt6nD1{PRAD1: parathyroid e 
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406945 K01383 
452449 AW068658 
436009 H57130 
418637 T86737 
420495 A{338247 
449203 AI634578 
437751 AA767373 
428412 AA428240 
425707 AF115402 
441967 AA972742 
403317 U02687 
406018 

410566 AA373210 
416747 AW876523 
431229 AM96479 
435148 A1918049 
436349 AW45255 
446895 AA1 66655 
448562 AI538880 
442762 AF035119 
442738 AWQ02370 
451874 AW983137 
448076 AJ133123 
443484 A10914S8 
430686 NfL001942 
446100 AW967109 
407624 AW157431 
435079 AA664192 
428923 BE047698 
422496 AA311301 
413385 M34455 
408418 AW963897 
446733 AA863360 
427434 BE538374 
428822 W2841B 
459325 AyV088369 
416996 W91892 
425638 NM.012337 
408000 LI 1690 
418894 W73921 
424687 J05070 
437378 AI198823 
454100 AI693231 
452785 R61362 
437311 AA370041 
400631 AF173337 
440028 AW473675 
426490 NM-001621 
424103 NAA.00191B 
407995 Ai094748 
449911 AI262106 
449509 AA001615 
452762 AW501435 
422839 AI674784 
435040 AI932350 
401200 

416248 H99169 
442262 BE170651 
419754 H0082O 
453908 AW613920 
446965 BE242873 
412798 AW998657 
416085 H18072 
418378 AW962081 
455995 BE179408 
422411 AW749443 
410888 AW861207 
446893 Ai610818 
442992 AI914699 
407021 U52077 



Hs.173451 metaBolhIoRein1A(funcG^ 

Hs.20943 ESTs 

Hs.120925 ESTs 

Ks.193538 ESTs 

HsJ8314 Homo sapiens mRNA;cONADKFZp586lD120(f 

Hs.282121 ESTs 

Hs.35669 EST5.ModerateIyslnu!artoALU1JHUMANA 

Hs.12fi083 ESTs 

Hs.11713 E74^ factor 5 (eb domain transcripl 

H$^09786 ESTs 

Hs^ fms-ielatedtyfQSino kinase 3 



AW13g680 

433194 AB040883 

454790 AW820B52 

431130 NM.006103 

434739 AAB04487 
406468 

457023 AA001732 

416226 N5S342 

422306 BE044325 

432810 AA863400 

412894 R09778 

430602 D13752 

436981 AA740891 



Hs.43047 Hoino sapiens cDNA FU13565 (is. done PL 

HS.1S929 hypoUteticai protein FU12910 

Hs,259929 ESTs 

Hs.124961 ESTs 

Hs,115315 ESTs 

Hs.2a2B03 ESTs 

H5.94612 ESTs 

Hs.8700 deleted in liver cancer 1 

Hs.131055 ESTs, WeaWy similar to NPM_HUMANNUCLEO 

Hs.194233 ESTs. Moderately similar to ALU1_HUMAN A 

Hs.20196 adenylate cyclase 9 

Hs.134559 ESTs 

Hs^633 desmoglein 1 

Hs.13804 hypothetical protein dJ462023.2 

Hs.248941 ESTs 

gb:ac05t)03.8l Stratagene lung (93721Q) H 
1^.188785 ESTs 
Hs;i78827 ESTs 

Hs.840 indoleamlne^)yrro{e2.3diaxygenase 

Hs.44743 KIAA1435 protein 

Hs.26040 ESTs, WeaWy sintilar to fatty acW omega 

Hs.301732 liypothelica] protein MGCS306 

Hs^0715 potassium voUage-gated channel, lsi(^ 

Hs.282184 ESTs 

Hs.59609 ESTs 

HS.1S8450 nasopharyngeal epitheGum specific prote 
Hs.620 txinous pemphigoid antigen 1 (230/2401(0) 
Hs.50743 ESTs 

Hs.151738 matj1xmetalloprot^ase9(ge!alin8seB 
Hs.160473 ESTs 

Hs.1 26043 chromosome 21 open reading frame 51 
H3.106642 ESTs. Weakly similar to T09052hypolhe0 
Hs.9456 SWUSNF related, matrix associated, acti 
Hs.10g494 secreted proteb of unknown function 
HS.12S843 ESTs. Weakty similar to T17227 hypolheti 
Hs.170087 aiyl hydrocarbon receptor 
Hs.139410 dlhydroTipoamide txanched chain Iransacy 
Hs.100134 hypothetical protein aJ12787 
Hs.12653 ESTs 
H8.84561 KTs 

H5.27B582 v^akt murine thymoma Vbal oncogene homo 
Hs.298908 ESTs 
i{s.152B25 ESTs 

Hs,23450 mHochondrialribosomal protein S2S 

Hs.8700 deleted in Over cancer 1 

Hs.30977 ESTs, Weakly simQar to B34087 hypoM 

H8.2B2178 ESTs 

Hs.16677 WO repeat domain IS 

Hs.119120 E3ut)kiuiiinliga5eSNAiRF1 

HS.92S76 ESTs 

gb:EST374154 MAGE resequences, MAGG Homo 

Ob:ll^4fT06l&^00-125'B07 HT0618 Homo 
Hsi2511 ESTs 

gb:RC1-CT0302-1202004)13^04 CT0302 Hbmo 
Hs.7110 ESTs 
Hs.13297 ESTs 

gb:Human madnerl transposase gene, comp 
Hs.161393 ESTs 
HSJ3243 K1AA1450 protein 

gb:RC2-ST030M20200^1412 ST0301 Homo 
Hs^19 epUidymfs-specHicwhey-ackficpralBin 
Hs.144130 ESTs 



Hs.173233 hypothetical protein RJ10970 

Hs.34372 ESTs 

Hs.227280 U6 snRNA-associated Snvtike protein 

H8^054 ESTs 

Hs.ie6510 ESTs 

Hs.184927 cytochrome P4SD. subfamily MB (stenM 

Hs^3316 ESTs 
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452501 
449838 
447160 
422156 
440137 
423938 
436471 
414680 
426782 
430027 
411562 
413252 
427115 
444610 
400451 
435255 
450159 
418375 
442835 
400196 
415734 
415189 
438940 
425349 
448515 
410557 
442562 
413488 
419088 
447373 
457465 
413918 
402820 
424872 
428552 
435464 
449008 
420838 
428231 
434933 
444870 
425354 
429183 
439155 
442787 
423864 
438563 
437140 
421991 
446534 
407881 
444838 
402318 
410878 
414494 
421306 
427027 
429088 
429859 
428060 
419953 
443718 
444187 
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420195 
417404 
442833 
430335 
414618 
434029 
410945 
421247 
433374 
445644 
450271 
448084 
407884 
430998 
435542 
443544 
421103 
405759 
446242 



AB037791 
AB020653 
AA330310 
N34524 
AA866199 
H29138 
AA719813 
AA743331 
R14614 
AB023197 
AL050201 
BE074910 
AW972853 
AI174783 

W87434 
AI702416 
N1A.003061 
AI021989 

NM_014747 
L34657 
AF075045 
AA425234 
K68441 
AA085803 
BE379584 
BE144017 
AI538323 
AI38ig22 
AW301344 
AW015698 

AA347923 
AW274560 
BE548300 
AW578003 
AW1 18210 
U17989 
R91095 
AS200621 
U62027 
ABQ14604 
H81076 
W93048 
AA460039 
AA810665 
AA312799 
NM^014918 
AI307356 
AW072003 
AV651680 

AW809201 
AA768491 
AAB06207 
A)924294 
D61542 
NM>007050 
AM20616 
BE267154 
AI083580 
AW138466 
AA705745 
N44348 
N1UL007350 
AA328153 
080007 
AI204600 
AA621763 
AVV811552 
BE391727 
AJ82140g 
R77766 
AI693900 
AI467800 
AF069291 
AF128847 
AA687376 
AI076315 
AI625835 



Ks.29716 hypothetica} protein FLJ10980 
H$^4024 K1AA0846 protein 
Hs,24181 ESTs 

6b:yy56d10Al Soar8S.muIIIpla.sclerosb. 
Hs.171397 ESTs 

Hs.157113 coenzyme 0.7 (ret, yaasOhomotog 
H8.117662 ESTs 
Hs^2572 hamoglot)In.Btpha2 
Hs.191254 ESTs 
Ks.227743 K)AA0980 protein 
H5.70769 hypothetical protein DKFZp586E1923 

gb:RC5-BT0580-170300-021.F12 BT0580 Homo 
Hs.1 12237 ESTs 

9b:HA2S01 Human feUI Over cONA libraiy 



Hs,105015 ESTs. Moderately slmBar to ALU1_HUMAN A 

H5;200771 ESTs, Moderately similar to A Chain A. T 

Hs^389 8ynaplDsomai«sociated protein. 25liO 

Hs.131903 ESTs 



HS.7B748 KIAA0237 gens product 

Hs78146 ptatelet/endothetial ceii adhesion molec 

Hs.271609 ESTs 

Hs.79886 ritx)se5i)hosphateisonieiaseA(rfbose5 

Hs.13528 hypothetical protein RJ14064 

Hs.192997 ESTs, Moderately similar to 178885 serin 

Hs.34789 dortdTy^hosphoatigosaccharidei)rotei 

0b:MRO-HTO165-191199mdO9 HT0165 Homo 

Hs.52620 lntegitn.t)eta8 

Hs.158781 ESTs 

Hs.122g08 ONA replicaiion factor 

H8.71245 ESTs 

gb:EST54302 Fetal he^ II Homo sapiens 
Hs.129$20 ESTs 

Hs.192999 ESTs. Moderatety similar to KIAA0961 pro 
Hs.22826 tropomodulin 3 (ubiqullous} 
Hs.5244 ESTs 
Hs.183105 nuctearautoantioen 
H8.4276 KIAA1701 protein 
H8.14B504 ESTs 

Hs.155935 compiementcomponentSareceptorl 
Hs.197g55 KiAA0704 protein 
Hs.269001 ESTs 

H8.250723 hypothetical protein MGC2747 

Hs.286 libosoroal protein L4 

Hs.1 34746 ESTs. Wealdy simDar to A46010 X-Onked 

83.283689 activator of CREM In tes& 

Hs.110488 t<IAA0990 protein 

Hs.175225 ESTs 

Hs.40g68 heparan sulfate (glucosamine) 3-0-sulfot 
H5.208558 ESTs 

H8.314248 ESTs, Wealdy simnar to ALU4_HUMAN ALU S 
Hs.6783 hypolhe^caiprotainFU22724 
Hs.125889 ESTs 

Hs.173259 uncharacterized bone mamw protein BM03 

H5^27716 KIAA0934 protein 

Hs.225952 protein tyrosine phosphatase, reoeptort 

Hs,249483 ESTs 

Hs.125752 ESTs 

Hs.221373 ESTs 

Hs.151274 ESTs 

gb2f41b1U1 Soares_felfiLheartJJbHH19W 
Hs.26243 Homo sapiens cONA FU11177 Us, done PL 
Hs.82101 plecl<strinhomotogy-Iil(e domain, family 
H$^8201 EST8.V\fealdy8imiiartoAChainACiy8t 
H8^39499 KIAA0185 protein 
H3.98978 hypothetical protein MGC10764 
Hs.170434 Homo sapiens cDI^ FU14242 fis. done OV 

gb:QV2-ST0145^71299^17'h10 ST014S Homo 
Hs.102910 general transcr4>0on factor UN. polype 
Hs.332789 EST 

Hs^71593 ESTs. Moderalety similar to A47582 B<ei 
Hs.200920 ESTs 

Hs.271000 ESTs. Wealdy similar to 138022 hypothett 
Hs.40539 chromosome 8 open readtog frame 1 
Hs.204038 fndotethylamineN^thytliansferase 
Hs.269533 ESTs 
Hs.16359 ESTs 

Hs.27104 Homo sapiens mRNA; cONA DKF2p6670226 (Ir 



N66336 



Hs.7360 ESTs 
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457938 AI373538 Hs.133900 ESTs 21 

433017 Y15067 Hs^79914 zinc finger protein 232 2.1 

438729 BE621807 Hs.3337 traismemixene 4 superfamOy member 1 2.1 

. 432B39 AA57946S Hs.45207 hypotheGcd protein KIAA1 335 2.1 

5 439224 AW471088 Hs.145950 EST&, Highly sbiiBar to T08892hypolheti 2.1 

410976 R36207 H5.25092 hypothsGcsl protein MGC1 0744 2.1 

454574 AW809109 gb:MR4^T0117-070100D27.804ST0117Homo Z\ 

411020 NM_006770 Hs.67726 macro^Aage receptor wilh collagenous str 2.1 

452279 AA2B6844 Hs.61260 hypolheOcai protein FU 13164 2.1 

10 446891 AL036877 Hs.282878 ESTs Zl 

434294 AJ27137g Hs.76194 ribosomal protein S5 Zl 

449057 AB037784 Ks.22941 KiAA1363 pfOtebl Zl 

432769 AA520814 Hs.144959 ESTs Zl 

441224 AU076964 H3.7753 cAimertn Zl 

15 407891 AM86620 Hs.4113S endomucin-2 Zl 

429017 AA463605 Hs.66295 mutfi-POZ-domain<»nlaining pioteh Zl 

405817 A!936028 gb»vo47a09jc1 NCLCGAP.Gas4 Homo sapiens Zl 

430566 AA481282 Hs.190149 ESTs Zl 

449208 AW263535 Hs.48643 ESTs Zl 

20 451397 AA017432 H3.84S29 ESTs. Weakly similar lo Z20Z.HUiyiAN ZINC Zl 

452042 H38857 Hs.243901 Homo sapiens cDNAaJ20738 lis. done HE Zl 

444779 A1192105 H5.147170 ESTs ZO 

433672 BE281165 Hs.288038 TL&^odated serlne^ininB protein 1 ZO 

^_ 415954 AA171B50 Hs.42251 ESTs ZO 

25 420273 AI652864 Hs.197257 ESTs ZO 

411354 AW992424 Hs.288141 hypothetical protein M6C31 56 ZO 

422389 AF240635 Hs.115897 pioiocadherin 12 ZO 

446994 AV650435 Hs.16755 MBIPpiotdn ZO 

417793 AW405434 Hs.B2575 small nudear rttwnudeoprotein poiypept ZO 

30 422654 AA314316 Hs.16372S ESTs ZO 

42S999 AW513051 Hs.332981 ESTs. Wealdysln«Br to 138022 tiypotheH ZO 

405634 ZO 

451562 H04150 Hs.107708 ESTs ZO 

431958 X63629 Hs.2877 cadheiin3,type1,P^tiefinO>l3centa ZO 

35 422095 A1868872 Hs.282804 hypoUieltcal protein aJ22704 ZO 

442010 Ai032680 H5.132213 ESTs ZO 

442991 BE281238 Hs.8886 hypolhetics) protein aJ20424 ZO 

425312 AA354940 Hs.145958 ESTs ZO 

^- 415191 AA190381 H3.120810 ESTs ZO 

40 416406 D86961 Hs.79299 lipoma HMGtC fusion partner-Jike 2 ZO 

425316 AA354977 Hs,191565 ESTs. Moderalely similar to T1 4342 NS01 ZO 

413753 U17760 Hs.75517 taminin.l)eta3(nlC8in(125kO),kalinln ZO 

452241 AU)50204 Hs.28540 Homo sapiens mRNA:cDMADKFZp586F1223(f ZO 

433571 AA765256 Hs.135191 ESTs, similar to unnamed piotein ZO 

45 417094 NM_006895 H3.ei182 histamine N^ttiyftransferase ZO 

409190 AU076536 H3.50984 sarcoma ampGtied sequence ZO 

413783 AA314337 Hs.301547 ribosomal protein S7 ZO 

423867 AA331686 gb:ESnS757Embryo.8weel(IHomDsapien ZO 

^. 429418 AI381028 HS.118769 ESTs ZO 

50 445829 Ai452457 Hs.145526 ESTs ZO 

452366 AK0OO464 H3.29276 bypotheticat protein FU 20457 ZO 

425704 U79293 Hs.159264 Human done 23948 mRNA sequence ZO 

446593 W79572 Hs.13277 hypothetical protein FU220S4 ZO 

400462 ZO 

55 422003 AA361760 Hs.296326 ESTs ZO 

444585 AW170015 Hs,6594 ESTs ZO 

444898 AI201548 Hs.308338 ESTs ZO 

403525 ZO 

443031 AW134696 Hs.49418 ESTs ZO 

60 430816 AI311928 ob:qo89h04.x1 NCLCGAPJ0d5 Homo sapiens ZO 

423690 AA329648 Hs.23804 ESTs. Weakly similar to PN00998on3pfOt ZO 

440941 BE266352 Hs.7535 COBW-tike protein ZO 

409627 AW997628 Hs.313537 ESTs ZO 

433258 AI806626 H3.207300 ESTs, WeaWyslmflstoALUBJIUMANUIl ZO 

05 412863 AA121673 Hs.59757 zinc finger protein 281 ZO 

4^76 AA326108 Hs.3382g bHLH protein DEC2 ZO 

430259 BE5S0182 Hs.127826 RalGEF^e protein 3, mouse homotog ZO 

417280 AW173116 Hs.262206 ESTs ZO 

423528 AB011137 Hs.300938 KIAA0565 gene product ZO 

70 424800 ALJ035588 Hs.153203 MyoO family Inhibitor ZO 

446019 A1362520 Hs.279789 h'stone deacetylase 3 ZO 

435472 AW972330 Hs.283022 triggering receptor expressed on myetoM ZO 

424882 AI379461 Hs.153636 far upstream element (FUSE) binding prot ZO 

430473 AW130690 Hs.59962 ESTs ZO 

75 431363 M86528 KS.2669G2 neurotrophbi 5 (neurotrophin 4/5) ZO 

436118 AW753311 H5^9415 ESTs ZO 

400859 ZO 

405829 ZO 

^15258 AW752247 Hs^3853 ESTs ZO 

SO 420314 H81671 H8^0921 EST8,VfeaMyslmIlartoT226B8lvpoth66 ZO 

437733 Ai792574 KS.122B76 ESTs ZO 

452019 AL157503 HS.27S52 Homo sapiens mRKA;cONADKFZp566N2424(f ZO 

453118 AW195849 HS.25Z757 ESTs ZO 
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430705 
420568 
452759 
408496 
420674 
410358 
450088 
410853 
438607 
422232 
432801 
402490 
446551 
438315 
445261 
401986 
420335 



435413 
458175 
458433 
417494 
416045 
424181 
445873 
411578 
453116 
425692 
435608 
402102 
458308 
438177 
415205 
427244 
441384 
438979 
400285 
405966 
407407 
427739 
433584 
448956 
439474 
421755 
454566 
418836 
453108 
447101 
408873 
426226 
401157 
407721 
425646 
408964 
419839 
402855 
443644 
445672 
432343 
411941 
430664 
402812 
457434 
448133 
402048 
423347 
423957 
433347 
437373 
439217 
446609 



NVL003540 

F09247 

AW590773 

AI683802 

NW,000055 

AW975168 

AW016343 

H04S88 

AVyr080237 

043945 

NM.016260 

AI30B176 

R56795 

T79759 

AA258771 
AA164366 
AI267476 
AW296024 
AL1353S2 
A)369494 
M15990 
AL039482 
AA250970 
AW238524 
A)Z76680 
090041 
AW1 83971 



Affi28155 

BE32701S 

H71616 

AA402400 

AA447849 

AW976218 



Hs^4781 6 H4 histore family, member C 

Hs^47735 protocalheTin alpha 10 

Hs.258996 ESTs 

Hs.136182 ESTs 

HS.13Z7 butyfyfchofinestarase 

K5.13337 ESTs.Wealdysindtarlouimafnedprole!n 

Hs.233301 ESTs 

H5.30469 ESTs 

KS.2S28B4 ESTs 

Ks.113274 transcription factor EC 

K5^8963 zinc finger DNAblndino protein Helios 

H5.65838 ESTs 

Hs.82419 ESTs 

Hs.282g39 ESTs. Weakly similar to 138022 hypolheti 

Hs.43616 Homo sapiens mRNA for RJ00029 protein. 

Hs.151973 Ivpotheticdl protein RJ23511 

Hs.46669 ESTs 

HS.1S0434 ESTs 

Ks.255883 ESTs, Weaidy similar to 138022 hypotlwH 

Hs.222137 ESTs 

Hs.31403 ESTs 

Hs.142517 Homo sapiens mRNA; cONADKI=Zp434P0810(f 

Hs.251946 p6ly(AHiind'ing protein, cytO|dasmlc1<l 

gba(p27c05jt1 NCLCGAPJW10 Homo sapiens 

Hs.146086 ESTs 

Hs.155956 N-8cetyltrans(ei9se1(8iylamineN-aoeiy 

Ks.250896 ESTs 

Hs.211055 ESTs 

Hs.281391 ESTs 

H3.135233 ESTs 

• Hs.178045 ESTs 

Hs.268660 Homo sapiens cONA: FU22182 fis, done H 

Hs^2565 ESTs 



AF050198 

AW1967SS 

AW295399 

AKD01674 

A1824060 

AW169454 

AW807605 

AI655499 

A1311457 

N72185 

AL046017 

AA769045 

Y12735 
AW157044 
AF201468 
U24577 

At08049l 

AI907438 

N^L002960 

AW876909 

AWg69834 

AW628192 

AA723157 

A1660412 

AW978309 

AF023130 

H67505 

AF086041 

BE395090 



gb:Homo sapiens putative mitochondrial s 
Hs.98105 NYD-SP14 protein 

gb:UKH^I2-ahv-lv03-(UJl£l Na.CGAP.Su 
Hs^630 cofactor required for Sp1 transcrlptbna 
Hs.211501 ESTs 

Hs.207422 ESTs, Wealdy similar to S71949 melaiiopr 

gb:l\ilR4-ST0098-12010(M)01-b06 ST0098 Homo 
Hs.161712 ESTs 
Hs.99472 ESTs 
Hs.44189 ESTs 

Hs.182278 calmoduQn 2 (pi)osptu)rylase tdnase, dett 

gb:oa80ti074l Na_CGAP_GCB1 Homosaplens 

Hs^lO duaJ^pecificitytyroslneKYHjIujsphoryl 

Hs.1 58512 cydirHlependent kinase^Ilte 2 (C0C2-rel 

Hs.49349 beta^ APP-deaving enzyme 

Hs.93304 piiospttolipa5eA2.groupVit(piatels^ac 

Hs.93270 ESTs, Moderately similar to S65657 alpha 

H5.282862 ESTs 

Hs^961 S100 caicium-blnding protein A3 

Hs^5201 ESTs, Weakly similar to FAT DROME CADHER 

Hs.303303 ESTs 

HS.1B851 hypothetical protein FU10875 

Hs.73769 tolale receptor 1 (adult) 

Hs.234557 ESTs 

Hs.138235 Homo sapiens cDNA FU13542 fis. dons PL 
gb:Homo sapiens Ras-GRF2 mRNA, pariiat c 
Hs.191979 KIAA1733 protein 
Hs.42g75 ESTs 

Hs.15535 Homo sapiens done 24562 mRNA sequence 
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TABLE 38- List of accession numbers for primekeys laddng unlganelD's for Table 3A. Forsuch such probeset is listed a gene duster number from wtilch the digonudeotides were 
designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence similarity using Oustenng 
and ADgnmenlToOls (DoubleTwist, Oakland CaTifomi^. Genbank aocessnn numbers for sequences comprising each cluster are listed in the 'Acoesston' column. 

Pker Unique Eos probeset MenbSer number 
CAT number Gene duster number 
Accession: Genbank accession numbeis 



PIcey CAT number Accession 
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408390 


107490.1 


409029 


1095422.1 


409073 


109851.1 


409519 


113722.1 


409695 


114876_1 


410008 


11681^1 


410615 


1212203J 


410672 


1214882.1 


410784 


1221005 1 


410785 


1221055J 


410888 


1225955.1 


410901 


1226077.1 


410934 


1227240.1 


410945 


1227561.1 


410999 


1228809 1 


411004 


1228975.1 


411018 


1229132.1 


411139 


1233781.1 


411151 


1234022.1 


411479 


1247077.1 


411514 


1248638 1 


411578 


125013.1 


411745 


1256160 1 


411880 


12631 10J 


411905 


1265181.1 


412167 


1280605.1 


412209 


1283610.1 


412248 


1285000J 


412282 


1287679.1 


412303 


1288130.1 


412566 


1306469.1 


412877 


1333953.1 


412879 


1334272^1 


413087 


1348720.1 


413136 


1350379J 


413252 


1355877.1 


413382 


1365954.1 


413468 


1373234.1 


413499 


1373910.1 


413618 


1380384.1 


413638 


1381238.1 


413642 


1381386.1 


413875 


1396766.1 


414109 


1417307.1 


414221 


142696J 


414605 


1465790_.1 


414935 


1509157.1 


415236 


1531080J 


415635 


1540853.1 


415747 


155189J 


415980 


1564218.1 


416288 


1585983.1 


416337 


1589299.1 


416441 


159480.1 


416548 


1600181.1 


416624 


1604694.1 


416871 


1626761.1 


416913 


163001.1 


416972 


163668_1 


417561 


168747_2 


' 417648 


1691070.1 


4WD09 




417859 


1703724.1 


418378 


174656.1 


416636 


177402.1 




4BC7CR 4 


419637 


186639.1 


419807 


188252.1 


420111 


190755J 


420621 


195111.1. 


420637 


195241.1 


421065 


198936.1 


422063 


210852.1 


422156 


212379J 


422342 


215498.1 


422429 


216469.1 


422977 


223410.1 


423121 


225175.1 



AW854542AA056567AW882724 . 
BE(B7807AW301579 
AA063458 AA06301 8 AI444822 
AA075368AA075359 

AA296961 AA296889AAO76945AAO77528AA077497 
AA079552 BE142525 BE142527 
AW772721AW873372H89212 
AW794600AW794730 
AW803201 8E079700 BE062940 

AW803341 AVV803265AW803403 AVV803466 AW803402AW803413 AW803268 AVV803396 AVV80333^ 
AW861207AW809509 

AVraiOOOl AW810092 AW81O170 AVV809884 AW809664 AW810353 AW810428 AVV810209 AVV8^^ 
AW810006AW609672AW809694 AW810552 AW810345 AW810432 AW809960 
AW811114 AWB11095 AW811087 AW811124 AW81 1054 AW81 1094 AW81 1 157 
AW81 1552 AW811470 AW811523 AW81 1482 AW81 1514 AW81 1456 AW81 1 525 
AW813004 AW812962 AW812961 

AWB13242 BE146089 AW813195 AW813173 AW813206 BE145953 BE146212 AW813196 AW854582 AW813241 BE061582 
AWB13428 AW813444 AW813367 AW813368 AW813429 AW813424 
AW819461 AW819478AW819479AW819559AW819S05 
AW866497 AW819775 AW819868 AW866602 AW866561 

AW848047 AW848202 AW848631 AW848142 AW848702AW848121 AW848632 AW84B140 AWB48571 AW848009AWB48067 AW848069 

AWB48g05AW848214 

AW850178AW850233AW850445AW850446 

AW238524 AA094059 RS9022 

AW867826AW859896 

AW872477 BE088101 T05990 

BE265067 BE264978 AW875420 

AW897230 AW897252AW897244 AW897231 AWB97263 

AWg01456 AW901450 AWg01441 ^ 
BE176480 AW903298 AW903313 

BE160188 AWg35785 BE160401 BE160319 BE160313 BE160395 

AW938336AW93633g 

AW952574BE073261 

BE01 1 168 BE006371 BE01 1 172 BE01 1313 BE01 1 166 
BE092219 BE092361 6E0(^789 
BE064655 BE1S3953 

BEQ66941 BE066911 BE066979 BE066929 BE066925 

BE074910 BE074913 BE07491 1 BE074903 BE074892 BE074935 

BE090689 BE090585 BE0906g7 8E090680 BE090&91 BE090696 BE090698 BE090686 

BE144017BE185527BE144023 

BE144884H97942 

BE154078BE154006BE153891 BE153523BE153983 
H71252BE154668BE154642 

BE154837 BE154879 BE154850 BE154877 BE154835 BE154849 BE154902 BE154g05 BE154867 BE154901 BE154904 BE154899 

BE176776H85072 

BE250744BE294770 

AW450979 AA136653 AA1366S6 AW4193B1 AA984358 AA492073 BE168945 AA809054 AW23803B BE011212 6E011359 BE01 1367 BE01 1368 

BE01 1362 6E011215 BE011365 BE011363 

6E3g0440 

C15671 C15333 C15S33 D81433 C14657 C14811 

R41400 H06659 Z38637 F02023 T16516 

F13168R21289T77628 

AA381209 AA381245 AA167683 

R52414R61681 H12009 n4429 

H51299 H44619 H46391 R86024 H51892T72744 

H48713 H83057 H48694H6S247 

BE407197 AA182474 AA180369 BE275628 6E276131 

H62953N76608N72413 

H69044 T47567 H75691 T50292 

H98716N90792N24283 

AW934714 BE161007 BE162500 AW749902 AW749864 BE1624g8 BE161005 AA190449 AWS13465 BE161006 BE162499 

BE019870 AA191284 AA191255 

AW974345 BE14S400 T97088 AA639076 

R06552T98985 R08656 

T99898 T92080 T91447 R07806 R07779 

T26453 Z44226R20425 

AW962081 AA218925 AA354237 

AW743855 AA225995 AW750208 AW750206 

AA244199AA244272H57440 

W27493 AA248063 AA249685 AA429679 

R77402 AA262462 AA250988 R06794 

AA255652 AA28091 1 AW967920 AA262684 

AA278808 BE082076 BE081812 BE081581 

AW9761 53 AA278945 AA747691 

AA329711 AA2a7436AA283148 

BE156476BE156473BE156474BE156475AA302839 

N34524 AA305071 AW354803 AA502335 AI433430 A1203597 AW026670 AW265323 AW8S07S7 AA317554 AW993843 AW835572 AW3B551 2 
Ai334966 W32951 K62656 H53902 R88904 AW835732 
AA309272 AA309312 AW961837 

AA310527AW962295 Z44865H)6641 . 
AA631498 A1017191 AA49121 1 AA761823 AA714S55 AA768099 AA808288 A1934069 AA570223 AA574389 AA582438 AI745348 AW964510 
AA31S842AW85375eHS6414 
AW884848 AA322213 AA322209 AW961624 
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423377 


22769^1 


423532 


229382.1 


423735 


231498J 


423841 


232507_1 


423867 


232732^1 


424872 


244505,1 


425201 


247933.1 


425497 


252499.1 


425570 


253248.1 


426226 


262918.1 


426497 


268121.1 


428603 


269825.1 


427551 


280163.1 


428048 


286394.1 


428679 


294049.1 


430212 


314437.1 


430818 


324239.1 


430844 


324570.1 


430933 


325757.1 


431071 


327550.1 


431169 


328799.1 


431322 


331543 1 


431595 


335512-1 


431688 


336609.1 


432111 


341618.1 


432189 


342819.1 


432222 


343347.1 


432779 


354024.1 


432803 


354267.1 


432869 


355475_1 


433347 


36388 1 


433492 


367934.1 


433584 


370400.1 


433644 


371919.1 


433687 


373061.1 


434138 


380672.1 


434374 


384869.1 


434613 


389550.1 


434636 


38990.1 


434950 


398081.1 


434995 


397210.1 


435079 


399783.1 


435463 


406582.1 


435634 


409239.1 


436532 


42180^.1 


437146 


43371.1 



438005 

438458 

438909 

438993 

440320 

440674 

443613 

443657 

444314 

444610 

444910 

446096 

446901 

447197 

448404 

449540 

450024 

450458 

450522 

451024 

451487 

452453 

452542 

453823 

453901 

454190 

454193 



454352 
454359 
454389 
454403 



447553.1 

457837.1 

46684.1 

467651.1 

491930.1 

49997.1 

575391.1 

576685.1 

600667.1 

612257.1 

624951.1 

661959.1 

697809.1 

711623.1 

761515.1 

80945J 

82296.1 

83566.1 

837264.1 

85S65J 

87131.1 

918300 1 

921410.1 

982526.1 

986414.1 

1049998.1 

1050256J 



1129567.1 
1130874.1 
1156B2_1 
1170435.1 



AL049377AL079930AL047223AW885968AA385235 ^ 

6E09(S03 BE090490 BE090463 BE090501 BE090500 6E090365 BE090505 BE090S09 BE090499 BE090502 6E0S0S04 BE090497 AA327285 

AA330259 AA661806 AA502431 AW974633 AA649496 

AW753967 AA370795 AA331630 AW962550 

AA331886 AW962659 AW9S2655 T89841 

AA347923AA347928AW95t769 . 

AA3521 1 1 AW962247 AA429695 

AA524596 AA3S8494 AA513507 

AA359556AA3S9819AW963122 

AA769045 AA372590 AW963633 

AA379913AA3799ai AW953523 

AA382291AAg94657 

T96203AM05343 796121 

AA70S745AA420850AA420433 

AA431765AA432015 

AA469153AI718503AA489225 

At31 1928 AAS36030 T51931 AA609816 AA487195 AA664207 

T94960AA487679 T9S013 

AW8536356E179901 BE179402AW863789AA489179BE010747BE010748 

AA491379 H86020 AW969148 

AW971240AM93843 AA493723 

AW970622 AAS03009 AA502998 AA502989 AA502805 T921B8 

AA508196 BE142920 AI280311 AI205616 D81709 

AA513906 AA847734 A)357044 

AW972777 AA526795 AA526305 

AA527941 A1810608 AI620ig0AA635266 

AI204995 AWB27539 AWg69908 AW440776 AA528756 

AWg79241 AA565006 AA847102 

AA565398AW894072 H97930 

AW974094 AA559074 AA502574 

AF023130 AF181 250 AA984703 AA694303 AA3S1792 

AW605849 AW%2898 N41060 AAS94852 

AW295399 AW207772 AW3O0641 AW070290 BE348854 AW170383 AA600968 AA778832 

AW342028 AA641080 AA603282 

AA743991 AA604852 AW272737 

AA825804 AW418787 AW074833 AI675642AI333368 

AA631439Ai086355AI082577 

AI821 826 AA640657 AA658836 AA65005S AI821079 

H50758AF147434H50666 

AW974892AA6S4375 

AW97499SAI821880Al821932Ai791196AA6S9617Ai821137AASS6925 
AA664192 H60250ni388 
AA682507AW851124 
T82384 R05307AA693714 

AA721522AW97S443T93070 _ 

AA730977 AI261584 AA334473 Z43283 AWB75861 AW938044 BE150701 AW936262 AA306862 BE565575 BE567380 AA728920 AA167612 

AI239729At251752AA485791 BES66425AV\I982956 

BE151746 BE336853 D63271 T94955 AAn4994 

AW975186 AAB07807 D29548 

AF065839 RG9137 AW188786 R69254 

AA82899SAA834879AI926361 

AA879294 N57538AW74541 

BE551546 Z25124 AI307139 Z28800 

AI079356 W23287 

R14973R14967AI081006 

AI140497 AW749625 AW749626 AW749644 

AI1747B3 R83569R12271 

A1201849 BE069007 AW948544 

A1276454A1633717AI275116 

A)347274AW844024 

R36075A!366546 R36167 

BE089973 AI498612 AW805032 

AA001713H63836 ^ 

AA005129 AA679084 AA694399 

AA009926 BE149301 

AI698839 AI909260 AI909259 

AA442176AA259181 

AA018072N4637OR84847 

AI902519A1902518At902516 

AV\»1 2256 AW812257 A)906423 AI906422 

AL137S67 BE064160 BE064186 

BE0658Q2 AW749032 AW003637 

AW177821 AW177B96 AW177867 

BE141183 AW178167 AW1781 62 AW178166 AW178172 AW845893 AW178159 AW178222 AW178213 AW17821 5 AW178090 AW178091 

AW178161 AW17e207 AW178210 AW178214 AW178212 BE140918 BE140917 AW178135 AW178205 AW178209 AW178223 AW178220 

AW178206 AW178203 AW178165 AW178168 AW178160 AW178136 AW845878 AW178131 AW178138 AW178105 AW845894 AW178129 

AWB45810AW845828AW178216AW178112 AW178211 AW17B224 BE140915 AW178221 AW178130AW178134 AW178096 AW178108 

AW7B133AW178164AW17821BAW178171AW178157AW178158AW178103BE141ie9AW178170AVV845816BEU 

AW178104 AW178163 AW178093 AW178208 AW17B137 AW178140 AW178219 BE141592 AW845901 BE1415eO AW178155 BE141598 BE140957 

AW389668 AW389657 AW609198 AW389649 

N71277AW390764 

AW752S71 AWB47602 AA07797g 

6E06S98S 6E065944 BE066008 6E066083 BE066093 
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454455 
454566 



454574 
454600 

454678 

454693 

454714 

454766 

454784 

454790 

454838 

454962 

455047 

455092 

455100 

455107 

455114 

455170 

455201 

455226 

455252 

455286 

455310 

455431 

455488 

455511 

455512 



455571 
455631 
455678 
455685 
455807 
455821 
455866 
455992 
455995 
456034 

458804 
458861 
459160 
459201 
459267 



1206965J 
1224432.1 



1225636.1 
1226077J 

1228915.1 

1229132.1 

1230493.1 

1234022.1 

1234630.1 

1234752.1 

1236509.1 

1246750.1 

1250536J 

1252971.1 

1253334.1 

1253874.1 

1254106.1 

1256906_1 

1259748.1 

1262534.1 

1266222L1 

1273576.1 

1278158J 

12B9854J 

12937?.-1 

1321229.1 

1321443.1 



1331885.1 
1347545.1 
1349716.1 
1350393.1 
1370914.1 
1372714.1 
1377119.1 
1398552^1 
1398903.1 
142696.1 

75803.1 
798085 1 
920051.1 
LI 
.1 



AWB07605 AW807690 AW807839 AW807752 AW807673 AW807667 AW807955 AW807760 AW807615 AW80789B AW807849 AW807821 
AVW07832 AW807842 AW807827 AW807822 AW807829 AW807830 AW807825 AW807603 AW807612 AW807908 AW807595 AW807617 
AW807S7B AW807687 AW807918 AW807921 AW807596 AW807602 AW807688 AW807509 AW807684 AW807770 AW807593 AW807754 
AW807679 AW807957 AW807683 AW807763 AW807902 AW807840 AW80781 9 AW807836 AWB07769 AW807685 AW807B47 AW807674 
AW807685 AW807670 AWWy791 7 AW807677 AW807680 AW807900 AWB07669 AWB07952 AW8O7907 AW807846 AWBOHSB 

AW807835 AW807608AW807753 AW807601 AW807956 

AW8O91O9AVV809112AWB09122AVV809126AWB09128AVV809133AVV809131AV^ 

AW810001 AW810092 AW810170 AW809B84 AW809664 AW810353 AW810428 AW810209 AW810429 AW810154 AW81016B AW809786 
AW810006 AW809672 AWB09694 AW810552 AW810345 AW810432 AW809960 
AW813089 W28102 

AW81342a AW813444 AW813367 AW813368 AW813429 AW813424 

AW815098 8E154843 BE154B31 

AW866497 AWB19775 AW819888 AW866602 AW866561 

AW820626 AW82(K21 AW820608 

AW820852 AW820773 AWe21088 

AW83371 1 AW833620 AWB33699 

AW847645 AWB47791 AW854083 AW853945 

AWB52530 AW852527 AW852526 

BE152428 AW855572 AW855607 

BE16019B AW935898 T11520 AW935930 AW856073 AW861034 
BE1541 13 AW856797 AW856847 AW861 128 AW856817 

AVW60972 AvEL AW862599 AW850988 AW860983 AW860B98 AWBe0925 AW860922 AW8609B6 AW8609B4 AW860989 

AW947884 AW94791 8 AW947888 AW947883 AW947897 AW94791 0 AW947905 AW864751 AW947B7B 

AW902103 AWB8901 2 AWB691 39 

AW876627 AW876630 AW876631 AW876625 

BE1443B4 AW887474 AWBa7403 BE144388 

AW893961 AW8939S8 AW894034 AW894019 

AW938484 BE001245 6E001190 

AA102322 

AW9M^ AvSm AW983610 AW983688 AW983601 AW983645 AW983607 AW983640 AW983625 AW983612 AW983642 AW983687 
AW983602 AW9B3624 AW983634 AW983637 AW983632 AW983817 AW983635 AW983630 AW983636 AW983639 AW983616 AW983689 
AW983641 AW983621 AW983603 AW983609AW983623AVV983644 AW9B3618 AW983615 AVV983611 AW983604 AW9836B6 AW983622 
AW983619AW983633AW983589AW983805 AW983626 AW983643 AW983631 AW983627 AW983813AW983614AW983685AW983593 
AW983590 AW983594 AW983620 AW983638 AW983592 AW98358B 
BE003714 BE003721 BE003720 BE003716 

6E063031 6E063002 BE063008 BE083024 6E063040 BE063006 BE063072 
BE066007 BE066017 BE066074 



BE141140BE141139BE141105BE141143BE141127BE141202BE141108 
BE143341 BE143344 BE143378 BE143368 

BE149024 BE149056 BE152826 BE149025 BE149057 BE152819 BE149030 BE149062 BE149023 BE149055 
BE179015 BE178965 BE179010 BE179002 BE178961 BE179005 BE178964 BE179012 BE179011 BE178963 BE178997 
BE17940B BE179798 BE179980 

AW4^79 AA136553 AA136656 AW419381 AA9M358 AA4W^ 
BE011362 BE011215 BE01 1 365 6E01 1363 
AL157625 N72696 BE622492 
A;530223AI630470 

Ai904723 Ai904725 AI904729 A1904722 Aig047S8 Aig04736 

AW391177 W45021 ' 
AJ003631AJ0036S0AJ003651 



TABLE 29C 



Reft Sequence source. The 7 <«gll numbers In IWscohjmn are Genbankl(tentjfler(GI)nun*^ T)unhamI.elaL"rBfefBtolhopubncallonenBBedTI»DNAseque^ 

human chromosome 7Z' Dunham J. et a)., Nature (1999) 402:48W95. 
Strand: Indicates DNAstrarid from wtrich axons wero predicted. 
NLposition: Indicates nudeoGdeposIOons of pieicledexon& 



Pkey 


Ref 


Strand 


NLposHion 


400451 


8113550 


Minus 


82189-82320 


400462 


9929659 


Mnus 


197610-197785 


400608 


9887666 


Minus 


96756-97558 


400639 


9887597 


Plus 


23150-23580 


400641 


8117693 


Plus 


47684992 


400756 


6119084 


Minus 


38734-3K57 


400859 


9757499 


Mnus 


91888-92018.98131-98294.99474^0 


400880 


9931121 


Plus 


29235-29336.38363.36580 


400889 


9958234 


Minus 


169782-170038 


400983 


B081198 


P!us 


107903-108832 


401045 


8117619 


Plus 


90044-90184.91111-91345 


401049 


7232177 


Plus 


149157>150692 


401078 


3687273 


Rus 


105052-105171 


401094 


9965511 


Plus 


137130-137302.139283-139506 


401103 


6568122 


^^us 


98330-98449 


401157 


9438269 


Minus 


114133-114247.114567*114645 


401169 


9690248 


Minus 


90815-90929 
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401200 9743387 Minus 

401213 9858408 Rus 

401254 9796309 Rus 

401323 9212516 Plus 

5 401335 M84881 Plus 

401497 7381770 Rus 





401517 


7677912 


Phis 




401528 


7770561 


Rus 


10 


401575 


7229804 


Mbttis 


401694 


3540172 


Minus 




401793 


7263888 


Minus 




401662 


7770606 


Minus 




401878 


8099802 


Minus 


15 


401986 


4406829 


Minus 


402046 


8072415 


Plus 




402048 


6072512 


Plus 




402102 


8117771 


Minus 




402103 


7249203 


Plus 


20 


402230 


9966312 


Minus 


402318 


7582559 


Minus 




402490 


9797648 


Rus 




402745 


9212200 


Minus 




402800 


6010175 


Rus 


25 


402812 


6010110 


Plus 


402820 


6455853 


Minus 




402855 


9662953 


Minus 




403133 


7331427 


Phis 




403271 


7230852 


Plus 


30 


, 403277 


8072597 


Minus 


403310 


8139936 


Minus 




403329 


8516120 


Plus 




403356 


8569930 


Rus 




403378 


9438244 


Minus 


35 


403388 


9438331 


Rus 


403467 


9929556 


Minus 




403515 


7656757 


Minus 




403525 


7960440 


Phjs 




403534 


8076917 


Minus 


40 


403568 


8101145 


Minus 


403574 


8101158 


Rus 




403637 


8871936 


Minus 




403677 


7331517 


Minus 




403691 


7387384 


Minus 


45 


403760 


7712202 


Minus 


403776 


7770611 


Minus 




403895 


7381715 


Minus 




403937 


7711761 


Minus 




404043 


9558573 


Pius 


50 


404097 


7770701 


Rus 


404200 


6010176 


Minus 




404249 


8655533 


Rus 




404274 


9885189 


Rus 




404285 


2325514 


Plus 


55 


404286 


2769644 


Plus 


404356 


7630858 


Minus 




404443 


7579073 


Minus 




404476 


8080699 


Phis 




404488 


8113266 


Minus 


60 


404513 


6151941 


Minus 


404548 


8570305 


Minus 




404555 


7243881 


Minus 




404561 


9795980 


Minus 




404588 


6456726 


K^nus 


65 


404593 


9944086 


Minus 


404599 


8705107 


Rus 




404860 


8979555 


Plus 




404916 


7341826 


Rus 




404957 


7407927 


Rus 


70 


405041 


7547195 


Rus 


405059 


7655683 


Phjs 




405090 


8072525 


^finus 




405257 


7329310 


Plus 




405336 


6094635 


Rus 


75 


405472 


8439781 


Rus 


405494 


8050952 


Minus 




405547 


1054740 


Rus 




405621 


5523811 


Rus 




405634 


5306288 


Plus 


80 


405654 


4895155 


Minus 


405692 


4314424 


Rus 




405759 


3286022 


Minus 




405829 


7109593 


k^us 




405848 


7651809 


Minus 



111586.111806.114791014916,115419-115583.116351.116446,116847-116907.122853.123057^24 

98243-98380.98489-98619 

152209-152383 

213509-214450 

15736-16352 

92607*92813 

29278-29770 

91570-93177 

76253-76384 

6405&64168 

102945-103083 

55839-55993.59145^293 

162268-162474,163089-163195 

31137-31293 

166394-166556.168167-16839S 

4393644078 

174566-174740 

14453-15414 

29762-29932 

12843-13403 

149982-150929 

76516-76690 

43921-44049.46181-46273 

25026-25091.25844-25920 

82274-82443 

59763-59909 

38314-38634 

134283-134485 

27494-27642 

183883-184026 

96450-96598 

92839-93035 

44264-44443 

112733-113001.114599-114735' 

73431-73602 

173358-179553 

152431-153243 

46652-47332 

85509^656 

5542-6176 

142647-142771.145531-14S762 

55008-55083.62860-63051 

68280-88463 

4591046260.4756347824 
1414-1513.1624-1756 
3502-400^40704308 
12609-12773 

29042-29135,4659746699 
55512-55781 
7066-7210 
64270^633 
104127-104318 
32282-32416 
3512-3691 
126433-126623 
87198-B7441 
101841-102043 
64835-64994 
112837-113339 
83896-84162 
63963-64157 
• 69039-70100 
4005940210 
74922-75788 
110443-110733 
65852-66081 
91057-91188 
147512-148011 
121230-121714 
349^ 
38552-39202 
73121-73273 
33267-33563 

106297-106447.108462-108596 
70284-70518 

124361-124520.124914-125050 
59362-59607 

17856^17957,18302-18412.18837-18927.22790-22989 

53624-53759 

61379-62562 

18263-18399 

15628-16127 

28135-28244 
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405988 


8247788 


Minus 


51762-51978 


40S970 


8247789 


Minus 


45795^6295 


406018 


6758904 


Minus 


37795.38168 


406091 


9123919 


Minus 


197370.197935 


406092 


9123919 


Plus 


251370-251797.252168-252882 


406149 


7144791 


Minus 


4446445164 


408195 


7289992 


Minus 


3629^^36827 


406333 


9213235 


Ptus 


64689^798 


406466 


9795553 


Plus 


4373^16.8870-9046.11366-11509.11625-11880 


406506 


7711374 


Minus 


6843-8077 


406554 


7711566 


Rus 


106956-107121 


405603 


8272659 


Minus 


3950&39694 



TABLE 30A: ABOUT 1840 GENES UP-RE6UUTE0 IN IDIOPATHIC PULMONARY RBROSIS (IPF) COMPARED TO HYPERSENSITIVITY PNEUMONITIS (HP) 

Table 30A lisis dnut 1840 genes flial ob up regulaled in Utopattdc puhnonay fibrosis (IPF) samples as compared virtth hypereensllivlty pneumonitis (HP) samples. These were 
selected from about 59680 probesels on an AffymeWxffios Hu03 Gene CWp array such ftal (he rafc of 'avefage* IdiopaOiic putmonanr fibrosis sample expression te^rel to •average' 
hypeisensiOvfty pneumonifis samrrte expresston was greater lhan or equal to about ^0. The ■merage" WlopatWc pulmonary fibrosis tevel was set to the 90* pereeniile amongst 
kfnpalhic pulmonaiy fibrosis samirtes. The "average" hypcrsensifivily pneumonitis tevd was set to Ihe 90* percenhTe amongst hypersensibvity pneumonitis samples. 

Pkey: Unique Eos probeset Identifier number 

ExAocn: Exemplar Accesston nun^. Genbank accession number 

UnigenelD: Unigene number 

Unigene TiUe: Unigene gene title 

R1: Ratio of IPF popathicputrnonary fibrosis) to HP (hypersensitivity pneumonitis^ 



Plcey ExAccn 

450478 AW451709 

432365 AK001106 
405654 

440209 H05049 

407811 AW190902 

439608 W79123 

425259 AL049260 

426230 AA367019 

416653 AA768553 

420481 U50S25 
403574 

415817 U88967 

419519 A1198719 

435256 AF193766 

423017 AW178761 

429629 BE501732 
40^3 

428766 AA477989 

441802 AA968636 

453649 Y07494 

447410 AI47023S 
442353 BE379594 
405494 



UnigenelD Unigene Title 

Hs.271200 ESTs 

Hs.274419 hypothetical protein FU10244 

Hs.22269 neurexin3 

Hs.40098 cysteine knot superfamfly 1, BMP antagon 

Hs.58561 G protein-coupled receptor 87 

Hs.1 55397 Homo sapiens mRNA; cDNA DKFZp564K143 (fr 

Hs.241395 protease, serine, 1 (trypsin 1} 

Hs.74170 metailothionein IE (functtonal) 

Hs.98201 Human BRCA2regton,mRNA8equenceCG029 

HsJ8687 protein tyrosine phosphatase, receptor-t 

H3.176376 ESTs 

Hs.13872 cytokine-like protein 017 

Hs.227948 serine (or cysteine) proteinase totdbito 

Hs.30622 HomosapienscONAFU13010fis,doneNT 

HS.9&800 ESTs 

Hs.127877 ESTs 

Hs^114 ATPase.Na*/K«transpor«)g.8lpha2(>) 

Hs.172698 EST 

Hs.49136 ESTs.ModeratelyslmilartoALU7_HUMANA 



442377 


AAg93807 


Hs.167367 


ESTs 


409928 


AL137163 


Hs.57549 


hypothettoal protein dJ473B4 


420407 


AA814732 


Hs.145010 


Dpopolysaocarfde^pecific response 5-Q 


415238 


R41400 




tfb:yl94b12^1 Soares Infant brain 1NIB H 


451562 


H04150 


Ks.107708 


ESTs 


403310 








445189 


AI936450 


Hs.147482 


ESTs 


409632 


W74001 


Hs^279 


serine (or cysteine) proteinase inMbOo 


439780 


AL10g668 




gb-.Homo sapiens mRNA full length Insert 


402076 








415025 


AW207091 


Hs.72307 


ESTs 


406690 


M29540 


Hs.220529 


can:}noembryonic antigen-related ceB ad 


438557 


AW364104 


Hs.143509 


hypothetical protein FU21924 


428042 


AA4ig529 


Hs.76391 


my»3vlros (Iiifluenza) resistance 1. homo 


418007 


M13509 


Hs.83169 


matrix meteltoprot^nase 1 (interstitial 


409545 


BE296162 


Hs.19002 


hypothetk»l protein MGC4675 


446619 


AU076643 


Hs.313 


secreted phospboprotein 1 (osteopontin, 


411966 


AA099113 


Hs.1 16609 


ESTs 


440274 


R24595 


Hs.7122 


scrapie responsive protein 1 


442879 


AF032922 


Hs.8813 


syntaxbitjindtng protein 3 


419236 


AA330447 


Hs.135159 


Homo sapiens cONA RJ11481 fis. done HE 


420185 


AIJ044056 


Hs,158047 


ESTs 


415672 


N53097 


Ks.193579 


ESTe 


45S488 


AA102322 




gb2l90f03j1 Strategene colon (937204) 


420026 


AI831190 


H&166676 


ESTs 


446868 


AV660737 


Hs.135100 


ESTs 


431622 


AW979271 


Hs293184 


ESTs 


407266 


AJ23S664 




gb;Homo saptens mRNA for immunoglobulin 


421300 


AW297398 


Hs.96617 


ESTs 


416045 


H1S990 


Hs.31403 


ESTs 
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414175 AI308876 
424693 BE169810 
436397 AA715013 
440504 AI948966 
409716 086640 
403625 

418988 A1123555 
416035 H42314 
400292 AA250737 
442849 R10099 
440887 AI799488 
427535 R29543 
410934 AW811114 
431374 BE258532 
444963 AI916973 
447530 AW192063 
444992 R37658 
416575 W02414 
431211 M86849 
451830 H18433 
446466 H38026 
404043 

423454 AL1104S6 
455540 BE080231 
434683 AW298724 
445898 AF070823 
422308 BE04432S 
428895 AA437124 
450018 AA421642 
419249 X14757 
455047 AW852530 
454039 AW079064 
403637 

414725 AA769791 
409073 AA063458 
403329 

434001 AW950905 

459664 

401497 

410797 AW857191 
411402 BE2g7855 
448844 A1581519 
435202 AI971313 
439418 A1282149 
443584 A1807036 
434352 AF129505 
430838 N46664 
430882 BE174240 
440129 AAB65818 
437636 AA7647B1 
455747 BEO74910 
455464 AW983901 
418771 AA807881 
434820 Ai821863 
440615 A)733055 
454482 BE147919 
400432 AX015809 
436508 AW604381 
423607 AA328329 
407415 AF07332B 
401878 

443162 T49951 
451325 AA021283 
440515 AJ131245 
406333 

409105 AW467539 
408000 L11690 
421482 AL135462 
442757 AI739528 
459717 

438637 AI783829 
412222 AA528283 
450101 AV649989 
410901 AW810001 
426217 AW131888 
441640 AI733345 
422977 AA631498 
425361 AA355933 
414955 C15506 
411965 6E467339 
403341 

411726 AW858612 
443271 BE568568 



Hs.103849 hypdheOcd protein DKFZp76tD112 

Ks.47557 ESTs 

Hs.169835 ESTs 

Hs.130017 ESTs, WteaWy similar to JN0908 H^-transp 

H5.56045 src homology Uvee(SK3} and cysteine d 

Hs.81796 ESTs 

gb:yo09e02^1 Scares adidl bran N2faSHB5 

H5.72472 ESTs 

Hs.269805 ESTs 

Hs.135905 ESTs 

Hs.2164 pro-plateiet basic protein (Indudes pta 

gb:MR^ST0131-111199^16^ Sr0131 Homo 

Hs.251871 OTP synthase 

H5^13603 ESTs 

Hs.248865 ESTs. Moderately stmRar to JC5238galac 

Hs^1375 ESTs 

Hs.38383 ESTs 

Hs.323733 gap Junction protein, beta 2. 26kD (oorni 

Hs.21542 KIAA1035 protein 

Hs^ anesUn 3. retinal pC-anestin) 

H8.46g sucdnalB dehydrogenase complex, subunil 

gb:RC4^0629-1202004)12-ni BT0629Homo 

Hs.202639 ESTs 

Hs.1 3423 Komo sapiens done 24466 mRNA sequence 

HS.22728Q U6 snRNA-assodated Snvfike protein 

Hs.187247 ESTs 

Hs^4309 hypothetical protein FU11106 

Hs.89768 gamma-amiiiobutyric add (GABA) A recepto 

gb:PllA1-CT024»)7l099^1-gO& CT0243 Homo 

Hs.245540 ESTs 

H5.1 25300 dng (inger protein 21, interferon-respo 

gb2f71a07^1 SoaresjinealjIand.NSHPG 

Hs^97 serine (or cysteine) proteinase Inhibito 



gb:RC2-CT03044)80100^11-b12 CT0304 HOmo 
Ks.6g855 NRAS^Iatedgene 
Hs.177164 ESTs 
Hs.170204 KIAAfl551 protein 
Hs.56213 ESTs, Highly similar to FXD3_HUMAN FORKH 
H3.267245 hypothetical protein FU14803 
Ks.66492 small muscle protein, X-4inked 
Hs.169395 ijypclhetical protein FU12015 
Hs.79024 heterogeneous nuclear ribonudeoprotain 
Hs.174936 ESTs. Wealdy simiiar to S71886 Sle20^iK 
Hs:>91844 ESTs 

gb:RC5-BT0680-170300^1-F12 BT0580 Homo 

gb:RC1.HN0003.220300^1 1-f10 HKD003 Homo 
H3.25329 ESTs 

gb:ns90f05.xS Na.CGAP_Pr3 Homo sapiens 
Hs.130806 ESTs 

gb:RC3-KT0230-16020O4)1&«08 HT0230 Homo 
H5.287767 Sequence 8 from Patent WO9950285 
K$.121 121 ESTs. Weakly simDar to S00755 pieckstrl 
Hs.6591 ESTs 

Ob:Homo sapiens tetracyHne Iransporter- 

Hs.9029 DKFZP434G032 protein 

Hs.59788 ESTs 

Hs.7239 SEC24 (S. cerevisiae) related gene famll 

Hs.255877 ESTs 

Hs.620 bullous pemphigoid an^en 1 (230/24QkD) 

Hs.104715 Inversin 

Hs^5 ESTs 

Hs.26766 ESTs 
Hs.292737 ESTs 

Hs.24385 Human hbc647 mRI^ sequence 

gb:MR4^T0124-270300^)05^11 ST0124 Homo 

Hs. 172792 ESTs, Wealdy similar to hypothetical pm 

Hs.144104 ESTs 

gb3ip83h04^1 NQ.CGAP.Thy1 Homo sapiens 

Hs.132221 hypothetical protein FU12401 

gb£15506 Oontech human aoita polyA> mR 

H5.280115 ESTs 

gb.-CM3XT0341-190400-152-li12CT0341 Homo 
Hs.195704 ESTs 
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417181 L10123 
426097 BE327369 
439199 R40373 
440728 AW086077 
434361' AA631634 
417428 N87579 
431291 N25521 
431242 AA987742 
426885 BE394849 
442360 AI374621 
452171 Aia63302 
440801 AA908366 
411738 AW859353 
431447 AAS05138 
433485 AM93076 
401365 

408281 BE141183 
411657 AW855563 
423065 Rg61S8 
428528 Ai004034 
454036 AA374756 
417252 AAigSOU 
417135 AA422087 
403089 

420691 AA829433 
412147 AW895984 
• 425578 U55652 
430403 AF039390 
454438 AA224053 
435434 AA660387 
420828 AA2B0778 
435586 AI279137 
452393 H87398 
416170 H42454 
408691 AW250525 
428912 AW103117 
455511 BE144762 
413849 BE173561 
401189 

425733 F13287 
447863 AL047611 
422654 AA314316 
435463 AA582507 
417919 A1928203 
405784 

431853 AA521034 
409629 AW4495B9 
403261 

427173 BE255017 
433717 AF063536 
406777 T23625 
410481 R34107 
419511 AA429750 
452291 AF015592 
449762 N93057 
421106 AA877124 
439382 BE247684 
404957 

436332 AL049679 
446393 AW014174 
452728 A1915676 
456386 W28481 
406268 AW06a311 
416972 BE019670 
427099 AB032953 
403344 

438993 AA828995 
444922 A1921750 
401596 AA172106 
418693 Ar750878 
414299 AA142989 
4S2744 AI267652 
4585S2 AW136139 
421065 AA329711 
439294 AW975328 
441201 AW118822 
434377 AW137148 
440472 AA886169 
418379 AA218940 
435878 R08330 
437263 AA747822 
444087 AV647899 
411745 AW887826 



Hs.1071 surlactanl protein A binding protein 
Hs.112238 EST5 
Hs.262g9 ESTs 

Hs.153272 Homo saptens cDNA: FU22715 fis. done H 
gb:np77h054i1 Na.CGAPJ>r2 Homo sapiens 
gb:LL2030F Human fetsi heart, Lambda ZAP 

H$^275 KnippeMype zbic Qnger protein 

Hs^1278 K1AA1201 prat^ 

Hs.131905 EST8. liitoderately simitar to Z195J1UMAN Z 

Hs^9055 ESTs 

Hs.211930 EST 

HS.190S35 ESTs 

gbtMR1-CT0353-150300-102-8l2 CT0353 Homo 

Hs.291341 ESTs 

H8.201967 aldo^eto reductase family 1. member C2 

gb:MWMfrW71-191199^1-b04 HT0071 Homo 
gb:Ciyi4-CT0278^1099^-l07 CT0278 Homo 
Hs^7130 Homo sapiens, done MGC:5406.mRNA.comp 

Hs.98638 ESTs 

Hs.93560 Homo sapiens mRNA lor KIAA1771 prolain. 
Hs.85971 ESTs 
Hs.50547 ESTs 

Hs,275343 ESTs 



Hs.158313 
Hs.241382 
Hs.172405 
Hs.187850 
Hs.186878 
Hs.151498 
Hs.99858 
Hs.220645 

Hs.98949 

Hs.15364 



Hs.163725 

Hs.86379 

Hs.70834 
Hs.279724 

Hs.97540 

Hs.150680 

Hs.321450 

Hs.75113 

Hs.2a853 

Hs.54888 

Hs.172844 

Hs.103070 

Hs^2302 

Hs.301956 

Hs.239708 

HS311054 

HS.173S60 



Hs.144871 
Hs.110950 
Hs.87409 
Hs.71730 
Hs.30604 
Hs.245856 

Hs.6523 

Hs.128757 

Hs.306593 

Hs.169071 

H5.137516 

Hs.20152 

HS.2B2375 



gb:QV44iN0039^0500-197<e08 NN0039 Homo 
chromosome 17 open reading frame lA 
lumor necrosis factor (tigand) supertami 
cell division cyde 27 
ESTs 
ESTs 
ESTs 

ribosom^ protein L7a 
ESTs 

gb:2B2162a5prime NIH_MGC_7 Homo sapiens 
ESTs, We^ similar b MEA6 [Hxaplens] 
gb:(mHT0180^1099^S4)04 HT0180 Homo 
API gamma subunil binding protein 1 

Homo sa^riens done 23578 mRNA sequence 
Homo sapiens cDNA FU14246 fis. done OV 
ESTs 

gb:z]18f0as1 Soare$JstalJver.^pleen. 
ESTs 



ESTs 
ESTs 

ESTs 

gb'AfH)63538 Homo saptens library (Yu Y) 
putative translation iniflation factor 
pregnancy specific beta-l-glycoproteb 2 
general transcription factor IIIA 
C0C7 (ceil division cyde 7. S. cerevi^ 
ESTs 
ESTs 
ESTs 

Homo sapiens cONA nJ14814 lis. done 1^ 

zinc finger protein 

ESTs 

gb:47e1 Human retina cONA randomly prime 
Homo saptens mRf^ fiili length Insert cDN 
gb:bb28c0lJ(1 NliiMGC.S Homo sapiens cDN 
odd Ozfyam homdog 2 (Dioeophila, mous 

gb:od77b08£l NCl.CGAP.0v2 Homo sapiens 

Homo saptens cONA FIJ13752 fis. done PL 

RagCprotein 

Ihrombospondin 1 

ESTs 

Homo sapiens mRNA; cONA DKFZp434E082 (fr 
ESTs 

gb:EST33382 Embryo. 12 weeic II Homo sapi 
chromosome 1 open reading frame 12 
ESTs 

Homo saptens cONA RJ 1 1382 lis. dona HE 
ESTs 

fidgelin^iice 1 
ESTs 

gb:nx97a04j1 NCI_CGAP.GC81 Homo sapiens 
ESTs 

^jyiRO^N0039-30030&OO1-cO2 8N0039 Homo 
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438660 U95740 
405521 

411597 AW852925 
415655 WD5433 
404822 

441107 AA917a75 
404834 

41Z76S AW996044 
428102 AA968441 
436511 AA721252 
441247 AW118881 
453098 Z25335 
410811 AW80S687 
425048 H05468 
431071 AA491379 
436298 AW2g3498 
440356 Ai9331B4 
452768 AWD694S9 
455241 AW876249 
409070 AA063003 
409044 A1129586 
419091 T85332 
422591 L07648 
403188 

416857 D10216 
413585 A)133452 
436149 AI754308 
443682 Ai383061 
437916 BE566249 
439618 AU60137 
438361 AAB05666 
451221 AI949701 
455475 AW948126 
433197 AB040889 
429881 T80112 
415598 A)433165 
431220 N52937 
433132 AB026264 
424029 AB014594 
404443 

407340 AA810168 
410318 AA084050 
412400 AW948066 
427167 A)239607 
438090 AA777534 
407936 AA905097 
440454 Ai733037 
417706 T90797 
426692 AI372B22 
407762 AW235638 
420727 H75701 
417508 BE163512 
413525 BE145699 
425798 AA364002 
459429 AA278779 
430205 AB025804 
437458 AL390131 
451073 AI758905 
452786 R61362 
429846 AB023021 
444414 AW293214 
402815 

410585 AW770523 
425168 R98366 
449729 R72032 
459359 N99545 
456443 AW967500 
439001 AFO7S068 
443657 R14973 
404193 

416379 N38857 
422511 AU076442 
426603 AA382291 
412589 R28660 
421037 AI684808 
427088 AA3980B5 
429927 NM.001115 
453375 AI990114 
435451 AF195420 
451882 AI821324 
419983 W55956 
40S001 US6196 
422182 ALD43892 



Hs^ Homo sapiens. Clone lMA(£3010566.rnRNA, 

gb:PMaCT0248-13109M01-flO CT024e Homo 
Hs.49890 ESTs 

Hs.190520 ESTs 

HS.28Z39 Human ONA seciuence 6om done RP11-438a2 

Hs.126866 ESTs 

Hs.291502 ESTs 

Hs.128051 Homo sapiens thymic sbomallymphopoieti 

Hs.86379 ESTs 

Hs.300648 ESTs 

Hs.164502 ESTs 

8b:^65f05.r1 ND^CGAP.GCBI Homo sapiens 

Hs.180138 ESTs 

Hs.127922 ESTs. Moderately slmflar to S65657 alpha 

Ks.61539 ESTs 

gb:Piyi4.PT0019-1312994}06^5 PT0019 itomo 

HS.224S60 ESTs 

Ks.33033 hypotlielteai protein FU14623 

Hs.178294 ESTs 

Hs.118630 llllAX4nlBracling protein 1 

Hs.89394 POU domain, dassl, transcription facto 

Hs.75431 fibrinogen, gamma pdypepfide 

Hs.159452 ESTs 

Hs.47248 ESTs, Highly similar to similar to Cdo14 

Hs.20999 liypotiielical protein FL123142 

Hs.19934 Homo sapiens mRNA full lengti) insert cDN 

Hs.1 46217 Homo sapiens cONA: FU23a77 lis. done I 

Hs.210589 ESTs 

gb:RC0-MT0013-260300^1^12k^0013Homo 

H5.281022 KIAA1456 protein 

Hs.192245 ESTs 

Hs.9858 ESTs 

Hs.102679 ESTs 

Hs.284245 iiypotheCcal proteb IMPACT 
Hs.137579 KIAA0694 gene product 

Hs.284289 vlfiligo-assodated protein ViT-1 

Hs.269259 ESTs, We^y similar to S23650 retrovini 

gb:RCO.MT0012-29030W)31-h10 MT0012 Homo 

Hs.99196 tiypotheBcal protein ltM3C11324 

Hs.191992 ESTs 

Hs.85050 phosphdamban 

Hs.1 29990 ESTs 

Hs.268623 ESTs 

Hs.1 10103 RNA polymerase i transcription factor RR 

Hs.29475 ESTs 

Hs.99886 complement component 4-bindlng protein, 

Hs.1808n H3 intone, family 38 (H3.36) 

gb:IWIRO4fTO208.221299-204-b10 HT0208 Homo 
gb:£ST74529 Pineal gland II Homo sapiens 

Hs.335696 EST 

Hs.235166 caibonic aniiydrase XIV 

Hs.1 28751 Homo sapiens cONA FU12235 fis, done MA 

Hs.206053 ESTs 

Hs.1 05642 ESTs. Wealdy sinflar to T09052 liypoliwfl 

Hs.225945 fucosyllransfefase9(alplta(1.3)fuco5y 

Hs.8752 transmembrane protein 4 

Hs.337501 ESTs 

gb7q37d04^1 Soares fetal Over spleen 
Hs.29235 ESTs 

gb:za40a05.r1 Soares fetal liver spleen 
Hs,133543 ESTs 

gbihiomo sapiens fiill length insert cDNA 

gb.-yf42f10.s1 Soares fetat Over spleen 

Hs.203933 ESTs 

Hs.1 17938 odlagen. type XVII. alpha 1 

gb:EST9S683 TesQs I Homo sapiens cDNA S 

Hs.24305 ESTs 

Hs.197653 ESTs 

Hs.142390 ESTs 

Hs.2522 8denylatBcydase8(brsln) 

Hs.24009t ESTs 

Hs.303006 ESTs, WeaUy similar to gamma^iereguiin 

Hs.100445 ESTs 

Hs.94030 Homo sapiens mRNA; cONA OKFZpS88£1624 (f 

Hs.296281 inlerieuMneiihancer binding fadorl 

Hs.180582 Homo sapiens cDNA:FlJ21836 lis. done H 
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4S1917 AW391351 Hs.50820 Homo sapisRS unknown niRNA 3.7 

432781 NM.014133 Hs.278940 PR006t8 protein 17 

443773 AV646452 Hs,30941 caWum channel. vollagwJependent bete 3.7 

- 40S984 M21305 ob:Human alpha sateflile and sateOits 3 3.7 

5 430682 AW971949 H$.291252 ESTs, Weakly similar b ZN91J1UMAN ZINC 3.7 

449804 AI53$663 Hs.39379 ESTs 3.7 

411505 AF15S659 Ks.70565 molyt>denumcorac(orsyn!he&2 X7 

430S03 AA533574 Hs.152274 ESTs 3.7 

443305 AI050593 Hs.1333ia ESTs 3.7 

10 415076 NM.000857 Hs.77690 guan^ cyclase l.sduble. beta 3 3.7 

452280 AI911410 Hs.167224 ESTs 3.6 

432189 AAS27941 gb.nh30cO4^1 NCLCGAPJ^3 Homo sapiens • 3.6 

406992 882472 gb:beta -poNONA polymerase beta {exon a 3.6 

441416 AI990139 Hs.148609 ESTs 16 

15 448776 BE302464 Hs.300S7 iyiRS2 (S. cefBvislaeHike,maoneskimhom 3.6 

413998 AW1038O7 Hs.243933 ESTs 16 

440385 AA884283 H5.192136 ESTs 16 

431873 AW971302 Hs.293233 ESTs 16 

401887 16 

20 404793 16 

422054 a!A322506 gb:EST25146 Cerebellum 11 Homo sapiens c 3.6 

432030 AI908400 Hs.143789 ESTs 16 

449645 AI961092 Hs.1961SS ESTs 16 

404476 3.6 

25 449336 AL119995 Ks.15260 ESTs, Highly sImRar to AC007228 2 BC372 16 

401200 16 

403937 16 

437918 AI761449 Hs.121629 ESTs 16 

443394 A105S865 Hs.13348S ESTs 16 

30 439107 AL046134 Hs.13944 adrenergic, beta, receptor kinase 2 16 

417229 AAg75096 Hs.19522 hypothetical protein PR02849 16 

425403 A1023753 Hs.156406 Human ONA sequence from done 11 g8H8 on 16 

436269 AA707472 Hs.ig0760 ESTs 16 

453823 AL137967 gb:(3KFZp761D2315j1 761 (synonym: hamy2) 16 

35 416394 H64111 gb7r57i03j1 Soaresfetalilver Spleen 16 

432779 AW979241 gb:EST391351 MAGE resequences. MAGP Homo 16 

439326 W07140 Hs.54721 ESTs 16 

423035 AW449679 Hs.156739 H.sapIensX6mRNA{donePEP11) 16 

435766 R11873 Hs.186498 ESTs 16 

40 448067 R68568 Hs.183373 stc homology 3 domain^ntainlng protein 16 

441605 AA984647 KS.12B801 ESTs 15 

414400 X08948 Hs.897 Fc fragment of IgE, high affinily I. roc 15 

416405 AI866282 H&11898 ESTs. Highly similar to K1AA1370 protein 15 

437642 AL079309 gb;Homo sapiens mRNAfulllength insert 15 

45 450350 T97817 Hs.174880 ESTs 15 

451704 AI755209 Hs.205616 ESTs. Weakly sintflar to ALU1_HUMAN ALUS 15 

459037 AW439497 Hs.290656 EST 15 

419247 S65791 Hs.89764 fragile X mental retardafkxi 1 15 

423121 AW864848 gb:PM2-SN0018-29030O«)3H«9 SN0018 Homo 15 

50 426724 AA383623 Hs.293616 ESTs 15 

434273 AA913143 Hs.26303 ESTs 15 

438042 AW296971 Hs^593 ESTs 15 

410500 R09442 gb:yt26c09/1 Soares tetal Over spleen 15 

416154 Z46122 gbi1SC0VB031 nonnallzed Infant brain cDN 3.5 

55 418432 M14156 Hs.65112 Insulln4kegrowlhfector1(somatomedi 15 

454447 6E163567 gb:QV3-HT0460.230200-101-b08 HTO460 Homo IS 

458087 AA393603 Hs.36752 protein kinase anchoring protein 6KAP42 15 

444338 AI937026 Hs.146842 ESTs IS 

427687 AW003867 Hs.1570 histamine receptor HI 15 

60 415929 AA724373 Hs.49344 hypothethat protein FU1 1006 15 

416009 Z43062 gb:HSC12E041 nomiaSzed Infant brafaicON IS 

421515 Y11339 Hs.105352 GaINAcaipha-2, S-slalyllransfBraseU 15 

403515 15 

435793 AB037734 Hs.4993 KIAA131 3 protein 15 

65 439953 AA918129 Hs.124638 ESTs 15 

457620 AA602711 Hs.336753 EST 15 

442006 AW975183 H3.292663 ESTs. Weakly simflar to S72482 hypotheli 15 

453931 AL121278 Hs.25144 ESTs 15 

453128 AW026516 Hs^1791 acylphosphatase 2. musde type 15 

70 413468 BE504766 gbJiz40g01j{1 Na_CGAP,GC6 Homo sapiens 15 

454600 AW810001 gb:MR4-ST0124-27030(M}05-b11 ST0124Homo 15 

451065 AW295132 Hs.222231 ESTs. Weakly similar Id granule ceO mar 15 

444493 R59410 Hs.282094 ESTs. Moderately sindlar to 138022 hypol 15 

426447 AV655843 Hs.169919 electron-transfer-flavopmtefai. alpha po 15 

75 410908 AA121688 Hs.10592 ESTs 15 

440364 AA9ia460 HS.12862& ESTs 15 

406190 15 

430762 AI3436S2 Ks.105667 ESTs IS 

451182 052552 Hs.295317 WAAI 789 protein 14 

80 432437 WO7088 Hs.293685 ESTs 14 

442137 AA977235 Hs.128830 ESTs. Weakly sinvlar to Z192_HUMAN ZINC 14 

405970 14 

407676 AW064111 Hs.279823 ESTs 14 
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413141 BE166323 gb:QV44{r0492-27010aO8&e12KT0492HQmo 3.4 

431418 X68242 H8.2S2722 Hbvl 3.4 

431954 AK001974 Hs.272242 hypolheQca! protein RJ1 11 12 3.4 

45S371 R20991 gb:yg06h01.r1 Soaros Infant brain 1 NIB H 14 

5 428062 AA420683 Hs.98321 hypotheScal protein FU1 4103 3.4 

423841 AW753967 gb:RC2-CT03044»01004)1U12CT0304Komo 14 

420430 AI703192 gl)nMd92h04j(1NCLCGAP.Ui24 Homo sapiens 14 

443921 AI091310 Hs,134848 ESTs 14 

444453 AW379394 Hs.145126 ESTs 14 

10 443475 AI086470 Hs.134482 ESTs 14 

414136 AA81Z434 Hs.119023 SMC2 (structural msfaUsnanoo of chrornoso 14 

453263 R91778 Hs.99369 ESTs 3.4 

410888 AW861207 gb:RC1-CT0302-120200«13Kl04CT0302Homo 14 

456303 AA224872 Hs.1 15088 ESTs 14 

15 431474 AL133990 Hs.190642 ESTs 14 

439702 AW085525 Hs,134182 ESTs 14 

458797 AW00183S Hs.13323 hypotheticd protein FU22059 14 

430140 AW2S6771 Hs.221999 ESTs 14 

423871 AA331906 9b:EST35805 Embryo, 8 week I Homo sapien 14 

20 459278 AW294659 H5^054 Honx) sapiens cDNA: FU22488tis. done H 14 

446672 T05514 gb:EST03403 Fetal brain, Stratagene (cat 14 

431548 Ai834273 Hs.9711 novel protein 14 

416182 NM.0043S4 Hs.79069 cyclin62 14 

422899 D16471 Hs.121571 Human mRNA,XqtennInaipo(6on 14 

25 417563 R07483 H8.180461 ESTs 13 

405455 13 

426235 AI631964 Hs.34447 ESTs 13 

439567 AI056618 Hs.134314 ESTs 13 

444848 AW451176 Hs.195954 ESTs 13 

30 451426 AW205003 Hs.208063 ESTs . 13 

408172 W0248& Hs.46039 prwsphoglyoerdemUtase2(musd8) 13 

401626 13 

4057B0 13 

417991 AA731452 Hs.190008 ESTs 13 

35 443212 AW269515 Hs,102500 hypotlwHcal protein FU20481 13 

403356 13 

404518 AI815501 Hs.79197 C083 antigen (activated B lymphocytes, i 13 

413581 BE150618 gb:RC3.HT0272-110100^13<!05HT0272Homo 13 

426701 Ai968103 Hs.209461 Homo sapiens cONAFU12636fis, done NT 13 

40 445510 AA946675 Hs.282824 ESTs 13 

418663 AK001100 Hs.41690 desmocolRnd 13 

447617 AI400762 lte.17667S ESTs 13 

448150 AI472167 Hs.302739 ESTs 13 

410140 AL134435 Hs.22269 nBurexln3 13 

45 443283 BE568610 gb:601342622F1 NIH.l^GC.53 Homo sapiens c 13 

454777 AW820027 gb:QVO-ST0294-240300-173fl04 ST0294 Homo 13 

410767 AJ001873 Hs.66185 Homo Sapiens mRNA, partial cDNA sequence 13 

433183 AF231338 Hs.222024 transcription factor BMALZ 13 

436168 AK000883 Hs.301645 Homo8aptenscONAFU10021fis. clone HE 13 

SO 438456 AA913381 Hs.190513 ESTs 13 

411186 AW821257 gb:PM:W0307.231299^U11 ST0307Homo 13 

411880 AW872477 gb:hni30H)lx1 NCLCGAP.TIiy4 Homo sapiens 13 

433567 AF073299 H3.103132 solute carrier famiiy 9 (sodium/hydrogen 13 

433805 AA706910 Hs.112742 ESTs 13 

55 409434 AF278761 Hs.131581 Homo sapiens tesSs transcript Y 7 (TIY? 13 

440164 AB002297 Hs.7022 dedicator of cytiMdnesis 3 13 

456555 AW592167 Hs.293299 ESTs 13 

419189 T95882 Hs.112318 6.2 kd protein 13 

428848 AF052728 Hs.188021 potassium vdtag&gated diannel, subfami 13 

60 407995 AI094748 Hs.100134 hypollietical protebi FU12787 13 

413200 AA127395 Hs.222414 ESTs 13 

416421 AA134006 Hs.79305 eukaiyotlctranslafionlnHialion (actor 13 

416737 AF154335 Hs.79691 LI M domain protein 13 

428356 AiJ)46991 Hs.10338 ESTs 13 

65 429216 AI369472 Hs.65407 ESTs 13 

432488 AA551010 Ks.216640 ESTs 13 

4333B6 AW360833 gb:P1^1-CT0243-201099^)04Ht08 CT0243 Homo 13 

400889 13 

416294 D86980 Hs,79170 K1AA0227 protein 13 

70 446190 Ai279299 Hs.256564 ESTs 13 

417801 AA4173e3 Hs.82582 Integrin, beta^Dce 1 (with EGF-OcB rep 13 

418122 R42778 Hs.22217 Homo sapiens done ilMAG&321 06, mRt^seq 3.3 

418375 NM-003081 Hs.84389 synaptosomal^sodaied pratdn. 25kD 3.3 

443367 AVWJ71349 Hs.215937 ESTs 13 

75 446645 AI336596 Hs.156294 ESTs 13 

434294 W271379 Hs.76194 ribosomal protdn S5 3.3 

452372 A1885742 H5.228474 ESTs 3.3 

414241 AA425085 Hs.4007 Sarcolemmal-assQdated ptclein 12 

436982 AB018305 H3.5378 spondin 1, ((«pondin)extracdIularmat 12 

80 430548 AW450575 Hs.163203 ESTs, WealdydmOarb 634087 hypoOieli 12 

427119 AW880562 Hs.114574 ESTs 12 

437073 AI885608 Hs.94122 ESTs 12 

437845 AA769576 Hs.90488 ESTs 12 
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454962 AWB47645 fibdLdCT0213-28010aOS&^CT0213HQmo 3.2 

414394 AI904738 H5.76053 OEAO/H (Asp^Sh^Ala-Asp/HIs) box polypep 3.2 

417175 R44556 Hs.94002 ESTs 12 

456536 AW135986 HS.2578S9 ESTs 3.2 

5 401132 3.2 

40n64 BE008347 gb:CM0-BN01544)80400^25-h04BN0154Hoino 3^2 

428004 AA449563 Hs.151393 glutamat&^ysteineDgase. catalytic sub 3L2 

450947 AI745400 H5.204662 ESTs 3.2 

456605 AI827786 H&.259044 ESTa 3.2 

10 452879 AW905328 Hs.180642 ribosomal protein L13 3.2 

454754 AW819191 gb:CM1-ST028^O712994)61-d08ST0263Hoino 3.2 

429479 AM53488 H5.99333 ESTs 3.2 

448090 AI808821 Hs.270289 ESTs 3.2 

401324 3.2 

15 404731 3.2 

419936 AI79276B gb:ol91d05.y5Na CGAP.KId5 Homo sapiens 3.2 

455571 BE003714 gb:QV3-BN0096-200400-161-a01 BN0096Homo 3.2 

433990 AA889328 Hs.112950 ESTs 3.2 

415239 R42608 Hs,139270 ESTs 3.2 

20 41B878 W20090 Ks.6616 ESTs 3.2 

438079 R09664 Hs.191223 ESTs 3.2 

422183 AA43169a Hs.112794 Human ONA sequence from done 1068E13 on 3.2 

457460 A1143312 Hs.129206 casein kinase 1. gamma 3 3.2 

454145 AA046872 Hs^2798 ESTs 3.2 

25 446S77 AB040933 K$.15420 KIAAISOOpfotein 3.2 

430664 AWg69834 Hs.303303 ESTs 3.2 

404588 3.2 

407834 AW08499t Hs.26100 ESTs 3.2 

413087 BE064655 gb:RC1-6T031»01299-01^c09 BT0313 Homo 3.2 

30 440790 AW593050 Hs.128580 ESTs 3.2 

452081 AW95a859 Hs.7514 Homo sapiens cDNA FU12141 fis. done MA 3.2 

421916 R34441 Hs.101007 Homo sapiens cDNA: aJ23546 fis, done L 3.2 

419261 X07876 Hs.89791 wingless^ IMMTVInlegration site fami 3.2 

419340 AA236S% Hs.87530 ESTs 3.2 

35 444771 AB023201 Hs.11912 KIAA0984 protein 3.2 

445233 AV653034 Hs.297559 ESTa 3.2 

457030 AI301740 H5.173381 dihydropyrimidinase^ 2 3.2 

408334 AW514652 Hs.321637 ESTs 12 

410085 AA428482 Hs.S8589 glycooenin2 3.2 

40 411018 AW813428 gb:MR3^T0192-010200.21&<H)5ST0192Homo 3.2 

403623 3.2 

432223 AA333283 Hs.121001 Homo sapiens, done IMAGE:3460280,mRNA 3.2 

444050 AW138295 Hs.135024 ESTs 3.2 

421036 AA810560 Hs.303577 ESTs 3.2 

45 401459 3.1 

404404 3,1 

450438 Ai696071 Hs.253800 ESTs 3.1 

414523 AU076633 Hs.76353 serine (or cysteine) proteinase Inhlbito 3.1 

419169 AW8519B0 Hs.262346 ESTs,VteaWy slmnartoS72482hypolheli 3.1 

50 441274 AW593781. Hs.131357 ESTs 3.1 

450785 AA852713 Hs.2S459 Homo sapiens, alpha-1 (VI) cdiagen 3.1 

452401 NM_007115 Hs.29352 tumor necrosis factor, elpha^uced pro 3.1 

400816 3.1 

410307 AF022913 Hs.62187 phosptiaSdyllnositd gtycan, dass K 3.1 

55 431906 AW328038 Hs.37486 ESTs 3.1 

440046 AW402306 Hs.6877 hypotheScal protein nJ10463 3.1 

450271 AI693900 Hs.200920 ESTs 3,1 

41S811 AA450191 Hs.172963 hypothetical protdn FU14624 3.1 

415273 Z39B40 Hs^29 ESTs 3.1 

60 450519 AAD100G6 Hs.224849 Homo sapiens cDNAaJ12583 fis. done NT 3.1 

451421 W16522 Hs.237689 Honno sapiens CDNAFU13539 fis. done PL 3.1 

446384 AB006624 Hs.14912 K1AA0286 protein 3.1 

436638 A!271945 Hs.134984 ESTs 3.1 

418079 R40058 Hs,6911 ESTs 3.1 

65 448466 AI522109 Hs.171066 ESTs 3.1 

448835 BE277929 Ks.11081 UBXdoniain«)nlainIng2 3.1 

415046 R40018 Hs.56400 ESTs 3.1 

448134 AI470790 Hs.34494 ESTs 3.1 

456027 BE327387 Hs.13913 K1AA1577 protein 3.1 

70 45B023 AW978161 HS.26B55S S'-7exoribonudease2 3.1 

417079 U65590 Hs.81134 tnterleultin 1 receptor antagonist 3Ll 

421308 AA687322 Hs.192843 leudne zipper protein IHCSGU 3.1 

414884 R54418 Hs.183745 liypoUieQcai protein aJ13456 3.1 

449138 AW294215 Hs.195631 ESTs 3.1 

75 455756 6E079307 gb:RC1-6T0623-1202D(M)11-g09ST0623HomD 3.1 

428170 K05530 Hs.12565 ESTs 3.1 

429878 AA460188 Hs.127263 ESTs 3.1 

455000 AW850283 Hs.324429 Homo sapiens cONAFU14015fis. done HE 3.1 

438369 n7886 Hs.83428 nudear factor of Icappalighlpdypeptld ai 

80 415840 R1S955 Hs.21758 ESTs 3.1 

444955 AW002844 Hs.148641 ESTs 3.1 

436020 AA778177 Hs.121724 ESTs ai 

453051 AWig6690 Hs.2242e9 ESTs 3.1 
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425178 
402145 
4108BS 
449238 
456737 
438214 
436250 
411622 
418454 
449357 
418950 
431508 



445409 

45Z778 

455577 

408235 

436194 

452073 

427050 

427244 

448405 

433767 

421376 

441519 

404367 

453502 

421948 

438165 

400608 

404042 

405229 

411411 

415452 

430371 

447046 

455851 

429014 

405605 

400227 

439037 

439693 

427533 

418355 

433536 

448446 

449623 

445568 

440448 

428201 

444148 

447972 

432S64 

440925 

428398 

415913 

418145 

413252 

400335 

426132 

436938 

437980 

455955 

414899 

403786 

430187 

451700 

455866 

445900 

457041 

415716 

422336 

451664 

407244 

455249 

428862 

406076 

405302 

400325 

408408 

423119 

424152 

431980 



H16097 Hs.161027 ESTs 

AM97117 Hs.129600 ESTs, Moderately sinAar to ALU^HUMAN A 

AA428229 Hs.331561 muscte>«pecliic RINGfinger protein 3 

BE247203 Hs.124831 CGI^protebi 

H08076 Hs.26320 TRABlDprot^ 

AY004667 Hs.B5844 neuTObopMc tyrosine kinase, receptor. 

Ai807894 Hs.47274 Homo sapiens mRNA;cONADKFZp564B1 76 ((r 

AA315308 H5.195870 hypolhetlcal protein RJ14991 

A)076363 Hs.288806 Honio sapiens cDNARJ1 1778 lis. clone HE 

n8517 Hs.13941 ESTs 

NM.012481 Hs.182979 rtt)OSomaipfotdnL12 

AI949081 Hs.147882 ESTs 

R71338 Hs^21 Homo sapiens cDNA:FU21592 (is, done C 

BE006341 gbftC2.BN0127-24030(M)114)05 6N01Z7 Homo 

AA053381 HS.759G9 praline-rich protelnivilh nuclear target 

AK001074 Hs.333435 Homo sapiens CDNAFU10212 lis, done HE 

AA625150 Hs.8209a ESTs 

AA397789 Hs.161803 ESTs 

AA402400 Hs.178045 ESTs 

AW207634 HsJ70649 ESTs 

AA609245 gb:af13a1U1 So8res.tes6s_NHT Homo sap 

AA287948 Hs.134110 ESTs 

AA972740 Hs.127092 ESTs 

Al^786 Hs^1273 transcription factor NYOspIO 

L425B3 Hs.334309 keratin 6A 

AA779344 Hs.138136 ESTs, Weakly similar to 1510254A LI rape 



AA345241 Hs.55950 ESTs, Weakly sinflar to KlAAl 330 protein 

F09134 Hs.12839 ESTs 

087466 Hs^40112 KIAA0276 protein 

AA3261 87 Hs.l7l70 G protein-coupled receptor 4 

BE146879 6bK3V4-HT0222.261 099^14^:11 KT0222 Homo 

A1600518 Hs.118150 ESTs 



AF075084 gb:Homo sapiens fuD ienglh insert cONA 

AI741816 Hs.125897 ESTs 

R36022 Hs.179566 iiypotheOcai protein aJ22624 

L42563 Hs.1165 ATPase. H*IK*- transporting, nongastric. 

AI732163 Hs.188909 ESTs. Weakly similar to altemativelysp 

A1521251 Hs.171030 ESTs 

C00719 Hs.120440 EST 

H00918 Hs^44 KIAA1796 prdein 

AA885428 Hs.125646 ESTs 

AA424156 Hs.206461 ESTs 

AW003204 Hs.151167 ESTs 

AL137275 Hs^0137 hypoUietk;al protein OKFZp434P01 16 

AA928829 Hs.47099 hypoUietical protein FU21212 

AW5110gO Hs.130419 ESTs 

Ai249368 Hs.98558 ESTs 

H70302 gb:yr95f07.r1 Soares fetal Ever spleen 

AF121260 H5^577 cysteine and glycine^ protein 3 (car 

BE074910 gb:RC5^TOS80-1703004)21-F12 BT0580 Homo 

Y13187 Hs^48067 Homo sapiens dmd gene. Intron 11 

AA370501 gb:EST82261ProstateglandlHoino88plen 

AW139680 H8.161393 ESTs 

R50393 Hs.278435 KIAA1 474 protein 

BE162394 gb:PM2-HT0451-170100ma08 HT0451 Homo 

AW975433 Hs^288 ESTs 

Ar799909 Hs.158989 ESTs 

AI470262 H5.29553 ESTs 

BE149024 gb:CMtWfT0249-291099^)84<04 HT0249 Homo 

AF070526 Hs.13429 Homo sapiens done 24787 mRNA sequence 

AA399016 Hs.250835 ESTs 

N59294 H$.179662 nucleosome assembly protein l-Oke 1 

AI761322 Hs.1 15285 dihydroiipoamWe S-acetyitransferase (E2 

AA8e9081 Hs.153952 5" nudeotidase (C073) 

M10014 Hs.75431 librinogen. gamma po^Sde 

AW876538 gb:RC3-PT0028-1901004)12-b08PT0028Komo 

NM_000346 Hs.2316 SRY (sex determining regton Y).bax 9 (ca 

AL390179 Hs.137011 Homo sapiens mRNA; d)NA 0KFZp547P134 (Tr 

M85292 H5.247924 Homo sapiens endogenous HIV-1 reiated so 

AF070571 Hs.44690 Homo sapiens done 24739 mRNA sequence 

AA322201 H1131976 ESTs 

AL133591 Hs.141480 Homo sapiens mRNA; cONADKFZp434N079((r 

AA523696 H8^4507 hypothetical protein FU20966 
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4257S3 AA383946 Hs.20969 ESTs 19 

401462 Z9 

456817 AI522129 Hs.173119 ESTs Z9 

422163 AFQ27208 Hs.1 12360 promiren (mouseHite 1 19 

5 419875 AA853410 H$.93557 proenkephaEn 19 

423047 NM.O0S323 Hs.123064 HI histonefamDy. member TOesSs-spec 19 

425349 AA425234 H$.79886 riboseS-phosphaleisomersseAfriboseS 19 

401358 19 

418531 R96760 Hs.183758 ESTs 19 

10 447290 AM76732 Hs.263912 ESTs 19 

4*1143 AI027604 Hs.159650 ESTs 19 

431292 AA370141 Hs.2281 chromogranin B (secretogranln 1) 19 

405783 19 

444459 A1680624 Hs.148676 ESTs 19 

IS 402112 R58824 Hs.2166 eukaryoGc transteHon elongation factor 19 

425745 U44060 Hs.14427 Homo sapiens cDHA:FIJ21B00 lis. done H 19 

444827 R09764 Hs.20416 ESTs 19 

451195 U10492 Hs.438 mesendiymo homeo box 1 19 

411417 AW845481 gb:MRKT005fr^20l199-OOaW4CT0056 Homo 19 

20 418343 AA216372 Hs.159501 ESTs 19 

431595 AA508196 gb:nh60f07.8l NQ^CCAP^PtS Homo sapiens 2.9 

436187 AK000998 Hs.297221 Homo sapiens cONAFU10136fis, done HE 19 

455699 BE068121 gb<»41-BT0368^61299ma02 BT0368 Homo 19 

459440 BE048054 gbiz46c03,y1 NCLCGAP_Bm52 Homo sapien 19 

25 428832 AA578229 Hs.324239 ESTs. Moderately similar to ZN91JHUMIAN Z 19 

423492 AF020761 Hs.129683 utiquitin^onjugating enzyme E2D 1 (tiomo 2.9 

424235 N^L003161 Hs.143507 T brachyufy (mouse) homolog 19 

437913 AI140a2S Hs.121623 ESTs 19 

443185 NVL006134 Hs.264142 dnomosome 21 open reading frame 4 19 

30 443458 1^5385 Hs.143509 liypothetical protein FU21924 19 

426803 AA362S68 Hs.17g747 ecotraptc viral intagraSon site 5 19 

437183 AI928184 Hs.122011 ESTs 19 

420879 N31165 Hs.236837 ESTs, Weakly 8)nutarbS43803RNAblndt 19 

442726 AW136066 Hs.19145 ESTs 19 

35 456189 H91010 Hs.44940 ESTs 19 

441115 R69910 Hs.29041 Homo sapiens cONAFUl4l77fiS. done KT 19 

435563 AF210317 Hs.95497 solute carrier family 2 (fadfitatedglu 2.9 

415628 F13080 gti:HSC3ID041 normalized In^tbr^ncDN 19 

423637 AL137279 Hs.130187 Homo sapiens mRNA;d)NADKFZp43401 21 4 (f 19 

40 443248 T75157 H5.337603 ESTs. Weakly similar to T08680hypotheU 19 

450877 A1799608 Hs.29178 ESTs 19 

439063 AF(ffi5922 Hs.1 13968 ESTs 19 

401526 19 

408751 N91553 Hs.258343 ESTs 19 

45 417320 AA195667 Hs.86022 ESTs 19 

442927 A1024347 Hs.131519 ESTs 19 

444125 AI124882 Hs.118121 ESTs 19 

452148 AF007143 H8^205 Homo sapiens done 23738 mRNA sequence 19 

453901 BE085902 gb:RC2^T0318-150200^11-W)9BT0318Homo 19 

50 452589 6E159915 Hs.61406 ESTs. Weakly simiiar to 2004399Adiit)mO8 18 

403011 18 

436154 AA764950 Hs.1 19898 ESTs 18 

408221 AA912183 Ks.47447 ESTs 18 

430345 AK000262 Hs.239681 hypothetteal protein FU20275 16 

55 415399 T26994 Hs.177198 ESTs 18 

441617 AW96g706 Hs.293332 ESTs 18 

443556 AA256769 Hs.94949 mefhy!maionyM3oAepimerase 16 

455092 BE152428 gb:CMO-HT0323-161299^126^HTO323Honn 18 

439703 AF086538 Hs.196245 ESTs 18 

OO 411024 BE082590 gbK2V1-BT0260>281099mfD5 BT0260 Homo 18 

414545 BE379492 gb:601236215F1 Nia.MGC.44Homo sapiens c 18 

434715 BE005346 Hs.116410 ESTs 18 

407594 AW057584 Hs.160681 ESTs 18 

^_ 439235 N45513 Hs.46608 ESTs 18 

65 453736 ALt18874 Hs^l zbic finger homeobox IB IB 
404957 IB 

437783 AI683150 Hs.2015S0 ESTs, We^sbnRv to AUUIJIUiylAN ALUS 18 

412887 BE007420 gb:PM3-BN0142-200300XI01-d)4BN0142Homo IB 

426942 AA393551 Hs.97450 ESTs 18 
70 403513 18 

419077 AA233885 Hs.164526 ESTs 18 

421823 N40850 Hs.28625 ESTs 18 

425664 AJ006276 Hs.1S9003 transient receptor potential diannel 6 18 

451007 H38108 Hs.32759 ESTs IB 

75 407803 AW081681 Hs^69064 ESTs, V\teaMy similar to T42689hypothetI IB 

409842 AW450809 H$.257347 ESTs 18 

439492 AFC86310 H3.103159 ESTs 18 

420814 AA721156 Hs.190440 ESTs 18 

on 449508 AK001566 Hs^3618 hypotheOca! protein FU10704 18 

oO 428350 AW873520 Hs.112017 GE36gene 18 
405456 18 

442459 A1264628 HS.12S42B ESTs 16 

415763 Z4228S Hs.5181 proTilisralton^ssodat&d 2G4. 38kD 18 
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428S32 AF157326 Hs.184788 TBP^terscQng protin Z8 

43S720 AW97S902 gb£ST3880t1 MA(^ cesequeoces, MA6N Homo Z8 

449539 W80363 Hs.5e446 ESTs Z8 

^ 415533 T74009 H5.268738 ESTs. WedMy similar toAUJTJtUMAN ALUS Z8 

0 408749 H65489 H5^50659 ESTs Z8 

404652 2.6 

423130 AW897586 Ks.21213 ESTs Z8 

424960 BE245380 Ks.153952 5* nudeoOdase {0073} Z8 

402131 ZB 

10 419530 X98330 Hs.g0821 fyanodlne receptor 2 (canfiac) Z8 

456118 AA380267 Hs.78277 DKI=ZP434F2021 protein Z8 

444217 AV648751 Hs.282395 ESTs 2.6 

449579 AW207260 Hs.134014 ESTs. WeaMysimDark)T46425hypothetl 16 

412323 AW937143 gb:PM14)T0041.281299^01.i01 0T0041 Homo 2.8 

IS 418912 NM_0006e5 Hs.89472 sngbtensin receptor 1 Z8 

433513 AIS663S6 Hs.171437 ESTs 2.6 

448912 083781 Hs.22559 KIAA0197 protein 2.8 

451496 AW503407 gb:Ul'HF^N0^)cvi«J.1V0-Ul.r1 NIH.MGC_50 2.8 

420273 AI652854 Hs.l97257 ESTs 18 

20 451949 U03884 Hs.463 potassiuin Inwanlly-rectifying cttannel. s Z8 

420756 AA411800 Ks.189900 ESTs Z8 

423532 BE090503 gb:RC6^0717.110400>011.F11 6T0717Honio Z8 

425012 T77668 H5.92414 Homo sapiens cONA: FU22030 fis, done H Z8 

441609 AA946764 Hs.133460 ESTs 2.8 

25 448870 BE1817B3 Hs.175358 ESTs. Weakly simHar to A475B2BK«Ugr 2.8 

451206 K86228 Hs.271780 ESTs, Weaidy similar to 138022 hypothefl 2.6 

457314 AA479597 Hs.193669 Irypoihetical protein DKFZp566J1 119 Z6 

416192 N^L005036 Hs.9S8 peroxisome proliferafive acfivated recep 2.8 

418888 AU076801 Hs.69436 cadherin 17. U cadherin (Iiver4ntasttn Z8 

30 455310 AW893961 gb:RC4^N0027>080400^11-d11 NN0027Homo Z8 

459450 AA426429 Hs.9a453 EST Z8 

424188 AW954552 Hs.142634 zinc finger protein 2.7 

423575 C18863 Hs.163443 Homo sapiens cDNAFU11576fi5. done HE Z7 

427443 AA402713 H5.97872 ESTs 2.7 

35 452092 BE245374 Hs.27842 iiypolheticaipraiein 1^11210 Z7 

413091 BE06S063 gb:RCl-BT031 3-1 105004)17^ BT0313 Homo Z7 

421003 T72080 Hs.95667 F-box protein 30 Z7 

429593 AKD00332 Hs.209927 Homo sapiens cDNAFU20325fb, done HE Z7 

445611 AW418497 Hs.145583 ESTs Z7 

40 412494 AL133900 Hs.792 ADP-ribosytatlon factor domain protein 1 Z7 

408243 Y00787 Hs.624 InterleiWnB Z7 

407308 H67394 H3.331325 ESTs, Weaidy simOar to 138022 ItypotheO Z7 

423728 AWB91294 Hs.132138 sdute carrier family 4. sodium bicarbon Z7 

404587 M99587 Hs.104134 homaobox(H6fandly)1 Z7 

45 410483 BE163567 gb:QV3W0460-230200-l01-«)8HT0460Ho(no 2.7 

416431 AW384459 Hs.172004 tilin 2.7 

416805 F13271 Hs.79981 Human done 23560 mRNA sequence Z7 

417177 rAl.004458 Hs.81452 «ly«;id-OoenzymeAligas8.lon&<^ Z7 

427134 AA398409 Hs.173561 EST 2.7 

50 428137 AA421792 Hs.170999 ESTs Z7 

430844 T94960 gb,7e38d07.r1 Slralagenelung (937210) H Z7 

441218 BE327561 H$.202345 ESTs Z7 

440911 AA909536 Hs.143S62 ESTs Z7 

411131 AW819212 ^:CM1-ST028347129»4}81'G07 610283 Homo Z7 

55 438602 AI167149 Hs.123374 ESTs, WbaMy similar b mariner (ranspos Z7 

441191 AI693930 Hs.14e816 ESTs Z7 
403776 Z7 

420158 AIS72490 Hs.99785 Homo sapiens cONA*FU21245 lis. done C Z7 

427839 AA608823 Hs.98244 ESTs Z7 

50 429905 AL080128 Hs.225998 OKFZP434C1 53 protein Z7 

449398 BE169100 Hs.195029 ESTs Z7 

450777 AA255646 Hs.60478 ESTs, l^rately similar Id S47073finoe Z7 

456043 AW97g009 H5.326108 ESTs Z7 
405523 2.7 

65 434849 AW292765 Hs.8053 ESTs Z7 

452755 AW138937 Hs.213436 ESTs. Weaidy similar to A34067liypotlieti Z7 

438055 AA776655 Hs.270942 ESTs Z7 

420906 AU)49974 Hs.100261 Homo sapiens mRNA; cONADKFZpS64B222(fr Z7 
_^ 405738 Z7 

70 417806 AI887277 Hs.183733 ESTs Z7 

430698 AA492071 gb:ne97b04.8l NCI_CGAP.Kid1 Homo sapiens Z7 

441969 Ai733386 Hs.129194 ESTs. Weaidy similar to ALU1_HUIWIAN ALU 8 Z7 

446092 N33522 H3.145894 ESTs 2.7 

456689 6E467912 Hs.154294 discs, large (Diosophila) twmotog 1 Z7 

75 413617 BE1S5373 Hs.279518 amytoid beta (A4)precuisor-fike protein Z7 

444931 AV652066 Hs.75113 general transcrtpiionfadorUIA Z7 

412236 AW902583 gbK}V3-NNl024-260400-17U10 NH1024 Homo Z7 

453264 AA034137 Hs.271955 ESTs 2.7 

438370 AAB43242 H8.48523 ESTs Z7 
80 406092 Z7 

454874 AW836407 Hs.270143 extracellular glyooprotelnEMIUN-2prec Z7 

455860 BE153208 ob:PMO-HT033S<»50400007-F10HT0335Komo Z7 

459275 AI606913 Hs.339352 Hbmo sapiens bfottier of COO (BOC) mRNA. Z7 
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411887 AA37S975 Hs.183380 ESTs. Moderately slndar to ALU8.HUMAN A 2.7 

441884 AW172630 Hs.144884 ESTs 2.7 

416211 R1462S gb:yg4Sc03.r1 Soareslniantbr^ 1N1B H Z7 

433128 AB021923 Hs,23357 EST-YD1 protein 2.7 

5 452259 AA317439 Hs^8707 signet sequence receptor, ganunaltransto Z7 

453896 AIS89482 Hs.146286 klne^ family fn6niber13A Z7 

456208 AW299698 Hs.334625 Homo sapiens cDNAaJ14890fis. done PL 2.7 

425876 AWD05887 HS.2340S8 ESTs 2.7 

450458 AA009928 Bb:zl07e05.r1 So3res.fBtBlJlver^pleen. 2.7 

10 406603 17 

410161 AI468210 Hs.261285 i^totropic regulator 1 (PRL1. Arabblops tl 

410871 D78387 Hs.65739 keraHn 12 (Meesmann cornea! dystrophy) 2.7 

412706 R97106 Hs.167545 ESTs 17 

422897 AA679784 Hs.4290 ESTs 2.7 

IS 436329 AI798750 Hs.163960 Homo sapiens heat shock transcripiion (a 17 

436679 AI127483 N9.1204S1 ESTs. Weakly sindlar to unnamed protein 17 

455992 BE179015 Qb:RC3-HT0612-OB050a0mi0KT0612Homo 17 

452S94 AU076405 Hs.29981 soluto cantor family 26 (sulfate transp 17 

419298 AA236115 Hs.l20785 ESTs 17 

20 454747 AW818535 gb:RC1 -ST0278-1 40300^ W05 ST0278 Homo 17 

455791 BE0906B9 gb:RC1^T0720-28030W)11-«)8 BT0720 Homo 17 

411409 AW844803 gb:RC3-CN0056-1 70300^1 5-f08CN00S6Hon» 2.7 

426662 AA879474 Hs.122710 ESTs 17 

400268 17 

25 438782 AA828380 Hs.126733 ESTs 17 

443764 F23283 gb:HSPD22980 HM3 Homo sapiens cDHA done 17 

412486 AF210650 Hs.150858 NAG19 protein 17 

411514 AW85D17B gb:IL3-CT0219-271099^22'Ht2CT0219Homo 17 

457900 AW976692 Hs.291665 ESTs 17 

30 417376 AA253314 H3.154103 UM protein (sNIar to rat proteto Una 17 

426682 AV660038 Hs.2056 UDPglycosyttransferasel family, polype 17 

435608 AW183971 Hs.250896 ESTs 17 

413627 BE162082 Hs.246973 ESTs 17 

432415 T16971 Hs.289014 ESTs. Weakly simSar to A43332mudn 2 p 17 

35 445660 AI702666 Hs.201955 ESTs 17 

441396 AW293677 Hs.186890 ESTs 16 

452046 AB018345 Hs.27657 K1AA0802 protein 16 

454936 AW846082 gb:MR3^017»)B1099^HK)1CT0176HQmo 16 

454434 AA083558 Hs.261286 ESTs 16 

40 436888 AI942357 H5.187B70 ESTs 16 

431613 AA018515 Hs.264482 Homo sapiens mRNA;cDNADKFZp761AD411 a 16 

408812 BE397160 Hs.254763 ESTs. Weakly similar to A42442into9rin 16 

416690 K84078 KS.1085S1 ESTs 16 

436471 AA719813 H3,117662 ESTs 16 

45 425659 AK000590 Hs.158836 hypolhetk:dl protein FU20583 16 

426237 AK001104 Hs.168241 hypothelteal protein FU 10242 16 

458257 U48351 Hs.201219 ESTs. Weakly simitar to S18946dtrahig 16 
455544 AW993880 gb:RC3-BN0034-240400-017-d09BN0034 Homo 16 

407494 U10072 gb:Human forkhead family (AFXI) mRNA. pa 16 

50 452821 AW471181 Hs.160874 ESTs 16 

434222 AF119886 Hs.283941 HomosaptensPR02591 lnRN^completoods 2.6 

429864 AA460039 Hs.288 ribosomal protein L4 16 

456273 AF154846 Hs.1148 zinc linger protein 16 
402603 16 
55 411162 AW819944 gb:QVO.ST0294-240300-172.e03ST0294Homo 16 

420621 AA278808 gb3879c09.n NCLCGAP.6C81 Homo sapiens 16 

435113 AA665469 Hs.117136 ESTs 16 

438188 AA779975 Hs.128859 ESTs 16 

438295 A1394151 Hs.37932 ESTs 16 

60 450181 H0S254 Hs.201198 ESTs 16 

433764 AW753676 Hs.39982 ESTs 16* 

433229 AB040925 Hs.gi625 KlAAl 492 protein 16 

443718 A!083580 H3^1373 ESTs 16 

418246 Ai472179 H5.121276 ESTs. Weakly similar to R5HU7A ribosomal 16 

65 453930 AA419468 KS.367Z7 hypothetted protein aJ10903 16 

400365 Y10259 Hs.274501 H.8aptensACTH receptor mRNA 3mR 16 

419318 AW969742 HS.29100S ESTs 16 

428527 A!902398 Hs.34492 Cyt19 protein 16 
404414 16 

70 446444 AI743737 Hs.24370 ESTs 16 

411354 AW992424 Hs.288141 hypothetical protein MGC3 156 16 

417918 AA209205 Hs.163754 hypothefical proton FU 12606 2.6 

418310 AA814100 Hs.86693 ESTs 16 

454481 AW794878 Hs.31423Q ESTs, Highly siraSar to dod((Ksapiens 16 

75 >W1216 BE299830 Hs.192908 ESTs 16 

438257 AW474419 Hs.224794 ESTs 16 

442264 AI278777 H5.263455 ESTs, Weakly similar to AUJ1,HUMAN ALU S 16 

419505 AA243660 Hs.143061 ESTs 16 

417596 R07343 Hs.226823 ESTs, Moderately simflar to 154374 gene 16 

80 443555 N71710 Hs.21398 ESTs, Moderately stoiOar to A Chain A. H 16 

444517 Ai939339 Hs.148883 ESTs 16 
454867 AW835924 gb:PMUT0018>25020l>O02-e09LT()01BHomo 16 

455870 AW45263t H5.313803 ESTs, Highly simitar to AF1576331nQnd 16 
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457630 A1680803 Hs.112627 ESTs 16 

424210 T71397 Hs.222707 KIAA1718 protein 16 

447748 AI422023 Hs.161338 ESTs 16 

_ 411970 AA099142 Hs.13804 hypothelica! protein dJ462023.2 16 

5 441233 AA972965 Hs.135568 ESTs 16 

400706 16 

436033 H75391 H$;!55748 ESTs 16 

440836 AW370882 Hs.222080 ESTs 16 

431QB6 AI829692 Hs^11561 ESTs 16 

10 455110 BE154505 gb:PM044T0343-28129»«03^HT0343Ho(no 16 

455678 BE066007 gb:RC^T0319-12020(M)144098T0319Han)o 16 

413088 BE064962 gb:RC1^T03 13>1 304004)1 &«02BT031 3 Kono 16 

436196 AK001084 Hs.33349B Homo sapiens cDNAFU10222fis. done HE 16 

4373S6 BE140398 Hs.21621 hypothetica) protein DKFZp762O076 16 

15 432374 W68815 Ks.301885 Homo sapiens cONAFU 11 346 fis. done PI 16 

442690 AI014727 Hs.160047 ESTs. Weakly sinniar b B28096 fino-1 pr 16 

441700 AA233556 Hs.126908 hypottietical protein FU12994 16 

410286 AI739159 Hs.61&98 DKFZPS86N21 24 protein 16 

^rv 2.6 

20 429761 AI276780 Hs.135173 ESTs 16 

437085 AA743935 Hs^02329 ESTs 16 

450822 AW771860 Hs.205130 ESTs 16 

457506 AF131757 Hs.274533 Homo saptens done 24926 tnRNA sequence 16 

416565 X54162 H&79386 leiomodin 1 (smooth miisde) 16 

25 430357 AW976769 Hs.165607 ESTs 16 

417249 N58198 Hs.1B2898 ESTs 16 

423554 M90S16 Hs.1674 glutamine-ftitctos&^phosphatetransamin 16 

440400 AA994364 Hs.125594 ESTs. similar to T25472hyptfthe6 16 

440460 H92571 Hs.234478 Homo sapiens cDNA:RJ22848 lis, done H 16 

30 446302 AI285848 Hs.149757 ESTs 16 

424012 AW368377 Hs.137569 tumor protein 63 l(Da with strong homoiog 16 

426944 AA780181 Hs.41182 Homo sapiens DC47mRNA, complete cds 16 

419547 AA34B947 Hs.91816 hypolheBcal protein 16 

455500 AW963582 gb:EST375655 MAGE resequences, MAGH Homo 16 

35 419435 AI200540 Hs.14877 ESTs. Weakly similar to (deiiine not ava 16 

452450 AW854891 H3.194720 ATP-Wnding cassette, 8ui>4amily G (WHIT 16 

434699 AA643687 Hs.149425 Homo sapiens cDNAFU11980fis. done HE 16 

436421 A1676031 Hs.122813 ESTs, Weakly simHar to ZN2^.HUMAN ZINC 16 

447505 AIJ049266 Hs.18724 Homo sapiens mRNA; cONA DKFZp564F093 (fr 16 

40 419758 U31973 Hs.93173 pliosphodiesterase 6C cGMP-spediic. con 16 

428698 AA394104 Hs.97489 ESTs 2.6 

446861 At696519 Ks.14427 Homo sapiens cONA:rU21800 lis, done H 16 

423025 AA831267 Hs.12244 hypothe&:al protein FU20097 15 

447624 AI640326 H3.62713 ESTs 15 

45 411736 AW859089 gb:MR1-CT0350-15020(M)02-d02CT0350Homo 15 

416334 H53139 Hs.36271 ESTs 15 

446818 AI342668 Hs.279765 ESTs 15 

454836 AW833711 gb:QV4-TT0008.2511994)43<11TT000BHomo 2.5 

442278 A!733477 Hs.156313 ESTs 15 

50 453393 AW956392 Hs.110376 ESTs 15 

420854 AW296927 gb:UKK6WD'9)&<>07<0>UL8l NQ.CGAP.8u 15 

408729 AA1957&4 Hs.72639 ESTs 15 

455675 BE085984 gb:RC3^T031 9-1 2020001 4^06 BT031 9 Homo 15 

411660 AW855718 gb:RC1-CT0279.070100-021-806 CT0279 Homo 15 

55 455252 AWS76627 gb:RC34>T0026-12020OOia<)11 PT0028HQmo 15 

409156 N76186 Hs.173518 Mphase phosphopntetn homolog 15 

423175 W27595 K8.18853 hypofheticd protein FU14627 15 

430291 AV660345 Hs.238126 CGI-49 protein 15 

401785 15 

60 402369 15 

439079 AF085937 Hs.38348 ESTs 15 

412566 AW962574 gb:EST374647 MAGE resequences, MAGG Homo 15 

411463 AW847645 gb:iL3<:T0213-280100^5&-A04 CT0213 Homo 15 

413758 BE162391 gb:PM241T0451-09010(M}02^ KT04S1 Homo 15 

65 404988 IS 

409446 AI561173 Hs.67688 ESTs 15 

412613 AA653507 Hs.285711 hypothetical protein nJ13089 15 
417909 R3S614 gbvg66e08j1 Soares infant brain 1NIBH 15 

454743 AWB18456 H5.79347 KIAA0211 genepradud 15 
70 406364 2.5 
404108 2.5 
411934 AW876538 gb:RC3-PT002B-1901(KM)1 2^)06 PT0028 Homo 15 

415747 AA381209 gb:EST94257AdIvatedT4eilslHomosap 15 

443526 AW792B04 H3.134002 ESTs 15 

75 415319 AA659823 Hs.34gS5 Homo sapiens cONAFU13485fIs, clone PL 15 
454854 AW835775 gb:QV4-LT0016-240200-1ia<i04 l.T0016Homo IS 

458771 AW295151 Hs.163612 ESTs 15 
414349 6E512988 gb:601172^F1 NIH_MGCJ5 Homo sapiens c 15 

426589 AW954460 gb:EST366530 MAGE resequences. MA6C Homo 15 

80 429515 ALi)31228 Hs.204370 ONA segment on chromosome 6 (unique, pse 15 

443814 AV655386 Ks.7645 fibrinogen, B beta pdypeplide 15 
411772 BE170301 gb:QV4HT063&O4050O>193«5HT0536Homo 15 

434784 AA649051 H$.1640a7 ESTs 15 
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429322 D86984 Hs.199243 KIAA0231 prolBin 2.5 

446252 AI2B312S Hs.150009 ESTs 2.5 

453361 AA035t97 Hs.107375 ESTs 2.5 

- 455275 AWg77806 sb:EST389810 MAGE resequences. MAGOHomo 2^ 

S 449410 AA001356 H8.18159 ESTs 2.5 

451403 AAB85569 Hs.40919 Homo sapiens COKARJ14511 lis. done NT 2.5 

458861 AI630223 gbredOBgOM PraSlMno Eiyttuoid Ce 2.5 

416944 N22809 gb.7w41e07£l Wtizmann OlfactOfy Epitliel U 

423010 W25436 Hs.90725 ESTs, Moderately similar to 138022 hypot 2.5 

10 412505 AA974491 Hs.21734 ESTs 2^ 

446399 AI298405 Hs.150080 ESTs ^5 

412139 BE044976 9b:hn25b10j(1 NO.CGAP.Thy? Homo sapiens 2.5 

403691 2^ 

424025 AI701852 Hs.301296 HQmo&apienscONA:nJ23131 fis.clone L Z5 

IS 420352 BE258835 ob:601117374F1 NIH_MGCJ6 Homo sapiens c 15 

422342 AA309272 gb:EST1 80209 Uver, hepatocellular cad Z5 

447343 AA256641 Ks.2366g4 ESTs, HlgMy similar to S02392 8lpha-2-m 15 

457770 BE065030 Hs.124179 ESTs 2.5 

427731 AA411750 H3.20943 ESTs 15 

20 426920 AA393351 H5,132121 ESTs 15 

427794 AA709186 Hs.282963 ESTs 15 

429903 AL134197 Hs.93597 cyclirHtependent kinase 5. regiilatorysu 15 

454190 AW177821 gb:lL^T0059-160899W-G05 HT0059 Homo 15 

414550 BE379808 gb:601159567ri NIK.MGC.S3 Homo sapiens c 15 

25 436391 AJ227892 Hs.l46274 ESTs 15 

401989 15 

423346 A1267677 K5.127416 synaptojanin 1 15 

444905 AW135863 Hs.209228 ESTs 15 

424539 L02911 Ks.150402 actMn A receptor, lypet 15 

30 400861 15 

458426 A1084514 Hs.249587 ESTs 15 

429520 AA160142 H5.205058 hypolhellcal protein FU20075 15 

403568 15 

430692 X80240 ob:H.sapIens endogenous letrovirus HERV- 15 

35 451078 A1927694 Hs.204470 ESTs 15 

424560 AA158727 Hs.150555 protein predicted by done 23733 15 

427888 AA417088 Hs.137598 ESTs 15 

425541 AA359119 gb:EST68172 Fetal hing II Homo sapiens c 15 . 

422840 U44059 H5.121481 thyrotrophic embryonic factor 2.5 

40 404708 15 

405008 15 

453772 BE281431 Hs.16323 Homo sapiens, Similar to G antigen 8. d 15 

411036 AA657218 Hs.297007 membran&teundtranscripfiontodorprol 15 

4M575 AI264847 H3.22545 Homo sapiens CONAFU12S35 lis, done NT 15 

45 449311 A1657014 gbil49a1 1x1 NCI_CGAP„GC6 Homo sapiens 2.5 

454Z77 AW295C69 H5.31743 ESTs. WeaHy similar to Z157_HUMAN ZINC 2.5 

454566 AW807605 gb:MR4^T0098-1 201 0OW1-b06 ST0098 Homo 15 

454597 AWB09548 gb:MR4^T01 24-261 099^01 5k» 1 ST01 24 Homo 15 

416208 AW291168 Hs.41295 ESTs. WeaWy similar to MUC2.HUMAN MUQN 14 

SO 407851 NM14496 Hs.40434 ribosomal pmtebS6 kinase. OOkO. polyp 14 

446554 AA151730 Hs.3Q1789 nudU (nudeoskto diphospiiats Gnked moi 14 

452850 H23230 Hs.22481 ESTs. Moderately sin^ar to A46010X-lin 14 
405468 14 

407300 AA102616 gb2n43e07j{1 StratageneHeLacen63 93 2.4 

55 408817 R61736 H5.124128 ESTs 14 

409627 AW99762B Hs.313637 ESTs 14 

416665 H72974 gb:yu28a10.s1 Soares fetal Over spleen 14 

417404 NM.007350 Hs.82101 pledstrinhomotogy-Cke domain, family 14 

418994 AA296S20 Hs.89546 seledbiE {endothelial adhesion motecut 14 

60 428709 BE268717 Hs.104916 hypothellcd protein FU21940 14 

429654 A1435046 H8.164318 ESTs 14 

432253 AW090822 Hs.274174 transcription elongatton factor (Sill) e 14 

439786 AV652707 Hs.33756 Homo sapiens mRNA full length Insert d)N 14 

445432 AV553771 gb:AV653771 GLC Homo sapiens cDNA done 2.4 

65 453052 R63050 Hs.223613 ESTs 14 

454137 AW500340 Hs.313876 ESTs, We^ similar to 138022 hypothett 14 

459608 AL119471 gb:DKF2p761M141_rl 761 (synonym: haniy2) 14 

452843 AI796769 H3.20B320 ESTs 14 

433222 AW514472 Hs.238415 ESTs. Moderately similar to ALUSJiUMAN A 2.4 

70 449695 AA164569 HS.34S50 ESTs 2.4 

431532 AI537617 Hs.270311 ESTs, We^ similar to ALU1.HUMAN ALUS 14 

425987 NM 007159 H$.40O7 SarcolemmaMssodated protein 14 
400541 14 

430982 R17432 Hs.22217 Homo sa{dens done IMAGE:321 06. mRNA seq 14 

75 432808 NM.015985 Hs.278973 8ng»poieSn-3 14 

410845 AW807182 Qb:MR4.ST0062-16020»X)1-b04ST0062Homo 14 

411561 H81154 Hs.285017 hypothetksl protein FU21 799 2.4 

421083 AA283628 Hs.298016 ESTs. WeaMy slmDar to 138022 tiypottieU 14 

423513 AF035960 Hs.129719 transglutaminaso 5 14 

80 434627 AI221894 Hs.39311 ESTs 14 

435663 AI023707 Hs.134273 ESTs 14 

455679 BE153275 Gb'PMO4n0335-18040&0084l1 HT0335Homo 14 

451797 AW663858 Hs.333513 small tadudUe cytokine subtemilyE me 14 
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409041 AB033025 Hs.50081 KIAAn99 protein 2.4 

423244 AL039379 HsmQ2 ESTs.W6aUy8ln«arloubiquiIou5TPRm Z4 

453874 AW591783 Hs.36131 col^en. type XIV. alpha 1 (undulin) tA 

4488B9 BE140902 gb:tL1-HT0028-240699«)l-C11 HT0O28Homo 14 

5 439481 AF086294 H5.12S644 ESTe 2.4 

412074 S74663 Hs.73139 AOP-iibosyttransferase 1 14 

403053 RSB624 Hs.2t86 eukayoilctranslafionelongtf on factor 2.4 

409298 AA070211 gtezm6Bc04.8l Slralagene neuroepi«um Z4 

411322 AW887330 Hs.172405 cefl dhrtsion cyde 27 2.4 

10 447640 AI4171B7 ab:ig75g11.x1 Scares JIhHMPu_S1 Homo sapl 2.4 

447849 AI538147 Hs.164277 ESTs 2.4 

458763 AI693417 Hs.293309 ESTs 2.4 

404638 2.4 

413986 Z43567 gb:HSC1FC021 nonnafizedlnfanl brain cDM Z4 

15 407721 Y12735 Hs.38018 duakpedfidty lyrosineKYHihosphoiyl 2.4 

422321 AA90&427 H5.181035 hypothetical protein MGC1 1298 U 

408238 W95901 g1)Zfi43d11.r1 SoaresfetinaN2b4NRHonio 2.4 

436747 AW977192 Hs.291343 ESTs 2.4 

437048 AA743240 Hs.91582 ESTs 2.4 

20 413143 BE067232 9b:PM3-BT0347.17020^001-l)05 BT0347 Homo 2.4 

404561 2.4 

444009 A1380792 Hs.135104 ESTs 14 

400250 2.4 

403891 2.4 

25 417002 T79613 Hs.14613 ESTs 2.4 

439445 AI927629 HS.57B73 ESTs 2.4 

441227 AW295407 Hs.1 28893 ESTs 2.4 

445038 AI635444 Hs.143917 dJ467N1 1.1 protein 2.4 

455107 BE154113 gb:PM1-HT0340-15129«)03-a08HT0340Hotno 2.4 

30 ' 458624 AI362790 Hs.278639 WAAI 684 protein; likely homolog of mous 2.4 

459344 AW499533 Hs.257976 ESTs 2.4 

452605 AW9585S7 Hs.90012 hypothetical protein FU23441 2.4 

457652 AF116656 Hs.273609 Homosapi8nsPR01167mRNA. complete ods ZA 

450068 AW207212 Hs.280925 ESTs 2.4 

35 444750 AW242684 Hs,243S23 ESTs 2.4 

414591 A188a4go HS.5S902 ESTs, Weakty simitar to AlUB.HUMAN ALUS 2.4 

407264 IJ4727 gb:Homo sapiens T-ceH receptor beta (TC 2.4 

443169 AK)38687 Hs.133338 ESTs 2.4 

426538 A1949749 Hs.44441 ESTs 2.4 

40 449752 A166B626 Hs.61773 Homo sapiens cONAHJl 1648 (is. done HE 2.4 

459592 AIJ037421 Hs.208746 ESTs. Moderately similar to poL ORF 1 1 2.4 

429504 X99133 Hs.204238 Dpocalin 2 (oncogene 24p3) 2.4 

429063 AW363e45 Hs.122142 ESTs.Weak^8imll8rtoA46010X-linked 2.4 

430484 D82880 Hs.241548 RASp21 protein activator 2 14 

45 423673 BE003054 Hs.1695 matrix metsdioproleinase 12 (macrophage 14 

447375 A!376660 Hs^57822 ESTs 2.4 

444230 H95537 Hs.146067 ESTs 14 

439911 AA8M024 Hs.189110 ESTs 14 

421296 NM-002666 Hs.103253 perilipln 14 

50 449385 A1650471 Hs.270370 ESTs 14 

430044 AA464510 Hs.152812 ESTs 14 

427131 AA448460 Hs.112017 GE36gene 14 

409103 AF251237 Hs.112208 XAGE-1 protein 14 

421354 AA7664B5 Hs.269664 ESTs 14 

55 423740 Y07701 Hs.293007 amlnopeptidasopurorayclnsensHlvo 14 

440048 AA897461 Hs.328737 ESTs, Weakly similar to envetope protein 14 

441358 AW173212 Hs.129041 ESTs 14 

453857 AL080235 Hs.35861 DKFZP585E1621 proteto 14 

414290 AI558801 H5.71721 ESTs 14 

60 427342 AL1101S0 Hs.176680 Homo sapiens mRNA;cDNADKFZpS86O0724(f 14 

459459 AA460445 gb3x66h11.r1 Soaros_totatJetus_Nb2HF8. 14 

434638 K50758 gb7p86e06/1 Scares fetal iiver spleen 14 

442717 1^88362 Hs.180591 ESTs, W^aWyslntilarloT23976hypothefl 14 

419537 W27493 gb:31h10 Human retina cDNA randomly prim 14 

65 431169 AW971240 gb£ST383329 MAGE resequences. MAGL Homo 14 

449432 AW451361 Hs.196529 ESTs 14 

458734 AI554946 Hs.158794 ESTs 14 

449529 AI990559 Hs.232033 ESTs 14 

426088 AF038007 Hs.166196 ATPase, Oass I, type 88, member 1 14 

70 420195 N44348 Hs.26243 Homo saptonscDHAFU 11177 lis. done PL 14 

418105 AW9374B8 Hs.178000 ESTs, Weakly sImBar to FV1 MOUSE IWENO 14 

430957 /«937072 Hs.55043 Homo sapiens cDNAFU 13277 lis. done OV 14 

418188 AW139413 Hs.151880 ESTs 14 

__ 424103 NM_001918 Hs.139410 dihydroQpoamkte branched chain Iransacy 14 

75 454324 AW608930 HS.S2184 hypolhellcd protein FU2061 8 14 

437369 AA76S230 Hs.121742 ESTs 14 

453211 WB4829 gba:h53f04x1 Scares jBlaUlverjBpleen. 14 

451099 R52795 Ks.25954 Inteiteukln 13 receptor. Alpha 2 14 
400462 2.4 

80 413697 AA131315 Hs.47144 OKFZP586N0819pioteJn 14 

421755 AW169454 Hs.207422 ESTs. Weakly simBar to S71949metallopr 14 

424195 U50536 Hs.142907 Human BRCA2regIan,mfU4A sequence CG011 14 

434183 AW974720 Hs.25206 group XII secreted ph05pho(ipa5eA2 14 
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435985 AA703154 Hs.191934 ESTs Z4 

449916 T60525 Hs.299221 pynivata dehydrogenase kinase. Isoenzyme 2.4 

458661 AI293789 Hs.166999 ESTs. Moderately similar to 138344 «in Z4 

459023 AW968226 Hs.60798 ESTs 2.4 

3 406005 2.4 

456561 AI868634 Hs.246358 ESTs. similar to T32250hypotheB Z4 

452161 R43077 Hs.221747 ESTs ^4 

436590 AI393115 Hs.127655 ESTs 2.4 

^_ 430151 AW968203 gb:EST380398 HMGE resequences. MAGJ Homo Z4 

10 445635 AI76g774 Hs.209831 ESTs. Weakly simHar to ALUI.HUIulAN ALUS 2.4 

433479 AW511459 Ks.249972 ESTs 2.4 

441676 BES64206 Hs.49889 ESTs Z4 

407965 W21483 Ks.41707 heat shock 27kDprOteln 3 2.4 

450682 Z42993 Hs.25320 Homo sapiens done 25142 mRNA sequence Z4 

15 452958 AA883929 Hs.40527 ESTs 2.4 

454032 W31790 Hs.194293 ESTs. Weakly simitar to 154374 gene NF2 tA 

405347 2.4 

440577 AAB89945 Hs.326381 EST i4 

455780 BE088828 gb:a42-8T0e93-230300.129^09BT0693Homo 2.4 

20 457024 AA397546 Hs.119151 ESTs Z4 

404249 2.4 

437511 A1807500 Hs.125247 ESTs Z4 

421338 AA287443 gb:z852c10.f1 NCLCGAP.GCB1 Homo sapiens 2.4 

425146 AW954627 gb:EST3666971kAAGEfesequences.lkMGCHomo 2.4 

25 428277 AA425220 Hs.179203 ESTs 2.4 

444870 AI200621 Ks.148504 ESTs 2.4 

402090 2.4 

458507 AI185703 Hs^6957 ESTs 2,4 

_^ 443054 Ar745165 Hs.8939 yes^sociated piotein 65 ki3a Z4 

30 446534 AI307356 Hs.175226 ESTs i4 

453111 AB014598 Hs.31720 hephaestin 2.4 

405230 ^4 

405935 2.4 

413642 BE154837 gb:PM1-HT0345-121199^01-c0BHT0345Homo 2.4 

35 420724 AA279694 Hs.191540 ESTs 2.4 

436998 AA745625 Hs.29i414 ESTs. We^simOar to AlU8„HUft4AN ALUS 2.4 

445748 U80766 Hs.13252 Human EST clone 22453 mariner transposon 2.4 

434283 AW235341 Hs.58715 thiamine pyn)phosphdHinase 2.3 

407404 AF040257 glKHomo sapiens TNF receptor homologmRN Z3 

40 440621 AW296024 Hs.150434 ESTs 2.3 

423417 AP000365 Hs.128342 potassium large conductance caSdum^ 2.3 

424131 AA335714 Hs.199665 ESTs 2.3 

450737 AW007152 Hs.203330 ESTs 2.3 

453687 T55674 Hs.283108 hemoglobin, gamma G 2.3 

45 442704 AI016463 Hs.130987 ESTs 2.3 

457756 AA126136 Hs.38125 Interfero^lnduced protein 75, 52kD 2.3 

412732 AW993300 gbJlC2-8N0033-1 80200-01 5^06 8140033 Homo Z3 

418998 F13215 Ks787849 ESTs.WeaMysimaartoT22074hypotheti 2.3 

419751 AW195581 Hs.93121 KiAA0761 protein Z3 

50 429485 AW197086 Ks.99336 ESTs 2.3 

433377 AI752713 Hs.43B45 ESTs 2.3 

434896 AW022054 Hs.136591 ESTs 2.3 

441675 Ai914329 Hs.5461 ESTs 2.3 

444711 AI188739 Hs.148488 ESTs 2.3 

55 445621 AI733818 H3.145549 ESTs 2.3 

449182 AW292381 Hs.224150 ESTs 2.3 

430987 Y08564 HsJ248190 UI}P-N-acetyMpha-[H3)3Ctosam)ne:poiyp Z3 

404068 2.3 

414366 BE549143 gb:601076456F1 NIH_MGCJ2 Homo sapiens c 2.3 

60 438315 R56795 Hs.82419 ESTs Z3 

423161 AL049227 Hs.124776 Homo sapiens mlW;cDNADKFZp564N11 16 (f 2.3 

447998 AI768289 Hs.304389 ESTs 2.3 

410160 AW382942 Hs.6774 ESTs 2,3 

432792 AA448114 Hs.278950 protocadhertn beta 1 . 2.3 

65 443363 AI792629 H5.133293 ESTs 2.3 

440729 AA904739 Hs.128204 ESTs 2.3 

411045 AW854891 Hs.1 15325 RA67. member RAS oncogene family-like 1 Z3 

459207 AW138410 Hs.45051 ESTs 2.3 

459124 AW301478 Hs.184592 protein kinase, lysine deftotont 1 Z3 

70 456664 BE2&1115 H8.988S5 hypoQiettoal protein RJ20909 Z3 

427962 AAS4S562 Hs.8700 deleted in Over cancer 1 Z3 

401899 23 

432116 AA902953 Hs.308538 ESTs 2.3 

404196 2.3 

75 410999 AW813004 gb:RC3-ST018&^230300^»h028T0186HonK} Z3 

413308 W28131 gb:42f7 Human reOnacDNArandondy prime 13 

430264 AA470519 gb:nc71f10.s1 Na.CGAP_Pr1 Homo sapiens tZ 

443482 AW188093 Hs.250385 ESTs 2.3 

453305 R39224 Hs.267997 EHiWCgene 2.3 

SO 451963 AI825440 Hs.224952 ESTs 2.3 

453043 AW136440 Hs.224277 ESTs 2.3 

435559 AF209198 Hs.42636 zinc finger proldn 277 2.3 

440727 AI073991 Hs.134268 ESTs, WesfUy similar to 2109260ABcen Z3 
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434120 AI436060 Hs,143937 ESTs 2.3 

429768 AA80S719 Hs.192154 ESTs 2.3 

425292 NML005824 Hs.t55545 37 kDa leudnwfch repeat (LRR) protein 2.3 

_ 4S5B41 BE145836 gb:MR04fr0208*1 01 299-202-608 HT0208 Homo 2.3 

5 411093 BE067650 ob:MR4-BT0358^004)03-e01 610358 Homo 2.3 

430706 NML003S40 Hs.247816 mMstDnafemay.meniberC 2.3 

428268 AA424957 Hs.294132 ESTs Z3 

458833 AW236702 Hs.171431 ESTs, We^ similar to A46010)Unked 2.3 

452215 AK002043 Hs.28472 (rypolhelical protein FU1 11 81 13 

10 444109 AI124553 Hs.48965 Homo sarins cONA: nJ21693 lis. clone C 2L3 

428411 AW291464 Hs.10338 ESTs Z3 

433098 AW190593 Hs.151143 ESTs 2.3 

424882 AI379461 Hs.153636 bru)slrBametemefa (FUSE) binding prot Z3 

453178 AA498088 HS.6164B ESTs 2.3 

15 404569 2.3 

413841 M34276 Hs.75576 piasntiogen 2.3 

424068 U50531 Hs.138751 Human BRCA2regim,mRNA sequence OG030 2.3 

433532 AW975367 gb:EST387475 MAGE resequences, MAGN Homo Z3 

^. 442710 AI015631 H5.23210 ESTs 2.3 

20 444206 AW301017 Hs.146492 ESTs 13 

451264 AI758235 9bw982fl08.x1 SoarBS_NSF.F8.9W_OTJ>A^P_S 13 

4547B4 AW820626 gb:RCO-ST0299-190100412-e10ST0299Homo 13 

429080 AA446228 H3.99057 ESTs 13 

404166 13 

25 416327 R99822 Hs.36172 ESTs 13 

400631 AF173937 Hs.109494 secreted protein of unknomfuncUon 13 

436504 AW665261 Hs.224625 ESTs 13 

435325 Ai038388 Hs.1 19309 ESTs 13 

421253 AI188102 Hs.31028 ESTs 13 

30 427046 BE246180 Hs.121385 ESTs 13 

432711 AA563785 Hs.152465 ESTs. Wealdy slnflar to 138022 hypothefi 13 

439715 AA524504 Hs.42612 ESTs. WealdysimSar to ALU1_HUMAN ALUS 13 

441398 AA932396 Hs.292036 ESTs.VteaMysiniil8rtoB34087tiypolhefi 13 

4*8458 AW614387 H5.171054 ESTs 13 

35 452542 AWei2256 ob:RCO-ST0174-191099^1-a07 ST0174 Homo 13 

417768 R24732 Hs.175139 ESTs 2.3 

427374 AI150033 Hs.143686 ESTs 13 

446847 T51454 Hs.82845 Homo sapiens cDNA:aJ21930 Us, done H 13 

423600 AI6335S9 Hs.310359 ESTs 13 

40 413006 W03857 Hs.34298 ESTs 13 

434698 BE044674 gb;hm46f01x1 Na_CGAPJV)F1 Homo sapiens 13 

407639 AW205369 Hs.312e30 ESTs 13 

455121 BE156459 gb:QVO-HTD368^010»)82408HT0368Homo 13 

448117 H49129 Hs,172982 ESTs 13 

45 443931 H23213 H3,22657 ESTs 13 

450795 AW173371 Hs.60435 ESTs 13 

418532 AW1 18745 Hs,9460 Homo sapiens mRNA: cDNA DKFZp547C244 (fr 13 

419441 AW023731 Hs.274368 MSTP032 protein 13 

455067 * AW854538 , gb:RC3-CT0255-200100-024-b02 CT0255 Homo 13 

50 418291 BE300369 Hs.289038 hypolheiical protein MGC41 26 13 

455964 BE166924 gb:CM4HT0501 -240300-51 d-iOl HT0501 Homo 13 

445944 K06338 Hs.13480 Homo sapiens done 24875 mRNA sequence 13 

424827 AI057094 Hs.98867 Homo sapiens cONA:FU23155fis, done L 13 

449272 AW1376S6 Hs.197645 ESTs 13 

55 445292 AV653264 Hs.139B2 Homo s3pienscDNAFU14666fis, done NT 13 

415131 D61119 gb:HUM158C11Baonlech human fetal brain 13 

444715 AV650947 Hs.282454 ESTs 13 

439560 BE565647 Hs.74899 hypothetical protein FU12820 13 

444140 AV648089 Hs.282383 ESTs 13 

60 423949 AI014546 Hs.130912 ESTs 13 

428434 AW363590 H3.65S51 Homo saptens. Similar to DMA segment. Ch 13 

445711 T79611 H3.193891 ESTs 13 

424565 AW102723 Hs.75295 guanytato cyclase 1, soluble, alpha 3 13 

455201 AW947884 gb:PM14ra)10-20030W)01-g08 MT0010 Homo 13 

65 429180 AA806287 Hs.58893 ESTs 13 

418849 AW474547 Hs.S3S65 Homo sapiens PI&MiARNA tor mannosyllran 13 

425523 ABX7948 Hs.158244 KiAA047g protein 13 

416509 N57713 Hs.260899 ESTs.i^toderately8lmBartoZN91_HUhUNZ 13 

419337 AW291112 HsJi0997B ESTs 13 

70 419699 AA248998 Hs.173M4 ESTs. Weakly similar to 138022 hypotheO 13 

428976 A1J037624 Hs.194695 rashomolog gene family, member I 13 

436294 AA708310 gb2g07b07.s1 So8res_pine8d^jl3nd_N3HP6 13 

458925 R15891 Ks.261587 Human (done CT&A4)mRNA sequence 13 

433939 AL133887 Hs.254122 hypotheScal protein 13 

75 450048 AI693269 Hs-202273 ESTs 13 

451640 AA195601 Hsl6771 Human DNA sequence from done 747K23 on 13 

416677 T63470 HS.334&40 ESTs, Moderately dmiiar to 178885 serin 13 
405920 13 
_^ 405747 13 

80 412105 H07971 Hs.94319 VPS10 domain receptor protebi 13 

420457 AA482280 Hs.191656 ESTs 13 

407726 AA435679 HS.88S94 ESTs 13 

423720 AL044191 Hs.23388 hypothetieal protein DKFZp434F0318 13 
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409517 X90760 Hs.120036 treporin I. cardiac Z3 

435352 At056599 Hs.120693 ESTs 2.3 

43SB71 R8S518 Hs.46736 hypolhaOca) protein FU23476 13 

444098 AV647969 Hs.109894 K1M1451 protein 22 

5 449276 AW241S10 Hs.252713 ESTs Z3 

417712 AA205S69 Hs.194193 ESTs, Moderately sttnilarbALUI.HUMAN A Z3 

449015 AIJ038958 Hs.22868 protein tyrosine phosphatase, noiwecept ZZ 

411377 AW841462 gb:RCS>CN001408030(M)12-€09CN0014Homo 13 

429276 AF056085 H5.198B12 G proteifrcoupled receptor 51 13 

10 411816 AW864509 ob:PMa^N0017-240300401-h03SN0017 Homo 13 

455280 AW886156 ob:RC50T0076-15030(M>21-E08 OT0078 Homo 13 

407809 AW082279 Hs.244106 ESTs 13 

420478 AA521259 Hs.193796 ESTs 13 

424073 U03493 Ks.138959 gap )unction protein, alkiha 7. 451d) (con 13 

15 445117 AI208754 Hs.147369 ESTs 13 

459390 BE385725 gb:601276347F1 NIHJM6C_20HQmo8apten$c 12 

420230 AL034344 H5.284168 forlchead box CI 12 

411517 AWB50267 gb:llX:TO219-161199<K31-A09CTQ219Honu> 12 

403678 12 

20 457003 S78234 Hs.172405 ceD division cyde 27 12 

404531 Z2S884 Hs.121483 chlorfdechann^ 1 .steletaimuscie(Th 12 

423045 AW987472 Hs.163302 PCTAIRE protein Mnase 2 12 

409427 AVtf369668 flb:RC2-ST0168471299-013WST0168Homo 12 

434745 AW974445 H5.185155 ESTs, Weaidysln«ar to T1 2482 hypolheB 12 

25 400696 12 

407259 L02256 gb:HuTnan Fab fragment binding syncytial 12 

411893 R82845 Hs.273789 ESTs 12 

426192 AA4240S1 Hs^42 ESTs 12 

435634 T82384 0b7c14fD5.n Stratagene lung (93721Q) H 12 

30 438018 AK001160 Hs.5999 hypolhetical protein FU1 0298 12 

458303 AI264628 Hs.125428 ESTs 12 

405692 12 

403572 12 

415380 F07953 Hs.16085 putaSve G-protdn coupled receptor 12 

35 433014 Nl^014711 Hs.279912 KIAAD419 gene product 12 

417859 T26453 gb:AB214F6R Infant brain. LLNL array of 12 

456472 AK001714 Hs.95744 hypolheUcaS protein similar to ankyrin 12 

444106 AI123922 Hs.138215 Homo sapiens cONAFU11400 Us, Clone HE 12 

428231 U17g89 Hs.183105 nuctear auteantigen 12 

40 454086 AW885909 Hs.6975 PRO1073 prolan 12 

425071 NM.013989 Hs.154424 deiodinase. iodofhyroiine, type II 12 

415348 H65887 Hs.272163 ESTs 12 

403780 12 

414262 AW975616 Hs^1469 ESTs 12 

45 419423 D26488 Hs.g0315 KIAA0007 protein 12 

442076 AW268583 Hs^62629 ESTs 12 

452975 M85521 Hs.244482 Homo sapiens, done IMAGE:361 171 9, mRNA. 12 

419216 AU076718 H8.164021 smaD inducible cytokine subfamily 8 (Cy 12 

^- 416568 H66558 gb7u16e04.r1 Soares fetal liver spleen 12 

50 425368 AB014595 Hs.15S976 culiin4e 12 

425686 M73531 Hs.1937 fednai degeneration, stow (roQnitis pi 12 

441638 AW293202 Hs.133451 ESTs 12 

446845 AI343645 Hs.156108 ESTs 12 

-,_ 422563 BE299342 H5.19348 hypolheCcd protein FU131 19 12 

55 436574 AW293527 H5.126465 ESTs 12 

424564 H10692 Hs.13310 ESTs 2.2 

456347 U00803 Hs.89426 fyn-fdated kinase 12 

446901 AI347274 gb±rf»d01x1 NQ^CGAP^Col 6 Homo sapiens 12 

459364 W69284 gb2d46d)3Jl SoaresJetaLheartNbHHISW 12 

60 430886 NI\1.001942 Hs^ desmogteini 12 

414831 M31150 Hs.77439 protein kinase. cAMP^ependent. regulate 12 

425707 AF115402 Hs.11713 E74^ factor 5 (els domain transcript 12 

403525 12 

453343 AA905353 Hs.121622 ESTs 12 

65 421574 AJ0001S2 Hs.105924 defen5in,beta2 12 

449327 AI638743 Hs.224672 ESTs 12 

454769 AW819848 gb:(3V(^m»4^0300-151-b04ST0294Komo 12 

420493 A1635113 Hs^0386 ESTs. Weddy similar to r7688S6eitneAh 12 

401614 12 

70 404767 12 

403534 12 

410694 AW770778 Hs^81238 ESTs 12 

436193 AA7060S9 Hs.2552e6 ESTs 12 

439626 N22415 Hs.189060 ESTs 12 

75 456481 AA258033 H5.IO8IIO 0KFZPS47E2110 pioteto 12 

441453 AW176106 Hs.285459 ESTs 12 

424946 M54572 Hs.153932 protein tyrosine phosphatase, non^ecapt 12 

437332 AA814943 gb»c07d06^1 NCLCGAP.GCSI Homosaptens 12 

454419 AA082211 Hs^936 myosin, fight polypeptide, (ogutabry. n 12 

80 416225 AA577730 Hs.188684 ESTs. Weakly siniilar to PC4259 ferritin 12 

450579 AW138774 H$.48614 ESTs 12 

400664 12 

447613 ALD41057 HSw33363 DKFZP434NQ93 protein 12 
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402669 AK001334 HS.1S470 putative ring zinc anger pnM NY-REN U 

430884 AF053748 Hs.24B114 gOalceB derived neurotrophic factor Z2 

432797 AAS65264 Hs.136443 ESTs 2.2 

405608 2.2 

5 42636S AA376667 Hs.t0283 RNAtjindingmoQr protein 8B 2.2 

40S634 2.2 

423646 K02364 sb:yJ35d06j1 Scares placenia Nb2Hi> Homo 2.2 

434690 AI887679 Hs.148410 ESTs 12 

436572 AA723274 Hs.279596 ESTs 2.2 

10 447044 AF030107 Hs.17165 regulator of GixoteinsignallinB 13 2.2 

448828 A1S802SS Hs.174782 ESTs. WeaUysimBaf to KIAA1437 protein 2.2 

457802 T76013 Hs.167279 FYVE4inger-containIngR8t»5 effector pro 2.2 

444585 AW170015 Hs.6594 ESTs 2.2 

433781 AA609379 Hs.192083 ESTs 2.2 

15 450587 A1828854 H5.2S8538 stiiafin. catmoduOn^lndino protein 2.2 

434077 AF116659 H5.321151 Homo sapiens PR014 12 mRNA.CQ(nptetecd& 12 

448756 Ar739241 H3.171480 ESTs 12 

430388 AA356923 Hs.240770 nuclear cap bindino protein subunU 2. 2 12 

454471 AWg02125 gb:QVO.NN1022-120500>220^12 NN1022 Homo 12 

20 419107 AW085152 Hs.292987 ESTs 12 

455114 AW857121 gb:RCl-CT0302.0404(XM)17^12 CT0302 Homo 12 

416548 H62953 gb:yr47f06^1 Scares feta) liver spleen 12 

454117 BE410100 Hs.40368 adaptoMelaled protein oonvlexl.slgma 12 

456056 AA463550 Hs.337532 ESTs, Weakly similar to A47582B<e!lor 12 

25 409998 M78345 Hs.98265 KIAA1 877 protein 12 

422352 AA766298 Hs.99200 ESTs 12 

409191 AW818390 Hs.175613 homdogofXenopus Qaspin 12 

433919 AA746311 gb:oa56d12jl NQ.CGAP.GCBl Homo sapiens 12 

455771 BE084820 H5.186711 hypothetical ptoleln FU20070 12 

30 431632 AK000992 Hs.333144 Homo sapiens cONAFUIOISO lis, clone HE 12 

454716 AW850584 gb:lL3<n-0219-16020OO53-D12 CT0219 Homo 12 

413752 BE161607 9b:MR3-HT044&^00300>20M1 HT0446 Homo 12 

458037 AF074982 Hs.226031 ESTs, Highly similar to ICIAA0535 protein 12 

434239 AF119910 H$.283047 hypolhetical protein PR02964 12 

35 435133 AJ010482 H8.31412 Homo sapiens cDNAFU11422 lis. clone HE 12 

442772 AWS03680 Hs.5957 Homo sapiens done 2441 6 mRNA sequence 12 

400697 12 

455685 BE066976 gb:Pkff)^0340-211299<)03«12BT0340Homo 12 

447039 AV681798 H5.282915 ESTs 12 

40 404593 12 

42Z728 AW937826 Hs.103262 ESTs. Weakly similar to ZN91_HUMAN ZINC 12 

421976 A1138443 Ks.23450 mitochondrial ribosomal protein S25 12 

401673 12 

425001 U55184 Hs,154145 hypolheticai protein FU1 1585 12 

45 447816 NM_007233 Hs.274329 TP53 target gene 1 12 

416143 Ai955650 Hs.79033 glutaminyt^eptldecyclotransfer8se(glu 12 

419118 AA234223 Hs.139204 ESTs 12 

426261 AW242243 Hs.168670 peroxisomal farrwsylaled protein 12 

449808 AA694220 Hs.15403 ESTs. Moderately similar to ALU7_HUMAN A 12 

50 454749 AW818849 gbJ«:i>STO278^004iaeO2 ST0278 Homo 12 

456933 AA383946 Hs.20969 ESTs 12 

402942 12 

437064 AI023264 gbx}v84h08.8l SoaresJbstisJ^HT Homo sap 12 

458623 AI30S223 Hs.148056 ESTs 12 

55 415257 F03016 Hs.27513 ESTs 12 

426269 H15302 H3.168950 Homo sapiens mRNA; cONAOKFZp566A1046(f 12 

44Z783 AI017586 Hs.131181 ESTs 12 

444313 A1140494 H5.197955 KIAA0704 protein 12 

453444 AL036531 gb:DKFZp564l1152j1 664 {synonym: hfbr2) 12 

60 422757 AI90993S H8.GS551 Homo sapiens. SlmOar to DNA segment. Ch 12 

430013 AA463833 Hs.151275 EST8,\AteaMy8lmtlartoTRHY^HUMANTRICH 12 

437138 AI935622 H8.271245 ESTs 12 

406298 12 

409723 AW885757 Hs.257e62 ESTs 12 

65 414481 AW4S19S6 Ks.8383 biomodomain adjacent to zinc finger done 2.2 

433266 A1863224 Hs.31476 Homo sapiens cDNAFU13872fis. done TH 12 

435090 BE217923 Hs.149595 ESTs 12 

457187 AA443927 Hs.144350 EST 12 

445061 AI253094 Hs,145227 ESTs 12 

70 442817 AW340093 Hs.130536 ESTs 12 

438298 H23542 Hs.1 81788 ESTs 12 

454916 BE067246 gb:PMl-8T0348-151299^1^04 BT0348 Homo 12 

428017 AA424983 Hs.98312 ESTs 12 

451149 AL047586 Hs.10283 RNAl)lnrilngmoW protein 8B 12 

75 418076 R61388 Hs.6724 ESTs 12 

403306 NM.006825 Hs.7436e transmembrane protein (63kD). endoplasmi 12 

441811 AI073546 Hs.164597 ESTs 12 

434763 AA548618 gb:ns07all.r1 NCl.CGAP_Ewl Homosaptens 12 

447453 AW608645 Hs.18800 hypothetical protein FU20281 12 

80 420931 AF044197 Hs.100431 small tedudtito cytokine 6 sub(amiiy(Cy 12 

415424 Z44766 gb:HSC28G081 nomidzed intent brain cDN 2.2 

408332 H91230 H5.234794 Homosaptens mRNA; cONADKFZpS64B083(fr 12 

421216 AV649282 Hs.102654 vesicto^ssociated monbrane protfldn 4 12 
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429609 AR)02246 
448700 BE614182 
457741 BE044740 
437927 A1039789 
401B94 

423531 AW752782 
424419 AKX1563 
436640 AA724411 
438290 AA843719 
445908 R13580 
455735 BE161124 
458455 AV648310 
430680 AW138724 
447147 AA910353 
424063 NM_002019 
441874 AA970389 
44604S AJ29743& 
433629 R13140 
415266 AA164199 
440633 AI140686 
442789 AW904361 
417563 AA203701 
407788 BE514g82 
401240 

408908 BE296227 
411151 AW866497 
414275 AW970254 
436992 AA741074 
439634 W79377 
411770 N^L014278 
400040 

458762 AW802754 
424736 AF230877 
419953 6E2671S4 
410648 AW792909 
423717 AA330035 
436683 AW991278 
445225 AI216555 
410991 AW8127g0 
412639 AW9612B4 
447777 AI424223 
451270 AW341392 
404526 A1912555 
452492 BE063096 
417154 AI674701 
428152 AA422030 
442312 A1820617 
456513 AA279143 
430712 AW044647 
441445 AI221959 
420288 AW071225 
412329 AW937445 
447033 AI3S7412 
436853 BE328074 
455188 AW864176 
430699 BE018217 
458356 AI024BS5 
457040 N77624 
424480 AA341442 
403317 U02687 
406018 

410566 AA373210 
413801 M62246 
415871 R55995 
416747 AW876523 
417725 R25257 
424856 AA347746 
439474 AI824060 
446895 AA16665S 
448582 AI538880 
452783 AA028167 
442430 R89164 
428908 AW303529 
427335 AA448542 
428336 AAS03115 
419290 A1128114 
416961 AA190926 
439950 AW937417 
458227 Z40670 
447179 AW015633 
454950 AW847460 
404453 

420844 AAS95522 



Hs.210863 ceBadhesbnffloleailewtthhoinolQgyto 
HS.1Z3075 ESTs 

gbdiin55g10ji1 Na.0GAP.RDF1 Homo sapiens 
HS.2S982 hypotri80calpraleihFU21031 

Hs.129750 hypotheOca) protein FU10546 

Hs.146589 hypQlheaca)pntelnFU10701 

Hs.155065 ESTs 

Hs,122341 ESTs 

H5.13436 Homo sapiens done 24425 mRNA sequence 

gb:PM(>4fT0425-14129^1-A06 HT0425 Homo 

Hs.213488 ESTs 

Hs.168974 ESTs. Highly similar to ALU7„HUMANAUJ S 

H5.292815 ESTs. Wealdy similar to T23482 hypolheti 

Hs.1 38671 fms^elated tyrosine Idnase 1 (vascular 

Hs.128055 ESTs 

Hs.20166 pioslate stem ceD antigen 

Hs.13359 ESTs 

Hs.270152 ESTs 

Hs^63320 ESTs 

Hs.1311 91 ESTs. Wealdy similaT to ALi;7_HUMAN ALU S 

gb3x52a10jl Soares_fet8l_liver_spleen. 

Hs.36991 S100 caldimv^xndino protein A2 

Hs.250822 serlne/lhteonine Idnase 15 

gb:QV4<$N0024-170400-17&«07 SN0024 Homo 
Hs.689 Charot-Leyden crystal protein 
Hs.120750 ESTs 

Hs.t67 (Ttoottdiule^oclated protein 2 
Hs.71992 heat shock protein (lispl 10 family) 

gb:IL2-UI^0076^0a061-H01 UM0076Homo 
H5.152701 ni)CR)(Ubule4nlerac6ng protein that ass 
Hs.125752 ESTs 

gb:CMO-UM0001-010300^2S8^UM0001 Homo 
Hs.152003 ESTs 
Hs.57787 ESTs 
H5.202398 ESTs 

gb:RC3^0186-141299^4^8 ST0186 Homo 
Hs.296235 ESTs 

gbte95a05jcl NCLOGAP.Pr28 Homo sapiens 
Hs^35795 ESTs 

Hs.157195 peptide YY, 2 (seminalptasmtn) 

gb:CM4aT025W)91 1994)39-802 BT0266 Homo 
Hs,21388 ESTs 

Qb:zv26h05.r1 SoarBS,NhHW^_S1 Homosapl 
Hs.1 29216 ESTs. Weddy similar to ALULHUMAN ALU S 
Hs.88561 ESTs 
Hs.196284 ESTs 
H8.1B7937 ESTa 
Hs^45556 ESTs 

gb:QV3-DT004»)9020a08O«09 DT0043 Homo 
Hs.157601 ESTs 
Hs.148661 ESTs 

gb:PM0^t4-26O400mb08 SN0014 Homo 
HS.1B3528 hypothetical protein FU14g06 
H3.131575 ESTs 

Hs.173717 phosphaBdIcacidphosphalaselypeTB 
HS.20S2S9 ESTs 

Hs.385 In&rolaled^rosine kinase 3 

Hs.43047 Homo sapiens cONA FU13585 Qs, done PL 

Hs.35406 ESTs, Highly similar to unnamed protein 

Hs.283309 ESTs. liltoderateiy simnar to ALULHUMAN A 

Hs.15929 hypoth6ttealprotdnFU12910 

Ks.21503 ESTs 

Hs.9521 ESTs. sMIar to ZN43 JIUMAN ZINC 

Hs.211501 ESTs 

HS.282B03 ESTs 

Hs.94812 ESTs 

Hs.61486 ESTs 

Hs.48320 double rlng^nger prol^ Dorfin 

H3.144955 ESTs 

Hs.251677 Ganfigen7B 

Hs.1 83752 mlcrosemlnoproleln, beta- 

Hs.1 12885 spina] cord-derfved growth factor-B 

Hs.ig0785 ESTs. Moderatety similar to S65657 alpha 

Hs.293561 ESTs 

H$.181340 ESTs 

Hs.157299 ESTs 

gb:RC3-CT0208-270999-O2lHBO4 CT020B Homo 



gbuih22o09x1 Na.CGAPJ>r1 Homosa)^ 
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426456 AASB0748 
428822 W28418 
430879 BE149423 
4445S4 A)168422 
4462S8 AAS8S662 
453853 AUMOGOO 
414083 AL121282 
401645 

436577 W84774 
427469 AA4O3084 
409168 N94037 
410276 At554545 
443372 AI792557 
422093 AF151852 
402333 

409374 RB7083 
412011 NM_00O406 
412798 AW998&57 
416085 H18072 
418378 AW962081 
437846 AA773866 
452374 AL037405 
450061 AI797034 
450180 AW449644 
405120 

407378 AA299264 
458890 AW865523 
435600 A1J047034 
440964 A)733106 
417455 AW007066 
436461 AW511956 
436777 AA731199 
427521 AW973352 
413646 BE155042 
413231 D87461 
423969 AI830571 
411518 AW850246 
443777 AV646510 
416148 H22453 
402528 

431215 AA496078 
436820 A1684535 
446209 At375025 
453362 H1498B 
417430 AA984546 
401069 

454078 AA501518 
410966 AW812088 
447124 AW976438 
449939 T86420 
411693 AW857271 
438005 BE151746 
443466 NM_003428 
407884 BE075316 



HS.1306S8 ESTs 

Hs^715 potesshmivoUag&fated channel, Isk-rel 

Hs.10554 hypolhefiCBl protein nJ12761 

Qb»k30e11jc1 SoarBS.NSF_F8.9W.OT^P^.P.S 

Hs.63131 Homo sapiens cONA RJ13 155 iis. done NT 

Hs.168083 ESTs 

Hs.257786 ESTs 



404694 



TB2745 

441092 T99289 

454643 BE006345 

426646 AA382787 

431605 AW972407 

414452 AA454038 
401991 

457176 AA436837 

436464 A1016176 

428208 AA442327 

445049 AV652718 

419116 AF292402 

427894 AL135709 

424296 AI631874 

424323 AA338791 
404582 

418631 AA225921 

424872 AA347923 

452539 AW105321 

454658 AW812330 

440310 AA878939 

433297 AV658581 

410900 AW810169 

419386 AA236867 
402451 

447842 AW160804 

453880 AI803166 

425189 H16622 



Hs.17643 ESTs 

Hs269347 ESTs, Weakly sinHa to 2109260A B cell 

HsJI 2938 ESTs. Wtoakly stmOai to AUJ1.HUMAN ALU S 

Hs.68301 ESTs 

Hs.133107 ESTs 

Hs.111449 CGl^pnM 

Hs.19081 ESTs 

H5.73064 gonadotropbHeleastnghomune receptor 
Hs.1 19120 E3 ubiquitin ligase SMURF1 
Hs.92576 ESTs 

gb:EST374154 MAGE resequences, MAGG Homo 
HsJ244569 esophagus canoenelated gene-2 
Hs.339639 ESTs 
Hs^inS ESTs 
Hs.257182 ESTs 

Hs.57776 ESTs, Modefalelyslmflar to 138022 hypot 

9b:PfcM-SM0O2O^104004X)9^ SN0020 Homo 
Hs.119747 ESTs 
Hs.130218 ESTs 

Hs.18949 ESTs. Weakly similar to CA2B.HUMAN COOA 
Hs.293261 ESTs 
Hs.293130 ESTs 
Hs.290585 ESTs 

gbPM0-KT0349-10129&O02-E04 HT0349 Homo 
Hs.75244 BCl2-Ske2 
Hs.331633 hypotheUcal protein DKFZpS66N034 

Qb:IIJ<;T0219-29109W)21^07 CT0219 Homo 
H8.411B5 Homo sapiens mRNA:cONADKFZpS6401262(f 
Hs.169187 ESTs 

Hs,121554 Human DMA sequence from done RP11-218C1 
H8.200811 ESTs 
HS.15336B ESTs 
Hs.107375 ESTs 

gb:8m88e08,s1 Stratagene schizo brain SI 

Ks.22209 secreted modular calcium4))nding protein 

gb:RC4-ST0173-191099^2^07 ST0173 Homo 

Hs.17428 RBP1-<B« protein 

Hs^72139 ESTs 

gb:CM0-CT0307.210100-15a^09 CT0307 Homo 
gb:PlVI1-HT030SO61299^03^ HT0305 Homo 

Hs.9450 zinc finger protein 84 (HPF2) 

Hs.95011 syntrophin. beta 1 (dystroptiln-assodate 

Hs.164411 albumin 
Hs.125556 EST 

gb:RC2^N0127-2403«M)11-d0$ BN0127 Homo 
Hs.122713 ESTs 

gb£ST384496 MAGE resequences, MAa Homo 
Hs.29032 ESTs 

gba:v57g07.s1 Soares_teslis„NHT Homo sep 
Hs.269783 ESTs. Weakly similar to ALU1_HUMAN ALU S 
Hs.104854 ESTs 

^AV652718 6LC Homo sapiens cDNActone 
H5^83093 neuromedin U receptor 2 
Hs.28921 zinc finger protein 
Hs.155140 casein kinase 2. alpha 1 pdypepUde 
Hs.177788 ESTs 



Hs.1 15105 ESTs 

gb:EST54302 Fetal heart II Homo sapiens 
ESTs 

DKFZP564G092 protein 



H3.4g367 
Ks.11123 
Hs.125408 ESTs 
Hs.282633 ESTs 



Hs.143868 



gb:MR4-ST0124«4050OO07-h07 ST0124 Homo 
ESTs. Weakly similar to 138022 hypotheO 



Hs.247302 twisted gastrulatlofl 
Hs.26462 ESTs, Weakly simiar to 138022 hypotheS 
gb7ni26dl7/1 Soares infant brain 1N1B H 
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45722S AWB2003S HS.Z78679 adisintegrinandmetaOoprotefnased^ 2.1 

400812 2.1 

402318 2.1 

410534 AW905138 gbK3VO-NN1O71-28O40O.207^07NN 1071 Homo Z1 

5 410878 AW809201 HsJ14248 ESTs.W6a]dy similar to ALU4.HUMAN ALUS Z1 

412029 AW88623B gb:RC50T0078-28030a022-F010TQ078Homo Z1 

414494 AA788491 Hs.6783 hypotheOcal protein RJ22724 2.1 

427027 A!924294 HS.1732S9 uncharacterized bone manw protein BM03 ^^ 

444438 AI151413 Hs,26330 ESTs 2.1 

10 435191 R15912 Hs.4817 Homo sapiens dona 24461 mRNA sequence Z1 

425324 1489470 HS.15S644 paired box gens 2 2.1 

430719 AA488988 Hs.293798 ESTs 2.1 

432577 BE208545 Hs^17590 hypothetical protein FU14640 2.1 

407593 AWD44083 Hs.237008 ESTs ^^ 

15 401098 2.1 

440299 AI871778 Hs.250112 ESTs 2.1 

414146 BEM9372 Hs.317596 Homo sapiens CONAFU12927 lis. done NT Z1 

428627 BE002993 Hs.187660 putative Rab5GDP/GTP exchange factor l» Z1 

451806 NM_003729 Hs.27076 RNA S'-termlnal ptiosphale cyciase Z1 

20 431912 At660552 Hs.154303 ESTs. Weaidysimi)8rtoA56154 AUsubst Z1 

439831 AW136488 Hs.25545 ESTs Z1 

451829 AW964081 Hs.247377 ESTs 2.1 

404595 2.1 

421498 AA292084 Hs.191575 ESTs. Moderateiy simitar to ALU^HUMAN A 2.1 

25 456083 U46922 Hs.77252 fragBetdsfidine triad gene 11 

440527 AV657117 Hs.184164 ESTs, IModerateiy similar to S6S657dplia ^^ 

406413 2.1 

439463 T69980 H$.SQ323 Homo sapiens cDKAFU 11 61 3 fis. done HE 2.1 

446242 N66^ Hs.7360 ESTs 2.1 

30 449625 NM-014253 Hs.23796 odz (odd Oztovm. Drosopliila) tiomotog 1 Z1 

457938 AI373638 Hs.133900 ESTs Z1 

413101 BE065215 gb:RC1-BT03l4^10300-01 5^1 BT0314 Homo -2.1 

408350 AW183350 H&250127 ESTs 2.1 

419812 Kil^000562 Hs.93210 complement component 8. alpha potypepUd 2.1 

35 430881 NM.000809 Hs.246112 gamma-aminobutyric add (GABA) A recepto 2.1 

429682 NM-006306 HsJ211602 SI^CI (structural maintenance of diromoso 2.1 

409955 U60665 Hs.57692 chromosome 6 open reading frame 10 2.1 

435579 AI332373 HS.1S6924 ESTs Z1 

436088 AA704687 Hs.191294 ESTs 2.1 

40 430223 NI^002514 Hs.235935 nephroblastoma overexpressed gene Zl 

416100 H18700 Hs.268799 ESTs Zl 

403218 A1134878 Hs.119500 rfbosomal protein, large P2 Zl 

409747 H60964 Hs.331250 ESTs Zl 

428764 W21550 gb3b52f1Zr1 Scares JetalJung_NbHL19W Zl 

45 425075 AA508324 Hs.1852 add phosphatase, prostate Zl 

409432 D49372 HsM460 small Indudble cytokine subfonBy A (Cy Zl 

428299 Ali)380O4 Hs^19 ESTs Zl 

406817 A]936028 gb:wo47a09j(1 Na.CGAP_Gas4 Homo sapiens Zl 

411940 AW876686 gb:CM4-PT0031.18020tW07^PT0031 Homo Zl 

50 412446 AI768015 Hs.92127 ESTs . Zl 

414012 AW452334 Hs.128148 ESTs Zl 

421966 AA904519 Hs.1 30710 ESTs Zl 

430566 AA481282 Hs.190149 ESTs Zl 

456606 AA292862 H3.275369 ESTs Zl 

55 451604 T65365 Hs.172851 arginase. type II ZO 

440926 AW196772 Hs.131323 ESTs ZO 

420687 AA279392 Hs.88605 Homo sapiens cDNAfnJ13427 lis. clone PL ZO 

459082 BE551721 Hs.282149 ESTs ZO 

413241 BE073771 Hs^2414 Homo sapiens done FLB8945 PR0241 1 mRI^ ZO 

60 426917 AA913814 Ks.172854 DKFZP586B0923 protein ZO 

447552 AI394125 Hs.160413 ESTs ZO 

420905 AA521307 H3.186651 ESTs ZO 

428052 AA420477 Hs.26993 ESTs ZO 

^_ 424308 AW975531 Hs.154443 minldvomosomemanten8ncedeiidenl(S. ZO 

65 432S27 AW975028 Hs.102754 ESTs ZO 

430202 T85775 gb^deOgOZrl Scares fetal liver spleen ZO 

446610 AV659433 Hs.282984 ESTs, Weakly similar to 138022 hypotheti ZO 

427961 AW293165 Hs.143134 ESTs ZO 

455290 U75810 gb:HSU75810 Human Homo sapiens cDNA cton ZO 

70 445564 ABQ26957 Ks.12698 KtAA1034 protein ZO 

412811 H08382 H3.21400 ESTs ZO 

413783 AA314337 Hs.301547 ribosomal protein S7 ZO 

423867 AA331886 gb:EST35757En)biyo.8weeklHoniosapien ZO 

429418 AI381028 Hs.118769 ESTs ZO 

75 431511 NM.012386 Hs.256581 Homo sapiens p95paxQDn-ldnase linker ZO 

445829 Ai452457 KS.145S26 ESTs ZO 

452366 AK000464 Hs.29276 hypothetical protein FU20457 ZO 

453123 A1953718 Hs.221849 ESTs ZO 

„^ 455401 AW336369 gbK3V4-DT0021^129W)71-d07 DT0021 Homo ZO 

80 406666 V0049S Hs.184411 albumin ZO 

445688 Al24e205 HS.1S3244 ESTs ZO 

446131 Nh4.000929 Hs.290 phosphoOpaseAZ group V ZO 

440388 Aie93520 Hs.223O00 ESTs ZO 
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457128 


Ai932995 


Hs.183475 


404416 






4441 B7 


AW1 38466 


Hs.151274 


431552 


A1815e63 


HS.2S9873 


455814 


BE141669 




454759 


AW819455 




426497 


AA3799t3 




404420 






408112 


AW451982 


Hs.248613 


432702 


AW973953 


H3.293744 


448587 


A1539652 


Hs.28330 


446854 


BE268103 


H5.208914 


410569 


AA76682S 


Ksi05675 


432596 


AJ224741 


H5.278461 


402341 






452919 


AW962167 




433632 


AA649921 


Hs.112553 


435079 


AA664192 




451927 


AL355587 


H3.27261 


432839 


AA579465 


Hs.45207 



450895 NG6727 
408459 H09701 
400842 
455797 
400859 
405829 



HS.109S7 
Hs.278386 



BE091833 



Homo sapiens done 25061 mRNAsaquence 
ESTs 

axonal transport of synapflcvesldes 
9b:CM1-Kr0092'22099»>1&U9 HT0092 Homo 
Ob:RCS^T029»)212994}31-A04 ST0293 Hgnio 
gb:EST92807 SMn tumor I Homo sapiens cO 

ESTs 
ESTs 

KIAA1546 protein 
hypothetical protein MGC1 0999 
ESTs 
malriilnS 

Qb:EST374240 MAGE resequences, MAGG Homo 
ESTs 

gb:8c05b03^1 Stratagene lung (937210) H 
Homo sapiervs mRNA full length insert cON 
hypothetical protein KIAA1335 
ESTs 

ESTs. Weakly similar lo 138022 hypoBieti 
gbdL24fn)731-2604004I764^BT0731 Homo 



10 
ZO 
ZO 
10 
2.0 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 



411863 


BE075244 


H$.12420 


ESTs 


10 


415258 


AW752247 


Hs.293853 


ESTs 


10 


416093 


R60685 


Hs.268698 


ESTs. Moderately simflar to AIUC.HUMAN 1 


10 


420314 


H81671 


Hs.320921 


ESTs. Wealdy similar to T22688 hypotheO 


10 


428002 


AA418703 




gb3v98c01s1 Soares.NhHMPu_S1 Homosapi 


10 


437733 


AI792574 


Hs.122876 


ESTs 


10 


453118 


AW195849 


Hs^2757 


ESTs 


10 


457039 


H29990 


Hs.101937 


sine oculls homeobdx (Drosophita) homolo 


10 


454578 


AW809178 




gb:MR4-ST01 18-261 099-01 2-c07 ST01 18 Homo 


10 


411565 


AW851728 




gb:MR2-CT0222-01 1 199^7-d0& CT0222 Homo 


10 


419986 


A1345455 


Hs.78915 


GA-binding protein transcription factor. 


10 


415173 


AW501735 


Hs.253015 


ESTs 


10 


449011 


At655376 


Hs.1 92693 


ESTs 


10 


410365 


AI287518 


Hs.62669 


Homo sapiens mRNA; cDNA DKFZp586D0923 (f 


10 


416057 


AI927382 


H3.29857 


ESTs 


10 


455688 


BE067238 




gb:PM1-BT0348-1S129^01-a12 6T0348 Homo 


10 


408531 


AW207553 


Hs^53639 


ESTs 


10 


434663 


AA641972 


HS.1300S8 


ESTs 


10 


428085 


AA421081 


H5,12388 


ESTs 


10 


425008 


R38685 


H3.332622 


ESTs 


10 


446139 


H77395 


Hs.39749 


ESTs 


10 


400049 








10 


428333 


AW972666 


Hs.293044 


ESTs 


10 


429456 


8E161832 


Hs.292689 


ESTs 


10 


425087 


R62424 


Hs.126059 


ESTs 


10 


457122 


AI026157 


Ks.33728 


ESTs, Wbakty similar to AUU1_HUMAN ALU S 


10 


400310 


X63966 


HS.13S631 


Haptens synthetic gene for platoleKde 


10 


451805 


AI968300 


Hs.208220 


ESTs 


10 


401986 








10 


415318 


T06544 




gb:EST04433 Fetal brdn. Stratagene (cat 


10 


417756 


Z43055 




db:HSC12B021 normalized infant brain cDN 


10 


418301 


AW976201 


Hs£3913 


hypothetical protein FU10252 


10 


424698 


AA164366 


Hs.151973 


hypotheScal protein TU23511 


10 


429110 


L29301 


Hs.2353 


opioid receptor, mu 1 


10 


433755 


AW085934 


Hs.120868 


ESTs 


10 


434118 


AF116715 


Hs.256256 


Homo sapiens PR02829 mRNA, complete cds 


10 


435413 


AI267476 


Hs.46669 


ESTs 


10 


443748 


AW206447 




' gb:UMt.Bll.afg^2.0.Ul.s1 Na_CGAP^Su 


10 


445205 


D83776 


Hs.12413 


KIAA0191 protein 


10 


458175 


AW296024 


Hs.150434 


ESTs 


10 


446419 


AW576760 


Hs.160726 


Homo sapiens cDHA fU116eO lis. dons HE 


10 


441627 


AA947552 


Hs.58086 


ESTs 


10 


457653 


AI820719 


Hs.154662 


DnaJ (Hsp40) homolog. subfamily A, membe 


10 


455614 


At693369 


Hs^02274 


ESTs 


10 


449899 


AI610700 


Hs.103280 


ESTs 


10 


420111 


AA25S652 




gb:z621h11.r1 Na_CGAP.GCB1 Homo sapiens 


10 


437354 


AA749215 


Ks.291886 


ESTs 


10 


412228 


AW503785 


Hs.73792 


complement component (3d/Epstetn Bair vi 


10 


419691 


W03298 


Hs.1 93521 


ESTs 


10 


439724 


AF086565 


Hs.60351 


EST 


10 


413362 


BE088812 




gb:CM2-BT0693-230300-129^ 8T0693 Homo 


10 


453652 


AW009640 


Hs.28368 


ESTs. Moderately similar to S65657 alpha 


10 


434361 


AF129755 


Hs.1 17772 


ESTs 


10 


442479 


AF069484 




gb-.AF069484 Homo sapiens astrocytoma Bb 


10 


413554 


AA3igi46 


Hs.75426 


secretogranln 11 (chromogranin C) 


10 


459323 


AW062490 




gb:MRO^T0085>100899^1-d01 CT0065Homo 


10 


44943B 


AASZ7317 


H5.554 


Sjogren syndrome antigen A2 (SOU), ribon 


10 
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nh'MR0.HT0157*191199-Q02-a12 KT0157 Honno 




\AttYiCA'} 


ns. 100 IU£ 


IS1/V\U0M |)tuwRl 


447153 


A AiH\e<yv) 

AAdu5202 




to 18 




Want 


Ue ifiAai 


Hnmn Ranbms <^mA DT1P1B6 iriRNA. CAG ISO 


TERROR 


BE067870 




nh*RmJtTfnfi9J)?1799.^1J)Qfi B[Tfl3&2 HfVllO 


455510 


AKOOIdSZ 


IJ» ODA'n 


A 1 rKtspendoni KCin noiiwoso 


449815 


Ald/iUOu 




cole 


42S3S8 






tiifivilhallmal rwrrfoin Cttmlsff tn tftnasgh 
nypuUlcuCB) pratBHl SMnniai VJ lonaawii 


4u023o 








451678 


AA374181 


Hs.26799 


DKFZPS64D0764pfQtein 


445073 


AW291389 


Hs.13056 


hypotheticai protein RJ13920 




AWOOr/DI 




nhbB)O^NDQ3S.29Q300J]01>a03 SN0038 Homo 


413174 


AA723564 


nS.i9lJ4J 


ESTs 


435810 






KBioon ir 


418887 






ESTs 


438563 


AA81(^5 


Hs.l 34745 


£STs,W^yBlmllartoA46010X4inl(ed 


431750 


A AP4 jnoe 

AA514980 


nS.£0Sf\JO 


ESTs 


453242 


T98327 




CO IS 


437074 


A12Bo2j5 


nS.1Ztt9QD 


KimAthA4>Ml fwntom PI ll'tOHA 
nypOuieuwai pfulcfli rUI 1 J^lrl 


459411 


N52920 




9Diyvp4nu9.5i ooares iBiai iiver spieen 


424834 


AK0D149Z 


Ue 4 cum 




409929 


R38772 


Urn 


fnyciin tfBnscf^Bon fdutor 1*ikB 


406378 
459208 


dc2d1314 




TQTe WaaMwsinrilpr In 138099 hvnotheli 
Lo 1 Si VYeaniy elnllmi lO liSilwu iijjiiniieii 


445260 


AI218133 


Ui> 4il7Ci7 


CCTe 


416248 


H99169 


Lis tMiien 

nS.234oO 


nuiDcnononai nDosoinai proiein 


445020 


Al^855 


U* lil^liM 

nS.1<l72Zi 


CCTa 


402048 






goirlW3-nNUUlV«UJUUnW*'w9n MOinO 


412695 


AW984439 




416408 


R94725 


Hs.35354 


ESTs 


423347 


At660412 


Hs^34557 


ESTs 


427836 


AA416642 


Hs.116176 


ESTs 


433347 


AF023130 




gb:Homo saptons RaS'GRF2 mRNA, parila) c 


436902 


AW247145 


Hs.192729 


ESTs 


440122 


AI733011 


H5.127678 


ESTs 


442901 


AI023654 


Hs.114191 


ESTs 


444097 


AW517412 


Hs.150757 


ESTs 


447278 


AI934935 


Hs.158669 


ESTs 


451361 


AA053854 




gb2l52f02Tl Scares retina N2b4HR Homo 


451813 


NM-016117 


H5.271B2 


phosphoRpase A2-acQvatino protein 


454423 


AW603985 




gb:RC4^0048.140100^11-a04 CN0048 Homo 


458801 


Ng8548 


Hs^76860 


ESTs. WeaiOy similar to C Chdbi C, Human 
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TABLE 308 

Pkey. Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Ptcey 

407593 
407594 
407639 
407676 
407721 

407726 

407762 
407764 
407768 



407803 
407809 
407611 



CAT number Accession 



1003161J 
1003220.1 
100S924J 
1008294.1 
10108J 

101126_1 

101439.1 

1014849J 

10163.1 



1017498.1 
101798^.1 
10180.1 



AW044083 AW044094 AW370634 
AW057584 AW057685 AW0441 53 R34370 
AW205369 AW058599 AW207608 
AW0641 1 1 AW064450 AW064429 

Y12735 NKL003582 AW238970 R38268 R41411 R41419 T16717 AA002193 HB2028 AI359545 AW1G5201 AW087158 AA699728 AI095264 
AA002065 H62029 AI289101 AA884804 AA9O4950 AA609672 A1139874 H77896 

AA435679 AA470655 H22526 AA044031 AA876426 W63767 AI421 140 A1418990 H42329 H88910 AIJ041066 H88909 W94610 AW352277 W94648 
W94167 AW952568 AI419S53 AA335501 AA393641 AA044353 H41626 H22525 RS8582 AW297645 C75230 AW368034 AW468904 AI272755 
AW235638 AA346882 AW666803 AA361281 AW963163 AA044373 AA136755 
86008347 BE008320 BE083307 BE083311 AW075968 

BE514982 BE814814 AW393078 AW238480 AA055637 N27644 AA641 158 M87068 AA161019 AA161003 AA587315 AA716746 AA593632 

AJ354870AW183492Y07755NM.005978AW872948AI608987 H64656AF086003AA643149A!819402 H64555AA 

BE122811 AA587026AVV857108 AI950679AAS96012AA654004AW238238AI971395AA459074AA45BB84AI6^ BE181995 8E1B1970 

AW195797 C00271BE182043 

AWD81681 N36967 N36959 

AWD82279AW082688 _ _ 

AW190902 A18027B8 AI041650 AI037867 AF1 1 01 37 NM 01 3372 H99469 N35377 AW1 51 676 Ai678451 AWa78795 AW087935 AI884505 

AW044B02 AB032372 AF045800 AI621183 AI750578 AA376403 A1910477 AA373348 AA373673 AI7521 24 AL359060 W4861 9 AA373298 AA373975 

AW804409AW604404AW368603AAS45734AW361415AA373557AVV956164AVV853798AI75059^ 

N27040 AA071138 Anr53050 AA599801 W48852 AA6G9811 AA112124A1940705 W52688 AA084X1 C01826 AI940729 AA373544 AW957491 
AW383224 AW383164 AW383192 AW383125 AW383194 AW842S07 AI940795 H80042 AW631119 W47324 W42884 AI750594 AI754387 AI753734 
AA372833 AW069006 AI750577 AW473621 AI888605 AA600082 A1041803 W51 909 W25447 AI521673 AI087351 AA670070 W47325 AA071 381 
W42791 AI090390 A1085102 A1302125 AA788723 AW069430 AW069485 AI754608 AA373014 BE140150 BE140166 BE140102 BE140143 
BE140157 BE140117 BE140098AW005943 BE140108 BE140176 BE140171 BE140144 BE140175 BE140160BE1 40152 BE140099 BE140177 
BE140167 BE140145 BE140109 BE140163 BE140172 BE140161 BE140179 BE140147 6E140107 BE140146 BE140155 BE140173 BE140148 
6E140174 BE1401S8 BE140149 BE140116 8E1401S6 BE14010S BE140103 BE140164 AW138508 BE140153 AW806557 BE140121 6E140162 
€01953 AW806500 BE140124 AA329219 AW95S642 AW0&9165 AI988107 N21 1 13 A)754594 AWD69264 AI754660 BE551937 AA543066 AA703927 
AW130486 AI614434 AI924946 AA789056 AW173867 AW069841 AI022286 AI753S23 AI7535S8 AI753482 AWD66940 AI753002 AA5e9866 
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407995 
408000 



408112 
408172 
408221 



408235 
408238 
408243 



103177 1 
1031J 



1039850.1 
1044105J 
104826.1 



104888.-1 
10487^1 
1048.1 



AI753593 AI753469 AI753506 AW008360 AI753255 AI9491 1 1 AI752123 W19275 AI679005 AI888455 AI677772 AI589279 AI988546 AW069588 
AI754028 N20040 An54354 Ar752B78AA838370AVVB31283 A1440410 AVroi684S AI801326 A)610424 AI453455 AI753087 

AA670062 AA5998S3 AI753621 AI802571 AI537325 AA669978AI921732 AI811571 A1309543H80043AI623845AI623852 A1075634 AI919521 
AA729459 C01795 AA577421 AA668620 AAfiOOOOS AA653400 AI445339 AW468974 AW1 30343 AA600104 AW970482 AW806615 BE140122 

407834 1020170 1 AW084991 RS4179 N64486 

407851 10212.1 NM.014496AF184985H82836 H82897 

407684 1023626 1 BE075316 AW090150 AW589417 H47391 D61911 BE173252 

407938 102759 1 AA905097 AI734242 BE044545 N86431 AA344041 C03245 AW963097 A1567324 AA045934 AW079233 R58274 AI744425 All 67427 R5ai76 

R58598 AI989945 AI921 684 AW01 3864 AI204559 AA909848 N841 1 5 AA897468 A1668637 BE221753 A1355307 AJ139542 N80934 
407965 10298 1 VV21483 U1 5590 NM.006308 N56314AA126678AA426507Y 17782 V«4740VWJ5082 W2tW^ 

AA429090 AA095643 AI224915AA443775 AI204315 AI333690AA586584 AI275037 A1193915 AA659647 T82B41 AA1 36048182643X82647 
F36041 N74099 A1572217 TB2650 T82642 T82640 R47360 R47361 R47357 N74672 R473S6 R47355 N93086 R47354 R47353 R47342 R47362 
R47358 N93441 N56384 N93S66 T82622 T82618 

AI094748 AW241797 AI759976T10509 AI814441 AM21977 AI423762 N73686T11392 AA045406 
L11690 NM.001723 1^9225 H99955AI750335 BEOT0199 M63618 N31521 M112876AW265395AA^ 

AI830854 AW361 101 AW379356 AA301 170 T29232 AW965998 H26216 H44230 H44784 AA808916 AA5147B5 H44575 AA705179 AA586735 
AW589403 H25843 AA654993 AI783626 8E087370 
AW451982 AW1571 17 AW161544 
W024e8 AW167479 AW361027 AW833243 

AA9121B3 AA541622 AI673341 AI266109 AI335896 AM21010 AA053327 AI982962 AI372854 AI674348 AAS31087 AI801016 A1372855 A1917816 
N52161 DS2928 AI401533 AI378549 A1378800 A1337972 D62536 AW972977 AA135826 BE222491 062195 BE326453 AA225293 AA236461 
R85586 AA625141 AA058641 AA135825 AA234644 H50784 R71863 
AA053381 

W95901 AA053387 AA12150t 

Y00787 M2B130 D14283 BE439795 BE439924 AA3621B7 BE439515 M17017 C05929 AA381897 AA346136 AA381472 C06304 AA381670 
AA381679 AA381965 T1 1274 AA381318 AA381601 AA381700 AA381952 AW950718 AA381992 D82198 AA381914 AV653399 AA362140 
AA381592 AA381967 AA381454 M26383 AA3621B8 AA382057 AA381781 AA352246 AA381659 AA3B1 395 AA381845 AA381848 AA381603 
W404K AW663238 AA381702 AA38135S AA381356 AA381653 AA362127 AA374516 AA381426 AA381767 AA362270 AA381459 AA362358 
AA362160 AA304441 T27482 083848 AA381877 AA381623 D83851 AA3621 33 AA381999 AA372837 AA362192 AA362196 AA381806 0821 32 
AA362107 AA363589 AA296403 T11316 082218 AA381456 W561 1 1 AA381435 W40153 T11379 W40283 082615 AA382121 W40401 AA381699 
T68485 D82618 W52968 W39356 T27467 083813 Z1 1686 T1091 5 AW387405 AW604306 AA363550 AA363514 BE004230 AA381218 W40279 
AA363545AW377023AA363453 M363559AA381833AVV850620AA381813AA381710M3B1584AA381577AA3^ 
AA30S271 AA363554 W52491 AW631242 AA363599 0822S4 AA363540 AA363596 AW951160 6E122704 AA380965 AW850702 AW630S01 
AA381229 AA381228 AA381S76 AA3ai795 AA381816 AI274317 AA360958 AA381517 AA381618 AA381891 AI310352 T11037 BE122702 
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AW302089 AW466960 AA810124 AW302684 AI272921 AI141003 AW589738 AA074714 AA621482 AI7g8501 AW024557 AA621074 T62627 
AA639206 A)913538 AA075135 R54613 AA352975 AW206892 Af868280 AW449243 AA9Q7317 AW1 34573 AI5904g2 AI610050 A1834309 A1375556 
AI284991 AW968038 

BE065030 AA670100 AA781546 AI022472 AA846803 AI497780 N68^6 AI382690 

T78013 AA699327 W87785 W8861 3 

AW976692AA806542AA745856 

A137383B AI073389 AI087143 AA764776 AA913318 

AW978161 AW978165 A)016938 AI539270 AW2949S8 AW51 1089 AA814649 

AF074982 827906 R31333 R31591 R27812 

AWg79009 AA828038 AAB28148 

AA393603 Z19481 AA252342 AI807614 AI913804 AA040176 AA971879 H53388 AF085972 Ai291424 H53349 AW015078 AA768307 AAl 27921 

AA723700 AA040B41 AA993954 AA213665 AA127972 AA913063 BE327712 AI017585 AA988186 AA628183 AI205930 AAB335S8 AA974107 

AI004390 H48931 AA724004 

AW296024 AA897109 A1015000 

Z40870 BE5O4110 BE219908 AW468668 AI002334 

U48351AA9e9182 8E327312AW138276B£467S67A1680815A)422668 

AI264628AF017648AI872732 

AI024855AI024877 

A1084514N46645AW1839&4 

AV648310 AV661871 AI928475 AV647819T55845 

A1185703AIB05813AW292764 

AW136139AI216724 

AI305223AI45eS77AI275589 

AI352790AI275996 N48887 

Ai299789BE551384 

BE281115T53860AV659439AV659421 BE539929 

A)554946AW362008AW362S53At683342AI376781 

AW802754BE619228 

AI693417 A1418256 AW627792 F3297g 

AW295151 Ai4250O4 AW470228 AI693738 H50554 R99t98 H50553 R99197 

AW001835 AW612725 AW136670 AI798956 BE467368 AA280216 Ai216754 BE622057 AL121193 AW853470 AW8534S0 AW369075 AW369108 
AW578479 AW369106 AW361242 AW361190 N79183 

N98648 AI458157 BE0416S2 BE218014 BE622355 AA369340 AA369S15 AW962780 AW962704 

AI522129 T56009RS384g 

AW236702A1566105Z40398 

AI630223A1630470 

AW885523 AWB65128 AW665467 AW865127 AW665466 

R15891 R61471 R61469 N69765AI014624 AA007214AW592075 H09780AA709038A133S898H11055AI559229F09750 n2573A^^ 
AA9886S4 AA626438 AI002431 AI299721 

AW988226 AI139249 AI701692 AA017303 AW46g622 AA2S9148 AA811690 AA807996 AA744260 AA824494 AA731710 T25332 AA258101 
AAg70687 

AW439497 A1826059 AA01&402 AA837392 
BE551721 H51678 A1823338 C014B8 AW813S62 
AW301478 AW301560 AI889207 
AW138410AI912712N40186 
BE261314AI243406AA027322 

A180891 3 AW028342 W8 1 290 A157 1379 AI382808 AA037071 W79588 R48751 N31808 AI870233 R48752 AW024895 A)333754 
AW294659 AI204928 AA351653 H51220 R86643 AA993182 U79298 R15294 F05089 Z42963 R17B16 T77498 AA332319 W56049 AA331586 
AW681873 AW881865AW881876 R52345 AI652070AA400044AA401512TQ8151 W05486 N6837B T33846A)190920 R43021 AI949980Z390B4 
T63413 W37269 F01343 R88669 AI6210S5 AW117593 AI19321 1 AW297932 A)50a709 AA400056 



Table30C 

Pkey: Unique number corresponding to an Eos probeset 

Rat Sequence source. The 7 digil numbers in Ihlscohmm ere GenbanktdenSlier (GQ numtxiis. "Dunham I. etal/refeis to the pubOcaUon entitled The DNA sequence of 
human chromosome 22.* Dunham I. el al.. Nature (1999} 40248^95. 

Strand: tndicatesDNA strand from which exons were predicted. 

NLposWon: Indicates nucleotide posHlons of predicted exons. 

Ptey Ref Strand Ntjnsieon 
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400462 


9929659 


Minus 


197610.197785 


400608 


9887666 


Minus 


96756-97558 


400612 


9929646 


Minus 


151513-151662 


400631 


6247025 


Minus 


S6203-56313.56424-56482.57073-S718S^513.57593.57747-57941 


400641 


8117693 


Phis 


4786^992 


400664 


8118496 


Plus 


13558-13721.13942-14090.14554-14679 


400G96 


8116812 


Mnus 


77737-77899 


400697 


8118812 




79073.79198 


400706 


7249204 


Minus 


78299-78^6 


400^16 


8569993 


Rus 


161221-162078 


400842 


1927148 


Pius 


90452-90673 


400859 


9757499 


Minus 


91888-92018.98131-98294.99474^0 


400861 


9757506 


Plus 


163855-1&4016 


400889 


9958234 


Minus 


169782-170036 


401069 


3927852 


Minus 


45682-45831 


401098 


9965518 


Minus 


85632-66174 


401132 


8705350 


Minus 


85679^795 


401189 


9690246 


Minus 


90815-90929 


401200 


9743387 


Minus 


111688.111806.114791-114916.115419-115583.116351.116446,116847-116907,122853.123067,^ 


401240 


3355450 


Phis 


77433.77636 


401324 


9863791 


Phis 


234057-234174 


401365 


9796180 


Minus 


119572-119672 


401368 


8670914 


Minus 


65508^662 


401459 


9212270 


Minus 


162001-183323 


401462 


6682291 


Phjs 


112763-112909 


401497 


7381770 


Phis 


92607-92813 


401526 


7770551 


Phis 


91570-93177 


401596 


3293210 


Rus 


63639.63890 


401614 


7839924 


Rus 


17350-17735 


401626 


6575943 


Minus 


238100-238432 


401645 


7657839 


Minus 


3498&^133 


401673 


7689903 


Minus 


122587-122705.122765-123047 


401694 


3540172 


Minus 


64056-64168 


401785 


7249190 


Minus 


165776-165996.166189.166314.166408.166569,167112.167268.167387-167469,168634-168942 


401878 


8099802 


Minus 


162268-162474.163089.163195 


401BS7 


7229981 


Rus 


93973-94120 


401899 


7230209 


Minus 


155620-155815 


401986 


4406829 


Minus 


31137-31293 


401989 


4309954 


Minus 


118611-118821 


401991 


4156128 


Plus 


2398-2513 


402048 


8072512 


Plus 


43936-44076 


402076 


6117410 


Rus 


128316-128627 


402090 


7249154 


Minus 


104805.104939.106898-107052 


402112 


8139750 


Rus 


10507-10713 


402131 


7704961 


Minus 


33114.33209.33496-33878 


402145 


6018280 


Phis 


113086-114600 


402318 


7582559 


Mnus 


12843-13403 


402333 


8844110 


Minus 


165693-165856 


402341 


7656696 


Rus 


22583-23699 


402369 


9558577 


Minus 


50417-50522 


402451 


9796677 


Minus 


48137-48343 


402528 


7630857 


Minus 


169609-169742 


402603 


9909396 


Mbius 


141663-141852 


402615 


9926801 


Phis 


131390-132157 


402689 


8348025 


Minus 


5885-6209 


402942 


936839B 


Rus 


102152-102386 


403011 


6693597 


Minus 


3468-3623 


403053 


8748888 


Rus 


146568-146659,147539-147811 


403089 


8954241 


Phis 


171964-172239 


403188 


9838289 


Mnus 


157618-157755 


403218 


7630969 


Phis 


58039-58149 


403271 


7230852 


Plus 


134283-134485 


403281 


6072630 


Minus 


7521-7728 


403306 


8099945 


Phis 


127100-127251 


403310 


6139936 


Minus 


163883-164026 


403317 


8318526 


Minus 


50823.50834 


403329 


8516120 


Phis 


96450-96598 


403341 


8569175 


Rus 


3089^30910 


403344 


8569726 


Rus 


70823-70990 


403356 


8569930 


Rus 


92839-93036 


403513 


7656757 


Minus 


155310-155436.158402-158535 


403515 


7556757 


Minus 


173358-179553 


403525 


7960440 


Rus 


152431-153243 


403S34 


8076917 


Minus 


46652-47332 


403568 


8101145 


Minus 


85509-85658 


403572 


8101156 


h^us 


1253-1675 


403574 


8101156 


Rus 


5542-6176 


403623 


8569879 


Minus 


3519-5426 


403825 


8569879 


Rus 


6551-7111 


403637 


6671938 


Minus 


142647-142771,145531-145762 


403878 


7331517 


Minus 


119573-120430 


403691 


7387384 


Minus 


88280W63 
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403776 


r/rUoii 


Mbnis 


1414.151X1624-1756 




8076989 


Plus 


93160-93409 


403766 


OUMOOO 


Minus 


73028-73217 


403891 




Minus 


ISISOS-1 93220 


403937 




Mmus 


12509-12773 


404042 


955SS73 


rlU5 


0 1 4U-O/Ua,Oo33-07o3 


404043 


saaoorj 


Plus 


OAA^O ^A4«1P A/^r'^¥9 

29042-29135,46597-46699 


4O406B 


JIOoOcI 


Minus 


18123-18766 


404108 




Minus 


63603-64942 








86147o86509 


404193 




Minus 


941B5'94322 


404196 


3805917 


Muius 


C7DOO coino 
d792o-6o109 




DooaaJo 


Plus 


64270-64633 






Minus 


114391-114628 


404404 


79799fi9 


Phis 


02112-82244 


AfXAAiA 


rJ04lo5 


Plus 


143127-143398 


WW 10 




Minus 


143042-143216.144704-144853,145800-146048 




/4U/952 


Minus 


129817-130566 


WM4J 


7579073 


Minus 


87198-87441 


WWW 


7657714 


Phis 


27768-29179 


404476 


0U0UD99 


Mile 

rlUS 


101841-102043 


<Iv*iOlO 


0101900 


Plus 


84494-84603 


iOAKTR 




Plus 


121918-122123,125198-125348 


W*1S0 1 




nus 


20152-20362 


404561 


QTQCQOn 

9r9a90U 


Minus 


canon ^A«iMt 

69039-70100 






Minus 


104257-1 04348^104822-104970 




O7QQ00A 


Plus 


53230-53424 




O09OO40 


Minus 


69781-70096 




Mod/ ZD 


Minus 


40059-40210 




9944U00 


Minus 


74922-75788 




QQCfiORO 
9900^02 


Minus 


16764-16900 


404638 


q7QR7m 

9( OOlOl 


Minus 


DAjtO^ nnC^O 4AAA'iff AAA^J^d 

99433-99528,100035-100161 




{I7CAOCO 


Minus 


108172-108296 




ar9»a7 


Minus 


128092-128227 




9800828 


Plus 


77522-77658 




TO^OOQQ 
fC0U£99 


Minus 


168609-168781,182951-163081 




/OO^O^ 


Minus 


23244-23759 




TOOO'JftC 


Minus 


61087-61590 


/A/ 000 


JoiUoM 


Plus 


7541-8132 


4/V/oo/ 


O9IIDUO 


Minus 


37948-38226 


404957 




nUS 


147512-148011 






Minus 


89944-90729 




4662677 


Minus 


72406-72600,72779-72856 


405001 


Dvia4U0 


Minus 


10464&-104819 


405008 


WOOV 19 


Minus 


D4U91-04A7 


405090 


ouf £9ca 


Minus 


dOaD<-J9202 






Plus 


140176-140340 






Plus 


51081-51701 


405230 


/4C**9Uo/ 


Minus 


97493-97582 


4U3oO^ 


2078453 


Minus 


121688-121840 


4UdJ4/ 


2979602 


Minus 


977-1116 




7408143 


Phis 


90716-90887.101420-101577 


AflKACC 


7656675 


Phis 


134112-134671 


4U3430 


7656676 


Phis 


150052-150208 




6050952 


Minus 


70284-70518 


Ai\KK!M 

•HJadci 


9454643 


Rus 


65096«247.77608-77637,81242-81364,8424$W95 




9454643 


Phis 


114550.114688,117265-117407,119490-119599.123237-123395,131140-131217 


AnHIMK 


O03oi9o 


Minus 


117070-117270 




5815499 


Minus 


66822-66925 


•)uaoo4 


aJUo268 


Phis 


17856.17957,18302-16412.1 8837-18927^790-22989 




4ftaeicc 
4099130 


Minus 


53624-53759 




4014424 


Phis 


61379-62562 


WOf JO 


9SW0990 


Phis 


44370^15410 


405747 


0<l09U09 


Minus 


153933-154060 




f24o<lM 


Minus 


48204-48371 




5796494 


Minus 


27238-27885 


Warm 


7417368 


Mirajs 


77798-78000 


405829 


7109593 


Minus 


15628-16127 




6758795 


Phis 


120621-120971 


wlACQOC 

4Ua!Ma 


6758795 


Minus 


163112-163652 


40S970 


K47789 


Minus 


45795-46295 


406005 


8247801 


Minus 


39912-40220 


406018 


6758904 


Minus 


37795-38168 


406076 


9123123 


Plus 


89972-90319 


406092 


9123919 


Plus 


251370-251797.252168-252882 


40619D 


7289992 


Minus 


22395-22901 


406268 


7549820 


Rus 


111718-112008 


406298 


5688278 


Minus 


30064-30770 


406333 


9213235 


PhiS 


64689^798 


406364 


9256114 


Minus 


5071S«)833 


406376 


9256142 


l^nus 


128408-126800 


406413 


9256407 


Plus 


4385844003,4699347136 


406468 


9795553 


Phs 


437»616,6S70.9046.11368.11509.11625-11880 
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TABLE 31A: ABOUT 1884 GENES UP-REOJIATED IN IPF COMPARED TO NSIP 

Tabte 31A lists about 1884 gensswtiose expression leveb ao up regulated In kfopathic pubnonajy fibrosis (IPF) samptss as compared wlh norvspedric intersUlial pneumonia (MSIP) 
samples. These were selected from about 59680 probesets on an Aflymetrix/Eos Hu03 GeneChip array such that the rafio of 'average' tdiopalhlc pulmonary fibrosis sample 
expression level to 'average' noiKpeci&c btfetsOOa] pneumonia sample expression was greater than or equal to about ZQ. The 'average* idiopathic pubmnary fibrosis level was set 
b the 80th percentile amongst idlopalhicpulrnonary fibrosis sample The 'average* rwrMpeCfficlnlersHlalpneuinonla level was set to the 
interstifia! pneumor^a samples. 

Pkey: Unique Eos probeset Identiliernumber 
ExAocn: Exemplar Arxesslon number, Genbank accession number 
UnigenelD: Unigene number 
Unfgene Title: Unigene gene tiSe 
R1: 



Ratio of IPF (UiopM pulmonary frbrosis) to NSIP [non^pecilie intaislilid pneumonia) 



Pkey 

450478 
405654 
432365 
403637 
431548 
407811 
439606 
403574 
416653 
441233 
415817 
409632 
432437 
407266 
423017 
403329 
429629 
441519 
453823 
406690 
416379 
428862 
407305 
434683 
441802 
431242 
442377 
420407 
428908 
445898 
457673 
458771 
426800 
44C504 
415025 
438557 
416128 
457242 
423629 
404793 
435563 
415672 
455488 
426230 
412282 
431622 
405523 
424693 
436397 
455476 
434784 
422977 
442849 
451519 
412474 
457081 
444827 
404822 
402430 
457900 
400292 
410934 
440172 



ExAocn 



AW451709 

AK001106 

AI834273 

AW190902 

W79123 



AA7685S3 

AA972965 

Ue8967 

W74001 

W07088 

AJ235664 

AW178761 

BE501732 

AA972740 

AL137967 

M29540 

N38857 

NM-000346 

AA715284 

AW298724 

AA968636 

AA987742 

AA993807 

AAB14732 

AW303529 

AF070623 

AA551569 

AW295151 

AA385085 

A1948966 

AW207091 

AW364104 

AA173632 

AA457011 

AW02t173 

AF210317 

N53097 

AA102322 

AA367019 

BE160188 

AW979271 

BE169810 

AA715013 

AA256753 

AA649051 

AA631498 

R10099 

A1800600 

A1791451 

AA916785 

R09764 



AW976692 
AA250737 
AW811114 
AA868584 



UnigertelD 


Unigene Title 


R1 


Hs.271200 


ESTs 


20.2 






16.1 


H8.274419 


hypotheScal protein FU10244 


11.9 






\U 


H5,9711 


novel protein 


m 


Hs.40098 


cysteine knot supeifanily 1» BMP antagon 


10.4 


Hs.58561 


G protein-coupled receptor 87 


103 






10.1 


Hs.74170 


metaDolHlonein 1E OimctldnaO 


9.3 


Hs. 135568 


ESTs 


9.1 


HS78867 


protein tyrosine phosphatase, receptor-t 


is 


Hs.55279 


serine (or cysteine) proteinase Inhibib) 


&4 


Hs.293885" 


ESTs 


as 




gb:Homo sapiens mRNA for Immunogtobulin 


&2 


Hs.227948 


serine (or cysteine) proteinase inhibito 


8.1 


Hs.30622 


Homo sapiens cONA FU13010 lis. done NT 


8.0 
8.0 


Hs.127092 


ESTs 


7.9 




gb:DKF2p761D2315j1 761 (synonym: hamy2) 


7.B 


Hs.220529 


cardnoembryonic antigen-related ceQ ad 


7.7 


Hs.203933 


ESTs 


7.7 


Hs.2316 


SRY (sex determining region Y)4)ox 9 (ca 


7.5 


Hs^2639 


gb:nv35fD3.r1 NCf.CGAP^6rS Homo s^ns 


7.2 


ESTs 


7.2 


HS.127B77 


ESTs 


6.9 


HS.IS1278 


KIAA1201 protein 


6.9 


lte.167387 


ESTs 


6.9 


Hs.145010 


t^potysaccarlde-spedltc response 5-li 


6.8 


Hs.144955 


ESTs 


6.8 


Hs.13423 


Homo sai^ens clone 24468 mRNA sequence 


6.7 


Hs^2034 


hypolheflcsl protein PR02822 


6.7 


Hs.163612 


ESTs 


6,6 




gb:EST98959 Thyroid Homo sapiens d3NA 5* 


6.6 


Hs.130017 


ESTs, WeaMy similar to JN0g08 HMransp 


6.6 


Hs.72307 


ESTs 


6.5 


HS.143S09 


hypothetical protein FU21924 


6.5 


Hs^lie 


CDC14 (cell dhrision cycte 14. S. cerevi 


6.4 


Hs.16612 


gb:ad90c1lJl Stratagene fetal reSna 93 


6.3 


Homo sapiens cDNA: nJ219091is, done H 


&3 


Hs.95497 


solute canter family 2 (fadfitated ghi 


6.2 
6.2 


Hs.193579 


ESTs 


6.2 


Hs.241395 


{^zl90f03.r1 Stratagene colon (937204) 


&2 


protease, serine, 1 (trypsin 1) 


6.1 


HS.2931B4 


gb:QV141T04134)1020(M)59i)05 HT0413 Homo 


ai 


ESTs 


6.1 






6.0 


H3.47557 


ESTs 


6.0 


H8.169835 


ESTs 


&0 




gb2s22b11r1 Na.C6AP.GC81 Homo sapiens 


5.9 


Hs.164007 


ESTs 


5.9 




gb3ip83h04^1 NQ.CGAPJTIiyl Homosa;^ 


5.9 


Hs.269805 


ESTs 


5.9 


Hs.209573 


ESTs 


5.8 




gb:i\i50c09.yS NCI.CGAP.0tf2 Homo sapiens 


5.8 


Hs.180610 


splk:ing factor prollne^tamlne rich ( 


5.8 


Hs.20416 


ESTs 


5.8 






5.7 






5.7 


Hs.291665 


ESTs 


6.7 


Hs.72472 


ESTs 


&7 




gb:MR2-ST0131-t 11 19^16^ ST0131 Homo 


5.7 


Hs.126154 


ESTs 


5.7 
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431374 


BE258532 


H5^51871 


409816 


AW500954 




447613 


AIJ041057 


Hs.33363 


417919 


AI92B203 


Hs.86379 


425259 


AL049260 


Hs,1 55397 


439063 


AR)85922 


Hs.1 13968 


406053 






431211 


M86849 


Hs.323733 


451830 


H18433 


Hs.21542 


416035 


K42314 




413849 


BE173561 


HS.1S384 


459458 


AW270957 


H5.2S4577 


416154 


Z46122 




404561 






426895 


AA437124 


Hs.187247 


419247 


S65791 


Hs.89764 


455601 


AI388680 


Hs.816 


440925 


AW5110S0 


H$.130419 


419249 


X14767 


HS.8978B 


448477 


BE612572 




454039 


AW078064 


Hs.245540 


459564 






401497 






408493 


BE206854 


Hs.46039 


444931 


AV652086 


Hs.75113 


456680 


AL137758 


Hs.1 16072 


452542 


AWB12256 




411402 


BE297855 


Hs.69855 


404957 






436445 


AA922213 


Hs.121735 


442617 


AW340093 


Hs.130538 


416045 


H15990 


Hs.31403 


425178 


H 16097 


Ks.161027 


441918 


Ar733373 


Hs.128119 


455464 


AW983901 




420929 


AI6g4143 


Hs.296251 


448844 


A)581519 


Hs.177164 


430686 


NM_Q01942 


Hs.2633 


405229 






417641 


AA205015 


H$.54617 


434167 


AA626334 


Hs.116153 


450438 


AI6g6071 


Hs.253800 


456394 


W28506 




455747 


BE074910 




417420 


T85150 


Hs.266814 


409545 


BE296182 


Hs.19002 


426750 


AA383950 




440815 


AI733055 


Hs.130806 


408959 


AW890878 


Hs.211610 


454482 


BE1479ig 




436508 


AW604381 


Hs.12112t 


452046 


AB01B345 


H&27657 


407415 


ARJ73326 




450090 


AW448940 


HS.2022S9 


406333 






434188 


AI765848 


Hs^1680 


403344 






446466 


H38026 


Hs.308 



405455 
411387 
426097 
427768 
411018 
415257 
441107 
419519 
410901 
426217 
424188 
456987 
405303 
414955 
451620 
421948 
424780 
443271 
417181 
402230 
422246 
431508 
415236 
413101 
444774 



AW842339 

BE327369 

T78402 

AW813428 

F03016 

AA917075 

AI198719 

AW810001 

AW131888 

AW954552 

AI557290 

C15506 

AW449888 

L42583 

U39576 

8E568568 

110123 

AA481032 

NtyL012481 

R4140O 

BB}65215 

AW052174 



CTP synthase 

gb:UI-HF^P0p-aff-h.12-04Jlr1 NIH.MGC.5 

DKFZP434N093 protein 

ESTs 

Homo sapiens mRNA; cDNA DKFZpS64K143 (ft 
ESTs 

gap Junction protein, beta 2, 26kO (conn 
K1AA1035 protein 

gb:yo09e02j1 Soares adult brain N2b5HB5 
API gamma subunit binding protein 1 
ESTs. Weakly similar to B34087 hypothetl 
gbi1SC0VB031 nonnalized Intent brain cDN 

ESTs 

fragile X mental retardation 1 

SRY (sex detennining region Y)>box 2 

ESTs 

gamm»aniinobutyiic acid (GABA) A recepto 
gb:601452090F1 NIH.MGC.66 Homo sapiens c 
ESTs 



phosphoglycerale mulase 2 (musde) 
general transcription factor IIIA 
Homo sapiens mRNA; cDI^JA OKFZp434H24S (fr 
gb:RC0^T0174-1910994)31-e07 ST0174 Homo 
NRAS-reiatedgene 

ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

gb:RC1-HNX03-2203004)11-f10HN0003Homo 
programmed cell deatti 4 
ESTs 

desmoglein 1 

bypotheilcal protein PLJ20060 
ESTs 
ESTs 

gb:48f 1 Human retina cDNA landomly prime 
gb:RC5-BT0580-170300^1-F12 610560 Homo 
ESTs 

hypothetical protein MGOI875 
gb£ST97403 Thymus tl Homo sapiens cDNA 
ESTs 

CUG triplet repeat, RNA-binding protein 
gb:RC3-HT023O.16O20aO1&^ HT0230 Homo 
ESTs. Wealdy similar b 800755 ptecksbf 
KIAAD802 protein 

gbiHomo sapiere t^racytine transporter* 
ESTs 

peroxisomal trans 2-enoyl GoA reductase; 

arresthi 3, retinal (X-emestln) 

H5.130815 hypothetical protein FU21870 
Hs.112238 ESTs 
Hs.174880 ESTs 

gb:MR3-STOig2.010200-210<05 ST0192 Homo 
Hs.27513 ESTs 
Hs.190520 ESTs 
Hs.176376 ESTs 

gb:MR4-ST0124.27030(M)054}11 ST0124 Homo 
Hs.172792 ESTs,We8kty8lmilartohypothelicalpro 
Hs.142634 zinc finger protein 
Hs.173535 ESTs 

gb:C15506 Oontech human aorta pdiyA^ mR 
Hs^57224 ESTs 
Hs.334309 terBSn6A 

HS.1530S8 butyrophiHn. subfantily 1. member A1 
Hs.195704 ESTs 

Hs.1071 surfactant protein A binding protein 

HS.S306 hypothetical protein DKFZp586F1122simn 

Hs.182979 dboGomal protein L12 

gb7f94b11s1 Soares infant brain 1NIB H 
gb:RC1^0314.3103004)15401 6T0314Homo 

Hs.ig6030 ESTs 



5.7 
5.6 
5.6 
&6 
5.6 
5.6 
5.5 
5.5 
5.5 
5.5 
5.5 
5.5 
5.5 
5.4 
5.4 
5.4 
5.4 
5.4 
5.4 
5.4 
5.3 
5.3 
5.3 
&3 
5.3 
5.3 
5.2 
5.2 
5.2 
5.2 
5.2 
5.2 
5.2 
5.2 
5.2 
5^ 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 
5.0 
5.0 
5.0 
SlO 
S.0 
&0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
48 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.8 
4.6 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.7 
4.6 
4.6 
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444414 AW293214 Ks.8752 transmembrane protein 4 4.6 

431291 N2S521 Hs^Z75 Kruppel-typezinc finger proteh 4.6 

436B53 6E326074 Hs.148861 ESTs 4.6 

445334 AI610081 Hs.9475 glucose transporter protein 10 4.6 

5 408172 W02488 . Hs.46039 (tfiosphoglycerate mulase 2 (muscle} 4.6 

426985 BE394B4S H3.131905 ESTs, Moderately sinrBlar to Z195.HUMANZ 4.6 

404638 4.6 

447617 AM00762 Hs.176675 ESTs 4.6 

422182 AU)43892 H5.180582 Homo sapiens cDNA: aJ21836ris. done H 4.6 

10 442360 A)374621 Hs.29055 ESTe 4.6 

411738 AW859353 gb:MR1-Cr0353-150300-102-a12 CT0353 Homo 4^ 

444157 AI125785 Hs.153351 ESTe 4.5 

401365 4.5 

459592 AU)37421 Hs.208746 ESTs. Moderately similar to poLORF I ( 4.5 

IS 436269 AA707472 Hs.190760 ESTs 4.5 

459446 AM16773 Hs^S012 EST 4.5 

452090 AA022684 . Ks.124673 Homo sapiens cDNAFUl 1477 (ls,ctone HE 4.5 

414899 AW975433 Hs.36288 ESTs 4.5 

443764 F232e3 gb:HSPD229B0HM3 Homo sapiens cDNA done 4.5 

20 444898 Ai201548 Hs.308338 ESTs 4.5 

417428 N87579 gb:LL2030F Human fetdl heart. Lambda ZAP 4.5 

428528 AI004034 Hs.98638 ESTs 4.5 

4D5605 4.5 

457982 AW856093 Hs.183617 ESTs 4.5 

25 427731 AA411750 Hs.20943 ESTs 4.4 

420691 AA829433 H8.275343 ESTs 4.4 

429927 NM.001115 Hs.2522 adenylate cyclase 8 (brain) 4.4 

453080 AU23056 Ks.23921 hypothetical protein DKFZp547A023 4.4 

412147 AW895984 gb:QV4-HN00394)40500-1S7-«08 NN0039 Homo 4.4 

30 435747 AI079519 Hs.134398 ESTs 4.4 

453824 AL138012 Hs.183840 ESTs. Moderately similar to ALU7J1UMAN A 4.4 

458865 . T05095 Hs.19597 KIAAI 694 protein 4.4 

459037 AW439497 Hs.290656 EST 4,4 

403310 4.4 

35 425578 U85652 H5.158313 chromosome 17 open reading frame 1 A 4.4 

427500 AW970017 Ksi93948 ESTs. Weakly similar to S6S657aIpha>10 4.4 

432020 AJ251509 HS.27234S phosphodiesterase 11A 4.4 

453043 AW136440 Hs.224277 ESTs 4.4 

456293 AW131715 Hs.311561 ESTs. Weakly similar to CYA7J<UMAN ADENY 4.4 

40 447879 BE503405 Hs.170437 ESTs, Weakly similar to PRP4_HUiy!ANSALIV 4.4 

426646 AA382787 H8.122713 ESTs 4.4 

454864 AW835775 gb:QV44.T0016-240200-110^04 LT0016 Homo 4.4 

404898 4.4 

435434 AA6803S7 Hs.187850 ESTs 4.4 

45 443314 AW771701 Hs.54646 ESTs 4.3 

408000 111690 Ks.620 bullous pemphigoid antigen 1 (230/240kP) 4.3 

441700 AA233556 Hs.126908 hypothetkal protein FU12994 4.3 

455000 AW8S0283 H5.324429 Homo sapiens CDNAFUl 401 5 lis. done HE O 

^_ 404767 4.3 

50 445189 AI936450 Hs.147482 ESTs 4.3 

452393 H87398 Ks.99858 cibosomal protein L7a ' 4.3 

428740 AA433838 gbawSSellrl Soares_totaLfetusJib2HF8. 4.3 

426830 AA385751 Hs.ig6379 ESTs. Wbakly similar to putaSve pi SO [H 4.3 

410615 AW772721 ^:hl9So01j(1 Na.CGAP.Thy8 Homo sapiens 4.3 

55 446619 AU076643 Hs.313 secreted phosphoptotein 1 (osteoponim, 4.3 

406073 4.3 

405692 4.3 

436033 H75391 H6.2S5748 ESTs 4.3 

410733 D84284 Hs.66052 G038 anSgen (p45) 4.3 

60 455587 BB)07829 8b:OV04N0147.280400-213^ BN0147 Homo 4.3 

459084 H01699 Hs.Z72B9 CX3I-125 ptoletn 4.3 

401189 4.3 

435451 AF195420 H5.303006 ESTs. Weakly similar to gamm»4ieregulin 4.3 

456407 AW968614 gb:EST380690 MAGE resequences. MAGJ Homo 4.3 

65 425733 F13287 Hs.159388 Homo sapiens done 23578 mRNA sequence 4.3 

447863 ALJ047611 HS.28B8B5 Homo sapiens CDNAFU14246 lis, done OV 4.3 

436659 AI217g00 Hs.144464 ESTs 4.3 

435463 AA682507 gb2j18f08.8l Scares Jetal.Gver.spleen. 4.3 

455675 BE065984 gb:RC3-8T0319^120200^14^6BT0319Homo 4.3 

70 439481 AF088294 H3.12S844 ESTs 4.3 

405287 4.3 

405784 4.3 

436461 AW511956 Hs.293261 ESTs 43 

437636 AA764781 Hs.291844 ESTs 4.2 

75 409629 AW449589 Hs.279724 ESTs 4.2 

412999 BE046255 gb:hn38g10j(2Na_CGAP.ROF2 Homo sapiens 4.2 

403281 4.2 

427531 AA405097 Hs.97957 ESTs 4.2 

OA 451862 AI821324 Hs.100445 ESTs 4.2 

oO 418656 AA362858 gb:EST72900 Ovary H Homo sapiens cONA 5 4.2 

405494 4.2 

456027 BE327387 Hs.13913 K1AA1S77 protein 4.2 

414539 BE379046 ob:601238646F1 N1H.MGC_44 Homo sapiens c 4.2 
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421106 
409076 
419563 
411688 
416614 
454434 
404526 
446393 
405302 



416972 
423841 
427099 
430484 
403895 
420457 
438993 
458421 
458722 
411382 
422373 
430749 
403625 
401887 
403667 
452744 
421085 
439294 
432792 
405443 
431169 
431822 
432328 
448324 
456536 
41S811 
411745 
438660 
449327 
425062 
433485 
434849 



AA877124 Hs.172844 ESTs 

N57559 Hs^73 hypothetical protein 

AA528235 Hs.193162 Homo sapiens cDNAFU 11 98363. done HE 

AW953440 gb:EST365510 MAGE iese(pi8nces.MAGB Homo 

183391 Hs.111849 ESTs 

AA083558 H5^1288 ESTs 

A1912555 Hs.157195 peptide YY, 2 {seirtnalplasinln) 

AW014174 Hs,301956 zinc finger pretain 

AL043482 Hs.267115 ESTs 

BE019670 gb:bb28c01 J(1 N)H_MGC_5 Homo sapiens cDN 

AW753967 obJ^C2-a03044I8010(M)1 1-h12 CT0304 Homo 

AB032953 Hs.173560 oddOz/len-mhomoIog 2(0rosophila,moiis 

D82880 Hs^41548 RASp21 protein activator 2 

AA482280 Hs.191656 ESTs 

AA826995 gb:od77b0a$1 Na.CGAP.0v2 Homo sapiens 

Af279978 Hs.22547 ESTs 

AA741545 H8.282832 ESTs. Weakty simHar to T24961 hypoUieti 

BE067246 ob:PM1-BT0348-151299^1-d04 BT0348 Homo 

AK001843 Hs.115700 Homo sapiens eDNA:FU23515 lis. done L 

AJ242956 Hs^960 avian mystocytomalosls viral relat 



422728 
445414 
408470 
429809 
453098 
402867 
431071 
436298 
440356 
419091 
422591 
426076 
443682 
444461 
430072 
413499 
439818 
443323 
424029 
455993 
420111 
403956 
410318 
426497 
430140 
457042 
450236 
417708 
428692 
413071 
437354 
403381 
425798 
459429 
426365 
430757 
430205 
433887 
444743 
404043 



A1267652 
AA329711 
AW975328 
AA448114 

AWg71240 
AA516049 

AI572739 

Ai571356 

AW135986 

AA450191 

AW867826 

U95740 

AI638743 

N57014 

AI493076 

AW292765 

AW937826 
AV653592 

AL162010 
Z25935 

AA491379 

AW293496 

At933184 

T85332 

L07648 

AW962714 

AI383061 

R53734 

X13294 

BE144884 

AL360137 

BE560621 

AB014594 

BE17908S 

AA255652 

W28077 

AA084050 

AA379913 

AW296771 

A}382130 

AW1 62998 

T90797 

A1372822 

BE064032 

AA749215 

AA364002 

AA278779 

AA376667 

A1458623 

AB025904 

AW204232 

AA045648 



Hs^504 Homo salens mRNA:cONADKFZp434E082(1r 

gb:EST33382 Embiyo^ 12 weeic li Homo sapl 
Hs^23 chromosome 1 open reading frame 12 
H8178950 pralocadherinbetal 

gb:EST383329 mC£ resequences. ^4AGL Honno 
gb:ng65d01.s1 Na.CGAP^Up2 Homo sapiens 
Hs.195471 &phosphofrudo-2-tdnase/tnictose-2,&b) 
Hs.34174 ESTs. Merately sifrilar to ALU8.HUliMN A 
Hs.257859 ESTs 

Hs.172963 Itrpotiwtical protein FU14624 

gb:li^RO^N0039^030(M)0lHi)2 SN0039 Homo 
Hs.6349 Homo sapiens, done lly)AGE:3010666. mRNA, 
Hs.224672 ESTs 

HS.75B74 ptegnancy-assodalsd plasma protein A 
Hs.201997 aldo-keto reductase famity 1, member C2 
Hs.8053 ESTs 

H5.103262 ESTs, Weakly simBar to ZN91_HUMAN ZINC 
Hs.146105 ESTs 

Hs^23603 Homo sapiens mRl^iA; cDNA DKFZp761D09121 ( 
Hs.88379 ESTs 



Hs.160138 
Hs.127922 
Hs.178294 
Hs.118630 

Hs.47248 
Hs.25978 
H8^00592 

Hs.19934 
Hs.9222 
Hs.137579 



Hs.79369 
Hs.269259 

Hs^21999 
Hs.97703 
Hs^4684 
H5^68823 
Hs.1 10103 

Hs.291886 



gb:aa65fD5.r1 NO CGAP.GC81 Homo sapiens 
ESTs 

ESTs. Moderately simitar lo S65657 alple 
ESTs 

MAX-InteFBcUng protein 1 
gb:EST374787 MAGE resequences. MAG6 Homo 
ESTs, H]Q)% slmBar to similar to Cdc14 
ESTs. WeaUy simliar to 2109260A B ceH 
Y^ttj/b avian myeloblastDds virat oncogen 
gbKM)-HT0182-041099-06&«11 HT0182 Homo 
Homo sapiens mRNA fuH length Insert cDN 
estrogen receptor binding site assoddte 
1^0694 gene product 

gbJlCO4IT0613-140300^1-d06 Hr0613 Homo 
gb2s21li11/1 NCt.CGAP.GCBI Homo sapiens 
nel (chidcenHiite 2 

ESTs, Weallly similar to S23850 retroviru 
Gb:EST92807 Sidn tumor I Homo sapiens d) 
ESTs 
ESTs 

KIAA1376 protein 
ESTs 

RNA polymerase i tiansctfptton (iaclor RR 
gb:QV3-8T029M10300>111-b08 BT0298 Homo 
ESTs 



gb:EST74529 Pined gland II Homo sapiens 
H$.335696 EST 

Hs.10283 RNA binding moQf protein 88 

gblk04g09j(1 NCt_CGAP»Lu24 Homo sapiens 
Hs.2351 68 carbonic anhydrase XIV 
Hs.279522 ESTs 

Hs.301957 niKfix (nucleoside diphosphate linked md 



4^ 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

3.9 

a9 

19 
3.9 
3.9 
19 
19 
3.9 
19 
19 
19 
19 
19 
19 
19 
3.9 
19 
19 
19 
3.9 
19 
19 
16 
16 
18 
18 
16 
18 
IB 
18 
18 
16 
16 
18 
18 
18 
18 
IB 
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AA/Uo4oo 


Hs.120127 




AITCOQfUC 


HS.20G063 




Dn7ilM 

nUf4o3 


ns.i8(Mo1 












LLa 44ieA J 


WUIWI 


AF1 73937 


Hs.109494 


425937 


WWLWIJZ40 


nS.io3B46 






Hs.1 45990 




AAUUyMf 


nsJogO 




HD2953 




4ooOZO 


AA778177 


Hs.1 21724 


4^4SHy 


AA9o5520 


Hs.23575 


4zo447 


AV655843 


Hs.1 69919 


441416 


A1990139 


Hs.148609 


456443 


AW967500 


Ks.133543 


402112 


R58824 


Hs^188 


404453 






451421 


W16522 


Hs237689 


421037 


A1684808 


Hs.197653 


4Z70o8 


AA398085 


Hs.1423g0 


453375 


A1990114 


Hs^40091 


453530 


Avv021633 




406964 


M21305 




432291 


AK001108 


Hs.274274 


449623 


C00719 


Hs. 120440 




W0329o 


Hs.1 93521 




Ai591222 


Hs.72325 


4(^271 








Ai953718 


Ks.221649 


4UVMW 






44yoU4 


AIo35563 


H3.39379 


44JoUO 


Ai050693 


Hs. 13331 8 


411100 


AVVo21257 




4Z4oo5 


AWi02723 


Hs.75295 


432189 


A A cntn J 4 

AA527941 










41 fSio 


AA2U92U5 


Hs.1 63754 


436026 


AI349764 


Hs,217081 


ATOUtA 
4<90Q4 


A A ACIWin 


Hs,286 


4l039£ 


X99226 


Hs.284153 


44^1 U 


Alices on 


Hs.1 1307 


446304 


AWl 04432 


Hs.1 49751 


441216 


BE299830 


Hs.192908 


421494 


Ai763322 


Hs.1 521 04 


404476 






416327 


R99822 


Hs.36172 


414145 


BE549372 


HS.317S96 


417401 


A A AA0A#10 

AA425026 


Hs.187615 


401200 






411560 


AW851166 


Hs.179909 


426306 


AA447310 


Hs.164059 


437918 


AI761449 


Hs.1 21629 


447917 


AL048037 


Hs.164568 


42i32o 


BE466506 


Hs.3981 


447290 


AI476732 


Hs.263912 


417229 


AA975096 


Hs.19522 


425403 


AL023753 


Hs, 156406 


403515 






419917 


AA320068 


H8.93701 


435554 


AF20B5u2 


H&18S708 


44U4HI 


U50525 


Hs.98201 


410500 


R09442 




439326 


W07140 


Hs.54721 


426296 


R14454 


Hs.5921 


411311 


AW036491 




418019 


R68911 


Hs. 176275 


417490 


A A lA^flP 

AA203335 




423035 


AW449679 


H5.1 56739 


416575 


W02414 


H3.38383 


414400 


)Q)6948 


Hs.897 


418405 


fXnJOOcOC 


Us 1 1fiOO 


450350 


T97817 


Hs.174880 


451704 


AI755209 


Hs.205616 


421013 


M62397 


Hs.1345 


407404 


AF040257 




423121 


AWB6484B 




430533 


AA480S95 


Hs.201552 


457141 


AA521410 


H8.41371 


411772 


BE170301 




440737 


AJ375167 


Hs.132221 


452728 


AI915576 


Hs.239708 


423266 


AA323875 


Hs.193574 



Homo sapiens cDNA: RJ227G9 lis, done K 

ESTs 

ESTs 

glKnf76g1lAl Na.CGAP.Co3 Homo sapiens 
ESTs 

secreted protein of unknown fundbn 
putative N6-DNA^thyilFsnsfer8Se 
ESTs. Weakly simSar to 138022 hypothefl 
a disintegiin and metaOoproteinase doma 
0l):yr47fl}6.r1 Soares fetal Over spleen 
ESTs 
ESTs 

electron4ransfar-fIavopntein, dpha po 

ESTs 

ESTs 

eukaiyoSc translatkm etongaOon factor 

Homo sapiens cDNA FU1 3539 lis. clone PL 

ESTs 

ESTs 

ESTs 

gb:d^26c0^y1 Morton Fetal Cochlea Homo 

gb:Human alpha satelBtsand sateOfte 3 

hypothetical protein FU10246 

EST 

ESTs 

Human ONA sequence from done RPM87J11 

ESTs 

ESTs 
ESTs 

gb:PM3^T0307-231299^1-b11 ST0307Homo 
guanytate cyclase 1, soluble, alpha 3 
gb:nh30c04.s1 NCLCGAP_Pr3Homos8()lens 

hypothetica) protein FU12606 
ESTs 

nlxisoma) protein L4 

FanconI anemia, complementallon group A 

ESTs. WsaUy8lmflartoT18326 hypothefl 

ESTs 

ESTs 

ESTs 

ESTs 

Homo salens cONA FU12927 lis. done NT 
ESTs 

hypothetical protein FU22995 

Homo sapiens cDNA FU13338 lis, dbne OV 

ESTs 

ESTs. Moderately sNlar to neuronal (hr 

ESTs 

ESTs 

hypotheOcai protein PR02849 

Human DNA sequence from clone 1198H6 on 

Homo sapiens mRNA; cONA OKFZp434E232 (fr 
early B^l factor 

Human BRCA2 regton, mRNA sequence CG029 
gb:yf26c09.r1 Soares fetal Over spleen 
ESTs 

Homo sapiens cDNA: FU21592 fis. done C 
gb:PM3-LT0032-281299O02-fl)2 LT0032 Homo 
ESTs 

gbx(S8g02/1 SoaresjBtaUiverjspleeiu 
H.8apiens XG mRNA (done PEP11) 
ESTs 

Fc fragment of IgE. h^ afiinlty I, Fee 
ESTs. Highly similar to KIAA1 370 protein 
ESTs 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

mutated in colorectal cancers 

gb:Hcmo sapiens TNF recefrtor homotog mRN 

gb:PM2-SN0O18-2gO30aO03<O9 SN0018 Homo 

ESTs. WbaMy sfmiar toTl7288 hypothefl 

ESTs 

gb:QV44nO53SO4OS00>19»0S KT0636 Homo 

hypotheScd protein FU12401 

ESTs 

ESTs 
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413543 AA130228 Hs.3246t1 ESTs, Modersldy simOar to ALU8.HUMAN A 15 

454447 BE163557 gb.-QV3-HT0460.230200.101^ HT0460 Homo 3.5 

458067 AA393603 Hs36752 pratebi kinase anchoring protein GKAP42 3.5 

_ 437608 AA76160S Hs.292308 EST$.Wtealdy similar to AlULHUMANAUUS 3l5 

5 415549 F11942 gls:HSC33F061 nonnanzedlnlanttiralncON 15 

420910 AL049437 H3.100292 Homo sapiens mRNA:cONADKFZp588E11 20 (( 3.5 

435793 AB037734 Hs.4993 K1AA1 31 3 protein 3.5 

453211 W84829 gbczh53l04^ Soaresj8taUh«L$pte8iL 3.5 

418717 A1334430 HsJ6984 ESts 3.5 

10 400641 3.5 

442973 BE58766S Hs.28&550 KOmo sapiens cDNA:FU231S6 lis. done L 3.5 

418007 M13S09 Hs.83169 matiixmetaHopioteinase 1 (intBfsGflal 3.5 

440364 AA910460 Hs.128626 ESTs 15 

458340 AK57102 Hs.6986 Human glucose transporter pseudogene 15 

13 412261 A1810054 Hs.14119 ESTs 15 

443204 AVa05878 Hs.29643 Ho(no8apienscONAFU131031is,ctoneNT 15 

416616 H68270 gbnfr81h09.r1 Soaresfetel liver spteen 3.5 

444338 AI937028 Hs,146642 ESTs 14 

436946 AW137748 Hs.125956 ESTs 14 

20 431632 AK000992 Hs.333144 Homo sapiens cONAFU10130fis. clone KE 14 

403306 NML006825 Hs.74368 transmembrane pratetn (6310)). endoplasmi 14 

422093 AF151852 Hs.111449 CGI-94 protein 14 

42B816 AA004986 Hs.193852 ATP4)IndIng cassette. sutManiilyC(CFTR 14 

442137 AA977235 HS.12883D ESTs. WeajdysimDar to Z19U1UMAN ZINC 14 

25 405970 14 

409434 AFZ78761 Hs.131S81 Homo sapiens testis transcript Y 7 (TTY7 14 

416100 H18700 Hs.26B799 ESTs 14 

431418 X68242 Hs.252722 Hb-I 14 

^_ 431954 AK001974 Hs.272242 hypothetical pmleinFUl 11 12 14 

30 440388 AI693520 Hs.223000 ESTs 14 

421072 A121S069 Hs^113 ESTs 14 

424578 AK001973 Hs.1S0890 hypothetical protein 14 

436331 AI239495 Hs.120189 ESTs 14 

444063 AI122614 Qb;q896l)05Ji1 S08tBS.feteLheait.NI)HH19W 14 

35 444453 AW379394 Hs.145126 ESTs 14 

404196 14 

421262 AA286746 Ks.9343 Homo sapiens cONARJ 14265 tis. done PL 14 

409555 AW410788 Hs.256185 ESTs 3.4 

417669 T99898 ob7e68g01 /I SoaresfelalDver spleen 14 

40 416057 AI927382 Hs.29857 ESTs 3.4 

425208 NIVI_002153 Ks.155109 hydn>xystsro)d(17-fael^ dehydrogenase 2 14 

447738 AI871000 Hs.161330 ESTs 14 

430664 AWg69834 Hs.303303 ESTs 14 

... 411377 AW841462 gb:RC6-CN0014^)80300.012-B09CN0014Homo 14 

45 415769 H94186 Hs^912 Rx)X only protein 7 14 

429382 Ai79124g Hs.278054 ESTs. Weaidy similar to 138022 hypotheB 14 

431474 AL133990 H8.190642 ESTs 14 

456908 AI953671 Hs.220994 hypothetical pntelnRJ14129 14 

442826 AI018777 H8.131241 ESTs 14 

50 400608 14 

436111 Ai803082 Hs.157212 ESTs 14 

452807 AA028933 Hs.162434 ESTs 14 

436577 Vmm Hs.17643 ESTs 14 

412209 AW901456 gb:RO0^1012-27O30(M)3lH^ NN1012 Homo 3.4 

55 417153 X57010 Hs^1343 collagen, type ii, alpha 1 (primary oste 3.4 

423871 AA331906 gb:EST35805 Embryo, 8 week I Homo saplen 14 

447516 W05355 Hs.102971 hypolheSca) protein FU 14751 14 

409623 AW4491B5 gbcUHiBI3^kg«054>4Jl4l Na.CGAP.8u 14 

416182 NM.Q04354 Hs.79069 cydinG2 14 

60 420854 AW296927 gb:UI-H-8VV(Mf&<>07-O-UU1 NCt(:GAP.8u 14 

422899 D16471 Hs.121S71 Human mFQU. Xq terminal porOon 14 

432404 AA535246 Hs.50852 ESTs 3.4 

458S95 AV660159 Hs.282284 ESTs. Weakiy dmHar to 136022 hypotheti 3.4 

440727 AI073991 Hs.134268 EST5,WeaWysimfl3rto2109260ABceD 13 

65 428766 AA477989 Hs.98800 ESTs 13 

439567 A1056618 Hs.134314 ESTs 3.3 

456231 H731B3 Hs.129885 ESTs. Weakly similar to 200439gAdtfomos 3.3 

454318 AW367764 fe.7857 erythrocyte membrane protein band 4.141 3.3 

„ 411966 AA099113 Hs.118609 ESTs 13 

70 443844 Ai08O491 Hs.93270 ESTs. Moderately Mlar to S65657eipha 3.3 

437037 T63804 gb7c21e09.r1 Strategenehing (937210) H 3.3 

407664 AW063476 Hs^9080 ESTs 3J 

405780 13 

426567 AA3ai579 Hs.182962 ESTs » 3.3 

75 400432 AX015809 Hs.287757 Sequence 8 from Patent WO9950285 13 

403356 13 

404516 Ai815601 Hs.79197 C083 antigen (activated B lymphocytes, i 13 

413581 BE150618 gb:RC3-HT0272.11010(W)13K:06HT0272Homo 13 

0. 429875 AI091815 gb:qa58b06.s1 Soares.NhHMPu.SI Homosapi 13 

OO 433785 BE044593 H$.112704 ESTs 13 

437876 AA7701S1 Hs.126424 ESTs 13 

444870 AI200621 Hs.l48504 ESTs 13 

453324 W26592 K5.232089 ESTs 13 
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437963 BE395279 gb:6013097&5F1 N!H.MGC_44 Homo sapiens c 3.3 

425361 AA355933 Hs.132221 hypothaScal prol^ RJ12401 3.3 

408813 AiS8009O Hs.48295 RNAheDcasefamSy 3.3 

^ 426692 AK001751 K5.171635 hypolheScal pfolBin FU10889 3.3 

5 407456 AJ237589 gL'Honn sapiens mRNA for T-boxtransoip 3.3 

433183 AF231338 Hs.222024 transcripSon factor BMAL2 3.3 

438168 AK000883 Hs.301645 Homo sapiens cONA PU10021 &. done KE 3.3 

438456 AA913381 H5.190513 ESTs 3.3 

453242 798327 H5.18343 ESTs 3.3 

10 415131 061119 gb:HUM1S8C1 IB Oontech human fetal brain 3.3 

412040 D86519 Hs.73086 neuropepQde Y receptor Y6 (pseudogene) 3.3 

435070 A1821270 Hs.285643 Homo sapiens cONAaJ14364fis, done HE a3 

444443 A)1492E6 Hs.S5099 rab6 GTPase acthrating protein (GAP and 3.3 

434001 AW9S0905 Hs.3697 seiine (or cysteine) proteinase Inllibito a3 

15 454145 AA046872 H3.627g8 ESTs 13 

405264 3.3 

411849 AW964970 Hs.18881 ESTs. Irately slmflar to KiAAl 276 pio 3.3 

416816 T71168 Hs.119567 ESTs, WeaWyshwIarto A47582BH»Dgr 3.3 

435325 AI038388 Hs.l 19309 ESTs 3.3 

20 440184 AB002297 Hs.7022 dedicator of cyto^dnesis 3 3.3 

428356 AL046991 Hs.10338 ESTs 3.3 

429216 AI369472 Hs.65407 ESTs 3.3 

429106 AA446&12 gb2w85g07.s1 Soares.tola}_fBtus.Nb2HF8. 3.3 

405720 3.3 

25 400889 3.3 

416294 D86980 Hs.79170 KIAA0227 protein 3.3 

422094 AF129535 Hs.272027 F-box only protein 5 3.3 

425374 AI904013 gb:MR'eT041.220199-104BT041 Homosapien 3.3 

418122 R42778 Hs.22217 Homo sapiens done IMAGE:321 06, mRNAseq 3.3 

30 427374 Ai150033 Hs.1 43666 ESTs 3.3 

443367 AW071349 Hs.215937 ESTs 13 

446645 AI336596 Hs.1S6294 ESTs 13 

457604 AI004397 Hs.334552 Homo sapiens cDNAFU14930fis, done PI 13 

452291 AF015592 Hs.28853 CDC7(ceIi(fivlstoncyde7, S.cerevisi 13 

35 410481 R34107 Hs.321450 pregnancy spedfic beta-1-giycoprotein 2 13 

458885 AA411303 H5.30022 ESTs, Vfeaidy similar to NAH6JiUMANS0DIU 13 

429608 U49250 HS.210B62 T^ brain. 1 12 

437454 AL390159 Hs.269812 Homo sapiens mRNA;cONADKFZp76m«>415(f 12 

430503 AA533574 Hs.152274 ESTs 12 

40 432839 AA579465 H$.45207 hypotleQcal protein KiAA1335 12 

421698 T69677 Hs.324323 ESTs 12 

412321 AW936913 gb:RC1-DT0029^)3020O4)12^8OT0029Homo 12 

422219 AW976073 H5.IOIO regulator of mitoflc spindle assembly 1 12 

454982 AW847645 gb:il^0213.28010(W56A04 CT0213 Homo 12 

45 441705 AI087052 Hs.55993 ESTs 12 

403619 12 

435608 AW183971 Hs.250896 ESTs 12 

426701 AI968103 Hs.209461 Homosapiens cDNAFUl 2836 fis. done KT 12 

401132 12 

50 407764 BE008347 gb:CM0-BN0154-080400^25^04BN0154Homo 12 

409425 U40462 H3.54452 zinc finger protein, subfamily 1 A. 1 (Ik 12 

428004 AA449563 Hs.151393 giulamat&cysteine ligase, catalytic sub 12 

443603 6E502601 H8.134289 ESTs, W^aldy similar to tOAAl 063 protein 12 

419935 AI792788 gb:ol91d05.y5Na.CGAP_Kld5 Homo sapiens 12 

55 455571 BE003714 gb:QV3^N0096-200400>161-a01 BN0096 Homo 12 

408592 12 

446530 AV658909 Hs.28^2 ESTs 12 

454466 AA984138 Hs.155101 ATP synthase^ H^transpOffing.fflitodwnd 3.2 

401449 12 

60 431196 AW974436 Hs.154929 ESTs 12 

422183 AA431698 Hs.112794 Human ONA sequence from done 1068E1 3 on 12 

459459 AA460445 gb2x66h1U1 So8res.totaUelus.Nb2HF8. 12 

421308 AA687322 Hs.192843 toudne zipper protoinFKSGU 12 

427335 AA448542 HS.2S1677 GBnllgen7B 12 

65 455236 AW875972 gb:CM3-PT0014^7129&OS1-b05Pr0014Homo 12 

425156 AA351364 flb:EST5g099 Infant brain Homo sapiens cO 12 

404588 12 

413087 BE0646S5 gb:RC1-6T0313^12994)12-c09BT0313Homo 3.2 

444910 AI201849 gb:qs76g04jc1 NCLCGAPJ'i^B Homo sapiens 3.2 

70 426660 NM_002719 Hs.171734 protein phosptiatase 2. regulatoiy subuni 12 

438315 R56795 H3.82419 ESTs 12 

425523 AB007948 Hs.158244 KIAA0479 protein 12 

419340 AA235590 Hs.87530 ESTs 3.2 

425636 AK001243 Hs.158370 liypdhetica] protein FU10381 12 

75 430553 AW392821 gb:C)M-ST027&021299<)534i09ST0275Homo 12 

457030 AI301740 Ks.173381 dlhydropyr1mldinase«e 2 12 

447375 A1376860 H3,257822 ESTs 12 

408334 AW514652 H3.321637 ESTs 12 

410085 AA428482 Hs.58589 glycogenin2 12 

oU 410536 1^9533 gtKyvZ7d04x1 Soaresfotol Over spleen 12 

448495 AW136516 Hs.208515 ESTs 12 

405634 12 

431098 AV^1465 Hs249230 ribonudease L (^.S'-oQgdsoadefiylate 12 
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15 
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421581 


U89331 


Hs.105932 


440633 


AI140686 


Hs.263320 


453264 


AA034137 


H9^1955 


411030 


Avvo5557d 




419169 


AWB51K0 


Hs.262348 


426591 


AA431127 


Hs.98665 


446S66 


001448 


Hs.300511 


452401 


NM.007115 


Hs.29352 


455170 


AWB60972 




4lo2Uo 


AVV291168 


Hs.41295 


423657 


Au>45128 


Hs.1691 


400816 






410307 


AR)22913 


Hs.62187 


440046 


AW402306 


Hs.6877 


452824 


W27643 


H5.73965 


400315 


U46120 


Hs.193392 


411965 


Bt467339 


Hs.280115 


416316 


H58721 


Hs.271628 


400613 






414819 


BE177320 


Hs.156148 


434833 


AF156548 


Hs.192969 


416693 


AI750878 


Hs.87409 


416258 
405033 


N4S661 


Hs.90011 


415273 


Z39840 


Hs.22229 


450519 


AA010066 


Hs^24849 


422654 


AA314316 


H5.163725 


414605 


BE390440 




400441 


Ml 5530 


Hs.99679 


402790 






438563 


A A04 A^ffP 

AA810665 


Hs. 134746 


447524 


D80449 


Hs.45177 


448835 


Bk277929 


H&11081 


415979 


HI 6427 


H8.271501 


434479 


All 38213 


Ks.1 62035 


426724 


AA3ojo23 


Ks.293616 


418105 


AV\/937488 


Hs.178000 


405608 






406506 






421216 


AV649262 


Hs. 102664 


452755 


AW1 38937 


Hs.213436 


404288 






429678 


AA460188 


Hs.127263 


439834 


Ar754576 


Hs.l 24523 


454564 


AVV807573 




450491 


BE045604 


Hs.202301 


409920 


BE169746 


H5.12504 


400579 






402953 






404285 






426890 


AA393167 


Hs41294 


457770 


BE065030 


Hs.124179 


435477 


BE218708 


Hs.1 17270 


436391 


AJ227692 


Hs.146274 


456083 


U46922 


H8.77252 


416421 


AA134006 


Hs.79306 




API 10002 


H$.233363 


449238 


AA428229 


Hs.331561 


452605 


AVV968557 


Hs.90012 


455323 


AW752389 


HsJ7296 


429628 


AB019494 


H1225767 


423454 


AL1 10456 


Hs.469 


452762 


AW501435 


Hs^8582 


401344 






455511 


d A^0A 

BE144762 




455^ 


AW8B6156 




433132 


AB026264 


H1Z84245 


423800 


AI633559 


HsJ10359 


427667 


AKD01279 


Hs.180171 


407257 


AB006834 




457041 


AA399018 


Hs^50835 




AL1 35462 


Hs.104715 


459082 


AA059246 


Ks.1 10293 


436475 


R56805 


Ks.86149 


411622 


AI807B94 


Hs.47274 


417489 


AW953341 


Hs^3 


418454 


AA315308 


Hs.195870 


409699 


BE154650 




438394 


BE379623 


Hs^7693 


443741 


AW451759 


Hs.145420 


405090 






432267 


AK000872 


Hs^4227 


445409 


AI949081 


Hs.147662 



short stature hofneotxn 

E8T8 

ESTs 

gb:CM4^T0278>221099WHJ01 CT0276 Homo 
ESTs. WeaUy sfrrilar to 872482 hypotheS 
ESTs 
ESTs 

tumor necrosis factor, otplia^uced pro 
gbK3V0.CT0387-18030O-167-hO7 CT0387 Homo 
ESTs. Weakly stmOar to MUC2_HUMAN MUQN 
glucan (1,4^lia-). Iwanching eniyme 1 . 

phospltaSdyllnosltol gt/can. dass K 

hypothelica)pnMaJ10463 

splicing factor, afglnine/torine^ 2 

Human expressed unknown mRNA 

ESTs 

ESTs 

hypolhelica] protein FU13231 

ESTs. Weakly stmOar to ATlAJftJMAN POTEN 

tlirombospondin 1 

adenytosuodnate synthase 

ESTs 

Homo sapiens cDNA FUI 2583 lis. done NT 
ESTs 

gb:601283601F1 NIH Jil6Cj44 Homo sapiens c 
B^xO growth fador1(12kO) 

ESTs. Weakly similar to A46010 X-Onked 
ESTs 

UBX domain-containing 2 

ESTs, Weakly similar to 154374 gene NF2 

d^ory receptor, famity 52. subfamiiy 

ESTs 

ESTs, WBakly similar to FV1 MOUSE FREND 



veskde-assodaled membrane protein 4 
ESTs, Weakly slmDar to A34087 hypotheU 

ESTs 
ESTs 

gb:MR1-ST0086^2129^04^01 ST0088 
ESTs 

iftely orthdog of mouse Arfcadia 



ESTs 
ESTs 

hypotheOca) protein FU14345 
ESTs 

fragile tvstkllne triad gene 

eukaiyotk: transiatton iniSaOoo fadof 

guanytate cyclase acSvator 1C 

musde^pecHlc I^N&finger protein 3 

hypothetical protein FU23441 

Homo sapiens cDNA FU20269 lis, done HE 

WH2 protein 

suodnate dehydrogenase complex, subunit 
>wW murine thymoma vtrd oncogene homo 

gb:CM0-HT018aO41099^b04 HT0180 Homo 
gb:RC5OT0078-15030(M)21-E08 OT007e Homo 
hypothetical protein IMPACT 
ESTs 

Homo sai^ cDNA FU10417 fis. done NT 

gb:Homo sapiens mRNA for HRV Fab N6-VK 

ESTs 

inversin 

ESTs 

phosphdnosltd ^phosphdto4dixlbig prol 
Homo sapiens mRNA; cDNA 0KFZp564B176 (Ir 
ESTs. WeaWy simliar to Aia)l_HUMAN AUJ S 
hypothetical protoinFU14991 
gb:PM341T03444)71299O03H:08 HT0344 Homo 
pepSdylprolyl teomerasa (cydophffinH 
ESTs 
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449347 AV649748 Hs.295901 KIAA0493 protein 3.0 

428301 AW828666 Hs.98440 ESTs.Wtaklysbi«ar Id 138022 hypottiati 3.0 

406384 10 

- 418409 AA219332 Hs.120869 ESTs,Wbak)ysfanQartoR107.HUMANm^ 3.0 

5 427050 AA397789 Hs.161803 ESTs 3.0 

431778 AL080276 Hs.268562 regulator of Gfrot^signaUbg 17 3.0 

448405 AW207634 Hs.170849 ESTs 3.0 

429846 AB023021 Hs,225945 fucosyllransfefase9(alpha(1^fucosy 3.0 

438165 AA779344 Hs,138136 ESTs, Weakly slmflar to 151 0254A LI repe 3.0 

10 418888 AU076801 Hs.89438 cadherin 17. U cadherin (fiver-intestin 3.0 

418432 M14156 Hs.85112 insulin^ growth factor 1 (somatomsdl 3.0 

426424 BE081745 Hs.272168 Homo sapiens cONAFU12090fis. done HE 3.0 

419505 AA243660 Hs.143061 ESTs 3.0 

403743 3.0 

15 415452 F09134 Hs.12839 ESTs 3.0 

428579 NM.005756 HS.1B4942 6 prote!iH»upled receptor 64 3.0 

447048 AA326197 Hs.17170 GpiQl8iiKOUpledieceptor4 10 

455851 BE146879 gb:QV4-HT0222-2610994}14<11 HTQ222Hoino 3L0 

400227 10 

20 438219 AK001695 Hs.146589 hypoOwIica) protein FU10701 10 

439037 AF075084 gb:Honio sapiens full lenOtti insert cDHA • 10 

439693 AI741816 Ks.125897 ESTs 10 

431292 AA370141 Hs^281 chromogrBnin 6 (secretogranin 1) 10 

403513 10 

25 425745 U44060 Hs.14427 Homo$ap{enscONA:aJ21800 lis. done H 10 

440122 A1733011 Hs.127678 ESTs 10 

446446 AI5212S1 Hs.17l030 ESTs 10 

422563 BE299342 Hs.19348 tiypotheUcal protein FU13119 10 

448130 AW271635 Hs.l70717 ESTs 3.0 

30 42028B AW071225 Hs.245556 ESTs 10 

428201 AM24158 Hs.206461 ESTs 3.0 

436643 AA757626 Hs.10941 ESTs. Weakly simitar to IPPl.HUMAN PROTE 10 

448966 AW372914 Hs.86149 phosphoinositol 3i}hosph8t&l)inding prot 10 

^_ 404513 10 

35 412074 S74683 Hs.73139 ADP-ritnsyitraRslerasa 1 10 

407762 AW235638 H5JE9475 ESTs 10 

403396 10 

436938 AW139680 Hs.161393 ESTs 10 

458090 AI282149 HsJ6213 ESTs. Higtily similar to FXD3JiUMANF0RKH 10 

40 400706 zg 

432779 AW979241 gb:EST391 351 IMGEfe5equences.il/IAGPH0m0 Z9 

444600 R41398 Hs.6996 ESTs Z9 

403786 Z9 

430187 AT799909 Hs.156989 ESTs 2.9 

45 451700 AI470262 Hs.29553 ESTs 2.9 

456649 R74441 Hs.117176 pc)y(AH)ind!ng pmtelrt, ruidear 1 Z9 

457503 AW970244 Hs.162tB8 ESTs 2.9 

446251 AW867156 Hs.28^ ESTs. WeaMysiniilv to 138022 tiypottiati Z9 

_^ 406327 Z9 

50 434671 R34758 gb:yg51g02.r1 Soares hfanl Ijraln INiB H Z9 

430175 AA46B724 gb:nB09a06.s1 Na.CGAP.Co3 Homo sapiar^s 2.9 

454186 BE141030 gb:MRO-HT0067.201099402-li11 HT0067Homo Z9 

449459 BE546846 Hs.19S048 ESTs 2.9 

435934 R19382 Hs.117869 ESTs 2.9 

55 400325 M85292 Hs247924 Homo sapiens endogenous HlV-1 related se 2.9 

408408 AF070571 Hs.44690 Homo sapiens done 24739 mRNA sequence Z9 

414514 BE327385 Hs.280187 ESTs 2.9 

423717 AA330036 Hs.152003 ESTs Z9 

^_ 424152 AL133591 Hs.141480 Homo sapiens mRNA; cONA DKFZp434N079 (If 2.9 

OO 430982 R17432 HsJQ217 Homo sapiens done IMAGE:321 06, mRNA seq 19 

424728 AK001007 Hs.138760 Homo6a;teRScONAFU10145fi5.clonaHE 2.9 

456166 W26642 gb:34b8 Human retina cONA randomly prime 2.9 

412222 AA52B283 Hs.292737 ESTs 2.9 

459201 AW391177 gb:MR3^T0203.2212994)23405 ST0203 Homo 2.9 

65 422511 AU076442 Hs.117938 cdlagen. type XVH. alpha 1 Z9 

435579 AI332373 Hs.156924 ESTs 2.9 

417027 AA192306 Hs.23926 triadin Z9 

415533 T74009 Hs.268738 ESTs. WeddysimBar to AiaJ7.HUMAN ALUS 2.9 

44S909 BE262656 Hs.32603 hypotl»6cal protein MGC3279 simiiar to 2.9 

70 418343 AA216372 Hs.159501 ESTs 2.9 
459440 BBM8054 gbiz46d}ly1 NCLOGAP^52Homosap{en 19 

403341 2.9 

445635 AI769774 Hs.209e31 ESTs. WeaMy similar to ALU1JHUIUIAN ALUS 2.9 

453830 AA534296 Hs.20953 ESTs Z9 

75 455886 BE149024 gbKaM)-KT0249'29109»)84«D4 Hr0249 Homo Z9 

407676 AW064111 Hs.279823 ESTs 2.9 

437913 A1140825 Hs.121623 ESTs 2.0 

443458 R05385 Hs.143509 hypothetica] protein FU2t924 Z9 

457049 AW631495 Hs.27135 Bhx» receptor-BSSodatad protein BAP29 Z9 

SO 400491 H2S530 Hs.50868 sdutecarrter family 22 (organic calion Z9 

456189 H91010 Hs.44940 ESTs Z9 

441874 AA970389 Hs.128055 ESTs Z9 

416483 H58311 HS.16S077 ESTs Z9 
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420879 N31165 K5^8837 ESTs, WeaUysiniilartDS43603RNAblnd) 2.9 

446447 AI300402 Hs^SO ESTs Z9 

4399S3 AA91S129 K5.124638 ESTs Z9 

400643 2.9 

5 436594 AI4199a2 Ks.156189 ESTs Z9 

43B402 D16902 Hs.42915 ARP2(acfiiHel8led protein 2, yeasi) ho 2.9 

451353 N21043 Hs.42932 ESTs ZB 

419791 AI579909 Hs.105104 ESTs 29 

41 5628 F13080 gb:HSC3ID041 normalized inlant brain cDN 2.9 

10 423637 AL137279 Hs.130187 Homo8apiensmRNA;cONADKFZp43401214(( Z9 

454747 AWBieSSS gb:RC1-ST027&-14030(H}16-(05 ST0278 Homo Z9 

452778 R71338 H$.5921 Homo sapiens cDNA: aJ21 592 fis. done C Z9 

457178 AL039101 Hs.1946» dynein. cytoplasmic. QgMintemnedlalB Z9 

401526 2.9 

15 408751 N91553 H52S8343 ESTs 2.9 

414140 AA281279 Hs.23317 liypoOieScal piotein FU14681 19 

417320 AA195667 Hs.86022 ESTs 2.9 

442927 A)024347 Hs.131519 ESTs 29 

444125 A)124fl82 HS.11B121 ESTs 2.9 

20 453901 BE065902 gb:RC2-BT0318-15020aO1 1-609 BT0316 Homo 2.9 

421847 NiyL014717 Hs.108884 KIAA0390 gens pnxJuct 18 

434424 AI8112Q2 H8.325335 Homo sapiens cDNA:FU23523 lis, done L 2.8 

422225 BE245652 Hs.118281 zinc finger prolein 266 2.8 

403011 2,8 

25 405170 2.8 

435878 R08330 Hs.20152 ESTs 2.8 

436194 AK001074 Hs.33343S Homo sapiens CDNAFU1021 2 fls. done HE 2 8 

400394 AF040257 Hs.283818 Homo sapiens TI4F receptor inmOt08mRN^ 28 

411244 AW833768 gb:QV4.TT000B.13010(W77-e08TT0008Homo 28 

30 441817 AW969708 Hs,293332 ESTs 28 

456118 AA380267 Hs.78277 DKFZP434F2021 protein 28 

417801 AA417383 Hs.82562 inlegrin. beta-Oka 1 {wiUi EGF-IOcs rep 28 

442717 RB8362 Hs.180591 ESTs. Weakly similar to T23976 hypotheU 28 

402131 28 

35 428959 AF100779 Hs.194680 WNT1 indudble signafing pathway protein 28 

438160 AA779332 Hs.122671 ESTs 28 

407594 AW057584 H&.1606ai ESTs 28 

417877 AI025829 Hs.86320 ESTs 2,8 

439235 N45513 Hs.4660B ESTs 28 

40 451257 AA016255 H5.31856 ESTs, Weakly similar to KIAA1453 protein 28 

437113 AA744633 gb:ny26c10.s1 NCLCGAP.GCBI Homo sapiens 28 

430882 BE174240 Hs.79024 heterogeneous nudear nlnnudeoprotein 28 

409978 D31897 Hs.57714 double CMike domains, alpha 28 

410672 AW794600 gb:RC8-Ulira014.17030&022-C05UM0014Homo 28 

45 412236 AW902S83 gb:Q\/3-NN1024-260400-171-f10 NN1024 Homo 28 

417827 T79366 Hs.108258 actin binding protein; macropltin (microf 28 

420206 M91463 Hs.95958 solute carrier family 2 (fadlitatedglu 28 

449676 AW380579 Hs.209657 ESTs 28 

454778 AW820199 gbXW2^0296-190100.029^7ST0298Homo 28 

50 451203 AWO70604 Hs.46517 ESTs 28 

450180 AW449644 Hs.257l82 ESTs 28 

409432 D49372 Hs.54460 small indudUe cytokine subfamily A (Cy 28 

442264 AI278777 Hs.263455 ESTs, Wedklyshniter to ALU 1J1UMAN ALUS 28 

450003 AA777809 Hs.191995 ESTs 28 

55 401602 28 

413986 Z43567 gb:HSC1FC021 normalized infant brain cDN 28 

436187 AK000998 Hs.297221 Homo sapiens cONAFU10136fis, done HE 28 

415652 T79213 Hs.272073 ESTs 28 

^_ 404076 28 

60 409416 AW3B8359 Hs.10667 ESTs 28 

420814 AA721156 Hs.190440 ESTs 28 

426960 AA393713 gbuE(71h04.r1 Soares.testis_NHT Homo sap 28 

419227 BE537383 Hs.8g739 dioilnergic receptor, niootinto. beta po 28 

^_ 448597 BE613250 Hs.98265 KIAA1 877 protein 28 

65 409928 AL137163 H$.57549 hypothetical protein dJ473B4 28 

433077 AA314262 Hs.301917 YDD19prol^ 26 

436720 AW975902 gb:EST388011lUlAGE resequences. liMGN Homo 28 

447410 A1470235 Hs.172698 EST 28 

414652 AI620599 Hs.72068 ESTs 28 

70 430454 AW4G9011 Ha.105S35 ESTs 28 

412417 AA102268 Hs.158622 ESTs 28 

423130 AW897586 H&21213 ESTs 28 

430660 R11884 Hs.100826 ESTs 28 

401098 28 

75 454036 AA374756 Hs.93560 Honwsaptens mRNA for KIAA1771 protein. 28 

403549 28 

414394 AI904738 HsJ6053 DEAiyH(Asp^hl-AI»Asp/Hls)bOxpolypep 28 

412323 AW937143 gb:PM1-OT0041-281299<101401 OT004t Homo 28 

433513 AI566356 Hs.171437 ESTs 28 

80 446677 A1800311 Hs.156291 ESTs 28 

457756 AA126136 HsJ8125 interferon4nduced protein 75. 52kO 28 

450895 N66727 Hs.10957 ESTs 28 

4343S2 AF12950S HsJ6492 small musde prot^ )Uinked 28 
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449358 AA001229 Hs.131438 ESTs 2.8 

422816 AA32358S Hs.93235 ESTs 2^ 

420756 AA41180O HS.1899X ESTs i8 

423532 BE090503 ob:RC6^T0717-1104(XM)11-F11 BT0717Homo 2^ 

5 448870 BE181783 Ks.175358 ESTs. Weakly similar to A47582B<e])6r 2.8 

451206 H66228 Ks.271780 ESTs. Wtokly shnnar to 138022 hypotheti 18 

457314 AA479597 Hs.1936e9 hypothetic^ protein DKFZp586Jt 119 2.8 

458023 AW976161 Hs.268555 S'-S* exoribonxlesse 2 23 

422260 AA31S993 Hs.105484 regenersQng geno type IV Z8 

10 429638 AI916662 Hs^11577 kinecQn 1 (kinestn receptoO 2.8 

408936 AL138043 Ks.293549 ESTs 2.8 

411762 AW850972 gb:QVO-CT0387-180300-157.h07 CT0387 Homo 2.8 

416192 NM.005035 Hs.998 peroxisome prolifeTalivd acCvated recep Z8 

455310 AW693961 gb:RC4>NN0027-06040(M}11-d11 NN0027HomQ Z8 

IS 406992 S82472 gb:bela iwh^DNA polymerase beta {exon a 2.7 

421003 T72080 Hs.95667 F-t)OX protein 30 2-7 

429593 AK000332 Hs.209927 Homo sapiens CDNAFU20325Q8. done HE 2.7 

445611 AW4184g7 Hs.14S583 ESTs 17 

445747 AI820863 H8.14532B ESTs, Weakly simSar to ALU1J1UMAN ALUS Z7 

20 445017 AI205493 Hs.176860 ESTs 2.7 

411726 AW858612 gb:CM3-CT0341-190400.152-h12CT034l Homo ZJ 

451917 AW391351 Hs.50820 Homo sapiens unknown mRNA 2.7 

416805 F13271 Hs.79981 Human done 23550 mRNA sequence Z7 

423020 AA383092 Hs.1608 repBcaiionpfolBiftASJMliD) 2.7 

25 427134 AA398409 Hs.173561 EST 17 

435689 AA694284 gb:zl35c02.s1 SoaresJetalJiverjBpleen. 2.7 

429282 N27596 Hs.21342 ESTs 2.7 

435731 AA699581 H3.I868II ESTs 2.7 

426682 AV660038 K5.2056 UDPglyoDsyttransferaselMly.pdype 17 

30 421274 BE160327 Hs.104572 ESTs 17 
403776 2.7 

409526 BE298751 Hs.55014 hypothetical protein FU 10206 17 

410201 AA126129 gb:2m78c07.r1 Strstagene neuroepithellum 17 

427839 AA608823 Hs.98244 ESTs 17 

35 447884 H29505 gb.7m60d10/1 Soares Infant brain 1NIBH 17 

449396 BE169100 H$.195029 ESTs 17 

422988 AW673847 Hs.97321 ESTs 17 

442772 AW503680 H8.5957 Homo sapiens done 24416 mRNA sequence 17 

434890 AF161345 Hs.283930 Homo sapiens HSPC082 mRNA. partial cds 17 

40 412400 AW948066 gb:RCa.MT0012-290300^1-h10MT0012Homo 2.7 

413998 AW103807 Hs.243933 ESTs 2.7 
403677 2.7 

423401 NhA_001992 Hs.12B087 coaoulaftonfoctorlt(lhrombln}recepto 2.7 

430698 AA492071 gb3ie97b044l Na.CGAP^K]d1 Homo sapiens 2.7 

45 432591 AA643238 H5,146144 ESTs 17 

446800 AI341635 H8.156488 ESTs 2.7 

454938 AW846134 gb:QVIM:T0179^1199^)4M02CT0179Homo 2.7 

456869 BE467912 Hs.154294 discs, large (Drosophila) homdog 1 17 

445233 AV653034 H3.297559 ESTs 17 

50 448755 AI739241 Hs.171480 ESTs 17 

416379 AA218940 Hs.137516 fidgeting 1 17 

435068 H16262 Hs.31415 ESTs 17 
406092 2.7 

422036 AA302647 Hs.271891 ESTs, WeaMy similar to ZN91J1UMAM ZINC 17^ 

55 441541 AA938663 Hs.199828 ESTs 17 

451395 A1082419 Hs.114761 ESTs 17 

455680 BE153208 gb:PM04fr0335O5040a007-F10 HT0335 Homo 17 

459275 AI808913 Hs.339352 Homo sapiens brother Of COO (BOC) mRNA, 17 

423949 AI014546 Hs.130912 ESTs 17 

60 435420 AI928513 Hs.59203 ESTs 17 

439418 A1282149 Hs.56213 ESTs. Highly simaar to FXD3.HUMANF0IW1 17 

454790 AW820852 gb:RC2-5T030M20200^1-f12ST0301 Homo 17 

447453 AWG08645 H$.1880O hypothetical proteb FU20281 17 

454767 BE069199 gb:QV3^T037W)10300-105ij03 BT0379 Homo 17 

65 413252 BEQ74910 gb:RC5^T058O-17030&O21-F12 6T0S80 Homo 17 
402429 17 
403760 17 

433128 AB021923 Hs.23367 EST-YD1 proton 17 

435448 H17132 Hs.27085 ESTs 17 

70 445706 AA30S520 H3.108812 hypotheScaipfOteinFlJ22004 17 

422171 US0529 Hs.1 12434 Novd human gene mapping to diomosome 13 17 

459023 AWg68226 Hs.60790 ESTs 17 

443246 T751S7 K8337603 ESTs, WeaUy similar to T1)6680hypothe6 17 
404569 2.7 

75 410181 AI468210 Hs.261285 plelobopfc regulator 1 {PRLI.ArBbidops 17 

422897 AA6797M Hs.4290 ESTs 17 

427038 NM.014633 Hs.173288 KIAAOISSgeneprodud 17 

449880 AI673006 Hs^1948 ESTs 17 

455992 6E17901S gb:RC3-HT0612-0805004134t10HT0612Homo 17 

80 415268 R53g35 Hs.287827 ESTs. Highly similar to MDR3J1UMANMULTI 17 

446554 AA151730 Hs.301789 nudix (nucleoside dlphosphato linked md 17 

452512 AW363488 Hs.337635 ESTs 17 

440728 AW086077 H8.153272 Homo sapiens cONA:nJ22715 lis, done H 17 
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419481 AI879195 Hs.90606 ISkOasetenoprotein Z7 

454352 AW389668 gb J^C2^016M71 299^1 ^STOI 68 Homo 2.7 

422831 R0K04 Hs.332943 ESTs 11 

413646 BE155042 s^PM04{T0349-10129&<)02-€04 HT0349 Homo 2.7 

5 426872 AA410446 Hs.112011 ESTs. VV^simBar to unknown (Hsapie 2.7 

459160 AI904723 gb:CM^066-1 20299^2 BT086 Homo sapien 2.7 

421338 AA287443 gbuESS2ciar1 Na.CGAP_GCB1 Homosa|)tens 2.7 

446002 A]34648a HS.14S789 ESTs 2.7 

454716 AWB50684 9b:IL3^T0219.160200O63.D12CT0219Homo Z7 

10 406664 134041 Hs.g739 gtyceR)l-3-phosphdta dehydrogenase 1 (so 2.7 

453128 AW026516 Hs.31791 acylphosphatase 2, muscle type Z7 

408591 AW250525 gb:2821626.5prime NIH_MGCJ Homo sairfens 2.7 

454754 AW819191 gb:CM1-STQ2834)712994181^108ST0263Komo 2.7 

439451 AF086270 Hs.278554 heterochroinatMke protein 1 17 

15 445225 A!216555 Hs.202338 ESTs Z7 

427175 K06924 Hs.23782 liypoihelicd protein RJ 12847 2.7 

411816 AW884609 gb:PM3^0017-24O30O^1-liO3 SN0017 Homo Z7 

438135 AI253025 Hs.190426 ESTs 2.7 

405981 Z7 

20 406005 2.7 

430762 AI343652 Hs.105667 ESTs Z7 

438361 AA805666 Hs.146217 Homo sapiens cDNA:aJ23077fis. done L Z7 

412105 H07971 Hs.g4319 VPS10 domain receptor protein 2.7 

434684 AA737282 Hs.190911 ESTs Z7 

25 445660 AI702668 Hs.201955 ESTs 2.7 

400844 2.6 

415725 BE219771 Hs.237146 liypothetical protein RJ12752 2.6 

420159 AI572490 Hs.99785 Homo sapiens cONA: FU21245fis.doneC 2.6 

408812 BE397160 Hs.254763 ESTs.WeaWy5lt«larto A42442lnlegiin Z6 

30 430052 AF102850 Hs.227933 Alg5. S. cerevlslae, homoiog of 2.6 

440310 AA878939 Hs.12S406 ESTs 2.6 

425859 Nmm Hs.158836 liypothetical protein nJ20583 Z6 

417252 AA195014 Hs.85971 ESTs ZB 

427167 AI239607 Hs.99196 tiypolfielica) protein MGC1 1324 2.6 

35 431613 AA018S15 H$.264482 Homo sapiens mRNA; cDNA OKFZp761A0411 (f 2.6 

414546 BE379492 gb:60123621SF1 NIH^M6C_44 Homo sapiens c 2.6 

407494 U10072 gb:Hum3nlb(1(headf3rroly(Ai^1)mRHA.pa 2.6 

429643 AM55889 Hs.167279 FYVG^er-COntdiningiW effector pro 2.6 

442240 AI791883 Hs.292719 ESTs 2.6 

40 452821 AW471181 Hs.160874 ESTs 2.6 

410238 N94320 Hs.144225 ESTs 2.6 

419236 AA330447 Hs.135159 Homo sapiens cONARJ1 1481 fis, done HE Z6 

440801 AA906366 Hs.ig053S ESTs 2.6 

440274 R24595 H8.7122 scrapie responsive protein 1 2.6 

45 411597 AW852925 gb:PM0-CT0248-1310994)01-fl0CT0248Homo 2.6 

417956 AA210704 Hs.ig0465 ESTs Z6 

420621 AA27880B gb3s79c09.r1 Na_CGAP_GC81 Homosapens 2,6 

425176 AW015644 Hs.155005 TEA domain famiiy member 1 (SV40transcr Z6 

_^ 425492 AL021918 Hs.158174 zinc finger protein 184 O^roppeWke) 2.6 

50 425698 NM16112 Hs.159241 polycystic kidney disease 2-til(e1 Z6 

426098 NM.014905 Hs.166351 KIAAl 072 protein 2.6 

435113 AA665469 Hs.117136 ESTs 2.6 

438188 AA77997S Hs.1288S9 ESTs 16 

445550 AI242754 Hs.137305 ESTs Z6 

55 458804 AL157625 gb:Dl^p761L2016„r1 761 (synonym: liamy2) 2.6 

448299 AA497044 Hs.20887 iiypotheScai protein FU1 0392 Z6 

436407 T88803 Hs.271507 ESTs, Wealdy similar to TllyLHUl\^ PROBAB 2.6 

425195 AA3S2D26 Hs.94319 VPS10 domain receptor protein Z6 

418282 AA215535 Hs.98133 ESTs U 

60 442757 Ar739528 Hs.2S345 ESTs 2.6 

413470 N20934 gb.7x54c11.8l Soarosmelanacyte2NbHMHo 2.6 

428527 AI902398 Ks.34492 Cyt19 protein 2.6 

441209 AA922939 Ks.135742 ESTs 16 

458679 AW975460 H8.143563 ESTs 2.6 

65 442279 AW667006 Hs.159970 ESTs 2.6 

407244 M10014 Hs.75431 fibrinogen, gamma poiypepQde 2.6 

411880 AW672477 gb:iiro30f03j(1 Na_CGAPjniy4 Homo sapiens 2.6 
404845 2.6 

„^ 411693 AW857271 gb:CM0<rr0307-210100-158^CT0307Homo 2.6 

70 438298 H23542 Hs.181788 ESTs 16 

444517 AI939339 Hs.146883 ESTs 2.6 

455870 AW452831 Hs.313803 ESTs. Highly similar to AF157833 1 nond 2.6 

457630 AI680803 Hs.112627 ESTs 2.6 

424015 N95698 Hs.166361 Homo sapiens mRNA; cDNADKFZp564F1 12 (fr Z6 

75 417563 AA203701 gb2x52a10.r1 SoaresJetOo-spteen. 2.6 

413174 AA723564 Hs.191343 ESTs Z6 

438875 AA827640 Ks.189059 ESTs 16 

440700 AW952281 Hs.296184 guanine nudeotide binding protein (6 pr 2.6 

423257 AW161039 H3.125878 synapsinlll 2.6 

80 431086 AI829692 K5.211561 ESTs 16 

409337 H71289 Hs.220535 ESTs 2.6 

442818 AK001741 Ks.8739 hypoUefical protein FU10879 16 

410004 AI298027 Hs.5057 carboxypeptidase 0 16 
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45S935 BE158887 gb:CM0-HT0395>28010O-169^KT039SHQmo 16 

403273 16 

445955 AA332209 H5.158196 transcriptionai adaptor 3 (A0A3. yeast h 16 

42S626 AIS37536 H5.173519 ESTs Z6 

S 451531 AA018311 Hs.114762 ESTs Z6 

428085 AA421D81 Ks.12388 ESTs 2.6 

420761 AI276780 Hs.135173 ESTs Z6 

4379SB BE1395S0 Hs.121668 ESTs. Moderately Similar to PC42S9feni Z6 

442666 W74633 Hs.303720 ESTs 2.6 

10 413088 BE064962 gb:RC1-ST0313-13040(V016<02BT0313HofflO Z6 

419107 AW085152 Hs.292987 ESTs Z6 

435766 R11673 Hs.186498 ESTs 2.6 

452879 AW90S328 HS.180&42 ribosomal protein LI 3 16 

440400 AA994364 Ks.125594 ESTs. WteaMy simitar to T25472 hypothec 2.6 

IS 440450 H92571 Hs.234478 Homo 8a(4ens cDNA: FU22648 lis. done H 2.6 

424146 AA705092 Hs,202368 ESTs 2.8 

433950 AW937417 Hs.293561 ESTs 16 

410366 AI267569 Hs.302689 itypothelical protdn 16 

417485 AA203304 H8.32826 CGI-130 protein 16 

20 412566 AW962574 gb:EST374647 MAGE resequences. MA6G Homo 16 

416498 U33632 Hs.79351 potassium channel, subfamily K. member 1 16 

440397 AA884448 Hs.157239 ESTs 16 

451236 Af767408 Ks.207026 ESTs, WeaUy similar to B56205ttansci1p 16 

411819 AW947884 gb:PM1-MTa)10.20030a001^08 1^0010 Homo 16 

25 430357 AW976789 Hs.165607 ESTs 16 

432869 AW974094 ob:EST3861 97 MAGE resequences. MAGM Homo 16 

427506 AK000134 Hs.179100 hypolhefical protein FU20127 16 

401614 16 

404531 Z25884 Ks.121463 chloilde channel 1 .sl(eletalfflusde(ni 16 

30 426698 AA394104 Hs.97489 ESTs 16 

440479 AA886461 H8.208161 ESTs 16 

443160 AI46791S Hs.38053 ESTs 16 

419323 A1092379 Hs.135275 ESTs 15 

442813 AI018435 Hs.270970 ESTs 15 

35 436196 AK001084 Hs.333498 Homo sapiens cDNAFU 102226$. done HE IS 

433561 BE540937 Hs.20104 hypothetical protein FU000S2 15 

434059 AA649162 Hs.2364S6 ESTs 15 

454836 AW833711 0b:QV4-TTOOO8-251 1 99^)43^11 TT0008 Homo 15 

456589 AV654623 Hs.288141 hypothetical protein MGC31 56 15 

40 459716 15 

436340 R42246 Hs.21606 ESTs 2.5 

428020 L19058 Hs.181581 glutamate receptor, ionotropic. kainate IS 

416951 AA190926 Hs.190785 ESTs. Moderately similar to S65557 elpha 15 

401078 15 

45 410644 AW902125 gb:QV0-NN1022-120500-22ah12NN 1022 Homo 15 

411660 AW855718 gb:RC1-CT027W)7010(W)21-806 CT0Z79 Homo 15 

425201 AA352111 gb:EST60051 Adlvaled T^elis XX Homo 83 15 

455252 AW876627 gb:RC3^0028-120200^13<)11 PT0Q28Homo 15 

439096 AA830185 Hs.269580 ESTs 15 

50 442627 AI027990 Hs.132303 ESTs 15 

457799 AF220188 Hs.236510 uncharacterbed hypothdamus protein HTM 15 

428799 AJ478619 Hs.104677 ESTs 15 

450402 BE218027 Hs.89969 ESTs 15 

411156 AW819939 Hs.273529 ESTs 15 

55 431673 AWg71302 Hs.293233 ESTs 15 

415706 BE182587 Hs.57485 ESTs 15 

412882 BE006919 Hs.134106 ESTs 15 

441300 R3S063 Hs.181536 ESTs 15 

413257 BE075035 gbd>M3-BT0S64'26030(M}02{05 GT0584 Homo 15 

60 434662 AA641957 gb3is18d08.r1 NCI.CGAP.GCB1 Homo sapiens 15 

455255 AWB77139 gb:QV2-PT0010.160400.133^1 PTOOIOHomo 15 

417137 U46265 Hs.81281 mitodwndrial ribosomal protein S21 15 

417909 R35614 0b:yo66eO8.n SoaresinbntbrdnlNIBH 15 

^. 458043 AW979009 Hs.326108 ESTs 15 

05 417008 AW673606 Hs.60758 aspartyHRNA synthetase 15 

442006 AW975183 H8.292663 ESTs. Weakly stmBar to S72482 hypothec 15 

455756 BE079307 . gb:RC1'BT0623-120200«11-g09BT0623Homo 15 

454032 W31790 Hs.194293 ESTs. Vteddysiiriiiar to 154374 gene NF2 15 

444963 A1916973 H5.213803 ESTs IS 

70 443526 AW792804 Hs.134002 ESTs 16 

454532 AA344685 Hs.58831 regulator of Fas4ndxedapoptosis 15 

428832 AA578229 Hs.324239 ESTs. Moderately similar to ZN9LHUMANZ 15 

442003 AW297497 Hs.201891 ESTs 15 

452768 AW069459 Hs.61S39 ESTs 15 

75 411355 AW838^79 Hs.22692 ESTs 15 

458890 AW865523 gbimSN00204)104004»94)05SN002DHomo 15 

400074 IS 

405241 15 

„^ 413095 BE065209 gb:RC1-HT0314^103004)15*12BT0314Homo 15 

80 414349 BE512968 gb:601172296F1 NIH^MGC J 5 Homo sapiens c IS 

422884 AW860g75 Hs.13256 ESTs 15 

429515 AL031228 Hs.204370 DMA segment on chromosome 6 (unique, pse 15 

431925 AKD00B90 gb:HomO6apienscONARJ10028fis.done 15 
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442653 BE269247 gb.-6011B5486F1 NIH.MGC J Homo sapiens cD t$ 

401B82 ZS 

458257 U48351 Hs.2012ig ESTs. W^aMy simSarto S 18946 ultra hig 2.5 

405336 2.5 

S 439492 AF0B6310 H3.103159 E$Ts Z5 

459390 BE385725 ob:6D1?76347F1 N]H.MGCJ!OHofnosapieiiSC 2.5 

436359 Z83806 gb:H^apl8r»mRNA(br8D(oneina)dyiielnhe 2.5 

429322 086984 Hs.199243 KIAA023t protein ^5 

431699 N»t001173 Hs.267831 RhoGTPaseactivatIno protein 5 2.5 

10 437107 AA74S598 KS.291B40 ESTs. Weakly Mbr to 178885 serineAh 2.5 

441953 H11695 Hs^229D1 disrupterof silendng 10 2.5 

442777 AW341541 Hs.271153 ESTs ZS 

453361 AA035197 Hs.107375 ESTs 2.5 

455275 AW977806 gb:ESr389810 MAGE resequences. MAGO Homo 2.5 

IS 457824 R84938 gb:ytS5f04 j1 Soares reOna N2b4HR Homo Z5 

428550 AW297880 Hs.98661 ESTs 25 

445900 AFQ70526 Hs.13429 Homo sapiens done 24787 mRHA sequence 2.5 

456359 A1967991 Hs.93574 hoffleoboxD3 Z5 

414386 BES4gi43 gb:601076456F1 NIH_MGC_12 Homo sapiens c 2.5 

20 452528 AA742457 H5.291479 ESTs 2.5 

408444 AW661B39 H3.253204 ESTs Z5 

440327 R12581 Hs,191146 ESTs tS 

410406 A)969703 Hs.1466 glycerol kinase Z5 

457021 AW968934 Hs.173108 Homo sapiens cDNA: FU21897fe, clone H 2.5 

25 41B948 AI217097 gb:qd43h07.x1 Soares_felaLh8aftNbHH19W Z5 

435427 AA682573 Hs.188982 ESTs, Weakly similar to organic anion tr 15 

427791 AA412446 Hs.g8138 ESTs 15 

403509 AF231919 HS.187S9 KIAA0539 gene product 2.5 

436590 AI393115 Hs.127655 ESTs Z5 

30 455556 AW995423 gb:QVO-BN0042-010400-183^08BN0042Homo Z5 

405889 Z5 

408274 R17315 gb:yg12g11.r1 Scares Infant bisin 1 NIB H Z5 

448015 AI458065 Hs.23196 ESTs 15 

454190 AW177821 gb:IL3.HT0059.18089^007-C05HT0059Honio Z5 

35 436154 AA764950 Hs.1 19898 ESTs 2.5 

406377 2.5 

437030 AA742577 Hs.303781 EST 2.5 

420815 AA280684 Hs.270584 ESTs 2.5 

418421 R5B820 Hs.B5050 piiosphdamban 2.5 

40 423638 AI003521 Hs.130310 Homo sapiens mRNA for cyciinB3iso{bmi 2.5 

415425 F08365 gb:HSCZSA1 21 normalized infant brdincDN ZS 

404577 15 
403568 15 

425967 N(A.007159 Hs.4007 Sarcotemmai^sociated protein ZS 

45 449899 AI610700 Hs.103280 ESTs Z5 

451078 AI927694 HsjKM470 ESTs Z5 

453343 AA905353 Hs.121622 ESTs Z5 

428728 NM_016625 Hs.191381 hypolhetkal protein Z5 

409642 AW450809 Hs.257347 ESTs Z5 

50 426235 At631964 Hs.34447 ESTs 2.5 

452043 H86231 gb:yt03fOZr1 Scares N2b5HR Homo 2.5 

401992 Z5 

419910 AA662913 Hs.190173 ESTs. Weakly slmSar to A46010X.Hnked Z5 

411036 AA857218 Hs.2970O7 membrane-bound transcription factor prol Z5 

55 444575 AI264847 Hs.22545 Homo sapiens cONA FU12935 fls, clone NT Z5 
449311 AI657014 gb:ll49a1Zx1 NCLCGAP„GC6 Homo sapiens Z6 

454566 AW807605 gb:MR4-ST0098-1 201 00^1-606 ST0098Honto 2.5 

454597 AW809648 gb:MR4^T01 24-261 099^1 5^1 ST01 24 Homo Z5 

413875 BE176776 gb:RC3-KT0586-110300^l1-g09HT0S85Homo Z4 

60 421583 AA293333 gb2l53cOar1 Scares ovary tumor NbHOT H Z4 

426237 AK001i04 H3.168241 tiypolhelica) protein FU 10242 Z4 

454437 AI248173 Hs.191460 hypotheScal protein MGa 2936 Z4 

419187 AA234852 Hs.44693 ESTs Z4 

^. 444493 R59410 Hs.2B2094 ESTs, Moderetely similar to 138022 hypot Z4 
65 405547 Z4 

454086 AW885909 Hs.6975 PR01 073 protein Z4 

417508 BE163512 H3.180877 H3 hislone, family 38 (H3.3B) Z4 

416277 W78765 Hs.180145 HSP0030 protein Z4 

420976 AI924940 Hs.108082 ESTs, Weakly slmBar to T31638hypothe8 Z4 
70 406468 Z4 

408617 R61736 Hs.124128 ESTs Z4 

418994 AA296S20 Hs.89546 setecGn E (endoOteiial adhesion molecul 14 
445432 AV653771 gbAV6S3771 GLC Homo sapiens cONActona 14 

_^ 454137 AW500340 Hs.313876 ESTs. Wbakly similar to 138022 hypotheO 14 
75 45532S AW896438 gb:PMl>NN0047-0404004)01-d09 NN0047 Homo 14 

409500 LI08098 Hs^576 sultotransferase. estrogen-preferring 14 
434138 AA625804 gb2u86h01.s1 Soares.testis.NHT Homo sap 14 

419511 AA4297S0 Hs.75t13 general IranscripSon factor UlA 14 

437980 R50393 HS.27843& KIAA1474 protein 14 

80 439999 AA115811 Hs.6838 rashomolog gene bmOy, member E 14 
403501 14 

446845 A1343645 Hs.156108 ESTs 14 
40in5 14 
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410845 AW8071B2 gtKMR4^T0062-18020(M)01-b04 8T0062 Homo 14 

411836 AW901879 Hs.314453 ESTs 2.4 

412879 BE092219 gb:Il2-BT0734-240400^71-B04 BT0734 Homo 2.4 

421083 AA283628 Hs.298016 ESTs. Weakly similar to (38022 hypoM 2.4 

5 423513 AF035g60 Hs.129719 transglutaminases ZA 

428882 AA436915 Hs.13174d ESTs, Moderately similar to ALU7.HUMAN A tA 

428945 AW192803 Hs.98974 ESTs, Similar toS65824 reverse t 14 

434627 AI221894 Hs.39311 ESTs 2.4 

435256 AF193766 Hs.13872 cytokine^ protein CI 7 14 

10 435079 AA664192 gb»05b01s1 StiBlagene lung (937210) H 14 

458239 BE439d77 Hs.283389 ESTs 14 

414093 BE544867 Hs.283077 centrosomal P4.1-assodated protein; unc 14 

441262 AI809130 HS.1769D6 ESTs 14 

402076 2.4 

IS 427962 AA946582 Hs.8700 deleted In Over cancer 1 14 

400587 14 

403053 1^24 Hs.2186 eii(aiyo6ctra»la8one)ongaIionU)r 14 

411203 AW872430 Hs.273743 ESTs 14 

447849 A1538147 Hs.164277 ESTs 2.4 

20 454201 ABD23191 K$.44131 KiAA0974 protein 14 

424131 AA335714 H5.199665 ESTs 14 

425921 Nh^007231 H5.1&2211 solute canter family 6 (neurotransmitle 14 

440385 AA8842B3 K$.192136 ESTs 14 

417976 BE565892 Hs.83077 (nletleuUn 18(lnlerfe(Dn-gamn»inducin 14 

25 447179 AW015633 Hs.157299 ESTs 2.4 

412977 AA125910 Hs.191461 ESTs 2.4 

436958 AA740322 Hs.293539 Hbmo sapiens mRNA for KIAA1758 protein, 14 

401361 2.4 

403891 2.4 

30 408419 AW2S0092 H5.305953 zincfmger protein 63 (HPF1) 14 

417002 T79613 Hs.14613 ESTs 14 

439446 AI927629 Hs.57873 ESTs 14 

458570 AW971698 Hs.12627 TJ6 protein 2.4 

458624 At362790 Hs.278839 KIAA1 684 protein; likely homoiogormous 14 

35 459344 AW499533 Hs.257976 ESTs 14 

413488 8E14401Z ob:MRO^HT0165.191199-004^HT0165Homo 14 

412114 AW893891 Hs.240833 ESTs. Weddy Mar to 138022 hypotheH 14 

423296 AW957193 Hs.3327 Homo sapiens cDNA: FU22219 lis. done H 14 

419983 W55956 Hs.9403O Homo sapiens mRNA;cDNAOKI^24)586E1 624 (f 14 

40 428268 AA424957 Hs.294132 ESTs 14 

450947 A1745400 Hs.204662 ESTs 14 

423073 6E252922 Hs.123119 MAD (mothers against decapentaplegicDr 14 

438142 T90309 Hs.269651 ESTs 14 

409239 AA740875 Hs.44307 ESTs. Moderately similar to 138022 hypot 14 

45 424235 NM_003181 Hs.143507 T kiraciiyury (mouse) homoiog 14 

429063 AW363845 Hs.122142 ESTs. Weakly simQar to A46010X-i]iiked 2.4 

433868 AA612960 Hs.337300 ESTs 14 
401645 2.4 

^_ 432149 AW614326 Hs.157012 ESTs. WbaHy similar to T34549pfot)at)le 14 

50 453393 AW956392 H5.110376 ESTs 2.4 

436054 AI076262 Hs.119813 ESTs 2.4 

425433 AA357471 gb:EST66274 LNCAP cells 1 Homo sapiens c 14 

417712 AA205569 Hs.194193 ESTs. Moderately similar to ALU1_HUMAN A 14 

420639 Ai683116 Hs.25328 ESTs, Moderately simiiar to ALU7.HUMAN A 14 

55 453369 BES5155D Hs.232630 ESTs 2.4 
405017 2.4 
405385 2.4 

435633 AI2481S2 Hs.270047 ESTs 2.4 

457126 AI932995 Ks.183475 Homo sapiens done 25061 mRNA sequence 14 

60 430535 AW968485 gb:EST380561 MAGE resequences, MAGJ Homo 14 

434544 C05875 Hs.91575 ESTs 14 

449432 AW451361 H5.196529 ESTs 2.4 

455219 AW879403 gtrPM0OT0019-1 50300^02^01 OT0019 Homo 14 

458734 AI554946 Hs.158794 ESTs i4 

65 442179 AA963842 Hs.333555 chromosome 2 open reading frame 2 14 

444313 A1140494 H8.197955 K1AA0704 proldn 14 

440448 AA885428 Hs.12S646 ESTs 2.4 

441498 AI379248 HS.5B742 ESTs Z4 

438205 AA780365 Ks.122161 ESTs 2.4 
70 402615 2.4 

425707 AFl 15402 H5.11713 E74^ke factor 5 (ets domain transcript 14 

422306 BE044325 Ks.227280 U68nRNAnassodatedSn>^ protein 14 

413697 AA131315 Hs.47144 01Ci^86)«)819 protein 14 

421755 AW169454 Hs.207422 ESTs, Weakly similar to S71949metaiR)pr 14 

75 449007 A1620433 Hs.193201 EST, WbaklysimflartoNIP2_HUMANBCt^A 14 

449916 T60525 H3.299221 pymvate dehydrogenase kinase, isoenzyme 14 

418857 D10216 Hs.89394 POU domain, dassl. transcription facto 14 

422466 BES14492 Hs.1 17487 gene near HD on 4p16.3 with homology to 14 

458914 BE327696 H3.280922 ESTs 14 

80 435061 AI651474 Hs.l63944 ESTs 14 

416458 AA160511 gb2p53f03.rl Stratagene NT2 neuronal pr 14 

453785 Ai368236 Hs.283732 ESTs, Moderately simSar to ALUI.HUMAN A 14 

421515 Y11339 Hs.105352 GalMAc d^l e^lalyltransferase 1. 1 14 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



403003 
405347 
406091 
428402 
438762 
455780 
457024 
404249 
443921 
407055 
417154 
419720 
405230 
405935 
436998 
445748 
419233 
414277 
452092 
453738 
410888 
434239 
434098 
414195 
445688 
451655 
423956 
413445 
436149 
405629 
432702 
433377 
444711 
445621 
456432 
449236 
459024 
441037 
431577 
436762 
412329 
410999 
429044 
431655 
439642 
441721 
443482 
403416 
416443 
419714 
415511 
412344 
449264 
451664 
441269 
402333 
453649 
430680 
404367 
403696 
441622 
411004 
411093 
428548 
404059 
446861 
413640 
423554 
435338 
442710 
444206 
451250 
454784 
458455 
458521 
407938 
439546 
441274 
454314 
409660 
428532 
411384 
453687 



AW237531 
AWB44412 
BE088828 
AA397546 

AI091310 
XB9211 
AI674701 
AA249131 



AA74562S 

U80766 

AA458673 

BE269910 

BE245374 

AL1 18674 

AW861207 

AF119910 

AA625499 

8E263293 

At248205 

BE327088 

W28203 

BE141022 

AI754308 

AW973953 

AI752713 

AI188739 

AI733818 

AWg8693l 

AJ403126 

AAQ20799 

AA913360 

T34523 

AAB26380 

AW937445 

AW813004 

A)261490 

AW971119 

W81441 

A12882S9 

AW188093 

AI744626 

N69469 

AA758751 

AI732617 

AW938384 

AI637649 

AA889081 

AW015206 

Y07494 
AW138724 



AW4S0957 
AW813242 
BE067650 
AA4300S6 

A1686519 

BEisaiis 

M90516 

AA678071 

A1015631 

AW301017 

AA491275 

AW820626 

AV646310 

AI651039 

AA905097 

AF088056 

AW593781 

AW364844 

AW452065 

AF1S7328 

AW842115 

T55674 



Hs J26876 Homo sapiens S0X6 mRMA, compteta cds 
Hs.65450 reticuton4 

Oi):CM2^T0693>230300-129^ BT0693 Homo 
Hs.1igi51 ESTs 

Hs.134848 ESTs 

gb:Rsapiens DMA lor endogenous letrovir 
Hs.21388 ESTs 

Hs.337776 liypolhetical protein FU1 1068 



Hs^l414 ESTs, WeaHy simflar to AIU8.HUMAN ALU S 
Hs.13252 Human EST done 22453 mariner transposon 
Hs.178306 ESTs 

gb:601186291F1 Nia.MGC.6 Homo sapiens cD 
Hs.27842 (^UieOcai protein ajl 1210 
Hs.34871 zinc finger homeobox 1 B 

gb:RC1^T0302-12020O^13^J04 CT0302 Homo 
Hs.283047 hypottielica) protein PR02964 

^:af69g08.r1 Soares_NhHMPu_S1 Homosapi 
Hs.89805 cholinergic receptor, nicotinic dphap 
Hs.153244 ESTs 
H$.212752 ESTs 

Hs.138169 Homo sapiens done 25215 mRNA sequence, 

gb:MRO-HT0067.201099-002-d10 HT0067 Homo 
Hs.159452 ESTs 

H5.293744 ESTs 

Hs.43845 ESTs 

Hs.148488 ESTs 

Hs.145549 ESTs 

Hs.179662 mideosome assembly protein 1-like 1 

Hs.26373 Homo sapiens cDNA: FU23449 lis. done H 

Hs.26^69 ptasminogen-IQn 

Hs.126468 ESTs 

H5^2040 Homo sapiens DMA sequence from PAC 43401 

Hs.126733 ESTs 

gb:QV3'DT0043-09020&O8O«09 OT0043 Homo 
gb:RC^T0186'230300^9^02 ST0186 Homo 

Hs.145527 ESTs 

gb:ESr383206 MAGE resequences. MAGL Homo 

H5.153967 ESTs 

Hs.127652 ESTs 

Hs^50385 ESTs 

Hs.151385 KIAA0564 protein 

Hs.194225 ESTs 

Hs.98216 ESTs 

Hs.182382 ESTs 

H5.2641 90 vacuolar protein sorting 35 (yeast liomd 

Hs.196105 ESTs 

Hs,153952 5'nudedldase(C073) 

Hs.178784 ESTs 

Hs.34114 ATPase. Na^i- transporllno. alpii82H 

Hs.168974 ESTs. Hig}dy8lmDartoALU7J{UIMAN ALUS 



Hs.224864 ESTs 

gb:MR3^T0191420200.207^10ST0191 Homo 
0b:MR44TD3SU)90300me01 BT03S8 Homo 

Hs^98649 EST 

HS.144Z7 Homo88plBnscONA:FU2ieoOfis.doneH 

gb:MR2.Hr0378.24Q200-2054K)9 HT037B Homo 
Hs.1674 giutamine-fmdose-OfhospliatetrBnsamin 
Hs.194300 EST3.Wealdy8linlarlol38022hypo&iefi 
Hs.23210 ESTs 
Hs.146492 ESTs 

Ks.236940 tiypotheOcal protein FU1 2542 

gb:RCaSTO2g9-igO10O412-«1O STQ299 Homo 
H3.2134B8 ESTs 
Hs.146559 ESTs 
Hs.85050 ptwspltolanfiban 

gbiKomo sapiens fuO (angth insert cOHA 
Hs.131357 ESTs 

gb:QV3-OT0044.2212994)45H«3 DT0044 Homo 
HsJ58905 ESTs 
Hs.184786 TBP-interacting protein 

gb:RCO^002&^mXM)31^11 CN0026 Homo 
Hs^lOB hemoQliobin, gamma G 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
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65 
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410140 
422443 
409071 
421253 
441398 
446458 
4S72a 
443718 
445568 
400582 
411262 
401145 
407440 
455121 
459077 
448117 
453331 
443751 
402038 
402176 
456605 
432479 
402527 
449272 
411024 
455608 
458618 
419875 
405521 
436517 
456801 
430444 
456208 
430388 
416509 
419337 
419699 
454456 
454633 
457028 
458925 
428336 
430850 
408622 
421227 
426902 
430789 
447475 
452148 
430712 
458103 
420959 
444096 
445641 
449276 
452294 
457653 
459497 
412852 
437539 
421813 
411994 
443476 
452463 
404936 
442833 
440836 
405120 
400238 
407809 
412303 
420478 
441417 
445117 
431162 
437036 
455849 
447624 
439780 
405706 
44n32 
440625 
404257 



AL134435 

NM_014707 

AW316932 

AI188102 

AA932398 

AW614367 

AW820035 

AI083580 

H00918 

AW834480 

AF227135 

BE156459 

N20370 

H49129 

AI240665 

AI285639 



AI827766 


Hs.259044 


AL042844 


Hs!275675 


AW137656 


Hs.197645 


BE082S90 




BE011437 




AI523857 


Hs^2257 


AA853410 


Hs.93557 


BE080932 


Hs.l 35225 


AWg61886 


Hs.136263 


AW296421 


Hs. 121 035 


AW299698 


Hs.334625 


AA356923 


Hs^40770 


N57713 


Hs^0899 


AW291112 


Hs^9978 


AA246998 


Hs.173044 


AW850984 




AW811380 




AW449838 


HS.97S62 


R15891 


Hs^1567 


AA503115 


Hs.183752 


BE144152 




AA056060 


H5J02577 


R78581 


Hs;!66308 


A1125334 


Hs^7408 


AA632577 


HSJ310235 


AI380797 


Hs.158992 


AFQ07143 


Hsi820S 


AW044647 


Hs.196284 


AW780192 


Hs,267596 


AA282119 


Hs.88975 


AV647969 


Hs.109694 


AI245987 


Hs.149442 


AW241510 


Hs.252713 


AI871925 


Hs.1 17895 


AI820719 


Hs.154662 


AA825742 


Hs.87517 


BE004117 


HSJ7415 


AA974673 


Hs.121419 


BE0482SS 




R57298 


H3.109087 


AW068594 


K5.133878 


R364S2 


Hs^17 


AA3281S3 


H3J8201 


AW370882 


Hsi22080 



AW082279 

AWa36338 

AA521259 

AI733297 

A)208754 

AW971180 

AI571514 

BE146666 

AI640326 

AL109688 

AI7S6398 



Hs.22269 neurexln3 

Hs.116753 hislonedeace1ylase7B 

Hs.18ig82 ESTs 

Hs.31028 ESTs 

Hs.292036 ESTs. Weakly ^ar to 834087 hypolhefl 

Hs.171054 ESTs 

Hs.278679 a disintsgHn and metalloproteinase doma 

Hs.221373 ESTs 

Hs^8744 KIAA1796 protein 

fib:MR2-TT0014.15119WHl-b07n0014 Homo 

gb:Homo sapiens candidate taste receptor 
fib:QV04fr03884)401004)82-106 HT0368 Homo 

Hs.235883 ESTs 

Hs.172g82 ESTs 

H8.8895 ESTs 

Hs.153324 EST 



ESTs 

telanin p80 (WD40<Ofil8lning) sutwnft B 
ESTs 

gb:QV1-BTO26O.281099^3-f05 BT0250 Homo 
Qb:CM4-BN0220-080500-17a-l03 BN0220 Homo 
ESTs 

pmenkeph^ 
ESTs 

Homo sapiens clone 24528 wRHA sequence 
ESTs 

Homo sapiens cONA F1J14890 fis. done PL 
nuclear cap bindino protein subunit 2, 2 
ESTs. Moderately similar to ZN91.HUMANZ 

ESTs 

ESTs. Weaidy similar to 138022 i^ypothefi 
9b:tl^CT0220>1S0200^6e-H08 CT0220 Homo 
9b:it^ST0143-290999^194305 ST0143 Homo 
ESTs 

Human (clone CT&A4) mRNA sequence 
microserninoprotsin, bela- 
gb:M{^HT01654}602004X)6«02 HT0165 Homo 
Homo sapiens cOI^ aj12166 lis. done MA 
mosaic serine protease 
ESTs 

ESTs. Weakly simiiar to 178885 serineWi 
ESTs 

Homo sapiens done 23738 mRNA sequence 

ESTs 

ESTs 

ESTs 

K2AA1451 protein 

ESTs 

ESTs 

ESTs. Moderately 8lmllartoA475B2&cei 
[>iaJ (Hsp4Q) homotog, subfamily A. metnbe 
ESTs 

ESTs. Weakly similar to AliJLHUMANAIiJ S 
ESTs 

gb:U49b05.y1 NCLCGAP.Bffl52Homo8apien 
Homo sapiens cDNA: FU22845 fis. dona K 
ESTs. Weakly similar to YC01J1UMAN HYPOT 
ESTs 

ESTs. WeaMy sImDarto A Chain^ Cryst 



Hs.244108 ESTs 

0b:QV4-DT0021.281299«7O«11 DT0021 Homo 
HS.19379S ESTs 
Hs.144474 ESTs 
Hs.147369 ESTs 

8b:EST383268 MAGE resequences. MA6L Homo 
Hs.133022 ESTs 

Bb:QV4-HT0222-21109W)14.(06 HT0222 Homo 
HSJ62713 ESTs 

gbtHomo sapiens mRNA ftiO length insert 

Hs.161318 ESTs 

Hs^452 Homo sapiens mRNA for KIAA1737 protein. 



13 
Z3 
23 
2.3 
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13 
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13 
13 
13 
13 
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437722 AW292947 Hs.122872 ESTs. WeaMyslmilartoJU0033liypo(h8Q Z2 

449133 Aie31$55 Hs.197919 ESTs Z2 

456555 AW592157 Hs.293299 ESTs 12 

408134 AK000184 Hs.42945 achisphinoamyeinase^phosphodlesle Z2 

5 428192 AA424051 Hs.304742 ESTs 2.2 

435834 T82384 gb7cl4f05.r1 Stratagene lung (937210) H Z2 

43601B AK001160 H55999 hypothetical proleinFU10298 Z2 

446096 AI276454 gt):ql71a1Zx1 Soares.NhHMPu.S1 Homosapi 2.2 

,^ 448108 AI800470 Hs.171941 ESTs Z2 

10 450232 BE300815 Hs.201326 ESTs Z2 

436134 AKD0061B Hs.123764 ESTs 2.2 

448468 AlS22t09 Hs.17106S ESTs Z2 

420678 AW593268 Hs.3530 U&assodated serine^inine protein 2 2.2 

430692 XB0240 gb:H^*ens endogenous retrovirus HERV- Z2 

15 446453 AV6S8469 Ks.188646 ESTs, Weakly sbnlar to ALUIJiUMAN ALUS 2.2 

423811 AB011163 Hs.12g308 KIAA0591 protein 2.2 

444050 AW136295 Ks.135024 ESTs 2.2 

431532 AIS37817 Hs.270311 EST8,Wtakly similar to ALU1J1UMAN ALUS 2.2 

422669 m2402 Hs.119122 ritnsomai pioteln L13a 2.2 

20 403388 2.2 

403780 2.2 

419423 D26488 H8.90315 K1AA0007 protein 2.2 

424719 H90452 gtiTK)1c03j1 Soaresfetdiiver spleen Z2 

431453 AW753917 gb:RC0.CT0299.291199^1.R)2CT0299Homo 2.2 

25 442078 AW268583 Hs.262629 ESTs Z2 

452975 M85521 Hs.244482 Komo sapiens, done iMAGE:361171 9, mRNA. Z2 

426197 AA004410 Hs.100009 acyi-Coenzyme A oxidase l.palmitoyi 2.2 

427119 AW880562 Hs.114574 ESTs 2.2 

400486 2.2 

30 448482 AW294078 Hs.171092 ESTs 2.2 

402621 2.2 

408363 NM_003369 Hs.44396 coronin, 8cUn4)inding pfotdn, 2A 2.2 

424584 H10692 H5.13310 ESTs 2.2 

445061 AI253094 Hs.145227 ESTs 2.2 

35 431065 AA491286 Hs.12B792 ESTs 2.2 

411908 L27943 Hs.72924 cyttdlne deaminase 12 

441826 AW503603 Hs.12991S phosphotriesterase related Z2 

446901 AI347274 gt>:tcO5d02j(1 Na_CGAP.Co16 Homo saf^ens 12 

422677 ALD46388 H5.208206 hypothetical protein RJ21 162 12 

40 455534 AW991925 gb:PM3-BN0011-130100^2-b07BN0011 Homo 12 

400163 12 

418882 NM.004996 Hs.89433 ATP^ilnding cassette. siit>.fanuiyC(CRR 12 

409206 AW364844 gb:QV3^T0044-221299«5^3DT0044Homo 2.2 

410556 R321S8 gb7h67a07Al Soares placenta Nb2HP Homo 2.2 

45 432584 AA928829 Hs.47099 hypothefical pratein FU21212 12 

439482 W70045 H5.58089 ESTs 12 

447877 AI435184 Hs.164252 ESTs 12 

418297 R912S4 gtKyp94e1l8l SoaresfetdBver spleen 12 

403534 12 

50 410594 AW770778 Hs.281238 ESTs 12 

414000 BE242814 Hs.323494 ESTs, WealOy similar to T27544 zinc res) 12 

432762 NM14099 Hs.278924 PR01768 protein 12 

437608 AA761594 Hs.122440 ESTs 12 

438550 AW976002 Hs.258402 ESTs 12 

55 439626 N22415 HS.1BS0B0 ESTs 12 

444540 At693927 H8.26516S ESTs 12 

450024 AA005129 gb3h90hOar1 Soares.fetaUver^Ieen. 12 

456481 AA258033 Hs.108110 DKfZP547E21 10 protein 12 

^. 435138 BE314734 gb:601152976F1 NIH_MGCJ9 Homo sapiens c 12 

60 412887 6E007420 gb:PM3^N0142-200300^X)1-c04 BN0142 Homo 12 

454204 AW816498 gb:QV(WT0236-171299^5-b02ST0236Homo 12 

408253 AW807476 Hs.2t051 Homo sapiens mRNA for FU00012 protein. 12 

432887 At926047 Ks.162859 ESTs 12 

^_ 448053 AI459108 Hs.159818 ESTs 12 

65 416171 H23898 Hs.125790 ieudne-rlch repeat^^ntMg 2 12 

433098 AW190593 Hs.151143 ESTs 12 

409781 AWB12266 Hs.15220 zinc Imger protein 106 12 

423441 R68649 Hs.278359 absentlnmeianomal ill® 12 

423646 K02364 gb:yi35d08j1 Scares placenta Nb2HP Homo 12 

70 436572 AA723274 Hs.279598 ESTs 12 

447044 AF030107 Hs.17165 tegutatDrof&protelnstgnatling 13 12 

448828 AI580296 Hs.174782 ESTs. Weddy similar to KIAA1 437 protein 12 

444585 AW170015 Hs.6594 ESTs 12 

437334 A1353947 Ks.283780 hypothelicai protein DKFZp761N1814 12 

75 431917 D16181 Hs.2868 peripheral myelin protein 2 12 

400843 12 

455688 BE067238 gb:PM1-eT0348-1512994)01-a12Gnr0348Honn 12 

449560 AA001767 Hs.17924 ESTs. Moderately similar to ALUlJiUMAN A 12 

408940 M58583 Hs.662 cerebeilin 1 precursor 12 

80 455201 AW947884 0bdnyi1-MTOO1O-2OO3OIM»1-gO8MTOO1OHomo 12 

413617 BE155373 Hs.27951B amyloid beta (M)precu(Sor<IIke protein 12 

4S9495 BE5441S6 gb:601O76707F1 NIHJM3CJ2 Homo sapiens c 12 

433225 AW816515 Hs.173S40 ATPase.Cl8SBV,typa10D 12 
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444547 
417156 
416761 
408867 
406748 
427443 
452B43 
427473 
433919 
431058 
428879 
415250 
440253 
434470 
416849 
432463 
400661 
407287 
414817 
415143 
449808 
412314 
442952 
425187 
408221 
411480 
459681 
414784 
442726 
450433 
437642 
406298 
409723 
433266 
435090 
457187 
446534 
403764 
442735 
455221 
405965 
408420 
441679 
432781 
448470 
419637 
443180 
422213 
423119 
450192 
428042 
400734 
430499 
451134 
401694 
423531 
424419 
431364 
436640 
436802 
443994 
445908 
446412 
448390 
449939 
412700 
453125 
422757 
452864 
452441 
402395 
459659 
428186 
438432 
409446 
408764 



414275 
436992 
439634 
444199 
446009 
435510 



AV650207 

N49476 

H85422 

AA437199 

AW339108 

AA402713 

AI796759 

AW274439 

AA746311 

AW968865 

AA431765 

F02614 

AI651329 

AA634818 

AW474S47 

AA548518 

AI678812 

AW902892 

AI955650 

AA694220 

AAB25247 

AI743261 

AW014486 

AA912183 

AW848022 

NM_000344 
AW136066 
AW444538 
AL079309 

AW885757 

AI663224 

BE217923 

AA443927 

AI307356 

R91949 
AW867751 

NM_006915 

BE502267 

NM-014133 

AW026226 

W27493 

R15875 

AA308385 

AA322201 

AA263143 

AA4ig529 

AWg69408 
AA31831S 

AW7527B2 

AK001563 

AW971382 

AA724411 

N34486 

AI094805 

R13S80 

AW135313 

AL035414 

T86420 

BE222433 

AW779544 

AI909935 

AA033714 

BE222078 



AW504300 
AW444990 
.AI561173 
BE087164 
BE296227 
AW970254 
AA741074 
W79377 
AI128931 
Ai989885 
BE143837 



Hs.282437 

Hs.166563 

Hs.108556 

HS.6S6 

Hs.217493 

Hs.97872 

Hs.208320 

HS.2S2709 



Hs.27319 

Hs.160289 

Hs.298138 

Hs.53565 

Hs.186733 



Hs,237B2 
Hs.79033 
KS.1S403 
Hs.250699 

Hs.131860 

H5.22509 

Hs.47447 



HS.2B8986 

Hs.19145 

Hs^31863 



Hs.257862 

Hs.31476 

Hs.149595 

Hs.144360 

H5.175225 



Hs.44756 
Hs.65g96 
Hs.278940 
Hs.309479 

Hs.258576 

Hs.133160 

Hs.131976 

Hs.24596 

Hs.76391 

Hsl31991 
Hs^999 

Ks.129750 

Hs.146589 

H8.294016 

Hs.156065 

H5.170504 

Hs.135522 

Hs.13436 

Hs.1500g8 

Hs^1068 

Hs^2139 

Hs.201262 

Hs.115497 

Hs.65551 

Hs^87629 

Hs.113069 



Hs.295605 

Hs.258800 

Ks.676e8 

Hs.302415 

H3.250822 

Hs.889 

Hs.1 20750 

Hs.167 

Hs.260681 

Hs.231926 



E$T5« simQar to 138022 hypoUieli 
repQcaOon EBCtorC(8c6va!Dr 1) 1 (14 
ESTs 

ceOdMstoncydeZSC 

8nns)dnA2 

ESTs 

ESTs 

ESTs 

glK0856dl2j1 Na^CGAP_GC81 Homo sapiens 

gb:EST380941 MAGE resequences. MAGJ Homo 

^I)3w80c03^1 Scares Jes&JIHT Homo $8p 

ESTs 

ESTs 

ESTs 

Homo sapiens PIG-M mRNA for mannosyttran 
ESTs 

gb:tu59d08jt1 Na,CX5AP^Gas4 Homo sapiens 

hypothetical protein aj 12847 

glutarainyl-peplide cydotransferase (gtu 

ESTs, Moderately sindar to ALU7.HUMAN A 

heal shock fectorUndng prat^ 1 

ESTs 

ESTs 

ESTs 

gtKlL3CT0214-2312g»)53^ CT0214 Homo 

sunnva) of motor neuron Ittetamerte 

ESTs 

ESTs 

gb:Homo sapiens mRNA full length tnserl 
ESTs 

Homo sapiens cDNA FU 13872 lis, clone TH 

ESTs 

EST 

ESTs 

gb:yqO6h06.s1 Scares fetal Over spleen 
gb:MI^N0038-29030a001^ SNQ038 Homo 

reSnifis pigmentosa 2 (X-Tinked recessl 
ESTs 

PRO0618 proton 
ESTs 

gb;31h10 Human retina cONA randomly prim 

cl3udin12 

ESTs 

ESTs 

RADSI-inleFBcSngprat^ 

myxovlrus (hfluensB) resislanoe 1, homo 



ESTs 

hypofheOcal prolebi F1J22195 

hypothetical protein FU10546 

tqr(»the(ical protein RJ10701 

ESTs, Moderately similar to B34087 Ivpot 

ESTs 

ESTs 

ESTs. Weakly simHar to S38038 hypofhetl 
Homo sapiens dona 24425 mRNA sequence 
ESTs 

hypothetical protein 
ESTs 

ESTs. Weakly similar to 138022 hypothec 

hypothetical protein FU22655 

Homo sapiens, Similar to DNA segment, Ch 

hypothetical protein FU14260 

ESTs 



mannosldase, alpha, dass 2A, member 2 
ESTs. Weakly similar to 138022 hypotheti 
ESTs 
ESTs 

serineAhreonine kinase IS 
Qiarot-ljeyden aystal pro^ 
ESTs 

nteotutwte'BSSOda t ed protflto 2 

ESTs. Moderately Mlar to AUIF.HUMAN I 

ESTs 

gb:MRO-HT0164-1S129»4)124>08 HT0164 Homo 



2.2 
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2.2 
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2.2 
2.2 
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Z2 
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22 
22 
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Z2 
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Z2 
Z2 
Z2 
Z2 
2.2 
Z2 
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Z2 
Z2 
Z2 
Z2 
Z2 
Z2 
Z2 
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Z2 
Z2 
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403891 




458333 


A1000792 


454560 


AW807281 


439343 


AF086161 


421498 


AA292084 


414428 


BE296906 


406941 


X58140 


445712 


A1458246 


451270 


AW341392 


4S14D3 


AA885569 


437073 


A]885608 


434789 


AW292515 


430884 


AF053748 


445944 


H06336 


405233 




446512 


H30351 


403188 




404443 




433S45 


AI621746 


414456 


H74314 


433479 


AW511459 


455482 


AW9483S3 


446384 


AB006624 


452004 


AI827815 


40S0S9 




425457 


AW964212 


403317 


U02687 


413801 


M62246 


415871 


R55995 


432774 


AA564946 


436349 


AI445255 


445532 


BE138944 


456313 


AA225741 


412818 


NM_003337 


450271 


AI693900 


401521 




422880 


AF228704 


448871 


BE616709 


449233 


BE048401 


408217 


AI433201 


457003 


S78234 


417448 


AA203135 


402103 




450579 


AW138774 


429597 


NM_003816 


456596 


AA291834 


415333 


H2441S 


457353 


X65633 


43498S 


AA658229 


414729 


BE466928 


400510 




420844 


AA59S522 


427434 


BE538374 


432188 


AI362952 


446296 


AA985662 


453853 


AiJ040600 


459108 


AW084176 


430118 


AI377255 


455964 


BE166924 


437981 


AA774445 


439957 


AI453184 


423734 


H02217 


450721 


AI732271 


429392 


AL109712 


429986 


AF092047 


432919 


ALD79800 


434791 


AA649235 


445273 


A1218441 


400514 




412798 


AW998657 


416085 


H18072 


437846 


AA773866 


439391 


AW975638 


428414 


AL049980 


429430 


A1381637 


449689 


AF228421 


430909 


AF034632 


453116 


AI276680 


416312 


W02640 


423019 


AI640185 


414007 


AI733895 


459535 


AV654907 


448776 


BE302484 



Hs.108209 ESTs 

gb:MR4^T0062-240300^}0^01 ST0082KDmo 

Ks.114611 hypolheticdprotBmFU11808 

H5.191575 ESTs. Moderately similar to ALU2jnjMAN A 

Hs.182625 VAMP (vesida-assodsted menifarane pratd 

(NONE) 

H3.167451 ESTs 

Hs.235795 ESTs 

H5.40919 Homo sapiens C0NAFU14511 (is. done NT 

Hs.94122 ESTs 

Hs.1 94317 ESTs. Weakly similar to T08&80 hypoihetl 

Hs.246114 glial ceQ derived neurotrophic factor 

Ks.13460 Homo sapiens done 24875 mRNA sequence 

Hs.207982 ESTs 



Hs.190258 ESTs. Moderately sin^ to AlU6_HUMAN A 

gb7U56e10.r1 Soares fslal liver spleen 
H$.249972 ESTs 

gb:ROO-MT0015-130400^1^7 MT0015 Homo 
KIAA0285 protein 
ESTs 



Hs.14912 
HS.2773S9 



MAGHHomo 



gb:EST376285MA6En 

Hs.38S lims-related tyrosine idnase 3 

Hs.35406 ESTs. Highly slmBar to unnamed protein 

Hs.283309 ESTs, Moderatdy similar to ALU1.HUMAN A 

Hs. 156280 ESTs 

Hs.115315 ESTs 

Hs.146200 ESTs 

gb:nc17b10Jl NCLCGAPJ'rl Homo sapiens 

Hs.811 ubiquI6n-con)ugaling enzyme E2B(RAD6h 

Hs.200920 ESTs 

Hs.121524 glutathione reductase 

Hs.1 59265 kruppel-related zinc finger protein hcKr 

Hs.196511 ESTs 

Ks.279860 tumor protein, transtaSonafly-oontroOe 

Hs.172405 cen division cyde 27 

Hs.130186 ESTs 

Hs.48614 ESTs 

Hs.2442 a dislnlegrin and metalloprotelnase doma 

Hs.78950 branched chain keto add dehydrogenase E 

Hs.13273 KIAA0592 protein 

Hs.248144 melanooortin 2 receptor (adrenooorfioolr 

Hs.291228 ESTs 

H8.281901 ESTs 

0b3th22cD9^1 NCLCGAP.Pr1 Homo sapiens 

H8.301732 hypotheflcd protein MGCS306 

Hs«2928 solute carrier family 7 (cationic amino 

Hs.63131 Homo sapiens cDNA FU13155 tls. ctone NT 

Hs.188083 ESTs 

Hs.223296 ESTs, Weakly similar to 138022 hypothec 

Hs 183287 ESTs 

gb:CM4-HTOS01.24030a«19-n}1 HT0S01 Homo 

Hs.192095 ESTs,WeaklyslmilartoKiAA1397proteln 

Hs.66357 ESTs 

gb:yi38d1 1.r1 Soares placenta Nb2HP Homo 
Hs.25587 ESTs 

Hs.296506 Homo sapiens mRNA full length Insert cON 
Hs.227277 sine oculis homeobox (Dmsoph!)^ homdo 

gb:DKFZp434O2330 j1 434 (synonym: htes3) 
Hs.1 16457 ESTs, Weakly similar to N1P3LHUMAN dCUI 
Hs.153846 ESTs 

Hs.119120 E3 Ubiquian Dgase SMURF1 
Hs.92576 ESTs 

Hs.244569 esophagus cancer-related gene-2 

Hs,293490 ESTs, Weakly simiiar to 138022 hypolheii 

Hs.164216 OKFZP564C152protebi 

Hs.1 55335 ESTs 

He.23889 DKFZPS64A032 protein 

Hs.2481 26 G protein^nupled receptor 38 

Hs.146086 ESTs 

Hs.16247 ESTs. WeaMysimOar to 2004399A chromes 
HS.28382B ESTs 
Hs.103813 ESTs 

gb:AV654g07 GLC Homo sapiens cONA done 
Hs.30057 MRS2 (S. cerevislae)-a(B, magneshim hom 
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421279 AW664e78 Hs.106645 ESTs ^1 

443167 A1202009 Hs.132087 ESTs 11 

459124 AW301478 Hs.184592 proldn kinase, lystne dafidenl 1 2.1 

448078 AI460117 H5.170464 ESTs. WgWy similar to A53933 myosin I 2.1 

5 438858 BE545498 gb:601070344F1 N1HJ4GCJ 2 Homo sapiens c Z1 

430521 NiyL016383 Hs.242163 M}M-TES« tumor antigen 2.1 

431089 BE041395 Hs.283S76 ESTs, VfeaMyslniDar to unknown protein Z1 

407401 AF029325 gb:HQmo sapiens Iaminlnbetfr4 chain pre 2.1 

426336 AA375802 gb£ST8813SHSC172ce!)s II Homo sapiens Z1 

10 451124 AI186203 Hs.31432 cardiac ankyrin repeat protein Z1 

425541 AA359119 glr£ST68172 Fetal hmgll Homo sapiemc 2.1 

406504 11 

410626 BE407727 gb:601299771F1 NIH.MGCJI Homo sapiens c 2.1 

415186 AA160945 >te.14479 Homo sapiens cDNAFU14199fe. clone NT Z\ 

IS 416175 H24230 Hs.2714S8 ESTs, Moderately sindar to ALU1.HUMAN A 2.1 

436820 At684535 Ks.200811 ESTs Z1 

442095 AI733162 Hs.1 28470 ESTs Z1 

451878 AI821027 Hs.8429 ESTs 11 

449178 A1633748 Hs.ig7597 ESTs 2.1 

20 427307 AF117947 Hs.174795 POZdomaiiKonlaJnlng guanine nudeotidB 2.1 

415857 AA866115 Hs.127797 Homo sapiens cONAFU11381fis. done HE 11 

425154 NM.001651 H5.154650 coDagen. type DC. alpha 1 11 

449746 AI668594 Hs.1765B8 EST6,W^sbTiilartoCP4Yj1UMANCYT0C 11 

441543 Ar733014 Hs.269715 ESTs 11 

25 403065 11 

428811 AA436052 Hs.99487 ESTs 11 

451803 BE541174 Hs.252058 ESTs, Ivkxteralely simflar to PC4259 ferri 11 

442906 AW298888 H9.170939 ESTs 11 

409171 R17126 gb:yg09c11.n Soaras infant brain 1NI3H 11 

30 414175 AI30B876 Hs.103849 hypotheticsj protein DKFZp761 01 12 11 

450785 AA852713 Hs.25459 Homo sapiens, atpha-1 (Vl) collagen 11 

412039 AW887384 gb:RCO<}T0089-13030O.021^07OT0089Homo 11 

4S30SS AW291436 Hs.31917 Homo sapiens, done M6C:96S8.mRNA,comp 11 

443268 At800271 Hs.129445 hypolhetica] protein FU12496 11 

35 455022 AWB50845 gb:lL:H:T022D-11119»)284)11 070220 Homo 11 

447972 AL137275 Hs.20137 hypothetical protein DKFZp434PQ1 16 11 

422942 AF054839 Hs.122540 tetraspan2 11 

400451 11 

406668 T62745 Hs.184411 albumin 11 

40 450159 AI702416 Hs.200771 ESTs. Moderately Similar to A Chain A. T 11 

404834 11 

448732 BE614063 Hs.334689 KIAAl 838 protein 11 

423453 AW450737 Hs.128791 CGMB protein 11 

421447 AB005216 Hs.104481 Nck. Ash and phosphoTipase C binding pro 11 

45 408774 AW270899 Hs.254569 ESTs. Weakly similar to B34087hypotheli 11 

419986 AI345455 H8.76915 GA4Mingprotobitranscrfp»on factor, 11 

405732 11 

417848 AA206581 Hs.39457 ESTs. Weakly similar to J C531 4 COC28Wc 11 

_^ 442875 BE623003 Hs.23625 Homo sapiens ctone TCCCTA00142 mRMA sequ 11 

50 420344 6E463721 Hs.97101 . putalhreGprotein^upted receptor 11 

455778 6E08S746 gb:GM2-BT0893-210300-123^9BT0693Homo 11 

426953 At769261 Hs.97439 ESTs 11 

440454 AI733037 H5,129990 ESTs 11 

433917 AI80932S Hs.122814 Humai DMA sequence from done RP5-1028O1 11 

55 424872 AA347923 gb£ST54302 Fetal heart » Homo sapiens 11 

454658 AWB12330 Hs.11123 DKFZP564G092 protein 11 

441963 AI733307 Hs.128002 ESTs 11 

439498 AA906731 H3.56297 008 protein 11 

456224 AW292905 Hs.128770 ESTs 11 

60 413525 BE145699 gb:MRO4fr02Q8-221299-204410HTQ206Homo 11 

444702 AI220122 Hs.326560 hypothetfcal protein MGC2780 11 

417787 R14948 Hs.23883 ESTs 11 

400612 11 

^_ 410878 AW809201 Ks.314248 ESTs, Weakly similar to ALU4_HUMAN ALUS 11 

65 414494 AA766491 Hs.6783 hypothetkal protein FU22724 11 

427027 AI924294 Hs.173259 uncharacterized bone marrow protein BM03 11 

451067 BE172185 gb:MI^HT0559-1 10300^5^1 1 HT0S59 Homo 11 

455032 Ai830B90 Hs.192422 ESTs 11 

„ 417945 R29072 gb:F1-101D 22 week old human fetel liver 11 

70 438268 AA782163 Hs^3502 ESTs 11 

424754 RD9692 gb^QSbllrl Soares fetal Iverspteen 11 

404599 11 

459655 11 

402455 11 

75 459278 AW294659 Hs.34054 Homo sapiens cONA:FU22488fls. done H 11 

421987 A!133161 Hs.286131 CQ-IOI protein 11 

400339 X57131 H5.248209 H2Ahlstone family, member F.pseudogsne 11 

438206 AA780385 Ks.187885 ESTs 11 

458451 AW297181 Hs.195922 ESTs 11 

80 447534 AW95393S HsJ0837 ESTs 11 

417687 AI828596 Hs.250691 ESTs 11 

412717 W00973 Hs^728 ESTs 11 

405759 11 
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406413 2.1 

442081 AA401863 Hs.22360 ESTs 2.1 

457938 AI373638 Hs.133900 ESTs 2.1 

420687 AA279392 Hs.88605 Homo sapiens cDNAaJl3427 05, done PL 2.1 

5 428822 W28418 Hs;30715 potassium voltage^ated channel Isknel ^^ 

415635 F13168 gb:HSC3JF101 nomtdized infant brain cON 2.1 

411421 BE272110 Hs.21177 ESTs Z1 

437825 AA769123 H5.291947 ESTs 2,1 

437083 AW082597 Hs.244862 ESTs 2.1 

10 409466 AA436207 Hs.226666 ESTs. lukiderately similar to 154374 gene ^1 

433523 H29882 Hs.162614 ESTs 2.1 

446868 AV660737 Hs.135100 ESTs 2.1 

445882 Ai948717 HS.22S155 ESTs, WeaMy similar to A46302PT6-assoc 2.1 

43B005 BE151746 Qb:PM1-HT030W)612994)03^HT0305Homo Z1 

IS 406617 Ai936028 gbnin)47809j(1 Na.CGAP.Gas4 Homo sapiens Z1 

410486 AW23S094 Hs.69233 ^ linger pretein 11 

411940 AW87666S gb:CM4*PT0031.180200^<eOS PT0031 Homo ^^ 

412446 AI768015 Hs.92127 ESTs 2.1 

457289 AW573204 Hs.137078 ESTs ^^ 

20 400335 Y13187 Hs.248067 Homo sapiens dmd gene, intron 11 ZO 

435959 AW296243 Hs.1 18375 ESTs ZO 

448188 AW001835 Hs.13323 hypolheGcal protein aJ22059 10 

418339 AA639902 Hs.l04215 ESTs. Moderately simiiar to SPCNJ1UMANS 2.0 

420430 AI703192 gb,-wd92h04jt1 Ma.CGAP.U24 Homo sapfens 10 

25 445717 AW664658 Hs.149332 ESTs 2.0 

451852 H09260 Hs.32333 ESTs 2.0 

459688 2.0 

441998 BE349537 Hs.38383 ESTs 2.0 

412194 AW900282 Hs.115412 hypothetical protein FU13881 10 

30 444229 AV64B613 Hs.282397 ESTs 2.0 

441635 Ai906538 Hs.13300Q ESTs, Weakly similar to S26689hypothet] 2.0 

421387 AR)59566 H5.103983 soluta carrier family 5 (sodium iodides 2.0 

414373 AW162907 Hs.75969 proline^ protein with nudear target 2.0 

428209 AA424197 H3.98947 ESTs. Weakly similar to S33496 trypsin! ZO 

35 443520 W90O22 Hs.186809 ESTs. Hig hly similar to LCT^HUMANUEUKO 10 

409248 AB033035 Hs.51965 K1AA1209 proteto 2.0 

444518 At160276 Hs.146884 ESTs 2.0 

422237 M13149 Hs.1498 histuiine-rich glycoprotein 10 

409316 U2B251 Hs^7 ESTs. Highly slmaar to Z169_HUMAN ZINC ZO 

40 402725 ZD 

413783 AA314337 Hs.301547 ribosoma) protein S7 ZO 

423867 AA331886 gb:EST35757 Embryo. 8 week t Homo sapien ZO 

425008 AWS7S764 Hs.174248 ESTs ZD 

427271 AW195922 Hs.188758 oonnexIn59 ZO 

45 444102 AV647953 Hs.83077 Intedeukin 18 (interfemivgammalndudn ZO 

445829 AI452457 Hs.l45526 ESTs ZO 

452366 AK000464 H5.29276 hypothetical protein FU20457 ZO 

457652 AF116656 Hs^73B09 Homo sapiens PR01167mRNA. complete cds ZO 

429540 M85776 gb:EST02297 Fetal brain. Slratagene (cat ZO 

50 459456 AA486036 Ks.190124 ESTs ZO 

409840 AW502122 gb:UI-HF-BR0Hr-o0&4MJI.r1 NIHJMGC.S ZO 

441025 AA913880 Hs.176379 ESTs ZO 

457802 T7&013 Ks.167279 FYVE-finger-contaMngRabS effector pro ZO 

^_ 445627 AW818475 Hs.7363 ESTs ZO 

55 440299 A1871778 Hs.250112 ESTs ZO 

401235 H24185 Hs.929ie hypothetical protein ZO 

429996 N90822 Hs.48969 ESTs ZO 
455135 AW857989 gb:PM2^0328-281299-003«04 GT0328 Homo ZO 

411537 BE0732S0 gb:MRO-BT0551-060300-102-e05 BT0S51 Homo ZO 

60 433449 AW772282 gb:hn71bOSj(1 NQ.CGAPJ<ld11 Homo sapien ZO 

454197 BE140966 gb:MR0.KT0065OB11994}02-b06HT0065Homo ZO 

445297 6E544163 Hs.87128 hypothetical protein FU23309 ZO 
403977 ZO 

458948 AI695359 Hs.280943 ESTs ZO 

65 418683 AK001100 H8.41690 desmoooOinS ZO 
411479 AW846047 oh:li3Cr0214-2912994)S2^12 CT0214 Homo ZO 

426536 AI949749 Hs.44441 ESTs ZO 

442765 6E567353 Hs.99480 ESTs ZO 
400859 ZO 
70 405829 ZO 

411863 BE07S244 Hs.12420 ESTs ZO 

415258 AW752247 Hs.293B53 ESTs ZO 

416093 R50585 Hs.268698 ESTs, Moderately similar to AlUC.HUMANt ZO 

416184 R48481 H5.269177 ESTs, Weakly similar to ALU6J4UMANAtiiS ZO 

75 437733 AI792574 Hs.122876 ESTs ZO 

453118 AW195849 Hs.252757 ESTs ZO 

457039 H29990 H3.101937 sineoairishomeobax(Drg6aphIl^homoto ZO 

444292 AI139794 Hs.l45569 ESTs ZO 
431360 NI^000427 H3.251680 toricrin ZO 

oU 407644 D16B15 Hs.37288 nxlearraceptor subfamily 1. group D,m ZO 
412029 AW686238 gb:RC5^TW78-28030(M)22-F01 OT0078 Homo ZO 

438522 AA809431 Hs.258886 ESTs ZO 
422634 NM.016010 Hs.118821 GGI^pnteln ZO 
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418790 H95693 gb.7t9S<nU1Soares_pinea)..sM.N3HPG ^0 

442950 AIS00417 Hs.46764 ESTs 2.0 

4S7040 N77624 Hs.173717 phosphaGdic add phosphatase lype 2B 2.0 

436464 AI016176 H5269783 ESTs,VteddyslmBarbALULHUMANAUJS ZO 

5 402674 2.0 

408733 AW264812 Hs.254290 ESTs 2.0 

408767 AA057279 Hs.21ig28 ESTs 2.0 

432801 NM.016260 Hs.278963 zfaic finger DNAtMigprotinHtiios 2.0 

418205 L21715 Hs^760 troponin I, skeletal, fad 2.0 

10 404604 2.0 

413827 BE182D82 Hs^4e973 ESTs 2.0 

402341 2.0 

436090 AA777534 Hs.191992 ESTs 2.0 

421303 T06464 gb:EST04353 Fete) brain. Stralagens (cat ZO 

15 411417 AW845461 gb:MR1-CT0056-201 199^)08^ CT0056 Homo ZO 

401988 2.0 

415318 T06544 gb:EST04433 Fetal brain. Stratagene (cat ZO 

417756 Z430S6 gb:HSC12B021 nonnafizBd infant brain cON ZO 

418301 AW976201 Hs.53913 hypothetical proleh FU10252 ZO 

20 433755 AW0B5934 H3,1 20868 ESTs ZO 

435413 AI267476 Hs.46669 ESTs ZO 

435648 H24347 Hs.27524 ESTs ZO 

447555 AI391662 H3.160963 Homo sapiens, clone MGC:12318.nrtRNA, com ZO 

458175 AW296024 Hs.150434 ESTs ZO 

25 458433 AL135352 Hs.255883 ESTs. Weakly similar to 138022 hypo W ZO 

446595 T57448 Hs.15467 hypotheUcal protein FU20725 ZO 

447678 BE385257 Hs.336457 Homo sapiens dopamine receptor inleracti ZO 

448150 AI472167 Hs.302739 ESTs ZO 

453445 AIJ036532 Hs.91453 ESTs ZO 

30 444420 A)148157 H8.146766 ESTs 2.0 

431956 AK002032 Hs.272245 Homo sapiens cONAFU 11 170 fis, done PL ZO 

413758 BE162391 gb:Pli/l2-HTO451'O9010aO02.4O4 HT0451 Homo ZO 

428231 U17989 Hs.183105 nudearautoanfigen ZO 

455873 BE152239 gb:QV4-HT031&4B1199^2B-f12HT0316Homo ZO 

35 430970 A1018210 Hs.144083 ESTs ZO 

412277 BE277592 Hs.73799 guanine nucleotide tiinding protein (6 pr ZO 

413025 AA805265 Hs.291646 ESTs ZO 

424083 AF055018 Hs.139137 Homo sapiens done 24442 mRNA sequence ZO 

427654 AA410183 Hs.137475 ESTs 2.0 

40 410483 BE163567 gb:QV3-HT0460-230200-1014)08 HT0480 Homo ZO 

423942 AF209704 Hs.135723 g(!ycoI^ld transfer protein ZO 

430340 AA476777 gbUDtf94g11.r1 Soares.tayjietusJtb2HF8. ZO 

425888 IM73531 Hs.ig37 retinal degeneration, slow (retinitis pi ZO 

425075 AA506324 Hs.1852 add phosphatase, prostate 2.0 

45 400265 2.0 

405985 ZO 

407407 AF050198 gb:Homo sapiens putative mitochondrial s ZO 

411459 BE142707 gb:MR0.HT0t57-1911994»2-9l2HT0157Homo ZO 

415105 D60166 gb:HUM089G1 IB Ctontech human fetal brain ZO 

50 434531 AA642007 Hs.1 16369 ESTs ZO 

447153 AA805202 Hs.315562 ESTs ZO 

447185 AW377092 HS.996D1 hypolhefical protein FU12553 ZO 

455696 6E067670 gb:RC>)-6T0362-021 299^1 -b06BT0362 Homo ZO 

456510 AK001652 Hs.99423 ATP^pendent Ri4A heticase ZO 

55 400817 AF1S1064 H&36G69 hypothetical protein ZO 

418647 AA226198 gb:nc26a07.8l NCLCGAPJ>r1 Homo sapiens ZO 

401785 ZO 

433417 AA587773 Hs.8859 Homo sapiens. SIndar to RlKENcONA 5830 ZO 

420777 AA280223 Hs.130865 ESTs ZO 

60 439509 AF086332 Hs.5B314 ESTs ZO 

430203 L36140 H3.235069 RecQprotein^ncepNAhelicaseQI'liks) ZO 

450382 AA397658 Hs.60257 Homo sapiens cONAFU13S98 lis. done PL ZO 

455540 BE080231 gb:RC4.BT0629-ia}200^2-f11 BT0629Homo ZO 

437620 AW976930 Hs.128760 ESTs ZO 

65 407528 X64990 gb:H.saplensRiRNAHTP(»X16tbrQKactDfy ZO 

402048 ZO 

403623 ZO 

411518 AW850246 gb:IL3Cr0219.2910994)21^7CT0219Homo ZO 

417531 NM.003157 Hs.1087 seilnefthreonine idnase 2 ZO 

70 422S00 6E143588 Hs.87 fetincblastsma«ke1 (p107) ZO 

423347 Ai660412 Hs.234557 ESTs 2.0 

424560 AA158727 Hs.150555 protein predicted by done 23733 ZO 

433153 • AA578512 gb:nh22e11.s1 N(X.CGAP_Pr1 Homo sapiens ZO 

433347 AF023130 gb:Homo sapiens Ra8-GRF2 mRNA. partial c ZO 

75 435373 AW865538 Hs.117689 ESTs 2.0 

442988 Ai026130 Hs.l31683 ESTs ZD 

447505 ALJ04g268 Hs.18724 Homo sapiens n)RNA;cONADKFZpS64F093 (fir ZO 

454423 AW603985 gb:RC4-CN0048-140100^11-a04CN0048Homo ZO 
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Table 31B 



Pkey: UMque Eos probeset identitier number 
CAT number. Gene duster mimber 
Accession: Genbank accession mmlbers 



Ptey 


CAT number 


Acoesdon 


407764 


1014849.1 


BE008347 BE008320 BE083307 BE0833t1 AW075968 


408274 


104999J 


R17315Z43364AA053547 


408691 


1074933J 


AW250525BE265117 


409171 


1104879J 


R17126R384S6H02771 


409208 


1108161J 


AW364844 AW364847 AW337534 AW937593 AW937659 


409623 


1144047J 


AW4491 85 AW449655 BE220971 


409699 


1149033.1 


BE154550 BE154785 AW466343 BE154816 8E154667 


409816 


1155396.1 


AW500954 AW50111 1 AW501394 


409840 


1156071.1 


AW5021 22 AW5021 25 AW501663 AW501720 


410201 


118365^1 


AA126129AA126033AA082561 


410483 


1204995.1 


BE1 63567 BE0736B9 BE073747 BE073780 6E073739 BE073748 BE1 63495 AW750176 BE163491 BE073763 BE073871 


410500 


1206323.1 


R09442 AW846115 AW846108 AW751967 AW846083 AW846087 AW846090 


410536 


1207322.1 


N39533 AW753094 AW753093 


410556 


1208157J 


R321 58 AW75405S AW754054 A\Y754053 AW754045 AW857320 


410615 


1212203 1 


AW772721AW873372 H89212 


410626 


1212621 -1 


BE407727 


410644 


1213795.1 


AW902125AW792853AW837703AW837710 


410672 


1214882L.1 


AW794&0OAW79473O 


410845 


1223881.1 


AW807182 AW807328 AW807083 AW807183 AW807192 AW807033 AW807061 AW807286 AW807097 AW80727O AW807372 AW807280 






AW807283 


410868 


1225955J 


AW881207AW80950d 


410901 


1226077.1 


AW810001 AW61Q092 AW810170 AW809884 AW809664 AW810353 AW810428 AW810209 AW810429 AW810154 AW810168 AW809766 






AW810006 AWB09672 AW80g694 AW810S52 AW810345 AW810432 AW809960 


410934 


12Z7240.1 


AW811114AW811095AW811087AW811124AW811054AW811094AWB11157 


410999 


1228809J 


AW813004 AWB12962 AW812961 


411004 


1228975J 


AW813242 BE146089 AW813195 AW813173 AW813206 BE145953 BE146212 AW813196 AW6545B2 AW813241 6E061582 


411018 


1229132.1 


AW813428AW813444AW813367AW81336BAW813429AW813424 


411024 


1229310.1 


BE0625g0 AW813565 BE147101 


411093 


1231970.1 


BE067650AW817053 


411186 


1235090.1 


AW821257 AW821267 AW821283 


411244 


1236407 2 


AW833768AW833631 AW83343S AWB33533 AWB33554 AW833543 AW833557 


411262 


1236998 1 


AW834460 AW834531 AW834637 AW834618 AW834653 AW834487 


411311 


1238530.1 


AW836491 AW836461 AW836673 


411377 


1242238 1 


AW841462 BE1S6657 BE156668 BE0g2475 


411382 


1242594.1 


BE067246 6E067241 BE067254BE067249AW841980 


411384 


1242702_1 


AW842115AWM2111 AW842103 


411417 


1245075J 


AW845481 AW845474 AW850849 


411459 


1246724 1 


BE142707 BE142852 BE142723 BE142905 AW847712 AW847792 AWB47858 AW647652 AW847654 AW847608 BE142961 


411479 


1247077.1 


AW848047AW848202AW848631 AW848142 AW848702AW848121 AW84d632AW84B140AWa48571 AW848009 AW848087 AW848069 






AWB48905AW848214 


411480 


1247089.1 


AW848022 AW84S704 AWB48168 AW848959 AW84&476 AW848699 AW848700 AW648761 


411518 


1248692_1 


AW850246AW850251 AW850302 


411537 


1248899.1 


BE073250 BE073378 BE073379 AW8S0S33 AW850529 


411597 


1250843.1 


AW852925AW852922 


411656 


1252973J 


AW855576 AW855650 AW855578 AW85S577 AW855642 AW855619 AW855824 AW855621 


411660 


1253078.1 


AW855718AW855740AWBSS748 


411688 


1254076J 


AW953440T08189 AW857085 


411693 


1254206 1 


AW857271 AWB57308 AW857296 AW857258 


411726 


1255048.1 


AW858612 AW881S64 AW858800 AW861945 AW856445 AW861901 AW858546 AW861921 AW858395 AW861988 AW856394 AW858401 






AW858404AW858399 


411738 


1255755.1 


AW859353AW85936SAW859382AW859356 AW859351 AW859354AW659355AW8S9381 AW859348 AW859364AW859372AW859352 






AW859349 AW859367 AWB59360 AW659374 AW859350 


411745 


1255160.1 


AW887826AW8598g6 


411762 


1256906 1 


AW860972 AW862598 AWB62599 AW860988 AW660983 AW860898 AW860925 AW860922 AW860986 AW860984 AW860989 


411772 


1257388.1 


6E170301 AW881539AW904851 BEl 54336 6E154090BE1 54275 


411816 


1259632.1 


AWB64609 AW884592 AW864594 


411819 


1259748 1 


AW947864 AW947918 AW947888 AW947883 AW947897 AW947910 AW947905 AW864751 AW947878 


411880 


1263110.1 


AW872477BE088101T05990 


411940 


1266262.1 


AW876686AW876717AW877215AW876691AW876722AW8772ieAW876694AW876725 


412029 


1272597.1 


AyV886238AW886229AW885244 AW887891 AW886269 AWB86146 


412039 


1273497.1 


AW887384 AW887372 AW887461 AW887315 


412147 


1279411 1 


AW895984 AW895982 AW895983 AW89591 8 AW8g5772 AW895843 AW895847 AW895653 AWB95852 AW895648 AW896054 M85583 


412209 


1283610.1 


AW901456 AW90t450 AW901441 


412236 


1284501.1 


AW902583 AW902585 AW902695 AW902568 AW902580 AWg02692 AW902684 AW902595 AW902573 AW902661 


412282 


1287679 1 


BE160168 AW935785 BE160401 BE160319 BE160313 BE160395 


412303 


1288130.1 


AWS36336AW936339 


412321 


1286608.1 


AW936913 AW93e959 AW936918 AW936909 


412323 


1288770.1 


AW937143AW937150AW937141AW937151AVV937132AVV937160AW937191AW937174AW937195 


412329 




AVV937171AW937142AVV937145AVV937165AVV937163AW937164AVV937137AW937179AW937156A 


1288943.1 


AW937445 AW937457 AW937364 AW937374 AW937395 


412400 


1292900.1 


AW948066 AW948108 AW948020 AW948047 AW94801 1 AW948033 AW948034 AW948041 AWg48028 AW948048 AW948046 AW948056 






AW948025 AW948053 AW948043 AW948032 AW948037 AW948050 AW948060 AW946059 AW948051 AW948027 AW948038 AW948052 






AW948036 AW948031 AW948030 AW948035 AW946039 AW948026 AW946044 AW948049 AW948042 AW948054 AW94805S AW948013 
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AWg48004 AW948000 AW948018 
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412474 


129889 1 


412566 


13064^.1 


412879 


1334272.1 


412887 


1334741 1 


412999 


1343220~1 


413071 


1348162"l 


413087 


1348720_1 


413088 


1348732ll 


413098 


1349078.1 


413101 


13491 54J 


413252 


1355877ll 


413257 


135S983~1 


413445 


1370833J 


413470 


1371600 1 


413488 


1373234 1 


413499 




413525 


1374fi1R 1 


413561 


13^n)33S 1 


413640 




41%46 


1381488 1 


413758 


1386900 .1 


413875 


1306766 1 


413986 


140720 1 


414277 


14321 25_1 


414349 


1437515^-1 


414366 


143863fi"l 


414456 


1447655 1 


414539 


14G0320~1 


414546 


1460704_1 


414605 


14BS7Q0~.1 


414955 


1509869 1 


415105 


1522674 1 


415131 


1523680 1 


415236 


1<«31080 1 


415318 


1533859 1 


415425 


1S36408'*1 


415549 


1539S99~1 


415628 


1540769 1 


415635 


154flaS3~1 


416035 


1567254 1 


416154 


13* 09IK]_I 


416458 


159583 1 


416548 


ifiOOinT 1 


416616 


1604113 1 


416972 


163668 T 


417426 




417490 




417563 


168749 1 


417669 




417756 


16971 U 1 

lOSf 1 V*T_ 1 


417809 


1707657 1 


417945 


1711126 1 


418297 


1736343'l 


418647 


177521 1 


418790 


17902B~1 


416856 


179649ll 


418948 


180808~1 


419637 


186639J 


419936 


189181_1 


420111 


190755 1 


420430 


193538_1 


420621 


1951 13~1 


420844 


198986 1 


420854 


197072_1 


421065 


198335 1 


421303 


9flin39''l 

CV 1 www. 1 


<t4LI<M0 


901378 1 




904990 1 


421813 


207654 1 


422977 


223410J 


423121 


225175.1 


423532 


22936SL1 


423646 


230597.1 


423734 


231479.1 


423841 


232507.1 


423867 


232732.1 


423871 


232749.1 


424719 


242889.1 


424754 


243305.1 


424872 


244505.1 



AI791451 Ai791288 BE019234 BE296601 AA111939 

AW962574 BE073261 

BE092219 BE092361 BE008789 

BE007420 6E007419 BE007421 BE007422 

BE0462558E046611 6E046716BE046732BE046273 

BE064032 Be064039 BE064027 BE064040 8E064045 BE063868 BE06389O 

BE064655BE153953 

BE064962 BE064979 6E064853 BE064857 BE0648S6 BE064977 BE0849S0 6E064880 BE064815 6E064957 BE064804 6E064816 BE064850 

BE064806 BE064795 BE064818 BE064975 BE064819 BE064810 BE064688 BE065059 

BE065209 BE065364 6E0651 10 BE0651 1 1 

BE065215 BE155544 BE1S5S41 BE155540 BE155542 BE155543 

BE074910 6E074913 BE07491 1 BE074903 BE074B92 BB)74935 

BE075035 6H074999 6E075008 6E075005 BE075032 BB075008 BE075037 

BE141022 BE141513 BE141532 BE141488 BE141481 BE141526 BE141516 BE141471 6E141478 BE141476 6E141475 BE141021 6E141S33 

BE141464 BE141490 BE141472 BE141480 

N2fl934BE141B75BE141877 

6E144017 BE185527 BE144023 

BE1448B4H97942 

BE145899 BE145648 BE145849 BE145853 6E14S927 BE14S925 

BE150618BE150616BE1S0826 

BE158118 BE158034 BE1S4709 BE154825 

BE155042 8E155040 6E154987 BE155012 

BE162391 

BEt76776 H85072 

Z43567 H24159AA134240 

BE269910 BE296S10 BE29592S BE269325 

BE512968 

BE54gi43 BE3906136E277344 
H74314BE299593 
BE3790466E395459 
BE379492 8E391967 
BE390440 

C1S506 D80541 080506 

D60166 080893 060101 

061119 081508 D81734 

R41400 H06659 Z38637 F02023 T16518 

T06544 Z44549 F05465 R61128 

F00365 Z43395 R54298 

F11942 778047 R13266 

F13080 R34751T75411 

F13168R21289T77628 

H42314 H43080 H45217 H15384 

Z46122 H23a03 RS6554 H23055 T80310 R13090 Z45524 

AA180S11 AAG49022AW974S66 

H62953 N76608N72413 

H68270 H68271R07781 

BE019670AA191284AA1912S5 

N87579 T93706 

AA203335R11393 

AA203701 R86895 

T99898 T92080 T91447 R07805 R07779 
Z43056R13398 T80474 
R3S614R24837R52440 

R29072 R29717 R29699 R29709 R29751 R29809 R29060 R29718 R2S057 R29591 R29683 R29575 R28913 R28910 

R91254T97156 R58711 

AA226ig3 AA226513 AA383773 

H95693 W02322 AA228140 K95692 

AA382858 AW863761 AA229428 

Ai21 7097 AW886090 W38035 W38792 AA232835 AW936043 

W27493 AA248063 AA249685 AA429679 

AI792788 BE142230 AA252019 

AA255652 AA28091 1 AW967920 AA262684 

A!703192 AW901 259 AA278523 AA262062 

AA27e608 BE082076 BE081812 BE081581 

AA595522 AA280981 6E464358 

AW296927 AI684514 AI263168 AA281079 

AA329711 AA287436 AA28314d 

T06454C17931 

AA287443 AA419385 BE084078 AI478347 

AA293333AI820910AA293403 

BE048255AA313083AA298419 

AA631498A)017191 AA491211 AA761823 AA714555AA766099AA808286At934069AA570223AA574389AAS82438 AI745346 AW964510 

AA319642 AW853758 H56414 

AW864848 AA322213 AA322209 AW961624 

BE09O5O3 BE090490 BE090483 BE090501 BE090500 BE0903G5 BE090505 6E090609 BE090499 BE09O5O2 BE0905O4 BE09D497 AA327285 

H02364AA32g085AW958111 AWg61436 

H02217AA330235AW955673 

AW753967 AA370795 AA331630 AW962550 

AA331886 AW982659 AW962655 T89B41 

AA331906AA3324B4 

K90452 AA345767 AW964302 H90399 

R09G92R09414AA346353 

AA347g23 AA34792B AW961769 
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425155 


247338_1 


AA351364H19138R12960 


42S201 


247933^1 


AA3521 1 1 AW962247 AA429695 


425374 


2S0946_1 


At904013 AI904015 AA356332 


42S433 


251775J 


AA357471 AA360564 AW962961 


425457 


252038_1 


AW964212 AA357832 AA357831 


425541 


2S2945J 


AA359119AW963014 D79884 


425798 


256586.1 


AA364002AI522307 


426076 


260504J 


AW962714 AA369277 AA369278 


426336 
426497 


26S003J 


AA375B02AA375642AW963815 


268121J 


AA379913 AA379981 AW963523 


426750 


27125^1 


AA383950 AA383958 AW96&I32 


426800 


272088.1 


AA385085AW966617 T877S8 


426980 


273843.1 


AA393713AA400726 


428679 


294049.1 


AA431765AA432015 


428740 


294655.1 


AA433838 AA450385 AA683244 


429106 


299885.1 


AA446612 AA456438 AA476655 


429540 


305828.1 


M85776 AA454535 AA456a)8 H901B9 


429875 


310034J 


AI09181 5 AA460162 AA460761 


430175 


314040_1 


AA468724 AA468772AA468643AW969231 


430340 


316229.1 


AA476777 T86049 


430535 


319643.1 


AW968485 AW968670 AA480922 BE35042S 


430553 


319868.1 


AW392821 AW392809AW843258AWB43049AW603156BE165656AW821728 


430692 


32200 1 


X80240AI064750AI133594 


430698 


32210O.1 


AA492071 AA484229AW969850 


430757 


322947.1 


Ai458623 AA639708 AA485409 R22065 AA485570 


430850 


324651.1 


BE144152 AA937952 AA487799 


431058 


327401.1 


AW968865AA491199C17148 


431071 


327550.1 


AA491379 H86020AW969148 


431152 


328726.1 


AW971180AA551515AA493610AW089533 


431169 


328799.1 


AW971240 AA493843 AA493723 


431453 


333457.1 


AW753917 BE1S2926 AA505333 BE155673 


431655 


336189.1 


AW971 119 AA574265 AA513268 


431822 


338082^1 


AA516049AW004g22 


431925 


33905.1 


AK000890 BE182413 AW890890AW84417g BE17B834 


432189 


342819.1 


AA527941 AI810608AI620190AA635266 


432363 


345469.1 


AA534489 AW970240 AW970323 


432779 


354024.1 


AW979241 AA565006 AA847102 


432889 


355475.1 


AW974094 AA569074 AA602574 


432919 


356290.1 


AL079800AA570294 L25459 


433153 


359938.1 


AA578512AA595535BE177533 


433347 


38389L1 


AF023130AF181250AA984703AA694303AA351792 


433449 


38653^1 


AW772282AA592974 


433919 


377243.1 


AA746311 AA927492AA617995 


434098 


380006.1 


AA625499 AA625269 AA625184 


434138 


380572.1 


AA625804AW418787AW074833A)875642AI393368 


434662 


390415.1 


AA641957 AW749897 AW749866 AW749887 AW749890 


434671 


390655.1 


R34758AA642317 


435079 


399783.1 


AA664192H60250T71388 


435138 


401159.1 


6E314734AA666393 


435463 


406582.1 


AA682507AW851124 


435510 


407286.1 


BE143837 AW749652 AA683327 


435634 


409239.1 


T82384R05307AA693714 


435669 


409755.1 


AA694284 H68267 H68264 


436359 


41B47.1 


Z83806AJ132091AJ132090 


436720 


425876.1 


AW975902 AA729344 AI557342 


436658 


428095.1 


6ES45498 AA830720 AI873015 AA732679 


437037 


431828.1 


T63804 T6376BAA742849 


437113 


433234.1 


AA744693AW7S00S9 


437642 


43997.1 


AL079309AA281819 


437963 


44635-1 


BE396279 


438005 


447553.1 


BE151746 6E336853 063271 T9495S AA774994 


438993 


467651.1 


AA8289g5 AA834879 A1926361 


439037 


46603 1 


AF075084 H53157 H53054 


439546 


47360.1 


AF088058 W76297 W72448 


439780 


47673 1 


AL109688 R23685 R2657e 


442653 


54812.1 


BE269247 BE270032 BE270800BES66B40 


442735 


550247.1 


R91949AI016237BE072329 


443764 


579650.1 


F23263AI084941F35774 


444063 


590989.1 


A1122614AW869134 


444910 


624951.1 


AI201849 BE069007 AW946544 


445432 


63943.1 


AV653771 BE089370 


446096 


661959.1 


A1276454AI633717AI275116 


446901 


697809.1 


At347274AW844024 


447884 


740749.1 


H2950S R18575 Z43580 T48738 AI435454 BE004683 


448477 


76475.1 


8E612572AU)40190F08514 


449311 


804513.1 


A165701 4 AW594035 AI657036 AI638390 


450024 


B2296.1 


AA005129 AA679084 AA694399 


451067 


B5759J 


BE172186 AA059279 AA020815 AA013437 


452043 


89532.1 


H86231 AA021632 H38271 


452542 


921410.1 


AW312256 AW812257 AI906423 AI906422 


453211 


95527.1 


W84829 AA033900 AW573557 


453530 


97021.1 


AW021 633 AA036730 A1886854 


4S3823 


982526.1 


AL137987 BE064160 BE064188 
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453901 
454165 



454130 
454197 



9B5414.1 
1049791.1 



1049996.1 
105039^1 



454204 1050597J 



454314 
454352 
454423 
454447 
454456 
454482 
454560 
454564 
454566 



454597 

454633 
454716 
454747 
454754 

454767 

454778 
4547B4 
454790 
454836 
454864 
454938 

454962 

455022 

455121 

455135 

455170 

455201 

455219 

455221 

455238 

455252 

455255 

455275 

455280 

455310 

455328 

455464 

455482 

455488 

455511 

455534 

455540 

455556 

455571 

455587 

455608 

455675 

455688 

455698 

455747 

455756 

455778 

455760 

455849 

455851 

455886 

455673 

455880 

455335 

455964 

455992 



110816U 
1129667J 
1183079.1 
1204995.1 
1207088.1 
1215087.1 
1223940.1 
1224407.1 
1224432_1 



1226059.1 

1227504.1 
1230503.1 
1233006.1 
1233580.1 

1234028.1 

1234343.1 
1234630.1 
123475^.1 
1238509.1 
1237929^1 
1245635J 

1246750.1 

1249160.1 

1254339.1 

1254729.1 

1256906J 

1259748.1 

1261640.1 

1261678.1 

1265662.1 

1266222.1 

1266482.1 

1272255.1 

1272607.1 

1278158.1 

1280063.1 

1292643.1 

1293183.1 

129372.-1 

1321229.1 

1322942.1 

1323701.1 

1325658.1 

1331885.1 

1335045.1 

1337389 1 

1349659.1 

1350606l1 

1351077.1 

1355B77J 

1358603 1 

1364506.1 

1364580.-1 

1375441.1 

1375451.1 

1377119.1 

1379498.1 

1380022^1 

1384144.1 

1389912.1 

1338552.1 



BE065902 AW749032 AW003637 

BE141030 BE141474 BE141467 BE141753 BE141024 BE141761 AW177583 AW177579 AW177582 AW177585 AW177587 AW807582 AW177581 
BE141477 BE141520 BE141456 BE141492BE141028 BE141775 BE1414a9 6E141751 AWin599 BE141750 AW177597 BE 141512 BE141460 
8E141749AW177598 
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BE141673 BE141650 BE141674 BE141550 BE 14 1688 AW1 78241 6E140994 BE141666 6E140998 BE141008 BE140988 BE141011 BE140975 

BE141667 8E141675 BE141657 BE141681 BE141656 BE141672 BE141680 AW176237 BE141012 B£1409g0 BE141658 BE141648 BE141013 

BE141668 BE140973 6E141004 BE140963 BE140g84 BE141009 AW1 76232 BE141007 8E14164g AWl782g3 BE140993 AW1 78233 BE141646 

BE141005 BE141691 BE141000 BE141652 6E140955 BE141562 BE140960 8E140g62 6E141001 BE140978 AW178229 AW1 78239 BE141671 

AW178230 BE141547 AW17823S BE141663 BE141549 BE1409g6 BE141003 AW176236 8E141002 BE141556 

AW816498 AW808791 AW808515 AW608379 AW808532 AW808605 AW808977 AW808B16 AW176876 AW17B486 AWB06514 AW178483 

AW178485 AW809007 AW808524 

AW384844 AW364847 AW937534 AW937593 AW937659 

AW389G68 AW369657 AW6091 98 AW3&9&49 

AW603985AW8543S0 

BE163567 BE073689BE073747BE073760BE073739BE0737486E163495AW750178BE163491 BE073763 BE073671 
AW850964 AW752836 M86124 
BE147919 AW794884 BE147847 

AW807281 AW807092 AW807425 AW807330 AW807174 AW807171 AW807274 AW807278 AW807367 
AW8Q7573AWS07566AW8Q7S72 

AW80760S AW807890 AW807839 AW807752 AW807673 AW807667 AW807955 AW807760 AW807615 AW807898 AW807849 AW807821 
AW807832 AWB07842 AW807827 AW807822 AWK)7B29 AWB07830 AW807825 AW807603 AW80761 2 AWB07908 AW807595 AW80761 7 
AW807678 AW807687 AW80791 8 AW807921 AW807595 AW807602 AW807688 AW807609 AW807684 AW807770 AW807593 AW807754 
AW807679 AW607957 AW807663 AW807763 AW807902 AW607840 AW807819 AW807836 AW807769 AW807685 AW807847 AW607674 
AW807686 AW807670 AWB0791 7 AWBD7677 AW807680 AW807900 AW807669 AW807952 AW807907 AW807846 AW807756 AWB07835 
AW807608 AW807753 AW807601 AWB07956 

AW80g&48 AW809704 AW809643 AW809653 AW80g709 AW609949 AW809939 AWB10O10 AWB09705 AW809950 AW809822 AW809867 

AW810093 AW810076 AW809673 AW810349 AW809895 
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AW850684AW850150 
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AW819265AW819268 AW819246 BE152602 AW81 9249 AW81 9251 AW819263 AW819194 
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AW820852 AW820773 AWa210&6 
AWB33711 AW833620AW833699 
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AWB46556 

AW847645AW847791 AW854083 AWBS3945 

AW850845 BE144010 AWB55164 

BE156459 6E1564e9 BE156466 AW657447 

AW857S89 AW858016 AW861677 AW861689AW881691 AW858056 

AW860972 AW862598 AW862599 AW860988 AVi/860983 AW860896 AW860925 AW660922 AWB609B6 AW86Q984 AW860989 
AW347884AW947918 AW947888AW947883 AW947897AW947910AW947905AW664751 AW947B7B 
AW879403AW867707 
AW867751 AW867n0AW887763 

AW875972 AW875983 AW875974 AW876000 AWB75966 AW8760S0 
AW876627 AWB76630 AW876631 AW876625 

AW877139 AW877135 AW87701 8 AW991835 AW877128 AW877108 AW87701 7 AWB77107 
AW977e06 AW887923 AW886321 

AW886156 AW687926 AW885324 AW888236 AW887906 AW886304 
'AW893981 AW6g3998 AW894034 AW694019 
AW8g&438 AW896534 AW696500 AW896540 AW896446 
AWg83901 AW984485 AW947715 

AW948353 AWg48351 AW948331 AW948303 AW948336AW946305AW948299 AW948346 AW9483S2 
AA102322 

BE144762AW979091 

AW991925AW991919 
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BE003714 BE003721 BE003720BE003716 
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BE067236 6E067235 BE0S7240 BE0872S6 BE067263 BB)67236 BE067260 BE0672S3 BE067248 BE0S7252 
BE067870 BE067866 BE165133 BE165334 BE165329 BE165332 
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BE079307 BE079309 

BE088748 BE088802 6E088755 BE088876 6E088947 BE088881 8E088952 



BE146868 6E14686S BE146667 
BE14^6£146914BE14691B 

BE149024 6E1490S6 BE152826 BE149025 BH149057 BE152819 BE149030 BE149062 BE149023 BE149055 
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455933 13S8665 1 BE17S085BE179084BE179086BE179264 

456186 1618618 1 W26642 H8d394 

456313 177240 1 AA225741 AI734056 A1820985 AI732153 AA259201 AA22S731 

456394 184327L-2 W28506 

456407 184988J AW968614AA243209AA281411 

456476 191761J AA2587S3AW628S80 

457242 307984.1 AA457011 A1978850 

457624 41515.6 > R64938AL047151 AA310309AW063200AI56952BAI307823 N49g75 

458804 75803J AL1575KN72696BE622492 

456890 812733.2 AW865523AW865126AW865467AW865127AW865466 

459160 920051.1 AI904723AI904725AI904729A1904722A1904758AI904736 

459201 92S883J AW391177 W45021 



TABLE 31C 

Pkef. Unique number corresponding to an Eos probesel 

ReK Sequence source. The 7 digit numbers In this colunm ere GenbankMenfo (GO mimbeis. t)unham I. e( si' refm to Die puUlcalion entitled ^1leD^tAse^ 

human chnjmosome 22.* Dunham I. et al., I^ture (1999) 402:489^95. 
Strand: Indicates DMA strand froiD which axons were predicted. 
Ntj>osiBon: Indicates nucleotide positions of predicled axons. 
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Table 32A lists about 969 genes upregutated In lung fibrosis relative to nonnai body Sssues. Types of pulmonaiy fibrosis samples tnduded in this analysis were idiopathic putmonanr 
fibrosis (IPF). hypersensitivity pneumoniSs (HP), and nonspecific bttefstiflal pneumonitis (NSIP). These genes were setected from 59660 probesets on the Eos/Af^trix Hu03 
Genechip array. Gene expression data fix each piobeset obtained from this analysis was expressed as average intensily (AI), a normafized vdue reflecting (he relative levels of 
mRNA expression. 

Tefale 33A lists about 800 genes upreguiated in lung fibrosis relative to normal lung. Types of pubnonary fibrosis samples included In this analysis were idiopathic pulmonary fibrosis 
(tPf^. hypersenstQvi> pneumonliis and noiKspedflc interBtltlai pneumoniUs (NSIP). These genes were selected from 59680 probesets on the Eos/Aflymetrix Hu03 GenecHp 
array, (^e expression data tor each probeset obtained from this analysb was expressed as average 
expression. 

Td)le 34A lists about 703 genes upreguiated In idiopathic pulmonary fibrosis (IPF) relative to hypersensitivity pneumoniSs (HP) or non-specific intersStiai pneumonitis (NSIP). These 
genes were selected from 59880 probeseb on the Eos/Af^trix Hu03 Genechip array. Gene expression data for each probeset obtained liom this analysis was expressed as 
average Inlereily (Al). a normalized value reflecting the relaOve levels of miVIAexpfe^n. 

Tebl835Alisb about 323 ganes upreguiated to hypersansifivtty pneumonitis (HP) relaS^ These 
genes were selected from 59680 probesets on the fos/Aflymetiix Hu03 (Senechlp array. Gene expression data for each probeset dbtahed fmm this analysis was expressed as 
average intensity (Al), a nonnallzed vdue refiecfing the relative levels of mf^ express 

Table 36A fists about 52 genes upreguiated in non-specific interstlGal pneumoniUs (NSIP) relative to hypersensifivity pneumonifis (HP) or idiopathic pulmonary fibrosis (IPR- These 
genes were selected fram 59680 pfii)esets on the Eo8/AffymetrixHu03Gene^ Gene expression data for each probeset obtaMlkom this analysis was expressed as 
average intensity (Al), a normalized vdlue reSecOng the retaOve leveb of mRNA expre^lon. 

Table 37A lists about 206 genes downregulatsd in lung fibrosis relafive b norma) lung. Types of pulmonary fibrosis samples included in this analysis were Idiopathic pulmonary 
fibrosis GR^. hypersen^ty pneumonitis (HP), and non-specific interstitial pneumoniSs (NSIP). These genes were selected from 59680 probesets on Oie Eos/Aflymetrix Hu03 
{Senechlpanay. (Seneexpfwslon dab far each probeset obtrfned liom this analysis was 
mRNA expression. 
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TaU838A lists about 207 genssupregutated in lung fibrosis Types d pulmonary fibrostesamplBslndudedtnU^ 

(IPH. hypersensKMiy pneumonitis (HP), and non-spectTc intersQQal pneumonitis (NSIP). These genes were selected fnxn 59680 probeseb on the Eos/Aflymetiix Hu03 
Genechip anay. Gene expression data for each pnobeset obtained from this analysis was expressed as average intensity (At), a nomialized vatue reflecting the relative levels of 
mRNA expression. 

TA61£ 32A: About 969 genes upregutaled In lung fibrosis relative to normal body tissues 

Ptey: Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number. Genbanit accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene Cde 

Rl: 901h peroBntile of lung fibrosis Ab dhrfded t>y 90th percentile of normal tissue Als. where the minimum value for the numerator and denotrnnator was set to SO. 

R2: 90th percentile of lung fibrosis Als dhrided by 90Ih percenfile of normal Qssue Als. where (he ISlh percentite of nonna! fissua Als was subtracted from both the 

numerator and denominator. The nMmum value for the numerator and denominator was set to 50. 
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M24461 
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AW056350 

AW449467 

N81037 

NM-014398 

BE041395 
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U33749 
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AI082692 
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AJ245671 
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AA033648 

AI638208 

Al^053 

AA863400 

AA215404 

)a0697 

AA454033 

CI 8863 

AI37S550 



Hs.76305 

Hs.16762 
Hs.54795 
Hs.1074 
Hs.10887 

iis.1355 

Hs.197764 

Hs.1867 

Hs.134662 

H5.94367 

iis.12844 

Hs.130704 



Hs,16757 

Hs,1012 

Hs.75182 

Hs.176703 

Hs.164225 

Hs.315389 

Hs,31141 

Hs.54443 

Hs.g8849 

H5.177582 

Hs.687 

Hs,184 

Hs.12496 

H$.949 

Hs.201378 

H5.77387 

Ks.e3169 

Ks.105039 

H5.130425 

Hs.16762 

85.1369 

Hs.187636 

Hs.163464 

Hs.44532 

Ks.77729 

H5.16530 

Hs.2488 

Hs.253569 

Hs.247057 

H132417 

HS.S0334 

Hs.2240 

Ks.163425 

Hs.11^ 

K8.103982 

Hs.136245 

Hs.g0250 

Hs.16954 

Ks.21948 

Hs.124292 

Hs.7473 

Hs.96901 

Hs.57664 



Hs.553 
Hs.41644 
Hs.163443 
HS.34686B 



Unigene Title 


Rl 


R2 


surtactant, pulmonary-associated protein 


22.45 


2a63 


FCXNES predicted novel secreted protein 


16.10 


7.65 


Homo sapiens mRNA; cDNA DKFZp564B2062 (f 


15.83 


14.86 


ESTs 


15.74 


21.96 


surfactant, pulmonaiy-assodated protein 


13.83 


34i3 


simOar to iysosome-assodaled membrane 


12.40 


7.38 


ESTs, Weakly sbnllar b unknown protein 


12.38 


6.05 


cathepsin E 


11.66 


6.49 


thyroid transcription factor 1 


11.38 


a79 


progastitcsin (pepsinogen CA 


10.69 


15.94 


ESTs 


10.64 


a27 


Homo sapiens cDNA: FU23494 fis, done L 


10.06 


8.92 


EGF-tilcexlomaln. multiple 6 


9.96 


5.43 


ESTs, Weakly simHar to BCHUIA S-100 pro 


9.90 


7.87 


N-acytsphingosine amidohydrDlase (add c 


9.38 


a35 


Eos Control 


9.03 


6.48 


ESTs 


a3i 


6.76 


complement component 4-binding protein, 


ai4 


5.51 


mannose receptor, C type 1 


7.70 


4.09 


ESTs 


7.56 


7,83 


ESTs 


7.06 


4.47 


Honu} sapiens cONA: FU23075 Tts. done L 


6.90 


2.89 


roundabout (axon guidance receptor. Dros 


6.88 


4.00 


chemokina (C-C moGQ receptor 5 


6.56 


4.25 


dyneln fight chain 2B (DNLC2B) 


6.46 


4.47 


surfactant, pulmonary-assodaled protein 


6.30 


13.57 


cytochrome P450, subfamily tVB, pdypept 


6.28 


3.38 


advanced glycosylatlon end prodxt-sped 


5.99 


13.26 


ESTs, Weakly simBar to ALU4.KUMAN ALU S 


5.88 


4.10 


neutrophil cytosolic factor 2 (65kD. chr 


5.88 


3.35 


ESTs. Weakly similar to T1 2545 hypothefi 


&87 


5.69 


monokine induced by gamma inleribron 


a64 


3.34 


matrbc metatloprolfiinase 1 (intersfitial 


5.72 


5.90 


solute canier fsnlDy 34 (sodium phospha 


5.59 


6.89 


ESTs 


5.58 


4.72 


Homo sapiens mRNA; cONA OKFZp564B2082 (f 


S.48 


S.33 


decay acoderatino factor for oomptoment 


S.48 


Z69 


ESTs 


5.45 


3.99 


totPon of hepato^ nudear factor-3 al 


5.38 


3.65 


diubiquitin 


»5.37 


3.11 


QxUised low density lipoprotein (lectin 


5.30 


a9B 


smalt tndudde cytokine subfamDy A (Cy 


5.29 


4.00 


lymphocyte cytosolic protein 2 (SH2 doma 


5.28 


Z48 


ESTs 


5.20 


3.78 


ESTs. WeaMy similar to 2109260A B cefi 


5.11 


3.81 


hypothettoal protein MGC4309 


5.11 


Z88 


testes developmen^laied NY(>^22 


5.07 


3.46 


uterogtolMn 


5.06 


9.48 


ESTs 


5.04 


4.16 


smal! inducible cytokine subfamOy A (Cy 


5.04 


3.66 


small indudbto Qloklne subfamSy B {0/ 


5.02 


4.26 


ESTs, Weakly similar to T17227 hypotheU 


4.97 


3.35 


ESTs 


4.92 


3.15 


gban03c03jt1 Stratagene schbo brain 81 


4.79 


3.05 


ESTs 


4.75 


Z65 


ESTs 


4.76 


4.26 


Homo sapiens cONA: FtJ23123 fis. ctone I 


4,73 


3.24 


ESTs 


4.66 




hypothetical protein RJ23049 


4.64 


4.88 


Homo sapiens mRNA fiiO length insert cDN 


4.60 


2.60 


ESTs 


4.54 


142 


ESTs 


4.54 


154 


solute carrier family 6 (neurotransndtte 


4.48 


4.86 


A{<AP-associated spenn protein 


4.46 


3.62 


Intron of pertosfin(OSF>2os) 


4.44 


141 


nudedar protdn p4f^ homotog of yeast 


4.42 


141 
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429228 AI553833 ESTs 4.32 2.98 

432435 65218886 Hs.282070 ESTs 4.30 2.26 

446932 AA961459 Hs.125644 ESTs 4.30 Z61 

_ 408369 R38438 Hs.182575 SLC15A2 SohitB carrier family 15 (frH»p 4.30 139 

5 409435 AI810721 Hs.95424 ESTs 4.30 Z60 

419490 NM.006144 Ks.90708 gratizyineA(oranzyin8l.c9U)tadcT4ynv 4.29 2.48 

452561 AI692181 Hs.49169 KIAA1634 pnM 4.23 126 

427698 AW972594 Hs.335499 ESTs 4.22 3.49 

431433 X65018 Hs.253495 surfactant, puhnonary-associated protein 4.22 13.34 

10 446608 N75217 Hs.257846 ESTs 4.20 3.62 

428227 AA321649 Hs.2248 sfnaDlndicn)le cytokine sulrfarn!lyB(Cy 4.18 ai4 

459702 A)204995 gb:8n03c03j(1 SfralagenesdiizolvainSI 4.16 2.64 

445885 AI734009 Hs.127699 KIAA1 603 protein 4.16 3.99. 

430260 AA361258 Hs.237B&8 Interteukin 7 receptor 4.13 2.79 

IS 425259 ALJ049280 Hs.155397 Homo sapiens mRNA; cDNA DKFZp564K143 (fr 4.12 2.19 

427019 AA001732 Hs.173233 hypothetical protein FU10970 4.12 3.02 

420556 AA278300 Hs.l 24292 Homo sapiens cDNA: Fy23123fis. done L 4.08 3.13 

426467 AK00212t Hs.184465 hypotheIicaipn]teInaJ11259 4.08 ^48 

432731 R31178 Hs.287820 fit)ronecfin1 4.06 266 

20 439398 AA2&4267 Hs.221504 ESTs 4.08 Z86 

409153 W03754 Hs.50813 hypolheGca) protein FU20022 4.05 3.51 

412584 X54870 Hs.74085 DNA segment on chromosome 12 (unique) 24 4.04 2.44 

436120 AI248193 Hs.119860 ESTs 4.04 3.11 

407910 AA650274 Hs.41296 fibronecUnleudne rich transmembrane p 4.03 169 

25 421482 AF016495 Hs.104624 aquaporinO 4.00 2.51 

443257 AI334040 Hs.l 1814 HSPC055 protein 4.00 161 

421659 NM.014459 Hs.106511 prolocadherin 17 4.00 3.00 

424273 W40460 Hs.144442 phosphoitpaseA2. group X 3.98 130 

415457 AW081710 Hs.7389 ESTs, VteaWy similar to ALU1.HUI4AM ALUS 3.97 136 

30 450656 AA010539 Hs.18912 ESTs 3.96 4.37 

429784 M89796 H3.30 membrane-spannbig 44omaIns, sulrfamily A 3.94 144 

424527 AW138558 Hs.334873 ESTs. Weakly similar to 154374 gene NF2 3.93 3.08 

413385 M34455 Hs.840 indo1earains-pyfrole2.3dk»ygen8se 3.92 3.53 

452416 AA026115 Hs.1 14777 ESTs 192 2.90 

35 428434 AW363590 Hs.65551 Homo sapiens. Similar Id DMA segment. Ch 190 5.06 

452281 T93500 Hs.28792 Homo sajjienscDNAFUII 041 fis. done PL 190 100 

453204 R10799 Hs.191990 ESTs 3.90 122 

450698 AI654223 Hs.16026 hypothetical protdn FU23191 3.81 182 

422173 BE385828 Hs.250619 phofboIin^prot^MDS019(CEIvl15) 3.80 123 

40 425638 NM_012337 Hs.1 58450 nasopharyngeal epltheHum spedfc prole 178 186 

406672 M26041 Hs.1982S3 ms^or histocompafibaity complex, dass 3.78 170 

457411 AWOasgei Hs.130093 Iroquols^lasshomeotjox protein IRX2 176 156 

432606 NM_002104 Hs.3068 granzymeK (serine protease, granzyme 3; 176 176 

436260 BE172762 Hs.292710 ESTs. Weakly slrtBar to AUi5_HUMAN ALUS 3.74 183 

45 414821 M63835 Hs.77424 Fc firagment of IgG. high affinity la, re 172 155 

428820 AA436187 H3.172631 Integrin, alpha M (complement component 171 125 

458079 AI7g6870 Hs.54277 DNA segment on chromosome X (unique) 992 170 126 

419556 U29615 Hs.91093 chitinase 1 (chitotrtosUase) 169 7.71 

417412 X16896 Hs.82112 interteukin 1 receptor, type I 3.68 117 

50 426174 AA547959 Hs.115838 ESTs 165 193 

408727 AL1372S9 Hs.47115 hypotheika] protein DKFZp43400513 164 162 

435990 AI015862 Hs.131793 ESTs 3.62 127 

427621 BE621182 H5.179882 hypolhetfcal protein FU 12443 152 148 

425555 AA359291 H3.130767 Homo sapiens d)NA:FU23553fis. done L 161 118 

55 419086 N|yL000216 Hs.89591 Kallmann syndrome 1 sequence 160 105 

426116 AA86e729 Hs.144694 ESTs 160 180 

419235 AW470411 Hs.286433 neurotrimin 3.58 188 

424054 AA334511 Hs.26638 membrane^Rlng44omaIhs,8tAfantIlyA 156 158 

422667 H25642 ESTs 155 144 

60 406673 M34996 H5.198253 maforMstDCompaUy complex, dass 154 198 

414142 AW38B397 Hs.334485 hemk»ntin(fitnifln 6) 3.S4 130 

428330 L22524 Hs.2256 matrix metalloprolelnsse 7 (matrilysln. 154 111 

430832 AI073913 Hs.100685 ESTs. Weakly sfaidar to JE0350AnlBrfQr 153 138 

417318 AW953937 Hs.240845 ESTs 152 102 

65 456034 AW450979 gb:UI-H<BI3^t»fr1 2^1.81 NCLCGAP.Su ISO 121 

415992 005837 Hs.145807 hypotheScal protdn FU13593 3.48 135 

430709 R34356 gb^hSSdOUl SoaresplacentBNb2HPHQmD 3.48 113 

440273 AI805392 Hs.325335 Homo sapiens cONA: FU23523 lis, done L 3.47 193 

424711 NM-005795 Hs.152176 cddtonin receplor-4ika 147 169 

70 418832 X04011 Hs.66974 cytoduomeb-245.t>dapdypeptUe(dtro 146 131 

416847 L43821 Hs.80261 enhancer of filamentaikm 1(cas-like do 146 137 

448019 AW947164 Hs.195641 ESTs, Moderately similar to 138022 hypot 145 107 

447183 A1554733 Hs.173182 ESTs 142 101 

435299 AI745458 Hs.343026 ESTs.WeaMy5lnaartoT20593hypQtheU 140 149 

75 425922 AL157466 H5.162751 Homo sapiens mfVlA:cDNADKFZp761E2423(f 140 142 

413714 A1560944 Hs.71428 ESTs 138 152 

407361 AA744622 Hs.29264S ESTs. Weakly simSar to ALU5JtUMAN ALUS 136 113 

436043 AW963838 H5.168830 Homo sapiens cDMAFLJ12136fis, done MA 136 141 

^_ 450330 AW50a775 Hs.24817 hypotheScal protein FU20136 136 106 

80 407756 AA1 16021 Hs.38260 ut)iquitinspedlic protease 18 135 142 

410606 AW418779 H5.114889 ESTs 135 139 

450726 AW204600 letinde add receptor, dpha 134 &35 

430573 AA744550 Hs.13634S ESTs 133 1.94 
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421585 


U95626 


Hs.302043 


433658 


103678 


Hs.156110 


454076 


AW204712 


Hs.61957 


452039 


AI922988 


Hs.1 72510 


454024 


AA993527 


Hs.293907 


430414 


AW36566S 


Hs.120386 


417958 


AA757382 


Ks.193417 


423001 


AA320014 


Hs^8603 


443774 


AL1 17428 


Hs.9740 


424084 


A)940675 


Hs.20914 


424236 


AA337401 


Hs 137635 


429819 


AL13^11 


Lfe 225108 


448869 


A1792798 


Hs.1 2496 


426083 




Hs 126712 

no* 1 &w ( 1 & 






H3.2S954 


440452 


AI99S13S 






Of i9&U3 


Hs.1473 


431X378 


229572 




413802 


AW%ddQO 


Hs 32241 


408761 


AA057264 


• H5^38936 


438568 


RQ8R6S 


Hb 1113S 


451497 


H83294 


Ue9B4122 


444034 


Al 1619S7 

ALIO 1 W» 


Hs.10177 


446094 


fvWv 1 » WW 


Hs. 13801 


442048 


AA974603 




406665 


Ml 6728 




430253 


AK001514 


H5.236844 


424943 


AU0772S0 


Hs 153924 


436805 


AA731S33 


Hs.270751 


412610 




Hs.74126 




D11928 


Hs 76845 




API 46761 


H5.20450 


420729 




H8.29(nK 




024577 


Hs.93304 


416580 


T61572 


H8.79385 


451820 




Hs 1M948 




AWd7'K7S 






Maori 4 


Hs 3^960 








42K13 




Hs.184697 


43^7 




Hs 252884 


445034 


AW2fi337G 


Hs1436S9 

tt9t 11 WWW* 


458332 


AK)O(041 




415083 


AI632683 


Hs 27179 


407930 


/vw430*lf 




407192 


AARnoTfln 




452960 


AraK)1335 


Hs.31137 


425509 


AF079363 


Hs.158213 


431087 


H12723 


Hs.290791 


452235 


ALJ039743 


Hs.28514 


449328 


A)962493 




422900 


AA641201 


Hs.222051 


414888 


ALJ039185 


Hs.77558 


430250 


NM_016929 


H3;;83021 


437527 


A)241019 


H8.146644 


432340 


AA534222 




420495 


A1336247 


Hs.98314 


445495 


BE622641 


H8.38489 


411252 


AB018549 


H3.69328 



420663 
412095 
410434 
436396 
434194 
435800 
420000 
449057 
413195 
43619B 
411492 
444020 
427785 
432583 
457675 
414646 
429950 
420394 



419038 
449765 
416293 
400880 



AI348408 

AA830166 

A)624707 

AF051152 

AI683487 

AF119847 

AI248285 

AB036063 

AB037784 

AA127382 

AK001125 

146848 

R92962 

X81053 

AW023624 

AF119917 

AA353776 

AW081608 

AB023161 

X03056 

AW134924 

N92293 

AI224483 



inununoQlobuIin kflppB oonstanl 
ESTs 



OKFZP434A236 protein 
hypolheiical protein FU23056 



ESTs, Weakly slnriar b ALU4.HUMAN ALU S 

ESTs. Weakly similar to AF191020 1 E2tG$ 

interteukin 13 receptor, alpha 2 

ESTs. Weakly similar to CAYP.HUMAN CALCY 

gastrlivrelea^ng pepBde 

tumor necrosis (actor receptor superfaml 

ESTs. Weakly similar to 865657 alpha-1 C- 

ESTs, W^aMy similar to (defline not ava 

rnajor liistocompalibillty cotnplax. dass 

WntinMbitorytxtor-l 

pledcstrin homology domain interacting p 

KIAA1685 protein 

gb»p34K)5.s1 Soares^NFL^T.GBC^SI Homo s 
gb:Human nonspediic crossreacting aniig 



(atty add binding protein 6, ileal (gas 
phosphoserine phosphatase-iike 
BCM-Gke membrane protein precursor 
ESTs 

phosphotipase A2. group VIl (platdet-ac 
Human clone 23574 mRMA sequence 
ESTs 

ESTs, Weakly similar to T17227 hypotheti 
membrattB-spanning 4^omdnSi subtemily i 
metanothlonein IE (fiinctionaO 



ESTs 

Homo sapiens cONA FU12933 lis, dons NT 
Homo sapiens cDNA FU12807 lis, done NT 
gb'^12e02x1 SoarBS.testis.NHT Homo sap 
pralein tyrosine phosphatase, receptor 1 
spemi assoda ted anBgen 6 
ESTs 

testes devek)pmen(>fe!dted NYI>$P21 
ESTs 
ESTs 

thyroid hormone receptor biterador 7 
diioride Intraceiiular channel 5 
ESTs 

gb:r^1d0Z8l NQ.CGAPJVAI Homo sapiens 
Homo sapiens mRNA; cDNA DKFZp586UI120 (f 
ESTs. Wedkly similar to 136022 hypotheO 
MD-2 protein 

H5.124675 ESTs, Weakly similar to T14742 hypotheO 
Hs.271305 ESTs 

H8.5921 Homo sapiens cONA: RJ21592 fts. done C 

Hs.63668 toU-like receptor 2 

Hs.152213 wIngtess-typeMMTV integration sits fami 

Homo sapiens PRO1550 mRNA. paifial cds 
Ks.1 18348 ESTs 

Hs*94262 p53-lndudblertbonucieo6de reductase 8 

Hs^2941 KIAA1363protdn 

Hs.22404 protease, serine. 12 (neurotrypsin. moto 

Homo sapiens cDNA RJ10263 fis, done HE 
Hs.70337 hnmunogiobtdin superfamily, member 4 
H8.35052 ESTs 
Hs.180828 ooQaoen.typelV.8lpha4 
Hs.1 62282 potasshjm channel TASIU; potassium chan 
H8.306574 Homo sapiens PRO3096 mRNA. complete cds 
Hs.901 C048an6oen(BH»n membrane protein) 
Hs.105053 ESTs 
Hs.97403 K1AA0944 protein 
Hs.73931 m^ histocompalibiSty comptex. dass 
Hs.ig0325 ESTs 

Hs.206832 ESTs. Moderately similar to ALU8J1UMAN A 
Hs.16063 hypothetical pfOteinFU21B77 

NM.00061 1*:Homo sapiens C069 anfigen p18 



3.32 

131 

3.31 

3.30 

3.30 

3.30 

3u30 

3.29 

a28 

3.28 

128 

127 

126 

126 

122 

122 

120 

120 

118 

116 

118 

118 

117 

117 

117 

117 

116 

116 

116 

115 

114 

113 

112 

112 

112 

110 

110 

110 

110 

110 

110 

108 

108 

108 

108 

107 

107 

106 

106 

106 

3.06 

105 

105 

105 

104 

104 

102 

102 

102 

3.02 

3.01 

100 

100 

100 

100 

100 

100 

100 

199 

2.99 

2.99 

2.98 

2.98 

2.98 

Z96 

2.96 

Z98 

Z95 

2.95 

194 

194 

2.94 

2.94 



175 

122 

155 

195 

137 

148 

104 

162 

135 

105 

145 

163 

167. 

104 

136 

187 

179 

130 

142 

112 

3.66 

199 

102 

142 

127 

180 

195 

lis 

1.95 

163 

1.74 

135 

109 

106 

158 

126 

101 

132 

1.80 

111 

120 

181 

\Xl 

W 

1.94 

112 

116 

175 

141 

164 

2.78 

1.87 

1.99 

149 

117 

1,78 

143 

1.77 

1.95 

124 

114 

113 

160 

1.94 

1.81 

1.89 

108 

118 

148 

176 

116 

121 

108 

140 

103 

1.74 

140 

246 

4.13 

1.72 

193 

1.94 

1.74 
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430382 AM77908 Hs.282287 ESTs. Moderately simSar to 138022 hypot 194 2.12 

419034 NM_002110 Hs.89555 hen)Op(^ ceQ kinase 2.93 2.25 

439335 AA742697 Hs.62492 N}^OS2883:Konios8f]iensse(retog!obin,fa 2.93 3.72 

_ 429597 NM-0038t6 Ks.2442 adlsintegrinandmelalloprotelnasedoffla 192 1.91 

5 419381 AA897581 Hs.128773 ESTs 2.92 2.18 

400419 AF08454S Taigel 192 1.83 

435176 AA744875 Hs.189413 ESTs 191 115 

413283 R78669 Hs.23756 hypoDtedcal protein stmSar to swine ac 190 125 

444339 T96555 Hs.31562 ESTs 190 3.16 

10 429272 W25140 Hs.110667 ESTs 190 143 

435047 AM54985 Hs.54973 cadherin-Gke protein VR20 190 129 

435080 AI831760 Hs.155111 liypothetica) protein FU 14428 ISO 140 

402474 NM.004079:Homo sapiens cattiepstnS(CTSS 2.86 147 . 

421554 AW137676 Hs.97775 ESTs 188 3.37 

IS 422770 AL117544 H$.120O21 DKFZP434I092 protein 188 100 

434658 AI624438 Hs^10286 ESTs 168 106 

440248 AA876138 ESTs 188 124 

442008 AW975183 ESTs. WeddyMlar to 872482 hypolhell 186 4.32 

430515 M745503 Hs,2e3313 ESTs 186 196 

20 446063 AI720140 Hs.151079 ESTs 186 2.47 

438177 8E327015 ESTs 186 1.70 

429083 Y09397 Hs.227817 BCI^-fetated protein A1 185 105 

417105 X60992 Hs.81226 C06 antigen 185 3.00 

433230 AW136134 Hs.220277 ESTs 184 1.97 

25 436676 AA613745 Hs.123446 ESTs 164 162 

435575 AF2134S7 Hs.44234 tfiggedng receptor expressed on (nyeloid 182 4.33 

420252 AW270404 Hs.193161 ESTs 162 3.22 

415788 AW628686 Hs,78851 K1AA0217 protein 182 1.78 

428065 A1634046 I4s.157313 ESTs 2.61 147 

30 434340 A)193043 Hs.128685 ESTs. We^y similar to T17226liypotheU 181 167 

451558 Nhl.001089 Hs.26630 ATP-tMng cassette. 8ul)-Mly A (ABC1 178 3.39 

435517 AA928626 Hs.130177 ESTs 178 136 

439883 AL359652 Hs.171096 Homo sapiens EST ftom clone DKFZp434A041 178 1.82 

434158 T86534 Hs.14372 ESTs 178 1.96 

35 428923 BE047698 Hs.188785 ESTs 178 107 

413786 AW613780 Hs.13500 ESTs 176 1.97 

406387 Target Exon 177 4.22 

421168 AF182277 Hs.330780 cytochrome P4S0. 8ut)(3mi1y IIB (p!ienot)8r 176 3.24 

444561 NM-0D4469 Hs.11392 o-fos Induced growth factor (vascular en 176 111 

40 427484 N32859 Hs.37288 ntidear receptor subMy 1. group D. m 176 1.94 

417728 AW138437 Hs.24790 KIAA1573 protein 176 1,78 

435154 AA668764 ^ ESTs 176 110 

429490 A1971131 Hs.238B9 ESTs. Vteakly simitar to AliH'JIlMAN ALUS 176 121 

423387 AJ012074 vasoadhn Intestinal peptide receptor 1 176 136 

45 432060 AW971364 Hs.324775 ESTs 175 102 

434164 AW207019 Hs.148135 serine/threonine kinase 33 174 148 

423708 U95218 Hs.131924 G proteiiKOupled leceptor 65 174 1.93 

442703 ALjD44949 Hs.116298 ESTs 174 1.89 

450247 AF123303 Hs.24713 hypothetical protein 174 1.73 

50 430998 AF128847 Hs.204038 IndotethylamlneN^lhyltransEBrase 174 185 

426535 AU077012 Hs.288582 ESTs, WeaMysimHartoutiiqullousTPRm 174 1.88 

409196 NIUL001874 Ks.334873 carixscypeptldase M 173 1.86 

422389 AF240635 H8.11S897 ptotDcadherin 12 172 126 

444324 AI301330 H8.143838 ESTs 172 1.74 

55 417631 H16423 Hs.82685 C047 antigen (Rh^aled anfigen, Integr 172 140 

428769 AW207175 Hs.l06771 ESTs 172 119 

404277 NM.019111*:Homo8apiensm^rhIstocomp8 172 3.12 

409653 AW451693 Hs220826 ESTs 172 162 

437211 AA382207 Hs.5509 ecotropic viral integratnn site 2B 172 125 

60 430299 W28673 Hs.106747 serine cartnxypeptidase 1 precursor prot 172 109 

444381 BE387335 Hs.283713 hypothelicai protein BC0 14245 171 126 

443547 AW271273 hypothetical protein aJ12666 171 ' 1.74 

408741 M73720 Hs.646 cartmxypepUdase A3 (mast cell) 170 139 

402674 TaigetExon 170 1.95 

65 438068 AI927209 Hs.30621O Homo sapiens cONA:RJ23133fis, done L 170 123 

415075 L27479 Hs.77689 Friedreich ataxia region gene X123 169 111 

444314 Ai140497 gb:ow76t)09.s1 Scares Jetaijverjspleen. 169 128 

428656 AB0377B8 Hs.188790 K1AA1377 protein 168 1.91 

„ 418883 BE387036 Hs.1211 add phosphatases, tartrate resistant 2.68 3.95 

70 443951 F13272 ferrilin, light pdypepfide 168 166 

427581 NM-014788 H8.179703 KiAA0129geneprodud 168 1.74 

432639 AW973785 gb:EST3B5886 MAGE resequences. MAGM Homo 168 1.78 

446423 AW139855 H3.15012O ESTs 168 129 

407939 W05608 Hs.312679 ESTs, WeaMy Similar to A49019dyneln he 167 107 

75 431779 AW971178 Hs.266571 apoOpoprotsln C-l 157 100 

458124 AWD05548 Hs.124590 ESTs 167 3.78 

432882 NM-013257 Hs.279696 senim/ghjcocorticoid regulated kbase^l 166 1.64 

445745 ABX7924 H5.13245 KIAA0455 gene product 166 1.64 

425188 AK002052 Hs.15S071 hypothetical protein FU11 190 , 165 1.92 

80 432231 AA339977 Hs.274127 OST 11240 protein 164 4.23 

442200 AW590572 Hs.235768 ESTs 164 146 

426828 N1^00QO2O Hs.172670 acfivbi A receptor type IMike 1 164 100 

448569 8E382657 H3.21486 signd transducer and activBtor of trans 163 323 
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42595S 796509 Hs^48549 ESTs^lModeratety similar to S65657 alpha Z63 2.48 

411213 AA676939 K3.69285 neuropinnl Z62 1.73 

439737 Ar751438 HS.41Z71 Honu) sapiens mRNAfuO length insert cON 2.62 Z69 

446570 AV659177 Hs.127160 ESTs 161 Z44 

5 411020 NM.008770 Hs.67726 macrophage receptor wjlhcoUagenousstr 160 3.39 

434792 AA649253 Hs.132458 ESTs Z60 1.74 

42S782 R14614 Hs.33845 ESTs Z60 Z36 

425371 D49441 Hs.1 55981 mesoIheBn 2.60 6.97 

447720 AIJ038765 Hs.161304 ESTs 159 108 

10 444623 AI183829 Hs.202111 ESTs 159 177 

433376 AI249361 . Hs.74122 caspase 4. apoptosb^ated Cysteine pr 158 101 

444542 A1161293 Hs.280380 amInopepBdase 158 131 

439549 AW937885 Hs.137314 ESTs 158 137- 

431385 BE178536 Hs.11090 membfane^panning44omains. subfamily A 158 156 

15 417015 M83772 Hs.80876 flavin conlainiRg monooxygenasa 3 156 147 

433308 AA582718 Hs.291650 ESTs 156 101 

443885 H91806 Hs.15284 ESTs 155 1.71 

408170 AW204516 H5.31835 ESTs 155 1.59 

^ 456844 Ai264155 Ms.152S81 COP-diacyiglycerol synthase (phosphatlda 154 1.63 

20 412104 AW20S197 Hs.240951 Homo sapiens. Simitar to IVKENcONA 2210 154 198 

428791 AA43S661 Hs.2647SQ ESTs 153 129 

435472 AW972330 Hs.283022 triggefing receptor expressed on myeloid 153 3.91 

447357 A1375922 HS.1S9367 ESTs 2.52 183 

431393 AW971493 Hs.134269 ESTs. Hlghty simnar to cytokine recepto 152 1.90 

25 424105 AI142336 H5.43g77 Human DNA sequence Irom done RP11-196N1 152 3.45 

408308 AU)33377 Hs.44197 tiypottieScal protein OKFZp564D0462 1S2 1.93 

438898 AW297855 ESTs, Mar to 138022 hypothefl 152 1.98 

420991 AW504814 Hs.287379 Homo sapiens mRNA br FU0O11 1 proldn. 152 141 

424049 AB014524 Hs.138360 KIAA0624 prolaJn 151 119 

30 438543 AA810141 Hs.192182 ESTs 151 106 

414061 NM.000699 Hs335493 amylase, alpha 2A;pancredic 151 114 

424806 AA382523 Ks.105689 MSTP031 protdn 151 111 

438580 AA811262 Hs:299202 ESTs ISO 1.83 

434445 A1349306 Hs.11782 ESTs 150 3.13 

35 444001 A1095087 Hs.1522g9 ESTs. li4oderatelyslmilartoS65657 alpha ISO 1.76 

413638 H71252 gt>.7s12h1l8l Soares fetal Over spleen ISO 100 

421281 AI299139 H5.17517 ESTs 150 140 

441384 AA447849 Ks.286860 retlnoic add Induced 3 150 175 

436772 AW9756e8 metallolhionein IE (funcOond) 149 1.80 

40 433102 AI343966 Hs.158528 ESTs 149 125 

430129 BE301708 Hs.233955 hypothetica] protein FU20401 148 109 

445612 N94126 83.12969 hypotheOcal protein 148 128 

445261 T79759 Hs.250651 ESTs, Weakly simflar to 138022 hypolheti 148 1.87 

433854 AA610649 H5.333239 ESTs 148 109 

45 447997 H00656 Hs.29792 ESTs, Weakly similar to 138022 hypotheH 148 175 

411069 AL133092 Hs.68055 hypothetical protein OKFZp43410428 148 101 

440594 AW445167 Hs.126038 ESTs 2.48 1.57 

450295 AI766732 Hs.210628 ESTs 148 1.99 

431316 AAS02663 Hs.145037 ESTs 148 1.80 

50 438564 AA381553 H3.19d253 .m^htstocompalitNBty complex, dass 148 180 

439593 BE073697 Hs.124863 ESTs 148 1.89 

422355 AW403724 Hs.300697 coagutation factor VII (serum piothrombi 147 3.74 

453134 AA032211 Hs.116493 ESTs 146 172 

417169 R13550 Hs.21388 ESTs 146 188 

55 434411 AA63264g Hs.201372 ESTs 2.46 1.95 

440381 AA917808 H5.190495 ESTs 146 109 

448782 AL050295 K1AA0758 protein 146 169 

404240 NM.0189S0:Homo sapiens mslorhlstocompat 145 183 

450843 AI741483 Hs.20S383 ESTs 144 IK 

60 434137 AA907734 Hs.124895 ESTs 144 155 

438315 R567g5 Hs.82419 ESTs 144 1.94 

420802 U22376 Hs.1334 v-myb avian myelotjiastosisvira) oncogen 144 1.61 

439402 WQ2753 Hs.103002 ESTs 144 1.90 

445903 AI347467 Hs.132781 dass I cytoidne receptor 144 132 

65 437323 AA371145 Hs.194397 teptin receptor 144 1.70 

433923 AI823453 Hs.14652S ESTs 144 1.58 

442201 AW516704 Hs.208726 ESTs 143 1.68 

437982 N93466 Hs.121764 ESTs^Woddysbiillartotesficdartekfi 143 3.22 

452698 NM.001295 Hs.301921 chemoMne {OCmoliO receptor 1 143 121 

70 407904 W44735 Hs.9286 Homo sapiens cDNA:FU21278fis. done C 143 113 

406973 M3499& Hs.198253 ma|orhlstocompaiibinty complex, dass 143 168 

428055 AA420564 Hs.101760 ESTs 142 105 

428970 BE276891 Hs.194691 reOnotc add Induced 3 (RAI61);metait)0 142 179 

433138 AB029496 Hs.59729 semaphorin sem2 142 1.68 

75 415757 AA830854 H5.187810 ESTs 142 102 

438507 AA809052 ESTs 142 108 

450811 Ai739486 Hs.245497 ESTs 142 1.97 

424027 AW337575 Hs.201591 ESTs 142 176 

423778 Y09267 Ks.132821 fiavtn containing monooxygenasa 2 141 ai5 

80 435978 AF27269g Hs.135118 Homo sapiens PR<lomaIn zinc Snger prate 141 108 

426291 U5e913 Hs.1 691 91 small indudble cytokine subfandy A (Cy 140 1.76 

416370 N90470 Ks.203697 C038 ant^en b>45} 140 1.97 

415688 AA166863 Ob2088dOU1 Stratagene ovarian cancer 140 1.63 
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445633 

431300 

407690 

418113 

425235 

4S1406 

437479 

445784 

418300 

413753 

416945 

416140 

418262 

420943 

442762 

429747 

420460 

420137 

439018 

427250 

452194 

411027 

407242 

418875 

425023 

432608 

406048 

415189 

437442 

410577 

422099 

427337 

427541 

420899 

431848 

446354 

423354 

423961 

410798 

4S7250 

446291 

426839 

422746 

439920 

414942 

419092 

424878 

406687 

411605 

416965 

428713 

435106 

420380 

407137 

410361 

447160 

421114 

453686 

4521 14 

417355 

434927 

442262 

425216 

425354 

409190 

414221 

435272 

414991 

424623 



422426 
413829 
427535 
447197 
446142 
410503 
435523 
437629 
429688 
430413 
447033 
429496 
425516 



AI453386 

AA502346 

R47799 

A1272141 

AA3S3113 

AI694320 

R61866 

At253155 

AI433074 

U17760 

BE248762 

Ai918035 

Z38968 

AI718702 

AF035119 

MB7S07 

AA262331 

AA306478 

AW300887 

R35941 

A1694413 

AF072099 

M18728 

W19971 

AW956889 

AI492660 

NVL007203 

L34657 

T85104 

X91911 

AA156022 

Z46223 

A1796983 

NM.001629 

A137B857 

AW449650 

AB011130 

D13666 

6E178622 

AA811987 

BE397753 

M74782 

NM.0M484 

H05430 

C14898 

J05581 

H57111 

M31126 

AW006831 

N26223 

AA432067 

AA100847 

AA640891 

T97307 

BE391804 

AA330310 

AW975051 

AL1 10328 

N226e7 

013168 

H46612 

BE170851 

N77530 

U82027 

AU076536 

AW450979 

AA906415 

C17898 

AW983062 

AW368S76 

W79117 

NH.001872 

R29543 

R36075 

AI754693 

AW975746 

T62849 

AW574774 

BE245169 

AWB42182 

A1357412 

AA453800 

BE0O07O7 



H8.17287 ESTs, Weakly stmflar to S26689hypoQieO 

gb:n826b03.8l Na.CGAP_Co3 Homo sapiens 
H5.266957 hypdheSca) protein FU14281 
Hs.83484 SRY (sex detenrtning region YHjox 4 
H$.112497 Homo sa|>ienscONA:FU22743lis. done H 
H5.6295 ESTs.Wb^sima8rtoT17248hypol}eO 
H8.101277 ESTs 
Hs.146065 ESTs 

Homo sapiens cDNA: FU21578 lis, ckine C 
lamlnln. beta 3 (n!celn (I^SkO), ka&nin 
aracMdonate S-Gpoxygenase 
Hs.301 198 roundabout (axon guidance leceplor, Dros 
ESTs 

Hs.279930 n^titstocompalibaHycompleXi class 
deleted In liver cancer 1 
caspase 1, ^x^itosb^etated cyslAie pr 
Homo saf^ens dona HB-2 mRNA sequence 
C030 antigen, delta pdypepOde (TTTd go 
m8mt>rBn8-spanhlng 4-domabis, suhfsn&f A 
ESTs 

dfactoiy receptor, family 2. subfamHy 
leukocyte intmunogiobdin-IBce receptor. 
gb:Human nonspecifio crossreading antig 
H5.233459 ESTs 

Ks.154210 EDG-1 (endothelial differentiation, sph 
Hs.170935 ESTs 

A kinase (PRKA) anchor protein 2 
plateiet/endoltielial celt adtteston mdac 
H5.222779 ESTs, Moder^ety sfanOar to simBar to N 
Hs.64639 gltoma pathogenesls-related protein 
Hs.1 11518 liypothelical protein 
Hs,176663 Fc fragment of lgG.kM affinity lllb.r 
H5.82921 solute carrier family 35 (CMP-siallc ad 
Hs.10O194 aracbidonate 54lpoxygenase^fivatlng p 
Hs.271605 ESTs, Highly similar to AF175283 1 zinc 
ESTs 

Hs.1 27436 catehim channel vdtage-dependent, alph 
Hs.136348 peiiosfa'n(OSI^2Ds) 
Hs.16291 gb:PM34IT0605.27020&OO1-a02 HT0605 Homo 
Hs.1 25779 ESTs 

H3.14623 interferon, gamma-inducibie protein 30 

Hs.1 72689 interleddn 3 receptor, alpha (low afRn 

Hs.118651 glypkan3 

Hs.288433 neurotrimbi 

Hs.192986 ESTs 

Hs.89603 mucin 1 , transmembrane 

Hs^1132 ESTs 

matrix metalloprotebase 11 (stromelysln 

ESTs 

H3.160436 ESTs 

ESTs, Moderately similar to CYA4 RAT ADE 
ESTs. Highly similar to AF174600 1 F-box 
Hs.102406 ESTs 

gb7e53h05^1 Soares fetal Over spleen 
guan^ate binding protein 1, interferon- 
ESTs 

Hs.293156 ESTs, Weakly similar to 178885 sedneAh 
Hs.304679 ESTs, Moderately amilar to Z19S_HUMANZ 
ESTs 

endolhelln receptor type B 
Homo sapiens KSPC285 mfVIA, partid ods 
deleted In Ihrer cancer 1 
Homo sapiens cDNA FU1 1 654 fis, done HE 
Hs.1 55935 complement component 3a receptor 1 
HS.S0984 sarcoma amplified sequence 

gb:UI-H«l3-alM-12-04JIJ1 Na.OGAP.Su 
ESTs 

gb:C17698 Human placenta cDNA (TF#v8ra 
H3.270737 ESTs 

caweoftn2 
ESTs 

carfaoxypeplMase B2 (plasma) 
pro-platdet basic protdn Cmdudes pla 
gb:yh88b0U1 Scares placenta Nb2KP Homo 
Hs.145968 ESTs 
HS.18&662 KIAA1702 protein 
Hs.1 1090 mentbr8ne^annln94-dom8lns. subfamily A 
Hs.121692 ESTs 

Hs.21 1610 CU6 tilplst repeal, RNA-binding protdn 
Hs^41392 smdlbidudble cytokine AS (RANTES) 
Hs.157601 PiBdktad gene: Eos ctoned; secreted fiN 
Hs.192793 ESTs 
Hs^7 ESTs 



Ks.86682 
Hs.75517 
H8.69499 



Hs.8700 

Hs^49a 

Hs.48376 

Hs.95327 

Hs.26638 

Hs.25418 

Hs.67846 



Hs.42322 
Hs.78146 



Hs.5978 



Hs.62661 
Hs.24181 



Hs.8236 

Hs.82002 

Hs.293815 

Hs.8700 

H3.13895 



Hs.1 10041 



Hs.139851 
KS58559 
HS.7S572 
Hs^164 



239 

Z39 

Z39 

139 

2.38 

2.38 

138 

Z38 

Z38 

Z37 

137 

2.37 

2.37 

2.37 

237 

137 

136 

136 

136 

136 

136 

138 

135 

135 

135 

2.35 

135 

135 

135 

134 

134 

134 

133 

132 

132 

132 

132 

131 

131 

131 

131 

130 

130 

130 

130 

129 

129 

129 

129 

129 

129 

128 

128 

128 

128 

128 

127 

127 

127 

126 

126 

126 

126 

126 

126 

126 

125 

124 

124 

124 

122 

122 

122 

122 

122 

122 

122 

122 

121 

120 

120 

120 

120 



1.99 

1.79 

1.84 

121 

109 

1.78 

3.00 

1.61 

125 

1.55 

141 

1.61 

105. 

100 

105 

1.67 

1.88 

161 

164 

115 

3.41 

3.05 

134 

1.95 

1.85 

106 

1.91 

134 

113 

1.73 

1.80 

2.24 

162 

152 

150 

121 

4.34 

119 

134 
1.66 

196 

112 

116 

4.06 

102 

a08 

1.84 

176 

1.58 

4.71 

1.73 

1.90 

182 

1.52 

109 

1.71 

1.98 

1.91 

1.88 

1.63 

1.84 

1.86 

1.72 

1.70 

1.56 

112 

115 

3.58 

1.87 

115 

3.33 

139 

328 

1.59 

1.88 

1.56 

149 

1.70 

1.64 

173 

168 

197 

1.58 
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422404 AL133571 Hs^189 Homo sapiens mRNA;cONADKFZp434F1 135 (f 2.19 1.92 

423526 AB011086 H5.129739 KIAA0514 gene product 2.19 2.85 

436465 X59135 Hs.156110 bnmunoglobulin kappa constant 219 201 

_ 426251 M24283 Hs.168383 IntsrceBulv adhesion molecule 1 (C054) 219 2^ 

5 443441 AW291196 Hs.92195 ESTs 218 1.73 

418458 AA332941 Hs.85226 lipase A. lysosomal acM.chotesterofes 218 2.53 

408705 AA312135 H3.48967 HSPC034 protein 218 1.54 

419150 T29818 H3.89640 TEK tyrosine kinase, endothelial (venous 218 1.93 

430915 AA468953 gb:aa55e05^1 Na.CGAP.GCBI Homo sapiens 218 1.57 

10 418791 AA935633 Hs.194628 ESTs 217 205 

432620 AA777749 Hs.5978 UMdom^only7 217 1.75 

424321 W74048 H5.t765 lymphocyte-specifx: protein tyrosine kin 217 201 

406646 M33600 H5.308026 major tustoconipatitdlty complex, dass 217 3.12. 

424450 A1.137526 dyneinlnterme(Satechaln2 217 4.14 

IS 426410 BE298446 Ks.305eg0 BCL2-tike1 216 219 

416975 NhiL004131 H5.1051 QranzymeB(8ranzynie2,cytoloxieT-lyinp 216 1.70 

421077 AKD00061 HS.1015S0 hypothetical protein 216 1.60 

424563 AA446932 Hs.151428 ret flnger protein 2 216 1.63 

405102 C15001220*:gI|4469558|gblAAO21311.1)tAF 216 1.78 

20 452436 BE077546 Hs.31447 ESTs. Moderately similar to A46010X-lin 215 1i7 

416206 AW206248 Hs.111092 hypolhetfeal protein FU22332 215 1.65 

418067 AI127958 Hs.83393 cystafinEM 215 240 

436372 AW972301 Hs.310286 ESTs 215 235 

418728 AW970937 Hs.293843 ESTs 214 258 

25 450400 AI694722 Hs.279744 ESTs 214 206 

409031 AA376836 ESTs 214 214 

435143 R12375 Hs.194600 ESTs 214 1.69 

444805 A600789g Hs.12017 homotog of yeast ut)k)uItlrH)rotdnligas 214 2.03 

453927 AA082465 Hs.125031 choiine/dhanoiamlnephosphotransferase 2.14 1.57 

30 416304 AA215702 gb3r97g10.n HQ.CGAP.GCBI Homo sapiens 214 1.68 

418299 AA279530 H5.83968 integiin. t}eta 2 (antigen CDId (p95), ty 214 204 

408996 AI979168 Ks.3440g6 glycoprotein (transmembrane) nmb 213 1.72 

417018 M16038 Hs.80887 v-yes-IYamagucht sarcoma viral related 213 1.66 

418741 H83265 Hs.8881 ESTs, WeaWy similar to S41044chromosom 213 247 

35 452353 C18825 Hs.29191 epithelial membrane protein 2 212 231 

418918 X07871 Hs.89476 CD2 antigen (p50). sheep red blood cell 212 276 

424006 AF054815 Hs.137548 C084 anSgen (leukocyte antigen) 212 211 

437581 N59284 H3.288010 ESTs 212 285 

410976 R36207 Ms.25092 hypolhetkal protein MGa0744 212 2.04 

40 429716 R25685 Ks.211933 collagen, type Xlil, alpha 1 212 200 

423069 W15513 Hs.1613 adenosine A2a receptor 212 1.72 

432860 AW974077 Hs.283349 ESTs 212 1.75 

449509 AA001615 Hs.64561 ESTs 212 1.84 

456062 AI866285 Hs.71962 ESTs. We^y similar to B36298 proline-r 211 4.42 

45 419216 AU076718 Hs.164021 small inducible cytokine subfamily B (Cy 211 1.65 

459680 K96982 Hs.42321 ESTs 211 220 

449677 AA002071 gb3h85d01.s1 Soares_fBtalJver_spieei\^ 210 212 

443071 A1J080O21 Hs.8986 comptement component 1,q subcomponent 210 248 

443021 AA368546 Hs.8904 Ig superfamlly protoin 210 242 

50 437838 A1307229 ESTs 210 1.67 

429421 AL031658 Human DNA sequence Inxn done RPI^OOIS 210 1.91 

407202 N58172 Hs.109370 ESTs 210 1.66 

443669 AI140462 Hs.1345B7 ESTs 210 1.64 

411990 AW963624 Hs.31707 ESTs. Vfe*ly similar to YEW4jrEASTHYP0T 210 1.71 

55 408410 AA447438 Hs.44697 ATPase, Qass V, type IOC 210 205 

435293 At601188 Hs.120910 ESTs 210 201 

410730 AVtf368860 DnaJ (Hsp40} homolog. subfamily B. membe 210 1.66 

427876 AI494291 ESTs 210 248 

^. 456672 AK002016 Hs.1 14727 Homo sapiens, done MGC:16327. mfVJA.com 209 3.11 

60 434987 AW97S114 ESTs 209 1.69 

433735 AA608955 H5.109653 ESTs 209 1.78 

433226 AW503733 Hs.9414 KIAA1488 protein 209 1.62 

425787 AA363867 Hs.155029 ESTs 209 1.85 

452304 AA025386 Hs.61311 ESTs. WeaMysImBar to 810590 cysteine 208 3.41 

65 442369 AI565071 ESTs 208 1.60 

430478 NIA_014349 Hs.241535 aponpoprotein L. 3 208 239 

434421 AI915927 Hs.3477t ESTs 208 1.66 

415138 C183S6 HS.29S944 Qssuef^ pathway Inhibitor 2 208 1.72 

431728 NkiL007351 Hs.268107 mulSmeifn 208 1.51 

70 444929 A1685841 Hs.161354 ESTs 208 3.14 

408873 AL046017 catmodulin 2 (phosptiorytase kinase, delt 208 209 

437634 AW293045 Hs.255158 ESTs 208 1.66 

400277 EosGontjd 208 1.46 

443601 AI078554 Hs.42658 ESTs 208 1J7 

75 432212 AW137742 ESTs 208 284 

410763 AF279145 Hs.8966 hypothetical prot^ FU21776 207 1.46 

406122 Target Exon 206 275 

430665 BE350122 H5.157367 ESTs. Weakly simter to 178885 serineAh 206 1.66 

408788 AL134947 Hs.213956 Homo sapiens BAC done RP1 1-10205 from Y 206 1.70 

80 421057 T5B283 Homo saptens cONA: FU22063 lis. dons H 206 1.78 

413936 • AF113876 Hs.297681 serine (or cysteine) proteinase inhlbito 206 230 

431924 AKD00850 Hs.272203 Homo sapiens cDNAFU20843Qs. done AO 206 231 

449444 AW818436 sdute canter family 16 (monocarfaaxytfc 206 1.41 
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43361B AA602539 H5.345494 ESTs t.92 1.B4 

424517 AI539443 H5.137447 Homo sapfera cONA FU12169 (is, done MA 1.92 127 

418036 Z37976 Hs.83337 lateidbansfomiinggrawUi factor bete b 1.92 1.76 

. 4S0747 AI064821 H&318535 ESTs. Highly ^mOar to 1 81 6357AEWS gen 1.92 1.72 

5 409745 AA077391 gb:7B14E12Chioino80nie7FetalBr^cDNA 1.91 1.83 

426780 BE242284 Hs.172199 adenylalacydase? 1.91 1.67 

452366 R12499 HS.204S8 ESTs 1.91 2.64 

438570 A1275803 Hs.12342d ESTs 1.91 3.12 

^. 414359 M62194 Hs.75929 cadherin 11. typ8 2, 0&<»lher{n{osteob 1.91 1.62 

10 446566 H95741 Hs.17914 memb[and-6panni(«44omaiiis. subfamily A 1.90 Z06 

424528 AW073971 Hs.238954 ESTs. Weakly simiiar to KIAA1 204 protein 1.90 1.85 

444745 AF117754 H5.II86I thyroid ttomioneieoeptor-assoclatedprot 1.90 1.65 

428166 AA423849 Hs.79530 M5-14 protein 1.90 1.70. 

426721 AA3835e8 Hs.13iei6 ESTs. Weakly sintilar to T2901 2 tiypolhetl 1.69 3.22 

IS 449271 AW338067 H3.7669 Homo sapiens cDI^RJ 11 946 lis. cfane HE 1.88 2.07 

438576 AI458213 H5.77542 ESTs 1.88 2.25 

437751 AA767373 ESTs. Moderstety similar to ALU1.HUMAN A 1.88 2.41 

449618 AI076459 Hs.15978 KIAA1 272 protein 1.68 1.63 

430634 A1860651 Hs.26685 calcyphosine 1.88 3.01 

20 440663 AW452976 Hs.247112 tiypothetkst protein FU10902 1.88 1.65 

440099 AU)800S8 Hs.6909 DKFZPS64G202 protein 1.88 1.78 

414662 AL036058 Hs.76807 m^tiistooompamy complex, dass 1.88 Z37 

444051 N48373 Hs.10247 adivated leucocyte cell adhesion motecu 1.87 Z(R 

414464 AIB70175 Hs.13957 ESTs 1.87 2.68 

25 427792 M63928 Hs.180841 tumor necrosis fador receptor supeifami 1.87 Z2S 

415801 R24219 Hs.278443 FcfrSQroentorigG^ tow affinity II1», re 1.87 2.05 

430027 AB023ig7 H5.227743 KtAA0980 protein 1.87 1.70 

425771 BE561776 Hs.159494 Bruton aganvnaglobulinefnla tyrosine kinas 1.87 Z18 

412443 AW951103 Hs.130767 Homo sapiens cONA: FU23553fis. done L 1.86 ZV 

30 408771 AW732573 Hs.47584 potassium voIl3gfr«atedchannei. delayed 1.88 Z31 

420361 N92054 Hs.194718 zinc finger protdn 265 1.86 1.63 

413869 NM.000878 Hs.75S96 tnterieuMn 2 receptor, beta 1.66 Z13 

422241 Y00062 Hs.170121 protein tyrosine phosphatase, receptor t 1.85 1.77 

442434 AA995787 Hs,129583 ESTs 1.85 2.15 

35 422735 AA169665 Hs.119529 Niemann^tdt disease, type C2 gene 1.85 Z77 

444083 AI123195 gbwolTalOxl Soares_NSF.F8_9W.OT.P/LP_S 1.84 1.73 

449679 At823951 Hs.129700 tolloidtol 1.84 1.57 

418183 NH4_001772 Hs.B3731 C033 antigen {gp67) 1.84 Z02 

414776 AA155598 Hs.212839 hypothetical protein FU14195; KIAA1714 1.84 1.72 

40 414803 X03100 Hs.gi4 Human mRNA for SB ciassllhiskmnpatibi 1.84 2.47 

408669 A1493S91 Hs.76146 plateietfendolheiial ceR adhesion moiec 1.84 Z29 

455508 AWg76165 gb:EST388274 MAGE resequences. MAGN Homo 1.84 1.69 

410290 AA402307 H5.322844 hypothetical protdn DKFZp564A176 1.83 Z12 

426457 AW894667 HsJ22660 diimerin (chlmaeiin) 1 1.83 1.59 

45 459247 N46243 Hs.110373 ESTs. Highly similar to T42626 secreted 1.83 1.57 

417086 AA194446 ESTs, WbakJy simitar to S55024nebulin, 1.83 1.45 

425175 AFQ20202 Ks.155001 UNC13(C.et8gansHIke 1.83 Z18 

429952 AF080158 Hs.226573 inhibitor of kappa UghtpotypepfUe gen 1.83 1.75 

438596 AA829427 Hs.243081 ESTs 183 Z83 

50 436486 AA742221 Hs.120633 ESTs 1.82 Z14 

433365 AH)26944 Hs.293797 ESTs 1.62 ZSO 

449943 AF104266 Hs.24212 latrophiiln 1.82 ZOB 

426437 BE076537 H8.16989S ublqulflnHXX^ugating enzyme E2L 6 1.62 IZf 

421563 NK4.006433 Hs.105806 granulysin 1.82 2.48 

55 449161 NS3431 Hs.47647 ESTs, Weakly similar to T00057hypoth8tl 1.81 Z81 

453107 NKL016113 Hs.279746 vanHloid receptor-iike protein 1 1.81 Z66 

418371 M13560 Hs.64298 C074 antigen (Invariant pdypepikle of m • 1.81 Z50 

432946 U60899 HS.2798S4 fn8nnoskiase,a(phabda8s2B, member 1 1.81 Z05 

^. 432297 AW663832 Hs.285625 Homo sapiens mRNA:cOKADKFZp434A1 19 (fr 1.80 ai3 

60 428677 AI657119 Hs.120036 boponln I, cardiac 1.80 Z94 

40948S S80990 Hs.252136 ticonn (coiiagen/librlnogen domainnnnt 1.80 Z28 

423081 AF262992 Hs.123159 spenn assodated antigen 4 1.80 1.56 

425458 H89317 H3.182889 ESTs 1.80 Z21 

425390 AI092634 Hs.156114 protein tyrosine phosphatase, non^pt 1.80 1.41 

65 409208 Y00093 integiin.dphaX (antigen COIICfplSO). 1.80 Z20 

430570 Ai4178&1 Hs.292464 ESTs 1.80 1.62 

439425 AF086244 Hs.1 14659 ESTs 1.80 Z37 

408688 A1634522 Hs,152925 KIAA1 268 protein 1.80 Z13 

440675 AW005054 Hs.279788 ESTs, Weakly similar to KCCUtUMANCALQ 1.80 1.80 

70 423690 AA329648 H3.23804 ESTs, Weakly similar to PN00998on3prot 1.79 1.57 

406621 X57809 Ks.8997 immunogtobufin lambda tocus 1.79 Z18 

431958 X63829 Hs.2677 cadhertn 3. type I.P-cadherin (placenta 1.79 1.68 

403421 NM.016369-:Homosapiensd8udIn18(a0N 1.79 Z47 

430423 AI190548 Hs.143479 ESTs. Weakly similar to hypothetical pro 1.79 Z92 

75 416384 AU076903 Hs.79283 selecUn P iigand 1.79 1.87 

440638 Ai376551 gb:ts64e10j(1 SoaresJ^Fl^T.GSC.SI Homoa 1.78 1.69 

422003 AA36173) Hs.296326 ESTs 1.78 Z05 

412268 NM_003005 Hs.73800 selectInP(gramlte membrane protein 140 1.77 1.82 

432987 AI864771 Ks.27964 C086 antigen (C028 antigen ligand Z B7- 1.77 Z03 

80 441602 AI&55043 Hs.133456 ESTs 1.77 Z01 

458194 AW383818 ESTs. Moderately similar to AU ^.HUMAN A 1.76 Z35 

432565 AA553477 Hs.152428 ESTs 1.76 Z63 

421071 AI311238 Hs.1(m76 ESnr8.W^simBartoGGHU1Ecoaagen 1.75 Z59 
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408989 AW361666 H&49S0O KIAA0746pfQle!n 1.75 166 

414807 At738616 Hs.77348 hydroxypnst^Mln dehydrogenase I^N 1.75 154 

403903 C5001632*:oqi0545308|gblAAG21430.1}ACOO 1J5 3.20 

- 421461 AW291023 ESTs. Weakly similaf to A46010)WInked 1.74 267 

5 430850 BE144152 gb:MR0>HTD165OG0200^)06^2HT0165Homo 1.74 2.52 

424377 AF081675 Hs.146322 k91er ceil lecQivae receptor subfam] 1.74 Z1S 

443884 N20617 Hs.194397 leptin receptor 1.74 1.51 

423057 AWg61597 Hs.130816 ESTs. Moderately slmOar to 13B022hypot 1.74 1.63 

448262 AW880830 Hs.186273 ESTs 1.73 2Sr 

10 431890 X17033 Hs271988 integiln. alpha 2 (C0498. dpha 2 subunl 1.73 1.87 

431630 NM_002204 HS.26SB29 Integiin, dpha 3 (antigen C049C. alpha 1.73 221 

412896 AW804157 Hs.308026 major histocompaWy complex, dass 1.72 237 

407366 AF026942 H5.17516 gb:HonK) sapiens dg33 mRNA, partial sequ 1.72 216. 

419407 AW410377 Hs.41502 hypothelical protein FU21276 1.72 1.52 

15 442117 AW664964 Hs.1 28899 ESTs; hypolhefical protsln for IMAGE447 1.71 1.55 

438606 NM.014859 Hs.6336 KIAA0672 gene pnjducl 1.71 1.57 

434795 BE620794 H3,4147 translocating chaln-essociaQng rrembrane 1.71 221 

426490 NM_001621 H3.170087 aryl hydrocarbon receptor 1.71 1.46 

418307 U70867 Hs.83974 sotute carrier famfly 21 (prostaglandin 1.71 249 

20 421221 AW276914 Hs.326714 Homo sapiens done tMAGE:71 31 77. mRNA se 1.71 1.57 

423857 N48902 Hs.133481 Homo sapiens mRNA; cDNA DKFZp664O0862 (f 1.71 1.56 

408393 AW015318 Hs.23165 ESTs 1.70 1.43 

432409 AA806538 Hs.130732 WAAI 575 protein 1.70 1.54 

440817 AI341423 Hs,288433 neurotrinAi 1.70 217 

25 421445 AA913059 Hs.104433 Homo sapiens, done I MAGE:4054868. mRNA 1.69 254 

453691 H12235 Hs.226505 ESTs 1.69 207 

422278 AF072873 H8.114218 liizzled (Drosophna) homolog 6 1.68 1.54 

424687 JO5O70 Hs.151738 m8trb(metaaDpTotelnase9(gelaGnaseB 1.68 229 

434951 AF161442 H1191591 Homo sapiens HSPC324 mRNA. partial cds 1.68 224 

30 444301 AK000138 Hs.10760 asporIn(LRRd8S$ 1) 1.68 1.44 

407775 Nl^004914 H5.38772 RAB36. member RAS oncogene fannily 1.68 203 

437119 Ai379921 Hs.177043 ESTs 1.68 4.21 

426836 N41720 Hs.172684 veslde^ssodaied membrane protein 8 (e 1.68 228 

453498 BE181412 H8.23245 hypothefical protein FIJ1 1767 1.68 276 

35 428289 M26301 Hs.2253 complement component 2 1.67 240 

404854 Target Exon 1.67 1.76 

450954 A1904740 Hs.25691 receptor (caldtonin) activity modi^ 1.67 232 

410048 W76467 H8.343874 proline oxidase homolog 1.67 3.03 

407857 AI928445 Hs.92254 synaptolagmin.like 2 1.66 1.51 

40 447827 U73727 Hs.19718 protein tyrosine phosphatase, receptor t 1.66 201 

417193 AI922189 Hs.288390 hypothetical protein FU22795 1.66 205 

421237 U25029 Hs.102761 Human glucocorticoid receptor atpha mRNA 1.66 220 

433350 BES63152 Hs.103S2 Homo sapiens cONA:FU20944 lis, done A 1.66 211 

417451 AW007280 Hs.115537 putative dipepfidase 1.65 211 

45 443791 N64458 Hs.143345 ESTs 1.65 211 

440475 A1807671 Ks.24040 potassium channel, subfamily K. member 3 1.65 204 

431743 AW972642 Hs.293055 ESTs 1.64 264 

400328 X87344 transporter 2, ATP-bindtng cassette, sub 1.64 243 

451876 T63141 gb7b39a1 2s 1 Stratagene lung (937210) H 1.64 202 

50 417321 N68722 Hs.191368 ESTs 1.64 253 

439237 AW4081S8 Hs.318893 ESTs. Weakly similar to A47562&€eHgr 1.64 201 

418707 U97502 Hs.87497 bulyrophflin, subfamily 3, member A2 1.63 217 

432176 AW090386 Hs.112278 arrestin. beta 1 1.63 204 

450708 AA376654 eukaTotic faanslalton initiaUon fador 1.62 205 

55 429570 BE242256 Hs.2441 K]AA0022genepiodud 1.62 1.39 

448406 AW772298 Hs.21103 Homo sapiens mRNA; cONADKFZp564B076((r 1.62 1.57 

439971 W32474 lte.301746 RAP2A.nnember of RAS oncogene family 1.62 1.44 

452424 A1964028 Hs.48353 ESTs 1.52 253 

423161 AL049227 Hs.124776 downstream of cadherin 6 (by 3.3kb) 1.62 1.38 

60 416316 H58721 Hs.271628 ESTs 1.62 1.39 

431806 AF186114 Hs.270737 tumor necrosis factor (llgandlsuperfami 1.62 267 

452203 X57522 transporter 1, ATP-blnding cassette, sub 1.62 245 

427509 M62505 Hs.2161 complement component 5 receptor 1 (C5a I 1.62 1.51 

^- 438089 W05391 nuclear receptor subfamily 1. group I. m 1.61 1.45 

65 409038 T97490 Hs.60002 smdlindudble cytokine subfamily A (Qy 1.61 1.52 

433417 AA587773 Hs.88S9 Homo sapiens. Sbnilar to RIKENcDNA 5830 1.61 2.40 

444009 AI380792 H8.135104 ESTs 1.60 215 

436057 AJ004832 Ks.5038 neuropathy target esterase 1.60 260 

_^ 437352 AL353957 Hs.2841B1 hypothetical protein DKFZp434P0531 1.60 257 

70 433614 VW7475 Hs.277101 cytodiromec oxidase silbunlt IV isoform 1.60 3.30 

410494 M36564 Hs.64016 protein S (alpha) 1.59 1.42 

411125 AA151647 Hs.68877 cytodwome b.245. alpha pdypeptide 1.59 202 

446615 R65954 Hs.334873 ESTs, Weakly simflar to ALU8_HUMAN ALU S 1.59 252 

419918 X80700 Hs.93728 pre-6<eO leukemia Iranscilplion factor 1.59 204 

75 428141 050402 Hs.182611 sotute carrier family 11 (proton^pled 1.59 1.98 

434308 N51517 H3.47282 ESTs 1.5B 229 

447341 AF106941 Hs.16142 9TesOn.bela2 1.58 209 

454315 AW373564 Hs.2S1928 6ANP homdog. SMAR1 homotog 1.58 210 

423281 AJ271684 Hs.126355 (>type(caldum dependent carbohydrate- 1.57 1.75 

80 433671 AW138797 Ks.132906 19A24protdn 1.57 205 

412889 AA290712 H3.82407 CXCchemokIneligand16 1.57 271 

436906 H95990 Hs.181244 ma^orMstocompatibllily complex, dass 1.57 224 

417771 AA804698 Hs.62547 reOnob add receptor responder(lazaro 1.57 1.43 
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423329 

424909 

431921 

4374ra 

426274 

415078 

417929 

401854 

406850 

433615 

431130 

453870 

414763 

428281 

412870 

407601 

432894 

457941 

442743 

419542 

433124 

422487 

429610 

417433 

429109 

409361 

427872 

449853 

431369 

415149 

447217 

427565 

445672 

432210 

458208 

452518 

419577 

439620 

423804 

424658 

428494 

431573 

409524 

406787 

419452 

406422 

421341 

421195 

425998 

426125 

451220 

409238 

411880 

432133 

428833 

455797 

415765 

427732 

449746 

407568 

422573 

427138 

457918 

423696 

416700 

407244 

451109 

408654 

407603 

445417 



A1982529 

AAS05445 

AF054910 

S78187 

N46466 

AB011542 

D38122 

AA311223 

R27219 

AI624300 



HsM298 C074 oiSoen Cvwariant pdypepSde of m 

Hs.300697 immunostobuQn heavy coRslanl gamma 3 (G 

Hs.127111 tek!in2(lBSficu!a) 

Hs.153752 cen division cyde25B 

Hs.58879 ESTs 

Hs^99 EGIM!l(»<loniabi.mulSpieS 

Hs.2007 tumor neoosb taOtx (figand) supeifami 

Hs.283091 found tninflamm^ zone 3 

Hs.74647 Human T-ce!l receptor acUve aipha^afai 

Target Exon 

Hs.172928 oollagen,type(.dpha1 

Hs.112757 ESTs 



NM_006103 

AW385001 

U97276 

AA194554 

N22788 

AC002300 

AW167668 

AI004525 

AI801351 

AA366037 

U51712 

AJOIOgOl 

AB024937 

BE270266 

ALJ008637 

Nm.00S982 

AA835058 

AF006823 

BE184455 

X12451 

BE4657S4 

031152 

A1907438 

AI567421 

AI380016 

AA28Q722 

L36531 

AAB38727 

AW403448 

NM_002406 

AA233439 

AW971070 

AW402151 

AW090702 

U33635 

AJ243212 

BE464560 

AU076629 

XB7241 

AF124251 

A1049990 

AW8724n 

AB033088 

A1928355 

BE091833 

NIUt005424 

NM.002980 



427507 
446967 
436553 
456837 
422129 
417785 
414849 
438988 
410598 
424247 



AA740964 

AW297985 

N77624 

AL3S9590 

Z92546 

AW498958 

M10014 

F11876 

M90688 

AW955705 

AK001058 

AB016305 

AF240487 

AI699629 

AW407157 

AW161450 

AU075635 

X59812 

AW372721 

AA740983 

A1817130 

X14008 



Hs.2719 HE4;epldidymis-5peciQc,«rtiey-acidicpr 
Hs.8042 Homo8ap!enscDNA:RJ23173fis,doneL 

Hs.77266 qulasdnOe 

H3.1 83434 ATPase, H transporling, tysosomd (vacuo 

H5.82407 CXC chemoWne figand 16 

Hs^7129 sodhim channel, nonvoltage^ated 1 , t)eta 

Hs,279772 biain specific protein 

Hs.14587 ESTs, Weakly similar to AF151859 1 CGI-1 

Hs.3021 10 ESTs. Weakly simitar to MUC2_HUMAN MUQN 

Hs.9091 1 solute canier family 1 6 {monocariwxylic 

Hs.13775 hypothe&al protein SMAP31 

Hs.198267 mucin 4, tracheolmmchlal 

Hs.21 1092 LUNX protein; PLUNC (palate lung and nas 

Hs.621 28 5T4 oncofetal trophct)last glycoprotein 

Hs.196352 neutrophil cytosoTic factor 4 (40kO) 

H854416 Sine ocuQs homeotXM (Drosophila) homob 

Human DNA sequence from clone RP1-261G23 
Hs.24040 potassium channel, subfamily K, member 3 
Hs^1754 seoBtory leukocyte protease inhibitor ( 
Hs.78056 calbepsinL 
Hs.17778 neurojpilin 2 

H8.179729 collagen, type X, alpha 1(Schfnidmetaph 
Hs^2862 ESTs 

Hs.273330 Homosaplens, chme li^GE:3544662. mRNA, 
ESTs. Weakly simiiar to T4S4.HUMAN TRANS 

Hs;S4758 ESTs. Weakly similar Id 138022 hypotheli 

Hs.91296 bitegrin.alpha8 

Hs.124405 ESTs. Weakly similar to A46010X-Hnked 

Hs.1 6725 bterferon-siimulated transcription fact 

Hs.1 51513 mannosyl (aIpha-1 ,3-)-glycoprot^ beta- 

Hs.184634 hypotheScal protein FU20005 

Hs^l 160 ESTs. Weakly similar to ALUI^HUIiiAN ALU S 

Hs.54673 tumor necrosis factor (ligand]superfarnl 

Hs.240615 tubulin alpha 1 

Hs.90572 PTK7 protein tyrosine kinase 7 
Target Exon 

deleted in maTignant brain tumors 1 
Hs.133017 ESTs 

HS.1659S0 fibroblast growth factor receptor 4 
H3.166994 FAT tumor suppressor (OrosophSa) homok) 
H5.26054 novelSH2-contaInlng(Hotein3 
Hs^1515 Homo sapiens mRNA; cONA DKFZ|)564G1 12 (fr 

gb:hm30f03j(1 Na.CGAPJhy4 Homo sapiens 
Hs.272567 KIAA1262 protein 

ESTs 

Qb:IL2-BT0731.26040a07&f04BT0731 Homo 
HS.7&824 tyrosine kinase vvith Immunogtobulln and 
HSi21^ secreOn fBceptor 

Hs.176588 EST8.Weaklys!m!tartDCP4Y_HUMAN(rrTOC 
Hs.62699 ESTs 

Hs.2g5726 Integrin, alpha V(vitronectin receptor 
Hs.173717 phosphaOdic acid phosphatase tjrpe 2B 
Hs.162604 hypotheScal protein DKFZp762M188 
Hs.1 31819 Sushi domain (SCR repeat) oontaMng 
Hs.343475 cathepsin 0 (lysosomal aspartyl protease 

fibrinogen, gamma polypeptide 
HS.S534 Homo sapiens cDNAFU12981fis, dons NT 
Hs.73885 HLMa hhtocompafibOI^ antlgea dass 
Hs.62604 Homo sapiens, done IMA6E:4299322. mRNA, 
Hs.1 2680 a distntegrin-fike and metaIk)prDtease w 
Hs.5378 spondin 1 . (kpondin) extraceHutar mat 
Hs.179152 tdMike receptor 7 
Hs.156781 ESTs 

Hs.8997 knmunoglobulln lambda locus 
Hs.109201 CG)^ protein 
Ks.1478 serine (or cysteine) proteinase Inhibito 
Hs,82568 cytochrome P450, subfamily XXViiA (stero 
Hs^1623 ESTs. Weakly similar to unnamed piotdn 
H5^10792 ESTs, Weakly similar to ALU8_HUMAN ALU S 
Hs.9195 Homo sapiens cONA FU13898 Qs, done PL 
Ks.234734 fysoqrme (renal anqrtoldosls) 



1^ 

1.56 

1.56 

1.55 

1.54 

1.54 

1.54 

1.S3 

1.53 

153 

1.S2 

1.S2 

\JS2 

1.51 

1.50 

1.50 

1.50 

1.50 

1.50 

1,49 
. 1.49 

1.49 

1.49 

1.49 

1.48 

1.48 

1.48 

1.47 

1.47 

1.47 

1.47 

1.46 

1.46 

1.46 

1.46 

1.46 

1.46 

1.45 

1.45 

1.45 

1.45 

1.44 

1.44 

1.44 

1.43 

1.42 

1.42 

1.41 

1.41 

1.41 

1.41 

1.41 

1.40 

1.40 

1.40 

1.40 

1.40 

1.39 

1.39 

1.38 

1.38 

1.38 

1.38 

1.37 

1.36 

1.36 

1.36 

1.36 

1.35 

1.35 

1.34 

1.34 

1.34 

1.34 

1.34 

1.34 

1.33 

1.33 

1.32 

1.32 

1.32 

1.32 

1,31 



Z37 
1.61 
2.51 
ZOO 
3.04 

1.44 

3.04 

261 

218 

208 

1.52 

ZSI 

1.36. 

1.43 

207 

1.46 

ZB3 

204 

2.25 

222 

209 

240 

1.39 

1.39 

1.76 

1.41 

1.44 

1.31 

250 

2.21 

1.50 

1.84 

1.40 

216 

201 

210 

1.60 

1.40 

1.40 
1.57 

210 

200 

1.45 

1.40 

201 

1.86 

1.95 

202 

1.47 

242 

205 

1.34 

210 

3.64 

3.24 

278 

202 

1.55 

209 

244 

1.85 

3.13 

1,38 

1.12 

201 

254 

204 

1.29 

1.34 

247 

1.66 

1.92 

1.86 

211 

3.75 

218 

1.78 

1.95 

205 

208 

206 

208 

1.29 
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429S(N) 


X78565 


Hs.289114 


hexsbfochion (tsnascin Cs cytotectin) 


1^ 


413474 


T86312 


Hs.334485 


Homo salens cONA FLJ1443fi Gs. doi» HE 


1.30 


406659 


AA663965 


Hs.277477 




1.30 


451049 


AA013353 






l!30 


436494 


AA720997 


Hs. 128295 


ESTs 


1.29 


438374 


AA321886 


H3.6193 


hypoChelica} protein FU14590 


128 


417801 


AA417383 


Hs.82582 


int&Qrin, bdd-Cte 1 (with EGF-£k8 rep 


1.28 


425883 


AL1 37708 


Hs.1 61031 


Homo sapiens mRNA; cDNA OKFZp434K0322 (f 


l!28 


428458 


AA428820 


H3.251399 


n8un}9ranin (proMn Idnaso C substfSto, 


1.27 


443180 


R15875 


Hs.258576 


claudin 12 


1.28 


421764 


At581S35 


Hs.148135 


sednsAhreonbie Unass 33 


1.26 


414217 


AI309298 


Hsu{79898 


Homo safriens cONA: FIJ23165 fis, done L 


1.26 


433283 


BE041135 


Hs.1 75622 


ESTs 


124 


426759 


A]590401 


Hs,21213 


ESTs 


1.23 


436446 


AVV016809 


Hs.1 19021 


ESTs 


1.23 


421487 


AA291590 


Hs.97252 


ESTs 


1.22 


431353 


AA828032 




ESTs 


1.22 


427403 


AA402107 


Hs 257146 




1.22 


453037 


AA045175 


Hs.1 7914 


ESTs 


1.22 


437606 






F^Ts Wnaklv Kbrribr In Al 1 11 HI IMAN Al 1 > 


1 99 


439941 


AI392B40 


Hs.1 8272 


airtno add bansportsr systsm Al 


1 99 
1.44 


451385 


AA017656 




ah*2A39h01 r1 Saarais mltna M9h4MR Hnmn 
^u.MiM0tii/ I.I 1 oiMios t QUI la M£mr>i\ nuiiiu 


1.21 


400496 






ENSP00000224716**6TP*b{nd!na orotetn SAR 


1.20 


409432 


049372 


Hs.54460 


sm^l tndiicihlfl Rvtf^lnn RiiMnnrilv A f Hv 
oil (Oil muuwmw vjwivijiy ovvIcuihit n \wy 




407239 


AA076350 


Hs!67B46 


leukocytB intfnunogiobiAivTtkB recsptor, 


1.19 


426486 


BE17828S 


Hs.170056 


Homo sapjens mRNA; cONA DKFZp58560220 (f 


1.18 


445033 


AV6524Q2 


Hs.72901 


cvc&n-dfiOfindpnl ktiUKS inhthltnr 7R (n1 


1.17 


439866 


AA280717 


H3.6727 


Ras*GTPaso acSva^g protein SH3 domdn 


l!l4 


440555 


D31292 


Hs.6853 


hypothe&cai protein FU22167 


1.14 


446006 


NML004403 


Hs.13530 


daafness, autosomat dominant 5 


l!l3 


432203 


AA305746 


Hs.49 


maci^haQO scavanQsr fao^tor 1 


l!l2 


432798 


AA565309 


H$.194015 


ESTs 


1.10 


411274 


NM_002776 


Hs.69423 


kallikreln 10 


1.10 


438656 


N40027 


Hs.7473 


ESTs 


1.09 


421552 


AF026692 


Hs.105700 


secfeted frizzled-related protein 4 


1.09 


448253 


H25699 


Hs.201591 


ESTs 


106 


409718 


D86640 


H5.56045 


SfQ homotogy three (SH3) and ^teine i1 


1.08 


409798 


AA248587 


Hs.30237 


ESTs, Weaidy sinuiar to ALUB_HUMAN lUI 


106 


449321 


AA001150 


Hs.1 32937 


ESTs 


106 


418693 


Ar750878 


Hs.87409 


thnxnIxTspondIn 1 


106 


402333 






Ta^et Exon 


103 


421814 


LI 2350 


Hs.108623 


thrombospcndin 2 


102 


425664 


AJ006276 


Hs!l59003 


trandenl receptor potenfial diannsl 6 


lOO 


458158 


AW296778 


H5.144734 


Human DMA sfiflusncA frnn ekuM RP3.41fiF91 


100 


405517 






net fchk^kenUlkfi 9 


100 


442526 


AW277221 




ESTs 


100 


446164 


AW273539 




hvoQlhfiticai omlBin FU23577 




449122 


A1631310 


\i& 196955 


ESTs 


1 nn 


438038 


AI732629 




ESTs V\tea)dvsimDar Id TA2R HUMAN BETA 


100 


429420 


AK001679 


Hs.202289 


hypolheticd pn^ebi DKFZp434P1735 


100 


453672 


U73531 


Hs 34526 




1 nn 

I.UU 


436187 








1 rtn 


438909 


AF085639 




obtHofflO saabns fuD tenolh Insert cDNA 


1.00 


423609 


AA328348 


Hs.216289 


ESTs 


100 


419261 


X07876 




niiiyiC4a*ij|ff3 mmi w mwuiaWHi Bw Mini 


1(X) 


436284 


AA708016 


Hs.1 90389 


ESTs 


IX 


440932 


AI801509 


Hs.1 82080 


ESTs 


100 


403420 






Tamal Emn 


100 


431169 


AW971240 




gb;EST363329 MAGE resequences, MAGL Honw 


100 


425916 


NM_006786 


H3.1 62200 


urotensin 2 


1.00 


419721 


NVLOOt650 




aquapor1n4 


100 


421761 


A1120297 


Hs.108043 


Friend leukemia virus integration 1 


100 


42S7B1 


AF001622 


H&.1G9523 


dass-l MHOcBstrictedTcallassociatdd 


100 


415094 


D59513 


Hs.330778 


ESTs 


100 


434088 


AF116677 


Hs.249270 


tiypotlietica) protein PR01966 


100 


420727 


H75701 


H5.99886 


complement oomponent 4-faindbig protdn, 


100 


430049 


AW277085 


Hs.99619 


ESTs 


100 


446868 


AV660737 




ESTs 


1.00 


418788 


Ar796317 


Hs.203594 


Homo sapiens urtcharaderized gastric pro 


100 


436391 


AJ227892 


Hs.146274 


ESTs 


100 


41X59 


BE1S1498 




gb:RC0-HT0295-29119»)31-E11 HT0295 Homo 


100 


427739 


AW196755 


Hs^8105 


NY0^P14 protein 


1.00 


452768 


AW294571 


HS.136D40 


ESTs 


1.00 



125 
1.92 
Z22 
2.12 
2.30 
Z34 
Z39 
169 
ZOO 
125 
Z01 
1.21 

ao5 . 

120 
120 
154 
3.00 
1.91 
2.40 
2.26 
122 
149 
125 
144 
2.06 
2.02 
114 
116 
^19 
112 
Z43 
Z23 
109 
152 
107 
Z10 
Z08 
158 
ZOO 
102 
1.03 
102 
Z36 
Z73 
Z07 
Z21 
ZS2 
Z23 
Z04 
Z02 
Z57 
Z&4 
Z23 
Z19 
Z28 
Z22 
166 
166 
Z02 
Z11 
Z26 
186 
196 
Z32 
Z26 
184 
187 
179 
144 
• 130 
142 
Z41 
Z23 



TABLE 328: 

Pkey: Unique Eos probeset idenlifier numtier 
CAT number. Gene duster number 
Accession: Genbank accession ntinbeis 

Pkey CATNumber Accession 

431089 125941.2 BG940189AW063489AA71S980BFQ01091 BF880066AA6G6102AA621946AA491B26 
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421798 3042.4 
2726.1 



539529.1 
101919J 



432222 
432810 



444314 
443951 

432639 
438698 
413638 
436772 
448782 



415688 
431300 
418262 



446354 
406687 
411605 



428713 
414221 
414991 
447197 
430915 
424450 
409031 



418259 1338S3J 



429228 21S430J 



459702 
422667 

456034 
430709 
450726 
442048 
406665 
440028 
437866 



456332 
407192 
449328 
432340 
434194 
436198 
440248 
442006 
438177 
435154 
423387 



.1 

224778J 

685588.1 
1234627.1 
666520_1 
750422.1 
0.0 

598730.1 
34267J 



1139B85J 

2200202_1 

3030726.1 

1619980J 

62680 1 

28727.1 

261690B„1 

1239046.1 

9337.12 

126605.1 

2612.2 



443547 137089.2 



1027984.1 
MH1768.18 

1237887.1 

598828.1 

1525406.1 

1239464.1 

34980J 



43BS07 2729.4 



1235745.1 
1529181.1 
5824.2 



452194 90339.1 



831448 1 
00 

10026J 



2132^7 

685586.1 

1785136J 

2176805 1 

197844J 

1674.1 

9531.1 



BC017829 AW276646 AI984209 AA663933 AA634104 AA551528 AA634041 AA298036 BG483990 T89297 BF853958 H64685 T90329 T60644 
T57747 BF852694 T92529 BG482852 BF883054 BF8830S6 N74880 AA829796 N90716 

X65018 BC022318 NM.003019 6E465060 AI732255 6F446634 AiB20677 AI002217 A1924488 81821373 6)770406 B1823937 61820265 BI489632 
BG482911 AA617783 AI807Gg7 AW205576T94427 AA487101 T94513 BI819407 BI822450 61820618 61824619 BG542824 66537862 

BG207209 BEl 66299 AI204995 BG199355 AW969908 AA528756 AW440776 BI044354 

BG292389 C06094 A168e930 AW104534 AA310513 AA630127 AW1 34897 AA046953 AW965490 A1810530 BF092924 AA3341S1 AA334725 
D31302 R^23 AA263003 61824635 AI276287 AI684428 AI524234 A1335035 AW014704 A!911443 AA972102 AI367512 AI126670 AW016017 
AI286003 Al 147163 AA626033 A1539156 AAS65542 AI094253 AW512612 6EBa9628 AA7447S2 6E646306 AW471324 AA999975 AA863400 
H17550 Ai991439 R46187 6E929954 AA333976 D63102 BF744491 

6M310925 AA4261 10 BM310S29 Bf 434286 AW015091 BF475996AW1 18867 BE675186 AI688568 A1453594 AW5905B9 Ai652425 AI827969 
BF056946 A)802886 A]393380 AI476224 AW590639 AW136271 A)4582S2 AI524726 AA843768 AA782158 AI336058 Ai0g7532 AW451S63 
AA459408 AA459633 AA418444 W23807 BG940150 AI493445 AW054729 AI221929 AI868744 AA21540S AA76671 3 AA621546 eF92B317 
BE464132 At990909 AW271459 AI262061 AA215404 N74332 BG940151 6G952261 AA9721 15 W96315 AA689566 R690S7 eF766886 BE769254 
W05240 

BG6761S5BM009591 A1479075 AI025794 AI017967 AM482706E466812 AAB53422AI3926498G952034 AA513384 BF840124 BE714620 
AW969805AI553633 

86207209 8E166299 AI204995 86199355 AW969908 AAS28756 AW440776 6K)44354 

AI756223 AW469334 eF940841 AW080346 AI270363 AI055892 BE464168 BF431797 8E350144 8F44S739 At693409 BF432999 062848 AA398070 
A1383375AW611490 

AA136653 AA136656 AW450979 AA984358 AA80gO54 AW238038 AA492073 BE168945 

AW969880 AA484613 AA501874 R34356 

Ar732297AW204600 T95017 

AW340495 AI964319 AA974603 

M18728 

AW473875 AI190744 W69997 AW104913 AI221098 W69996 AA8B5487 AA861491 

U52054 AL581000 AA1568S0 AW293839 61335865 AA024963 BF149420 BE073977 AW602574 BE164012 BE163992 BEl 63974 AW402161 
BM194134 AW966609 W84374 BF916380 AA385173 WB4366 AA383743 BF903598 AA043776 W84421 AA778446 AW444904 BF446960 
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446868 15525.1 AK074473 BC017997 BI631060 BF971 101 AI886394 A1082824 AV708785 W86073 WD7772 AV660737 AI816793 R52250 BG163529 AA633473 
AI191256 R44763 R19947 BF571346 W862S7 

413059 1488711.1 BE063078BE151503BE151498 



438089 22448.4 



427672 



TABLE 32C: 



Pkey: Unique number conesponcSng to an Eos probeset 

Ref: Seqinnoe source. The 7 (figitnumberB In this column ere Genbank Identifier (GO num^ 'Dunham. etaL'refarB to the pUbOcdlhm entitled Hl^ 

human chremosome 22* Ounham, el si (1999) Nature 402:489495. 
Stiaiul: Indicates DMA strand from whtehexons were precfided. 
NLpositiofK Indicates nudeolide posiSons of predided exons. 



Pkey 


Ref 


Strand 


Ntjxnitkm 


400880 


9931121 


Plus 


29235-29336^36363-36580 


402474 


7547175 


Minus 


53526-53628.55755-55920.57530^57 


406387 


9256180 


Rus 


116229-116371.117512-117651 


404277 


1834458 


Minus 


91665-91946 


402674 


8077108 


hfinus 


39290-39502 


404240 


5002624 


Minus 


116132-116407,116653-116922 


405102 


8076681 


Minus 


• 120922-121296 


406122 


9144087 


Minus 


30940-31386 


400750 


8119067 


Phis 


198991-199168.199316-199548 


404394 


3135305 


^«nus 


37121-37205.37491-37762,410534114a4132 


403421 


9665041 


Minus 


126609-126773,139986-140205 


403903 


7710671 


Minus 


101165-102597 


404854 


7143420 


Rus 


14260-14537 
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5 



401854 


7770S38 


Rus 


151483.151637,151902-152008.152146-15231 


406422 


9256411 


Plus 


163003-163311 


400498 


9743584 


Phis 


4151541695 


402333 


8844110 


Minus 


165693-165856 


406517 


7711431 


Plus 


715V7402 


403420 


9864989 


Rus 


159835.159938 



10 TABLE 33A: About 600 genes upregulated In lung fibrosis relaihra to nonra) lung 

Ptcey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbanicaoces^ number 
UnigenelD: Untgene number 
1 5 Unlgena TUla' Unlgsne gene tHIe 

R1: 90th percentSe of lung fibrosis Alsdhnded by goih percentile d normal lung Als^ 



PIcey ExAccn UnigenelD UnigeneTlUe R1 

20 406964 M21305 FGENES predictad novel secreted protein 16.10 

431089 BE041395 ESTs^WtektysImHar to unknown protein 12^8 

421110 AJ250717 Hs.1355 cathepsinE 11.86 

428330 L22524 Hs.2256 matrbc metalioprotelnase 7 (matrilysin. 11.62 

431958 X63629 H3.2877 cadheiin 3. type 1, P-cadtierin (placenta 9.90 

25 444381 BE387335 Hs.2837l3 hypothetical protein BC014245 6.58 

406850 AI624300 Hs.172928 collagen, type t. dpha 1 8.26 

429500 X78565 Hs.289114 hexabracMonftenasdnCcytotaciin} 8.24 

422487 AJ010901 Hs.198267 mucin 4. tracheobroncMa) 7.72 

408380 AF123050 Hs.44532 diuKqulfin 7.24 

30 432306 Y18207 Hs.3O3090 protein phosphatase 1. regulatory (inhib 7.15 

456034 AW450979 gb:UI-H.BI3.ala«12^).Ui.s1 NCLCGAP.Su 7.12 

453355 AW295374 Hs.31412 myopodin 6.96 

408562 AI436323 Hs.31141 roundabout (axon guidance receptor, Dros 6.88 

421552 AF026692 Hs.105700 secreted fHzzled^elaled protein 4 6.83 

35 426125 X87241 Hs.166994 FAT tumor suppressor (Drosophila) homoto 6.72 

407192 AA609200 gb:af12e02.s1 Soares.lestisji^ Homo sap 6.72 

438089 W05391 nudear receptor subfamily 1, group I, m 6.62 

449523 NM.000579 Ks.54443 chemoldne (C-C motiO receptor 5 6.56 

421952 AA3OO90O Hs.98B49 dynein light chain 28 (DNLC28) 6.46 

40 417433 BE270266 H5.82128 5T4 oncofetaltrophoblast giyooprotein 6.32 

439195 H89360 gb:yw28d08^1 Morton Fetal Cochlea Homo &29 

444301 AK000136 H5.10760 asportn (LRR class 1} &28 

414061 NM_000699 Hs.335493 amylase, alpha 2A: pancieaSc ai3 

423057 AWS61597 Hs.130816 ESTs, Moderately similar to 138022 hypot 6.11 

45 430702 U56979 Hs.278558 H factor 1 (compiBmenq aiO 

424878 H57111 Hs.221132 ESTs 6.00 

417678 Ug0916 Hs.82845 Homo sapiens cDNA: FU 21930 Us, done N 6.00 

414217 AI309298 Hs.279898 Homo sapiens cONA:FU231 65 Us, done L 5.94 

406491 A1086063 Hs.7882 ESTs 5.94 

50 419407 AW410377 H$.41502 hypothetical protein FU21276 5.92 

432222 At204995 gb:an03c03j(1 SIratagene sdiizo brain 81 ' 5.92 

407857 AI928445 Hs.92254 synaptotagmln^ikB 2 5.90 

433230 AW136134 HsJZ20277 ESTs 5.86 

412719 AW016610 Hs.816 ESTs 5.86 

55 407788 8E514962 Hs.38991 S100 catdum^ding protein A2 5.82 

426759 A!590401 Hs.21213 ESTs 5.72 

416007 M13509 Hs.63169 malrb metalloprdBlnase 1 (inteistiUal 5.72 

421814 L12350 Hs.108623 thiombo8pondln2 5.71 

430887 N66801 Hs.260287 KIAA1 841 protein 5.70 

60 453870 AW385001 Ks.8042 Homo sapiens d)NA: FLJ23173 lis. dOTOl 5.62 

436954 AA740151 Hs.130425 ESTs 6.58 

411573 Afi02g000 Hs.70823 K1AA1077 protein 5.55 

432441 AW292425 Hs.163484 Mrontifhepatot^ nudear eactor-3 at 5.38 

410806 AW418779 Hs.114689 ESTs 5.38 

65 410800 BE280421 Hs.94499 ESTs 5.32 

413195 AA127382 Hs.22404 protease, serine, 1 2 (neun}trypstn,moto 5l28 

406687 M31126 matrix melafloproteinase 1 1 (stromelysin 5.26 

417733 AL048678 Hs.82503 H.saplensmRNAfar3ramorunI(nownprol &22 

412622 AW664708 Hs.171959 ESTs 5.22 

70 439941 AI392640 Hs.18272 amino add transporter system A 1 5.18 

440675 AW005054 Hs.279788 ESTs, Weakly similar to KCCI.HUMANCAia 5.15 

430299 W28673 Hs.106747 serine carfaoxypepSdase 1 precursor prot 5^13 

425177 AF127577 Hs.155017 nudear receptor InteracSng protein 1 5.12 

444314 Ai140497 gbxw76b09.8l Scares Jetal_iver_spteen_ 5.11 

75 444527 NM.005408 Hs.11383 smaQ indudbia cytokine subfamily A (Cy 5.04 

452239 AW379378 protdn tyrosine phosphatase, receptor t 4.97 

453874 AW591763 Hs.36131 collagen, type XIV. dpha 1 (unduRn) 4.96 

443884 N20617 Hs.194397 leptin receptor 4.94 

444040 AF204231 H5.182982 golgln-67 4.94 

oO 428281 AA194554 Hs.183434 ATPase, H transporting, lysosomal (vacuo 4.93 

440687 AL080222 Hs.7358 hypolheUca)prdebFU13110 4.92 

420000 AB036063 Hs.94262 p534nduca)lefibonudeoOde reductases 4.92 

432435 BE218886 Hs.282070 ESTs 4.92 
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422573 AW297985 Ks35726 inlsgrin. alpha V(vflronec(in receptor 4.90 

430eS5 BE350122 H5.157367 EST5.W8aHyslmflar to 178885 serineflh 4.90 

446006 NM.004403 Hs.13530 deafness, autosomal dominanl 5 4.90 

415992 C05837 Hs.t45607 hypaOnfical protein FU13593 4.82 

5 430027 AB023197 H$.227743 K1AAO980 protein 4.78 

408393 AW01S318 Hs.23165 ESTs 4.76 

449509 AA001615 Hs.84S61 ESTs 4.72 

416206 AW206248 Hs.1 11092 hypothelicai protein FU22332 4.72 

412828 AL133396 Hs.74621 piton protein (p27-30) (CreutzfiBhklakob 4.72 

10 433226 AWS03733 Hs.94t4 K1AA1488 protein 4.68 

444745 AF1 17754 Ms.11861 thyroid hormone receptor-associated prol 4.68 

442994 AI02671 8 Hs.1 6954 ESTs 4.66 

430580 AA806105 Hs.300697 immuno^bulin heavy constant Qamma 3 (6 4.66 

409361 NM-005982 Hs.54416 sine ocufis homeobox (DrosophUa) homoio 4.66 

IS 41B005 A]166220 Hs.83164 ooHagen, type XV. alpha 1 4.65 

433586 TB5301 Qb:yd7ed06.s1 Soares fetal ihrer spleen 4.64 

424917 A1636208 Hs.96901 hypothetical protein FU23049 4.64 

424408 AI754813 Ks.146428 collagen, type V. alpha 1 4.64 

418113 A1272141 , H3.e3484 SRY (sex determining region YHwx 4 4.62 

20 451109 F11675 Hs.5534 Homo sapiens cDNAFUl 296 1 fis.ctor^e NT 4.62 

450086 AW016343 Hs.233301 ESTs 4.61 

422163 AF027208 Hs.112360 prominin (mouseHilie 1 4.60 

442652 A1005163 Hs.20137B ESTs. Vfeaklysinnlar to T1 2545 hypotheti 4.^ 

410268 AA316161 Hs.61635 sbclransmembrane epItheN anOgen of 4.58 

25 418259 AA215404 ESTs 4.54 

426716 NiyL006379 Hs.171921 sema domain, immunoglobiilin domdn (Ig), 4.54 

432810 AAB63400 ESTs 4.54 

407112 AA070801 Hs.61615 ESTs, Weakly simOar to ALU7.HUMAN ALUS 4.53 

436100 AA704806 Hs.143842 ESTs.Wealdyslmiterto2004399Achromo8 4.52 

30 412652 Ai801777 ESTs 4.52 

436899 AR»5833 Hs.135624 ESTs 4.52 

416179 R19015 Hs.79067 MAD (mothers against decapentaptogicDr 4.52 

436252 AI539519 Hs.120969 Homo sapiens cDNAFU 11 562 fls. done HE 4.52 

443324 R44013 Hs.164225 ESTs 4.51 

35 407690 R47799 Hs.266957 hypothetical protein FU14281 4.51 

431393 AW971493 Hs.134269 ESTs, Highly simitar to cytoldne recepto 4.51 

452518 AA280722 Hs.24758 ESTs. Weakly simnar to 138022 hypoOieii 4.50 

431843 AA516420 ESTs. Weakly similar to P8022hypotheti 4.60 

436865 AW680358 Hs.33g808 hypothetical protein FU101 20 4.46 

40 452561 AI692181 Hs.49169 KIAA1634 protein 4.46 

440273 AI805392 Hs.325335 Homo sapiens cDNA'Fy23523 lis, ctone L 4.45 

442048 AA974603 gb»p34fOS.8l SoaresJ^FlJT.GBC.SI Homo 8 4.44 

436120 AI248193 Hs.1 19860 ESTs 4.44 

423575 C18883 Hs.163443 tnlronofperiostln(OSF-2Ds) 4.44 

45 429697 AW296451 Hs.24605 ESTs 4.44 

439866 AA260717 Hs.6727 Ras-GTPase acthfating protein SH3 domain 4.43 

429688 BE245169 Hs.211610 CU6 triplet repeal RNA-binding protein 4.43 

414462 BE622743 Hs.301064 arfapttol 4.42 

428698 AA852773 Hs.334838 KIAA1866 protein 4.42 

50 420838 AW118210 H8.42321 ESTs 4.41 

458584 AF217518 Hs.8360 PTO012 protein 4.40 

434340 AI193043 Hs.128685 ESTs. Weakly similar to T17226hypothell 4.40 

400076 Eos Control 4.38 

431049 AA846576 Hs.103267 hypothetical prot^ FU22548 sinto to 4.38 

55 445773 H73456 Hs.13299 Homo sapiens mRI^cDNAOKF2p751M01 11 (f 4.36 

420298 AI199510 Hs.267912 ESTs. Weakly similar to AtiJ7_HUMAN ALU 8 4.36 

433339 AF019226 Hs.8038 gliotJtastomaoverBxpressed 4.36 

412490 AW803564 Hs^BSSO HmsaptenscDNA: FU22S2Bfi5,doneH 4.34 

^_ 416391 AI87B927 H8.79284 mesod^spsciOclranscr^t (mouse) horn 4.34 

60 421221 AW276914 KsJ26714 Homo sapiens dona IMAG&713177,nnRNA6e 4.33 

409342 AU077058 Hs.54089 BRCA1 associated RING domain 1 4.33 

429228 A1553633 ESTs 4.32 

426456 D03032 Hs.169984 nudear protein 4.30 

408389 R38438 Hs.lB2575 SLC1SA2 Solute canter laniBy 15 (H«^p 4.30 

65 432476 T94344 Hs.326263 ESTs 4.29 

434963 AW974g57 Hs.288719 Homo sapiens cDNAFU12142fls. dona MA 4.28 

436446 AW016809 Hs.119021 ESTs 4.27 

439556 AI6237S2 Hs.163603 ESTs 4.26 

„^ 428179 AI127772 Hs.279696 senimi^lucocoitlcdd regulated kbiase-B 4.26 

70 428411 AW291464 Hs.10338 ESTs 4.26 

434936 A1285970 Hs.1B3617 ESTs 4.23 

413048 M93221 Hs.75182 mannose receptor. C type 1 4.23 

432606 NH.0021O4 Hs.3068 granzyme K (serine protease, granzymo 3; 4.22 

„^ 413859 AW992356 H5,8364 Homo sapiens pyruvate dehylfrogenaseMna 4.22 

75 409977 AW805510 Hs.97056 hypothetical protein FU21 634 4.22 

441297 AW403084 Hs.7766 uhiquitirvcoT^ugafing enzyme E2E 1 (homo 4.21 

421229 AI056590 Hs.7086 hypothetkai protein MGC1 2435 4.20 

456844 AI2641S5 Ks.152981 COP-diacytgtyoerol synthase (phosphaHda 4.20 

^- 423578 AW980454 Hs.222830 ESTs 4.20 

80 446608 N75217 Hs.257846 ESTs 4.20 

424238 AA337401 Ks.137635 ESTs 4.19 

450747 AI064e21 Hs.318535 ESTs. Highly slmHar to 181 8357AEWS gen 4.18 

420674 NM.000055 Hs.1327 tRilyr^dtolinesteraso 4.18 
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42B227 AA32t649 H&224a 8inaainducibbcytoldn8SuWaniayB(Cy 4.18 

439593 68073597 HS.124B63 ESTs 4.17 

442369 AI565071 ESTs 4.16 

- 445885 AI734009 H$.127699 K1AA1 803 protein 4.16 

5 459702 AI204995 9b:an03c03j(1 Stralagene schizo brain SI 4.16 

452960 AK001335 Hs.31137 protein tyrosine phosphatase, receptor t 4.16 

440703 AL137663 Hs.7378 Homo sapcsns mRNA; cONA 0KFZp434G227 (fr 4.15 

407347 AA829847 gb:od40d07.s1 Na^CGAP.GCBl Homo sapiens 4.14 

409153 W03754 Hs.50813 hypothetical protein FU20022 4.13 

10 430168 AW968343 Hs.145582 OKFZP434I1735 protein 4.12 

451164 T87943 banscripfion factor 7-like 2 (T-cellsp 4.12 

426174 AA547959 Hs.115838 ESTs 4.12 

431562 AI884334 Hs.11637 ESTs 4.12 

^. 417094 NhA-006895 Hs.61182 histamine fi^ytbansferase 4.12 

IS 4252S9 AL049280 H5.155397 Homo sap^ mRNA; cONA DKFZp564K143 (fr 4.12 

449437 AI702038 Hs.100057 Homo sapiens cDNA:FU22902ris, done K 4.12 

425053 AF046Q24 H5.154320 ubiquif!n-acihralingenzynnE1C{hOfndog 4.10 

444020 R92962 Hs.3S052 ESTs 4.10 

439424 AI478667 Hs.1161B3 hypoUieficai protein FU22833 4.10 

20 416987 086957 Hs.e0712 K1AA0202 protein 4.10 

457121 AI743770 H8.180513 ESTs. Wealdy similar to KIAA0822pret8in 4.09 

422737 M26939 H3.1 19571 coHagen, type HI. alpha 1 pieis-Oanl 4.09 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 4.08 

431193 AW749505 Hs.296770 KIAA1719 protein 4.08 

25 452144 AA032197 Hs.102558 Homo sapiens, done MGC:5352.mRNA,comp 4.08 

433308 AA582718 Hs.291650 ESTs 408 

445756 AA290690 Hs.300776 ESTs 4.08 

431745 AW972448 Hs.l63425 ESTs 4.08 

444610 A1174783 9b:HA2501 Human fetal Over cONAUbfary 4.07 

30 440099 A1080058 Hs.6909 DKFZP564G202 protein 4.06 

439398 AA2B4267 Hs.221504 ESTs 4.06 

432731 R31178 Ks.287820 fibronedini . 4.06 

415075 . L27479 Hs.7788g FHedreld) ataxia retfon gene X123 4.05 

433826 AF076859 Ks.B6347 hypothetical protein 4.05 

35 428055 AA420564 Hs.101760 ESTs 4.04 

412584 X54870 Ks.74085 DNA segment on Chromosome 12 (unique) 24 4.04 

413243 AA769266 Hs.193657 ESTs 4.02 

431214 AA294921 Hs.348024 v-ral simian teukemia viral oncogene horn 4.02 

453753 BE252983 Ks.35086 ubiquitin spediic protease 1 4.02 

40 414504 AW089181 Hs.115175 sterile-aiphamotifand leudne zipper c 4.02 

434404 AW445034 HsJiS657B ESTs 4.02 

407604 AW191962 cdiagen, type VIII. alpha 2 4.02 

429412 NIA.0a6235 Hs.2407 POU domain. dassZ associating factor 4.02 

436772 AW975688 metaitothiondn 1E{fundionai) 4.00 

45 443257 AI334040 Hs.11614 HSPC065 protein 4.00 

450187 AA7367B8 Hs.78521 K1AA1 71 7 protein 3.98 

433913 At694106 H5.72325 ESTs, Weakly similar to 138022 hypotheti 198 

415060 AJ223810 H8.43213 ESTs, WeaMy similar to lEFSJtUMAN TRANS 3.98 

434098 AW6629S8 Hs.75825 ptetomorphic adenoma gene4Bce1 3.98 

50 432374 W68815 Hs.301885 Homo sapiens cONAFU11346fis. done PL a98 

426816 AA554627 83.292996 DKFZp434A0131 protdn 3.98 

440118 AB040693 Ks.6968 K1AA1460 protdn 198 

413836 W920O3 Hs.70614 ESTs 197 

_ _ 442647 AL038436 Hs.31388 ESTs 3.96 

55 449188 AW072939 Hs.347187 myotubutarin related protein 1 186 

4506S6 AA010539 Hs.l6912 ESTs 196 

410817 AI262789 Hs.93659 protein disulfide isomerase related prot 194 

429784 M89796 Hs.30 memfarane-spanfling44omdn8.subramlyA 194 

408483 AA464836 H3.291079 EST8.Weddystan{lartoT27173hypotheQ 194 

60 407879 AA045464 Hs.6557 zinc finger protdn 161 193 

438146 Z36842 Hs.57548 ESTs 193 

433658 1X3678 Hs.156110 immunoglobulin kappa constant 193 

429355 AW973253 H&292689 ESTs 192 

^_ 437210 AA311443 Hs.293S83 Homo8apiensinRKA:cDNADKFZp586E2317 (f 192 

65 432467 T03667 Hs.239388 Human DMA sequence from done RP1-304ei 4 192 

452416 AA026115 Hs.114777 ESTs 192 

413873 AI310151 Hs.173524 ESTs 191 
400196 EosConbd 191 

„^ 437175 AW968078 Hs.87773 protein kinase. cAMP-dependent. catalyfi 3.90 

70 453204 R1079g 85.191990 ESTs 190 

454076 AW204712 Hs,61957 ESTs 190 

431183 NM.006855 Hs.250696 KOEL (Lys-Asp-GhM^) endopiasmto retk; 190 

4371 58 AWOgOl 98 K1AA1 1 50 protdn 190 

443970 AI280341 Hs.166571 ESTs 190 

75 441633 AW958544 H3.112242 normal mucosa of esophagus spedfici 190 

452281 T93500 8s.28792 Homo sapiens cDigAFUl 1041 (is. ctone PL 190 

414057 AA316896 Hs.257267 FYVE and cdled^Q domain contaHng 1 189 

411495 AP0006g3 Hs.703S9 KIAA01 36 protein 188 

438452 Ai220911 Hs.288959 hypothetical protdn FU20920 188 

80 410297 AA148710 lumkan 188 

427698 AW972594 Hs335499 ESTs 188 

436769 AA748675 ESTs 186 

417819 AI253112 Hs.133540 ESTs 186 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



445800 


AA126419 


HSJ2944 


inositol po)yphosphate-4-phosphalase, ty 


3.86 


425838 


NM_014071 


Hs.159813 


nudear receptor ooacUvator RAP250; per 


3.86 


422173 


BE385828 


HS^619 


phoftionivfike pfoldn MOS019 {CEM15) 


3.88 


42B147 


AW629965 


H&234983 


EST8. Weahly similar Id 210926QA B cd 


185 


445693 


AW800444 


Ks.76507 


LPS4nduoed TNF-aTplta fxtoi 


aes 


432706 


NM.013230 


H5J86124 


C024 antigen (small cell iung carcinoma 


185 


412636 


NM.004415 




desmopla)(in(DPi.DPli) 


184 


436169 


AA888311 


Hs.17602 


Homo sapiens cOI^ nJ12381 lis. done MA 


164 


418876 


AA740616 




6b:ny97f1U1 Na^CGAP.GCBI Homosapiens 


184 


436110 


AA704899 


Hs^iesi 


ESTs. Weakly simDar to 138022 hypoM 


184 


430317 


A6020645 


H3.239189 


glutamlnase 


184 


442806 


AW294522 


Ks.149991 


ESTs 


184 


414320 


U13616 


Hs.75893 


ankyrtn 3, node of Ranvier (ankyrin G) 


182 


430612 


AF182294 


Hs^41578 


U6 snFiNA-Bssodated Sm^ proton LSni8 


182 


427051 


BE178110 


Hs.173374 


Homo sapiens cONA FU 10500 lis. done 1^ 


182 


430573 


AA744550 


Hs.136345 


ESTs 


182 


453394 


AW950474 


Hs.402e9 


ESTs 


181 


431266 


AW149321 


Hs.105411 


ESTs 


180 


434987 


AW975114 




ESTs 


180 


452685 


AI634651 


Hs^250 


v-maf nmisculoaponeurotic fibrosarcoma (a 


179 


435175 


AA744875 


HS.1B9413 


ESTs 


176 


437134 


AA349944 


H5.42915 


ARP2 (adln-related protein 2. yeasQ ho 


177 


430709 


R34356 




0b:yh85d0U1 Soares placenta Nb2HP Homo 


177 


427157 


U51166 


Hs.173824 


thymtne-ONA glycosylase 


176 


441989 


AA306207 


Hs,286241 


pfoiein kinase, cAMP-dependent, legulato 


176 


417228 


AL134324 


HSJ312 


ESTs 


176 


418546 


AA224827 




gb:nc32904^1 Na.CGAPJ>r2 Homo sapiens 


176 


450779 


AW204145 


Hs.15e044 


ESTs 


3.75 


412408 


D51103 


Hs.73851 


ATP synthase, H transporting, mitochondr 


175 


443879 


Z28462 


Hs.9927 


Homo sapiens mRNA; cONA DKFZp5640156 (fr 


175 


414812 


X72755 


Hs.n387 


monokine induced by gamma interferon 


175 


429494 


AA7e9365 


H8.12G058 


ESTs 


175 


447118 


AB014599 


Hs.330988 


Homo sapiens, Similar to Bfeaudal D (Dro 


3.75 


408822 


AW500715 


Hs.57079 


Homo sapiens cDNA FU13^7 fis, dona OV 


174 


419591 


AF090900 


Hs.91393 


Homo sapiens cONA: FU21887 lis. done H 


174 


448121 


AL045714 


Hs.128653 


hypothetkial protein DKFZpS64F013 


174 


436260 


BE172762 


Hs^2710 


ESTs, VVeakiy6imilartoALU5_HUMAN ALU S 


174 


421485 


AA243499 


Hs.104800 


hypothetk:a) protein FU10134 


173 


414883 


AA926960 




COC28 protein kinase 1 


172 


416178 


A]806527 


Hs.192822 


serdogicaily defined breast cancer antt 


172 


452250 


BE618654 


Hsi»607 


hypothetical protein A-21 1C6.1 


172 


444099 


D87432 


Hs.10315 


solute canier femily 7 (caGonk; amino 


172 


438607 


AW080237 


Hs.252e84 


ESTs 


3.72 


408221 


AA912183 


Hs.47447 


ESTs 


3.72 


416699 


6E539639 


Hs.173030 


ESTs. Vfeakly similar to ALUS HUMAN ALUS 


3J0 


419900 


AI469960 


Hs.170698 


ESTs 


170 


446342 


BE298665 
AW967109 


H$.14846 


Homo sapiens mRNA; cDNA DKFZp5640016 (fr 


170 


446100 


H3.13804 


hypothetbal protein dJ462023.2 


170 


448019 


AW947164 


Hs.195641 


ESTs. Moderately similar to 136022 hypd 


170 


413283 


R78669 


Hs^756 


hypothetical protein similar to swine ac 


3.70 


447846 


AA324057 


Hs.77955 


Homo saptens cONA: FLJ23527 fis, clone L 


170 


436198 


AK001125 




Homo sapiens cONA FLJ10263 fis, clone HE 


170 


418300 


A1433074 


Hs^682 


Homo sapiens cDNA: FU21578 Us, done C 


169 


408495 


W68796 


Hs^37731 


ESTs 


169 


424452 


N41367 


Hs.173002 


ESTs, Weakly similar to 138022 hypolheQ 


168 


448479 


K96115 


Hs^1293 


UDP-N-adeyigjhicosamine pyrapliospho^ 


168 


431974 


AW972689 


Hs.200934 


ESTs 


168 


416354 


NM_000633 


Hs.79241 


B^»t]CLUymphoma2(BCL2) 


168 


417412 


X16896 


Hs^2112 


inlerieukin 1 receptor, type 1 


168 


413645 


AA130992 




gb:zo15e02^1 Stralagene cdon (937204) 


167 


416221 


BE513171 


H3.79086 


mitochondrial ribosoma! protein L3 


167 


419111 


AA234172 


H&137418 


ESTs 


167 


423979 


AF229181 


Hs.136644 


C8 box-oontainiftg WD protein 


166 


418875 


W19971 


Hs.233459 


ESTs 


166 


451690 


AW451469 


H$.20999O 


ESTs 


166 


423032 


AI684746 


Hs.119274 


RAS p21 protein activator (GTPase adiva 


165 


414888 
428347 


AL039185 


Hs.77558 


thyroid hormone receptor interactor 7 


166 


At264161 


Hs.183773 


gdgl autoantigen. golgbi subtemily a, 4 


166 


426779 


AA384577 


Hs.93714 


ESTs. Weakly sin^ to T00385 hypotheS 


166 


435335 


A)693150 


Hs.137928 


ESTs 


3.66 


410577 


X91911 


Hs.64639 


gttoma pathogenesis-relsted protein 


166 


452933 


AW391423 


H3.268555 


Homo saptens cONA: FU22425 fis, done H 


165 


429105 


087077 


Hs.196275 


K1AA0240 protein 


164 


407813 


AL120247 


Hs.40109 


K1AA0872 protein 


164 


425863 


U43604 


Hs.159901 


Human unMenCfied mRNA, p^^ sequenc 


164 


451678 


AA374181 


HSJ6799 


DKFZP564O0764 protein 


164 


452420 


8ES64871 


Hs.29463 


centiin. EF-hand protein, 3 (CDC31 yeast 


164 


452408 


AA308477 


H8.29379 


hypQlhe6c8lprot^FU10867 


3.64 


441466 


AW673081 


H$54828 


ESTs 


163 


414013 


AA766605 


Hs.47099 


hypothetical protdn FU21212 


162 


420056 


AW043684 


Hs.99804 


ESTs 


162 


424886 


H88584 
BE348813 


Hs.96900 


hypothetical protein; KIAA1830 protein 


162 


431774 


H8.268561 


hypothetical protein FU10726 


162 
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435990 


AI015862 


Hs.131793 


417821 


BE24S149 


Hs.82643 


414715 


AA587891 


Hs.904 


444484 


AK002126 


Hs.11260 


417008 


AA191708 


HSJ25825 


413823 


AI341417 


Hs^06 


435354 


AA678267 


Hs.117115 


427832 


AF038382 


Ks.180930 


427846 


AW499770 


Hs.180946 


426116 


AA868729 


Hs.144694 


457635 


AV660976 


Hs.3569 


443998 


AI620661 


Hs.295276 


417867 


AW952547 


Hs.194603 


418182 


AW016405 


Hs.16648 


434941 


AW073202 


Hs.334825 


424831 


HS1453 




448410 


AK000227 


Hs.21126 


421823 


N40850 


Hs.28625 


414781 


D50917 


Hs.77293 


427393 


AB029018 


Hs.177635 


415664 


NM_004939 


Hs78580 


425465 


L18964 


Hs.1904 


417124 


BE122762 


Hs.25338 


416502 


NML006159 


Hs.79389 


419490 


NM_006144 


Hs.90708 


421097 


A1280112 


Hs.125232 


410390 


AA876905 


Hs.125286 


442073 


AW973443 


Hs.8086 


435266 


AK001942 


Hs.4863 


441499 


AW298235 


Hs.101689 


453256 


A1S65587 


Hs.32556 


414142 


AW368397 


Hs.334485 


438023 


AF204e83 


Hs.6048 


412245 


A1952669 


Hs^2883 


424144 


AA454033 


Hs.41644 


446682 


AW205632 


Hs^11198 


431392 


A)371223 


Hs^88871 


433430 


AI883735 




420394 


AB023161 


Hs.97403 


425383 


D83407 


H5.156007 


443547 


AW271273 




420676 


AI434780 


H5.4248 


410690 


AA322979 


Hs.130266 


459645 


AA074346 




401403 






451166 


T98171 


Hs.185675 


418836 


AI655499 


Hs.161712 


421462 


AR)15495 


H$.104624 


414555 


N98569 


HSJ6422 


432401 


NM.013330 


H5.274479 


408392 


U28831 


HS.44S66 


425836 


AW955698 


Hs.90960 


452327 


AK000196 


H3.2g052 


418721 


NM_002731 


H5.87773 


433627 


AR}78866 


Hs.284296 


422960 


AW8g0487 




430570 


AI417881 


H$.292464 


406387 






416585 


X54152 


Hb.79386 


432340 


AA534222 




412240 


H72176 




450937 


R49131 


Hs.25267 


443634 


K73972 


Hs.134460 


422963 


M79141 


Hs.13234 


424954 


NM.000546 


Hs.1846 


433437 


U20538 


Hs.3280 


425100 


AF051850 


Hs.154567 


450660 


AF131784 


Hs.25318 


444250 


R40815 


Hs.12398 


428386 


R17298 


Hs.295923 


447764 


NM_003776 


Hs.19500 


411251 


R19774 


Hs.22835 


432648 


AAS57g52 




428708 


NKL014897 


Hs.190386 


437233 


D81448 


Hs.339352 


451743 


AW074266 


Hs.23071 


453258 


AW293134 


HS.32S97 


448705 


H05072 


Hs.124984 


414489 


AI620677 


Hs.73105 


429732 


U20158 


H8.2488 


435841 


R28522 


H3.186937 


424130 


AL05O136 


Hs.140945 


451198 


AWg84541 





ESTs 162 

protein tyrosine kinase 9 162 

amyk>-1.&gtucosidase.4^pha^tucanotr 162 

hypotheficfil protein FU11264 162 

Homo sapiens cONAnj20848 lis. done AO 162 

ESTs 161 

ESTs 160 

TBP-«sod8tedraclor172 160 

KIAA0729 protein 160 

ESTs 160 

hypottielics) protein 160 

ESTs 160 

ESTs, llAoderateiy simitar to 138022 hypol 158 

ESTs 158 

Homo sapiens cDNAiaj14752fis. dons NT 158 

ESTs 158 

liypotiieiicai protein FU20220 3 58 

ESTs 158 

KIAA01 27 gene product 157 

KIAA1095 protein 157 

DEAO/H(Asp^(hAla^pff%)t)oxpolypep 156 

protein kinase C, iola 156 

ESTs 156 

i^in kinase C-btndIng protein NELL2 156 

granzyme A {granzyme 1 , cytoloxte T-lymp 155 

Homo sapiens cDNA aj 13266 lis, done OV 155 

ESTs 154 

RNA (guanine-?'} melliyltransfBrase 154 

tiypolhetical protein DKFZp566A1524 154 

ESTs 154 

KIAA0379 protein 154 

iiemicenlln (fibulin 6) 154 

FEM-1 (Celegans) honvdog b 154 

ESTs. Weakiy similar to 138022 liypoDieti 154 

Al^-assodatedspenn protein 153 

ESTs 152 

Homo sapiens cONA FU11997 lis. done HE 152 

ESTs 152 

KIAA0944 protein 152 

Dom syndrome critical region gene 1-lik 152 

hypothetical protein FU12666 152 

vav 2 oncogene 151 

ESTs ■ 3.50 

ESTs 150 

Target Exon 150 

ESTs 150 

ESTs 150 

aquapoiin 9 3,50 

phosplidipase A2. group IIA (piatetets, 150 

NME7 149 
KIAA1641 protein ' 149 

ESTs 148 

hypothetical protein FLJ20189 148 

protein kinase, cAMP-dependentcatalyti 148 

Homo sapiens cDNA: FU22993 lis. done K 146 

cadherin 1 3. H^adherln (heaif) 148 

ESTs 148 

Target Exon 147 

teiomodin 1 , smooth musde (li^OOl) H'hy 146 

gb:n|21d0Zs1 Na^CGAPJ^I Homo sapiens 146 

hypotheibal protein RJ1 3159 146 

ATP<dependant interferon response protei 146 

ESTs 145 

ESTs 145 

tumor protein p53 (Li-FFawnenI syndrome) 146 

caspase6.apopt(»is-fBla!8dcysieinepr 145 

superviHln 145 

Homo sapiens clone 25194 mRNA sequence 145 

EST8,We8My8lmii8rto2004399Achroinos 144 

seven in ab5en6a{Oro60phila9hontdo9l 3.44 

nuclear kx:alizationsion8l deleted in V 144 

HHGP prt^ein 3 44 

gb»il17d)5^1 Na.CGAP.HSC1 Homosapiens 3!44 

KIAA0924 protein 144 

Homo salens brother of COO (BOC) mRNA, 143 
ESTs • 142 

ring finger protein (C3H2C3 type) 6 142 

ESTs. Moderately simflar to AiaJ7.HUMAN A 142 

ESTs 142 

lymphocyte cytosdic protein 2 (SH2 dome 141 

ESTs 141 

Homo sapiens mRNA; cDNA DKFZpS86L141 (fr 140 

hypothetical protein RJ21 127 140 
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AruoUloo 


Hs.226573 


436023 


TMOIfi 

IBlBlo 


HS.3022S1 


449000 


AAw2uuo 


Hs.1 88833 


4J/f J3 


AWofazio 


Hs^610 


4£9Dlr 




Hs.211563 


AAOAIA 


Al/!f2lli4 


Hs.13809 


490SnJ9 


AA5045% 




4Jsoor 


A ADit7Cin 

AAlMrolQ 


Hs.161292 




DtOcxlOf 






AL1 57466 


Hs. 162751 


AlK'iQQ 
400C99 


AI745458 


Hs.343028 


A^i'Mi'i 
4<T£DJ 


AoU2Uooo 


Hs.103u00 




AWWOSoO 


Hs.62119 


440028 


AW473o75 




AKAtYTfi 

4d4UfU 


N79110 


Hs^1276 


4J£i>i^ 


AlooOoW 


Ks.191202 


442426 


AJ373062 


Hs^2938 


428412 


AA426240 


Hs.1 26083 


448772 


AW390822 


Hs^1528 


417067 


AJ001417 


Hs.81086 


413714 


AI5d0944 


Hs.71428 


415663 


AVV296841 


Hs.3 13332 


407904 


W44735 


Ks.9288 


421114 


AW97S051 


Hs,293156 


AAtMA 
44UZ14 


AAZ47i1o 


Hs.7049 


AAnaoi\ 
44^90 


AL0420U5 


Hs.1117 




AiyioUbo 


LI.* 4 A Af 09 

nS.1 44583 


40v«MU 


AWoUUirO 


Ll» OAa4^ 

Hs^4817 


4l4rOJ 


AWU095o9 






AWaoOoOo 


Hs.1 66830 


AiARAtt 
414040 


AA05077d 


Hs.901 


41 ul«J 


AAo7d939 


Hs.69285 


4AJ01J 


Aio73o71 


Hs,7041 


41 rOM 


ttitt fwiiiaoo 
NNL00499B 


HS.K251 


401090 


AI492369 




4£j9l0 


AFuoaOlo 


Hs.135281 


441 W J 


AA315o05 




420729 


AW964897 


H5.290825 


440U1Q 


AA534930 


Hs.1 27236 


AAMRO 
440009 


AWZoo9o2 


Hs,111335 


4o^^U 


N46161 


Hs.35274 


453271 


AA903424 


Hs.6786 


440009 


A Vie 1 ice 

A!7o7756 


Hs.82302 


ill 0001 


X040t1 


Hs.88974 


440^91 


AA325633 


Hs.136102 


410fZU 


A 1104 COT 

Alo81b87 


H8.39526 


40ielUr 


A8u20o81 


Hs.27973 


439943 


AWu83789 


Hs.1 24620 


433282 


BE539101 




410044 


AW97B436 


Hs.82515 


417259 


AW909838 


Hs.81800 


421379 


Y15221 


H5.103982 


434210 


AAo65612 




431923 


AI741770 


Hs.292690 


453199 


AI3362o6 


Hs.32353 


419534 


AA443591 


Hs.90858 


44o9o9 


BE2o7795 


HS.2259S 


400014 


AI241331 


Hs.131765 


422(^2 


AB007683 


Hs.1 11373 


412262 


W26406 




420071 


NM_013989 


Hs.154424 


446094 


AKD01760 


Ks.13801 


44o48o 


AK0013o9 


Hs. 15144 


44U0O9 


AVV9o9259 


Hs.1 86647 


447735 


AA775268 


Hs.6127 


432331 


W37B62 


Hs.274368 


433697 


AA600357 


Hs.239489 


419231 


AL046294 


Hs.1 38245 


4309S0 


AA489525 




409758 


AW474960 


Hs.182258 


417958 


AA767382 


Hs.193417 


Aimin 

4IU/00 


Ar4/yi4D 


Hs.6966 


419543 


AA244170 




454024 


AA993527 


Hs.293907 


452039 


AI922988 


Hs. 172510 


443798 


R07B48 


Hs.188522 


449378 


AW664026 


Hs.59692 


455657 


BE065209 




420126 


NM.016255 


Hs.95260 


444291 


A1598022 


Hs.193989 


424084 


AI940675 


Hs.20914 


437330 


AU53944 


Hs.50115 


443774 


AL117428 


Hs.9740 



inhibitor of kappa Qahl poTypeptiiteoen 

ESTs 

ESTs 

ESTs, Modsrately similar to Ibdl [H.8apl 

&ceIlCLIJIymphoma7A 

hypoUieQcal prote&i FU10648 

ESTs 

ESTs 

ESTs, WeaUy simOar b P8022 hypotheO 

Homo sapiens mRNA; cONA DKFZp761E2423 (f 

ESTs. Weakly simflar to T20593 hypottiefl 

KIAA0831 protein 

hypoihelfcal protein RJ14800 

ESTs. VMdy similar to T17227 hypolheti 

ooBagen, t)rpe IV, a^a 3 (Goodpasture 

ESTs. WeaMy slmBar to ALU^^HUMAN lUI 

hypothetical protein MGC5370 

ESTs 

L-kynuienlne/alph&aninoadipate aminolra 
sotute carrier fanBy 22 (extreneuronal 
ESTs 
ESTs 

Komo sapiens cDNA: FIJ21278 lis, done C 
ESTs, Weakly slmDar to 178685 serineAh 
hypdhetica) protein FU11305 
Irlpeptklyt peptidase H 
ESTs 

hypotheOcal protein FU20136 
unacQve progesterone receptor^ 23 kO 
Homo sapiens cDNA FU12136 fis, done MA 
C048 antisen (B-ceO membrane protein) 
neuFop9m1 

ESTs. Weakly similar to A47582 &«eQ gr 

myosin IE 

ESTs 

atpha-acenIn-2-as50ciated UM protein 

desmogle!n2 

ESTs 

hypothetical protein FU12879 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU14814 fis, done NT 
cytochrome t>*245, betapolypeptUe (chro 
KIAA0853 protein 
ESTs 

KIAA0874 protein 
ESTs 

hypothetical protein 

KIAA0494geneprodud 

chondroifin sulfate proteoglycan 2 (vers 

smaB Indudble cytokine subfamBy 8 (C^ 

ESTs 

ESTs. Weakly simlar to 138022 hypothec 
mitogen-actlvated protein kinase kinase 
Homo sapiens done 25023 mRNA sequence 
hypothetical protein FU10637 
ESTs, Moderately similar to 138937 DNA/R 
KIAA0423 protein 

seven In absentia (DrosophDa} homdog 1 
ddodlnase, iodothyronlne. type II 
KIAA1665 protein 

hypothetical protein DKFZp564O043 
ESTs 

Homo sapiens cDNA: FU23020 fis. done L 
MSTP032 protein 

TIA1 cytotoxk; granule-assodaled RNA-bl 
ESTs. WeaUy similar to T17227 hypotheU 

ESTs 

ESTs. Weakly similar to 178885 sertneAh 
ESTs 

hypofheticatprotdnFU21776 
gbuicOShOZsl NCl.CGAP_Pr1 Homo sapiens 
hypothetkal protdn FU23403 

ESTs 
ESTs 
ESTs 

gb:Ra-eT03143103004)15-b12 BT0314 Homo 

Autosomal Hlghiy Conserved Protein 

TAR DMA binding protein 

hypothetical protein FU23056 

Homo sapiens mRNA; cONA DKFZp761J1 1 12 (f 

DKFZP434A236 protein 
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425657 


T89839 


Hs,1 19471 


406672 


M26041 


Hs.198253 


419905 


AW248229 


Hs.93659 


425332 


AA633306 


Hs.127279 


418529 


AW{X)5695 


Ks.250897 


417944 


AU077196 


HS.829B5 


433618 


AA60253g 


Hs.345494 


408830 


AA748009 


H8.173328 


415914 


AA306033 


Hs.78915 


415102 


M31899 


Hs.77929 


432626 


AA471098 


Ks.278544 


429493 


AL134708 


Hs.145998 


445S60 


AA332145 


Hs.13392 


4502S6 


AA2S6887 


Hs^4724 


426083 


AW962712 


Hs.126712 


420059 


AF161486 


Hs.94769 


446412 


AI219083 


Hs.42532 


436758 


AW977167 


Hs.155272 


438011 


BE466173 


Hs.145696 


426354 


Nlil.004010 


Hs.169470 


426860 


U04953 


HS.172B01 


437830 


AB020658 


Hs^67 


453368 


V\I20296 


Hs.286178 


409939 


AA463437 


HS.115S6 


413715 


AW851121 


Hs.75497 


407939 


W05608 


Hi.312679 


418283 


S79895 


H183942 


414405 


AI362533 




445893 


A1810702 


Hs.202613 


434423 


NM«006769 


Hs.3844 


408951 


AW407227 


H5.227591 


408949 


AF189011 


Hs.49163 


410337 


M83822 


Hs.62354 


409010 


AI648675 




400419 


AF084545 




454078 


AA601518 


H552209 


422461 


NKL003417 


Hs.1 17077 


441604 


AI683049 


Hs.201282 


411960 


R77776 


Hs,18103 


414895 


AW894856 


H5.116278 


430522 


N7S750 


Hs^42271 


426490 


NM_001621 


Hs,170087 


405268 






429155 


AW009866 


Hs.1 18258 


427196 


AW967522 


Hs.191593 


439776 


AL360140 


Hs.176005 


417727 


AL133623 


H$.82501 


410853 


H04588 


Hs.30469 


411962 


AA099050 




451099 


R52795 


H5^5954 


440452 


AI925138 


Hs.55150 


427480 


BE148769 


H8.334477 


444623 


AI1B3829 


Hs.202111 


424006 


AR)54ai6 


Hs.137648 


435874 


AA868688 


Hs.93102 


443801 


AW206942 


H5.253594 


434962 


AW975084 




430929 


AA489166 


Hs.156933 


426316 


NM.002430 


Ks.268515 


430378 


Z29572 


Ks.2556 


422109 


S73265 


Hs.1473 


451119 


AA805417 


Hs.64753 


414893 


AA2152g5 


Hs.77578 


432676 


A)167386 




428820 


AA436187 


Hs.172631 


422040 


AA172106 


Hs.110950 


437838 


AI307229 




408761 


AA057264 


Hs.238936 


420789 


AI670057 


Hs.199882 


419135 


R61448 


Hs.106728 


446019 


AI362520 




430848 


AW021726 


Hs.345490 


425375 


AA631977 


HS.1S5995 


424075 


AI807320 


Hs.227630 


41^2 


AW964490 


Hs.32241 


453111 


AB014598 


Hs^1720 


454042 


H22570 




407756 


AA116021 


Hs;38280 


447183 


AI554733 


Hs.173182 


437446 


AA788946 


Hs.101302 


431631 


AW023204 


Hs.302743 


420664 


A1681270 


H$.99824 


451582 


AI963026 


H$.2899S8 



EST$ 

ws^ hlstocompafibOi^ complex, class 



ESTs 
TRK-hisedgene 
collagen. ty(>8V, alpha 2 
ESTs 
ESTs 

GAW ng piotein transcriplibn factor, 
exdskm lt!pdrcloss<o^^llelne^&lg rade 
aoetyl-Coenzynie A acelytbansfiBrase 2 (a 
ESTs 

tetheHng factor SEC34 

MFH^plHied sequences wlO) leudne^fc 

ESTs. WeaMy similar to AF1910201 E2IG5 

RAB23, memtier RAS oncogene family 

ESTs. Moderately simBar to ALU8.HUMAN A 

ESTs 

splldng factor (CC1 .3) 
(^trophln (muscular dystrophy, Duchenne 
isoleucine-tRNA synthetase 
KIAA0851 protein; suppressor of actin 1 
Komo sapiens cDNA FLJ 1 1 968 fis. clone HE 
Homo sapiens cDNA FU12566 fis, clone NT 
Honw sapiens cDNA: FU22139 lis. done H 
ESTs. Weakly simflar to A4g019 dynein ha 
cathepsin K (pycnodysostosls) 
KIAA0306 protein 

ESTs. Weakly simflar to TRHY.HUMAN TWCH 

UM domain only 4 

hypothetical protein FU11088 

putative ribonuctease (II 

ceI]dhrisloncyde4^ 

Homo sapiens, Similar to RIKEN cDNA 1700 

Taget 

secreted modular caldum-Unding protein 

zinc linger protein 264 

ESTs 

ESTs 

Homo sapiens cDNA FU13571 lis, clone PL 
KIAA0471 gene product 
aryt hydrocart}on receptor 
ENSP00000223174*MAA07B3 PROTEIN, 
prostate cancer associated protein 1 
ESTs 

Homo sapiens mRNA fu!i length Insert cON 
similar to mouse Xml / Dhm2 protein 
ESTs 

gb:zk85d12.r1 Soares_pregnanLutenJ8jlbH 

interteukin 13 receptor, dpha 2 

ESTs, Wfeakly similar to CAYP.HUMAN CALCY 

hypothetical protein FU11328 

ESTs 

C084 anligen (leutocyte antigen) 
ESTs 

intron o^ trtchorhinoph^bngeal syndro 
gb:EST387190 MAGE resequences. MAGN Homo 
ESTs 

menlngtoma (dsnjpled in balanced transl 
tumor necrosis factor receptor superfami 
gastrfn-releasing peptide 
ESTs 

ubiquiSn spedGc protease 9, X chramos 
gb:qf29c01j(1 SoaresJes&JIHT Homo sap 
integnn. alpha M (oomptement component 
Rag C protein 
ESTs 

ESTs, Weakly simDar to (dadina not ava 
ESTs 

ESTs. Weakly similar to KIAA1353 protein 



gb:df27eQ2.y1 Morton Fetal Cochlea Homo 

KIAA0643 protein 

REI-sOendng transcription factor 

ESTs, Weakly similar to S65657 elpha-IC- 

hephaestin 

hypolheQcai protein FU20093 
ubiquitin specific protease 18 
ESTs 

ESTs. Moderately simOar to CA1C RAT COL 
ESTs 

BCE-1 prot^ 

ESTs. Weakly dmBar to putalhre pi SO [H 



3.28 
3.28 
3.27 
3.Z7 
3.27 
3.27 
3.27 
3.26 
3.26 
3.26 
3.26 
3.26 
3.26 
3.26 
3.26 
3.28 
3.26 
3.28 
3.28 
3.25 
3.25 
3.25 
3.25 
125 
3.24 
3.24 
3.24 
3.24 
3.24 
3.24 
3.24 
124 
124 
124 
124 
124 
124 
124 
123 
123 
123 
123 
123 
122 
122 
122 
122 
122 
122 
122 
122 
122 
121 
121 
120 
120 
3.20 
120 
120 
120 
120 
120 
120 
119 
119 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
118 
117 
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432954 








ntslAmu 


nS.27475 


427373 




HS.i3u7dO 






nS.D342o 


441889 




ns.^oojr1 


416959 


D28459 


nS.olK>l£ 


44S525 


DC 149000 


Us 

ns.14o91 


420623 


Dm4*MO0 


Ue QfM^7 

ns,994Jf 


451475 


T19093 






AAT791 AO 


lie 1CQ7Q 

HS. 10979 


429556 




Un OQQBQ 

ns.90900 


448514 


nDUcvDcD 


Ue VliQCC 




AHllOQn'9 
AIIOOoUO 


Hs.i 53944 










AW43U0Ud 


Hs.2534 


•iiion 




nS.o2223 






nS.220702 






Hs. 145544 


itioncQ 
41OvO0 


AVvy7M55 


Hs.293902 




AI\wloi4 


Hs.23w44 




A\AKlir7il/C 

AVVyo/44o 


nS.301711 




/\oU4U000 


r18.ojZ4o 


453915 


AAtuunoi 

M/\0OOi Zi 


nS./00Zl0 


407725 


DC<X}0U9<t 




437412 




nS.o4/44 


440937 


AC0n7T3j1 
nrZUAl £** 


Ue 704 


44^7 


ARM77RA 


Ua 


446126 






4072Q4 




Ue AAMVf 


419145 


IktQQCOO 
n99D00 




418757 




Ue 4f:D7')fl 


430000 


AUU9ftCQ^1 


Uo oncno 
nS.99b90 


437296 


AAQcnoQ^ 

/VV}0U994 


Ue OAOat 


441381 


14091 QC 


U^ <94ffTil 




AAKIIQin 


Ue 40eT7A 

nS.125779 


422900 


ttdA^OM 


Ue oomci 


442787 




Ue OC/VJ'W 


430569 




Ue Ovifn4i: 


419a<iS 


AAJoacon 
AAUOOcU 


Ue onnci 
nS.gUuo1 






Ue *vynAt\ 
r1Su}ZZ/1U 


417308 


uanron 


rfS.oio9^ 


409799 




Hs.76845 


429160 


AVVOOJUOJ 


Ue 4>l>lilCO 

MS. 144409 




£.4ZOOr 


IJ_ CSDO^l 

ns,83oo3 


451107 


AA9^f;ina 

nr\£JulUQ 


Ue 47a^Q 




Al 1R1DC7 
ALIO 193/ 


Hs.10177 






n3.1 74918 




AIIUJ047 


Hs.121001 




Ai^nnflOQQ 

A>UJUU£99 


ns.1 80952 


401000 


AWuOoooo 


Hs.193969 




AF134157 


Hs.169487 


AARQAti 


Aiiyoiio 


ns.1o611 




Aiio«>14o 


Hs.36602 


4no4» 


AA4oOj4o 


Hs.317596 


41/lUO 


A0U99Z 


Hs.61226 




AA01U043 


ns.33^39 


408089 




Hs,4K44 


453686 


At iin^fi 


Ue 911/670 


426167 


APIV?QfY91 


Ue 1C7>(0C 




AAOIM74'l 
AA35>4i 1^ 


Um 44C070 

Hs.noo7o 


416580 


1 vlOf^ 


Ua 7M0C 


419839 


M9AC77 


Ue omru 


424001 


W67883 


Ue 4^7>17ft 


434564 




Ue 4QQ^C4 

nS.1 00001 


433401 


Acmofioft 


Ue 9QA047 




AA.4ft4nT7 
AAODIVJ/ 






AlOOOOUl 


Hs.71721 






Hs.278695 


^9Qft1Q 
4Z90I9 


AL1 33011 


Hs.225108 


44BOfO 


NM.003677 


Hs^2393 


<r«04r1 


VB74ilO 

A9r34o 


Hs.184510 


436288 




Ue 4QO;tin 

rfS.19Z4)U 


433376 


AI249361 


Hs.74122 


416051 


AAe3S666 


Hs.25253 


453468 


W00712 


Hs.32g^ 


412340 


AA101609 


Hs.182685 


438716 


AA814903 


HS.15S113 


419440 


AB0205e9 


Hs.90419 


433017 


Y15067 


Hs.279914 


428513 


BE220806 


Hs.184697 


437666 


AA156781 




451027 


AW519204 


Hs.40806 


448030 


N30714 


H&32Sg60 



ESTs 

Homo sapiens cONA aJ12749 lis. done MT 
myosin phosphatase, largel subunit 2 
ESTs, V\teaWy sbnDar to Z1 95.HUMAN ZINC 
hypolhelical protein FU20274 
utHquituvconjugaling enzyme E2A (RA06 h 
Homo sapiens, Sin^ar to zinc finger pro 
Homo sa^ ntRNA; cONA OKFZp5e&G1924 If 
KIAA0725 protein 

ESTs, WesMy simflar to A43932 mudn 2 p 

ESTs 

KlAA0819proteiii 

ESTs 

ESTs 

bone morphogenetic protein receptor, typ 

chortfin-fike 

ESTs 

ESTs 

ESTs. Weakly slrrdlar to ISHUSS protein d 
hypothe&al protein aJ10652 
ESTs 

KIAA1450 protein 
fibosomal protein L44 
ESTs 

Homo sapiens mFWA; cDKA DKFZp547C136 ffr 
KIAA0810 protein ' 
KIAA1363 protein 

pleckstrin homology domain interacting p 

ESTs. WeaWysimllartoALUUHUMAN ALU S 

gb:za39g11.r1 Soaies fetal Hver spleen 

hypothetic^ protein FU13150 

hypothetical protein MGC5338 

KIAA1700 

ESTs 

ESTs 

ESTs 

hypothetical protein MGC2747 
UOP^^tyMpha-lHalactDsamine:potyp 
progesterone binding protein 
ESTs 

KlAAOId gene product 
phosphoserihe phosphatase-iike 
ESTs 

transmembrane, prostate androgen induced 
Homo sapiens ubiquitin protein r^ase (U 
piecfcstrin homology domain Interacting p 
ESTs 

Homo sapiens, done IMAGE:346028a miVlA 

dynactin4(p62) 

ESTs 

Kreister (mouse) maf-ielated leucine zip 
tumor protein D52-4ile1 
I factor (comptemenO 

Homo sapiens cDNA FU12927 lis. ctone NT 

C06 anOgen 

ESTs 

IhloredoiMke 

ESTs. lifcder^similartoZ195_HlAi(ANZ 

RANblndlf^protebS 

ESTs 

Human done 23574 mRNA sequefrcs 

phosphoGpase A2. group Vil (platelet-x 

paternally expressed 10 

Homo sapiens cONA RJ12807 tis, done NT 

serotogically defined cdon cancer antig 

IRNAIsopentenylpyrophosphate transferas 

ESTs 

Homo sapiens prostein mRNA. cotnpleto cds 
Homo sapiens mRNA; cONA DKFZp434P201 (fir 
densHy-ceguIated protein 

stratlQn 
ESTs 

caspase 4, apoptosis-fetated cysteine pr 

mannosidase. alpha, dass 1A, member 1 

DKFZP566F084 protein 

ESTs 

ESTs 

K1AA0882 protein 
zinc finger protein 232 
p)exinC1 

metaOothionein 1E (functk}nal) 

Homo sapiens, Similar to RIKEN cONA 2810 

membrane-spamiing 44lom8ins. subfan^ A 
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3.17 

ai7 

3.17 
3.16 
116 

ai6 

116 
116 
116 
116 
116 
116 

lie 

116 
116 
116 
116 
116 
116 
116 
115 
115 
114 
114 
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114 
114 
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406122 
435272 
410726 
413083 
407949 
417538 
434938 
434733 
434421 
407930 
424939 
458332 
445034 
446570 
429920 
459513 
419038 
451079 
417386 
453108 
449328 
428656 
425509 
447957 
417226 
452248 
426279 
433814 
453064 
431341 
441789 
456437 
438771 
448497 
416240 
409038 
424776 
408409 
429693 
425960 
431625 
451144 
432274 
408683 
427735 
440603 
415443 



M737345 
AK001214 
W30681 

AA906415 



406685 
446013 
433902 
412610 
432505 
440040 
433255 
419726 
417258 
435800 
444838 
456760 
40B360 
427982 
438398 
410434 
412095 
425955 
450247 
417865 
415457 
438543 
415503 
429138 
447881 
425481 
453315 
440638 
433208 
442495 
418858 
408170 



AL035737 

W21874 

AW050865 

AWa)0718 

AI334357 

A1915927 

AA045847 

AK00D059 

AI000341 

AW293376 

AV659177 

AW473208 

AI032946 

AW134924 

A1827988 

AL037228 

AI311457 

AI962493 

AB037798 

AF079363 

NM.014821 

AV1C050S4 

AA093668 

At646520 

AA609738 

R40334 

AA307211 

D52059 

AI924228 

NM.016289 

BE513269 

NH.001981 

T97490 

AI867931 

AW838181 

BE254962 

AW410646 

AW750627 

AW956103 

AKO00382 



AA916765 

AL121733 

T07353 

AI348408 

M18728 

AI360167 

AW292820 

X90908 

AW274526 

BE219431 

A1274270 

U50330 



AI248285 

AV651680 

AW961251 

AI806090 

NM.016156 

AI683487 

AF051152 

AI624707 

T96509 

AF123303 

AW086059 

AW081710 

AA810141 

U36601 

AB020657 

BE620888 

AW978162 

BE544203 

AJ376551 

AW002834 

AI184717 

AW961605 

AW204516 



H5.294041 
Hs.100914 
Ks.146233 

H5.1 10041 

Hs.15936 

Hs.75184 

Hs.247057 

Hs.275711 

H5.8115 

Hs.159337 

H$34771 

Hs.188361 

Hs.153881 

Hs.143659 
Hs.127160 
Hs.1 15572 

Hs.190325 
H5.240728 
Hs.82043 
H199472 

Hs.188790 

Hs.158213 

Hs.20126 

Ks.4283 

Hs.28578 

HS.169D84 

HS.16S25 

Ks.89463 

Hs.251531 

Hs.7972 

Hs.115185 

Hs,6406 

Hs^1893 

Hs.79095 

Ks.50002 

Hs.164595 

Hs.278337 

Hs.211612 

Hs.164649 

Hs.6634 

Hs.61712 

Hs.274251 

Hs.46847 

Hs.180610 

Hs.7299 

Hs,7948 

Hs.124675 

H5.152774 

Hs.144906 

Hs.74126 

Hs^77721 

H3^2031 

Ks.96840 

Hs.1274 

Hs.118348 

Hs.208558 

HS.127B28 

H3.44344 

H3.181326 

Hs.152213 

Hs.63666 

Hs.5921 

Hs.248549 

Hs^4713 

Hs.6529 

Hs.7369 

Hs,192182 

Hs.78473 

Hs.197298 



Hs.24831 

Hs.24095 

HS.2114S 
Hs^1835 



ESTs 

hypothetica) protein FIJ10352 

Homo sapiens cDNA: aJ221 30 as, done H 

Target Exon 

ESTs 

ESTs 

chlOnase 3^ 1 (cartilaBB glycoprote 

ESTs. VM(iy similar ta 210926QA B cea 

hypothetlcat proteb MGC2452 

Homo sapiens, done H4GC:16169. mRNA. com 

ESTs 

ESTs 

Homo sapiens cONA FU12807 lis. done NT 

Homo sapiens NY.R£N^ antigen mRNA. pv 

ESTs 

ESTs 

ESTs 

ESTs. WeaWy similar to 138022 tiypotheli 

flb:ox06g09^1 Soares.fBtaI_llver spleen 

ESTs ■ 

ESTs, Moderately similar to PC42S9 ferri 

0123 gene produd 

ESTs 

ESTs 

-KIAA1377 protein 
sperm assodatsd antigen 6 
KIAA0317 gene produd 
ESTs 

musclet)IInd (DrosophilaHiltt 
tubby like protein 3 
ESTs 

potassium large conductance caldunvaci] 
proteasome (prosome, macrapain) subunit. 
KIAA0871 protein 

ESTs. Moderately similar to PC4259 ferri 
M02S protein 

hypolheticat protein DKFZp761N0624 
efridermal growtl) fedor receptor patttway 
^ faidudble cytoldne subfemOy A (Cy 

Homo sapiens cDNA RJl 1 537 fis, done HE 
SEC24 {S. cerevlsiae) related gene fairti 
hypotheticat protein DKFZp434H247 
Homo sapiens cDNA: FU22547 fis. done H 
pyruvate dehydrogenase kinase, isoenzyme 
hypothetical protein aJ20375; KIAA1797 
TRAP and TNF receptor-assodated proldn 
spDcing factor proline^lutamine rich ( 
Novel human gene mapping to dwrnosome 1 
ESTs 

ESTs, Wealdy simitar to T14742 hypolheti 
gb:Human nonspecfficcrossreading anlig 
ESTs 
ESTs 

fatty add binding protein 6. Heal (gas 
K1AA0049 

zinc finger protein, subfamily 1A. 4 (Eo 
KIAA1527 protein 
bone morphogenetJc protein 1 
gb.7y60a09^1 f 
ESTs 
ESTs 

guanine nudeotJde binding protein (G pr 
hypolhellcBl protein FU20534 
KIAA1073 protein 

wingless-lype MMTV faitegFatlon site fad 
toIMite receptor 2 

Homo sapiens cONA: FU21592 fis, done C 
ESTs. Moderately similar to S65657 alpha 
hypothetical protein 

EST8,V\feakly similar 10178885 sedne/th 

ESTs. Vte&lys&nllar to ALU1_HUMAN ALU S 
ESTs 

N-deacetytase/N^fotransterase (hepars 
NSI-bindinoproteiii 

GCN 1 (general cootrd of aminos synt 

ESTs 

ESTs 

gb:te64o10jt1 Soares_NFU.T.GBC.S1 Hofws 

ESTs 

ESTa 

bypolheScd protein RG083M05.2 
ESTa 
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aio 

109 
3.09 
3.09 
3.09 
109 
3.08 
3.08 
3.08 
3.08 
3.08 
3.08 
3.08 
3.08 
3.08 
3.08 

aoe 
ao8 
ao6 

106 

3.06 

106 

106 

106 

106 

106 

108 

105 

105 

105 

105 

105 

104 

3.04 

104 

104 

104 

104 

104 

103 

103 

103 

103 

103 

102 

102 

102 

3.02 

102 

102 

102 

102 

3.02 

102 

102 

101 

101 

101 ' 

101 

100 

100 

100 

100 

100 

100 

100 

100 

100 

^98 
zg8 

Z98 
Z98 
198 
2.98 
2.98 
2.96 
Z96 
2.96 
195 
195 
194 
194 
194 
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430382 
449765 
407361 
407910 
43600S 
449458 
449317 
411118 
449494 
416311 
433068 
429272 
432519 
445467 
426782 
426216 
413882 
421554 
446488 
421391 
424527 
419284 
415788 
448461 
410491 
443441 
422725 
431926 
420406 
437678 
440115 
439883 
446428 
451273 
435154 
432451 
442703 
419341 
435861 
420137 
438441 
426158 
432882 
416239 
434792 
424852 
«S53B 
419551 
450571 
442435 
424148 
445784 
408072 
434779 
450295 
440381 
433923 
420802 
429870 
437908 
438676 
410361 
444969 
446423 
435517 
425354 
439180 
429073 
433834 
417365 
414521 
402550 

TABLE 33B; 



AA477908 
N92293 
M744622 
AA650274 
BE551650 
AI805078 
AW293413 
N27944 
AW237014 
D80529 
NM_006456 
W25140 
AI221311 
AQ39832 
R14614 
N77630 
AA132973 
AW137676 
AB037782 
AVyf304350 
AW138558 
AW820889 
AW628686 
W15284 
AA465131 
AW291196 
AA315703 
AW972724 
AA741024 
AA829860 
R41808 
AL359652 
AW082270 
NM-014811 
AA668764 

Mmwx 

AL04494S 
N71463 
AF254956 
AA306478 
AW664960 
NM.001S82 
NWL013257 
AL038450 
AA649253 
AI222779 
NWL012337 
AW582256 
AF158240 
AI986208 
BE242Z74 
AI253155 
BE00S566 
AFiS3815 
AJ766732 
AA917808 
AI823453 
U22376 
L01087 
AI082424 
AA813745 
BE391804 
A1203334 
AW139655 
AA928626 
U62027 
Ai393742 
AA446167 
AA620742 
DS0683 
D28124 



H5.282267 
Hs.206832 
K5.292645 
Hs.41296 
Hs.158126 
Hs.208261 
Hs.132906 
Ks.22147fi 
H5.31S369 

Ks.288215 

Hs.110667 

Hs.130704 

Hs.15617 

Hs.33846 

Hs.13895 

H5.1&4492 

Hs.97775 

Hs.15119 

Ks.igig58 

Hs.334873 

Ks.215658 

HS.7B851 

Hs.74832 

Hs.64001 

Hs^195 

K3>199993 

Hs.88378 

HS.122S34 

Hs.144924 

Hs.171096 

Hs.12496 

Hs.26163 

Ks^71 

Hs.116298 

H5.1 18888 

H5.16606 

Hs.95327 

Hs^19 

K$.t99067 

Hs,279686 

Hs.48948 

Ks.13245e 

Hs.144848 

H$.1584S0 

Hs.91011 

Hs.60397 

HS544760 

Hs,1741 

HS.U6065 

Hs.16773 

Ks^lSI 

Hs^10628 

Hs.190495 

HS.14662S 

Hs.1334 

Hs^11S93 

Hs.123446 

Hs.62561 

Hs.160628 

Hs,150120 

Hs.130177 

Hs.155935 

Hs.199067 

H&47385 

Hs.130788 

Hs.82028 

Hs.76307 



EST$, Moderate!/ sim!^ to 138022 hypot 
ESTs. Moderately sinfiar to ALUMUMAN A 
ESTs. Wbakfy simOar b ALU5.HUMAN ALU S 
GbroRecOn leucine rich transmembrano p 
Homo sapiens cONA FU13350 fls. done OV 
ESTs 

19A24 protein 

ESTs, Weakly sbrtlar to AF108460 1 ubinu 
Homo sapiens cDNA: aj23075 fis, done L 
gb:HUM081H05B Human felai brain (TFujiwa 



ESTs 

ESTs. Weakly sfanUar to BCHUIA 8-100 pro 
ESTs. We^ siniltar to AUJ4JHUMAN ALU S 
ESTs 

Hbmo sapiens cONA aJ11654 8s; dbna HE 

ESTs 

ESTs 

KIAA1361 protein 

tnvnunoglobulin superfamDy receptor tran 
ESTs. WeaUy similar (o (54374 gene NF2 
ESTs, Moderately similar to ZN91_HUMAN 2 
KIAA0217 protein 
ESTs 

Homo sapiens done 25218 mRNA sequence 
ESTs 

ESTs. W^sKnilartoALUBJIUMAN (tn 
0b:EST384816 MAGE resequences, MAa Hbmo 
ESTs 
ESTs 

ESTs. W^ simnar to B Chdn B, Sdut 
Homo sapiens EST from done DKFZp^^4A041 
ESTs. WeaMy slmilarto ALUCHUMAN ALU S 
KIAA0649 gene product 
ESTs 

ESTs. WtsaUy similar to ALU1JJUMAN ALU S 
ESTs 

ESTs. Weakly slirtlar to ALU1_HUMAN ALU S 
candtdato tumor suppressor protein 
CD3D antigen, delta polypeptide rnT3 co 
ESTs 

v-eib*2 avian erythroWasCc leukemia v 

serum/gtucocoiticdd regulated kinase^ 

ESTs 

ESTs 

ESTs 

nasopharynge^ epithelium specific ptote 
anterior gradient 2 pfenepus laevfe) hom 
ESTs 

ESTs. Highly simaar to B34087 hypotheti 

Integrin. beta 7 

ESTs 

Homo sapiens done TGCCIA00427 mR!^ sequ 

potassium fnwardjj^fecGiying channel, s 

ESTs 

ESTs 

ESTs 

avian mydoblastosb viral oncogen 
prot^ kinase Ctheta 
ESTs 
ESTs 

guanylate binding protein 1, biterferon* 

ESTs 

ESTs 

ESTs 

oomptemenl oomponent 3a receptor f 
v-er b-b2 avian erythrobiasfic leukemia v 
ESTs 
ESTs 

transfomning gnmth factor, beta reoepto 
neujoblastono, suppression of tumoi^ 
Target Exon 



Z94 
Z94 
2.94 
Z93 
^93 
2.93 

zn 

2.92 

i9i 

Z91 
190 
190 
2.90 
2.88 
2.88 
Z88 
2.88 
188 
2L84 
184 
182 
182 
182 
182 
180 
180 
180 
160 
179 
178 
178 
178 
177 
176 
176 
175 
174 
174 
172 
170 
170 
167 
166 
162 
160 
158 
167 
156 
156 
156 
156 
153 
152 
152 
148 
146 
144 
144 
144 
143 
137 
128 
2^6 
127 
127 
126 
100 
1.98 
t.72 
1.52 
1.30 
1.09 



Pkey: Unkiue Eos probeset Identifier number 
CAT number. Gene duster number 
Accession: Genbanh accesston numbers 

Pkey CAT Number Accesston 

431089 12594U BG940189AW063489 AA715980 0FOO1O91 BF880066 AA666102AA62t946 AA491826 
456034 685586.1 AA136653AA136656AW450979AA984358AA809054AW238038AA492073BE168945 
407192 220020^.1 AAfi02964AAS09200 
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439195 
432222 
405687 
4443t4 
452239 



433586 
418259 



431843 

442046 
429226 

442369 
4S9702 
407347 
4511B4 



444610 
407604 



.1 
0_0 

1027984 1 
10116J 



32908.1 
133853.1 



44S334_1 

75042?_1 
215430.1 

2691713J 
S39529J 
B10943L1 
153U 



214529^.1 
43771 1 



438089 22448.4 BM475665BE644917AW770789AW952971 K648S3BM2632S9AI224S4S Ml 84866 N69114AW518902AI440169AA809472AV654440 

AA281642 AU185230 AW337382 AI872923 AI537113 N73882 T83378 H63731 BF671764 AWB97624 AI811204 AA344646 BE009112 BQ89S664 
H91240R60548 N41701 
21979J AF086037 H89360 H89548 

BG2O7209 8E166299 Af204995 BG199355 AW969908 AA52875S AW440776 61044354 
M31126 

AW749625 AW749626 AW749544 AI140497 

BG034853 AW173315 AW303375 BG180225 BG939153 BF057308 AA600736 AI751258 AI090486 BE939504 AW631492 AI768270 AI862133 
AA417652 BE378218 AA599207 AW794702 AA024968 AA445024 Ai148235 All 91710 BI493797 AI272646 81493796 AA634323 AI754332 
AA25B414C05155 AI21B226AI039656AI350380A10846S8AJ754989AJ67354SAI432010A1751035AA375571 AA446297 8G216743 
BC011194 AW517087 AA601054 TB5512 

BM31 0925 AA426} 10 6M310629 8F4342d6 AW015091 BF475996 AW1 18867 6E675186 AI668558 At453594 AW590569 AI652425 AI827969 
BF055946 A1802866 AI393380 A1476224 AW590639 AW136271 AI458252 AI524726 AAB43758 AA782158 AI336058 A1097532 AW451563 
AA459408 AA4S9633 AA418444 W23607 BG940150 AI493445 AW054729 AI221929 AI868744 AA215405 AA766713 AA521546 BF928317 
BE464132 AI99(S09AW271459A1262061 AA215404 N74332BG940151 BG952261 AA972115W96315AA689586RB9057BF7668e6BE769254 
W05240 

432810 101919.1 BG292389 C06094 AI668930 AW104S34 AA310513 AA830127 AW134897 AAa48953 AW965490 AI81053O BF092924 AA334151 AA334725 
D31302 R20723 AA263003 6IB24635 AI2762B7 A)684428 At524234 AI335035 AW014704 AI911443 AA972102 AI367512 AI126670 AW016017 
A)286003 AI147163 AA626033 AI539156 AA565542 AI094253 AW512612 BE889626 AA744752 BE646306 AW471324 AA999975 AA863400 
H17550 AI991439 R46187 BE92g954 AA333976 D63102 BF744491 
412652 18856J A(601777B£677762AW008210AW009441 BE3S0994BE207949Al09l475AI802774A)B27533At075363AI659979AA687855At07B125A]09028S 
AA670058 AA60241 1 AA683472 AI436058 AA612826 AI036932 BG057726 AI167355 BF44g023 A)289476 AW074381 BF97291 2 A1991780 
AA889119 AI537472 239730 A!868953 A1192337 BE812g78 BE81 2939 AA1 15248 H99006 AI9157B4 mm T1674B D20468 AA509899 BF081234 
AA115336 BE812876 BE612972 BE812982 BE813006 BE813019 R43883 BE6129d1 

AW970134 AA516420 AA5430O7 BG057526 BIO0143O A1498371 D60181 061004 D60382 C15876 N91070 C14815 C15068 D80763 C14818 
C15161 D60184D60656 
AW34049SAI984319AA974603 

6G676155 BM009591 Al47g075 AI02S794 A(017967 AA446270 BE466812 AA853422 A)392849 BG952034 AA513384 BF640124 6E714620 
AW969805A1553633 
AA993566 AIS21958 AI565071 AI664217 

BG207209 66166299 AI204995 BG193355 AW969908 AA528756 AW440776 61044354 
T23514AI6557B5 

299386 AV716301 BE222333A;949687BF732426AI4M086AW21430AW577332 N68315eE672030AI084440AI250908 N50901 AA757364 
BF515264 AI1B6231 BE466036 AW631313 AA993514 N5441 1 BG05751 5 AW013895 N77963 AA708723 AI273295 N59093 AA522665 AIB71574 
AAS05521 AAB122S6 AA5S3641 AW467057 W68650 AI168772 AA988308 AA910057 At668258 AA918322 AI335647 AA621494 AA904390 
AI760007 AU146694 AV758440 BM142562 N66507 A1674563 AW628584 A)2e3902 BF223368 AI090490 AI452918 K71423 AA062767 T94332 
AA016003 BG681608 AA082426 AA029451 N71873 BF437B45 T541 54 AV741757 
A1174783R12271 R83559 

AK074129 AL138071 AI806793 AI453544 AI084616 BF062435 BE672932 AI828296 BF062464AI 803443 AI264326A1280956AI743547 AW295227 
AW139967 AI972S61 AI863570 AW062752 AW062756 AW062753 AW176904 AW062755 AW062750 AW062749 AW176886 AI751694 BG251391 
AA043821 W30872 BI524827 N94581 A10B4614 AA776976 A1377572 AI751695 A1016576 AW439994 AW194255 AW627763 AA668249 A1079870 
AW022524 AI968540 Ai248360 AA702490 AA043282 AAS31511 AW176883 AW751767 AW176BS7 AWQ62748 AW)62745 AW176862 AW176815 
AVV176814AW176846AW176841AW176871AW176874AW176819AI927437AW191962R36548AA853713AW196059^^ 
AW062754 AW062751 AW062746 AW1768B8 AW751772 AW176859 AW751765 AW751762 AW176833 AW176B73 AW751745 AAe94S13 
AA780815 AW176857 AW176834 AW176639 AW751771 AW176B78 AW176843 AW176842 AW7S1756 AW176847 AW1768^ 
AW176849 AW175844 AW176877 AW751757 AW176850 AW176851 AW751758 AW751746 AW176858 AW176B53 AW761766 AW175845 
AW176872AW176840AW176876AW176852AVV751754AW176836AW176656AVV751773AVV751760AW7517M 
AA8S3712 AA317570 R38469 AW291569 
AA156151 Z25109 C05177 AW975688 AA731063 N67084 

AL050068 AA160485 AW173544 AW29B506 AW439860 AI521563 AI702529 AI393606 AW138323 AA570109 H19504 BM021968 BF063327 
BFS93552 AAS30766 AI59771 7 AI807128 AA523012 AI356250 AW451 857 AA974203 AI762577 BF512552 AW007307 BE675286 AW450602 
AA962057 AW5ie069 AJ582546 BF221924 BF222543 AIB01808 AW468699 AW000736 AI866625 AW235356 BM021837 AA91 1956 AI680606 
W86516 T03370 AW61 1634 H41653 AI466349 H19S68 AW090198 AW043993 R39847 

BC013S39 81494690 8M91211 AI928393 AA843540 8G936644 AU165628 61495842 AW1732S5 6M052709 AI743999 A(690144 AI922209 AI740907 
AW340368 AA928759 AW1 18737 BF513970 AA707807 BF435295 AI339463 AI373842 AI433809 eE222392 AA602308 AA428251 AM60355 
AW662760 AI888087 AI342098 AA722418 W78151 N64382 BE221B48 AW025g01 AA4521 20 AI150479 AI015166 AA779515 AA661791 BM474307 
BI911169 BG575154 AW953303 T33604 D59141 AA385785 AA148648 Bh4461981 BQ681168 BI602483 BE889592 AW954311 BM052986 BI962893 
A1989299 Z42328 BF029504 T35666 B6402602 AU16S770 A1023271 AA147719 AI434079 AI569000 AI276488 AA992453 AA342821 AA648303 
AI349384 AI051008 AA926941 AA350894 AW071451 N22249AI784138AA083847 N22258AW440825 AA661S70 A^ 
R70463AI383586AA827189B1494872AW021094BI494871 AA905500AM60923BI492041 AW02B965A1624611 BG271780AI497723H88862 
059858 N89979 AA658425 N81 154 062341 AJ274437 N66697 H96993 AJ370663 AA728850 H05232 R59374 H12223 AI9357ra AI362553 D60006 
N29572 AI916833 N75273AA148710 B1597117 BG740471 AA332671 AA333874 AA643052 AW020176 R70550BG623469 AA452342 AW965441 
W1d723 R32966 BE8B3841 R61003 AI910374 At885262 R55325 AI468927 R34681 H9621 1 Z39807 BF954386 
1239572.1 AW975804AA748675AA731006 

1438J M77830 NM.004415 AF139065 BG681115 BG740377 BI712964 BG000656AA1 28470 BI438324 H27408 BE931630 BE187165 AW370827 

AW370813 J0521 1 BG698865 BG740734 BG68061 8 BG739778 Bi765807 BM353403 BM353248 AW1 77784 AW205789 AW951576 AW848592 
BE182164 BF149266 BE940187 B1060445 81060444 BF350983 BE720095 BE720069 BE715154 BE082584 BH)82576 BE004047 AA857316 
BI039774BE7138188E713548AW170253BEt60433BI039775AWB86475B^M62504BE931734BF149264AAW 
AU127260 AW364859 BF993352 BG223489 BE819009BF381184 BE715956 R58704 AA852212 AW366566 BI090358 BF087707 BE819046 
BE819005 AA377127 BE073467 BE819069 8E819048 8(036306 BG990973 BI040954 BF919911 AU140155 A1951756 AI434518 AW804674 
BR52969 BE837009BE925826 BF149265 AW995615 BE814264 81039782 AU1 40407 BE144243 BE709863 BF985642 BE001923 BF933510 
AW255328 BG435319 BE1B2156 AW365175 AW847688 BE818280 AW177933 BF873679 AW17B00O BE0B2526 BF476866 BF086994 BF592276 
BE082507 BE082514 BE082505 BF873693 AW068a40 AWB47678 8Fe04153 AW365157 BE813930 BE002030AW365153 BE184941 BF749421 
BE184920 BF839562 6E184933 BF842254 BE698470 BE931048 BF999889 BF368816 BE184924 BE159846 BE714632 BE184948 BG986645 
AA131128 AAO^I W39488 C04715 BF096124 BE86534t AW799304 AL603116 6E149760 6E705967 BE70S966 6E705968 AW848723 
AW376699 AW376817 AW376697 BG005097 BF751 115 BE698084 AW848371 AW376782 AW848789 AW849074 AW361413 BF927725 BF094211 
AW997139 BE865474 6E185187 BE156621 BE71OT89 BE713297 BE7132g8 BE179915 AW799309 BF872345 BRJ88676 BE705939 AW752599 
BG005197 BF350086 BE715198 BE715155 BF752396 BF093817 BF831 190 BF752409 BE008561 BG959922 BF094833 BF094748 BF094583 
AW377699 AW607238 BE082519 AW377700 BF349467 Al 190590 AI554403AI392926AU1 58477 81467252 AU159919 AI760816 BF082516 
AI439101 AA451923AI340326AI590975B1791553AI700963A1142B82AA039975AA946936AA644381 6M314884AA702424AI417612 
AWig0555 AI220573 A1304772 A1270345 A1627383 AA552300 A1911702 AW166807 A1346078 WgS070 AA14gi91 AA026864 AIB30049 AW780435 
A]078449 A)819984 A)858282 6M68588 A1860584 AI025932 AA02S047 AA703232 AA656154 AA515SO0 AW192085 AA918281 T77861 AI927207 
AI2052B3 BF082491 AW021347 AI568096 BEg39862AA088866 012062 AA0S6S27 AA782109 W192B7 W02156 AW1SQ038 AA022701 TB7181 
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436772 t239464J 
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418876 
434987 
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413645 
424831 
433430 
443547 

459645 
422960 

432340 
412240 
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60 
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121279.1 

121985J 

1234627.1 

242836J 

8371.2 



28727.1 

1234345^1 

1272834J 

2181751.1 

137089.2 

722255.1 
11862.2 

1619980.1 
8235.1 



20 



25 



30 



35 



432648 129028.1 
40 451198 18361.1 



45 456505 15472.2 
442113 43919.1 



50 440028 598730 1 
414783 262554 1 
431698 6468.7 



441623 3362.1 



433282 759J 



434210 54921.1 



412262 4362.1 



430950 594908.1 



H44405 AI910434 8F082513 AI494069 A1270027 A1635878 AA128330 B068142S BE706078 R20904 BG6B00S9 BG676647 BF764409 AA026654 
AV745530 B1762796 BG287391 AW7987B0 BE706045 BE925470 AW7991 18 BFD87995 BE002273 AW879451 A1571075 BE067788 AV721320 
A1022862 N29754 C03378 N84767 AA1d1077 H30146 BE714290 At686B69 A)568892 AI915596 AW105614 AI887258 Ai538577 BE926474 
83)67737 B6319488 AA247685 AW798883 AW103S21 BF989173 AW88087B BE939707 BE1B57S0 BE714064 BE713903 BE71386B BE713763 
BG950164 BE713810 AW365151 BG955489 BE0052726F915937 AW365148 AI905927 6F992780 AW853812 BG954443 B1770853BG679406 
6G740832 BG681087 BG698430 AA455100 T87267 BE696209 BE696210 BI069463 6E006273 BE872225 AW391912 8E92551S BG877012 
BG741970 AA026480 BE705999 BG677157 BE009090 BG681378 BE712291 B6961498 BG6789B4 B1040941 AA337270 AW384371 AW847442 
81058659 6E813665 W9504B W2545BAW177785 AA025851 BE931733BF1 54837 BG849393 BE714441 AW9982456E7}1801 AI284090 
BE064323 BE719390 BE940148 BQ991212 BF375714 BF349S22 6G998267 T487g3 B1013292 BE001925 AW365156 AW36S154 AW606653 
8F763109 BE931637 BE1 67181 BE713879 6F354008 BF67872S H90899 AW365145 W38382 AI498487 
AA740616 AA654S54 AA229923 
AW9751 14 AI09^ Ai767001 AA658364 AW517542 
AW989880AA4M613 AA501874 R34356 
T59708 AA224B27TS9843 BE156903 

AF274943 B6494894 AI719075 AA908783 AI9351 50 AM22691 AA910644 AA5831 87 BM272157 AI828996 AA527373 AW972459 AI831360 
AA7724 18 AI033892 AA100926 AU154749 AI4S9432 AM23513 AI094597 AA740817 A1991 988 AI090262 AI312t04 BI256707 AA459522 AA416871 
Aia75239 A(339396 AA701623 AI139549 AI336880 AA633648 Ai969380 At382835 AA399239 Ai146955 BFS14270 N92892 A1348243A1278887 
AA459292 Ai494230 BF507531 A)4g2600 AA962596 AW613002 AA293140 AA23S549 BF108854 AA954344 N49682 AI457100 AW589407 
AW300758 BE220715 BE220898 BE569091 BM009647 BF900351 AI537692 AI203723 AI857576AA584410 AW371667 BM172363 
AK001125 AU120581 AU146612AW301393 
AA130992 AWg69537 AA503835 
H61453 AA838765 A1693104 AA721 107 
AA961588 AI863735 AA588325 

AV645808AA701657AW271273AI796734 AI472316AI017531 AI061178 BF109096AA548964 M83805AA131648 AA156589BE708349 
AW952494 030877 AV684717 Z24837 F00167 BF576150 T63841 R78995N87474 
AW292g84 AA724304 AA074346 AA074368 

BF090249 AW954g47 AW890487 A1305236 D60845 060537 AA825429 W36294 AW890410 AW088235 BF740240 AA448709 A)3S0279 AA8791 1 9 
AA319510 BE702077 BE699015 BE702D46 AW901293 T99319 081708 BF475488 D60383 081751 BE699260 
AA534222AA632632 T81234 

BF963346 BI460763 BI599382 All 86089 AY055003 AW9S91 85 AA1 87681 AI692282 AU153608 BI493898 B!493899 A)797349 BE550679 AI701 1 37 
AA744561 AA082682BE218816AW511501 BF447881 AI859101 BF059295AI914038 At346564AI68S003Al676076 Ail 25336 AW469637T10225 
Ai831296 R53451 A1453440 AA983739 AW470873 AI348290 BE857670 055901 Z43908 T34429 FO7305 AK074340 AL5381 18 BG201484 AI334192 
T35535 AW176751 BI4961 32 B1496133 AA469961 H72176 BF326265 AI770016 AI693177 BF223634 BF963661 BF962265 61034894 AI475851 
N69071Ai910707BE219038A)918036BE670589BM1935l5AI33d497AA805525AA487625AA830336AA749368AW1D4323AJ6^ 
AI272788 A1536849 AW152414 AW161923 T23854 AA610763 AA912188 AW339028 Z39946 T10224 F03171 AJ205478 AI638791 F02580 F01551 
AW207551 AA421030 BE246012 81034937 BI035373 BF939581 H19984 BEB67247 H92677 N5S988 BF9S7332 H16615 R16442 BF956229 
BFg61886R12698AA10n86AA365932R53452BRI62714BP959364 H41634AW086187H199B5BF054881 P02581 H18616H41527 AW003446 
BE243443 H92276 AW954002 BI670132 
AA677593 AA618150 AA557952 

AK024780 NMJ)24549 AW440385 AW965502 BF477855 AI149742 BE669424 BE349941 AI37671 7 AA977493 AW592564 AA8841 1 1 AAB65453 

N51840 BF593777 AA044645 AW512183 BF059034 AI984320 AA733139 AIB84553 AI559673 AI421 262 AI052068 AI085B91 BI9631 00 AA479883 

AI271696AI539839AI370924AI147781AA535765BI966896AA877448AW152259AA938992AA996022AI8004t0AA6t3303^ 

AW571973 A1203770 BI494224 BI494225 D45504 AA705824 AI6251 83 AA4371 18 AA640325 AA235528 AA73141 1 AA626263 AI560519 AW071 1 14 

AW005691 AI055933 AA971813 AA334937 N58441 AA536020 AW874174 AL555960 AW607532 H44647 

BC017965 AW969075 AA279982 AA504511 AI219979 AA504595 AI245579 AA278181 BG485019 BI049312 

AK074156 AI401548 BE002083 BEC02085 BE002084 B61 13650 AW580909 AA632959 BG610861 BG436183 BF438017 A1568508 AI095503 
Ai204208 AW244158 BE622614 NS2349 BE042940 A)095856 AA9761 19 AA835838 AI620854 AA456105 AA971S69 A1246289 Ai335636 AA994082 
AI250060 AI095710 AA60512S AA291672 BE463947 N52870 AA287553 AI14g093 AA581B02 W31684 AA610159 BF247061 BG2S8954 BE176866 
BE177041 AA332106 BE967020 

AW473675 A1190744 W69997 AWt0491 3 AI221098 W69996 AA685487 AA861491 

BE972657 AW069569 A1392889 H85161 AI074962 AA423935 BE671254 AA912627 H39845 K69083 AA280259 N67746 AA155722 
BM423604 BE075913 AI492369 AI831729 AA807134 Ai057292 BF059751 AA918209 BEB58647 AW016707 AI676289 AI452916 BF593014 
A1480099 AI5222B3 At628681 A147O402 BF056360AW236B30 AA514206 8F065812 AI796201 BF0S5617 BF054827 AI288301 AA705763 
BC022413 8E395396 8F754175 AA50G621 BE706665 BE706678 AA723159 BE153169 BE706729 BE708SS8 BE153312 BE706708 AW371B53 
AW371849 BE153241 BC017410 AI337912 AI090244 AW090300 BE219837 AI623661 BE501576 BE501734 AI742232 AI023964 Af458424 
AA975373AI288904 AI984S83 AAB9032S N32562 A1358102AW241694 AI0384WAI872071 A1018389AA576391 AAg77874 AWl 89392 W37448 
AA612894 A1277548 H89551 A!899774 K89365 AA315805 AW579186 BC014584 BC014581 AW780125 A1672414 BE32814S AW600919 8F031306 
AW172758 BE70B322 AA345675 BE875779 H28241 H25318 BF540913 86179688 BF110202 AA528775 W37S73 BE041644 AW366504 6M1 29522 
BM129822 AI122760 BE71B200 AW887496 AA149420 BE7O8307 BE539395 BE74B765 A1373653 R75904 BF979185 BF691393 BG495595 
BI094458 BE706702 BG496559 BF248373 BG494800 

BC015150 BM471660 BM478941 B!759769 BI094332 BE005819 BE005834 BW023432 BE179013 BE675522 AI680275 AA515794 AA256503 
F01634 AF002990 BG288697 AI09161S AW002917 AI6310a5 AW182664 BE501232 A19221 88 BF836783 A1631078 BE464544 AI376986 AW503774 
AA42413t AWS1 1849 AAB72I69 BF084885 AM32024 Af491908 BF197906 AAS6059g A{274416 AW269225 AW0S828d AI61 1346 AA236200 
C00417 8E6e9699 A1468149 BM4G0137 AI685282 Aig027l4 B6990066 BF894391 T83393 164298 BF80g721 BE925494 AV755948 BF928512 
BF894494 BF808642 AA256710 BF894347 8E928709 H06072 AA199607 B1B59300 AI983233 A1521 122 8E168898 BE30284B X78770 BC000927 
NM.032632BG707103BG913857BG613824BG613257BM313077B1561132B1461426BE389811 BE391509BE164646AA746371 AV750611 
BE175813AA332040BG200997BG485576AL043706AW16994BF903156AW820369AU1328248F941136BE543211AA5^ 
BFB09623 AW814670 AU129338 AW880782 AA371070BG615427 81667976 BI598706 BI599341 BE869166 B6502432 NM 020144 AF218840 
BI461 108 81562235 81462594 61458753 BF195561 BFB32738 BE690146AI678629AL1 35238 AU129571 AW500045 AA382478 AW502738 
BI561031 BE260153 BF087226 BM480024 AW604395 BE707050 R72668 BF038881 8F895070 86036594 BE551609 BF326901 BG286526 
AW962888 BE544939 AW383399 

AK057015 AI025834 BE857936 AA149091 AI742972 AW439172 AI253168 AA255613 BF5131 75 AIW5006 T03406 AW338149 AA836442 
AA420530 R88566 AI61 1672 AA433916 AA442855 BF063008 AA812568 A)889706 AA71531 3 AA768539 AA767620 AA665471 AA404380 
AA665612 BF056442 AA706388 A1550676 AA627448 AI141769 H782Z7 AW901852 H78221 BE701982 8F689273 AA397464 N33072 R60218 
AW968247 H14833 AA768305 AA043348 R56470 BF739B32 R51827 AI474963 BG494574 AA149090 BF238154 A18Q2210 8E000129 8F734513 
R41964 H2t055 R85253 R177(K R40844 BF790218 BG388356 BF003037 AA703138 AA377348 W24822 

AK056051 AI971 258 AI681 134 AU148134 AI803300 AA917325 AA923683 BF895068 AJ304442 A! 167464 AI284188 AA054272 AA829262 A1351910 
T90930 AI886230 U70056 AU1 19916 6F446537 6E503207 BE502849 AI698102 AA258553 AV718529 AV719917 6F724133 BI438668 AI804000 
BE349103 A1912^ BE645117 AA227954 AA446520 AAB7gi47 AA28m0 AVV136872 AA807g07 A143S989AI33g626 A)383274 AM1851^ 
BE771804 BF894509 AA455093 AI379061 AI150855 BF769906 R17298 AU138740 BF808607 6E674633 AW00132 AA2277B9 AA253099 
AV^75199 AA935418 n431S F12666 AA022923 TB9028 AA258608 VV26406 BE838620 AW00706 AAIOIS^ 
R42764 R44804R41278 
A15258n AA469525 AWOBSIH 
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419543 
455657 
414405 



251903.1 
1490185 1 
112689 1 



409010 1033U 



411962 
434982 
432676 
437B38 
446019 



432954 
446126 



419145 
409245 



458332 
459513 
449328 
406686 
417258 
447861 



425481 
440638 
442495 
416311 
431926 
435154 
437908 



2307710J 

121871J 

3503J2 

2512601J 

65B727J 



454042 30254J 



2159612.1 
610.2 



248375.1 
3199Ji 



437866 34267J 



1139685 J 
417837J 
3030728.1 
0.0 

400835.1 
44623.1 



334120_2 

371165.1 

928718_1 

1280744.1 

1237041.1 

126605.1 

13268.11 



AI018087 AA244170 AA244355 
BE065209 BE065364BE065110BE065111 

AU)47596 AA393792 AI670731 AIQ37957 AW874384 A1038137 N82286 AI241379 BE501096 AW090696 At927369 AI669226 AI369437 AI371075 

AW612409Al688711AI183289AA47ni7Al076l22AA635190AA700S84AA7B1S08D8l02DBF575223AIK^ 

BG263489 BE090666 BE090664 BE090662 K26545 

AL575207 AL551714 BM0147B1 BG542863 BG771232 AA429722 A)377511 A]770155 AA716665 BG003427 AA810811 AA442760 AA128610 
AA059411 A)796263AM94075A1572127 AA420992 BF436083 Ai648675AAB78813 81488614 B6700886AA128609 AV702879 AA731146 
AI560338 A1373224 AA919169 AI75B175 AA976350 BG701414 BF057794 AW135598 AA0625B3 BI549631 A)18S077 AA933879 AW024454 
AA193289 AA045194 BG928396 BE856883 8F435859 AA188423 AW237471 R99289 D61992 BE856637 BF368270 AA194235 N51319 AA383499 
NS306S B6548812 BF027898 BG779448 
AA099050AA099526 T47733 
AW975084 T90204 AA658177 
AI187366 AA618478 AA558869 
Ai30a202 At307229 AA769348 

M352520 D25917 A1670784 AI742347 AW269769 A1270700 AW61 0641 AW793036 AW793035 AW610540 AW362220 AW362166 AW362214 
AW362225 AW362228 AL1 19827 

AJ420458 AI018523 AA7C8688 BF949633 AL1 19553 BF945960 A1081305 AA041432 AI921013 AI684910 AI654847 AW874199 AI206120 
AW24142B R43035 T66767 AW10371 5 W28478 BF953052 H45926 BF807568 AW903943 BE170143 BI040435 BF931989 0I6OOOOO AV722350 
W27787 H45331 BI549761 R53955 BI549855 BG991583 BI491075 AW020049 AW1 29293 H45263 AA41 0309 AA340613 R42410 AA707199 
A1431587 BE858679 AW292267 A)421676 AA041 195 6E466753 AI243913 At358894 AW13729B AI366468 N64350 AA779107 AW025969 R49056 
A^34701 1 R55722 AW771 106 F04969 Z38381 FOI 659 H17396 B1493714 A18801 03 AW771447 Aia>2561 AA788851 AI494438 BF8561 14 H22S70 
AI076345AI887648AA572691 

BF946219 BF946218 BF851494 AL536879 AA457150 AI590194 AI582529 AA464515 AA916242 AA337109 AA335509 N46908 AA336322 
AA336407 AA337222 AA319240 Bt026817 81027058 AL536880 AI693827 AA651730 A1701013 BM068789 AW339506 AA293021 8F891 108 
AI458885AW361203AW974652 AI761251 A1655763 AA628083 BE047125AW08S916A112g687 N52070AW1 72361 AA052951 AWOB5909 
AlOOOOOB AA962570 AI371 342 AI364207 AA464514 AI962506 AI824603 AW376300 AA058439 AW361 192 AV656660 N50282 BF820514 BF891008 
H40784 BF8911 12 BE708029 AW043567 AA056762 

6M456602AV706711 BF37g357K90994AA23443SAAS5B020BF351723AA328271 R94615 N99638BG223375AW973750 N59599 
AF030234 8C017465 BG008526 AW505S50BM460141 N47324 AA361037 AA321632 N45606 AV752798 AV6571 16 AA2g6632 AU1 37857 
AW467027 AI742080 A1624350 K56206 AA478516 AW439997 AW393556 AW393523 AIS59753 AI608732 R66S56 H01374 61257369 61259830 
AWg60845 BM466252 AW956813 BE76B647 AV6S8853 BM055248 BF372070 BF372055 BF372061 AA347852 AA905B63 BGS05078 AV654024 
BF093291 AW021929H22650AA4597156G496341 6E697763BI254209BG499543 H42946 6105978061086741 H87896 H87599 6F691752 
6E768511 86940948 W37195 BF372041 BE8B3796 BF372082 BF367329 BF909744 AW966003 AV714014 BI492868 BI495144 AA921845 
At693426 A(652147 AI435449 N47325 AI434429 AA573137 A1163429 AI629962 Ai332526 BF513d37 AIie9561 A!221962 Ai376034 AW1 18897 
AW665247 AW340077 N41605 AA478519 AA46387S At858260 AA463379 A)292305 BE045947 AA971089 AI125820 BG940947 A1080245 
AA8a4954 Ai12S702 AI3B2934 AA931635 AI356631 AW439905 AI027633 AI399648 AI014533 AA347851 AA738261 N67374 N69081 AI768667 
AA946472AI819214AA293133AI1B6725AA889214AI222635BM95143N29605 N48812AA769041 AI49^^ AA095911 eE222062 
056772AW372265 6M054985 012465 BG534562 AW003511 H87486 H42880AW1 90293 BF594697 6F3776t 1 H22043 BI255749 BI492848 
H1&217 H21980 H22651 H88179 H87354 K440S2 H25165 H44128 

U52054 ALSB1000 AA156850 AW293839 6)335865 AA024963 BF149420 6E073977 AW602574 BE1&4012 BE163992 BE163974 AW402161 

BM1941 34 AW986609 W84374 6F91 6380 AA385173 WB4366 AA383743 BF003598 AA043776 W84421 AA778446 AW444904 8F446960 

AA837481AW55539AW468444AW4680Q2AA811830AA581806AI8666a8AI572124AA687333ra^ 

AA156781 A1536733 BM144850 A1471883 AA040926 BF507639 AA043777 AW874142 BE832S23 6E163g72 61022546 61021204 

AI000341 AI765341 AW873274 

AV704062 8E162284 AI032946 BF360636 

AJ8847ai A)652308 AI651694 AI638744 A»62493 

Ml 8728 

BG1 16781 BI914326 B1030196 N58885 N63406 AV683374 N56892 BG1 10501 AA333708 AA359583 AWg83123 NgS562 N95696 N95587 

AK074291 AW293424 BE676135 AI832125 BE019146 BE465019 AI761124 AA617778 A1279232 AW575897 AI672039 F28618 BF924261 

AA7221B4 BF934174 BE004328 AV749301 BE880282 BJ019798 B10193B9 BF928776 AWB13409 AV726604 AAD77560 6E272975 BF949119 

AW814195 BE879126 AI697926 BF594155 BE205787 BF063513 N35828 A)94d557 AI433B39 A1379679 6G056182 AI589094 N23123 AA588805 

AW316581 AI080272 Ai421980 A)493318 BF194830 N87S90 AA495^3 N32996 AA699844 H96845 H96592 N28741 81035539 BF747723 

6F171066 W01350 H05495 A)243785 Z39622 AA687432 A1350659 R461Q2 

AL520496 AW978162 A1610475 A1668990AW470054AA609426A1167391 AA815231 AA368241 

BG00950D AI376551 AA897445T87714 

AI184717AW518883AF121173 

AA179446 AA357764 D81719 D80529 C14833 

AW972724 AA877998 AA522631 AU185388 

AW972063AA66S764 AA804491 AW665688 AA765069 

At740S86 AA771806 BE500996 AW204531 A1082424 AI033879 BF093176 AA771764 D38676 



TABLE 33C: 



Pkey: Unique number conesponding to en Eds pnAeset 

Refc Sequence eource. The 7 digit numbers in this column are Genbank Mentifiar (GO mimben. 'Dunham, et aL* lefefs to the pubUcaOon enSfled The DNAsequence of 

human chromosome 22* Dunham, et aL (1 999) Nahire 402:48M95. 
Strand: bdicatesONA strand from which eioons were predicted. 
NLpo^Son: Indtoales nudaoOde posKkms of predktted exons. 



Pkey 


Ref 


Strand 


NLposIOon 


401403 


7710966 


Phis 


1461B0-146294 


408387 


9256180 


Phis 


116229.118371.117512-117651 


405268 


4156151 


Minus 


24404-24521 


408122 


9144087 


Minus 


30940^1386 


402550 


7652009 


Mbws 


8041M0673 



TABLE 34A: Aboul 703 genes upregulated In idiopathic pubnonaiy fibrosis relaflve to hypersenslthrily pneumonitis or non-specific hiteistilial pneumonitis 

Pkey: Unk;ue Eos probeset idenQfier number 

ExAocn: Exemplar Aocesston number. Genbankaocessldn number 
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R2: 



UnigeneJO: Uitigene number 
Unigene Title: Unigene gene title 
R1: ^Jhjer^ofl^AlsdM^ 

SOth pen»ntite of IPF AIs (fivided by 90ft perre^ 

Unigene HQe 

matrix metaOQprotelnBSe 1 (interstilia) 
seciBted phosphopfotein 1 (osleopontln, 
coOagen. type X\/I1. alpha 1 
FGENES predicted novel secreted protein 
Homo sapiens mRNA; cDNA D)<F^564K143 (fr 
fibrinogen, gamma polypeptide 
GSTs 

proenkephaBn 

hypoUi8tica)pn}t^aJ22418 
ESTs 

ESTs. VVisaMy slh^ar to 8^462 tvpoM 
ESTs 

delodlnase, iodothyrtxrine, type tl 
serins (or cysteine) proteinase inhiUto 
ftypotlietical protein PR02834 
buOous pemphigoid anl^en 1 (230/240kO) 
Predicted gene: Eos cloned; secreted w/V 
ESTs 

Homo sapiens cDNA FU1 1177 fis, done PL 
UDP glycosyltfBnsferase 1 family^ polype 
ESTs 

interteuldn 1 receptor antagonist 
small indudble cytokine suUamily B (Cy 
prominln(mouse)-li)(Bl 

Homo sapiens mRNA; cONA DKFZp«4F1 135 (f 
KIAA0455 genepioduct 
phosphoiamban 
inlronofperioslin(OSF-2os) 
ESTs 

Down syndrome critical region gene Wilt 
ESTs 

ESTs, Weakly sinular to ALU8_HUMAN ALU S 
lebmodin 1, smooth muscle (LM0D1) (Thy 
E74*like ^ 5 (ets domain transcrtot 
ESTs 

liephaesttn 

lntarteu)(in 13 receptor, alpha 2 
ESTs 

integrm. beta^Jke 1 (with EGF-like rep 
ESTs 

hypothetical protein DKFZp434F0318 
ESTs 

Bpocalln 2 (oncogene 24p3) (NGAL) 
complemenl component (3d/E|»tein Ban vl 
nephroblastoma overexpressnl gene 
gb:hm30f03j(1 NCI_CGAP^TTiy4 Homo sapiens 
C16001440•:glI12330704|gb|AAG52890.1^AF3 
C16001416*:gl|127431121reflXP 010131.2| 
similar to S68401 (cattle) glucose indue 
ESTs 

collagen, fype XIV. alpha 1 (unduBn) 
RNA bindng motif protein 86 
mucin 5, subtype B, tracheobronchial 
caisequestrin 2 (canfiac musd^) 
ectonudeoside triphosphate d^hoaphohyd 
C4001445:gi(12G97g9S|dl«BAB218iail (AB 
ESTs 

desmocoOind 
ESTs 

tumor protein 63 kOawiOi strong hornotog 
ESTs 

NM.002275*:Homo sapiens keraSn 15 0<RT1 
ntotubule-assodated protein IB 
proiilatelel basic protein (includes pta 
ESTs 
ESTs 

Target Exon 

cell adhesion molecute with homotogy to 
secreted modular caldunvbinding protein 



Pkey 


ExAfxn 


UnigenelO 


405443 






418007 


M13509 


HS.B3169 


446619 


AU076643 


H5.313 


422511 


AU076442 


Hs.117938 


40S954 


M21305 




42S2S9 


AL049280 


H5.155397 


407244 


M10014 




421823 


N40850 


Hsm25 


419875 


AA853410 


Hs.93557 


409542 


AA503020 


Hs.36563 


416310 


AA814100 


Hs.86693 


442006 


AW975183 




43831S 


R56795 


Hs.82419 


425071 


NM»013989 


H3.154424 


409632 


W74001 


Hs.55279 


434233 


AF119903 


Hs.138453 


408000 


L11690 


Hs.198689 


447033 


AI357412 


Hs.157601 


420185 


AL044056 


Hs.251385 


420195 


N44348 




426682 


AV660038 


Hs.2056 


408221 


AAd12183 


Hs.47447 


417079 


085590 


Hs.81134 


419216 


AU076718 


Hs.164021 


422163 


AF027208 


Hs.112360 


422404 


AL133571 


HS.3361B9 


445745 


AB007924 


Hs.13245 


407938 


AA905097 


Hs.85050 


423575 


CI 8883 


Hs.163443 


446659 


AI335361 


Hs.226376 


425383 


{383407 


Hs.156007 


437620 


AW976930 




414591 


AI888490 


Hs.55902 


416585 


X54162 


H8.79386 


425707 


AP1 15402 


Hi.11713 


430712 


AW044647 




453111 


AB014598 


Hs.31720 


451099 


1^2795 


Hs.25954 


414290 


A(568801 


Hs.7172t 


417801 


AA417383 


HS.82S82 


412639 


AW961284 


Hs.203838 


423720 


AL044191 


Hs.23388 


429757 


AW452355 


Hs.256037 


429504 


X99133 


Hs.204238 


412228 


AWS03785 


Hs.73792 


430223 


NM.002514 


H$^3S935 


411880 


AWB72477 




401645 






401673 






449048 


Z45051 


Ks.22920 


416316 


H5B721 


Ks.271628 


453874 


AW5917B3 


Hs^6131 


451149 


AU)47586 




421190 


U95031 


Hs.102482 


410036 


R5717i 


Hs.57975 


429525 


N92&40 


HsJ05353 


405120 






432224 


AW169460 


H3.208358 


418663 


AK001100 


H5.41690 


412622 


AW664708 


Hs.171959 


424012 


AW368377 


Hs.137569 


442767 


AI017208 


Hs.131149 


401785 






411800 


N39342 


Hs.103042 


427535 


R29&43 


H5.21&4 


444009 


A1380792 


Hs.135104 


435143 


R12375 


Hs.194600 


402333 




429609 


AF002246 


Hs.210863 


454078 


AA6015t8 


Hs.22209 


452242 


R509S6 


Hs.1 59993 


418693 


AI7S0876 


Hs.67409 


428411 


AW291464 


Hs.10338 


459702 


AI2W995 




428839 


AI767756 


H5.62302 


427138 


N77624 


Hs,173717 



thfombospondin 1 
ESTs 

gb:an03c03j(1 Stratagene schizo brato S1 
Homo sapiens cONA FU14814 lis. done NT 
phosphaBdto acid phosphatase type 2B 



R1 


R2 


a66 


7.50 


7^ 


a66 


&63 


103 


4^ 


Z81 


4.73 


5.69 


4.34 


4J4 


4.14 


5.88 


4.12 


1.80 


3.90 


Z01 


a68 


Z90 


3.66 


Z84 


3.60 


113 


3.49 


170 


3.47 


1.38 


3.46 


146 


3.28 


Z51 


3.26 


126 


119 


Z12 


3.18 


101 


3.16 


116 


ao8 


118 


3.07 


1.98 


a02 


Z36 


Z98 


1.72 


Z87 


1.48 


Z80 


Z66 


Z78 


1.65 


2.78 


Z46 


Z78 


1.55 


Z74 


1.56 


Z74 


1.85 


Z72 


Z72 


Z67 


ZG5 


Z66 


1.47 


Z66 


2.66 


Z62 


Z62 


2.61 


1.72 


2.69 


1^7 


Z59 


1.23 


Z58 


ZS8 


Z56 


Z29 


2.57 


1.74 


Z57 


1.60 


2.57 


1.00 


2.56 


1,12 


Z56 


1.25 


Z54 


Z54 


Z53 


138 


2.47 


Z83 


Z46 


1.18 


Z42 


144 


Z40 


1.69 


Z40 


1.95 


Z40 


1.61 


Z40 


2!40 


Z39 


1.27 


Z38 


2.38 


Z38 


ZOO 


Z38 


Z38 


Z37 


1.63 


Z37 


1.52 


Z36 


1.22 


Z35 


1.79 


Z35 


1.18 


Z34 


Z34 


Z34 


Z07 


Z33 


1.68 


Z33 


115 


Z33 


1.10 


Z32 


1.30 


Z32 


1.45 


Z32 


Z32 


Z32 


1.54 


Z31 


1.98 


Z31 


Z49 


Z31 


1.32 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



4223S3 


T55979 


Hs.115474 


repficaOon factor C (acQvalor 1) 3 (38 


2.30 


130 


456536 


AW135986 


Hs.257859 


ESTs 


2J2R 


128 


428166 


AA423849 


Hs.79530 




IT! 


1.88 


456936 


M81349 


Hs.1955 


senim amyloid M. conslMe 


Z25 


116 


417728 


AW138437 


Hs.24790 


KlAA1573pttilan 


tR 


1.37 


453070 


AK001465 


HS.3157S 


SEC63. endoplasmfa: reticulum translocon 


2.24 


142 


409159 


AW673312 


H&50848 


hypotheffcal protein FU2033t 


2124 


124 


404942 


AI7391S9 




s^ng factor, arginlne/serine^ 9 


124 


164 


410286 


Hs.61896 


DKFZP58BN2124pn}leIn 


Z24 


146 


440516 


S42303 


Hs.161 


cadherin 2, type 1, N-cadher1n (neuronal 


2^4 


1.94 


421574 


AJ000152 


H5.105924 


defensin, beta 2 


123 


1.36 


416005 


AI1B6220 


Hs.83164 


ool!a9en.typeW»8l|)ha1 


Z2Z 


1.37 


421948 


L42583 


Hs.334309 


keratin 6A 


ZSQ 


120 


407788 


BE514982 


Hs^l 


S100 calchmhbinding protein A2 


119 


1.61 


417563 


AA203701 




gb2x52a10.r1 Soares_fetaLRvar_8p(e8n_ 


118 


140 


431089 


8E041395 




ESTs. WeaMy similar to unknown protein 


116 


146 


447333 


BE090580 


Hs.70704 


hypothetica! protein dJ61688.3 


116 


100 


455797 


BE091833 




gb:l!^.BT0731-260400.076-F04BT0731 Homo 


116 




414987 


AA524394 


Hs.294022 


hypothetica! protein FU14950 


116 


1.99 


403362 






NM»001615*:Homo sapiens actin, gamma 2. 


116 


1.61 


432374 


W68815 


Hs.301885 


Homo sapiens cDNA RJ1 1346 Us. done PL 


115 


111 


407137 


T97307 




gb;ye53li0S.s1 Soares fetal liver spleen 


115 


124 


402641 






C1002296:gil66778171refINP.033126.1j rep 


114 


114 


418236 


AW994005 


H$.337534 


ESTs 


114 


114 


413059 


BE151498 




gb:RamT0295.291199-031.E11 HT0295 Homo 


114 


114 


432437 


W07088 


Hs.293685 


ESTs 


114 


114 


428398 


AI249368 


Hs.98558 


ESTs 


114 


114 


428336 


AA503115 


Hs.183752 


microsemlnoprotBin, bela* 


112 


1.43 


421B53 


All 17472 


Hs.108924 


$H3-doman piolein 5 (ponsin) 


112 


1.93 


436391 


AJ227892 


Hs.146274 


ESTs 


112 


112 


417430 


AA984546 




gbamSSeOasI Stralagene schteo brain SI 


111 


117 


407443 


AF227138 




gb^Homo sapiens candidate taste receptor 


111 


136 


428434 


AW363590 


Hs.65551 


Ham sapiens, Similar to DMA segment, Ch 


110 


1.29 


409432 


049372 


Hs.64460 


smaO Indudbie cytokine subfamily A (Cf 


110 


110 


456814 


AV653110 


HS.1066S0 


hypotiietical protein FU20533 


110 


100 


440273 


AI805392 


Hs.325335 


Homo sapiens cONA:FU23523 lis. ctonaL 


110 


1.37 


450271 


AI693900 


H8.2Q0920 


ESTs 


109 


134 


432222 


A1204995 




gb:an03c03.x1 Stratagena schizo brain SI 


109 


1.40 


458208 


A)380016 




ESTs, Weakly simBar to T4S4_HUMAN TRANS 


108 


100 


405600 


A1B2S942 




C12001673:gi|9631264|re1|NP,048045.1|or 


107 


1.97 


434654 


H5.139366 


Homo sapiens ctone L5 polyadenylaled HER 


107 


152 


439261 


Ai126020 


Hs.145674 


basic transcription factor 3 


105 


1.45 


421515 


Y11339 


Hs.105352 


GalNAc alpt>a>2. 6-sia!yltran5fef ase 1. 1 


104 


104 


457741 


BE044740 




gb:!inf)55g10j<1 NCI„CGAP.RDF1 Homo sapiens 


104 


104 


423161 


AIJ049227 


Hs.124776 


downstream of cadtierin 6 (by 3.3kb) 


102 


1.33 


412505 


AA974491 


Hs.21734 


ESTs 


102 


102 


443180 


R15875 


Hs.258576 


daudin 12 


102 


102 


431605 


AW972407 


Hs.124370 


gb:EST384498 MAGE resequences. MAa Homo 


102 


102 


415938 


BE383507 


H5.78921 


A kinase (PRKA) anchor protein 1 


102 


117 


452S71 


W31518 


H3.34665 


ESTs 


102 


109 


405061 






Target Exon 


101 


152' 


439343 


AF086161 


H5.114611 


hypothetical protein FU11808 


101 


141 


402327 






Target Exon 


100 


144 


418786 


AI796317 


Hs.203594 


Homo sapiens unchvaderted gasldc pro 


100 


100 


447343 


AA256641 


Hs^6694 


* ESTs, Highly similar to S02392alpha-2^n 


100 


100 


438634 


AW340400 


Hs.126728 


ESTs 


1.99 


143 


416127 


N49843 


Hs.79022 


GTP-binding protein overexpressed in ske 


1.97 


1.41 


423961 


D13G66 


Ks.136348 


peno8iin(Uor^/os) 


1.98 


1.48 


430397 


AI924533 


H8.10S607 


tdcartxjnato transporter related protdn 


1.96 


1.31 


411010 


AW813339 




gb:MR3-ST0192-1012g9^13K:05 ST01S2 Homo 


1.96 


173 


439628 


W81007 


Hs.58628 


ESTs 


1.96 


1.28 


444301 


AK000136 


Hs.10760 


aspof1n(LilR class 1) 


1.98 


1.58 


431726 


NiyL015361 


Hs.268053 


KIAA0029 protein 


1.95 


1.72 


410418 


031382 


Hs.63325 


IransmemtNvie protease, serine 4 


1.95 


1.87 


452814 


AI092790 


Hs.334703 


hypothetical protunFU1452g 


1.95 


1.06 


417562 


AW888754 


Hs.134128 


crystaliin, gammas 


1.95 


114 


424460 


AA341442 


H&.205299 


ESTs 


1.94 


1.94 


404342 


AI051607 




C7002192*:gir7293207tgbtAAF54404.11 (AEO 


1.92 


1.32 


443320 


Hs.16335 


ESTs 


1.91 


118 


449780 


AA443241 




ribosomal protein L44 


1.90 


1.76 


423337 


NM.004655 


H3.127337 


axin 2 (cotiductin, axil) 


1.89 


116 


434416 


AAe05903 


H5.5g49B 


cell diviskm cyde 2-tike 5 (cholineste 


1.89 


104 


457505 


AL044659 


Hs.43791 


ESTs 


1.89 


134 


425912 


AL137629 


Hs.162189 


serine/threonine kinase with Obi- and pi 


1.88 


1.26 


413585 


AI133452 


Hs75431 


fibnnogen, gamma polypeptkle 


1.88 


1.88 


428231 


U17989 


Hs.183105 


nuclear autoanligen 


1.88 


1.88 


426832 


AA578229 


Hs.324239 


ESTs. Moderately sinnlar to ZN91_HUMAN Z 


1.88 


1.88 


404429 






Target Exon 


1.88 


118 


447644 


AW861622 


Hs.108846 


Homo sapiens cDl^ FU 14934 fis. done PL 


1.88 


ao4 


406641 


AJ235667 




gb:Homo sairfens mRNA for immunoglobulin 


1.86 


157 


417059 


AL037672 


Hs.81071 


extraceOutar matrix protein 1 


1.88 


1.48 


454565 


BE141231 




gl7:MRO-HT0075<)81 199-003^ HT0075 Homo 


1.66 


1.21 


415115 


AA21422B 


Hs.127751 


nypODieucai proton 


1.85 


1.23 
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432305 


Y1B207 


(te.303090 


pnitein phosphatase 1 , regulatofy Cmhib 


1^ 


1.45 


414085 


AA114016 


HS.7S746 


aldehyde dehydrogenase 1 fan^, member 


1.84 


1.44 


403344 






NM.000341:Komo sapiens sohAe carrier fa 


1^ 


1.84 


447245 


AK001713 


Hs.17860 


hypoUwQcal protein FU10B51 


1.84 


133 


446006 


Nh^004403 


Hs.13530 


deafness, sutKOmsl doirinsnt 5 


1.84 


2.02 


401593 






Target Exon 


183 


2.34 


434392 


AW983709 


Hs^24 


Homo sapiens cONA: FU23435 fis. ctone H 


m 


Z12 


406461 






hypothetica) prot^ done 24751 


1^ 


101 


455657 


BE06S209 




gb:RC1'8T0314-31030W)1 W)12 BT0314 Homo 


1.83 


1.2S 


400609 






C10001147;gll12698926|gb|AAK01739.1IAF33 


1.82 


108 


42^ 


AI868872 


Hs.282804 


hypothetica! protein FIJ22704 


1.81 


1.14 


422867 


L32137 


Hs.1564 


carlOage oHgomerlc matrix protein (pse 


1.81 


1.22 


417412 


X16895 


HS.B2112 


interteukin 1 receptor, type 1 


1.81 


2.10 


426521 


AF151445 


Hs.170219 


hypQthetfcai protein 


1.81 


2.08 


429610 


AB024937 


Hs.211092 


LUNX protein; PLUNC (palate lung and nas 


1.81 


1.64 


423915 


AF039018 


Hs.135281 


a)pha-actinin-2-8S50ci8ted UM protein 


1.80 


1.34 


439606 


W79123 


Ks.58561 


G protein-coupled receptor 87 


1.80 


1.80 


459189 


A1909090 




gb:lL-6T198>01049»07 BT198 Homo sapten 


1.60 


1.80 


412429 


AV6S0262 


Hs.75765 


GR02 oncogene 


1.80 


2.55 


402674 






Target Exon 


1.80 


141 


431130 


NM.006103 


Hs.2719 


KE4; epididymis-specltic whey-addic pr 


1.79 


1.57 


454624 


AW833646 




gb:QV^TT0008-1611994»3^09TT0008HQnio 


1.78 


1.94 


401677 






BAI1>associated protein 3 


1.78 


128 


426291 


U58913 


Hs.169191 


small inducible c^loldne subMly A (Cy 


1.78 


1.53 


430028 


BE564110 


Hs.227750 


Target CAT 


1.78 


1.59 


44S988 


BE007663 


Hs.13503 


inacthration escape 2 


1.78 


110 


452272 


AW292249 


H&2S2739 


hypothetical protein DKFZp434P03l6 


1.78 


108 


416205 


L21715 


H163760 


troponin 1, stetetat. Hast 


1.78 


170 


400425 


AY004252 


Hs^87385 


PR domain containing 12 


i.n 


102 


400419 


AF084545 




Target 


1.77 


167 


447169 


AI989603 


HS.1572B9 


E8T8 


1.77 


121 


452359 


6E167229 


Hs.29206 


hypolhe6cdl protein M6C14376 


1.77 


112 


431941 


AK000106 


Hs.272227 


Homo sapiens cDNA FU20099 fis, done CO 


1.76 


154 


432808 


NM-015985 


Hs.278973 


angtopoietin-3 


1.76 


1.76 


437400 


AB011542 


Hs.55g9 


EGF-SksKtomain, multipla 5 


1.75 


lOO 


422487 


AJ010901 


Hs.198267 


mudn 4, tracheobronchial 


1.75 


1.47 


444986 


AI204197 




ESTs 


1.75 


148 


451027 


AW519204 


Hs.40808 


Homo sapiens, Similar to RIKEN cDNA 2810 


1,74 


1.69 


413524 


BE145B37 




gb:MRO-HT0208*101299-202-c07 HT0208 Homo 


1.74 


1.74 


409099 


AKD00725 


Hs.50579 


hypolheticai protein FLJ20718 


1.74 


126 


405579 






C22000151:gi|8806921HNP.004165.1| so 


1.74 


112 


405797 






CX001015:gllt1322384fembfCAC16667.1( (AJ 


1.73 


1G6 


405159 






ENSP0000024333r;CONA FU13984 fis. don 


1.73 


101 


450569 


AW192334 


H5.3821B 


ESTs 


1.73 


108 


450912 


AW939251 


Hs.25647 


v-fos FBJ murine osteosarcoma viral onco 


1.73 


1.24 


445261 


T79759 


Hs.250651 


ESTs, Wealcly similar to 138022 hypotheti 


1.73 


152 


454231 


AW450669 


Hs.45068 


hypothetical protein DKFZp4341143 


i.ra 


1.64 


429500 


X78565 


H&289114 


hexabrachion (tenascin C. cytotactln) 


1.72 


1.37 


422259 


AA307584 




8b:EST176498 Ooton cancfnoma (HCQ cell 


1.72 


1.72 


456034 


AW4S0979 




gb:UI-H.BI3^a^12-(MA.8l Na.CGAP.Su 


1.72 


1.34 


451B62 


H09260 


H8.32333 


ESTs 


1.71 


116 


403520 






Target Exon 


1.71 


1.39 


456596 


AA291834 


HS.789S0 


trandied ch^ kelo add dehydregenese E 


1.71 


126 


426603 


AA382291 




gb:EST95683 TesSs 1 Homo sapiens cDNA 5 


1.70 


1.70 


418387 


R18085 


H5.22279 


gb7g16bUr1 Scares infant brain INIB H 


1.70 


1.70 


433417 


AA587773 


Hs^ 


Homo sapiens. Similar to RIKEN cONA 5630 


1.70 


1.87 


402538 






C1001634:gI|12621136|ref|NP-075245.1| Ba 


1.69 


1.57 


414844 


AA296874 


Hs.77494 


deaxyguanoeine Idhase 


1.69 


106 


418478 


U38945 


Hs.1174 


^n-dependenl kinase Inhifailor 2A (me 


1.69 


1.63 


446553 


AB021179 


H3.15299 


HMBA-inducIbie 


1.68 


102 


456235 


AA203637 




Qb2x5eb12.n SoaresJetalJiverjBpleen. 


1.68 


112 


424580 


AA446539 


Ks.339024 


EST8.Weakiy6imilartoA46010X-iinked 


1.68 


103 


433930 


AA620338 




ESTs 


1.68 


128 


404151 






Target Exon 


1.68 


1.80 


429392 


AL109712 


Hs.296506 


HoiTto sapiens mRNA fiiil length insert cDN 


1.67 


118 


430070 


AF197927 


Hs.231967 


A1X1 fused gene from 5q31 


1.66 


116 


400496 






ENSP00000224716*:GTP-binding protein SAR 


1.66 


113 


413464 


AL121500 




ESTs 


1.66 


103 


411188 


BE161168 




gb:Pltmfr0425-17010(M)02-a10 HT0425 Homo 


1.66 


112 


446281 


K69416 


H5.14606 


hypothetical protein FU20271 


1.65 


128 


443282 


T47764 


Hs.132917 


ESTs 


1.65 


104 


423217 


NM_000094 


Hs.1640 


collagen, type VII, alpha 1 (epldennolys 


1.65 


1.67 


453355 


Ara5374 


HS31412 


myopodin 


1.65 


1.66 


432375 


BE538069 


Hs^962 


S100 caldunvMing protdn P 


1.65 


1.54 


437929 


T09353 


Hs.106642 


ESTs. Weaidy sSmSar to T09052 hypotheti 


1.65 


104 


410295 


AA741357 




pidogen (enacfki) 


1.64 


2.30 


437767 


AAB30103 


Hs.293331 


ESTs 


1.64 


1.26 


416560 


T61572 


Hs.79385 


Human dona 23574 mRNA sequence 


1.64 


3.38 


450795 


AW173371 


Hs.60435 


ESTs 


1.64 


1.64 


421847 


NlyL014717 


Hs.108884 


KlAA0390gen8pfQdud 


1.64 


175 


403010 






C210001S2:gil6226483)5plQ521181YlKK)3.ERWS 


1.64 


103 


406387 






Target Exon 


1.64 


1.78 


440423 


AW2939dS 


Hs.192277 


ESTs 


1.63 


2.05 
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444381 BE387335 H5.283713 hypcrfheScal protrin BC014245 

442802 AL133035 Hs^28 hypolhefcd protein DKFZp434G171 

442424 AI342715 Hs.129569 ESTs. Moderately sini)arbB34087hypot 

402885 TagelExon 

408788 AA773187 H5.294027 ESTs 

448719 M033627 Hs^1858 trtnudeotide repeat contMig 3 

414684 AW630023 H5.76893 34iydfT»yt}utyrate dehydrogenase (heart, 

408838 AA827589 Hs.153 libosoma) protein L7 

441600 AA939347 Hs.127223 Homo sapiens cysteine knot protein (ZSIG 

420693 NM.001972 Hs.99863 elastase 2, neutrophil 

412649 NM-002206 Hs.74369 integrin, atpha7 

432331 W37B82 Hs.274368 MSTP032 protein 

454034 N^4_000691 H5.575 aldehyde dehydrogenase 3 lamOy, mento 

400279 NM^004581*M)mo sapiens Rab geranytgeran 

437865 AI472305 Hs.19565 ESTs 

429165 AW009B86 Hs.116258 prostata cancer assodaled protein 1 

442993 BE018882 Hs.1 66196 ATPase, Pass I. type 6B. member 1 

410684 AA088500 Hs.170298 ESTs 

433149 BE257672 Hs.42949 hypothetic^ protein HES6 

448429 D17408 Hs.21223 caiponin 1. tiasic. smooth musde 

426457 AW894867 Hs.22660 chimerin (chimaeftn) 1 

427654 AA410183 Hs.137475 ESTs 

411662 D60541 Hs.285519 Komo sapiens cONAFU11904fis. done HE 

440383 AA884208 Hs.30484 ESTs 

406690 M29540 H$.220S29 cardnoembryonic antigen^ed celi ad 

424687 J05070 Hs.151738 matrb(m6taiioproteinase9(geiatinaseB 

407857 AI928445 Hs.92254 synaptotagmlnte 2 

411573 AB029000 Hs70823 K1AA1077 protein 

433336 AF017g86 Hs.313&6 secreted frizzlerkelded protein 2 

428471 X57348 Hs.184S10 sUatilin 

429249 XB1479 Hs^375 egMil(e module oontaMng,mucin^ 

407966 AA29S052 Hs.38S16 Homo sapiens, done MGC:15B87. mRNA. com 

418026 BE379727 Hs.83213 fatty add binding protein 4, adipocyte 

430469 AW603667 Hs.288742 Homo sapiens cDNA:FU22712 lis, done H 
44551 1 AA846512 Homo sapiens cDNA FU14459 fis. done HE 

404501 nudeoside phosphoiylase 

429107 AI470451 Hs.99075 ESTs 



417259 


AW903838 


H5.81800 


chondroHin sdfate proteoglycan 2 (vers 


429597 


NM_003816 


Hs.2442 


a disintegrin and metaUoprolelnase doma 


458091 


AF150286 




gb:AF1 50286 Human mRNA (rom cd34 stem ce 


439280 


AI125435 


H$.123654 


ESTs 


428096 


AW291771 


Hs.42239 


Homo sapiens, done IMAGE:3868S89. mRNA, 


414221 


AW450979 




gb:U1.H-BI3^ta«12^Ul8l NCLCGAP.Su 


451712 


AA019290 


Hs.110489 


ESTs 


402487 






Target Exon 


415274 


AF001548 


KsJ8344 


myosin, heavy polypeptide 11. smooth mus 


452887 


AI702223 


HS.1072S3 


hypothetical protein OKF2:p761F241 


410253 


T51623 




ESTs 


408741 


M73720 


Hs.646 


carboxypeptidase A3 (mast cdl) 


432985 


T92363 


H5.178703 


ESTs 


422166 


W72424 


Hs.112405 


S100 ca1dum4)lnding protein A9 (csAgran 


429259 


AA420450 


H5.292911 


RakophiSn 


429289 


AI400746 


Hs.62187 


pbosphaOdyRnosttol glycan, class K 


441457 


AW996651 


Hs.43838 


ESTs 


433365 


AF026944 


Hs.293797 


ESTs 


425483 


AF231022 


Hs.158159 


FAT tumor suppressor (Orosophila) homdo 


424386 


BE146577 


Hs.285132 


ESTs 


429655 


U48959 


HS.211S82 


myosin, Rght polypeptUo kinase 


442391 


AW450544 


Hs.220751 


ESTs 


414341 


D80004 


Hs.75909 


K1AA0182 protein 


436222 


AI20B737 


Hs.122810 


Homo sapiens cDNA FU1 1489 fis. done HE 


442264 


AI27e777 


Hs.263455 


ESTs, Weakly similar to ALUUHUi^AAN ALU S 


443878 


AW292499 


Hs.139709 


hypothetical protein FU12572 


430152 


AB00132S 


Hs.234642 


aquaporin3 


447752 


M73700 


Hs.105938 


lacto transferrin 


404455 






opkiid receptor, kappa 1 


424106 


AA412442 


Hs.98132 


ESTs 


433095 


AK001092 


Hs.302480 


Homo sapiens cDNA FLJ1023O fis, done HE 


409361 


NVL0OS982 


Hs.54416 


sine ocub honrteobox (Drosophila) homdo 


456256 


AB0OO45O 


Hs^l 


vaodnia related kinase 2 


439310 


AF066120 


Hs.102793 


ESTs 


407102 


AA007629 




0!ycero)-3-phosphate dehydrogenase 1 (so 


437981 


AA774445 


Hs.145365 


ESTs, Weakly similar to K1AA1397 protdn 


421485 


AA243499 


Hs.104800 


hypothetfcd protein FU10134 


414799 


Ai752416 


Hs.77326 


(nsulin-like growth factor binding prote 


453864 


AW021407 


Hs.21068 


hypothetica) protein 


401067 






ENSP00000252105*:CONA FU12240 lis. don 


456054 


BE313241 




gb:601151545Fl N1H_MGC_19 Homo sapiens c 


402324 






C19001982:gi|3043838|db|lBAA25483.1| (AS 


417733 


A1JD48678 


HS.S2503 


H^piens mlU4A for 3'UTR of unknown plot 


457734 


BE394365 


Hs.38750 


hypothetical protebi FU11S26 


402013 






Target Exon 


429295 


AA6B2a77 


Hs^6 


ESTs. Moderately slndltf to AliiS^HUMAN A 
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430920 

409368 

431958 

427719 

433430 

423790 

444083 

433256 

420859 

456664 

438158 

409B83 

4S2316 

413048 

457462 

452679 

401116 

419618 

452281 

426125 

428330 

418742 

416335 

408404 

448175 

431646 

459557 

449925 

442321 

454843 

410281 

402998 

443709 

435259 

454407 

453359 

434126 

417944 

442316 

438330 

410935 

455885 

405550 

451385 

424925 

431022 

439781 

429379 

435310 

430702 

451331 

459198 

442344 

402917 

418211 

437158 

427373 

433911 

402504 

409465 

449426 

405491 

406685 

442410 

407701 

400818 

406475 

426935 

414171 

444195 

447918 

421314 

412992 

401025 

4S2862 

425308 

402308 

428415 

407242 

410741 



U96402 

AA071059 

X63629 

AI393122 

A1863735 

BE152393 

Ai123195 

AW604447 

AW468397 

AW963354 

A1796556 

AW452419 

AA29B484 

M93221 

AL133573 

Z42387 

AA528295 

T33500 

X87241 

L22524 

AW451197 

R63267 

AW192518 

BE296174 

68019924 

N58315 

AI342493 

AF207664 

AWB34S36 

AF076612 

AI082692 

AA152106 

AW578420 

AA448787 

AI138589 

AU077196 

Z75331 

AW450572 

BE067395 

BE153524 

AA017656 

NM-002432 

AA490815 

AA845538 

NhU)14840 

AA705075 

U56979 

AK002039 

AI086347 

AI022925 



H5.248132 

Hs.2877 
H5.134726 



Hs.339408 
H3.100000 
Hs.334409 
Hs.167884 
Hs.296098 
Hs.61265 
Hs.75182 
Hs.272312 
H8.83883 



H5^792 

Hs.166994 

HS.225S 

Hs.113418 

H3^8399 

HS.22S160 
Hs.271580 

HS24192 
Hs.8230 
Hs.258549 
Hs.166186 

Hs.134662 

Hs.4859 

HS.11B843 

Hs^4872 

Hs.1 18205 

Hs.82985 

Hs.6217 

Hs^7316 

Hs.66881 



HS.153B37 
Hs^51 

Hs^0598 
Hs.169536 
Hs.278568 

Hs.151138 
Hs.79368 



BE244746 Hs.247474 
AW090198 

AB007972 Hs.130760 

AI923092 Hs^ 

AW393810 H8.78054 

T92251 Hs.198862 

M18728 
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Hs.8198 


405536 






424736 


AF230877 


H3.152701 


423017 


AW178761 


Hs^2794d 


425371 


049441 


HS.15S961 


429925 


NM.000786 




406711 


N25514 


Hs.77385 


409407 


AWg67370 


Hs^2655 


406109 






418597 


AK001678 


H18G337 


403162 






411020 


NM.006770 


Hs.67726 


407225 


J04617 




416955 


AW889150 


K5.80595 


4S19B9 


AF169787 


Hs.27413 


410276 


AI554545 




446921 


AB012113 


Hs.16530 


406712 


M31212 


Hs.77385 


406773 


AA812424 


Hs.76067 


452082 


N51905 


H5.125133 


419150 


T29618 


H5.89640 


417204 


N81037 


Hs.1074 


408339 


R97502 


Hs^0443 


400247 






430030 


BE300094 


Hs.227751 


442275 


AW449467 


Hs.54795 


406786 


AW161678 


Hs.111334 


439403 


BE265745 




428043 


T92248 


Hs^40 


406722 


H27498 


Hs.293441 


432242 


AVW22715 


Hs.162160 


450724 


R55428 




424125 


M31669 


Hs.1735 


432077 


AL1 34685 




427687 


AW003887 


Hs.1570 


435256 


AF193766 


Hs.13872 


420026 


At831190 


HS.16G676 


455128 


AW681555 


H&314372 


410685 


AA497117 


Hs.56893 


401404 






449625 


NI\/L0142S3 




443458 


R05385 


H1143509 


452744 


A1267652 


Hs^46107 


418355 


U2563 


Hs.1165 


447947 


N33033 


Ks.270215 


41S236 


AA330447 


Hs.135159 


455047 


AW852530 




440400 


AA994364 


Hs.125594 


444963 


AI916973 


Hs.213603 


410934 


AW811114 




442649 


R10099 


Hs.269805 


420407 


AA814732 


HS.14S010 


454600 


AW810001 




418454 


AA315308 


H8.195670 


459045 


N69101 


Hi.40730 


455500 


AW963562 




411745 


AW967826 





KM.005569':Homo sapiens UM domain Unas 


1.12 


199 


ghjcosan^ (N'acety1>-&^alasQ (Sanf 


1.12 


108 


Ct5000305:gI)3806122|9blAAC69l98.1| (AFO 


1.11 


Z01 


Homo sapiens RtRNA expressed only In plac 


ill 


zoo 


ESTs 


1.11 


2.01 


hypotheOcal protein FU231S3 


1.11 


105 


ESTs, Weakly similar to BING1 [Ksapiens 


1.11 


103 


gbiH^ens mitKhomlrid 1 6S rRNA gene 


1.10 


112 


SH3^1omain tnntfing protein 4 


1.10 


104 


inyosin, figU polypepMe 6. alkaS. smo 


1.10 


107 


ESTs 


1.10 


130 


fibronecQn 1 


1.09 


Z12 


DMA fragmentation tactor. 45 kO. dplia p 


1.09 


110 


Homo sapiens mRNA: cONA OKFZp£B6J021 (fr 


1.09 


104 


NM.003213':Homo sapiens TEA dom^ famil 


1.09 


119 


ESTs 


109 


122 


potassium voltagd^ated channel, subfaihi 


109 


126 


bsdWiXe growth factor Ijinding prate 


109 


128 


gDoma pathoganesis-related pmtein 


108 


164 


PDZ and UM domain 1(ellln) 


107 


102 


ESTs 


107 


108 


ESTs 


107 


111 


Target Exon 


107 


111 


gelsoim (amytoidosis. Rni^ type) 


107 


100 


Homo sapiens cONA: aJ22667 fis. dOne H 


107 


113 


IVP0th8tfcalproi^FU13188 


107 


2.21 


zlhc finger piotein 204 


106 


104 


NM.005605:Homo sapiens 26S pmteasoma^ 


108 


120 


mfcrotubuJe-intBracting protein tttat ass 


1.06 


106 


serine (or cysteine) proteinase Intitbito 


106 


106 


mesotttelin 


108 


1.27 


cytochrome P450, 51 (tanosterol 14^p)u 


106 


137 


myosin, light polypeptide 6, alkali, Bino 


106 


105 


Homo sapiens cDNA RJ132Bg fis, done OV 


105 


100 


Target Exon 


1.05 


2.04 


stmilar to DNAKlIrected RNA polymerase ) 


105 


167 


C200023V:Oli9602031lgl)|AAF99597.11AF239 


105 


107 


macrophage receptor with coHagenous str 


105 


107 


eukaryotic translation elongaOon factor 


105 


102 


NhL004552*iiomo sapiens NADH dehydiogena 


105 


110 


adaptor protein containing pH domain, PT 


104 


112 


angiopdeb'n-2 


1.04 


1.04 


small WwMq cytokine subfaniy A (Cy 


104 


1.04 


myosin. Bght potypepUde 6. dkaB, smo 


103 


105 


heat shock 27kO protein 1 


103 


110 


hypotheOcal protein inj22501 


103 


101 


TEK tyrosine kinase, endotheHal (venous 


103 


124 


surfactant, pufanonary-assoclated pnteln 


102 


100 


senlrin/SUMO^pedic protease 


102 


119 


Eos Control 


102 


104 


lecHa g^actoside^ng. soluble; } 


101 


101 


ESTs 


101 


104 


ferritin. Bght polypepGde 


101 


1.06 


ESTs, Weakly storilar to ALUC_HUMAN llll 


101 


111 


uteroglobin 


100 


106 


Homo sapiens SNC73 protein (SNC73) mlViA, 


100 


102 


ESTs. Weakly similar to ALU4_HUMAN ALU S 


100 


116 


gb:yi79b05.r1 Soaies breast 2NbHBst Homo 


100 


0.99 


inhibin, beta B (activin AB beta poiypep 


1.00 


1X8 


gb:OKFZp547M126_r1 547 (synonym: hftrl) 


100 


105 


histamine receptor HI 


100 


100 


cytokine^ protein C17 


1.00 


100 


ESTs 


100 


100 


EST 


100 


100 


ESTs. Moderately stmilar to ALU1 JIUMAN A 


1.00 


1X0 


Target Exon 


100 


100 


odz (odd Oz/tennn. DrosophQa) bomok}g 1 


1.00 


100 


hypothelteal protein FU21924 


100 


lOO 


Homo sapieris mRNA; cDNA D}^FZp434E082 (fr 


100 


1.00 


ATPase, H7 transport^, nongastric, alp 


100 


100 


ESTs 


100 


100 


Homo sapiens cONA FU11481 lis. done HE 


100 


1.00 


gb:PM1.CT024W71099WI^06 CT0243 Homo 


100 


100 


ESTs. Weakly similar to T25472 bypolheti 


100 


100 


ESTs 


100 


100 


gb:MR2-STD131.11119M)15^ST013l Homo 


100 


100 


ESTs 


100 


100 


Gpopdysaccarid&cpecHk: response 5-8 


100 


1.00 


gb:MR4^T0124.27030W»5*ll ST0124 Homo 


100 


100 


hypothetical protein FU14991 


100 


100 


ESTs 


100 


1X0 


gb:EST375655 MAGE resequences. MA6H Homo 


100 


100 


gb:MR0^N003»>30030(MK)t^ SN0039 Homo 


100 


100 
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429932 


AirasOOS 


Hs^1588 


ESTs 


432365 


AK001106 


Ks274419 


hypoMcal protein FLJ10244 


415817 


U88967 


H3.78867 


protein tyrosine phos^rfiatase, receptor4 


431374 


BE2S8532 


Hs.251871 


CTP synthase 


443162 


T49951 


Hs.9029 


DKFZP434G032pn)tein 


432128 


AA127221 


Hs.296502 


ESTs 


451838 


AW005866 


Hs.1 93969 


ESTs 


438414 


AA806794 


Hs.131511 


ESTs 


435872 


M701357 


H5.1 92759 


ESTs 


425211 


M16667 


Hs.1667 


progastricsin (pepslnoQon C) 


424001 


W67683 


Hs.137476 


patsmally expressed 10 


416869 


AW516565 




gbscqOltfOSJcl So3res_NHCeCjcervic3l_tumo 


458659 


AW749895 


Hs,332520 


Homo sapiens mRNA; cOMA OKFZp434A1014 (f 


418458 


AA332941 


Hs.85226 


Gpasa A, lyscsomai acid, chdestero) es 


432728 


NM_006979 


Hs^78721 


KLA dass (1 region expressed gene KE4 


432093 


H28383 




gl):ylS2c03j1 Sooes breast 3NbHBst Homo 


452239 


AW379378 




protebi ^losine phosphaUse, receptor t 


403167 






Target Exon 


402209 






Target Exon 


453500 


AI47B427 


Hs.43125 


esophageal cancer related gene 4 protein 


424090 


X^599 


Hs.139262 


XiAP associated factor-l 


432816 


N38913 


Hs.221575 


ESTs 


451779 


AW96B616 


Hs.296234 


ESTs, Vfeakly sMiar to T31613 hypotheS 


406851 


AA609784 




megor histocompalUIity complex, dass 


427698 


AW972594 


HsJ3S499 


ESTs 


44000S 


AK000517 


Hs^44 


((ALP2 pretein; PYRIN-Containing APAF14 


427383 


NML00S411 


Hs.177582 


surfactant, pufimonaiy-as sedated protein 


426024 


Z43405 


Hs.75668 


Homo sapier^ Siniarto RIKEN cONA 1700 


400986 






NM_024085':Homo saplms hypoMcal pro 


430353 


AVV952337 




cttrats syntiiase 


404975 






uncharacterlzed hypoihdamus protein HTO 


406673 


M34998 


Hs.1 98253 


ms^ histocQmpaKbifity complex, class 


431323 


AW970623 




gb:EST382705 I^GE resequences, MAGK Homo 


404926 






Target Exon 


432297 


AW663632 


Hs^85625 


Homo sapiens mRNA; cDNA OKFZp434A1 19 (fr 


437601 


AA761545 


Hs.248344 


ESTs, Weakly slnto to ALULHUHWN ALU S 


421566 


NIWL000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


406646 


M33600 


Hs.308026 


major NstocompaianTity complex, class 


442195 


NlyL001430 




endothelial PAS domam protein 1 


415457 


AW081710 


Hs.7369 


ESTs, Weakly similar to ALU1JHUMAN AtOJ S 


413916 


ht498l3 


HSJ5615 


apolipoprolain C-11 


453716 


AA037675 


Hs.152675 


ESTs 


437802 


A147S995 


Hs.122910 


ESTs 


422282 


AF010225 


Hs.114309 


apoOpoproteln L 


410361 


BE391804 


Hs.62661 


guanytats binding proton 1, interferort- 


424917 


A)836208 




hypolhetical protein FU23049 



1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


0.99 


Z33 


0.98 


3.26 


a97 


3.61 


a97 


0.96 


0.97 


1.08 


0.96 


2.25 


0.96 


Z07 


0.94 


Z18 


0.94 


1,31 


0.94 


Z12 


0.94 


2.19 


0.94 


0.79 


0.94 


2.06 


0.92 


2.04 


0.92 


0.74 


a9i 


2.11 


0.91 


2.15 


0.91 


2.14 


0.69 


1.04 


0.69 


0.90 


0.88 


Z42 


0.87 


1.14 


a87 


Z04 


a87 


2.10 


0.86 


2.28 


0.88 


Z50 


0.86 


1.94 


0.80 


2.08 


0.79 


Z01 


0.77 


a86 


0.77 


Z10 


0.76 


2.66 


0.76 


1.09 


0.76 


2.00 


a76 


0.78 


0.73 


Z06 


0.73 


2.10 


0.70 


Z08 


0.66 


2.95 


0.54 


2.11 


0.53 


0.53 



TABLE 34B: 

Pkey. Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Aooesston: Genbank accesskm numbers 



Pkey 

442006 
420195 



CAT Number Aocesston 



437620 9575.20 



1239046.1 AW975183 AA973583 AI365103 AI699495 AI301787 

28714 1 AK002039 AL117524 AV714494 AW954901 AIJ045243 BF955185 AU137860 AWB80615 AW880496 AA256290 BE767078 N44348 AI886676 
AA455877 N65571 AA999864AU1 57344 A1B17146 RS4821 BE223107 AA455880 AI3S5752 BF589210 N63487 A1924033 AI923020 AI306145 
AI919421AI5B4169AI250173AM40227AABe9698AW244040A1358104Al570333A!418315N94787R72348N94TO 
AW005707 N98831 BF001047 BF588891 AA318076 AW601474 AW883910 H10056 N53481 BE838574 BF909132 BI084973 BG257295 BG818471 
BE348449 AI420623 AW271213 BE048764 W44682 AI887849 AW903942 AA975919 AA312915 BF948057 R55120 H10110 BI045196 AW880645 
AW976930AW292808 AW451796 BF514112At806378AI658903AI769457 AW593455AI625525AI538551 A1650509 AA761825 AA973287 
AA861483 W73055 AI735361 W50499 W76653 BG959557 
430712 301999 1 AW044647 AI670953AI656180AA484715AI659205BF923472 

411880 1139083.1 BE088101 T05990AW872477 ^ 
451149 4941 2 AF231512AW300273BG779015AW510935AI989816AA137069AI74B876AW150861 AI862628AI805872AI575382BE855437AW044703 

AI677769 AA886718 AI753144 AA626B85 A1018092 AI263010 AW0261 73 BE221 1 38 AA25626B AW571932 AW276137 Al^^^ 
AA977716 AI302589 AA348340 AI720838 AI31 1733 AA015867 N73713 AL047586 AW840354 AA256196 AW840357 AW840504 T35664 Z36755 
AW954421 AA247424 AI056930 T31 380 BI910428 HB8489 BG675223 AA443427 BE879501 AA478530 R72977 AA2g8566 BF792417 AA356982 
BF922499 BE764808 BE56S636 BF9039B8 BF331881 N42207 BG623760 B6611090 BF735387 6E697757 BE697755 BE718853 N78560 AI984095 
AA137140 AA053711 N59865 AI078134 AA643796 T57803 AA01B642 N66799 AI004600 BG896323 BF895104 M73684 N73806 N73B1 1 AW900287 
AA01B641 

459702 539529 1 B6207209BE1 66299 A1204995BG199355AWg69908AA528756AW440776B1044354 
417563 2243443.1 AA203701 R86895 

431089 125941 2 80840189 AW063489AA71S980 6F001091 BF880066AA666102AA621946AA491 826 

455797 1511159.1 BE091833BE091874BE091871 

413059 1488711.1 BE063078 BE151503 BE151498 

417430 40161J2 AW872732AW827432AA139662AA610519R54983 

432222 539529 1 B6207209 BE166299 A1204995 BG199355 AW969908 AAS28756 AW440776 81044354 
458208 45807 4 AI990640AI38DOt6BM273298BM273080 

457741 12074? 1 BI01796BBBJ44740B1017768AW827360BF380597BI017970BF746974BF380S82BF380592BF908552BF907924B 

6F380634 BE1G6S81 BE161439 BF9086Q6 BI017961 6E044718 AW627623 BFg07758 BI017967 AW827621 AA653908 81017765 BI017955 
B1017960 61017798 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



411010 
449760 



406641 
454565 

455657 
4591B9 
454824 
444986 
413524 
422259 
456034 
426603 
456235 
433930 
413464 
411168 
410295 



.458091 
414221 
410253 
407102 



456054 
4093G8 
433430 
423790 
444083 
419618 
408404 
459557 
455685 
451385 
439781 
451331 



406685 



432810 



438874 
400221 
437751 



1066474.1 
31099J 



0«0 

1061836J 

1490185.1 

MH1945J 

1073655.1 

704733.1 

1516859.1 

140437,1 

685586.1 

1299162.1 

1979764.1 

19851J5 

415532-1 

1072487J 

2817.1 



400279 2140.1 



445511 9560.8 



452694.1 
685586.1 
132134.1 
7177.2 



1263570J 

110612^1 

2181751.1 

886344.1 

10908J2 

252691.1 

658475.1 

859794.1 

1524553.1 

8502^.1 

2592493.1 

28714.1 



437158 Sg575.1 



0.0 

MH497J 



443144 t6112J3 



101919.1 



52147.1 
9287.3 
643238.1 



AW813381 AW816094 AW813357 AW814469 AVVB13293 AW816099 AW813295 AW813425 AW813331 AW81 3325 AW81 3351 AW813427 
AW813339 

BG721806 BG623574 AA387501 BG436403 BG619828 BG570704 BF086115 BF086118 R78932 BG620^ BG571920 BF997723 AA368244 
BG620631 BG6219S7 BG435818 BG620442 BG621518 H12650 BG573175 H61600 R67494 H01715 D78811 BG435953 BF107266 079043 R67255 
H01310 8G570941 66570693 R21776 AA327133 R32578 R30775 BGS70963 T86948 H61601 W86279 BF991 104 R21732 BFg90905 BG622861 
6E92g894 AI09Q2g0 6E929277 BE9292a4 AA3677B3 AA0825B1 076839 H76318 N91085 8E929344 083217 BE929334 H53S36 R80360 H54070 
C17064 AW962470 Ra0900 BG619698 66623948 H94918 BE929345 AA004267 6F957177 BG62068S BF086421 TB7029 C17D44 H60972 
BGS73514 AA131924 078838 BG003560 C16615 W86323 R09737 R02529 AA387502 
AJ235667 AJ235668 AJ235669 AJ235670 

BE141160 BE141231 BE141793 BE141791 6E141167 BE141807 BE1418Q6 6E141805 AW807S91 AW807590 AWB07585 AW8075d3 BE141803 

AVII8459iaBE141207BE141l68 

8B)6S209 BE065354 BE065110 BE06511 1 

AV683451 AK057494 B6718853 BM1 52866 6G390826 BE709644 AIB647Z7 BI045161 61459637 AI909102 Al909QgO BG722507 81023834 
AW833783AWB33646 AW833525AW833351 AW833526AW833825 
AW268472 AI204ig7 AW592537 
BE145894 BE145837 BM263472 

BFB21471 AW795791 BF844843 6F621371 AA307S84 AW79579D BF633724 6E154067 BE064709 

AA136653 AA136656 AW450979 AA98435B AA809064 AW236036 AA492073 BE16894S 

AA994657AA382291 

AA203837 AA832266 H67452 

AW873618AA620338 

AL527514 AI732432 AA133309 AI225224 AV700997 BF589351 AW291763 A1121 500 AA129708 
AW821260 BE162466 BE161168 

BG402352 BG545066 AA150252 AL036760 AA452480 AI033256 W66776 Wg3372 N31248 At0522l9 AI367635 W69374 N88610 R58194 Bi524854 
614971 1 1 BF940043 AI12g268 A1359798 AI056480 AA121421 A)042150 AW449003 AI418180 AMI 9420 A13S6058 BF632243 AI349330 AI359448 
W76647 6F477170 AA09gi63 BF994549 AW60B256 AA045416 H03770 AL574791 AW069455 BE302148 AW022281 AW960273 AA1 21268 
AI336371 A1989381 A)131425 A1147483 A1311537 AV\r338638 AI141649 AA709414 AI1871T7 AA780884 A)333805 AA04S312 AI823918 A)349421 
W63753 W70299 AA557276 AA299007 N98212 W74084 N24823 T54892 AA054724 W73059 A)869152 N93462 N71889 A)537432 R71628 
AA303089 A1498550 T60941 AV706417 AW067648 All 50677 AW3381 16 A1336313 AAB2625Q AI139518 AA662948 AA902723 A)970175 W68682 
AI069380 AI146372 H99951 AW183001 A1270317 AA532767 AA044727 AA931652 RB2469 AA150261 W67788 H67495 RB0715 AW149812 N76914 
AI862034 W61122AW023118W69375 T88917 T47984 N21531 R35546AA055544H15534AA688295AA090588AA044764BF994641 R79547 
N21313 BF674610 H02&74 AW97S323 R16904 AA328030 AA0S4e71 R79546 BF832310 A1249109 

Y08200 NM.004S81 BC0030S3 BE733834 BI763321 B6773890 BF091906 BI917S41 A1023762 AAS87230 BF43S086 AI264262 AI687392 A1810536 
AW589886AI244419AA749261 AA53S435AW205689 AI765770 AI765431 C02465AW305347 AI818456AA322111 AW381845 AW381829 
AV749407 AA811636 AU159893 AA603065 AA652542 AI488878 R49816 AW381863 6E38g867 BE182387 BF687771 AA527551 AA134051 
AA831504 AA134052 AI971759 AW089048 6)913532 AA367709 BG82815S 6F093014 
B»M71219BE093160BG171761Bl254009AJ905474AA453162AA82g7S9W086559AA776022AJ3n4468F5^^ 
AA443880AA476733AW970674A1393291 AAg88283AI905528AW3849S6 078656 
AF150266AV739062AA835857 

AA136653 AA136656 AW4S0979 AA984358 AA809054 AW238038 AA492073 BE168945 

AA774785 AA584875 AA57n05 AA68317B AA083204 AW352057 T92332 T51823 T02858 AA083375 T92381 

AW945170 BF930905 F33652 BG057618 A)368018 At421485 A)300352 A137dS25 A1264177 AI276281 Ai245302 M281080 A\1 90036 AW451438 

AW242903 AA910870 F22289 Fig&47 F2237S AW473816 BF445785 AA774S28 F33447 C01Q77 AW772227 F17759 H42812 R09701 AA349096 

R48772 H42892 H42537 R47698 N2B263 H25721 F32388 H43971 R48205 F21390 H4Sa09 AA007620 R47897 R83734 H45844 AW9836S3 H43970 

H42S36 H24495 R48875 H42961 H22079 R86018 

BF314481 BE313241 6)198333 BE383146 

AA0710Sg AA085201 AA085020 

AA961566 A1863735 AAS88325 

6E152393 AA330g84 BE073904 6F176271 

B1836699A)123195 

6G168298 AA247945 AAS28295 AW971284 

6F3QB&98 6E296629 BE2S87G5 AW192S18 BE299614 BE300025 BF307463 

6G623239 N58315A)524952 

6E153524 BE153576 BE153583 

AA019761 AA017656 AA017374 

AAB4S538AA890229 

AK002039 All 17624 AV714494 AW954901 AL04S243 6Fg55185 AU137860 AW860615 AW880496 AA25629Q BE767078 M44348 A13e6676 
AA455877 N66571 AA999664 AU157344 AI817146 R54821 BE223107 AA455880 A1355752 BF589210 N63487 AI924033 A1923020 A1306145 
A1919421 A)584169 A)250173 AI440227 AA669696 AW244040 AI358104 A1S70333 A)418315 N94787 R72348 H94780 BF944396 BF754698 
AW005707 N98831 BF001047 6F588691 AA318076 AW601474 AW8a3910 H10056 N63481 BE838574 BF909132 BI084973 BG257295 BG818471 
BE348449 AI420623 AW271213 8E048764 W44882 A1887849 AW903942 AA975919 AA312915 eF948057 R55120 H101 10 61045196 AW880645 
ALj050068 AA160485 AW173544 AW2S6506 AW439860 A1521563 A170252g At393606 AW1 36323 AA570109 Ht9S04 BM0219$8 6F063327 
BF593552 AA63a766 A)597717 A1807128 AA523012 AI356250 AW451 857 AA974203 AI762577 BF512552 AW007307 BE675286 AW450602 
AA982057 AW516069 AI582546 BF221924 eF222543 A1601808 AW468599 AW000738 A)866625 AW235356 6M021837 AA911956 A1880606 
W86516T03370AW611634 H41653 A1468349 H19588 AWOOOigS AW043993 R39847 
M18728 

NM.032013 AB044943 AF308609 B17692S4 A1819262 AAS45639 A1289073 6E670187 AA207127 T33098 AA633492 BE328324 AI083620 AI204945 
AA903471 AI372602 AI049836A)049737 BF000101 AI686167 6F436B96 AI659189 T32971 AA311302AW73314g A)435981 T03438 AA879208 
A1972177 T33083 AA613910 A1971019 Ai027140 AA853739 A1620528 AI637519 A)972307 BF054861 AK027665 BF0827S1 A)598127 T33663 
AA204743 6E086680 F12132 T26372 BF76S338 AA323106 AA322907 B1037062 BE315235 AV722868 BE254381 T66212 BES66142 T09034 
AA408202 BF843940 A1743855AA1 13358 BG990683BE0887296E535539 

AB058726 A1651414 BE245990 6E245765 6F439734 AA&48422 AA040639 AI340155 AA255928 AA278365 BE766296 AA28Q771 ALJ555562 
A!474638 AI863068 6)260946 AK027039 BG615852 A)698039 AA252016 AA2S8886 6E905205 AWS01 167 eF5141 17 6)857400 AW297001 
A)624923 AA12S900 AW272165 AA1 90967 AA280729 AA035532 AW129692 AA125699 6G528645 6E614599 BE464693 AI5601 28 AAS51 51 1 
A1351149AL555551 

BG292389 C06094 AI668930 AW104534 AA310S13 AA830127 AW1348g7 AA046953 AW965490 AI610530 8F092924 AA334151 AA33472S 
031302 R20723 AA263003 6)824635 AI276287 A1684428 A)524234 AI335035 AW014704 Ai91 1443 AA972102 A1367512 AI126670 AW016017 
AI286003 A)147163 AA626033 A1539156 AA565542 AI094253 AW512612 BEB89628 AA744762 BE646306 AW471324 AA999975 AA863400 
H17550 A)g91439 R46187 BE929954 AA333976 063102 6F744491 
AFa75017 R66779 R22463 K02780 • 

NM.002082 L16862 60828886 BE795217 6E904064 eE294526 6E297283 BE394617 6E935127 BE935106 F12351 BG823182 H16710 
AW978798 AA767373 AW173343 AA836163 N27563 AA90S32B R97032 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



453331 



446526 

455577 
4308S8 
400161 



409959 
421341 



434051 
410933 
406731 



400231 



432745 
412477 



456076 84S5J 



430202 
424017 
437272 



453542 
437565 
430763 
409031 



430108 
410276 
400247 



1063474^1 BF374577 AW809840 AW809996 AW809798 AW809695 AW809646 AW809738 BF374582 AVII809716 AW809826 AWro9802 AW809747 
AW810152 

16559J BG57t303 AA410586 AA035018 BG5721 17 BG620022 AA147247 BG005785 BG014448 R31981 H02668 H12498R36203 BF992089 R73999 
T49904 R757» BI057974 T53681 AA147933 N50695 R68588 R2S671 R31935 R25110 R35105 AKD5562B BE157<57 AW653674 AA190993 
H01642 BF51O304 AA626915 AA746952 AI161014 AA099554 BG572534 A1803329 AI809932 AI808765 AA41 1449 A)378760 AA976929 AI378620 
AA909684 R75632 AI360919 Ai350463 AW069127 AA41 1621 AA742532 H12451 BE208298 H03612 H12839 N58781 R75957 BF998484 AI240665 
BF989591 BI056088 BG001S90 BF107035 

1 1 131_1 7 AKD23419 BE464935 8E221668 AI14888S BI859909 AI3747B0 AA766e92 AI347967 AI5B2917 AA4771 17 AA229236 AA652637 AA636109 
AU134580 F21298 BF802607 BE769124 AV658891 H69816 AV659853 

1475566J BE006307BE0063416E006311 

73704J AFQ07ig0AW62070SBE16B488AF0Q7192BF753303BG976971 BG976568 

2656.2 U44839NMJD04651 BC000350BM58316AU117940BG759024BG749S94BE799505BG831S37AI816335AA325352A1J47005AW1CT^ 

A1859331 Ai816186 AU150786 AL043549 AW1 62880 AU159233 A)143169 T03478 BE727648 AA764725 6E206603 A1369814 AI984389 AW157545 
BE221486 H99016 AU159025 A)074496 A1494516 BE245950 AA704385 AA280862 A)479595 AI369776 BE671398 T0SS38 AA682249 61677303 
BE645335 AI359434 H92868 DS2599 053609 054715 T06015 BE222174 AI954706 DS3218 D53767 R69689 W86696 Af497670 R70771 SF309414 
BE620147 8G910S97 AW964968 6E836120 AL57971S HS6512 DS59S6 B1044097 AL5S5239 BF220278 AA061991 Ai819544 AW001573 AW131600 
AI658764 DS2367 W22034 BG818979 BG024561 BE702n9 61458663 B1910399 BG707755 BF3482B4 H10055 BI066315 BE620574 H410B8 
BG119517 W23257 W21941 AA328817 

2781 J AW513804 BE179199 BE179195 BE179198 BE179204 BE162686 

1407.1 NM-007329 AF159456 AJ243212 AJ297935 AA295769 NM_017579 AJ243224 AI492875 AI796676 AI749838 AA918144 A18 14590 AI923531 
BF513992 AI720725 AI150879 AI279072 AW512904 A1492104 A1284510 AI141 231 AA613554 AW66214B AW769047 AA56S985 AW51 2888 
AU100513 BG9555B5BG955588 AA295763 BE829414 8F760645 BG954398 AA295332 AA295795 8E932867 AW769S69 T89953 8E934311 

73505.1 AF116522 AI114S07 AA640834 BF111602 AA377g99 
1064624.1 C16024 C15974 AW811066 AW811052 AW811020 
0.0 A1S59131 

15525.1 AK074473BC017997BI831060BF971101Al886394AI082824A\m)8785VW6073W07772A\ffi60737W^^ 
AI191256 R44763 R19947 BF571345 W862S7 

MH494.5 BC013310AF261085BC004109 AY007133 BC009081 BCC01601 NfcL002O46M33197BCO20308J02642 M38164BE794233AV721080BE255459 
BG926429 BG389312 BG477333 A1031799 BI763443 BI260432 AA989106 AV72B576 BI091380 AA402499 A1200513 AI284734 AI223995 AI289749 
BG283291 BM013814AW438544BM450203 F35435F33262BE890952AA401181 BG939668 F35525B1088182F34674F33506BM471326 
F34677 AW276712 AA1 87508 F34866 AA1 14245 AA522581 N23935 A1D76923 AI018505 BE879774 BM465637 AI753078 BG2221 59 AA595947 
BF970917 BI094125 AA719841 BE893087 BG775178 BE793983 BE797071 BF339134 BE409272 6E266456 6E786770 8E745957 BG755835 
BE266758 BE25g342 BM450181 BG748174 6E299322 6M423587 BM467637 BM452667 6M479516 6M452420 BE273297 BM466364 BM450640 
BM478743 BM459094 BM455306 BM472001 BM478247 BM478771 BM480379 BM459071 BM450106 BliM67584 BM464546 BM465044 BM450176 
BF569359 BM462924 BM4S5329 BM471815 61862301 6G331736 H04903 AA374894 BE902964 

112543.1 AA658826AI821926 Ai791191 AA635129 AA564492 

8669.2 AI220117At857837 AI218371 BM091400At304964AI198508AI400738AW571549AW950042A!08g943AA4372BOAU 150878 6F197070A1267984 
BF594181 BF196688AI433152AI338921 AI820364AI280197AA652531 AI674938AI342447 AI620350AI281295AI148821 N54787AI338121 
AI2811S3N51899AI087072AA954788AVV069054AI346309BG529629AI340135BF083036A116^^ 

AI383720 8E0475e5 AW01 5498 AA937149 AA708346 AW771478 AW802508 H53334 AW389204 AW798230 AI553922 A1560688 AW950043 
AI961682 AV706506 R01853 AA126514 N62757 AI536893 AJ926052 AI41 8720 N99964 AI558933 A)915737 A1080691 AI185358 N48996 N68575 
HB2824 H60037 A1247247 T95664 BF583863 Ar749637 AV/088541 AA991 294 AA8874S2 AI073726 AA6331 32 AA629674 AA629649 AA629656 
AAS78595 A11687S8 AA804572 AI085786 AA9943g6 AA991209 AA94B563 AA92g054 AAg27952 T87001 AA928210 AA629296 AW802267 
AW384129 BF744400 AA194110 AI382839 AA184837 AA406264 AI250750 R37035 At525566 W01244 

BG210376 BG217800 BE925778 W39114 BG682395 N70644 BE709097 AW275615 BC001469 AL564888 AI088126 AW003852 BF792438 
AA161295 AW9701 31 AI127310 AW029307 AW192534 AA843144 AW606235 BE221641 AW0081 1 1 AA224203 AA604607 AW794761 AAl 34005 
AA126850 N99165AW769391AW818302A1269871 BE503027 AW401627 AA486231 AA486417AA191542AA0281 28 AAl 59991 AI498090 
AI241Q24BM145449 AA774661 A1626021 C182S1 BE185811 AA291517 N38896N59222AI24S611 AA169207A1298572 AAl 69585 All 31 139 
AAt57960 Af439963 AI208276 AA936(W1 W67305 AW337587 AI357055 WD4739 A121451 7 AA617789 AW241277 AI880213 A1582789 Al 143996 
BE814848 N49964 A1936222 AIB17819 F{a976 AA039349 AW805002 T35117 N94388 AI6B9530 AW384573 AW3M555 AW384539 AW384473 
AA129709 AW384466 BG194342 BG204579BG02753BA1578075 AA399553 AW794949 T88866AW511211 R26588 R36i11 BG170598BE937009 
BG678833 AA862899 H96612 H02273 AA768487 BF211173 N32570 AA088287 R68451 AA297563 H16847 R80665 AA421891 W68402 R2B379 
R54119 R70109 R77661 R67963BG701844 H68670 AA169864 AA114111 BE715243 R59317 BE715252 BE713804 BG336586 NM.001579 
BC01 1835 U51478 BM4631 1 7 AU119746 B1462090 BI227086 6G706303 BF059073 6G706532 BI544716 A1568735 BE858747 AU122881 
AU126210 A1166547 AU1347056E281323 AI147220 BE263820 AW973937 BG231863 BE658367 BEZ78941 AI262814 AI001194 AI391616 
A)200862 N32564 AJ161064 A}089818 8F514359 A1370916 AI341797 BE263168 AI218416 AI13109B AI285310 AI928^ 
AW028327 AI217792 A1161020 AI342854 AI221544 A1304700 AI343(X)5 BM148839 AAB31536 AW0742S8 BF940569 A1149876 AI914574 AA298442 
A)241 343 AA668985 AW2721 72 Al 160537 BG209220 AA02B152 AA02S989 
1233214.1 AW988345AA468998 T85775 
888651.1 AW956107AA334317AA333789 

176.2 AB075828 BG1 07783 AW0213 13 B1492345 AW385707 AW5608®)AW296117 6E072066 AVW80775 BF679498 AW580B28 8E0036G6 BE003872 

AW580851 AW580908 BI037146 AW5S0894AW580864 H17658 N50343 N54223F05S79F07386F05578 AA773248AA354359AA330257 C2068S 
BE548300 AWg68728 AW96B554 AA777644 AA7068S6 AA682517 AAB32267 BG165087 N51097 N51567 R95837 BI256603 W89042 R95836 
R97364 AW975957 AA747943 AA8 1 1 289 

885.22 BFS6B18&BF89974SAWB36724BE243668 

69698811 AW976857AI809001AA769369AW102732AA761235 

1400.7 AL578301 A1337389 AI871302 AA97B185 BF591738 AI765912 

9531J BF036O43 AW190446 BG194731 AW662036 A)445021 BE9375S0 AW818972 AW393132 AA834685 BF1 12058 AV721682 H16423 AI270187 

AI857345AA937302AW818444BE929780BG498578BF155010BI598271 61599811 BE161728 AW578737 AW753711 AW379707 AW381918 
B6506608 AW028637 AW994240 BF887392 BF790073 AW381624 AV727105 BF439618 AA443174 A1018009 N42B50 AW573242 AM172S8 
AA463483 AI6761 31 AI167170 AA838627 AA443828 AW592922 AA235129 AA73027B AW439062 AW474332 BI043239 AW474342 BG708553 
AW362423 BF090028 BE827256 R1 6550 R39478 R39479 R94368 BG540916 BM314745 AA251087 D54231 055274 BF085805 031589 AW986405 
AW99442S 081879 BE093545 AW90i 1 07 AA383529 BI021552 R56420 N39976 AA573281 H82595 AA234955 BE093539 AW367006 BF358697 
BF386318 AA663856 BE702099 BF035959 AI267384 AI207232 BE348320 AA621574 AA861212 BF083343 BF083341 AV745131 D53074 
AW95447eAW954472AA376836AV724531 053053 CI 4928 AA0932B7AA062538BG483558BE94OO50AA765954 T701 71 BE938775 BE940057 
D53502 AW373300 AL118798 BM1 28728 AAl 9341 1 AW4447C9 AW952455 A1887612 BF431948 61496876 AI264159 eM128481 A1624657 
AI689301 Aig69467 AA861685 AA251595 AA625761 AA872090 Ai826780 AA328366 68827416 R7S951 DS6918 R68122 eE827384 AL1187g7 
A1184164 AAt64411 6l4g53326E858113AI863860H006G0T189849AW780389C14667eE93499SBI018652R9^ AA1&4410H00752 
AW373305 AW373299 AW373302 

1233254.1 AA926810AW968393AA465294AA811301 

641443.1 AA083514 A15S4545 AW169852 AI363822 A1633826 A1&56026 AI765624 AA147545 AA147552 

2764.1 BG022339 BO00961O 6C010537 X79805 NML006713 U1 2979 BM487B14 8M4S0743 AU132951 AU137129 60493425 AV7S8819 BG708412 

BG705885 BG702217 AV716638 6G777009 B1545689 61552153 eM476712 60770858 6G5Z7656 BGS2B277 BG391388 AV716861 61602926 
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439403 
450724 



432077 
449625 

455047 
410934 
454600 



455500 
411745 
418689 
432093 
452239 



408B51 
430353 



4937.10 
40161J 



452948.1 
249224J 

1092329.1 
106465:L1 
1063489.1 



1228737.1 

43036.8 

12789J4 

1237011.1 

10116.4 



0_0 

102B8J 



431323 
442195 



1235269.1 
16007J 



BG290073 BI667399 BM451469 B1657173 BI602139 BG532171 BI669215 BI544727 B6721852 AV716503 AV701327 BM090738 B1492000 
Al30a85B BI544904 AL593813 AV715B29 AV716505 AV714587 AV717902 BF668072 AV716385 BW61927 BM090954 AV717828 BG503676 
AV647719 B6501392 BG428433 BE895629 BM313117 AW021050 BG435032 BM152910 AA313503 AAB72377 BG574714 AV712054 AV732695 
AA252476 AV712759 ALS99643 BE790872 BG654930 W73337 AW675377 AV760376 AV725138 AV716379 AABB7165 BE830003 AW023796 
A1599291 Ai902948 BG944042 F00781 AA352483 BG217897 N33888 AW5B1924 BG654730 031410 AA353088 D3128B AA295029 H95170 
BE935104 AU139980 BG772963 BG776470 BG532512 BG105449 B1545421 AV715456 AW386083 BG699714 A1535832 AL514940 BG190861 
BG210593 AW999254 H95138 AA353863 BE764809 N50375 BE091363 BG701255 BI860846 B18324B5 BG168150 BG028647 BE546301 
BG900321 B1909737 BG702363 BG514141 BG61 1137 BG700121 BF031492 N85802 AV715940 N51590 BG993478 BE172016 AW893622 
AK056692 AF086220A1375068AA284293 W32566AW797961 AA960897 AA504145 

BF060834 Bi019048 AWB62002AK056737 AA429538AW063311 AW440655AW231970AA428584 AA995028AWB54593 AA984131 AA552874 
AAS647Se AA528743 BG927275 fiE045117 AW975234 AA664937 6F749390 60928948 AA984546 6E0091S3 AV732467 BE078167 AW872716 
AW063245 AA578441 AI820704 AI732283 RS5428 BF999154 BF88048S 
AL1 34685 AW972760 AA525319 AA526644 AA525342 AA526632 AA525296 AV739423 AW972755 

BI918168 AW779760 N48674 A1375997 AA235370 BG699146 AI913631 AI498402 AI016320 AA323193 R49021 DS9344 BG9887S0 N45526 

BG986917 T61382 849391 R45432 AI203107 R35004 F07491 R25094 R35360 

AW852530AW852526AW852527AL597969 

AW811114AW811095AW811094AW811124AW811157AW811054AW811087 

AW810001 BF374533 AW809694 AW809672 BF374748 BF374S62 AW81 0209 AW81 0092 AW809786 AW810428 AW810429 AWB09884 BF374650 

BF374641 BF374764 BF374744 AWB10552 BF374678 BF374653 BF374579 AW810168 AWB1 0170 BF37461 1 BF374676 BF374667 AW81 0353 

BF374682 AW810154 BF374688 BF374565 BF374757 BF374637 BF374743 BF374719 AW809664 BF374643 BF374680 BF374714 BF374708 

BF374716 AW810432BF374691 6F358066AWB10006 AW810345AW809960 

AW9635e2 BE064192 BE0641696E152580 AW983587 

AV704306 BF368780 AW867826 AW859896 

AA229762 AA230035 

AW972670 AA52S808 H28359 H28383 

BG034&53 AW173315 AW303375 BGig022S BG939153 BF057308 AA600736 At75125B AI090486 BE93g504 AW631492 AI768270 AI862133 
AA41 7652 BE378218 AAS99207 AW794702 AA024968 AA446024 AI148235 AI191710 BI493797 AI272646 BI493796 AA634323 AI754332 
AA25e414 C05155AJ218228 AI039656AI350380AI084698 AI754989 AI673545 AI432010AI7S1035AA375571 AA446297 BG216743 
AA609784 R97304 

BC010106 AL560552 AU133296 AU133086 BE268567 BE268523 B1544879 BE398161 BG473088 BI544445 BE258021 BE296339 BE255040 
BE263020 BG706790 AL598627 AW952337 BG7581 13 AW512753 BE267666 BE253415 81225718 BE268350 BE258245 81224965 AW772605 
BG723903 BE397282 BI196341 BG702B80 AI878949 AL600437 AA416759 BE259917 AI031582 BF512142 AI088248 BE56032B AI802623 
AI28B613 AL597585 AW768553 AIB16352 BF732831 B1225687 AA833686 AA722593 AA807750 AW068064 AA405187 AI923236 N51 593 AL52ni 0 
BG282576 AL525927 AL525971 BI869547 AI064725 R91B56 H46814 H20112 W01682 AW848870 AW848585 AW376562 AW848985 AW848937 
AWM8862 AW848581 AW848512 AW848176 AW752623 AW752618 AW376822 AW376821 AW376684 AW376623 AW376622 BE706047 
AW752602 AW752691 AW752674 AW752652 AA379167 AW752610 AW7526B4 AW752613 AW752660 AW84e709 AW848576 AW8491S5 
AW848981 AWB48980 AVW8979AW848978 AVVB48973AVm8916 AW848713AW848708 AW848642AVV848641 AW84e639 AW848573 
AW848493 AW848492 AW848489 AW848488 AW848487 AW848353 AW848352 AW848220 AW752698 AW752697 AW752682 AW752681 
AW752680 AW752679 AW752664 AW752651 AW752638 AW752637 AW752636 AW752628 AW752626 AW762624 AW75261 9 AW752596 
AW752608 AL582019 BE875587 AL529175 AW965868 BG686208 AA259073 BE696973 AA459543 AA358314 W40564 BF928427 AW849000 
AW848718 AW84e515 AW848507 AW848444 AW848440 AW848232 AW848222 AW752657 AW376786 AW376781 AW376615 AW376614 
BI752581 AA534520 AI748906 AA047799 AI014753 ALS14460 AL581982 BG743146 W24171 H20102 H1 1227 AW752607 AW006596 AW130378 
BE716519 AW752661 AW848289 BF349557 AW752612 AW752632 AW848910 NM_004077 AF047042 AL560606 Bi765898 BI196B31 BI855656 
•BE906674 BG749937 BE53S486 BE019810 AA313713 AA992S42 AA332541 AA682985AA356125 6E140478 6G750945 BI45754d BG025661 
BF326302 AA325019 BG980676 AA337465 AA321974 B6949285 BG427585 R23979 BG611485 BE560678 W18977 N50379 BG824101 BG471750 
Bi463171 W04691 AU099350 B6471 590 BMOl 1999 BE2629456ES59801 BF756438BE881957 BE314546BG911831 6G1S0811 6G112017 
AA15751BT92368AW752620 
AW970823 AA502839 AA50281 9 

U81984 NM.001430 BE907085 B1333232 A1021986 AU138476 C18601 U51626 AU100517 BI054387 AU076970 BE786454 BG010080 AW377189 
BF998789 AA368139 R1 1396 T83613 BG006324 B1012404 BG001643 BF757957 AL549361 AL544016 BE002870 BE929314 BEOOOIBO AL046650 
BI0S3717 BE929315 81054967 BF980055 BFg2S432 RD5421 BFg22073 T70331 61004403 



TABLE 34C: 

Ptey: Unique number corresponding to an Eos probesel 

Reft Sequence source. The 7 digit numbers In IWs cdumn are Genbank Idenfiflw (Gl) numbers. "Dunham, et al." refers to the piAlicatlon entitted The DMA sequence of 

human chromosome 22* Dunham, el a). (1999) Nature 40248^95. 
Strand: Indicates DNA strand from wMch exons were predicted. 
Ntposition: Indicates nucteolide positions of predicted exons. 



Pkey 


Ref 


Strand 


NtjKtsition 


405443 


7408143 


Plus 


90716-90887,101420-101577 


401645 


7657839 


Mbi» 


34988^133 


401673 


7689903 


Minus 


122587-122705,122765.123047 


405120 


8099940 


Phis 


140176-140340 


401785 


7249190 


Minus 


165776-165996.166189-166314.166408-16656 


402333 


8844110 


Aifinus 


165893-165856 


404942 


7382153 


Pius 


92095-92252 


403362 


8571772 


Rus 


64099^260 


402641 


9958129 


Minus 


122596-125135 


405600 


5923640 


Bus 


28662-27225 


405061 


7656744 


Minus 


132492-132932 


402327 


7656695 


Minus 


108675-108770,109801*109910 - 


404342 


9838093 


Phjs 


115854-116033 


404429 


7407979 


Plus 


31352-31498 


403344 


8569726 


PhJS 


70823-70990 


401593 


7230957 


Pius 


10368.10572.11293-12356 


406461 


9756020 




158842-159136 


400609 


9887671 


Minus 


92037-92247 


402674 


8077108 


Minus 


39290-39S02 


401677 


9965537 


Minus 


628S&63086.6380343684 


405579 


6456174 


Hus 


10099&>101542 


405797 


1934909 


Minus 


5599.5681.5821-6104 
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405159 


9966252 


Plus 


79659-79804 


403520 


7684483 


Minus 


97621-98084 


402538 


9801137 


Minus 


96314*96539 


404151 


7534014 


MhnB 


G9038-69399 


400496 


9743564 


Plus 


4151S41695 


403010 


3132346 


Pius 


78385-79052 


406387 


9256180 


Plus 


11622&-116371.11751M17651 


402885 


9926751 


Rus 


71919-72049 


404501 


72^859 


Minus 


37270-37526 


402487 


9797538 


Plus 


75677-75843 


404455 


7677926 


Minus 


26927-27611 


401067 


5764724 


Minus 


153366.153509 


402324 


7630361 


Plus 


26052-26803 


402013 


7407997 


Pius 


174540-174634.1 75449-175568 


401116 


9966559 


Plus 


123579-124447 


402998 


2996643 


Mnius 


17175-17373 


405550 


1552494 


Rus 


91720-92115 


402917 


7406502 


Minus 


1034-1177.3143-3266 


402504 


9797871 


Rus 


12366-12614 


405491 


5801645 


Rus 


81857-82045 


400818 


8569994 


Plus 


172644-172765.173085-173200 


406475 


9797684 


Rus 


125417>1S563,1280S2-128180 


401025 


8117518 


Minus 


179287.179483.161044.181166^161844.18203 


402308 


7340295 


Minus 


92080-93638 


405213 


6692345 


Minus 


50267.51151 


400740 


7329267 


Minus 


79920-8051080576-80746 


402825 


6165330 


Minus 


78572-78807 


405973 


8247789 


Plus 


103859-104254 


405818 


4071056 


Plus 


2905S29196 


402621 


9930950 


Rus 


130806-131036 


401311 


9212516 


Minus 


180124-180754 


401899 


7230209 


Minus 


155620-155815 


403579 


8101179 


Minus 


36167-36365 


404600 


8705107 


Rus 


118354-118444.118849-118792 


405531 


9665194 


Plus 


35602-35803 


405542 


9857564 


Rus 


71331-72183 


405131 


8516051 


Minus 


136764-137594 


403026 


7670575 


Rus 


56521-56840 


405369 


2078469 


Minus 


34183-34357,35686-35751 


405932 


7767812 


Minus 


123525-123713 


405156 


9986228 


Rus 


146733-146860.147899-147961,153127-15325 


402174 


8575912 


Plus 


253499-253874 


405536 


9795661 


Rus 


164091-164162.164397-164516.166720-16679 


406109 


9127147 


Minus 


58328-58485 


403162 


9838085 


Rus 


82652-83613 


401404 


7710968 


Pius 


136474-136645 


403167 


9838127 


Rus 


162599-162935 


402209 


8576119 


Minus 


5331&^72 


400986 


8085497 


Minus 


63140^19 


404975 


3419864 


Minus 




404926 


7341919 


Minus 


150411-151484 



TABLE 3SA: About 323 genes iq)regutetedtnhypa8enstCv{typne^^ 

Pkey. Unique Eos probeset identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnlgemlD: Untgene number 

IMgene TlOe: Unlgene gene tiOe 

R1: 901h peicenSle of HP Als divided by 90lh peroentiie of IPF Als. where 1 50i pen^enOie of nomnal tissue AIs was subtracted from boUi the numerator and denominator. 

TM minimum value for ttw numerator 8nd denominator wg? get to 50. 
R2: 9001 peicenSle of HP AIs divided by the median of IPF Als, where the minimum vdue fbr the numeiator and denonitnalor was selto 50. 



Pkey 


ExAccn 


UnigenelD 


Unigene'raia 


R1 


R2 


402550 






TaigetExon 


4.03 


4.70 


421563 


NM_00€433 


Hs.105806 


granulysln 


3.37 


170 


424326 


NM_014479 


Hs.1452g6 


ADAIMe disintegrin protease, decysbi 


3.31 


142 


417967 


BE244373 


^te.1119 


nuclear receptor subfamily 4, group A, m 


3.09 


1.51 


411089 


AA456454 




cell division cycle 2-1ike 1 (PTTSU^E pr 


^99 


1.26 


415350 


AF188625 


H3.189507 


phospholipase A2. group ilD 


Z71 


1.43 


406654 


M90686 


Hs.73885 


HLAG hlstocompaflbili^ antigen, dass 


2.70 


1.53 


459705 


BB182764 


Hs.270252 


ESTs, Weakly slmiiar to androgen recepto 


. Z70 


1.14 


412610 


X90908 


Ks.74126 


fatly add binding protein 6, ileal (gas 


Z69 


199 


452194 


AI694413 




olfactory receptor, family 2, subfamOy 


2.63 


167 


447709 


U97145 


Hs.19317 


GDNF family receptor alpha 2 


Z63 


1.52 


410910 


AW810204 




gb:MR4^T012&^1199<}17-d08 ST0125 Homo 


2.59 


1.00 


454671 


AW61S29 


Hs.336908 


ESTs 


150 


134 


441859 


AW194364 


Hs.94814 


intedeukln-4 induced gene-l protein (R 


Z45 


1.90 


422398 


A1476149 


Hs.334489 


hypothetteal protein FU21992 


145 


1.36 


403244 






C2002870*:OiI82698|p{rt)JQ0985 hydraxypr 


140 


1.53 
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415462 


RS2692 


H112698 


447028 


AI97312B 


Hs.167257 


412394 


AW984150 




450165 


AWX)7235 


Hs.63931 


431093 


AB031038 


Hs^1704 


444090 


S69115 


Hs.10306 


4138B2 


6E156991 




441320 


AI768724 




456766 


R87310 


Hs.7740 


420340 


Nl\/L000734 


Hs.97087 


459721 


A1299050 


Hsll 43635 


405452 






458079 


AI796870 


Hs.54277 


401447 






423066 


Y18264 


Hs.1 23094 

i W» 1 4bWW 


441704 


A1458766 


Hs.192125 


405097 






408544 


AW29382S 




413454 


BE141162 




444404 


M31525 




418460 


M26315 


1 MtUvMfU 


436063 


AK0OOQ28 




429212 


NM.001504 


Hs. 198252 


400712 






417929 


R27219 


H5.74647 


403478 






418747 


AJ249977 


Hs.88049 


429712 


AW245825 


Hs.211914 


451666 


Z43948 


Hs.326444 


414020 


NMJ0O2984 


Hs.75703 


456057 


AA947457 


Ks.135560 


444346 


At142274 




418918 


X07871 


Hs.89476 


451318 


AA029888 


Hs,95071 


458935 


Y16521 


Hs.24812 


417105 


X60992 


Hs.61226 


408219 


6E061111 


H5^54211 


420137 


AA306478 


H8.95327 


443711 


N67861 


Hs 49390 


423234 


AA323534 


Hs 296162 


416975 


MM 004131 


Hs.1051 


425280 




Hs.1 872 


405827 






406909 


L20777 


Hs 73885 


437295 




Hs.88417 


424281 


AA766243 




430413 


AW8421fi2 




423901 


AA333006 




405075 






457423 


AK000642 


Hs.265018 


406267 






423365 


AA324992 




449970 


AI678058 




430733 


AW97592D 


Hs 121036 


445323 


AI288274 


Hs 345792 


402240 






451404 


AA460775 


Hs62% 


424463 


AW19S353 


Ks.1 19903 


400107 






404811 






40K89 






404088 
414991 


CI 7898 




429073 


AA448167 


Ks.47385 


426274 


D38122 


Hs.2007 


401897 






431094 


AVV972276 


Hs.116195 


424899 


AL1 19387 


Hs 119062 


419711 


C02621 


Hs 159282 


459019 


AA017156 


Hs.40719 


405453 






402516 






457365 


AA577297 


Hs.303249 


407928 


NM_002262 


Hs.41682 


436553 


AW407157 


Hs.8997 


406266 






419409 


AW297831 


Hs.143792 


435028 


AW1 93035 


Hs.187370 


404696 






403533 






411673 


BE064853 




424148 


BE242274 


Hs.1741 


419833 


AA251131 


Hs.220697 



ESTs 

br^lInkprotebKl 

^:PM24«I0008>17030(M)OU09 mm Homo 
ESTs 

eomesodemin (Xenopus lae^) homolog 
natural loQer cell group 7 sequence 
gb:RC3-HT0371*29010(M)13«02HT0371 Homo 

flbulinl 

oxystero) binding protein-tike 1 
CD3Z antigen, zela polypepfide (7113 com 
gb:qn14d12j(1 Na_CGAP_Lu5 Homo sapiens 
Target Exon 

DMA segment on chromosome X (unique) 992 

Target Exon 

sal (Orosoph3a}4ike 1 

ESTs 

ENSP00000175236-:A disintegiin and metal 
ESTs 

gb:MRO-HTC07W)21299^1-d03 HT0076 Homo 

msQor histocompatibility complex* dass 

COB antigen, a^ha polypeplide (p32) 

n^osomal protein S24 

G protein-coupled receptor 9 

Target Exon 

Human T-ceB receptor active a)ph»cha]n 
NM.022342:Homo sapiens Idnesbi protein 9 
protein Idnase. AMP-acthraled, gamma 3 n 
ENSP00000233627*:NAOHHil3]quinone oxidare 
cartilage addle protein 1 
small indudlile cytokine M (homologous 
ESTs. Weakly stndar to T43458 hypolhefi 
ESTs 

CD2 antigen (p50), sheep red bk}od cell 
ESTs 

COP-<fiacytgiycerol synthase (phosphaDda 
CD6 antigen 

gb:QVO-BT0041-011199^9402BT0041 Homo 
C03D antigen, delta polypeptide (TTTS oo 
ESTs 

AD037 protein 

granzyme B (granzyme 2, cytotoxic T-lymp 
phosphoenoipynjvate carbmyldnase 1 (sol 
TaigetExon 

gbrHuman MHC dass I HLA-6 gene (HLA-A33 
ESTs 

gb:oa13b11.s1 NCI.CGAP.GCBI Homosaplens 
smal l induc ible cytokine AS (RAIITES) 
gb£ST37064 Embryo, 8 week I Homo saplen 
Target Exon 

hypothelkal protein FIJ2063S 

Target Exon 

ESTs 

ESTs 

ESTs 

ESTs 

Target Exon 

ESTs. simtlar to T17248 hypotheS 
ESTs 

Eos Control 

NM_021096:Homo sapiens cah&Bn channel, 

Target Exon 

Targ^Exon 

gb:C17898 Human placenta cDNA (TFufhvara 
ESTs 

tumor necrosb factor (llgand) superfsmi 

C17001987:gII73033601gb|AAF58438.1| (AEO 

ESTs 

ESTs 

ESTs 

liypofiieficalprotdnKtAA1164 
NM.005748^:Homo safto YYI-assodated f 
Target Exon 
EST 

Mler ceO lectin-Ike receptor subM 
immunogtobufin lambda locus 
Target Exop 

hypotheOcai protein M6C2656 
ESTs 

NM_013443:Homo sapiens CMP-NeuAC:(beta)- 
Target Exon 

gb:RC1«BT0313-1 10300^5^ BT0313 Homo 

bitegrtabeta? 

ESTs 
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ZAO 
Z33 
Z32 
132 
Z30 
Z28 
2.27 
2.27 
2.27 
Z26 
2.25 
Z2S 
125 
125 
2.24 
Z24 
2^4 
2.22 
120 
120 
119 
119 
118 
118 
117 
117 
117 
116 
116 
115 

lis 

115 

114 

114 

113 

113 

113 

111 

110 

110 

109 

108 

108 

108 

107 

107 

107 

107 

107 

107 

107 

108 

106 

106 

106 

105 

105 

104 

104 

103 

103 

103 

103 

103 

102 

102 

102 

101 

101 

101 

101 

101 

101 

101 

2.00 

100 

100 

100 

100 

100 

100 

1.99 

1.09 



1.00 

1.64 

1.00 

1.32 

1.81 

1.69 

1.59 

1.87 

1.36 

1.98 

1.82 

1.29 

5.80 

1.55 

1.51 

1.00 

1.00 

1.95 

126 

1.37 

188 

1.42 

1.22 

1.00 

1.79 

1.80 

1.76 

1.44 

112 

188 

ISO 

138 

1.93 

1.16 

1.52 

161 

1.94 

166 
1.00 
1.52 

3.74 

1.84 

1.00 

129 

1.72 

1.00 

116 

1.50 

1.15 

167 

1.30 

1.70 

148 

1.00 

1.00 

1.94 

1.44 

1.32 

142 

118 

1.57 

1.00 

104 

3.10 

1.92 

1.55 

1.00 

141 

1.92 

1.76 

1.24 

1.00 

138 

2.62 

1.64 

146 

1.60 

1.55 

1.21 

1.17 

1.00 

3.66 

1.69 
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423196 

426416 

449317 

424321 

422109 

424218 

406303 

438676 

404240 

404056 

425508 

429819 

416941 

446998 

409153 

419490 

446608 

425357 

414812 

422994 

424517 

433671 

412116 

447656 

432468 

419231 

427527 

431574 

436485 

432606 

421379 

448569 

429670 

412584 

413869 

406672 

452203 

426451 

447131 

414512 

426752 

444793 

452334 

446227 

407830 

423799 

456332 

408380 

437644 

402736 

438866 

422846 

426202 

414646 

420440 

416967 

415823 

421924 

427307 

444929 

439237 

416196 

428227 

436568 

430308 

433934 

443559 

450000 

415349 

406656 

456974 

415401 

439372 

434666 

417695 

417427 

431903 

413472 

425762 

412472 

451406 

412568 

449835 



AK001866 

AW512744 

AW293413 

W74048 

S73265 

AF031824 



Hs,125139 

m.169824 

Hs.132906 

Hs.1765 

Hs.1473 

H$.143212 



AA813745 Hs.123446 



AA9915S1 

AL133011 

BE000150 

N99013 

W03754 

NM.006144 

N75217 

BE271188 

X72755 

AW891802 

A1539443 

AW138797 

AW402166 

NM.003725 

AW402155 

AL046294 

A1809057 

AW572659 

X59135 

NM-002104 

Y15221 

BE382657 

101087 

X54870 

NM_000878 

M26041 

X57522 

AI908165 

NO04585 

AL044336 

X69490 



D60471 

AI281459 

NK1.001086 

AW026300 

AI000341 

AF1230S0 

AA74857S 

U44385 

BE513934 

BE266484 

AA353776 

NW_002407 

BE616731 

R81864 

BE514514 

AF117947 

AI685841 

AW408158 

At745649 

AA321649 

R98865 

BE54086S 

AW273261 

AI076765 

A1952797 

AI765697 

M16714 

M12529 

N80139 

AF088033 

AF151103 

BE241624 

M90391 

AB029468 

BE242870 

BE244076 

AW975398 

A1694320 

AI87B826 

AVV979300 



H&97013 

Hs.225108 

Hs.48778 

Hs.16762 

Hs.50813 

Hs.90708 

HS.2S7646 

Hs,155975 

H5.77367 

Hs.296276 

Hs.137447 

Hs.132906 

Hs.784 

Hs.19126 

Hs.3003 

H5.136245 

Hs.153261 

Hs.251373 

Hs.156110 

Hs.3066 

Hs.103982 

Hs.21486 

Hs.211593 

Hs.74085 

Hs.755g8 

Hs.198253 

Hs,169946 
Hs.17466 
Hs.6831 
Hs.172004 
Hs.11958 
H5.133gO 
Hs.270114 
Hs.587 
Hs.132g06 

Hs.44532 
Hs.136748 

HS.32549S 
Hs.1583 
Hs.82916 
Hs.901 
Hs.97644 
Hs.80645 
Hs.205103 
H8.109506 
Hs.174795 
Hs.161354 
HS.31B893 
Hs^9 
Hs.2248 
Hd.11135 
Hs.238990 
Hs.216292. 
Hs.269899 
Hs.10888 
Hs.13231 
H3.89643 
Hs.16g401 
HS.26B916 
Hs.159225 
Hs,112259 
Hs.82401 
Hs.82127 
Hs.272100 
Hs.75379 
Hs.159578 
Ks.293836 
Hs.6295 
Hs.74034 
H8^13 



hypolheScaf protein aJ11004 
kSier oen tectin-Gka receptor subfaml 
19A24 prolan 

lymphocytd-spedfic protein tyioslnsld^ 
gastrln-releasinfi peptide 
cyststin F (leukxystalin) 
C16000922:gi|7499103lpirflT20903hypothe 
ESTs 

NM_018950:Homo sapiens fi^jorHslocompat 
TaiiQetExon 

Homo sapiens. Simitar to RtKEN cDHA 2310 
Homo sapiens mRNA; cONA DKFZp434P201 (fr 

niban protein 

Homo sapiens mRNA; cONA DKFZpS64B2062 (f 
tiypotiieScal protein FU20022 
oranzyinBA(granzyme 1. cytotoxic T-lyinp 
ESTs 

protein tyrosine pItQsphatase, reoeptort 
monokine induced ty gamma interferon 
ESTs 

Homo sapiens cDI^A FU12169 lis, done 
igA24 protein 

Epstdn-Barr vims induced gene 2 (lymph 
sm Idnase-assodated ptusptraproteln of 
CD3E antigen, epsilon polypeptide CTiTS 
ESTs, Weaidy similar to T17227 liypotheU 
immunoglobulin heavy constant mu 
hypothetical protein dJ434014.3 
immunogiolHjIln Vappa constant 
granzyme K (senne protease, granzyme 3; 
small inducit)(e cytoldnB subfamOy B (Cy 
signal transducer and acfivator of trans 
protein kinase Cbtheta 

DNA segment on chromosome 12 (unique) 24 
Inteileukln 2 receptor, beta 
rm^ histocompatibflily comptex, dass 
transporter 1, ATP-binding cassette, sub 
GATA-binding protein 3 (T-cell receptor 
retindc add receptor responder(tazaro 
golgl phosphoprotein 1 
Utin 

oxldallve 3 alpha hydioxysteroid dehydro 
gb:HUM1 1 1 0098 Ctontech human fetal brain 
ESTs 

arylacetamide deaoelylase (esterase) 

19A24 protein 

ESTs 



tec&vQka NK cell receptor 
NM.024d52:Homo sapiens hypothetical pret 
tissue inhibitor of metalloprotetnase 2 
neutrophil cytosoric factor 1 (47kO. dv 
chaperoiito oontaMng TCP1. atibuidt 6A ( 
C048 anOgen (B-cell membrane protein) 
mammaglotiin 2 
Interferon regulatory factor 1 
ESTs 

cnorAu acQn-bindirig proteb, 1A 

PDZ domaln^ntalning guanine nudeoSde 

ESTs 

ESTs, Weaidy similar to A475B2 &cell gr 
KIAA1708 protein 

amafl indudbie cytddne subfamily B (Cy 
major histocompatibil^ comptex, class 
cydin-dependent Mnase inhibitor IB (p2 
ESTs 

ESTs, Moderately similar to ALU8.HUMAN A 

hypothetical protein FU21709 

ESTs 

major histoconipaGbility comptex, dass 

apdipoprotato E 

ESTs 

ESTs 

T ceti receptor gamma locus 
C069 anOgen {p6a eaily T-ceO acthraO 
Interteutdn 16 (lymphocyte chemoattracte 
SMS3 protein 

solute carrter family 1 (glial high afH 
AT-hooic transcriptjon fador AKNA 
ESTs 

ESTs, Weaidy similar toT17248 hypotheS 
caveolin 1, caveotea protein, 22kO 
ESTs 



1.99 

1.98 

1.98 

1.97 

1.97 

1.96 

1.96 

1.95 

1.95 

1.94 

1.93 

1.93 

1.92 

1.92 

1.92 

1.90 

1.90 

1.69 

1.89 

1.88 

1.68 

1.88 

1.86 

1.86 

1.84 

1.63 

1.82 

1.82 

1.82 

1.81 

1.80 

1.79 

1.78 

1.78 

1.78 

176 

1.76 

1.73 

1.73 

1.73 

1.73 

1.72 

1.72 

1.72 

1.72 

1,71 

1.70 

1.70 

1,70 

1.69 

1.69 

1.68 

1.68 

1.68 

1.67 

1.67 

1.65 

1.65 

1.63 

1.63 

1.63 

1.62 

1.62 

1.62 

1.62 

1.62 

1.61 

1.61 

1.60 

1.60 

1.60 

1.59 

1.59 

1.59 

1.58 

1.56 

1.57 

1.57 

1.56 

1.56 

1.56 

1.55 

1.55 



1.84 
2.56 
2.44 
Z51 
3.32 
1.86 
Z16 
3.62 
Z06 
Z60 
3.24 
Z35 
Z24 
5.44 
6.08 
5.S2 
4.63 
1.50 
4.93 
3.30 
Z17 
1.83 

ai2 

1.88 

1.65 

Z46 

2.07 

2.63 

2.75 

4.56 

5.10 

1.89 

3.34 

3.55 

1.97 

Z12 

1.55 

Z04 

1.56 

ZOO 

Z62 

2.30 

Z12 

Z48 

Z72 

Z40 

3.71 

Z71 

Z58 

Z10 

1.39 

Z02 

Z14 

3.30 

Z42 

1.49 

Z16 

1.58 

1.94 

116 

1.74 

Z76 

4.42 

1.74 

1.85 

ZOO 

ZOO 

1.46 

ZOO 

1.47 

1.63 

1.68 

Z04 

4.08 

3.06 

Z37 

Z14 

Z28 

1.46 

Z26 

Z38 

1.67 

Z16 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



405545 






Tarpsl Exon 


1.55 


Z64 


435299 


AI745458 


Hs^3026 


ESTs, Weatdy similar to T20593 hypotheti 


1.55 


3.81 


422060 


R20893 


Hs.325623 


ESTs. Moderately similar to ALU5_HUMAN A 


1,54 


Z14 


424243 


Ai9493S9 


Hs. 143600 


ESTs. Highly simQar to ds Gotgi-local) 


1.53 


Z62 


457500 


NM_002759 


Hs.274382 


protein kbiase, interferon-indudtjie dou 


1.53 


2.04 


424541 


AW392551 


Hs.160559 


ESTs, Wealdy similar to A56194 ttinxnboKa 


1,53 


2.00 


439039 


AI656707 


HS.4B713 


ESTs 


1.53 


Z38 


449523 


NM.000579 


Hs.54443 


ctiemokbie (OC motif) receptor 5 


1.52 


Z65 


457718 


F18572 


Hs.22978 


ESTs. WeaUy stmSar b ALU4.HUMAN ALU S 


1.52 


Z06 


415198 


AWOt^SO 


Hs.943 


natural killer ceH transcript 4 


1.52 


1.40 


431594 


AI823999 




ESTs 


1.52 


2.12 


432656 


NM.W)0246 


Hs.3076 


MHC dass 11 trans activator 


1.52 


2.20 


422426 


W79117 


Ks.58559 


ESTs 


1.52 


Z22 


414372 


AA143654 




9lKZo6Sa02j1 Stratagene pancreas (93720 


1.51 


m 


427247 


AW504221 


Hs.174103 


inlegrtn. dpha L (antigen GD1 1 A (p18()). 


1.50 


1.67 


433043 


W57554 


Hs.125019 


lymphoid nuclear pfDtein (LAF-4) fARNA 


1,49 


3u12 


406621 


X57809 


H5.8997 


ImmunogloljuSn temtxla locus 


1.49 


1.78 


419166 


AA234638 


HS.2935B4 


ESTs 


1.49 


110 


418323 


NMJ)02118 


Hs.1162 


nufor HstocompaGbiliiy complex, dass 


1.49 


1.47 


435304 


HI 0709 


Hs.269524 


ESTs 


1.48 


Z96 


452834 


AI63B827 


Ks.1 05685 


K1AA1688 protein 


1.48 


Z14 


446516 


R65964 


Hs.334873 


ESTs, WeaMy similar to ALU6_HUMAN ALU S 


1.48 


1.38 


429272 


VV25140 


H5.1 10667 


ESTs 


1.48 


119 


428379 


X06026 


Hs^S9 


GD36 antigen, gamma polypeptide (nra CO 


1.48 


1.66 


433231 


AB040926 


H5.143552 


KIAA1493 protein 


1.47 


Z16 


408847 


AW290997 


H5.30348 


ESTs 


1.46 


Z08 


405441 






Target Exon 


1.46 


2.99 


443378 


AW392550 


Hs.9280 


proteasoms (prosome. macropaln) subunit« 


1.45 


1.56 


459644 


AW197203 




gb:xm38b01j(1 NQ^CGAP^GCe Homo sapiens 


1.45 


2.44 


431433 


X65018 


HSJ53495 


surfactant, pulmonary-assodatsd protein 


1.45 


1.70 


422934 


BE244189 


Hs.1 22492 


hypothetica) protein 


1.44 


1.27 


409799 


D1192B 


Hs.76845 


phosphoserine phosphatase^ 


1.44 


3.46 


406698 


X03068 


Hs.73931 


major NstocompaObrtty complex, dass 


1.44 


1.71 


421407 


T82331 


Hs.1 82278 


ESTs. Weakly simiiar to (^U6C collagen 


1,43 


1.56 


413420 


AW410235 


Hs.75348 


pfoteasome (prosome, macropain) activato 


1.43 


1.25 


400269 






Eos Control 


1.43 


Z02 


420973 


AA743415 


Hs^1368 


ESTs 


1.42 


Z08 


442104 


L20971 


Hs.1 68 


phosphodiesterase 4B. cAMP-specifK: (dun 


1.42 


Z20 


430015 


AW768399 




ESTs 


1.41 


Z06 


427648 


AI376722 


Hs.180062 


proleasome (prosome, macrop^) subunil. 


1.41 


1.31 


418870 


AF147204 


Hs.89414 


chemokine (C-XrC motiQ, receptor 4 (to 


1.40 


1.72 


437479 


R61866 


Hs,101277 


ESTs 


1.40 


Z52 


425345 


AU077297 


Hs,1 55894 


protein tyrosine phosphatase, non-recept 


1.40 


2.17 


416030 


H15261 


Hs.21948 


ESTs 


1.40 


Z62 


419886 


AA251562 


Hs.146168 


ESTs, Wealdy simQar to AP1 16023 1 SH3 d 


1.40 


1.66 


443951 


F13272 




ferritin, 6^ polypepfids 


1.40 


1.64 


414875 


H4267g 


Hs.77522 


m^ histocompaSlNlity complex, dass 


1.40 


1.42 


412471 


M63193 


Hs.73946 


endotheOai cell growth factor 1 (platel 


1.40 


1.34 


428782 


X12830 


H8.193400 


inlerieukin 6 receptor 


1.40 


Z30 


400680 






NM.014207:Homo sapiens COS antigen (p56- 


1.39 


1.93 


428289 


M26301 


HS.22S3 


complement component 2 


1.39 


1.39 


441410 


AA932689 


Hs.233304 


ESTs. Weakly similar to 138022 hypotheti 


1.39 


1.42 


406545 


M57466 


Hs.814 


m^or hlstocompatit>nity complex, dass 


1.39 


1.45 


441379 


AW1757B7 


Hs.334841 


selenium binding protein 1 


1.38 


1.32 


416636 


N32536 


HS.4264S 


sohite carrier (amily 1&(monocart)Oxytic 


1.38 


Z04 


418707 


U97S02 


HS.B7497 


bul^fiophllln, subtemfly 3i membor A2 


1.38 


1.35 


423526 


AB011086 


Hs.129739 


KIAA0514 gene product 


1,37 


1.41 


424168 


L29277 


Hs.321677 


signal transducer and activator of trans 


1.37 


1.33 


431723 


AW068350 


Hs.1 6752 


Homo sapiens mRNA; cONA DKFZp564B2062 (f 


1.37 


1.74 


426437 


BE076537 


HS.169B95 


ubiquiOiKonJugating enzyme E2L6 


1.35 


1.38 


446666 


H95741 


Hs.17914 


mentrane^pannlng 4-domains, subfamily A 


1.35 


1.64 


452353 


CI 8825 


Hs^191 


epith^al mernt)rane protdn 2 


1.34 


1.47 


448408 


AW772298 


Hs.21103 


Homo sapiens mRNA; cONA DKFZp564B076 {ft 


1.31 


1.77 


435106 


AA1 00847 


Hs.5978 


ESTs. Highly similar to AF174600 1 F-box 


1,31 


1.53 


444633 


AF111713 


Ks.286218 


juncliona] adhesion molecule 1 


1.30 


1.37 


430998 


AF128847 


Hs.204038 


indoiethylamine limethyltranslisrase 


1.29 


1.49 


419092 


J05581 


H3.89603 


mucin 1, transmembrane 


1.28 


1.36 


451864 


N20370 


Hs.69547 


ESTs 


1.28 


1.42 


421140 


AA298741 


Hs.102135 


8^ sequence receptor, delta (transto 


1.28 


1.31 


412790 


NM„0t4767 


Hs.74583 


KIAA0275geneprodud 


1.28 


1.63 


446272 


BE268912 


Hs.14601 


hematopoietic cetkspedfk: Lyn substrat 


1.28 


1.38 


422530 


AVV972300 


H3.II8IIO 


bone manow stromal ceD antigen 2 


1.28 


1.36 


435822 


T95594 


Hs.1874d5 


ESTs 


1.27 


1.82 


455863 


AA907305 


Hs.36475 


ESTs 


U7 


1.38 


404277 






NM_0191 1 1*:Homo sapiens major histocompa 


1.27 


1.52 


413497 


BE177661 




Bb:RC1-HT0598^203004)1 1-h02 HT0598 HOmo 


1.27 


1.54 


441835 


AB036432 


Hs.184 


advanced glyoosylation end product-sped 


1.27 


1.53 


418371 


M13560 


HsJ4298 


CD74 anOgen (Invariant potypepSde of m 


1.26 


1,27 


434747 


AA837085 




ESTs 


1.26 


1.60 


425320 


U29344 


H5.83190 


fatty add synthase 


1.25 


1.35 


452363 


AI582743 


Hs.94953 


Homo sapiens. SimOar to complement oomp 


1.25 


1.41 


434644 


H98071 


Hs.4055 


chromosome 21 open reading fmne 50 


1.25 


1.30 


404854 






Target Exon 


1J5 


1.57 
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406973 


M34996 


Lie iqfl9S3 


niijof hislocofwpBfibiH^y cojiiptoXi ctass 


1.25 


1.57 


421071 


AI311238 


Hs.104476 


ESTft WbtAfv dn^AT In f^f^HUlE cdBaoen 
CO i9i vraoNj SiiiBiar ui vAsnvi ic wwc^m 


1.24 


126 


431779 


AW371178 


7fi&S71 


fipotipopfulBin 


1.24 


139 


416047 


BE439894 


)-k 7ftQQ1 




1,23 


2.08 


406826 






•» finugen (invaiani poiypqitHo 01 in 


1.23 


120 


426836 


N41720 


H5.1 72684 




1.22 


124 


415661 


AF057307 


HS.78S75 


nrnsafwstn fvarvint f^imhiv disessfi and 


122 


116 


406824 


AV\^15961 


H5.84298 




1.22 


117 


420679 


X571S2 




fthrrtbtrin 


1.22 


130 


443071 


AL080021 


K3.8986 


MJlTfMCnreill WUIlipWcill 1 1 If Slf(MAlI)Wil6llli 


1.22 


158 


416090 




H$.83429 




1.21 


1.33 


430250 


NM 016929 


Hs 283021 


ehlnrirtfi fnlrsiRplliilsir channpf 'i 

MHUIIUO IllUauCUUIal WmHIIIBI w 


1.21 


1.60 


406825 




UefMOQO 


vUf ** Bni^en (invdiidiii poiyp^iKJc oi iii 


1.20 


120 


436906 


H9S990 




(nsior htstocomp3(3^ty complsx, ctsss 


1.19 


127 


422241 


TUUUDc 


ns. 1 /vi£i 


prolBin tyrosins phos(^3t8S6i rocoptor t 


1.19 


162 


408279 


Mr ^10909 


ns.'rwao 


nofuo so|A5ns, none mu w> i £.v i / ■ ii u\fv> vom 


1.18 


1.25 


411372 


Ml l*tf OO 1 


kk 7119RQ 


low dBfisity Gpopfotsin roooptor (fsniili 


1.17 


133 


AAA'iAt 




no. IUQO/ 


sln^Isr to lysosofno-assocfatsd incfnbfsnc 


1 1R 
1. ID 










9btH.s8pi6ns protsfavsGTuiGAhrconins ki 


1 1R 
1. ID 


1. 13 








A 1 r-OCnaing CSSSoUo, SUD'Tdrnuy n y/\l3l^1 


1. ID 




432805 


X94630 


Hs.3107 


CD97 antigen 


1.10 








ns. 1/ fooc 


surfsctsnt, pulinon3ry*3SSOci3t6d protBin 


1.16 


141 


4J0WD 






nuctGsr receptor sutjfaniily 1 , ^^roup 1, m 


1 1R 
I.ID 




443623 




Hs.9641 


Mil 1 ipicnKni wn 1 iponcTii i| ^ ouuMnii^Miciit* 


1,15 


127 


429832 




Ue 0RHAT7 


f=<rrR WpaVtw dmibir tn 1 IRPP HUMAN IIRiOll 


1.15 


172 


438183 


DCTC'MCO 


H3.6IOI 


hunnlhiiilrQl nmleln M/2P^17R 


1.15 


121 


432680 


T47364 


Ue OTRSI*) 


intftffnmn alnhii.{nf{itrahlA nrntowi 97 

iiiioiioniiii oipnfrHiuuuioio imuusui ci 


1.14 


121 


406782 


AA430373 




<rfi>»r2fV11 Jti Soaiafi iwav himy NhHOT H 

^UtCJVCUl 1 liS 1 9llcliU9 wVcS J UHIM fWriv 1 n 


114 


141 


414662 




rW' • vDVf 


nuijof histoooiiipsSbility complox, dsss 


1.12 


125 


452547 




Hs.74120 




i!ii 


139 


414803 


X03100 


Ks.914 


Hitman mRNA frv HassM hkhwvwnnnfiH 


1.11 


1.22 


430260 




Ue 037868 


tnlAripitkin 7 rPTAnfnr 


1.10 


1.73 






fTS.cDOuw 


leunoic aciu inoucea j 


1 no 


1 99 
1.4c 


424614 


X54486 


na. 1 9 1 


eorfnA /nr pwcfa!nn\ nmlotnaea InhihHn 
ovlun; yjf C/oIbhic/ (JloicinaSKS iniuoilU 


1.09 


1.14 


419200 






EST 


1.08 


164 


416511 






1 weftewmaLgeeorlflfaH mi ilftenanninn momhr 

LjsosoindraSMJCiaioo iiHiiuspianning fiicinin 


1.08 


118 


409428 




H$.54457 




1.07 


112 






R« 171DQ 


Intend (iKfnbfflno protdn 2A 


i!o7 


171 


421481 








1.07 


158 


406868 


AAS05445 




iiliiiiuiiuyutJUiiii iiaavj ifWioKiiii yaiiiiiia v 


1.07 


1.23 


412819 


T2S829 


HQ 9^n4ft 


Pl^fiflR Ktnrttnn nrntaln nrofM ir^nr 

rivouD ouiQing piowfli (xtn/iuotH 


1.06 


145 


mo£oo 






Unmn eDftfon« rfUIA PI IIIRflR Ik rinna HP 

nuiilO SBpions cmv\ ruii louo iiSi viono nc 


I.UD 


1 94 


A'iAOA'i 

HOI mo 




Ue ocTiRq 


synoecan *t ^onipniyjywoni fjwiocaiij 


I.VKJ 


117 


448133 


AAT^II**? 
rv^f £J 19/ 


Uc TTTRtl 


KNaio leCBpiOi 1 ^oOUll/ 


1.04 


121 


Ami i 0 

Will/. 


AAUrUOUl 


u« cifiie 
nS.310l9 


CO 1 s, weaxiy similar to ALU/_nuiwVM alu o 


1 FiA 


0 1^ 

a. 


418156 


W17056 


Hs.83623 


nuclear receptor subfamily 1, group 1, m 


1.03 


6.65 


438089 


W05391 




nuclear receptor sutifamlly 1, group 1, m 


1.03 


8.00 


429615 


AF258627 




Ml r^uinotng comcuoi ouiMaiiiuy n \t\D\^ i 


1.01 


1.25 


406722 


H27498 


Hs.293441 


Homo sapiens SNC73 proton (SNC73) mRNA, 


100 


118 


438091 


AW373062 




mnie^r receptor sutiMly 1, gwp 1, m 


0.99 


12.84 


407018 


U49869 




NhL018955:Hofno saf^ns ubiquitin B (UBB) 


0.99 


107 


412898 


AW804157 


HS.30B026 


major hlstocompaSbBl^ oomptax, dass 


0.98 


157 


435523 


T62849 


Hs.11090 


membrarv&^pannbg 4^omaln3, subfamily A 


0.97 


140 


426530 


U24578 


Hsi7862S 


oocnptemenl component 4A 


0.96 


128 


456898 


NM_001928 


HS.1S5S97 


D component of complement (adipsfn) 


0.95 


129 


407241 


M34516 




0b:Huntan omega light chan prcrtein 14.1 


0.94 


1.11 


425371 


049441 


HS.1SS981 


mesolhein 


0.92 


145 


431369 


BE184455 


HSJZ517S4 


secretoiy leukocytB protease inhibitor ( 


0.91 


1^ 


TABLE 35B: 











Pkey: Unique Cos probeset Identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accesskm 



41 1089 5597.6 BI009308 61009893 6Fg22023 BF922909 6F922913 BF922096 BF957733 BE701791 AA456454 AA57g876 BF933710 AA091294 BI007291 

AW905577 AW975593 AA713730 AW836761 AA666384 AA551106 BF594606 AI082382 Ai955808 A1679895 AI679386 BF4355S5 AA586369 
AA5513S1 AAS95622AA5G5188BF808855AAS84921 N86077AA601031 AA63318a AA514764 AA454562 AAS51297 AA936109BI009389 
AW897806 BE815442 BR39374 BI009310 6F925422 BF933709 BF922034 BF925465 BI009680 

4521 94 90339.1 A1694413 AW994700 A)912946 N73548 AI082035 AW271652 W24189 W24182 Ar71971B AAQ24658 AWB10120 AW015394 T797S5 AA988043 
AI709339 

410910 1063929.1 AW810196AW810555AW810507AW810204AW810619AW810534 

412394 1174616.1 AW947794 AW947793 AW947802 AW947798 AW947792 AW984150 AW9841G6 AW984167 AW984168 AW984179 AW984134 AW984160 

AW984180 AW984194 AW984202 AW984190 
413682 1S27038J BEt56943 BE15737S BE156965BE156949BE156956 

441320 56978.5 AI346734 AI377971 BG193341 B6S48376 AA928353 AI768724 B6215700 AA449370 61462157 BI060283 BG677508 AA318802 BG719160 
408544 683260.1 AW293825AW235391 

413454 1515217.1 BE141291 eE141306BE1412B8BE141283BE141162BE1411686E141290BE141161 6E141165 

444404 16138.1 60013183 AW408658 NM_002119 M3152S M26039 6M455399 BF7323ei 8M152457 AW407685 BM193161 AW407778 81819141 AA7022S4 
6FB5S074 81761232 
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436083 

444346 
424281 
423S01 
414991 
411673 

452203 



5483^1 

47906.14 
892055 1 
887384.1 
1785138J 
1096744J 

2630J 



458332 
431594 
414372 
459644 
400269 

43X15 



1139685J 
1236764J 
48421 U 
662385.1 
2726_1 

713L2 



443951 MH1768J8 



413497 
434747 
408782 
419200 



1518C02J 
117643J 
0.0 
9531.1 



438089 22448.4 
438091 22448.1 



AK000028 AA494483 A1298674 AA720n3 AV761529 A1884670 AI936202 AW2942S 061652 BFB81184 AV711384 N27154 A192e970 AV734970 

N40094 N285g6 AA684747 AA512890 BG436S93 

AI147991 AI142274AI198553 

AA338252AA338213 

AW962691 AA333006 AA332289 

D78831C17898 078883 

BF330730 BF350539 8E153665 BE065082 BE064850 BE064663 BF330763 BE153620 BE064737 BE155079 BE064651 AW8S6751 AW856622 
BE064691 6E1S3674BE153698BE064730BE153536 

BC014081 NM.000593 )CS7522 L21208 L21207 L21206 L21205 121204 AL561404 A1546423 AL560492 AL556882 AL541576 ALS50654 BI823519 
B)770023 AL554969 BI489906 AI304693 AW295947 BM146642 X57521 BG820143 BE898390 F06770 F12630 BM423610 A1561518 BM009470 - 
B6742981 AA279685 AA847441 AA313737 BF172639 BFB97216 BF914190 BF903647 870277 A!SG9694 AWD732g6 AI381433 AA564644 
AA487429 BE658232 AA838610 AI539114 AI719375 AI829129 BG057675 AI423422 AU158860 BE300655 AW1707n AA586956 AL571889 > 
Al556850AL576404AL582800Bt256544 BF342301 BG875994AA054458AA353161 AI940434BE816S22AL57Z636A147g650AW1 50377 ' 
AU154395 AW951 271 AI032220 AI819778 AI345733 AW771 150 AW512525 AI249904 AA279809 AI352549 AW512517 BG056280 AA521222 
BE271141 AL581932AL541575BI819184AV660ig0ALSS6475 AI620020 AW069888AW079179 Z21518AA687601 F04651 A1783961 TS7198 
AI433367 n8652 A1^968 AA365648 AL582619 BE874601 BF804669 A1374458 BM145S02 AI2G5514 AI536823 A!4^^ 
AA487637 AA031844 AA535221 AW794256 AW381447BE768S05 A1682892AA830969 AAB62356AA6S3084BM009154 AA135727 H05927 
H23433 R42244 HTmt AW366665 AW366601 AA676742 AL556474 AA135770 BE774050 BF914200 H88457 AAfi27746 61560216 BI753586 
AI000341 AI766341 AWB73274 

AIB23999 AA970060 AAS08176 AW972585 AI873427 AW972369 
Bt093452AW970885BG118285AA569075AA492132AW753140AA213770AA143654 
W03900 AW197203 AW753300 

X65018 BC022318 NM-003019 BE465C60 AI732255 BF446634 AI820677 At002217 AI924488 BI821373 BI77O406 BI823937 BI820265 81489632 
BG48291 1 AA617783 AI807697 AW205576 T94427 AM87101 T94513 BI819407 B1B22450 81820618 BI824819 BG542824 BG537862 
BC017171 BC012195 NM.007126 AF1 00752 All 37377 Z70768 BM474865 BG754806AU 124376 BG757203 BG764420 BG775028 BG824418 
BM045810 AU120387 B6770238 B6686740 BG913323 BI759980 BG395998 BM048875 BE881070 BE313689 BE879144 BM309834 AW245847 
A1770171 BF196861 BE856897 AA463876 AI375927 AA648810 AA948193 AA490916 AI459893 AI458188 AI24G40B AI191843 AI131029 AW768399 
AI365196 AW337g84 AW0261S0 BE4665gi BE674599 AI818438 AA772197 A1651927 AW151 143 B!198825 BG819083 BM4S8764 BE903567 
BE732715 BM043200 BE900263 BE900706 6E731097 BE390023 6G87S384 BF9g8406 BFg88930 BM475542 AW24621S BE501897 BEg03610 
6E561530 6E560537 BE903782 BE732947 Bt227204 6G761305 BE262642 BE391848 BE382475 BG008256 81547991 B1459099 BE391391 
6E259420 BE298109 AW245422 Ai423847 A)914618 H80534 BE301004 AL531791 AI435581 BR93112 AL577303 AA373265 BE746965 BF743630 
BE879296 AI359493 Bik/)018598 AI689260 AW072450 F20201 AW151405 AW517572 AA773468 BG259694 BE391163 BG621529 A1421728 
BG767231 BM4629S3B6340524W52648AA113434BE785431 BI041981 B6832385 BG25316B BG7S9470 BF369329 8F981332 6E2S9418 
B065738 BI091658 N72S12 W5B732 WB56gO BG9569B9 AI205206 H19721 W17051 W77958 61262010 AA844319 W74143 W72214 N85194 
BE734033 BG164099 AA931089 F13645 R41394 AK025758 BG180977 BE349455 AA812018 AA740241 AI027722At1 50356 AA886395 AW977627 
BE220225 AAB84082 AW5181 14 AI243844 AA809493 AA481029 AA82571 8 AI347866 AI431 670 AA814436 AI251 109 R07704 AA765606 AA724593 
A1916399AI537550AA491103 AW008188 R07703AA9d9120AA746235 AW028983AA789102 AU185751 AW971465 AA48g681 AW971893 
AW612086 BE077936 BI860809 BE002760 BG746251 BE982912 BM454584 AL134894 BF104082 Ha0591 

AI334106 R63583 A1028079 AI140098 AI91 1625 AI890637 F34815 T65959 N40935 V^2768 AA854747 AA861945 AA876472 AA778270 W32249 
AA026081 W52662 W15352 W79670 W95384T94283 AA002155 R820S2 8E625493 BE825520 
BE177561 H062158E144709BE144829 
AW976537 AI033582AA837085AA745261 AA648395 
AA430373 AA968771 

6F036043 AW190446 B6194731 AW662036 AI445021 BE937550 AW818972 AW393132 AA834685 6F1 12058 AV721682 H16423 AI270167 
A1857345AA937302AW818444BE929780BG49867BBF155010B1598271 BI599811 8E16172BAW578737 AW753711 AW379707 AW381918 
BG506608 AW028637 AW994240 BF887392 BR90073 AW381624 AV727105 BF439618 AA443174 AI01B009 N42850 AW573242 AI417258 
AA463483 AI676131 AI167170 AAB36627 AA443828 AW592922 AA235129 AA730278 AW439082 AW474332 BI043239 AW474342 BG708553 
AW362423BF090028 BE827256 R16550 R39478 R39479 R94368 BGS40916 BM314745 AA251087 D54231 D55274 8F085805 031589 AW966405 
AW994425 081879 BE093545 AW901 1 07 AA383529 81021 552 R56420 N39976 AA573281 H82595 AA234955 BE093539 AW367006 8F358597 
BF366318 AA663856 BE702099 BF035969 AI267384 AI267232 6E34S320 AA621574 AA861212 6F083343 BF083341 AV745131 053074 
AW954476 AW954472 AA376836 AV724531 053063 C14926 AA0g3287 AA062638 BG483558 BEg40050 AA765954 T701 71 BE938775 BE940057 
053502 AW373300AL118798 BM128728 AA193411 AW444709 AW952455 A1887612 BF431948 81496876 AI264159 8M128481 A1624657 
A1689301 AI969467 AA8616B5 AA251595 AA625761 AA8720gO A)8267g0 AA328366 8E827416 R75951 D5691B R6B122 BE827384 AL118797 
AI184164 AA1&4411 81495332 BE858113 A1B63860 K00660 T69849 AW780389 C14667 BE934995 B[018652 R92801 AA164410 H00752 
AW373305 AW373299 AW373302 

BM475665 BE644917 AW770789 AW952971 N64883 BM263259 AI224545 All 84866 N69114 AW518902 A1440169 AA809472 AV654440 
AA281642 AU185230 AW337382 AI872g23 AI5371 1 3 N73882 T83378 H63731 BF671764 AW897824 AI81 1204 AA344646 BE0091 12 BG899664 
H91240R60548N41701 

AK054860 AV6521 98 AV652192 AV6521 38 AV652127 AV652194 BE935919 AV65201 7 AV651995 AV651548 AV646063 AV^^ 
AV646179 AW8B0409 AA345002 BF155169 BE068931 X56197 A1603014 AW953629 BM263546 8E5S0772 AA701084 AI681352 AA358889 
AW938841 BF438147 W05391 H753136P326185AV64633SAV651589AV646340AV651992AV646384AV646364AV687497BF155183 
AV646370 AW797876 A19C6821 X56196 BE833e35 AA628440 BE833808 8F224205 AA7091 26 8E673807 AI923886 AA947932 A1276125 AI185720 
AW510698 AA987230 BE467708 AW898628 AW898544 A1146S84 AW043642 A)286245 AI186932 A1635262 A)139455 At298739 AI813854 
AI024768 BE&99445 BE699444 A)707807 052654 AI214518 AI004723 At698085 AW087420 AI565133 AA845571 AW69B622 8F1 101 44 AW513280 
AI061 1 26 BF362770 AI268939 AI435816 6F47S318 Ai024767 8E174213 AA757S98 AA513019 AA902959 A1860794 Ai^ 
BM310S32 AW513771 AI951391 AI337671 BF095606 BF095601 BFD95468 AW890091 BF09S753AW243400 AW898607 AW898816 BF362762 
A)922204 AW898625 BE699468 BE174196 AW102923 052715 BE699456 052477 OS5017 BF95S933 BG623563 AV6462&4 AA463522 81003244 
AI299190 W40186 8E174210 8F939091 BF4341B0 AW579001 T55662 H01811 T52522 BF945037 BF955938 054679 053933 R67100 BG925552 
BF9g9056 RB3430Z29922T85791 W03942 H63289 A1091537 BF086583 AA345570H48870 H80720 T83523 61039626 61037700 R00353 
6F155164 N38343 N79072 H01812 T55581 



TABLE 35a 



Pkey. 
Ref: 

Strand: 
NLposItion: 

Pkey 
402550 
403244 
405452 



Ur^ue number corresponding to an Eos piobeset 

Sequence source. The 7 d^ numbers bi this column are GenbankhIen6fier(GQ numbers. *Dunham, eld.' refers to the pubTicafion entitled "The ONA sequence of 
human chromosofne 22" Dunham, el al f 1999) Nature 402:489-49S. 
Indlcales ONA strand Item which exons were predicted. 
Indicales nucleotide positions of predicted exons. 

Rsf 



Strand Ntjnsi&m 

Minus 8041U0873 

7637828 Plus 175792-176144 

Minus S387&^27S 
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wo 03/042661 



PCT/US02/36810 



401447 


8574299 


Mhnjs 




405097 


8072599 


Plus 


171191*171360 

If 1191* If iMtfV 


400712 


81 18874 


Plus 




403478 


99582K 


Plus 




405827 




Plus 


11)970.10977 


405075 


7770506 


MiniR 
nunus 




406267 


7528342 


Mnus 




402240 


7690131 




1(U.')A9.1 04^97 1flS13&1flfi379 


404811 




rlUS 


£D*l£<^&D39(^4C0O3v>£D9Of 


403589 


8101229 


Rus 


&>330 




9958257 


Phut 




401697 


8569218 




604-767 


405453 


7656675 






402516 


9798099 






406266 


7528342 


Minus 


2365-2518 


404696 


9800109 


Minus 


60037-60144.62675^081 


403533 


8076874 


Phis 


162922-163658 


406303 


8575858 


Rus 


173622-173788 


404240 


5002624. 


Minus 


116132-116407,116653-116922 


404056 


3548765 


Phis 


75843.76980.77146.78263 


402736 


9212044 


Minus 


66876-67010 


405545 


1054740 


Plus 


1186n-118807,119091'119296,121626-12182 


405441 


7408124 


Phis 


100952-101283 


400680 


8118752 


Rus 


118343-118684,120720-121013 


404Z77 


1834458 


Minus 


91665-91946 


404854 


7143420 


Plus 


14260-14537 



TABLE 36A: About 52 genes upregulated in noivspedfic bitersGllat pneumonitis felaiive to hypeisensitivlty pneumonilis or Idiopalhic putntonary fibrosis 

Pkey. Unique Eos probeselidentiiier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene TltlK Unigene gene tide 

R1 : 90th percenllie of NSIP Ate divided by 90tii peruenble of HP Ats, where the minimum value for the numerator and denominator was set to 50. 

R2 90th peioenliie of NStP Als divided 90th percentile of IPF Als. where the minimuin value for the numerator and denominator was set to 50. 



Pkey 


ExAocn 


UntgenetD 


Unigene Title 


R1 


R2 


435140 


AA658123 


Hs.134170 


ESTs 


Z76 


276 


429504 


X99133 


Hs.204238 


lipocaOn 2 (oncogene 24p^ (NGAL) 


157 


1.00 


435375 


AI733610 




ESTs 


Z55 


255 


420813 


X51501 


Hs.99949 


pn)iac8n4nduced protein 


Z55 


1.35 


425071 


NM.013989 Hs.154424 


delodinase, iodothyronme. type II 


252 


0.73 


421296 


NM_002666 Hs.103253 


periiipin 


Z50 


245 


419290 


AI128114 


Hs.112885 


spinal conj-derived growth factor-B 


243 


1.79 


408882 


H12084 


Hs.31110 


ESTs. Weakly simOar to MAGE-B4 IH^sapie 


242 


1.77 


437318 


AW362939 


Hs.120721 


ESTs 


236 


1.61 


421823 


N40850 


H3.28625 


ESTs 


229 


0.56 


412228 


AW503785 


Hs.73792 


oomplemeni component (3d/Epstein Barr vl 


228 


0.89 


430538 


AI809163 


Hs.990a 


nitrogen fixation duster-lile 


225 


280 


414009 


R57516 




ESTs 


219 


1.86 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteopontin, 


219 


0.33 


430699 


AW969847 


HS.29Z718 


ESTs. WedUy similar to RET^JIUMAN RETIN 


218 


1.24 


413722 


BE247354 


Hs.16400 


ESTs. WeeMy similar to KIAA1 435 proUn 


214 


1.95 


433874 


AW204429 


Hs.155033 


ESTe 


213 


1.72 


429609 


AF002246 


Hs.210863 


cell adhesion molecule with homology to 


212 


0.91 


414290 


AI568801 


Hs.71721 


ESTs 


211 


0.81 


451678 


AA3741&1 


Hs.26799 


DKFZPS64D0764 protein 


211 


1.01 


408785 


AA588061 




gb:nk10d03j1 Na_CXSAP_Co2 Homo sapiens 


210 


1.61 


449048 


245051 


Hs.22920 


similar to S68401 (cattle) glucose indue 


208 


0.85 


444179 


W35132 


Ks.267442 


ESTs 


208 


1.13 


430223 


NM.002S14 Hs.235935 


nefrfuobtasloma overexpressed gene 


205 


a80 


451099 


R52795 


Hs.25954 


bitorteuldn 13 receptor, dplia 2 


204 


0.79 


439134 


AAB30599 




ESTs 


204 


1.69 


418512 


AW498974 




diacytglycerd kinasa. zeta (104kD) 


202 


202 


457311 


A}497811 


Ks.172753 


Homo sai^ens chromosome 19, BAC 41195 (C 


200 


1.45 


402274 






q9D0O498*:9l)4567179)gblAAD23607.1|AC0O 


1.88 


224 


453222 


AA033929 


H5.19156 


ESTs 


1.77 


200 


447261 


NM.006691 Hs.17917 


extracdhdar link dom^vcont^nlng 1 


1.73 


202 


427297 


AW292593 


Hs.334907 


Homo sapiens, done MGC:17333, mRNA, com 


1.69 


1.69 


408714 


Ai2ig304 


Hs.266959 


temoglobin. gamma G 


1.62 


247 


418333 


W92113 




Ob2h48e01i1 SoaresJet3Lriver_8p)een. 


1.58 


2.04 


404090 






Target Exon 


1.48 


203 


444445 


AA342329 


HS.11S920 


Homo sapiens cONA: IU22816 fis, done K 


1.39 


206 


414386 


X00442 


Hs.75990 


haptoglobin 


1.09 


1.44 


439372 


AF088033 


Hs.159225 


ESTs 


1.05 


213 


432606 


Nlk^002104 Hs.3066 


granzyme K (serine protease, granzyme 3; 


1.01 


208 


412472 


AW975398 


Hs.293836 


ESTs 


too 


226 


432894 


AW167668 


H3.279772 


tvain specific protein 


a97 


1.19 


422060 


R20893 


Hs^5823 


ESTs. Moderately simiiar to AUJ5JtUMAN A 


0.96 


206 


416971 


R346S7 


HS.B0656 


uncoupOng proletn 2 (ndtoChondrial, pre 


ags 


1.08 
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424310 


AA338648 


Hs.50334 


testes development-related NYIVSP22 


a93 


1.47 


422109 


S73265 


Hs.1473 


gastriiHeleasing pepQde 


a92 


3.05 


420440 


NM_002407 


Hs.97644 


mammagiobin 2 


a9t 


111 


418198 


AI745649 


Hs.26549 


KiAA1708 protein 


0,90 


Z25 


418707 


U97502 


H5.87497 


tHJtyrophilin, subfamily 3, member A2 


a89 


1.11 


419231 


AL046294 


H5.138245 


ESTs. Weakly similar to T17227 hypotheO 


0.85 


1.74 


446608 


N75217 


H5.2S7846 


ESTs 


0.82 


Z10 


418916 


X07B71 


Hs^TB 


C02 antigen (p50}. sheep red blood cell 


a73 


1.32 


412610 


X90908 


Hs.74126 


^ acid bindbig protein 6. Seal teas 


a70 


1.76 



TAB1£36B: 



Pkey. Unk}ue Eos probeset identilier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

435375 130020J AI733810 A1049989 AA678769 
414009 438978.1 BE221268 R6751SAV730582 R67516 
406765 CLO AAS8B061 
439134 2561476 1 AA830599AA970659AA883802 

418512 12225.6" BM046773 AA224297 T33786 T08951 T09Z74 T08592 T3093B AA350905 

418333 73080.1 AF264624 AW668618 AV73144S R93353 AA5B4550 AV732726 BF602614 BF434359 AA077092 B1027317 AA199B12 AW629027 AA831618 AI124782 
AA765804 AA055698 AA677404 AA055366 AAB89402 AA765530 aE503126 BE467367 AW139964 WB1697 AI887646 W81696 AA447817 AA447667 
F13631 BVQSSSn AW268271 AW088477 BF677839Ali016S9 AW502118 AW502624 AA574189 B1020104 



TABLE 380: 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source, The 7 digit numbers in this column are Genbank Menffier (Gi) numbers. 'Dunham, et al.' fefers to the pubBcaGon entitled *The DNA sequence of 
human chromosome 22* Dunham, et aL (1 999) ftafajra 402:469^95. 
Strand: indicates DNA strand itm which ewns were pretScted. 
NtjX)SQion: indicales nudeofide positions of predated exons. 

Pkey Ref Strand Ntjxsilion 

402274 2935596 Ru8 5604-6527 
404090 9967460 Minus 100815-100966 



TABI£ 37A: About 206 genes downregulated In lung fibrosb relative to norma) htng 

Pkey: IMque Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnlgenelD: Unigene number 
Unigene Title: Unigene gene title 



R1: 


90lh percentile of nornial lung Als divided by the median of IPF A!s, where the minimum value kx 


Pkey 


ExAccn 


UnlgenelD 


Unigene Title 


Rl 


454229 


AW957744 


Hs.278469 


lacrimd prbOne rich protrin 


11.67 


432128 


AA127221 


Hs.295502 


ESTs 


9.86 


421218 


NM_000499 


Hs.72912 


cytochrome P450, subfamily 1 (aromatk: c 


7.69 


453310 


X7D697 


Hs.553 


solute canter family 6 (neurotransmitta 


7.32 


420958 


AA309431 


Hs.66 


interteukin 1 receptor-IB(e 1 


7.13 


402608 






Homo sapiens defensin. alpha 1, myetaid* 


6.67 


406714 


AI219304 


Hs.266959 


hemoglobin, gamma G 


&40 


406673 


M34gg6 


hls.198253 


histocompatibmty complex, class 


&22 


416539 


Y079D9 


Hs.79368 


epithelial membrane protein 1 


5.04 


418021 


M15881 


Hs.1137 


uromodulin (uromuocild, Tamnv41orsfall gty 


4.77 


409385 


AA071267 




0b:zm61g01.r1 Strategene fibroblast (937 


4.74 


450847 


NM„003155 


Hs.25590 


stannlocaldn 1 


4.46 


404518 






CD83 antigen (acGvated 6 lymphocyteSi i 


4.36 


413951 


AW061200 


Hs.75640 


natriuretic pepSde precuisor A 


4.32 


407570 


Z19002 


H8J7098 


zinc finger protein 145 (KruppeMSte, e 


4.25 


456525 


AW468397 


Hs.100000 


S100 callduni4)ind!ng protdn A8 (calgrBn 


423 


429509 


AW614420 


Hs.204354 


res homotog genefanily, member B 


4.14 


445769 


AI741471 


Hs.23666 


ESTs 


4.10 


414002 


NM.006732 


Hs.75678 


FBJ murine osteosarcoma iriral oncogene h 


4.06 


42S571 


AJ0072g2 


Hs.158306 


ephrfn^ 


3.92 


423168 


R34385 


Hs.124940 


GTP-blndIng protein 


3.80 


401234 






mltogen-activated protein kinase 8 Inter 


3.78 


402181 






Target Exon 


3.77 


403479 






NM.007064:Homo sapiens sertneAhreonlne 


3.68 


435424 


AW083883 


Hs.37896 


Homo sapiens cOHA FU13510 Us. ctonePL 


3.68 


402911 






NM.021158':Homo sapiens prolebi kinase d 


3.66 


442195 


Kli4.001430 




endothelial PAS domain protein 1 


X65 


400089 






Eos Control 


3.60 
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413948 C05145 Hs.75536 myosin OgMch^ 2a 3.S6 

438564 AA3815S3 Hs.19a2S3 i najof Ms beoinpataCHy complex, class 3.54 

413956 A1821351 H5.193133 EST8.WeaWy6lm!laf toALU7_HUMANALUS 3.54 

_ 431319 AAB73350 Hs.302232 ESTs 3.52 

5 434292 AF124368 H&306551 Homo sapiens IMAGE Consortium ID 839832. 3.46 

401S40 NM_002B75:Homo5api8nspn)myeIocyIioleu 3^46 

426477 AA379484 Hs.154073 eb:EST923B6Sldn tumor I Homo saptenscO 3.43 

402328 Target Exon 142 

401590 TBTBelExon 3.42 

10 403545 NM-.024513*:Homosap!ensFYVE and coiled. 3.37 

403376 Target Exon 3.36 

447966 AA340605 Hs.l05887 ESTs^WesMysimitartoHomotogofralZ 135 

417696 BE241624 Hs.82401 CO69anagen(p60.eai1yT<eUactivaa 3.28 

413719 BE439580 Hs.754d8 smaH inducible cytokine subfamily A (Cy 3.27 

IS 401126 NM_006856*:Homo sapiens activaiK^ trans 3.27 

408243 Y00787 Hs.624 intsrieuMnS 3.23 

412429 AV650262 Hs.75765 GR02oncogene 3.22 

426420 BE38380B Ks.322430 NORG family, member 4 3.21 

449333 H73444 Hs.394 adrenomeduDIn 3.19 

20 401904 Target Exon 116 

401919 NM_012448-:Homo sapiens signal transduce 3.14 

406443 ENSP00000236574*;HypothetIca!21.8kDap 3.14 

458232 BE217872 Hs.279537 ESTs 3.12 

406016 TaigelExon 3.12 

25 450912 AW939251 Hs.25647 v-fbsFBJ murine osteasarooma viral onoo 3.11 

451831 NM 001674 Hs.460 activating transcription factor 3 3.08 

450562 AW136468 Hs.202199 ESTs 3.07 

405938 Target Exon 3.04 

451029 AA852097 Hs.2S829 las^ielated protein 3.02 

30 421201 AW241940 Hs.102500 hypothetic^ protein RJ20481 ^98 

439839 AA689354 ESTs 2.98 

439891 AL389940 Hs.109968 ESTs 2.98 

418935 T28499 Hs.69485 cartionic anhydrase IV Z95 

418853 NM_005236 Hs.89296 excision repair crossKXxnplemenling rode 2.95 

35 429113 D28235 Hs.198384 prostaglandin^doperoxfale synthase 2 (p 194 

410326 A1368909 Hs.47650 ESTs 188 

407244 M10014 fibrinogen, gamma polypeptide Z8S 

459721 AI299060 Hs.143835 gbain14d12j{1 NCLCGAPJLuS Homo sapiens 2.84 

416212 R40290 Hs.124685 ESTs 184 

40 428686 AA431801 Hs.98764 ESTs. Weakly stmQar to A29861 acfin gam 183 

437508 A1204354 H3.121347 ESTs 182 

437990 At686579 Hs.121784 ESTs 182 

443709 AI0B2692 Hs.134662 ESTs 181 

423099 NM.002837 Hs.123641 protein tyrosine phosphatase, receptor! 180 

45 416188 BE157260 Hs.79070 v-myc avian myelocyiomatais viral onoog 179 

404231 Target Exon 178 

434305 A!01828O Hs.l30189 ESTs 177 

445493 AI915771 metallolhtonein IE (functionai) 176 

_^ 418056 AA52488S gb:nh34ID2.8l NCI_CGAP.Pr3 Homo sapiens 175 

50 404102 Target Exon 175 

440206 AI762232 Hs.46794 ESTs 175 

403031 cathepsin D Oysosomal aspartyl protease 175 

413164 BE058494 gb:MRl-0TO371-O5O5OOWa12BTO371 Homo 2.74 

459330 C16931 gb.'C16931 Clontech human aorta pdyAmlW 174 

55 456967 AW0040S6 Hs.168357 T-box2 174 

427602 AI375258 Hs.98005 ESTs 174 

431367 Z20964 K8.323817 DKFZPS47E1 010 protein 172 
406059 Target Exon 171 

420575 BE263301 Hs.99029 CCAAT/enhancer binding protein (C/EBP), 171 

60 457275 AA463422 Hs.209431 ESTs 171 

432559 AW452948 Hs.257631 ESTs 171 
402483 NM.020389:KorTH> sapiens putaI^e capadla 170 

416069 R37101 H3.20982 ESTs 170 

445445 Al=238870 Ks.275706 Homo sapiens clone GI^H4 similar to gQ 169 

65 43G232 AA707006 Hs.187863 ESTs 168 

416773 T39748 H8.325474 TargetCAT 167 

434038 AA622104 ESTs 167 

405448 Homo sapiens mRNA; cONA DKFZp586l2022 (f 166 

404439 ENSP00000067222*:MItDChondriat28Sribos 165 

70 43S724 N39308 H3.117898 ESTs 165 
404026 Target Exon 165 

400881 NM-025080:Homo sapiens bypotheOcal prat 164 

430314 AA369601 Hs.239138 pre^lcoiony-enhandng factor 162 
405429 Target Exon 162 

75 402642 C1002296:gi|66778171ref|NP.033126.1|rep 161 

436575 BE304709 Hs.146550 myosin, heavy polypeptide 9, noiHmisde 161 

449293 AA001088 Hs.29739 ESTs. Weakly slrrdlar to C34323GTP^ 161 

418157 NM.003243 Hs.342874 transforming growth factor, beta recepto 160 

446122 AI362790 Hs.278639 K1AA1684 protein; likely homolog of mous 159 

80 433291 AR)07191 gb:Homo sapiens SIB 276 Intestinal mucin 159 

426795 AI810474 Ks.196945 ESTs 158 

423503 M92843 Hs.343586 zinc finger protein homologous to Zf>^ 158 

430768 AB030207 H8.247888 guanine nucteoSde binding protein 13. g 158 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Avmn 


AJ012074 








HS.13Q907 




R43409 


Hs.6829 








*Hi (W 


AA262294 


Hs.180383 


AtXRJA 


oc0o5941 


*Klf Ml 


AA70o337 


Hs.1 20802 




X75042 


Hs.44313 




AWAAACCA 


Hs.35380 




BE25i104 


Hs.113052 




AW5024o3 


HS.19G521 




AiooTZoO 


Hs.184270 








AJ243425 


Hs.32^ 


412851 


A!826502 


H3.97269 


419968 


X04430 


H5.93913 


409209 


AA460160 


Hs.73217 


•Wf 1/0 


AVv449385 


Hs,157294 




Ai908639 


Hs^467B1 


41 01 DO 


R85350 


Hs.101368 


417295 


AW993524 


Hs.43148 


406305 




AntlQR 
4Z/000 


AA417083 


Hs.104789 


436409 


AJ2a8982 


Hs.183656 


413861 


BE 175424 




<HIOOU9 






402594 






AMtttVi 








AA503115 


Hs.1 83752 


AKSlKCa 
43oODO 


AI769067 


Hs.1 27824 




AW57293B 


Hs.130580 


4U3J00 


AA071u59 




4U0I3O 






4401D£ 


ALu39531 


Hs.323363 


lUOOa • 














a51501 


AAAJtf\ 

Hs.99949 




Au07d728 


Hs.8867 










AAl&wOD 


^A<%AA 

ns.72308 




Y13187 , 




445878 


AI262974 


HS.145S87 


^A^Q7i; 






KUiZZO 


Hs. 185679 


<HJU01 J 






RQ2740 


Hs.1 37555 




R59141 




U26403 


H5.37142 




AVy^1504 




458637 


AV657446 




449125 


AI671439 


K5.196029 


418922 


AWg68580 


Hs.42699 


402404 




AOioa^ 


R22497 


Hs.1 10571 




nr*^A4A jf* 

BE243845 


Hs.75511 


19D000 


AP035S2O 


Hs.153883 


4Z0l9d 


fA^004235 




422166 


W72424 


Hs.1 12405 




BE2o4974 


Hs.6566 




A1925668 


Hs.222312 


406564 




403581 






403718 






*H#*»f 00 








W73158 


H5.170434 




AI664561 




400007 


AF253468 


Hs.3736 


4^0000 


AA530892 


Hs.171695 


402051 
409859 


AW501926 


417987 


BE244373 


Hs.1119 


405083 




405163 






402386 






406755 


N80129 


HsTI 99263 


40S811 


AW500896 




454034 


NM_000691 


Hs.575 


400489 






428704 


AA432007 


Hs.1920gO 


429307 


AU076592 


Hs.198951 


400116 






404795 






408053 


AW139474 


Hb.246862 



vasoactive inJestinal pepCde receptor 1 
ESTs 

Homo sapiens mRNA tor KIAA1644 protein. 
C19001662':gi|5753872lJBf|NP^034345,1| I 
dual speciliaty phosphatase 6 
9b:RC3.BT0319.100100^12^Jl2BT0319 Homo 
ESTs. Moderately simaar to A26641 Na?ex 
>Mel avian reOcutoeAdotheliosis vbal 
X 001 protein 
RNAcydasehomolog 
ESTs 

capptng protein (adin filament} muscle 

Target Exon 

early growth response 1 

ESTs 

inteileuldn 6 (inteiferon, beta 2) 

ESTs 

ESTs 

ESTs 

ESTs 

epitheHai menfit)rane protein 1 
transcriptional adaptor 3 (A0A3. yeast h 
ESTs 

VNN3pratain 

gb:RC44m)578-17030(W12^101 HT0578 Homo 

C3000142*:gil45030151rBfJNP 00390ai|co 

C1002603':gII9887091|gb|AAG01738.1|AF248 

NM_001397:Homo sapiens endothelin conver 

microseminoproteln, beta- 

EST 8, WeaMy slmflar to T28770 hypoflief! 

ESTs 

gbamSealO/l Stralagene neuroepltheHum 

NM_003213*:Homo sapiens TEA domain famil 

hypotheOcaf protein FU22169 

Target Exon 

Target Exon 

prolactin-induced protein 

cysteine^h, angiogenic inducer, 61 

C11001794^:giI10946612|r8nNP.067286.1| 

ESTs 

Homo sapiens dmd gene, Intron 1 1 
ESTs 

uncharacterized hypothatamus protein HTO 
ESTs 

Target Exon 

putative chemoMne receptor; GTP-faimfing 
gb:yg96d1 in Soares Infant brain 1N1B H 
ephriivA5 

gb:U|.HF-BP0p-ajd^fO4.WJIj1 NIH_MGC 5 
gb AV657446 6LC Homo sapiens cOI^ clone 
Homosaptons mRNA for KIAA1657 protein. 
ESTs 

NM^02496r:Homo sapiens hypothetical pro 
growth arrest and ONA^iamage-lndudbta, 
oonnectNe tissue growth factor 
MAO (mothers against decapentapleglc, Or 
Kmppel-Eke factor 4 (gut) 
S100calcium4)inding protein A9 (ealgran 
thyroid hormone receptor interactor 13 
ESTs 

msh (Drosophib) homeo box homolog 2 
Target Exon 
Target Exon 
Target Exon 

Homo sapiens cONA RJ14242 lis, done GV 
ESTs 

delt&4ike 4 homolog (I^rosophiia) 
dual spedftdty phosphatase 1 
Target Exon 

gb:U».HF^R0p4||p4.08^WJIj1 NIKJ«3CL5 
nudear receptor subfamily 4. group A. m 
Target Exon 

C500056r9iI7513700(pfrirri4l51 Invpro 
Target Exon 
metaltothtonein 1L 

gbrUkHF-BPOp^-aWWiljl N1H„M6C.5 
aldehyde dehydrogenase 3 tamily. membeT 
Target Exon 
ESTs 

jun B protiHmcogene 
Eos Control 
Target Exon 
ESTs 
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2.57 
Z57 
Z56 

2.56 

156 

ZS6 

155 

2.55 

Z54 

2.53 

Z52 

151 

150 

149 

149 

149 

148 

144 

143 

143 

142 

142 

142 

140 

137 

137 

137 

138 

138 

135 

133 

131 

131 

131 

130 

130 

128 

127 

125 

116 

118 

118 

117 

116 

115 

115 

112 

108 

107 

100 

1.96 

1.98 

1.98 

1.96 

1.96 

1.93 

1.92 

1.92 

191 

1.91 

1.90 

1.90 

190 

189 

1.64 

183 

182 

181 

178 

178 

178 

175 

1.73 

1.73 

1.70 

170 

170 

1.69 

167 

1.65 

1.65 

165 
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414580 BE388918 
428800 M576Z7 
451676 R84n0 
402394 
404818 
436364 
420369 
405530 
402448 
433495 
409020 
405443 



X060g6 
U96769 



AW373784 
AA062549 



gb:60127S386F1 NIH_MGC.aO Homo sapiens c 
Hs.193717 biterteukinlO 

HS.33S38 ESTs.WedldrslnnOartoQxy9aHegulated 
ToigetExon 
Target Exon 

9b:Huinan maorophaoe sAphal-anGtrypsin c 
Hs.97220 chondroadhefin 

CXO0149r:gI|45575431re!lNP.001384.1| ex 

Tsrgel Exon 
Ks.71 atpha-2-glyoopn)tein 1 , zinc 
Hs.21162 retbindin 

TergetExon 



TABLE 37B: 



Pkey: Unique Eos probesel identirier number 
CAT number Gene duster number 
Accesskn: Genbank accession numbers 



1.63 
1.63 
1.62 
1.61 
1.60 
1.55 
1.54 
1.54 
1.53 
1.51 
1.51 
1.12 



Pkey 

409385 
442195 



439839 
445493 
418056 
413164 



CAT Number Accession 



459330 
434038 
433291 
423387 



455674 
413861 
409368 
415405 
409837 
458637 
428193 



448793 
409859 
409811 
400116 



110758J 
15007J 



2594580_1 
423456.1 
286199.1 
1492512.1 



105725.1 
630988.1 
73706J 
261^.2 



1490762^1 
1561647^1 
110612.1 
1872126.1 
915621.1 
395206.1 
430.1 



916430.1 
58948.2 
5269J 



414580 
436364 

TABLE 37C: 



623093.1 
1414.37 



T65940 T64515 AA071267 AA071334 

U81984 NM.001430 BE907085 Bt333232 AI021986 AU138476 C18601 U51626 AU100517 BI054387 AU076970BE786454 BG010080AW377169 

BFggB789 AA368139 R1 1396 T83613 BG006324 BI012404 86001643 BF757957 AL549361 AL544018 BE002870 BE929314 BE090199 A104665O 

B1053717 BE929315 BI054967 BF960055 BF925432 R05421 BF922073 T70331 BI004403 

Ai023587 AA889354 AA&46791 

AV71 1317 A!809938 AI808768 Ai240S93 AI915771 

AW971347 AA524886 AA211537 BF903005 BF357120 

BE068758 BE066745 BE068689 BE068778 BE068529 6E088683 6E068445 6E068392 BE068719 BB)68473 BE068521 BE06B628 BE068422 
BE066618 6E068354 8E068390 BE068414 BE068433 BE068369 BE068384 BE068661 BE068324 BE068301 BE058436 BE068754 BE068329 
BE068672 BE088494 BE06B596 BE068332 BE068347 BE088588 BE068328 BE068493 BE068740 BE068885 BE068759 BE068307 6E06B429 
BE068303 BE068693 6E068374 BE068295 BE068625 BE068302 BE068663 BE068675 BE068579 BE068311 BE068674 BE068547 6E066602 
BE068605 BE068352 6E068306 BE068401 8E068537 BE068552 BE088450 BE068723 BE068393 BE068671 BE068748 BE068317 BB}68447 
6E068568 68)68632 BE068357 BE088330 BE068498 BE068831 BE068540 BE068410 BE068628 BEQ68591 BE068522 BE068676 BE068499 
BE068361 BE0685g8 BE068350 BE068299 6E068580 BE068567 BE068692 BE068321 BE066327 BE068739 BEQ68526 8E068536 BE06B765 
BE068340 BE06B733 BE0&8293 BE068565 BEQ68480 6E068476 BE068761 BE06871 2 BE068706 6E068549 88)68419 BE068383 BE068434 
BE06B418 BE068525 8E068543 6E06B752 BE066550 BE066623 BE068470 

BGS631S2 BF846777 BF849354 BF849359 BF846636 BF849201 BF6493S6 C16931 AA056717 AW864542 AW882724 AA056567 
AJ910738 AW139227 AA932891 AA622104 
AR)07191 AW820706 80978594 BF872238 

LI 3288 AA928785 Ai60891 2 AW872978 AAS65655 AI02291 5 A1304920 AI564366 Ai668793 A)094557 T60038 R72302 K45409 AA508805 R46356 
AA418798 BM129553 6M129126 BM129292 BM128865 A1808418 AI689932 Aie06573 BF43180B AW87298S AW166269 H73241 T16182 AI264547 
R73391 R72085 R72840 T83751 X75299 BF754348 R94105 AW449839 R73300 NM_004624 AI797007 BE045543 BF110021 BF754250TB3923 
AW884084 AAg03896 AA418362 L20235 R72351 H45098 AA961010 R73210 R46451 AW884085 BI022902 BI763932 BI910138 
BE065941 6E065997 BE066003 BE066070 BE066098 
BF352282 6E175424 BE175418 BE175383 
AA071059 AA085201 AA085020 

W18191 R59141 R54142R12130F11352Z42794R)S242 F0792SH21084 R54090R59142 

AW501 504 AW501656 AW503048 AW502449 AW50209d 

AV8577S2AV6S7446 

AF105036 U70663 Nlil.004235 AF022184 AU141767 AU141 1 10 AL040569 D44830 BIOl 1 351 AL575805 AI290876 At014784 AI393429 A126621 1 
AW074303 AA62071 1 BF197792 AW008766 D25944 Ai687397 AA621680 AV714408 BF44690S BM314505 BF514079 BM314197 AA845201 
AWB74084 BE720622 AI127241 AA238239 AI679709 AI6791 35 Al 572470 AAS73434 A)568487 BE049325 AA687950 BG92S989 AI338031 
A)36S073 AJ024576 AA298805 K04001 H45668 BG682146 AU52388 81462361 BG547513 BG896663 BI256661 
Ai936948AI864581AI570841 
AW501926 AW502566 AW501927 
AK057581 AW500982 AW5008g8AW50110S 

D42041 NM.014610 AJ000332 61758702 6G720850 AU141 1 29 AU13071 1 AUl 41 380 AU132402 BM048556 AU127S20 BE2599B4 AU1 28952 
BE614151 AL601516 BM146777 AU128103 BM194094 6E937951 BE696396 AK026997 AK026567 6F969293 6E798100 BI086881 BG166248 
BE877845 BG385414 AW886747 BF093789 AW390159 8F820311 AA421676 AW880845 AW404827 BF726465 BE161190 BE254102 AW406002 
BE161223 AI912055 8F930228 AW374357 AW794531 8E720524 BE933982 BE933858 BE933694 BE933978 6E9336&4 BE933S83 8E1 66557 
BE933874 8E933641 BE933859 BE933626 BE933865 BE933833 BE933864 BE933631 BE933867 BE933634 6E933857 BE933624 BE933883 
BE933650 BE720491 AA420426 BE720410 BE720458 BE720444 BE720411 AW368748 BF874616 BE933498 BE835979 BF9266a7 AW849921 
AVV850026AVV850022AVm9977AW849900BG250251W87689Ail92825A1692824AA426263A)09031^ 

AI53B868 AW276162 AI279916 AA6003ia AI168836 AW662284 At262619 AA293457 BF347442 AA421677 AA658063 AA565510 AA937060 
AI142684 AA788940 AAB27426 AU152614 AI342784 AU148738 AA219664 AA047835 H99450 AA018563 AI073634 BM475120 BG875251 
BG248778 U46372AA383858 AU140356 BG821B91 BF935049 81760656 B1054103 BF98230g BE872215BI257291 AU158469AU160599 
AU152469 AUl 52375 AU152059 AU148575 N32267 AU149554 AI627459 AI719840 AW77901 7 A1291493 AW304181 AW470055 AI086491 
Ai31 1387 AI634232 AI151241 AI288848 AW050588 AW5895B0 A)241353 AI880219 AA039309 AA026517 AA016239 AA01 3444 H86822 R87530 
AAOS8462 N27082Z39679BE544309W52619AA018076AI813668AWl89907A)418104AU15g878AU150087 R21754A^^ 
AU153097AI961344AA018208 W32429 R45344R77453BM470129AU130415BI227374BE298179AW844963^ 
BF850571 AA355641 86747156 ALJ547262AW367941 8E560004 BI116061 86899031 BE560318 BF174177 61051456 6E001967 6E385446 
BF969326 BFB08765 8F684480 86421617 AI940607 AW875483 BE789632 8F808711 81192691 AW904249 BI911430 BE265407 BE730343 
BE397808 81226516 

BG333973 BE385437 BE408833 6E387650 

XD6826 X060g6 86468890 AW951851 W23S62 T28392 H56742 H58030T69205 



Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers In this column are Genbank Mentiiier (Gl) numbers. 'Dunham, et d.' refers to the pubOcaSon entiQed "The DMA sequence of 

human Ghromoaome 27 Dunham, et al (1999) Nahire 402:489495. 
Strand: bidlcates DIM strand ftomwrMchexons were predicted. 
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NLposftkm: Indicates ntxfeotWepodOons of predictoJexm 



Pkey 


Rel 


Slrand 


NU»si8on 


402w8 


99100% 


Plus 


3749S^669 


404518 


8151988 


Plus 


84494^603 , 


401234 


9929642 


Rus 


120173-120337 


402181 


8575812 


Rus 


44974&450040 


403479 


7329292 


Minus 


148389'148533,150678.150809 


402911 


7263904 


Rus 


142689-142979 


401540 


8072433 


Plus 


106838-107310 


402328 


4464283 


Minus 


13758-13922.14558-14752 


401590 


9966320 


Minus 


33547-33649 


403645 


8699714 


Minus 


443^2 


403376 


9369545 


Minus 


108698-108830 


401126 


8699701 


Minus 


68290-68487 


401904 


8671956 


Plus 


60959^1603,6267O«2890.63778^3838 


401919 


9502466 


Plus 


67S3&«7666 


406443 


9280765 


Rus 


8595147327 


406016 


8272661 


Plus 


4134H1940 


405938 


6758795 


Minus 


166671-167411 


404231 


8218035 


Minus 


61077-61322 


404102 


7229900 


Rus 


97685-98018 


403031 


7768597 


Minus 


1308-1416 


406059 


9103984 


Minus 


13856-14004 


402463 


7574980 


Minus 


65578-66119 


405448 


7582529 


Rus 


135347-136532 


404439 


7139680 


Rus 


55316-55585 


404026 


7582549 


Mnus 


79674-79968 


400881 


2842777 


Minus 


91446-91603,92123-92265 


405429 


7321905 


Minus 


51577-51723 


402642 


9958129 


Minus 


125599-125756 


402217 


9795981 


Minus 


21521-21757 


405857 


6758728 


Plus 


26564-26819 


406305 


8575669 


Pius 


108239.108386,112216-112378,115388-11557 


403605 


6862654 


Plus 


9161441718 


402594 


7705170 


Rus 


103082>103414 


402803 


3287156 


Minus 


55923-56033 


405156 


9966228 


Rus 


146733-146860,147899-147961,153127-15325 


403591 


8101229 


Plus 


4201-4833 


406193 


7269992 


Rus 


30183-30G62 


400703 


81 18859 


Rus 


63657-63857.64802-64905 

86096-86605 


404975 


3419864 


^£nus 


400513 


9796593 


Plus 


74613-74823 


402404 


3970932 


Phis 


53154-53280 


408564 


7711604 


Minus 


52788-53013 


403581 


0IU1 lOc 


rUS 


6794-7396 


403716 


7239669 


Pius 


86899-87122 


404758 


7708327 


Minus 


130204-130606 


402051 


8082020 


Minus 


19346-19480.20041-20119 


40SQ63 


7658414 


Minus 


111047-111666 


405163 


9966267 


Minus 


161171-161299 


402386 


9799769 


Rus 


22069-22303 


400489 


8954013 


Rus 


131475-131652 


404795 


4826439 


Rus 


147501-147780 


402394 


9929690 


Plus 


33308-33482 


404818 


2769655 


Rus 


33671-33839 


405590 


6960456 


Rus 


90492-90818 


402448 


9796640 


Rus 


112942-113069,114303-114521 


405443 


7408143 


Plus 


90716-90887.101420.101577 



TABLE 38A: About 207 genes upregulated In lung fSirosbretallve to nonndtiu 

Rcey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number. Genbank accession number 
UnlganelD: Unlgene number 
Unlgene Title: Unigene gene fiOe 
R1: 



Rcey 

437275 
407891 
414020 
410219 
434666 
425009 
406517 
420568 
425873 



90lh peicenOe of lung fibrosis Als dMded by 70th percenti^^ 



ExAocn UnigenelD 

AW976035 H5.292396 

AA486620 Hs.41135 

NM.002984 Hs.75703 

T9B226 Hs.171952 

AF151103 Hs.112259 

X58288 Hs.154151 

F09247 Hs.247735 

N»4.0133go Hs.160417 



Unigene T19e 


R1 


ESTs, WeaMy similar to A47582 gr 


4.28 


en(!omucin-2 


4.14 


small Inducible cytokine M (homologous 


4.07 


occtudin 


age 


T cdl receptor gamma locus 


3.88 


protein tyrosine phosphatase, receptor t 


187 


Taiget Exon 


176 


protocadherin aiphe 10 


170 


transmenibFBne protdn 2 


169 
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10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



438797 


C16161 


Hsj83040 


nypouicucai proffiin rrcuzyu 


3.66 


410315 


AI638871 


HS.152S19 


Homo saftens cDNA: FLJ2K24 ns» clone H 


3.65 


446714 


W73818 


Hs.n0028 


ESTs • 


3.64 


412326 


R07566 


Hs.73817 


smafl indudbie cytoMne A3 (homologous 


3.61 


430259 


6E550182 


HsJ 27826 


RaiGEF-Gke pioteln 3, mouse homoiog 


3.58 


412790 


NNL014767 H$.74583 


K1AA0275 gene product 


3.56 


424338 


W76816 


Hs.49943 


ESTs, We^ similar b S65657 alpha-IC- 


3.56 


412654 


AI093480 




hypotneoca protein RJi1896 


3.56 


414386 


X00442 


Hs.75990 


napioQiooH) 


3.54 


451035 


AU0767B5 


Hs.430 


plastin 1 (1 isofonn) 


3.52 


436473 


A1193122 


Hs,132275 


ESTs 


3.51 


406714 


At219304 


Hs.266959 


hemoglobin, gamine G 


3.46 


414586 


AA3Q6160 


Hs.16488 


lymphocyte cytosolic protein 1 (l-plasti 


3.45 


427274 


NW.005211 


Hs.174142 


colony slimutaiing factor 1 receptor, fo 


3.45 


427527 


AI809057 


H5.153261 


tmmunoglobulin heavy constant rmi 


3.39 


452813 


U54727 


Hs.191445 


ESTs 


3.36 


442831 


AI798959 


Hs.1316e6 


ESTs 


3.35 


427774 


AA278583 


Hs.180737 


Homo sapiens done 23664 and 23905 mRKA 


3,34 


445330 


R52856 


Hs.21691 


ESTs 


3.31 


435001 


AW903849 


Hs.173840 


HUEL (C4orf1)4iteracSng protein 


3.31 


431681 


AK000378 


Hs.267566 


hypothetica) protein FLJ20371 


3.29 


432314 


AA533447 


Hs.312989 


ESTs 


3.28 


435129 


AI381G59 


HS.26708S 


ESTs 


3.28 


407151 


H2S838 


HS.3D1527 


ESTs, Moderately similar b unknown (Rs 


3.24 


422607 


Z4S471 


Hs.1 18884 


stromal c«l-denved laclor 2 


0.21 


421205 


AL137540 


Hs.102541 


netrin4 


3.20 


428582 


BE336699 


H$.1850S5 


BENE protein 


3.20 


423582 


BE000831 


Hs.23837 


Homo sapi^ d}NA FU1i812 lis, oone HE 


3.19 


424880 


NM_000328 H5.153614 


retinitis pigmentosa GTPase regulator 


3.17 


421233 


AA209534 


Hs.284243 


telraspan NET-6 protein 


3.17 


429350 


AI754634 


Hs.131987 


ESTs 


3.16 


428727 


AF078847 


Hs.191356 


general transaction factor 11 H, polype 


3.16 


4348S0 


Z43161 


Hs^14 


301(Dap(Ot^ 


3.13 


414602 


AW830088 


Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564B1264 (f 


3.12 


446506 


A)123118 


Hs.15159 


ctemokIne4ike factor, alternatively spl 


3.11 


416114 


AI695549 


H5.183868 


glucuronidase, beta 


3.10 


435669 


AF255910 


Hs.54650 


junctional adhesion molecuia 2 


3.09 


444212 


AW503976 


Hs.10649 


basement membrane-induced gene 


3.08 


422442 


AA324998 


Hs.147066 


signal bansducer and activator of trans 


3.08 


442870 


N45018 


Hs.8769 


hypothetical protein DKFZp761J17121 


3.08 


424456 


AA341017 


H8.25549 


hypolheOcai protein FU20898 


3.07 


429673 


AA884407 


Hs.211595 


protein tyrosine phosphatase, non-recept 


3.07 


445107 


AI208121 


Hs.147313 


ESTs, We^ similar to 138022 hypotheH 


3.06 


438828 


AL134275 


Hs.6434 


hypothetic^ protein DKFZp761F2014 


3.04 


428106 


BE620016 


Hs.182470 


PTD010 protein 


3.04 


428403 


AI393046 


HS.3261S9 


teucine rich repeat On FUl) interacQn 


3.04 


431830 


Y16645 


Hs.271387 


small inducible cytokine subfamSy A (Cy 


3.03 


417512 


X76534 


Hs.82226 


glycoprotein (trartsmembrane) nmb 


3.02 


423067 


AA321355 


Hs.285401 


colony stlnfUilating fxtor 2 receptor, be 


3.01 


437457 


AA757900 


Hs.270623 


ESTs, Weakly similar to S65657 aIpha-1C- 


2.96 


415000 


AW02S529 


Hs.239812 


Homo sapiens serologicany defined breas 


2.96 


437145 


AR)07216 


H5.5462 


sohite carrier family 4, sodium bicarbon 


2.98 


418838 


AW38S224 


Hs.35198 


ectonudeotUepyrophosphatase/phosphodi • 


2.96 


419660 


BE280337 


Hs.194693 


sohite carrier family 7 (cationic amino 


2.96 


409956 


AW103364 


Hs.727 


inniDln, bete A (acQvin A, acfmn AB a 


2.95 


414493 


AL133921 


Hs.76272 


leiinoblastonia-ljiniSng protein 2 


2.94 


416883 


AW140128 


H5.1849Q2 


ESTs 


2.92 


417675 


AI808607 


Hs.3781 


sintilar to murine leudne-rich repeat pr 


Z92 


418318 


U47732 


H3^072 


transmembrane 4 superfamily member 3 


Z92 


432841 


M9342S 


Hs.62 


protein tyrosine ptiosphatase, non-recept 


2.92 


429640 


U83508 


Hs.2463 


angiopoietin 1 


Z91 


449843 


R85337 


Hs.24030 


solute carrier family 31 (copper transpo 


2.91 


401956 






Target Exon 


ZOO 


416926 


K03109 


Hs.263395 


HT018 prolan 


Z90 


433691 


AA60S012 




ESTs 


Z88 


441892 


AB028981 


Hs.6021 


KIAA10S8 protein 


Z87 


439453 


6E264974 


Hs.6566 


thyroid hormone receptor interactor 13 


Z87 


417165 


R80137 


Hs.30273e 


Homo sapiens cDNA: FIJ21425 (is« ctona C 


Z87 


414291 


AI269619 


H8.13040 


G proteiivcoupled receptor 86 


Z87 


417696 


BE241624 


Hs.82401 


C069 anSgen (p60, eady T-cell activati 


ZB7 


435913 


W95006 


HS.2695S9 


ESTs, Wealdy similar to Soow aipha-io 


4 DC 
4C.00 


422050 


AA302741 


H3^5766 


ESTs. Moderately slntil3rtoJC5238gaIac 


Z8S 


451355 


AA748418 


Ks.164577 


ESTs 


Z85 


442085 


AAg75688 


H3.159955 


ESTs 


Z84 


427704 


AW971063 


Hs.292682 


ESTs 


Z83 


427247 


AW504221 


Ks.174103 


Intogrtn, alpha L (anfigen CD1 1 A (p180). 


Z83 


441965 


AA972712 


Hs.269737 


ESTs 


Z82. 


430268 


AK000737 


Ks.237480 


hypothetical protein FU20730 


Z82 


450056 


6E047394 


Hs.8208 


ESTs. Weakly similar to S71512 hypothec 


Z80 


407245 


X90568 


Hs.172004 


tifin 


Z60 


418941 


AA452970 


Hs.239527 


ElB^SkOa-assodated protein 5 


Z80 


446601 


At312783 


HS.1S5772 


Komo sairiens (hymic stromal co-transport 


Z60 


432195 


AJ2436G9 


H8.8127 


KIAA0144 gene product 


Z8D 
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449088 AI654048 HS.1965S6 ESTs 180 

416511 NM.006762 HS.793S6 lysosond-assodated mUtSspamtlne membr 180 

406648 AA563730 Hs.277477 nuyorhistocompaMy complex, class 179 

412116 AW402166 H5.784 EpsteirvBarr vims induced gene 2 Oymph 178 

D 433793 AW975959 Hs.107513 ESTs, Moderately Giirilar to K1AA1058 pro 178 

4402S5 AI932285 HS.160S89 ESTs 178 

410057 R66634 HS.2681Q7 mtilfimeTfn 177 

417497 AVW02482 Hs.82212 C053mt!g8n 177 

446733 AA863360 Ks^6040 ESTs. Weakly similar to fatty acid omega 176 

lU 431884 AA521246 H&210792 ESTs. Weakly similar to ALU8.HUMAN AUU S 175 

409969 AW514668 Hs.194258 ESTs. Moderately sfariar to A1.U5.HUMAN A 175 

436729 BE621807 (ransmenibrane 4 superfamily member 1 175 

431451 AA761378 Hs.192013 ESTs 174 

_ 413517 N76712 Hs.44a29 ESTs. Weakly similar to l38D22hypothetl 174 

15 447818 W79940 H5.2ig06 Homo sapiens clone 24670 mRNA sequence 174 

418818 AA228899 Hs.101307 Homo s^iiens HUT11 protein mRMA. partial 174 

424673 AA345051 Hs.294092 ESTs. Weakly similar to 136022 hypolheti 174 

443194 AI954968 matrix Qa protein 171 

^. 443804 AL135352 Ks.255883 ESTs. Weakly similar to 138022 hypolheti 171 

20 452870 AW502761 Hs.30909 KIAA0430 gene product 170 

430334 AI824719 Hs.143251 ESTs 170 

437187 Ali)80208 Hs.308325 Homo sapiens mRMA; cDNA DKFZp586C1523 (f 170 

432279 N95104 Hs.274260 ATP-blnding cassette. sub-famRy C (CFTR 170 

413950 AA249096 Hs.32793 ESTs 170 

25 430016 NM.004736 Hs.227656 xenotroplc and polytropk: retrovirus rec 170 

431710 AI735482 ESTs" 170 

448749 AW85967g HsJ21902 Homo sapiens ctone 25237 mRNA sequence 169 

451154 AA015879 Hs.33536 ESTs 169 

424541 AW392551 Hs.180559 ESTs.WeaWy similar to A561 94 Ihromboxa 169 

30 446899 NM.005397 Hs.15426 podocalyjdn^ 168 

418031 AA646744 Hs.269493 ESTs 168 

453902 BE502341 H5.3402 ESTs 168 

405121 mltooervacSvated protein kinase 8 Inter 168 

410163 AF151977 Hs.59260 Nn5 protein 167 

35 429632 AWig5336 Hs.148910 ESTs 167 

437191 NM_0Q6846 Hs.331S55 serine protease Inhibitor. Kazal type, 5 167 

455004 AW850303 gb:IL3-CT0219-191199-030^09CT0219Homo 167 

444933 N^016245 HS.121S0 relHiaishofVchalndehydrogenseAedud 167 

401113 s6tute canter family 22 (organic caSon 166 

40 419462 AF071076 Hs.1 12255 nudeoporin 98kO 168 

407635 AW370213 Hs.295232 ESTs. Moderately similar to A46010X^ 166 

419175 AW270037 KIAA0779 protein 166 

408988 AL119844 Hs.49476 Homo sapiens ctone TUASD^uK^halregi 166 

452721 AJ269529 Hs.301871 solute carrier family 37 (glycerol^ho 166 

45 430592 AJ224864 H$.9688 leukocyte membrane antigen (IRC1) 165 

446830 BE179030 Human ONA sequence fiom done RP5-1174N9 164 

433327 AI674779 Hs.126744 ESTs 154 

424868 AI566170 Hs.96886 ESTs 164 

429854 R55508 Hs.99472 ESTs 183 

50 427080 AW068287 Hs.301175 ras-reIatedC3bohlSnum1oxln substrate 163 

456711 AA033699 Hs.83938 ESTs. Moderately sbnlter to MA82LHUMANM 163 

419777 D60134 Hs.270975 ESTs 163 

414577 AI056548 Hs.72116 hypothetical protdnFU20992sinto to 162 

427596 AA449506 Hs.270143 extracellular glycoprotein EMIUN-2prec 162 

55 452445 A6002438 Hs.29596 Homo saptens mRNA from chromosome 5q21-2 162 

447482 AB0330S9 Ks.18705 K1AA1233 protein 162 

419110 AA234171 Hs.187626 ESTs 162 

450353 AI244661 Hs.103298 ESTs, Weakly sinflar to 865657 al)riia-10 162 

^_ 419828 TB1422 H3.14922 ESTs 162 

OO 427202 6E272922 H5.173936 interteukin 10 receptor, beta 162 

412491 W31589 Hs.739S7 RABSA, member RAS oncogene «amily 161 

436496 AA281959 K3.5210 glla maturation factor, gamma 161 

435053 AW629386 ESTs 161 

435029 AF167706 Hs.19280 cysteine-rteh motor neuron 1 161 

65 425876 C75094 Hs.334514 NG22 protein 160 

412561 NM.002286 Ks.74011 lymphocyte-acthraSongened 160 

430539 AK001489 ADP-ribosytalton factor-fika 1 160 

419825 AI7&4011 Hs.7326 ESTs 159 

_^ 412577 222968 Hs.74076 CD163antigBn 158 

70 425894 AW954011 Hs.180711 ESTs 158 

410883 D43767 H5.66742 Cai7chemokine(TARq(SCYA17) 158 

441028 A1333660 Hs.17558 Homo saptenscONAFU 14446 (is. done HE 158 

413949 AA316077 Hs.75639 Human TBI gene mRN\ 3* end 158 

^- 434943 AI929819 Hs.92909 chromosome 21 openreading frame 50 156 

75 443605 K06865 H5.134131 ESTs 157 

425017 All 19305 Hs.26409 ESTs 157 

440334 BE276112 Hs.7165 zinc finger protein 259 156 

426075 AW513891 Hs^70149 ESTs. Weakly similar to 21 09260ABceII 156 

425345 AU077297 Hs.155894 protein tyrosine phosphatese.nonmpt 156 

oU 407174 T79938 Hs.77062 teukocytebnmunodtobuDn^ receptor. 156 

443834 AI741510 Hs.173548 ESTs 155 

427557 Nhl.0D2659 Hs.179657 ptesminogen activator, urokinase recepto 155 

420539 AA262735 Ks.44004 A0031 protein 155 
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Awu/ozn 


HS.1Q2415 


noiTO sapiens rnKivv; cuNA uKrZpSowwiZi {■ 


C.94 






H$.5944 


1 iiliiln j>iniiiLj fruiilt.- 4 4 * m_ _ 

saQUBCBTneriamty 1 1 (prototKcoupied 


9 fi>l 

C.M 


422994 


AWB91802 


Hs.298276 


ESTs 


2.54 


XI 400*) 

4n992 


AW81o214 


Hs.143055 


ESTs 


2.54 


451160 


K61899 


Hs.171937 


steroid dehydrogenaso-Gks 


2.54 


415775 


H00747 


Hs.29792 


ESTs, Weaidy slnrilar to 138022 hypoM 


153 


429752 


H5234B 


H5.38636 


ESTs 


2.53 


414612 


8E274552 


Hs.76578 


proldn inhibitor of scQvsted STAT3 


2.53 


453329 


T97205 


H$.1 93400 


ESTs, Weakly simiiar to 2109260A B ceQ 


2.53 


436503 


AJ277750 


Hs.183924 


ubiqutlin associated and SK3 domain cont 


Z52 


445911 


AI985987 


Hs.145645 


ESTs. Moderately similar to ALULKUMAN A 


2.52 


433332 


AI3S7347 


Hs,44898 


Homo sapiens done TCCCTA00151 mRNA sequ 


2.52 


435943 


f^194 


Hs^1141 


Homo sapiens mRNA lor KIAA1 568 protein. 


2.52 


452253 


AA928891 


Hs^8608 


Homo sapiens cDNA: FIJ221 15 fis, done H 


2.52 


442506 


BE566411 


Hs.41726 


ESTs 


Z52 


419972 


AL041465 


Hs.182982 


0olgiiv67 


2.52 


431074 


BE072772 


Hs.8997 


ESTs, Moderaely similar to A46010 X*un 


2.52 


449129 


Alo31o02 


Hs.258949 


ESTs 


2.52 


440524 


R71264 


Ks.16798 


ESTs 


2.51 


419203 


AA488719 


Mfi 1%1S1 


ESTs 


Z51 


404370 






Target Exon 


ZS1 


432828 


AB042326 


Hs.287402 


dwndroitln 4«utfotransfsrasa 


251 


439219 


N33883 


Ks.41322 


ESTs 


2.51 


426044 


AA093322 


Ks.301404 


RNA tbiding mofir protein 3 


2.50 


433681 


AI004^7 


Hs.200360 


Homo sapiens cONA FLJ13Q27 fis, done NT 


2.50 


437644 


AA748575 


Hs.136748 


lectiTvlDQa NKceO laceptor 


2.50 


442566 


R37337 


Hs.12111 


ESTs 


ISO 


409317 


U20165 


H5^2S0 


1)006 morphogenaOc protein rooeplor, tjip 


ZSO 


450506 


NM.004460 


H5.41B 


filiroblast acfivation protein, alpha 


2.50 


447484 


AA464839 


Hs,292566 


hypotheficdl protein FU14697 


ZSO 


415165 


AW887604 


H5,78065 


complemenl component 7 


Z50 


435284 


AA879470 


Hs.96649 


Homo sapiens cONA RJ1 1492 lis. done HE 


ZSO 



TABLE 38B: 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Acoesdoni Genbanlc accession numbeis 



Pkey 



CAT Number Access 



412654 1350J BG743181 At830050BE695688 AA1265gi Ai903503 R26045 N62894 N63950AA131619 A)681480 N79626 AA461603 R78g79 AW606865 N66622 

fiF448838 AA779000 AA460314 AI09Z721 AI870182 AI436284 A14g4151 AI127704 AI127702 BE349350 AI093460 AA11S264 AA131567 R26840 
R78885 

433691 2203511.1 AI223854AI129852AA605012 

436729 6624J X75684 AL573167 AI445461 AI453743AI983&55 AI564644 AA977180 AI694111 AI591358AW071625 AI678712AI720939 AI9277696E439798 

At963432 AA292956 AW192S93 AI865838 AI696905 AI424384 A)16131 2 AI91 1921 AI597801 81494959 AI240988 AI492554 AW262737 BE044033 
AW008570 AW629505BI494958 AA088439 AA7Q8057 BF222820 BF593608 BE501957 AAS24S28 BE044134 AW572531 AW015724 68349166 
AA043217 BE219784 AI799814 AA129575 AI671727 AI470033 BE646195 AVm9725 AA903050 AA147228 AA404570 Al^^^^ 
AW673152AA723200C06123BF057147AAfiZ7686AA157944AI990245AA662517 T32487AI800106A133317^ 
AW275048 AA182640 AM76328 AI296935 AW0851 58 AW471421 AW103470 AW300456 AW191997 AI823466 AAg62397 AA1 36658 AI251 817 
AW339104 AA724739 AA411 100 AA191349 AA757735 AA037696 Ar769516 AW77Z283 AA010631 AI692846 A1061065 H80983 R79933 AI950693 
AI245632 AI349390 AA14a284 AI79BS02 AA487693 AI621 320 AW194272 C06365 AA953883 BE858936 AI918S23 AI872628 A1927217 AI453453 
AI189366 AW338878 A1261359 A1500576 8F477735 A1032S69 A1972899 AI985583 Z28771 AI383829 AI693030 AAG03586 6E773488 AW339301 
BE7734a9BE773462BE773495AI6S03386E773499AI745717BE811475BE8114708E811464BE611418BE811415BE811400B 
BE81 1388 BE8113S2 BE773S01 6E773494 BE773486 BE773474 BE773473 BE773470 BE773461 BE81 1350 BE81 1337 BF593847 BG055D71 
AWB75302 BF003068 AA719173 8E811348 AI562462 AI686240 BE773500 AI244845 AI565439 AtgiB453 AI472527 AI446740 AA035576 AA191414 
AW674145 C05782 AI589264 D57558 AI468237 AI432033 AA989662 R21752 BF002457 AA988297 AL574095 AL576200 AL571074 AL574525 
AL578810 BG498381 AI926364 BE879732 AA479634 AA479712 C17732 BM091258 BFB43901 AW820230 C1747S BE327120 AA129574 
AA136645 6F843S00 AW806193 AA502832 AA649494 Al^68520 AL547960 BE706937 BE811360 BE773498 BE811401 8E773484 BE811437 
BE811380 BE811399 BF997171 6F757734 BE926037AI377596 008111 AVV088g88 BE811404 BE61 1472 AI865» 
BE905927 BE81 1435 AA1 91387 AW772000 BE811453 BE614379 BF844522 B1044698 AI744233 AW984527 C17504 BF843883 AI248307 
BE773483 AI567995 W60075 BF941 183 Ar738844 BE81 1458 BE773481 AI262930 AA948565 BE708942 6E1 56360 T65026 AW242958 AW1 97954 
8E905184 AA722206 Ai344943 A)348877 AI334860 BE621657 BE1 56280 AA454099AA037722 8F843897 AW805183 AA043218 66482896 
AA182734 AA877242 AW372926 H272S2 R38114 BF851858 BE156214 AA190427T91762 AA035067 AA837326 T10930 BF906587 B175S0Z7 
66506731 BCOQ8442BC010166AL550134ALS53098AL548700AL550751 AI^7978AL545286 A1540643AU1 18627 AL601379BI259821 
B6741786 61888522 AU1 35866 61552770 Bi2S9210 61256520 61255569 66485098 BI258228 66498501 6M044512 AU133984 AL556586 
BE745111 61222633 AU133917 66288151 61260715 61550550 66500773 61551761 66707601 BI61 8593 6F69 1383 66721 129 8654 1576 
6E908666 66751098 BI224135 B6400746 BG478065 6E790436 AW080238 AU137549 66429896 BE392466 AW961686 B6721056 6E908365 
BE546656 66541235 AW583735 66528290 6)260895 AW651691 6M048974 6M043805 66142165 AA31 5188 AI446615 C06300 86497644 
AA088544 AI815987 66528631 6E619182AW239185 AW062910AW062902 AA347236 F11933AA488005 AA301631 AA376800 056120 
AA343532 AA308638 F00242 AA376086 AA31 6968 AA343799 &I870221 BE910282 BG538746 AW960564 AV732879 016654 AA1 92519 BF92214a 
AA216013 66624091 BE544387 86507008 AW176446 eF790033 6E088925 6E088854 AA921353 R2ie00 AA01 1222 T9752S 

443194 19335.1 61754027 BF696071 AI351939 66151298 AI91 9334 AI401520 61770165 W72057 T95158 T29478 AA181252 BG927793 AA714431 AA600749 

AA181247AA6147S6AA081092 H52207 66926934 BF222579 66899001 N64245AAg53040AI832406AA102441 66926081 AAS93445 AA916041 
AA987847 AAS83329AA737219 AA916443 AW128994 A1492560 AI761847 6C005272 NM.000900 X53331 M58549 B1758966 AL598829 B1754530 
BG699n0 6E439699 BE440148 AV704365 AV733652 66212015 66184149 66200180 66212690 BI761222 B6182079 AW338822 AI925631 
A1423041 AW071161 AI889835 AW1 29112 86925339 A)017633AA566964 BF725590 AI00421OAI8O97S9 BE0a3097 BG898220 AW997681 
8F668788 6E083134AW631281 8(7193052 66183095 BE440088 661 85728 A1499579AA188162AA864282BI493352AA155654AA836749 
AA836844 AA98547e AW082299 A1816747 AA450221 AA971 294 BE327509 Af719662 BGS76669 AI479382 6F824747 AI741800 BG982952 
AI088473 AA916151 AW473324 B6901 177 6E439998 AW023269 8E813871 AW999947 BE839108 AV707983 AA369722 AW796627 AW890808 
A1341771 AA302459 61493353 AA3G6332AA371104AA367277AL547972BG928011 AI678903At699886AI956165AA484893AA6439S3 
AW591063 66203275 6621 1093 AI334791 AA916589AW05B2fi8A1362370 Al 143352 AAS08721 AI928079 067214 BE045265 AA5417S5 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1611592,1 
1089114 1 
LI 



431710 
45S0D4 
419175 



446830 41421.1 



435053 
430539 



124009.1 
31268.1 



BG219S10 B6201686 BG195572 AW019904 AW089242 AA953322 AI686698 F27562 AA614749 056645 F20774 F30660 F2564B AWlB3Sd? 
AA827300AA582214Al701289AA228293AI906950AA230156AA384572AW43898aM^^^ 
A\mM1588E439643AA910666AA155913AA923097AA975721AA985555BG927032AA9S3^^ 
AVV083733AA128053AI953789AI911993AM2179BBG429150AI915306 23013^ 

BG924321 AI039722AI954968AI372839AI401406AI538215 '«''vii«w«if«;«wuA««igijAA5535»AA916094 
AM22419AA514370 AI741678 AI735482 Ar735081 AI371436 
AW850587 AW8S0569 AW850316 AW850303 

BE008630AVV270037AA234765A1334004BF057179AI857450A!3411^^ 

2S^']il^'>Si^Z^^^^^^ N55645AA319869 R369im^AW^ 

?215???^fJ!!J?l BE17903O AW294203 BF849776 AA459064 AI917452 AW403072 W27419 BF914568 BF798468 AW370558 

A)332638AA663215AW629388 

^y^^S^^^^^^^^'®^^^°^3AI856679AI686163AA662158AA911580 031095 A^^ 

AW169600 D31161 AA905362 C21179 BE327258 03 W4 AW439053 031309 BF756901 K8^fl^S>Sl49562 ffiSl4^^^ 
AA4S2D28AW473972AW468490AA410271A1475944BF821859AA658188^03W^^^ 
BF375775AVV821784AW75085W16475 D31031BG698392AVV880676AVS^^^ 



TABLE 38C: 



Pkey: Unique number corresponding to an Eos pcobeset 

R8f: Sequence source. The 7 digit numbers In lhlsa)lumn are Genbankldenl^ 

human chromosome 22" Dunham, et al. (1999) Nature 4Q2:48»495. 

Strand: Indicates Dl^ strand tawhtehexorts were predicted. 

NLposiUon: Indicates nudeoQde positions of predicted exons. 

Pitey 



406617 
401958 
405121 
401113 
404370 



numbers. 'Dunham, et al.' refers to the publicaGon entitled 'Tlie DMA sequence of 



Ref 


Strand 


Ntj)osition 


8439858 


Plus 


36430-36552 


3256613 


Rus 


108411-106629 


8102330 


Minus 


35816.36004,38587-36684 


9966541 


Minus 


19419-19959 


7631003 


Plus 


127868-128244 



TABI£ 40A: 656 genes upregulaled In fibrosb relative to nonnal bo^ 

anilbodte. Thesegeneswere8elecledlh)m59680p(obesetsontheEos/A^metfhHuO^ Gew expression data for eaclurobSdoW^^ 

«^ as average intensjKtAi^^ 

Pkey: Unique Eos probeset identifisr number 

txAccn: Exemplar accession number, GenBank accession number 

UniGenelD: UniGene number 

Pred.Prot Domains: ^n predided protein domains. Abbreviattons used: TM. transmembrane domain; SS. signd sequence: »Y. veiy likelif to contain: =M. 

UniGeneTllte SSgX^Sb^ 

Pkw ExAccn; UnlgenelO! UnJgeneTItte; Pred.PtolDomdns; R1 

442275; AW449467; Hs.54795: Homo sapiens secretoglbbin, famfly 3A, m; UlefOglDbin;TM=4yi;SS°Y: 39.47 
428434: AW363590: Hs.65551; Homo sapiens. Similar to DMA segment. Ch; LBPJPLCETP GTliWytSS^Y: 32.35 
4^5;AA742697;Hs.62492:NM.052863:Homo8aplens5ecSlobln,fa:^ ««-m.!)»=t. 
406964: M21305: ; FGENES predicted novel secreted protein; none^one; 27.90 
425211; M18667; Hs.1867; progastricsln (pepsinogen C); asp;TM=M:SS=M; 27.90 

SIS?: M^Lf'^lS^'^*'? Pfoduckped; homaoboxA:yttransferase.noWGF.anlvAc^ 27.23 

445921;AB012113;Hs,16530;smailWuctT}lecytoWnesubfamlIyA(Cy;n^:24.97 ««».«^ 

428330: 122524; Hs.2256; matrix metailoproteinase 7 (mabilytin, ; Peptidase JfllO;; 24.38 

431723: AW058350; Hs.278966; Homo sapiens mRNA; cDNA DKFZp564B2052 (f; PMP2?.a3Udln.non8C 23.35 

409153: W03754; Hs.50813; hypotheticat protein FIJ20Q22; fibiinoga>.C:: 23.29 ^'^^^""'"^ 

431W9; flE041395: Hs.374629; ESTs.WeaJd^s&nHar to unknown pratebi ; none.none: 1^23 

425371; 049441; Hs.155981: mesolheTn; none:TM=M;SS4* 1&17 

448133; AA723157; Hs.73769; folate receptor 1 (adult); Fblate.recMIP;TM=M:SS=M: 17.64 

4Sl!SlilS^.acI^^!^ Ribosomal.L20.N^PLcotrans:TM=Y;SS=N: 17^ 

421798; N74880; H5.355462: N-acytsphinoosine amidohydrdase (ackJ c; SAPA.SurfactanL8.none: l&ai 

S ^ (cWtolrtoskteseir^jidro.Wl?^^ 

419092; J05581; Hs.89603: mudn 1, transmembrane; 8EA:Tiyi»Y:SSMyi; 16.08 

426174: AA547959; Hs,1 15838; Homo sajnens similar to EcNnoidin (LOCI; nQnd.none; 15.84 

IS A ^Tiv^^u WstoHxnpafiW^comptoc. dase ; i8.MHCJJIpha:TM=M:SS=M: 15.42 

421110; AJ250717; Hs.1355: calhepsin E; aspn 15.08 

444342; NM«014398;Hs.l0887:8lmaartolysosome«sacbtedrnembr8ne;|jnvTM=Y:S^^ 14.94 
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408621; XSTaOS^ H1181 125; bnmunoglobuOn tantbda locus; ie.HSP70,Ppx<;ppA:TM4tSS=N; 
443709: A1082692: Hs.134662; ESTr. SNFMmne; 14.05 

428970; BE276891: H5,194691: refinote odd Induced 3 (RAI61): metabo; 7lm_3;TM=Y;SS=M; 1188 
457200; U33749; Hs.197754; (hyidd transcription factor 1; hom80bo}cTM=M;SS=N; 1186 
D 432519; AI22131 1; Hs.13O704; ESTs. WeaMy similar to BCHUiA S-100 pro; none.non8; 13.82 
422355; AW403724; H1300897; ooaguiaQon factor Vll (serum prothrontbl; none,lg; 1 162 
430280: AA3812S8; Hs.237868; Inteileuldn 7 recepion fnlnone; 1147 

415457; AW081710; Hs.7369; Homo sapiens testes specffic A2 twmolog ; MORN^98r.tnTM=Y;SS=M: 1135 
431164:AA493550; Hs.94367:t}iynjld transcription factor 1:none.homeolK»; 13.32 

10 414998; NM.002543;Hs,77729; oxidised low density DpoprolBln(tecen;lW^ 1Z83 
400269; ; Hs.253495: Eos Contro!; lecOn.c.Co089en;a!nk;: 12.30 
424310; AA336648; Hs.S0334: testes developmenV-reiated NY&^P22: none;TM=M;SS=N; 11.81 
451558; NM.001089; H5.26630: ATP-binding cassette, sul>-fam!Iy A (ABC1; ABC.tran.SRP54;Tli^Y;SS-M: 11.79 

^ ^ 452304; AA025386;H$.61311; ESTs, Weakly sbnBar to S10530 cysteine ;none.Ron6;n^^ 

1 5 445537; AJ245671; Hs.12S44: EGF4ke-doma!n. muIUple 6: EGF.MAM;; 11.56 

423778; Y09267; Hs.132621; flavin containing monooxygenase Z FMOIike,pyr_redox:TM=Y;SSs|yk 11.41 

414812; X72755; Hs.77367; monoklno Induced l>y gamma Intefferon; IL8;TM=M;SS5Y; 11.31 

430832; At073913; Hs.100686; ESTs. WeaMy simSar to JE0350 Anterior ; none.none; 11.25 

407910; AA650274: Hs.41296; fitKonecGn leucine rich transmembrane p; fhlUVlLRRCT.LRRNTfrM»Y;S8»=M: 11.15 

20 451497; H83294: Hs.284122; Wnt intiibitory tactor-1; EGF.WIF;; 11.07 

430250; NM_016929; H5.283021; ctitoride intracellular channel 5; nona;TM=M;SS^N; 1 1.07 

41 1020; NM_006770: Hs.67726: macrophage receptor with coQagenous str; SRCR,COliagen;TM»Y;SS»M; 11.05 

446619: AU078643; H5.313: secreted phosphoprotein 1 (osteopontln. ; Osteoponim;: 11.01 

438091 ; AW373062; Hs.351546: nuclear receptor sutifamlly 1, group I, m; homrKme.fec,zf-C4,fmne: 10.97 

25 413048; M93221; Hs.75182; mannose receptor, C type 1; fri2.lectin.cRlcin.BJectln,Xlink;TNNY;SS4ll; 10.93 
432231: AA339977; Hs.274127; aST 11 240 protein; none;TM=M;SS=M; 10.81 
416402; NM.000715: Hs.1012; complement component 4-bindtng protein. ; sushl;TMNM:SS»M; 10.77 
418156; W170S6; Hs J3623; nuclear receptor sulifemily 1, group I. m; hQrnione.jeCiZMi4,none; 10.63 
438553; AW407157; KS.18112S: immunoglotwKnlan^a locus; ig,HSP7aPpx-GppA:TM=M;^ 1056 

30 421071; AI311238; Hs.104476; ESTs. Weakly similar to CGHU1E collagen ; none:TM»Y;SS^ 1057 

418007; M13509: Ks.83169: matrix metaOoproleinase 1 (interstitia): hemopexin.Peptidase_M10.Astecln.PG_b(nding_1;: 10.33 
419086; NMJ)00216; H5.89591; Kallmann syndrome 1 sequence; fhd.wap;; 10.30 
407786; AA687S38; H5.3897% telraspan 1; transmembraRe4;TllA»Y:SS«M; 1028 

. _ 441384; AA447849; Ks.288680; reSnoic add Induced 3; 7tnLlnone; 10.26 

35 453914; NM_000507; Hs,574; fructose-I.WMsphosphatase 1; FBPase;TM=M:SS=N; 10.22 

435523; T62849: Hs.11090; meml)rane^pannlng 4^Jomalns, subfamily A; none;TW=Y;SS=M; 10.09 
423354; AB01 1130; H5.127436; calcium channel, vdlage^ependent alph; vwaCache:TM=M;SS=N; 10.03 
415323; BE269352; Hs.949; neutrophil cytosotic factor 2 (65kO. chr; SH3,TPR;TM=M;SS=N; 10.02 
408562; AI436323: Hs.31141; roundabout (axon guidance receptor. Dros; lg,fn3;TM::M;SS=N: 10.02 

40 448782; AL050295; Hs.362806; KJAA0758 protein: 7tm_2,ig.GPS,SEA;TM=Y:SS=N; 9.88 
419235: AW470411: Hs.288433; neurotrimin; none.nono; 9.79 
415992; C05837; Hs.145807: hypothetic^ protein FU13593; none:TI\*=Y;S&=M; 9.74 
418883; BE387036; Hs.1211; add phosphatase 5. tartrate resistant; Metallophos;TM=M;SS^; 9.70 
439018; AW300887; Hs.26638; membran&^pannlng 4-domaIns. subfamPy A; none;TM=Y;SS=M; 9.69 

45 442652; AI005163; Hs.201378; Homo sapiens cDNA FU40427 fis; none:TM=M;SSsN; 9.68 

446291; BE397753; Hs.14623: inlerferon. gamma^ndudWe protein 30; aLT;TM5M;SS=Y; 9.64 
408380; AF123050: Hs.44532; diubiquilin; ubiquitln:TM=M;SS=N; 9.54 
436089; W05391: Hs.351546: nudear receptor subfamily 1, group I, m; hormonejec.zPC4.none; 9.52 
. _ 449494; AW237014: Hs.315369; aquaporin 4; MlP.none; 9,61 

50 456062; AI866286: Hs.7ig62; ESTs, Weakly similar to B36298 prolines, none.noneE 042 

446428: AW08227O: Hs.12496: ESTs. Weakly similar to ALU4l.HUMAN ALU S; none.none: 041 
421852; AASOOgOO; Hs.98849; dyn^n light chain 23 (DNLC2B): none.none; 019 

407949: W21874: Hs.247057; ESTs, Weakly similar to 210926QA B ceil ; RIbosomal_S14.ank4)klnas6,d8alh.none; 016 
_ _ 456034; AW450979: ; gbiUI-H-Bll^a-l 2^1.81 NCI_CGAP.Su; none.none: 015 
55 407768; BE514982: Hs.38991; SlOOc^nt-fainding protein A2; e&tand,SJ0O,SJ0O.eihand; 015 

416965; N26223; Hs.160438: MIMC1; none;NA;NA: 003 

443324; R44013: HS.16422S; ESTs; none.none; 9.03 

435575; AF213457; Hs.44234; triggering receptor expressed on myeidd; i9;TliA»Y:SS=M: 9.00 
^ . 440273; AI805392; Ks.325335; Homo sapiens cONA: FU23523 fis. done L; none.none: 8.99 
60 424527; AW138SS8: Hs.334873: ESTs, WeaMy sbnOar to 154374 gene NF2 ; Ziucart>Opept.none: 180 
409203; AA780473: Hs.687; cytochrome P450. subfamily IVB. pdypept; p450;TMsM;SS»Y: 6.76 
423387; AJ012074; H5.346500: vasoacUve IntesUnal peptide receptor 1; 7tm.2.HRM.CSI>.T«^Y;SS»M; 074 
443907; AU076484; H5.9963; TYRO protein tyrosine kinase binding pro; none;TM=A^SS»Y: 8.73 
^ _ 425367; BE271 188; Hs.155975; protein tyrosine phosphatase, receptor t ROne;TMsM;SS»Y: 8.68 
05 418918; X07871; Hs.8947G: CD2 antigen (p50}. sheep red Uood cell ; io:ThNY;SS»M; 056 
421563; NM.005433; Hs.105806; granulysb; none;; 055 

450726; AW204600: H5.355462; HUMPSP6A Human puknonaiy surfadant-asso; SAPA,8urfBdanLB.none: 051 
419893: AA133749; Hs.301350; FXYDdomalnKXjntining ton transport leg; ATP1G1J>IM-MAT8;TW=Y;SS=M; 061 

- . 424450; AL1 37526: Hs.374425; dynein intermedlato chain 2; WD40:; 8.42 

70 402474; ; ; NM-004079:Homo sapiens cathepsin S (asS; PeptMase.CI;; 8.41 

458079; AI796870; Hs.54277; Homo sapiens simiar to RIKEN cDfM 28100; none;TM=M;SS=N; O40 

424779: A1J045851: Hs.153053; C037 antigen; transmenibrane4:TMaY:SS^ 036 

453310; X70697; Hs.553: sdute canter family 6 (neurotransmitte; SNF.5HT_transporter:TM»Y;SS=N; 034 

. - 448140; AF14&761; Hs.204S0; BCM-Gke membrane protein precursor; ig;TI^Y:SS^N: 033 

75 404240; ; ; NM-018950:Homo sapiens ms^ histocompat; lg,MHC_l:TM=Y:SS=M: 028 
459702; AI204995: : gb3n03c03jc1 Strata^ene schizo brain SI; none,none: 017 
449523; NM.000579: Hs.54443; chemoHne (&C moliO receptor 5; 7tnu1:TM=Y:SS=M: 017 
442994; AI028718: Hs.16954: ESTs; 8nk.pkinase^ealh.RibosonBt_S14: 012 

o « 446998; N99013; Hs.278966; Homo sapiens mRNA; cDNA DKFZp564B2052 (f; PMP22.aaudin.none; O07 

oO 420137; AA306478; Hs.95327; C030 antigen, delta pdypepOde (TIT3 co; iTAM;TM=Y:SS=M: O01 
435472; AW972330; Ks.283022; triggering receptor expressed on myetoid; ig;TM=M;SSsM; 7.99 
432441; AW292425; Hs.163484: Intron of hepatocyto nudear factor-3 al; ForiLhead.nons: 7.99 
409208; Y00093: Hs.172631; Integrin. alpha X (antigen C011C (pISQ).: vwaJ^6X3AP.IntegritiJK,vwaJiitegrin_AF&GAP; 7.94 
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432606: NM.002104: Hs.3068: graoynn K (serine protease, granzyme 3;; trypsin:TM^Y;SS»M; 7.92 
442B32: AW208560: Hs.253569: ESTs; none^e; 7.90 

412104; AW20S197; H$.240951: Homo safuens. Similar to RIKEN cONA 2210; none;TM=M;S&=N: 7.89 

427337; Z46223; Hs. 176663; Fc fraginanl of IgG, iow afSniiy liib. n ig:TM=Y;SS=M; 7.66 

443951; F13272; Hs.356835; ferrilin, OgM polypeptide: PMP22.Claudin.none; 7^ 

41829% AA279S3Q; Hs.83968: Integrin. beta 2 (anfioen C018 (p95). ly; integrin.B.EGF;pSI;TMsY;SS>^ 7.79 

447131: NM.004S85: Hs.17466; reSnoic add leceplor responder (tazare; ftone;TM»Y;SS=N: 7.78 

423961; 013666; H8.136348; peilcslki (OSF.2o8); F8SClcOn:TM=M:SS=M: 7.73 

424917; AI636208; Hs.9e901; hypothetlca) protein FU23049: non8:TM=M;SS=N; 7.72 

436584; AA381SS3; Hs.1S8253: histocompal'bifity complex, dass ; iQ,MHCJI.alpha.none: 7.65 

456672; AK002016; Hs.114n7; Homo sapiens, done MGC:16327. mRNA. com; none.PK.PK.C.niyosin_head,RlioGAP; 7.64 

427792; M63928; Hs.180841; tumor necrosis rador receptor supertami; SRP14,TNFK.c6;; 7.63 

436954; AA740151; Hs.130425; ESTs; none^nona; 7.58 

429732; U201S8; Hs.2488; lymphocyte cytosollc protein 2 (SH2 doma; SH2;; 7.56 

407601; AC002300; Hs^7129; sodium channel, nonvollag&^ated 1, beta; ASC;T1^Y;$S=M; 7,55 

417105; X60992; Hs.81226: C06 antigen; SRCR;TM=Y;SS=fyt 7.51 

414821 ; M53835; Hs.77424; Fc fragmenl of IgG, high affinity la. re; ig;Tl^Y;SS=M; 7.48 

444381; BE387335; Hs.283713; hypothetical protein BC014245: Cdl^en:TM=M;SS»M; 7.40 

432222: AI204995; ; gb:an03c03j(1 Stratagene schlzo brain SI; none^none; 7.38 

422667; H25642; Hs.132821; ESTs; FMCWii«.FIWCme; 7.37 

444527; NM-005408; Hs.11383; sm^ indudUe cytokine subfamily A (Cy; ILB;; 7.38 

457411; AW085961; Hs.130093; Iroquois-ciass homeobox protein IRX2: none.none: 7.32 

439237; AW408158; H5.318893; ESTs, Weaidy similar to A47582 BH^eO gr. Furin4ke.pkin8se.Reoep.Ujdomain,YlP,none: 7.32 

419231; AL046294; Hs.135245; ESTs. Weakly similar to T17227 hypotheli; none,nonB; 7.30 

438873; AI302471; Hs.124292: Homo sapiens cONA: FU23123 lis. done L; non8,none; 7,27 

424027; AW337575; Hs.201591: ESTs; 7tnL2.HRM.none: 7.26 

428927; AA441837; H3.90250: Homo sapiens hypdhetical protein FU231; none.none: 7.24 

432435; BE218686; Hs.282070; ESTs; Rone^none: 7.22 

428467; AK002121; Hs.184465: hypothetica) protein FU112S9: none;TM=>Y:SSsM; 7.21 

416030; H15261; Hs.21948; ESTs: nono,none; 7.20 

433293; AF007835; Hs.32417; hypothetical protein MGG2742; none;TM=M;SS=N; 7.18 

416741; H&3265; Hs.8681; ESTs. Weakly similar to S41 044 chromosom; pkinase>MruBcp.pkInaseActiviajecp; 7.16 

420656; AA279098; Hs.187636; ESTs: none.non8:7.14 

427698; AW972S94; Hs.335499; ESTs; none,none: 7.1 1 

432268; BE311856; H5.274230; 3'-phosphoadsnoslne ff-phosphosuifato sy; APSJiin3seATP-sulfurytase(TM=M;SS»N; 7.06 

413859; AW992356; Hs.8364; Homo sapiens pynwate dehydrogenase Wna; SAW_PNT,none; 7.04 

430413: AW842182; Hs,24139?. smaD Inducible cytokine A5 (RANTES); 0;TM=M;SS=Y: 7.04 

45236% A1582743; Hs.94953; Homo saf^ens, Similar to complement comp; dqtOdtagenr. 7.03 

421461; AW39ig72: Hs.104696: K!AA1324 protein; none;TM»M:SS»M: 7.01 

418945; 6E246762; Hs.89499; arachidonate 5-ltpoxygenase; lipoxygenasePLAT;TM4^N; 6.97 

452281; T93S0O; Hs.2879?. Homo sapiens cONA FU11041 lis. done PU TGFb.propeplide,TGF4)eta,nQne: 6.96 

458124; AW005548; Hs.124590: ESTs; none^none; a94 

422846; 6E513934; Hs.1S83; neutrophB cytosoiic factor 1 (47kD, dir; SH3.PX;TM=M;SS^: &93 

41 1027; AF072099; Hs.67B46; leukocyte ImmunoglobiMke receptor. ; lnosItoLPJg:TM4«SS»N: 6.92 

428820: AA435187; Hs.172631: IntEgrin, alpha M (complemenl component ; vwa.lntegriaJV.FG-GAP;TM=Y;SS*M: 6.90 

423575; C18863; Hs.163443; Intron of periosfin (OSF.2os): Fasdc!in,none; 6.89 

419490; NM.006144: Hs.90708; granzyme A (granzyme 1, cytotoxic T-lymp; lrypsln:TM=M;SS=M: 6.89 

450954; AI904740; Hs.25691 ; receptor (cateitonln) activily modifying; none;TM»Y;SS=M: 6.87 

425976; C75094; Hs.334514; NG22 protein; voltage^CLC:TM=Y:SS=M; 6.84 

425555: AA359291: Hs.130767; Homo sapiens d)NA: FU235S3 fis. done U LRR;TM^M;SS»N: 6.81 

414991; C17898; : Homo sapiens qwegulated by BCG-GWS (LO; Zip.none; 6.80 

410342; R31350; Hs.743; Fc fr^menl of IgE. high affinity I, roc; ITAM;Tlk/NY;SS°M; 6.80 

422163; AF027208; Hs.1 12360; prominin (mouseHlke 1; none;TM=Y;SS=M; 6.79 

445885; AI73400g; Hs.127699; KtAA1603 proldn; none.none; 6.77 

436576; AI458213: Hs.77542: ESTs; 7liiu1,DnaJ: 6.77 

417079; U65590: H5.81134; fntefleuWn 1 receptor antagonist ILI;; 6.76 

42471 1; NM-005795: Hs.152175; catellonin reoeptor-like; 7tnu2,HRM:TM=Y;SS=M; ^75 

416847; L43821: Hs.80261: enhancer of (ilamentalion 1 (cas-lSce do; SH3;Tl^M;SS^; a73 

426251: M24283; Hs.168383: InteroeQular adhesion mdecule 1 (CD54); ig.lCAKLN:TM=M;SS=M: 6.71 

417929; R27219: Hs.74647; Human T-ceO receptor adive alphachain; ig.Bbhydrolase; 6.70 

412584; X54870; Hs.74085: DMA segment on dvomosome 12 (unique) 24; none.ledIn.c: 8.70 

428227; AA321649; Hs,2248: small indudbia cytokine subfamily B (Cy; IL8;Tl**t;SS=Y: 6.68 

421445: AA913059; Hs.104433; Homo sapiens, done IMAGE:4054868. mRNA; totLtr8ns.K.letra3sp; a65 

439750; Al35g053: Hs.57664: Homo sapiens mRNA y I tongth Insert cON; il^0H.ailW^H.N.CBS.integi1n_B.RidiLBJed]n: 6.62 

428582; BE336699; Hs.185055! BENE protein; none:TM=Y;SS=M; 6.60 

453142: AA033648: Hs.7473; Homo sapiens gap junction protein, alpha; connex{n;TM3Y.-SSsM; 6.60 

432374; W58815: Hs.301885; Homo sapiens cONA FU1 1346 fis, ctone PL; none,none: a56 

448569: BE382657: Hs,21485; signal transducer and adivator of trans; SH2.STAT,STAT_bind,STAT _prot:TM=M;SS»N: 6.54 
424321; W74048: H5.1765; lymphocytfr^pedfic protein tyrosine kin; SH2.SH3,pkInase;Tiyi=M;SS=N: 6.51 
446932; AA961459; HS.12S644; ESTs; none,lAR.UVlNT: 6.50 

427247: AW5W221: Hs.174103; Integrin. alpha L(antigen C011A (p180),; vwa,htegrinAF&<5AP;TM=Y:SS=M; 6.48 

425998; AU076629; Hs.165g50:fibroblastgrowthfactor receptor 4; ig.pkinase:TM=M;SS^M: 6.47 

447232; AW499834: Hs.327; Inlerieukin 10 receptor, alpha; none;TM=M;SS=M: 6.46 

431745; AW972448; Hs.163425; Novel FGENESH predk:tedcadherin repeal ; none.none: 6.43 

417370; T28651 : Hs.374466; tryptophanyMRNA synthetase; WHEP-TRS,tRNMynLlb;; 6.41 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor t; kinesin,fn3.Y_phosphatase;TM=M;SS=N; 6.40 

429610; A6024937: Hs.21 1092; LUNX protein; PLUNC (palate lung and nas; none;: 6 J9 

409340; BE174629; Hs.321 130: hypotheOcal protein MGC2771; 

aaj)emieases4}yrUoxaLdeC;bromodomaIn.PHD.MBDATJu)oKDOT.Pt3J>l^ 
413385; A434455: Hs.840: tnddeamtnoijyrrde 2.3 dtoxygenase; 1DQ;TM»M:SS^: 6^ 
451820; AW058357; Hs.199248; ESTa; 7tnLl:TM=Y:SS=M; ^34 
408369: R38438: H5.118747: SLC1SA2 Sdute canter famQy 15 iJMm PTR2:TM»Y;S8"N: 6.32 
424247; X14008: H3.234734; ^ysozyme (rend amylddosis): iys jg.FAD_SynthJdhJdh.(;pUnass:: 6.32 

514 



wo 03/042661 



PCTAJS02/36810 



444090; S691 15; Hs.10306; natural kflter C6Q group 7 sequence; PMP2?„aaudJn;TM=Y;SS=M; ^31 

416819; Un735: Hs,80205; plnv2 oncogene; pWnase;; 6.30 

421659; NM.014459; H5.106511; protocadherin 17; cadherin;TM=M:SS=W 6^ 

415198; AW009480: Hs.943; natural kHIer eel) Irensoipl 4; none;TM=M;SS=N: 6v26 

424273: W40460: Hs.14444i phosphol^ A2, group X; phoslip;TM=M;SS=Y: 6.24 

429083; Y09397; H5.227817; BCL24elaled prot^ Al; Bd-2;TM=M;SS=N; 6.23 

452194; AI694413; Hs.373599; olfactofy receptor. tamBy 2. suMamily ; non8,non8: &22 

424144; AA454033; H5.41644; AKAP-assodated sperm protebt; Rlla;; 6,21 

414142; AW368397; ^te.334485; heniicenlin (fibulin 6); EGF4g,t^J.hornion84^uash.TlUAdenoJE3 CR1:TM=M;SS=I\* a21 

442006; AW975183; Hs,372210; ESTs. WeaWy sWIar to S72482 hypolheli; none^none; 6.20 

420256; U84722; lte.76206; cadherin 5, type 2. V&cadherin (vescula: cadlwln.Cadherin.C.tenn:TII^Y;SS=H ai9 

421379; Y15221 ; Hs.103982; smafi indudt)Ie cyUMne subfamily B (Cf, IL8:7M=iyfcSS«Y: 6.17 

440452: A1925138; Hs.55150; ESTs, WeaWy sfmDar to CAYP.HUMAN CALCY; none;fiA;NA; 6.17 

421462; AF016495; Hs.104624: aquaporin 9; MIP:TM=Y;SS=M; 6.16 

452960; AK001335; Hs.31137; protein tyrosine phosphatase, receptor t; Y_pho5phatase.none; 6.15 

410381: BE391604: Hs.62661; guanylato bindlho protein 1. Interferon-; GBP.6BP^C;TM=Y;SS^ 6.13 

415765; NM.005424; Hs.78824: tyrosine Idnase with trnmunogbbuiin and ; EGF,fh3jg,pkln8se.lamlnliLEGFJM=M;SS=Y: 6.12 

430478; NAt014349; Hs.241535; apoBpoproieln 1, 3; MotA_ExbB;Tf*=Y:SS=M; 6.12 

413869; NM.000878; Hs J5596; InlerieuWn 2 receptor, beta; none;TM=Y;SS=*/t 6.09 

446608; N75217: Hs.175622: ESTs; Armadi«o.seg.HEAT_PBS;TM=M;SS«M; 6.08 

430378^ Z29S72; HS.25S6: tumor necrosis factor raceptor superfaml; 11^; 6.08 

426116; AA868729: Hs.l44694: ESTs; none.none:6.06 

445033; AV652402; Hs.72901; cycHrHiependent kinase Inhibitor 2B (pi; anlC 6.05 

426721; AA383588; Hs.288545: ESTs, Weakly similar to T29012 hypotheii; zW2H2:TMNM;SS=N: &05 

429228; A15S3633; Hs.356828; ESTs; none.none: 6.05 

421757; Z20897: Hs.296259; paraoxonase 3; AiyleslBrase:; 6.04 

43766% A1358105; Hs.123164; ESTs, Weakly similar to match to ESTs AA; none.pkina5e,pkinase.C; 6.03 

419508; AW997938; Hs.90786; ATP-Wnding cassette, subfamily C (CFTR; ABC_tr8n.ABCjnemtoBne;TM=Y:SS»M; 6.02 

428667; AI375550; Hs.346688; nudeotar protein p40; hontdoo of yeast ; none.nQne; 6.01 

432731; R31178; Hs.287820; fibronectin 1; fnl.fn2.fc3.nonB; 5.95 

446566; H95741; Hs.17914; membran^panning 4<fom^ns. subfamily A; none;TM=Y;SS=iyi; 5.95 

450556; AA010539; Hs.18912; unnamed protein product; zf-C2H2;; 5.94 

418460; M2631 5; Hs.85258; CDS anflgen. alpha polypeptide (p32); lg;TM=Y:SS=M; 5.94 

424054; AA334511; Hs.26638; membrane-spanning 4^tomains, subfamily A; none;TM=Y;5S4rt; 5.94 

408048; NM.007203; H3.42322; A kinase (PRKA) anchor protein 2; Paralemmin:TM=M;SS=N; 6.94 

438670; AI275803; Hs.123428: ESTs; none:NA;NA; 5.91 

424238; AA337401: Hs.137635; ESTs; none;TM=M;SS=M; a90 

444143: AW747996: Hs.160999; ESTs, Moderately similar to A56194 Ihrom; Bd.2,none; 6,89 

423690; AA329548; Hs.23804; ESTs. Weakly similar to PN0099 son3 prot; lon_lrans.lQ,none; 5.88 

409799; D11928; Hs.76845: phosphoserine phosphatase-Rke; Hydrolase;TM=M:SS=N; 5.81 

407239; AA076350; Hs.67846; leukocyte fmmunoglobulin-like receptor, ; tg;TM=Y;SS=M; 5,81 

411125; AA151647; Hs.68877; cytodirome b-245, alpha polypeptide; none:TH*=Y;5S=K«; 5.80 

420340; NVL000734; Hs.97087; CD3Z antigen, zela polypeplkle (HTS com; ITAiyi;TiyNliA:S8=IM: 5.79 

431681; AK000378; Hs.267566; hypolheScd protein FU20371: 8ugarJnTM»Y:SSaM: 5.79 

413441; A1929374; Hs.75367; Src^ike-adapter; SH2,SH3;TM=M;SS=N; 5.79 

443257; A1334040; Hs.1 1614; HSPC065 protein; trypsin;TM=M;SS=N; 6.76 

415801; R24219; Hs.278443; Fc fragment of IgG. low affinity lib. re; lg;TM=Y;SS=N; 5.70 

435299; AI745458: Hs.343026: ESTs. Weakly similar to T20593 hypotheO; none;NA;NA; 5.69 

415995; NR1.004573; Hs.35588B; phosphofipase C, beta 2; C2,PI-PLC'Y,PI-PtC-X;TM=M;SS=N; 6.67 

436772; AW975688; H5.348918; metailothtonein IE (fundlonaJ); 7tnL2.HRM,none; 5,67 

431385; BE178536; Hs.1 1090; membrane^panning 4-domains, subfamHy A; none.none; 5.66 

419833; AA251 131; HsJ220697; Homo sapiens tryptophanyl-iRNA synthetas; WKEP-TRS,tRNA^Llb,none: 5.66 

421859; AA356620; Hs.108947; KIAA0050 gene product; ank,PH,ArfGap:: 5.64 

407756; AAl 16021; Hs.38260; ubiquifin specific protease 18; UCH-1.UCH-2;; 5.63 

425354; U62027; Hs.155935; complement component 3a receptor 1; 7tm.1;TM=Y;SS=fc«; 6.63 

423533; NM.014339; Hs,129751: interteukin 17 recepton none;TM=Y;SS4fl; 5.63 

419577; L36531; Hs.91296: Integrin, alpha 8; lntegrinAF&GAP;TM=Y;SS=N: 5.61 

452561; AI692181; Hs.49ie9: KIAA1634 protein; TPR,POZ.WW,GuanylateJdn;TM=i«:SS=N; 6.61 

428677; AI557119; Hs.351582; troponin I, canTtac; none:TM=fc^;SS=N: 6.60 

425509; AF079363; Hs.158213; spenn associated antigen 6; Armadnio_seg.HEAT PBS;TM^;SS=N; 5.58 

453852: AW96181B; Hs.211592; MUM2 protein; pkinase.DAGJ'E.blnd,C2.pkInase Cnone; 5.57 

421924; BE514514: Hs.109605; coronln. ac8i>4)lnding protein, 1A; WO40.ldh C:TM=M;SS=N; 5.57 

448030; N30714: Hs.325960; membranMpanning 4<tomalns. subfamily A; none;TI^Y;SS=M; 5.55 

431630; NM»002204; Hs.265829; Integrin. elpha 3 (antigen C049C alpha ; F&GAP.Rhabd jlycop.lntegrfnJV;TIi^Y;SS=IWI; 5.53 

410257; BE244044; H3.61469; hypolhellcal protein; none,none; 5.53 

441965; AA972712: Hs,269737; ESTs; pWnase,AdIvliuecp.TSPN.Cona9en; 5.52 

413934; U03056; Hs,75619; hyaluronoglucosaminMase 1; integrin B.GlyGoJhydro 56;; 5.52 

424517; AI539443: H5.137447; Homo sapiens cDNA FU12169 fb. done MA; SHi8TAT,8TAT.bind,STATj>rolnQn8; 5.50 

447357; A1375922; Hs.132821; ESTs; FMaiike,FM(Mike; 5.46 

422109; S73265; Hs.1473; gastrin-reteasing peplkle: Bombesin.Oefensinj«)pep;TM»M;$SBM; 5.46 

447033; AI357412: Hs.157601: Predicted gene: Eos doned; secreted wA/; none.none; 5.45 

417412; X16896; Hs.821 12; Interteukin 1 receptor, type I; ^.TlR:TlyNA-SS»M; 5.45 

438057; AJ004832: Hs.5038; neuropathy target esterase; cNMP_binding M.trans.Paiailn;TM=Y;SS=M: 5.41 

417497; AW402482: Hs.82212: C053 antigen; Uansmentoe4;TM=Y;SS=M; 5.41 

439285: AL133916; Hs.47860; hypotheOcal protein FU20093: lg.pkIn35Q.lRR,U«NT.LRRCT4ione: 5.40 

452698: NM^001295: Hs,301921: dtemddne (WJmotil) receptor 1; 7tm.1;TM=>Y:SS=H* 5.40 

443623; AA345519; Hs,9641; complement component 1, q subcomponent, ; C1q.Collagen;; 5,40 

446272; BE26B912: Hs.14601: hemalopdetic ce*«pedfic Lyn substrat; SH3.HS1 jep;TM=M;SS=N; 5.38 

437275; AW976035; Hs.292396: ESTs, WeaMy sWlar to A47582 &«ll gr; 00ne.Frizzled.F2: 5.37 

419660; BE280337: Hs.l94693; sohito carrier famfly 7 (caBonic amino : a9j»mBases;TM»Y:SS5iy|; &37 

449853; AF006823: Ks.24040; potassium channel, subfanfly K. meniber 3; ionJrans;Tiy|9Y;S5BM; S.36 

442434; AAg95787: Hs.129583: ESTs: IRK.none: 5.38 

428065; At634046: Hs.t57313: ESTs; ICEj)20.0EO.iC€_p1QJC^j)20.DE0: 5.36 
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445333: 8E537541: Hs.44278: hypolheOcal protein FU12538 sWlar to; rasMTK;: 5.33 

425638; NM_012337: Hs, 158450; nasopharyngeal epilheGum specOic prote; non8;TM=M;SS=N; 5.32 

419034; NM_0021 10; Hs.69555: hamopdslic ceO kinasa; SHZSH34ildnase;TMNUl;SS=N; 5.32 

452416; /WJ26115;Hs.114777:EST8:none.PDiphobn_deain: 5^ 

425205; NM.0058S4; Hs.155106; receptor (caSdtonin) activity modifying; none:TM3Y;SS=N; 5^ 

440475; A)807671; Hs^4040; potassium cliannel, subfamily K, mmbu 3; ioiLtr8ns,nQne; 518 

417355; 013168; Hs.82002; entJothelin receptor (ype B; 7trr\.1^«3HC4MSPRYW«^^C2H2wW-baqT^^ &28 
436120; AI248193: Hs.119860; ESTs; heme.l.none; 5.27 

41 8307; U70867; Hs.83974; solute canler tamily 21 (prostagtandln ; OATP.N,OATP C;TM=Y;SS=M; 5,27 

409745; AA077391; ; gb:7B14E1 2 Chromosome 7 Fetal Brain cONA; 7lnu1.zf-C3HC4.1h3.SPRY,KRAB^I-C2H2.rve,zl-B tK)x:TM=Y;SS=M; 5.25 

421554; AW137676; Hs.97775: ESTs; none,none; 5.23 

408308; AL033377; Hs.44197; hypothetical protein DKFZp564O0462; nona^none; 5.22 

410434; AF051152; Hs.63668; loll-iike receptor 2; LRR.tJy^CT,TlR;TM=M;SS=H* 5.21 

421585; U95626; Hs.302043; chemtAine (C-C motiO receptof-fte 2 (; 7lnLl;TMaY:SS=M: 5.19 

400261; ; Hs.1802; Eos Control; lg.MHCJI_beta;TM=Y;SS=M; 5.19 

436856; A1459355; Hs.127310; ESTs; pklnase/rTn;TM=MiSS=N; 5.18 

408761; M057264; Hs.238936; ESTs. WeaHy similar to (delQne not ava; 7Inul,nonB; 5.17 

425023; AW956889; H5.154210; EDG-1 (endothelial dHferenllallon. sph; 7!in.1;TM=Y:SS=M; 5.16 

452203; X57522; Hs.352018; transporter 1. ATP-torftng cassette, sub; ABC.tran^^C jnenU»ano,SRP54,ThyrnIdylatUdn;TM=Y;SS=M; 5.16 

451220; AF124251; Hs,26054; novel SH2-contaIning protein 3; SH2:TM=M:SS=N; 5.15 

417771; AA804698; Ks.62547; reSnoIc acid receptor responder (lazaro; none,none; &14 

424925: NM.002432: Hs.153637; myeloid cdl nuclear differenfiation ant PAAO.DARN.HIN;: 5.14 

451099; R52795; Hs.25954: Inlerleukin 13 receptor. aTpha 2; fn3;TM=Y:SS=M; 5.13 

427509; M62505; Hs.2181; complement component 5 receptor 1 (C5a I; 7tnx.1;TM=Y:SS=M; 5.12 

423196; AK001B66: Hs.125139: hypothetical protein FU11004; nOTe;TM=M;SS=N: 5.12 

433671: AW138797: Hs.132906: 19A24 protein; ig;Tfyt=M;SS=M: 5.11 

426457; AW894667; Hs.380138; chimerin (chlmaerin) 1; DAG_PE^nd.RhoGAP,SH2;TM=M:SS=N; 5.06 

431890; X17033; Hs.271986; integrin, alpha 2 (CD49B, alpha 2 subuni; vwa,inlegriaAFG-GAP;TM=Y;SS=iyt 5.05 

41 8185; AW958272; Hs.347326; intercdlUter adhesion moleculo 2 (ICAM; none;TM=Y;SS=M; 5,05 

437352; AIJ53957; Hs.284181: hypothetical protein DKFZp434P0531: DUF221:TM=Y;SS=M; 5.03 

457918: AL359590; Hs.162604: hypothetical protein DKFZp762M186; PtJ]c;TM=Mi4;SS=:N; 5.02 

452924; AW580939; Hs.97199; complement component Clq receptor EGF,lectin cTissue fac,Xnnk,TIL:TI\4=Y:SS=M: 5.02 

426535; AU077012: H5.288582; ESTs. Weakly similar to ubiquitous TPR m; Kuniti„BPTI,Kun!li.BPn,7tnL2,HRM; 4.99 

432805; X94630; Hs.3107; CD97 antigen; 7tm.2.EGF,GPS,Feca);TM=Y;SS=M; 4.95 

434883; AW381538: Hs.19807; hypothetical protein MGC12959: SH3,PaWW,l^hoGAP;; 4.95 

414291; AI2896ig: Hs.13040; G protein^oupled receptor 86; 7tnu1:TWsY;SSsM: 4.94 

428981; BE313077: Hs.93135; ESTs. Weakly sfanaar to ALU2.HUMAN ALU S: none^ 4,92 

451154: AA015879; Hs.33536; ESTs; TIIVlP,none; 4.92 

435730; AB020635; Hs.4984; KtAA0828 protein; AdoHcyas8.TrfcA-N.2-Hadd DH_C:TMsM:SS=N; 4.90 

413011; AW068115; Hs,821: biglycan; LIW,UWNT~ 4.90 

422732; AA577455; Hs.24937; trBnsfomier-2 alpha OilrB-2 alpha); rmvlg; 4.89 

417015; M83772; H5.80876; flavin containing monooxygenase 3; FMCWike,pyT_fedox;TM=Y;S5=M: 4.88 

412773; H157e5: Hs.74573; similar to vacdnla virus Hindill K4L OR; PLDc;TM=M;SS=N; 4^ 

439659; AW9707B0; Hs.69483; leucine^ich repeal-containing G protein; 7tnu1,LRR;TM"Y;SS»N; 4.87 

405102; ; ; C15001220':gl|4469558|gblAAO21311.1| (AF; DAG.PE-blnd,PH.RhoGEF,DC1:;4.85 

422795; AB033109; Hs.375610; K1AA1283 protein; 7tnfLl JcazaU2M.A2M-N;TMnY;SS=M: 4.84 

432581: AU076465; Hs.278441: KIAA0015 gene product; PP2C;TM=M:SSsN: 4.83 

41 4936; C14774; ; gb:C14774 Clontoch human aorta polyA mRN; ank^kinase.death,none: 4.82 

430162; AB001325; Hs.234642: aquaporfn 3: MIP;TM=Y;SSsM: 4J2 

444838; AV651680; Hs.208558: ESTs; integrtnJ^F&GAP.none; 4.81 

410423; AW402432: H3.63489; protein tyrosine phosphatase, norwecept; SH2.Y_phosphatase.DSPc:TM=M;SSaM; 4,81 

453107; NM_016113; Hs.279746; vaniltokJ receptor-Tike protein 1; ank.tonJrans;TM=Y;SS=N: 4.80 

433376; AI249361: H8.74122; caspase 4. apoptosis^ed cysteine pr; CARD,lCEj>10,ICEj>20:: 4.80 

422010: AA302049: H5.31 181; Homo sapiens cONA: FU23230!ls.cteneC; none.SOF.5ugar.tr: 4.78 

419542; AA366037; Hs.9091 1; solute carrier tamily 16 (monocarboxylic; none.nQne; 4.76 

438899; AF085833: Hs.13S624: ESTs; none,PI3^PI4Jdnase.P13Ka.PI3K.C2^3MHM»l3K-p85B; 4.75 

427418; AA4025B7; Hs.356667: LATI-STM protein: none.none; 4.75 

431924; AK000850; Hs.272203; Homo sapiens cONA FU20843 lis, done AO; 8H3.none; 4.73 

424218; AF031824: H5.143212; cystalln F (leukocystatin); cystetin;: 4.72 

414888; AL039185: Hs.77558: thyrwd hormone receptor Interaclor 7; HMG14 J7 we; 4.72 

416176; AI808527; K5.192822; serologically defined breast cancer anti; none;TM=M;SSsN; 4.71 

430037; BE409649; Hs.227789; mitogervactivated protein klnas^-acthrat; pkinase;TM=M;SS=N; 4.71 

451527; AF022813: H$.26518: transmembrane 4 superfamOy member 7; none,nons; 4.71 

453870; AW385001; Hs.8042; Homo sapiens cDNA: FU23173 lis. ctone 1^ FG-GAP.htegrinAKlF; 4.71 

408113; T82427; H5.194101; Homo sapiens cDNA: FU20889fis. done A; 7tnu3.none; 4.70 

438543; AA810141; Hs.192182; ESTs; SH2,pkinase.none; 4.70 

424943: AU077260; H3.153924: death-assodated protein kinase 1; ank.pkinase.death.SPRY,SAP.Ribosoma|J14o,SRP54.dOENN,OENN,uOENN:TM^ 4.70 
438113; AI457908; Hs.8882; ESTs; 7tm.1,r»one; 4.70 

422164: NM.014312; Hs.1 12377; cortic al thymocyte receptor {X laevis ; Ig.GeraInUnov;TM=Y;SS=M: 4.69 
414482; S57498: Hs.76252: endothelin receptor type A; 7tn>.l:Tli*=Y;SS=«; 4.69 
425069; AA687465; Hs.298184; potasstom vollag&flated channel shakeF-; aWo.kflUBd.none: 4.67 
432314; AA533447; Hs.285173; ESTs; XEnk.none: 4.66 

453518; AW503205; Hs.27268: gh:Ul-HF-BN0-rfcl-fl.03^lj1 N11LMGC 50; SH3.PH.RhoGEF;TM=M;SS=N: 4.66 

418613; AA744529; Hs.88575; ntitogen-acth^ated protein Hnase kinase ; pWnase,CNH;TM=M;SS=N; 4.66 

446053; AI720140: Hs.151079; ESTs; ISK-Channel,none; 4.65 

454034; NM-000691; H9.575; aUehyde dehydrogenase 3 My, member ; aUedh;; 4.65 

431441; U81961; Hs.2794; sodium diannel. nonvdtage-^ated 1 alpha; ASC;Th^Y;SS»N; 4.65 

443402; U77846: Hs.9295; elastin (supravalvular aortk: stenosis. ; none.P0Z,UM,pidn8S8; 4.85 

414809; AI434699; Hs.77356; transferrin receptor (p90, C071); PA:TM=>Y;S5=N: 4.64 

427535; R29543; Hs.2164; priH)iatelet basic protein (Indudes plac tLB:TiyNM;SS«M; 4.64 

4371 19; AI379921; K5.1 77043; XPJ71387 simile to rhotekin; none.nqiB; 4.63 

41 1779; AA292B1 1; Hs.72050; noiHnetastaOc ceOs 5. protein expressa; NDKn 4.63 

429784; M89796; Hs.30s membranKpanidng 4dQmains, suUamily A; none;TM3Y;SSsN; 4.62 
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415934: NM_000928: Hs.992: phosphoHpaseAZ, group IB (pancreas): phosDp-4.61 

408873: AL046017: Hs.356216: cabnoduBn 2 (phosphmytase Idnase. del); none,non8: 4.61 

426432: AF001601: Hs,169857; paraoxonase 2; AfytestBTase:TM=M:SS=N: 4.59 

444805; AB007899; Ks.12017; homdog of yeasi uKquISivprotefn Ggas; WW.HECT.RNAjolAnone: 4.59 

408000: L1 1690; Hs.198689: bullous pemphigoid antigen 1 (23Q/240kD): efii3nd.spectrin,GAS2.5H3.Rectin,R\Xyk)seJsom.FIiD.bZlP^^^ 

l2.MJdh_C.CHjyP3:TM=M:SS=N; 4.59 
431087; H12723: Hs.290791; ESTs; ton trans.non8; 4.58 " 

425485; L18964; Hs.1904; prot^ kinase C lota; pldn3se.DAG.PE-bInd.pkin3se COPR:TM=^M:SS=N; 4.56 

422427; AA310514:Hs.96692; ESTs: PaEts,CHspectm.Ca channel B,non8;4.57 

441527; W19504: Hs.7884: solute earner famay 21 (ojganto anion ; OATP.N,OATP.C:TM=Y;SS=N; 4^ 

416464; NM.000132; Ks.79345: coaguIaDon factor VIII. procoagidantco; CiKixidase.F5.FBJype_C;; 456 

421233; AA209534: Hs.284243; (etraspan NET-6 protein; transinembrBRe4;TM»Y;SS=M; 4.S6 

422311; AF073515; Hs.114948; cytokine receptor-lite factor 1; ff»3:TM=M;SS=N; 4.55 

444895; A1674383: Hs.22891 ; solute carrier family 7 {caSonte amino ; ASC.deatti.TNFKcS; 4.55 

428141; D50402; Hs.18261 1; soiuta carrier famly 1 1 (pmtoiKoupled; Nramp;TM=Y;SS=N; 4.55 

410290: M402307; HS.322&44; hypothelica! piotein OKFZ(>564A176; SeniaiPSt.TiaintegriiLB;TM=Y:S&:M; 4.54 

426437; BE076537: H&.169895: ublquiOiKOnjusaling enzyme E2L 6; AnnadiItojBeg,UQ_con,noae: 4.54 

450086; AW016343: Hs.233301; ESTs; ank.death^U5.NMU.noR8: 4.54 

438209; AL120659; Hs.6111; aryWjydrocarbon receptor nudear transi; HlHPAS,ILfl;TM=M;SS=N; 4.54 
414788; X7B342; Hs.77313; cydlrwJependent Wnase (CDC2-BkB) 10; pkinase;TM=M:SS=N: 4.53 
429109; AU)08637; Hs.196352: neutrophil cytosolb factor 4 (40kD): SH3,0PR.PX;TM^SS=N; 4.53 
427557; NM_002559; Hs.179657; plasminogen actlvalor, urokinase recepto; UPAFLLY6,ET,PtA2Jnfi:; 4.53 
411213; AA676939: Hs.69285; neuropflbi 1; MAM,F5^FB_type„CCUB.CUB,MAM.F5_F8_type_C: 4.53 
434158; T86534; Hs.1437^. ESTs; adenylatekInase.non8; 4.52 

431941; AK000106; Hs.272227; Homo sapiens cDNA FU20099 lis, done CO; pkinase.Furin^ke.Recep_Utomain,none; 4.52 

447341; AF106941; Hs.18142; errestin, beta 2; arresBn.arTestln_C,PX.PH.PU>c; 4.52 

447656; NM.003726: H5.19126; srckinase-sssodated phosphoprot^ of ; SH3,PH;TM=M;SS=N; 4.51 

417018; M16038; Hs.80887; v-yes-1 Yamaguchi sarxma viral related ; SH2.SH3,pidn8se;TMNiA:S$s44; 4.51 

422893; X98411; Hs.380O77; myosin IF; SH3.myos!rLhe8d.lQ;TM=M:SS=N; 4.51 

407202; N58172: Hs.109370; ESTs; F5.F8.lype.Cpkinase,Els,none; 4.51 

447079; AA2800S7; Hs.105280; ESTs. Weakly similar to dJ963K23.2 [H.8a; zf^C2H2.zf-C3HC4,UIM;TM=M;SSsN: 4.51 
450747; AI064821; Hs.129953; ESTs. Highly sinfjBar to 1818357A EWSgen; mii.2f-RanBP.GAS2; 4.50 
419452; U33635; Hs.90572; PTK7 protein tyrosine kinase 7; lg.pklnase;TM»Y:SS-M; 4.50 
453856; AA804789; Hs.379109; PDZ-UM protein mysthiue: UM,PDZ;TiyNM;SS»N: 4.49 
432744; AA988835; Hs.38664; ESTs; none,none; 4.49 

419032; W81330; Hs.99877; ESTs. Hghly similar to JAK3B (H.sapiens; pk]nase,SH2,lnsu1in,pkina5e.SH2: 4.48 
444009; AI380792; Hs,135104; ESTs; TNFfLc6,TIUnone; 4.48 

426416: AW612744; Hs.169824: killer cell tecSn^ke receptor subfamt; lediiuc;TM=Y;SS»M; 4.48 

412802; U41518; Hs.74602: aquaporin 1 (channel4omiing Integral pr, M1P;TM»Y;SS"M; 4.48 

447217; BE465754; H5.17778; neuropilin 2; CUB.MAM.F5^F8_type„C;TI\*=M;SS=M; 4.47 

408771; AW732573; Hs.47584; potassKim vottage-flated channel, delayed; efhand,ion_lrans.K.letra,none; 4.47 

435049; AL122067; H$.4746: hypottietlcal protoin FU21324; none;TMNM;SS=N; 4.46 

413278; BES63085; Hsi33; interferOfhsSmulflled protein. 15 kOa; ubkiuBIn;: 4.45 

423804; AW403448; Hs.1706: interferoiv&fimutalfid transcripfion fed; IRF;cK:3HC4.l8R.zf-RsnBP:TM=M;SSsN; 4.45 
434308; N51517: Hs.47282: ESTs; pWnase.pWnase.C.none; 4.45 

434448; W26667; Hs,184581: Homo sapiens cDNA FU14821 fis, done OV; pWnase.pkinase,C;: 4.45 

417426; NM_002291; Hs.82124; lamlnin. beta 1; laniinin^GF.Iaminin_NtemMntBgrin.6;; 4.44 

417389; BE260964; Hs.82045: mUkbie (neurfte growtlHMomoiing factor; PTtf.MK;TMsM;SS»Y; 4.44 

430259; BE550182; H5.375142; f^F-like protein 3. mouse homotog; fn3,RA,f^GEF;TM»M:SS=M: 4.44 

436001: AW903849; Hs.173840; HUEL(C4oif1Hnterading protein; ig;TM=M;SS=M; 4.44 

452355; N54926; Hs.29202; G protei^coupied receptor 34; 7lnu1.0ATP_C;TM=Y;SS=N: 4.43 

418751; BE389014; Hs.372548: phosphdnodIide-3«n3se. regulatory su; SH2.none; 4.43 

410068; AI833B88: Hs.58435; FYN-Unding protein {FYB-12Q/130); SH3;TM=MI;SS=N; 4.43 

449961; AW265634; Hs.133100; ESTs; pkinase,Fuiin^.Recep.L.domain,none; 4.42 

451734; NM.006176; Hs.25944; neurogranin (protein kinase C substrate.; IQ,7tm_1;TM=M:SSsN; 4.42 

410598; AI817130: Hs.9195; Homo sapiens cDNA FU13698 Ds, done PL; RasGEF.PRK: 4.42 

43941 1; AA044876; H3.6BM3; ESTs. Weakly similar to CYA2.HUMAN ADENY; guanylate_cyc;T^#=Y;SS=f4; 4.42 

433179; AW362945; Hs.162459: ESTs; ArmadiltojBeg.noRe; 4.42 

414849; AW372721; H3.291623; ESTs, Weakly siniar to unnamed protein ; pkinase.none; 4.42 

409512; AW979187; Hs.293591; melanoma dliferenflatton assodated prol: DEAO,heIicase_aCARD;TM=M;S&=N; 441 

445903; AI347487: Hs.132781: dass I cytokine receptor; fn3;TM=Y;SS=N; 4.41 

438507; AA809052; Hs.356627; ESTs; nonB.none; 4.41 

409524; AW4021 51; Hs.54673; tumor necrosis fador (Ugand) superfami; TNF;TM=Y:SS=M; 4.40 
453037; AA045175; Hs.17914; ESTs; none;TM=Y;SS=M; 440 

412228; AW503765; Hs.73792; complement component (3d/Epstoin Barr vt; sustii;TM»Y;SS=M; 440 
451035; AU076785; Hs.430; plastin 1 (i isoforni); elhand.CH.Adaptin_Nn 4.40 
415149: X12451: H5.7B056: cathepsin U Peptldase.CI;; 4.39 

408105; AW152207; H5.270977; ESTs. Weakly similar to 138022 hypotheB; Y_phosphatase.carb_anhydfase.DSPc»none; 4.39 

423099; NM-.002837; Hs.123641; protein tyrosine phosphatase, receptor t; fn3,Yj)hosphatasa,DSPc,COX6C;TM=fi^SS=M; 439 

438330; AW450572: H5.257316; ESTs; pWnase.2«>4,ERM.CNH.none; 4.39 

433437; U20536; Hs.3280; caspase 6. apoptosis^eiated cystine pr. i(£_p10.ICEj)20:; 4.39 

429747; M87507: H5.2490; caspase 1, apoptosls^ed cysteine pn CARDJCEj)10.IC^20:; 4.39 

426410; 6E2g8446; Ks.305890; Ba2-like 1; Bd-2.8H4.none; 4.38 

434511; R28982; Ks.18106: ESTs; pkinase.GiycoJ)ydn).39; 438 

448868; AW196663: Hs.20024?, caspase recfuHmonl domain protein 6; CAf^TM»M;SS«N: 437 

447827; U73727: Hs.19716: protein tyrosine (tophatese. receptor t; fn3.lg.Yj)hosphabse.MAM:TM»Y;SSsM: 436 

432583; AW023824; Hs.1 62282; potassium channel TASM; potasshim Chan; ion tr8ns.X;TM^Y;SS=M: 4.36 

413472; BE242870: Hs.75379: sdute carrier family 1 (glial high affi; SOF:TM=Y;S^M: 4.36 

426828; NKL000020; Hs.172670; adivin A receptor type IHike 1; pkinase.Activinjecp;TMsf«S&4UI; 436 

449444; AW816436; Hs.3S1306; sdute canter temily 16 (monocartxscyBc; nonQ(TiyNY;SS=l^ 4.36 

437145: AF007216; Hs^ sdute canter temHy 4 sodium btoarfaon; HC03.cotransp;TM»Y;SS»N; 4^ 

429670; 101087; HS.211S93; protein kinase Q thete; 0A6J>E4Mpklna5e4]kin3se.(;DNAjKH.vlral.KPHD,X1;TN^^ 435 

421 195: BE484560; Hs.133017; ESTs: none^ne; 4.35 
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4t5758: BE270465: Hs.78793; protein kinase C. zela; pklnase,DAG PE^bdjjWnase.COPR;; 4.35 

457001; J03258; Hs.2062; vitamin D 0,25- (Chydroxyvilaniln D3) re; hormone rec^H;4,MetaIlolWo_5;TM=M;SS=H 4.34 

419150: T29618; Hs.89640: TEK tyrosine kinase, endothelial (venous: EGF.fh3.pWnase,ig.lanilnbuEGF,DSL;TM»Y;SS=M: 4.34 

440875: AW005054: Hs.279788; ESTs. Weakly similar to KCCl HUMAN CALO; pkin3se,none; 4.34 

429657; 013626; Hs.2465: WAAOOOl gens product; pulalive Giwotel; 7tmJ;TM=Y;SS=M; 4,34 

414509; AW161311: Hs.76294; C063 anSgdn (melanoma 1 antigen); transmeml)raRe4:TM'Y:SS»M: 4.34 

425771; BE561776; Hs.159494: Bruton agammagtobuiinemia tyrosine Unas: SH2,SH3,pkInase.PH3TK:TM»M:SSsN; 4.34 

452124: AA454220; Hs.61 170: ESTs; pkinase,none; 4.33 

407775; N^^004914; Hs.3e772; RAB38. member RAS oncogene idrr&f, ras.aif:TM=M;SS=N; 4,33 
452688; AA721140; Hs.49930; ESTs, Weakly similar to putalive p150 (H; SH3.none; 4.33 
434164; AW207019; Hs.148135: serineAlveORlne kinase 33; |ddnase;TKH^SS=N; 4J2 
445330: R52656: Hs.2169t: ESTs; TlmJ^e; 4.32 
437527; AI241019: Hs.145644: ESTs; RP5K.noRe; 4.32 

437763; AA469369; Hs.5831; tissue MBor of metaitoprol^e 1 ; TlMP.pkinase.DAGuPE-bind.RBD; 4.31 
416714; AF283770; Hs.79630: CD79A anilgen (immunogtobuln-assodated; ig.lTAM^.dus:TM»Y;SS4« 4.31 
416269: AA177138: Hs.161671: ESTs; |ddnase.DAG_PE-bind,RBD.Rone; 4.30 
425458; H89317: Hs.182889; ESTs; bn.tran8.none; 4.30 

424206; NM.003734; Hs.198241; amine oxidase, copper containing 3 (vase; Cu_anr)inejDXld,Cu.antin8ji]tUN2,Cu„8mlne_oxU 4.29 

451676; T63141; ; gb:yb99a12.s1 Stratagene lung (937210) H; SH3.none; 4.29 

417601; AA417383; HS.82S82; tntegrin. beta^ke 1 (vrith EGF4ike rep; EGF;; 4.29 

43SZ40; AI025435; H$.117532; ESTs; GHMP.kinases.nQne; 4.27 

444051; N48373; Hs.10247; activaled leucocyte cell adhesion mdecu; none,none; 4.26 

423523; AW29982B; Hs.193580; ESTs; nQnd.nons; 4.26 

426274; D361 22; Hs.2007; tumor necrosis factor (ngand) superfami; TNF;TM-Y;SS-N; 4.26 

425356; BE244879: Hs.155939; inositol polyphosphate-SiJhosphatase. 14; Exo.endo.phos,SH2;TM-M:S$=N; 4.26 

448386; AB037750; Hs.21061: KIAA1329 protein; PKD,BNR;TM=Y:SS=M: 4.26 

418318; U47732: Hs.84072; transmembrane 4 superfamily member 3; tran$membrane4;TM=Y;SS=M; 4.26 

427274; NM.00521 1; Hs.174142; colony stimulating factor 1 receptor, fb; ig.pkinase;TM=Y;SS=M; 4.26 

416602; NM_006159; Ks.367895: Protein kinase Oblndtng protein NELL2; EGF,vwc.TSPN;: 4.25 

436729; BE621807; H5.351316; transmembrane 4 superfamily member 1; none;TM=Y;SS''M; 4.25 

436494; AA720997; Hs.12829S; ESTs; non8.CAP.GLY,HC03.colransp.Giyco hydro„63.PH: 4.24 

439568; AI091277; Hs.302634: frizzled (DrosophBa) homdog 8; Fiizzted,Fz.7tm_2,toxin_2;TM=Y;SS=M; 4.24 

416255; AW135405; Hs.37251; ESTs; pkinase.none; 4.24 

400328; X87344; ; transporter 2, ATP-binding cassette, sub; none;TM»Y;SS»N; 4.24 

405121: ; ; mitogen-adivaled protein kinase 8 inter, Cys.knot,TGF-beta,vwa,ywc.vwd.T1UDllF139;: 4.24 

425795; AJ000479; Hs.l59543; ED&6 (endothelial differentiation, G-p; 7tm.1;TM=Y;SS=M: 4.23 

406786: AW161678; Hs.111334; fsnUin. Bght polypeptida; ferrifIn;TMsM;SS=N; 4.23 

449843; R85337;Hs.24030: solute easier laniHy 31 (copper tr8nspo;nane;TM=Y:SS=bt 4.23 

445657; AW612141; Hs.279575; Homo sapiens ^protein coupled receptor ; 7lm.1;TM=:Y;$S»M; 4.23 

413795; ALD40178: Hs.142003; ESTs; none.pkinase.U^LRRCT; 4.22 

409142; AL136B77; Hs.50758: SMC4 (stnjctural maintenance of chromoso; ABC_tran.M,SMC_N.SMC_C.DUF164.none; 4.22 

447887: AA114050; Hs.21 1610; caspase 8. apoptosis-related cysteine pr, )CEj)10,IC^20.DED;TM=M;SS=N; 4.22 

417316; AW953937: Hs.240845; ESTs; SH3,PH.RhoGEF:: 4.21 

424291: AL1200S1: Hs.144700; ephmvBI; Ephrin;TM»Y;SS»M: 4.21 

408279; AF216955; Hs.44095; Homo sapiens, clone MGC:12617. mRNA. com; none,none; 4.20 

432636; AA340854; Hs.278562; daudin 7; PMP22_ClaudIn;TM=Y;SS=M; 4.20 

424618; 129472; Hs.1802; major hiskjcompaObiHly complex, class ; ig.MHCJLbela;TM=Y;SS=M; 4.19 

445633; AI453386; Hs. 17267; ESTs, Weakly similar to S26689 hypothetl; IRK.none; 4.19 

432882; Nh^013257; Hs.279696; serum/^lucocorHcok! regulated kinase^ pkinase,PX.pkinase_C;: 4.19 

425481; AW978162: Hs.37281 1; ESTs; none.OxysteroLBP; 4.19 

429081; Y14039; Hs.igS17S; CASP8 and FADD4ikQ apoptosis regulator; lCEj>20.DED;TMsM;SSsN; 4.18 

401083; ; ; Nf^016582*:Homo sapiens peptkJe Iranspon PTR2:TM=Y:SS=M; 4.18 

420676; AI434780; Ks.4248; vav 2oncogene; RhoGEF,PH.CH,SH2,SH3.DAGJ>E-bind.none; 4.18 

424377; AF081675; Hs.146322; HRer cell lecMke receptor subfami; lecfirLc;TM=Y;SS=M: 4.17 

424148; BE242274; H5.1741: Integrin. beta 7; lntegnruB.EGF,metdthto.PSI:TM=Y;SS=M: 4.17 

421391; AW304350; Hs.191958; Immunogtobulln superfamily receptor Iran; lg,none; 4.17 

452100; AI666666; H3.379032; Inositol pdyphosphata-Sphosphatase, 75; Exojendo4}hos,RhoGAP,non8; 4.17 

413989; X14034; Hs.75648: phosphofipase C. gamma 2 (phosphattdyfin; SH2.SH3,C2,PH,PI-PL&Y.Pi^C-XPDQF;: 4.17 

422310; AA316622; Hs.98370; cytochmme P450. subMiy US. polypept; none,pkinase,fn3,lg: 4.17 

444034; AL161957; H5.10177: pleckslrin homotogy domain Interacting p; E1- 

E2>TPase.CationJ^TPase.C.CaJkmJVTPaseJJ.Hydrolase.RiT)OSon«l_^ 
450056; BE047394: Hs.502; ESTs. Weakly similar to S71512 hypothefl; A6C^tran,ABCjnembr8ne,lg.MHCJUbeta,8RP54,proteasome,ABCjnembrane^C„^^ 4.16 
407245; X90568: Hs.172004; titin; fn3,lg.SGXXSG.pklnase;TM=M;8S=N; 4.16 
418962; AA714835; Hs.271883; ESTs; RhoGAP,SH2,pklnase.POLO_box.none: 4.15 
410590; BE615216: Hs.64746: chloride Intracellular channd 3; non6:TM»M:SS»N:4.15 
425743; BE3g6495: Hs.159428; BCL2-8ssodaied X protein; Bd-2;TM»Y;8SBN: 415 
445967; AI699629; Hs.156781; ESTs; none.none; 4.14 
432176; AW090386; Hs.1 12276; arrestin. beta 1; arrestin.arres8n.C,none; 4.14 
452571; W31518; Hs.34665; ESTs; none;TM=M;SS=N; 4.14 
425421; L11669; HS.15714S; Islracydlne transporter-like protein; sugar.tr;Tliil»Y:8S»ll« 4.14 
410687; U24389: Hs.65436; lysyl oxklase^ 1; LysyUooddasa;; 4.14 
417871; AA521368: H8.24252; ESTs; IBBAroadflto_seg,none: 4.13 
429819; AL133011; Hs.253920; Homo sajMons mRNA; cDNA DKFZp434P201 (fn none.none; 4.12 
424522; AU34647; Hs.149957; ffbosomal protein S6 kinase. 90kD. polyp; pkInase,pkinase_C;: 4.12 
429623; NRiLOOSSOS; Hs.21 1569; G proleirhcoupted receptor kinase 5; pMnaseJ^;TllA»M;SS»N; 4.12 
413019; BE281604; Hs.75140; tow density lipoproteln^lated protein-; none;TMsM;SS»Y: 4.12 
434071; AF1 16653: Hs.34192: Homo sapiens PRO0823 mRNA, oomptete cds; none:TM=M;SS»N; 4.11 
434779; AF153815; Hs.50151; potassium Inwardly-rectiiylng channel a; IRK;TM»Y;SSBNt 4.11 
449655: AA002008; Hs.188633; ESTs; PIP5K.none: 4.11 
40640% ; ; NI4.002162*:Homo sapiens inteiceOular ad; lg:TM»Y:SS»M: 4.10 
427732; NM.002980: H8.2199; secretin receptor; 7lrTL2.HRM:TM»M;SS»M; 4.10 
437608: AA76160S: Hs.292308: ESTs, WeaNy slmHar to AUJ1_HUMAN ALU S; pkbiase.RIOMone: 4.10 
432885: AAS9^7; Hs.368129: ESTs. Weakly similar to ALU1J1UMAN ALU S; pldnase.pkinase_C,none; 4.10 
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411190-, AA306342: HS.69U1: pnMMnase C>ak0% pUnasajikinase.CXHR^ 

418342: BE002723: H!l334330: lepBn leceptoi; ICEj)20.DEDJC^j>10.)C^20.DED; 4.10 

424909: S78187; Hs.153752: cell dMston cyde 258; Rhodanese;; 4.10 

435905; AW997484; Hs.5003; igAA0456 protein; SH3.RhoGAP»FCH;TM=M:SSsN: 4.10 

42227B: AF072873: K5.1 14218; frizzled (Drosophila) homdog 6; Fz.Frizzled.7tin_2;TM»Y:S&=M: 4.10 

437952; 063209; Hs.5944; sohile carrier famiiy 1 1 (proioivcoupled; none;TMsY:SS»% 4.10 

432827; Z68128; Hs J109; Rho GTPase acfivaSng protein 4; FCH.RhoGAP,SH3;TiyNM;SS^ 4.09 

435140: M668123; Hs.134170: ESTs; none^ione; 4.09 

422627; BE33$857: Hs. 118787; transformit^ growth factor, t)et»{nduced; Fasdclin^Cjrari^.fTiembrari8.GrP.EFrU;TMs|itSS4^ 4.08 
426483; A1906539; HS.184S92; KIAA0344 gene product; rwn8.none; 4.08 
446232: A1281848; Hs.ig4691; (Otinoic acid induced 3; 7lrTL3.none; 4.07 
431674; AA098901: Hs.30164Z. ^protein coupled receptor. none,GCV_H; 4.07 

409686; AK000002; H$.55879: Homo sapiens mRNA; cDNA OKFZp434L0827 (f; ABC.tran>^BCjnemfarane;TMsAit;SS"M; 4.07 

441518; AW16ie97; Hs,294150; ESTs; Y^hosphalase^OSPcnone; 4.07 

442599; AF078037; Hs.324051; RelA-assodaled inhibitor. SH3.8nk;TM=M;SS=»N; 4.06 

436982; AB018305; H$.5378; spondin 1, (f<spondin) extracellular mat tsp.l.Reeter;; 4.05 

420361; N92054; Hs.1947ie; zinc finger protein 265; z^RanBP,7tm_1; 4.05 

439549; AW937885; H5.137314; ESTs; SH2.none; 4.04 

419981; AA897581; Hs.128773; ESTs; pklnase.DA6J^-l)ind,p)dnase C,OPR,f>one; 4.04 
418836; AI655499; Hs.161712; ESTs; pklnase.ActivlAjecp,PDiZU5.dBalh; 4.04 

408806; AW847814; H5.75608; Homo sapiens cDNA: FU21532 fis. done C; SH3,PDZ,Guanylate.kin.none; 4.04 
432106; N58323; Hs.269098; ESTs, Waaldy similar to RETROViRUS-RELAT; SH3,PDZ«Guanytate.l(in.none; 4.03 
426086; T94907; H3.188572; ESTs; PliEts.Caspectrin.C8.cliBnnet.B.none: 4.03 
41 6203; X54942: 1^.83758; COC28 protein kinase 2; CKS;; 4.03 

412270; AC005262; Hs.73797; guanine nucleotide t}Inding protein (G pr; G-^h8,8rf;TM»M:SS3N; 4.03 
416350; AF1,88625; Hs.189S07; piiosphonpase A2. group ItD; phosfip;TM=M;SS=Y; 4.02 
434457; AF141332; Hs.200333; apdipoprotein 848 receptor. none;Tli^M:SSsN; 4.02 

414271; AK00027S: Hs.75871: protein Idnase C Ixnding protein 1; bii3nndoma'n.PHD,PVVVVP.zMmJD:TM=li^ 4.02 

425694: U51333; Hs.159237: Iiex6idnase 3 (witite ceil); liexDldnase,lwxoi(ina5e2:TiyNM;SS»N; 4.02 

449943; AF104266; Hs.24212; tetrophlBn; 7tnL2,GPS,GaLLeciln.OlJ,l^philin,HRM;TM=Y:8S»lt 4.01 

408938; AA059013; Hs.22607; ESTs; fn3.Yj)hosptiBtase.carfa^anhydr8se.none; 4.01 

426839; M74782; Hs.172669; interteuidn 3 receptor, alpha (low affin; none;TM=M;SS=M; 4.00 

422282; AF01922S; Hs.114309; apollpoprotein I; MotA.ExbB;TM=Y;SS»M[ 4.00 

410726; AI623859; K5.15936; ESTs; pkinase,projsomerase,none: 4.00 

428318; BE3001 10; Hs.183842; ubiquitin B; fipocarin.sSdedh,ublquitin.lRK;; 4.00 

440188; AK001812; ite.7036: N<Acetylglucos amine kinase; ROK;TM=M;SS=N; 3.99 

429952; AF080156; Hs.226573; IntiibitDr of kappa Oght polypeptkie gen; pkinase,ubiquitin.EnterDtoxin^,PH04,pkinase.ubiquilin; 3.99 
414700; K53202; H6.38163; ESTs; 7tmJ;Tll4=Y;SS=M; 3.99 

432269; NM.0O2447; H3.2942; macnipiiagestimuIaOng 1 receptor (c-met; pkin8se.8ema.PSI.TIGA4.EXTRA;TM=M;SS«M; 3.99 
456362; AW973003; H5.179909; hypolheficai protein FU2299S; none;TMslirt:SS»N; a9B 
427541 ; A)7g8983; Hs.375635: solute carrier 35 (CMP-daifc ad; none^none: 3.98 
440248; AA876136; Ks.363458; ESTs; SH2.none; 3.98 

437400; AB01 1542; Hs.5S99; EGF-Hke-domaln, multiple 5; TNFRj£.laminln.EGF^=Y;SS=N; 3.98 
425262; D87119; Hs.1554l8; GS3955 protein; pkinase;; 3.98 

420166; AW732276; Hs,95583; transmembrane 4 superfamSy member (tetr; lransmembrane4;TM=Y;SS=M; 3.98 

437151; AA745618; Hs.380121; BANP iK)molog, SMAR1 iiomoksg; none,none; 3.98 

443574; U83993; Hs.321709: purinergic receptor P2X. llgand^ated to; P2)Oeceptor;TM3Y;SSsM; 3.97 

449027; AJ271216; Hs.22880; dipeptidytpepiidsse III; Peptidase_M49.EGF,ig,NsiJregulln;TM=M;SS=N; ^97 

41 1574; 6E242842: Hs.6780; protein tyrosine Idnase 9^ (Ae^etat; LRR.LRRCT.Tiacolilin J^DFJM=M;SSnN; 3.97 

432639; AW973765; : gb:EST385886 MAGE resequences, MAGM itomo: none,tRK: 3w97 

457675; AF1 19917; Hs.306574; Homo sapiens PRO3098 mRNA. complete cds; none;; 3.97 

445701; AF0555B1: Hs.13131; lymplx}cyte adaptor protein; SH2,PH;TM=M;SS=N; 3.96 

437157; BE048860; Hs.17287; ESTs; lRK.none: 3.96 

453641; AA444140; Hs.90960; ESTs; CbLN.CbLN2.a)LN3.UBA.zf-C3HC4.none; 3.96 
446714: W73818: Hs.110028: ESTs; TtnLlJtnLl: 3.96 

427648; AI376722: Hs.180062; proteasome (prosome, macropain) subunit,; proteasomer. 3.96 

453686; AL1 10326; H5.304679: ESTs, Moderately similar to Z195_HUMAN Z; none,lectin.c.8g.ctian; 3.96 

457718; F1BS72; H5.2297B; ESTs, Wealdy simiiar to AtJU4.HUII/tAN ALU S; pkin8se,pkin33e; 3.95 

428727; AF078847: Ks.76452; general transcrlptkm factor IIH. polype: PH04.UM;Tiyl»M;SS:'N; 3.95 

435411: AW444619: Hs.138211: ESTs; none,pldnase; 3.94 

440209; H05049: Hs.247837: neurexin 3; laminln.G.EGF.none; 3.94 

416636; 1^2536; Hs.42645; sdute carrier family 16 (monocaAoxytic: none.none; 3.94 

435272; AA906415; Hs.1 10041; ESTs; none.pl(inase; 3.93 

402550; ; ; Target Exon; none.none; 3.93 

425233; Z17B61; Ks.155218; ElB^Oa-associated protein 5; SPRY,SAP.pkin3se.fn3.Ig; 3.93 
410073; AW408163; Hs.58488; catenin (cadtierin-assodaled protein), a; Statlmiin,VincuIin;; 3.92 
453548; AiJ)79983; Hs.1 15774; integrtn. alptia 1; none,vwa.F&GAPjn1egrinJ^ 3.92 
417226; AW505054: Hs.4283; ESTs; pkinase.RGS.PH.myosliuhead.H^in_tdl; 3.92 
446755; AW451473; H5.16134; serine/threonine kinase 10; pkinase.TYA;TM=M;SS=N; 3.92 
452344; AI264357; Hs.55405; hypotheUcal protein MGC16212; Sulfate_transp,STAS;; 3.92 

418516; NM.006218; Hs.85701; pho6phdnositide-3-klnase, catalytic, al; PI3_R4Jdnase,Pt3Ka,P13K^C2.P13iObd,PI3Kj)85B.none; 3.91 

423069; W15613; Hs,1613; adenosine A2a receptor; 7tnu1;TM=Y;SS=M: 3.91 

414443; AU077268; Hs.76144; platetetrderrved growtti factor receptor.; ig.pkinase;TM=Y;SS=N; 3.91 

434392; AW983709; Hs.250824; Homo sap^ cDNA: FU23435 fis, done H; pkinase.none; 3.91 

429615; AF25B627; Hs.211562; ATP-blndlng cassette. sub4amOy A (ABC1; ABC_tran;TM=Y;SS=M; 3.91 

414774; X02419; Hs.77274; plasminogen activator, urokinase; kringl8.trypsin.planLthMs;; 3.91 

442831; AJ798959; Hs.131686; ESTs; ABGJran,PR]CABC_tran; 3.91 

441657; BE314696; Hs.7936; BAII-assodated protein 2; SH3;T^t:^M;SS^; a91 

438698; AW2g7855; Hs.361 171; ESTs. WeaMy similar to 136022 hypothetl; iipoxygen3se.PLAT.n0ne; 3.90 

447560; AF065214; Hs.ie858; phospijoripase A2. group IVC (cytosoHc, : PLA2.B;TMWW:SS»Ni 3.90 

437897; AA770561; Hs.146170; tiypothetk^l protein FU22969: zf^HHCnone; 3.89 

429379; NM.01484Q; H8.200598: K1AA0537 gene produd; pkinase,RI01:TM=IM;SS=N: 3J9 

410179; W27723: KS.S9496: cell division cyde 24ike 5 (choOneste; pkinase;; 189 
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428713; AA432087; Hs,268551: ESTs. Modexalsly similar to CYA4 RAT ADE; pWnase;: 3.89 

456629: AW891965; m.367942: hlslone deacetytase 3; HSP90.HATPase.<uW:2H2,PHD,none; 3^9 

425190; AW028302; Hs.155079; protein phosphatase 2. regtdatofy sutmnh. B56;TM=M;8S^N; 3.89 

4267S2; X694g0: Hs.172004; fHin; fn3.ig.pi(in8Se.SGXXSG;TM»M;SS=N; 3.89 

417767; BE242241: H5.82542: acykayacyi hytbotese (neutropW!); Upase.G0SUTM4*SS^ 3.88 

414029; BE2g7731; Hs.75709; mannos&«i)hosph3te receptor (cation dep; Mar>^P.recep;TM=W;8Ss^ 3.88 

416140; AI918035; Hs.301198; rouiKlaboul (axon guidarcs receptor, Dros; [umnone; 188 

434224; AA380731: H5.84; tnterteukin 2 receptor. Qanvna (severe co: frt3;TM»Y:SS=M: 3.88 

410011; AB020641; Hs.57856; PFTAIRE protein kinase 1; pUnasa;TM=M;S&»N; a87 

406908; Z25437; ; gb:asapiens prolelivtyiosine Unase gen; nono.none: 3.87 

425289; AW139342; Hs. 155530; interferon, gammfrlnilucitJle protein 16; PAAD.DARN,HIN;: 3.87 

441859: AW194364; H5.9877; interteuMn^ induced gene-1 protein (R; Am{nojoxidase.FAD_l)indiRg_3,TBC:TIUNM;SSBN: 3.87 

439975; AW328081: Ks.6817; inosine triphosphatase (nucleoside triph; Ham1pJI(e;TM=M;SSsN; 3.87 

415392; Z44087; Hs.10957; ESTs; PIP5K,none; 185 

416033; NM12201: Hs.76979; Golgi apparatus protein 1; cy8jTch.F6FR:TM=Y;SS^ 3.66 
414649; A1672727; Hs.76753: endogOn (C0105 anOgen) (ENQ); none:TiyNY;SS»IM: 3.85 
425729; 122647; Hs.159360; prostaglandin E receptor 1 (sul^type EPI); 7tnu1;TM»Y;SS4/l; 3.85 
414496; W73853: Hs.355424; ESTs; pWnase,F5_F8jlype_C.adh^8hortnone: 3.84 
412204; A1125507: Hs.24937; ESTs; fg,rrni,none; 184 

434375; BE277910; Hs.3833; 3r-phosphoadenosine S'-phosphosdfate sr. APS_lunase,ATP-sulfuryla5e,PRK.ThymidylateJdn;; 3.84 

444981; AW855398: Hs.12210; hypothefoal protein FU13732 similar to; SH2;TM=^^SSsN; 3.84 

412309; M23892: Hs.73809; aracMdonste IS-lipoxygenase; lipoxygenase.PLAT;; 3.84 

405545; ; ; Target Exon; ABC_lran,SRP54ABCjneml)r8ne;Ti*»Y;SS=M; 184 

407143; C14076: Hs,332329: EST; nonB:TM=Y;SS=M: 184 

420593; AA280356: Ks.187634; ESTs; B56,none; 184 

413420; AW410235; H5.75348; proteasome (prosome, macropain) aclhrato; PA28_dpha^A28.bela.t)ioptetiiLH;; 3.83 
448253; H25899; Ks.201591; ESTs; 7tm_2.KRM,none; 163 

414042; NM.004915; Hs.10237; ATP-blrw)ing cassette, suWamily G (WHIT; ABC tran.PRK,GBP;TM=Y;SSaN: 183 

430397; AI924533; Hs.10S607; bicarbonate transporter related protein ; HC03 cotrBnsp;Tl^-Y;SS^; 3.83 

423067: AA321355; Hs,285401 ; colony stimulating (actor 2 receptor, be; fn3;TM=Y;SS41; 3.83 

458188; AW297226; Hs.137840; ESTs. Moderately similar to S1X4_HUM^ H; pkinase.WIMO; 182 

426486; BE178265; Hs.170056; Homo sapiens onRNA; cDNA DKFZp586B0220 (f; pldnase,none; 182 

428791; AA435661; Hs,264750; ESTs; zl-C3HC4.none; 3.82 

438068; AI927209; Hs.306210; Hcnno sapiens cONA: FU23133 tis. done I; NusG;; 182 

453370; AI470523; Hs.139336: ATP-bindIng cassette, sub-tamily C (CFTR; ABC tran,ABC membrane;TM=Y;SS=N; 182 

419250; AW770185; H8.356066; U5 snRNP-spedflc prc>teln, 116 kD; 7tm_1.BAH^f.CXXC.DNAjnethylase; 182 

410017; AW952426; H8.1 09438; Homo sapiens done 24775 mRNA sequence; none,none; 3.82 

420679; )C571S2: Hs.165843; Mann; CK.II beta.Fibn1lain.WD40;TM=M;SS=N; 182 

41 7916; NM_006416: Hs.82921 ; solute carrier family 35 (CMP-sIalic ad; 0UF6;TM»Y;SS»M; 181 

425923; NM-005026; Hs.162808; phosphdnosltid&^kinase. catalytic, de; none,none; 3.81 

417365; D50663; Ks.82028; transfofTning growth factor, beta recepto; pkin3se.WD40;TM=Y;SS::N; 164 

414521; 028124; H5.76307; neuroUastoma. suppresston of tumorlgenl: DAN;TM=(;A;SS=M; 152 

422398; A1476149; Hs.334489; hypothetical protein FU21992; SH2.SH3;; 151 

418432; »^14156; Hs.851 12: Insuiin^e growth fa^ 1 (somatomedi; Instdn;; ISO 

459705; BE082764; Hs.270252; ESTs, Weakly similar to androgen recepto; none.C2,WW.HECT: 3.48 

425009; X5828&; i^154151; proton tyroskw phosphatase, receptor t; rn3jg.Yj}hosphatase.MAM;TM=Y;SS=M; 3.38 

415617; U88967; H5.78867; proteto ^Ine phosphatase, receptor-t; fh3,Yj>hosphatase,caTb_enliydras6:TM^Y;SS=M; 137 

433336; AF017986; H&31386; secreted ftizzled-relatsd protein 2 (stn FZ.NTR;; 124 

426125; XB7241; H8.166994; FAT tumor suppressor (DrosophOa) homolo; EGF^cadherin,laminin G;TM=:Y;SS=M; 11 1 

419721; NM_001650; Ks.315369; aquaporin 4; MIP.none: Z99 

433147; AF091434; Ks.43080; platelet derived growth factor C; POGF.CUB;; Z91 

417976; BES65B92; Hs.63077; inteHeuidn 18 (interferon-flamm^lndudn; none;TM=M;SS=N; 2.89 

439180; AI393742; Hs.199067; M-extM avian erythroblastic teukemta v; Furtrv«ke.pldn8se,RecepXdoma!n,Fuiin-Ii1(a,pkinase,Recep.t^domain,PepN ^59 
426158; NM_001982; Ks.199067; v^b2 avian eiythroUasScteuittniiav; Furin4ike,pkinase,Recep_Ldomain.FuTin>Iite,pkinase,Recep.l^don^ M24: 2.23 

41 1089: AA4564S4; Hs^5702: cell dhrlslon cyde MHo 1 (PtTSLRE pr. none,Rone; 2.07 
428800; M57627; Hs.193717; Inteileukin 10; ILIOn 1.10 

TABLE 40B 



Pkey: Unique Eos probeset Identifier number 
CAT number Gei^ duster number 
Accession: Genbanlt accession numbers 

Pkey CAT Number Aocesston 



456034 
459702 
432222 
414991 
409745 



BB5586_1 
539529 1 
539529_1 
1785136.1 
MH1944.5 



414936 
451876 
432639 

TABUE40C 



1782849 1 
2328579.1 
1237887.1 



AA136653 AA136556 AW4S0979 AA984358 AA809054 AW23e038 AA492073 6E1 68945 
BG207209 BE1 66299 AI204995 BG199355 AW969908 AA528756 AW440776 BI044354 
BG207209 6E1 66299 AI204995 86199355 AW969908 AA5287S6 AW440776 81044354 
D78831C17698 D76883 

BI030997 AA921874 AW188822 BIQ27862 AI347618 A1361453 A1088754 AW207491 AA077391 BG012775 BG997382 AA286B33 AA150722 BI00762S 
BI027864 61009100 BI00627S 61006270 B10310OO BI029864 BI006277 81007627 81006266 BI006991 BI006990 81007763 81007762 8G997377 
AA150780 B1033518 BI027818 BG015789 BI033607 AA341445 
C14774C17911 079033 

T63141 A1821021 BF370092 BF370127 BF370060T62998 
AW973785 H60163 AA557608 



Pkey: Unique number corresponding to an Eos probeset 

Refc Sequence source. Tlie7d!gitminl)er8ln(hiscolunm8reGenbankUenGfier(G0ffiimbers. T)unhaml.elal.'r6ferstoIhepublicaSonen1itled'TheDNA 

sequence of human chromosome 22.' Dunham L el al.* Nature (1999) 402:48949S 
Strand: ImlicstesDNA strand liPom which exons ware predtoted. 
NLposiSon: tndkatesnudeoikteposi&insol predicted exons. 
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Pkey 


Ref 


Strand 


Ntposta 


402474 


7547175 


Minus 


53526-53628.5575555920.5753^57757 


404240 


5002624 


Minus 


116132-116407.116653-116922 


405102 


6076881 


Minus 


120922-121 296 


405121 


8102330 


Minus 


35616'36004.365S7.36684 


4010B3 


3242744 


Pfus 


33192-33360 


406403 


9256305 


Minus 


151426-151660 


40KS0 


7652009 


Maws 


80413-60673 


405545 


1054740 


Phis 


118677-118807,119091-11923^121626-12182 



TABLE 41 A: 556 GENES UP-REGUIATED IN PANCREATIC TUMORS OR PAMCREATTTIS RELATIVE TO NORMAL TISSUES 

Tabl8 41A fists abou! 556 genes up^BgutetBdhpmcrealfc These genes wreedected from 59680 pnteels on the Eos/Afl^ 

Hu03 Genechip array. 

Plcey: Unique Eos probeset identifier numlier 

ExAccn: Exempli Accession numtier.GenlanIt accession nufnl)er 

UnigeneiO: Unigene numt}er 



Rl: 


90lh peroenGle Of pancreatic 


cancetAnedian of normal pancreas 


PR^ 


ExAccn 


UnigcnsID 


Unlgene Title 


412228 


AWDUJ/OO 




comptenfwnt component (3d/Epstein Ban* vi 


431462 






grardn-ti]^ neuroeRdocrine peptide precu 


444995 


A m'yio.ti 


n5. i iLSM 


secreted phosphoprotein 2. 24I(D 


453863 




Hs.572 


orosofnuooKi \ 


441031 


Al 1 1 UOM 




fii}rinogen, 6 l)eta polypepSde 


421344 


AWdjIUjU 




vimn-tilce 


416018 




Ue 7nQT7 
n3.r09/ f 


proprotein convsrtase si4)tlllsin/kexin t 


438091 




nSiAMO^o 


nudear receptor subfamily 1, group 1. m 


418888 


At in7Rpn4 
AUUrboUi 


Ue RQAIA 
nS.Ds'tJO 


cadherin 17, U cadherin (liver-intesfin 


418969 




ns^osUr 


liypothetlcd protein Fy20258 


443162 


T49951 


HS.9u29 


DKFZP434G032 protein 


423096 


AA732do4 




progestin induced protein 


*ti Of 19 




nS.rP'ISiO 


smell Inducitiie cytokine subfamily A (Cy 




AW3o9771 


r18.02DZU 


Inlfiodn tsetaS 


421044 


AF0fi1871 

nr vu lUf • 


Hs 311736 


Human DNA sequence from done RP1-238D1 5 


407788 


BE514982 


Hs.38991 


8100 calchjfTVbirKllng protein A2 


422867 


L32137 


Hs.1584 


cartilage oQgomeric mM protein (COM 


432467 


T03667 


Hs^39388 


Human DNA sequence from done RP1-304B1 4 


457059 


eE561665 


Hs,177677 


exosome component Rrp40 


451945 


BE5040S5 


Hs.211420 


ESTs 


453354 


W55946 


H3.234863 


Homo sapiens cDNA FU12082 fis. done HE 


443247 


BE614387 


Hs.333893 


c-MyctafTgelJPOl 


410132 


NM.003480 
H3a765 


Hs.300946 


MIcrofibril-assodatBd gIycop(otsin-2 


416984 


HS.8070& 


diaphorase (NADH/NADPH) (cytodirome b-5 


413835 


AI272727 


Hs.249163 


fatty add hydroxylase 


433790 


GE298215 


Hs.288968 


RAG22A. member RAS oncogene fBrrBy 


414774 


X02419 


Hs.77274 


plasminogen activator, uroklnsse 


410639 


BE269047 


H5.65234 


bypothetica! protein FU20S96 


410541 


AA065003 


Hs.64179 


syntentn-2 protein 


427722 


AK000123 


Hs.180479 


liypothetlcal protein FU20116 


429612 


AF082649 


Hs^52567 


pituitaiy tumor-transfomiing 1 


407604 


AW191962 


Hs.249239 


coOagen, typeVlll.alpha2 


431193 


AW749505 


Hs.296770 


KIAA1719 protein 


442080 


AW444761 


Hs.44565 


ESTs 


427670 


BE612888 


H3.180224 


myosb) regulatory light chain 


448921 


AB012113 


Hs.16530 


small Indudbte cytoldne suMamity A (Cy 


419551 


AW582256 


H5.91011 


anIeHor gradient 2 p(enepus laevls) horn 


441633 


AW958544 


Hs.112242 


normal mucosa of esophagus spedtic 1 


407792 


A1077715 


Ks.39384 


putative secreted ligand homdogous to f 


419216 


AU076718 


Ks.164021 


small indudble cytoldne subfaniy B (Cy 


416913 


AW934714 




gb:RC1-OTO001-O31299411-a11 DT0001 Homo 


418384 


AW14g266 


Ks.25130 


Homo sapiens cDNA FU14923 fis, done PL 


452355 


N54926 


H3,29202 


G protein-coupled receptor 34 


419481 


AI879195 


Hs.90606 


15 kOa selenoprotein 


407230 


AA157857 


H5.182265 


ker8lin19 


418526 


BE019020 


Hs.85838 


solute canter family 16 (monocarboxyllc 


427585 


031152 


Hs.179729 


coOagen. type X, alpha 1 (SdmuJ metaph 


411498 


NM_014210 


Hs.70499 


ecotropic viral integration sits 2A 


445517 


AF208855 


Hs.12830 


hypothetic^ protein 


446619 


AU076643 


HsJ13 


secreted phosphoprotein 1 (osteoponlln. 


428385 


AF112213 


Hs.184062 


putative RabS-lnteracting protein 


448863 


BE614S99 


Hs.106823 


hypothetical protein MGa4797 


406867 


AA157857 


H8.182265 


keraOnlS 


417426 


Nll^002291 


Hs.82124 


lanrfdn, beta 1 



— 

7.25 

1.64 

3.58 

114.18 

922.40 

Z19 

61.10 

607.40 

228.20 

4.97 

38.01 



11.06 

116.90 

21.52 

8.74 

3,11 

307.70 

33.60 

7.31 

133.70 

349.10 

33aoo 

3.78 

3.53 

73.90 

3.39 

1.72 

10.29 

6.79 

4.62 

386.30 

6.99 

118.00 

2.73 

647.30 

738.90 

68.43 

103 

7.73 

227.30 

115.60 

19120 

119.90 

12.11 

6.63 

592.10 

120.40 

117.40 

4.25 

3.12 

135.20 

11.32 

408.20 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



406366 


APD?fiRQ9 


Ue ifvc70n 


401201 






420767 


AF0727tl 

Uf £,9 1 1 


UcQQQIR 


405556 


Y09306 




442776 






405555 






403207 






427656 




Hs.21 


426004 




ns. 1 £H 1 ^ 


401541 


NA 




429793 


AI417638 


no. 1 1 WIO 


423068 


M25629 




433110 


D56494 




425988 


6E04S897 




416768 




Lie 


412470 


M93^ 


Uq 70070 
ns.f 07Zt} 


431969 


AA366217 




419219 


AW583139 


u. 00747 


412686 


AWtft.'^nfi? 




427811 


nfto im/ 


nS.loUoo4 


420937 


AWQRR71Q 


Me i'iAn 


416068 


AUUQ71ICC 




410839 




Ue RRCQI 


437986 


AA774R7*; 




415934 


MM ryvtqoo 


Ue OQO 


427965 




Ue lAIOfiO 


406399 






426230 


AA3B701Q 

/vva/i via 




414081 




Hs.300280 


421243 




ns. 1 v&Of D 


419263 


AV\583874 


Ue Ramo 


424208 




Mc lA.'tllt 


408983 


NM 000499 


Ue RM 


436217 


T53925 


Ue 1A7 

ns.ivr 


435975 


AL11B990 


Hs.41997 


431330 


X69532 


He 9777 


414463 


T69078 


He 7R177 


415003 


M11437 


Ue 777^4 


422281 


M36603 




414910 


X12662 


ns.«0*nl9r 


417296 


L36196 


ManiftfiA 

r».oicx)*t 


400836 


lr\lY von 1 f 




452983 


L32140 


Ue C^l 


419768 


T72104 




413841 


M34276 


Ue 7CC7C 


400560 


1f\H 1 f 




419502 




Ue Qn7CC 


425746 


NM 001701 


Ue 1 KQAiin 


426205 


063521 


Ue 167077 


414590 






443614 


AVW3J00 


Ue 7ftilC 


429023 


MM nofni? 


Ue O^CI 


428311 


MM ray%<;i 


ns.l 000/1 


425260 




Hs.1870 


443316 




Ue iAAA'i 


413318 




Ue 7K9fii: 


413829 


MM nnifr79 


Hs.75572 


421126 


M74587 


Ue iMi'n 


407731 


MM onnof^ 


Ue 9Qnco 


413585 


All 33452 


Ma JKMi 

ns.r3<ui 


452624 


AIJ07gfi0fi 


Ue 9nnu 


416402 


NM 000711; 


nS.lUl4 


425573 


AROOfidW 




421905 


A1660247 


Me 49fiQO 


406672 


MTfiOdl 


n8.1 98253 


431369 


BE1 84455 


U* 9C17Ril 


421712 




Ue imiwi 
ns.iv/ 109 


417233 


VV25005 


Ue OAttili 


442896 


K^779i; 

rvJf l£D 


Un 1C14AO 

ns.2onuB 






Ks.43047 


428486 


AWWMQ? 


Urn tOJICIkd 


457489 


AI693815 


Hs.127179 


404886 


NA 




432874 


W94322 


Hs.279651 


445891 


AW391342 


Hs.199460 


404682 


NA 




429547 


AWD09166 


Hs.99376 


441085 


AW1365S1 


Hs.181245 


422397 


AJ223366 


Hs.116051 


446868 


AV660737 


Hs.135100 


404287 


NA 




443267 


AW450630 


Hs.133851 


451635 


AA018899 


Hs.127179 



secreted ftizzJed^elaled protein 4 a62 
Target Exon o.75 
carboxyl estsr Opase (bfle salktitnula a63 
haneodomam4nteracOng lutein kinase 3 0.75 
chymotiypstn C (caldecrin) a67 
liomeodoma'MnteracSng pratain kinase 3 0.83 
C2000960:glI131432|sp|P23132|UTH.BOViN 0.80 
e!astase 1, pancraaSc (X98 
ESTs, Moderately stmBar to SYNL RAT SYN a88 
Target Exon o^gi 
estrogen reguiatsd gene 1 a85 
katEkrein 1. tenal/iiancreas/^alivary 0l81 
rat regenerating islei-derfved^, hum 0.72 
ESTs, Weakly similar to 138022 hypolheS ags 
regenerating IsleWerlved 1 alpha (pane 0.87 
pancreatic lipase^ated protein 1 0.89 
cart)oxypep8dasBA1(pancfBatic) 0.97 
cartxjxypeptMase A2 (pancreatic) 0.95 

ctiymotiypstnogen B1 0.95 
catoypeptUase B1 (Sssue) t07 
coOpase, pancresfic 0 99 

ESTs, WeaWy similar to ISHUSS protein d l!o2 
protein disulfide tsomerase i,00 
testis expressed sequence 11 102 
phosphdipase A2, group IB (pancreas) 106 
eiastase 3, pancreatic (protease E) 1.22 
NM_0031 22*:Homo sapiens serine protease 1 .06 

protease, serine. 1 (trypsin 1) 1 !i 1 

amylase, alpha 2A; pancreatic 1.22 
pancreatic Dpase 1.13 
InsuTm \['\2 
pancreatic I^ase^lated protein 2 113 
cystk: Gtxosis transmembrane conductanc 1.32 
fitnlnogen-Qke 1 1.72 

alpha-1-8 gjycoprolein 1.50 

intsr-atpha (gbbulin) Inhibitor. HI pol ZQ2 

atpha-1-microgtobuilnA}IkuninpfecurBor 1.82 

kininogen 3,03 

hemopfiXin 2.14 

aiginase, liver 97,90 

sulibtransterase family. cytosolle.2A. 23a70 

Target Exon 2.47 
afamin 

apdipoproteinA-l 4,37 

plasminogen 374.00 

NM.030878':Homo sapiens cytochrome P450, 144!50 

fibilnogen, A alpha poiypepIUe 265.50 

bite add Coenzyme A: amino acid N-acylt 77.80 

leukocyte ceNerwed chemotaxln 2 1 69.80 

coagulatton factor il (thrombin) 3.6O 

fibrinogen, B bete polypepQde 400.40 

protein C (inacthfator of coagulaOon fa 4.72 

tryptophan 2,3dioxygenase 5.26 

phenylalanine hydrm^e 73.79 

aldetvde dehy±ogenase 8 family, nnember 1 82.20 

tnter4^a (gtobulin) inhibitor. H2 pol 335.00 

carboxypeplMase B2 (plasmd) 1 73.40 

Insutin-like growth factor binding proto 565.30 

complement component 8. bete polypeptkle 88.20 

fibrinogen, gemma polypeptlda 477.20 

coagulation factor V (proareelerin. labi 201 M 

complement component 4'binding protein, 426.10 

serine (or cysteine) proteinase Inhlblto 1 .10 

ESTs. Waakly similar to UV-1 protein (H 0.62 

major histooompatibinty complex, class 4.02 

secretory leukocyte protease Inhibitor ( 5.34 

hypothetical protein 5.62 

small inducible cytokine subfamiiy B (Cy a85 

ESTs 157.70 

Homo sapiens cDNA FU13S85 lis. ctone PL 137.70 

pancreatic poiypeptkte 2.59 

ciyptic gene 3,23 

ENSP000002S1112*:Sodlun^tesshjni-trBnsp Z64 

melanoma Inhibitory activity 2.43 

ESTs 70.38 

C9001 188«:gll12738842|rel|NPJ73725.1| p 1,38 

ESTs 6.85 

Homo saidenscONAaJ12532 lis. ctone NT &21 

Homos8pienseDNA:FU22495fis,doneH 1.74 

ESTs 102.10 

C6001909:gq704441|(n]j|BAA18809.1|(D298 242.70 

ESTs 98.90 
crypifcgene 
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417801 M417383 Hs.82582 Integrtn, bela^ 1 (with EGF-like rep 131.70 

414142 AW368397 Hs.150O42 Homo 8a;^enscDNAFU14438fis, done HE 128.70 

42S921 NM_007231 Hs.1 62211 solute carrier famHy 6 (neiirotmnsmiUa 92.90 

410309 BE043077 Hs.278153 ESTs lOaSO 

5 425842 A1587490 Hs.159623 NK.2 (Drosophiia) homolog B 170.10 

431938 AA938471 H5.54431 specific granule protein (28 kDa):cy5lB 75,70 

449592 A1555494 Hs.195718 ESTs 4.58 

414259 W44633 Hs,301296 Homo sapiens cDNA: FU23131 fis, cbne I 188.50 

^ 406685 M18728 gb:HumannonspeciiiccrossrBacling8n6g 1123.60 

10 411573 AB029000 Hs,70823 KIAA1077 protein 995.60 

429201 XD317B Hs.1 98246 groufKpecHiccofnpcnentlyitaiFteDbind 1U2 

416318 U47732 Hs.84072 transmembrane 4 su|)erfaiTiQy member 3 8.38 

428698 AA652773 Hs.334838 K1AA1866 protein 562.00 

444754 T83911 Hs.11881 transmembrane 4 superMlymeiiiber 4 4.00 

IS 432596 AJ224741 Hs.278461 matrilln3 283.50 

428824 W23624 Hs.173059 ESTs 4.55 

444008 BE395085 Hs.10086 type I transmembrane protein Fn14 3.01 

424971 AA479005 Hs.154036 tumor suppressing suUransferdble candid 4.21 

418394 AF132Bie H3.84728 KnjppeMike factor 5 (Inlesfind) 4.80 

20 448844 AI581519 Hs.177164 ESTs 362.80 

420908 ALJD49974 Hs.100281 Homo sapiens mRNA:cDNADKFZp564B222(fr 133.90 

423685 6E350494 Hs.49753 uveal airioantlgenw^ coiled CQfldomd 128.20 

428392 H10233 Hs.2265 secretoiy gianule. neuroendocrine piotei 13.83 

429597 Nll^003816 Hs.2442 adisfntegrinandmetalloproiBinasedoma 316.00 

25 452571 W31518 Hs.34665 ESTs 245.50 

443646 A1085198 Hs.164226 ESTs 189.40 

438032 AA150797 Hs.109276 lalexin protein 291.10 

448030 N30714 Hs.325960 membfane-8|>annIng4^Jom8ins. subfamily A 252.20 

422109 S73265 Hs.l473 gastrtfweleasing peptide 278.20 

30 430407 H23551 Hs.30974 ESTs 6.20 

419235 AW470411 Hs.268433 neuretrimin 423.50 

449048 Z45051 Hs.22920 similar to S68401 (cattle) glucose indue 4.01 

444301 AK000138 Hs.10760 asporin(LRRctass 1) 499.90 

427333 AR)87797 Hs,1765S8 aquaporin8 1.05 

35 417931 W95642 Hs.82951 trefoil factor 3 (intestinal) 4.33 

407777 AA161071 Hs.71465 squalene epaddase 3.64 

435652 N32388 Hs.334370 uncharaderized tiypothalamus protein HBE 1.47 

421341 AJ243212 Hs.279611 deleted in malignant brain tumors 1 3.98 

453935 AI633770 Hs.42572 ESTs a08 

40 431629 AU077025 Hs.265827 interferon, alpha-inducible protein (cio ^84 

439737 AI751438 Hs.41271 Homo sapiens mlWAfuH length Insert cDN 14.21 

426227 U67058 Hs.168102 Human proteinase acQvated receptor-2 mR 315.70 

413554 AA31914& H8.75426 secretogranin II (chromogranln C) 8.53 

412104 AW205197 H8.240951 Homo sapiens, Similar to RIKENcDNA 2210 3.13 

45 410310 J02931 H5.62192 coagdaflon factor HI (thromboplasfin, 9.33 

440484 BE328156 Hs.150356 ESTs 1.03 

447395 AI418412 Hs.184793 Homo sapiens cDNA:aJ21880iis. done H 1.09 

440099 AL080058 Ks.6909 DKFZP5S4G202 protein 14.74 

_^ 434665 AA642125 gb:ni60c01. si NCLCGAPjLyro3 Homo sapiens 0.98 

50 452194 At694413 Hs.332649 oKadory receptor, family 2. subfamily ^23 

408915 NM_016651 Hs.48950 heptaceltuiar carcinoma novel oenfr3 pro 329.40 

424411 NA^005209 Hs.146549 ciystairin.bet8A2 1.71 

426575 M74826 HS.17080B glutamatedecart)oxytase2(pancrea9cis ZG9 

445417 AK001058 Hs.12680 Homo sapiens CONAFU10196 lis. done HE 1.70 

55 426322 J05068 Hs.2012 transcobalamin I (vitamin B12 binding pr 3.19 

429010 Y18198 Hs.1 94725 one cut domain, family manber 2 1.98 

414420 AA043424 Hs.76095 tmmedi^ earty response 3 2.54 

422565 BE259035 Hs.1 18400 singed (OrosophilaHito (sea ufriiln to aSO 

414004 AA737033 Hs.7155 ESTs. Moderately similar to 21 15357ATYK 312.80 

60 441350 AB020690 Hs.7782 paraneoplaslfc antigen MA2 177.80 

406173 #{NOC>VT) ENSP00000250148*:Growth hormone variant 1.46 

403776 #(NOCAT) ENSP00000226542*:SmaIlindudblecytddn 121.80 

403574 MA Target Exon 16,12 

428832 AA578229 Hs.324239 ESTs. Moderately simflar to 2N91.HUMANZ 3.94 

65 458449 Ha4482 Hs.29019 ESTs 71.60 

409958 NM_001523 Hs.57697 hyaluronan synthase 1 1.77 

437100 Ai761073 Ks.14535 Homo sapiens cONA: FU22314 lis. done H 3.13 

451181 AI796330 H3.207461 ESTs 68.00 

440508 BE267911 Hs.198970 ESTs 38.00 

70 429636 AA455692 Hs.163232 ESTs 30.70 

419570 W68738 gb2d37g06.s1 Soares_fBtal_hearmbHH19W 1.02 

431779 AW971178 Hs.268571 apd^proteinOI 3.36 

431723 AW058350 Hs.16762 Homo sapiens mRI^A; cDNA DKFZp564B2062 (f 10.20 

428330 L22524 Hs.2256 malrte metalioproteinase 7 (MMP7; uterln 3.94 

75 423961 013866 H5.136348 osteoblast spedScfador 2 (fesddln 1171.10 

414359 M62ig4 Hs.75929 cadherin 11.type2.0B<adherin(osteob 809.50 

440482 AA886658 Hs.50873 ESTs 9.95 

414602 AW630088 Hs.76550 Homo sapiens mRIM; cDNA 010^56461264 {f 30.70 

423401 NM-001992 Hs.128087 coagulation fador II (thrombin) receplo 82.90 

80 452239 AW379378 H3.170121 protein tyrosine phosphatase, receptor t 26.01 

433364 AI075407 Ks.296083 ESTs. Moderately sbnDar to 154374 gene 5.38 

409335 NhiL001502 Hs.53985 gjlycoprotein 2 (zymogen granule membrane 0.54 

420876 AA91842S Hs.177744 ESTs 0.89 
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430154 AW563058 Hs.234726 seiine (or cysteine) proteinase Intnbilo 034 

401732 «{N0CA1) NM.00 11 Te^-ilomo sapiens Rl» GOP dlssoda 1.13 

404142 NA TaigetExon 1.33 

424165 AW5B2904 H5.1422S5 IstelamylcridpQlypepSde 195 

S 413880 AI650842 Hs.1109t5 inteifeuldn 22 receptor 1.34 

407007 U22961 gbtHumanmRNAdonewiihsimnaritirtoL 1^ 

426300 U15979 Hs.169226 deiMce tiomoiog (Drosopi^) 1.48 

432855 AF017968 Hs.279565 secreted fiizzied-related protein 5 1.28 

424503 NM.002205 Hs.149609 integrin, alpha 5 (fibronedin receptor. U1 

10 445730 AI824342 H8.170042 ESTs Z14 

406666 V00495 Hs.184411 albumin 195 

435849 BE305242 H5.16098 claudln2 1.96 

426784 U03749 Hs.172216 dvomogranin A (paraUiyroid secretory pr 2.49 

430272 X04898 Hs.237658 apolipoprotein A-tl 3.29 

IS 412374 X01388 H5.73849 apolipoprotein C>llt 2.42 

419276 BE165909 Hs.306881 MSTP043 protein 83.40 

415448 T68645 Hs.952 sotule carrier famfly 10 (sodiumABeac 3.52 

423541 AA295922 Hs.129778 gastrolnlestlnal pepWe 116 

^_ 428355 BE255452 Hs.2257 vilroneciin (serum spreading factor, som R24 

20 425551 AA359252 HS.1264B5 tiypoliielicai protein FU 12604; WAAI 692 14.67 

465530 AV655701 H8.75183 cytociirome P450. subfamily HE (etlianoi- 4.30 

428786 Y16577 Hs.2314 mannose-Wnding lectin (protein C) 2, so 92.10 

420726 K02402 Hs.1330 coagulation factor IX (plasma tluombopla 203.30 

451253 H48299 Hs.26126 daudlnIO U7 

25 420923 AF097021 H3.273321 differentially exprossed in tumatopoieO 3.38 

413881 1.00190 Hs.75599 serine (or cystine) proteinase iniilbito 7.04 

431930 AB035301 Hsi!72211 caditerf n 7, type 2 5.84 

421379 Y15221 Hs.103982 smaH Inducible cytokine subfamily D(Qf Z65 

419354 M62839 Hs,1252 apolipoprotein H (beta-2-gIycoproleln I) 9.28 

30 422237 M13149 Hs.1498 hlstidine^icti glycoprotein 34.26 

437145 AR107218 Hs.5462 solute canter family 4, sodium Ueartxm 1.92 

414386 X00442 Hs.75990 haptogtobin 8.32 

425247 NVL005940 Hs.155324 matrix metailoprotelnase 11 (MMPIIisbo 1.74 

452589 F33888 Hs.284176 transferrin 6.51 

35 436624 T64297 Hs.5241 fatty add landing protein 1. tiver 35.08 

409187 AF154830 HS.S0965 cartaamoyl-pliosptiata synthetase l.mitoch 17a30 

428874 W32133 Hs.194388 transtliyretin (prealbumin, amytddosis t 134 

405849 #(NOCAT) Target Exon 103.10 

405281 #(NOCAT) NK4-002884:Homo sapiens pregnancy-zonepr 31.20 

40 419078 M93119 Hs.89584 Insullnoma-assodaled 1 6.28 

422095 A1668872 Hs^62804 hypothetical protein FU22704 189 

425834 NM_001639 Hs.1957 amyloid P component, serum 3.80 

452304 AA025386 Hs.61311 ESTs, Weakly slfflHar to S 10590 cysldne 1.82 

407244 M10014 Hs.75431 fibrinogen, gamma pdypepQde 13.15 

45 450400 AI694722 Hs.279744 ESTs 5.22 

413916 1449813 Hs.75615 apolipoprotein C-ll 8.60 

444632 A1184027 Hs.146986 ESTs. WeaWy sWlar to FATH_HUMAN CADHE 71.30 

415905 AI751357 Hs.286741 Homo salens cDNA: FiJ22256fis, done H 1.70 

410197 NIA,005518 Hs.59889 3^ydroxy.3^nethylglutaiy«::oenzyn» A sy 4.65 

50 417366 BE185289 Hs.1076 smalt proTineM protein IB (comitin) 3.01 

436961 AW375974 Hs.156704 ESTs 164.60 

446319 AW207590 Hs.l60711 ESTs 1.88 

427899 AA829286 Hs.332053 serum amytold A1 6.98 

419092 J05581 Hs.89603 mudn 1. transmembrane 2.12 

55 421515 Y11339 Hs.1053S2 GalNAc atpha-2. e^iaiyttransferase 1. 1 13120 

452340 UlAJXZm HS.SOS ISL1 transcription fador, UM/homeodoma 6.23 

406319 NA CX000780:gi|6679197|reflNP_032800.1|pd 51.50 

404286 NA C6001909:gi|704441)dbj|BAA1 8909.11(0298 1.75 

419183 U60669 Hs.89863 cytodiromeP450.8UbfMyX)CtV (vitamin 52.90 

60 406293 NA Target Exon 68.30 

431912 AI660552 Hs.154903 ESTs, Weakly simaar to A561 54 AN SUbst 102.43 

409327 L41162 Hs.53563 coDagen. type IX. alpha 3 206.30 

425200 BE255203 Hs.155101 ATP synthase. HtiBnsporflng.mitachondr 5.76 

418738 AW388633 Hs.6682 8dutecanierfamily7,(cafionlcairino 200.10 

65 416661 AA634543 Hs.79440 IGF-II mRNA4*iding protein 3 97.70 

434599 AA543687 H5.149425 Homo sapiens cDMAFU 11 980 fis. done HE 3.96 

429921 AA526911 HsJ2772 collagen, type XI. alpha 1 30.00 

428758 AA433g88 Hs.98502 hypOtheOcd protein FU14303 4.65 

446998 N99013 Hs.167e2 Homo8apten8mRNA;cOf^AOKF2p564B2062(f 193.80 

70 418478 U38945 Hs.1174 cycHnKfependent kinase inhibitor 2A (me 3.05 

420001 J0S064 Hs.1282 complement component 6 159.00 

449038 AL133084 H3.22908 Homo5ainensmIW;cONA0KFZp434J1027(( 39.10 

423184 NM-004428 Hs.1624 ephriiMI 139 

428505 AU)35461 H3.2281 chromogranln B (secrelogranin 1) 327.90 

75 444783 AK001468 H3.62180 anillin (Drosophila Scraps honidog), ad 90.50 

445593 AW203963 Hs.150896 ESTs 49.20 

450701 H39960 Hs.288467 Homo sapiens cDNAFU12280 Its, done MA 3.12 

424420 BE614743 Hs.146588 prostaglandin E synthase 1.93 

408660 AAS25775 Hs.29^ ESTs. Moderately similar to PC4259feni 39.90 

80 417940 1^8205 Hs.24230 ESTs 57.20 

434206 AW136973 Hs.288516 ESTs. Weakly sbnOar to S69890 mitogen I 151 

439920 H05430 Hs.288433 neurotrlmin 1.91 

432542 AW083920 Hs.16098 daudin? 3.47 
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410418 D31382 Hs.63325 

415989 AI267700 Hs.317584 

414987 AAS24394 Hs.294022 
400024 

5 418067 AI127956 Ks.83393 

424687 J05070 Hs.151738 
405909 NA 

448811 AI590371 Hs.174759 

430044 AA464510 Hs.152812 

10 406690 M29540 Hs.220529 

418245 AA088767 HS.B3B83 
423733 AA330281 

450154 R15B91 Hs.281587 

422487 AJ010901 Hs.198267 

15 424902 Nh4.003666 Hs.153687 

422330 D30783 H$.115263 

435749 AAS84890 Hs.5302 

423634 AW959908 Hs.1690 

430691 C14187 Hs.103538 
20 401682 NA 

422440 NM.004812 Hs.116724 

431441 U81961 Hs.2794 

442560 AA365042 Hs.228598 

414812 X72755 Hs.77387 

25 425211 M18667 Hs.1867 

421430 AW207555 Hs.97093 

419893 AA133749 Hs.301350 

409420 Z15O08 Hs.54451 
448437 AW470125 

30 406671 AA129547 Hs^85754 

411556 AA102670 Hs.70725 

424586 NM_003401 H5.150930 

428227 AA321649 Hs,2248 

438746 AI885815 Hs.184727 

35 456032 AW%7446 Hs.301711 

431808 M30703 Hs.270833 

423472 AF041260 Hs.129057 

439759 AL359055 Hs.67709 

442295 AIB27248 Hs^4398 

40 428928 BE409838 Hs.194657 

437157 BE048860 Hs.120655 
404285 NA 

424036 AA770688 H35B777 

422026 U80736 Hs.l10826 

45 428664 AKD01666 H8.189095 

437935 AW939591 Hs.5940 

423575 C18863 Hs.163443 

422956 BE545072 Hs.122579 

406722 H27498 Hs:293441 

SO 413278 BE563085 Ks.833 

439750 AL359053 Hs.57664 

434377 AW137148 Hs.306593 

425428 AL110261 Hs.157211 

421298 AW172431 Hs.13012 

55 422424 AI186431 H8.296638 

421582 AI910275 Hs.1406 
401480 NA 

409289 AA576953 Hs;i2972 
409757 NM»001898 H8.123114 

60 449722 BE280074 Hs^3g60 

452240 A)591147 Hs,61232 

415165 AW887604 Ks.78065 

423673 BE003054 Hs.1695 
428450 NM_014791 Hs.184339 

65 409041 A6033025 Hs.50081 

453331 A]240665 Hs.8895 

400288 X06256 Ks.149609 

453160 AI253307 Hs.23g884 

444015 A1472865 Hs.135534 

70 421308 AA687322 H5.192843 

448045 AJ297436 Hs.20168 

422426 W79117 Hs.58559 

450737 AW007152 Hs.203330 

429504 X99133 Hs.204238 

75 456553 AA721325 Hs.189056 

413281 AA86127I Hs.222024 
417866 AW067903 Hs.82772 
431630 NWL002204 Hs.265829 

400298 AA032279 Hs.61635 

80 431753 X76QS9 Hs^l 

428651 AF196478 K5.188401 
424905 NM.002497 Hs.153704 

433132 AB026264 Hs.284245 



tisnsmembrsne protease, serine 4 tSZ 

ESTs 182.50 

hypQlheQcalpnM)FU14950 2.84 

AFFXcontrol-HUMRG£M1009a_5 4.62 

cyslaBn E/M 4.19 

malnx metanoproteinase 9 (getaOnase B 2.92 

Target Exon 71.80 

ESTs 6.74 

ESTs «-91 

cardnoembryonlc antigen-related oeS ad 757.80 

transmembrane, prostate androgen induced 1.65 

gb£ST33985 Embryo, 1 2 week II Homo sapl 104.70 

Human (done CTGiA4) mRNA sequence 143.00 

mucin 4, tracheol)ronchlaI 311 JO 

inositol poIyphosphate-4-(^iosphatase, ly 63.00 

epiieguTm 141.70 

lectin. galactoside4)indin9.solut]le. 4 Z59 

heparuvbinding growth factor binding pr 181.90 

ESTs 95J0 

Target Exon 6.17 

aldatotoreductasetaniOyI, member 810 318.60 

sodium dtannel, nonvoltage^ated 1 alpha Z51 

ESTs, Weakly similar to 2004399A chromos 3.90 

monokine Induced by gamma interferon 434.60 

progastrksin (pepsinogen C) 6.58 

Homo sapiens cDNA: FU23004 lis, done L 35.10 

FXYDdomain^tainlng ton transport reg 2.45 

lamlnin. gamma 2 (nfeein (lOOkD). kalini 8.56 

gb:xw60c04 j{1 NCI_CGAP^Pan1 Homo sapiens 79.80 

met proto-oncogene (hepatocyte growth fa 147.30 

gamma-aminobutyric acid (GABA) A reoepto 3a07 

X-ray repair complementing defective lep 55.10 

smaU inducible cytokine subfamSy B (a 405.20 

ESTs 3.57 

ESTs 136,80 

amphiregu){n(schw8nnoma^erived growth 36.10 

breast carcinoma amplified sequence 1 9.93 

Homo sapiens mRNA full length Insert cDH 146.40 

Homo S^ens cDNA RJ1 1 469 lis, clone HE 20.60 

cadh8rin1,type1,E-cadherin(epithen 1.58 

ESTs 91.80 

C6001909:gi|704441|dt^|8AA18909.1|(D298 123.80 

H2A histone family, member L 5.26 

trinucleotide repeat containing 9 130.40 

Similar to SAUI (sd (Drosophila)-iike 48.80 

mucin 1 3, epithelial transmembrane 3. 1 5 

Homo sapienscDNA FU 11576 fis, ctone HE 253.20 

hypothetical protein FU10461 80.00 

Homo sapiens SNC73 protein (SNC73) mRI^ 3.05 

intsrferorv<stimutated protein, 15 kOa 1-66 

Homo sapiens mRNA fufi length insert cON 23.02 

Homo sapiens cDNA FU11382 fis. ctone HE 78.10 

OKFZPS8680621 pfotein 174 

ESTs 133.10 

prostate differentiatkm factor 2.65 

tiefon factor 1(pS9 5.17 

Target Exon 73.70 

hypothetical protehtFU13352 137.70 

cystatinSN 9.36 

cydinBI 162,70 

ESTs 151.90 

complement component 7 2.85 

matrix metaitoproteinase 12 (macrophage 290.30 

KIAA0175 gene product 6.89 

iaAA1199 protein 334.10 

ESTs 12.85 

integrln. alpha 5 (fibronectin receptor, 12.42 

H2B histone famity. member L 15&40 

ESTs 14.60 

leucine^ protein FK8614 87.20 

prostate stem cell antigen 526.20 

ESTs 58.30 

ESTs 281.00 

fipocaHn 2 (oncogene 24p3} 31 .25 

ESTs. Highly similar to SimOar to 8 Ce 78.00 

transcriptton factor 6iy^2 21Z10 

coOagen.typeXI,8ipha1 3.40 

integrta a^ha 3 (antigen CD49C, alpha 3-48 

six transmembrane epithelial antigen of 281.50 

neuromecfinU 6030 

annextnAlO 50a30 

NlMA(neverinndtosisgeneaHelatsdk 85.80 

hypotheOcal protein IMPACT 55.30 
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435039 AWW3921 Hs.130526 ESTs 64.00 

447033 A1357412 Hs.i57601 ESTs 123.20 

433578 BE336886 Hs.3416 adipose differenOaUoiHQlated proteb^ 9.22 

42S11 AU076442 Hs.1 17938 coOagen. type XVII. alpha 1 52170 

5 411274 NM.002776 H5.69423 kBlIito^10{KLK10){PRSSL1)(r»es1) 4438 

452705 H49805 H5,246005 ESTs 12ai0 

428479 Y00272 Hs.164572 ceQ dMston cyde 2, G1 to S and G2 to 92^ 

425397 J0408S Hs.156346 topoisomerase (DMA) II alpha (ITOkD) 29^ 

^ 422562 AI962060 Hs.U8397 AE-binding protein 1 3.84 

10 428579 NM_005756 Hs.184942 G proleln-coupled receptor 64 27^ 

428242 K5570g Ks.2250 teukerraa mhttjilOTy (actor (cholinergic 4.09 

440868 R79707 Hs.263339 ESTs. Moderately shrnlar to 138022 hypot 76.30 

421493 BE300341 Hs.104925 edodeimaMieural cortex (with BTWke 2.37 

410199 AW377424 Hs.205126 Homo sapiens cDNA: nJ22667 lis, done H 3.44 

15 426320 W47595 K5,16930Q Iransfarminfl growth factor, Iwla 2 138.10 

419290 AI128114 Hs.112885 spinal corMertved growth factor-B 3.45 

459309 AA040620 Ks.5672 hypothetical protein AF 140225 127.80 

415138 C1835& Hs.295944 tissue factor pathway inhibitof 2 381.20 

422553 A1697720 Hs.171455 ESTs, Weakly slmDar to T31613hypolhe8 136.60 

20 432375 BE536069 Hs.2962 S100 cateiUTn-t)lnding prot^ P 6.87 

400634 tt(NOCAT) C22000015:gi|1274l327|ief|XP.008833.2iz 89.00 

428970 BE276891 Hs.194691 reOnoicacidMuced3 4.78 

423739 AA398155 Hs.97600 ESTs 135.60 

450375 AA00g847 Hs.8850 adtshtegrtnandmetaltoproletnasedoma 148.50 

25 423554 M90516 Hs.l674 glutaintne-fhictose^phosphatetransacnin 87.70 

407001 U12471 Hs:i47954 Human thiorntospomlin-1 gene, partial ods 76.80 

419261 X07676 Hs.89791 wirtgless-typeMMW Integration sHef am! 110.60 

419948 AB041035 Hs.93847 NM.01B931:Homo sapiens NADPHoxWase 4 ( 234.60 

428471 X57348 Hs.184510 stratifin 3.72 

30 427051 BE1781tO H8.173374 HomosapienscDIMARJIOSOO lis, clone NT 437.90 

425234 AW1S2225 Hs.165909 ESTs, Weaklysimliar to 138022 hypothetl 219.20 

419842 AA76S489 Ks.104350 ESTs 3.80 

418007 N413509 Hs.83169 matrix melaltoproteinase 1 (Mf«'l;lnlflrs 606.80 

444207 A1565004 H5.79572 cathepsin D (lysosomal aspaityl protease 2.62 

35 442432 BE093S89 Hs.38178 hypothetical protein FU23468 258.70 

413753 U17760 Hs.75517 lamWn.l)eta3(niceln(12SkD),laIinln 304.80 

441384 AA447849 Hs.268660 Homo sapiens cDNA: FU221 62 lis. chifteH 8.78 

418327 U70370 Hs.84136 palred-likehofneodoinalntransatpflonfa 10.95 

452401 Nl\4_007115 Hs.29352 tumor necrosis factor. dpha*iduced pro 199.70 

40 438199 AW016531 Hs.122147 ESTs 67.70 

422420 U03398 Hs.1524 tumor necrosis factor (ligand) superfami 107.20 

431183 Nl^006855 Hs.250696 KDEL (Ly^AspOu-t^u) endoplasmic relic 3.59 

417389 BE260964 Hs.82045 mh]kine(neuritoO(mrthH)romo6iigfact^ 3.45 

421937 AI878857 Hs.109706 hemalotoglcd and neuiptoglcal expressed 3.17 

45 427961 AW293165 Hs.143134 ESTs 109.30 

422043 AL133649 H5.1 10953 retinolc add Induced 1 2.98 

426711 AA383471 Hs.180669 conserved gene amplified in osteosarcoma 276.50 

450983 AA305384 Hs.25740 ER01 [S. ceievisiaeH>ka 5.28 

410268 AA316181 Hs.61635 six Iransmemlvana epithelial antigen of 27.85 

50 433001 AF217513 Hs.279905 done HQ0310 PRO0310pl 342.30 

424088 A1351010 Hs.102267 lysyloxWase 21150 

432731 R31178 H3.287820 fibronectinl 185.10 

416209 AA236776 H8.7g07B MA02(fiiItoflc8nes(defldenlyeasl.h 106.10 

414085 AA114016 H8.75746 aldehyde dehydrogenase 1 family, member 227 

55 417308 H60720 Hs.81892 K1AA01 01 gene product 405.30 

438146 Z36642 H5.57548 ESTs 8.38 

424800 AU)35568 Hs.153203 MyoD family inhibitor 172.10 

416143 A1S566S0 Hs.79033 ghitamifiyH)ep0decy€)olrBnsferB5e(9lu 45.70 

408380 AF123050 Hs.44532 dfuNqulfin 11.18 

60 412140 AA219691 Hs.73625 RAB6lnterading,kinesln^(rabkines 149.10 

422953 M79141 Hs.13234 ESTs 33.60 

409956 AW103364 H3.727 inhib}n,betaA(acthdn^actMnABa 6.73 

407756 AA1 16021 H5.38260 ublquifinspedfk: protease 18 a85 

424897 D63216 Hs.153584 Hzzled^elated protein 31Z40 

65 421110 AJ250717 H3.1355 calhepslnE 790.80 

411789 AF245505 Hs.72157 DKFZP5641 1922 protein 3.17 

421485 AA243499 Hs.l04800 hypothetical protein FU10134 8.52 

409632 W74001 Ks.55279 serine (or cysteine) proteinase inhlbito SSaOO 

^ 406837 R70292 Hs.l56110 immunoglobulin kappa constant 4.36 

70 426440 BE382756 HS.169S02 solute canferfamUy 2 (fadOtatedghi 2.83 

421470 R27495 Hs.1378 annexloAS 24Z90 

407242 M18728 gb*J4umannonspedficcrossrBadlngantlg 36.91 

432101 AI918950 Hs.l23842 EphA3 221.60 

406687 11431126 H5.272620 matrbcmetaHoproteinasell (MMPll;8tro 5.34 

75 429170 NM.001394 Hs.2359 dual spedtfclty phosphatase 4 292,00 

408308 AL0333n Hs.44197 hypothefical protein DKFZp564O0462 184.90 

435202 AI971313 Hs.170204 i<lAA0551 protein 64.80 

407216 N91773 Hs.102267 tysyl oxidase 73.70 

409231 AA446644 Hs.692 GA733-2anligen; epithelial gtycoproteln 3.20 

80 407881 AW072003 Hs.40968 heparan sulfate (ghicosamine) 3^uifot 288.70 

407811 AW19Q802 Hs.40098 cysteine knot superfamily 1. BMP antagon 502.60 

420899 NM.001629 Hs.100194 arachidonate S^ipoxygenase^ctivallng p 6.13 

441020 W79283 H8.3S962 ESTs 178.90 
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453857 AL080235 Hs^l OKFZre86E1621 pmteh 504.30 

428966 AFa59214 H&194667 cholesterol a&hydfoxylase 242.10 

413435 X51405 Hs.75380 cartwxypepMase E 7.30 

436476 AA326t08 Hs.33829 bHLH protein 0EC2 247.20 

5 406747 AI925153 Hs.217493 8nne3QnA2 UaOO 

455800 R22479 Ks.167073 Homo 8a|»enscDNARJ13047fis. clone NT 112.10 

431211 M86849 Hs.323733 gap Junction protein, beta 2, 26kD (conn 583.90 

431890 X17033 Hs.271986 integrin. alpha2(CD498.3]ph3 2subuni 6.56 

431958 X63629 H5.2877 cadherin 3. l/pel.P-cadlienn (placenta 460.90 

10 444865 BE613126 H5.47783 B aggressive lymphoma gene 204.40 

437763 AA469369 HS.S831 OssuBlnhibOorofmetalloprotdnase 1 7.75 

41B870 AF147204 Hs.89414 chemoUns (OX-C moSO. receptor 4 (fus 14.61 

424560 AA158727 H5.150555 protein predicted »>y done 23733 99.60 

436396 Ai683487 Hs.152213 wingiess-type MMTV tntegraOon site fami 24^20 

15 407137 T97307 gb7e53h05.8l Scares fetal tiver spleen 17.68 

410668 BE379794 Hs.65403 hypolheecal protein 4.18 

427660 AI741320 H5.1 14121 Homo sapiens cDNA: nJ2322B lis, ctoneC 116.40 

408826 AF216077 Hs.48376 Homo sapiens done HB-2mRNA sequence 60.30 

442577 AA292998 Hs.163900 ESTs 4.18 

20 416498 U33632 K3.79351 potassium channel. subfamiiyK. member 1 334.20 

447343 AA256641 Hs.236894 ESTs. Highly similar Id S02392al|rtta-2Hn 8.16 

451277 AK001123 Hs.26176 hypothefical protein FU10261 375.30 

445133 AW157546 Hs.153506 ESTs 292,40 

414799 A1752416 H5.77326 InsuDn^ growth factor binding prole 4.38 

25 431805 AF186114 Hs.270737 tumor necrosis factor (ligand)6uperfeml 89.00 

437330 A1353944 K5.SOIIS Homo sapiens mRNA;cONADNFZp761J11 12 (T 322.10 

410687 U24389 Hs.65436 iysyl oxidase^ 1 9.10 

417409 BE272506 Hs^2109 syndecanl 4.05 

426471 M22440 Hs.l70009 transforming growth factor, alpha 138.60 

30 458809 AW972512 Hs,20985 slnS-assodated polypeptide. 30kD 250.50 

448625 AW9707B6 Hs.1 78470 hypothetical protein FU22662 4,89 

450506 NM_004460 Hs.418 fibroblast acOvaOon protein, alpha 11.76 

433336 AF017986 Hs.31386 secreted fHzzled-related protein 2 4.79 

408491 AI088083 Hs.7882 ESTs ^25 

35 437802 A1475995 Hs.122910 ESTs 4.54 

408296 AL117452 Hs.44155 OKFZP586G1 517 protein 175.10 

421155 H87879 Hs.102267 lysyl oxidase 170.10 

451310 AW250651 H5.26213 Human DNA sequence from done RP3447F3 Z91 

439867 AA847510 Hs,161292 ESTs 261.60 

40 417771 AA804698 Hs.82547 retinolc add receptor responder (tazaro 723.00 

410763 AF279145 Hs.8966 hypothetical protein FU21 776 251.70 

431385 BE17&536 Hs.11090 membrane^pennlng 4^omaIns. subfamily A 155.50 

457180 R26022 Hs.194G62 calponin 3. addle 68.00 

424408 AI754813 Hs.14642B cdlagen. type V, alpha 1 17.19 

45 452679 Z42387 Hs.63883 transmembrane, prostate androgen biduoed 5.32 

425139 AW630488 Hs.325820 protease, serine, 23 371.90 

432978 AF126743 Hs.279884 ONAJ domaifrcontaining 7.27 

406850 AI624300 Hs.172928 coOagen, type I, dpha 1 19.30 

421991 NM_014918 Hs.110488 KIAA0990 protein . 190.50 

50 421814 L12350 Hs.108623 Ihrombospondln 2 15.02 

409703 NHL006187 Hs^009 Z-S'-oHgoadenyiale synthetase 3 (100 k 28,57 

452281 T93500 Hs.28792 Homosai^enscONAFU1l04l fis,donePL 519.20 

413048 M93221 Hs.75182 mannose receptor, C type 1 24a60 

404210 #{NOCAT) NM-005936:Homo sapiens myeiold/iymphold 404.60 

55 452862 AW378065 Hs,8687 ESTs 364.20 

447072 D61594 Hs.17279 tyrosylproteln sulfotransferase 1 226.20 

426935 NM 000088 Hs.172928 collagen, type I, dipha 1 4.31 

427390 AI432163 Hs.268231 Homo saptens cDNA: FU231 11 fis. done L 10.41 

417259 AW903838 H3.8I8OO chondrdtln sulfate proteoglycan 2 (vera 22.46 

60 451295 AI557212 Hs.l7132 ESTs, Moderately similar to 154374 gene 23.74 

448569 6E3d2657 Hs.21466 signal transducer and acfivator of trans 5.68 

417148 AA359896 Hs.293885 hypothetical protein FU 14902 190.80 

432359 AA076049 Hs,274415 Homo sapiens cDNA FU10229 lis. done HE 230.50 

. 422278 AF072873 Hs.114218 litaled (Orosophila) homoiog 6 361.90 

65 422545 XD2761 Hs.287820 fibronecHnl a81 

442379 NM.004613 Hs.8265 transglutaminase 2 (C polypeptide. pTOte 7.30 

417412 X16e96 Hs.82112 interleuWn 1 receptor, type I 267.20 

422110 At37B736 Hs.1 11779 secreted protein, addle. cyslBln84fch 5.07 

'431512 BE270734 Hs.2795 lactate dehydrogenase A 270.10 

70 417433 8E270266 Hs.82128 5T4 oncofetal bophoblast glycoprotein 504.60 

426369 AF134157 Hs.1694e7 Kreistef (mouse) maf^elatedleudne zip 10.62 

437470 AL390147 Hs.134742 hypothetical protdn DKFZp547D065 Z80 

417944 AU077196 H3.82985 collagen, type V. dpha 2 14.01 

428797 AA496205 Hs.193700 Homo sapiens mRNA; cDNA OKFZp5e6l0324 (f 9.15 

75 434423 NM.006769 HsJ844 UM domain only 4 297.30 

426125 X87241 Hs.166994 FAT tumor suppressor (Orosophila) homdo 486.20 

422573 AW297985 Hs.295726 Integrin. alpha V (vitronectin receptor 9.73 

421552 AF026692 Hs.105700 secreted frtzzled^elated protein 4 762..90 

424730 N^4«003358 Hs.23703 ESTs, Moderately similar to CE(3T_HUMANC 7.81 

80 400133 NA Eos(>mtPd 357.00 

444361 BE387335 Hs.283713 ESTs. WeaHy similar to S640S4hypolh8S 1150.30 

422048 NM.012445 Hs.288126 spondin 2. extraceDuIarmatrbc protein 4.50 

446019 A1362520 Hs279789 Mstonedeacetylased 11:26 
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426490 NM.C01621 Hs.l70087 

422687 AW086823 Hs.119206 

432401 NM.013330 Hs.274479 

437223 C15105 Hs.330716 

Table 41B 



ary) hydrocsrtKNi rec^tof 
beulin^ graurth factor biralmg pcOte 
NME7 

Homo sapiens cONA FU14358 lis. done HE 



459.50 
Z68 
4.99 
7.65 



Pkey: Unique Eos probeset idenCfier number 
CAT number Gene duster number 
Accession: Genbank accessioo numbers 



m 

416913 
419570 
423733 
434565 
448437 

Taae41C 



CAT Number Accessions 

163001 1 AW934714 BE161007 BE162500 AW749902 AW49864 BE162498 BE161005 AA190449 AW513465 BE161006 BElte499 

1860604 1 W68738 W68831 

231476 1 AA330281 0AA330232AW982521 

380530 1 AA&42125AA654516 

763310'l AW470125 AI734872 AI749559 AW856504 A1583942 AW779036 AW843429 AW844876 A1520713 AW847236 



Pkey: IMique number corresponding loan Eos probeset 

Ref: Sequence source. The 7 digit numbefs in this column are Genbank Idenlilier (GI) numbers. "Dunham I. el si; refers to the pubikaSon enlilted The DMA sequence of 

Iruman chromosome 22.' Dunham I. et a)n Nature (1999) 402:489495. 
Strand: Indicales DMA strand from which exons were predicted. 
NtjKWltlon: Indicales nudeolide positions of predicted exons. 

Ntjwsition ~~ 

278637-279292 

94182-94323,97056-97243,101095-101236.102624-103005 
677-1188 

138534-138629.139234.139294,140121-140335.142033-142479 
168120-166347.166451-166557.169651-169832 
5001fr^158 
13022-13473 

19346-19525.19625-19708,19697-19973,20067-20130,20215-20414 
89914-90033,90729.90855,91131-91198 
5542-6176 

1414-1513,1624-1755 
B031&«0459 
169925-170121 
32282-32416 
51086-51301 
53134-53281 
40977-41150 
11743-11929 

34202.34351.35194.35336,4541245475.45731-45958,4729W7457.4954949658.4979049904.W^^ 
17705-18287 
869B5-87233 
12925-13213 
17546-17953 
8232&«2S61 
63448.63554 



Pltey 


■ tol 


SUand 


400534 


6981626 


Minus 


400560 


9843598 


Rus 


400836 


8954179 


Plus 


401201 


9743387 


Minus 


401480 


7321503 


Rus 


401541 


8072607 


Minus 


401682 


4755167 


Pius 


401732 


1200312 


Rus 


403207 


7630829 


Plus 


403574 


8101156 


Plus 


403776 


7770611 


Minus 


404142 


9856692 


Minus 


404210 


5006246 


Rus 


404285 


2326514 


Plus 


404286 


2326514 


Rus 


404287 


2326514 


Plus 


404682 


9797231 


Minus 


404866 


9366919 


Minus 


405281 


6139075 


Minus 


405849 


7651817 


Minus 


405909 


7705180 


Minus 


406173 


7230224 


Rus 


406293 


5686274 


Minus 


406319 


9211730 


Minus 


406399 


9256288 


Minus 



TABLE 42A: 574 genes upregulaled in pancreatte cancer relalhre to mmA body tissues 

Table 42A fets about 574 genes upregulated In pancreatte cancer relathre to normal body tissues that are BKely to encode proteins amenable to modulalton by small molecules, 
peptides, or anObodies. These genes were selected trom 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expresston data for each prabeset obtained Irom Ihis 
an^sb was expressed as average biten^ (At), a normalized valusreR^ The prot^pnduds of these genes often contain one or more 

domains indtoaOve of have oncogenic function or of transducing Intracelhjiar signals, or of being modulatable by smaB molecutes, peptkles, or antibodies (e.g. pkinas^ deatfMlomdn. 
71m. phosphatase, or lon^transpoiter). Certain predicted protein domains are noted. 

Pkey: Unique Eos probeset tdentifier number 

ExAccn: Exemplar accession number, GerlBank accession nunter 

UniGenelO: UniGene number ^ , , i.. 

Pred-ProlDomans: Certain predicted protein domains. Abbiwiallons used: TM. fransmembrane domdn; 88, signal sequence: «Y, very Bkely to contain; «=M, likely to contain: other 

protein domain abbreviaSons are from PFAM (Nudeie Adds inarch. 2002. 30*^280). 
UniGene Title: UnlGene gene title 

R1 goth percentile of pancreatte cancer AtedivMed by the 501h percentile of nomidfissueAts 

R2 901h percentile of pancreatic cancer Als divWed 1^ the 9Qth peicenOe of nomwl pancreas Als, where the 15lh percenSle of aB normal tissue Afs was suMraded 

from both the numerator and denorr^ator 

Pkey; ExAccn; UnigeneiD: Unigene Tilte; PrBd.ProlDomains; R1; R2 

428230; AA367019: Hs.241395: protease, serine. 1 (trypsin 1); trypsln.loxinJ;SS=M; 107.29; 1.07 

415934; NM_00092B; Hs.992; phospholipase A2. group IB (pancreas); phosllp;SS=M: 83.67; 1.06 

421996; AW5B3807; Hs.1460; glucagon; honnooe2;SS=M; 59.35; 1.61 

406399; : ; NM.003122*:Homo sapiens serine protease ; kazai:SS=M: 55.49; 1.08 

444381; BE387335: Hs.283713; hypolhetieal protein BC01424S: COIlagen;TM»M;SS»M; 53.65; 43.61 
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406685; M18728; ; QiKHuman nonspedfic crassteacfing snOo: lg;TM=M:SS=M: 52.73: 2ZB3 
428698: AA852773; Hs.33483a: K1AA1866 protein: ronB;NA;NA; 3Z44: 13.11 
437145; AF007216; Hs.5462; sohite carrier family 4. sodium Wcartwn; HC03_cotransp;TM=Y;; 29.80; 1.46 
428874; W32133; H3.194356; IranslhyreUn (prealbuniin. amytddosis t Translhyretin:SS=M; 29.42; 1^ 
444754; T8391 1 ; Hs.374341; transmembran© 4 superfanRy member 4; none;B*'Y;SS=M; 28.78: a 13 
418068; AW971 155: Hs.293902: ESTs, WaaWy similar to ISHUSS protein d; nora;TW=M;SS=M; 28.61; 058 
438091; AW373062: Hs.351546; nudear receptor subfamily 1. group I. m; homx)n8,rec^.none; 25.38; 3.83 
413719; BE439580; Hs.75498: smaD Wucibte cytoWno subfamily A (Cy; IL8;SS=M; 24.64; 7.21 
417771; AA804698; Hs.82547; rellnote add receptor responder (tazaro; ncne,none; 2177; 6.74 
414998; NM.002543; Hs.77729; oxidised low density Epopiotein Oaclin; leciii\.qTM=Y;SS=M: 22.96; 4.57 
418318; 114773% H5.84072; transmembrane 4 superfamily member 3; transmenibrane4;TM»Y;SS=M: 2Z31: 5.42 
425573; AB006423; Hs.15B30ai sertna (or cysteine) proteinase Inhiblto: serpin.GCV_H;TM=M;SS=M: 2151; 1.03 
433110; 056494; Hs.3191; rat regenerating IslsWeriveWke, hum; tectln„c;TM=M;SS=M; 21.90; 0.60 
426490; NM_001621; Hs.170087; aryl hydrocarbon receptor, PAC.PAS;TM=M;; 21.41; 19.89 
453863; X02544; H5.572; orosomuooid 1; Upocalin«8)dedh»ub(qultm.lRK;S&=M; 20.80; ai2 
421126; M74587; Hs.102122; Insulin^ growth bdor binding prote; thyroglobuliiLl.lGFBP:SS=Y; 20.60: 8.48 
451035; AUD767B5; H3.430: plasOn 1 (I tsotomn); efhand.CH^apftn_N;SS»lit 19.25; 3.53 
413859; AW992356; Hs.8364; Homo sapiens pynivate dehydrogenase kina; SAM.PNT.non0; 18.38; 2.53 
420332; NM.001756: Hs.1305; serine (or cysteine) proteinase Inhiljito: 5eTpln;TM=M;SS=M: 18.19; 2.29 
438089; W05391: Hs.351546; nudear receptor subfamily 1. group I, m; hofmonejec^C4,none; 17.67; 4.80 
417426; NM-002291; Hs.82124: famlnln, beta 1; Iamlnln.EGF.lanfilnln JJternUntegriruB;SS=M; 17.08; 6J7 
427509; M62505; Hs.2161; complemenl component 5 receptor 1 (C5a I; 7lmJ;TM=Y;SS=M; 16.89; 7.15 
441031; All 10684; Hs.7646; fibrinogen, B beta polypeptide; fibrlnogen_C,G-alpha,arf;TMs»H;SS=M; 16.59; 7.74 
445033; AV652402; Hs.72901; cydSKlepcndent kinase inhibitor 28 (pi; ank;; 16.28; 9.22 

431183; NM«006855; Hs.250696: KDEL (Lys^Gtu-Leu) endoplasmic retic; Efttumen_recept;TM=M;SS=M; 15.96; 138 

444784; D12485; Hs,11951; ectonudeolWe pyrophosphatase/phosphocfi; SomatoinedinJB,EndoriiidBase^hospho(fiesl;TM=Y:SS=M; 15.65; 1.33 

408243; Y00787; Hs.624: interleuHn 8; HUH,PAS.Il^TM=Mn 15.53; 4.34 

419355; AA428520: Hs.90061; progesterone iMndlng protein; hemaJ:TM=Y;SS=M: 15.45; 10.50 

425006; R49031; H5.22527; ESTs; pkinaso.TBC; 15.17; 0.58 

433376; AI249361: Hs.74122: caspase 4. apoptosis-rclated cysteine pn CARD,ICEj>10.lCE.p20:SS=M; 14.84; 3.04 
422260: AA315993; Hs.l05484: regenerating gene type IV; lectln.c;SS=M; 14.71; 2.89 
430280; AA361 258; Hs.237868; InterieuMn 7 receptor th3,none; 14.28; 1 1.47 

408983; NM_000492; H3.663; cystic fibrosis transmembrane conduclanc; ABC_tran^Cjnembr8ne,PRK,Bac^expO(L3;TM=Y;: 13.98: 1.18 
414812: X72755; Hs.77367; monokine Induced gamma Interferon; IL8:TM=M:SS=Y; 13.81; 7.69 

429170; NM_001394: Hs.2359; dual spedffclty phosphatase 4; Rhodanese.DSPc,Y_phosphatase.RibosomaLS3_N:TM=M:; 13.59; 2.24 

425988: BE045897; Hs.53985: ESTs. We^ similar to 138022 hypotheti; nonB,none; 13.54; 0.95 

409512; AW9791B7; Hs.293591; melanoma differentiation assodaled prol; DEAD,heliC8se_CCARt);TM=M;; 13.48; 3.59 

429556; AW139399; HS.9B988; ESTs; none;TM=Mn 13.20; 1.16 

417079; U65590: H8.81134; inteileukin 1 receptor antagonist; IL1;SS=1^ 12.83; 7.72 

427286; AW732B02: Hs.2132; epidemial growth factor receptor pathway; SH3.TonB_boxC;TM=M;; 12.72; 9.01 

431912; A1660552: Hs.356183; ESTs» Weakly similar to A56154 AW subst; none,Acyl-CoA^dhMl-CoA_dh_M,Acyi-CoAjJh.l* 12.72; 6.72 

4121 16; AW402166; Hs.784: Epstein-Barr virus Induced gene 2 (lymph; 7tnLl;TM=Y;SS=M; 12.71; 12.56 

425397; J04088: Hs.156346; topoisomarase (DNA) II alpha (170kD): DNAj|yr8seB.DNA.topolsolV,HATPasejc:SS=M; 1152; 4.92 

409142; AL136877; Hs.S0758; SMC4 (struchiral maintenance of duomoso; ABC_tFan«M.SMC_N,SMC.C,DUF164.non8; 1Z38; 7.59 

428157; AI738719: Hs.198427; hexokinase 2; hexokinase.hexoklnase2.none; 1Z34; 11.53 

424273; W40460: Hs.144442; phosphonpase A2. group X phosIlp;TM=M;SS=Y: 12.32; 3.02 

414821; M63835; Hs,77424; Fc fragment of IgG. high affinity la re; ig:TM=Y;SS=M; 1129; 2.21 

415323; 8E2S9352; Hs.949; neutrophil cytosoBc fador 2 (65kD, chn SH3.TPR;TM=Mr. 1116; 4.14 

427263; AL1 19796; Hs.174185; ectonudeotidepyrophosphatase/phosphodi; Sutf8t8se.Soniatomedin_B.Phosphodfest,EndonudeasetTM=M;SS"Y; 11.97: 5.93 
434779; AF153815; Hs.50151; potasshim inwardly-rectifying channel, s; IRK:TM=Y;; 11.76: 1.5B 
426227; U67058; Hs.154299: Human proteinase adhraled rBceptor-2 mR; 7lnLl;TM=Y;SS=M; 1 1.75; 3.56 
412228; AW503785; Hs.73792; complement component (3d/Epsteln Barr vl; 8ushi;TM=Y;SS=M; 11.67; 3.77 
450737; AW007152; Hs.63325; transmembrane protease, serine 4; trypsin.ldUBCepLa,none; 1 1 .65; 4.52 

439750; A1359053; Hs.67664; Homo sapiens mRNA full length Insert cDN; IMPDH-C.IMP0H..N.C8S,lntegrin J,Ridn.BJectln: 1 1.56; 5.46 
431512; BE270734; Hs.2795: bctalo dehydrogenase A; WhJdh_C.SH3.pWnas8.UBA;TM=Mn 11.55; 5.11 

429638; AI916662:Hs.211577; kinedin 1 O^in receptor); bZIP,Tropomyosin.spectrin.UP3PLCETP.B56,M;TMsY;SS=M: 11.47:4.65 

445133;AW157646; Hs,198689; ESTs; efhand.8pediin,GASlSH3,Ptectin,RA,Xylose«isom.RIO,bZIP,Tropomyo8ln.Myc^M^ 1162 

411352; NIUL002890; Hs.758; RAS p21 protein adhrator (GTPase adwa; SH2.SH3.C2,PH,RasGAP;TM=M;SS»M; 11.24; 9.95 

447343; AA256641; Hs.236894; ESTs, Highly similar to SQ2392alpha-2-m; none.none; 11.19; 4.16 

422616; 6E300330; Hs.11B725; saienophosphatB synthetase 2;AIRS^RS„C;TM=M;: 11.17; 5.98 

422241; Y00062: Hs.l70121: protein tyrosine phosphatase, receptor t; kinesin,lh3,Y_phosphatase:TM»Mn 11.14; 9.09 

440594; AW445167; Hs.126036; ESTs; nona,none; 11.05; 16.45 

425289; AW139342; Hs.155530; Irterferoo. gammaOndudble prot^ 16; PAAD.DARN,HIN;SS=M: 11.05; 10.38 
427700; AA262294; Hs.180383: dual spectfidty phosphatase 6; Rhod3nese.OSPc;TM=M;; 1 1.05; 4.70 
448811; AI590371; H5.199460; ESTs; none;TM=Yn 10.85; 9.69 

424321; W74048; Hs.1765; tymphocyto-spedffc protein tyrosine kin; SH2.8H3.pWnase;TM=M;; 10.72; a65 
429083; Y09397; Hs.227817; BCtJ^Iated protrin Al; Bd.2;TM=M;; 10.51; 1197 

424247; X14008; Hs.234734; lysozyme {renal amytoWosls); ly8,lg.FAD.SynthJdh,klh.C4)klnase;8S=M; 10.37; 6.35 

436856; AI4693S5; Hs.127310; ESTs; pkinase.rTm:TM»M;: 10.36; 174 

427337:246223: H3.176663;Fc fragment of IgQ. taw 8fBni^lItb.nio:TM«Y^ 10.34; 3.14 

417018; M16038; Hs.80887; v^l Yamaguchi sarcoma vN related ; SHlSH3,pkln8se;TM=M;: 10.34; 4.47 

418299: AA279530: Hs.83968; htegrln. beta 2 (antigen C018 (p95). Ir. integrin_B.EGF,PSl:TM=Y:SS=M; 10.21; 4.58 

451820; AW058357: Hs.199248: ESTs; 7bn.1;TW=Y;SS=M; 10.18; 167 

413048; M93221; Hs.75182; mannose receptor, C type 1; fnlledirLcRk*LBJedin;aink;TM=Y;SS=M; 10.17; 8.35 
429752: H5234B: H136636; ESTs; pMnase^pMnase; 10.13: 1135 

452721; AJ269529; Hs.301871; sohite canter family 37 (glycerd^pho; M0RM,s«9ar.tr,TM»Y:SS=M; 10.08; 8.74 
421462: AF016495; Hs.104624; aquaporin 9; MiP;TM=Y;SS=M: 10.05; 6.06 
452698; N^L001295: Hs.301921: diemoWne (CVC motiO receptor 1; 7tmJ;TR*>Y;SS=M; a96; 3w16 
416389; AA180072: Hs.149846: Inlegrin, beta 5; integrfn^none; 9.65; 9.59 

421044; AF051871: Hs.101302: Human DNA sequence from done RP1-23BD15: frt3.vwa.Cdla8en.TSPM:TfcA=M;SS=M; 9.78; 5.96 

446620; AA128808: Hs.179902; transporter-fike protein; nonfl:TM=iY;SS=M: a75; 164 

405102; ; ; C15001220*:gi)4469S58|gb)AAD21311.1| (AF; DA6J>E-fatRd.PliRhoGEF,DC1;SS»M; 9.74; 1.88 

418693; AI750878; Hs.B7409; thrombospondin 1; EGF.tsp.l,vwcTSPN,tsp_3:SS^ a72: 6.94 
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426535; AU077012: H5.288582: ESTs, Wbakly similar b ubiquBous TPR m: Kunil2^BPn,KiinlU„BPn.7lni.2.HRM; 9.68; laSB 

448105; AW591433: Hs.298241; Transmembrane protease, serine 3; tdLrecepLatrypsin:TM=Y;SS=M; 9.67; 4.06 

456266; L29073; H5.198726; coM stwdt domdn protein A; 7tm-2.HRM.CSO;TM=Y;SS=M; 9.62: Z36 

410240; ALi57424; Hs.61289: synaptpjarfn 2; ExoLendo jihos.SyM.nnvGrafrw«j»rins^^ 9.62; 3l77 

457001; J03258: Hs.2062; vilairfn D (1^ dOiydraxyvltafrin D3) re: honTnnejec,z«M.MelanoWo_5;TM=*t 9.6flt &0S 

456373; BE247706; Hs.89751; membranwpanmng 4Htomalns, suMamfly A; nai0:TMsY;; 9.57; 3.77 

416847; L43821: H3.8D261; enhancer d filamentafion 1 (cas^do; SH3:TM^M:: 9^; 10.50 

437158; AW090198: Hs.348709: KIAA1150 protein: none;NA;NA; 055; 8^7 

426108; AA622037; Hs.166468; programmed cen death 5; DUF122;TM=M;; 9.47; 5.67 

403344; ; ; NM_0ClC341:Homo sapiens solute carrier fa; aipta^3se;TM='Yn 9.47: 1.42 

449543; AF070632: Hs.237»; Homo sapiens done 24405 mRNA sequence; K_lBtra.lofLtrans^one: 9.46; 112 

433233; AB040927: Hs.301804; KIAA1494 protein; SH3,zraHC4;TM=M;; 9.42, 4.01 

444838: AV551680:H5.20855B; ESTs; WegriaAFG^,nonD: 9.42; 1.87 

439803; AA001021; Hs.6685; thyroid homwne receptor Inleractor 8; none.none; a41; 5.55 

428505: AL035461: Ks.2281: chromogranln 6 (secretogranin 1); Granin;SS=M; 9.40; 3.46 

411213: AA676939; Hs.69285! neuropflin t; MAM.F6J^8.lype„C.CUB,CUB.MAM.F5_FB„lype.C; 9.38; 6,32 

432810; AA883400: Hs.374489; ESTs; none,Skp1^ 9.38; 4.36 

427581; NM.014788; Hs.179703; KIAA0129 gene product; SPRY;cW_box;TM=M;; 9.34; 8.26 

413109; AW389845: Hs.110855; ESTs, similar to leukemia virus receplon PK04,none; 9.34; 4.67 

428450; NM_014791; Hs.184339: KIAAD175 gene product; KA1,pHnase;TM=M:: 9.31; 4.24 

4081 13; TB2427; H8.194101 ; Homo sapiens cDNA: FU20869 lis, done A; 7tnu3,non9: 9.24; 7.12 

448030; N30714; Hs.325960; mentrane-spanning 4^omalns, sulMy A: none;TM»Y;S5sM; 9.23; 6.03 

437672; AW748265; Hs.5741; flavolienxvrolein bST, heme_1,NAD«bindlng,lipoxygenase,FAD_blndIn9.6;TM=M;; 9.22; 10,72 

416498; U33632: Hs.79351; potassium channel, subfamily K. member 1; ion_trans;TM=:Y;SS=M; 9.20; 4.46 

409956; AW103384; Hs.727; Inhibin, beta A (activtn A, acUvIn AB a; TGF^eta,TGFbjKOpeptlde,Tub;SS=M; 9.19; 1^46 

413095; AA494359; Hs.30715; potassium voltag&^ted channel, tsk-rel; none.START; 9.15; 2.18 

418838; AW385224; Hs.3519B: ectonudeoiide pyrophosphatase/phosphodi; Pbosphodiesl;TM»Y;SS»(yt 9.14; 3.03 

452960; AK001335; Hs.31137; protein tyrosine phosphatase, receptor t; Y_phosphat3Se,none; 9.14; 11.75 

417821; BE245149; H5.82643: protein tyrosine kinase 9; cofirin^F;SS=M; 9.1 1; 4.29 

427157; U51 166; Hs.173824; thymine-DNA glycosylase; UOG;TM=M;; 9.06; 9.68 

431341; AA30721 1; Ks.251531; proteasome (prosome. maoopain) subunit,; proie3some;TM=M;; 9.05; 5.61 

413367; NM^006517; Hs.75317; solute carrier famfly 16 (monocartioxylic; suoar^tr;TM=Y;; 9.04; 5,79 

437296; AA350994; Hs.20281; KIAA1700; Rhodanese.DSPc;TM=M;; 9.02: 5.75 

418888; AU076801; Hs.89435; cadharin 17, U cadherin (iwer-lntestin: cadhcrin:TM=Y;SS=M; 8.94; 5.01 

446406; A1553681; H5.348490; Arg/AbHnteracting protem ArgBP2; Sorb.none; B.91; 1.77 

428820: AA436187; Hs.172631 ; Integrin, alpha M (complement component ; vwa.lntegrin_A,FG-GAP;TM=Y:SS=M; 8.85; 4.74 
434398; AA121098: Hs.3838: senim^nduciWe kinase (SNK); pWnase,POLO,box;TM=M;; 8.78; 4.54 
453902; BE502341; H3.3402: ESTs; none,none; 8.72; 3.71 

433334; Ai927208; K5.231958: matrix metalloprotelnase 28; Pepfidase_M10,none; 8.71; 4.28 

446488; AB037782: Hs.15119; K1AA1361 protein; pWnase;SS=M; 8.70; 171 

450247; AF1 23303; Hs.24713; hypothetical protein; em3nd,mito_carT;TM=Y:SS=M; 8,68; 3.40 

432101; AI918950; H5.123642; EphA3; fn3,pklnase,SAM.EPH_lbd:TM=Y;SS=M; 8.62; 5.62 

410763; AF279145: H8.8966; hypolhettoal protein FU21776; none,none; 8.61; 13.53 

422278; AF072B73; Hs.1 14218; frizzled (DrosophHa) homolog 6; Fz,Frizzled.7tm_2;TM=Y;SS=M; 8.55; 4.82 

425465; L18964; Hs.1904; protein kinase C. lota; pWnase,OAG_P£-blnd,pklnase_C,OPR;TM=M;; 8.60; 3.18 

419111; AA2341 72; Hs.1 3741 8; ESTs; none.IRK; 8.47; 7.51 

430024; AI808780; Hs.227730: integrin. alpha 6; lntBgrin^F&GAP;TM=Y;SS=M: 8.45; 3.46 

447574; AF162666: H5.18895; touslecMike Mnaso 1; p»nase;TI^; 8.45; 5.30 

447217; BE465754; Hs.17778; neuropflin 2; CUB.MAM,F5J=8.lyp6_C:TM»M;SS=M: 8.44; 610 

419034; NM_002110: Ks.89555; hemopoietic ceH kinase; SH2,SH3,pkina5e;TM=Mn 8.43; 4.87 

405555; ; ; homeodomain-lnteracfing protein kinase 3; tfypsin;TM^M;: 8.39; 0.68 

417412; X16896; Hs.821 12; Interieukln 1 receptor, type 1; i9.TIR;TM=HA;SS=IW; 8.35; 4.74 

405556; ; ; homeodom^n-interacting prolan kinase 3; tfypsln;TM=Mn 811; 0.87 , 

407687; AK00201 1; Hs.37558; hypothetteal protein FU1 1 149; lys.ig,FAD_SynOi.ldh,tdh_C,pkinase;SS4^ 8.28; 112 

408051; AI523351; Hs.172148; ESTs; PH,RhoGAP,none; 8.27; 5.65 

449523; N!yL000579; Ks,54443; diemoklne (C-C motiO receptor 5; 7lm>1;TM=Y;SS=M; 8.26; 5.49 

429732; U20158; Hs.24B8; lymphocyte cytosofic protein 2 (SH2 doma; SH2;SS=M: 8.24; 8.91 

405204; ; ; NM_002086*:HoTno sapiens growth factor re; SH2.SH3;TM=M;; 123; 6.43 

426806; T19228; Hs,172572; hypothetical protein FU20093: 8nk,pWnase,UPF0073;SS=M; 8.20; 6.1 1 

428428; AIX37544; Hs.184298; cydiiMlependent kinase 7 (homdog of Xe; pkln3se;TM=M;; 8.18; 137 

450375; AA009647:Hs.352537; a disintegrin and melaltoprolfilnase doma: Reprolysln,PepJ412Bj)ropep,djshtegrin,Reprolysln,Pep,M12BjHope|tf^ 12.24 

443303; U67319; H$.9216; caspase 7. apoptosls^lated cysteine pr pkln85e,ICE_p10.ICE_p20;TI^M;SS=M; 8.15; 4.61 

413132; NM_006823: Hs.75209; protein kinase (cAMPHJependent.catalyfi; PKI;SS=M; 115; 11.12 

428513; BE220806; Hs.184697; Homo sapiens done 23765 mRNA sequence; PSI,none: 113; 13.28 

429345; R11141; Hs.199695; hypothetical protein; K.tetra,SAM; 8.13; 1.15 

425838; NM„014071; Hs.l59613; nudear receptor coactivator RAP250; pen none;TM=M:: 112; 4.54 

425836; AW955696; Hs.90960; ESTs; Cbl^N.Cbi„N2.CblJllUBA,rM:3HC4,none; 111; 7.47 

406366; ; ; secreted frizzled-related protein 4; trypsin;SS=M; 8.05; 0.69 

429663; M68874; Hs.21 1587; phosphd^ase A2. group IVA (cytosdic. ; C2.PIA2^B;TM=M;; 8.04; 5.20 

458946^ AA009716! Hs.4231 1; ESTs; nDne,DSFt,Yj>hosphatase; 102; 1.93 

425280; U31519; Hs.1872; phosphoendpyruvate carboxyMnase 1 (sd; PEPCK;TM=M;: 7.97; 19.33 

445800; AA126419; H3.32944; inosltd pdyphosphale^-phosphatasa. ty; none,none; 7.90; 11.44 

449444; AW818436; Hs.351306; solute carrier family 16 (monocartx)xyfic; none;TM=Y;SS=M; 7.89; 7.00 

426167; AF039023; Hs.167496; RAN binding protein 6; AfmaifiltojBeg,HEAT_PBSr. 7.83; 11.16 

400408; S75765; : Homo sapiens delta CGK6 gene, parfid c; 7tnu1.none; 7.81: 178 

448362; AA641767; Hs.21015; hypothetical protein DKFZp564L0864 slmO; sugar.b;TM»Y;SS=M; 7.78; 7.02 

457670; AF1 19666; Hs.23449; Insulin receptor tyrosine kinase substra; 8H3;TW=M;; 7.75; Z63 

409799; D11928; H3.76845; phosphoserine phosphdase^ Hydrolase:TM=M;: 7.72: 2.68 

447887; AA1 14050; Hs.19949, caspase 1 apoptosls^lated cystdne pr; lCEj}10,ICaj)20.DEO;TM=M;: 7.68; 140 

421684; BE281S91: Hs.108768; hypolhellcal protdn FU10S11; AmiadiOojBeg;SS^ 7.43; 6.57 

434699: AA643687; Hs.149425; Homo sapiens cOTM FU11980 Os. dona HE; fhideosldej^^ 

417880; 6E241S95; K5.82848; sdeclin L (lymphocyte adhesion mdecde; EGFJedii\jB,BUshl;TI^M;SS=^ 7.38; 160 

436729; 6E621807: Hs.351316; len sm emb ia ne 4 superfemOy member 1; none;TM»Y;SS"M; 7.29; 178 
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428970: BE276891: H5.194691; retinoic add Induced 3; 7tnu3:TM=Y;SS^ 7.26; 11:00 

426761; AI015709: Ks.172089; PORIMIN Pro-onoQsIs receptor Indudng ma; rane:TMsY;SS»M: 7.25; 7.22 

413880; AI66084i Hs.1 10915; InterteuWn 22 receptof; rissuaJac:TM=Y;SS=M: 7.24; 0.38 

418945; BE246762; Hs.89499; arachidonate 54ipoxygenase; npoxygenase.PUT;TM=M:; 7.22: 6.45 

413441; AI929374; Hs.75367; Sfc4ike^apter, SH2,SH3;TM=M;; 7.20; 5.72 

426158; NM.001982; 85.199067; v-erb-b2 avian eiythroijlasfic leukemia v; Funn4ike4iMna5e.Rece|>.Ljfcxnain.l^^ 7.13; 3.97 

42B474; AB023182; tte.184523; KIAA0965 protein; pkinase;TM=M;; 7.13; 5.43 

421582; AI910275; Hs.350470; trefoil factor 1 (breast cancer, estioge; trefc^.Gastr]n;SS»M; 7.08; 21.61 

449843; RB5337; Hs.24030; solute canier family 31 (copper transpo; none;TM=Y;SS«M; 7,07; 5.18 

4521 10; T47667; Hs.28005; Homo sapiens cONA nj11309 fis, dona PL; pidnaseA^njecp.none; a94; 4.82 

451295; A1557212; Hs. 17132; ESTs. Moderately similar b 154374 gene : pklnase.DAG^PE-trind.pWnaso_COPR,none: 6.92; 15.34 

430680; AW1 38724; Hs.1 68974; ESTs. Highly similar taALU7^HUMAN ALU 8; Yjjhosphalas8;^aptjn,N.Yj)hosphatase; 6.88; 1.94 

44160C: AA939347: Hs.127223; Homo sapiens cysteaie knot protein (ZSIG; 7tnLl,ldljacepLa.lRR;SS=M^ 6.86; 0.97 

410887; U24369; Hs.65436: lysyl oxidase^ 1; LysyLoxUase:SS=:M; 6.63; 7.24 

425009; X58288: Hs.154151; protein tyrosine phosphatase, receptor t; fn3,ig.Y_pho5phatase,MAM;TM=:Y;S$=M; 6.83; 11.43 
400539; ; ; Target Exon; none:BN^;; 6.70; 1.19 

431113; AK000673; Hs.274337; hypothetical protein PU20666; pkinase;TM=M;: 6.65; Z21 

445280: AW05506d; Hs.343220: avian sarcoma vbus CT10 oncogene ; SH2.SH3.none; 6.61; 10.66 

425S34; NNL001639: Hs.1957; amytoU P component senim; penlaxln:TM::M:SS^ 6.57; 2.20 

435706; W31264; Hs.7045; GL004 protein; PDEase.GAF,none: 6.55; 11.44 

415906; AI751357; HS.2B8741; Homo sapiens cDMA: FU22255 fs, done H; Ephrin,nonB; 6.45; 5.25 

408308; AL033377; Hs.44197; hypothetical protein DKFZp564D0452; none,none; 6.42; 9.14 

432336; NM_002759; fte.274382; protein kinase, Interferofrlndudble dou; dsnn,pkin8S©:TM=M;; £42; 4.12 

417874; BE616160; Hs.82829; protein tyrosine phosphatase, non-recepi; Yj>hosphatase;TW»Y;: 6,42; 126 

446872; X97058; Hs.16362; pyrin&ilnergfc receptor P2Y, G-prote!n c; 7tm_1;TM=Y;SS=M; 6.41; 4.54 

444006; BE3950B5; Hs. 10086; type I transmemtjrane protein Fn14; ldLrBcepLa,PKD.MHCJ;TM=M;SS«^; 6.38; 3.55 

412970; AB026436; Hs.177534; dual spedfidty phosphatase 10; Rhodanes6,DSPc;SS^ 6.35; 4.95 

422583; AA410506; Hs.27973; KtAA0874 protein; ank.G«Ipha;TM=M;; 6.35; 3.56 

452355; N54926; H5.29202; G proleifrcoupted receptor 34; 7lrTi.1,0ATP.C;TM=Y;; 6.32; 11.02 

422282; AF019225; Hs,1 14309; apoBpoprotein L; MoWLExbB,TM=Y;SS»M; 6.32; 5.15 

407235; D20569; Hs.169407; SAC2 (suppressor of actln mutatons 2. y; none,(^somal^S13,GalaclosyLT^,adh_phort.2^C3HC4: 6.30; 8.35 
4264B6; AW583497; H$.184604; pancreatic polypeptide; hormone3;TM=M;SS=^Y; 6.29; 3.51 
408847; AW290997; Hs.30348; ESTs; pkinaseJg.none; 6.28; 3.63 

428179; A1127772; Hs.279696; serum/glucocOftfcoW regulated kinase^; pkinase,PX.pWnase«C;SS=M: 6.28; 3.50 

443614; AV655386; Ks.7645; fibrinogen. B beta polypeptide; none,none; 6.26; 7.48 

425354; U52027; Hs.155935; complement component 3a receptor 1; 7tnu1;TM=Y;SS=M; 6,25; 3.98 

448888; AW196663; Hs.200242: caspase recruitment doman protein 6; CARO;TM=M;; 6.21; 4.10 

428180; All 29767; Hs.1 82874; guanine nucleotide bMng protein (G pn G^ha.8rf.TM=M;; 6.18; 4.62 

409245: AA361037; Hs.356436: tRNAIsopentenylpyrophosphate transferas; AnnadiIto.seg;TM44;; 6.17; 11.15 

417952; A1192838: Hs.372643: du^kpedfid^ lyroslne^hosphoryl; pkIn3se,none; 6.17; 3.05 

445701; AF0S5581; Hs.13131; fymphocyte adaptor protein; SH2.PH;TM=U;; 6.16; 11.90 

425910; AA830797: Hs.184760; CCAAT-box-binding transcflp8on factor; none;TM=M;; 6.10; 196 

426797; AW936258; Hs.342849; ADP-ribosylation lactor-like 5; aif,Ca_channeLB,SH3; 6.03; 3,17 

408331; NIA.007240; Hs.44229; dual spedfusly phosphatase 12; 0$Pc;TM^M;; 5.99; 2.55 

441384; AA447849; Hs.268660; Homo sapiens cONA: FU22162 fis, done H; 7tm_3inon6; 5.97; iai2 

414217; AI309298; Hs.279898; Homo sapiens cDNA: FU231651is, dona U none;NA;NA; 5.92; &47 

418506; AA084248; Hs.85339; Unknowa protein for M6C:29643; none.none; 5.91; .1.94 

436345; AA87X08; Hs.121572; ESTs; CARD,BIR.zf^HC4.CARD.BIR.zf-C3HC4; 5.90; 1.40 

414087; W19712; ; gb'.zb36d03.r1 Soares_parathyrold_tumor_N; pkinase.none; 5.85; 0.90 

430396; D49742; Hs.241363; hyakflonan-binding protein 2; ank.dealh/U5,EGF.kringle.tjypain,NBbuBn,UM;SS»M: 5.77; 1.24 

431385; BE178S36; Hs.11090; membrane-spanning 4HJomaIn8, subfamily A; none,none; 5.71; 4.00 

427557; NM_002659; Hs.179657: plasnnnogen activator, uroWnase recepto; UPAH.LY6,ET,Pl^lnh;SS=M; 5.71; 3.83 

414171; AA360328; Hs.865; RAP1 A, member of RAS oncogene family; pklnase,DAG.PE-blnd,RBD,ras,DC1,GFP;TM=M;; 5.6^ 3.07 

41 8870; AF147204: Hs.89414; chemoWne (OX-C motif), receptor 4 (fus; 7lnu1,7tm_2;TM=Y;SS=M; 5.68; 1192 

425317; AW205118; Hs.210546; interteukin 21 receptor; none;TM=Y;SS=M; 5.60; 5.45 

417863; AB000450; Hs.82771; vaccinia related kinase 2; pkinase;TM=M;; 5.59; 4.19 

400151; ; ; Eos Gontrd; ATJiook,DNAjn!3jepdr.HATPase_cUCLcon;TM=M;; 6,53; ai3 

450139; AK001B38; Hs.355608; seium/^lucocorticoid regulated kinase; nQne,none; 6.52; 8.61 

418203; X54942; Hs.83758; CDC28 protrin kinase 2; CKSr. 5.52; 10.04 

433556; W56321; Hs.111460; catehimtealmodurHHlependent protein kin; pkinase,none: 5.51; 6.75 

424701; NI\4_005923; Hs.15l988; mitogen-acOvaled protdn kinase kinase ; pklnase;TM=M;; 5,47; 4.58 

415875; AA894876: Hs.5687; protein phosphatase IB (formerly 2C), ma; PP2C;TM=M;; &43; 6.30 

408761; AA057264; H5.238936: ESTs. Weakly similar to (defline not ava; 7tm_1.none; 5.42; 2.59 

41 5444; BE247295: Hs,78452; solute carrier family 20 (phosphate tran; PH04.UM;TM=M;; 5.37; 8.69 

444184; T87841: Hs.282990; Human DNA sequence from done RP1-28H20 ; pkin3se,RI01 >W.KOW:TM=M:: 5.36; 3.32 

410434; AF051152; Hs.63568; tolHike reoeplof 2; LRR.LRRCT,TIR;TM=M;SS=M; 5.36; 3.94 

429023; NM.Q00312: Hs.2351; protein C (Inacthrdtor of coagulatkm fa; EGF.trypsln,gta;SS^M; 5.31; 4.30 

421559; NM.O14720; Hs.105751: Ste2awlated serine/threonine kinase: pklnase,UVR;TM=M;; 5.31; 3.26 

429922; 297630; Hs.2261 17; HI histone family, member 0; linker.hlstone;TM=M;; 5.27; 3.12 

440682; AW362152: Hs.27181; nudear receptor binding fador-2; cydln,bZIP;ThAsM;; 5.26; 4.82 

411558; AA102670; Hs.70725: gamma-amlnobulyric add (GABA) A recepto; NeurjdianJ£D.Neur_chan_memb:TM=Y;SS=M; 5.2S; 11.26 
428234; U93553; Hs.183123: nudear reoeptor subfamiiy 5. group A, m; hamK)n8jec.zKM:SS»M; 5l20; 1.11 
408683; R58665; Hs.46847: TRAF and TNF raceptor-assodaled protein; ExD_endojihosJM=M:: 5.19; 6.25 
408657; AA782601; Hs.173328: ESTs; B56,none; 5.18; 5.47 

438746; AI88581S; Hs.184727; Human melanoma-associated antigen p97 (m; transfenln,Guanylats Jdn,PDZ.SH3; 5.17; 4.02 

438698; AW297855; H5.361 171; ESTs, Weakly similar to 138022 hypotheti; npoxygena5e.PLAT.none; 5.16; 2.91 

442200: AW590572: Hs.235768; ESTs; non8.none: 5.11; 4.22 

41 8738; AW388633; Hs.6682; sohito cam'er fandly 7. (caftonic amino; none.none; 5.08; 171 

419088: AI538323; Hs.367688; IntegHn. beta 8; tntegrin_B.none; 5.07; 3.53 

414555; N98569; Hs.76422; phosphdipase A2, group IIA (platelets, ; phoslip;TM=M;SS=Y; 5.05; 3.42 

408414; A1114688; Hs.193400; ESTs, Weakly siniar to 2109260A B ceQ ; fn3.i9;TNNY;SS«M: 5.05; 3.41 

430407: H23551; Hs.30974; ESTs; pkina5e.PBD.n0nQ; 5.03; 1.63 

427127; AW802282: Hs.22265; pynn/ate dehydrogenase phosphatase; PP2C.nQne: 5X0; 5.14 
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452194; At694413; Hi373S99: Ufalqi^fiivlike pnM FAT10777 - dhibiq; nona.non8; 4.98; ^e5 
410973; AW408183; Hs.58488: catenln (cadherfaMSSOdated protein), a; Sl8Donin.Vincunn;SS=M: 4.97; 1060 
409430; R21945; Hs.346735; spDdng factor. 8^gln^nete8rtn8^ich 5; DSPcRhodan8se.i>one; 4.96; 2^7 
432841; M93425; H5.62; proton tyroGine phosphatase, noiwecept: Yj)hosph8tasG;SS::M; 4.88; 21.69 
433470: AWg60564: H$.351316; tFansnnmtuane 4 superfiBmlly member 1; noR8;TM=Y;SS^ 4.68; 4.60 

418529^ AW005e95: Hs.250897; TWWiised gene; B8ndj»1.ERHpkinase.tJW.UWCT.MAM.Nudeoplasn^ 4,79; 
5.47 

421425; Al(001564; Hs.104222: hyprtheDcd protein FU10702: efhand,k8zal,aif.r8s.7tnLl;TM=M;; 4.75; 5.41 

414135; NM-004419; Hs.2128; dual spedflcHy phosphatase 5; Rhodan8S8.DSPcYj)hosphatase;TM=Ms 4.74; 9.76 

417640; D30857; Hs.82353; proton C receptor, endothelial (EPCf^; none;TM=M;SS=M; 4.73; 4.68 

430630; AW2e9920; Hs.2621; cyslafin A (stefin A); cystaIln;TM=Mn 4.65; 5.61 

452239; AW379378; Hs.356289; protein tyrosine phosphatase, receptor t none,none; 4.63; 6.62 

427333; AF06n97; Hs,l76658; aquaporin 8; M1P;TM=Y;SS=M; 4.63; 0.80 

431890; X17033; Hs.271986; Integrin. alpha 2 (C049B. alpha 2 sulwni; vwaIntegrin_A,F&GAP;TM=Y;SS=M; 4.58; 11.38 

428065; AI634046; Hs.157313; ESTs; ICEj)20,DEO,ICEj>10,lCEj>20,DED; 4.55; 4.51 

428582; BE336699; H3.1B5055: BENE protein; none;TM=Y;SS=M; 4.54; 8.76 

416224; NM.002902; Hs,79088; reliculocfilbin 2, EF-hand cateiimi bindin; efhand;SS=M; 4.54; 1957 

450056; BE047394; Hs.8208; ESTs, WeaWy sinralarto 871512 hypotheti; ABC,tran,ABCjnembfBneJg,MHCJ_belaSRP54.proteasome,ABCjn^^ 4.49; 10.47 

414987; AA524394; Hs.294022; hypothetical protein FU14950; SH2;TM=M;; 4.41; 7.27 
447232; AW499834; Hs.327; intarleuldn 10 receptor, alpha; none;TM»M;SS=M; 4.41; 10.25 
433208; AW002834; Hs.24095; ESTs; 8rf.Ca_channeLB»SH3; 4.39; 1Z14 
403208; ; ; Target Exon; tectin„c»none: 4.37; 0.76 

44048G: BE243513; Hs.7212; Ivpoltietlcal protein PP1044; LRR,PAAD.DAPIN,AAA,CARD.NB.ARC;MA;NA; 4.36; 1034 

414278; AA330116; Hs.355877; Human glucose transporter pseudogene; flone,none; 4.35; 7.95 

424633; NK/L003B94; Hs.153405; pertod ([}rosophi]a) homolog Z PAS;S&=M; 4.34; 6.23 

422573; AW297985: Hs.295726; Integrin, alpha V (vitronectin receptor; FG-GAP.lntegrloAnone; 4.32; 5.85 

418721; NM.002731; Hs.87773; protein Wnase, cAMP-dependenl. calalyti; pklnase,pl(lnase„C;SS=M; 4.31; 3.09 

412330: NM_005100; Hs.788; A kinase (PRKA) anchor protein (gravin) ; nonefrM^M;; 4.25; 1174 

421939; BE169531; Hs.109727; TAKI-Wnding protein 2; KIAA0733 protein; zf.RanBP,CUE;TM=M; 4.25; 12.54 

414774; X02419; Hs.77274; plasminogen activator, uroWnase; lcringte.tTypsln.plantthlonlns;SS=M; 4.24; 091 

418526; BE019020; Hs.85838; solute carrier family 16 (monocarboxyric; none;TM=Y;SS=M; 4.22; 5.27 

415801; R24219; Hs.278443; FcfragmenloflgG. low affinity lib, re; ig;TM=Yn 4.16; 7.22 

417866; AW067903; Hs.82772; collagen, type XI, alpha 1; CoIla9en,COU=l,T8PNJanMLG,CorA;SS=M; 4.16; 027 

445496; AB00786O; Hs.12802; devetopmeni and differentiation enhandn; SH3,anl(,PH,Ai16ap;TM»M:; 4.15; 23.43 

436075; BE090176; Hs.179902; transporter-like protein; none:TM=Y;SS=M; 4.14; a76 

414462; BE622743; Hs.301084: arfapfin 1; none.none; 4.08; 13.43 

435730; Ae020635; Hs.4984; KIAA0828 protein; AdoHcyase.TrkA-N.2-HacW J)H.C;TM=M;; 4.06; 9.12 

431681; AK000378; Hs.267566; hypothetfcjd protein FU20371; BUgar.tnTM=Y;; 4.04; 10.05 

429379; NM_014840; Hs.200598; KIAA0537 gene product pkInasB.RI01;TM=Mn 4.00; 6.35 

429061; Y14039; Hs.195175; CASP8 and FADlWiIra apoptosls regulator; IC^20.DED;TM=M;; 3.98; 5.66 

405203; ; ; NM 002086*:Homo sapiens growth factor re; SH2,SH3;TM=Mn 3.95; 17.71 

409335; NM.001502; Hs,53985; glycoprotein 2 (zymogen granule membrane; zona_pellucida;TM=M;SS=M; a94; 058 

446006; NM.004403; Hs.13530; deafness, autosomal dominant 5; none;TM=M;SSsM; 3.89; 7.59 

413899; AF083892; Hs.75608; tight junction protein 2 (zona ocdudens; SH3,PD2,Guanyl8teJdn;TM=M;; 3.84; 8.89 

438000; AI825880; Hs.5985; non-klnase Cdc42 effector protein SPEC2; none;TM=M;; 3.83; 4.22 

418054; NM_002318: Hs.83354; lysyl oxldase4D« 2; 8RCR.Ly5yLoxldase:TM=M;SS=M; 3.81; 6.45 

450285; AW383256; Hs.24752; spectrin SH3 domabi binding protein 1; SH3;TM=M;; 3.78; 8.49 

417141; U22662; Hs.347353; nuclear receptor subfamily 1. group H, m; honnonejec,2l-C4;SS=M; 3.77; 4.22 

455376: AA663904; Hs.89862; TNFRSF1 Associated via death domain; dealh;TM=M;; 3.68; 4.92 

438113; AI467908; Hs.8882: ESTs; 7tni_1.none; 3.59; 1Z12 

414883; AA926960; Hs.346669; COC28 protein kinase 1; CKS;; 3.58; 10.93 

429952; AF080158; Hs.226573; bihiWtor of kappa light polypeptide gen; pWnase.ubk|Ullln.Enteroto)dn^PH04,pkln8se.ubiquitin; 3.57; 5.10 
416088; AI077288; Hs.374374; serum/yucocortlcoid regulated kinase; none.none; 3.56; 4.60 
418478; U38945; Hs.l 174; cycIinKlependent kinase inhibitor 2A (me; ank;; 3.55; 4.52 

44856% BE382657; Hs2^m^ fiignaJ transducer and activator of trans; SH2,STAT.STAT^blnd.STAT_prot;TM=M;: 3.64; 019 

434608; AA805443; 85.179909; hypotheHcal protein FU22995; none;TM^; 3.52; 9.70 

400288; X06256; Hs.149609; fritegrin. alpha 5 (iibronectin receptor.; integri(LA.F&GAP;TM»Y;; 3.45; 044 

445350; AF052112; Hs.12540;lysophosphoiip3sel; abhydrolase_2;TM4lln 3.41; O03 

416255; AW135405; Hs.37251; ESTs; pMn8se,none; 3.41; 13.97 

40882% AW500715; Hs.57079: Homo sapiens cONA FU13267 lis. ckme PlPSICnone; 3.40; 6.97 > 

426432; AF001601; Hs.169B57; paraoxonasd 2; Ary!estBrBse;TM::M;: 3.39; 11.24 

431629; AU077025; Hs.265827; Interferon, alpha^ndudbfe protein (do; none;TM=M;SS=Y; 3.39; 5.10 

414291; AI289619; Hs.13040; G proteinH»upIed receptor 86; 7tm,1;TM=Y;SS=M; 3.38; 1025 

457329; At634860; Hs.247043; type 1 tumor neciosb factor receptor sh; Pep6dase_M1;SS=M; 3.38; 13.78 

411125; AA151647; Hs.68877; cytochrome b-24S, alpha polypep&te; none:TM»Y:SSsM: 3.36; 4.17 

443710; AI928136; Ks.9691; Homo sapiens cONA: FU23249&, done C; G^onone; 3.32: 20.33 

454294; AB000734; HS.50&40; JAK binding protein; SH2;TM»M;; 3.31; 094 

408912; AB011084: Hs.48924: K1AA0512 gene product; AI.EX2; Armadito.8eg;TM=M;SS=M; 3.29; 3.07 

426728; NM_0071 18; Hs.387689; triple functional doma&i (PTPRF Interact; SH3,tg.pklnase.PH,8pectrin.RhoGEF;TM=fct; 3.27; 14.90 

427202; BE272922; Hs.173936; intarieukin 10 receptor, beta; Tissue„fec;TM=Y;SS=M; 3.24; 4.49 

413076; U10564; Hs.75188; weel (S. porabe) homotog; pkInase;TM=M;; a24; 12.27 

425976; C75094; Hs.334514; NG22 protein; voltage.aC;TM=Y;SS=M; 313; 13.40 

417534; NM_00499B; Hs.82251; myosin IE; SH3jnyosln_head.lQ;TM»M;; 3.21; 1021 

458097; AW341 135; H3.58104; ESTs; none.SH3,PID; 3.21; 7.34 

437928; NM.005476; Hs.5920; UOP-N-acetylglucosamlne-2<plmerase/N.ac; hexoklnase,FGGY.ROK.Epim8ras8_2:SS«M; 3.20; 038 
425177; AF127577; H5.155017; nuclear receptor Interacting protdn 1; none;SS=M; 3.19; 5.09 
416094; AW995512; Hs.225977; nudear receptor coactivalor 3; none,none; 118; 4.17 

453489; AA300067; Hs,102000; hypothetical protein 0KF2p434N185; F5J8_lype_C,pkinase.Ets.F5.F8_type.C,pkinase.Ets: 3.17; 7.88 
414914; U49B44; Hs.77613; ataxia telangleclasia and Rad3 related; FAT,FATCPl3_PI4Jinase;TKNM;; 3.16; 4.71 

412767: AA233S08: H8.286241; protein kinase. cAMP-dependenl. ragtilatoc SH3*7tnL2jC8dhedn.GPSJanMcLaEGFJaminlruEGF.8uif^ 3.16; 
7.19 

415662; AW972481: Hs.170610; ESTs. Highly similar to 601887 MEKUnas; pMnase^none; 016; 7.21 
407788: AA687538; Hs.38972; tetraspan 1; tr8nsmenti8ne4;TM»Y;SS<iM: 015; 22.66 
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437175; AW968078; Hs.87773; protein kinase. cAMP-depentfenl cddyO; |Ainase,pkinasa_C»Ror»: 3.14; 11.72 

409270: BE090051: Hs.23120; PIST; fh3,pWnaso,P02.DUF139:TM=Y:S&44; 109; 7.81 

419591: AF090900; Hs.91393; Homo saptens cONA: RJ21887 fis, done H; P0Z,127;TW=M;; 3.06; 5.48 

447225; R62676: Hs. 17820; Rho^sodaled, oolted-coB containing p; PH,pWnase.HR1.nonQ; 3.04; 1105 

412692; AF0442B8; Hs.74515; wyl hydrocarbon receptor nuclear ttansi; HLH.PAS.PAC;TM=M:: 2.95; 12.28 

409274; NM_003930: H5.52644; SKAP55 homoiogue; SH3.PH;SS>M; 190; 14.62 

417707; AL0357B8; Hs.82425: aclln related protein 2/3 complex, sutwn; none:TM=Ms 2.90; 11.00 

427045; K86504; Hs.173328; protein phosphatase 2. regulatory sidxmi; BS6:TM:=M;; 2.69; 6.12 

431177; NNL003304; H5.250687; transient receptor potential channel 1; lon_trans.ank;TM=Y;: 2.89; 6.53 

443426; Ai=098158; Hs.9329; chnjmosome 20 open reading frame 1; none;TM=M;; Z87; 9.18 

418546; AA224827; ; Qt>n)c32g04.s1 Na_CGAP^Pr2 Homo sapiens ; vwa.lntegi!n^FG-GAP,none; 2.86; 9.94 

446658; W5B353; HS.2B5123; Homo sapiens mRNA full lenglh insert cDN; NDK.PH,Oxyslerol^BP:SS=M; 2,85; 14.25 

454080; AI19971 1; Hs.576; fucosidase, alpha-L- 1, fissue; A!ph^Lfucos;TM=M;SS=M: Z81; 28.84 

432874; W94322; Hs.279651; melanoma InWbllory activity; SH3;TM=M;SS=Y; Z80; 10.53 

433000; U26710; Hs.3144; Cas-Br-M (murine) ectroplc retroviral tn zM:3HC4.UBA,CbLN.Cbl_N2.Ct)i^N3;; 2.77; 10.93 

444488; AW1 92879; Hs.35566a. andenl conserved domain protein 4; none,none; 2.77; 1Z58 

417904; AI750762: Hs.82911; protein tyrosine phrephalase type IVA, m; Y_phosphatase,DSPc;TM=M;; Z76; 12.78 

425204; NiyL002436; H3.1861; memtyrane prot^ paindtoylated 1 (55kO); SH3.PDZ.Gusnyt8teJdn;SS=M; ^74; 5.71 

419282; AAB34664; Hs.29131; nuclear receptor coadivator 2; PAS.2f-C2H2.SET; Z73; 1i50 

410793; AW581906; Hs.65392; Interseclb) 1 (SH3 domain protein); SH3,e(hand.C2^H,RhoGEF,M;SS=IW; Z73; 9.84 

446081; AA972412; Hs,13755: f-box and WD40 dom^n protein 2; WD40,F-box,Rit>osomal„L14;TM=M-.; 2.71; 1Z29 

414443; AU077268; Hs.76144; plaleleWeiived growth fatorieceptor,; l9,pklnase;TM=Y;; 2.71; 10.53 

452683; AI089575; Hs.374574; progesterone marHxm binding protebi; honieobox.none; Z69; 1Z53 

423533; NIWI^014339; Hs.129751; InterteuWn 17 receptor; none;TM=Y;SS=M; Z67; 8.59 

422627; BE336857; Hs.118787; transforming growth factor, beta-Induced; FasciclinABC.tranABCjnembiane,GTP.EI=TU;TM=M;SS=M; Z67; 12,22 

453915; AA588721; Hs.12284; ribosomal protein 144; none.T-box; 2,65; 6.38 

416810: AF035606; Hs.80019; programmed ceQ death 6; elhand;TM^M;; 2.61; 13.89 

439858; AA332057; Ks.6639: hypothetical protein M6C15440; none;TM»M;SS"M; 2.58; 10.19 

449924; W30681; Hs.146233; Homo sapiens cONA: FU22130 fis. clone H; SH3.none: Z56; 19.04 

412926; AI87g076; Hs.75061: macrophage myrisloytated atanine-rich C ; MARCKS;SS=M; 2.55; 14.99 

439237; AW408158; Hs.318893; ESTs, Weakly similar to A47582 &«ell gn Furin-IIke,plclnase,RecepX*»nain,YLP,none; 152; 14.71 

409098; AA132672; Hs7984; pleckstrin homology. Sec7 and coliedito^. PH,Sec7;TM=M;; 2.51; 14.51 

413040; AA19333B; Hs.12321; sodium caldum exchanger Na_Ca.Ex:TM=Y;SS=M; 149; 9.28 

422070; AF149785; Hs.111126; pituitary lumor-transfomiing 1 inleracli;TCTP,TM=M;SSaY; 2.45; 12.49 

452289; BE568205; Hs.28827; mitogen-acthfaled protein kinase kinase ; pkinase;TM=M;; 144; 6.68 

427657; AV652249; Hs.180107; polymerase (DMA (firected). beta; none;TM=M;; 143; 7.97 

446287; BE247683: H3.1461 1; dud specificity phosphatase 11 (RNA/RNP; OSPt;SS=M; 141; 9.51 

410017; AW952426; Hs.l09438; Homo sapiens done 24775 mRNA sequence; none.none; 141; 14.01 

424756; AW504657; Ks.152931; lamin B receptor; ERG4_ERG24.FKBP:TM=Y;; 140; 5.98 

4475B0; AI953360; Hs.88201; ESTs; none.none; 136; 11.63 

426276; AW881411; H5.169078; hypothetteal protein FU23018; hormone_rec,rf-C4:TM=M;; 134; 13.34 
424441; X14850; Hs.147097; H2A histone fantlly, member X; histone,CBFD_NFYB_HMF;; 133; 1117 
429623; NM_005308: Hs.21 1 569; G protelrHXJupled receptor kinase 5; pkln3se,RGS;TM=IVl;: 2.32; 15.80 
439866; AA280717; Hs.6727; Ras-GTPase activating protein SH3 domain: rrm.ffrF2;TM=l^; 2.32; 12.48 
453648; W21493; Hs.28329; hypothelica! protein FU14005; none,nona; 131; 13.19 
443951; F13272: Hs.355835; ferritin. Fight polypepBde; PMP22„Ctaudln.none; 131; 8.51 
453327; AW500180; Hs.356109; IryptophanyMRNA synthetase; rmi.vwa.FG.GAP. 130; ia02 
439255; AA322302; Hs.lB3302; PCTAIRE protein kinase 2; none.none; 126; 10.36 

424467; AI929392; Hs.350026; Dna) {Hsp40) homolog. subfamily B, membe; DnaJ.pHnase.UBApklnase.C;SS=M; 126; 1 1.82 

426440: BE38Z7S6; Hs.169902; solute canter femHy 2 (fadiitated glu; 8ugar^tr.TM=Y;SS=M; 126; 1154 

456607; AI66019Q; H3.106070; cydiiKJependent kinase Inhibitor 1C (p5; COI;TM=M;; 125; 13.11 

423960; AA164516; Hs.136309; SHI^tainlng pmteln SH3GIB1; 8H3.non6; 120; 20.05 

424058; AL121516: Hs.138617; thyroid hormone receptor Interactor 12; HECT,WWE:TM=M;; 120; 13.38 

446644; Nft4-003272; Hs.15791; transmembrane 7 supeilamily member 1 (up: none:TVlsY;SS=M; 118; 15.68 

411218; H46440: Hs.180628; dynamin l-Uke; dynaniltL24ynaniin.GEDji0Re: 118; 13.83 

414721; X90392; H8.77091; ribosomal protein L10; ExojBndoj)hos.Ribo8om^U10eAcyflr8nsfer8se^.TMaM:8S4^ 114; 11.24 

421759; AA027968; Hs.107979: small membrane protein 1; none;TM3Y;SS=^t 114; 14.03 

416240: NM„0019B1; Hs.79095; epidermal growth factor receptor pathway; elhand,DUF164;TM=M;: 113: 1186 

435521:VV23814;Hs.6361;mItogen-acthratedprotelnWnaseklnase;none,non8;112;11.08 . 
409340: BE174629; Hs.321130: hypothelfcd protein MGC2771; 8aj)enneases.pyiWoxal.deC.brDmodomain,PHD.MBD^TJooKDDT.P^^^ 
112:14.05 

453064; R40334: Hs.89463: potassium large conductance cak:lum-acQ; none.none; 112; 8.98 

409223; AA31 2572; Hs.362852; phospholnosllide-3-kin3se, regulatory su; SHlSH3.RhoGAPjU)ne{ 108; 1 1.60 

414482; S57498; Hs.76252; endothein receptor type A; 7tmJ;TM=Y;SS=M; 106; 14.23 

414496; W73853; Hs.355424; ESTs; pWnasef 5J8JlypeLC3dhjhort,nond: 105; 13.45 

450455; AL1 17424; Hs.25035; chtoride intracelhjlar channel 4; non8.TNF: 105; 19JM 

449906; NM_005638; H5.24167: synaptobrevln-like 1; 5ynaptobrevln.NTF2;TM=Y,-; 104; 13,34 

422112; BE540240; Hs.111783; 1^1 prot^; Sm.BAG;SS=M: 103; 1160 

434935; BE551824; Hs.273369; uncharacterized hematopoietic stem/proga; none;TM=M:; 102; 10.52 

433427; AI818449; Hs.171889; choSnephosphotransferase 1; SH2,C0P-OH_P_tr8nsfJM=M;: 102; 16.87 

410850: AW362867; Hs.302738; Homo saptens cONA: FU21425SS, done C; Su!f8tB_lransp,STAS>IM6_box; 102; 9.37 

440481; AA182907; Hs.7200; Homo sapiens, done MGC:16714. mRNA. com; pklnase.RCC1 ;TM=M;; 102; 1131 

434545; AF255303: Hs,112227; membrane-assodated nudeic add binding; rWXXHgpdhAteno.ElB_55K.2K3HC4:TM4fc 100; ai5 

4101 13; AW995S64; Hs.250824; Homo sajxens cONA: FU23435 lis. done H; pklnase.none; 1.99; ia64 

414636; AL120259: Hs.76691; stennin: none;TM=M;SS«Y; 1.95; 7.72 

408176; AK001553; Hs.43436; adenytete kinase 38)pha Eke; adeny1atekinase.none; 1.95; 14.95 

422690; AU077275; Hs.119222; suppression of tomorigenidty 13 (colon ; TPR;TM=I*; 1.94; 10.91 

427881; BE538296; Hs.323834; cytochrome c oxWase subunil Va; ncne.6KAP; 1.93: 20,57 

433387; L76528; Hs.3260: preseniHn 1 (Alzheimer disease 3); Preseninn,7tm_3^xidored_q5„N;TM=Y;; 1.92; 1158 

453938; AF082569; Hs.36794; IMype cydWnteracfing protein 1; B56:TM=M;; 1.90: 1174 

433592; NM_004542; Hs.3436; deleted In oral cancer (moose, homotog) ; none;TM=M;; 1 .89; 23.27 

447791; BE2418S9: Hs.10S75: C6I-11 protein; V-ATP8seJiAm)adIilo^;TM44;; U8: 1182 

426359: AAa76409: Hs.1086^ Homo saptefo cONA: FU23313 fis, done H; adenytetekIna5e,nQne; 1.68; 14.95 
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432650; D0086Q; H&56; phosphoritxsyl pyrophosphatD synthetase ; none^none; 1.68; 1Z70 

4242501 AR)73310:Hs.143648:insUIinrewploro4strato^PHJRS;T^ ^-86; 19.50 

424482; 8E^21: Hs.149155; voltag&dependeiri ankn channel 1; EuIu)orin:SS=M: 1.85; 11.29 

425335; BE334327; Hs.296267; toflistBMcB 1; efhand,kaza!.aif/as.7tm_1:TM=M;; 1.85; 13.62 

42612% NM.006925; Hs.166975; spiidng factor, arginlnefeerine^ 5; rnn;SS=M: 1.83; 10.88 

451579: AW607731; Hs.26670: Human PACdone RPM15N1 from 22q11 J^j; Mng!B;TM=Y;SS»M: 1.83; 20^ 

428901: AI929568: Hs.146668: KIAA1253 protein; 7lnLZUPRW73.TMS.TDE:TM=Y:SS»M; 1.83; 19.00 

453963: AAO40311; H5,2B959; ESTs; pklnase,AdivliUBC|>,non8; 1.82; 15.25 

417414; AA434589; Hs.367675; dUTP pyrophosphatase; dUTPase.KRAB- 1.81; 14.20 

414521; D28124; Hs.76307; neuroblastoma, suppression of tumorigeni; DAN;TM^M:SS«M; 1.81; 2Z29 

425356; BE244879; Hs.155939; Inositol polyphosphate-SfihosphalBse. 14; Exo_eRdoj]ho5.SH2:TM»M;; 1.80; 1BJ0 

41 7733; AU)48678; Hs.82S03: Rsapiens mRNA for 7UTR of unknown piot: none;NA;NA; 1.80; 6.28 

424805; AF230904; Hs.153260; c^nleracting protein; SH3;TM=M;; 1.80; 11.99 

420747; BE294407; Hs.99910; phosphofnictoldnase, plalelet; PFK:TM=I^, 1.79; 25.25 

416819; U77735; Hs.80205; plm-2 oncogene; pklnase:SS=M; 1.78; 15l25 

437708; AB033020; Hs.5801; KIAA1194 protein; LRR,Exo_endoj)hos;TM4te 1.77; 11.11 

439877; H39685; ^.258730; tiyptase beta 1; pkln3se;SS=M: 1.77; 21.91 

440256; U23841; Hs.18851; hypothetical protein FU10875; nore.UBA.UBX: 1.76; 1^95 

425367; BE271188; Hs.155975; protein tyrosine phosphatase, receptor t; none;TM=M;SS=Y; 1.76; 21.01 

414703; BE243877; Hs.374355; ATPase, Na? transporfing. beta 3 poJypep; Na_K-ATPase;TM=Y;SS=M; 1.75; 20.03 

443593; AI344782; Hs,349261; Dna) (Hsp40) homolog, subfamily C. membe; rnn.DnaJ.TPR;TM=M;; 1.75; 13.29 

437412; BE069288; Hs.34744; Homo sapiens mRNA; cDNA DKF2p547C136 (fr; ABC«tr8n.GTP^EFTUABC.membrane,none; 1.75; 8.75 

413798; AW40B094; Hs.75545; InleileuWn 4 recepton fn3,granurin;TM=M;SSs^; 1.74; 14.73 

438438; AA257992; Hs.50651; Janus kinase 1 (a protein tyrosine lonas; pidnase,SH2.adenylatektn85e,none; 1.7% 24.10 

429655; U48959; Hs.211582; myosin, light polypeptide kinase; pkIn8se.fn3.t9,none; 1.73; 31.59 

421456; AW579842; Hs.l04557; hypotheiteal protein FU10697; zl-C2H2,0UFlB,eIhand,C2PI-PLiC-Y,PW»L(>X;TM4^: 1.73; 16.87 
444252; R21135; Hs.54985; ESTs; none,none; 1.71; 10.40 

442819; BE622721: Hs.284275; Homo sapiens PAK2 mRNA. complele cds; none,pWnase,PBD: 1.69; 14.02 
447918; AI129320; Hs,1 15175; ESTs, H^tly similar to JC5818 gamma^t; pklnaso,SAM.none; 1.69; 17.14 
429279; AB018271; Hs.198689; KiAA0728 protein; Myosin_t^.efh3nd,spectrin.GAS2.Myosin.t8il; 1.68; 14.21 
450440; AB024334; Hs.25001; tyrosine 3Hnonooxygenase/lryptophan 5^no; 14-3-3!TM=M;; 1.67; 24.67 
413423; AU076684; Hs.75350; vincufin; Vinculin.none; 1.65; 29.28 

420972; AW814816: Hs.31431; hypothetical protein FU12171; Frucl068mInJdn:SSsM; 1.65; 10.75 

416884; M60484; Hs.80350; protein phosphatase 2 (fbnneily 2A). cat; MetaOophos;SS=M; 1.63; 24.55 

436719; Y11192; HS.S299; aldehyde dehydrogenase 5 family, member ; lipocann.8tdedh.ubiquiUn,IRK;5SsM; 1.61; 11.20 

419223; X60111; Hs.l244; C09 antigen (p24); transmembrane4;TM=Y;SS=M; 1.61; 14.93 

414176; BE140638; Hs.75794; EDG.2 (endothelial differentiation, lys; 7tmJ.CRC8;TM=Y;; 1.61; a03 

431476; BE612705; Hs.256697: histidine triad nudeo6da4)inding prote; HrT;SS»M; 1.60; 24.37 

412347; AW970026; Hs.73818; ubk^uinol-cytochrDmeG reductase hinge p; UCR^hinge.G-aIpha.arf;TMsM;; 1.59; 18.09 

423804; AW403448; Hs.l706; InterfcroiMtlmulaled transcription fact; IRF.rf-C3HC4.IBR.zf-RanBP;TH4=M;; 1.59; 10.99 

426552; BE297660; Hs.170328; moesin; BandJl.ERM,pklnase.LJRR,LRRCT,MAM,Nucl8oplasmln,Tropomyo5in,OPR,iaament.b^^ 1.58; 25.97 

428216; M184B8; Hs.l83037; protein kinase, cAhAP-dependent, regulato; cNMP.bJndlng.RIIa;SS=M; 1.66; 10.58 

421251; Z28913; Hs.102946; enigma (UMdomafr^ protein); UM.PDZ;SS=M; 1.56; 13.51 

448581; NM_00270g; Hs.21537; protein phosphatase 1, catalytic subunll; none,none; 1.55; 1^33 

417098; AB017365; Hs.l738S9; frizzled (Drosophlla) homolog 7; Fri2zled.Fz.7tm_2.toxtn»2;TM=Y;SS=M; 1.55; 13.77 

437076; AA961260; Hs.5443; BCL2.«ssociated athanogene 5; BAG,Hanta_nucteocap;TM=M;; 1.54; 10.93 

426653; AA530892; Hs.171695; dual specifk:ity phosphatase 1; Rhodanese.DSPcYj)hosphalB5e;TM=M;; 1.54; 11.88 

421143; AB024538; Hs.l0217l: Immunoglobulin superfamny containing le; ig.LRR.LRRMT.LRRCT;TM=M;SS=M: 1.53; 2m 

414457; AW514320; Hs.76159; ATPase, H transporting, lysosomal (vacuo; pklnase.ATP^ynLCjwne; 1.53; 32.59 

414382; AW380339; Hs.8068; hematopoiet» PBX-lnteracting protdn; M;TKH4r. 1.52; a66 

450998; BE387614; Hs.25797; splicing factor 3b, subunlt4. 49kD: rrm;TM4ft; 1.52; 11.74 

402705: ; : activator of S phase kinase; AhpC-TSA;TM=M;SS=M; 1.51; 26.85 

426268; AF083420: Hs.168913; serineAhroonlne kinase 24 (Ste20. yeast; pklnase;SSoM; 1.50; 24.04 

414604; AU076649; Hs.76556; growth arrest and DNAdamage^uclble 3; none;TM=M;; 1.50; 14.35 

445584; AF217518; Hs.8360: PT001 2 protein; none;SS=M; 1.49; 12.00 

407232; X04526; ; gbiHuman liver mRNA for beta^ubunit sig; WD40;TM=M;: 1.49; 19.32 

424206; NM.003734: Hs.l9B241; amine oxidase, copper containing 3 (vase; Cu„amlne„oxld,(Xamlne.oxWN2,Cu_8mlne.oxldN3;TM=M:SS=M; 1.48; 13.21 
458761; AF090922; Hs.l52738; mitochondrial nbosomal protein 111; ERJumenjeceplRajosomaLL11,rebosomaLL11.N;TM=Y;SS=Mt 1.48; 12.50 
426340; Z97989; Hs.169370; FYN oncogene related to SRC FGR. YES; BNR,SH2.SH3.pkinase:TM=Y;SS=M; 1.48; 17.75 
414166^ AW888941: Hs.75789; Mfliyc downstream regutated; DEAO.helicase.C.mn.rklr,CysJmot.Tll^vwa,vwc.vwd,IQ,Rlla.dbhydrol3se,TGF 

bela.0UF139,TPR.OSPctsp_l.RibO5oma!.S21,wp;TM=M:; 1.48; 2a47 
452516; AA058630; Hs.29759; RNA POLYMERASE I AND TRANSCRIPT RELEASE ; none;SS=M; 1.46; 12.72 
414240; AL046742; Hs.75842; duakpedficity tyroslne^YHihosphoryl; pklnase;SS=M; 1.45; 14.38 
420532; AA248016; Hs.194110; hypothetica) protein PRO2730: pkinase,WD40;SS»M; 1.43; 13.92 
402575; ; ; Rho GTPase acOvatlno proleta 1; PAP2;TMaY;S8=M; 1.43; 13l71 
414765; X07854; Hs.77269; guanine nudeoQde binding protein (6 pr; &dpha,arf:TM^; 1.41; 24.62 
448423; BE390905; Hs.21 198; transkxase of outer n*ochondrialmembr,TPR;TM=M;8S=M; 1.41; 10.70 
422587; AI879352; Hs.l 18625; hexoklnase 1; hexokIna3e.hexokinase2;TM=Mr, 1.41; 19.31 
415995: NH/L004573: Hs.355888; phosphoTipase C, beta 2; C2,Pm.&Y,PI-PLC-X;TM=M,-; 1.40; 1 Ul 
416108; AL036595; Hs.42322; A kinase (PRKA) anchor protein 2; PBralemmln;TM=M;; 1.40; 13J8 
427721; AI582843; Hs.1 80455; RAD23(S. cerevblae) homolog A; ubtqul8n.UBA,inta8rin^;SS=Mj 1.39; 15j01 
417891; W79410; Hs.82887; proteb phosphatase 1, regulatory (inhlb; none;TMaMs 1.39; 15.97 
427373; AB007972: Hs.l 30760; phosphaiase. target subunit 2; 8nk:TM=Ms 1.39; 14.49 
446334; U52427; Hs.14839; polymerase (RNA) U (DNA directed) polyp; 

C0X8.SHMT.M1F.6ST C.EFIG domain,GST.N,Sl,Fz,Fri22led.C8lrefta!lin.7lm^2,rm>.PAP_BSsoc;TM=Y 
447042; AB035863; Hs.1822l7: succbiate-CoA Qgase, ADP4omung. beta ; ligase<>j\ATP-grasp,Zlp,(ySaseJjD2.GARSJ;TM=Y:SS«M; 1.37; 11.37 
427705; AI870421; Hs.l80394; signal recogniUon parlicte 14kD (homoto; SRP14,TNFR^c6;SS=M; 1.37; 2Z05 
425969; AW576265; Hs.301763; K1AA0554 protein; SH3,FCH,HR1;TM=M= 1.37; 13.68 
433572; AL046859; Hs.3407; protein kinase (cAMP^Jependent. catalytl; PKI;SS=M: 1.35; ia43 

410597; W16518; Hs.279518; amytoW beta (A4) pracuisor-like protein; Kunilz.BPTI.A4^EXTRA,Coprogen_oxldas;TW3Y;SS=M; 1.35; 22.54 
416424; Y13622; Hs.6SQ87; latent transforming growth factor beta b; EGF,TB.spidertoxln,granuGn.ANFjeceptor;SSsM; 1.34; 1109 
442603; AU)35719: Hs.303091; pleckstitn homology. Sec7 and coBedAni; PH,Sec7;TM4^ 1.34; 11.40 
418043; AVV3777S2:Hs.83341;AXLreceptorlyroslne kinase; tn3Jg.pk!nase:^ 1.31: ia79 
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439Z78; AFO77046: Hs.6518: ganglloslda expression (actor % MAP1J.C3.anribnlr8rL3:TM=M.*; 131; 15.89 
425875; AU077333: Hs,160483; eiylhrocyte menArane protein band 7.2 (s; PBP,Band_7:TM=M;: 1 .31; 17.93 
407744: AB020629: H5.38095: ATP^iindlng cassette, sub-fefrfly A (ABC1: ABC_lran,PRK;TM=Y:SS=M; 1.29; 10.95 
420679; X57152; Hs.99853; Obiinarin; CM-bela,Rbfinarin,WD40:TM=M;; 1.29; 18.69 
427397; AI929685; Hs.177656: calfnodulin 1 (phosphorylase (dnase. dett; efh8nd.RmaAD;SS=M; 1.29; 15.68 
424661: M29551: Hs.151531; protein phosphatase 3 (fonnedy 2B). cat; Metaflopha8;ThNM;: 1.28; 13.39 
428950; BE311879; Hs.194673; phosphoprotein enriched bi astrocytes 15; DE0;TM4fc 1.27: 11.15 
440820; AU)31845; Hs.356416; plakophin4; ruxra.none; 1.26: 10.65 
448153; Y10805; Hs.20521; HMT1 (hnRNP melhyllransferase, S. cerevi; NusG;SS=M: 1.25; 12.07 
447386; NiyL006289; Hs.375001: KIAA1Q27 protein: Band.41,LLWEQ^poprolein.(RS;SSsM: 1.22; 10.65 
433053; BE301909: Hs.279952; glutathione S^ransferasesutiunit 13 horn; HCC^Jsomer8se:TMsMn 15.78 
440708; AF038962; Hs.7381; vollageKJependent anion channel 3: Eidu»dn.&iterolo]diLA.PH04,none; 1.20; 14.06 
417069: AA442192; H5.374980; cytochnxne c oxidase sutwnll Vlll; a)X8,SHiy^,MIF,GST.C,EF1GLjdomaIn,GSTJ4,Sl,F2,Filzzled^ 
1.18; 16.91 

402559: ; : Rho GTPase acOvattng protein 1; PAP2:TM=Y;SS=M: 1.16; 15.49 ' 

426636; 6E242634: Hs.2055: ut}iquitin-adivaQng enzyme El (A1S9T an; 11iiF,U6ACT;Thft>M;; 1.14; 10.99 

428773; BE256238: Hs.193163: bridging (ntegralof 1; BAR,SH3;SS=M; 1.14; 11.38 

406906; Z25424; ; gb:H.S8plens protein-seHnefthreonine ki; none,ru)ne; 1.1% 12.97 

44393?. AW888222; Hs.9973; Isnsln; SH2,WW.RD.non8; 1.07; 15.41 

421996; AW583807; Hs.1460; glucagon; hormone2;SS=M; 59.35; 1.61 

414998; NM»002543; Hs.77729; oxidised low density Upoprotein (lectin; leclin_c;T]^Y;SS=M; 22.96; 4.57 

442573; H93366: Hs.7567; branct^ed chain aminotransferase 1, cytos; aminotran J.none; 21.41; 1.15 

451035; AU076785; Hs.430; plastin 1 (I Isoform); em8nd.CH^iap(in.N;SS=M; 19.25; 3.53 

408243: Y00787; Hs.624; InlerteuWn 8: HLH.PAS.IL8;TM=M;: 15.53; 4.34 

421340; F07783; Hs.1369; decay acceierafing factor for complement; sushl;SS=M; 14.84; 19.59 

422250; AA315993; Hs.105484; regenerating gene type IV; lectln_c;SS=M; 14.71; 2^ 

430280; AA361258; Hs.237868; interteukin 7 receptor. fn3.none; 14.28; 11.47 

412116; AW402166; Hs.784; Epstein-Barr virus Induced gene 2 (lymph; 7tm.1;TM=Y:SS=l\4; 12.71; 12.56 

451820; AW058357; Hs.l99248: ESTs; 7tm.1;TM=Y;SS=M; 10.18; 2.67 

418693; AI750878: Hs.87409; Ihromixjspondin 1; EGF.tsp_1,vwcTSPN,tsp_3;SS=M: 9.72; 6.94 

448105; AW591433; Hs.298241; Transmembrane protease, sertne 3; ldl_recepLa.tnfpsln;TM=Y:SS=M: 9.67; 4.06 

456266; 1^9073; Hs.198726; cold shock domrfn protein A; 7lnL2,HRM.CSD;TM=Y:SS=l\* 9.62; 2.36 

413095: AA494359; H5.30715: potassium voltaga^ated channel. IsM: none.START; &15; 2.18 

417933; XO2308; Hs.82962: Ihymidylate synthetase; lhyroidybL8ynlMR.Ml.E,MR.MLE^N;SS=M: 8.97: 5.01 

433334; AI927208: Hs.231958; matrix metailoprotdnase 28; Peptidase^M10.none; 8.71; 4.28 

418030; BE207573; H3.83321; neuromedin B; Bombesln;TM=M;SS=Y; a38; 1.55 

433437; U20536; Hs.3280: caspase 6. apoptosis-related cysteine pr ICEj>10.ICE_p20;SS»M; a31; 4i3 

449523; NIUL00057g: Hs.54443; chemoldne (OC motiO feceptor 5; 7(nLl;TM=Y:SS4^ &26: 5.49 

428513; 8E220B06; H5.184697; Homo sapiens dorw 23785 mRNA sequence; PSI.none; 8.13; 13.28 

449444; AW8ie438; Hs.351306; sdute carrier fantily 16 (monocarboxyiic; none;TM=Y;SS=M; 7.89; 7.00 

453459; BE047032; Hs.257789; ESTs; none^none; 7.40; 0.60 

436729; BE621807; Hs.351316; transmembrane 4 superfamity member 1; none;TM^Y;SS=l^ 7.29; 5.78 
426761; AI015709: H5.172089; PORIMIN Pnyoncosls receptor Inducing me; none;TM=Y;SS=M; 7.25; 7.22 

426158; NM 001982; Hs.199067; v-erb^2 avian erythroblastic Membv; Fur1n-liKe,pIdnase,Recep_l^domain,Furin-6ke.pkinase,RecepX<^n.Peptidase_M24; 7.13; 3.97 

419968; X04430; Hs.93913; interteuldn 6 (interferon, beta 2); IL6;SS3Y; 6.93; 3.43 

4571 33; M54968; Hs.351 221; v-W-rBs2 Wrsten rat sarcoma 2 viral on; ras,ldfi;SS=M; 6.90; Z85 

420344; BE463721: H8.97101; putative G proteiacoupled receptor; Methyltran5f_5:TM=Y;SS=M; 6.88; 3.10 

417874; BE616160; Hs.82829; protein tyrosine phosphatase, non^ecept; Y_phosphalase;TM=Y;; 6.42; 2,26 

427969; NM_001963; Hs.2230; epidermal growth factor (liela-urogastron; E6F,ldljeceptb.EB;TM=M;SS=H* 6.37; 1.07 

430396; D49742; Hs.241363; hyaIuronan4)inding protein 2; anlt.dealh.ZU5,EGF,kringle.trypsln.NebuIin,UM;SS=M; 5.77; 1.24 

427557; NM_002659: Hs.179657; plasminogen adhrator, urokinase reoepto; UPAR.lY6.ET,PLA?.lnh;SS=M: 5.71; 3.83 

418283; S79895; H3.83942: cathepsin K (pycnodysostosts); Peptidas8.C1;SS=M; 5.59; 38.68 

45M71; AV548509; Hs.194240; ESTs; none,none; 5.23; 1,05 

433470; AW950564; H5.351 31 6; transmembrane 4 supeifamily member 1; none;Tll4=Y;SS=M; 4.88; 4.60 
433293; AF007635: H8.32417; hypotheHcai protein MGC4309; nona;TM=M;; 4.56; 4.93 
410867; X63556; Hs.750: fibrillin 1 (Marfan syndrome): EGF,TB,wn1.EB.TIL;SS=M; 4.32: 26.87 
417512; X76534: Hs.82226; glycoprotein (transmembrane) nmb; PKD;TM=Y;SS=M; 4.26; 9.04 

414825; X06370; H3.77432; epidennal growth factor receptor (avian ; Furifrnke.pkinase,Recep_U.domain;TM=M:SS=M: 3.94; 1.16 

439180; AI393742; Hs,199067; v-crb*2 avian eryOiroblastte teui«mla v; Furin^.pkinase.Recep_l^domainJ^urin^ke.pWnase.Recep.U-doma5n,PeptkJase.M24; 3.76; 2.21 
419608; AW997938; Hs.90766; ATP^ilnding cassette, sub-fathRy C (CFTR; ABC_tran.ABCjrambfsn8;TM>Y:SS»M; 3.47: 2.24 
419749; X73608; Hs.93029: sparcteslBonecfln. cwcv and kazaMike d; ltazal.thyrogk*uIin_l;SS=M: 3.37; 7.10 
438576; AI458213; Hs.77542: ESTs; 7tm.l.0naJ; 3.15; 3.27 
428093; AW594506; H3.104830; ESTs; none.none; Z81; 3.40 

459683; AI674906: H8.199460: gb:wc73l02jc1 Ka.CGAPjP8nl Homo sapiens: nooe;TM=Y;; Z77: 1.36 
414443: AU077268; Hs.76144; plsteleMerived growQi factor receptor.; igjiMnaso;TM»Y;; Z71; 11X53 
430451; AA836472: Hs.297939: cathepsin 6; Pep8dase.C1,proj6omerase;SS^ Z28; 14.59 

428953; AA306610; Hs.348183: tumor necrosb factor receptor superfami: 603_n*bosomal,Ribosomal_L10,TNFR_c6,OEAO;; 2.21; 6.33 
435496; AW840171; Hs.265398; PAR-6 beta; nonB,none: 2.17; 2.00 

418841; 6E243136; Hs.86947; a disintegHn and metafloproteinase doma; disintegrin.Reprolysin.PepJ412Bj)n}pep,EGF:TM»Y:8S=M; 1.91; 13.06 

414521; D28124; Hs.76307: neuroblastoma, suppresston of himorigenl; DAN;TMsM;88eM: 1.81; 2129 

419452; U3363S; Hs.90572; PTK7 prot^ tyrosine kinase 7; lg.pkinase;BA=Y;SS=M; 1.52; a40 

452795: AW392555: Hs.l8878; hypolhellcal protein FU21620; 20&Fen_0xy;TM=M:: 1.49; 3.29 

432199; AI693815; Hs.127179; cryptic gene; none;TM=M;SS=M; 1.23; \m 

453986; BE148734: Hs.63325: transmembrane protease, serine 4; trypsin.kil_rec8ptiajione; 1.00; 3.92 

445418; AW139377; Hs.127179; cryptic gene; none.none: 1.00: 2.45 

451106; BE382701: H8.25960: N-MYC oncogene; HLH.Myc_NJenn:TM=M;: 1.00; 1.87 

447993; AW139S25: Hs.170362; ESTs; none.nQne; 1.00; 1.30 

TABLE 42B 

Pkey: Unique Eos probeset IdenSSer number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 
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Pkey CAT Number Accession 

405685 0_0 M18728 

414087 1632850J W19712BE247277 

400151 9575.21 BC006850 U07418 KM.000249 U07343 AL574783 BtO90482 BG6844B1 AA385302 86196167 B1091720 BG195132 AI680106 A1457552 AA402478 

BG249688 AA347119 6G755996 BG822S78 
418546 242838.1 TS9708AA224827T59843BE156903 

TABLE 42C 

Pkey: Unique number corresponding to an Eos pR)beset 

Ref: Sequence source. The 7 digit numbers in (his colunm are GenbarikhJen|]lier(GI) numbers. Dunhaml.eiai.'rereistottiepuUicaBQnenlitied'TheDI^ 

sequence of human chromosome 22.' Dunham I et a).. Nature (1999) 402:489495. 
Strand: indicates DMA strand from which exons were predicted. 
NLjxjsition: Indicates nucteoQde positions of predicted exons. 



Plcey 


Ref 


StrarKj 


l^ltjKjsltion 


408399 


9256288 


IMnus 


63448^3554 


405102 


8076881 


Minus 


120922421298 


403344 


8569726 


Plus 


70823-70990 


405555 


1552511 


Plus 


153405-153564.1&4623.154876.155Z72-15540 


405555 


1552511 


Plus 


163497'163623.164715-164g6B.165369-16S50 


405204 


7230116 


Plus 


126569-126754 


406366 


9256126 


Minus 


10639-10800.10890-11023.11113-11293 


400539 


7574902 


Plus 


8559-8721 


403208 


7630829 


Minus 


147706-147903.148667-148804 


405203 


723D116 


PhJs 


125295.125463 


402705 


8782736 


Rus 


89981-90114,90773-90895,91131-91261 


402575 


9884830 


Minus 


109742-109883 


402559 


9864273 


Plus 


33539^715 



TABLE 43A: 43 genes upregulaled in pancreatic cancer relative to norma! body tissues 

Table 43A lists Gibout 43 geies upregutated in pancreatic cancer relative to nonnal body tissues Dial are likely to encode proteins parflcutarly useful for diagnosfic or prognostic 
appfications. These genes were selected from 59680 probesets on the Eos/Affymetrbc Hu03 Genechip array. Gene expression data for each probes^ obtained from this analysis 
was expressed as average intensity (Al). a normalized value reflecting the relative level of mRNA expression. The protein products of these genes often contain one or more domains 
indicathre of have oncogenic function or of transducing intraceilular signals, or of being modulatable by small molecules. pepQdes, or antibodies {e.g. pkinase, deathHtomain, 7tm, 
phosphatasOi or lonJransportBf). Certain piedhded prot^ domsdns are noted. 

Pkey: Unique Eos probesel identilier number 

ExAccn: Exempiar accessbn number. GenBank accesskm number 

UniGenelD: UniGene number 

Pred.PrelDomains: Certain predicted protein domains. Abbreviations used: IH transmembrane domain; SB, signd sequence; ^Y. very Pkely to contdn; ^M, likely to contain; other 

protein domain abbreviaSons are from PFAM (Nucleic Ackis Research, 2002. 30:276-280). 
UnlGeneTWe: UniGene gene title 

R1 90th percentile of pancreatic cancer Alsdhrlded by the 50th percentile of nonnal tissue Als 

R2 90th percentile of pancreatic cancer Als divided by the gOth percentile of normal pancreas Als, where the ISlh percentile of flit normal tissue Als was subtracted 

from botti ttw numerator and denominator 

Pkey; ExAccn; UnfgenelD; Unlgene TWe; PredProtDomalns; R1; R2 

446619; AU076643; Hs.313; secreted phosphoprotein 1 (osteopontin. ; Osteopontin;SS=M; 44.95; 2.17 

421552; AF026692; Hs. 105700; secreted frizzled-reialed protein 4; Fz.NTR;SS"M: 35.40; 2ai3 

411274; NM_002776; Hs.69423; kaffikreln 10; trypsln;TM=Mn 30.10; 13.59 

446921; AB01 21 13; Hs.16530; small htdudbie cytokine subfamily A (Cy; IL8;SS=Y; 29.33; 16.08 

413719; BE439580; Hs.75498; small Inducible cytokine subfamily A (Cy; IL8;SS=M; 24.64; 7.21 

452281; T93500: Hs.28792; Homo sapens cDNA FU11041 fis, ctone PL; TGFbj>ropeptWe,TGF-bet8.none; 23.81; 10.74 

40781 1; AW1 90902; H3.40098; cysteine knot superfamity 1, BMP antagon; TGF-beta.DAN;SS"Y; 2Z33: 10.20 

404882; ; ; C9001188':gil12738842|reflNP_073725.1| p; none;TM=M;; 17.72; 1.40 

413554; AA319146; Hs.75426; secretogranln II (chromograntn C); Granin;TM=M:SS»Y: 17.36; 2.01 

428392; H10233; Hs.2265; secretory granule, neuroendocrine prot^ none;TM=M;SS44 16.82; 1.70 

408243; Y007B7; Hs.624; inlerteukin 8; HLH,PAS.1L8;TM=M;; 15.53; 4.34 

419216; AU076716; K5.164021; sm^ induc&le cytokine subfamily B (Cy; \[B;SS^IA 15.40; 3.70 

426242; H55709; Ks.2250; leukemia inhlbltoiy factor (cho&nergte ; UF_OSM;SS^M; 14.85; 6.58 

421340; F07783; Hs.1369; decay accelerating factor for complement; 8ushl;SS^M; 14.64; 19.59 

409757; NM-001898; Hs.123114; cystaOn SN; cystatin;SS=M; 14.61; 12.75 

425071; NM-013989: Hs.154424; deiodinase. todothyronine. ^e II; T4_detodlnase;TM=M:SS=Y: 14.35; 17.22 
414812; X72755; Hs.77367; monokine induced by gamma Interferon; IL8;TM=M;SS=Y; 13.81; 7.69 
409420; Z15008; H3.54451; lambiln, gamma 2 (ntoein (lOOkD). kalint; laminln.B.lamintn.EGF;SS=M; 13.05; 7.72 
432596: AJ224741; HS.27B461: maWfln 3; EGF.vwa;SS'M; 1180; 9.91 

422109; 873285; Hs.1473: gastrfn-releasing peptide; Bombesin.Deti3nsiiuvopep;TK^S&=t^^ 1Z79: 4.69 

421379; Y15221; Hs.103982; small inducible cytokine subfamily B (Cy: IL8;TM::M:SS»Y: 11.38; Z22 

429547; AW009166; H5.99376: FGENESH predicted novel secreted proteh; none,none; 10.25; 562 

422424; AI186431; Ks.296638; prostate difleienfiatton factor; TGF-beta;SS»M; 9.98; 1.88 

42850S; AL035461: Hs.2281; chramogranin B (secretogranln 1); 6ranln;8&«l; ^40; 3.46 

40995G; AW103364; Hs.727; \tAMi, beta A (acthfin A, ac8v1n AB a: T6F-bel8^T6Fb_propepIide.Tub;SSsM; 9.19; 16.46 

418030; BE207573; Hs.83321; neuromedn B; Bonteln;TM»M;SS»Y: a38; 1.55 

452401; NiyL007115; HS.293S2; tomor necrosb factor, aiph»lnduced pro; )(IWt,CUB;S8»M; 7.46; 4.96 
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421562: Aigi0275: Hs.350470; tretbO factor 1 (breast cancer, estioge; trefoO.Gastrin.S&'M: 7.08; 2U1 

423634; AW959308; Hs.1690; hepain-btndlng growth (actor t&nUt^ pr noneiTM^SS^M; 6.78; 12.19 

428486; AW563497; Hs.1B4604; pancrealte polypeptide; honTionB3:TMNM;SS<=Y: 6.29; 151 

443646; AI085198; Hs.164226; ESTs; EGF.tsp J,vwc,TSPN.tspJ.non8; 6.17; 4.25 

457489; A)633815; H5.127179: oypltc gene; Rone;T)yNM:SS=M; 5.19; 2.79 

450983; AA30S384; HS.2S740; ER01 (S. cerevtsteHite; i»na;SSsM: Si)1: 7.43 

422867; L32137; Hs.1584; cariilage ol^omeric matrix protein (pse; tsp.3.EGF;SSBf« 4.87; 9.40 

426322; J05068; Hs.2012; transcobalamin I (vitamin 612 binding pn CDbaIan4n.bind;SS4A; 4.71; 11.74 

414774; X02419; Ks.77274; ptasminogen activator, urokinase; l(ringle,tjyps{n,pianLthionins;SSBM; 4.24; 6.91 

426758; M433988; H$.98502; CA125 anligen; mucin 16; SEA;TM=Y;; 3.52; 8.43 

422046; NM.012445: Hs.288126; spondin 2, exbaoeOular matrix protein; tsp_1;TM=M;SS^M: 3.45; 7.69 

424687; J05070; Hs.151738; matrix metalloproteinase 9 (geiatlnase B; fri2,tiemopexln.P^0d85eJ410;SS=M; 3.43; 10.37 

417931; W95642; Hs.82961; trefoa factor 3 (intestinal); trefofl:SS=M; 198; 9.65 

445417; AK001058: Hs.12680; Homo ssfdens cONA FU10196 fis. done HE; tsp_1,Repro(ys{n.PepJil12Bj)ropep.noRe; 2.97: 5.74 . 
432874; W94322; HS.2796S1 ; melanoma Intiibltory activity; SH3;Th/NM;S8»Y; 2.60; 10.53 
431462; AW58367% Hs.25631 1; granin-Sle neuroendocrine peptide precu; none.none; 2.70; 1.99 

TABl£43C 

PItey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. Ti» 7 digit number? in ttits column are Genbsnlt identilier (Gl) numbers. "Duniiam I. et d.* refers to tlie pubRcation entitled Ttie DNA 

sequence of iiuman chromosome 21' Ounliam I. et al., Nature (1999) 402:489^95. 
Strand: Indicates ONA strand from wtilcfiexons were predicted. 
Nt4)OSiflon: Indicates nucleotide positions of predicted exons. 

PItey Strand NUxniBon 

404682 9797231 Minus 40977-41150 



TABLE 44A: 754 GENES UP-REGULATED IN RHEUMATOID ARTHRITiS COMPARED TO NORMAL BODY 

Table 44A Trsts about 754 genes up-regu!aled in rfieumatold artiiritis. These were seleclfid from 35403 probesets on the Affymetrix/Eos HuOl GeneChip. 

Pl(ey: Unique Eos probeset identilier number 

ExAocK Exemplar Accession number, Genbank accession number 

UnigenelD: Unig«ie number 

Unigene Title: Unigene gene title 



Pl(ey 


ExAocn 


UnigenelD 


UdgeneTHIe 


100042 


M10098 




AFI=X control - HUMRGEM10098.3 


101577 


M34353 


Hs.1041 


V405 avian UR2 sarcoma vims oncogene h 


103353 


X89399 


Hs.1 19274 


RAS p21 protein activator (GTPasa activa 


104743 


AA021157 


Hs.33619 


Homo sapiens cDNA FLJ20096 lis. done CO 


104996 


AA112307 


Hs.105694 


hypotiieticat protein aJ21919 


105437 


AA2S2191 


Hs.25ig9 


hypotiietical protein 


108258 


AA063269 




gb:zm02a09.s1 Stratagene corneal stroma 


109086 


AA166695 


Hs.27(J737 


tumor necrosis factor (Dgand) superM 


109279 


AA196625 


HS.86C80 


ESTs 


109779 


F10527 


Hs.3353 


bela-1,3^ucuionyitranslerase 1 (glucur 


111794 


R32647 


Hs.23545 


ESTs 


112531 


R69798 


Hs.29036 


ESTs 


112764 


R96306 


Hs.1912g0 


ESTs 


113293 


T67026 


Hs.187403 


ESTs 


115416 


AA283893 


Hs.337079 


ESTs 


116548 


D20433 




8b:HUMGS01407 Human promyetocyte Homo sa 


116565 


D45533 


Hs.129691 


tiypottietlcdi protein 1^)21603 


118104 


N5S332 


H5.39785 


ESTs 


119243 


T12603 




gb:CHR90123 Chromosome 9 exon II Homo sa 


119336 


T55340 


Hs.208238 


ESTs 


120101 


WS5414 


H5.554g7 


EST 


120715 


AA292700 




gb'2sS9a06.8l KCI_CGAP.GC81 Homo sapiens 


120872 


AA357993 


Ks.96g86 


ESTs 


121010 


AA398355 


Hs.97330 


ESTs 


121509 


AA412092 


Hs.97888 


ESTs 


121722 


AA419482 


H5.98874 


dmiiar to proOne-ridi profit 48 


122265 


AA436838 


Hs.98906 


EST 


123208 


AA4896B1 


Hs.102248 


Homo sapiens cONA: FU22105 fis, done H 


123490 


AA599723 




gb:ag1 1o07.8l Gesster Wibns tumor Homo s 


124198 


H53099 


Hs.198271 


NAOH dehydrogenase (ubiquinone) 1 alpha 


124294 


H90573 


Hs.102298 


EST 


125067 


T86429 


Hs.1 11725 


ESTs 


125153 


W38294 






125330 


AA401804 


Hs.1 14574 


ESTs 


125335 


T86620 


Hi.16230 


hypothetical protein FU20619 


125361 


T90348 


Hs.ie3404 


ESTs 


125439 


AA826305 




gb:PM(M.T0017-031299^1-c07 LT0017 Homo 


125535 


R17430 


Hs.22215 


secretogranin in 


125583 


R22272 


Hs.86022 


ESTs 


125590 


R23858 


H8.14337S 


Homo sapiens, dona IMAGE:3840937. mRNA. 


125742 


H81181 


Hs.261023 


hypotheOcal protein FLJ20958 



537 



wo 03/042661 



PCTAJS02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



125795 
125B58 
125665 
126039 
126143 
125177 
126219 
126221 
126262 
126277 
126292 
126293 
126353 
126556 
126559 
126609 
126616 
126628 
126638 
126861 
126990 
127017 
127049 
127209 
127331 
127357 
127368 
127374 
127429 
127490 
127502 
127647 
127650 
127676 
127746 
127812 
127824 
127933 
128006 
128011 
128038 
128058 
128199 
128308 
128369 
128410 
129199 
130998 
134409 
134578 
134644 
100262 
100576 
100704 
100787 
100873 
100943 
100996 
101046 
101371 
101461 
101697 
101909 
102199 
102275 
102295 
102319 
102383 
102470 
102544 
102649 
102798 
102804 
102851 
102852 
102B60 
103262 
103484 
103559 
103658 
103719 
103876 
103897 



T98190 

HI 1549 

H12876 

AA160575 

N29315 

H93164 

N36368 

AI248169 

CT5147 

N39132 

AA491328 

Z18870 

Ai243114 

AA491325 

R15866 

W87435 

AA348581 

A1357886 

AA001527 

AA742428 

AA215510 

AA740146 

AA23S966 

AA305023 

F20186 

AA452788 

AA434382 

AA448728 

M961490 

W52891 

AA614422 

AI087279 

AA873776 

D31237 

A)239495 

AA749094 

A)208365 

AA811102 

AA058693 

AI347067 

AA668782 

AI126617 

AI073548 

A)079496 

A)142639 

AA452788 

H90914 

C00810 

AA281600 

AA194724 

S83308 

D38500 

HG3044-HT3742 

HG3242.HT3419 

HG3872-HT4142 

HG4333-HT4603 

HG6aOKr880 

J03909 

KOI 160 

M13232 

M22430 

M64358 



U21128 
U30998 
U32581 
U34587 
U40622 
U49835 
U57721 
U88133 



U89942 

V00532 

V00571 

X00368 

X78565 

Y08374 

Z19585 

Z74615 

AA054109 

AA226865 

AA248870 



Hs.7756 

HsJ1066 

H5^3078 

Hs.181102 

Ks.266331 

Hs.129750 

HsJ?93483 

H1172965 

K1143764 

H5.15441 

Hs^48121 

Hs.94031 

Hs.112227 

H5.170263 

^fe.186802 

Hs.134605 

Hs.170994 

m.144432 
H5.191650 
Ks.251946 
Ks^tSII 
KS.B1964 



K5.193326 

Hs.312110 

Hs.293751 

Hs.7278 

Hs.1835a2 

Hs.148410 

Hs.261957 

H5.279938 

HS.1201B9 

Hs.291434 

Hs.127811 

Hs.303561 

85.129908 

Hs.124636 

H8.137024 

Hs.132449 

Hs.164597 

Hs.134169 

Hs.146662 

Hs.200332 

Hs.293981 

Hs.164915 

Hs.224137 

Hs,B7224 

Hs^e458 

Hs^7820 

Ks.166110 

KsJ020e3 

K5.17364 

K5.14623 

K&36S89 
Ks.76422 



H179914 

Hs.17752 

Hs.168052 

HS.66S78 

HS.1S0930 

Hs.154138 

Hs.169139 



Hs.83354 
H8.93907 
HSJ5294 

Hs.289114 

Hs.75184 

Hs.75774 

HS.17292B 

Hs.4273 

H$.8203 

Hs^sosa 



proteasome (prosomo, macropan) 26S subu 

ESTs 

hOAT4 

p30DBC protein 
hypoUietjcal pnM MGC4595 
hypoihetic9fpratinnJ10546 
ESTs, WeaMy shnilar b sfarOar to a el 
ESTs 

ESTs, Wealdy sMr b unknown (Rsapi 

Cnn (CrBmped OrosophilaHnte 

gb3a65d09^ Na.CGAP.GCB1 Homo sapiens 

6 proldnxoupled receptor 22 

ESTs 

menitjrane-associated nudelc add binding 
tumor protein p534)ind]ng protein. 1 
ESTs • 
ESTs 

tiypotlwtical protein MGC10946 
Qb:zf58g09.r1 Soares reSna N2i>41^ Homo 
ESTs 
ESTs 

PQly(A)-bindiii9 protebt, cytoplasmic 1-1 
ESTs 

SEC24 (S. cerevisiae) related gene famil 

gb:HSPO05873 HM3 Homo sapiens cDNAdone 

gbOTBOdlCrl Soares_t(rtalJetusjNb2HF8_ 

fibmblast growth foctor receptor-Gke 1 

ESTs, Wealdy similar to 138022 iiypotltet 

ESTs, Moderately similar to TPTE.HUMAN P 

cryptociirome 2 (photolyase-fiite} 

ESTs 

ESTs 

ESTs 

HSPO067 protein 
ESTs 
ESTs 
ESTs 

ESTs. Moderately 8imI]artoALU1.HUMAN A 

KIAA0591 protein 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

gb2v80d10.r1 SoarBs.totalJetusjNb2HF8 
liypoOwtical protein FU20651 
guanine nucleotide binding pnM (G pr 
small nuctear RNA activating comiiex. p 
hypothetical protein * 
SRY {sex detemiining region Y)-box 5 
postmeiotic segregation increased 2A^e 
fibronectin 1 

caldum channel, voltage^ependent. aiph 
immunogtobulin heavy constant mu 
zinc linger protein 79 (pT7) 
gb:PMO^N0019-28030a001^11 SNOOIOHomo 
interferon, gamm&^ndudble protein 30 

coagulation factor VII (semm pmthrombi 
phosphoHpase A2, group IIA (platelels. 
gb:Human rtiom^ gene, exoa 

lundcan 

phosphatidylserine^pedfic phosphdipas 
KIAA0421 protein 

corticotropin releasing hormone receptor 
X-ray repair complementing defe^ rep 
chltinase 3-like 2 

kynureninase (l-kynurentne hydrolase) 
gb:U68133 Human ceQ line Pa06A Homo s 
gb:Human endogenous retrovirus H proteas 
lysyi oxidase^ 2 
interferon, elpha 14 
corticotropin releasing hormone 
gb:Human prolactin gene 5" region, 
hexabrechion (tenasdn C. cytotactin) 
chitinase ^ike 1 (cartilage glycopnole 
thrombospondin 4 
ooilagen. type I, elpha 1 
hypolhetlcal protein FU13159 
endomembrane protein emp70 precursor ISO 
EH-domain containing 4 
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103908 


AA249437 


Hs.317403 


103985 


AA313880 


KS.99872 


104056 


AA397529 


H5.58297 


104209 


AB000221 


Ks.16530 


104386 


H41895 


Hs.144164 


104398 


HS3555 


HsJ6790 


104422 


H86856 


Hs. 132909 


104561 


R60100 


Hs.323817 


104593 


R81267 


Hs.98640 


104543 


AA0O4701 


Hs. 18978 


104673 


AA007633 


Hs^lO 


104681 


AA009832 


Hs.34500 


104711 


AA0172S4 


Hs.32794 


104812 


AA034111 


Hs.124187 


104877 


AA047437 


Hs.22968 


104886 


AA053348 


Hs.339699 


104924 


AA058532 


Hs^8774 


10S071 


AA136532 


Hs.29475 


105105 


AA151872 


HsJ7016 


105203 


AA195660 


Hs.7682 


105317 


AA233926 


Hs.52620 


105617 


AA280687 


Hs.4069 


105707 


AA291012 


Hs37617 


105754 


AA3026S7 


Hs.192028 


105770 


AA347964 


Hs.269873 


105882 


AA400292 


H5.81988 


105883 


M400490 


H9.334907 


105890 


AA400766 


HsJ0S12 


106080 


AA418046 


Hs.35124 


106090 


AMI 8909 


Hs. 169333 


106096 


AA419B09 


H8.170121 


106124 


AA423987 


Hs.7567 


108308 


AM36186 


Hs.30662 


106438 


AA449199 


Hs.21342 


106660 


AA460936 


Hs.27056 


105731 


AA465657 


Hsu^205 


106860 


AM8888g 


Hs.32425 


107055 


AA6001S2 


Hs.29419 


107151 


AA621169 


Hs,8667 


107183 


C20974 


Hs.12114 


107231 


059299 


Hs.34727 


107490 


W74158 


Hs.103189 


107572 


AA001903 


Hs.59962 


107620 


AA005039 


Hs.60171 


107801 


AA019433 


Hs.285803 


107817 


AA020781 


Hs.60847 


107823 


AA021057 


Hs.60836 


107857 


AA024687 


Hs.61208 


107882 


AA025630 


Hs.231987 


108005 


AA037769 


Hs.194293 


108092 


AA045981 


Hs.184029 


108115 


AA047291 


Hs.1 65216 


108214 


AA058661 


H$.60764 


108382 


AA074885 


Hs.67726 


108409 


AA075S78 




108436 


AA078801 




108625 


AA101983 


Hs.283022 


108631 


AA1 02553 


H5.334337 


108763 


AA127539 


H3281397 


108852 


AA133131 




108931 


AA147186 




108976 


AA151480 


Hs.91202 


109026 


AA157811 




109170 


AA1 80352 


Hs.191472 


109303 


AA206126 


Hs.269291 


109326 


AA210719 




109345 


AA213774 


H$^03396 


109404 


AA224594 


Hs.86941 


109473 


AA233151 


Hs.81796 


109725 


F10003 


Hs.79658 


109794 


F106B4 


Hs.23687 


109835 


H00615 


Hs.170044 


109896 


H04794 


Hs.30489 


109918 


H05641 


Hs.216701 


109950 


K08200 


Hs.268770 


110078 


HI 5054 


Hs.318773 


110182 


H20402 


Ks.31746 


110213 


H23216 


Hs.86905 


110310 


K38209 


Hs.32728 


110354 


H41280 


Hs.22588 


110413 


H48124 


Hs779454 


110422 


H48467 


Hs.36094 


110433 


H49425 


HS.301Q82 



hypotheQcdlprateinMGC2744 
tetal Alzheimer anSgen 
aa8prot^ 

small indudbie cytoMne subfamily A (Cy 
ESTs. Moderately sNIar (o ALU8_HUMAN A 
ESTs. WeaMy similar to pulaliva p150 (H 
ESTs 

DKFZP547E1010 protein 

hypothetical protein FU21069 

Homo sapiens cONA: F1J22822 fis, ctone K 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done IMAGE:451 939, mRNA se 

growth differenQsOon factor 11 

ESTs. VfeaMy sinflar to 138022 hypoOwO 

ESTs 

hypotheScsl protein F1J22930 
ESTs 

intQgnn.beta8 

ghicocoriicoid modulatory element bindin 
ESTs, We^ sfanBar to A53833 myosin I 
ESTs 

Homo sapiens ctone IMAGE:297403, mRNA se 
disabled (Drosophila) homolog 2 (mitogen 
Homo sapiens, done MGC:17333. mRNA. com 
Homo sapiens mRNA for KIAA0556 protein. 
ESTs 

hypothetical protein DKFZp761E2110 



Homo sapiens cDNA: FU21962 fis. done H 

ESTs 

ESTs 

KIAA1264 protein 

alpha Integiin binding protein 63 

ESTs 

ESTs 

ESTs 

vanini 

ESTs. ModeiBtely stmSar to 138759 zinc 
ilpopoiysaodiadde spedlic response 
ESTs 
ESTs 

Homo sapiens cONA FU10674 fis, done NT 

ESTs 

ESTs 

ESTs 

A111 fused gene from 5q31 

ESTs. Weakly strnfiar to 154374 gene NF2 

hyp^riheHcal protein DKFZp761A052 

ESTs 

ESTs 

macrophage receptor with collagenous str 
(^2m88h03 Jl Stratagene ovaikn earner 
gb:ziTt94a09.s1 Strdageneoolon HT29 (937 
triggering receptor expressed on myeloid 
ESTs 

hypothetical protein AD034 
gb3m25d03^1 Stratagene pancreas (^3720 
gbzo38d01.s1 Stratagene endotheHal eel 
ESTs 

gbzo35d07.s1 Stratagene colon (937204) 
ESTs. WeaMy simnar to ALU1.HUMAN ALU 
ESTs 

gbx(88e04^1 NCI_CGAP_GC61 Homosapiere 

ESTs 

ESTs 

ESTs 

casein Mnase 1, epsilon 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cONA DKFZp564lOB16 (f 
ESTs. Weakly similar to 2004399A chromos 
KIAA1836 protein 

hypotheOcal protein DKFZp547F072 

ATPase. H^ transporting, lysosond (vacu 

EST 

ESTs 

ESTs 

ESI 

UDP«N-8cetyt^ha-l>gaiactosambi6:pdyp 
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110434 
110653 
110750 
110827 
110829 
110917 
111100 
111112 
111179 
1111B5 
111223 
111275 
111443 
111573 
111590 
111671 
111732 
111809 
111829 
111944 
112015 
112023 
112055 
112334 
112340 
112353 
112457 
112478 
112533 
112588 



H49446 
H58934 
N20522 
N30077 
N30198 
N46363 
N62S22 
Ne3281 
N67239 
N67551 
N68921 
N70970 
R01901 
R10305 
R11157 
R19368 
R25153 
R33616 
R36070 
R40606 
R42836 
R43020 
R43521 
R56239 
R56602 



R65706. 
R66067 



R77302 



Hs«26299 

Hs.124990 

Ks.30961 

Hs.14855 

Hs^62S 

Hs^lTO 

Hs^0450 

Hs.35452 

Hs.10750 

Hs.12844 

Hs.334838 

Hs.35006 

HS.1856B3 

Hs.75425 

Ks.22g084 

H5.163813 

Hs.24688 

Hs.21263 

Hs^3198 

H$.236223 

Hs.26139 

Hs.206469 

H8.8904 

Hs^13 

Hs.28664 



EST8 
EST8 
ESTs 
EST5 
ESTs 
ESTs 

BCM-like membrane protein precursor 
ESTs 

asporin (LRRdass 1) 
EGF-Cka^loman, multiple 6 
KIAA1866 protein 
ESTs 

gb:Homo 88(^ endogenous retrovlnB W 
ESTs 

ubiQititln Bssod^ed protein 

Homo sapiens cDNA FOJ1 1666 lis. done H 

ESTs 

EST 

gtriiomo sapiens fuQ lengih insert cDNA 

suppressor of potassium transport defect 

ESTs 

EST 

ESTs 

ESTs. WeaMy similar to ALU6.HUMAN ALU S 
Igsuperfamily protein 

Homo sapiens mRNA; cDNA DKFZp58SF1323 (f 
gb7i16g12.8l Soares placenta Nb2HP Homo 
ESTs 

gb:yi47f03.5l Soares placenta Nb2HP Homo 
Qb:ifl75h08.s1 Soares placenta Nb2HP Homo 



112595| 


R77783 


nS.ZZ4U4 


proteasOi serine^ 12 (neurobyp^i moto 


112676 


RB6976 


HS^OdO 


CO IS 


112744 


R93206 


|J_ <9Q4TeO 


cd 18, vvQBKiy simHai TO loowu nypoineu 


112777 


R95869 


Hs.35467 


EST 


112817 


R98491 


Hs.14584 


ESTs 


112902 


TC3262 


ris. 129190 


numan unik sequence nom none iv^iwdw i 


113009 


T23699 


Hs.7246 


ESTs 


113151 


T51620 


Hs.9326 


COT 

col 


113297 


T67161 


H8.13059 


ESTs 


113398 


T82280 


Hs.87016 


nypotneUcal protein FU22938 


113484 


T87795 


ns.iOr«><tg 


CO IS 


113769 


U55966 


Hs.22985 


a]pt)a2,8<lalyltransferase 


113794 


W37382 


K3.11090 


roembiBno^panning ^domains, subfamily A 


113971 


W86760 


Hs^59172 


ESTs 


114086 


Z38152 


Ks.26920 


ESTs 


114178 


Z390&3 


Hs.17930 


chromosome 6 ofien reading frame 11 


114206 


Z39294 


Hs.27339 


EST 


114371 


Z41835 


Hs.27810 


ESTs 


114428 


AA017130 


Hs.84790 


K1AA0225 protein 


114466 


AA026970 


Hs.135150 


lung type-( ceB membrane-assodated gly 


114625 


AA084362 




gb:zn05b10j1 Stratagene neuron (937 


114862 


AA235174 


Hs.106432 


Homo sapiens cONA FU13410 Us. done PL 


114908 


AA236545 


Hs.54973 


Gadhefli>4ikB protein VR20 


114973 


AA250845 


Hs.87762 


ESTs 


115009 


AA251561 


Hs.48689 


ESTs 


115055 


AA253005 


Hs.61753 


ESTs 


115098 


AA256161 


H3.161729 


ESTs 


115321 


AA280805 


Hs.191540 


ESTs 


115385 


AA2e2540 


88.109694 


KIAA1451 protein 


11&I66 


AA287008 


Hs.285655 


ESTs 


115479 


AA287S96 


HS.2781B8 


ESTs. Moderatety stmilar to 154374 gene 


115663 


AA405838 


Hs.40507 


ESTs 


115689 


AA410645 


H3.199014 


ESTs. Moderate^ s!n«ar to ALU7J1UMAN A 


115748 


AA418835 


H8.90286 


ESTs 


115810 


AA426026 


Hs.187815 


ESTs 


115827 


AA427890 


Hs.83583 


actlA rotated pnrtein 20comiri6x, sutxm 


115881 


AA435577 


Hs.184942 


GprotelrKOiipled receptor 64 


116146 


AA460708 


Hs.62905 


liypothe8calproldnFU14634 


116257 


AA481493 


Hs.88537 


ESTs 


116365 


AAS21080 


HS.4676S 


ESTs 


116941 


H77395 


Hs.39749 


ESTs 


116982 


H81933 


Hs.312582 


ESTs 


116995 


f^28 




(^7564ti03.8l Soares reDna N2b4HRHomo 


116997 


H84214 


H3.40594 


ESTs 


117016 


H87171 


H&52t70 


ESTs 


117097 


H93608 


H8.41919 


EST 


117101 


H94043 


Hs.24341 


transcriptional co«iIvalor with POZ<U 


117238 


N20815 


Hs,173337 


ESTs 


117303 


N22776 


Hs.264079 


ESTs 


117399 


N26480 


H8.43805 


iQioma HMGICfu^ partner-Qke 3 


117503 


N31963 


Hs.44288 


ESTs 


117544 


N33222 


Hs.44451 


ESTs 


117594 


N34929 


Hs.171984 


ESTs 



540 



wo 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



llftKf 


knell's 


u» iiii7no 
riS.44/09 


coiSiVVBaxiy wnwoF 1) sMUOr nypoinei 




N38970 


lie KMOIjI 


ESTs 


1 1 f 030 






Col 


lUObf 


N4Ug/b 




gD.jyoUOUO.Sl Ow«i ca_JliUUipic^SC<C4 ua*a_ 


1 ir/DO 


Mil7Df\7 


nS.40/0/ 


CCT 
col 


lifOUr 


N48701 




COT 

col 


147Q4fi 






0tryy77805.s1 So3iGS_iTHillip!B_8d6fOs)s. 


117B82 


N50101 




nypotnctjcsi proicin rriKiu 1 


11/90/ 


N51935 


nS,4/iJ/4 


nuino saptafis cuiv\ ruJ uooi us, cions rL 


118074 


N5418B 


HS.IJUJZJ 


rfOinO sapiens, CIOnB IMnVsc.09MJ4;i^ (nKlvV 


118114 


N56875 


nS.143212 


nurt^n C /111. Ajuujitiitliil 

cystaun r (leuKOqfSiSDni 




IJC007C 

Nooz/D 


Ue 0901 1Q 

nS.Z4Mll9 


CCT 

col 


118270 


N62868 


LI— Aoce^ 

Ks.48653 


ESTs 


118291 


N63076 


Hs.1 38746 


EST 


118358 


N64017 


tj_ 4 A ACM • 

H5.1 44633 


nypotnsucai protein UKr zp4j4r£>Zic 


118383 


N64529 


U- JQAA4 


COT 

c5T 


118412 


N64856 


Hs.97437 


centrosomal protein 1 


116433 


N00Z4o 




COT 

col 


118600 


N69222 


Hs.238936 


cSTs, WeaEiy slnvlar to (deiTine not av 


118641 


N70298 


Hs.49829 


ESTs 


118643 


N70324 


tj_ A no An 


ESTs 


118695 


N71781 




iwvvi 199 protein 


118915 


K91481 


Hs.54713 


ESTs 


119041 


R02S91 


Hs.284294 


^ * -■- - ■ - J ,. — I.. ^ ^Jii.f JiIj A*lrt J 

Breakpoini cnBier region prot^ uteri 


119069 


KZ7d19 


nS.231D4D 


EST 


119105 


R42357 


Lla n4 ACQ 

nS^1453 


ESTs 


119154 


Rfi1293 




gD:yn07a0a8l Scares infant bran INIB H 


119241 


T12559 




gb:CHR90079 Chromosoma 9 aocon 11 Homo sa 


119269 


T16367 


i»_ coot 


EST 


119310 


T40427 




gbiya01a08t.s2 Strateoone lung (937210) H 


119345 


T63474 




CCT 

col 


119353 


TCCQC7 


Lie AOtAM 

nS.l 07402 


CCT* 

colS 


11!m9U 




nS^49/ IZ 


CQTe UUaaWiv cimllw In Al 1 11 MIIMAN At 1 1 


119423 


799544 




CCTe Ma>«l>ti» ewmla* Ia At 1 M Ul lUAIJ Ai 1 1 

bo 1 s, weaxiy simuar lo ALUi_riuivi/wi /\lu 


119<IZ0 






CCT 

col 


119529 


W3o05j 






119795 


W73370 


HS.3397Z2 


col 8, nignly amaar to S03917 rit)rDnec 




WU7JI0C7 




CO 18 


119831 


WToOsO 


l_J_ caA4A 


DKfZPS86l2024 pruti^ 


119930 


W86471 


1 CICOjI 

nS. (510/4 


hypocrelin (oraxin) raceptor 2 


120039 


W92548 


nS.349o5 


ESTs 


120256 


Ml 69801 


Hs.9o710 


nypotheljca protein 


120284 






Qb:zp54o11.8l Stratagena NT2 neuronal pr 


120350 


AA21 1300 


Li_ 4noe4A 

ns.106o14 


KiAAOoz/ proiein; [jrosopniia muiupie as 


120379 


AA227849 




gD:DKrZp434B182<„r1 434 (synonymclUBSJ) 


120383 


AA228030 


H$.1 23122 


FSH pnmary response (LRPR1 , rat} nomob 


120420 


AA236031 


nS.112oo5 


spinal conJ^Jerived growth factor-B 


120437 


AA243427 


H3.104311 


novel piotan With MMA domain 


120461 


AA251301 


HsJ£93359 


ESTs 


120594 


AA2B2054 


Hs.5094 


ring fingor protein 10 


120611 


AA284178 


t 1. A A AA4l^ 

Hs.1 10637 


liomeo hem A10 


120626 


AA285064 


1 1_ A A A AAf 

Hs.1 04485 


EST 


1208S8 


AA291503 


HS,97249 


ESTs * 


120747 


AA302976 


n$.9oo72 


ESTs 


120749 


AA303235 




gD:BST14544 Tesus nimor Homo sapiens cu 


120752 


AA311972 


Hs.22895 


nypotneijca] protein RJ 23548 


120B51 


AA349dd2 


11- 4^A4AB 

Hs,1 74248 


ESTs 


120866 


AA3S0710 


1 1« AA 4 A*rA 

Hs.291272 


ESTs 


120949 


AA397830 


Hs.98347 


coTs» weaKiy sirniar to jcojUo tesos^p 


120996 


AA^^I 


HSy308114 


ESTs 


121038 


AA39B536 


H3J7365 


ESTs 




AAJaoooo 


n5.9/oW 


CCTa 
to IS 


121067 


AA398662 


Hs.97302 


ESTs 


121071 


AA39o678 


Ks.139355 


ESTs 


121082 


AA3^22 




gDTt75n07 jl Soape8,je8Ds_NHT homo sap 


121172 


A A JAAA J A 

AA400013 


Hs.97750 


EST, Weaoy smwar lo MPL3 RAT MCROTUB 


121191 


AA400205 


Hs.1 04447 


ESTs 


121354 


M405384 


Hs.193737 


ESTs 


121393 


AA405981 


Ks.262643 


ESTs 


121399 


AA406059 


Hs.332700 


EST 


121479 


AA411911 


IJi> 004 4 ft 

Hs.9ol10 


ESTs 


121498 


AA412033 


H3.178045 


ESTs 


121704 


AA418743 


Ks.98308 


KIAA1B62 protein 


121738 


AA421131 


Ks.148515 


Human dona 23564 mRNA sequence 


122198 


M435692 


Hs.97541 


ESTs 


122220 


AA436011 


Hs.98187 


ESTs 


122250 


AA436692 


Hs.988g2 


EST 


122279 


AA437209 


Ks.234016 


ESTs 


122266 


AA437259 


Hs.104944 


ESTs 


122330 


AA442870 


Ks.98628 


Homo sapiens, done IMAGE:4214491.mRNA, 


122338 


AA443311 


Hs^8998 


ESTs 


122355 


AA443789 


Hs.19978 


OQ^ protein 


122590 


AA453264 


Hs.99310 


ESTs 
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122746 


AA458791 




122805 


AA460702 


Hs.82772 


122841 


AA461536 


H3uI88908 


122899 


AM69960 


Ha.178420 


122933 


AA476728 


Hs.107537 


123005 


AA479726 


Hs.52620 


123142 


AA487504 


Hs.105718 


123153 


AM88349 


Hs.334808 


123168 


AA488881 


Hs.105218 


123188 


AA489092 


H8.1 77726 


123276 


AA491270 


Ks.187946 


123305 


AA496133 




123328 


AA496968 




123450 


AAS98913 


Hs.1 11207 


123464 


AAS99014 


HS.1S3321 


123850 


AA609332 


H5.180696 


123700 


AA609806 


Hs.191956 


12385B 


AA620821 


H5.112911 


123863 


AA620873 


Hs.1 12916 


124046 


F10243 




124059 


F13573 


H3.283713 


124196 


H52617 


Hs.144167 


124197 


H52921 




124229 


H62793 


Hs.268945 


124230 


H63111 


Hs.6655 


124241 


H65947 


Hs.1 65355 


124251 


H68286 


H$.107924 


124400 


N30597 


Hs.179152 


124416 


N34042 


Hs.271674 


124570 


N67117 


Hs.102808 


124575 


N68168 




124588 


NB9197 


Hs.191361 


124598 


N70294 


Hs.269137 


124655 


N93176 


Hs.102914 


124706 


R07499 


Hs.193812 


124848 


R60135 


Hs.203498 


124882 


R74041 


Hs.101539 


124898 


R82846 


Hs.2737e9 


125086 


T91161 


Hs.173880 


125145 


W38001 




125216 


W73409 


H5.103185 


125342 


AI055916 


Hs.133552 


125351 


T96520 


Hs.324746 


125419 


AI076822 


Hs.134544 


125424 


T99667 


HS.18S64 


125526 


R14487 


Hs.17110 


125539 


R17870 


Ks.248120 


125633 


AA908225 


H3.15463 


12S689 


R48940 


Ks.108043 


125707 


C14616 


Hs^122 


12S790 


AA86832S 


Hs.99982 


125878 


AA324967 ' 


Hs,7298 


125969 


R94247 


Hs.88414 


125970 


A1400964 


Hs.177616 


125975 


AA49S691 


Hs.152290 


125985 


H54857 


H8.35981 


126018 


K54866 


Hs.1 67583 


126032 


H59735 


Hs.269065 


126059 


K66582 


Hs.308488 


126107 


H791S5 


Hs.93361 


126154 


AK)04105 


Hs.190488 


126199 


A]000492 


H5.125829 


126207 


W77936 


H8.83583 


126227 


N27236 


Hs^9034 


126269 


AA830432 


Hs.44701 


128373 


F11606 


Hs,6079 


126378 


AA347642 




126383 


AAS85594 


H5.6298 


126403 


K73388 


Hs.125976 


126525 


AAB84833 


Hs.168432 


126527 


AA548559 


Hs.1 03853 


126566 


W6724S 


Hs.103142 


126583 


W92895 


Hs.279746 


126610 


AA460338 


Hs.191391 


126622 


AA699443 


Hs.193213 


126633 


AA206993 


H3J15367 


126727 


AA037230 


HS.13S084 


126762 


AA0&4671 




128775 


S86382 


Hs.957 


126783 


AA126047 




126882 


AA761143 


Hs.250581 


126945 


R51877 


Hs^S 


126988 


AI311457 


Hs.99472 



gtKda88c02.s1 Stmtagene Cati reSna 93 

coHagen. type XI. alpha 1 

WAS protein My, mernber 2 

ESTs. rtghly sImBar to WASP WeracUng 

chromosome 7 open reading frame 2 

Intsgnn, beta 8 

EST 

hypolheikal protein MGC4189 

EST 

ESTs 

ESTs 

gb:zv51e12.s1 Soares_tesOs_KHT Homo sap 
Qbaa42g03.s1 Soares_NtiHMPu.S1 Homosapi 
ESTs 

Homo sapiens cDNA FU10577 fis, done NT 

ESTs 

ESTs 

EST 

ESTs 

gb:HSC3CC122 nonn^ infant brdn cDN 
ESTs. Weakly sfmHar to S64054 hypothed 
ESTs 

gb:yq7Gc09ji1 Soares fetd tiver spleen 
ESTs 

Homo sapiens EST from clone 208499, fiill 
ESTs. Moderately similar to ZN91_HUMAN Z 
ESTs 

toH-fike receptor 7 

ESTs 

ESTs 

gb'j^l IcOl .si Soares fatal fiver spleen 
ESTs. WeaMysImiiar to 138022 hypolheQ 
ESTs. WeeUy similar to AS6194 Uuomboxa 
ESTs 

ESTs. Weakly similar to AUJ8_HUMAN ALU 

EST 

ESTs 

ESTs 

interleukin 1 receptor accessory protein 

ESTs 
ESTs 

alpha-2-H&glycoprotBin 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA 0KFZp434C2016 (f 

G protein-coupled receptor 21 

Homo sapiens, done iMAGE:2959994, mRNA 

Friend leukemia vlnjs integraiton 1 

WM inhibitory fador-l 

proteoglycan 2. bone marrow (naturd k!l 

blphenyl hydmiase-like (serine hydmias 

BTB and CNC homotogy 1. basic leucine zi 

tiigii density Bpoprotein binding protein 

ESTs. Highly similar to JC2463 vasoactiv 

ESTs 

ESTs 

ESTs. H^ly simiiar toKIAA0349 (Ksapl 

ESTs 

ESTs 

Homo sapiens, Simlar to nuclear tocaliz 
EST8 

actin related protein 2/3 complex, aubun 

ESTs 

ESTs 

6 cell RAG associated protein 
gb7y62a11.s1 Soares_mulfip!ej5clerDsi5„ 
KIAA11S1 protein 

ESTs, Weakly similar to S71949 metetlopr 
ESTs 

hypothetical protein FU20043 
ESTs 

var^iloid recaptor4ik8 prolc^ 1 

ESTs 

ESTs 

Homo sapiens, Similar to hypoOtetlcal pr 
cystaOn C (amyk^ angiopalhy and cereb 
gb2m13b04.s1 Stratagene pancreas (93720 
putative opioid receptor, neuromedin K ( 
gb2n09d10.s1 Sb^enehhfT neuron (937 
SWI/SNF relaled, matrix associated. ac8 
ESTs 
ESTs 
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127D70 


AAS41612 




CCTb 


127087 


AA3B0418 


U* DOflil 

HS.0QU12 


onr2 iniBracunQ iransniBmaranB BaaptuT 


127187 


AA297138 


HS.2U7422 


bo T &i woany stniudf 10 o r 1 i}<i9 ineisi^^ 


127215 


A1246377 




csTokMipGifuscinosis, neuronal 8 (epSd 


127229 


AA316181 


Hs.6id35 


^transmennbrsns epithalial antigen of 


127278 


AA342715 




qd:coT4oju9 revBi spieeii Homo sapiens cu 


127299 


AA3oG710 


HS.15B4oO 


CCTb 


127325 


AA333073 


nS.l2DQ99 


CCTe 

cols 


127347 


AA428350 


Hs.5B3o9 


hypolnewai protBtn mcc^uw 


127401 


AA921944 


Hs.1 27639 


ESTs 


127420 


AA599582 


Rs.82171 


Homo sapiens cione 191 B7 pla^taoxpras 


127438 


AI224421 


Hs.77100 


genera transcnpuon lacwr iic, poQ^ 


127441 


AA835684 


Hs.287601 


Honu> sapiens cDNA rUl w30 ns, done Tn 


127449 


A1421666 


H3.75722 


ribophorin H 


127493 


AAB08081 


Hs.291701 


ESTa 


127505 


AA594244 


Hs.292245 


ESTs, WeEHdy sbnuar to ALUIJHUMAN ALU S 


127620 


A10Z5699 


HS.1162W 


ESTs 


127623 


AA773234 


HS.271B77 


anglopolelin-fiks 2 


127633 


AI339609 


H8.268538 


potassium vdlage^ated channel, Isk-rel 


127701 


AA935466 




gb:zt84c06.s1 Soares_ptnealjlano„N3HPG 


127713 


AA668322 


Hs.1 50683 


ESTs 


127722 


AA700444 


Hs.189186 


ESTs, WeaUy sinular to ALUD_HUMAN HI! 


127733 


AA704680 


Hs.1 89005 


ESTs 


127816 


AA743646 


Hs.120604 


ESTs, Weakly siiniar to YAuZLHUMAN HYrUT 


127966 


AI493406 


Hs.292514 


CCTs 

boTS 


127973 


AI336794 


Hs.129117 


ESTs 


127989 


AA909267 


Hs.1 32413 


ESTs 


127997 


AI2B1S49 


HS.311D54 


rttxro safxens mRNA luii lOTgui insen CUN 


128016 


N92597 


Hs.82689 


tunw rejecDon antigen i9P9o) 1 


128037 


AAB68394 


4 n4 4 on 

HS.1B1129 


CCT* tAfanLIu stmilwln C100CP MunlBett 

cbTs, weaKiy sinuiar to siiistn cyniesii 


128053 


T65605 


Hs.o5377 


cSTs, Mooeraiety similar to KIAAI J99 pro 


126066 


A^4838 


IJ_ 4 DA4^4 

Hs.1o9171 


ESTs 


128071 


AA6o93So 


HS. 109241 


CCTe 
COlS 


128091 


AA904559 


Hs.1 29329 


ESTs 


128113 


AI341423 


LJ» 000404 

H5.26o433 


neurotrin^n 


128145 


Ai498467 


fcl^ 4^&&fA 

Hs. 166659 


solute carrier fani3y 4, sodium bicartton 


128167 


AA932961 




uncharacterized hematopoietic stem/progo 


128195 


AI143666 


Hs.1 27778 


ESTs 


128265 


T95851 


Hs.17691 


ESTs 


128283 


Ai07o570 


LI- 40400 


CCTb 

cSIS 


12B309 


A1457235 


Hs.l 00479 


CCTa 


128313 


AI051250 


Hs.1 67775 


CCTo 

bolS 


128346 


A!088907 


1 l_ 4 ^A4 AA 

Hs.160189 


ESTs 


128359 


A1096526 


Hs.270244 


ESTs, Weakly similar to loouZ2 nypoitietl 


128369 


F12681 


Hs.30445 


U^^^ \ .lAAlA 14 A^tt^ ft* -* AIT" 

Homo sapiens cuNA FU 14687 OS, Clone NT 


128371 


H12876 


Hs.2a307B 


hOAT4 


128421 


T77876 


Hs.268589 


ESTs 


128453 


X02761 


Ks.287620 


fibronecOn 1 


128498 


T63496 


Hs.32944 


biositM polypnospnate^pnosphaiase, ty 


128S14 


H84261 


Hs.301693 


Homo sapiens, done IMAGE:3638994, mRNA, 


128551 


K090S8 


HS.27B398 


NAA1 117 protein 


128683 


AA316862 


H8.9605 


cleavage and polyadenytation specific fa 


128731 


AF005271 


Hs.104555 


neuropeptide FF-amide peptide precursor 


128643 


AA234141 


HS.27S675 


Itatanin poO (WlMO^ontaining) subunlt B 


128988 


AM1 1040 


1 1_ AAA4 Jf\ 

Hs.294140 


ESTs 


129016 


W84524 


4aJ4nJ 

H3.1 84194 


fransmembrane 4 superfantily member 5 


129021 


AA426406 


Hs.173081 


1/1 A A Af <9 A ^m^^mX^ 

ICAA0530 protein 


129095 


L12350 


Hs.1 08623 


Uuombospondin 2 


129171 


AA234048 


Hs.7763 


catumenln 


129188 


M30257 


Hs.1 09225 


vascular cefl adhesion moiecule 1 


129410 


U25987 


Hs.272620 


pregnancy spedlic beta-1 -glycoprotein 9 


129467 


AA410311 


H3.44208 


tqrpotnetical protein FU23153 


129518 


AA369807 


Hs.1 12238 


ESTs 


129534 


R73640 


Hs.1 1260 


tiypcnnettcai protein puii2d4 


129632 


L27213 


Hs.1176 


solute carrier farnily 4, anion exchanger 


129691 


X06700 


HS.119S71 


collagen, type 111, alpna 1 (Ehleis-uani 


129881 


AA458952 


11. 4 A4 JAA 

Hs. 181408 


nypolneScal protam FU 22301 


129990 


N30316 




gbiyw75b05.s1 Soares_ptac8nteL6tD9iKeeks. 


130049 


V01515 


Hs.1460 


glucagon 


130171 


AA454177 


Hs.245257 


ESTs, Weakly similar to A4601O X-lnked 


130411 


AA3(J9009 


nS.lDsslU 


t^AAniTQ nana nrrw4ii/«( 

iWvw 1 1 0 gciiQ pioDUCi 


130479 


R44163 


Hs.12457 


tiypothetical protein FU10814 


130511 


U2137 


Hs.1584 


cartilage oUgomeftc matrix protein (pse 


130521 


U92971 


H5.194351 


coagulation factor 11 (thromlnn) recepto 


130645 


AA020942 


Hs.17200 


STAM^ilce protein containing SH3 and ITA 


130655 


N92934 


H3.17409 


cysteine-rich protein 1 (intesttnal) 


1306S6 


Z20481 


Hs.330988 


Homo sapiens, Similar to Bicaudal D (Dro 


130889 


057622 


Hs.20g85 


sin3-a5Sodated polypeptide. 30kD 


131064 


AA598441 


Hs.22583 


OKFZP434K2235 protein 


131070 


F13694 


Hs.22607 


ESTs 


131189 


116782 


Hs.240 


M^ase pbosj^ioprotein 1 


131316 


XS1699 


HS.25S8 


bone gammacsrtxucyglutamate (gla] prote! 


131506 


W4757g 


HS.5B01 


KIAA1194 protein 
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131551 

131SS3 

131830 

131879 

132017 

132D25 

132095 

132159 

132164 

132180 

132223 

132238 

132406 

132945 

133135 

133193 

133370 

133408 

133409 

133591 

133899 

134137 

134339 

134421 

134462 

134515 

134527 

134711 

134824 

134854 

134921 

135003 

135210 

135348 

100547 

100572 

100687 

100695 

101447 

102329 

102892 

103036 

103206 

103260 

1037S1 

104113 

104316 

104453 

104668 

104916 

106151 

108899 

107379 

107412 

107652 

107754 

107B97 

108238 

108497 

108710 

109012 

109043 

109560 

110572 

110687 

111418 

111507 

111644 

111919 

112102 

112229 

112309 

112368 

112397 

112532 

112858 

113170 

113321 

113404 

113420 

113813 

113663 

113790 











1 IT^Cil 




AAUl/ IDl 


He T17Q7 






U9091D 


He VtAR 








nS.41 IVi 








Mc AIR 


R77451 


TfSJtC'iO 


AA453446 




F09979 


t4e AT7A 


1440559 






U« CMC 
nS.DDoO 


CI 4015 




AA156897 


Hs.72157 


U22172 


Uf> 17QC07 

nS.l /sDs/ 


U65916 




T82292 


ris./0lll 


vnncoo 


Ue T7il^0 


F10045 


Ua 7Qtil7 

nS. r904r 


AA478971 




AAiZZjod 








C20737 




T40B35 




XD4011 


lie flCKlTA 


S78723 




J03454 


Ue 170C7^ 


u/cnisc 
WoUloo 








WQM99 


Hs33589 


AA442054 


Hs^177 


HG2149-HT2219 




HG2271-HT2367 


Hs.73995 


HG31154iT3291 




HG315-HT315 


Hs^2620 


M21305 




U3M07 


Hs.158084 


X05232 


Hs.83326 


X54925 


Hs.83169 


X72755 


Hs.77367 


X78416 


Hs.3155 


AA082824 





AA427510 

D61871 

Migi69 

AA007312 

AA056588 

AA424958 

AA490107 

U93868 

W28105 

AAD10195 

AA017462 

AA026240 

AA059473 

AA083070 

AA121960 

AA156576 

AA159605 

F01778 

H60523 

H93005 

R010B4 

R07728 

R16539 

R39926 

R44840 

R50938 

R55021 

R59371 

R60822 

R69824 

T02963 

T54342 

T70580 

T82323 

T83964 

T93337 

795909 

W33178 



Hs.iei202 
Hs.330821 
Hs.123114 

Hs.155489 

Hs.294132 

Hs.21753 

Hs.333861 

Hs.287797 

Hs^2 

Hs^9244 

Hs£6783 



Hs.5947 

HS.72S80 

Hs.131740 

Hs^7844 

Hs.177311 

H5.19081 

Hs^668 

Hs^23649 

H3.21031 

Hs^26475 

Hs^4949 

Hs.26653 

Hs.26805 

Hs.28313 

Hs.4454 

Hs,270373 

Hs.13759 

Hs.70337 

Hs.15400 

Hs.17167 

Hs^912 



Homo sapfens cDNA: FU22115 fe,ctone H 
Novel htonsn geno (na|)pbi9 to dioinosoiTiB 20 
G protelivcoupted leceptor Unaso 2 (Dtd 

ESTs 

vav3oncogone . 
Rdlk bt giobidB^ fBdor 8 protein 
Homo sapiens done 24877 mRNA sequence 
2jc family member 1 (odd-paired OrosopM 
procoflagen^ne, 2'Oxog!ut8rslo 
fibiotilast activation protein, alpha 
chtomosome 11 hypotheGcdp(Dt^ORF3 
ESTs 

Homo s^ns mRNA; cONA DKFZp761C1712 (f 
ATP-binding cassette, suixfamBy B (MDR/ 
hypothetical protein DKFZp56401664 
EST 

OKFZPS&4I1922 protein 

Human DNA damage repair and fecombinaOo 

dieted in azoospermia-lilce 

protease, serine, 11 (IGF binding) 

epid^ma) growth factor receptor (avian 

KIAA0211gene product 

disabled (^ophOa) homolog 2 (mitogen 

oolBgen.^V.a}pha2 

sel-1 (suppressor of Kn-12, Celegans)- 

ESTs 

EST 

cytochrome b-245, beta polypeptide (chro 
S-hydroxytryptandne (serotonin) receptor 
collagen, type I, alpha 2 

Homo sapiens mRNA; cDNA DKFZp434P153D (f 
trWJortiinophalangeal syndrome I 
liypothetical protein DKi^564B1162 
phosphoitpase C, gamma 1 (fbmieity suUy 
gb:Homo sapiens mucin (mucin) mRNA, pat\ 



gb:Human GoQi-mbp gene, exon 2. 
pregnancy specific bela-1-glycoprotdn 9 
gb:Human dpha satellite and satelDte 3 
peroxisome receptor 1 
matrix metaDoprotelnase 3 (stromelysin 
matrn metalloproteinase 1 (interstiiial 
monoldne induced by gamma Inleffaton 
casein, alpha 

gb2o08b08^1 Stratagene neuroepltheliim) 

hypotheBcal protein FIJ10038 

EST 

cystaOnSN 

gb:EST376458 MAGE resequences, MAGH Homo 

NSI-associated pratein 1 

ESTs 

JM5 protein 

polymerase (RNA) III (DNA directed) (32k 
integrin. beta 1 (fibronecthi receptor, 
ESTs. WeaUy simter to AliJF.HUMAN till 
ESTs 

gb3io77805^1 NO JCGAPJ^I Homo sapiens 
EST 

gb3m85a05i1 Stratagene ovarian cancer 
gb3m24g09j1 Stratagene pancreas (93720 
mel transfornting Mcogene (derived from 
ESTs 

Homo sapiens cONA: FU22562 lis. done H 

EST 

ESTs 

ESTs 



EST. Moderately similar to Cd-7 h 
ESTs. WeaMy similar to 178885 serineAh 
ESTs 
ESTs 

gb7]76d05^1 Soares breast 2NbHBst Homo 
ESTs 

ESTs, Wealdy slmiiar to putative p150 1 

ESTs 

ESTs 

ESTs. Wealdy similar to S65657 afpha-IC 
RAB3A intBracOng protein (rabin3)4lke 
immunoglobunn si^terfamily, member 4 
ESTs. Wealdy similar to S65824 reverse 
ESTs. Highly similar to LRR FU-I tntera 
gb:yo47g07j1 Soares fetal fiver spleen 
ESTs 

544 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



113889 

114016 

114251 

115187 

115722 

115775 

116380 

116551 

117009 

117329 

117523 

116387 

118456 

118741 

118771 

119075 

119217 

119306 

119347 

120006 

120441 

120651 

120811 

121186 

121599 

122146 

122261 

122352 

122433 

122489 

122554 

122857 

122889 

123399 

123662 

123762 

123792 

123900 

123981 

124126 

124404 

124557 

124703 

124867 

125092 

125111 

125331 

125349 

125426 

125436 

125465 

125515 

125626 

125656 

125743 

125757 

125760 

125804 

125967 

126068 

126081 

126150 

126171 

126198 

126224 

126289 

126343 

126406 

126419 

126479 

126500 

126520 

126701 

126718 

126739 

126745 

126846 

126872 

126952 

127036 

127039 

127067 

127083 



W72720 

W90671 

Z39898 

AA26180S 

AA417297 

AM24030 

AAS98455 

D20458 

H85422 

N23660 

N32626 

N64579 



N74042 

N74690 

R3&451 

R95778 

T26914 

T64349 

W90108 

AA243588 

AA287288 

AA346854 

AA400156 

AA416770 

AA435584 

AA436830 

AA443725 

AA447417 

AA448342 

AA451886 

AA46387g 

AM65704 

AA521274 

AA609385 

AA610013 

AA620333 

AA621223 

C20797 

H18517 

N31998 

N66025 

R07294 

R68g71 

T92544 

T98240 

A1422996 

T87B26 

R43963 

R64472 

A1375276 

R13353 

AI038854 

AA040118 

H17151 

AI274g06 

W03020 

R79519 

A)341206 

AI190171 

AI346024 

AA018427 

M704771 

A1469355 

A1097280 

AA194603 

AA626890 

AA034096 

AA451775 

n8141 

AA885308 

AA292S88 

AA515212 

AA322718 

AI160709 

AA057506 

AA663527 

AA1366S3 

M195575 

Ai468598 

AA233366 

F06732 

Z44079 



HS.11D87 

Hs.21948 

Hs.44021 

Hs^9609 

Hs.46627 

Hs.66817 

Hs,229071 

Hs.108556 

H8.93670 

Hs.145532 



Hs.50421 

Hs.50547 

Hs.287820 

Hs^37309 

Hs.132785 

Hs.10848 

Hs.190035 

Hs.99657 

HS.5Z788 

H5.339608 

Hs.98255 

Hs.250173 

H5.98902 

Ks.159677 

Hs^85491 

HS.17B551 

Hs.154654 

Hs.99606 

Hs^87687 

Hs.105516 

Hs.1 12703 

Hs.112857 

Hs.1 12953 

Hs.g5481 

Hs.164568 

Hs.164256 

H5.141604 

H5.300076 

Hs.168500 

Hs.137548 

Hs.178658 

Hs.161378 

Hs.164480 

Hs.184029 

Hs.16131 

Hs.156732 

Hs.180789 
Hs.78687 

Hs.166835 
Hs.40300 
*Hs.16899 
Hs.173770 
K8.144413 
Hs.227835 
Hs.64616 
Hs.191942 
Hs.127310 
Hs.44493 
Hs.73451 
Hs.158701 

Hs.129064 

Hs.12285 

Hs,184376 

Hs.72071 

Hs^9670 

Hs^9435 

Hs^9047 

Hs.1 16910 

H3.85962 

Hs.276916 

Hs.168103 

Hs.91608 



gb3d61c034l Soares^t^arLNbHHISW 
ESTs 

ESTs 

Homo sapiens mRNAfv FU00065 pcotein. 

ESTs 

ESTs 

ESTs 

EST 
ESTs 

Homo sapiens cDNA: FU22664 fis, done H 
ESTs. Weakly similar (o fVI MOUSE FRIEND 
gb:yzS1d1 1 .si Morton Felal Cochlea Honio 
gb7y69iOU1 Soaresjnultiplejsclafosis. 
KiAA0203 gene product 
ESTs 

fibfonecQnl 
EST 

EAP30 subuntt of ELL complex 
gb:yc10d08j1 StiBtagene lung (937210) H 
KIAA0187 gene product 
ESTs 

ESTs ^ 

ftsgile X mental retardation, autosome 
hypothetical protein FU10120 
EST 

hypothetical protein FU131S8 

ESTs 

ESTs 

ESTs 

rilx)5omal protein LB 

cytochrome P450. subfamBy I (dioxWnd 

EST, WeddysimBar to STK2_HUMAN SERIN 

Homo sapiens cDNA: FU21960 (is, done H 

EST 

ESTs. Moderatety8iirilartoAF171102 1 r 

gb:af1Bd04^1 8oaresJeslls_NHT Homo sap 

ESTs 

EST 

ESTs 

fibroblast growth factor 7 (keratinocyte 

hypothetical protein FU20657 

ESTs. Moderately similar b ALU1_HUMAN A 

86tute canter lamily 22 (organic caOon 

ESTs 

C084 antigen (leukocyte anfigen) 
RAD23 (5. cerevfslae) homolog B 
ESTs 

ESTs. Weakly similar to T50609 hypotheO 
hypothetical protein DKFZp761A052 
hypothetical protdn FU12876 
ESTs 

gb7f76c04.r1 Soares in^t brain 1NIB H 
S164 protein 

neutral sphingomyelinase (N-SMase) acfiv 

gbiym37a05x1 Soares Mant brain 1NIB H 

ESTs. Highly sfanilar to 1814460A p53^ 

calpadn3.(p94) 

ESTs 

ESTs 

ESTs 

KIAA1049 protein 

chromosome 12 c^ien reading frame 3 

ESTs 

ESTs 

Human ONA sequence liom clone 462023 on 
ESTs. Weakly similar to S55024 netniOn. 
ESTs 

gb9v41h02j1 Soares fetel Over spleen 
Homo sapiens chromosome 19. cosmid F2216 
ESTs. Weakly sirdlar to 155214 salivary 
synaptosomal-associatod protein. 23kO 
hypothetical protein FLJ20038 
ESTs. Weakly similar to AF 147790 1 trans 
ESTs, Weakly similar to KIAA0927 protein 
Homo sapiens cDNA FU14059 fis. done HE 
gb:zf49g04.r1 Soares refina N2b4HR Homo 
ESTs 

gb:U}-H^I3«la-a-12'04Jl.s1 ND.CGAP.Su 

hyahironansynlhasaS 

nudear receptor subfamily 1, group 0. m 

prp28. US snRNP 100 kd pmtoln 

gb:HSC1JA051 normalized inttet brain cON 
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127116 


AA278492 


Hs.288304 


127282 


AA347547 


Hs.185780 


127349 


AA412108 


HsJt693S0 


127352 


AA415577 


Hs.189105 


127482 


AI337294 


HS.1053S2 


127543 


A)364367 


Ks. 157392 


127553 


AA282433 




127556 


AA679831 


Hs.190228 


127859 


AA808837 


Hs.291559 


127993 


AA8478S6 


Hs.1 24565 


128277 


AI018275 


Hs.269791 


128285 


AA634569 


Hs.13351 


128317 


AI051960 


H&303754 


128334 


A1080130 


H5.134207 


128428 


AI1B5718 


Hs.143900 


128582 


U22963 


Hs.101840 


128592 


AA470056 


Hs,113994 


128751 


AA442274 


Hs.183176 


129105 


AA224351 


Hs.108681 


129161 


N27334 


Hs.181780 


129248 


N99174 


Hs.206063 


129361 


X64229 


Hs.1 10713 


129577 


AAi424952 


HS.B2906 


129600 


N78980 


Hs,271599 


129989 


AF005887 


Hs.247433 


130024 


U15197 


Hs.1 13271 


130292 


U70136 


H5^18791 


130589 


AA234308 


H5.16441 


130736 


T99385 




131238 


R82327 


Hs.24625 


131378 


AA453886 


Hs.203910 


131601 


M31165 


Hs.29352 


131605 


AA256220 


Hs.29383 


131676 


C20785 


H5.30514 


131861 


D11925 


Hs.184245 


131873 


H39997 


Hs.1 66852 


132023 


F01927 


Hs.3743 


132273 


AA489716 


Hs.43658 


132770 


AA42S647 


Hs.56406 


132859 


D20925 


Hs.69235 


133052 


R40166 


Hs.106826 


133373 


S72487 


Hs.73946 


133446 


M25322 


HS.73B00 


134693 


K70361 


Hs.8854 


134733 


U03644 


Hs^l 


134965 


J05460 


Hs.272458 


135327 


AA477989 


Hs.98800 


135377 


C21382 


Hs.99766 


135398 


AA19407S 


H$^7270 



Homo saptens cONA FU11S29 fis. done HE 

ESTs 

ESTs 

ESTs, Weakly sImQv to NBR13 (Rsapteiis 

GeiNAc 8)pha-2. e^transferase 1. 1 

Homo sai^ cONA FU20780 fis. done CO 

gb:d353g02/1 NQ_CX3AP.GC81 Homo sapiens 

ESTs 

ESTs 

ESTs 

ESTs 

LanC pacterial lantibio6c synthetase c 

ESTs 

ESTs 

ESTs 

major histocompa&flity oomptex, class 
Homo sapiens cDNA FLJ20796 (is, done CO 
ESTs 

Homo ^(dens brdn tumor assodsted prol 

hypolhsQcd protein FU20241 

ESTs 

DEK oncogene (DNAt)indlng} 

CDC20 {cell division cyde 20, S. cerevi 

hypothetic^ protein MGC10S00 

activating transcription factor 6 

ABO blood group (transferase A, alpha 1- 

proteoglycan 4, {megaJcaofOcyte stimulati 

DKFZP434H204 protein 

Qb»*?69g07x1 SoaresJetaLilver^spleen. 

ESTs 

small glutamine-rlch tetralricopep0de r 
tumor necrosis fador, alpha-induced pro 
Homo sapiens mRNA; cDI^ DKFZp434E2321 (f 
ESTs 

KIAA092g protein Msx2 interacting nudea 
KIAA1683 protein 

matrix melailoproteinase 24 (membrane-in 

DKFZP5861151 protein 

Homo ss|)iens cONA FU13549 lis, done PL 

transportin-SR 

K1AA1696 protein 

endotheSa) cell growth factor 1 (ptate) 
selectin P (granule membrane protein 140 
Human transcription unit PVT gene, exons 
CBF1 Interacting corepressor 
protein phosphatase 3 (foimeriy 2B), cat 
ESTs 

Homo sapiens mRNA; cDNA DKFZp56ll0323 (f 
ret protooncogene (muIQpte endocrine 



TABl£44B 

Picey: Unique Eos probeset identifier number 
CAT luinlben Gene duster nuntber 
Accession: Genbanic accession numbers 



108497 



107897 
130735 
108710 
100943 
124575 
125439 
117697 
125515 
118387 
126292 
102798 
126378 
125743 
128406 



CAT number 
110079.2 



91776J 

611414.1 

133560.1 

45976J 

1666649J 

485590.1 

499877.1 

18U 

65081.-5 

327512.1 

34524J 

244444.1 

S02S.5 

95703,1 



AA074897 AA1 13914 AA064871 AA079329 AA071309 AA084710 AA1 29030 AA075042 AA074794 AA071453 AA078803 AA14862B 

AA122204 AA074159 AA126185 AA0791 17 AA127089 AA07(»12 AA079280 AA131372 AA078833 AA071087 AA076131 AA071047 

AA079401 AA083070 AA102076AA115163 AA074198 AA134725 AA113889AA121103 AA075041 AA055148AA071310AA101144 

AA079659 AA078931 AA079209 AA070928 AA058994 AA069817 AA0761 87 AA069053 AA131489 AA071308 AA063317 AA070156 

AA071430 AA076056 AA0756B4 AA070053 AA126283 AA126078 AA075895 AA079208 AA074583 AA071086 AA07 

AA078802AA076622AA065051AA079143AA071110AA079434AA14874dAA079230AA085188AA074485AA0705ro 

AA083166 AA0851 18 AA079450 AA08S044 AA120938 AA079200 AA100168 AA061472 AA122355 AA12g031 AA085362 AA069220 

AA070940 AA075968 AA074563 AA084027 AA 1 1 59 29 

AA604872AA026240 

AI168326 T99385 

AA121959AA121960 

AW884944 U)7517 AW669606 

1468168 N691 68 N90450 

AW835829AAB26305 R017S9 

N40976AA902795 

R13353R13890H11359 

N64579 

AM91328M42312 

U88896 U68698 AA916056 T03285AI341S94AI359534A1634O31 U88897 

»^24AA347642 

H17151H119S6 

N76683AA034096AA034082 
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127067 
119243 
111443 



127278 
103751 
126636 
127331 
127357 
126745 
126762 
126783 
112309 
126872 



111829 
104668 
127553 
120379 



127701 

128410 

114625 

109026 

108409 

100687 

109326 

123762 

116548 

125145 

125153 

116995 

102649 

118456 

102860 

120715 

120749 

113663 

113889 

108258 

101048 

129990 

122746 

124046 

108436 

124197 

101447 

108852 

101697 

108931 

101909 

117816 

119154 

119241 

119310 

119347 

119529 

112467 

112533 

112588 

121082 

123305 

123328 

100547 

123490 



1534978J 
1774795 1 
31528.18 



240640 1 

118557J 

80804J 

379388 1 

288073.1 

104479.1 

110350.1 

113388 1 

1576900.1 

142696.1 

158963.1 

45636.1 

82752.1 

202308.2 

34624J 



405284^1 

288073.1 

111686_1 

150431.1 

113869.1 

tj9r.HT3291 

genbanlUVA210719 

genbanKjAA610013 

genbanlLD20433 

entfBJL.W38001 

entraz.W38294 

g8nb3nl^K83928 

genbanJLU6ai33 

genbanK.N66580 

entrezj(00368 

0enbanMA292700 

6enbanKuAA303235 

g8nbanKJ'95909 

g8nbanK.W72720 

genban)LAA063269 

entfez.KD1160 

genb»iLN30316 

gBnbanK^458791 

genbanK.F10243 

genbari)LAA0788O1 

g8nbanlLH52921 

entrez.M21305 

g8nbanlUVA133131 

entrezJA84358 

genbanK>A147186 

entrez^869265 

genbanlLN48872 

genb3nK.R61293 

genb3n)LT12559 

genbanlLT40427 

g8nbanlLT64349 

entrez^W38053 

genban)LR65706 



F06732Z43705 
T12603T12604 

AR)72503 AF208161 M613238 H12439 M75991 D78692 BE019603 AA776439 R37932 T93615 AF072508 R00744 R01948 R68585 

A1128495 AA865193 Ar797629 H13302 AF072506 KM.014590 AF072505 R00743 T9366t T39519 R68740 H13097 N58614 N77302 

H01372 msn H04136 AA426511 AW971553 AW900030 R7B136 T52094 AI598135 AA781423 R76086 R77278 AI393478 AAB37267 

A)570707 R01901 R27412 N53177 A1379210A)128S26AA250958 R79323 R27389 H01325N5S091 TG9704AA868777T47345R27591 

AA860368 AA729556 H04137 T87297 C17420 AA293243 AA419144 

AA342715AA367634 

AA131367AA082824 

AA057531 AA001527 

F20186AA522352 

AA424107AA452788 

AA047854AA057S06AA053841 

AA064613AA0&4671 

AA083531 AA126047 AA074915 AA148649 
R55021 H26613 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BEO1 1212 8E01 1359 BE01 1367 
BE011368 BE011362 BE011215 BE01136S BE011363 
AA179SS6 AA162626 AA182603 
AF074991 R38070 

AW964385AA007312 A!081711 AA3182S3AW891655TB9192 
AA505046 AW969109 AAS05047 

AL04272S BE063316 AW975610 AA4S7591 BE062092 A1655202 AA714296 A1267264 AI075321 AA223288 AA071122 AA227849 

AA216700 A)696002 AA101867 AA099426AA1 35997 AL041698 10261 5 T51824 AA207189 T59230 151868 AA663341 BE165757 

AW818104 AW392866 AA584918 AA099408 AW856396 AW881859 AA053045 

AA679064AA935466 

AA424107AA4S2788 

AAOB1507AA070071 AA070840AA084362 

AA157811AA836869 

AA075631 AA075578 

L18862 

AA210719 

AA510013 

D20433 

W38001 

W38294 

H83928 

U68133 

N66580 

X0036d 

AA292700 

AA303235 

T95909 

W72720 

AA063269 

KOI 160 

N30316 

AA4587gi 

F10243 

AA07a801 

K52921 

M21305 

AA133131 

M64358 

AA147186 



N46872 
R61293 
T12559 
T40427 
T64349 
W38053 
R65706 



8enbenlLR77302 

genbanK_AA398722 

genbanlLAA496133 

98nbanKuAA496968 

tigrjfr2219 

genban)LAA599723 



R77302 

AA39B722 

AA496133 

AA498968 

M57417 

AAS99723 



TABLE45A: 90 GENES DOWhWGUUTEDM RHEUMATOID ARTHRmS(X)MPARED TOW 

TaUe 4SA lists d}oul 90 genes dowiHegulalsd in (heumalM 

Pkey: Unique Eos probeset identiter numbo^ 

ExAccm Exemplar Accession number. Genbank accession nuniber 

UmgenelD; Unlgene nufrt>er 

Unigene Tttle: Unigene gene title 
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Pkey 

100137 

100240 

100289 

100858 

100763 

100779 

101091 

101155 

102223 

102282 

102378 

102386 

102389 

102480 

102568 

102605 

102693 

102710 

102920 

102929 

103166 

103263 

103463 

103835 

104796 

105714 

105927 

105945 

106001 

106027 

106227 

106295 

106417 

107391 

109107 

109685 

110021 

110738 

112745 

113059 

113822 

113859 

113909 

114693 

115399 

116606 

116533 

119254 

119493 

120108 

120886 

120953 

121303 

121547 

123495 

123608 

123749 

124763 

125386 

125657 

125670 

125882 

126541 

126715 

126817 

127112 

127273 

127615 

127635 

128528 

129398 

129821 

131037 

131328 

131631 

132079 

132455 

132582 

132610 

132755 

133192 



ExAccn 

013627 
031767 
D45248 

HG285S4IT2995 
HG3S9741T3800 
HG3731-HT4001 
L06132 
L13972 
U24685 
U31383 
U40369 
U40998 
U41371 
U50327 
U59752 
U64444 
U73824 
U77827 
X12451 
X13238 
X67951 
XB0199 
Y00281 
AA172215 
AA029368 
AA2gi429 
AA402966 
AA404512 
AA410986 
AA412119 
AA429262 
AA435664 
M446008 
W02877 
AA169180 
F0932S 
H11252 
H99370 
R93237 
T26925 
W47350 
W57225 
W78127 
AA122158 
AA283182 
080217 
F02702 
T15837 
W35384 
W95696 
AA36S566 
AA397911 
AA402441 
AM12448 
AAS998S0 
AA609144 
AA50S949 
R39610 
H60ig2 
AA481719 
A1432621 
H45538 
AA204913 
R70160 
AA478642 
A1143906 
AA335263 
AA718919 
AA766903 
R39234 
AA437374 
AA489459 
AA256171 
V01512 
M4B6668 
H67964 
T15774 
AA31B547 
AA443114 
AA609201 
AA393804 



UiiigenelD 

Hs.15071 
Hs.75416 
Hs.179774 
HS.754S2 

Hs.302063 
Hs.149155 
Hs.301698 

Hs.79126 

Hs^91 

Hs.81728 

Hs.75916 

Hs.1432 

Hs.303091 

Hs.181369 

Hs.183684 

Hs,113207 

Hs.78056 

Hs.74649 

Hs.160909 

Hs.83422 

Hs.2280 

Hs.93748 

Hs.33026 

Hs.12211 

Hs.332040 

Hs.14453 

Hs.8963 

Hs.234799 

Hs.19613 

Hs.8583 

Hs.261828 

Hs.284294 

H&269280 

Hs^102 

Hs.31037 

Hs.139648 

Hs.74170 

H5.172684 

Hs.17466 

Hs.13273 

Hs.9956 

Ks.300683 

Hs.92023 

HsJ?59B42 

Hs.268726 

Hs.279009 

Hs.50477 

Ks.16803 

H5.301342 

Hs.97499 

H8^197 

Hs.104777 

Hs.106747 

Hs.112651 

Hs.112790 

Hs.76288 

Hs.76853 

Hs.150540 

Ks.82685 

Hs.101448 

Hs.7854 

HS.2415S2 

Hs.291623 

H$.125103 

Hs.144950 

Hs.116346 

Hs.251699 

Hs.234573 

Hs.301005 

Hs.22391 

Hs^5647 

Hs.29802 



Hs.4892 
Ks.278712 
Hs.5326 • 
Hs.182635 
H8.67052 



UidBeneTlOe 

chaperonbi containing TCP1 , subunil 8 (t 
DAZ associated pnM 2 
pfDteasoma (prosome, macropdn) acfivato 
heat shock TOkD pnM 2 
gt):Hun)an major tiistooonipailbQny coRipte 
immuno^ilobulin heavy constant mu 
voltage^endent anion channel 1 
slatyttransferasa 4A (beta^alactosidase 
gbcHomosapleRS inunuRogiobuEn heavy cha 
guanine nudeoQde binding protein 10 
spermldine/^pennine Nl-acetyttransferase 
unci 19 (Celegans) homolog 
splicing factor 3b. subui^t 2, 1451(0 
protein kinase C substrate 80IC-H 
pieciistrtn homology. Sec7 and ooOediiBoi 
ut^itin fusion degradation l-Gke 
eukaryoUc transtaSon Inlfiation factor 
G protelivcoupled receptor 30 
caihepsinL 

cytochroma c oxidase subunit Vic 
peroxiredoxin 1 
MliJ51 protein 
ribophorini 

Homo sapiens cONA FIJ14676 fs, done NT 
hypothetical protein PP2447 
GOP-fiicose bansporter 1 
hypothetical protein M6C13010 
interferon consensus sequence binding pr 
Homo sapiens mRNA full length insert cON 
breattpdnt duster region 
ESTs 

slm»artoAP0BEC1 

6 piotein^upled receptor kinase 7 

Breakpoint ckister region protein, uteri 

ESTs 

ESTs 

ESTs 

kinesin family member 1C 

metaUolhlonein IE (functlonaO 

veside-assodated membrane protein 8 (e 

reOnoic add receptor (ssponder (lazaro 

KtAA0S92 protein 

hypothetica] protein FU20259 

Homo S8|^ cDNA RJ12825 Ss, clone NT 

core Nstone macroH2A2^ 

protein kinase, AMP-acDvated, gamma 2 n 

ESTs. Highly slmflar to ZN91_HUIWAN ZINC 

matrix Qa protein 

RAB27A, member RAS oncogene family 

LUC7 (S. cerevisiae)-flke 

hypolhetlcai protein MGC4342 

ESTs. Weakly similar to unknown [H.88ple 

Bk»II CUTIymphoma 7C 

ESTs 

serine carbcocypeptklase 1 precursor prot 

ESTs 

EST 

calpain 2, (m/lt) large subunit 

Homo sapiens mRNA; cONA 0}<FZp434N1726 (f 

Homo sapiens, done tl^GE:3954961. mRNA, 

C047 antigen (Rh-related antigen, Integr 

metastasis assodated 1 

zincfton regulated transporter-like 

KIAA0268 protein 

ESTs. Weakly similar to unnamed protein 

ESTs 

ESTs 

gbsv88a04ir1 SoaresJ4hHMPu.S1 Homosapl 
ESTs. Highly sinfiarto A46297 beta-1.6- 
ESTs. Weakly similar to 10N4^TR14 IH.s 
Homo sapiens mRNA for TL132 
purine^fch element Uiuling protBbi B 
duomosorra 20(q)en reading ftamo 3 
v4os FBJ murine osteosarcoma viral onco 
sUt (Drosophila) homdog 2 
ESTs 

Homo sapiens done 24841 mRNA sequence 
eukaryoSc translation ini&aflon factor 
amino add system N transporter Z, porcu 
ESTs 

vacuolar pfotdn sorfing K (yeast homd 
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10 



133437 


R57419 


Hs.7370 


133449 


AA094989 


H5.7381 


133649 


AA479139 


Hs.75393 


133814 


M33882 


Hs.76391 


134378 


Mmm 


Hi82425 


134419 


108044 


Hs^2961 


134548 


U41515 


H3^95 


134776 


J05SB2 


Hs.89603 


135032 


AA243497 


H$.173685 



phcsphoUdyfinosIlol transfer protein, b 
\n^ta9&^ependenl onton channel 3 
add phosphatase 1, soluble 
myxovinis (inOuenza) resistance 1, homo 
ac6n related protein 2/3 cocnplex. subun 
trefoQ factor 3 (Intestinal} 
Deleted In spGMiandfepliHool 1 fegio 
mudn 1, traRsmendvane 
hypothetical protein FU12619 



TAB1£458 



1 5 Pker Unique Eos probeset idenliiter number 
CAT number Gene cluster number 
Accession: Genbank accession numbeis 



20 



25 



30 



35 



40 



Pkey CAT number Accessions 

127615 380951_1 AA626215 AA718919 
100763 tigrHT3800 X12432 Y08693 

102223 221 265 AF01 3616 AA300945 X65907 AF062264 AF=062250 Z47228 Z75389 Z75374 AR)62152 AF062146 Z7539B X64153 AF062101 .AF062218 S59161 
Z75392 AF0621 96 AF062192 X65904 U24685 AF062181 Z47241 Z75376 AF062217 Z47234 X64152 AF062187 AF062173 AF062158 Z47229 
M74fl18 M74021 X54441 M84512 U9115 M84508 Z75384 AJ244983 AJ245240 AJ245030 AJ245042 M26998 LD3635 S64473 AJ244997 AJ245013 
AJ279535 U89768 AF174049 AF174085 AF174086 U97246 AJ245011 AJ245017 AJ245028 AJ245041 AJ245051 AJ245065 AJ245235 U22391 
Z49143 Z74665 AF087428 S66098 Z70650 AJ244929 AF00552B AF022004 AF021983 U00556 AJ245035 Z70517 Z70605 AJ245052 AJ245045 
AF087424 AF174054 S67110 U21257 U21267 U21268 Z35492 U7t 103 AfH)21991 L23518 Z70644 AJ24S036 Z49141 AF089001 Z74695 Z46304 
AF021957 AF021990 AF022005 AF052527 AF021947 Z70604 Z70610 AF062104 Z49135 X64235 Z46341 Z46305 Z46307 Z49136 AJ244996 
Z46342 AJ244931 AJ244935 AJ244937 AJ244938 112192 AJ244939 AJ244940 AJ244941 Z46308 AJ244962 AF062234 AJ244973 AJ244984 
AJ244985 AF1740e8 AJ279519 AJ279521 AJ279526 AJ245009 AJ279531 AJ245008 AJ244994 AJ244991 AJ244990 AJ244988 AJ244987 X87440 
AJ245238 Z7C525 Z70626 Z70641 Z70640 Z70643 AJ244975 Z70616 Z70637 AJ244982 AJ244967 AJ239377 AJ245057 AF021948 AF107239 
AJ245040 U4163 AF062231 Z70627 AF062113 AF00S527 AF174041 AJ279537 Z70642 U00497 Z70639 AJ245054 AJ244960 AJ279524 AJ244943 
AJ249631 AF035041 AJ245039 AJ245050 AF107233 AJ239362 AJ244969 Z46278 Z46290 Z46274 Z46281 AJ239351 L25293 AJ244944 AJ244951 
Z46280Z46270AJ245043Z46276AF107241 Z46271 Z46277AJ245034Z46273AJ244992 Z46282 Z70B38Z46275AJ244972 Z46272 Z45279 
Z46269 AF087422 M74469 X64159 AF103243 X64156 AJ244942 Z46316 AJ222547 Z45322 Z46324 Z46326 Z46327 AJ222556 Z45329 Z46330 
Z46302AJ222561 AJ222549AJ222568AJ222570AJ222571 Z49139 AJ222578 AJ222562AJ222577Z46323 AJ222576 AJ222566 Z46315 
AJ222557 AJ222564 AJ222559 AJ222573 AJ22257S Z46318 AJ222548 Z46319 AJ222552 AJ222550 AJ222567 AJ222558 AJ222563 Z48317 
)C87438AJ222S55 AJ240581 AF103161 AJ240580AJ240594 Y17929 AJ240553AJ240573 AJ240558 AJ240555 Y17927 Y17949 AJ240561 Y17948 
Y17933 Y17947 Y17944 Y17928 Y17931 Y17934 AJ240595 Y17943 Y17932 Y17930 AJ240890 AJ240560 Y17945 AJ240556 879918 AF103278 
AW3842S6 AF103299 AF103122 X75022 AF004937 Z30557 Z30677 Z30573 Z30576 Z30561 Z30674 Z30S62Z30675 AW403129 A1203192 
AW404253AW237246A1654630H61354 
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Table 46A lists about 714 genes upregulaled In esophageal cancer retative to nonra! body tissues. These genes were selected from 59680 probesets on the Eos/Affymetiix Hu03 
Genechip array. Gene expression data for each pnrfieset obtained ftm this an^ was expressed as average Intensity (At), a normaTized value reflecting the relattve level of 
inlVlA expression. 

Table 47AB5ts about 113 genes upregulatBd In esofdiageai tumoTB velative to normd esophagus. These genes were selected fnxn 59680 probesets on the Eos/Af)ymetrfxHu03 
Genechip anay. Gene expression datalbreaA probeset obtained from this analysis was expressed as average intensity (Al), a normdized value reflecting the retaOve level of 
mRNA expression. 



Table 48A Bsts about 162 genes downregulaled in esophageal tumois relative to nonnal esopi 
Genechlpanay. Gene expression data tor each probeset obtained fiom this analysis was 
niRNA expression. 

TABt.E46A; [ 



These genes were selecled from 59860 prdbesets on the Eos/Aflymetrix Hu03 
as average Intensity (AI). a normalized vto reflecting the relative level of 



Ptey: IXitque Eos probeset IdentHler number 
ExAccn: Exemplar Accession number, Genbank accession number 
Ur^enelD: Unigene number 
Undone Title: Unigene gene title 

Rt: 90th penxntila of esophageal htmor Als divided by the 70th percentile of noimai tissue Als. where the 1 5th percentile of the 

6ie numerator and denominator. 



normal tissue Als was subtracted from both 



Pi(ey ExAccn umgenelD Unigene Title 



413808 
411243 
422168 
401781 
424098 
425211 
417388 
401780 
421948 
400289 
429538 



;00287 

AB039886 

.AA566894 

AF077374 

M18G67 

8E165289 

L42583 
X07820 
BE182592 



418007 M13509 



Homo sapiens mRNA for cafatesmon. 3r UTR 
Hs.69319 CA11 

Hs.112408 SlOOcalciunvbindIng protein A7(p6a1as 

Target Exon 
Hs.139322 small pnllne^ch protein 3 
H5.1 667 pTDgastrtesin (pepsinogen C) 
Hs.1076 smaO proline^ prat^ IB (oomlfin) 

NM.00555r:Homo sapiens keratin 18 (foca 
Ks.334309 keratin 6A 

HS.225B matrix metaUoproteinase 10 (stromeiysln 
Hs.11 261 small prdlne^h protein 2A 

Nll4.002425:Homo saito matrbc metallopro 
Hs.83169 matrix metallaprotelnase 1 (Interstitial 



■RT 

31.57 
26.06 
25.65 
23.23 
21.35 
20.37 
20.33 
18.94 
18.13 
18.01 
17.31 
17.28 
16J6 
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430520 NM_0161SO H&242057 duomosome 1 open reading trame 10 16^5 

408522 A1S41214 Hs.46320 SinanpfQ&iMichpfOteinSPRKpiuman. 16^ 

413278 BE5630B5 Hs.833 inteiferoiv«Bmul8tedpretebv15l^ 15^ 

_ 421582 AI910275 trefbflfadgrl [beasl cancer, estroge 14.86 

J 425579 X05997 HS.1S9177 Gpase. gastric 14.53 

421773 W69233 Hs.1 12457 ESTs 14.26 

433091 Y12642 H&3165 lymphocyte antigen 6 complex, locus D 14.26 

422158 L10343 Hs.112341 protease InhitiUor 3, siMertved (SKAL 13.93 

444325 AW152618 Hs.16757 ESTs 13.24 

10 431723 AW058350 Hs.16762 Homo sapiens mRNA; cONA OKF2p564B2062 (f 13.19 

420783 A)659838 Hs.99923 lectin. galactosMe-bindlng, 8olut)la. 7 11.98 

423873 BE003054 Hs.1695 matrix metatloprotdnase 12 (macrophage 10.99 

426350 NM.003245 Hs.2022 transgtutamhiase 3 (E potypepfide. prate 10.77 

432239 XB1334 Hs.2936 matrix metalioproteinase 13 (cdlagenase 10^1 

IS 446292 AF081497 Hs.279682 RhIypeCglyoopratefn 9.69 

421978 AJ243662 Hs.1 10196 NICE-lpfoteb 9.68 

448811 AI590371 H5.199460 ESTs 9.38 

453331 AI240665 ESTs 9.37 

423634 AW9599C8 Hs.1690 heparin^ng growth factor twvfing pr a28 

20 413719 BE439580 Hs.75498 small indudlile cytokine sutrfamSy A (Cy ai8 

406667 M31126 matrix metaOoproteinase 11 (stromelysin 9.13 

454034 NM_00089t Hs.575 aldehyde dehydrogenase 3 family, memtier 9.04 

450701 H39960 Hs.28S467 hypothetical protein XP.0981S1 6.77 

418685 Z36830 Hs.87268 annexInAS 8.76 

25 421110 AJ2S0717 Ks.1355 cathepsinE 8.42 

407788 BE514982 Hs.38991 SlOOcatdunvbinding protein A2 8.42 

424012 AW368377 Hs.137569 tumor protein 63 kDa with strong homolog 8.38 

423217 NKt000094 Hs.1640 collagen, ^ VII, alpha 1 (epMermolys &16 

427666 AI791495 Hs.180142 cdmodulin-like sidn protein (aSP) &11 

30 450375 AA009647 adisintegrinandmetaltopTDlflinasedoma 8.03 

401785 NM_002275*:Homo sapiens keraOn 15 (KRTI 7.97 

445891 AW391342 Hs.199460 DPCR1 protein 7.95 

437053 AU077018 Hs.3235 kertfn4 7.93 

423771 W47225 Hs.1262S6 biterieuklnl.beta 7.80 

35 409757 NIVL001898 Hs.123114 cystatlnSN 7.74 

444342 NM^014398 H5.10887 similar to tysosome-assodated membrane 7.64 

452838 U65011 Hs.30743 prerereniiaDy expressed entigen in mela 7.58 

429211 AFD52693 Hs.198249 gapJuncSonpn3tein.i)eta5(connex]n3 7.55 

428330 L22524 Hs.22S6 matrix metaDoproteinase 7 (matriiysin. 7^ 

40 448045 AJ2g7436 Hs.20166 prostate stem cell anOgen 7.14 

408243 Y00787 Hs.624 InterieukinS 7.13 

429359 W0O482 Hs.2399 matrix metatloproteinase 14 (membrane-in 7.08 

437191 Nkd_006846 Hs.331555 serine protease inhibitor, Kazal type. 5 7.04 

407366 AF026942 Hs.17618 gbrHomo sapiens clg33mRNA. partial seqo 7.04 

45 424687 J05070 Hs.151738 matrix metdbproieinase 9 (gOlatinasa B 6.98 

421379 Y15221 Hs.103982 small inducible cytokine subfamayB(Cy 6.89 

414774 X02419 Hs.77274 plasminogen activator, urokinase 6.85 

439926 AW014875 Hs.137007 ESTs a84 

446921 AB012113 Hs.16530 smaU Inducible cytokine subfamily A (Cy 6.82 

50 429259 AA420450 Hs.292911 l^lakophllln 6.77 

419741 NM^OOTOig Hs.g3002 ublquHin canler protein E2-C 6.71 

424834 AKD01432. Hs.153408 Homo sapiens cDNARJI 0570 fi3,ck)neKT 6.67 

429228 AI553833 Hs.326447 ESTs &61 

426312 AF026939 H5.181874 Interferon^uced protein wHh tetratri &60 

55 431211 M86849 Hs.323733 gap junction protein, beta 2. 26kO (conn &5B 

441362 BE614410 Hs.23044 RA051 (S.cerGvistae}homolog(EcoOR0 6.55 

414987 AA524394 H8.294022 hypothatkal protein FU149S0 6.54 

448989 AK001898 Hs.16740 hypotheQcal protein RJ1 1038 6.53 

409632 W74001 Hs.S5279 serine (or cysteine) proteinase inhiblto 6.51 

60 422166 W72424 Hs.112405 SlOOcak^km-bimfing protein A9(caigran 6.49 

417515 L24203 Hs.62237 ataxia-telangiectasia group D^odated 6.48 

428471 XS7348 Hs.184510 strafifin 6l46 

422511 AU076442 Hs.1 17938 collagen, type XVII. ^ha 1 &45 

444381 BE38733S H$.283713 ESTs. Weakly similar to S640S4hypotheti &44 

65 401747 Homo sapiens keiBlIn 17 (KRT17) &42 

421508 NM.004833 Hs.105115 aisent in melanoma 2 a42 

416768 AA363733 Hs.1032 legeneraflng islet>derived 1 alpha (pane a42 

417079 U55590 Hs.81134 interieuKin 1 receptor antagonist a41 

432374 W68815 Hs.301885 Homo sapiens cDNAFU11346fis, clone PL 6.38 

70 422596 AF063611 Hs.118633 ?-5^H>Ilgoadeny1atesynthetase4!ke a38 

409601 AF237621 Hs.8082B keratin 1 (epidemralytic hyperkeratosis) 6.38 

444781 NM_014400 H$.11950 GPi-anchored metastasis-associated prote 6.35 

407811 AW190S02 Hs.40O98 cysteine knot superfannlly 1, BMP antagon 6.33 

425415 M13903 H3.157091 invducrin a32 

75 431958 X63629 Hs.2877 cadherin 3. type I.P-cadheiin (placenta aso 

415989 ' AI267700 ESTs 6.23 

406673 M34996 H3.198253 major hhtocompaiibinty complex, dass a21 

449228 AJ403107 Hs.1485g0 protein related with psoriasis a21 

436749 AA584890 Hs.5302 lectin, galactoskto4)inding. soluble. 4 ai8 

80 444527 KM.005408 Hs.11383 small indudUe cytokine subfarrily A (Cy a06 

418663 AK001100 Hs.41G90 desmoodOn 3 a04 

428368 BE440042 Ha.83326 matrix metaDoproteinase 3 (stromelysin 5.98 

414915 Nli^002462 Hs.76391 myxovinis(influenza)resistanc8l,hon» 5.98 
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452401 NiyL007115 Hs.29352 tumor necrasb (actor, dphaMuoed pro &93 

452304 AA025386 H$.61311 EST8.We^ similar to S10590 cysteine 5i92 

418004 U37519 Hs.87539 aidehyde ctehycbogenase 3 fernSy. member Sl92 

424620 AA101043 Hs.151254 kallikrein 7 (chymobypac, stratum corn 5.84 

5 425650 NM.001944 Hs.1925 desmoglein 3 (pemptdgus vulgaris antigen 5.76 

400665 NiyL002425:HQnio8apiensmatrixmetallopro &75 

427747 AW411425 Hs.160655 serineffhreonine kinase 12 5.72 

425247 NK1.005940 Hs.155324 n^m8taDoprot^nase11{strome)ysIn 5.72 

414004 AA737033 Hs.7155 ESTs. Moderately sintilar to 211 5357ATYK &71 

10 422765 AW409701 Hs.1578 bacdovirallAP repeat-containing 5 (sur 570 

439606 W79123 Hs.58561 G protein<oupIed receptor 87 5.70 

445417 AK001058 Hs.12680 Homo sapiens cDNAFU10196 83. done HE 5.68 

433447 U29195 Hs.3281 neuronal pentraxin II 5.67 

428227 AA321649 Hs.2248 small indudt){a cytokine sutifamDy B (Cy 5.64 

IS 408000 L11690 Ks.1986S9 tuOous pemphigoid antigen 1 (230/240kO) 5.62 

413219 AAB78200 Hs.116727 Homo sapiens cONAFU13692 lis, done PL 5.60 

428450 NM.0147gi H5.184339 KIAA0175 gene product 5.53 

424408 AI754813 Hs.146428 coOagen. type V. alptia 1 &50 

416250 AA581386 Hs.73452 hypothetical protein MGC1 0791 5.48 

20 447164 AF026941 HS.17S1B Homo sapiens dgSmRNA. partial sequence 5.47 

412326 R07568 Hs.73817 small inducible cytokine A3 (homologous 5.44 

439223 AW238299 Hs.250616 UL16 binding protein 2 5.44 

431629 AU077025 H&26S827 Interferon, atpha^ndudbie protein [do 5.42 

_ 402994 NM_002463':Homo sapiens myxovims(influ 5.40 

25 447333 BE090680 Hs.70704 hypothetica]protdndJ616B8.3 5.40 

426991 AK001536 Homo sapiens cDNA 1^10874 lis. done NT 5.36 

454241 BE144666 gb:CM2-HT017&O410994}17-€02HT0176Homo 5.33 

408716 Pmm Hs.151714 HomosaplensmRNA tor K1AA1 769 protein, 5.32 

449722 BE280074 Hs.23960 cydinBI 5.31 

30 428434 AW363590 Hs.65551 Homo sapiens. Simitar to DMA segment. Ch 5.30 

426283 NM.003937 H5.169139 Itynureninase (L-kynurenine hydrdase) 5.29 

418941 AA452970 H5.239527 E1B-5SkOa^odaled protein 5 5.29 

417720 AA20562S Hs.208067 ESTs 5.29 

424008 R02740 Hs.137555 putative chemoUne receptor GTP-binding 5.2B 

3S 419216 AU076718 Hs.164021 smafl indudUe cytoldne subfamDy B (Qy 5.27 

431620 AA126109 Hs^64g81 2-5'-oilgoadenylale synthetase 2 (^9-71 5.26 

430280 AA3612S8 Hs.237668 interteuldn 7 receptor 5.25 

422627 BE336857 Hs.118787 translomdng growth todor, betMnduoed &24 

402075 ENSP00000251fl56*:Plasma membrane caldum 5.24 

40 413753 U17760 Ks.75517 lam}nin.beta3(niodn(125i(D).k8linin 5.24 

445537 AJ245671 Hs.12844 EGF-lil«Miomain. multiple 6 5.22 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, &22 

418558 AWD82266 Hs.86131 Fas (TNFRSF6}-assodated via death dbmd SJ21 

422440 Nl^004812 Hs.116724 aldo-keto redudase family 1. member BIO 5.20 

45 428168 ^»8447 H5.22 transglutaminase 1 (Kpdypeplldeepider &20 

408663 U24683 Hs.293441 immunoglobdin heavy constant mu 5.19 

409178 BE393948 Hs.50915 kalli)(rein5 5.15 

443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.14 

410700 AA352335 Hs.65641 hypdhetical protein FU20073 5.10 

50 418054 N1IL002316 Hs.83354 lysyl oxiddse-iacB 2 5.09 

452281 T93500 Hs.28792 Homo sapiens cONA FU1 1041 Us. done PL 5.09 

412471 M63193 Hs.73946 endotheHal ceH growth (ador 1 (plate) 5.08 

411274 NM.002776 Hs.69423 kallikiBlnIO 5.07 

407756 AA116021 Hs.38260 ubiqiritin spedfic protease 18 5.03 

55 40S893 AW247090 Hs.57101 mInichromosomem^ntenancedefident(S. 5.03 

433800 AI034361 Hs.135150 lung typ&4 ceil membrane-assodated gty 5.02 

412755 BE144306 Hs.1 79891 ESTs. WeaWysimHartDP4HA.HUMANPR0LY 5.02 

416530 U62801 Hs.70361 kalDkrdn 6 (neurosln. zyme) 5.02 

429058 AF138863 Hs.35254 hypothetical protein FLB6421 5.00 

60 442117 AW664964 Hs.128899 ESTs; hypothelfea) protein (or tMAGE447 5.00 

426711 AA383471 Hs.343800 consenred gene amp&fied in osteosarcoma 4.97 
405770 NM-002362:Komo sapiens melanoma antigen, 4.96 

444763 AK001468 Hs.62180 anilBn (Drosophila Scraps homdog). ad 4.94 

420859 AW466397 HS.1000QO SlOOcaldum^ng protein A8(calgran 4.94 

65 426866 U02330 Hs.172816 neuregulinl 4.93 

423017 AW178761 Hs.227948 serine (or cysteine) proteinase inhibtto 4.92 

455601 A1368680 Hs.616 SRY (sex detenrtnlng region Y^box 2 4.91 

409956 AW103364 H3.7Z7 inhibln,betaA(adhrinA.adivinABa 4.90 

427786 BE407863 HS.256B71 ESTs 4.87 

70 409420 Z1S008 HS.544S1 lamihin. gamma 2 (idoeln (lOOkD). kaOni 486 

444371 6E540274 Hs.239 tbrtdiead box Ml 4.88 

431009 BE149762 Hs.48956 gap )unc6on protein, beta 6 (connexin 3 4.85 

434826 AF155661 Hs.22265 pyruvate dehydrogenase phosphatase 4.84 

406690 M29540 Hs.220529 carotnoembiyonic antigen^eialed ceil ad 4.83 

75 409402 AF208234 Hs.695 eyslalin B (stelin B) 4.81 

408202 AA227710 Hs.43858 DKFZP586L151 protein 4.79 
401994 Target Exon 4,77 

425292 NM.005824 Hs.l55545 37 kOaieudne^ repeal (LRR) protein 474 

^. 421574 AJ0001S2 Hs.105924 detensln. beta 2 4.69 

80 429299 At620463 H1347408 hypothettea) protdn M6a3102 4.69 

422109 S7326S Hs.1473 gastrfn^asing peptide 4.68 

439453 BE264974 Hs.6566 thyraU honnone receptor hterador 13 4.68 

407944 R34008 Hs.239727 desmoGotnn2 4.67 
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411296 BE207307 Hs.10114 giowUi suppressor 1 4.6S 

433364 AI07S4a7 Hs,296083 ESTs. Moderately sbnOsr to 1S4374 gene 4.65 

425234 AW152225 Hs.165909 ESTs, WeaMy similar to 138022 hypdhetl 4.61 

421335 X99977 Hs.103505 ARS component B 4.60 

5 422S15 AW500470 Ks.117950 multifundlonat polypeptide similar to S 4.59 

453779 N35187 H$.43388 28kO interferon responsive pralein 4.S9 

423575 C18663 Hs.163443 periosGn (OSF-Zos) 4.59 

417308 H60720 Hs.81892 KIAA0101 gene product 4.5B 

428651 AF196478 HS.18B401 annexinAlO 4.58 

10 424354 NM.014314 H5.145612 RNAheiicase 4.58 

404996 Target Exon 4.56 

404240 NM.01 89S0:Homo sapiens histooompat 4.56 

453095 AW295660 Ks.252756 ESTs 4.5S 

410407 X66839 Hs.63287 carbonic anhydrase DC 4.55 

15 418678 NM.001327 Hs.87225 cancerftestisan6gen(NY-ES0-1) 4.55 

450685 L15533 Hs.423 pancreatiOs-associated protein 454 

425483 AF231022 Hs.156159 FAT tumor suppressor (Orosopliila) homoto 4.53 

425397 J04088 Hs.156346 topoisomera5e(DI^}ltalp)ia(170kD) 4.52 

408380 AF123050 Hs.44532 diul)lquiUn 4.47 

20 443859 NhL013409 Hs.9914 foillstatln 4.46 

411773 NM_006799 Hs.72026 protease, serine, 21 (testisin) 4.44 

412140 AA219691 Hs.73625 RAB6 interacting. lOnesin^ (rabklnes 4.44 

4217n BE562088 Hs.106196 HSPC037 protein 4.44 

408908 BE296227 Hs250622 serine/Utreonlne](inase15 4.43 

25 408122 AI432652 Hs.42B24 liypotheticat protein FU10718 4.42 

422487 AJOIOgOl Hs.198267 mucin 4. tracheotxandiid 4.42 

400419 AF0&4545 Target 4.42 

452571 W31518 Hs.34665 ESTs 4.41 

430044 AA464510 Hs.152812 ESTs 4.41 

30 414732 AW410976 Hs.77152 minichromosome maintenance deficient (S. 4.39 

448111 AA053486 Hs.20315 interferon-induced protein with tetratri 4.39 

443347 AI052543 Hs.133244 melanoma^erlved leucine zipper, extras 4.39 

453884 AA355925 Hs.36232 K1AA0186 gene product 4.38 

436481 AA379S97 Hs.5199 HSPC1 50 protein similar to ubiquilin^ 4.37 

35 417900 BE250127 Hs.82906 CCX20 (ceil division cycle 20. S. cerevi 4.37 

424046 AF027866 Hs.13B202 serine (or cysteine) proteinase Intilblto 4.37 

427983 M17708 H$.2233 colony sOmutabng factor 3 (granulocyte 4.36 

448357 N20169 Hs.108923 RAB38, member RAS oncogene (amiiy 4.36 

409041 AB033025 Hs,50081 Hypothetical protein. XP^OSI 860 (KIAA1 19 4.34 

40 439999 AA115811 Hs.6838 ras iwmotog gene family, memt}er E 4.34 

410361 BE391804 145.62661 guanylate binding protein 1. Interferon- 4.34 

409703 NM_006187 Hs.56009 ?-5*-ol1goadenyta!e synthetase 3 (100 k 4.32 

402447 C1000201:gi|204416{gbtAAA02627.1|(L0519 4.31 

426514 BE616633 Hs.170195 bone morphogenefic protein 7 (osteogenic 4^ 

45 432731 R31178 Hs.287820 fibronectini 4.27 

422397 AJ223366 Hs,1160S1 Homo sapiens cONA: FU22495 fis, ctane H 4.27 

413670 AB000115 Hs.7&470 hypothetic^ protein, expressed in osteo 4.25 

425580 L11144 H3.1907 gaianin 4.25 

421506 BE302796 Hs.105097 thynMlneiunase l.soiufale 4.23 

50 409433 AA074382 Hs.135255 ESTs 4.23 

430630 AW269920 Hs.2621 cy8tatInA(stefin A) 4.22 

447343 AA256&41 Hs.236894 ESTs, Highly sinular to S02392 8!pha.2-m 4.21 

407047 X65965 gb:RsapiensS00-2 gene for manganese su 4.20 

432375 BES36069 Hs.2962 SlOOcalciunvbinding protein P 4.20 

55 434449 AW953464 Hs.3849 hypothetica) protein nJ22041 similar to 4.19 

417866 AW067903 Ks.82772 coD^en. type XI. alpha 1 4.19 

436291 BE568452 Hs.344037 protein regulator of cytoUnesis 1 4.18 

418140 BE613836 Ks.83551 micn3fibTinar.associated protebi 2 4.17 

410286 Ai7391S9 Hs.61898 ' DKFZP586N21 24 protein 4.16 

60 448844 AI561519 Hs.177164 ESTs 4.16 

432680 T47364 Hs.278613 tnlBrfBian. alph&^ndudbte protein 27 4.16 

417599 AA204688 H$.62954 ESTs 4.16 

402992 Target Exon 4.15 

429500 X78565 Hs.289114 hexabrachion (tenasdn C, cytotactin) 4.14 

65 422100 Ai096988 HS.1115S4 ADP-dbosyiaQon factor-tilo 7 4.13 

409512 AW979187 83.293591 melanoma differentiatlbnassacialedprot 4.12 

446985 AL0387O4 Hs.156827 ESTs. WeaMy slmRar to Al^OUMANAiaJS 4.11 

407137 T97307 gb^eSShOS-sl Soares fetal liver spteen 4.10 

411263 BE297802 Hs.69360 ldnesin^6(mitottccentrom6re-assoc 4.10 

70 439979 AW600291 H3.6823 hypotheSca) protein FU10430 4.10 

423905 AW579960 Hs.135150 lungtype^ceBmembrane^ssodatedgly 4.09 

427337 Z46223 Hs.176663 Fc fragment of IgG, low affinity lllb.r 4.08 

417933 X02308 H$.82982 thymldylate synthetase 4.08 

_^ 418589 A1360883 H&274448 hypothetical protein FU11029 4.06 

75 417678 X06560 Hs.82396 2',5'-oi]goadenytaie synthetase 1 (4046 4.08 

451541 BE279383 Hs.26557 pla)(ophnin3 4.06 

433848 AF095719 Hs.93764 carboxypepBdaseA4 4.06 

418113 A]272141 Hs.83484 SRY(8exdetemdnlng regbnYH»x4 4.04 

429599 AA806106 Hs.123664 ESTs 4.03 

80 450823 T81223 Hs.22011 complement^slq tumor necrosis factor-rel 4.02 

423787 AJ29S745 Hs.238204 nudear pore complex protein AXO 

431250 BE264649 H&251377 laxol resistance associated gene 3 4.00 

416091 AF295370 Hs.283082 defensin,beta3 3.97 
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427557 NM.002659 Hs.179657 ptasniinogen acOvatv. urokbiasa recepto 3.97 

427099 AB0329S3 Hs.1 73560 oddOz/(eiMn)vnnoIog2(Oiosophila.mous X97 

402408 NM-030920*:Homo sapiens hypoQiefical pro 197 

424927 AW973666 Hs.l53850 hypoUiellca! proleln C321D14 195 

5 414907 X90725 Hs.77597 poto(DrosophlaHIk8 kinase 195 

452888 AW955454 Hs.30942 ephriivB2 195 

456525 AW468397 Hs.100000 S100 caidunvUnding proton A8 (calgran 195 

450983 AA305384 Hs.25740 ER01 (S. cerevislaeHito 194 

419693 AA133749 K5.301350 FXYDdomaiKonlaining ion transport reg 194 

10 420596 NM.002692 Hs.99185 potyrorasePNA directed), epslkm 2 194 

422094 AF129535 Hs.272027 only protein 5 194 

414945 BE076358 Hs.77667 lymphocyta antigen 6 complex, locus E 193 

404286 C6001909:6i|704441|dtqI6AAl8909.1)(D298 193 

423951 D13666 Hs.138348- periostin (OSF-2os) 192 

15 426075 AW513691 Hs.270149 EST5.WeaJdy similar to 21 09260ABcen 191 

447377 X77343 Hs.334334 transcription factor AP-2 alpha 191 

435066 BE261750 Hs.4747 dyskeratosis congenita 1. dyskerin 191 

446998 N99013 Hs.16762 Homo sapiens inRNA:cONADKFZp564B2082(t 191 

421904 BE143533 Ks.109309 hypothetical protein FU20035 190 

20 441553 AA281219 Hs.121296 ESTs 190 

428093 AW594506 Hs.104830 ESTs 190 

441020 W79263 Hs.35962 ESTs 189 

447078 AWB85727 Hs^l4 ESTs 189 

437044 AIJ035864 Hs.69517 differentiany expressed {nFanoonf^ an 189 

25 417621 AV654694 Hs.82316 Interfenm-induced, hepatitis Oassodat 187 

424905 NKt002497 Hs. 153704 NIMA (never In mitosis gene aHelated k 187 

426059 BE292842 Hs.166120 interferon reguiatofy factor 7 188 

419833 AA251131 Hs.220e97 ESTs 185 

451807 W52854 Hs.27099 hypotheOcai protein FU23293 similar to 185 

30 422530 AW972300 HS.11B110 t)ona marrow stroma) ceD antigen 2 183 

452203 X57522 transporter I.ATP-Wnding cassette, sub 183 

414020 NM_0029e4 Hs75703 smaU inducible cytokine A4 (homologous 183 

414809 AK34699 H&77356 transferrin receptor (p90.CO71) 183 

442599 AF078037 Hs.324051 RetA-assoctated inlribltor 162 

35 405387 N^UJ22170':Hon»8apien8WilB8ms*eurwi 182 

449539 W80363 Hs.58446 ESTs 182 

419079 AW014836 Hs.18844 ESTs 181 

410434 AF051152 Hs.63668 toD-like receptor 2 181 

408660 AA525775 ESTs. Moderately similar to PC42S9 feni 180 

40 409142 AL136877 Hs.50758 SMC4 (structural maintenance of chiomoso 180 

435099 AC0O4770 Hs.4756 flapstnicture-specificendonucleasel 180 

444005 BE395085 Hs.10085 type I transmembrane protein Fn14 179 

426761 AI015709 Hs.172089 Homo sapiens mRNA;cONADKF2p588l2022(f 179 

444665 BE613126 Hs.47783 B aggressWe lymphoma gene 178 

45 427528 AU077143 Hs.179565 rnlnlchnfTiosonis maintenance defk:ieiil(S. 17B 

414561 AI064813 H&195155 Homo sapiens amino acM transport system 176 

411789 AF245505 H3J2157 Adlican 177 

428311 Nl^005651 Hs.183671 tryptophan 2.3Klk)0(yoenase 177 

449376 AW664026 Hs^9892 ESTs 176 

50 449961 AW265634 Hs.133100 ESTs 176 

443378 AW392550 Hs.9280 pioteasome(proson»,macfopa!n)subunlt. 3.75 

407242 M18728 gb:Humannonspecifk:crossreactlngantig 175 

414416 AW409985 Hs.76034 hypothetical protein MGC2721 175 

422283 AW411307 Hs.114311 CDC45 (cell divlsbn cyde 45, S.cerevis 3.75 

55 422675 BE01BS17 Hs.119140 eukafyoQctransIalkinbutotion factor 3.74 

422899 BE410590 Hs.119257 enrsl sequence (mammary tumor and squamou 3.74 

441954 AI744935 Hs.8047 F8nconlanenila.oomplementa9on group G 3.74 

410290 AA402307 H3.322844 hypothetical protein DKFZpS64Al76 173 

422648 086963 Hs.118893 Melanoma assodaied gene 3.72 

60 428953 AA308610 H&348183 tumor necrosis factor receptor supeifaml 3.71 

443883 AA114212 H9.9930 serine (or cysteine) proteinase inhiblto 3.71 

428728 NM_016625 Hs.191381 hypothetical prolan 171 

400245 Eos Control 171 

442432 BE093589 Hs.38178 hypothetical protein FU23468 170 

65 428484 AF104032 Hs,184601 sdute carrier family 7 (cafionic amino 170 

424840 D79987 HS.15M79 extra spindle poles. S.cerevlslae. homo 170 

440659 AF134160 Hs.7327 daudlnl 169 

414821 M63835 Hs.77424 Fc fragment ofIgG, high aJanltyta, re 167 

413053 AL035737 Hs.75184 chlflnase 3-like 1 (cartilage glyooprote 167 

70 425081 X74794 H3.154443 minfchroinosome maintenance deflcl8nl(S. 3.66 

409432 049372 Hs.54460 smaH Inducible cytokine subfamily A (Cy 165 

428291 AA534009 Hs.183467 interferon stimuUed gene (20kD) 3.63 

414883 AA926960 C0C2B prot^ Unase 1 163 

428398 AI249368 Hs.98558 ESTs 163 

75 428479 Y00272 Hs.334562 cefl divisfon cyde 2. G1 to S and 62 to 163 

408482 NM 000676 H5.45743 adenosine A2b receptor 3.63 

404287 C6001909:gII704441ldbjlBAA18909.1|(D298 163 

431941 AK000106 H3.272227 Homo sapiens cDNA FU20099 fis, done CO 3.61 

414110 BE251752 ^£01 1 1 2444F1 NIH_MGCJ6 Homo sapiens c 3.61 

80 427857 AU33017 Hs.2210 hypothetical protrin FU22865 161 

419968 X04430 Hs.93913 inte(teukin6CintBrferon.beta2) 161 

430413 AWB42182 Hs.241392 smaHMudbie cytokine A5(RANTES) 160 

416209 AA236776 Hs.79078 MAD2 (mitotic anesi deficient. yaasl,h 160 
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433001 AF217513 Hs.279905 ctonsHQ0310PRO0310p1 ^60 

430994 AA490346 Ks.40530 Homo saf^ns, done MGai7624,mRNA. com 3.60 

456534 X91195 Hs.100623 phosphoTipase C. beta 3. ndQhtor pseudo 3.59 

437340 AL3S3935 Hs,135917 hypothefeal prolan OKFZp761D1823 aS9 

5 435793 AB037734 Hs.4993 KIAA1313 protBin 3.59 

437016 AU076916 Ks.5398 guanine tnonphosphate synthetase 3J9 

420247 AA^30 HS.44S80 hypothetical protein FU20979 3^ 

424308 AW975531 H5.154443 mln i c hro mos o mG naintenance deficient (S. 3.57 

422282 AF01922S H5.114309 apdlpoprotein L 3.57 

10 424635 AA420687 Hs.1 15455 Homo sapiens cDNAFUl4259fis,cton8 PL 3.57 

421044 AF081871 Hs.101302 Kumai DMA sequence from done RP1-238015 3^57 

408015 AW136771 Hs.244349 epldeima) dlfferenUaflon oomptex protei 3.56 

422956 BE545072 H5.122579 ECH protein (EpiOieialoeU transform! 3.56 

449039 Aig62602 Ks.742B4 hypo8»Iical protein MGC2714 3.56 

15 446269 AW2631S5 Hs.14559 hypothetical protein FU 10540 3.56 

400297 AI127076 Hs.306201 hypothetical protein DKFZp56401 278 3.55 

428977 AK001404 Hs.19469e cydin82 3^ 

402995 Nfi^002463':Komo8apiensn)yxovitus(infiu 155 

416065 BE267931 Hs.78996 proSterating ceU nudear antreen 3i4 

20 432917 NM-014125 Hs541517 PRO0327 protein 3.54 

439750 AL359053 Hs.57664 Homo sapiens mRNA full length Insert cDN 153 

445411 A1137255 Hs.12646 hypotheScal protein FU22693 3.52 

438113 A1467908 Hs.8882 ESTs 3.52 

414420 AA043424 Hs.76095 Immedials early response 3 3.51 

25 419682 H13139 Hs.92282 paired-6ke homeodbmain transcription fa 3.50 

447208 BE315291 Hs.237971 hypothetical protein MGC5627 150 

432543 AA552690 Hs.152423 Homo sapiens cONA: FU2 1274 fis. done C 3.49 

442295 AI827248 Hs.224398 Homo sapiens cDKA FU1 1469 fis. done HE 3.49 

426440 BE382756 Hs.169902 sotutecanfef family 2 (fadntatedglu 149 

30 429249 X81479 Hs.2375 egf-lilce module containing. mud}v4il«, . 148 

413900 AW40g747 Hs.75612 stiess-induced-phosphoprotdn 1 (HspTO/H 148 

424242 AA337476 Hs,347408 hypothefical protein MGC1 3102 148 

414761 AU077228 Hs.77256 enhancer of zeste (Drosophila) honwlog 2 147 

446480 NM_014578 Hs.15t14 ras homdog gene family, memtw 146 

35 414825 X05370 ' Hs.77432 epidenral growth factor receptor (avian 146 

428855 BE544095 Hs.164950 BarH-Ike homeotwx 1 146 

449003 X76342 Hs.389 alcohd dehydrogenase 7 (dass IV), mu o 3.46 

450506 NM.004460 libfoblasi activation protein, alpha 3.46 

421307 BE539976 H5.103305 Homo sapiens mRNA; cDNA Di<FZp434B0425 (f 145 

40 422938 Nbl.001809 Hs.1594 centromere protein A (17kD) 145 

405545 Target Exon 145 

418322 AA284166 Hs.84113 cyclin^ndent kinase lnhU)itor 3 (COK 144 

441703 AW390054 Hs.192843 leudne zipper protein FKS614 144 

417944 AU077196 Hs.8^ coQagen, type V, alpha 2 144 

45 417924 AU077231 Hs.82932 cydin D1 (PRAD1: parathyroid adenomatos 144 

431228 AB006748 Hs.198282 phosphdipid scrantlase 1 144 

422363 T55979 Hs.1 15474 repncaiJon fador C (adivalor 1) 3 (38 143 

440502 Ai824113 Hs.7a281 reguiabrof &proteinsIgnaiitng 12 143 

448741 BE614567 Hs.19574 hypothetical protein MGC5469 143 

50 453922 AF053306 Hs.36708 tudtfinguninhllxtedlvterizlmidazolesl 143 

406646 M33600 Hs.308026 major MstoGompafibllity complex, dass 142 

413281 AA861271 Hs.222024 transcription factor BMAL2 142 

449101 AA205847 Hs.23016 G protein^upted receptor 142 

430890 X54232 Hs.2699 glyplcani 141 

55 422809 AND01379 H3.121028 hypolheiicd protein FU 10549 141 

412429 AV650262 Hs.75765 6R02oncogene 141 

443211 AI128388 Hs.143655 ESTs 141 

422209 AF005210 Hs.113222 diemddne (OC motif) receptor 8 3.40 

428303 AWg74476 Hs.183801 regulator of &pratein signalling 16 139 

60 421817 AF146074 Hs.108660 ATP-Unding cassette, 8ut>-My C (CFTR 139 

428664 AK001666 H5.169095 dn^toSAUl (sal(Drosophila)^ 139 

422101 AW404176 Hs.111611 rOwsomal protein U!7 139 

457670 AF1 19666 Hs.23449 Insulin receptor tyrosine kinase 8Ut»tia 3.38 

437033 AW248364 Hs.6409 RNA pdymerase I subunlt 137 

65 425322 U63630 H3.155637 protein Idnase. DNA^ctivated. catalytic 137 

417059 AL037672 Hs.81071 extracelhilar matrix protein 1 137 

400298 AA032279 Hs.61635 six transmemtirane epithelial antigen of 136 

414812 X72755 Hs.77367 monokine induced tiy gamma iillerferon 136 

436748 BE159107 Hs.159263 cdlagen. type VI. alpha 2 136 

70 401797 Target Exon 3.36 

426309 li^7815 Hs.183650 ceiluiar retinolc edd-tjinding protein 2 135 

421553 NM.006433 Hs.1058C6 granulysin 135 

402294 Target Exon 134 

414024 AA134712 Hs.22410 gb2m79g08.n StratageneneuroepitheBum 3.34 

75 401981 NM.021626:Homo sapiens serine carbOKypep 133 

418462 BE00159& Ks.85266 tntegrin, lieta 4 133 

418667 031771 Hs.89404 msh (Drosophiia) homeo box homotog 2 133 

424800 AL035588 H3.153203 MyoOfarnily inhibitor 133 

412420 AL035668 Hs.73853 bone morphogenetic protein 2 133 

80 404440 NM_021048:Homo sapiens melanoma antigen. 3.33 

432398 AA307808 Hs.2979 (refdl factor 2 (spasmolytic protein 1) 133 

421677 H64092 Ks.36282 ESTs 133 

407792 AI077715 Hs.39384 putative secreted Qgandhamdogous to f 132 
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449048 Z45051 Hs^920 stmHar lo S68401 (calOe) ghicose bduc 3.32 

417197 AW994561 Hs.151777 eukaryotic tr&nsla&m initiation factor 3^ 

429669 BE1B5499 Hs^471 K1AA0O20 gsne product 3^ 

409636 AA305729 Hs.18272 amino acid transporter system A1 3^ 

S 429415 NM.0025g3 Hs.202097 procollagen C^endopepSdase enhancer 132 

405386 Target Exon 132 

410274 AA381B07 Hs.6t762 hypoxia^ndudbie protein 2 3.31 

448275 BE514434 H$.20830 Idn8sin^l(e2 3.31 

. 418245 AA088767 Hs.838B3 Iransmemtirane. prostata androgen Induced 3.31 

10 452291 AF015592 Hs.28853 C0C7 (cell division cyde 7, S.cerevlsi 3.31 

431830 Y16645 Ks,Z71387 $man(nduc{btecyto)dne8ubraniByA(Cy 3.31 

422575 AK000546 Hs.118552 liypotheticai protein FU20539 3.31 

404171 NMtO00638*:Hoino sapiens superoxide dismu 131 

418464 R87580 H3.144531 Ob7m89h07.r1 Scares 8dullt)rainN2b4HB5 131 

IS 425566 AW162943 Hs^50618 UL16 binding protein 2 131 

410226 At831958 Hs.61053 hypoUieScsi protein 130 

432281 AK001239 Hs.274263 hypottieficat protein ajl 0377 130 

443247 BE614387 Hs.333893 oM/c target JP01 ISO 

449717 AB040935 Hs.23954 cerebral cetladlie^nioleade 130 

20 428336 AAS0311S Hs.183752 microsenvnoprotein. beta- 129 

416111 AA033813 Hs.7901B diromalin assembty factor l.subunit A ( 129 

420759 T11832 Hs.127797 Homo sapiens cDNARJ1 1381 fis, clone HE 128 

432336 N14.002759 Hs.274382 protein Idnasa. interferonHndudble dou 128 

405776 NM.005361:Homo sapiens melanama antigen, 128 

25 419486 AA316241 Hs.90891 nudeop}u)smin/nudeop)dsni!n 3 127 

421150 AI913562 Hs.189902 ESTs 127 

406400 kani)(r^8(neuropsin/ovasin)(iaK8) 127 

455813 BE141577 gb:QV2.HT0083-071 2994)1 8^11 ifTOOBS Homo 127 

426084 BE367014 Hs.1B6146 Homer, neuronal immediate eady gene, 3 127 

30 458814 Ai49B957 Hs.170861 ESTs. WeatdysinBTartoZISSjWMANZINC 127 

456791 BE615453 Hs.346509 dedicator of cyto-ldnesisl 127 

419551 AW562256 Ks.91011 anterior gradient 2 (Xenepusl8evls)iiom 126 

429002 AW248439 Hs.2340 iunctlon plaicoglobin 126 

450000 Ai9S2797 Hs.10888 tiypottietical protein FU21 709 125 

35 407777 AA161071 Hs.71465 squalene epoxidase 125 

419485 AA469023 Hs.99807 ESTs. Weakly sin^ to unnamed protein 125 

426437 BE07e537 Hs.169895 utnquitinK»njugafing enzyme E2L 6 124 

415701 NM.003878 Ks.78619 gamma^Iutan^ liydrolase (conjugase. fol 124 

412817 AL037159 Hs.74619 prote3Some(pn)some,macropain)26Ssubu 124 

40 447519 U46258 Hs.339665 ESTs 124 

412561 NM 002286 Hs.74011 lymphocy(e-8CtIv8aongene3 124 

446528 AU076640 Hs.15243 nucleolar protein 1 (120kD] 124 

423198 M81933 Hs.1634 cell division cycle 25A 123 

415091 A1JD44872 Hs.77910 3^ydroxy-34netliylgIutaryI-Coen2ymeAsy 123 

45 441065 AW136551 Hs.1B1245 Honio sapiens cONAFU12532 fis, done NT 122 

443071 AIJ080021 Hs.8986 oomplenient component l.qsubcomponenL 122 

406901 AK001330 Hs.48855 hypothefical protein FU10468 122 

425849 AJO00512 Hs.296323 serum/gtucocorticQid regulated Idnase 122 

410275 U85658 Hs.61796 transcription factor AP-2 gamma (activat 122 

50 415817 U8B967 Hs.78867 protein tyrosine pltosphatase.feoeptor4 121 

409197 N54706 Hs.303025 chromosome 11 open reading frame 24 121 

412641 M16660 Hs.74335 heat strode 90ld) protein 1. beta 121 

413436 AF238083 Hs.68081 sphingosine kinase 1 121 

408636 BE294925 Hs.46680 C6H2 protein 121 

55 412115 AK001763 Hs.73239 hypothetical protein RJ 10901 121 

413142 M81740 Hs.75212 ornithine decarboxylase 1 121 

411573 AB029000 Hs.70823 KIAA1077 protein 120 

428242 HS5709 H5.2250 jeukemia inhibitory Mir (choOnergic 119 

409361 NM.005982 Hs.54416 sineocuI{shomeobox(Drosophiia)homoto 119 

60 435014 BE560896 Hs.10028 mitochondrial ribosomat protein L17 118 

401176 TargelExon 118 

434551 BE387162 Hs.280858 ESTs. Highly similar to A35661 (^exds 117 

410310 J02931 Hs.62ig2 coagulation factor ill (thromboplastin. 3.16 

427584 6E410293 Hs.179718 v-myb avian myeloblastosis viral oncogen 116 
65 423725 AJ403108 Hs.132127 hypothetica] protdn LOC57822 116 

452012 AA307703 Hs.279766 Wnesin family member 4A 116 

407289 AA135159 Hs.203349 Homo sapiens cDNAaJ12149 fis, done MA - 115 

409461 AA382169 Hs.S4483 M-myc (and STAT) Interador 115 

433020 Ai375726 Hs.279918 hypotheticai protein 114 

70 437915 AI837993 H3.2D2312 Homosapiens done Nil NTera2D1 teratoca 3.14 

426997 BE620738 Hs.173125 pepOdyMf ^nterase F (cydophiOn 114 

420005 AW271106 Hs.133294 ESTs 114 

426935 NM.000088 Hs.172928 collagen, type 1. dpha 1 3.13 

412270 AC005262 Hs73797 guanine nudeoOde binding protein (G pr 113 

75 421975 AW961017 H3.6459 hypolhetica) protein FU 11856 113 

427585 D31152 H5.179729 coBagen. typo X. alpha 1 (Schmid meiaph 112 
448140 AF146761 H820450 BCNMto membrane protein precursor 111 
431722 AF16152B Hs.268049 hypothetical protein lit 
427239 BE270447 Hs.174070 uWquitin canier protein 111 

80 413385 M34455 H5.840 indoiesminei>yiToie 2.3 dioxygenase 110 

439780 AL109688 gb:Homo sapiens mRNA full length insert 110 

422885 BE244068 Hs.121544 IntedeuUn 12 receptor, beta 1 110 

418090 U57059 Hs.83429 tumor necrosbrador(ligan(Qsupe(fM 110 
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439755 AW7484B2 Hs.77873 B7hoinoloo3 aiO 

404170 NM.000636*:Homo sapiens stiparoxlda dismu 3.09 

417370 T28651 Hs.82030 byptOftayURNA synthetase 3.09 

410006 AW732308 H$.577B3 eukaryoiic translaSon hiOaOon factor 3.09 

5 446291 BE397753 K5.14623 interferon, gamma^udbis protein 30 3.09 

421155 H87B7g H5.102267 iysyloxldase 3.08 

441224 AU076964 Hs.7753 calunenin 3.08 

424326 NM-014479 K5.145296 disintegrin protease a08 

429413 NM.0140SB Hs^01877 DESCl protein 3.08 

10 438251 BE515065 Hs.296585 nudedar protein (KKE/D repeat) 3.08 

446510 K58306 Hs.15165 retindc add induced 14 108 

442620 C00138 Hs.8535 Homo sapiens mRNA (or KtAA166B protein. 3.07 

409637 AA323948 Hs.55407 Homo sapiens mRI^ cONA DKFZp434K0621 (f 3.07 

4266B2 AV660038 Hs.2056 UOP gtycosyltransferase 1 (an%, polype 107 

IS 448853 NM12204 Hs.22302 genera! transoiplion factor II tC. polyp 107 

453775 NVL002916 Hs.35120 replication factor C (activator 1) 4 (37 107 

408915 NM 016651 H5.48950 tieptaceltular carcinoma novel gen&^ pro 106 

435505 AF^92 Hs.211238 interteuMn-l liomoiog 1 106 

412577 Z22968 Hs.74076 C0163 antigen 106 

20 410575 BE207480 Hs.6934 Homo sapiens cONA aj22044 fis. Clone H 106 

416084 L16991 H3.79006 deoxythymidytate kinase (thymidylate kin 105 

430393 BE185030 Hs.241305 estrogen^esponsive B Ixuc protein 106 

447342 AI19926B Hs.19322 Homo sapiens, Sinritar to RIKENcONA 2010 104 

451578 NML016323 K5.26663 cydin^ Imding protein 1 104 

25 444726 NM_006147 Ks.84981 interferon regulatory factor 6 104 

447733 , AF157482 Hs.19400 MAD2 (ntitotic arrest defident. yeast ii 104 

437741 BE561610 Hs.5B09 putative transmemlxane protein; iiomdog 104 

442643 U827S6 Hs.3991 PRP4/STKWDspthitno factor 104 

429358 AB03782S Hs.20O317 KIAA1404 protein 103 

30 410068 AI633888 H8.58435 FYN^inding protein (FYB-12Qn 30) 103 

426746 J03626 Hs.2057 uridine monophosphate synthetase (oratal 103 

409154 U72882 Hs.50842 Inlerfeion^uced protein 35 102 

442173 N76101 HsJ127 KIAA0144 gene product 102 

447400 AK000322 Hs.18457 hypothetical protein aJ20315 101 

35 450962 BES35647 Hs.25723 SiogrenTs syndrome/^leroderma autoanlig 101 

407634 AW016569 Hs.136414 UDP-GicNAc:t)6taGalt)eta-1,3^-acelylgiuc 3.01 

411387 AW842339 Hs.130815 hypothetical protein FU21 870 101 

438662 AA223599 Hs.6351 cleavage and pdyadenytaticmspedfic fa 101 

459107 AA611881 Hs.28S05 ut)iquitin^»njugatins enzyme E2H (homdo 100 

40 430287 AW182459 Hs.125759 ESTs. Weakly similar to l£U5_HUMANt£UKE 100 

416110 Z42262 Hs.322844 hypothetical protein DKJ^64A1 76 100 

435523 T62849 Hs. 11090 memt]Tane-spanning4<iomalns,6ut>f3m9yA 3.00 

448569 BE382657 Hs.21486 signal transducer and acthmtor of trans IX 

410268 AA316181 H3.61635 six transmentrane epitheGal antigen of 100 

45 400200 NM.002788*:Homo sapiens proteasome (pros 3.00 

403330 Target Exon 2.99 

413833 Z15005 Hs.75573 centromere protein E (31 2kD) 199 

403416 At744626 KIAA0564 protein 2.97 

403438 NM_031419*:Homosapiensmoleculepossess 2.96 

50 447942 F12628 Hs.1S5470 hypolhettoal protein MGC1G040 2.96 

427722 AK000123 Hs.180479 hypotheScal protein FU201 16 235 

414806 014694 Hs.77329 phosphalidylserfne synthase 1 2.94 

440086 NM.005402 Hs.288757 v-ral simian leukemia vNoncqgenehom 294 

429547 AW009166 Hs.99376 ESTs 2.93 

55 419121 AA374372 H8.89626 parathyroid honnon&fikehomione 290 

431890 X17033 Hs.271986 inlegi{n,alpha2(C0498,dpha2sut)uni Z89 

417259 AW903838 Hs,81800 diondroitin sulfate proteoglycan 2 (vers 2.89 

418203 X54942 Ks.6375B COC28 protein kinase 2 2.86 

441633 AW958544 Hs.112242 normal mucosa of esophagus spedffcl 286 

60 423425 AA375756 Hs.14449 KIAA1609 protein 286 

412851 At826502 Hs.106149 ESTs 286 

400664 NM_002425:Homo sapiens matrix metsOopro 286 

454140 AB040888 H3.41793 hypothsOcal protein FU 10474 285 

435602 AF21751S Hs.283532 uncharacterlzed t>one manow protein BM03 285 

65 421116 T19132 Hs.101850 re^Mdlng protein 1. cellular 284 

432343 NM.002960 Hs.2961 S100cah:kjnvt)inding protein A3 283 

423767 H1B283 Hs.132753 F^onlyprotein2 282 

413476 U2584g Hs.75393 add phosphatase 1, sduble 282 

„^ 441801 AW24Z799 Hs.86368 ESTs 280 

70 441565 AW953575 H8.303125 p53^uced protein PIGPC1 280 

416539 ¥07909 Hs.79368 epithelial membrane protein 1 279 

428959 AF100779 Hs.194680 WNT1 indudble signaDng pathway prolflin 279 

422947 AA3067B2 Hs.122552 G-2 and S^ase expressed 1 275 

417849 AW291587 H3.82733 nWogen2 274 

75 450434 AA166950 Hs.195870 hypothetical protein FU1 4991 273 

430466 AF052573 Hs.241517 potymerase(ONA directed), theta 272 

431448 AL137517 Hs.306201 hypothetk^al protein DKF2:p56401 278 271 

424874 AA347g51 Hs.326413 Homo8a(rienscONAFU20812fis.doneAO 271 

453633 AA357001 Hs.34045 hypoihelical protein FU20764 271 

80 447854 AW138454 Hs.11594 ESTs 271 

427581 HIIL014768 Hs.179703 igAA0129genBprodud 270 

412636 NM.004415 desmopldkln (DR. OPtQ 269 

420576 AA2g7634 Hs.5492S KIAA1B58 protein 266 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



HS.88S8 bnxnodoni^BQacenl to zinc finger doma 

Hs.154424 deiodinase.iodothyroriine.(ypell 

Hs.64001 Homo sapiens done 25218 mRNA sequence 

Hs.334838 KIAA1 866 protein 

Hs.26176 hypothetical protein FU10261 

H8.122730 EST8,Moderdtely5lmilartoKIAA1215pR> 

Hs.276679 e disinlegrin end metaODpratelnase dome 

NML000179*:Homo sapiens mulS (E. coB) h 

H5.gi417 topotsomeiase (DMA) II binding protein 

Hs.8939 yes-assocjaled protein 65 kOa 

Hs.119286 ESTs 

KS.S8806 hypoCiellcal protein 

H5.100431 smaUinductbte cytokine B subfamily (Cy 

Hs.337251 hypolhelfcal protein MGC2487 

Hs.21 254 TKAF interacting protein 

Hs.1 59499 parattiyrdd honnone receptor 2 



Hs.159557 karyophertnaIpha2(RAGcQ)K}i11.impor 
Hs.198384 prostagtandin^ndoperoxide synthase 2 (p 
Hs.39957 plecfcstrin2(mQuse)hom(^ 
Hs.83883 transmembrane, prostate androgen Induced 
Hs.72924 cytkSne deaminase 
Hs.211579 melanoma cell adhesion molecule 
Hs.227730 integrin. alpha 6 

Hs.54277 DNA segment on chromosome X (unique) 992 
Hs.155694 protein ^resine phosphatase, non-rec^i 
Hs.134403 hypothetic^ protein aJ10858 
Hs.40499 diditopr pCenopus laevis) homolbg 1 
Hs.35403 ESTs, Highly similar to unnamed protein 
Hs.44155 DKFZP5B6G1517 protein 
Hs.75162 mannosB receptor, C type 1 

C5002154*:gii7299015igb|AAF54217.1| (AEO 
H&28980 hypothetical protein inJ14540 
Hs.1721 intedeuldnll 
Hs.1 80383 dual specificity phosphatase 6 
Hs.90077 T&lnteracting factor (TAt£ family homeo 
Hs.172665 rnelhytenetetiahyttRiMiate (dehydrogenase 
Hs.7137 clones 238S7 and 23775 zinc finger prote 
gb:Human parathyroid honnon&feiated pep 
ENSP00000246632*:CDNA FU20261 lis, don 
H5.1 0526 cy^e and glydne-rich protein 2 
Hs^73321 difterenSaRy expressed in hematopoietl 

down syndrome criScai region protein OS 
Hs.2161 complement component 5 receptor 1 {C5a I 
Hs.3838 sennn-lndudbie kinase 
Hs.90708 granzyme A (granzymel, cytotoxic T-lymp 
Hs.83321 neuromedin B 

Target Exon 
Hs^1856 trtnudeotkie repeal contabilng 3 
H&50002 Sffld Inducible cytokine subtomOy A (Cy 
Hs.69752 desmoconin 1 
Hs.13264 KlAAD856protetn 

NlyL00621 2*:Komo sapiens e-phospholructo- 
Hs*75716 senne(orcystebie)protdnaselnhIbtto 
Hs.50500 ESTs 
Hs.10760 asporin (UW class 1) 
Hs.152982 hypothetical protein FU131 17 
Hs.1 70009 transfomiing growth factor, alpha 
H&281295 ESTs 

NM_031891M>mo sapiens cadheiln 2a type 
, Hs.8025 8ng1opoIetin-tiks2 

H3.44313 v-ie) avian retteirioendaUiellosIs vbd 
HS.1875BS ESTs 
Hs^25S ESTs 

H3.44298 milochondilairibosorna] protein 817 
Hs.140489 ESTs, Weakly sln^lar to UNI.KUMAN LINE- 
Hs.20993 htgh^ucose^egutated protein 8 
H5.2488 lymphocyte cytosolk: protein 2 (SH2 doma 
Hs.172766 MAPftnicrotobutoaffinltiHegulaQngkina 
H3^51677 G8nBgen7B 
NM_001429 Hs.25272 EtAbJnding protein p30Q 

Hs.85256 COS antigen, alpha polypeptide {p32) 
NM-000579 H3.54443 dwnokine (OC mofif) receptor 5 

granzyme B (granzyme 2, cytotoxic T-lymp 
KIAA146d protein 
413129 AF292100 Hs,t04613 RP42bomokjg 
432606 NM_002104 Hs^066 granzyme K (serine protease, granzyme 3; 
AA128808 Hs.179902 transporter-like protein 

Hs.22626 tropixnodulln 3 (ubiquitous) 
Hs.134342 TASP for testls-spediteadriamydn sens 
Hs.1334 v-cnyb avian myeloblasto^ viral oncogen 
Hs.129228 galaetoldnase2 



442932 


AA457211 


425071 


NM.013989 


410491 


AA465131 


428698 


AA852773 


451277 


AK001123 


447347 


AA570056 


429505 


AWB20035 


406137 




419594 


AA013051 


443054 


AI745185 


452620 


AA436504 


420552 


AK000492 


42(»31 


AF044197 


434517 


AA635690 


448454 


NM_005879 


425776 


U2512B 


436238 


AK002163 


440676 


NMLQ04987 


425811 


AU)39104 


429113 


D28235 


407804 


AF228603 


452679 


Z42387 


411908 


L27943 


449230 


BE613346 


430024 


AI808780 


458079 


Ai796870 


425345 


AU077297 


423881 


AK001720 


407853 


AA336797 


457819 


AA057484 


408298 


AL1 17452 


413048 


M93221 


403851 




433745 


AF075320 


423903 


M57755 


427700 


AA262294 


419373 


NM„003244 


426827 


AW087805 


440282 


BE262386 


406974 


M57293 


401924 




444190 


A1878918 


420923 


AF097021 


436608 


AA6289B0 


427509 


M62505 


434398 


AA121098 


419490 


NM.006144 


418030 


BE207573 


404927 




438549 


BE386801 


409038 


197490 


411388 


X72925 


445757 


AW44g065 


405069 




414035 


Y00630 


443168 


AI038653 


444301 


AKDOOIjo 


433345 


AI681545 


426471 


M22440 


445019 


AI205S40 


402021 




431886 


NIA.012091 


454219 


X75042 


409571 


AA504249 


450831 


R37974 


408353 


BE439836 


445960 


AI268399 


448356 


AL120837 


429732 


U20158 



426850 8E247670 

427335 AA44S542 
450649 

418460 M26315 
449523 

416975 NM_0O4131 Hs.1051 

433226 AW503733 H5.9414 



446620 

449008 AW578003 

433160 AW207002 

420802 U22376 

423482 BE280172 



Z68 

Z68 

2.66 

Z&4 

2.64 

2.64 

2.63 

Z63 

2.62 

ZS9 

2.59 

2.59 

2i6 

2.S6 

LS5 

2.55 

Z54 

2M 

154 

253 

153 

Z51 

Z49 

Z48 

Z47 

Z46 

Z45 

Z45 

Z45 

Z44 

Z42 

Z40 

Z39 

Z37 

Z37 

Z36 

Z32 

Z31 

Z31 

Z31 

Z30 

Z29 

Z29 

Z28 

Z27 

Z27 

Z26 

Z25 

Z2S 

Z24 

Z23 

Z21 

Z18 

Z17 

Z16 

Z15 

Z13 

Z11 

Z10 

Z06 

Z07 

Z05 

Z04 

Z03 

1.99 

1.99 

1.98 

1.97 

191 

1.90 

1.90 

1.88 

1.88 

1.88 

1.88 

1.86 

1.85 

1.85 

1.81 

1.79 

1.78 

1.77 

1.77 
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434370 


AF130988 


HS.S8346 


ectodys{ri3^ 1» snMdroOc receptor 


419125 


AA642452 


Hs.130881 


B-ceii UUjiyrnpnofns ha (zinc ungiBT pro 


425545 


N98529 


Hs.156295 


Homo Sc^nens, done mGulZ40l« nwNA, oom 


405102 






C1500122D :gli44D9KK}BDlAAi/Ziotl.i| vAr 


433201 


A6040898 


Hs.21104 


K1AA1463 prcAein 


420798 


W93774 


Hs.9993o 


KersuD lU (epiuaiiiiuiTuc nypcTKeraiosB 


437660 


AA333063 


Hs.279898 


Homo saixens cONA: rU23i65tis.cionsL 


414961 


U27266 


Hs.927 


fnyoslrvoindtng prowin n 


428405 


Y00762 


Hs.2266 


dtoHnergic rece])tor» ntcoflnlc, e^>ha p 


422170 


AI791949 


Hs.1 12432 


ant}-Muilefisn honnone 


431846 


6E019924 


Hs^1580 


uropiakin IB 


404468 






Co0Du44Z^.9l{ 1 1 1 ZU0w|rei|NrJU0D9 10.1 1 C 


405779 






NMJ0O5367:Homo sapiens melanoma antigen, 


441129 


AA074904 


Hs^d420 


ESTs, WeaMy ^rralar id tIodsi nypotneD 


427244 


AA402400 


Hs.178045 


ESTs 


411411 


AA345241 


K&55950 


ESTs, Weacjy slmuar to KiAAISoO fmtein 


417777 


AI823763 


Hs.7055 


ESTs, Weany similar to 178885 sennenn 


418367 


AA326035 


Hs^236 


tiypotheUca protein DKFZp434u)718 


440340 


AVV89S503 


H5.125276 


ESTs 


437162 


AW005505 


Hs.5464 


tiiyiotd hormone receptor coacMing pr 


4247S0 


D29956 


Hs.1 52918 


ubiquifin specific pmlisase 8 


429469 


M84590 


Hs.27 


glydne dehydrogenase (decartioxyiating; 


408374 






C160u1364:qi|1 i0D7373|rBip«rJDo7DB9.1t C 


430806 


SE266026 


Hs.31476 


Homo sapiens cDNA FU 13872 lis, done in 








Target Exon 


401258 






NM_030932*:Homo sapiens diaphanous (Dros 


433323 


AA805132 


Hs.159142 


ESTs 


427441 


AA412605 


H8.343879 


SPANX(amBy.meml)erC 


4M707 


A)188613 


H5.4ie90 


desmocoHin 3 


409103 


AF251237 


Hs.112208 


XAGE-1 protein 


451106 


BE382701 


HS.25S60 


N-MYC oncogene 


434804 


AA649530 


Hs.348148 


gb:ns44l05.s1 Na^CGAPJVIvl Homo sapiens 


430686 


NM.001942 


Hs.2633 


desmoglein 1 


423325 


AW068739 


Hs.243770 


ESTs 


406703 


X13100 


Hs.173084 


myosin, heavy polypeptide 3, delete! m 


418827 


BE327311 


H5.4716a 


HT021 


404104 






a00137B*:gll1 171748|sp|P46530INOTC.BRAR 



1.76 

1.75 

1.74 

1.74 

1.73 

1.65 

1.62 

1.61 

1.61 

1.61 

1.58 

1.57 

1.55 

1,55 

1.52 

1.52 

1.51 

1.51 

1.46 

1.47 

1.46 

1.44 

1,43 

1.40 

1.39 

1J8 

1.36 

1.33 

1.31 

1.27 

1.27 

1.23 

1.21 

1.19 

1.03 

1.01 

1.00 



TABLE 4gB 



Ptey. Unique Eos probeset idenfifier number 
CAT number Gene duster number 
Accession: Genbanit accessbn numbers 



PiCBy CAT Numtwr Accession 

413808 2S0S 1 A1570199 At886812 AW867550 Ai921557 AW468096 AI925581 AI679988 AW473623 BE841640 BF061525 AI445703 AI925072 AW883188 

AW863076 BEB41731 AW863167 BE841390 BEM1365 BF374078 BE841760 BE841694 BE841769 AA335110 BE841692 BF374073 AA335204 
BF374079 BEB41713 AA335167 BE841584 AW868103 BE841645 BE841755 AI075336 AW867433 BF3n831 BE841758 AW86891 1 AW863155 
AWB68847 BE841651 AA335145 BE841670 BF374260 BF374088 BE841661 BE841728 BI335729 BE841739 BE841663 AW863104 AA335201 
AA335143 BF906965 AW867493 BE841505 BF374250 BE841766 BF373837 AW863191 BE841705 AW863154 AW868673 AW86731 1 AA335896 
BEe41763 AW863407 86937102 BF374252 BF374247 BF3742S5 BEB41785 AW029590 AW131278 AI801021 AW058240 AW058400 AW029230 
AVW29432AW130609AW029128AW130469AI570155AI620272AVV029259Ai801389A18^^ 

AW131174 AI581069 AI225028 AI446689 AI923321 AI439430 AI801502 AI679707 AW028944 A19336B4 A1801724 AI537779 AI354652 AI470250 
AI536872 AI891 161 AW868019 AW006034 AI702599 AA335192 AA335165 AA3351 89 A1933725 AW044393 A1888797 BEB41677 BE841681 
AA335141 AW008176AA335223AI868837AW868622Ai803901 AW005718Ai538062A)2622S8 AI580678AI445803A}445394AI86ai68 
AA335144 AI926349 AA33S21D AA334919 AA335163 AA335216 A1678342 BF374135 At932922 AA335214 AA335109 AI570325 AI452619 Ai926109 
AI45348B AI676e06 AW869289 AW869211 BE841580 AI679368 AI888882 AI926170 BF50B305 AW869315 AA334g26 BE841712 AW026584 
AA335200 BE841764 AV730339 AW474979 AI286344 Ai446430 Ai537612 AA3351 66 AW868051 AI679133 AI949520 BE841652 AI949532 
BE937113 BE841789 BE841643 AW130556 BEe41761 AW868716 AW868698 BE841669 BE937108 AA335158 AA335153 AA335159 AW887404 
AW868692 BE841742 AW86a711 AW867546 BE841699 AA335198 AA335146 AW868150 BE841660 T99129 BE841740 BE841714 AA335154 
AVV868815BF373812BE841657BE841780AI440394AA335215AA335202AA335162AA335160At8016S6AI6784996^^ 
Ai826057AI572459AI932773AA335197AI611752AA335224AI452592AA335147AA335149AA33492BAA335114AA335n 
AW029395 AI570325 BF373838 BE841691 BE841776 AW883485 BF374093 AW130376 BE841732 AI446393 AI446781 AW867547 AW029012 
AA335227 AW869307 AVV869350 AW868709 AW869407 AW005017 A1679252 AI925523 AW1 51553 AW863109 Ai44S917 AI799620 A1921607 
AW008153 A1520957 AI610620 AI679828 At868151 AI537839 AI679547 T26354 AI282567 AA335207 R83655 BFg06963 AW131160 AI925626 
AW029396 AW028445 AW008410 AW152586 AW008476 AI801040 AI4S3669 Ai621 200 AA33492S BF374069 BF374075 N53208 BF374246 
AW868723 BE937150 AAg55002 AW863338 BE841767 

421582 13358 1 X00474 NiyLC03225 X52003 1^12075 Bi765761 AW950155 AI571948 81760569 AA3084aO AA568312 B!761955 AA507595 AA614579 AA614409 
BF747598 BM142326 AA307578 AI925552 AA578674 AA582084 AW009769 AA514776 AAS88034 BG271505 AA858276 BM142503 AW05O70O 
A1307407 A1202532 AA524242 AI909772 A1970839 BG236516 AW750216 AA587613 AJ909749 AI909751 AI910083 AA614539 R55292 AA507418 

453331 16559 1 BG571303 AA410586AA035018BG5721 17 8G620022 AA147247 B6005785 BG014448 R31981 H02668 HI 2498 1^6203 BF992069^ 
T49904 R75732 81057974 T53681AA147933N50695I«8588 R2S671 R31935 R25110R36105AK055628BE157467AW663674AA190993 
H01642 BF510304 AA526915 AA746952 Ai161014 AA099554 BG572534 AI803329 A1809932 AI808765 AA41 1449 AI378760 AA976929 AI378620 
AA909684 R75632AI360919A13S0463AW069127AA411621AA742532H12451 BE208298 H03612H12839NS8781 R75957BFg96464AI240665 
6F989591 BI056086 BG001590 BF107035 

406687 0 0 ^t31125 

450375 16559,3 BG570705BG572749AW606284 H04021 AA151166AW954405AA131254BG056461 W46291 H01532H04384 H03231 AAB52875 H04410 
H59605 BE1 57601 AA1 13758 

415989 10194 1 6C0133896C017398Al023543AA191424AI267700AI46g633AWg56465AVV953397AA172056BE940298BF9^ 

BG285837 AI720344 BF541715 AA355085 AA172236 
^6991 29771 1 AIW1536AK05613SBM474813BE887303AK022914AW581 996 AVV81 2945 BE882302AA1 34266 BI043873AA01^ 

AU128438 BE364458 At^967 BI857117 BF688525 BI465223 BM460132 AU129877 B1222283 BG1715S2 BI043S44 BG4g6295 6G750710 

6126542 86108520 AU150719AW510354Ai554256AtJ53968AA1910gSBF132635 
454241 6858Q6J BE1849426E184946AWZ384146E144666 
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452203 2630J BG014081 NM.000593 X57522 121200 L21207 121206 L212Q5 L212D4 ALS61404 AL546423 AL560492 AL556882 AL541576 AL550854 BI823S19 

B177Q023 AL5549G9 Bt489S06 AI304693 AW295947 BM146642 X57521 BG820143 BE8S8390 F06770 F12630 BM423610 AL561516 BM009470 
BG7429B1 AA279685 AAB47441 AA313737 BF172639 BFe97216 BF914190 BF903647 S702n A1S69694 AW073296 AI361433 AA564644 
AA487429 BEB58232 AA838810AI539114AI719375 AI829129BG057675 AM23422AU158860 BE300555 AW1 70777 AA5B6356AL57 1889 
AL556B50AL576404A1.582B00Bi2S65446F342301 BGB7S994AA05445BAA353161 AI940434BE816522AL577638AI479650AW1 50377 
AU1S4395AVy^127lAl032220At819778Al348733AW77115DAVV512525AI249904AA27g809AI352S49AW 

6E271141 Ali81932 A1541575BI8191B4 AV660igOAL558475A1620020AVVQ898a8 AVra79179Z21518 F0465t AI783951 T57198 

AI433367 T788S2 AL5549S8 AA355648 AL582619 BE874601 BF804669 AL5744S8 BM145502 AI266514 AIS38823 A)475626 AA948210 AA8B4054 
AA487637 AA03tB44 AA535221 AW794255 AW361447 BE788505 A1682892 AAB30989 AA882356 AA653084 BM009154 AA135727 H05927 
H23433 R42244 N79997 AW3666B5 AW366601 AA678742 AL5S5474 AA135770 BE7740S) BF9U200 H88457 AA627748 BI560216 BI753586 
408660 105529_1 AW975281 AA664988 AA525775 AA056342 A1538978 

400245 12188 1 X79449BC017853 AL121035BF195384AW119044AI028023AW451110AI971911 AW015069AI079170AI3763^^ 

AA449S79AA740864 NM_001111 U18121 AL567297 BG773801 BF973874AV6871O4AA527579AA843525BE706355AIO745B9A1523475 
BE8g0249 AW406263 BE074258 AV729485 BF809610 BG058619 AA677244 BE179838 AA622264 A1460106 AA74041 1 AI499168 A1078223 
AI682923 BE696559 AW3753B5 AA788739 BG984978 Z40874 T17054 F09669 AW844043 U10439 BI711870 AW245957 AU158S67 AA679305 
AA679315 W72510 AJ346029 BG059762 AW251062 AA132373 AI925621 AIB60230 AI340172 AW192891 AI707980 A1094937 AIQ421 15 AI200901 
BE328452 AA64467B AA551209 BE351065 AA970761 N68509 AW002028 AA160826 A1422774 AW8731 14 AW073597 AW6644B3 AI218710 
AW020550 AW190607 AI984545 AI871921 AI333970 AI452887 AI8ie335 AA398665 AI554424 AI274187 BE465703 AW512940 AW241366 
Ai923954 AAS76649 AW168294 AAB13181 AA912168 A)049738 AW514073 AA548255 A)569630 BE710031 AA244182 At341697 AA563904 
AI537990 AW517908 AW172943 Z39498 AI750294 AWl 50414 AI253293 BEB25720 T31860 AW150775 D2031 0 AA^ 
, AL038957 BF910979 AA352297 BG988142 AW372175 BF229106 AW866705 BE093482 BG990395 A1499917 AA054452 H05484 AI828502 

BM467331 AU140570 AL135417 BF947202 AW391926 BE813418 BF99B473T92021 61021048 8M048783 AWS01366 AW501342 AW501549 
BE939021 8E707147 BE160974 BE305207 N49011 AA947119 AA678801 BE536876 AW697428 BG329648 8 6818540 BES42344 81919250 
BI253018 AW1309g6 6E074249 BE895428 81034862 BE083277 BF952166 

414883 8371 2 AF274943 BG494894 AI719075 AA908783 AI935150 AI422691 AA910644 AA583187 BM272167 A1B28996 AA527373 AW972459 AI831360 

AA772418 A1033892 AA100926 AU154749 AI459432 A1423513 AI094597 AA740817 A1991988 AI090262 AI312104 81256707 AA459522 AA416871 
A1075239 AI339996 AA701623 AI139549 AI336880 AA633648 A1989380 At36283S AA399239 A1146955 6F514270 N92892 A1348243 AI278887 
AM59292AI494230BFS07531AM92600AA962596AW613002AA293140AA235549BF108854AA954344N49682AM5^ 
AW300758BE220715BE220698BE569091 BM009647BF900351 A)537692AI203723AI857576AA584410AW371667BM172363 

414110 1634167.1 6E253764 BE2S0764 BE255757 BE251752 BE251925 

450506 1529 1 IK)9278 NM J04460 U76833 AF007822 AL550894 BG203919 AL575714 A1478772 AW022667 AW61 3820 AI435793 A1051768 AI200109 AA43561 1 

BG208151 A1446661 BG215551 BM449645AW630055BG620125AL550932AW471 133 AU1 36648 BE925603 8F828688 
BE141S77 8E141585 BE141587 
AL109688 R23665R26578 

BC005265 BG176720 AW006027 BM352064 AW026316 AI635822 AI880584 A1693769 A109221 1 B1492387 AJ400449 AW1 66297 BF939910 
AA2322S2 AW021432 Ai333893 AA494308 AA854899 AI436795 AW0692S6 AA682373 A!092748 AA993184 AI126077 A1081758 A)24(^ 
A)261 863 AI378423 AA465237 A)376096 AA035579 AI087306 AA448162 AA129977 AI090g03 A1080688 A)288939 N33004 Ai601240 AW021546 
A)370773 A)086064 AA669528AI 250053 A]870113 AAB53181 AA856014 BG055562 BG9395S9 AW08076S AA032283 AW467587 H40506 000762 
NWL002788 AA641134 AI58229S A1417525 AI563975 AI093566 AI707743 AI290741 AW073417 BE875418 BM2640768G876884 AI680535 
AW854219 BE774635 AW854212 BG952443 AW854221 AW854208 BE156348 8EB43056 AW858991 8E937569 BG878291 8G876450 AW819099 
AI908570 AA449871 AU135228 BM478404 BF126298 AA375499 AA248473 

M77830 NM_004415 AF139065 B66B1 1 15 BG740377 8171 2964 BG000656 AA128470 81438324 H27408 8E931630 8E16716S AW370827 
AW37081 3 J0521 1 BG698865 BG740734 BG680618 86739778 8I76S807 BM353403 8M353248 AW1777B4 AW205789 AW951 576 AW848592 
BE182164 BF149266 6E940187 81060445 81060444 BF350983 8E720095 8E720069 8E715154 8E082584 eE082576 BE004047 AA857316 
BI039774 BE713818 8E713548 AW170253 BE160433 81039775 AW886475 8M462504 BE931734 8F149264 AA340777 BF3811&3 8G621737 
AU1 27260 AW364859 BF993352 BG223489 BE819009 BF381184 BE71S9S6 R58704 AABS2212 AW366566 8)090358 BF087707 BE819046 
BE819005AA3771278E073487BE81906gBE819048 81036306 8G990973 81040954 8F919911AU140155A19S1766AI434518A^ 
BF752969 8E837009 BE925826 BF149265 AW9956t5 BE814264 81039782 AU140407 8E144243 BE709863 8F985642 BE001 923 BF933510 
AW265328 BG436319 BE182166 AW365175 AWB47668 BE818280 AW177933 8F873679 AW178000 8E082526 8F476866 BF086994 BF592Z76 
BE082507 BE082S14 BE082S05 BF873693 AW066840 AW847678 8F804153 AW365157 BE813930 BE002030 AW365153 8E184941 8F749421 
BE184920 BF839S62 BE184933 BF842254 BE698470 BE931048 8Fg99889 BF368816 BE184924 6E159846 8E714632 BE184948 6G986845 
AA131128 AA099891 W39488 C04715 6F096124 BE885341 AW799304 Ali03116 BE149760 BE705g67 BE7059^^ 
AW376699AVV376817AW3766978G005097BF7511158E696084AVV848371AW376782AVV848789AV^^ 

AW997139 BE865474 BE18S187 BE156621 BE715089 BE713297 BE7132g8 BE179915 AW799309 8F872345 BF08d676 6E705939 AW752599 
BG005197 BF3S0086 BE715196 BE71S155 BF762396 6F093B17 BF831 190 BF752409 BE006561 8G959922 BF094633 6F094748 BF094583 
AW377G99 AW607238 BE082519 AW377700 BF349467 Alig0590 A1S54403 A1392928 AU158477 81467252 AU159919 AI760816 BF082516 
AI439101 AA451923AI340326 AI590975BI791553 AI700983AI142882 AA039975AA946936 AA644381 BM314884AA702424 AI417612 
AW190555 A1220573 AI304772 AI270345 AI527383 AA552300 A1911702 AW1 66807 AI346078 W95070 AA149191 AA026854 AI830049 AW780435 
AI078449AI819984AI8S8282 81468588 AI860S84A102S932AA026047AA703232AA658154AA515500AW1920B5^^ T77861 AI927207 
AI20S263 BF082491 AW021347 AI568098 BE939862 AAg88866 D12062 AA0S6527 AA782109 W19287 W02156 AW19003B AAD22701 T87181 
H44405 AI910434 BF082513 A1494069AI270027 At635878 AA128330 BG681425 BE706078 R20904BG880059 6G676647 BF764409 AA026654 
AV745530 81762796 BG287391 AW7g8760 BE706045 8E926470 AW7991 18 BF08799B 6E002273 AW879451 AI571075 BE087786 AV721320 
AI022B62 N29754 C03378 N84767 AA131077 H30146 8E714290 A1686869 A1568892 AI915596 AW105614 At887258 AI538577 8E926474 
BE067737 BG319486 AA24768S AW788883 AW103521 BFg89173 AW860878 BE939707 BE185750 BE714064 BE713903 BE71366B BE713763 
BG95D164 BE713810 AW365151 BG955489 BE00S272 BF915937 AW365148 Aig05927 BF992780 AW853812 BG954443 B1770B53 6G679406 
BG740832 BG681087 BG698430 AA455100 T87287 BEe96209 BE696210 BI089483 BE006273 BE872225 AW391912 6E925515 BG677012 
B6741970 AA026480 6E705999 BG677157 BEOOg090 BG68137B 6E712291 8G961498 8G876984 81040941 AA337270 AW384371 AW847442 
81058659 BE813565 W9504BW25458AW177786AA025851 BE931 733 BF1 54837 BG949393 8E71 4441 AW996245BE711801 AI284090 
BE064323 BE719390 BE940148 86991212 8F375714 BF349522 6G996267 T48793 BI013292 BE001925 AW365156 AW365154 AW606653 
8F763109 BE931637 BE167161 8E713879 BF354008BF678728 H90B99 AW365145W38382 AI498487 
436608 32229J2 BC015981 AJ30161S AA628980AI126603BF184719 



455813 151S590J 
439780 4908Z.1 
400200 3806.1 



412636 1438J 



TABU46C 

Pkey: Unique number cooespondlng b an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are GenbanK IdentiTier (GO numbers. Dunham, et aL' refers to the pubflcafion entilied The DMA sequence of 

human chromosome 22* Dunham, et aL (1999) Nature 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
Ntposilion: Indicates nucleotide positions of predicted exons. 



Pke/ Ref Strand NLposltton 

401781 7249190 Minus 6321&83435^83531-83658^4(^1.B423 

559 



wo 03/042661 



PCTAJS02/36810 



401780 


7249190 


Minus 


28397-28617^20-2904531 35-29296^941 




400666 


8118496 


Plus 


17982-181 15.20297-20456 




401765 


7249190 


Minus 


165776-165996.166189-166314,166408>16656 




401747 


9789672 


h6mis 


1 1 659$>1 18816.1 191 19-1 19244.1 19609-1 1976 




400665 


8118496 


Ptus 


16879-17023 




402^ 


2996643 


Minus 


4727-4969 




402075 


8117407 


Phis 


121907-122035.122804-122921,124019>12416 




405770 


27^037 


Ptus 


61057-62075 




401994 


4153858 


l\/Iinus 


42904-43124.4321 1-43338,44607-44763.4519 




404996 


6007890 


Ptus 


37999^145[38652-38998,39727-39872,4055 




404240 


5002624 


Minus 


116132-116407 1166S3-116922 




402447 


9796640 


Rus 


47605-47729 51696-51 821 5207D^22S7 5330 




402992 


7767907 


Minus 


42137-42515 


* 


402408 


9798239 


l^nus 


110326-110491 




404286 


2326514 


Ptus 


5108&51301 




405387 


K87915 


Minus 






404287 


2328514 


Ptus 


53134-53281 




402995 


2998643 


Minus 


5962-6216 




405545 


1054740 


PbiS 


118677-118807 119091<1ig(29GLl216%>12182 




401797 


6730720 


Ptus 


6973-7118 




402294 


2282012 


Minus 


257S>3000 




401961 


4581193 


Munis 


124DS4-1242Q8 

i mr^w^r^ 1 4b^ft Vv 




404440 


7528051 


Plus 


80430-61581 




405386 


6579238 


^^us 


4095W1297 




404171 


9930793 


Plus 


173667.173783,176876-177055 




405778 


7280331 


Ptus 


18748-19757 




406400 


9256298 


Plus 


1553-1712.1878-2140.4252.4385.5922.6077 




401176 


9438469 


Mbuis 


2047&^20734 




404170 


9930793 


Ptus 


168838.169248 




TABLE 47A: 








Pkey. 


Unique Eos probeset identifier number 




ExAocn: 


Exemplar Accession number, Genbank accession number 




Unigene!!^ 


Unigene nuntex 






Unlgene 'n)Ie:Unlgene gene title 






R1: 


Maximum of esophageal tumor Als divided by the 381h percentile of (he norma! esophagus Ais 




Ptey 


ExAocn 


UnigenetD 


Unigene inte 


R1 


400289 


XD7820 


Hs:22S8 


matrix (nataDoprolelnase 10 (stpomelysln 


31.70 


411243 


AB039888 


Hs.69319 


CA11 


3o[l2 


418007 


M13509 


Hs.83169 


mainx metelloproteinase 1 (interstitial 


18.46 


444325 


AW1S2618 


Hs.16757 


ESTs 


18.22 


444381 


BE387335 


H$.283713 


ESTs. Weaidy similsr to S64054 liypotheli 


17.52 


421379 


Y15221 


HS.1039B2 


smaD indudUBcyUMnesiibfaniBy B (Cy 


16^8 


400666 






NM.002425:Hofno safdens matrix metanopro 


15.59 


425211 


M18667 


Hs.1867 


progastricsin (pepsinogen C) 


15.22 


425679 


X05997 


Hs.159177 


lipase, gastric 


14.60 


432239 


X81334 


Hs.2936 


matrix metaHoprotdnase 13 (collagenase 


13!l4 


431723 


AW058350 


Hs.16762 


Homo sapiens mRNA: cONA DKFZp564B2062 (f 


12.60 


446619 


AU076643 


Hs.313 


secreted phospltoprotetn 1 (osteopontln, 


12.00 


453331 


AI240665 




ESTs 


11.20 


431620 


AA126109 


Hs.264981 


2''5'-oi!ooadenylate Byntt»tase 2 (69-71 


10.77 


408380 


AF123050 


Hs,44532 


diubiquilin 


iOJ32 


423673 


BE003054 


Hs.1695 


matrix metatioproleinase 12 (macrophage 


10*32 


412326 


R07566 


Hs.73ai7 


small tndudbie cytokine A3 (homologous 


10^ 


419216 


AU076718 


Hs.164021 


smafl Inducible cytokine subfamily B (Cy 


10.18 


408243 


Y00787 


Hs.624 


intBrieukin8 


9.80 


414359 


M62194 


Hs.75929 


cadherin 11, type 2, OB^herin (osteob 


9.75 


450375 


AA009647 




a distrriegrin and metaUoprotetnase ctoma 


9.12 


407366 


AF026942 


Hs.17518 


gb:Komo sapiens clg33 mRNA. partial sequ 


&88 


433447 


U29195 


Hs.3281 


neuronal pentraxin U 


&64 


421508 


NM.004833 


Hs.105115 


absent in mdanoma 2 


6.46 


452862 


AW378065 


Hs.8687 


ESTs 


6.34 


432828 


AB042326 


Hs.287402 


chondroitin 4^]fblransferase 


7.92 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA RJ11041 fis, ctone PL 


7.86 


409757 


NtiLOOisgs 


Hs.123114 




7.62 


452838 


U65011 


Hs.30743 


preferenlidDy expfossed antigen in meta 


7!60 


413670 


AB000115 


Hs.75470 


hypolhelicai protein, expressed in osleo 


7.58 


452410 


AL133619 




Homo saito mRNA; cONA DXFZp434E2321 (f 


7.48 


437330 


AL353944 


Hs^llS 


Homo sapiens mRNA; cONA i3KFZp761J11 12 (f 


7.44 


406687 


M31126 




matrix metaOoproteinase 11 (sliomelysin 


7.24 


430280 


AA3612S8 


H3.237868 


interieu)dn7raceptor 


7.18 


439343 


AF086161 


H3.114611 


hypothefcal protein PUIISOS 


7.13 


429228 


AI553633 


Hs326447 


ESTs 


7.04 


421110 


AJ2S0717 


Hs.1355 


calhepsinE 


6.98 


414004 


AA737033 


Hs.7155 


ESTs. Moderately similar to 2115357A TYK 


6.88 


424321 


W74048 


H1176S 


iymphocyte-specifk: protein tyiosina Un 


a88 


406673 


M34996 


Hs.ig8253 


m!|)or tdstocompatOrinty complex, dass 


&72 


421562 


AI910275 




IrefbO factor 1 (breast cancer, eslioge 


8.52 



wo 03/042661 



PCT/US02/36810 







nS.i73lo 


Konto sapiens cigS nTOlAi psrilal seiiuencs 


w.1v 


Atvutn 




Ue CAM 


nOmo Daptens cuna. rxj£Btn ns, cione n 






AMUMAtnC 

AWOlAofD 




bo IS 




41WDU 




LI- Qe*lCQ 

H5.o525o 


CD8 oiGgen, alpha polypepUo (p^ 


R 19 


41 




Ua mil/ 


Qfowth suppressor 1 


O.W 






nS.io1or4 


intBnerojvtrHJucea protein win vsnwn 


IS Mi 
OiOO 


413441 


A13Z9374 


Hs.75367 


Src^o^apter 


9.00 


427337 


Z48223 


Hs.1 76663 


Fc (TBsment of IgG^ low affii^ llib, r 


<; fli 

9.01 


417715 


AWbbaDO/ 


HS.OOODO 


ESTs 


9.rO 


413808 


J00287 




Hofno sapiens mRNA for caMesmon, 2t UTR 


9.DJ 


400685 






NiVLuu<44o.nOfno sspiens msnx fneteiiDpro 


9>0U 


42440B 


Al 79401 d 


n5.i4o4ZO 


coOagen, type V, sifi^ 1 


9.90 


418299 


AA//3bjU 


nS.boabo 


inieQrin, oeis 2 (anosen ijuio {pUb), ly 


K AA 


444527 


NM_0u540o 


Hs.l1383 


small induaoie cytoxine suDismiy A (uy 


9.4* 


42o3oB 


Dt:44QD44 




matrix melallopnAsinase 3 (stnomelystn 


K An 
0.4U 


416768 


AA363733 


Hs.l032 


fegenerating tslet^erlved 1 alpha (pane 


9.00 


430413 


AW842182 


Hs.241392 


small inauabie cytoxine A5 (RAnTES) 


5.08 


427509 


M6K05 


Hs.2161 


conipleinenl component 5 teceptor 1 (CSa 1 


c no 


422530 


AVV972300 


Hs.11d110 


bone manow stmnal cell enfigen 2 


9.U4 


413278 


8E563085 


HS.B33 


inteiferon-stimulated protein, 15 kOa 


4.92 


436856 


At l^ft^ee 

AM69355 


Hs.1 27310 


ESTs 


4.60 


426711 


AA383471 


Hs.343o00 


cons^ed gene ampGfied in osteosarcoma 


4.bU 


421362 


AKDOuOSO 


Lis 4n4<m 

nS.i 03853 


1. -M--M— t n r minim CI ImfU'k 

hypotnetical protein RJ20049 


A tn 
4.00 


452401 


NR4-007115 


Hs.29352 


tumor necrosis factor, alphaMitted pro 


A AH 
4.40 


404240 






N»LD iBSouinomo saptens major nistocompai 




435523 


T62849 


Hs.11090 


membrane^pannlng 4-domsdns, sutrfamOy A 


A 'iA 
4.04 


437763 


A A ACAtCA 


Hs.5831 


tissue biNlritor of metaUoprotetnase 1 


4.£a 


425139 


AWq3048B 


HSJ25338 


protease, serine, 23 


A OA 
4.4*1 


415989 


AI267700 




ESTs 


A on 


408202 


A AO^mn 
AAZZ/710 


nS.4,5o5D 


Ui\r^OODLl5i prOtBul 


All 


450701 




ns.^oo4D/ 


kMnA4hafw»ol nmtAJn YD 1101)1 CI 

nypoineucai proiein ArjusDioi 




423271 


W47225 


ns.i 2)5256 


t«^M4M>l#lM 4 Wain 

intefleuKin i, beta 


A M 


414774 


X02419 


Hs,77274 


piasmino9en ectivator, uroldnase 


0.%nl 


443907 


AUD7d484 


Hs.9963 


lYKU prenin tyrosine Kinase otnotng pro 


^ on 


424687 


J05070 


HS.1917M 


m^rbt metfilloprotdnase 9 {gelatinase 8 


O.O0 


444006 


Bc39^85 


Hs. 10086 


type 1 transmembrane protc^ Fn14 


■a OA 
O.OD 


414915 


NM-002462 


HS.7B391 


myxovirus (Influenza] resistance 1, homo 


O.iD 


408122 


A1432d52 


nS.42BZ4 


nypoineticai protein rui iD71o 




408049 


AW076098 


Hs.345588 


desmoplaKin (DPI, DRI) 


0.44 


431629 


AU077025 


H5.265S27 


Inteneron. alpha-tnductble protein (do 


0.0/ 


435370 


AI9d4074 


H3.225O0O 


bbis 


^ 90 
O.i^ 


443378 


AW392S50 


H5.9280 


proteasome (prosome» macropain) subunit 


9 4Q 
0. iS 


443071 


AL080021 


Hs.8986 


complement component 1 , q subcomponent. 


3.18 


409154 


U726B2 


Hs.50842 


lnlerferoi>4nduced prot^ 35 


0.10 


445417 


AK001058 


Hs.12680 


Homo sapiens cONA FU10196 lis, done HE 


3.12 


413142 


M81740 


Hs.75212 


omttMne decaiboxylase 1 


3.00 


406S46 


M33600 


Hs.308026 


m^ histocompafibi% complex, class 


2.76 


402992 






Target Exon 


2.57 


452304 


AA025386 


Hs.61311 


ESTs, WeaWy slmuar to S10590 cysteine 


2.54 


41S245 


AA088767 


^.83883 


transniend)rane, prostate androgen Induced 


Z52 


413945 


NM.000591 


H$.75627 


CD14 antigen 


Z51 


423225 


AA852604 


H3.1 25359 


Tlry-I cell surfece antigen 


Z50 


443883 


AA1 14212 


H3.9930 


serine (or cysteine) pr^^ase inNbSo 


2.48 


416149 


X12451 


Hs.78056 


caihepsin L 


*i m 
LAl 


42S247 


NML00S940 


H5.155324 


matrix mstanofvoieineso ii (sirameiysin 


^40 


410422 


AU)42014 


HS.6334B 


Homo sapiens, done MGu 1520% mRNA. com 


^49 


413936 


AF1 13676 


Hs.297681 


serine (or cysteine) proteinan inmbito 


Z45 


409202 


AA236881 


Hs.51043 


hexosanvnidase B (beta pdniepBde) 


9 9Q 


422562 


AI962060 


Hs.l 18397 


AF - A 

A&olnding protein 1 


^.49 






H8.9661 


ptotBssome (piosomB, mscropaln) subuid^ 


2^ 


444652 


BE513613 


Hs!ll538 


.acdn related protein 2/3 complex, subun 


2.19 


412471 


M63193 


Hs.73946 


endothelial cell growQi factor 1 (platel 


Z19 


449717 


AB040935 


K3.23954 


cerebTBl cell adhesion molecuie 


2.03 


417389 


BE260964 


H5.62045 


midldne (neurite growth-promoting factor 


2.03 


428981 


BE313077 


Hs.93135 


ESTs. WeaMy sImUar to ALU^HUMAN AUU 8 


1.83 


445109 


AF039916 


Hs.12330 


ectonudeoslde triphosphate dlphosphohyd 


1.79 


406778 


H06273 


Hs.101651 


Homo sapiens mRNA; cDNA DKFZp434C107 (fr 


1.70 


408716 


AI567839 


Hs.151714 


Homo sajdens mRNA for KIAA1769 protein. 


1.69 


412773 


H15785 


H3J4573 


similar to vaccinia virus Hinditi K4L OR 


1.66 


414024 


AA134712 


Hs.22410 


gb:zm79g08.r1 Siratagena neuroefAheSum 


1.65 


426530 


U24578 


Hs.278625 


complement component 4A 


1J8 


414945 


BE076358 


Hs.77687 


lymphocyte antigen 6 complex, lociis E 


1.52 



TABLE 47B ^ , 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Gent>ank accession nund)ers 

Pkey CAT Number Accession 

453331 16559 1 BG571303AA410586AA035018BGS72117BG620022AA147247BG005785 B6014448R31981 H02668H12498R3620^ 

T49904 R7S732 BI057974 T53681 AA147933 N50695 R68S68 R25671 R31935 R251 10 R38105 AKD5S62B BE157467 AW663674 AA190993 



wo 03/042661 



PCT/US02/36810 



H01642 BF51 0304 AA526915 AA746952 A!161 014 AA099554 B6572534 AI803329 AIB09932 AI808765 AA41 1449 AI3787G0 AA976929 AI378620 
AA909684 R75632AI360919AI350463AW069127AA411621 AA742532 H12451 BE208298H03612H12839r^1 R7S957BF9g6484Al240665 
eF989591 Bt056088BG001S90BF107035 

450375 16559.3 BG570708 BG572749 AW605284 H04021 AA151166 AW954405 AA1312S4 BG0664S1 W46291 H01532 H04384 H03231 AA852876 H04410 
H59605 6E157601 AA113758 

452410 59661.1 AL133619 A1435410AA622747 AWZ72464AI21S594AIS7375BAI476447AI60412BAI5B1345AI026826M AA255162At5S9724 
AI493388 AA614641 A)125754 AI214351 AI567080Ar200813 AI476629AI685732 AA602400AA730140A1565082AI269603AI807095 AA90^ 
AA505S09 AI204595 AIS82930 AI688077 AA757883 AA730154 AA664048 81831663 AI734138 AI734130 AI732734 AW043563 AI741241 AI732741 
BF111446BE677727AA437369 AA426284AA433997 AA425820 

408687 0^0 M31126 

421582 133S8.1 X00474 NM.003225XS2003 Ml 2075 81765761 AVI/SS0155 AI571 94881760569 AA3O840OAASS831 2 81761955 AA507595AA614579AA614409 
81747698 BM142326 AA307578 A)925552 AA57B874 AAS82084 AW009769 AA51 4776 AA588034 86271505 AA858276 BM142503 AW050700 
AI307407 AI202532 AAS24242 AI909772 A)970639 8G23651 6 AW75021 6 AA587613 AI909749 A)909751 A)910083 AA614539 R55292 AA5074 1 8 

413808 2905J AI570199AI888812 AW867550 A1921557 AW459096 AI925581 AI679988 AW473623 BE841640 BF061525 AI445703 AI925072 AWB63188 

AW863076 BE841731 AW863167 BE641390 BE841365 BF374076 BE841760 6E841694 BEB41769 AA335110 BH641692 BF374073 AA335204 
BF3740796E841713AA335167BE641564AW868103BE841645BE841765A!076336AW867433BF373831 BEB41758 AWB68911 AW863155 
AW888847 BE841651 AA33514S 8E841670 BF374260 6F374088 8E841661 8E841729 81335729 BE841739 8E841663 AW863104 AA335201 
AA33S143 BF906985 AW8674S3 BE841505 BF374250 BE841765 BF373837 AW863191 BE841 705 AW863154 AW888673 AW86731 1 AA335896 
BE841753 AW863407 BE9371026F3742S2 BF374247 BF374255 BH841785 AW02g590 AW131278 AI801021 AW058240 AW058400 AW029230 
AW029432 AW130609 AW029128 AW130469 AI5701 55 AI620272 AW0292S9 AI801 389 AI888662 Aig26902 AI601799 AI61 0344 A)452852 
AW131 174 A1581 069 AI225028 AI4466B9 AI923321 AI439430 AI801502 AI679707 AW028944 AI933684 AI801724 AI537779 AI354652 AI470250 
A)536872 AI891151 AW668019 AW006034 AI702599 AA33S192 AA335165 AA335189 AI933725 AW044393 AI888797 8E841677 8E841681 
AA335141 AWD08176AA335223AI888B37AW868622AI803901 AW005716AI538062AI282258AI580S76 A1445803AI445394AI868168 
AA335144 A1926349 AA335210 AA334919 AA335163 AA335216 AI678342 BF374135 Afg32922 AA335214 AA335109 AI570325 AI452619 AI92610g 
A1453488 AI678606 AW869289 AW8G9211 6E841580 AI679368 A1888B82 AI926170 8F508305 AW869315 AA334926 8E841712 AW026584 
AA335200 BE841764 AV730339 AW474979 AI286344 AI446430 A1537612 AA335166 AW868051 A1679133 AI949520 8E841552 AI949532 
BE9371 13 BE841789 BE841643 AW130556 BE841761 AW868716 AW866698 BE641669 BE937108 AA335158 AA335153 AA335159 AW667404 
AW868692 BE841742 AW866711 AW867546 6E841699 AA335198 AA335146 AW888150 BE841660 T99129 BE841740 BE841714 AA335154 
AW888815 BF373812 e£841657 6E841780 AI440394 AA335215 AA335202 AA335162 AA335160 A1801656 AI678499 BF374019 AW130236 
AI826057 A1572459 A1932773 AA335197 AI611752 AA335224 AI452592 AA335147 AA335149 AA334928 AA335114 AA335111 A1567048 
AW029395 A1570326 8F373838 8E841691 BE841776 AW863485 BF374093 AW130376 BE841732 A1446393 AM48781 AW887547 AW029D12 
AA335227 AW869307 AW869350 AW868709 AW859407 AW005017 AI679252 A!925523 AW151553 AWB53109 A1445917 AI799620 A)921607 
AW0081 53 AI520957 AI61 OS20 At679828 A)B68151 AIS37639 AI679547 T28354 AI282567 AA33S207 R63656 BF906963 AW131160 AI92S626 
AW029396 AW02844S AW008410AW1 52586 AW008476 A)801040 AI453669 AI621200 AA334925 BF374069 8F374075 N53208 6F374246 
AW868723 BE937150 AA955a02 AW863338 BE841767 

415989 10194J 6C013389BC017398AI023543AA191424AI267700AI469633AVV9S8465AVV953397AA172056BE940298BF90^ 
BG285837AI720344BF54171SAA355086AA172238 

TABLE 47C 

Pkey: Unique number corresponding to an Eos piobesel 

Refc Sequence source. Ttie 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham, et a!.* refsis Id the pubUeaHon entitled *The DNA sequence of 

liuman chromosome 22* Dunham, et aL (1999) Nature 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted. 



Ntposifion: Indicates nudeofide posl 


itions of predicted exons. 


Pksy Ref 




■"nCpSaibn 


400666 8118496 


Plus 


17982-18115,20297-20456 


400665 8118496 


Plus 


16879-17023 


404240 5002624 


Minus 


116132-116407.116653-116922 


402992 7767907 


Minus 


42137-42515 


TABLE 48A: 







Pkey: Unique Eos probeset kfentifier number 
ExAocn: Exemplar Accession number, Genbank acoesston number 
UnigenetD: Unlgene number 
Undone TlttelHgene gene Gtle 



R1: 


90lh percentile of normal esophagus Als divided tiy the 90ih percentile of esophageal tumor Als 


Pkey 


ExAocn 


UnigenelO 


Unlgene Title 


R1 


407245 


Xg0568 


Hs.172004 


llfin 


37.43 


426752 


X69490 


Hs.172004 


titin 


30.23 


425545 


N98529 


K&.1582g5 


Homo sapiens, done MGC:12401, mRNA. com 


23.69 


407013 


U35637 




gb:Human netiuDn mRHA, parlid ods 


17.09 


400440 


X83957 


Hs.83870 


nebulin 


15i6 


408704 


M21665 


Hs.929 


myodn, heavy polypepSde 7, cardiac mus 


14.21 


428087 


AA100573 


Hs.162421 


troponin C2, fast 


13.03 


417070 


219077 


Hs.172004 


Utin 


13.02 


408707 


S73840 


H5.931 


nryosin. heavy poIypepQda 2. skeietai mu 


1Z61 


405001 


U58198 




interteukin enhancer binding factor 1 


1Z53 


418391 


NiyL003281 


Hs^3 


tn^xMiin 1, skeietai, stow 


12.46 


418205 


U1715 


Hs.83760 


tFO()onln 1, skeletal, fast 


1Z40 


422633 


XS6832 


Hs.1 18804 


enolase 3, (beta, muscle) 


1Z21 


400499 






C100O1858:gII6679124|r€qNP.032759.1| na 


11.99 


418390 


AF133a20 


Ks.84865 


tilin immunogtotjulin domain protein (myo 


10.53 


412519 


AA196241 


Hs.73980 


troponin T1, skeletal slow 


10.21 


417435 


NIIC00S181 


Hs.82129 


carbonic adiydrase til. musde specific 


10.14 


413778 


AA09Q235 


Hs.75535 


myosin, Dghl polypeptUe 2, regulatory, 


10.13 


408493 


BE206854 


Hs.46039 


phosphogiyoerate mutase 2 (musde) 


10.00 


416373 


AA195845 


H5.73580 


ESTs. WeaUy similar to S12658 cysteine- 


9.66 


41S672 


N53097 


HS.193S79 


ESTs 


9.57 


409096 


AA194412 


HS.S05S0 


saroomerte musde proldn 


9.48 



562 
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431360 NiyLQ00427 Hs^1680 loricrin 9.42 

41G982 J05401 Hs.80691 cieafitn kinase. iniUjchondrial 2 (sanom 9.20 

42G429 X73114 Ks.169849 myosin-binding protein C, slovMype 9.15 

_ 422069 AJ010083 H5J43603 tidiKap (telethonin) &96 

5 409028 AB014S13 Hs.49998 Z-band altemafivety spliced POZ-moSf a64 

437206 AW97S934 Hs.283382 ESTs^UteaMysindar to (38344 ifQn,ca 8.48 

421296 NM_002666 Hs.103253 peiilipin a47 

412129 M21334 Hs.73454 troponin T3. skeletal test &39 

^. 434352 AF1 29505 Hs.BS492 small rnusde protein. X-Cnked 028 

10 418026 BE379727 Hs.83213 fatly add tiinding protein 4, adipocyte 7.93 

408591 AFD15224 Hs.46452 mammaglobin 1 7^ 

435124 AA725362 Hs.120458 ESTs 7.76 

430681 AWg69675 Hs.291232 ESTs 7.70 

454229 AW957744 Hs.278469 lacrimal pn^ rich protein 7.68 

15 424734 Ai2l7685 Hs.96844 ESTs 7.59 

428221 U96781 H$.183075 ATPase. Ca transporting, cardiac muscle, 7.57 

431204 F28841 K$.250760 cytochrome coxMasesubunit Via poiypep 7.41 

443727 Z2S389 Hs.18459 ESTs 7.21 

408753 AI337192 Hs.47438 SH3 domain binding glutamic acid^fch pr 7.04 

20 413132 NM_006823 HS.7S209 protein kinase (cAMP-dependenlcatatya 6.08 

424485 AI68S069 Hs.272556 peptldylarginlne dalrnlnase type I 6.93 

403805 Target Exon &87 

429997 NNL006789 Hs.227457 apolipoproleinBmRNAediUng enzyme, ca &72 

418532 F00797 Hs.8S844 neurotrepldc tyrosine kinase, receptor. 8.70 

25 419711 C02621 Hs.159282 ESTs a70 

422640 M37984 Hs.1 18845 troponin C, stow &68 

433839 F3S430 Hs.146070 ESTs. We^ similar to ALU1.HUMAN ALUS 6.55 

406703 X13100 Hs.173084 myosin, heavy poiypeptkle 3. sketolalmu 6.34 

451621 A1S79148 Hs.26770 totty acto binding protein 7. brain 6.27 

30 446962 AI351421 Hs.279709 muscto spedtk: ring finger protein 1 6.20 

411102 AA401295 H3.23926 triadin 6.17 

411852 AA528140 Hs.107515 ESTs. WeaMy similar to T00329hypolheti 6.15 

454059 NM.0031S4 H8.37048 stafherto 5.95 

451957 AI796320 Hs.10299 HQmosaptenscDNAaJ13545fi8,ctonePL 5.85 

35 434360 AW015415 Hs.127780 ESTs 5.57 

420813 X51501 Hs.99949 prolactin^nduced protein 5.52 

417376 AA253314 Hs.154103 LiM protein (similar to rat protein kina 5.46 

424688 AA216287 HS.181S myosin, light polypepQde 3, dkaB;ven 5.42 

446523 N(A-003063 Hs.334629 sarcoBpin 5.41 

40 402270 Target Exon 5.25 

437846 AA773866 Hs.244569 esophagus cancer-felated gene-2 5.24 

424982 U94777 phosphorytase. glycogen; muscte(McArdle 5.17 

414657 AA424074 Hs.76780 protein phosphatase 1,rQgulatoiy{inhib 5.14 

410621 AA194329 Hs.172004 tIBn 5.10 

45 429134 AA446953 Hs.99004 ESTs 5.06 

436519 AJ278124 Hs.238756 myozenin 5.04 

447023 AA356764 his.17109 tntegrai membrane protein 2A 5.03 

427639 AW444530 Hs.105362 Homo sapiens, done MGC:18257, mRNA.com 5.02 

426451 AI908165 1^.169946 GATA^inding protein 3 (T-ceii receptor 5.ro 

50 433635 AI074S02 Hs.134292 hypothe»cal prot^ MGC12921 4.98 

429892 NM.003803 Hs.2504 myomesin 1 (sketomin) (ISSkD) 4.96 

411021 F00055 H5.172004 titin 4.95 

416349 X69089 Hs.7g227 myomesin(Mi)rotein}2(165kP) 4.93 

424697 D63216 Hs.153684 friazted-related protein 4.92 

55 406741 AA056357 Hs.74466 cardnoeihbryonfcantlgeiHetetedcellad 4.92 

428824 W23624 Hs.173059 ESTs 4.78 

416892 AK000268 Hs.87383 hypolhe6ca] protein 4.74 

448406 AW772298 Hs.21103 Homo sapiens mRNA;cDNADKFZp564B076(fr 4.73 

432306 Y18207 Hs.303090 protein phosphatase 1, regulatory (biMb 4.66 

60 424049 AB014524 Hs.138380 KIAA0624prQtoto 4.65 

439609 AWg71945 Ks.293236 ESTs 4.65 

433122 AB019391 Ks.58049 ESTs 4.62 

415447 Z97171 Hs.78454 myocllln.trabecuterineshwDi1(todudbte 4.59 

415655 W05433 ESTs 4.S9 

65 442376 W95S88 Hs.129982 Homo saplenscDNAEU12228fis, done MA 4.58 

452308 Ai167560 H5.612g7 ESTs 4.57 

418072 F35210 HS.86S07 Human DNA sequence from done RP3^C17 4.56 

429413 NhL014058 Hs.201B77 DESC1 protein 4.53 

„^ 423725 AJ403108 Jte.132127 hypothetical protein LOC57B22 4.53 

70 438704 AI435060 Hs.32825 ESTs 4.60 

413391 A1223328 Ks.75335 glydne amidlnotransferase {L-arginine:g 4.49 

430699 AW969847 Hs.292718 ESTs. Weakly dnvlar to RET^JIUMANRETtN 4.48 

419050 NM_000036 Hs.89570 adenosine monophosphate deaminase 1 (Iso 4.46 

422313 AF045941 Hs.115166 sdellln 4,43 

75 417045 FD1180 H3.332030 Homo sapfens 0IV1 4.41 

426158 NM.,001982 Hs.199067 wert>^2 avian erythroblastic teukemla v 4.39 

435101 AI743156 H3.131064 ESTs 4.37 

432408 N39127 ESTs. Wealdy similar to A46010X4inked 4.35 

439706 AW872527 Hs.59761 ESTs, Weakly simBar to OAPI^HUMAN DEATH 4,35 

80 429930 A]580809 Hs.99569 ESTs 4,30 

429624 AA45e648 Hs.99476 ESTs. Weakly similar to 131 31 84B8!pha1 4.26 

429454 ALJ039940 Hs.202949 K1AA1 102 protein 4.20 

411000 N40449 Hs.201619 ESTs.W^Ml8rtoS38383SEB4BpR) 4.11 
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429032 


nDUiW4a 




email [iMtiiriMft nrtnUno mhfnmlhf A fOv 
SiiKii inuuwHiiB cjiuMiR} Buinamij n |w]r 


3.99 






lie OflftAQ 




3.95 








ESTs 


3.84 




AW091091 




Uftn^ condone 1 mirnnitm mRMA 

nuiiK/ oopions wuuvwn nuMv\ 


3.84 






nS.1 IU4UD 


PQTa WpaVlu chnnor In 13^(199 KvmlhAH 


3.83 




AlrJOOlD 


n9.f f«40 


njuiuxypiusiagionDHi wnyuniyBiioao 


3.82 


•MUI 1 1 




ns.cJ'tf 00 


Murropawiic yHWom 


180 


422267 


r 10003 


ns. 1 i*M*io 


cvirrhnvna r nvfalnsA Rt^tnH Vila w^vofi 

VjUWUUillO b UMUOw OUWUIIH Vila JlViJpw 


3.75 




/U4/ 4109 


ns. i00*i3£ 


CO IS 


3.74 


449003 


X76342 


Hs.389 


flImhrJ rfphvrfmnpiusR 7 f class ISA mu o 


3.70 




nnl9*»O0U 




uupOiTiOQUnii H \\uusaaf 


3.68 




Alston/ 




Q.Vn Anmntn MrtMnn nhilaml^ aHfLrirh nr 
onO Ouiiidin DuwinQ ^uioniic ouwiiui |h 


3.68 




AA04D441 


Ue 911 QB7 


CCT 

cot 


3.67 


Ailvyn 




ns,ouruo 


/valeanitaeMn 1 fffie1_kiftf/«K elrolalal m 
CSjSc^UcSUui 1 (iBSHWllCn, SKclolw ITl 


3.63 


mwm 


/vv)70f 01 


Ue 'fOqftO^ 


PQTe MrwlpnilAlvfiimibirtn AID'S MUMANA 


3.62 








CO IS 


3.57 




AtAWnXXCQ 
nW00<HtO9 




uun 


3.52 




nri00U4<i 


Uc 1<vR9Qfi 

ns.iooc9v 


balliVrotn 1*1 

Kdmiucin 10 


3.49 




/v\21ofOl 


Ue 7Q9ft>; 


Mippicssion oi MAmngGiiiatj o 


3.48 


4213 U 


ADfV)7Q07 

AoW/920 




fnyOTtsgsTin 


3.41 


AA'iO'l'i 


AK^CPOC 
Ali>0909tl 


Uc Vi^MA 


CCTe 
CO IS 


3.37 


41ab4o 


1 rooDi 


U«QiRT7 
nS.9l0i 1 


uiynxu nonTwne responsivo orv/i m \io*f 


3.36 


418067 


All 27950 




cysiaun c/m 


0.0£ 


^8666 


Al Afi040n 




noino sapiens niriiv\ cuNn uiNrtp^oHniVt \» 


3.29 


AKiRM 


CCOOO<l 




P^Tt WpnIrhf Rimaar tn AAR4 HUMAN 64 KD 


3.26 




AW1Jy04» 


Ue ttHi'iA 


CQTc UUaaVlit elfmlv fn A KTOdi rtrfrdrinn 

Co 1 s, vvQaKiy suiutar \o noi £o i cyujiuiia 


3.23 




AAf;9^R^7 
AAa^434f 




C^O/^ flnrmtn_^Affifiilnlnn tmi IronenOft ran 

rATU QOiiKuiHmiiaining ion uanspun icg 


3.21 








nn^juuouo i<noiiiu oap'™'^ dP^mUG gicuuiiii 


3.21 




rUU99l 






3,17 




NiVLiUU0£*i9 


H$^22 




3.13 




ADfVtOQQq 
/\OU0a999 


Ue 97(»A 


IC1AA117^ nmtpin 


3.08 






U«fl94^9 


KlAAflflflQ nmlAin 

IWVWU09 |JIUlOUI 


2.99 






Ms4Q30 




2.97 


•lOUoUU 


Al flAIA^n 


UecffOin 
n5.oo^iu 


P^TTe MhtMw ehiiliir In ITH4 HINulAlllMTFR 
CO 1 Op ruyniy SMiniar \u 1 1 m^nuivwi in i cn 


Z97 


•K>IDI<* 




Ue iTznm 

ns. 10/ uiw 


ESTs 


2.83 


452380 




Hs.98539 


ESTs 


167 




AAQ'3flyl71 


Ue UA^I 


er^vMfW^ rtronttta n«vi4^n fOll Ln^V ^iiela 

spGcmc yitfiiuie piwinn \£,o woj, cjoio 


2,57 




AUUQ79M7 




ESTs 


2.67 




Wf JioO 


Ue 9ftQftT7 


hunn(hoHr*al wmfalw CI li9Q91 

nypouiouctu piowin rui icm i 


2.54 




WQ/ OZD 


Hs 55412 


F^Tq Wn;ikivfibirilarlDK1CJ HUMAN KERAT 


2.50 


4J*lal 


AAA^^IO 
AAVn JlsO 


Ue 97')1ftf> 

n5.£f 0100 


ht/r\AlhaRf>fl4 lunlflin f*lnfUi TakilhnnniB 

nypoincocs] pfoieuii vionB i cKuiun^iui 


2.33 


42dQdd 


ACDcnoc 

Ar I09U29 




ValliVmbi 19 


2.32 




79nQA9 




IV-niyC UuWriSUudliKcgUkllaO Qollo £ 


2.28 


41UDM 


KIM fVt797R 


Ue RHAOA 
115.00424 


lalroMAi^On fnloefnln^v^an-KtnHInn nfnioln 

iBirdnBcun ^iiosniinogQiroinuing proiciii 


2.25 


411388 


X72925 


H3.69752 


(tesrnocoiljn 1 


Z2S 


425721 


AC002115 


Hs.1 59309 


uroplaidn 1A 


112 


430520 


NM_016190 


H8.242057 


chromosome 1 open reading frame 10 


110 


429441 


AJ224172 


Hs.204096 


Rpophilin B (uteroglobin famOy membeO 


2.02 


417405 


W28657 


Hs^07 


ESTs 


101 


434560 


R13052 


K3.3964 


Homo sapiens done 24877 mRNA sequence 


1,95 


417074 


Z49878 


Hs.81131 


guanidinoacetals N^neUiyltransferasa 


1.79 


430513 


AJ01200B 


Hs^41586 


^prot^ 


1.68 


454478 


AW805749 


H3^18885 


superoxide dlsnnilsse 2, ntltDchondrld 


1.68 


416559 


A!039195 


Ks.128060 


ESTs 


1.66 


447205 


BE61701S 


Hs.11006 


ESTs, Moderately sbnBar to T17372 plasm 


1.64 


415780 


U75698 


^78846 


heal shock Z7M)|s0lein 2 


1.55 


409702 


AI752244 




eUkaiyoOc transition elongafion factor 


1.50 



TABLE 48B 

PIcejr: Unique Eos prot)esetidenG6er number 
CAT numtier. Gene chistsr nanibet 
Accession: Qanhark accession numt}ers 



Ptey 

407013 
424882 



415655 
432408 

409702 



CAT Numtier Accession 



2073.7 
2S36iL1 



15499J 
2061J8 



U35637 AA192323 AA194508 BG011583 F25712 AI^6820 BE185376 

AK057547 BG181248 AA883756 F25670 AA778128 F27657 F18914 F25171 AA178844 F21556 F25872 F20457 F27617 F38059 F34817 F26957 
F25922 F31278 F34666 F01176 F36333 F01226 F27406 F27130 F28742 F24126 F29891 AA1959S5 AA086351 W69291 F25680 F32791 F3131 1 
F32360F25216F19679F18656F29700F24954F32741 F30404 F35470F33989 F33141 F36382 F34118Fini4AA17634SF24700AA550940 
F18617 F16859 F15633 F34675 F16528 F17281 AA088388 F30859 F21852 C02644 F29425 F25286 C03553 F352S9 W80691 F16457 F24094 
F167B3 AA180319 F28443 F17763 F17448 F00542AA197179 AA193012 

AJ276240 N70563 F37502 F29200 F27903 F18577 F19683 F20857 Z28857 F30994 ni752 F17375 F15601 F17543 F1741 1 

AV724258 AA247153 BF738219 BF513744 AW058048 A1082691 AA865520 N39127 AV724549 F20776 AA249747 AW970392 AA535433 F36964 

F33894 

AK055951 AK026458 BM39120 BM021 108 F30243 BtyK)55214 BM054962 6M069667 F37401 AA563621 AI752243 AI720773 AI933014 F18964 
F35317 F35258 F27772 H39537 AW445222 F1940B H28557 F30608 F31797 F30960 BF837737 BFB37688 AL551046 B!758668 Br765038 Bi837440 
BE392882 BI438801 AI093511 AI752244 AI784111 BG490221 BF338840 BF338974 BG896472 AL576843 AW986769 F253B8 F37436 H2B558 
AI025548AA782333 F30929 F36002F21229AI720539AA719449 F21231 F18924AA626888 F30774F27704F31411 F31127F33381 F36153 
F31 793 F31 138 F31966 F33901 AA298244 BI757347 AI810201 AI692843 F29441 H51409 F21804 AW973249 Ft8440 F17572 F32499 AA327152 
AA534140 A1188088 F18B93 F23362 AA010888 Ft8143 Z28500 H27651 AI720790 F2242S H13178 H28S77 F21098 F37777 F21466 F16S98 
F23420 AL574723 R7S610 F34035 F17845 F18560 F25902 R79117 F35534 F15713 At812800 F16563 F15645 F33809 F29995 BG939623 F17385 
F17384 F18660 F17922 F15523 AI0932S3 F18359 F31452 F00232 A1S83430 6M0213S3 AA284108 H27650 H29935 BE708208 AA010737 H51451 

564 
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Z19399 AI678418 AI9S2S35 F17265 F17826 F37939 F35639 F17367 W75962 R70189 Z2875S R72106 AA335915 R75700 R79116 W72887 
AISei552 R71403 F23388 C03913 B17S6149 BI116109 BF790727 AL553994 R82966 W47487 AA456066 AW984G08 BE708220 BG490637 W47419 

TABLE 4BC 

Pkey: Unique number correspondtng to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in (his column are Genbank Identier (GQ numbers. Dunham, el a).' ceters to the pubilcaEan entitled 'The DMA sequencs of 

human ctvomosome 22" Dunham, et a). (1999) Nature 4Q2:4B&495. 
Strand: Indicates ONA strand from which exons were predicted. 
Ntj)osifion: Indicates nucleotide positions of predicted exons. 



Plwy 


Ret 


Strand 


Nljxtsttion 


405001 


6015405 


Minus 


104646-104819 


400499 


9795071 


Minus 


148495-148806 


403805 


8140491 


Minus 


51483-51742,53429-53511 


402270 


3106020 


Pius 


117656-117622 


404270 


9828129 


Mmus 


3849-3750.4161-4306^5962-604a6849«965 



Table 49A. 1562 genes uprsgulated tn lung cancer relative to nonnat body tissues 

Table 49A shows 1562 genes upregulated in hjng cancer relalive to normal body tissues that are lilwly to encode proteins amenable to modulaQon by small molecules, peptides, or 
anGbodiN. These genes were selected trom 59680 piobesets on the Eos/Afiymetibc Hu03 GenecWp array. Gene expression data for each probeset obtaned horn this analysis was 
eijr^ed as average mtensity (Al), a normaOzed v^ lefiecb'no the retative level of mRNA expression. The protein products of these genes cneti contain one or more domains 
IndicaOve of have oncogente function or ct transducing intracellular signals, or of bdng modulatable by small molecules, peptides, or anfibodies {ag. pkinase, deafMomain. 7tm. 
phosphatase, orton^transporter). Certain predtoied pfotehi domail^ 

Pkey: Unique Eos probeset Identifier number 

ExAocn: ExempJar accession number, GenBank accession number 

UniGenelD: UniGene number 

Pred-ProLDoraains: Certain predicted protein domains. Abbreviations used: TM, transmembrane domam; SS, signd sequence; »Y, veiy likely to conttfn: sM. likely to contain* other 

protein domain abbreviations are from PFAM (Nucleic Adds Research, 2002. 30:276-28(». 
UniGene inie: UniGene gene title 

90fi»P«cenliteofhingtunwfAlsdhAtedbylheS0thpercentite 

numerator and denominator. 

Pkey; ExAccn; UnlgenelD; Unigene Title; PredProLDomalns; R1 

421502; AF111856; Hs.105039; sotote canter family 34 (soctom phosphate), menter 2; Ribosomal.L20.NaJV^^ 24.06 

439335; AA742697: Hs.62492; ESTs. WeaMy similar to B39066 prolineifch protein 15- rat (Rnowegfeusl: none;SS=M: 21.70 

406621; X57809: Hs.lB1 125; ImmunoglobuGn lambda locus; lg.HSP70,Ppx.GppA;TM=M;; 19.36 

421341; AJ243212; Hs.279611; delated In mafignant brain tumore 1; zona4)elIudda,CUB.SRCR;SS=M; 16.99 

452304; AA025386; H5.61 31 1; ESTs, WeaWy similar to S10590 cysteine proteinase IH.sapiensI; none.none: 1^67 

429259; AA420450; Hs.292911 ; ESTs. Highly sfanilar to S60712 band-6-proteln (Hjapienst none,none; 16.50 

454034: NM_000891; H3.575; aldehyde dehydrogenase 3 family, member Al; aMedh;; 16.24 

408000; L11690: Hs.620: bullous pemphigoid antigen 1 (230/240kD); efhand.spectrin.GAS2.SH3.Plectin.RA,Xylose_teom.FIiD,bZIP.Tropofn^^ 
14.76 

421798; N74880; Hs.29877: N-acylspWngoslno amldohydrolase (acid ceramidaseHfte; SAPA.SurfactanLB,none; 14.18 

439706; AW872527; Hs.59761; ESTs. Weakiy slmiter to DAPIJIUMAN DEATIiASSOOATED PROTHN 1 [asapfensl; none.none: 13.94 

431846: BE019924:H5.271580;uroplakin IB; transmembrane4;TM=Y:SS=M; 13.54 

417079; U65590: Hs,81134; interfeukin 1 receptor antagonist IL1;SS=M; 12.97 

448133; AA723157; Hs.73769; folate receptor 1 (aduH); FolatejBcMIP;TM=M;SS=Mt 1Z50 

414809; AI43469* Hs,77356; transfenin receptor (p90. C071); PA;TM=Y;; 1Z12 

436553; AW407157; H3.181125; Immunoglobulin lambda tocus; lg,HSP70,Ppx-GppA;TW=M;; 12.00 

418738; AW388633; Hs.6682; solute cam'er family 7, (catlonic amino add transpoiter, y system) member 1 1; none,none; 1 1 99 

419693; AA133749; Hs.301350; FXYO domai^containlng Ion transport regulator 3; AP1G1 PLM_MATB;TM=Y;SS=M; 11.88 

417866; AW067903: Hs.82772: collagen, type XI, alpha 1; ColI^,COUI,TSPN,laminin G.CorA;SS=M: 1 1.38 

414998; NM-002543; Hs,77729; oxidised tow density lipoprotein (tecOn^ike) receptor 1; lectin c;TM=Y;SS=M; 1 1.21 

428970; BE276891: Hs.194691; retlnofe acid Induced 3; 7tm 3;TM=Y;SS=M; 11.08 

418004; U37519: Hs.87539; aldehyde dehydrogenase 3 famfty. member B2; aldedh;TM=M;SS=M: 1 1.01 

425397; J04088; Hs.156346; topolsomerase (DNA) II alpha (ITOkO); [3NAjgyr85eB.DNA^topoisoiV,HATPasejB;SS=M: ia69 

418478; U38945: Hs. 1 174; cydlrHJependent kinase inhibitor 2A (melanoma, pi 6, Inhiblls C0K4): ank;; 10.65 

439223; AW238299; Hs.250618; Ul.16 binding protein Z kHjecepUPKD.MHCJ;TM=M;SS=Y; 10.52 

®^ product^pedfic locepton homeobax,AeyttransftoQ.notohyEGF.ank,Acyltransfer8S^ 10.47 
451558:NNL001089;Hs.26630;ATP-bindIngcas8ette,8ub.familyA(ABC1).member3;ABC lran,SRP54;Tltf=Y;SSaryi; ia33 
443426; AF098158; Hs.9329; chromosome 20 open reading frame 1; none;TM=fy!;; 10.21 
452747; BE1538S5; Hs.61460; Ig superfamily receptor UJIR; lg.Rhabd_slycop;TM=Y;SS=H 10.14 
417389; BEa0964; Hs.82045; mldklne (neurite growtb-promofing factor 2); PTNJ«<;TM=M;SS«Y; 10.13 
433091; Y12642; Hs.3185; lymphocyte anfigen 6 comptex, tocus 0; UPARJ.Y6JoxtoA**ijBcp;TM=M:8S»Y; 10.12 
454098; W27953: Hs.292911; ESTs, Highly similar to S60712 band^protebi [Ksaplens]; none.none; 10.05 
414812; X72765; Hs.77367; monokine Induced by gamma htefferon; IL8;TM=M;SS=Y; 9.98 
430832; AI073913: Hs.100686; ESTs. WeaWy similar to JE0350 Antertor gradienl-2 [H^saptenst none,none; 9.79 
422310; AA316622; Hs.9837a cytochrome P450, subfamily HS, polypeptUe 1; none,pkinase.fh3,lg; 9.60 
414987; AA524394; Hs.294022: hypotheteai protein FU14950: SH2;TM=M;: 9J4 
439453; BE264974; Hs.6566: thyroid homwne receptor toteractor 13; AAA>«CLtrBn,CoaE:TM«Mn 9.52 
430280; AA381258; Hs.237868; Interteukln 7 recepton fh3,none; 9.48 
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423217: NM.Q00094; Hs.1640: coBagen. type VU. dpha 1 (epidemiolysis iMdlosa. dyslnphic dontinant and reoes^); KunltLBPTI,fri3,vwa.OollaQ8n.beta.ta^ 
9.44 

418882; NM_004996: Hs.89433; ATP^ncUng cassette, sub-family C (CFTWMRP). memlwr 1; ABCjn8mbrBne^C_tran:TM=Y;SS=M; 9.32 

435472; AW972330; Hs.283022: triggering receptor expressed on myeWd cells 1; lg;TK*=M;SS=M; 9.26 

447343; AA2S6641; Hs.23S894: ESTs. Htghiy sinto Id S02392 8iplia-2'fnacroS!obu!]n receptor precursor [H.sapiens): none,none; 9.18 

419508; AW997938; Hs.90786: ATP-binding cassette. suWairtly C (CFTR/MRP), memljer 3; ABC.tranABCjnenilHBne:TM=Y;SSsM: a06 

441384; AA447849; HS.28B660; Homo sapiens cONA: FU22182 lis, done HRC00953; 7tm.3.none; 8.98 

446292: AF081497; H3.279682: Rh type C glyooprotein; AmnionhJnutransp.FecCO;TM=Y;SS=M; 8.74 

436972; AA284679; Hs.25640; daudin 3; PMP2?_Claudin;TM»Y:SS=M; a71 

421817; AF146074; Hs.l086e0; ATP-l)lndlng cassette. sul>.femny C (CFTR/MRP), memt)er 5; F8Scfcnn.ABC_tran.ABCjTieinhfanB,GrPJEFTU;TM=M;SS=M; 8.71 

423354; ABOl 1 130; Hs.l 27436; calcium channel. vollage^Jependent alptia 2/aella sulnmt 2; wo,Cache;TMsMn 8.66 

439606; W79123; Hs.58561; G proleln-coupted receptor 87; 7tnLl;TM=Y;SS=M; a63 

438091; AW373062; ; nuclear receptor subiamily 1, group I. member 3; hormonejec.zf-C4.none; 8.60 

421506; BE302796; Hs.105097: thymidine kinase 1. soluble; TK;TM=M;; 8.57 

413278; B£563085: H5.833; interferon-stimulated protein, 15 kDa; ubiquitin;SS^ 8.56 

408908; BE296227; Hs.250822; setlne/lhreonine kinase 15; pkinase;&&:M; 8.52 

414774; X02419; Hs.77274; plasminogen activator, urokinase; krinole,trypsin.plaiiLlhk}nins;SS-M; 8.49 

430630; AW269920t Hs.2621; cystaOn A (stelin A); cyslatin;TM=M;: a42 

413011; AW06811S; Hs.821: tifglycan; LRR.LRRNT;SS»M: 8.40 

446291; BE397753; HS, 14523; interferon, gamma-indudbte protein 30; GILT;TM=M;SS=Y; 8.39 

411089; AA455454: ; cen division cycle 2* 1 (PITSLRE proteins); none.none; 8.37 

422765; AW409701: H5.1576; baculoviral lAP repeakontaining 5 (sun^); BIR;TM=M;; &34 

453922; AFOS3306; Hs.36708: budding uninNtjited by benzimidazoles 1 (yeast homolog), bete; none;SS»M: a2S 

449019; AI949095; Hs.67776; ESTs, Wbakly similar to T22341 hypothetteal protein F476Bi - Caenortiabdills elegans [Ceiegans]; none,none; 8.24 
409799; D11928; Hs .76845; pfiosphosetfne phosphalase^ike; HydrDlase;TM=M;; a22 
416819; U77735; H5.80205: plm-2 oncogene; pklnase;SS=M; 8.19 
451541; BE279383; Hs.26557; pl^phUin 3; Armadilk).seg;TM=rM;; 8.16 

409142; AL136877; Hs.50758; SMC4 (stnictural maintenance of chnxnosomes 4, yeasf)-like 1; ABC.tr8n.M,SMCJ4.SMC.CkDUF164,none: 8.16 
429002; AW248439; Hs.2340; juncfion plakogk)fafai; Armadillo^eg;TM=ti^, 8.14 
445033; AV652402; Hs72901: mucin 13, epiiherial transmembrane; ank;; ai4 
421757; Z20897: Hs.296259; paraoxonase 3; Aiylestefase;SS=Y; 8.10 

414821; M63835; Hs,77424; Fc fragment of IgG, high affinity la. receptor tor (CD64); lg;TM=Y;SS=M; 8.03 
439285; AL133916; ; hypothetical protein FU20093; lg.pklnase.UlR,tJ^RNT.UWCT.nonB; 7.97 

439738; BE246502; Hs.9598; sema domain, Immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic domain, (semaphorin) 48; 8efflaiPSI,lntegrin_B;TM=Y;: 
7.86 

424905; NM-002497; Hs.153704; NIMA (never in mitosis gene a)-related kinase 2; pkinase;TM4^; 7.85 
424779; AU)46851; Hs.153053; CD37 antigen; tiansmembran64;TM«Y:SS»M; 7JS 

409340; BE174629; Hi.321 130; hypothetical proteh MGC2771; aflj)erine8ses.pyildoxri jBC,biomodomaln,PHO.MBD.ATJook,DDT.ro 
7.84 

415323; BE269352: Hs.949: neutrophil cytosoTic factor 2 (65kD. chronic granutomatous disease, autosomal Zf, SH3,TPR:TW»M;; 7.73 

427337; Z46223; Hs.176663; Fc fragment of IgG. taw affinity lllb. receptor for (CD16); lg:TM=Y;SS=M; 7.72 

430378; Z29572; Hs.2556; tumor necrosis factor receptor superfamBy, member 17; ll^,SS=M; 7.71 

451253; H48299; Hs.26125; daudin 10; PMPZLaaudin.Peptidase_M1.K^tetra;TM='Y;SS=M; 7.70 

435575; AF2134S7: Hs.44234; triggering receptor expressed on myetoM ceils 2, ig;TM=Y;8S=M: 7.70 

427747; AW411425; Hs. 180655; serine/threonine kinase 12; pkinase;TM=M;; 7.70 

426251; M24283; Hs.168383: Intereetlular adhesion molecule 1 (C054), human rtiinovhus receptor; ig.lCAMJl;TM=M;SS*:M; 7.67 
422282; AF019225; Hs.114309; apofipoproteln U MolAuExbB;TM=Y:SS=M: 7.64 

413859; AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase kinase 4 mRNA, 3" untranslated region, partial sequence; SAMJ*NT/ione: 7.54 
424008; R02740: Hs.137555; putative chemokine receptor. GTP-Unding protein; 7tm J:TM»Y;88=M: 7.52 
418322;AA284166;Hs.84113;cyclin4ependentldnaselnhlbitQr3(COK2-«sodd^ 

421071; AI31 1238; Hs.104476; ESTa. Wtokly similar to CGHU1E cdBagen alpha 1 (Xl) chain precursor (H.S8pIen$]; none|TKA»Y;SS^ 7.40 

421481; AW391972: Hs.104695; KIAA1324 protein; none;TM=M;SS=M; 7.39 

438089; W05391; ; nuclear receptor subfamBy 1, group I, member 3; honnone_rec.if-C4.none; 7.38 

428484; AF104032; Hs.l 84601; solute carrier famBy 7 (cattonic amino acid transporter, y system), member 5; 

aaj)Bnneases.pyrtdoxal_deC,bromodomaIn,PHD,iyiBD.ATJhodi.ODT,R3J»l4.Wnas^^^^ 
448988; Y09763; Hs.22785; gamm&«minobutyric add (GABA) A receptor, epsiton; Neur j*anJBD,Neurjchanjnemb;TM=Y;SS=M; 7.38 
416178; AI808527; Hs.192822; serolo^ly defined breast cancer antigen NY-BRr81; none;TW=M;; 7.31 
418506; AA084248; Hs.85339: G proteliH»upled receptor 39; none.none; 7.25 
441553; AA281219; H3.121296: ESTs; none.F6<3AP,integrinJ^ 7,25 
422311; AF073515; Hs.l 14948; cytokine receptor-Iike factor 1; fn3;TM=M;; 7.21 

415817; U88987; Hs.78857; protein tyrosine phosphatase, receptor-type, Z pdypeptWe 1; fn3.Yj)hosphalase.carb_anhydrase;TM=»Y;SS=M; JJiO 

438746; AI885815; Hs.184727; Human metaioma-assodaled antigen p97 (melanotiansferrin} mRNA, 3r dank; tr8nsferrin,GuanylaleJdn,PDZ,8H3; 7.20 

412723; AA648459; Hs.335951; hypothetical protein AF301222; none;TI«=M.'; 7.14 

418203; )(54942; Hs.83758; CDC28 protein kinase 2; CKS;; 7.14 

428582; BE336699; Hs.185055; BENE protein; none;TM=Y;SS=M; 7.12 

418462; BE001596; Hs.85256; inlegrin. beta 4; fn3.lntegrtn_B,C8ax.beta,EGF;TM=M;SS=M; 7.08 

420344; BE463721; Hs.97101: putative G proteiiMXWpled receptor; Methyltransf.5;TM=Y;SS=M; 7.02 

428450; NM.014791; Hs.184339; KIAA0175 gene product; KA1,pkinase;TM=M;; 7.00 

4492305 BE61 3348; Hs.21 1579; melanoma ceQ adheston mdecule; ig4sodh.Ribosom8Lt6.F-^^ 

439237; AW408158; Hs.318893; ESTs. Weakly similar to A47582 B^ growth fador precursor IKsaptensJ; Fuiln^pMnase,Recep.Uomaln,YlP.none; 6.97 

421508; NM.004833; Hs,105l 15; absent in melanoma 2; PAAD_DAPIN,HIN;TM=M;; 6.96 

410342; R31350i Hs.743: Fc fragment of IgE, high affinity I, receptor for. gamma pdypeplide; ITAM;TMsY;SS=M; 6.93 

428479: Y00272; Hs.334562; cell dlviston cyde 2. G1 to S and G2 to M; pkinase,ICEj)1 0,lCEji20;TM=M;SS=M; a93 

421532; AW138207; H8.146170; hypothelfcal protein FU22969; Annadaio_seg.HEAT;TM=M,-; 6.91 

451035; AIXJ76785: H3.430; plastin 1 G boform); efhand.CH^dapfin^USS^ 6.86 

432407; AA221036; ; gbnoOSfl 2.rl Stratagene NT2 neuronal precursor 937230 Homo sapiens cONA done 5* slmaar to SW:POLBAEVM P10272 POL POIYPROTEIN ;. mRNA 

sequence; DEAO,hd]case.C.rTm.Ndr.Cy8jmotTtUvwa,vwc.vwdJQ.RIl8,abhyd^ 

beta.DUF139,TPR,DSPc,tspJ.Ribosomal_S21.rvp;TM=M;; 6.84 
442599; AF078037; Hs.324051 ; ReJA-assodated inhibltof; SH3»ank;TW=Mr. a77 
448243; AW369771; Hs.52520; Integrin. beta 8; lntegrtn_P.none; 6,76 

427557; NM-002659; Hs.l79657; plasminogen adivalor, urokinase recepton UPAR_LY6.ET,PIA^;SS=M; 6.75 

418054; NiyL002318; Hs.83354; (ysyl cmdasa^ 2: SRCR.LysyLoxid8Sa;TlyNM;$&*M; 6.74 

428440; BE3827S6: Hs.169902: sotute canter Miy 2 (fadlHated glucose transporter), meniber 1; sugarJtr:TM»Y;SSsM; 6.73 
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430397; A1924533; Hs.105607: bicabonsle transpoiler related protein 1; HC03_cotran5(i;TM»Y:: &71 
449523; NVLOOOSTS: Hs.54443: dienuAire (OC mota) receptor 5: 7tm.1;TM=Y;SS=M: 6.71 

431630; NM_002204: H3.265829: littegrin. alpha 3 (antigen C049C alpha 3 subunil of VLA-3 receptor); F&GAP.RhabdUoVcop,lnteg(M:TMaY:SS«M; 6.70 

410434; AF051152; Hs,63658; iMB receptor 2: LRR,LRRCT.TIR;TM=M;SS=M; 6.69 

424925; NM 002432; Hs.153837; myetod cell nudear differenfiaOon antfgen; PAAD.OAPIIiHJNs 6.69 

431890; X17033; Hs.271986; integnn, alpha 2 (C0496, alpha 2 sut)unlt of VIA2 receptor); vwa^ntegtinJ^FG-GA^ &65 

428157; AI738719; Hs.198427; hexokinase 2; hexrfdnase,hexo«nase2jujne; 6.64 

430770; AA765694; Hs.l23296; ESTs; none.none; 6.63 

412270; AC00S262; Hs J3797; guanine nucteolide prot^ (G protein), alpha 15 (Gq class); G^pha.arf;TMsM;; 6.59 

439750; AU59053; Hs.57664; Homo sapiens mRNA full length insert cDNA clone EUROIMA(S 2005735; IMPDK.(;iMPOH-N.CBS.lnlegrin_B,Rldn_B_lectin; 6.S9 

427700; AA262294; H3.180383; dud specificity phosphatase 6; Rhodanese.DSPc;TM=M;; 6.59 

413048; M93221; KS.7S182; mannose receptor. C type 1; fn2JeclhLC,RidiLB.IectiaX[ink:TM=Y;SS»M; 6.58 

42S345; R11141; Ks.199695: rtypotheOcal protein; ICtetra.SAI4 6.58 

4161 10; Z42262: Hs.322844: hypothetlca) protein DKFZp564Al76; Sema.PSI,TIG.Integrin.B;TM»Y;SS=^ 6.56 
416883; BE387036; Hs.1211; acid {Aosphalase 5. tartrate resistant; Metal]opho3;TM=M;SS=M; 6.57 

426746; J03626; Hs.2057; uridine monophosphate synthetase (orotate phosphoritjosyl transferase and oroti(fine>5'-decart»xylase): ftiliosyltran,OMPd8C8se:TMsM;; 6^ 
402260; ; ; NM_001436*:Homo sapiens fibrillarin (FBI), mRNA. transcripQ (F8A). mRNA.; p)dnase.FibriQar1n.none: &56 

456373: 6E247706; Hs.89751; memtxan^spannlng 4KlQmabis, sublam&y A, member 2 (Fc fragment d (gE. high afllnily I, receptor for. beta polypeptide): none:TM»Y:: 6.S3 
444006; BE395085: H5.10088; type I transmembrane protein Fn14; tdljecepLaPKD.MHCJ;TMsM;SS»Y; 6.63 

41 1027; AF072099; Hs.67846; leiitocyte Immunoglobutin-like receptor. subEamHy B (with TM and ITIM domdns), member 4; iiu)sltoLP,lg;TM-M;: 6.52 
435523; T62849; K5.11090; membran&^ panning 4-domains, subfamily A, member 7; non8;TM=Y;SS3M: 6.52 
432920; U37689; H$.3128; polymerase (RNA) 11 (DMA directed) polypepfide H; nons;T(ii=M;; &48 
412773; H15785; Hs.74573; similar to vaccinia vims Hindlll K4L ORF; PLOc;TM=M;; 6.48 

. 409208; Y00093: Hs.51077; Integrin. alpha X (antigen C01 1C (p150). alpha polypeptide); vwa.F6.GAPJntegrin_A.vwa.lntegrin^FGW: 6.43 
424441; X14850; Hs.147097; H2A histone family, member X; hlstone,CBFD.NFYB^HMF;; 6.43 
416918; X07871; Ks.B9476; CD2 ant^en (p50), sheep red blood cetl receptor; lg;TM=Y;SS=M: 6.41 

413219; AA878200: Hs.1 18727; Homo sapiens cDNA FU13692 6s, clone PLACE2000103: Hliideath,TNFR.c6A7J-Co\.hydro; a41 

429170: NM.001394; Hs.2359; dud specificity phosphatase 4; Rhodanese.DSPb.Yjiho8phalas8,Riboso(naLS3J^TM4il:: 6,39 

453914; NM^000507; Hs.574: ffUctose*1.6^isphosphatase 1; FBPase;TM4A;; 6^ 

424273; W40460; Hs.144442; phospholipase A2. group X; phosiip;TW=M;SS=Y; 6,37 

428385; AF1 12213; Hs.1B4062; putative RabS^nteractlng protein; SH2,SH3;SSsM: 6.36 

432636; AA340854: H5.278562; claudin 7; PMP2^_Claudin;TM=Y;SS=M: 6.34 

409430; R21945; Hs.346735; splicing factor, arginine/^ne^ 5; DSPc,Rhodanese,nor^; 6.34 

451734; NVL006176; Hs.26944; neurogranin (protein kinase C substrate, RC3); IQ.7tnu1;TM=M;: 6.34 

443907; AU076484; Hs.9963; TYRO protein tyrosine kinase binding protein; none;TM=M;SS=Y: 6.34 

401027; ; ; Target Exon; none.none: 6,26 

41 8299; AA279530; Hs.e3968; integrin, beta 2 (antigen C018 (p95). lymphocyte funclion^ociated antigen 1; macrophage antigen 1 (mao-1) beta subunit); 

lnte9rtn_B.EGF,PSI;TM=Y;SS=M; 6.22 
429732; U201S8; Hs.2488; lymphocyte cytosdic protein 2 (SH2 domain^ntaining leukocyte protein of 76kD): SH2;SS=M; 6.21 
4081 13; T82427; Hs.194101; Homo sapiens cDNA: FU20869 lis, ctene ADKA02377; 7tnL3.none; 6.20 

406771; AW732573; Hs.47584; potassium vollage^ated channel, delayed-iectiiier, subfamily S. member 3; efhand,k)nJransXJetra,non8: 6.19 
456534; X91195; Ks.100623; phosphoSpase C, beta 3. neighbor pseudogene; UM.PDZ.pklnase:SS=M: 6.18 
408482; NW-000676: Hs.45743; adenosine A2b receptor, 7biLl:TI^Y;SS»M! 6.17 
426427: M86699; Hs.169840: TTK protein kinase; pkinase;: 6.17 
445019; A1205540C H&281295: ESTs; none.none; 6.16 

438552; AJ245620; Hs.6314: type I transmembrane receptor (selzure^elated protein); non8,none; 6.16 

414907; X90725; Hs.77597; polo (Orosophla)^ kinase; RibosomaLU7aB.pkinase,Pa0^box,1RNA-synL1b,dynamln,dynan*uZGED^ 6,14 

425322; U63630; Hs.155637; protein kinase, DNA^ated. calaJyCc polypeptide; PI3 J>MJdnas8,FAT,FATC;TM=M;: 6.13 

417421; AL138201; Hs.82120; nuclear receptor subfamBy 4. group A. member 2; hormone^,zf-C4:SS=M: 6.13 

425776; U251 28; Hs.159499: parathyroid homran? receptor Z 7lmJ2,HRM:Th^Y;SS4«: &12 

422278; AF072873: H5.1 14218; frizzled (DrosophHa) homotog 6; Fz.Frfzzled.7lm»2:TMaY;8S»M; 6.12 

427490; Z95152; H3.178895; mitogen-adivated protein kinase 13; pklnase;TM=t^.; 6.12 

421445; AA913059; Hs.104433; Homo sapiens, done IMAGE:4054868. mRNA; ton_tran3,ietetia.asp; 6.1 1 

444143; AW747996; Hs.160999: ESTs. Moderately simitar bA561 94 thromboxane Ap2 receptor, endothelial (H.8aplens]: 6d-2.none; 6.10 

423887; AL080207; Hs.134585: DKFZP434G232 protein; ABC_tran;TM=Y;; 6.10 

409636; AA305729: Hs.18272; amino acU transporter system A1; AeLtrans;TM=Y;; 6.09 

41 1020; NM,006770; Hs.67726; macrophage receptor with collagenous structure; SRCR,CoIlagen;TM=Y;8S='M; 6.09 

425354; U62027; H5.155935; complement component 3a receptor 1; 7tm J;TM=Y;SS=M; 6.08 

439963; AW247529; Hs.6793; platelet-acGvaOng factor acetythydrolase, isofbmi lb, gamma subuntt (29kO); PAF-AH.Ib,Upa5e.G0Sl;TM»Mn 6.07 
421753: BE314828; Hs.10791 1 ; ATP^indlng cassette, sut>.f3mily B (MDR/TAP). member 6; ABC.!r8n^jnembfane:TI^Y;SS=«* 6.07 
406908; Z25437; ; gb:H.sa(^ns protebvtyrosine kinase gene, complete CDS.; none,nons; &07 
42S849; AJ000512; Hs.296323; semm/^hicocorfiooid regulated kinase; pkinasejMnaseLC;TM4/l:SS"M; 6.06 

452363; A1582743: Hs.94953: Homo sapiens. SImiter to comptement component 1. q subcomponent, c potypeptUe. done MGC:17279, mRNA. completB cds; C1q,OoOagen:SS4t. 
6.05 

414883; AA926960; ; COC28 protein kinase 1; a<S;; 6.05 

414166; AW888941; Hs.75789; N^nyc downstream regutated; OEAD.helicase^CmNdf,CysJattJt,TIUvvifa.vwc,VYi(d.lQ,RIIatabhydrolase,T^^ 

beta,DUF139,TPR,DSPctsp_1.rebosomaLS21,rvp;TM=M;; 6.03 
452888; AW955454; Hs.30942; ephrin-B2; Ephrin.fn2;TM=Y:SS=M; 6.03 
448782; AU)50295; H5.22039; KIAAD758 protein; 7tm_2.lg.GPS,SEA;TM=Yn 6.03 
449101; AA205847; Hs.23016; G protein-coupted receptor, 71nul;TM=Y;SS=M; 6.01 
445462; AA378776: Hs.268649; hypothetical protein MGC3077; none;; 6.00 
424381: AA285249: Hs.146329: protein Idnasa OtaO^ pMnase,FHA.DnaJ:TM=Mn 6.00 
420162; BE378432; HS.95S77: cydiiHtependent kinase 4; pkinase:TM:'M;; &99 
439310; Ai=086120; Hs.102793; ESTs; casein Jtappapkinase.ig,none; 5.97 
414972; BE263782; Hs.77695: KIAA0008 gene product; GKAP:TM=M;; 5.97 
425976: C75094: Hs.334514; NG22 protein: voUage_aC:TM»Y:SS»M: 5.94 

444946; AW139205; Hs.156457; hypothetical protein FU22408: 8bhydrolase,^ihydrol8se J:TW=>Y;SS4« 5.93 
41 1263; BE297802: Hs.69360: klrnsin^ 6 (mitotk: oerHromere^ssodated Unesin); MnesIn:TM=M:; 5.93 
421462; AF016495; Hs.104624; aquaporin 9; M1P:T^*=Y;SS=M; 552 

426761 ; A1015709; Hs.172089; Homo sajrfens mRNA; cDNA DKFZp586l2022 (from done DKFZp586l2022); non8;TM=Y;SS=M; 5.92 

407792; AI077715; Hs.39384: putative secreted Ggand homdogous to Qxl; none:TM»M:SS»Y; 5.91 

428ni; AB02699% Hs.193143; KIAA1089 protein: C2,PM»ljC.Y4WC-X:TM=fc*; 5.91 

438564; AA381553; Hs.198253; histocompatibility comptox. dass U. CX) alpha 1; tg,MHCJLalpha.none; 5.91 
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440006: AK000517; Hs.SS44; hypothetical protein FU20510: AAA.N&ARaPAAO.DAPtN;NA;NA: 5.90 

449027; AJ271216; Hs.22880; dipepai^pGdase III; Pep{jd3So_M49.EGF,ig.Neuregui]n:TMsM;: 5,90 

408790: AW580227; Ks.47660: neurobophic tyrosine Mnasa, raceptor, type Z l9J)Utias8.UVllJRRNT.UU^;TA^Y;S^ 5.89 

4131B6; AU077141; Hs.75231; soUite carter family 16 (monocsitexylb add transporters), menlber 1; 8ugar_lrJM=Y:SS=^ 5.B9 

4306SS: AA531276; Hs.59509; ESTs; pkinase.PP2CvnQn8; 5w88 

422609: Z46023; Hs.118721: slafidase 1 (lysosomal riaWase); BNRSH2,SH34»kirwse:TM=Y:SS«M; 5.88 

425367; BE271188; H5.155975; protein tyrosine phos;}hatase, (eceptor type. Oassoctated protein; non8:TM>M;SS»Y; 5.88 

429619; AL120751; Hs.211568: eulcaryotic transiatton inilialion bctor 46amma, 1;none.noRe; 5.86 

437429: H799d1; Ks.5613; Homo sapiens mRNA;cONA DKFZpS64E2222 (trom done DKFZp564E222?); SH2.SH33TB; 5.86 

436576: AI45621 3; Hs.77542; ESTs; 7litu1.DnaJ; 5.85 

429663; M68874; Hs.21 1587; phospholipase A2. group IVA (cytosolic caldunHlependenQ; C2«PLA2.B;TM»M;; 5.85 
419981; AA897581; Hs.12B773: ESTs; pldnase,DA(3„PE-l)ind.pWnase.C»OPR.none; 5.83 

426953; AA306610; Hs.3481 83; tumor necrosis tactor receptor superMly, member 6b, decoy; 60ej{bosomd,RibQsornal.L10.TNFK_c6,DEAD;: &83 
414606; D14594; Hs.77329: pbosphalidylserine synthase 1; PSS;TM=Y;SS=M: 5.82 
451320; AW118a72; ; diacylgiycefol kinase, zeta(104kD): Rone:TM=Mr. 5.82 
400991; ; ; Target Exon; Armadillo segjec&ucinone; 5.81 

456906; AF1 17646; Hs.156637: ck^4A (murine) ectro|& retroviral transforming sequence c; zK:3H01.Cbi_N.Ct)i_N2.Cbl J43;TI\*=M;; 5.81 
434263; N34B95; Hs.44648; ESTs; igjnone; 5.81 

428233; BE250944; Hs.183556; solute carrier 1 (neutral amino add transporter), member 5; elF6.SDFJMsK^ 5.78 
421959; AW751497; Hs.98370; cytochrome P450. subfamily IIS, polypeptide 1; p450;TM=Y;SS=M: 5.78 
449539; W80363; Hs.58446; ESTs; pWnase,Furin-IIke.Recep.Ldomain,nonB: 5.77 
409012; AL117435; Hs.49725; 0KFZP434I216 protein; PH.RhoGEF:TM=M:SS=M; 5.77 

41227G: BE262621; Hs.73798; macraphage mIoFalion inhibltoiy factor (glyoosylalMnHfaiUng factor): MtF,sugar.tr,none; Si75 
409533; AW969543; Ks.21 291; mKogen-adlvated protein Mnase Unase kinase 13; PeptUase.C48.none; 5.73 
457001; J03258; Hs.2062; vitamin D (1.25- diliydroxyvitamin D3) receptor, iMHmonejec,2f-C4,Mey toU^to„5;TM=M;; 5.73 
416084; U6991; Hs.79006: deoxythymidyiate kinase (Uiymidytate kinase): none.none: 5.72 

448569; BE382657: Hs.21486: signal transducer and adivator of transcription 1. 91kO; SH2.STAT.STAT_bind,STATj)n)t;TM=i^;; 5.72 
452295; BE379936; Hs.28666: programmed cell death 10; serpin^nons; 5.72 

448775; AB02S237; Hs.3B8; nudbc (nudeoslde diphosphate Bnked moiety X)4ype motif 1; NUDI)(;TM=^SS=M; 5.72 

448733; NM-005629; Hs.1 87958; solute carrier fanfly 6 (neurotransmitter transporter, creatine), menAer 8; SNF;TM=Y:; 5.71 

417015; M83772; Hs.80876; flavin containing monooxygenase 3; FMCWikB»pyr.redox:TM=Y;SS=M; 5.69 

453323: AF034102; Hs.32951; solute carrier family 29 (nudeosUe transporters), member 2; Nucieosk)e_tran;Th(NY;SS=M; 5.69 

410290; AA402307; Hs.322844: hypothefical protein OKFZp554A176; Sema.PSI.TiG.lntegrin.B:TM=Y:SS=M; 5.69 

412182; AA2055B8; Hs.155160; Spiidng factor, argininefeerine^ 46kD; rrm,hormonejec,zf-(M,sugar_tr,; 5.69 

418526; BE019020; Hs.85638: solute cairter family 16 (monocarboxyTic add transporters), member 3; none:Tli^Y;SS^ 5.66 

447250; Ai876909; H5.17883; protein phosphatase 16 (tormeily 2C), magneshmHlependent, gamma Isolbmi; PP2C;TM^A;; 5.65 

4381 13; AM67908; Hs,8682: ESTs; 7tnu1,none; 5.65 

421391: AW304350: Hs.1919S6; immunogtebutin superfamOy receptor Iranslocaibn associated 2; lg.non8; 5.64 
417115; AW952792; Hs.334612; srr^all nuclear ribonucteoproteln polypepfide E; Sm.pklnase;; 5.64 

406137; ; : Nlit000179*:Homo sapiens mutS (E. coB) homotog 6 (MSH6). mRNA. VERSION NM»00017ai Gl; MulS.C,PWWP.MtitS.K:TM;<M;: 5.63 

421917; AB028943; Hs.109445; KIAA1020 protein; BTB.zM)2H2.PI3_Pt4Janase,PI3Ka;TM='M;; 5.62 

445873; AA250970: Hs.251946; poly(A).binding protein, cytoplasmic l-Bke; PABP,mn,pldnase,14^; 5.62 

447365; 6E^76; Ks.334; Rho guanine nucteotide exchange factor (GEF) 5; SK3,PH,RhoGEF;TM=Mn 5.61 

446872; X97058: H5.16362; pyilmldlnergk: receptor P2Y, G-pmtetn coupled, 6; 7tnLl:TM=Y;SS=M; 5.59 

433662: W07162; H8.150826: (VVTX-6 proteirK ras/BCJranMTIiNyi;SS=M: 5.59 

449029; N28989: Ks.22891 : solute canter temOy 7 (cSHonic amino add transporter, y system), member 8; sa j)em«ases;TM°Y:88=M; 5.58 
431236; AV656840; Hs.285115; intsileukin 13 receptor, elpha 1; fn3:TM=Y;SS=M; 5.57 
430508; AI015435: Hs.104637: ESTs; SDF;TM=Y;SS=M; 5.66 

426227; U67058; Ks.154299; Human proteinase adhrated receptor-2 mRNA. 7UTR: 7tm_1:Tl4»Y:SS»M; 5.55 
421677; H84092: H$.38282; ESTs; Alpp^^iUojBegJBB: 5M 
429083: Y093g7; Ks.227817; BCL^-reteted protein A1; Bd-2:TM=M;: 5.54 

429563: BE619413: Hs.2437; eukaryoQc transtaikm inifialion factor 2B. subunit 5 (epsilon. 82kD); hexapep,W2.hoimone2,DUF2g(TMNM;: 5.52 
412817; AL037159: Hs.74619: proteasome (prosome, macropain) 263 subunit, non-ATPase. 2; PC.rBp;TM=M:; 6.51 
452291; AF015592; Hs.28853; CDC7 (ceil division cycle 7. S. cerevisiae, homOk)g)^ike 1: pkInase;TM»M:; 5.51 

437412; BE089288: Hs.34744; Homo sapiens mRNA; cDNA OKFZp547C136 (from done DKFZpS47C136); ABC.tran.GTPjEFTU,ABCjnembrane,none; 5.50 

423778; Y0g267; Hs.132821; Ravin cont^ning monooxygenase 2; FMO^ke.pyrjedox;TMsY;SSsM; 5.48 

422846; BE513934; Hs.1583: neutrophil cytosolic lador 1 (47kD, dironic granulomatous disease, autosomal 1); SH3J>X:TM=M;: 5.48 

434699; AA643687: H3.149425; Homo sapiens cONA FU1 1980 fis. done HEMBB10013O4; NudeosideJra2,none: 5.48 

426691; NM.006201; H8.171834: PCTAIRE preteln kinase 1; pkin8se;TM»M;; 5.48 

453905: NM.002314; Ks.35556: UM domain kinase 1; pkinase,UM,PDZ.zM>ARP.TM=M;: 5.48 

412939: AW411491; Hs.75069: eukaryotic translation etongation factor 1 gamma; nona,none; 5.44 

430466; BE062109; Hs.241551; chloride channd. catolum adhrated, family member 2; none;TM»Y;SS»M; 5.43 

430066; Ai92g859; Hs.237B25: signal lecognilnn partk^ 72kD; TPR,AlRC,SAICAR.syn1; 5.40 

422241; Y00062; Hs.170121; protein tyrosine phosphatase, receptor type, C; kinesInMYj)hosphat8se;TM:^: 5.40 

411825; AK000334: H8.72289; hypothetica! protein RJ20327: SNF.Zip;TM=Y;; 5.36 

400205: ; ; NM.006265":Homo sapiens RAD21 (S, pombe) homotog (RAD21), mRNA.(AP0.1/CD95 (Fas)^odaled phosphatase) (PTPN13}, mRNA.; DUF173;SS=M; 5,35 
410687; U24389; Hs.65436; lysyl oxklase^Ike 1; LysyLox!dase:SS=M; 5.34 
407786; AA687538; Hs.38972: tetraspan 1; transmembrane4:TM:'Y;SS=M; 5.34 

4251 18: AU07661 1 ; Hs.1 54672; methytene tetrahydrofblate dehydrogenase (NAO dependent), methenyltelrahydrofidate cyctohydrolase; myb.DNA- 

blnding,THF.DKG_CYH,THF.DHG^GYH.aCAP_6LY.AA\L0M.P»pfl^^ 
400210; ; ; Eos Control; Adapjcomp^sub,Clatjadaptor_8;TM=M;: 5.32 

414825; X0637Q: Hs.77432: epMennal growth factor receptor (avten erylhrobtastk: teukemia viral (v<ert>^) oncogene homotog): Fuiin4ike.pUnase,Recep_momah;TMsM;SS4ft 
5.31 

414035: Y00630; H3.75716: serine (or cysteine) proteinase inhOntor. dade B (ovalbumto). member Z serptn;SS»M; 5.30 

416000; R82342; H5.79856; ESTs, Weakly similar to S65657 alpha- l&adrenergic receptor spBce form 2 [Ksaptens): non6,8ugar.^, 5.30 

4143S8; W70171; Hs.7S939: uridtoe monophosphate kinase; PRi<,CoaE:; 5.29 

424321; W74048; Hs.1765; lymphocyto«pedfto pnteln ^Ine Mnase; 8H2,SH3,pkinas8:TM»M;: 5.29 

450296; AL041949; Hs.24756; hepaiocyte growth tector-regulated tyrosine kinase substrate; none,none; 5.29 

456672; AK002016; Hs.1 14727; Homo sapiens, done MGC:16327, mRNA. complete cds; none,PK.PieC,myosln.head,RhoGAP: 5.28 

410068; AI633888; H3.58435; FYimtding protein (FYB-12Q/13(Q: SH3:TM=M;: 5.28 

456629; AW891965; Hs.279789; hislone deacetybse 3; HSP90,HATPB5Q.c;Ef<:2H2,PHO.nQne; 5.27 

417218; AA005247; Hs.2B5754: met prot»onoogene (hepatoeyto growth fedor receptor); pkinase.Semai,PSI.TlGJntegrin_8:TM»Y:SSsl\t 5.26 
444051; N48373; Hs.10247; activated leucocyte cell adhesion mdecuto; none.none; SJZ6 
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404083t ; ; C60021S9^gil828027tpiit|AS3S93 prot^n-tyrosins-phosphataso (EC 3.1.3.48). nonreceptor ty; none;SS^ 5.26 
422051; AyV327546: Hs.111024: sotuto carrier famOy 25 (mBochondrial carrier; citrate transporter}, memtier 1; mito carr.TM=M;; 5.26 
419034; NNt002110; Ks.e9555: hemopocetic cell Idnase; SH2.SK3j}kin3se;TM=M;; 5.26 
427732; NM.002980: Hs.2199; secrefin receptor; 7tnL2,HRM;TM=M;SS=M; 5.25 

425921; NM.007231; Hs.16221 1; solute carrier fanrfiy 6 (neurotransmitter transporter), member 14; SNF:TiyNY;SS=M: 5.25 

446030; N30714: Hs.325960; memtyane-spanning 4-domans, subfamily A, member 4A; non8:TM»Y;SS=ht 5.24 

441607; NM.005010; Hs.7912; neuronal cell adhesion molecula; WD40,fh3^;TM»f^ 5.23 

446620; AA12880d; Hs.179902: transporter-tike protein; none;m=:Y;SS=M; 5.23 

422616; BE300330: Hs.118725; selenophosphata synthetase 2; AIRS^RS.C;TM4^; 5^3 

447131; N\L004585; Hs.17466; retinoic add receptor responder (tazamtene indxed) 3; none:TM='Yn 5.21 

446272; BE26691 2; H5.14601 : hematopoietic cell-specific Lyn substrate 1; SH3.HS1 jep;TM=M;; 5^ 

450447; AF212223; Hs.25010: hypothetical protein P15-2; KTF2;TM=M;; 5.19 

42S003; AF1 19046; Hs.154149; apurinic/apyilinidlnic endonuclease(APEX mideaseHlke 2 protein: TrDpoiiin.Exojendoj)hos.lQ;TM^; 5.19 
446836: AC002563: Hs.15767: dtron (rho^nteracang. serineAhreonine idnase 21); CNH.DAG_PE-bind.PH.1nvolxrin.M;TMstk«;: 5.19 
434826: AF155661; Hs.22265: pyruvate dehydrogenase phosphatase; PP2C.none; 5.19 
447081; Y13B96; Hs.17287; potassium inwardly-rectifying channel, subfamily J. member 15; IRK;TM=Y;; 5.19 

407949: W21874: K$.247057: ESTs. Wealdy simQar to 210926QA B ceil growth factor (Ksapiens); Ribo6omaLS14.8n)(4ddnase.deafh.none; 5.18 
442^)0; A1A^0572: H5.235768: ESTs; none,TOne; &18 

446566: H95741; Hs.17914: membiane^panning 4-domalns. suMamity A. member 6A: none:TMNY:SS^ 5.16 

452690: AI536C70; Hs.15085; ESTs; pou,homeobox.lla.chan>U^F_receptDr; 5.18 

419138: U48508; Hs.89631; ryanodine receptor 1 (stceletal); ton_trans,SPfW,fWDIUTPR.RyR.MlR:TM»Ys 5.17 

431441; U81961; Hs.2794; sodium channel, nonvoltage^ated 1 alpha; ASC;TM=Y;; 5.16 

418945; BE246762; H5.89499; arachMonats 54ipoxygena$e: lipoxygenase.PUT;Tr^M;; 5.16 

407601; AC002300; Hs.37129; sodium channel, nonvoitag&^aled 1. beta (Uddle syndrome); ASC;TM:'Y;SS44t 5.15 

429500; )C76565; Ks.2891 14; hexabrachton (tenasdn C, cytotacOn); EGF,fn3.fibrinogen_C.toxin.2.Kera0n.B2:TM=M;SS°Y; 5.15 

411984: NM05419: Ks.72g88; signal transducer and activator of transcription 2, 113i(D; SH2.STAT,STATJ]faid.STATu»ol;TM»liA:: 5i15 

433470; AW960554: : transmembrane 4 superfamBy member 1; none;TM='Y:SS"M; 5.14 

452817: AA322859: HS.28427S; Homo sapiens PAI^ mRNA. complete cds; pldnase.PBO:TM=M:: 5.14 

453102: NItfL007197: Hs.31664; frizzled (DrosophUa) homolog 10; Fz.Fri22led,7tmJ2;TM=Y:SS=l«; 5.14 

427792; M63928; Hs.180841: tumor necrosis (actor receptor superfamily. member 7; SRP14,TNFRjc6;8S»M; 6.14 

430563: AA481269; ; ATP-blnding cassette, sub-family C (CFTRrti^RP). member 5; ABC.tran,GTP.EFTU.ABCjnembi3ne.none: 5.13 

4316B1; AK000378; Hs.267566; hypotheMcal protein FU20371; 6ugar_tr;TI^Y;; 5.12 

431183; N14^00585S; Hs.250596; KDEL (Lys-Asp-GIu-Leu) endoplasmic reticulum protein retention receptor 3; Ell.fcimenjecepl:TM=M;SS=M: 6.12 

417771: AA804698; Hs.82547; reQnoic add receptor responder (tazarotene induced) 1; none.none; 5.11 

418613; AA744529; Hs.86575; nitogen-acQvated protein Idnase idnase Idnase Idnase 1; p)dnase.CNH:TM=M.': 5.1 1 

409524; AW402151 : Hs.54673; tumor necrosis factor (Bgand) superfamily, member 13; TNF:TM=Y:SS=M: 5.1 1 

436856; A1469355; Hs.127310: ESTs; pidna5e.rmi;TM=M;; 5.09 

411296; BE207307; Hs.10114; growth suppressor 1; 20G-FBil_0xy;TM=M;SS=M: 5.09 

410082; AA081S94; Hs.15831 1; Musashl (Drosophila) homoiog 1; rrm;T(^l^; 5.09 

404440; ; ; NM.021048:Homo sapiens melanoma antigen, family A, 10 (MAGEA10), mRNA. VERSION NM.021049.1 G); MAGE:TMsM;: 5.08 

424977; AA34g289; Hs.100057; Homo sapiens cDNA: F1J22902 fis. done KAT05581; none,non8: 5.08 

422100; A1096988; Hs.1 11554; AOP-rlbosylatlon factor-like 7; arf.rB3;TM=M;: 5.07 

452222; AW806287; Hs.21432: SEX gene; Sema,TIG.PSI.GDI; 5.07 

430300: U80805; H5.238648; oncostaQn M receptor; th3:TM=>Y:SS»M: 5.07 

408369; R38438: Hs.182575: sohite canter larrtiiy 15 (H??7 transporter), member 2; PTR2:Th^Y;: 5.07 

422112: BE540240; Hs.111783: Lsmi protein; Sm.BAG;SSsH 5.06 

449961; AW265634; Hs.133100; ESTs; pkinase,Furfn-like.Recep.l^domdn,none; 5.06 

430024; AI808780; Hs.227730; integrin. alpha 6; lntegrinAFG^TM=Y;SS=M; 5.06 

412641; M16660: H$.74335: heal shock 90kO protein 1, beta; HSP90.HATPase.c:TM»M:: 5.05 

437608; AA761605: H8.292308; ESTs. Wed^y similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [H.8aplen^ pkinase,Rt01,none: 
5.05 

400296; AA305627; Ks.139336; ATP^nding cassette, sutMamOy C (CRR/MRR), member 4; ABC_tr8n.ABCjiiembrBne;TMsY;; 5.04 
446232; A1281848; Hs.194691; retlnote add Induced 3; 7tm_3.none; 5.04 
425262; D87119; Hs.155418: 683955 protein; pkinase;SS«M: 5.04 

414703; BE243877; Hs.76941; ATPase. Na? transporting, beta 3 po^tkte; NU(^TPase:T1^Y;SS»k« 5.03 

434808; AF1 55108; Hs.256150; Homo sapiens. Similar b RIKEN cONA 2810027019 gene, clone M6C:14827, mRNA. complete cds; none;TM»Mn 5.03 
425852; AK001504: Hs.159651; death receptor 6, TNF superfantily member 21; de8ih.TNFK.c6:TM»Y;SS»M: 5.03 
449437: At702038; Hs.100057; Homo sapiens cDl^ FU22gi}2 fis. clone KAT05581: none.none: 5.03 

448913: AA194422; Hs.22564; myosin VI; ixrn.zf-RanBP.pkinase.GST.C.Ets.SAMJ'NT.ABC^.mentrane.myosinJead.lQ.MyQsin.N.ba 

C2H2.PHD.BTB,TRIS>^T_hook.SAM;TM=M:; 6.02 
413441; AI929374; Hs.75367; Src-like^pter, SH2,SH3;TM=M;: 5.02 

427618; NM_000760; Hs.217S; colony stimulaBng factor 3 receptor (gramiocyte); fn3;TM=M;SS=M; 5.02 

417666; A1345001: Hs.e2380; men^e a trots 1 (OAK assembly factor); zf<:3HC4:TM^M;: 5.02 

429903; AL1 34197; H5.93597: cycfiiMlependent kinase 5. regulatory suburtit 1 (p35); COIC5_adwalDr,none: 5.01 

445333; 6E537641; Hs.44Z78; hypothetical protein FU12538simnar to ras^elated protein RAB17; ra5.art,TK;SSsM; 5.01 

426285; U20620; Hs.343581; kaiyopheiin alpha 1 Cunportin alpha 5); AmiadiIto_seg.lBB:TM»M,-; 5.01 

421233; AA209534; Hs.284243; tetraspan NET^ protein: transmembran84;TM::Y;SS^M: 5.01 

424517; AI539443; Hs.137447: Homo sapiens cDNA FU12169 fis. done MAMMA1000643: SH2.STAT,STATJjind.STAT_prolnone; 5.00 

'425345; AU077297; Hs.155894: protein tyrosine phosphatase, nonreceptor type 1; Yj)hosphatase,DSPo;TM=t4:SS=M; 5.00 

446946; AtB78932: Hs.317; topoisomerese (DN/^ 1; TopoisomeraseJ.Topoisomer_LN.RrrwAD.Hanta-mideoc8p;TM=M;; 4.99 

413900; AW409747; Hs.75612; stress-lnduced-phosphoproleln 1 (Hsp70/Hsp90organt2ing protein); TPR,PDZ.WW.GuanylateJdn:TM=M;; 4.98 

4121 16; AW402166; H3.784; Epsteln-Barr virus induced gene 2 (lymphocyte<;pedlte G proteifrcoupled receptor); 7tnLl;TM=Y;SS=M: 4.98 

400792: AA635052: ; Homo sairiens mRNA; cONA DKFZp434O0515 (from done DKFZp434O0515); rf-C3HC4.CARD,BtR:TM=MK 4.98 

417018; M16036: Hs.80887; v-yes-1 YamaguchI sarcoma vbal related oncogene homoioo; SH2,SH3,pklnase;TM»M:: 4.98 

427247; AW504221; Hs.1741 03; Integrtn, alpha L (antigen CD11A (plBO). lymphocyta (unctloiKessodated antigen 1 ; alpha polypepfide); vw^inlBgrirLA.FGW;TM*Y;SS=^* 4.98 
442080: AW444761; Hs.44565: ESTs; ank;; 4.97 

454042: H22570; : hypotheltoal pmtein FU20093; Ig j)ldnase.LRR.LRRNT.LRRCTjione: 4.97 

452698; NM.001295; H3.301921: diemoktne [OC mofil) receptor 1; 7tm_1;TM=Y:SS=M; 4.96 

416276; U41060: Ks.79138: UV-I protein, esbogen regJated; PepIk!ase.C4,Osteopontin.Zip:TMsY:SS=M; 4.96 

408847; AW290997: H3.30348: ESTs; pkln3se.ig.none; 4.96 

419452: U33635: Hs.90572; PTK7 protein tyrosine ktoase 7; lgj)kina5e:TM=Y:SS=M: 4.95 

450737; AW0071S2: Hs.203330: ESTe; tiypsiigdLrBcepLa.nona; 4.95 

443354; AW970672: Hs.9247: protdn Unase, AMP^cUvatad, tfpha 1 c8tEly6e subunil; pkinas8.RI01;TM=M:; 4.94 
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41413S; NM.0044ig; Hs.2128: dual spedfidly phosphalase S; Rhodanese.DSPc.Yj)hQ6phdtase;TM=M;: 4.94 

424247: X14008: Hs^734; lysozyms (rend amyloidosk): lys.ig/AO.Synth.ldhJUh.apMnase.'SSsM: 4.94 

434206: AW13S973; HS.1B0479: ESTs, WeaKly simaar to S69890 milogen indudbte gene mfg-2 [H^apiens]: PH;TM=M;: 4.93 

41B870; AF147204; Hs.89414;chemokIne{C-X4:motiO,fecqjtor4(fusin): 7lm_1,7lm.2;TM=Y;SS=M;4.93 

4087t6; A)567839; Hs.151714: Homo sapiens ntf^ for KIAA1769 protein, partial ods; Uvr04iertcase,RNB,RuntTM=M;: 4.93 

426437; BE076537; Hs.16g895: ubiqtdfiivconiuga&ng enzyme E2L 6; Afniadillo^,UQjEon,none: 4.92 

424241; AW995948; H$.182339: Homo sapiens pynivate dehydrogenase Unase 4 mRNA, J untranslated region, partial sequence; Els.SAMjPNT;TMsM:: 4.92 
414570; Y(»)2B5; Hs .76473; insutuvtiKe growth factor 2 receptor. fn2.QMR;TM^;SS^ 4.92 

407239; AA076350: Hs.67846: leukocyte tmmunoglolnilWike receptor, sulrfamlly B (with TM and ITIM domains), memtier 4; l9;TMsY;SS=M; 4.92 
409512; AW979187; Hs.293591; melanoma differentlafion associated protein-S; DEAO.tielicase_CCARD;TM=M;: 4.91 
416714; AF283770; Hs.79630: C079A anBgen (immunogtotHiln-assodated alpha); lg.rrAM^-Clus;TM»Y;SS=M; 4,91 

404289; ; ; NM.002944*:Horoo sapiens \wds avian UR2 sarcoma vims oncogene homolog 1 (ROSl). mRNA.; (h3,pWnase,DUF139;TM=Y;SS=M; 4.90 
428141; 050402; Hs.18261 1; solute carrtar family 1 1 (protoivcoupled divalent metal Ion transporters), member 1; Nramp;TM»Y;; 4.90 
407853; AA336797; Hs.40499: dickkopf (Xenopus laevb) homolog 1; none;TM=M;SS=Y; 4.89 
432179; X75208; Hs.2913; EphB3; EPH_Ibd.fn3,pklnase,SAW;TM=Y;SS=M; 4.89 

401083; : ; NH.016582*:Homo sapiens peptide transporter 3 (LOC51295). mRNA. VERSION NM-016579.1 Gl; PTR2;TM=Y;SS=M; 4.89 
402211; AA811738; ; KIAA0430 gene product; lon_lrans.lCtetra;TM=Y;: 4.88 

421541; NM.003942: Hs.10S584; rilxjsomal protsin S8 kinase, 9QkO, polypeptide 4; pldn8se4)kinase_C:TW>M;: 4.87 

431810: X67155; Hs.270845; ktnesin4ike 5 (mitotic Mnesfaviito protein 1); kinesin:TM=Mn 4.86 

425295; AA431366; Hs.37251; ESTs: pk]n3se.none; 4.86 

424439; AA579635; H5.1770; Hgase l» ONA, ATP-dependent DNA^Qgase;; 4.86 

419168; AI336132: Hs.33718; Homo sapiens cDNA FU12641 Its. done NT2RM4001953; none.none; 4.86 

442875; BE623003: Hs.2362S: Homo sai^ done TCCCTA00142 mRI4A sequence; ietetra.0UF51^ne: 4.86 

425465; L18964: Hs.1904: piolein kinase C, lola; pkln8se.DAGJ%4)Ind.pNnase_aOPR;TM=M;: 4.66 

410293; AK000047; Hs.61960; hypothetical protein; K.telra;TM=H; 4.86 

443623; AA345519; Ks.9641; complement component 1. q subcomponent, alpha pdypepSde; C1q.Coilagen:SS=M; 4.85 

445903; AI3474a7: Hs.132781; dass I cytokine receptor. fn3;T^4«Y;; 4.85 

427509; mSOS- Hs.2161: complement component 5 receptor 1 (C5a l^arNi); 7tnLl:TU»Y:8S»M; 4.85 

428820; AA436187; Hs.172631; Integrin, alplia M (comfHement component receptor 3. alpha; dsolcnown as (3)1 lb (p17(q, macrophage antigen alpha polypepBde): 

vwajntegrin J^F&GAP,TM=Y;SS=M; 4.84 
445143; U29171; H3.75852; casein kinase 1. delta; rf.(:3HC4.Filamln.zf-B>x.NHU>Mnase;d^Z;TM=M;; 4.82 
427157; U61166; Hs.173824; thymtne-DNAglycosyiase: UD<9;TM=M;; 4.81 
427857: AL133017; Hs.2210; hypothettod pnrtdn FLJ22B65; myosln>ad.lQ.zWi«YND;TM=M;SS=M; 4.81 

422293; X94453; Hs.1 14366; pyrroIina-5<arbaxytate synthetase (ghitamate gamma-semialdehyde synthetase); aldedh.a8kinase:TMsM;; 4.81 
414280; BE410769; Hs.75873; ryxln; UM.Ig.pWnase;TM=M;SS=M; 4.81 
424570; AA343306; Hs.13351 1; ESTs; SH3,ank,none; 4.80 

451144; AW956103; Hs.61712; pyruvate dehydrogenase kinase, isoenzyme 1; HAP8se_c,none; 4.80 

402705; AA214818; ; activator of S phase kinase; AhpC>TS^TM^;SS=M: 4.80 

410024; AW191024; Hs.55016; hypothefical protein FU21935; SH3;TM=M;: 4.80 

419972; AL041465; Hs.182982: gol{^; none.none; 4.80 

427127; AW802282; Hs.22265; pyruvate dehydrogenase phosphalase; PP2C.none; 4.80 

413476; U25849; Hs.75393; add phosphatase 1, sdubte; LMWi^TM=M;SS=M; 4,80 

415801; R24219; Hs.278443; Fc fragment of IgG. tow affinity lib. receptor for (C032); tg;TM>Y;; 4.79 

402233; ; ; NM-030760*:Homo sapiens endolheliai dlfferenllallon, sphingolipld G-protein^xJUpled receptor, 8 (EDG8), mRNA.; 7bR.1;TM=Y;SS=M: 4.79 
448153: Y10805; Hs.20521; HMT1 (hnRNP methyltransferase, a cerevisiae)^'te Z NusG;SS=M; 4.79 
407722; BE2S2241; Hs.38041; pyridoxal (pyrWoxine, vitamin B6) kinase; pfkB;TM^; 4.79 

405370; ; ; NM_005569«:Homo sapiens domain kinase 2 (UMK2), transcript variant 2a. mRI^; pkinase,UM.PDZ:SS=M; 4.79 

416498; U33632; Hs.79351; potassium channel. subfanHy K. member 1 (TWIK-I); lon„trans;TM=Y:SS=M: 4.78 

429921; AAS26911; Hs.82772; collagen, type XI. alpha 1: Cdl8gen.C0LFl,TSPN.iam]nin_G.O>rA;SS=M; 4.78 

424415; NlyL001975: Hs.146560: endase 2, (gamne. neunmaO: endase;TM=M;; 4.78 

4331 33; AB0Z7249: Hs.104741; PDZ-Undlng Unase; T-cdi originated protein kinase; pkinase;TM=M;; 4.78 

431629; AU077025; Hs.265B27; interferon, alpha^ndudble protein (done IFI*16); none:TM=M;SS=Y; 4.78 

417929; R27219: Hs.74647; Human T-cell receptor acthre alph»hain mRNA from JM cell Rne. complete cds; lg,abhydrolase; 4.78 

450334; AF035959: Hs.24879; phosphatiifo add phosphatase ^a 2C: PAP2:Th^Y;SS:=M; 4.78 

447674; BE270640; Hs.19192; cydln^tependent kinase 2; pkin85e;8S>MI; 4.77 

409744; AW675258; Hs.66265; Homo sapiens mRNA; cONA DKFZp586P2321 (from done OKFZp586P2321); none;NA:NA; 4.77 

446196; AI744888; Hs.149470; ESTs; zf^H(M,SulfatB_transp,STAS; 4.77 

429305; AF095727; Hs.287832; myefin protein zer»4ike 1; ig,transmembrane4;TM=Y;SS=M; 4.77 

426612; AF105365; Hs.172613; sohite earner famly 12 (potassiumfchtorkle transporters), member 7; none;TM»Y;; 4.77 

425811; AU)39104; Hs.159557; karyophedn alpha 2 (RAG cohort 1, Imporfln alpha 1); ArmadiIkLj6e9.IBB,DEAD.hdkase.C,Sec63,DOT,PHD,bromodomdn:TM»M;; 4.77 

444664; N26362: Hs.11615: map kinase phosphatase^ike protdn MK-STYX; DSPt;TM=Mn 4.77 

452256; AK000933; Hs.28661; Homo sapiens cDNA FU10071 lis, done HEIUtBA1001702; GOI.7tnv.1.none: 4.76 

447207; AA442233; Hs.17731; hypotheQca) protein FU12892; none;TM=M;; 4.76 

400846; ; ; sortiiin^ated receptor. MDIR dass) A repeals-containing (SORLl); EGF.fn3.kn:jBcepLa,idLrecepLb.granuIin.BNR;TM3Y;SS=M; 4.76 
452355; N54926; Hs.29202; G proteirM»upled receptor 34; 7tm.1.0ATP^C;TVNY;; 4.75 

406809; AF000574; Hs.22405: leukocyte immunogtobullivilice receptor, subfamily 6 (with 7M and iTIM domains), member 2; ig.GBmiriijnov;TliNM;SS=M; 4.75 
- 427378; 6E515037; Hs.177556; melanoma antigen, family 0, 1; MAGE;TM=M;; 4.75 
444042; NIUL004915; Hs.10237; ATP4}lnding cassette, subfamily G (WHITE), member 1; ABCJran.PRK.GBP;Tli*5Y;; 4.74 
410406; A1969703; Hs.1466; giycerd kinase; FGGY,FGGY_C;TM=Ms 4.73 
41 1653; AF070578; Hs.71 168; Homo sapiens done 24674 mRNA sequence: none;NA:NA; 4.73 
437667; BE61641 2; Hs.2e6218; jundionai adhesion mdecule 1: none.HLH: 4.73 
417781; BE279380; Hs.82563; KIAA0153 protein; TTl.A:yLtransf; 4.73 
453986; BE148734; Hs.63325; transmembrane protease, serine 4; tryp8ln.WljecepLa.none: 4.73 
412228; AW503785; Hs.73792; ccmpiemenl component {3d/Epstein Barr virus) receptor 2; 5Ushi;TIWNY;SS=M; 4.73 
418255; AW135405; Hs.37251; ESTs; pWn3se.none: 4.73 

413472; BE242870; Hs.75379; solute carrier family 1 (glial high affinity glutamate transporter), member 3; SDF;TM=Y;SS=M; 4.73 
406906; Z25424; ; gb:H.sapiais protein-serlne/threonlne kinase gene, complete COS4 none,none; 4.73 
432065; AA401039: Hs.2903; protein phosphatase 4 (formerly X). cat^ subunll; Mela!lophos:TMsM;; 4.72 
424909; 3781 87; Hs.1 53752; cell diviskm cyda 25B; Rhodanese;SS=M; 4.72 

422599; BE387202; Hs.118638: noiHnetastalic ceBs 1. protdn (NM23A) expressed In; NDK,PH.OxystBrol_BP:SS=M; 4.71 
426136; AW957239; ; gb:EST369309 MAGE resequences, MAGO Homo sapiens cONA, mRNA sequence; PP2C,none: 4,71 
446203; Z47553; Hs.142B6; flavin conlatriing monooxygenase S; FiyiO#e.pyr.;edox;TMBY;SSnM; 4.71 

451295: AIS57212; Hs.17132; ESTs. Moderately simBar to IS4374 gene NF2 protdn lasapiens) pklnase.OAGLPE-ttid.pUnasa.(;OPR.none: 4.71 
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424099; AFO71202: Hs.139336: ATP^mfing casse&s. sub-family C (CFTR/MRP). nwnbef 4; ABC.Iran^jneinbTan8;TM»Ys 4.70 
424959; N|yL005781; Hs.l53937; activated p21cric42Hs Wnase; Wh.1dh^aSH3,pMnase.UBA:TM=Mn 4.70 
427206; NM»004586; Hs.173965; rfbosomal protein S6 Hnase. 90kD. pdypei^o 3; nonB,non8; 4.70 

421662; NM_014141; Ks.l0655^ cell recogrfflon molecute Caspr2; EGF,F5.F8_typ8.C»lainbiin_G.SuIfalB_transp,STAS Jtnu3^_in^^^ 4.70 

413431; AW246428; Hs.75355; uttqullirKon|ugaBng enzyme E2N (homologous to yeast UBC13); Ua.con:TM=M; 4.70 

405484; ; ; C3002124*:gl)12737280]renXP_0066Bi2| keraCn 18 [Homo 8apiensII16633; none;SSsM; 4.70 

401345: M83738; ; protein tyrosine phosphaiase, no^feceplor type 9; none;TMsM;; 4.70 

416602; NM_008159; Hs,79389; nel (chlckenHIke 2; EGF.w«;.TSPN;SS=Y: 4.69 

412507; L36645; Hs.73964; EphA4; fn3.pklnase.SAM.EPHJbd;T(vNY;SS=M: 4.69 

437897; AA770561; Hs.146170; hypothetical pnto RJ22969; zH)HHC.non^ 4.69 

432886; 86159028: Hs.279704; chromatin aocesstt){Gty complex 1; none;TM=M;; 4.69 

400843: ; : Nlit003105':Homo sapiens sortilbMBlated receptor, \jpUi dass) A repeats-contairfng (S0RL1). mRNA.; 

EGF.fnatdl_recepLa.ldLrecepLb,granulln,BNR;TM=Y:SS=M; 4.6B 
433409; AI278802: Hs,25661; ESTs; {^e.pktnase; 4.68 
413869; N1VL000878; Hs.75596; biterteukin 2 receptor, twta; none:TM=Y:SS=M; 4.68 
430259; BE550182; Ks.127826; RalGEWite protein 3, mouse homolog; ih3,R\R8S6EF;TM=M;SS=Nt 4.68 
42S761; AW664214; Hs.196729: ESTs: SH3,(^l)OSomai_S3Ae; 4.68 

431941; AK000106; HsJ!72227; Homo sapiens cONA FU20099 lis. done €0104544: pktnase Jiir1n^Recep.L.dbmahi,none: 4.68 
419493; AR)01212; H5.90744; preleasome (prosoms, maciopam) 26S 8ubun», noivATPase. 11: CDK5_acBvatorp0.none; 4.67 
425966; NM_001761: Hs.1973; cydin F; cycnn,F-box.cycan_C;TM=M,-; 4.67 
408056: AA312329; Hs.42331; ephriivA4; Ephrin;TM=M:SS=M: 4.67 

453476: AI640500; Ks.24633; SAM domain. SH3 domain and nudear localisation signals, 1; SH3.SAM:SS=M; 4.67 

412926; AI879076: H8.7S061; manophage inyristoylatBd alBnineM C kinase substrate: MARCKS;SSsM: 4.67 

424635: AA420667: Hs.115455: Homo sapiens cDNA FU142S9 lis, done PLACE1001076; pk!n8se.Fuitrvlke^p_L.<ksnaIn.none: 4.66 

446051: BE048061; Hs.37054; ephrin-A3; EphiinA.deamin.dsrm,z-8}pha; 4.66 

436729; BE621807: ; transmeml)fane 4superfamlly member 1; nonB;TM=Y;$S=M; 4.66 

408204; AA454501; Hs.43666; protein tyrosine phosphatese lype IVA. member 3; Y_phosphatase:TM=M:; 4.66 

435542; AA6e7376; : ESTs; SH3^.pidn3se.PH.8pectrin.RhoGEF,none; 4.66 

429682; NM-008306; Hs.21 1 602; SMd (structure maintenance of chfomosomes 1, yeastKiite 1; ABC.trBn.SMC.H,SMC_C.KlO;TMsM:; 4.66 

417497; AW402482; Hs.82212; CD53 antigen; transmemt)rane4;TM=Y;SS=M; 4.66 

418736; T18979; Hs.87908; Snf2-related CBP adivatof protein; heIicase_C,AT_hook,SNF^.N:TM=M;; 4.65 

415117; AF120499; Hs.78016; potynudeoMde Wnase 3'-phosphatase;ViralJjeHcaso1:TMsM:; 4.65 

418629; BE247550; Hs.86859; growth factor leceptor-bound protein 7; SH2.PH,RA;SS"M: 4.65 

426108; AA622037: Hs.166468; programmed ceD death 5; DUF122;TM=M;; 4.64 

429263; AA019004; Hs.l98396; ATP-blndlng cassette, sub-family A {ABC1), member 4; ABC.Iran.SRP54iTM=»Y;SS«M; 4.64 
431886: 177964; Hs.271980; mHogen-adivaled protein kinase 6; pkin8se:TM=M;; 4.63 
435049; AL122067: H$.4746: hypothetical protein FU21324; none;TM»M;; 4.63 

437763; AA469369; H3.5831: tissue inhaiitor of metalioproteinase 1 (eiythroid potonSaOng acflvRy, oOOagenase inhibitor); TIMP,pkinase,DAG.P&faind,RBD: 4.63 

413436; AF238083; Hs.68061: sphingosine kinase 1; DAGKc;TM=M;: 4.63 

421846; AA017707: Hs.1432: protein kinase C substrate 80K-H; efhand.kil_recepta;SS=M; 4.62 

442590; Ai002686; Hs.130313; ESTs; none,Y.phosphatese,6and_41,connexin; 4.62 

416224; N^^002902: Hs.79088; retlculocalbin 2. EF<l)and cak:lum binding doman; dhand;SS=M; 4.62 

423740. Y07701; Hs.293007; amtnopeptklase puiomyctn sensitive; Peptid3se.M1/m«Iino_seg; 4.61 

429300; AB011108; Hs.198891: serineflhreonineiirotein kinase PRP4 homdog; pkin3se;TM3M:: 4.60 

447232: AW4g9834; Hs.327; tnterteukin 10 receptor, alpha; none;TKNAA;SS^ 4.60 

412942; AL120344; Hs.75074; mltogen-activated protein klnase^vated protein kinase 2; pk]nas8:TM=iyt: 4.60 

419596; 6E379320; Hs.91446; MKP-1 like protein lyrosine phosphatase; DSPc;; 4.59 

417880; BE241595; Hs.82848; setectln L (lymphocyte adheston mdecule 1); E6F48dln_c,sushi;TM=M;SS*M; 4.59 

41 1 125: AA151547; Hs,68877: cytoduome b.245. alpha pdypeptide; none;TM=Y:SS«M: 4.59 

434883; AW381S38; H8.19807; hypotheflcdl protein M6C12S59: SH3,PH,WW.RhoGAP;8S^ 4.58 

447312; AI434345; HsJ6908; adivaling transo^n fador 1: rTn\zf-I^P.pktoase,GST.C,Els.8AK4.PNT.ABCZjneiitbraneinyw 

C2H2.PH0.BTB,TRISATJl00k,SAM:TM=M;; 4.58 
435254; AW194689; Hs.30778; ESTs; pkinase.BadertaLPQQ.none; 4.58 

426925; NM-001 196; HS.31S689; Homo sapiens cDNA: FU22373 lis, done HRO08741; EsterBse,enolase.Peptidase.89:TM=M:; 4.58 

421685; AF189723; Hs.106778: ATPase. Ca transporting, ^ 2Ci member 1; Ca8on>TPase.C;Calioa>TP8SBjl,E1-E2>TP85e,Hydrofasa,XPGLN:Th^ 4.58 

447627: U73727; Hs.19718: protein tyrosine phosphatese. receptor type. U; th3.lg.Yj)ho6phalase.MAM;TM=>Y;SS»M: 4.58 

427640: AF058293; Hs.180015; CMtopachrome tautomerase; COXa.SHMT,MlF.GST.QEF1G_domaln.GSTJI,S1,Fz,Fifczledjcalretfca*Bn,7tm»Z^ 4.57 
441085; AW136551; Hs.181245; Homo sapiens cONA FU12532 Its. done NT2RM400Q200; nonajrane; 4.57 
409581; U66243; Hs.5S039: mitogen^ated protein kinase 12; pMnase;SS=M; 4.57 
423184: NM_004428; Hs.l624; ephrin^l; Ephrto:TM=M.'SSBM; 4.56 

443920; AL037764; Hs.35304; Homo sapiens cDNA FU13855 (is. done Pt.ACE1011503: none.FM04ike; 4.S6 

422627; BE336857; Hs.118787; Iransfonnlng growth factor. beta-Induced. 68kD; F8sddinABC.>an/BCjnen4rane,GTPJEFTU;TM»M;SS=M; 4.58 

416869; AW516565; ;gb3(q01d05j(l Soares_NHCeCj:eivtoaLtumor Homo sapiens cDNA done 3* sbnOar to conteinsAlu repetitive etement;contdns etemenl MER11 repetifive 

element ;, mRNA sequence; none.R8sGAP.WW,ICI: 4.58 
430016; NH.004736; Hs.227656: xenotroptc and polytropfe retrovirus recepton SPXEXS;TM»Y;; 4.56 
437157; BE048860; Hs.120655; ESTs; IRK,none; 4.55 

422769; AA938905; Hs.120017: dfactay receptor, family 7. subfamily E, member 38 pseodogena; none,none; 4.55 

457918; At^9590; H3.162604; hypotheficd protein DKFZp762Ml86; PL[)c;TM=M;; 4.55 

434467; BES52368; Hs.231853: Homo saddens cONA FU13445 fis. done PLACE1002982: 71m J/wne; 4.55 

421 1 40; AA298741; Hs.l02135; signal sequence receptor, ddta {trsnsk)coiv8SSOdated protein ddte); none;TM»Y;SS»M: 4.55 

406364; : ; Target Exon; hex3pep;TM=M;; 4.55 

434682; AA827165; Hs.l919S8; immunogtobulin supeifamOy receptor transtocallon associated % ig.none; 4.54 

438939; H21012: Hs.287657; Homo sapiens cDNA: FU21291 lis. done GOL01963; F5 J^8Jype.C.pkinase,Et8,nooe; 4.54 

433435; BE545277; H3.340959; Ts translation etongafion factor, mitochondrial; EF_TS,UBA;: 4.54 

411165; NM_000169; Hs.69089; galactosidasa, alpha; MeIiblase;SS=M; 4.54 

408956; AK001868; Hs,49344; hypothetical protein FU11006: k)n_trans;TM=Yr, 4.54 

416847; 143821; Hs.B0261: enhancer of tilamentatton 1 (cas-Gka docking; Crk-assocfated substrate related); SH3:TM=M;: 4.53 

410226; AI831958; Hs.61053: hypothetteal protein; SH3.TPR;TMsM.*; 4.53 

422753; AI928995; Hs.1575; smaH nudear rtnnudeopfotein 03 polypeptide (18kI3); Sm:SS=M: 4.52 

418355; L42563; H3.1155; ATPase. H7 Iranspoifing. nongastrte, alpha pdypeplWe; El-EZ^TPase.CatioiLATPase.CiCalloiLATPaseJI.Hydtdase;TM3Y;; 4.52 

400261; : : Eos Control; ig.MKCJLbeta:TM»Y;SS=M: 4^ 

444633: AF1 11713; Ks.288216; jundtond adheston mdecuto 1; ig;TM»Y;SS«M: 4^ 

422940; BE077458: ; gb:RC1-BT06064n05004)1SM4 Bfni606 Homo sapiens cONA. mRNA sequence; Sec7.PKANF^feceptorJlg.d)an.WD40.IRI^ 4.52 
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40O3O3: AA2427S8: Hs.79136; UV-I pioteln. estrogen regulated; nonsjwne: 4^1 

412604: AW978324: Ks.1904: ptoton kinase C iota: (ddnase.DAG_PE^Ind.(ddnase.COPR:ThNAn 451 

448533; AA31 1426; Ks.21635; tut»jlb, gamma 1; tubu{in;TM=M:: 4.51 

457906; AW975939; Hs.153290; Homo sapiens cONA FU14318 fis, done PIACE300O402: none.pl(inaS6: 4^1 
456382; AW973003: Hs.179909; hypolheticeJ protein FLJ22995; none;TW=Mn 4.51 
429890; AW956329; Hs.23721; ESTs; nonewar.tr,rebcsomaLS25: 4.50 

424618; t^9472; Hs.1802; histocompaSbllity convlex, dass II. 00 t)ela; lg^HCJLbeta:TM^Y:SS=M: 4.50 
444823; BE262989: Hs.12045; putative protein; Mra1.MB0AT;TW=M;SS=Y; 4.50 

405490; ; ; NM_03141 4:Homo safaens serine/threonine Wnase 31 (STK31), transcript variant 1. mRNA.; p)dnase,TUDOR;TM=M;: 4.50 
424494; U7857S; Hs.149255: phosphafidyOnositoM-phosphate 5«nase, type I, alpha; PIPSK;SS=M; 4.50 
441031; All 10884; Hs.764S; fitjrlnogen. B beta poiypeptlde; Mu}gen.C&alpha,arf;TMsM;SS«Mt 4.50 
443951; F13272; Hs.1 1 1334; (errifin, ngtil polypeptide; PMP2:Laaudln.none; 4.50 

410423; AW402432; Hs.63489; pratein tyrosine ptiosphatase, norneceptor type 6; 8H2,Y_phosphallase.DSPc;TM=M:; 4.50 

429556; AW139399; Hs.98988; ESTs; none;TM=M;; 4.50 

458791; BE615453; H5.346509; dedicator of cyto-kinesis 1; none;TM:'Y;; 4.49 

425209; Ati)49761; Ks.155140; casein kinase t alptia 1 potypepOde; pkinase.ABC1;TM=^4;; 4.49 

425695; Nl\tO05401; Hs.159238; protein tyrosine phosphatase, nonreceptor type 14; Yj)hosphatase,Band_41,DSPc;TM=M;; 4.49 

424943; AU077260; Hs.1S3924; dealh-assodated protein kinase 1; ai1Kpkinase,death.SPRY.SAP.Ribosomat„U4e.SRP54.dOENN.DENN.uDENN;TM44;; 4.49 

412970; AB026436; H$.177534; dual specifidty phosphatase 10; Rhod8nese,OSPc;SS='M; 4.48 

400755; AA635062; ; Homo sapiens mRNA; cDNA 0KFZp434O0515 (from done DKFZp434O0515); zf^C4,CARD.BIR;TM=M;; 4.48 
425566; AW162943; Hs.250618; UL16 IMng protein 2; kll_recepLa.PKD,MHCJ;TM=M;SS=Y; 4.48 

410151; X15723; Hs.59242; paired amino ackJ deaving enzyme (furin. membrane assodated receptor protein); Peptk]ase.S8.P;TM»Y;SS=M; 4.48 
423538: L22075; Hs.1666; guaidne nudeOQde binding protein (G protein), alpha 13; UCfUitnge.G^lpha^TM=M;: 4.48 
42471 1; NII4.0057%; HS.15217S; catdtofibi receptor-tike: 7tnL2>IRM;TMsY;SS^ 4.48 
427878; C05766; Hs,181022; CGI-07 protein; none.iM2H2; 4.48 

443991; Nl^002250; Hs.10082; potassium btermediate/smatl conductance cddunvadivaled channel, subfamily N, member 4; CaMBO.SILdiannel,ion_tran8;TM»Y;SS=M; 4.48 
422605; H16646; Hs.1 18666; hypolh^ protein PP591; PAPS.redud.HtoCF.bksynlh,': 4.47 

410583; AW770280; Hs.38258; ESTs. Moderately similar to JCS238 g^tadosylceramida^ protein. GCP Hsapiens) SH3.PQZ.Guany1atB.kln.none; 4.47 
434419; AL040608; K5.296938; dual spedfidty phosphatase 7; DSPO;TM«Mn 4.47 

410O32; BE065985; ; gb:RC3-BT0319-12020^014-a09 BT0319 Homo sapiens cDi^ mRNA sequence; abhydroIasBL2.none: 4.46 

423078; M35198; Hs.123125; intern, beta 6; lntegrin.B.EGF.pf>*inding;TM=Y;SS=M: 4.46 

400263: ; : Eos Contrd; GTP.EFTU,EFGlC.GTP JFTU_D2,5erpln;TM=M;: 4.46 

441406: Z45957; Hs.7837; phosphoproteln regulated ty rnBogeiite pathways; pkinase:TM=M:; 4.45 

434551; BE387162; Hs.280858; ESTs, Highly similar to A35661 ONA exdsion repdrcrossrxHnpiemenQng protein ERCC3 [H.sapiens); none;TM>M;; 4.45 
413227; M79082; ; ESTs; none^one; 4.45 

441321; H171B2; Hs.7771: B-ce!I assodated protein; Band_7;TM=M;; 4.45 

457194; H20669; Hs.35406; ESTs. H^hiy similar to unnamed protein product [Ksapiensl; none.pkinase.P6D; 4.45 
414745; AA16051 1; Hs.5326; amino add system N transporter 2; porcupine; none,none; 4.45 

404276; ; ; NM_002944*:Homo sapiens v-ros avian UR2 sarcoma vims oncogene homolog 1 <R0S1), mRNA.; th3,pkinase.DUF139;TM=Y;SS=M; 4.45 
426966; AI493134; ; sderostin; DAN;TM=I^SS=M; 4.45 

408873; AIJD46017; Hs.1 82278; calmodulin 2 (phosphorylase kinase, delta); none.none; 4,44 

428486; BE178285; Hs.170056; Homo sapiens mRNA; cDNA DKFZp5B6B0220 (from done DKFZp586B0220); pkinase,none; 4.44 
432798; AA565309; Hs.194015; ESTs; lnlegrin_B.Sema.PSI,T1G,none; 4.44 

439668; A1091277: H5.302634; fHzzled (Drosophlla) homolog 8; Frizzled,Fz.7tnL2,toxin.2;TM=Y;SS=M; 4.44 
417886; AA214584; ; ESTs; SPRY,7tnL3>^NF„TOceptor,none: 4.43 

452098; AI858183;;gbnwl46a12j(1 Na_CGAP.Ut1 Homo sapiens cDNA done 3* sinilar to contains Ahj repetitive element;. mRfU SK3,none; 4.43 

426874; N67325: Hs.347487; ESTs; SH3,TonB^boxC4wnB; 4.43 

422714; AB018335; H3.119387; KlAA0792gene produd; OUF221;TM=Y;SS=M; 4.42 

410741; Z11695: K5.324473; nutogen-adivated protein kinase 1: pklnase.none; 4.42 

432193; AA372264; Hs.273193; hypothettedl protein FU10706; pkin8se;TM=M;; 4.41 

409506; r»4.005153: Hs.64589; NCK adaptor protein 1; SH2,8H3;TM»M;: 4.41 

429390; AB040942; Hs.201500; KtAA1509 protein; none:TM»M;; 4.41 

421859; AA356$20; Hs.108947; KIAA0050 gene product; ank.PHAfGap;SS=:M; 4.41 

451527; AF022813: Hs.26S18; transmembrane 4 superfan^ member 7; none.none; 4.41 

421748; NM.014718: K8.107809: KIAA0726 gene product; cadherin;TM>Y.'; 4.40 

410418; BE410072: Hs.63304; protein phosphatase methytesterase-l; none:TMN^ 4.40 

460457; AA367701; Hs.6639; KIAA1624 protein; none;TM=M;SS=M; 4/40 

433029; NM-014322; H3.279926; opsin 3 (encephatopsln): 7tm_1.IA)nooxygenase;TM=Y;SS=M; 4.40 

408805: H69912; Hs.48269; vaocsiia related kinase 1; pklnase;TM»M;; 4.40 

421585; U95626: Hs.302043; chennokine (&C mo8f) receplor-flkB 2; 7tnLl;TM:'Y;SSs»« 4.40 

440014; AW960782; Hs.6855; a5h2 (absent smaO. or homeolk; DrosophOa, homo)og)-likB; SPRY.BAG,UPF0001; 4.40 

451154; AA015879; Hs.33536; ESTs; TIMP.none; 4.40 

433895; A1287912; Hs.3628; mitogervadivated protein kinase kinase kinase kinase 4; pklnase^,CNH.ERM;TM=M;; 4.40 
422034; AC008488; H8.333069; Ets2 repressor fadon Ets:TM=M;; 4.39 
444009; AI380792; Hs.135104; ESTs; TNFR^c6,TIUnooe; 4.39 
420020; BE296866; Hs.94382; adenosine ktee; pfkB;SS<^ 4.39 

416207; NM_014745; Hs.79077; Hot© sapiens, done MGC:2908, mRNA, complete cds; none;ThA=Y;SS=Mt 4.39 

417655; AA780791; Hs.14014; hypothetteal protein FU14813; pkln3se.pkinase_C;TM=M;; 4.39 

402915; ; ; ENSP0000020258r:Bk:8Tbonate transporter-related protein BTR1.; HCX>3_cotransp;TM=Y;; 4.39 

453199; AI336266: Hs.32353; mitogen-adivaled protein kinase Mnase kinase 4; pk!nase;TMsM; 4.38 

416033; NM.012201; Hs,78979; C^i apparahis protein 1; cy8jfctLF6FR;TM3Y;SS=M; 4.38 

453672; U73531; Hs.34526: 6 proteiivcoupted receptor, 7tm_1;TM=Y;SS=M; 4.38 

437852; BE001836; Hs.256897; ESTs. Weakly similaf to dJ365012.1 [H.8aplenst GPS.7tmJ5;TM=Y;; 4.38 

420039; NM_004605; Hs.94581; suifotransferase family. cytDsoIk:. 2B, member 1; SuIfotransfenSS=M; 4.38 

412834; R77123; Hs.79881; Homo sapiens cONA: FU23006 lis. done LNG0O414: 7tm.l .none; 4.38 

452203; )C57522; ; transporter 1. ATP-binding cassette; suWamily 6 (MDR/TAP); ABC_tran,A6Cjnembrane,SRP54.Thymidylale_kln;TM=Y;SS=M; 4.37 
425317; AW205118: Hs.210546; InterieuMn 21 receptor none;TM=Y;SS=M; 4.37 
432945; AU)43683; Hs.8173; hypdheOcal prolebi FU10803; nono;TM=M;SS=M; 4.37 

424028; AFO55084; Hs.153692; Homo sapiens cONA FU14354 Gs. done Y79AA1001384. highly similar lo Homo sapiens very large Gi)roteh coupled receplor-1 (VLGR1) mRNA; 
nonejwne; 4.37 

434071 ; AF1 16653; Hs.341 92; Homo sapiens PRO0823 mRNA, compIetB cds; none;TM=M;; 4.37 

412S96: AA161219; Hs.799: diphtheiia toxin receptor QnparlrvbindlnQ epidermSl growth fiador-fike growth factor); EGF:TM»Y;SS=ht 4.38 
440270; NM.01S986; Hs.7120; cytokine receptor-like mdecule 9; 1h3:SS»»« 4.38 
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432987; A1864771; HS.Z7954: C086 anOgen (C028 anSgen Dgand 2, B7-2 enfigsn); none:TM»Y;SS=ht 4.38 

436943; AA773838; Hs.$353: caspase ia apoptosls-ielated cysteine protease; IC^1(MCE^20.0ED:TMsMn 4.36 

457897: AI356125: Hs.345168; ESTs. Wefikly sbnDar to HXA^.HUMAN HOMEOBOX PROTEIN H0X-A2 (Rsaptens); homeotx)x;NA:N^• 4.36 

406671; AA123547; fe.285754; met pfottH)nc09ene (hepatocyte gowth factor receptw); S8ma,pldnase,TIG.PSI,nonB; 4.36 

413969: X14034; Ks.75648; phospho&pase C, gamma 2 (phosphalklyflnosilol^fic); SH2.SH3.C^PHPi4>L(>Y.PI-PLC'X,P0Gr.'SS4^ 4.35 

408101: AWg68504: Hs.123073; CDC2-reIate() prnteb kinase 7; none,none: 4.35 

414029: BE297731: Ks.75709: mannose-^phosphate receptor (cation dependent); Man^jecep;TM=M;SS=M: 4.35 

425069; AA687465; H5.298164; potassium vdtage^ated channel, shaker-reialed sutrfamDy. beta member 2; akk)J(etjed.none; 4.35 

438937: AW952654; Ks.244824: ESTs; EPH Jbd,p)(in8se.fn3.SAM.none: 4.35 

412584; X54870; Hs.74035; DMA segment on chromosome 12 (unique) 2469 expressed sequence; noneJectiiLC; 4.35 
436540; BE397032; Hs.14468: hypothetical protrin MGC14226; nm.7tnu1.SNF;TM=Mn 4.34 
435267: N23797; Hs.1 101 14; ESTs; none.Syia.N.Exo_endojihos; 4.34 
405616; ; : Target Exon; none,SH3.BAR: 4.34 

432141; BE410964; Hs.272736; nudear receptor binding pral^ pldnase;Tlk«:iMn 433 
417927; R73095; Hs.24122; ESTs; none.pHnase; 4.33 

429849; U33053; Hs.2499; protein kinase Onke 1; pkinas8.pkin3se.C,HR1;TMsM;; 4.33 
425743; BE396495: H5.159426; Ba2-associated X protein; Bc|.2;TM=:Y;; 4.33 
453863: XO^ Hs.572: OlOSomucoM 1; lpocaIin.a)dedh,uUquitin.iRK;8S=M; 4.33 

400847; ; ; NM_003105*:HonK) sapiens sor^IbHelated receptor, UpiR class) A repeats-oontainlno (S0RL1), mRNA.; 

EGF,fn34dUecepLa,ldljecepLb,QranuHn,BNR;TM=Y;SS=M; 4.33 
414914; U49844; Hs.77613; ataxia telangiectasia and Rad3 related; FAT.FATCi>l3^Pt4Jdnase;TM=M;: 4.33 
413858; NM.001610; Hs.7S589; ackJ phosphatase 2. lysosomal: acidj}hosphat;TM»Y;SSsM; 4.33 

442539; AL119S06: HS.5822D; Homo sapiens cONA: FU23005 lis. clone LNGQ0396. highly siniilar to AF055023 Homo sa^ mRNA sequence; 

RasGAPiadenylatehinase; 4.33 
419607; R52557; Hs.91S79; Homo sapiens clone 23783 mRNA sequence; IMP4;TM4^ 4.32 
436703; AW880614; Hs.146381; RNA binding motif protein, X chromosome: rnn,SH3kPKCH,RhoGEF; 4.32 
414899; AW975433; Hs.36268; ESTs; pkinase.SH2.SH3.none; 4.32 

444895; AI674383; Hs.22891 ; solute carrier family 7 (cationic amino add transporter, y system), member 8; AS(;death,TNFR.c6; 4.31 
415135; AW673559; Hs.78040: KOEL (Lys-AspXalu-leu) endoplasmic reticulum protein retentton receptor 1; ErUumenj^eoept,none: 4.31 
444070; NM.015367; Hs.10267: MIL1 protein; Bcl-2;TM=Yr. 4.31 

42261 1; AA158177; Hs.118722. fucosyltransferase 8 (alpha (1.6) fucosyitmnsferase); SH3.K-box;TM=M;SS=Y; 4.31 
437162; AW005505; Hs.5464; thyroid honnone recepbor coacb'vafing prkein; bromodomain:TM=M;; 4.30 
440983; M20681; H5.7S94: solute carrier family 2 (faclHtaled glucose transporter), meniber 3; 8UgarJr;TM=Y;SSsM; 4.30 
414080; AA135257; Hs.47783; B aggressWe lymphoma gene: A1pp;TM=M;; 4.30 

415072; BE253887; Hs.77676; Homo sapiens, ck)ne IMAGE:3461982, mRNA. partial cds; MetaIIophos.AnnadiIk)_8eg;TM^; 4.30 
442994; At026716; Hs.16954; ESTs; ank,pkbi3se.death,Ribosomal S14; 4.30 

432328; A1572739; Hs.195471; 6-phosphofructO'2-kIna5e/liruclose-2.6-biphosphatase 3; PGAM,6PF2K;TM=M;; 4.29 
439490; AW249197; Hs.100043; ESTs. Weakly similar to A46302 PTB-assodated spitoing factor, kmg fonn (H-saplens]; none;TM=M;: 4.29 
422005; BE266556; Hs.1 10702; Homo sapiens mRNA; cDNA OKFZp761E212 (from done DKFZp761E212); none.Na..H.Exchanger. 4.29 
415214; AI445236; Hs.125124; EphB2: fn3.pkIn8Se,SAM.EPHJbd:TM=>Y;SS«M; 4.29 

430316; NM.000875; Hs.239176; insulin-tike growth factor 1 receptor. fh3,FurMce,pkinas6.Recep_Ldomain:TM=M;S8»M; 4.29 
429099; 6E439952; Hs.19&177; phosphorytase kinase, gamma 2 (testis); pkinasa,Bac.ONAJ)Inding;TM=M,'; 4.29 
425843: BE313280; Hs.159627; death assodated protein 3; niyb_ONA-bInd]ng.PAK6AH,bromodom8ln.PH0.8Er;TMNM;: 4i8 
437603; AW979259; Hs.293673; ESTs; death,nond: 4.28 

439975; AW328081; Hs.6817; Inosine triphosphatase (nudeosMe triphosphate pyrophosphatase): Ham1pJke:TliNMn 4.28 
424512; X53002: Hs.149846; Integrin, beta 5: integrin_B,EGF;TM:>Y;SS=M; 4.28 
442980; AA857025; Hs.8878; kinesin-like 1; k!nBsin,Ujteo_ORF3,DUF164;TM=M;: 4.28 

420166; AW732276; H8.g5583; transmembrane 4 superfamily member (tetraspan NET-7); transmembr8ne4;TM3Y;8S^ 4.27 
409582; R27430; Ks.271565; ESTs; none.Neur_chan.LBO,Neurj:hanjnemb; 4.27 
439096; AA83018S; ; ESTs; ras.none; 427 

414561; AI064813: Hs.195155: Homo sapiens amino add transport system N2 {SN2) mRNA. complete cds: A«.lrans;TM»Y; 427 

411835; U29343; Hs.72550; hyaturonannnediated motility receptor (RHAMM); bZIP;S$=M; 4.27 

428761; AF164799: Hs.ig3384; putatathre 28 kOa protein; pkinase.DAGJ^-bind.pkinase^C.OPR;SS^: 427 

430603; AA148164: Hs.247280; HBV assodated fador. zf.C3HC4.zM^BP.pkkiase: 427 

415149; X12451; H8.76056; calhepsin L; Pepad8se.C1;S&4i« 426 

44483B; AV651680; HS.20855B; ESTs; 1ntegr1aAF&GAP,none; 4i8 

402328; ; ; Target Exon; pldnase:TM»M;; 4.26 

416094; AW995512; Hs.225977; nudear receptor coadivalor 3^ none.none; 4.26 
420942; H03514; Hs.15589: ESTs; none.pkina5e: 426 
453902: BE502341; Hs.3402; ESTs; none.none; 4.26 

425505; AL036458: ; gb:OKFZp664D2082j1 564 (synonym: hfbr2) Homo sapiens cONAdone DKFZp5B4O2062 5*. mRNA sequence; arf.&alphainone: 426 

427344; NM_000869; Hs.2142: S^ydroxytryplamlne (serotonin) receptor 3A: Neur.d)an.LBD.Neurjchanjnemb;TM»Y;SS4« 426 

432269; NM_002447; Hs.2942; macrophage stimulating 1 receptor (c-met^lated tyrosine kinase); pkinaseLSema.PSl,TIGA4.fXTRA;TM=M;SS4A; 426 

417007; AF224741; Hs.80768; dJortde diannel 7; C8S,voItage.aC;TM=Y;; 4.26 

447960: AWg54377; Hs.26412: ring finger protein 26; zK:3HC4;TM»Y;8S=iyi; 426 

4*2300; AI765908; Hs.129166: ESTs; non8;SS=M; 425 

421656; NM_016447; Hs.108931: MAGUK protein p55T; Protein Assodated wISi Uns 2; SH3.PDZ.(3uanyt3teJdn.t^7;TM»M:; 425 
452110; T47667; Hs.28005: Homo sapiens cDNA FU11309 Us. done PUCE1010076; pklnase.ActrvinjBcp.none; 4.25 

422451; AA31 0753; Hs.42491 ; ESTs. Weakly similar to S65657 dpha-1 C-adienargte receptor spBce form 2 (lisapiens]; PDZ.SH2.STAT.STAT.bind.STATj)(Dt.none; 4.25 
453955; AW579207: Hs.304666; ESTs, Weakly similar to 178885 sertneAhreontne^pedflc protein Mnasa IH-sapiens]; fh3.lo.MAMAme: 425 
457670; AF119666; Hs.23449: insulin receptor tyrosine kinase substrata; SH3;TI\^M;: 4.25 

419133; U46116; Hs.89627; protein tyrosine phosphatase, receptor type, G; fn3.Yj)hosphatase^jBnhydf3Se.D$PC;TM3Y;SSs=M; 4.2S 
41 9660; BE280337; Hs. 194693; sdute carrier fanily 7 (cattonic amino add transporter, y system), menlbflr 7; 8q.peRn8ases;TM"Y;SSB|i1: 425 
415198; AW009480; H5.943: natural kBIer ceQ Iranscr^t 4; none:TM=M:; 4.24 

416440; AI823912; Hs.79335; Homo sapiens, Similar to SWl/SNF related, matrix assodated. acfln dependent regulator of chromafln. subfamily d, member 1. done M6C:15280. 

mRNA. complete cds; SWIB;TMsfUt;: 4.24 
419169; AWSSISSO; Ks.282346; ESTs. Weakly simQar to S72482 hypotheScal protein [Ksaf^s]; non8.spedrln«SH3,PH,(»; 424 
449444: AW818436; Hs.23590: solute carrier family 16 (monocarboxyilc add transporters), member 4; none;TM>Y;SSeM; 4.24 
433848; AF09571 9; Hs.93764; caibaxypeptidase A4: Za.cartOpept.Propep.M14;SS»ll« 4.24 
442213; N361 10; H3.305971: solute earner fanvly 2 (fadHtated glucose transporter), meniber 10; sugar.tnTiyNY;SS»M: 4.24 
412681; AW983655: Hs.1720O4; tiHn; fn3Jg.SGXXSG,pklnas8:TM=>M.-: 4.24 

424653; A W977S3 4; Hs.151469; cStoiumfcalmoduiin^tependent serine protein kinase (MAGUK famDy); nonQ.none: 424 
421068; AU076725; Hs.101408; branched chain amfaiobansferasa 2. mitochondrial; amlnotrBnjt;; 423 
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42B338; AF14776S: Ks.232093: ESTs; th2.aMR:TINyi;SS>^ 4.23 

443329: BE262943; H3.9234: hypolhotical protein MGC1936: iu>n8:TM»Y;SS=M: 4.23 

432314; AA533447: Hs.312989: ESTs; XnnMone; 4.23 

434606; AA805443; H5.179909; hypothetical protein FU22995; none;T^«=M;: 4.23 

454165; AW933355; Hs.285614: sprouly (Drosophila) homotog 4; SH2.SH3rrM=IW;SS=M; 4.23 

442536; A1457102: Hs.347970; Human glinose transporter pseudogene; nons,none; 4.23 

442549; AI751601; Hs,8375; TNF rBcaptor-assodalBd factor 4; MATH.zJ-TRAF^«3HC4:SS=M: 4.22 

424154; AF026004; Hs.141660; chtortde channel 2; vollage_aC.CBS.EPOJPO^jep: 4.22 

433419; AJ830342; Hs.211272; ESTs; transmembran94.none; 4.22 

421921; H83363; Hs.6820; transIocasB of Inner mitochondrial membrane 10 (yeast) homotog; zf.TimlO_DDP,ethand,CH^pectrin.8erpin;TM=M;;4.22 

445633; A)453386; Hs.17287: ESTs, Weakly similar to S26689 hypotheScd protein hc1 - mouse (MJnuscutusk IRICnone; 4.22 

424812; AF059252; Hs.153299; [X)M^ (C. elegans) homolog Z; none;TM=M;; 4.22 

410568,* BE379794: Ks.65403; hypothetical protein; death,TNFR.c6;TM=Y;SS=M; 4.22 

416636; N32536; Hs.42645; solute cenier family 1 6 (monocaiboxylic ackJ tiansporleis). meniber 6; none we; 4.22 

418969: W33191; Hs.2a907; hypothetical protein FU20256; SH3;TM=M:;4.21 

447200; BE543146: Ks.281434; Homo sapiens cONA FU14020fi8, clone HEMBA1Q03638; nona^none; 4.21 

400208: : : Eos Conlrol; FCH,RhoGAP,SH3:TM=M;: 4.21 

405369; ; ; NIA.M5569'*lomo sapiens m domain Unasa 2 (UMK2). transcript variant 2a. mRNA.; pklna5e,UM.P0Z;SS=M: 4.21 
445350; AF052112; H5.12540; lysophosphoSpase 1; Bbhydrolase_2;TM^M;: 4.21 

441208; A1339704: Hs.150401; ESTs. Wealdy similar b ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [KsaplensJ; 

lon^trans,RYORJTPR,MIR,nofte; 4.21 
427217; AA399272; Hs.144341; ESTs; ANP,GHMPJinases,none; 4.21 

400845;:; NhL003105*:Homos^xenssortil]iHelaled le^^ repea]&«onta!ifttg(SORL1XmRNA.; 

EGFMI<flJQC8pLa,ldLrec8pU),grBnuIln.BNR;TNI»Y;SS=M; 4.21 
422667; H25642; ; ESTs; FM0.1Ike.FM0^ike; 4.21 

450056: BE047394; H5.8208; ESTs. Weakly similar to S7151 2 hypothetical protein T2 < mouse [M.musculus]; 

ABC lran>^BCjnembrane.lg,MHCJLbeta,SRP54.pfoteasome^Cjnembrane.ABC.tran; 4.20 
448950; AF2B8687; Hs.9275: CGHS2 protein; E1-E2J\TP3se.Hydrolase;TM=Yn 4.20 
408634; AW407254; Hs.182278; calmodulin 2 (phosphorytase kinase, delta); none,none; 4.20 
422335; AA375957: Hs.6682; solute carrier family 7, (cationic amino acid transporter, y system) nmbet 1 1; none,none; 4.20 
426754; NM_014264; Hs.172052: serine/lhreonlne kinase 18; pkinase;TM=M;: 4.20 
435810; BE343853; Hs.2785; keralin 17; zf-Tlm10_DDP.SH2,SH3,pWnase,PH,BTK.Ribosomal„L44; 4.20 
446143; BE245342; Hs.306079; sec61 homolog; NUDlX,secY,E1_dehydrog,transketj)ynTM=Y;SS=M; 4.20 
426626; Al 1 24572; Hs.323879; Inhiljitor of kappa light polypeptkJe gene enhancer In ftcells, kinase gamma; zf.C2H2;TM=M;; 4.20 
403716; ; ; Target Exon; Ad8p.comp^ub.P0Z.0EP.DIX,Dishevelled.hexapep,W2,ABC»lr8n; 4.19 
415749; BE262529; Hs.78771; phosphoglycerate kinase 1; P6K.none: 4.19 
434599; AB002313; Hs,3989; pJeidn 82; PSI,Sema.TIG;NA;NA; 4.19 
412600; U8824; Hs.74101; spleen tyrosine khase; SH2.pkinase;; 4.19 
416738: N29218; Hs.4029O; ESTs: ABC_tranABC_membrane.nonB: 4.19 
410839; BE269047; Hs.65234; hypothetical protein FU20S96; DEAD,helicase_C,PRK^P3;TM=M;; 4.19 
431385: 8E178536; Hs.11090; membrane^panning 4-domaIns, subfamily A. member 7; none.none; 4.19 

407305; AA715284; ; gb:nv35f03.n NQ CGAP Br5 Homo sapiens cONA ck)ne similar to contains Alu repeiithre element. mRNA sequence; pktn3se.lntegrin.B,Sema,PSI.TIG.none; 
4.18 

452880; AA029332; KS.B7549; ESTs; none.lntegriiv.B; 4.18 

428245; AF151048; KS.1831B0; anaphase pnmolingoompiexsubunK 11 (yeast APC11 homotog); none:SS«ft« 4.18 
421964; X73079; H5.288579; polymeric immunogtobOlin receptor; lg,Cobalamin.bInd;TM=M:SS=M; 4,18 
409213; U61412; Hs.51133; PTK6 protein tyrosine kinase 6; 8H2.SH3,pkinase;TM=M;: 4.18 

421790: AW896201; Hs.22654; sodium channel, voltag^flaled. type I. alpha polypeptWe; lon_trans,lQ.PEP-utillzer8.Cni*»Y;: 4.18 
429668; AA626142; Hs.179991; ESTs. Weakly similar to S28942 protein kinase C [Ksaplensl; none,none; 4.18 
443068; AI188710; ; ESTs; Endonuclease,pMhaseA:Gviruecp.none; 4.18 
418827; 6E32731 1; Hs.4716G; HT021; none:TM=M;: 4.18 

447887; AA114050; Hs.19949; caspase 8. apoptosls-related cystehe protease: ICEj)iaiCEj)20.DED:TW=Mr,4,18 

429109: AL008637; Hs.196352: neulrophil cytosoOc factor 4 (40kD); SH3.0PR.PX;TM=M;; 4.18 

422083; Nh^OOl 141; Hs.1 1 1256; arachidonate 15-lipoxygenase, second type; none,none; 4.18 

438974; AF089816; Hs.6454; chromosome 19 open reading frame 3; PDZ;S$=M; 4.18 

413407; AI356293; Hs.75339; inositol polyphosphate phosphatase-like 1; SH2,SAM,Exo.endoj)hos;SS»M; 4.18 

424954; NM.000546; Hs.l846; tumor protdn p53 {U-Fraumenl syndrome); P53,WO40JRK:TM=Mn 4.17 

421836; AF109219: H5.108787; phosphatWyBnosllol glycan, class N; w)ne,none; 4.17 

431544; AK000770; Hs.299329; Homo sapiens cONA FU20763 fis, ctone Ca0991 1; none,none; 4.17 

413781; J05272; Hs.850; IMP (inosine monophosphate) dehydrogenase 1; CBS,IMPOH_aiMPDH»N^TM=Mr. 4.17 

452012; AA307703: Hs.279765; Wnesin family member 4A; kinesIn,DNA^topolsolV.KWM=M:: 4.17 

425606; U52112; Hs.158331; renin-binding protein; none;; 4.16 

416817; AA398045: H5.104679: ESTs: Furln4ke.pkinase,RecepXdomain,fn3,none: 4.18 

402447; ; ; C1000201:gil204416IgblAAA02627.1| (L05195) fructose transporter (Ratlus norvegteus] gi|44; none;TM=Y;SS=M; 4.16 
452875; BE276760; Hs.30928; DNA segment on chromosome 19 (unique) 1177 expressed sequence; EulLporin;TM=M:SS»M; 4.15 
426395; BE151985; Hs.5722: hypolhstica! proteb FU23316: pkinase.none; 4.15 

404140; ; ; Nftt.006510:HoiTO sapiens ret finger protein (RFP). transcript vaiianl alpha. mRNA.; zK3HC4,SPRY jrf^.box;SS=M; 4.1 5 

432268; BE311856; Hs.274230; 3'-phosphoadenoslne 5'i)hosphosulfale synftase 2; APSJ(lnaseATP-sulfuryl8se;TM=M;: 4.15 

405516; ; ; ENSP0000020045r:ThyroId receptor Hiteracfing protein 6 fTRiP8) (OPyUnteractingproleln 1) (Zyxin rotated protein 1) (ZRP-I).; URtTM=M;: 4.15 

448390; AL035414; Hs.21068; hypothetical protein; FGGY.C:TM=M;; 4,15 

435732; AF229178; Hs.123136; leudna rich repeat and death domain containing protein; none.none: 4.15 

414108; A1267592; Hs.75761; SFRS protein kinase 1; ank.PKOxysterol.BP,pkinase;TM»M;; 4.15 

41 1558: AA102670; Hs.70725; gamm»aminobutyric ackf (GABA) A receptor, pi; Neur.^harULBD.Neurjd)arunemb;TM»Y;SS'M: 4,14 
424339; BE257148: Hs.145416; endoglycan; none:TM=Y;SS=M: 4.14 

427274; NVL00521 1; Hs.174142; colony stimulaUng factor 1 receptor. Ibnnerty McDonough fieSne sarcoma vbal (v-fcns) oncogene homolog; 1g4*inase;TMaY;SS»M; 4.14 
440524; R71264; Hs.16798; ESTs; SH3,lg.pklnase.PH^pectiln,RhoGEF,none; 4.14 
436115; AW51203% H8.102004; ESTs; pklnasajione: 4.14 

447050; NH.016314: Hs.1720p: STAIMa protein containing 8H3 and ITAM domains 2; SH3.VHS.UIM:SS=M; 4.14 

418529; AW005695; Hs.2508Sl7; TRK-4used gene; Band_41,ER^^pkinase,LRRLRRCT,MAM.Nocteoplasm^n,TlOporayos(n,OPRJIIamentJ2lP,G^^^ 4.14 
420727; H75701; Hs.99&66; complement component 4-binding protein, beta: sushtSS^M; 4.14 
433075; NM.002959: ; sortOin 1; ExD.endoj>hos>VtrophIn.1.BNR.KeIcti;TM^M;; 4.14 
422783; AAS989S6; Hs.120439; ethandamlne kinase; CtiolinaJdnase:TM»Yn 4.14 
410726; At62385g; Hs.15936; ESTs; pkInase.proJsomerBse.none; 4.14 
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417903; NM.002342: Hs.1 116; lymptiotoxin beta receptor (TNFR supofamOy. member 3): TNFRjc8;T»M;S&=M: 4.14 

428307; W27393: Hs.t83$48; protein tyrosine phosphatase, receptor type, f polypeptide (PTPRF). interacting protein (Uprfn). dplia 1; SAM.SH3^S1 jep; 4.14 
442434; AA995787; 85.129583; ESTs; IRlOione; 4.13 

438361; AA805656; Hs.148217: Homo sai^ cONA: aJ23077 fis. done t>IG05840: pldn8Se4}klnase.C;none! 4.13 

445580; AF167572; Ks.12912; skbl {& pombe) tiomolog; none;S&4it 4.13 

421425; AK001564; Hs.104222; hypothetical protein FU10702: 6{hand,kazaLaif,ras.7tnLl;TM»M.'; 4.13 

400252; ; ; NIWL00465r:Homo sapiens uWquitin specific protease 11 (USP11). mRNA. sidstrate 1 (PTPNS1). mRNA^ UCH.1,UCH-2:TM=Mr, 4.13 

446641; AU)49229; HS.157B7; Homo sai^ens mRNA; cDNA DKFZp554O1016 (from done OKFZp564O1016): non8,pWnasB,PBD; 4.13 

400209; ; ; NM_001665:Hoino sapiens Rho GTP&se adhrating prot^ 4 {ARHGAP4), mRNA. VERSION NM.006083.2 Q; FCH,RhoGAP,SH3;TM=M;; 4.13 

429012; AW6295gS; Hs.194726; Ba2-assodated athanogene 4; Sm,8AG;SS=M; 4.13 

41 1826; AW947946; Hs.26706; CGH21 prol^ none.DSPt; 4.13 

423189; M59371; Hs.171596; EphA2: fn3.pWnase.SAM.EPHJbd;TM=Y:SS=WI; 4.12 

413934; U0305G; HS.7S619: hyaiuronogluGOsamihidase 1; integrinJB,G}ycoJiydro_56;SS::M; 4.12 

414874; D26351; Hs.77515; Inositol 1 A5-trfphosphate receptor, type 3; lon_trans.MIR.RYDK.ITPR;THA=Ys 4.12 

432047; NM_016247; Hs.272380; Inteiphotoreceptor matrix proteogiycan 200; EGF,SEA;TM=Y;SS=M; 4.12 

451820; AW058357; Hs.199248; ESTs; 71m_1;TM=Y;SS=M; 4.12 

4455T5; BE388665; Hs.179999; Homo sapiens, done iMAGE:34S7003, mRNA; zK:2K2.BTB.K.tetra.WD40,Syniaxin; 4.12 

42453% 102911: H5.150402: Adhfin A receptor, type I (ACVR1) (AUC2); pidnaseArtivinjecp JM=iyl;SSeM: 4.12 

405110; ; : C7000ig9:gqi2643960tsp|Q9Y6T7IKDGB_HUMAN OIACYLGLYCEROL IQNASE. BETApaYCERlDE KINASE none^none; 4.12 

441026; AW1790S8; Hs.99658; ribosomal protein L7a; pl(in8se,lRaLRRCT,RibosomdJ.7Ae^; 4.11 

443142; AI696513: Hs.108705; protein phosphatase 2 (formerty 2A). regulatory subunH A (PR 65), belalsofbrm: HEAT,VilBllogenln.N.HEAT^PBS;SS=M; 4.11 
450905: NM^004572; Hs.2S0S1; pialtophilin 2; AimadlIlojseg;TM=M;; 4.11 

459601; AL044470; Hs.270604; ESTs. Weddy simter to ALU7 JIUiyiAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION WARNING ENTRY IKsaplens]; 
none.SH3.PGAM.UBA: 4.10 

417300; A1765227; Hs.55610; solute canter family 30 (zinc transporter), member 1; Cat*on.efflux:TM=Y;SS=M; 4.10 

427315; AA179949; Hs,175563; Homo sapiens mRNA; cONA DKFZp564N0763 (from done DKFZp584N0763); ncwie.spedrln.SH3,PH,CH; 4.10 

416239; AL038450; Hs.48948; ESTs; E1-E2J^TPase.CatloiLATPase^CCa«on_ATPeseJi.Hydrolase.none; 4.10 

429311; AF080157; Hs.198998; conserved henx-loop4ieIix ubiquitous kinase; pkinase.none; 4.10 

412146; M92444; Hs.73722; APEX nuclease (mulGfiinctional DNA rep^r enzyme); Exo„endo_phos.Atroph{n-1.BNRJCelch:TM3Mr. 4.10 

418420; AW604405; Hs,324874; hypothetical protein MGC3079; Phosphodiesl;TM=Y;; 4.10 

434396; AA632270; Hs.162851; Homo sapiens cDNA FU 14317 (is, clone PLACE3000401: pWnaBe,nonB; 4.10 

454438; AA224053; Hs.172405; cefl division cycle 27; SPRY.7tnu3ANFjeceptor. 4.10 

439576; AW263124; Hs.315111; nuclear receptor co-repressorm[)AC3 complex subunit WD40:TM=M:: 4.10 

451995; Ai827431; Hs.224645; ESTs. WeaMy simBar tolF16_HUMAN GAMMA-INTERFERON-INDUOBLE PROTEIN IFH6 (H.sapi8n8); none,PAAD.DAPIN,HIN; 4.10 

420340; NM_000734: Hs.97087; C03Z antigen, zeta polypeptide (TIT3 complex); rrAM;TM«M;SS»Mj 4.10 

442942; AW167087; Hs,131562; ESTs; pWnase.none; 4,09 

428187; AI687303; Hs.285529; 6 protein-coupled receptor 49; 7tm_1,none; 4.09 

418838; AW365224; Hs.35198; edonudeotide pyrophosphatase/phosphodiesterase 5 (putaSve funcSon); Ph06phodiest;TMsY;SS"M: 4.09 
416445; AU)43004; H5.79337; K]AA0135 protein; pldnase,PAS;TM=M;; 4.08 

427001; NM.006482; Hs.173135: dual-specificity tyroslne-(Y)-phosphorylation regulated kinase 2; pkinase;TM»M:: 4.08 

403608; ; ; C3001 199:giI7494834b)cr||T 15308 hypotheticaJ protein B0285.2 - Caenorhabdifis eleganslHI; 7tnu1,7tm.-2,GPS,WIF;TMsY;SS»M: 4.08 

427177; AB006537: Hs.173880; Inteiteukin 1 receptor accessory protdn; lg.TIR;TM=Y;SS=M; 4.08 

401241; AB02B989; ; mitogen-activated protein kinase 8 Interacting protein 3; CysJcnot.TGF-t)ela.vwa,wrc,vwd,Tll,DUF139;SS=M; 4.07 

444805; AB007899; Hs.12017; homolog of yeast uWquiOn-protein ligase Rsp5; potential epithelial sodium channd regulator, WW,HECT,RNAj)oLA,none; 4.07 

448686; AW196663; HiZmAZ caspase leeniitnent dom^ protein 6; CARi>.TM=M;; 4.06 

426006; R49Q31; Hs.22IQ7; ESTs; pklnase,TB(!; 4.06 

434521; NM-.002267; Hs.3886; karyopherin alpha 3 (Importln alpha 4); Annadjito„seg.lBB;TM=M;; 4.06 
408761; AA057264; Hs.238936; ESTs. Weakly similar to (deHine not avaH^le 7496841) laelegansj; 7tnLl,none; 4.05 
425289; AW139342; Hs.155530; interferon, gamma^ndudtde protein 16; PAAO.DAPlHHiN:SS=M; 4.05 
413109; AW389845; HS.1106S5: ESTs: PH04,none; 4.05 

426457; AW894667; Hs.169985; diimerin (dilmaerin) 1; DA6J^-bInd,RhoGAP.SH2:TM=M:: 4.05 
435730; AB020635; Hs.4984; KIAA082B protdn; AdoHcyase,Trt^ArN.2-Hadd_OH_C;TM=M;: 4.04 

429747; M87507; Hs.24g0; caspase 1 . apoptosh^ated cysteine protease (hterleukin 1. tnta^ oonvertase); CARD.ICEj>10.iCejs20;SS^ 4.04 

444378; R41339; Hs.12569; ESTs; i9.pkinase,LRR,LRRNT,LRRCT.none; 4.04 

449843; RBS337; Hs.2403O; sdute carrier family 31 (copper transportere), member 2; none;TMsY;S8sM; 4.04 

427359; AW020782; Hs.79881; Horro sapiens dSNA: FU23006 lis. done LNG00414: 7lRLl.none: 4.04 

413095; AA49435% H8.3071fi; pdasshim vdtag»gated channd, IsMated family, member 3; none,STARr; 4.04 

418540; AIB21597: H8.90877: ESTs. Weakly similar to ALUUHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [Ksapiens); ank.CAP_GLY,7tnu1; 
4.03 

442007; AA301116; H5.142838; nudeolar phospho^KOtein Nopp34; mn.IRK;SS=M:4.02 

448659; AF191838; H3.21712; TANK^ndhig kinase 1; pkinase;TM=M;; 4.02 

412935: BE267045: Hs.76084; tubuDn-speclfic chaperone c; nonen 4.02 

414844; AA296a74: Ks.77494; deoxyguanosine kinase; dNK;; 4.02 

445817; NM,003642; Hs.13340; hlstone acetyltransferase 1; none;TM=Mn 4.02 

426728; NM-007118; Hs.171957; triple funcfionaldom^ (PTPRF Intfiracting); SH3,lg.pkInase,PH.spedrin,RhoGEF:TM=M:; 4.02 
420576; A1434780; Hs.4248; vav 2 oncogene; RhoGEF,PH,CH,SH2.SH3,OAG_PE-bind.none; 4.02 

405102; ; ; C1S001220':gi)4469558l9b)AAD21311.11 (AF126008) breast cancer nudearieceptor-blnding auxl: DAGJ>E-Und.PH.RhoGEF.DC1;SS4/l; 4.02 
439964; Ai732902; Hs.124652; Homo sapiens cDNA FUl 2376 ts, done MAMMA1002494; pkinase,nofte; 4.01 
429680; AIJ035754; H8.2474: toIHito receptor 1; LRR,LRRCT,TIR;TM=M;SS=M; 4.01 

453891; AB037751; Hs.38353: Homo sapiens mRNA full length insert cONA done EUROIMAGE 1035904; none.none; 4.01 

426535; AU077012; Hs.288582; ESTs. Weakly similar to ubk^uitous TPR moBI, Y Isofonn (H.88ptens]; Kunitz_BPn J<unrtz^BPn.7tnu2.HRM; 3.99 

424232: AB015982; Hs.143460; protein kinase C, nu; pkInase,DAGJ»E4*uJ.PH;TM=M;; 3.99 

408308; AL033377; Hs.44197; hypothetfeai protdn DKFZp564DD462; none.none; 3.98 

449517; AW500106; Hs.23643: serine/threonine protdn kinase MASK; pklnase;TM=M;; 3.98 

404185; ; ; Target Excn; sugar„tr;TM=Y;SSsM: 3.98 

441226- BES63042: Hs.118820; Homo sapiens. Similar to RIKEN cONA 061001 2G03 gene, done MGC:14132. mRN^ complete cds; none;TM»M:: 3.98 

429638; AI916662: Hs.211577; Wrwclin l(Wnesin receptor): bZIP.Tropomyosln.spM^ 3.97 

417386; AL037228: Hs J2D43; D123 oene produd: NUDDCsecY.E 1.ddiydrog,transketjjyr,TM=Y;SS=M; 3.97 

452721; AJ269529: Hs.301871; sdute carrier family 37 {olycerol-3-phosphate transporter), member 1; MORN.sugar_tr:TM=Y:SS=M: 3.96 

417183; R52089; Hs.172717; ESTs; pkinase,LRRCT,lg.lRR.LRRNT,none; 3.95 

439176; AI446444: Hs.190394: ESTs, Weddy similar to B28096 Bne-1 protdn 0RF2 [Ksapiensl; none;TM5M;; 3.94 

424490: AJ278016; HS.S5565: aikyrin repeat domdn 3: ank4*lnase;TM4it 3.94 

422610; AF153820E Hs.1547: potasstom Inwardly-recSiytno channd, subfanfly J. member Z IRtCTM^Y:: 3.94 
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450746; 082673; H5.Z785B9; generd transci^ 1^ 0, i; 

418516; NM_006218: HS.B5701; phosphob»si8de-a*lnase. catalyBc. dpha pdypapQds; PI3_PM.Idnase,PI3Ka.PI3K.C2,PI310W.n3Kj«5B.ru^ a94 

414217; AI309298; KS.27989S; Homo sapisfls cDNA: RJ23165 fis. done LNG09846: none;NA;NA; 3.93 

416537: T99(»6: Hs.144904: midear receptor OHepressof 1; myb_DNA-blnding,RNAj)oUU»n0; 3.93 

450747; AI064821; Hs.318S35; ESTs. Highly siinlar (o 1818357A EWS gena {H.sapi8ns); nin.zf-RanBP.GAS2; 3.93 

444825; AW1 67613; ; mitogen-activated protein kinase kinase idnase 8; pMnase:TMBM;: 3.93 

408354; A1382883; Hs.159235: ESTs; none.none; 193 

453945; NiVl 005171; Ks.369G8; acflvafing transcription factor 1; mn;tf-Ran6P.pkin3se.GST.C.Ets,SAMJ^.ABC;Lrnefflb^^ 

C2H2.PHD.BTB.TFilS;VT.hool<,SAM;TM=M;: 3.93 
428532; AF157326; Hs.184786; TSP-interactIng protein; Arniadillo_S8g,VHS,HEAT;TM=M;; 3.92 

413967; AW204431; Hs.1 17853; ESTs, Wealdy sirriiar to 138022 liypoUielIca) protein [H^sapians]; Annadi]k>_$ag.lBB.PHO,DOTjU)ne: 3.91 
415906; AI751357; Hs.288741; Honu sapiens cDNA: aJ222S6 tis. clone HRC02860; Eplufn^tone; 3.91 
450139; AKOOl 838; Hs.295323; serum/glucooorfiooU regulated kinase; none.none; 3.91 
440255; AI932285; Hs.160569; ESTs; none,pkinase; 3.90 

4210n; AK000061; H5.101590; hypolheecaJ protein; anKpWnase.deaIh,SPRY.SAP.Rn)OSomd.L24e,SRP54.dDENN.IDENN,uOENN;TM=M:; 3.90 

433211; H11B50; Hs.12808; MARK; pkin8se,UBA,KA1:SS={yl; 3.90 

433233; AB040927: Hs.3018O4; KIAA1494 protein; SH3^-C3HC4;TM=M;; 3.90 

419809; U4641Sc KsJ70379: gt):HSU4641S Human pancreatic cancer ceil line Patu 8988t Homo sapiens cDNA done xs476, mRNA sequence; PWWP,non8; 3.90 
433198; AA992841: HS.Z7263; KIAA1458 proldn; none.none; 189 

407721; Y12735; Ks.38016; dual-spedikaty tyrDstn8^)-phosphorytation regulated kinase 3: pkinas6;TM=M;: 3.89 
427657; AV652249; Hs.180107; polymerase (DMA directed), beta; none;TM=M;: 3.89 
453035; AW581943; K5.334; Rho guanine nudeotkle exctiange factor (GEF) 5; nonB,none; 3.89 

446329; NM.013272; HS.14B0S; solute carrier family 21 (organic arion transportar), member 11; kazd.0ATP_N.0ATP.C;TM9Y:SS«M; 3.89 

429922: Z97630: Hs.226117: HI histone famQy, member 0; llnker.hi&tDnetTM»M;; 3.68 

432074; AA52524B; Hs.149723; ESTs; Y_phosphatase,none; 3,88 

435143; R12375; Hs.194600; ESTs; SH3ig,pkinase.PH^pectr!n,RhoGEF.none; 3.87 

423198; M81933; H3.1634; cell division cyde 25A; Rhodanese.none; 3.87 

428474; AB0231B2; Hs.1 84523; KIAA0985 protein; pkinffie;T|yNM;; 3.87 

419073; AW372170; Hs.183918: Homo sapiens cDNA FU12797 lis, done NT2RP2002066. highly similar to Rattus noivegicus tr an s membrane receptor Unc5H2 mRNA; 
dealh^U5:SS=M;3.86 

415457; AW081710: Hs.7369; ESTs. Weakly sin^ to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY (lisapiensl; 

MORN.sugar_tr;TM=Y;SS=M; 3.86 
447061; D86964; Hs.17211; dedicator of cyto-kinesis 2; SH3;TM=M;; 3.86 
426490; NM.001621; Hs.170087; aiyl hydrocarbon receptOG PAC.PAS;TM=M.'; 3.86 

451961; NM.003800; Hs.27345; RNA guanyiyltransferase and ff-phosphdiase; mRNAj»p.enzyme.DSPic.[)NA_Sgase.mRNAj»p_C;TM=M;; 186 
417674; BE616160; Hs.82829; protein tyrosine phosphatase, non^ptor type 2; Yj)ho8phata5a;TM=Y;; 188 
446874; AW968304; Hs.56156; ESTs; none,RGS; 185 

418630; AI351311; Hs.251946; poly(AH){nding proteb. cytoplasmic 1-Ske; pkinase.none; 185 

416140; AI918035; Hs.301 198; roundabout (axon guidance receptor. DrosophUa) homotog 1; none.none; 185 

425474; 248054; Hs.158084; peroxisome receptor 1; TPR;TM=M;; 185 

413073; AL0381 65; Hs.751 67; translocase of outer mitochondrial membrane 20 (yeast) homdog; MAS20,zM0,VPS9;TMsM:SS»M; 185 
411770; NIyL014278; Hs.71992; head shock protdn (hspllO family); HSP70;TM=M;; 184 
428782; X12B30; Hs.193400; Interieukin 6 receptor. fh3.^;TM=Y;SS«M; 3.84 
450684; AAa72605; Hs.25333; kilerteuWn 1 receptor, type II; ig;TM=Y:SSsM; 184 

433376; AI249361; Hs.74122; caspase 4. apoplosls^aled cysteine protease; CARD,ICEj)10,ICej)20;SS=M; 183 
440332; AI218S17; Rs.l88051; ESTs; fn3,pklnase,SAM,EPHJbd.none; 183 
445803; AV655264; Hs.4283; ESTs; pkinase.RGS.PH.myosin_be3d,Myoslrut^; 183 
435905; AW997484; H3.5003; KIAA0456 protein; SH3,RhoGAP,FCH;TM=M;; 183 

414991; C17898; ; gb:C17898 Human placenta cONA (TFupwara) Homo sapiens cDNA done GEN-554E10 S, mRNA sequence; Zip.none: 183 

423067; A/^355: Hs.285401; colony stimulating factor 2 receptor, beta, km-Om (gramdocyte^nacrophage}; fn3;TM=Y;SS=M; 182 

419088; AIS38323; Ks^2620; integiin, beta 8; tntegrirLB,none; 182 

411704; AI499220; Hs.71573; hypothetical protein FU10074: pUn8sa:TM»l«: 182 

459346; AW510557; Hs.258016; EST; none;TM=M;; 3.82 

445330; R52656; Hs.21 691; ESTs; 7tm.1 .none; 182 

4514S2; BES60065; Hs.26433; ddldiyHihosphato (UDP^^ace^ucosamine} N-acelylglucosamlnephosphotransliBrase 1 (GteNAo-l-P transferase); Giycos.lransf.4;TM=Y;SS=>M: 
181 

405545; ; ; Target Exon; ABC,lran,SRP54.ABC_membrane;TM=Y;SSsM: 181 j 

448155; NM.005591; Hs.20555; melolic recombination (S. cerevlsiae) 11 homdog B; M8ta]tophoB.RibosoroaLL15e;SS=M; 181 
416305; AU076628; Hs.79187; coxsadde virus and adenovlnjs recepton {g;TM»Y;SS=M; 180 
415444; BE247295: Hs.78452; solute carrier family 20 (phosphate transporter), meniber 1; PH04,LIM;TM=M:; 180 
421684; BE281591: Hs.106788; hypothetical protein FU10511; ArmadillojBeg;SS=M; 180 

438581; AW977766; Hs.292133; ESTs. Moderately slmOar to 178885 serlne/threonine^pedite piotoin kinase [H.sapiens^ pkin8se.RI01.none: 179 
439199; R40373; Hs.26299; ESTs; ton_tr8ns,none; 3.78 

450931; N25156; H3.25648; tomor necrosis fador receptor superfantily, member 5; TNFR.c6;TM=Y;SS=M; 178 
417691 ; AU076610; Hsi2399; low density llpoptotehi receptor defed C complementing; none;SS«M; 3.78 

430355; NM_006219; Hs.239818; phosphdnostfide-S^lnase. catalytic, beta pdypeptide; PI3J'l4_kinase.PI3Ka.Pi3K_C2,PI3Kjbd,Pt3Kj)85B;TM=M:; 3.78 
448119: H38587; Hs.346509; dedicator of cyto^esis 1; non6;TM=:Yn 178 

442013; AAS06476: Hs.10600; Human DNA sequence from done RP1 1>353C18 on chromosome 20 Contains ESTs. STSs. GSSs and CpG islaruis. Contains the NIFS gene for 

cysteine desdh^ase, two genes for novd proteins and the gene for the spHdng fador CC1 .3 with a second isofbm) {CC\ .; none,none; 3.77 
425481; AW978162; H3.18571; ESTs; none.OxysteroLBP; 3.77 

41 141 1; AA345241 ; Hs.55950; ESTs, WeaWy slraflar to KIAA1330 protein [H.S8piens]; RNAj)ol^lg.MHC»eEF2^kin8se;8S=M; 177 

426866; U02330; Hs.l72816: neuregufin 1; Peptid3se_M49.EGF,ig.NeureguIin;TM=M;; 177 

430396; D49742; Hs.241383; hyalumnan-binding protein 2; ank.death.ZU5.EGF,kringte,tiypsln.NebuIin.LIM:SS»M: 177 

434398; AA121098; Hs.3838; senmvlndudbte kinase; pktn8se,POLO_box;TM^; 3.77 

415485; AW272990; Hs.18571; ESTs; non8,Oxysterd_BP; 176 

453226; AA641926; Hs.6171 2; pyruvate dehydrogenase kinase, Isoenzyme 1; HATPase_p.none; 176 
418758; AW959311; Hs.172012; hypolhetfcal protein DKFZp434J037; pklnase,RI01;TM=M:; 176 

424642; AA034127; Hs.153487: signal tmnsdudng adaptor molecule (SH3 domain and ITAM moSO t; SH3.VHS.UIM;TMsM:; 175 

426500; NM.014638; Hs.170156; KIAAO450 genepnxJud; C2,P^PtC-Y;TM=M; 175 

419952; AK000967; Hs.93872; K1AA1682 protein; non8;TM=M.-; 175 

42S424; NM.004954; Hs.157199: ELKL motif Unase; pkinase.UBA.KA1;TM^M:; 175 

431696; AA2S9068: Hs.267ai9; protdn phosphatase 1. regdatoiy (Miifaaof) subunit 2; none:SSBM; 175 
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444184:187841: H5.28299(); Human DMA sequence from done RPV 

pioteins. (ho S* end of a novel gene. ESTs. STSs. GSSs and thiee CpG Islands: p»nase^01^J<0WtTkNh«; 3.75 
405411; ; ; ENSP00000252213;SOOIUM BICARBONATE COTRANSPORTER.; none:TW=Y;SS=Mt 3.75 
405602: ; ; Target Exon; pklfiase;SS=M; 3.75 

429355; AW973K3; Hs.292689; ESTs; p}dn3se.t)ZIP,Annadiflo.seg,none; 3.75 

430153; AW968128;Hs.335679; ESTs; pkina5e,nono; 3.74 _ 

414180; A18B3304; Hs.120905; Homo sapiens cONA FU11448 lis. done HEMBA1001391: P13.PI4J0nase.roKaiPI3ieC2.PI3IOMJ'l3Kj)^ 3.74 
432235; AA531132;;Qb:r^7h0as1 Na_CGAP_Pr9 Horno sapiens cDNA clone, mRNA sequence; pWnase.f»ne: 174 

433390; AA585950; Hs.260180; Homosaptens mRNA; cDNA DKFZp761618121 (fromctone DKFZp761G18121): comptetecds; none.speclfln.SH3.PKa^ 3.74 

426485; NM_008207; Hs, 170040; plateJeWerived growth (actor receptof-nke; lg;SS=M; 174 

408414; All 14688; Hs.17998; ESTs, V\teaWy simOar to 2109260A B cefl growth factor (H^sajtensl: lh3Ai;TM=»Y;SS«M; 3.73 

409793; AI825463; Hs.147996; protein Wnase. X4lnKed; pJdnase,pkinaso.C;TM=M;; X73 

412456; T32689; Hs.7859; ESTs; BAG,none; 3.73 

407894; AJ278313; Hs.41143; phospholnositide^pecific phosphoBpaso Ctela t; C2,PI-PLOY.PI-PlC.X;TM=MS 3.73 
442229; AI885776; Hs.8164; Munbrey nanism; MATH.DENN.GRAM,zf-B.box,dOENN,uOENN;SScM; 173 
450151; AI0881 96; Hs.22968; Homo sapiens done IMAGE:451939. mRNA sequence: tg4)Unase.none: 3.72 
408331; NM.007240; Hs.44229: dual spedfdty phosphatase 12; DSPt:Th4=M;; 3.72 
417821; 8E24S149: Hs.82643; protein tyrosine kinase 9; coSIin^DP.SS^M; 3.72 

403391; : ; C3001164':gi|1730l96^|JlP50573|6AR3_RAT GAMMA-AMINOBUTYRIOACID RECEPTOR RHO^ SUBUNIT PRE; non8;TM=Y; 3.72 

417627; AA203524: ; glm56e10/1 Soares Jetal>er_spleen J NFLS^I Homo sapiens cDNA done 5*. mRNA sequence; SH3:SS«M; 3.71 

428428; AIJ037544; Hs.1 84298; cyditvdependent kinase 7 (homolog of Xenopus M015 cdlc-acCvaling kfriase); pkinase:TM=fte 3.71 

428180; AI129767; Hs.182874; guanine nudeolide binding protein (G protein) alpha 12; &dpha,8rfn'M-M;; 3.71 

422127: AW504286: Hs.11204% SET binding factor 1; dDENN^ENN.GRAM.PH:SS=Mt 3.70 

430570; Ai417881; Hs.292464; ESTs; 7lm_2,Fz,Frizzled.none; 3.70 

452561; AI692181; Hs.49169; KIAA1634 protein; TPR,PDZ,WW.GuanylalBjdn;TM=M;; 3.69 

432338; NIVL002759; Hs.274382; protein kinase. interferon-inducit)te dout)la stranded RNA dependent; dsrniipkInase;TM=M;: 169 

419945; AW290975; Hs.118923; ESTs; SH3.P0Z,GuanylataJdn.tr8nsfenin; 169 

426539; AB0111S5; H$.170290; discs, large (OrosophDa) homolog 5; SHlP0Z,GuariytateJdn:TM4A:: 168 

438534: AA721628; Hs.191958; Immunoglobunn superfamily receptor translocalion assodated 2: tg;TMsY:SSsM; 168 

407202; N58172:Hs.109370: ESTs; F5.F8.lype.CpWnase,Ets.none: 167 , . ... 

420297; AI62B272; Hs.88323; ESTs, Vfeddy similar to ALU1J<UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY (H^apiens]; pWnase.TUDOR.none; 
3.67 

417863; AB000450; Hs.82771; vacdnia relaled kinase 2; pkin3se;TMsM;; 3.67 

425304; AA463844; Hs.31339; fibroWasI grovrth factor 11; FGF.Neur_chan_LBD,Neurj*anjnemb,none; 167 

418318; U47732; Hs.84072; transmentrane 4 superfamily memt)er 3; transmembrBne4;TM»Y:SS=M; 167 

419511; AA429750; Hs.751 13; general transcrlpfion factor IIIA; ©ypto.none; 166 

424315; AW614850; Hs.ig3384; putstafive 28 kOa protein; none.none; 166 

41 3076; U10564; Hs.75188; west (S. pombe) homolog; pkInase;TM=M;; 166 

425838; NM_014071; Hs.l59613; nudear receptor coacthralor RAP250; peroxisome proliferator-adhraied receptor btterading protein; thyroid hormone receptor binding protein; 
none;TM=M;; 165 

446983; AA157484; Hs.97199: complement component Clq recepton EGF,lectin^cTlssueJac,Xllnk.TIL:TM=Y:SS=M; 3.65 

434350; AL042940; Hs.93872: K1AA1662 protein; none,none; 165 

457317; AA683016; Hs,12210; hypothetical protein FLJ137328lfT*artotensin; SH2;TM=M;; 165 

434416; AA80S903; Hs.59498; oeli division cyde 2'iike 5 (cholinesterese^laled ceil division controller); pkina5e.none; 165 

410174; AA306007; Hs.59461: DKF2P434C245 protein; none.DSPc; 3.65 

423598; BE247600: Hs.155538; ESTs; 7tnu1;TM=Y;SS«:M; 165 

440B61: BE2441 15; Hs.74B2; KIAA0682 gena product; nm,GuanytateJ(in;TM^M;; 164 

454954: AW993013; Hs.49169; KIAA1S34 protein; TPR.PDZ,WW,Guanylate_kin;TM=M;; 164 

43Q2S0; NM.016929; Ks.283021; dilorida intracelluiar channel 5; nofle;TM=M;; 164 

450587; AI82B854; H8.258538: strialin. C8lmodulir»*lnding protein: pWnase.WD40;TM=Y;; 164 ^ , 

424693; AW295112: Hs.153648; Homo sapiens cDNA FLI13303 fis. done OVARC1001372, highly similar to Homo sapiens llpTm-aIpha4 mRNA; SAM;SS=M; 164 

425645; AA381027; ; gb:EST70242T^»U lymphoma Homo sapiens cDNA S end, mRNA sequence; HMG.box.DNAjnisjBpair.HATPase_c,non8; 164 

417426; NM_002291; Hs.82124; lanilnin, beta 1; laminin_EGF,lamJnin JRenn,lntegiin.B;SS=M; 163 

451292; AB037716; Hs.26204; KIAA1295 protein; SH3;TM=M:: 163 

412314; AA825247; H8.250899: heat shock fador binding protein 1; 7biLl:TM"Y;S&^ 163 

418303; AA215701: Hs.1 86541; ESTs. Weakly similar to 138022 hypotheOcd protdn (H^sapiens]: elF5.elF2B.W2,pk]!iase.UBA.KA1; 163 
452716: Ai914925; Hs.222240; ESTs; SH2,STAT,STAT_blnd,STATjjrot,none; 163 

403859; ; ; NM-004520*:Homo sapiens klnesin heavy chain member 2 (KIF2), mRNA. meiriber 3 (KCNQ3), mRMA.; kinesln:TM=M;; 163 

450377; AB033091; Hs.74313; KIAA1265 protein; Zip;TM=M;SS=M; 163 

417793: AW405434; Hs.82575; small nudear ribonudeoprotein potypepflde B"; cnn;TM=M:: 163 

404942; U30825; ; spiidng factor, aiglnlnefeerlne^h 9; C036;TM»Y:SS=M; 3.63 

429554; NM.01 2275; Hs.207224; interleUktn 1, delta; IL1;TM^M;; 3.63 

417871; AA521368; H5.24252; ESTs; IBB/nnadiIk).seg,none; 162 

437872; AW748265: Hs.5741; flavohemoproteln b57: heme^1,NADJjlnding.ripoxygenase.FAD.bindlng_6:TM=M;: 162 
438698; AW297855; H3.125815; ESTs. Weakly similar to 138022 hypotheOcal protein (H japlensj lipoxygenase,PLAT,none; 162 
447217; BE465754; Hs.17778; neuropiTm 2; CUB,MAM.F5J^8„lype_C;TM=^tSS*M; 151 

407961: AW672939; H8.41694; origin recogniton comptex, subunil 2 (yeast homologH*s; none.pWnase.pro_isomerase; 161 
428840; M15990; Hs.194148; v-yes-1 Yamaguchi sarcoma viral oncogene homdog 1; SH2.SH3.pWn8se:SS=M; 3.61 
455608; BE011437; ; gb:CM4^N02204)80500-170-f03 BN0220 Homo sapiens cDNA. mRNA sequence; none.COK5.activator, 161 
407748; AL079409; Hs.38176; KiAA0606 protein; SON Orcadian Osdllatory Protein (SCOP); PP2aiKR.PH;TM=M.-: 3.60 
421474; U76362; H3.104637; solute carrier farrily 1 (giutamate transporter), member 7; SDF;TM=Y;SS=M; 160 

449987; AW079749; Hs.184719; ESTs. Weddy similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY [H.8aplens); 

ABC tran.ABCjnembf8ne,ton_trans; 160 
403142: : : NM_002706*:Homo sapiens protein phosphatase 1 B (formerly 2C). magnesium-dependent, beta isoSonn (PPM1 B). mRNA.; PP2C;TM=M*,; 3.60 
400844; ; ; NIWL003105*:Homo sapiens sortiflrHelaled receptor. L(DLR dass) A repeats^»ntaining (SORLl). mRNA^ 

EGF,fn3.WljBcepLaitflJEcepLb.granunn.BNR;TM=Y;SS=M; 159 
4S0152: A1138635: Hs.22968: Homo sapiens done IMAGE:451339. mRNA sequence; ig.pkinase.nona: 3.59 
429782; NM-005754: Hs.220689: Ras-GTPase^lhratlng protein SHl<lomafrblnding protaln: frm,NTF2;TM=M:: 359 
436469; AK001455: Hs.5198; Down syndrome critical regbn gene 2; none;SS=M; 159 
437400; AB01 1542; Hs.5599; EGFawJomaIn, mulliplo 5; TNFR,c6.laminin_EGF;TM=Yn 158 
426797; AW9362S8; Hs.342B49; AOP-ffbosyla!ton factof-Dke 5; arf.CajchanneLB.SH3; 156 
431170: AW971246: H3.291022: ESTs: LRR.CARD.none: 158 

434542; AA769310; Hs.61 260; hypolheQcd protein FU13164; PK0xysterdJBP:TM44;SS:4« 158 
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420181: A1380089: Hs.1589S1; ESTs: non8^pldnase4J<R.LRRCT; 3.57 

450572: AI7Q0863: Hs.202494; Homo safdens cDNA FU13245 6s. done OVARC1000681: NsLSuiph.synip.non8; 3^7 
433618; AAG02539: Ks.34S494; ESTs; &8!phaA.deani!nasQ: 3.57 

452695; AW780199; Hs.30327; mitogeiv-activated protdn kinas&s&vatsd protein Unase 5: pUn8se.noRe; 3^ 

418512; AW498974; ; diacytotycerol kinase, zela (104kO); r3S.non8; 3.57 

451752; AB032997; Hs.26966; K1AA1 171 protdn; ATP-syin.CTBC:TM=Y:SS=M; 3.57 

417129; AI381600; Hs.300684; calcitonin gene^elated pepSde-ieceptor component protein: nonejione; 3.57 

449474; AA019344; Hs.2055; ut)iqultin-aclivating enzyme El (A1S9T and BN75 temperature sensiOvity complemenDng); THF,U8ACT.pl(faiase,UCH-2.UCH>1^zM:2H2^- 
RanBP,G*pa)ch; 3.57 

412124; H43378; Hs.288550; Homo sapiens cDNA: RJ23156 fts.clona LN609609; none.none; 3.56 
435021; AA922192: Hs,54709; ESTs; EPaMpJdnase.<h3,SAM,none: 3.56 

431341; AA30721 1; Hs.251531; proteasome (prosome. macropain) subunit, alplia type. 4; prDteasome;TM=M;; 3.56 

437387; Ai198874; H5.28847; AiX)26 protein: oone.7tnL.1.WD40; 3^ 

422583; AA41050&; Hs.27973: KIAA0874 piotdn: ank.6-a)pha;TM^; 3.56 

452102; U04343; Ks.27954; C086 antigen (CD28 antigen tigand 2. B7-2 antigen); non8:TM=Y:SS=M; a56 

4201 12; mL005109; H5.9S220: oxidaSve^tress respond 1; pk{nase;Tli«::lkil:: 3.55 

437639; AA827712; Hs.291860; ESTs; SH3.none; 355 

45^00; mjXXnSSi Hsi74382: protein Unase, bileiferon4nducifale doutrie stranded RNA dependent dsnn.pldnase;T1iHil:: 3.55 

415660; AI909007; Hs.78563; ubiquIAvronjiigaOng enzyme E26 1 (homologous to a elegans UBC7): UGLcoivTMNM;: 3^ 

423393; R37772; Hs.21420; p21-activated protein tdnase 6; p)dnase.PBD;TM=l\4;; 3.55 

428727; AF078&47; Hs.191356: genera) transcripfon factor polypeptide 2 (441(0 sutHinit); PH04,UM;TMsM;; 3.55 

411190; AA306342; Hs.69171; protein Idnase 2; p)(in3se,|]ldnase.C,HR1;TM=M;; 3.55 

408683; RSB665; Hs.46&47; TRAP and TNF receptor-associated prot^; Exo_endoj)hos:TII^M;: 3.55 

412350; AI659305; Hs.73826; protein tyrosine phospliatase. nonreceptor type 4 (megatcaryocyte); Y_pho8phata8e,Band.41 .PDZ;TMsM;: 3.55 
446742; AA232119; Hs.1G085; putaBve G-protetn coupled receptor. none;TM=Y;SS«M; a55 

427283; ALII 9796; Hs.17418S; ectonudeotide pyrophosphatase4)tK}Sphod)estera5e 2 (autotaxin); Sirifatase,SomalomedinJB.PliosphodiesLEndonuclease;TM=M;SS=Y; 3.55 
414688; A1039185; Hs77558; thyroid hormone receptor Interaclor 7; HMG14.17.none; 3.55 
424848; A1263231; Hs.327090; EST; SH3.PDZ.Guanylate_ldn.none; 3.54 

402704; ; ; C1001099*:g!160058961reqNP_OO9101.1i testis-speditc protein idnase 2 [Homo sapiens] glH; none.none: 3^54 

444099; D87432; Hs.10315; solute carrier {amtly 7 (catbnic amino add transporter, y system). memt)er 6; aaj)erniease8;TM»Y:SS=:M; 3.54 

429687; AI675749; Hs.211608; nudeoporin 153kD; z».RanBP.inteBrin_B;TM=M;; 3.53 

413879; AA132951; Hs.212533; Homo sapiens cDNA: FU22572 fis, clone HS102313; none,none; 3.53 

431045; AW968560; Hs.301957; nudix (nudeoside diphosphate tinited moiety )()-type motif 5; NU{)IX.secY,E1.dehydrag.lr8nslal4iyr;TMsY;8SeM; asS 

423855; AA331761; Hs.254859; ESTs; none,pWnase,UQ_con.vwa,F&GAP.integrinJ^; 3.53 

440682; AW362152; Hs.27181; nudear receptor lading factor-2; cydin,l)ZIP:TlWi=M;; 3.52 

410686; A1733735; Hs.114905; 1RE1, S. cerevisiae. homolog of; pldna5e,Bacterial_PQQ:TM=M;SS=M; 3.52 

449610; ABQ08681; Hs.23994; actlvin A receptor, type IIB; p)unase>\ctivin_recp;Tli«=Y;SS-M: 3.52 

418755; Y14443; Hs.88219; zinc finger protein 200; z^C2H2.zf-BED:TM=M;; 3.52 

448804; AW512213; Hs.342849; ADP-^osytatton fador-Eke 5; arf.Ca.c)iannd_B.SH3; 3.52 

438507; AA809052: Hs.182018; ESTs; none,none: 3.52 

456559; AI336273: Hs.102&48; gtucocorticoid receptor DMA factor 1; none.PAS; asi 

41 0054; AL1200S0; Hs.58220; Homo sapiens cOt^ FU23005 fis. done LNG00396. highly simiiar to AF055O23 Homo sapiens done 24723 mRNA sequence; 

RasGAP.adenylateidnase; 3.51 
422321; AAg06427; Hs.181035; hypotheticai protein MGG11296; none;TM=Mn 3>51 
445701; AF055581; Hs.13131; lymphocyte adaptor protein: SH2,PH;TM=M;; 3.50 

407393; AB038237; ; gb:Homo sapiens mRNA for G proteins»upN receptor 051^ complete cds.; 7tm„1 ;T1tf^Y;SS=l\A: 3.50 

443303; U67319; Hs.9216: caspase 7, apoptosis-<elated cystdne protease; pWnase,ICEj)10.iCEj)20;TM=M;SS=M; 3.50 

420673; A60081 12; (^99847; peroxisome I}k)genesi8 factor 1; AAA.APS_l(inase;TM^;SS^M; 3.49 

424663; NM.002351; Hs.151544; SH2 domain protein 1A. Duncan's disease (lymphoproliferalhre syndrome); SH2;TM::M;; ^49 

429327: AA283991; Hs.199248: prosta0iand1n E receptor 4 (sulitypB EP4): 7lnLl;TM»Y;SS"M; 3.49 

400178; ; ; Eos Oontrol; nona,Somalomedln_B: 3.49 

439549; AW937885; Hs.137314; ESTs; SH2.none; 3.49 

436345; AA873008; Hs.121572; ESTs; CARD.BIR.zl^C3HC4,CARD.BlR^C3HC4; 3.49 
427658; H61387; Hs.30868; nogo receptor; UW,LRRNT,tJWCT;SS=M; 3.48 

402833; ; ; C1OO2508.-gii6691937|embiCA665797.1| (AL09677(q t»A15QA6.2 (novel 7 transmennbrane receplor, none,none; 3l48 

442363; AI337304; Hs.23120: PIST; fn3.pldnase.PDZ.DUF139;TM=>Y:SSeM; 3.48 

409132; AJ224538; Hs.50732; protein kinase, AMP-activated, beta 2 non-catalytic subunit; none:TM=M:; 3.47 

417971; Y08991; Hs.83050; phosphoinositidd-3-i(inase, regulatory subunit 4, pi 50; pldnBse,WD40,HEAT;SS»M: 3.47 

432169; Y00971; Hs.2910; phosphoribosyl pyrophosphate synthetase 2; Pribosytlran;; 3.47 

447425; AI963747; Hs.18573; acytphosphatase 1, erythrocyte (common) type; Acylphosphatase:SS»M: 3.47 

427231; AW851989; Hs.2B5814; sprouty (DrosopHIa) homdog 4; SH2,SH3;m=M;SS=M; 3.46 

401851; ; ; NKlL002401*:Homo sapiens mitogen-adh/ated protdn tdnase kinase kinase 3 (MAP3K3}. mRNA; pkinase;SS=M; 3.46 
407677; AW01681 1; Hs.234478; Homo sapiens cDNA: FU22&48 (is. done HSI07329; pkinase.pkinase_C.none; 3.45 
432279; N95104; Hs.274260; ATP-Wnding cassette, sub-famBy 0 (CFTR/MRP). member 6; ABC_tran/BCjnembrane,nona; 3.45 
437103; AW139408; Hs.152940; ESTs; Ch(dineJdnase.none; 3.45 

420338; AA825595; K3.88269; Homo sapiens, done M6C:17339. mRNA. complete cds; 7tm-1:TM=Y;SS=M; 3.44 
422209; AF005210; Hs. 113222; dtemoMne (C-C motif) receptor 8; 7tnv.1.7tnu2:TM=Y;SS=ftlfe 144 
410781; AI375672; Hs.165028; ESTs; pJdnase,landnln.Nlerm.laminln_EGF.cydin.F.baxflfdlrLa^^ 3.44 
437296; AA350994; Hs.20281; KIAA1700; Rhodanese,DSPc;TM=M;; 3.43 

419855; A1935182; Hs.144139; ESTs; zf^C4.UBA.CblJ^.aLN2.CbLN3.zf<:3HC4.UBA.CbLN,Ctt.N2,CttJM^ 3.43 
433336; AF017986; Hs,31386; secreted Hzzled-relaled protein 2; Fz,NTR;SS=M; 3.43 
428483; AI908539; Hs.184592: KiAA0344 gene product; none.none; 3u43 

445119; AF035121; Hs.12337; kinase bsert domain receptor (a type 111 receptor tyrosine kinase); lg,pWnasB;TM=Y;SS=M; 3.42 
454468; A1590319; H3.19122; eukaryoQc translation initialion fador 4E-nke 3; none.l^ur_chan_LBD.Neur_chaiunemb.lF4E: 142 
410386; W26187; Ks.3327; Homo sapiens cONA: FU22219 fis. done HRC01637; pkin85e,GuaiiylateJ(inPOZ.$H3.UZ7«none: 3.42 
422907; A1879263; Ks.77273; Human glucose transporter pseudogene; none.none; 3.42 
449816; AI701457; H5.38894; ESTs; SET.BAHJ^PI^C; 142 

440074; AA86304S; Hs.10669: ESTs, Weakly similar to T00050 hypotheBcal protein KIAA0400 tH.sa(fens]; SH3.8nk.tubuIin4MtgM3ap;^TM^SS>4A; 142 
425475; WS6339; Ks.107057; ESTs; pMnasejione; 142 

401242; AB028989; ; nntogen<activated protein kinase 8 Interading protein 3; CysJ(notTGF-betavwa,vwcvwd.TIL.DUF139;SS»M; 141 
429276; AF056085; Ks.198612; G protein^xipled receptor 51; 7tnL.3.ANF jeceptor.bZlP;TM=Y;; 3.41 
445800; AA126419: Hl32944: inasHoi pQ)yphosphate-4pho$phata5e. type I. lOTkO; none.non8: 3.41 
410908; AA121586: H8.10592; ESTs; GTP_EFTU.GTP.EFniJ»,GTPJEFTU.D2.none: 3.41 
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45298Q: AK001335: Hs.31 137; protebi tyrosine phosphatase, raceptar type; E: Yjitiosphatas8.none: 140 

447898: AW369638: Hs.1 1 2318; 6^ kd protein: none.none; 140 

450402; BE218027: Hs.89969; ESTs: SH3.non8; 3.40 

441466; AW673081: Hs.54828: ESTs; pMnase,zK:2H2.KRAB.none; 140 

408546; W49512: Hs,4634B; bradykbifn receptor 81; 7tm.1 ;TM=Y;SS=M; 3.40 

410927; T77635; ; 6t).7c91h06.r1 Scares btfani bran 1 NIB Homo sapiens cONA done S*. mRNA sequence; none,non8: 140 
409646; AW161391; Ks.709: deoxycy&rine kinase; dNK;SS=M: 139 

417165; R80137; Hs.302n8; Homo sapiens cDNA: FU214251is. done COL04162; Su}fale.transp.STAS.HM6J)0x: 3J9 
449343; AI151418; ; protein phosphstase 3 (formerly 2B), catalytic sutxinlt alpha isofomt (caldneuiln A alpha): none^one; 3.39 
450511; R07423: Hs.85092: thyroid hormone feceptofinteractor 11; Myosln_lail.EGF; 139 
414271; AK000275; Hs.75871; protein kinase C Ijinifing protein 1; bromodoman.PHD.PWWP;rf-MYND;TM=M;; 3.38 
418428; Y124gO; Hs.85092: thyrdd honnone receptor interactor 11; bZIP,kinesin.6TP.cydohydrol,M;TM»k/t: 337 
422369; AF005216; H8.1 15541; Janus kinase 2 (a protebi tyrosine kinase): SH2,pldn8se:TM=M:: 137 
456451; At761180; Hs.94211; rcdl (required for eel dIffeienOation. 8.pontbe) homdog 1; none:TM>M;; 3.37 
438543; AAB10141; Hs.192182: ESTs; SH^pkinase,none; 137 

401943; NM.012434; ; solute carrier family 17 (antonfeugar transporter), member 5; iione:TMsM;: 136 
415276; U88866: Hs.78353; SFf^ protein kinase 2; pkinase;TM^M;: 138 

447881; BE620886; Hs.75354; GCN1 (general oontnl of 8min&«dd synthesis 1. yeasO^Ike 1; pldnase.pklnase: 135 
434533; AA639257; HS.29K49; ESTs; 8H3PDZ.Gu8nylateJdn4)one; 3.35 

432639; AW973785; ; gb:EST385885 MAGE resequences, MAGM Homo sapiens cDNA. mRNA sequence; nGne.lRK; 135 
410678; BE540516; Hs.293732; hypothetk^al protein M6C3195; Arm3dll1o_$eg;TM=M;SS°Y: 135 
402807; ; ; ENSPC0000235229:SEMB.; tnlegTta.B,Sema,PSI;TM=Y;; 135 
420189: AW2g6380; Hs.95821; osleodast stImulaSng factor 1; SH3M:; 134 

437389; AU59587; Ks.271586; hypolhetica) protein DKFZp762M115; secY,E1.dehydrog.tran5keU^,none; 134 
453423; NH4_002647; Hs.32971; phospholnositide^kinase. dass 3; R3.R4.kinase,PI3Ka,PI3ieC2;TM=IW;; 3.34 
414270. 1.20852; Hs.347527; sdule carrier tamlly 20 (phosphate transpoiter). member 2; Enterotoxin_A,PH04:TM=Y:SS=M; 133 
417479: A1057052; ; ESTs. Weakly similar b Z195JiUfc4AN ZINC FINGER PROTEIN 195 (H.sapiensJ; iJW,CARO.none; 133 
424946; M64572; Hs.153932: protein tyrosine phosphatase, norneceptor type 3; Band_41,PDZ,Y_phosphatase.none: 133 
452681 : AF1 53330; Hs.30246; solute carrier famBy 19 (thiamine transporter), member 2; Fotate.carrienTM='Y;SS=M; 133 . 
426477; AA379464: ; gb;EST92386 Skin tumor i Homo sapiens cDNA 5* end. mRNA sequence; DUF6.MATH3TB; 133 
438283; A1458931; Hs.37282; ESTs; none.trBnsm8mbrane4; 3.33 
421327; AA837295; Hs.18B802: ESTs; none.lMP4,Y_phosphatase: 133 

432481; AW451645; Hs.151504; Homo sapiens cONA FU11973 fis, done HEMBB1001221: lam]nirLG.Odlagen«COLFlCQrA,TSPN.none: 3.33 

452662; AA456ig3; Hs.9071; progesterone membrane binding protein; homeobox,none: 3.32 

428997; AF065391; Hs.194718; zinc finger protein 265; zf.RanBP;TM=»M;; 132 

432211; BE274530; Hs.273333; hypothetical prolan FU 10986; FGGY_C;TII*:M:: 131 

443601; AI078554; Hs.15662; ESTs; ank.pkina5e,de3th.Ribosoma).S14; 131 

430597; AF062006: H5.285529; G pfotein<ouplad receptor 49; 7tm_1.UW,UTOT;TM=Y;SS=M; 3.31 

419912; AF249745; Hs.6066; Rho guanine mjdeofide exchange Jacior (GEF) 4; SH3.PH,RhoGEF;TM=M;; 131 

400380; NML018485: Hs.283079; G protefaMOupled receptor C512; 7lm.1:TM»Y;8S»M: 131 

41S983; AI436798; Hs.1 17078; Homo sapiens cDNA: FU23028 Us. done LNG01852, highly similar to HSU08023 Human cellular probonoogene ((xner) mRNA; 

fh3.ig,pklnase;TM=Y;SS=M: 131 
441054; AA913591; Hs,126480; ESTs; none,7lm_1: 3.31 
416342; BE002723; ; leptin receptor; IC£j)20,DEO.lC^10.ICEj>20.OED; 131 
446126; AW836779; Hs.1 13029; ilbosomal protdn 825; none.7tnLl; 131 

425086; AW957571; Hs.12319: Homo sapiens cDNA FU12774 fis, done NT2RP2001663, moderdlely simitar to ENOLASE (EC 4.2.1.1 1); none,GuanylateJdn.PDZ.SH3: 3.31 
425725; NlA.012243: Hs.159322; sdute center lamHy 35 (UDP^acetylglucosamlne (UDP-QcNAc) transporter), member 3; DUF6;TM=Y;S8=M; 130 
422808; AW160544; Hs.118695; potasshim voltag&flated channel, subfamily G, member 1; k>nJrans,K.tetra;TM=Y;: 130 
429081; Y14039; Hs.195175; CASP8 and FADWika apoptosls regulator; ICE4)20,DED:TM=fA: 130 

433856; AWg74941; Hs.292385: ESTs, WeaMy skniar to 178885 serineAhreonine-spedfic protein kinase (H^apians]; pkinaso,ABC1,none; 130 
413132; NM006823; Hs.75209; protein kinase (cAMP^pendent, catalytic) inhibitor alpha; PKI:SS=M; 130 
402603; ; ; ENSPO00002512O6*:KIAAO778 PROTEIN (FRAGMENT).; none;TM=Y;; 3.30 
418801; AA228366; Hs.115122: ESTs; IntegrinJU^G-GAP.none: 130 

400275; ; : Nl\^00651 r:Homo sapiens seryl-tRNA synthetase (SARS), mRNA. (SAM68). mRNA.; tRNA^ynL2b.8eryLtRNA-N:TM=Mn 129 

4402B6; U29589; Hs.7138; cholinergic receptor, muscarinic 3; 7tnu1:TM=Yn 3.29 

409101; NM_004297: Hs.50612; guanine nudeotide binding protein (G protein), alpha 14; G-aIpha,none; 129 

432738; AA788898; Hs,179902; transpofter-Bke protein; none;TM3Y;SS=M; 129 

408738: NiyL014785: Hs.47313: KIAA0258 gene product; none;TM=M;; 129 

443195; BE148235; H3.193063: Homo sapiens cDNA FU14201 Its. done NT2RP3002955; AaJrans,none: 129 

405328; ; ; NM.005391 :Homo sapiens pyruvate dehydrogenase kinase, Isoenzyme 3 (PDK3), mRNA^ HATPasejB;SS=M; 128 

418764; N30531; Hs.42215: protein phosphatase 1. regulatory subunit 6; nona,none; 3.28 

408756; AA524743; H3.44883: ESTs; ArTnadilIo^5eg,IBB,DEAD,helkase.C.Sec63.DDT,PHD.bron»domaIn;TM»M;: 128 

415474; NM.014252; Hs.78457; solute center family 25 (mitochonddal center, omlthlne transporter) member 16; mito«car;TM=M;; 128 

417805; U3B545; Hs.82587; phosphdipase D1, phophalidytdtdine^pedlic; PH,PLDc.PX:TM=M:; 128 

410254; BE004131: Hs.318510: Homo sapiens d}NA FU13682 fis, done PLACE20O0O1S. mak^ similar to EPIDERMAL GROWTH FACTOR RECEPTOR SUBSTRATE 

SUBSTRATE IS; eihand,none; 128 
443968; AA287702; Hs.10031: KIAA0955 protein: CARD;TM=M;SS=M: 128 
438899: AF085833; H8.135624; ESTs; m)ne,R3J>l4Jdnase.Pt31^Pi3K.C2,P13IObd,PI3KjJ85B: 3.27 
415663; AW296841: Hs.313332; ESTs; UO.con.Neur.dianJSO.Neur.d)anjnemb; 127 

414087; W19712: ; gb2b36d03j1 Soare3_p3rathyroW_lumor_NbHPA Homo sapiens cONA dona ff, mRNA sequence; pklnase.none; 127 

442833; AA328153; Hs.88201: ESTs. WeaMy similar to A Chain A. Crystal Stnictuie Of The Human Acyl Protein Thtoesterase 1 Al 1.5 A Resdutk^n [Ksapiensj; 

abhydrola5e_2;TiyNyi;; 127 
444754; T83911; Hs.11881; transmembrane 4 superfamily member 4; none;TM:sY;S&>M: 126 
432579; AF043244: Hs.278439; nudeoter protein 3 (apoptosb repressor wllh CARD domain); CARD;TM=M;; 3.26 

458943; AW249181; Hs.19954; ESTs. Weakly similar to T19873 hypothetkal protein C41C4.2 - Caenorhabdife eiegansJCdegansl; nona.pldn3sebR6S: 126 
411974; AW880414; Hs.84264; addto protein rfdiln leudnes; E1-E2^TPase.CaIk)rLATP8seLCCalk3n>TPase.N.Hydrolase.8sp; 3.26 
437145; AF007216; Hs.5462; sduto canier famOy 4, sodium btoarbonate cotransporter, member 4; HC03jcolransp;TMsY:; 128 
423387: AJ012074; ; vasoadhro hilesfinat peptMe receptor 1; Tbn J.HRM.CSD;TMsY;SS«M: 125 
442643; U82756; Hs.3991; PRP4/STK/WD splteing fador, W040;SS=M; 125 

417525; R93355; Hs.192991; ESTs. WeaWy similar to ALUBJ<UMAN UII ALU CLASS 8 WARNING ENTRY Hi [H.sa;fens|: SH3,lg.pWnase.PH.5pedrin,RhDGEF:SS*M; 125 
412283; BE069084; ; gb:QV3-BT0379-1 401 00^)58^1 2 BT0379 Homo sapiens cDNA, mRNA sequence; ton„trans,RyDRJTPR,MIR.none; 125 
411213: AA676939: Hs.69285: neuropilin 1; MAM.F5J«Jype.aCUB,CUBJMAM.F5J8.type.C; 125 
400352; AF227133; ; taste receptor, type 2. member 7; none:TM»Y;SS=M: 125 
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402974: ; : Target Exon; Yjilio5phal8S8,GnRH,honTione5>MTnone4; 3J25 

407644: 016815: Hs.37288; nudear teceptor subfamily 1. group D, menter 2; hoiinoR8jecj:K^Tfi^M;SS=M; 3.25 
421654; AW163267: Hs.106469; suppressor ol varl (S-cerevislae) 3^8 1: hencase.QSS^M; X2S 
438022: AW517524: Hs.135201; N002 protein: LRR.CARD.GTP_C0C;ViraLhellcase1;TW=MK 324 
449954: AW001741; H$.24243: hypolireaai protein nj10705: pWnase:TM=M:: 3,24 

428816: AA004986; Hs.193852: ATP-WndIng cassette. sub-femBy C (CFTR/MRP). member 2: EGF^W,liypsirwCUB>BC.Iran^.itwnbr8ne:SS=M; 3.24 
427319: AW531495: Hs,27135: &«l receptor-assod^ protein BAP29: fiIament;TM=Y:SS=M: 3.24 
421970: AF2271S6: Hs.110103: RNA pdymerase I transcripQon laclor RRN3; 

aa j>em«ases.pyridoxaUJeabit)iiiodomain^O^ 3.24 
41 1887: AW182924: Hs,128790; ESTr, pklnas8:TW=:M:: 3.24 
430180; AA331406: Hs.75456: A kmasa (PRKA) anchor protein 10; RGS:SS=M: 124 
410257; AW978005: Hs.12600: N^thylmaleimlde^ensitive factor attachmenl proteh. bete; none.NTF2: 3.23 
410240; AL157424: HS.612B9: synaptojarin Z Exo.enaoj)hos,SyjaLN.rnn.GranHfej)orins:7M=M:: 3.23 
434510: AF143885: HS.1B190: EST; SH3.FCH,rone: 3.22 

422592: BE081857: Hs.94211: rcdl {required for cefl differentiatton. S.pombe) homotog 1; none.Pi-PI^X,PH.PW»LOY.C2: 3.22 

439303; AA001021; Hs.6685: ttiyrdd honnone receptor interactor 8; non8,cH)ne: 122 

448520: AB002367; Ks.21355: doublecortin and CaM kinase^ka 1; (ddnase.0CX;TM9M:: 3.22 

409245; AA361 037; HsJZ86036; 1RNA Isopentenylpyiophosphate transferase: ArTnadiilo^.TM>M:: Z22 

456946; AA009716: Hs.4231 1; ESTs; none.DSP6.Yj}hosphatase; 3.22 

409048; HS9990: K5.37699; ESTs; Armadn)o.8e9,IBB.none: 3.22 

420357; U94333: Hs.97199; complement compcnenl Clq receptor; EGFJecOn_c,rBSue_facXIink.TlUTM=Y:SS=M; 3.22 
426230: AA357019: Hs.241395; protease, serine. 1 (trypsin 1): lrypsin,to)dru4:SS=M: 121 

41 1352: NM.002890; Hs.758: RAS p21 piDtdn acGvator (GTPase acQvdDng proteiii) 1; SH2^H3.C2.PH.R85GAP:TM»M:S8=M; 3l21 

438333; R39382; Hs.25283: cydirKdependent kinase 8; p}dn8se.none; 3.20 

414202: BE275653: Hs.270379; transmembrane 6 superfamily member 1; 7tiTL5,none: 3.20 

429651; D79246; Ks.279870; ESTs, Weakly similar to A46010 X-ibiked retinopathy protein IKsaptens]; MgtE,none: 3.20 

400987; ; ; C11000939:9l]11464993lreqNP.065260.1) gene for odorant receptor MOR83 [Mus muscuhis] giI6; non8;TM»Y;SS«M: 3.20 

413760; Z25101: Hs.25127; Homo sapiens mRNA for K1AA1725 protein, partial cds; nonoMAfGap; 3.20 

408466; AI909712; Hs.93837; phosphaUdytinosltol transfer protein, raembranusodated; PXPH,PU)cPH,PLDc.PX' 3.20 

409463; Ai458165: Hs.17296: hypothetical protein MGC2376; K,tetra;TM=Mn 3.20 

425910; AA830797; Hs.1B4760; CCAAT-box-binding transcription factor: none;TM=M;; 119 

423798; AF047033; Hs.132904; solute carrier family 4, sodium bicarbonate cotransporter, member 7; HC03„cotransp;TM=sY;SS=M; 119 

407753: AIJD45916: Hs.293419: ESTs; Ephrin.non8: 119 

419355; AA428520; Hs.90061; progesterone binding protein; hemeJ;Th^Y;S&=M: 119 

454128; AL031259: Hs.41639: programmed cefl dealh 2: zf-MYNDjTM^Mr. 119 

421202; AF193339; Hs.102506; eukaryoQc transtellon Initiation factor 2-alpha kinase 3; pklnase;TM=Y;SS=M; 119 

446360; N42553: Hs.267914: honiotog of mouse transient receptor potential-phosphoQpese OinteracQng kinase CHaK; hypothetical protein FIJ201 17; 

loiLtran3.WHCICEF2_kinas8:TM=M;: 118 
458882; R34993: Hs.226666: ESTs. Moderately slnflar to 154374 gene NF2 protein IH.saptenst CRALTRiO.PW; 3.18 
424124; AA335609; Hs.7569: ESTs. Weakly shr^ar to A46010 X-linked retinopathy protein [H.S8piens); pkinase.TBC: 3.18 
444745: AF1 17754; Hs.1 1861; thyroid honnone receptor-associated protein. 240 kOa subunit; none:TM=M;; 3.18 
425399; AA652588: Hs.301348: Homo sapiens cONA FU13271 fis. done OVARCIOOIOOO; SH1HS1 jep,nonB: 118 
425836; AW955696: Hs.90950; ESTs; CbiJJ,Cbl.N2.CbLN3,UBA.zf<»HC4,none; 118 

403335; ; ; NM.021815:Homo sapiens sctete carrier family 5 (cholinB transporter), member 7 (SLC5A7), mRNA.; SSF;TM=Y;SS=M: 117 
428788; AF082283; H8.193516; Bk»D CLUIymphoma 10; CARD;TMsM;: 117 

429558; AI391454: Hs.2072S1; nudedar autoanHgen (5SkD) similar to rat synaptonemal comptex protein: nor»;SS-M; 3.17 

440248: AA876136: H3.153136: ESTs; SH2,none; 3.17 

423706; U95218: Hs.131924; G pmteln^»upled receptor 65; 7tmL.1;TM»Y;8S»M; 117 

429752; H52348; Hs.36636; ESTs; pkinase.pklnase; 117 

446163; AA026880; Ksi52S2; Homo saiHens cDNA FU13603 lis. done PtACE101027Q( noneMNA; 117 

456773; AI038192: Hs.129764; EGF^e repeats and disooMIn Mike domabis 3; finv8H3,nvodiiJie8djaMyTH4.EGF»F5.FBJIype^C.M 117 
434392: AW983709; Hs.250824; Homo saptens cDNA: FU23435 fs. done HRC12631; pMnase^none; 116 
435972: W95088: Hs.114198: ESTs; pkinase,OPR,none; 116 
441401; AI824338; Hs.1268gi; ESTs; Tissue.fac;TM=M;SS::M; 116 

410497; AL157648; Hs.157078; Homo sapiens d)NAFU12793 fis, done NT2RP2002033; none.none: 116 

4011 13; H25530; ; sdute carrier fan% 22 (organto cafon transporter), member 1-like; none;SS4^ 116 

424833; NM.003894: Hs.153405; period (Drosophite) homotog 2; PAS;8S=>M: 115 

453860; AIB03166: Ks.28462: ESTs. We^ simitar to 138022 hypoOieScal protein {H.saplens]; HSP70,none; 115 

435391; AA704588: Hs.58934; ESTs; RP5K,nona; 115 

428065; AI634046: Hs.157313: ESTs; ICEj)20,DED.IC£j)10,ICEj*20.DED: 115 

452688; AA721 140: Hs.49930: ESTs. Weakly similar to putetive p150 [H-sapiens]; SHlnone; 115 

426839: M74782; Hs.172689; Interteukin 3 receptor, alpha (low affinity): none;TM=»M;8S=M; 115 

421247; BE391727: H3.102910: genera) transcripfton factor IlK pdypepQde 4 (52kD subuntf); none;TM=U;; 114 

440249; AI246590; Hs.249175: ESTs;TalD_DNase.pkinasa,daath.none; 114 

409519; AK001015; Hs,55220: BCL2-assoclated alhanogane 2; BAG.TM=Mr, 113 

446135; AW130288: Hs.170318; hypothetical protein FU10147; hoimonejBC.zf-C4;SS=M; 113 

400440; X83957; Hs.83870: nebuDn; SHlNebuDn.- 112 

409099; AK000725; Hs.50579: hypolhetJcai protein FU2071B; Armadilto„seg:TM=M:; 112 
434237; AF1 19908; H3.235516: hypothetical protein PR02955; none;SS=M: 3.12 

4281 79; All 27772: H3.279696: ESTs. Weakly similar to 138022 hypothetteal protein lasapiensj; pkInase,PXpWnase.C:SS»M: 112 
422824; NM_01 2108; Hs.121 128; BCR downstream signafing 1; SH2,PH:TM=M^ 111 

409745; AA077391: ; gb:7B14E12 Ghronu)Some 7 Fetal Brain cONA Library Homo saptens cONA done 7B14Et2, mRNA sequence; 7trTU1iZK:3HC4.lh3,SPRY.KRAB,zf- 

C2H2.rve.z«_box;TM=Y:SSsM; 111 
435411; AW444619: Hs.138211: ESTs; none.pkinase; 111 
424852; A1222779; Hs.144848: ESTs: adenylalekInase,SH2,pklnase.none: 111 
441970; AW959918; Hs.155160; ESTs; rTnu:f^:2K2: 11 1 

453370; AI470523: Hs.139336; ATP*lndbig cassette. suWamily C (CFTR/MRP). member 4; ABCJran.ABCjrambrane;TM=Y;; 111 
413285; BE078405: ; gb:QV2-BT0617-08030O^1^03 BT0517 Homo saptens cONA. mRNA sequence; GCV_T:SS»M: 110 
429458; BE161832: Hs.292689: ESTs; pkInase.bZIP,Armadnto.seg.none; 110 

401 1 85; : : NlyL021625;Homo saptens vanlSdd receptor-reteted osmdkaDy activated channel; 0TRPC4 praleln (0TRPC4), mRNA.; aitk.torUrans;TM»Y;: 110 

404537; Z25884: ; chtoride channel 1 , sketetal muscte (Thomson disease, autosomdl dominanQ; none;TM:>Y;; 110 
417089: H52280; Hs.18612; Homo saptens cONA: FU21909 fis, dona HEP03834; voaage.aa(»S.none; 3.09 
450792; AA400323: Hs.l83041; ESTs; none.ABC.tran; 108 
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420381: N92054; Hs.194718: zinc finger prol^ 265; zWanBP.TInLl; 109 

444040; AF204231; Hs.18ffl82: 90lglrv67: SH3.C2PKraioGEF^and:Th«=M:: 3.09 

415990: AF124145: Hs.80731; autocrtno moHily factor receptor. 2f-C3HC4.CUE;TM=Y;: 3.09 

442215; AI703172; Hs.129005; ESTs. WeaWy simaar to 210926QA B cell growth factor pisapiens]; norw.none; 3.09 

424187; AA33B561; H5.17287; ESTs. We*^ sfmOar to S26689 hypothelfcal protein hcl • mouse IM.musciilus): lRK,none: 3.09 

425623; AA382825: Hs.132793; ESTs; none:TM=Mr. 3.08 

419577; 1^1; Hs.91296: Wegrtrt. alpha 8; inl8grin_A.F&GAP:TM=Y;; 3.08 

426618; AL036456; Hs.171374: smg GOS^SOOATED PROTEIN; Anradnio_seg:TM=M:: 3,08 

445133; AW1 57646; Ks.153506; ESTs; elharKJ.5peclrin,GAS2.SH3.Pl8Ctln,RA.XytoseJsorivRiD.b^ 

423681; AB023215: Hs.13t525; Homo sapiens mRNA; cDNA OKFZp434E 199 (from dorte DKF2p434E199); partial cds; TTL:TM=M;: 3.08 
428730; AA625947; Hs.25750; ESTs; HECT^ne; 3.08 

427976; AW977808; Hs.80545: crttogen-adlvated protein kinase B Interacting protein 2; Ra)osomal_L37e,pWna$e; 3,08 
412448; U2964; Hs.73895; tumor necrosis factor receptor superfamfly, member 9; TNFR.c6;TM=Y;S$=M; a08 
416814; AW192307; Hs.80042; dolictiyl-P-Gk::Man9G)cNAc2-PP-<to!ichy1glucosyllr8nsferBBe; A!g6.Alg8,7lm^1;TM=>Y;SS=M; 3.08 
427395; AW298741; H3.97861 ; ESTs, Moderately similar to 138022 hypoiiwtical protein {H.sap!sn5); non8.8ldediY,aakln8se; 3.08 
435267; AW450938; HS.1B0115; ESTs; none,PFK; 3.07 

422309; U79745; Hs.1 14924; solute carrier fairily 16 (roonocarboxylic acid transporteis), member 6; 6ugaf_tr,TM=Y;SS=M; a07 

439238; N47305: Hl4666a{ ESTs; 7toLl;TM»Y;SS=M; 3.07 , «, 

458760; AI498631; HS.111334; fenifin, ligM pdypepUe; cy8latln,IWtIn.hlstone.HC»3_colransp,SH3.RiioGAPjtatutirj)e^ 3.07 

424236; AW058114; Hs.7837; phosphoprotein regulated mitogenic patlways; pWnase;TM=M;; 3.06 

427286; AW732802; Hs.2132; epidermal growth factor receptor palhway substrate 8; SH3.TonB_boxC;TM=M;; 3.06 

423878; AI907090; Hs.52891; hypothetical protein PR01853; cyBlayn,lerTitln.histone,H(X)3_cotransp.SH3.RhoGAP.xan^urj)erm^ 3.06 
419270; NM.0OS232; Hs.89839; EphAI; fh34d(lnase.SAM.EPHJbd:TM»M;SS»M; 3.08 

450407: NM.000810: Hs.24969: gamma-amlRobutyrlc add (GABA) A leeeptor, alpha 5; Neurj*8n.LBD^rjcharuneii*fTM=Yn a06 
456249; AI206144; Hs.82508; HRIl^FB2206 protein; none:SS=M; 3.06 

441550: F13385; Hs,7888; Homo sapens done 23736 mRNA sequence; pWnaseiRBcepX*«Bln,Fur!ivflke.YLP4»i»; 3.05 
446488; AB037782; Hs.15119; K1AA1361 protein; pkinasa;SS^M: 3.05 

447495; AW401 884; Hs.18720; pregranmed cell death B (apoptosls^udng tador); pyr jedox:TI«=M;; 3.05 
425390; A1092634; Hs.156114; prolehi tyrosine phosphatase. nofHeceptor type substrate 1; lg;TM=Y;SS=M: 3.04 
409705; M37762; Hs.56023; brain-derived neurotrophic facton NGF;SS=M; 3.04 
413962; AA331563; Hs.24678; spWngoslne-l-phosphalase; PAP2;TM=Y;; 3.04 

426578; R23027; ; gb:yh27e07.rl Scares placenta Nb2HP Homo sapiens cDNA done S*. mRNA sequence; pldnase,none; 3.04 
438005; BE151746: ; gb;PM1-HT030&O612994X)3^06 HT0305 Homo sapiens cDNA, iitRKA sequence; pWnase,UBA,KA1,nGne; 3.04 
438316; AA789249: Hs.80042; gb:aj27g08.8l Soares^testls^NHT Homo sapiens cONA done 1391582 J, mRNA sequence; none,none; 3.04 
452850; H23230; Hs.22481; ESTs. Moderately similar to A46O10 X-llnlied retinopathy protein (H.saplensl; CBS.voltage.aCnone; 3.03 
405266; ; ; Target Exon; aif ,G-alpha;SS=M; a03 

402815; ; ; C1003844':gl|6912550!refiNP_036483.1| olfactory receptor, family 10, subfamily J, member 1; none;TM=Y;SS=M; 3.03 
422803: W28669; Hs.139041: ESTs; traremembrane4.nonB; 3.02 

439325; AF086139; Hs.150423; cycli^dependent Idnase 9 {C0C2-relatBd kinase); pkhiase,Mur_llg3se,MurJigase_C; 3.02 

416389; AA180072; Hs.149846; Integrin, beta 5; hitegria.B,none; 3.02 

418836; A1655499; Hs.161712; ESTs; pkinaseA:thrinjecp,PDZZU5,death; 3,02 

438995; AW748336; Hs.110513; KIAA0421 protehi: none;TM=M;; 3.02 

422676; D28481; Hs.1570; htetamine receptor HI; 7tm_l;TM=Y:SS=M; 3.02 

450267; AW505538: Hs.243620; ESTs; pWnase.none; 3.01 

400566; ;; Taigei Exon: non8;TM»Yn 3.01 

407816: AW500857: Hs.40137: anaphaseirnKTOling complex 1; metotic chedtpoW regulaton PkPLC.X.C2,SH2.PH.SH3,PI-PLC-Y.PAN,none; 101 

429673; AA884407; Hs.21 1595; protein tyrosine phosphatase, nonreceptor type 13 (APO-1/CD95 (Fas)-assodaled phosphatase); Y j)hosphatasa.Band J1,P0Z;SS=M: 3.01 

417067; AJ001417; Hs.81086: solute canter famBy 22 (extraneuronal monoamine IransporteO, member 3; 5ugar_trJM=»Y;SS=M; 3.00 

403212: : : NM.019595:Homo sapiens Inleisectin 2 {ITSN2). mRNA. (CHRNA9), mRNA.; SH3.efhand.C2.PH.RhoGEFJM=M;; 3.00 

410141; R07775; Hs.287657; Homo sapiens cDNA: FU21291 fis. done COU)1963; F6J=e.Vl»-<XpWnase,EtB.none; 3.00 

421059; AI654133; Hs.30212: thyroid receptor {nterading protein 15; none.none; 3.00 

452335; AW188944; Hs.61272; ESTs; none.IRK; 2.99 

437644; AA748575; Hs.136748; lectin-lil« NK ceH receptor, lectln^c;TM=«Y;SS=M; 2.99 

435876: AW612586: Hs.160271: 6 piolelivcoupled receptor 48; 7tiTLl,LRR.U«NT;TM=Y;SS=M: 2.99 

429177: AA447627; Hs.207429; ESTs; 7liTu1,none: 2.99 

449289; BE466067; Hs.225660; ESTs; 3Betq_HSD.pkInasa: 2.99 

454701 ; AW854930: ; gb:PMW;T0263-201099-003^ CT0263 Homo sapiens cDNA, mRNA sequence; SH2^TAT,ST AT_bind.STAT jjcot^none; Z99 
409995; AW960597; Hs.129206; ESTs; pldnase,none; Z98 
446860; AV660685; Hs.282953; ESTs; none,PP2C; 2.98 

438684; AA830105; Hs.194976; SH2 domaln-oonlaining phosphatase andior pratdn 1; lg;TM»Y;SS>M; 198 
434164; AW207019; Hs.148135; serlneAhreonlne kinase 33; (ddn3se;TMsM;: 2.98 

403290; ; ; C1000101 l*gl|4758212|TBqNPJ044l l.li dual spedfidly phosphatase 8 iHomosaplensJgllBOl; none:TM=M;: 2.B7 
433556; W56321; Hs.1 1 1460; c^^dum/tolmoduIiiHiependent protein Idnase (CflM Idnase) H deKa; pldnase.none; 2.97 
421990; T31811; Hs.1 10480; DC12 protein; GKAP.DUF159;TM=M;; 2.97 

428315; AA688152;Hs.9B505; ESTs; pkinase.none;Z97 . ^ . « ^««, 

41 1 140; AW819463; ; gb:RC5-ST0293^1299-031-C07 ST0293 Homo sapiens cONA. mRNA sequence; CaiollneJdnase.Cam_8cyltransf,Sdfalase,CartUPcy!lransf: 2.97 
453998; H47802; Hs.7557; FK506^ndlng protein 5; none.none; ^97 
401342; ; ; Target Exon; none,none; Z97 

453020; AL162039; Hs.31422; Honw sapiens mRNA; cDNA DKFZp434M229 (from done 0KFZp434M229); dNK,none; 2.96 
410976; R35207; HS.2S092: hypotheHcal protein MGC10744; none:TM=M;SSsMi Z96 

431074; BE072772: Hs.153279; ESTs. Moderately similar to A46010 X-Tmked retinopathy protein (asaplens); none,8erpln; Z98 

443829; AI0879&4; Hs.23348; S-phase Idnase^ssodetod protein 2 (p45); F-box.none; 2J98 

400356: AF227137; ; taste receptor, type 2, member 13; none;TM=Y;SS=M; 195 

422559; AW247696; Hs.155839; hypothetical protein MGC12934; 8dh_?lncPGK.Semia!dhyde.dh;SS=M: Z95 

423482; 6E280172: Hs.l29228; gaiadoldnase 2; GHMPJdnases;TM=Mn 2.95 

438330; AW450572; Hs.257316; ESTs; pkinase.zf-C4.ERM.CNH,none: 2.95 

414581; AA256213; Hs,72010; ESTs; none.Carn_acytliansf,ChdinoJdnaso,SC01-SenC,Ghfcos.lran5f J.Gl^ 195 
453058; AW612293; Hs.2886e4: Homo sapiens cONA FU1 1750 lis, done HEMBA1005568: SH2.SH3.C^PKR8sGAP.nono; 185 
430556: AW967807; Hs.13797; ESTs; KECT.none; 194 
400471; ; ; Tanjel Exon; none;TM=M;; 194 

419459; AW291128; Hsi78422: OKFZP586G1 122 protein; MBto!tophos.7tnu1; 194 
40701 3; U35837; ; glxKuman nebulin mRNA. parUai ods; SH3,NebuQn;: 194 
421476: AW9S3805: Hs.21887: ESTs; Ph«I.PAlPi«il: 194 
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426B08; T19228; Hs.172572; hypottietlcal protein aJ20093: 8nk.ptt»se.UPF0073:SS=M; 2.94 

405588: ; ; NM_000299^Homo sajsens pfatophilln 1 (ectodennai dys|iaslatekin Iraollitjr synAwne) (PKPl), mRNA.; AnnadiIlo.8e9;TM=M^ 2.94 

443814: AV65538S: H5.7645: fibrinogen, B beta polypepflde; none,ftone: Z94 

416737: AF154335: HsJ9S91: UM domain protein: UM.P02:TM=M:: Z93 ' 
428522: R10184: Hs.191987: ESTs. WeaWy sbnflar to ALUI HUMAN AUU SUBFAMILY J SEQUENCE CONTAMlNA-nON WARNING ENTRY (Ksapiens): 

nonaMSapMTNFRj^ 2.33 
447818: W79940: Hs.21906: Homo sapiens done 24670 mRNA sequence: none.p!cina5e: 2.93 
432925: AAB78324: : ESTs: none.none: 193 

443670: AW178935: Hs.238707: ESTs: RmaAD,OENN,dDENN,uOENN;TM=M;; 2.93 

447555: AI391662: Hs.1 60953: Homo sapiens, done MGC:1231B, mRNA, complete cds: none:TM=M:: 193 

435092: AL137310: Hs.4749: Homo sapiens mRNA; cDNA DKFZp76tE13121 {from done OKFZp761E13121); pai&l ods; none;TM=M;; Z93 

417670: R077B5: ; gb:yf 1 5c06^1 Scares fetd Over spleen 1NFLS Homo sapiens cONA done 9 similar to contains AIu repeOfive etement:contalns MSR1 repetitive etemenl ;, mRNA 

sequence: XYPPX,ABCjnembrane,A8C_tran: 2.93 
424148: BE242274: Hs.1741; inlegrin, beta 7; {ntBgrtn.B,E6F,metalthio,PSi:TM=Y:SS=M: Z92 

439090: H65724: Hs.347158; gb.'yr76a11.r1 Scares fetal Bver spleen 1NFLS Homo sa|^ cDNA clone 5*, mRNA sequence: pl(lnase,none; 2.92 
408048; NhL007203: Hs.42322: A Idnase (PRKA) anctwr protein 2; ParaIemmIn;TM=Mr. Z92 

428796: AU076734; Hs.193665: solute carrier faroOy 28 (sodiunHsupied nudeoslde transporter), member 2: Nudeoside.tra2.8PO.tFansp.2;TM=Y; 2.92 

41S272: AA164215i HS.2031B6; ESTs; noiie.Exojendoj)hos,BNR,Atrophin-1,B55.pkin3sejg.TPR; Z92 

424775; AB014S40; Hs.153026; SWAP-TO prolwn; e1hand.PH,Neuregufin:TH4=M;: Z92 

439569; AW602166: Hs.222399; CEGP1 protein; EGF,TNFR„c6.0ranunn,CUB,Ker8tin_B2,TIL:TM=M;SS=M: 2.92 

441680: AW444598: Hs.7940: RAP1, GTP-GDP dissociation stimulatDr 1; Annadll)o^:TM=Mr. 2.91 

444784; D12485; Hs.1 1951; ectonudeotide pyrophosphatase/phosphodiesterase 1; SQmalomedlruB.Endonuclease.Phospliodiesl;TM»Y;SS»M; 191 
400398; AF137396; HS.283B79: Ubiquilin 3; 7tRu1.Abi:TM»Y;S&MM; 2.S1 
435592; AI830490: Hs.1466: gtyoerd kfaiase: FG6Y.FG6Y_C;TM=IMn Z90 
400539: : : Target Exon; none:TM=M;; 190 

403743: : : C1002604:gi)8393568|reflNP.0589B9.11 Wnase interarfing with leukemia-associated gene (st; none;TM=M^ 190 

418913; BE046745; Hs.91579: Homo sapiens done 23783 mRNA sequence; Yj»hosphatase.iMP4.none; 190 

428169; AI928984: Hs.182793: golgi phosplioprotein 2; pholoRC,UPF0118;TM=Ys 190 

403912; ; ; C5000394':gi|1 27372801rBtlXP„008681^ keraBn 18 [Homo sapfens)l|6633; m)ne;TM=M,-; 189 

431868; BE246400: Hs.285176: acetyl-Coenzyme A transporter, none:TM=Y;; 189 

421558; AB011 125; Hs.105749: KIAA0553 protein; none;TM=M;; 189 

444100; AA383343; Hs.221 16; C0C14 (cell division cyde 14, S. ceievlslae) homdog B; Yj>tW8phalase.DSPc;TM=M;; 189 

447437; U07225; Hs.339: puriner^c receptor P2Y, G-proleln coupled, 2; 7tm^1,SH2;TW»Y;SSeM: 189 

431512; BE270734: Hs.2795; lactate dehydrogenase A; ldh.ldlv.C.SH3.pkinase,UBA;TM=M;; 189 

446601; A1312783; Hs.155772; Homo sapiens thymic stromal co-transporter mRNA. complele cds; sugar_tr;TM=Yf; 189 

420747; BE294407: Hs.99910: phosphofrudoMnase, platdet; PFK:TM=M,-; 2.88 

449459; BE546846; Hs.195048: ESTs; ank,ras.PH,ArfGap.HC03„colransp; 188 

405099; ; : Target Exon; C1R.PLC.Y.PI-PLC-X;TM=M;: 188 

445890; AF055019: Ks.2ig06; Homo saptens done 24670 mRNA sequence; pkinase.pk{nase; 188 

401445; ; ; NM.021 16r:Homo sapiens potassium channel, sublarally K, member 10 (KCNK10), mRNA.; lon_lrans:TMaY;SS=M; 187 

405480; ; ; Target Exon; none.none; 187 

400189; ; ; Eos Contrd; LRR.PPTA;TM=M:; 187 

450125; AA005418; Hs.158186; ESTs; GIDE-N,7tm_l,none; 187 

432055; AB040973: Hs.272385; G protebwoupted receptor 72; 7lm_1;TM=Y;SS=M; 186 

423619; T4B691: Hs.249159; adrenergic, alpha-2A-, receptor, 7tmJ,7tmJ;TM=Y;SS=M; 186 

417381; AF1M142; Hs.82041 sdute canter family 23 (nudeobase transporters), member 1; xan_ur_permease,RA; 186 

420035; F26725; Hs.187908; ESTs, Weakly similar to A47582 B^l growth fador precursor IH.sapiens); HATPase^c,MOZ.SAS,zf.C2H2; 186 

425480; AB023198; Hs.158135; KiAA0981 protein; PIP5K;SS=M: 186 

446700; AW206257: Hs.156326: Human DNA sequence from done RP1 1-1461.22 on diromosome 6p21.32-2l2. Contains the gene for myefln/oliflodendrocyte glycoprotebi MOG, 

(pert oO the gene for a novel KRAB box containing C2H2 type zinc linger protein, ESTs. STSs, GSSs end a putaSve CpG; none:TM=M;; 2.86 
444595; AL121094; Hs.63572: hypolheticd protdn M6C14433: Y j)hosphatase,8HlYj)hosph8ta5e.8H2: 185 
41 1331 ; AW837178; ; gb:QVUT0037^70300-10(W1 1 LT0037 Homo sapiens dJNA, mRNA sequence; SH2,none; 185 
410763; AF279145: H3.B986: hypotheOcal protein FU21776; none,nooe; 2.85 
440617; AA894880; Hs.181161; ESTs; none,none; 185 
454071; AI041793; Hs.42502; ESTs; 7tnLl.non8: 185 

411040; AF007393; Hs.177574: proteln-klnase. MeifenMvindudble double stranded RNA dependent inhibitor, repressor of (PS8 repressor): Hli1;TM=M;; 185 

402183: ; ; NM-004491*:Homo sapiens glucocorticoid receptor DNA binding fador 1 (GRLF1), mRNA.; none;SS»Mi 2.85 

428753; AW939251 Hs.192927; hypothetical protein FU20251; none;TM=M;; 184 

417070; 219077; Hs.172004; titin; fh3.lfl.SGXXSG.pkinase:TM=Ms 184 

458456; AI122709: Hs.153609; ESTs; bZIP>^rmadiIIo.seg,mn.NTF2,none; 184 

421226; AUJ96748; Hs.102708; DKFZP434A043 protein; Armadillo.segJntegrin J,PSI.TlG;TM=M:SSsM; 184 

436733; BE327477; H$.166941; ESTs; 7tnL3.oxldored_q5_N,Presennin.PWl; 184 

427161; AI024595; Hs.97508: a disintegrin and metaHoprotelnase domain 6; lg:TM=Y;SS=M; 184 

419462; AF071076; Hs.1 12255; nudeoporin 98kD; DEAD,hencase_CNudeopoALFGihomeobox;8S=M; 183 

41 3658; AA055369; Hs.75456; A kinase (PRKA) andior proteto 1 0; none.nons; 283 

400749; ; ; NM.003105*:Homo sapiens soriHIn-relaled receptor, L(DIR dass) A repeat&<»ntalnlng (SORLl), mRNA.; 

EGF,fn3Jdl.rBC8pLa,WijecepLb.granulin.BNl^TM=Y;8S=M; 183 
447388; AW630534: Hs.76277; Homo sapiens, done MGC:9381. mRNA, ccmpblecds: TB^OP1.HVA22iTM=Y:SS=M: 183 
413243; AA769266: Hs.193657; ESTs; pklnase,zf-C4.ERM,CNKnone; 183 

423690; AA329648; Ks.23804; ESTs, Weakly similar to PN0099 80»3 protein [H.S8pIens]: toiutrans.IQ.none; 182 

447993; AW139525; Hs:i70362: ESTs; none,none; 182 

423061: AI290473: Hs.44807; ESTs; inlegrin_B,Sema.PSI,TI6,none; 182 

440619; AW408586; Hs.91052; ESTs. Moderately simflar to ALU5J1UI^N AliJ SUBFAMILY SO SEQUENCE CONTAMINATlOM WARNING ENTRY [H.6apiBns]; 

8bhydrolase_lnone; 182 
423497; U92642: Hs.129701: G protelnhcoupted receptor 45; 7tnLl:"rM=Y;SS=M; 181 
446126; AW085909; Hs.10177; pleckstrin homology domato Interacflng protein; none.none; 181 
452488; N74921; Ks.184389; ESTs; non8;TM^M;; 180 
449515; AI653378; Hs.302012: ESTs; ton_trans:TM=Y;SS=M; 179 
443881; R64512; H$.237146: hypothetical protein FU12752: none.none; 179 

449636; A1556608: HsJJ81328: ESTs, WeaHy similar toT00378 KIAA0641 protan [H.saplenst pkInaseMmone3;TW=Y;SS=M: 178 

424348; AB020523; Hs.266258; endonudease G^e 1; Endonudease;TM=M;SS^ 176 

418844; M62982: Hs.1200: araditdonale 12-lipoxygen85a; DpQxygenase.PLAT:TM=M;: 178 

442233; AW967149; Hs.28439; ESTs, WbaUy sbnOar to 138022 hypothstkal protein (H.8apiens]: MIF,siig8r.lr«none; 178 
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450010; AW293B01; Hs.255052: ESTs; ARI0.7tm-1; 2-78 
452813: U54727; Hs.191445; ESTs; pkin3S8.AcflvliUBcp.ron8: Z78 
416177; N44987; ; ESTs; (ddnasejione; 2.78 

408014; AA723782: Hs.41749; protein Wnase, cGMP^petwJettf, type 11; cNMP_blndlng.pklnase:S$=M; i77 
448362; AA641767; Hs^lOIS; hypothetical prol^ OKFZpS64l0864 simflar to WAT1; sugar_tr,TM=Y;SS=M: 2.77 
423994; X01057; Hs.1724; InterteuWn 2 receptor, alpha; 5Ushl:TM=Y:SS=M; 2.77 

427342: AL110150: H3.176680; Homo sai^ens mRNA; cDNA OKFZp5B6O0724 (ftwn dona DKFZp586D0724); none;NA;NA; 2.76 
447574: AF162666: Hs.16895; totisledas Mnasa 1; pkinase;TM4«;: Z76 

442681: AlB091d2; Hs.130907; ESTs; tran$ketol8se,E1_dehydn)9,trarBl(eU)yr.tiaisl<^^ 2.75 
433537; AW024214: Hs.102307: ESTs; Na.sulph_symp,aaj)ennea5es:TM»Y:SScM; 2.75 
458997: AW937420: Hs.69662: ESTs; SH3.RhoGAP,FCH:TM=Ms 175 
432284; AA532807; Hs.105822; ESTs; pWnase.none: 2.74 

406139;:: Target Exon;ig.Tul»:TM=Y;SS=M:i74 . . , ' , 

43951 8; W76326: ; 9b2d60d04 j1 SoaresJetaLheaJtNbHHlSW Homo sapiens cDNA done S" sImBar to contains Ahi repetitive dement mRNA sequence: ArmadiBo_8eB,non8; 
Z74 

428536; A1143139:H5.228e:visinifak0l;emand;SS=M; 2.73 ^« 
40021 1: ; ; NM_003899*:Homo sapiens PAK^teracflng exdiange factor beta (P85SPR), mRNA. VERSION NM-003897.1 Gl; SH3.PH,RhoGEFJerpene_synIh:TM=Mr. Z73 
402129; ; ;TaraetExon; SH2.Pepli(lase_C9:TM=M:: 2.73 
424238: AA337401: H3.137635: ESTs; none;TM=M;SS=M: 2.73 

433834; AA620742:Hs.130786: ESTs; SPX,EXS;TM=Y;:Z73 . . 

409339; AB020685: HS.S4037; ectonudeotlrfe pyrophosphatase/Jjhosphodtesterase 4 (pulathre function); SU!fataseJ'hospnoale8l;iM=M;5S=M; ZTZ 

408163; AW779842: Hs.258217; ESTs; 7tm.l,z^BJ»ox,zf.C3HC4.7lm_1.zW_box.z«3HC4; 2.73 

422358; AL133030: Hs,1 15429; Homo sapiens mRNA for K1AA1666 protein, partial cds; SH3;TM=Rfc ZJZ 

426409; AA594207; ; gb:nn29e01.8l Na^CGAP.GasI Homo sapiens cDNA clone 3*. mRNA sequence; pldnase,Fibrinarfn,none; 2.72 

400645; ; ; Target Exon; lig_chan,SBPJbi_3,ANF_receptor,none; 2.72 

443661; AA336609; Hs.10862; Homo sapiens cDNA: FU23313 fis, done HEP1 1919; 8denylatBlanase.none: 2.71 
442572: AI001922; Hs.l35121; hypothetical protdn FU22415; none.HSP70: Z71 

409317; U20165; Hs.53250; bone moiphogenetic protein receptor, type II (BMPR2K |*lna5e/ctlvlnjecp:TM=M;SS=M; 2.71 

403201;;; Target Exon; none;; 2.71 ^ ._ ^ «,„ 

459357; AW848421; ; ob;ll3^0214>1 50200^75-611 CT0214 Homo sapiens cONA, mRNA sequence; ABC_tran«ABCjmembrane.lon.tr8n$; Z70 

439935; S75105; Hs.8358; glutamate receptor, lonotroplc. Iiainaie 2; ANFjeeeptor.l)o.dian,none; Z70 

414924; C06267; Hs.44247; ESTs; non8.non8: 169 

421008: BE259378: Hs.l03147; hypothetical protein FIJ21347; DUF255:: 269 

449951; AA004982: Hs.120904: ESTs; DED.CaIsequ8Strin; 269 

411226; AW833022; ; ob;RC^TTOOO$-191099412-4)04TT0005 Homo sapiens cDNA. mRNA sequence; pkinase.none: 2.68 

417625; U59305; Hs.44708; Ser-Thr piotdn Mnase related lo the myotonic dystrophy protein Wnase; pIdnase,l>ZIP,frgamma,K^j*iRas8_C;SS=M; 268 

408051; AI623351; Hs,172148: ESTs; PH,RhoGAP.non9; 268 

412521; AW753481; Hs.294022: hypothetical protein FU14950: SH2:TM=M:: 268 

413922; Ai535895; Hs.221024; ESTs; Ion„transjWDR_rrPR.MlR.UOP6T: 268 

4321 88; AI382952; Hs.2928; sohite canter fantily 7 (cafionic an*» add transporter, y system), member 1 ; aaj>ermeases:TM=«Y:SS=M; 267 
41 5516: F1 141 1 ; : 9b:HSC2WF081 nonnalbed Infant brain cONA Homo sapiens cONA done c.2wl08. mRNA sequence; lon^trans.none: 267 
419749; X73608: Hs.93029; Bparc/osteonectln, cwcv and kazai^ domains proteoglycan (lesUcan); kazaI,lhyrogloburtn^1;SS=M; 268 
416095; AW014327; Hs.221951; ESTs, Wealdy similar to 138022 hypothetical protein (H-saplensl; lg.zW:3HC4.CbLN.CbLN2CbLN3.none: 266 
403609; ; ; C3001 199:gi|7494834|piliT15308 hypolhetica! protein B0286.2 - Caenorhabdilis etegans||41; 7lmJ,7tm_2GPS,WIF;TM=Y;SS'=M; 266 
458213: AU)47521; Hs.12210; hypothetical protein FU137325iniartDtBnsln: p)dnas8.none; 266 

426158; NM.001982: H8.199067; v^b2 avian erythroblastc leulfflmla vlrd oncogene homolog 3; Furin-ilw,pWnase.RBcep_U.domaln,Furin- 

Dke,pk}nasa.Recep_Ldomdn.Peptidas8.M24; 266 
435410; AL135057: Hs.117182: ESTs; non8,pWn8se.RBD.DAG_PE-bind; 266 

437838; AI307229: Hs.184304: ESTs; CARD,ICEj)20.iCEj)10.HrT.vdtage_a(;CBS,HCCAJsbmerase; 2.66 
430293; AI416988: Hs.238272: Inositd 1,4.5-tr^phale receptor, type 2; ion_1ran8.RYDRJTPR.MIR.none; 265 
433090; AI720050: ; immortallzationmpregulaled protein; none:SS=M; 265 

432103; T15803: Hs.272458: protein phosphatase 3 (formeity 28). catalytic subunlt. dpha boform (caldneurin A dpha); MBtanophos;TM»M;; 265 
435852; H72303; Hs.38011; ESTs; pkinase.none; 264 
433327; AI674779; Hs.126744; ESTs; none,7tnu1: 264 

438459; T49300: Hs.35304; Homo sapiens cONA FU 13655 lis. done PlACEIOl 1503; none,FM(Wike: 264 
432251; AW972983; Hs.232165; pdycythemia nibra vera 1; cell surface receptor; none;TM=M;SS=ft/l; 263 
446963; A1B62668; Hs.176333; ESTs: OMPdecase,Pr!bosyttran,pkinase,RhoGEF.PH; 263 
444821; AA053564; Hs.12040; STE20-like Wnase; pkinase:TM=M;: 263 
435206; AK001451; ; C02-assodated proteh; none.none; 263 

434370; AF1^8: Hs.58346: ectodysplasto 1. anhidroOc recepton deatM<unll2L6Pn:TM»Y;SSf:M; 263 

439039; Ai658707; Hs.48713; ESTs; pkinase,ncne: 263 

449656; AA002008; H8.168633; ESTs: PIP5K,none; 263 

429341: X73874; Hs.2393; phosphoiylase kinase, alpha 1 (musde); none:TM=M:; 262 

445174; AV652850: Hs.172004; Utin: (h3,lg,SGXXSG,none; 262 

424950; AA602917; Hs,156974: ESTs; none,CDP-OH_P.transt 262 

438141; AW946871; ; gb:RC2-ET0022-080500^12-d02 ET0022 Homo sapiens cDNA, mRNA sequence; SH2,STAT,STAT,ljind,STATj)rot,non8: 261 
434938; AW500718; Hs.81 15: Homo sapiens, done MGCi16169, mRNA, complete cds; pkinase.TBC,Rhodanese;TM=M;; 261 
409264: NK/L014937: Hs.52463; KIAA0966 protdn; Sy}q.N:TM^; 260 

458438: AI141520; Hs. 1 51454; ESTs, WeaMy similar to ALUCJ1UMAN llil ALU ClASS C WARNING ENTRY til {Ksaplensl; pWna5e,none; 260 
400719; ; ; NM_004055^Homo sajdens caipain 5 (CAPN5). mRNA. VERSION NMJM43352 Gl: C2.P6pWase_C2.CdN'Ull;TM=Ms 2.60 
427318; AF1B6081; H5.175783: zinc transporter; 25p,TM=Y:SS=M; 259 
426086; T94907: Hs.188572: ESTs; PH.Ets.CH,spedrln.Ca.di8nneLB^i»; 259 

430105; X70297; Hs.2540; dwOneiglc receptor. nIcoBnfc. alpha pdypepBde 7; Neurjch8n_lJB0,Neur.chanjnemb.pWnase;TM3Y;SS=M; 258 

411495; AP000893: Hs.70359; K!AA0136 protein; HATPasej:,baP:TM=M:; 258 

438167; R28363; Hs.24286: ESTs; none;TM=Y;SS=M; 258 

418749; N75147; Hs.22488: ESTs; non8jM2H2KRAB.pldna5e: 258 

454289; AL137554: Hs.49927; protein kinase NYIVSP15: dCMPjcyUleamtTM?M;; 258 

443805; H0686S; Hs.l34l31; ESTs; eihand.ion.tiBn$.none; 257 

429429; AA829725; H5.334437; hypotheticd prolan M6C4248: none,trBnsmembrane4; 257 

403088; ; ; Nh^003319*:Homo sapiens titin (TTN), mRNA. mRNA.; ta3,ig.S6XXSG:TM=M.-; 257 

409190; AU076536: Hs.50984; sarooma amplified sequence; transmembran84;TM='Y;SS=M; 257 

426696; AW363332 Hs.l71844; Homo sapiens cONA: FU22296fl8, done HRC0446B:lg;TM»Y;SS«M; 256 

403328; ; ; Tarset Exon; GlycoJ)ydro.35;TM»Mr, 256 
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426167; AR)39023; Hs.167496: RAN binding protein 6; AfmadiBo_5eg,HEATJ>BS;; 2^ 
42869S; AI355647; H5.189999; purinergic receptor (iamly A group 5); 7tm.1;TM=Y;S8BM; 2^ 
419285: 031687; H8.8386B; KIAA0062 protein: Zip;TM=Y;SS='M; 2.54 

415740; NB0486; Hs.3991 1; Homo sapiens mRNA for aJ00089 protein, parDa! cds; CBW-21:TM=M;; 2.53 

403305; NIVL008B25; ; transmemlirwe protein (53X0), endoplasmic reljcu!um/Gol0 IntennediatB compartment; pkinase;TW=Y:SS=M; i53 

443804; AL1 35352; Hs.255883; ESTs, WeaMy similar to 138022 hypothetical protein |H.saptensJ; Peplidasa^M18.Peptidase^18,Yj)hosphatBS8: Z53 

450425: H06607; Hs.6099; ESTs; E1-EZ>^TP8se,Caflon„ATI^^C,CatioTLATPase.N Wra»as^ 2.51 

401702; ; ; NM_00t 17r :Homo sapiens ATP-blnding cassette, sulhfarrily C (CRR/MRP), memlw 6 (ABCC6), mRNA.; ABC_tran,ABCjn8mbrane:TM=Y;SS=M: 2.50 
439463; W69304; ; gb3d46fQ1.r1 Soares_fetaLh8artNbHH19W Homo sapiens cDNA done mRNA 8e(|uence; fh3,Y j>hosph8l85ejioiie: 2.50 
425975; AB01 1082; Hs.165559; organic ^lionic trnnsporter^Bte 4; sugar_tr;TM=Y;; Z50 

443259; AW090601:Hs.69171;proteb kinase 2; picin3se,pWnase_W1,^^^ ^ « 

400777; ; ; NiyL007325':Homo sapiens glutamate receptor, lonotrophlc, AMPA 3 (GRiA3). transcript variant flip. mRNA.; fig jchan.SBP.bac.3^Fjeqeptor;TM=M;SS»Y; 
2.49 

426044; AAS02490; Hs.170290; ESTs; none.none; 2.48 

454564; AW807573; ; gtj:MR1'ST008W)21 299^)04^01 ST0088 Homo sapiens cONA, mRNA sequence; pI«nase.none; Z48 
415938; BE383507; Hs.78921; A kinase (PRKA) anchor protrin 1; KHHtomaln,TUDOR;TM=M:SS=M; Z47 

426481; AW963941; ; (^:EST376014 MAGE resequences. MAGH Homo sapiers cONA, mRNA sequence; Yj)hosphatase.Band,41,OSPc.rwne; 2.46 
426005; AA377499; ; gb:ESTB0341 Synovial sarcoma Homo sapiens cDNA ? end. mRNA sequence; tubulln,FKBP.COX6B.7liiLl.tubUtiiuC;SSBM; 146 
424879; AA348D13: Hs.273385: ESTs; arf.&a)pha.none; Z46 
415156; XB4908; Hs.78060; phosphorytase kinase, tieta; none;TM=M;; Z46 

416508; R39769; ; ESTs, Moderately sonHar to ALUS HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION WARNING ENTRY [Ksaidens]; 

SH3.PDZ.Gtt8nylatB3in.2U5.none; Z46 
408087; AW150645; ; gtKxg54l07J(1 NCLCGAP.Ut4 Homo sapiens cDNA ctone 3*. mRNA sequence; XYPPX^^Cjnembrane^Clran; Z46 
433434; AA58842g; ; gti:no22b03.8l NCI CGAP Pr22 Homo sapiens cDNA clone 3*, mRNA sequence; pklnase.DNAjnls jep8ir,HATPasejD: 2.45 
446768; AV660305; Hs.110286; ESTs; l6Lp20.D£D.lCej)10,ICEj)20.DED; 2.45 
437158; AW090198: ; KIAAl 150 protein; none:NA;NA; 2.45 
430177; AW969233; Hs.302746; MSTP028 protein; l^telranone; 2.46 

422270; AF114494: Hs.1 14052; protein tyrosine phosphalase4ike {proline Instead of catalyBc arglnine). member a; nonajTM^Y;; Z45 

430680: AW138724; Hs.16B974; ESTs. Highly simitar to ALU7.HUMAN ALU SUBFAMILY SO SEQUENCE CONTAMINATION WARNING ENTRY IH.sapiensI; 

Y_phosptiataseAlaplin_N,Yj>hosphatase;2.44 
446569; AW248031; Hs.155839; hypothetical protein MGC12934: adhjlncPGK.Semiddhy(Je_dh;SS=M; 2.44 
41 1902; AW875344; ; gb:RC1-PT0009-220300.013.{06 PT0009 Homo sapiens cON^ mRNA sequence; none,pWna5e,ank; Z43 
430057; AW450303; Hs.2534; bone morphogenetic protein receptor, type lA (BMPR1A) (ALM; Acllvin_recp4»Wnase;TM=Y;SS=M; Z43 
446338: Ai289121: Hs.206978; ESTs; none.SH3; Z42 
426221; AB0078B1; Hs.110613; KIAA0421 protein; nonB.RibosomaL88; Z42 
446796; AI652497: Hs.110103; RNA polymerase I transcriptton factor RRN3; none,none: Z41 
428360: H10291: Hs.30974; ESTs; pklndse.PBD.none; Z40 

428379; X06026; Hs.2259: CD3G anOgen, gamma pdypeptide (TfT3 con^); ITAM;TM=Y;SS=M: Z40 

432488; AA551010;Hs.216640: ESTs; Na_5ulph„symp.none;Z40 u . .o^ni o.a 

407235; 020569; Hs.169407; SAC2 (suppressor of aclin mutations Z yeast homotogHke; none,Ribosonfal_S13,GaIactosyLT,Zlp,adh_short.2f-C3HC4; Z40 
448595; AB014544: Hs.21572; KIAA0644 gene product; LRRLRRGT;TM=Y;SS=M; Z40 

428283; AI439096; H3.323079; Homo sapiens mRNA; cDNA DKFZp564P1 16 (from done OKFZp564P1 16); Y j)hosphatase,lh3,lg,none; Z39 

432460; K1291Z Hs.Z74691; adenylate kinase 3; adenytatekInase.none; Z38 

429549; AI333013:Hs.250505;retlnofc add receplDf. alpha; none.rf-C3HC4,BRCT,ng_d^^ 

42^ AW137635; Hs.4423B; ESTs, WeaWy similar to S55657 alpha-ICadrenergic receptor splice fomi 2 (H.8aplensl; Ptosphodiesl,Somalomedln_B,EndonuclBa5e.none; 2.36 

417473; M55268; Hs.82201; casein Idnase 2, alpha prime polypepflde; pWnase ABC1;TM=M;; Z35 

453186; AK001708; Hs.32271; hypothetical protein FU10846; TK.DUF300;TM=Y;SS=M; Z33 

447276; AL049795; Hs,17987; hypothefical protein MGC1203; none;TM=M;; 2.33 

445310; AI242490; Hs,153290; Honw sapiens cDNAFU14318 lis, done PLACE300040Z none.pkinase; 2.31 

432942; AF0839S5; Hs279852: G pntehKiOUpled lecepton 7tnu1.gtobln;TM=Y:SS^M; Z30 

434693; AW976001: Hs.337603; ESTs; none.none; Z26 

452034; F12234: Ks.75893; ankyrin 3, node of Ranvier (ankyrin G); ZU5.(Iealh,none; Z25 

423732; AF058056: Hs.132183; solute cenier family 16 (monocarboxyCc add transporters), member 7; sugar„tr,TM=Y;SS=M; Z25 

404956; ; ; C1003210':gi|6912582|rBflNP 036524.11 pefllnlHomo sapiens) flfl6009487|dbi!BAAB492Zl| (AB; none,PI3.PI4Jdnase,PI3ieCZPI3Kjbd,PX.Pl3Ka,C2; Z24 

452183; NM-006594; H5.28298; adaplDwelated protein complex 4. bete 1 subunit; AdaptiruN.Y jjhosphatase; Z23 _ ^ . 

420529; D25259; H3.3198M; ESTs. Moderately siniar to 154374 gene NF2 protein [H.sapiensJ: pk!nase.DAG_PE-bind.RBD.ras,DC1,GFP;TM=fcte Z21 

408808; BE074219; Hs.17230; hypothefical protein FLJ22087; Armadnto_8eg;TM=M;SS=M; Z21 

451932; AA360954; Hs.27268; Homo sapiens cDNA: RJ21933 fb. done HEP04337: SH3.PH,RhoGEF;TM=Mi; Z21 

432008; AW296791; Hs.193170: hypolheDcal protein FU21687; UM,Synaptophysln,h3n_trans,K0W; Z20 

455840; BE145897; ; gb:MR04<T0208.221299-204-b07 HT0208 HomosaptenscDNA, mRNA sequence; PI3JPI4_klnase.PI3Ka,Pl3^Pl4_Wnase.Pt3Ka; 2.19 
429238; NM„002849;Hs.19828B. protein tyrosine phosphatase, receptor fype,R:YjliP5phate8« . . 

430975; AA490055; ; gb:ab05b09.8l Stralagene fetel rdna 937202 Homo sapiens cONA done J, mRNA sequence; adeny!ateWnase.ThymWyteteJdn;TM=M;; Z17 
407174; T79938; HS.7706Z leukocyte kniTnmogtobuft>^ike receptor, suMart% B (with TM and FTIM domains), member 5; ig,none; Z16 
450921; AA098790; Hs, 146245; ESTs. Moderately similar to T17242 hypolheOcal protein DKFZp686B1417.1 [asaptens); none;NA;NA; Z15 
427209; H05509; Hs.92423; KlAAl 566 protein; pldnase;TM5M;; Z14 

401917; AL050149; ; RAN binding protein 3; Orexln,SHZSTAT,SrAT_bind,SrAT_prol,tonJransPACPAS.none; Z12 

426359:AA376409;Hs.10862;Homosa|tenscDNA:FU23313fis,cteneHEP11919;8denylatekInase.^ . . 

4395^ W78548; H1336621: ESTs, Moderately sImDar to ALU5J1UMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION WARNING ENTRY (H.saptens]; ton.trans.nQn6: 
Z06 

410439; R35943;Hs.63758; transferrin receptor 2; PA:TM=>Y;;Z05 . . 

448698; A1S64769; Hs.173070; EST, Weakly similar to ZN4Z.HUMAN ZINC FINGER PROTEIN 42 (MYELOID ZINC RNGER 1) (MZF-1) (H-saptensJ; none,zr.C2H2; 104 

449543; AF07063Z Hs.23729: Homo sapiens dona 24405 mRNA sequence; ICtetraJonJran8,none; Z04 

453496; AA442103; Hs.33084; solute canter family 2 (fadHtated gluoose/fructose transporter), menter 6; 8ugar.tf:TMsY;5S=M: Z02 

443952; AI149106; Hs.143530: ESTs; pWnase,non8; Z02 

437589; AA761322; HS.26986Z. ESTs; SHZSH3.CZPKRasGAP,none; Z02 

422837: AA399024: Hs.118836; myogtoWn; globln;TM=M;; Z01 

450253; AL1 33047; Hs ^^4715; Homo sapiens mRNA; cONA DKFZp434D0215 (fromdone DKFZp434D0215); partial cds; SH3tTM=M;; 1.97 
401984: ; ; C17000146*:gq2143629|pirIiA57l56 Ca2-»Aa!moduIifHlependOT^ prolrin kinase (EC Z7.1.12?) I; pUnase^ 1.96 
453464; AI88491 1; Hs.32989; receptor (calcitonin) adivlly modifying protein 1; none;TM=Y;: 1.95 
417733; AL048678; HS.B2503; H.sa(tois mRNA for 3mR of unknown protein; none;NA;NA; 1.94 

411450; H49519;Hs.127301; ESTs; pklnase,none: 1.82 ^ 

406303; ; ; C16000922:oq74991O3MT2O903 hypothetical protein F14F4.3b- Caenorhabdilis etogans gl; ABCjr8n,GTP_EFTU,PRK,ABCjnembrana;TM»Yn 1.80 
425009; XS8268; H8.154151; protein tyrosine phosphatase, receiHor type, M; fh3J9,Yj)ho5phatose.MAM;TM»Y;8S=M; 1.74 
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425280; U31519: Hs.1872: phosphoenolpyruvatecarboxsfklnase 1 (solutte); PEPCK;TW=M;: 1.65 ncfwni-t*. < » 

425958; AW1S3271; H3.301839; InlraceDular enOgen detected by monodonal anlftjody KH; Intracellular hyahironan-bcnding proteto; Yjhospliatas8iUBPc;TM=M;; 1.63 
432563; NM.013261; Hs.198468: pefoidsome pidSMm acSvated lecepbr, gamma. ooacQvator 1; rnn;TM=M;; U1 

TABLE 498 

Pkey: Unique Eos probesel idenliiier number 
CAT number. Gene duster mimbar 
Aooesskxc Genl)ank sxesslon numlKfs 

Pksy CAT Nuiriber Accession 

438091 22448 1 AK054860AV652198AW52192AV652138AV652127AV652194BE935919A\^2017AVB51995AV65154^ 

AV646179 AW880409 AA345002 BF155189 BE068331 X56197 AL603014 AW953629 BM263548 BE550772 AA701084 AI581352 AA35B689 
AW938841 BF438147 W05391 H75313BF326185AV646335AV651589AV646340AV651992AV646384AV646364AV687497BF1551B3 
AV646370 AW797876 AI906821 X55196 BEB33835 AAfi28440 BE833808 BF224205 AA709126 8E673807 AI923886 AA947932 AI276125 AI185720 
AW510698 AA387230 BEienOB AW8S8628 AWB98544 AI1469B4 AW043642 AI288245 All 86932 AI635262 A1139455 A1298739 AI813854 
AI024768 BE699445 BE699444 A!707807 D52654 AI214518 AI004723 AIG98085 AW087420 AI565133 AA845571 AW898622 BF110144 AW5132B0 
AI061 12S BF382770 A1268939 AI435818 BF475318 AI024767 BE174213 AA757598 AA51 3019 AA902959 A1860794 AI334784 BF10841 1 
BM310532 AW513771 A1951391 AI337671 BF095606 BF095601 BF095468 AW890091 BF095753 AW243400 AW898607 AW898818 BF362762 
AI922204 AW898625 BE699466 BE174196 AW102923 052715 BE699456 D52477 D55017 6F955933 6G623563 AV646254 AA463S22 61003244 
AI299190 W40186 BE174210 BF939091 BF4341B0 AW579001 T55662 H01611 TS2522 8F94S037 BF955938 D54679 D53933 R67100 BG925552 
BF99g056 RB3430 Z29922 T85791 W039^ H63289 AI091537 BF086583 AA345570 H48870 H80720 T83S23 BI039826 B1037700 R00353 
BF1 55164 N98343 N79072 H01812 T55581 

411089 5597 6 BI009308 B1009893 BF922023 BF922909 BF922913BF922096 BF957733 BE701791 AA456454 AA579B76 BF933710 AA091294 B1W7291 

AW905577 AW975593 AA713730 AW836781 AA666384 AA551106 BF594606 AI082382 A1955808 AI679895 AI679386 BF435555 AA5B6359 
AA551351 AA595822 AA565168 BF808855 AAS84921 N86077 AA601031 AA633188 AA514764 AA454562 AA5512g7 AA936109 B1009389 
AW897806 BE816442 BF739374 61009310 6F925422 BF933709 BF922034 BF9254S5 B1009880 

439285 22495 1 AF085101 AL133916AW955684AW950828AI346341 AI867454BM2B3376BF432231 AI421Z79AI85527^ 

N52351 AA626243 N59253 A1341407 AA456968 AA457077 AI358918 AA364013 N79113 N547B4 BE175639 M76721 AV727392 Z45529 Z44343 
F05908F05403 F053g8 

438089 22448 4 BM475665 BE644917 AW770789 AW952971 N54853BM263259AI224545AI18486BN69114AW518902AI440169AA809472AV654440 

AA281642 AU1 85230 AW337382 AI872923 AI537113 N73882 T83378 H63731 BF671764 AW897824 AI811204 AA344646 BE0091 12 66899664 
H91240R60548 N41701 

432407 MH1429 12 BG036675BF772005BF771866BG960386BG960381 NM.00571 2 AF1 1031 5 BE074534BE182776BE1 58000 8E1 57999 BE714315AW81 6104 
AW847519AA099426 AW817981 AW856396BG961122 AA224498 AA30B542 AWB21833 BF902155AI732411 BG77B834 BG283641 BE748279 
BE748870BG319540BE748864BF739224BG985155AK057283 61861468 AA663341 AA457591 60949294 AW392886AA0711 22 AA227849 
AA584918BG959670BF773486A1041698BF959013 R87170C16859BF770411 BF77129BA1075321 LI 3823 AA21 6700 BF771864AW861859 
BE537088 C1B935 AA155719 8FT71 172 BR69107 BF804964 AW81 8172 AW818143 AW392930 AW817057 AW858044 BF74621 1 AA179928 
AW861687 AWB21826 81055726 BF242643 AA207189 BF770412 6F771 157 BG430030 AA055592 
414883 8371 2 AF274943BG494894AI719076 AA908783 AI935150AI422691 AA910644AA583187 BM272167 A1828996AA527373AW972459 AI831380 

AA772418 AI033892 AA100926 AU154749 AI459432 A1423513 AI094597 AA740817 AI991988 AI090262 A)312104 BI256707 AA459522 AA416871 
A1075239 AI339996 AA701623 AI139549 AI336880 AA633648 AI989380 AI362B35 AA399239 AI146955 BF514270 N92892 AI348243 AI278887 
AA4S9292 A1494230 8F507531 AI492600 AA962S96 AW613002 AA293140 AA235549 6F108854 AA954344 N496B2 At4571 00 AW589407 
AW300758 BE220715 BE220898 BE569091 BM009647 6F900351 AI537692 A1203723 A18S7576 AA58441 0 AW371667 BM172363 
451320 12225 2 AK057826 A)631 982 AA224195 AI701458 AA890570 AW966562 AW071907 A1671 352 AI375892 T03517 AI124088 R88265 AI084316 BF223720 
A1354686 T33552 AW205836 eE931 1 15 AI72021 1 T03490 BF084055 AJ372837 T1 5415 BG054890 AA530384 F26326 AI14071 9 AA443303 T33230 
T33623 T3351 1 T33785 AWl 18072 W20198 AI657180T15734 AI419B06 N90552 AA224388 T15909 T03515 D55812 W27899 
400205 2538 1 NM 006265 D385S1X98294 BM477931 B|yi461566AU1 23557 AU1 33303 AU13464gAW»^ 

AW503840BM1524&4AVV505260AI81S984AVV504075AW500716AL597310BC001229BM47437 AA984202AU135205BE090841 AW1637S0 
6F747730BF898637AI206506AV660870AV692110AW386830AV656831 N84710 AW993470 BF086802 BR58454 BG960772 BF757769 
B1870853 BE018627 C75438 AW148744 BR57753 BG622067 BE909924 AA7082O8 6G530266 BF968015 AW992930 BFB88862 BG536628 
AA143164 AW74e953 66498922 8F885190 BF889005 eF754781 6F800003 eM476529 AI627668 AW028126 AL046011 BF590668 AI017447 
AAS79936Ai367597AA699622BE280597An24620AK)82648AW274985AA677870Ai056767BE$S1689AA28 
AW002374AW062651 AA360834 N68822AU135442AU125960 Z78334BE545813AI092115BF312771 BF2428S9 66533816 60533781 
66164745 66492433 8liM73183 AA172043 AA172069 AU157092 AU151353 AU155318 BE302211 AJ375022 AA085841 AU1 57923 H88858 
AA132730 AA115113 AAS09781 AJ475256 AA424206 AW572383 AW084296 All 84820 A14591 78 AA782432 H92184 AA340562 BF195818 
AA852821 AW576342 AA827107 AA173317AW190014AI91B514AA729372AA720718 AI055958 AA331424BE328601 AA515690BI018896 
AW628277 AA748368 AA626222 66492636 AW38062O eF80006B AW370956 AA290909 R25857 66952995 6F801437 AA172077 AU15S890 
AU149783 AI720904 AA902936 AA865727 AI470830 AV740577 AA142982 AA482485 AU 145485 AWS76399 AU156042 R63448 BF246427 
6E928472 D25910 BR56439 BFg68785 6E565238 AA355981 AI905607 B6291148 B6533096 B6532888 6F030886 B6613756 BE928471 
86574501 AA187596 AA361198 T95557 BG531446 B6527242 86527513 B6611 106 AA08S99S BFW7»2 BG024608 BE540261 BG531235 
AL579993 BG108733 BG483503 B6571032 BG492505 
400210 133 1 BC014030NM 004068 BC004996AKD57883 D63475BiyM68205 66386792 B6750447 66575842 B647m 

AU141281 66825395 66338276 eE396231 AU121493 AU131489 6F341132 66335659 61769251 AU142779 AU124483 86480828 AU121353 
66702326 AU137866 86759048 61460601 8E887290 66750415 BF127710 66757819 B1160266 AU122088 BG824046 66744180 eE884943 
BM006610 66702599 BI870749BF307033 81334771 B6480606 AU142599 AU141336 BM01 8563 81335565 AU138308BE391241 86774488 
6E27B025 66717959 86706920 BE312567 AU138300 BE303034 BE303016AU1 39252 8E335797 66470940 BE336735 BF30S197 AU134288 
AU139907 BE261245 BE208718 BG761912 BF212890 AW732692 6M011258 AW247629 AU136696 8F530078 BF204146 AU)48752 66328927 
6E388385 8E260122 AU138789 6E253465 BF733914 AW249415 AW239535 BE31 1791 BE256236 BF529742 B6770465 AW245777 AW245813 
BE296677 BE266852 BE1681 15 BE396596 BE280057 BE168229 AI750820 AU134137 BF792191 AI272215 AI907348 AW238875 BF805152 
BF568397 BE712727 eE081443 6E001805 BF724535 BF744705 AW247385 8E796369 AU133759 AV705142 BE794402 BM465821 BG281284 
AW384831 B^M50689 AU134125 6E31 1650 BF356318 BM462831 BG420555 BE749127 AU134590 BM019438 8C013798 BG761223 66122058 
BEe72076 BG748496 BG821374 BE619159 BG423244 AU1371 10 AU127210 BE2700B1 AA496860 AA351380 AA356303 AU099781 AA355912 
AA37141 1 AA325535 U36188 AL569574 AL577204 AL525543 AL567342 AL567334 AL567122 AL536527 AL567098 6G681585 86824951 
BI870652 BI225655 AW393878 66750632 61223803 eE877552 BE270473 BE389392 BE779021 6NU)17B45 811 17816 BE396513 6M049(n6 
AW393945 6E786941 BE267724 6E562981 8E314236 BF744102 BM019781 BF806063 BF828673 BF829181 81008788 BF829180 6F829175 
eF433802 6E909928 BF839784 AW578564 BF943095 AW328030 AA448596 AW882588 AA909845 BF924341 AW361460 AA425174 BE940557 
BF761585 6F931276 eF934886 W88597 8F933977 BI03590S BE836064 BF837576 BI018368 AA205908 H54612 R88902 8E812330 BE932300 
BF924562 AW805376 6E769859 R67381 AA371901 6E838855 AA326381 W56191 AA341231 AA464093 AL575977 AL518650 AL547393 
B6338327 Alfi17fi63 ALS36921 BG207096 86207589 66199290 66220159 BG744842 66104730 AA555035 AA618009 AA881062 AA610582 
AVV245418AI040983AAS21380AW245455AU147292AU155238BE620286AW250767BF888236BE620B19AU1 
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5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



AW06949S AA701091 AU155225 AW2S0019 AA66623S AU147764 AA449739 BF907S98 AW043731 AU144390 A1924565 BE300831 AWa74021 
BE208088BI732773BF594057A1159873AU143930N24100AI052499At989370AI3661^ 

BF81 1 358 AU151352 AA477554 A!1312gO AIMSTST AU157349 AA620559 AA581 533 A038299 AA402755 A10S0079 AU1 57614 AI280691 
AA706590 BE206615 AA881056 AU153214 AU155712 AW069554 Ai081124 AU155846 AI081 123 AA719876 AI754976 A1027763 AA723095 
8151 8762 AW0S07D6 F37521 R79062 AAS65530 H39826 AA404670 AU156884 AI13S882 AA513633 AI283202 AA651656 AI623515 AA427413 
AW248474 BF641432 BM453246 A1302390 AA808269 A)249357 AA132775 AA716732 AA479932 AI198296 AI208674 A)088433 F38977 AA496899 
AW515762 AA602541 AA34116t D58785T15399 AI286077 W69t52F28418A12ni56AA336034 AAS684S5BE206121 AI080033AI69g880 
AA881255AI953465AI613240AL581773AL531577 BF528351 AA657809 AW152670BE621523 AW078705 AVV573719AA102613AA195604 
R72209 AA402208 AA404305 N24999 A)963535 D54741 BI461415 BM475959 8)260926 B)1 16213 BE280120 BE940258 BE280O44 AI905744 
BG001474 BE937718 BF799283 BI091621 66421006 BG289235 6F73682S 61253429 BG170064 R72512 T92815 BF931257 8M4551B3 BG033362 
BG574220 N31395 BF739185 AA371995 BE87901 1 61198754 BF378989 T12266 BM019421 BI194570 BF378992 AW996595 61858549 W94605 
W61345 AA122384 AA17ig23T92736 AA761 504 AI81 9039 AI439358 AL517777 AL565919 AA622001 AA642695 AA704144 AA081465 AA070621 
AA375562 AA7000t1 AA9268S3 W37310 A1566236 eF677B09 6G760021 AW381433 BE828605 BE268449 BF805977 BG292452 BFg81071 
BF217108 BF928G98 R33993 AW862841 AI857453 AW078733 A1433035 AI018103 R90927 AA80472D AA551 734 R3383S Hig741 R78754 
BF930494 B6109583 AA631926 6E834008 BG9%533 
433470 6624 1 X75684 A1^167A1445461 Ai453743A)983655AI564544AA977180AI694111 Ai591358AVV071625At67^^ 

AI963432AA292956AW192593AI865838AI696905 AI424384 AI161312 AI911921 A1597801 8)494959 AI240988AI492554AW262737 BE044033 
AW008570 AW629505 6)494958 AA088439 AA706O57 BF222820 BF593608 BE501957 AAS24526 BE044134 AW572531 AW015724 BE349186 
AA0432t7 BE219784 A)799814 AA129575 A)671727 A)470033 BE646195 AW779725 AAgOSOSO AA147228 AA404570 At075678 W38161 Aig72739 
AW573152 AA723200 C06123 6F057147 AA627686 AA157944 AI990245 AA66251 7 T32487 A1800106 A1333170 A)859160 W45410 AI990827 
AW275048 AA1 82640 AA478328 A1298935 AW085158 AW471421 AW103470 AW30O456 AW191997 A)823466 AA962397 AA136658 A1251817 
AW339104AA724739AA411100AA191349AA757735AA037696A)769516AVm2283AA010631AI692846A)W^ 
A1245632 A)349390 AA1482B4 A1798502 AA487893 A1621 320 AW194272 C0S365 AA9S3883 BE8S8936 A)91 8523 A!B72628 At927217 A)453453 
AI169366 AW338878 A)261359 A)500576 6F477735 A)032569 AI972899 A)98S583 Z28771 A1383829 AI893030 AA603586 BE773488 AW339301 
6E7734B9 6E773462 6E773495 A1650338 6E773499 At745717 6E811475 6E811470 8E811464 BE811418 BE811415 6E811400 BE811398 
BE81 1388 BE811352 BE773501 BE773494 BE773486 BE773474 BE773473 eE773470 6E773461 BE811350 BE811337 6F593847 BG055071 
AW675302 BF003068 AA7igi73 6E81134B A)562462 AI686240 6E773500 At24484S A)565439 AI918453 A1472527 A1446740 AA035576 AA191414 
AW674145 C05782 A)589264 D57558 A1468237 A)432033 AA98g662 R21752 BF0»2457 AA988297 AL574095 AL576200 A1571074 AL574525 
AL578810BG498381 A)928364 BE879732AA479834 AA479712C177326)I«»1258 eF843901 AW820230C17476 BE3271 20 AA1 29574 
AA136645 eF643900 AW806193 AA502832 AA649494 A156B520 A1547950 8E706937 6E81 1 360 BE773498 BE81 1 401 6E773484 BE811437 
BE811380 BE811399 BF997171 6F757734 BE926037 A1377596 C06111 AW088968 BE811404 BE811472 A1865912 AI925607 AI871950 AI0S3510 
eE90S927 eE811435 AA191387 AW772000 6E811453 BE614379 BF844522 61044896 AI744233 AW984527 C17504 eF843883 Ai248307 
BE773483 A1567995 W60075 6F9411B3 A)738844 6E8114S8 BE773481 A)262930 AA948565 BE706942 BE1 56360 T65026AW242958 AW197954 
BE905184 AA722206 AI344943 A)348877 AI334860 BE621857 BE156280 AA454099 AA037722 BF843897 AW808163 AA04321 6 66482696 
AA182734 AA877242 AW372925 H27252 R38114 BF851858 6E1 5621 4 AA1 90427 T91762 AA035057 AA837326 T10930 BF9065B7 61755027 
BG506731 BC008442 6CX)10166 AL550134AL553096AL548700A).550751 AL547978 AL545286 AL540643AU118627 AL601379BI259821 
BG741786 B!868522 AU135666 61552770 61259210 BI256520 BI2S5569 BG485098 61256228 6 6498501 6M044512 AU133984 AL556586 
eE745111 6)222633 AU133917 63288151 61260715815505506650077361551761 66707601 61818593 eF691383 66721129 BG541578 
BE906666 66751098 61224135 66400746 66478065 6E790436 AW080238 AU137549 66429898 8E392486 AW961688 66721056 6E908365 
BE546656 BG541235AW583735 BG528290 61260895 AW651691 BIM048974 6KdO43805 66142165 AA315186 A)446615 C06300 B6497644 
AA088544A)81S987 6G528631 6E6igi82AW239t85AW062910AW062902AA347236F11933AA488005AA301631 AA376800 D56120 
AA343532AA308636 F00242AA376086 AA31 6988 AA3437g9 61870221 eE9102828G538748AW980564AV732879 016854 AA192519BF922148 
AA216013 66624091 6E544387 66507008 AW176446 BF790033 6E088925 BE0B8854 AA821353 R21800 AA011222 T97525 
BC015334 AA605670 AW014686 N6661S 66029745 AA481 269 H91728 
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AA7046B8 AA652189 AA179463 AL535925 6E275744 6E277708 eE27S715 AW504259 AA354483 BE244197 BE246232 D17055 AW013876 
AW014877 T094&4T08407 AA830246 AW897881 BES01192 BES01195 AL044534 AA256853 81037915 AA448037 BM461769 81825965 BE763352 
AW167531 Z45588AV721881 AA527273AI573219AA457036AW439651 AW264418AA577618A1802954 AA902292AA468752 A1380374 
AA722690AlB67708AAgi6982AI291576AW190427At338089Ai653744A!306665AW513541AW44W^^ 
At671644AW193386A1261832AA775336BF436811AI582703AI278635eE440188AA617898AA648948 81491^ 
F37022AVV770819AA2S8808AI369564AVV503675AA777194eES010486F222087AA042973At868087AA9n 
AA447634 A1282427 F22456 T15901 AAB2S298 AW007436 BE934303 BG981939 AW373814 BM151638 AW956921 BM150080 8M153173 
BM147451 6F953992 AA916696 AW444935 M78398 AW581 147 AW608258 AA651910 AA1 321 52 AW80629S T30326 D20054 AA310837 T06543 
BM194508 6M193225 eM469346 AW964920 AA325930 BI833627 AW952193 AA738189 AA321051 66987199 BF953967 T08890 BE869543 
B67428578G888665AA4568806G0018428F8094S2AW892083eF944342T49551 W69981 BF764519T15869AA132030 

AW9&4027AA377709 

AW848421 771427 

T62567AW833022AW833054 

H20760 R15237 Z43915 BF372479 F11411 

AB038316 6C008888 BE905346 8E301941 AA705936 AW014954 BE376742 AI720050 6E395327 66951204 
AK001451AU151098AW515640AW439618A16715S5A\IV304983AA565885AI829434AWS90882^ 

AAB83540 AA169387 AW771S71 A1130803 BF43e773 AA088710 AI972691 Ai972638 Af762358 AI473907 A)92S905 AA502277 BG943806 

B6218468 AA194853 AU128875 AA306025 86986896 

AA778a49 AW946871 AWg46782AW946955 

AF086263 W69200 W69304 

AW807573AW807572 

AW963941 AA37982S AW963944 AA379S64 

NM-054014 X52220 BC00S147 81551326 A1393601 AW592611 AA608921 AA731598 W96331 AW590007 AI076813 A1022644 AA158365 AI6g9321 

A)146747AW296894 H85337AA017692AA354519AA018512D20081 R02704AA82S671 AA017651 AL135600 R02S85AA018849B6749616 

6F689840 R85326 AA677955 AA702354 AI076645 AI057359 H53178 W86484 HS3074 66988909 AVV96245^ 

R39769T53143H60012 

AW1S0645AW811024AW811148AW811068 

8F81 2525 AW504832 A19725S7 AA58M29 AI299694 

AL050088 AA160485 AW1 73544 AW296506 AW439860 AI521563 AI702529 AI393606 AW1 38323 AA570109 H19504 BM021988 BF0&3327 
BF593552 AA630766AI597717 AI807128AA523012 AI356250AW451857 AA974203 AI762577 8F512552 AW0O73O7 BE675286 AW450602 
AAg62057 AV\^16069 A15e2546 6F221924 6F222543 AI801808 AW468599 AW0X73& AI866625 AW235356 BM021837 AA91 1956 AI680606 
We6516 T03370 AW61 1634 H41653 AH68349 H19588 AW080198 AW043993 R39847 
AW875344 AW875287 AW875285 AW875286 BF361295 AW875402 AW875400 
BE145816 eE145897 6F349721 eE145865 

AKD57266BI767614BI828S86AW0693626IB29572 A1828091 81819382 AL04O4O2 
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Pkey: Unique number corresponding ban Eos jOTbesd ^«^-ri-nuA 
Ref: Sequence source. The 7 digit numbers Uiltocduiwi are 6enbartkWenli88r(GQ Dunham Letd. refers to the puhOcaliofienliOed The D^w 

wquencoofhumanchromosonieaZ" Dunham Letal, Nature (1999) 402:4BW95l 
Strand: Indicates ONA strand from which axons were predicted. 
NLposition: Indicates nudeolide posithins of pretficted exons. 



Ptey Ref Strand Ntposflion 

402260 3399665 Minus 113765-1 1391 0.11 5653-1 1576S.11 6608-11694 

401027 7230983 Minus 70407-70554.71060-71160 

400991 8096825 Plus 159197-159320 

406137 9166422 Minus 30487-31058 

404083 9944029 Mbws 16650-17082 

404440 7528051 Rus 80430-81581 

400792 7382433 Plus 134339-134593 

404289 2769644 Plus 15049-15288.30287-30457 

401083 3242744 Plus 33192-33360 

402211 7689783 Itfinus 67414^229 

402705 8782736 Phs 89981-901 14,90773-90895^1 131-91 261 

402233 7690102 Rus 90281-91477 

405370 2078469 Minus 38980-39111 

400846 9188605 Rus 39310-39474 

405484 5922025 Rus 199214-199579.199672-1999201200262-20049 

401345 9926424 Phis 148042-148392 

400843 9188605 Rus 586^5970.7653-7784.8892-9023,9673-9807, 

406364 9256114 Minus 50715-50833 

405490 7705240 Rus 20883-20850 

400755 81190S3 Mir)us 120084-120889 

404276 9885189 Rus 1Z7624-127856 

402915 7405502 Minus 140-276 

405616 5649378 Minus 2782-3308 

400847 9188605 Rus 4464344835 

402328 4454283 Minus 13758-13922.14558-14752 

405369 2078469 Minus 34183.34357.35686-35751 

400845 9188805 Plus 34428^12 

403716 7239669 Rus 86899^122 

402447 9796640 Rus 4750547729.5 1696-51 821 ,52070.52257.5330 

404140 9843520 Rus 37761-38147 

405516 9454624 Rus 112707-112876.113676-113854 

405110 8096888 ftfinus 118940-119100 

403608 8308266 Mbius 121321-121476 

401241 4B273W Minus 30503-30844,31056^1248 
405102 8076881 Minus 120922-121295 
404185 4572584 Minus 129171-129327 

405545 1054740 Rus 118677-118807.119091-119296,121626-12182 

405411 3451356 Minus 17503-17778,18021-18290 

405502 4753260 Rus 44647-44778 

403391 9438337 Phis 4241042544,83317-83540,8684^869223797 

403869 7280046 Minus 34379-34583 

404942 7382153 Phis 92095-92252 

403142 9444521 Rus 89286-90131 

400844 91BB605 Rus 24746-24872;?5035.25204 

402704 8782736 Rus 37368-37493 

402833 8918545 Plus 26987-27778 

401851 7770425 Minus 1 46443-146664,147794-1 47971.14835V14848 

401242 4827300 Mhnis 32616-32863 
401943 4914397 Rus 65925.66371 

402807 6456148 Minus 101542-101650.103476-103658 

402603 9309396 fJStna 141663-141852 

405328 3253114 Plus 21399-21583 

402974 9B63349 Rus 124035-124321 

400987 8086488 Minus 22052-22185 

403335 8568884 Rus 112307-112524.114074-114703 

401 1 1 3 9966541 Minus 1 9419-19969 

401185 9625304 Minus 177393-177691 

404537 8247909 Minus 188775-189573 

405266 4156171 Minus 63337-63552 

402615 9926801 Rus 131390-132157 

400566 9884730 Phis 64488-64714 

403212 7630897 Minus 156037-156210 

403290 8083176 Rus 19288-20076 

401342 9908882 Rus 30953242 

400471 9331670 Minus 105629-105760 

405588 5002511 Rus 4618046366 

400539 7574902 Rus 8559^721 

403743 7652003 Minus 136463-136646 

403912 7710730 Minus 72000-72290,72431-72700,72929-73199 

405099 8074292 Minus 114365-114514.128635-128831 

401445 8218584 Minus 93700-93885 

405480 2766593 Rus 33325-33659 

402183 7658390 Minus 100518-104298 

400749 7331445 Minus 9162-9293 

406139 9166768 Minus 72397-72602 
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402129 


7704953 


Minus 


lODlSD-lroSOO 






Minus 




403201 


9958297 


Minus 


109782-109934 


403609 


8308266 


Minus 


125974-126320 


400719 


8118911 


Minus 


4457944656.45294-45487.46449^6641 


403088 


8954241 


PhJs 


169894-170193.170504-170808 


403328 


8469086 


Minus 


12042S-120703 


403305 


6099945 


Plus 


114632-114805 


401702 


1871197 


Minus 


66182-66325 


400777 


8131663 


Plus 


70745-71121 


404956 


7387343 


Rus 


55883-56203 


401917 


9S02466 


Rus 


25054-25229 


401984 


4454511 


Plus 


103825-104024 


406303 


8575868 


Plus 


173822-173786 



Table SQA lists about 41 4 genes up-regulaled In non-seminomatous mixed genn ceil testicular cancer compared to nonnal adult tissues. These were selected from 58660 pfobesets 
on the Affymetrix/Eos Hu03 GeneChip airay such that the ratio of 'average' non-semlnomatous mixed germ cell testicular cancer to 'average* nonnal adult tissues was greater lhan 
or equal to Z The 'average' non-seminomatous mixed genn testicular cancer level was set to the B5th percentile amongst non-seminomatous mixed gemff cell testicular cancers. 
The 'average' normal adult tissue level was set to the 95th penxntile amongst norwnalignant tissues. In order to remove gene-specific background levels of non-specific 
hybridizallon, the 10th percentile valua amongst the non-malignant nonnal body tissues was subtracted from both the numerator and the denominator before the ratio was evaluated. 

Table 51A lists about 518 genes up-regulated in seiranomatous testicutar cancer compared to nomial adult tissues. These were selected from 59680 probesets on the Alfymetrbc/Eos 
Hu03 GeneChip array such that the ratb of 'average' seminomatous tesUcular cancer to 'average' nom^ adult Gssues was greater than or equal to Z The 'average' seminomatous 
testicular cancer level was set to the 851h percentile amongst seminomatous testicular cancers. The 'average' normal adult tissue level was set to the 95th percentile amongst non- 
maOgnanl tissues. In onler to remove gena-spedlie background levels of non^specRic hybrUlzatkm. the 10th percentile value amongst non-malignant normal body tissues was 
subfracted from bottt the numerator and the denoriilnalor before the lalto was 

Table 52A lists about 673 genes up-regulated in testicular cancer (non-seminomatous and seminomatous) compared to normal adult testicular tissues. These were selected from 
59680 pmbesets on the Aflymetibc/Eos Hu03 GeneChip an^y such that the ratb of 'average' testicular cancer to 'average' nonnal testnular adult tissues was greater than or equal to 
6. The 'average' testicular cancer level was set to the 75th percentile amongst testicutar cancers. The 'average' norma! adult testicular tissue level was s^ to the 95lh percenflte 
amongst non-malignant testicular tissues, tn order to remove gene-spedfb background levels of norv^pecific hybridizatton, the 1 0th percentile vahie amongst non-malignant nonnal 
body tissues was subfracted from both the numerator and the denominator before the ratb was evaluated 

Table 53A Osts about 735 genes up-regulated bi tesQcular cancer (non-seminomatous and seminomatous) compared to nonnal adult tissues. These were selected from 59680 
probesets on the Ailymetilx/Eos Hu03 GeneChip anay such that the ratio of 'average' testicular cancer to "average' nonnal adult tissues was greater than or equal to 3. The 
'average' teslkndar cancer level was set to the 9SIh percenple amongst tesGcular cancers. The 'average' nonnal adult tissue level was set to the 95th percentile amongst non- 
mdlgnant tissues. In order to remove gene^pedSe background tev^ of norvspecffic hybrldlzaUon, Oie 10th percenQle value amongst non-malignant nonnal body tissues was 
subtracted from both the numerator and the denominator befriie the ratk) was evduated. 

Table 54A Ssts about 476 testi-spedflc genes downregulaled in testicular cancer (non^emlnomatous and seminomatous). These were selected from 59680 probesets on the 
Affymetrlx/Eos Hu03 GeneChip array such that the ratio (R1 ) of nonnal test) to nomial adult tissues was greater than or equal to 3. R1 was calculated as the mean number of 
interquartile range values over the median nonnal adult body Qssue expression among nonnal testicular samples. The ratio (R2) of 'average' normal tesfi to 'average' testicular 
cancer among Ihese genes was greater than or equal to 1 The 'average' nonnal testi level was set to the 50th percentile amongst normal testis. The 'average' nomial testicular 
cancer level was set to the 95th percentile amongst tesQcular cancer samples. In order to remove gene-spediic background levels of nonspecific hybridization, the 10th percentiie 
v^ amongst non-m^ant nonnal body tissues was subfracted bom both the numerator and the denominator before the raSo was evaluatBd. 

Table 55A llste about 586 genes up-regulated in non-semlnonr^ous mixed genn eel tes&^ular cancer compared to normal adult testioMar tissues. These were selected from 59680 
probesets on the Affymefrix/Eos Hu03 GeneCNp array such that the ratio of 'average* non-seminomatous mbced germ cell testicular cancer to 'average' normal adidt testicular 
tissues was greater tfiai or equal to 4. The 'average' non-seminomatous nvxed genn cell testicular cancer level was set to (he 95th percenfile amongst non-seminomatous mixed 
genn cell testicular cancers. The "average' norma) adult testicular tissue level was set to the 95th peroentile amongst non-malignant testicutar tissues. In order to remove gene- 
specific backgn>und levels of non-spediic hybridizaOon, Oie lOth percentile value amongst non-malignant nonnal testicular Qssues was subfracted from both the numerator and the 
denominator beftxe the reSo was evaluated. 

Table 56A lists about 812 genes up-regulated to seminomatous testknitar cancer compared to nonnal adult testicular tissues. These w^ selected from 59680 probesets on the 
Affymefrb(/Eo6 Hu03 GeneChip array such that the ratio of "average' seminomatous testicular cancer to 'average' norms! adult testicular tissues was greater than or equal to 4. The 
'average' seminomatous testicular cancer level was set to the SOth percanOIe amongst seminomatous tesflcular cancers. The 'average' nonnal adult tissue level was set to the 95th 
percentile amongst nofMnsfignant testicular Hssues. In order to remove gene-specific background levels of rtort-spedSc hytiiMfizatton. the 1 0Bt percentile vsiue amongst non- 
malignant nonnal testicular tissues was subtracted from both the numeralor and the denominator before the rafio was evaluated. 

TABLH50A: 

Pkey: Unk^ue Eos probeset identifier number 
ExAccn! Exeniplar Aocesdon numbefi Genliank aocesston number 
UnlgenelD: Unlgene number 
IMgene Title: Unlgene gene titto 



R1: 


of nonseminomatous nAted germ ceD testicular cancer compared to nonnal adutt ti^ 


Pkey 


ExAccn 


UniGene 


Unlgene THte 


R1 


432666 


AW204069 




ESTs, weakly similar to unnamed protein 


74.60 


432730 


At086520 


Hs.131358 


ESTs 


50.55 


450581 


AF061513 


Hs.25195 


TGF-beto4 


47^5 


418696 


AW959433 


Hs.326290 


hypothetteal protein FIJ12S81 


44.05 


423458 


A1204212 




ESTs 


36.60 


428664 


AK001666 


Hs.189095 


simOv to SAUI (sal (DresophHaj-like 


30.60 


448981 


Ai968719 


Hs.195387 


ESTs 


26.40 


407710 


AW022727 


Hs.23616 


ESTs 


24.00 


429466 


AF155827 


HS.203S63 


hypoDieScal protein FU10339 


19.35 


451106 


BE382701 


HS.2S980 


N-MYC oncogene 


18.85 


417407 


AA923278 


Hs.290906 


ESTs. W^ddy stoto to protease Itisapl 


18.40 


420759 


T11B32 


Hs,127797 


Homosaptens cDNA FU11381 fis. ctone HE 


18.25 
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424578 AK001973 K5.150890 hypolheQcal protebi 17.88 

418758 AA2522S4 Hs.226949 ESTs 17.20 

404998 TeigelExDn 16.15 

_ 447534 AW953935 H5,288655 ESTs 15.80 

5 456847 AI360456 H5^7776 ESTs 15.00 

446879 AI654443 Hs.197683 ESTs 14i0 

438915 AA280174 Hs.28S681 Williang-Beuren syndrome c hr omosoms regl 14.75 

452838 US5011 Hs.30743 preferentlafly expressed anQgen in mela 14.70 

449322 AIS38616 HS.19&566 ESTs 1435 

10 418007 M135D9 Hs.a3169 matrix melaBoproteinase 1 (inteistHlal 14.20 

448776 BE302464 Hs.30057 MRS2(S.cefevisiaeHite.magneshimhom 1Z95 

433330 AW207084 Hs.132816 hypothetical protein M6C14801 12.70 

410102 AW248508 Hs.279727 ESTs;homologi»of PEIA^IOonasavlonyi 1155 

447188 H65423 H3.17631 hypothetica) protein OKFZp434E2135 1143 

IS 406547 Target Exon 12^ 

434549 AA738254 Hs,165390 ESTs, Highly sintitarbA40350transcrfp 12.10 

428227 AA321&49 Hs.224d smal) indudt)le cytokine sutrfamllyB(C^ 11.65 

408908 BE296227 HS.250B22 serineAhfeonine Idnase 15 11.55 

437099 N77793 Hs.48659 ESTs.Higli!ysirrtlartoS14458lamlnln a 11.05 

20 430676 AF084866 gb:Homo sapiens envelope protein RIC>3( 10.08 

426866 U02330 Hs.172816 neuregulini 10.05 

446791 AI632278 Hs.195922 ESTs 10.05 

433159 AB035698 Hs.150587 klnesin^ilce pnM 2 ^85 ' 

428479 Y00272 Hs.334562 ceO division cyde 2. G1 to Sand G2 to 8.95 

25 427521 AW973352 ESTs 8.92 

452291 AF015592 Hs.28853 C0C7 (ceH division cyde 7, a cerevisi aSO 

427486 AA974433 fitmblast growth tactor 4 (heparin secr 8.52 

425266 J00077 Hs.155421 alphEMetoprotein &S0 

408465 AW196940 Hs.253277 ESTs 8.47 

30 444971 AI651116 Hs.1488S9 ESTs 8.35 

413318 AU076607 Hs.75285 inter-alpha (globulin) InliQjitor, H2 pel 8.35 

425769 U72513 Hs.159486 Human RPL1 3*2 pseudogene mRNA, complete 8.00 

409731 AA125985 HS.S6145 thymosin, iieta, identified In neuroblast 7.95 

424905 NM_002497 Hs.153704 NiMA (never in mitosis gene a)-relatedk 7.75 

35 412265 AA101325 Hs.86164 hypothetical protein FU 12457 , 7.65 

407340 AA810168 H5.284289 vitiHgo-associaled protelnVIT*1 7.50 

453884 AA35S925 Hs.36232 KiAA0186g8neprodud 7.36 

422956 BE545072 Hs.122579 ECT2 protein (Epithelial ceil transform] 7i5 

432239 X81334 Hs.2936 matrix metatloproteinase 13 (cotlagenase 7.25 

40 440119 AA865455 Hs.125331 ESTs. Moderately simitar to unluiown {Rs 7.22 

431840 AAS34908 Hs.2860 POU domain, dass 5, trsnscriplbn facto 7.13 

435918 Al^3538 Hs.86232 growth differenOalion factor 3 7.13 

412537 AIJ031778 nudear transcription factor Y, alpha 7.08 

416658 U03272 K5.79432 fibriilin 2 (congenital contractura) ara 7.05 

45 428916 AF003001 Hs.194562 telomeric repeat binding factor (NIMA-ln 6.88 

424085 NM_002914 H5.139226 repQcalion factor C (activator 1) 2 {40 &75 

453392 U23752 Hs.32964 SRY (sex detemMv region Y)-box 11 &75 

437052 AA861697 Hs,120591 ESTs 6.75 

425427 A1652662 Hs.157205 branched chain aminotransferase 1. cytos 6.72 

50 443523 AKD01575 Hs.9536 hypolhefical protein FU1071 3 &71 

457465 AW301344 Hs.122908 DMA replc8lton fector 6.62 

442832 AW206560 Hs.253569 ESTs 6.54 

427711 M31659 Hs.1BO408 sotute carrier family 25 (mltodiondrial' 6.30 

453913 AW004683 Hs.78934 mutS (E. cdi) homdOg 2 (coton cancer. &30 

55 448588 AI970276 Hs.15690S KtAA1676 6.12 

436608 AA628980 Hs.192371 down syndrome crtDcal region pnTleinDS 6.09 

415857 AA866115 Hs.12n97 Honu) sapiens cDNARJIISSI lis. done HE 5.95 

412642 BE244698 Hs.809 bep3locytegrowthM}r(hepapoiBlinA: 5.85 

^. 443068 AI188710 ESTs 5.65 

60 438450 At050866 Hs.65853 noda^ mouse, homolog 5.81 

441287 AW293132 Hs.131373 ESTs 5.80 

425572 A6011076 Hs.158307 undifferentiated embryonic cell transcii 5.76 

416747 AW876523 Hs.15929 hypothetical protein FU12910 5.75 

^_ 436802 AW247145 Hs.192729 ESTs &70 

65 441627 AA947552 H5.56086 branched dialnaniinQiransferasel. cytos 5.60 

440304 BE159984 Hs.l 25395 ESTs 5.60 

432407 AA221036 Hs.13273 gb2ri)3f12/1 SIratagene NT2 neuronal pr S.S6 

436812 AW298067 gb:U)4^W0-^p^*0^Ui.s1 NCI.CGAP.Su 5.55 

423873 BE003054 Hs.l 695 matrix metaSoprotelnase 12 (macrophage 5.51 

70 431354 BE046956 Hs.2S1673 ONA(cytosine«-HneOiylfr8nsfBFBse3be 5.51 

430044 AA464510 Hs.152ai2 ESTs 5.47 

437036 A1571514 Hs.133022 ESTs 5.45 

435663 AI023707 H3.134273 ESTs 5.40 

427667 AK001279 H3.180171 HomosapienscONARJ10417fis,doneNT 5.40 

75 416111 AA033813 Hs79016 chromatin assembly fadorl.subunit A ( 5.21 

447254 NM.004153 Hs.17908 or^in recognition complex, subunit 1 (y 5.15 

434551 BE387162 Hs.280858 ESTs.Highly sin«artDA35661 DNAexds 5.15 

430272 X04898 H5.237658 apoRpoproteinA^I 512 

o_ 427961 AW293165 Hs.143134 ESTs 505 

80 424315 AW614850 H5.193^ pulalaSve 28 kOa protein 505 

409798 AA248S87 Hs.30237 ESTs. WeaMy similar to AlII6.HUA4ANUil 5.00 

418477 AW022g83 gb:df46h11y1 Morton Fetal Coddea Homo 5.00 

418378 AW962081 0b:E8T374154MA(£ resequences. MAGG Homo 4.95 

591 



wo 03/042661 



PCT/US02/36810 



4302S5 AK00O7O3 Hs.323822 Homo sapiemmRNA (or KIAA1551 protein. 4.94 

443537 D13305 H5.203 cholscystoktrdn B receptor 4.92 

431494 AA991355 Hs.298312 hypolhe&a! protein DKFZp434A1 315 4.90 

416861 AA634543 Hs.79440 IGF4I mRM/UmdIng prdein 3 4.85 

5 423642 AW452650 Hs.157148 hypothetical protein M6C13204 4.60 

449S92 AI655494 Hs.195718 ESTs 4.75 

407300 M102616 Hs.120769 gb3n43607.s1 StrdaoeRBHeLaoeas393 4.73 

420333 AJ001383 Hs.97084 lymphocyte antigen 94 (mouse) homolog {a 4.68 

446700 AW206257 Hs.156326 Human DNA sequence from done RP11-145t^ 4.61 

10 445537 AJ24S671 Hs.12844 EGF4k»dom8in. mullipte 6 4.60 

448966 AW372914 Hs.86149 phosphrrinosilal 3i)hosphdeMing piot 4.S0 

439570 T79925 Hs^9165 ESTs, WeaHlysfmRar to ALU1.HUMAN ALUS 4.50 

440006 AK000517 H5.6844 NA1P2 protein; PYRIhkConlalnlng APAFl-B 4.48 

402145 Target Exon 4.48 

15 408750 BE294069 Hs.93581 hypothetical pn)teinFU10S1 2 4.47 

453289 Ai188161 Hs.144627 ESTs 4.45 

430252 AI638774 Hs.105328 testes develbpmenkelated NYD^P20 4.40 

422689 AW856665 gb:RC3-CT0297-29010OO13^03 CT0297 Homo 4.32 

426427 M86699 Hs.169840 TTK ptotein kinase 4.30 

20 420047 AI478658 Hs.94631 brefeldin A^hibitsd guanine nucleotide 4.20 

430287 AW182459 Hs.125759 ESTs, Wealdy similaf to LEUS.HUMAN LEUKE 4.18 

419635 NM.00S033 Hs.91728 potymyositis/^efoderma autoantigen 1 ( 4.15 

416209 AA236776 Hs.79078 MAD2(mitoiic8nest deficient, yaasth 4.10 

438188 AA779975 H8.1288S9 ESTs 4.10 

25 435514 AW592804 ESTs 4.10 

442333 AI650877 Hs.129302 ESTs 4.05 

413627 BE182062 Hs.246973 IntranofBicauda) D homolog 1 4.00 

445140 AI650599 Ks.197913 ESTs. Weakly siniiiar to SCP3 MOUSE SYNAP 4.00 

448038 AVV016073 Hs.232026 ESTs, Wealdy slmBar to R052_HUMAN 52 KD 4.0O 

30 458814 A1498957 Hs.170861 ESTs. Weakly sinfllartaZ195_HUMAN ZINC a95 

419423 D26468 Hs.90315 KtAA0007 protein ^95 

440527 AV657117 H8.1&4164 ESTs, Moderately sinflar to S65657 alpha 3.95 

441553 AA2B1219 Hs.121296 ESTs 3.95 

432415 T1697r Hs.289014 ESTs, Wealdy similar to A43932 mucin 2 p 191 

35 409757 NM_001898 Hs.123114 cystatinSN 3.89 

432281 AK001239 Hs.274263 hypotheficalpideinFU 10377 3.88 

450351 BE547267 Hs.59791 hypothetical protein M6C1 31 83 3.85 

403780 C4001759:gi|1332S0|8ppi9474IRO5?.HUMAN 3.64 

421917 AB028943 H5.109445 KIAA1020 protein 3.84 

40 417153 X57010 Hs.81343 collagen, type II. alpha 1 (primary oste 3.84 

429120 AK0Q1673 Hs.196530 hypothetlcai protein FU 10811 3X2 

410193 AJ132S92 Hs.59757 zinc finger protein 281 3.80 

453922 AF053306 H5.36708 l)uddinguninhil}iladt)ybenzimidazoles 1 3.80 

415829 AW450198 Hs.163742 ESTs 3.78 

45 440953 AI683036 Hs.124135 Homo sapiens cDMAFUl 3051 fis, done WT 3.77 

439780 All 09688 gbiHomo sapiens mRNAhiBtengfii Insert 3.70 

422938 N(;L001809 Ks.1594 centromere protein A (171(D) 168 

415947 U04045 Hs.7Bg34 mutS(E.coli)homQlog 2 (colon cancer, 3.66 

423123 NM 012247 Hs.124027 SELENOPHOSPHATE SYNTHETASE ; rtiman sden 165 

50 420900 AL()45633 Hs.44269 ESTs 3.65 

426572 AB037783 Hs.170623 hypothefical protein RJ11'183 3.65 

426498 D31765 Hs.170114 KIAA0051 protein 3.60 

452461 N78223 Hs.108106 transcription factor 3.60 

418379 AA218940 Hs.137616 fidgetin-like 1 3.50 

55 442573 H93365 Hs.7567 brandieddiainamtnotransferasol.cytos 3.48 

428301 AW628666 H3.98440 ESTs, Weakly slilflar to 138022 hypOtheQ 3.45 

419384 AA490866 Hs.39429 ESTs 3.44 

453932 AW006303 H8.329295 ESTs. Weakly similar to (defline not ava 143 

446293 A)420213 Hs.l49722 UM domain transcription fator UM-1 (h 141 

60 422094 AF129535 Hs.272027 F-box only protein 5 140 

416661 NM_001949 Hs.l189 E2F transcription fador 3 140 

423198 MB1933 Hs.1634 ceIldhrisioncyc}e25A 139 

424153 AA451737 Hs.l41496 MAGE-(lke2 138 

417705 AW134952 H8.175220 hypothetical protein FU14541 3.37 

65 443715 AIS83187 HsS700 cydinEI 134 

420281 AI823693 Hs.323494 Predidedcalton efflux pump 134 

449571 AW016812 Hs.200266 ESTs 134 

424687 J05070 Hs.l51738 matrix metaJtoprolelnase 9 (gdaOnaseS 131 

452807 AA028933 Hs.162434 ESTs 131 

70 422756 AA441787 Hs.119689 tilyooprotelnhoiniones,^hapdypeplide 130 

421650 AA781795 Hs.122587 ESTs 130 

418355 L42563 Hs.1165 ATPase, H? transporting, nongastric alp 3.28 

438494 AAg08678 Hs.130183 ESTs 123 

„^ 424568 AF005418 Hs,150595 cytodirome P450, suljfamiiy XXViA. polype 3.22 

75 433764 AW753676 Hs.39g82 2!rc linger prot^RlKZF(NM.023929) 120 

427642 R40761 Hs.9834 ESTs 120 

414747 U30872 Hs.77204 oentrome(epRMF(3SV400I(O.mltosih 116 

403432 N1^001622:Homo sapiens 8lpha-2-H&glycop 118 

442618 R56222 Hs.26514 ESTs 117 

80 415799 AA653718 Hs.225841 DKFZP434D193 protein 117 

416000 R82342 Hs.79856 ESTs. Weakly similar to S656578lpha-1C- 115 

450431 AW136797 Hs.266041 ESTs 113 

433800 Aia34361 HS.1351S0 hrngtype^ceDmernbrane-assodatedgty 3.12 
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430835 A)240006 Hs.192326 ESTs 3.12 

419741 NM.007019 H5J93002 ubiquiSn carrier fnotsinES-C 3.10 

417791 AW965339 Hs.111471 ESTs 3.10 

434609 R76593 gb7i60c11.r1 Soares placenta Nb2KP Homo 3.05 

5 430253 AK001514 Hs.236844 hypothetical pidsinFU 10652 3.04 

411975 AI916058 Hs.1445a3 ESTs 3.01 

430491 AL109791 Hs.241559 Homo saftamRNAfuntengtli Insert cDN 3.00 

413943 AW294416 Hs.144687 Homo sapten&cONAFU129811ls. done KT Z99 

440207 AI37197B Hs.128326 ESTs ^98 

10 435726 BE535787 Hs.1 13170 ESTs 2.97 

432840 AK001403 Hs.279521 hypothetlcaJ protein FU20530 Z97 

450149 AW969781 Hs.132863 Zlc family member 2 (odiHiaired DrosophI ^95 

435373 AW865538 Hs.117689 ESTs ^ Z93 

452571 W31518 HsJ4665 ESTs Z93 

15 454679 AW813110 gbK)M«T0189O510994)21.f05 ST0189 Homo 2.91 

414972 BE263782 Hs.77695 K1AA0008 gene product Z90 

437496 AA452378 Hs.146668 Homo sapiens mRNA;cDNADKFZp547J125(fr Z90 

420092 AA814043 Hs.88045 ESTs 2.89 

438378 AW970529 Hs.86434 hypolhetiCBl protein RJ21 81 6 2.89 

20 434414 AI798376 gbir34l)07jc1 NQ.CGAP.Ov23 Homo sapiens 2.87 

422746 NM.004484 H3.1 19651 glyplcanS Z87 

446258 AI283476 Hs.26347B ESTs 2.86 

444371 BE540274 Hs^ fofkheadboocMI 2.86 

409517 X90780 troponin I, canfiac 2.85 

25 414034 U89277 Hs.30S98S early developnientfegidatorlihomologo Z84 

443169 A1038687 Hs.133338 ESTs 2.84 

447519 U46258 Hs.339665 ESTs 2.84 

453785 AI368238 H8.283732 EST8,Moderdely8ln^toALU1J1UMANA Z84 

406687 M31126 maMxmetaBoproteinasell (siromelysin Z83 

30 416201 AA467752 H8.195161 ESTs 2.83 

412140 AA219S91 Hs.73825 RAB6 interacting, kinesln-like (rabklnes Z83 

457191 A!376228 Friend leukemia vims integration 1 Z82 

410704 BB)76754 gl).<M1-eT0601-180200-121*b10BT0601 Homo 181 

413646 BE155042 95^^0349.1012994)02^ HT0349 Homo 180 

35 421307 6E539976 Hs.10330S Homo saf^ns mRNA; cDNA DKFZp4346042S (f 175 

427719 AI393122 Hs.134726 ESTs 175 

451684 AF216751 Hs.26813 C0A14 175 

414530 NM-000506 H$.76530 coagulaQon factor n(tluombin) 174 

442032 AW016786 ESTs 173 

40 437123 Ali)49285 Hs.302053 Homo sapiens mRNA; cDNA DKFZp564M193 (fr 172 

446528 AU076640 Hs.15243 nudeoJar protein 1 {120kO) 172 

442007 AA301116 Hs.142838 nucleolar phosphoproleln Nopp34 171 

438180 AAB08189 Hs.272151 ESTs 170 

453900 AW003582 Hs,226414 ESTs. Weakly simHar to ALUB.HUMAN ALUS 170 

45 423765 R23858 Hs.143375 Homo sapiens, clone IMAGH:3640937, mRNA. 169 

420949 AA934063 Hs.13836 ESTs, Weakly simBar to 138022 hypotheU 169 

413813 M96956 H$.75561 teratocardnom»ierived flrowtti factor 1 168 

433914 AF108138 Hs.112160 Homo sapiens DMA helica5ehomok}g(PIF1) 167 

445413 AA151342 Hs.12677 061-147 protein 166 

50 448769 N66037 Hs.38173 ESTs 166 

411022 AW936378 c|b:QV4^0021-30129M71-f05 DT0021 Homo 165 

423600 AI633559 Hs.310359 ESTs 165 

447175 AI385208 Hs.293608 ESTs 165 

414151 AW976468 Hs.257245 ESTs 165 

55 448877 AI583696 Hs.253313 ESTs 162 

427584 BE410293 HS.17971B v-myb avian myelobtastosis viral oncogen 161 

440591 AA431599 Hs.132799 hypothetical protein aJ23451 161 

449665 AI655391 Hs.143375 Homo sapiens, clone IMAG&3840937, mRNA, 161 

453775 NML002916 Hs.35120 • replication factor C (activator 1)4 (37 160 

60 429226 AIS53633 ESTs 160 

410929 H47233 Hs.30643 ESTs 159 

427528 AU077143 H8.179565 m!nfchromosomemalnlenancadelideiri(8. 158 

446142 AI754693 Hs.145968 ESTs 156 

445093 AI207197 ESTs 156 

65 413688 A1469213 Hs.71404 ESTs 155 

447733 AF157482 Hs.19400 MA02 (mitotic arrest defident. yeast, h 155 

420218 AW958037 nlxjsomal protein L4 155 

407275 AI364186 gb:qw34h07ji1 NQ.CGAP_Ut4 Homo sapiens 155 

414312 AA155694 Hs,191060 ESTs 155 

70 421535 AB002359 Hs.105478 phosphon-bosytfomiylglycinaniklinesyntha 155 

439979 AW600291 Hs,6823 hypothetical protein FU10430 152 

426075 AW513691 Hs.270149 ESTs. Weakly BimHar to 2109260AB cell 151 

435096 AA664977 gb3iu73b07.s1 NCI.CGAP^l Homo sapiens 150 

422468 AA355210 gb:EST63589 JurlcalT-cellsVHomosapien 150 

75 449576 AW014631 Hs.225068 ESTs ISO 

415684 D59356 sort^tol dehydrogenase ISO 

452226 AA0248g8 Hs.157103 ESTs 150 

421451 AA291377 Hs.50831 ESTs ISO 

424308 AW975531 Hs.154443 mintchromosome maintenance deticient(S, ISO 

80 418203 X54942 Hs.83758 COC28 protein kinase 2 149 

453941 U39817 Hs.36820 Bloom syndroms 149 

413762 AW411479 HsJ48 FKSO&binding protein 4 (59U>) 149 

449655 AI021987 Hs59970 ESTs 149 
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430521 NM.016383 H5.242183 HOM-TE&85 tumor anttgan 149 

447444 AKD00318 H$.18616 hypo(heiicslpiotB&tRJ20311 Z48 

414618 AI204600 Hs.9697B hypoUielical protein M6C10764 148 

. 445383 NM_005993 Hs.12570 tubuQiv^pacffic Chaperoned 147 

5 452404 AW450675 Hs512709 ESTs 146 

444823 BE262989 Hs.12045 putaGve pfOteai 146 

427675 AWISBISO Hs.180248 zinc finger protein 124 OHZF-1 6) 145 

444159 AF118846 Hs.10431 dead rhger (DrosophOa)^ 2 (bright 145 

436211 AK00158I HsJ34828 hypolheOcalprtrtelnFU10719: K1AA1794 145 

10 416734 H81213 Ha.14825 ESTs. WeaMy similar to K1AA1 503 protein 145 

433183 AF231338 Hs.222024 transcription f^BMAL2 145 

447350 AI375572 v-en>-a avian eryttirotilastic leukemia vi 145 

428728 NM.016625 Hl191381 hypotlistical protein 143 

407325 AA291180 Hs^28476 ESTs.Wealdysin^k>attema8v8lysp 143 

15 410276 A1554545 angiopoieOn.2 142 

444670 H58373 Hs.33293B hypothetica) protein MGC5370 142 

419029 AA233397 Hs.326290 liypothetlcal protein FU12S81 142 

437908 AI082424 ESTs 141 

414812 X72755 Hs.77367 monoWne Induced by gamma blerferon 141 

20 425202 AW962262 H3.152049 ESTs. Weakly slmHar b 138022 hypotheti 140 

425212 AW962253 Hs.171618 ESTs 139 

423787 Aj295745 Hs.236204 nuclear pore complex protein 138 

425601 AW62g485 Hs.l40720 GSK<3 binding protein FRAT2 138 

449576 AW380579 Hs.209857 ESTs 138 

25 429487 NM_004477 Hs.203772 F5H0 region gene 1 137 

453227 AW135862 Hs.243991 ESTs 137 

417833 AW003251 Ks.86264 hypottietical protein aJ14549 136 

451999 AW176401 Hs.27424 DEAD/H (Asp^u^a-Asp/His) box polypep 136 

407910 AA650274 H8.41296 fibroneclin leucine ricli transmembrane p 135 

30 418666 T65754 gb:yc11c07.s1 Stratagenelung(93721Q)H 135 

410060 NM_001448 Hs^367 glyplcan4 135 

449138 AW294215 Hs.195631 ESTs 135 

425159 NMJ004341 Hs.154868 carttamoyt^phale synthetase 2. aspart 135 

434808 AF155108 Hs.2561S0 Homo sapiens. Similar to RIKENcONA 2610 135 

35 436481 AA379597 Hs.5199 HSPC1 50 proldn similar to ubiqulQn-OOn 134 

410275 U85658 Ks.61796 transcription (actor AP.2ga]Ttfna(8Cthfat 134 

407818 AL021938 Hs.40154 jumonp (mouse) homdog 134 

417777 AI823763 Hs.7055 ESTs. Weakly simRar to 178685 serineAh 133 

401704 NM.021195*:Homo8apiensdaudIn6(CLON6 133 

40 449670 F07693 Hs.85603 Homo sapiens mRMA;cDNADKFZp434K2172(f 132 

424081 NM.006413 Hs.139120 ribonuclease P (30kD) 132 

422809 AK001379 H3.121028 hypothetical protein FU10S49 131 

429271 AF039850 Hs.198515 dead ringer (DrosophHaKil^ 1 130 

^_ 432865 AI753709 Hs.152484 ESTs. Weakly similar b 138022 hypatheS 130 

45 421379 Y15221 Hs.103982 smaH inducible cytoldne subfamily B (Cy 130 

410166 AK001376 Hs.59346 hypothetical protein FU 10514 130 

448755 AW503807 Hs^1907 histone acetyltransferase 130 

403433 NML001622:Homo sapiens 8^ha>2-HS^cop 129 

441031 AI110684 Hs.7645 fibrinogen, B beta polypeptkfe 129 

50 419594 AA013051 H5.91417 topoisomerase (DMA) II binding protein 128 

407289 AA135159 H3.203349 Homo sapiens cONAFU12'149fis. done MA 127 

425910 AA830797 Hs.184760 CCAAT-box-bindng transcription factor 12S 

401220 branched chain aoiinotranslafase 1. cylos 126 

453985 N44545 Hs.251865 ESTs 125 

55 414890 BE281095 Hs.77573 uridine phosphorytase 125 

409014 H83115 Hs.49760 origin recognitk)n complex, subunil 6 (y 125 

416140 BE613836 Hs.83551 nMbriltar^odatod protein 2 125 

424765 AA428211 hypothetica! protein FU140336indlar to 125 

419278 AU076799 H5.1247 apolipoproielnA^ 124 

60 412123 BE251328 Ks.73291 hypothetical protein FU10881 124 

438459 T49300 H3.35304 Homo sapiens cDNA FU13655 Us. done PL 2.23 

417273 AK002209 Hs.smi Homo sapiens cDNAFU 11 347 fis.ctone PL 123 

449722 BE280074 Hs^60 cydinBI 122 

^_ 443184 AI638728 Hs.131973 ESTs 122 

65 416391 A1878927 HS.792B4 mesodemispedfio transcript (^nouse) horn 121 

440983 M20681 Hs.7594 sohito carrier family 2 (fadfitatedglu 121 

435045 BE297155 H5.143e98 ESTs 121 

414883 AA926960 (X)C28 piotem kmasa 1 121 

446323 AI268274 H$.345792 ESTs 120 

70 410855 X97755 HsWIB RA054(Sxerevi8laeHil« 120 

448757 AI366764 Hs.48820 TATA box binding protein (TBPHBSOdato 120 

450254 NM.004885 Hs.99231 neuropepUde G proteiivcoupied receptor, 120 

418973 AA233056 H3.191518 ESTs 120 

413582 AW295647 Hs.71331 hypoUietlcal protein MGC5350 120 

75 434334 AA912476 Ks.1 16750 Homo6q)lenscONAFU13221fis,ck)neNT 120 

443748 AW208447 gb:UI-H^1^ar9^2-0>UL8l Na.OGAP.Su 120 

415989 A1287700 ESTs 120 

400195 NM_00705r:Hoino sapiens ZWIOinteractor 120 

428878 AA436884 Ks.48926 ESTs 120 

oO 431805 NM.014053 Hs^0594 avCR protein 119 

446839 6E091926 H5.16244 mitotic spindle coiletMlretetod prat * 119 

424381 AA285249 Hs.148329 protein kinase Chk2(CHEK2) 118 

417389 BE260964 Hs^S mkildne{neuritoorouvthiMomotingtelor 118 
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423905 AW5799G0 HS.13S150 lung type4 cell tnembrane-associatedgly 2.18 

413992 W26276 HS.104S57 RNA.U28maDnudsar ^1S 

412722 AI343300 Hs.15091 ESTs i18 

_ 409089 KM_014781 Hs.5M21 KIAA0203 gene podud Z\7 

5 430809 AI791150 Hs,262009 ESTs. Moderately similar to 138022 hypo! 217 

406542 C19000728*:gqi2585552|sptQgY2Q1|Z257.KU Z17 

420509 MB3554 Hs.1314 tumor necrasb factor receptor superM 2.17 

425234 AW152225 Hs.165909 ESTs. Weakly simllsr to 138022 (lypdhsa 2.17 

,^ 425580 L11144 Hs.lS07 gaMi Z16 

10 4393SB AA284267 Hs.221504 ESTs 2.16 

452833 BE5596B1 H5.30736 KIAAD1 24 protein 2.15 

421350 AW301608 Hs.276168 ESTs, Moderately similar to 154374 gene Z15 

444863 AW384082 Hs.104879 serine (or cysteine) proteinase inhibito Z15 

449410 AA001356 Hs.18159 ESTs 2.15 

15 442717 RB8362 Hs.180591 ESTs. Weakly similar to T23976tiypotheli 115 

427953 AA417944 Hs.44331 ESTs Z15 

422281 ^36603 Hs.346935 liemopexin Z15 

433875 AW977653 Hs.753ig ribonudeoQde reductase M2 polypeptide Z14 

444960 Ai611317 H5.341531 ESTs ZU 

20 415890 K0822S Hs.258712 ESTs 114 

402099 ENSP00000217725*:LBmlninfltpha-1 chain p 114 

427779 AA906g97 Hs.180780 TEI^ protein 2.14 

453005 AWD5S308 Hs.31803 ESTs, Weakly similar to H-WASP[H.8apisn Z14 

422170 Ar791949 Hs.112432 anIi-Munerian hormone 114 

25 414161 AA138108 Hs.184852 KIAA1 553 proton 114 

437623 083880 Hs.5719 chromosome condensaOon^dled SMOesso 113 

449810 AB008681 Hs.23994 actrvinAreceplor.lypellB 111 

450663 H43540 Hs^5292 ribonudsase HI, laige subuntt 111 

419525 T79257 Hs.1259 asldoglycoprotein receptor 2 111 

30 424727 AW590378 Hs.152519 hypothefical protein FU20674 110 

418592 X99226 Hs.284153 Fanconi anemia, comptementatkm group A 110 

425292 NM.005B24 Hs.155545 37 kOateudnMlch repeal (LRRJptotdn 110 

430821 AA487264 Hs.154974 Homo sapiens mRNA:cDNADKFZp667N064(fr 2.09 

418552 AFig82S4 Hs.86088 IGF-II mRNA^Aidlng proteto 1 2.09 

35 408291 AB023191 Hs.44131 KIAA0974 protein 109 

425474 Z48054 H3.158084 peroxisome receptor 1 109 

453028 AB006532 Hs.31442 RecQ protein^ 4 2.09 

447831 AI433293 Ks.164115 ESTs 108 

437162 AWD05505 Hs.54&4 thyroid hormone receptor ooadhraSngpr 108 

40 429168 AB033096 Hs.197668 KiAAl 270 protein 108 

432446 AA542845 Hs.294088 GAJ protein 108 

417866 AW087903 Hs.62772 collagen, type XI, alpha 1 107 

431093 AB031038 Hs.301704 eomesodermin (Xenopus laevis} homotog 107 

408116 AA251393 Hs.289052 Homo sapiens. Similar to RIKEN cDNA 5430 107 

45 449569 AI656634 Hs.195389 ESTs 107 

429999 AI761902 Hs.99597 ESTs 106 

420552 AK000492 Hs.98806 hypothefcal protein 106 

423175 W27S95 H6.347310 hypothetical protein FU 14627 105 
408137 NM_000179*i1omo sapiens mutS(E.coIi)h 105 

50 413833 Z15005 Hs75573 centromere protein E (312kp) 105 

450375 AA009647 adisintBgrinandm^attoprateinasedoma 105 

409066 AA062980 Hs.66960 ESTs 105 

425700 AF076292 Hs.159251 forkheadtxucHI 105 

432359 AA076049 Hs.Z74415 Homo sapiens cDNAFU10229fls, done HE 106 

55 409093 BE243834 Hs.50441 CGMM protein 105 

418054 NM-002318 Hs.833&4 tysytoxldase^ike2 104 

408446 AW450669 Hs.45068 hypotheOcat protein OKFZp4341143 104 

417115 AW952792 Hs.334612 smaQnudearritxuiudeopiiotetnpolypept 104 

429840 AM59699 Hs.99496 ESTs 103 

OO 409717 AW452871 Hs.56043 CGI-115preteln 102 

418113 At272141 H5.83484 SiW (sex detennining region Y)-tx» 4 102 

448275 BE514434 Hs.20830 kinesin4ike2 102 

432731 R31t78 Ks.287820 (ibronectini 102 
405157 Nh/L003213*:Homo83piensTEAdomainfama 102 

65 425274 BE281191 Hs.155462 mlnfchiofnosonisinainbnancedeflctentN 101 

423739 AA398155 Hs.97600 ESTs 101 

421310 AW630087 Hs.1 03315 trinudeothie repeat containing 1 100 

457107 AA418246 Hs.185796 ESTs. Weakly similar to Z184.HUMAN ZINC 2.00 

„ 437257 AI2830BS Hs,290931 ESTs. Weakly similar to YFJ7.YEASTHYPar 100 

70 407259 L02256 ^iiuman Fab fragment bindteg syncytial 100 



TABI£50B: 

Pkey: Unique Eos probeset Uenlifier number 
75 CAT numlier Gene duster mmiber 

Accesston: Genbank accession numbers 

Pkey GATNumtier Access to n 

80 432866 144 7 AA55858S AAS65499 AI380576 AW204089 AA991648 AA864939 

423458 30480 1 8001 8070 BG702493AI204212AA46092gAA993606BF926635AA226938BG1 90705 BG16&4g6AW291 865831833^ BG214S39 
BG215094 B6198887 BG196332 BG208220 BG212418 
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430676 

427521 

427466 
412537 



443068 
436812 
418477 

41B378 
422689 
43SS14 
439780 
434609 
454679 
434414 



406687 
457191 
410704 



413646 
442032 
411022 
429228 

445093 
420218 

435096 
422468 
415684 
447350 
410276 
437908 
418866 
424765 



400195 



L2 

513212L1 

684159J 
14066J 



18695.17 
659779J 
4172.1 

1227421.1 

874209.1 

132288.1 

49082.1 

14739J 

174325.1 

35978.1 



409517 4537J 



0.0 

1389162L1 
1054873.1 



1525656.1 
15407^1 
1066666.1 
215430.1 

175963.1 
191547.1 

125215J 

216674.1 

18695.18 

2267324.1 

641443.1 

13266 11 

245947.1 

6857.1 



414883 8371.2 



443748 
415989 



10194.1 
16894.2 



450375 16559.3 



BG433950 BE061583 T05808 BE144813 AW612038 BE144812 AW812040 AW812041 AU124350 BE061602 BE081604 8F922595 BE061603 
AI352469 BE061601 BI062752 AW818206 BFe87722 

AW973352 BF222929 AWD1 6853 BF059130 Ai651829 BE551767 AA5584 14 At339359 BF059801 AI961 162 AI341422 Ai206248 At2061 65 

AA5487^ AA768578 A1539081 AW025957 AA736837 N79575 AW594357 AA480892 

BF510715 BE673055 BE464111 AW590620 AI637939 AA404324 AW236441 AI650952 BF056796 AA974433 

AK025201 AM25472 AI694282 BG057305 AA907787 A1286170 AI684577 AJ420494 A1809865 BF058095 AI478773 AI160445 AU)441 14 

AW665529 A1129239 AW297152 AI268215 AI469807 A1969353 BE552356 N66509 AA736741 AA382555 AW075811 AV759188 BI259364 

BF445142 BG232065 AI141758 A!631202 A1167566 A1208445 AA889823 6F982682 N90322 Bi090882 BF208005 AW953918 AL0441 13 A1016793 

AA382556 AW235763 AA927051 AI862075 BE886691 BE619282 

AV752763 A1032142 N30308 N22181 H95390 AW67S632 

AW978773AW298067 AA810101 AW194180 AA731645At690673 

BC022538 A1990847 6F478249 BG217896 B6212702 BG182057 AW569883 BF000085 AAg939e9 BG479023 6G220014 BG679466 BE907092 

AI623855 AA223956 AA223917 AW022983 AW090580 AW573219 BF514491 6F445397 AAB84705 AI910424 

AA218925 AW962081 AA354237 

AW9S4733 AA3150Q6 AW856&65 

AA683356 AW592804 AI1502B7 

AL10968BR23665 R26578 

AF147390 R76593 R76594 

AW813110BF771370BF771371 AW813113AW003381 

AF134164 BF609407 AA216567 BF842863 AI267168 BF876178 66999253 AVI/861651 AW858362 AI817548 BF771300 AA11392d AA223422 
AA055556 BF773400 BF998869 BE081333 BE073424 BE142245 H59S71 H59570 BFB715S8 BF871064 6E001 132 6F826831 AW754298 
AA223267 BG997895 BG997897 AW991957 AA534354 BG319501 BF736309 AI694265 AA045564 BG950256 AI829309 BG987850 6E093175 
BF854337 

NH.000363 X54163 M64247 A1255781 AI760600 AI367238 BE1 40258 AW207185 AI657074 C03333 All 9391 1 C05024 C03193 AI95021 5 C05070 
C05613 W17389 C05351 AA31 1399 C04180 C04896 C05502 C05482 C04456 C04543 G04558 C04551 C031 14 C03103 AI369979 At652255 
T12391 T12073 W19390 CQ2994 002730 C04434 W07138 RS7607 C03339 
M31126 

AI216469 Ai3547B9 AA446136 H24336 AA446443 AI376228 R4e940 

AW877458 AW877524 BE076922 BE166912 AW840534 BE076754 AW797829 BE166905 BE166926 AW877452 BE166927 BE166932 AW877523 
BE166917AW877529BE166928BF351394AW877S22AW877528BE166861 BE166866BE166913BE166919AWB77456AW877537 BE076866 
AW840571 

BE155042 BE155040BE154987 BE155012 

BF223060 BF222B18 AI950472 AW016788 AI207 136 AI969730 BF222890 AI533857 AI96B71 1 AA974235 AI352637 
AW936378 AW936544 AW813513 

BG676155 BM009591 A)479075 At025794 Ai017967 AA448270 BE466ai2 AA8S3422 AI392649 BG952034 AAS13384 BF840124 6E714620 

AW969605AI553633 

AI207197BF773544AW196462 

AW958037 R42557A!337047AA948360AI63800SAA459950AI624915At638047Ai46785SAI521826AA86030^ 

AW779380AA609879 AI634791 AI493770AI56S211 Z41 145 AI627952AA303734BE349457AW1 96765 AA256S27BE0897Z7 

H30075 AA684977 AW975278 

AW962701 AA310998 AW962699 

BF666746 D59356 BG678312 N55640 AA166861 

AI375572 A1480404 6F430912 T06882 

AA083514AI554545AW169852AI363822AI633826A1656026AI765624AA147545AA147552 
A1740566 AA771806 BE500998 AW204531 A)082424 AI033879 BF093176 AA771764 038676 
T65754AA229858AA229857 

AK021881AU145974AU145787C16964AM28211AU119698AA9932&4BF999192AW903017AA346559AU119446AW5^^ 
AW8981 65 AW388878 AW890957 Z18340 

AF274943 B6494894AI719075AA908783AI935150AM22691AA910844AA583187BM272167Ai826996M 

AA772418 A)033892 AA100926 AU154749 A)459432 AI423513 AI094597 AA740817 AI991988 AI090262 A)312104 BI256707 AA459522 AA416871 
AI075239 AI339995 AA701623 AI139549 AI336880 AA633648 A1983380 AI362835 AA399239 AI146955 BF51 4270 N92892 AI348243 A1278897 
AA459292 AI494230 BF507S31 AI492600 AA962596 AW613002 AA293140 AA235549 BF108854 AA954344 N49682 AI457100 AW589407 
AW300758 BE220715 66220698 BE569091 BM00g647 8F900351 At537692 A)203723 AI857576AA584410AW371667 BM172363 
6M467830 A1084433 AW206447 Ai400g76 AI248530 R16553 

8C013389 BC017398 AI023543 AA191424 AI267700 AM69533 AW958465 AW953397 AA172056 BE940298 BF909208 BF909980 BF095153 
BG285837 AI720344 BF541715 AA355086 AA172236 

eM4775&4 BM423967 BC020979 AF067656 NM.007057 61869291 60468263 BG760599 61261788 AA855060 BE257094 6F212452 6E868249 
BI259219 AW409765 BEC89556 AL564377 BI258884 AW440401 AL578460AL578434AL555136BG036804AL531381 AW371767 BG610641 
BF1025S2 BE294929 BF792282 BG1216S7 BQ502285 BG777493 ALS&4510 AW770358 AA573448 AA564001 AA969560 AW078946 AW750065 
AU73860 AA143778 H99221 AA969210 AW103401 AW750073 

BG570706 BG572749 AW606284 H04021 AA151166AWg54405AA131254BG056461 W46291 H01532 H04384 K03231 AA852876H04410 
H59605BE157601AA113758 



TABLE 50C 

Pksy: Unique number corresponding to en Eos probeset 

Ref: Sequence source. The 7 digit numbers In this cohimn ere Gent)ank IdenOfier (GO nuint)efs. 13unham, et aL* refers to the publication entitled The DMA sequence of 

human chromosome 22" Dunham, et al (1999) Nature 402:489495. 
Strand: Indicates Dl^ strand from which axons were predicted. 
NLposifion: Indicates nudeoQde positions of predicted exorn. 



Ptey 


Ref 


Strand 


Ntj}0$ilion 


404996 


6007890 


Rus 


37999-38145.38652-38998.39727<39872.4055 


406547 


7711513 


FAinus 


172780-174358 


402145 


8018280 


Plus 


113088-114600 


403780 


8076989 


Rus 


93160-93409 


403432 


9719611 


Minus 


68204-68392 


401704 


3097841 


Rus 


24712-25374 


403433 


9719611 


Minus 


72225-72437 


401220 


9929324 


Minus 


4607948279 


406542 


7711499 


Rus 


117335-11B473 


402099 


8117697 


Rus 


121553-12174^123265-123423 


406137 


9166422 


Mbnis 


30487-31058 



596 



wo 03/042661 



PCTAJS02/36810 



405157 99G622B Plus 156363-156502.157573-157746 



5 

TABLE 51A: 

Pkey: Uniqua Eos probeset IdenGlier number 

ExAccn: Exemplar Accession number. Genbank aooession numbef 

UnioenelD: Unigene number 
, U Unigene TiOo: Unigene gene fitle 

R1: RaOo of senrinonistoustesQoiilsr cancer oonipsfed to noniul adult tissues 



Pl«y ExAccn UnlgenelD Unigene Tllle R1 

IS 418896 AW959433 Hs.326290 iiypotheScal protein HJI 2561 58.62 

432666 AW204069 ESTs^Ws^ similar to unnannedprolBln 49.00 

432730 Ai066520 Hs.1313S8 ESTs 37.64 

426534 U58096 HsiOSI testis specffic protein, Y<4nited 37.60 

428664 AK001666 Hs.169095 similar to SAai (sal pDSophOaHOe 32.70 

20 420387 AA259090 Hs^028 ESTs 29.96 

420347 AU>33539 Hs.97124 Human DMA sequence from ctoneRP1-309H1 5 26.50 

437052 AA861697 Hs.120591 ESTs 26.42 

407710 AW022727 Hs.23616 ESTs 23.85 

420528 AF130728 Hs.98566 doubiesex and m3b-3 related tFansctiplto 23.12 

25 424578 AK001973 Hs.lSOSgO hypothetical protein 22.27 

420759 T11832 Hs.127797 Homo saptenscDNAaJl1381fis. done HE 22,06 

417407 AA923278 Hs.290905 ESTs, Weakly similar to protease [H.8api 20.46 

429486 AF155827 Hs^03963 hypothetical protein 1^10339 18,44 

434649 AA738254 Hs.165390 ESTs, Nighty similar to A40350lransciip 15.92 

30 430252 A1638774 Hs.105328 testes developmenirclated NYD-SP20 . 15.44 

423458 A1204212 ESTs 15.28 

438915 AA280174 Hs.285681 WilSams-Beuren syndrome chromosome regi 15.26 

427711 M31659 Hs.180408 sohile carrier fanAy 25 (mitochondrial 14.84 

^. 427667 AK001279 H$.180171 Homo sapiens CONAFU10417 lis. ctonB^f^ 12,98 

35 426427 M86699 Hs.169840 TTK protein kinase 12.44 

420401 AK001907 Ks.97464 hypothetical protein 12.40 

406937 U14622 gb:Kumanlranstetolase4il(e protein gene 11.60 

430521 NIA.016383 HsJ!42183 HOM-TES-SS tumor antigen 11.55 

425769 U72513 Hs.159486 Human RPL13-2p8eudogenemRN\ complete 11.52 

40 418477 AW022983 gb:df46h12.y1 Morton Fetal Cochlea Homo 10.94 

434551 BE387162 HS.2808S8 ESTs. Highly simiiar to A35661 DNAexcis 10.78 

436812 AW298067 gb:U]-H-8WO-aip-g-09-0-U1.8l NCLCGAP.Su 10.54 

437789 AI581344 Hs.127812 ESTs. Weakly dmilar to T17330liypotheti 10.40 

433800 AI034361 Ks.135150 lung type4 cell menibranMSSodated gly 10.32 

45 421241 Xgi817 Hs.102866 trar^ketotase^ikB 1 10.14 

410102 AW248508 Hs.279727 ESTsihomotogueofPEM^fCtonasavignyf 10.02 

418134 AA397769 Hs.66617 ESTs 9.76 

433159 AB035698 H8.150587 tonesto-Gke pnM 2 9.56 

433975 AAg71953 Hs.122055 ESTs 9.36 

50 422956 BE&45072 Hs.122579 ECT2 protein (Eptthellal cell tTansfomd 9.30 

410561 BE540255 Hs.6994 Homo sapiens cDNA:FU22044 Us. CtoneH 9.22 

431494 AAS91355 Hs.298312 hypothetical protein DKFZp434A1 31 5 9.16 

436899 AA764852 ESTs 8.76 

426083 AW962712 Hs.126712 ESTs, Weakly simBar to AF1 91 020 1 E2IG5 a76 

55 424905 Nl\^002497 Hs.153704 Ntl^ (never In ntitosis gene a)-reiatedk a64 

408908 BE296227 Hs.2S0d22 sertneAhreonine kinase 15 aso 

413627 BE182082 Hs.246973 Intronof BicaudalDhomotog 1 8.42 

425572 AB011076 HS.1S8307 untfiSerenfiatod embryonic cell Iranscrl 8.30 

415857 AA866115 H3.127797 Homo sapiens cOMAFUl 1381 fe.ctone HE ai4 

OO 408728 AL137379 Hs.47125 hypotheOca) protobi FU13912 ai4 
406547 Target Exon a02 

424153 AA451737 Hs.t41496 MAGE-4ike2 7.90 
434699 AA643687 HS.14942S Homo sapiens CDNARJ11960 lis. done HE 7.G4 
^_ 437421 AA917062 ESTs 7.53 

05 409731 AA125985 Hs.56145 thymosin, beta, Uentlfied in neuroblast 7.S0 
419423 D26468 Hs.90315 K1AA0007 protein 7.38 
428227 AA321649 Hs.2248 smaii indudble cytokine subfamSy fi (Cy 7.38 
431840 AA534908 Hs.2860 POU domain, dass 5, transcr^n facto 7.32 
„^ 430676 AF084866 gb:Homo sapiens envelope protein RIC-3( 7.29 

70 436608 AA628980 Hs.192371 doumsyndfomecrtttoalfegtonpiDtebiDS 7.25 
435206 A1432364 Hs.160594 ESTs 7.20 
414972 BE263782 H5.77695 KIAA0008 gene produd 7.12 
407340 AA810168 Hs.28428g vitillg&^ssodatBd protein VIT-1 7.10 
426516 Z43039 Hs.170198 K1AA0009 gene product 7.10 

75 436513 AJ278110 H&125507 DEAO^iox protein 7.04 
427521 AW973352 ESTs a98 

423673 BE003054 Hs.1695 matrix metalloproteinase 12 (macrophage 6.92 
422232 D43945 Hs.113274 transcription factor EC a90 
420047 A1478658 Hs.94631 brefeUlnArinhibitedguanbienudeoSde aB3 
80 431041 AA4g0967 Hs.197955 K1AA0704 pretesn a76 
427335 AA448542 Hs.251677 GBn6sen7B a56 
422797 ABa33064 Hs.236463 KIAA1238 protesn 6.55 
418379 AA218940 Hs.137516 (idgeBn4De1 a46 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



423905 


AW579960 


HS.1351S0 


lung typ6-l fnsfnbranB-assoclatBd Qly 


&45 


433764 


AW753676 


Hs.3g982 


zinc finger protein RINZF (NM_0K92^ 


&44 


422655 


AJ011812 


Hs.119016 


transcription (actor NRF 


&38 


433701 


AW445023 


Hs.15155 


ESTs 


&34 


436909 


AA9O7120 




EST8 


&2B 


423728 


AW891294 


Hs.132136 


sohdo carrier fomOy 4, sodiwn fatetxn 


627 


429228 


A1553633 




ESTs 


a26 


419384 


AA490866 


Ks.39429 


ESTs 


&23 


435514 


AW592804 




ESTs 


&08 


434334 


AA912475 


H5.1 16750 


Homo sapiens cONA RJ13221 6s. done NT 


5.90 


430835 


AI240006 


Hs.192326 


£ST8 


5.89 


438188 


AA779975 


KS.12B859 


ESTs 


5.88 


429120 


AK001673 


H5.196530 


hypotiie6cd protein FU10811 


5.80 


408758 


NM_003666 


Hs.47504 


exonudease 1 


5.78 


424081 


NA^.008413 


Hs.139120 


ribonuctea5eP(30kO} 


5.70 


432359 
428153 


AA076049 


H8i74415 


Homo saptems cDNA FU 10229 fis. done HE 


5.67 


AW513143 


Hs.98367 


SRY (sex determining region YHnx 17 (S 


5.64 


422689 


AW856665 




gb:RC3-CT0297-^100^134l03 CT0297 Homo 


5.56 


428301 


AW628666 


Hs.98440 


ESTs. Wealdy similar to 138022 hypothsG 


S.S8 


419556 


U29615 


Ks.91093 


chiiinase 1 (chitotiiosidase} 


5.55 


438494 


AA908678 


Hs.130183 


ESTs 


5.52 


421974 


AA301270 




gb:EST14192 Testis tumor Homo sapiens cO 


5.52 


427510 


Z47542 


Hs.179312 


smaD nuclear RNA actlvaSng complex, po 


5.48 


412265 


AA101325 


Hs.86154 


hypothetical protein FU12457 


5.45 


413623 


AA82S721 


Hs.246973 


intron erf Bicauda) D iwmolog 1 


5.36 


402145 






Target Exon 


5.30 


414136 


AA812434 




SMC2 (structural mainlenance of chronnso 


5.28 


428479 


Y00272 


Hs.334582 


ceil division cyde 2, 61 to S and (^2 to 


5.22 


426949 


AA442153 


Hs.104744 


hypothetical protei) OKFZp434J0617 


5.16 


408460 


AA054726 


Hs.285574 


ESTs 


5.14 


415947 


U04045 


Hs.7e934 


mutS (E. coli) homolog 2 (colon cancefi 


5.12 


420900 


AL045633 


Hs.44269 


ESTs 


5.08 


426496 


D31765 


Hs.170114 


KIAAOOSI protein 


5.01 


407122 


H20276 


H5.31742 


ESTs 


5.00 


422938 


NM-001809 


Hs.1594 


centromere protein A (17i(0) 


4.95 


402199 






Target Exon 


4,90 


409103 


AF251237 


Hs.1 12208 


XAGE-I protein 


4.90 


416859 


H43437 


Hs.80305 


hypolhelica! protein MGC14258 


4.84 


410166 


AK001376 


Hs^9346 


hypothetic^ protein FLJ10514 


4.82 


410929 


H47233 


Hs.30643 


ESTs 


4.73 


417886 


AA214584 




ESTs 


4.73 


426223 


AW977812 


Hs.130391 


ESTs 


4.72 


409421 


AA1998B3 


Hs.67624 


ESTs 


4.72 


428249 


AA130914 


Ks.183291 


zinc finger protein 268 


4.71 


429999 


AI761902 


Hs.g9597 


ESTs 


4.68 


431721 


AB032996 


Hs.268044 


KiAA1 170 protein 


4.68 


408321 


AW40S882 


Hs.44205 


cocDstalin 


4.67 


419197 


N48921 


Hs^441 


KIAA1615 protein 


4.66 


428329 


AA428091 


HS.9B453 


ESTs, Moderately similar to R27328 2 [H. 


4.64 


418235 


BE072634 




gb:PM4^T0540-17129^1-h08 BT0548 Homo 


4.64 


427119 


AW880562 


Hs.272525 


ESTs 


4.64 


414812 


X72755 


Hs.77367 


monokine Induced tiy gamma interfenxi 


4.64 


414034 


U89277 


Ks.305985 


eaiiy devetofnnanl Fegdator 1 (homolog o 


4.64 


409068 


AA082980 


Hs.66960 


ESTs 


4.62 


416201 


AA467752 


Ks.195161 


ESTs 


4.53 


433330 


AW207084 


Hs.132816 


hypothetical protein MGC1 4601 


4.52 


429629 


BE501732 


Hs.30622 


Homo sapiens cONA FU 13010 lis. done NT 


4.50 


437099 


N77793 


Hs.48659 


ESTs. HIghiy simiiar to S14458 laminin a 


4.46 


415799 


AA653718 


Hs.225841 


DKFZP434D193 protein 


4.46 


412530 


AA766268 


Hs,266273 


hypothetical protein FU13346 


434 


418221 


Z45514 


Hs.83775 


DiGeorge syndrome gene D 


4.32 


418971 


AA360392 


Hs.87113 


ESTs 


4.30 


416111 


AA033813 


Hs.79018 


chromatin assemtily factor 1 . subunit A ( 


4.29 


423175 


W27595 


Hs^7310 


hypotheticai protein FU14627 


4.21 


415717 


AA167270 


Hs.130435 


ESTs 


4.18 


423198 


M81933 


H5.1634 


ogO (fivlsion cyde 2SA 


4.12 


433849 


BE465884 


Hs^80728 


ESTs 


4.12 


436211 


AK001581 


Hs.334828 


hypothetical protein FLI10719; KIAA1794 


4.11 


432840 


AK001403 


Hs.27g521 


hypothetical protein FU20530 


4.07 


421307 


BE539976 
AA769791 


Hs.103305 


Hdmo sapiens mRNA; cDNA Oi(FZp434B0425 (f 


4.07 


414725 




ling finger protein 21, InterfBTOo^BSpo 


4.05 


408291 


AB023191 


Hs.44131 


KIAA0974 protein 


4.05 


408332 


H91230 


H3.234794 


Homo sapiens ml^ cONA OKFZp564BG83 (fr 


4.04 


416773 


AK000340 


Hs.79828 


hypothetical protein FU20333 


4.04 


427584 


BE410293 


Hs.179718 


^-wij/b avian niydotilastosis vird oncogen 


4.03 


421917 


AB028943 


H8.109445 


KIAA1020 protein 


4.02 


430647 


AC003682 


Hs.127988 


ESTs, Weakly similar to 221 IJtUMAN ZINC 


4.02 


430287 


AW182459 


Hs.125759 


ESTs. Weakly similar to LEU5 JtUMAN UEUKE 


4.01 


436360 


A1962796 


Hs.156100 


ESTs 


4.ra 


438624 


AA889055 


Hs.123468 


ESTs 


3.99 


434609 


R76593 




gb9i60c11.r1 Soares placenta Nb2HP Homo 


3.92 


411945 


AL033SZ7 


H3.92137 


L-4nyo-2pn)tein(MYa2) 


3.90 


408085 


AW954272 




gb£ST366342 MAGE resequences. MAGCHonn 


3.90 
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413833 Z15005 K$.75573 centromere protein E (31 2kO) 3.90 

421010 AW974553 Hs.267124 ESTs, WesUy8lmS8rlDALU6.HUMANALUS 3.88 

438458 AA913381 Hs.20994 ESTs 3.86 

. 424687 J05070 H&151738 matrtc meiaDoprot^nase 9 (geiaQnase B 3.87 

5 412537 AL031778 nuclear transolpiion factor Y. alpha 3.88 

418651 NM_001949 Hs.1189 E2F transcription factor 3 ^ ^85 

408750 BE294069 Ks.93581 ftypothaticalprot^Fy 10512 3.83 

422094 AF129535 Hs^2027 IMxa only protein 5 3,82 

419839 U24577 Hs.93304 phosphonpas8A2. group VII (platelet-ac 3.80 

10 416350 AF18862S Hs.189507 phosphoHpase A2. group IID 3.78 

426054 U12431 Hs.1661Q9 EUV{enit»yontole8iat.abriomdvlsi^ 3.76 

401435 C1400039r:^4998981piilIT33295hypoth a76 

424557 AA343057 Hs.164588 ESTs. Moderately simnarlo neuronal Ihr 3.74 

422631 BE2ie919 Hs.118793 hypothetical prolan aJ10688 3.70 

15 409089 NM.014761 Hs.50421 KIAA0203 gene proctuct 3.70 

426067 AW664691 Hs.g7053 ESTs 3.67 

415684 D59356 sorblto) dehydnDgenase 3.66 

429469 M645go Hs.27 glycine dehydfogenasefdecartioxylalino; 3.62 

424590 AW966399 Ks.46821 hypothelicel protein FU20086 3.62 

20 427761 AA412205 Hs.140998 ESTs 161 

433641 AF080229 gbiHuman endogenous retravinfiK done 1 3.60 

418216 AA662240 Hs.283099 AF15q14 protein 3,59 

438180 AA808189 Hs.2721S1 ESTs 158 

424281 AA766243 gb:oa13b1U1 Na.CGAP_GC81 Homo sapiens 3.56 

25 421379 Y15221 Hs.103982 smaQIndudblecytotdne subfamily B(Cy 3.55 

428878 AA435884 H3.48926 ESTs 3.54 

438885 AI886558 Hs.1849B7 ESTs 3.53 

416445 AU)43004 H5.79337 KIAA01 35 protein 3.52 

424381 AA285249 H5.146329 protein Idnase Chk2(CHEK2) 3.51 

30 432415 T16971 H&289014 ESTs, Weakly similar to A43932mucin 2 p 3.49 

427298 AA400495 ESTs 3.48 

420218 AW958037 ribosomal protein L4 3.40 

407300 AA102616 Hs.120769 gb3n43e07^1 StratageneHeUce]ls393 3.40 

410420 AA224053 Hs.172405 ceIldhrlsioncycle27 3.40 

35 432809 AA565509 Hs.131703 ESTs 3.36 

424085 NM.002914 Hs,139226 replication factor C(adM)r 1)2 (40 3.34 

421373 AA808229 Hs.46677 ESTs . 3.34 

423354 AB011130 Hs.1Z7436 calcium channel. voItagMlependent, alph 3.31 

418830 BE513731 Hs.88g59 hypotheOca) protein M6C4816 3.30 

40 431077 AI669133 Hs.1 15660 hypothetical protein aJ12810 3.30 

418049 AA211467 Homo sapiens, Similar to nuciaarlocaliz 3.26 

420949 AA934063 Hs.13836 ESTs, Wealdysindlar to 138022 tiypothetl 3.22 

432407 AA221038 Hs.13273 gb2i03nir1SlratageneNT2neurondpr 3,21 

434288 AW169Q75 Hs.116265 mMttZ 3,20 

45 418295 AW970043 Hs.238039 hypothetica} protein FU1 1090 3.19 

429714 BE561801 Hs.2484 T-cellteukemia/lymphomalA 3.17 

421350 AW301608 Hs.278188 ESTs, Moderately stmilar to 154374 gene 3.17 

420161 A1683069 Hs.120817 ESTs 3.17 

414618 AI204600 Hs.96978 hypolhetica) protein M6C1 0764 3.16 

50 430253 AK001514 Hs.236844 hypottietical pralein HJ10652 3.14 

414747 U30872 Hs.77204 centromere protein F(35tf400((O.initosin 3.14 

423419 R55336 Hs.23539 ESTs 3.13 

410275 U85658 Hs.61796 transcription factor AP-2 gamma (acQval 3.12 

408092 NM.007057 Ks.42650 ZW10 interactor 3.12 

55 423685 8E350494 Hs.49753 uveal autoantigen with coBedccSdomai 3.12 

436378 AW970S29 H5.86434 hypothetical protein FU21 81 6 3.12 

415912 H08859 Hs.206469 ESTs. WeaMy similar to AlUejtUMAN ALUS 3.12 

412140 AA219691 Hs.73625 RAB6 interacting, Idnesld-lilce (rabUnes ail 

420552 AK000492 H5.98806 hypothetical protein 3.11 

60 402408 NM.030920*:HQmo sapiens hypoOteltoalpio 3.10 

432281 AK001239 Hs.274263 hypothetical protein FU10377 3.10 

415629 AW450198 Hs.163742 ESTs 3.09 

423739 AA398155 Hs.97600 ESTs 107 

418459 RB5436 Hs.268814 ESTs 3.07 

65 421972 IM1818S Hs.1454 gastric inhibitory polypepiide 3.07 

437257 AI2B3085 Hs.290931 ESTs,Wealdy8imnartoYFJ7JfEASTHYP0T 3.06 

429830 AI537278 H8.225841 DKFZP4340193 protein 106 

420524 AB010575 Hs.98547 amitoride^ensitivecalion channel Ite 106 

433023 AW864793 thrombospondln 1 104 

70 421633 AF121B60 Hs.108260 soring nexbi 10 104 

420507 AF093408 Hs.98397 A kinase (PRKA) anchor protein 3 3.04 

432938 T27013 Hs.3132 steroidogenic acute regulatory protein 103 

414598 A1094221 Hs.135150 lung type-l cen membrane-assodaled gly 103 

419835 NN/L005033 Hs.91728 potymyosms&derodenna autoantigen 1 ( 103 

75 425312 AA354940 Hs.145958 ESTs 102 

425474 Z48054 Hs.156084 peroxisome receptor 1 101 

411027 AF072099 H3.67846 leukocyte immunoglobulin^ receptor. 101 

432446 AA542845 Hs.294088 GAJ protein 101 

424513 BE385864 Hs.149894 mitochondrial translaQonal initialion f 3.00 

80 436902 AW247145 H3.192729 ESTs 100 

422789 AK001113 Hs.120842 hypothetical protein FU102S1 100 

430056 X97548 Hs.2280S9 Kf^AB-essodatod protein 1 Z98 

427617 042063 Hs.199179 RAN faindbtg pTOtein 2 2.98 
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406367 NM.022357:H0ino sapiens putaOvemetaBop Z97 

416866 T65754 gb7c11c07.s1 Stratagene lung (937210) H 2.97 

435916 AF263538 Hs.86232 growth differenliaSon factor 3 Z97 

436511 AA721252 Hs.291502 ESTs 2,96 

5 402660 Target Exon 296 

414161 AA136106 Hs.184852 KIAA1553 protein 295 

427239 BE270447 ub'K)ui&i earner protein 295 

433863 Aiai7723 Hs.22678 hypothetical protalnFU21 832 294 

417576 AA339449 Hs.82285 phosphoribosylglydnamUefonnyllransler 294 

10 402299 Target Exon 292 

420697 AA827705 H8.26605 ESTs 290 

427719 Ai393122 Hs.134726 ESTs 290 

419131 M406293 K5.109526 ESTs 289 

_ 410048 W76467 Hs.343874 pronne oxidase hom(*)g 289 

15 427314 AB033024 Hs.175475 KIAA1198 protein 289 

424315 AW814850 Hs.193384 putatallve 28 kOa piotein 288 

430335 D80007 Hs.239499 K1AA0165 protein 287 

410351 BE391804 Hs.62661 guanylate binding protein 1, interferon- 287 

413686 AI469213 Hs.71404 ESTs 287 

20 429183 AB014604 Hs.lOTOSS KtAA0704 protein 288 

430292 AK000634 H3.238270 hypothefical protein FU20627 286 

422726 U11690 Hs.1572 faciogenita) dysplasia (Aarslcog-Scott sy 286 

437834 AA769294 gbnc38g03.s1 NO.CGAP.GCBI Homo saptens 286 

435159 AA668879 Hs.116649 ESTs 2.84 

25 428361 NM.015905 Hs.l83858 transcripUona) intemiedlary factor 1 284 

430388 AA356923 Hs.240770 nuciearcsp binding protein subunit 2 2 284 

434070 AF1 16652 H3.270087 hypothetical protein PRO0813 283 

429323 NM.001649 Hs.2391 aplcd protein. Xenopuslaevis^e 283 

433247 AB040948 Hs.142856 KIAA1515 protein 282 

30 415884 H22965 Hs.13471 ESTs 282 

427668 AA298760 iHs.180191 hypothetical protein FU14904 282 

437162 AW005505 Hs.5464 thyroid hormone receptor coactivating pr 281 

401091 decay accelerafing iactor tor complement 281 

425601 AW62948S Hs.140720 GSK-3 binding protein FRAT2 279 

35 428597 AK000147 Hs.295909 hypothetica} protein FU10700 279 

417705 AW134952 HS.17S220 hypothetical protein FU14541 279 

438243 AI581311 ESTs 278 

418203 X54942 Hs.B3758 CDC28 protein kinase 2 278 

410704 BE076754 gb:CM1-BT0601.18Q200-121-b10 BT0601 Homo 277 

40 429063 AW363B45 Hs.322903 ESTs. WeaWy similar to A46010 X-^lnlted 278 

427147 AA398587 Hs.97414 ESTs 276 

430552 AA176374 Hs.2438e6 nudearautoanligenlc sperm protein (his 276 

437660 W31708 Hs.55304 ESTs 274 

425237 U07695 H8.155227 EphB4 272 

45 419335 AW960146 Hs.284137 hypothetical proldnFUl 2888 272 

426386 AA748850 Hs.125d30 bladder cancer overexpressed protein 270 

423123 NM_012247 Hs.124027 SEt^NOPHOSPHATE SYNTHETASE ; Human selen 270 

430968 AW972830 gb:EST384925 MAGE resequences. MAGL Homo 270 

420698 NM.002692 Hs.99185 polymerase (DNAdirected),epsik)n 2 268 

50 419741 Kl^.007019 Hs.93002 ubiquitin carrier protein E2-C 268 

401464 histonedeacetylaseS 268 

411856 K67899 Hs.4190 Homo sapiens cDNA:FU23269fis,ck>neC 268 

411365 M76477 Hs.289082 GI\A2 gangOoside activator protein 268 

419029 AA233397 Hs.326290 hypothetical protein FU 12581 267 

55 421654 AWt63267 H5.106469 suppressor of vart(Sx»BVi8tee) 341(8 266 

421535 AB0023S9 Hs.105478 phosphoribosylfomiylglydnanMlnesyntha 266 

423453 AW450737 Hs.128791 CGH)9 protein 266 

412673 A)J)42957 Hs.31845 ESTs 265 

410006 AW732308 H3.57783 eukaryotic translation initiation factor 265 

60 434159 AW135214 Hs.191828 ESTs 265 

427260 AA663848 gb:ae70b06.s1 Stratagene schizo bt^ SI 264 

439053 BE244588 Hs.6456 chaperonin containing TCP1. subunit 2 (b 264 

414706 AW340125 Hs.76989 KIAAOOg? gene product 264 

^_ 433979 AA620999 gb:ag03a0a8l SoarBS_testis.NHT Homo sap 264 

65 403989 ENSP000000346632tic finger protein 131 264 

420582 8E04787B Hs.99093 Homo sapiens chromosome 19, 006nddR2837 264 

418355 L42563 Hs,1165 ATPase. H7 tf ansporilng. nongastrfc. alp 263 

411127 AA668995 Hs.218329 hypotheUcai protein 262 

437205 AL110232 Hs.279243 Homo sapiens mRNA; cONA DKFZp^4O2071 (f 262 

70 412123 BE251326 Hs.73291 bypolhetical protein FU10881 261 

436481 AA379597 Hs.5199 HSPC150 protein similar to ubkiuiOn^n 260 

408446 AW450669 Hs.45068 hypothetical protein DKFZp434l1 43 259 

437033 AW248364 Hs.5409 RNA polymerase I subunit 258 

418592 X99226 Hs.264153 F8nconiBneniia,comptementatlongroupA 258 

75 415585 R59946 Hs.184852 KIAA1 553 protein 257 

424600 AU035588 Hs.1S3203 MyoO famOy inhltiitor 257 

426470 AA528794 Hs.128644 ESTs 257 

426919 AL041228 ELAV(embfyonic lethal, abnomialviskin. 256 

„^ 421209 AJ010230 HS.102S76 ret fii«er protein^ 1 anSsense 256 

OO 437498 AA452378 Hs.146668 Homo8aptensmRNA:cONADKFZp547J125(lr 256 

401837 NiyL025109:Homo saptens hypothetkal prol 256 

428743 AL080060 Ks.301549 HomosaptensmR^l^cO^IAOKFZp564H172{fr 256 

422809 AK001379 H5.12102B hypoSietical protein RJ10549 255 
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418648 AW979223 Hs^47B ESTs 2.55 

423020 AA383092 Hs.1608 replication protein A3 (14M)) 2.54 

430345 AK000282 Hs.239681 hypothetics) prolein FU2027S 2^ 

424075 AI807320 Hs.227630 REI-saenringtransciipaonlaclor 2.54 

5 428728 NM.016825 H5.191381 hypothetic^ pfoteln Z53 

423755 AB037735 HS.132SG0 hypo(heGcdpioleinFU10312 2.52 

424051 AL1 10203 Hs.138411 Honn sapiens mRNA;d)NADKFZp58SJ1922(f 152 

418734 H81213 KS.1482S ESTs. Weakly sNIer to ICIAA1 503 pnM 152 

422406 AF025441 Hs.1 16205 Op^lnleractlng protein 5 2,52 

10 433226 F28212 Hs.14953 K1AA1491 protein 2.51 

411943 BE502436 Hs.7962 ESTs. Weakly similar to S44608 C02F5.6p 2^1 

426181 AA371422 Hs.334371 hypotheOcd protein M6C13096 2^ 

423642 AW452650 Hs.1 57148 tqrpothetical protein MGD 3204 2.50 

411571 AA122393 Hs.70811 hypothetical protein FU20516 2.48 

IS 419750 AL079741 Hs.183114 Homo sapiens cDIM FU14238 fis. done NT 2.48 

408209 NM_004454 Hs.43697 ets v9iantg8ne5(ets-relaledinorecule 2.47 

435726 BE535787 Hs.1 13170 ESTs 2.47 

404088 TaipetExon 146 

403137 N^4_005381*:Komo sapiens micl8din(NCL), 146 

20 434276 AF1235S9 Hs.93605 leucine z^. putative tumor suppresso 146 

422283 AW411307 Hs.1 14311 CDC45(ceadhrtsioncyde45.8£erevis 2.46 

429652 AA766810 Hs.259290 ESTs 145 

416204 AW972270 Hs.l95161 ESTs 145 

414713 BE485243 Hs.12664 ESTs 144 

25 425910 AA830797 Hs.184760 CCAAT-faoxpbtnding transcription factor 2.44 

408875 NM_015434 Hs.48604 DKFZP434B168 protein 144 

435244 N77221 Hs,1 87824 ESTs 144 

402679 NiyL000478:Hofno sapiens dicaanephosphat 143 

413943 AW294416 Hs.144587 Homo sapiens cDNAFU12981fls, done NT 142 

30 433914 AF108138 Hs.1 12160 Homo sapiens DMA helicase homoloo (PIF1) 141 

437812 Ai582291 Hs.16646 ESTs, Weakly similar to 04HU01 debrlsoqu 2.41 

410855 X97795 Hs.66718 RAD54 (S.cerevlslaeHDw 141 

423232 BE244625 Hs.1 25742 teudne^ neuronal protein 140 

427578 AI591305 Hs.ie9084 ESTs, Higlily similar to TUI^.HUMAN TUBBY 140 

35 409934 R91601 Hs.190466 hypothetical protein FU22584 139 

423787 AJ295745 Ks.236204 nudear pore connplex protein 139 

420892 AWg75076 Hs.172589 nudear phosphoprotein similar to S.cer 139 

438869 AF076009 gbiHomosapIensfiin length insert cONA 138 

434981 AW182577 Hs.293077 ESTs 138 

40 417911 AA333387 Hs.82916 chaperodn containing TCP1,sut)Unil6A( 138 

409210 AA251812 Hs.51120 catheTicidlnanlimlcfotjialpepBde 137 

424425 AB031480 Hs.146824 SPR1 protein 137 

411885 AA452636 Hs.131057 ESTs. Moderately similar to CR6DJ1UMAN 6 137 

421567 AJ272137 Hs.198265 matrix metalloproleinase 25 137 

45 425159 NML004341 Hs.154868 caibamoyl^Jhosphale synthetase 1 espart 2.37 

418678 NM^001327 Hs.167379 cancerAesBs antigen (NY-ESO-1) 136 

431197 AL038596 Hs.250745 polymerase (RNA) III (ONA dtreded) (62k 136 

411630 U42349 Ks.71119 Putative prostate cancer tumor suppresso 136 

410968 AA199907 Hs.67397 homeotJOxAl 136 

50 421305 6E397354 Hs^24830 diptheria toxin leslstanca protein requl 138 

417153 X57010 Hs.81343 cdlaoen,typen.dpha1 (prtmaiyoste 136 

412389 AW947655 gb:RC0^0003-1403004)31-b07 MT0003 Homo 135 

419359 AIJ0432Q2 Hs.9ro73 chromosome segregaSon 1 fyeast homoiog) 135 

403780 C40017S9:gi|133250|splP19474|RO52.HUMAN 134 

55 437681 AI2079S8 Hs.1665S6 Homo saptons, Simitar to TEA domain fami 134 

400205 NK^006265*:Hon« sapiens RA021 (S. pombe) 134 

433160 AW207D02 Hs.134342 TASP for testis^pediic adriamydn sens 134 

432606 NM_002104 H5.3066 granzyme K (serine protease, granryme 3; 134 

425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo 133 

60 430606 BE266026 Hs31476 Homo 8a|^cONAFU13872fis. dona TH 133 

424308 AW975531 H5.154443 idnlduomosome maintenance delident(S. 132 

418821 AA436002 Hs.183161 ESTs 132 

437437 AA226869 hypothetical prot^DKFZp762U)311 132 

413437 BE313164 HsJ5361 gene from NF2/hnen]ngioma region of 22q12 131 

65 425848 BE242709 Hs.15g637 valyl-tRNA synthetase 2 130 

435532 AW291488 Hs.1 17305 Homo safdens. done IMAGE:3882908.mRNA 130 

430183 BE010038 gb:PM3^N0176.100400^1-g04BN0176Homo 130 

409342 AU077D58 Hs.54089 BRCA1 assodaled RING dom^n 1 129 

430504 H52761 ' Homo sapiens, done MGC:12617,mRNA, com 129 

70 427726 AI3S9144 Hs.1 43688 Homo8apienscDNA:FU23031 fis,doneL 129 

417115 AW952792 H5.334612 small nudear ribonudeoprotein polypept 128 

412721 AW183165 Hs.95600 ESTs 128 

404071 C12000514*:gii7302471teblAAF57556.1|(AE 127 

413762 AW411479 H3.848 FK506Wing protein 4 (59kD) 126 

75 425811 AL039104 Hs.159557 karyopherindpha 2 (RAG cohort l.lmpor 126 

424935 AI655010 Hs.120363 hypothetlcaiproteinli/l6C15634 126 

415791 H09366 Hs.78853 uradH)NA glyoosylaso 126 

431667 AA812573 Hs.246787 ESTs 126 

„^ 424169 AA336399 Hs.153797 ESTs 125 

80 438540 6E397032 Hs.1 4466 hypothe£ca! protein MGC1 4226 125 

418113 AI272141 H8.83484 SRY (sex determining region YVbox 4 124 

403242 . Target Enm 124 

414732 AW410976 HS.771S2 mlr g d u umo so me maintenance defident (S. 124 
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4210Q2 AF11G030 Ks.100932 transcf«pBon factor 17 124 

438833 BE612940 Hs.88252 ESTs 2.24 

420333 AJ001383 Hs.97084 lymphocyte antigen 94 (mouse) homoiog (a Z23 

_ 433844 AA610175 Hs.179S47 Homo sapiens cDNA FU 121 95 fis. done MA 223 

5 427528 AU077143 Hs.179565 mintehiomosome maintenance deficient (S. 123 

430289 AK00t952 Hs.238039 liypotheScal protein RJ11090 123 

421016 AA504583 Hs.101047 transcrip6onlactor3(E2Alininuno^tnil 123 

436251 BE515065 Hs.296585 nucleolar protein (KKE/D repeal} 123 

41B826 AK000375 Ks.88820 HOCMC28P protein 123 

10 428812 AA770001 ESTs 122 

433220 An76192 Hs.131933 ESTs 122 

422225 8E245652 Hs.118281 zinc finger protein 266 122 

437549 AA7S914g Hs.128757 gtrah70e03^1 Soare$_test]s.NHT Homo sap 122 

409299 AA045650 Hs.53125 small nudear ribonudeoproteln D2 polyp 122 

15 408665 T68845 Hs.1 12200 ESTs, Wea)(ly similar to ALU7_HUMAN ALU S 122 

408116 AA251393 Hs.289052 Homo sapiens, Similar to RlKENcONA 5430 121 

420062 AW411096 Hs.94785 TGF(l)elaHnduced transo^ tadof 2 121 

432820 A)554057 Hs.152477 ESTs 121 

_^ 430255 AK000703 Hs.323a22 Homo saptensmRNA for IdAAl 551 protetn. 121 

20 420337 AW295840 Hs.1 4555 Homo sapiens cONA: FU21513 lis. done C 120 

407275 AI384188 gb:qw34h07j(1 Na_CGAP.Ut4 Komo sapiens 120 

416209 AA236776 Hs.79078 MA02 (mitotic arrest deftdent. yeasl h 120 

423675 Aig90509 Ks.1 31342 small Indudble cytoidne sut)famay A (Cy 120 

433698 H24201 Hs.247423 addudn 2 (beta) 119 

25 409101 NI\A_004297 Hs.S0612 guanine nudeotide binding protein (G pr 119 

435541 AA687361 Hs.221318 ESTs 119 

412019 AA485890 H$.69330 Homo sapiens cDNAFU13835fis. done TH 119 

418753 BE217818 Hs.87016 hypoOwfical protein aJ22938 119 

435461 AI075846 H8.133996 ESTs 119 

30 402260 NIVL001436*:Homo sapiens Mlarin (FBI 118 

421098 AI697901 Hs.192425 ESTs 2.18 

400587 C10000649*:giI72965741gblAAF51857.1)(AE 118 

407832 AW976516 H5.2B3707 Homo sapiens cONA:FU213S4 lis. done C 118 

427159 U80735 Hs.173854 PAX transcription adv^n domain inte 117 

35 405770 NM.002362:Homo sapiens melanoma anUgsn, 117 

412722 A1343300 Hs.l5091 ESTs 116 

414334 AA824298 Hs.21331 hypottwlicai protein FU10036 116 

438192 Ai859065 Hs.293607 Homo sapiens AFG3L1 (soTomi 1 ml^ pail 116 

417420 T85150 Hs.268814 ESTs 116 

40 421308 AA667322 Hs.192843 ieucine zipper protein FKSG14 116 

412851 AI826502 Hs.97269 ESTs 116 

414702 L22005 Hs.76932 cell division cycle 34 116 

409670 Al3e8109 KIAA1 855 protein 116 

419926 AW900992 Hs.93795 DKFZP58602223 protein 115 

45 417863 AB000450 Hs.82771 vacdnia related kinase 2 115 

434750 BE019254 Hs.4112 t-«ompIex1 115 

410252 AW821182 Hs.61418 miaofibrflto-assodated protein 1 115 

418574 N26754 Mi)hase phosphoproteki 9 115 

409019 AW385412 myosin regulatory light chain 2, smooth , 115 

50 416608 R11499 Hs.189716 ESTs < 114 

436027 AI8640S3 Hs.39972 ESTs. Weakly similar to 138588 reverse t 114 

408161 AW952912 Hs.300383 hypothetical protein MGC3032 113 

422805 AA436989 Hs.121017 H2Ahistone family, member A 113 

410284 U50939 K$.61828 amyidd beta precursor protein^lngp 113 

55 434274 AA828539 Hs.116252 ESTs. lifoderately similar to AlOJlJIUf^ A 112 

430935 AW072916 zinc linger protein 131 (done pHZ-10) 112 

433252 AB040957 Hs.151343 K1AA1 524 protein 112 

416619 U77735 Hs.802O5 pim-2 oncogene 112 

437218 AL117497 Hs^185 ESTs. WeaWy similar to T42727proiifera 112 

60 407239 AA076350 Hs.67846 leui8)cyt0immunogtobuDn^ receptor, 112 

433947 AA927996 Hs.112876 ESTs. Wbddy simitar to AF1 29535 1 F^ 111 

424727 AW590378 Hs.152519 hypotheticd protein FU20674 111 

435703 AW630133 Hs.63313 GK003prolein 111 

420297 AI628272 Hs.88323 ESTs. Weakly similar to ALUIJtUlWAN ALUS 111 

65 422192 AA30S1S9 H8.113019 ns485 111 

407961 AW672939 Hs.41G94 origin recognltton complex, subunit 2 (y 110 

410193 AJ132592 Hs.59757 zinc finger protein 281 110 

414151 AW976468 Hs.257245 ESTs 110 

434789 AW292515 Hs.194317 ESTs. Weakly shnDar to T08680hypolhetl 110 

70 424196 AL133660 Hs.142926 Homo sapiens beta cysteine String protd 110 

408831 AF090114 Hs.48433 endocrine regulator 110 

414733 BE514535 Hs.77171 mlnfchromosofT» maintenance ddldent (S. 109 

434523 AA703709 Hs.23410 trensloc^ of inner mitochondrial membr 109 

__ 409637 AA323948 Hs.55407 Homo sapiens mRNA; cDNA DKFZp434K0621 (f 109 

75 403532 NM_024638:Homo sapiens hypotheScal pro! 109 

432141 BE410964 Hs.272736 nudear receptor binding protein 108 

409014 H83115 Hs.49760 origin recognitton complex, subunit 6 (y 108 

410575 BE207480 Hs.6994 Homo sapiens cONA:FU22044fis. done H 108 

415071 AK002197 Hs^270 Komo sapiens cDNAFUl 1335 fis. done PL 108 

80 418755 Y14443 Hsi8219 zinc finger protein 200 108 

406137 NM.000179*:Homo sapiens mutS(E COR) h 107 

409893 AW247090 Hs^lOl mbikhramosome maintenance defident(S. 107 

421413 AI826128 Hs.55209 ESTs.W^eim0artoA43384 S9pfotei 107 
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Hs.80624 


hypolhetical protein M6C2560 


102 


408638 


BE294925 


Hs.46680 


CGI-12 protein 


102 


420005 


AW271106 


Hs.133294 


EST8 


Z02 


412783 


BE276738 


H3.7457B 


DEAO/H (Asp-Qii-AIaAsp/His) box polypep 


2.02 


415319 


AA659823 


Hs.34955 


Homo sapiens cDNA FU13485 lis. clone PL 


102 


425358 


AL0796S8 


Hs.338207 


FKS08 trtndng protdn 12'4Bparnycin essoc 


2.01 


409617 


6E003760 


H8.55209 


Homo sapiens cONA DKFZp434K0514 (f 


2.01 


438450 


A]050866 


Hs.65853 


nodal, mouse, honiolog 


ZOO 


431629 


AU077025 


HS.265B27 


interferon, alpha-inducible protein (cb 


ZOO 


424934 


U75370 


Hs.153880 


polymerase (RNA) mltochondttel (ONA dire 


ZOO 


436291 


BE568452 


Hs.344037 


protein regulator of cytokinesis 1 


ZOO 


414251 


AL042306 


Hs.97689 


VASA proton 


ZOO 



TABLE 51B 
Pkey: 

CATnumben 
Accession: 



Unique Eos probesel Menta* number 
Gene duster number 
Genbank accession numbers 



Pkey 

432666 
423458 

418477 

436812 



437421 
430676 



436909 
429228 

435514 
422689 
421974 
414136 



417886 
418235 
414725 



434609 
408065 
412537 



41S684 
433641 



424281 



CAT Number Accession 



144J 
30480.1 

417^.1 

659779_1 
1000797J 
978554.1 
6083SJ 



427521 51321?„1 



696835J 
215430.1 

132288.1 
874209.1 
864120.1 
30243J 



1031334.1 
886897.1 
19377J 



14739.1 

101881.1 

14066.1 



18895.18 
35983.1 



AA558585 AA565499 AI360576 AW204069 AA991648 AA864939 

BC01807O BG702493 AI204212 AA460929 AA9936Q6 BF826635 AA226938 BG190705 6G1 86496 AW291 865 BG183340 BG195301 BG214539 
6G215094 B6198867 BG196332 BG206220 BG212418 

BC022538 AI990847 8F478249 BG217996 B6212702 6G182057 AW589883 6F000085 AA993969 BG479023 B6220014 BG679466 BE907092 

AI623B55 AA223956 AA223917 AW022983 AW090560 AW573219 BF514491 BF445397 AA884705 AI910424 

AW978773 AW298067AA810101 AW194180 AA731645A1690673 

AA764852AA738937 

AA917062 AA7573G9 AW592218 

BG433950 BE081583 T05B08 BE144813 AW812038 BE144812 AVV812040 AW812041 AU124350 6E081602 BE081604 BF922595 BE061603 
AI352469BE081601 BI062752AW3182066F687722 

AW973352 BF222929 AW016853 BF059130 At651829 BE551767 AA558414 AI339359 6F059601 AI961162 AI341422 AI206248 AI206165 
AA548736AA768578A1539081AW025957AA738837 N7957SAW594357AA480892 
AW102S70AAg071S0AA907120AA737l88AI248890AW9n353 

BG6761556M009591 AI479075 AI025794 A1017967 AA448270BE466812AA853422 AI392649BG962034 AA513384 BF840124 BE714620 

AW969S05AI553633 
AA683356 AW592804 AI1502B7 
AW954733 AA3150C6 AW856665 
AA301270 AA301379 AA301366 

AJ420453 AL526740 AWg6B449 AA459140 AA843893 AI566516 AW971760 AA430089 AI753216 AA854268 AA743075 AI864957 AA4S8920 
Ai565634 AA211796 BG615512 BE169275 BF983253 BF969462 AA766261 A1769894 AA135833 AI831542 N63376 AA214392 AU154486 
AW605017AW450072AA446459 6E8B1875AI061423 AA598549AW4391S1 AA426273Z40087AA812434 AA135965 K04812 
AA210967 057294 AA214584 AA207006 056572 
BE072634 6E072653 AA830615 AA214736 AA331718 

NM-OS8166AF220030AL043894 AVW74257 AA625445 AU153502AI650537AVV612116AI672377 AW772451 BE892241 BE501740 AA718936 
AI650276 AI654206 BE503226 AI6S1327 AW873562 AW271269 AW271S65 AI873518 AI2071 50 A1338826 AI650258 AI628362 AA2271 17 Ai207149 
AVV052076AI470776AA588100AVV235652 AA769791 AI701653 AK027664 AI984770AU153469 BE222^^ 
61054316 

AF147390R76593 R76594 

81603077 AWg54272BI598724AI0031S4AAOS9300AAO46911 B1GG9907 6160096661668987 
Aia)25201AA425472AI6942B2BG057306AA907787AI288170AI6845nAJ420494AI809865BF058095AI4^^ 
AVV665529AI129239AW297152AI268215 AI469807 AI989353BES52356N66509AA736741 AA3825S5AV^ AV7S9188BI2S9364 
BF445142 BG232065 Ai141758 AI631202 AI167566 Ai208445 AA889823 BF982682 N90322 BI090882 BF208005 AW9S3918 AU)44113 AI016793 
AA382556 AW235763 AA927051 AI862075 BE886691 8E619282 
6F666746 059356 BG678312 N56640 AA168881 

AF080229AF080232 U87S93U87592 U87591 U87590AI638743 AI833818 AW206802AI583718 AF080231 AF080234 AF080233AL535594 
AI81 8326 AF080230 S46404 AI970376 AA463992 AW665466 BF512210 U87S95 U87S89 6E550633 AI672S74 BE467547 A1680833 AW614951 
N29988 N25695 H89001 U875g8 BE673974 AI797496 AiTOI 526 AA703395 AW139734 H92278 N66048 BE219539 BH671 665 Ai624817 6E46661 1 
AI206344 AA574397 BF593413BG231271 BF773517 U67594 BF082180BE466420 AI887798 BF674385 AA204735 AW496808 AA204833 
AA207155 BI004756 AA206262 AI365204 H77608 AW59051 1 
AA338252AA338213 

603 



wo 03/042661 



PCTAJS02/36810 



43B243 
410704 



430988 
4Z7260 
433979 
426919 



412369 
400205 



2532601.1 
1054673J 



12371 15 J 
11272.50 
2076469.1 
347372.1 

52134.1 

1174403.1 

2538.1 



427298 115241.1 AA933717BR)61697AW6283Z7AA541788AM00495 

420218 191S47J AW9S8037 R42557 A1337047 AA948360 AI638005 AA459950 A1624915 AI638047 AI467858 A1521826 AA86O305 AI932315 AW003092 AW271756 
AW7793B0AA609S79A1634791 AI493770A1565211 Z41145AI627952AA303734BE349457AW19676SAA2S6527BE089727 

418049 12052.4 AJ314547NM 052868 B1494693AA835065At634477Al336678AIB07696BF477887AI701 147 Z391 87 R38979 F02234AA98471^ BI222234 
AV731417 R42408 H04996 T98498 R12489 R12S77 R42405 

433023 3970 8 BE999987BF438599AVV864793AI802899BE815132AW468888AI672189AI062004BF112024AA772335AVV? 
A1966683AA846676 AA927355 H80424AW973295 R88209 F2g868BE928871 

418866 24S947J T65754AA22g658AA2^B57 

427239 20459 2 AL532360 BE794750 AA582906 AI015a67 AW271034 BG271638 AW075177 AWD71374 AI345565 A1307208 BE138953 BEO490B6 AI334881 
AW075Q06 AW075181 AA4640ig AW302733 AW07S100 AWD73433 AI8D28S4 A)334909 AIB02853 At34503S AI348921 A)340734 AI307478 
AI2512a9 AW3D2327 AW072520 AI312145 AW073656 AW072513 AW071289 AI307559 AAB76186 T29587 AI307493 AI255068 AI252868 AI252839 
AW074809 AJ252926 A1252160 AI251 662 A1251 262 A1610913 AI270787 AI2701S6 AI252075 AW073459 AW072901 AW072496 AW071420 
AI305762 AI254764 AI802837 A)2S1264 AW073049 AW071311 A1340643BE1 38965 BE138S02 AW073456 AI334733 AI054335 BE139260 
A)054302AI054060A]054057AI053722AI269711BE139228AW470478AVV271039AW302085BE041872AI2W^ 
BF718773 8F71 8645 AW074868 BE857822 
437834 294580.1 BG110129 AW749287 BE535498 AW749299 AW749293 AW749302AW749298 AW749291 AW749294 AW749289AW749288 AW749296 
AA769294 AW749297 AW749295 AW749292 6E002573 
AI581311 AA781682 AA781678 

AW877458 AW677524 6E076922 BE166912 AWB40534 BE076754 AW797829 6E166905 BE166926 AW877462 6E166927 6E166932 AW877523 
6E166917 AW877529 BE1 66928 BF351394 AW877522 AW67752B BE166861 BE166866 BE166913 BE16e919 AW877456 AW877537 BE076865 
AW840571 

AW972830 AA489820 AA527647 AA570362 
AA401424AA400100AA663848 
NS04S4AA620999T16375 

BI917595A1203314AL041228AV727959 061361 D82004BI753157AA961066AI990307BF439651 AI453076A1376075AI014836Ai01830B 
AW163530AA393346AA935601AA628633AI150282AI028574AI217182AA431478AVV087473AVV900295 
AF075009R63109 R63068 
AW947655AW984020 

N^L006265 D38551 X98294 BM477931 BM461 566 AU123557 AU133303 AU134649 AW500421 BM172439 AWS00587 AW50366S AW50435S 
AW503640BM152454AW505260AI815984AVV504075AW500716AL597310BC001229BM474371 AA984202AU135205BE090841AW1 63750 
BF747730 BF898637 AI206506 AV660870 AV692110AW386830AV656831 N84710 AW993470 BF086802 BF758454 BG960772 BF757769 
81870853 BE018627 C75436 AW148744 BF757753 6G822067 BEg09924 AA708208 BG530266 BF968015 AW992930 BFB88862 BG536528 
AA143164 AW748953BG498922BF885190BF889005BRS4781 BF800003BM476529AI627668AWD28126AL046011 BF590e68AI017447 
AA579936 AI367597 AA699622 BE280597 AI124620 A1082548 AW27498S AA6n670 A1056767 BE551689 AA287642 H94499 Ar752427 AI65236S 
AW002374AW062651 AA360834N68822AU135442AU125960Z78334BE545ai3AI092115BF312771 BF242859 B6S33616 BG533761 
BG164745 BG492433 BM473183 AA172043 AA172069 AU157092 AU151353 AU155318 BE30221 1 A1375022 AA085641 AU157923 H88858 
AA1 32730 AA1 151 13 AA909781 AI475256 AA424206 AW572383 AW084296 AI184820 AI469178 AA782432 H92184 AA340562 BF195818 
AA852821 AW576342 AA827107 AA173317 AW180014 AI918514 AA729372 AA72971B AlOSSgSS AA331424 6E328601 AA5156g0 81018896 
AW528277 AA748388 AA626222 BG492636 AW380520 BF800058 AW370956 AA290909 R25857 BG952995 BF801 437 AA1 72077 AU 1 55890 
AU149783 A1720g04 AAg02936 AA865727 AI470830 AV740677 AA1 42982 AA482485 AU145485 AW576399 AU1 56042 R63448 BF246427 
BE928472 D25910 BF758439 BF988785 BE565238 AA3S5981 AI905&07 6G291148 BG533096 6G532888 BF030886 BG6137S6 BE928471 
BG574501 AA187596 AA361 196 T95557 6GS31446 BG527242 BG527513 BG61 1106 AA08599S BF847252 BG024608 BE540261 BG531236 
AL57g993 BG108733 BG483503 BG571032 BG492S05 
425331 1227464.1 AA427363 AW962128 AA355353 

437437 6087.1 BC009352 BC014630 AU131857 AL527140 AU131768 B1769362 BI753220 AU1 29886 AU128771 AA314135 AU1 26819 AI333799 AA479336 

AA258503 AL597351 A1359619 BG697218 BI254283 AI743846 AA236444 AA397533 AA247450 AI051464 AI224533 AU153442 AU151001 
AU152621AU151829AU1S3069AW26g958AU154195AI862754AI589780AW273839A)338155Al126632B£^^ 
AI625961 AI222288 AI280054 AA973329 Al 524262 Ai242371 AA295517 AI567885 AIS90681 AJ34661 6 AW247913 AI4220S1 AI475352 AI689531 
AW469308AW198034AA936939AU15105gAU148134AA486419AU151953At830986B1493265BI493264AU149861BE2M^ 
AW962827 BM48O30O AA226869 AL52936B 8M451957 AU132714 BIB71319 AA380739 Bt91 1351 6F795906 6E548853 AW579751 

430183 17316 1 AK055746AAO399O9BE183282 W6O721 AA464867AA398986T67280BF995651AI67S065BG001051BF764727BF^ 
BF173139BE010038 

430504 5477.8 BE219720 BF475241 A)5717236E21 9848 61789268 AI2248g9AA724884AVmi467AA480255AW84561 6 A1440295 H52800 66218790 AI681^ 

AW300064AW262133 H21568AI363015Ai884914 H86948 
428812 1383169.1 AAHOOOI AA431112AA432126 

409670 8882.8 AI625045AW504152A1469088AA905873AW504662AW136114AI927270BE041754A1646386AA66265^ Ar744934 

AI400147 A)381657 AA676551 AA974367 AW1 17437 A1570383 A)242456 A1274581 AA678138 R49939 AI393926 AA345854 AW6058S0 A18e9780 
AW391171 R77044 

418574 12009 2 AW955043AI990326AA776406AI016250AW451882AA843878BF916900AW945895AI979339 N23129 W70051^ 

409019 32320.4 Biyi480413 N28908 H39792 BE240826 BE682093 6E240827 AW868637 BF739795 AA700a34 AA769597 AA489668 AW968808 AW085196 

A)093280AI218457AA063138AI632958AW515005AI570S30Z41724AA748789AI696584AA062544AA773643A^ 
430935 15297 3 BC017923AA789302AW466994BF513878A!819642A1184913AW469044AI220572AW072916AI280239AM^^ 

AA489195 N59350 AA693435 B6531 204 AM84243 AV\^14092 
424677 2518.37 U09414 NM.003438 AA503545 AI022449 AA043458 AA766074 AA765442 AAB05052 AI02821 1 AW60970B 



TABLE 51C 

Pkey: Uniqua number oonesporafing to an Eos probesdt 

Rsf: Sequence source. The 7 (figH numben in this column are Genbank Idenfitor (Gi) numbeis. 'Dunham, et al' refers to the publication entitled The DMA sequence of 

human chromosome 22* Ounhanv et at. (1999) Nature 40248^95. 
Strand: Indicates DMA strand from wWch exons were predicted. 
NtjKJsltion: Indicates nudeottde positions of predicted axons. 





Rsf 


Strand 


Ntjxsilxjn 


406547 


7711513 


HAtmis 


172780-174358 


402145 


8018280 


Phjs 


113086-114800 


402199 


8576116 


KAmis 


84187-84744 


401435 


8217934 


Mmis 


54508-55233 


402408 


9796239 




110326-110491 


408367 


9256126 


Minus 


58313^9 


402680 


8113438 


Rus 


137634-137768,139702-139893.140475-14059 


402299 


6693370 


Plus 


23367-25175 


401091 


9958240 


Rus 


94760-94898 



604 



wo 03/042661 



401464 


6682291 


Minus 


17068^170834 


403989 


8569909 


Plus 


91Z97-oi379,3Z4u>329UO 


401837 


7630990 


Minus 


lOiAQC 191CCn I^IYOO 




JluO0£l 


Mnus 


1814^18766 


403137 


9211494 


Minus 


92349-92572.929S8-93084.93579-937l2.9394 


402679 


8113438 


Pius 


132079.132216 


403760 


8076989 


Plus 


33160-93409 


404071 


7210053 


Minus 


167354-167B59.168810-168920.16g00O-16910 


403242 


7637817 


^Anus 


11297-12511 


402260 


3399665 


Monjs 


113765-113910.115653-115765.116808-11694 


400587 


9887626 


Plus 


25435-25588.25668-25747 


405770 


2735037 


Plus 


61057.62075 


403532 


8076642 


^^^us 


81750^1901 


406137 


9166422 


Minus 


30487-31056 


402677 


8113438 


Plus 


22135-22309.23083-23238 


402678 


6113438 


Plus 


37395-37S14.3786&37981 



TABLE 52A: 

Pkey. Unique Eos probeset IdenGfier number 

ExAccn: Exernfilar Accession number, Genbanit accession number 

UnigenelD: Unigene number 



Unlaene Unigene gene Utte 






R1: 


Ratio of tesBcular cancer (non^semlnomatous and SemlnDmalous) compared lo nonnal adUlt le 


Siucular flss 


PItey 


ExAocn 


UnigenelD 


Unigene Tltie 




414438 


AI879277 


Hs.76136 


Otioredoxin 


51.77 


424247 


X14008 


Hs.234734 


tysozyms (renal amyloidosis) 




416680 




Hs79S16 


brain abundant, membraiw attactied signal 


49.20 


412948 


6E243313 


Hs.334851 


UM end 8H3 protein 1 


44.46 


438091 






nuclear receptor subfamily 1, group 1. m 


An Tfi 


406658 


AI920965 


Hs.77951 


msfor histocompaOblRty oomptex. class 


39.64 


418174 


L20688 


Hs 63656 


Riio GOP dissociaGon inhibtlor (GOf) bet 


J0.70 


409038 


T97490 


H5^002 


fiiTtflH faiducthtfi cvtoldna sufafamdlv A (Cv 

INIUlll UHJUMIMD IrJIWIWIV wMVlMilUtJ \WJ 


38.25 


413063 


AL035737 


Hs.75184 


cMQnase ^-Gke 1 (cartQage gtycopfote 


37.50 


430542 


"tow 


Hs.119122 


rihnsflmal ivclBin L13a 


37.22 


428928 




no. 


mdh^rin 1 Ivm 1 E^sdherin febHhsli 


35.98 


432730 






ESTs 


^.25 


444562 


/vMDu/ la 




RJKEN cDNA 9130422N19 fifina 


31.69 


446525 


AVV9OrU0!l 






31.33 


417088 


M54915 


Ks.81170 


pliivl oncogene 


31.20 


41B870 


AF147204 


Hs^14 


chemoidne (C-X-C motiQ, receptor 4 (lus 


29.93 


433800 


A)034361 


HS.1351S0 


lung type-i ceil membrane-associated gty 


29.35 


426295 


AW367283 




zinc finger protein 6 (CMPX1) 




406856 


AW515336 


Hs.29797 


ribosomai protein LIO 


28.93 


417139 


M69043 


Hs.81328 


nuclear factor of Icappa light polypeptid 


27.99 


440207 


A!371978 


Ks.1 28326 


ESTs 


27.75 


422578 


AF239666 


Hs.1545 


caudal type homeo box transcription fact 


28.95 


432359 


AA076049 


HS274415 


Homo sapiens cONA FU10229 fis, clone HE 


26.90 


420367 


AA259090 


Hs.257028 


ESTs 


26.50 


429978 


AA249027 




ribosoma) protein 86 


26.43 


440440 


Z28925 


Hs.7168 


sema dom^n, immunoglobulin domain (Ig). 


26.36 


423873 


BE003054 


Hs.1695 


matrix metaDoprateinasa 1 2 (macrophage 


2673 


412638 


NM_004415 




desmo(da)(in(DPl, DRI) 


26.15 


435538 


AB01154O 


Hs.4930 


low density lipoprotein receptor'ieieiBd 


25.25 


446899 


NM.005397 


Hs.16426 


podocalyxin-Hto 


25.25 


442562 


BE379584 




doQchyl-dipliosphooOgosaccharidei)rotd 


25.15 


406658 


M16714 


Hs.89843 


major hlstocompaGbiGty complex, dass 


25.13 


446619 


AU076643 


Hs.313 


secreted phosphoproteln 1 (osteoponBn. 


24.78 


423961 


013666 


Hs.136348 


pertostin(OSF-2os) 


24.48 


425543 


R23313 


h)s.334895 


ribosomai protein LI Oa 


24.38 


420676 


AI434780 


Hs.4248 


vav2oncogeno 


24.18 


406820 


AI223968 


Hs.108124 


ribosomai protein 84, X-ltriked 


23.96 


440859 


NkL014297 


Hs.7486 


protein expressed In IhyroU 


23.80 


447526 


AL046753 


Hs.303649 


small [ndudblQ cytokine A2 (monocyte ch 


23.56 


414587 


NM.004862 


Hs.76507 


U^induced TNF-atpha factor 


23.22 


446827 


A1973016 


HS.1S725 


)VpotheticalpraNn&B6146 


22.93 


449571 


AW016812 


Hs^266 


ESTs 


22.83 


413787 


A)352558 




tyrosine 3-monooxygenase/tryptophan 5-mo 


22.81 


410315 


A1638871 


Hs.17625 


Homo sapiens cONA: FU22524 fis. done H 


22.68 


414092 


Z14244 


Hs.75752 


cytochrome c oxidase subunil Vllb 


22.45 


422714 


AB018335 


Hs.1 19387 


KIAA0792 gena product 


22.45 


439180 


A1393742 


Ks.199067 


v^rb-b2 avian eiythroblasQc leukemia v 


22.30 


444784 


D124B5 


Hs.11951 


ectonudeoQde pyrophosphatase^k^hosphocfi 


21.69 


406648 


AA563730 


Hs.277477 


major htstocompatibility complex, dass 


21.58 


448588 


AI970278 


HsJ56905 


KIAA1676 


21.23 


433423 


BE407127 


H3.8997 


heal shodc 701(0 protein 1 A 


21.19 


429490 


AI971131 


Hs.23889 


ESTs, WeaMy similar to AtJU7JIUMAN ALU S 


20.70 


432606 


NM.002104 


HsJ086 


granzyme K (serine protease, granzyme % 


20.60 


407862 


BE548267 


Hs^7988 


Homo sapiens dMA aJ10934 lis. done OV 


20.57 



wo 03/042661 



PCTAJS02/36810 



420754 W79431 Hs.346911 r9»somal protein L22 2a40 

425769 U72513 Hs. 159486 Human RPII 3-2 pseudogenemRNA. complete 2ai5 

4248X AL035588 Hs.153203 MyoD fafrfly Inhibitor 20.10 

412915 AWD87727 Hs.74823 NKL004541:Ho(nosaptens NAOHdehydrogenas 2a01 

5 452322 BE566343 Hs2Q988 ^utaredoxln (thWlransferase) taB9 

410143 AA188169 KIAA11 91 protein 1141 

420759 T11832 Hs.127797 Homo ss«BenscONAaJ1 1381 lis. doRB HE 19.08 

430253 AK001514 Hs.236844 hypothetical protein FU1Q652 19.03 

425535 AB007937 Hs.158287 K1AA0468 gene product ia7B 

10 411573 AB029000 Hs.70823 KIAA1077 protein - 18.63 

452874 AK001061 Hs.3092S hypolheticat proton faJ10199 18.53 

408869 A1493591 Hs.78146 platel^endotheRalceladhesbnmolec 18.52 

421379 Y15221 Hs.1 03982 sma]llnducn))ecytoidne subfamily 6 (Cy 18.50 

426083 AW982712 Hs.126712 ESTs. Weakly similar to AF1910201 E2IG5 18.50 

IS 429183 AB014604 Hs.ig7955 K1AA0704 protein 1^48 

4500X A)952797 Hs.10888 hypolheGcai protein FU21709 18.44 

450377 AB033091 K1AA1265 protein 18.40 

430255 AK00O7O3 Hs.323822 Homo sapiens mRNA for KIAA1551 protein. 18.15 

440528 BE313555 Hs.7252 KIAA1224 protein 18.05 

20 444381 BE387335 Hs.283713 ESTs.WeaWy slmDartoS64054hypoIheU 17.98 

420028 AB0146B0 H5.8766 carbohydrate (N-acetytotucosdmlne-6^} s 17.80 

414682 AL021154 Hs.76884 bihibitof of DNA binding 3. dominant neg 17.75 

428782 X12630 Hs.193400 interteuUn 6 receptor 17.48 

415221 W07418 Hs.78225 annexlnAI 17.47 

25 429614 AI371172 Hs.211539 hypothelical proteb MGC4248 17.40 

418707 U97502 Hs.87497 butyropl^lin.subfamlly 3, member A2 17.30 

412025 Ai827451 H5.24143 WlstottV^ldrtch syndrome protein Interac 17.14 

417407 AA923278 H8.290905 ESTs. Weakly simlter to protease (Ksapi 17.13 

424326 HM.014479 H5.145296 ADAWMItedislntegilh protease, decysin 17.10 

30 446921 AB012113 H$.16S30 smaH inducibte cytokine subfamily A (Cy 17.03 

425996 W67330 hypothetical protein All 101 15 16.98 

402474 NM.004079:Homo sapiens calhepsin S (CTSS 16.98 

450937 R49131 Hs.26267 ATP-dependantlnterfefoniesponseprotei 16.98 

427521 AW973352 ESTs 16.93 

35 421181 N^L005574 Hs.184585 UM domain only 2 (rhomboOn^Ike 1) 16.93 

443523 AK001575 Hs.9536 hypothefical protein I=U1071 3 16.53 

449338 H73444 Hs.394 adrenomeduIBn 16.36 

429469 M64590 Hs.27 glycine dehydrogenase (decarboxytelbig: 16.23 

425945 AW410669 Hs.164280 solute carrter family 25 (mitochondiy 16i1 

40 430332 R51790 H3.239483 Human done 23933 mRNA seqpcnce 16.15 

427691 AW194426 Hs.20726 ESTs 16.13 

406786 AW161679 Hs.111334 (errilin. light polypepWo 16.11 

431639 AK000680 Hs.26617S phosphoprotein assodated with GEMs 16.10 

451106 BE382701 Hs,25950 N-MYC oncogene 16.09 

45 408380 AF123050 Hs.44532 diublqulfin 16.00 

445863 R12234 Hs.13398 Homo sapiens done 25028 mRNA sequence 15.93 

456236 AF045229 Hs.82280 regulator of Opiotein signalling 10 15.70 

408791 AI220684 K3^7939 hemoglobin, alpha 2 15.69 

414020 NM.002984 Hs.75703 small IndudWe cytokine A4 (homologous 15.64 

50 440273 AI805392 Hs.325335 Homo safto cONA: aJ23523 lis. done L 15.55 

417640 D30857 Hs.82353 protein C receptor, endotheria](EPCR) 15.55 

446108 AL036596 Hs.42322 A kinase (PRKA)andH)r protein 2 15.53 

410185 BE294068 Hs.737 immediate early protein 15.49 

422105 AI929700 Hs.1 11680 endosulfine eipha 15.23 

55 415899 X76992 Hs.78909 butyrflte response tedor 2 (EGF^esponse 15.23 

428227 AA321649 Hs.2248 small indudUe cytddne subfamily B (Cy 15.05 

427820 BE222494 Hs.l80919 Inhibitor of DNA binding 2. dominant neg 15.02 

426552 BE297660 Hs,170328 moesin 14.96 

422241 Y00062 Hs.170121 protein tyrosine phosphatase, receptor t ' 14.88 

60 436860 H12751 Hs.5327 PR0191 4 protein 14.85 

418509 AB028624 H5.85539 ATP synthase, H transporting. RiHodiondr 14.84 

444060 AA340Z77 Homo 8apten8cONAFU20167fis, done CO 14.78 

412623 R28898 H3.74170 metaQothionein lE(funetionaO 14.70 

408989 AW361666 Hs.49500 K1AA0746 protein 14.53 

65 425234 AW1S2225 Hs.165909 ESTs. VfeaUysbntter to 136022 hypolheO 14.48 

417144 AA382104 Hs.81337 tedn. gdadoside-Unding, sohAte. 9 14.31 

410325 AB023154 Hs.62264 K1AA0937 protein 14.23 

415938 BE363507 HS.7B921 A kinase (PRKA) anchor protein 1 14.20 

_^ 433412 AV653729 Hs.8185 CGM4 protein; sdlide dehydrogenase y 14.19 

70 418151 AA864238. Hs.83563 adlnietetedprateina3oon)ptex.8Ubun 14.18 

426996 AW966934 Hs.173108 Homo sapiens cONA:FU218g7fis. done H 14.13 

447211 AL161961 Hs.l7767 K1AA1554 protein 14.08 

417426 NM_002291 Hs.82124 laminin, betel 14.08 

__. 414420 AA043424 Hs.76095 immediate early response 3 14.04 

75 444051 N48373 K3.10247 acSvstedteucocyteceSadhesiinmdecu 14.02 

454413 A16S3672 Hs.40092 PNAS-123 13.93 

452651 A1218916 H5.30209 K1AA0854 protein 13.85 

450581 AF081513 Hs.2S195 TGF.bete4 13.85 

420962 NM„005904 Hs.100802 MAD (mothers against decapentaplegIc.Dr 13.78 

80 407112 AA070801 Hs.51515 ESTs, Weakly siniar to ALU7J4UMAN ALUS 13.63 

410598 A1817130 H3.9195 HonK)8aptenscONAaJ136S8fis.donePL 13.59 

428664 AK001666 H5.189095 steiltertoSALLI (sd(DrosophlteHike 13.57 

448412 AI2190B3 Hs.42532 ESTs. Ittoderately similar to ALU8J(UMAN A 13.53 
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430268 AX000737 Hs^480 hypolheScd prM aJ20730 13.43 

445055 BES128S6 Hs.109051 SHSdorrah binding QhitanifcacitM^ 13^1 

447534 AW953935 Hs.288655 ESTs 13J3 

408822 AW500715 Hs.57079 Honu) sapiens CONARJ13267fis. done OV 1131 

5 428065 AI534046 Hs.157313 ESTs 1330 

4252B9 AW139342 hs.15S530 intetfenin.8anin»lnducfltepnMi16 13.28 

436398 KB7136 H&5174 ribosomal protein S17 1118 

453856 AA804789 Hs.19447 P0Z-4JM protein mystique 1Z93 

452436 BE077546 HsJ1447 ESTs. Moderately simiiar to A46010X4in 12.90 

10 445817 NM.003642 Hs.13340 histone acetyltransfersse 1 12.90 

408437 AW957744 Hs^8469 lacrimal ptoDneridiprol^ 12^ 

435522 N64214 Hs.9774 synovia) sarcoma transtocaOon gene one 12.89 

415857 AA866115 Hs.127797 Homo sapiens cDNA FU11381 fis. done HE 12.83 

406743 AA911568 Hs.279860 tumor protein, lianstationaQy-controlie 12.79 

15 407951 W77762 Hs79015 antigen iden&fted by monodona) antitXKl 12.78 

435080 AI831760 Hs.155111 hypothetical protein FU 14428 12.75 

418299 AA279530 Hs.83968 integrtn,t)eta 2 (antigen C018(p95).iy 12.73 

430630 AW269920 Ks.2621 cystitln A (stetin A) 12.68 

409208 Y00093 Integrtn. alpha X(Bn6genC011C(p1 50), 12.65 

20 422511 AU076442 H3.117938 collagen, type XVII, alpha 1 12.50 

438915 AA280174 Hs.2a5681 VVlHlams-Beurensyndrorne chromosome ragi 12.48 

437374 At359571 Hs.44054 ninein (GSK38 interacting protein) 12.43 

433793 AW975959 Hs.107S13 ESTs. Moderately simHar to KIAA1056pio 12.43 

409963 AA133590 Hs.250857 caldum/calmodutin-dependent protein Mn 12.41 

25 412247 AF022375 Hs.73793 vascular endothefial growth factor 12.41 

413497 BE177661 0b:RC1-HTO5984y2O3OOO11-hO2HTO598Hon!O 1Z40 

436876 Ai124756 Hs.5337 isodtrate dehydrogenase 2 (NAOP).mitoc 12.38 

432409 AA806538 Hs.130732 KIAA1575 protein 12.33 

453020 AL162039 Hs.31422 Homo sapiens mRNA:cONADKFZp434M229(fr 12.33 

30 419384 AA490866 Hs.39429 ESTs 12.33 

410275 U85658 Hs.61796 transcripQon factor AP-2 gamma (acttvat 12.32 

432805 X94630 Hs.3107 CD97 antigen 12.32 

416975 NM.004131 Hs.1051 granzymeB(granzyme 2, cytotoxic T-iymp 12.25 

450719 AI096837 Hs.21349 ESTs. WeaMy similar bRB8BJHUMANRAS-R 12.13 

35 423753 Y11312 Hs.132463 phosphdnositide-3-kin8se.dass2.t)ela 12.12 

418460 ii426315 H5.85258 COS antigen, alpha poIypeplId8(p32) 12.03 

402145 Target Exon 12.01 

407179 AA206465 thymosin, beta 4, XGhramosome 12X)0 

433208 AW002834 Hs.24095 ESTs 11.95 

40 447735 AA775268 Hs.6127 Homo sapiens cDNA: RJ23020 fis, done L 11.90 

408912 AB011084 H3.48924 KIAA0512 gene product; A1£X2 11.83 

422068 AI807519 Hs.104520 Homo sapiens cDNARJ13694 fis. done a 11.75 

431427 AKQ00401 Hs.252748 Homo sapiens dlNAFU20394fis. done KA 11.76 

427761 AA412205 H8.140996 ESTs 11.68 

45 449246 AW411209 H5.23363 hypothetical protein FU10983 11.58 

436075 BE090176 Hs.179902 tiansporter-lilte protein 11^ 

440774 AI420511 Hs.153934 ESTs 11.35 

430594 AK000790 Hs^46885 hypothetica! protein nJ20783 11.25 

419223 X60111 Hs,1244 C09 antigen (p24) 11.08 

50 424528 AW073971 HsJ238954 ESTs, Weakly similar to WAAI 204 protein 11.08 

444656 Ai277924 Hs.145199 ESTs 10.98 

420943 AI71B702 Hs.279930 major histocompatibiQty complex, dass 10.98 

450294 H42587 K5i38730 hypothetical protein M6C10823 10.92 

413688 A1469213 Hs.71404 ESTs 10.83 

55 406701 AA780613 Ks.62954 ferritin, heavy pdypeptide 1 10.78 

424687 J05070 Hs.1S1738 matrix melaOoprt^einase 9 (gelalinaseB 10.75 

407252 AA659037 Hs.163780 ESTs 10.75 

445929 Ai089860 Ks.323401 dpy-304ike protein 10.70 

451864 N20370 Hs.69547 ESTs 10.69 

60 429307 AU076592 Hs.198951 jun B proto.onoogene 10.64 

434280 BE005398 gb:CM1-BN0116-150400-189.h02BN0116Homo 10.63 

447519 U46258 Hs339665 ESTs 10.63 

417365 050683 Hs.82028 transforming growth factor, beta recepto 10.59 

418945 BE246762 Hs.89499 aradiidonate ^lipoxygenase 10.55 

65 406776 T16206 Hs.237164 ESTs. Highly simiter to LDHHJ1UMANL-UC 10.54 

437103 AW139408 Hs.152940 ESTs 10.50 

449961 AW265634 Hs.1331O0 ESTs 10.50 

441244 BE612935 Hs.1840S2 PP1 201 protein 10.49 

450139 AK001838 serunV)iliicooGriicoid regulated Idnase 10.48 

70 427202 BE272922 Hs.173936 InterieuMn 10 receptor, beta 10.48 

449944 AF290512 HS.5B215 Homo saptens.Simlter to ritoteldn. done 10.47 

446682 AW205632 Hs.211198 ESTs 10.43 

413886 AW958264 Hs,103832 simflar to yeast UpD. variant B 10.43 

430068 AA454964 gb3x80f10.s1 Soares ovary tumor NbHOT H 10.40 

75 424950 AA602917 Hs.156974 ESTa 10.40 

434442 AA737415 ESTs 10.33 

438089 W05391 nudear receptor subfamily 1, group I, m 10.33 

432559 AW452948 Hs.257631 ESTs 10.30 

„^ 414191 AW250089 Hs.75807 POZ and UM domain 1 (elfin) 10.30 

80 434649 AA738254 Hs.165390 ESTs. Highly similar to A40350transalp 10.28 

424321 W74048 Hs.1765 lymphocyte^pedfic protein tyrosine Mn 10.27 

452568 AA805634 Hs.300870 HomosaptensmRNA;cDNADKFZp547M072(ir 10.27 

419490 NM.006144 HS.907D8 granzyme A (granzymel, cytotoxic T^ 10.23 
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445245 AB032973 

445488 AB037782 

410611 AW954134 

425875 AU077333 

416926 H03109 

420225 AW243046 



445577 
411975 
447644 
408784 



N40696 
AI916058 
AW861622 
AW971350 



444795 Ai193356 

407110 AA018O42 

400440 X63957 

414829 AA321568 



427711 
426827 
456362 
446795 
424201 
422627 
419904 
451129 



M31659 

AWD67805 

AW973003 

AI797713 

L33075 

BE336857 

AA974411 

BE072881 



414405 Ai362533 

418840 A»21614 

453716 AA037675 

415323 BE269352 

415189 L34657 

408380 AI806090 

423024 AA593731 
434423 



Hs.12461 

Hs.15119 

Hs^4 

Hs.160483 

H5J263395 

Hs.282076 

Hs.t37064 

Hs.144583 

Hs.108646 

Ks.63386 

H3.160316 

HS.2S208S 

Hs.83870 

H5.77436 

Hs.180408 

Hs.172665 

Hs.179909 

Hs.155471 

Hs.1742 

Hs.118787 

Hs.16672 



Hs.185831 

Hs.152675 

Hs^49 

Hs.78146 

H8.44344 

Hs.325823 



437469 

416078 AL034349 

410397 AF217517 

422603 BE242587 



KW.006769 Hs.3844 
AW753112 Hs.15514 



434524 
422960 
414774 
411960 



AA63S931 
AW890487 
X02419 
R77776 



428818 AI131291 
408161 AVV952912 



Hs.79005 
Hs.63042 
Hs.118651 

Hs.249716 

Hs.77274 
Hs.18103 
Hs.102308 
Hs.3003e3 
Hs7854 
Hs.14317 



441455 AJ271671 
433271 6E621697 

436823 AW749865 Hs.1 17077 

427968 A1857607 Hs.181301 
AF161486 



420059 

410730 AW3688&0 

431958 X63629 

417315 A10e0042 

421098 A1597901 

422689 AW856665 

433156 R59206 



Hs.94769 

Hs.2877 

Hs.160450 

Hs.192425 



425246 
440268 
414821 
407254 



Hs.17519 
H5.155321 
Ks.336959 
Hs.77424 
Hs.181165 
HS.171B25 
Hs.95140 
Hs.152983 
88.165960 
Hs.195841 
Ks.31314 
AW967807 Hs.13797 
AW373713 Hs.146324 
AW294522 Hs.l49991 
AW971146 Hs.293187 
AA807346 Hs.288581 



AI085561 
BE270030 
M63635 
AW129401 
6E24S550 
420099 D80011 
424768 AA353895 
441438 AW137772 
448019 AW947164 
437886 BE264111 
430556 
450147 
442808 
431167 
449971 
417018 
422451 
419839 
409493 
432314 
414591 
415825 
429500 
420337 



M16038 

AA310753 

U24577 

AA386192 

AA533447 



Y18024 
X78565 



Hs.80887 

Hs.42491 

Hs.93304 

Hs.193482 

H8.31238g 

Hs.55902 

H3,7B877 

Hs.289114 



452679 Z42387 
437100 AA434054 



AW29S840 H8.14S55 
Hs.83883 
HS.B0624 



417228 
425593 



AL134324 
AA27e921 



422618 6E300330 
436380 AW502384 



Hs,7312 
HS.190B 
HS.11872S 



429109 
444933 



AL008637 Hs.1963S2 
NA1.01624S HS.121S0 



LCHNpralein 
KIAA1381 protein 
KIAA1628 protein 

erythrocyte membrane protein band 7.2 (8 
HT01 8 protein 

Komo s^ens mRNA for KIAA1G50 protein. 
cytop)8snife polyadenylation etemenl bind 
ESTs 

Homo sapiens cONA aJ14S34 6s. dons PL 

ESTs 

ESTs 

PradeT'WOIi/Angelman syndrome-S 

nebuiin 

pteckstitn 

sohite canier family 25 (mib)chondriat 
methyienetetrahydrofolate dehydrogenase 
iiypotlieiical protein FU22995 
ESTs 

IQ mo^ containing GTPase actlvaling pr 
transfomiing growth factor. betEhinduced 
ESTs 

gb:RC2-BT0548-200300^12<09 BT0548 Homo 

KIAA0306 protein 

ESTs 

ESTs 

neutrophil cytosolk: factor 2 (65kO, chr 

platetet^othenal cell adhesion motec 

hypothetical protein FU20534 

ESTs, IWIoderatelysImilarto ALU5JHUMAN A 

UM domain only 4 

hypothetical prot^ M6C3260 

protein tyrosine phosphatase, receptor t 

DKFZp564J157 protein 

heroatopoleticdly expressed homeobox 

ESTs 

cadheiin 13, K<»dherin (heart) 
plasminogen ecthrater. urokinase 
ESTs 

potassium bwardl^rectifying channel, s 
hypothec protein MGC3032 
zinc/iron regulated transporter-liice 
nudeoter protein family A, member 3 (H/ 
ESTs, VtesMy similar to 136022 hypdheO 
calhepsinS 

RAB23, member RAS oncogene family 
DnaJ (I4sp40} homolog. subfamily B, membe 
cadherin 3, type 1. P-cadherin (plaoente 
ribosomal praldn 824 
ESTs 

gb:RC3CT)297-2901004)13<i03 CT0297 Homo 
HomosaptenscDNA: FU22539fis,doneH 
serum response factor (c-fos senim respo 
Homo sapiens, done IMA6E:3677165, mRNA 
Fc fragment of IgG. high afRnity la^ re 
eukaryoQc translation etongafton factor 
basic heOx-loop-herix domain conteMno 
KIAA01 89 gene product 
HUS1 (S. pon(be)checkpoinl homolog 
ESTs 

ESTs. Moderately similar to 138022 hypot 
rotlnoUastoma^iinding protein 7 
ESTs 

CX;M45 protein 

ESTs 

ESTs 

Homo saptens cDNA FU14296 lis, done PL 
v-yes-1 YamagucH sarooma viral lelated 
ESTs. Weaidy slmiiar to S65657 aipha-1 O 
phoqihdipase A2, group VII (piatetet^ 
Homo saptens cOHA FU11903 lis. done HE 
ESTs 

ESTs. Weakly similar to ALU8.HUMAN ALU S 
inositd 1,4.5-tnsphosphate 3-ldnase B 
hexabrachion (tenasdn C. cytotectln) 
HoRio saptens cDNA: FU21513Gs.doneC 
transmembrane, prostete androgen induced 
hypothetical protein MGC2560 
ESTs 

proteoglycan 1, secretory granute 
setenophosphate synthetase 2 
gteUI-HF^ROp^l-IZmrl NIHJAGC.5 
neubophU cytosolic factor 4 (40kQ) 
reOnal shortchain dehydiogenaseftediict 
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10.15 
10.14 
10.07 
10.05 
10.04 
10.03 
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9195 
a93 

a90 

9.90 
9.88 
9.68 
9.85 
9.83 

a7a 

9.67 
ft65 
a63 
a63 

ass 
a53 

9.50 

aso 
a4s 

9.45 
9.43 

a43 
a43 

9.42 

as? 

9.37 

a36 
ass 
a32 

9.30 

a28 

9.28 

a27 

9.27 
9.25 
a23 

a23 
a23 
ai8 
ai6 
ai8 
ai8 
ai7 
ai7 
ai5 
ai4 
ai3 

9.12 

aio 

9.08 
9.08 

ao8 

9.08 

ao7 
ao5 
ao5 

9.05 
9.03 
9.03 
9.02 
9,00 
a99 
8.98 
8.95 
8.94 
8.93 
a90 
8.90 
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430592 
445612 
427254 
428970 
425190 
430162 
421684 
446659 
447198 
437457 
401091 
442832 
442495 
428467 
449924 
447674 
425580 



AJ224e64 

N94126 

AL121523 

BE276691 

AW028302 

AW450843 

BE281591 

AI335361 

061523 

AA757900 

AW206560 

AI184717 

AK002121 

W30681 

BE27G640 

L11144 



449655 AA002008 
412093 BE242691 



Hs.9688 

H5.12969 

Hs.97774 

Hs.194691 

Hs.155079 

Hs.346348 

Hs.106768 

HS.226376 

Hs.283435 

Hs^70823 

Hs.253569 

H3.1844S5 

Hs.146233 

Hs.igi92 

Hs.1907 

Hs.188633 

H8.14947 



407833 
411979 
437134 
430333 



425284 
441623 
442622 
441021 
446630 
417621 

433655 AL036559 



AW955632 
X85134 
AA349944 
S70114 
AI979168 
AF155568 
AA315805 

NM_000435 Hs.e546 
AW576716 Hs.7644 



H5J2984 
Hs.42915 
HS239489 
H8.344096 



449335 
446975 
436797 
414662 
414601 
406699 
443884 
442821 
418522 
435968 
440327 
409327 
435684 
430299 
427523 
407151 
448094 
421395 
431574 



AW384793 H5.15740 
AV654694 Ks^2316 
Hs.3463 
Hs.345728 
HS.16G95 
Hs.334477 
Hs.76807 
Hs.301417 
Hs.182979 
H5.194397 
Hs.8752 
HS7149 
Hs.111577 
H3.191146 
Hs.53563 



AW1 50717 
BE246446 
AA731491 
ALi)36058 
AV660804 
LD6505 
N20617 
86391929 
AA605038 
AW1614B1 
R12581 
U1162 
NM>001290 Hs.4980 
W26673 H5.106747 



6E242779 

H2S836 

H24387 

D90084 

AW5726S9 



412645 AW444433 
423523 AW299828 



426759 
426780 



A1590401 
BE242284 



426215 AW963419 
435748 AA899756 



443351 
447500 



AW016763 
AI3dig00 



407949 W21874 
428728 



HS.17M26 

Hs.301527 

Hs^2061 

Hs.1023 

H&261373 

Hs.136061 

Hs.193580 

H3.21213 

Hs.172199 

Hs.155223 

Hs.117335 

Hs^799 

Hs,159212 



434511 

411852 

424875 

419378 

449523 

425277 

451631 

443303 

407013 

429999 

445493 

413420 

422392 

453485 

434159 

432666 

430915 

425913 

448776 

438763 

435905 

408663 

427395 



NM.016625 H5.191381 

R28982 Ks.18108 

AA528140 Hs.107515 

Ai187945 Ks.199310 

R24922 Hs.90078 

NM.000579 HSJ4443 

NM.001241 Hs.155478 

NM_001674 Hs.460 

U87319 Hs^216 



U35637 
AI761902 
AI915771 
AW410235 
NM_005908 Hs.1 15945 
BE620712 KS33026 



H5^870 
Hs^S97 

Hs.75348 



AW135214 

AW204059 

AA488953 

AA365799 

BE302464 

AI583207 

AW997484 

U246B3 

AW298741 



HS.19182B 



Hs,30057 
Ks.99029 
Hs.5003 

Hs.97881 



leukocyte membrene anGgen(IRC1) 8.83 

hypotheOcd protdn 8.80 

ESTs 8.80 

reOnolc acid induced 3 8.80 

pnrt^phosphadas82,regulalofysubuni 6.79 

ESTs B.75 

hypothetical pro(einaJ10511 6.73 

ESTs 8.73 

ESTs 8.73 

ESTs, Weakly simflar to S656578tpha-1& 8.70 

decay accelerating factor for oomplemenl 8.68 

ESTs 8.68 

ESTs 8.63 

ttypolhetica} protein FLI11259 6.63 

Homo sapiens cDNA:FU22130fis. clone H 8.61 

cyciiivdependent kinase 2 6.59 

galanln 8.55 

ESTs 6.55 

ESTs 8.54 

ESTs. Weakly sirrater to S19560prolln^r 8.54 

retlnc^aslonia-tjincfing protein 5 8.53 

ARP2(actirHeIated protein 2» yeast) ho 8.51 

TIAlcyto(oxkigranUle4SSOcialedRNA4)i 6.45 

glyooprotein (transmembrane) nnd> 6.45 

NSI-associatedprotetol 145 

desmoflleln 2 8.43 

Notch(Dn3sophiIa)homolog3 8.42 

H1hi5tonefamily.member2 6.40 

Homo sapiens mRNA;cONADKFZp434E033{fr a40 

interferon^uced. hepatiOs Oassodat 6.35 

ribosomal protein S23 6.33 

STAT induced STAT inhibitor 3 6.32 

ubiquItin-acUvaling enzyme El-fflce 8.31 

hypothefk^al protein M6C14879 6.30 

me^or histocon^JsGbirity complex, dass 8.30 

AHI^ nudeoprotein (desmoydWn) 6.29 

ribosomal protein L12 &'28 

teptln receptor 6.28 

transmembrane protsin 4 8126 

Homo sapiens cDNA: FU219S0 fis. done H 6.24 

integral membrane protein 3 &23 

ESTs 6.23 

collagen, type iX, alpha 3 6.22 

Ul^doman binding 2 6.16 

serine carboxypeptklase 1 precursor prot 6.15 

upregulated by 1 ,25Klihydroxyvitamin D-3 6.14 

ESTs, Moderately similar to unknown (H.8 8.13 

ESTs. Weakly similar b 138022 hypotheti aiO 

pyruvate dehydrogenase (lipoamkle) alpha 6.09 

hypothetk^t protein dJ434014.3 8.06 

Homo sapiens, SbnBar to hypothetalpr 8.07 

ESTs 6.03 

ESTs 8.03 

adeny1atocydase7 8.03 

stannk)catein2 8.03 

ESTs 6.(X) 

Homo sapiens cONA FU13471 lis. dona PL 6.00 

ESTs 6.(X) 

ESTs.Weakly8imilartD210926OABcen 6.00 

hypothetica] protein 8.00 

ESTs 7.99 

ESTs, Weakly similar to T00329hypotheti 7.98 

ESTs 7.95 

nucleotki&«ugar transporter slmOar to 7.95 

chemddne(&C motif) receptor 5 7.93 

cydInT2 7.91 

acGvating transcriptton factor 3 7.90 

caspase 7. apoptosis-related cysteine pr 7.90 

gb:Human nebufin mRNA. paitd cds 7.90 

ESTs 7.90 

mela!tot)donein1E(functtonaO 7.89 

pfoteasome (prosome, macropabi) acOvato 7.68 

mannostdase. beta A. lysosomal 7.88 

hypothe&al protein PP2447 7.87 

ESTs 7.85 

ESTs. Weakly sfarflar to unnamed protein 7.83 

gb:aa55e05.r1 Na„CX3AP.GC81 Homo sapiens 7.83 

SEC22.vesWetraffiddno protein (S.c 7.80 

MRS2(S.cerevtslae)^ike, magnesium hom 7.80 

CCAAT/enhancer bintfing protein (OEBP), 7.79 

K1AA0456 protein 7.78 

Immunogtobunn heavy constant mu 7.78 

ESTs, Moderately slmBar to 138022 hypol 7.78 
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446272 BE288912 Hs.14G0t h8matopQlel)cce&«peci})cLynsubstrst 7.7$ 

438962 BE046594 gb:hn41c11jt1 NCLCGAF_RDF2 Homo sapiens 7.75 

434983 AW974957 Hs:!86719 Homo sapiens cDNAFU12142fis, done MA 7.73 

422900 AA641201 Hs.222051 ESTs 7.73 

5 432598 AI341227 Hs,157108 ESTs 7.72 

449322 A1638616 Hs.196566 ESTs 7.71 

416987 D86957 Hs.80712 iaAA0202 protein 7.67 

4100)0 BE2B0421 Hs.94499 ESTs 7.67 

416801 X98834 Hs.79971 sal (DrosopMa)-Uke 2 7.67 

10 437442 T65104 Hs.222779 ESTs, Moderately sindlsr to similar to N 7.65 

407137 T97307 gb:ye53h05.s1 Soaresfetallivef spleen 7.65 

401466 vestde^assodated membrane protein 4 7.65 

408870 AA075144 gb3in86l06^1 Siratagene ovarian cancer 7.64 

408558 AW015759 HS.2357D9 Homo sapiens mRNA; cDNA DKFZp667B071 1 (f 7.63 

15 4572S0 AA811987 H5.125779 ESTs 7.63 

412949 A1471639 Hs.71913 ESTs 7.63 

406819 AA906472 gb:og82a10£l Na.C6AP.0v8 Homo sapiens 7.62 

441612 AI802629 H5.113560 Homosa|)tenscONAFU11631fi5,doReHE 7.62 

414799 AI752416 Hs.77326 kisuHn^ growth factor binding prole 7.61 

20 435937 AA830893 Hs.119769 ESTs 7.60 

447197 R36075 gb:yii88bOU1 Soares placenta Nb2HP Homo 7.60 

407719 AW963866 H&44021 Homo sapiens mRNA for FU00065 protein. 7.60 

417336 R70429 HS.8198B disabled (DfOSopMla) homotog 2 (mBogen 7.56 

418134 AA3977e9 Hs.86617 ESTs 7.55 

25 451812 X81689 Hs.152151 plakopliilin 4 7.55 

412347 AW970026 HS.7361B ubiquinol-cytochromecfBdudasa hinge p 7.54 

429083 Y09397 Ks.227817 BOZ-related protein A1 7.54 

414004 AA737033 HS.71S5 ESTs, Moderately similar to 211 5357ATYK 7.52 

423905 AW579960 Ks.1351S0 king type^ cen membrane>assodated gly 7.52 

30 407784 AW139585 Hs.12708 ESTs 7.52 

425762 BE244076 H3.159578 AT-fwok transcription factor AKNA 7.50 

418452 BE37974g Hs.85201 C-type(ca!dum dependent, carbohydrate- 7.50 

438459 T49300 Hs.35304 Homo sap]eracDNAFU13655fis. done PL 7.48 

446013 AI360167 Hs.152774 ESTs 7.48 

35 429281 AA830856 Hs.29808 Homo sapiens cDNA: RJ21 122 Its, done C 7.46 

415526 N76536 Hs.265591 ESTs. Wealdy similar to ALU IJiUMAN ALUS 7.45 

417450 AA314435 Hs.17519 Homo sapiens cDNA: RJ22539 lis, done H 7.45 

431773 BE409442 Hs^68557 pled(slrinhomoiogy-lil« domain, family 7.44 

447082 T85314 Hs.54629 tWoredoxl^Ulw 7.43 

40 441962 AW972542 Hs.289008 Homo sapiens cDNA:r^J21814 lis, done H 7.43 

429056 AF138863 Hs.35254 hypotheOcal protein Fl^6421 7.43 

439971 W32474 Hs.301748 RAP2A. member of RAS oncogene f amity 7.43 

442233 AW9S7149 Hs.28439 ESTs, Wedldy similar to 138022 hypotheQ 7.43 

436394 AA531187 Hs.126705 ESTs 7.39 

45 452248 AA093668 Hs.28578 musdebOnd (I3rosopMlaHi(e 7^ 

446258 A1283476 Hs.263478 ESTs 7.38 

410570 AI1330g6 Hs.64593 ATP synthase. Htran5poi€ng,m!tDdiondr 7.37 

447484 AA464839 Hs.292566 hypothetical protein RJ 14687 7.36 

435541 AAfi87361 Hs.221318 ESTs 7.35 

50 453932 AWD08303 Hs.329296 ESTs, Weddy similar to (deiiinenolava 7.35 

408067 6E244580 Hs.342307 hypolhetica) protdn HJ10330 7.35 

427307 AF117947 Hs.174795 POZdomalrHxmtainIng guanine nudeoBde 7.35 

418336 BE179882 glutathione peroxidase 3 (plasm^ 7,35 

448877 AI583696 Hs.253313 ESTs 7.35 

55 443195 BE148235 Hs.193063 Homo sapiens cDNA FU14201 fis. done KT 7.35 

444838 AV651680 Hs.208556 ESTs 7.33 

422693 6E300073 Hs.279860 tumor protdn, tianslallonall^oontrolls 7.31 

424677 U09414 zinc finger proteto 137 (done pH^SQ) 7.30 

441878 AI801869 H3.127982 ESTs 7.29 

60 406542 C19000728*:gII12585552|sp|Q9Y2Q1|Z257.HU 7.28 

408418 AW963897 Hs,44743 KIAA1435 protdn 7.28 

425367 BE271188 Hs.1 55975 protdn tyrosine phosphatase, receptor t 7.26 

442492 AA528489 Hs.234518 ribosomal protein 123 7.25 

424541 AW392S51 Hs.l60559 ESTs. Weakly simiiar to AS61 94 thromboxa 7.25 

65 452852 AK001972 Hs.30822 hypothetted protein FU11 110 7.25 

425501 AW043782 Hs.293616 ESTs 7.25 

411251 R19774 Hs.22835 HH6P protein 7.25 

444870 H58373 Hs.332938 hypothelkal protdn MGC5370 72S 

„^ 418117 AI922013 H5.e3496 linker for actlvaSon of T cells 7.24 

70 441384 AA447849 Hs.288660 Homo sapiens cONA:FU22182 Us. done H 7.24 

434817 AA082118 Hs.102737 gdtath protdn 7.23 

419970 AW612022 ESTs 7.23 

432290 AK001099 Hs.274273 Homo sapiens cONAFU10237 Us. done HE 7.23 

426647 AA243464 Hs.294101 pre-B-ceO leukemia transcription fador 7.23 

75 433891 AA613792 gb:no97li03.sl Na^CGAP_Pr2 Homo sapiens 7.21 

454038 X06374 Hs.37040 plaldet-derived growth tador dpha pd » 7.21 

430314 AA369601 HS.23913B pre^ncdony-enhandngfador 7.20 

443247 BE614387 Hs.333893 oMyc target JP01 7.20 

_^ 441224 AU076984 Hs.7753 caiumenin 7.18 

80 447188 H65423 H5.17631 hypothetical protdn DKFZp434E2135 7.18 

447687 AA114050 Hs.19949 caspase 8, apoptosis^daled cysteine pr 7.15 

447341 AF106941 Hs.18142 arrestin.beta2 7.15 

408113 T82427 Ks.194101 Homo88pien8cONA:FU20869fis.doneA 7.14 
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418696 AWg59433 Hs.326290 hypO&ieBcdpRilefaiFU12SB1 7.14 

434699 AA643687 H5.149425 Homo sapiens cONAFUl 1980 fls. done KE 7.13 

421633 AF121660 Hs.106260 sorting neidn 10 7.10 

410668 BE379794 Hs.159651 hypoMcal pratsin 7.09 

5 435812 AA700439 H5.188490 ESTs 7.08 

414476 AA301667 Hs.76224 EGF^tadng OnMkd extrecdliila 7.08 

408331 NM_007240 Hs.44229 dua!8peQ8dtyphospha!as8l2 7.08 

417165 R80137 Hs.302738 Homo sapiens cONA:FU21425 6s. dons C 7.06 

408805 AF025374 Hs.46465 T^O, Immune regulator 1 7.06 

10 416401 N80139 Hs.266916 ESTs 7.05 

415799 AA653718 Hs.225841 DKFZP434D193piotBin 7.05 

415995 NM-004573 phosphdip8seC.beta2 7.05 

414612 X72755 Hs77367 nvinoklne induced by ganvnainteifBron '7.05 

417535 AA203569 Hs.191482 ESTs 7.04 

15 449567 A!990790 Hs.1B8614 ESTs 7.03 

429355 AW973253 Hs.292689 ESTs 7.03 

442460 NM_014135 H$.8345 PRO0541 protein 7.03 

453187 Ai161383 Ks.34549 ESTs. HigMy similar to S94541 1 done 4 7.03 

430260 AA3612S8 Hs.237866 Intsrieukin 7 receptor 7.03 

20 426124 A)268389 Hs.250697 phosphaQdylinosItolg)ycan.das8F 7.02 

442685 AB033O17 Ks.6594 KIAA1 191 protein 7.01 

433735 AA608955 Hs.109653 ESTs 7.00 

416003 X98001 Hs.78948 Rab geranylgeranyltrBnsfersse. beta subu 6.98 

424415 NM_001975 Hs.146580 enolase 2. (gamma, neurond) 6.96 

25 416655 AW966613 Hs.79428 BCL2/adenovinjsE1B 19k[Mnteraciingpro 6.95 

409956 AW103364 Hs.727 inhibin. belaA(adiv(nA, adMnABa 6.9S 

407136 T6489S Hs.2874:»> HofflOsapienscONAFU115331is.doneHE 6.93 

425235 AA353113 Hs.112497 Homo sapiens d)NA:FU22743 lis. done H 6.93 

451853 W16193 ESTs. Ittoderately similar to HERC2[H.sap 6.93 

30 439444 AI277652 K5.5457B ESTs, Weakly similar to 138022 liypolhetl 6.93 

451838 AW005866 Hs.193969 ESTs &91 

436612 AW298067 gb:U141-BW0-ai)>^9M1.8l NCI.CGAP.Su 6.90 

443749 R36828 Hs.143463 ESTs 6.90 

434584 D57341 Hs.188361 Homo sapiens cONAFU12607 lis. done NT 6.90 

35 427919 AA173942 Ks.326416 Homo sapiens mRNA:cONAOKFZp564H1 91 6 (f 6.90 

431840 AA534908 Hs.2880 POU domain, dass 5. tratscripllonfecto 6.90 

435655 AW105863 Hs.6947 HSPC069 piotein 6.90 

427640 AF058293 HB.18roi5 (Vdopadtfome tautomefsse 6.85 

418259 AA215404 ESTs 6.85 

40 407244 M10014 fibrinogen, gamma pdypepQde 6.85 

418832 X04011 Hs.88974 cytodirome b-245, beta pdypepUe (duo &83 

441321 H17182 Hs.7771 Bh^ assodated protein 6.80 

433162 AI025842 ESTs &80 

425410 AA310974 Hs.156828 Homo sapiens CONAFU10522 lis. done NT 6.60 

45 434372 AA531373 gb:np86d)1.s1 NCLCGAP_Thy1 Homo sapiens ^80 

456629 AW891965 histone deacetytase 3 6.78 

430283 BE391688 RAB7, memberl^S oncogene MIy 6.77 

418300 AI433074 Hs.6&682 Homo sapiens cDNA: FU21578 (is. done C 6.76 

406858 At865720 Hs.29797 iteomalprat^LIO 6.75 

50 429582 Ai56g068 Hs.22247 ESTs 6.75 

401 113 sdule canfer family 22 (orgadccatton 6.75 

449576 AW014631 Hs.225068 ESTs &75 

432588 X92715 Hs.3057 zinc linger protein 74 (0)s52) 6.72 

417558 AF045229 Hs.82280 regulator of (Hvotein signalling 10 6.72 

55 430451 AA836472 Hs.297939 calhepsinB 6.72 

410503 AW975746 Hs.188662 KIAA1702 protein 6.70 

415682 AB47128 Hs.191870 ESTs 6.70 

410102 AW248508 Hs.279727 ESTs:homdogueof PEM^lOonasavignyi 6.70 

414217 Ai309298 Ks279898 Homo sa{denscONA:FU23165 lis. done L 6.70 

60 457073 AA233210 Hs.179943 ribosomalpratofaiLII a69 

442232 AI357813 Hs.337460 ESTs. Weakly similar to A47582B^ngr 8.68 

436137 AI056769 Hs.133512 ESTs ^68 

425787 AA363867 Hs.155029 ESTs &67 

437802 AI475995 Hs.122910 ESTs &65 

65 432636 AA340864 HS.276562 daudto7 6.65 

407340 AA610168 Hs.284289 viffligo^ssodaltod protein VTT-I 6.65 

418036 Z37976 Hs.83337 talent transtbnning growth IMrbetob 6.65 

423494 AV\^6S Hs.24143 WIskottAtdridisyiNlromepretoh Inters 6.63 

441355 AI822034 H8.137097 ESTs 6.63 

70 430968 AW972830 gb:EST384925 MAGE reseqiienoes.MAa Homo 6.63 

434551 BE3S7162 H$.280856 ESTs. Highly similar to A35661 DMA exds 6.63 

447232 AW499834 Hs.327 hterteuUn 10 receptor, alpha 6.62 

422310 AA316522 Hs.98370 cytodvome P450. subfamily IIS, polypept 6.60 

449217 AA278536 K8.23262 ribonudease. RNasa A famity, k6 6.60 

75 449057 AB037784 Hs.22941 KIAA1363 protein &60 

446979 AIS54443 Hs.197683 ESTs 6l60 

452382 N38902 Hs.211539 hypothetical protein MGC4248 6.60 

424868 A1568170 Hs.96866 ESTs 6.59 

„^ 409485 860990 H5.252136 ftcoTn (cdlagenMnogen domabvcont 6.58 

80 451603 BE379499 Hs.173705 Homo sapicnis cONA: FU220S0fis, done H 6.58 

426158 NM_001982 Hs.199067 v«rb^2 avian eryUvoblastictoultemiav 6.58 

452472 AW957300 Hs.294142 EST8,VfeaklysimBartoCS5663dlgodend 6.57 

450256 AA28SB87 Hs.24724 MFH«np!ified sequences wSh teudne^ 6.56 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



451589 
444207 
418459 
427667 
406745 
446173 
436556 
423825 
443441 
428403 
431971 
450219 



AA424791 H55734 
A)565004 

R85436 Hs^14 

AK001279 Hs.180171 

AW511970 Hs.279860 

BE56584g Hs.14l58 

BE545586 Hs.278712 

NM_004402 Hs.133089 

AW2911S6 Hs.92195 



At393048 
BE274907 



442618 

422773 

413863 

418905 

40S086 

448520 

407284 

447296 

443963 

426495 

422303 

440119 

451658 

435918 

439979 

412220 

436716 

413703 

413326 

441970 

430835 

414890 

418113 

414768 

422340 

407198 

432S86 

432729 

420012 

432879 

429732 



Ai610447 
R55222 



BE247585 
BE539674 

AB002367 
AI539227 
AW243814 
AAB78183 
D31765 
AW410362 
AA865455 
AW195351 
AF263538 
AWB00291 
BE350058 
A1433540 
BE1 58360 
H88621 
AW959918 
A)240008 
BE281095 
AI272141 
AW376g89 
AW296219 
K91679 
AA568548 
AK000292 
AW9579B5 
AW815932 
U201S8 



Hs.326159 

Hs.77385 

H3.224624 

Hs.48778 

H&26514 

HsJ015S2 

Hs.75462 



Hs.21355 

Hs.214039 

H3.18063 

Hs.17448 

H8.170114 

H8.27556 

Hs.125331 

Hs^50520 

Hs.86232 

Hs.6823 

Hs.36787 



Hs.19762 

H5J3737 

Hs.1923% 

HSJ7573 

Hs.83484 

Hs.259855 

Hs.115325 



416082 AA160000 
437296 AA350994 



Hs.130732 
Hs.99014 
Hs.173734 
H$^488 
Hs.137396 
Hs^81 
Hs.180655 
Hs.251946 
Hs.47094 
Hs.301819 
HS.8S226 
Hs.134460 
Hs.95612 
Hs.8906 

Hs.155376 
H3.184367 
Hs.279518 
Hs.60389 
Hs.93795 
Hs.342849 
H3.32112 
HS.S8589 
Ks.178391 
Hs.79172 
Hs^83722 
Hs.139262 
NM.005651 Hs.183671 
AW291487 Hs.213659 
AA12S985 Ks.56145 



AW411425 

AA250970 

A1277367 

AWS03116 

AA332941 

H73972 

Aie85516 

Z45822 

M93425 

T53088 

AB011110 

W16518 

AA010319 

AW900992 

AWg36259 

AI123478 

U94362 

AW973119 

AA157634 

BE293127 



427747 
445873 
410387 
413677 
418458 
443634 
409453 
443035 
432841 
410532 
428453 
410597 
458965 
419926 
426797 
412528 
410079 
427477 
416297 
435961 
424090 
428311 
451061 
409731 
440129 AA865818 
428773 
436372 
440719 
406685 
421305 
450988 
456659 
406806 
420151 
413441 
449317 
421568 



AW972301 

AA150869 

M18728 

6E397354 

BE618571 

AW749895 

AWD88535 

AA2S5931 

Ai929374 

AW293413 

W8S8S8 



H&193163 
Hs.310288 
Hs.26257 

Hs.324830 

Hs.429 

Hs.332520 

Hs.186704 
Hs.75367 
Hs.132908 
Hs.99604 



niBidn^omd expressed anSgen 5 (hyaluion 
cdihepsln D (lysosomal sspartyl protease 
ESTs 

Homo sapiens cONA (^10417 fis, done NT 
tumor prot^. translaQonaliy-oontroIle 
copine HI 

Homo sajte cONA FU1 1074 fis, done PL 
DNA fragmentation (actor. 40 kD. Ijeta po 
ESTs 

leudne rich repeat (in FUl) interacHn 
myosin, DgM pdtypepOde 6. eOcali, smo 
ESTs 

niban protein 
ESTs 

KIAA1039 protein 
BTG family, member 2 
adinln.aipha4 

NKL006662':Homo sapiens SnG-relaled CSP 
doubiecorfin and CaM Mnase^ 1 

hypolhetted protein fTJ23555 

Homo sapiens cDNA FLJ10768 fis, done KT 

Homo sapiens cDNA FU13818 fs. done PL 

KIAA0061 protein 

hypothetical protein aJ22405 

ESTs, Moderately simiiar to unknown (He 

ESTs, Moderately simlar b 138022 hjpA 

growth differentiation factor 3 

hypolh8fica)pn)l^FU10430 

chromodomain helicase DNA binding protei 

gb:ti69g05.x1 Na^CGAP_Kld11 Homosapien 

gb:PM1-HT0383-131299^1-h08 HT0383 Homo 

ESTs. Weakly similar to K1AA1140 protein 

ESTs 

ESTs 

uridine phosphorylase 

SRY (sex determining region YVbox 4 

elongation fador-2 khiase 

RAB7. member f^S oncogene famHy-Iike 1 

gb:yv04a07.s1 Soares fetal liver spleen 

ESTs 

hypolhellcal protein FU20285 

Homo sapiens, done IMAGE:3632168. mRNA 

ESTs. Weakly similar to ALULHUMAN ALU S 

lymftfiocyle cytosoik: protein 2 (SH2 doma 

ESTs, WbaUy similar to JCS238 galactosy 

K1AA1700 

serine/threorine kinase 12 
poly(A}-binding protein, cytoplasniic 14 
ESTs 

zinc finger protein 146 

Ipase A. lysosomal add, cholesterol es 

ESTb 

ESTs 

Homo sapiens dona 24889 mRNA sequence 

protein tyrosine phosphatase, non>fecept 

hemoglobin, beta 

GTPase acGvaling protein4iks 

amyk>ld beta (A4) precuisor-lika protein 

ESTs 

DKFZPS86D2223 protein 
AOP-ffbosyiation factor-like 5 
ESTs 

glycogenin 2 
ribosomd protdn L44 
solute earner family 25 (nritochondria! 
GTT1 protein 
XIAP associated factor-1 
tryptophan 2.3^oxygenase 
ESTs, Weakly similar to KiAAl 357 protein 
thymosin, beta. Uenlified In neuroblast 
ESTs. Weakly similar to S71886 Ste204'k 
|}ridgbtg integrator 1 
ESTs 

ATP-dependant interferon response protel 



dipiheria toxin resistance protein reqd 
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AA234592 AA890223 AA766824 B1259822 AA333631 AW968840 BE940639 TB3865 BE762742 AW697470 W05809 N41323T87376 R58544 
H88711 Bt087136 

409208 10117^ AK074047 AI144342AW014280BM14512BN28267AW206231 AA989041 H93197AW594063BG236296AW23e606AW081O31 AA765843 
BM144372 AA989341 AI824838 AI963970 A1637671 AW196330 BG427526 BM148789 BF893844 BF881946 
BE177661 H06215BE144709BE144829 
6E005398 AA628622 AA994155 

AK001838AU135179AU134241AV651702AV650032AV651304AV650101AV651263AV651888AV651866AA^ 
AV651172 AW856145 AU1 17599 AU135388 BG254665 AA166919 86483981 AVyf809606 BG494194 AA62281 1 A1876158 AA687804 AJ701729 
AU133725 AW961387 AU144387 AU151757 AA551031 BE675412 N34769 AA713483 A1890079 A1588918 AI361889 AI209020 AA668981 AI240990 
AA741 144 AA490899 Ai200221 AW589574 W96201 BG154182 AV655159 AA328145 N36348 AI081357 N76715 AA693346 AA742488 AI269719 
AA897483 AI886459AU155B73H0425SAW243986AA557749A12e6227 R68691 R33453AW388097 
AA947566 

AA662701 AA533929 AA737415 

BM475665 8E644917 AW770789 AW952971 N64863 BM263259AI224545 All 84856 N69114 AW518902 AM40169 AA809472 AV654440 
AA281642 AUl 85230 AW337382 A1872923 AI537113 N73882 183378 H63731 BF671764 AWB97824 AI81 1204 AA344646 BE0091 12 B6899564 
H91240R60548 N41701 
BE072B81 AI762181 BE072948 

AU)47596 AA393792 AI670731 AI037957 AW874364 AI038137 NG2286 AI241379 BE501096 AW090e98 AI927369 AI669226 AI369437 At371075 
AW612409 AI688711 AI183289 AA477717 A1076122 AA635190 AA700984 AA781508 081020 BF575223 AI356183 D79312 AI3755Se H61111 
BG283489 BE090666 BE090664 BE090662 H26545 
422960 11862 2 BF090249 AW954947 AW890487 AI305236 D60845 D60537 AA825429 W36294 AWB90410 AW088235 BF740240 AA448709AI350279 AA879119 

AA319510 8E702077 BE699015 BE702046 AW901293 T99319 081708 6F475488 D60383 081751 BE699260 
410730 114639 1 BG427950AA826016AI903441 BG939868AW979154AA640872BG547134AA457091 AW368860VV88852R^^ 
422689 B74209ll AW954733AA31S006AW856665 

614 



410143 MH1244_8 



450377 12109.1 



425998 138046J 



413497 
434280 
450139 



430068 
434442 



451129 
414405 



1518002.1 
1474904J 
34017J 



1177709..2 

111738.1 

22448.4 



1495511.1 
112669.1 



wo 03/042661 



PCTAJS02/36810 





436980 


917280.1 




442495 


92871 6 J 




425264 


3834.5 


c 








441623 


3362.1 










445493 


423456.1 


15 


432686 


144.7 


430915 


197844.1 




425913 


4123.2 


20 


438962 


195763.1 


406670 


0_0 




406819 


0.0 




447197 


2175805J 




418338 


58817.1 


25 


424677 


2518.37 




419970 


13569.1 




433891 


647290J 


30 


415995 


2117_1 







35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



451553 MH115364.1 



436812 
418259 



433162 
434372 



659779.1 
133853.1 



2167905.1 
858779.1 
207.22 



430283 1418.1 



AW502384AA828822A1982587 
A1184717 AW518883 AP121 173 

BM47612D BE672181 AI697553 AA938187 A1280879 AW340123 A1912727 AI081775 AI089556 All 91349 AIB71604 AI631607 AI890800 AI701917 

AVmi624AAK3041 AI991575AI160B22AA771763 88)89784 BE089788A1222942AW41B516AA329211A109573^ 
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AW371849BE153241 BC017410AI337912At090244 AWD90300BE219837A1623661 BES015766E501734AI742232AI023964AI458424 

AA975373 AI288904 A1984583 AA890325 N32562 AI358102 AW241694 A1038448 AI672071 A1018389 AA575391 AA977874 AW189392 W37448 

AA612894 AI2n548 H89551 A1699774 HB9365 AA315805 AW579188 BC014584 BC014581 AW780125 AI672414 BE32ai45 AW600919 BF031306 

AW172758 BE708322 AA345675 BE875779 H28241 H25318 BFM0913 B6179688 BF1 10202 AA528775 W37573 BE041B44 AW366504 BM129522 

BM129822 Ai122760 BE718200 AW887496 AA149420 BE706307 BE539385 8E748765 AI373653 R7S904 BF979185 BFG91393 BG495S95 

BI094458 BE706702 6G496S59 6F246373 BG494800 
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AI207343 BF813684 6F92B775 AA828585 

AA075144 

AA908472 

R36075 R36167 AI366546 
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AI65B779 A1675997 AW665991 AI459263 AH20121 Z38874 AAS7011S AW301008 AA21 6257 BF082662 Bi772789 H05989 BF0B5523 BI001277 

U09414 NiyL003436 AA503545AI022449AA043458AA766074AA765442AA805052At028211 AW609708 

AKD55270BE348291 Al 1 90289 AW61 2022 AI269508Al2e6578AK69676AW271406BEB79851 BF574163BM97126AW903775 81917368 
AW150900BF244813H79201 

AW182329AA613792T05304AWB5B385 ^^^^^ 
M95678 NiyL004573 AL530754 AI439623 AW374413 BFB98680 AW6309S9 BF875526 AW402206 BF818e90 BF893068 AW504110 AW408O49 
BG002913 AL530753 AI524084 AW769231 AA464970 AA293723 A!095051 AI953375 AI982938 BM146050 AW575804 AA962489 AI655426 
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AA248197 AW381373 AW177325 AW806879 AA935217 8E067498 BE083742 BE067470 AWB94935 BE082529 A)24881 1 BE179917 BE002200 • 
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AWB02049 BG675859 AV658871 BG678060 AI565004 AWB19026 BE843092 AV686437 AV723049 BG616948 AI91 1647 At743490 AI091096 
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AI625428 AA5a07g2 AA582038 AA581668 AA658065 AAB2ai56 AA657160 AA936103 AI14933S AA936925 AA581684 AA954198 AW238461 
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AW971213 AA493925 AA493567 AAS76839 AA934462 
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TABLES2C 

Ploey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digtt numbers in this column are Genbanltldentllter (GO nuinbers. 1)unham,eteL'refeis to Uiepubticaiion entitled niieDNA sequel 

human chromosome 22r Dunham, el at. (1999) Nature 402:489^9^ 
Strand: ImScatesDNAstraidlromwMchexDns Mere predicted. 
Ntjiosiflon: Indicates nudeoddeposiBons of predicled axons. 
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TABLE 53A: 

Pkey: Unique Eos probeset Identltier number 

ExAccn: Exemplar Accession nunter, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of testicular cancer (non-seminomatous and semmomalDus) compared to normal adult fisaies 



Pkey 


ExAccn 


UntgenelD 


Unigene Tifle 


R1 


432666 


AW204069 




ESTs. Weakly slmSar to unnamed protein 


78.20 


452838 


U65011 


Hs.30743 


preferenBaliy expressed enOgen in moia 


75.45 


418896 


AWg59433 


Hs.326290 


hypothetical protoln FIJ125B1 


58.80 


432730 


AI066520 


Hs.1 31358 


ESTs 


51.80 


450581 


AF081513 


K8.25195 


TGRtela4 


4a40 



616 



wo 03/042661 



PCTAJS02/36810 



426534 U56096 HsiOSI testis spedOc protein, Y-«nked 44.05 

423458 A1204212 ESTs 36.60 

428S64 AK001666 Hs.189095 sbrdar b (ssi (OrosophilaHike 34.65 

_ 420367 AA259090 H3.257028 ESTs 32.60 

5 451106 BE382701 Hs.25960 tOflYC oncogene 30.10 

437052 AA861697 Ks.120591 ESTs 29.35 

417407 AA923276 Hs.290905 ESTs. Weakly similar lo protease IH.sapi 29.05 

420759 Tt1832 Hs.1 27797 Homo sapiens cONAaJ11381 lis. done HE 28.45 

420347 AL033539 H5.g7124 Human DMA sequence lramckineRPl-309HlS 28.25 

10 40ni0 AW022727 HsJ?3816 ESTs 26.86 

448981 At968719 Hs.195387 ESTs 26.40 

429468 AF155827 Hs.203963 hypotheteal protein FU10339 25.55 

420528 AF130728 Ks.g8568 <loul}l8sex and mab^ related transcripfo 25.10 

425769 U72513 Hs.l59486 Human RPll3-2pseud09ensmRNA. complete 23.70 

IS 4302S2 Ai638774 Hs.105328 testes devdopmenkelated NYI>SP20 21.95 

454077 AC006952 Hs.37062 tnstMite3(I^GeII} 21.73 

428227 AA321649 Hs.2248 small inducOde cytokine 5ut>faniilyB(Cy 20.15 

434649 AA738254 Hs.l65390 ESTs. Highly simitar taA40350lranscrip 19.65 

^ 424578 AK001973 Hs.150890 hypothetical protein 19.16 

20 427335 AA448542 Hs.251677 Ganligen7B 19.05 

438915 AA280174. Hs.2856B1 WilUams-eeuTen syndrome chromosome regf 18.95 

432938 T27013 Hs.3132 steroldogeflic acute regulatory protein 18.86 

449322 At&3861S Hs.198566 ESTs 18.30 

430891 C14187 K3.157208 arlstales&related homeobox protein ARX 18.00 

25 430576 AF084866 gb:Homo sapiens envelope protein RIC-3( 17.96 

440119 AA865455 Hs.125331 ESTs, Moderately sindlar to unltnovmias 17.41 

418756 AA2522M Hs.226949 ESTs 17.20 

410102 AW248508 Hs.279727 ESTs;homologueof PEM^IQionasav^nyi 16.20 

447534 AW953935 H3.288655 ESTs 16.04 

30 407122 H20276 Hs.31742 ESTs 15.95 

446979 AI654443 Hs.l97663 ESTs 15.90 

406547 Target Exon 15.70 

427711 IUI31659 Hs.180408 sotute carrier family 25 {mitochondrial 15.65 

456847 AI360456 Hs.37776 ESTs 15.50 

35 448776 BE302464 Hs.30057 MRS2 (S. cerevl5lae)-like, magnesium horn 15.00 

452291 AR)15592 Hs.2e853 CDC7(ceII division <^e 7, S.cerevlsl 14.95 

408908 BE296227 H5.250822 serine/threonine Idnase 15 14.65 

418007 M13S09 Hs.63169 matrix metdlloprotelnasa 1 (inteistiOal 14.20 

422828 AL133396 prion proteb) 2 (dubiel) 14.08 

40 433330 AW2070B4 Hs.132816 hypotheOcai protein MGCl 4801 14.05 

410561 6E540255 Hs.6994 Homo sapiens cONA: aJ22044fo. done H 14.05 

427667 AK001279 H8.180171 Homo sapiens cDNAFU10417 lis, done NT 13.90 

418134 AA397769 Hs.86617 ESTs 13.95 

454438 AA2240S3 Hs.l72405 ceU division cyde 27 13.70 

45 449032 AA045573 Hs.22900 nuclear fador(erythrold-deiTved 2)^ 13.40 

426427 M86699 H$.16S840 TTK protein Idnase 13.35 

4377B9 Ai581344 Hs.127812 ESTs, Weakly similar to T17330hypothef] 13.20 

419384 AA490866 Hs.39429 ESTs 13.10 

418477 AW022983 gb:df46hl2.y1 Morton Fetal Codilea Homo 12.85 

50 453922 AFQ53306 Hs.36708 budding uninhibited by benzlmldazoles 1 12.80 

447188 H6&423 H3.17631 hypothetks! protein OKFZp434E2135 ' 12.78 

430521 NM_016383 Hs.242183 HOM-TES-65 tumor antigen 12.72 

443068 AI188710 ESTs 12.65 

437099 N77793 H3.48659 ESTs. Highly similar to 814458 laminta a 12.60 

55 420401 AK001907 Hs.97464 hypothetical protein 12.50 

410381 BE391604 H5.62661 guanylate binding protein 1. tnteileron- 12.50 

431494 AA9gi355 Hs.298312 hypotheticai protein DKFZp434A1 31 5 12.45 

408937 U14622 gb:Human transketolase-fike protein gene 12.25 

^ 439451 AF086270 H5.278554 heterodiromalirv^e protein 1 12.10 

60 404996 Target Exon 11.86 

424905 NM„002497 Hs.153704 t^lMA (never in mitosis gene aHdatedk 11.85 

444619 BE538082 Hs.8172 ESTs, Moderately simitar to A46010)Un 11.60 

434551 BE387162 Ks.28085d ESTs. Highly simflar to A35661 ONAexds 11.55 

421241 X91817 Hs.l02866 transketdaso^ike 1 j 11.50 

65 414972 BE283782 Hs.77695 KIAA0008 gene produd 11.45 

426666 U02330 Hs.172816 neuregutini 11.37 

433159 AB035898 Hs.150587 Mnesin^ protein 2 11.35 

433800 Ai034381 H5.13S150 lungtype^oeDmembranMsoddedgly 11.24 

„^ 440207 At371978 Hs.128326 ESTe 11-12 

70 407276 A19S1116 H8.326/36 Homo sapbns breast cancer anBgenNy-BR 11.10 

450142 AW207469 H3.24485 dtondrdSnsuifab proteoglycan 6 (bama 11.05 

449576 AW014631 Hs.225068 ESTs 10.95 

414251 AU)42306 Hs.97689 VASA protein 10.95 

422956 BE545072 Hs.122579 ECT2protdn (Epithelial ceDtranstami lOJO 

75 436812 AW2g8067 gb:UI-KeW0-8iP'g^)9^s1 NCI.CGAP.Su 10.85 

427521 AW973352 ESTs 10.81 

408728 AL137379 Hs.47125 hypolheOcd protein FU 13912 10.80 

442832 AW206560 Hs.253569 ESTs 10.62 

436899 AA764852 ESTs 10.60 

80 428949 AA442153 Hs.104744 hypothetical protein DXFZp434J0617 10.55 

409731 AA125965 H8.56145 Oiymosln. beta. UenOSed in neuroUast 10.45 

435206 AI432354 Hs.160594 ESTs 10.15 

433975 AA9719S3 K5.1220S5 ESTs 10.10 
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448791 AI632278 Hs.195922 ESTs 10.0S 

422232 D43945 K5.1 13274 tranxripSon factor EC 10.00 

420047 A1478658 Hs.94631 brefddin A-lnhibited guanine nudeoQde ^71 

- 434334 AA9t2476 Hs.118750 Homo sapiens cDNAFUl 3221 fis. done NT GlSO 

5 423673 BE003054 Hs.1695 matrix metaOoprotelnasa 12 (macrophage 9.44 

438168 AA779975 HS.1288S9 ESTs 9.30 

418973 AA233056 Hs.191518 ESTs 9l25 

413627 BE162082 Hs.246973 intron of Bicauda) 0 homolog 1 9.2S 

422689 AW856685 gb:RC3-CT0297.29010(M)13<)03 CT0297 Homo 9.15 

10 436808 AA626980 H$.192371 down syndrome critical region protein DS 9.11 

434699 AA643687 Hs.14942$ Homo safaenscDNAFUl 1980 lis. done HE 9.08 

426518 Z43039 Hs.170ig8 KIAA0009 gens product 9.05 

440968 N36327 gb7x82b06j1 Soares melanocyte 2NbHM Ho 9.05 

440952 Ai291804 Hs.118101 ESTs 9.05 

15 427489 AM03084 H5.269347 ESTs. Weakly slmnar to 2109260ABcea 9.05 

442618 R56222 Hs^14 ESTs a96 

419423 D26488 Hs.90315 KIAA0007 protein &95 

428153 AW513143 Hs.98367 SRY (sex detennlnino region Y)to 17 (S &80 

439979 AW600291 Hs.6823 tiypolhetica! protein FU 10430 8.76 

20 444971 A1651116 Hs.148659 ESTs ^75 

436513 AJ278110 HS.1255G7 DEAIMxix protein &60 

427466 AA974433 fibroblast growth factor 4 (heparin secr &S9 

415857 AA886115 Hs.127797 Homo sapiens cDNAFU 11 381 Ss, done HE &S8 

428847 AI954833 H3,98881 ESTs ft67 

25 408465 AW198940 Hs.253277 ESTs aS4 

443523 AK001575 Hs.9536 hypotheDca) protein FU10713 6.53 

440527 AV&57117 Hs.184164 ESTs. I^todefstely simOar to S66657 alptia &S0 

439570 T79925 Hs.269165 ESTs. Weakly similar to ALUIJ^UMAN ALUS 8.50 

450480 X82125 Ks.25040 zinc finger protein 239 8.50 

30 425266 J00077 Hs.155421 alpha^eloprotein 8.50 

453884 AA35592S Hs.36232 KiAAOl 66 gene product 8.42 

413318 AU076607 Hs.75285 Inter-alpha (gtobuOn) InHbltor. H2 pd 8.35 

430835 A1240006 Hs.192326 ESTs 8.33 

416859 H43437 Hs.80305 hypothetical protein M6C1 4258 aSO 

35 423905 AW57g960 Hs.1351S0 lung type-l cell membrane-assodatedgly 8.26 

407340 AA810168 Ks.284269 vitiligo-assodaled protein VIT-1 a2S 

449260 AA741180 Hs.29B79 ESTs a25 

430255 AK00O7G3 Hs.323822 Homo sapiens mRNA for KiAAl 551 protein, aiB 

448844 AI581519 Hs.177164 FGENESH predicted novel cell surface pr ai7 

40 431840 AA534908 Hs.2860 POU domain, class 5, transcription fado ai4 

428479 Y00272 H3.334562 cefl division cyde 2. G1 to Sand G2 to ai4 

426083 AW962712 Hs.126712 ESTs.WeakiysimilartoAF1910201 E21G5 a03 

425572 AB011076 HS.1S8307 imrfifterentialedefhbryoRiccdltranscrl 7.98 

410420 AA224053 Hs.172405 Gendh4s]oncyde27 7.90 

45 453878 AW964440 Hs.19025 DC32 7.75 

430287 AW182459 Ks.125759 ESTs. Weakly simflar to t£U5_HUMAN LEUKE 7.65 

453913 AW004683 Hs.78934 mutS(Ecoll)homolog 2 (cdon cancer. 7.65 

421974 AA301270 gb:ESTt 41 92 Testis tumor Homo sapiens cO 7.65 

432840 AK001403 H5.279521 hypothetical protein FU20S30 7.65 

50 451950 AW292317 Hs.213307 ESTs 7.60 

412265 AA10132S Hs.86154 hypothetical proldn FU12457 7.59 

435514 AW592804 ESTs 7.55 

431041 AA490967 Hs,197955 K1AA0704 protein 7,55 

432415 T16971 Hs.289014 ESTs. WedWy similar to A43932mudn 2 p 7.51 

55 416830 BE513731 Hs.68959 hypothetical protein ^ftGC481 6 7.38 

409421 AA199883 Hs.67624 ESTs 7.35 

449433 AI672096 Hs.9012 ESTs. Wealdy similar to S266500NA^indi 7.35 

458570 AW971698 m.12627 TJ6 protein 7.30 

_^ 441287 AW293132 Hs.131373 ESTs 7.30 

60 434609 1176593 gbiylSOcllJI SoaresptacenteNb2HPHomo 7.25 

432239 X61334 Hs.2936 matrix metalopfoteinase 13 (oollagenase 7.25 

441425 AA933590 Hs.28937 hoflfieobox protein from AL590526 7.25 

446293 AI420213 Hs.149722 UlUi domain transcription factor UM-I (h 7.21 

^- 414136 AA812434 SMC2(slTuchira)mdintananc8 0fchromoso 7.20 

65 409089 NIhi.014781 Ks.50421 KIAA0203 gene produd 7.19 

422938 NM.001809 Hs.1594 centromere protein A (171(0) 7.18 

441421 AA356792 ' H3.334824 hypothelica) protein FU 14825 7.15 

452226 AA024a98 H3.157103 ESTs 7.15 

435918 AF283538 Hs.86232 growth dUterenlialion factor 3 7.14 

70 418861 (^001949 H5.1189 E2Ftransciiptton(ador3 7.10 

436380 Ai962796 Hs.156100 ESTs 7.10 

442950 Ai500417 Ks.46764 ESTs 7.00 

415684 059356 sorbitol dehydrogenase 7.00 

446336 1^3848 Hs.44976 ESTs 7.00 

75 453183 AWD88185 Hs.2238S6 ESTs 7.00 

444434 NIUI.004849 Hs.11171 APG5(8Utophagy5.S.cerevisteeHlte 6.95 

422665 AJ011812 Hs.1 19018 transcription factor NFIF 6.95 

437421 AA917062 ESTs 6.95 

^. 428916 Afmm Hs.194562 tetomeric repedl binding factor (NIMA^n &94 

SO 408045 AW138959 HsJ245123 ESTs a90 

448588 AI970276 Hs.156905 IQAA1676 a89 

433764 AW753676 Hs.39982 zinc finger protein fUNZF (NM..023929) a85 

439780 AL109888 gb:Homo sapiens mRNA hiDtength Insert a85 
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449911 AI262106 H3,12653 ESTs &85 

417791 AW985339 Hs.1 11471 ESTs 6.80 

424085 NM_002914 Ks.139226 rspOcaOon factor C (activator 1)2(40 ^75 

. 453160 AI263307 H2B histone family, memtw L 6.75 

5 453392 U23752 H5.32964 SRY (sex delBrmining region YHwx 11 6.75 

4254Z7 AI652662 Hs.1 57205 tygncfied chain atrino t r a ns f er ase 1. cytos 6l73 

447254 NM.004153 Hs.17908 origin recognition complex, subunil 1 ^ 6.70 

41B379 AA21B940 Hs.137516 lidgetin^iics 1 6.70 

407386 AF026942 Hs.1 7518 gb:Homo sapiens d933mRNA,pa&l8equ 6.70 

lU 414618 AI204600 Hs.96978 tiypothetical protein MGC10784 6.69 

417153 X57010 Hs.81343 caOa8ei|,tn»a. alpha 1 (primary osiB 6.66 

428743 A1080060 Hs.301549 Homo sapiens mRNA;cOKADKFZp564H172(fr 6.65 

442717 f«8362 Hs.180591 ESTs. Weakly sinflsr to T23976 hypotiieO 6.65 

433247 AB040948 Hs.142856 KIAA1515 protein 6.65 

IS 430647 AC003682 Hs.127988 ESTs, Weakly stmOar to Z211_HUMAN ZINC 6.65 

417686 AA214584 ESTs 6.64 

432169 Y00971 Hs.2910 phosphoribosylpyrephosphata synthetase 6.62 

412537 AL031778 nuclear transcitpOon factor Y, alpha 6.61 

_^ 426614 AA411925 Hs.301960 ESTs 6.57 

20 457465 AW301344 Ks.122908 DNA replication factor 6.52 

430253 AK001514 Hs.236844 hypofheOcd protein HJ10652 6.50 

440801 AA9063G6 ESTs 6.50 

453116 AI276680 Hs.146066 ESTs 6.50 

436909 AA907120 ESTs 6.50 

25 402199 TaigetExon 6.50 

419556 U2g615 Hs.91093 cMlinase 1 (chitotriosidase) 6.46 

421285 NM.000102 Hs.1363 cytodiroma P450. subfamily XVil (steroid 6.41 

438494 AA906678 Hs.130183 ESTs ^41 

416592 X99226 Hs.284153 Fanconl anemia. comptementaBon group A 6.40 

30 408758 NM.003686 Hs.47504 exonud&asel 6.40 

442671 AI005568 Hs.130673 EST 6.40 

432281 AK001239 H5.274263 hypothetical protein FU10377 6.38 

413833 Z1S005 Hs.75573 centromere protein E (31 2ltD) 6.35 

424081 NM.006413 Hs.139120 ribonucleaseP(30itD) 6.33 

35 441878 Ai801869 Hs.127982 ESTs 6.31 

429120 AK001673 Hs.196530 hypothetical prot^ FU 10811 6.31 

418221 Z45514 Hs.B3775 DtGeorge syndrome gene D 6.30 

410166 AKD01376 Hs.59346 hypothetical protein FU10514 6.30 

421650 AA781795 Hs.122587 ESTs 6.30 

40 453932 AW006303 Hs.329296 ESTs. Weakly similar to (defllne not ava 6.28 

408291 AB023191 Hs.44131 K1AA0974 protein 6.26 

438180 AA60816g Hs.272151 ESTs 6.25 

412026 AA383618 Hs.73073 tesQs-spedficankyrin motif containing 6.25 

427510 Z47542 Hs.179312 small nudear RNA activating complex, po a20 

45 423642 AW452650 Hs.157148 hypothetical protein MGC1 3204 a20 

416111 AA033813 Ks.79018 chromatin assemtily factor l.subunit A ( 6.18 

407300 AA102616 Hs.120769 gb2n43e07^1 StratageneHeUce!ls393 6.12 

426223 AW977812 Hs.130391 ESTs 6.10 

445038 AI635444 Hs.143917 dJ467N11.1 protein 6.10 

50 419197 N48921 Hs.27441 KIAA1615 protein 6.09 

453775 NM.002916 Hs.3S120 repIkrationMirC (activator 1)4 (37 6.05 

436902 AW247145 Hs.192729 ESTs &05 

429228 AI553633 ESTs 5.99 

457065 AI476318 H3.192480 ESTs 5.90 

55 426572 AB037783 Hs.170623 hypothetical prolelnFUl 1183 5.90 

449132 BE045641 Hs.197573 ESTs 5.90 

432359 AA076049 Hs.274415 Homo sapiens cDNAFU10229 Us, done HE 5.69 

423728 AW891294 Hs.132136 solute canter femOy 4. sodium bfcarbon 5.65 

457289 AW573204 Hs.137078 ESTs 5.65 

60 433849 BE4658B4 Hs.280728 ESTs 5,85 

412642 BE244598 Hs.e09 hepatocyte growth factor (hepapoletin A; 5.85 

438450 AI050866 Hs.65853 nodal, mouse, hcmolog 5.81 

428301 AW628665 Hs.98440 ESTs. Weakly similar to 138022 hypothefl &80 

408750 BE294069 HS.935B1 hypotheficai protein FU1051 2 &77 

05 415947 U04045 H5.78934 mutS coC) homOlog 2 (colon cancer, 5.77 

408460 AA054726 H5.285574 ESTs 5.75 

442461 AW082564 HS.2B5833 Homo sapiens cONA: FU221 35 Its. dtuieH 5.75 

416747 AWB75523 Hs.15929 hypothetical proteb FU12910 5.75 

_^ 428249 AA130914 Hs.1 83291 zinc finger protein 268 5.74 

70 419635 NM.005033 Hs.91728 polymyosilisfedarodermaautoantigen 1 ( 5.72 

402145 Target Exon 5.71 

447178 AW594641 Hs,l92417 ESTs 5.70 

458814 AI498957 Hs.170861 ESTs. Weakly similar to Z1 95 JiUMAN ZINC 5.65 

442980 AAB57025 Hs.8878 kinesi^fikel 5.55 

75 419131 AA406293 Hs.109526 ESTs 5.60 

450254 NI\L004865 H3.99231 neuropeptkle 6 protein^»upled recepton 5.60 

441627 AA947552 Hs.58086 branched chain aminotransferase 1, cytos 5.60 

440304 6E159984 K5.125395 ESTs 5.60 

440553 AA889416 Hs.344043 Homo sapiens CDNAFU14459 lis. done HE 5.58 
80 442333 AI650877 Hs.129302 ESTs 5.58 
453941 U39817 Hs.36820 Bloom syndrome 5.57 
415799 AA653718 Hs.225841 DKFZP43401 93 protein 5.57 
413623 AA825721 Hs.246973 bitronofOcaudaiDhoffldQgl 5.55 
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427147 


AA398587 


Hs.97414 


ESTs 


5.55 




451050 


AW937420 




EST8 


5.55 




450113 


Ai683098 


Hs.200886 


ESTs. Moderslely slmnartoALU7_HUMAN A 


5.54 


5 


416G78 


NM.C01327 


H&187379 




S.54 


437812 


AI582291 


Hs.16846 


ESTs. WeaUy simOar to 04HUm defarisoqu 


&S3 




431354 


BE046956 


Hs.251673 


DMA (cytoskie-S-Hnethyttrensferase 3 be 


5.51 




449592 


AI655494 


Hs.195718 


ESTs 


5.50 




445517 


AF208855 


Hs.12630 


hypothoQcs) protdn 


5.50 


10 


416658 


U03272 


Hs.79432 


fibrillin 2 (oongenBal oonlractural sra 


5.48 


430044 


AA464510 


Hs.152812 


ESTs 


5.47 




437036 


AI571514 


Hs.133022 


ESTs 


5.47 




423006 


U29700 


Hs.123014 


anl>-MuQer1an hoonone receptor, type D 


5.48 




409103 


AF251237 


Hs.112208 


XA6E-1 protein 


&45 


15 


420900 


ALJ045633 


Hs.44269 


ESTs 


&45 


437257 


Ai263085 


Hs.290931 


ESTs, WeaMy simDar to YFJ7.YEAST HYPOT 


5.45 




440738 


AI004650 


Hs.225674 


WD repeat domain 9 


5.45 




412723 


AA64845g 


H3.33S951 


hypothetical protdnAF301222 


5.45 




441122 


H56777 


Hs.121084 


eppuv3 


5.42 


20 


414151 


AW97&468 


Hs.257245 


ESTs 


5.40 


435663 


AI023707 


Hs.1 34273 


ESTs 


5.40 




448986 


H42169 


Hs.347310 


tiypothetical protein FU14827 


5.39 




433701 


AW445023 


Hs.15155 


ESTs 


5.39 




443486 


NM_003428 


Hs.9450 


zinc linger protein 84 (KPF2) 


5.35 


25 


440842 


AA907288 


Hs.1 30173 


ESTs 


5.35 


432407 
401837 


AA221036 


Hs.13273 


gb:zi03ri2Jl Stratagene NT2 neuronal pr 
NM.025109:Homo 8^1^ hypothetical prot 


5.34 
5.32 




423739 


AA398155 


Hs.97600 


ESTs 


5.31 




424315 


AW514850 


Hs,193384 


putatstive 28 UDa protein 


5,31 


30 


453900 


AW003582 


Hs.226414 


ESTs, Wealdy8lmilartoAUJ8.HUMAN ALUS 


5.30 


415717 


AA167270 


HS.13043S 


ESTs 


5.30 




428329 


AA426091 


HS.984S3 


ESTs. Moderately similar to R27328 2 [K 


5.26 




427119 


AW880562 


Hs.272525 


ESTs 


5.25 




432117 


AL036195 


Hs.2909 


protamine 1 


5.24 


35 


446837 


AW273055 


Hs.156598 


ESTs 


5.23 


442007 


AA301116 


Hs.142838 


nucleolar phosphoprolein Nopp34 


5.21 




422797 


AB033064 


Hs.236463 


KIAA1238 protein 


5.19 




446258 


AI283476 


Hs.263478 


ESTs 


5.18 




445577 


N40696 


Hs.1 37064 


cytopiasmic poiyadenytaiion element bind 


ai7 


40 


445413 


AA151342 


Hs.12677 


CGI-147 protein 


5.17 


449670 


F07693 


Hs.85603 


Homo sapiens mRNA: cOI^ OKFZp434K2172 (f 


5.16 




436211 


AKD01581 


HS.33462B 


hypothetical proteb FU10719; KIAA1794 


5.15 




429629 


BE501732 


Hs.30622 


Homo sapiens cDNA FU13010 fis, ctone NT 


5.15 




424235 


NVL003181 


Hs.143507 


T brachyury (mouse) homotog 


5.15 


45 


448038 


AW015073 


Hs.232026 


ESTs. Weakly similar to R(^J1UMAN 52 KD 


5.15 


430272 


)a)4898 


Hs.237658 


8po8popro(elnA-ll 


5.14 




422094 


AF129535 


Hs.272027 


F-box only protein 5 


5.13 




420424 


AB033036 


H8.97594 


K1AA1210 protein 


5.13 




447924 


AI817226 


Hs.313413 


ESTs. Weakly similar to T231 10 hypothed 


5.10 


50 


422631 


BE218919 


Hs.118793 


hypotheticd protein FU10668 


5.10 


453446 


AL036710 


HS.2095Z7 


ESTs 


5.10 




438378 


AW970529 


Hs.86434 


hypoihetcal protdn FU21816 


5.06 




418235 


BE072634 




gb:PM4«T0548-1712994)01*h08 BT0548 Homo 


5.05 




427961 


AW2g3165 


Hs.143134 


ESTs 


5.05 


55 


441553 


AA281219 


H5.121296 


ESTs 


5.05 


429999 


Ai761902 


Hs.99597 


ESTs 


5.04 




426496 


031765 


Hs.170114 


KIAAOOSI protein 


5.02 




410929 


H47233 


H5.30843 


ESTs 


5.01 




448757 


Ai366764 


H5.48820 


TATA box bteding protein (T6P}4ssodate 


5.01 


60 


457107 


AA418246 


Hs.1657g6 


ESTs. VfeaUy stmlter to Z184_KUIMAN ZINC 


5.00 


408332 


H91230 


Hs.234794 


Homo sapiens mRNA; CDNA OKFZpS64B083 (ir 


5.00 




440138 


AB033023 


Hs.318127 


hypothetical protein FU1D201 


5.00 




407568 


AA740964 


Hs.62699 


ESTs 


5.00 




409798 


AA248567 


Hs.30237 


ESTs, simiter to ALUB.HUMAN lill 


5.00 


65 


431215 


AA496078 


Hs.121554 


Human DNA sequence from ctone RP1 1-218C1 


5.00 


416350 


AF188625 


H5.189507 


phosphbUpase A2. group tlD 


4.99 




452197 


AW023595 


Hs.232048 


ESTs 


4.98 




420333 


AJ001383 


Hs.97084 


lymphocyte antigen 94 (mouse) homdog (a 


4.97 




403760 


AW962081 




C4001759:^13^SO|sp|P18474|R(^:UlUMAN 


4.97 


70 


418378 




gb:EST374154 MAGE lesequenoes. MA6G Homo 


4.95 


418894 


W73921 


Hs.50743 


ESTs 


4.95 




426623 


AA382826 


Hs.132793 


ESTs 


4.95 




443537 


D13305 


Hs.203 


cholecystoklnin B receptor 


4.94 




414812 


X72755 


Hs.77367 


monoidnQ induced by gamma interferon 


4.94 


75 


453716 


AA037675 


Hs.152675 


ESTs 


4.90 


402299 






Target Exon 


4.90 




411945 


AL033527 


H5.92137 


L-niyc-2 protein {MYa2) 


4.89 




414034 


U89277 


Ks.305985 


eariy devetopmenl regulator 1 (homdog o 


4.87 




409066 


AA062980 


Hs.66960 


ESTs 


4.85 


80 


437496 


AA452378 


Hs.146668 


Homo sapiens mRNA; cONA DKFZp547J125 (fr 


4.85 


416661 


AA634543 


Hs.79440 


IGF-amRNA^ngpfotdn3 


4.85 




450375 


AA009647 




a disintsgiln and mdaUoprotelnaso doma 


4.85 




416201 


AA467752 


Hs.195161 


ESTs 


4.85 




420348 


AL13738S 


Hs.97140 


Homo saptens mRNA: cONA DKFZt)434M 1 1 28 (f 


4.84 
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423138 


M81933 


Hs.1634 




424687 


J(^70 


Hs.151738 




418971 


AA3603g2 


Hsi7113 


*• 


411571 


AA122393 


Hs.70ai1 


5 


409517 


X90780 






424322 


AL157491 


HS.14S211 




443169 


AI038687 


Hs.133338 




438624 


AA889055 


H$.123468 


10 


442562 


BE379584 




412530 


AA765268 


Hs.266273 




443715 


AI583187 


H3.9700 




423123 


N>A-012247 


Hs.124027 




451105 


AI761324 




15 


444431 


AW513324 


Hs.42280 


440591 
424281 


AA431599 
AA766243 


Hs.1 32799 




447175 


Ai365208 


Hs293806 




408101 


AW968504 


HS.123D73 


20 


430183 


BE010038 




416445 


AL043004 


Hs.79337 




429652 


AA766810 


H3^59290 




426054 


U12431 


Hs.166109 




418618 


U66097 


Hs^6724 


25 


445537 


/U245671 


HS.12B44 


427298 


M400495 






412663 


M121673 


Hs^9757 




446700 


AW206257 


Hs.156326 




418839 


U24577 


Hs.93304 


30 


446751 


AA766998 


Hs.79126 


432656 


NM.Q00246 


H5.3076 




434283 


AW235341 


Hs.58715 




437915 


At637993 


Hs.202312 




421830 


AA789269 


Hs.1 22509 


35 


440006 


AK000517 


K5.6844 


450719 


A1096837 


Hs.21349 




431721 


AB032996 


Hs.268044 




423175 


W27595 


Hs.347310 




453529 


AA036729 


H5.335639 


40 


416209 


AA236776 


Hs.7g078 


444386 


BE085183 






428976 


AL037624 


Hs.194695 




449510 


AI653154 


Hs.328147 




414725 


AA769791 




45 


424153 


AA451737 


Hs.141436 


414466 


AA349211 


H5.76205 




448966 


AW372914 


Hs.86149 




458443 


AV647010 


Hs.27 




453289 


AI188161 


Hs.144627 


50 


433641 


AF060229 




440196 


N72847 


Hs.125221 




452338 


AW608920 


H5^159 




428855 


AS435901 


Ks.89563 




416734 


H81213 


Hs.14825 


55 


442240 


AI79ie83 


Hs.292719 


421917 


AB028943 


Hs.109445 




420949 


AA934063 


Hs.13836 




449676 


AW380579 


Hs.209657 




433163 


AF231338 


Hs.222024 


60 


439314 


AA382413 


Hs,178144 


425312 


AA354940 


Hs.145958 




427584 


BE410293 


Hs.179718 




430444 


AW296421 


Ks.121035 




416773 


AK000340 


Hs.79826 


65 


421010 


AW974553 


Ks.267124 


418216 


AA662240 


Hs.283099 




450351 


BE547267 


Hs.59791 




454073 


AW206286 


Hs.1 18727 




417006 


AWS73806 


Hs^75B 


70 


417576 


AA339449 


HSJ228S 


448877 


AI5836g6 


Hs.253313 




421379 


Y15221 


K5.103982 




411630 


U42349 


Hs.71119 




430292 


AK000634 


H&238270 


75 


427778 


AA412323 


Hs.105323 


418768 
409268 


T39310 
AA625304 






442010 


AI032680 


Hs.132213 




452807 


AA028933 


Hs.162434 


80 


401435 






447519 


U48258 


83.339665 




421307 


BE539976 


Hs,103305 




424590 


AW966399 


Hs.46821 




453909 


AW004045 


H5.203365 



cead]vi5iancycls2SA 4.82 

matrix metdDoproteinasa 9 (gelallnase B 4.81 

ESTs 4.80 

Iqrpothelical protein FU20516 4.80 

boponbi I, CfffdtfW? 4.80 

Homo8ap(ensrnRN^-cO^IADKFZp434K1l11(f 4.60 

ESTs 4.60 

ESTs ^ 4.79 

dolichyl-diphosphooGgosaocharfdoiirold 4.76 

hypoO»Iical protein FU13346 4.76 

cydhEl 4,76 

SELENOPHOSPHATE SYNTHETASE ; Human selen 4.75 

gb.-wi6Qb1lj(1 Na.CGAP.Co16 Homo sapiens 4.71 

Homo sapiens, done MGC:9010. mRNA oomp 4.71 

hypothetical pmte!nFU23451 4.71 

gb:oa13b11.8l NQ.CGAP.GCB1 Homosapiens 4.70 

ESTs 4.70 

C0C2-retated protein kinase 7 4.69 

gb:PM3-BN0176-10O40(M)01^O4 BN0176 Homo 4.68 

K1AA01 35 protein 4.66 

ESTb 4.65 

ElAV (emtxyonic leiha), abnomial vision, 4.65 

GTP cydotiydrolase 1 (dop^responsiva dy 4.64 

EGF-Gke^tomain.muUipte6 4.62 

ESTs 4.62 

zinc finger protein 281 4.62 

Human DNA sequence from done RP1 1-1451.2 4.61 

phospfioTipase A2, group VI] (platelets 4.60 

Human DNA sequence from done RP11-16L21 4.60 

MHC dass II transactivator 4.60 

ttiiamlnepyrophosphoMnase 4.60 

Homo sapiens done Nil NTersZDI teratoca 4.60 

ESTs. Weakly sln^ar to dJ1018D12.3 [H.S 4.59 

NALP2 protein; PYRIN-Containing APAF1-B 4.58 

ESTs. Weakly simitar to RB8B.HUMAN RAS-R 4.58 

KIAAIITOprolein 4.55 

hypothetical protein FU14627 4.55 

ESTs 4.55 

MAD2 (mitotic arrest deficient, yeast, h 4.55 

gb: RC1.BT0 314-020200-012-c04 BT0314 Homo 4.55 

IBS homdog geno famliy, msmber i 4.55 

ESTs 4.55 

ring linger protein 21, interteron-respo 4.54 

MAGE>like2 4.53 

cytochrome P450, subfamily XIA (choteste 4.52 

phosphdnosttd 3^hosphatfrtindIng prot 4.50 

glycine dehydrogenase (decarboxytating; 4.48 

ESTs 4.48 

gb:Human endogenous retrovifus K done 1 4.45 

ESTs 4.45 

zinc finger protein 75 (D8C6) 4.45 

nudear cap binding protein subunlt 1, 8 4.45 

ESTs,Weakiysimi]3rtDKIAA15D3pn}tein 4.45 

ESTs 4.45 

KIAA1020 protein 4.45 

£STs,Weddy8lmilarlDl38022hypotheB 4.44 

ESTs 4.43 

transcription fedorBMAL2 4.40 

ESTs 4.40 

ESTs 4.39 

v-myb avian mydoblaslosis viral oncogen 4.39 

ESTs 4.35 

hypothetfcd protein FU20333 4.35 

EST8.W^8im{lartoAliJ6_HUMANALUS 4.34 

AF1Sq14prot^ 4.32 

hypotheticatprotelnMGC13183 4.32 

ESTs 4.30 

aspartyMRNA synthetase 4.30 

phosphoribosyt^ydnamlde fbrmyllransfer 4.30 

ESTs 4.28 

small indudble cytokine subfamily B (Cy 4.27 

Putathre prostate cancer tumor suppresso 4.25 

hypothet)cdpfQie)nFU20627 4.25 

ESTs 4.25 

gb.7a04809j2 Stratagene bng (937210) H 4.25 

ESTs 4.25 

ESTs 4.24 

ESTs 4.23 

C1400039r:gil7499896tpir1fT33295hypoth 4.23 

ESTs 4.21 

Homo sapiens mRN^ cONA OKFZp434B0425 a 4.21 

hypothetical protein FU20086 4.20 

ESTs 4.20 
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431126 AR)B5243 Hs.283819 zinc finger protein 238 4.20 

429828 H09604 Hs.13268 ESTs 4^ 

415969 AI267700 ESTs 4^ 

421373 AA808229 Hs.46677 ESTs 4.20 

J 433979 AAS20999 gb:ag03a08.s1 Soares.tesQs.NHT Homo sap 4.20 

408321 AW405882 Hs.44205 coitistalm 4.19 

410193 AJ132S92 Hs.59757 zinc finger pmlein 281 4.17 

430335 080007 Hs.239499 KiM0185prot^ 4.17 

408031 AA081395 Hs.42173 Homo sapiens cDlMFlJ10366 lis. done NT 4.16 

10 438885 A18B6558 Hs.184987 ESTs 4.15 

451578 NMJ018323 Hs.26663 cycDivC binding pmtsini 4.15 

432446 AA542845 K5.294088 GAJ protein 4.13 

445076 AI206866 H5.154131 ESTs 4.11 

420218 AW958037 ribosomal protein L4 4.10 

IS 453628 AW243307 Hs.83937 hypotheScal protein 4.10 

416459 R85436 H5.268814 ESTs 4.10 

418866 T65754 gb:yc11c07.s1 Stretagenebng (937210) H 4.08 

440404 AI015881 Hs.324527 mitochondnal ra)osomai protein S5 4.06 

426300 U15979 Hs.169226 deltas ttomolcg (DrosophOa) 4.06 

20 446223 BE300091 H5.119699 hypoUietica) protein FU12969 4.05 

429984 AL050102 Hs.227209 hypothefical protein FU2161 7 4.05 

449687 W68520 tntenonedlale filament protein syncoOin 4.05 

452109 AI525873 Hs.61164 liypotlietical protein FU 14909 4.06 

401484 IvstonedeacetylsseS 4.05 

25 444670 H58373 Hs.332938 liypothetical protein MGCS370 4.05 

415884 H22966 Hs.13471 ESTs 4.05 

442066 BE502147 Hs.128418 ESTs 4.04 

402098 ENSP0D000217725*:Lamininalpha-1 drain p 4.02 

404287 FGENESH predicted noveiCUWomaincorta 4.01 

30 422756 AA441787 Hs.1196e9 glycoprotein liormones, alptia polypeptide 4.01 

449704 AK000733 Hs.23900 GTPase activaflng protein 4.00 

445685 AW779829 gbihnOSaOSjd Na_CGAP.KId11 Homosapien 4.00 

444379 N99035 Hs.30352 ESTs 4.00 

435373 AW665538 Hs.117689 ESTs 4.00 

35 424557 AA343057 Hs.1&458e ESTs. Moderately similar to neuronal thr 4.00 

413646 BE155042 gb:PMO-HT0349-1012994)02^04 HT0349 Homo 4.00 

418648 AW979223 Hs.292478 ESTs 4.00 

446074 AA079799 Hs.343103 hypothetical protein FU1 1896 4.00 

447353 AI375701 Hs.25884 ESTs 4.00 

40 410100 AA081636 Ks.271916 ESTs. Weakly similar to S41 044 chromosom 4.00 

428856 AA436735 Hs.183171 hypottraScal protein FU22002 4.00 

445140 AI650599 Hs.197913 ESTs, WeaMy similar to SCP3 MOUSE SYKAP 4.00 

406367 NM.022357:Homo sapiens putative metallop 3.99 

437834 AA769294 gb:nz36g03.8l NCLCGAP.GCBI Homo sapiens 3.99 

45 453985 N44545 Hs.251865 ESTs 3.98 

408446 AW450669 H5.45068 hypothetical protein DKFZp434l1 43 3.97 

408562 AI436323 Hs.31141 roundabout (axon guidance receptor. Dros 3.97 

414713 BE465243 H$.12664 ESTs 3l96 

_^ 426067 AW664691 Hs.97053 ESTs ZM 

50 456497 AW967956 Hs.123548 ESTs, Weakly similar to AF1 08450 1 uhinu 196 

454679 AW813110 gb:CM4-ST0189^51099^21-f05ST0189Homo 3.95 

451865 H43737 Hs.33186 ESTs. Weakly similar to unknown protein 3.95 

403137 NM.005381*:Homo88ptensnudeoRn(Na). 3.95 

445730 AI624342 Hs.179082 ESTs 3.95 

55 451993 AA765776 Hs.122983 ESTs 3.95 

428819 AL135623 H5.193914 KIAA0575 gena product 3.92 

433683 Ai817723 Hs.22678 hypothetical protein FU21832 3.91 

420812 AA715303 Hs.107369 ESTs 3.90 

423806 AA331247 Hs.86617 ESTs 3.90 

60 437205 AL110232 Hs.279243 Homo sapiens mRNA; cONA DKFZp564O2071 P 190 

449211 AI922972 Hs.196073 ESTs 3.90 

409757 NM.001898 Hs.123114 cystallnSN 3.90 

438027 A1664053 Hs.39972 ESTs. Weakly sln^ar to 138588 reverse t 3.89 

432512 NM_003284 H3.3017 transitton protein 1 (during Hstone k) 3.89 

65 440840 AW629866 ESTs. Weakly simtar to S640S4hypotliea 3.88 

449099 A1629041 Hs.46908 ESTs 3.88 

408092 NM.007057 Hs.42650 ZWIObteractor 3.85 

423909 AJ223183 Hs.135194 Immunogtobullnsuperfamily, member 6 3.65 

437162 AW005SO$ Hs.5464 thyroid honnone receptor coacSvatingpr 3.64 

70 424381 AA28S249 Hs.146329 protein kinase Chk2(CHEK2) 3.B3 

433023 AWB64793 thrombospondtn 1 3.82 

452571 W31518 Hs.34665 ESTs 3.81 

421413 AI826128 Hs.55209 ESTs. W^ similar to A493&4 59 piotel 3.80 

440953 A)683036 KS.12413S Homo sapiens cDNA FU13051 fis, done IfT 3.80 

75 420697 AA827705 Ks.26605 ESTs 3.60 

407275 AI364186 gb3iw34h07j(1 Na.(XSAP.Ut4 Homo sapiens 3.80 

422789 AK001113 Hs.120842 hypothelteal prol^ FU10251 3.80 

411856 H67899 Hs.4190 Homo sapiens cONA-FU23269fia. done C 3.80 

449529 AI990559 Hs.232033 ESTs 3.80 

80 447444 AK000318 Hs.18616 hypothetical protein FU20311 3.7B 

444656 AI277924 Hs.145199 ESTs 3.77 

448674 W31178 Hs.154140 ovaiy-spedlksaddic protein 3.77 

415829 AW4S0198 Hs.163742 ESTs 3.76 
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4361B8 


AK001049 


Hs.48712 


hypothetical protein FU2073& 


3.75 




402178 






C19001998*:g!I6453813|reflNPJD08926.2) b 


175 




418179 


X51630 


Hs.1145 


VV2nis tuiiuf 1 


3.75 


5 


423545 


AP000692 


H5.129781 


dinxnosofne 21 open readins firame 5 


3.7S 


4290S3 


AW363845 


Hs.322903 


ESTs, WeaMy similar to A48010 X4ink6d 


3.75 




437440 


AA646804 




ESTs 


3.75 




427366 


AA885108 


Ks.223806 


TATA box binding protein (TBP)*8ssQdaIe 


3.74 




438455 


AA913381 


HS.20S94 


ESTs 


3.73 


10 


418821 


AA436002 


Hs.183161 


ESTs 


3.73 


417918 


AA209205 


Hs.163754 


hypolheDcal protein FLJ126(ffi 


3.73 




415912 


H08859 


HS.20&469 


ESTs, WeaMy Mar to AtU6_HUMAN ALU S 


3.71 




423020 


AA383092 


Hs.1608 


repDcatton protein A3 (14i(0} 


3.70 




409928 


AL137163 


Hs^7549 


hypolhelfea) protein dJ473B4 


3.70 


15 


414206 


AW276887 


Hs.46509 


ESTs 


3.70 


427761 


AA412205 


Hs.1 40996 


ESTs 


3.69 




428728 


NM.016625 


Hs,191381 


hypott)e6cal prot^ 


3.68 




452631 


A)188658 


Hs.87496 


ESTs 


3.68 




427719 


AI393122 


Hs.134726 


ESTs 


3.68 


20 


431869 


AA521136 


Hs.190176 


ESTs 


167 


429830 


AI537278 


H8.225841 


OKFZP434D193 protein 


3.67 




420297 


AI628272 


Hs.88323 


ESTs, Weakly slmRartoALUIJIUMAN ALUS 


3.66 




421972 


M1B165 


Hl1454 


gastric intdbitofy polypeptide 


3.66 




403433 






NM.(K)1622:Homo sai^ens sjpha-2*HS''9iycop 


3.65 


25 


456030 


AA136106 


Ks.184852 


KIAA1553 protein 


165 


402408 






NM_030920*:Koino sapiens hypothelica! pro 


165 




452387 


A1680772 


Hs^6094 


trIffiicMde repeal contalntno 12 


165 




416808 


R11499 


HSJ89716 


ESTs 


165 




417553 


L09190 




trtchohyaiin 


165 


30 


408065 


AW954272 




gb:EST366342 MAGE resequences. MAGC Homo 


165 


431077 


AI669133 


H$.1156G0 


hypothetical protein FU12810 


164 




452461 


N7B223 


Hs.108106 


uenscnpuon lacior 


160 




437660 


W31708 


Hs.55304 


ESTs 


160 




420552 


AK000492 


Ks.98806 


hypotheOca! protfib) 


160 


35 


419926 


AW90O992 


Hs.93796 


DKFZP586D2223 protein 


159 


420161 


AI683089 


Hs.1 20817 


ESTs 


159 




414747 


U30872 


Hs.77204 


centronsro protein F (3S(y400kD, mltosin 


3.59 




449571 


AW016812 


Hs^266 


ESTs 


156 




424727 


AW590378 


Hs.152519 


hypothetical protein FU20674 


155 


40 


441820 


AA969119 


HS.143S02 


ESTs, Weakly similar to envelope protein 


155 


423685 


BE350494 


H3,49753 


uveal autoantigen with coiled coil domai 


3.55 




427532 


AA442152 


Hs.104744 


hypothetical protein DKFZp434J0617 


155 




437700 


AA766060 


Hs.301209 


rnyetold/iymphoid or n^ed-iineaQe teukem 


155 




438176 


AW138970 


Hs.122113 


ESTs 


155 


45 


453062 


AW207538 


Hs.61603 


KtAA1677 


3.55 


447064 


AB002350 


H8.17262 


KIAA0352 gene product 


155 




430056 


X97548 


Hs.228059 


KRAB-assodated protein 1 


154 




416049 


AA211467 




Homo sapiens, Sininar to ntKtear localiz 


154 




434288 


AW189075 


Hs.1 16266 


f)brillln3 


154 


50 


439176 


AI446444 


Hs.190394 


ESTs, W^y simOar to 826096 One-I pr 


152 


421350 


AW301608 


Hs.278168 


ESTs, Moderately similar to 154374 gene 


3.62 




413943 


AW294416 


Ks.1 44687 


Homo sapiens cDNA FUI^I fis, done NT 


3.52 




412123 


BE251328 


K5.73291 


l^thetical protein FU10861 


3L51 




430968 


AW972B30 




{^:EST38492S MAGE lesequenoes. MAGL Homo 


ISO 


55 


449467 


AW205006 


Hs.197042 


ESTs 


150 


405935 






Target Exon 


150 




429782 


NIVL005754 


K3.220689 


Ras-GTPas&>activallng prot^ SH3-dQmain 


150 




411027 


AF072099 


Hs.67846 


leukocyte immunogtobulin^ receptor, 


150 




412140 


AA219691 


Hs.73625 


RAB6 interacting. Unesln^lke (rabkines 


149 


60 


429183 


AB014604 


H8.197955 


KIAA0704 protein 


149 


428876 


AA436884 


H5.48926 


ESTs 


149 




418203 


X54942 


Hs.83758 


CDC28 protein kkiase 2 


149 




435088 


H16262 


Hs.31415 


ESTs 


148 




442573 


H93368 


Hs.7567 


branched chain amfnotransferose 1, cytos 


148 


65 


451085 


AW295132 


Hs.222231 


ESTs, Weakly sImDar to granule ceD mar 


148 


419741 
406542 


NM_007019 


Hs.93002 


ubkiuilln carrier protein E2<C 
C19000728*:gli12585552i$p|Q9y2Q1|Z257.HU 


148 
147 




422406 


AF025441 


Hs.116205 


Opa-lnteracting protein 5 


146 




402099 


AKD00375 




ENSP00000217725*:Laniinh alpha-1 chain p 


145 


70 


418826 


Hs.88820 


HOCMC28P protein 


145 


424513 


BE385864 


Ks.1 49894 




145 




427617 


D42063 


Hs.199179 


RAN binding protein 2 


145 




428381 


Nf^015905 


Hs.183858 


transcripflond Intermediary factor 1 


145 




400268 






NM.003292:Kofno sapiens Iranstocated prom 


145 


75 


443596 


AW026048 


Hs.134124 


ESTs 


145 


442875 


BE623003 


Hs.23625 


Horra sapiens dons TCCCTA00142 mRNA sequ 


145 




416031 


T3029D 


H9.107515 


ESTs. Weakly similar to T00329 hypolhefl 


145 




435244 


N77221 


Hs.187824 


ESTs 


145 




423354 


AB011130 


H3.127436 


calcium channel, voltage-dependent, atph 


3.45 


80 


453785 


A)368235 


Hs.283732 


ESTs, Moderately sin^ to ALU1.HUMAN A 


145 


420686 


Ai950339 


Hs.40782 


ESTs 


144 




429467 


NM_004477 


Hs.203772 


FSHD region gene 1 


143 




448769 


N66037 


Hs^173 


ESTs 


143 




423453 


AW450737 


Hs.128791 


GGMfi protein 


141 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



417705 


AW134952 


H117S220 


hypotheSca) protein FU14541 


3.41 


410252 


AW821182 


Hs£1418 


nvcrolibri1lar<85SOci^6(S pfo^n 1 


3.41 


404068 






Target Exon 


a40 


401644 






T8i||etExon 


3.40 


422364 


AFD67800 


Hs.1 15515 


(Myiw (calcium dependent cartwhydrale- 


3.40 


452907 


6E256966 


Hs.31652 


ESTs, Moderately dmflar b 154374 gene 


3.40 


4202B1 


AI623693 


Hs.323494 


Predicted cafion efbot pump 


3^ 


452404 


AW450675 


Hs.212709 


ESTs 


139 


452256 


AK000933 


Hs.28661 


Homo sapiens cONA FL) 10071 fis. clone HE 


139 


420892 


AW975076 


Hs.172589 


nudear phosphoprotein similar to S. cer 


139 


440606 


AI828751 




ESTs. Weakly similar to (38022 hypotheli 


138 


425474 


Z48054 


Hs.158084 


peroxisome receptor 1 


137 


429714 


BE561801 


Hs.2484 


T-ceD leukemla/lymphoma 1A 


137 


446214 


AK001322 


Hs.14347 


tiypolheQca) protein FU 10460 


136 


434808 


AF155108 


Hs.256150 


Homo sapieris, SimSar to RtKEN cONA 2810 


136 


446789 


BE539108 


Ks.22051 


hypothetical protein MGC15548 


136 


421633 


AF121860 


Hs.106260 


sorting nexin 10 


136 


438192 


AI85g085 


H8.293807 


Homo sapiens AFG3L1 isofomi 1 mRMA, part 


136 


436511 


AA721252 


HS.291S02 


ESTs 


135 


402680 






Target Exon 


135 


414598 


AI094221 


Hs.135150 


hing type-l ceD menll)ranfr«ssociated gly 


135 


449477 


A)652602 


K5.197043 


ESTs 


135 


413686 


AI469213 


Hs.71404 


ESTs 


135 


401091 






decay accelerating (actor for complemant 


135 


418295 


AW970043 


Hs^8039 


hypothetical protein FU1 1090 


135 


433220 


AI076192 


Ks.131933 


ESTs 


134 


453200 


AA033832 


Hs.212433 


ESTs 


133 


427239 


BE270447 




ubiquttin carrier protein 


133 


418355 


L42563 


H5.1165 


ATPase. H? transporting, nongasliic, alp 


133 


421535 


AB002359 


Hs.105478 


phosphoritmytformylglydnamidine syntha 


131 


441243 


Ar7670S6 


Hs.193002 


ESTs 


130 


440716 


AW105245 


Hs.146509 


ESTs 


130 


400587 






C10000649*:gI|7296574|gb|AAF5l857.1|(AE 


130 


401148 






Target Exon 


130 


411752 


AW236047 


Hs.126497 


ESTs 


130 


433252 


AB040957 


H6.151343 


K1AA1524 protein 


130 


434008 


AA740878 


Hs.112982 


ESTs 


130 


444565 


BE613126 


Hs.47783 


B aggressive lymphoma gene 


130 


458067 


AA393603 


Hs.36752 


protein tdnase anchoring protein GKAP42 


130 


410340 


AW182833 


H5.112188 


hypothetical protein FU13149 


129 


452761 


BE244742 


Hs.30532 


CGI-77 protein 


129 


451418 


BE387790 


Hs.26369 


hypothetic^ protein FU20287 


129 


429323 


NM_001&49 


Hs.2391 


apical protein, Xenopus laevls-tiitt 


128 


432809 


AA565509 


Hs.131703 


ESTs 


127 


449426 


T92251 


Hs.198882 


ESTs 


127 


425174 


D87450 


Hs.154978 


KIAA0261 protein 


125 


435159 


AA668879 


Hs.1 16649 


ESTs 


125 


446597 


AK001334 


Hs.15470 


putative ring zinc finger protein NY-REN 


125 


411554 


W22895 


Ks.1 12360 


promIntn(mouseHilc8l 


125 


447555 


AI391662 


Ks.160963 


Homo sapiens, clone IMGC:12318, mtVIA, com 


125 


426931 


Nl^003416 


H5^76 


zinc finger pntain 7 (KOX 4, done HF.1 


125 


445093 


AI207197 




ESTs 


125 


410275 


U85658 


Hs^1796 


transcription factor AP-2 gamma (ac&vat 


124 


424568 


AF0Q5418 


Hs.150595 


cytochrome P450. subfamBy XXVIA, polype 


124 


453293 


AA382267 


Ks.10653 


ESTs 


124 


421654 


AW163267 


Hs.106469 


suppressor of varl (Sxerevisiae) VSSxb 


123 


430552 


AA176374 


H9^43886 


midear eutoantigenlc spemi protein (his 


122 


411975 


A)916058 


Hs.144583 


ESTs 


122 


448140 


AF146761 


Hs,20450 


BChMOte mentrane protein precursor 


122 


403432 


AJ278111 




NM.001622:Hamo sapiens alpha-2*HS^]lycop 


121 


436515 


Hs.195292 


puta&ve tumor antigen 


121 


456505 


AA504595 




ESTs 


121 


427668 


AA298760 


Hs.160191 


hypothetical protein FU14g04 


121 


452794 


AI192444 


HS.2S892 


ESTs. W^aidy similar to 137356 epitheOa 


120 


427314 


A6033024 


Hs.175475 


KIAA1 198 protein 


120 


424051 


AL1 10203 


Hs.138411 


Homo sapiens mRNA; cDNA DKFZp586J1922 (f 


120 


452028 


AK001B59 


Hs.27595 


hypothetical protein FLJ21142 


120 


421002 


AF116030 


Hs.100932 


transcription factor 17 


120 


422225 


BE245652 


Hs.1 18281 


zinc finger protein 266 


120 


437549 


AA759149 


H3.128757 


gb:ah70e0i8l SoaresJesfisJlHr Homo sap 


120 


418524 


AA300576 


Hs.85769 


addle 82 kOa protein mRNA 


120 


427642 


R40761 


Ks.9834 


ESTs 


120 


442765 


BE557353 


Hs.99480 


ESTs 


120 


410048 


W76467 


Hs.343874 


proHne addase homotog 


120 


412008 


N»yL0O1841 


H5.73037 


cannablnoid receptor 2 (macrophage) 


120 


423675 


AI990509 


Hs.131342 


small tndtx^ible cytokine subfarnily A (Cy 


120 


453895 


AAa39843 


Hs.61948 


Homo saptens, ctone MGC:16466. mRNA, com 


120 


424144 


AA454033 


Hs.41644 


AKAP-assodated sperm protdn 


119 


428612 


AA770001 




ESTs 


119 


422805 


AA438989 


Hs.121017 


H2A histone family, member A 


119 


444371 


BE540274 


KsJ239 


fbrfcheadbOKMI 


118 


427528 


AU077t43 


HS.179S65 


minlchromosofflo mdntenanoe defldenl (S. 


117 


451684 


AF216751 


Hs.26813 


C0A14 


117 



624 
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452031 


AA741314 


451230 


BE546208 


416000 


R82342 


444823 


BE262989 


44852B 


AU076640 


430388 


AA356923 


447801 


K88923 


424188 


AW954552 


436941 


AA860383 


400592 




437642 


AL079309 


450405 


AI694913 


414161 


AA136106 


440129 


AA865618 


438538 


AA832203 


441013 


A1125252 


450431 


AW135797 


423755 


AB037735 


434769 


AA648884 


449523 


NM.000579 


410704 


BE076754 


412673 


AL042957 


420507 


AF093408 


423419 


R55336 


426925 


AW242474 


426108 


AA622037 


439398 


AA284267 


449138 


AW294215 


441795 


Nseiis 


456053 


S57498 


403810 




421281 


AI299139 


429274 


AI379772 


438243 


AI581311 


424800 


AL035568 


429469 


M64590 


416443 


N69469 


421230 


AW958439 


427906 


AA8&4330 


414706 


AW340125 


441703 


AW390054 


445679 


A1343868 


434456 


AW452621 


433228 


F28212 


419335 


AW960146 


415227 


AW821113 


421184 


NM_003616 


438869 


AF075009 


422726 


011690 


429302 


AU076674 


416975 


NM.004131 


433914 


AF108138 


411365 


M76477 


435726 


BE535787 


426386 


AA748850 


420596 


NM.002692 


427953 


AM17g44 


433612 


AF078164 


421305 


BE397354 


446046 


BE281291 


434776 


AA648988 


414132 


AI801235 


430491 


AL109791 


433493 


AA594915 


452606 


N45202 


453416 


NM.003037 1 


TABLE 53B 


Pkey: 


Unique Eos pr 


CAT number Gene cluster n 


Accession: 


Genbankacce 


Pkey 


CAT Number 


432666 


144.7 


423458 


30480.1 


430676 


60836.2 


422828 


227063 1 


418477 


4172_1 


443088 


18695.17 


436812 


659779.1 



Hs.855 

Hs.26090 

H5.79856 

Hs.12045 

Hs.15243 

Hs.240770 

H5.270247 

Hs.142634 

Hs.292791 



Hs.279637 
HS.1848S2 

HS.2919S5 
H5.126419 
Ks.266041 
KS.132S60 
Hs.134278 
Hs.54443 

Hs.31845 
Hs.98397 
H3^9 



Hs.166468 
Hs.221504 
Hs.195631 
H5^1137 
Hs.76252 

H5.17517 
Hs.99206 

Hs.153203 
Hs.27 
H3.194225 
Hs.105633 
Hs,166520 
Hs.76989 
Hs.192843 
Hs.301059 
Hs.1 16832 
Hs.14953 
K!l284137 
Hs.72402 
Hs.102458 

Hs.1572 
Hs.198899 
Hs.1051 
Hs.l 12160 
Hs.289082 
Hs.1 13170 
Hs.125830 
Hs^9185 
Hs.44331 
Hs.61188 
KSJ24830 
Hs.170408 

Hs.48480 
Hs^41559 
Hs.155087 
Hs.90012 
H5^2970 



RAP1A, member of RAS oncogene family 

hypolheOcal protein RJ20272 

ESTs, Weakly similar to S65657 alpha-IC- 

putalive protein 

nucleolar protein 1 (120I(P) 

nuclear cap binding protein subunil 2. 2 

Homo sa^riens cONA OJI 1977 lis. done HE 

zinc linger protein 

ESTs 

Target Exon 

gbrHomo sapiens mRNA fOtl length Insert 
ESTs 

KIAA1553 protein 

ESTs. Vtfealdy sknaar to S71886 StB20-liI( 

ESTs 

ESTs 

ESTs 

hypoUietical protein RJ10312 

Homo sapiens cONA aJ12676 fs. done NT 

chemoUne (C-C motiO receptor 5 

gb:CM1-BT0601-180200.121-b10 BT0601 

ESTs 

A kinase. (PRKA) anchor protein 3 

ESTs 

ESTs 

programmed ceil death 5 

ESTs 

ESTs 

AO024 protein 

endothelin receptor type A 

C3001 199:gij7494834|p{rl|T15308 hypothet 

ESTs 

ESTs 

ESTs 

lUlyoDfantity inhibitor 

glycine dehydrogenase (decartxngflaiing; 

ESTs 

ESTs 

ESTs 

KIAA0097 gene product 
leucine zipper protein FKSG14 
hypothetical protdnFU12468 
ESTs 

K1AA1491 protein 
hypothetical protein FU12888 
ESTs 

survhral of motor neuron protein Inlerac 
gb:Homo sapiens full tenglh Insert cONA 
fadogenltal dysplasia (Aarskog^tt sy 
eukanrotic translation Initiatkm factor 
granzyme B (granzyme 2, cytotoxic T-tymp 
Homo sapiens DNA helicase homotog (PIF1) 



ESTs 

bladder cancer overexpressed protein 
polymerase directed), epsIkNi 2 
ESTs 

Homo saptens Ku70*lndlng protein 0<UB3) 
dlplheria toxin resistance protoki requi 
ESTs. Moderately simiIartoA47S82&«d 
gb:ns4ini^1 Na.CGAP.6CB1 Homo saptens 
ESTs 

Homo saptens mRNA fun tength Insert cON 
ESTs 

hypothetical protein FU23441 
s^naOng lymphocytic acOvalkm motecut 



117 
116 
116 
115 

lis 

115 
115 
115 
115 
115 
115 
115 
115 
114 
114 
114 
114 
113 
113 
112 
112 
111 
111 
111 
111 
110 
110 
110 
110 
110 
110 
110 
110 
110 
109 
108 
108 
107 
107 
106 
106 
106 
105 
105 
105 
105 
105 
105 
104 
104 
104 
104 
104 
103 
103 
102 
101 
101 
100 
100 
100 
100 
100 
100 
100 
100 



AA558585 AA5654g9 A1360576 AW204069 AA991648 AA864939 

BC018070 BG702493 AI204212 AA460929 AA993606 BF926535 AA226938 BG190705 BG186496 AW291865 BG183340 BG195301 BG214539 
BG21 5094 6G1 98867 BG196332 BG208220 8G212418 

BG433950BE061583 T05808 BE144813 AW812038 BE144812 AW812040 AWB12041 AU124350 BE061602 BE061604 BF922595 8E061603 
AI352469 BE061601 B1062752 AW818206BF887722 wtwwiwt or««w ocuoigiM 

BE671981 BE503379 AI655440AI337054AI28B920AI242370AI825182AA756081 BFB55141 BF091068 

^SE^/^SSS?! ^^^JB^^ ^^^^^5 B6212702 B6182057 AWS89883 BF000085 AAg33869 BG479023 BG220014 BG679466 BE907092 
AI623855 AA223956 AA223917 AW022983 AW090580 AW573219 BF514491 BF445397 AA864705 AI91(^4 
AV752763AI032142 N30308 N22181H95390AW675632 
AW978773AW298067 AA810101 AWig4180 AA731645AI690673 

625 
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427521 513212^1 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



440968 

427486 
421974 
43SS14 
434609 
414136 



415684 
437421 
439780 
453160 



417886 
412537 



440601 
436909 
429228 

451050 

418235 
418378 
450375 



4511D5 
424281 
430183 

427298 
444386 
414725 



418768 



433979 
420218 



449687 



445685 
413646 
437834 

454679 
440840 



433023 
437440 



1000797J 

874209J 

518029.1 

684159J 
864120J 
132288 1 
14739J 
30243_1 



18695.18 
978554.1 
49082.1 
6028.5 



1031334.1 
14066_1 



2635916.1 
596835J 
215430.1 

11847.4 

886897.1 
1227421 1 
16559 3 



409517 4537.1 



442562 39593.1 



1145037.1 
892055.1 
17316.1 

11624U 
1490237.1 
19377.1 



433641 35983.1 



2293204.1 

109625.1 

10194.1 

2076469.1 
191547.1 

245947.1 
25369^ 



381678.1 
1525656.1 
294580.1 

174325_1 
29686.1 



3970.8 
2497201.1 



AW973352 BF222929 AW016853 8F059130 AI651829 BES51767 AA558414AI339359 8^059601 A1961 162 AI341422 AI206248 AI206165 
AA548738 AA768578 AI539081 AWQK957 AA736837 N79575 AW5943S7 AA480892 
AA7648S2AA736937 
AVV954733 AA31S006 AW856665 

A»WroiAI634418AW236545A1824880BF223710AW1396B6AI6ra^^ 

BFS1071 5 BE673055 BE4641 1 1 AW590820 AI637939 AA404324 AW238441 A16509S2 6F0S6796 AAg74433 
AA301270 AA301379 AA301386 
AA683356 AW592804 AI15Q287 
AF147390 R76593 R76594 

AL526740 AW968449 M4591 40 AA843893 ^ 
AI566634 AA211796 BG615512 BE169275 BF983253 BF969462AA766261 A1769894AA1 35833 AI831542 N63375AA214392AU154486 
AW605017AW4S0072AA446459BEB81875AI061423AA598549AW439151AA426273Z40087AA812434A^ 
BF666746 D59356 B6678312 N56640AA1 66861 
AA917082AA757369AW592218 
AL109688 R23665 R26578 

^^^^^^2 AV760377 AL601008 BI604131 BE645918 BG187760 BG181525 BG210634 66192999 AI263307 
'^'^"^ ^^^562 AA7221 83 R79452 H7^^ 
AA084581 AA033810 AV742510 AV735788 R08336 i<wwi 
AA210987 D57294 AA214S84 AA207006 D56572 

AVffi65529All2^9 AW297152AI268215AI469807Ai9693a BE552356 N66509 AA738741 AA382555AW)75811 A>raiMIB^^ 



AA382556 AW235763 AA927051 AI862075 BE886691 BE619282 
AA906366 AA938956 AI910938 
AW102570 AA«)7150 AA907120 AA737188 AI248890 AW977353 

AWSdSdOo /u553633 

BM453041 AA760783BE21B582AI340046 AW166131 BF51 5854 AI630296AA461 307 AI090881 AW023059AA155797AA11548BALS97396 
AW889004AW937420AA137082AA013374BG619478BG401B39 A««oai Av™zjuw/v«»/9/AAn548BAL5g7396 

BE072634 BE072553 AAB3051S AA214736 AA331718 
AA218925 AW962081 AA354237 

SJ??.?™/^^'^^^^ AA151166 AW954405 AA131254 BG056461 W46291 H01532 H04384 H03231 AA852876 H04410 
no9o09 BEl 57601 AAl 1 3758 

C05613 W17389 C05351 AA31 1399 00418004896 0^502 C05482 0)4456004543 C04558C^^ 0031 14 003103 Al3699ra^SK 
T12391 T12073 W19390 O02994 002730 004434 W07136 R57607 003339 w«iw«oaw««ooz«» 
t^r^ BG399272 AA187835 BF821903 AV66055O AV660556 AV660502 BG564397 BE379584 BF446961 AI653056 AW973709 AI553173 
BG054997 AI266043 BI054879 AI656750 AI492830 AW021 142 AI4721 84 AW170056 A1082443 AI167921 059940 BI492088 H74180 AW130886 
^S1S;^37®?!.^^^^^^ ^^^203 AAl 15535 AI264790 AW205074 AA860452 AA554902 AI000715 062102 BE544768 AI376090 059939 
AW242249AA525421R34211 R34328BF248064BF241437BF572759BF218832 «io/omw uaww 

BI015205 Ar761324 AWB80937 AWB80941 
AA338252AA338213 

^flJS^^S ^^''^^ T67280BF995651 AI675065 8 

or 173 139 BEOi0038 

AA933717 BF061897 AW628327 AA641788 AA400495 
BE065183 AI144398 BE06S367 BF377924 

NMJ58166 AF220030AL043894AVV974257AA625445 AU153502 AI650537 AW612116AI672377 AW7724S1 8E892241 BE501 740 AA71 8938 
AI650275AI654206BE503226AI651327AVV873562AW271269AVW71565A]873518AI207150AI338B26AI6^ 

61054316 

^SS^ ^^SB^ ^^^5^ ^^590 AJ638743 A1633818 AW206802 AI583718 AF080231 AF080234 AF080233 AL635594 

A181 8326 AF080230 S46404 AI970376 AA463992 AW665466 BF512210 U87595 U87589 BE550633 AI672574 BE467547 AI68Q833 AW614951 
!f^^.???^.5S2?^ ^^^^^ BE673974 AI797496 AI701 526 AA703396AW1 39734 H92278 N66048 BE219539 BE671665 AI624817 BE466611 
tS^^^SS!^ ^'^^^^^^^231271 BF773517 U87594W^^ 
AA207155BI004756AA2C8262AI365204H77608AW590511 
T39328 T39310 T39303 T39284 

BE8933S6 AA625304 AI765607 A!6248S8 R76060 AA069651 BG998885 R35783 BFD86499 AA428755 AI245055 

BO013389 BC017398 AI023543 AA191424 A1267700 AI469633 AW958465 AW953397 AAl 72056 BE940298 BF909208 BF909980 BF095153 

BG2B5837AI720344BF541715AA355086AA172236 

N50454AA620999T16375 

^JIS^^^^SI'^^^^^ '^^^^^^'^^59950 AI624915A1B38047W^^^ 

AVV779380AA609879A!634791AI493770A!565211241145AI627952AA303734BE349457AW96765AA25«^ 
T65754 AA229658 AA229857 

^^^^^ ^'^^^^ AW300558 BG058755 AA058343 A1554842 AW207438 BF509981 BF444954 AW026234 AI6201O4 
^J^^!E!SSS^!*^!i^^^^ ^"^^^^ AI682952 A1167202 AA631394 AI421915 AI222883 BF477519 A120B777 AA765849 AI675076 

M^IJ iiiSSl'^^^^ ^^^'^22 AW023555 H06192 AA91 0222 AI660021 AI032525 AI375480 

A1361B60 AI032919 AA833599 80057928 AA56391 3 AW235737 AA002124 AA913636 F04607 AI867699 AA648100 BE091446 AM85378 

^r^J^^!^^}^ ^^^'^^^ ^^7^°* B''«^6878 BG290981 AW954251 AA757126 H11545 AA353384 N48448 

AA37984S AA004943 AA379928 AA002123 BM4701 18 AL598847 ALS98830 B6899239 R57470 BF939179 AI650642 AI75B851 BF352505 We8422 
VV35297H11435AA937499AI783996R12500AI819557N39093 Z41619H22849AA004942R09436^^^ 
BG029683AI248120BI850480AW779829 N22494 
6E1S5042 6E155040 6E154987 BE15S012 

AW749287 6E535498 AW749299 AW749293 AW749302 AW749298 AW749291 AW749294 AW7492B9 AW749288 AW749296 
AA769294 AW749297 AW749295 AW749292 BE002573 
AW8131 10 BF771370 BF771371 AW813113 AW003381 

BI862319AA204955BF240507BG212143AW205739BI760647BM129481 B1760482AW300025AI288591 AW236114AI302852AI03854a 
AI797M7 AA5344968G188194AA921877 B6191846 BG182959 BE620243 BF217428 BC009514 BM463015 AL529077 BM051B74 66773269 
BM3143518M314660AW629566AA316207AI623431AA504153AA3i4700BG195449BG614101 «wdio«du/m^w 
?.^?*?L™?'^^"^^^2^ ^^^5132 AW468888 AI6^^^ 
AI968683AA846676AA927355H80424AW973295 R88209 F298688E928871 
AA846804 AA757581 A1050950AI092024 AA838807 



626 
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417553 25B857_1 AL545411 AI096389BF431750AI130948VV60065 W80663AA258580 W3279VV76156VV80862AV^ N30878 

AI769345 R7i250 A1363766 R22777 R17009 R27985 R28243 
408085 101881.1 B1603077 AW954272 Bi598724 At0031 54 AA0S9300 AA04691 1 61689907 BI600966 81669987 

418049 12052.4 AJ314647NM_0S2888 61494693 AA835065AI534477AI336678At807696BF477887AI701 147 Z391 87 R38979 TO B1222234 

AV731417 R42406 H04996 T9B498 R12489 R12577 R42405 
430988 1237115J AW972830AA489820AA527547AA570382 

400268 840.4 U69668AM48366X631058C01 8514 BE694436AI655840AVV235355BG427984AA61 2862 AA448223BM1 45813 BMIM^ 

BM148408 AA448232AA454176 AA740959 AA88439t AA808545 AWD70759 BM144223 N75518 BE542983 BE241942A1 124022 AA761 687 
BF908518 BF907890 Rl 1490 AL536642 BF109180 AA953881 A)783716 BE622908 A1621005 AW148784 AI590114 AW275000 AI765790 BF222859 
AW167268 Ai990460 AW300443 AA779660 At620568 BF115024 BE504703 AW628332 AI922851 BE006638 AU158376 AI166279 AA809916 
A1489757 AA630828 AA830388 N64324 AI049683 AA970275 6F4773&4 BG261301 AA326388 AU1 50565 AU158374 AA687987 N58510 Ai8S0450 
AL549572 BF349280 eF349269 BM463016 AW836798 AL120958 AW836891 AW385525 BE175733 BE175727 BE175723 BF092430 BI061782 
AU135358BE17573t BE 1 75754 BE1 75756 BE8417478F798384AU 128251 BF095246BG223262 AW847833AL§36643AW366516AW391532 
BE934857 BF925057 AW438446 R86246 AWi79270 BE087782 BI832144 

440606 10075_1 BC017350 BC021031 A1220219 AI828751 AW134498 BE139642 AAB94554 AI278594 AV747315 BE561 749 B1085890 T801 17 H69682 N70904 
AV741999 H70098 

427239 20459 J AL532360 BE794750 AA5B2906 AI015067 AW271034 BG271636 AW075177 AW071374 AI345565 AI307208 BE138953 BE049086 AI334881 
AW075006 AW075181 AA464019 AW302733 AW075100 AW073433 AI802854 AI334909 AI802853 AI345036 A134B921 AI340734 AI307478 
AI251289 AW302327 AW072S20 AI312145 AW073856 AW072513 AW071289 A1307559 AA876186 T29587 A1307493 A1255068 AI252868 AI252839 
AW074809 AJ252926 AI252160 AI251662 A1251282 A!610913 AI270767 AI270156 A1252075 AW073469 AW072901 AW072496 AW071420 
AI305762 AI254764 AI8Q2837 AI251264 AVra73049 AVV07131 1 AI340843 6E138965 6E138502 AWO^ 

AI054302A1054060AI054057At053722 AI2B9711 BE1 39228 AW470478AW271039AW302085 BE041872A1254494 AI2714g8AI252427 
BF718773 BF718645 AW074866 BE857822 
445093 175963J AI207197 BF773544 AW196462 

456505 15472.2 BC017965AW969075AA279982AA504511 A2219979AA504595Al245579AA278iei BG485019B1049312 
428812 1383189.1 AA770001 AA431112AA432126 
437642 77594.1 . AL079309AA281819 
440129 260788^1 AI732997 AA977633 AA865818 

410704 1054673.1 AW8n458 AW877524 BE075922 BE166912 AW840534 BE076754 AW797829 BE166905 BE166926 AW877462 BE166927 BE166932 AW877523 
BE166917 AW877S2g BE166928 BF351394 AW877522 AW877528 BE166861 BE166866 BE166913 BE166919 AW877456 AW877S37 BE076866 
AW840571 

438243 2532601.1 AI581311 AA7B1682AA781678 
438889 52134.1 AF075009R63109R63068 
434776 118129J AW974599AA648988 R98760 



TABLE 53C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham, et d.' refers to the pubUcaHon entitled "The DMA sequence of 

human chromosome 22" Dunham, et a). (1999) Nature 402:489^95. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: bidicates nucleotide positions of predicted exons. 



Pkey 


Ref 


SIrand 


Ntposilion 


406547 


7711513 


Minus 


172780.174358 


404996 


6007890 


Plus 


379g9^145.38652-38998.39727-39872,40S5 


402199 


8576116 


Minus 


84187^744 


402145 


8018280 


Plus 


113086-114800 


401837 


7630990 


Minus 


12O993-121095.121660-121729 


403780 


8076989 


Rus 


93160-93409 


402299 


6693370 


Rus 


23367-25175 


401435 


8217934 


Minus 


54508-55233 


401464 


6682291 


Mfaius 


170688-170834 


402098 


8117697 


Minus 


4418&44330 


404287 


2326514 


Rus 


53134-53281 


406367 


9256126 


Minus 


58313-58489 


403137 


9211494 


Minus 


92349'92572.92958-93084.93579-93712.9394 


402176 


K75912 


Rus 


391138-391711 


403433 


9719611 


Mbitjs 


72225-72437 


402408 


9796239 


Minus 


110326-110491 


405935 


6768795 


Minus 


163112-163652 


406542 


7711499 


Plus 


117335-118473 


402099 


8117697 


Phis 


121553-121742.123265-123423 


404088 


3168621 


Minus 


18123-18768 


401644 


8576138 


Rus 


82655^59 


402680 


8113438 


Rus 


137634.137768.13970M39B93,140475-14059 


401091 


9958240 


Rus 


94760-84896 


400587 


9887626 


Rus 


25435-25568.25668-25747 


401148 


2547238 


Minus 


22521-23053 


403432 


9719611 


Minus 


68204.68392 


400592 


9887642 


Minus 


24642-24815 


403610 


8308266 


Plus 


157705-157860 



TABLE 54A: 

Pkey: Unique Eos probeset idenGfier nuniber 

ExAccn: Exemplar Accession number, Genbank aocesskm number 

UnigeneiD: Unigene number 

Untgene Title: Untgene gene title 

Rl: Ratio of norma) test] to normal adult tissues 

R2: Ratio of 'average* normal test! to 'average' testlcidar cancer 

Pkey ExAccn UnlgenetO Unigene TUe Rl R2 
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401979 






421825 


AA298758 


Hs.183747 


408493 


BE206854 


Hs.46039 


441728 


AI797395 


HS.1697S7 


452215 


AK002043 


Hs^472 


415211 


R64730. 


Hs.155988 


418877 


S83308 


Hs.87224 


449108 


A1140683 


Hs.98328 


420437 


AA99Z768 


Hs.97633 


438632 


AA724299 


Hs^4020 


418409 


AA219332 


Hs.120869 


408409 






427060 


AW378993 


Hs.90288 


427310 


AI613480 


Hs.47152 


427166 


AM31576 


Ks.99154 


427178 


AA398888 


Hs.97542 


410694 


AL137538 


Hs.65500 


424310 


AA338648 


Hs.50334 


427441 


AA41260$ 


Hs.343879 


438057 


AW294544 


HS.1257B5 


422163 


AA431698 


Hs.112794 


427293 


AA705799 


Hs.183714 


444983 


AI916973 


Hs.213603 


428608 


At699329 


HS.9916B 


453178 


AA498068 


H5.61648 


428618 


AAB85360 




401741 






422086 


AW182930 


Hs^50182 


426604 


H53354 


Hs.97141 


442373 


A1377758 


Hs,164799 


427455 


AF173081 


Hs.178215 


437248 


AW449340 


Hs.93090 


426608 


AA444162 


Hs.99344 


427297 


AW292593 


Hs.334907 


422358 


AL133030 


Ks.1 15429 


451610 


AW118804 


H5^7126 


410830 


8E044562 


Hs.266847 


426677 


AW949856 


Hs.97165 


437558 


AI126471 


Hs.124112 


423088 


NM.006687 


Hs.123530 


426476 


NM.003296 


Hs^ 


421952 


AA300900 


H8.98849 


429877 


W37337 


Hs.103014 


413114 


AI825838 


Hs,75206 


412026 


AA383618 


Hs.73073 


411844 


AI607681 


H5.144656 


436868 


AA974253 


Hs.120319 


426599 


AW183574 




426683 


AI073430 


Hs.146775 


426930 


AA393442 




4Z7836 


AA416842 


H3.116176 


407721 


Y12735 


Hs.38018 


430822 


AJ005371 


Hs^48017 


434150 


BE047007 


Hs.116116 


422789 


AKD01113 


H5.120842 


422116 


H54205 


HS.111BS0 


433724 


AI827749 


Hs.144924 


410187 


AA860341 


Ks.104680 


419584 


AF053356 


H5.283764 


456182 


A1147996 


Hs.155633 


418885 


T19204 


Hs.195685 


426848 


AA3B2787 


Ks.122713 


420349 


NM»016611 


Hs.97174 


426624 


A)125222 


Hs.96712 


420710 


NM_007009 


H3.99875 


434317 


A1S74095 




443432 


AI056863 


Hs.339871 


425709 


AA383076 


HS.1S9274 


426670 


AA383047 


Ks.310210 


408813 


AW242086 


Hs.253987 


452235 


AL039743 


Hs.28514 


434133 


A)655275 


Hs.236635 


427294 


AA412594 


HS.12S902 


427262 


AA448509 


Hs.128852 


429851 


AA459835 


Hs.120573 


406376 






425865 


AA393491 


H5.183740 


428665 


NM.017481 


Hs.169184 


439379 


AA8350Q2 


H$.125611 


427520 


BE467881 


H5.97489 


458940 


8E149824 


H$.132888 


426620 


AW450252 




429516 


AI653299 


Hs.99354 



C17000767:gl|1 1990770|Binb|CAC19651.1| <A 

ESTs. Moderately similar to CALB J<UMAN C 

phosphoglycarata mutase 2 (musdB) 

Honio sapiens BOUL£ (BOUl^ rnRN\ cofiiplet 

hypoihefical protein FU1 1181 

OVm (Asp^iKAlaAspftlis} box pcHypep 

SRY (sex detendning region Yybax. 5 

hypolhetrcd protein MGC13040 

Aklnase (PRKA) anchor protein 4 

ESTs. WeaMy sbnilar to CRTCHUMAN CAU^ 

ESTs. Weakly similar to R107.HUMAN HfiEV 

Target Exon 

ESTs 

Iek6n3 

ESTs 

Hotno sapiens testisi-dev^pment rdated 

Homo sapiens mRNA; cONA OKFZp434N2019 (f 

testes devebpmenkelated Nyi>^P22 

SPANX famDy. memt)er C 

ESTs. Weakly similar to CORB MOUSE CORNI 

Human ONA sequence from done 1088E13 on 

ESTs 

ESTs 

ESTs. Weakly similar to AF1 32972 1 CGI^ 
ESTs 

Target CAT 
Target Exon 
ESTs 

ESTs, Weakly similar to hypothetical pro 
testes development^elated NYD^P17 
Vertetirate UN7 homotog 1. Tax IntBradi 
ESTs 

hypothetica! protein PRT0-NY3 

Homo sapleiis, clone MGC:17333. mRNA, com 

Homo sapiens mRNA for KIAA1666 protein, 

ESTs 

ESTs. Weakly mar to KIAA1 214 protein 

ESTs 

ESTs. lUtoderatoly similar to HSJZJIUMAN D 

acDn-Uke 7A 

testis specific protein 1 {prot» H4-1 p3 
dynein 8ght chain 28 (DNLC2B) 
ESTs 

protein phosphdase 3 (fomierty 28). cat 
tesHs-spedfic ankyrfn mo6f containing 
ESTs. Weakly simflar to T17257 hypotheti 
Homo sapiens autoimmune infertility-rela 
ESTs 

ESTs, Weakly sirrilar to T30993 hypoUieO 

ESTs 

ESTs 

dual-6pecifk:ity tyros!ne^>i)hosphoryl 
glycer8!dehydo-3f hosphato dehydrogenase 
tesQs specific, 10 
hypothetical protein FIJ102S1 
mitochondria capsule selenoprotein 
serfneAhreonine protein kinase SSTK 
ESTs 

F-lx)x only protein 24 

ESTs, We^ similar to splioeosomal pro 

ESTs 

ESTs 

potassium inwardly'recQj^ chsnnel, s 
hypothetical protein [)KFZp434H0311 
zona pelludda Urufing potein 
ESTs 
ESTs 

outer dense fibre of sperm t^ 1 

ESTs 

ESTs 

testes developmenWelated im)-SP21 
ESTs, Weakly similar to ATHUB aclln bete 
ESTs 
ESTs 

hypothetk:al prol^ DXFZp434K1172 

NM.02124r:Homo5aptens protemine 3 (PRM 

ESTs 

iiblquIiinS 

ESTs 

ESTs.W^8irrtotoB28098Qn8-1 pr 

KIAA1674 

ESTs 

ESTs, WeaMy dmOar to hyperpolaitzatto 



10.08 


43.3 


10.35 


38.8 


5.74 


32.0 


10.54 


24.5 


4.88 


22.0 


6.93 


19.8 


4.94 


18.5 


7.94 


17.5 


.16.75 


15.6 


6.51 


14.9 


4.96 


14.6 


a98 


14.3 


3.56 


14.2 


4.S0 


14.0 


4.28 


13.9 


10.19 


13.7 


5.76 


13.4 


5.24 


13.3 


10.45 


1Z6 


18.57 


12.3 


5.18 


12.3 


10,79 


12.1 


ai8 


12.1 


15.40 


11.8 


4.13 


11.8 


7.53 


11.5 


10.41 


11.5 


4.39 


10.7 


7.04 


10.6 


8.23 


10.3 


3.03 


10.1 


9.06 


10.1 


3.68 


10.0 


9,70 


9.8 


11.85 


9.7 


5.63 


9.7 


4.38 


9.5 


&58 


9.3 


4.47 


9.2 


15.07 


6.9 


18.55 


8.9 


1333 


8.8 


6.97 


8.7 


3.78 


8.6 


22.03 


8.5 


7.34 


8.3 


4.16 


8.2 


6.29 


8.1 


10,89 


8.0 


5.08 


8.0 


4.79 


8.0 


5.11 


7.9 


21.26 


7,6 


4.85 


7.6 


10.29 


7.5 


9.12 


7.5 


22.24 


7,4 


3.03 


7.4 


6.43 


7.4 


9.90 


7.3 


7.14 


7.3 


7.03 


7.3 


14.90 


7.3 


3.71 


7.1 


20.78 


7.1 


3.98 


7.1 


3.46 


7.0 


23.21 


7.0 


6.92 


7.0 


5.77 


6.8 


9.23 


6.7 


7.46 


6.7 


a44 


6.7 


5.66 


6.5 


9.01 


6.5 


3.96 


6.4 


9.15 


&4 


11.07 


6.4 


5.06 


63 


9.29 


6.2 


3.11 


6.2 


12.27 


6.2 


9.15 


6.1 
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426736 


AA43161S 


Hs.1 30722 


427843 


AC00S622 


H3.1B0943 


426639 


AI799059 


Hs!l 12807 


438637 


6E500941 


Hs.126730 


433795 


AI216683 


Hs. 122599 


441232 


AI656050 


Hs.7086 


433943 


AA992805 


Hs.44865 


426955 


AA393869 


Hs93S0Q4 


428918 


A1036967 


HsJ2324 


427851 


AA846543 




428208 


AA442327 


Hs.104854 


422207 


AI828862 


Hs.1 0964 


431153 


AVV972343 




419350 






427107 




lie lAO'UR 


429461 


AI188219 


na.99jl 1 


432512 


NM 003384 


Me 


434451 


AW4dSm 


n9.i« 1*100 


420348 


All 37385 


Uti 071^ 


427214 




ns. 1 / 0£ I J 


456658 


AI301117 




457034 


AA398061 


no. £9090/ 


423120 


AW1 60551 


Ue 19^01 
no. 


438983 


AF0858B4 


ns.anu.9 


426619 


A)357194 




440822 


At554897 




416205 


AA17639S 




426712 


AW173177 




427840 


A)216654 


Hs.98251 


439314 


AA382413 


Hs 1711144 


426943 


BE551631 




409209 


AA46016Q 

fWtUWlUU 


Uc 7^91 7 


441710 


AI1878a3 

rV Iwf OWw 


Us i97cin 


420571 






428563 


AA431616 




433994 


AU042483 




441856 


AI574774 




427789 


AA412428 


Hs.46842 


418967 


NM 001735 




430232 


AA489940 


u. inn94 

no. IwQg&'f 


421850 


AW274576 


Ue 191(191 


449436 




Uq 07Wf\7 


426699 


AA3B3337 


Uc 1919fiQ 
nS. IZlAOi; 


426827 




Ue 1QRCBR 


427285 




He Q77A4 


423693 


Al 133633 


ns.lalf 19 


457019 






405264 






450606 




ns.Dvoou 


421378 


L77564 




431215 




Uq 191 CC^ 


427423 




Ue 177Q9fi 
ns. 1 f /7£0 


438756 




Uq qn^Q9^ 


424197 




Ue I^OQAQ 


423284 




He 19R4QR 


432117 






424426 


AM7fi41fi 


He 1^9ftM 


437387 


All 98874 


Hs. 28847 


420718 




Uo QQAfll 


420768 




He 909Km 


423677 


M8Bfi08 


H« imiftl 


436661 


All 25270 


H« 19Mfia 


427749 


BE04S979 


Ue QonQc 


441830 


AA3a31(U 


H« 490^ 


427877 


AW1 38725 


Ue 17flnR7 
no. 1 rOUOi 


426623 




He 1^97Cn 


429965 


ALj040379 


ns.anmi 


451099 


RS2795 




417592 


nntSJ'Wv'i 


ns,ia24Jf 


421938 






424144 




n$.4lD44 


426710 


BB)41517 


Hs 143893 


428710 


Ai690919 


Hs.126780 


438641 


AW1384&4 


Hs.190653 


420614 


AL110291 


Hs.99384 


422705 


NM.006686 


Hs.119287 


421805 


AIJ042716 


Hs.130947 


448963 


AA459796 


Hs.331247 


426738 


AA421097 


Hs.29ig02 


440403- 


AW665135 


Ks.130531 


456085 


A)184560 


Hs.130352 


439594 


AI24S026 


Hs.1 11099 


428909 


Alig0714 


Hs.98945 



ESTs 

hypoUnOcsl protein R30953L1 
ESTs 

ESTs. Weakly similar to KIAA1214 protein 
ESTs. WeaWy similar to ALU7.HUMAN ALU S 
hypothsScai protein MGC1243S 
lymphoid enhanc6r-IMit» factor 1 
ESTs 

protamine 2 
ESTs 
ESTs 
ESTs 

ttypolhelica) protein FU21343 

Homo sapiens chromosome 19, cosrrM R2666 

ESTs 

ESTs. Wealdy similar to HSJ^^HUMAN DNAJ 
transilton protein 1 (during hbtone to 
ESTs 

Homo sapiens mRIW: cDNA DKF2p434iyi112S {f 
ESTs 

Homo sapiens ciuomosome 21 segment HS21C 
soggy-igene 

proaoosin bindtog protein sp32 precuso 
ESTs 

Homo sapiens clone 191B7 placenta expres 

hypoUietical protein |ylGC5356 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Wealdy simitar to ENC^HUIWAN ECTOD 

Human DMA sequence Itam ctone RP1-39G22 

ESTs 

ESTs 

ESTs 

iiypotiieticai protein fU23093 
tiaclerteidal/penneatiity-increasing pro 
ESTs, Moderately similar to fmJH fenf t 
ESTs 

i)ypothetlcd protein OKFZp434i21 17 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cOMA Dia7p434E2118 (f 

iiypolhetical protein 

NM_030813*:Homo sapiens suppressor of po 

ESTs, Moderateiy similar to ALU6,HUMAN A 

serineAhieonine idnase 22B (spenniogene 

Human ONA sequence ftwn dona RP11-218C1 

exonudease NEF^p 

hypotlietlcal protein DJ<FZp434L17l7 

gerni cell spedfic Y-box twntfing protein 

Homo sapiens ctvomosome 19. cosmid R3134 

protandnel 

KIAA1674 

AO026 protein 

lactate dehydrogenase C 

ESTs, Wealdy similar to T47142 tiypotheO 

pyruvate dehydrogenase (npoamide) alpha 

ESTs. Wealdy simOar to T1 9142 hypothec 

Homo saptens cDNA: FU230S2 lis, done L 

hypothetical protein 0KFZp564D0372 

ESTs 

ESTs 

Homo sapiens cONA iiJ1 1789 fis. done HE 

interleuldn 13 receplor. dpha 2 

ESTs. WeaMy similar to 154383 chromosom 

gb2u66c01.r1 Soares.teslis.NHT Homo sap 

AKAP-assodated sperni protein 

ESTs 

ESTs, Weakly slmIlartoT12519 hypothetl 
ESTs 

putative transmemtirane protein 
acSn-likeTB 

hypothelica] protein DKFZp434N1415 
Homo sapiens, done tMAGE:3610712, mRN^ 
ESTs 
ESTs 

ESTs. Wealdy SffnOar to A47582 &<«n gr 
hypothetical protein MGC10974 
ESTs 

629 



3.58 


6.1 


6.34 


6.1 


6.93 


6.0 


3.04 


6.0 


10.45 


&9 


4.27 


5.9 


6.87 


5.8 


4.75 


5.8 


38.40 


18 


15.87 


5.8 


6.34 


5.7 


6.43 


5.7 


9.24 


5.7 


14.94 


&6 


6.25 


5.6 


3.92 


5.6 


22.03 


5.6 


7.89 


5.5 


11.26 


S.S 


8.41 


5.5 


4.35 


5.5 


11.29 


5.4 


8.88 


5.4 


22.69 


5.4 


7.07 


5.4 


3.60 


5.4 


10.26 


5.4 


8.17 


5.3 


6.44 


5.3 


8.35 


5.3 


6.31 


5.2 


7.85 


5.2 


7.73 


5.1 


9.39 


5.1 


14.94 


5.1 


6.84 


5.0 


a74 


5.0 


4.29 


5.0 


4.14 


4.9 


10.66 


4.9 


1Z27 


4.9 


4.50 


4.9 


5.67 


4.9 


20.66 


4.9 


4.72 


4.8 


&03 


4.8 


3.67 


4.8 


4.48 


4.8 


3.76 


4.8 


7,35 


4.8 


a66 


4.7 


19.27 


4.7 


12.05 


4.7 


17.70 


4.7 


7.50 


4.7 


55.33 


4.7 


5.41 


4.6 


5.04 


4.6 


9.18 


4.6 


5.70 


4.6 


10.93 


4.5 


3.82 


4.5 


6.45 


4.5 


11.29 


4.5 


4.09 


4.5 


26.62 


4.5 


13.25 


4.5 


3.88 


4.4 


3.46 


4.4 


4.69 


4.4 


19.15 


4.4 


5.57 


4.4 


11.86 


4.4 


6.19 


4.4 


6.68 


4.4 


9.73 


4.4 


5.89 


4.4 


6.77 


4.3 


3.49 


4.3 


6.97 


4.3 


6.30 


4.3 


7.85 


4.3 


7.79 


4.3 
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426735 


T78716 


Hs.120445 


esTs 


5.10 


4.3 


438653 


AW188099 


Hs.131813 


ESTs 


5.29 


4.3 


443038 


AI968058 


H5.209206 


EST3, Weakly sbnOar to S38782 acQn bet 


7.29 


4.2 


428677 


AI657119 


Hs.120036 


troponoil, cardiac 


10,73 


4.2 


424220 


AK000869 


Hs.143251 


hypothetica] protefai 


9.13 


4.2 


426299 


H93373 


Hs.169222 


acrosoma) vesicle protein 1 


4,87 


4.2 


42B871 


AA913840 


Ks.38903 


ESTs 


3.76 


4.2 


410163 


AF151977 


Hs.59260 


NTTSpratsin 


7.29 


4.2 


433133 


AB027249 


Hs.104741 


POZ-t)ini9n9 Idnase; T-ceP origlnaied pr 


5^ 


4.2 


427757 


A1142295 


Hs.129794 


ESTs 


3.57 


4.1 


426721 


AA383588 


Hs.131816 


ESTs. similar to T2901 2 hypotheti 


11.13 


4.1 


401692 






C16000122*:gi15689527]dbj1BAA83047.1| (A 


6.37 


4.1 


403783 






NM 031956:Homo&apiensNYO-SP1 4 protein 


3.74 


4.0 


421611 


AM59641 


H8.97309 


ESTs 


11.51 


4.0 


404271 






ENSP00000244792':Phosphoglycerate kinasa 


4.02 


4.0 


441800 


AW027571 


Hs.7973 


t^ypolhetica) protein DKFZp434G156: KIAA1 


8.88 


4.0 


423000 
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440182 


AA868919 


Hs.250110 


EST8 


3.03 


13 


435517 


AA926626 


Hs.130177 


ESTs 


3.64 


13 


448309 


BE044261 


Hs.149774 


ESTs 


3.52 


13 


420338 


AA825595 


H5.88269 


Homo sapiens, done MGC:17339i mRNA. com 


4.23 


13 


433829 


AI190715 


Hs.102021 


ESTs 


6.08 


13 


4294B5 


AW197086 


Hs.99338 


ESTs 


3.14 


13 


423058 


AWg64568 


Hs.111591 


ESTs 


3.36 


13 


433B22 


AI218809 


Hs.112772 


ESTs 


3.63 


13 


442268 


BE2780&4 


Hs.8179 


hypothefcal protein, done 2746033 


3.00 


13 


434298 


AA660090 


Hs.116290 


ESTs 


3.71 


13 


440720 


AW662776 


Hs.336943 


Human DMA sequence Irom done RP1 1-60H7 


3.08 


13 


427554 


AW246578 


Hs. 179615 


hypothetical protein FU10058 


3.39 


13 


414251 


AL042306 


Hs.97689 


VASA protein 


1U5 


13 


430254 


AI609S20 




ESTs 


3.27 


13 


437418 


AI47B954 


Hs.53459 


ESTs 


3.43 


12 


437522 


AA983844 


Hs.121383 


ESTs 


4.15 


12 


422808 


AA449014 


Hs.121025 


chromosome 1 1 open reatfing frame 5 


3.09 


12 


436895 


AA72S655 


Hs.120480 


ESTs 


5.26 


12 


422247 


U18244 


HS.113G02 


sdule canicf fandly 1 (Idgh sfGntty a 


&78 


12 


421625 


AA405386 


Hs.178004 


ESTs 


3.22 


12 


434807 


AA364183 


Hs.323443 


hypothetical protdn FU11806 


5.51 


12 


428174 


AA913321 


Hs.126778 


ESTs 


3.09 


12 


409735 


AL035295 


HS.S6175 


Ksaplens gene fnm PAC 1 06H8. simitar t 


3.11 


12 


444467 


AI1S0388 




ESTs 


3.81 


12 


433832 


AA918018 


Hs.172516 


ESTs 


6.94 


12 


440038 


AW593295 


Hs.210956 


ESTs 


5.87 


12 


415240 


AA161411 


Hs^8668 


chromosome 21 open reading bame 57 


3.66 


12 


432538 


BE258332 


Hs^8352 


male-enhanced antigen 


3.56 


11 


440882 


Ai20S777 


H5.129S38 


ESTs 


3.83 


11 


436605 


A1187742 




ESTs 


3.41 


11 


422990 


AF035620 


Hs.122764 


6RCA1 associated protein 


5.66 


11 


432174 


AW590264 


Hs.132806 


ESTs 


3.05 


11 


409267 


NM.012453 


Hs.52515 


bansducn(beta)-iilce2 


5.46 


11 


423021 


AIJ036111 


Hs.292767 


ESTs 


3.87 


11 


439012 


BE383814 


Hs.6455 


RuvB (E coli homologHite 2 


5.25 


11 


433812 


AA72S026 


H5.97165 


ESTs, Weakly sImBarto T31611 hypothea 


3.62 


11 


456924 


A)631510 


Hs.196956 


ESTs. Highly simitar to matdi to EST AA3 


4.38 


11 


437249 


AA432202 


Hs.103147 


hypothetical protein FU21347 


3.38 


11 


426038 


AA388101 


Hs.99052 


ESTs 


3.03 


11 


427065 


AA397903 


Hs.236635 


gb2t89f12.r1 Soares.tes6s.NHT Homo sap 


3.23 


11 


428824 


W23624 


Hs.173059 


ESTs 


107 


11 


428224 


X54017 


Hs.183088 


acrosin 


3.16 


11 


438954 


AA740151 


Hs.130425 


ESTs 


3.20 


11 


444470 


AA412195 


Hs.13740 


ESTs 


4.27 


11 


457579 


AB030816 


Hs.36761 


HRAS-like suppressor 


5.30 


11 


427886 


AA417083 


Hs.104789 


ESTs 


3.49 


11 


439273 


AW139099 


Ks.269701 


ESTs 


3.83 


11 


434318 


AW207552 


Hs.116328 


ESTs. Wealdy sln«ar to A3g564 lianscrip 


4.01 


11 


427015 


AA397520 




ESTs 


4.28 


11 


421598 


AW630942 


H8.108061 


RD RNA-binding protein 


3.30 


11 


427236 


AA399959 


Hs.148271 


ESTs 


3.07 


11 


434520 


AA205273 


H5.177011 


hypotheBcal protein 


3.19 


10 


456051 


T85626 


Hs.76239 


hyix)the&al protein FLJ20608 


3.11 


10 


418322 


AA264166 


Hs.84113 


cycDiHtependent ksiase Inhibitor 3 (COK 


8.41 


10 


451807 


W52854 




l^theOcal protein FU23293 similar to 


3.52 


10 


448984 


AW751955 


Hs^53 


hypothelicd protein FU22318 


4.73 


10 


420464 


W32963 


Hs.98269 


VRK3 for vaodrda relaled khase 3 


3.86 


10 


414181 


AK000476 


Hs.75798 


hypotheflcal protein 


3.33 


10 


424558 


AF038847 


Hs.1504g0 


FKSO&lrindIng proldn 6 {36k(^ 


7.70 


10 



TABI£548 

Pkey: Ui^ue Eos probeset MenSfler number 
CAT number Gene duster number 
Aooession: Genbank aocesskm nuiAbers 



Pkey 


CAT Number 


Accession 


428616 


2668.1 


BO017998 Bt826643 86715794 BG722697 Bi460787 B6773459 H52859 AI652853 AI990773 AW665193 AW340801 AA913806 AI337099 






BE045942 AW572790 AV^15652 H15004 AA9091 15 BI465310 BI462024 BI561578 BI46307S BG722527 R86003 BG623286 H15003 BI562131 






6G435272 


426599 


10110J 


BE736800 BM471423 A1J57221 BG763302 BF742ig6 BF991016 BG200112 BF920027 8G576409 BG332214 61830957 AI827504 AW183574 






AI805171 A1126491 AA448257 A1090641 AW183329 AAS94873 AI203683 BE041513 AA382260 AA382261 ALS54887 BE273483 


426930 


1310779J 


A1809889 AA393442 AI1S0574 AI200886 AI221 692 AAe08977 AA813213 


434317 


599587,1 


AJ209094AJ377740AW1l7382AW182289A!674095AWl8a019AA897352AA931314AA923336AW665317AA629314AA7^^ AA906846 






AA974625 AA684357 AI808590 


426620 


142987J 


AUM2392Al147451AA758821AW450252AA39g310AI656343A1636668AW515660AI190733Ai025812AA723M^ 






AA398244 AA382463 AH 39837 


419350 


13086.1 


A1218809 BC014809 BG724383 AI0243S9 AA904573 A1138595 AA868885 AI768931 BI828436 BG717350 86719800 AW182303 AA44d181 






BI826670 BI827131 BI830254 61824155 81831745 T19ig0 81830415 


440822 


532606.1 


BG207562 8G1921 13 AA977616 AW274024 AI554B97 AI221379 AA969158 AA906867 AI873494 AI015039 


421938 


863689.1 


AA412383 AA300675 8G773248 AA412243 AA405951 



633 



wo 03/042661 



PCTAJS02y36810 



442S89 33097J2 



436SB8 
431985 
420431 
436601 
408415 

427748 
440115 
428524 
440933 
427829 
438735 
449333 



433792 
442122 
433760 
426956 
413372 



430254 
444487 
436605 
427015 
451807 



2470836.1 

76926J 

29290.1 

10131J 

4581.6 

137262?^1 
34.3 

1382164.1 

980517.1 

1373537.1 

10316.1 

36378.1 



2204621.1 
2684549.1 
584982.1 
657337.1 
32896.1 



13102.1 
1008400.1 
1008207.1 
683123 1 
1775812 



N1UL004157 X14968 66480488 8E409869 BG72389B 86476313 AU121626 BE386516 AI969297 AW172340 AA889668 R23438 AI01S037 
BE250556 BF590945 AW385993 BF983000 AA070235 AL556082 M78388 AW504473 AW370139 8691 3697 8E699098 B6827945 BE741233 
AI015465AW370169BE297350AA093249 
AI122828 AA»)9991 AA759233 

AL591713 BF197609 A1985094 AW448916 A1243277 AL449530 AL449S29 

AW241405AW205071 AI671586AJ652354AI638465AW590359AW662771 AW594067BE502532A1218894BE466416BF056295A1247366 
AI990484 AI917746 AW665925 A)216456 AW182169 AA989884 AA723888 AI01841 9 

BI602176 B1603138 BI459895 Br755030 AW418788 AAB83999 AA724858 AI48031 1 AW196355 AI004813 AI651117 BE814363 AW58985& 
AA4481 24 AA447982 BI461166 AA405629 
AI208966AA421041 AA81S377 AA411954 

BF980398 R51074BF979883A1S39370BM128735AA993397AI611039AW593985 R41808 
A1208080 AA442862 AA42S772 

AI125404 AW593312 AI247364 AI208217AA910021 AI915307 
A1190292 AI188225 AA416673 AA416596 AA952888 AA972172 AA906874 

M78676 NM.022571 6G772522 BF516449At537465 AW517245 BF762536AA634446AW1 96331 AI203035 B6722281 

AK0S6320 AL522040 BI793043 AW071691 AI433682 AAB85414 AA702664 BI792794 H96879 R52351 AA211126 AA442875 N25725 AA482S63 

N33446 N25222 Z41110 N26507 N73447 N24077 N20492 AWZ75550 H99819 ALS18306 ALS22041 AW959849 AL518307 AA72S907 A165S113 

AI309906 F10184 BM451081 BE257595 6G721625 8)828509 BG70047D F12568 Z45396 BI829288 AA364618 AA364aS1 AA421448 T74231 

R52350 AMe2415 AI203021 T88948 AI565842 

AI024286 AA769898 AA776661 AA868972 AA609S24 

AI932330 AI190707 AI376782 AA976947 

AA609179AW592321 AA758282Ai214437AW072537AA781937 

BI831 486 AW190479 AI472793 AA460217 AA459937 BF082576 AA393673 AA398702 

NM-006001 L11645A1205504AI207994Ai187362AA709ig0BI462421 BG772170 BG722772 AA436991 8G771655B)553260BF1 26025 
BF125857 61462670 BG724164 61562424 BG721652 61559662 8G7224S5 B159641S 66717561 66722138 66773507 BG720572 C03887 At016802 
AL042663 AA770436 AA435720 T19365 AA626698 AA759057 AI208021 BF507644 AI20805B AA41 2719 AA426374 AI208775 AA977217 AA758055 
F34585AA180062 

B6700885 AA888017 AW341719 AA971332 AI688794 Z20462 AI808145 AW665263 AA884952 AA906136 
AW663704 AI150368 A1216464 

AI125340 A1125684 AI377949 AI126470 A121 8351 AW665355 Ai243952 AW653454 AI240503 A1187742 AA884214 AA723933 
AAB57437 AI968733 A1968939 AA992784 AA397520 AW235244 

BM479185 AL552795 AL577722 BF038888 BM127617 BF510346 AW450652 AA865478 AW4495ig 6M127314 AI606S39 AW449522 AA993634 
AI827626AA904788 



TABLE 54C 

Pkey: Unique number corresponding lo an Eos probeset 

Re^. Sequence somce. The 7 digit mimbeis In tills cdumn are Gentiank Identifier (61) nunnbefs. 'Dunham, el at.' refers (o the pubOcaBon entitled The DNA sequence of 
human chromosome 22' Dunham, et bL (1999) Nature 402:489495. 
Strand: liMficatss DNA strand from which exons were precScted. 
Nt_posilion: tndcales nucleotide positions of predictad exons, 



Pley 




Strand 


Nt poriMon 


401979 


2828776 


Minus 


75693-75851.76977-77112 


406409 


9256364 


Minus 


141101-141256 


401741 


2982169 


Plus 


19668&>196823.200241-200381 


406378 


9256142 


Minus 


126408.126800 


405264 


7329374 


Plus 


28556-28684 


401692 


3540172 


Phis 


26365-26510 


403783 


8081824 


Rus 


128412-128635 


404271 


9828129 


Minus 


56392-57645 


401798 


6730720 


Phjs 


22831-23448 


402857 


9801539 


Plus 


13402-14133 


405528 


9581957 


Minus 


22418-22687 


403328 


8469086 


Minus 


120428.120703 


401712 


6682593 


Minus 


76410.76527.76692-76B29.78737-78866.8024 



TABl£55A: 

Pi^ey: Ur^ue Eos probeset identirier number 

ExAocn: Exen^iar Accession number, Genbahk accession number 

UnigeneiD: linigene number 

Uhigene T(lle:Uniiene gene thle 

R1: Ratio of non^eminomatous ndxed germ cell lesScular cancer compared to normd aduH testicular tissues 



Pkey 


ExAccn 


UnigeneiD 


Unigene Title 


R1 


416680 


AW245&40 


Hs.79516 


brtf n abundant, ntendvane attached s^nai 


48^0 


423961 


013666 


Hs.136348 


peftosfin(08F>2Ds) 


43.30 


412948 


BE243313 


Hs.334851 


UM and SH3 protein 1 


4150 


428928 


eE409838 


Hs.194657 


cadherin 1 , type 1, E-cadheiin (epithel 


38.80 


424247 


X14008 


Hs.234734 


lysozyma (renal amylohlosis) 


29.80 


414438 


A1879277 


Hs.76136 


Ihioredoxh 


29.40 


406656 


A!920965 


Hs.77961 


msjor histocompafiiillty oomptoc, dass 


29.25 


446899 


NhL0OS397 


Hs.16426 


podocatyxin-liie 


28.70 


411573 


A602g000 


Hs.70823 


KlAA1077prot^ 


28.40 


432730 


AI066520 


Hs.131358 


ESTs 


28.00 


418870 


AF147204 


H&89414 


chemokine (C-X-C moliQ, receptor 4 (fus 


28.00 


444381 


6E3B733S 


Hs.283713 


ESTs. Wealdy simliar to S64054 hypoM 


27.90 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteoponOn, 


26.60 


447526 


ALJ048753 


Hs.303649 


smali InduciblB cyioldne A2 (monocyte ch 


£.00 


416174 


L20688 


Hs.83656 


Rho 60P dissoctaOon inhaiitor (601) bet 


24.15 
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406856 


AVI^15338 


Hs.29797 


414682 


AU)21154 


Hs.76884 


440440 


Z28925 


K$.7188 


417139 


M69043 


Ks.81328 


417426 


NM.002291 


Hs.82124 


407862 


BE548267 


Hs.337986 


439180 


A1393742 


Hs.199087 


413787 


A)352558 




408669 


AI493591 


Hs.78146 


412636 


NhL004415 




414092 


Z14244 


Hs.75752 


414587 


NIWL004882 


Hs.76507 


412915 


AW087727 


Hs.74823 


408646 


AA563730 


Hs.277477 


412247 


AFD22375 


Hs73793 


446108 


AL036596 


Hs.42322 


424800 


AL035588 


Hs.153203 


413063 


AL035737 


Hs.75184 


415314 


N88802 


Hs,5422 


406656 


M16714 


Hs.69643 


426295 


AW367283 




406820 


AI223958 


Hs.108124 


444562 


AA186715 


Hs.336429 


417088 


M54915 


Hs.81170 


449338 


H73444 


H5.394 


444784 


D12485 


Hs.11951 


414420 


AA043424 


HsJ6095 


425543 


R23313 


Hs.334895 


452679 


Z42387 


Hs.83883 


420754 


W79431 


Hs.346911 


410397 


AF217517 


Hs.63042 


417640 


D30857 


HS.823S3 


415899 


)C7B992 


HS78909 


406786 


AW161678 


Hs.111334 


422105 


Ai929700 


Hs.1 11680 


422714 


AB018335 


Hs.119387 


444051 


N48373 


Hs.10247 


426996 


AW968934 


Hs.173108 


445863 


R12234 


Hs,13396 


429614 


AI371172 


Hs.211539 


410185 


BE294068 


Hs.737 


409038 


T97490 


HS.50X2 


410611 


AW954134 


Hs.20924 


448588 


AI970276 


Hs.156905 


421406 


AF179897 


Hs.104105 


420962 


NH.005904 


H5.100602 


425234 


AW152225 


Hs.165909 


452322 


BE566343 


Hs.28988 


454413 


AJ653672 


Hs.40092 


415221 


W07418 


Ks.78225 


425535 


AB007937 


Ks.158287 


450000 


AI952797 


Hs.10888 


418151 


AA864238 


Hs.83583 


414799 


AI752416 


Hs.77326 


412025 


AI827451 


Hs.24143 


445055 


SE512856 


Hs.109051 


412490 


AW803564 


HS.2888S0 


408437 


AW957744 


H3^76469 


430542 


AIS57486 


H5.119122 


424670 


W51215 


Hs.116651 


432409 


AA806538 


Hs.130732 


425945 


AW410669 


Hs.164280 


425996 


W67330 




449961 


AW26S634 


Hs.133100 


435522 


N64214 


Hs.9774 


407179 


AA20&465 




412623 


R28898 


Hs.74170 


429978 


AA249027 




450377 


AB033091 




418S09 


A8028624 


H3.85539 


440869 


NKL014297 


Hs7486 


417144 


AA382104 


Hs.81337 


451106 


BE382701 


Hs,25960 


426552 


BE297660 


Hs.170328 


433423 


BE407127 


Hs.8997 


442622 


NM^000435 


Hs.8546 


407951 


W77762 


HS79015 


449944 


AF290512 


Hs.58215 


423873 


BEW3054 


Hs.1695 


435055 


AW023337 


Hs.5422 


406743 


AA911568 


Hs.279860 


422627 


BE338857 


Hs.1 18787 


420676 


AI434780 


Hs.4248 



rfbosomal protein 110 


23.66 


Inhibitor of DMA binding 3. dominant neg 


23.60 


SQJTB dofn^, fnununogtobutin domdn {Ig), 


23.60 


nuclear factor of kaf^ Dght polypeplid 


22,35 


laininln, beta 1 


21.60 


Homo sapiens cONA FU10934 fis, doneOV 


21.55 


v-ert)-b2 avian eiythroblaslic teukemia v 


21.50 


tyrosine 3^nonooxygenase/tryptophan &fln 


21.50 


platetet/endolheDa) cell adhesion molac 


20.90 


desmQpl3kin(DPI, OP{0 


2a80 


cytochrome c oxidase subunit VHb 


20.30 


LPS^nduced TNF-a!pha factor 


19.45 


NM_004541:Homo sapiens NAOH dehydrogenas 


19.30 


major histocompaSblli^ oomptex, class 


19J0 


vascular etKlothellal growth factor 


17.45 


A idnase (PRKA) anchor protein 2 


17.40 


MyoD family inhQjRor 


17.20 


chitinase 3^ 1 (carriage glyoofHOte 


16.90 


glycoprotein M66 


i6!eo 


major histocompaiibility complex, class 


16.75 


zinc finger pretBin 6 (CMPX1) 


16.25 


ribosomal protein 84. XMei 


16.00 


RIKEN cDNA 9130422N19gene 


15.85 


pim-1 oncogene 


15.60 


adrenomedutlin 


15.51 


edonucleolide pyiophosphatase/phosphodi 


15.33 


immedlats early response 3 


15.30 


nbosomal protein LlOa 


15.10 


transmembrane, pn>state androgen Induced 


15.00 


ribosomal protein 122 


14.92 


DKFZp564J157 protein 


14.86 


protein C receptor, endothelial (EPCR) 


14.70 


butyrate response lactor 2 (EGF-response 


14.60 


fenilin. light polypeptide 


14,57 


endosulfine alpha 


14.57 


KIAA0792 gene product 


14.25 


activated leucocyte ceD adhesion mdecu 


14.05 


Homo sapiens cDNA: FU21897 lis, done H 


14.00 


Homo sapiens done 25028 mRNA sequence 


14.00 


hypothefical protein M6C4248 


13.95 


immediate early protein 


13.85 


small inducible cytokine subfamily A (Cy 


13.60 


K1AA1628 protein 


13.60 


KiAA1676 


13.40 


Mels (mouse) homolog 2 


13,10 


MAD (mothers against decapentapleglc, Dr 


13.00 


ESTs, Weakly similar to 138022 hypotheO 


13.00 


ghitaredoxin (tMoltransferase) 


13,00 


PNAS-123 


12.90 


annexin A1 


12.89 


K1AA0468 gene product 


12.48 


hypothetical protein RJ21709 


12.45 


actln related protdn 2/3 complex, subun 


12.40 


bisulbvlike growth lactor k)Indlng prote 


12.19 


Wiskott-Aldrich syndrome protein tnterac 


12.12 


SH3 domain binding glutantic ackl-rich pr 


12.07 


Homo sapiens cDNA: FU2252B lis, ctone H 


11.90 


lacrimal prolina rich prakln 


11.60 


fitxjsomd prot^ LI 3a 


11.51 


epitheRal V-like antigen 1 


11.50 


KlAA1575pmleln 


11,50 


solute carrier family 25 (mitochondrial 


11.44 


hypothetical protein AL1 101 15 


11,40 


ESTs 


11^40 


synovid sarcoma transkicaGon gene on c 


11.25 


Ihynusln, beta 4, X chromosome 


11^21 


metalbthtondn IE (ftjncQonal) 


11,10 


ribosomal protein 86 


1l!lO 


K1AA1265 protein 


11.10 


ATP synthase, H tansporting, mllochondr 


11.10 


proton expressed h thyroid 


1100 


Iec6n. galactoside4)inding. sohibie, 9 


11.00 


N-MYC oncogene 


10.93 


moesin 


10.91 


heat shock TOkO protein 1A 


10.90 


Notch (Drosophiia) homolog 3 


10.80 


anOgen Uentllied by monoclonal anObod 


10.80 


Homo sapiens, Similar to rhotekin, cfone 


10.75 


matrix metaDoproteinase 1 2 (macrophage 


10.70 


glycoprotein M6B 


10.70 


tumor protein. translaOonally-controIlB 


10.70 


transforming growth factor, lietaMuced 


10.65 


vav 2 oncogene 


10.60 
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S.SO 


ritx>somat protein 823 


5.50 


straOfin 


5.42 


Homo sapiens cDNA FU13267 (is. done OV 


5.40 


nldogen 2 


5.40 


K1AA0746 protein 


540 


hypothetfcal protein FU1125g 


5.40 


RAP2A. memtwr of RAS oncogene family 


5.40 


Homo sapiens cONA: FU22130 (b. done H 


5.40 


gbx}g82a10.sl NCI_C6AP.Ov8HOmo sapiens 


5.39 


6CL2/adenov{n]s E1B igkUnterBcflng pro 


5.36 


meningioma expressed antigen 5 (hyaluron 


5.34 


collagen, type IX, alpha 3 


5.34 


caOiepsin S 


&32 


junBprotoonoogene 


5.30 


ESTs 


5.30 


keratin 18 


5.30 


odlagen, type V, alpha 1 


6.27 


ESTs. Moderately 8lndlartoZ195_HUMANZ 


5.27 


gdactoskJase, alpha 


5.26 


fierritln, heavy polypeptide 1 


a26 


uncharactertzed hernatopoieOc stem/jproge 


5.24 


acOnin, alpha 4 


5.23 


hypothetica) protein FLI10833 


5.22 


Homo sapiens mRNA: cDNA OKFZp434G227 (fr 


5.20 


pre-B-oeQ cdony-enhandng fiador 


5.20 


molecule possessing ankyrfn repeats Indu 


5.20 


hypotheOcal protein FU10902 


5.20 


eufcaryolic translalion etongaikm tetor 


5.20 
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5 

10 

15 

20 

25 

30 

35 
40 
- 45- 
50 
55 
60 
65 
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75 
80 



409SO4 
432561 
430556 
431315 



AW444448 
AU0764S5 
AW967807 
AVW72227 



425190 AW028302 
454227 AW983897 



4»357 

452191 

419073 

442295 

443030 

421B78 

424875 

427641 

442806 

442495 

439941 

444961 

452472 

418117 

431824 

417558 

410570 

431805 

430333 

428494 

419541 

443834 

452852 

444621 

422200 

416003 



AB007e67 
AU076408 



Hs.49124 

HSJ78441 

Hs.13797 

Hs.163986 

Hs.155079 

Hs,44743 

Hs^78311 

Hs.28309 



AW372170 H3.183918 
AI827248 Hs.224398 
Ks.9236 
H5.111498 
Hs,199310 
Hs.146116 
Hs.149991 



AA299652 
Ai187945 
AI270591 
AW294522 
AI184717 
At392640 
AWB55398 
AW957300 
A)922013 
AW972B42 
AF045229 
AI133096 



Hs.18272 
Hs.12210 
Hs.294142 
Hs.83496 

HS.822B0 
Hs.64593 



KML014053 Hs.270594 
S70114 Hs;239469 



AA233439 
BE170548 

H73972 

AK001972 

AA298065 

AA080895 

X98001 



425428 AL110261 



Hs.164634 

Hs.118190 

Hs.1 34460 

H5.30822 

Hs.11465 

Hs.347969 

Hs.78948 

H5.157211 



452063 


R53185 


Hs.32366 


414821 




Hs.77424 


429556 


AI391454 


Hs 207251 




X92715 


HS.30S7 


433162 


A)025842 




406797 


AI432224 




412347 


AW970026 


Hs.73818 


422392 


NHL005908 


H5.115945 


447197 


R36075 




447832 


AI433357 




420932 


AW374605 


Hs.11607 


-413593 


-AA205248 - 




451658 


AW195351 


Hs,250520 


441224 


AU076964 


Hs,7753 


408067 


BE244580 


Hs.342307 


424971 


AA479005 


H3.1S4036 


417308 


H60720 


Hs.81892 


438763 


Ai583207 


Hs.99029 


418113 


AI272141 


Hs.83484 


414191 


AW2S0089 


Hs.75807 


421748 


NM-01471B 


Hs.107609 


427486 


AA974433 




424263 


M77640 


Hs.1757 


406867 


AA157857 


Hs.182265 


449378 


AW664026 


Hs.59892 


427202 


BE272922 


Hs.173936 


416945 


BE246762 


Hs.89499 


448966 


AW372914 


Hs.86149 


426711 


AA383471 


H5.343800 


433681 


AI0O4377 


Hs.200360 


441321 


H17182 


Hs.7771 


448896 


AL157484 


Hs.22483 


447232 


AW499834 


Hs.327 


449317 


AW293413 


Hs.132906 


436372 


AW972301 


HS.3102B6 


422082 


AA016188 


Hs.111244 


432465 


DS6165 


Hs.275163 


417924 


AU077231 


Hs.82932 


446659 


A1335361 


Hs.226376 


414829 


AA321568 


Hs.77438 


418036 


Z37976 


Hs.83337 


417677 


NM_016055 


H8.82389 


443559 


At076765 


Hs.269899 


423766 


AA303799 


Hs.300141 


432407 


AA221036 


Hs.13273 


453485 


BE620712 


Hs.33026 


452973 


H88409 


Hs.40527 


427816 


AA1S9248 


Hs.180909 


406794 


AI890243 




449475 


AI348027 


HS.1085S7 



ESTs 5.2D 

KIAA0015 gene product 5.16 

ESTs 5.16 

Homo sapiens cONA: FU22765 8s. done K 5.16 

prot^ phosphatase 2. regiMwysubwd 5.15 

KIAA1435protdn 5.15 

ptexinBI 5-12 

UDP-ghicose dehydrogenase 5.11 

Homo sapiens cONAaJ12797fis, done NT 5.11 

Homo sapiens cDNA FU11469 fis. done HE 5.10 

hypothetical protein FU23516 5.10 

Homo88pienscDNARJ11643(ls.dQneHE 5.10 

ESTs 5.10 

ESTs 5.10 

ESTs 5.10 

ESTs 5.10 

amino add transporter system A1 5.10 

hypolhelica) protein FU13732 similar to a09 

ESTs. Weakly similar to C55663cl^odend 5.07 

linker for activation of Testis 5.06 

gb:EST364937 MAGE resequences. MAGL Homo 5.06 

legtdator of &prQte!n signaling 10 5.06 

ATP synthase, H transporting, mitochondr 5.03 

aVCR protein 5.00 

TIA1 cytotoxic granule-assodated RNA-tri 5.00 

hypotheOcd protein RJ20005 5.00 

Komo8apienscDNA:FU210811!s.doneC 5.C0 

ESTs 5.00 

hypothetk:ai protein FU11110 5.00 

glutathione-S'transtieraselike;glutath) 4.97 

cytochrome c oxidase subunit IV 4.95 

Rab geranytgeranyttransferase, beta subu 4.94 

DKFZP586B0621 protrin 4.94 

ESTs. Weakly simitar to TWST.HUMAN TWIST 4.93 

Fc fragment of IgG, high affinity la. re 4.93 

nucleolar auloantigen (55kO) similar to 4.92 

zinc finger protein 74 (Cos52) 4.82 

ESTs 4,92 

ribosomal protein L6 4.91 

uWqulnokytochrome c reductase hbige p 4,90 

mannosidase, beta A, lysosomal 4,90 

gb7h88b01.8l Scares placenta Nb2HP Homo 4.90 

ESTs 4.90 

ESTs, Weakly slmilartoT21697 hypolheli 4.90 

-gb2q78c12.r1StrBtagenehNT-neuron(937 - —4.90 

EST8.Moderately8lmilartol3802ahypol 4.90 

cahimenin 4.90 

hypotheQcdprot^FU10330 4.90 

tumor suppressing subtransfBrablecandU 4.90 

KlAAOIOIgeneprodiwt 4.90 

CCAAT/enhancer binding protein (C/EBP), 4.89 

SRY (sex determining regtonY>box 4 4.69 

PDZ and UM domain 1 (etfin) 4.88 

K]AA0726geneprodud 4.87 

fibroblast growth factor 4 (heparin seer 4.88 

LI cell adhe^n molecule (hydrocephalus 4.82 

keratin 19 4.81 

ESTs 4.81 

interteuUn 10 receptor, beta 4.80 

arad^donate5-llpQxygen3se 4.80 

phosphdnositDl 3-plwsphale^ing prol 4.80 

conserved gene ampnfied In osteosarcoma 4.80 

Homo sapiens cONA FU 13027 fis. done HI 4.79 

B-ceO assodaled protein 4.75 

Homo sapiens mi^ dMA OKFZp762M127 (tr 475 

interieukln 10 receptor, alpha 4.73 

19A24 protein 4.73 

ESTs 4.71 

hypothetkal protein 4.70 

non^netastaOc cells 2, protein (NM236) 4.70 

cydin 01 (PRA01: parathyroid adenomatos 4.70 

ESTs 4.70 

pledtstrin 4.70 

latent tiansfonning growth factor beta b 4.70 

CGi-110 protein 4.70 

ESTs. Moderately similar to AIU8JHUMAN A 4.70 

ribosomd)»^elnL39 4.70 

gb2r03f 1 2.r 1 Stratagene NT2 neuronal pr 4.69 

hypothetical protein PP2447 4.67 

ESTs 4.67 

perux l ie d o xi n 1 4.67 

ribosomal protebiL6 4.66 

hypothetical protein PP1057 4.65 
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427157 U51166 K5.173824 thymine^ g^ycosylase 4.64 

425410 AA310974 Hs.156828 Homo sapiens cONARJ10522f)S. done NT 4.62 

407874 AI76&311 Hs.289047 Homo8aptenscONAFU14059fis,donaKE 4.61 

- 417535 AA2035G9 Hs.191482 ESTs 4.61 

5 421818 AWS92g76 Hs^038 NM_002489iiQmo sapiens NADHdehydrogenas 4.61 

408491 AI0880&3 K5.7682 ESTs 4.60 

428398 AI249388 H5.98SS8 ESTs 4.60 

410295 AA741357 fddogen (enactin) 4.60 

. ^ 407198 H91679 gbYv04307.s1 Soares fetal Bverspteen 4.60 

10 440327 R12581 Hs.igi146 ESTs 4.60 

417353 AA375752 Hs.348140 Homo sapiens mRNA; cONA OKFZp586F1B22 (f 4.60 

426141 005686 Hs.293972 ESTs 4.60 

412922 M60721 Hs.74870 H2.0(DrDSoph!iaHitehomeob(Kl 4.59 

434449 AW953484 Hs.3849 hypdheflcai protein RJ22041 similar to 4.58 

IS 416987 086957 Hs.60712 KIAA0202 protein 4.57 

418883 BE387036 H8.1211 acid phosphataso 5. tartrale resistant 4.57 

416581 H66276 Hs.108268 ESTs 4.56 

440983 M20681 Hs.7594 solute canter My 2 (fadiitaladglu 4.56 

450988 BE618571 Hs.429 ATP synthase, H transporting. mRochondr 4.56 

20 413663 BE247585 Hs.75462 BTGfamUy. member 2 4.55 

410342 R31350 Hs.743 Ixtragment of IgE. high affinity I. rec 4.54 

424321 W74048 Hs.1765 iymphocytMpedricpfQlein tyrosine Kin 4.53 

453330 AI268081 H5.342389 peptidytpioiyllsomeraseA(cyc(ophifo 4.52 

417750 Ai267720 Ks.250523 synovia) sanxmtranslocaled to X duo 4.51 

25 440774 AI420611 H5.153934 ESTs ^ 4.51 

422693 BE300073 Hs.279860 tumor protein, translaiionally-controlla ' 4^1 

411125 AA151647 Hs.68877 cytochrome b-245. alpha polypeptide 4.51 

449267 AI638640 Hs.220624 ESTs 4.51 

432606 NM.002104 Hs.3086 granzyme K (serine protease, granzyme 3; 4.50 

30 432706 NM.013230 Hs.286124 C024 antigen (small cell tung carcinoma 4.50 

436823 AW749865 Hs.117077 ESTs. Weakly similar to 138022 hypotheU 4.50 

437489 AW753112 H5.15514 hypothetical protein MGC3260 4.50 

413703 BE158360 gb:PM1-HT0383-1312gM01-hD8HT0383Homo 4.50 

415526 N76536 H5.265591 ESTs. Weakly similar to ALUIJtUMAN ALUS 4.50 

35 406623 X6g392 Hs.91379 ribosoma) protein 1^ 4.49 

456642 AW451623 Hs.109752 putative c-MycHesponsive 4.47 

406653 AAS74074 Ks.77961 m^rhistocompatibliity complex, dass 4.47 

408307 AI761786 Hs.204574 ESTs 4.46 

428297 AA236291 Hs.1 83583 serine (or cysteine) proteinase tnhibito 4.46 

40 447296 AW243614 Hs.18063 Homo sapiens cDNA FU10768 fis. done MT 4.45 

416801 X98834 Ks.79971 sal (DrosophHaHOce 2 4.45 

406870 AA075144 gb'2m86f06^1 Stratagene ovarian cancer 4.45 

446291 BE3977S3 Hs.14623 Intorferan. gamm»todiicB)Ie protein 30 444 

416297 AA157634 Hs.79172 solute carrier farrily 25 (mOochondrfal 4.44 

45 - 428773 -BE256238 - Hs.-193163 — bridging integratoj-l ■ 4^- 

427640 AF058293 Hs.180015 [><)opachn3me tautomemse 4.43 

437223 C15105 Hs.330716 Homo sapiens cDNAFU14368fis. done HE 4.43 

412265 AA101325 Hs.86154 hypothetical piotdn FUt2457 4.43 

442232 A1357813 Hs.337460 ESTs. Weakly a'mHv to A47582a«ellgr 4.42 

50 441612 AI802829 Hs.1 13660 Homo sapiens cDNA aJ11631 fis. done HE 4.41 

424866 A1568170 Hs.98886 ESTs 4.41 

408380 AF123050 H$.44532 diubiquitin 4.40 

411960 R77776 Hs.16103 ESTs 4.40 

428782 X12830 Hs.193400 interleukln 6 receptor 4.40 

55 408360 AI806090 Hs.44344 hypothetk:al protein FU20534 4.40 

456629 AWB91965 histone deacetylase 3 4.40 

414416 AW409985 Hs.76084 hypothetical protein M6C2721 4.40 

422499 At268866 Hs.19631 ESTs. Weakly similar to 138022 hypotheU 4.39 

^. 414219 W20010 Hs.75623 ALLI-fiised gene from dvomosomelq 4.39 

60 427779 AA906997 Hs.180780 TERA protein 4.38 

422340 AW298219 HS.11532S RAB7, member f^S oncogene tamily^ike 1 4.37 

413276 Z24725 Hs.75260 mitogen indudbte 2 4.36 

452651 AI218918 Hs.30209 KIAA0854 protebi 4.35 

^_ 453467 A1535997 Hs.30089 ESTs 4.35 

65 435961 BE293127 Hs.283722 GHI protein 4.35 

415691 AW963979 Hs,24723 ESTs 4.34 

435968 AW161481 H5.111577 integral membrane protein 3 4.34 

420099 D80011 Ks.g5140 KlAA0189geneprodud 4.33 

_^ i 421522 R48681 H3.102991 hypothetfcal protein FU13956 4.33 

70 457073 AA233210 Hs.179943 (fbosomal protein L11 4.31 

427337 Z46223 Hs.176663 FcfragmenlongG, tow affinity lin».f 4.31 

420732 AA769133 Hs.63525 ESTs 4.30 

432731 93\m Ks.267820 fanonectlnl 4.30 

437275 AW976035 Hs.292396 ESTs. Weakly simflar to A47S82B^llgT 4.30 

75 408784 AW971350 Hs.63386 ESTs 4.30 

441962 AW972542 H$.269008 Homo sapiens eONA: FU21814 lis, done H 4.30 

419926 AW900992 Hs.93796 DKFZP566D2223 protein 4.30 

429058 AF1388&3 Hs.35254 hypothetk:al protein FLB6421 4.30 

434963 AW974957 Hs.288719 Homo sapiens cONAFU12142 fis, done MA 4.30 

SO 413677 AW503116 Hs.301619 dnc finger protein 146 4.29 

428970 BE276891 Hs.194691 re&idc add induced 3 4.28 

408896 AI610447 Hs.48778 niban protein 4.26 

433550 AAS890S1 Hs.177376 ESTs 4.26 
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406230 
435655 
448717 
437366 
4167S9 
447341 
410423 
409453 
428453 
444681 
416072 
450937 
447595 
418452 
430594 
425259 
431560 
439403 
409245 
437296 
406877 
419652 
406661 
452432 
448782 
407110 
422960 
432841 
415857 
420298 
419726 
426075 
430255 
418699 
443035 
457415 
412220 
427509 
444633 
441384 
431956 
422310 
439815 
417930 
418456 
424464 
417035 
412627 
414890 
452248 
450887 
444224 
451351 
407792 
439864 
408745 
409132 
410597 
409485 
426398 
417777 
446979 
416000 
426647 
436394 
409956 
414502 
446013 
452404 
444736 
438590 
451638 
449632 



AW10S663 
R67419 

W524S2 

AK000978 

AF106941 

AW402432 

AI885516 

AB011110 

AJ243937 

ALl 10370 

R49131 

AW379130 

BE379749 

AK000790 

AL0492eO 

BE244135 

BE265745 

AA361037 

AA350994 

AA226392 

AL157485 

X66975 

AW206006 

AL050295 

AA018042 

AW890487 

M93425 

AAfi66115 

AI199510 

U50330 

AW513691 

AK000703 



Z45622 

AKOOOOlO 

BE350058 

M62505 

AF111713 

AA447649 

X63629 

AA316622 

AA206079 

H81136 

AA332941 

R68537 

AA1924S5 

BE3gi959 

6E28109S 

AA093668 

AA011518 

AV648599 

AW058261 

AI077715 

AI720078 

AW936356 

AJ224538 

W165ie 

S80990 

BE256390 

AI823763 

AI654443 

R82342 

AA243464 

AA531187 

AW103364 

AW830088 

AI360167 

AW450675 

AA533491 

AAB11465 

AW00S866 

AA694264 



Target Exon 

Hs.6947 HSPC069 protein 

Hs.21851 Homo sapiens cDNA FU12900 Gs. ckme KT 

ribosomal protein L10 

H5.79741 hypothetical protein FU10116 

Hs.18142 8nBStin.bela2 

H5.63489 prote&i (yroshe phosphatase. noiHecept 

Hs.95612 ESTs 

HS.1B4367 GTPase activating proleln^ike 

Hs.268316 chromosome 6 open reading firama 9 

Hs.79000 growth associated protein 43 

H3.26267 ATP-dependanI interferon response protd 

HS.189S3 phosphodiesterase 9A 

Hs.65201 C4ype (calchjm dependent, cartfohydrate- 

Hs.246885 hypothetical protein FIJ20783 

Hs.155397 Homo sapiens mRNA: cONA OKFZp564K143 (fr 

Hs.260238 hypotheficai protein FIJ10842 

ESTs. WeaUy similar to AUiC_KUIi4AN I'll 

IRNA isopentenytpyniphosphatB transferas 

Hs.20281 KIAA1700 

Hs.179943 rilwsomal protein L11 

Hs.91973 hypotheticsl protein 

Hs.172550 poiypyrtmidine tract Imtding protein {he 

Hs^76 Homo8aplenscDNA*FU21778(i8,c)oneH 

K1AA0758 protein 

Hs,252085 Prader-Wnii/Angelman syndrome-5 

cadherin 13. l+«adhertn (heat) 

Hs.62 protein tyrosine phosphatase, non-recept 

Hs.127797 Homo8aplenscONAFU11381 fi8,cloneHE 

Hs.267912 ESTs. Wealdy8imBartoALU7JiUIWAN ALU S 

Hs.1 274 Inne morphcflenefic protein 1 

Hs.2701 49 ESTs, Wealdy similar to 2109260A B cell 

Hs.323822 Homo sapiens mRNA for KIAA1551 protein, 

Hs.173030 ESTs, Weakly similar to ALU8_HUMAN ALU S 

Hs.8906 Homo sapiens done 24889 ml^ sequence 

Hs.258798 hypothetic^ protein FU20003 

Hs,36787 chromodomam hellcase DMA trfnding protei 

Hs.21 61 complement component 5 receptor 1 (C6a I 

Hs.286218 juncfona) adhesion molecule 1 

Hs^88660 Homo sapiens cONA: FU22182 fis. done H 

Hs.2877 cadherin 3. type 1, Podherin (placenta 

Hs.98370 cytochrome P450, subtamily IIS, potypept 

Hs.6693 hypothetical protein FU20420 

Hs.334604 Homo sa;^ mSm for KIAA1870 proteh, 

H5.85226 Epase A, lysosomal add. chdesteroi es 

Hs.17962 ESTs 

H5^8 Homo sapiens done IMAGE:451939. mRNA se 

Hs.74276 chloride Intracellular channel 1 

Hs.77573 uridbe phosphotylase 

H5^S78 musdet)Iind(DrosophilaHilce 

H5.271778 E5r8,WeaUy8imliartoi38022hypo»efi 

Hs.199438 ESTs 

Hs.321435 ESTs, WeaWy similar to ALU1_HUMAN ALU S 

Hs.39384 putative secreted ligand homologous to f 

Hs^1997 ESTs. Weakly similar to A47582 &<»D gr 

Hs.300925 EST8.WeaUy8im3artoA46010X-linked 

K5^732 protein kina$8,AIMP-adhratBd.beta2no 

Hs^9518 amytold t)eta (A4) precwsor-flke protein 

Ks^52136 ficoHnMlaganffibrinogendomdn^t 

Hs.169716 caiponin2 

HS.70S5 EST8.WeaUy8imilartoi788858erineAh 

Hs.197683 ESTs 

Hs.79856 ESTs,Weakly8lmnartoS65657alpha.1C- 

Hs^lOI pre-B^I leukemia transcriptton factor 

Hs.126705 ESTs 

Hs.727 inhIbin.betaA(adiv{nA,activ!nABa 

Hs.76550 Homo sai^ens mRNA; d)NADKFZp56481264(f 

Hs.152774 ESTs 

Hs^12709 ESTs 

Hs^17 hypothetic^ protein FU14681 

K5.123375 ESTs 

Hs.193969 ESTs 

Hs.60049 ESTs 



4^5 
4^ 
4^4 
4.24 
4.23 
4.22 
4.22 
4.22 
422 
4.21 
4.20 
4.20 
4.20 
4^ 
4.20 
4.20 
4.20 
4.20 
4.18 
4.17 
4.17 
4.15 
4.15 
4.14 
4.14 
4.14 
4.13 
4.12 
4.11 
4.11 
4.11 
4.10 
4.10 
4.10 
4.10 
4.10 
4.10 
4.10 
4.10 
4.09 
4.09 
4.08 
4.07 
4.08 
4.08 
4.06 
4.06 
4.08 
4.05 
4.05 
4.05 
4.05 
4.04 
4.04 
4.04 
4.03 
4.01 
4.01 
4.01 
4.01 
4.01 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 
4.00 



TABLE 55B 

Pl«y: Unique Eos prot)esetMenliriernuml)er 
CAT number Gene cluster number 
Accession: Genbanlt accession numbers 



Pkey 



CAT Number Aooession 
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413787 761i.l BC003047 S80794 NM.0O34O5 X7ai38 AY0O7i32 L08439 AW340648 AW1 31 665 AJ082748 AI470204 BI71 1 078 BF350700 BI496963 AI087141 

AA720664 AA862331 AA605146 BM313650 At089749 A1359738 N69ta7 AW995424 AI086917 AI063995 AW340217 N99662 AI629449 AI089839 
A1608761 A1342365 Altg9076 AA908944 AI248943 AU160053 AI191245 A!2t8477 A1077943 AA864930 AI310394 AA872478 AI279782 W61343 
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N21313 6F6746tO H02874 AW975323 R16904 AA328030 AA054671 R79546 BF832310 AI249109 

413703 376077J BG015794 BE158357 BE1S8353 BE15835B BE158360 BE158352 BE158351 BE1S8355 

408870 0.0 AA075144 

458629 207.22 AW891965 AW604749 BE080872 R15559 BE177623 AW883520 AW945343 AI246167 T07082 AW80S679 W96278 AA135796 W32615 AW995418 

AW801688BE003837 AW801621 AW385721 AW385742AVV385714AVVB04757VV87409AVV604738AW385757AW580796AVV801247BEM^^ 
BE003183 AA8471 12 AW580975 AW604760 AW385727 BE164590 BE003090 AW382791 AW604759 AW886589 AW60475B ^M4337 AI378548 
AW890438 AAOni72 AI288883 AA229639 AA091945 AW945454 AA063829 AA702504 AW861 938 AW83481 6 AW580841 AA094372 T06399 
AVV88568eBE244086BE005035AVV861913AA551773AW858460AW370926AW54352AVV889^^^ 

AA248197 AW381373AW177325AVVB0687aAA93S217BED67498BEOa3742BE067470AVV894935BEO^ BE179917 BE002200 

AW607506 AW392889 AW894560AW381360AI904206AW863533 C00609 AW381372BE082530AW898120 BE075323 AW392799 AWS01420 
AI695314 BE083790 AW858558 AW945550 BE177153 AW970506 BE350419 AI906919 AW360794 AI906917 AW885979 AW794240 AW945566 
AI688683 AI688694 AW009660 AW601421 AW350793 BE06B524 BE083901 AW369847 AW381871 AW935435 AW5645S2 AW877775 AW838449 
BE1B0466AWB58501 BE180484AI371163AA778231 AI174991 BE011720AW877776AW877BOOAW877795T19900AW85636SAW8980g9 
BE011715BE167842BE011718BE011724AW363639AVW8658AW878662AV«94887BE082356AVV3892t1AW^ 
AVV610318AW996909AVVB10296AVV901923AVV880003AI762171Am^ 

AW842095 AI243049 AW902074 AW062592 AW176684 AW751 692 BE087703 A1907439 BE009686 BE1721 1 5 BE077030 AW608556 AW835577 
AI909628 BE077029 AW175241 BE077552 BE160370 BE1 602TO AW835658 AW606765 AW506770 AW835678 AW606758 AW606778 AI907484 
BE172821 AW606768 AW999517 AWB44165 BE171738 AW751683 AW610493 BE177484 BE177487 AA090510 AW844117 BE173387 AW999878 
AI124870 BE163472 AW841B23 AW3ra762 AW893297 AI290296 BE089132 AA610287 AW176676 AW6D7622 BE172639 AW893232 AA329629 
BE089008 BE17B350 BE178214 BE063291 AW820236 AW999653 BE089486 BE173126 BE171775 BE1857B7 AA5582B0 AI174840 AW999112 
BE218391 BE172734 BE178021 BE172738 BE173324 AW603494 AL036722 R38192 R60905 H53721 H41052 AL037917 R37795 AW99B972 
AA767189 AW044272 H50689 AA768399 AA767764 AI087888 H44202 BE222792 N90597 W81396 N90615 AI935353 BE501168 F10945 
AW118215 AI970480 AI627641 AW236081 AA574090 AI627652 A16B1913 AI759983 N69591 N69276 BE467722 AW392780 BE172467 H92861 
AI524921 F02989 Z39328 F027D5 F01414 T88678 A1215165 H87220 AW374781 

437386 5541.2 BM476605 B!5450D4 BI834636 BG112453 BI199049 BG112759 AA149846 H97925 AA306121 AA313204 V«2451 AI734997 AA931168 AA429766 

N47913AA584321 BF940241 AI083648 AI089410 AI347705 AI343661 AI186232AI889031 N98464AI820039AI459034AV6S2512AA622990 
BE857200 AA932998 AA740573 AI826264 AA865683 AI3445S0 AI027349 AI056087 AA442777 AA603724 AA873347 AI056717 AI092185 A)032895 
AA535589 BF806025 BF806061 BF805985 BF746099 BF746097 AI309259 AI597603 BF806066 AI090653 AI12920S AI248410 H72993 AW615341 
BF80S990BF805982AAg93819T34373 T35604 H56242AA648145T35907BF808691 N94015AV703438BG774276 H82341 R76371 

439403 4937J0 AK056692 AF086220 AI375066 AA2B4293 W32566 AW797961 AA960897 AA504145 

409245 3199.2 AF0302348C017465BG008526AW505550BM460141 N47324 AA361 037 AA321632 N45B06AV7S2798AV6571 16 AA29e832AU 137857 

AW467027AI742080AI624350 H58206AA478S18AW439997AW393555AW393523AI559753AI808732R568 

AW980845 BKM66252 AW956813 BE768647 AV658853 BM055248 BF372070 BF372055 BF372051 AA347852 AA905863 BG505078 AV654024 
BF093291 AW021929 H22650 AA459715 BG496341 BE697763 BI254209 BG499543 H42946 BI059780 BI086741 H87B96 H87599 BF691752 
BE768511 BG940948 W37195BF372041 BE883796BF372082BF367329BF909744 AW986003AV71 4014 BI49286B 81495144 AA921845 
AI693426 AI652147 A)435449 N47325 AI434429 AA573137 AI1B3429 AI82g962 AI332526 6F513937 AI189S61 Ai22ig62 AI378034 AW1 16897 
AW6e5247 AW340077 N41605 AA478519 AA463875 At858260 AA483379 A1292305 BE045947 AA971089 All 25820 BG940947 A)060245 
AA884954 AI125702 AI382934 AA931835 AI358631 AW439905 AI027833 AI39964B AI014533 AA347851 AA738261 N67374 N69081 AI768667 
AA948472 AI819214 AA293133 At186725 AA8a9214 AI222635 B1495143 N29805 N48812 AA769041 AI492769 DSSHI AA095911 BE222062 
D56772 AW372265 BM054985 012465 BG534562 AW00351 1 H87486 H42880 AW1 90293 BFS94697 BF37761 1 H22043 BI255749 81492848 
H16217H21980 H22651 H88179H87354 K44052 H25165 H44128 

448782 34980.1 AB018301 AL050295 BF513128 AW385080 AL551708 AI352542 AIB29703 AI81938B AW629019 AW073189 AW273B57 AW118768 AI453845 

AI452494 AA886341 AI057144 AA904647AI423547 AW263913 AI094774AI434419A)039546 AHJ02491 A124O412Z25099AA995178AW050649 
AW026140AI796309 AI584012BE166666Ar767991 AI309041 AA724059 A1695284 AI245095 T63971 Z40627 BEta6681 BG570071 BF921915 
BI5627O2BG5065O2AV658086 R48378AA121543Al0g6938AA61813t H40993 R48277 A1352281 BG540263 6GS38901 N95226A1356752 
At221152 Z2B777 R16574 AW966449 AA044116 AW797518 81010405 AA044288 AI093508 8E1401 69 764039 BG433106 AW1 30367 AW1 30361 
N73937 AA127680 AW044037 AI096437 AA384077 BFg41499 TS3764 BG003285 

422960 11862J BRB0249 AW9S4847 AW890487 AI30S236 D60845 D80537 AA825429 W36294 AW890410 AW088235 BF740240AA448709 AI350279 AA879119 
AA319510 BE702077 8E69901S BE702046 AW901293 T99319 D81706 BF475488 060383 DB1751 BE699260 

TABLE 5SC 

Pkey: Unique number conesponding to an Eos probese! 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gi) numbers. "Dunham, el d.* lefeis to the pubUcation enQHed The DMA seauenoe of 

human chromosome 22" Dunham, et al. (1 999) Nature 402:489495. 
Strand: Indkates DMA strand from which exons were predicted. 
NLposIfion: indicates nucleotide positions of predicted exons. 

PIcey Ref Strand NtjJosKion 

401466 6682292 Plus 28748-29023 

401192 9719502 Minus 69559-70101 

402474 7547175 mma 53526^28.55755-55920.5753^57757 

402145 6018280 Plus 113086-114800 

406230 4760409 Pkis 71716-72515 



TABLE 56A: 

Pksy: Unique Eos probesel Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelO: Unigene number 

Unigene TllteUr^ene gene title 

R1: Ra0o of semlnomaloustesficutar cancer oornpaied to normal aduUtesficute 

Pkey ExAccn UnlgenelD UnigeneTllle R1 

414438 AI879277 Hs.76138 (Moredoxbi 47J0 
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424247 X14008 Hs^34734 lysozyme (renal amyMdosb) 44.B0 

438091 AW373062 midear receptor subiarnlyl, group I. ffl 40.10 

412948 BE243313 Hs,334851 UM and SH3 protein 1 34.90 

_ 417088 M54915 Hs.81170 plm-1 oncogene 31.10 

5 430542 AI557486 Hs.119122 ribosomalp(Oteinl13a 29.60 

412915 AWD87727 H5.74B23 NM.004541:HQmos^]iensNAOHdehydrDgen85 29.10 

418174 120688 Hs.83656 Rho GDP dissociation inNbRor (GDI) bet 28.15 

406820 At223958 Hs.108124 rteomal protein S4. X-finkad 28.13 

433800 AI034361 Hs.135150 hii« type^cenmembrane^ssodatedgiy 28.10 

10 406658 AI920965 Ks.77g61 ms^ histoconipatibllily complex, dass 27.65 

416680 AW24S540 Hs.79516 brain abundatd. memtirane attached signai Z7.70 

446525 AW967069 HS.2115S6 hypolheScal protein MGC5487 27.20 

432359 AA076049 Hs.274415 Homo sapiens cONAFUl0229fis. done HE 26.30 

422578 AF239666 Hs.1545 caudal type homeo box transciipfan fart 25.80 

15 429978 AA24g027 (ibosomal protein S6 25.40 

418870 AF147204 Hs.89414 d»ni(^((>X«mota),iec8plDr4(lis 24.60 

440207 AI371978 Ks.128326 ESTe 24.50 

425543 R23313 Ks.334895 rtbosoma) protein LI Oa 24.30 

442562 BE379584 ddldiyWiphosphooBgosaocharide^jrotd 24.10 

20 444562 AA186715 H5.336429 RIKENcDNA9130422N19gene 24.05 

413063 AL035737 Hs.75184 ddtlnaseS-Ilkel (cartilage glycoprote 24.00 

409038 T97490 Hs.50002 small tndudble cytokine subfamily A (Cy 22.90 

420367 AA259090 Hs.257028 ESTs 22.90 

406856 AW515335 Hs.29797 ribosomal protein LI 0 tlV 

25 417139 MS9043 Ks.81328 midear factor of kappa fight polypeptid 22.75 

412636 NM_004415 desmopiakIn(DPl.DPn) 22.40 

420676 AI434780 Hs.4246 ' vav 2 oncogene 22.10 

440869 NM 014297 Hs.7486 protein expressed in thyroid 21.40 

446627 AI973016 Hs.15725 hypothetical protein SBBI48 21.20 

30 410315 AI638871 Hs.17625 Homo sapens cDNA: RJ22524 fis. done H 21.10 

420754 W79431 Hs.345911 ribosomal protein L22 20.98 

435538 AB011540 Hs.4930 low density lipoprotein receptor-related 20.90 

440440 Z28925 Hs.7188 sema domdn, immunoglobulin doman ()g), 20.80 

429490 AI971131 Hs.23889 ESTs. Weakly similar to ALU7.HUMAN ALU S 20.40 

35 425769 U72513 Hs.159486 Human RPL13-2pseudogenemRNA. complete 19.50 

422714 AB018335 Hs.119387 KlAA0792geneprodud 19.15 

430253 AK001S14 Hs.236844 hypothetical protein FU 10852 19.00 

413787 A13S2558 tyrosine 3Hnonooxygenase/tryptophan &mo 18.50 

4S2874 AK001061 Hs.30925 hypothetical protein FU 10199 18.50 

40 430255 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1551 protein. 18.00 

432606 t^M_002104 Hs.3066 granzyme K (serine pTOtaase, granzyma 3; 17.90 

448588 AI970276 Hs.156905 KIAA1676 17.70 

444784 D12485 Hs.11951 edonudeotMapyrophosptiatase/tihasphodi 17.50 

428782 X12830 Hs.193400 Interteutdn 6 receptor 17.40 

45 414092 Z14244 Hs.75752 cytodvome c oxidase subunlt Vllb 17.20 

425945 AW410669 Hs.164280 solute carrier family 25 (mitodiondrial 17.15 

420759 T11832 Hs.127797 Homo sapiens cONAFUl 1381 fis. done HE 17.10 

440528 BE313555 H8.7252 KIAA1224 protein 17.06 

410143 AA188169 WAAI 191 protein 17.05 

50 421181 NM-005574 Hs.184585 UM domain only 2 (fhonibo8n-ne 1) 16J0 

426083 AW962712 H5.126712 ESTs. We^ similar to AF1910201E2IG5 16.70 

429183 AB014604 H3.197955 KIAA0704 protein 1^70 

450937 R49131 Hs.26267 ATP-dependant interferon response protel 16.60 

449571 AW016812 Hs.200266 ESTs 16^0 

55 432730 AI065520 Hs.131358 ESTs 1R20 

426295 AW367283 zinc finger protein 6 (CMPX1) 16.15 

439180 AI393742 Hs.199087 v-erb-b2 avian CTfliroblBSticIeckemiav 15.90 

420028 AB014680 Hs.8786 carbohydrate (N-acetylgIucosan^nfr6^ s 15.80 

421379 Y15221 Hs.103g82 smaBWudWecytoldne subfamily B(Cy 15.80 

60 455238 AF045229 Hs.82280 rogulatorof G-proteinstgnalling 10 15.70 

429469 M64590 Hs.27 gtydne dehydrogenase (decarboxytaling; l&SO 

428928 BE409838 Hs.194657 cadhertn1.type1,EMadherfn{e^Ii 14.90 

452322 BE566343 Hs.28988 ghitarodoxin (thiottransrerase) 14.90 

406656 M16714 HS.B9843 major histocompatlblDty complex, dass 14.85 

65 414020 NIWL002984 Hs.75703 small lndudblecytddneA4(homdogou5 14.71 

450377 AB033091 WAAI 265 protein 14.70 

425998 W87330 hypothetical protfilnALl 101 15 14X0 

430332 R51790 Hs.239483 Human done 23933 mlVM sequence 14.60 

427691 AW194426 H5.20726 ESTs 14.42 

70 428814 AI371172 H8.211539 hyPoQwScaipn>1dnM6C4248 14.35 

451106 BE382701 Hs.25960 N4y)YC oncogene 14.21 

422241 Y00062 Hs.170121 protein tyrosine phosphatase, receptor t 14.13 

436860 H12751 Hs.6327 PR01 914 protein 13^ 

446899 NM.005397 Hs.16426 podocalyxWike 1350 

75 450000 Ald52797 Hs.10888 hypothetical protein FLi21709 13.75 

408380 AF123050 Hs.44532 dhlbiquHin 13.70 

447526 AL048753 Hs.303649 small indudUe cytokine A2 (monocyte ch 13.40 

427521 AW973352 ESTs 13.30 

410598 AI817130 Hs.9195 Homo sapiens CONARJ13698 fis. done PL 13.25 

80 428664 AK001666 Hs.18g095 dmOartoSAai (sal(DtosaphiIaHika 13.23 

408822 AW500715 Hs.57079 Homo sapiens CONAFU13267 fis. done OV 13.23 

425269 AW139342 K3.155530 Interferon, gammaMudble protein 16 1Z70 
426552 BE297660 H5.170328 ffloesin 12.69 
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415657 


AA666115 


Hs.127797 


436398 


H87136 


Hs.5174 


41B1S1 


AA664238 


Hs.83583 


453020 


AL152039 


Hs.31422 


410275 


U85658 


Hs.61796 


414587 


NM_004862 


HSJ6507 


425875 


AU077333 


Hs.160483 


41S938 


BE383507 


Hs.78921 


419384 


AA490866 


Hs^29 


410185 


BE294088 


Hs.737 


407862 


BE548267 


Hs.337986 


433793 


AW975959 


Hs.107513 


406743 


AA91t568 


Hs.279860 


408989 


AW361666 


HS.49S00 


430268 


AK000737 


Hs.237480 


447735 


AA775268 


Hs.6127 


410325 


AB023154 


Hs.62264 


445617 


NM.003642 


Hs.13340 


418299 


AA279530 


H&.B3968 


445863 


R12234 


H5.13396 


454413 


A)653672 


Hs.40092 


418460 


M26315 


H5.8525B 


428085 


AI634046 


H5.157313 


432805 


X94630 


Hs.3107 


422511 


AU076442 


H5.1 17938 


444060 


AA340277 




452436 


BE077546 


Hs.31447 


409963 


AA133590 


Hs.250857 


402474 






407112 


AA070801 


Hs.51615 


408786 


AW161678 


Hs.! 11334 


428227 


AA321649 


Hs.2248 


444656 


AI277924 


Hs.145199 


453856 


AA804789 


Hs.19447 


440774 


A!420611 


Hs.153934 


408669 


A)493591 


Hs.78146 


431639 


AK000680 


Hs^175 


444381 


BE387335 


Hs,283713 


418509 


AB02B624 


Hs.85539 


437374 


AL359571 


Hs.44054 


415899 


X78992 


Hs.76909 


450719 


AI096837 


HS21349 


424800 


AL035568 


Hs.153203 


446682 


AW205632 


Hs.211198 


447211 


AL161981 


Hs.17767 


425234 


AW152225 


Hs.165909 


422105 


AI929700 


Hs.111680 


417144 


AA382104 


H3.81337 


422068 


AI807519 


Hs.104520 


452651 


AI218918 


Hs.30209 


418707 


U97502 


H3.87497 


426996 


AW968934 


H8.173108 


427761 


AA412205 


Hs.140995 


443523 


AK001575 


Hs.9538 


402145 






413686 


AI469213 


Hs.71404 


446488 


AB037782 


Hs.15119 


449246 


AW411209 


Hs.23363 


424321 


W74048 


Hs.1765 


407179 


AA206465 




413497 


BE177661 




430068 


AA464964 




446795 


AI797713 


Hs.156471 


435522 


N64214 


Hs.9774 


451864 


N20370 


Hs.69547 


419490 


NM_006144 


Hs,90708 


419904 


AA974411 


Hs.18672 


427711 


M31659 


HS.18O40B 


409208 


Y00093 




424950 


AA602917 


Hs.166974 


447534 


AW953935 


Hs.288655 


419223 


X60111 


H$.1244 


423873 


BE003054 


Hs.1695 


422980 


AW890487 




412025 


AIB27451 


Hs.24143 


408784 


AW971350 


Hs.63386 


417407 


AA92327B 


Hs.290905 


432409 


AA806538 


Hs.130732 


440273 


AI805392 


Hs^5335 


407110 


AA018042 


Hs^52085 


408161 


AW952912 


Hs.300383 


434260 


BE005398 




434524 


AA635331 


Hs^49716 



Homo8apienscONAajl1381fis,c)oneHE 12^ 

ribosomat protein SI 7 12.50 

adln related protein 2/3 complax, sutmn 12.30 

HonK)wpiOT5niRNA;cDNADKFZp434M229(fr 12.30 

transcrqiiion factor AP-2 gamma (acGvat 12.28 

IPS^xed TNMpha (actor 12.25 

erythrocyte membrane protein band 7.2 (s 12.25 

A kinase (PRKA)andx)r protein 1 1Z20 

EST8 1Z20 

immediale early protein 12.15 

Homo sapiens cDNA aJ10934 fis. done OV 12.05 

ESTs. Moderately similar to KlAAl 058 pro 12.00 

tumor protein, iFBnslaSonally-controlle 1 1 .90 

KIAA0746 protein 11.80 

hypottteOcal protdn FU20730 11.80 

Homo sapiens cDNA: aj23020 fis, done L 1 1.80 

K1AA0937 protein 11.70 

histone acetyttransferase 1 1 1 .70 

integrin, beta 2 (anGgen C01 8 {pg5). ly 1 1.61 

Homo sapiens done 25028 mFVJA sequence 11.60 

PI^S.123 11.60 

COS antigen, alptia potypeplide (p32) 1 1.40 

ESTs 11.40 

C097 antigen 11.36 

collagen, type XVIi, atpha 1 1 1.35 

Homo salens cONA aJ20167 fis, done CO 11.30 

ESTs. Moderately similar to A46010)Wln 11J0 

caidumfoalmodulin-dependent protein kin 1 1 .1 1 

NM_004079:Homo saj^ns calhepsin S (CTSS 11.00 

ESTs, Wealdy simlar to ALU7_HUMAN ALU 8 11.00 

ferritin, Bgtit polypeptide 10.95 

small indudble cytokine sublamlly B (Cy 10.90 

ESTs 10,90 

PDZ-UM protein myslique 10.85 

ESTs 10.82 

platetettendothelial ceD adhesion mdec 10.80 

phosphoprotelnassodatedwfihGEMs 10.80 

ESTs. Weakly sinular to S64054 hypotheO 10.80 

ATP synliiase, H transporfrig, mitochondr 10.70 

ninein (GSK3B interacSng protein) 10.65 

butyrate response bdor 2 (EGF<esponse 10.60 

ESTs, Weakly sMIar to RB8B J1UMAN RA&A 10.43 

MyoD family inhibitor 10.40 

ESTs 10.40 

K1AA1S54 protein 10.31 

ESTs, Weakly similar to 138022 hypolheli 10.30 

endosulRne alpha 10.21 

lectin. ga!actos{dfr^]nding.solubto, 9 10.20 

Homo sapiens cDI^ FU13694 fis. done PL 10.20 

K1AA0854 pmteln 10.15 

butyrophiOn, subfamily 3. member A2 10.11 

Homo sapiens cDNA: aJ21897 fis, done H 10.10 

ESTs 10.10 

hypotheft:al protdn nJ10713 ^90 

Target Exon 9.82 

ESTs 9.80 

KIAA1361 protein gi60 

hypothetical protein FU10983 9.80 

lymphocyt&«pedfic protein fyioslne kin 9.80 

thymosin, beta 4, X chromosome 9.72 

gb:RCl-HT0598^2030W)1 1-h02 HT0598 Homo 9.70 

gb:zx60(10.8l Soares ovary tumor NbHOT H 9.70 

ESTs a70 

synovid sarcoma transtocalion gene one 9165 

ESTs 9.65 

granzymsA{granzyme1,cytolaxiGT-lymp 9.60 

ESTs m 

sduts carrier family 25 (milochondrf al a60 

Integrin, alpha X (antigen C011C (pi SO), 9.52 

ESTs 9.50 

ESTs 9.50 

C09antlBen(p24) 9.41 

matrb metsDi^rateinase 12 (macrophage a40 

cadheiin U HH»lherin (heart) 9.33 

Wiskott-AUitoh syndrome protein tnterac 9.32 

ESTs 9.30 

EST 6. similar to protease (H.8Bpi 9.30 

KIAA1575 protein 9.30 

Homo sapiens cONA: FU23523 (is. done L 9.30 

Prader.Wan/AngeImansyndrome-5 9.22 

hypothetk^ protein MGC3032 9.20 

gb:CM1-BN0116.150400-1894)02 6N0116Komo 9.20 

ESTs 9.20 
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450294 

4072S4 

434442 

440268 

447519 

433156 

410730 

438823 

442808 

433271 

445577 

408437 

420962 

431187 

42109B 

424528 

446108 

401091 

433412 

438089 

4319S8 

452568 

414191 

411979 

414829 

430162 

448412 

423753 

407833 

416975 

433208 

428970 

425284 

437108 



H42587 

AW129401 

AA737415 

BE270030 

U46258 

R59206 

AW368860 



BE621697 



Ks£185 



Hs^877 



426827 
453716 
418840 
434649 
449656 
425535 
409493 
432559 
436797 
420099 
417640 
420337 
434423 
437886 
418522 
433655 
435968 
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423523 
409327 
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444785 
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442821 
412347 
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4134^ 
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437134 
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Hs.238730 hypolheOcal pnt^ MGC10823 9*20 

HS.1B11G5 eukayoficttanslaaonelongaaon factor 9.10 

ESTs 9.10 

Hs.336959 Homo sapiens, done IMA6E:3677t85.inRNA 9.03 
Ks.339665 ESTs 

HS.17S19 Honiosapi8iiscONA:FU22S39fis.don8H 8.98 

OnaJ(Hsp4O)ho{n)lo0, subfamily B.fnembo 8.90 

AW749865 Hs.1 17077 ESTs, Weakly simQar b) 138022 hypotheG 8.90 

AW294522 H5.149991 ESTs 8-90 

Hs.14317 nucleolar protein family A. member 3 (H/ 6.89 

Hs.137064 cyloplasmtepolyadenylailon element bfaid 8.81 

Hs.278469 lacrimal praline rich protein 8.80 

NM_005904 Hs.100602 MAD (moQiers against decapenlaptegicDr 8.80 

AW971146 Hs.293187 ESTs 8.80 

Hs.192425 ESTs 8.70 

Hs.238954 ESTs. Weaidy similar to KIAA1204 protein &70 

Hs.42322 A kinase (PRi<A} anchor protein 2 8.70 

decay accelerating factor for complement 8.62 

CGM4 protein; sulfkie dehydrogenase li 8.60 

niHitear receptor subfamily 1 , group t, m 8.60 

cadherin 3, type 1, P-cadheHn (placenta 8.60 

Hs.300870 Homo sapiens mRNA;cDNADKi^547M072(fr a59 

Ks.75807 PDZ and UM domain 1 (elfin) ^56 

Hs.72984 retinoblastomablnding protein 5 8.50 

Hs.77435 pleckstrin 8-50 

AW450843 Hs.346348 ESTs 8.50 

A1219083 Hs.42532 ESTs, Moderately simQar to ALU8_HUMAN A 8.50 

Y11312 Hs.132463 phosphoinosltid8-3«ndse,class2.beta 8.45 

AW955632 Hs.66656 ESTs.Vteaklysimnarto SI 9560 prolines 8.43 

NM_004131 H8.1051 granzymeB(granzyme2.cytoto3dcT4ymp 8.40 

AVW)02834 Hs.24095 ESTs 8.40 

H5.194691 reUndc acid induced 3 8.38 

NSI-assodatedproteb 1 8.33 

Hs.80624 hypotheses protein MGC2560 8.33 

Hs.44344 hyi)othetiGal protein FU20534 8.30 

Hs.172665 methytenetetrahydrofotate dehydrogenase 8.30 

Hs.152675 ESTs 8.30 

Hs.185831 ESTs 8.20 

Hs.165390 ESTs,Highlys!milartoA40350tr8nscrip 8.20 

H3.188633 ESTs 8.20 

Hs.158287 KIAA0468 gene product 8.17 

Hs.1934a2 Homo sapiens CDNAFU11903 lis. done HE 8.13 

Hs.257631 ESTs 8.10 

Hs.334477 hypolheticd protein MGC14879 8.10 

H3.95140 KIAA0189 gene product 8.01 

Hs.82353 protein C receptor, endothelial (EPCR) 8.00 

AWM5840 Hs.14555 Homo sapiens cDNA: aJ21 513 fis. done C aOO 

NM_006769 Hs.3844 LIM domain only 4 8.00 

BE;B4111 Hs.31314 retinoblastoma4)inding protein 7 aOO 

Hs.7149 Homo satins cDNA: FU21950 fis. done H 7.92 

H3.3463 ribosomai protein S23 7.89 

Ks.l 1 1 577 integral membrane protein 3 7.89 

Hs.18106 ESTs 7.88 

Hs.193580 ESTs 7.86 

HS.53S63 coBagen, type IX. alpha 3 7.84 

Hs.18103 ESTs 7^ 

AW135214 Hs.191828 ESTs 7.80 

AI381900 Hs.159212 ESTs 7.80 

L06505 Hs.182979 ribosomal protein L12 7.75 

8E242587 H5.118651 hematopoieftoally expressed homeobox 7.68 

AI590401 Hs.21213 ESTs 7.66 

T16206 H5.237164 ESTs. Highly shnBar to LDHHJIUMANL-IAC 7.62 

AW856665 gb:RC3.CT0297-2g0100413^03 CT0297 Homo 7.60 

AI1933S6 Hs.160316 ESTs 7.60 

U24683 bnmunogtobuRn heavy constant mu 7.S9 

BE391929 HS.87S2 Iransmembrane protein 4 7.56 

AW970D26 Hs.73818 ubiquinokytochromec reductase hinge p 7.52 

AA659037 Hs.163780 ESTs 7.50 

AI362S33 KIAA0306 protein 7.50 

AW298741 Hs.97861 ESTs, Moderately sbnibr to 138022 hypol 7.50 

Ar76l902 HS.99S97 ESTs 7.50 

AW137772 Hs.185980 ESTs 7.50 

AW861622 Hs.108646 Homo saf^nscDNAFU 14934 fis. done PL 7.50 

At718702 Hs.279930 ma|orhistDcompalibflity complex, dass 7.46 

BE270640 Hs.19192 cydln^Iependent kinase 2 7.43 

AW410235 Hs.75348 proteasome (prosome, macropainl acOvato 7.42 

AA310753 Hs.42491 ESTs. Weakly similar to S65657dpha-1C- 7.41 

AA34g944 Hs.42915 ARP2 (actin-relaled protein 2. yeasQ ho 7.40 

A8011084 Hs.48924 KIAA0512 gene product; Al^ 7.40 

U24577 Hs.93304 phosphdIpaseA2, group VU(pteletelraB 7.40 

AK000401 Hs.252748 Homo sapiens cDNAFU20394 fis. done KA 7.40 

AW753112 Hs.15514 hypothettoai protein MGC326D 7.40 

AI341227 Ks.157108 ESTs 7.38 
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W05391 

X63629 

AA805634 
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441878 

443247 
412645 
417315 
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445245 
414812 
445055 
410397 
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Hs.12461 

Hs.77367 
Hs.1 09051 
HS.63D42 



447484 AA464a39 Hs.292566 hypolhefical protein aj 14697 7.34 

441612 AI802629 Hs.113560 Homo sajdenscONAFUl 1631 lis, done HE 7.30 

408067 BE244580 Hs.342307 hypothetical protein FU10330 7.30 

434963 AW974957 Hs^719 Homo sapiens cDHAFU12t42fls, done MA 7.30 

437103 AW139408 Hs.152940 ESTs 7.30 

442495 AJ184717 ESTs 7.30 

445929 AI089560 H5.323401 (Jpy-3IWike protein 7.30 

446013 At360167 Hs.152774 ESTs 7.30 

436075 BE090176 Hs.179902 transporter-fike protein 7.20 

450139 AK001838 serom^Bh)cocor6coid regulated kinase 7.20 

423905 AVW579960 HS.1351S0 lung typeJ ceil memtjTBne-assoclatBd gly 7.17 

406819 AA90S472 gb:og82al 0^1 Na.CGAP_0v8 Homo sapiens 7.15 

407719 AW963866 Hs.44021 Homo sapiens mRNAibraJ00065 protein. 7.12 

425593 AA278921 Hs.1908 proteoglycan 1, secretory granule 7.10 

413886 AW958264 H$.103832 simHar to yeast Up<3, variant B 7.10 

422616 BE300330 Hs.1ie725 selenophosphatB synthetase 2 7.10 

424677 U09414 zinc finger protebi 137 (cbnepHZ>30) 7.10 

427254 AL121523 H5.97774 ESTs 7.10 

427307 AF1 17947 Hs.174795 PDZ dDmaln-oonl**>g guanine nucteotWe 7.10 

gb:Ul.HF-BR0p^M2-O-Ul.f1 NIH.MGC.5 7.10 

gty:RC2-BT0548-200300.012'^BT0548Homo 7.10 

Hs.127982 ESTs 7.09 

Hs.333893 o-Myc targelJPOl 7.04 

Ks.1 36061 Homo sapiens. Similar to hypotheOcd pr 7.00 

H8.180450 ritiosomal protein S24 7.00 

Hs.35254 hypothetical protein F1J6421 7.00 

Homo sapiens cDNA: FU21122 lis. done C 7.00 

LCHN protein 7.00 

monokine Induced t)y gamma interferon 7.00 

SH3 domain binding glutamteacid-ridipr 6.97 

DI^564J157 protein 6.96 

Hs.326290 hypotheticai protein FU1 2581 6.96 

Hs.146233 Homo sapiens cDI^FU22130fis, clone H 6.95 

Hs.86517 ESTs 6.90 

Hs.152983 HUS1(S.pombe)d)ed(pointhomdog 6.90 

Hs.9216 caspase 7, apoptosis-related cysteine pr 6.90 

Hs.107515 ESTs. WeakJy similar to T00329hypothetl 6.89 

Hs.193969 ESTs 6.88 

Hs. 1 55975 protein tyrosine phosphatase, recepitor t 6.87 

Hs.33026 hypothetical protein PP2447 6.85 

veside-assodated membrane protein 4 6.84 

Hs.179943 ribosomal protein L11 6.83 

H5.14947 ESTs 683 

HS.234S18 ribosomd protein 123 6.83 

Hs.288581 Homo sapiens CDNAFU14236 lis. done PL 6.83 

Hs.268557 pleckstrinhoffldogy-likBdoffldn, family 6.82 

Hs.268916 ESTs 6.80 

Hs.293616 ESTs 6.80 

HS.15S111 hypolheficd protein FU14428 6.80 

Hs.5337 Isodtrate dehydrogenase 2 (NADP).mItoc 6.80 

Hs.54443 diemokine(OCmotiO receptor 5 &80 

ESTs, Wealdy similar to unnamed protein 6.79 

Hs.1742 IQfflottfcontainbigGTPaseacGvafingpr 6.77 

Hs.155478 cydinT2 6.72 

Hs.1 55321 serum response fador (c-fos senim respo 6.70 

Hs.191381 hypothetical protdn 6.70 

H$.108747 serine carboxypepfidase 1 precureor prot 6.70 

Hs.109653 ESTs 6.70 

Hs.13797 ESTs 8.69 

HS.1914B2 ESTs 6.69 

Hs.83496 linker for aclivafton of T ceils 8.67 

Hs.82280 regulator of G-protein signatiing 10 6.65 

HS.1B0559 EST8,Weakty8lmfl8rtoAS61941hrombaKa 6.62 

Hs.18142 arrestin.beta2 6.61 

Hs.247057 ESTb, Weakly similar to 2109260AB cell 6.60 

Nl\4_014135 Hs.8345 PRO0641 protein 6.60 

AI131291 H3.102308 potassium inwardly-fectifying channel, 8 6.59 

Hs.329296 ESTs. Weakly dnAar to (deflbie not ava 6.S7 

Hs.78225 annexinAI 6.56 

Hs.24724 MFH^plified sequences with leudnoKie &S4 

Hs.2886% Homo saj^ns cDNA: FU22182 lis. done H 6.51 

HS.10&768 hypotheteal protein FU10511 6.50 

Hs.7753 cahimenin 6.50 

Hs.143463 ESTs 6.50 

Hs.32061 ESTs. Weakly similar to 138022 liypotheO 6.50 

Hs.79971 sal{DrosophilaH3»2 6.40 

ESTs 6.40 

Hs.106260 sortognexinIO 6.40 

Hs.119769 ESTs 6.40 

Hs.12969 hypothetical protein 6.40 

ESTs. Moderately dmiiar to HERC2IH.sap 6.40 

Hs.287420 HoRU) sapiens cDNAFU11S33fis, done HE 6.40 

Hs.279860 tumor protein. Iranslationdly^xmlione 6.39 

648 



449924 

418134 

424768 

443303 

411852 

451838 

425367 

453485 

401466 

457073 

412093 

442492 

449971 

431773 

416401 

426501 

435080 

435876 
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453932 
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448094 
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U09414 
AL121523 
AF1 17947 
AW502384 
BE072881 
At801869 
BE614387 
AW444433 
AI080042 
AF138863 
AA830856 
AB032973 
X72755 
BE512856 
AF217517 
AW959433 
W30681 
AA397769 
AA353895 
U67319 
AA528140 
AW005866 
BE271188 
BE620712 

AA233210 
BE242691 
AA528489 
AA607346 
BE409442 
N80139 
AW043782 
AI831760 
AI124756 
NM_000579 
AW2D4069 
IJ3075 
NM.001241 
AI085561 
Nl^016625 
W26673 
AA608955 
AW967807 
AA203569 
AI922013 
AF045229 
AW392551 
AF106941 
W21874 
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BE281591 

AU076964 

R38828 

H24387 
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AA215404 
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AA830893 

N94126 

W18193 

T64896 

BE300073 
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414476 
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431840 
435812 
401113 
408418 
412220 
426780 
427202 
434699 
447887 
449576 
444933 
432841 
411975 
452652 
433162 
449322 
430333 
440327 
442832 
456362 
427968 
414662 
418113 
406870 
416003 
445493 
424687 
427477 
422499 
443441 
413677 
405797 
406857 
410387 
410503 
441982 
425762 
406877 
407784 
416297 
446272 
412949 
420059 
435758 
451658 
441623 
416926 
425190 
441244 
421305 
430504 
422310 
406605 
433891 
406542 
406858 
408331 
432729 
439451 
455263 
441321 
429083 
406806 
416987 
450988 
428773 
406794 
457752 
435511 
451589 
439979 
412528 
424875 
426981 
447711 
449961 
416759 
415082 
422773 
441455 
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AA301867 
AA310974 
AA534906 
AA70O439 

AWS63897 



8E242284 

BE272922 

AA643687 

AA114050 

AW014631 

NM_016245 

M9342S 

AI916058 

AK001972 

AI025842 

AI638616 

S70114 

R12581 

AW206S60 

AW973003 

AI857607 

AL036058 

A1272141 

AA075144 

X98001 

AI915771 

J05070 

AW973119 

AI268666 

AW291196 

AWS03116 

AI432224 

AA613726 

A1277367 

AW975746 

AW972542 

BE244076 

AA226392 

AW139585 

AA157634 

BE268912 

AI471639 

AF161486 

At418466 

AW195351 

AA315805 

H03109 

AW028302 

BE612935 

BE397354 

H52761 

AA316622 

AR)2S374 

AA613792 

AI865720 

NM.007240 

AK0C0292 

AF0B6270 

AW981702 

H17182 

Y09397 

AW088535 



BE618571 

BE256238 

AI690243 

AI821270 

AA683336 

AA424791 

AW600291 

AI123478 

AI187945 

AL044675 

A1459554 

AW265634 

AK000978 

AA160000 

AB028962 

AJ271671 



Hs.102737 

Ks.76224 

HS.156B28 

Hs^SSO 

HS.18B490 

Hs.44743 

Hs^787 

H5.172199 

Hs.173936 

Hs.149425 

Hs.1994g 

H5.225D88 

Hs,12150 

H5.62 

Hs.144583 

Hs.30822 

Ks.196566 

Hs.239489 

Hs,191146 

Hs.253569 

Hs.179909 

Hs.181301 

Hs.76807 

Hs.83484 

Hs.78948 

Hs.151738 

Hs.178391 

Hs.19631 

Hs.92195 

Hs.301819 

Ks.29797 

Hs.47094 

K5.188682 

Ks.289008 

H5.159578 

Hs.179943 

Hs.12708 

Hs.79172 

Hs.14601 

Hs.71913 

H5.94769 

Hs.33665 

Hs.250520 

Hs.263395 
Hs.155079 
HS.1B4052 
Hs.324830 

Hs.98370 
Hs.46465 
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Hs.44229 
Hs.130732 
Ks.27e554 

Hs.7771 
H5.227817 

Hs^0712 

Hs.429 

Hs.193163 

Hs^5643 

Hs.189046 

Hs.5734 

Hs,6823 

Hs^2112 

HS.19931D 

H5.173081 

H5.161286 

Hs.133100 

Hs.79741 

H5.137395 

HSJ01552 

HSJ654 
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440119 


AA665455 


H5.125331 


ESTs. Moderately similar to uniuiown [H^ 


5.63 


424326 


NM.014479 


Hs.145296 


AOAMie disintegrtn protease, decysin 


5.60 


431770 


BE221880 


K5.268555 


5^*3rexorit)anuclease2 


5.60 


436511 


AA721252 


Hs.291502 


ESTs 


5.60 


446630 


AW384793 


Hs.15740 


Homo sapiens mRNA; cDNA OKFZp434E033 (fr 


5.60 


406623 


X69392 


Hs.91379 


nboscma) protein L26 


5.60 


452382 


N38902 


Hs.211539 


hypoltteUcal protein M6C4246 


5.57 


416047 


BE439894 


Ks.78991 


DNA segment, numerous coptes. expressed 


5.56 


437296 


AA350994 


Hs.20281 


K1AA1700 


5.56 


453985 


N44545 


Hs.251665 


ESTs 


5.56 


443351 


AW016783 


Hs.30799 


Homo sapiens cDNA FU13471 fis. done PL 


5.55 


448877 


A1583696 


Hs.253313 


ESTs 


5.53 


435748 


AA699756 


H5.1 17335 


ESTs 


5.52 


420732 


AA789133 


HS.63S2S 


ESTs 


5.51 


421818 


AW992976 


Hs.50098 


NM.002489:Homo sapiens NAOH dehydrogenas 


5.50 


430915 


AA488953 




gb:aa55e05.r1 Na.CGAP.GC61 Homo sapiens 


5wS0 


436716 


AI433S40 




gb:ti69g05j(1 ND.CGAP.Kidll Komosapien 


&50 


437442 


T85104 


Hs.222779 


ESTs, Moderately similar to similar to N 


5.50 


449625 


NM.014253 




odz (odd OzAen^n. DrosopltOa) homolog 1 


5^ 


456497 


AW9S7956 


Ks.123848 


ESTs,WeddyslmiIartoAF1084601 ubinu 


S50 


451287 


AK002158 


Hs.26194 


fikfity homolog of mouse InvnuT^'-assoda 


5.50 


433701 


AW44a)23 


Hs.15155 


ESTs 


S.49 


427640 


AF058»3 


HS.18W1S 


[>dopactuome (automerase 


5.47 


420552 


AK000492 


Hs.98806 


hypothetical protein 


5.45 


449338 


H73444 


Hs.394 


adrenomedutlin 


5.42 


427176 


AW381569 


Hs.40334 


ESTs 


5.42 


409945 


AWD15935 


Hs.122642 


ESTs 


5.40 


421568 


W85858 


Hs.99804 


ESTs 


5.40 


423961 


D13666 


Hs.13634e 


penosiin(OSF-2os) 


5.40 


440719 


AA150869 


Hs.26267 


ATP-dependanI interferon response protel 


5.40 


443035 


Z45822 


Hs.8908 


Homo sapiens dona 24889 mRNA sequence 


5.40 


458659 


AW749895 


Hs.332520 


Homo sapiens mRNA: cDNA Di<FZp434A1014 (f 


5.40 


424415 


NM^001975 


Hs.146560 


endase 2. (gamma, neuronal) 


5.40 


420137 


AA306478 


Hs.95327 


CD30 antigen, delta polypeptide (TiT3co 


5.39 


422163 


AF0Z7208 


Hs.112360 


prominin (mous6>-l]Ke 1 


5.3B 


439815 


AA206079 


H5.6693 


hypothetical protein FU20420 


5.37 


452432 


AW206008 


Hs.283378 


Homo sapiens cONA: FU21778 fiSi done H 


5.37 


457465 


AW301344 


Hs.122908 


DNA replication factor 


5.37 


412935 


BE267045 


Hs.75064 


tubuBn^spedfic chaperone c 


5.38 


409485 


880990 


Ha^52136 


tioofin (cdlagen/Gbrtnogen domainnnnt 


5.35 


430283 


BE391688 




RAB7, member RAS oncogene femlly 


5.33 


406814 


AA642947 


Hs.1 19122 


ribosomal protein LI 3a 


5.33 


409019 


AW385412 




myosin regulatory Oght chain 2, smooth 


5,30 


410561 


BE540255 


Hs,6994 


Homo sapiens cONA: FU22044 fis, done H 


&30 


412623 


R28898 


Hs.74170 


melaDothionein IE (functional) 


Sl30 


417450 


AA314435 


Hs.17519 


Homo sapiens cONA: FLJ22539 fis, dons H 


&30 


416702 


BE268388 


HS.6694S 


ESTs. Wbakly similar to A46010 X-Onked 


&30 


419926 


AW900992 


Hs.93796 


DKF2'586D2223 prot^ 


5.30 


422900 


AA641201 


Hs.222051 


ESTs 


5.30 


423494 


AW504365 


Hs.24t43 


Wiskott-Aldrich syndrome protein interac 


5.30 


427667 


AK001279 


Hs.180171 


Homo sapiens cONA FLJ10417 fis. done NT 


5.30 


427774 


AA278583 


Hs.160737 


Homo sapiens done 23664 and 23905 mRNA 


5.30 


430177 


AW969233 


Hs.302746 


MSTP028 protein 


5.30 


430835 


AI240008 


Hs.192326 


ESTs 


5.30 


433009 


AA761668 




gb3u24c08^1 NCl.CGAP_GCei Homosaplens 


5.30 


439776 


AL360140 


Hs.176005 


Homo sapiens mRNA ftiil length insert cON 


5.30 


447082 


T65314 


Hs.54629 


thtoredoxin-Gke 


5.30 


415995 


NI1.004573 




pho^)hdIpase C, beia 2 


5.29 


424578 


AK001973 


HS.1S0890 


hypothetical protein 


5.27 


441303 


AW293081 


Hs.241601 


ESTs 


5.27 


427816 


AA159246 


H8.180909 


peroxiredoxin 1 


5.27 


443963 


AA878163 


Hs.17448 


Homo sapiens cONA FU13618 fis. done PL 


5.26 


450273 


AW2g6454 


H5.24743 


hypothetical protein FU20171 


5.24 


444708 


AW971049 


Ks.11774 


protein (peptidyHnoly) ds/Irans Isome 


5.23 


415121 


060971 


Hs.34965 


Homo sapiens cDNA FU13485 fis, done PL 


5.21 


458079 


Ar7g6870 


Hs.54277 


DNA segment on chromosome X (unique) 992 


5.21 


405086 






NM.006662*M)mo s^ens Snf2-rdated C6P 


5.20 
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Hs 268814 


ESTs 




AW170ft1Q 


Uc 119110 




A'iRAQK. 




M« 170114 




«Wlr15l 




H« 306799 


l^vno saolens mRKA: cDNA DKFZd564F133 (fr 




nnOOIOr J 




fltnrn&BoOl^i NCI CGAP Thvl lisffio sapiens 


40001c 






ab*UI-H-BW&-rfD-a-4)941-UU1 NCLCGAP Su 


44 low 




IX. 1^1 171; 


ESTs 




R34910 


Hs.1 19172 


ESTs 
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MlCOf (£U 






•J510I1 
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Hs75643 


nudear tactor (ervthn^d-deilvad 2). 45 




A94490 


Lk> ll/'lfiR 

ns.! mooD 




432465 


LK>t)i09 


u. 971:1 




41<ifOo 






olnnnalinn fw4nr«9 IritUKA 


AAT^'i'i 


mW4990«M 




Inlprtoiikin inncpnlor fllnha 


430478 


Nh/L014349 


Hs.241535 


apdlpopiot^ L, 3 


420151 


AA255931 


Hs.186704 


ESTs 


434274 


AA628539 


H5.116252 


ESTs. Moderately similar to AIU1.HUMANA 


419317 


AA236282 


Hs.172318 


ESTs 


424699 


AW206227 


Hs^87727 


hypothetical protein aJ23132 


428403 


A)393048 


Hs^26159 


teudne ricii repeat (in FUl) interactin 


430968 


AW972830 




gb:EST384925 MAGE resequences. MAGI Homo 


431709 


AF220185 


Hs.267923 


uncharacterteed hypothatemus protein HTO 


436137 


A1056769 


Hs.133512 


ESTs 


440946 


AW188311 


Hs.128619 


ESTs 


448497 


BE613269 


H5.21893 


hypothetical protein DKFZp761N0624 


416655 


AW9&8613 


Hs.79428 


BCL2/8denovinis E1B 19K[)^teracUng pro 


417228 


AL134324 


Hs.7312 


ESTs 



Hs.96886 ESTs 

aclinin,al|4ia4 
Hs.143688 Homo saplenscONA: FU23031 Us. done L 
Hs.26514 ESTs 

Hs.13137 UV radiation resistance assodalsd gene 

libosomal protein L13a 
H5.41639 prograniinedcelldeath2 
AW797724 Hs.130350 ESTs 
Ks^10286 ESTs 
Ks.14158 coplnelll 

H3.342389 peplidylproly) Isomerase A (cydophiftn 

gbuiy97f1U1 Na^CGAP.GCBI Homo sapiens 
Hs.44672 hypolhetica! protein FU 10470 
Hs.64593 ATP synthase, H transporting, mitochondr 
Hs.94499 ESTs 
H5.192013 ESTs 

AW815832 Hs.173734 ESTs, Wteldy similar toAlUIJftJMAN ALUS 
AW105663 Hs.6947 HSPC069 protein 
H5.114104 ESTs 
Hs.126705 ESTs 

Hs.35304 Homo sapiens CONAFU13655 lis, done PL 
H3.337460 ESTs.W^sininartoA47582&«engr 
Hs.8594 KIAA1191pmteln 
Hs.11183 sorting nexin 2 
H$332938 hypothetical protein MGCS370 

gb:yh88b0U1 Soares placenta Nb2HP Homo 
Hsi00866 ESTs. Moderately similar to AlOff^HUMAN A 
H$.8S092 thyroid hormone receptor Interactor 1 1 
Hs.271778 ESTs, Weakly simaar to 138022 hypotheti 
AW»9^ Hs.101150 Homo sapiens, done tMAGE:4054156.mRNA. 
libosomal prot^ L6 

406793 AW264291 Hsi662 guanine nudeotide binding protein (G pr 
439864 A1720078 Hs^ig97 EST8,We3ldysbnBartoA47582&«e!lgr 

Hs^7912 ESTs, Weddy8bnilartoALjU7_HUMAN ALUS 
gb1e64e10j(1 SoaresJIFUr.GBCLSI Homos 
EosContrd 
Hs.76095 ImmecSaleeady response 3 



424868 A!S68170 

418905 BE539674 

427726 AI359144 

442618 R56222 

445715 AB0129SB 

406813 AW276131 

454128 AL031259 
440709 

436372 AWg72301 

446173 BE56S849 

453330 AI268061 

418876 AA740616 

408405 AK001332 

410570 AI133096 

410800 BE280421 
431451 
432879 
435655 
435919 
436394 



AA761378 



AI0521B9 
AA531187 



438459 T49300 
442232 AI357813 



442685 
444454 



AB033017 
BE018316 



444670 H58373 
447197 R36075 



450113 
450511 
450887 
452056 
457068 



A16B309S 

Ra7423 

AA011518 



420298 AI193510 

440638 AI376551 
400281 

414420 AAD43424 





415799 


AA653718 


H5.225841 




434666 


AF151103 


Hs.112259 




449057 


AB037784 


Hs.22941 


75 


448625 


AW970786 


Hs.178470 


451598 


N29102 


Hs.118078 




409686 


AK000002 


Hs^9 




410597 


W16518 


Hs^9518 




446291 


BE397753 


Hs.14623 


80 


447150 


A1439011 


Hs.86386 


418458 


AA332941 


H$.85228 




422627 


BE336857 


HS.1187B7 




437166 


AA338305 


Hs^72 




415825 


Y18Q24 


Hs.78877 



hypotheticdl protein FU20173 
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406781 


AA539388 




449B10 


A6008681 


Hs.23994 


410323 


AI241708 


Hs.2g6322 


444652 


BE513613 


Hs.11538 


422340 


AW296219 


Hs.115325 


400424 


/U276316 


Hs^87374 


411573 


AB029000 


Hs,70823 


421045 


BE144608 


Hs.55533 


42S235 


AA353113 


Hs.112497 


430387 


AW372884 


Hs.240770 


438590 


AA811465 


Hs.123375 


442071 


BE048433 


Hs.276043 


449567 


AI990790 


Hs.188614 


453213 


AA0826S0 


Hs.6217 


440129 


AA865818 




437802 


AI475995 


Hs.122910 


409461 


AA382169 


Hs.544e3 


421932 


W51778 


Hs.323949 


426453 


AB011110 


Hs.184357 


413441 


AI929374 


Hs.75367 


446560 


AK001567 


Hs.311002 


435541 


AA687361 


Hs.221318 


410557 


AA085803 


Hs.192997 


412766 


BE544475 


Hs.54347 


415526 


K76536 


Hs.265591 


418973 


AA233056 


Hs.191518 


421433 


AI829192 


Hs.22380 


432925 


AA878324 


HSJZ64750 


436869 


AF075009 




442233 


AW967149 


Hs.28439 


447198 


061523 


Hs.283435 


448552 


AW973653 


Hs.20104 


444681 


AJ243937 


Hs,288316 


414598 


A1094221 


Hs.135150 


447817 


6E620775 


Hs.4866 


416062 


AA724811 


Hs^791 


406661 


X56975 


Ks.172550 


424582 


AF0268i)9 


Hs.150922 


411165 


NIUL000169 


Hs.69089 


435905 


AW997464 


H5.5003 


445776 


NM.001310 


H8.13313 


424730 


NiyL0033S8 


Hs.23703 


414747 


U30872 


Hs.77204 


410668 


BE379794 


Hs.159651 


406774 


AW518383 


Hs.177592 


406648 


AA563730 


H5OT477 


407951 


W77762 


Hs.79015 


415682 


A1347128 


Hs.191870 


417621 


AV654694 


H&82316 


419970 


AW612022 




'420012 


AW957965 


Ks.99014 


431574 


AW572659 


Hs^61373 


432586 


AA568548 




437438 


AL359620 


Hs.14217 


441355 


AI822034 


Hs.137097 


444539 


AI95576S 


Hs.146907 


458965 


AA010319 


Hs.60389 


408655 


M21533 


Hs.277477 


414915 


NM.002462 


Hs.76391 


414821 


M63835 


Hs.77424 


423766 


AA303799 


Hs,300141 


451351 


AW058261 


H3.321435 


450043 


AA88S699 


Hs.24332 


447742 


AF11392S 


HS.1940S 


433339 


AF019226 


Hs.8036 


426395 


BE151985 




418300 


AI433074 


H&86682 


423799 


AW028300 


Hs.132906 


445093 


AI207197 




428044 


AA093322 


H8.301404 


453968 


AA847843 


Hs,62711 


414194 


6E175494 


Hs.75811 


427747 


AW411425 


HS.1B0655 


406745 


AW11970 


Hs^79860 


407013 


U35637 


Hs.83870 


407198 


H91679 




414646 


AA3S3776 


K5.901 


429887 


AIS75749 


K5.211608 


436566 


BE545588 


HU78712 


437634 


AW293046 


Hs.255158 


439971 


W32474 


Hs.301746 


442485 


BE092265 


Hs^724 


445B73 


AA25Q870 


Hs.251946 



gb:nq88b06£l Na.CGAP^Co9 Homo sapiens 

acIMn A receptor, type IIB 

Homo sapiens cDNA: RJ22844 6$. done K 

aclln lelaled praleln 2/3 complex, subun 

RAB7, member RAS oncogene fan^-like 1 

zinciinger protein 304 

K1AA1077 protein 

ESTs 

Homo safto cDNA: FU22743fis, done H 

nudear cap binding protein eubunn 2, 2 

ESTs 

ESTs 

ESTs 

Homo sapiens cONA aJ12521 fis. done MT 
ESTs. WeaUy sin«ar to S71886 Ste204ik 
ESTs 

N-myc {and STAT) inleractor 
kangal 1 (suppression of tumorigenidty 
GTPase acOvating protdn4]«e 
Src-IIkB-adapter 

Homo sapiens cDHA FU 10705 Qs. done KT 
ESTs 

ESTs, Moderately sinular to 178885 serin 
ESTs 

ESTs. Wealdy similar to ALU1.HUD4AN ALU S 

ESTs 

ESTs 

ESTs 

gb:Homo sapiens fuD lenglli insert cONA 
ESTs. WeaMy simBar to 138022 hypothetl 
ESTs 

hypolhelical protein FLJ00052 
chromosome 6 open reading frame 9 
lung type-) cell membrane-assodated gly 
Homo sapiens cDNA FU143B7 lis. done HE 
Homo sapiens cOl^ FLJ14609 fis. done NT 
potypynmidine trad binding protein (he 
BCS1 (yeast tiomologHike 
galaclDsldase, etpha 
KIAA0456protdn 

cAMP responsive element binding protein- 
ESTs. Moderately simiiar to CEGT J1UMAN C 
centromere protein F (35Q/40QkD. mltosin 
hypothetical protein 
nlKKoma) protein, large. PI 
major histocompatibiiity complex, dass 
antigen idenfified by monodonal anObod 
ESTs 

hterferoivlnduced, hepatitis Oassodat 
ESTs 

Homo sapiens, done IMAGE:3632168« mfWA 

hypothetical pr€teindJ434014.3 

ESTs 

hypothetfcd protein DKFZp762P2111 

ESTs 

ESTs. Weakly similar to 2004399A duwnos 
ESTs 

nvjor histocoaipafibility oomptea^ dass 
mpcovirus (Inftimnza) redslanoe 1, homo 
Fc fragment of IgG, htgh affinity la. re 
ribosomsl protein 139 

ESTs, WeaMy similar to ALJU1_HUMAN ALU S 
OGI-26prolein 

caspase recruitment domain 4 



hypotheScal protein FU23316 

Homo sapiens cONA: FLJ21578 63. done C 

19A24pro(dn 

ESTs 

RNA binding motif protein 3 

High mobility group (nonhistone diromoso 

N-acytsphingosine amldohydrdase (addc 

serine/threonine kinase 12 

tumor protein, translalionally-conbdie 

gbiHuman nebuHn mRNA, partial cds 

gb7v04a07^1 Scares fetal Gver spleen 

C048 antigen (&«ell membrane poteln) 

nudeoporin 1531(0 

Homo sapiens cDNA FU11074 6s, done PL 
ESTs 

member of RAS oncogene liamily 
hypothetkal protein FU13187 
p(dy(/^4)in(fing protein, cytopbsndc \4 
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450497 


H64159 


Ks.15328 


ESTs 


4.50 


417497 


AW402482 


Ks.82212 


C053 antigen 


4.50 


447667 


AL117611 


H$.19150 


Homo saptens mRNA; cONA DKFZp564A2164 (f 


4.49 


413856 


013639 


Hs.75588 


cyclinD2 


4.49 


419556 


U29615 


Ks.91093 


chifmase 1 (chMosUase) 


4.49 


429617 


X89984 


Ks.211563 


B-oe!l CLU^homa 7A 


4.48 


427157 


U51166 


Hs.1 73824 


thymhe-ONA glycosytase 


4.48 


446021 


BE389213 


Hs.286 


fOxifionial protein L4 


4.47 


413822 


R08950 


H5.272044 


ESTs, slmDar to ALULHUhMN ALU S 


4.46 


412819 


T2S829 


Hs.24048 


FK508 bin(Sng protein pfecusor 


4.46 


448717 


R67419 


Hs^1851 


Homo sapiens cONA aj12900 fis, done m 


4.45 


401846 


AW410382 




NM_000988*:Honio sapiens ribosomal piotel 


4.44 


422303 


Hs.27556 


hypottietical protein aJ22405 


4.43 


442358 


BES67985 


Hs.18565 


ESTs. Moderately sbnBar to ALU4J1UMAN A 


4.43 


436623 


A1417073 


Hs.107265 


ESTs 


4.42 


412146 


M92444 


Hs.73722 


APEX nuclease (multifuncfional DMA repai 


4.42 


437042 


AK000702 


Hs5420 


hypolhslica! protein FU2069S 


4.42 


416754 


H07145 


H5.6799 


ESTs, Weatdy sbnUar to T12483 bypolhetl 


4.41 


436671 


AW137159 


Hs.183291 


ESTs 


4.40 


410079 


U94382 


HS.585B9 


0lycoQenin2 


4.40 


420150 


AA648712 


Hs.29798 


KIAA1712 protein 


4.40 


424723 


BE409813 


Hs.152337 


protein arginine N^hyltransferase 3(h 


4.40 


428931 


AAg94979 


Hs.98967 


ATPase, H(Hransportlng, lysosomal, non 


4.40 


429109 


A1J008637 


Hs.196352 


neutrophil cytosoflc fector 4 (40M}) 


4.40 


430280 


AA361258 


Hs.237868 


intedeukin 7 receptor 


4.40 


438330 


AW450572 


Hs.257316 


ESTs 


4.40 


438962 


BE046594 




gb:hn41c11ji1 Na.0GAP^R0F2 Homo sapiens 


4.40 


444794 


A1419991 


Hs.145225 


ESTs 


4.40 


445100 


AW186205 


Hs.12311 


Homo sapiens done 23570 mRNA sequence 


4.40 


449659 


R60031 


Hs,198899 


eukaryotic translation iniGafion factor 


4.40 


449832 


AA594264 


Ks.60049 


ESTs 


4.40 


452404 


AW450675 


Hs^12709 


ESTs 


4.40 


447296 


AW243614 


Hs.18063 


Homo sapiens cDNA FU10768 lis. done MT 


4.39 


425097 


N^4_014247 




PDZ domain containing guanine nudrotide 


4.37 


441607 


NMjOOSOIO 


Hs.7912 


neuronal ceQ adhesion molecule 


4.37 


406742 


AI468091 


Hs^79860 


tumor protBin, translaUonaUy-controDB 


4.35 


425095 


AWO14160 


HS.18258S 


KIAA1276 protein 


4.34 


410342 


R31350 


Hs.743 


Fc fragment of IgE, high sffinHy l» rec 


4.34 


442333 


AIS50877 


Hs.1 29302 


ESTs 


4.33 


424971 


AA479005 


Hs,154036 


tumor suppressing subtransferable candid 


4.32 


415912 


H08859 


Hs.206469 


ESTs. WeaUy sin^ to ALU8.HUMAN ALU S 


4.32 


437386 


W524S2 




ribosomal |wotdn L10 


4.31 


408558 


AW015759 


Hs.235709 


Homo sapiens mRNA; cDNA DKFZp6676071 1 (f 


4.30 


408875 


NMJ15434 


Hs.48604 


DKFZP434B168 protein 


4.30 


409604 


AW444448 


Hs.49124 


ESTs 


4.30 


418866 


T65754 




gib.*yc11c07^1 Stralagene lung (93721Q) H 


4.30 


419423 


D26488 


Hs.90315 


KtAA0007 protein 


4.30 


422017 


NM_003877 


Hs.110776 


STAT induced STAT inha^or.2 


4.30 


422797 


AB033064 


H5.236463 


KIAA1238 protein 


4.30 


428467 


AK002121 


Hs.184465 


hypothetical pfDtelnFU11259 


430 


432290 


AKW1099 


H5^74273 


Homo sapiens cDNA FIJ10237 tis, done HE 


4.30 


434551 


BE387162 


Hs.280856 


ESTs. Highly similar to A35661 DNA exds 


4.30 


436138 


H53323 


Hs.25717 


Homo sapiens cDNA: FU23454 lis, done H 


4.30 


438915 


AA280174 


Hs.285681 


Wiliiams-Beuren syndrome chromosome legl 


4.30 


449217 


AA278536 


Hs.23262 


ribonudease, RNase A family. k6 


4.30 


452994 


AW962597 


Hs.31305 


K1AA1547 protein 


4.30 


418863 


BE387036 


Hs.1211 


add phosphatase 5. tartrate resistant 


4.30 


437250 


BE257342 


Hs.94576 


hypothetical protein MGC3082 


4.29 


440910 


H97875 


Hs.1 17974 


ESTs 


4.29 


406653 


AA614553 


Hs.252259 


hypothetical proteto FLJ23059 




432295 


BE091049 


Hs.343665 


ribosomal protein S15a 


4.28 


400244 






Eos Control 


4.28 


413518 


BE14945S 


H3.75415 


bela-2-micFogIobulin 


4.28 


409132 


AJ224538 


Hs.50732 


protein Idnase, AMP-acfivated, beta 2 no 


4.27 


406746 


AA580395 


Hs.279860 


tumor protein, translatlonaily<ontroDe 


4.26 


400395 


AF111167 




v-fos FBJ murine osteosarcoma viral onco 


4.26 


443229 


AI057129 


Hs.133396 


ESTs 


4.26 


450201 


T97838 


Hs.25722 


ESTs 


4.25 


409636 


AA30S729 


HS.1B272 


amino add transportsr system A1 


4.25 


422082 


AA016188 


Hs.111244 


hypothetical protein 


4.24 


444099 


D87432 


Hs.10315 


solute caiTter femUy 7 (caOonic amino 


424 


453902 


BE502341 


Hs.3402 


ESTs 


4.24 


415189 


134657 


H8.76146 




4.22 


404854 






Target Exon 


4.21 


406653 


AA574074 


Hs.77961 


m^ histocompatibility oomptex, dass 


4.20 


400440 


X83957 


Hs,83870 


nabuHn 


4.2D 


415049 


N67334 


Hs.50158 


ESTs 


4.20 


418304 
423180 
424684 


AA215702 
AR)68302 
AW752714 


HS.12S031 
Hs^174 




4.20 
4.20 
4.20 


fitx)Somd potein 817 


429412 


NM.006235 


H5^407 


PCXJ dom^ class 2, assodaiing factor 


4.20 


438141 


AW946871 




gb:RC2-ET0O22^50O4)12'd02 ET0022 Homo 


4.20 


436607 


AW080237 


Hs.252884 


ESTs 


4.20 
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451952 

455397 

417116 

453247 

430451 

414283 

452248 

450746 

444797 

44571B 

425783 

414837 

406710 

424436 

422343 

416207 

406724 

449475 

413828 

416819 

436574 

405266 

408996 

41Q704 

420851 

423096 

428328 



433308 
443559 
450850 
453785 
406854 
410768 
419612 
434203 
439237 
441374 
443415 
424338 
422305 
400233 
421959 
442622 
424795 
446231 
452933 
427681 
409061 
413891 
414004 
417035 
410584 
417353 
42364S 
430048 
431113 
434170 
434584 
435391 
446768 
448019 
451831 



AL120173 
AW936332 
Z43916 
TB0198 
AA836472 
AWSGOOII 
AA093668 
082673 
AB018333 
H79791 
AI026740 
U24266 
AI708347 
AW81842fl 
A162B633 
NM_014745 
C14071 
AJ348027 
L19067 
U77735 
AA72S002 

Al97gi68 
BE076754 
AA281062 
AA732684 
AA426080 
AW973253 
AA5B2718 
A1076765 
AA648886 
A)388236 
AA613705 
AF038185 
AI498267 
BE262677 
AW408158 
AA043696 
A1066523 
W78816 
A1928242 

AW751497 
NM_000435 
AW102850 
NWL002163 
AW391423 
AB01B263 
A1204994 
BE271020 
AA737033 
AA192455 
AB011112 
AA375752 
AI215632 
T6S054 
AK000673 
AA626509 
D57341 
AA704588 
AV660305 
AW947164 
NM.001674 



H5^1663 ESTs 



Hs.7634 
Hs.111806 
Ks.297939 
K5.154993 
HS.28S78 
Hs^78S8g 
Ks.12002 
Hs.15227 
Hs.1948 
Hs.77448 
Hs.184014 
Hs.4953 
Hs.346823 
Hs.79077 
Hs.234518 
H5.108557 

Hs.80205 
Hs.272018 

Hs.344096 

Hs.29493 
H3.278428 
H$.292812 



H8^1650 
Ks.269899 
Hs.151999 
Hs.283732 
Hs^52259 
H3.66187 
Hs.1 10613 
HS283558 
Hs.318893 
Hs.7822 
Hs.133472 
Hs.4d943 
H&293438 

Hs^70 
Hs.8546 
m.153177 
Hs.14453 
Hs^88555 
Hs.180338 
Hs.7874 

Hs.7155 
Hs^2968 

Hs.348140 
Hs.147487 
Hs.73605 
Hs.274337 
Hs.122329 
Hs.188361 
Hs5B934 
H3.1102B6 
KS.19S641 
H8.460 



gb:QV4-OT0021-28129W)7&fl01 DT0021 Homo 
hypolhaGcal protein RJ12287 simiiar to 
ESTs 

caOiepsinB 
ESTs 

musdeUind posopMa)^ 
generd transci^Oon fador U, 1 
KiAA0790pretaIn 
ESTs 

nbosomai protein S21 

aldehyde dehydrogenase 4 fannOy, meinber 

ribosomai prolan L31 

goigi aubanfigen. golgin subfamily a. 3 

gb:ty77d05jcl Na.CGAP.Kidl1 Homosaplen 

Homo sapiens. ck)ne iyK3C:290e, mRNA. comp 

ribosom^ protefn L23 

hypothoficd protein PP1057 

>Kel avian reSculoendolheliosIs vfral 

pinv2 oncogene 

low molecular mass Ubiquhone-binding pr 
Target Exon 

glycoprotein (transmembrane) nmb 
gb:CM1-8T0601-1 80200-1 21-b10BT0601 Homo 
hypothetical protein FIJ20142 
progestin induced protein 
ESTs. Weakly similar to 138022 hypotheti 
ESTs 
ESTs 

ESTs. IWoderateiy simHar b ALU8_HUMAN A 
ESTs 

ESTs. Moderatety similar to ALU LHUMAN A 
ribosofnal piotftin S3 

Homo sapfens done 23700 mRNA sequence 
KIAA0421 protein 
hypothetical protean PR01 855 
ESTs. Weaidy similar to A47582 M\ gr 
HofiK) sapiens mRNA: cDNA DI^564C1216 (f 
ESTs 

ESTs. Weakly didlar to S6S657 alpha-IC- 
ESTs, Highly sintOarto AF198488 1 trans 

Eos Control 

cyk)chrome P450, subMy IIS. pdypepi 
Notch (Drosophila) homolog 3 
ribosomai protein S28 
interferon consensus sequence binding pr 
Homo sapiens cONA: FLJ22425 fis. clone H 
tumor necrosb factor receptor superfani 
Homo salens cOKA* nJ21435 lis, done C 
tumor suppressor deleted In oral cancer- 
ESTs, Itoterateiy similar to 21 15357A TYK 
Homo sa^^s done 1MAGE;451939. mRNA se 
K!AA0540protdn 

Homo sapiens mRNA; cONA OKR!:p586F1822 (f 
ESTs 
ESTs 

hypothetical protein FU20666 
ESTs 

Homo sapiens cDNA FU12807 lis. done NT 
ESTs 
ESTs 

ESTs. Moderatdy simflar to 138022 hypot 
adhraQng transcripDon factor 3 



4.20 

4.20 

4.19 

4.19 

4,19 

4.18 

4.18 

4.16 

4.16 

4.15 

4.15 

4.15 

4.15 

4.14 

4.13 

4.13 

4.12 

4.12 

4.11 

4.11 

4.11 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.10 

4.09 

4.09 

4.08 

4.08 

4.08 

4.08 

4.07 

4.07 

4.06 

4.08 

4.06 

4.05 

4.05 

4.05 

4.05 

4.03 

4.03 

4.02 

4.02 

4.01 

4.00 

4.00 

4.00 

4.00 

4.00 

4.00 

4.» 

4.00 

4.00 

4.00 



TABU56B 

Pkey: IWque Eos probesel Uentifier number 
CAT number. Gene duster number 
Accession: Genbank aocesston numbers 



Pkey 
438091 



CAT Number Accession 

22448 1 AKD54860 AV652198AV652192 AV65213a AV552127 AV652194 BE935919 AV652017 AV651995AV651548AV54W63J«ffiS19^^^ 

^SsAV^WroX^ X56197Ali03014 AW953629BM263548 BE550772AA7010B4 AI681352AA^ 

AV^0AWra78TO/K 

ASS8MS^48n^ 

A^47TbE6Sk99444AI707W^ 

Wffil 26 BF362770 AI268939 AI435818 BF475318 AI024767 BE174213 AA757598 AA513019 AA902959 AI880794 AI334784 BF108411 

fl^ AVraW^^ BF095608BFO95601 BF095458 AW890091 BF095753AW243400AW898607 AW898616BI^^^^^^^ 

AiS9^AU^8fi2S BE^6^ B&l74ig8 AW102923 D5Z715 BE6994S6 D52477 OS5017 BFg55933 BG623563 AV646254 AA463522 Bi003244 
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BF999056 RB3430 Z29922 T85791 W03942 H63269 A!091537 BF0B65B3 AA345570 H48870 H80720 T83523 BK)39626 BI037700 R00353 
BF155184 N98343 N79072 H01B12 T55581 
429978 35194 2 BE738425BE738323BM126944AVV629678AW265195AI916735AI394255AI573090A1354442AVV512857 A13 

AI354441 A1308821 AA772275 AW055215 A1569705 AI336532 AA805547 AV6B21 25 H93575 AW071 172 AW7e9904 A18839B5 AW265018 
5 AW196655 D79662 BE042393 N75017 AW014741 C75509 BE748621 H92431 AW079261 AW901780 AA329482 AW960115 BI260621 AI767525 

R31663 BI918664 AVyf963196 006195 AI67801 8 
442562 39583 1 AK0S6685BG399272AA187835BF821903AV660550AV560556AW60502 B6564397BE379584BF44ra^^ 

BG054997 AI266043 B1054879 A1656750 AM92830 AW021 142 AI472184 AW170056 AI082443 Al 167921 D59940 BI492088 H74180 AW130886 
AI348677 AI278577 AA761517 A1698203 AA115535 AI264790 AW205074 AA860452 AA554902 AI000715 D62102 BE544768 AI376090 059939 
10 AW242249 AA525421 R34211 R3432B BF248064 BF241437 BF572759 BF21B832 

412636 1438 1 M77830NM-004415AF139065BG6B111SBG740377B1712964 B6000656AA128470B1438^^ 

AW370813 J05211 BG698855 BG740734 BG680618 BG73977B BI765B07 BM353403 BM353248 AW177784 AW205789 AW951576 AWB48592 
BE182164 BF149266 BE940187 BI060445 BI06M44 BF350983 BE720095 BE720059 BE715154 BE082584 BE082576 BE004(M7 AA857316 
B1039774 BE713818 BE713548 AW170253 BE160433 B1039775 AW886475 BNM52S04 BE931734 BF149264 AA340777 BF3B11B3 BG621737 
AU1Z7260 AW364859 BF993352 BG223489 BE819009 BF381 184 BE715956 R58704 AA852212 AW386566 81090358 BF087707 BE819C46 
BE8igQ05 AA377127 BE073467 BE819069 BE819048 81036306 BG990973 BI040954 BF919911 AU1401S5 A19S1766 A1434516 AW804674 
BRS2969 BE837009 BE92SB26 BF149265 AW99561S BE614264 Bt039782 AU140407 BE144243 BE709863 BF98S642 BE001923 BF933510 
AW265328 BG436319 BE182166 AW365175 AWB47688 BE818280 AW177933 BF873679 AW178000 BE082526 BF476866 BF086994 BF592276 
BE082507 BE082514 BE082505 BF873593 AW058840 AW847678 8F804153 AW365157 BEB13930 BE002030 AW365153 BE184941 BF749421 
6E184920 BFB39562 BE184933 BFB42254 BE698470 BE931048 6F999889 BF388816 BE184924 BE15Q646 BE714632 BE184948 BG98S845 
AA131 128 AA093891 W39488 C04715 BF098124 BE8GS341 AW79g3{)4 ALJ603116 BE149760 BE705987 BE705966 BE70596B AW848723 
AW376699 AW376817 AW376697 BG005097 BF751 115 BE696084 AW848371 AW376782 AWB4a789 AW849074 AW361413 BF927725 BF094211 
AW997139 BE865474 BE1 851 87 BE1 56621 BE715089BE713297BE713298BE179915AW799309BF872345BF088676BE705939AW752599 
BG005197 BF350086 BE715196 BE715155 BF752396 BF093817 BF831 190 BF752409 BE006561 BG959922 BF094833 BF094748 BF094583 
AW377699 AW607238 BED82S19 AW377700 BF349467 AI190590 AI554403 AI392926 AU15e477 BI457252 AU159919 AI760816 BF082516 
A1439101 AA451923 AI340326A1590975B1791553AI700963AI1428B2 AA039975AA946936 AA644381 BM314884 AA702424AI417612 
AW190555 AI220573 A1304772 AI270345 A1527383 AA552300 AI911702 AW166807 AI346078 W95070 AA149191 AA026864 AI830049 AW780435 
AI078449 AI819984 A1858282 B1468588 AI860584 A1025932 AA026047 AA703232 AA658154 AA515500 AW192085 AA918281 T77B61 A1927207 
AI205263 BF082491 AW021347 AI568096 BE939862 AA088866 D12062 AA066527 AA7821D9 W19287 W02156 AW1 50038 AA022701 T87181 
H44405 AI910434 BF082513 AM94069 AI270027 AI635B7B AA128330 BG681425 BE706078 R20904 BG680059 BG676647 BF764409 AA026654 
AV745530 81762796 BG287391 AW79a780 BE706045 BE926470 AW7991 1 8 BF087996 BE002273 AWB79451 AI571075 BE067786 AV721320 
AI022862 N29754 C03378 N84767 AA131077 H30146 BE714290 AI686869 A1568892 AI915596 AW105614 AI887258 AI538577 BE926474 
BE067737 BG319486 AA2476B5 AW798883 AW103521 BF989173 AW860878 BE939707 BE185750 BE714064 BE713903 BE713868 BE713763 
BG950164 BE713810 AW365151 BG955489 BE005272 BF915937 AW36514B AI905927 BF992780 AW853812 BG954443 B1770853 86679406 
BG740832 BG681087 BG698430 AA455100 T87267 BE696209 BE696210 B1089483 BE005273 BE872225 AW391912 BE925515 BG677012 
BG741970 AAD26480 BE705999 BG677157 BE009090 BG681378 BE712291 86961498 BG678984 BI040941 AA337270 AW3B4371 AW847442 
BI0S8659BE813665W95048 W25458AW177786AA025851 BE931733BF1 54837 B6949393BE71 4441 AW996245BE711801 AI284090 
BE064323 BE719390 BE940148 BG991212 BF375714 BF349522 BG996267 T48793 81013292 BE001925 AW36S156 AW3651 54 AW606653 
BF763109 BE931637 BE167181 BE713879 BF354008 BF67B726 H90899 AW355145 W38382 A14984B7 

BC003047 S80794 NM 003405 X78138 AY007132 L08439 AW340648 AW131665 A1082748 AI4702O4 BI711078 BF350700 BI496963 AI087141 
AA720684 AA862331 AA605146 BM313650 AI089749 AI359738 N69107 AW995424 AI086917 AI083995 AW340217 N99662 AIB29449 AI089839 
A1608761 AI342365 AI199076 AA908944 AI248943 AU160053 AI191 245 A121 8477 AI077943 AA664930 AI310394 AA872478 AI279782 W61343 
AAS6S9S5 W4659& AA126874 AA223241 AA491574 R84813 AA491520 BG055114 A)469689 BE4645S0 AW864539 H67097 AL534332 C21397 
AH)85941 AW028427 B6939820 AI697089AI039008AI125315 A1655561 AW150042 L20422 X57345 BI458375 AU142B52 81566601 BEB88276 
AU119302 BI603754 BG705953 B1598754 BE296713BG002538BF951911 N29226 BE909424 AV698274 AV683116 AV708195 AA127798 
AI124697 054224 F08031 AA340253 BF923383 BM467B08 81546644 BG777200 BG705941 BG468577 AU127209 AW403970 B1597630 BW58091 
AV689560 BI669267 BG506219 BI837163 B1667275 AA343750 BE7e3112 BF671405 BF954720 H67636 H69456 AA484894 BI869271 BF998207 
N31547 BF945817BF947918N906306G980194AA1S6681 BI493502 AW2731 18 A1473820AA608688A1359337AV7 12091 AA084101 6F592038 
H13301 A1B64305AA505883A1423963AW084401 AI917740 R69858 AA033631 N79982BE88S276AI635674AA096126AA700018AV707753 
A1082545AU145681 AA629032 AI421 367 AA740589AA150830A1248541AA9B8608AA1 50478 W65437BM31 0234 AA262704 T28031 AI811116 
BM272753 H21979T15405AA938406 F04963A!188295AW152629AA905196 B6223058AI831016AI765457AI8111^ 
AA775958 AI261476 AA219489 AI688035 AI872093 BE537084 AW189078 082630 AI123121 AL583492 BE350791 R69901 W65436 BE155392 
81089081 BE155394 AL120538 AW166100 AI359620 AI174338 N20527 W47413 AA155615 AI272249 H25293 BE61455B H69864 AW383484 
H78021 H1 1517 R56892 H23204 N21630 R82499 AW383522 AA774536 AL534331 W94127 W46459 AI866231 AA513281 AA192465 H69844 
W85827 AW383642 AW383529 AA171496 AL537424 BE814866 BF823254 R82553 AI809817 AI559406 

AK000357 AI571830 AA579613 AA668790 BF939495 BF1 98886 AI990982 BF591561 AI809189 AW410232 BF739769 AI144392 8F438721 
A1707495 A1423359 N52503 BE855784 N94367 AI023931 AA563662 AI744624 AI221 298 AI299523 AI299520 BI49131 2 AI45271 1 AI360730 
AA622643 AW195955 AA470473 AW392767 AI218127 AA873430 AA906005 AA748160 H89523 AW0202S2 AW68498B AA970424 AAB79079 
AI581622 A1361357 AA468498 AW008694 AA903275 AI187724 A)187718 AAB90505 BF092771 A1066655 BE714172 BG289028 AW080390 
BE001197BF132049 

BE880923 BG390191 AW470082 AW014585 AI423255 BI714731 86054894 AW780248 N316B3 AW654132 AW467353 AI983152 AA617918 
BF447795 A1088357 AA807328 AA576970 AI741153 A1755003 AM74016 A1422030 AI3481 14 AW997085 BM271753 AI363147 BM31 1311 AI146640 
AI246771 AW512619 A1359020 BG054897 AI292234 A121 5830 AI283836 C06205 AW503423 AW272680 N33205 AW873021 AA070724 Ar753886 
AW192487 AI087151 AA658909 AI346368 AI335677 AAB25442 AW440066 AW1 31357 AW513210 AI082314 AI085455 BE551404 AA7B0704 
AW008596 AI796964 AA917471 AM00531 AA668626 N72207 AI306482 AW440562 A1084687 AA347280 AAD63536 BF477389 Ai241652 AA931643 
AA484310 AAB12486 AI032216 AA665779 AI916336 AI350590 BF198106 AM33377 AI300838 AI672626 AI282741 AI351487 AW105544 AA973627 
AW517914 AA715424 AA508454 BF334080 AI274616 AW357201 AW572619 AW469088 AA382095 A1368364 AI146934 AI3571B0 AI361181 
81911347 81871044 AA138325BF084010BF084007AA335788AI92087BAA809614BE932941 AI678261 C75308AI148479BE178174W88513 
BMU)13827 

AB033091 A1^20743BE811B13N53332 N99716AL661910AA280555BE710392AV705100 AVV293978AW44455^ AI873695 
BG700891 BI553517 R8051B BG779771 BG534451 AA479402 AW981 580 BF061430 A1857643 A1768486 AW5121 1 8 AA479302 AW770384 
AW072470A!041596A1049699AW592885 AA976261 BEB79747 BG114119 AW183811 AA909938 BF571621 BF350794 BF351375 BE925699 
AL050294 BC010371 BF982270 AL042656 BF095732 AW812618 8F095731 BG212397 BF678765 BI038602 BG388664 AW575337 BG289398 
BE939598 AW805570 AA527097 AW150540 Ai693720 AA743364 AI91 5793 N48185 AI573107 AA043474 A1351 61 5 AI969490 AI910763 R50866 
AI699181 N73808 K08164 

BM006551 AA367152 AW953705 AI631B33 AW237429 AW027804 AA729038 BE503409 AI521 935 BF739953 AA702982 AA557633 AA780065 
Ai218139 AW194264 020120 AI08271 5 AI969980 BEB576B8 BE32671 1 AW953706 A1393749 A1383821 W67199 A1431759 AI798526 AI521794 
AI796380 AW1 17545 A1749857 AI537634 N50122 

AW973352 BF222929 AW016853 BF059130 AI651B29 BE551767 AA558414 AI339359 BF059601 At961 162 AI341422 AI20624B A1205165 
AA548736 AA768578 AI539081 AW025957 AA736837 M79575 AW594357 AA480892 
444060 6315 1 AKD55070AK055612AKS00174BG619806BG108086A1741949A1004176BF891936BF3765^ 

AA490531 AW874981 AA740788 AW27475eBF51 2523 AI521 278 AA548759AI802431 AW0516B2 A1528247 AI799606 AW0B81O3 AA236549 
AI191529 AVITO168 A1168451 AVW73812 AVV090611 AVra03593 A1215845 AITB^^ 

655 



15 
20 
25 
30 
35 

40 413787 *m2J 

45 
50 
55 

410143 MH1244.8 

60 

426295 510J 

65 

70 4503n 12109.1 

75 

425998 138046.1 

427521 513212.1 

80 



wo 03/042661 



PCT/US02/36810 



413497 
430068 
409208 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



434280 
434442 
410730 



422669 
41440S 



44249S 
450139 



406619 
424677 



451129 
432666 
418259 



433162 
406870 
445493 
406797 
441623 



430604 

433891 
45S263 



406794 
431155 
414150 
430915 
436716 
449625 

430283 



1518002.1 
1177709 -2 
10117J" 



422960 11862.2 



1474904 1 
111738J 
114639J 
22448.4 



425284 3834.5 



874209.1 
112689.1 



928718.1 
34017.1 



0_0 

2518 37 
917280 1 
1495511 J 
144 7 
133853.1 



451653 MH1 15364.1 



2167905.1 
0.0 

423456.1 

0,0 

336^.1 



5477.6 

647290.1 
26143.1 



0.0 



0^0 

1235742.1 
2305332.1 
197844J 
2472838.1 
249224.1 

1418.1 



AA989444 AI080438 AAS52S00 AW237538 AA481Q80 AI246378 AA565227 AA3S8921 AA207051 AA72137B BF438608 AI0862g5 AI8a6630 
AA^1 12 AI864588 AW271985 AW078868 AA725342 BF326598 AA84^ AI082536 AA766664 AI453279 AA435673 A)61 9515 AA879080 
AA234S92 AA890223 AA766824 BI259822 AA393631 AW968640 BE940639 T83865 BE762742 AW897470 W05809 N41 323 T87376 R68544 
H68711 BI087136 

BE177661 H06215BE1447098E14482g 
AA947566 

AKD74047 AI144342 AW014280 BM14512B N28267 AW206231 AA989041 H93197 AW594063 BG236296 AW236608 AW081031 AA765843 
BM144372 AA989341 AI824838 AI953970 AI637671 AW196330 6G427526 BM148789 8F693644 BF881946 

BF090249 AW954947 AW8904a7 AI305236 060845 D60537 AA825429 W36294 AW890410 AW088235 Bf740240 AA448709 AI350279 AA8791 19 ' 
AA319510 BE702077 BE69g015 BE70204& AWg01293 T99319 D81708 6F475488 D6D383 D817S1 BE699260 
BE005398 AA628622 AA994155 
AA66Z701 AAB33929AA737415 

6G427950AA826016A)903441 BG939868AW979154 AA640872BG547134 AA457091 AW368880 W88852 R94779 AA088823 
BM47S665 BE644917 AW770789 AW952971 N&4863 BM2832S9 AI224545 A1184866 N89114 AV«18902 AI440169 AA809472 AV654440 
AA281642 AU185230 AW337382 AI872923 AI53T113 N73882T83378 H63731 BF671764 AW897824 AI811204 AA344646 BE009112 BG899664 
H91240 R60548 N41701 

BM476120 BE672181 AI597653 AA938187 AI280879 AW340123 AI912727 AI0B1775 AI089556 AI191349 AI871604 AI631607 A1B908W AI701917 
AW771624 AA663041 AI991576 AI160622 AA771763 BE089784 BE089788 AI222942 AW418516 AA32921 1 AI095736 BE550294 AA738345 
BE218683 BE670546 BG149S05 BF061776 D19821 BE005786 BE178892 6E005728 BF841237 BE005648 R27634 
AVVK47^ AA315006 AW856^S 

AL047596 AA393792 AI670731 AI037957 AWB74364 AI038137 N62286 AI241379 BE501096 AW09069S AI927389 AI669226 AI369437 At371075 
AW612409 AI68671 1 AI183289 AA477717 AI076122 AA635ig0 AA700g84 AA761508 D81020 8F57S223 AI35618^ 
BG283489 BE090666 BE090664 BE090662 H26545 
AI184717AW518883AF121173 

AK001838 AU135179 AU134241 AV651702AV650032AV551304 AVB50101 AV651253 AV651888 AV651866 AA628554 AV651355 AV651174 

AV651172AW856145AU117S99AU135386BG254665AA166919BG483981 AVV809608BG494194AA622811AI676156AA^ 

AU1 33725 AW961387 AU1 443B7 AU151757 AA551031 BE67541 2 N34769 AA71 3483 A1890079 AI588918 AI361889 AI209020 AA668981 AI240990 

AA741144AM90899AI200221AVW89574W96201BG154182AV655159AA328145N36348AI081357 N76715AA6 

AA897483 AI886459 AU155873H04255AW243986 AA557749AI286227 R88691 R33453 AW386097 

AA908472 

U09414NM.003438 AA503545 Al022449AA043458AA766074AA765442AAB05052AI02e211 AW609708 
AWS02364 AA828822 AI982587 
BE072881 AI762181 BE072946 

AA558585 AA565499 AI360576 AW204069 AA991648 AA864939 

BM310925 AA426110 BM310629 BF434288 AW015091 BF47599SAW1 18867 BE675186 AI688568 AI453594AW590589 AI652425 AI827969 
BF056946 AI802866 A1393360 A1476224 AW59083g AW136271 A)4582S2 AI524726 AA843768 AA782158 AI336058 AI097532 AW451563 
AA459408 AA459633 AA418444 W23607 BG940150 AI493445 AW054729 AI221929 A!868744 AA215405 AA766713 AA62154S BF928317 
BE464132 AI990909 AW271459 A1262061 AA215404 M74332 BG940151 BG952261 AA972115 W96315 AA589586 R69057 BF766886 BE769254 
W05240 

BG1 19563 AW976776 BG498478 BE245304 AW450934 BF508792 AA599489 AA599477 AI805268 AA055489 Al 128758 AA761425 AA731858 
AI302271 BE219432 AA687294 AA018972 AW629429 Z45675 AW961626 T30940 R34554 T84712 BG986694 
AI742311A1025842AA578843 
AA075144 

AV71 1317 AI809938 A1808768 AI240593 A1915771 

AI432224AW276890A1499346AA937014AA653573AI318525A)246219AA961S91AI270640 

BC022413 BE395396 BF75417S AAS06621 BE706665 BE706678 AA723159 BE153169 BE706729 BE706558 BE153312 BE706705 AW371853 
AW3718496E153241 BC017410AI337912AI090244AW0903006E219837AI623661 BE501576BE5D1734AI742232AI023g64 AI458424 
AA975373 AI288904 AI984583 AAB90325 N32562 AI358102 AW241694 A1038448 AI672071 AI01 8389 AA576391 AA977874 AW189392 W37448 
AA612894 AI277548 H89551 AI699774 H89365 AA315805 AW579186 BC014584 BC014581 AW7B0125 AI672414 BE32B145 AW600919 BF031306 
AW172758 6E708322 AA345675 BE875779 H28241 H2S316 BF540913 BG179688 BF1 10202 AA528775 W37573 BE041644 AW368504 BM129522 
BM129822 AI122760 BE718200 AW887498 AA149420 BE706307 BES39395 BE748765 AI373653 R75904 6F979185 BF691393 B6495S95 
BI094458 BE706702 B6496559 BF248373 BG494800 

BE219720 6F475241 AI571723 6E2ig848 6!789268 AI224899 AA724864 AW771467 AA4802S5 AW845616 AI44029S H52600 BE218790 At681575 
AW300064 AW262133 H21588 AI36301S AI884914 H8694e 
AW182329 AA613792 T05304 AW856385 

6G216963 BG164885 BG213710 BG204771 BG193014 BG197556 BG217461 BG19818S BG183594 BI59642S AA115605 A1589156 BF439839 
BG188832 AI3S9615 BG1 90473 AI024233 BF439574AW1 18085 AI672797AA610042 BG212008 AI204382 R70913 AA033534 AA781036 A1627278 
AA307285 AA03421 8 BG482749 AW162429 BI602460 AA721989 AM76516 AA476416 AA903019 BFl 10864 AA307286 AA1 15471 AW964555 
AA423828AA1 15129 AI419107 

AW088535AI889321 AA954221 AI337552 R42581 AW194670 AA0S4862 AW001147AA864374 AA630699 AW276176AA676615AA857g65 
A)625428 AA580792 AAS82038 AA581668 AA658065 AA828156 AA857160 AA9361 03 AI149335 AA936925 AA581684 AA9541 98 AW238461 
A)281504 AI265812 AA583267 AW236162 AA876535 AW304266 AW474334 A1559415 A1589241 AI660952 AA641 137 AI431696 AI688844 
AA552513AA564954AW090553AI205612AI245753AA954883AA947909A1866014 
A1890243 

AW971213 AA49392S AA493567 AA876839 AA934462 
T40644AA136139AA136026 

BG425760 BF997600 N48516 W73454 BF816344 BF997601 AA4B8953 
AI433540AA804981 AA728984 

BI918168 AW779760 N48674 AI375997 AA235370 BG699146 A1913631 A!49e402 AI016320 AA323193 R49021 059344 BG986750 N45526 
BG986917 T61382 R49391 R45432 AI203107 R35004 F07491 R25094 R35360 

BC013728 BI0&4032 BF090365 BE410706 N36391 W80436 BFB13124 Rg0857 R62778 K00591 K02329 AA355285 AK000826 6C008721 
BG744004 BG479141 BG823856 BG479061 AU121103 AU138176 B6702567 81599840 BG422775 BG700944 BE280747 AU138529 BE269929 
B6704110AU123329AU1220S8AU138135B6723106BG722291 BG831041 BG705239BE740517BE207133BE252867AU139772BG714385 
BF568538 BE304393 BF971755 BE267985 BE272012 AU135396 AU124535 BIS51682 BG700612 A!815488 BG475195 BE388Z73 6E391517 
BM148991 BF929247 AV653435 AW250299 AW249189 BF093150 BF093173 AW20532S BF057101 BF000551 AI341 108 AL162009 AK026135 
BE907957 W73238 B171 1457 BG926027 A1816428 AA837518 AI754405 BG179142 AA481485 AW023435 BE855747 AI885101 N52163 AI016096 
AK)90204 BE677045 AI523320 AI126855 N2&501 AA043680 AA976459 At039S90 AI937917 A1381000 W94866 A1375797 AI07g601 AI168236 
AAS99882 AA084368 A1342635 A)190294 N26093 AI085234 AI298169 6Fg39715 AI223164 H98704 BE21892S N2g394 AI918735 N41520A1 147784 
AI918796 AA854317 N22133 AA199850 AI149728 AA121263 AI051074 AA565116 AI097349 N22209 AA552917 N331S1 D52422 BF477483 
AA476599 AA525787 AI279198 D53353 AA738063 AA558406 BI496334 AA999948 AA42S847 81496335 AA909624 BF197591 AW023259 A(652819 
T31424 AA088213 T31 1 15 A1206650 AA978796 AI948989 AW248762 AA449265 AA290587 AI682521 AA310227 238743 AA935369 AW1 19141 
BF941087 A1470657 A1349451 AW079338 045665 N21640 T30071 AI446705 R60220 BE833481 R49680 R70049 R41223 R32402 R69984 R701 1 1 
BM476906 BE293815 BE382443 BF1S5692 BE720638 BE931983 BE720594 8F08589D AA336953 BF081638 AA3S8915 AA3841 16 AA360142 

656 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



433009 
415995 



434372 
436812 
406799 
430968 
418905 



406813 
418876 
447197 
457068 



440638 
400281 

406781 
440129 



419970 

432586 
426395 



445093 
43B962 
425097 



437386 



418886 
400244 



2142268.1 
2117 1 



858779_1 
659779^1 
0_0 

1237115J 
517.1 



B1667664 B6823235 BE31 5559 BE301 958 6E891 114 BG826267 BE253680 BG979094 AV722757 N67629 BG997927 BI915769 BG660692 R62777 
BE2S1 1 16 H56358 AW369586 BG677759 B1044604 R75787 BE770980 H69529 R69983 AA259238 R37827 AA310379 W01 103 AI873384 
AL554578 BR)38102 K87182 K87517 K01574 TS2S73 N28881 AA301397 T9237S R68401 AW800466 A128B172 BE676949 D54019 AI909769 
AW798415 BI222383 AI393171 C15260 N269S9 H17129T53095 TS2494 R68602 A1364765 BF687869 AI617035 AW105354 BE293820 H14206 
BI093066 C14083 AL534349 BE255B83 BE254068 AA428399 AV\^79360 AW579381 NS3144 D60748 D54020 BG2921 06 H96705 D52423 T36174 
D54161 R73016 D55021 B16572ra H83444 BI016954 D54163 R15563 B1818664 BE1S2207 B1046S02 BF885667 8E613212 BE165773 8F149332 
AW607O45BE305200 

409019 32320.4 BM480413 N28908 H39792 6E240826 BE682093 6E240827 AW868837 8F739795 AA700834 AA769597 AA48966B AW968806 AW085196 
AI093280 A1218457 AA063138 A1632958 AW51 5005 A1570S30Z41 724 AA7487B9 AI696584 AA062544 AA773643 AA4gQ285 
AA76166BAA573621 R09670 R92814 

M9567BNM.004573Al^754AI439623AVW74413BF898880AW630959BF875526AW402206BF818690BF89^ 
BG002913 AL530753 A)524064 AW769231 AA464970 AA293723 AI095051 AI953375 A1982938 BM14G050 AW575804 AA962489 AI655426 
BM146046 T28538 BE241936 T89023 Bi910963 A1416S66 AI7671 1 1 AI422290 AW468260 6E676853 AI655771 AI961 755 F04675 AA682826 
AA737606 BM1 94382 AA91 2021 AW183098 BE676682 A1962227 AI591366 AA621765 AA293724 R51642 F10194 BI909727 BFB92632 AW950600 
AW950138 BG258567 6F892649 AI800647 BFB92710 AA353176 BF894726 AA465038 Bi040B89 B1837749 BE244320 H18054 n4300 AW797026 
AA926790 F32814 AW751 282 AI933994 AA578823 T78372 BF899896 BF882808 BF974989 
BG622121 AA631424AA988296 AA631373 
AWg78773AW298067AA810101 AW194180 AA731645AI690673 
AA90B548 

AW972830 AA489820 AA527647 AA570382 

BM469076 AA533027 AI127512 A)368802 AAS33141 AA700560 AW576028 A)6108S1 AI435361 BM129172 AW474544 BM128899 AI814292 
AW502039 AA531243 BF941858 AW502037 AA702337 AI419854 AA662755 AA934364 AI3W510 A)291 136 AAS05263 A1144527 A1076919 
At633534 AI242473AA938561 BG0S5372 AA512894AI671356 AA962403BF808010AA663911 AA847056AA513301 AA359069AA377265 
BG291206 AA402298 AA665766 AW601002 AA302290 AI305842 AW800873 AA302492 AA478427 A(B17291 AW801 104 AWB0102B AA665744 
BF1 55979 A)374743 AA476431 AI159846 AI369757 A)800672 BF435788 AA255451 Ai937707 AW006198 AJ280363 BF062434 AW801 1 15 
AI919181 F28413 W04214 AW152380 AW901567 AW901570 AA888371 AA384251 AI302846 BE701902 AA931606 H42673 R33703 AW901556 
AA009816 AW90156d AW352200 AA25BS58 H15928 B1087170 AW800530 AA36906e N98562 H28552 N34644 K97650 H00956 W70039 AI142831 
AAOOgSI? F37136 N70289 AA531347 R72374 H27488 R66605 
0.0 AW276131 
121279J AA740616AA654854AA229923 
2176805.1 R36075 R36167AI366S46 

1196.1 BC022444 BC004138 NM.000970 6C020679 X69391 D17SS4 BC013863 6G779630 BG574189 6G571986 BG494603 BG575713 AV702244 

AU145720 A156B177 BE350654 BG943995 AA730918 AA63W24 AV744573 AI61 1573 AI613381 AI61 3403 Af250148 AW301759 AI31 2307 
AI376488 BE139S82 AV743357 AI371 951 AIB13376 AI6t 3570 AI345719 A131 2458 AI61 2697 AI334453 AW268747 AW26831 8 A1343025 AI33S454 
A)31 3096 AI30801 1 AW075176 AI3487B2 AI334479 AW268332 AI61 1433 A1371906 A)312459 AI371647 AI370051 AI335217 AI312262 A1610971 
AI348798Ai613376AI612698AI348772Al345434AI345283AI312292AA775853AI611363AI345286A»^ 

Ai802857 A1609452 A1583301 A)34925B AI345645 AI34501 1 AI340442 At335410 AI310907 BF055B80 BF0S5774 BF055754 BF054712 AI583261 
AI309664 BF054905 AJ370092 A1349212 A)345583 A)334554 AI312242 BF055856 AI612686 AI611382 AI610952 AI345301 Ai312235 AI610936 
A1609836 AI340650 AI312472 BF05484O AI802838 A1345448 A)336508 AI313098 AI31 2293 AI31 2277 A)254912 AW3020gi AWa74788 A!61 1966 
A)61 1948 At583382 AI5832B4 AI583245 A1379398 AI371935 AI340554 AW301840 AI862239 AI612025 A)611869 A16Q9852 AI345449 A)312375 
A1312318 AI311993 AI310895 BF057960 BE178604 AI802880 Ai612115 A161 1866 AI370059 AI313286 Ai602656 AI611438 AI311987 Ai802821 
A]3129976F05471lAI6118B9At3764B3AI252445A)611577AI349l39BF054925AI802840AI583286Al340739At81^^^ 
A!308909 AI61 1883 AI313275 BF477097 At609466 AI348792 AI340466 6G944254 BG841457 AW302007 BG944413 B6941475 058042 AA328426 
60429596 BE395392BM424161 BF663310AV727384 6M473884BG531178AA307591 BF96S435 
6G009500 A1376551 AA897445 T67714 

BE564506 Z47727 BF028489 BI597545 AA152273 AA730753 W07504 AA406507 AA316220 AA424583 N35844 N72473 AA442109 WOUSO 
AA747605 AA235894 N41438 AA382786 Aig06934 AI906935 BF091 197 N85572 N75678 AV758098 BE719745 BE719744 BE719735 
AA639388 AA584945 AA776364AA776365AA865528 
AI732997 AA977633 AA86581B 
AF075009RB3109R63068 

AK055270 BE348291 A)190269 AW612022 AI269S08 AI266578 AI2G9675 AW271406 8E879851 6F574163 BI497126 AW903775 61917368 ' 
AW15Og0OBF244813H79201 

BC022881 AU1S0944 BG750763 AW754175AW857737 AI91165g AI050036 AA554053 AI826259AA568546 
AK057067 8F111915 AW249706 AI222199 AI887251 AtB18753 AI244687 AW135782 BF476693 BF080935 AI197928 AW251092 AI685636 
AI685630 6E67345B AA932894 AW205200 AA720784 BG236866 AI583152 SE677464 AW149338 AI926439 BG031285 BF915924 BG9825B3 
BI060025 60991393 BE830027 eE151985 8E152208 BE152002 BE818337 BE717581 BE81B354 BE830030 AA377477 T51255 BF920761 
66170636 N90052 
AI2D7197 BF773544 AW196462 
AI207343 BF613684 BF92B775 AA828585 

AF070570 BF439282 BF109960 AI480268 AI038060 AW082339 BF516290 6E218214 A)469956 AA039955 6E644674 AI861871 AA766231 
AA845840 W8571 6 AA676253 AI087188 AA022908 AW9S31 78 T33195 BF59471 1 AA48B969 R55652 D81245 080778 D81560 AW960933 
BF930897 BE698103 AA040024 eF515980 6E168475 AA453247 AI267601 R60894 R44223 T33194 AA114936 W31640 W38B29 W39109 
AA0O4849 H419S2 W86634 BF031932 BM423354 AU)41825 H29654 At908178 WB57S4 AI90S762 AA309860 CO4540 AA340246 H84669 
BM476605 61545004 BI834636 6G112453 61199049 BG1 12759 AA149&46 H97925AA3061 21 AA313204 W52451 At734997 AA931 168 AA429766 
N47913AA5B4321 BF940241 A]0B3648AlO8941OAl347705 A]343861 AJ186232A1889031 N98464 AI820039AI459034 AV652512 AA622990 
BEe57200 AA932998 AA740573 AI826264 AA665683 AI344550 AI027349 A1056087 AA442777 AA603724 AA673347 AI056717 AI0921 85 AI032895 
AA53S689 BF806025 BF806081 BF805985 BF746099 6F746097 A1309259 AiS97603 eF806066 AI090653 AI129205 AI248410 H72993 AW615341 
BF605990BF805982AA993819T34373T3S604 H56242AA648145T35907BF808691 N9401 5 AV703438 66774276 H82341 R76371 
T657S4 AA229658 AA2298S7 

X79449B(»17853AL121035BF198384AW119044A!028023AW451110AI971911AW015069AI079170AI37^ 
AA449679AA740864N^L001111 U18121 AL567297 BG773801 BF973874 AV687104AA527579AA843525BE706355AI074589AI523475 
BE690249AW40S263 BE074258AV729485BF8D98106G0566igAA677244BE179338AA622264AI460106AA740411 AI499168AI078223 
AI682923 BE695559 AW375385 AA788739 BG384978 Z40874 T17054 F09869 AW844043 U 10439 BI71 1870 AW245957 AU158567 AA67930S 
AA679316 W72510 A1346029 66059762 AW25t062 AA132373 A1925621 A)860230 A!3401 72 AW1 92691 Ai707960 A!094937 AI0421 1 5 AI200901 
6E32B452 AA644578 AA551209 6E351065 AA970781 N68609 AW002028 AA160826 A1422774 AW8731 14 AW073597 AW664483 AI21 871 0 
AW0205S0 AW190607 AI984545 A1871921 A1333970 A1452887 A1818335 AA398655 AI554424 AI274187 6E465703 AW512940 AW241356 
AI923954 AA576649AW16B294AA813181 AA912168AI049738AW514073 AA548255AI569630BE710031 AA244182 AI341697 AA563904 
AI537990 AW517908 AW172943 Z39498 AI750294 AW150414 A1253293 BE825720 T31860 AW150775 D20310 AA150892 AU133933 BE781 148 
AL0389S7 6F910g79 AA3S2297 BG988142 AW372175 BF229106 AW665705 BE093482 BG990396 AI499917 AA054452 H05484 AI628502 
BM467331 AU140570 AI135417 BF947202 AW391926 BE813418 BFg98473 T92021 6i021048 BM048783 AW501366 AW501342 AW501S49 
6E939021 BE707147 6E1609746E305207 N49011 AA9471ig AA678801 BE538876AW897428 66329646 BG81 8540 BE542344B19192S0 
61253018 AW130996 BE074249 6E895428 61034862 8E083277 6F952166 
416304 1093209.1 BEB83520BI057842AA215702AA215703AA36BQ066E008876 66066655 
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371165.1 
9758.4 

0.0 

2607882.1 

52134.1 

13569.1 

6633.1 
22291.3 



175963L1 
195763.1 
23175.? 



5541.2 



245947.1 
12188.1 
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438141 1 173217 J AA778849 AW946B71 AW946782 AW946955 

S 94aT"^ S^^^S?7AU124990BG829759AU143705BG701663BG69W81AU131718BE515^ 
Sr(287MBM2S3692AU099013AI241350U8B316AV«5^ 

AU148749 AI620647 AU151769 AU153404 AWOnJ6S6AM57^AU153077 N99966 AVV050940 BG055674 AA506657 AW301529 AI91^46 
A)61123S A1266081 AI334542 AVWJ71277 AI312434 AW3031 14 A1435544 AI436466 R22972 A1472987 M77886 AW072883 AA318683 019761 
AW050566 AI312433 AA32B444 R72435 AM30721 AI142599 AA582290 AU148896 AA721233 AW628132 R93935 AA10O71O B1520773 AI934172 
BG222461 H85359 AW074639 AA0171 17 B1025412 AL582142 AA443547 AA586793 AA777535 AA693844 AI018661 AA577422 AA522800 
BF054818 AA102378 AA757993 AA687769 R55540 AA505784 BI82070S BI767939 
410704 1054673 1 AW877458 AW877524 88076922 BE166912 AW840534 BE076754 AW797829 BE166805 BE166926 AW877482 BE166927 BE166932 AW877523 
iuy»o/J.i 0E156928 BF351394 AW877522 AW877528 BE166861 BE166866 BE166913 BE166919 AW877456 AW877537 BE076866 

AVVB40571 

400233 11259 1 BC004324 NM 001020 BC007977 M60854 BM050628 BG829809 BE385504 BG744451 B1826914 BE440007 BI2^56 BE395117 BE389334 
" BE255792B1194159B166a21BB1194376BG715213BG714408BE392513AW22219AW328077B^fW24171Bl82B2^ 

BE392943 BE394033 B1B58915 B1558334 BE621019 BG706995 BE791985 BF987484 BI193635 B67618S9 BM466537 BG747165 BG827488 
AI133550 BM01151 1 BI2272B2 BG489212 BG478388 BE727789 BI160880 BG831707 BG324692 BM470427 B1083889 BG831605 BG7541 14 
BG420536BF308210BE384213 BG832271 BG828032 BG481641 BF205675 BE899041 BE27155B B1193807 B1159866 BG473786 BG397178 
BI194428BI117210BG768326BG7S9507BF97564SBF343657BM020598BG831082BG829943BG8a501 BF306557 BE562511 BM050145 
BM017978 BI193934 BI160764 BI160371 B6754991 BF973348 BF663234 BF032537 BE388168 BM009051 BI192794 BG831002 BG830459 
BG764737 BG761608 BG481705 BG104314 BM464565 81261500 BG831857 BG831684 BG829852 BG765030 BG760419 B6760268 BG749762 
BG480900 BG419627 BG248771 6F975542 BM042233 81161149 BG831302 BG830033 6G829626 BM050064 BI193014 BI161 360 B6822729 
BG110091 BG106500 BI258369 B6831982 BM45B301 BM019513 BI161350 Bll 14178 BG481969 8G474870 BF974048 BF971122 BE741405 
BE3952e9 BG832027 BG831469 BG490895 BHW13638 BG943529 BG831012 BG829471 BG686284 BG337575 BG338551 BF208677 B1258301 
B1160946 BG105893 BF183072 BM459542 BI193881 BG832043 BG831323 BI194545 B1160958 BG755930 BG706018 BE743885 BM465145 
BG831227 B6774290 BF683451 BE907161 BM045391 81194396 BI161269 86747091 BG546643 BF984853 BI160208 B1226402 BI226336 
AW328236 BG339458 BF972634 BE909808 81160988 BI160251 8G828764 BG8268606G758360 8F568228 81818282 81457127 8G831491 
BG759864 AIB30010 BF56B381 BE907238 BI161172 Bll 16773 86827153 BG825088 BG335419 BG109404 A192906B BE906354 8E408564 
BM045000 BG339617 BG282794 8G335767 BE907263 BF568921 BG829951 8G479305 BG260397 AI922228 BE301975 AW516055 BG480919 
8G480626 AW196817 BG335261 BE906157 BE395717 BE391427 81192954 B6829757 86476379 BE301536 BE394727 BE257695 BE905344 
AI433577 BE894416 8E886992 BE409223 BF034768 BE904O77 BGB3C886 BE909153BE907998 8E395767 AI871751 8E744523 81192663 
8G831 669 AI000225 8E743838 BE272515 AA62B07B BM463802 BE393375 BE393033 AW170187 BE730961 BE395410 8E744572 BE392297 
BE391448 8E390780 BE388821 BE258477 BE90S970 BE901567 BE898833 BE880325 BF726889 BE910504 8E390753 BE390131 AA550542 
BE744156 8E394125 BE742207 8E395255 BE392942 BE894335 BE37B222 8E906928 BE904650 BE393704 BE620999 BE515162 BE378753 
BE272370 BE907458 BE612801 8E392484 8E907636 BE907353 BE910491 BE909796 BE905331 AW248173 AI683576 BE908826 BE620180 
8F037570 8E908312 8E615015 BE256977 BE746875 BE394133 BE391478 8E910068 BE907185 BE742109 AA995746 BE561195 BE908825 
BE905472 BE906509 BE906017 BE910442 8E514657 BI261969 BE741707 BE392216 B»«MZ793 BF570283 812621 19 BE395707 BE37B298 
AW327827 BE394422 8F569178 BE263240 A1700512 BG830290 BF569308 8F569156 BI194587 BE390831 BG745096 AI681675 8E39S674 
AA136372 BE279892 AA442822 8E384898 AA313519 A1878886 AA305904 F33366 BE394852 F29153 F33618 AI133637 AA300009 F34063 
F29455 AU099691 A1905085 AI906656 AA343249 BE388691 AW404280 AA379888 F29022 BF089981 F31013 F24305 
413891 823 1 8E271020 AI925430 AI806151 AW129911 AA828002 AW003539 8E042625AI287859AW77B973AI621 173 AI991000AA846016AW1 50029 

AW169748 AA649945 AI358496AI470921 BF434211 AW513748AW451232 A1953739AI249448AI040580AI65S280AI637976AW194345 
AW61 1997 AI367197 BF064039 F29558 AI537342 BF593207 AW879538 AA973211 A1674328 AW879559 BF06ig61 AA481914 AA426532 
AAf426653'AA480106 AW243290 BF5131 02 BF346057 AI763358 AW003726 A1139045 AI570748 AW237602 T57492 BE887212 A198931 1 
AA133045 F23464AA576416T15590 AI650891 A1950958 A19&3931 AW515101 AI650820 H81989AA508473 
410584 35319 1 BF828833 A)968217 A1651409 AI760574 AI147562 AW001418 A1146791 A1650589 A1952939 AI432373 AI964094 Ai963870 AM20438 A1336803 
AA809634 BF590828 AA741075 8171 2639 AL1 34637 BM254338 AA527993 AI867208 A1439038 A1684987 AI631698 AI587126 AI637622 AI651931 
AI867525AI783674 AI638281 AI825752AI339197 AI653411 AI341372 AI673213 AI673191 AVm9768Al527W 
Ar741624 AI401569 AW190430 AW196390 AI829182 AI523816 AI760522 8E505014 A1917343 8G818909 AW009307 AA927544 AA825621 
AA829400 AA52730/ AI887999 AI865022 AA885063 AA653458 AA483816 AAB36167 AA505879 AA421004 AA252526 A1380678 AW195980 
AA649133 AI742276 AVW15700 AAS95019 AAB77835 AI701658 AA729793 AA535004 AA326792 AA5051 13 AA603726 W6B390 N90130 AA489461 
AAB30462 



TABLE sec 

Ptey* UniQU8 number oorrespondbiQ to an Eos piobsset 

Reft* Sequence source. The7dlgitnumberelnthlscoluninareGenbankldentiilBr(61)numbei8. TDimham.elal/refeiBtothepuWicafloncnliOed'TheDNA^ 

human chromosonw 27 Dunham, el aL (1999) Nature 402:48W95. 
Strand: Indicates DNA strand from which exons were predeted. 
NUx^ition: Indicates nucleotide positions of predicted exons. 



Pitey 


Ref 


Strand 


NLposiBon 


402474 


7547175 


K^us 


53526^28,55765-55920.5753fr5n57 


402145 


8018280 


Phis 


113086-114800 


401091 


9958240 


Plus 


94760-94898 


401466 


6682292 


Plus 


28748-29023 


401113 


9966541 


Minus 


19419-19959 


406542 


7711499 


Plus 


117335-118473 


405086 


8072509 


Plus 


73664-73841.74081-74217,7461&.74779.7492 


401846 


7712190 


Minus 


82775^2823,82912-83022 


404854 


7143420 


Pha 


14260-14537 


40S266 


4156171 




63337-63552 



TABl£ 57A: 703 genes upregulated In tesScdar cancer rdaOve to rnrnd 
TaWe 57A Rsls atwul 703 genes upregufaled in tesQcular cancer relative to non^ 

oeolides. or antibodies. These genes were selected frorn 59580 piot)eset8 on the Eos/Affyrnetrw Gene expression data for each piobeseloManedftom this 

m*sls was expressed as werage IntensHy (AI), a iiom^ 
don^ indicalJ« of hare oncogenic ftmcBM 

Ttm, phosiphalase. or iocLlransportBr). Certabi predicted protein dom^ are noted. 
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Ptey: Unique Eos probesetUenlifier number 

ExAcca* Exemplar accession number. GenBankanes^ number 

UnlGendD: UniGene number 

PrBd.f^LDomsIn$: CetieSn predieted protein domains. AbbrevteSons used: TM« fransmembiane domain; SS, signal sequence; sY. veiy likely to conldn; sM, 

Skeiy to oonlabc other protein domain abbreviaSans are from PFAM (Nudefc Adds Research. 20Q2. 30:276-280). 
IMiGeneTiOe: UniGene gene tiOe 

95th pensnOe of testicular cancer Ah Med by 8ie SOth peio^ 
from both the numerator and denominator 



R1 



10 Pker, ExAocn: UnigcnelD; Unigene Tlfle; Pred.PraLDomains; R1 

424687; J0507D; Hs.15i738; matrix melaltoprateinase 9 (gelalinase B, 92liD gelaflnase, 92W) type IV coflagenase); matrix metalloproteinase 9 (gelallnase B; 31 23 
440119;AA865455; Hs,125331;ESTs, IWoderately similar to unknown (H-sapiens); ESTs. Moderately similar to unknovim 27.37 
421241; X91817; Hs.102866; transkelotase^ 1; transketotase^ 1; 26.89 
1 D 43t840: AA534908; Hs.2860; POU dbma&i, ciass 5, transcription factor 1; POU domain, dass 5, transcrtplton facto; 25.03 
435918; AF263538; Hs,86232; growth dHferentlaaon factor 3; gnwth tfifferBnOallon factor 3; 19.88 

432666; AW204069; Hs.351 1 1 8; ESTs. WeaWy sinwlar to unnamed protein product [H^sajriensl; ESTs, WteaWy slmBar to unnamed protein : 17.74 

419556; U2g615; Hs.91093; chlflnase 1 (chItotriosidase);chitlnase 1 (chilotriosidase); 17.64 

452838; U6501 1 ; Hs.30743; preferenOdly expressed antigen in melanomaE ptsferentSally expressed anSgen In mela; 17.06 
20 417886; AA2145B4;; ESTs; ESTs; 15.95 

412265; AA101325; Hs.e8154; hypothetical protein FU12457; hypothetical protein FU12457; 15.93 

425572; AB01 1076; Hs.158307; undiffereneated embryonic ceB transcription factor 1; undifferentiated embryonic cefl transcrl; 15.82 

423905; AVV579960; Hs.135150; lung lype-l cell membrane-assodaled glycoprotein; lung typo-l cell membrane-assodated gW; 15.11 

419741; NM.007019; Hs.93002: ubfqulUn canter protein E2-C; ubiquien carrier protein E2^ 15.08 
ZD 427584; BE410293: Hs. 179718; v^nyb avian myetoblaslosb viral oncogene homdog^ 2; v-myb avian myetoWaslosis viral oncogen; 14.17 

418696; AW959433; H3.326290; hypotheUcal protein FU12581; hypothetical protein FU125ei; 13.58 

416819; U77735; Hs.B0205; pim-2 oncogene; pim-2 oncogene; 13.20 

414034; U89277; Hs.305985; early devetopment regulator 1 (homolog of pdyhomeotk: 1); early developnienl regut^ 1 (homolDg o; 12.93 
454077; AC005952; Hs.37082; InsuOrvJike 3 (Leydig cell); insu}in4ikB 3 (Leydi^ cell); 12.90 
3U 432730; AI066520; Hs.13135B; ESTs; ESTs; 12,84 

446293; AI420213;Hs.149722;UM domain Ifanscripfion factor UA*.1(h^^ (h; 12.74 

423354; AB01 1 130; Hs.127438; caldum channel, voUage^lependent. dpha 2Alelta subuntt 2; cafeium channel, vollage^pendent dph; 1^46 
450581;AF081513;Hs.25195;TGF.beta4;TGF^ta4; 12.42 

450719; AI096837; Hs.21349; ESTs. Weakly similar to RB8B_HUMAN RAS^EIATEO PROTEIN RAB-BB (H,saplensl; ESTs, Weakly simiar to RB8B Wmi RAS^^ 12.26 
431462; AW583672; H$.25631 1; granin^lte neuroendocrine peptMe precursor, grahln-iike neuroendocrine peptide precu; 11.96 
431354; BE04G956: Hs.251673; DNA (cytasine^)Hnetl9llrBn5to^ 
402199; ; ; Target Exon; Target Exon; 1 1.85 

424578; AK001973; Hs.150890; hypothetical protein; hypothetical prolan; 11.81 
416350; AF188625; Hs.189507; phosphollpase A2, group IID; phosphoSpase A2, group IID; 11.67 
4U 439979; AW600291; Hs.6823; hypothetical protein FU10430; hypothetwal protein FU10430; 1 1.57 
410048; W76467; Hs,343874; prdine oxidase homolog; proline oxMase homdog; 1 1.42 

442573; H93366; Hs.7567; branched chain aminotransferase 1, cytosolfc; branched chain antinolransferase 1, cytos; 11.42 
414812; X72755; H577367; monokine hiduced by gamma Interferon; monokine Induced by gamma interferon; 1 1.38 
421917; AB028943; Hs. 109445; KIAA1020 protein; KIAA1020 protein; 11.15 
HD 440006; AK000517; Hs.6844; NALP2 protein; PYRIM-Contalnlng APAFI-tike; NALP2 protein; PYRIN-Containlno APAFI-li; ia92 
414683; 878296; Hs.76888; hypothetical protein MGC12702; hypothetical protein MGC12702; 10.91 

423673; BE003054; Hs.1695; matrix melaltoprotelnase 12 (macrophage elastase); matrix metaPoproteinasa 12 (macrophage ; 10.74 
433800: A1034381; Hs.135150; lung typeJ cell membranMssodated glycoprotein; kmg lype4 cell membrane-associated gly; ia68 
429120; AK001573; Hs.195530; hypothetfcal protein FLJ1081 1; hypothetteal protein FUtOSlI; 10,48 
D\) 444371; BE540274; Hs.239; fotkhead box ^A1; forWiead box Ml; ia46 
441553; AA281219; Hs.121296; ESTs; ESTs; 10.37 

426534; U58098; Hs.2051; teste spedfic protein. Y-Bnked; testis spedlic protein, Y-ilnked; 10.28 
441878; AI801869; Hs.127982; ESTs; ESTs; ia06 

- - 432117; AL036195; HsJ?909; protamine 1; protamine 1; 10.01 

:>:> 425427; AI652662; H5.317432; brand^ed chain aminokranslarase 1, cytosdto; brandied chain amlnotransl^ 1. cytos; 9.97 
416201; AA467752; Hs.195161: ESTs; ESTs; 9.97 
410929; H47233; H5.30643; ESTs; ESTs; 9.91 
427486; AA974433; Hs.362432: fibroblast growth fodor 4 (heparin 

427239;BE270447;Hs.356512;ublqui8ncariierprotdn;ubtquItacanierprold^^ ^ ' ii««Ha«Biuw«iwaor*»inBpann5ecr;».oi 

OU 402680; ; ; Target Exon; Target Exon; 9.68 

409208; Y00093; Hs.1 72631; Wegrin. alpha X (antigen C011C (p150), alpha pdypcptide); Integitn, alpha X (antigen C011C (pl5(J),; 9.46 
443426; AF098158; Hs.9329; chromosome 20 open reading ftme 1; chromosome 20 open reading frame 1; 9.42 
440207; AI371978; Hs.128326; ESTs; ESTs; ^41 

433001; AF217513; Hs.279905; done HCi0310 PR00310p1; done HQ0310 PRO0310p1: 9.41 
OD 447534; AW953935; Hs.288655; ESTs; ESTs; 9.33 
442333: AI650877; Hs.129302; ESTs; ESTs; 9.28 

421307; BE539976; Hs.103305; Homo sapiens mRNA; cONA OKFZp434B0425 (from done DKFZp434B0425); Homo saptons mRNA; cONA OKFZp43480425 (f 9 24 
423458; AI204212;Hs.3511l3; ESTs; ESTs; 9.23 

4319S8; X63629; Hs.2877; cadherfn 3. type t. P-cadherin (placental); cadherin 3. type 1 . P-cadherin (placenta; 9.23 
/U 422938; NM_001B09: Hs.1594; centromere protein A {17kD); centromera protdn A (17kD); 9.21 

41 1027; AF072099; Hs.67846; leukocyte fmmunogtobunn^ikB receptor, subfamily B (with TM and ITIM domains), member 4; leukocyte Immunodobdln-llke reoeotor : 9.21 
425397; J04088; Hs.l56346; topdsomerase (ONA) II alpha (170kD); topdsomerase (DNA) 11 dpha (170kp); 9.18 
428664; AK001666; Hs.1B9095; similar to SAUl (sal (Drosophila)-like: similar to SAUI (sal (DfosophilaHke; 9.17 

- - 428970; BE276891; Hs.194691; retinofc add Induced 3 (RAIGI); matabolropicglutamale family GPCR; relinote add induced 3(RAIG1); metaboc 9 11 
/ J 447733; AF157482; Hs.19400; MA02 (mitoOc anest defidenl. yeasl. homdog)^ 2; MA02 (mitoCc arrest defident. yeast h; 9.11 

422310; AA316622; Hs.98370; cytochrome P450, subfamily IIS, polypeptide 1; cytodvoma P45a subfamily IIS, polypept; 9.t0 
449722; BE280074; Hs.23960; cydin B1; cydin Bl; 8.86 

441560; F13386; Hs.7888; v-erb-a avian erythroWastte teukemfa viral oncogene homdog-Rka 4 (HER4); v-ert>^ avian erythroblastic leukemia vi; 6.86 
0^ 440983; M20681;Hs.7594;sduto carrier family 2 (faoTitated glucose bansporteO. member 3; 
oU 409342; AU077058; Ks.54089; 6RCA1 assodated RlNGdomaln 1; BRCA1 assodated RlNGdomain 1; 8.83 

420367; AA259090; Hs,K702B; ESTs; ESTs; a82 

415947; U04045; Hs.78934; mulS (E. coli) homdog 2 (odon cancer, nonpdyposis type 1); mutS (E. cdi) homdog 2 (coton cancer. ; ^73 
418613; AA744529; Hs.86575: mitogeiMdivated pretdn kinase kinase kbiase Unase 1; mitogen^aled piotdn kinase kinase ; a71 
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417407; AA923278; Hs.290905; ESTs, WeaMy similar to protease (Rsapiens): ESTs. WeaMy simaar to protease ^64 

407239: AA0763S0; Hs.67846; teukocyte ImmunoglobtMDce reoeptof. subfamily B (with TM and iTlM domains), member 4; leukocyte immuno^buOn-Gka receptor. ; B.58 
436461; AA379S97; Hs,5199; HSPC150 protein simOar to uttquiSrvconlugattng enzyme; HSPC1S0 protein sin^ to ut^ilinnnn; 8.55 
. 412140; AA219681; Hs.73625; RAB6 interacOno. Unesln^ke (rabkinesin 6); RA66 interacting, kinesij>4jke (rabUnes; a52 
J 438091; AW373fl62: Hs.351546; nudear receptor subfamily 1 , group I, men\ber 3; nudear receptor suManily 1, group I, m; B.51 
424800; AL035588; Hs.153203; MyoD famfly inhibitor. MyoD family Inhibitor, 8.45 
447188; H65423; Hs.17631; hypotheiical protein OKFZp434E2t35; hypothetical protefri DKFZp434E2135: 8.45 
430056; X97548; Hs.228059; KRAB-essodaled protein 1; KRA&«sodated protein 1; a42 
- 417389; BE260964; Hs.82045; midkine (neuifte 8rowthi)romotin9 factor 2); midkine (neurite growt^promoting factor, 8.40 
i U 430676; AfW866; H5.372585; gb;Hbmo sapiens envelope protein R103 (env) gene, comptete cd; gbtHomo sapiens envetope protein RlC-3 (; 8.38 
420759; Tl 1832; Hs.127797; Homo sapiens cONA FU1 1381 fis, ctona HEMBA1000501; Homo sapiens cDNA FU11381 fis. done HE; 6.38 
406621; X57B09: Hs.181 125; trnmunoglobuiln lambda locus; immunogbbu&n lambda locus; 6.37 
453914; NM.000507; Hs.574; fhictose>1.6-blsphosphatesa 1; fructose-l.&^bisphosphatase 1; 8.25 
^ . 42319a' M8t933; Hs.1634; ceil division cyde 25A; cell diviston cycte 2SA; ai9 

ID 418299; AA279530; Hs.83968; tolegrin, beta 2 (antigen CD18 (p9S). lymphocyte funclton^assodaled antigen 1; macrophage antigen 1 (mac-1) beta subunilj: integrln. beta 2 (antigen 
C018 (p%), ly; 8.17 

453968; AA847843; Hs.6271 1 ; High mobffity group (nonhistone chromosomal) protein 4; High mobility group (nonhistone diromoso; a 16 
453985; N44545; Hsi51885; ESTs; ESTs; ai4 
451 108; BE382701: Hs.25960; N44YC oncogene; N-MYC oncogene; aiO 
ZU 420347; AL033539; Hs.97124; Human DMA sequence from dona RP1-309H15 on duomosome 6p22.1-2Z3 0)ntalns a gene similar to HOGF (hepatomaK^eilved growth factor fhloh- 
mobilitygroupproteinWte)).EST8.8TSs,6SSsandaCpGlsl:HiimanONAseque^ B^wwu, i«»r inign. 

415857; AA86611 5; Hs.127797; Homo sapiens cDNA RJllSBI lis. done HEMBA1O0O5O1; Homo sapiens cDNA FU11381 fis, done HE: a02 
425601; AW629485; Hs.140720; GSK-3 binding protein FRAT2; GSK-3 binding protein FRAT2; 7.90 

421015; AA504583; H3.101047; transcription factor 3 (E2AimmunoglobuRnenitancerbin(fing fectois E12/E47); Iransciipfion factor 3 (E2Aimmunogtobul; 7 89 
Z J 432407; AA221036; ; 9b2r03fl2j-1 Stratagene NT2 neuronal prccuisor 937230 Homo sapiens cD^W done 5' similar to SWiPOU_BA£VM P10272 POL POLYPROTEIN • mRNA 
sequence; gb3i03f 12.r1 Stratagene NT2 neuronal pr, 7.83 

422845; BE51 3934; Hs. 1583; nautrophO cytosoUc factor 1 (47M}.chrontogr8ntitomatous(Bsease.autoso^ 1); neutrophil cytosolte factor 1 (47kD.chr 7 80 

433228; F28212:Hs.14953;KIAA1491 protein; KIAA1491 protein; 7,73 ^ "h> 7 w v ^u.^nr.i.ou 

446528; AU076640; Hs.15243; nudeolar protein 1 (120kD); nutiediar protein 1 (120kD); 7.71 
J\J 447350; A1375572: Hs,172634; v-ertw avian eiythrofalastiG teukemia vN oncogene homotog^ike 4 {HER4); v-erb-a avian erythroblastic teukenva vi; 7 71 

448775: AB025237; Hs.3B8; nudn (nudeoslde diphosphate linked moiety X)-type motif 1; roidlx (nudeosUe diphosphate linked md; 7.71 

430253; AK001514; Hs.236844; hypothetical protein FU1065^ hypothetical protein FU10652; 7.70 

444784; D12485; Hs.11951; ectonucleotkle pyrophosphatase/phosphodiesterase 1 (Plasmwen membrane glycoprotein PC-1); ectonudeoUde pyrophosphataseMiosohodi' 7 62 
443537; D13305;Hs.203;diolecystokIninBreceptondKJlecy5tokinlnB receptor; 7.57 wiop«»pn«^^ 
J J 446291; BE397753; Hs.14623; interferon, gamma4ndudble protein 30: interferon, gamm&*nludble protein 30; 7.55 

410008; AW73230B; Hs.57783; eukaryoUc Iranslafton faMon fector 3, subunit 9 (eta. 1 161(0); eukaryotic translation iniHalton factor; 7.53 
430255; AK000703: Hs.323822; Homo sapiens mRNA tor KIAA1551 protein, partial cds; Homo sapiens mRNA for KIAA1551 protein, ; 7.52 
41 1975; A1916058; Hs.144583; yUTR of. dead ringer (Drosophfla)-fikB 1 ; 3'UTR of. dead ringer (Drosophfla)^ ; 7.50 

439864; AI720078; Hs.291 997; ESTs, Weakly sWlar to A47582 aceD growth factor precursor (H.sapiens]; ESTs. WeaWy similar to A47582 B-cell « 7 47 
^ !I?2IS ^^Bi^-^JJ^Ii!^^ ^^^"^ ^ 2700083B06 gene, done MGC:4669. mRNA. complete ods; Homo sapiens, Similar to RIKEN cOKA 2700; 7.47 

417705; AW134952; Hs.175220; hypotheBcal protein FU14541; hypothetical protein FU14541; 7.47 

425387; BE271188; H5.155975; protein tyrosine phosphatase, receptor type. C^sodated protein; protein tyrosine phosdwtese. receptor 1 7.47 
407710; AW022727;Hs.23616; ESTs; ESTs; 7.45 
. 445093; A1207197; Hs^74149; ESTs; EST5;7.41 

40 418113; AI272141; Hs.83484; SRY (sex detemnlnlng region YHxw 4; SRY (sex determiiBng region Y>*ox 4; 7.39 

417900; BE250127; Hs.82906; COC20 (ceO dhfiston cyde 20. S. cerevisiae, homdog); COC20 (ceil division cyde 20, S. cerevi; 7.37 

n??!n? i^llll^}' fiVcine dehydrogenase (decarboxytefing; glydne decarboxylase, fllydne cteavage system protein P); glydne dehydrogenase (decarboxyiatlng: : 7.33 
422726; U11690; Hs.1572; fadogenitol dyspl8sia{Aar5kog^8yridrome):fedogBniteIdy8plasfa 

430504; H52761; Hs.44095; Homo sapens. done MGC:12617. mRN^ complete cds; Homo saptens, dona MQC:12617, mRNA, com; 7.32 
50 448981; Al968719;Hs.t95387; ESTs; ESTs; 7.28 

413762; AW41 1479; Hs.848; FK506'binding protein 4 (59kD); FHSOS-binding protein 4 (59kD); 7.26 

AL137310; H5.4749; Homo sapiens mRNA; cONA DKFZp761E13121 (from done OKFZp761E13121); partial cds; Homo sapiens mRNA; dDNA OKF2p761E13121 1 7.25 
434414; AI798376; ; gb:tr34b07 jil Na,CGAP.<M3 Homo sapiens cONA done J shifilar toTR:015475 015476 UNNAMED HERV-H PROTEIN mRNA sequence; ab*34b07j(1 
cc Na_CGAP_Ov23 Homo sapiens; 7.24 

D 5 428977; AK001404; Hs.194698; cydin 62; cydin 82; 7.19 

434274; AA628539; H8.57783; ESTs. h/toderately shitUar to ALU1,HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY (Rsaptensl; ESTs. Moderatehr 
simHar to ALU1 J<tiMAN A; 7.19 

44570ft AW206257; HS.15B326; Human DNA sequence torn done RP11.14SU22 on dirofflosome6p21.32-2Z2. Gonteins the gene formyeRritoRgodeiid^glyo^^ M06. 
(Partof)lhegeneforanowlKRABboxconlalntogC2H2lypedncf;HumanONAscquencefro^ 
OU 420524; AB010575; H8.98547; amitoride-sensllive catton diaimel 3. tesQs; amitoride^enslSve catton dennel 3, to; 7.15 

439053; BE2445M; Hs.6456; chaperonin containing TCP1, subunR 2 (bete); chaperonln oonteNng TCP1, subunit 2 (b; 7.14 

445076; AI206888;Hs.l54131: ESTs; ESTs; 7.14 * i • 

446588; AI970278;Hs.156905:KIAA1676; KIAA1676;7.13 
, - 429486; AF155827^ Ks.203963; hypothetical protein FU10339; hypothetical protein FU10339; 7.10 

00 441362; BE61441(k Hs.23044; RAD51 (S. cerevisiae) homotog (E coR RecA homdog); RAD51 (S. cerevisiae) homotog (E cell Re; 7J04 

433914; AF108138; Hs.1 12160; Homo sapiens DNA hefcase homotog (PiFI) mRNA. partial cds; Homo saptens ONAhelicase homdog (RFl); 7.(C 
413278; BE563085iH8.833;inteifen»«timulaled protein, 15kDa;lnterteron-8timulatedpiotehi.15kOa;7.02 

423765; R2385a- Hs. 143375; Homo sapiens, done (MAGE:3840937, mRNA, partial cds; Homo sapiens, clone IMAGE:3840937. mRNA.; 6.96 

416658; U03272; H5.79432; fibrfllin 2 (congei^tal contractura) aradinodadyly); fibrillin 2 {congenital contractural are; a92 
/VI 438450; AI050866; Hs.65853; nodal, mouse, homotog; nodal, mouse, homdog; 6.90 

415323; 6E269352; Ks.949; neutrophil cytosoQc factor 2 (65kO, chronte granulomatous disease, autosomal 2]; neutrophil cytosoHc factor 2 (65kD. chr aSD 

444381; BE387335; Hs.283713: hypolbeticai protein 8C014245; hypothetical protein BO014245; 6.89 

447582; 6E29352(k Hs.18910; prostete cancer overexpressed gene 1; prostete cancer overexpiessed gene 1; a89 
- _ 424779; AL046851; Hs.153053; CD37 entigen; CD37 antigen; a89 

/J 443907; AU076484: Hs.9963: TYRO protein tyrostoe kinase binding protein; TYRO protein tyrosine kinase binding pro; 634 
427298; AA400495; ; ESTs; ESTs; 6.82 

414732; AW410976; Hs.77152; mintohromosome maintenance deftetent {S. cerevlstee) 7; mintohromosome maintenance deficient (S.; asi 
424959; NM.0057B1; Hs.153937: acfivated p21cdc42H5 kbase; acfhratod p21cdc42Hs kinase; ast 
426866; Ua2330: N5.1728ia neureguEn 1; neuregulto 1; aSO 
OU 418203: X54942; HS.B3758; COC28 protein kinase 2; COC28 protein Wnase Z a80 
427521; AW973352: : ESTs; ESTs; 6.75 

430397; AI924533; Ks.105607; btoaibonate transporter reteted protein 1; btosrbonate traisporter related proteto ; a75 
427719; AI39312% Hs.134726; ESTs; ESTs; a74 
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439453: BE264974; H$.6566; IhymW hoonons fBCeptof hieractof 13: thyroid homwoB receptor Interactor 13: 6.72 
453348; BE272318; Hs.8595; hypolheOca! protein RJ12438; hypothetical protein FU12438; ^71 

4461 13: AW967553: H5,323518; Homo sapiens mRNA for aJ00083 protein, partial cds; Hbmo sapiens mRNA for FU00083 protein. ; 6.70 
. 445383; NMJD05993: Hs.12570: tubuQivepedfic chapeione d; tubuRn^spedlic chaperone d; &70 
5 433701: AW445023; Hs.15155: ESTs; ESTs; 6.69 

418054; NM_002318: Hs.83354: lysyl ojddase^ilcB 2; lysyl oxidaso4ike 2; a63 
441031; All 10684; Hs.7645: fibrinogen, B beta polypeptide: fibrinogen. B beta polypepDde; 6.62 

439237: AW408158: Hs.318893: ESTs. WeaWy similar to A47582 growth factor precursor (H^fi^itens): ESTs. V^teaWy simiiar to A47582 Mi w. 6.59 
424381; AA285249:H5.146329; protein ldnaseChk2 (CHEK2); protein Hnase Chk2 (CHEK2);a58 
414821; M63835: Hs.77424: Fc ftagment of tgG. h^h afJWty la, receptor for (C064); Fc fragment of IgG^ high affmity la, re; 6J8 
418883; BE387035; Hs.1211; add phosphatase 5, tartrate resistant add phosphatase 5. tartrate resistant 6.57 
414161; AA135106; Hs.184852; KIAA1553 protein; KIAA1 553 protein; 6.57 

437437; AA226869; Hs.351623; hypothetical protein OKFZp782L0311; hypothetica! prot^ DKFZp762L0311; 6.65 
425769; U72513; Hs.159486; Human RPL13.2 pseudogene mR^i^ complete cds; Human RPL13-2 pseudogene mRNA, ccmplele ; tSS 
ID 452363;AI582743: H5.94953; Homo sapiens, SliiJilar to complement component l.qsubcompoTOnU polypeptide, done MGC:17^^^ 
to coitpem^t comp; 6.55 

422765; AW409701: Hs.157B; bacutovlral lAP repeat^tabilng 5 (survMn}; bacutovM lAPiepeal-contalnfng 5 (sun 6.51 

409B93; AW247090: Hs.57l01; minlchromosome maintenance defident (S.cerBvblae) 2 (n^tofin); minichromosome maintenance delident (S : &50 

418918; X07871; Hs.89476; C02 antigen (p50). sheep red Wood cell receplon 0)2 antigen (p50), sheep red Wood cell ; 6.49 

41 791 1; AA333387; H5.8291 6; diaperonin containing 1CP^ . subunit 6A {zeta 1); chaperonin containinQ TCPl. subuntt 6A fc 6.44 

427747; AW41 1425; H5.180655: serineAhieonlna kinase 12; serinaAhreonine Unse 12; 6.43 

409142; AL136877; Hs.5075B; SMC4 (structural maintenance of dvomosomes 4, yeaslHite 1; SMC4 (sbuctural m^tflnance of chromoso: a42 
430280; AA361258;Hs.237868;lnterieuWn 7 receptor, mterieukin 7 receptor; 6.42 * 
432938; T27013: HsJ132; sterddogenic acute regulatory protein; steroidogenic acute regulatory prDteln; 6.42 
ZD 422809; AK001379; Hs.12l028; hypothetical protein FU 10549; hypothetical protein FU10549; 6.40 

l^^l?^"?; ^S?"*!^*' ^ ^ TUU J^UMAN TUBBY REIATEO PROTEW 3 (H^apiens); ESTs. Highly similar to TUL3.HUMAN TUBBY; 6.40 
4171 15; AW952792: Hs.334612; smaD nudear ribonudeoprotein potypepSda E; small nuclear ribonudeoprotein polypept; 6.39 
428227; AA321649: Hs.2248; small Indudble cytokine subfamily B (Cys-X-Cys). member 10; small IndudWa cytokine subfamily B (Cy; 6.39 
441384; AA447849;Hs.288660;reilnolc add Induced 3; retinolc add Induced 3; a38 

438915; AA280174; Hs.285681; WilBams-Beuren syndrome chromosome regton 14; Winiams-Beuren syndrome diromosome regi; 6.34 
416773: AK000340; Hs.79828; hypolhetical protoln FU20333; hypothelJcal protein FU20333; 6.34 

151?!? AW176401; Hs.27424; OEADW (Asp-Qu-Aia-Aspftfis) box polypeptide 1 1 (Sxerevlslae CHLl-like heHcase); DEAOm (Asp-Glu-AlaAspftfls) box pdypep; 6.31 
444159; AF116846: Hs.10431: dead ringer (DrosophllaHke 2 (bright and dead ringer); dead ringer (DrosopWIaHIke 2 (bright ; 6.31 
425274; BE281 191; Hs.155462, minJchramosome maintenance deftelent (fnls5, S. pombe) 6; minichromosome m^ntenance defident (mi; 6 31 
^ J 434649; AA738254; Hs.165390; ESTs. Highly similar to A40350 transcription repressor protein YYl [Ksaptonsl; ESTs, Kighty dmilar to A40350 transcrto' 6 30 
420507:AF093408:H8.98397;Aktoase(fWA)andK)rprD(e!n3:Akln^ * .n^ 7«ni>anDAw*utransatp.6.30 

415829: AW450l98;Hs,163742; ESTs; ESTs; 6.28 s ^ y, ^ 

410342; R31 350; Hs.743; Fc fragment of IgE. high affmHy I. receptor for; gamma pdypeiwde! Fc fragrnent of IgE, Mcdi affinl^ I, rsc 6.26 
446839; 6E091926; H$.16244; mitotic spindle coHed^l related protein; mitotic s(^e cdted^on related prot: 6.25 
4U 457465; AW301 344; Hs. 122908; DMA replication factor; ONA replicafion fetor; 6.25 
428918; AL036967; Hs.2324; protamine 2; protamine 2; 6.24 

443523: AK001575; Hs.9535; hypothetical protein FU10713: hypolhetlcd protein FU10713; 6.19 

424415; NM_001975: Hs.146580; enolase 2. (gamma, neuron^; enolase 2, {gamma, neuronal); 6.19 

407245; X90558; Hs.172004; tittn; titin; 6,18 
4^ 458627; AW088642; H5.97984; SRY (sex determining region Y)-box 17 (80X17); SRY (sex determining regton Y^box 17 (S; 6.18 

421379; Y15221; Hs.103982: small Inducible cytokine suWamily B (Cys-X-Cys), member 11; small Indudble cytoWne subfandy B (Cy; 6,16 

435099; A0004770; H8.4755; fiap strudure-spedSc endonudease 1 ; flap structore^pedfic endonudease t; 6.13 

424308; AW975531: H6.154443; n^nichromosome maintenance defideni (S. cerevislas) 4; mtntchramosome maintenance defident (S.: 6.12 

430521; NM_016383:H3.242183;HOM-TES^lumor8nt»geri;HOM-TE8«tomormtigen;6^ ^ 
D U 444823; BE262989; H5.12045; pulaihre protein; putative protein; 6.10 

^SiS^'" ^1 04032; Hs. 1 84601 ; sdute carrier fentily 7 (cationic amino add Iransporter, y system), member 5; solute carrier femHy 7 (catbnte amino ; 6.09 
; 436*:Homo sapiens fibrillarin (FBI), mRNA. IranscripO m mRWA.; NM.001436*:Homo sapiens librillarfn (FBL; a09 

441321; H17182;Hs.7771;B^»ll associated protein; B^ellassodated protein: 6.05 

D D 431629; AU077025; H3.265827; interferon, alphfrindudWa protein (done FI-6-16); interttom, alpha^ndudbte protein (do; 6X)4 
402678;;; Target Exon; Target Exon; 6.03 r- v-^ 

453884; AA355925: H5.3823?, KIAA0186 gene product WAAOl 86 gene produd; 6.01 

439753; BE262233; H5.7423; hypothefical proteto fhxn EUROtMAGE 2168212; hypothetkal protein from EUROIMAGE 2168; 6.01 

420596; NM„002692; H8.99185; polymerase (DNA directed), epdton 2; pdymerase (DNA directed), epsiton 2; 6.01 
OU 420676; AI434780; Hs.4248; vav 2 oncogene; vav 2 oncogene; 6.00 

418756; AA252254; H6.226949; ESTs; ESTs; 5.99 

454438; AA224053; Hs.l72405; cell dhrtslon cyde 27; ceO divkton cyde 27; 5.98 

407618; AL021938: H$.40164: JumonJI (mouse) homotog; fumori{i (mouse) homotog; 5.98 
^ 413313; NM_002047; Hs.29388S;fltycyl-tRNA8ynthete5e;gtycyl-tRNA synthetase; 5.96 
OD 424247; X14008: Hs.234734; lysozyme (renal amytoWosls); l^sozyma (renal amytoUosis); 5.95 

JlgsSAiewwJteig^^ 

43657&! AI45821^ Hs!77542: ESTs; ESTs; sio 

-70 iSIJ? MSSS?!?«*^*^?1I?i:."!!!!S^ m^om»assodated antigen p97 (melanotransferrin) mRNA, J flank; Human melanoma-assodated antigen p97 (m; 5.89 
/U 420005; AW271106;Hs.133294; ESTs; ESTs; 5i9 

417208; S67773; Hs.81665; v-Wt Hardy-Zuckerman 4 feline sarcoma viral orwogene homotog; v-kit Hardy-Zuckennan 4 feline sarcoma v 5.88 

; i C»0147r:gq5809678Igb|AAB41848.2| (U64675) sperm meinbrane protein BS^3 (HortwsapiensIH; C2001472*:giI5809678teblAAB41 848.21 (U64- 5 87 

44873tfcAB032983;Hs.21894;KIAA1157proteln;KIAA1157prolein;5.87 
- - 406137; : ; NM_000179^Hoino saptens mulS (E coIQ homdog 6 (MSH5). mRNA. VERSION NMJ)00178.1 Gl; Niy4^000179*:Homo saptens mulS (E odD h- 5 85 
fO 423787; AJ295745;H5.236204; nudear pore complex protein; nudear pore compIaxproteln5&B5 

425126; N32759: Ks.l72944; chorionto gonadotropin, beta pdypeptide: choriorA: gonadotropin, beta polypeptide; 5.84 

452796: AB01 1 10ft Hs.30656f K1AA0528 gene product; KIAA052a gene product; 5.84 

418322; AA284166; H3.841 13; cydirvdependent kinase inhibitor 3 (COK2-Qssodated duai spedficity phosphatase); cydbHlependaiit kinse bUdbitor 3 (COK: 5 80 
447359; Nfct012093;Hs.l8268; adenylate kinase 5; adenylate kinase 5; 5.79 " 
oU 420344; BE463721; Hs.97101; putaSve G proteliHXiupted reoeptx, puteliva G protebcoupted receptor; 5.78 

ALuThumAn'aST^^^^ WefiWy similar to ALUIJttJMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION WARNING ENTRY IH.8aptens); ESTs. WMMy sWIar to 
414761; AUd77228; Hs.77256; enhancer of zeste (Drosophfla) homotog % enhancer of zeste (Drosophna) homotog Z 5.75 
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430287; AW18245St Hs.125759C ESTs. Weakly Mar to L£U5_HUMAN LEUKEMIA ASSOCtATEO PROTEIN 5 [Rsaplsnst ESTs. Weakly simltv to LEU5 HUMAN LEUKE: 5 74 
436251; BES1506S; K5.296565; nucleolar pcutein (KK&D repeal}; nucieoiar protein (KKE/D repeat); 5.73 

421535; AB002358: Hs. 105478; phosphoribosyRbmiylglycinan^ synthase (FGAR amWolrBnsfefBse); phosphoribosylfbrrnytalydnarnidine syntha; 5.71 
4148B3; AA926860: Hs.34868% CDC28 protein kinase 1; COC28 prot^ kinase 1; 5.69 
^ 425159; NM.004341; Hs.154866; carbamoyHihosphala synthetase 2. aspait^ transcartamytase, and dthydioorotase; csrbamoyVphosphate synthetase 2, aspart 5.69 
401704; ; ; NM_021 ISSVHomD ssftem daudln 6 (CL0N6). mRNA. VERSION NKL020382 1 GJ; NM-021 195* Jtemo sapiens daudin 6 (CLDN6; 5.60 
425358; AlJ07g658; H3.338207; FKS06 binding proton 12-fBpamyctn assodated protein 1; FK506 binding protein 12-r8pamydn assoc; 5.65 

402677; ; ; NM,OOW7a'Homo sapiens aOcailne phosphatase. Bver/bonaflddney {ALPL), mRNA. alpha-hytfraxyfase). polypepGde 1 {CYP7B1), mRNA.; NM.000478:Homo saoiens 
^ alkaline phosphal; 5.64 

1 U 409264; NM_014937; Hs.52463: KIAA0966 protein; KIAA0966 protein; 5.63 

432185; AA221032; HS.272B38; hypothetical protdn FU10494; hypothetical protein FU10494; 5.63 

409012; AL117435; H5.49725; 0KFZP434I216 protein; 0KFZP434I216 protein; 5.63 

430252; AI638774; fe.105328; testes developmenketeled NYIVSP20; testes devekjpmenNetated NYO-SP20; 5.61 

419359; ALD43202; Hs.90073; chromosome segregation 1 (yeasl homologVlilce; chromosome segregalGon 1 (yeast homolog); 5.61 
15 4S2816;AA131789;Hs.61 509; ESTs; ESTs; 5.60 

402679; ; ; NH.000478:Homo sapiens alkaiine phosphstee, HverAxmeflddney (ALPL), mRNA. dpha^dnnqrlase). polypeptide 1 (CYPTBIX mRNA.; NJiL000478:Homo sapiens 
alkaline phosphal; 5.59 

414291; A}269ai9; Ks.13040; G proteirHXK/pled receptor 86; G proteirKoupled receptor 86; 5.58 
453028; AB006532; Hs.31442; RecQ proteiMike 4; RecQ proteln-liko 4; 5.58 
IV 453905; NM.002314; Hs.36S66; LIM domain kinase 1; UM domain kinase 1; 5.56 

41 1263; BE29760% Hs.69360; UnesbvGke 6 (mitolto centromere-assodated kinesin); k!nesin4ikB 6 (mitotk: centiomere^assoc; SJSS 

419660. 8E280337; H9.194893; sdute canter fantily 7 {caSotdc amino add transporter, y system), member 7; solute canter famBy 7 (caUonh: amino : 5.55 

446979; AI654443; Hs.197683; ESTs; ESTs; 5.54 

424321; W74048; Hs.1755; lymphocyte^pedfic protein tyrodne kinase: lymphocyte^ecHic protein tyrosine kin; 5.53 
25 416962; AA714835; Ks.271863; ESTs; ESTs; 5.53 

447388: AW630534; Hs.76277; Homo sapbns. done MGCS381 » mRNA, complete cds; Homo sapiens, done MGC:9381. mRNA. comp; 5.52 

427247^^504221; Hs.174103; htegrin, alpha L (anOgni CD11 A {p180), lymphocyte function-assodalBd antigen 1; alpha polypeptide); integrin, alpha L (antigen CD1 1A (p180).: 

449322; AI538616; Hs.196566; ESTs; ESTs; 5.51 
3 {) 428450; NM.014791: Hs.184339; KtAA0175 gene produd; KIAA0175 gene product; 5.5t 

415141^M«a)9^ ECTs. Weakly simRar b ALU7^HUMAN ALU SUBFAMILY 8Q SEQUENCE CONTAMINATION WARNING ENTRY [H.sapiensJ; ESTs, WbaWy similar 

4M04B; H05626; Hs.6921; ESTs; ESTs; 5.46 

417079; U65590; Hs.81 134; Interleukin 1 receptor antagonisl; Interieukin 1 receptor antagonist; 5.45 

434699; AA643687; Hs. 149425; Hbmo sapiens cDNA FU11980 fls, done HEMBB1001304; Homo sapiens cONA FU11980 fis, done HE; 5.44 
414334; AA824298; Hs.21331: hypothetkai protein FU10036: hypothetical protdn FU10036: 5.44 

452291; AF015592; Hs.26853; CDCf (ceU dhndon cyde 7, S. cerevisiae. homdogHike 1; CDC7 (ceH diviskm cyde 7, 6. cerevisi; 5.44 

438564; AA381553; Hs.198253; major histKompatibaity complex, dass H, DO alpha 1; histocompatibilily complex, dass ; 5.44 

427668; AA298750; Hs.180191; hypothetical protein FU14904; hypothetical protein FU14904; 5.43 
HK) 449437; AI702038; Hs.100057; Homo sapiens cDNA: FU22902 fis, done KAT05581; Homo sapiens cDNA: FU22902 (is. done K; 6.41 

453633; AA357001; Hs.34045; hypothetical protein FU20754; hypolheticd proteta FU20764; 5.40 

450746; D82673: Hs.278569: general transcription fador II. i; general transcriplten fador II. I; 5.40 

425966; NN^001761; H3.1973; cydin F; cydln F; 5i39 

418134; AA397769: Hs.866l7; ESTs; ESTs; 5.38 
45 432141; BE410964; Hs.272736; nudear receptor binding protein; nudear receptor binding protein: 5.37 

417141; U22662; Hs.347353; nudear receptor subfamily 1, group H, member 3; nudear receptor subfamily 1, group H, m; 5.38 

428329; AA426Q91; Hs.98453; ESTs. Moderately slmflar to R27328 2 [H.8apiensJ; ESTs. Moderately similar to R27328 2 (H,; 5.35 

40681 1 ; U82979; Hs.67846: leukocyte immunogUbulin^ leceptor, subfamily B (with TM and ITtM domains), member 4; leukocyte {mmunogtobuOrMike receptor. ; &34 
415819; AU077330; Hs.360791 ; transcriptton elongatton fador A (Sll). 1; transcriptton etongatton fador A (Sll).: 5.33 
D\J 448133; AA723157; Hs.73769; folate receptor 1 (aduM); folate receptor 1 (adiril); 5.33 

424762; AL1 19442; K3.183684; eukaryollc translation Initiation factor 4 gamma, 2; aukaryotic translation initiatton lador; 5.32 
421959; AW751497; H8.98370; cytochrome P450. subfamily US. polypeptide 1; cytodwome P450, subfarrdly IIS. pohrpept; 5.32 
429271; ^039850: Hs.198515; dead rfnger (DrosophOa)^ 1; dead linger (DrosophDaHike 1; 5.32 

- 456373; 6E247706: H5.86693; merrtbiBne^penhtng 44omains, subfamDy A. member 2 (CD20 antigen}; membrane^pannlng 4-domalns. subfamily A; 5.30 
D J 414907; X90725; Hs.77597; pdo (Drosoph{a)-like kinase: pdo (Drosophte)-Iika kinase; 5.30 

422997; BE018212; H3,122908; DNA repGcalton fador. DNA replteation fador, 5.29 

440014; AW960782; Hs.68S6; 8sh2 (atjsent small, or homeolic OrosophHa. homofog)^*ke; 8sh2 (absent small, or homeotte. Orosop; 5.28 

416399; AF131701; H&.64753; hypothetical protein FU12442: tvpoB^etical protein FU12442: 5.26 

416178; AI808527; Hs.192822; serologlcalty defined breast cancer anOgen NY-BR-81: serdogteaitydeSned breast cancer antk 5.21 
OU 450377; AB033091; Hs.355925; KIAA126S pro*dn; KIAA1285 protein; 5.20 

409670; AI368109; Hs.375604; NAAI 856 protein; K1AA1856 protein; 5.20 

429083; Y09397: K5.227617; 6CL2-re(ated protein A1; 6CL2^fated protein A1; 5.20 

449523; NM.000S7% Hs.54443; chemoMne (&C motif} receptor 5; chemoklne ((VC mdiO leceptor 5; 5.20 
^ 408908; BE296227; K3.250822; sertoeAhreonlne kinase 15; serinefthieodne kinase 15; 5.19 

OD 429732; U20158: Hs.248B; lymphocyte cytosdlc protein 2 (SH2 domdn-cont^ng teukocyte protein of 76kD); lymphocyte cytosoDc protein 2 (SH2 doma: 5.19 

438089; W05391; Hs.351546; nudear receptor subfamily 1, group I, men*er 3; nudear receptor subfamily 1, group I, m; 5.16 

437623; O63860; Hs.571 9; chromosome condensation-related SMC-assodated protein 1; chromosome condensation-rdated SM&asso; 5.15 

448161: AF272833: Hs.279763; hypolhettoal protein FU10504; hypotheScd protein FU10504; 5.15 

436540: BE397032; H3.14468; hypothetkal protein MGC14226; hypothetted protdn MGC14226; 5,14 
/ U 422241; Y00062; Hs.170121; protein tyrosine phosphatese. receptor type, C; protein tyrosine phosphatase, receptor 1 5.14 

426752; X69490; Hs.172004; iien; tifin; 5.13 

415007; BE244332; Hs.77770: adaptor-related protein oompfex 3, mu 2 subunil; adaptor-rdated protein comptex 3^ mo 2 ; 5.13 
400ffi3: ; H8.75309; Eos Control; Eos Control; 5.13 

- ^ 437099; N77793; Hs.48659: ESTs. Highly similar to S14458 laminin a)pha-1 chdn procuisor [H.8aplen$]; ESTs. Highly similar to S14456 teminin a: 5.12 
/ O 427209; K06509; Hs.92423; KlAAl 566 protdn; KIAA1566 protdn; 5,10 

407347; AA829847; ;gb:od40d07.s1 Na.CGAP.G(»1 Homo sapfensdDNA done tMAGE:1 37041 3 ST dnto to oonfainsAhjrepetithre dement;, mRNA seqw 

Na_CGAP_GC81 Homo sapiens; 5.10 -"i 
458933; A1638429; H5.24763; RAN binding protdn 1; RAN btodlng protein 1; 5.10 

- 450431; AW136797; Hs.266Wl; ESTs; ESTs; 5.09 

OO 434608; AA805443; Hs.179909: hypothe8cd f»otdn FU22995; hypolhdicd protdn FU22995; iOB 

410423; AW402432; Hs.63489; protdn tyrosine phosphatase, non-receptor type 6; protein tyrodne phosphatase. noiHecepI; 5.08 

41792$: R2721 9; Ns.74647: Human T-cdl receptor adne dpha<han mRNA from JM cdl line, oomptete cds; Human T-«el receptor adive dphachaln; 5.05 

412723; AA6484S9: H8.335951; hypothefcal protein AF301222: hypotheOcd protein AF301222: 5.05 
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44732t: AW271217; Hs.281434; Homo saptens cONA FU14026 fis, done HEM9A10C3838; Homo sapiens cONAFUl 4028 6s, done HE; 5.03 

417866; AW067903; Hs^2772; coDagen. type XI, afpha 1; conagen. type XI. atpha 1; 5.03 

410082; AA081594: Hs.1 5831 1; Musashl (DrosophOa) homolog 1; MusasN (DiosophOa) tramoJog 1; 5.02 

43359i NM_004642; Hs,3438; deteled In oral cancer (mouse. t»molog) 1; deleted in oral cancer (mouse, homotog) ; 5.02 

42581 1; A1039104: Hs.1 59557; karyopheiin alpha 2 (RAG cohort 1. knportin alpha 1); karyopherin alpha 2 (RAG oolwt 1. Impon 5.00 

425237: U07595: Hs.l55227; EphB4; EphB4: S.00 

414809; AM34599: Hs.77356; transfenin receptor {p90, C071); transferrin receptor (p90. C071); 4.99 
402145; ; ; Target Exon; Target Exon; 4.99 
432126; AA865239; Hs.37195; ESTs; ESTs; 4.99 

408279; AF2169B5; Hs.44095; Homo saptens, done MGC::12617, mRNA, complete cds; Homo sapiens, done MGC:12617, mRKA, com; 4.98 
419525; T79257; Hs.1259; aslaloglycoprottin receptor 2; asabglycopratein receptor 2; 4.97 
424439; AA579635; Hs.1770; ligase I, DNA. ATP-^pendent Ugase i. DNA, ATP-dependent; 4.97 

427667; AK001279; Hs.180171: Homo sapiens cDNA FU10417 fis. done NT2RP1000112; Homo sapiens cONA aj10417 fis, done MT; 4.96 . 
457313; AF047002: Hs.241520; transcripCofla) coadivstor; transctiptionaJ coacfivalor; 4.96 

448569; BE382657; Hs.21488; signal transducer and adivator of transcription 1, 91kO; stgnal transducer and activator d trans; 4.94 
426427; M86599: H5.169840: TTK protein kinase; TTK protein kinase; 4.91 

440129; AA86581B; H$.369523; ESTs, Weakly similar to S71886 Ste20^iKe protein kinase [H.saplens]; ESTs, WeaUy similar to 871886 Ste20-lik; 4.91 
453922; AF053306; Hs.36708; twdcfing ur^tiibited by l)enzimidazoies 1 (yeast homdog). tHidding uninhibited tjy benzimidazoles 1 ; 4.90 
435602; AF217515; Hs.283532; undiaraderlzed bone marrow protein BM039; uncharaderized bone mamw protein BM03; 4J9 
452698; NM-001295; H8.301921; chenwklne (C-C mo80 receptor 1; dwmoWne (OC motif) receptor 1; 4.88 

445333; BES37641; H8.44278; hypoihe6cal protein FU12536 similar to ras^elated protein RAB17; hypofhelica] protein FU12538 similar to; 4.88 

437162; AW00SS05; Hs.S464; hormone receptor coadivating protein; thyroU hormone receptor ooadhrating pr 4.87 

441600; AA939347; Hs.1 27223; Homo sapiens cysteine knot protein (ZSIGS1) mRNA, comptete cds; Homo sapiens cysteine knot protein (ZSIG; 4.87 

421350; AW301608; Hs.278188; ESTs. Moderately similar to 154374 gene NF2 protein [H.sapiens]; ESTs, Moderately simiiar to 154374 gene ; 4.87 

409093; BE243834; Hs.S0441; CGM)4 protein; CGMM protein; 4.86 

424304; NIA.00139S; Hs.144879; dual spedfidty phosphatase 9; dual spedSdty phosphatase 9: 4.86 

437696; Z83844; Hs.S7gO; hypothetical protein dJ37Et6.S; hypothetical piotebt dJ37E16.5: 4.86 

417381; AF164142; H3.82042; solute canier family 23 (nudeobase transporters), member 1; solute carrier family 23 (nucleobase tra; 4.84 

416445; AL043004; Hs,79337; KIAA0135 protein; K1AA0135 protein; 4.83 

448939; BE257795; H3.22595; hypothelted protein FU10637; hypolhefcal protein RJ10637; 4£2 

413566; AW504451; Hs.285814; sprouly (Drosophila) homoteg 4; sprouty (DrosopWIa) homotog 4; 4.82 

424081; NM»005413; Hs.139120; n*bonudease P (30kD); ribonudease P (30kD); 4.81 

425474: Z48054; Hs.158084; peroxisome receptor 1: peroxlsorne receptor 1; 4.81 

412276; BE262621; H3.73798: macrophage n^ratlon inhibitory factor (glyoosylation-hhibiting factor); macrophage migration Inhibitory factor (; 4.81 

422689; AW856665: Hs.299797; gb:RC3^T0297.290100^)13Kl03 CT0297 Homo sapiens cDNA, mRNA sequence; gb:RC3<n'0297.29010^013<J03 CT0297 Homo; 4.80 

409101; NNL004297; Hs.50612; guanine nucleotide binding piotein (G protein), alpha 14; guanine nucleotide binding protein (G pr; 4.79 

412760; AW379030; Hs.41324: ESTs; ESTs; 4.79 

447250; AI878909; Hs.17883; protein phosphatase 1 G (formerly 2C}, magnesiunMlependent gamma isofonn; protein phosphatase 16 (fonneily 2C). ma; 4.79 
429345; R11141; Hs,199695; hypothetfcal protein; hypothetical protein; 4.78 
448950; AF288687; Hs.9275; CX5H52 protein; Ca-152 protein; 4.78 

412926; A1879076; Hs.75061; macrophage myristoylated danine^fch C kinase substrate: macrophage myristoylated danine^ C ; 4.78 
412641; M16660; Hs.74335; heat shodc 90kD protein 1, beta; heat shock 90kD protein 1, beta; 4.76 

420261; AW206093; Hs.748; fibrtibtast growth fador receptor 1 (fms^ated tyrosine kinase 2. PfeSler syndrome): fibrdilast growth factor leoeptor 1 (ftns; 4.76 
421905; AI660247; Hs.32699: ESTs. WeaHy similar to UV-1 protein [H.8apiens]; ESTs, WeaWy similar to UV-I protein IK' 4J5 
413860; AI660842: Hs.l 10915; interleukin 22 receptor; Interieukin 22 receptor; 4.75 

424905; NNL002497: Hs.1 53704; NIMA (never in mitosis gene a><eiated kinase 2 (NEK2): NIMA (never in mitosis gene a)-rBlaled k; 4.74 
418355; U2563; Hs.1165: ATPase. H? transporting, nongastric. alpha polypeptide; ATPase. H? transporting, nongastric, alp; 4.74 
435905: AWS97484: K9.5003: KtAA0456 protefai; K1AA0456 protein; 4.74 

428024; Z29067; Hs.2236: NIMA (never in mitosis gene a^elated Unase 3; NIMA (never in mitosis gene a}^latod It; 4.74 
421846; AA017707: Hs.1432: protein kinase C substrate 80K-H; protein kinase C substrate 80K-H: 4.72 
419136; U48508; Hs.89631; ryanodine receptor 1 (skdetaO: ryanodine receptor 1 (skeletal]; 4.72 
437296; AA350g94; Hs.20281; KIAA1700; KIAA1700: 4.70 

450142; AW207469E Hs.24485; diondrciOn sul^ proteoglycan 6 (bamacan); chondroiOn sdlato proteoglycan 6 (bama; 4.70 

421733; AL119671; Hs.1420; fibroblast growth fador receptor 3 (adiondroplaste. thanatophoite dwarfism); fibroblast growth tector receptor 3 (adi; 4.69 

449475; A1348027; H8.129826; hypothetical protein PP1057; hypothetfcal protein PP1057; 4.69 

420062; AW41 1096; Hs.94785; TGF(beta)-lnduced transcription fador 2; TGF(bete)-lnduced transcription factor 2; 4.69 

429336; Afi0Q5O38; Hs.19927Q; cytochrome P450, subfamily XXVllB {25-hydroxyvitamin D-l-alpha^iydraxytase). polypeptide 1; cytochrome P450, subfamily XXVItB (254iy; 4.69 
438856; A1469355: Hs.127310: ESTs; ESTs: 4.68 

406937; U14622; ; gb:Kuman transketolasfr^ike proldn gene, partial cds.; gb:Human transketdasfrllke proteto gene; 4.67 

41 1296; BE207307; Hs.10114; gnwrth suppressor 1; grwrth suppressor 1; 4.57 

426726; AA488915: H3.171955; trophlntn associated protein (tastin); Irophlntn associated protein (tasQn); 4.67 

4091 32; AJ224538: Hs,50732; protein kinase. AMP-adivated, beta 2 norKatelytIc subunil; protein kinase, AMP-adivated, beta 2 no; 4.67 

449230; BE613348; Hs.356392; melanoma cell adhesion molecde; mdanoma ceO adheston mdecute; 4;66 

431681; AK000378: H3.267566; hypothetical protein FU20371; hypotheticd protein FLJ20371; 4.65 

443823; AA34S519; Hs.9541; complement component 1. q subcomponent a^ha pdypepUde; oomplenienl component 1, q subcomponent ; 4.65 
441595; AW206035: H3.356457; ESTs; ESTs; 4.64 
423419; R55336; H3.23539; ESTs: ESTs; 4.64 

415724; NM.003580; Hs.78687; neutral sphingomyelinase (N-SUase) adivaiton assodated factor; neuird sphlngomyeilnase (N-SMase) activ; 4.63 
435045; BE297155; Hs.143698; ESTs; ESTs; 4.62 

424441; X14850: Hs.147097; H2A histone famHy, member X; H2A histone famly, member X; 4.62 
414972; BE263782; Hs.77695: KiAAOOOB gene product; KIAA0008 gene product; 4.62 

436685; W28661; H3.5288; Homo sapiens mRNA; cDNA OKFZp434M245 (from done DKFZp434M245); Homo saptens mRNA; cONA DKFZp434M245 {fr 4.62 
449515: A1653378; Hs.302012; ESTs; ESTs; 4.61 

425998; AU075629; Hs.165950; fibroblast growth fartor receptor 4; fibroWast growth fador receptor 4; 4.61 

420027; AF009746; Hs.94395; ATP-hlndlng cassette. sub-famQy D (ALD). member 4; ATP-«nding cassette, sub4an<iy D (ALD); 4.61 

436469; AK001455; H5.5198; Down syndrome critical regton gene 2; Down syndrome criUcd regton gene 2: 4.61 

413441; AI929374: Hs.75367; Sn>{ks^apter; Src^ke-adapter; 4.60 

456847; Ai360456; Hs.8^88; ESTs; ESTs: 4.58 

421508; BE302798; Hs,105097; thymidine kinase 1. sotubte; thymidine kinase 1, sdubte; 4.57 

426935; NM.000088; H3.172928: coSagen. type 1, dpha 1; coOagen. type 1. dpha 1; 4.57 

42878% X12830: Hs.193400: intertetddh 6 receptor. interteuMn 6 recepton 4.56 

409430: R21945; HS.34673S; spBdng fador. afglnlne^rine^ 5; splidng fador. arglntoefterine^kh 5; 4.56 

41 2773: HI 5785: Hs.74573! Mter to vacdnia virus Hindlll K4L ORF; dmiter to vacdnia virus HindlQ K4L OR; 4^ 
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443C58: AI188710; H5.374480: ESTs; ESTs; 4.55 

441607; NM.0OSO1O; Hs.791 i neuronai ceQ adheston imlecule; neuronal ceD adhesion molecute: 4.54 
453227; AW135862; Hs.243991; ESTs; ESTs; 4.52 

438459: T49300: Hs.35304; Homo sapiens cONA aJ13655 lis. done PLACE101 1503; Homo sapiens cONA FU13655 fis. done PU 4.51 
422555; BE259035; Ks.1 18400; singoJ (t>osophSa}4Ilce (sea urdiln bsdn homdog Ske); dnged (DrasophflsHilGe (sea wdiin fas: 4.51 
453613; F06838; Hs,374476; ESTs; ESTs; 4,50 

413900; AW409747; Hs.75612: stress-induced-phosphoproteln 1 (HspTOMspSOofganlxing protein): sIressMiced-pliosphoprDteIn 1 (Hsp7Q/H; 4.50 
412507; L36645: Hs.73964: EphM; EphA4; 4.50 

419034: NM.00211(): HS.89S55: bemopdeOccea kinase: hemopdeflc cdl kinase: 4.49 

413431; AW246428; Hs.75355: utriqunbvconjugaQng enzyme E2N (homdogous to yeast UBCI^; ubfquilbvcQniugaling enzyme E2N (homolo; 4.49 
406547; ; ; Target Exon; Target Exon; 4.49 

443216; WB0487: K5.324521 ; hypoOieCca] protein DC50; hypothelkx! protein DCSQ: 4.48 

417497; AW402482: H3.82212: CDS3 antigen; CD53 antigen; 4.47 

448595; AB014544; H8.21572; KIAA0644 gene product; KIAA0644 gene produd; AA7 

445350: AF052112; Hs.12540; iysophosphoOpase 1; lysophosphdipase 1; 4.46 

446238; NM„006293; Hs.301; TYR03 protein tyrosine kinase; TYR03 protein tyrosine kinase; 4.46 

429150; AF120103; Hs.197366; smoothened (Drosophfla) homolog; smoothened (Oosophia) honx)k)g; 4.46 

420340; NM.000734; Hs.97087: CD3Z antigen, zeta polypeptide (TTT3 complex); C03Z antigen, zeta polypeptide (T]T3 com; 4.46 

41 3426; U88837; Ks.75354; GCN 1 (general control of an^add synthesis 1 , yeasl)^ 1; 6CN1 (gsnerd contrd of andno^ synl: 4.44 

421819; KM013403; Ks.108665; ztnedin; zlnedbi; 4.44 

40951 2; AW979187: Hs.293591; mdanoma dlfferenOathm assodaled protein^; melanoma dlfferenOaOon associated prot; 4.44 

428995; AW00497S; Hs.194716; MAO (mothers against decapentqpleiiic Drosophlla) homdog tnteradlng protein, receptor acOvatkx) anchon MAO (mothers against decapentaptegic. 
Or; 4.43 

434551: BE3B71 6% Hs.280B58: ESTs. Highly similar to A35681 DMA excision repair cros^comptemenGng protein ERCC3 (Ksapiens]; ESTs. Highly similar to A35661 DMA exds: 
4.43 

418295; AyV970043; Hs.236039: hypoihetk»l protein FU11090( hypothetkal piotdn FUllOgO; 4.42 
409243; AB037761: Hs.51743; KIAA1340 protein; K1AA1340 protein: 4.42 
437103; AW139408; Hs.152940; ESTs; ESTs; 4.42 

413186; AU077141; Ks.374548; solute canter family 16 (monocart)oxylic ackl transporters), member 1; sdute canierfaniDy 16 (monocarbaxyQc; 4.42 
456362; AW973003: Hs.179g09; hypothetical protdn FU22g95; hypothetkal protein FU22995: 4.40 
424078; AB006625; Ks.1 39033; paternally expressed 3; paternally expressed 3; 4.39 

448153; Y10805;Hs.20521: HMT1 (hnRNP methyltransferase. S. cerevisiae)4ke 2; HMT1 (hnRNPmethyltransfBrase,S.cefevl:4.37 
410134; U58140: Hs.58927; nuclear VCP-like; nudear VCP-tike; 4.36 

435523: T62849; Hs.1 1090; membrane-spanning 4^omaIns, subfamily A, member 7; membrane-spanning 44lomains, subfandty A; 4.35 
400440; X83957; hls.83870; nebuTin; nebufin; 4.35 

437218; AL117497; Hs.58185; ESTs, Weakly similar to T42727 prdifsraaon potenflakelalBd protdn - mouse [Mjnuscdus): ESTs. Weddy slntiiar toT42727 prefifera; 4.34 
430478; Nlit014349: H$.241535; apoQpoproteln L. 3; apoDpoprotein U 3; 4.34 
432744; AA988835; H3.38664; ESTs; ESTs: 4.33 

423173; AA442655; Hs.124942; protdn phosphatase 2A 48 kOa regulatory subunit; protein phosphatase 2A 48 kOa regulatory: 4.33 
41 5995; Ni\t004573; H$.35588B: phospholipase C. beta 2; phosphdipase C, beta 2; 4.33 
408728: AL137379: Hs.47125; hypdhetfcd pmtdn FU13912: hypothefkal protdn aj13912; 4.30 
431222: X56777; Hs.273790; zona pdhidda gtyooprotdn 3A (sperm reoeptor); zona peDudda glyooprotein 3A (sperm re; 4.30 
427792; M63928: Hs.180841; himor necrosis factor reoeptor superfamiiy. member 7; tumor necrosis factor receptor supeifamh, 4.29 
426227; U67058; Hs.154299; Human proteinase activated receptor-2 mRNA, 3'UTR; Human proteinase acUvated receiptor-2 mR; 4.29 
, 427337; Z46223; Hs.176663: Fc fragment of IgG, tow affinity Ulb. receptor for (C016); Fc tragment of IgG. tow affinity lllb. r, 4.29 
434826: AF156661; Hs.22265: pyruvate ddiydrogenase phosphatase; pyruvate dehydrogenase phosphatose: 4.29 
412314; AA825247; H5.356064: downstieam of: G protelivooupled receptor 27 (GPI^ (SREB1); downstream of: G protein^pled recepto: 4.28 
447827; tJ73727; Hs.19718; protein tyrosine phosphatase, reoeptor type. U; protein tyrosine phosphatase, receptor t; 4.28 
426106; AA622037; Ks.166468; programmed cdl death 5; programmed ceB death 5; 4.28 

428820; AA438187; Hs.172631: integrtn, alpha M (comptetnent component receptor 3, alpha: also known as GDI lb (p170). macrophage antigen alpha polypeptide); integrin, dpha M 

(complement component ; 4.27 
437908; AI082424: H$.351043; ESTs: ESTs; 4.27 

444664; N26362; Hs.11615; map kinase phosphatase^ protein MK-STYX: map kinase phosphatose^ protdn MK^ 4.27 
429002; AW248439; Hs.2340; iundton plakogtobin; junctton pld«)globin: 4.26 
439334: A1148976; Hs.1 12082; ESTs; ESTs; 4.26 

425308; M97639: H$.155585: receptor tyrosine kinase^ orphan receptor 2; receptor tyrosine kinase^Qee orphan leo; 4.25 
413869; NM.000878; H3.75596: interteukln 2 receptor, beta: iderieukln 2 receptor, bete; 4.25 
453648; W21493; Hs.380100; hypotheticd protein FU14005; hypothettod protein FU14005; 4.24 

447200; BE543145; Hs.261434; Homo sapiens cDNA FU14026 lis. done HEMBA10O3838; Homo sapiens cDNA FU14028 lis, clone HE; 4.24 
447528; AI612027; Hs.76277: Homo sapiens, done MGC:9381, mRNA. comtdete cds; Homo saptons. done MGG:9381, mRi^. comp; 4.23 
452721; AJ269529; Ks.301871: solute carrier family 37 (glycerd^phosphate transportei). member 1; sdute cam'er fanfly 37 (glyoero)-3i)ho: 4.22 
449810: AB008681; Hs.23994; acfivln A receptor, type IIB; adivin A receptor, type liB; 4.22 
447196; 061523; Hs.283435; ESTs; ESTs: 4.22 

416714; AF28377Q; Hs.79630: C079A antigen (immunogtobulln-assodated dpha); CD79A antigen Cimmunogtobulin-assodated; 4.22 
425356; BE244879; Hs.155939; inositd pdyphosphate^phosphatase, 145kD; Inositol potyphosphate-Siihosphatase. 14; 4.22 
422605; H16646; Hs.118666: hypothetical protein PP591: hypothettod protein PP591; 4.21 

444535: AF011466: Hs.122575: E0&4 (endothdld dUferentiatton, lysophosphaitoto add G-proteln<oupled receptor, 4); ED&4(endotheQd differentiaGon. lys; 4.21 
417088; M5491S; KS.B1170; pim-l oncogene; pbn-1 oncogene; 4.20 
421707: NiyL01492l: Hs.107054: tedomadto-2: IedDmedIrv2: 4.20 

408717; AF045458; Hs.47061; unC'51 (C. degans^like kinase 1; uno-51 (C. degans)^ike Mnase 1; 4.20 
438485; W57578; H3.378718; RAB7. member RAS oncogene family; RAB7, member RAS oncogene family: 4.19 
419452: U33635; Hs.90572: PTK7 protein tyrosine kinase 7; PTK7 protein tyrosine ktoase 7; 4.18 
418755; Y14443; Hs.86219: ztoc finger protdn 200; zinc finger protein 200: 4.18 
417212: AW952823: Hs.351547: NSI-binding protein; NSI-binding proteto; 4.17 
413686; AI469213; Hs.71404; ESTs; ESTs; 4.17 

419344; U94905: Hs.277445; diacylglycerd kinase, zete (104kD): diacylglycerd kinase, zeta (104kD); 4.16 

418870; AF147204: Hs.89414: chemokine (C-X-C motH). receptor 4 (fusin): chemoklne (OX-C moQf). receptor 4 (he; 4.16 

408482: NH.O0G676: H8.45743; adenosine A2b receptor, adenosine A2b receptor. 4.16 

422391: 083479: Ks.115907: diacylglycerd kinase, delta (ISOkD); diacylglycerd kinase, ddta (130kD); 4.15 

409421; AA199883; H3.67624; ESTs; ESTs; 4.15 

415938; BE383507; Hs.78921; A kinase (PRKA) anchor protein 1; A kinase (PRKA) andw protein 1; 4.14 
415198: AW009480; Hs.943; ndurd kiOer cdl traiscrlpt 4; naturd killer cell transcript 4; 4.14 
424685; mi223: Hs.151734: nudear transport fador 2 (piacenld protein 15): nudeai transport 

664 



wo 03/042661 



PCT/US02/36810 



42B479: YQ0272; HS.334S62; eel) division cyds 2. 61 to S and G2 to M: cell division cyde ^ G1 to S and G2 lot 4.13 

423728; AW891294; H5.132136; sotuta carrier family 4, sodium Ucarbonaie cotransporter, member 8; solute carrier liamily 4. sodium bicarton; 4.13 

433435; BE545277: Hs.340959: Ts trsnslafion ekmgaiion factor, mitochondrial; Ts translalton elongalton factor, mitoch; 4.13 

4202S3; AI8S6055; Ks.96200; neighbor of A^ase anchoring protein 95; neiglibor of Apktoase anchoring protein 9; 4. 12 

41 2SB4; X54670; Hs.74085; ONA segment on chromosome 12 (unique} 2489 expressed sequenoe;^ DMA segment on chromosome 12 (ur^ue) 24; 4.12 

400205; ; Hs.81 848; Nlwl_XS265':Homo sapiens RA021 (& pombe) homobg (RA021), trSm. (APO-1/C095 {Fas>-as50cialed phosphatase} {PTPN13), mRNA.; 

m 006265*:Homo sapiens RAD21 (S. pombe); 4.12 
425322; U63630; HS.15S637; protein kinase. DNA^tivated. catalytic pdypepQde; protein kinase, DNA-activated. catalytic: 4.1 1 t' 
41 9607; R52557; H5.91579; Homo salens done 23783 mRNA sequenco; Homo sapiens ctono 23783 mRNA sequence; 4.10 
459035; AW291109; HS.332S63; ESTs. Weakly sNlartoT31611 hypoflielicd 

4.10 

432512; NKI.003284; Hs.3017; transition protein 1 (during hlstone to protamine replacement); b-ansition protein 1 (during histone to ; 4.10 

452875; BE275760; Hs.30928: DNA segment on chromosome 19 (unique) 1177 expressed sequence; DNA segment on chromosome 19 (unique) 1.1; 4.09 

423804; AW403448; Hs.1706; interferon-sMated transcription (actor 3, gamma (48kD); Interferoivstlnwlated transcription fact; 41)9 

413745: AW24725% Hs.75514; nudeoskle phosphorylase; nucleoside phttphorytese; 4.09 

442091; AW770493; Hs.162874; guanine nudeoQda binding protein (G protein) alpha 12; guanine nucteotUe Undng protein (G pn 4.09 

425003; AF1 19046; Ks.154149: apurink:/apyrimidlnic endonuclease(APex nudeaseHito 2 protefai; apurfnic^mMinlc endonuctease(APEX ; 4.08 

421859; AA356620; H8.108947: K1AA0050 gene product; KIAA0050 gene product; 4.08 

432841; M93425; Hs.62; proteh lyrastne phosphatase, noiweceptor type 1% protein tyrosine phosphatase, norweoept 4.08 
452069: A8028949; Hs.183994: KIAA1026 protein; KIAA1026 protein; 4.08 

425069; AA687465; Hs.29ai84; potassium voltage-galed channel, shaker-related subfamily, beta member 2; potasstom vdtage-gated channel, shaker-; 4.07 
418526; BE019020; Hs.85838; solute carrier family 16 (monocarboxyHc add iransporters), member 3; solute canier family 16 (monocarboxytic; 4.07 
424517; A)539443; Hs.137447; Homo saptens cONA FU 12169 lis. done MAMMA1000643; Homo sapiens cONA FU12169 fis, done MA; 4.07 
434224; AA380731 ; Hs.64; interieukin 2 receptor, gamma (severe combined immunodelidnicy): InterieuMn 2 receptor, gamma (severe oo; 4.06 
446791; AI632276; HS.19S922; ESTs; EST8;4.06 

432065; AA401039; Hs.2903; pmtein phosphatase 4 (formerty X). catalytic subunit; protein phosphatase 4 (fbnneriy X). cata; 4.06 

431 194; D43704; Hs.250712: cddum channel. vdtagoHJependent. bete 3 subunit; catctum channel, voitageKtependent beta; 4.06 

418751; BE389014; Ks.372548: phosphoinosiiide-3-kinase. regulatory subunit. polypeplkJe 3 (p55. ganvra); phosphoinosUkie-S-kinase, regUtetoiy su; 4.06 

425923; NM.005026: Hs.162808; phospholnosItkte-3-klnase, catalytic delte pdypepOde; phosphoteosBDde^ase. catelytlc. do; 4.05 

408692; AL040127; Hs.34074: dipeptWylpeptidase VI: dlpepSdylpeplidase VI: 4.04 

446272; BE2&B912: Hs. 14601; hematopoietic ceU-spedfic lyn substrate 1; hematopoietic ceO-speciric Lyn substrat; 4.04 

439176; AI446444; Hs.190394; ESTs. Weakly similar to B28096 Dne-I protein OPfl [H.sap{ens]; ESTs, Weakly simHar to B28096 line-l pr; 4.04 

417880; BE241595; Hs.82B48; setectin L (lymphocyte adheston molecute 1): selecGn L (lymphocyte adheston motecute; 4.04 

410068; AI633888; Hs.5M35; FYN-blnding protein (FYB-1 20/130); FYN-blndIng protein (FYB-1 20/1 30); 4.03 

410639; BE269047; Hs.65234; hypothetfcal protein FU20596: hypolheteal protein FU20596; 4.03 

427716; 1^8951; Hs.180446; kaiyopherin (importin) beta 1; karyopherin Omporttn) bete 1; 4.03 

451050; AW937420: Hs.351869; ESTs; ESTs; 4.02 

449687; AB023227; Hs.23860: KIAA1010 protein; KIAA1010 protein; 4.02 

448499; BE613280; Hs.77550; p53^u]ated D0A3: pSS^uteted DDA3; 4.01 

437527; AI241019; Hs.145644: ESTs; ESTs; 4.01 

425118; AU07661 1; Hs.154672; methylene tetrahydrofbteto dehydrogenase (NAD dependent), methenyltetrahydrofblalte cydohydrotese; metttytene tetrahydrolblate detydrogenase; 
4.00 

451931; AK000208; Hs.27267; Homo sapiens cDNA FU20201 fis, done COLF1210; Homo sapiens cDNA FIJ20201 fis. dona CO; 4.00 
412939; AW411491; Hs.75069; eukaryotic Iranstetton etongation factor 1 gamma; eukaryotic translation elongallon factor; 4.00 
409581; U66243; Hs.55039; mitogen-actrvated protein kinase 12; mttogervactivated protein kinase 12; 3.99 
433577; AW007080: Hs.284ig2; ESTs; ESTs; 3.99 

439963; AW247S29; Ks.6793; platetet-activating factor acetylhydrdase, isofonn lb, gamma subunit (29kD); ptetelet-acfivaQng factor acetylhydrola; 3.99 
418629; BE247SS0; Hs.86859; growth factor receptor-bound protein 7; growth factor receptor-bound protein 7; 3.99 
448633; AA311426; Hs.21635: tubulin, gamma 1; tubulin, gamma 1; 3.98 

402398; ; ; C19000263:gi}31080231gblAAC1S755.1| (AC004659) BC62940.2 (Homo88piens]Il66335; Cig000263.-gq3108023]gb|AAC15755.1| (AGO; 3.97 
408414; Ail 14688; Hs.193400; ESTs, Weakly simiiar'to 210926QA B cdl growlh factor [Ksaptens]; ESTs. W^ simiter to 2109260A 8 ceB ; 3.97 
415012; NM.004383; Hs.77793; tyrosine kinase; c-src tyrosine kinase; 3.97 

416084; 116991; Hs.79006; de(»yihyniidy(ate kinase (Ihymic^tete kinase); deoxythynAfytate kinase (thynvdytete kin; 3.95 

422051; AW327546; Ks.1 1 1024; solute canier family 25 (ntitochondriai carrier; dtrate transporter), member 1 ; solute canier family 25 (mitochondrial ; 3.95 

447887; AA114050; Hs.21 1610; caspase 8, apoptosis-rel^ cysteine protease; caspase 8, apoptosis-related cysteine pr; 3.95 

430770: AA765694; Hs.123296: ESTs; ESTs; 3.94 

442994; AI026718; Hs.16954; ESTs; ESTs; 3.94 

420333; AJ001383; Hs.g7064; lymphocyte antigen 94 (mouse) homolog (activating NKfeoeptor. NK-p46); lymphocyte antigen 94 (mouse) homdog (a; 3.94 
438456; AA913381; Hs.279763; ESTs; ESTs; 3.94 

422599; BE387202: Hs.1 18638; non^etastaiic cells 1, protein (NM23A} expressed in; notHnetasteSc ceQs 1, protein (NM23A) ; 3.93 
420162; BE378432; Ks.95S77; cydirvdependent kinase 4; cydin-dependent kinase 4; 3.93 

424829; NM_002507; Hs.1827; nerve growth fiactor receptor (TNFR superfamiiy, member 16); nerve growth factor receptor (TNFR super; 3.93 

447574; AF162666; Hs.18895; tousled^ kinase 1; lousted-fike kinase 1; 3.93 

425797; AF002986; Hs.1 59545; platelet actuating receptor homdog; platelet aciivaSng receptor homotog; 3.93 

421910; NM.014586; Hs.109437; hormonally upregutated neu tomor*associated ktoase; honnonaliy upregdated neu tumor-assoda; 3.92 

413781; J05272; H3.850; IMP (inoslne monophosphate) dehydrogenase 1; IMP (Inosine monophosphate) dehydrogenas; 3.92 

434334; AA912476; Hs.116750; Homo sapiens cDNA FU13221 fis. done Mr2RP4002075; Homo sapiens cONA FU13221 fis, done NT; 3.92 

400262; ; Hs.75309; Eos Control; Eos Control; 3.90 

424977; AA349289; H8.100057; Homo sapiens cONA: RJ229Q2 fb. done KAT05581; Homo saptens cDNA: FU22902 fis. dona K; 3.90 
409799; D1 1928; Hs.76845; phosphoserine phosphatase-like; phosphoserine phosphatese-Gke; 3.90 
435206; AI432364; Hs.160594; ESTs; ESTs; 3.90 

439863; BE547830; Hs.375208; paired immunogtobufin^ike receptor beta; paired immunogiobuiln-iike receptor beta; 3.90 
413627; BE182082; Hs.246973; bitron of Btoaudal 0 homolog 1; intron of Bcaudd D homdog 1; 3.90 
426265; AA421069; Hs.97896; ESTs; ESTs; 3.89 

451063; AW163702; Hs.25911; HIA-B assodated transcrip(r2: HLA-B assodated tjanscript-2: 3.89 
407013; U35637; Hs.83870: gbiHuman nebuQn mRNA, partial cds; gb:Hum8n nebuftn mRN\ partial cds; a89 
437239; AW503395; Hs.5541; ATPasa. Ca transporting, ubk^uitous; ATPase. Ca transporting, ubk^uttous; 3.88 
400281; ; Hs.1802: Eos Gontrd; Eos Control; 3.88 

450447; AF212223; HS.2S010; hypothetteal protdn PI 5-2; hypothetted protdn P1S-2; 3.88 

42Z293; X94453: H3.1 14366; pyrrdine-S«^boxytete synthetese (gtetarnate gamma^emiaidehyde synthetase); pynQline-&<arl)oxylate synthebse (glut 3.87 
414K1; AL042306; Hs.97689: VASA protein; VASA protein; 3.87 

417767; BE242241; Hs.8254^ acytoxyacyl hydrolase (neutrophil); acytoxyacyl hydrolase (neutrophiO; 3.87 

414443: AU0n268: Hs.76144: platetet-^ierlved growth fedor receptor, bete pdypepSde; ptatetet^terived growth fador receptor^ 3.87 
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444099; 087432; Hs. 10315: solute canter lamOy 7 (caflonic amino acid transporter, y system), member 6: solute carrier fan^ 7 (cationk; amino : 3.66 

419696; BE379320; Hs.9144et MKP-I Eke pioteln lyniane phosphatase; MXP-1 like protein tyrosine phosphatase; 3.86 

427022; AW245839; H5.173^; small nuclear ribonudaopnitein polypepQde A; smafl nudear rit)om)deoprots!n pdypept; 3.66 
- 443861; AA336609; Hs.10862: Homo sapiens cDNA: FU23313 6s.donB HEP11919; Homo sapiens cONA: FU23313 fis. done H; 3.86 
D 408056; AA312329; Hs.42331; ephrm-A4; ephr1frA4; 3.86 

410552; X66945: H3.748; Olvoliiast growth (actor rec^ 1 (fms-related tyrosine kinase 2. Pfeiffer syndrame); fibroblast orowlh laOar receptor 1 (Ims; 3.85 

450778; U6137S; ^25450; solute carter family 29 (nudeosUe transporters), number 1; sotute carrier famify 29 (nudeoside be; a8S 

418978; T85295: Hs.268605; ESTs; ESTs; 3.84 

429903; AL134197; Hs.93597; cydlnKfependent kinase 5, regulatory subunil 1 (p35): cydirHlependenl kinase 5^ regulaton su 184 
lU 442980; AA857025;Hs.8878;kinesln^ke1;kInesln4ikBl;a84 

447232; AW499834; Hs.327; Interteukin 10 receptor, alpha; inteiteuWn 10 receptor, alpha: 3.84 

^'^^ (cartjoxy-termlnai domins^ RNA pdymeiase II, pdypeptUe A) phosphatase, subunil 1; CTD (carboxy-terminal doming RNA pdyme; 3.83 

432539; AL138169; Hs.278378; karyopherin beta 2b. Iransportin; karyopherin beta 2b, transportin; 3.83 
, - 41 5684; 059356; Ks.374480; sorbitol dehydrogenase; sorbild dehydrogenase; 3.83 
1 5 451598; N29102; Hs.79658; ESTs; ESTs; 3,82 

449433; AI672098; Hs.901 2; ESTs, WeaMy simBar b S26650 DNA4}inding protein 5 (Ksapiens); ESTs, Vfeddy sinto to S26650 DNA-bindl; 3.82 

428359; AA376409; Hs.10862: Homo sapiens cONA: RJ23313 fis. done HEP1 1919; Homo sapiens cDNA: FU23313 Us. done H: 3.81 

435160; AB002374; Hs.4791; KiAA0376 protein; KIAA0376 protein; 3.80 

443402; U77845; Hs.9295; eiastin (supravalvular aorfic stenosis, Winiains4euren ayndrone); elastin (supravahnHar aortn stenosis. : 3.80 
ZU 42Z753; A1928995; Hs.1575; small nudear ribonudeojuoteln 03 pdypepQde (18kO); small nuclear ribonudeoprolein 03 polyp' 179 
421508; NM.004833; H3.105115: absent in melanoma 2; absent In melanoma 2; 3.79 
414806: 014694; Hs.77329; phosphalidytserine synthase 1; phosphatldytserine synthase 1; 3.79 

428023; AL038843; Hs.374530; Homo sapiens cONA: fU23602 U, done ING15735: Homo sapiens cONA: FU23602 fis, done L; 3.79 

421654: AW163267; Hs.106469; suppressor of varl (S.cerevlslae) ma 1; suppressor of varl {S.cerevisiae) 3^ 3.79 
ZD 439668; AI091277; H8.302634; frizzled (Dnsophlla} homolog 8; fHzzled (Drosophila) homdog 8; 3.79 

41 1125; AA151647; H3.68877; cytochrome b-245, alpha polypeptide; cytxhrome b-245, alpha polypepfide; 178 

429170; NM_001394; Hs.2359; dual epedficity phosphatase 4; dual specifidly phosphatase 4; 178 

425354; U82027; Hs.1 55935: oompJement oonnponent 3a receptor 1; comptemenl component 3a receptor 1; 3.78 

42390% AJ223163: H8.135194: Immunogtobulln supeifanily. member 6; Immunoflbbulin superfemily. member 6; 178 
JU 429687; AI675749; Hs.211608; nudeoporin 153fcD; nudeopoiin 153kO; 177 

414177; A1351355; Hs.356303; uncharaderlzed hypothalamus protein HARP11; undiaraderized hypothalamus protein HAR; 177 

414135; NM^004419; Hs.2128; dual spedfidty phosphatase 5; dual spedfidty phosphatase 5; 177 

445817; KM.003642; Hs.13340; histone aoetyllrensferase 1; hbtone acetyttransferase 1; 177 

410422; ALM2014; Hs.63348; Homo sapiens, clone MSC:15203. mRNA. complete cds; Homo sapiens, done MGC:15203, mRNA, com; 176 
J3 412146; M92444; Hs.73722; APEX nudease (multifiindtonal DMA repair enzyme); APEX nuclease (mdlifuncyooal DNA repd; 176 
413011; AW0681 15; Hs.821; biglycan; Wglycan; 176 
428157; AI738719; H3,198427; hexoklnase 2; hexoklnase 2; 176 

400288; X06256; Hs.149609; integrln. alpha 5 (filxonedin receptor, alpha polypepUde); Integrih, alpha 5 (fibronedin receptor,; 175 
. . 427378; BE515037; Hs.177556; melanoma antigen, family 0, 1; mdanorra anUgen, f^ D, 1; 3.75 

4U 405484; ; ; C3002124*:gi|12737280|refp(P,005682.2|keraBn 18 (Homo sapiens]fl6633; C3002124':gij12737280|reflXP 006682.21 k; 175 
450998; BE387614; Hs.25797; spfidng lador 3b. subonit 4, 49kD: spfidng fador 3b, subunit 4, 49kD; 175 
432460; H12912; Hs.274691; adenylate kinase 3; adenylate kinase 3; 3.75 

42881 6; AA004986: Hs.193852; ATP-Wnding cassette. sub-tamBy C (CnWMIV). member 2; ATP-bindIng cassette, 8ub4My C (CfTR; 174 
45 ^^^^toM^S^HUM^^^ 3^?* ^ ALUBJiUMAN ALU SUBFAMILY SX SEQUENCE COMTAiyBNATtON WAITING EMTRT [Ksapfensl; ESTs. Vlteakly stirdar 

453329; T972(K; Hs.193400; ESTs, Weakly similar to 210926aA B cell growth fador [Ksapiens]; ESTs. Vteakly similar to 2109260A 8 ceil ; 174 
426440; BE382755; Hs.159902; solute canter family 2 (fadStated ghicose bansporter), member 1; sohita canier farely 2 (facililated glu; 174 
438330; AW450572; Hs.257316: ESTs; ESTs; 3.74 

419911; L15301; H5.1276; BN51 (BHK21) temperature sensilhrttycomptemenling; Bl^51 (BHIC21) temperature sensitivity com; 3.74 
jU 452695; AW780199: Hs.30327: mitogen-adhraled protein kinase-adivaled protein kinase 5; mitogen-adrvaled protein Wnase-adwal; 173 

442233; AW967149: Hs.28439: ESTs, Weakly sliiillar to 136022 hypothetical protein {H.sapiens]; ESTs. Weakly simQar to 138022 hypotheti' 3 73 
451295; AI557212; Hs.17132; ESTs. Moderately similar to 154374 gene NF2 protein [H.saplens]; ESTs, Moderately similar to 154374 gene ; 173 
410772; BE275297; Hs.194685: Homo sapiens done 24675 mRNA sequence; Homo sapiens done 24675 mlVIA sequence; 3.73 
426251; M24283: Hs.168383; Interoenular adheston molecule 1 (CD54), human rhinovinjs recepton Intercellular adhesion molecule 1 (C054); 172 
0 J 449843: R85337: Hs.2403tfc sduta carrier family 31 {copper transporters), member 2; solute caniar family 31 (copper transpo; 3.71 
423523: AW299828:Hs.193580; ESTs; ESTs; 3.71 

413407; AI356293; Hs.75339: inositol polyphosphate phosphatase-like 1; Inosild polyphosphate phosphatase-fike ; 171 
448336; R53848; Hs.44976; ESTs; ESTs; 3.70 

422083; NM_001 141; Hs.111256; arachWonate 15-flpoxygenase, second type; aradiWonale 15-lipoxygenase, second lyp: 3.70 
OU 416087; AF045184; Hs.79008: SKHNTERACT1NG PROTEIN; SKHNTERACTING PROTQN; 3.70 
442200: AWS90572: Hs.235768; ESTs; ESTs; 3.70 
414280; BE410769; Hs.75873: zyxin; zyxln; 169 

409354; N68ie8: Hs.159472; Homo sapiens cONA: FU22224 6s, ctone HRC01703; Homo sapiens cONA: FU22224 fis, clone H; 169 
415276; U88666; H3.78353; SFRS protdn kinase 2; SFRS protein kinase 2; 169 
0-> 439659; AW970780; Hs.59483; leudne^lch repeat-containing G protein^aupled receptor 6 (LGR6) mRNA; leudn&4di repeat'«ontaining G protein; 3.69 
446522; NM.003876; Hs.15196; putative receptor protein; putative receptor protein; 169 
422785; AI824114; H3.289088; heat shod 90kD proteki 1. dpha; heat shock 90kO proteb) 1, alpha; 168 

401083; : ; NM.016582*:Homo sapiens peptide transporter 3 aOC51296). mRNA. VERSION NM-016579.1 Gi; NM-OieSB^rHomo sapiens peptUe tmspor; 168 
413048; M93221:Hs.75182;manncse receptor. C type 1;mannosereoBptof.Cbrpe1; 168 «r. r-r- 

70 452690; Ai538070; Hs.15085; ESTs; ESTs: 168 

428981^6^E3J3077^ W^sMy similar to ALU2JiUMAN ALU SUBFAMILY SB SEQUBJCE CONTAJyilNATlON WARNING ENTRY (Ksaplens); ESTs. Weakly simHar 

415010; NK^004203; Hs.77783; membrane-assodated tyrosine- and Ihreodne-spedfic cdc2-inhlbttory kinase; mefnbran»as80dated tyrosine- and threon- 3 68 

428579; NM_005756; Hs.184942; G protdrvcoupled receptor 64; G proteIrH»upled receptor 64; 168 
/ D 446430; AA345837: H3.15075; hypothetical protein DKFZi)434E2216; hypothetteal protein OKFZp434E2216; 166 

442013; AA508476; Hs.375009; Human ONA sequence from done RP1 1-35301 8 on chromosome 20 Contains ESTs. STSs. GSSs and CpG Islands. Contains the NIFS gene for 
cysteine desuKurase. two genes for novel proteins and the gene for the; Human ONA sequence fiwn done RP1 W53C1; 3.66 

41 6602; NM_006159; Hs.367895: Protein kinase Obindtng protein NELL2; PTOteto kinase ^binding protein NELU: 165 
80 ^^^^•^^563042: Hs.118820: Homo saptons, Ma to RIKEN cONA 0610012G03 gene, done MGG14132. mRNA, complete cds; Homo sapiens. Sirrfiar to RIKEN cDNA 0610s 

413076; U10564; Hs,75188; weel {S. pombe) homdog; weel (S. pombe) homolog; 165 

429303; AW137635; Hs.44238; ESTs. Weakly similar to S65657 alpha-IC-adreneiglc receptor splice ibmi 2 (H.sapten3]; ESTs, Weakly similar to S65657 alpha-IC-: 3.65 
452060; W2e980: H8a49089: ATP-binding cassette. sidMamily F (GCN2fl), member 2; ATP^ding cassette, sub-iamBy F (GCN2; 165 
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451 558; NM_001089: Hs.26630; ATP-«nding cassete. sub-famBy A (ABCt), member 3; ATP^Wing cassette^ wMamBy A (ABCl; 3.65 
414774; X02419; Ks.77274; plasminogen activator, ureidnasa; plasminogen acdvator. urokinase; 3.65 
417426; NK4_002291; Hs.B2124; Mnfai, beta 1; lamlnin. beta 1; 3.65 

450296; A1041949; Hs.24756; hepatocyle growth fector4eguIaIed tyrosine Idnase substrate; hepalocyte growth (ador-fegulaled tyros; 3.64 
D 437659; AI358105; H3.1 23164; ESTs, WeaWy similar to match to ESTs AA6679» (Ksaplensj ESTs, WeaHy sbiBar to match to ESTs AA; 3.64 
414029; BE297731; Hs.75709; mannos^efhosphate receptor {caBon dependonQ; mannoseh6fhosphate receptor (cafion dep; 3.64 
444388: AB033058; Hs.11101; iaAA1232prote!n; K1AA1232 protein; 3.64 

425910; AA830797; Hs.184760: CCAATte^Jinding Iransaiplion factor. CCAAT-box^ng transcription factor; 3.63 

451484; AV648896; Hs.283771; hypothetical protein; hypothetical protein; 3.53 
1 U 444613; H29627; Hs.79092; hypothetical protein aJ14427; hypothetical protein aJ14427; 3.63 

447495; AW401864; H5.18720; programmed cell deadi 6 (apoptosis4ndudng factor); programmed oeQ death 8 (spoptosis^uc; 3 62 

408101; AW968504: HS.Z78348: CDC2.<etated protein kinase 7; CDC2.ro!^ protein kinase 7; 162 

424732; D80001; Hs.152629; KIAA0179 protein; KIAA0179 protein; 162 

41 1165; NM_000169: Hs.e9089; galactosidase, alpha; galactosfaJase, alpha; 3.62 
1 D 422112; BE540240; Hs.111783; t.sm1 protein; Lsmi protein; 3.62 

453020: AL162039; Hs.31422; Homo sapiens mRNA; cONA DKFZp434M229 {from done DKFZp434M2»); Homo sapiens mRNA; cONA OKFZp434M229 (1^ 161 

438795; AA825792;Hs,377119;gb,*od84b1U1 NCI_CGAP.0v2 Homo sapiens cDNA done, mRNA sequence: gb:od84b11 .si NQ CGAP Ov2 Homo sapiens -3 61 

445515; BE388665;Hs.179999; Homo sapiens, done IMAGE:3457003. mRNA; Homo sapiens, done IN^ ' * 

407797; AK000524; Hs.39850; hypothetical protein aJ20517; hypothetical protein FU20517: 160 
Z\) 423217; NM_000094; H5.1640. collagen, type VII, alpha 1 (eptoennolysb bullosa, dystrophic, dominant and recasshre); odiagen. type VII. alpha 1 (epkiemralys; 160 

444985; AI677737;Hs.380100: hypothetical protebRJ14005:hypothefcaI protein FU14 # /i~ \ 

433764; AW753676; Hs.39982; zinc finger protein RINZF (NM.023929): zinc linger protein RIN2F (NM 023929); 189 

427857; AL133017; Hs.288679: hypothetical protein FU22865; hypothetkai pro^n FU22B65; 159 

415020; BE249915; Hs.293533; Human DMA sequence from dona RP1 1-1271^ on chromosome 10. Contains ESTs. STSs. GSSs and CpG isfands. Confalns the gene for a novel 
jlO gtutathione-S-transferase and five novel genes; Human DNA sequence from done RPI 1-127^ 

415149: X12451; Hs.78056; cathepsin calhepsin I; 3.57 

458715; AK000973; Hs.1706; hypothetical protein FU10111; hypolhelkjal protein aJIOIII; 3.57 

423576; NM.000383; Hs.129829; autdmnume regulator (auionimmune pdyeaidocrinopathy candidiasis ectodermal dystrophy); autoimmune regulator (aulomimmune ndven- 3 57 
440270; N^L015986;Hs.7120; cytokine reoeptor^lkemdecde 9; cytokine reoeptor^kemdecute 9: h-/ . • 

jU 434883; AW381538: Hs.19807; hypothefica! protein »^GC12959; hypotheOcal protein M6C12959: 157 

404975; ; ; NI\^014323*:Homo sapiens zinc linger protein 278 (ZNF276). tnnscrfpl variant 1. mRNA; NM-014323':Homo sapiens zlhc finger orot 157 
449656; AA002008;H5.188633; ESTs: ESTs; 156 
413795; AL040176; Hs.1420O3; ESTs; ESTs; 156 

406859; AI581 134; H5.181357: lambiin receptor 1 (67kO. tibosomal protein SA); bmMn receptor 1 (67kD, ribosomal prot: 156 
35 411030:BE387193;H8.67896;7^0protein;7.60proteln;3.56 

447079; AA280057; Hs.105280; ESTs, Weakly similar to dJg63K212 [H.8apiensl; ESTs. Weakly sMter to dJg83K212 {Ksa: 156 

424263; M77640: Hs.1 757; LI cell adhesion molecule (hydrocephalus, stenosis of aqueduct of SyMus 1, MASA (mental refardation, aphasia, shuffling gdt and adducted Ihumbs) 
syndrome, spastto parapiegfa 1); LI cdl adhesion mdecde (hydrocephahjs; 155 

413472; BE242870, Hs.75379; sohite canter family 1 (gDai high affinity glulamale Iransporter), member 3; solute carrier family 1 (alial high affi: 155 
4U 443465; BE243123; Hs.321045: IKK-relaled kinase epsilon; indudbte IkappaB kinase; IKK-relaled kinase epsaon; indudble Ik; 3.55 

426745; J03626: Hs.2057; uridine monophosphate synthetase (orotate phosphoribo^ translerase and orotkllne-5'-decartx»qflas8]i; uridine monophosphate synthetase (orotat; 155 

450931 ; N25156; Hs.25648: tumor necrosis factor receptor superfamliy, member 5; tumor necrosis factor receptor superfamb 155 

425836; AW955696:Hs.90980: ESTs; ESTs; 154 «^ k« 

. - 441054; AA913591; Hs.126480: ESTs; ESTs; 154 
45 440592: AL137258: Hs7285: KIAA0759 protein; K1AA0759 protein; 154 

458946; AA009716; Hs.42311: ESTs: ESTs; 3.53 

449027; AJ271216; Hs.22880: dipepQr^pUdase 111; dtpepSdytpeptidase 111; 153 

421652; NM_014141 ; Hs.105552; cell recognlfion mdecute Caspr2; cell recogidfton motecule Caspr2: 3.53 
- 422732; AA577455: Hs.24937; transfonner-2 alpha (htra-2 alpha); transfonner-2 alpha (htra.2 alpha): 153 
50 424870; T15545;Hs.244624; ESTs; ESTs; 152 

442794; AI744130: Hs.356753; hypothetical protdn MGC2975; hypotheticd proteh M6C2975; 152 

417640; 030857; Hs.82353; protein C receptor, endothefial (EPCR); protein C receptor, endothdld (EPCR); 151 

41 W71; AA400027; Hs.295234; ESTs. WeaWy similar to T31 613 hypothetkai protein Y50E8A1 - CaenorhabdiOs etegans {Cetegansl; ESTs. We^ similar to T31613 hypotheU: 3 51 
410257; BE244044;Hs.61469;hypothdicd protdn; hypothetical protein; 151 i » i. . j 

D D 424837; BE2761 13; H8.333034; N-acetyltnansferase. homdog of S. cereviaae ARDl; N-ace^transferase. homdog of S. cerev; 151 

421921; H83363: Hs.355993: tianstocase of inner milDchondrtd membrane 10(yeasO bomolog; banstocase of Inner mitochondridmembr. 150 
454128; AL031259; Hs.367g0O; programmed oeO death 2; programmed cdl death 2; ISO 

434049; AA501430; Hs.5771 ; amyolrophte lateral sderosis 2 fluvenlte) chromosome region, candidate 2; annobopKte laterd sderosis 2 (hivenih 150 

453641; AA444140;Hs.9096O. ESTs; ESTs; 150 
OU 429592; A602904t; HS.209&46; KIAA1118 protdn; KiAA1118 protein; 149 

430547; AC003682; Hs.12798a: ESTs. WOddy simiter to Z21 LHUMAN ZINC RNGER PROTEIM 21 1 (Ks^iiens) ESTs. similar to Z21 1 HUMAN ZINC ; 147 

410B55;X97795;Hs.66718;RAD54(S«revlslae)^;RAO54(SxerevisiaBHke;lM » ^ > ' .nw«.««« u.a^r 

413372; H55532: Hs.349695; tubdln, alpha 2; tubulin, alpha 2; 3.07 

437224; All 17628; Hs.97808; ESTs; ESTs; 177 
05 430439; AL133561; ; OKFZP434B061 protein; DKFZP434B051 protein; 176 

435897; AF269223; Hs.128322; komjitex 1 1 {a murine top homdog): l-complex 1 1 (a murine top homdog); 153 

VJ^ ^ dnwnsome segregatton protein smcl (H.8aptensl; ESTs. WeaWy similar to 154383 diromosom: 2.46 

412026; AA383618; Hs,73073; lesbs-spedfic ankyrin mdif confaining protebi; testis^pedfto ankyrin rodif contddng; 2.35 
422789: AK001 1 13; H3.120842: hypothefical protein FU10251; iiypdhelfed protein FU10251; 233 
70 426627; AF012359; Hs.195685; ESTs; ESTs; 2.12 

438983; AF085864: Hs.20029; proacrosin binding protein sp32 precursor; proacrosin binding proteh sp32 precurso; 2.07 
425709; AA383076; Hs.159274; outer dense fibre of spenn fails 1; outer dense fibre of spenn tails 1; 1.99 
433724; AI827749; Hs.144924; serine^reodne protein kinase SSTK; serineflhreodne protein kinase SSTK; 1.68 
420710. NM.007009; Hs.g9875: zona pdtodda binding protdn; zona pdludda Undmg protein; 1.54 

TABLE57B 

Pkey: Udque Eos probesetUenOiter number 
^ ^ CAT number Gene duster number 
oU Aocesston: Genbank accesston numbers 

Pkey CAT Number Aocesston 
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417886 1031334.1 AA210SB7 DS7294AA214564 AA2O7006 D56572 

432407 MH142J.12 8G03667SBF7r2005BF771868BG960386BG960381 NM.005712AF110315BB)74S34BE182776BE1580008E157999BE714315AVra^ 

d5S^!52SS5»^.2SI*JI^^"^'^*^^'^ BF90215SAI732411 BG778834 BG283641 BE7482^ 

^!S1^5f?1Sg? BTO^ Bffl155 AKD572a3 BB61466AA663341 AM57591 BG949294 AW392B86 AA07112? AA?^7)liia 
AA584918BG9S9570BF7734B6AL04ie98BF959013R87170C16859BF770411 BF771298 AI075321 L13823AA216700BF77ie84AW861tB9 
BES37068 C18935 AA155719 BF771 172 BR69107 BF804964 AWB18172 AW818143 AW392930 AW817057 AW858044 BF74621 1 AAI^M 
AW861687 AW82ie26 BI055726 BF242643 AA207189 BF770412 BF771 157 BG430030 AA0^2 
^lSJlSi22S5S!^HH^'^^"'^*'^"^*'"8'5S99253AVTO61851AW85ra 

AA0555S6 BF7734flO »«8869 BE081333 60)73424 BE142245 HS9571 H59S70 BFB71558 BF871064 BE001132 BF826831 AW754298 
AA2a267 6(399789586997897 AW991957 AAS343S4 B6319501 BF736309 Aie94a5AA04S564 BG950256AI829309BG987850BE093175 
8FB54337 

115541_1 AA933717 BF061897 AW628327 AA641788 AM0O495 

51321^1 AVV973352BF222929AW016853BF059130AI651829BE551767AA558414AI339359BF059601AI9611K^^ 

AA548736AA768578A1539081 AW025957 AA736837 M79575AW594357AA480892 
810943J T23514At6S5785 

6750J AL133561 AL117481 AL1 22069 AW439292AI968826AU)41090 



434414 35978 1 



427298 
427521 

407347 
430439 

TABLE 57C 



Pkey: Unique nuntfnr conesponding to an Eos probeset 
Ref: Sequence source. The 7 dlgrt numbers in this cotumn are Genbank UenQlier (GQ 
sequence oC human chromosome 22,' Dunham L et al., Nature (1999) 402:489^5. 
Strand: Indicates DMA strand from which exons were predicted. 
NLposHion: Indicates nucleotide posfScra of pre^ded axons. 



numbers. "Dunham I. el aL' leteis to the publicalion enGfled "The DNA 



Pkey 


Ref 


Strand 


NLpositton 


402199 


8576116 


Minus 


84187^744 


402680 


8113438 


Plus 


137634-137768.139702-139693,140475-14059 


402260 


3399665 


f4!nus 


113765.113910.115653.115765.116808-11694 


402678 


8113438 


Plus 


37395-37514.37866^7981 


403171 


9838164 


Mnus 


74502-74703 


406137 


9166422 


KAnus 


30487-31058 


401704 


3097841 


Rus 


24712-25374 


402677 


8113438 


RU5 


22135-22309.23063-23238 


402679 


8113438 


Rus 


132079-132216 


402145 


8018280 


Rus 


113086.114800 


406547 


7711513 


A^nus 


172780-17435B 


402398 


4092817 


Mnus 


24019-24973 


405484 


5922025 


Rus 


199214-199579.199672-199920,200262-20049 


401083 


3242744 


Rus 


33192-33360 


404976 


3419864 


Mhws 


139625-140632 



Pkey: 
ExAocn: 
UnKSenelO: 
Pred.ProlDofnalns: 

UnlGeneTUte: 
R1 



TABLE S6A: 434 genes upregulated In bladder cancer relative to nom^ body tissues 

J^iSSS^^S^^S^^"^^^ Geneexpresslondataforeachprobesetobtainedfromthisan^ls 
^i^^!^J!^^.^^ (A!) a nomiafeed >^refiedlng the letafive level of mRNA expression. The protein products of these genes often contain one or more domains 
IndteaGve of have oncogenic luitction or of transducing intracellular sr^ais. or of being modulatable by smaU molecules, peptides, or antibodies (e.g. pkinase. death^lomain 7tm 
phosphatase, or kMUtransporter). Cerlain predicted protein dom^ are noted. • h-h . « •« aeauKjomarn, run. 

Unique Eos probeset identiiier number 
Exemplar accession number, GenBank acoesskm nunriier 
UniGene number 

Cert^ predicted protrin domabe. AbbreviaBons used: TM. transmembrane domain; SB. sigrtd sequence; =Y, very Skely b oontein; =M. likely to contain; other 
prDtelndomainabbfBvlatoaiBfromR=AM(NucldcAckbRe5aarch,2002.30:»^ y 7 .««iiy»wn«jn,«iwr 

UniGene gene title 

9001 percenGte d bladder tumor Als dlvMed by the 50th pensnSle of no^ 
Pkey; ExAccn; UnigenelD; Unigene TtDe; Red.PRrii3Qn)alns; R1 

430630; AW269920; Hs.2621; cystatin A (stefin A); cystatin:TIW=iW;; 35.25 

422282; AF019225; Hs,114309: apolipoprotefn U MolA^ExbB;TM=Y;SS=M; 33.25 

414555; N98569; Hs.76422; phospholipase A2. group IIA {platelels. ; phosiip;TM=M;SS=Y; 31.68 

415192; D17793: Hs.78183; aldo-keto reductase bmily 1. member C3 : aldo_keLred:TM»M;: 31.04 

417771; AA804698; Hs.e2547; retinoic acid receptor responder (tararo; none,none: 28.50 

nH^I^V^^}??^' '''^^ ^ erythroblastic leukemia v: Furln-like,pkinase,Recep_LdomainJ^urWlke.pkInase,RecepJ^domain.PBpM^^ 27.43 
417079; U65590; Hs.81134; Inteileukin 1 receptor antagonist IL1;SS=M; 25.98 .r««vj^u«m«i.rBp™B5BjM^.zf.« 
413859: AW992356; Hs.8364; Homo sapiens pyruvate dehydrogenase Hna; SAM-PMT,nor\e; 25.38 
418818; AA228899; Hs.101307; Homo sapiens HUT11 protein mRNA, partial; UT,none: 25.28 
425397; JO4088; H8.156346; topofsomerase (DNA) II alpha {170kO); ONAjyiaseB.ONA_topoteolV.HATPase c;SS=Kl 23.58 
421733; All 19671; H5.1420; fibroblast growth factor receptor 3 (ach; ig.pklnase;TIW=Y;SS=M; 2U4 
424008; R02740; Hs.137555; putative chemoWne receptor, GTP-blnding; 7lm_1:TM=Y;S$=M; 20.45 
447343; AA2S6641; Hs.236894: ESTs. Highly slmOar to S02392 ^ha-2Hn: none.none; 19.78 
408243; Y00787: Hs.624: interleukin 8; HLH.PASMTM»M:: ia90 
427490; Z95152; Hs.178695: mitogen-acSvated pmlein kinase 13: pkinase;TM=M,-; 18.75 
410687; U24389; Hs.65436; lysyloxWas^ike 1; LysyLoxldase;SS=W; 18.63 
444381; BE387335; H5.283713; ESTs, Weakly slntfar to S64054 hypolheB; CollaQen;TM=M;SS=IVt; 18.60 
452747; BE153855; Hs.61460; Ig superfamlly receptor LNIR; lg,Rhabdjlycop;TM»Y;SS=M; lasS 
415444; BE24729^ te.78452; sohile carrier temiy 20 (phosphate tran; PH04,UM;TM=M;; ia25 

668 
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413132; NhL006823: Hs.75209; proJein kti^ (cAMPntependenl, caayli; PK!;SS=^ 17.73 
427337; Z46223: Hs.176663; Rj (ragmenl of IgG. tow aJ&rily IHb, n lg;TM=Y:SS=M; 1 7.68 
450746; 082673; HS.27BS89; general transcription factor II, i; none,SH3,PX; 17.12 
418945; BE246762; Hs,89499; arachidonate S^ipoxygenase; Dpaxygenase,PlAT:TiyNyt: 16l8B 
i^!' !:?i22?' Hs.100724; peroxisome proitferalive acUvated recep; honnonejB(^zK:4;TM=M;: 16.78 
439941; AI392640; Hs.18272; antino add transporter system A1; A£Ltrans;TM=Y;; 1^75 
431848; BE019924; Hs.271580: woplakin IB; lr8nsmembrane4;TM=Y;SS=M; 16,55 

AM ?lSSc^ amyloWosIs); ly8.1g,FAD_SynllMdMdh.C.pWnase;SS=M; 16,43 

414883; AA926950; ; COC28 protein Kinase 1; CKSs 1&20 

438091 : Am73062; ; nuclear receptor subfamily 1. group I, m; homionBjBC*C4.none; 15.80 
12?!?! AW247529; H5.6793; platelet^tivating factor acetylhydrota; PAF-AHJb.Upase GOSUTM=M;: 15.70 
428450: NM.014791; Hs.184339; KlAA0175flene prodwt; KA1.pWn8se;TM=M;; 15M 

* (Orosophila) homotog 6; F2,Frtaled.7tnL2:TM=Y;SS=J^ 15,45 
1*; i3^NM_004445;Hs.3796:EphB6;EPHJbdMpttwse^ 

™^ EGF,teclln_c^ushi:TW=M:SS=M; 15.28 

Jt^J: ^mSL^'VoS?^ pot^hfln Int^atefemall conductance; Ca^fflO.SK.ch3nnel.iotl.tr8ns:TM»Y;SS«M: 15.10 

416305; AU076628; Hs.79187; coxsadde virus and adenovims receptor. lg;TM=Y;SS=M' 14 90 

432306; Y18207: H5.303090; protein phosphatase 1, regutetory {inhib; C8M 21:TM=M:: 14.80 

413076; U10564: H8.75188: weol (S. pombe) homotog; pWnase;TM=|yi;; 14.73 
/U 429345; R11141; Hs.199695: hypolhetJca! pcotein; K_telra.SAM; 14.58 

BE613348; Hs.211579: melanoma cefl adhesion molecule; tg,bodh,RibosomaLL6,Rbox:TM=Y:SS=M: 14.55 

421508; NM_004833;Hs.105115; absent in melanoma 2: PAAD tWPIN.HIN;TM=l!t14 

446005; NM.004403; Hs. 13530; deafness, autosoma! dominants; none.THA=M;SS=IWI; 14.35 

429556; AW139399: Hs.98988; ESTs; RonB:™=M;; 14.18 
Z3 416224; MM.002902; Hs.79088; retlculocalbin 2. EF-hand calcium bindin; e1hand:SS=IW; 14.13 

429673; AA884407; Hs.211595; protein tyrosine phosphatase, non^ecept; Y j>h06phata5B.Band 41.PDZ:SS»iy|- 1190 

426657; NM_015865; Hs.171731; solUtecanler family 14 {ure^ transport UtItX^-IsS 

428157; AI738719: Hs.198427; hexokinase 2; hexokinase.hexoldnase2.non©; 13.80 

n liJS^rSS^^u''^,^^?^^'^^'' P.'n3.WljBcepUldI.recepUgranuriri.BNf^TI^Y;SS=M; 13.78 

JU 439453; BE264974;Hs.6566;lhyrDid hormone receptor Interactor 13; AAA^^C tran.CoaE:TM=iy4:- 1338 
432314;AA533447: Hs.312g89; ESTs; Xlink.none; 13.25 -"^.uoaciM ivi., ij.jb 

413109; AW389845; Hs.110855: ESTs; PH04,none; 13.15 
424490; AJ278016; Hs.55565; ankyrin repeat domain 3; ank,pklnase;TIVt=M;; 13.13 
^ _ 426490; Nft4-0O1621; Hs.l70087: ary! hydrocarbon receptor; PAC,PAS;TKN\/I;; 1Z93 

425177; AF127577; Hs.155017; nuclear receptor InteracQng protein 1; none;SS=M; 1Z38 
404942; U30825; ; spRclng factor, arginineteerine^ich 9; C036;TM=Y;SS=M; 12.03 

40 SSl^^u^iS^^ EGFJNFR_c6.gr8nuIln.CUB,Keraen_B2.TIUTM=M:SS-M: 11,93 

4U 443195:BE148235:Hs.193063:HomosapienscOI^FU14201fls.ckme|yrr;Aa.lran5.none;11^ 

41218JAA205588:Hs.1551*60; Splicing factor, aiglnlnefc^ reczK4,sug8r tn;11.85 

. 433470; AW960564; ; transmembrane 4 superfamlly member 1 ; none;TM=Y;SS=M; 11 .80 
^ J 425118; AU07661 1; Hs.154672; methylene letrahydrofbiate dehydrogenase; myb DNA- 

bindlii9,THFJ}HG.CYH,THFJ)HG_CYK_C,CAP„GLYW.L0N.PBpMdie C9,b2lP,M,xan ur_permease.H003 cotransD'THfeM-- 11 69 
418870; AF147204: Hs.89414: chemokine (OX-C motif), receptor 4 (fus; VtnTl Jlm3;™=Y:Srn:M ^'^ 
426761; AI015709; Hs.172089; Homo sapiens mRNA; cONA DKFZp586l2022 (f; none;TM=Y;SS=M; 11.48 

SO a1:S?S5J t?*?«?^' '^"^ '"^ ll«H.C.IMPDitN.C8SJntegrfiL3.RlcliL$jBdin; 11.38 

DU 446742; AA232119;H3.16085:putalhreG-prot^coupted receptor; none;TM=Y;SS=^^ <j».ni«vj»jwun, 11.00 

436729; BE621807; ; transmembrane 4 superfamlly member 1; none;TM=Y:S8=Mt 11 18 

40996ft BE261944: Hs.339673; hexoWnase 1; none.none; 11.02 

426539; AB01 1 155; Hs.170290; discs, large (OrosophPa) homotog 5; SH3,PD2.Guanyble Jdii:TM=M:- ia78 
417821; BE245149; Hs.82643; protein tyrosine kinase 9; cofiliii!;S)F;SS=M; ^^^|^-^ 
DO 427654; AA410183: Hs.137475; ESTs; tofLtrans.m.lGFBP,tspJ; 10.58 

418004; U37519; Hs.87539; aWehyde dehydrogenase 3 farvStf, member ; aJdedh;TM=M;SS=M; 10.53 
ilSS' ^ 8^^^ nucleotide exchange factor SH3.PHRho6EF;TM=M;; ia53 

449437; AI702038: Hs.100057; Homo sapiens cDNA: FU22902 lis. done K; non&none: 10.52 
436856; AM69355; Hs.127310; ESTs; pkinase.rnn;TM=M;: 10.48 
OV 451035; AU0767e5: Hs.430; plasfln 1 (1 Isofomi); efhand.CH^^lliLN;SS=M; 10.38 

S^,??!^ Hb.83988; Inlegrin, beta 2 (antigen C018 (p95). ly; lnlBgrin_B.EGF,PSI;TM=Y;SS=l«; 10,35 
SIS? ?i!^'.!J*;?^l??^= 2 (C049B. alpha 2subunl; vwa.lntegriiLA.FG.GAP;TM=Y;SS=M; 10.34 

' '^8tfca! protein FU2051ft AAA.NB^C,PAAD^OAPIN;NA;NA; 10.25 

Hs.97101; putative G proteln^xxipled recepton M8lhyltransf_5;TM»Y:SS=M; 10.18 
437852; BE001836; Hs.256897; ESTs. Weakly similar to dJ365012.1 [Rsa; GPS.7tm 2;TM=Y;; iai3 

'^A^^u^'^^-.^^'"^!'^'' EGF.fn3,ldUwepLa.WUece^Lb.gr3nulin,BNRTM»Y;SS»li* ia08 
SS? AA262294; Hs.180383; dual specHicity phosphatase 6; Rhodanese,OSPt;TM=Mu 10.05 

400496; ;;ENSP00000224716'."GTP4)lndingproletoSAR;none:TNNY,-; 10.01 
413899: AF083892: Hs.75608; tight Junctfon protein 2 (zona occludens; SH3.PDZ.GuanylaleJdn:TM=lift: laOO 
404568; ; ; NM.02207r:Homo sapiens hypothetical pro; SH2;TM=iA-; 10.00 
444823; BE262989; Hs,12045; putatNe pcotein; Mra1,MB0AT;TM=M;SSaY; a93 
421425: AK001564: H5.104222; tqrpolheSca! protein FU10702; efhand.kazalMfas.7lnu1:TNNM:; 9J0 
- - 413441; AI929374: Hs.75367: Sre-nkMdapter; SH2.SH3;TMsM,-; 9.90 

f NM.^546; Hs. 1846; tumor protein p53 (U-Fraumeni syndrome); P53,WD40JRK;TM=i^; 9.88 
452289; BE568205; Hs.28827; mftogeivactivated protein kinase kinase ; pWnase;TM=M;; 9.85 
439223; AW238299: Hs.250618; U116 binding protein 2; kfljBcepUPKD,MHCJ;TM=M;SS=Y: 9.83 

!?1.00284ft HS.19828B; prolato tyrosine phosphatase, receptor I; Yj»hosphatase;Tlyl=Y;SS=M; 9.80 
Sin V^^^* NM.004419; Hs.2128; dual spedfidty phosphatase 5; Rhodanese,DSPo,Yj»hosphaiase;TM=M- 9.73 
OU 452239; AW379378; Hs.170121; protdn tyrosine phosphatase, receptor t none.nofl8; a73 

418322; AA284165; Hs.84113; cydirnJependent kinase Inhibitor 3 (CDK; Y ohosphatasfcOSPteTMal*: 8.72 
403912;;;C5000394^gi|127372B0iretp«^00668Z2Ik;none:TM=^^ 

427315: AA17994ft Hs.175563: Homo sapiens mRNA; cONA DKFZp664N0763 (f; non8,8pectrin.SH3.PaCH: 9.70 
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428428; AU)37544: Hs.1B4298; cycOrHtependenl Unasa 7 (homotog of Xe; pkinase:TiyNyi= 9.68 

^""^ Hs.170121: protebi tyrosine phosphalase. receptor t Wnesin.fn3,Y jtepha!as8;TiyNiyfc a63 
429083: Y09397; Hs.227817; BCU-robted protein Al; Bcl-2:TM=M.- 9.63 

- 4^2: U63630; Hs.155637; protein Unase. DNA«tivatBd, catalylic; H3.Pt4 Jdnase.FAT.FATC:TM=M;; 9.55 
^ 1™* BE550182; H5.127826; RafGEMke protein 3. mouse homotog: fh3.RA,RasGEF;TM=A4;SS=M; 9.50 

42852ft AA331901: H8,t84736: hypdheOcal protein FU10097: none;TM=M;; 9.50 
418969: W33191: Hs^907: hypothetical protein FU20258; SH3;TI«=M;; 9.50 

^®^Ca«.PHWmTFU^^ 
1 0 414911: NmIoOOIOT: Hs.77502: damage^pedfic ONA binding protein 2 (4; WD40,homeobox.UM:TWNte 9.48 

IS™' A1557212: Hs.17132; ESTs. Moderately similar to 154374 gene ; pl(in3S8.DAGLre-bind^kinase COPRjwme: 9.45 
402328; :: Target Exon; pWnas8:TM=M,- 9.44 

443710: AI928136; Hs.9691; Homo sapiens cONA: FU23249 fis. done C; G^pha^none; 9.42 

- 414987: AA524394; Hs.294022; hypothetica) protein FU14950: SH2:TM=M;; 9.42 
J?g^^^|^91ftTO^ 

440675; AW005054; Hs.47883; ESTs/wfealUy similar to KCC1_HUI^ CALCfc pl(!na5e.nQne; 9.35 
433376; AI249361; Hs.74122: caspase 4. apoplosis-felaled cysteine pr, CARD.ICEj)10.ICEi)20:8S=M: 9.28 
410668; BE379794; Hs.65403; hypolheecal protein; dealh,TNFK.c5;TM=Y;SS=M; 9.25 
ZU 430024; AI808780: Hs.227730; Integrin, alpha 6; lntegrfnJ^FG-GAP;TM=Y;SS=M: 9.23 
452698; AI826645; H5,21 1534: ESTs; ArfGap.PHM.GuanybtBjIn,PDZ.SH3: 9.13 
434263: N34895; Hs.44648. ESTs; lg.none: 9.13 

407949: W21874: Hs.247057: ESTs. Weakly similar to 2109260A B ceB ; Rib08omaLS14,ank,pWn3se.dealKnonB: 9 10 

429332; AF030403;Hs.199263;Slo-20 related Wnase;pkinase.metallhlo;TM=M;SS=^ 
ZD 417426; NM_002291; Hs.82124: laroinin. bela 1; laminin_EGFJamininJiterai.integrin B;SS=M: 9.08 

443951; F13272: Hs.111334;ferriSn, Ightpolypeplida; PMP2SLaawlln.Rone: 9.07 " 

414368: W70171; Ks.75939: uridine monophosphate Mnase; PRKCbaE;; a98 

451292; AB037716: Hs.26204: KIAA1295 protein; SH3;TM=M;: 8.93 

438000; AI825880; H3.5985; non-Wnase Cdc42 effector protein SPEC2; none;TM=M;; 8.90 
JU 446520; AA128808; Hs.179902; (ransporter-like protein; none;TI\/l=Y;SS=M; 8.90 

436075; BE090176: Hs.179902: Iransporter-lilte protein; none;TM=Y;SS==M; 8.88 

437056: AI147061; ; gb»k33a11.5l SoanBS_N5F„F8_9W^OT.PA^P^S; none,spectm.SH3.PH,CH; 8.78 

445496; AB007860; Hs.12802; devetepmenl and dilferentiallcn enhandn; SH3^lcPHAfl3ap:TH#5M= 8.78 

418203:X54942:Hs.83758;CDC28proteinldnase2;CKS;:8.75 
J:> 434608; AA805443; Hs.179909: hypothetical protein FU22995: none;TM=)W;: a70 

445033; AV652402; Hs.72901; mudn U epieieOet fransraembrane: ank;; 8.68 

417640; D30857; H5.82353; protein C receptor, endothelial (EPCR); none;TM=M;SS=M; 165 

432841; M93425; Hs.62; protein tyrosine phosphatase, non-recept Y_phosphaiie;SS=M: 8.65 
, ^ 430397; AI924533: fe.105607; bicarbonate transporter related protein ; HC03 cotr3nsKTM=Y;- 8.64 

448888; AW196663; Hs.200242; caspase reciultment domain protein 6; CARD:TM=M;; a60 

442994; AI026718; H5.15954; ESTs; enk.pWn8se,dealh,Ribosomal S14; 8.60 

429109: AU)08637; Hs.196352; neutropKI cytosofc factor 4 (40kD); SH3,0PR,PX:TM=M:; 8 60 

455439; AW945484; Hs.184252; ESTs. WeaJdy similar to ALli8_HUIVIAN ALU S; none,7toLl: 8.55 

437763; AA469369; Hs.5831 ; (issue Inhilntor of metalloprotelnase 1 ; TIMP,pldnase.DAG PE-blnd,R8D; B 43 
*fO 417035; AA192455;Hs.22968; Homo sapiens clone lMAGE:451939,mRNAse:none.none; 8.40 

418478; U38945; Hs.1174; cyclinHJependenI kir\ase inhibitor 2A (me; ank;; 8.39 

448209; AW160489: Hs.20709; tetraspan 5; fransmembrans4:TM=Y;SS=M: a33 

450139; AK001838; Hs.296323: serum^glucocoriicold regulated kinase; none,none; a33 

445350: AF0521 12; K$.12540; lysophosphonpase 1; abhydrobse.2:TM»Mn 8.31 
D U 427509; M62505; H8.2161; complement component 5 receptor 1 (C5a I; 7lm 1 ;TM=Y:SS=H/I; a30 

445633; AI453386; Hs.17287; ESTs, Weakly similar to S26689 hypotheti; IRK,none; a28 

446719; W39500; Hs.301872; hypothetteal protein MGC4840; AAA.SXl:Tiyi=M;: a23 

424762; AL119442: Hs.183684; eukaryoSc translafionUtlaBon facton non8,none; a20 

- 427274; NH^C05211; Hs.174142:cotonystiinulallnglartor1 receptor. fo:1g,pklna5e:TiyNY:SS=l^ a IB 
33 416094; AW995512; Hs.225977; nudear receptor coadivator 3; none,none: ai5 

419262; AA834664; Hs.29131; nudear receptor coadivator 2; PAS,2f.C2H2,SET; ai5 

417386; AL037228; Hs.82043; D123gane produd; NU0IX.8ecY,E1_dehydrog,lransketjynTM=Y:SS=M: ai3 

431236; AV656840; Hs.285116: Interieukto 13 roceptor. sipha 1; lh3:TM»Y;^ aiO 

439658; AA332057: Hs.e639: hyptrihetieal protein M6C15440: none;TM=|yi;SS°IVI; 8.09 
OU 425424; NWL004954; Hs.157199; EUa motif kinase; pWnase,UBA,KA1;TM=:M;; 6.08 

422573; AW297985; Hs.295726: integrin, alpha V (vitronecfin receptor; FG.GAP,inteflrfn /Lnone: a05 

408912; ABOl 1084; Hs.48924; KIAA0512 gene produd: ALEX2; Amiadilto seg:TM=M;sSM; a03 

447674; BE270&40; K5.19192; cydiiHJependent Uhase % pUnase;SS=M; 8.03 
^ ^ 404891; ; ; Taigel Exon; none.none: 7.95 

03 414278; AA3301 16; Hs.77273: Human glucose transporter pseudogene; none,none: 7.93 
452888; AW955454; Hs.30942; ephrin-B2: Ephrin.fn2;TM=«Y;SS=M; 7.93 
433211: H11850: Hs.12808; MAfW; pklnase,UBA.KA1:SS=l^ 7,91 
438485; WS7578; H5237955; RAB7, member RAS oncogene family; pkinas8.ABC1,none; 7.90 

70 ilSl? JSSJ }^'VJF'' ^ ^ ^^^^ ^ HLH,dealh.TNFR.c6,AcyWoA.hydro: 7.90 

/U 417381;AF164142:Hs.82042;sdutecarTterfamily23(nucIeobasetra:xan urj«miease,RA;7M ' ^ 

^^^.^JSi^^^^^' ^P*^ (c-eb^>2rFurirv4ike.pWnase,Recep.ljJomaln.YIJ>:TWNY;S^^ 7 J6 

405036; ; ; N^(L021628•:Homo sapiens arachidonate Bp; lipoxygenase,complax1 49Kd,PUT;TM=M;; 7.83 

- - 431912; AI660552: H5.76549; ESTs, Weakiy similar to A56i54 AW subst noneAJyW&A^dhAM-CoA^dhjM^Scvl-^ dh N* 7 80 
75 432981; NM_002733; Hs.3136; protein Wnase. AMP-ecSvated, gamma I n; (»C2SBn^^ 

i?^? ^ITi^' [?b:RC1.BT0606^500^15^04 BT0606 Homo; Sec7,PH,ANFjeceptorJlg.dian.WO40.IRK: 7.78 
446636; AC002563: Hs.l5767: dlron (rtK>Werading. serinaflhreonin; CNH,DAG^PE-blnd,PKlnvolucrin,l\4;TM=M;; 7.78 
431 183; NVL006855; Hs.250S96; KDEL (Lys^Asp^u^) endoplasmic relic; EiUumen jecept;TM=M;SS=M: 7.78 

V. ^!Jk^lS?;!?°^ softnm^Blated: EQ^.fn3JdUecepLaJdljBCepLhflram]lin.BNR;TM=Y;SS=M: 7.73 
40914i AL136877: Hs.50758: SMC4 (structural maintenance of chramoso; ABC_lraigii.SMC.N.SMC CDUF164.non8; 7 73 

^^-^5355; ublquilinK»njugafing enzyme E2N (bomoto: U(JjDon.Yjiho8phalase.SH2: 7.70 
403212; ; ; NM.019595:Homo saptefs Inteisectin 2 (TT; SH3,Blhand,C2.PaRho6EF;TI*4(l:; 7.70 
441190; H09073; K125046; ESTa; E1^TP8se.CaBoiUVTPaso.QCaItofUVTPasaLXHydrolase^ 7.68 
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JSSSS^i'.^^ L^^^^ ChnDmosome 7 Feld Brain cONA; 7lnLl.z«XmC4MSPRrj0^.zW2H2jw.zW b«:TM»Y-SS=M: 7ii8 
42W4ft M15990:Hs.194148: v-yes-l Yemaguch) sarcoma viral onoogene; SHZSHSjitfSsaSsSZyK^ ^^^^^^^^ 
447898; AW9B9638: Hs.tl2318: 6.2 kd protein: nonB.non8: 7.65 war«jwnase:K>=M. 7.65 

425289: AW139342; Hs.1 55530: Intefferon, gamma^nduciWe protein 16: PAAD QAPIN.HIN-SSdLi- 7 fei 
401927: ; ; C17000914-:gil8394367IrBllNP^05854aiI s: none:V7.60 

42148^ Af922821: Hs.32433: ESTs: non8Wn.OXPI-PLC-Y,C2; 7.53 

430486; BE062109; Hs.241551; chloride channel, caldum activated, fern; none:TM=Y-SS=M- 7.53 

444252; R21 135; Hs.54985: ESTs; none.non8; 7.47 

• Hs-77613; ataxia telangiectasia and Rad3 ralalEd; FAT,FATC.PI3 J14J*asB:TM=M:- 7 47 

i!S5^ "'l^ cluomosoma 20 open readhg Ihrna 1; r»^^u, {t^-^'*^' 

'^^^ L^iJ^? ' NCI.CGAP.M Homosaitas ; vwa,li.l8flrtn J^.FfrGW.nofle: 7.45 
43^60; AA333063: H8.279S98: Homo sapiens cDNA: FLJ23165 Us, done 1.; MneSArNA: 7 43 

4^ ^ P°^^ ^ SPX.EXS:TM=Yn 7.38 

IS aaS; 2*™ S SfS'^^'^^e 4 Bupertamily member 6; tr8nsmembrane4:TM=Y;SS=M; 7.32 

ij2?T ^ISSS H^Sli.^r'^ honnonereoeplor.assocl3ted prot; none;TM=Ms7.23 

4528 \ 7; AA322859: Hs.284275: Homo sapiens PAK2 mRNA, convtete cds; pUnase PBD-TM=Mr 7 19 

407^:NM.000910:Hs.37125;neuropeplideYrec8ptofYS^^ 

"P"^^ (Drosophila) homolog 4; SH2,SH3:TM=M;SS=M: 7.18 
SH3 domain binding protein 1; SHS-TIM^M;; 7.18 
403344; ; ; NM_000341:Homo sapiens solute carrier fa; aIpha-ain¥tese:TM=Yr 7.1S 

i^!? iV^^o.^u ^' sotute carrier family 2 (fadDlated gtu; sugarJnTM=Y;8S«M: 7.14 
404875; ; ; NM.022819*:Homo sapiens phosphoflpase A2; phoslip:SS4ft 7.11 
433618; AA602539: Hs.345494; ESTs; G^haAjdramSffi^ 

inSil* M PJfii^?.^'^^*^^^ P^P"*'^ "^^P^o^ 1: 7tm^2,HRM,CSD;TM=Y;SS=A4: 7.03 
SJ?;!: TSSSli "^21^^* ?°'««som8 (prosome. macropain) subunit; pfoteasome:TM=M:: ^93 

S2ir;'^:iKlf^^^^ 

iS^feS uS^? tSi^^eeg; ESTs; pWnase*.bZlPAnnadillo.5eg.none: 6.88 

i^S S«?u V2^;te'*°'^P^ A2. group IVA (cylosollc. ; C2.PU^B:TH/}=M;; 6.85 

^nftifl: JifiS^?^?if^P^^ ®' aPoPtosls^lated cysteine pr ICEj)1 0.ICE j)20.DED;TM.li;; 6.80 
iS15?;i^i?.^5?°^^°2*:CDNAFU11591fc.^ ^ .«uJ'«"n?;o.«i 

22^11: ^°!i?ii,SiJl??^^ * 0'Bxln.SH2.STAT.STAT.b!nd.STATj)rollon_6w5PAW»AS,none: 6.78 

MfliliiWBlaled; EGF,fn34dUec8pLa,ldLwDepLbianS5^^ 6 73 

404440; ; ; NNL02l048:Homo sapiens melanoma anllgen,; MAGE;TM=M.-: 6.73 

P^^^ dehydrogenase kina; Eb.SAiyLPNT:TM=M:; 6.70 

408056; AA312329: Hs.42331;ephrinW4; Ei»hrin;TM=M;SS=M: 6.61 

iJiSS* Sf eukaryotfc translation elongation factor; totLtrans.lQ:TM=Yn 6.60 

^^h^^^^' ^^^^^ ^"^^ FU1 3357 lis. ctone PL; non8.none: 6.60 
woa^, ; ; Nft4_020975*:Homo sapiens ret prot&oncooe; cadh8rln.Dklnase-TMaY'SS=M: 6.fia 
418562: l»0B69: Hs.124831; CGI^ proteb; i^o^i^^ '^'^ 
jS!?? • flb:DKFZp547F15aj1 547 (synonym: Wbrl) ; PKneCnone; 6.55 

VS^^^^i H«™> SPfe™ cDNA DKFZP547C136 (fr; ABC.Iran.GTP.EFTU>VBCjnembranfl.none- 6 54 

S iJ^?^' S^o''^i°^ pr^DKFZp434N185: F5,F8.lype.(;pklnase.Ets.F5.FB.|ype.C.pkJnase.Ets; 6.47 
Ras-GTPaseacevaDng protein SHSdom^; mn.ffTF2:TM=M:: 6.46 

441712: AW391927; Hs.7946: KIAA1288 protein; AIP3:TM=M:: 6.44 

Soo?? acyM>)enzyme AoxWase 1. paimitoyt; pkln8se.7tnLl; 6,43 

?Fn^i"°^."'-^^^' hypothetical protein FU10986: FGGY.QTM^M;; 6.43 
1^? ^S?' S^SS »'W»»«fcal proteh FU10803: none;TM=M:SS4t &40 
438941; AF075047: Hs.31884: ESTs; C8j*anneLB.SH3,arf,none: 6.40 
^'^^'^ ^STs: pklnasejW2H2,KRAB.non8; 6.33 

^^527 protein; MHCK.EFUinase:TM=lW:SS=M: 6.33 
427801; AW979155; Hs.29827S: amino add transporter 2; A«.trBns;TM»M;5S=iyi; a32 
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407970; AW403814; Ks.41714: BCL2-associated alhanogene: uUquitin,6AG,Tiopomyo5in:: 6.30 
438464: AA669735; Hs,324743; protein phosphatase 4 regulatay sutwnil; nona^na; a30 
453977; AA8860Q6: Hs.250427: ESTs; pldn8S8.P2)Oeceptor.E1.E^^TPase.HydiDlase; 6^ 
412491; W31589; Hs.n957; RAB5A. meniberfV^S oncogene (airilr.fas.arf,PP2C;TiyNiyt^ 6.30 
0 413235;BE243445;Hs.75248;topofeomerBse(ONA)«beta(180l(p):D^^ c.DNAjyraseB.DNA-topolsoIV,HATPase c:6.29 

404342; : ; CT002192*:gip299207|gb|AAF54404.1| {AEO: non8;TM=M;; S.27 r - 

409274: Nft^.003930: Hs.52644; SKAP55 homotogue; SH3,PH;SS=M: a25 

419693; AA133749: Hs.301350; FXYD domabHsntaining Ion Ifansport reg; ATP1G1 PLM,MAT8;TM=Y;SS»M: 6.23 
405429; ;;TaigelE»njYj)hosphalase,non8;a23 
1 U 404975; AL042279; ; uncharacterlzed hypothalamus protein HTO: lcringl8;TM=Y:SS=M: 6.20 
452929; AW954938; Hs.172816: neureguGn 1; N8uregtinn.EGFjg,NeiBgijnn,EGFJg: &18 
446883; AW452756; Hs.15364; hypothetical protein FU10955; DEAD,heiic8se.C»fnn.»to.Cys.taiol.71Uwa.vvw^ 

lwta,DUF139,TPaDSPc.tspJ,RibosoniaI.S21.fvp:TM=M:; 6.18 
433233; AB040927; Hs.301B04: KIAA1494 pratetn; SH3.2M:3HC4;Tiyt=M;; 6.15 
1 D 458791; BE615453; Hs.346509;dedicalorof cyto-Wnesis 1; non8;TW3Y;; 6.15 

421836; AF109219: HS.10B787; phosph^dyHnosito) giycan, cJass N; none.non©; 6.14 
447727; A1421079; ; tumor necrosis factor receptor superfami; ncne,synaptobfBvin; 6.13 
401536; : : NM-002530*:Homo sapiens neurotrophic tyr. lg,pkin8se,tJlR.LRRNT.tJ?Ra;TW=R*SS=IW; 6.11 

flll^S?' ^'^^'^ ESTs. Wealdy similar lo PU^U plasmin [H.; PANJ(ringl8.ljypsin,PW>LC.X.C2.SH2.PH.SH3.PI-PLC.Ym 6.10 
427557; NM-002659: Hs.179657; plasminogen acthrator. urokinase recepto; UPAI^LY5.ET,PlA2fnh;SS=IA 6.08 
450447; AF212223; Hs.25010: hypothetical protein P15-2; NTF2;TM=l\/i;; 6.08 
450107; AI873287; Hs.257812: ESTs; ICE p20,DED;TlVl=M;; 6.05 
418175; AW967054; Hs,20631 2; ESTs. WeflHy similar to 138022 hypotheti; rf.C2H2.BTB.I^tetra,Syntaxin,none: 6.05 
- 408983; NM_000492; Hs.663; cystic fibrosis transmembrane conductanc; ABC^tran ABCjneml)rane.PRK,Bac e)tporL3;TW=Y;; 6 05 
ZD 439738; BE246502; Hs.9598; sema donwln. Immimcglobuiin domdn (jg),; Sema,PSI,inlegrin BJU^fz 6,03 
427625: AF008216: Hs.2850t3; putalhre human HLA class II associaled p; none.non8: 6.03 " 
409430; 1^1945; Hs.346735: splicing factor, arglnine/^erine-rich 5; DSPo,Rhodanese.none; 6.03 
446550; AB016625; H8.15813: sohite earner family 22 (organic cation; 8tjgarJnTMaY;SS=M: 6.03 
405102; ; ; C15001220*:gi|4469558lgb|AAD2131 1.1| {AF; DAG„PE-bInd.PH.RhoGEF.DC1;SS=l\4 6.03 
JU 400121; ; ; Eos Cbntiot; SH3.PH,RhoGEF.Terpene,synlh;TIW=M;; 6.03 

415327; H22769; ; gbTn54c02.r1 Scares infant brain 1N1B H; SH3PDZ.GuanytateJdn;SSNUI; 6.03 
404148; ; ; NM_002944':Homo sapiens v^s avian UR2 ; fn3,pldnase.DUF139;TMaY:SS=M; 6.03 
405531; ; ; Target Exon; PDZ,CARD,Guanylate_ldn;TM=M;; 6.00 
433363; AA584829; Hs.275l63; non-metastatic cells 2, protein (NM23B) ; NDK,nonB: ROO 
J!) 427270; H47921: Hs.174139; chloride channel 3; voltaQ©.aCC8S.none; 5.99 
423774; 139064; Hs.1702; Interieuldn 9 recepton noneiTM^SS^M; 5.98 
424124; AA335609; Hs.7589; ESTs, WeaMy similar to A46010 XMed ; pkInase.TBC; S.98 
41 1040; AF007393: Hs,177574; protein-Wnase. Interferon-inducible ddir. HLHjTI**!;: 5.95 
423422; AC005175; Hs.128425; NY-REN-24 antigen; SuWbtransf&rJtnLl^e; 5.88 
40 453902; BE502341: H8.3402; ESTs; none.none; 5.88 

409636; AA305729: Hs.1B272; amino add transporter system A1; Aa.trans;TM=Y;: 5,86 
436154; AA764950; Hs.119898; ESTs: efh8nd.OAG_PE-Wnd,0AGlCa.PHD.DAGKc.PSI,none; 5.85 
455358; AW902641; ; gb:QV3.NN1024-100500-181-d08 NN1024 Homo; 8ulfata5e.Somatom6din 6«Ph08phodie5i.none: 5.83 
414108; A1267592: Hs.75751: SFRS protein kinase 1; anlcPH.Oxyslerot BPiJWnase;TM»M;: 5.82 
45 452547; AA335295: H8.74120; adipose specific 2; LEA:TM=M;; 5.82 

415204; T27434: : gb:hbc2294 Hunnan pancreatic Islet Homo s; Na_Ca Ex.Cabc-beta,none; 5.80 
435563; AF210317: Hs.95497; sohite canrfer family 2 {faclTitated gtu; sugar tnTM=Y;; 5.78 
456097; C15702; Hs.288028; ESTs, Moderately similar to 154374 gene ; dsmFKBP; 5.78 

422445; M231 14; H3.1526; ATPase, Ca transporting, cardiac muscle,; E1.E^TPase,Cation_ATPasB CCationATPase N.Hydrola8e*TM=Y- 5v77 
50 436246; AW450963;Hs.119991; ESTs: none,DNAjyraseB,DNA.topolsolWPa^^ " 

422953; AA488860; Hs.24S043; hypothetical protein FLJ14297; ABC tran,PRK;TM=Y:SS=IWt 5.75 * 

425854; AA749190: ; ESTs; RhoGAP,SH2,pkinase,POLO box.none;~6,74 

424160; T74062; ; gbv^SlfOl.rl Soaies infant brain 1NIB H; ROMone; 6.70 

417389; BE260964: Hs.82045; mWkine (neurite growth-promoling factw; PTN.MK;TM=M;SS8Y; 5.69 
J J 437613; R19892; Hs.10267: MIL1 protein; none,none; 5.68 

410820; BE391493; Hs.16475; Human DNA sequence from clone RP5^2M4 ; TBC;SS=M: 5.68 

423393; R37772; Hs.21420; p21-actlvated protein kinase 6; pkinase.PBO;TI\/NM:; 5.66 

426500; NM-014638; Hs.170156; KIAA0450 gene product; C2,PM'LC-Y;TMsM;; &63 

406930; U04691; ; gb:Human olfactory receptor (OR17-219) g; none:TM=Y:SS=Mt 5.60 
OU 401044; ; ; Target Exon; none.lC^20JCEj)10,CARD,PeptidaseJ41; 5.55 

428479; Y00272; Hs.334562; cell division cycle 2. G1 to S and G2 to; pkinase.lCEj>10JCEj>2(]tTI*=l«:SS=M: 5.55 

421970;AF227156;Hs.110103;RNApolymeraseltranscriptionractorRR; 

aaj)errneases,pyridoxaljdeC,bromodcmain,PHD,MBD^TJiook.[)DT.H3J»l4Jdnase,FAT.^^^ 
^ 426248; T1898a;Hs,293668; ESTs; pklnaso.none; 5.50 

0:> 418426; NM_003804; Hs.296327; receptor (TNFRSFHnteracfing serin&4hn pkinase,death;TI«=Hfc 6,43 

417086; AA194446; ; ESTs. Weakly simiiar to S55024 nebufin, ; ank.death,ZU5,EGFJ(ring!e.trypsIn,NebulIn,UM;SS=M; 5.43 
447437; U07225; Hs.339; purinergic receptor P2Y, Girotein coupl; 7lnLl,SH2;TM=Y;SS=M; 5.40 
41 2247; AF02237S; Hs.73793; vascular endothelial growth facton P06F:SS=M; 5.40 

434938; AW500718; Hs.B1l5: Homo sapiens, cbne MGC:16169, mRNA. com; pkinase,TBC,Rhodanese;TIW=H^;; 5.38 
/U 419355; AA428520; Hs.90061; progesterone binding protein; heme J;TM=Y;SS=M; 5.35 

411188; BEt61168; ; gb:PMO+fr0425-1 701 0^002^10 HT0425 Homo; adenyiateWnasewe; 5.35 

422461; NS4.003417; Hs.1 1 7077; zinc finger protein 264; zf<;2H24<RAB,TRIS;TM=M;; 5.28 

428348; BE466S86; Hs.17433; hypolhe»ca) protein FU20967; none,none; 5.25 
_ _ 429170; NM-.001394; Hs,2359; dual spedlicity phosphatase 4; Rhodanese.DSPcYjjhosphatase,RibosomaL S3 N'B/NM;' 5 23 
/J 429592; AB029041:H5.209646;I<!AA1118 protein; Troponln.Exo_endoj)hos.l(XTM=^^ 

434821;AA159111;Hs784281; Hunan putat^eribosoinalproteS SI mRNA; EfUu^ L11 N;THil»Y;SS=M: 5.21 

434368; AW519020;Hs.73893; dopamine receptor D2;pkinase,SH3,none; 5.15 ♦ " 

405586; : ; Nk^OOOSgg-itenio sapiens ptakopMOn 1 (ec; ArmadBh3LSeg;TM=M;; 5.13 
_^ 408176; AK001S53; Hs.43436; adenylate kinase 3 alpha Hke; adeny1atekinase,none; 5.12 
oU 407443; AF227138; ; gbiHomo sapiens candWate taste receptor; none;ThA=Y;SS=M; 5.11 

416817; AA398045: Hs.104679; ESTs; Furin-fike,pWnase,Recep Ldomaln,fh3,none; 5.10 

401886: : ; NM_021783.Homo sapiens XEDAR pCEDAR), mR; TNFfl.c6JM=M;SS=M: 5.08 

410314; AW8607Q8: Hs.18851: hypothetical protebi FU10875; mybJ)rM4dndingPAKBAHJ)roiiiodomain.Pm.SET;Th^ 5.08 
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401579: AL031447; ; Hoim saptens. dons iMA6E:4053044. mRNA.; Neur chan LBD.Neiir_chan.m8mt),non8: 5.05 

44S873; AA^TO; Hs.2S1946; poiy(AHiinding protein, cytoplasmic 1-1: PABP^pkinase.14^ 5.05 

417529; AA203634; ; glxzx58b09j1 SoaresJel3l.liver_spl8eiu pkinase.UBA.KAl4K}n8; 5.03 

417527; AA203524; ; ob2x56e10j1 Soares.fetaLDvecspieenj SK3;SS=M: 4.98 

407722; BE252241: Hs.38041; pyitioxai (pyridoxine, vilamb) G6} kinas; pfk6;TM=M;: 4.96 

431321; AW136372; HS.1B52; acid phosphatase, prostate; add jihosphat,none; 4.93 

404298; ; : C6001238^:gil1 21715|spIP26697lGTA3^CHICK: none.GST»aGST.N.pWnasa: 4.85 

407603; AW955705; Hs.62604; Homo saptens, done IMAGE:4299322. mRNA.; nonB;TM4*: 4.82 

424099; AF071202; Hs.139336; ATP-bindbig cassette, suMamOy C (CFTR; ABCLIraivABCjiwnhfBn8;TMsYK 4.73 

422366; 183882; Ks.97927: ESTs; pkinase,none; 4.64 

424905; NM.002497; Hs.153704; NIMA (never in mitosis gene oHetAted k; pldnase;TM=M;; 4.48 

426925; 1S6; Hs.315689: Homo sapiens cDNA: RJ22373 lis, done H; Esterase.enol3se.Peplidase_S9:TM=M;; 4.45 

439606; W79123; Hs.58561; G pfoteirvcoupled receptor 87; 7tiTu1;TI^Y;SS=M; 4.40 

400749; ; ; NNL003105*:Homo sapiens softilin^elaled; EGFMIcILrecepLa4dl^recepLbt9ranulin.Blffi:TMaY;SS8M; 4,38 

425721; AC002115; Hs.1S9309; UToplaldn lA; transmeml]Fane4;TM»Y;SS=tlfl; 4.33 

444008; BE39S085: H5.IOO86; type I Irsnsfnembrana protein Fn14: ldUeoepLaPKD,MHCJ;TiyNIM;SS=Y; 4.31 

400751; ; ; NM.003105':Homo sapiens sortiOwelated; EGF,fh3MjecepLa,W_recepLb.9ranulln.BNR;TM=Y;SS=M; 4.18 

408908; BE296227; Hs.250822; serine/threonine kinase 15; pkinase;SSeM; 3.90 

422152; AAg09249; Ks.1 12282; sdute canter fam&y 30 (zinc transport; nonejione; 3.68 

45876ft AI498631; Hs.1 11334; fenitln. light polypepUde; cystalin,feiTitl^histone.HC03^cotrBnsp,SH3,RhoGAP;tan_urj)ennease^ a85 
441218; BE327561: H5.202345; ESTb: none.W040.E1-E^TPase,CationJ<TPase_C.CationJ\TPase^N.Hyd^ 3.78 
419073; AW372170: H5.183918; Homo sapiens cDNA FU12797 lis, done MT; daalh.2U5;SS=M; 3.76 

451385; AA017656; ; flb2e39Ii01.r1 Scares retina N2b4HR Homo ; Atrophin-1.endase;UropWn-1,Yjhosphatese.SH2,fibi!nogen.C.TIMt 3.60 

412604; AW978324; H3.1904; protein kfnase C. bta; pMnase,OAG_PE^nd,pldnase,C,OPR:TNNM:; 130 

409582; R27430; Hs.271S65; ESTs; none.Neur^diffli_l3D,Neur_d>anjnemb; 3,28 

441155; AW161008; Hs.7719; GABA(A) receptor-assodated protein; m>\ LC3:SSTi^- 3.23 

436740. AW975133; ; gb:EST387239 MAGE resequences, MAGN Homo; none.EPHJbd.fn3,pldnase,SAM; 3.20 

418319; AW611703; Hs.190173; ESTs, Weakly ^Har to A46010 X-Rnked ; non8.tRK; 3.20 

409744; AW675258; Hs.56265; Homo sapiens mRWA; cONA DKi=Zp586P2321 (f, none;NA;NA; 113 

418764; N30531; Hs.42215; protein phosphatase 1, reguiatoiy subuni; none,none; 3.10 

400646; ; ; sortniiMelaled receptor, (jpiB, class) ; EGF.fn3.idi jecepLaldl jecepLb.gTBnidin,BNR;TIW=Y;SS=M; 3.09 

422005; BE266556; Hs.110702; Homo sapiens mlWA; cDNA DKFZp761E212 (fr; none,Na H-Exdiangen 3.03 

426440, BE382756; Hs. 169902; sdute canrter family 2 (fadfitated glu; sugar^tr;TM=Y;SS=M; 3.02 

424187; AA335561; Hs.17287; ESTs, Weakly similar to S26689 hypotiieti; IRK.none; 2.98 

425852; AK001504; Hs. 159651; death receptor 6, TNF supeifamiiy member; death,TNFRj£;TM=Y;SS=l^; 2.93 

401279; ; ; CI 300035 1*:gii2494033)splQ64398|KDGO.MES: nooe.none; 2.88 

406671; AA129547; Hs.285754; met proloKjncogene (hepatocyte growth fa; Sema.pkinase.TlG,PSI,none; 183 

447081; Y13896; Hs.17287: potassium imvaidly-rectifying channel, s; IRK;TtyNY;; Z60 

453619; H87&48; Hs.33922; Homo sapiens, done MGC:g084. mRNA. comp; pkina5e:T]\MWI:; 2.75 

441699; AW511126; Hs.127572; ESTs; none>\a_trans; 2.73 

458781; A!444821; Hs,63085; ESTs, Weakly similar to MPP3.HUH^ MAGUK; SH3.PD2.6uanytate.kin.l77;TWNIVI;; 2.73 

446913; AA430650; Hs.1 6529; transmembrane 4 superfamfly member (tetr, transmembrane4;TM=Y;SS=l\^ 2.70 

453487; R31770; Hs.56562; ESTs; 7tm_1.none; 2.6B 

421279; AW664878; Hs.106645; ESTs; pklna5e.non6; 268 

419720, AA249131; Hs.337778; hypothetical protein FU11068; none.none; 2.65 

452345; AA293279; Hs.29173; hypothetlcaJ protein FU20515; DSPc;TM=M;; 163 

422247; U18244; H5.1 13602; sotute carrier family 1 (high affinity a: SOF;TM=Y;; 2.52 

42521% AW962253: Hs.171616; ESTs; p1dnase.none; 2.60 

427344; NM_000869; Hs,2142; S^iydraxytryplaminB (serotonin) receptor. Neur.chan.t^D,Neur chan memb;TM=Y;SS=M; 2.58 

423829; AW021173; Hs.18612; Homo sapiens d}NA: FU21909 fis, done H; vdtage aC,C6S,none: Z5S 

456737; BE247203; Hs.1 24831; (Xa-67 protein; 8bhydrolase_2;TM=M;SS=M; 2.53 

421585; U95626: Hs.302043; chemoklne (C-G motif} receptor-fike 2; 7tm.1;TM=Y;SS=M; 2.53 

424028; AF055084: Hs.153692; Homo sapiens cONA FU14354 fis, done Y7: none.none; 2.50 

448324; AB71356; Hs.34174; ESTs, Moderatdy similar to ALU8_HUH4AN A; ICEj)20,CARO,ICEj)10.none; 2.50 

402256; ; ; Target Exon; pkinase,U6A,none; 2.43 

452256; AK000933; Hs.28661; Homo sapiens cDNA FU10071 fis, done HE; GOUtnuLnone; Z40 

419169; AWB51980; Hs.262346: ESTs, Weakly simBar to S72482 hypotheti; none,spectiin,SH3,PH.CH; Z40 

420634; S42457; Hs.1323; cydic nudeotide gated channel dpha 1; cNMP_binding,ion_(rans:TM»Yn 2.35 

419630; W57756; ; gb:zd20g10j1 SoaresJetalJeartNbHHIOW; zf^HC4,none; 2.35 

426227; U67058; Hs.154299; Hun^ proteinase activated receptor-2 mR; 7tnLl:TM»Y:8&4yi: Z35 

400704; ; ; Target Exon; rig_chan,SBP^bac^3/NF,/eceptor;TM=Y;SS=M; Z33 

400149; ; ; Eos Control; acid j)hosphal;TM=Y;SS=M; Z30 

459327; AW149706; Hs.7859; gb:xf41d02j(1 Na.CGAP.Bm50 Homo sajrfen; PHOfWWP.SET,pWnase,ig: 2.30 
452220, BE158006; Hs.212296: ESTs; inlegrinJ^FG-GAP.none; Z25 
416690; H84078; Hs.108551; ESTs; pkinase.non8; Z23 
408354; AI382803; H3.159235; ESTs; none.none; Z23 

452203; )(57522; ; transporter 1, ATP-blndlng cassette, sub; ABC_tran.ABCjnembrane,SRP54,ThymWytatoJdn;TMsY:SSsM; Z21 

405093; ; ; C12001 101*:gI|7522643|plr|fT32733 AMPA g; none.none; Z20 

412723; AA648459; H3.335951; liypolhelfcal protdn AF301222; none;TM=M;; Z20 

418738; AW388633; Hs.6682; solute carrier family 7, (calionfc amino; none,none; Z18 

417185; NM.002484; Hs.81469; nudeotUe binding protein 1 (Ecdi Min; ParA,fBr4_NifH,Af8AuATPase;TM=M;; Z18 

433222; AW614472; Hs.238415; dk^kkopf (Xenopus laevis) homok}g 4; none,PH04; Z18 

413627; BE182082; Hs.246973; ESTs; Annadinoj5eg.lBB;TM=Mn Z18 

407415; AF073328; ; gb:Homo saddens tetracyOne transporter-; none.none; Z15 

450592; AI701S55; H3.202562; ESTs; pkin3se.none; Z1S 

428767; AI421972; Hs.g8802; ESTs, Moderately similar to T14342 N5D1 ; none.pkin85e.fg: Z15 

429012; AW629596: Hs.194726; BCL2-assodated athanogena 4; Sm,BAG;SS=fi« Z15 

419122; AI401380; Hs.44410; ESTs; ABC_tran>^BCjnembranejione; Z10 

446420; AW015693; Hs.135614; ESTs; k)n.trans.none; Z05 

420076; AA827860; Hs.293717: ESTs; OUF59.pkInase: Z05 

409416; AW388359: Hs.10667: ESTs; transnrembrane4;Tiyi'Y:S&«t Z03 

428766; AM77989: K5.98800; ESTs; TPR.7tnu1: 2.03 

427001; NU.00648% Hs.173135: duat-spedfidty ^rosIna^YKphosphoryl; pkinase;TM=lit Z03 
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453709; All 19133; ; protein kinase C substrate 80K-H; none^ne; 2.03 
423341; AW242394; Hs. 108650: ESTs; none.none; ZOO 

456772; AW604297; Hs.l29711: hepatitis A vlius cellular receptor 1:tg;TM=Y:SS=M; 1.88 
427786; BE407853; Hs.258871: ESTs; none.FG-GAPJtnul; 1.65 
423508: AW604297; Hs.12971 1; hepaCtis A virus ceflular receptor 1; ig;TRA=Y;SS=M; 1.0) 
447993: AW139525I Hs.17036^ ESTs; none^xme: 1.00 

TABLE 5BB 

Pkey: Unique Eos probeset kfentifier number 
GAT number. Gene cluster number 
Aocesston: Oenbank accession nunibefs 

Pkey CAT Number Aocesston 

414883 8371 J AF274943 8G494894 AI719075 AA908783 AI935150 AM22691 AA910644 AA583187 BNC72167 A1828996 AA527373 AW972459 AI831360 AA772418 
AI033892 AA100926 AU154749 AI459432 AI423513 AI094597 AA740817 AI991988 A1090262 AI31 2104 BI256707 AA459522 AA416871 A1075239 
AI339996AA701623 AI139549AI336880AA633648 AI989380AI362835 AA399239 A1146955 BF514Z70 N92B92AI348243 AI278887AA459292 
AI494230 BF507531 AI492600 AA962596 AW613002 AA293140 AA235549 BF108854 AA954344 N49682 AI457100 AW5B9407 AW300758 BE220715 
BE220698 BE569091 BM009647 BF800351 AI537692 AI203723 AiB57576 AA584410 AW371667 BM172363 

438091 22448J AK054860 AV652198AV652192AV652138AV652127 AV652194 BE935919AV652017 AV651995 AV551548AV646063 AV651985 AV646184 
AV646179 AW880409 AA345002 BF155189 BE068931 X56197 AIJ03014 AW953629 BM263546 BE550772 AA701084 AI681 352 AA358689 
AW938841 BF438147W05391 H75313BF326185AV646335AV651589AV646340AV651992AV646384AV646364AV687497BF155183AVB46370 
AW797876A1906821 X56196 BE833835 AA628440 BE833808 BF224205 AA709126 BE673807 AI923886 AA947932AI2761 25 All 85720 AW51 0698 
AA987230 BE467708 AW898628 AW898544 AI146984 AW043642 A1288245 AI186932 AI635262 AI139455 A1298739 AlBI 3854 AI024788 BE699445 
BE699444 AI707807 D52654 AI214518 AI004723 AI698085 AW087420 A!565133 AA845571 AW898622 BF110144 AW513280 AI081 126 BF362770 
AI268939 AI435818 BF475318 AI024767 BE174213 AA757598 AA513019 AA902959 AI860794 AI334784 BF10841 1 BM310532 AW513771 AI951391 
At337671 BF09S606BF095601 BFOg5468AWB90091 BF095753AW243400AW898807AW898616BF362762AI922204AWB986KBE699468 
BE174196 AW102923 D52715 BE699456 D52477 D55017 BF955933 BG623563 AV646254 AA463522 B1003244 AI299190 W40186 BE174210 
BF939091 BF434180 AW579001 T55662 H0181 1 T52522 BF945037 BF955938 D54679 D53933 R67100 BG925552 BF999055 R83430 229922 T85791 
W03942 H63289 AI091 537 BF086583 AA345570 H48870 H80720 T83523 BI039626 BI037700 R00353 BF1 55184 N98343 N79072 HOI 812 T55581 

433470 6624J X75684 A1^73167 AI445461 AI453743AI983655AI564644 AA977180AI694111 AI591358 AW071625 AI578712AI720939 AI927769 BE439796 
AI963432 AA292956 AWig2593 AI86S838 AI696905 At424384 A1161312 AI91 1921 AIS97801 BI494959 A1240988 AI4d2S54 AW262737 BE044033 
AW008570 AW629505 BI494958 AA08&439 AA708057 BF222820 BFS93608 BE501957 AA524526 BE044134 AW572531 AW015724 BE3491B6 
, AA043217 BE219784 AI799814 AA129575 AI671727 AI470033 BE646195 AW779725 AA903050 AAU7228 AA404570 AI075878 W38161 A1972739 
AWS73152 AA723200 C06123 BF057147 AA627686 AA1S7944 AI990245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 A1990B27 
AW275048 AA182640 AA478328 At298935 AW08S158 AW471421 AW103470 AW300456 AW191997 AI823466 AA962397 AA136658 A125181 7 
AW339104 AA724739 AA41 1 100 AA191349 AA757735 AAD37696 AI769516 AWn2283 AA010631 AI892B46 AI061065 H80983 R79933 Ai9S0693 
AI245632AI349390AA148284AI798502AA487893AI621320AW1 94272 C06385AA953883BE858936AI91 8523 

AI189366 AW338678 A1261359 AI500576 BF477735 AI032569 A1972899 AI985583 Z28771 AI363829 AI693030 AA603586 BE773488 AW339301 
BE773489 8E773462 BE773495 At650338 BE773499 AI745717 BE81 1475 BE81 1470 BE81 1464 BE811418 BE81 1415 BE81 1400 BE811398 
BEB1 1388 BE811352 BE773501 BE773494 BE773486 BE773474 BE773473 BE773470 BE773481 BE81 1350 BE811337 BF593W7 B6055071 
AW875302 BF003088 AA719173 BE811348 Ai582462 A[688240 BE773500 AI244845 AI565439 AI918453 AI472527 AI446740 AA035576 AAigi414 
AW674145 C0S782 AI589264 D57558 AI468237 AI432033 AA989662 R21752 BF002457 AA988297 AL57409S AL676200 Al^1074 AL574525 
AL57B810BG498381 AI92B364 BE879732AA479834AA479712C17732BM091258 BF843901 AW820230 CI 7476 BE327120AA129574AA1 36645 
BF84390O AW806193 AA502832 AA649494 AL568520 AL547950 BE706937 BE81 1350 BE773498 BE81 1401 BE773484 BE81 1437 BE81 1380 
BE811399 BF997171 BF757734 BE926037 AI377595 006111 AW088968 BE81 1404 BEB11472 AI855912 AI925607 AI871950 AI093510 BE905927 
BE81 1435 AA191387 AW772000 BEB1 1453 BE614379 BF844522 BI044896 AI744233 AW984527 C17504 BF843883 A1248307 BE773483 AI567995 
W60075 BF941 183 Ar738844 BE811458 BE773481 AI262930 AA948565 BE706942 BE155360 T65026 AW242958 AW1 97954 BE905184 AA722206 
AI344943 AI348877 AI334860 BE621857 BE156280 AA454099 AA037722 BF843897 AW806183 AA043216 BG482896 AA182734 AA877242 
AW372926 H27252 R381 14 BFB51858 BE156214 AA190427 T91762 AA035067 AA837326 T10930 BF906587 BI755027 BG506731 BC008442 
BC010166 AL550134 AL553096 ALS48700 AL550751 AL547978 AL545286 AL540643 AU1 18627 AL601379 BI259821 BG741786 BI868522 AU135866 
BI552770 BI259210 B1256520 BI255S69 BG485098 B1258228 BG498501 BM044512 AU1 33984 AL5565e6 BE745111 B1222633 AU133917 B6288151 
Bt260715 BI550550 BG500773 BI551761 BG707601 BI818593 BF691383 B6721 129 BG541578 BE906666 BG751098 BI224135 BG400746 B6478065 
BE790436 AW080238 AU137549 BG429896 BE392486 AW961586 BG721056 BE908365 8E546556 BG541235 AW583735 B6S28290 81260895 
AW651691 BM048974BMO43805BG142185AA315188AI446615C06300BG497644AA088544A1815987BG528831 BE619182AW239185 
AW062910 AW082902AA347236F11933 AA488005AA301631 AA37680OO56120AA343532 AA30B636 F00242AA376086AA316968AA343799 
BI870221 BE910282BG538748AW960564AV732879D16854AA192519BP922148AA216013BG624091 BE5443876G507008AW176446 
BF790033 BE088925 BE08d854 AA921353 R21800 AA01 1222 Tg7525 
436729 6624.1 X75684 Al^73167 AI445461 AI453743 AI983655 A1564644 AA977180 AI694111 AI591358 AW071625 A1878712 AI720939AI927769 BE439796 
AI963432 AA292956 AW1 92593 AI865838 AI696905 A1424384 AI161312 AI91 1921 AI597801 81494959 AI240988 AI492554 AW262737 BE044O33 
AW008S70 AW629505 BI494958 AA088439 AA706057 BF222820 BF593608 BE501957 AA524526 BE0441 34 AW572531 AW015724 BE349186 
AA04321 7 BE2197e4 AI799814 AA129575 AI671727 AM70033 BE646195 AW779725 AA903050 AA147228 AA404570 AI07587B W381 61 AI972739 
AW673152 AA723200 0061 23 BF057147 AA627686 AA157944 AI990245 AA662517 T32487 AI800108 AI333170 AI859160 W4M10 AI990827 
AW275048AA182640AA47B32BAI298935AWOBS1S8AW471421AW103470AW300456AW19ig97AI823466A^ 
AW339104 AA724739 AA411100AA191349AA757735AA037698 AI769516AW772283 AA010631 AI892846 AI061065 H80983 R79933 AI950683 
AI245632 AI349390 AA148284 AI798502 AA487B93 A1621320 AW194272 C06365 AA953883 BE858938 AI918523 AI872628 AI927217 AI453453 
AI189366 AW338678 AI261 359 A1500576 BF477735 AI032569 AI972899 AI985583 Z28771 AI363829 AI693030 AA603586 BE773488 AW339301 
BE773489 BE773462 BE773495 A1650338 BE773499 AI745717 BE81 1475 BE81 1470 BE81 1464 BE81 1418 BE81 1415 BE81 1400 BE811398 
6E81 1368 BE811352 6E773501 BE773494 BE773486 6E773474 BE773473 BE773470 BE773461 BE811350 BE811337 BFS93847 B6055071 
AWB75302 8F003068 AA719173 BEB11348 AI582462 AI686240 BE773500 AI244845 AI565439 AI918453 AM72527 AI446740 AA035576 AA191414 
AW574145 C05782 AI589264 057558 AI468237 AI432033 AA989662 R21752 BF002457 AA988297 AL574095 AL576200 AL571074 A1574525 
AL578810 BG498381 A192B364 BEB79732 AA479834 AA479712 C17732 BM091258 BF843901 AW820230 C17476 BE3271 20 AA1 29574 AA136645 
BFB43900 AW806193 AA502832 AA649494 AL568520AL547960 BE706937 BE811360BE773498 BE811401 BE773484 BE811437 BE811380 
BEB11399BF997171 BF757734BE926037AI377896 a8111 AVira8968BE811404BE811472AI865912AI925607A1871950AI0935lOBE 
BEB1 1435 AA191387 AW772000 BE81 1453 BE61 4379 BF844S22 BI044898 AI744233 AW984527 C17504 BF843883 AI248307 BE773483 AI56799S 
W60075 BF941 183 A1738844 BE81 1458 BE773481 AI262930 AA948565 BE706942 BE156360 T65026 AW242958 AW197954 BE905184 AA722206 
AI344943 A1348877 AI334860 BE621857 BE1 56280 AA454099AA037722 BF843897 AW805183 AA043216 BG48289B AA182734 AA877242 
AW372926 H27252 R381 14 BF851858 BE156214 AA190427 T91762 AA035067 AA837326 T10930 BF906587 B1755027 BG506731 BC008442 
BO)10186AL550134AL553096AL548700AL550751AL547978AL545286A1340643AU118G2^ BG741786B1868522AU1 35866 

BI552770 BI2S9210 BI256520 BI2S55G9 86485098 BI2S8228 80498501 BM044S12 AU133984 AL556586 BE7451 11 BI222633 AU133917 BG288151 
81260715 81550550 BGSOOnS 81551761 BG707601 81818593 BFe91383 BG721129 6G541578 BE906666 86751096 BI224135 BG40074& BG478065 
BE790436 AW080238 AU137549 8G429896 BE392486 AW961 686 BG7210S6 BE9083G5 BES46656 BG541235 AW5B3735 BG528290 81260895 
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400211 



453826 
447727 
400121 



423387 2612.2 



353^1 



AW651691 BMD489748Am3805BG142185AA31518BAM46615(»6300BG497644AA088544AI81S987BGS2^ 6E619182AW239185 
AWD62910 AW062902 AA347236 F11933 AA468005 AA301631 AA376800 D56120 AA343532 AA308636 F00242 AA376086 AA316968 AA343799 
B»70221 BE910282BG538748AW960564AV732879D16854AA192519BF922148AA216013BG624091 6E544387 66507008 AW176446 
BF790033 6E088925 BEG88854 AA921 353 R21600 AA01 1222 T9752S 

437056 428504.3 AW976398 A) 147061 AA785223AA743360AI603927 

422940 58443J 60012771 8G3971S3 BF366196 AA3372n AA319285 AW843252 

409745 MH1944.5 B1030997AA921874AW188822BI027B62AI347618AI361453AI088754AW207491 AA077391 8G01 2775 BC5997382AA286833AA1 50722 81007625 
B1027884 61009100 61006275 81006270 BI031000 61029884 B1008277 BI007627 BI006266 61006991 6IOQ6990 61007763 BI007762 8G997377 
AA150760 81033518 BI027818 66015769 81033807 AA341445 

407347 81G943J T23514 AI655785 

418546 242836.1 T59708AA224^T59343BE1 56903 

432407 MH1429.12 BGC36675 BF772005 BF771B66 BG960386 BG960381 NM_005712 AF110315 BE074534 BE182776 BE158000 8E157999 8E714315 AWB18104 
AWB47519AA099426 AW817981 AW856396BG981122AA224498AA308542AW821833BF902155AI732411 6677883466283641 6E748279 
BE748870 BG319540 BE748864 BF739224 BG9B6155AK057283 BI861466 AA663341 AA457591 66949294 AW392886 AA071122 AA227849 
AA584918B6959570BF773486AL041698BF959013 R87170C16859BF770411BF771298AI075321 LI 3823 AA21 6700 BF771864AW861 859 
BE537068 C18935AA1 55719 BF77 1172 BF769107 BFB04964 AW81 8172 AW81 8143 AW392930AWS1 7057 AW858044 BF746211 AA179928 
AW861687 AW821B26 61055726 8F242643 AA207189 BF770412 eF771 157 66430030 AA055592 

L13288 AA92878S AI608912 AW872978 AA5656S5 AIQZ2915 AI304920 AI564366 AI668793 AI084557 T60038 R72302 H45409 AA508805 R463S6 
/V^18798 BM129553 BM129126BM1292926M128865AI808418AI689932AI8065736F431808AW872985AW 

R73391 R72085 R72840T83751X752998F7M348 R94105AW449B39R73300NM_004624AI797007BE045543BF110021 BF754250 T83923 
AW884084 AA903895 AA418962 L20295 R72351 H45098 AA961010 R73210 R464S1 AW884085 BI022902 61763932 BI910138 
NM.003899 D63476 BM456434 AA778936 AA452871 A!052466 AW014136 AA448725 BE673088 AW028198 61856378 eM150466 6M150674 
BM148451AVV500880AA180228 6E243S078M144903AA333656AW503767AA305470AW504819AAg781M 
AA704686 AA652189 AA179463 AL535925 6E275744 BE27770B 6E275715 AW504259 AA354483 BE244197 6E246232 D17055 AW013876 
AW014877T09464 T08407AA830246 AW897881 eE501192 6E501195AL044534AA258853 61037915 AA448037BKW61769 BI825965 BE763352 
AW167531 Z45588 AV721881 AA5Z7273 AI573219AA457036 AW4396S1 AW28441B AA577618 AI802954 AA302292AM68752AI380374 AA722690 
AI867708 AA916982 AI291576 AW190427 AI338089 AI653744 AI306665 AW513541 AW440077 AI370014 AA904269 AW188378 A1571644 AW193386 
A1261832 AA775336 BF43681 1 A1582703 A1278835 6E440186 AA6178g8 AA646948 61491837 6F59a311 AA448833 F27048 F37022 AW77081 9 
AA258808 AI369564 AW503675 AAn7194 6ES01048 6F222087 AA042973 AI888087 AA91 1460 Z41274 AI9ig082 T16746 AA447634 A1282427 
F22456 T15901 AA825298 AW007436 BE934303 BG9B1939 AW373814 BM1 51638 AW956921 BM150080 BM153173 BM147451 BF953992 AA916698 
AW44493S M7839a AW581 147 AW608258 AA651910 AA132152 AW806295 T30326 D20054 AA310837 T06543 BM194508 BM1 93225 BM4e9348 
AW964920AA325930 B1833627 AW952193 AA738189AA321051 BG987199 BF953967 T08890 BE869543 BG742B57 BG9BB685 AA4S5880 
86001842 BF809452 AWB920B3 BF944342 T49551 W69981 BF764519 T1 5869 AA1 32030 
400158 2166.1 BC000486 80)01930 U1500B NM.004597 86326561 AV761269 BF212132 BG113313 B1838038 BG944198 BG255198 BG164334 BM013507 
AV882717 BF028751 BG755489 B6030141 BG105499 BE440103 BF727449 BM454228 BG912181 BM015197 BG395995 AI126579 AW248204 
BI857394 BI857319 BG944205 BG323958 BI092930 AV756576 AA2B0959 BF033078 BE539360 AW247174 BG107B36 BE796975 BG170920 A!148142 
AI092009 A1336349 AW13409 BF685691 8E616029 AW13166 BE909458 BE743868 BM018218 A1340153 BE792567 BE206806 
BE20858S BI862286 C18760 86219364 All 22677 8E207297 BF754798AA314384 BEB73951 BG207294 All 48198 AI336189 All 41094 Ai308985 
BF220098 AI143289 BE856397 AA644001 8E675402 AJ346708 AI052809 Al 140082 BF081542 AI342428 AI479525 AA278456 AI750163 BE707501 
BG944510 AA314616 AA315774 BF360721 8E738404 AA814194 AA477195 AA932107 F3685B AA363159 BE865458 AA354147 AW904224 AI055855 
AA380365 F28474 059140 AA341946 AAg93339 AA323350 F24344 AA374436 BG942061 AA636015 AA852gi9 T31486 F36334 66943190 T30319 
BG943153 BG941836 T32569 T31 160 AA31 1749 BE710086 BE70991 1 W23826 BG941639 BR69034 AA026407 AA229555 AV7S6340 AV714380 
AL138129 BG003205 AL136179 61021773 BE064231 

AL520734 81666852 BF870571 BF945361 BF945354 A!770086 81861138 A)421079 A1043200 66683612 AW946265 
NM.003899 063476 BM456434 AA778935 AA452871 AI052466 AW014138 AA448725 BE673088 AWD28198 81656378 6M150466 6M1S0674 
BM148451 AW500880 AA1 80228 8E243507 BM144903 AA333656 AW503767 AA30S470 AWS04819 AA978194 AW500776 BE872488 AH)32663 
AA704686 AA652189 AA179463 AL535925 BE275744 BE277708 BE275715 AW504259 AA354483 BE2441 97 BE246232 D17055 AW013876 
AW014877 T09464 T08407 AAB30246 AW897881 8E501 192 8E501195 AL044534 AA258853 B1037915 AA448037 BM461769 BIB25965 BE763352 
AW167531 Z45588 AV721881 AA527273 AI573219 AA457036 AW439651 AW264418 AA577618 AI802954 AA902292 AA468752 AI380374 AA722690 
AI8S7708 AA916982 AI291 576 AW190427AI338089AI653744AI306666AW51 3541 AW440077 AI370014 AA904269 AW188378 At671644 AW193386 
AI261832 AA775336 BF436811 AI582703 AI278835 8E440186 AA617898 AA648948 61491837 8FS90311 AA448633 F27048 F37022 AW770819 
AA258808 A1369564 AW503875 AA777194 BE501048 BF222087 AA042973 AI868087 AA91 1460 Z41274 AI919082 T16746 AA447634 AI282427 
F22456 T15901 AA825298 AW007436 BE934303 BG981939 AW373814 BM151638 AW956921 BM150080 BM153173 BM147451 BF953992 AA916696 
AW444935 M78398 AW581 147 AW608258 AA65191 0 AA132152 AW806295 T30326 D20054 AA310837 T06543 8M194508 BM19322S 6M469348 
AW964920 AA325930 81833627 AW952193 AA7a8189 AA321051 6G987199 6F953967 T08890 6E869543 86742857 B698868S AA4S6880 
86001842 8F809452 AW892083 BF944342T49551 W69981 BF764519T15869AA132030 

BM048470 66983877 N92089 Z4354S F05783 R35182 H22769 H71928 H63921 R94394 AW195830 AA630220 F02030 R60760 R0B204 H71929 
BE454731 R62568 H$3520 H77840 AL51 5501 F01971 T71768 N55275 AA883505 AW608824 AA704682 N74193 W57593 AW753775 H60439 
AI076352 R31485 8E160931 AW130730 BE160869 AW580256 AW902073 BE702158 AW382079 N76884 8F431013 H38156 AV648937 H80573 
455358 1160035.1 AVV9Q2569AVVg02S57AVV902654AVV902641 AWg026S0AVV902741AVV^^ 
415204 186S508J Z25288R85077R»323 082802 T27434 

425854 2638.3 BF1 84345 AA384846 81494468 AA775693 8F19S801 AA749190AI4802236E327332 A1470017 AW628345AI917472 AI972264 A1375571 A1652583 
AI556167 AI9831106E5S0378 AVV01S085eE552318AI631475AM33447 AA534540AI866480 A1468119BF110875 AW51^ AA827482 

RS9357 

424160 5320.3 AW370493AA923304 AI66082BAA502611 BI017370AA039704 W65393BE938514W652956F919826 61759440 BI838879AA336207F12441 
BF1711876E185031T74062AW904410AA280078 8F919&29 

417086 1154.2 AK05660S AKD56969 AW380724 AA195950 F21243 Z17422 AA176595 AA1769SS AA194350 BC005933 BC017866 AA196396 Z24810 AA181361 

AA1931 15 AA086465 F25194 F30130 AA180013 /VI346143 F20589 F20820 F29371 F32439 F25153 F32146 F261 10 F28506 F25307 F27533 F20375 
F32370 AA653078 BF574897 Z24852 F00172 T30560 AW449825 AI620346 86986374 BE706521 C02691 AL596834 F31902 F26078 AA6700g9 
8F475S55 F30818 F37524 AJ346558 F28050 F17g33 F31637 C03413 A1092152 AA180743 AA085730F21998 F20854 F18944 F31180 F37937 F37738 
AA193162Z17344AA192546 

411188 1072487J AW82 1 260 BE 1 62466 BE161 168 

417529 3302462.-1 AA203634 

417527 2431831_1 AA203524 W88451 

451385 8S022J AA01 9761 AA01 7656 AA01 7374 

436740 1239008,1 AW975133AA805813AA729943 

419630 37310.5 BF223647 AA204668 AA2S60e6 BF574707 W57756 AA249070 81906494 AA24801 1 

400149 16458.1 BC003160NM-001610X12548BG386685 BI760866BI559619B6323829AU135543B1834101 AU142120AU124511 AU1 24889 AL558171 AU1172B6 
BI824000 BG386610 8I7532B5 81223475 AU134828 BM126369 BE206493 BF751498 AL544274 AY007137 81828921 BE870130 BF771242 BI835451 
81765655 81820955 81223344 BG015924 BI759894 AL527413 BF310588 N31870 N23974 BE514914 T48863 AW8602S7 BF334625 AAB83850 
AU144168 AA442562AU159491 AU148353 AA564123 AU148667 AI377256 AW664004 AI871712 A1141486AI3323S1 AI339094 BE206109 AW519033 
AI817729AI332490AI149455AI857411AI7631S4AI751608AI377222AI0819S6AVV664229AI27S672A^ 

AI269164 AW105344 8M1930B1 6E550930 AI082116 AAB54691 AI056249 AI221082 AI290113 D51818 AA732409 660551251485878 AUl 561 21 
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366136.1 
10123.14 
353SL1 



415327 9792J 
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BF093871/ _ 

. AI094208AA028946AA975078R16815BG151132Atr76»7b5^^ 
452203 2630.1 6^14081 NM.OW593 X57522 L21208Ui207 L2t20BL21205l^l204AL561404Al546423 

BG742981 AA2798B5 AA847441 AA313737 BF172639BF897216 BF914190 BF903647 S70277 AI569694 AVV073296 AI3614M MS^B^ 
BE858232AA838610AI539114AI719375A1829129BG057675AI423422AU15B^K 

"^ISS ^^2301 BG875994 AA05445B AA353161 AI940434 BEelwa^KJXASjS^^ 

AIB19778A1346733AVV771150AW512525A1249904AA279809AI352549AW512517bS^^ 

AV660t90AL556475AI62002DAW08988BAW079179Z21518AA^7ro^^ 

453709 S9915J AK057669 AK054977AU19747BE893744B^«S 

^^SJS5^!S]SX^.?J^'^'«^2355BF439899AW055166A1096957 



^^nl??D«^«S™IfS^ BG057324BF001339AA910169 AA765133A1360722AI701849 AI365083 H9r97TAI8M377X3T2M^ 

ooll^P1^^^^^^^218SAA705746BE379632 W93803AI440333AW^^ 
?SS.\^Sf ^^^^'^^^^^ W23495 mil90 BF995236 BF9^^^ 
AVV367707AA630879AA428420R76236BG567847 N25931AA173568AI073567>^M7^ 

M^SI BE181002W60239BF98759BBF995279W 



TABLE 560 



Pkey: Unique number conesponding to an Eos probesel 

^' ^."^^^s"^' TThf 7digltnumb^lnlWsco^^ "Dunham L el ai; refers to lhe«it)llralk)nentltW 

sequence of human chromosome 22.' Dunham I. et al., Nature (im 402:489-495 Fuwwwn mv\m i ne una 

Strand: Indicates DNA strand from which exons were predicted. 
NtjnsKion: Indlcdes nitcteotide positions of predicted exons. 

PKey Ref Strand NLposltion 

400843 9188605 Plus 5863.5970,7653-7784.8892.9023.9873^7 

404942 7382153 Plus 92095-32252 

400752 7331445 Minus 36215-36461 

400496 9743564 Rus 4151W1695 

404568 9966995 Minus 92893-93116 

403912 7710730 Minus 72000-72290,72431-72700.72929-73199 

402328 4464283 l\«nus 13758.13922,14558-14752 

404891 7329392 Plus 84974-85125 

405036 7543748 Minus 121957-122129 

400845 9188605 Plus 34428-34612 

403212 7630897 Minus 156037-156210 

401927 3873185 Minus 112000-112137 



TABLE 59A: ABOUT 1 170 GENES UP-REGUIATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Ptey^ Unique Eos probeset Identifier number ' 

ExAocn: Exemplar Accession number, Genbank accession nuniber 

UnigenelD: Unigene number 

Unigene TiOe: Unigene gene title 

R^' Ratio of prostate tumor to normal adult body tissue 



Ptoy ExAccn UmgenelD Unigene Title 

L22524 Hs.2256 matrix metalloproteinase 7 (matrilysln. 

AA53K10 HS.17199S kallikrein 3, (prostate specffic antigen 

AF097021 Hs.273321 differentlafly expressed In hemab^ 

H40164 Hs.80296 PUfldr^ceHpn)le&i4 

AA506324 Hs.ie52 add phosphatase, prostate 

M57399 H3.44 pleiotrophin (heparin binding growth fac 

AF151103 Hs.l 12259 TceO receptor gamma locus 

AW582256 Hs.91011 anterior gradient 2 (Xenepu? teevis) hom 

AVW92356 H5.83S4 Homo sapiens pynwate dehydrogenase kina 

Ai093155 Hs.95420 JM27 protein 

AA503115 H3.183752 mioosemiroprotEin. bela- 

S39329 HS.1813S0 kallikrein 2, prostatic 

Nl\1.006159 Hs.79389 nel (chktenHite 2 

AI249368 Hs.98558 ESTs 

AW292425 Hs.163484 ESTs 

AF109298 Hs.1 18258 prostate cancer associated prol^ 1 

AA604698 Hs.62547 retinote add receptor responder (lazvo 

AF179274 Hs.22791 transmembrane protein with EGF-Gke and 



428330 
426747 
420923 
416854 
425075 
431103 
434666 
419551 
413859 
420154 
428338 
400287 
416602 
428398 
432441 
414569 
417771 



R1 

108.1 

1015 

90.5 

79.8 

71.6 

70.6 

69.1 

66.4 

66.4 

63.9 

61.4 

59.7 

54.6 

54.6 

54.6 

54^ 

54J 

53.7 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



437145 


AF007216 


Hs.5462 


sohitfi csrrtcf fsn^ly 4, sodium bicsrtxin 


49.6 


446057 


A1420227 


Hs.149358 


ESTs. WeaJdy similar to A46010 X-lirdced 


48.0 


431579 


AW971082 


H5.222888 


WB;ddvfiirT^h)TRH¥ HUMAN TRICH 


46.8 


400299 


X07730 


Hs.171995 


ItnlliVmln ^ frvnctnto cnM^!if> onftnpn 

MUUAIClII J| yNUdmiC 9p«UilC tfllBJCII 




427674 


NM.003528 


Hs.2178 


H2B histono fandly, ntembef Q 


44.9 


432374 


W68815 


Hs.301685 


Homo saoiens cDNA FU1 1 346 lis. dons PL 


44,5 


422487 


AJ010901 


Hs.198267 




44.5 


415314 


N88802 


Hs.5422 


^ycoprotfiln M6B 


43.2 


431958 


X63629 


Hs.2877 


csdhsrin 3, typs 1, P-csdhcrfn {p(3ccnl3 


41.9 


409262 


AK000631 


Hs 52256 






423582 


6E000831 


HsJ3837 


Homo saoiens cDNA FIJ1 1 612 fis ckmaHE 


40.5 


400298 


AAD32279 


Ks6163S 


^ht IfonsmftmhninA AfilfhAlbri flnlkuM nf 

OM UniMIIOIIIUIClllOtl|JIUIDilal dlimOIIUI 




439176 


AI446444 


H5.19Q394 


f STfi Weaklw clinfbtr tn R7Rnqfi Snn.1 nr 




416565 


X54162 


Hs 79386 


ktiMTwfrfi 1 /emnnik i*iiierfa\ 
RuoiiiMUi 1 \e(iiiwtn muawo^ 




447726 


AL1 37638 


Hs,19368 


mablQn 2 


38.6 


426174 


AA547959 


HS.115B38 


ESTs 


38.1 


425071 


NM 013989 


Hs 154424 


dfllnrflnASA InrintKvmnlno tvno II 


OO.v 


417067 


AJ001417 


KS 81086 


fiohltn Mnlpr fnmth/ 97 foytnmRirmnal 
eVHUW wiliKSi loIliU/ fX (WUoTlBUIIJTIal 


Of.O 


414217 


A)309298 


Ks^79898 


Hnrnn sanians cDNA! FLJ23165 Ik rinnn 1 


37.0 


433043 


W57554 


Hs 125019 


ijriiifjiHjiu iiM«M34ii |Nwm yxjnr-'tf nmno 




400302 


N480S6 


Hs.1915 


Msto hvrimlflSfl fMBsliitai^nefHlp mMnh 




453096 


AW294631 


Hs.11325 


ESTs 




419239 


AA468183 


Hs 184598 


Homo saoians crHAi FLJ23241 lis. chma C 




427665 


AF134803 


Hs 180141 


enftlin 9 /rrvKptnl 
wvtuiii c \iiiu3i*ni^ 


U Q 


415539 


At733881 


HsJ2472 


BMP-RIB 


34.9 


428411 


AW291464 


Hs.10338 


ESTs 


34.6 


428819 


AL1 35623 


Hs. 19391 4 


KlAAfK7') npnn nrrytiv4 




401424 






NM 001 172*Hnfnil RaniAiM nrnfn»«A KmA tl 


34.0 


452114 


N22687 


Hs.8236 


ESTs 


34.0 


426990 


AL044315 


Hs. 173094 


Homo sapiens mRfiA for KIAA1750 protBin, 


33.8 


448045 


AJ297436 


Hs 2016S 


nmclafo cipm paII onitnan 




421552 


AF026692 


Hs.105700 


socrolod fi1zz!6d*fGlal&d proton 4 


32.9 


432101 


AI918950 


Hs.123642 


EphA3 


32.7 


427398 
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453510 


At6g9482 


Hs.42151 


407894 


AJ278313 


H3.41143 


433908 


AW298141 


Hs.157975 


440529 


AW207640 


Hs.16478 


413249 


AF167160 


Hs.75251 


408430 


S78876 


H$.44926 


430665 


BE350122 


Hs.157367 


447752 


M73700 


Ks.105938 


426044 


AAS02490 


HS.33869S 


403362 






427982 


NM.016156 


Hs.181326 


407634 


AW016569 


Hs.136414 


448362 


AA641767 


H3.2101S 


433226 


AW503733 


Hs,9414 


427472 


AA522539 


Hs.131250 


453861 


AIQ26838 


Hs.30120 


456082 


AW978811 


Ks.314451 


409264 


NM_014937 


Hs,52463 


432729 


AK000292 


Hs.278732 


422270 


AF114494 


Hs.1 14082 


410339 


A1916499 


Hs.298258 


431992 


NM_002742 


Hs.2891 


424432 


AB037821 


Hs.146858 


431933 


AI187057 


HS.132S54 


440749 


W22335 


Hs.7392 


416434 


AW163045 


Ks.79334 


450382 


AA397658 


HS.602S7 


407021 


U52077 




413786 


AW513780 


Hs.13500 


437866 


AA1%781 


Hs.74170 



hypolhetical protein MGC2747 15.9 

RAB1 8, member RAS oncogene iatr&f 15.9 

KM_004930*:Homo sapiens capping protein 15.6 
HoniosapiensctoneDT1PtB6mRNA.CAGfep 15.6 

ESTs 15.7 

ESTs 15.7 

ESTs 15.7 

peroxlrodoxin 3 15.7 

Homo sapiens HUT11 protein inRrM,p9fial 15.7 

EST8 15.7 

glycoprotefn M6B 15.7 

sterile-alpha mofif and teudne zipper c 15.6 

hypotheScal protein FU21988 15.6 

ESTs 15.6 

ESTs 15.6 

six transmembrane epithelte) antigen of 15.6 

ESTs. WealdysiinBartoALULHUMAN ALU S 15.6 

ESTs 15.5 

protein tyrosine pliospiatese. receptor 1 15.5 

ESTs 15.5 

ESTs 15.5 

Horrto sapiens piostein mRNA, complete cds 15.4 

ESTs 15.4 

ESTs 15.4 

ESTs 15,3 

KIAA0217 protein 15.3 

ESTs, Weaidy simiter fo AUJ4J{Uli/)AN ALU S 15.3 

iiypothetica) protein FLJ20729 15.3 

thrombospondln 1 15.3 

prominin (mouseHQce 1 15.3 

dautfin 12 1&2 

ESTs 15.2 

hypothetical protein FLJ13782 15.2 

ESTs 15.2 

chromosome 1 open reading frame 21 15.2 

chromosome 1 open reading frame 21 15.1 

plelomorphic adenoma gene4tke 1 15.1 

ESTs 15.1 

homolog of yeast ubiquIUn-proteln Ggas 15.1 

UM domainH^onteining prefened transloc 15.1 

ESTs 15.1 

Homo sapiens cDNA: FU22696 lis. done H 1 5.1 

small nuctear protein PRAC 15.0 

DKFZP434G032 protein 15.0 

Homo sapiens cONA: FLJ22528 8s, done H 15.0 

ESTs. Weaidy simiter to A46010 X^tnked 15.0 

ESTs 15.0 

bjmican 150 

hypothelical protein OKFZp56401278 14.9 

protein phosphatase 1, regulatory frnhb 14.9 

ESTs 14.9 

zinclinoer protein 200 14.8 

ESTs 14.8 

ESTs 14,8 

phosphoinosilide-spedficphospholipase 14.6 

ESTs 14.8 

Homo sapten3cONA:FLI21718fis, dons C 14.8 

OEAD/H (A6p^u-AlaV\sp«Is) box binding 14.8 

dipeptldytpeptidase IV (C026, adenosine 14.7 

ESTs, Weakly simiter to 178885 serineAh 14.7 

tactetransfenin 14.7 

ESTs 14.7 

NMJ)01615*:Hb(no saptens actin, gamma % 14.7 

KIAA1073 protein 14.7 

UDP-GlcNAc:beteGalbetEh1,3'H«»tylgluc 14.7 

hypothetical protein DKFZp664L0884 simii 14.7 

K1AA1488 protein 14.6 

transposon^erlved Buster3 transposas&4 14.6 
ESTs. WeaWy similar to NUCL.HUMAN NUCLE 14.6 

ESTs, Weakly similar to ALU1_HUMAN ALU S 14.6 

KIAA0966 protein 14.6 

hypothetical protein FLI20285 14.5 

protein tyrosine phosplialase^ (proli 14.5 

ESTs 14.5 

protein tdnaseC,mu 14.5 

proiocadheiffi 10 14.5 

ESTs 14.5 

hypothetical protein M6C3199 14.5 

nudear factor, inteiteukin 3 regulated 14.5 

Homo8aftocONARJ13598fis.donePL 14.S 

gb:Human marteerl fransposase gena, comp 14.5 

ESTs 14.5 

meteltoltiionein IE (funcSonal) 14.5 
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454078 


AA601518 


HS.222D9 


eocfttlarf imirfiilBr jtalptiirn-htmniiA nmtatn 
ScwoIdO iiwuulBf liowui If DlnQliiy piwlBlil 


14.4 


411990 


AW96i3824 


H3.31707 


ESTs. WbaUv dmBar to YEW4 YEAST HYPOT 


14.4 


443280 


AA299688 


Ks^4183 


ESTs 


14.4 


416838 


D54745 


Hs.80247 




14.4 


436860 


H12751 


Hs.5327 


PR01914 Dfoteln 


14.3 


425174 


D874S0 


Hs.154978 


K1AAQ261 Dn^in 


14.3 


442242 


AV547906 


Hs.90424 


Homo sanifins cDMA: Fl. J939ft5 ftt cImia H 


14.3 


443790 


NM_003500 


Ks.9795 


8cyl«Co€nzyni6 A oiddsso 2, brsnchsd chsi 


14.3 


418994 


AA296520 


Hs 89546 


splficfin F fftflrinlhnlbrf nHhpicirvi nvJmil 


14.2 


419563 


AA526235 


Hs 193162 


Homo fiadens cDNA FUI 19R3 fte donn HF 


id 9 


437718 


AI9279BS 


Uc 1QR77Q 


ESTs 




412B26 


AL133396 


Hs.74621 


nrion nraifdn f n97-^) IC^mihtfUAAsAcrAi 

yu iwi 1 yt UUmJ 1 W**- ' ^JVJ \vl OUUICHrwalUlU 


1i 9 


432712 


A8016247 


H3^B8031 


cieroL/^S-riiKaiurfKA ffimnat PRf^^ riplt» 


14.2 


433332 


AI3S7347 


Ks.44898 






451533 


MM 004657 


Hfi 7(iS\Q 


ooTum oBpnvowoii losponso yjnoopnsuoyi 




424701 




H& 151988 


itrftftniwmpHif al »H nmtotn Irtnaea IfinsKA 
llHUiyBl i*aCliVo(cu piuiiBin KuioSo KlnaSo 


a 1 


446494 


AA463276 


Lie TfiftOrK 


WW iwiiKuii-iAnHwnuig wii8 


14 1 
14.1 


456508 




ns. 1^0409 


CO 1 5, WodNy wnuiar 10 Ar^UOoOd 1 dwhii 


l4.U 


432251 






pdycythcfnla rubra vers 1^ cell suffaco 


14 A 

14.Q 


426423 


NM_012446 


Hs 169833 


9H^K7*aUCUIIMw^ViWUIlHMl^ ^UiCIII 




442281 


N34742 




Homo Rflnipnc cHNA PI 11 \^Q9 fie f^tmwi M 


14 A 


442145 


AI022650 


Hs.81 17 


Hiuu^Miiiciacuny pruiBin ci>Dti>i 


14 Q 


418848 


AI820961 


Hs 193465 


ESTs 




447072 


061594 


Hs.17279 


h/rasviomtpin fiiilfnlranRforaRA 1 


139 


439516 


W76326 




ab:zd6Qd04 r1 Soares fetal heart NhHH19W 


139 


443967 


AW294013 


Hs.20(^2 


ESTs 


l<>.9 


424775 


AB014540 


Hs.1 53026 


SWAP-70 protein 


13.9 


411190 


AA306342 


Hs.69171 


protein kin3se O^ks 2 


13.9 


447384 


A1377221 


Hs.40528 


ESTs 


13.9 


444880 


AW1 18683 


H3.1541S0 


ESTs 


13.9 


433409 


AI278802 


Hs.25661 


ESTs 


13.9 


423201 


NM_000163 


Hs.125180 


9T0wlh hormone receptor 


13.9 


413724 


AA131466 


Hs93767 


Kvnnlhelietri rmlpbi PI J i^SBR 


14 0 


454219 


X75042 


Hs.44313 


T1VI UVlall IBIIMJIIKIHIUUICUDSIS Vlioi 


14 R 


429165 


AW009886 


Hs.1 18258 


prostate cancer associated protein 1 


13.8 


430598 


AK001764 


H3.247112 


hypotheScai protein FUI 0902 


13.8 


444172 


6E147740 


Hs,1 04558 


ESTfi lUbideratatv simDa^ to 138022 hvfW 


13.8 


416774 


AI005169 


Hs!28274 


Homos8Dfsnscl}fJA:FLJ22049fis clonaH 


13.8 


410310 


J02931 


Hs.62192 


ooaoulafim faefor III flhmmhnnkuttn 


14 ft 


428738 


N^L000380 


Hs.1 92603 


xeioderma pigmentosunii Domplenientafon q 


13.8 


400296 


AA305627 


Hs 139335 


rM r^MliUllly M13901U1| OULflalllllj V 1 1\ 


14 ft 
10.0 


407939 


W05608 


Hs.312679 


ESTs We^slmHartDA49019dvn8lnl» 


13.8 


452281 


T93500 


Hs.28792 


HonosaDtenscDNAFLIIKMI fis-donaPL 


13.7 


408622 


AA0S6060 


Ue 909577 


Homn «x^ns rnNA Fl t191fiAfk rimnUA 
imnu 9c)|jR}ii9 Guixn rw i£ loo ns, uwib win 


14 7 

ij.r 


426054 


A)948688 


Hs.266619 


ESTs 


147 


444636 


T96667 


Hs.1 7877 


ESTs 


14 7 


416292 


AA17g733 


49'^Qn 


iKnopnoiyngcoi Ccuwiiionia suscupuDniqf 


14 7 


442315 


AA173992 


Hs.7956 


ESTe Moderatelv filmaar In 7KJQ1 MttMAN? 


14 7 
10./ 


452260 




u« 9079A 


rvso?! iiiciTiijei rvsa oncogsna idiTiuy 


7 
10.1 


428966 


AF059214 


Hs 194687 




14 7 
10./ 


450316 


WB4446 


Hs.226434 




14 7 
10. 1 


419839 


U24577 


Hs.93304 


nhosDhflflrtasn A2 nmitn Vfl fnbrtAlALnr 


13.7 


443634 


H73972 


Hs.134460 


F<!Tn 
CO lo 


14 7 
lO.I 


449474 


AA019344 


Mr 9A<v<t 


uoiquiurhacuvcHinQ buzjuto ci ^idsfi an 


44 7 


422278 


AF072873 


Hs.1 14218 


frizzled (Drosophiia) ho^TK409 6 


13.7 


445817 


NM.003642 


Hs.1 3340 


Ill^lUliV tMdJ(ll<M 10161090 1 


14 7 


434629 


AA^9081 


Hs.4029 


alloma-afTiQOfied saaiifinca-41 


13.8 


416795 


AI497778 


Hs.20509 




13.6 


410001 


AB041036 


Hs.67771 


koftlWmin 1 1 


14 A 
10.D 


452242 


R509S6 


Hs.159993 




■ 141% 
IAD 


413231 


D87461 


Hs.75244 


Ort OJikft 2 


14fi 
I0.« 


404641 






nliVi^UA taQO Jwiriv bcipiolla pilUSpnoyHJCuiflUia 


14 R 
IJ.g 


428648 


AF052728 


Hs. 188021 


fxitasstuni vottaaa-aatsd channel Atihfami 


13.6 


414279 




ru>> 1 w tuu* 


GCN5 (Qeneral control of 8inino>8cid synt 


14 R 
10.0 


411573 


AB029000 


Hs.70823 


KIAA1077 Dmtein 


IOaI 


417632 


R20855 


Hs!5422 


QlycopfOtBbi M6B 


13.5 


431467 


N71631 


Me 9lK^QA 


nomo sapiens mKNA, cuna ui\rcp4<)4cU5^ [t 


13.5 


450313 




Ue 119R^^ 


Daroertiieui synaruine « 


13.5 


445472 


nOUUOOOl 




nomo sapiens mnrV\ tor iwvvo^ j gene, par 


13.5 


419264 


AA877104 




PVtit WacAIw efanlbr In Al 1 IR UIIMAMIIII 


14 ft 
1il.O 


416863 


AK001100 


Hs.4ie90 


^fesfmpolHn 3 


13.5 


452277 


AL049013 


H3^783 


KIAA1223 protein 


13.5 


447896 


AI436124 


Hs.294flS9 


Homo sapiens cDNA FLJ13384 lis, done PL 


13.5 


447082 


T85314 


Hs.42644 


th)oredo]dn-tika 


13.5 


451468 


AW503398 


Hs.293663 


ESTs. Moderately similar to 138022 hypot 


13.4 


415443 


T07353 


Hs.7948 


ESTs 


13.4 


427212 


AW293849 


Hs.58279 


ESTs, Weakly similar to ALU7.HUIMAN ALU S 


13.4 


442113 


BE622187 


Hs,99670 


ESTs. Wealdy sinto to 138022 hypolheti 


13.3 


433517 


AW022133 


Hs.189838 


ESTs 


13.3 


430829 


AW451999 


Ks.194024 


ESTs 


13.3 


453111 


AB014598 


Hs^1720 


hephaesUn 


13.3 


435839 


AF249744 


Hs^951 


Rho guardna nudeolide exchange lactor ( 


13.3 


429113 


P28235 


Hs.ig6384 


pro6taolandii>«ndopefQxlde synthase 2 (p 


13.3 
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422728 


AW937826 


Ks.103262 


427288 


AW732802 


Ks.2132 


437323 


AA371145 


Hs.226627 


421878 


AA299652 


H5.1 11496 


433037 


NK^014158 


Ks.279938 


407938 


AA905097 


HS.850S0 


400860 






41t031 


W37943 


Hs.34692 


438797 


AA731491 ■ 


Hs.336454 


409277 


T05556 


Hs.156880 


434036 


AI659131 


Hs.197733 


448520 


AB002367 


HS.2135S 


450755 


AA010964 


Ks.159464 


450649 


NM.001429 


Hs.25272 


408495 


W68796 


Hs.237731 


429732 


U20158 


Hs.2468 


428839 


AI767756 


Hs.62302 


416987 


D86957 


Hs.80712 


453006 


A)362575 


Hs.167133 


421750 


AK000768 


Ks.107872 


41S752 


BE314524 


Hs.78776 


411529 


AA430348 


Hs.317596 


417318 


AW953937 


Hs.12891 


410763 


AF279145 


Hs.8966 


426034 


A)2769B9 


Hs.56123 


447966 


AA340605 


HS.105B87 


428218 


AA424266 


Hs.123642 


409799 


011928 


HSJ6845 


448779 


BE042877 


Hs.177135 


448944 


AB014605 


Hs.2259g 


417315 


AI080042 


Hs.336901 


429697 


AW296451 


Hs.24605 


432674 


AA641092 


Hs.257339 


445309 


AL157474 


Hs.12504 


451099 


R52795 


Hs.25954 


424806 


AA382523 


Hs.105689 


442343 


AA992480 


Hs.129874 


432244 


A1669973 


Hs.200574 


459513 


AI032946 




452512 


AW363486 


Hs.337635 


415079 


R43179 


H5.22895 


422365 


AF035537 


Hs.1 15521 


411067 


A)681006 


Hs.71721 


442501 


AA315267 


H5.23128 


434466 


AB037829 


Hs.3862 


437052 


AA861697 


Hs.1 20591 


433234 


AB040926 


Hs.65366 


453830 


AA534296 


Hs^0953 


409995 


AW960597 


Hs.30164 


414290 


AI566801 


Hs.71721 


417248 


AA32944g 


Hs^47302 


418624 


A1734080 


Hs.104211 


412854 


AU)93480 


Hs.29263 


450253 


AL133047 


Hs^4715 


446619 


AU076643 


Hs.313 


427078 


At676062 


Hs.1 11902 


420838 


AW118210 


Hs.5244 


449784 


AW161319 


Hs.12gi5 


431325 


AW026751 


Hs.57g4 


415276 


U88666 


Hs.78353 


426110 


NM.002913 


Hs.166563 


400301 


X03635 


Hs.1657 


434926 


BES432G9 


Hs.50252 


426316 


NM.002430 


Hs.268515 


420345 


AW295230 


Hs^5231 


432205 


AI806583 


Hs.1 25291 


451893 


AW192083 


Hs.290855 


435266 


AK001942 


Hs.4863 


427003 


U19487 


Hs.20g0 


404642 






414241 


AA425085 


HS.4M7 


429716 


R25685 


Hs.211933 


420871 


AA702972 


Hs.65300 


448072 


A1459306 


H5.24908 


441269 


AW015206 


Hs.178784 


427761 


AM12205 


Hs.140996 


425322 


U63830 


Hs.155637 


417733 


Atj048678 


Hs.82503 


436521 


AW203986 


H3.213003 


406920 


AL120071 


HS.4899B 


446332 


AK001635 


Hs.14838 


453994 


BE180964 


Hs.1655gQ 


420397 


NM.00701B 


Ks.97437 



ESTs. Weakly sinflar to ZN91JHUMAN ZINC 13.3 

epidermal growth factor mptor pathway 13.3 

lepQn receptor 13.2 

Homo sapiens cONA FU11643 lis, done HE 13.2 

HSPC067 protein 13.2 

phospholarnban 13.2 

Target Exon 13.2 

K1AA1323 protein 13.2 

hypothefical protein MGC14e79 13.2 

ESTs ^32 

hypothetical protein MGC2849 13.2 

dout)lecortin and CaM kinases 1 13.2 

ESTs 13.1 

ElAUflding protein p300 13.1 

ESTs 13.1 

lymphocyte cytosolic protein 2 (SH2doma 13.1 

Homo sapiens CONAFU14814 lis, done NT 13.1 

IQAA0202 protein 13.0 

ESTs 13.0 

hypothetical prot^aJ20761 13.0 

putaOvetransmemtwane protein 13.0 

Homo sapiens cDNA aj12927 lis. done HT 13.0 

ESTs 13.0 

hypothetical protein iaJ21 776 1 Z9 

Homo sapiens cDNA FU13443 fis, done PI 12.9 

ESTs.WeaIdyslmItertoHomdogofralZ 12.9 

EphA3 12.9 

phosphoserine phosphatase-Glce 12.9 

ESTs 12.8 

alrophin-l Interacting protein 1; aciivl 12.8 

rilxtsomaJ protein S24 1Z8 

ESTs 12.8 

ESTs, Weaidysimllar Id 138022 hypofhetl 128 

lately orlholog or mouse Afttadia 12.8 

interteukin 13 receptor, alpha 2 12.8 

MSTP031 protein 1Z7 

ESTs 117 

ESTs 12.7 

gb:ox06g09.s1 SoaresJetelJlver_spleen_ 1Z7 

ESTs 12.7 

hypolhelicd protein FU23548 12.7 

REV3 (yeast homoiogHite. catalytic sut) 12.7 

ESTs 12.7 

ESTs 12.7 

regulator of nonsense transcripts 2: DKF 12.7 

ESTs 1Z7 

KIAA1495 protein 116 

ESTs 1Z6 

ESTs 1Z6 

ESTs 1Z6 

twisted gastnitafnn 1Z6 

ESTs 1Z6 

hypothetical protein FUllSge 1Z6 
Homo saptens mRNA; cDNA DKFZp434D0215 (f 1Z6 

secreted phosphoprotein 1 (osteoponQcu 1Z5 

ESTs 1Z5 

ESTs 12.5 

ESTs 12.5 

ESTs, WeaMy similar to 2109260A B cell 1Z5 

SFRS protein kinase 2 12.5 

replication factor C {activator 1) 1 (14 1Z5 

estrogen receptor 1 1Z5 

mitochondital ritxisomal protein 132 12.5 

meningioma (disnipted in balanced transl 12.5 

ESTs 1Z4 

ESTs 12.4 
ESTs ■ 12.4 

hypothsQcal protein OKFZp566At524 1Z4 

prostaglandin E receptor 2 (sut)typeEP2) 1Z4 
NM_021965^Homo saptens pho^iho^lucQfnite 12.4 

Sarootemmai-associated protein 12.4 

coflagen, type XID, alpha 1 12.4 

ESTs 1Z4 

ESTs 12.4 

ESTs 1Z3 

ESTs 1Z3 

protein kinase. DN/^«IIvated.catalytie 1Z3 

H.saptens mRNA tor 3tJTR of unknown prat 1Z3 

ESTs 1Z3 

Qironecfin teudne rich transmembrBne p 12^ 

hypothetical protein FU10773 1Z2 

rfbosomaiprotdhS13 1Z2 

centrosomd protein 1 1Z2 
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447471 AH)39843 Hs.18676 8piDU^(DiDSophSa}hofnobg2 12.2 

442562 BE379584 Hs^789 doOchyUI^hosphooBgosaochar^^ 112 

400880 NM.0006ir:Komo6apiensCO59anQgenp18 112 

_ 425920 ALJ049977 Hs.162209 daudinS \U 

0 419743 AW408762 H5.5957 Honx) sapiens dons 2441 6 mRNA sequence 12.2 

414222 AL135173 H&878 sorttlol dehydrogenase 111 

427051 BE178110 K5.173374 Homo sapiens cONAFUIOSOOfis. done NT 12.1 

452994 AW962597 Hs.3130S KIAA1547protdn 111 

425154 NM.001851 Hs.154850 conagen. type IX, alpha 1 111 

10 437124 AA554458 Ks.197751 KIAA0666protabi 111 

411450 H49619 H9.127301 ESTs 111 

421559 NM.014720 Hs.10S751 S(e2afelaledserlneAhreonIne kinase 111 

449625 NM.014253 H5.23796 odz (odd Oztown, Drosophfla) homolog 1 111 

419459 AW291128 Hs^78422 DKFZP586G1122proldD 111 

15 426252 BE176980 Hs^917 ESTs 111 

414171 AA360328 Hs.865 RAP1^ member of RAS oncogene tartly 111 

436169 AA888311 Hs.17602 Homo sapiens cONA RJ12381 Gs, done MA 110 

435047 AA454985 Ks.54973 cadherin^ protein VR20 12.0 

417625 U59305 Hs.44708 Ser-Thr protein Wnase related to the my 12,0 

20 439820 AL360204 Hs,283853 Homo sa^^nsmRNAfuHenglh insert cDN 110 

443646 A)085198 Ks.164226 ESTs 110 

434874 N62448 Hs.293970 melhylmatonate^semialdehydedehydTogenas 110 

443837 A)934625 Hs.9684 spindle pole body protein 110 

451640 AA195601 Hs.26771 Human DMA sequence torn done 747H23 on 110 

25 424232 AB015982 Hs.l43460 protein kinase C. nu 11.9 

414555 N96569 Hs.76422 phosphol]paseA2. group IIA(platetels. 11.9 

446874 AW968304 Hs.56156 ESTs 11.9 

454119 BE549773 Hs.40510 uncoupfing protein 4 11.9 

436746 AA730045 Hs.187866 ESTs 11.9 

30 432359 AA076049 Ks.274415 Homo sapiens cDNAFU10229 lis. done HE 11.9 

416508 R39769 Hs.2O6088 ESTs, Moderetely similar to ALU8_HUMAN A 11.8 

444758 AL044878 Hs.11699 34iydrDxy-3-methylglutaryK)oenzymeAre 11.6 

413991 m4725 Hs.42683 ESTs 11.8 

431645 AF078849 Hs.266483 dynein light chain-A 11.8 

35 421406 AF179897 H5.104105 Meis (mouse) homolog 2 11.8 

408826 AF216077 Hs.48376 Homo sapiens done HB-2mRNA sequence 11.8 

424903 T26477 Hs.22883 ESTs. Weakly sinular to 138022 hypotheQ 11.8 

4129n AA125910 HSw191461 ESTs 11.8 

. 426981 AL044675 Hs.173081 KIAA0530 protein 11.8 

40 410853 H04588 Hs.30469 ESTs 11.7 

444670 H58373 Hs.332938 hypothetical protein MGC5370 11.7 

444381 BE387335 Hs.283713 ESTs. WeaMy similar to S64054hypolhetl 11.7 

434398 AA121098 Hs.3838 sanmvlndudbte Unase 11.7 

438032 BE045624 Hs.152992 ESTs 11.7 

45 433212 BE218049 Hs.121820 ESTs 11.6 

424330 AW073953 Hs.333396 Homo sapiens cDMAFU13596fis,ctone PL 11.6 

452679 Z42387 Hs.838d3 transmembrane, prostate androgen Induced 11.6 

430929 AA489166 Hs.156933 ESTs 11.6 

423782 A1472209 Hs.323117 ESTs 11.6 

50 425770 NKt014363 Hs.159492 spastic ataxia Of (Mevdx-Saguenayls 11.6 

437108 AA434054 Hs.B0624 hypothetical protein M6(2560 11.6 

430200 BE613337 H5.234896 geminin 11.5 

453857 AL080235 HS.35B61 DKFZP585E1 621 protein 11.5 

^_ 411096 U80034 Hs.68583 mitochondria Intermediats peptidase 11.5 

55 422805 AA436989 Hs.121017 H2Ahistonefamily« member A 11.5 

422538 NKL006441 Ks.118131 5. lO^neOtenyttetrabydroldatB synthetase 11.5 

412677 AW029608 Hs.17384 ESTs 115 

421896 N62293 Hs.45107 ESTs 11.5 

453930 AA419466 Hs.35727 hypothetical protdn FU10903 11.5 

60 443123 AA094538 H8.27280B putative tansciip8on regulation nudea 11.5 

415293 R49462 Hs.106541 ESTs 11.5 

443161 AI038316 gb:ox48d)8.x1 So3re$.totaLIetus.Nb2HF8. 11.5 

420185 AL044056 Hs.l58047 ESTs 11.5 

^- 445527 W39694 HS.832B8 ESTs. Weakly similar to S14747sphlngomy 11.5 
05 428829 R140S0 H5.194061 Homo sapiens niRNA;cDNADKFZpS66B213(tr 11.5 
416852 AF283776 H$.802B5 Homo sapiens mRNA;d}NADKFZp586C1723(f 11.5 

449919 AI674685 Hs.200141 ESTs 11.5 

414844 AA296874 K5.77494 deoxyguanosine kinase 11.5 

^ 449543 AF070632 Hs.23729 Homo sapiens done 24405 mRNA sequence 11.4 

70 429784 M89796 Hs.30 membraft»spanning44omdns,8ubtanilyA 11.4 

407766 AA587538 K$.38972 tetraspani 11.4 

414407 AA147026 Hs.76704 ESTs 11.4 

441247 AW118681 Hs.l2805l Homo sapiens thymic stnmallymphopoieti 11.4 

456804 AI421645 Hs.139851 caveoIln2 11.4 

75 422546 AB007969 H&.301478 KiAA0500 protein 11.4 

417350 U50928 Ks.82001 polycystn kUney disease 2 (autosomal d 11.4 

417683 AWS66008 H5.239154 oikyrln repeal lantily A (RFXATMca), 11.4 

424800 AL035588 Hs.153203 MyoDtamily inhibitor 11.4 

433Sn AWD07080 Hs.8817 ESTs 11.4 

80 453935 AI633770 Ks.42572 ESTs 11.4 

415060 AJ223810 Hs.43213 ESTe. Weakly sinfiar to lEFS.HUMAN TRANS 11.4 

425710 AF030880 Hs.159275 sdute cantor (amOy. member 4 11.3 

448766 A1473827 Hs.31793 ESTs 1U 
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444151 


AW972917 


Ks.1^49 


a^a-meOiylacyl-CQA racemase 


11.3 


410108 


AA0816S9 


Hs.318775 


OSSP-reialed protein 6 


1U 


418450 


R84397 


Hs.193651 


£ST$. WeaUy similar to allematively $p 


11.3 


415457 


AW081710 


Hs.7369 


ESTs, Viteakly shnOar to AIU1.HUMAN ALU S 


1U 


452110 


T47667 


Hs.28005 


Homo sapiens cDNA FIJ11309 lis. done PL 


11.3 


414212 


M136569 


Hs.10848 


KlAA0187genepreducl 


1U 


401519 






C15000476*:9ll12737279trerp(P.012163.1| 


1U 


458509 


AA654650 


Hs.2829(K 


ESTs 


117 


416913 


AW934714 




gb:RC1-OT0001-031299^11<a11 DT0001 Homo 


11.2 


452784 


BE463857 


Hs. 151258 


hypothetical protein FLJ21062 


lU 


436758 


AW977167 


Hs.155272 


ESTs 


11.2 


409401 


A!201895 


Hs.181309 


prDteasome (piosomBi maciop^n) subunili 


115 


434715 


6E005346 


Hs.1 16410 


ESTs 


11.2 


427283 


All 19796 


H5.1 74185 


ectonudeofide pyrophosphst3se/|^c»phod] 


11.2 


428342 


AJ739168 


Hs.1 31798 


Homo salens cONA FU 1 3458 fis, clone PL 


11.2 


419465 


AW500239 


Hs.21187 


Homo salens cONA: FU23Q68fis.cloneL 


11.2 


430519 


AF1 29534 


H5.49210 


F>box or^ protein 4 


11.2 


418838 


AW385224 


Hs.35198 


fictonudsoHda nwroohnsiihatafiBMinsnharli 


11.1 


429163 


AAB84766 




ab2m20a10.s1 Soobs NFL T 6BC SI Norm s 


11.1 


434158 


T66534 


Ks.14372 


ESTs 


111 


414341 


080004 


KS.7S909 


KlAA0182orDte!n 


11.1 


414650 


AA150435 


Hs.72063 


ESTs 


1l!l 


448475 


BE613134 


Hs.247474 


hypothetical protein FLI21032 


111 


437575 


AW954355 


Hs.36529 


hypothetical protein MGC1 1242 


111 


423566 


AW976434 


Ks.3623 


hypotheficad protein FU 1 1 220 


111 


42S289 


AW139342 


Hs.155530 


trUerfeion. ganvi»inducible protein 16 


111 


452598 


AI831S94 


Hs.68647 


ESTs, Weaidy shnHar to AUJ7J{UMAN ALU S 


111 


417433 


BE270266 


Hs.82128 


6T4 oncofetal tnophoblast ^{yooprolein 


11.0 


438379 


N23018 


Hs.1 71391 


C-terminal Idndir^ protean 2 


110 


433230 


AW136134 


Hs,220277 


ESTs 


110 


412622 


AW664708 


Hs.171959 


ESTs 


110 


450353 


AI244661 


Hs.103298 


ESTs, WeaMy simiter to S65657 aipha-IC- 


110 


434349 


NM_015678 


Hs.3821 


neurobeachin 


11,0 


430261 


AA305127 


Hs.237225 


hypothetica} protein KT023 


11.0 


420544 


AA677577 


Hs.98732 


Homo sapiens Chromosome 16 BAG done CIT 


110 


424099 


AF071202 


Hs.139336 


ATP-btraftng cassette. suthfemBy C (CRR 


110 


442082 


R41823 


Ms.7413 


ESTb 


110 


422890 


Z43784 


Hs.75893 


ankyrin 3, node of Ranvler (ankyrin G) 


110 


437162 


AW005505 


Hs.5464 


thyidd hormone recefrior ooacSvalBng pr 


110 


450244 


AA007534 


Hs.1 25062 


ESTs 


11.0 


417169 


R13550 


Hs.246773 


ESTs 


110 


421481 


AW391972 


Hs.1 04696 


K1AA1324 protein 


10.9 


431576 


M76665 


Hs.275215 


hydroxystsroid (ll^beta) dehydroQenase 1 


10.9 


408988 


AL119844 


Hs.49476 


Homo sapiens done TUAS C>i-diKhat regl 


10J 


439706 


AW872527 


Hs.59761 


ESTs, We^ slmliar to DAP1_HUI^ DEATH 


10.9 


435133 


AJ010462 


Hs.31412 


Homo sapiens cDNA FUl 1422 fe, done HE 


10.9 


433656 


L03678 


Hs.156110 


immunoglobulin kappa constant 


10.9 


428364 


AA426565 


Hs.311103 


ESTs ModerMr similar to ALU1 HUMANA 


10.9 


441540 


C01367 


Hs.1 271 28 


ESTs 


10.9 


431154 


AW971228 


Hs!290259 


ESTs Weaklv similar to 138022 hvDotheti 


10.9 


416777 


AF146760 


Hs.79844 


DKFZP564M1 416 protein 


10.9 


439556 


A)623752 


Hs.163803 


ESTs 


10.9 


428280 


K05541 


Hs.183428 


asRxispan (Kras onooQeno-assodated Qene 


lOB 


453942 


AW190920 


Hs.19928 


hypotheficd prtfdn SP329 


10.8 


447982 


H22953 


Hs.137551 


ESTs 


lois 


422779 


AA317036 


Hs.1 18787 


transformino arowth factor beta4nduced 


10.8 


447595 


AW379130 


Hs.18953 


phosphodiesterase 9A 


10«8 


427115 


AW972B53 


Hs.1 12237 


ESTs 


10.8 


448019 


AW947164 


Hs.195641 


ESTs. Moddrately sImQar to 138022 hypd 


10.6 


419326 


W94915 


Hs.42419 


ESTs 


lOJ 


435163 


AA668884 


Hs.1 91 55 


ESTs 


10.8 


417578 


T91443 


Hs.193983 


ESTs 




407930 


AA045847 


Hs.188361 


Homo Buttons cDNA FLI12807 lis, done NT 


10.8 


450206 


AI796450 


Hs.201600 


ESTs 


10.8 


426413 


AA377823 




gb:EST9Q605 Synovial sarcoma Homo sapian 


10.8 


421977 


W94197 


Hs.110165 


ribosomai protebi L26 honubg 


10,8 


437967 


BE277414 


Hs.5947 


md transfdmlng oncoQene (derived from 


10.8 


408243 


Y00787 


Hs.624 


foterleuMn 8 


10.8 


439492 


AF086310 


Hs.103159 


ESTs 


10J 


413492 


087470 


Hs.75400 


K1AA0280 protein 


10.8 


436962 


AW377314 


Hs.5364 


DKFZP564{052 protein 


10.8 


435572 


AW976339 


HSJ239828 


ESTs, Weakly similar to GAG2.HUMAN RETRO 


10J 


422583 


AA410506 


Hs.27973 


KiAA0874 protein 


m 


417665 


AW852858 


H5.22882 


ESTs 


10.7 


433285 


AW975944 


H3.237396 


ESTs 


10.7 


419693 


AA133749 


Hs.301350 


FXYD domdrKCM^rung ion transport reg 


ia7 


424878 


H57111 


Hs.221132 


ESTs 


10.7 


449659 


R&0031 


Hs.1 98899 


eukaryotic translation IniSation factor 


10.7 


415910 


U20350 


H5.7B913 


chemokine (&X3-C) receptor 1 


10.7 


433556 


W56321 


Hs.1 11460 


cddumfcabnodulln^pendenl protein Un 


10.7 


452241 


A1.050204 


Hs.28540 


Homo sapiens mRNA; cONA DKFZp5B6F1223 0 


10.7 


450330 


AW500775 


Hs.24817 


hypothefical protein FU20136 


10.7 


430589 


AJ002744 


KS.24631S 


UDP-N^e^Mpha-D^iatodosarineipdyp 


10.7 
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433923 


AI823453 


Hs.146625 


418196 


AI745649 


Hs.26549 


416155 


AI807264 


Hs.205442 


430057 


AW450303 


Hs2534 


429350 


AI754634 


H$.131987 


418601 


AA279490 


Hs.86368 


437267 


AW511443 


Hs^UO 


432169 


Y00971 


H5.2910 


421982 


AF206019 


H$.1 10347 


414085 


AA114016 


Hs.75746 


416662 


AI801098 


Hs.151500 


449685 


AW296669 


Hs.66095 


441111 


Aie06867 


H9.126594 


436671 


AW137159 


Hs.146151 


447974 


R76866 




417916 


NI\iL006416 


Hs.82921 


424562 


Ai420859 


H$.1S0557 


408331 


NM.007240 


Hs.44229 


443015 


R33261 


Hs.6614 


439584 


AA838114 


Hs.221612 


426298 


AW9650S8 


Hs.1 11583 


420929 


AI694143 


Hs.296251 


418318 


U47732 


HS.B4072 


414565 


AA502972 


Hs.1 83390 


418329 


AW247430 


Hs.84152 


424433 


H04607 


Ks.9218 


420230 


AL034344 


Ks.284186 


419441 


AW023731 


Hs.274368 


426011 


AW996096 


Hs.58924 


451900 


AB023199 


Hs.27207 


424692 


AA429834 


Hs.1 51791 


439999 


AA1 15811 


Hs.6838 


424368 


AB037766 


Hs.146085 


402076 






413886 


AW958264 


Hs.103832 


445041 


T641B3 


Hs.282982 


428927 


AA441837 


Hs.90250 


414831 


M31158 


Hs.77439 


407902 


AL117474 


Hs.41181 


426429 


X73114 


Hs.169849 


452055 


AI377431 


Hs.141693 


436913 


AI380429 


H8.172445 


443684 


AI681307 


Hs.55098 


429281 


AA830656 


Hs.29808 


421040 


AA715026 


Hs.135280 


425277 


NiyL001241 


Hs,155478 


418564 


AA631143 


H8.278695 


440099 


AL080058 


Hs.6909 


434958 


T99949 


Hs.303428 


419972 


AL041465 


Hs.182982 


416182 


Nm_004354 


H5.79069 


418365 


AW014345 


Hs.161690 


452268 


AI358570 


Hs.123933 


430361 


A1033965 


Hs.239926 


446716 


AA435575 


Hs.16602 


419544 


AI909154 




433023 


AW864793 


Ks.87409 


409151 


AA306105 


Hs.50785 


449115 


AW959952 


Hs.37528 


445715 


AB0129S6 


Hs.13137 


443228 


W24781 


Hs.293798 


424051 


AL1 10203 


Hs.138411 


453496 


AA442103 


Hs.33084 


452741 


BE392914 


Hs.30503 


428728 


NM.016625 


Hs.191381 


42390S 


AW579960 


Hs.135150 


413328 


Y15723 


Hs.75295 


453020 


AL162039 


Hs^1422 


411359 


H8e088 


Hs.22635 


420000 


AB036063 


Ks.94262 


412282 


W%406 


Hs.295923 


419875 


AA853410 


Ks.33557 


414422 


AA147224 


K8.337232 


426581 


AB040956 


Hs.135890 


424649 


BE242035 


Hs.151461 


407317 


AI204033 


Hs.271461 


440426 


AI15S800 


Hs.7181 


43349S 


AW373784 


H8.71 


423250 


8E061916 


Hs.125849 


438459 


T49300 


Hs.35304 


451593 


AF151879 


Hs.26706 


435106 


AA100847 


Hs.ig3380 


438157 


AW137011 


Hs.49576 



ESTs 

KlAA1708pn}tein 

ESTs. Weakly Mar to T34036 hypotheO 

bone morphogenefo protein receptor, lyp 

ESTs 

cabnegin 

ESTs 

ptosptoribosyl pyrophosphate synthstdse 

REV1 (yeast homQiog)-GkQ 

aldehyde dehydrogenase 1 tamSy, member 

ESTs 

ESTs 

ESTs 

ESTs 

gb:yi64b01s1 Soares placenta Nb2HP Homo 

solute carder family 35 (CMP-sIanc ad 

tjasic transcdpflon element \Mng prel 

dual spedrid^ phosphatase 12 

ESTs, Weakly simBar to A43932 mucin 2 p 

ESTs 

ESTs. W^ similar to 138022 hypothec 
programmed cell death 4 
Iransmemtirane 4 superfamfly memt)er 3 
hypolheltcal protein FU 13590 
cystathionlne-beta-synthase 
ESTs 

fofkheadboxCI 
MSTP032 protein 

ESTs. Weakly similar to JC5594 {erfcy gen 

KIAA0982 protein 

KIAA0092oeneprQdwt 

ras homotog gene family, member E 

KIAA1345 protein 

C5002020':g[I1082876|plriIS55467 Iropomy 
similar to yeast Upf3, variant B 
solute canler 
ESTs 

protein kinase. cAMP-dependent regut^ 

Homo sapiens mRNA; cONA DKFZp727C191 (fr 

myosin-bindino protein C, stow-type 

hypothetical proteih MGC10858 

ESTs 

ESTs 

Homo salens cmk FU21122 lis. done C 

ESTs 

cycllnT2 

Homo sapiens prosteln ml^. complete cds 

DKFZP564G202 protein 

Homo 8a;»ens cDNA FU14832 fis. done OV 

golgin-67 

cyclinG2 

ESTs 

ESTs. Weakly slirilar to ZN91 J1UMAN ZINC 

ste(d-C4^thyl axUase^ 

ESTs 

gb:QV<eT20(H)10499W 8T200 Homo s^ 
Ihromboepondin 1 

SEC2^ veslde trafiiddno protein (8. c 
ESTs. Weakly sitfiilar to AFD90944 1 PRO06 



K1AA1710 protein 
Homo sapiens mRNA; cONA DKFZp586J1922 (f 
sdute canier family 2 (fiadliteted gh) 
Homo sapiens cDNA FU1 1 344 lis. done PL 
hypothetk:al protein 

lung typ8^ cell membrane-assodated gly 
guanytete cydase 1 . sduble. alpha 3 
Homo saptens mRNA; d)NA DKFZp434M229 [fr 
ESTs 

pSS^dudble ribonudeoSde reductase s 

seven h absentia (OrosophBa} homdog 1 

proenkephafln 

HomeoboxA13 

KIAA1523 protein 

embiyonic ectodemi devetopment 

ESTs, Weakly simBar to 138022 hypothetl 

Homo sapiens cDNA FU 13663 fis. done PL 

dpha-^glyooproteinl.zbic 

chromosome 8 open reading frame 2 

Homo sapiens cONA FU13655 fis. done PI 

CGI-121 protdn 

ESTs, Highly similar to AF174600 1 F<box 
ESTs 
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443181 


AI039201 


H5.283316 


449125 


AI671439 


Hs.196029 


411479 


AW&4B047 




446553 


AB021179 


Hs.15299 


418278 


Ai088489 


Hs.83937 


419791 


AI579909 


Hs.105104 


418866 


T65754 




413753 


U17760 


Hs.75517 


445564 


ABQ289S7 


H$.12896 


435021 


AA922192 


Hs.54709 


457498 


AI732230 


Hs.191737 


425628 


Nfi4.004476 


Hs.1915 


419589 


AW973708 


Hsi0192S 


446162 


AI631319 


H163841 


414523 


AU076633 


Hs.76353 


427393 


AB029018 


Hs,l77635 


408741 


M73720 


Ks.646 


437743 


AI383497 


Hs.131811 


414888 


A1039185 


Hs,77558 


448172 


N75276 


Hs.135904 


421863 


AI952577 


Hs,108972 


444618 


AV653785 


Hs.173334 


413450 


Z99716 


Hs.75372 


413431 


AW246428 


HS.7S355 


410150 


AW382942 


Hs.6774 


444100 


AA383343 


Hs^116 


448148 


N1VL016578 


H$.205O9 


443476 


AW068594 


Hs.133878 


433345 


A1681545 


Ks.152982 


423661 


AB023215 


Hs.131525 


450205 


A1219746 


H5.11356 


408374 


AW025430 


Hs.155591 


446035 


NM_0Q6558 


Ks.13565 


432589 


AL135725 


Hs.131708 


435604 


AA625279 


Hs^892 


411997 


AW673478 


Hs.108323 


419672 


AA465113 


H5.23853 


444564 


AI167877 


Hs.143716 


426931 


N^^003416 


Hs.2076 


451424 


AI862026 


Hs.302810 


433730 


AK002135 


Hs^2 


431616 


AA508552 


Hs.ig5839 


427359 


AWQ20782 


Hs.79881 


425683 


AB037813 


Hs.159200 


442760 


BE07S297 


H5.6614 


418884 


AA230228 


Hsi9197 


444821 


AA053564 


Hs.12040 


451989 


AF169797 


Hs.27413 


439731 


AI953135 


H3.45140 


453127 


AI696671 


Hs.294110 


435706 


W31254 


Hs.7045 


410748 


8E383616 


Hs.12532 


418250 


U29928 


Hs.e3918 


408832 


AWD85690 


Hs.63428 


418339 


AA639902 


Hs.104215 


452387 


U71207 


H5.29279 


432887 


AI926047 


Hs.162859 


430291 


AV660345 


Ks.238126 


440370 


AA884000 


Hs.8173 


412350 


AI659306 


Hs.73826 


428825 


AI084336 


H3.12a783 


441054 


AA913591 


Hs.126480 


415675 


AA894876 


Hs^ 


453078 


AF053551 


Hs.31584 


409203 


AA7B0473 


Hs.687 


446720 


AI439136 


Hs.140546 


418475 


AI858732 


Hs.30443 


417708 


N74392 


HsJi0495 


407870 


AB032990 


Hs.40719 


420568 


F09247 


Hs.247735 


418858 


AW961605 


Hs,21145 


453548 


AL079983 


Hs.116774 


427308 


026067 


Hs.174905 


434579 


T55958 




434826 


AF155661 


H8.22265 


427528 


AU077143 


Hs.179565 


413174 


AA7235&4 


H5.191343 


443250 


AI041530 


Ks.132107 


451957 


At796320 


Hs.10299 


451408 


AI694320 


Hs.6295 


422299 


AK000181 


Hs.1 14556 


427635 


BE397988 


Hs.179982 


416533 


BE244053 


Hs.79382 



ESTs 

Homo sapiens mRNA lor KIAA1657 protein. 

0b:IL3-CTO214.2912994)52-A12a0214 Homo 

HMBA^udbte 

hypotheOcai protdn 

ESTs 

gb^cl 1c07^1 Strstagene lung (937210) H 

lannintn. beta 3 (niceln {125kD), kalinin 

KIAA1034 protein 

ESTs 

ESTs 

foWe hydrolasa (prostato^pedfic memb 

Homo sapiens cDNA FU13446 fis, done PL 

hypothaficat protein 0KFZp434E2318 

serine (or cystdne) proteinese Inhibito 

KIAA1095proten 

carboxypepHdase A3 (mast ceil) 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

thyroid hormone receptor tnteiaclDr 7 

ESTs 

Homo sapiens mRNA; cDNA DKFZf>434P228 «r 
Ea-RELATED RNA POLYiWlERASE II. ELONGATI 
i^tylgalactosamlnidase, alpha- 
utiiquian-coniugallng enzyme E2N (homolo 
ESTs 

C0C14 (ceO division cycle 14. S. cerevi 

HBVpX associated protein^ 

ESTs. Weakly slmflar to YCD1J<UMAN HYf^T 

iiypoihetlcal protein FU13117 

Homo sapiens mRNA; cONA 0KF2p434E1 99 (fr 

ESTs 

Ibilthead Ixjx F1 

SameWflce phospiiotyrosine protein. T-ST 
ESTs 

uncharaderized bom manow protein BM04 
u!}Iquil&H»i4iig8tlf)geKEyme E2E 2 Qwm 
|STs, Weakly similar lo A34615 profilagg 

zinc finger protein 7 (KOX 4. done HF.1 
Novel human gene mapping todiomosome 20 
hypotlietical protein FU11273 
ESTs. Weakly similar lo 138022 hypotheU 
Homo sapiens cDNA: FLJ23006 fis. done L 
hypottwCcal protein DKFZp762K222 
ESTs, Weakly similar to A43932 mudn 2 p 
ESTs 

STE2arike kinase 

adaptor protein containtng pH doman. PT 
hypothetk^I protein FU14084 
ESTs 

G1004 protein 

chromosome 1 open reading frame 21 
adenosine monophosptots deaminase (isolb 
ESTs. Weakly similar to Z195.HUMAN ZINC 
ESTs. Moderately sindar to SPCNJiUMAN S 
eyes absent (Dn»ophaa) tiomdog 2 
ESTs 

CGI-49protebi 

hypotheitoal protein FU10803 
protein tyroshe phospliatase. non-fecept 
ESTs. WdaUy similar to 138022 tiypotheO 
ESTs 

protein pliosphatase IB (formeriy 2C), ma 
meta]dn2 

cytodirome P450. subfamily IVB. poiypept 
ESTs 

sentitn/SUMO-spedtk: protease 
ESTs 

iiypoU«lk^ protein KIAA1164 
protocadtierin alpha 10 
hypothefical protein RG083MOS.2 
integrin. alpha 1 
KIAA0033 protein 

gb7b3Sf05.r1 Stralagene fstal spleen (9 
pyruvate dehydrogenase phosphatase 
minichromosome maintenance deficient (S. 
ESTs 
ESTs 

Homo sapiens cONA FU13545 lis. done PL 
ESTs. Weakly similar to T17248 hypottwti 
hypotheOcal protein FLI20174 
tumor protein p53-binding protein 
retinoblastom&^2(p130) 
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430)91 


AA830144 


Hs.135613 


418575 


AA22S313 


Hs.222886 


432426 


AW973152 


Hs.31050 


404571 






444427 


H25094 


H5.293663 


439183 


AW970600 


Hs.303261 


432886 


B£159028 


H3.279704 


432689 


AB018320 


Hs.278826 


417333 


AL157545 


H5.42179 


429302 


AU076674 


H3.198899 


412324 


AW978439 


Hs.69504 


422072 


AB016255 


Hs.111138 


431318 


AAS02700 


H5.293147 


417171 


BE613486 


Hs^1412 


421709 


AA159394 


HS.1070S6 


415155 


XB4908 


Hs.78060 


436446 


AWD16B09 


Hs.323795 


447482 


AB033059 


Hs.18705 


439726 


AW449893 


Hs^07 


42952S 


N92540 


Hs.205353 


409180 


AU076536 


Hs.50984 


419111 


AA234172 


Hs.137418 


452487 


AW207659 


Ks.6630 


416039 


AA376g89 


Hs.78989 


408239 


AA0S3401 


Ks.271827 


419241 


AA523939 


Hs.165258 


409752 


AW963990 




453894 


AW937825 


Hs.56847 


439671 


AW162840 


Hs.6641 


407230 


AA157857 


Hs.1 82265 


426083 


AW962712 


Hs.126712 


452221 


C21322 


Hs.288057 


434263 


N34895 


Hs.44648 


412766 


BE544475 


H5.54347 


418723 


AAS04428 


Hs.10487 


443970 


AI280341 


HS.16G571 


424534 


D876B2 


HS.15Q275 


419407 


AW410377 


Hs.41502 


447889 


AW469180 


Hs.170651 


429504 


X99133 


Hs.204238 


433165 


AA578904 


Hs^92437 


418700 


A1963808 


Hs.86970 


440282 


BE262386 


Hs.7137 


411630 


U42349 


Hs71119 


408915 


NM.016651 


Hs.48950 


448552 


AW973653 


Hs.20104 


432682 


A1376400 


Hs.159588 


435603 


Z44194 


H5.4994 


437444 


H46008 


Hs.31518 


410252 


AWB21182 


Hs.61418 


438497 


AA808725 


Hs.291712 


449226 


AB002365 


Hs.23311 


449465 


N[A.004a80 


Hs.23598 


420969 


Ai636310 


Hs.28310 


4(9845 


AW971183 


Hs.9683 


434415 


BE177494 




420805 


L10333 


Hs.99947 


415083 


AI632683 


Hs.27179 


430387 


AW372B84 


Hs.240770 


445280 


AW055053 


Hs.306088 


425657 


T89839 


Hs.1 19471 


426250 


AW809208 


Hs.183297 


453293 


AA382267 


Hs.10653 


412446 


A(768015 


H8.92127 


441102 


AA973905 


Hs.331328 


421689 


N87820 


Hs.106826 


448073 


W19789 


Hs.336635 


422244 


Y08890 


Ks.113503 


452323 


W44356 


Hs.292812 


441499 


AW298235 


H3.101689 


420164 


AA163408 


H5.95665 


420061 


AVt/024937 


Hs.29410 


420337 


AW295B40 


Hs.14555 


417407 


AA923278 


Hs.290905 


450580 


N40087 


Hs.15248 


419926 


AW900992 


Hs.93796 


421928 


AF0137S8 


Hs.109643 


412639 


AWg61284 


Hs.296235 


429786 


AU080232 


Hs.220696 


446131 


NM.000929 


Hs.290 


451418 


BE387790 


H3.26369 


416658 


U03272 


Hs.79432 


425139 


AW630488 


Hs^5820 



ESTs, Moderately sbnDar to 138022 hypot 9.8 

ESTs.WealdyslinIl8rtoTRHY.HUMANTRICH a7 

ESTs 9.7 

NMJ)15902^HGmo sapiens progestin induce 9.7 

ESTs, Moderatelyslmilar to 138022 hypot 9.7 

ESTs 9.7 

chromafin aocessibflity oomptax 1 9.7 

Aig/AbHnleracling protein ^9BP2 9.7 

bramodom^ and PHD finger oontaMng, 3 9.7 

eukoyoSc tran^aSon iniliafion Eactor 9.7 

ESTs 9.7 

KIAA0712 gene product 9.7 

ESTs. Moderately similar b A46010 X-ltn 9.7 

Bplnl 9.7 

CE(>6 protein 9.7 

phosphoryiase Hnase, beta 9.7 

ESTs 9.7 

KIAA1233 protein 9.7 

ESTs, Weakly similar to 138598 zinc ling 9.7 

ectonudeoside triphosphate diphosphohyd 9.7 

sarcoma amplifled sequence 9.6 

ESTs 9.6 

Homo sapiens cDNA FU 1 3329 fis. done OV 9.6 

alcohol dehydrogenase 5 (class III), chl 9.6 

ESTs. Moderately similar to ALU7.HUMAN A 9.6 

ESTs 9.6 
gb:EST376063 MAGE resequences, MAGH Homo 9.6 

Homo sapiens cONA FU12874 fis, done NT 9.6 

kinesln family member SC 9.6 

keratin 19 9.6 

ESTs, Weakly sMlartoAF191020 1 E2IG5 9.6 

hypothetical protein RJ22242 9.6 

ESTs 9.6 

ESTs 9.6 

Homo sapiens, done tMAGE:3954132, mRNA. 9.6 

ESTs 9.6 

KiAAQ241 protein 9.6 

hypothetic^ protein FU21 276 9.6 

ESTs 9.5 

llpocailn 2 (oncogene 24p3) 9.5 

ESTs 9.5 

ESTs, Moderately similar to ALU5_HUMAN A 9.5 

clones 23667 and 23775 zinc finger prote 9.5 

Putative prostate cancer tumor suppmsso 9.5 

heptaceHutar cardnoma novel gene^ pro 9.5 

hypothetical protein FU00052 9.5 

ESTs 9.5 

transducer of ER6B2. 2 as 

ESTs 9.5 

micn>tibriI^>assodated protein 1 9.5 

EST8.WeakIystmIlartoI36022hypo(heti 9.5 

KiAA0367 protein 9.4 

CREB binding protein (f^binsteln-Taybi s 9.4 

ESTs 9.4 

DnaJ (Hsp40) horooiog, eubfamiiy C, meirbe 9.4 
gb:RC6-HT0596-27030a011-C05 HTOSOe Homo 9.A 

retlcuton 1 9.4 

Homo sapiens eDNAaJ12933 lis. done NT 9!4 

nudearcap binding protein subunit2, 2 9.4 

v-cric avian sarcoma virus CTIO oncogene 9.4 

ESTs 9.4 

OKFZP566F2124 protein 9.4 

ESTs 9.4 

ESTs 9.4 

bitermedlatB filament protein syncOilln 9.4 

KiAA1696 protein 9.4 

Homo sapiens, done IMAGE:41 79482. mHNA 9.4 

kaiyopherin (impoitin) beta 3 9.4 

ESTs, Weakly similar to T3346e hypotheti 9.4 

ESTs a4 

hypotheQcd protdn 9.4 

ESTs 9.4 

Homo sapiens cONA:aJ21513(i3. done C > 9.4 

ESTs. Weakly simOar to protease [H.sapi 9.3 

ESTs 9.3 

DKFZP58502223prot^ 9.3 

poiyadenylate binding protein^nteractin 9.3 

ESTs 9.3 
Homo saptens mRN^ cONA DKFZp5e8A081 (tr 9.3 

phosphdipase A2. group V 9.3 

hypotheticat protein FU20287 a3 

fibi90n 2 (congenital contractural era a3 

protease, serine, 23 a3 
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44B807 
412505 
412314 
445704 
40^ 
423465 
42^ 
430027 
450580 
424850 
433862 
429227 
401558 
428634 
437678 
416653 
431117 
450608 
431187 
421202 
429597 
430669 



A1571940 
AA974491 
AA825247 
AI493742 

AW970976 

AI148006 

AB023197 

AF131784 

AA151057 



448822 
456161 
423800 
432189 
434361 
443247 
419879 
411928 
448704 
448430 
451246 
418821 
427213 
419103 
450654 
434194 
447033 
420757 
428593 
430223 
444984 
445674 
430294 
419852 
440349 
432161 
405523 
416662 
422763 
446183 
432897 



449523 
407204 
413554 
414917 
434539 
441805 
412S30 
424683 
426158 
413132 
426490 



441079 
437231 
431316 
403137 



418489 
453344 
410227 
417173 
408161 
420495 
433213 
451573 
408393 
434725 
418876 



AI961456 

AA811845 

AA829860 

AA768553 

AF003S22 

NM.004460 

AW971146 

AF193339 

NM.003816 

AW969657 

AF255910 

BE149845 

BE264645 

AA331156 

AA527941 

AF129755 

BE614387 

Z17805 

AAB88624 

AW080932 

AI500642 

AW189232 

AA436002 

AW007211 

Z40229 

AJ245587 

AF1ig847 

AI357412 

X78592 

AW207440 

NH.002514 

H15474 

BE410347 

A1538228 

AW503756 

AA884196 

AK000400 

T25853 

AA033699 

AA354991 

AF155099 

Y09763 

NM.000579 

R41933 

AA319146 

C04863 

AW748078 

AA285136 

AA766266 

NB7519 

NM,001982 

NM_006823 

NM.001621 

T15767 

AW150697 

BE303000 

AA502663 

AI821784 

U76421 

BE349075 

AB009284 

U61397 

AW952912 

A1338247 

AW865130 

AW130351 

AW01S318 

AK000796 

AA740616 



Hs.7549 EST8 

H5JI734 ESTs 

Ks.250899 heat shock MvbindiRg protein t 

Ks.165210 ESTs. Moderately sImBar to 138022 bypot 

NM.X1839*i{on» sapiens csifmln 3, add 

K$.2936S3 ESTs 

Hs.222120 ESTs 

Hs.227743 KIAA0980 protdn 

Hs.25318 Homo sapiens done 25194 mRNA sequence 

Hs.1 53498 chromosome 18 open reading tramel 

Hs.3610 KIAA0205 gene product 

Hs^l27S hypothetical protein FU1 1 01 1 

ENSP00000220478*:SECRETOGRAN]N III. 

Hs.106290 Keteh moUf containing protein 

Hs.122834 ESTs 

Hs.74170 metaitothloneln 1E (functional} 

HS.25OSO0 delta (DrosophnaHDce 1 

Hs.41 8 fibrotiast sctivalion protein, etphs 

HS.2931B7 ESTs 

HS.102S06 eidtaryofic translation InlDalion factor 

Hs.2442 a dlsintegrin and meteDoproteinase doma 

Hs.291029 ESTs 

Hs.54650 junctional adhesion molecule 2 

Hs.289038 hypothetical protein M6C41 26 

Hs.282093 hypothetical protein FU21 91 B 

gb:EST35034 Endxyo. 6mek. subtracted ( 
gb:nh30c04Al NCLCX5AP^Pr3 Homo sapiens 

Hs.117772 ESTs 

Hs.333893 c-Myc target JPOl 

Hs.93564 Homer, neuronal Immediale early gene. 2 

Hs. 197289 rab3 GTPase-activaling protein, norvcata 

Hs.249247 heterogeneous nuclear protein similar to 

Hs.289067 Homo sapiens cDNA RJ1 1404 Us, done HE 

Hs.39140 cutaneous T-celllymphoma tumor antigen 

Hs.ie3161 ESTs 

Hs.16131 hypothetical protein aJ12876 

Hs.96423 hypothetical protein FU23033 

Hs.25275 KruppeMype zinc linger (votein 

HsJSSMO Homo sapiens PRO1550mRN^ partial ods 

Hs.157601 ESTs 

Hs.99915 androgen receptor (dihydrotestcsterone r 

Hs.185973 degenerative spermatocyte (hom^ Droso 

Hs.235935 nephrobtastoma overexpressed gene 

H5.132698 fatty add desaturase 1 

Hs.13063 transcription factor CA150 

Hs.32976 guanine nucleotide binding protein 4 

Hs.286184 hypothetical protein dJSSIDlS 

Hs.31476 Homo sapiens cDMA FU13872 Us, done TH 

H5.292807 ESTs, Weakly similar to envetope [Rs^ 
C8001409«:gi)7441226lpir)jS31212 coOage 

Hs.7538 ESTs 

Hs.83938 ESTs. Moderately slmliar to MASZ^HUMAN M 

Hs.14222 HoiiK) sapiens mR^^^•cD^WOKFZp761P019(fr 

HS.2797B0 NY.REN.18 an«gen 

Hs.22785 gammaamlnobulyric add (GABA) A recepto 

HS.S4443 diemokine(C-Cmo80 receptor 5 

Hs.140237 ESTs, WeaMyslmHartDALUlJflJMAN ALUS 

Hs.75428 secretogranln II (chromogranln Q 

Hs.47191 ESTs 

Hs.21 4410 ESTs. Weakly similar to MUCZJtUMAN MUQN 

Hs.301914 neuronal spedticbanscrlpaon factor D 

Hs.266273 hypothetical protein FU13346 

Hs.27196 ESTs 

Hs.1 99067 v-erb-b2 avian erythroblastic leukemia v 

Hs.75209 protein kinase (cAMP^lependenl catalyti 

Hs.170087 aiyl hydrocartion receptor 

Hs.22452 Homo sapiens mRNA for KIAA1737 protein, 

Hs.107418 ESTs 

HS.2B8929 hypothetk^l protein FU13^ sbnlar to 

HS.14S037 ESTs 

NK^00538r:Homo sapiens nudeoOn (NCg. 

HS.188S78 ESTs 

Hs.85302 adenosine deaminase. RNA-spedite. B1 (h 

Hs.44571 ESTs 

Hs.61 1 52 exostoses (multipIeHike 2 

Hs.81424 ubiqullifvlike 1 (sentrin) 

Hs.300363 hypothetcal protein MGC3032 

Hs.98314 Homo sapiens mRNA; cONAOKFZb5B6L0120(f 

Hs.137190 ESTs 

Hs.243549 ESTs 

Hs.23165 ESTs 

Hs.4104 hypotheOcai protein 

gbaTy97f11.8l Na^CGAP.GCBI Homosa^jlens 
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AWl 01975 


HS.16S892 




UlJlOO 


H8.^002 




AI940675 


Hs.^14 




/mo1d0o6 


Hs.296341 




/Wviyonw 


1 4 eo4 P 1 

Hs.158154 


•W 1000 


Dc4o4341 


LJ. A4 AA4 

Hs.21201 


AAMiAl 


ni471B 


Hs.11S06 


444090 


Al 199730 


Hs.208275 




AW450OOO 


Hs.257359 




AAOoUOoO 


11^ 4 AJn J M 

Hs. 124344 




NM.005100 


Hs.7B8 


410275 


U85658 


Hs.61796 


425465 


LI 8964 


H$.1904 


424113 


Ar743880 


HS.12S76 


459324 


AVVuo0953 




451684 


AF216751 


Hs.26813 


451244 


AW008798 


Hs.267448 


439170 


AA332365 


H5.165539 


424238 


AA337401 


Hs.137635 


449686 


AW072813 


Hs^0668 


440658 


H29142 


H3.143032 


410870 


U81599 


Hs.66731 


458229 


AI929602 


Hs.177 


429831 


AA5d4488 


Hs.137526 


453468 


W00712 


Hs.32990 




AFZ61088 


Hs.154721 


453345 


AA302862 


HsJ0Q63 


AMone. 
49£4CUo 


AW34U281 


Hs.33074 


440486 


ob243513 


Hs.7212 


4499^9 


OC4AtMCe 

Bk14w8D 


Hs.14831 




AF1o9011 


Hs.49163 


424668 




Hs.151573 


ill 971 0 


AW016d10 


Hs.129911 


423872 


AB0203i6 


Hs,134015 


408242 


A A f\C4 f Ail 

AA25i594 


Hs.43913 


4ouo4o 


AVVD21726 




4^14Sft£ 


BE17dS90 


H5.104916 


4430 y J 


T97999 


Hs.18214 


410442 


X73424 


Hs.63788 


425913 


AA3o57S9 


H$.50785 


425704 


U79293 


Hs.1 59264 


427723 


AI359260 


Hs.279789 


441663 


BES64214 


Hs.102946 


447335 


8E6 17695 


Hs.286192 


428259 


AA424793 


Hs.24144 


433104 


AL043002 


Hs.126246 


420041 


ABra5142 


Hs.94592 


432432 


AA541323 


Hs.1 15831 


452335 


AW188944 


Hs.61272 


412673 


AL042957 


HS.31&45 


430335 


D80007 


Hs.239499 


419904 


AA974411 


Hs.18672 


442402 


NM_000954 


Hs.8272 


433759 


AA680003 


Hs.109363 


434064 


AL049045 


Hs.180758 


401785 






410407 


At ^AA040 

AL044B18 


Hs.84928 


419647 


A A 4 J AA A^ 

AA346947 


Hs.91816 


425907 


A A lOPYf A 

AA365752 


Hs. 155965 


452017 


Ar4 AA'MXA 

AF109302 


Hs.27495 


AOAA^A 

424494 


H67U2B 


Hs.146861 


400303 


A A AJ A^rn 

AA242758 


Hs.79136 


414660 


AA743331 


Hs.272572 


413992 


VV2o275 


Hs.1 36075 


415071 


AK002197 


Hs.284270 


451230 


AAA 

BE546208 


Hs.26090 


441790 


Ai Aim J nnn 

AW294909 


Hs.1 32208 


441124 


T97717 


Hs.1 19563 


438160 


AA779332 


Hs.1 22871 


433264 


085762 


Hs.3229 


434851 


AA808164 


Hs.1 16502 




DC9*H}Zr/ 


Hs.i 03104 


426535 


AU077012 


Hs.268562 


411667 


BE1601S8 




454058 


AI273419 


Hs.135146 


435614 


R09718 


Hs.20403 


446595 


T57448 


H8.15467 


43S677 


AA684142 


Hs.293726 


447894 


AW204253 


Hs.21912 


452682 


AA456193 


Hs.9071 


418372 


AA311833 


Hs.84318 


447231 


AK001293 


Hs.330208 


416206 


AW206248 


Hs.111092 



ESTs a8 

endotheOn receptor ^6 aS 

hypothsfical prctdn fUTZOSB Z3 

adenylyt cydasa^ssodatad protein 2 8^ 

ESTs. WeaMysimlarb granule cell mar aS 

necOn 3; DKF2P56580846 protein a8 

Human clone 23589 mRNA sequence a8 

ESTs, Weakly similar Id AlUAuHUMAN !!U a8 

ESTs ae 

ESTs a8 

A kinase (PRKA) anchor protein (gravfn) a6 

transaiptkm (actor AP-2 gamma (acSvat aS 

protein kinase Ciola aS 

ESTs as 
glKxc28c12j(1 NQ_CGAP_Co1B Homo sapiens a8 

C0A14 a8 

hypottietica) protein inj20039 as 

ESTs. Weakly similar to 138022 iiypotiietl a8 

ESTs as 

ESTs. Moderately similar to ALU4_HUMAN A a8 

EST 8, Weakly similar to neuronal thread 6.8 

homeo box B13 aS 

phosphatidy&)ositolgIycan.classH aS 

ESTs as 

DKFZP566I^ protein aS 

acomtase1.8olul)ie 8.8 

neuiocaldn delta as 

Homosapiens.cioneIMAGE:3606519, mRNA, as 

hypothetteal protein PP1044 a7 

Homo sapiens. Similar to zinc linger pro a7 

putaHvB rOxxuidease lH 8.7 

cryptochrome 1 (photoiyas&m 8.7 

ESTs a? 

uronyl 2-su!fotranster8se a7 

PIBF1 gene product a7 

gb:dt27e02.y1 litoiton Fetal Cochlea Homo a? 

hypoBetlcal protein FU21 940 6.7 

ESTs. Weakly similar to B34087 hypotheti 8.7 

propionyl Coenzyme A cartxjxylase, beta p 8.7 

SEC22. vesicte tratScking protein (S. c a7 

Human dona 23948 mRNA sequence 8.7 

histonedeacelyiaseS 8.7 

ESTs 8.7 

hypo(hettealprotdlnFU20940 8.7 

ESTs a7 

ESTs, Moderate^ similar to unnamed prot a? 

Idotho 6.7 

ESTs a7 

ESTs a7 

ESTs 87 

KIAA0185 protein 8.7 

ESTs ae 

prostaglandin D2 synthase (21kO. brain} as 

Homo sapiens cONA: FU23603 fis, clone L 8.6 

hypothetical protein PRO0082 a6 

NM_002275*:Homo sapiens kera&i 15 (KRT1 8.6 

nuclear transcription factor Y, beta a6 

hypothetical protein a6 

ESTs 8.6 

prostate cancer assodaledprot^ 7 8.6 

hypothetical protein FIJ20560 8.6 

UV-1 protein, estrogen regulated a6 

hemogkibin. alpha 2 8.6 

RKA.U2small nuclear 8.6 

HoroosaplenscDNAFU11335fis,ctonePL 8.6 

hypotheDcal protein FU20272 8.6 

ESTs 8.6 

ESTs 8.6 

ESTs a6 
cysteine dioqrgenase. type I .8.6 

ESTs a6 

ESTs a6 

ESTs.Weaklyslmllartotibh|u{lousTPRm a6 
gb:QV1-HT04mi0200<)59^HT0413Hbmo aS 

hypothetkal protein FU13984 a5 

ESTs a5 

hypothetica) protein FLJ20725 8.5 

ESTs. Weakly s&ililar to TSGA RAT TESTIS a5 

ESTs a5 

progesterorw membrane binding protein 8.5 

repllcalion protein A1 (70kO) aS 

crystaII]n.zetB(quinonereductase>4ik 8.5 

hypothetfealprotdnnJ22332 as 

689 
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434747 


AA83708S 


Hs.220585 


ESTs 


as 


415938 


BE383507 


Hs.78921 


A kinase (PRKA) anchor protein 1 


as 


450813 


AI73982S 


Hs.203376 


ESTs 


as 


413956 


AI821351 


Hs.193133 


ESTs, sintflar to ALU7 JUiMAN ALU S 


as 


408699 


AA056614 


Hs.108200 


ESTs. Moderately similar to ZN91_HUMAN Z 


as 


440840 


AW829866 


Hs.283713 


ESTs. simnar to S64054 hypoM 


8.5 


407891 


AA486620 


Hs.41135 


endoiTUicii>-2 


8.5 


411019 


AW993097 


Hs.48617 


Komo sapiens cONA FU12540 lis, done 


85 


432620 


AA777749 


HS.5978 


UM domain only 7 


8.5 


424511 


BE300512 


Hs.193557 


ESTs, Moderately simBar to ALU7.KUMAN A 


as 


408170 


AW204516 


Hs.31835 


ESTs 


as 


452670 


AF068227 


Hs.30213 


cenjid-tipofuscinosiSi nourond 5 


as 


450516 


AA902656 


Hs.21943 


NIF3 {Ngg1 Interacting fiactor 3, S.pombe 


as 


417560 


U73338 


H3,82283 


S-methyttetrahydrotolate-liomocysteine mo 


as 


430606 


BE266026 


Hs.31476 


Homo sapiens cOKA FU13872 1)3. done TH 


a4 


436953 


AW959074 


Hs.23648 


HoHK) sapiens cDNA FU 13097 fis, ctom NT 


a4 


418120 


AA213437 


Hs.192249 


ESTs 


a4 


419436 


AA991639 


Hs.242413 


iiypothetica] protein 0KFZp434K1421 


8.4 


432600 


AI8210d5 




Qb:ns95a12.y5 Na„CGAP.Pr3 Homo sapiens 


a4 


44B474 


AI792014 


Hs.13809 


hypothetical protein FU10648 


a4 


434384 


AA631910 


Hs.162849 


ESTs 


a4 


452234 


AW084176 


Hs.223296 


ESTs. Viteakly stoiOar to 138022 hypolheti 


a4 


411562 


ALJ050201 


Hs,70769 


hypotheScei protein OKFZp586E1923 


a4 


421622 


AB037748 


Hs.106204 


KIAA1327 protein 


a4 


431160 


AW971174 


Hs.324330 


ESTs 


a4 


449567 


AI990790 


H8.188614 


ESTs 


8.4 


411088 


BE247S93 


Hs.145053 


ESTs 


a4 


452185 


AA120761 


Hs.28307 


WW domain l)indlng protein 4 (tbrmln bind 


as 


437872 


AK002015 


H3.5B87 


RNA binding motif protein 7 


8.3 


429393 


AA383024 


Hs.201603 


Homo sapiens mRNA; cDNA DKFZp434D0917 (( 


a3 


425692 


D90041 


HS.1S5958 


N-acetyftrsnsferase 1 (aiytomina N-acety 


8.3 


438440 


AA807228 


HS.22S161 


ESTs 


a3 


431952 


AL049385 


Hs.272251 


Honw sapiens mRNA; cONA OKFZp586M141B (f 


as 


422295 


AF051151 


H3.1 14408 


tolWilte receptor 5 


as 


440146 


AW014231 


Hs.90790 


Homo sapiens cONA: FU22930 fis. ctone K 


as 


425242 


013635 


Hs,155287 


KIAAOOIO gene product 


8.3 


432769 


AA620814 


Hs.144959 


ESTs 


8.3 


43361S 


AA732982 


Hs.269607 


ESTs. Weakly similar to ALUI.HUMAN ALU S 


as 


427229 


AI799751 


Hs.5635 


ESTs 


a3 


446947 


AF146747 


Hs.232165 


polycythemia rubra vera 1; ceQ surface 


as 


403149 






NH.001450:Homo 8a;»ens four and a half L 


as 


436679 


AI127483 


Hs.120451 


ESTs. Weakly similar to unnamed protein 


as 


421991 


NM_014918 


Hs.1 10488 


KIAA0990 protein 


8.3 


436476 


AA326108 


Hs.33829 


bHLH protein 0EC2 


8.3 


434980 


AW770553 


Hs.14553 


sterol O-acyitransferase (acyl-Ooenzyme 


as 


407385 


AA610150 


Hs.272072 


ESTs. Weakly similar to 138022 hypothsD 


as 


453121 


At968264 


H&232286 


ESTs 


as 


420630 


AL133101 


KS.99S08 


Homo saptens mRNA: cDNA OKFZp434O0921 (f 


aa 


426101 


AL049987 


H3.166361 


Homo sapiens mRNA; cDNA 0KFZpS64F112 (Ir 


6.2 


419517 


AF052107 


Hs.90797 


Homo sapiens done 23620 mRNA sequence 


8.2 


425707 


AF115402 


Hs.11713 


E74-like factor 5 (ets iamsb transcripl 


a2 


440080 


AW0S1597 


Hs.143707 


ESTs 


a2 


422049 


W25760 


Hs.77631 


glycine deavage system protein H (amino 


a2 


4301S4 


AW583058 


Hs.234728 


serine (or cysteine) protetoase biMbito 


a2 


432695 


063480 


Hs.278634 


KiAA0146 protein 


a2 


407300 


AA102616 




gb2n43e07.s1 Stratagene HeLa caO sS 93 


a2 


408964 


AF201468 


Hs.49349 


beta-site APP-deaving enzyme 


a2 


418051 


AW192S35 


Hs.19479 


ESTs 


a2 


412125 


Y17114 


Hs.73393 


eyes absent (Drosophlte) homotog 4 


a2 


441523 


AW514263 


Ks.301771 


ESTs. Weakly similar to ALUFJiUMAN lUI 


a2 


417956 


AA767382 


Hs.193417 


ESTs 


a2 


452040 


AVV973242 


Hs.2936g0 


ESTs, Weakly slmRar to 138022 hypothsU 


a2 


430468 


NM.00467a 


H8.241519 


angtopotetin-Gke 1 


a2 


407802 


D84145 


Ks.39913 


novel RGOcontaining protein 


a2 


420556 


AA278300 


Hs.124292 


Homo sapiens cONA: FIJ23123 Ss, clone I 


a2 


414138 


AA135884 


Hs.3772 


ESTs 


a2 


438209 


AL120659 


Hs.6111 


aryl-hydrocarbon receptor nudear transi 


a2 


408267 


AW380525 


Hs^2675 


hibuQn^pedSc diaperone e 


8.2 


429692 


N48422 


Hs.9977 


ESTs. Weakiy similar to B34087 hypolheO 


8.2 


408108 


AI560492 


Hs/2743 


hypothettoai prol^ 


a2 


408418 


AW963897 


Hs.44743 


KIAA1435prot^ 


a2 


430334 


AI824719 


Hs.328700 


ESTs 


a2 


422083 


Nlil.001141 


Hs.111256 


arachhtonate 15-tIpoxygenase, second typ 


a2 



TABLE 59B 



Pkey: Unk^ue Eos probeset identifier number 

CAT number Gene duster number 

Accesston: Genbank accesston numbers 



Pkey CAT Number Accesstons 

409385 112523.1 AA071267T6594OT64515AA071334 
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PCT/US02/36810 



4097S2 


115301.1 


AW96399D AA078ig6 AW749482 AA077468 BE151571 AA376917 


411479 


1247077.1 


AWB48047AW848202 AW848631 AW848142 AW848702AW648121 AW846632 AW848140AW848571 AW648009 AW848067AW848069 AW848905 






AW848214 


411667 


1253334J 


BE160198AW935898T11520AW935930AW856073AW861034 


416913 


163001.1 


AW934714 BE161007 BE 1 62500 AW749902 AW7498&4 BE1624g8 BE1 61 005 AA1 90449 AW513465 6E161006 BE162499 


418866 


179788^1 


T65754 AA229357 AA229658 


418876 


179960.1 


AA740616AA654854 AA229923 


419536 


185688 1 


AA603305 AA244095 AA244183 


419544 


185760.2 


AI909154 AA526337 AA244193 AI909153 


423800 


232161J 


AA33 1 1 56 AA331 1 57 AA33 1 1 55 


426413 


266650J 


AA377823 AW954494 AI022&88 


429163 


300543 1 


AA684766AW974271 AAS92975AA447312 


430848 


324621.1 


AW021726 AA487752 AA488085 


431121 


328275 1 


AW971157 AA492575 AA492520 


432189 


342819.1 


AA527941 AI810608 AI620190 AA635266 


432600 


3S09S9J 


A1821065AW973464AA554802AiB21831 AA657438AA640756 AA65Q339 


434415 


385931.1 


BE177494 AWZ7e909 AA632849 


434579 


38916_1 


T55958 T57205AF147346 


439518 


47334 1 


W76326AF086341 W72300 


443161 


561305J 


A1038316AI344631 Ai261653 


447974 


745643.1 


R76886A14S3674R77049 



TABLE S9C 



Pkey: Unique numbor corespondlng to an Eos ptobesel 

Ref: Sequence sotiice. The 7 digit numbers in this cohmm are Genbahk ktenUfier (Gl) number. tXmham I. et al.' refers to Bie puMicaBon entitled The DMA sequence of 

human chromosome 2Z* Dunham 1. et ai„ Nature (1999) 402:489495. 
Strand: Indicates DMA strand from which exons were predicted. 

Nt_position: Indicates nudeoUde positions of predicled exons. 



Pkey 


Ref 


Strand 


NUposilton 


400860 


9757499 


Minus 


151830-152104.152649-152744 


400880 


9931121 


Plus 


29235-29336,36353.36580 


401197 


9719705 


Plus 


176341-176452 


401424 


8176894 


Rus 


24223-24428 


401519 


6649315 


Plus 


157315-157950 


401558 


7139678 


Phis 


103510-104090 


401747 


9789672 


Minus 


118596-116816.119119-119244^19609-119761,120422-120990,130161*130381.130468-130593.13109^^ 








131932. 1 32451-1 32575,133580-13401 1 


401785 


7249190 


Minus 


16577&'165996,166189-166314.166408-166S69.167112-167266,167387-167469,168634-168942 


402076 


8117410 


Plus 


128316-128627 


402812 


6010110 


Plus 


25026-25091.25844-25920 


402855 


9662953 


Minus 


59763-59909 


403047 


3540153 


Minus 


59793-59988 


403137 


9211494 


l^us 


92349-92572.92958-93084.93579-9371^93949-94072.94591.94748,95214J95337 


403149 


9799833 


Rus 


25034-25185 


403362 


8571772 


Rus 


64099^260 


404210 


5006246 


Phis 


169926-170121 


404571 


7249169 


Minus 


112450-112648 


404641 


9796810 


Minus 


32247-32362 


404642 


9796810 


Rus 


102999-103145 


405523 


9454643 


Plus 


11455O-114688,117265.117407.11949W19599,123237-123395,13114O.131217 



Table 60A ttsts about 1 166 genes up^egulated in prostate cancer compared to norma) adult tissues. These were selected from 59680 probesets on the Aftymetrlx/Eos Hu03 
GeneChIp array such that the rafbof 'averBgO* prostate cancer to 'average* nonnal adUt tissues was greater than or equal to 3.0. The 'average' proslale cancer level was set to the 
75"> percentite amongst prostate cancers. The 'average* norma) adiA Sssue level was set to Die 85*> percentile anrangst ncn^nalignanl tissues. In order to remove gene-specific 
background levels of non-spedfic hybrMization, the lO^peroenlilevdue amongst the rnrHieHgnant tissues was subtracted from nuRierator and the denominator before the 
rattowasevahiated. 

TABIE60A: ABOIJT 11 66 GENES UP-lgGUUTED IN PROSTATE CANCER COMPARED TO NORMAL AIM^^ 

Pkey: Unkiue Eos probeset UenlHIer number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelO: Unigene number 

Untgene Title: Unigene gene Site 

R1: Ratio ctf prostate tumor to normal adult body Qssue 



Pkey 


ExAccn 


Unigene ID 


Unigene TKIe 


R1 


426747 


AAS3S210 


Hs.171995 


kallikFein 3, (prostata speciTic anOgen 


56.6 


419526 


AI821895 


Hs.193481 


ESTs 


56.2 


420154 


AI093155 


K5.95420 


JM27 protein 


44.0 


432441 


AW292425 


Hs.163484 


ESTs 


41.9 


414569 


AF109298 


Hs.118258 


prostate cancer associated protein 1 


39.9 


431616 


AA508552 


Hs.195839 


ESTs. Weakly similar to 138022 hypotheti 


37J 


400299 


X07730 


Hs.171995 


kallikrein 3, (prostata specilic antigen 


38.0 


446057 


AI420227 


KS.14935B 


ESTs.WaaMy6lmBartoA46010Xfinked 


32.9 


425075 


AA5D6324 


Hs.1852 


ackf phos|4iatase^ proslale 


31.1 


400298 


AA032279 


H3.6183S 


dx transmembrane epithelial antigen of 


30.0 


439176 


A1446444 


Hs.190394 


ESTs, V\toaUy sbiiHar b B28096 line-1 pr 


29.8 



691 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



400302 


K48056 




431579 


AW971082 


Hs.222888 


432101 


A19189S0 


H8.i23642 


408369 


R38438 


Hs.182575 


453096 


AW294631 


Hs.11325 


400287 


S39329 


Hs.181350 


409262 


AK000631 


Hs^56 


428819 


AL135623 


Hs.193914 


401424 






453370 


AI470523 


Hs.139336 


400292 


AAS0737 


Hs.72472 


407202 


N58172 


Hs.109370 


415989 


Ai267700 




407709 


AA456135 


Hs.23023 


40716B 


R45175 


Hs.1 17183 


433444 


AW975324 


Hs.129816 


409731 


AA125985 


Hs.56145 


428336 


AAS03115 


Hs.183752 


410929 


H47233 


HS.30&43 


424099 


AF071202 


Ks.139336 


418961 


AW967646 


Hs.23023 


452792 


AB037765 


Hs.30652 


403047 






433466 


AA508353 


Hs.105314 


418396 


AI765805 


Hs.26691 


433647 


AA603387 


Hs.222294 


432240 


A)694767 


Hs.129179 


430487 


D87742 


Hs.241552 


440260 


AI972867 


Hs.7130 


429170 


NM.001394 


Hs^9 


429220 


AW207206 




416370 


N90470 


Hs.203897 


416819 


AA22e776 


Hs.191721 


421513 


X00949 


Hs.105314 


429918 


AW8739B6 


Hs.119383 


420092 


AA814043 


HS.8&045 


432473 


AI202703 


Hs.152414 


450693 


AW450461 


Hs^3965 


431548 


A1B34273 


Hs.9711 


432674 


AA641092 


Hs.257339 


430167 


AI799909 


H$.158989 


441690 


R81733 


H&33106 


408621 


AI970672 


Hs.46638 


448999 


AF179274 


Hs.22791 


450642 


R39773 


Hs.7130 


430376 


AW292053 


Hs.12532 


432966 


AA650114 


Hs.325198 


410330 


AW023630 


Hs.159425 


434666 


AF151103 


Hs.1 12259 


400297 


AI127078 


Hs,306201 


423073 


BE252922 


Hs.123119 


431474 


AL133990 


Hs.190642 


416829 


AA516531 


Hs.55999 


453661 


AI02663a 


Hs.30120 


434217 


AW014795 


Hs.2334g 


418432 


M14156 


Hs.e5112 


428398 


AI249368 


Hs.98558 


421059 


AI654133 


Hs.30212 


447313 


U92981 


Hs.18081 


443980 


A1093577 


HS.2S5416 


448290 


AK002107 


Hs.20843 


436032 


AA150797 


Hs.109276 


432527 


AW975028 


Hs.102754 


434792 


AA649253 


KS.1324S8 


420424 


A8033036 


.Hs.97594 


440529 


AW207640 


Hs.1 6478 


424846 


AU077324 


Hs.1832 


448519 


AW175665 


HS.27889S 


418339 


AA639902 


Hs.104215 


432600 


AI821085 




413597 


AW302885 


Hs.117183 


446100 


AWg67109 


Hs.13804 


431448 


AL137517 


H5.306201 


424701 


NH.005923 


Hs.151968 


432435 


BE216886 


Hs^82070 


429290 


AF203032 


Hs.198760 


410467 


AF102546 


Hs.63931 


439444 


AI277652 


H8.54578 


450362 


AA3976S8 


Hs.60257 


452340 


NM-002202 


Hs^ 


417332 


AW972717 


Hs.288482 


407021 


U52077 




419563 


AA526235 


Hs.193162 



ftsSata hydrolase (prostatMpedScmemb 2a9 

ESTs.Weakly8lni9artoTRHy.HUMANTRiCH 27^ 

EphA3 26J 

sohitseanier family 15 (H777|raRspor1 26.2 

ESTs 25.7 

kaHikfein 2, prostatic 25.5 

hypothetical protein FU20624 24.6 

KIAA0575 gene product 24.5 

NM.001172Hon)o sapiens arginase, type n 24.5 

ATP-Wnding cassette, sub-family C (CFTR 22.4 

BMP^IB 2Z3 

ESTs 22.3 

ESTs 20.2 

ESTs 20,0 

ESTs 19,6 

ESTs 19.4 

thymosin, beta, identiSed (n neuroblast 19.1 

microsemlnoprDtetn. beta- 18.7 

ESTs 18.0 

ATP-bindlng cassette, sub-family C (CFTR 17.9 

ESTs 17.7 

K1AA1344 protein 17.5 

NM„005656*:Homo sapiens transmembrane pr 17.3 

reladn 1 (HI) 173 

ESTs 17.1 

ESTs 16.9 

Homo sapiens cOKA RJ13581 lis, done PL 16.7 

KIAA0268 protein 16.5 

copine IV 16.0 

dual spediicity phosphatase 4 15.7 

ESTs 15.7 

ESTs. Weakly simOar to 138022 hypotheli 15.3 

ESTs 14,8 

relaxinl(HI) 14.6 

ESTs 14.5 

ESTs 14.5 

ESTs 14.4 

ESTs 14.4 

novel protein 14,4 

ESTs. Weaidy similar to 138022 hypotheli 14.3 

ESTs 14,3 

ESTs 14,3 

chromosome 1 1 open reading frame 8 14.2 

transmembrane piotEin with EGF-like and 13.7 

copine IV 13.5 

chromosoflfie 1 open reading frame 21 13.4 

ESTs 13.4 

ESTs 13.4 

T cell receptor gamma locus 13.1 

hypothetical protein OKFZp56401278 13.0 

MAO (mothers against decapentaple^ Or 12.8 

ESTs 12.6 

NKhomeobax(Diosophaa),fem]ly3,A 12.5 

ESTs.WeaI(IyslmRartoNUCLHUMANNUCLE 12.5 

ESTs 12.3 

InsuQn-like growth factor 1 (somatamedl 1^3 

ESTs 12.2 

thyroid recqriorinlerac&igprotdn 15 12.2 

Homo8aplenscloneDT1P1B6mRNA.CAG(ep 12.2 

hypothetica) protein FU21986 1Z1 

Homo sapiens cONA FU1 1245 fis. done PI. Ill 

lalexin protein 12.0 

ESTs 12.0 

ESTs liO 

K1AA1210 protein 11.9 

Homo sapiens cONA: FU21718 fis. done C 1 1.9 

neuropeptide Y 11.9 

Homo sajtons prostein mRhtA. complets cds 1 1.9 

ESTs. Moderately similar to SPCNJ1UMAN S 11.8 

gb:ns953lZy5 ICLCGAP_Pr3 Homosaplens 11.8 

ESTs 11.8 

hypothetical protein dl462023.2 11.7 

hypothetical protein 0KFZp554O1278 11.6 

mitogen-ectivated protein kinase kinase 11.6 

ESTs lU 

neurofilament, heavy polypeptide (200kP) 1 1 .5 

dachshund (Drosopfiila} hornolog 1 1 .5 

ESTs. Weakly sintfar to 138022 hypothefl 11.5 

Homo sapiens cDNA FU13598 fis. done PL 1 1.5 

ISL1 transcriptton factor. UMnrnneodoma 1 1.1 

hypothetical protein FU21511 11.0 

gb:Human marfnerl Irensposase gene, comp 1 \J(i 

Homo sapiens cDNAaJ11983 fis, done HE 10.8 
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432729 AK000292 Hs.130732 hypothetical protein RJ20285 10.6 

400301 X0363S Hs.1657 estrogen receptor 1 10.5 

419536 AA603305 gb:np12d1U1 Na.CGAPJPr3 Homo sapiens m 

^ 445472 AB006631 Hs.12784 Homo sapiens mRNA far KIAA0293 gene, par 105 

5 415539 AI733881 H5,72472 BMP-RIB 10.4 

419264 AA877104 Hs.293672 ESTs.WbaMysIfniIartDAlUB.HUMANini 10.4 

424432 AB037821 H5.146B58 prolocadherlnID 10.3 

439518 W76326 gb:zd60d04.r1 Soaras_tBtaU)8ait.NbHH19W 10.3 

^_ 434038 A1659131 H5.197733 hypdheGcd prot^ MGC2849 103 

10 415263 AA948033 Ks.130853 ESTs 10.2 

431217 NML013427 HS.2S0830 Rho GTPssa xthrafing protein 6 10.2 

444805 AB007899 Hs.12017 hontologofye8slut}lquitbH)roteinngB5 10.1 

437252 AI433833 Hs.1641S9 ESTs. Weakly similar to ALU I.HUMAN ALUS 10.1 

444190 A1878916 Hs.10526 cysteine and gtyctne-rtch protein 2 10.0 

15 418293 A1224483 Hs.16063 hypolheticat protein FU21Bn 10.0 

432244 AI869973 H&200574 ESTs 10.0 

451640 AA195601 Hs.26771 Human DMA sequence from done 747H23 on 9.9 

450164 A!239923 Hs.63931 ESTs 9.8 

447072 D61594 Hs.17279 lyrosytprotein sulfotransferase 1 9.8 

20 415079 R43179 Hs.22a95 hypotheticai protein FU23548 07 

409799 011928 Hs.76B45 phosphoserine phosphafBse^ike 9.6 

433234 AB040928 Hs.65366 K1AA1435 protein 9.5 

400236 AA305627 Hs.139336 ATP-blnding cassette, sub-family C(CFTR 05 

422728 AW937826 Hs.103262 ESTs. WeaWy sbrolar to ZN91.HUMAN ZINC 05 

25 425154 NM.001651 Hs.154S50 collagen, type IX. alpha 1 04 

433927 A1557019 H$.116467 smafl nuclear protein PRAC 04 

444670 K58373 Hs.332938 hypothetical protein MGC5370 04 

449625 NVL014253 odz (odd OzAen^ Orosophita^ homolog 1 04 

^- 450325 Ar935962 Ks.26289 ESTs 04 

30 427761 AA412205 Hs.1409g6 ESTs 04 

453930 AA419466 Hs.36727 hypothetical protdn aj 10903 04 

416795 AI497776 Ks.20509 HBV pX associated protein^ 03 

422805 AA436989 H8.121017 H2Ahistonerandly. member A 03 

^- 418848 At820961 Hs.193465 ESTs 9.2 

35 408430 S79876 Hs.44926 dIpeptidylpeplkJase IV (CD26, adenosine 02 

415788 AW628686 Hs.78851 K1AA0217 protein 02 

404641 NM.021965*:Homo sapiens phosphoghicomuta 01 

432437 W07088 Hs.293685 ESTs 01 

436396 AI683487 Hs.152213 wmgless-typeMMTV integration Bits fami 01 

40 436962 AW377314 Hs5364 DKFZP564I052 protein 01 

410765 A1694972 Hs.66180 nucleosome assembly protein l-blce 2 9.0 

428342 A1739166 Homo sapiens cDNAFU13458 lis. done PL 00 

452281 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 lis, dona PL OO 

433043 W57554 Hs.125019 lymphoid nudear protein (LAF-4}mRNA 8.8 

45 442082 R41823 Hs.7413 ESTs 08 

436671 AW137159 Hs.146151 ESTs 07 

415785 AW419196 Hs.257924 hypothetical protein FU13782 07 

440774 A1420611 Hs.153934 ESTs 07 

420120 AL049610 Hs.95243 banscripfion elongation factor A (811). 07 

50 427138 ^n7624 Ks.173717 phosphatidic add phosphatase type 28 07 

430722 AW968543 Hs.203270 ESTs. Weakly similar to ALULHUMAN ALUS 06 

422487 AJ010901 Hs.198287 mudn 4, tradieobrandOai 06 

443646 AI085198 Hs.164226 ESTs 06 

414565 AAS02972 Hs.163390 hypothetical protein FU13590 05 

55 428728 NM.016625 H$.191381 liypolhelical protein 05 

434926 BE543269 Hs.502S2 rrdtodiondrlal ribosomal protein L32 05 

437162 AW005505 Hs.5464 thyroid homwne receptor ooadhralinBpr 05 

437866 AA1S6781 metaltothionein IE (functional) 05 

^- 453006 AI382575 Hs.303171 ESTs 04 

60 418723 AA504428 Hs.10487 Homo sapiens, done tlMAGE:3954132.mRNA. 03 

421040 AA71S026 Hs.135280 ESTs 03 

414212 AA136569 Hs.10848 K!AA0187geneprodud 03 

417916 NM.006416 Hs.B2921 solute canterfamity 35 (OA'-siallc ad 02 

^_ 440749 W22335 Hs.7392 hypothalicai protein MGC3199 02 

65 412715 NKIL000947 H5.74519 prima5e.pdypep8de2A (581(0) 8.2 

428829 R14050 Hs.194051 Homo sapiens mRNA;d3NADNFZpS66B213(fr 02 

447158 AW274731 Hs.157920 ESTs 01 

428330 L22S24 K3.2256 mabix metafioprotetnase 7 (matrilysin, 01 

417412 X16898 Hs.B2112 interteuldn 1 receptor. ^ I OO 

70 424692 AA429834 Hs.151791 KIAA0092geneprQdud OO 

435981 H74319 Hs.188620 ESTs 7.9 

442081 AA401863 Hs.22380 ESTs 7.9 

434988 Ai416055 Hs.161160 ESTs 7.9 

407910 AA650274 Hs.41296 fibroneclin teudno rich bansmembrane p 7.8 

75 420345 AW295230 Hs.25231 ESTs 7.8 

412324 AWg78439 Ks.69504 ESTs 7.8 

420757 X78S92 H3.99915 androgen receptor (dibydioteslosteroner 7.8 

408374 AW025430 Hs.155591 forfcheadboxFI 7.8 

435604 AA625279 Ks.26892 uncharaderized bone marrow protein BM04 7.8 

oO 453160 A)263307 Hs.239884 H2B histone famity. member L 7.8 

424051 AL110203 Hs.138411 Homo sapiens mRNA:cONAOKFZp586J 1922 (f 7.7 

416182 N^.004354 Hs.79Q69 cydinG2 7.7 

452110 T47667 Hs.28005 Homo sapiens cOtMFUl 1309 as, done PL 7.6 
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451099 


R52795 


Hs^5954 


(nterteiddn 13 leceptor, dpha 2 


7.6 


427003 


U194Q7 


Hs,2090 


prostagtamfin E reoeptar 2 (subtype EP2) 


7.5 


443180 


R15875 


Hs^58576 


ciau(fm12 


7.5 


454119 


BE549773 


Hs.40510 


uncoupQng prot^ 4 


7.5 


418693 


Ar750878 


Hs.87409 


QuxxntjospofuEn 1 


7.5 


426581 


AB040956 


Hs.135890 


KlAA1523prot^ 


7.5 


417683 


AW5&6008 


Hs.23gt54 


snkyrin repeal famOy A (RFXANK4ikB), 


7.5 


439569 


AW602166 


H5.222399 


CEGP1 protein 


7.4 


416140 


A1918035 


Hs.301198 


roundabout (axon guidance receptoTi Dros 


7.4 


451230 


BE546208 


Hs.26090 


hypoUielicsi protein FIJ20272 


7.4 


400294 


N9S796 


Hs.278695 


Hbnio sspisns prostsin inRNA, completB cds 


7.4 


436873 


N23874 


Hs.50477 


RAB27A, msflibor RAS onooQsne fonriBy 


7,4 


419743 


AW408762 


Hs^957 


Homo sapiens clone 24416 mRNA sequence 


7.3 


428600 


AW863261 


Hs.242413 


hypolhafica) protein OKFZp434K1421 


7.3 


410252 


AW821182 


H5.61418 


n^Crofibrillar-associated protein 1 


7.3 


424649 


BE242035 


Hs.151461 


embryonic edodsnn developinsnl 


7.3 


428218 


AA424266 


Hs.123642 


EphA3 


7.3 


434158 


T86534 


Hs.14372 


ESTs 


7.3 


413249 


AF167160 


Hs.76251 


DEACVH (Asp^u-Ala^p/Ks) box binding 


7.3 


430261 


AA305127 


Hs^37225 


hypothafical prot^ HT023 


7.3 


429597 


N|yL003816 


Hs.2442 


8 disintegrln snd metaHoprrtefawse dome 


7.3 


423566 


AW976434 


Hs^623 


hypothetical protein FIJ1 1220 


7.2 


433556 


W56321 


Ks.1 11460 


c^unVcalmoddin-dependent protein idn 


7.2 


420871 


AA702972 


Hs.65300 


ESTs 


7.2 


418278 


A1088489 


Hs^937 


hypotheOca! proton 


7.1 


456516 


BE172704 


Hs^22746 


KlAA1610pf^ 


7.1 


417511 


AL049176 


Hs.82223 


choniin>lii(e 


7.1 


411667 


eE160198 




gb:QV1-Hr0413-010200^94i03 HT0413 Homo 


7.1 


414422 


AA147224 


Hs.249195 


Homeot}oxA13 


7.1 


407118 


AA156790 


Hs.262035 


ESTs, Wealdy similar to Z223.HUMAN ZINC 


7.1 


421566 


NM_000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


7.0 


418564 


AA631143 


Hs,278695 


Horno sapiens prostein mRNA, complete cds 


7.0 


437872 


AK002015 


Hs.5887 


RNA binding motif protein 7 


7.0 


448148 


NM_016578 


Hs.20509 


HBV pX assodated protein-8 


7.0 


449523 


NM_000579 


Hs.54443 


cbemotdne (C-C motif) receptor 5 


6.9 


431657 


AI345227 


H&.105448 


ESTs, We^ similar to B34087 bypotheti 


6.9 


457728 


AW974811 




gb:EST386916 MAGE resequences, MAGN Homo 


6.9 


415910 


U20350 


Hs.78913 


cbemokina {C>X3<^ receptor 1 


6.9 


421977 


W94197 


Hs.110165 


nixisomal protein 1 26 homdiog 


6.9 


432161 


AK000400 


Hs.341181 


ESTs, Weaidy shnOar to envebpe [H.8a|rf 


6.9 


432887 


A1926047 


Hs.1 62859 


ESTs 


6.8 


444931 


AV652066 


H8.75113 


genera) transcription fiactor IIIA 


6.8 


421823 


N40650 


H8^8625 


ESTs 


6.8 


426981 


AL044675 


Hs.173081 


KIAA0530 protein 


6.8 


425170 


AU077315 


Hs.1 54970 


transcr^tion factor CP2 


6.8 


411479 


AW846047 




gb:ll-3-CT0214-2912994J52-A12 CT0214 Homo 


6.8 


420954 


AA282074 


Hs.237323 


N-ace^g)uoosanune-phosphato mutase 


6.8 


439492 


AF086310 


Hs.103159 


ESTs 


6.8 


449919 


AI67468S 


Hs.200141 


ESTs 


6.8 


452221 


C21322 


Hs.288057 


hypolhetica) prolan FU22242 


6.8 


431555 


AI815470 


Hs.260024 


ojc42 effector prolan 3 


6.7 


441111 


AI806867 


Hs.126594 


ESTs 


6.7 


457498 


A1732230 


H8.191737. 


ESTs 


6.7 


41 5293 


R49462 


Hs.1 06541 


ESTs 


6.7 


429163 


AA884766 




^:am20a10.s1 Scares GBC SI Hanv}8 


6.7 


433908 


AW298141 


Hs.1 57975 


ESTs 


6.7 


446896 


T15767 


Hs.22452 


Homo salens mRNA for )^2AA1737 prx^in, 


6,7 


431770 


BE221680 


Hs.266555 


S'-y exoritwnudease 2 


6.7 


418575 


AA225313 


Hs.222888 


ESTs, Weakty similar to TRHY_HUMAN TRICH 


6.6 


443745 


AB039670 


Hs.9728 


AL£X1 protein 


6.6 


427958 


AA418000 


Hs.98280 


potassium IntBimfldifltBtemaB conductance 


6.6 


447805 


AW627932 


Hs.302421 


gem]n4 


6.6 


440995 


T57773 


Hs.10263 


ESTs 


6.6 


422173 


BE385828 


Hs.250619 


phort)ol]n-iii(e protein MOS019 


6.6 


431992 


NWL002742 


Ks.2891 


protein kinase C, mu 


6.6 


437052 


AA861697 


KS.120S91 


ESTs 


6.6 


444030 


AW021254 


Hs.135055 


ESTs 


6.6 


416836 


D54745 


Hs.80247 


cboiecystoidnin 


6.6 


447033 


AI357412 


Hs.157801 


ESTs 


6^5 


428927 


AA441837 


HS.902S0 


ESTs 


6.5 


432189 


AAS27941 




gb:nh30c04j1 NCI.0GAp_Pr3 Homo sapiens 


6.5 


448779 


BE042877 


Hs.177135 


ESTs 


6.4 


430598 


AK001764 


Hs.247112 


hypothetical protein FIJ10902 


6.4 


411630 


U42349 


Hs.71119 


Putathre prostate cancer tumor suppresso 


6.4 


432662 


M376400 


Hs.159588 


ESTs 


6.4 


441499 


AW298235 


Hs.101689 


ESTs 


6.4 


441676 


6E564206 


H3.49889 


ESTs 


6.4 


421077 


AK000061 


Hs.101590 


hypothe&d prot^ 


6.4 


452784 


BE463857 


Ks.151258 


hypotheOcsl protein FU21062 


6.4 


452055 


AI377431 


Hs.141693 


hypotheficai protein MGC10858 


6.4 


431578 


M76665 


HS.27521S 


hydroxysteroid (11-beta) dehydrogenase 1 


6.3 


418994 


AA296520 


Hs.89546 


selecOn E (endothelia) adheston molecul 


&3 


452277 


AU>49013 


H5.28783 


KIAA1223 protein 


&3 
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412953 Z45794 Hs.238809 ESTs ^3 

441247 AW118881 Hs.128051 Homo sapiens thymic stromdlymphopGie^ &3 

453390 AA862496 Ks^8482 ESTs 6.3 

- 418450 R84397 Hs.193651 ESTB,WeaWy6iiralartDaIlOTafivefy8p &3 

D 407829 AA045084 Hs,29725 hypothetical prolan FU13197 &3 

425704 U79293 Hs.159264 Human done 23948 mRNA sequence 6^ 

433610 AA806822 H5.112547 ESTs 6J 

446552 AW973853 Hs.20104 hypothetlcai proleb) FU00052 &3 

412977 AA12S910 Hs.191461 ESTs 6.3 

lU 441217 AI922183 Hs.213245 ESTs &3 

443912 R37257 H$.184780 ESTs &3 

429281 AA830856 Hs.29808 Homo8apIeRscONA:FU21122fis,clon8C 6.2 

435021 AA922192 Hs.54709 ESTs 6.2 

425465 L18964 H5.1904 protein kinase Ci tola 6.2 

15 418821 AA436002 Hs.1B3161 ESTs &2 

424511 BE300512 H8.193557 ESTs, Moderately sbnUar to ALU7.HUMAN A 6.2 

448108 AI800470 H5.171941 ESTs &2 

408418 AW963897 Hs.44743 KIAA1435 protein a2 

_^ 450728 AW162923 Hs.25363 pTesenilln2{Alzhsfeii8r(fisease4) 6.2 

20 427078 Ai676062 Hs.111902 ESTs 6.2 

426931 f«1.003416 Hs.2076 zinc finger protein 7 (KOX 4. done HF.1 6.2 

449907 AA004825 Hs.103281 ESTs 6.1 

458509 AA654650 Hs.282906 ESTs 6.1 

437323 AA371145 Hs.194397 leplin receptor 6.1 

25 407902 AL117474 H3.41181 Homo sapiens inRNA;cONAOK!=Zp727C1 91 (fr 6,1 

429302 AU076674 H5.198e99 eidcaryotictranstalionbilialionfiacto &1 

412673 AL042957 Hs.31845 ESTs 6.0 

410150 AW382942 Hs.260024 ESTs 6.0 

421853 AI952677 H5,108972 Homo sapiens niRNA;cONADKFZp434P228(fr 6.0 

jU 446947 AF146747 Hs.232165 potycylhemianitMaveral; cell surface 6.0 

408968 A1652236 Hs.49376 hypothelical protein FU20644 6.0 

409047 AW961434 Hs.31539 ESTs 6.0 

418601 AA279490 Hs.86368 calmegln 6.0 

425710 AF030880 H5.159275 sdutecarier family, member 4 6.0 

35 439820 AL360204 fe^83853 Homo sapidnsmRNA full length insert d)N 6.0 

414905 H40873 Hs.175971 ESTs 6.0 

450295 AI766732 Hs.210628 ESTs 6.0 

412505 AA974491 Hs.21734 ESTs 6.0 

... 428730 AA625947 H8.25750 ESTs 6.0 

40 445413 AA151342 Hs.12677 CGI-147 protein 6.0 

451424 AI662026 Hs.302810 Novel human gene mapping to chomosome 20 5.9 

425628 NM.004476 Hs.1915 folate hydrolase (prostale^peciiic memb 5.9 

435677 AA694142 Hs.293726 ESTs. W^simflar to TSGA RAT TESTIS 5.9 

431359 AW993522 Hs.292934 ESTs 5.9 

45 404632 NM_022490:Komo sapiens hypothetical prat 5.9 

423748 AI149048 Hs.30211 hypothetical proton aJ2231 3 5.9 

449118 R67477 Hs.23103 Betl (S. cerovisiae) homdog 5.9 

405523 C8001409*:gi|7441226Iplri|S31212cdtage 5.9 

448807 AI571940 Hs.7549 ESTs 5.8 

50 404842 NM-021965*:Homo sapiens phosphoghicomuta 5.8 

452598 AI831594 ESTs. Weakly simBar to ALU7.HUMANAIAJS 5.8 

427212 AW293849 Hs.58279 ESTs, Weakly similar to ALU7JHUIi^ ALUS 5.8 

435047 AA454985 Hs.54973 cadheriivlikB protein Vl^ 5.8 

408622 AA056060 Hs.202577 Homo sapiens cONARJ12166fls, done MA 5.8 

55 419038 AW134924 Hs.190325 ESTs S.8 

416913 AW934714 gbiRCI-DTOOOWCI 299^)1 l-ell OT0001 Homo 5.7 

432432 AA541323 Hs.1 15831 ESTs 5.7 

435937 AA830893 Hs.119769 ESTs ' 5.7 

414528 AA1489S0 Hs.188836 ESTs 5.7 

OO 432712 AB016247 HS.28B031 steroK^&desaturase (fungal ERG3. ddte 5.7 

412783 BE276738 Hs.74578 DEAIVH (Asp^u-AlaAsp/Hls) box polypep 5.7 

408826 AF216077 Hs.48376 Honro sapiens done HB-2mRfJA sequence 5.7 

419083 A1479560 Hs.98613 Homo8dpienscOKAFU12292fis.doneMA 5.7 

420184 AA188408 H3.95665 hypothetical protein 5l7 

05 428493 AK001745 Hs.184628 hypothetical protein l=U10863 5.7 

443250 A1041530 Hs.132107 ESTs 5.7 

443324 R44013 Hs.164225 ESTs 5.7 

448164 R61680 Hs.26904 ESTs. Moderately similar to Z195J1UMANZ 5.7 

448172 N7S276 Hs.135904 ESTs 5.7 

/U 433332 AI387347 Hs.44898 HomosapiensdoReTCCCTAOOISI mRNAsequ 5.6 

453455 AA063553 Hs.221931 EST8»WeaMy8imitertoJC1087RNAhdic 5.6 

452242 RSOgse Hs.1S9993 gycosyltransf^ 5.6 

410141 R07775 Hs.287657 Homo sapiens dDNA:RJ21 291 fis, done C 5.6 

418019 R68911 Hs.1762TO ESTs 5.6 

75 450813 AI739825 Hs.203376 ESTs 5.6 

419239 AA468183 Hs.184598 Homo saptens cONA: RJ23241 lis, done C 5.6 

411096 U80034 Ks.68583 mftodiondnal intermediate peptidase 5.6 

451684 AF216751 Hs.26813 C0A14 5.6 

427674 NM.003526 Ks.2178 H2Bhistone family, member Q &6 

OU 407275 A1384186 gbxiw34h07j(1 NCLCGAP^Ut4Homo6apfens 5.6 

417958 AA767382 Hs.193417 ESTs Sl6 

418407 AL044818 Hs.84928 nudear transcription factor Y. beta 5.6 

435176 AA744875 Hs.169413 ESTs 5.5 
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428465 AW970976 Hs.293653 ESTs 5.5 

431316 AA502663 H5.145037 ESTs 5.5 

434804 AA649530 H5.348148 gb:ns44ro5j»1 NCLCGAPJOvl Homo sapiens 5.5 

411990 AW963624 HsJITO? ESTs. Weakly similar to YEW4 YEAST HYPOT 5^ 

416653 AA768553 Ks.193t45 (net3l)oOiion8in1E(funciknal)~ 5.5 

427359 AWD20782 Hs79881 Homo sapiens cONA:FU23006fis. done L 5.5 

432169 Y00971 Hs.^10 phosphoribosytpyiDphosphatesynOielase S.5 

416239 A1038450 Hs.48948 ESTs 5.4 

421470 R27495 Hs.1378 annexinA3 5.4 

408177 AI241733 Hs.43871 ESTs 5.4 

424084 A194Q875 Hs.20914 hypolheOcd protelh FU230S6 5.4 

414171 AA360328 Hs.865 RAP1A. member of RAS oncogene Mly 5.4 

448072 AW59306 Hs.24908 ESTs a4 

420397 NM_007018 Hs.97437 centrosomal protein 1 54 

429250 H56585 Hs.198308 tryptophan rich basic protein 5.4 

453070 AK001465 Hs.31575 SE063. endoplasmic reticulum Iranslocon 5.4 

429165 AW009886 Hs.1182S8 piDStste cancer assodated protein 1 5.4 

418866 T65754 gb:ycl1c07.s1 Stralagene lung (937210) H 5.4 

428364 AA42B555 Hs.311103 ESTs.htoderalelysintilartoALUI HUMANA 5.4 

431467 N71831 Hs.256398 Homo sapiens miWA; cONA Di<FZp434E0528 (f 5.4 

436679 AI127483 H8.120451 ESTs. Wealdysimnar to unnamed protein 5.4 

447188 H85423 Hs.17631 hypotheScal pioleln OKFZp434E2135 5.4 

457211 AW972S65 Hs.32399 ESTs, Weali^ similar to S51797vasQdilat 5.3 

412877 AW029608 Hs.17384 ESTs 5.3 

430412 AW341754 Hs.189305 ESTs 5.3 

423250 B£0$igi6 Hs.125649 chromosome 8 open reading frone 2 5.3 

421433 AI829192 Hs.22380 ESTs 5.3 

435572 AW975339 Hs.239828 ESTs. Wealdy Similar to GAGLHUHAAN RETRO 5.3 

458571 AV653731 H5.282829 ESTs. Moderately Similar to PC4259feiTl 5.3 

441054 AA913591 Hs.126480 ESTs 53 

431725 X65724 Hs.2839 Norrie disease (pseudogitoma) 5.3 

433409 AI278802 Hs.25661 ESTs 5.3 

441102 AA973905 intermediate filament protein syncoifin 5.3 

453387 A!990741 Hs,252809 ESTs &3 

421928 AF013758 Hs.109643 pcJyadenylate binding protein-interactin 5.3 

433923 AI823453 Hs.146625 ESTs 53 

426503 AA380153 gb:EST93093 Skin himor I Homo sapiens cD 5.2 

447574 AF162666 Hs.18895 tousled^ Idnase 1 5.2 

421896 N62293 Hs.45107 ESTs 5.2 

410870 U81599 Hs.66731 homeoboxB13 5.2 

420729 AW964897 H3.290825 ESTs 5^ 

425066 M82882 Hs.154365 E74^ factor 1 (ets domain transcript 5.2 

429467 NM_004477 Hs.203772 FSHD region gene 1 5.2 

447816 Ni^007233 Hs.274329 TP53 target gene 1 5.2 

445553 AB021179 H5.15299 HMBArlndudUe 5,2 

453308 AW9S9731 Hs.323099 ESTs 5.2 

452576 AB023177 Hs.29900 KIAA0960 protein 5.2 

421437 AW821252 Hs.104336 hypothetlcel protein 5.2 

422295 AF051151 Hs.114408 tolMikeieceptorS 5.2 

453942 AW19092O Hs.19928 hypothetic^ protein SP329 5 2 

400424 AJ276316 H5.287374 zinc linger protehi 304 5.2 

430387 AW372884 Hs.240770 nuclear cap binding protein subunit 2, 2 5.2 

452588 AA889120 H8.110637 homeoboxAlO 52 

448944 AB014605 Hs.22599 abophln-llntoracQng protein 1; actlvl 5.1 

413991 H44726 Hs.176090 ESTs &1 

444454 BE018316 Hs,11183 sorting nodn 2 5.I 

446795 Ai797713 Hs.156471 ESTs 51 

407300 AA102616 H5.120769 gban43e07.8l StratagenaHeUceIls393 5.1 

432340 AA534222 0b:r^21d02.8l Na.CGAPJ\A1 Homo sapiens 5.1 

446469 BE094848 Hs.15113 homogentlsate 1.2-dioxygenase (homogenti 5.1 

453293 AA382267 Hs.10653 ESTs 5.I 

408920 AL120071 Hs.48998 fibronectin leucine rich transmembrane p 5.1 

409643 AW450866 HS.2573S9 ESTs &1 

427723 AI355260 H3.279789 histone deacetylase 3 5.I 

438157 AW137011 Hs.49576 ESTs 51 

420344 BE463721 H3.97101 putative G protei^coupled receptor sil 

432251 AW972983 Hs.232165 polycythemia rubra vera 1;celi surface 6.1 

418051 AW192535 Hs.19479 ESTs 5.I 

419926 AW900992 H5,93796 DKI^586D2223 protein 5.1 

425843 BE313280 Hs.1 59827 death assodaled protein 3 51 

440594 AW445167 Hs,126036 ESTs &1 

452449 AW068658 Hs.20943 ESTs 51 

429769 NM_004917 Hs.218366 IcalGlc/ein 4 (prostase. enamel mdbix. p $A 

428898 AB033070 H$.19440B KIAA1244 protein 5.1 

415339 NM_015156 Hs.78398 KIAA0071 protein 5.I 

450671 Ai356967 Hs.43086 ESTs. WeaMy sinto to A46010 X-flrikod 5.1 

444484 AK002126 H3.11260 hypolheecal protein FU 11264 5.0 

433023 AW864793 Hs.87409 thrombospondin 1 5.0 

433862 086950 H5.3610 K1AA0205 gene product 5.0 

451244 AW008798 Hs.343877 hypothefical protein FU20039 5.0 

418624 AI734080 Hs.104211 ESTs 5.0 

430291 AV660345 Hs.238126 C6M9 protein 5.O 

432261 R42216 Hs.12342 Homo sapiens done 24538 mRNA sequence 5.0 
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429922 


Z97630 


Hs.226117 


HI histone famBy, msfnber 0 


S.0 




420218 


AW95e037 


Hs^ 


ribosomal protein U 


5.0 




425242 


013835 


Hs.1 55287 


K1AA0010 gene product 


5.0 




427176 


AW381569 


Ks.40334 


ESTs 


5.0 


5 


431585 


BE242803 


K5262823 


hypotheScd protein FLJ10326 


5.0 




439699 


AF08S534 


Hs.1 87561 


EST&Mod8ratB)v similar bALUI HUMANA 


5.0 




412350 


AI659306 


Hs.73826 


protein tyrosine phosphatase, nocHBcept 


5.0 




450244 


AA007534 


Hs.125062 


ESTs 


5,0 


10 


439731 


AI953135 


K5.45140 


hypothetical protein RJ 14084 


SuO 


434237 


AF1 19908 


Hs.235516 


hypolhetica) protein PR02955 


5.0 




416700 


AI9G380B 


H5^970 


ESTs, Mocterately simQar b ALU5.HUMAN A 


5.0 




445666 


H20899 


H5,13399 


Homo saoians done 2S032 mRNA sMuen» 


4.9 




408331 


NM_007240 


Hs,44229 


dual specHidty phosphatase 1 2 


4.9 


15 


417407 


AA923278 


Hs.290905 


EST s, V\fe3kty simSaf to protease (H.ssp) 


4.9 


431325 


AWD26751 


Hs.5794 


ESTs, Weakly similar (o 2109260A B ceQ 


4J 




444151 


AVV972917 


Hs.128749 


flbdia^nfiDnttacvt-CoA racemasa 


4.9 




401451 






NiyL004496^^Horno sapiens hepatocyte nude 


4.9 




416434 


AW163045 


Hs.79334 


nudear foctor, inteHeuldn 3 fBQulated 


4.9 


20 


419839 


U24577 


Hs.93304 


(dN}sph(dlp8se A2, QfDup VII (platetet-8C 


4.9 


442281 


N34742 


Hs.170065 


Homo sapiens CI3NA FU13492 lis, done PL 


4^9 




420608 


BE548277 


Hs.103104 


ESTs 


4.9 




420911 


U77413 


Hs.100293 


(Kinked N-aoe^)9lucosamine (QcNAc) tr 


4.9 




424332 


AA338919 


Hs.1 01615 


ESTs 


4.9 


25 


430523 


AW45138S 


Hs.161954 


ESTs 


4.9 


449300 


AI656959 


Hs.346514 


ESTs 


4.9 




426657 


NlilL015B65 


Hs.1 71731 


solute carrief (afnily 14 (urea transport 


4.9 




422^9 


AK000181 


Hs.114556 


hypottietical protetn FLJ20174 


4.9 




438321 


AA576635 


Hs.6153 


CGi-48 protein 


4^9 


30 


452744 


A1267652 


Hs,246107 


Homo sapiens mRNA; cONA DKFZp434E082 (fr 


4.9 


418818 


AA228899 


Hs.1 01 307 


Homo sapiens HUT1 1 protein mRNA, parliai 


4.8 




425174 


D87450 


H5.1 54978 


i<lAA0261 protein 


4.6 




430458 


AA479300 


Hs.225706 


ESTs, Weaidy similar to 138022 hypotheti 


4.8 




455497 


AAl 12573 


Hs,278695 


Homo sajriens prost^ mRNA, oomptete cds 


4.8 


35 


416208 


AW291168 


HS.4129S 


ESTs, Wealdy similar to MUC2.HUMAN MUON 


4!8 


449539 


W80363 


Hs.58446 


ESTs 


4.8 




450654 


AJ2455B7 


Hs.25275 


Kfuppet-type zinc finger prot^ 


4.8 




434228 


Z42047 


Hs.283978 


Homo salens PR02751 mRNA, complete cds 


4.8 




407385 


AA610150 


Hs.272072 


ESTs, Wealdy similar to 138022 fiypoiheli 


4.8 


40 


420210 


Ai557257 


Hs.44811 


ESTs 


4^8 


425905 


AB032959 


Hs.318584 


novel C3HC4 type Zinc finger (ring finge 


4^8 




428647 


AA830050 


Ks.124344 


ESTs 


4.8 




434015 


AA84451d 


H&300876 


hypotheScaS protein FLJ13386 


4.6 




441879 


AI521936 


Ks.107149 


novel pnriein siniSar to archasal, yeast 
^^ox and t6udne*fteh repeal protein 4 


4.8 


45 


408990 


AL022395 


Hs.49526 


4.8 


416030 


HI 5261 


Hs.21948 


ESTs 


4.6 




420948 


AB016898 


Hs.1 00469 


inveti^d/lvmnhf^ or fnixettflneaae leukeni 


4.6 




423749 


U09848 


Hs!l 32390 


zinc finger protetn 36 (KOX 18) 


4.8 




410268 


AA316181 


Hs.61635 


six frflnsn idTibfonft Gpithoflot BntfQcn of 


4.7 


50 


452323 


W443S6 


H$.292812 


ESTs, weakly sbrto to T33468 hypotheQ 


4.7 


443684 


AI681307 


Hs.55098 


ESTs 


4.7 




415621 


AI648602 


Hs.55468 


ESTs 


4.7 




420851 


AA281062 


Hs.29493 


hypothefical protein FLJ20142 


47 




424433 


H04607 


Hs^lB 


ESTs 


47 


55 


438703 


AV^80614 


Hs.146381 


RNA hindlnfl mnfrf nrotf^n X chmmasflina 


4.7 


451806 


NM.003729 


Hs!27076 


RNA 3i «4Bnrinal ohosohata cvdasfi 


47 




433293 


AF007835 


Hs.32417 


hypolheticd protein MGC4309 


47 




434973 


AW449285 


Hs.313636 


EST 


4.7 




433560 


AI925195 


H8.130891 


hypothetic^ protetn M6C4400 


4.7 


60 


431587 


Nh4.016179 


Hs.262960 


tisnsient receptor potential channel 4 


4.7 


452260 


AA453208 




RAB9, mgmbcr RAS oncogene famfly 


4.7 




412846 


AW981245 


Hs.55896 


Homo sapiens PAC done RP5>978E18 from 7 


il 




416836 


At655499 


Hs.161712 


ESTs 


4.7 




401558 






ENSP00000220478*:SECRETOGRAN1N 111. 


4.7 


65 


441647 


AA534210 


Hs.285280 


Homo sapiens cOt*^ FU22096 fis, done H 


4J 


443881 


R64512 


Hs.237146 


hypothetical proton FLJ12752 


4.7 




417159 


R13550 


Hs.246773 


ESTs 


4.7 




423349 


AF010258 


Hs.1 27428 


hontfio box A9 


4.7 




458611 


AI266407 


Hs21145B 


w\^v|JowiHf uajniiiviiiuiaiiu j/iuioiii 


47 


70 


444951 


AI783767 


Hs 148635 


ESTs ModeralehrslmnartoALUB HUMAN 1 


4.6 


416774 


At005169 




Ifarm sflriRiifi rDNA' FL J99(WIQ fia ckmn H 

■ tu*imf OB^iiBTw W Unn> rliaK/Vig 119, MWtO n 


4.6 




419465 


AW500239 


Hs.21187 


HomosaptenscONAi FU2306B fis, done L 


4!6 




423242 


AL039402 


Hs.125783 


OEME-6 protein 


4.6 




452527 


A!122843 


Hs.184319 


ESTs. WkUy similar to KIAA1 006 protein 


4.6 


75 


419005 


T86358 


H3.19393t 


ESTs. similar to 154374 gene NF2 


4.6 


403046 






NM.00S656':Homo sapiens transmembrane pr 


4.6 




416602 


Nli4.006159 


Hs.7g3B9 


nel(chidcenHike2 


4.6 




407426 


AF129533 




glKKomo sapiens F-booc protein Ft)l3b (FBL 


4.6 




426011 


AW996095 


Hs.58924 


ESTs. Weakly similar to JC5594 )er1ty gen 


4.6 


80 


442138 


AA445973 


Hs.13303 


Homo sapiens cONA: FU21764 lis. done H 


4.6 


446015 


T30968 


Hs.13531 


hypoOntical protein FU10971 


4.6 




452994 


AW982Sg7 


Hs^1305 


KIAA1S47 protein 


4.6 




446091 


AW022192 


HSJ20D197 


ESTs 


4.6 




432908 


At861896 


Ks.304505 


ESTs 


4.6 
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430226 


6E245562 


Hs.2551 


8drBn8i9ic, b6l&-2*, receptor, surfscs 


4.6 


431673 


AW971302 


Hs,293233 


ESTs 


4.6 


446163 


AA354991 


Hs 14999 


Hnmn un^JM mRNA' cHNA nKP7n7A1Pfl1Q ffr 


4.6 


452355 


N54926 


Hs.29202 


0 p^otet^^oupted rocQptor 34 


4.6 


432093 


H28383 




ah'vlS9en3 r1 Sfiares hraast SNhHRst MonM 


4.6 


446402 


BE244226 


Hs.21094 


RAB1S mernher RAS onemeiiA ItMrdhf 


4.6 


409019 


AW3SS419 


Hs.9615 


cnyosn icoidsAoiy iQht chsin 2i smooth 


Alt 


413812 


AW1 88687 


H<t 4474fl 


ESTs 




414343 






oostcd vGsi^6 cnendvsno piol^ 


AR 


434423 


NM_006769 


Hs.3844 


UM Anmsin onhi 4 


45 


4073S5 


AA846203 


HS.193S74 


ESTs. MteaMtf similar to At U1 HUMAN Al US 


4.5 


421724 


AB037832 


Hs 107287 


KIAA1411 nmfabi 


4.5 


412125 


Y17114 


Hs.73393 


Oyes oOSoiU (UiiidO|J|iII^ iHNIIUKiy 4 


A Ik 
1.0 


437330 


AL353944 


H&S0115 


Mnnn cflnlAiM mRMA* eTMh. nieC7n7fi1 11119 /f 
rHMiiO SopicIlS nlnlws CUfin Uf\r£pf 0141 1 Is \l 


A C 


446720 


lOD 


n9« i^iMto 


CO 19 


A C 


446664 


AI879317 


HS-334S91 


ofMung Muiui 0B| ouminii it i^uku 


A K 


415083 


nIOOcDOO 


Ub 7T17Q 


nodio Sopiens cuna rtJifsoo ns, ciora n i 


A C 


423412 






jNusiow coiK^r BssoMdicu pniiBin q 




433507 


A1817336 


Hs.191791 


ESTs 


A^ 


426966 


AF059214 


Hs.194687 


chofestpfal 9S-hvrimxvln!:p 


AS 


423782 




Hs "^n^ 17 


ESTs 


A f\ 


456362 


AW973003 


Hs.1 79909 




45 


415451 


HI 9415 


Hs 288720 


FSTs ModBratfiitfsifnBartoALUI HUMAKIA 


Ail 


419745 


AFD42Q01 


Hs.93005 


stiin /f^hJcon hfwwdnn) vIm* finnpr iwnl 
akuy ^uiiiMsn iMiHMig^ zinc Niiyai 


AC 


428715 


AW293716 


Hs 53126 


ESTs 


4.5 


429857 


AF089897 


Hs 294030 


lotv^b^viwfaeftjniateH liiMlinn nmlnn A 
vu^AWK/iiKHoavtoiauu iuikmiuii inwusui '* 


4.5 


459561 






ESTs 


A c 


416155 






colo, YfBoNy suiuiar 0} i otuoo nypmiHii 


AC 


414272 




H<t 46Q88 


ESTs 


A C 


424690 


6ES3a356 


Hs 151777 




4.5 


417350 


US0928 


Hs 82001 




A s 


450094 


Ai174947 


Hs^95789 


Homo saoians mRNAr cDNA DKF7n<ifi4D1 1fi4 ff 


4.5 


459324 


AWD80953 




obaa^lZjtt NO CGM> ColSHamBaaiens 


4.5 


429784 


M89796 


Hs^ 


infirnbrsnO'^psinhiQ 4^oinsbi8i subfend^ A 


4.5 


401519 






C1SOQO47fi*-oni973797Qtr0fiyp fl191fi.11l 


A c 

•1.0 


417576 


AA339449 


HS.822&5 


nhasoharihQSvkilvcInafnidfi favmvttrnnsfRf 


4.5 


427464 


N32859 


nSiOr«00 


fittj^l^qr rA/«Anlnr eiiHfomiKr 1 nrnttn ^ m 

nuweaT iBCspQir Buoiamiiy i, giuup u, m 


A K 


417061 


/M67S944 


Ua IBffiiQI 


HhnM sar^ms rHMA Pi l19n_T1 fie rhiui UP 

mnu aopicns wnn ru i«uoo ns, cnne nc 


A A 


418858 




H&.21145 


IwmlhAn^iri nmloln RHnmunii 9 
nypuuioiKoi piuiBin rsuiiCMivivs*^ 


A A 


433517 


AW022133 


Hs]l89838 


ESTs 


4.4 


429559 


AI965345 


Hs.26425 


ESTs 


A A 
1.1 


452338 


AWG0RQ90 




ziiic HngBi piuiein / o ^uooo/ 


A A 
1.1 


409190 


AU076536 


Hs.50^ 


MrtfTineaf^n«rT^l^fRM1?7nfl« HnnaPI 


A A 

1.1 


444922 


nJ9c 1 f 3V 




null IV Sapiens cunn riij lOrSZ ns, ClOno rL 


A A 
1.1 


416547 


H62914 


Hq 9fiflQ4R 


P^^Tr Wnalclv ftlmtbr in Pr^9(;Q fpfritln 


AA 
1.1 


430320 






uiiLndTdCionioQ ny(MUiaianus piuLBin nuu 


A A 
1.1 






Me 'nRAn 


nomo sapiens cuna rU ioU?/ tis, ciono nl 


A A 

4.4 


438497 


AAB0B725 


Us. 9Q1719 


CO 1 S| VTBaKly SBinlar to lODUu nypOulBD 


A A 
1.1 


434384 


AARjtlQIA 
/vwOi9lv 


Ue IfiOfUQ 


PQTc 
CO IS 


A A 


444564 


All 67877 


no. ittji lo 


ESTs 


A A 
1.1 


447500 


AI381900 




PQTb 
cola 


4.4 


425259 


ALD49280 


Hs 1S53Q7 


noiTw/ sopiBiis miMini cunn ui\r£p90*H\ i*)o yi 


A A 
1.1 


408380 


AF1 23050 


Hs 44532 


rfhihintiHin 

UlUUUfUIUII 


A A 

1.1 


448766 


AI473827 




ESTs 


A A 
1.1 


417421 


AL1 38201 


Hs 82120 


mictpnr rpcftnhv ciihfAmllv d nminA m 


44 


432810 


AA8634X 




CO IS 


A A 
4.4 


437612 


AI582291 


Hs 16846 


FSTs WesUvdmilflrtoQdHUnidf^iriMnii 
CO to, vvcHMy miiiuol VJ wtnuui uconsoijiJ 


A A 

1.1 


447247 


AW359351 


Hs.287955 


Hftfnfl ^flniftns cHMA PI JIMQfl ftt idnmiMT 


4.4 


434022 


R16374 


H« 117Q(Vi 

TiOt 1 1 (930 


ESTs 


A A 
4.4 


422365 


AF03S537 


Hs 115521 

rio* 1 1 WW& 1 


ncvw \ycaai iiuiiiuiuy/*iu\t7i waiaiyuw eUU 


A A 

1.1 


417728 




no<^^f 9w 


KIAAI*!?^ nmlnhi 


A A 
1.1 


420407 


AAfi1473? 




B|M^iy9KC€uiu9^pcviiiC rcaponao o*u 


A A 

1*1 


425610 


AI923627 


Hs.31903 


ESTs 


4.4 


432882 


NM_013257 


Hs 279698 




A") 
1.0 


448550 


AL161983 


Hs 2141S 


ikma sanlens mRNA* itflNA nKP7ii7S1K9fl94 ff 


A^ 
1.0 


410227 


AB009284 


Hs 81159 


exostoses (tniA^]te)~likB 2 


A 1 

1.0 


417845 


nLI WhOI 


Ue R971Q 


rtOino sapiens nwlAi cuna Ul\r^KwOr I02c (T 


A 4 

4.J 


452335 






CCTe 

CO IS 


A Q 

4aI 


429716 




nsu( 11900 


mhII^vom Kjma VI tl AtnKA 1 

Cuiisgon, qfpe ajh, aipna i 


A 9 
4,0 


^OUOIO 


AL135018 




nypoinsucBi proiein Muv^Mo 


4.0 


408527 


Hs 33074 


Hnmo fisntsns dfinfl llUlAfiP*3fiOBS1Q mRMA 


4,3 


419852 


AW503756 


HS.286184 


hypothetical protein dJ5510Z5 


4!3 


424534 


D87682 


Hs.150275 


KIAA0241 protein 


4,3 


424850 


AA151057 


H&153498 


chromosome 16 open reading (rame 1 


4.3 


435023 


Ar692SS2 




gb:wd73f12j(1 NCLCGAPJUi24 Homosapiefls 


4.3 


442914 


AW188551 


Hs.99519 


hypothefica} protein FIJ14007 


4.3 


452679 


Z42387 


Hs.83883 


transmembrane, prostate androgen induced 


4.3 


421163 


AA375974 


Hs.32450 


ESTs, We8My8imQartDT23762hypolhefi 


4,3 


420405 


AA743396 


Hs.189023 


ESTs 


4.3 


450330 


AW500775 


H3.24817 


hypolheHcal protein FU20136 


4.3 


423998 


H29138 


.H8.157113 


coenzyme Q. 7(rat yeast) lK)molog 


4.3 


439764 


726535 


HsJa744 


hypothetical protein M6C13105 


4.3 


446351 


AW444551 


Hs^O 


X 001 protein 


4.3 
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431583 AL042613 Hs562476 S-adenosytmelhionlne dscartxixylase 1 4.3 

414312 AA155694 H8.191080 ESTs 43 

438379 N23018 Hs.171391 C4enninal binding protein 2 4.3 

41192B AA688624 Hs.197289 rab3 GTPase-acflvafing pioWa notwata 4.3 

414880 AA74333I h8mog!ob^^ alpha 2 4.3 

423104 AJ005273 Hs.123847 antigenic detennbiant Of iQcA prolan (m 4.3 

440840 AW629666 ESTs. VfeaWy similar to S64054 hypofteB 4.3 

449802 AW901804 Hs53984 hypotheticat protein FU20147 4^ 

434874 N62448 Hs.293970 methyWonate^einialdehydedehydregenas 

431429 AF072B13 rotictdonS r- i ^ 

413492 D87470 Hs.75400 KIAAOSBO protein 4.2 

419929 U90268 Hs.93810 cerebral cavernous malfDrmaSons 1 42 

435846 AA700870 Hs.14304 ESTs 42 

441523 AW514263 Hs.301771 ESTs. Wealdy similar to ALUFJftJMANHH 4!2 

450546 AA010200 Hs.175551 ESTs 42 

414222 AL135173 H8.678 soititol deiiydrogenase 4!2 

416445 AU)43004 Hs.79337 WAAOI 35 protein 42 

409151 AA306105 Hs^785 SEC22, veslde trafficking protein (S. c 4.2 

412707 AW205373 Hs.16443 Homo sapiens cDNA: FU21 721 fis. done C 4.2 

442991 8E281238 Ks.8866 hypottieOcal protein FU20424 4.2 

447439 AA313565 HS.14S020 ESTs. Weakly similar to KIAA1205 protein 4.2 

449885 AW296669 Hs.66095 ESTs 42 

407204 R41933 Hs.140237 ESTs, Wbaldy similar to ALUIJiUMIAN ALUS 42 

450377 AB033091 fe.74313 K1AA1 265 protein 42 

419847 AA34B947 H5.91816 iiypoawtJcal protein 4.2 

442049 AA310393 Hs.190044 ESTs 49 

430669 AW9696S7 Hs^1029 ESTs 42 

424181 AL039482 Hs.142517 Homo sapiens mRNA; cDNA OKF2p434P0810 (f AA 

443834 H73972 Hs.134460 ESTs 41 

426215 N77630 Hs.13895 Homo sapiens cONAFU11654 Us. done HE 4.1 

422634 NM_016010 Hs.1 18821 CGI^ protein 41 

445895 E>29954 Hs.13421 KIAA0056 protein 41 

447659 AL049985 Hs.19180 Homo sapiens mRMA:cONAOKF2pS64E1 22 (fr 4.1 

407198 H91679 9b:yvO4a07.s1 Soaresfelfll liver spteen 4.1 

447752 M73700 Hs.105938 lactolransferrin 41 

^2855 NI^001839*:Horoo sapiens calponin 3, acid 4.1 

443161 AI038316 ob:ox48c08j(1 Soares.toteLfctus Nb2HF8 4.1 

415827 H17462 Hs.23079 ESTs ' 41 

408161 AW952912 Hs.300383 hypotheticat protein M6C3032 4!l 

418365 AW014345 Hs.161690 ESTs 41 

429525 N92540 Hs.205353 ectonudeoslde triphosphate diphosphohyd 41 

435401 R44477 Hs.10056 hypothetical protein FU 14621 4.1 

446657 A1335191 Hs.260702 ESTs, WtoMy similar to 21 09260AB cell 4.1 

421141 AW117261 HS.12S914 ESTs 41 

430335 D80007 Hs.239499 KIAA01 85 protein 4.I 

435020 AW505076 Hs.301855 DiGeofge syndrome critical region gene 8 4.1 

445309 AL157474 Hs.12504 fikelyorthotog of mouse Arkadia 4.1 

443547 AW271273 H5^3767 hypothetical protein FU12666 4.1 

410763 AF279145 Hs.8966 hypolheOcal protein FU21 776 4.1 

410592 R94088 Hs.43569 ESTs 41 

423698 AA329798 Hs.1098 DKFZp434J 181 3 protein 41 

428634 AA811845 Hs.106290 Keteh motif containlnfl protein 4.I 

430253 AK001514 Hs.236844 hypotheUcal protein FU10652 4.1 

425211 M18667 Hs.18B7 progaslricsln (pepsinogen Q 4.0 

443273 AI042063 Hs.132156 ESTs 40 

428055 AA420564 Hs,101760 ESTs 4*0 

425707 AF115402 Hs.1l713 E74^ factor 5 (ets domain transcftot 40 

451294 AI457338 Hs.29894 ESTs 40 

430519 AF129534 H5.49210 F^ only protein 4 40 

441765 R53790 Hs.23294 hypothetical protein FU14393 4.0 

431117 AF003522 Hs.250500 delta (DrosophilsPa 1 4.0 

446354 AW449650 Hs.346335 ESTs 40 

451468 AW503398 H3.293663 ESTs. Moderately sbnHar to 138022 hypo! 40 

409706 BE15S773 Hs:213207 ESTs 40 

447082 T85314 . thioredoxin^iKe 40 

418594 A1732083 Hs.187619 ESTs 40 

426501 AW043782 H5.2936t6 ESTs 40 

436797 AA731491 H8.334477 hypothetical protein MGC14B79 4.0 

41K8B H51299 flb7p07c06.8l SoaiBS breast 3NbHBst Homo 40 

419855 NM.007020 Hs.93502 UI-snRNPUndino protein homotoQ(70kD) 4.0 

425920 Ati)49977 Hs.162209 daudinB / 

416658 U03272 Hs.79432 fibrillin 2 (congenitelconlraetoialara 40 

452367 U71207 HsJ?9279 eyes absent (D/osophSa) homoiog 2 4.0 

433209 AB040907 HS.27B436 KIAA1474 protein 40 

428801 AW277121 HsJt54881 ESTs 40 

419629 AB020895 Hs.91662 K1AA088B protein 4.O 

451061 AW291487 HS.2136S9 ESTs, WeaWy similar to WAAI 357 proteb 3.9 

420544 AA677577 Hs.98732 Homo sa^^ensaromosome 16 BAG done QT 3.9 

^00695 C11002514*:gi|11280151!plr||E82756brta. 3.9 

417173 U61397 Hs^l424 uUquiOnJIke 1 (sentito) 3.9 

419985 H66373 H5.58S6 ESTs. Highly simiter to l»A393J16.3IH8a 3.9 

4^0 N3S938 Ks.199429 Homo sapiens ii^cONAOKFZp434M2216(f 19 

433759 AA6B0003 H5.109363 Homo saptens cONA: FU23603 fis, done L 3.9 
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428882 


NM 000346 




RRY fw ripltwiiili^Mi mflfam YLhm Q leak 


3.9 


413450 


Z99716 


H5.75372 




3.9 


427615 


6E4 10107 


Uc 17Qfl17 
riit. 1 f 90 1 f 


nf^.fl9 nmfnln 


3^ 


450649 


NM.001429 


Hs^5272 


E1A WndinQ protBin p300 


3^ 


445798 




Hs. 13321 


itssfTsnQcd Lwnyc Vision S6(]U6nc6 


3J 


416198 


H27332 


Hfi!99S98 


njiwuniuca ihuusui ivhs^amo 


3.9 


431147 


AI7677$1 


Hs.20300 


ESTs 


3.9 


439192 


AW970536 


Hs 105413 


ESTs 


3.9 


446157 


6E270828 


Hs 131740 


Hnnvi Rar^nc oHMA* Fl I994i9 lifi rlmw \i 
nviNiiU od|Mcii9 M/iw r wcODc Hot uum n 


19 


451900 


AB023199 


Hs.27207 


KIAAD9&2 Dmloln 


19 


420802 


U22376 


Hs.1334 


tiLmiih m/!tm mimlnMaclnBte wiral nnnmiBfi 

TTiiyo twiaii iiqrcwuiaSHiMS vuu wmyon 


As 


426083 


AW962712 


Hs 126712 


ESTs Waa]dvfifaiiQartoAF1910%1 E^GS 


3.9 


442320 


A1287817 


Hs. 129536 


ESTs 


19 


447541 


AK00O26d 


Hs 18800 




3.9 

A9 


444636 


T96667 


Hs 17877 


ESTs 




432103 


T15803 




oi'olotn nhnentifltami 4 IfnmiAilu 9R\ Mil 

piotein pnu9|inaicBO q (iDiiiiBny co/t gbi 




412576 


AA447718 


Hs.107(K7 


ESTs 


19 


423952 


AWS777R7 


ns. 1 joiu* 


KIAAnfliv) nm)Atn 


J.9 


437916 


RFSS674Q 


H$.20999 


hvnnthpilr^ fwntoln PI 19^1^9 
nypouicucul piuiBin tljaj mc 


1 Q 


442242 


AV647908 


Hs.90424 


Homo ^aniAns cDNA! PI I9^9fi'> fi« rinnn M 


0.9 


447280 


BPfi17Qfl7 


n3.9f 


CO IS 


9 0 

0.9 


421750 


rWUW 1 WW 


Ue 107879 


hvrmfliplirfll nmfpin PI l9n7R1 


0.0 


450580 


N40087 




ESTs 


o«o 


432359 


AAD76049 


H5.27441S 


Hanio«anipfKcrMAP1J1099Qfk rbuwHP 


0.0 


429323 


NH.001649 


Hs!2391 


a^iitOi ^vwmiI) /NCiivyua leoviatiivo 


1R 


407332 


Ai80156S 


Hs.200113 


HQninsaDieKcDNAPlJ1137<)fi<; cbrrnHP 


3.8 


426413 


AA377823 




<]b'EST9080S Svnovtal csicama Homo Rsnlftn 


3.8 


412652 


AI801777 


HS.28Q024 


ESTs 


18 


426225 


AA769045 




ob»^0h07^1 HCt RfiAP fiPRI Honn snnlprM 


3,8 


437816 


At823445 


Hs 280699 


ESTs 


3.8 


444534 




Hs.42294 


CO IS 


J.0 


459241 


AA032276 


Hs.99010 


ESTs Moderately &!niitar!oT14342NSD1 


3.8 


408242 


AA2S1594 


H143913 


PiBFI Qsno product 


0,0 


443484 


AI091458 


H5.134559 


ESTs 


3.8 


422083 


MM 001141 




aitKiHuuiioio 19-iipuAyyanaaB, Scconu lyp 


1 ft 

0.0 


428043 


T92248 


Hs.2240 


iiismolobin 


3.8 


407192 


AA609200 




Qbiflf17o0?.s1 SosTSS testis NHT Homossp 


3.8 


421590 


AF004715 


H110S940 


ifiricv f maiKA) homolnoJkti 


3.8 


453020 


AL162(^ 


Hs. 31422 


Hmm earAiiM mRMA' rHMA rWP7nA.1dM99Q ffr 
noino oapRiiS nii\iv>, cuiv\ Unrcp't^mca \ff 


1ft 


437372 


AA323968 


Hs 283831 


hvDottifi&:d Dfnteh DKF7nS47G1fi3 


3.8 


432278 


AL1 37506 


Hs 274256 






407917 


U63139 


Hs.41587 




0.0 


408547 


AA574291 


Hs>57837 


CO IS 


1 fl 

0.0 


427635 




Ua 17QqQ9 
n9.1f 970« 


tumor protein p53-bindlng protein 


0.0 


430968 


AW972830 






1ft 
OiO 


452234 


AWD84176 


Ha 223298 




1 A 
0.0 


453898 




Hs.42746 


ESTs 


1 A 
0.0 


408875 




ns.*}ooirf 


ni^P7P^^R1Rn nmlotn 




431197 


AL038S96 


Ha 25074'i 


fMlvmpr»CA fRMA) III /nNA HimrtMfl /R9k 


1ft 
0.0 


434072 


H70854 




nuniO Sa|wn9 r 1 UOc nirVtni wmpiOlB 009 


1ft 
0.0 


438138 


R98299 


H< 177(;n9 


P<:Te 
COI5 




447908 




Ue 1QQQQ 


u}\r^r90oi\u^ proiein 


0.0 


415992 


C05837 




hvrmthnllrfll rvniptn PI 11 TC<U 


17 
0.r 


435655 


AW10S663 


Hs.6947 


HSPC069 orateln 


17 


430807 


AW973521 


Hs!247324 


ilUttJUIwIIQira] IIUOSQIIial pfOwlll O It 


17 

0.f 


439398 


AA2fi42fi7 


Ue 991 end 


CO IS 


1 T 
0./ 


448515 


H68441 


13528 




3.7 


445242 


BE156478 


H5.2IIO8 


pCiTs WRotrh/ ebntbtr fn All 11 UIIMAMAIIIC 


17 

o.r 


418576 


AW96B159 




cpiuiaiiol mjMum cnannei 4, vQ 1 "Uiw n 


IT 
0.f 


436024 


AI800041 




ESTs 


17 


407783 


AW99G872 


H5.I72O28 


3 tfsinlBQnn find rt^lflitopj^i^tfljfmso domfi 


17 


436207 


AA334774 


H3!l2845 




17 


442861 


AA243837 


Hs S7787 


ESTs 


17 


448207 


AI47549Q 




CO IS 


1 7 
0.1 


450628 


AW382S84 


^204715 


ESTs 


1^7 


432606 


MM 009104 


H5.3O68 


ftrflfiTuma W fcorintt nmf Aaen nranvuftia 9* 

SiiMUjino f\ ^senno pnweasoi QionzyniB 


17 
0./ 


438613 


C05569 


Hs 2431 99 




17 
0./ 


443031 


AW134fi<)6 


Hs.49418 


ESTs 


1 7 
0.1 


447608 


AW905D49 


Uq laqcc 

IK). 10999 


nuiiiO Sapiens cunn ru«uoor ms, cions rsn 


1 7 
0.r 


4flAai9 


nrclMOId 




Inlmwlhilfv mmwlmmn »Mmw<iliili«4 mvLJk 

inutfwffililicw iiiijiiiWilllU^tfiQUurolnil CSiCIU 


17 


412777 


AI335773 


Hs.270123 


ESTs 


17 


424602 


AK002055 


Hs.151046 


liypothetical protein RJ1 1193 


17 


424882 


AI379461 


H3.153635 


{ar upstream etement (FUSE) Irindlng prot 


17 


425898 
443323 


AA385&49 


Hs.269478 


ESTs. Weakly similar to PC4259 Min 


17 


BE560621 


Hs.9222 


estrogen receptor tUnSkiQ sits assodate 


17 


452129 


AW291379 


Hs,212827 


ESTs 


17 


453927 


AA082465 


Hs.125031 


duQne/ethanolaminephosphctransfefase 


17 


432336 


NM.002759 


Hs.274382 


proiein Mnase. intefferon-indudtile dou 


17 


425988 


BE045e97 


Hs.274454 


ESTs. Weakly similar Id 138022 hypotheU 


17 


443123 


AA094538 


Hs.272808 


putative transcription regutalkxi nudea 


17 


445467 


AI239832 


H5.15617 


ESTs, Weakly slniilar to ALU4J1UMAN AlOJ S 


17 


423855 


AA331761 


H5.254859 


ESTs 


17 


403790 






NM.001334*:Hano saptera cattepsin 0 (CT8 


17* 
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434470 AA63481B Hs.29813B ESTs 17 

4S2032 BE24400S Hs.27610 mtinoto sck}- 8fui tnteifen^ 3.7 

420026 AI831190 HS.16G676 ESTs 3.7 

421838 AW88108g H5.IO88O6 Homo sapiens mRNA:cONADKFZp566M0947(r 3.7 

D 449916 T6052S Hs.299221 pyruvate dehydrogenase kinase, isoenzyme 3.7 

452270 AW9750t4 Hs.26 fetncheiatasa (pnibporphyrta) 3.7 

445941 A)267371 Hs.172636 ESTs 3.6 

416882 A)633044 byptophanyitRNA synthetase 2 (mitochon 16 

412533 AA679863 Hs.69606 ESTs 3.6 

10 431740 N75450 Hs.183412 ESTs. Moderately similar to AF1 16721 67 3.6 

442710 At015631 Hs.23210 ESTs 3.6 

448212 AI475858 gb:tc87d07j(1 NQ.CGAP.Cai Homo sapiens 16 

413431 AW248428 H5.75355 utnquiiinK»i^aBng enzyme E2N(homob 16 

453038 AW292415 Hs.20509 HBV pX associated protein^ 16 

IS 427661 AA410292 Hs.104761 ESTs 16 

420923 AF097021 Hs.273321 differentially expressed in liematopQieli 16 

414844 AA296874 Hs.77494 deoxyguanosine kinase 16 

441866 BE464341 Hs.21201 neciin 3; DKFZP566B0a46 pfotsin 16 

406815 AA833930 Hs.288038 tRNAlsopentenytpyrophosphatetransferas 16 

20 423482 BE280172 Hs.12922B galactoklnase 2 3.6 

424677 U09414 Hs.151689 zinc linger protein 137 (ctone pHZ-SO) 16 

430160 AW968210 Hs.293g^ ESTs. Weakly simiiar lo AIUC_HUMAN lltl 16 

433572 BE281165 Hs.288038 TlS-assodated serlne^ne protein 1 16 

433887 AW204232 Hs.279522 ESTs 16 

25 452670 AF068227 H$.30213 cemid-lipofusdnosis. neuronal 5 16 

411890 H92738 Hs.75811 N-acylsphlngosine amidotiydrolase (add c 16 

432728 NIA.006979 Hs.27e721 Hl^dassil region expressed gene KE4 16 

431724 AA514535 Hs.283704 ESTs 3.6 

435703 AW630133 Hs.83313 610)03 protein 16 

30 445674 BE410347 Hs.13063 tFansotption factor CA150 3.6 

414178 AW957372 H5.46791 ESTs, Weakly similar to 138022 hypotheU 16 

416111 AA033813 Hs.79018 ctiromatin assembly (actor l.subunit A ( 16 

432840 AK001403 Hs.279521 hypothetical protein FU20530 16 

439726 AW449893 Hs.293707 ESTs. Wb^ similar (oP8598 zinc fing 16 

35 445704 AI493742 Hs.167700 ESTs. Moderately similar to 138022 hypot 16 

449115 AW959952 Hs.37526 ESTs, Weakly 8ln«artoAF0909441PRO06 16 

439379 AA835002 Hs.125611 ESTs 16 

414646 AA353776 Hs.901 C04B antigen (B^ membrane protein) 16 

434579 T55956 gb7b35f05.r1 Stratagene fetal spleen (9 16 

40 451367 AA923729 Hs.26322 cell cyda related kinase 16 

420285 AA258124 Hs.293878 ESTs. Moderately siiifilar to ZN91JHUMANZ 16 

431263 AW129203 Hs.322915 ESTs 16 

431952 Z70695 Hs.272240 Homo sapiens cDNAFUl 1086 fis. done PL 16 

439092 AA830149 gb:oc44fOB.s1 NCLCGAP.GCBI Homo sapiens 16 

45 411562 AL050201 Hs.70769 hypothetical protein DKFZp586E1923 15 

417771 AAB04698 Hs.62547 retinotoacM receptor responderflazaTO 15 

419440 AB020689 Hs.90419 KiAA0882 protein 15 

417333 AL157545 Hs.173179 bromodomain and PHD linger containing, 3 15 

419241 AA523939 Hs.165258 ESTs 15 

50 410762 AF226053 Hs.66170 HSKM-B protein 3.5 

413384 NM_000401 Hs.75334 exostoses (mulQple) 2 ' 15 

424081 NM.006413 Hs.139120 ribonudease P (30kD) 15 

, 426004 AA449563 Hs.151393 glutamat&cystelnelgase. catalytic 8ub 15 

420390 AA330047 Hs.191187 ESTs 3.5 

55 416662 T25853 Hs7538 ESTs 15 

433280 AA581404 Hs.269037 Homo sapiens cDNAFUl41358s« done MA 15 

433285 AW975944 Hs.237398 ESTs 15 

421991 NM_014918 Hs.1 10488 KIAA0990 protean 15 

413950 AA249096 Hs.32793 ESTs 15 

60 431318 AA502700 Hs.293147 ESTs. Moderately similar to A48010X4n 15 

400289 X07820 Hs.2258 matrix melalk)protelnase 10 (stromelysin 3.5 

411149 N68715 H3.269126 ESTs 3.5 

417601 NM.014735 Hs.82292 KIAA0215 gene product 3.5 

^ _ 418334 AA319233 Hs.5S21 ESTs 15 

65 422583 AA410506 Hs.27973 K1AA0874 protein 15 

425717 X07282 Hs.171495 reSnote acid receptor, beta 3.5 

425856 AA384906 Hs.98927 hypothetical protein FU13993 15 

434569 A1311295 Hs.344478 K1AA0196 gene product 15 

419436 AA991639 Hs.242413 hypothetical protein DKFZp434K1 421 3.5 

70 437296 AA350994 H&20281 KIAA1700 15 

432697 AW975050 H3.293892 ESTs. Weakly simDar to ALU4^HUMAN ALUS 15 

439605 AF086431 Hs.22380 ESTs 15 

442145 AI022650 Hs.8117 eibb2-lnterac(ing protein ERBIN 15 

432589 AL135725 Hs.131708 ESTs 15 

75 425910 AA8307g7 Hs.184760 CCAAT-box-binding transcription factor 15 

412095 A1624707 Hs.5921 Homo sapiens cDNA:FU21 592 lis, done C 15 

421129 BE439899 Hs.89271 ESTs 15 

429433 AA452899 Hs.213588 ESTs. WeaUysbnilar Id KIAA1353 protein 15 

434839 AI743069 Hs.134736 ESTs 15 

SO 435166 AI391470 Ks.l586t8 ESTs 15 

448986 AW372914 Hs.86149 phosphoinosltol 3i)hosphate4)indIng piot 15 

449318 AW236021 Hs.78531 Homo sapiens, Sindv to RlKENcONA 5730 15 

453311 AW104911 Hs.126707 tvpothe&al protein RJ1 1457 15 
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456497 


AW9679S6 


Hs.123648 




413786 


AW613780 


H8.13S00 




430935 


AW072916 




5 


439904 


AW892676 




449909 


AA004681 


Hs.59432 




445817 


NM_003842 


Hs.13340 




419594 


AA013051 


Hs.91417 




417251 


AW015242 


Hs.99488 


10 


423545 


AP000692 


Hs.129761 


438527 


AI969251 


Hs.1 15325 




453916 


AW974874 


Hs.75212 




430200 


BE613337 


HsJ234696 




4371B7 


AL080208 


Hs.306325 


15 


423645 


AI215632 


Hs.1474e7 


432370 
434966 
404571 


AA308334 
AA657494 


Hs.274424 




418727 


AA227609 


Hs.94834 


20 


436374 


AA400709 


Hs.96716 


452721 


AJ269529 


Hs.301871 




427472 


AA522539 


Hs.131250 




418245 


AA088767 


Hs.B3883 




402031 






25 


407137 


T97307 




449603 


A1855662 


Hs.197698 




425692 


D90041 


Hs.155956 




436B05 


AA731533 


Hs.270751 




437387 


AA749316 


Hs.271879 


30 


407908 


BE37975a 


Hs. 110853 


430144 


At732722 


Hs.187694 




416292 


AA179233 


Hs.42390 




423800 


AA331156 






439375 


AA689526 


Hs^249 


35 


429084 


AJ001443 


HS.19S614 


432886 


BE159028 


Hs.279704 




408267 


AW380525 


Hs.343564 




408832 


AW085690 


Hs.63428 




422564 


Ai14e006 


Hs,222120 


40 


424842 


AA034127 


Hs.153487 


429984 


AL050102 


Hs^7209 




435256 


AF193765 


Hs.13872 




435137 


A1056769 


Hs.133512 




440348 


AW01S802 


Hs.47023 


45 


442910 


A)365130 


Hs.11307 


413242 


6E243910 


Hs.9082 




445469 


AW298370 


Hs.153714 




449517 


AW50O106 


Hs.23643 




425354 


U62027 


Hs.155935 


50 


433037 


Nl\/L014158 


Hs.279938 


423044 


AA320829 


Hs.97266 




434128 


W93170 


Hs.284164 




449845 
423590 


AW971183 
AW952412 


Hs.9663 
Hs.65674 


55 


440370 


AA884000 


Hs.8173 


438825 


BE327427 


Hs.79953 




446874 


AW968304 


Hs.56156 




408101 


AW968504 


Hs.1 23073 




419131 


AA406293 


Hs.41167 


60 


423176 


A1033140 


Hs.124983 


433764 


AW753876 


Hs.39982 




440658 


H29142 


Hs. 143032 




411558 


AA102670 


Hs.70725 




452815 


AA418841 




65 


421234 


AA907153 


Hs.israso 


424075 
412045 


A1807320 
AA099802 


Hs.227630 
H$.83883 




420041 


AB005142 


H3.94592 




426174 


AA547959 


H5.1 15638 


70 


427083 


NM_006363 


Hs.1 73497 


441021 


AW578716 


Hs.7644 




436664 


AW197887 


Hs!2S3353 




441124 


T97717 


Hs.t19563 




444169 


AV648170 


Hs.58756 


75 


421476 


AW953805 


Hs.21887 


422165 
455100 


AU)41199 
BE160198 


Hs.1481 




450747 


AI0&4821 


Hs.318535 




401197 






80 


433404 


T32982 


Hs.102720 


422546 


AB007969 


H8J0147B 




443884 


N20617 


Hs.194397 




414341 


D80004 


H8.75909 




425292 


NiyL005e24 


HS.155S45 



ESTs, W^aUy simDar to AF108460 1 ubinu 


3^ 


EST8 


3J 


zinc finger protdn 131 (done pHZ-lp) 


3.5 


gb:CM3-(M)004-260300>131-c12 NN0004 Homo 


3JS 


ESTs 


3^ 


histono scctyilrBiisfBfsSB 1 


15 


topofsomerase (DMA) 1) Unding protein 


3.5 


ESTs. Weakly similar to YK54_YEAST HYPOT 


15 


chrcxnosotne 21 open reading frame 5 


15 


RAB7, member RAS oncogene femily^ike 1 


15 


Ofnllhtne rfBCBibpjtytesB 1 


3^ 


gembiln 


3.4 


Homo sapiens mRNA; cONA DKFZp585C1523 (f 


3.4 


ESTs 


14 


N'Sce^neunsnlnlc edd phosphate synthe 


14 


G^uil66iD4.s1 NCLCGAPJ'rS Homo sapiens 


3.4 


NM.01S902*:Honv> sapiens progestin induce 


3.4 


ESTs 


3.4 


ESTs, Weakly similar to T17210 hypofheii 


3.4 


solute carrier family 37 (glycerol-3-pho 


14 


transposon^lerived BusterS trsnsposiase^ 


3.4 


transmemhrsnB Dmslata andmoen inHirrpH 




ENSP00000251056*:Rasina membrane csddum 


3.4 


gb:ye53h05.s1 Scares fetal Hver spleen 


14 


ESTs 


14 


N^ce^ttransfeiBse 1 (ar^amb^ N*ace^ 


3.4 


ESTs 


14 


ESTs. Moderatdy similar to ALU1_HUI\AAN A 


14 


uncharacterized l)ematopoietic stem/|)ioge 


14 


ERGL protein; ERQO-53-fike prntetn 


14 


nasopharyngeal carcinoma susceptibility 


3.4 


gb:EST35034 Embryo, 6 week, subtracted ( 


14 


stenid dehydrogenase honnk^ 


3.4 


spncbig factor 3b, subunit 3, 13(^D 


14 


chromatin ecoessibility comi^ex 1 


14 


tubuTtn-spedfic chaperone e 


3.4 


ESTs, Weakly simDar to Z195 HUMAN ZINC 


3.4 


ESTs 


3*4 


signal bansducing adaptor mdecule (SH3 


3]4 


hypothetical protein FU21617 


14 


cytokine-fike protein C17 


14 


ESTs 


3i4 


ESTs 


14 


ESTs, Weakly simSar to T19326 hypotheQ 




nudeoporln p54 




complement^lQ tumor necrosb factor-rel 




serineAhreor^ne protein kinase MASK 


3.4 


cwJiptoi luinl component 3a roceptw 1 




HSPC087 prol^ 


34 


protocadherin 18 


14 


protein x 0004 


3.4 


Dnal f Hsd40) homoioa suhfamSv nwunhA 


AH 


ESTs, Weakly similar to A40348 Elav/SeX' 


3.4 


hypothefical protein RJ 10803 


3.4 


ESTs 


34 


ESTs 


3.4 


C^C2-re!3ted protein kinase 7 


14 


ESTs 


3.4 


Homo sapiens mRNA; cONA DKFZp564C142 (fr 


14 


ESTs 


14 


ESTs, Weakly similar to neuronal thread 


14 


aamma>amlnol)utvjic fGABA) A mmitn 


13 


gb2w0l6l1.s1 Soares NhHMPu SI Homo sap! 


13 


ESTs 


13 


RE1*s{l6ncing bsnscilpQon factor 


13 


trans memhrann nm&lsito nnrfmnon hutttrcA 




Motho 




ESTs 


3.3 


Sec23 fS cerevlsiae) hnimlna B 


AO 


HI hlstonfifaniHv fnAmher? 


AO 


ESTs 


13 


ESTs 


13 


ESTs 


13 


ESTs 


13 


MsUdine decflifroxylasft 


13 


^.QV1-HT041^O1020(M)S9^03 HT0413 Homo 


13 


ESTs, Kghly similar to 1818357A EWS gen 


13 


ENSP0000Q229263':HSPC213. 


13 


ESTs 


3.3 


KIAAOSOOprotoln 


3.3 


leptbi ioceplor 


13 


KIAA01B2 protein 


13 


37 kDateucin»4toh repeal (U^ prol^ 


13 
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414407 AA147026 Hs76704 ESTs a3 

420324 AF163474 Hs.96744 prostata androgen-reoulated transcript 1 3.3 

423453 AW450737 Hs.12B791 CGt-OS protein 13 

444170 AW513879 Hs.10240B ESTs 3.3 

445474 AI240014 Hs^9558 ESTs 13 

450562 AI339732 Gflch RNA sequence bimSttg factor 1 13 

451246 AW1B9232 Hs.39140 cutaneous T-ce!l lymphoma tURioranfigen 13 

434194 AF119847 Ks.283940 Homo sapiens PRO1550mRNA.partidod8 13 

444324 A1301330 H5.143838 ESTs 13 

427581 NM_0147B8 Hs.179703 KIAA0129 gene product 13 

429258 AA448765 obw10B09,rt Soare5.lotaUalus.Nb2HF8, 3.3 

400B80 NM.000611*:Homo sapiens CD59 antigen p18 13 

445525 BE1498G6 Hs.14831 Homo sapiens. Similar to zinc linger pro 13 

451968 H56195 Hs.26409 Homo sapiens mRNA; cOIW DKFZp547K204 ffr 13 

411190 AA306342 Hs.69171 protein Idhase C4I)ce 2 13 

410660 AI06111B Hs.65328 Fancon) anenia. complementailon group F 13 

420493 AI635n3 Hs^70366 ESTs. WealdyslmSar to 178885 serineAh 3.3 

423740 Y07701 Hs.293007 amlnopepHdase puromycin sensitive 13 

428523 AW974540 Hs.98626 ESTs 13 

428743 AlMJOeO Hs.301549 Homo sapiens mRNA;cDNAOKFZp564H1 72 (fr 3.3 

432363 AA534489 gb:nf76glU1 NQ.CGAP.CoS Homo sapiens 13 

435102 AW899053 HS.76917 Ftocniy protein 8 13 

438147 AW250553 H.2K binding factor-2 13 

445808 AV655234 ESTs, Moderately similar lo PC4259 feni 13 

453058 AW612293 H5.288684 Homo sapiens cONAFU11750 lis, clone HE 3.3 

453286 AA034319 Hs^l Homo sapiens cOI^ FU 141 77 fis, done MT 13 

453802 AL134757 gb:OKFZp547G1090_r1 547 (synonym: hfbri) 3.3 

422017 NW_003877 Hs.1 10776 STAT Induced STAT inliibitor-2 13 

411373 BE326276 Hs.8861 ESTs 13 

435114 AA775483 Hs.288936 mitochondrial ribosomal protein L9 3.3 

443674 A1081330 Hs,145008 ESTs 13 

427528 AU077143 Hs.179565 mlnlchromosome maintenance deficient (S. 13 

433800 Ai034361 Hs.135150 lung type^ cell membrane-associated gly 13 

420969 AI636310 Hs.28310 ESTs 33 

421654 AW163267 Hs.106459 suppressor of varl (Sxerevlsiae)34ke 13 

409542 AA503020 Hs.36563 hypothetical protein FU22418 13 

410119 F07841 Hs.13926 ESTs 13 

417379 AA196390 db:zp99b1()ls1 Slratagenemuscle 937209 H 3.3 

423201 NM_000163 Hs.1251B0 giDWth homtone feceptor 13 

442787 W93048 Hs.250723 hypolhefical protein M6C2747 13 

407930 AA045847 Hs.188361 Homo sapiens cONAFU12807 lis, done Ml 13 

423427 AL137612 H5.28S848 KIAA1454 protein 13 

424903 T26477 Hs.22883 ESTs, W^stmPar to 138022 ItypotheQ 13 

426775 AA384564 ESTs 

432378 AM93046 Hs.146133 ESTs 33 

438875 AA827640 Hs.189059 ESTs 13 

452959 AI933416 Hs.189674 ESTs 13 

453124 AI139058 85.125790 teudno^ch repeat-containing 2 3.3 

417560 U73338 Hs.82283 &inethyltetr8hydrofQlale.homocy8teineme 12 

433002 AF048730 Hs.279906 cycIinTI 32 

430929 AA489166 Hs.155933 ESTs 12 

437444 H46008 Hs.31518 ESTs 12 

446297 AI346330 Hs.149728 ESTs 32 

407870 AB032990 Hs.40719 hypothetical protein MAA1 164 12 

429377 AA813192 Hs.200596 KIAA0547 gene product 12 

426030 BE243933 Hs.108642 zinc finger protein 22 {KOX 15) 12 

420583 H77659 Hs.65450 re6cubn4 12 

425264 AA3S3953 Hs.20389 ESTs, Weakly slfrtiar to gonadotropin Ind 12 

442562 BE379584 dolichyWlphosphoollgosaccharidfriJrotal 12 

418871 NM.001608 H5.1209 acyU}oenzyme A dehydrogenase, long chai 3.2 

421029 AW057782 H5.293053 ESTs • 

452997 N64777 Hs.44B56 ESTs ' 12 

435035 BE568487 Hs.47668 x008protdn 12 

408412 AW193033 .'Hs.124436 ESTs 12 

421684 BE281591 Hs.106768 hypothetical protein FU10511 12 

421887 AW161450 Hs.1 09201 CGI^ protein 12 

432409 AA806538 Hs.1 30732 KIAA1 575 protein 3.2 

434378 AA631739 Hs.335440 EST 12 

409401 A1201895 Hs.181309 proteasome (prosome. macropain) subunll. 3.2 

420195 N44348 Hs.26243 HomosaplenscDNAFU11177fis,donePL 12 

410264 AKO01853 Hs.61508 Homo sapiens cDNA FU10991 Os, done PL 12 

410296 AW630675 Hs.271946 ESTs 12 

413838 AV661185 H3.75574 mitochondrial ribosomal protein L19 12 

414342 AA742181 Hs.75912 KIAA0257 protein 12 

414709 AA704703 Hs.77031 Sp2transa^ factor 12 

425475 W56339 Hs.107057 ESTs 12 

437814 AI088192 Hs.135474 ESTs. Wealdy similar to 0DX9_HUiyWIATP-0 12 

449897 AW819642 Hs.24135 transmembrane protein vozatm; hypotheB 12 

452295 BE379936 Hs.28866 programmed cOD death 10 12 

437517 A19Z7675 H5.244482 Homo sapiens, done IMAGE:361 1719, mRNA, 12 

432960 AW150945 Hs.1447SB ESTs 12 

452498 AK000101 H5.29700 hypothetical protdnFU20094 12 

427299 AA830210 H5.214263 ESTs, Moderately sindar to AUJ1.HUIMAN A 12 
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421645 


AA974127 


Hs.129777 


ESTs 


3.2 


457489 


A!693815 


H3.127179 




3.2 


415691 


AW963979 


Hs^4n3 


ESTs 


12 


433852 


AI378329 


Hs.126629 


ESTs 


3.2 


439735 


AI835386 


HS.142B48 


t^fpotheOcal protein 


3.2 


428279 


M425310 


Hs.1 55766 


ESTs. WsaUy a'n^lar to A47582 BhxO gr 


3.2 


427715 


BE245274 


HS.1B0428 


KIAA1 181 protein 


3.2 


408899 


AA056614 


Hs.106200 


ESTs. Moderatety slmiiar to ZN91.KUMAN Z 


3.2 


415715 


F30364 


Hs.302204 


ESTs 


3.2 


426172 


AA371307 


H3.125056 


ESTs 


3.2 


434825 


N87549 


Hs.125287 


zinc linger protein ZNF14(Mike protein 


3.2 


436246 


AW450963 


Hs.119991 


ESTs 


3.2 


436401 


AI087958 


Hs.29088 


ESTs 


3.2 


408784 


AW971350 


Hs.63386 


ESTs 


3.2 


432125 


AW972667 




Homo sapiens cONA aJ123ro Us. done MA 


3.2 


426716 


NWL006379 


Hs.171921 


sema domsin, innnunoQloiniiin domdn (lg)t 


3.2 


405558 






Target Exon 


12 


434747 


AA837085 


Hs^85 


ESTs 


3.2 


414591 


Ai888490 


H5.55902 


ESTs. Weakly slmflar to ALU8 JIUMAN ALU S 


3.2 


428991 


AK001536 




Homo sapiens cONA FU10874 Tis. done NT 


3.2 


426171 


At128606 


Hs^557 


zinc finger protein 161 


12 


402802 






NM.001397:Homo sapiens endothelin conver 


12 


412530 


AA766266 


Hs.266273 


hypotheOrai protein fU13346 


12 


433050 


A1093930 


Hs.163440 


HomosaptenscDNA: FU21000 lis. done C 


12 


439221 


AA737106 


Hs.32250 


ESTs. Moderately similar to 178885 serin 


12 


440351 


AF030933 


Hs.7179 


RA01 (S. pomlM) homolog 


12 


445218 


T79971 


Hs.12432 


Homo sapiens done 24407 mRNA sequen(» 


12 


448568 


AA149121 


Hs.71947 


ESTs 


12 


420131 


F08286 


Hs.95262 


nudear factor related to kappa B bindtn 


11 


429299 


AI620463 


Hs.347408 


liypothe&al protein M6C13102 


11 


407913 


BE393767 


Hs.41569 


phosphaSdic add phosphatase type 2A 


11 


415009 


C75253 


Hs.220950 


ESTs 


11 


406627 


764904 


Ks.163780 


ESTs 


11 


438666 


AW014493 


Hs.126727 


ESTs 


11 


419875 


AA853410 


Hs.93557 


proenkephsfin 


11 


456660 


AA90924g 


Hs.112282 


sdute carrier family 30 (zinc transport 


11 


416701 


AA814948 


Hs.g6343 


ESTs. Weakly sinular to AlUC.HUMAN !!ll 


11 


425071 


KM_013989 


Hs.154424 


delodinase, iodothyronlne. type II 


11 


440300 


N39760 


Hs.8859 


Homo sapiens, Similar to f^KEN cONA 5830 


11 


444100 


AA383343 


Hs,22116 


C0C14 (cell division cycle 14, S. cerevi 


11 


431416 


AA532718 


Hs.178604 


ESTs 


11 


422662 


BE274778 


Hs.119007 


RAB4. niemt)er RAS oncogene famSy 


11 


449543 


AF070632 


Hs.23729 


Homo sapiens done 24405 mRMA sequence 


11 


437083 


AW082597 


Hs.244862 


ESTs 


11 


430294 


AI538226 


Hs.32976 


guanine nudeotkJe i)inding protein 4 


11 


417380 


T06809 


Hs^2086 


ESTs 


11 


419965 


H16382 


Hs.70258 


ESTs 


11 


430039 


BE253012 


H5.153400 


ESTs. Weakly similar to ALU1_HUMAN ALU S 


11 


430188 


AL049242 


Hs^34794 


Homo sapiens mRNA; d)NA DKFZpS64B083 {fr 


11 


446146 


A1287539 


Hs.148078 


ESTs 


11 


426647 


AA243464 


Hs^94101 


pre-B-cell leukemia transcdpQon fiactor 


11 


418555 


A(417215 


Hs.87159 


hypothetical protein FU12577 


11 


453496 


AA442103 


Hs^084 


sdute canter family 2 ffetiStsted glu 


11 


423947 


AW451954 


HS.13S941 


K1AA1048 protein 


11 


451693 


AW192083 


Hs.290855 


ESTs 


11 


447726 


AL137638 


Hs.19368 


matrilln 2 


11 


415068 


Z19448 


Hs.131887 


ESTs. Weakly dmilar to T24396 hypotheli 


11 


453439 


At672438 


Hs.32g76 


guaitfne nudeoOde binding protdn 4 


11 


443331 


A1052026 


Hs.14g995 


ESTs 


11 


410340 


AW182833 


Hs.112188 


hypotheacal protein FU13149 


11 


414372 


AA143654 




gbzo65a02.r1 Stralagene pancreas (93720 


11 


427421 


AA402414 


Hs.3059 


coatomer protein complex, sutxjnlt tieta 


11 


429399 


AM52244 


Hs.16727 


ESTs 


11 


430092 


AI821399 


Hs.16514 


ESTs 


11 


433577 


AW007080 


Ks.284192 


ESTs 


11 


433730 


AK002135 


Hs.3542 


hypothettoal protein HJ11273 


11 


436894 


K80696 


Hs.233313 


ESTs 


11 


4^56 


AA767537 


Hs.197096 


ESTs 


11 


438979 


AW976218 


Hs.32565 


ESTs 


11 


441791 


AW372449 


Hs.61271 


hypothelxal protein FU21 159 


11 


457718 


F18572 


HS.2297B 


ESTs, Weakfy slmDar to ALU4 JtUMAN ALU S 


11 


452017 


AF109302 


Hs.27495 


prostate cancer assodaied protein 7 


11 


419030 


T79957 


H5.188466 


ESTs 


11 


438308 


AI343469 


H3.127685 


K1AA1627 protein 


11 


433571 


AA765256 


Hs,135191 


ESTs, Weakly simBar to unnamed protein 


. 11 


421685 


AF189723 


Hs.106778 


ATPase, Ca transporfing, type 2C, mOTiiier 


11 


428678 


AA436884 


Hs.48926 


ESTs 


11 


432205 


AI806563 


H5.125291 


ESTs 


11 


440099 


AL080058 


Hs.6909 


DKFZP564G202 protein 


11 


428259 


AA424793 


H3^4144 


ESTs 


11 


434614 


AI249502 


Hs^669 


ESTs 


11 


420380 


AA640891 


Hs.102406 


ESTs 


11 


433323 


AAB05132 


H8.159142 


ESTs 


11 
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43^4 


W26354 


Hs.28891 




412088 


AI689496 


Hs.108932 




434361 


AF129755 


Hs.88474 


5 


400664 






438354 


AI879252 


Hs^15l 




449474 


AA019344 


Hs^55 




410595 


AW529223 


Hs.54794 




41S245 


N59650 


Hs^7252 


10 


418647 


AA226198 




434808 


AF155108 


Hs^561S0 




453716 


AA037675 


Hs.152675 




432952 


AA813887 


Hs.188173 




408705 


AA31213S 


Hs.46g67 


15 


430280 


AA361^ 


Hs^37868 


445919 


T53519 


Hs^92 




441790 


AW294909 


Hs.132208 




414085 


Ml 14016 


Hs7S748 




452619 


AW298597 


Hs.61884 


20 


422788 


AL117352 


Hs.77196 


439703 


AF086538 


Hs.196245 




444489 


A1151010 


Hs.157774 




453878 


AW964440 


Hs.19025 




440193 


AW902312 


Hs.7037 


25 


433880 


AI805366 


Hs.199945 


426383 


M58524 


Hs^025 




447595 


AW379130 


Hs.16953 




426044 


AA502490 


Hs.170290 




431962 


AL049385 


Hs^72251 


30 


445084 


H38914 


Hs^50848 


452737 
458229 


AK001680 
AI929602 


Hs.30488 
Hs.177 




435712 


AAe94607 


Hs.178956 




407378 


AA299264 


Hs^7776 


35 


414993 


AW819403 


Hs.77724 


419459 


AW291128 


H5^78422 




421988 


AW450481 


Hs.161333 




432833 


NS1075 


Hs.110028 




445210 


H09323 


Hs.27133 


40 


447620 


AW290951 


Hs.224965 


449375 


R07114 


Hs.271224 




426695 


Af355647 


HS.1899S9 




443967 


AW294013 


Hs^00942 




441224 


AU076964 


Hs.7753 


45 


448822 


88149845 


Hs.289038 


435688 


H72286 


H3.128387 




441889 


AI090455 


Hs,268371 




411704 


At49g220 


Hs.71573 




415954 


AA171850 


HS.422S1 


50 


419544 


A1909154 




420077 


AW512260 


Hs.87767 




443475 


A!06647a 


Hs.132809 




447231 


AK001293 


K8.330208 




438436 


AA807168 


Ks2715S2 


55 


427871 


AW9924C5 


Hs^9822 


443280 


AA299688 


Hs^4183 




448264 


AI478933 


Hs.188260 




428873 


AW601325 


Hs.337757 




453843 


025215 


Hs^5804 


60 


400303 


AA242756 


Hs.79136 


445943 


AW898533 


Hs.181574 




432286 


AW327432 


H3.255843 




431707 


R21326 


Hs.267905 




432675 


AI79165S 


Hs.105884 




443162 


T49951 


Hs.9029 


448073 


W19789 


Hs.336635 




419713 


AWg68058 


Hs.92381 




447966 


AA340605 


Hs.105887 




423583 


AL1220S5 


Hs.129836 


70 


418304 


AA215702 




417206 


AA291183 


Hs.81648 




410011 


A6020641 


Hs.57856 




411850 


AK002033 


Hs.72782 




438986 


AF085888 


Hs.269307 


75 


445921 


AW015211 


H8.146181 


447124 


AW976438 


Hs.17426 




452953 


At932884 


H8.271741 



hypoUetical protein FU1136(k8rteniisp 3.1 

ESTs 3.1 

ESTs 3.1 

iai.002425:Homo sapiens matrix melalopro 3.1 

RAN binding protein 7 3.1 

ubiquitln-actlvallnQ enzyme El (A1S9T an ai 

zinc finger protein 183 (RING tbtger, C3 ai 

ESTs 3,1 

gb:nc26a07.s1 Na.CGAP.Pr1 Homo sapiens 3.1 

Homo sapiens. Similar to RIKEN cONA 2810 ai 

ESTs 3.1 

Homo 8^nscDNAaj12187 lis, done MA 3.0 

HSPC034 protein 3.0 

tntarieuldn 7 receptor 3.0 

hypothetic^ protein MGC14141 aO 

ESTs 3.0 

aldehyde dehydrogenase 1 family, member ao 

Homo sapiens, done 1MAGE:4298026. mRNA, ao 

Human DNAsequencelromdoneRPS^BIO aO 

ESTs ao 

ESTs ao 

DC32 ao 

Homo sapiens done 24923 mRNA sequence aO 
ESTs . ao 

transforming gronrth factor, beta 3 aO 

phosphodiesterase 9A ao 

ESTs ao 

Homo sapiens mRNA; cDNA DKFZp5861M1418 (f 3.0 

hypothetical protein FU14761 3.0 

DKFZP434F091 protein 3.0 

phosphafidyBnositol glycan, dass H 3.0 

ESTs 3.0 

ESTs. Moderately similar to 138022 hypol 3.0 

KIAA0586 gene product 3.0 

DKRP586G1 122 protein 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs ao 

ESTs ao 

ESTs ao 

purineigtc receptor (family A group 5) ao 

ESTs ao 

caiumenfaY aO 

hypothetical protein MGC4126 aO 

ESTs ao 

hypothetical protein FU20274 aO 

hypothetical protein FU10074 aO 

ESTs ao 

gb:QV-9T20041049»007BT200Homo8^ ao 

ESTs ao 

ESTs ao 

crystalDn, zeta (quinone redudaseHk aO 

ESTs ao 

Homo sapiens, done IMAGE:3507281. mRNA. 3.0 

ESTs ao 

ESTs ao 

Homo sapiens mRNA; cDNA OKFZp566M063 (fr 3.0 

hect domain and RLD 3 3.0 

UV-I protein, estrogen regulated aO 

ESTs ao 

ESTs 3.0 

hypothetical protein FU10422 3.0 

ESTs ao 

DKFZP434G032 protein aO 

Homo sapiens, done (MAGE:4179482, mRNA 3.0 

nudix (nucleoside diphosphate linlced mol aO 

ESTs, Weakly similar to Homdog of rat Z 3.0 

KIAA1028 protein aO 

Qb2i97g10.r1 Na.CGAP.GCBI Homo sapiens aO 

hypothetical protein FU11021 similarto aO 

PFTAIRE protein kinase 1 ao 

hypofheticat protein FU11171 aO 

ESTs ao 

ESTs ao 

RBPI-tite protein aO 

EST8.Wealdy8lmilartoA46010X^nked aO 
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TABLE 609 



Pkey: Unique Eos probesetldentliiermimber 
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CAT number Gene duster number 
Accession: Genbankaocesskm numbers 



Ptey" 



CAT Number Accessions 



1253334.1 
143909.1 
147525.1 

156454J 



411479 1247077J AWB48047 AW848202AW848631 AW848142 AW846702AW848121 AW848632 AW848140 AWM8571 AWB48009 AW8480$7 AW84e069 
AW848905AW848214 

BE160198AW935898T11520AW935930AW856073AWS61034 
AAt43654 AW753140 AA213770 AW970865 AA569075 AM921 32 

AA743331 AA837388 AW664540AA775711 AA150965BE46S475 AA988994 AASBSgOS Ai141054 AW194991 AA252147AA31 9500 Ai 184288 
AA708749 AA644620AA652769AA242975AA1 51074 T19890 

AI267700 AI720344 AA191424 A!Q23543 A1469633 AA172056 AW%B465 AA172236 AW953397 AA3S5086 
H51299 H44619 H46391 R86024 H51892 T72744 

Ai633044 AW016212 AW241 143 AA76905B R43272 AW068958 AA21D918 AA293774 A174881 5 At763294 Ai3331 14 A)277384 AI088297 AI468477 
A1824624 AW189606 AI631751 Z40749 At984673 AI671316 AA189024 AW235412 BE178426 R24677 R40635 H05100 R40597 
AVV934714BE161007BE182500AW7499Q2AVV749864BE162498BE161005AA19044SAVV513465BE16100^ 
AA196390AA507837AA19646B 
AA215702 AA368006 AA215703 BE0665S5 BE006878 
AA226198 AA226513 AA3B3773 
T657&4 AA229857 AA229658 
AA603305 AA244095 AA244183 
A)909154 AAS26337 AA244193 AI909153 
AF109300 AI299378 AI202654 
AA331 156 AA331 157 AA3311 55 
AA769045 AA3725gO AW963633 
AA377823 AW954494 AI022686 
AA380153 AA380233 AW983S29 

AA3e4564 AW966475 H02121 N41297 D53213 AA886888 A1922414AW044240AWig6808AH]76736AA59g294 A]954433AW117617AI640323 
H96134 

AK001536AA191092AW510354At5M2S6AI.353968AA134266 

AI739166 AA426249 A)19S636 AW505198 AW977291 AA824583 AA883419 AA724079 AI01S524 Ai377728 AW293682 AI928140 AA73143B 
A)092404AI085630AA731340 
AA884766 AW974271 AASS2975 AA447312 
AW207206 AW341473 AA448195 AI951341 
AA448765 C04967 C03045 AA658293 

AW072916Ai184913AA489195AW466994AVMG9044N59350Ai819642AI280239AI220572AA789302Ai4^ 
AW972830 AA527647 AA489B20 AA570362 

ARi72813 AF1192g7 AA362885 AF059524 NM.006054 AA157365 AW163623 AA056148 AA227062 AA4ia057 AA227076 AA0787S3 AA233594 
D58629 AA232373 AA233577 T35956 BE618035 AA354497 AA359082 T32010 AA134519 BE299901 BE2680g6 BE396826 AA324268 AL120308 
AA187561 AA311680 
H28383 AW972670 H28359 AA525B08 
AW972667AA526539A1057032AW167842 
AA527941 AI810608Ai620190AA535266 
AAS34222AA632632 T81234 
AA534489 AW970240 AW970323 

AI821085 AW973464 AAS54802 AI621831 AA657438 AA640756 AA650339 

AA863400 AI991439 AW01G017 AW014704 At367512 H17550 AA744752 R461B7 AW471324 Ai126670 AA626033 At276287 Ai094253 Ai286003 
AI147163 AI911443 AW512612 AAS72102 AA999975 AI664428 At335035 063102 At524234 Ai539156 AA565542 
T55958T57205AF14734B 
AA657494A1582663A1581639 
AI692552AI393343A1800510AI377711 F24263AA661B76 

AA1567ei AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE16404Q BE164012 BE163972 BE163974 BE163992 
AAB37481 AW46B444 BE185091 AW468002 AA687333AA811830AA581806Ai866686AI572124AA043777AAD40926D20160A1536733 
AA812489AW874142AI47188 
AW2S0553 L07876 Z36843 R3Q893 A1190097 AW965317 
AA830149 AWg78407 M85983 AWS0d637 
W76326AF086341 W72300 
AWB92676 AA853877 D44747 

AW629666 AW959831 AW20S739 BE620243 AA412367 AW30002S AW051920 AI288591 AW2361 14 A1302852 AI038548 AA534496 AI797207 
AA921877 

AA973905 Ai2998e8 AA917019 K63235 T90771 

BE379584 R34211 BES44768AW973709Ai653056A1853173A]266043Ai658750 H74180AI492830AI376090AI4721M 
AI0d2443 AW021 142 AI167921 AI348677 AI276577 AW130886 AA761517 AI696203 AA1 1553S At264790 R34328 D59939 AW205074 AA554902 
062102 A10007 
AI038316AI344631AI261653 
AV655234 AW986332 AA340239 
T85314 AI360684 T85528 T91254 
AI475858AW969013 

NM_014253 AF100772 BEDB8769 AL022718 BE161779 AWB63569 BE161 640 AL039060 BE168542 AW296554 AA323193 AA235370 AW779760 
N48674 Ai375997 R45432 D59344 A1203107 F07491 R35360 R25094 Ai913631 A1498402 T61362 AI016320 N45526 T61415 AA331486 
450580 83929.1 N40087H12925AA460779AA096372 6E160847 AWB16890AW816893AW816577AW816891 AW816882AW81 6868 AW816889AWB1 6940 
AW816892 AWB16941 AW816578 AA029183 T83320 H86850 AA01029S 6E160823 Z25353 AA15a863 AW887764 AW023806 AW022095 
AA164518 AA730973 W00417 W65303 
83933J AI339732 AA010300AVK1S041 AA768334 N29860AA42S874 AA425118 AA865629 AW936878 
9074J AA453208MO04251U44103AI671547N57463AVm7521A1479232 T82809R67109AVV7^ 

BE326331 AI934576A1922378AW276431 A)718466 N38566 AA904753BE464245AI338752Ai659875AW272336AI423138AI089270A1160904 
AA6643&4BE1 

452S98 92338.2 AI831594AW970867AW027959AI129800 AI927949AI650270A]825105 AW514661 AI708393AL138076BE180510AI926721 Ai399955AA749139 
AI862160 AW874011 AI242763AA262795AA039864 H73499 AI033249BE245651 A1816834 N25206AA82B301 AI084565Ai302816 AA026g05 
AA77255 

93255.1 AA418841 AI452657 AI768876AAD28973 BE179873C00215 AA418930 
981589.1 AL134757AW079131 

1253334.1 BE160198 AW9358g8 T11520 AW935930 AW856073 AW861034 

706 



411667 
414372 
414680 

415969 
41G28B 
416882 

416913 
417379 
416304 
416647 
418866 
419538 
419544 
423412 
423800 
426226 
426413 
426503 
426775 

426991 
428342 

429163 
429220 
429258 
430935 
430968 
431429 



432093 
432125 
432189 
432340 
432363 
432600 
432810 

434579 
434966 
435023 
437866 



438147 
439092 
439518 
439904 
440840 

441102 
442562 



443161 
445808 
447082 
448212 
449625 



450582 
452260 



452815 
453802 
455100 



^1 

162718.1 

163001J 
167238.1 

173658.2 
177521,1 
179788.1 
185688.1 
185760.2 
228001 1 
232161 1 
262918J 
266650.1 
268283.1 
271683.1 

27415.1 
290035.2 

300543.1 
301384.1 
301917.1 
325772.1 
326269.1 
33313.1 



341283.1 
341776.1 
342819.1 
345248.1 
345469.1 
350959.1 
354375.1 



11 
396504.1 
388093.1 
44433J 



45074.1 
468554.1 
47334 1 
479942 1 
50357.2 

509604 1 
54500J 



561305.1 

65133.1 

707248.1 

755099.1 

8113.1 



wo 03/042661 PCTAJS02/36810 



457728 393853.1 AW974811 AA651634AAG50072 



TABl£60C 



Pkef. Unique number oorrespcuKSng to an Eos prebeset 

Ref: Sequence source. The 7 digit numbera in Uiis oohimn are Genbanic Identifier (Gi) numbers. Dunham 1. et aL* refers to the pubticalton entitled The ONA sequence of 
human chromosorne 22.* Dunham L et al. Nature (1999) 402:489^95. 
Stran± Imfic^es 01^ strand from which exons were predicted. 

NLoosHion: indicales nucleotide positions of precfided axons. 



Pkey 


Ref 


Strand 


Ntj)06ition 


400664 


8118496 


Plus 


13558-13721.13942.14090.14554-14679 


400695 


7249150 


PhJS 


160456-160567.164757-164873 


400B80 


9931121 


PhB 


29235'29336.36353-36580 


401197 


9719705 


Plus 


176341-176452 


401424 


8176894 


Rus 


24223.24428 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401558 


7139878 


Rus 


103510-104090 


402031 


7656761 


Phis 


33080-33283.33939-34094.36103^7 


402802 


3287156 


Minus 


53242-53432 


402855 


9662953 


Minus 


59763^09 


403046 


3540153 


Minus 


55707-55659.56369^6511 


403047 


3540153 


Minus 


59793-59968 


403790 


8084957 


Muuis 


8782&«7947,89835*90002 


404571 


7249169 


Minus 


112450-112646 


404632 


9796668 


Plus 


45096^5229 


404641 


9796810 


Minus 


32247-32382 


404642 


9796810 


Plus 


102999-103145 


405523 


9454643 


Phis 


114550-114688.117265.117407,119490-119599,123237-123395,131140-131217 


405558 


1621110 


Plus 


4502-4644,5983-6083 



TaUe 61A lists about 440 genes uiwegulaled tn prostate cancer compared Id nonnal prostate and nornial adult tissues. These were selected from 59680 probesets on the 
Affymetrix/Eos Hu03 GeneChtp array such that the ratio of 'average' prostate cancer to 'average' nomia) prostate tissue was greater than or equal to 2.0 and the raOo to nomtai adult 
tissues was greater than or equal to 2.0. The 'average* prostate cancer level was set to the 75^ percentile amongst pn3state cancers^ The 'average' normal proslalB tissue level was 
set to the 95*> percentile amongst prostate Sssues. The 'average' nonnal aduit Gssue level was set to the 8?' percenOa amongst noTHnallgnant fissues. In order to remove gene- 
specific background leveb of non-specific hybridlzatton, the lO*" percentite value amongst the non-malignant tissues was subtracted ftom both the numerator and Ibe denominator 
b^ore the ratio was evaluated 

TABt£ 61A: ABOUT 440 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE AND NORMAL ADULT TISSUES 

Pl^ Unique Eos probesel Identifier number 

ExAocn: Exemplar Accession number. Genbank aooesston number 

UnigenelD: Unigena number 

Unlgene THIecUnlgene gene title 

R1: RaOo of prostate tumor to normal adult body tissue 

R2: Ratio of prostate tumor to nomtai prostate tissue 



Pkey 


ExAocn 


UnigenelD 


Un^ne Title 


R1 


R2 


444151 


AW972917 


Hs.128749 


atphwnelhylacyt-CoAracemasa 


42.2 


AU 


421566 


NM.000399 


Hs.1395 


early growth response 2 (Krox-20 (DroGOp 


3.2 


7BA 


420729 


AW964897 


Hs.290825 


ESTs 


3.7 • 


15.8 


401197 






ENSP00000229263':HSPC211 


ao 


12.6 


450096 


AI682088 


Hs.79375 


holocarboxytase synthetase (btolin-[prop ' 


12.4 


12.4 


449156 


AF103907 


Hs.171353 


prostate cancer antigen 3. non-coding DD 


12.3 


12.3 


443271 


BES68568 


H3.19S704 


ESTs 


11.6 


11.6 


434078 


AW880709 


Ks.283883 


difomosome 8 open reading frame 4 


Z1 


11.2 


431117 


AF003522 


Hs.250500 


dena(Dro3ophil^^1 


9.4 


9.4 


417315 


A1080042 


Hs.180450 


ribosomal protein S24 


13 


9.0 


416162 


N^L004354 


Hs.79069 


cydin G2 


a4 


U 


421913 


Ai934365 


Hs.109439 


osteoglycin (osteoinducth/e factor, mime 


2.2 


84 


434217 


AW014795 


Hs.23349 


ESTs 


8.3 


8.3 


425782 


U66468 


Hs.159525 


cell growth regulatory with EF-hand doma 


8.3 


8.3 


442501 


AA315267 


Ks.23128 


ESTs 


2.0 


&3 


429984 


A1050102 


Hs.227209 


hypothetical protein aJ21617 


ai 


ai 


449048 


245051 


Hs.22920 


slmQar to S68401 (catOe) glucose Indue 


3.1 


8.0 


417407 


AA923278 


Hs.290905 


ESTs, Weakly dmllar to protease [H4»pi 


34.0 


7.9 


420120 


AL049610 


Hs.95243 


transcriptton etongaSon factor A (Sil)- 


7.5 


7.5 


419078 


M93119 


H3.89584 


insufinoma-assodated 1 


7.4 


7.4 


425905 


AB032959 


Hs.318584 


novel C3HC4 type 23nc finger (rfng tinge 


9.4 


7.3 


425016 


BE24S277 


Ks.154196 


E4FtF8nscrip6on (actor 1 


7.2 


7.2 


452221 


C21322 


H5.288057 


hypothetical prot^ FU22242 


7,0 


7.0 


421552 


AF026692 


Hs.105700 


secrtied litzzled-related protcfri 4 


4.0 


6.6 


409799 


D11928 


H5.76B45 


;^vo5phoserine phosphatase^ke 


6.3 


&3 


411373 


BE326276 


HsX861 


ESTs 


12 


&3 


423445 


NM.014324 


Hs.128749 


alph»<nethylacyl-CoA racemase 


6.7 


6i2 


422583 


AA410506 


Hs.27973 


KIAA0874 protein 


2.3 


6.2 


438209 


AL120659 


Hs.6111 


aryMiydrocartxin receptor nuclear IransI 


6.1 


6.1 


437147 


AL049964 


Ks.8358 


hypottietlcsil prot^ FLJ20366 


Z6 


6w0 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



432415 


T16971 


Hs.289014 


ESTs. similar to A43932 mtMjn 2 p 


6.0 


&0 


446898 


T15767 


Hs.22452 


Homo sapiens mf^ for KIAA1737 pretBbi, 


6.2 


&0 


449118 


R67477 


Hs^103 


Betl (S. cerevisiae) homolog 


5.9 


&9 


427472 


AA522539 


HS.1312S0 


transposofHlerlved 6uster3 trensposase^ 


3.3 


S.9 


43154S 


A1834273 


Hs,9711 


novo) prot^ 


15.7 


5.8 


417153 


X57010 


Hs.81343 


coQagen, type 11. alpha 1 (primary osta 


4.2 


5.8 


410889 


X91662 


Hs.66744 


twist prosophUa) homoloo (acrocephalos 


3.5 


5.8 


429597 


NK4.003816 


Hs.2442 


a (fisintegrin and metafloproleinase doma 


7.9 


&8 


422538 


NM.006441 


Hs.n8131 


5.1&4nathenyItetrahydrofblate synlhetasa 


5.7 


5.7 


414812 


X72755 


Hs.77367 


monokine induced bf gamma interferon 


2.5 


&6 


448552 


AW973653 


H5.20104 


hypolheOcal protdn FU00052 


5.6 


5.6 


416111 


AA033813 


Hs.79018 


chromafin assembly factor 1, sutHinlt A { 


5.5 


5.5 


414516 


AI307802 


Ns.135560 


ESTs. Weakly sbnOar to T4345d liypotheli 


17 


5.5 


414343 


AU)36166 


Hs.323378 


coated veside menibrans prated 


&4 


5.4 


451684 


AF216751 


Hs.26813 


C0A14 


3.9 


5.4 


421470 


R27496 


Hs.1378 


annexinA3 


5.3 


5.3 


426350 


NM 003245 


Hs.2022 


transglutaminase 3 (H polypepfide, prote 


5.3 


5.3 


436476 


AA326108 


Hs.33829 


t)Hli^ protein DEC2 


2.8 


5.3 


437571 


M760894 


Hs.153023 


ESTs 


5.2 


5.2 


447072 


061594 


Hs.17279 


tyrosylprotein suUbtransierase 1 


5.2 


5.2 


444917 


R66551 


Hs.144997 


ESTs 


5.1 


5.1 


417318 


AW953937 


HS.24Q845 


ESTs 


4.6 


5.1 


434170 


AA626S09 


Hs.122329 


ESTs 


5.1 


5.0 


432897 


AF155099 


H3^9780 


NY4^18 antigen 


3.3 


5.0 


433560 


A1925195 


Hs.130891 


tiypolhefoal protein M6C4400 


4.8 


5.0 


421623 


N40850 


H8.28625 


ESTs 


5.0 


5.0 


451244 


AW006798 


Hs.343877 


hypothetical protein FU20039 


4.9 


4.9 


453370 


A1470523 


Hs.139338 


ATP-binding cassette, sub-fandy C (CFTR 


29.8 


4.9 


425242 


D13635 


Hs.155287 


KtAAOOIOgenepradud 


t2 


4.8 


439024 


R96695 


Hs.35598 


ESTs 


5.4 


4.8 


407235 


D20S69 


Hs.169407 


8A(^ {suppressor of aciin mutaBons 2. y 


4.8 


4.8 


421663 


A1952677 


Hs.106972 


Homo sapiens mi^ cDNA DI07p434P228 (fr 


4.7 


4,7 


437916 


BE566249 


HSt20999 


hypothetica] protein FU23142 


3.6 


4.7 


451952 


AL120173 


Hs.301663 


ESTs 


4.7 


4.7 


431676 


AI685464 




gb:tt88f04 j(1 Na_CGAP jy28 Homo sapiens 


9.1 


4,7 


452242 


RS0956 


Hs.159993 


gyoosyttransfeFBse 


4.7 


4,7 


401519 






C15Q00476':gi|12737279|ref|XPjD12163.1) 


4.6 


4.6 


414342 


AA742181 


Hs,75912 


KIAA0257 protein 


3.4 


4.6 


426197 


AA004410 


H5.100009 


acyt-Goenzyme A oxidase 1, palmitoyi 


3.3 


4.6 


429467 


NVL004477 


Hs.203772 


FSHO region gene 1 


2.1 


4.6 


408380 


AF123050 


Hs.44532 


diubiquitin 


4.2 


4.6 


425907 


AA365752 


Hs.155965 


ESTs 


2.2 


4.6 


427078 


AI676062 


Hs.11ig02 


ESTs 


4.8 


4.5 


419465 


AW500239 


H5^1187 


Homo sapiens cDNA: FIJ23068 fis, done L 


Z7 


4.5 


429250 


H56585 


Ks.198308 


tjyptoplian rich bssxc i^otein 


4.5 


4.5 


431585 


BE242803 


Hs.262823 


hypothetical protein FiJ 10326 


15 


4.5 


443250 


A1041530 


Hs.132107 


ESTs 


8.1 


4.4 


423545 


AP000692 


H$,129781 


chroniosome 21 open roadng frame 5 


4.4 


4.4 


423698 


AA329796 


HS.109B 


DKF2^434J1813pr(M 


4.4 


4.4 


442281 


N34742 


Hs.170085 


Homo sapiens cDHA, FU13492 fis, clone PL 


4.4 


4.4 


428826 


AL048842 


Hs.1 94019 


attractin 


2.0 


4.3 


447188 


H65423 


Ha.17631 


hypotheHcai protdn DKFZp434E2135 


4.3 


4.3 


418701 


AA814948 


Hs^6343 


ESTs. WeaUy slmttar to ALUCJHUMAN 


4.2 
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HhDC*1 $uifotr9n$f6FBSo 


2.1 


25 


452741 


BE392914 


HsJ0S03 


Homo sapiens cDNA FU11 344 (is, done PL 


2.5 


25 


447702 


AI420256 


H5.161271 


ESTs 


2.1 


24 


412043 


BE156622 


Hs.333371 


HAtno sanlens dnnA TAflt tmtfBWiJHted irAN 

liwtllV9<l|JI6(mtWilO l/TIV UIHIIHiaiGHBU lliivi 


2.4 


24 


421413 


A1826128 


HsS7637 


ESTs Weddvsiirilar Id A49364 59010^ 


24 


24 


429128 


AA446669 


H5.119316 


ESTs 


2.0 


24 


429258 


AA448765 




^3x10e09.r1 Soares.totBUelua.Nb2HF8_ 


2.4 


24 


435102 


AVra99053 


H5.76917 


F-faox only protein 8 


2.4 


24 


458161 


BE264545 






3.1 


24 


407021 


U52077 






7.3 


24 


437517 


AI927675 


Hs.244482 


HomosaDiens ctanelMA(^3611719 mRNA. 


2.1 


24 


408267 


AVV380525 


Hs.343564 


bdjuEn-spedfic di^psfora e 


4^ 


24 


447713 


AI420733 




ESTs 


2.4 


24 


456848 


AL1 21087 


Hs.296406 


KIAAOfiSS nana fvoduet 


2,2 


24 


437617 




Hs.5716 




Z6 


24 


432378 


AI493046 


Hs 146133 

rw« i^w 


ESTs 


5.2 


24 


443180 


R15B75 


Hs.258576 


daudin 12 


10.1 


2!4 


445929 


AI089660 


Hs 323401 




2.2 


24 


424701 




r». 19 1900 


rT«tnnan_aHiws4oH nmfoln Irinmo IrtnocA 
ITnUJyclraCUValcu piuwui nIuSo IUIIoSo 


2.3 


24 


445255 


NM_014841 


Hs. 12477 


fivn;iatosfMnai>a&sactated Mnlein 91 kDa 


2.5 


24 


409731 


AA12S985 




Ihi/nvKin hpin MftnliflpH In npiimMflst 

UIJIIMWIli MSUlt lUCllUltCU III IICUIUUUmI 


27.5 


24 


421709 


AA1 59394 


Hs 107056 
noi ivf w«>u 




2.3 


24 


412707 


AW206373 


Hs. 16443 


Hotno sspisns cONA: FLJ21721 fis* clone C 


47 


24 


418882 


NM 004996 


Hs.89433 


ATP«Und!no cnssatta siiwfamlltf C ICFTR 


2.4 


24 


451071 


AW138807 

riff IwUOUv 


Hs 204939 


FST< H!oMv similar In A54677 homaofic 


2.4 


24 


420344 


BE463721 


Hs 97101 


Duialtva G nmtaliwYMinli^ mcentfif 


3.1 


23 


433037 


MM 014158 




HSPCOfi? omfefat 

ri Wi VAAJ f pi viwU 1 


2,4 


23 


418329 


AW247430 


HS.841S2 




4.1 


23 


419544 


AI909154 




^aV-8T20(M)104994)07 BT200 Homo sullen 


23 


23 


424602 


AK002055 


Hs 151046 


ESTs ^"^^^ 


2.1 


23 


450693 


AW450461 


Hs^03965 




2.3 


23 


452387 


AI680772 


Hs 306094 


trintH^aofldn reaaflt eonl;)lnlna 1 9 
UHiin«tvwuuw icpcoi vViiUMiiiiiy i& 


2.3 


23 


412045 


AA099802 


Hs.83883 


trsnsmembrsne, prostate aiK^ogen induced 


4.1 


23 


428180 


All 29767 




ntianifM niv^MiUHn Nivflnn nmtpin tG nr 


10 


23 


419438 


AA406400 


Hs.12482 


yij«<dU4iB)iiiua^icsui x^oujiuoiioiciaso 


2,7 


23 


443194 


AI954968 


Hs!279009 


cnatrix Oa protein 


2.1 


23 


414869 


AA 157291 


Hs 21479 


tihmtidaln 1 


2.5 


23 


407300 


Ml 02616 


Hs.1 20769 


gbizn43e07^1 Strategene Heta ceB 83 93 


23 


23 


436396 


A1683487 


Hs 152213 


iiiinntp<i!t.1vna h/lMTV InhnraMon die fanri 


23 


23 


439221 


AA737106 


Hs 32250 


ESTs, Moderately stinflar to I78B85 serin 


23 


23 


453802 


AL1 34757 




nh*nKF7n&17dOQfl r1 547 fsvnonvm! hfhrit 


4.1 


23 


421059 


A)654133 


Hs.30212 


thvmiri rpTAnhv lr)tpr:v*.tlnn nmtaln 1 5 


7.9 


23 


426413 


AA377873 




yUpWW 1 ^v\J\Jv OJIIWMl} MlwUlllCl Ill/lllU ocyioii 


22 


23 


431147 






ESTs 


5.8 


23 


442677 


AI5S7914 




rwnykninid ncvUHaA thioastarafia 


2,5 


23 


446091 






F«>T< 
COlo 


4.0 


23 


431563 




Uq 190019 


CO 18 


22 


23 


439478 


AFQ4Q4fin 




HafriimAH anMamial aiitnmniilafmv farlnr 

uaiumicu epnjorniai puuAoyiiKiujfj idcm 


21 


23 


413049 


NM Q021S1 


Hs.823 


hanciln MranciTiflmhrana nmtaa$a saHna 1 


3.5 


23 


405685 






r:9nn9R7Q-a!li^n7fiRq(mflNP nn.19QR1ltni 
V>^W£0£9.ylp3Uf 009pcI{rl~^UlM^90. I| ud 


23 


23 


407917 


U63139 


Hs.41S87 


RAD5Q /S cefRvislaa) honrioloa 


2.3 


23 


450378 


AW249181 


Hs.19954 


ESTs. Weaktv simnar to T1 9873 hvoothefl 


23 


23 


429303 


AW1 37635 

riff 1^ f Uvw 


Hs.44238 


ESTs Weaklv slrntar to S6S657 atoha^l C> 


23 


23 


417248 


AA329449 


Ue 947^09 




23 


22 


420277 


AW452D16 


Hs 1278B3 


ESTs Weaklv slmijar to SFR4 HUMAN SPUC 


26 


2.2 


431555 


A1815470 


Hs.260024 


Cdc42 effector protein 3 


4.5 


22 


440099 






DKF7P5546?02 Ofotein 


29 


22 


416653 


AA768553 


Hs.193145 


inetaliothioneto IE (functional) 


14 


22 


431542 


H63010 


Hs!s740 


ESTs 


7.0 


22 


401556 






ENSPQ0Qfl022047B*'SECf^OGRANIN III 


22 


22 


418281 


U09550 


Hs.1 154 


ntfUuelal nhieriniBtiAi 1 19DkD (fnncbi fi 


22 


22 


422295 


AFQS1151 


Hfi 114408 


MUllcn mrnnhv !! 


22 


22 


432453 




Hs.27172 


PST« MoriarfltplvstmRarlnPCd2S9feiri 

1 9, vn/WiwKiJ OHIHMt H/ r VVCvw mil 


22 


22 


447499 




H« 1d7(i7A 


fwntfv^ivlfM>rin hfltfi AR 
pioiucaQncnii uouj lu 


22 


22 


AUTViA 






IHAAIIAT nmloin 
IMIVMOHf piulBUi 


4.0 




4S(n9<; 






CO 19 


11.9 


22 


422390 


AW450893 


Hs.1 21830 


ESTs, Weakly similar to T42682 hypolheti 


21 


22 


448789 


BE5391(» 


HS.22051 


hypothetical protein MGC15548 


26 


22 


435679 


AI127483 


Hs.120451 


ESTs, Wealdy slmOar to unnamed protein 


11.8 


22 


425263 


NM-001197 


Hs.155419 


8CL2-interaciino kilter (apoptosis-induc 


3.2 


22 


451939 


U80456 


Hs.27311 


slngto-intnded (OrosophBa) homotog 2 


13.6 


22 


410252 


AW821182 


Hs.61418 


microfibriBar-associsted protein 1 


4.2 


22 


439699 


AF086534 


Hs.187561 


ESTs, Moderately sto«ar to ALU1J1UMAN A 


48 


22 


422576 


BE&48555 


H3.1 18554 


C6^83 protein 


21 


22 


433293 


AF007835 


H5.32417 


hypothetical proteto MGC4309 


11.0 


22 


410733 


D842B4 


Hs.56052 


CD38 antigen (p45) 


3.1 


22 


421379 


Y15221 


Hs.103982 


smafl indudhte cytoMne subtiBmily 8 (Cy 


21 


22 


442995 


AAS32511 


Ks.288455 


Homo saptens cONA: FU23270 Os, ctone C 


22 


22 


408833 


AW612232 


HS.254B35 


ESTs 


3.0 


22 



711 



wo 03/042661 



PCT/US02/36810 



422424 


AI186431 


Hs.296638 


prostate diff erenliaBon (actor 


4.4 


12 


433444 


AW975324 


Ks.129816 


ESTs 


11.7 


12 


4463S4 


AW4496S0 


Hs.346335 


ESTs 


2.6 


12 


438869 


AF07S009 




gb:Homo sapiens full length insert cDNA 


5.9 


12 


425628 


NM.004476 


Hs.1915 


folals hydrolase {prostate^pedfic menA) 


12.8 


12 


424038 


AA770688 




H2A hislone family, msniber L 


3.8 


12 


440146 


AW014231 


Hs.90790 


Homo sapiens cONA: fnJ22930 fis. dona K 


4.8 


12 


404641 






NM.021965':Homo sapiens phosphoghjaxnuta 


2.2 


12 


407253 


AA411175 


Hs.141939 


ESTs. Moderately similar to S65657 a^ 


2.2 


12 


406068 






C2002008:g!I7303957igb|AAF59000.11 (AEOO 


ai 


11 


415162 


AF035718 


Hs.78061 


transcription factor 21 


Z1 


11 


423349 


AF0102S8 


Hs.127428 


homeot)oxA9 


6.9 


11 


424534 


D87682 


Hs.150275 


K1AA0241 protein 


Z3 


11 


407756 


AA116021 


H5.38260 


ut)iQUi6n specific protease 18 


Z5 


11 


440658 


H29142 


H3.143032 


ESTs, Weakly similar to neuronal thread 


1$ 


11 


416547 


H62914 


H5.268946 


ESTs. WeakJy similar to PC4259 ferritin 


Z1 


11 


432281 


AK001239 


Hs^4263 


hypotheticat protein FIJ10377 


^^ 


11 


447533 


NM_004786 


Hs.18792 


thioredoxin-6Ice, 32kD 


^^ 


11 


448966 


AW372914 


Hs.86149 


phosphdnositol 3-ptrasphate4iinding pro! 


z\ 


11 


449897 


AW819642 


Hs^4135 


transmembrane protein vezatin; hypolhsti 


11 


11 


452959 


AI933416 


Hs.169674 


ESTs 


2.1 


11 


427083 


NM^006363 


Hs.173497 


8ec23 (S. oerevtsiae) homolog B 


Z8 


11 


408291 


AB023191 


Hs.44131 


KIAA0974 protein 


2.1 


11 


418727 


AA227509 


Ks.94834 


ESTs 


2.1 


11 


440491 


R352S2 


HS.1305S8 


ESTs, Weaidy simitar to 2109260A B ceQ 


^^ 


11 


433466 


AA508353 


Hs.105314 


relax!n1(H1} 


17.3 


11 


432695 


D63480 


Hs.278634 


KIAA0I46 protein 


Z3 


11 


427982 


NM_016156 


Hs.181326 


KtAA1073 protein 


a2 


11 


445873 


AA250970 


Hs.251945 


po}y(A}-blnd1ng protein, cytoplasmic 1*1 




11 


434569 


A1311295 


Hs.344478 


KIAA0t96 gene product 


^^ 


11 


441791 


AW372449 


Hs.61271 


hypothetical protein FLJ211S9 


2.1 


11 


438520 


AA706319 


Hs.98416 


ESTs 


2.7 


11 


414269 


AA298489 


Hs.303171 


olfactory receptor, family 51, subfamily 


17.3 


11 


410660 


Ai061118 


KS.6S328 


Fanconi anemia, complementation group F 


3.5 


11 


456660 


AA909249 


Hs.1 12282 


solute carrier family 30 (zinc transport 


7.4 


11 


450861 


A1S23898 


Hs.17617 


ESTs 


Z4 


11 


419193 


D29643 


Hs.34789 


dolichykliphosphooligosaocharide^tel 


Z1 


11 


423784 


AK000039 


Hs.132826 


Homo sapiens cDNA FU14913 fis, done PL 


3.2 


11 


416208 


AW291168 


Hs.41295 


ESTs, Weaidy simHar to MUC2_HUMAN MUCIN 


&4 


11 


407182 


AA312551 


HSJ230157 


ESTs 


11 


11 


421669 


N87820 


Hs.106826 


KIAA1696 protein 


11 


11 


432833 


N51075 


H5.1 10028 


ESTs 


11 


11 


425170 


AU077315 


Hs.154970 


transcription factor CP2 


16 


10 


436278 


BE396290 


Hs.5097 


synaptcgyrin 2 


18 


10 


438719 


AA357129 


Hs^9625 


integral membrane protein 2B 


13 


10 


432728 


NM.006979 


H3^78721 


Hl^ class II region expressed gene KE4 


ae 


10 


410310 


J02931 


Hs.62192 


coagulation factor III (thromboplastin, 


10 


10 


429716 


R25685 


Hs.211933 


ooDagen. type XIU, alpha 1 


16 


10 


424081 


NM 006413 


Hs.139120 


ribonudeaseP (301(0) 


10 


10 


433800 


A!034361 


Hs.135150 


lung type-l cell membrane-assodated gly 


10 


10 


413627 


eE182082 


Hs.246973 


ESTs 


3.3 


10 


414133 


AW022188 


H5.41167 


ESTs 


13 


10 


411096 


U6Q034 


Ks.68583 


nvtochondrlal intonnedials pepOdase 


14.7 


10 


450244 


AA007S34 


Hs.125062 


ESTs 


ai 


10 


422493 


AW474183 


Hs.250173 


hypothetical protein FU131S8 


16 


10 


409219 


AA393383 


Hs.133331 


ESTs 


11 


10 


419986 


A1345455 


K3.78915 


(SArblnding protein transcrlptton factor. 


10 


10 


421969 


AF0S2217 


Hs.110099 


cwB'tiinding factor, runt domaiiii alpha 


10 


10 


434128 


Wg3170 


Hs.284164 


protein X 0004 


17 


10 


434503 


T96231 


Hs.17762 


ESTs 


10 


10 


443292 


AK000213 


H$.9196 


hypothetical protein 


10 


10 


443555 


N71710 


Hs.21398 


ESTs, Moderately similar to A Chain A. H 


2.0 


10 


448172 


N75276 


H5.13S804 


ESTs 


6.0 


10 


452039 


AI922988 


H&172S10 


ESTs 


10 


10 


400290 


H16838 


Hs.31608 


hypothetical protein FU20041 


6.2 


10 


449625 


NM_014253 




odz (odd Oz/ten-m, DrosopNIa) homolog 1 


10.1 


10 


441406 


Z45957 


Hs.7837 


phosphoprotdn regulated by mitogei^ pa 


13 


10 


451369 


AA017321 


H9.26g691 


ESTs 


11 


10 


434011 


AW953437 


Hs.5486 


donePLB5214 


11 


10 


408608 


AW847814 


Hs.28g005 


Homo sapiens cONA: FU21532 Its, done C 


5.4 


10 


433388 


AI432672 


Hs.288539 


hypothetical protein FU22191 


17 


10 


432653 


N62096 


Hs.293185 


ESTs. Weaidy similar to JC7328 amino ad 


5.5 


10 


446783 


AW138343 


Hs.141867 


ESTs 


4.4 


10 


437323 


AA371145 


Hs.194397 


ieptin receptor 


15 


10 


410076 


T05387 


Hs.7991 


ESTs 


19 


10 


421501 


M29971 


Hs.1384 


0-6-methytguanin&^NA melhyllransferase 


11 


10 


419733 


AW382955 




Homo sapiens cONA FIJ14415 lis. dona HE 


15 


2.0 



TABLE 61B 



wo 03/042661 



PCT/US02/36810 



Pkey: Unique Eos prObeset Uentifier number 

CAT numben Geno duster number 

Accession: Genbank accession numbers 



Pkey 


PAT Khimhor 


AA^^^ AAMnsn AUURAAUn A&7TC711 AAli^nQR.^ RF^R'tdTtt AAQf^QOd AAflRROfl'^ AlldlfliU AWIQdQQI AAO^OiAl AA!)1QSD0 /UlftdTflA 


414DoU 


147525.1 


AAf 40tf01 IVVMf tfOO AvV0O494U/v\f f Of DC^Ov'tfO rvvraOTS't fvw00W9MI IH lUW nfV rvuuclH/ n/WivwWM lOt^oO 






AATHATM AAM^ft^TI LARtTTfR AA9A9Q7^ AA1<«in7d TIQ}^ 


*tlOOOc 


I0<f IO_l 


AIR't'VUA AU/niR919 AW9A1 ^A1 thJKOTAR QADTO AUUnfiAQ'tA AA91flQ1R AA9Q.T77A A17dflfi1 ^ AI7fijt?4d AfT^II^ AI977^4 A10S8297 AUfifU77 






AWItlQCnR AIC^ITIII 7>in7>tQ AIQft^ftT^ AiR71^1fi AA1RQn9A AW9^<td19 RP17ftd 
fxiOc'ttU'* AW I09DUO AlOOl >0i £.'iu/*}9 AlsO'tOf J AlOf IJia /v\l09U£l MfTCOOS l£ DC l/O^ 


410910 


iDJUUt.l 


AWQ^71>I nciRinrVT DC1C9Cnn hWlAOtXn AU/7JQARA RP1R9AQft RPIRiraK AAlOn^Q AW^I^fiit RPIRinnR RP1R9dQQ 


•119944 




AloruMM AAVunvr AkOAAAfsn Aionom 

Amilv 104 AAwfiOvw' AA£44l9v AIvU«I1mO 




407CDfi 4 
l07909.1 


AUne90ftC UnURfl AiruneCA unulOfiO 1MU9m U979^1 TAdfi94 UftHIti WMSSl WSSfftR AARSQRQI AA.S1^?n9 VUR37Rn RPQA(U19 TQI^QR 
AWjO<SnO nKHOO AIU4UD0O VVDUa^ 104040 nrOnO WVi*90r WOJOf D AAOOWM fvW l*MU£ fVOJ f 03 1 « 191090 




Aiociorrn Aunu>i9^^ MonniQ AurceiCiiflTQniiA aH^qqa? aaa/WRA") AAftAfi9D AAtUtlQfiR AA7RQnn9 
AliK>l9/U AVVU444M N^UUiO AVVD0094^ laUl 14 Al 10354/ AA0U3D40 AA0SD£O*AA00l3OO nnfOOW* 


424036 


23460.1 


AA770688 HI 5373 AW161070 BE304523 BE378517 AA989300 AA9D4029 BE254211 AA449148 A1268420 A130049S AI215637 AI300494 AI268S51 




AA928971 AA179427 AA947684 BE3937g2 H98018 A1885781 AI188557 AI290658 C15404 


426413 


266650.1 


AA377823 AW954494 AI022688 


4292S8 


301917.1 


AA448765 004967 G03045 AA658293 


431676 


336411J 


AI685464 AW971 338 AA513587 AAS25142 


432689 


35275.1 


AB01 8320 H56457 AA24791 6 N83488 N87920 AA095553 T19856 AL134279 AA094167 AI673378 At000340 R47500 W16595 AW152297 AI62S937 






AA0a2027 A1814851 AA902666 At039729 AW975053 6E302243 A1240793AA1 93203 N5561 


438869 


46651.1 


AF075009R63109R63058 


439518 


47334.1 


W76326AF086341W72300 


442677 


548626_1 


A1557914 W81031 AW473764 At814081 W81068 AW182826 AW173296 AI376594 AI220500 BE257195 BE246486 R55637 C20768 AI014407 






AI248353AW028015 


445808 


65133J 


AV655234 AW966332 AA340239 


449625 


8113.1 


NH.014253 AF100772 BE088769 AL022718 BE161779 AW863569 BE161640 AL039060 BE168542 AW295554 AA323193 AA235370 AW7797eO 






N48674AI375997 R45432D59344AI203107 F07491 R35360 R25094AI913631 AI498402 T613 


450580 


83929.1 


N40087 H12925 AA460779 AA096372 BE160847 AW816890 AW816893 AW816577 AW816891 AWB16882 AW816868 AW816689 AW816940 






AW816892AW816941 AW816576AA029183 T83320HB6850AA01029SBE160823 Z25353AA1S0883AW8 


453802 


981589.1 


AL134757AW079131 



TABLE 61C 



Pkey: Unique number corresponding to an Eos probese! 

Ref: Sequence source. The 7 (figil number In this column are Genbank Menfa'fier (Gl) nunibefs. "Dunham 1. e( al.' refeis to Sie pubOcallon enOOed The DMA sequence of 

human chromosome 2V Dunham I. et al., Nature (1999) 402:489^95. 
Strand: Indicates DNA strand from which exons were predicted. 
Nt position: Indicates nucleotide positions of predkrfed exons. 



Pkey 


Ref 


Strand 


Ntjwsitlon 


401197 


9719705 


Pius 


176341-176452 


401519 


6649315 


Pius 


157315-157950 


401556 


7139678 


Plus 


103510-104090 


402802 


3287156 


Minus 


53242-53432 


403047 


3540153 


Mnus 


59793-69968 


404641 


9796810 


Minus 


32247-32362 


405685 


4508129 


Minus 


37956-38097 


406068 


9114084 


Plus 


382^ 



Table 62A (ists about 600 genes up^egulated in prostata cancer compared to normal adult tissues. These were selected from 59860 probesats on the Affymetilx/Eos Hu03 GeneCMp 
array such that the ratio of 'average" prostate cancer to 'average' nomia) aditft tissues was greater than or equal to 3.0. The 'average' preslate cancer level was set to the 75°' 
percentile amongst prostate cancers. The 'average' normal prostate tissue level was set to the 95* percentile amongst nonnal prostate tissues. In order to remove geno^peciilc 
badcground levels of non-^&c hybridtealion, the 10* peroenfiie value amongst norwnaBgnant boif tissues was subtracted from both the numerslor and the denominator before the 
ratio was evaluated. 

TABLE 62A: ABOUT 600 GENES UP-REGUIATED IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE TISSUES 

Pkey: Unk)ue Eos probeset UenGfier number 
ExAocn: Exemplar Accession nunnber, Genbank accession number 
UnigenelD: Unigene number 
IMIgene Title: Unigene gene GUe 



R1: 


Ratio of prostate bimor to normal prostate tissue 




Pkey 


ExAocn 


UnigenelD 


Unigene Title 


R1 


444151 


AW972917 


Hs.128749 


alpha-melhytacyl-CoA racemase 


4Z2 


421566 


NM.000399 


Hs.1395 
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AABwjBI 


Hs.315111 


41R7A9 


LjoimS 


H$.79888 


AiltrM 
410fUl 


AA014948 


Hs.96343 


AiOnH'i 
4l8UoO 




Hs.98613 


A^oonn 


AA4o0421 


HS.3087S 


AVTcnt 
40/D72 


AW7482d5 


HS5741 


439518 


W76326 




450661 


AW952160 


Hs.146550 


443907 


AU076484 


Hs.9963 


409151 


AA306105 


Hs.50785 


430968 


AW972830 




456804 


A1421645 


Hs.139851 


414291 


At289619 


Hs.13040 


420544 


AA677577 


Hs.98732 


448430 


A1500642 


Hs.289067 


429570 


BE242256 


Hs.2441 


444100 


AA383343 


Hs.22116 


443622 


AI9t1527 


Hs.11805 



N^tyigalactosamlnldase; alpha- 
Homo sapiens mRNA; cONA DKF2p434P22B (f r 
hypothetical protein FU23142 
ESTs 

ESTs, WeaWy sinilaf to T34036 hypothefl 

acOvallng transcripSon factors 

stspl)5pQcmgleetarSLU7 

ESTs 

ESTs . 

ESTs 

gb:tl88f04jc1 NCLCGAPJ»»28 Homo sapiens 
protein tyrosine phosphatase, receptor t 
gycosyttransferase 

C150C0476':fli|12737279|feIIXP_012163.1| 
KIAA0257protebt 

acyt-Ooenzyme A oxidase 1, pabnttoyl 
FSHD region gene 1 
KIAA0182 protein 
dkiliiquiiin 

ESTs. Highly slmBar to cytokine reoepto 

ESTs 

ESTs 

hypothetical protein PR01331 

smatt inducible qrtoklne A4 (homologous 

ESTs 

ESTs 

Homo sapiens cDNA: RJ23068 fis« clone L 

HSPC034 protein 

tryptophan rich tiaslc protein 

hypothetical protein FU10326 

ESTs 

ESTs 

eutcaiyotic translafon InitiaBon factor 

Homo sapiens mRNA; cONA DKF^pS66M063 (fr 

apoKpoprotein E 

doIichyl4>^IManSGIcNAc2-PP^JblichytgI 
putaOve transcription regulation nudea 
ESTs 
ESTs 

gb:zk65d12.r1 Soaresj>regnanLuteros.NbH 

chromosome 21 open reabig frame 5 

DKFZp434J1813proteh 

ATPase. aminoptnsphoBpU Iransporter-li 

RalGEF-like protein 3, mouse homolog 

extracellular matrix protein 2. female 0 

ESTs. similar to 138022 hypothell 

Homosapien5cDNAFU13492fis.clonePL 

ESTs 

ESTs 

ESTs 

KIAA01Q1 gene product 



basement menlbraneMioed gene 
hypotheOcat protein DKFZp434E2135 
ESTs 

imegrin, beta 2 (anSgen C016 (p95). ly 
mefhytenetetrahydrotiolale dehydrogenase 
senim/ghicoooilicoid regulated hinase^ 
Homo sapiens cDNA; FU22743 lis. clone H 
ESTs 

hypothetical protein DKFZp761F2014 
hypothetical protein FU22584 
ESTs 

FBJ murine osteosarcoma viral oncogene h 
ESTs, Moderately similar to A46010 X-lin 
nuclear receptor co-repressor/HDAC3 conv 
ribose S>phosphale isomeiase A (ribose 5 
ESTs. VfeaUy similar to ALUCJ1UMAN till 
Homo sapiens cONA FU 12292 lis. done MA 
ESTs 

flavohemoprotein bS7 

gb:2d60d04j1 SoaresjBtaLheail.NbHH19W 
ESTs 

TYRO protein tyrosine kbiase Undlng pro 
SEC22, vesfcle traffiddng protein (& c 
gb:EST384925 MAGE resequences. MAGL Hbmo 
caveonn2 

6 protan-ooupted receptor 86 

Homo sapiens Chromosome 16 BAG done err 

Homo sapiens cONA FU11404 63. done HE 

KIAA0022 gene product 

C0C14 (cen division cycle 14. S. cerevi 

ESTs 

716 
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436576 


AI458213 


H5.n542 


443958 


BE241660 


Ks.10029 


410037 


ABQ20725 


H5^009 


4119B8 


AI207410 


Ks^280 


421129 


BE439899 


Hs.89271 


421492 


BE176990 


Hs.104916 


428966 


AF059214 


Hs.194687 


430383 


Ai861854 


Hs.210778 


431475 


AI567669 


Hs.40342 


446466 


AI76S890 


Hs.16341 


448664 


A1879317 


Hs.334691 


404390 






408815 


AW957974 


Hs.25465 


417512 


X76S34 


K5.82226 


414888 


AL039185 


Hs.77558 


418295 


AW970043 


HS^38039 


432242 


AW022715 


Hs.162160 


433887 


AW204232 


Hs^79522 


436556 


AI364g97 


Ks.7572 


430157 


BE348706 


Ks^8543 


451259 


NM-006052 


Hs^6146 


458776 


AV654978 


Hs.19904 


400301 


X03635 


Hs.1657 


443846 


At085198 


Hs.164226 


451491 


AI972094 


H5^66221 


452234 


AW084176 


Hs^23296 


430389 


AL1 17429 


Hs^40845 


413275 


W94748 


Hs.48821 


418432 


M14156 


Hs^112 


432261 


R42216 


Hs.12342 


433409 


AI278802 


Hs^5661 


452186 


AA120761 


Hs.28307 


417187 


AB011151 


Hs.334659 


421921 


K83363 


Hs.6820 


447247 


AW359351 


Hs.287g55 


415710 


J04543 


H8.78637 


457794 


AA689292 


HS.24685D 


451124 


A)186203 


Hs.31432 


424401 


H67220 


Hs.l6g681 


435905 


AW997484 


Hs.5003 


442731 


AI868167 


Hs.131044 


418650 


6E386750 


Ks.86976 


423749 


U09648 


Hs.132390 


439394 


AA149250 


Hs.56105 


442287 


AW952703 


Hs.8182 


436758 


AW977167 


HS.1S5272 


451030 


H40697 


Hs.181426 


405934 






439231 


AW581935 


H5.1414eO 


444755 


AA431791 


Hs.1 13823 


447276 


AL049795 


Hs,17987 


431912 


AJ660S52 


Hs.76549 


428493 


AK001745 


Hs.184628 


412978 


A)431708 


Hs.820 


414962 


AF273304 


Hs.235376 


447482 


AB033059 


Hs.18705 


449590 


AA694070 


H3.26d835 


450203 


AF097994 


Hs.301528 


450899 


T77447 


H3.177BM 


423337 


NM_004655 


Hs.127337 


452S88 


AAe89120 


Hs.l 10637 


422661 


NM_014700 


Hs.1 19004 


437231 


BE303000 


Hs.288929 


450506 


NM.004460 




407317 


AI204033 


Hs^71461 


410330 


AW023630 


Hs.159425 


444143 


AW747g96 


Hs.160999 


416156 


Xa4908 


H&78060 


422975 


AA347720 


Hs.122669 


403100 






422449 


AF168418 


Hs.116784 


424163 


N»^007071 


Hs.142245 


424806 


AA382S23 


Hs.105669 


425878 


AW9848Q6 


HS.3808S 


432363 


AA534489 




443441 


AW291196 


ks.92195 


432529 


A1989507 


Hs.162245 


432908 


AI881896 


Hs.304505 


424723 


BE409813 


Hs.152337 


424655 


AW204725 


H5.25560 


414664 


AAS87775 


Hs.66295 


437162 


AW005505 


H3.5464 


421828 


AWB91965 


Hs.279789 



ESTs 4.1 

calhepsinC 4.1 

K1AA0918 protein 4.1 

Homo8api8ns,donelMAGE:3636299.inRNA, 4.1 

ESTs 4.1 

hypolhe6c3lpr(M)FU21940 4.1 

cholesterd 25%droxyt85e 4.1 

hypdhefical protein FU10989 4.1 

putaSvsnuctearpR^Bin 4.1 

MAWOtMngprotdn 4.1 

splicing factor 3a. sutHinit 1, 1 20kD 4.1 

Target Exon 4.1 

hyp^hetea) protein FU22341 4.1 

glycoprotein (tfansmemtH'ane) nmb 4,1 

OiynAt hofmono feceptor Interactor 7 4.1 

hypotheOca) protein HJ11090 4.1 

ESTs. WeaUjfsifnilar b ALU4.HUMAN ALU S 41 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

Down syndrome crilical region gene 3 4.1 

cystathIon8SG(cystatMon]negamn»4yase 4.1 

estrogen receptor 1 4.1 

ESTs 4.1 

Homo sapiens cDNA FU13741 (is. done PL 4.1 

ESTs.Weai(!y6liD!larb66022hypothe(i 4.1 

DKFZP434D146 protein 4.0 

ESTs 4.0 

InsuDn-Iike growth factor 1 (somatomedl 4.0 

Homo sapiens done 24538 mRNA sequence 4.0 

ESTs 4.0 

WW domain tdnding protein 4 (tbrmintiind 4.0 

tiypothetical protein MGC14139 4.0 

translocase of inner mitocliondna) memlw 4.0 

Homo sapiens CDNA FU13090fis, dona NT 4.0 

annexinA? 4.0 

ESTs 4.0 

cardiac anityrin repeat protein 4.0 

deatti effector domaln^ontaiidng 4.0 

K1AA0456 protein 4.0 

ESTs 4.0 

prolyl endopeptidase 3.9 

zinc finger protein 36 (KOX 1 8) a9 

ESTs 3.9 

synaptic nuclei expressed gene 1 b 3.9 

ESTs 3.9 

ESTs. Moderately similar to YSHUT tiveon 3.9 

Target Exon 3.9 

Homo sapiens mRNA;cDNADKF2p434N079(fr 3.9 

QpX (caseinolyttc protease X. E. coH) 3.9 

tiypottietical protein MGC1 203 3.9 

ESTs, Weaidy similar to A561S4 At)i sM 3.9 

l^tlietical protein FLJ10883 3.9 

homeot)oxC6 3.9 

XP|yiC2protdn 3.9 

KiAA1233 protein 3.9 

ESTs a9 

L-l(ynurenine/^ptia.aminoa(fipateamMa 3.9 

ESTs 3.9 

axin2(conduciln,axi]} 3.8 

tiomeotxixAlO 3.6 

KiAA0665 gene product 3.8 

hypothe&al protein FU1325B8lmilar1o as 

fbm\A3sX activafion proteb, afpfia 3.8 

ESTs. Wealdy similar to 138022 fiypotheti 3.6 

ESTs 3.8 

ESTs, Moderately similar to A56194 ihrom 3.8 

ptusphofylase kinase, twta ae 

KIAA0264 protein aS 

C20Q1027':gll7296271Igb|AAF51562.1| (AEO ao 

ttiyroid homnone receptor interactor 4 aS 

HEf^-HLTR.assoctating3 aS 

k4STP031 protein 3.8 

Homo8aplens.SlniilartoRIKENcOHA0810 3.8 

gbuif76g11.s1 Na_CGAP.Co3 Homo sapiens aS 

ESTs as 

ESTs 3.8 

ESTs as 

protein arglnlne N-methyI[ransferase 3(h aS 

ESTs 3.8 

mu!ti-PDZ-doma!rvconta!ning protein . aS 

thyroid hormone receptor ooactivating pr aS 

Mstonedeacetylase3 as 

717 



wo 03/042661 



PCT/US02/36810 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



451524 
418821 
422049 
453160 
4ia)72 
424906 
446554 
449523 
452259 
417348 
430289 
435401 
448807 
413403 
418323 
449125 
408089 
428242 
439209 
434927 
431735 
456619 
458062 
411213 
414709 
415339 
418203 
427283 
452260 
439506 
420077 
451009 
436473 
413198 
443067 
452281 
452561 
433269 
445817 
447033 
414171 
427138 
442991 
434974 
443399 
401807 
413979 
414680 
417380 
425710 
441247 
445183 
426931 
417873 
419594 
446419 
449006 
456719 
432101 
453555 
449217 
412922 
450066 
429302 
451246 
415528 
435513 
440643 
409068 
439132 
451173 
422673 
401597 
401744 
432723 
449465 
450628 
432125 
435411 
434629 
437158 
416375 
419441 



AKD01466 

AA436002 

W2S760 

AI263307 

All 10370 

A(5660&6 

AA151730 

NM.000579 

AA317439 

AI940507 

AK0019S2 

R44477 

AI571940 

AA12910S 

NM_002118 

AI671439 

H59799 

H55709 

AF067993 

H46612 

AW977724 

AV547917 

AW978811 

AA676939 

AA704703 

NM.015156 

X54942 

AL119796 

AA453208 

<A1361238 

AW512260 

AA013140 

AI193122 

AW157712 

AI077389 

T93500 

AI692181 

AI343543 

NM_003642 

A)357412 

AA360328 

N77624 

BE281238 

AA778711 

AI452659 

BE218090 

AA743331 

T06809 

AF030880 

AW118681 

AB007877 

NM.003416 

BE266659 

AA013051 

AV658299 

AW578003 

Z437B4 

AI918950 

BE298808 

AA278536 

M60721 

H56499 

AU076674 

AW189232 

R17236 

AW404075 

M78878 

AW236991 

H72803 

AI755082 

N5g027 

AAt72108 

029677 

NNL004380 

AW382884 

AWg72667 

AW444619 

AA789081 

AW090198 

H95567 

AWD23731 



H5^516 hypoOstical protein FU106O4 3.8 

Hs.183161 ESTs as 

Hs.77631 glycine cleavage system protein H (amino 3.8 

Hs.239884 H2Bhiston6 family, member L 3.8 

H5.7g0OO growth associated protein 43 3.7 

Hs.153716 Homo sapiens mRNA for Hmob33prolBln. J 3.7 

HsJ01789 nudlx (nucleoside dIplwsphaleBnkedmol 3.7 

Hs.54443 c)temokine(OCmolii} receptor 5 3.7 

Hs.28707 signal sequence receptor, gamma (Iranslo 3.7 

Ks.318526 hypothetical prDt^FU12G61 3.7 

Hs.238039 hypothetical protehFU11090 3.7 

Hs.10056 hypothetical protein FU14621 ZJ 

Hs.7549 ESTs 3.7 

Hs.198425 ESTs. Moderately similar to S65657 alpha 3.7 

Hs.1 162 m^ hlstocompatitiifity complex, class 3.7 

Hs.196029 Homo sapiens mRNA for KIAA1657 protein. 3.7 

Hs.42644 tWonadoxi^Eke 3.7 

Hs.2250 teulwnilainhitdtoiy factor (choTinergic 3.7 

Hs.gi954 ESTs 3.6 

Hs.293815 Homo sapiens HSPC285 mRI^ partial cds 3.6 

H3.75968 thymosin, beHa 4, X chromosome 3.6 

Hs.107153 irOwbilorofgrowthfamlly.memtwl-a 3.6 

HSJ14451 ESTs, Wealdy similar to ALUlJiUIylAN ALUS 3.6 

Hs.69285 neuropin 1 3.6 

Hs.77031 Sp2trBnscriptton factor 3.6 

Hs.783g8 KIAA0071 protein 3.6 

Hs.83758 COC28 protein kinase 2 3.6 

Ks.174185 ectonudeoiidepyrophosphatase/lihosphodi 3.6 

RAB9. member RAS oncogene fanDy 3.6 

Hs.41136 ESTs 3.6 

Hs.87767 ESTs 36 

Hs.115707 ESTs 3.6 

Hs.132275 ESTs 3.6 

Hs.47534 ESTs, WeaWy similar to 138022 hypolheO 3.6 

Hs.269818 ESTs. Weakly simOar to Z195.HUMAN ZINC 3.6 

Hs.28792 Homo8aplen8cDNAFU11041fls.donsPL 3.6 

Hs.49169 K!AA1634prDtdn 3.6 

Hs.126890 ESTs 3.5 

Hs.1 3340 histoneaceMtranstbrasel 3.6 

Ks.157601 ESTs 3.6 

Hs.865 RAP1A, inemt)er of RAS oncogene family 3.6 

Hs.173717 phosphaSdfo acM phosphatase type 2B 3.6 

Hs.8886 hypothetical protein FU20424 3.6 

Hs.4310 eukayotic translation Initiation factor 3.5 

Hs.61 1 53 proteasome (prosome, macropain) 26S subu 3.5 

C700135Q:(^|6578126|9b|AAF17706.1|AF0496 3.5 

Hs.279453 ESTs 3.5 

h8mogtobin,a)pha2 3,5 

Hs.332088 ESTs 3.5 

Hs.159275 solutB carrier family, memlwr 4 3.5 

HS.126Q51 Homo sapens thymic stromal lymphopoleti 3.5 

H3.1238S KIAA0417 gene product 3.S 

HSJ2076 zinc finger protein 7 (KOX 4. ctoneHF.1 as 

Hs.293859 Homo sapiens. Similar to Rl KEN cONAA430 3.5 

Hs.91417 topo{somer8se(DNA)(l binding protein 3.5 

Hs.163959 ESTs 3.5 

H3.22826 tropomodulln3(ubiquitoiis) 3.5 

ankyrin 3, node of Ranvler (anWrin G) 3.5 

Hs.123642 EphA3 3.S 

Hs.33363 OKFZP434N093 protein 3.5 

Hs.23262 ribonuctease. RNase A family. k6 3.5 

H5.74870 H^O(OvosophilaHikehomeobax1 aS 

Hs.252692 ESTs. Weakly similar to 138022 hypolhetl aS 

Hs.198899 etdcaryoSo translation IniOalkm factor a5 

Hs.39140 cutaneous T-ce9 lymphoma tumor anSgen 3.5 

9b:yg12c04/1Soares intent brato INS H 3.5 

Hs.42785 DC11 protein 3.S 

Hs.7314 KIAA0614 protein 3.5 

Hs.102495 EST8.Moderalelyslmilarfol38022hypot 3.5 

Hs.38363 ESTs 3.5 

Hs.48317 ESTs aS 

gb.7v59d1 1 j1 Soares fetal 0ver spleen 3.4 

RagCprotein 3.4 

Target Exsn a4 

Hs.3085 KiAA0054 gene product: Helfoase 3.4 

Hs.2359d CREB binding protein (Rubinsteln-T«yUs 3.4 

Hs.204715 ESTs a4 

HomosapienscDNAFU12300fis.cfoneMA a4 

Hs.13e211 ESTs a4 

Hs.4029 glioma-ampIifledsequencMI a4 

K1AA1 ISO protein a4 

HS.1247W ESTs a4 

Hs.274368 MSTP032prDteh a4 
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432741 At732358 Hs.185118 ESTs. Moderately similar to A3741 3 calU 3.4 

439192 AW970538 Ks.105413 ESTs 3.4 

440474 Ai207936 HS.719S g8mma-8iT^bu^add(QABA}AT«»pto 3.4 

428466 AF151063 Hs. 184456 hypothetkal prolefo 3.4 

S 447397 8E247676 Hs.18442 E'1enzyn» 3.4 

426828 NM.000020 Hs.172S70 acDvlnArece(}tortn»IMika1 3.4 

418293 AI224483 Hs.16063 hypolhefical imM aJ21877 3.4 

424779 AL046B51 Hs.153053 C037 8nl!gen 3.4 

415015 NM_002109 Hs.7779a hisWyKtRNAsynlhctase a4 

10 425900 AK000166 Hs.288809 hypo(heUcalprotebiFU20159 3.4 

408113 T82427 HS.1941D1 Honu) sapiens cDNA: FU208e9 as. dona A Z.A 

452852 AK001972 H9.30822 hypothetical protein RJ1 11 10 3.4 

414671 AA832479 Hs.72080 ESTs 3.4 

431689 AA305688 Hs.267895 UDP-Gaht)etaGtcNAcbeta1.3^tosy1tr 3.4 

IS 444480 AI150730 - Hs.265640 ESTs, hAoderat8ly6ini9artoA47582B^ 3.4 

403389 C3001393*:^3327090[dfap^1613.11(AB 3.3 

440749 W22335 Hs.7392 hypothe&a) protein MGC3199 3.3 

408374 AW025430 Hs.l55591 IwkheadbaxFI 3.3 

410494 ^A38564 Hs.64016 protdnS (alpha) 3.3 

20 426997 BE620738 Ks.173125 peptldytpro)ytisonieraseF(cyctopMlin 13 

433505 AWS04027 Hs.15301 Homo sapiens COIMFU12596 lis. done NT 3l3 

400277 Eos Control 13 

411031 W37943 Hs.34892 KIAAl 323 protein 3.3 

417295 AW993524 H3.43148 ESTs 3.3 

25 438459 T49300 Hs.35304 Hoino sapiens CDNAFU13655 lis, done PU 3.3 

432169 Y00971 Hsi910 phosphoribosy) pyrophosphate synthetase 3.3 

437872 AK002015 Hs.5887 RNA Ming motif protein 7 3.3 

443912 R37257 Hs.1&4780 ESTs 3.3 

445270 AI762154 Hs.54982 Homo sapiens cDNAFU14014 (is, done HE 3.3 

30 425836 AW955696 Hs.90960 ESTs 3.3 

414565 AA502972 Hs.183390 hypothetical protein FU13590 3.3 

418700 AI963808 Hs.88970 ESTs. Moderately sMar to AlAJ5_HUMAN A 13 

428186 AW504300 H5.295605 mannosUase, alpha, dass 2A. menit)er 2 3.3 

436499 AJ276678 H5,283102 HEF like Protein 3.3 

35 448164 R61680 Hs.26904 ESTs. Moderately similar to Z195_KUItAANZ 3.3 

426848 H72531 Hs.36190 ESTs 3.3 

439653 AW021103 Hs.6631 hypothetical protein Fy20373 3.3 

450094 AI174947 Hs.295769 Homo sapiens fnRNA:fiDNADKFZp564D1 164 (f 13 

402474 NH.004079:Homosap{enscathepslnS(CTSS 13 

40 424800 A1J035588 Hs.153203 MyoOfamOy Inhibitor 13 

430280 AA361256 Hs.237868 tnleileukin 7 receptor 13 

408968 AI652236 Hs.49376 hypothetica! protein FU20644 13 

453548 AL079983 Hs.116774 bilegiln. dpha 1 13 

451387 AA923729 Hs.26322 cdl cycle rdaied kinase 12 

45 444445 AA342329 Hs.115920 Homo sapiens cONA:FU22816fis. done K 12 

445472 AB006631 Hs.12784 Homo sapiens mRKA for K1AA0293 gene, par 3.2 

400658 ENSP00000237081*:KIAA1217 PROTEIN (FRAGM 12 

408101 AWg6B504 Hs.123073 C0C2-re}ated protein kinase 7 12 

410366 AI2675B9 Hs.302689 hypothetical protein 12 

50 411048 AK001742 Hs.67991 hypothetical protein DKFZp434G0522 12 

421246 AW582962 Hs.102897 CGI-47 protein 12 

422763 AA033699 H&83938 ESTs. Moderately simnar to MAS2_HUMANM 12 

426227 U67058 Hs.154299 Human proteinase acthraled reoeptor-2 mR 3J2 

439047 AW979177 gb:EST391287 MAGE resequences, MAGP Homo 3.2 

55 444451 AV650179 Hs.282431 ESTs 12 

420568 F09247 H8.247735 protocadherin alpha 10 12 

.406973 M34996 H5.19B253 ma|orhistocompatibaity complex, dass 12 

402802 NM_001397:Homo sapiens endotheOn conver 12 

431304 BE157283 gb:RC4.HT0373-13020a011-a03HT0373Homo 12 

60 406247 Target Exon 12 

411653 AF070578 Hs.71168 Homo sapiens done 24674 mRNA sequence 12 

412639 AW961284 H3.296235 ESTs 12 

419168 AI336132 Hs.33718 Homo sapiens cONAFU 12641 lis. done NT 12 

433561 BE540937 Hs.20104 hypothetk:al protein FU00052 12 

65 436207 AA334n4 Hs.12845 hypothetical protein MGC13159 12 

438461 AW075485 Hs.286049 phosphoserine ambwtfsnsferase 12 

455473 AW984788 gb:RC1-HN0015-1204004»U07HN0015Homo 12 

441683 BE564214 Hs.102946 ESTs 12 

417386 AL037228 Hs.82043 0123 gene produd 12 

70 427288 AW732802 Hs.2132 epUennaigimvth factor receptor patfiw^ 12 

442355 AA456539 Hs.8262 lysosomd-assodded membrane protein 2 12 

422473 U94780 Hs.117242 menlngtoma expressed anUgen 6 (coiled^ 12 

407816 AW500857 Hs.40137 anaphase-promoting complex 1; meiollcdi 12 

458809 AW972512 H$.20985 slnS^odated pdypeptide. 30kD 12 

75 453884 AW021407 Hs.2106B liypolli8fk»l protein 12 

448944 AB014605 Hs.22599 atiophln-1 faiterading protein 1; adM 3.2 

435129 Ai381659 Hs.267086 ESTs 12 

424894 H83520 Hs.153878 reproducfonS 12 

416959 D2B459 Hs.606l2 ubiquiSivconiugating enzyme E2A (RA06 h 12 

80 410185 BE294068 Hs.737 ImmediatB early proton 12 

451149 AL047586 Hs.10283 RNAbindinsmdif protein 88 12 

410531 AW7529S3 gti:OVaCT0224-2610994)3&«02CT0224Homo 12 

411954 AA652523 Hs.269496 ESTs. WeaUysimBar to 138022 trypotheS 12 
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10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



423746 
444489 
453124 
452413 
409390 
453078 
453024 
439444 
441124 
416128 
445101 
407949 
413492 
421044 
422603 
427399 
434804 
442906 
445840 
446084 
435293 
400750 
416852 
423566 
409558 
400194 
400479 
402895 
403423 
408161 
410886 
414922 
420121 
430519 
447147 
422481 
430487 
409549 
423554 
431286 
430226 
408051 
420090 
400296 
414279 
440426 
423872 
420154 
447560 
453293 
434398 
414135 
441527 
423957 
444001 
411315 
414178 
441401 
457718 
429500 
424240 
447476 
419440 
421786 
433285 
409997 
417874 
420956 
445345 
450671 
427315 
434953 
438469 
428130 
414839 
445784 
446677 
426448 
445921 
408673 
430387 
427157 
435907 



Ai149048 

Ai151010 

AI13905B 

AW082633 

AI927212 

AF053551 

AVV846787 

AI277652 

T97717 

AA173632 

T75202 

W21874 

087470 

AF061871 

BE242587 

NM-014883 

AA64g530 

AW298888 

AI277811 

ALI 37658 

AI040777 

AF283776 
AW976434 
8E253407 



AW952912 

AW809324 

000723 

AW968271 

AF129534 

AA910353 

AL050163 

D87742 

AB029015 

M90516 

AW263476 

BE245562 

AI623351 

AA220238 

AA305627 

AW021691 

AI159800 

AB020316 

Ai093155 

AF065214 

AA382267 

AA121098 

NM_004419 

W19504 

AW978309 

A)095087 

AW836547 

AW957372 

AI824338 

F18572 

X78565 

AB023185 

BE293466 



AI188653 

AW975944 

AI906055 

BE616160 

AA351584 

AWD03850 

AI356967 

AA179949 

8E049102 

AK001455 

AW444985 

X63692 

AI253155 

A1800311 

R06054 

AW015211 

M34996 

AW372B84 

U51166 

AW304025 



Hs.30211 


hypoOieficai proton FU22313 


X2 


H3.157774 


ESTs 


U 


Hs.125790 


leudnMich repeatoxvlaining 2 


3.2 


K5.212715 


ESTs 


3.1 


Hs.3734 


ESTs 


3L1 


HS.31S84 


(netaxin 2 


3.1 




gb:aV3^0194-181099meO1 CT0194 Homo 


3.1 


Hs.54578 


ESTs, WeaMy simOar to 138022 hypoM 


3.1 


Hs.119563 


ESTs 


3.1 




C0C14 (cell division cycle 14. S. cerevl 


3.1 


H3.12314 


Koax) sapiens cnRNA; cONA DKFZp586C1019 (f 


3L1 


Hs.247057 


ESTs. Weakly &irralar to 2109260A B ceO 


11 


Hs75400 


IOAA0280 protein 


3.1 


Hs.101302 


Human DMA sequence from done RP1-238D15 


3.1 


Hs.1 18851 


hematopoletically expressed homeobox 


11 


Hs.177664 


K1AA0914 gene product 


11 


Hs.348148 


eb:ns44105^1 NQ^CGAPJUvl Homo sapiens 


11 


H5.170939 


ESTs 


11 


Hs.146291 


ESTs 


11 


Hs.13768 


Homo saptens mRNA; cONA 0KFZp434i1216 (f 


11 


Hs.117170 


ESTs 


11 




Target Exon 


11 


HS.8028S 


Homo sapiens mRNA; cDNA DKFZp586C1723 (1 


11 


Hs.3623 


liypothetica) protein FU11220 


11 


Hs.72363 


Homo sapiens mRNA for FU001 16 protein. 


11 




Nlil.003763:Homo sapiens synlaxin 16 (STX1 


11 




Target Exon 


11 




ENSP000002521 17:MyeIin transcription lac 


11 




Target Exon 


11 


Hs^0383 


t^ypoihetical protein MGC3032 


11 




gb:MR4^T0121-141099^0^.1 ST0121 Ho 


11 


Hs.77631 


glycine cleavage system protein H (amino 


11 


Hs.191534 


ESTs. Wealdy simnar to ALULHUMAN ALU S 


11 


Hs.49210 


F-box only protein 4 


11 


Hs.292815 


ESTs, Wealdy similar to T23482 hypolheti 


11 


Hs.117339 


DNAX-flctivdtion protein 10 


11 


HS.2415S2 


KIAA0268 protein 


11 


Hs.54886 


phosphoQpase C, epsllon 2 


11 


H3.1674 


glutamine-ftuclose-6-pliosphate transamin 


11 


H$.44268 


myefin gene expression factor 2 


11 


KS.25S1 


adreneiglc beta-2-, receptor, surface 


11 


Hs.172148 


ESTs 


11 


Hs.94986 


ribonucleaseP{38icD) 


11 


Hs.139336 


ATP-blndlng cassette, sub-family C (CFTR 


11 


Hs.7181 


GCN5 (general control of amino-acid synl 


11 


Homo sapiens cONA FLJ13663 % done PL 


11 


Hs.134015 


uronyt 2-^uift^nsferase 


11 


Hs.95420 


JM27 protein 


11 


Hs.18858 


phosphollpasa A2, group IVC (cytDSoBc. 


11 


Hs.10653 


ESTs 


11 


Hs^38 


senim-indudbte kinase 


10 


Ks.2128 


dual specificity phosphatase 5 


10 


Hs.7884 


solute canter family 21 (organic anion 


10 


Hs.136235 


Homo sapiens cONA FLJ13542 fis, done PL 


10 


Hs.152299 


ESTs, Moderateiy sImilBr to S65657 alpha 


10 




gb:PM34.T0032-03010^006^08 LT0032 Homo 


10 


Hs.46791 


ESTs, Wealdy similar to 138022 hypotheO 


10 


Hs.126891 


ESTs . 


3.0 


Hs.22978 


ESTs, Wealdy sInflartoALU4.HUMAN ALU S 


10 


Hs.289114 


hexabracHon (tenasdn C cytotactin) 


10 


H3.143535 


caldum/calmodunn^pendent protein kin 


10 


Hs^880 


ESTs, Weakly similar to 138022 hypolheti 


10 


Hs.90419 


KIAA0882 protein 


10 


Ks^1351 


ESTs 


10 


H8.237396 


ESTs 


10 


Hs.57749 


synaptic nuclei expressed gene 2; KiAAtO 


10 


K$.82829 


proldn tyrosine phosphatase, non-recept 


10 


Hs.100543 


Homo sapiens done 24505 mRNA sequence 


10 


H5.12532 


chromosome 1 open readSrig frame 21 


10 


Hs.43086 


ESTs, W^ similar to A46010 X-flnked 


3.0 


Hs.175563 


Homo sapiens mRNA: cONA DKFZpS64N0763 (f 


3.0 


Hs.179982 


ESTs. Wteddy simBar to TRHY.HUMAN TRICH 


10 


Hs^198 


Down syndrome crtticd region gene 2 


10 


Hs.77603 


ESTs 


10 


Hs.77462 


DNA (cytoslne^)-mathyllransfera5e 1 


10 


Hs.146065 


ESTs 


10 


Hs.156291 


ESTs 


3.0 




gb:ye89g07j1 Soaresfela) Over spleen 


10 


Hs.146181 


ESTs 


10 


H5.198253 


mafor hbtocompatiljlli^ comptex, dass 


10 


Hs.240770 


nudear cap binding protein subunit 2, 2 


10 


Ks.173824 


thymine-ONA glyoKylasa 


10 


Hs.258397 


ESTs 


10 



720 



wo 03/042661 



PCT/US02/36810 



^'^^^^^ HomosapIensmRNA;cONADKFZp56401262(f lo 
tl^l '^^^ tfyptophanylWNAsyntl»lase2(initocto 10 

45^ ""^^ EST8.WbakV8lmflartoimn8m!ipio^ Jo 



TABLE 62B 



I Unique Eos probesel fdenfitier number 

i U CAT number Gene cIusIbt number 



Accession: Genbank accession numbera 



Pksy 

410531 
410886 
411315 
411962 
414279 



CAT Number Accession 



415528 
416128 
416882 

422873 
426448 
430935 
430966 
431304 
431676 
432125 
432363 
437158 



439047 
439518 
4S0506 



15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



403423 
403651 

OU 405141 
405934 
406038 
406247 



1207200.1 

122S822_1 

123857DJ 

t26744J 

143227J 



414680 147525.1 



1539409.1 

157163.1 

162718.1 

219674.1 
267323 1 
325772^1 
326269 1 
331266 1 
336411 J 
341776.1 
345469.1 
43392.5 



437866 44433.? 



450580 
452260 
453024 
455473 
456719 

TABLE 62C 



468139.1 
47334.1 
836_1 

63929.1 

9074.1 

944876.28 

1292917.1 

222707.1 



AW752953 H88044BE156092 
AW809324 BE144977 BE144956 
AWB35547 AW835513 AW838587 
AA099050AA099526 T47733 

^^™«^E'^.S.^f'^^^ «AIW««AW242437AA618148 
R17236RS2580F11642 

AA173632 AI17485B AA5B1361 Ar700024 AA173988 BE165417 AI386964 

N59027 AA314694 N53937 R08100 a»v«04u Btuw 

R06054 AA378769 AW956453 

SoSK'iS 

BE157283 BE157287 AA502438 
AI6BS464 AW971 336 AA51 3587 AA525142 
AW972667 AAS26539 AI057032 AW167642 
AA534489AW97Q240AW970323 

W76326AF086341W72300 

7^«^S?!JSl'^?l!'i?^^ AVV9M821 AWM48»AW ^^^^ 

AW196918AA948287AI953735AI263TOAA319189AW964436AI90M«AV«94^ 



Ptey: 
Ret 

Sbaid: 
WLBOslllon: 



Unique numbef canEspondins to oi Eos pnbesal 

Indicates ONA strand from which exons were predicted. 
Indicates nucieotide positions of pfKfided exons. 



Pkey 

400479 

400658 

400750 

401197 

401519 

401597 

401744 

401807 

402474 

402802 

402895 

403100 



Ref 


Stiaid 


8439786 


Minus 


8118459 


Minus 


8119067 


Plus 


9719705 


Phis 


6649315 


Rus 


3293210 


Rus 


2576349 


Pius 


7331536 


Plus 


7547175 


Minus 


3287156 


Minus 


9967547 


Rus 


8954402 


Minus 


9438331 


Minus 


7105492 


Phis 


7708872 


Rus 


8667030 


Minus 


8960911 


Rus 


6758795 


Plus 


8389537 


Rus 


7417725 


Minus 



NUposHion 

115386-116348 

73525-73644 

18899M99168.199316-199548 

176341-176452 

157315-157950 

65838^031 

14595-14751 

152325-152912 

53526.63628.55755-55920^530^57 
53242-S3432 

85537-65671.86379-664e9 

13683'13874 

163415-163634 

6934069615 

22733-23007 

39624-40072 

9986M00054 

159913-160605 

3776«7877 

46234^6461 



721 



wo 03/042661 



PCTAJS02/36810 



Table 63A fists about 366 genes sIgniOctntty dowrHegulated In prostate cancer compared to nomial prostate. Tl»se were selected from S9680 probesels on the Aliymelrix/Eos Hii03 
GeneChIp array swh that the ratio of •average' normal prostate to •average* prostate cancer tissues viras greater than or equa) to 2.5. The "avefage" normal prostate level was set to 
Die 75°" percentile of normal prostate tissues. Tlie 'average* prostate cancer level was set to the 75* percentile amongst tumor samples. In order to remove gene^spedfic 
background levels of non-spectfichybrfdizatton. the 10* pa was subtracted brim both the numerator and the der»minatorbelbre to 



TABLE 63A: AB OUT 366 GENES SIGNIFICANTLY DQVW-REGULATH) IN PROSTATE CANCER COMPARED TO NORMAL PROSTATE 

Pkey: UMque Eos probeseHdentllter number 
ExAocn: Exemplar Aocesston number, Genbahk accession number 
UnigenelD: Unlgene number 
Un^ene Title: Unigene gene title 
JM: Ratk) of normal prostate fissue to prostate tumor Bssue 

Pkey 
425932 
407245 
425545 
426752 
400440 
412519 
413778 
420813 
415337 
407013 
433331 
432117 
453863 
431847 
408855 
448059 
403612 
405001 
441490 
435805 
401917 
455649 
450216 
404606 
432326 
459708 
440808 
458339 
454278 
417032 
434352 
439175 
439749 
417364 
400831 
416935 
442082 
452625 
440965 
456815 
430611 
415981 
441040 
442764 
411426 
433081 
401896 
445323 
458895 
417729 
431627 
420721 
449519 
442089 
436781 
402797 
404267 
442931 
418626 
423772 
457136 
436404 
458840 
403649 
435866 
423871 

722 



ExAccn 


llntnpnA IH 


1 Inlnnna Ttlla 


D1 
Kl 


M81650 


Ks.19K 


oviiiwiivumji 1 

6tjn 


MA 
Otf.*) 


X90568 






N98529 


Hs.156295 


HofnAfi9fliAn& dfinA Mr^r^19^1 mRMA mm 




X69490 


Hs 172004 


tilin 


£9.9 


X83957 


H5.83870 


nehufin 




AA196241 


Hs739B0 


tiTWWntn T1 slrplptol filnui 
uu|JUiun 1 ii 5ise«Bidi« oiiiw 


14 Q 


AA0g0235 


Hs.75535 


mvosin. Haht ootvoeDiida Z reaidalQiv 


12.0 


X51501 


Hs.99949 




11 A 


244881 


Hs.9012 


PQTs UUbsitrli/ etmnar in C^RRCTt nKIA-Mn«41 

CO 1 9, weoKiy strnuar vo oxoow UN/W}inOi 


10.2 


U35637 




gu.nijnian neuuiiii mnriA, paniai cos 


9.1 


Ar73881S 




PQTs 


8.9 


AL036195 


Ks.2909 


protamlfie 1 


n 4 


X02544 


Hs.572 


orosomucoid 1 


D 9 
9.C 


A1791314 




yu.ao*tog 1 oirauigonB lung csrcinoiTia 


7.4 


T83061 




nomo sapiens mwjA wr kiaai rzi protetfii 


7.1 




^170425 


CO IS 


6.6 






Tfltnpf Pynn 


tiA 
0.^ 


U58196 




Intfirlpiiktn enhanMr hindinB fiwbv 1 


R 9 


N46901 


Hs 266720 


ESTs 


R 9 




Lie iRiQR 


CO IS 


6.0 


AL050149 




RAM HnHiim nmlAfai ^ 


5.9 






go.r%ui--oiiui«>iiuoiA>*vif ^Uh diujio nomo 


5.7 


R02018 


Hs.1 68640 


Bnkvlasifi. omnnxsivR ffmiMAN hnmr^m 


a.D 






Tomol CvAn 
1 dlycl CAUil 


0.0 






amiloride^ensithre cation charmel 2, ne 


5.5 




ns.ioooi« 


nomo sapiens cuna rLi 14 io< ns, cnne ma 


5.5 




Hs.7432 


hvnnlhAlWd nn^n n HfU?? 

nypoinoiicai pruiein ruiwr r 


5.5 


AW976B53 


Hs 172843 


ESTs 


R 9 


/^217525 


H5.49OO2 


rVtum ci/fi/linma noil flrlliAelMft vnAlBmila 


C 9 


AA1Q9AfiO 


n5.£r 1000 


CCTs 

tolS 


5.2 






smau musoe pnnein, AHinKea 


5.1 


AFQBfi021 
nruoouci 


Urn 97111? 


CO IS 


e A 




LU imn 


Homo sapiens mRNA full length Insert cDN 


4.9 


N73749 




CO IS 


A D 

4.9 








4.8 




gozpof lUiM 1 ouaiagene neLa ceo so 89 


4.8 




Ua 7114 


CCTb 
COlS 


4.6 


AA724771 


Hs.61425 


ESTs 


4.7 


AI523646 


Ks.169859 


ESTs 


47 


NM.013348 


Hs.144011 


potassium inwardly^iifying channel,su 


4.7 


AA837120 


Hs.1564ai 


ESTs 


4.6 


R35694 




gb.'yg67b04.r1 Scares Infant brain t NIB H 


46 


AW449782 


Hs.178803 


ESTs 


4.6 


A1762254 


Hs.131122 


ESTs 


4.6 


BE141714 




gb:avO-HT0101-0610994>32<04HT0101 Homo 


4.6 


Z85986 


Hs.186757 


Homo sapiens, done MGC:5564, mRNA, comp 


4.6 






Target Exon 


4.5 


AW207282 


Hs.213049 


ESTs 


4.5 


A1535663 


Hs.39379 


ESTs 


4.5 


Z43798 


Hs.6777 


ESTs 


4.5 


AW609720 


Hs.265540 


HSPC042 protein 


4.5 


AA927802 


Hs.159471 


ZAP3 protein 


4.4 


W04244 


Hs.49829 


ESTs 


4.4 


AI801500 


Hs.128457 


ESTs 


4.4 


AI914535 


Hs.221377 


ESTs 


4.4 






Target Exon 


4.4 






NM^00434S*:Honx) sapiens lunt^elated tra 


4.3 


AIQ24376 


Hs.150473 


ESTs 


4.3 


AW299508 


Hs.135230 


ESTs 


4.3 


AA306637 




EAP30 subunit of ELL complex 


4.3 


AA428240 


Hs.126083 


ESTs 


4.3 


AW968556 


H5.137240 


Homo sapiens mRNA for par&l 3UTR. seq 


4.3 


A1560296 


Hs.174782 


ESTs^WeeMy sinotar to KIAA1437 protein 


4.3 






TatgetExon 


4.2 


AA704S38 


Hs.119740 


ESTs 


4.2 


AA331908 




gb:EST35B05 Embryo, 8 week 1 Homo sapien 


4.2 
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436190 


AK001059 




glKHomo sai^ens cONA FU10197 fis, dona 


4.2 


400164 






pne ContfCJ 


4.1 


435519 


AI218950 


Hs IT^SI 




4.1 


425094 


A)955956 


Hs.21417 


ESTs 


4.1 


41592B 


R46799 


H$.23966 


ESTs 


4.1 


415746 


AA16TC70 


H5^1413 


solBto csnfer tBnfly |2i {potssdunvcM 


4!l 


426663 


AI02B767 


Hs^2603 


ESTs 


4^1 


453752 


AL120800 




9b:OKFZp762E152_r1 762 (synonync hfTiBi2) 


4,1 


404260 






Target Exon 


4.0 


453412 


AJ003290 




gbA)003290 Sdxted chromosome 21 cONA 


4.0 


401411 






B^P00000247l72*:HyPOTH£TICAL 12&2 kOa 


4.0 


415925 


H09474 


Hs.202341 


Honu) sspiens cDNAi FU23573 fis, dono L 


4.0 


402817 


H241S5 








449233 


BE046401 


Hs.195511 


ESTs 


3.9 


435457 


AA682421 


HS.5912S 


ESTs 


3.9 


444105 


AW189097 


Hs.166597 


ESTs 


3.9 


430610 


AI821465 


Hs.188810 


ESTs. Weakly similar to ALU6_HUIi/IAN ALU S 


3.9 


411320 


AW836646 




Gb'PM3-LTQQ32-09010[MK)8^QS LTm3? Hqitv) 


3.8 


409679 


BE250521 




hfifnnlflo rpaa femilv nnAfTihpr A 


w.O 


415386 


Z43087 




gb:HSC13A121 normalized in^ brain cDN 


3.8 


446175 


AL036568 


Hs.291 




3.8 


459707 


AA631362 


Hs. 120866 


□b-nnBSbOI^I NCI CGAP Thvl HomasaDiens 


3.8 


406917 


X65964 




9b'H.sapiens nestin geno. 


3.8 


455887 


BE154173 




ab-PM1-HT0340.2012fiftD04iif12 HTfOdO Hum 


0.0 


443313 


A(796730 


Hs.55513 


ESTs 


3.8 


452351 


AA025647 




Qb!2efi5d01j^1 Soams feta) haal NbHH19W 


3.8 


400499 






C10001858:gi|6679124iref|NP_032759.1| no 


3.7 


411629 


AW865749 




ab-O\/3.SKIQ09t.1QflS00.1B&cha SMflQ91 Homn 
ywiWv<^o nuvc i* luuovi^ iii9ifUi> 0111AI& 1 nutiiu 


Off 


408229 


AW176091 




Ob:OV(V6T0107-2S0899'007-fa08 BT0107 Homo 


3.7 


427318 


AF186081 


Hs.1 75783 


7inc trflnsnnrtAT 




417620 


R02530 


Hs.191198 


ESTs 


3,7 


404660 






Cg00Qfl41**oil12G54G91fahlAAHQ11&S ItAAHO 


3.7 


409144 


AVV341187 


Hs.279714 


ESTs 


3.7 


438524 


AA922236 


Hs!221037 


ESTs 


3.7 


459200 


Y09306 


Hs.30148 


hofitaodofnafn-lntefadinn nretaln tdnasA 3 


3.7 


418833 


AW974899 


Hs.292776 


ESTs 


3,7 


424887 


AA34ai12 


HS.2S9461 


ESTs 


3.7 


404984 






Tsgat Exon 


3.7 


454549 


AW806910 




gbXlV4-8T0023-160400-172-fl)4 ST0023 Homo 


3.7 


426736 


AA43161S 


Hs.1 30722 


ESTs 


3.7 


434552 


AA639618 


Hs.325116 


Homo salens. doM MG&SSfiS mflNA. comn 


3.6 


438040 


AI222422 


Hs.121646 


ESTs 


3.6 


443618 


T82009 


Hs.3fM1700 

1 l0»VVtf 1 WW 


htfOBthaficfll Dffllrin F1J90797 


3.6 


442097 


AW015799 


Hs.1 26474 


ESTs 


3.6 


435186 


.AL1 19470 


Hs.145631 


ECTs 


3.6 


429853 


A6020625 




hirtvmnhninJikA "i 


1 R 
O.D 


456521 


AVV373450 


Hs^8fi212 


hvnoihAti«)l nmipin F1 I1179Q 

IIJpVUICW<01 piVlBIII rldtl III £9 




453282 


AK00Q043 


Hs.32922 


hvnolhetica] ointein FU20Q36 


3.6 


400749 






NM 0Q31fl'!*'HfimnRan!AnR£nrinin-m)afnH 


3.6 


450295 


AI766732 


H3.210628 


ESTs 


3.5 


421154 


AA284333 


Hs.287631 


Homo salens cONA FU14269 fis. dona PL 


3.5 


434378 


AA631739 




EST 


0.3 


421342 


AA504749 




nh*nfl{UfnR r1 ClUkP CiTRi Unmn eaeitmi 


0.3 


406704 


M21665 


Hs.929 


llljuatll) iR3<JVy |MIJ|lc)>UUo f , CaiUldC fHUS 


0,3 


429043 


AI824977 


Hs!l45319 


ESTs 


3.5 


438211 


T08401 


Hs.290856 


ESTs. ModeralBiy similar to ALm.HUMAN A 


3.5 


405258 






Target Exon 


14 


440207 


AI371978 




ESTs 


0«*l 


445045 


AI652676 


Hs 147256 


ESTs 




406177 






Tamfil 


3.4 


438535 


109078 




gb.Homo sapiens iiiRNA frsgmsnt 


3.4 


405735 






ENSP000002S2164*-KIAA1S78 omtdn (Fratm 




445797 


AI253414 




Qb-8a14f04j(1 Slaniev Fmntd NS oool 2 


3.4 


444286 


At625304 


Ks 190312 


ESTs 


3.4 


442027 


AI652926 


Hs.1 28395 


ESTs 


3.4 


447889 


AW469180 


n9>^M90 


CO 19 




405152 






Tomot f-ynn 


0.1 


402460 






C1D01261''Qil2&959791efnbICAA7fi8<>d llfYO 


3.4 


448516 


AW8S8S95 




abd^C1«NNi0073-28040(M)1 1<o09 NN0073 Honu 


3.4 


450588 


AA010319 


rl5.0VM09 


C9I5 




447600 


AI420990 




ESTs 


a4 


406085 






Target Exon 


3.3 


417959 


AWg77642 


Uj 291742 


ESTs 


3.3 


416872 


L44284 


Hs.12915 


ESTs 


3.3 


446593 


W79572 


Hs.13277 


hypolbeHcal prddn FU22054 


as 


446516 


AW996692 


Hs.257557 


ESTs. WeaMy similar to 138022 hypothati 


as 


415896 


H08311 


Hs.14822 


ESTs. Weakly to 178885 seilneAh 


as 


410140 


AL134435 


Hs.247837 


neuron 3 


a3 


458539 


AI733837 


Hs.145681 


ESTs 


as 


422399 


AW41038D 


Hs.116056 


mesendiymai stem ceQ protein DSC43 


as 


413381 


BE09Q690 
AKD0t494 




gb:RC1-eT0720-280300-01 1-g02 BT0720 Homo 


as 


429422 


H$.202S96 


Homo sapiens cOHA FU10632 fis, dona l«rT 


as 


428561 


AW973652 


Hs.283105 


ESTs 


a2 
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445608 
449017 
415908 
457733 
449340 
428134 
434407 
429973 
418092 
411356 
443366 
422069 
420778 
403451 
451686 
423837 
422731 
450049 



401Z7B 
418087 
421813 
402490 
448001 
445316 
405150 
413784 
425198 
433224 
421894 
419386 
402422 
435849 
405422 
420543 
410627 
409079 
406600 
408025 
426349 
444576 
430661 
441335 
459647 
435713 
407275 
456103 
435200 
449245 
428132 
443454 
423732 
411944 
432718 
435534 
430348 
410289 
453126 
459421 
441543 
442252 
419254 
443727 
419896 



415098 
415131 



433942 
400461 
442640 
438814 
438361 
422482 
407142 
454319 
442907 
456075 
438690 
452570 
408493 
447556 
451632 



AI830851 Hs.200014 EST5.WeakfyslmlbrtoAaDICPR0UNE^ 

AW002425 Hs,224142 ESTs 

H08623 Hs^833 ESTs 

AW974812 HSJ91971 ESTs 

AW235786 Hs.195359 hypolhetical protein MGCl 0954 

AM21773 Hs.161008 ESTs 

AW815333 gbK3V^ST02154»0100^B3fl01ST02l5H«no 

Ai423317 Hs.164680 ESTs 

R4S154 H3^8439 ESTs 

H45377 gb.7n99h03.r1 Soaras aduit brain N2b5HB5 

AI053501 Hs^8869 ESTs, Moderately simflwta 21 09260A Be 

AJ010063 Hs.343603 tifbvcap (telelhonin) 

AW970512 gb£ST382593 MAGE resequences. MAGK Homo 

Target Exon 

AA059246 Hs.110293 ESTs 

AW937063 Hs.275150 gb:PM3^)T0037-23129M01-g11 DT0037 Homo 

A1.138411 gb:DKFZp434A1229.r1 434 {synonym: htesS) 

AI681234 Hs.258509 EST 

AB037798 Hs.ie8790 KIAA1 377 protein 

TaigelExon 

AA961613 Hs.127838 ESTs 

BE048255 gb;tz49b05.y1 Na.CGAP.Bm52Homosap{en 

Target Exon 

AW293237 H3.202037 ESTs 

AI219833 Hs.166767 ESTs 

TarigetExon 

BE165819 Hs:W7664 ESTs 

AA352090 Hs.1280C3 hypothetic^ protein FU21 213 

AB040919 Hs.210958 KIAA1 486 protein 

AI418464 H$.190836 ESTs 

AA236667 ESTs. WlBaMy sinflar to 138022 liypolheti 

TacQetExon 

BE305242 Hs.16098 claudin2 

ENSP0000021665B':HYPOTHEnCAL 133.5 t(Da 

AA278221 Hs.173344 ESTs 

AA1B1339 Hs.929 myosin, heavy polypepSde 7. cardiac mus 

WB7707 Hs.82065 InterleuMn 6 signal transducer (gpl 30. 

Target Exon 

AI692784 Hs.41767 PTD002 protein 

AI308855 Hs.301497 arginytlransferase 1 

AI400974 Hs.18»)45 ESTs 

AC00S551 Hs.130714 Homo sapiens HSPC323 mRNA. partial cds 

6E222470 Hs.150825 ESTs. Wealdy simitar b antigen NY-Ca33 

R34107 Hs.321450 pregnancy 8pec8icbeta>1-glycqprotein1 

AA699313 Hs.114071 ESTs 

A1364186 gb:qw34h07j(1 NCLC6AP_Ut4 Homo sapiens 

Z39430 Hs.72350 ESTs 

AA670310 Hs.145903 ESTs 

AI636539 Hs^4296 ESTs 

AA421765 gbzu24g02.s1 SoaresJihHMPlLSI Homo sap! 

AI057494 Hs.133421 ESTs 

AF0580S6 Hs.132183 solute carrier family 16 (monocaiboxyfic 

AW877139 gbX3V2-PT0010-160400.133fl01 PTOOlOHomo 

AA563943 H5.244371 ESTs 

AA830927 Hs.1 17306 ESTs 

AA476915 Hs.189225 ESTs. WeaWysimnar to 138022 hypothetl 

AWg01618 Ks.61935 Homo sapiens mRNA; cONA OKFZp761l071 (fr 

AA032155 H5J61622 ESTs 

AA233154 KS.12S32 chnamosomel open reading frame 21 

Ar733014 Hs^69715 ESTs 

AI733395 Hs.129124 ESTs 

Ai469453 Hs.222760 ESTs 

Z2S389 Hs.16459 ESTs 

299362 ob:HSZ99362 DKFZphamyl Homo sapiens cOKA 

AW003512 Hs^2770 arachldonate f^genase 3 

D59687 gb:HUM0S6E10B Cbntech humanfetd brain 

0611 19 gb:HUM158C1 IB Oontech human fetal brain 

Target Bcon 

AWZ72166 Hs.123465 ESTs 

Target Exon 

AI205646 Hs.147220 ESTs 

AA826278 Hs.249596 ESTs 

AA805666 Hs.146217 Homo sapiens cONA:FU23077fis. done L 

AI439905 Hs.344476 gbii57g08.x1 Na.CGAP.Lym12 Homo sapien 

AA412535 gb2t99b10Al Soares.testi8j4HT Homo sap 

AW247736 Hs.101617 EST8.WeaMysimDartoT32527hypoth8Q 

AI023763 Hs.79707 ESTs 

N73442 gb:yz31h09j1 Soaresjnuniplejscterosis. 

AA815031 Hs.123598 ESTs 

AW881293 Hs.336940 Homo sapiens cONA: FU20861 fe, done A 

BE206854 Hs.46039 phosphoglycerate mutase 2 (musde) 

AI88S187 Hs^10710 ESTs. Moderately similar to AUUS HUMANA 

BE005934 HS.31062S EST 
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426461 
401656 
407269 
435754 
433565 
451004 
425223 
405770 
456227 
454445 
4194S4 
427639 
428079 
406337 
401864 
406881 



450044 

403630 

445514 

446362 

432492 

430869 

434316 

413155 

433329 

446523 

449923 

453826 

405578 

432789 

455791 

449109 

436255 

415984 

402844 

456666 

414620 

404979 

412318 

424361 

412542 

441975 

457021 

457861 

439204 

415642 

446847 

443359 

447336 

449045 

422185 

436030 

449589 

436092 

415054 

412908 

409583 

4104 B3 

435383 

426629 

415831 

412281 

434898 

422229 

447518 

458546 

438648 

450399 

420833 

453903 

443650 

427419 

423741 

451577 

441358 

402706 

436054 

402749 

442472 

445762 

441420 

405564 



AW963941 

AJ24S210 

AA700752 
AAS99763 
AA044967 
AA3S282S 

TB4239 

AW749432 

WD1060 



Ks.117341 
Hs.112520 

Hs.146065 

Hs,189788 
Hs.3*382 



AW444530 Hs,105362 
AA421020 Hs.208919 



016154 

AA442884 

R66444 

AI241280 

AW612481 

AW275110 

U22491 

AW411330 

BE067952 

AF015041 

NM.003063 

BE258051 

AL138129 

026361 

BE090689 

AW270992 

F01143 

R19046 

AA452512 
AA150120 

AW936911 

AK001551 

AW961516 

AW173248 

AW968934 

AW450205 

AF087987 

U19878 

T51454 

AI792583 

AW139383 

BE072483 

AL1 17530 

R02287 

AW752437 

AI345995 

AI733907 

AA121913 

AW440117 

BE163567 

R61083 

AI203933 

H1S145 

AI810054 

AW500458 

AF134414 

TB0061 

AI215667 

AA813125 

AW5t1049 

R47948 

AW299606 

AI699330 

NM_000200 

AA330362 

N69101 

AW173212 



Hs.201201 
Hs,51B91 

HSJ48906 
Hs.104105 
Hs,271106 
Hs^48117 
Hs.118796 

H3.199291 
H5334629 



Hs.3104 

Hs.120949 
H8.284284 
Hs.5010 

Hs.76719 
Hs.244621 

Hs.326729 

Hs.145944 

H8.95097 

H5^285 

Hs.173108 

Hs.305890 

Hs.42696 

Hs^224 

Hs^5 

H3.135354 

Hs.245437 

Hs.278337 

Hs.112822 

Hs.121052 

Hs.135258 

Hs.127383 

Hs.293898 
Hs.256879 



Hs.97142 
Hs.30509 
Hs.14119 
Hs.29956 
Hs.113271 

Hs.175044 
Hs.146335 
Hs^2007 
Hs.188732 
Hs.232777 
Hs.151444 
Hs.177888 

Hs.40730 
Hs.129041 



AI076282 Hs.119813 



AW806859 

AI734002 

AA932872 



HS.264S90 



flb:EST375014 MAGE resequances. MAGH Homo 
TaigslExon 

gtKHomo saptens mRNA for Imnunoglobuiln 

ESTs 

ESTs 

gb2f53d09.r1 Sosres retina N2b4HR Homo 
tibf STB0880 AcSvBted Th»!1s XX Homo sa 
NM.002362:Homo sapiens melanoma antroen. 
ESTs 

gb:RC3^T038&3012994)1 1-a09 6T0388 Homo 

ESTs 

Homo sapiens, done MGC:18257» mRNA. com 

ESTs 

C14000021:giI7242973|dbjlBAA92547.li (AB 
Target Exon 

gb:Human gene for cytochrome P'450c11. e 

ESTs 
ESTs 

C3001708*:ol|4758028|fBliNP.00436ai| co 

ESTs 

ESTs 

ESTs 

G protein-coupled receptor 7 
ann8xinA6 

gb:CMO.BT0365-061299-122^09 BT0365 Homo 

numb (Drosophlla) homotog-like 

sarcolipin 

gb:601111034F1 NIHJA6CJ6 Homosaplensc 
gb:OKFZp547F152j1 547 (synonym: Itfbri) 
CX001454:gi|8393794|rBf]NP_058681.1| myo 
K1AA0042 gene product 

gb:RC1-BT0720-28030aoi l-f08 BT0720 Homo 
ESTs. Weakly slmHar to ALU7^HUMAN ALU S 
zinc finger 1111 

gb:yg2ni1.r1 Scares Infant brain 1NIB H 
C1000118':gij9951913{rBflNP.05283Z1| pr 
U6 snRNArassoclaied Snvlika protein 
itbosomal protein S14 
Target Exon 

hypolheOcal protein MGC1 1082 

Homo sapiens cDNA FU10689 Its, done MT 

ESTs 
EST 

Homo sapiens cDNA: FU21897 (Is. dona H 

BCU^iltel 

EST 

transmentrane protein with EGF-ain and 
Homo sapiens cDNA: FU21930 lis. done H 
ESTs 
ESTs 

Hemp sapiens cDNA FU1 1537 lis, done HE 

DKFZP434B172 protein 

ESTs 

ESTs, Weakly slmilartoALU7JiUMAN AIOJ S 
ESTs 

gb2o86hOayS Stratagene ovsiisn cancer 
pregnancy-assodated plasma protein-E 
ESTs 

gb:QV3.HT0460-230200-101^ HT0460 Homo 

weel (S. pombe] homdog 

ESTs 

ESTs 

ESTs 

KlAA0460protdn 

ABO blood group (transCBrase A. dpha 1- 

gb7d22fD8.s1 Soares fetal Over spleen 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

hlstfilln 3 

Ob;EST34074 Embryo, 12 week D Hbmo sad 

ESTs 
ESTs 

Target Exon 
ESTs 

Target Exon 

gb:MRO^T002^081 199004<03 ST0020 Homo 
ESTs. Moderately similar to AUJ5^HUI\/!AN A 
gb3)o57d07^1 Na.CGAP.Lu5 Hbmo sapiens 
Target Exon 
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18 
Z8 
IB 
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ZB 
2.8 
ZB 
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Z7 
17 
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17 
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2.7 
2.7 
17 
2.7 
11 
17 
17 
17 
17 
17 
17 
2,7 
17 

17 

17 

17 

2.7 

2.7 

17 

17 

17 

17 

17 

17 

17 

17 

17 

2.7 

2.7 

2.7 

2.7 

17 

2.7 

17 

17 

17 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

2.6 

2.6 

IS 

IB 

IB 

IB 

IB 

IB 

IB 

IB 

IB 
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40S003 






Tsr^el Exon 


Z6 


4595S4 


A)910884 


Hs.346429 


ESTs 


Z6 


441597 


AW13S032 


Hs.203625 


ESTs 


Z6 


411280 


NS0617 


Hs.80506 


smdl RuctesrribonudeQpnjtein polypept 


Z6 


420509 


M83554 


H5.1314 


tumor necrosis factor receptor supertami 


Z6 


445060 


AA830811 


K5.282908 


ESTs 


^6 


436260 


BE172762 


Hs.292710 


ESTs. Weakly simaar to ALU5„KUMAN ALU S 


16 


428717 


T78001 


Hs.133546 


hypothetical protein aJ21 1 20 


Z6 


401716 


W03955 




C16000902:Sil403440]9blAAA7316&1| (M817 


16 


416628 




gb3a82tf04j1 Scares fetal Over spleen 


IS 


443864 


N37059 


Hs.36250 


ESTs. Weakly stmHar to 138022 hypolheti 


2.6 


440702 


AA904178 


Hs.148233 


ESTs 


Z6 


456310 


AA225522 




gb:nc25c06.r1 Na.CGAP.Pr1 Horro sapiens 


16 


451255 


AAQ20857 


Hs.90744 


ESTs 


16 


455737 


BE072248 




gb:QV4^T0538.271299O59-b12 BT0536 Homo 


16 


445326 


AI220072 


Hs.344672 


ESTs 


16 


408432 


AW195262 




gbacn67b05j(1 Na_CGAP.CML1 Homo sapiens 


16 


445874 


BE326671 


Hs.170058 


ESTs * 


16 


441053 


AA913819 


Hs.188025 


ESTs 


16 


455505 


AW970640 


Hs.309071 


ESTs 


16 


453491 


AU)40177 




gbdDKFZp434F0213.r1 434 (synonym: htes3) 


16 


45S04B 


AW852749 




Ob:PM1-CT0247-08010a0084i09 CT0247 Homo 


16 


456990 


NM^004504 


Hs.171545 


HIV-1 Rev bindinQ protein 


16 


444130 


Ai125263 


Hs.170410 


ESTs 


15 


422210 


BE269319 


Hb.171937 


steroid dshydrogenase-Iike 


IS 


424241 


AW995948 


Hs.182339 


Homo sapiens pynivSte dehydrogenase hina 


IS 


451606 


AAOIOTOI 


Hs.7945 


AIE-75 liMlng prot^ protein 


15 


400427 


AB044934 


Hs.287388 


histamine H4 receptor 


IS 


410443 


BE062906 


Hs.28338 


K1AA1546 protein 


15 


455210 


AW8665g9 




gb:QV4^024'21040O'181-b1O SN0024 Homo 


15 


429909 


AW9770S0 


Hs.184860 


CGI-203 protein 


15 


441191 


AI683930 


Hs.148816 


ESTs 


15 


413489 


BE144228 




gb:MR0^0165-140200-009^04 HT0165 Homo 


15 


448215 


N34740 


Hs.6658 


ESTs 


15 


419354 


M62839 


HS.12S2 


apolipoprotein H {beta-2-^iycoproteb 1] 


15 


447610 


AW296286 


Hs.255534 


ESTs 


15 


450724 


R55428 




gb:yI79b05.r1 Scares breast 2NbKBst Homo 


15 


414523 


AU076633 


Hs.76353 


serine (or cysteine) proteinase tnhibito 


15 


414776 


AA155S98 


HS212839 


hypothetcal protein FU14195; KIAA1714 


15 


447730 


A14212S1 


Hs.114085 


Homo sapiens mRKA for KIAA1755 protein. 


15 


434152 


AI792665 


Hs.291190 


ESTs 


15 


412671 


AW977734 


H3.37931 


gb:EST389963 MAGE resequences, MAGO Homo 


15 


456401 


W28146 




gb:43f1 1 Human retina cONA randomly prim 


15 


404678 






Target Exon 


IS 


408520 


AA225063 


Hs.161614 


ESTs 


15 


411332 


AW837212 




gb:QV2.LT0038-140300^081-c01 LT0038Homo 


15 


448920 


AW408009 


Hs.22580 


atkylglyoerone phosphate synthase 


15 


429906 


AL0B0137 


Hs.193743 


ESTs 


15 


433712 


AF090887 


Hs.306562 


Homo sapiens done HQ0OB5 


15 


430353 


BE539951 


HS.30899& 


Homo sapiens, done iMAGE'^7D73, mRNA. 


15 


446224 


AW450551 


Hs.13308 


ESTs 


15 



TABI^63B 



Pkey: tjnique Eos probeset Menfitier number 
CAT number Gene cluster number 



Accession: 


Genbartkaoces 


ision numbers 


Pkey 


.CAT Number 


Accession 


408229 


1048462.1 


AW175091 H24234 


408432 


1058667J 


AW195262 R27668AW811262 


409679 


114787J 


BE:S0$21 AA076837 BE249870 AA984291 AWS02442 AW501551 AI221491 AA194239 D63045 AA193426 AA773243 AA193293 


410483 


1204995.1 


BE163567 88)73689 BE073747 BE073760 6E073739 BE073748 BE163495 AW7&0176 BE163491 BE073763 BE073671 


411320 


1238624.1 


AW836646AW836580AW836610AW836636AW835603AW836632 


411332 


1239102.1 


AW837212 AW837408 AW837285 AW637380 AW837213 AW837411 AWB37418 


411356 


1240273.1 


H45377 H21137AW838640 


411426 


1245515J 


BE141714 AWB45993 AW845989 


411829 


1260309 1 


AW865749 8E179419 BE179492 


411944 


1266482.1 


AW8771 39 AW8771 35 AW877018 AW991835 AW877128 AW877108 AW877017 AW877107 


413155 


1351148J 


BE067952 6E067945 BE067942 BE067943 BE067949 BE067954 BE067944 BE067953 BEQ87956 BE067946 


413381 


1365950.1 


BE090690 BE090688 BE090881 BE090693 BE090875 


413489 


1373392.1 


BE144228BE144291 


415054 


151827J 


Ar733907 AA1S9708 Ai732614 


415098 


1522174 1 


059687 [)59694 D59656 D59589 


415131 


1523680.1 


D61 119 081508 D81734 


415386 


1535560 1 


Z43087 F07410 H15506 H54108 F^33 H98000 


415981 


1564242.1 


R35694H12035f^12 


416628 


1604848 1 


W03955 H82332 H59247 H72466 


416935 


163179.1 


AA190712 AA190665 AA252564 


419386 


184356.1 


AA236667 AA237066 AA354236 AW957759 H08961 


419896 


1888662.1 


Z99362 Z99363 


420n8 


196389.1 


AW970512 AA280251 AI652287 8E466438 AI650725 AA551654 AA281574 AW571481 
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421342 
421813 
422731 
423741 
423772 



423871 
426481 
428132 
431847 
434407 
435383 
436190 
438535 
441420 
442472 
445797 
447518 
447500 
448516 
449923 
450724 
451004 
452351 
453412 
4S3491 
453752 
453826 
454445 
454549 
455048 
455210 
455649 
455737 
455791 
455887 
456075 
456310 
456401 



201427J 
207654_1 
220507^1 
231582-1 
23188 1 



232749.1 

267878 1 

287430.1 

338402.1 

385744J 

405360^1 

41555.1 

45946.1 

51674aLl 

543371.1 

650943,1 

724787.1 

728288J 

766241.1 

81926.1 

844585J 

B5453J 

91233.1 

986264l1 

969172..1 

979899.1 

982669.1 

1204468.1 

1223789.1 

1250599.1 

1260650 1 

1348708 1 

1353892 1 

1365954 1 

1380836J 

1476756.1 

177089.1 

1844649.2 



AA504749AA287498 
BE04825S AA313083 AA298419 
AL138411AL138412AA315860 
AA330362 AW96252S H87796 

AA306637 NM.007241 AF156102 BE388339 66614027 AW935016AA31 5261 AW376418 AW304951 At859820 AA367436 AW516957 8E614519 
A)453152A!4S3149AI453139AW166378AI139491AI538368AW486227Ai680027AWD90513AA662830F3^ 

CX}2215A16525 
AA331906AA332484 

AW963941 AW963944 AA379825 AA379564 

AA421765 AA456076 AI290275 AA455579 

AI791314AI791434AA516511 

AW815333AW815409AA632563 

R61083 R1 3743 AA679174 AA679193 Z42903 

AK0010S9AA633055 

L09078 L03145 L09094 L09098 L03165 109102 
AA932B72W28068W28643 T96110 
AW806859AW808852AF049S82 
AI253414AI366014 R34822 
T80061 AI382804 
A)420990A1399725AI401757 

AW89aS95 AW898588 AW8g8590 AW898663 AW898592 AI525093 

BE258051 R45758AA004732BE255126 

R5S428 AI820704 Ai732283 R54383 

AA044967 H86327 AA013079AA058776BE242713AA019987 

AA025647R45716AW753786 

AJ0032gOAJ003288AW276947 

AL040177 

AL120800BE378580 
AL138129AL138179BE064231 
AW749432 AW749434 R47332 

AW806910 AW866461 AW866396 AW866373 AW86661 1 AW866616 

AW852749 AW8S2755 AW852620 

AW866599 AW866294 AW666468 AWB66467 

6E065051 BE155165 BE064764 BE155231 BE064648 BE054671 6E064636 

BE072246 BE072229 8E07222S BE072210 BE072221 BE072256 BE072211 BE072242 

BH090689 6E090685 BE0906g7 BE090880 BEQ90691 BE09Q6g6 BE090698 BE090686 

6E154173 BE154098BE154096 

N73442 R98100BE410380 

AA225522 AA225465 Ata2097g AW973965 A1791935 AAS58735 
W28146 W28187 



TABLE 63C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 dtgil numbers in this cotumn are Genbanit idenOfler (GI) numbeis. Dunham I. et at' refers to the pubtcation entiOed The DMA sequence of 

human chromosome 7Z' Dunham I et a)., NaUre (1999) 402:489^95. 
Strand: Indicates DMA strand from which axons were predicted. 
NLPOsitlon: Indicates nucieoBde positions of predicted exons. 



Pkey 


Ref 


Strand 


Ntjxisition 


400461 . 


9929654 


Plus 


32727-32846.32929^1 


400499 


9796071 


Minus 


14849S.148806 


400749 


7331445 


Minus 


9162-9293 


400831 


8576271 


Minus 


56502-57407 


401278 


9799936 


Plus 


98428-98573 


401411 


7799787 


Minus 


144144-144329 


401656 


9100664 


Minus 


1-382 


401716 


6715703 


Plus 


174722-174911 


401884 


8140731 


Minus 


89182-90053 


401896 


8569194 


Plus 


115129-115294 


401917 


9502466 


Plus 


25054-25229 


402422 


9796344 


Minus 


32843^08 


402460 


9796884 


Minus 


108901-109254.1 10246-1 1 0581,1 13613-1 13960 


402490 


9797648 


Rus 


149982-150929 


402706 


8894426 


Minus 


148640-148805 


402749 


9212740 


Minus 


68787-68882,76602-76768 


402797 


3421043 


Minus 


15758-15930 


402817 


6822166 


Minus 


48611-49012 


402844 


9369286 


Pius 


54958-55313 


403451 


9838240 


Rus 


77382-78300 


403612 


8469060 


Minus 


94723-94859 


403830 


8569999 


Minus 


13909-14466.15251-15760.16898-17431.41742-42440 


403649 


8705159 


Minus 


27141-27247 


404260 


9366879 


Rus 


51396^1513 


404267 


9581792 


Minus 


12209-12313,18241-18397 


404606 


9212936 


Mnus 


22310-23269 


404660 


9797068 


Plus 


168215-168916 


404678 


9797204 


Plus 


115196-115448 


404979 


4160139 


Minus 


8776^88217 


404984 


6939882 


Rus 


87221-87505 


405001 


6015406 


Minus 


104646-104819 


405150 


9929756 


Plus 


12647S-126773 
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405152 


9965561 


Minus 


137682-137969 


405258 


7329310 


Rus 


129330-130076 


405422 


7243869 


Minus 


101938-102079.102261-102443,102896-103202 


405564 


2114222 


Minus 


16766-17344 


405678 


4079670 


Plus 


15182M52D27 


405735 


9931101 


Minus 


29854-29976 


405770 


2735037 


Plus 


61057-62075 


406003 


8247800 


Plus 


4207942516 


406085 


9123888 


Plus 


18665-18843 


406177 


7279760 


Minus 


18930-19148 


406337 


9213455 


Plus 


90117-90337 


406398 


9256276 


K£mis 


118691-118959 


406600 


6248616 


Minus 


36296-36610 



Table 64A Dsts about 703 genes up^ulatsd In 8PH compared to nonnal adult tissues. These were selected frofn 59680 probesets on the Aflymetiix/Eos Hu03 GeneChIp array 
such lhat the ralb of 'average* 6PH to 'average' norma) adult tissues was greater than or equal to 3.0. The 'average' BPH level was set to Ihe 7S^ percentile amongst BPH tissues. 
The "average" nomna) adult tissue level was set to the 85>^ penasniae amongst non-malignant Ussues. In order to remove gene-specUic background levels of non-specific hybridizaton, 
the 10"* peroenfite vsltue arnongst the fton^nalignant CBSUes was subtF^ 

TABLE 64A: ABOUT 703 GENES UP^REGULATED IN BENIGN PROSTATIC HYPERPUSIA COMPARED TO NORMAL ADULT TISSUES 

Pkey: Unique Eos probeset Idenfifier number 

ExAccn: Exemplar Accession nunter, Genbank accesskm number 

UnlgenelD: Unlgene number 

Unigene Title: Unigene ger>e t'tls 



R1; 


Ratio of BPH (issue to normal adult body tissue 




Pkey 


ExAccfl 


Unigene ID 


Unigene Title 


R1 


420154 


AI093155 


Hs.95420 


JM27 piDtein 


49.6 


426747 


AA5%210 


Hs.171995 


kafltkrcln 3i (prostde spocHic antigen 


47.2 


419526 


AI621895 


Hs.1 93481 


ESTs 


43.6 


432441 


AW29242S 


Hs.1 63484 


ESTs 


42.7 


431616 


AA508552 


Hs.1 95839 


ESTs, Weakly sirnlla' to 138022 hypothed 


30.2 


407202 


N58172 


Hs.1 09370 


ESTs 


26.1 


432101 


AI916950 


Hs.123642 


EphA3 


25.8 


400299 


X07730 


Hs.171995 


kaUBcrein 3, (prostate specific anl^n 


25.5 


425076 


AA506324 


Hs.1 852 


acid phosphatas8i prostate 


24.6 


414569 


AF109298 


Hs 118258 


pmstate cancer associated protein 1 


24.4 


410929 


H47233 


Hs 30643 


ESTs 


21.1 


400287 




Hs 1813^ 


kainicrein ? oroslafic 


20.3 


446057 




Hs 14Q3SA 


ESTs, Weakly similar to A46010 X-linked 


19.6 


423073 


QCOS9Q99 


Hs 123119 


MAD f molherB aa^nst decaoBidaoleok^ Dr 


18.6 


415989 


A1267700 




ESTs 


17.8 


428336 


AAS03115 


H3.183752 


nucroseminopmteln. beta- 


17.3 


450693 


AW450461 


Hs.203965 


ESTs 


16.7 


400298 


AA03227g 


Hs.61635 


six transmembrane epithelial antigen of 


16.7 


407168 


R45175 


Hs.117183 


ESTs 


15.5 


408369 


R38438 


Hs.182575 


sdlute canier family 15 (H??7 transport 


15.5 


454119 


BE549773 


Hs.40510 


uncoupSng protein 4 


14.5 


428819 


AL135623 


Hs.193914 


KIAA0575 gene product 


14.5 


400302 


N480S6 


Hs.1915 


folate hydrolase (proslat&^pecilic memb 


14.4 


400297 


AI127076 


Hs.306201 


hypoOwficd pratebi DKFZp66401 278 


14^ 


418432 


M141S6 


Hs.85112 


bisulkvfike growth factor 1 (somatomedi 


14.2 


432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to 138022 hypotheti 


14.2 


433444 


AWg75324 


Hs.129816 


ESTs 


13.8 


432240 


AI694767 


Hs.129179 


Homo sapiens cDNA iaJ13581 fis, cbne PL 


13.5 


428342 


AI739168 




Homo sapiens CDNA RJ13458 fis, ctone PL 




401424 






NM-001172:Homo sapiens arginase, type II 


12.7 


432435 


BE218886 


Hs.282070 


ESTs 


1^5 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-tike and 


1Z3 


446100 


AW967109 


Hs.13804 


hypotheiicd protein dJ462023.2 


12.0 


425154 


NK1.001651 


Hs.154850 


coOagen, type IX, alpha 1 


11.9 


407275 


AI364186 




gb:qw34h07j(1 NCt_CGAP_Ut4 Homo saplere 


11.8 


452340 


NhiL002202 


HS.S0S 


ISL1 transcriplion factor. UM/homeodoma 


11.7 


432473 


A)202703 


Hs.152414 


ESTs 


11.3 


410330 


AW023630 


Hs.159425 


ESTs 


11.2 


431579 


AW971082 


Hs.222886 


ESTs. Weakly simitar to TRHY JIUMAN TRICH 


11.2 


452792 


AB037765 


Hs.30652 


KIAA1344 protein 


11.2 


418848 


AI820961 


Hs.193465 


ESTs 


10.9 


400292 


AA250737 


Hs.72472 


BMP-RIB 


10.9 


433847 


AA603367 


Hs.222294 


ESTs 


10.8 


453160 


At263307 


Hs.239884 


H2B histone family, merriber L 


10.8 


409262 


AK000831 


Hs.52256 


hypothelk»I pratoin FLJ20624 


10.6 


431474 


AL133990 


Hs,190842 


CEGP1 protein 


10.3 


429220 


AW207208 




ESTs 


10.3 


428134 


AA421773 


Hs.161008 


ESTs 


10.2 


408622 


AA056060 


Hs.202577 


Homo sapiens cONA aJ12165 lis. ckme MA 


10.1 


456497 


AWg679S6 


Hs.123648 


ESTs, Weakly similar to AF108460 1 ubinu 


as 


434792 


AA6492S3 


Hs.132458 


ESTs 


9.7 


433466 


AAS08353 


HS.10S314 


relaxin 1 (HI) 


9.5 


439176 


AI446444 


Hs.190394 


ESTs. Weakly similar to B28096 llne-1 pr 


9.5 


428398 


AI249386 


HS.98S58 


ESTs 


9.4 
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4^290 


AT W^HAm 


Urn 40fl7RA 


reuionioiiwiii, neavy pnypepood (jeuwDi 


9.3 




407709 




Hs.23023 


CO IS 


9.3 




415293 


R49462 


rVt, iVODt I 


ESTs 


O 1 

9.1 


5 


429918 




ns. 1 19JO9 




9.1 


440260 


A)972887 


Hs7130 


nviinn l\/ 


9.1 




453096 




ns.iiJcO 


CCTs 
CalB 


9.1 




410467 


AF102546 


na.oJ9ji 


ddchshund (DrosophOs) homoloQ 


6.9 




456516 


BE172704 


Hs22274S 


KlAAlfilO nmlnn 


0.0 


10 


430722 


AW968543 


Hs 203270 


FSTr Wftskhrfihnnartn AI 111 HI IMAM All IQ 


0.O 


456072 




n9.«r i9cO 


nonxi sapiens cuna: rLf^^f o9 us, dons K 


8.6 




453006 




Ue VL1171 

nStOiMif 1 


CCTe 
CO IS 


8.7 




418339 






CO 1 5, Mooeraray siRwar 10 orUN^nUMAN 0 


8,7 




420424 


nOUOOUJO 


Hs.97594 


mAA19in nmlafn 
lUnMl^lU pfOlcul 


8.7 


15 


450642 


R39773 


Hs.7130 


mtvna l\/ 

MlpUiD IV 


0./ 


434036 




Lie 1Q77H 

ns. i9f f oj 


nypouioDca proiein MuLuco^ii 


8.7 




428927 


AA441837 


HS.902S0 


ESTs 


tl.O 




418829 


AA516531 


UeCCQQQ 


ni\ nomBODux ii/rosopnua^ lanuiy o, a 


8.5 




420345 




Ue 9^971 
nS.£9£9 1 


CO 15 


8.5 


20 




nVVO lOUOO 


ns.ioi£Di 


CCTe 

bolS 


8.4 


419743 


AW408762 




HrtfiiA enraom rlnno 9AA1(\ mDMA enniiafiMs 
nUlHl otipions uOna £,'n 10 fimnn ScI|UBI10B 


0.4 




428728 




Re iqi^i 


hypoOisticsl pfotoin 


0.0 




417169 


R13550 


Ue 94fi77'* 


CO IS 






453387 




Ue 9t;9anq 


CO IS 


8.2 


25 


434656 


API SI in.*) 

/^T 191 lUJ 


Uq ii99(;q 


T ccQ rBCfiptor Qun lUk locus 


8.1 


4179SB 


AA767362 


Hs. 193417 


ESTs 










ns,ii:i9i9 


Honx) sapiens dons 2^1 placfinta expres 


7.9 




444609 


AWS71fiffl 




CO 18 


7 a 




424099 


AF0715n9 
nriJl itMc 


Ue IWUA 

ns. 1 09000 


ATD>JTwtStwi 4*ve^a eiik-famllu P fPCTD 

A 1 r-oinainQ cassouB, suiviamiij l |Ur i K 


7.8 


30 


433923 


AI823453 


Hs 146625 


ESTs 


7 7 

f.f 


437587 




Ue 79^c 
ns.f a0c9 


nuiiMii UNA Sequence nuni cione Kri'ior ji i 


7.7 




441690 


R81733 


Hs 33106 


ESTs 


7 ft 
r.D 




453370 




nst. 1 090vo 


Ai r*Miwing vaaSouBt suiricaiiiiy V \wr i n 


7 R 




441247 


AW11B6B1 


Hs 128051 


Hofno sflptens Ihymte stromsl ^ffnpfiopoisfi 


7 <t 


35 


439444 




U. CM7P 


CCTe tWaaMu elmilw In nUVyi kutwrfhalt 

cois, wsoKiysifnuai 10 MiHizznypotneii 


IJS 


431448 




nSiOUOcui 


iiTPuuiBiicai proiein uiv^^poo'tuifro 


1 A 




413597 


AVU3Q28&S 


Hs 1171Ra 
no. 1 1 r 1 Ov 


ESTs 


7 A 




404592 






NM 029739**Hnim sariAns F^ iihtmrfdn Rn 


1.0 




409557 




n9.& III 9w 


ESTs 


7 ^ 


40 


420352 


6E2S8835 




fib-fi01117374F1 NIH MGC 16 Hnrio snt^M r 


7 ^ 
r.J 


438231 


r\VV99')009 


ns. l33D0a 


CCTb 
COIS 


7.3 




440529 




n9.1D4rO 


nOtTK) SSptenS CUNA. r LJ21 f IB !», 01008 U 


7.1 








Ue QJCfil^ 


CCTe 


7.1 




417332 


AUt/Q79717 


Us Tflfuim 


kwfiAlkat^^ nmlain CI 191 C41 

nypouiBDca pniiBin ruzioii 


7.1 


45 






Ue ^6797 

n9.oorc/ 




7.1 


449249 


T52285 


Ue lO^IIC 

ns. 190 1 19 


noiuo sapiens nwiA rji iuaai r04 piQuBin, 


7 ft 
f.U 




430187 


Anrqqqnq 


Ue i cQqaq 
ns. 190909 


LOIS 


7.0 




434217 




Ue TITAQ 

ns.<oo<^9 


CCTe 
CO IS 


7.0 




431217 


MM 01^97 


He 9SnA^ 

ns><9uoou 


Rho GTPsse sctivatinQ pfoieln 6 


6.9 


50 


430188 


AL049242 


H>! 7347Q4 

n9.AO^f 9^ 






415766 


AW41Q1% 




hvnnfhARnil fwnlRin n l1Y7ft9 


0.9 








Ue 


thyroid leceptor hteracUng protein 15 


6.8 




426334 


A1^7^7A 


Ue iniciio 
r1S.OU00O£ 


CCTe 
COlS 


6.6 




458509 




Ue OQOqnR 

ns>£0<9uo 


ESTs 


0.0 


55 


449821 


Alfi71lAl 


Ue 911199 

ris.ziii££ 


CCTe 

cO IS 


6.8 


407118 


AA1<%fi7qn 


V4q 9fi9n'?fi 
nsucDcuoo 


CO IS, weany sunUor lo cGU_nUMArl uNU 


0.0 




443250 


AI041530 


Hs 132107 


ESTs 


ft7 
0./ 




428330 


1 99'i94 
Uc£9«4 


U« 99tt 


niatrix metaltoproleinase 7 (matrilystn, 


5.7 




408197 




nsh luii iv 


CSTn WfKitchr cimllnr In A^AniH yjlntmri 

CO 1 8, wodiuy simuor uj a'hju iu A-iinK60 


A A 


60 


454457 


AW753456 




nh'0\/7.rTfi9fil.9fiinqQ-ni1ji11 fTIWRi Hnmn 


0.0 


1 10^ 




Ue OOTMI 


nofDo sapiens cdna puiazd? ns, dono PL 


6.5 




429433 


AA4S9&QQ 


Hs OiViHR 
ns.£ I0900 


CO i s, weaioy somiar u iMnAiON pivisin 


R C 
0.0 








Ue 

rlS.^9t>09() 


nomo sapiens itikna; ouna Ui\r2p4o4c!0o2o (f 


6.5 




424433 


H04607 


H3.9218 


ESTs 


a5 


65 


450164 




Ue ttxm 
ns.o090i 


CCTe 
COIS 


6.5 






Us lft91 
na.oiMU 


nypoinBBCdi pfoien ru iizzu 


e e 
0.5 






nQ4i99 


nS.1M£0 


CCTe 

cols 


6.5 




425312 




Ue 1Ji;0l3) 
ns. 149990 


CCTe 

CO is 


e A 
0.4 






AC 1^1 70(1 
/V 101 fOO 


Ue ^AVTCa 

nS.o4ofOO 


Homo sapiens clone 25119 mRNA sequence 


6.4 


70 




AAOnOCTR 




ESTs 


6.4 


412350 


AiecQM: 

nl099dUQ 


Hs.73826 


pntebi tyrosine phosphetese. mmmoc^ 


6.3 




432600 


A1821085 




ymiOSOd l£*/9 n\A^MnrJ^O % wiiiw 8d}<mna 


6.3 




435375 


AI733810 


Hs.187832 


ESTs 


&3 




416370 


N90470 


Hs.203597 


ESTs. WeaMysimnar to 138022 hypotheti 


6.3 


75 


431359 


AW993522 


H5.292934 


ESTs 


6.2 


424736 
403667 


AF230877 


Hs.152701 


rnlcrotutKdB^tefacQnQ piot^ that ess 
Target Exon 


ai 

6.1 




424846 


AU077324 


H3.1832 


neuropepUds Y 


6.1 




439569 


AW602166 


Hs.222399 


CEGP1 protein 


6.1 


80 


438396 


A1663487 


H5.152213 


wtng)ess4ype MMTV {ntegraSon sits (and 


&1 


448004 


AW451477 


HS.2574S6 


ESTs 


6.1 




431770 


BE221B80 


Hs.268555 


flixoribonudeasB 2 


ao 




453861 


AI026838 


Hs.30120 


ESTs. WtaUy similar to NUOJftJMAN NUCLE 


&0 




415621 


AI648602 


Hs.55468 


ESTs 


ao 
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436673 N23874 m.50477 RAB27A. member RAS oncogene family 6.0 

442481 N99828 ob2a32c04/1 Soaresfelalllver spleen 6.0 

432527 AW975028 Hs.102754 ESTs 5.9 

_ 447156 AW274731 Hs.157920 ESTs 5.9 

3 404003 Taig^Exon 5.9 

424853 BE549737 Ks.132967 Human EST done 122887 mariner transpose 5.9 

427726 A1359144 Hs.l43688 Homo sapiens cDNA: RJ23031 fis. done L 5.9 

433043 W57554 Hs.125019 lymphoid nudear protein (lAM) mRNA 5.9 

438138 R98299 Hs.177502 ESTs 5.9 

10 440129 AA865818 Hs.174938 ESTs.WeddyslinQarloS71B868te204ik 5.8 

434973 AW449285 Hs.313636 EST 5.8 

408527 AL1350ie Hs.33074 Homo sapiens, ctonelMAG£:380651 9, mFU^ 5.8 

459241 AA032276 Hs.99010 ESTs, Moderately similar to T14342NS01 5.8 

^- 434485 AI823511 Hs.118567 ESTs 5.8 

15 429716 R25685 Hs.211933 coIlaoen.typeXHI.alptia1 5.7 

443960 AI093577 Hs.255416 hypothetical protein FU21 966 5.7 

453200 AA033832 Hs^12433 ESTs 5.7 

428002 AA418703 gbzv98c03.s1 Soares.NhHMPu.S1 Homo sapi &7 

432956 AA6S0114 Hs.325198 ESTs 5.7 

20 448148 NM..016578 Hs.20509 HBVpXassodatedprotein^ 5.7 

420948 AB016898 Hs.10O469 myeldd/iymphoidwfnixed^ageleukero 5.7 

• 425810 AI923627 H5.31903 ESTs 5.6 

414312 AA155694 Hs.191080 ESTs 5.6 

404571 NM»015902*:Homo sapiens progesfin Induce 5.6 

25 424850 AA151057 hls.153498 dtfomosome 18 open reading frame 1 5.6 

439092 AA830149 gb:oc44l08.s1 NC1.CGAP.GCB1 Homo sapiens 5.6 

427078 AI676062 Hs.111902 ESTs 5.6 

422805 AA436989 Hs.1210l7 H2Ahlstone family, member A 5.6 

417511 AU)49176 Hs.82223 chordln^ike 5.6 

30 449625 NM.014253 odz (odd OzAen^. Drosophila) homolog 1 5.6 

433927 AI557019 HS.11&467 small nuclear protein PRAC 5.5 

441676 BE664206 Hs.49889 ESTs 5.5 

421513 X00949 Hs.105314 relaxinl(HI) 5.5 

432682 AI376400 Hs.159588 ESTs 5.5 

35 435021 AA922192 Hs.54709 ESTs 5.5 

454171 AW8S4632 flbK3V2-CT0261-201D994)11-f05CT0261 Homo 5.5 

426581 AB040956 Hs.135890 KIAA1523 protein 5.5 

439079 AF065937 H&38348 ESTs 5.6 

443635 AI080230 Hs.134214 ESTs 5.5 

40 400080 Eos Control 5.5 

410196 AI936442 Hs.59838 IvpottieQcsj protein RJ1080B 5.5 

436578 AI091435 Hs.134859 ESTs 5.5 

424051 AL110203 Hs.138411 Homo sapiens mRNA:cDNADKf^586J1 922 (f 5.4 

450325 AI935982 Hs.26289 ESTs 5.4 

45 421863 AI952677 Hs.108972 Homo sapiens mRNA;cONAOKFZp434P22d(fr 5.4 

432161 AK000400 Hs.341181 ESTs. W^simllarto envelope [Ksapi 5.4 

435072 AW592176 H$.116932 ESTs 5.4 

426647 AA830050 Hs.124344 ESTs 5.4 

441111 AI806867 Hs.126594 ESTs 5.4 

50 433087 AI720666 Hs.152520 ESTs 5.3 

450244 AA007534 Hs.125062 ESTs 5.3 

400294 N95796 Hs.278695 Homo sapiens prostelnmRNA.oomptBtecds 5^3 

415539 AI733881 Hs.72472 BMP^IB Si3 

437267 AW511443 Hs.258110 ESTs 5.3 

55 441916 AA993571 Hs.129075 ESTs 5.3 

452784 BE463857 Hs.151258 hypothetica) protein RJ 21 062 5.3 

446715 AI337735 Hs.173919 ESTs, Moderately similar to ZN91_HUMANZ 5.3 

452531 AA42g462 Hs.293946 ESTs. Weakly similar ID 138022 hypolhefi 5.3 

426503 AA380153 gb:EST93093SKin tumor I Homo sapiens cD 5.3 

60 423101 MB3941 Hs.123642 EphA3 &3 

445704 Ai493742 Hs.167700 ESTs, Moderately sImSar to 138022 hypot 5.3 

450613 Af739625 H$.203376 ESTs 5.3 

409079 W87707 H8.82065 interleukin 6 signal transducer (gp130, 5.2 

^. 415890 H06225 Ks.268712 ESTs 5.2 

65 454968 AW849046 gb:it3-CT0214'l5030OO85-H06CT0214Homo 5.2 

441054 AA913591 Hs.126480 ESTs 5.2 

408385 AF055634 Hs.44553 unc5(C.eleganshomolog}c 5.2 

437752 AA767376 H5.291631 ESTs, Moderate^ similar to S6S6578^ha 5.2 

448072 AI459306 H5.24908 ESTs &2 

70 419536 AA603305 gbmp12d11.s1 NO.CGAPJ>l3Hon»sapfen$ S.2 

421040 AA715028 Hs,135280 ESTs 5.2 

446495 D60923 Hs.153460 ESTs 5.2 

415263 AAg48033 Hs.130853 ESTs 5.2 

426748 AL048409 Hs.97177 ESTs, Wesridysinto to ALU1.HUMAN ALUS 5.2 

75 436338 W92147 Hs.1 18394 ESTs 5.2 

445238 AA883971 Hs.187506 ESTs 5.1 

450362 AA397658 Hs.60257 Homo sapiens cONAFU13598 lis. done PL 5.1 

457211 AW972S65 Hs.32399 ESTs, Weafc^ similar to S51797vasodil8t 5.1 

450582 AI339732 &rid)RNA sequence Unding factor 1 5.1 

SO 438447 AI082883 Hs.30732 hypotheScal protein FU13409:KIAA1711 5.1 

430598 AK001764 Hs.247112 hypothetical piOteln PU 10902 5.0 

440995 T57773 Hs.10263 ESTs 5.0 

41Z775 AA709046 Ks.27552 Homo8apfeflsmRNA;cONADKFZp586N2424 (f 5.0 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



42S2S9 


ALJ049280 


Hs.155397 


418334 


AA319233 


HS5521 


435401 


R44477 


Hs.10056 


418564 


AA631143 


Hs.278695 


407198 


H91679 




403696 






452891 


N75582 


Hs.212875 


457374 


AA493862 




430261 


AA305127 


Hsi3722S 


408829 


N^.006042 


Hs.48384 


448041 


AW292769 


Hs^28 


451244 


AW008798 


Hs.343877 


443160 


R15875 


Hs.258576 


414422 


AA147224 


Hs^49195 


418866 


T65754 




420851 


AA281062 


Hs.29493 


428715 


AW293716 


Hs.53126 


435937 


AA830893 


Hs.119769 


435136 


R27299 


Hs.10172 


438132 


AA907076 


H&122060 


447058 


AI939456 


H8.160870 


419187 


AA234852 


Hs.44693 


433523 


H29882 




420871 


AA702972 


Hs.65300 


450317 


AI692689 




453843 


D25215 


Hs.35804 


430172 


AA468591 


Hs.161889 


431657 


A)345227 


Hs.105448 


452843 


A1796769 


HS.20B320 


458229 


AI92g602 


Hs.177 


420954 


AA282074 


Hs.237323 


452031 


AA741314 


Hs.865 


440354 


AA889386 


Hs.125468 


427212 


AW293849 


H8.58279 


443361 


AI792628 


Hs,133273 


409731 


AA125985 


H8.56145 


420026 


AI831190 


Hs.166676 


450330 


AW500775 


Hs.24817 


415788 


AW626686 


HSJ8851 


417173 


U61397 


Hs.81424 


420S31 


AFD44ig7 


Hs.100431 


434406 


AI031771 


Hs.132586 


454024 


AA9935Z7 


Hs.293907 


447805 


AW627932 


Hs.302421 


438875 


AA827640 


Hs.169059 


419511 


AM29750 


HS.7S113 


445242 


BE156478 


Hs.21108 


428218 


AA424266 


Hs.123642 


452277 


AL049013 


Hs.28783 


418835 


AI655499 


H3.161712 


417601 


NM.014735 


HS.B2292 


430897 


AA484207 


Hs.211867 


430701 


AI760833 


Hs^3971 


437252 


AI433833 


Hs.164159 


407204 


R41933 


Hs.140237 


420133 


AA426117 


Hs.155543 


431740 


N7M50 


Hs.183412 


439752 


T78968 


Hs.14411 


447816 


NM_007233 


HsJ74329 


431060 


AF039307 


Hs.249171 


445372 


N36417 


Hs.144928 


452055 


AI377431 


Hs.141693 


416602 


NM_006159 


Hs.79389 


418019 


R68911 


Hs.176275 


423352 


AA324808 


Hs.193576 


438042 


AWa5971 


Hs.180810 


452978 


AA029994 


HS.61S23 


412843 


AW971239 


Hs.136433 


417412 


X16896 


Hs.82112 


445210 


H09323 


Hs^7133 


416706 


AA314576 


Hs^88945 


420608 


BE548277 


Hs.103104 


428249 


AA130914 


H&183291 


442242 


AV647908 


Hs^24 


444030 


AW0212S4 


H9.135055 


435655 


AW10S663 


Hs^947 


400533 






435285 


AW272603 


HSJ266134 


450216 


AA873345 


Hs.60226 


432960 


AW150945 


HS.14475B 


459527 


AW977556 


K5.291735 


426413 


AA377823 




430535 


AW968465 





Homo sapiens mRNA:cONAOKF2p564K143(Ir 5.0 

ESTs 5.0 

hypoUietica) protein FU 14621 5.0 

Homo sapiens pnBteintnRNA.oonipi8tecds 5.0 

gb:yv04a07^1 Soares fetal liver spleen 5.0 

C4001 100*:gq5852342|gblAAO54015.1| (AFO 5.0 

ESTs. Weakly simOar to DYH9_HUMAN OUA 5.0 

gb:nh05d12£l NQ.CGAP_Tliy1 Homo sapiens 5.0 

hypothetical protebi HT023 5,0 

h^)3ran sulfate (gluoosamine) S-O^utfbl 5.0 

ESTs 5.0 

hypotheScal protein FU20039 5.0 

daudin 12 4.9 

KomeobQKA13 4.9 

gb^ci 1c07.s1 Stralagene lung (937210} H 4.9 

hypotlwiical protein FU20142 4.9 

ESTs 4.9 

ESTs 4.B 

ESTs 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.8 

gb»fd86g05j(1 Na_CGAP^Lu24 Homo sapiens 4.8 

tied domain and RLD 3 4.8 

ESTs 4.8 

ESTs. Weakly simiiar to B34087 iiypolheti 4.7 

ESTs 47 

phospiiatidylinosiiot glycan, class H 4.7 

N<acetylgIiicosaminefhosphalenHJtase 4.7 

RAP1 A, member of RAS oncogene family 4.7 

ESTs 4.7 

ESTs. Weakly similar to ALU7.KUMAN ALU S 4.7 

ESTs 4.6 

thymosin, beta^ MenQfied In neuroblast 4.6 

ESTs 4.6 

hypothetical prolefei FU20136 4.6 

KiAA0217 protein ' 4.6 

ubiquitin-like 1 (sentiln) 4.6 

small inducible cytokine Bsub!amliy{Cy 4.6 

ESTs 4.6 

hypothetical protein FU23403 4.6 

gemln4 4.6 

ESTs 4.6 

general transcrfptkjn factor IIIA 4.6 

ESTs, Weakly similar to ALU1_HUMAN ALU S 4.6 

EphA3 4.6 

K1AA1223 protein 4.6 

ESTs 4.5 

K1AA0215 gene product 4.5 

ESTs 4.5 

ESTs 4^ 

ESTs. Weakly similar to ALU1 HUMAN ALU S 4^ 

ESTs, Weakly similar to ALU1_HUMAN ALU S 4.5 

ESTs 4.5 

ESTs, Moderately sNiar b AF1 16721 67 4.5 

ESTs 4^ 

TP53 target gene 1 4.5 

homeolX)xA11 4.5 

ESTs 4.5 

hypothetical protein MGC10858 4.5 

nel(chickenHB(e2 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4,4 

ESTs 4.4 

ESTs 4,4 

interieukin 1 receptor, type I 4.4 

ESTs 4.4 

hypothetical protein FU13448 4.4 

ESTs 4.4 

zinc finger protein 266 4.4 

Hbmo sapiens cDNA: FLJ23285 its, dorte H 4.4 

ESTs 4.4 

HSPC069 protein 4.4 

ENSFW000209376*:PRED85 protein (Fragmen 4.4 

ESTs 4.4 

Homo sapiens, done IMAG£:3621638. mRNA, 4.3 

ESTs 4.3 

ESTs, Weakly sin^ to 178885 serineAh 4.3 

gb:EST90805 Synovial sarooma Homo sapien 4.3 

gb:EST380561 MAGE resequences, MAGJ Homo 4^ 
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416312 W02640 Hs.16247 ESTs.We^ysindarto2004339AGttfonios 4.3 

454159 AW96e065 Hs.44143 polybromol 4.3 

435176 AA744875 H3,189413 ESTs 4.3 

451061 AW291487 Hs.213658 ESTs, W^sImOar to KIAAI 357 protein 4^ 

5 407910 AA650274 Ks.41236 fibtttnectin leudne rich transmemtrane p 4.3 

420111 AA255652 gb:zs21h11.r1 Na.0GAP.GC61 Homo sapiens 4.3 

420757 X76592 Hs.999l5 aridiDgenreceplorCdjhydrotestosteioner 4.3 

459045 N59101 Hs.40730 ESTs 4.3 

405348 C7001654:gllWe980611dbj|BAB21849.1|(AB 4.3 

10 422906 U80773 Hs.121580 Human EST done 42944 mariner transposon 4.3 

449603 AI655662 Hs.19769a ESTs 4.3 

409705 M37762 Hs.56023 brairHierived neurotrophic factor 4.3 

423782 AI472209 Hs.323117 ESTs 4.3 

4328B7 AI926047 H5,162859 ESTs 4.3 

15 420905 AA521307 H&18G651 ESTs 4.2 

416814 AW192307 H5^2 dolid)yl^-Glc:M3n9Gk:KAc24>P-^^ 4.2 

427908 AA417272 H5.24122 ESTs 4.2 

432765 AJ003429 gb:AJ003429 Selected diromoGome 21 cONA 4.2 

444246 H932B1 Hs.10710 hypothetical protein HJ20417 4.2 

20 450597 AI701635 Hs.207077 ESTs 4.2 

434022 R18374 Hs.1179S6 ESTs 4.2 

427761 AA412205 Hs.140996 ESTs 4.2 

433209 ABO40907 Hs.278436 KIAA1474 protein 4.2 

^_ 413525 BE145699 ob:MR0-HT0208-221 299-204^1 OHT0208 Homo 4.2 

25 436679 At127483 Hs.120451 ESTs, WeaidysimBar to unnamed protein 4.2 

449655 AI02ig87 Hs.S9970 ESTs 4^ 

440774 AI420611 Hs.153934 ESTs A2 

427521 AW973352 Hs^OSSS ESTs 4.2 

440594 AW445167 Hs.126036 ESTs 4.2 

30 458912 AI911066 ESTs 4.2 

410790 AW803357 gb.U2-UI^«)079^300mA08Uti«)079Honio 4.1 

415245 NSgesO Hs.27252 ESTs 4.1 

435743 T66861 Hs.12962 ESTs 4.1 

442786 H50733 Hs.256261 ESTs. ModefalBiy6lmllartoALU8_HUMANA 4.1 

35 436714 AA7289S4 Hs.293399 ESTs 4,1 

423789 AK002084 Hs.132851 hypothetical protein FU1 1222 4.1 

408177 AI241733 Hs.43871 ESTs 4.1 

424200 AA337221 gb:EST41944Endom^ tumor Homo sapte 4.1 

430523 AW451385 Hs.161954 ESTs 4.1 

40 445206 AI350199 Hs.269990 ESTs 4.1 

452335 AW188944 Hs.61272 ESTs 4.1 

438431 AW207860 Hs.293116 ESTs 4.1 

449907 AA00482S Hs.103281 ESTs 4.1 

430487 087742 Hs.241552 K1AA0268 protein 4.1 

45 448152 AI741053 H3.170770 ESTs 4.1 

446515 H66441 Hs.13528 hypothetical protein FU 14054 4.1 

423242 AU)39402 Hs.125783 DEME-6 protein 4.1 

412715 NML000947 Hs.74519 prlmase. polypeptide 2A (581(D) 4.1 

400746 Target Exon 4.1 

50 413081 BE064415 gb:RC4^T031 1-241 199^1 2-b03BT0311 Homo 4.1 

446416 AV658299 Hs.163959 ESTs 4.1 

451640 AAig5601 Hs.26771 Human DNA sequence from done 747H23 on 4.1 

427203 AW629517 H3.244655 ESTs 4.0 

440840 AW629866 ESTs. Weaidy similar to S64054hypotheli 4.0 

55 442338 At761976 Hs.156080 ESTs 4.0 

446307 T50083 Hs.22247 ESTs 4.0 

455276 BE176479 gb:RC3-HT0585-16030aO22-bO9 KT0565 Homo 4.0 

415861 Z43123 Hs.144513 ESTs 4.0 
418259 AA215404 ESTs 4.0 

60 418818 AA226699 Hs.101307 Homo sapiens HUT11 pnMmlV^ partial 4.0 

432229 AW290976 Hs.143587 ESTs 4.0 

418310 AA814100 Hs.86693 ESTs 4.0 

416760 H85182 Hs.191327 ESTs. Highly similar to KIAA1 102 protein 4.0 

^. 419083 A1479560 H3.98613 Homo sapiens cDNAFU12292fi3. done MA 4.0 

65 421312 AA824627 Hs.291670 ESTs 4.0 

448131 Ai67S0&4 Hs.200481 ESTs 4.0 

439731 A1953135 Hs.45140 hypothetical protein FU 14084 4.0 

418243 W51873 Hs.171857 Homo sapiens testis protein mRN^ parfia 4.0 

_^ 434072 H70854 Hs.2B3059 Homo sapiens PRO1082mRNA,complelecds 4.0 

70 434844 AF157116 Hs.22350 hypothetical protein LOC55757 4.0 

452221 C21322 Hs.288057 hypothetical protein FU22242 4.0 

432697 AW97S050 Ms.293892 ESTs. Weakly stmOar to ALU4_HUMAN ALU S 4.0 

446354 AW449650 H3.346335 ESTs 4.0 

__ 447072 061594 Hs.17279 tyrosyiproleinsuUotransferase 1 4.0 

75 417333 AL157545 Hs.173179 bromodomain and PHD finger containing. 3 4.0 

441568 AI733322 Hs.127176 ESTs 4.0 

441736 AW292779 Hs.8182 ESTs 4.0 

457498 AI732230 H3.191737 ESTs 19 

„^ 433234 AB040928 Hs.65366 K1AA1495 protein 3l9 

80 434222 AF119886 Hs.283941 Homo sapiens PR02S91 mRNA. complete cds 3L9 

448568 AA149121 Hs.71947 ESTs 3l9 

415068 Z19448 HS.131B87 ESTs,WeddysiinilartoT24396hypott«ti 3.9 

424940 AA985308 HS.283902 ESTs 19 
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416288 


K51299 






420301 


AA767526 


Hs.22030 




42S066 


AA868555 


HS.178Z22 


5 


435878 


R08330 


Hs^1S2 


446862 


AV660697 


Hs.282700 




447530 


AW192063 


Hs.248665 




4S2619 


AW298597 


(^.61684 




401403 






10 


448779 


BE042877 


Hs.177135 


420533 


AI809510 


Hs.116971 




411084 


T18987 


Hs.125472 




419629 


AB02069S 


Hs.91662 




423453 


AW450737 


Hs.128791 


15 


434833 


AF156548 


Hs.192989 


455646 
411479 


BE064420 
AW848047 






419038 


AW134924 


H5.190325 




421129 


BE439899 


Hs.89271 


20 


424332 


AA338919 


Hs.101615 


441766 


R53790 


Hs.23294 




447033 


A)357412 


Hs.157601 




439306 


BE220199 






410352 


AW969725 


Ks.150444 


25 


407981 


AW672g39 


Hs.41694 


410252 
439560 


AW821182 
8ES6S647 


Hs.61418 
Hs.74899 




440450 


AI333129 


Hs.156147 




456611 


AI268407 


Hs.211458 


30 


419589 


AW973708 


Hs.201925 


431576 


M76665 


Hs.275215 




447280 


BK17907 


Hs.97635 




438379 


N23018 


Hs.171391 




416009 


Z43062 




35 


416534 


H69043 


Hs.224961 


423044 


AA320829 


H5.97266 




424701 


NM.005923 


Hs.151988 




433507 


A1B17336 


Hb,191791 




437718 


A1927288 


Hs.1 96779 


40 


419831 


AW448930 


Hs.5415 


424830 


AW270580 


Hs.189311 




42G981 


AL044675 


Hs.173081 




431447 


AA505138 


Hs.291341 




435932 


W03928 


Hs.114524 


45 


442447 


AA999723 


Hs.129607 


403242 








433908 


AW298141 


Hs.1 57975 




452323 


W44356 


Hs.292812 




412095 


At624707 


Hs.5921 


50 


418759 


AA227879 


Hs.167621 


422299 
452462 


AK000161 
BE173515 


Hs.114556 




423096 


AA732684 


Hs^78428 




454037 


AW998718 




55 


428055 


AA420564 
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412988 


RPfldRRRfl 

DCwtDOOU 






3.0 


452679 


242387 




UdnsrncnibrsnOi prostete sncbooBii biducsd 


3.0 


416642 


T96118 


Hs 226313 


ESTs 


3.0 


418946 


AI217097 




abnd43h07 x1 Soaes heart NbHH19W 


3.0 


426174 


AA547959 


HS.115B38 


ESTs 


3,0 


430459 


BE1 78539 


Hs 278634 


ESTs 


3.0 


439182 




Hs 91691 


hvnnfhplira) nmlalfi nKF7n7fi9n07fi 


3.0 


446258 


AI283476 


Hs^63478 


ESTs 


3.0 


448686 


AAlCpRCQ 
#VM90D39 




hynoniAfirjil twtiAt Fl MATAA 
nypouiBUiKU pOflein ru i^r it 


3.0 


453455 


AAQ635S3 


Ik 991931 


EST& WbaSdv sindar to JC1087 RNA heDc 


3,0 


414441 


AA234759 


Hs 139950 


ESTs 


3.0 


427302 


AA400540 


Hs13S282 


Homo saclens cDNA FLI11SS4 fis. done HE 


3.0 


437048 


AA743240 


rKltV 1 \f04m 


ESTs 


3,0 


450963 


AI864666 


H5.48832 


ESTs 


3.0 


432336 


MM fln97*iQ 


U«97i9R9 


pcotein kfaissOi bitsffston^ndticiblB dou 


3.0 


459535 








3.0 


448734 


K3CD ItUlU 


M« 19AA1fi 




3.0 


451468 




Hs 993663 




3.0 


407829 


AA04S(ffl4 


Hs 29725 


hvnothatic^ ivntein FL113197 


3.0 


420407 


AAfi 147^9 




lipwpOiySaCvaKlD'DpBCiliC roSpunSc 9*1) 


3,0 


426743 


nnOOOOOO 


Hq 9^<V199 


CO 19 




442326 






IIJ^HJUIOUWJI |flWlB(ll mUMl^Olf 


3.0 


449913 






CO 19 


o,v 


454096 








3.0 


437323 


AA371145 


Hs 184397 




3.0 


407137 


T97307 




QUayD9Jl1U9i9 1 wOSlcS vSvM IIVeT a|li6of1 


3.0 


450580 


H40087 




ESTs 


3.0 


429084 


AJ001443 


Hs.195614 


spOdng factor 3b, subunit 3. 130kD 


3lO 


418365 


AW014345 


H&161690 


ESTs 


3.0 


423784 


AK000039 


Hs.132826 


Homo sapiens cDNA FU1491 3 fis, done PL 


3.0 


435677 


AAe94142 


Hs.293726 


ESTs. Wbahfy sfanDar to TSGA RAT TESTIS 


3.0 


414951 


AW794931 


Hs.100861 


hypOtheOcal prot^ FIJ14600 


3.0 


418819 


AA228776 


Hs.191721 


ESTs 


aio 


428634 


AA811845 


H5.106290 


Kelch tnofif containing praldn 


10 


431869 


AA521136 


Hs.190176 


ESTs 


ao 


435008 


AF150262 


Hs.162898 


ESTs 


10 


448880 


AW205507 


Hs^2360 


ESTs. Highly sbnHar 6> 138587 reMni 


10 


451391 


AA017410 


HS.40S68 


ESTs 


10 


452959 


At933416 


H8.189874 


ESTs 


10 



TABLE 64B 



Pkey: IMtud Eos probeset IdenliSer nunAer 

CAT number. Gene duster number 

Accession: Genbsnk aocesston numbets 



Pkey CAT Number Accession 

410790 1221131 1 AW803357 AW803423 AW812233R06814 

410869 1225123 J AW808361 AW608404 AWB08368 AW808594 AW808654 AW808813 AW808551 AWB08676 AW808350 AW808406 AW808694 AW808934 
AW808829AW808385 AW608422AW808401 AW808409AW808760AW80B863AW80B521 AW808539 AW8066O9AW808472 AW608739 
AW808704 AW8085S6 AW808714 AW808420 AW8 

411438 1245660.1 AW846433AW846159AWB46377AW046528 



736 



wo 03/042661 



PCTAJS02/36810 



411479 1247077J 



412988 
413081 
413525 
415989 
41G009 
416288 
416882 

41B259 
418866 
418948 
419536 
420111 
420352 
423412 
424200 
426413 
426503 
426991 
428002 
428342 

429163 
429220 
430535 
432600 
432765 
433523 
434763 
43629S 
436326 



439092 
439306 
440640 

440947 
442481 
445432 
448044 
449343 



449570 



450317 
450580 



450582 
450687 
452462 
453682 
454037 
454096 
454171 
454457 
454860 
454968 
455276 
455646 
455710 
457374 
4SB912 



1342150J 
1348553.1 
1374635J 
156454.1 
1566379J 
1585983J 
162718J 

173388.1 
179788J 
180808.1 
185688J 
190755_1 
192979.1 
228001.1 
236595.1 
266650.1 
268283J 
27415.1 
285602.1 
290035.2 

300543.1 
301384.1 
319643.1 
350959.1 
353907J 
368873.1 
392847^1 
41733.1 
41795J 



437866 44433.2 



488554.1 
47088.1 
50357J 

505904.1 

543588.1 

63943.1 

747196.1 

80517J 



81018.1 
8113.1 

831956.1 
83929.1 



83933.1 

84327.1 

918580.1 

977454.1 

996287.1 

1007449.1 

1049240.1 

1207274.1 

1237732.1 

1247029.1 

1272M1J 

1348557.1 

1352368.1 

328758.1 

823104^1 



AW848047 AW848202AW848631 AW848142 AW848702 AW848121 AW848632AW848140 AW848571 AWB48009 AW848067 AW84806g 

AW848g05AW848214 

68)46660 BE046738 BE044958 

BE054415BE064430BE064448 

6E145699 BE145848 6E145849 BE145653 6E145927 BE145925 

AI267700 AI720344 AA191424 A1023543 A1469633 AA172056 AW958465AA1 72236 AW953397 AA3SS086 
Z43062R13213H14422 

HS1299H44619 H46391 RB8024 H51892T77744 

AI633044 AW016212 AW241143 AA769058 R43272 AW066958 AA2109ie AA293774 AI748815 AI763294 At333114 AI277384 At068297 AI468477 
AI824624 AW189606 Ai631751 Z40749 AI984673 A)671316 AA189024 AW235412 BE178426 R246n R40635 K05100 R40597 
AA215404 AI990909 BE464132 AW271459 N74332 AI262061 
T65754AA229857AA229658 

AI217Q97 AW886090 W38035 W38792 AA232835 AW938043 
AA603305 AA244095 AA244183 
AA2556S2 AA28091 1 AW967920 AA262684 

BE25883S AWg66316 AA256918 AW843305 R14744 Ai580386 BE071923 R36280 

AF109300AI299378AI202654 
AA337221 AA336756 AW966196 
AA377823 AW954494 AI022688 
AA3801 S3 AA380233 AW983529 

AK001536 AA191092 AW510354 AI554256 AL353968AA1 34266 
AMt8703 AA418711 BE071915 BE071920 BE071912 

AI739168 AA426249 AI199636 AW5051 98 AW977291 AA824583 AA883419 AA724079 AI015S24 A)377728 AW293882 A1928140 AA731438 

At092404A!085630AA731340 

AA884766AW974271 AA592975AA447312 

AW207206 AW341473 AA44ei95 AI951341 

AWg68485 AW968670 AA480922 6E350425 

AI821085AW973464AA554802AI821831 AA657438 AA640756 AA650339 
AJ003429AJ003367AA564825 

K29B82 AW665533AW149901 AI572917 AA5985QOAI686466AI336390AW864390AW864320 

AA648618AW974389H51771 

N73895AJ001872 

6E085236 BE085317X04236AAS77934 AA578392AAS0Z836 AAS95852AA57825BAW270791 AA507151 AA559152T57040BE503281 
AW593405 A1825755 At350499 AI655710 AI972281 AI654949 BE073961 6E073962 BE041399 AW750214 AA228488 BE074016 AI908706 
AW270601AW8732a2 

AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444304 AW602S74 BE164040 BE164012 BE163972 8E163974 BE163992 
AA837481 AW468444 BE185091 AW468002 AA687333 AA811830 AA581806 A1866686 Ai572124 AA043777 AA040926 D20160 AI536733 
AA812489AW874142At47168 
AA830149 AW978407 M85983 AWS03637 
6E220199 W01813 AF086118N70760 BE221405 

AW629666 AW959831 AW205739 BE620243 AA412367 AW300025 AW051920 AI288591 AW236114 AI302652 AI038548 AAS34496 AI797207 

AA921877 

AA910403 A1B15593WS6381 AW1 62520 AI816550 
N93828 BE079B73 All 10738 AF074645 
AV653771 BE0B9370 

A1458682 H24240 R14537 R18426 AW857082 

A1151 41 8 W60401 AW631238 A1346938 AA862B55 W60310 N72501 H90060 BE150445 AW380821 AI540906 C04881 W03542 AA641764 H97053 

AVI^9353 AA521308 M001203 VV92828 A1207788 AA746655 R78710 VV24617 AA024805 C01747 AW173095 

H00789R76925AW1828 

AA001793AA001871 

NM_014253 AF100772 BE088769 A1022718 6E1 61779 AW863569 BE161640 AL039060 6E168542 AW296554 AA323193 AA235370 AW779760 
N4B574 AI375997 R45432 059344 AI203107 F07491 R35360 R25094 AI913631 AI498402 T61382 Ai016320 N45526 T61415 AA331486 
Al6926e9R14223R1B395 

N40087 H12925 AA460779 AA096372 6E1 60847 AW816890 AW81 6893 AW816577 AW81 6891 AW816882 AW816B68 AW8168B9 AW616940 
AyV816892AVV816941AW816578AA029183T83320H86B50AA010295BE160823 Z25353AA15088^ 
AA164518AA730973W00417 W65303 

AI339732 AA010300AW515041 AA768334 N29860AA42S874 AA425116 AA86S829 AW936878 

AA495800 AA495737 AA010736 AA654716 AA640726 

BE1735156E173560AI902860 

T79703 TB6307 A1j07972S 

AW998716AW022148K68020 

AW062757AW176890AW062758AW176895AW176869AW176900AW176897AW17692BAW176868AW1768^^ 
AW654832 AW654798 AW654857 AW854816 AW854834 AW854617 
AW753456 AW753036 AW6&4888 AW854882 
AW835767 AW835537 6E160187 

AW849046 AW8479S6 AW849039 AW847957 AWB48279 AW848698 AW849034 AW849033 

BE176479 BE176678 BE176357 BE176550 AW8d607g BE17^6 6E176615 6E176555 BE176489 BE176610 BE176362 

8E064420 BE064435 BE064429 BE064414 BE064400 BE064517 

6E072049 BE069471 BE069489 BE069478 BE069479 BE069492 BE069485 BE072122 BE072124 BE069491 BE069486 BE069464 

AA493662 AW897396 BE154814 

AI91 1 066 AI933734 AI680888 AJ003S99 



TABLE 64C 



Pksf. Unique number conesporKfing to an Eos probesd 

Ref: Sequence source. The 7 digUmindieis in this Golunm are Gen^^ "Dunham Let^'fBfeis to the puUicalionenb'tled The ONA sequence of 

human chromosome 22.' Dunham 1. el al.. Nature (1^ 402:489^95k 
Strand: litdicalesONA strand (ioni which exons ware predicted. 
N^,_posl8ont trHficates nucteOSde positloitt d predidsd exons. 



Pkay Ref Strand NLposlDon 



737 



wo 03/042661 



PCT/US02/36810 



400533 


6981826 


MRUS 


277132-277595 




7329326 


Mimis 


147703- i47B9d 






Minus 


o5o79-o5735 




7710968 


Plus 


i4dioU-i4o<94 






Minus 


12i45o-i2iD% 




81TC894 


Plus 


Z42Z3>24^ 




7637817 


Minus 


1 1297-1251 1 


403481 








403510 


7652047 


Plus 


61866^2027 


403667 


6650483 


Minus 


1344-1442.1545-1697 


403896 


3135242 


Minus 


143467-143634 


404003 


8855948 


Plus 


198349-199098 


404561 


9795380 


Minus 


69039-70100 


404571 


7249169 


Minus 


112450-112648 


404592 


9943965 


K^us 


39067-39225 


404848 


6248647 


Minus 


23955-24034^143-25264 


405264 


7329374 


Plus 


28556-28684 


40534B 


2914717 


Minus 


43310-43462 


405510 


7630909 


Minus 


101028-101174 


405548 


1532158 


Plus 


11552-11686 



Table 65A lists about 347 genes up-regulated in BPH comijared to piostate cancer tissues. Tliese were selected from 59680 probesets on ttie AfTymetrix/Eos Hu03 GeneChip array 
such that the ratio of 'avejage' BPH to 'average' prostate cancer fissues was greater than or equal to 3.0. Tf« 'average" BPH level was sel to the 75* percentile amongst BPH 
tissues. The 'average* prostate cancer tissue level was set b the 85*^ percentile amongst maTignant prostate tissues. In order to remove gene-specific background levets of non- 
specific hybridization, the 10<» percentile value amongst non-malignant body tissues was subtracted from both the numerator and (he denominator before the ratio was evaluated. 

TABLE 6SA: ABOUT 347 G£NES UP-1^6ULATED IN BENIGN PROSTATIC HYPERPLASIA COMPARED TO PROSTATE CANCER TISSUES 



Pl^ Unique Eos probeset Identifier number 
ExAccn: Exemplar Accession number. Genbank accession number 
UnlgenelD: Uhfgane nuirder 
Unigene Title: Umgene gene tite 



R1: 


Ratio of BPH Bssue to prostate tumor tissue 




Pkey 


ExAccn 


Unigene 10 


Unigene TiOe 


Rl 


428134 


AA421773 


Hs.161008 


ESTs 


a4 


446336 


AW81S036 


HS.1S1251 


ESTS 


9.3 


456373 


BE247706 


Hs.89751 


membrane-spanning 4-<lomain3, subfamily A 


8.9 


458072 


AI890347 


Hs.271923 


Homo sapiens cONA: FU22785 6s, ck)ne K 


a8 


400533 






ENSP00000209376*:PREO65 protein (Fragmen 


a7 


418310 


AA814100 


Hs.86693 


ESTs 


8.7 


404592 






N^t022739':Homo sapiens E3 ubiqullin lig 


8.1 


454457 


AW753456 




gb:QV2.CT0261-26109W)11<l11 CT0261 Homo 


7.7 


400080 






Eos Control 


7.4 


459646 


AW88396B 


Hs.321190 


gb:QV3-OT0063-290300-13&€04 OT0063 Homo 


7.4 


420352 


BE258835 




gb:601117374F1 NtHLMGCJB Homo sapiens e 


7.3 


436231 


AW594539 


Hs.155689 


ESTs 


7.3 


418387 


R1608S 




gb:yg16b12.r1 Soares Infant brain INIB H 


7.2 


442481 


N9g828 




gb2832c04j1 Soares fetal Ever spleen 


7.1 


449249 


T52285 


Ks.193115 


Homo sapiens mRNA fior KIAA1764 prot^. 


7.0 


457653 


A1820719 


H8.154662 


DnaJ (Hsp40) homolog, subfamily A, membe 


6.9 


404967 






Target Exon 


&9 


430535 


AWg66485 




gb:EST380S61 MAGE resequences, MAGJ Homo 


6.9 


454860 


AW835767 




gb.'aV4-LT001»4O20O-11&bO8 LT0016 Homo 


6.9 


423789 


AK002084 


Hs.132851 


hypothetical protein FLJ11222 


6.8 


412988 


BE046680 




gbiin42M3j(1 NCLCGAPJtDF2 Homo sapiens 


6.8 


400440 


XB3957 


Hs.63870 


nebulin 


6.6 


454171 


AW854832 




gbK3V2-CT0261-201099411-f05 CT0261 Homo 


6.6 


400086 






EosConbol 


6.4 


440911 


AA909536 


Hs.143562 


ESTs 


6.4 


425312 


AA354940 


Hs.145958 


ESTs 


6.4 


426140 


AF131798 


Hs.343768 


Homo sapiens done 251 19 mRNA sequence 


6.4 


419015 


T79262 


Hs.14463 


ESTs 


6.3 


453789 


AA628517 


H5.118S02 


ESTs 


6.2 


424940 


AA985308 


Hs.283902 


ESTs 


6.1 


403867 






Target Exon 


6.1 


429014 


A{800518 


Hs.118158 


ESTs 


6.0 


417758 


U27699 


Hs.82535 


solute canferfatitDy 6 (neurotransmitle 


6.0 


419999 


AI760942 


Hs.191754 


ESTs 


6.0 


405348 






C70016&4:gi|12698061|dl]jIBAB21849.11 (AB 


ao 


404003 






Target Exon 


a9 


453200 


AA033B32 


Hs.212433 


ESTs 


a7 


428002 


AA418703 




gb2v98c03.s1 Soares.NbKMPu.S1 Homosapl 


a7 


432319 


Awsiono 


Hs.128386 


ESTs 


a7 


445444 


AA380876 


Hs.270 


pieckstrin homology. Sec7 and coiledAxi 


&6 


443361 


AI792628 


H5.133273 


ESTs 


5.6 


411518 


AWB50246 




gb:IL3<^T0219-29109»)21*E07 CT0219 Homo 


5.6 


439079 


AF085937 


H3.38348 


ESTs 


a5 


422081 


AW13^20 


Hs.196011 


ESTs 


as 


408197 


AA282262 


Hs.107410 


ESTs. WfeaUy AvStar \o A46010 X^iidced 


a5 


423529 


TB7318 


Hs.120411 


ESTs 


as 



738 
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PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



436578 
450920 
435072 
414403 
443744 
433087 
441916 
414818 
452531 
454988 
415890 
408385 
437752 
446495 
435375 
426748 
424994 
405321 
416706 
438206 
445238 
455747 
420533 
457374 
440354 
440388 
421188 
403481 
438132 
403333 
450317 
433597 
430172 
452643 
452031 
427302 
442123 
438727 
420931 
442160 
442295 
434589 
404995 
422906 
405549 
430697 



455801 
444800 
403371 
438431 
421926 
439752 
414441 
454665 
454585 
452978 
436042 
416059 
451469 
433072 
443318 
424686 
416312 
415467 
459045 
415417 
409111 
441620 
427908 
432765 
448108 
436345 
409500 
41352S 
403305 
436338 
427521 
430124 
410790 
434360 
416385 
442786 



A1091435 

AA011626 

AV«92176 

AW969551 

A1084326 

AI720686 

AA993571 

BE541217 

AA429462 

AW849046 

H08225 

AF055634 

AA767376 

D60923 

AI733610 

Alj048409 

AW954525 

AA314676 
AA780385 
AA883971 
BE074910 
A1809510 
AA493662 
AA889386 
AI6g3520 
AA2846S8 



AA907076 Hs.122060 



A)692689 

AA708205 

AA468S91 

AI798769 

AA741314 

AA400540 

A16977g0 

AW978756 

AF044197 

AI337127 

AI827248 

AF147363 

U80773 

AA484207 
AI620661 
BE140643 
AW119071 

AW207860 

AA300591 

T78968 

AA234759 

AW612866 

6E069128 

AA029994 

AW296971 

AA211566 

NM_014809 

AI928037 

AI051603 

AA345504 

W02640 



N69101 
F12038 
AL043362 



AA417272 

AJ003429 

AW300021 

AA873008 

U08098 

BE145B99 

NiyL008825 

W92147 

AW973352 

AW204994 

AW803357 

AW01S415 

H54253 

H50733 



HS.1348S9 ESTs 5^ 
Hs.133324 ESTs 

Hs.116932 ESTs 5.4 

H3.76064 ribosoma) protein L27a &4 

Hs.271548 ESTs, Weakly similar to 178885 serine/th 5.4 

Ks.152520 ESTs &3 

Hs.129075 ESTs 5.3 

H5.23606 ESTs 5.3 

Hs.293946 ESTs, WesMy similar to 138022 hypothsli 5.3 

gb:tl3^0214.15030IM)85'H06 CT0214 Homo 5.2 

Hs.268712 ESTs 5.2 

Hs.44553 unc5(C.eIeganshomoiog)c 5.2 

Hs.^1631 ESTs. Moderately slRolar to SG5657al|)ha 5.2 

Hs.153460 ESTs 5.2 

Hs.187832 ESTs 5.2 

Hs.97177 ESTs. Weakly similar to ALULHUMAN ALUS 5.2 

gb:EST386595 MAGE resequences. MAGC Homo 5.2 

Target Exon 5.1 

Hs.288945 hypothetical proteh) aJ 13448 5.1 

Hs.187885 ESTs 5.1 

Hs.187506 ESTs 5.1 

gb:RCSaT0580>170300-021.F12 BT0580 Homo 5.1 

Hs.1 18971 ESTs 5.1 

9b.-nh05d11s1 Na_CGAP.Thy1 Homosapiens 5.0 

Hs.125468 ESTs 5.0 

Hs.223000 ESTs 4.9 

H5.261493 ESTs 4.9 

Target Exon 4.8 

ESTs 4.8 

NM^002518*:Homo sapiens neuronal PAS dom 4.8 

gb:wd86g05jc1 NQ.CGAPJii24 Homo sapiens 4.8 

Hs.100343 ESTs 4.8 

Hs.161889 ESTs 4.8 

Hs.208320 ESTs 4.7 

Hs.865 mnfii, member of RAS oncogene family 4.7 

Hs.135282 Homo sapiens cONAnJ11554 lis, ctone HE 4.7 

Hs.159961 EST 4.7 

HS.20S679 ESTs 4.7 

Ks.100431 small IndudbiecytoHneB subfamily (Cy 4.6 

Hs.156325 ESTs 4.6 

Hs.224398 Homo sapiens cONAIOJ 11 469 fis, done HE 4.6 

gb:Homo sapiens fuQ length insert cONA 4.6 

ENSr>00000251890':Monocyticteukemiazlnc 4.6 

Ks.121580 Human EST done 42944 mariner transpQSon 4.5 

C7001976*:gl|475S712HNP.0046S9.1|a! 4.5 

Hs.211867 ESTs 4.5 

HsJ296276 ESTs 4.5 

gb:RCO-HF001&^1059M)16 HT0015 Homo sap 4.5 

Hs.153287 ESTs 4.5 

Target Exon 4.5 

Kb.293116 ESTs 4.5 

gb:EST1 3437 Testis tumor Homo sapiens cO 4.5 

Hs.14411 ESTs 4.5 

Hs.132950 ESTs 4.5 

gb:RC3CT0186^100^)17-b03 ST0186 Homo 4.5 

gb:QV3-BT0379^101Q04171-g06 6T0379 Homo 4.4 

Hs.61523 ESTs 4.4 

Hs.180610 ESTs 4.4 

gb:zn56d05.s1 Stratagene musde 937209 H 4.4 

H3.26441 KIAA0319 gene product 4.4 

Hs.158832 ESTs 4.4 

Hs.133141 ESTs 4.4 

gb:EST51 529 Gall bladder li Homo sapiens 4.3 

Hs.16247 ESTs, Weakly simitar to 2004399Achromos 4.3 

Hs.260274 ESTs 4.3 

H8.40730 ESTs 4.3 

Ks.140970 EST8.W^8imiIartoAliJ6JftiMANALU8 4.3 

Hs.7984 pleckslHnhonvA)gy.Sflc7andooliedfcol 4.3 

Hs.26675 ESTs 4.2 

H8.24122 ESTs 4.2 

gbAX}03429 Selected chromosome 21 cDNA 4.2 

Hs.170685 ESTs 4.2 

Hs.121572 ESTs 4.2 

Ks.54576 sulibtransferase. estrogen^ferring 4.2 

gb:MRimr0208-221299-204-b10HT0208Hoino 4.2 

transmembrane protein (63kO). endoptasmi 4.2 

Hs.1 18394 ESTs 4.2 

Hs.290585 ESTs 4.2 

Hs.253450 ESTs 4.2 

gb:IL24Jiy!00794»(mM)60^8UUX179Homo 4.1 

Hs.127780 ESTs 4.1 

KS.20S241 ESTs.WeaMy8lm]lartoS65657alpha-1C- 4.1 

Hs.256261 EST8.MDderatolySlmBartoAtU8.HUMANA 4.1 
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445206 AI3S0199 Ks.269990 ESTs 4.1 

424200 AA337221 db£ST41944 Endometrial tumor Homo saple 4.1 

448152 AI741053 H5.170770 ESTs 4.1 

. 4S3713 R20640 Hs.79133 cadherin6,typ82 4.1 

5 400746 Target Exon 4.1 

413081 BE064415 gb:RC4.BT031V-241ig9^^()03BT0311 Hoino 4.1 

442338 Ar761976 H$.156080 ESTs 4.0 

455368 AW93S234 8b:QVO-DT00204)90200>10&g05 DT0020 Kbnu> 4.0 

455276 BE176479 gb:RC34IT0585.16030^022*09 HT0585 Homo 4.0 

10 421312 AA824627 HsJ91670 ESTs 4.0 

431421 AW969118 H110B144 ESTs. WeakfysbnOBr to unnamed pioteln 4.0 

448882 AJWISSI Hsi2404 protease, serine, 12 (Reun)lrypsin.moto 4.0 

441568 A[733322 Hs-127176 ESTs 4.0 

455646 BE0&4420 gb:RC4.BT0311-24119W)12^8BT0311 Homo 3,9 

15 424765 AA428211 HS.2342S6 hypothetical protein FU 14033 similar to 3.9 

454024 AA993527 Hs.293907 hypothetical protein RJ23403 3.9 

453387 AI990741 Hs.252809 ESTs 3,9 

434222 AF119686 Hs.283941 Homo sapiens PR02591 inRNA,completBCd5 3.9 

454806 AWB72430 Hs.273743 ESTs 3.9 

20 429066 AA888555 Hs.178222 ESTs 3.9 

435878 R08330 HS.201S2 ESTs 3.9 

453055 AW291436 Hs.31917 Homo sapiens, done MGC:S658. mRNA. comp 3.9 

447530 AW192063 Ks.248865 ESTs. Moderately similar to JC5238g8lac 3.9 

407834 AW084991 Hs,26100 ESTs 3.9 

25 400398 AF137396 Hs.283879 ubiqu]iln3 3,9 

421353 AW2g2857 H$.2S5130 ESTs 3.9 

401459 C14000482*:gq9790241[refINP.062628.1|8 aO 

403433 NIUL001622:Homo sapiens atpha-2-HS^IycQp 19 

^. 444911 U06117 Hs.250 xanthene dehydrogenase 3.9 

30 436350 AA713661 Hs.121091 ESTs 3.9 

447930 R44574 Hs.107510 ESTs 3.9 

410559 AW754192 ob:RC2'CT032M312994)12.a04CT032rKomo 3.8 

452320 AA042673 H1160412 ESTs 3.8 

402145 Target Exon 3.8 

35 458438 AI141520 Ks.151464 ESTs, Weakly GimnarbALUC.HUI\4AN!!!! 3.8 

440450 A1333129 Hs.156147 ESTs 3.8 

437587 A1591222 Hs.72325 Human DNA sequence from done RP1-187J11 3.8 

416009 Z43062 gb:HSC12E041 nomiaiized infant brain d}N 3.8 

434381 AA631834 gbnip77h05.s1 Na_CGAP_Pr2 Homo sapiens 3.8 

40 416534 H69043 Hs224961 Homo sapiens cDNARJ14366fis. done HE 3.6 

433523 H29B82 ESTs 16 

411436 AW846433 gb:QV0^0179m300-143^2CT0179Homo 3.8 

424830 AW27O580 Hs.189311 ESTs.WealdysimnartDput8th«p150[H 3.8 

431447 AAS05138 Hs.291341 ESTs 3.8 

45 435932 W03928 Hs,114524 ESTs 3.8 

442447 AA999723 Hs.129507 ESTs 3.8 

419831 AW446930 Hs.5415 ESTs 3.8 

403242 Target Exon 3.8 

455490 AW953477 gb:EST365547 MAGE resequences. MAGB Homo 18 

50 449264 A1637649 Ks.196105 ESTs 18 

443635 AI080230 Hs.134214 ESTs 17 

428200 A1039624 Hs.98388 ESTs 17 

452462 BE173515 gb:RC2.HT05e0.21020OO12-«D3HTO560Homo 17 

418759 AA227879 Hs.187621 ESTs 17 

55 450497 H64159 Hs.15328 ESTs 3.7 

447785 AL041765 H5.340375 ESTs 17 

451746 M66178 Hs.311258 ESTs 17 

416321 H94331 Hs.34024 ESTs 3,7 

448135 AI470874 Hs.343799 ESTs 17 

60 405510 ENSP00000233779-:Hypothe8cal610kOap 17 

407437 AF220264 gb:Homo8apierBMOST.1mRNA.compl8tBcd 3.7 

451859 H44491 Hs.252938 ESTs. Weakly similar to ALU1.HUI«AN ALU S 17 

455135 AW857989 gb:PM2-CT0328-2812994)01e04Cr0328Homo 17 

^. 409189 AA12S984 gb2n27h06.r1 Stratageneneuroeplthelium 17 

65 420300 AA25824S Hs.127573 Homo sapiens FKSG41 {FKS641)mRNA.oampl 17 

427726 A1359144 H$.143688 Homo sapiens cDNA: RJ23031 fo. done L 17 

434306 AW081757 m.44241 Homo sapiens cDNA:FU21447 Its. dona C 17 

456354 X56411 Hs,1219 aloohd dehydrogenase 4 (dass II). pi p 17 

419261 X07876 Hs.89791 wngless-type MMTV integraBon site fami 17 

70 422899 D16471 HS.121S71 Human mRNA. Xq terminal portion 17 

447458 AI741082 HSw156961 ESTs 17 

411421 BE272110 Hs.21177 ESTs 17 

439075 AF085933 Hs^20 ESTs 17 

__ 440947 AA910403 ESTs 17 

75 417565 AI203405 Hs.47B31 ESTs 16 

430096 U91935 Hs.233321 RetIn»JeiivedPOl>4om^«ador-1 16 

446658 H71739 Hs.200227 ESTs, Moderately similar to A53959ttirom 16 

404561 trichorhinophalangeal syndrome 1 gene (T 3.6 

^- 429073 AA446167 Hs.47385 ESTs 16 

80 419002 T7B625 Ks.268594 ESTs 16 

450630 AA010429 Hs.191939 ESTs 16 

455087 AW854538 gb:RClCT02S5.20010&024W CT0255 Homo 16 

412768 AW996044 Hs.26239 Human DNA sequence tan dona RP11438B2 16 
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439481 AF088294 HS.12S844 ESTs 3.6 

419622 AA452054 Hs.119338 ESTs 3.6 

449821 AI671141 K5.211122 ESTs 3.6 

451193 N23850 Hs.44098 ESTs 3.6 

5 412701 AW984757 ob:RC1-HN001&040400-011-g10HN0015Homo 16 

419611 AB031479 H&.91600 SEEKIproteh 3.6 

433563 At732837 Hs.277901 ESTs 3.6 

427235 AI1262B8 Hs.192232 ESTs 3.6 

443045 BE072463 Ks.276337 Homo sapiens cONAFU11537 its. done HE 3.5 

10 401132 C120005ir-.fli)4758712IrefINP.004659.1l3 3.5 

421105 AA766501 Hs.125113 ESTs 3.5 

405264 NM_03081 3*:Homo sapiens suppressor of po 3.5 

417675 AI808607 Hs.3781 similar to murine leuctne-rich r^)(eat pr 3.5 

449988 AW372066 Hs.201420 ESTs,Moderale}ysimn3rbALU7.HUMANA 3.5 

15 434763 AA548618 gb:n507a11.f1 NCLCGAP_Ewl Homosa;»8ns 3.5 

416379 N38857 H5.203933 ESTs 3.5 

407557 Z83803 gbiRsapiensmRNAfbraiconeniaidynelnhe 3.5 

437956 AA773283 Hs.203559 hypoOiedcal pioteln aJ12701 3.5 

403510 Target Exon 3.5 

20 451722 H86374 Ks.40861 ESTs 3.5 

435681 AA694192 Ks.148979 ESTs 3.5 

427125 AA683362 Hs.97612 ESTs 3.5 

424287 AL133105 Hs.144633 hypothetical prot^DKFZp434F2322 3.4 

434497 AI821803 Hs.136580 ESTs 3.4 

25 4536B2 T79703 gb:yd71e08.r1So3res fetal Dver spleen 3.4 

404848 ENSP00G00240769*:BG153O3.1 (slmaar to C 3.4 

417435 NA4.005181 Hs.82129 carbontoanhydrase 111. musde specific 3.4 

442231 W02434 H5.222413 ESTs 3.4 

450297 AW901347 Hs.38S92 hypothefical protein FU23342 3.4 

30 436340 R42246 Hs.21606 ESTs 14 

441596 AA939300 Hs.206768 ESTs 3.4 

456995 T89832 Hs.170278 ESTs 3.4 

456304 AI820973 ob:nc21c02.y5Na.GGAP.Pr1 Homo sapiens 3.4 

427033 AI457449 H8.192817 ESTs 3.4 

35 400379 NM.01&432 Homo sapiens Ofvarian cancer related prol 3.4 

410551 1^5730 Hs.21B16 ESTs 3.4 

423357 AI285124 Hs.157505 ESTs 3.4 

450582 A)339732 &f1ch RNA sequence binding factor 1 3.4 

437662 AA765387 Hs.145095 ESTs 3.4 

40 442388 AW663442 Hs.129485 ESTs 3.4 

445004 AI204616 Hs.148701 ESTs 3.4 

450597 A]701635 Hs.207077 ESTs 3.4 

438801 AA825971 Hs.124284 ESTs 3.4 

445432 AV653771 gb:AV653771 GLC Homo sapiens cDNA done 3.4 

45 450003 AA777809 Hs.191995 ESTs 3.4 

431408 AA504757 Hs.105738 ESTs 14 

428923 BE047698 Hs.188785 ESTs 14 

407344 A103802S Hs.2714ie 0b:ox29f07jc1 SoaresJolaUelii8j<b2HF8_ 13 

_^ 425523 AB007948 Hs.158244 KIAA0479 protein 13 

50 427473 AW274439 HS.2S2709 ESTs 13 

437115 AA744703 H5.129030 ESTs 13 

434520 AA205273 Ks.177011 hypothetica) protein 13 

447282 A(989963 Hs.197505 ESTs 13 

453326 AW292635 Hs.346145 ESTs 13 

55 418985 AI042330 Hs.87128 hypoOielical protein FU23309 13 

449256 AADS9050 Hs.59847 ESTs 13 

432550 AW297206 H3.164018 ESTs 3.3 

423101 M83941 H5.123642 EphA3 13 

424853 BE549737 Hs.132967 Human EST clone 122887 manner transpose 3.3 

60 410700 AA352335 Hs.65641 liypotheOcal protein FU20073 13 

418719 AW975S90 Hs.161707 ESTs 13 
402703 Target Exon 3l3 

455710 BE072049 gb:PM4-BT0532-170100«)4^BTD532Homo 13 

452011 AW628911 Hs.211429 ESTs 13 

65 453973 AI291895 Hs.61993 ESTs. Wesft^sirrllarto ALU1JHUMAN ALU 8 13 

420355 AW958263 Hs.123126 ESTs 12 

41B898 T66847 Hs.194040 ESTs. Weakly similar to 138022 hypothefl 12 

418986 AI123555 Hs.81796 ESTs 12 

449570 AA001793 gb:zh86c08/1 Soaresjetaljver.spleen. 12 

70 447516 W0S355 Hs.102971 hypotiutfcal protein FU 14751 12 

408304 AW810279 gb:MR4-ST0125-151299O29-a09ST0125Homo 12 

402454 C100250r:fli|129092|sp|P23270lOi^.RATO 12 

411552 AW851255 gb:IL3^T0220.16020OO66-H02 CT0220 Homo 12 

407896 D76435 K3.41154 Zic Mly member 1 (oddi)aifedOrosophi 12 

75 435786 H09175 H5.26085 ESTs 3.2 

447597 AI686036 Hs.213675 ESTs 12 

432625 Ai243598 Hs.94830 ESTs. Moderately similar to T03094A^n 12 

459535 AV6S4907 gb'AV6S49076LC Homo sapiens cONA done 12 

_^ 447183 A1554733 Hs.173182 ESTs 12 

80 426629 AI203933 Hs,97142 ESTs 12 

447892 AI435848 H5.172g78 ESTs 12 

457136 AM28240 H9.126083 ESTs 12 

443585 AW46e983 Hs.283949 enameBn 11 
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451399 
450687 
416812 
452102 
434512 
458251 
439950 
429569 
403605 
422666 
408799 
429350 
430454 
441617 
427773 
415901 
445758 
403291 
436295 
455772 
418037 
438142 
416423 
448765 
457021 
420301 
420721 
436013 
441492 



419386 
435628 
444326 
413774 
434352 
434269 
416422 
416642 
439182 
418948 
433796 
427813 
405733 
426743 
449913 
454096 
440129 
410929 
458187 
429430 
451391 
414951 



AL042110 

AA495800 

H91010 

U04343 

AW139932 

AL040927 

AW937417 

AA454993 

AA677981 

AA059412 

AI7S4634 

AW469011 

AW969706 

AA412290 

K08396 

R60715 

N73895 
W2B799 
A1990212 
T90309 
H54375 
R15337 
AW968g34 
AA767526 
AA927802 
AA703419 
Ai149998 
AW808361 
AA236867 
W88732 
AI939357 
AA131782 
AF129505 
AK001991 
H60457 
T96118 
AF0B6030 
AI217097 
. AAB10867 
D29833 

AA383833 

AA004696 

AW062757 

AA865818 

H47233 

DS6919 

AI381837 

AA017410 

AW794931 



Hs.326728 ESTs 



Hs.44940 

HS.27S54 

Hs.188941 

Hs.210422 

Hs^93561 

Hs.138343 

Hs.1ig023 

Hs.47986 

Hs.131987 

Hs.105635 

H5.293332 

Hs.98124 

Hs.76118 

H8.25804 



Hs.88447 
Hs.269651 
Hs^8921 
H3.21958 
Hs.173108 
Hs.22030 
Hs.159471 
Hs.287749 
Hs.146346 



Hs.36107 

Hs.270710 

Hs.182314 

Hs^6492 

H&3781 

Hs.226313 
H8.21821 

Hs.186997 
Hs^07 

Hs^45022 
Hs.333016 

Hs,174936 
Hs.30643 
HS.26S848 

Hs.155335 

Hs,40568 

Hs.100861 



gb2wOSb07^1 So8res.NhHMP0.S1 Homossp) 
ESTs 

C086 antigen (CD28 antigen figand 2, B7- 
ECTs 

ESTs ^ 
ESTs 

ESTs, WeaMy similar to 178885 serineAh 
Targel Exon 

SMC2 (structural maintenance of chromoso 

hypothetical piotein M6C10940 

ESTs 

ESTs 

ESTs 

ESTs 

Utjiqulfin cartioxyMennlna) esterase 11 
ESTs 

Tai^etExon 

9b:za62d06^1 Scares fetalDver spleen 

gb:52g1 1 Human retina cONA randomly prim 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp547D086 (fr 
Homo sapiens cDNA: RJ21897 tis. done H 
paired box gene 5 (6^ lineage specif 
ZAP3 protein 

Homo sapiens cI3NA: F1J23593 fis, done L 
ESTs 

gb:MR1^TO11M11099m(04ST0111 Homo 

ESTs. W^aldy sintilar Id 138022 tiypotheti 

ESTs 

ESTs 

ESTs 

small muscle protein. X-ilnked 
similar to murine leudne-fidi repeat pr 
ESTs. Moderately similar to ZN91J1UMAN Z 
ESTs 

hypothetical protein DKFZp762O076 
gb:qd43h07,x1 SoafBsJetaI,hearLNbHH19W 
ESTs 

salivary prortne-rich protein 
NMJX21140*:Homo sapiens ubiquitously tra 
ESTs 

ESTs 

gb:CMO-CT0103-120B99-037-g07 CT0103 Homo 
ESTs. Wealdy similar to S71686 Ste20^!( 
ESTs 



ESTs 
ESTs 

hypoiheOcal protein FU14600 



3l1 
11 
3.1 

ai 
ai 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 

10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
3.0 
10 
10 
10 
10 
10 
10 
10 
10 
10 



TABLE 65B 



Pksf. Unique Eos probeset identifier number 
CAT number Gene duster number 
Accesston: Genbank accession numbers 



Pkey 
408304 



409189 

410559 
410790 
410869 



411436 
411518 
411552 
412701 
412988 
413081 
413525 
416009 
416422 
418059 
418387 



CAT Number 
1050848J 



110687.1 

1208283J 
1221 131J 
1225123 1 



1245660 1 

1248692.1 

1249255.1 

1322288.1 

1342150J 

1348563 1 

1374635 1 

1566379,1 

1593811J 

171879J 

174731.1 

180808.1 



AVV810279 BE146684 BE146693 BE146694 BE146679 AVV810472 AVV810208 AW810356 AW810193 AW178838 AW 
AWB10330 AW810514 AW810441 AW810358 AW17e8S2 AVV810359 AW810322 AVV810327 AVIQl^^ 
AW810263 AW810325 AW81 0443 AW8 

AA125984AA127189AA065075AA070377AA100017AA079891AA113255AA075168AA082764AA083380N84829AA 
AA085119 AA085208 AA085045 

AW754192 W00554 AW8577g7 AW754203 AW7M197 AW754193 
AW803357 AW803423 AW812233 R06814 

AW808361 AW808404 AW808386 AW806594 AW808654 AW808813 AW808551 AW808676 AW808350 AW808406 AW808694 AW808934 

AW808829 AW808385 AW808422 AW808401 AWa08409AW808760AW808863AW808521 AW808539AW808609AW608472AWB08739 

AW80d704 AW808556 AW808714 AW808420 AW8 

AW848433 AW8461 59 AW846377 AW846528 

AW850246AW850251 AW850302 

AW851255AW851432AW850955 

AW984757 AW984797 AW984734 AW984745 

BE048680 6E046738 BE044958 

BE064415 BE064430 BE06444B 

BE145899 BE145848 BE145849 BE14S653 BE145927 BE14S925 

Z43062R13213H14422 

H6(M57 H6S709 H7352B HS433S R87154 

AA211586F35799AA211641 F29720AW937387 AW937408 

R18085 AA219028 R17712 Z44345 

Ai217097 AW886090 W38035 W38792 AA232835 AW936043 
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4i9md 


lo4430_1 


AA236oq7 AAZJrObD AAJM2W AVVsdf rd9 nUKwl 


420352 




BE25Bo35 AWSoBolD AA2aB9lQ KNviSSta K14r44 AldciilJllo bcUr 182i} tu02oU 


421920 




AA300591 AW9o3o93AA3«M93 


424ZUU 




AA337221 AA33o7m AW3oo19o 


424666 


24Z4o5.i 


Ak.'ider/XA AA^ACtKA AXk.mCi'iA'i 

AA345504 AA34525 1 AVV3oo243 


424!»4 


245788_1 


AW^525 Aij72oo5 AAo495Ul /uj7Zdo7 Hi 0x4 


428002 


Zo5o02_1 


• AJ«tnA4 AAJ4II^44 BPfn4 ME nCAV4n'W\ DCfn4A4 Q 

AA418703 AA418711 BE07191S dc071920 ocOr 1912 


430535 


31 9543 J 


AW%8465 AW968670 AA480922 BE350425 


4J2/DO 


353907_1 


AJQu3429 AJ0D33o7 AA5o4o25 


433523 


3ooo73.i 


Lnnaoo avaiccccio AtAi4iirmn4 AicmiT AACDfiCfM AtCBe/ftfi AmtnOft AUmtU^QA AUUft(Sil90n 
ru9oo2 AVVbD5533 AWl 4^01 AI5r291 7 AAsSosOU AlboMoo /UooOJw AWoo4aaU AW00402U 


434381 


3851 55_1 


A A^^40^J A A^<VlJAf A AP4AAPP A A nfM*944 

AA631 834 AA533425 AA532455 Al79231 2 AI7923 1 1 


434569 


38929.1 


AF147363 T47219 T47218 


434763 


392847.1 


A Af AO04O AftAJr\^J'>OA 

AA648618 AVV974389 H51771 


436295 


41733.1 


At^40nf A IAA4D*VO 

N73B95 AJ001872 


440947 


505904.1 


AA910403 A]B15593 W58361 AW16K20 AIS16550 


442481 


543588.1 


N99828 BE079873 A1110738 AF074645 


445432 


63943.1 


A%i^P«^^WA nv*AAA4V*A 

Av653771 BE089370 


449570 


61018.1 


AA0Ot793AAOO1871 


450317 


831956.1 


A1692689 R14223 R18395 


450582 


83933.1 


AtAAA^«tA AAAAAAAA AftAl^J^AAA A A^£>AAAA AlAAAMt A A A Af An J AAJAe440 A AOCCBAA AlAIA^ffO^O 

Af339732 AAOl 0300 AW51 5041 AA768334 N^60 AA425B74 AA4251 1 B AA865o29 AW93o878 


450687 


84327.1 


AA495800 AA495737 AAOl 0736 AA654716 AA640726 


452462 


918580.1 


n^4^Atff4P m~A'9Aff*^A AtAA^A^A 

6E173515 BE1 73560 Ai902880 


453682 


977454.1 


T79703 T96307 AIJ07972S 


454096 


1007449.1 


AW062757 AW176890 AW082758 AW176895 AW176o69 AW17o900 AW176o97 AW17d92B AWudooS AW17dB92 


454171 


1049240.1 


AUmcja40 AtAtOrJ^nO A\AIOttOCI A\AI&CJ04e AIU0eJ04J AlAfQCJQ4T 

AVvo54832 AW85479B AVvo54857 AWB54o1 6 AVVo94o94 AWo54o1 7 


454457 


1207274.1 


AiAno^ce AijmcMoe aia/oc^bco aiafocjiom 
AW75345b AW75303o AW854o58 AVvo54BoZ 


454585 


122565<.1 


Bpncn4ia acnennno AXAiotwne 

BE06912O Bc069023 AW0O9375 


454665 


1 228599.1 


AtAlft4A(IMS AtAMI4'WJfi AIAIA4<n.4^ A1AHM4II01 iklAI044VC4 At>fQ< 

AWB12866 AW812746 AWo12747 AW8iKo4 AWoiZroJ AWai2722 


454860 


12377^1 


AUHMCVm AIAIB4ef4Y BC4dMm 

AVVo357o7 AWB35537 BEi601o7 


454988 


1247029 1 


AW849046 AW8479Sfi AWB49039 AW847957 AW848279 AW848898 AW&49034 AVV849033 


455067 


1252050ll 


AW854538 AW85441B AW854412 


455135 


1254729J 


AW857989 AW858016AW661677 AW861689AWB61691 AWB58056 


45S276 


1272541.1 


BE176479 BE176878 BE176357 BE176550 AW888079 BE176676 BE176615 6E176555 BE1764a9 BE176610 BE176362 


455388 


1287904.1 


AW936234 AW936074 AW936181 AW936l7g AW936217 AW936077 AW935227 AW936191 


455490 


1297826.1 


AW953477 Z41970 F12435T73989 T09387 


455646 


1348557.1 


6E064420 BE064435 BE064429 BE064414 BE054400 6E064517 


455710 


1352368J 


Be072049 8E069471 BE0&9489 BE069478 BE069479 6E069492 BE069485 BE072122 BE072124 BE069491 6E069486 BE069464 


455747 


1355877.1 


BE074910 BE074913 BE07491 1 BE074903 BE074892 BE074935 


455772 


1363114.1 


W28799BE088078 


455801 


1370508.1 


BE140643 6E140645 BE140644 BE140657 BE140660 BE140659 BE140661 


456304 


176820.1 


A1820973 AI734077 A!820984 AA225796 AA225080 AA225101 


457374 


32B758J 


AA493662 AW697396 BE154814 



TABLE 65C 



Pkey: Unique number conesponding to an Eos probesel 

Ref: Sequence source. The 7 digit aumbers in liiis cdumn are Genbank IdentiRer (Gl) numbers. "Dunham i. et al/ refers to Ihe pubncaGon entitled The DMA sequence of 

human chromosome 22.* Dunham L el aL, Nahire (199^ 402:489^95. 
Strand: Indicates D^IA strand from which exons were predielad. 

NtjxaHlon: Indicates nucleotide positions of oredlded exons. 



Pltey 


f^ 


Strand 


I^Lposltlon 


400533 


6981826 


Minus 


277132-277595 


400746 


7329328 


Minus 


147703.147896 


401132 


8705350 


Mnus 


85679^5 


401459 


9212270 


Mmus 


18200M83323 


402145 


8018280 


Phis 


113086-114800 


402454 


7534025 


Minus 


14826-15803 


402703 


8705069 


Minus 


15335*15500 


403242 


7537817 


Minus 


11297-12511 


403291 


7230870 


Rus 


95177-95435 


403305 


8099945 


PhJS 


114632-114805 


403333 


8566833 


Minus 


124794-124941 


403371 


9087278 


Plus 


105655-106050 


403433 


9719611 


Minus 


72225-72437 


403481 


9965004 


Plus 


93496-93633 


403510 


7652047 


Plus 


61866^2027 


403667 


6850483 


Minus 


1344-144Z1545-1697 


403805 


8140491 


Minus 


51483^1742.53429.53511 


404003 


8655948 


Plus 


198349-199098 


404561 


9795980 


K/5nus 


69039-70100 


404592 


9943965 


Minus 


39067.39225 


404848 


8248647 


Minus 


23955-24034.25143-25264 


404987 


7523744 


Nfinus 


89944-90729 


404995 


6006247 


Minus 


154015-154123 


405264 


7329374 


Plus 


28556-28684 


405321 


3419846 


Minus 


4465445210 


405348 


2914717 


Minus 


43310^62 


405510 


7630909 


Minus 


101028-101174 


405549 


1552494 


Phis 


10876-11048 


405733 


9884689 


PhB 


124632-125051 



743 



wo 03/042661 
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Table 66A Dsts 8bout370 genes up-regidated fn 6PH compared to prostate cancer and nonnal adull tissues. Tliese were selected from 59680 protiesets on the AfTymetrix/Eos Hu03 
GeneChip array sudi thai the ratio of 'average' BPH to 'average' prostate cancer and norma! adult tissues was greater than or equal to 3.0. The 'average' BPH level was set to the 
75^ pereentHa amongst BPH tissues. The 'average' prostate cancer and nonnal adult tissue level was set to the 65"* percentile amongst maOgnanl prostete tissues and nonnal adull 
tissues. In order toiemove gene-spedSc background levds ofnofMpeclGc hybridlzaflon, Oie lO^percentite value amongst noo-fliaBgnant body tissues was subtracted from both the 
numerator and Ihe denominator befoTB the rsGo was evahiated. 

TABLE 66A: ABOUT 370 GENES UP-REGULATED IN BENIGN PROSTATIC HYPERPIASIA COMPARED TO PROSTATE CANCER AND NORMAL ADULT TISSUES 

TW- UrtJqu6 bos t)r6ft6d6t iMHlihtf ftOrttW 

ExAcca* Exemplar Accession number. Genbank accession number 
UnigenelD: Unlgene number 
Ur^ena Trite: Unlgene gene title 

R1: Ratio of BPH tissue to prostate tumor and normal body tissue 



Pkey 


ExAccn 


UnigenelD 


Unlgene HUe 


R1 


410929 


H47233 


Hs. 30643 


ESTs 


21.1 


450S93 


AW450461 


Hs.203965 


ESTs 


16.7 


41 8432 


M14156 


HS.B5112 


insdln^Ike growth (actor 1 (soroatomedl 


14.2 


432473 


AI2O2703 


Hs.152414 


ESTs 


11.3 


446330 


AVV815036 


Hs.151251 


ESTs 


10.9 


407275 


AI364186 




Sbxiw34h07jt1 NCI.CGAP.UM Homo sapiens 


10.7 


428134 


AA421773 


Hs.161008 


ESTs 


10.2 


400297 


AI127076 


Hs.306201 


hypotheticall protein OKFZp56401278 


9.8 


433456 


AA508353 


Hs, 105314 


reiaxin 1 (Hi) 


9.S 


415293 


R49462 


H5.106541 


ESTs 


9.1 


458072 


AI890347 


Hs.271923 


Homo sapiens cDNA: RJ22765 lis, done K 


6.8 


428927 


AA441837 


Hs.g02SO 


ESTs 


8.6 


420345 


AW295230 


H5.25231 


ESTs 


8.5 


453387 


AI990741 


HS.2S2809 


ESTs 


a2 


454457 


AW753456 




gb:QV2-CT0261'261099^11<d11 CT0261 Homo 


7.7 


441247 


AW1 18881 


Hs.128051 


Homo sai^ thymic stromal lymphopoleti 


7.5 


431576 


M76665 


Hs.275215 


hydroxysierold (ll-bete) dehydrogenase 1 


7.4 


400080 






Eos Control 


7.4 


431448 


AL137517 


Hs.306201 


hypothetical protein DKi7p56401278 


7.4 


404592 






NM-022739*:Homo sapiens E3 ublqUiQn tig 


7.3 


420352 


pg25883S 




gb:6011 17374F1 NIHJMGC_16 Homo safriens c 


7.3 


438231 


AVV594S39 


Hs.1 55689 


ESTs 


7.3 


410330 


AW023630 


Hs.1 59425 


ESTs 


7.2 


449300 


A1&56959 


Hs.345514 


ESTs 


7.1 


449249 


T522B5 


HS.19311S 


Homo sapiens mRNA for KiAA1764 protein, 


7.0 


426384 


AI472078 


Hs.303662 


hypothetical protein FLJ13189 (FU13189) 


6.8 


454171 


AW854832 




^:QV2-CT026l-201099^11't05CT0261 Homo 


6.6 


408197 


AA282262 


Hs.1 07410 


ESTs, Weakly similar to A46010 X-Hnked 


6.6 


429732 


U20158 


Hs.2488 


lymphocyte ortosolic protein 2 (SH2 doma 


6.6 


421154 


AA284333 


Hs.287631 


Homo sapiens cDNA FU14269 lis. done PL 


6.5 


431467 


N71831 


Hs.256398 


Homo sapiens mRNA; cDNA DKFZp434E0528 (f 


6.5 


424433 


H04607 


Hs.9218 


ESTs 


&5 


442481 


N9982B 




gb:za32c04/1 Soares fetal liver spleen i 


6.4 


425312 


AA354940 


Hs.145956 


ESTs 


&4 


426140 


AF131798 


H8.343768 


Homo sapiens done 251 19 mRNA sequence 


6.4 


440911 


AA909538 


Hs.143562 


ESTs 


6.4 


400533 






ENSP00000209376^PRED65 protein (Fragmen 


6.2 


410910 


AAol4100 


n5.oo693 


ESTs 


0.2 


403867 






Target Exon 


6.1 


436396 


AI683487 


Hs.152213 


wingtess-type MMTV integration site fam) 


6.1 


404003 






Target Exon 


5,9 


424853 


6E54g737 


Hs.132967 


Human EST done 122887 mariner transposo 


5.9 


438138 


R9B299 


Hs.177502 


ESTs 


5.9 


424940 


AAg85308 


Hs.283902 


ESTs 


5.8 


434485 


AI623511 


HS.116S67 


ESTs 


5.8 


453200 


AA033832 


H8.212433 


ESTs 


5.7 


428002 


AA418703 




gb3v98d)3.s1 Soares.NhHMPu.S1 Homosapi 


5.7 


424850 


AA151057 


H9.153498 


chromosome 18 open reading frame 1 


5.6 


445444 


AA380876 


Hs.270 


pleckstrin homologyi Sec7 and ooQedAid 


5.6 


443361 


AI792628 


Hs.133273 


ESTs 


5.6 


421513 


X00949 


H3.105314 


relaxto1(H1) 


5.5 


439079 


AF085937 


Hs.38348 


ESTs 


&5 


430535 


AW968485 




gb:EST380561 MAGE resequences. MAGJ Homo 


5.5 


436578 


AI091435 


H3.134859 


ESTs 


&5 


424051 


All 10203 


Hs.138411 


Homo 6af)iens mRNA; cONA DKFZpS88J1922 (T 


5.4 


421663 


AI952677 


Hs.108972 


Homo sapiens mRNA: cDNA DKFZp434P228 {fr 


5.4 


435072 


AW592176 


Hs.1 16932 


ESTs 


&4 


435375 


AI733810 


Hs.187832 


ESTs 


5.4 


444609 


AW57t659 


Hs.278081 


ESTs 


5.4 


416602 


NM_006159 


Hs.79389 


nd (chlckenHiite 2 


5.4 


433087 


AI720686 


HS.1S2520 


ESTs 


5.3 


439092 


AA830149 




gb-.oc44i08.s1 NQ.CGAP.GCBI HoRtosafriens 


5.3 


437287 


AW511443 


Hs.258110 


ESTs 


5.3 


441916 


AA993571 


Hs.129075 


ESTs 


5.3 


452784 


BE463857 


H8.151258 


hypothstksl protein FLJ21062 


5.3 


452531 


AM294&2 


Hs.293946 


ESTs. Weddy similar to 138022 hypothefi 


5.3 



744 
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MaJ941 


nS.1ZJ042 


E|rfiA3 


&3 






AI591222 


Hs.72325 


« « nvtA » - - -« DtM 4fln 14 4 

Human DNA sequence from done Hri-io/ Ji 1 


Sl2 








nS.^Dorl£ 


ESTs 




5 




A\KIQAOt\AC 

AvVo4tflMD 




gDnL34^TU2i4>l3U}0UH}»-nuo \/IU4i4n0fn0 






/\rUK>tM4 


nS.44&DJ 


unc5 {Celegans homolog} c 








AA767376 


nS291o31 


ESTs, Mooerately sinvar to Sd5o57 alpna 


5.2 






UoUiiZj 


riS.lao4ou 


CCTs 
CbiS 


c ■> 






U6U//0 




Human EST done 42944 mariner transposon 




in 






ns. 1 *iODoo 


nOmo 5a|jcns cuiw ruzouj 1 ns, MunB l 


ai& 


426748 


AL048409 


Hs.97177 


ESTs. Weaxty similar to AIU1_KUmAN ALU S 


5.2 






vva4i4/ 


Ue 1 1 fi'iOA 








416706 


AA314676 


Hs.288945 


hypothelical protein FU13448 


&1 




440129 


AA8o5618 


118.174936 


ESTs. WeaWy similar (o S71B86 Ste20-uk 


5.1 


1 s 


445238 


AAfl83971 


Hs. 187506 


ESTs 


5.1 


450582 


AI339732 




G-fich RNA sequence binding facUv 1 


C 4 

9.1 




420533 


AI809510 


Hs.1 18971 


ESTs 


5.1 




438447 


AIuoZ0o3 


nS.30/32 


lu«ajKl><An<Mj amIaJa CI 141 A AO. I/IAA1714 

nypouieuca pfotetn rUlo4u9; iwVAirii 


5.1 




440354 


AA8o93oo 


Hs.1 25468 


ESTs 


5.0 


90 


452891 


N75582 


HS.212d75 


poTi. tAi;aaL4.< ai_na. 1^ n\/ijn Ul 11 1 A Al /^l > ■ A 

tSTs, weaWy simuar to DYn9_HuMAN ulIA 


5.0 


457374 


AA493o62 




go:nm)5d12.s1 NCI_CGAP_Tny1 Homo sapiens 


5.0 




40^29 


NM„0Uw42 


LI- aOMjI 

n5.4ooo4 


heparan swan (guoosamine] 9>o-6unot 


C A 

9.V 




448041 




Ub OACyiQ 


ESTs 


c n 
3.0 




457653 


Aio2D719 


nS.154oo2 


DnaJ (nsp4u} nomolog, suoTamiiy A, menwe 


4.9 


9S 


450497 


H641S9 


Hs.15328 


ESTs 


4.8 


JIB* Vt 

438132 


A A nmnte 

AA3u7076 


H3,12206U 


ESTs 


A 0 

4.0 




414818 


oc54lZ17 


nS.23oOo 


ESTs 


A 0 

4.0 






A AXOQQQ 


U« 049COC 


CO 1 St weaKiy snTuiar lo i\iaai joo proiein 


Jl D 

4.0 










CCTe 
CbIS 


A a 
4.0 


JV 


450317 


Aid92oB9 




gDwdooguaJci NujiAaAr.LiJZ4 Homo sapiens 


4.0 


443635 


AI0BO23O 


LJ_ 

Hs.134214 


ESTs 


4.8 




430172 


A A jicoeiM 

AA4oB591 


Hs.1o1oo9 


ESTs 


4.0 




452843 


AI79o7d9 




bblS 


Jl 7 

4./ 




427302 


AA40U540 


nS.1 95282 


UaUMft OHHlaaB «nAIA CI 14 4 CCA Aa alaaa UC 

n0m08apKn$CUnArLJii354ii5, done He 


4.7 




452031 


AA741314 


Hs.865 


RAP1A, member of RAS oncogene family 


Jl 7 

4./ 


423789 


AKu02uo4 


Un 4'1')DC4 

. Hs.1 32651 


nypotneiicai protein FUl 1 222 


4.7 




417173 


U01397 


U« QiAOA 

nS.Dl424 


UDcquituHike i (seninnj 


4.6 




44U90I 


/\rU441?r# 


nS.iUlKK}i 


small Inducible cytokine B sutifamBy (Cy 


4.0 




454024 


AA»35Z7 


Hs.293907 


nypotneucal protein FU23403 


4.6 


A(\ 
HKi 


405348 






^7fW14CCA<a:i4 0liAOAfi4Uk2inAD04aJn 41 f AD 

C7D0iD64:gi|1269o0Dl|dpjlDAD21 849.11 {no 


A e 
4.0 


419511 


A A jiofVTen 

AA4297aO 


Hs.75113 


general transcnptlon factor IIIA 


Jl c 

4.D 




447058 


AI939450 


Ua 4CA07A 


ESTs 


Jl e 
4.0 




428218 


AA4242GO 


Hs. 123642 


EphA3 


A e 
4.0 






A AilOAnAT 

AA4o4A)f 


Uf. 04 4 DC7 


ESTs 


Jl c 
4.0 






A ACC14iinil 

AA5dl404 


nS.2o903r 


Homo sapiens cuna ru i4ij3 nSi done ma 


4.5 


449821 


AI871141 


Hs.211122 


ESTs 


A C 

4.0 




420905 


AAS21307 


Hs. 186651 


ESTs 


4.5 




439752 


T7B968 


Hs.14411 


ESTs 


4.5 




452055 


A1377431 


Hs. 141 693 


koaalkatlaal aM»n>a llf^r^lADCQ 

nypotneucai protetn mqlioooo 


4.5 


SO 


430188 


Au)49242 


Hs .234794 


Homo sapiens mRMA; CUNA OKFZp564B083 (n^ 


4.4 


423352 


A AOOiiDnO 

AA324OU0 


Ua 4fi9CTC 

HS.19357D 


ESTs 


4.4 




436042 


AVv29o971 


Lta. 4one4n 
H3.I6O6IO 


ESTs 


4.4 




452978 


AA029994 


Hs.61523 


ESTs 


4.4 




420154 


AI093155 


Ua ACA4A 

Ks.95420 


m07 ar>«ta>a 

JM27 piDtein 


4,4 




42a24a 


A A 4 9nQ4 A 


Ua 400004 


zinc finger protdn 268 


4.4 


442242 


AVo47908 


H8.90424 


> 1 1 _nAiA. n tiiMBe a— t § i 

HoflW sapiens cDNA: RJ23285 ns, done H 


4.4 




452498 


AK0OO1O1 


mJZSffw 


nypoweticai proiein ru^OASn 


4.4 




459527 


AW977556 


Hs.291735 


PCVm IAI^.a1Jab -t — 4_ nOfMkf »:Jk J^^kJIL 

ESTs, Weady similar to I78B85 sennenn 


4.3 








Ua 4AA1>I'3 

nS.1UUo4J 


tots 


Jl t 

4.0 


ou 


416312 


Wu2d40 


Hs.16247 


ESTs, WeaMy sbntlar to 2004399A cnromos 


4.3 


420111 


AA255d52 




0D3821fii1.n NCUCGAP.GC81 Homo sapiens 


4.3 




iiconic 


Afeain<t 
N091UI 


Ub Aivnn. 
nS.4073U 


CCTb 

coTS 


A 9 

4.J 




409705 


M37762 


Hs.56023 


brain-deilved neurotrophic factor 


4.3 




433433 


AIo92623 


Hs.121513 


Homo sapiens done 73>1 placento expies 


4.2 


^s 


427908 


AA417272 


Hs.24122 


ESTs 


4.2 


432765 


AJra3429 




gb*Ai003429 Selected chromosome 21 cDNA 


4.2 




413525 


BE145899 




gb:MfV)-nr0208-2212^2Q4-o10 HT020B Homo 


4.2 




420355 


AW968263 


Hs.1 23126 


ESTs 


4.2 




427521 


AW973352 


Hs.290685 


ESTs 


4.2 




458912 


Ai911056 




ESTs 


4.2 


410790 


AVVB03397 




9b:ll.24iM00794)90300H)50iAuo UM0079 Homo 


4.1 






N59D9U 


Hs. 27252 


ESTs 


Jl 4 

4.1 




434360 


AWD15415 


Hs.127780 


ESTs 


4w1 




442766 


H50733 


Hs.256261 


ESTs, Moderately simflartoALUBJIUMANA 


4.1 


75 


450597 


AI70163S 


Hs.207077 


ESTs 


4.1 


424200 


AA337221 




gb:EST41944 Endometrial tumor Homo sapie 


4w1 




445206 


AI3S0199 


Hs.269990 


ESTs 


4.1 




438431 


AW207860 


Hs.293116 


ESTs 


4.1 




448152 


AI741053 


Hs.170770 


ESTs 


4.1 


80 


436875 


AA827640 


Hs.169059 


ESTs 


4.1 


400746 






Target Exon 


4.1 




413081 


BE064415 




gb:RC4^T031 1-241 1994}12-b03BT0311 Homo 


4.1 




404967 






Target Exon 


4.0 




427203 


AW629S17 


Hs.244655 


ESTs 


4.0 
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442336 






ESTs 


%w 




455276 


68176479 




Qh!RC3-HTDSSS-16Q30[U)994i09 HTQSB5 Homo 


4.0 




452303 


AI953425 




CO 1 Of vwadNj ouiuhji n loouu ny)wui8u 


4.0 




435136 


R27299 


Hs.10172 


ESTs 


4.0 


5 


416760 


HB5182 


Uq IQI'177 

ns.19 1 J«f 




4.0 




421312 


A AD5AC77 




CO la 


I.U 




441558 




Ue 19717R 
ns.i£r irD 


pen** 

CO 19 


Ati 










CO 19 


A n 


10 


448882 


A inn I'M! 


Ue 994nA 






434222 


nr 1 19000 






0.9 










pidisd box QBiiB 5 (B'HScO Oneaso specif 


0.9 






AADepcce 


Ue I7fl999 

nS.lf 0£££ 


CD IS 


0.9 




435878 




ns.«ui9c 


CD 19 


0.9 


15 


447530 






PQTe IUlnAsra4(^ eim{1« ir(t9Vlft^»< 
Cw 1 9, nHMBMUCiy 91 1(11 Im ID Jl^&OO ydoC 


0.9 




DCiMODOU 




gD.niW£rNioj(i Nw_oiaw%,KLJr* nOmO sapcens 


9 Q 
0.9 








nSi99OJ0 


ii/pouicuCw protein ru luouo 


0>«l 




411084 


T18987 


H« 17<>d79 
ns. fc9^f4 


PCTe Mrvlpmli>lu ctmRw in KIAAflRT? rwn 
CO 1 0, tWMVfwKij siinRsi n iwvwor r piu 


4 Q 

0.9 




413629 




Uc QiRR9 


WIAAnftftn fifMlain 

iwv\uooo praioHi 


A.9 


20 


455646 






nh*Rri.RT(m 1.9d1 1QAil19WlR RTfVtl 1 Hnmn 


0.9 






nS. 1 j£7UU 


CO IS 


0.9 




425810 






CO IB 


0.9 




III 




Ue 7aRA 


(^Bckstifi hoinolDoyt S6c7 snd coited/bd 


0.9 




411479 






nh-II V*Tn914.9Q19<WJW9JM9 010914 Hnmn 


3.9 


25 






u« mfiiA 


PQTc 
CO IS 


0.0 


428342 


AI7')qi6R 




Hnmn fssmipne rHMA P 11 ^4*^ fie rlnno PI 


3.8 




431421 


AWQRQ11R 

nVf9U9 1 Id 




CO 1 a, vvaoAij ouidkm 10 uriiMinov] pioiotn 


3.8 






243052 
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AA495800 AA495737 AA010736 AA654716 AA640726 
BE17351S 6E173560 A1902860 
T79703 T96307AL079725 
AWgg8716 AW022148 N68020 

AW06Z757 AW176890 AW062758 AW176895 AW176869 AW176900 AW176897 AW176928 AW176868 AW176892 

AW854832 AW854798 AW854857 AWB54816 AW854834 AW854817 

AW753456 AW753036 AW854868 AWB54062 

AW812868 AW812746 AW812747 AW812884 AW812763 AW812722 

AW835767 AW835537 BE160187 

AWB49046 AW847956 AW849039 AW847957 AW848279 AW848698 AW849034 AW849033 
AWB54538 AW8544 1 8 AWB5441 2 

BE176479 BE176678 BE176357 BE176S50 AW8a6079 BE176676 6E176615 6E176555 BE176489 BE176610 6E176362 
BE064420 BE064435 BE064429 BE064414 BE0644DO BE084517 

BE072049 BE089471 BE069489 BE069478 6E069479 BE069492 BE069485 BE072122 BE072124 BE089491 BE069486 BE069464 

AA493662 AW8973g6 BE154814 

AI911066Ai933734AI680888AJ003599 



TABLE 66C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 dlgil numbeis In this cotumn are Genbank IdenGiier (G!) numbers. T)unham I. el el.* refers to the publication entitled The DMA sequence of 
human chromosonie 22.* Dunham L eft al, Nabie (19^ 402:48&495. 
Strand: Indicates DNA strand from which axons were predicted. 
Nt jwslfion: Indicates nucleotide positions of predicted exons. 



Pkay 


Rsf 


Strand 


Ntposition 


400533 


6981826 


Minus 


277132-277595 


400746 


7329328 


Minus 


147703-147896 


401132 


8705350 


Minus 


8567945795 


402145 


8018280 


Hus 


113088-114800 


403242 


7637817 


Minus 


11297-12511 


403481 


9965004 


Plus 


93496.93633 


403510 


7652047 


Plus 


6166642027 


403667 


6850483 


Minus 


1344-1442,1545-1697 


404003 


8655948 


Pius 


198349-199096 


404561 


9795980 


Minus 


69039-70100 


404592 


9943965 


^Anus 


3906749225 


404848 


6248647 


Minus 


23955-24034,25143-25264 


404987 


7S23744 


Minus 


89944-90729 


404995 


6006247 


Mnus 


154015-154123 


405264 


7329374 


Plus 


28556-28684 


405321 


3419846 


Minus 


44654-45210 


405348 


2914717 


Minus 


4331043462 


405510 


7630909 


Minus 


101028-101174 


405548 


1532158 


Phis 


1155M1686 



TABLE 67A: 689 GENES UP^GULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 67A Hsts about 689 genes up-regutated In prostate cancer compared to normal adult Sssues. These were selected from 9009 probesets on the Affymetrix/Eos Hu03 GeneCh^} 
array that showed some expression In human xenograft tumors and cell Ones. The 689 genes were selected such tlial the ralio of 'average' prostate cancer to 'average' nonnal adult 
tissues was greater than or equal to 2.5. The 'average* prostate cancer tevai was set to the 75th percentile amongst prostate cancers. The "average' normal adutt Ossue level was set 
to the 85th percentile amoi^t non-ma&gnant Bssues. Inonler to remove gene^pediic background levels of non-specific hyfarUlzalk)n, the 10th percentile vatue amongst non- 
malignant Gssues was subtracted from both the numerabir and the denomina^ 

Pkey: Unique Eos probeset Mentifier number 

ExAocn: Exemplar Aocesskm number, Genbank accession number 

UnigenelD: Un^ne nuniber 

Un^ene THte: Un^ne gene filte 

R1: of tumor to nonnal body tissue 
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5 

10 
15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Pkfiy 


ExAccn 




1 hiloflnn TIDb 


R1 


448999 


AF1 79274 


Hs.22791 


tmnsmfij 1 iliranfl nnlrin unlh Ff^Jilm mvl 

UOmildlMNBiW |HWUNII Willi CVT^UW OliU 


73^ 


426747 




Me 171001; 


WglTrVmln 3 fnfftftlflta ftnaHfli* untfaafk 
KaUlNnn 0, yHWiBiB 9|KMUG BlH^Bn 




432441 


AW3Q242S 


na. ItMWi 


CO 19 


30.V 


446057 


AI42Q227 




Tfp'pS t/BAsisnl BBOflptof potsfibdl C8ti 




400302 


N48056 




Rjtalti hydfolsso (jtfostid^spflcific inonib 


48.1 


414569 






fVfwtfllA rani^M' iwinnrtnlnii imitulii 1 

piuslcUa ConcBi oaauwaicu imwibuI i 


11 A 


432240 


A)fi947fi7 


Hs 129179 


Hnmn fitmipm cDNA FlJISUSftl fis rVww PI 




419526 


At821895 


Hs 193481 


ESTs 




400298 


AAfli3797Q 




idv trancmpinhninft POilhftibil fwilinpn nf 

OW UflllSlllCMlUKUn? CpiJHiUCSI tBiuydi M 


3S7 


400299 


X07730 


Hi; 171Q(K 


ksffikroin 3t (prostflte specific sntiQBn 


3A Q 


453370 




Hs 139338 


nl r^MHOiny CoaSauBi SUOianVj \j ^wT 1 r\ 


M.9 


409731 


AA12^85 




fhwmACin Kala MantHtofl In natimKlael 

uijmosui, uci0» lucnuiKO in ncurijoiBsi 




424099 




Hs 1t933R 


nl r^NiHIUiy GaS5ollB| oUlrlainnj w \wr 1 n 




400292 




H8.72472 


RUPJIIR 


on A 


400287 




Hs iRi^m 


k^Gkiidn 2^ pfostsBc 


OAfi 


425075 


AAS06324 




ficid phosph3tes6| prostste 


M tt 
fO.9 






Uq CA41C 


stne ocuus noniBOwox (mosopniiai noinoio 


97 fl 

<r.o 


415539 


AI733fiB1 


Hs.72472 


DlVir*fMO 


97 A 


428819 


At l^iifiT^ 


ns. 1909 1«# 


nana nm/4iH>l 

i\uv\\ati) gene pnxjuci 


9117 
£9.1 


409262 


AKDDQfi31 










WOOOlO 


ns.ouiooo 


Homo sapisns cDNA FU11346 lis, clone PL 


9jI a 

£4,0 




WO 1 f 9 


14e 117101 
nS.1 1/100 


CO IS 


24.5 


400298 




Hs 139336 


ATPahlnriinfl m<Q<u>}tp RtiK>fnfnI)v P fPPTR 
m r*uuiuii(\j MiaocuOi out^iaiiiiij w 1 1\ 


93 ft 


428338 




no. lojf 04 


fnicrosGntuioproti^ni b6t3* 


99 K 


433043 


W57554 


Hs.125019 


IvfnnhnU nitrlnnr nmfpin fl AF«^\ itiRNA 

l|III^IWJU IIUWIOOl |MUU7«I \W\r'^J lill\IV\ 




403047 






MU An'sR'tfi**Mnnnfl cantone (rancfnomhrano nr 

l1IV\_UUUUOU •rHJilKJ ati^ClR} UaRMllOlnUIOllO 


93 A 


444484 


AK002126 


Hs.11260 


hvnoiheiicsi nmlfun PI J 1 1 


22JJ 


423073 


6E252922 


Hs.123119 


[WW \rnoui6i9 B^oinai ucCaponiopicS'vi ur 




407709 


AA456135 


HS.23Q23 


ESTs 


91 


401424 






lifV^^VU 1 1 f A«nUIIIU SMlJJlOlia OiyUKOOi Ij^nl 11 




448290 


AK002107 


Hs.20843 


Homo saoisns cDNA PLI1 1 245 fis. dniB PL 


19.7 


434666 


AF151103 


Hs.112259 


1 wju icccpiDr gonniki locus 


10.9 


415989 


A1267700 




ESTs 


17 Q 
If .9 


437052 


AA861697 


Hs.120591 


ESTs 


17.8 




U80456 


Hs.27311 


smgioHiinucO (Uiusopniiaj nomuiog « 


ie s 


431548 


Ai834273 


Hs.fl711 


novBi fHuunn 


1ft ft 

1D.D 




NM_004476 Hs.283946 


fotalB hydrolasB (pfostBl8'^)edfic mcmb 


ie R 


428882 


N^^000346 Hs.2316 


j^RY t^ptt itotenrinina mninn V1.1mv Q fnn 


16.5 




A!682088 


Hs.79375 


ngiuCaKXMyiasB synuieiaso (Dnwir (prop 


Ifi R 
1D.9 




AW302885 


Hs.l 17183 


PCT« 
COIS 


1ft 1 
1D.1 


1) lUsO 


U80034 


H5.68583 


niiuxTiunofiai inuanipuiaw pepuoase 


11:7 
19./ 


435677 


AAe94142 


Hs.293726 


coio, iVDduj ouiHuH ID 1 o\3n nni icoiio 


1<«3 


419743 


AW408762 


Hs.5957 


nuuiu sapiens vioiiq £'r't lo rnnnM s^uenco 


1ft 9 
19.J 


mil til, 


H20276 


Hs.31742 


cois 


le 1 

19.1 


427958 


AA41600O 


H5.98280 


IMiaaaiuiTi inwniKiuiawidiTiau mnoimoim 


1A1 


418829 


AA516531 


HS.S5999 


MK hnmnnlvw ffWnqmhlfert fiwnlkt ^ A 
nn uMiHuiMH (urasopniiajt lanuif o, n 


13Q 


426501 


AW043782 


Hs.293616 


COIS 


110 
1J.9 




AB033070 


Hs.194408 


w#v\i4*w proicni 


19 7 

1*7 


418961 


AW967646 


■Hs.23023 


ESTs 


13 3 
10.0 




AW17566S 


Hs.278695 


Homo sapiens prostdn mRNA, conqiistB ods 


19 9 

13.2 




AI620961 


H3.193465 


CCTb 


19 1 


428398 


A1249368 


Hs.98558 


ESTs 


13n 




AW207205 




CCTe 
CO IS 


19 7 


401451 






niv^w^m •nuiuo oapwns nspmocyva nucra 


19 A 


424012 


AW368377 


Hs.137569 


tumor protein 63^ k03 vrith stronQ homoloo 


l«.9 


419076 


M93119 


Hs.e9584 


insuKnoma-BssociaiBa i 


19 9 
1«>0 


450382 


AA397658 


Hs.60257 


Hnmn cimlnnQ rTlMA PI 1 1 ViQR fie <4nna PI 


11 Q 
11.9 


430376 


AW292053 


Hs.12532 


wHuiiHjauiiiB 1 upon lOaUiny Mania £ 1 


11 n 

1 1.D 


408826 


AF216077 


Hs.48376 


Homo mofsm dom HR«9 ihRNA ftPmnnnB 


11.4 


429918 


AW873986 


Hs.119383 


ESTs 


11 3 
1 1«0 


427212 


AW293849 


Hs.58279 


P<>Ta Wpakhf dmlfarhkAl 117 Ul tUAM AltIO 
CO 1 8, WcBKIj SnUUei OI/Uil/r,/1UIWnn nW 9 


11 1 

II. 1 


412446 


AI768015 




COIS 


11 1 
1 1.1 




T32982 




PQTe 
COIS 


1AO 


420757 


X7B592 


Hs.99915 


anuiugun iBcepuir (wnyoroiBSttiSwranB i 


ino 


Of 1 


NH.003528 Hs.2178 


ruxs nisiuno ituiuiy, nienwcT u 


in ft 

IU.D 


444151 


AW972917 


Hs.128749 


fl1nKa..mAfh\iff^^wl.^AA r^^^mapo 
o^jna-fuouiyiaCyrVAJn idUBinaSc 


in A 


«Wf«M4 


AI199268 


Hs.19322 


nomo sipens, oimuar to ruKcN cuNA 2010 


10.3 


418278 


AI068489 


Hs.83937 


nypDlrooca protein 


10.3 


430226 


BE24S562 


Hs.2551 


flrffBftSf^fc^ boti^^*! focoptof| suriisco 


10.2 


433927 


A1557019 


Hs.1 16467 


small nudear protein PRAC 


10.2 


408000 


L1 1690 


Hs.ig8689 


tJuBous pemphigdd antigen 1 (230/24QkO) 


10.1 


449825 


NM.014253 


odz (odd Oz/teiHn. OresophSa] homoloo 1 


10.1 


431117 


AF003S22 


Hs.250500 


delta (DresoptiDaHikel 


10.0 


439444 


A1277652 


Hs.54578 


ESTs. VVeaMy slmRar b 138022 hypotheD 


9.8 


437718 


Ai927288 


Hs.196779 


ESTs 


9.8 


440529 


AW207640 


Hs.16478 


Homo sapiens cDNA: RJ21718 fis, done C 


9.7 


453160 


A1263307 




H2B histone family, n)enil)er L 


9.7 


432715 


AA247152 


Hs.200463 


ESTs. VVsddy similar to KIAA1 074 protein 


9.7 


450325 


AI935962 


Hs.9ig73 


ESTs 


9.6 


434170 


AA626509 


Hs.122329 


ESTs 


9.6 


428600 


AW663261 


Hs.138860 


hypotheScal pratdn DKFZp434K1421 


9.6 
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447033 A)357412 K&157601 ESTs 9.5 

401747 Homo sapiens keraSn 17 (KRTIT) 9.3 

434423 NM.006769 Hs.3844 UM(iomanon)y4 9.3 

. 411887 AW1B2924 Hs.128790 ESTs 9.2 

D 457211 AW972565 Hs.32399 ESTs,VfesMyslmI]8rtoS51797v85odiIat 92 

415263 AA948033 HS.13Q853 ESTs 9.2 

432729 AK000292 H$.130732 hypoOieScal prot^ RJ20285 9.2 

421896 N62293 Hs.45107 ESTs 9.1 

432101 AI918950 Hs.123642 EphA3 ai 

10 451640 M195601 Hs.26771 Human DNA sequence bom done 747H23 on 9.0 

428046 AW812795 Hs.337534 ESTs. Moderstely similar to 138022 Itypot &9 

420218 AW958037 rfbosomal protein 14 8.8 

434926 BE543269 HS.S02S2 mitochondria) ribosomal protein 1^2 8.6 

424692 AA429834 Hs.151791 KIAA0092 gene product &6 

15 417333 AL157545 Hs.173179 bromodomabi and PHO finger conldning. 3 a6 

452594 AU076405 Hs.29981 &otidecaiTierfarrtity26(sui»eiransp a5 

446100 AW967109 Hs.13804 hypoltieficdprotBbdl462023.2 aS 

400294 N95796 Hs.278695 Homo sapiens prostelnmRNA.oanpletscds as 

416182 NM.004354 Hs.79069 cyclInG2 aS 

20 431542 H63010 Hs.5740 ESTs a4 

447397 BE247676 Hs.18442 E-1 enzyme a3 

432674 AA641092 Hs.257339 ESTs, Weakly similar to 138022 hypotheQ ai 

456660 AA909249 Ks.! 12282 soiuts canter famSy 30 (zinc transport ai 

443822 A1087412 Hs.143611 ESTs. Weakly sindar to 20043g9AchroniDs ai 

25 451684 AF216751 Hs.26813 C0A14 ai 

437124 AA554458 K1AA0666 protein aO 

416239 Ali)38450 Hs.48948 ESTs aO 

432415 T16971 Hs.289014 ESTs, We^ly similar loA43932iniJdn 2 p 7.9 

424800 AU)35588 Hs.153203 l^yoO family inhilritor 7.9 

30 428728 NM_016625 Hs.191381 hypothetical protein 7.9 

418564 AA631143 Hs.278695 Homo sapiens piosteinmRKA, complete cds 7J 

442049 AA310393 H5.190044 ESTs 7.8 

429597 NM.003816 Hs.2442 adisintegrinandmetalloproteinasedania 7.8 

410889 X91662 Hs.66744 twist (Orosophiia) homolog (acrocephaios 7.7 

35 443180 R15875 Hs.256576 ciaudin12 7.5 

433285 AW975944 Hs^37396 ESTs 7.5 

. 410870 U61599 Hs.66731 homeoboxB13 7.4 

418836 A16S54g9 Hs.161712 ESTs 7.4 

444922 AI921750 Hs.144871 Homo sapiens cONAFU13752 lis, done PL 7.3 

40 431992 NM.002742 Hs.2B91 protein kinase C. rmi 7.3 

425905 AB032959 Hs.3185B4 novel C3HC4 type Zinc finger (ring finge 7.3 

432S86 AA568548 ESTs 7.3 

431325 AW026751 Hs.5794 ESTs. WbaMy similar to 2t09260AB cell 7.3 

435047 AA454985 Hs.54973 cadheiliv4ikB protein VR20 7.3 

45 447476 6E293466 H5.20880 ESTs. Weakly similar to 138022 tvpothati 7.3 

422728 AW937826 Hs.103262 ESTs. Weakly similar to ZN9LHUIWIAN ZINC 7.2 

426108 AA622037 HS.16&468 programmed cell death 5 7.1 

433323 AA805132 Hs.159142 ESTs 7.1 

423349 AF010258 Hs.127428 homeot»xA9 7.0 

50 416815 U41514 Hs.80120 UOP-N-acetyl-elphaO^alactosaminKpQlyp aO 

425154 NI4^001851 Hs.154850 coQagen. type IX elpha 1 a9 

438869 AF07S009 gb:Homo sapiens ftiD length insailcONA a9 

418329 AW247430 Hs.84152 cystalhtonlne^ta^ynthase 6.8 

452367 U71207 Hs.29279 eyes absent (Orosophiia} homoiog 2 aO 

55 418601 AA279490 Hs.8636d calmegin aO 

429769 NiyL004917 Hs.218366 kaHikran 4 (prostase, enamel metrix. p 6.8 

436962 AW377314 Hs.5364 IKFZP564I052 protdn 6.8 

425465 L16964 Hs.1904 protein kinase C, lota ao 

450377 AB033091 KIAA1265 protein a6 

60 451418 BE387790 Hs.26369 hypoihetical proteh FU20287 &6 

418250 U29926 Hs.83918 adenosine monophosphate deaminase (Isolo a6 

416111 AA033813 Hs.79018 chromatin assembly factor l.subunlt A ( as 

426350 NM.003245 Hs.2022 transglutaminase 3 [E polypepOde, prole as 
440774 AI420611 Hs.153934 ESTs 6.5 

65 413992 W26276 Hs.136075 RNA. U2 smaO nudaar a4 
421928 AR)13758 Hs.109643 polyadenylata binding protein^nteracSn a4 
433293 AF007835 Hs.32417 hypdhetteal protein MGC4309 a4 
452340 NM_002202 Hs.505 ISL1 transcripOonfector.UIWhQnieodoma a4 
451027 AW518204 Hs.40808 ESTs a4 
70 436063 AK000028 ribosomal protein S24 a3 

440749 W22335 Hs.7392 hypoOietkal protein liAGC3199 a3 
432103 T15603 Hs.272458 protein phosphatase 3 (fonnerly 2B), cat a3 
431958 X63629 H3.2877 cadhertn3. type I.P-cadherfn (placenta a2 
411019 AW993097 Hs.48617 Homo sapiens cONAFU12S40iis. done NT a2 
75 421566 NM.000399 Hs.1395 eariygrovrth response 2 (IOox-20(DrDsop a2 
452744 AI267652 Hs.246107 Homo sapiens mRNA;cONADKFZp434E082(fr ai 
425259 AL049280 Hs.155397 Homo sapiens mRKA;cDNADKFZp564K143(ir ai 
442772 AW503680 H3.5957 Homo sapiens done 24416 mRNA sequence ai 
448045 AJ297436 Hs.20166 prostate stem ceil antigen aO 
80 428342 AI739168 Homo sapiens cONAFU13456fi5. done PL aO 

453439 AI572438 Hs.32976 guanine nudeoOde binding protdn 4 aO 
433517 AW022133 Hs.189838 ESTs aO 
424036 AA770688 H2A Mstone family, member L 19 
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420092 AAB14043 K5^045 ESTs S.9 

410268 AA3161B1 KS.6163S slxtraisniembfaneepittieaalan6genol 5.8 

42S071 NM_0139e9 H5.154424 deiodinase. lodothyronine. ^ II 5.8 

^ 441866 BE464341 K5.21201 necfin 3; OKFZP566BQ846 protein 5.8 

5 450244 AA007534 Hs.125062 ESTs 5.8 

429165 AW0098B6 Hs.118258 prostate cencersssodalBd protein 1 5.7 

451952 AL120173 Ks.301663 ESTs 5.7 

429084 AJ001443 Hs.195614 spQdn9factor3b.6utxinil3, 130kO 5.7 

445873 AA250970 Hs.251946 po!y(A).blndlng protein, cytoplasmic 1-1 5.7 

10 412935 BE267045 Hs.75064 tubutuvspecific chaperons c 5.7 

415068 Z19446 Hs.131887 ESTs. similar bT24396tiypottiea 5.7 

434826 AF155661 Hs.22265 pynjvste dehydrogenase phosphatase 5.7 

415621 AI648602 Hs^68 ESTs 5.7 

^_ 416653 AA768553 Hs.193145 metaUtonein IE (functional) 5.6 

15 433332 AI367347 Hs.44898 Honio sapiens done TCCCTA00151 mRNA sequ 5.5 

455497 AA112573 Hs.278895 Homo sapiens prosteinmRNA.conipi8tecds 5^ 

418245 AA088767 Ks.83883 trensmembrene, prostata androgen induced 5.4 

426S93 AW207440 Hs.185973 degenerative spemiatocyte (homok}g Droso 5.4 

425211 M16667 Hs.1887 progastricsin {pepsirx)gen C) 5.4 

20 456508 AAS02764 Hs.123469 ESTs. Weakly similar to AF206855 1 BM-01 5.4 

458809 AW972512 Hs.20985 stn3^x)^ed polypeptide. 30kD 5.4 

410762 AF226053 Hs.66170 HSKM^ protein 5.4 

440594 AW445167 Hs,126036 ESTs 5.4 

413326 Y15723 Hs.7529S guanyiate cyclase l.solutrte, alpha 3 &3 

25 447805 AW627932 Hs.302421 gemln4 5.3 

434808 AF155108 Hs^56150 Homo sapiens. SirnOar to RIKENcDNA 2810 &3 

453308 AW959731 Hs.323099 ESTs 5.3 

421470 R27496 Hs.1378 anr^exInAS S.3 

^_ 418555 AI417215 Hs.87159 hypothetical protein FU 12577 5.2 

30 404532 NM.022490:Homo sapiens hypothetical prol 5.1 

450861 A1523B98 Hs.17617 ESTs 5.1 

431583 AL042613 H$.262476 S-adenosytmethionine decartwxylase 1 5.1 

400303 AA2427S8 H&.7gi36 UV-1 protela estrogen regutated 5.1 

400290 H18836 Hs.31608 hypolheticd protein FU20041 5.1 

35 410037 AB020725 Hs.S8009 K1AA091 8 protein 5.1 

410240 AL157424 Hs.612B9 synapto]anin 2 5.0 

403046 NM_005656*:Homo sapiens transmemt)ranepr 5.0 

419647 AA34B947 Hs.gi816 hypothetical protein SlO 

450203 AF097994 L-kynurenine/atph^aminoadipateeminotra 5.0 

40 450164 AI239923 Hs.63931 ESTs 5.0 

417318 AW953937 Hs.240845 ESTs 4.9 

417153 X57010 Hs.81343 collagen, type i{» alpha 1 (primary oste 4.9 

412350 AI659306 Hs.73826 protein tyrosine phosphatase, non^aoept 4.9 

456088 BE177320 Hs.156148 hypoths6cdl protein FU 13231 4.9 

45 433852 AI378329 1^.126629 ESTs 4.9 

452679 Z42367 Hs.83883 transmembrane, prostate androgen induced 4.9 

430387 AW372884 Hs.240770 nuctearcap binding protein subunit 2, 2 4.9 

416276 U41060 Hs.79136 UV-1 prot^ estrogen regulslled 4.8 

441021 AW8716 Hs.7644 HI histon6fanlly.member2 4.8 

50 453171 R76472 Hs.65646 ESTs 48 

416795 AI497778 Hs.20509 HBV pX associated protein-S 4.6 

427615 BE410107 Hs.179817 CGI^ protein, PSW^I 4.8 

419440 AB020689 Hs.90419 K1AA0882 protein 4.8 

422048 NM.012445 H3.288126 spondin 2. extraceButarmatitx protein 4.8 

55 453082 H18835 H$.31608 hypothe&a) protein FU20041 4.8 

420948 AB016898 Hs.100469 myeloid/lymphoid or mixed-fineage leukem 4.8 

442592 6E566623 Ks.2989g ESTs, Weakly similar to G02075transcrip 4.8 

421685 AF1B9723 Hs.106778 ATPase. Ca transporting, type 2C, member 4.7 

^_ 431724 AA514535 Hs.283704 ESTs 4.7 

60 423242 AU)39402 Hs.125783 OEME-6 protein 4.7 

434485 AI623511 Hs.118567 ESTs 4.7 

407103 AA424681 Hs.256301 hypothetical prot^ MGC1 31 70 4.6 

414085 AA114016 Hs.75746 aldehyde dehydrogenase 1 family, member 4.6 

^_ 422486 BE5144g2 Hs.117487 gene near HD on 4p16.3 with homok)gy to 4.6 

65 422083 NM.001141 Hs.1112S6 arachkkmate 15>4»xygenase. second ^ 4.6 

439735 AI635388 Hs.142848 hypoih^ protein 4.6 

420039 NIA.004605 Hs.94581 sulfbtransferase family. cytosoOc, 2B. 4.6 

449845 AWg71183 Hs.6019 Dn8J(H5p40) homolog. subfamily Cmembe 4.5 

423445 NM.014324 Hs.128749 alpha^thylacyt^ racemase 4.5 

70 433730 AK002135 Hs.3542 hypotheficdl protein FU11273 4.5 
426006 R4g031 HS.22627 ESTs 4.5 
451982 F13036 Hs.27373 Homo sapiens mRNA; cONA DKFZpS6401763 (f 45 
415082 AA160000 Hs.137396 ESTs. Weakly simSar to JC5238galaclosy 4.5 
409799 D11928 Hs.76845 phosphosettne phosphatasa-Ske 4.5 
75 423583 AL122055 Hs.129836 KIAA1028 protein 4.4 
452017 AF109302 Hs.27495 prostate cancer assodaled protein 7 4.4 
437162 AW005505 Hs.5464 Oiyroidhonnone receptor ooacQvalingpr 4.4 
451468 AW503398 Hs.^63 ESTs. Moderately similar to 138022 tvpol 4.4 
429467 NM.004477 H$.203772 FSHDregtongenel 4.4 
80 451752 AB032997 KIAA1 171 protein 4.3 

430294 AI538Z26 Hs. 32976 guanine nudeoDdebMigpnM 4 4.3 
452259 AA317439 Ks.28707 signal sequence receptor, gamma (transio 4.3 
456382 AW973003 Hs.179909 hypothetical protah FU22g9S 43 
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449048 Z4S051 HsuOSZO sbnOar to S68401 (catUe)gluoosa indue 4.3 

440300 N39760 H5.8859 Homo sapiens. SbnBar to RIKENcONA 5830 4.3 

427308 026067 Hs.174905 K1AA0033 protein 4.3 

- 43570& W31254 Ks.7045 GU)04 protein 4.3 

5 416854 H40184 Hs.80296 Purtdnje ceQ protein 4 4.2 

451406 Affi94320 Hs.6295 EST8.W^8lniBartoT17248hypoth8tl 4.2 

410076 T05387 Hs.7991 ESTs 4.2 

410275 U85658 H5.61796 transcrlpSon factor AP-2 gamma (acGval 4.2 

419239 AA468183 Hs.184598 HonH) sapiens cOHA:FU23241 Us. done C 4.2 

10 442501 AA315267 Hs.23128 ESTs 4.2 

436761 A1817776 H5.236557 ESTs 4.2 

420380 AA640891 Hs.102406 ESTs 4.2 

436556 AI364997 Hs.7S72 ESTs 4.2 

458440 Ai095468 Hs.1352S4 Homo sapiens done 1 thrombospondinmRNA 4.2 

IS 446416 AV658299 Hs.163959 ESTs 4.2 

417873 BE266659 HS.2936S9 Homo sapiens. Sin^ to raKENcONAA430 4.2 

430598 AK001764 Hs.247112 hypotheticd protein FU 10902 4.1 

421305 BE397354 Hs.324830 dtptheria toxin resistance protdnrequi 4.1 

429299 AI620483 H5.347408 hypothetical protein MGC1 3102 4.1 

20 407894 AJ278313 Hs.41143 phcspha'nositid&^pediicphosphollpasa 4.1 

414664 AA587775 Hs.66295 multi-POZ-domairKontdining protein 4.1 

408418 AVV963897 K5.44743 ICIAA1435 protein 4.1 

430945 U80669 Hs.55999 NKhomeotx}x(Orosoph9a).l3mily3,A 4.1 

432363 AA534489 gb:nf76g11.s1 Na_CGAP_CD3 Homo sapiens 4.1 

25 4*7574 AF162666 Hs.18895 totisled^ike Idnase 1 4.0 

447595 AW379130 Hs.18953 phosphodiesterase 9A 4.0 

453857 AL080235 Hs.35861 DKFZP586E1 621 protein 4.0 

444108 R55784 H8.140942 ESTs 4.0 

422690 Z43784 ankyiln3.nodeof Ranvier(8nl(yrinG) 4.0 

30 417379 AA196390 gb:zpg9b10.s1 Stratagenennisde 937209 H 4.0 

418818 AA226899 Hs.101307 Homo sapiens HUTU protein mRNA,p3r«at 4.0 

410102 AW248508 Hs.279727 ESTs; homologue of PEN4^ (dona savignyi 4.0 

418821 AA436002 Hs.183161 ESTs 4.0 

438825 BE327427 Hs.79953 ' ESTs 4.0 

35 433233 AB040927 H5.301604 KIAA1 494 protein 4.0 

415276 U88666 Hs.78353 SFRS proteinldnase 2 4.0 

440146 AW014231 Ks.S07SO Homo salens cDNA: FU22930fis. done K 4.0 

407819 R42185 ESTs 4.0 

450402 BE218027 Ks.89969 ESTs 4.0 

40 432527 AW975028 Hs.102754 ESTs 4.0 

419733 AW362955 Homo sapiens cONAFU14415 lis. done HE 3.9 

422134 AW179019 Hs.112110 mitochondrial ribosomai protein L42 3.9 

432435 6E218686 Hs.28207D ESTs 3.9 

450680 AF131784 Hs.25316 Homo sapiens done 25194 mRNA sequence 3.9 

45 436420 AA443986 H3.31595 ESTs 3.9 

430122 NM.013342 Hs.233765 TCF3(E2A) fusion partner (in ddUhood 3.9 

411031 W37943 H3.34892 KIAA1 323 protein 3.9 

429259 AA420450 Hs.292911 Ratophilm 3.9 

407613 AL120247 Hs.40109 KiAA0e72 protein 3.9 

50 417067 AJ001417 Hs.81086 solute canier family 22 (extraneurond 19 

410227 AB009284 Hs.61152 exostoses (muifiple)^ 2 ' 3.9 

417958 AA767382 Hs.193417 ESTs 3.9 

427176 AW381569 Hs.40334 ESTs 19 

448826 Ai580252 Hs.293246 ESTs, Weaidy similar to putative pi 50 [H 3.8 

55 447966 AA340605 Hs.105887 ESTs, Wealdy similar to Homdog of rat Z 3.8 

432675 AI791855 ESTs 18 

419713 AW9680S8 Hs.92381 nudix (nucleoside diphosphate llnl(ed mol 3.8 

427479 BE410092 Hs.17B471 KiAA079Bgeneprodud 3.8 

^. 443162 T49951 Ks.9029 Di<FZP434G032 protdn 18 

OO 413950 AA249098 H3.32793 ESTs 3.6 

439983 AW247529 Hs.6793 platdd^valing fxtor acetylhydroia 3.B 

419083 AM79S60 H8.98613 Homo sapiens cDNAFU12292fis, done MA 3.8 

448148 NM.016578 Hs.20509 HBVpXassodated protein^ 3.8 

419465 AW500239 Hs.21167 Homo sapiens GONA:aJ23068fis. done I 3.8 

65 432274 AK000382 Hs.274251 hypotheti6dpratolnFU20375:KlAA1797 18 

447620 AW290951 ESTs 16 

430589 AJ002744 Hs.246315 UDP-N-acetyMpha-D^aidCtosaminerpolyp 16 

418004 LI37519 Hs.67539 aldehyda dehydrogenase 3 family, member 18 

^. 416655 AW968613 Hs.79428 Ba2MenovirvsE1B19M>MerBdingpro 18 

70 459284 AF1S5660 H8.300496 rnKodiondridsdule carrier 18 

421829 AB018330 Hs.108708 cflidumfcatmodulinKlependenl protein Un 17 

437252 A1433833 Hs.1S4159 ESTs. Weakly similar to AIU1_HUMAN ALUS 17 

437967 BE277414 H8.5947 mel transfaming oncogene (derived from 17 

„. 417061 AI675944 Hs.188691 Homo8apien5cDNAFU12033fis.doneHE 17 

75 423337 Nik4.004655 Hs.127337 axm2(oondudin.axil) 17 

418838 AW385224 Hs.35198 edonudeolidepyrophosphatosa/phospbodi 17 

453469 AB014533 Hs.33010 KIAA0633 protein 17 

408063 BE086548 Hs.42346 caidneurin-btnding protein cdsardn-1 17 

422072 AB01825S Hs.111138 KiAA0712geneprodud 17 

80 420297 AI626272 Hs.88323 ESTs. WeaktyslmBar to AIUIJIUMAN ALUS 17 

420522 AW957137 Hs.98541 hypotheOcd protein 16 

408833 AW612232 H8.254835 ESTs U 

426647 AA243464 H3.294101 pi&eceBteukenilalransoipltonfador 16 
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456177 Nlit012391 Hs.79414 piosUte epOheQuHMpecific Eb transcr 3.6 

425689 Wt6460 H924283 ESTs.iyfaieia! t » si f nila rtorad«adaxy 3l6 

432370 AA308334 Hs.274424 N-ac6lytimursm!nic acid phosphate synOia 3u6 

- 422424 AI185431 K3^6638 prostate (fiRerenOation factor 3^6 

5 448157 BEZ70828 Hs.131740 Homo sapiens cDNA:aJ22S62fis. done H X6 

433339 AF01S226 HS.B036 g^oUastomaoverexpressed 3.6 

421887 AW161450 K5.109201 CGI^pRM 3.6 

435114 AA775483 Hs^68338 niitochondn3lnt)OSQmalpnlehL9 3.6 

412520 AA442324 Hs.795 H2AhIstono family, member 0 3.6 

10 436476 AA326108 H5.33829 bHm protein DEC2 3.6 

441224 AUa76964 Hs.7753 catumenhi 3.5 

404922 NM.003071:Homo sapiens SWI/SNF related, 3.5 

43227B AL137506 Hs.274256 hypothetical protein FU23563 3.5 

452747 BE153855 Hs.61460 iQsupeffamity receptor mi R 3.5 

15 429686 AI671613 Hs.28538 Homo sapiens cDNA:RJ21086 (Is, done C 3.5 

408001 AA04S458 Hs.95296 ESTs aS 

428330 L2^24 Hs.2256 matrix metalloproteinasa 7 (matrilysin, ^5 

418113 AI272141 Hs.83484 SRY(sexd6temiinlngfe^nYH)ox4 3l5 

428055 AA420564 Hs.101760 ESTs 3.5 

20 418827 BE327311 Hs.47166 KT021 3.5 

431103 M57399 Hs.44 plelotrophin (heparin binding growth tac 3.5 

427701 AM1 1101 Ms.243886 nudear auloanligenlc sperm protein (his 3.5 

419168 AI336132 Hs.33718 Homo sapiens cDNAFU12641fis. dona KT 3.5 

409151 AA306t05 SEC22. vesicle trHfTtddng protein (S.c 3.5 

25 439671 AW162840 Hs.6641 lunesin family member 5C 3.4 

452721 AJ269529 Hs.301871 solute canier family 37 (glycerol^ho 3.4 

449933 AW157098 Hs.324104 Human DMA sequence from done RP1-63M2 o 3.4 

432815 AA557191 HS.S5028 ESTs, Vlfea»y similar to 154374 gene NF2 3.4 

443884 N20617 Hs, 194397 lepSn receptor 3.4 

30 403752 NM_002753*:Homo sapiens mitogefraciivate 3.4 

427723 A)355260 Hs.279789 histone deacetylase 3 3.4 

421878 AA299652 Ks.111496 Homo sapiens cDNARJ1 1643 (is. done HE 3.A 

440494 BE618768 Hs.7232 aoetyK^oenzymeAcartxixylasealpha a4 

426716 NM.006379 K$.171921 sema domain. Immunoglobulin (fomain(lg), a4 

35 433647 AA603367 Hs.222294 ESTs 3.4 

407137 T97307 gb:ye53h05.s1 Soares fetal fiver spleen 3.4 

432886 BE159028 Hs.279704 diromatin accesslbifity complex 1 ^4 

427715 BE245274 Hs.180428 MAAII 81 protein 3.3 

437617 AI026701 Hs.5716 KIAA0310 gene product 3.3 

40 447818 W79940 Hs.21908 Homo sapiens done 24670 mRNA sequence 3.3 

414407 AA147026 Hs.76704 ESTs 3.3 

421537 BE383488 Hs.105547 neural proliferation, difterentiation an 3.3 

435655 AW105663 Hs.6947 HSPC069 protein 3.3 

427871 AW992405 HS.S9622 Homo sapiens, done IMAGE:3507281.mRNA. 3.3 

45 421662 NM.014141 Hs.108552 cell recogniOon molecule C8spr2 3.3 

407204 R41933 Hs.140237 ESTs. Weakly slmnarbALU1_HUMAN ALUS 3.3 

407908 BE379758 Hs.110853 uncharactertzed hematopoietic stem/|)n9ge 3.3 

407846 AA426202 Hs.40403 Cbp/pSOO^nteradingtransacthrator.wit 3.3 

420568 F09247 Hs.247735 pro^herln alpha 10 3.3 

50 436873 N23874 Hs.50477 RAB27A. member RAS oncogene family 3.3 

411145 6E439553 H$.2S0528 Komo sapiens, done IMAG£:4098694.mRNA. 3.3 

435017 AA336522 Hs.12854 angiotensin II. type I receptor-assodat 3.3 

435020 AV^5076 Hs.301855 DiGeorge syndrome critical region gene 8 3.3 

443991 NM.0022SO Ks.10082 potassium Intemiediate/^ conductance 3.3 

55 409960 BE261944 hexoMnasel 3.3 

433891 AA613792 gbmS7h03.8l Na_CGAPJ>r2 Homo sapiens 3.3 

430389 AL117429 Hs.240845 DKFZP434D146 protein 3.3 

432242 AW022715 Hs.162160 ESTs. WeaMy similar to ALU4jnMAN ALUS 3.3 

419972 AL041465 Hs.182982 gdgin^7 3.3 

60 445707 AI248720 Hs.114390 ESTs 3.3 

412628 AI972402 Hs.306051 bypotheticd protein M6C2648 3.3 

425263 NM.001197 Hs.155419 BCU-lnteiBding UDer (apoptosis^nduc 3.3 

419879 Z17805 Hs.93564 Homer, neuronal immediate early gene. 2 3.3 

433345 At681545 Hs.152982 liypothetical protein Fyi31 17 3.3 

65 417515 L24203 Hs.82237 fllaxla^elangiedasia group [>essodated 13 

414922 D00723 glydne cleavage system protein H {amino 3.3 

437898 W81260 H5.43410 ESTs U 

426126 AL1 18747 Hs.26691 ESTs 12 

422089 AA523172 Ks.103135 ESTs.WeaidysimDarioSFR4_HUMAN SPUC 12 

70 450649 NM.001429 H5.25272 ElAUndlng protein p300 12 

404210 NM-005936:Homo sa|^ myelold/lymphotd 12 

434954 AF161455 Hs.284295 Homo sapiens HSPC337 mRNA. partial cds , 12 

401519 C15000476*:gi|l2737279|rBf|XP_012163.1| 12 

419517 AF052107 Hs.90797 Homo sapiens done 23620 mRNA sequence 12 

75 431585 6E242803 Hs.262823 hypothetica] protein FU10326 12 

427461 AA531527 H$.332040 hypothetical protein MGC1 3010 12 

453288 AW583292 Hs.274412 similar to yeaslUpO. variant A 12 

412513 AA322599 Hs.5163 ESTs. Weakly similar to AF1 51 840 1 CGI-S 12 

432579 AF043244 Ks.278439 nudeolar protein 3 (apoptosis repressor 12 

80 439593 BE073597 H5.124863 ESTs 12 

451945 BE504055 Hs.211420 ESTs 12 

434614 AI2495D2 K53869 ESTs 12 

445525 BE149866 Hs.14831 Homo sapiens. SbrSar to zinc finger pro 12 
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419991 AJ000098 Ks.942tO eyes absent (OrosQphfla) homobg 1 3.2 

403101 AW966504 Hs.123073 COC2-r6lated protein Unase 7 3.2 

425810 A1923627 Hs.31903 ESTs U 

f 443123 AA094538 H5.Z72808 putaSve transcripfon regutaiion nudea 3.2 

J 447439 AA3t35&5 Ks.145020 ESTs. Weakly similar to KtAA1 205 protein ' 3.2 

452167 N75238 Ks.1307S Homo sapiens cDNA:FU23013 its. done L 11 

440789 AB007857 Hs.7416 KIAA0397 gene product 11 

436087 BE300296 K5i054 CQ-1 33 protein 11 

414222 Al.135173 sorbitol dehydrogenase 11 

10 419749 X73608 Ks.93029 sparc/osteonecSn. cwcv and kazaMike d 11 

418559 AA225048 Hs.104207 ESTs 11 

408784 AW971350 Hs.63386 ESTs 11 

424285 BE207168 Ks.144630 nudaarreceptorsubfamilyZgroupF.m 11 

417193 A1922189 Hs.288390 hypothetical protein FU22795 11 

15 428738 NI\4.O0O38O Hs.192803 xeroderma pigmentosum, comptementalnn g 11 

436469 AJ272269 Hs.121429 zim^bindtno protein Rboc728 11 

433008 BE242758 Hs.190223 ESTs. Moderately similar to T29285hypot 11 

426991 AK001536 Homo sapiens cDNA FU10674 fis. ctone NT 11 

445943 AW898533 H8.181574 ESTs 11 

20 434194 AF119847 KomosapiensPROISSOmRNA. partial cds 11 

432426 AW973152 Ks.31060 ESTs 11 

416974 AF010233 Hs.80667 RAUPI associated Eps domain oontdning 11 

451131 AI267586 Hs.268012 (atty-add-CoenzymeAiigase.kmg^hain 11 

423453 AW450737 Hs.128791 CGW9 protein 11 

25 417215 BE253161 Ks.81687 non^T^etastatk: cells 1 protein expresse 11 

438278 BE398290 Hs.5097 synaptogyrin 2 11 

^2908 AI861895 ESTs 11 

•43^2 AA813887 H$.188173 Homo sapiens cONARJ12187Gs. done MA 11 

438705 AI049624 Hs.2833g0 ESTs. Weakly similar to 2109260AB cell 11 

30 427982 NIA_016156 Hs.181326 K1AA1 073 protein 11 

431578 AB037759 H5.261S87 GCN2elF2aipha kinase 11 

409757 NM.001898 Hs.123114 cyslaUnSN 11 

407136 T64896 Hsi87420 Homo sapiens cDNAFU 11 533 fis. dona HE 11 

434293 NM.004445 Hs;J798 EphB6 11 

35 410001 AB041035 Hs57771 kanikrdnll 11 

434970 AW272262 Hs.225767 ESTs 11 

432800 BE391046 Hs278962 AlM-1 protein 11 

423392 AA195037 Hs.169341 HTPAPptotein 11 

432205 AIB065B3 Hs.l2529t ESTs 11 

40 448807 AI571940 HsJ549 ESTs 11 

439979 AW600291 Hs.6823 hypolhetteal protein FU 10430 11 

435561 AA351978 Hs.4943 hepatocellular carcinoma assodatedprot 10 

427315 AA179949 Hs.175563 Homo sapiens mRNA:cONADKFZp564N0763(f 3.0 

410082 AA081594 Hs.158311 Musashi(DrDsophlla)homQlog1 3.0 

45 432621 AI2g8S01 H3.21192 ESTs, Weakly similar to T46428hypo»ietI 10 

429638 AI916662 Hs.211577 kinectin 1 (klnesin receptor) 10 

452908 AB001451 Hs.30g65 neuronal She adaptor homolog 10 

426030 BE243933 Hs.108642 zinc finger protein 22 (KOX 15} 10 

407688 W25317 Hs.37616 Human I}9spBce variant BmRNA. complete 10 

50 434958 T99949 Hs.30342B Homo sapiens cDNAFU14e32fis. done OV 10 

418610 AW245993 Hs,223394 hypothetical protein MGC2742 10 

451815 AW974911 H5.184793 Homo sapiens cDNA: FU21880 65, done H 10 

433556 W56321 Ks.111460 cahdunn/balmodi^ependent protein Un 10 

413010 AA393273 Hs.75133 tianscriptkm factor 64ike 1 (mitodwnd 10 

55 449459 BE546846 Hs.195048 ESTs 10 

438523 H66220 Hs.276177 ESTs 10 

434263 N34895 Hs.44648 ESTs 10 

446825 BE266822 Hs.344097 filamln A, alpha (actirvblnding protein- 10 

408681 AW953853 Hs.292833 ESTs. Weakly dmiiar to 138022 hypotheti 10 

OO 429966 BE081342 Hs.283037 HSPC039protdn 10 

412652 AI801777 ESTs 10 

442409 BE208843 Hs.129544 hypothetical protein MGC1 5438 10 

450832 AW970602 Hs.105421 ESTs 10 

^_ 440074 AA863045 Hs.10669 ESTs, Weakly similar to TOOOSOhypotheQ 10 

05 436032 AA150797 Hs.109276 latexin protein 10 

419594 AA013051 Hs.91417 topoisomerase (DNA) 11 binding protein 3.0 

421674 T10707 Ks.2g6355 hypothetfeal protein FU23138 2.9 

432302 AA345857 Hs.274307 KIAA1442 protein 19 

415172 AF079529 KS.7B106 phosphodiesterase 8B 29 

70 412928 AI879076 H5.7S061 macrophsge nryristoylated a!anin&^ C Z9 

413142 M81740 Hs.75212 ornithine decarboxylase 1 2.9 

437179 AA393508 serotogteaOy defined coloncanceranOg 2.9 

418372 AA311833 H5.84318 repHcatkin protein A1 (70kD) ZS 

439609 AW971945 Hs.293236 ESTs 29 

75 427398 AW390020 H5.20415 diromosome 21 open reading (rams 11 19 

447958 AW796524 Hs.68644 Homo sapiens microsomal sfgnd peptidase 29 

409340 BE174629 HsJ21130 hypolhelkd protein MGC2771 29 

420061 AW024937 Hs^9410 ESTs 29 

431663 NM.016569 H5.267182 TBX3^ protein 29 

oO 417622 AW29B163 Hs.82318 WAS protein fandly.meniber 3 29 

420344 BE463721 Hs.97101 putaGve 6 protobKOupled receptor 29 

446791 AI632278 Hs.19S922 ESTs 29 

433313 W20128 Hs.296039 ESTs 29 
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428465 AW970976 Hs.293S53 ESTs 2.9 

457489 AI693815 Hs.127179 oyptegene 19 

452092 BE245374 Hs.27842 hypotheOca! protein FU 11 210 19 

446880 AI811807 H3.10B846 Homo sapiens cDNAFUl 4934 fis. done PL 19 

5 419829 Ai824228 Hs.115185 ESTs, Moderalely simitar (oPC4K9fsn! 19 

437396 BE140396 Ks.21621 hypotheftca! protein DKFZp762O076 19 

413125 BE244589 H5.752a7 glyoxaiase) 19 

401785 NM.002275':Hon» sapiens keralln 15 (KRTl 19 

433495 AW373784 Hs,71 a!pha.2^Jycoprotein1,33nc 19 

10 452099 BE612992 Hs.27931 hypothetical protein FU 10807 8lni3ar to 19 

437296 AA3S0994 Hs.20281 KIAA1700 19 

421254 AK001724 Hs.102950 ooal protein gamnocop 19 

445109 AF039916 Hs.12330 ectonudeoside triphosphate diphosphohyd 19 

423551 AA327598 Hs.89633 ESTs 19 

IS 421154 AA284333 Hs.2B7631 Homo sapiens cONAFU14269 lis, done PL 19 

410193 AJ132S92 Hs.59757 zinc finger protein 281 19 

435854 AJ278120 HS.499S putative ankyriiwepeat containing prote 19 

423396 A)382555 Hs.1279S0 bramodonnain-contBinIng 1 18 

442202 BE272862 Hs.106534 hypothetical protein FU22625 16 

20 441345 AW06e57g Hs.7780 Homo sapiens mRNA; cDNA DKFZp564A072 (fr 18 

444387 H54892 Hs.10974 hypothetical protein FU22390 18 

422522 AI023428 H5.34549 ESTs. H^hly similar to S94541 1 done 4 18 

443337 Y07604 H3.9235 nofHtietastatlc celts 4, protein expresse 18 

418166 AI7544t6 Cdc42 effector protein 3 18 

25 448734 BE614070 Hs.326416 Homo sapiens mRNA; cDNADKFZp564H1916(f 18 

413550 W03011 Ks.306881 MSTP043 protein 18 

426170 6E161065 Hs.167531 m6thylcrotonoyi-CoenzymeAcaitoxyIase2 18 

444101 R19175 Hs.169793 ritnsomd protein U2 18 

425320 U29344 Hs.83190 fatly add synthase 18 

30 431631 AA548906 Hs.122244 ESTs 18 

448804 AW512213 Hs.342849 AOP-ril)Osylatlonfedor-inw5 18 

431416 AA532718 ESTs 18 

447881 . BE620886 GCN1 (generd control of amino-addsynt 18 

445309 AL157474 Hs.12504 lilcelyorthdog of mouse Ariiadta 18 

35 453186 AK001708 Hs.32271 hypothetic^ protein FU10846 18 

439778 AL109729 Ks.g9364 put^ transmenfbrans protdn 18 

425010 T16837 H3.4241 ESTs 18 

432840 AK001403 Hs^79521 hypothetical protein FU20530 18 

45054S AA010200 Hs.175551 ESTs 17 

40 431674 AAOg8901 Hs.301642 &protBln counted receptor 17 

422017 NM_003877 Hs,110776 STAT Induced STAT lnhib«or-2 17 

4431B1 AI039201 Hs.283316 ESTs 17 

448913 AA194422 Hs.22564 myo^VI 17 

440193 AW902312 Hs.7037 Homo sapiens done 24923 mRNA sequence 17 

45 452941 AL1 10347 Ks.31074 N-sulfogtucosamine sutfohydidase (8uI£b 17 

451338 AW612322 Hs.19131 transcription factor Dp-2(E2Fdimedzdl 17 

411190 AA306342 Hs.69171 protein idnase Ofike 2 17 

434829 AA789081 Hs.4029 gOoms^mpniied 5e(iuence41 17 

407192 AA60g200 gb:an2e02^1 So3resJesQs_NKr Homo sap 17 

50 434747 AA837085 ESTs 17 

428171 AA489323 Hs,182825 ribosomat protein L35 17 

426657 NhL015865 Hs.m73t solute carrier famfly 14 (uroa transport 17 

447147 AA910353 H5.7S432 EST6.W^8lmQartoT23482hypothea 17 

452955 AW390282 H$31130 trsnsmen4)rane78uperfamilymeml}er2 2.7 

55 407748 AL079409 Hs.38176 KIAA0606 protein; SCN Orcadian Osdlat 17 

445596 RB9543 Hs.12942 veslde tiaftiddng protein 17 

452268 NM.003512 Hs.26777 K2A histone famBy. memt)er L 17 

446494 AA463276 Hs.288906 WW Domab<>mtaining Gens 2.7 

438157 AW137011 Hs.49576 ESTs 17 

60 452260 AA453208 H$.330994 RAB9. member RAS oncogene femily 2.7 

409648 AW451449 Hs.57749 ESTs 17 

401866 Target Exon 17 

400301 X03635 Hs.1657 estrogen receptor 1 17 

453390 AAB62498 Hs.28482 ESTs 17 

65 426514 BE616633 Hs.170195 bone morphogenetic protdn 7 (osteogenic 17 

42S294 AFD33827 Hs.1S5553 HNK-t sulfotranslerase 17 

405387 NM.022170*;Homo sapiens Wniiams-Beuren 17 

441266 H15966 Hs.293845 Homo sapiens, done IMAGE:3502329, mRNA, 17 

432388 X15218 Hs.2969 v-sid avian sarcoma viral oncogene homd 17 

70 450937 R49131 Hs.26267 ATP-dependant interferon resporBeprotel 17 

447210 AF035269 Hs.17752 phosphaOdylserine^pedficphosphafipas 17 

445098 AL0S0272 Hs.12305 OKFZP566B1 83 protein 2.7 

451404 AA460775 HS.629S ESTs. WeaMy similar to T17248hypolheU 17 

409550 T08490 Hs.288969 HSCARG protein 17 

75 452707 AI093823 Hs.4507O ESTs 17 

451134 AA318315 HS.2S999 hypothetical protein FU22195 17 

452826 BE245286 Hs.301638 peroxisomal biogenesis factor 6 17 

420816 AW989635 Hs.33032 ESTs. Weakly similar to 1207289A revere 17 

438510 AL080220 Hs.6285 OKFZP586P0123 protein 17 

80 406627 T64904 Ks.163780 ESTs 17 

442315 AA173992 Ha.7956 ESTs. Moderately sbnOsr to ZN9LHUMANZ 17 

421717 AF230924 Hs.107187 divdent cation tolersnt protein CUTA 16 

42S910 AA830797 Hs.184760 (mi-box^Ming transorifrtion factor 16 
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440695 AWD88363 Hs.246240 ESTs 18 

416941 BE000150 Hs.48778 niban protein 2.6 

422717 Ai557623 Ks.1 19475 cold inducOite RNMiindlng proteki 16 

426110 NM.002913 K5.166563 repQcalion factor C (activator 1) 1 (14 16 

5 422027 AL043100 Hs.306319 fatty add amida hydrolase 16 

401197 ENSP00000229283-'i«PC2a 16 

442607 AAS07576 Ks.2e8361 Homo sapiens cONA:FU22696 lis. donsH 16 

410297 AA14B710 lumlcan 16 

^_ 447963 A)4S2973 Hs.165900 ESTs. We^slmnar to ALU1 HUMAN ALU S 16 

10 433101 AV\^2317 Hs.12082 Homo sapiens mRNA; cONADKFZpS66L203(fr 16 

437546 AW074836 Hs.173984 T-boxl 16 

443725 AW245680 Hs.9701 growth arrest end DNAdamago^nducible. 16 

436213 AA325512 Hs.71472 hypothetica] protein FU10774; KIAA1709 16 

408157 AA047685 Hs.62946 ESTs 16 

15 420805 L10333 Hs.99947 leliculonl 16 

454144 BE280478 Hs.182695 hypothetical protein MGC3243 16 

426761 AID15709 H5.172089 Homo sapiens mRNA; cONAOKFZp586l2022(f 16 

428695 AI355647 Hs.189999 puriner^ receptor (family A group 5) 16 

431725 X65724 Hs.2839 Norne disease (pseudofllioma) 16 

20 425174 D87450 Hs.154978 KtAA0261 protein 16 

410467 AF102546 H&63931 dachshund (Dmsophila) homotog 2.6 

413435 X51405 Hs.75360 caiboxypeptidase E 16 

428293 BE250944 Hs.183558 solute canter family Uneutratarrtra a 16 

428180 Ai129767 Hs.182874 guanine nucleotide l>lndii)g protein (G pr 16 

25 439680 AW245741 Hs.58461 ESTs, Weakly simitar to A3S659krueppel- 16 

424395 AA165082 H5.146388 mbotubule-assodated protein 7 16 

406670 W79632 Hs.2S6301 hypotheficd protein MGC1 31 70 16 

413762 AW411479 Hs.848 FK5064]inding protein 4 (SSkO) 16 

429922 Z97630 Hs.226117 H1 histone family, member 0 16 

30 449518 BE395253 Hs.30861 ESTs 16 

445919 T53519 Hs.334692 hypothetical protein MGC14141 16 

429343 AK000785 H5.199480 Homo saj^ns. SlmOar to opsin 3, cbne 16 

438552 AJ245820 Hs.6314 type I transmenrOirane receptor (seizures 2.6 

431472 AK001Q23 H5.256549 nucleotide binding protein 2 (E. coll Min 16 

35 409299 AA04S650 Hs.53125 smaQ nudear ribonudeoprotein 02 polyp 16 

434672 AW294020 Hs.117721 ESTs 16 

443015 1^33261 Hs.6614 ESTs. Weakly sindar to A43932 mucin 2 p 16 

452576 AB023177 Hs.29900 KtAA0960 protein 16 

412843 AF007555 Hs.74624 protein tyrosine phosphatase, receptor 1 16 

40 430375 AW371048 Hs.93758 H4 hlstone family, member H 16 

408196 AL03454e Hs.43627 Sf^ (sex determining region YHxuc 22 16 

424339 6E257148 endoglycan 16 

416292 AA179233 Hs.42390 nasophaiyngeelGareinoma susceptibility • 16 

451040 AA324743 Hs.40808 ESTs 16 

45 445636 AW105401 ribosomal protein 129 16 

419175 AW270037 KIAA0779 protein 16 

422000 M30S99 Hs.110637 homeoboxAlO 16 

44112B AA5702S6 ESTs. VfeaUy similar to T23273hypotheG 16 

446237 AW270S15 Hs.149596 Homo sapiens. Similar to RIKENcONA 2310 16 

50 431797 6E169&41 Hs.270134 hypothetk^l protein FU20280 15 

406789 AI041403 ribosomai protein 1^9 15 

423217 NM.000094 Hs.1640 collagen, type VU, alpha 1 (epidermolys 15 

423115 AA421973 Hs.169119 ESTs. similar to T25731 hypotheO 15 

420460 AA262331 Hs.48376 Homo sapiens ctoneKB-2 mRNA sequence IS 

55 436165 AI373544 Hs.331328 Intermediate flianent protein synooiHn 15 

430542 Ai557486 Hs.119122 ribosomal protein L1 3a 15 

452827 AI571635 Hs.55468 ESTs 15 

431609 AW792792 Hs.264330 N-acy!sphtegoslne8middiydrolase(addc 15 

431108 AA991508 Hs.l05317 ESTs 15 

60 410310 J02931 Hs.62192 coagulaGon factor III (thromboplastin. 15 

430333 S70114 Hs.239469 T1A1 cytotoxic granule-assodaled RN^bi 15 

439864 AI720078 Hs.291997 ESTs, Weakly similar to A47582 B^stl gr 15 

438821 AA826425 Hs.192375 ESTs 15 

^_ 421091 W22821 rftx)Somal protein U6 15 

65 439414 NM.001183 K8.6551 ATPase, H transporting, lysosomal (vacuo 2.5 

400263 Eos Control 15 

451428 AW083384 Hs.11087 ESTs. Highly similar to T46395 hypotheli 15 
449051 AWg61400 Hs.333526 HER2 receptor tyrosine kinase (oerb-b^. 15 
431615 AW295859 Hs.235860 ESTs 15 
70 433037 NM_014158 H5.279938 HSPa67 protein 15 
409504 AA304g61 Hs.699 peptidylproiyi isomerase B (cydopMBn 15 
409330 AK001231 Hs.53940 hypothetical protein FU 10369 15 
438299 AK000767 Hs.5111 hypoOietfeal protein nJ20729 15 
__ 450628 AW382884 Hs.204715 ESTs 15 
75 426793 X89887 Hs.172350 HIR (hlstone cellcyderegulatk}ndefec 15 
431122 AI2675g3 Hs.250535 Homo sapiens mRNA; cDNADKFZp434N241 2 (f 15 
407236 W79485 Hs.173980 nuclear n^prot^NMP200 related to 15 
421109 1^2 Hs.101642 AT-blndIng transcriptton factor 1 15 
408770 AW270608 Hs.170195 bone morphogeneOc protein 7 (osteogenic 15 
80 416737 AF15433S Hs.79691 UM domain protein 15 
414889 AA157291 Hs.21479 ubinudeini IS 
431222 XS6777 Hs.273790 zona peOucida glycoprotein 3A(speim re IS 
403532 NA4.024638M»nosaptenshypotheltealprol 15 
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418S49 


Al(»8485 


Hs.169341 


ESTs. Moderately Mar to S65657 alpha 


Z5 


443804 


AL135352 


Hs.255883 


ESTs, Weakly similar to 138022 hypotheQ 


Z5 


407887 


AA579668 


Hs.41072 


seHne (orcyste!rte) |Hoie!naseinhn)ilD 


2.5 


414556 


AW975063 


Hs.343443 


ribosomal protein L36 


Z5 


446911 


N27605 


HS.1&492 


OKFZP564G2022 protein 


ZS 


435126 


AI393666 


Hs.42315 


plO^ndino protdn 


2.5 


421666 


M24470 


Hs.1435 


guanosine nwnophosphate reductase 


2.5 


426768 


U66615 


Hs.172280 


SWUSNF related, matrix assoctated, acli 


2.5 


447050 


NM.016314 Hs.17200 


STAM(e protein conning SK3 eni TTA 


Z5 


408461 


AB0377S6 


Hs.45207 


ttypotheDca) protein K1M1335 


2.5 


421594 


R4S689 


Hs^1889 


Homo sapiens cDNA FU12978(]s, done NT 


2.5 


443837 


AI984625 


Hs.9884 


spindte pole body protein 


2.5 


421867 


AA481078 


HS.109G45 


hypothetical protein FU10498 


2.5 


435021 


AA922192 


Hs.73962 


ESTs 


15 


435750 


A8029012 


Hs.4990 


KIAA1089 protein 


2.5 


43S02S 


T08990 


Hs.4742 


anchor attachment protsln 1 (Gaalp, yeas 


2.5 


4072S5 


AA012992 


HS.2S6301 


hypothetical protein MI6C13170 


2.5 


425976 


C75094 


Ks.334514 


NG22 protein 


Z5 


449458 


AI805078 


Hs.208261 


ESTs 


2.5 


428013 


AF151020 


Hs.181444 


hypothetical protein 


Z5 


424369 


R87622 


Hs^714 


KIAA1831 protein 


2.5 


431604 


AF175265 


Hs.264190 


vacuolar protein sorting 35 (yeast homol 


2.5 


452779 


AA418775 


Hs.47234 


ESTs 


2.5 


433585 


T85301 


H8.194397 


fib:yd78d06.8l Soares feta) Over spleen 


2.5 


438682 


AA354489 


Hs.222103 


EBP50.PDZInteractor<rf64kO 


2.5 


427515 - 


T79S26 


Hs.179516 


Integral type 1 protein 


Z5 


418700 


AI963808 


HS.B8970 


ESTs, IModerately similar to ALU5_HinMAN A 


2.5 



TABI£67B 

Pkey. Unique Eos prot}eset Mentifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbeis 



Pkey CAT Number Accession 

415989 10194 1 BC013389 BC017398 AI023543 AA191424 AI267700 A1469633 AW958465 AW953397 AA172056 BE940298 BF909208 BF909980 BF095153 

BG285837 Ai720344 BF541715 AA3550B6 AA172238 
429220 15103.7 AW341473 AA448195 AW207206 Ai951341 AA969259 

412446 I 63467 1 BC021735AI669212AL120184AI769949BE701002BE184363BE819031 BG702238BF090049BF963318BF961912BF943013 AA9^^ 
AA151245 BF960659 AA987907 Z41449 BF908059 BF908053 BF908049 BE699424 BF908060 BFg62832 BF952020 BF963134 BI035538 
BF908052 BF908057 BF090026 BF943158 AI632924 BF5123*0 BF952021 BF950775 BF943437 BF942847 AI768015 F09778 F04816 F02721 
AA102645 AI633838 AA517929 BF947Q01 BI035448 BE935876 AW690837 AW898604 6F957405 BF963433 BG70481 5 

433404 7392.1 BC022980 BF224081 BG149908 AW672842 BE670687 A1702161 AW341B32 BE222503 N71838 Ai026061 AW953116 AW083132 A1979261 

AV725377 AI423298 AI640707 AW67551B Ai032611 AI818044 A1299508 AI911386 Ai270418 BE219257 BM141826 AA826491 Z25159 AA587421 
N59447 Z39436 T32982 R54110 BF115783 F09044 BF808433 

449825 249224L.1 BiSI 6168 AW779760 N48674 A1375997 AA235370 BG699146 Ai913631 Ai498402 Ai016320 AA323193 R49021 059344 BG986750 1445526 
BG986917 T61382R49391 R45432A1203107 R35004F07491 R25094 R35360 

453160 6028.5 BC009612NM.003526BIS97616AV761592AV760377AL601008BI604131 BE645918BG187760BG181525BG210634BG192999A1263307 

AA344186 AW952966 AA033609 AA037S62 AA722183 R794S2 H70775 BF674991 BE769437 BG007856 AA037483 AW572535 At 143991 
AA084581 AA033610 AV742510 AV735788 R08336 

420218 191547.1 AW958037 R42557Ai337047AA948360AI638005AA459950AI624915Ai638047Ai467856AI521826AA860305^^^^ 

AW779380 AA609879 A1634791 AI«3770AI565211 Z41145AI6279S2 AA303734BE349457 AW1967GSAA2S6527 BE089727 

437124 59408L1 A1050013BG939SOOAW969191 AA769925Ai377973At625545 AA811365AA521114 N24705Ai379579AA424899 AI684671 AA829715 AI453010 
N35401 AA677452 AA504340 At209149 AA883574 AI379062 AI084455 AI280147 AA644327 BF432S08 N27B73 N47364 N34880 AI147024 T86860 
A1219716 AAg60926 H25544 81857123 AW9804e9 AA599099 AA765246 AA328537 BG434703 AA890373 AA424765 AI292318 AA829886 N95742 
AI218758 H25568 N36282 AA024987 N36687 BI919187 mg471 AA889g70 AW1 66152 AA468546AI262504Ai452782AA554458AA807080 
AA724542AW102730AA909978AW118134AW827241 W56431 BM127381 BF43G987AI016509AW683972BM1276B6C15552 N63435 N51744 
T93800 N56980 BG106636 N49381 R49886 D61278 Bt756612 AA508234 R49885 BF8S0422 

432586 6633 J BC022881 AU1 50944 BG750783 AW754175 AW857737 AI91 1659 AI050036 AA554053 Ai826K9 AA568546 

438869 52134.1 AF075009 R63109 R63068 

450377 12109.1 AB033091 A1^20743BE811813N53332 N99716Ali61910AA280655BE710392AW05100AVV293978AW444556^ 

BG700891 BI553517R80518BG779771 BG534451 AA479402AW961580 6F061430AI857643Ai768486AW512118AM79302 AWn0384 
AW072470 Ai041 596 A1049699 AW592865 AA976261 6E879747 BG1 141 19 AW1B3811 AA909938 BF571621 6F350794 BF351375 BE925699 
AL050294 BC010371 BF982270 AL042656 BF095732 AW812618 BF095731 B6212397 BF676765 BI038602 BG388664 AW675337 BG289398 
BE939598 AW805570 AA527097AW1 50540 AI693720AA743364AI915793 N48185 AIS73107 AA043474 AI351615 AI9694g0 AI910763 R50866 
Ai699181 N73808H08164 

436063 5483.1 AK000O28 AA494483 Ai298674 AA720773 AV761 529 AI884670 Ai936202 AW294235 D616S2 BFBS1 184 AV71 13B4 N27154 AI926970 AV734g70 

N40094 N28596 AAB84747 AA512890 BG436593 

428342 671Z.1 AK056315 AI015524 AA724079 BI713619 A1377726 AW293682 AI928140 AI092404 AI085630 AA731340 BM469629 AW968804 AA4^658 

AA769094 BF446026 AW11 8719 AI332765 AW50D888 AW576558 AI85957 1 AW49g664 AW614573 AW629495 AW505314 W74704 AI356361 
AI923640AVV070509Ai521500AL04209SAA609309AA761319Ai381489 H45700AA761333AVV265424 

A1392620 Z40708 AI985S64 AW263513 AA913892 Ai693486 AW263502 Ai806164 AW291137 BI061872 BI0S9496 AA134476 AW084888 AA036867 

AW370823 T55263 BI002756 AA489664 BF827261 W74741 BFg63166 
424036 6226.1 NiyL033445BC001193At885781 BF794032AA476620AA810906AA810905AI291244At88S097AI359708AI335629 H97388AI344589AA300377 

AA457566 AW771833 BE465621 AI364068 A1364452 AI648505 AI918342 Ai928670 AA886580 AL531029 AA886344 Ai186419 BG32909B 

BM045465 AL531028 BG437151 6E868021 AA179427 
45D203 19009 1 AK055952 B6182168 66220105 BG191569 BG188964 BG187388 BG220104 BG183714 &E645998 Ai819354 AW974068 AI393835 A1580846 

Ai0247g6 AW020098 81491 127 AI393644 N74993 AW472959 BM478854 BI597437 H12165 81456612 BE543192 
451762 10408.5 AB032997A1141676AW978722BE467119AI761408BF727385AW237035AI934521 eF436248Ai47g668Z40632AA832081 AW295801 

BF057835 BE46S977 A1621269 BE465983 8F756369 N74056 A1817896 AA716S67 AA934774 K8260OH09497 BF943762 BE395335 BEB83333 
432363 1234917.1 AW970240AA534489AW970323 



758 



wo 03/042661 



PCT/US02/36810 



10 



422B90 61426.1 



417379 1610005J 
407819 7392J 

419733 7612L3 



432675 1237917_1 
447620 687223^1 
15 409151 -4123.1 



20 409960 39576J 



25 433891 647290.1 
414922 1563J2 



30 414222 18695.1 

35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



426991 29771J 



434194 62680J 
432908 452541.1 
412652 18858.2 



437179 12239.1 



418166 18856.1 



431416 120918J 
447881 44623.1 



407192 22002aL1 
434747 117643.1 
410297 2990.1 



424339 50559.1 



AK057805 AW1 62343 All 90479 AI093318 BE048820 All 98397 AA654667 BE219303 Z39851 P02655 Z28734 T16575 F10145 245266 AV«72911 
AW984438 AW004030 AI632565 BES02530 BI7923B3 BF056928 AA449241 A)651 825 AAB05324 A1264663 AWt%91 8 AA948267 A!953735 
A1263703 BFCS6387 AW594171 AI867447 AA319159 AI903440 AW9561 10 A1356013 AI867923 BG91 1906 081 142 C15616 AL538697 Z2S032 
Z43784 R13382 AV746924 AA449369 AA318815 BF364265 H17038 HI 0064 R)4161 T67230 R4089& AW204071 81819428 AA683393 AA683378 
Z43192T74078T05103F12527 T77951 H10116K17037 BF85S4Q7 R19603 
AA196390AA507837AA196468 

AK056628 AI800896 6F939022 BE64471B AI954754 6E218177 BE34B567 AI962406 AW293122 AI968798 AI457321 BE327228 BG913S31 
AW939055 T30260 R54166 Z43366 BG819153 BFOOS1 19 BE646274 BF940^1 R18246 R421&5 

AK027321 W53676 W53789 BH)4S412 BFl 14614 BE645183 BM126230 AWD44233 A1951970 AW663548 A1139947 AA514302 AA846232 

BM126251 AA789002 AA581985 AA809643 AW1 88870 BE706664 6E708539 BE153177 BF084g25 AL13377g AW961768 AA6SS693 AA347970 

AA295134 AAS26O37AA4492B2BG190454 H61476 T91398N20018T90114 H75644 8E710738 BF687723 H28581 AA249370 BF726698 

BE841554 BI04S099 T84625AW1 29676 6G770826 

AWg73a34 AI791932 AI7918S5 AI732640 AA558833 AA5S9897 AI821610 

AI973051 AI400921 Ar796154AW241817 AW290951 

NK4_004892 AF047442 BE275338 BF724863 BI917208 6E276993 At602308 AA306105 BM152S05 BC001364 AW993471 AW993481 AW993283 
AW992919 AW992921 AW992980 AW992B61 AW993220 B6573124 AA45638S AA234796 AI902726 AA354813 61092644 BG778400 B1260001 
BG007325 A)267455AA42^4 AI160782AI472186 AA255500AA434006 BG435520AI356111 H00525 AV74g060 BG944497 6G292031 
AA902153 C04925 AA902160 AA383100 AW073533 AA256706 AA150809 R65766 AW958448 BE090972 BF69319S AV736979 R65855 R80136 
AA484677 

BE64475B A1082238 BF940027 A1201079 AI436035 AW275g66 AI085394 A1291655 AW070441 AI474134 AI268978 At769279 AI567682 AA693941 
BF477668AVV664149AA283782BF509538AVV296866AI2689nAI166133BM352065A1262769BF941976 M 6F763S97 
AL565888 BM352383 AA427768 AA385346 All 88988 AA931 831 AA134g72 BF217480 BF111012 AA908246 AA319849 AA318136 A1514271 
BF364291 AL515057 AV70231 2 AA377395 AL544217 AI341000 AW193583 AI350789 AA888338 BF945380 AW87g092 AA130839 T91066 N92326 
A!004389 AA078832 A1572370 W04622 BE314003 AW960808 BM360872 AA319160 AA130778 AL514257 
AW182329 AA61 3792 T05304 AW858385 

BG107484 AA632009 A1432670 A1656S60 AI650884 A1521919 A12&4653 AW150793 AW61 1894 A1917098 BI091245 AI651454 6F434B89 AI5802B8 
AI880735 BE301995 A1392959 AW613965 BM023828 AW515374 A1460102 BM023318 BE328188 AI952820 AI581363 AA557165 At695677 
AL562079 AI700926 AM70561 BF063058 AW196387 AU132984 BI064046 A1970157 R02122 H55924 A1521721 AA808206 AA725223 AJ766003 
AW339821 AA805951 AI287969 AW5S4827 

BC021085 AL527872 AL526296 AL557087 BI255090AU1 43499 AL560355 BG823170 BE736988 AU141388 AL580262 BI764173 AU120299 
AU141755 AU14ig88 AU142941 AU141330 AU141784 61770885 61911394 BE901426 61918039 6G760842 6E883026 61254740 eE7326g0 
BG430761 BE792868 AW32fl267 All 351 73 AAl 02674 6F726986 AL564735 BE155962 BE155979 BF741879 H67776 H59234 H89665 AW117774 
AW274435 L29008 NM.003104 U07361 BF002824 81222949 AI458045 AW951 537 BF196474 At66g049 A!042523 BE463g28 AU16012S AU160560 
AW069877 AW01S214 AI946716 8E219706 AI953605 BE217755 AI991382 A)245005 BF431 179 AI521843 BE048908 A!420449 AW275385 
AA336950 BE501521 BF740566 AA311404 AA384639 B1772535 BG473076 BE8912g8 BE246928 6M012986 BE2426g3 BE901 342 6G746358 
BF374053AL564430AU143835AI635707BF195492At280559 8F741685AA3852578E247655 W94974 BE163702AI02S167A1827118 N78641 
AL581093 AU158964 AU156917 A1282516 AU146399 AA713947 AI285028 AA101228 A1338522 AA832316 Al284g86 AA857926 BF372568 
AA570172 AI753B25 AA171586 AU159257 BM194320 W93390 AW132101 AA5S0898 BI2S8878 AA522S54 N55172 AW013929 AI828274 AI871237 
C75260 AA934846 AA555036 AAS26579 AAS26466 N8Q270 AI538347 AW615805 AU158875 AU158883 AU159396 AI016667 AU1S9238 AI282517 
AA406317 AI2&5043 N53050 AA969446 N577ie H93323 W93374 A)873751 H5601 1 AI936174 AA937B30 AW438877 AI800550 AW32B268 
Ai244886 AA948424 AA700604 AA832317 AA01155S H97671 AA98871 1 AW1S0529 N70983 H44325 AV657614 AA902238 AA64401B AA0340S0 
T71053 N71444 AA831158 T40892 AA706106 

AK001536 AK0S6135 8M474813 BE887303 AK022914 AW581996 AW812945 6E882302 AA13426& 81043873 AA019433 81862088 BM468657 
AU128438 BE384458 AL353967 81657117 8F686525 BI46S223 BM460132 AU12g877 81222283 8G171592 81043544 8G496295 86750710 
BI256542 BG108520 AU1S0719 AW510354 A15542S6 AL3S3g68 AA191092 BF132635 
AF1 1 9847 AA437261 AA436987 AI132965 
AF150424 AI861896 AAS70057 AV736855 

A1801777 BE67n62AW008210 AWD09441 BE350994 8E207949 AI091475 AI802774 A1827533 AI075363 Al^ 
AA670056 AA602411 AA683472 At4360S8 AA612826 AI038932 BG057726 AI1 67355 BF449023 AI289476 AW074381 BF97291 2 AI991780 
AA689119 AI537472 Z39730 AI868953 At192337 BE812978 BE812939 AA1 15248 1499006 AI91 5784 F08973T1674B D20468 AA609899 BF081234 
AA115336 BE812876 BE812972 BE8129a2 BE813006 BE813019 R43883 BE012981 

AK055109 BC01908S AA187684 BG656226 BM023227 AI932311 AW264381 AA398371 6M021483 AI432433 AI375777 AI129580 AW262782 
AA134107 BM023515 AA977504 AI659222 AI348454 R6972S AA97S26B BM021207 AL080074 AA129218 AW207842 N90581 AA77igi9 AI092259 
AI028416 A1074114 BG656536 BE501677 aW193419 AA917040 W90430 AI342984 AI378957 AL036486 AW020068 B1491093 BF476021 R41226 
R69631 F04125 C02343 AA1 15589 R56480A)400988R54266 R31422 

AK055915 8E8672S2 A(523348 AA765350 6F446858 Z43875 R19529 AL133637 BG389444 AW382942 BE702956 AA081961 BE835247 BE635308 
eE835295 AA376302 BE645790 AA375690 AA376S92 AW954423 N65732 AA249770 AI75441 6 AA213816 6F592044 AA81 1729 AW514842 
A1633486 AI096810 AW183016 AI635738 N27524 6E&45916 A)247842 AWg91230 A17&4277 H1681 4 Ar766692 6F108422 A1800361 T951 29 
F1 1313 R97946 R93989 AA375242 BF109388 AI860939 AI68O06O AW953899 6F971486 A}g72337 AW953841 AA372437 AI216746 H11384 
R38484AA249043AA249732 

AW969151 AA532718AA504784 AI791194 A1821930AI8214B5F37127 AA654206 F27974 

AK074291 AW293424 BE676135 At832125 BE019146 BE465019 AI761124 AA617778 AI279232 AW575897 A!672039 F28818 BFg24261 
AA722184 BF934174 6E004328 AV749301 BE880282 BI019798 B1019389 BF928776 AW813409 AV726604 AA077560 BE272975 BF949119 
AWB141 95 BE879126 A1697926 BF594155 BE205787 BF06351 3 N35828 A)948557 AI433839 AI379679 BG0561 82 A1589094 N23123 AA588805 
AW316561 A1080272 A1421980 AI483316 BF194830 N87S90 AA49S993 N32996 AA699844 H96845 K96S92 N2B741 61035539 8F747723 
8F1 71 066 W01350 H0549S At24378S Z39622 AA887432 AI350659 R46102 
AA602964AA609200 

AW976537 A1033582 AA837085 AA745261 AA648395 

BC013939 B1494890 61491211 A!928393 AA843540 80938644 AU185628 61495842 AW173255 BM05Z709 Ar743999 AI690144 AI922209 AI740907 
AW340368AA928769AVV118737BF513970AA707807BF43529SAI339463At373842A14338098E222392AA« 

AW862760 AI888087 AI342098 AA722418 W78151 N64382 BE221848 AWOZSgOl AA452120 A1150479 AID16168 AA779515 AA661791 BAM74307 
81911 169 BG575154AW953303 T33604 D59141 AA385785 AA148648 6M461961 8G681168BI602483BE889592AW9S4311 6M0S2986 61962693 
AI989299 Z42328 BF029504 T35668 BG402602 AU1 85770 A(023271 AA147719 AI434079 A1569000 A1276488 AAg92453 AA342821 AA648303 
AI349364A1051008AA926941 AA350894 AW071451 N22249AI784138 AA083847 N22258 AW440825 AA661570 AA376667 AA659125 A1356299 
R70463 AI383586 AAB271B9 BI494872 AW021094 BI494871 AA9DS500 AA460923 81492041 AW026965 AI62461 1 BG271780 AI497723 H88862 
059856 N89979 AA656425 N81154 062341 AI274437 N66697 H96993 A1370663 AA728850 H05232 RS9374 Ht2223 AI935759 AI362S53 D60006 
N29572 AI916833 N75273 AA148710 B15971 17 BG740471 AA332671 AA333874 AA643052 AW0201 75 R70550 BG623469 AA452342 AW965441 
W19723 R32g66 6E883841 R610O3 AI910374 AI885262 R55325 A1468927 R34681 H9621 1 Z39607 BFg54386 

NM.01S720 AF219137 ALS34420 ALS24055 AL537346 AL538442 BG765888 AL530054 AL525377 8G474596 8G473144 BE251553 BG706099 
AL538039 8G703131 BE255806 8F8052S6 F12128 AL566773 81828686 8F761480 Ai204971 8G818818 81199246 ALS34816 8F529941 AA324163 
AL523285 8G914330 K07952 AL534815 8E769903 AI867802 BM310135 ALS337Q2 8E254484 BFS28852 6E867462 BE740130 AL1341&4 
AL5671 15 AL533701 AL524054 AL515904 AL523284 A1568203 AL534419 BF981 162 BE257148 AL561833 
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445636 8561 5 BF339388A1345516BG3916576E708967BG026034BE261703HS6716H65572 H93801 T46830 R96953R9698^ 

AA090672 

419175 35068 1 A6018322B(»12480 81524873 AW665554AI934469At479916BF09617BBF096162 81^132 M744972AI95ig8^ 

AA886998 A15705e5 A]gi6688 AI67881 1 AI693109 AI308135 AA669046 AAS61054 AI018082 H60&18 BE221942 R52609 AI915164 AA355626 
Z44671 Bt052776 BF882486 B6286184 A1589558 AA931683 AA534979 AI275392 AI273455 RS2553 AA629920 H80652 AA360728 F10618 
AW953665 AW176773 K85527 AA765570 AA081927 BF093262 BG743753 AL037576 AA534314 BE814984 BE973713 N4g493 6E006634 
BE008630 AW270O37 AA234765 AI334004 BF057179 AI857450 AI341 191 AI434143 A1917449 AW517207 AA25S424 AW008334 AAB47572 
AA994211 AAB61901 AA5B1873 A1580157 AI364353AW242357 AW235291 N55645 AA319869 R36911 AA256551 AW044188 AI203159 N49403 
F02090 AI187299 AI609644 Z40516 AW952314 

441128 2093^1 BC014072 BE328850 AI356567 AI148171 AI022165 BG149661 BF000671 AA233101 AA573721 AA447991 AW016855 AI005068 AAS5407I 
BF478215 AA906902AW014761 86905651 8E512923BM047129 AA243852 AA232991 AA12755DAA127S51 AA570256AI4732376F033706 
N90525 Am3623 A1359627 B6674574 Be903322 

406789 0.0 AI041403Z49148 

421091 2494U AK057700 BC015899 BE867108 AL525926 AL527436 BG91 3023 A1884867 BE858461 AI885227 AI935218 BE645396 A1922406 AA77B161 
BF345973 AW195853 At687121 AI336147 A1091364 AI769203 AW627859 BE677432 BF439954 BG820230 AA283686 
AA932022 BE222881 AW016109 BM352667 BFS28099 BG818616 AI935916 AA912295 BG910887 AL568301 AIJ567278 81522445 BI7543B4 
BG81937S BFBSSgSO AA323718 AAB60955 AI069847 AL520776 AL526045 A1538507 AL567798 AL536876 AL574332 6F834531 8F3401 16 
BF835954 AL573997 ALSB1252 BF957667 AL580170 AW952324 AL569382 AW148695 AL559234 BG034270 8(913839 AL5813Z7 AL565842 
H43568 BF94S188 AL5596B6 AL539326 T1S481 AW895092 AL582684 BI519896 

400263 18977 1 Z11692 X51466 NM_001961 M19997 81224253 8G830478 AU122147 AU123437 BG113591 BG752624 8E8B6804 BI868669 6G337216 AW629935 
BM01652SAIS60409AL562666A1909178BFB4g556AA371735BF038841 BF727115BC0065476G757526AL5S5664BI261304BG770095 
81033486 B1517580 BGB76486 81011828 AI313235 BG831724 BF869862 BG998348 61011834 BF888337 BF896627 8FD92380 AW803215 F01241 
6F805719 BG876487 AW49B536 BF988886 BG998849 AA248724 BG829202 BG756456 BG032392 Bt859267 BM016990 BG332369 BE9335S5 
BE15675BBM4S2445 AI937808AW026128 N23684AW006041 AI337621 F33111 8F344301 BG1 05450 86387343 BF569547BF 154671 
BM007368 BF569385 BE772007 81199487 BF761700 B!261519 BF944452 BF898S06 Ai038390 8M044934 AW361 142 BG743618 BE769206 
8E893973 BI01S047 BF886479 BF761350 BE769769 BG766117 8F847365 BE397834 AW371121 BF089125 8E082996 BF1B3193 BG180964 
BF08S940 BI000274 6G255503 BG674499 BG774174 BI015084 



TABLE 67C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identirer (Gi) numbers. Dunham I. et a).' refers to Ihe publication entitled Hits ONA 
sequence of human chromosome 22/ Dunham I. et al.. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predlctfld. 
Ntj}OSltion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


403047 


3540153 


Mnus 


59793-59968 


401424 


8176894 


Plus 


24223-24428 


401451 


6634066 


Minus 


119926-121272 


401747 


9789672 


Minus 


118S96-118816.119119-119244.119609-11976 


404632 


9796668 


Plus 


450S&45229 


403046 


3540153 


Minus 


55707-55859,56369-5651 1 


404922 


7341893 


Rus 


13248-13428 


403752 


7678857 


Rus 


33704-33828 


404210 


5006246 


Plus 


169926-170121 


401519 . 


< 6649315 


Plus 


157315-157950 


401785 


7249190 


Minus 


165776-165996.1661B9-166314.166408-16656 


401866 


8018106 


Phis 


73126-73623 


405387 


6587915 


Minus 


3769-3633,5708-5695 


401197 


9719705 


Rus 


176341-176452 


403532 


8076642 


Minus 


81750^1901 



TABLE 68A: 995 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

Table 66A lists about 995 genes up-regulated in prostate cancer compared to normal adult tissues. Theso were selected from 9009 probesets on (he AfiymetiWEos Hu03 GeneChIp 
array that showed some expression In human xenograft tumors and ceO lines. The 995 genes were selected such that the ratio of 'average' prostate cancer to 'average' normal adult 
tissues was greater than or equal to 2.5. The 'average' prostate cancer level was set to the 85th pementOe amongst prostate cancers. The 'average' normal adult tissue level was set 
to the 85th percentile amongst nofwnaOgnant Sssues. In order to remove gene-specific background levels of non-spedfto hybrldizaQon, the 10th percentile value amongst norh 
mdignanl tissues was subtracted from both the nurneralor and the denon^ 

Pkey: Unkjue Eos probeset Uentiiler number 
ExAccn: Exemplar Accession number, Genbank aooesslon number 
UnigenelD: Unlgene number 
Unlgene Title: Unlgene gene liOe 



R1: 


Ratio of tumor to normal body fissue 




Pkey 


ExAccn 


UnigenetO 


UnigeneTIfle 


R1 


448999 


AF179274 


H5,22791 


transmembrane piolein with EGF-Cke and 


85.5 


426747 


AA535210 


Hs.171995 


kaliitoeln 3, (prostato specific antigen 


68.6 


446057 


AI420227 


Hs.149358 


Trp>p8 transtent receptor potenSal cat! 


65.6 


400302 


N480S6 


Hs.283946 


folate hydrolase (prostal&spedfk: memb 


61;9 


432441 


AW292425 


Hs.163484 


ESTs 


60.4 


419526 


AI821895 


Hs.193481 


ESTs 


45.9 


414569 


AF109298 


Hs.118258 


prostate cancer associated prot^ 1 


45.7 


424099 


AF071202 


Hs.139336 


ATP4iinding cassette, sut>>fSamny C (CRR 


45i 


432240 


A1694767 


Hs.129179 


Homo sapfenscDNA FU13581 fis. done PL 


44.9 


400298 


AA032279 


Ks.61635 


six transmembrane epQheBal anfigen of 


40.9 


453370 


AI470523 


Hs.133338 


ATP-fabidlng cassette. siMmOy C (CFTR 


39.6 
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409731 AA125985 Hs.56145 thymosin, beta. Uentifted In neuroblast 37.7 

400299 X07730 Hs.l71995 kalBkrein 3, (prostate spedlic antigen 34.9 

400296 AA305627 Hs.139336 ATP-binding cassette, sub-family C(CFTR 31.7 

400292 AA2S0737 Hs.72472 BMP-RIB 31.4 

5 42S075 AAS06324 Hs.1652 add phosphatase, prostate 31.4 

409262 AK000631 Hs^2258 hypothetical protein FU20624 31.4 

428819 AL135623 Hs.193gi4 K1AA057S gene product 31.2 

409361 N(yL005982 Hs.54416 sine ocuQs homeobox (Drosophils) homolD 30.3 

4D7168 R45175 Hs.117183 ESTs 29.6 

10 400287 839329 Hs.1B1350 katiikrein 2. prostatic 29.6 

415539 AI733881 Hs.72472 BMP-RIB 28.8 

432374 W68815 Hs.3018dS Homo sapiens cDNAFUl 1346 fis. done PL 28.8 

433043 W575S4 Hs.125019 tymphdd nudear pmtein (LAF-4) mRNA 28.1 

423073 BE2S2922 Hs.123119 MAD (mothers ag^nstdecapentaptegic. Dr 27.5 

15 403047 NM.00S656*:Homo sapiens transmembrane pr 27.4 

444484 AK002126 Hs.11260 liypolhelical protein HJ11264 25.7 

428336 AA503115 Hs.1837S2 micraseminopioleln. beta- 25.0 

401424 NM-001172:HQmo sapiens aiginase, type II 24.9 

407709 AA456135 Hs.23023 ESTs 24.7 

20 407122 H20276 Hs.31742 ESTs 24.6 

42S628 NH.004476 H5.283946 Mate hydrolase (prostalo^peciflcmemb 212 

415989 A1267700 ESTs 23.1 

448290 AK002107 Hs.20843 Homo sapiens cDNAFU11245 lis. done PI 22.7 

437052 AA861697 Hs.120591 ESTs 22.6 

25 411096 U80034 Hs.68SB3 mitodK)ndriai Intemfiediate pepUdase 22.5 

451939 U80456 Hs.27311 sinel&^nded (Drosophlla) homdog 2 21.2 

450096 AI682068 Hs.7937S holocarboxylase synthetase (bio(in-[prop 20.8 

419743 AW408762 Hs.S957 Homo sapiens done 24416 mRMA sequence 20.5 

427958 AA418000 Hs.g8280 potassium intermedlatefemalloondudanoe 20.5 

30 431548 Ai834273 Ks.9711 novel protein 19.8 

434666 AF151103 Hs.112259 T cell receptor gamma locus 19.4 

426501 AW043782 H$.293616 ESTs 19.3 

413597 AW302885 Hs.117lB3 ESTs 18.9 

428862 NM 000346 Hs.2316 , SRY (sex detemilning region Y)-box 9 (ca 18.7 

35 435677 AA^142 Hs.293726 ESTs, Wealdy similar to TSGA RAT TESTIS 18.0 

429220 AW207206 ESTs 17.9 

408826 AF216077 Hs.48376 Homo sapiens done HB-2 mRMA sequence 17.6 

418961 AW987M6 Hs.23023 ESTs 17.3 

428898 AB033070 Hs.194408 K1AA1 244 protein 17.3 

40 420757 X7B592 Hs.99915 androgen receptor (dihydrolestosteiDne r 16.8 

, 428398 AI249368 Hs.98558 ESTs 16.3 

419078 M93119 Hs.89584 insuiinoma-assodated 1 15.8 

450382 AA3976S8 Hs.60257 Homo sapiens CDNAFU13S98 lis, clone PL 15.4 

449625 m^014253 odz(oddOz/le(Hn.Drosoph!la)homolog 1 15.2 

45 401451 NM_004496':Homo sapiens hepatocyte nude 14.9 

447033 AI357412 Hs.157501 ESTs 14.9 

452594 AU076405 H5.29981 solute carrier famSy 26 (sulfate transp 14.7 

448519 AW175665 Hs.278695 Homo sapiens prosteinmRNA. complete cds 14.6 

416829 AA516531 HsJ5999 NKhomeobox(Drosophni4.f8mOy3.A 14.5 

50 430376 AW292053 Hs.12532 chromosome 1 open reading frame 21 14.4 

418848 AI820961 Hs.193465 ESTs 14,3 

429916 AW873S86 Hs.1ig383 ESTs 14.2 

424012 AW368377 Hs.137569 tumor protein 63 kOa with strDnghomotog 14.1 

418278 AI088489 Hs.83937 hypothetical protein 14.1 

55 408000 L11690 Hs.198689 bullous pemphigoid antigen 1 (230f240kD) 13.9 

443822 Ai087412 Hs.143611 ESTs. Weakly slmBar to 200439gAchromos 13.3 

432101 Ai918950 Hs.123642 EphA3 13^ 

447342 AI199268 Ks.19322 Homo sapiens. Sbribr to RlKENd)NA201O 13.1 

412446 AI768015 ESTs 12.7 

60 437718 AI927288 Hs.196779 ESTs 12.6 

433404 T329B2 ESTs 12.6 

431089 BH041395 Ks.156110 ESTs. Weakly similar to unknown protein 12.6 

427212 AW293849 Hs.58279 ESTs^WeaklysImllarlo ALi;7.HUMAN ALUS 12.5 

^_ 429597 NM.003816 Hs.2442 adislntegrin andmetalloprotelnasedoma 12.6 

65 444151 AWg72917 Hs.128749 a)pha^thylacy1-CoA racemase 12.4 

428046 AW812795 Hs.337534 ESTs. Moderately simitar to 138022 hypot 12.4 

453160 A)263307 H2B hlstone family, member L 12.2 

433927 AI557019 Hs.116467 smaQ nudear protein PRAC 12.1 

„ 431117 Af=003S22 Hs.2SOSO0 delta (Dro6ophila)-&kB 1 12.1 

70 424600 AL035588 Hs.153203 MyoD family inhiUlDr 11.9 

427674 NM.003528 Hs.2176 H2Bh]stDne family, mernberO 11.9 

424692 AA429834 Hs.151791 KIAA0092 gene product 11.7 

415263 AA948033 Hs.130853 ESTs 11.3 

416182 NM.004354 H$.79069 cydinG2 11.3 

75 420216 AW956037 ribosomal prolein L4 11.3 

411887 AW182924 HS.12S790 ESTs 11-2 

425154 NM.001851 Hs.154850 odiaeen, fype DC. alpha 1 11.2 

430226 BE245562 Hs.2551 arfrenergic. beta-2-, receptor, surtee 11.2 

450325 A1935962 Hs.91973 ESTs 11.1 

80 439444 A1277652 Hs.54578 ESTs. Weakly similar to l38Q22hypotliefi 11.1 

440529 AW207640 Hs.16478 Homo sapiens cONA:FU21716 Us. done C 11.0 

457211 AW972565 Hs.32399 ESTs, Weakly similar to S51797vasodilat 11.0 

428600 AW863261 Hs.138860 hypolheDcal protein OKFZp434K1421 11.0 
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422728 


AW937826 


Hs.103262 


ESTs. VMdy sfanSv b ZN91J1UMAN ZINC 


11.0 




417333 


ALI 57545 


Hs,1 73179 


bnxnodom^ and PHOfiUQOf contsining, 3 


10.7 




434423 


NM_006769 


H5.3844 


UM domain only 4 


10.6 




434170 


AA62&509 


Hs.122329 


ESTs 


10J 


432729 


AK000292 


H5.130732 


hypoOffitJcal protein FU 20285 


10.5 




446100 


AW967109 


H5.13804 


hyjnlheQcai protein dJ462023.2 


10.4 




432715 


AA247152 


Ks.200483 


ESTs, Wealdy slmaar to KIAA1074 protein 


10.4 




421896 


N62293 


Hs.45107 


ESTs 


10.3 


1 n 
lU 


401747 






Homo sapiens keidGn 17 (KRTIT) 


10.1 


451640 


AA195601 


Hs.26771 


Human DNA sequence from done 747H23 on 


10.1 




431542 


H63010 


Hs.5740 


ESTs 


10.0 




447397 


BE247676 


Hs.18442 


E-1 enzyme 


10.0 




433265 


AW97S944 


Hs.237396 


ESTs 


10.0 


15 


431325 


AW026751 


Hs^ 


ESTs. Vteakty simOar to 210926QA B ceS 


9.9 


410889 


X91662 


Hs.66744 


twist (DrosophSa) homoiog (acrocephdos 


9.6 




432674 


AA641092 


Hs.257339 


ESTs. Weakly simDar to 138022 hypdttieti 


9.6 




434926 


BE543269 


Hs.50252 


mitodiondilai ribosomal protein L32 


9.6 




442049 


AA310393 


Hs.190044 


ESTs 


9.6 


20 


443180 


R15875 


Hs^58576 


ciaudin12 


9.5 


406964 


M21305 




FGENHS predated novel secreted protein 


as 




426350 


NM.003245 


Hs.2022 


transgiulaminase 3 (E polypeplide. prote 


9.5 




456660 


AA909249 


Hs.1 12262 


sotute canler family 30 (zinc transport 


9.5 




448045 


AJ2g7436 


Hs^166 


prostate stem cell antigen 


9.5 


25 


451684 


AF216751 


Hs.26813 


C0A14 


9.5 


440594 


AW445167 


Hs.126036 


ESTs 


9.4 




432415 


T16971 


Hs.289014 


ESTs. Weaidy similar to A43932 mucin 2 p 


a3 




418601 


AA279490 


Hs.86368 


calmegin 


a3 




433332 


AI367347 


Hs.44898 


Homo sapiens done TCCCTA00151 mRNA sequ 


9.2 




437124 


AA554458 




KIAA0666 protein 


9.2 


428728 


NMJ16625 


H5.191381 


liypotlielical protein 


9.2 




438869 


AF07S009 




gb:Homo sapiens fuH length insert cDNA 


9.1 




431992 


NM„002742 


Hs.2891 


proldn kinase C, mu 


&9 




413992 


W26276 


Hs.136075 


RNA.U2sman nuclear 


a9 


35 


428342 


AI73916e 




Homo sapiens xXm FU 13458 lis, done l\ 


8.9 


418836 


At655499 


H3.161712 


ESTs 


aa 




409799 


011928 


Hs.76845 


phosphoserine phosphatas&^ilte 


as 




400294 


N95796 


Hs.278695 


Homo sapiens piosteln mRNA, complete cds 


a? 




434826 


AF155661 


Hs.22265 


pynivate dehydrogenase phosphatase 


8.7 


40 


418250 


U29926 


Hs.83918 


adenosine monophosphate deaminase (isofo 


a7 


425905 


A8032959 


Hs.318584 


novel C3HC4 type Zinc finger (ring Tinge 


a? 




418329 


AW247430 


Hs.84152 


cystalhionine4)eta^lhase 


a7 




450203 


AF097994 




L-lcynurenlne/^pha^noadipate amlnotra 


a7 




425465 


L18964 


Hs.1904 


protein kinase C. iota 


as 


45 


416239 


AL038450 


Hs.48948 


ESTs 


8.5 


436962 


AW377314 


Hs.5364 


DKFZP564I052 protein 


a4 




450164 


AI239923 


Hs,63331 


ESTs 


a4 




452744 


AI267652 


Hs.246107 


Homo sapiens mRNA; cDNA DKFZp434E082 (fr 


a4 




440774 


AI420611 


Hs.153934 


ESTs 


a3 


CA 

50 


444922 


AJ921750 


Hs.144871 


Homo sapiens cONA aJ13752 lis. cM f*L 


a3 


410870 


081599 


Hs.66731 


homeot)oxB13 


a3 




435047 


AA454985 


Hs.54973 


cadheifn-Dke protein VR20 


a2 




418564 


AA631143 


Hs.278695 


Homo sapiens pnsslein mRNA, complete cds 


a2 




451027 


AW519204 


Hs.40808 


ESTs 


a2 


55 


445873 


AA2S0970 


Hs.251946 


pdy(AH3inding protein, cytoplasmic 1-1 


a2 


423349 


AF0102Sd 


Hs.127428 


homeol)oxA9 


ai 




421928 


AF013758 


H3.109643 


pdyadenylato btodong protsin^ntBracOn 


ai 




432586 


AA566548 




ESTs 


ao 




426108 


AA622037 


Hs.166468 


progiammed cdl deatli 5 


ao 


iCA 

oO 


417153 


X57010 


HS.B1343 


coDagen, type 11. alpha 1 (primary osto 


ao 


416653 


AA768553 


H5.193145 


metaOothionein IE (functional) 


7.9 




447476 


Be293466 


Hs.20880 


ESTs. Weakly similar to 138022 Ivpotheti 


7.9 




450377 


AB033091 




KIAA1265 protein 


7.8 




429769 


NWL004917 


H&218366 


kallikrein 4 (prostase, enamel matrix, p 


7.7 


65 


416815 


U41514 


Hs.80120 


U0P-N-ac8tyl-alpha^alactosam{ne:polyp 


7.7 


433323 


AA805132 


HS.159142 


ESTs . 


7.7 




451952 


A1120173 


Hs.301663 


ESTs 


7.7 




452367 


U71207 


Hs.29279 


eyes atisenl (Drosophila) homdog 2 


7.7 




421470 


R27496 


Hs.1378 


annexInAS 


7.7 


70 


420092 


AAai4043 


Hs.88045 


ESTs 


7,6 


416111 


AA033813 


Hs.79018 


dvomsSn assembly factor 1, sutunit A ( 


7.6 




440749 


W22335 


Hs.7392 


hypothetical protein VIGC3199 


7.6 




441866 


BE464341 


Hs.21201 


necQn 3; DKFZP56660846 protein 


7.6 




411019 


AW993097 


Hs.48617 


Homo sapiens cDNA FU1 2540 lis. dene NT 


7.5 


75 


421566 


NM.000399 


Hs.1395 


eaily growth response 2 (Krax>20 (Drosop 


7.4 


451418 


BE387790 


H5.26369 


hypotheOcd protein FIJ20287 


7.4 




433517 


AW022133 


Hs.189838 


ESTs 


7.4 




415621 


AI648602 


Hs.55466 


ESTs 


7.4 




442592 


BES66623 


Hs.29899 


ESTs, WeaMy stmOar to GO2075 transcrip 


7.3 


80 


452340 


NM_002202 


H$.S05 


ISL1 transcripUon factor. UM/homeodoma 


7.3 


450244 


AA007534 


H$.125062 


ESTs 


7.2 




433293 


AF007835 


Hs.32417 


hypothetical protein MGC4309 


7.2 




425259 


AL049280 


Hs.155397 


Homo sapiens mRNA; cONA OKFZp564K143 (fr 


7.2 




434485 


AI823511 


Hs.1 18587 


ESTs 


7.2 
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442772 AW503680 Hs^957 Homo sapiens done 24416 mRNA sequence 7.1 

425211 M18667 Hs.1B87 progastiicsin (pepsinogen C) 7.1 

43195B X63629 HS.2S77 cadherin 3, type 1, P-cadherin (placenta 7.0 

408418 AW963897 Hs.44743 K!AA1435pretein 7.0 

S 436083 AK000028 ribosomal proleln 824 7.0 

412350 A)6S9306 Hs.73826 protein tyrosine phosj^hdase. noiHecepl 6.9 

414864 AA587775 H5.68295 muiO-POZKtoni^n^tabiing protein 6L9 

428465 AW970976 Hs.293653 ESTs a9 

436476 AA326108 Hs.33829 l)HLH protein DEC2 &8 

10 431585 BE242603 Hs^2823 hypotheScsl prol^ RJ10326 &8 

434808 AF155106 Hs.256150 Homo sapiens. SimOarloRlKENcONA 2610 a7 

432103 T15803 Hs.272458 protein phosphatase 3 (formerly 28). cat &7 

456088 BE177320 Hs.156148 hypothetical protein aJ13231 6.6 

429064 AJ001443 Hs.195614 splicing factor 3b. subunit 3. 130kO 6.6 

15 422486 BE514492 Hs.1 17487 gene near HD on 4p16.3 with homology to 6.6 

410227 A8009284 Hs.61152 exostoses (mutttpteKiltt 2 6.6 

453439 AI572438 Hs.32976 guanine nucleofkle binding protein 4 6l5 

410037 AB020725 Hs.58009 KiAAC»18 protein 6.5 

420297 A1628272 Hs.e8323 ESTs, Weakly siniilar to ALU1.HUMAN ALUS 6.4 

20 409648 AW451449 H$.57749 ESTs &4 

448146 NiyL016578 Hs.20609 HBVpX associated protein^ 6.4 

410268 AA316181 Hs.61635 six transmembrane epithelidanllgen of 6.4 

422134 AW17g019 Hs.112110 mitochondrial rtboscnna] protein 142 6.3 

415066 Z19448 H$.131887 ESTs, WeaMy similar to T24396hypoM 6.3 

25 404632 NN/L022490:Homo sapiens hypothetical prot 6.3 

412935 BE267045 Hs.7S064 tobidn^pedficchaperonec &3 

453308 AW959731 Hs.323099 ESTs &3 

449048 Z45051 Hs.22920 similar to S68401 (catlle)gtiJ005e Indue &3 

410752 AF226053 Hs.66170 HSKM-B protein ' 6.3 

30 436032 AA150797 Hs.109276 latexin protein &3 

456508 AA502764 HS.1234G9 ESTs. Weatdy similar to AF208855 1 BM-01 &3 

413328 Y15723 HS.7S295 guanylate cyclase l.sohibie. alpha 3 6.2 

425071 NM_013989 Hs.154424 deiodinase. bdothyronine. type II 6.2 

424036 AA770688 H2A histone family, member L 6.2 

35 418245 AA088767 Hs.83883 transmembrane, prostate androgen Induced 6.2 

450861 A152389B H3.17617 ESTs 6.1 

418821 AA436002 Hs,183161 ESTs 6.1 

458809 AW972512 Ks.20985 8hi3^odated polypeptide. 30KD 6.1 

432527 AW975028 Hs.102754 ESTs 6.1 

40 433730 AK002135 Hs.3542 hypothetical protein FU 11 273 6.1 

449845 AW971183 Hs.6019 DnaJ 0fep40) homotog. subfamBy C. membe 6.1 

420948 A8016898 Hs.100469 myeloid/lymphoid or mixed-Gneageteukem 6.1 

425704 U79293 Hs.159264 Human done 23948 mlVlA sequence 6.0 

458440 A1095468 Hs.135254 Homo sapiens done llhrombospondin mRNA &0 

45 419647 AA348947 H5.91816 hypothetical protein ao 

455497 AA1 12573 Hs.278695 Homo sapiens prosleln mRNA, complete cds aO 

428593 AW207440 Hs.185973 degeneralhfe spennalocyte (homolog Oroso a9 

400303 AA2427S8 Hs.79136 UV-1 protein, estrogen regulated 5.9 

446416 AV6S82g9 Hs.163g59 ESTs 5.9 

50 407819 R42185 ESTs a9 

430387 AW372884 Hs.240770 nuclear cap binding protein subunit 2. 2 5.9 

410102 AW248S08 H&279727 ESTs; homdogue of PEM-3 [Qona savlgnyi 5.9 

429165 AW009886 Hs.118258 prostate cancer assodflted protein 1 a9 

417873 6E26e659 Hs.2936S9 Homo sapiens. Similar to RIKENcDt^A430 aS 

55 410240 AL157424 Hs.61289 synaptcjanin 2 a8 

421305 6E397354 Ks.324830 diptherta toxki rasistancs protein requi 5.8 

451406 AI694320 Hs.62g5 ESTs, Weakly sbnilar to T1 7248 hypotheti 5.8 

416795 Ai497778 Hs.20S09 HBVpXassodatedproteirv^ 5.7 

^_ 427701 AA411101 Hs.2438B8 midearautoantigenic sperm protein (his a7 

60 451468 AW503398 Hs.2g3663 ESTs, Moderately simQar to 138022 hypot a7 

433852 AI37B329 Hs.126629 ESTs a7 

432882 NM.0132S7 Hs.27g696 senrni/glucocorficokl regulated kinase^ a6 

419440 AB020689 Hs.90419 KIAA0882 protdn a6 

435706 W31254 H3.7045 GUMM protein 5.6 

65 431583 ALJ042613 Hs.262476 S^enosytmethlontne decarboxylase 1 a6 

442409 BE208843 Hs.129544 hypotheSca) protein MGC1 5438 5.6 

453171 R76472 Hs.65646 ESTs 5.6 

437252 AI433B33 Hs.164159 ESTs, Weakly similar to ALUI^HUMAN ALUS 6.6 

422083 NM_001141 Hs.1112S6 aradildonate 15^pQxygenase, second typ 5.6 

70 429638 AI916662 Hs.211577 Mnectin 1 (Mnesbl leoeptoi) 5.5 

403046 NM>005656*'i1omo sapiens transmembrane pr as 

447805 AWB27932 Hs.302421 gemln4 S.5 

4431B2 T49951 Hs.9029 DKFZP434G032 protein 5.4 

418555 A)417215 Hs.87159 hypothetical protdn FU12577 a4 

75 437162 AW005505 Hs.5464 thyroM homrone receptor ooadivaling pr a4 

419465 AW500239 Hs.21187 Homo sapiens cDNA:FU23088fis. done L a4 

440146 AW014231 HS.S0790 Homo sapiens cDNA:FU22930fis. dona K a4 

427308 026067 Hs.174905 KtAA0033 protein a4 

409151 AA306105 SEC22, veside traffiddng protdn (S. c 5.4 

80 432840 AK0ai4O3 Ks.27g521 hypothetical prol^ FU20530 5.4 

423242 AL0394a2 Ks.125783 DEME4 protein a3 

453082 H18835 Hs31608 hypotheflcd protein FlJ2004t a3 

441021 AW578716 Hs.7644 HI histoneramDy,mefhber2 aS 
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415276 U88866 Hs.78353 SFRSpiDlein kinase 2 5.3 

400290 H1B836 Hs.31608 hypothetical protein RJ20041 5J 

432435 BE21888S Hs.282070 ESTs 

417318 AW953937 Hs.240845 ESTs W 

5 429467 NM_004477 Hs.203772 FSHO region gene 1 5^ 

416276 U41060 Ks.791% UV-I protein, estrogen regtMed &3 

410076 T05387 Hs.7991 ESTs W 

420039 NM„004605 Hs.94581 sutfotrenstsrase family, cytosoiic. 2B. 5.2 

423445 NM.014324 Hs.12874g slphs^thylacyl-CoAracemase 5.2 

10 436420 AM43966 ite.31595 ESTs 5.2 

414085 AA114016 Hs.75746 aldehyde defiydngenase 1 famDy. member 5.1 

422048 Nlit012445 Hs.288126 spondln 2. extreceHUtar matrix proton 5.1 

427176 AW381569 Hs.40334 ESTs 5.1 

450832 AW970602 Ks.l05421 ESTs 5.1 

15 448807 AI571940 Hs.7549 ESTs 5.1 

420568 R)9247 Hs.247735 prolocadherin alpha 10 5.1 

421685 AF189723 Hs.106778 ATPase,Ca transporting, type X,memt)er 5.0 

427615 BE410107 Hs.179817 CGi-82 protein, PSDR1 5.0 

430598 Aia)01764 Hs.247112 hypothetical protein FU10902 SlO 

20 452576 AB023177 Hs.29900 KiAA0960 protein SlO 

431724 AA514535 Hs.283704 ESTs SlO 

409757 NM.001898 Hs.123114 cystaSnSN 5.0 

432242 AW022715 Hs.162160 ESTs. WeaWy simDar to ALU4_HUI^ ALU S 5.0 

407103 AA424881 Hs.25e301 hypothetical protein MGC1 31 70 5.0 

25 421154 AA284333 H$.287631 Homo sapiens cDiMFU14269fis. done PL 5.0 

400301 X03635 i43.1657 estrogen receptor 1 5.0 

426006 R49031 Hs.22627 ESTs 5.0 

452679 Z42387 Hs.83883 transmembrane, prostate androgen Induoed 4.9 

446880 A1811607 i4s.108&46 Homo sapiens cDNAHJ14934fi8, done PL 4.9 

30 444108 1^784 Hs.140942 ESTs 4.9 

452259 AA317439 Hs.28707 signal sequence receptor, gamma (transto 4.9 

419083 A1479560 Hs.98613 Honro sapiens cDNAaJ12292 lis. done MA 4.9 

416854 H40164 Hs.80296 PurWnja cell protein 4 4.9 

422890 Z43784 ankyrin 3.nodeof Ranvler(anl(yrin 6) 4.9 

35 428330 L22524 Hs.2256 matrix metaOoproteinase 7 (mabilysin. 4.9 

419168 AI336132 Hs.33718 Homo sapiens cDNAFU 12641 lis. done NT 4.9 

452017 AF109302 Hs.27495 prostate cancer assodated protein 7 4.9 

456362 AW973003 H8.179909 hypothetical protein RJ22995 4.9 

442501 AA315267 Hs.23128 ESTs 4.8 

40 439735 A1635386 Hs.142846 hypothelica) protein 4.8 

451752 AB032997 K1AA1 171 protein 4.8 

423583 AU22055 Hs.129836 KIAA1 028 protein 4.8 

446157 BE270828 Hs.131740 Homo sapiens cDNA:aJ22S62 lis, done H 4.8 

432363 AAS34469 gb3Tf76g1 1 .si Na_CGAP.Ce3 Homo sapiens 4.8 

45 433313 W20128 H5.296039 ESTs 4.9 

408101 AW968504 Hs.123073 CDC2-related protein kinase 7 4.8 

453857 AL080235 Hs.35861 OKFZPS86E1 621 protein 4.8 

451982 F13036 Hs^7373 Homo sapiens mRNA:cDNAOKFZpS6401783 a 4.7 

447574 AF162666 Hs.18895 tousled^to kinase 1 4.7 

50 429299 AI620463 Hs.347408 hypothetical protein MGC1 3102 4.7 

420522 AW957137 Hs.98541 hypoOwtical protdn 4.7 

417379 AA196390 gb:zp99b10.&1 Strdlagene musde 937209 H 4.7 

411031 W37943 Hs.348g2 KIAA1323 protein 4.7 

431725 X65724 Hs.2839 Nonie disease (pseudogitema) 4.7 

55 441224 AU076964 Hs,7753 cahimenln 4.7 

407813 AL120247 Hs,40109 KiAA0872 protein 4.7 

440074 AA863045 Hs.1D669 ESTs. Weakly similar to T00050 hypolheti 4.7 

430294 A1538226 Hs.32976 guanine nudeotide t)tnding protein 4 4.7 

415082 AA160OOO Hs.137398 ESTs. Weakly similar to JC5238galadosy 4.7 

60 414922 D00723 glydnedeavage system protein H (amino 4.7 

430945 U80869 Hs.55g99 NKhom9obox{Drosophila).family3,A 4.7 

438825 BE327427 Hs.79953 ESTs 4.7 

425174 D87450 Hs.154978 KIAA0261 protein 4.7 

420380 AA640891 Hs.102406 ESTs 4.7 

65 410193 AJ132592 HS.S9757 zinc finger protein 281 4.7 

440300 N3g7e0 H8J859 Homo sapiens. Sindar to RiKENcONA 5630 4.6 

436761 A»17776 Hs.236557 ESTs 4.6 

410275 U85656 Hs.61796 transcriptton factor AP-2 gamma (adhoi 4.6 

404922 NM-003071:Komo8apiensSWI/SNF related. 4.6 

70 438556 AI364997 Hs.7572 ESTs 4.6 

429302 AU076674 Hs.198899 eukaryotk: translation tnlfialtonfador 4.6 

447595 AW379130 Hs.18953 phosphodiesterase 9A 4.6 

433006 BE2427S8 Hs.190223 ESTs, Moderately similar to T2928Shypot 4.5 

407894 AJ278313 Hs.41143 phosphdnosiUdwpedficphosphdipase 4.5 

75 416818 AA228899 Hs.101307 Homo sapiens HUT11 protein mRNA. partial 4.5 

433556 W56321 Hs.1 11460 cddumilBalmodulIivdependent protein Hn 4.5 

443123 AA094538 Hs.272808 putafive trenscitption regulation nuctea 4.5 

419991 AJ000098 Hs.94210 eyes absent {Drosophila) homotog 1 4.5 

419594 AA013051 Hs.91417 topoisomefase (DNA) H binding protdn 4.5 

80 453390 AA8824g6 Hs.28482 ESTs 4.5 

417061 AI675944 Hs.188691 Homo sapiens cDNAaJ12033fi5. done HE 4.5 

419239 AA46B163 HS.184S98 Homo sapiens cDNA: FU23241 fis. done C 4.5 

435873 N23874 Hs.50477 RAB27A. member RAS oncogene famBy 4^ 
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408001 AA046458 Hs.S5296 ESTs 4.4 

413950 AA249096 Ks,32793 ESTs 4.4 

416655 AW988613 Hs.79428 BCL2/adenovinJS E1B ISkD^terading pro 4.4 

- 418004 U37519 Hs.87539 aldehyde dehydn)g6n8se 3 family, mear^ 4.4 

5 419733 AW362955 HomosapienscONAFU14415fis.donsKE 4.4 

433233 AB040927 H3^01804 KIAA1494 protein 4,4 

439979 AW6002gi Ks.6823 hypothetical prot^ FU 10430 4.4 

4305S9 AJ002744 Hs.246315 UOP^acety)^pha'09a!actosaniin8:potyp 4.4 

417958 AA767382 Hs.193417 ESTs 4.4 

10 450680 AF131784 Ks.25318 Homo sapiens done 25194 mRNA sequence 4.4 

435020 AW505076 Hs.301855 IKGeorge syndroms criOcal region gene 8 4.4 

437396 BH140396 Hs.21621 hypotheiicd protdn DXFZp762O076 4.4 

425810 AI923627 Hs.31903 ESTs 4.4 

426028 NKtOOmo Hs.172028 a(fis!ntegr!nandmetaltoproteInasedoina 4.3 

15 452747 BE153855 Hs.61460 )g superfamSy receptor LNIR 4.3 

429259 AA420450 H8.292911 PlatopMIn 4.3 

452092 6E245374 Hs.27842 hypoMcd protein FU11210 4.3 

437967 BE277414 Hs.5947 met transfonnlng oncogene (derived fnxn 4.3 

459284 AF155660 Ks.3004g6 mltDchondiial solute carrier 4.3 

20 417067 AJ001417 Hs.81086 solute canter (anuly 22 (extraneufonal 4.3 

423453 AW450737 Hs.128791 CGI^ protein 4.3 

407137 T97307 gb:ye53h05.8l Soares fetal Ever spleen 4.3 

448826 At560252 Ks.293246 ESTs, Weakly similar to putative pi 50 [H 4.3 

419713 AW988058 Hs.92381 nudlx (nucleoside diphosphate linked mol 4.3 

25 427359 AW020782 Hs.79881 Homo sapiens cONA:aJ23006 lis. done L 4.3 

450402 BE218027 Hs.89969 ESTs 4.3 

428055 AA420564 Hs.101760 ESTs 4.3 

429441 AJ224172 Ks.204096 IqxiphlOn B (uterogfobin family memtier) 4J 

432875 AI791855 ESTs 4.2 

3U 439963 AW247529 Hs.6793 platelel-acthraiing tedor aoetylhydrola 4.2 

445707 AI248720 Hs.1 14390 ESTs 4.2 

410297 AA148710 lunta 4^ 

416737 AF154335 H5.79691 UM dom^ ptX}tBin 4.2 

_^ 430255 AK00O703 Hs.323822 Homosaptens mRNA (or KIAA1 551 protein, 4.2 

35 421629 AB018330 Hs.108708 C8lduniA:atmodurnKlependent protein kin 4.2 

447620 AW290951 ESTs 4.2 

426647 AA243464 Hs.294101 piB-&«eO leukemia transcription factor 4.2 

432886 BE159028 Hs.279704 chromSlinaocessibifi^ooniitexl 4.2 

446237 AW270515 Ks.149596 Homo sapiens. Similar to RIKENcONA 2310 4.2 

4U 421878 AA299652 Hs.111496 Homo sapiens cONAFU11643 lis, done HE 4.2 

423337 NiyL004655 Hs.127337 axin2(condudin,axi!) 4.2 

435854 AJ27B120 Hs.4996 putaSveankyifiKepeat containing prots 4.2 

427723 AI355260 Hs.279789 histone dea^ase 3 4.2 

430122 NM.013342 Hs.233765 TCF3(E2A) fusion partner fm childhood 4.2 

45 416140 A1918035 Ks.30119S roundabout (axon guidance receptor. Dros 4.2 

428231 U17989 Hs.183105 nudear eutoanOgen 4.2 

447966 AA34Q605 Hs.105887 ESTs, Weakly similar to Homolog of rat Z 4.1 

432274 AK0OO382 HS.2742S1 hypothetical protein nj2037S;KIAA1 797 4.1 

432621 Ai298501 Hs.21192 ESTs. WeaMy simitar to T46428hypotheli 4.1 

50 408063 BE086548 Hs.42346 caldneurin^nding protein calsardn-1 4.1 

407192 AA609200 gb:sf12e02.s1 SoaresJesGs.NHT Homo sap 4.1 

426716 NK1.006379 Hs.171921 semadom^aimmunoglobuiin domain (Ig), 4.1 

427479 BE410092 Hs.178471 KIAA079d gene produd 4.1 

434194 AF119847 Homo sap«nsPRO1550 mRNA, partial ods 4.1 

55 422072 A6018255 Hs.11113a KlAA0712gen8predud 4.1 

411145 BE439553 Hs.250528 Homo sapiens, done IIAAGE?(0988B4, mRNA. 4.1 

449459 6E546846 Hs.195048 ESTs 4.1 

421662 NML014141 Hs.106552 ceil recognition mdecute Caspf2 4.1 

445309 AL157474 Hs.12504 likely orthotog of mouse Ariffldla 4.1 

60 412520 AA442324 Hs.795 H2Ahistonef8mSy.memt)6rO 4.0 

446663 BE614599 hypothefical protein M6C1 4797 4.0 

425689 W16460 Hs.24283 ESTs. Moderately simitar to reduced expr 4.0 

433647 AA603367 Hs.222294 ESTs 4.0 

^_ 426030 6E243933 Ks.108842 zinc finger protein 22 (KOX 15) 4.0 

05 418838 AW385224 Hs.35198 edonudeoiidapyrophOGphatase/^phoA 4.0 

439820 AL360204 Hs.283853 Homo sapiens mRNA lull iength Insert cON 4.0 

422634 N^^016010 Hs.1ie821 C6i^2protein 4.0 

453469 AB014533 Hs.33010 K1AA0633 prdBin 4.0 

418827 BE327311 Hs.47168 HT02t 4.0 

70 446791 AI632278 H8.19S922 ESTs 4.0 

443884 N20817 H8.194397 leptinrQoeptor 4.0 

416857 AA188775 Hs.292453 ESTs 4.0 

401519 C15000476*:8il127372791feqXPJ12163.1| 4,0 

407846 AA426202 Hs.40403 ap/p300^nteradingtransadhrator,wil 4.0 

75 431669 AA305688 HS.267G95 UDP-GdOietaGtoNAcbetBl^gdladDsyltr 3.9 

408784 AW971350 Hs.63386 ESTs 3.9 

435655 AW105663 Ks.6947 HSPC069 protein 3.9 

432615 AAS57191 Hs.55028 ESTs. Weakly simOar to 154374 gene NF2 3.9 

448804 AW512213 Hs.342849 AOP-(fi)osytafion factor-liks 5 19 

SO 430389 AL117429 Hs.240845 OKFZP434D146 protein 3.9 

432278 AL137SQ8 HS.2742S6 hypothetical protein FU23S63 19 

4148G9 AA157291 HsJh479 ufainudeini 3.9 

429163 AA884766 gb»n20a1QLs1 SoaresJiFU.T.G8C.St Homos 3.9 
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444130 AI8789t8 Hs.10526 cysteine end gtycbiMfchpnteln 2 19 

443991 NM.002250 Hs.10082 potasshimlntemediatefernalloondudanoe 19 

433339 AF019226 Hs.6036 gliobtastomaovefexpressed 19 

_ 435021 AA922192 H3.73962 ESTs 19 

5 412834 R77123 Hs.79881 Homo sapiens cONA:FU23006 lis, dons L 19 

442607 AA507576 Hs.288361 Homo saptenscONA:FU22696fis. done H 3.9 

408833 AW812232 Hs.254835 ESTs 19 

432952 AA813887 Hs.188173 Homo sapiens cONARJ12187fis. done MA 19 

456177 Kht012391 Hs.79414 prostate eptttieHunvspedSc Ets transcr 18 

10 433345 AI681545 Hs.152982 hypolhalicai protein FU131 17 18 

421687 AW161450 Hs.109201 CG^ protein 18 

434614 AI249502 Hs.29669 ESTs 18 

425910 AAB30797 Hs.1847e0 CCAAT4«x-ttnding transcription factor 18 

432370 AA308334 Hs.274424 N-acetylneursm!nlc add ptiosphate synttta 18 

15 413010 AA393273 H5.75133 transcription factor 6#e 1 (mnodrand 18 

435114 AA775483 K&288936 mitodnndrlal ritnsomal protein L9 18 

439024 R96eg6 Hs.35598 ESTs 3.8 

426991 AKD01535 Homo sapiens gDNARJ 10674 fis. dons NT 18 

427871 AW992405 Hs.59622 Homo sapiens, done iMAGE:3507281.mRNA, 3.8 

20 439871 AW162840 Hs.6641 Idnesin family memlier 5C 3.8 

416113 AI272141 Hs.83484 SRY(sexdetemiiningfegtonYH)Ox4 18 

422424 AI166431 Hs.2g8638 pmstate differentiation factor 18 

447439 AA313565 Hs.145020 ESTs. We^ similar to iOAAl 205 proldn 18 

414178 AW957372 Hs.46791 ESTs, Weakly amilar to 138022 hypoliieti 3.8 

25 427099 AB032953 Hs.173560 odd Ozften^ homolog 2 (Orosoptiila. mous 3.8 

407908 BE379758 Hs.1 10853 uncharacterizsdliemjopdeScstem/itraoe 18 

440695 AW088363 Hs^46240 ESTs 16 

429686 Ai871613 Hs.26538 Homo sapiens cONA: FU21086 lis, dona C 3.6 

447533 NM_004788 Hs.18792 tttbredoxin-l3(S. 32I(D 18 

30 411190 AA306342 Ks.69171 pmtsin Idnase 2 18 

426126 AL1 18747 Hs.26691 ESTs 18 

434263 N34895 Hs.44648 ESTs 18 

414407 AA147026 Hs.76704 ESTs 17 

432426 AW973152 Hs.31050 ESTs 17 

3S 425770 N^L014363 Hs.159492 spastic ataxia of Ctiarievoix-Saguenay (8 3.7 

432579 AF043244 Hs.278439 nudeotar protein 3 (apoptosis repressor 17 

444101 R19175 Hs.169793 ritnsomat protein L32 17 

432908 AI661696 ESTs 17 

432800 BE391046 Hs.278962 All^l protein 17 

40 429966 8E081342 Hs^83037 HSPC039 protein 17 

419972 AtiM1465 Ks.182982 golgiiv67 17 

410001 AB041036 Hs.57771 italtllrolnll 17 

432205 AI806583 Hs.125291 ESTs 17 

401197 ENSP000002292B3'ai8PC211 17 

45 420061 AW024937 Hs.29410 ESTs 17 

431103 M57339 H5.44 pleiotropiiin (heparin binding giowtti fee 17 

433577 AW007080 Hs.284192 ESTs 17 

435017 AA336522 Hs.12854 angiotensin II, type I receptor-assodat 17 

_^ 420805 L10333 Hs.99947 reticutoni 17 

50 401866 Target Exon 17 

457183 H91882 Hs.116569 DvMiindlng protein lDAX(lnMI)i&)n of 17 

427982 N]i^016156 Ks.181326 KIAA1 073 protein 17 

405387 NM.022170^Homo sapiens Wiiliams^uren 17 

449933 AW157098 Hs.324104 Human DNA sequence liom done RP1-63M20 17 

55 432388 X15218 Hs.2989 avian sarcoma viral onoogenehomd 17 

438157 AW137011 Hs.49576 ESTs 17 

451945 BE504055 Hs.211420 ESTs 17 

452721 AJ269529 Hs.301871 solute canter famBy 37 (gtycero^ho 3.6 

454229 AW957744 Hs.278469 lacrtmd proline ridiprot^ 16 

60 416974 AF010233 Hs.60667 RALBP1 assodated Eps domain containino 16 

421537 BE383488 Hs.105547 neural proliferation, differenliation an 16 

428738 NlkjL000380 Hs.192803 xerodemra pigmentosum, complem&itaflon g 3.6 

440494 BE618768 H8.7232 acetyl^enzymeAcartwxytasealptia 16 

407748 AU)79409 H3.38176 K1AA0806 protdn; SCN Orcadian Osdilat 16 

05 453049 BES37217 Hs.30343 ESTs 16 

«2089 AAS23172 Hs.103135 ESTs.Vfeakly8lnAartDSFR4.HUMANSPUC 16 

408096 BE250162 Hs.83765 (fitiydrofolate redxtase 16 

407204 R41933 Ks.140237 ESTs. Weakly slmBar to AI^1.HUUANAiaiS 16 

450849 NM.001429 Hs.25272 E1At)indlno protein p300 16 

70 436489 AJ272269 H8.121429 zira^tiindlng protein n)oc728 16 

447818 W79940 85^906 Homo sapiens done 24670 mRNA sequence 16 

424285 BE207168 Hs.144630 nudearrece|^sutifami}y2,8roupF.m 3.6 

452260 AA453208 Hs.330994 RAB9. memt)er RAS oncogene famOy 16 

420344 BE463721 Hs.97101 putaBve G protdn^xiupled receptor 16 

75 42ni5 BE24S274 Hs.180428 KIAA11 81 protein 16 

426793 XB9887 Hs.172350 HIR{til^one cell cyde regulation defte 16 

419879 Z17805 Hs.93564 Homer, neuronal Immediate eariy gene, 2 3.6 

403752 NM-002753^Homo sapiens Rilbgen^ctivale 3.6 

4185S9 AA22S048 Hs.104207 ESTs 16 

oO 438523 H68220 Hs.278177 ESTs 16 

438705 AI04g624 Hs.2B3390 EST8.WteaUyslmBvlD2109260A6ce!l 16 

443960 AI0g3577 Hs.255416 hypothetical protein FlJ219e6 15 

412513 AA322599 Hs^163 ESTs.W^8tmQartoAF1S18401 OGU 15 
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428695 AI3SS647 Hs.189999 purinei^ receptor ^amiiy A groups) 3.5 

419517 AR)52107 Hs^797 Homo sapiens done 23620 mRNA sequence 3.5 

446494 AA463276 H&288906 WW DomaiivContainho Gene 3.5 

417515 U4203 H&82237 daxla^elangteclBsia group CVessodated 3.5 

S 433891 M613792 6b:no97hOas1 Na.CGAPJ>r2 Homo sapiens 3.5 

427315 AA179949 Hs.175S63 Homo sapiens niRNA;GONADKFZpS84N0763(f as 

445525 BE1498G6 Hs.14831 Homo sapiens. SniQar to zinc finger pro 3.5 

434872 AW294020 Hs.1 17721 ESTs 3.5 

453288 AV\^83292 Hs.274412 sbnltartoyQastUplS^vsrlanlA 3.5 

10 434970 AW272282 Hs225767 ESTs 3.5 

417193 AI922189 H12B8390 hypothetical protein faJ22795 3.5 

437617 AI026701 Hs.5716 tOAAOSIO gene product 3.5 

445943 AW898533 Hs.181574 ESTs 15 

412628 A!972402 Hs.306051 hypothetical protein MGC2548 3.5 

15 452167 N75238 Hs.13075 Horrw) sapiens cDNA: i^23013tis. done L 3.5 

410467 AF102546 K3.63931 dachshund (Drosopi^) homobg 3.5 

440133 AW902312 Hs.7037 Homo sapiens done 24923 mRNA sequence 3.5 

451815 AW974911 Hs.184793 Homo sapiens cDNA: RJ21 880 tis. done H 3.5 

429170 NM.001394 Hs.2359 dua) specifidty phosphatase 4 15 

20 410082 AA081594 Hs.158311 (tash) (Diosophtla) homolog 1 3.5 

ASm BE300298 HS.S054 C6t.133 protein 3.5 

437898 W81260 Hs.43410 ESTs 3.5 

451131 AI267586 Ks^68012 fatt^add-CoenzymeA0gase.lon9H:h8ln 3.5 

439609 AW971945 Ks293236 ESTs 3.5 

25 443555 N71710 Hs.21398 ESTs, Moderately similar to A Chain A. H 15 

418610 AW245993 H5.223394 hypothetica) protein MGC2742 3.5 

409980 eE»1944 hexoldnasel 15 

425263 NIIC001197 Hs.1S5419 BCL2^nteradingUner(apoptDsisMuc 15 

404210 NM-005936M]mo salens myeldd/lymphoid 3.5 

30 442323 AW016G69 Ks.29190 ESTs 15 

452707 A1093823 Hs.45070 ESTs 15 

421437 AW821252 Hs.104338 hypothetical protein 15 

426514 BE616633 Hs.170195 bone morphogenefteprolelh 7 (osteogenic 14 

450628 AW382884 Hs.204715 ESTs 3.4 

35 446825 BE266822 H5.344097 fitaniin A, alpha (acGn^inding protein- 14 

421867 AA481078 Hs.109045 hypotheticai protein FU10498 3.4 

434954 AF16145S Ks284295 Homo sa(densHSPCd37mRNA. partial cds 14 

452908 AB001451 Hs.30965 neuRinal She adaptor homdog 14 

429922 Z9763D Hs.226117 HI histonefamBy, member 0 14 

40 448734 BE614070 Hs.326416 Homosai^ensnnRNA;cONADK|:Zp564H1916(r 14 

434293 NM-004445 H8.37g8 EphB6 14 

414222 AL135173 sotbitol dehydrogenase 14 

421674 T10707 Hs.296355 hypothetical protein FU23138 14 

438279 AA805166 Hs.154762 HIV-1 rev binding protein 2 3.4 

45 411078 AI222020 H3.182364 CocoaCrtep 14 

419749 X73608 H5.93029 spaR:/Qsteoneclin.aivcvandl(azdl^kad 14 

432302 AA345857 Hs.274307 K1AA1 442 protein 14 

407944 R34008 Hs.239727 desmocdrin2 14 

431674 AA098901 Hs.301642 Gfroteln coupled receptor 14 

50 40K91 AB023191 Hs.44131 K1AA0974 protein 14 

431578 AB037769 Hs.261587 GCN2elF2aJpha kinase 14 

440789 ABQ07857 Hs.7416 K1AA0397 gene predict 14 

434958 T9g949 Hs.303428 Homo sapiens cONARJI 4832 68, done OV 14 

452295 BE37g938 Hs.28886 programmed cefl death 10 14 

55 417229 AA975096 Hs.19522 hypothetical protein PR02849 14 

439593 BE073597 Hs.124863 ESTs 14 

407136 T64896 Hs.287420 Homo sapiens cONAFUII 533 fis. done HE 14 

421044 AF061871 Hs.101302 Human DNA sequence from done RP1-238D15 14 

427461 AA531527 Hs.332040 hypothetical protein MGC1 3010 13 

60 417315 AI080042 Hs.160450 ribosomal protein S24 13 

423392 AA195037 H5.169341 HTPAPprdteIn 13 

412652 AI801777 ESTs 13 

415172 AF079529 Hs.78106 phosphodiesterase 8B 13 

418372 AA311833 Hs.84318 replication protein A1 (7Qt(0) 13 

65 • 4479SB AW796524 Hs.68844 Homo sapleflsmicrosarnal signal pepQdase 13 

425010 T16837 Ks.4241 ESTs 13 

420460 AA262331 Hs.48376 Homo sapiens done KB-2fnftNA sequence 13 

443181 AI039201 Hs.283316 ESTs 3.3 

442064 Ai422867 HsJ8594 ESTs 13 

70 431663 NIil.016569 H5.267182 TB}<34so protein 13 

412926 AI879076 Hs.75061 macrophage myitstoytated alanine^ C • 13 

420818 AW969635 Hs.33032 ESTs, Weatdy^mDar to 1207289A reverse 3.3 

408829 NM_008042 Hs.48384 heparan sulfate (glucosamine) 30tfot 13 

408681 AW953853 H3.292833 ESTs, Wealdyslmnar to 138022 hypotheO 13 

75 436278 BE396290 Hs.5097 synaptogyiin 2 13 

424560 AA1SS727 Hs.150555 protein predated by done 23733 13 

417215 BE253181 Hs.81687 non^tastaOccaOsl protein expresse 13 

418168 Ar754416 Odc42 effector protein 3 13 

434629 AA7890B1 Hs.4029 gUoma^plilied sequenoe^l 12 

80 427398 AW390020 Hs.2041S chromosome 21 Open reading frame 11 12 

457489 A169381S H3.127179 cryptic gene 12 

434747 AAB37085 ESTs 12 

413125 BE244S89 H&75207 glyoxatasel 12 
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435581 AA351978 Hs.4943 hepatDceButar cardnoma associated prot 3ui 

447210 AF035269 Hs.17752 phosphi^xfytsertn&^pedficphosphol^ 3^ 

422522 AI02342B H3.34549 ESTs. Highly s!ni!lar to S94541 1 done 4 3.2 

423336 AI382555 Hs.127950 bnrnodom^ivcont^ning 1 3.2 

4017B5 Nh^00227S*:Hamo sapiens keraOn IS (KRT1 3.2 

441345 AW068579 Hs.7780 Homo sapiens fnRNA;cONADKFZpS64A072((r 3.2 

451404 AA46a775 Ks.62g5 ESTs,W^ similar to T17248hypotheO 3.2 

422173 BE38582B Hs.250819 phorboGtv-ika protein MOS019 3.2 

452099 BE612992 Hs^931 hypotheMpratEtoaJIOSO? similar to 3.2 

452626 BE245266 H9.301636 peroxisomal biogenesis factor 6 3.2 

419829 A1924228 K8.115165 ESTs, Moderately filniOar to PC4259 fent ' a2 

407688 W25317 Hs.37616 Human D9splioe variant BmRNA,Gafnpbto 3.2 

406214 NM_016371:Homo6apien8hydroxystBrald(1 3.2 

416292 AA179233 Hs.42390 nasopharyngeal cardRomasuscepSbility 3.2 

438714 AA814859 Hs.160074 £ST8 3.2 

433213 AW665130 Hs.137190 ESTs 3.2 

412170 D16S32 Hs.73729 very low density Upoptoteinieceptor Z2 

409340 BE174629 Hs.321130 hypolheiica! protein MGC2771 3l2 

413142 MB1740 Hs.75212 ornithine decarfaoxyiase 1 a2 

451338 AW512322 Hs.19131 transcripiion factor Dp-2(E2Fd!mefizat 3^ 

437179 AA393508 serologicany defined colon cancer antig 3.2 

418700 AI963808 Hs.66970 ESTs. Moderately simiiar to AIU5.HUMAN A 12 

438361 AA805666 Hs.146217 Homo sapiens cDNA:aJ23077 lis. ctoneL 3.2 

450546 AA010200 Hs.175551 ESTs 32 

409995 AWg60597 Hs.129206 ESTs 12 

408739 W01556 Ks.44685 ESTs, Moderately similar to 138022 hypot 312 

438821 AA62642S Hs.192375 ESTs 3.2 

403532 NKt024638dHomo sapiens bypotheticdlprot 3.2 

452941 AL1 10347 Hs.31074 fkulfogluoosaminesultohydRilase (sulfa 11 

414341 D80004 Hs7S90g KIAA01 82 protein 11 

423044 AA320829 Hs.97266 protocadheiln 18 11 

429716 Ft25685 Hs.211933 coDagen, type XIII. alpha 1 11 

444367 H54892 Hs.10974 hypothetical protein RJ22390 11 

423551 AA327598 Hs.89633 ESTs 11 

453186 AK0dl708 Hs.32271 hypdheScal protein FU10846 11 

413550 W03011 Hs.306881 MSTP043 protein 11 

431933 AI187057 Hs.132554 ESTs 11 

439778 AL109729 Hs.99384 pulathre transmeml>r8ne proteb) 11 

448044 AI458882 gbnk13e01j(1NQ.CGAP.U24 Homo sapiens 11 

422017 NM.003877 Hs.110776 STAT Induced STAT bihll)Hor-2 11 

421254 AK001724 Hs.102950 coat protein gammfrcop 11 

424339 BE257148 endoglycan 11 

445109 Aro39916 Hs.12330 ectonudeoslda triphosphate diphosphohyd 11 

425320 U29344 Hs.83190 fatty add synthase 11 

426170 BE161065 K5.167531 methy1crotonoyM:o6nzym9Acaitoxylas92 11 

408770 AW270608 Hs.170195 bone morphogenetic protein 7 (osteogenic 11 

442202 8E272862 Hs.106534 hypothetical protdn FU22625 11 

417622 AW298163 Hs.82318 WAS protein family, member 3 11 

439864 AI720078 HS.2919S7 ESTs. Weakly slmSar to A47582B«eDgr 11 

433038 AA574091 Hs.105964 ESTs 11 

447050 NM.016314 Hs.17200 • STAKMike protein containing SH3 and ITA 11 

436299 AK000767 Hs.S111 hypothetical protein FU20729 11 

408380 AF123050 Hs.44532 diubiquitin 11 

407910 AA65Q274 Hs.41296 «bronec6n teudne rich transmembrane p 11 

445098 A1050272 Hs.12305 DKFZP566B183 protein 11 

431797 6E169641 Hs.270134 hypothetical protein FU20280 11 

4372S6 AA350994 Hs.20281 K1AA1700 11 

452627 AI122843 Hs.184319 ESTs. Wealdy similar to KIAA1006 protein 11 

448913 AA194422 HsJ22564 myosInVl 11 

415079 R43179 H3.2289S hypotheScal protein FU23548 10 

408157 AA047685 Hs.62946 ESTs 10 

443337 Y07604 Hs.9235 nonnnetastetlcoells 4. protein expresse 10 

410310 J02931 H3.62192 coagidalten factor III (thrombopbslin, 10 

437546 AW074836 Hs.173984 T-boxl 10 

451134 AA318315 Hs.2S999 hypothetical protein FU22195 3.0 

423932 T95633 Hs,189703 ESTs 10 

433495 AW373784 H8.71 a!pha-2-giyooprote!n l.zfaic 3.0 

453293 AA382257 Hs.10553 ESTs 10 

428171 AA489323 H3.182825 ribosomal protdn 135 10 

BE250944 H^1835S6 soluto canter family 1 (neutral amino a 10 
NM.015865 Hs.171731 sdutocanterlantlly 14 (urea transport 3.0 

406627 T64904 Hs.163780 ESTs 10 

418259 AA215404 ESTs 10 

426110 NM_002913 Hs.166563 repDcaflon fador C (activator 1) 1 (14 10 

411089 AA4584S4 c8Ddhristoncyda24iks1 (PITSIPEpr 10 

431416 AA532718 ESTs 10 

431631 AA548906 Hs.122244 ESTs 3.0 

409251 R10723 H8.20573 ESTs 10 

434974 AA778711 eukeryoOo translation inlSation factor 10 

447147 AA910353 HS.7S432 ESr8.Wealdy8lnflartoT23482hypolheti 10 

450937 R49131 Hs.26267 ATB4ependanllntertefon response protei 10 

43Z72B NM.008979 Hs.278721 HLAdass II region expressed g8neKE4 10 

433037 NM.014158 Hs779938 H$PC087 protein 10 
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411598 


BE336654 


HSJ0937 


H3 Mstoire (8n%, member A 


3.0 




43Q26B 


AK000737 


Hs^7480 


hypothelteal pro^ FU20730 


SlO 




443725 


AW24S680 


Ks^ 


giowth arrest and DNA-damago^ucible, 


10 


5 


431155 


AW971213 




gb:EST383301 MAGE resequences, MAGL Homo 


3.0 


407355 


AA846203 


K3.193974 


ESTs. We^ sMlar to ALUUKUMAN ALU S 


3.0 




438209 


A1120659 


Hs.6111 


eryMiydrocertiQfl receplof oudear t^j^wJ 


ao 




434392 


AW9837Q9 


Hs^S0824 


Homo sapiens cDNA: aJ2343S 6s, done H 


ao 




432810 


AA863400 


Hs^3054 


ESTs 


ao 


10 


430333 


S70114 


Ks.239489 


TIA1 cytotoxic granuIe-assodalBd RNAti 


ao 


445596 


R89543 


Hs.12942 


vesicle trafficldi^ protein 


ao 




406670 


W79632 


Hs^6301 


hyp^ficd protein MGC13170 


ao 




452779 


AA416775 


Ks.47234 


ESTs 


ao 




439680 


AW245741 


Hs.58461 


ESTs. Weaidy stmilar to A35659 kmeppel- 


ao 


15 


406288 


AI74S32S 


Hs.271923 


Homo sapiens cONA: FLJ22785 fis, done K 


ao 


424395 


AA165082 


H8.146388 


niterohibule>assodsied protein 7 


2.9 




423472 


AF041260 


Hs.129057 


b(B8St cflfdnonw smpSSed se^uenco 1 


Z9 




425920 


AL049977 


H8.162209 


deudn 6 


2.9 




409927 


T69981 




gtiryc19d03i1 Stifllagene lung (93721tQ H 


2.9 


20 


447881 


BE620866 




GCijI (general oontrol of ambtMCiitsynl 


Z9 


439584 


AA838114 


H8.221612 


ESTs 


2,9 




417601 


ra4_014735 


Hs.82292 


IQAAD215 gene product 


Z9 




452268 


hlM.0Q3512 


Hs.28777 


H2A histone faiT%, msnto^ L 


Z9 




422576 


BE548555 


Hs.1 18554 


C6l-83pn)tdn 


2.9 


25 


438510 


AU)80220 


Hs.6285 


DKFZPS86P0123 protein 


Z9 


423217 




Hs.1640 


ooDagen, type Vli, alplia 1 (epidennol^ 


Z9 




4Z7581 


NM.014788 


Hs.179703 


K1AA0129 gene proctot 


2.9 




447963 


AI452973 


KS.16S900 


ESTs. Vlteddy sNIar to AUil JKUMAN ALU S 


Z9 




443015 


R33261 


Hs.6614 


ESTs. Weakly ^lar to A43932 mudn 2 p 


2.9 


30 


429351 


AK001490 


H3.200016 


hypothetical protein FLJ10Q8 


Z9 


42S294 


AF033827 


Hs.155553 


HNK-I suifiDlransferasd 


Z9 




452955 


AW390282 


Hs^llSO 


transmemtnana 7 supeifanfly member 2 


2.9 




416439 


AA180383 


Hs.118769 


ESTs 


2.9 




442315 


AA173992 


Ks.7956 


ESTs, Moderately dmilartoZNd1_HUMAN Z 


Z9 


35 


446911 


N27605 


H3.16492 


OKFZPS64G2022 protein 


Z9 


428801 


AW277121 


Ks.254881 


ESTs 


2.9 




430462 


AIS84156 


H8.105640 


Homo sapiens, done IMAGE:4139775, mRNA, 


2.9 




420552 


AK00O492 


Hs.98808 


hypotheOcal piotdn 


Z9 




426180 


AI129767 


HS.1B2874 


guanine nucteoiide tending protein (6 pr 


2.9 


40 


418222 


A)675881 


Hs.86538 


ESTs 


2.9 


433009 


AA761668 




gb:nz24c08.s1 NCI^CGAP.GCBI Homo sapiens 


Z9 




432140 


AK000404 


Hs.272688 


hypotheltoal protebi FU20397 


2.9 




434517 


AA635690 


H3.337251 


hypotheticat protein IMGC2487 


2.9 




448718 


AA220235 


Hs.153959 


hypothetical protein MGC1&436 


Z9 


45 


408196 


AL034548 


Hs.43627 


SPY (sex detemv'ning region Y)4)0X 22 


2.9 


419849 


BE041436 


Hs.93379 


eukaryotic franstaOon inifiaSon foctor 


2.9 




452256 


AK000933 


H5.28661 


Homo sapiens cDNA FIJ10071 lis, done HE 


Z9 




430375 


AW371048 


H$.93758 


H4 histone fandly, memt)er H 


Z9 




409650 


T08490 


H3.288969 


HSCARG protein 


Z9 


50 


413586 


AAg71014 


H5.75432 


IMP (inosine monophosphate) dehydrogenase 


Z9 


412719 


AW016610 




ESTs 


2.9 




438552 


AJ245820 


Hs.6314 


type 1 tfsnsmemtuane receptor (sdnire-r 


2.9 




439621 


AI200281 


Hs.123910 


ESTs, Highly similar to B34087 hypotheii 


Z9 




437050 


AA766420 




ESTs 


2.9 


55 


414256 


AW41Q03S 


Hs.75862 


IMAD (mothers against decapentapleglc, Or 


Z9 


429343 


AK000785 


Hs.199480 


Homo sapiens, Sin^ar to ej^ 3, done 


Z9 




416941 


BE000150 


Hs.48778 


niban protein 


Z9 




421594 


R4566a 


Hs^B89 


Honu) saddens cONA FLJ12978 lis, done NT 


Z9 




436213 


AA32S512 


H8.71472 


hypolhefical protein FU1tI774; K1AA1709 


Z9 


60 


436823 


AW749865 




ESTs, Wesddy simRar to 1380!^ hypotheO 


Z8 


441266 


H15968 


Ksi93845 


Homo saltans, done II^GEi350Z329, mRMA, 


Z8 




426761 


AI015709 


Ks.172089 


Homo saptens mRNA; cONA Di<FZp586l2022'(f 


Z8 




413746 


AA133243 


HS.171SS3 


ESTs. Modwatety simOa'to CLO_HUMAN C 


Z8 




457733 


AVVS74812 




ESTs 


2.8 


65 


400860 






Target Exon 


Z6 


. 436165 


At373544 


Ks.331328 


tntemiediate fiament protein syncoifin 


ZB 




433312 


AI241331 


H5.131765 


ESTs. Moderately sirnQar to 138937 ONA/R 


ZB 




431604 


AF175265 


Hs.264190 


vsairAitr nmlflin sralinf] 3S Iwxd iuvnal 


Z8 




421717 


AF230924 


Hs!l071B7 


divalent caHon tderant protein CITT A 


Z8 


70 


453912 


AL121031 




SWI/SNF related, matrix assodated, acQ 


ZB 


435703 


AW630133 


Hs^13 


GiC0O3 protein 


ZB 




431108 


AA991508 


Hs.105317 


ESTs 


Z8 




451040 


AA324743 


Ks.40808 


ESTs 


ZB 




443837 


At984625 


H3.9884 


Epindia pde tndy protein 


ZB 


75 


418198 


AI745649 


H3,26549 


KlAA170a protein 


ZB 


450447 


AF212223 


Hs^SOlO 


hypothelica) protein P15-2 


ZB 




409330 


AK001231 


Hs^940 


hypolheCcal protein FLJ10369 


ZB 




413762 


AW411479 


Hs.848 


FK50&t>ind!ng protein 4 (59kO) 


ZB 




433586 


T85301 


H8.194397 


gb:yd76d06^1 Soares tetal Over spleen 


ZB 


80 


424720 


M89907 


Ks.152292 


SWI/SNF related, matrix assodated. acQ 


ZB 


422027 


AL043100 


Hs.306319 


fstiy add amide hydrdase 


ZB 




422717 


A1557623 


Hs.1 19475 


odd indudbia RNA^inding protein 


ZB 




412843 


AR»7SSS 


H&74624 


proteto tyrosine phosphatase, receptor t 


ZB 




434237 


AF119908 


Ks^lS 


hypothetical protein RRQ29S5 


ZB 



769 
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407255 AA012992 K3.S6301 hypothetical praldnMGCl3170 U 

431472 AKD01023 Hs.256549 nucMdsUndng protein 2(6x00 Mb) 2A 

42331B AW487064 Hs.5740 ESTs W 

454144 BE280478 Hs.182695 hypo(h6ticalprotelaMGC3243 2^8 

5 452827 AI571835 Hs.55468 ESTs 18 

452040 AW973242 HS.2936S0 EST&. Weekly Mar to 138022 hypolheli Z8 

418830 BE513731 Ks.88959 hypalheOcal protein MGC481 6 2.8 

417517 AR)01176 Hs.82238 P0P4 (processing of precursor , S.cerev 2.8 

419286 AA236005 H3.128764 ESTs ZB 

10 430287 AW182459 HS.12S759 £STs.WeaMy8bnllatoL£U5.HUMU<LEUKE 18 

420911 U77413 Hs.100293 OMed N«etyi^aniiii8 (GlcNAc) 2.8 

428054 AI948888 Hs.2G6619 ESTs 18 

409504 AA3049B1 Hs.699 peptidytprotyl isomerese B (cycbphifin 18 

440080 AW051597 ESTs 18 

IS 431609 AW792792 Ks.264330 ria:/tsphIrigo6ineamUohydrolase{acidc 18 

447896 AI436124 Hs.294069 Homo sapiens CONAFU13384 lis. done PL 18 

4315K AA5056S6 Hs.6185 KIAA1557 protein 18 

442995 AA53K11 K3.288455 Homo sapiens cONA:FU2327Dfls, done C 18 

433101 AW572317 Hs.12082 Homo sapiens inRNA;d}NAOKFZpS66l203{fr 18 

20 449052 AW0295O7 Hs.161102 ESTs U 

418663 AiawilOO Hs.41690 desmoooRin 3 18 

442013 AA506476 Hs.10600 Human ONA sequence fiom done RP11-353C1 18 

449051 AW961400 Hs333526 HER2 receptor tyrosine Unase(c«TW 18 

453038 AVW92415 Hs.20509 HBV pX assodated prol^ 18 

25 419175 AW270037 KIAA0779 protein 18 

426788 U66615 Hs.172280 SWl/SNF related, matrix 8S8odated,aca 18 

421091 W22821 ritnsomal protein 126 18 

413435 X51405 Hs.75360 carfaoxypeptidase E 18 

432651 AW973744 Hs.293100 ESTs 18 

30 452700 Ai859390 Hs.288940 fiv&^ transmerrtbrane protein MBS 18 

454148 AW732B37 Hs.42390 r^sopliaryngeal cardnoma suscepSfalBly 18 

419452 U33635 Hs.90572 PTtapmtein tyrosine idnase 7 18 

425491 AA8d3316 Hs.255221 ESTs 18 

414556 AW975063 Hs.343443 ribosomai protein L36 IB 

35 435126 AI393666 Hs.42315 p10-bimSng protein 18 

435602 AF217515 H&283532 uncharacterized bone marrow protein 6M03 17 

441128 AA570256 ESTs. We^ similar to T23273hypollietl 17 

440340 AW895503 H8.12SZ76 ESTs 17 

410339 Ai916499 Hs.29825B ESTs 17 

40 430542 AI557486 Hs.119122 libosomal protein LI 3a 17 

406789 AI041403 ribosontil protein 129 17 

424130 AL050138 Hs.140945 Homo sapiens ifRNA; cDl^ DKF2p586L141 (fr 17 

449518 BE395253 Hs.30861 ESTs 17 

410099 AA081630 KIAA0036 gene product 2.7 

45 422000 M30599 Hs.110637 trameoboxAlO 17 

407110 AA018042 H3^2085 Prader-Wilii/Ange)mansyndnm« 17 

415443 T07353 Hs.7g48 ESTs 17 

432745 AI82ig26 gtnit78fD5J(SNCI.CGAPJ>|[3 Homo sapiens 17 

409299 AA04S650 Hs53125 smaOnudearribonudeopreteinD2polyp 17 

SO 424369 R87622 Ks.28714 KIAA1 831 protein 17 

417831 H16423 H3.82685 C047 antigen (Rtwelated ant^en. integr 17 

424723 BE409813 Hs.162337 protein arginine N^neltiyttransfeiase 3(ti 17 

451428 AW083384 Hs.11067 ESTs,HigMy8lnflartoT46395liypotl)eO 17 

400283 Eos Control 17 

55 435750 AB029012 Hs.4990 KIAA10B9 protein 17 

443804 AL135352 Hs.2S5883 ESTs. Weaidy simitar to 138022 tqrpoOiaS 17 

421709 AA159394 Hs.107056 CEM protein 17 

421077 AK000061 Hs.101590 Itypottietlcal protein 27 

416359 AU)42210 Hs.1649d hypothetical protein DKFZp762N231 6; KIAA 17 

60 442104 L20971 Hs.188 phosphodiesterase 4B,cAMP'«pedfic (dun 17 

411605 AW006831 ESTs 17 

437133 AB018319 Ks.5460 K1AA0776 protein 17 

407236 W79485 Hs.173g80 nudear matrix protein NMP200 related b 17 

432125 AW972667 Hs.183006 Homo 8apfensd)NAFU1 2300 fts. done MA 17 

65 427657 AV652249 Hs.ie0107 polymerase (DNA directed), beta 17 

445636 AW105401 ribosomal protein L29 17 

443801 AW206942 Hs.253594 intron of:tr{dK)rt)inophalangeat syndro 17 

420223 N27807 ribosomal protein L4 17 

427515 T79526 Hs.179516 Integral typo I protdn 17 

70 447673 AI823987 Hs.182285 ESTs 17 

411960 R77776 Hs.18103 ESTs 17 

433212 BE216049 Ks.121820 ESTs 17 

448244 BE613416 Ks.336425 Homo 8aptens.doneMGC:17296,mRNA, com 17 

441551 AA318224 Hs^6141 ESTs 17 

75 421883 XS5079 Hs.1437 ghioostdasQi alpha; add (Pompe disease, 17 

411562 AL050201 Hs.70789 hypothetic^ protein DKFZp586E1923 17 

434851 AAB06164 Hs,116502 ESTs 17 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142mRNA8equ 17 

421109 U2B32 HsJ01842 AT-tdnding tjanscripOon fador t 17 

80 432450 AI990739 HS.1581B4 ORF 17 

426174 AA5479S9 HS.115B38 ESTs 17 

445919 T53519 Hs.334692 hypotheflcal prdein M6C14141 17 

423115 AA421973 Hs.169119 EST6.WBddydmilartoT25731 hypolhsQ 17 
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431222 X56777 Hs^73790 zona peDucUa glycoprotein 3A(5pennfe 2.7 

451558 NM_001089 Hs.26630 ATP-tiindins cassette, sub-fiamily A (ABC1 Z7 

431615 AW29S859 Hs.235660 ESTs 2.7 

^ 418649 AI098485 Hs.169341 ESTs. Moderately similar b S65657 alpha Z7 

5 431122 A1267593 H5.2S0535 Hono sapiens mRNA;cONAOKFZp434N241 2 {f tl 

438882 AA354489 Hs.222103 EBP50-PD2lnteractDrofS4kD Z7 

424954 NM_000546 Hs.184& tumor protein p53(l>Fraumenl syndrome) 2.7 

458082 AW978B11 Hs.314451 ESTs. Weakly shnnar to ALU INHUMAN AIA)S Z7 

427087 BE073913 Hs.173515 uncharadedzed tiypolhalamus protein HTO 2.7 

10 412524 AA417813 Ks.44208 hypothetical protein FU23153 Z7 

435176 AA744875 Hs.189413 ESTs 2.7 

419390 M701162 Ks.g02Q7 hypotheScalpral^MGCIIIdS 2.7 

428013 AF151020 Ks.181444 hypothelica] protefai 2.7 

441094 1/33819 H3.7647 MYC-assodatedzbic finger protdn(puri 2.7 

IS 451356 AA748418 Hs.164577 ESTs 2.7 

408461 AB0377S6 Hs.45207 hypothetical protelhKlAA1335 Z7 

418516 NM.006218 Hs.65701 phosphobrasHUe^ase. catalytic al 2.7 

423476 A1J035633 Human DMA sequence from done RP&-1046G1 Zl 

444034 AL161957 Hs.10177 pteckstrin homology domain Interacting p 2.7 

20 404913 NM.024408*MnTO sapiens Notch (DrosophO Z7 

407378 AA299264 HsJ7776 ESTs, Moderately dmDar to 138022 hypot Z7 

441054 AA913591 ESTs 2.6 

428500 AI81S39S Hs.184641 fatly acid desaturase 2 Z6 

430512 AF182294 Hs.241578 U6 snRMA-assodated Sm^ protein LSm8 2.6 

25 433301 AW2962&0 Hs.152016 Homo sapiens cONA:FU22140 lis. done H Z6 

444381 BE387335 Hs.283713 ESTs, Weakly slmHar to S64054hypo1hefi Z% 

443666 At458179 Hs.41528 ESTs, Weakly similar to neuronal thread 2.6 

427376 BE51S037 Hs.177556 mdanoma enagen, family D. 1 2.6 

439414 NM-.001183 Hs.6551 ATPase. H transportng, lysosomal {vacuo 2.6 

30 421866 M24470 Hs.i435 guanoshie monophosphate reductase 2.6 

421605 8E440108 Hs.106127 RNA polymerase 1 16 kDasubunil 2.6 

426797 AW93625B Hs.342849 ADP-ritxsylaGon Cactor-«ke 5 2.6 

410935 BE067395 Ks.66881 dyneln. cytoplasmte, intennedlate polype 2.6 

4078B7 AA579568 H5.41072 serine (or cysteine) proteinase bthitjJto 2.6 

35 430341 NM_00634B Hs.239831 golgi transport complex 1 (90 W)a8ubuni 2.6 

424956 AW19B103 Hs.18724 ESTs, Weakly slnular to granule cell mar 2.6 

452748 AB011128 H5.30512 Homo saj^ensmRNA for KlAA0556p(oteln, 2.6 

408212 AA297567 Hs.43728 hypothetical protein 2.6 

407893 BE408359 Hs.43621 Homo saf^ens, Similar to hypotbeOcalpr Z6 

40 425976 C75094 Ks.334514 NG22 protein 2.6 

448424 AW009892 Hs.31924 ESTs 2.6 

420164 AW339037 Hs.24908 ESTs 2.6 

443444 AW952619 H8.17235 Homo sapiens dona TCCaA00176mRNA8equ 2.6 

412774 AA120865 Hs.23136 ESTs 2.6 

45 445922 A1337316 Hs.147998 ESTs 2.6 

429925 NM_000786 cytochrome P450. 51 (lanosterol 14-alpha 2.6 

419222 AD001S28 Hs.89718 spenntoe synBiase 2.6 

417327 NM.004922 H&81964 SEC24(SLcefevlsiae)fetatBdgenefarrll 2.6 

420337 AW295840 Hs.14555 Homo sapiens cDNA:FU21513fis, done C 2.6 

50 408743 AL110246 H8.47367 KIAA1785 prOteh ZB 

434011 AW953437 HsJ5466 doneFlfi5214 2.6 

434976 Aig63821 ESTs 2.6 

417454 NM-000202 HS.1724S6 Uannato2«ultotase (Hunter syndrome) 2.6 

448789 6E539108 Hs.22051 tiypofhelto8tprotdnMGC15548 Z6 

55 441049 WB8920 Hs29341 hypothettod protein FU22376 2.6 

437271 AL137445 Ks.28846 Homo sapiens mRNA; cONA DKFZp5660134 (f^ 2.6 

425017 AL119305 Hsi6409 ESTs 2.6 

440333 AI378424 H3.2B8761 hypothetic^ protein FU21749 16 

434551 BE387162 HsJ280B58 ESTs. Highly simnar to A35661 ONA ends 2.6 

60 413132 N(iL006823 H3.75209 protein knuse(cAMP-dependentcaialyll Z6 

409619 AK001015 Hs.55220 Ba2-assodatedathanogena2 ZS 

444550 BE2S0716 Hs.87614 ESTs 2.6 

446258 AI283476 H8.263478 ESTs 2.6 

440191 AI990417 tubulin, beta 5 2.6 

65 453187 AI161383 Hs.34549 ESTs, Highly simiter to 8945411 done 4 Z6 

412133 U83460 Hs.73614 sdute carrier family 31 (copper transpo 2.6 

426841 A1052358 Hs.193726 ESTs 2.6 

435937 AA83C893 Hs.119769 ESTs 2.6 

425960 AW41C646 H3.164849 hypothetlcd protein DKFZp434H247 16 

70 456161 BE264645 Hs.282093 hypotheQcal protein aJ21916 16 

456236 AF045229 K5.82280 regulator of G-proteinslgnalHng 10 16 

43502$ T08990 Hs.4742 anchor attadtmant proteto 1 (Gaalp^ yeas 16 

416751 T48130 Hs.5897 Homo sapiens mRNA;c(mADKFZt)5B6P1622(f 16 

458948 AA009716 Hs.42311 ESTs 16 

75 424762 AL119442 Hs.163684 eukaiyo&etranslaOontolQaftonfedDr 16 

449456 AI80S078 Hs.208261 ESTs 16 

432409 AA806538 H3.130732 KIAA1575protdn 16 

425184 BH278288 K5.155046 loilheranblood group (Aubenierb antigen 16 

442338 AW340958 Hs.7572 ESTs 16 

80 445622 AF106697 Hs.12971 thtoredoxlnradudasebeta 16 

440409 AW294316 ESTs 16 

444610 AI174763 gb:HA2S01 Human fetol Over cONAQnaiy 16 

425863 U43604 Hs.159g01 Human unUenSfiedmRNA.p8rtidsequenc 16 
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436972 AA284679 Hs^5640 c!aiidin3 2.6 

426809 BE313114 H3.29708 ESTs 2.6 

431183 KM.005655 Hs.250896 KOa (Ly&V^p^Leu) endoplasniic reSc 16 

457638 AI792670 H5.144405 ESTs 2.6 

442821 BE391929 Hs.8752 transmenibraro piol^ 4 2.5 

435522 N64214 Hs.9774 synovid sanxima translocaSon gene on c 2.5 

406764 AA429825 Hs.343443 ribosoma) piotein L38 2.5 

422032 AA47G966 Hs.110857 polymerase (RNA)D1(DNA directed) poly 25 

409125 R17268 H5.343567 axonal transport ol synaptic vosides 23 

416801 X98834 Ks.79971 sal (OrosophnaHikB 2 2£ 

446091 AW022192 ESTs 2.5 

419848 AA251242 Hs.103238 ESTs 15 

415015 NM.002109 Hs.77798 NslidyMRNA synthetase 2.5 

431836 AF178532 Hs.271411 bela^te APP-deaving enzyme 2 2.5 

414987 AAS24394 H5.294022 hypoOwQcal prot^ FU14950 Z5 

447087 AW403870 HsJ01872 hypoflietlc8lprate{nMGC4840 2.5 

4S6050 R79445 Ks.76230 ribosomd protein S10 2.5 

426178 AA371409 H5.105664 hypotheOcal protein FLJ22294 Z5 

414528 AA1489S0 Hs.186836 ESTs 2.5 

426494 AL1 19528 H3.170098 KIAA0372 gene pnxtud 2.5 

407656 AW747988 Hs.37443 Komo sapiens mI^cONAt}KFZp434B21 19 (f Z5 

437330 AU53944 Hs.50115 Homo sapiens mRNA;cDNADKFZp761J11 12 (f 2.5 

449139 BE288315 K3^3111 phenyialanina^i^ synthetase^ 2.5 

448153 Y10805 K3,20521 HMT1 (hnRNPmelh^lranslOTe,acerewl 15 

417388 AL037228 1^.82043 D1 23 gene product 2.5 

409420 Z15008 H5.54451 laminIn.gamroa2(nIceln(100kO).k8AnI 2.5 

408796 AA688292 Hs.170345 ESTs 25 

407627 AI419020 Hs.62820 chrofnosome 6 open reading tionel 2.5 

421535 AB002359 Hs.105478 phosphodlnsyiromiyigiycinairidinesynlha 15 

415862 R51034 Hs.144513 ESTs 15 

426418 1^490464 ite.169825 ooHagen. (ype IV. alpha 5 (Alport syndr 15 

436557 W15573 Hs5027 ESTs. Weakly similar to A475B2B^«Ugr 15 

413517 N76712 Hs.44829 ESTs, Weakly similar to 138022 hypotheti 15 

449S0O AW956345 Hs.12926 ESTs 15 

457090 AL080243 Hs.180920 rfbosomal protein S9 15 

445389 NM_016831 Hs.12592 period (Drosophtia) homolog 3 15 

410006 AW73230B Hs.57783 eukaiyotk: translation IniSatton factor IS 

452908 BE207039 H8.306123 MAGEF1 protein 15 

450341 N90956 Hs.17230 hypotheiteal protein FU22087 15 

419430 A!878942 Hs.90336 ATPase, H transporting, lysosomal (vacuo 15 

440099 Ali)80058 Hs.e909 DKFZP564G202 protein 15 

452518 AA280722 Ks.24758 ESTs. Weakly similar to 138022 hypothec IS 

431843 AA516420 ESTs. Weakly similar to 138022 Ivpotheti 15 

424795 AW1028SO Hs.153177 ritnsomal protein S28 15 

448019 AW947164 HS.19S641 ESTs. Moderately sinillar to 138022 hypot 15 

450704 H85157 Hs.40696 ESTs IS 

400262 Eos Control 15 

435266 AK001942 Hs.4863 hypothetk:8l protein DKFZp566A1 524 15 

418039 R06859 Hs.l93172 ESTs. Weakly slmflar to 138022 hypothafi 15 

429491 NKL012111 Hs,204041 chromosome 14 open reading frame 3 15 

456844 A1264155 Hs.152981 CDP-toylglycerol synthase (phosphafida 15 

413781 J05272 Hs,850 IMP (iix)^ monophosphate) dehydrogenas 15 

437450 AL3901S4 Hs.26954 Homo sapiens mf^ cONA 0KFZp7626123 (fr 15 

424005 AB033041 Hs.137507 vang (van gogh. OrosophiIa)Jike 2 15 

429332 AF030403 Hs.199263 Ste-20 related kinase 15 

438572 BE267017 Hs.6315 acetytserotoninCMnethyltFansferase^ 15 

425692 D90O41 Hs.155956 N-acety(transf erase 1 (arytairine N-aoety 15 

431168 W05656 Hs,159755 ESTs 15 

408803 AF248953 Hs.24049 golgi autoanfigen. golgin 8utrfan% a. 2 15 

414482 S57498 Hs.76252 endotheltn receptor type A 15 

437762 T78028 Hs.154679 synaptotagmln I 15 

425722 AI659a76 Hs.97031 hypotheScal protein MGC1 3047 15 

433881 AI004377 Hs.200360 Homo sapiens cDNAnJ13027 lis, ckme NT 15 

451585 AK001171 Hs.326422 hypothetk»i protein MGC4549 15 

424959 NM.005761 Hs.153937 aclivaledp2todc42HsUnase 15 

407151 H25836 Hs.301527 ESTs. ModeratdydmHar to unknown [H.S 15 

446054 AB014537 Hs.13604 KIAA0637 gene product 15 

452556 H78517 Hs.33905 ESTs 15 

421026 AU>47332 Hs.101067 GCN5 (general control of aminos synt 15 

407687 AK002011 Hs.37S58 hypothetical protein FU1 1149 15 

433683 AI817723 Hs.22678 hypotheScal protein FU21 832 15 



TABLE 688 

Pkey: LMqueB)sprobesetMen81l8rnumber 
CAT number (3ene cluster mmiber 
Accession: Genbank accession numbers 



Pkey CAT Number Aocesston 
415989 
429220 



10194 1 BC013389BCO17398AI023S4dAA191424Al267700AMe9833AW958465AWg53397AA172056BE94O298BF^ 

8G2B5637A1720344BF541715AA35S088AA172236 
15103 7 AW341473 AA448195 AW207206 Ai951341 AAS692S9 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



449625 249224.1 BI91 8168 AW779760 N48874 AI37S997 AA235370 BG699146 AI913631 A1498402 AI016320 AA323193 R4g021 D59344 BG986750 N45526 
BG986917 T61382 R493S1 R45432 A1203107 R3S004 F07491 R2S094 R35360 

412446 63457.1 BO021735A(669212AL120184Af769949BE7O1002eE184363BE61d031 BG702238BR)90049BF963318er961912BP943013AA934514 
AA151245 BF9S0659 AA9S79Q7 Z41449 BF908059 6^908053 BF908049 BE699424 6F908060 6F962832 6F9S2020 BF963134 BI035538 
8F5080S2 6F908057 BF090026 6F943158 AI832924 BF512340 BF9S2021 BF960776 BF943437 BF942847 Ar768015 F09776 F04816 F02721 
M102645AI633838AAB17929BF947001B10354486E935876AVV890837AVV8986046F9574056F96M 

433404 7392.1 80022980 BF224081 BG149908 AW672842 6ES70687 A1702161 AW341832 BE222S03 N71836 AI026081 AW9S3116 AW083132 AI979261 

A\^5377 AI423298 A1540707 AW675518 AI032611 A1818044 A1299508 AI911385 AI270418 BE219257 BM141K^ 
N59447 Z39438 T32982 R54110 BF115783 F09044 BF808433 
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Z11692 X51466NM.001961M19997BI224253BG830478AU122147AU123437BG113591 BG752624BEra6804BJ868669BG33721fiAWR?QQis 
Sfo^2^?^^^^^^^^^^^"^^^^735BF03884lBF727115BC00^^^ 

^'^"^ A'313235 BG831724 BF869862 BG998348 BI011834 BF88B337 BF8^!bR^80aK5 F01241 
BF805719 BG876487 AW498536 BF968866 BG998849 AA248724 BG8a202 BG756456 BG032392 BI859287 BmS^^^^^s 
IfSSS.^^i^.^SSi'X ^337621 F33111 BF344301 BGlcS^^ B^Sfw^f^' 

6M007368 BFS69385 BE772007 BI199487 BF761700 BI261519 BF944452 BF698506 AI038390 BM044934 AW381 142 BG743618 6^69208 
BE893973 BI015047 BF886479 BF761350 BE769769 BG7661 17 BF847355 BE397834 AV^mi E^1» BEO^ 
BR»9W0 BI000274 BG255503 BG674499 BG774t74 B(015084 ^^-^mui Dinx»i£> ebuszsw BP1B3193 BG180964 

2S^2iJI^n!*15?!lf??^° ^"^^^^ BM460488 W56819 B1042183 B6977498 BE767451 BF870009 BG477472 R61137 R14274 R20259 
SS?^SJ?^'^^^^^^'^^™^^^^^^^^^23683AW006S31 B^^^ 



8^9388 AI345516 BG391657 BE708967 BG026034 BE261703 H56716H65572HS3801 T48830 R96953 R969e9R397076E667^AA^10 
N27807AA256634BE276324 

AI221207 A1659856 AA913591 AI220302 

NM_000786 U23942 BI601050 BG771947 BG773455 BI561558 BI460206 BG714348 BM126447 AU12941 1 AU129401 AL1 19303 AV7243M 
AU099323 AU127089 BG701614 BM150364 D55653 AV702235 BE090408 AUlS^S AW149651 B^754^^^^^^^ 
S.^^22^^'^^^^^^'^^^^^^^^"^28040AI693998AW615411AW0^ 



U47727 BG773392 N58531 AA226460 BG679564 AA292178 W56883 BE813131 C03646 B6287974 AA386261 H47580 R4885B TOemiZ^ 
B1869421 BG502073BG425943VW7290W31363BE004451BF208311BI048717 N78122AA^ 
AI9B3821AA657925AA935438AW975068 ~^^^#Mla«KW 

8000222 AL136871 NM.032261 BC009497 B^M61705 BG470749 BG826905 B6285127 BI253235 BE397026 BG704967 AW961225 BM3S2B17 
SS^A^^'^^^^^'^31549AA326504BC019924BG257230BI^ 
BE260391AA599M2BF6B8779AI75S222AA4S2272AVW41170AU156565 F28259T16319AA362506^^ 
Ay729565BE268285BE867433BM011110BG285856AI922439BE270975AL119^ 

^1S55?^HS21^'^^^^^ AA243218AW498837 AA748311 AI754395T1572B AA775369AA858195 W73625AA216mMre«^^ 
A1061112 AI783806AL569622 AA070466BF229936 N58159 H8028B N32598 HB0293 H80279 AI581253 AV^BsJSsBI^^^^ 

S?nH?^S^??l?®^*^^ BE293868BF792321 81^92258 BG107176T56604BM193566 H0^8M61804S 

AAW7083B!224818AW204722A1309186AI215122AI200785K^ 



D20132T55734 T60692 R76858AA022948AW770291 R50934AA205918 R97811 H40328l^52AA936029AI813809«^m7XimMs 

^^c^P^S?F ^^^"^ ^'223147 AW381001 AA448464AW^^ 

All 7 *»7o3 R1 Z27 1 KB3569 
AI274757AI559500AW022192 

S^^JSS?^^'®®^*^^™*^**^^^ °S0181 D81004 D60382C1S876N91070C14815C15068 D80763C14818 
LlaiDl [jou1o4 uoUo% 

bII^ SI"*® ^^^^^^ "^^^ ^'^^253 BG830478 AU122147 AU123437 BG113591 BG7^^ 
8NW16525AI5604O9Alie2B66AI909178BFB49556AA37l735BF038841BF7W^ 

Bra486 BI517580BG876486BW11828 AI31323SBG831724BF869852BG998348 BI011834 BF888337 BF89^FM23TOAVV8Sr)1241 

^^^24 8G829202 BG756456 BG032392 BI859287 BMO^B^sSs^^ 
BE166758BM4S2445A?937808AVVD26l28N236e4 A\ftW60^ AI337621 F33111 BF344301 BG105450BG38^BFW95^^M??^^ 
BM007368 BF56938S BEn2007 81199487 BF761700BI2B1519BF944452 BFB98506 AI038390BM^934AVV^ 



Ptey. Unique number corresponding to an Cos piobesel 
Ref: Sequence souree. Be 7 digit numbers in this colunm are GentenkMenSSer^^^^ 
sequence of human chmnosome 22.* Dunham I. etd. Nature (1999) 402:489^95. 
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strand: Indtcatas ONA strand from which wsns wcro prccficted. 
NLpodOon: tndicfltfistnideoOde portions of predctedexons. 





Ref 


Strand 


N^posiOon 






Minus 






81 76834 


PtlK 

nus 




401451 


6634068 


Minus 


119926-121272 


401747 


9789672 


Minus 


118596.116816.119119-119244.119609-11976 


404632 


9796668 


Rus 


4509&45229 


403046 


3540153 


Minus 


5S707-5585giS638»56511 


404922 


7341893 


Rus 


13248-13428 


401519 


6649315 


Rus 


157315-157950 


401197 


9719705 


Rus 


176341-176452 


401B66 


6018106 


Rus 


73126-73623 


405387 


6587915 


Minus 


3769^833^5708^895 


403752 


7678857 


Rus 


33704-33828 


404210 


5006246 


Rus 


169926-170121 


401785 


7249190 


Minus 


165776-16599^166169.166314,166408-16658 


406214 


7342036 


Rus 


8632086523 


403532 


6076842 


Minus 


8175081901 


400860 


9757499 


Minus 


151830.152104,152649-152744 


404913 


7341740 


Rus 


97717-97976 



Table 69A (isls about 200 genes thai are upregutated in human umbOica) vein endothelial ceDs (KUVEQ compared to nonnal body tissues. These genes v/ere seleded from 59680 
piDbesets on the Eos/Affymetrix Hu03 GenecKp array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a nomidized 
value reflecting the relative level of mRMA expression. 

Table 70A lists at}Out 146 genes that are downregutated using the mean of vessels (veins and arteries) compared to the mean of HUVEC. These genes were seiedad fro^ 59680 
prebesets on the Cos/AfIymetrlxHu03GenecNp array. Gene expression data Ibr each probeset obtained from thb analysts w» 
value reflecting relative level of mRhlA expression. 

TABLE 69A: 

PIcey: Unique Eos probeset idenSfier number 

ExAocn: Exemplar Accession number, Genbenit accession number 

UnigenelCh Uragene number 

UiHiene Title: Unlgene gene title 

R1: RaOo of the mean of HUVEC Al's to the mean of the nonnal body tissue APb 



Rcey ExAccn UnlgenelD Unigene Title R1 

424806 AA382523 Hs.1D5689 MSTRSI protein 1.51 

418038 Z37976 H8.83337 tatenttransfomnlng growth factor beta b 1.94 

444381 BE387335 Ks.283713 ESTs^Weddy similar to S64054hypothe0 1.60 

442802 AL133035 H3.8728 hypothetical prolan DKFZp434G171 1,73 

417944 AU077196 Hs.82985 collagen, type V, alpha 2 1.48 

410480 R97457 Hs.63984 cadherin 13. Hcadherin (heart) 1.91 

452516 AA056630 Hs.29789 RNA POLYMERASE I AND TRANSCRIPT REl£ASE 1.S9 

425139 AW630488 HS.2S338 protease, serine, 23 ^ 1.63 

402463 NM.014624:Homo sapiens 8100 caldunvUnd 1.50 

423798 AR)47033 Ks.132904 solute carrier family 4, sodium bicaibon 1.31 

412568 AI878826 Hs.74034 caveoSnl.csveOlae protein, 221(0 1.53 

419948 AB041035 Hs.93847 Nft4.016931:HomD sapiens NADPH oxidase 4 ( 2.06 

447526 AL048753 H8.303649 BmaS Inducible cybldneA2 (monocyte ch ^10 

446500 U78093 Hs.15154 susM-repeatoontainingproteiniXchrom 1.67 

441457 AW998651 Hs.43836 ESTs 1.30 

408296 AL1 17452 Hs.44155 DK{7PS86G1517 protein 1.61 

418994 AA296520 Hs.6g546 selecfin E (endothelial adhesion molecul 1.87 

413795 ALO40178 Hs.142)03 ESTs 1.64 

441689 A1123705 Hs.289068 ESTs 1.44 

412567 AI750979 Hs.74034 Homo sapiens dona 24661 mRNA sequence 1.75 

424432 AB037821 Hs.146858 protocadherin 10 1.65 

432673 AB028859 HS.Z78605 Onai (Hsp40) homotog. subfamily B. membe 1.31 

418883 U90908 H8.87241 hypotiietical protein from clones 23549 a 1.42 

453085 AW954243 KtAA0251 protein 1.47 

438887 R68857 Hs.265499 ESTs 1.49 

436729 BE621807 transmembrane 4 superfantHy member 1 1.91 

400494 ENSP00000238970'K:iG30(FrBgmenQ. 1.34 

442506 BES66411 ESTs 1.54 

425023 AWSS6689 Hs.154210 endofheOai diftierenfiation. sphfaigoSpi 1-55 

414476 AA301867 Hs.76224 EGR^ontainingtibiAMikeextraceliuta Z65 • 

452933 AW391423 Hs.288555 Homo sapiens cDNA:FU2242Sfis. done H 1.62 

422389 AF240835 Hs.115897 protocadherfn 12 1.38 

417124 BE122762 H3.25338 ESTs ^13 

433681 AI004377 Hs.200360 Homo sapiens cONARJ13027 Us. done (It 1.31 

419983 W55956 Hs.94030 Homo sapiens ndmA:cDNADKFZp586E1 624? 1.54 

414776 AA1S5598 Hs.212839 hypotheScai protein FU1419S; KIAAm4 1.51 

450534 AI570189 HS.2S132 KIAA(M70 gene product 1.43 

449618 AI076459 Hs.15978 KiAA1272 protein 1.42 

446098 AWD72215 Hs.a08470 ESTs 1.53 

413281 AA861271 Hs.222024 transotption factor 8MAL2 1.40 

448674 W31178 Hs.154140 ovar^spediie acidic pTOtaln 1.47 

407081 X97748 gb:R8aiptensPTX3 gene pforootor region. 1.33 
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434846 AW29S389 Hs.119766 ESTs 1.29 

408570 AL046406 Hs.103483 K1AA1798 protein 1.47 

436772 AW975688 metaOothionein 1E (funcOonat) 1.B1 

453789 AA628517 Hs.1ie502 ESTs 1.41 

S 424259 AK001776 Hs.143954 hypoQieGcsi protein RJ10914 1.35 

441020 W79283 Hs.35962 ESTs 1.62 

401234 mitogen-acOvated protein Unase 8 inter 1.48 

420174 AI824144 H3.199749 ESTs 1.75 

445684 AK001696 Hs.13109 Ran binding protein 11 1.43 

10 418893 A)750878 H$.87409 Onombospondin 1 1.89 

426535 AU077012 Hs^85B2 ESTs^W^MIarlDidiiquiteusTPRin 1.77 

412330 NM.005100 H&788 A ldnase(PRKA) anchor protein (gravln) Z02 

412646 NM.006825 Hs.74368 transmembrane protein (63kD), endopiasmi 1.64 

434523 AA703709 Hs.23410 transiocase of inner mitxhondrialmembr 1.26 

15 433376 AI249381 Hs.74122 caspase 4. apoptosis-ieteted cysteine pr 1.49 

422099 AA156022 Hs.111518 tvpoOietical protein 1.56 

449394 AA004368 Hs.18160 Homo saptenscDNAFU11550fis, done HE 1.54 

445119 AF035121 Hs.12337 kinase Insert domain receptor (a type II 1.40 

438142 T90309 Hs.269651 ESTs 1.45 

20 437478 AL390172 Hs.317432 brandted chain ambtotransferasa 1, cytos 1.44 

422809 AKD01379 H8.121028 hypoQmlical protein FU10S49 1.42 

453459 BE047032 HS.2S7789 ESTs 1^1 

442711 AF151073 Hs.6645 hypothetical protein 1.42 

428428 mZim Hs.184298 cydin4ependentldnase7(homoiogofXe 1.32 

25 400288 X0&256 Hs.149609 Integrin. alpha 5 (fHininecijn roceptor, Z67 

417640 030857 Hs.82353 protein C receptor, endothenBl(EPCR) 2.47 

407252 AA659037 Hs.163780 {A8075832}KIAA19S2 protein (Homo sapien 1.76 

418825 AA228881 Hs.22394 hypotheticai protein FU 10893 1.40 

417805 U38545 Hs.82587 pttosphoitpaseDl.phophaQdyldwtlne^pe 1.35 

30 426828 NM.000020 Hs.172670 adivln A receptor type IMikel 1.55 

415714 NM_0022gO Hs.78672 laminin, alpha 4 1.90 

446984 AB020722 Hs,16714 1^ guanine exchange tactor(GEF) 15 1.71 

421778 AA428000 Hs.283072 actin related protein 2/3 complex, sutnin 1.88 

421218 NM.000499 Hs,72912 cytochrome P450. subfamily I (aromatic c 1.75 

35 452298 AI039243 Hs.278585 NM-024756; Homo sapiens hypothetical pro 1.86 

434598 T59538 gfaYb65g1 2.81 Stratageneovaiy (937217) 1.52 

412351 AL136960 Hs.73828 T-ceH acute lymphocyfic teutemla 1 1.61 

433374 AI821409 Hs.304471 EST 145 

429113 D28235 (te.196384 prostaglandin^operoxide synthase 2 (p Z14 

40 423099 NM.002837 Hs.123641 protein tyrosine phosphatase, receptor t 1.50 

435705 AA782114 Hs.28043 ESTs 1.53 

456999 AA319798 Hs.298581 eulcaiyotic transifliion Olongation factor 1.52 

453983 K94997 H8.16450 ESTs 194 

406508 TaigetExon 1.97 

45 424503 NK^002205 Hs.149609 Integrln, alpha 5 (libronedin receptor, 232 

452924 AW580939 Hs.97199 oomptement component CI q receptor Z36 

426698 AW363332 Ks.171844 NM.006605Homo8aptenspo«ovinisreoept 1.61 

418941 AA452970 H8.239527 E1&65kO»«ssocIatod pTOtain 5 1.41 

450152 AI138635 Hs.22968 intronofVEGFR 1.70 

50 437269 AA334384 Hs.149420 ESTs 1.60 

445279 R41600 Ks.22245 ESTs 1.59 

409509 ALJ038923 Hs.322710 ESTs 1.52 

410240 AL157424 Hs.61289 $ynaptoian]n2 1.61 

424711 Nlit005795 Hs.152175 c^dtonln recepter-&l« 1.96 

55 449318 AW236021 Hs.78531 Homo sapiens, SlmDar to RlKENcONA 5730 181 

407662 BE548267 Hs.337986 Homo sapiens cONAFU10934fis, done OV 1.91 

410057 R66634 Hs^107 mulOmertn 1.85 

446727 AB011095 H3.16032 KIAA0523 protein 1.67 

412564 X83703 cardiac ankyrin repeal protein 1.51 

60 414786 AI246482 Hs.243010 Homo sapiens IBS bomolog gene MIy. me 1.71 

410278 AI554545 Hs.71832 ang]op0te8n.2 1.91 

406627 T64904 Hs.163780 ESTs 1.76 

405025 Bone morphogenetic protein 6 1.69 

422648 086983 Hs.118893 Melanoma assodated gene 233 

65 452852 AK001972 Hs.30822 hypotheScal protein RJ11110 1.78 

427912 AIJD22310 Hs.181097 tumor necrosis (actor (l|gdnd)6uperfami 1.79 

432004 BE01B302 Hs.2894 ptacentd growth factor, vascdarendoth ' 2.02 

430129 BE301708 Hs.233955 hypotheticd protein FU20401 1.93 

„^ 429276 AF056085 Hs.198812 6 protein.coupled receptor 61 1.65 

70 407704 BE31S072 Hs.78768 maiionantceaexpresslon^hanoedgene/ 1.48 

448694 AA476756 Hs.194477 E3iAiqui8nQg8seSMURF2 2.01 

434398 AA121098 Hs.3838 senmvMdbte Idnase 1.83 

416851 AW963951 Hs.85618 ESTs 1.65 

412420 AU035668 Hs.73853 bone rooiphogenefic protein 2 1.69 

75 416179 R19015 H8.79067 iyiAD (mothers against decapenteptegte, Or 1.71 

450689 A1369275 Hs.243010 Homo sapton3cONAFU14445fis, done HE 1.86 

416658 U03272 Hs.79432 fibrillin 2 (congenitd contraduid ara 1.65 

425235 AA353113 H5.1 12497 Homo s8ptenscONA:RJ22743 lis, done H 1.83 

42^ AB020643 Hs.183006 KIAA0836 protein 1.77 

80 412755 BE144306 KS.17S891 ESTs. Vfealdy similar to P4HAJ1UMANPR0LY 200 

456977 AK000252 Hs.ieS7S8 hgrpotheScai protein aJ20245 1.61 

458977 AK000252 Hs.169758 Iiypofhe8c8lprot^aj20245 1.61 

427897 NiyL017413 H8.303084 apdin: peptide flgand tor APJ receptor 1.94 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



42S34 




KS.165S09 


444409 




nS.492D5 


421340 


POTTOS 


(^.1369 


445701 




n5.13i31 


444009 




nS.135i04 


409^ 




Hs.51233 


416940 




Hs.43157 


418922 






431548 




nS.8f II 


434927 




nS.293o15 


434558 






438183 




Hs.6101 


426438 








A An?S91 4 


Hs.17404 


liDUgO 


/WVlDlOOZ 


ns.83381 




AAn<mio 


Hs.1 88746 




AI(nC0Q7 


Lis ^nnoo 






ns.i2o4o5 






HS.B2902 






Ks^1595 






Ks,76118 


AiOiAR 


AA91QftO-1 


lis 700C 




t OAQTH 

Lzuy/ 1 


Hs.188 




JU4U00 


rv.1ao340 


418506 


AAnftA9AR 




413745 




nS.r00i4 








425811 






415765 


MU fl(MA9^ 


Li- WMMil 

nS.fooZ4 


431089 




nS.iOOilO 




AtOTPflCV 
AlOrOOOr 


ns.l097w 


408669 




k4e 7MAR 


416815 




nSJWlZU 


418203 


)ffiA!U9 


nSJSofoo 


408243 


TvUf Qr 


UsR9A 

ns.Dz4 






ns,w354 




ALAatnttn 

/WIOOOZU 


Hs.41 135 






HS.4113S 




AA150797 


nS.10927D 




bu4roo4 


nS.1 03070 


^10179 




KS.Z2120 








AA7tiR(\ 




n&.16o58 


^191 9ft 




LI. 4n0F#\4 

Hs.29&5u2 


*»1 f4« 


NM_002291 


Ks.82124 






nS.23349 




Ai A^Ain 


Hs.6906 




AWz4<m2Z 


H5.95835 






Ll> 4vnAAp 

HS.Z79905 


418741 




Hs.8881 






Hs.7567 




AOniOOCQ 


Hii.118140 




bc2oUUr4 


H3.23960 




ArZ34532 


Hs.61638 


422603 


RP9A9RR7 


U» 440Ci;4 








448231 


AirniQiA 


Urn <M4C/M 


408989 


/WYOOIODD 


U- JQCAA 

nS,49w0 




MD00r4 


Ua 44 4C07 

ns.211oo7 


418738 


AVVJOIiOiM 


nS.D0B2 






Ub 4JT4«ie 


4174011 
*tl f)WO 




Hs.81892 


407975 


yilQA9R 
AD94ZD 


nS.4171o 


420256 


U84722 




400666 






431728 


NM.007351 


Hs266107 


440088 


NM.005402 


HS288757 


413132 


NM-.006823 


H8.75209 


451979 


FD6972 


HS27372 


415138 


CI 8356 


Hs.295944 


414577 


AK}56S48 


Hs.72115 


444330 


Ai597655 


Hs.49265 


422424 


Ai166431 


Hs.296638 


400289 


XO7820 


H52258 


418007 


M13509 


Hs.83169 



EST8,Wieakly similar to 08022 hypottwti 1.74 

ESTs 1.88 

decay aoceferaOng fiactor for complement 1.74 

lymphocyte adaptor protein us 

ESTs 1.90 

tumor necrosis factor receptor superfami 2.16 

ESTs 2.03 

ESTs 2.09 

novel protein i.8i 

Homo sapiens HSPC285 mRMA, pailiai ods 214 

ESTs 1.93 

tiypo!helicaIpfOt^MGC3178 2.05 

endolheSn 1 2.46 

S0X7SRY (sex tfelemiining region YHwx 220 

guanine nucleotide binding protein 1 1 2.26 

ESTs 1.93 

ESTs 2.39 

iiypollietical protein FU12604; KIAA1692 1.84 

Ihymidylale synthetase 1.70 

ESTs » 1.97 

ubiquIQn cartnxyl-temiinal esterase LI 1.93 

1^6 interacting. Idnesin^lffi (rabldnes 1.80 

phosphodiesterase 4B. cAMP^^pecIlic (dun 203 

topoisomeiase (DMA) 11 alpha (1701(0) 1.5B 

G protein-coupled receptor 39 1.61 

nucleoside phosphofylase 1.99 

mitogen-activaled protein Unase 8 inter 299 

kaiyopherin alpha 2 (RAG cohort 1 . Impor 1.76 

tyrosine kinase with Immunogloburm and 2.13 

ESTs, Wealdy similar to unknown protein Z71 

hematological and neurologk^al expressed 1.65 

platelet/endothelialcellaAiesbnmoiec Z39 

UDP-N-aoety)^ha^8lactosamine:polyp 1.77 

CDC28 protein kinase 2 1.67 

interieuldnS 2.09 

iysy{oxid3S&4ike2 2.42 

endomucin-2 234 

endomucln-2 2.34 

latexin protein Z24 

ESTs Z07 

ESTsislmilar to TRANS^€MBRANE 4 8UPERF 2.00 

ESTs. Weakly similar to S72482 hypotheS 2.33 

phosphoiipase A2. group IVC (cytosofic, 211 

ESTs 2.27 

laminin, l»ta 1 2.II 

ESTs 2.03 

Homo sapiens cONA:FU23197 lis. ckmeR 2.08 

ESTs, Weakly similar to unnamed protein 1.57 

cion8HQ0310PR00310p1 1.60 

ESTs. Weakly similar to S41044chranosom Z37 

branched chak) aminotransferase liCytos Z55 

KiAA0716 gene product 2.09 

cycflnBI 1.76 

myosin X 2.01 

hemalQpoietically expressed homeotiQX 215 

gkHuman alpha setellite and sateSte 3 a08 

ESTs 2.27 

KiAA0746 protein 1.43 

phosphoiipase A2, group IVA (cytosolic, 267 

sokite canter family 7, (catkmtc amino Z18 

Merceilularadhestan molecule 2 Z60 

KIAA0101 gene product 1.66 

endothellal ceH-spedfic molecule 1 292 

cadherfn 5, type 2, V&cadherin (vascula Z92 

NM_002425:Homo sapiens matrix metaJk)pro 259 

mullimaiin 2.72 

v-ral simian leukemia viral oncogene horn 225 

protein kinase (cAMP^ependenl cata^ 1.76 

endothelial tyrosine kinase (EtK) (BMX), 3.37 

Bssue factor palhweylnhiliitor 2 3.11 

hypothetfeai protein FU20992slmiIarto ail 

ESTs a08 

prostate ditferentlallon fector Z96 

matrix metalloproteinase 10 (stmnelysin 4.84 

matrU metafloproteinase 1 (inteistilial 11.91 



TABLE 6SB 

Pkey: Unique Eos prot)eset identifier number 
CAT number Gene duster number 
Accession: Genbank access numbers 



Pkey 



CATNumber Accession 
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PCTAJS02/36810 



453085 10017.1 



436729 6624.1 



442S06 29197J 



436772 
434596 
412564 



442006 



1239464J 

14701J 

18571J 



1239046J 



8C017336 B6716430 BG501286 B1458528 A)582223 N9S532 A1338138 AI273442 AW102617 AA831177 AA745842 AA412583 AA35S375 
BG547492 AW954243 B1765546 6G057641 A1192435 AI338935 A)31 2651 A!708679 AI19112S AI206832 AA676899 Aia78010 AI888718 AA452830 
BF445542 N69930 M71S017 BF446713 BED4^ AW7719Q9 AA907729 AI143749 Ar761290 AA890233 BF925759 AI733713 AI767257 AA814536 
W5B778 AA91 8481 BG743526 BE645242 AI02532B AI298436 A1290445 HTHIO AI475034 86740023 fimm AI340003 81602481 W38495 
AI183314 AI927418 80397181 AAS78310W19369 W56507 C057S1 AW360760 AW380770 AW380790BF830729 H28425 AA037326 AA37S605 
X75684 AL573167 AI445461 AI453743 AJ983655 AI564644 AA977180 AI6941 1 1 AI591358 AW071 625 A167871 ^ 

AI363432AA292956AW192S83A!865838A1655905Ai424384Ai161312Ai911921 A1597801 BI494959AI240988AI492554 AW262737BE044033 
AW008570 AW629505 81494958 AA088439 AA706D57 BF222820 BF5936C8 6E501957 AA524S26 BE044134 AW572531 AW015724 BE349166 
AA043217 BE219764 At799814 AA129575 AI671727 AI4700336E646195AW779725AA903050AA147228 AA404570A!07587^ AI972739 
AW673152 AA723200 C06123 BFD57147 AA627685 AA157944 AI990245 AA662517 T32487 AI800106 AI333170 AI859160 W45410 A1990827 
AW275048 AA182640 AA47B328 AI298935 AW085158 AW471421 AW103470 AW300456 AW191997 AI823456 AA962397 AA136658 A1251817 
AW339104 AA724739 AA411 100 AA191349 AA757735 AA037696 Ar769516 AW772283 AA010531 AI692846 AI061065 H80983 R79933 AI950693 
AI245632 AI3493S0 AA148284 AI798502 AA487893 AI521320 AW194272 C06365 AA953883 BE858938 AI918523 AI872628 A1927217 AI453453 
A! 1 69386 AW33867B A1261359 A1500576 6F477735 A!032569 AI972899 A1985583 Z28771 AI363829 Ai693030 AA603585 8E773488 AW339301 
BE773489 BE773462 BE773495 A1650338 6E773499 AI745717 BE811475 6E811470 BE8114G4 BE811418 6E811415 BE811400 BE811398 
BE81 1388 BE8113S2 BE773501 6E773494 BE773486 8E773474 8E773473 BE773470 BE773461 8E811350 BE811337 8F593847 8G055071 
AW675302 BF003088 AA719173 BEB11348 AI582462 A1686240 BE773500 AI244a45 AI565439 AI918453 A1472527 AI446740 AA035576 AA191414 
AW8741 45 C0S782 A1589264 057558 AI468237 AI432033 AA989862 R21752 81^457 AA988297 A1574095 A^^ 
AL578810 BG438381 AI928364 BE879732 AA479834 AA479712 C17732 6M091258 BF843901 AW820230 C17476 6E3271 20 AA1 29574 
AA136645 BF843900 AW806193 AA502832 AA649494 ALS68520 AL547960 BE708937 BE81 1360 8E773498 8E61 1401 BE773484 BE81 1437 
BE81 1380 BE811399 8F997171 BF7S7734 BE926037 A1377596 a61 1 1 AW088968 BE81 1404 BE81 1472 A188ra^ 
85905927 BE811435 AA191387 AW772000 BE81 1453 BE614379 BF844522 81044898 At744233 AW984527 C17S04 BF843883 AI248307 
BE773463 A!567995 W60076 BF941163 Ar738844 BE611458 BE773481 At262930 AA948565 BE706942 BE1S6360 T65026 AW242958AW1 97954 
BEg05184 AA722206 At344943 AI348877 A1334850 BE621857 BE156280 AA454099 AA037722 BF843897 AW8061 83 AA043216 86482896 
AA182734 AAB77242 AW372926 H27252 R38114 BF651858 BE1S6214 AAig0427 T91762 AA035067 AA837326 T10930 6F906587 BI755027 
BG506731 BC00B442 BC010166 ALj550134 AliS3096 AL548700 Al^51 AL547978 A^ 

B6741786 BI868522 AU135866 81552770 8I2S9210 81256520 81255569 86485098 BI258228 B6498501 BM044512 AU133984 ALS56586 
BE74S111 B1222633AU133917 86288151 81260715 B1550550 86500773 BI551761 B6707601 61818593 BF6913d3 66721129 86541576 
BE906666 86751096 BI224135 B6400746 B6478065 BE7g0438 AW080238 AU137549 B6429896 BE392486 AW96168B 66721056 BE908365 
BE546656 B6541235 AW583735 B6528290 6)250895 AW651691 BM048974 BM043805 86142185 AA315186 AI44661 5 C06300 B6497644 
AA088544 A1815967 B6528831 BE6igi62 AW239185AW062910AW062902AA347236F11933AA488005AA301631 AA3768Q0 D56120 
AA343532 AA308636 F00242 AA376066 AA316968 AA343799 81870221 BE910282 86538748 AW960564 AW3287g 016854 AA192519 BF922148 
AA216013 86624091 BE544387 BQ507008 AW176446 BF790033 BE088925 BE088854 AA921353 R21800 AA01 1222 T9752S 
NM.002640 L40377 86227962 AI925897 W60100 66980023 AA853498 AA360401 AW9551 68 W61361 A1074846 AA3737g8 86193973 66499418 
BF816267 AI675932 66611734 66777123 AW4494856F9120166E566411 BM068728AW976385AL121194AI767324BM054^^ 
AA156151 Z25109 C05177 AW97568B AA731063 H670B4 
AF1 47374 T59538 T59589 T59598 T59542 

X83703 KM.014391 80018667 AU139209 6E924924 BE924899 eE924977 BE924g85 6E924947 eF2290S5 BE924917 BE924938 eE812621 
BE924937 AA969164 BE924898 6F229066 BF2290&4 BF081 177 BEg24934 BF229040 BE925037 BE92501 1 66943776 AA488072 AA486364 
Z36249 AW455806 AW5B2753 6F081 131 BE924910 6F229058 BF061172 BE9249S6 AW378093 BF081132 BF081168 6E812736 6E812738 
BF081 1 65 G041 60 €04483 BF22g048 
AW9751 83 AA973583 A)365103 At699495 AI301787 



TABLE 69C 

Pkey: Unique number corresponding to an Eos probeset 

M. Sequence source. The 7 digil numbera (n this colunm are 6enbank tdenlSier (6Q numbeis. Tunhanv et refers to (he pufaQcalron entitled The DNAsequenoe of 
human chromosome 22* Dunham, et aL (1999) Nature 402:489495. 
Sfrand: Indicates DMA strand from witich axons were predicted. 
p osfljoiii Indicstes nudeoSdo positions of predlctBd exonSa 



Pfcey 


Ref 


StrarKl 


Ntj>osiGon 




402463 


9796896 


Minus 


8818^2 




400494 


9714719 


Rus 


169845-170272 




401234 


9929842 


RU8 


120173-120337 




406506 


7711374 


Minus 


6843^ 




405025 


7107727 


Plus 


105267-105343,106184.106294,106387-10653 




405121 


8102330 


Minus 


35816.36004.36587^64 




400666 


8118496 


Plus 


17982-18115,20297-20456 




TABt£7QA: 








Pkey: 


Unique Eos probeset Mentifler number 




ExAccn: 


Exemplar Accession numb 


er, 6enbank accesskm number 




UnigenelD: 


Unlgene number 






Unlgene Tlfia:Uiiigene gene title 






R1: 


Rslio of the mean of the vessel ATs to the mean of the HUVEC ATs 




Pkey 


ExAccn 


UnigenelD 


Unlgene Title 


R1 


428928 


BE409838 


Hs.194657 


cadheiln 1, type 1, E'Cadhafta (epHhsB 


0.99 


439180 


At393742 


Hs.199067 


v-eib^2 avian erythF0bias8c teutemia v 


1.06 


412636 


NVL0O4415 




(tesmoptakin(DPI.DP]l) 


1.25 


426158 


NM.001982 


HS.19S067 


V'eii>-b2 avten erythroblasSc teuken^ V 


1.36 


414320 


U13616 


Hs.75893 


ankyrtn 3, node of Ranvier (ankyiin 6) 


1.50 


417678 


U90916 


Hs^2845 


Homo sapiens cONA: FU21930 fis. done H 


1.55 


414S72 


AU077174 


Hs.288181 


caHiepsh H 


1.64 


415314 


N88802 


Hs.5422 


glyooproteln M6B 


1.70 


431103 


M57399 


Hs.44 


pletotrophin (heparfn binding growth fac 


1.75 


406973 


NQ4998 


Hs.198253 


mafor histocompatibiQy complex, class 


1.88 


456974 


M12529 


Hs.169401 


epoHpopiotdn E 


1.90 


430560 


Z28942 


H&243960 


N-mycdownstreaiiHegulated gene 2 


1.91 


406828 


AA41d202 


Ks.84298 


C074 antigen invariant polniepOde of tn 


1.97 


422048 


NiyL012445 


H&288126 


spondin 2. exbace&idar mabfx pralein 


1.99 
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406826 AW51600$ Hs.B4298 CD74anSg8n(]nvaiantpoiypep6deofm Z02 

453023 AW028733 Hs.31439 serine protease hhlbKor, Kunltz typo, Z03 

413391 AI223328 Hs.75335 gjydns am idi n olran sfef ase (L-arglnfl^g 2.12 

406824 AV\^1S961 Hs.84298 CD74 antigen (Invariant polypepSde of m 2.15 

5 418299 AA279S30 HS.8396B inte9rin.beta2(8iTQgenC01B(p95)*ty 2.17 

437536 X91221 Hs.144465 ESTs 2.21 

442402 NM.000954 Hs.B272 piostaglandinD2 synthase (21kD.fardn) 2.28 

446945 AI193115 Hs.16611 tumor protein D52>{ike 1 Z27 

422639 A(929377 Hs.173724 creatine kinase, tirain 2.29 

10 427451 AI690916 Hs.178137 (ians(}ucerof ER8B2. 1 2.31 

454042 H22570 hypotheSca] protein FU20093 2^ 

416283 S79895 Hs.83942 cathepsin K (pycrwdysostosis) 138 

424247 X14008 H5.234734 iysozyTTO (renal amytoidoste) 139 

422103 AAS84330 Hs.1 11676 protein IdnaseHII; small stress protein 141 

15 415994 NM.002923 Hs.78944 reguiatorof&prote{n8i9naliing2.24)c 143 

427393 AB029018 HS.17763S KIAA1 095 protein 147 

414217 At309298 H3.279898 Homo sapierBcDNA:RJ23165 lis. titonoL 149 

414063 H26S04 Hs.75736 epoHpoprat^ D 150 

426716 NJyL006379 Hs.171921 sema domain, irnrnunog!ot)ullndorndn(lg), 151 

20 414803 X03100 Hs.914 HurnanmRNAfbrSBclassnhistocompatit)] 151 

410532 T53088 Hs.155376 hemogbUn. beta 156 

431369 BE1B4455 Hs.251754 secretory leul(ocyte protease InNtAorf 158 

437952 D63209 Hs.5944 solute canfer family 11 (proton^oupled 158 

^_ 453655 AW980427 Hs.342874 transforming giowtli factor, l)ela recepto 159 

25 453655 AW960427 H3.342874 transforming growth factor, beta recepio 159 

453655 AW960427 H&342874 transfornting growth factor, b^reoepto 159 

427980 AA418305 Ks.303205 EST 159 

442695 AC002425 Hs^603 p8 protein (cancfidata of metastasis 1) 162 

^_ 425751 T19239 Hs.1940 crystallln, alpha B 163 

30 414194 BE175494 HS.75B11 N-acylsphingosine emldohydrolase (add c 163 

443907 AU076484 Hs.9963 TYRO protein tyrosine Icinase binding pro 166 

453464 A18&4911 Hs^9 receptor (calcitonin) acSvitymocfifying 167 

410677 NM.003278 Hs.6&424 tetranectm (plasminogen-binding protein 167 

452685 A1634651 Hs.3025O v-mafmusculoaponeuiotb fibrosarcoma (a 167 

35 425869 AA524&47 Hs.160318 FXYD domain-containing ion transport reg 173 

406643 N77S76 Hs^7939 henwgiobin. alpha 2 173 

453767 AB011792 Ks.3S094 exIraceQular matrix protein 2, female o 174 

424614 X54486 Hs.151242 serine (or cysteine) proteinase inhiUto 180 

444195 AB002351 Hs.10587 K1AA0353 prolan 181 

40 417233 W25005 Hs.24395 small inducible cytokine subfamily B(Cy 188 

44249B U54617 H3.8364 i^omo sapiens pyruvate dehydrogenase kina 162 

412810 1^1574 Hs.74615 platelet-de/ived growth factor receptor. 193 

440820 ^m^m ptakophinn4 194 

429123 AB011099 Hs.196847 K1AAD527 protein 197 

45 411529 AA430348 Hs^175g6 Horho sapiens cONAaJ12927fis,ck)ne NT aOI 

413305 NM_000426 Hs.323511 Homo sapiens cONA:IU23176fls.ctoneL 3.01 

425622 AW360847 Hs.16578 ESTs aOI 

420195 N44346 Hs.251385 Homo sapiens cDNAfU 11 177 fis. done PL a04 

446584 U53445 Hs.15432 downreguiated In ovarian cancer 1 3.04 

50 407669 AI827976 Hs.24391 hypotheScal protein fU13612 a06 

423915 AP039018 Hs.135281 alpha^cQnin-2-«ssoclated Ul^ protein a07 

410036 f^171 Hs.57975 calsequestrin 2 (cardiac muscle) aiO 

412047 AA934589 Hs.49696 ESTs 112 

414840 R27319 H823823 hairytenhancePoMfftltielalsdwilhYRP 3.14 

55 424651 AI493206 ESTs 3.17 

443932 AW888222 Hs.9973 tensin . 3.18 

420859 AW468397 Hs.100000 S100 caIc]unM)lnding protein AS (calgran 3.20 

434025 AF114264 H8,216381 HoriK) sapiens clone HH409 unknown mlWA 3.20 

^. 421834 BE543205 Hs288771 DKI=ZP586A0S22 protein a20 

60 423961 D13666 Hs.136348 periostin(OSF-2os) 3^4 

447384 AI377221 Hs.40528 ESTs 3.24 

425383 D83407 Hs.156007 Down syndrome critical regkM gene 1-lik 3.25 

447023 AA356764 Hs.17109 tirtBgral membrane p(0leh2A 3.25 

420136 AW801090 Hs.195851 actln. alpha 1 smooth musde, aorta a25 

05 406972 M32053 gbiHumanHIO RNAg6ne,c(Hnpletecds. 3.26 

410132 NM.003480 Hs.300946 Microfibrll-assoclaledglycoprotein-2 a27 

430998 AF128847 Hs.204038 indoIethyiainineN-methyltransferase a2B 

456898 NMJH)1928 Hs.165597 D component of complement (adlpsln) a29 

443823 AA345519 Hs.g641 complement component 1,q subcomponent, 3.33 

70 410614 AI091195 Hs.65029 growth errest^pediic 1 a35 

451529 AI917901 HS.20B641 ESTs a35 

430310 U60115 Hs.239069 four and a half UM domains 1 a37 

424897 063216 Hs.1S3684 frizzled^elatad pnsteln 137 

. 427111 AA351026 Ks.173594 serhie (or cystine) pmleinaselnhibito a38 

75 443604 C03577 K3.9615 myosin regulatoiyGght chain 2. smooth a41 

421913 AI934365 K3.109439 ostaogiydn (ostedndudhra factor, mime 143 

451331 AKD02039 H3.251385 Homo sapiens cONARJI1 177 fis. done PL 3.44 

416908 AA333990 Hs.80424 ooagutation factor XIIL A1 pdypepBdo 3.46 

407938 AA905097 Hs.850S0 phospholamban 148 

OO 41B0Q5 Ai1B62^ Hs.83164 cotiagen. type XV. alpha 1 asi 

452877 AI250789 H3.32478 ESTs a52 

419577 L36531 H$.91296 (ntegr1n.a^6 a52 

404277 NhL0191irM»noBapiensm^hislocompa a54 

780 



wo 03/042661 



PCTAJS02/36810 



40781S AW373880 Hlie38a) hypothetical protein FU20Zn aSB 

421853 A1117472 Hs.108924 SH3^fc)main protein 5 (ponsln) a64 

447111 AI017574 Ks.17409 cystein&^ protein 1 (intestinai) 3^8 

426935 NM-000068 Hs.172928 collagen, type I. siptia 1 3.70 

418371 M13S60 Hs.84298 C074an1i8en (invariant polypeptide of m 171 

440Z74 R24595 Hs.7122 sctepieresponsivs protein 1 3.73 

427818 AV611222 Hs.193765 ESTs 3.73 

433465 AV657778 Hs.3314 selenoprotein P. plasma, 1 3.75 

458568 AI769067 Hs.127824 ESTs. WeaWy similar to T2B770hypothefi 3.77 

459290 Nlld 001548 Hs.34853 int)Q)itor of DMA IMng 4, dominant neg 3.80 

424206 NIUL003734 H5.198241 amine ojddase. copper containing 3 (vase a83 

453874 AW591783 HsJ6131 collagen, type XIV. alpha 1(unduliii) 186 

428957 N1UL003881 Hs.194679 WNT1 indudbie signaling pathway protein 189 

434868 R50032 Ks.159263 conagen,typeV1.8lpha2 3.90 

417043 NM_004369 Hs.80988 DoQagen.1ypeVl.aIpha3 195 

417512 )C76534 Hs.82226 glycoprotein (transmemtjrane) csflb 195 

452669 AA216363 Hs.262958 hypotheScd protdn DKFZp434B044 197 

421314 BE44Q002 Hs.1 80324 Homo sapiens, dona IMAG£:4183312. mRNA. 4.01 

449925 AI342493 Hs.24192 Homo sapiens cONAaJ20767fis, done CO 4.03 

453500 AI478427 Hs.43125 esophageal cancer rel^ gene 4 protein 4.07 

430572 U33114 H$.245188 tissueinhil)itorofmetanQp(Oteinase3 4.09 

421823 m0850 Hs.28625 ESTs 4.17 

422550 BE297626 H$.296049 mtcroMar-assodated protein 4 4.25 

443060 D78874 H5.6944 procoDagen OendopepOdase enhancer 2 4.26 

450912 AW939251 Hs.25647 v4osFBJ murine osteosarcoma viral onco 4.29 

432306 Y18207 Hs.3O3090 protein phosphatase 1, regulatory (inhib 4.34 

419938 AU07677^ Hs.1279 complement component l.rsttomponenl 4.37 

415165 AW887604 Hs.780&5 complement component 7 4.37 

414176 BE140638 H8.75794 endolheaaldifferen6a1]on.|yso|rtiospha 4.41 

452114 N22687 Hs.8235 ESTs 4.43 

406850 /^624300 Hs.172928 cdlagen. type I. dpha 1 4.43 

428411 AW291464 Hs.10336 ESTs 4.48 

412802 U41518 H5.74602 aquaporin 1 (diannel-fomiing bitegrd pr 4.51 

406849 AA454809 Hs.172928 ooOaeen, type I. alpha 1 4.54 

453084 R40334 Hs.69463 potassium large conductance caidunKaca 4.57 

429500 )C78565 Hs.2B9114 hexabradilon(tenasdnC.cytotactin) 4.57 

433335 AR}17986 Hs.31386 secreted ftlzzled^elated protein 2 4.68 

421614 L12350 Hs.108623 thramtmpondbi 2 4.73 

427373 AB007972 Hs.130760 myosin phosphatase, target subunit 2 4.76 

414290 AI568801 Hs.71721 ESTs 4.98 

415274 AF001548 Ks.78344 myosin, heavy polypepGde 11. smooth mus 5.1? 

417130 AW276858 Hs.61256 ' S100 caldunvtindhtg protein A4 (caldum S.14 

416784 AA334592 Hs.79914 lumlcan 5.20 

453355 AW295374 Hs.31412 Homo sapiens cONAFU11422 lis, done HE S.50 

408221 AA912183 Hs.47447 ESTs 6.65 

430223 •NM..002514 Hs.235935 nephroUaslomaoverexpressedgone 5.87 

416585 X54162 Hs,79386 leicmodin 1 (smooth musde) 6.47 

422737 M26939 Hs.1 19571 coilagen. type III. alpha 1 (Ehlers-DanI 6.65 

426406 AI742501 Hs.16g756 complementcomponent 1.8 subcomponent 7.94 

448429 D17408 Hs.21223 calponlnl. basic, smooth musde a90 

413624 BE177019 Hs.75445 SPAROGke 1 (masQ. hsvin) 9.91 



TABLf 708 

Pkey: Unique Eos probeset IdenHSer number 
CAT number. Gene duster number 
Aocesskxc Genbank accosston numbers 

Pkey (^TKumber Accession 

412636 1438 1 M77830NM 004415 AF139065BG681 115 BG740377 6^2964 BG000656AA128470BI438324 H27408BE931630BE167165AVW 

AW370813 Jb5211 BG698865 BG740734 BG680618 BG73977B 61765807 BM353403 BM353248 AW177784 AW205789 AW951576 AW848592 
BE182164 BF149266 BE940187 6I06O445 81060444 BF350983 BE7200g5 6E720069 BE7151S4 BE0825B4 BEt)82S76 BE004047 AA857316 
BI039774 BE713818 BE713548 AW170253 6E160433 61039776 AW866475 BM462504 BE931734 BF14g264 AA340777 BF381183 6G621737 
AU127260 AW364859 BF9933S2 86223489 BE619009 6F381 1 84 BE71S956 R58704 AA85221 2 AW366556 BI090358 BF087707 BE819046 
BE819005 AA377127 BE073467 B£ei9069 BE61 9048 BI036306 6G990973 BI040954 6F91991 1 AU140155 AI951768 AI434518 AW804674 
BF752969 BEB37009 BE92S82B BF149265 AW^IS BE814264 61039782 AU140407 BE144243 BE70g863 BF985642 6E001923 BF933510 
AW265328 6G436319 BE182166 AW365175 AW847688 BE8ie280 AW177933 BF67367g AW178000 BE082526 BF476866 BF086994 BF592276 
6E082507 BE082514 BE082S05 BF873693 AW06884O AW847678 BF804153 AW365157 BE813930 BE002030 AW365153 6E184941 BF749421 
BE1B4920 6F8395626E164933 BF842254 BE698470 BE931048 8F999B89 BF368816 BE184924 BE159846 BE714632 BE184948 BG988845 
AA13112B AA099891 W39488 O04715BFQ96124 BC885341 AW799304 AL£03116 BE149760 BE705g67 BE705g66 BE70S988 AW846723 
AW376699AW376817AW376697BG0050976F751115BE696084AW848371 AW376782AW848789AW849074AW361413 BF927725 BF094211 
AW997139 BEB65474 BE185187 BE156621 BE715089 BE713297 BE713298 BE179915 AW799309 BF872345 BF088676 BE705939 AW752S99 
BOmW 6F350086 BE715196 BE715155 6F7523g6 BF093817 BF831 190 BF752409 BE006561 6G959922 BF094833 6F094748 BF094583 
AVV377699AW607238BB182S19AW377700BF349487A1190590A!554403A1392926AU158477BM67252AU1^^^ 
AI439101 AA451923 AI340326 A1590975 81791553 AI700963AI142882AA039975AA948936AA644381 BM314884 AA7Q2424 AI417612 
AVV190555AI220573AI304772AI270345At627383AA552300A1911702AVn66807AI346078V^ 

AI078449 AI819984AI858282BI468588 AI860584AI02S932AA026047 AA703232 AA656154 AAS15500AW192^ T77861 AI927207 

AI205263 BF082491 AW021347 AI568096 8E939862 AA088866 012062 AA056527 AA782109 W19287 WD21 56 AW1 50038 AA0227O1 T87181 
H44405 AI910434 BF082513 AI494069 AI270027 AI635678 AA128330 BG68142S BE706078 R20g04 BG680059 6G676647 BF764409 AA026654 
AV745530 81762798 6G287391 AW798780 BE708045 BC926470 AW7g9118 BF087996 BE002273 AWB79451 AI571075 BE067766 AV721320 
AI022862 H29764 G03378 N84767 AA131077 H30146 6E714290 AI686889 A1S68892 A1915S96 AW10S614 AI8872S8 AI538577 BE926474 
BE067737 BG319486 AA247685 AW798883 AW103S21 BF989173 AW660878 BE939707 BE1857S0 BE714064 BE713903 BE713868 BE713763 
8G9S0164 BE713810AW3651S1 B6955489 8E00S272 BF91S937 AW365148 AI905927 BFg92780 AW853812 BG954443 BI770853 BG879406 
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BG740832 BG681067 BGfi98430 AA455100 T87267 BE696209 BE696210 Bt089483 BE006273 BE672225 AW391912 BE92551 5 B6677012 
BG741970 AA026480 8E705999 BG877157 BE009090BG581378 BE712291 BG951498 BG578984 BI040941 AA337270 AW384371 AW847442 
6I05B859 BE813665 W95048 W25458 AW177788 AA025851 BE931733 6F154837 BG949393 BE714441 AW996245 BE711601 AI284090 
BE064323 BE719390 BE940148 BG991212 BF375714 BF34g522 BG9S6267 T48793 B1013292 BE001925 AW3651S6 AW365154 AW60G653 
8F763109BE931637BE167181 BE713879BF354008BF678726H9Q899AW365145W38382Al4g8487 

454042 30254 1 AJ420458AI018523AA708888Bm9633AL119553BF945g60AI081305AA041432AI921013AI684910AI654^ 

AW241426 R43035 T66767 AW103715 W28478 BF953052 H4S926 BFB07568 AW903943 BE170143 B)040435 BF931S89 Bt600000 AV722350 
W27787 H45331 Bt549761 RS3955 61549855 BG991583 B1491075 AW020049 AW12g293 H45263 AA410309 AA340613 R42410 AA707199 
A1431587 BEB58879 AW292267 A142167B AA041195 8E466753 A1243913 A13S8894 AW1 37296 AI366468 N84350 AA779107 AW025989 R49056 
AA347011 R55722 AW771108F04989Z38381 F01659H17396 61493714 AI880103AW771447A1202S61 AA7e8851 Ai4944368F856114 H22570 

440820 3091 1 AW964381 BG007409 BM314056 AA465642 T30661 T331 1 1 Z42834 C04542 BF948152 BF944325 BF858B95 AA935284 AI267350 ^64249 

W67500 F07982 AA322394 BHd9987 BE64496S 6M313782 AA910364 A)809246 AAB36750 BF115228 AA829730 N27413 BM141766 A!742325 
AA455261 AA939708 AI42Q241 AllSOOSS BF222341 BF941301 AA77t807 Z4t034 BF447457 BF447426 AW515347T03874 FQ2360 F02302 
N34898 A1678586 AA807B24 AA948556 AW204638 BMt42045 BF446775 AV648364 A1801368 AA971739 AI017351 AA7G0722 AI460007 AM58383 
AI694152 AA22G536 BE467282 N47808 BE348825 BE630581 BE830583 N53009 N59351 N41056 W67501 AA016246 H43293 F21282 RS9121 



424G51 46029.5 B6435302BM0836B7AA904b35AM88889 W76175AA761874H28767AAgi0081 AA837086AIS2182SB6986378AI47B562AA743152AA746092 
H88863 BG888375 AA63S644 AI493206 AA6G9979 BE245127 66986430 BG986529 BF665e73 BG030157 BG622575 AA76S495 

TABU70C 

Pkey: Uiriquo mmber oonesporufino to on Eos pnibesel 

Reh Sequence sotiico.'nie 7 <Iigit numbers In Udscdunm are Genbatlk Identifier (Gi) numbeis.'Dunham, et d.' refers to the publicallon entaied The DMA sequence of 
human chromosome 22* Dunham, et aL (1999) Nahire 402:489^95. 
S\sw± trufcales DMA strand from which exons ware predicted. 
NUK«Hion: indicates nudeoQdeposlfions of predicted exons. 

Pkey Ref strand NLposition 
404277 1834458 Minus 91665-91946 



TABLE 71A:774GENES UP-REGUIATEDIN EWING SAROOMACOiyiPAREDTO NORMAL ADULT TISSUES 

Table 1 A lists 774 genes up^gulaled in Ewing Sarooma compared to nonnai adult tissues. These were selected from 35371 pmbesets on the Affymetitx/Eos Hu03 GeneChip array 
such Bial the ralto of 'average' Ewing Saix»ma to 'average* normal adult tissues was greater than or equal to 2.1 . The 'average' Ewing Sarcoma level was set to the 75th percenlBe 
amongst numerous Ewing saitomas. The 'average* nomtd adult (issue level was set to the 85th percentile amongst various noiHnallgnant tissues. In oider to remove gene^pecific 
badc^und ieveU of non^pecffic hybridizBllon, the lOlh percentile vahie amon^ 
ratio evaluated. 

Pke/: Unique Eos probeset identifier number 

ExAocn: Exemplar Accession number, Genbank aooesston number 

UnlgenelD: Unlgene number 

Unigens Title: Un^ne gene title 

R1: 75th percentile of ewing sannma to 85th percentile of body map 



Pkey 


ExAocn 


UnigeneiD 


UnigeneThle 


R1 


104659 


AW969769 


HS.10S201 


ESTs 


70.3 


101447 


M21305 




gb:Human alpha satelfita and satellite 3 


64.7 


105782 


H09748 


Hs.57987 


CLLAymphoma 118 (dnc finger pro 


50.6 


115681 


NM.0057S6 


Hs.184942 


G proteli>-cou|4ed receptor 64 


46.8 


121792 


AWg69726 


Hs.98381 


ESTs, Wealdy ^milar to serine protease 


46,7 


101104 


AW86225B 


Hs.169266 


neuropeplkfe Y receptor Y1 


48.6 


110278 


AP061573 


Ks.19492 


piotocadherinO 


46.1 


126645 


AA316181 


Hs.61635 


sbt Iransmemiyane epHheOa! enflgen of 


41.4 


116752 


AL0O8583 


Hs.91622 


neurond pentraxin receptor 


40.9 


119717 


AA918317 


HsOT87 


&C8tl CLUIymphoma 11B (zinc finger pro 


39.6 


104691 


U29830 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 


38.9 


110728 


AA737106 


HS.322S0 


ESTs. Moderately shnltar to 178885 serin 


35.5 


121362 


AF0S0147 


Hs.97932 


chondromodilDn 1 precursor 


317 


131291 


NMJ)04350 


Hs.170019 


runt-related traitscr^&m foctor 3 


33J) 


101083 


D54745 


Hs.80247 


chotecystoldnin 


31.7 


121619 


AA528339 


Hs.176062 


ESTs. Weak^ sbrOar to phosphaBdylseit 


28.7 


122651 


AW975398 


Ks.293836 


ESTs 


28.0 


100299 


D49493 


Hs2171 


growth dtftotiafion factor 10 


26.5 


129977 


NM.000399 


Hs.1395 


eaity growth response 2 (Krex-20 (Drosop 


26.2 


123619 


AA602964 




gbm)97c02.8l NCL0GAPJ>r2 Homo sapiens 


26.1 


124006 


A1147155 


Hs.279727 


ESTs: homotogue Of PEAM ICtonasavlgnyi 


23.1 


116301 


AW969706 


H3.293332 


ESTs 


22.6 


121231 


AA814948 


Hs.98343 


ESTs. Weakly aindlar to ALUCJ1UMAN Ml 


22.3 


106533 


AL134708 


MS.14S998 


ESTs 


2Z3 


109166 


AA219691 


Hs.73625 


RAB6 tnteracOng. kinestn-Uke (rabklnas 


21.3 


131313 


R96^ 


H8.75874 


ribosomal protein L44 


20.8 


116790 


AW161357 




ndcrotubule-as^xiated protein tau 


18.7 


105316 


AI671245 


Hs.24835 


hypothetical protein FU14594 


18.2 


102123 


NM_001809 


Hs.1594 


oentron\ere pretdn A (17kl7) 


17.8 


126218 


A1049601 


Hs.13649 


^tovel huiran gane mqifAig to dnmosonnB 13 


17.8 


119791 


AA554907 


Hs.58291 


ESTs 


16.7 


113003 


AW282315 


Hs.7215 


ESTs 


16.3 


102836 


U94320 


Ks.158330 


neuropeptide Y receptor Y5 


16.3 


126799 


AW753865 


Hs.74376 


dfiactomedin related ER tocaOzed protel 


16.3 


105298 


BE387790 


Ks.26369 


hypoOteScalpnrtein FU20287 


15.5 


107160 


AA314490 


Hs.27669 


K1AA1563 protein 


14.8 


115313 


AA808001 


Hs.184411 


albumin 


14.6 


123308 


C14187 


Hs.103538 


ESTs 


14.2 
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126077 M78772 Hs^10836 ESTs U.0 

100638 AA884407 H5.211595 protein tyrosine phosphatase, noo-recept 13.9 

106821 NM.006594 Hs.28298 edaptor-ielatad protein complex 4, beta 13.9 

110288 H40665 H&31564 ESTs 13.8 

S 106438 A1221919 hypolhefica) protein aj10S82 13.8 

112134 R41823 Hs.7413 EST5»3!syntBnin-2 13.7 

115265 AA334274 K3.18368 DKFZP$64B0769 piotein 13.4 

126639 AW582962 Hs.102897 CGM7 proton 13^ 

125698 AF078847 Hs.191356 general transcription factor UK polype 1Z8 

10 105200 AA328102 K5.24641 cytosketeton associated protein 2 113 

119456 AA248897 Hs.487B4 ESTs 12.2 

135155 AI207958 KS.166S56 Homo sapiens, SimBar to TEA domain fanti ^Z^ 

106614 AA64B459 Hs.335951 tiypolhefical protein AF301222 12.0 

131941 BE252983 Ks.35086 ubiquilin specific protease 1 110 

IS 132968 AF234532 Hs.61638 myoslnX 11.7 

132725 NML008276 Ks.184167 apliidng factor. aiginIneJtefin»fich 7 11.7 

111353 W20090 Hs.6616 ESTs 11.4 

123289 AA495904 Hs.103316 ESTs 11.3 

104388 AI249502 Hs.29669 ESTs 11.1 

20 123532 AA60B733 gb:8e56f06.s1 Stiatagene lung caidnoma 11.0 

104173 AA084273 Hs.76S61 ESTs. W^aldy similar (DS47072 finger pr 10.7 

125556 AB033084 Hs.236463 KIAA1238 protein 10.7 

127240 AJ005683 Hs.86998 nudearfaclorofadivstedT-cails5.t 10.6 

123049 e£O476B0 Hs.211869 (fic)d(opf(Xenopusiaevis))iomoloo2 10.6 

25 132315 AF091086 Hs.44563 hypothetical prat«n 10.5 

134321 BE538082 Hs.8172 ESTs, Moderately slmBar to A46010 X4]n 10.5 

128609 NIUL003616 HS.1024S6 sunrtval of motor neuron protebitnterao 10.4 

131945 NM_002916 Hs.35120 replication factor C (activator 1) 4 (37 10.4 

110730 N67555 Hs^11 ESTs 10.3 

30 119188 AI97gi47 Hs.10126S hypothetical protein FU22593 m 

115909 AW872527 Hs.59761 ESTs, Weakly similar to OAPIJIUMIAN DEATH 10.3 

. 121916 AW1 17207 Hs.98523 ESTs 10.2 

131402 Y08B90 H8.113503 kaiyopherin (impoitin) l)eta 3 10.1 

101879 AA176374 Hs,243886 nudear autoantJgenk: spemi protein 0^ 10.0 

35 133181 X91662 Hs.66744 twist (DrosophSa) hornolog (acrocephatos 10.0 

131307 NM„000025 Hs.2549 adrenergic, tieta^. receptor 10.0 

113849 AA457211 Hs.885B txomodom^ adjacent to zinc finger dome 9.9 

103446 X98834 Hs.79971 sal (Drosoph9a)^ 2 9.7 

105091 AA148859 Hs.179909 hypothefical protein RJ22995 9.6 

40 106146 W07268 Hs.118392 ESTs. Weakly sbnSar to lllHUERtgEli: re 9.5 

127003 AW816515 Hs.173540 ATPase, QassV.typelOO 9.5 

115414 AA662240 Hs.283099 AF15q14 protein 9.4 

117B29 BE160952 H8.247117 ESTs, Moderately similar to AlUFJftJlMANl 9.4 

126098 M79088 gb:EST01 236 Subtracted Hippocampus, Stia 9.3 

45 121910 AI204600 Hs.96978 hypothetical prot^MGCI 0764 9.2 

130625 AF176012 H3.260720 J domain containing protein 1 9.2 

129755 R42216 Hs.7759 Homo sapiens ctone 24538 mlV^A sequence 9.1 

100335 AW247529 Hs.67g3 piatetet^cfivafing factor acefyihydRda 9.0 

111109 AI940675 Hs20914 hypothetical protein FU230S6 9.0 

50 120217 AA882257 Hs.66035 ESTs 9.0 

112891 T03927 Hs.293147 ESTs. Moderately similar to A46010)Win 8.9 

131801 NM.007115 Hs.29352 tumor necrosis factor, alphaMuoed pro 8.8 

126262 AI085153 Hs.143764 ESTs, WeaidysimitartounkiKwnlHjsapto &6 

113903 A1368034 Hs.38679 ESTs. Weakly similar to A46010}(lhked 8.7 

55 105253 AW^7464 Hs.5003 iaAA0456 protein &6 

102581 AU077228 Hs.77256 enhancer of zeste(Drosophila)homotog 2 8.6 

105172 AA187554 Hs.300496 ndtochondrtal soiuto canfer a6 

100380 082343 Hs.74376 neuroblastoma (nenre tissue) protein BA 

129229 AF013758 H8.109843 polyadenytate tmufing pnrteli>4nterac8n aS 

60 102725 AB026187 Hs.159156 protocadherin 11 8.5 

130298 AI347487 Hs.132781 dass I cytokine receptor 8.4 

132294 AB023191 Ks.44131 KIAA0974 protein 8.2 

118644 AA443241 rilxjsomal protein L44 8.2 

106575 AW970602 Hs.105421 ESTs a2 

65 133142 AWg52412 HS.6S874 ESTs, Weakly sliriter to A40348ElairiSex- ai 

119499 AI918906 Hs.55080 ESTs &1 

100248 NM_015156 Hs.78398 KIAA0071 protein ai 

114837 BE244930 Hs.166895 ESTs ao 

107098 AI823593 Hs.27688 ESTs ao 

70 123980 AW082862 Hb.287733 hypothaOcsl protein RJ231 89 7.9 

105280 AA894838 Hs.14600 ESTs 7.9 . 

115147 AA745781 Hs.38399 hypothetfcal protein 1^602454 7.9 

134570 U66615 Hs.172280 SWUSNF rotated, matrix assodatBd.ac6 7.9 

133083 AI654133 Hs.30212 thyroid receptor Interacfing protein 15 7.8 

75 105730 AW377314 Ks.5384 DKFZP564I052 protein 7.8 

126426 AA125984 gb3n27h0ar1 Strategene neuroepilhelium 7.8 

107254 Z45998 Ks.22543 Homo sapiens mRNA:cONADKFZp76111912(f 7.8 

134367 AA339449 Hs.82285 phosphortbosylgtyclnamldetormyttTansfBr 7.8 

119082 AF252297 Hs.91546 cytochrome P450 retinoid meteboBzing pr 7.7 

80 12S4Q0 AL110151 Ks.128797 DKF2P5B6D0824pioteln 7.7 

129413 AW377610 Hs.11123 DXFZPS64G092 prot^ 7.7 

116766 AI808857 Hs.950g7 ESTs 7.7 

129075 BE2S0162 Hs.83765 dihydrofolate reductase 7.6 
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10S143 


AI368835 


Hs.24808 




132438 


AW383687 


Hs^l6 




118036 


A)471862 


Ks.196008 




131170 


NM_014253 




5 


104546 


R3^98 


Hs.91559 




131495 


AA812434 






132S43 


BE568452 


H5.344037 




124040 


U23752 


Hs.32964 


10 


127695 


AA714731 


Hs^1457 


132559 


AP1 19846 


Hs.270863 




101050 


AU077324 


Ks.1832 




111823 


R35252 


HS.13055B 




116401 


AW893940 


Hs.59698 


15 


105127 


AA045648 


Hs^1957 


128478 


AA708205 


Hs.100343 




110456 


H52348 


HsJ6636 




118846 


AVV299598 


Hsi0895 




120934 


AA226ig8 




20 


128132 


AA22S632 




127431 


AW771958 


H$.175437 




108157 


W37943 


Hs^892 




117852 


AW877787 


HS.1361Q2 




106213 


N45016 


Hs3769 


25 


118013 


AI674126 


HsJ4031 


120147 


Ai917116 






118267 


N34905 


Hs.44653 




132581 


AKO00S31 


H&52256 




120325 


AA195651 




30 


133276 


AW978439 


Hs.6gS04 


127742 


AW293496 


H3.180138 




127664 


AA806164 


Hs.1 16502 




108778 


AF133123 


HS.S0847 




121553 


AA412488 


Hs.48820 


35 


107059 


8E614410 


Hs.23044 


133658 


AA319146 


Hs.75426 




115197 


R18856 






112268 


W39609 


Hs.22003 




103346 


X87613 


Hs.5464 


40 


126967 


AA205976 




118499 


N67274 


Ksi0141 




105301 


AW352357 


Hs,7457 




113617 


AI869372 


Hs.17207 




127968 


AA830201 


H8.124347 


45 


134719 


AA852985 


Hs.89232 


110748 


AW581992 


Hs.301434 




115844 


AI373062 


Hs,332938 




121885 


AA934883 


Hs.98467 




102398 


U423S9 




50 


130882 


AA497044 


Hs.20887 


132256 


AI078645 


H5>431 




106383 


AA447453 


HS.Z7860 




135186 


U73328 


Hs.172648 




105510 


Z42D47 


HSk283978 


55 


109261 


6E466639 


^61779 


114357 


R41677 


Hs.6107 




128501 


AL133572 


H3.199009 




109747 


AI223001 


Hs.22969 




128361 


AW172570 


Ks.14600 


60 


115773 


AW445044 


Hs.38207 


104689 


AA420450 


Ks.292911 




113575 


AW138168 


HS.1S671 




121830 


AW394055 


H$^27 




119298 


NM.001241 


Hs.155478 


65 


109841 


K01052 




115622 


AK)88691 


Hs.209414 




122959 


AW821252 


Hs.104336 




109872 


R65841 


H3J28653 




114208 


AL049466 


Hs.7859 


70 


113494 


T91451 


H9.88538 


127684 


AA668831 


K5.159971 




128300 


W94ig7 


Hs'.110165 




127489 


AA650250 






133479 


W01556 


Hsu08797 


75 


10SS09 


AA195191 




101255 


BE385654 


Hs.149894 




134676 


W28Q51 


Hs.87819 




130895 


AA641767 


Hs^lOIS 




126395 


AI468004 


Hs.276956 


80 


130723 


BE247676 


Hs.18442 


107230 


AI034487 


Hs.34850 




102745 


AW753865 


Hs.74376 




108899 


AA121514 


Hs.70832 




128080 


F12310 





ESTs,Wea)(lysini!lartot36022hypoiheg 7.6 

chaperonlncontdrrfn9TCP1.BUbunil6A( 7.6 

HomosaplenscONAFU11723(is.cioneHE 7.6 

odz(oddOi/t8n^DrDsoptdb)hamQlog1 7.5 

ESTs 7.5 

SMC2 (structural m^tenance cf chremoso 7.5 

protein re^Mir of cytoldnesisi 7.4 

SRY (sex determining region YHkw 11 7.4 

ESTs, Weakly similar to heterogeneous ri 7.3 

hypoOielicai protein PRO1580 7.3 

neuropeptide Y 7.2 

EST$,W^8lmilarto210926QA8ceO 12 

ESTs 7.1 

nudbc (nucleoside diphosptete linked mol 7.1 

ESTs 7.1 

ESTs 7.1 

ttorneo box C4 7.1 

gb:nc26a07.s1 N(3.0GAPJ>r1 Homo sapiens 7.1 

gb:nc08a07 Jl Na^(X5AP^Pr1 Homo sapiens 7.0 

ESTs. Moderately slmOar to PC4259 fani 7.0 

KIAA1323 protein aS 

KIAA0853 protein 6.9 

tiypothelicai protein DKFZp761J17121 6.9 

ESTs 6.9 

liemoglot)ln. beta 6l8 

Komo8apienscDNA:aJ226696s.clon8H 6.8 

tiypofhetlcal protein FU20624 6.8 

AP-2 beta transoIpSon factor 6.8 

ESTs as 

ESTs 6.7 

ESTs 6.7 

general transcdptk}n factor IllC. polyp 6.6 

TATA box binding protein (TBP}-associal0 6.6 

i^l (S. cerevisiae) homoiog (E coQ Re 6.6 

secretogranin ii (chromogranin Cj 6.6 

ESTa 6.5 

sohjte canter fandy 6 (neuroiiansmltte 6.5 

thyioMhonnonereceidorooacQvaBngpr &4 

gbucq48a10.r1 Stratagene hNT neuron (937 M 

EST 6.4 

MAGE1 protein ^3 

Komo8ai^nscONAFU11922fis.doneHE aS 

ESTs 6.3 

chfomobox homoiog 5 (DrosophllaHPIaiph 6.3 

K1AA1387 protein 6.3 

hypothettea) protein MGC5370 6.3 

ESTs. Highly similar to AF257737 1 cOia a 2 

gb:HiimanN33 protein form 1(N33)genOi a2 

hypoOwtical protein RJ10392 6.2 

muitne leukemia viral (bmi-1) oncogene h a2 
Homo sapiens mRNA; cONA DKF^iS86M0723 (f 6.2 

diste)^homeobox4 a2 

Homo sapiens PROZ761mRNA, complete Gd8 &2 

Homo sapiens cONA PU13591 H done PL ai 

Homo8aptenscDNAaJ14839fis.doneOV 6,1 

protein centring CXXC dom^ 2 a 1 

ESTs. Weakiy stmOar toZ141JtUMAN MQ ai 

ESTs ai 

Human DNA sequence from done RP4^15 6.0 

PlakophlUn ao 

ESTs. Weakly slntilar to KBF3JtUMAN NUOE 6.0 

ESTs.WeaktysimDartol38022hypothelI 5.9 

cycIinT2 SJ 

gb:yi32h01.s1 Soares placenta Nb2HP Homo a9 
(^sa{ri6n5mRNA:cONADKFZp564D0472(f 5.8 

hypothetical protein aO 

ESTs a8 

ESTs a8 

ESTs a8 

K1AA0379 protein aS 

fibosomaIprotdnL26haniotog 5.8 

ESTs 5.8 

ESTs, Moderately skniar Id 138022 }iypot 5.8 

hypoffie^ protein FU20729 &6 

roilDdiandrtaltransiattonaliniiiattonf 5.6 

Homo sapiens, done MGC:2492.mRNA.comp a? 

hypothetic8SpfotrinOKFZp664L0884stma a? 

hypothetical protein FU12929 a7 

E<1 enzyme a? 

ESTs a7 

oUadomedin rdated ERIocafized protel a6 

ESTs a6 

0b:HSC38O041 normalized inlM brain CON aO 
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117357 


N24829 




120827 


AA38252S 


Hs.132967 


123494 


AW179019 


H3.II2IIO 


100438 


AA013051 


Hs,91417 


126165 


AI741816 


Hs.1 25897 


125934 


M193325 


Hs.32646 


124370 


A1243499 


Hs.170552 


106135 


1 AL1 17474 


HS.411B1 


10^8 


R49390 


H3^54129 


132967 


AA316181 


Ks.61635 


108806 


243371 


Hs.7012 


115816 


BE042915 


Hs.22572 


125188 


AA61Q620 

fvyv 1 lAi&w 


Hs.181165 


131185 


BE280074 


Hs.23960 


111227 


T06701 


Hs.1 2268 


105516 




30738 




MM BDa&A 


Hs.43697 


1160(^ 


179153 


Ks!48589 




242993 


H&.25320 


115121 


A1634S49 


Hs.325422 


125464 


N71807 




115536 


AW298S97 


Hs.61684 


119040 


R02394 


Ks.269436 


128040 


AW500486 


Hs.180610 


110151 


Hie835 


H3.31608 


105539 


AB040884 


Hs,109894 


130567 


AA383092 


Hs.1608 


111077 


N41387 


Hs.173002 


120830 


AI568170 


Hs.96886 


106516 


AL137311 


Hs.234074 


103316 


X83301 


Hs^24728 


116129 


AF189011 


Hs.49163 


131253 


R71802 


Hs^4853 



116680 AWg02848 Hs.273829 

123949 AA621665 Hs.208957 

105511 AB037829 H3.3862 

125847 AW161885 Hs.269745 

108301 AA06g728 H&184562 

110799 AI089660 Hs.323401 

104899 AA054728 Hs.285574 

125972 AI927475 Hs.35406 

107869 AW975998 Hs^95 

121309 AA293834 HsS7312 

125321 T86652 Hs.178294 

102627 AIJ021918 Hs.158174 

104448 AF084555 Hs.7351 

126020 H79863 Hs.114243 

116814 H50834 Hs.77899 

130622 A1S82291 HS.16B46 

110818 AL157503 H3.27552 

127765 AA971148 Hs.129074 

108768 AF161721 Hs.61345 

120484 AA253170 Hs.96473 

114518 AW163267 Hs.106469 
129012 R81936 

124973 AI476066 Hs.102243 

124648 AA249088 Hs.125034 

126904 AA948033 H3,130853 

131526 AI005169 Hs.28274 

130637 AA356764 Hs.17109 
126769 AA083456 
126086 H75661 



100169 


AL037228 


Hs.82043 


130262 


D63216 


H3.153684 


109260 


AW978515 


Ks.131915 


103120 


BE410731 


Hs.74050 


111099 


R15337 


Hs.21958 


126142 


H86261 


HS.4056B 


126802 


AW805510 


Ks.97056 


105848 


AW954064 


Hs.24951 


127987 


AI022103 


Ks.124511 


129706 


AA443241 




129598 


N30436 


H3.11556 


126629 


AL096739 


H8.107260 


111348 


AA034922 


HSJ9585 


100448 


AF234887 


KS.S76S2 


127542 


AA703684 


Hs.179833 


132452 


AW973521 


H3J47324 


123778 


AW352149 


Hs.102314 


126521 


A1475110 


Ks.203933 


110343 


AW138703 


Hs.17268 


102963 


mm 


Hs^4534 


127207 


AA377165 


Hs.44833 



gb:yx98h12£l Scares metsnocyte 2N1)HM Ho 
Human EST dons 122887 fnariner transpose 
mitochondria itaomal protein L42 
topoisomafBse (DNA) D binding protein 
ESTs 

hypo0ielicalpnMFU21901 
ESTs 

Homo sa;fens mRNA; cONA OKFZp727C191 (Ir 
K1AA1678 

Six transmembrane epItheQal anSgen d 
ESTs 

Homo sapiens cONA aJ13675 Gs^ done PL 
major hisbsompaGbSly comfJIex. dass 
cydinBI 
ESTs 

Ivpottialicai protein aJ10407 

els variant gene 5 (ets-relatBd moiectde 

zinc finger pDtdn 228 

Homo sapiens done 25142 mRNA sequence 
ESTs 

flh:yz29d09x1 SoaresLnniIllple.8defO^_ 

Homo sapiens, done tMA6E:4298Q26. mRNA, 

ESTs. Moderately simflar to PC4259 ferrt 

spOdng factor prdlneitlhitamine dch ( 

liypothetical protein FU20041 

KiAA1451 protein 

repMon protein A3 (141(0) 

ESTs, Weakly similar to 138022 iiypotheB 

ESTs 

Homo sapiens mRN^ cONA DKFZp761G02121 ( 
SMA5 

putaOvaribonudeaseni 

ESTs 

ESTs 

EST 

regulator of nonsense transcripts 2; DKF 
ESTs 

rib08omaiproldnL24 
dpy^liln protein 
ESTs 

ESTs, Higtily similar to unnamed protein 
ESTs. Weakiy similar to 138022 hypotheS 
ESTs 
ESTs 

zinc linger protein 184 (KruppeMiln) 
cycOc AMP plYOSplioproiein. 19I(D 
ESTs 

g|b7p86a10^1 Soaresfatd liver spleen 
ESTs, Weaidy sImQar to 04HUD1 ddirlsoqM 
Homo saptens mRNA; cONA DKFZpS88N2424 (f 
ESTs. Moderately dmOar to MEG1 MOUSE M 

RU2S 
EST 

suppressor of varl (S.cer6visiae) 3-liiu 

(fbcGomd protein L44 

ESTs. W^aldy similar to 178885 seiineAli 

tiypolheOcalprot^FU13340 

ESTs 

Homo saptens cONA: nj22049 fis. done H 
integral membrane protein 2A 
gtm09g08.r1 Strategena hNT neuron (937 
gb7r77gD1 /I Soares fetel Iver spleen 
D1 23 gene product 
fiizzted^etated pmtetn 
KIAA0863 protein 

fiMar lympiioma valiant translocaSo 

Homo sapens mRNA; cONA DKFZp547O086 ^ 

ESTs 

hypotheOca! prol^ FLI21634 

ESTs 

ESTs 

rilMSomei protein L44 

Homo saptens cDNA FIJ12566 fis. done NT 
tiypotlnfica) protein DKFZp586H0823 
ESTs 

cadfierin. EGF lAG seven-pass G-type rece 

ESTs. Moderete|ys]m!lartoALU5.HUMAN A 

mitochondrial ribosomal protein 814 

ESTs 

ESTs 

ESTs 

c^tanln^elated pdypepfide. bete 
ESTs 
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113974 AW969756 Hs.34145 
133098 AW629223 Hs£4794 



109920 


H05840 


Hs.111323 


103487 


AA743503 


Hs.172108 


125353 


AB033043 


Hs.149377 


100893 


BE245294 


Hs.1 80789 


135047 


All 34197 


Hs.93597 


111148 


AB020690 


H5.7782 


107681 


BE379594 


Ks.49136 


108480 


AL133092 


Hs.68055 


132742 


AA025480 


Hs.292812 


104912 


AA813192 


Hs 200596 


110223 


HI 9836 


Hs^1697 


113047 


AI571940 


Hs.7549 


101031 


J05070 


Hs.1 51736 


113002 


BE243513 


Hs.7212 


105^ 


AW134924 


Hs.1903K 


112386 


AA8317ffi 


Hs. 171914 


106489 


AA452054 


Hs 119338 






Ha.99519 


112661 


AAS9S802 


Hs.33410 


111346 


AW97Q976 


Hs.293653 


12^96 


AAtt1130 


Hs.112640 


125447 


A)582222 


Hs 128686 


106178 


AL049935 


Hs^1763 


102250 


AI249361 


HsJ4122 


105868 


AA376760 


H3i34642 


127496 


AI031850 




127315 


AF1 16622 




110193 


AI004874 


Hs.310764 


105012 


AF098158 


Hs.9329 


129445 


W52452 


Hs.29797 


114721 


D61939 


Hs.103822 


120922 


AA461003 


Hs.97128 


115167 


AA749209 


H3.43728 


102407 


AW602154 


Hs.82143 


106111 


AW875398 


Hs.6451 


123829 


AF251237 


Hs.112208 


103816 


NNL002647 


Hs.32971 


100269 


NM.001949 


HS.11&9 


112728 


R91913 


Hs^72104 


135098 


AW274526 


Hs.277721 


106034 


AW9S2005 


Hs.14928 


133571 


BE515037 


Hs.177556 


106246 


AL036917 


Hs.288621 


125724 


AL360190 


Hs.318501 


132206 


AA425204 


H8.334721 


130227 


BE397151 


H8.153003 



129650 AF109298 Hs.1182S8 

130382 NtVL003450 Hs.155204 

106073 AL157441 Hs.17834 

105403 AI473827 H8^1793 

118138 AA3747S6 Hs.93560 

134126 NM.003747 Hs.131814 

115041 AA252457 Hs,86543 

106012 A1240665 

116732 AW1S2225 Hs.165909 

130832 AW838006 Hs.20082 

125960 At754693 Hs.145968 

133916 AIJ039185 Hs.77558 

106232 AB037828 Hs.15370 

129228 U40714 Hs^307 

109929 AA7731B7 Hsi94027 

123729 AL039779 Hs^8672 

101266 L36645 Hs.73964 

132389 AA310393 Hs.190044 

124320 H95749 Hs.102342 

103023 AW500470 Hs.117950 

116133 AW449S97 Hs.313652 

129001 AA443323 Hs.107812 

119271 AI061118 H3.65328 

114767 Ai859865 Hs.154443 

126107 H75477 Hs^3361 
115333 6E241812 Hs^7860 
100571 L14561 Hs.78546 
134104 L352S3 Hs.79107 
120150 BE005771 Hs,153746 
134470' X54942 Hs.83758 
129948 AI537162 Hs^88 
106882 AA14g537 Hs.26994 
126199 AW088278 Hs.125829 
112727 T91029 Hs.15069 
118672 AI039201 Hs^16 



EST8. WB^SlmnartDB49S47 GTP-bMI 
dncfingerproteln 183 (RING flnger, C3 

ESTs 

mideopodn 88kD 

hypoff^ticst piot^ DKFZp761L0424 
S1&4protdn 

cycSrHfependent kinase 5. regutatvy su 

paraneoplasOc anUgsn MA2 

ESTs, Moderately stmilar to AtJU7_HUMAN A 

liypoUtelical protein OKFZp434t0428 

ESTs, Weddy simlar toT33468 hypothefi 

KIAA0547 gene product 

ESTs 

ESTs 

matrix met^Doprolelnase 9 {gelatinase B 
hypothetical pRitBinPP1044 
ESTs 

Homo sapiens cONA FU14209 fis. done ^^' 
ESTs 

hypothetic^ protein FU 14007 

ESTs. VfeaMy dmnar b 11 7279 hypotheti 

ESTs 

EST 

ESTs 

KiAA0554 protein 

C8spase4, apopto^s^elated cysfdne pr 

ti4njl]n dpha 1 

ESTs 

gb:Homo sapiens done aB4217 mRNA seque 

Homo sapiens mRt^ d)NA DKFZp434M082(fr 

chromosome 20 open leading 6ame 1 

ribosomal protein L10 

ESTs 

ESTs 

hypothetical protein 

E744ke factor 2 (ets domain transcript 

PRO0659 protein 

XAGE-1 protein 

ph<^hotnosnide^ldnase, dass 3 

E2F transcription fador 3 

ESTs. Moderately similar to AUUIjnjMAN A 

membrane component, chromosome 17. surfa 

hypolhetlcai protein FU12903 

melanoma antigen, fanrily D, 1 

KIAA1638 protein 

Homo sapiens mRNA full length insert gON 

Ivpotheticai protein aJ13391 

Gerir»AhieonIneldnase16 

prostate cancer assodated protein 1 

zinc finger protein 174 

downstream ndgtibor of SON 

ESTs 

HomoaapiensmRNAIbrKIAAITTI prolBbi, 
l8nkyiBsa,TRF1-interading anltyrlHela 
ESTs, Moderately 6{ni3rtoT00256hypot 
ESTs 

ESTs, Weddy dmDarto 138022 hypoOtefi 

zine finger protein 3 (A&^l) 

ESTs 

Oiyrold hormone receptor interactor 7 
IOAA1407 protein 
lyrosyt-tRNA synthetase 
ESTs 

mamtirane component chromosome 11. surfa 

EphA4 

ESTs 

EST 

muIGfimdiond pdypep&da simDar to S 
EST, Wealdy simiiar to 136022 hypothetic 
6P0Z protein 

Fanconi anen^ oomptementafion group F 
minlchromosome maintenance deficient (S. 
ESTs 

protein tyrosine phosphatase, notwacept 
ATPase, Ca^ transporting, plasma membia 
mttogen^^dWated protein idnase 14 
hypotheflcdprotebiFU22490 
COC28 protein kinase 2 
ESTs 

hypothetxal proldn FLJ20477 

ESTs 

ESTs 

ESTs 
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131263 AU077W2 Hj24950 regulator of GiWrtelnslgnalllno 5 4.0 

134417 NM.006416 Hs.82921 sdute carrier family 35 (CMP^ ad 4.0 

113609 T93263 Ks.16675 EST8.Wea((lysimnar to 823650 retrovini 4.0 

^ 127153 AI732303 Hs.186518 hypothsQcalpR)tBlnM6C2599slmIlv 4.0 

5 124105 H11484 Hs.79133 ESTs 4.0 

100031 AFFX control - DapX-M 4.0 

108897 AF033023 Hs.167496 RAM tending protein 6 4.0 

128859 AW630087 Hs.103315 tiinudeoQde repeal containing 1 4.0 

133928 AW403084 Hs.7766 ublquilinKwnlugaling enzyme E2E 1 (homo 4.0 

10 126985 AI470S23 Ks.139338 ATP-bindlno cassette. suthCamBy C (CFTR 4.0 

103100 NM.005574 Hs.184585 UM(tomabionly2(rhonlbo1in^1) 4.0 

119005 AL038511 K$.125316 ESTs^WeaktysMlar to S33990 linger pr 4.0 

118751 N74210 lte.50454 ESTs 4.0 

128797 NM_002975 Hs.10S927 stem ceUgrovrthfecton lymphocyte secr 4.0 

15 111387 A1244489 HsJ285724 ESTs 4.0 

118876 ri45312 ^.48506 ESTs 4.0 

107348 AW973753 Ks.182426 rttxjsomal protein S2 4J) 

120528 AI923511 Hs.104413 ESTs 4.0 

115652 BE0935B9 Hs^178 hypotheecal protein RJ23468 4,0 

20 102935 BE561850 Hs.80506 smeU nuclear ritxHtudecprotein poiypepl 4.0 

115399 AF151S34 Hs.92Q23 core Hskme macrDH2A2.2 4.0 

133615 M62843 Hs.75236 ElAV (embryonic lethal atmormalviston. 4.0 

118129 N57493 gb7y54c08ji1 SoaresjnulilplejBderosls. 4.0 

126522 W31912 Hs.21168 gb2c7Gd03.s1 PancreaQcbletHofnosapi 4.0 

25 131965 W79283 Hs.35962 ESTs 3.9 

102507 U52154 Hs.193044 potassium liwanJIy-recfiiying chanriel 3.9 

126029 AA704253 H5.16g3S9 ESTs 3.9 

107168 W57578 Hs,237955 RAB7, member RAS oncogene family 3.9 

105808 AI133161 Hs.286131 CGI-101 protein 3.9 

30 134087 U51166 H5.17^4 thymine^NAglycoeylase 3J 

133195 AI434760 HsJ!79949 KIAA1007 protein 3.9 

112996 BE276112 H5.7165 dnc finger protdn 259 3^9 

129428 AA25e906 Hs,111364 ESTs. Wealdy similar to ubiquitous TPRm 3.9 

118967 At668670 Hs.216758 ESTs 3.9 

35 127335 AAB30210 Ks.214263 ESTs, Moderately Mar to ALU1.HUMAN A 3.9 

106636 AW958037 Hs.286 libosomal protelii U 3.9 

133529 W45623 Hs.74571 ADP-ftoytdtion factor 1 3.9 

102266 U29725 Hs.3080 mitogen-acQvated protein kinase 7 3.9 

125905 A1678638 Hs.6456 chaperonin containing TCPl.subunil 2 (b 3.9 

40 103080 AU077231 Hs.82932 cycSnDI (PRADIiparathyrotdadenomatos 3.9 

131971 BE567100 Ks.154938 hypotheScal protein MDS025 ^9 

129703 6E388665 Hs.179999 Homo sapiens, done IMAGE:3457003,mRNA 3.9 

128895 AW467000 Hs.106985 ESTs 3.9 

110924 AW058463 Hs.12940 zin&flngers and homeoboxes 1 3.9 

45 112921 R91095 Hs.4276 WAAITOI protein 19 

120820 AA347417 Hs.95869 EST 18 

106459 AA789081 H8.4029 gHoma^IHled sequenco^l 3.8 

114253 BE149866 H5.14831 Homo sapiens, StinDar to 2irc finger pro 18 

105476 AL117352 Hs,77196 Human DNA sequence from done RP&876B10 3.8 

SO 122682 AA984531 Hs.159293 ESTs 3.8 

130580 N32388 Hs.334370 uncharacterized hypoOialamus protein HBE 3.8 

130174 M29551 Hs.151531 protein phosphatase 3 (fbmierly 2B), cat 3.8 

llSSn AA393167 Hs.41294 ESTs 3.8 

129785 H1900B Hs.184780 ESTs 3.8 

55 126127 N95428 gb:zb80d09.8l Soares.senescentJbrablas 3.8 

109793 F13088 Hs.8040 heparan suSate (glucosamine) 3-0«llfbt 3.8 

103175 X69089 HsJ9227 myomesin (M^xotein) 2 (165kO) 3.8 

100154 H60720 Hs.81892 KIAA0101 gene product 3.8 

106140 AB006624 Hs.14912 KIAA0266 protein 3.8 

60 129052 8E275031 Hs.158210 hypothetical protein M6C2S55 3.8 

103319 X63492 Hs.82359 tumor necrosis factor receptor superfanii 3.8 

106319 W22335 Hs.7392 hypothetical protdn MGC3ig9 3.8 

102391 AA296874 Hs.77494 deoDcyguanosine Idnase 3.8 

127262 AA828125 gbMd71a09.s1NCLCGAP.Ov2 Homo sapiens 3.7 

65 126872 AW450979 gb:UI-H^»alM.124Hn.s1 NCLCGAP.Su 3.7 

107315 AA316241 HS.90G91 nudeophosmtnMudeoplasmin 3 3.7 

113119 T47910 gb:yb16b11.s18tratagene fey spleen (9 3.7 

126962 R12014 H8.20976 ESTb 3.7 

116203 AW137166 Hs.873C6 ESTs 3.7 

70 123377 AW969183 Hs.271297 ESTs 3.7 

122798 AW366288 85.145696 8piidngf^r(CC1^ 3.7 

107780 AA018927 Hs.269213 ESTs 17 

132733 AB020631 Hs.123654 PCFIIphomolog 17 

120849 AA687322 Hs.192B43 leudne zipper protein fn<SG14 3.7 

75 131163 AAD99524 Hs23754 ESTs 17 

126708 AW982593 Hs.135260 ESTs 17 

117417 AI241733 Hs.43871 ESTs 17 

106272 AW959731 Hs.323099 ESTs 17 

110834 AW273860 Hs.5759 ESTs 3.7 

80 123663 AL039482 Ks.142517 Homo sapiens mRNA;cONAOKFZp434P0810(f 17 

124250 AA350256 EST. Wea]dyGbnlarto210926QA8celg 17 

127038 AA233014 HS.10S985 ESTs 17 

107711 W96141 Wasm ESTs 17 

787 
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130850 AB040922 Hs^Z37 DKFZPSG6Cm protein ZJ 

119126 R45175 HS.1171B3 ESTs 3.7 

125466 R08234 Hs.180461 ESTs 3.6 

1051S0 AAB31977 Hs.155995 KIAA0643pratB!n 3.6 

5 103163 AU077016 Hs.3235 keratin 4 3.6 

104495 AW975687 Hs.292979 ESTs 3.6 

107599 AVy664072 H5.60136 ESTs 3.6 

113577 A13a0699 Hs.111334 PR00470 protein 3.6 

102681 Y06B90 Hs.113503 toyopheiin (impoitin) beta 3 3.6 

10 109411 R98881 Ks.109655 6excombonm!dieg(DrosophI)aHite1 3.6 

127563 AB035B98 Hs.150587 kinesiivlike pTOtebi 2 3.6 

131112 H15302 Hs.168350 Homo sapiens niRNA;cDNM)KFZp566A1046{f 3.6 

114118 F01598 Hs.175930 ESTs 3.6 

103937 AAd34063 Ks.lSSSG ESTs, VfeaHysirnBar to 138022 hypotheli 3.6 

15 125174 W51835 H3.231082 EST 3.6 

104799 AA029703 gb:z09SliO8.8l Soares.(etaLheajLNbHH19W 3.6 

126952 AID43463 Hs.6755 RaP2 interacting protdn 8 3.6 

135191 X16668 Ks.333497 cytochrome P450, subfamily IID (debilsoq 3.6 

124367 AI683183 H5J9348 distaMess homeo box 5 3.6 

20 113560 T9101S ESTs 3.6 

119232 AI655226 H8.117659 ESTs. Wbaldy similar to T46481liypolheO 3.6 

113988 W87536 Hs.36473 ESTs, WeaMy similar to JC5238oalaclosy 3.6 

115173 BE612940 Hs.88252 ESTs 16 

126500 AA69^9 Hs.191385 ESTs 3.6 

25 127256 AI738610 Hs.267967 ESTs. Moderately stmllar to ALU8_HUMAN A U 

123419 T66087 H8.112482 Homo sapiens unloiownnil^ sequence 3.6 

102025 U04045 Hs.78934 mutS(&coli)lK)molog 2 (colon cancer. 3.6 

100661 BE623001 Hs.132748 Homo sapiens ribosomai protein L39 mRNA, 3.6 

132339 D80030 Ks.45127 chondroitln sulfate proteoglycan 5 (neur 3.6 

30 115691 AW190215 Hs.62348 hypottietlcsl protein FU1 1753 3.6 

134853 BE266328 HS.902B0 5<nilnoln)idazoto44»boxamiderfboniide 3.6 

134075 Nl\t012201 Hs.78979 Gotgi apparatus protein 1 3.6 

128468 T23625 Hs.150580 putaSve translation bMion factor 3.6 

127229 AA316181 Hs.61635 sbc transmembrane epithelial antigen of 16 

35 108451 AA079195 fib3m92h12.s1 Stiatagene ovarian cancer 3.5 

133421 AF134160 Hs.7327 ctaudlnl 35 

135332 AW393883 Ks.98968 hypotheScal protein FU23058 3.5 

132520 AA257992 Hs.50651 Janus kinase 1 (a protein tyrosine Idnas 3.5 

125562 A1494372 Hs.98968 hypothetical protein aJ23058 3.5 

40 126996 BE161065 Hs.167531 methylcrotonoyl-Goenzyme A carboxylase 2 3.5 

125558 1^9305 gb7h16c10.r1 Soarosbtfentbrdn 1NIBH 15 

117265 AA4S1966 RAB9^ protein 3.5 

130215 BE301683 Hs.152707 gttoblastoma amprified sequence 3.5 

109482 AI002238 Hs.11482 8p!k:lng factor, arglnlne/^erine^ 11 3.5 

45 133726 A1803188 Hs.252716 oxysterol^^nding protein-rolated protei 3.5 

132317 BE262438 Hs.44592 beta-1.4 manni^transferase 3.5 

117613 AW029507 Hs.161102 ESTs 3,5 

100944 107518 mucin 6. gastric 3.5 

105226 R58958 Hs.26608 hypothefical protein M6C1 5880 15 

50 125032 T74884 gb:yc58d02.s1 Strstogene liver (937224) 15 

123720 AA609734 Hs.1 12755 EST 15 

128846 AA730767 K5.285753 S(X3104iIcei)roteIn 15 

116443 AW962196 H8.339808 LBP protein 32 15 

128770 AB01S982 Hs.143460 protein UnaseCnu 3.5 

55 106916 AA076049 • Hs.274415 Homo sapiens cDNAFU10229^.cton8 HE 15 

131244 AI638429 K3.24763 RAN btoding protein 1 15 

128765 AFO73310 Hs.143848 InsuHn receptor substrate 2 15 

111223 AA852773 Ks.334a38 KIAA1B66protein 15 

104857 A1920g02 Hs.19068 ESTs, Moderately simitar to S65657dpha 15 

60 105395 AI580880 H5.268149 putaOve methyttransterase 15 

133582 BE391579 Hs.75087 Fas-oGvated serine/threonine Idnase 15 

128527 AA504583 Hs.101047 transcripibn factor 3 (E2Aimmunogtobul 3.5 

106367 AA504747 Hs.136102 K1AA0853 protein 15 

103392 X94563 gb:R8ap(ensdbitebp9eneexon1&2. 15 

65 101086 AA362524 Hs.2S09S9 histatini 15 

133423 T840e4 K5.ig6008 Homo sapiens cDNAFU11723fis. done HE 15 

117714 N45226 Hs.46495 EST 15 

110986 AL133117 Ks.81376 Homo saptens mRNA; cONADKFZp586L1121(f 15 

114096 AF080219 Hs.27007 chromosome condensation 1-like 14 

70 117147 AW901347 Hs.38592 hypothetical protdn aJ23342 14 

133347 BE257758 Hs,71475 add ctoster protein 33 3.4 

132456 AB011084 Hs.48924 KiAAD512genaprodud:AtBC2 14 

133765 M62194 Hs.75929 c8dheim11,type2,C3B<8dheiin(osteob 14 

109734 Ai927212 Hs.3734 ESTs 14 

75 132788 BE083422 H5.56851 hypothefical protein MGC2668 14 

106685 AII076617 Hs.16251 cteavageandpotyadenytoOonspodScfa 14 

105593 AA279341 H8.174151 atdahyde oxidase 1 14 

131381 M92642 K3^208 collagen, type XVI. dpha 1 14 

108887 BE544475 Hs.54347 ESTs 14 

80 113115 AI141426 Hs.8705 ESTs 14 

107234 AA972447 Hs.288833 Homo sapiens mRNA; cDNADKFZp434K087(l^ 14 

123110 AA4862S6 Hs.103510 EST 14 
131019 W28614 chortodc somatom a mm o tropin honnone1(p 14 
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129079 AK000157 Hs.108502 hypofheOca! pnM FU20150 14 

111122 N63753 H3.16492 DKFZP564Q2022 protsht 3.4 

103934 BE278111 Hs.134200 DKFZP564C188 protein 3.4 

^ 128671 A1885045 Hs.211588 ptospholhosilU8^nase.regulalOfysu 3.4 

S 123258 AM90929 Hs.105274 ESTs. Weakly sMlarbRMSIJUiMANREGUL 3.4 

128826 Z40313 Hs.10633D Homo sa;^ done IMAGE:23371.ntf^seq 3.4 

120149 AA227609 Ks.94834 ESTs 3.4 

128817 BE395776 Hs.168640 snkytosis. progressive (mousd) homOlog 3.4 

110679 AA004798 Hs.108311 ESTs, WeaUy similar to T00351 hypoOieli 3.4 

10 122385 AA813546 H3.99034 GTP-binding protein Rho7 3.4 

134455 BE378152 H5.83S30 hypothelicat protein 3.4 

115506 AB037756 KS.4S207 hypotheQcal protein K1AA1335 14 

134267 Ai174596 KS.1S6209 RAE1 (RNA export 1.S.ponte}homalog 14 

^. 106691 M443164 Hs.232S9 hypothetica] prot^ FU13433 14 

IS 105169 BE245294 H3.180769 S164 protein 14 

120120 BE547267 Hs.59791 Ivypotheflcal protein M6C1 3183 14 

126638 AA649257 Hs.66998 ESTs 14 

128531 H03721 Ks.63236 libosomal protein S15a 13 

108650 A1049951 Hs.22370 Homo sapiens mRNA;cONADKFZpS64O01 22 (f 13 

20 108859 AL121500 ESTs 13 

112276 R53442 Hi26038 ESTs. WeSkty similar to 138022 hypotheS 3.3 

125693 H23989 Hs.169743 Homo sapiens done 25121 neuronal 13 

113474 R50752 H5.23656 hypothetical protein MGC5297 13 

132994 AA112748 Hs,279905 don8HQ0310PRO0310p1 13 

25 124691 R05635 Hs.110153 ESTs 13 

112511 AW97M20 dtynaCOn 2 (pSO) 13 

132985 AIJD45579 Hs.62113 KIAA0717prot^ 13 

125743 H17151 Hs.7416 gb7m37a05.i1 Scares Infant brain 1N1BH 13 

^. 133363 Ai866288 Hs.7ig62 ESTs, Weakly similar to B36298prQlinef 13 

30 107908 AF087999 ESTs 13 

105312 BE613348 Hs.211579 melanoma oeOadlwslonmolecde 13 

110256 H63947 K$.23795S RAB7, member RAS oncogene famOy 13 

132073 AA2g5052 Hs.38S16 Homo sapiens, done MGC:15887.mRl^ com 13 

107090 AW809208 Hs.183297 0KFZP566F2124 protein 3.3 

35 105463 AA825974 Hs.d2646 hypotheScal protein FU21901 13 

109592 A11980S9 Hs.26370 ESTs 13 

113649 N94768 Hs.16400 ESTs, Weakly sbnilar to KIAA1435 protein 13 

103898 AA248884 gb:k3517.8eq.F Human fetal heart. Umbda 3.3 

116439 AA251594 Hs.43913 RBFl gene product 13 

40 129535 AA397972 Hs.1 12603 cNmerin (chlmaeiin) 1 13 

113283 T66813 Hs.12947 EST 13 

12S992 K04150 Hs.107708 ESTs 13 

133160 N54968 Hs.G6309 hypothetic^ protein M6C1 1061 13 

134076 AR)86215 Ks.78980 gb:Homo sapiens ftiH length insert cDNA 3.3 

45 128301 U90552 H3.2&4283 butyrophilin. subfamily 3, member A1 , 13 

127728 AW404061 protein kinase C, beta 1 13 

126516 R95872 Hs.117572 chemokina Ending protein 2 13 

127506 T61039 Hs.76067 ifbosomal piotetn LlOa 3.3 

104769 AA025887 Hs.293943 hypothetical protein M6C1 1266 13 

50 126666 AA648886 Hs.151999 ESTs 3.3 

130453 U80735 Hs.173854 PAX transcription activatjon domain We 3.3 

107131 AW961605 Hs.21145 hypothetk:aI protein RG083M05.2 13 

130422 AW160614 H8.100034 deavagasthnuIallonMir.Srpre^ 13 

121292 AA401807 g|b:zv8Sf11.s1Soares_totaU^.Nb2HF8. 13 

55 123284 AA488g88 Ks.293796 ESTs 13 

130734 AW137091 Hs.18624 KiAA1052 protein 13 

105400 AF198620 Hs.10283 RNA binding motif protein 8A 13 

105014 AA121123 Hs.269267 EST6,WeaUyslmBartoAF1613611 HSPCO 13 

114988 AA251089 gb3s04105.8l NCLCGAP.GCB1 Homo sapiens 13 

60 132867 AF226667 H5.58553 CTPsynftaseP 13 

124169 BE079334 Hs.271630 ESTs 13 

114652 AI521936 Hs.107149 novel protein similar to archaeal, yeast 13 

113876 AI799751 H5.5635 ESTs 13 

111520 AI985369 Ks.301134 ESTs 13 

65 121748 BE536911 Hs.234545 hypothetk^at protein NUF2R ' 3.3 

106834 AIJ044182 Hs.2B070 KIAA0753geneprDdud 3.3 

128869 AA766242 Hs.80618 hypothetical protein 13 

129619 AA209534 Hs.284243 tetraspan ^ET•6 protein 13 

126770 AI2g2320 Hs.81361 heterogeneous nudear (ibonudeoprotdn 13 

70 115734 AW900992 Hs.93796 OKFZP586D2223 protein 3.3 

100253 D38024 H5.157425 double homeobox. 2 3.3 

130999 AA326683 Hs.21992 likely ortholog of mouse vailanlpoiyade 13 

114092 H81213 Hs.14825 ESTs. WeaUy similar to KIAA1503 protein 13 

__ 129284 AA318224 Hs.296141 ESTs 13 

75 119127 AA708035 Hs.12248 ESTs 13 

123910 AA621262 Hs.179923 ESTs. WeaUysMbr to 866657 8lpha>1C- 13 

132664 Ai740461 Hs.S4542 ESTs 13 

118397 BE139479 Hs.161492 ESTs 3.3 

^. 107003 AW138437 Hs.24790 KIAA1 573 protein 13 

80 126735 mi13 Hs.226795 g^Aalhione S^transfersse pi 12 

130847 A1672483 Hs.20220 Ppase protein 12 

101188 AA020956 Hs.17S881 core-binding factor, beta subunU 12 

113626 T94318 Hs.17359 ESTs, Moderately sinflar to RL44J1UMAN 6 12 
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121782 AW452957 Hs.63348 Homo sapiens, done MGC:15203,mRHA.COffl 3.2 

11Sa63 AA081218 HS.SB608 Homo sai^enscDNAFlJ14205 lis. done NT 3.2 

106103 6E62D779 Ks.12094 mitochondria) rlbosomal protein L30 3^ 

123608 AA620552 gbseSSglUISbalaeenetuno carcinoma 3^ 

5 133761 AF041430 Hs.75922 brain protein 13 3.2 

110561 AA379597 Hs.5199 HSPC1 50 protein Similar to uttquilhw»n 3.2 

115239 BE25132B Hs.73291 hypothefcai protein FU 10881 3.2 

117457 N29682 Hs.44071 ESTs.WeaWysimnaf toALU5J<UMANALUS 3.2 

128862 BE250742 Hs.106673 eukaryoCctTanstaOoninliiaSon factor 3.2 

10 129673 D3BS52 Hs.1191 K!AA0073 protein 3.2 

105154 AA307279 Hs.35947 nethyKHiG binding domain protein 4 a2 

126722 N66146 H3.1112S HSPO033 protein 3.2 

109966 H09103 Hs,30897 EST 3.2 

129315 N014563 H3.174038 spondyloepiphyseal dysplasia, late 3.2 

IS 109517 AI631874 Hs.1S5140 casein kinase 2, alpha 1 poiypepCde 3.2 

105252 ABO39670 HS.972B ALEXI protein 3.2 

101754 S70114 HS.2394B9 TIA1 cytotoxic granulMSSOciatedRNA^i 3.2 

110384 H452a2 Hs.2687g8 ESTs 3.2 

134118 BE336680 Hs.182877 WAAO1 16 protein 3.2 

20 134869 AL157518 Hs.90421 PR02463 protein 3.2 

100780 BES6195B H8.293441 bnmunogtobuOn heavy constant mu 3^ 

125728 AW954565 Hs37987 B^sliaUlymphomallB (zinc linger pro a2 

129794 AF161399 hypothetical protein FU13433 3.2 

129056 AI769958 Hs.108336 ESTs, WeaJdy similar to ALUEJ^UMAN All 3.2 

25 107129 AC004770 Hs.4756 flapstnA^ture-spediicendonuciease 1 3.2 

103038 AA926960 COC28 protein kinase 1 3.2 

130553 AF062649 Hs.252587 pituitery lumor-lransfonnlng 1 3.2 

125394 BE178502 H8.173772 ESTs, WeaWy similar to 178885 sorlnafth Z.i 

132305 AI806090 Hs.4«44 hypothetical protehiFlJ20534 3.2 

30 131136 AB033099 Hs.23413 KIAA1273 protein 3.2 

102983 6E387202 Hs.118638 nofrmeta8tellicoeRs1,pfOtdn(NM23A) a2 

117639 AA377165 Hs.44833 ESTs 3.2 

127076 A1422951 Hs.146162 ESTs 3.2 

126153 K85692 Hs.40730 ESITs 3.2 

35 132676 N92589 Hs.261038 ESTs. WeaHy simBar to 138022 hypotheS 3.2 

104948 AW242407 Hs.73848 carctnoembryonlc anllge^relatfid cefl ad 12 

122110 AI123000 Hs.301240 melanooor^ 1 receptor (alpha melanocyt 12 

127705 AJ003322 gbAI003322 Selected diromosome 21 cDNA 12 

109442 AW296134 Hs.86999 ESTs. Weakly sImHar to S65657 alpha-lC- 12 

40 100474 NM.000699 Hs.30O280 amytase. alpha 2A; pancmatk: 12 

115536 AK001468 Hs.62180 anillin (Drosophfla Scraps homolog), act 3.2 

119651 AW863858 H8.333513 sm^indudbte cytokine subfamily E. me 12 

127211 AA30552D Hs.108812 hypothetical protein inJ22004 12 

134964 At803516 Hs.272891 hippocatelivlike protein 4 11 

45 105551 AW005822 Hs.25292 r9x)nudease HI, large subunit 11 

119750 A1538880 Hs.94812 ESTs 11 

125590 R23858 Hs.143375 Homo sapiens, done IMAG£:3840937, mlV^ 11 

104590 AW373062 nudearreceplorsubrarniiy l.graupKm 11 

110724 AW016783 Hs.30799 Homo sapiens cDNAFU13471ll8. done PL 11 

50 116429 AF191018 Hs.279923 putaSve nudeoQde binding protein, est 11 

133915 AAB15092 H8.77554 Homo sapiens CDNAFU14967 lis, done TH 11 

128721 AW403911 Hs.266175 phosphoproteinassQdated wIhGEMs 11 

128536 R44214 Hs.101189 ESTs 11 

129179 AW69025 Hs.109154 ESTs 11 

55 108793 AA129395 Hs.71139 ESTs 11 

108807 AI652236 Hs.49376 hypotheflcal protein FU20844 11 

133461 NM.000762 Hs.334345 cytodiromeP450, subfamily IIA(phenobar 11 

132571 AW674699 Ks.5169 suppressor of G2dlBte of SKP1, 8. oere 11 

134937 AI251449 Hs.171939 ESTs 11 

60 125756 BE174587 Hs.28g721 growth arrest spedfic transcript 5 11 

112369 AW956243 Hs,4243 hypothetical protein RJI 2650 11 

127002 AL353940 Hs.24979 hypoOiefical protein DKFZp761P1010 11 

128179 AW293689 Hs.127116 ESTs 11 

117121 H95044 Hs.321386 EST 11 

65 126556 AF2S5303 Hs.112227 membranMSSOdatednudete add binding 11 

128403 AI908006 Ks.295382 Homo saptenscDNAHJ 144591b, done HE 11 

127930 AA809672 Hs.123304 ESTs 11 

114250 AI914699 H5.13297 ESTs 11 

108828 AKD01693 Hs.273344 DKF2P564O0463 protein 11 

70 105225 AA211777 |^2n57d02.8l Strategenemusde 937209 H 11 

117997 N52090 Hs.47420 EST 3.1 

104558 f^78 H8.88959 hypothetteal protein MGC481 6 11 

124777 R41933 Hs.140237 ESTs. Weakly similar to ALU1_HUMAN ALUS 11 

106035 N35568 Hs.5245 hypothetical prot* FU20643 11 

75 127621 AW297206 Hs.164018 ESTs 11 

120215 AF109219 Hs.108787 phosphaBdyBnositol glycan. dass N 11 

119403 AL117554 Hs.119908 nudeolar protein NOP5/NOP58 11 

105024 AA126311 Hs^879 ESTs 11 

123485 AI308876 Hs.103849 hypotheflcd protein DKFZpT61D1 12 11 

80 109416 BE268388 Hs.8694$ ESTs.Wealdy8lniflartoA46010X4teked 11 

132490 NK4.O0129O Hs.4980 UM domain Uiiding 2 11 

115348 AA281S62 Hs.768 ESTs 11 



117297 AW779829 gbJinB8a05J(1 NQ.CGAPJQdll Homosapien 11 
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112501 
107532 
130753 
132425 
1151B8 
129707 
108474 
106472 
102687 
117602 
105522 
133650 
134624 
112435 
134654 
106651 
109597 
132342 
106057 
105378 
128660 
115467 
128604 
126219 
126698 
133966 
119155 
131566 
100237 
105515 
123073 
111375 
130724 
129928 
118922 
131902 
125165 
122219 
132195 
102298 
103286 
130166 
126997 
128902 
117186 
126096 
106711 
114046 
135029 
112003 
105735 
126628 
133350 
110312 
125659 
129601 
123423 
128695 
123470 
109252 
101445 
102126 
115475 
106573 
103106 
130755 
112246 
126887 
117960 
133826 
113179 



AA972447 Hs.288833 Homo sapiens mRNA:cONAOKFZp434KDB7(fr 3.1 

AA443473 Hs.l73684 Homo salens fnRNA;cDNAOKFZp762G2D7(fr 3.1 

AA205223 H3.189 phosphodiesterase 4(;cAMP-6pdSc (dun 3.1 

N87549 Hs.125287 zinc finger piotelnZNF14(ma protein 3.1 

AKDQ0219 Hs.88367 hypolhetica]pn)lBlnFU20212 3^1 

AW572317 K3.12082 Homo sapiens mRNA;cONADKFZ0S66l2O3(lr 11 

BE383668 Hs.42484 hypolhetical protdn FU10618 3.1 

A1207162 Hs.3815 staawrin-nkeiirDteln RB3 3.0 

NM_007019 Hs.93002 ub{qu!fin canier pioteln E2-C 3.0 

N35020 Hs.44685 C3HC44ike zbic finger protein 3.0 

W19789 Hs.336635 Homo sapiens, done IMAGE:4179482.mRNA ao 

D84294 Hs.1 18174 tetralricopeptkte repeal dom^ 3 3.0 

ARJ35119 H3.8700 deteted in fiver cancer 1 3.0 

NM.013255 H$.288791 muslceBn LintraceDidar mediator conta 3.0 

AK001741 Hs.8739 hypothetical protein FU10879 3.0 

AA460421 Hs.30875 ESTs 3.0 

AA989362 H9.293780 ESTs 3.0 

AW162758 Hs.341729 ESTs. Weakly similar to ALUSJiUMAN ALU 8 3.0 

BE614474 Hs.289074 F-box only protein 22 3.0 

AA824298 Hs.21331 hypothetical protein FU10036 10 

AA011597 Hs.177398 ESTs 10 

AI366784 Hs.48820 TATA box Iilmfing protein (TBP)«soclatB 10 

Ai87g099 H3.1 02397 GlOT-3 for gonadotropin inducible transc 3.0 

A1804331 Hs.99423 ATPnlependent RNA heOcase 10 

AI221147 HS.14S088 ESTs, Wealdy similar Id T1S936hypotheti 10 

6E280478 Hs.182695 hypothetical proton MGC3243 IP 

R61715 Hs310598 ESTs. Moderately simOar to ALUI.HUMAN A 10 

AA450352 Hs.26966 MAAl 171 protein 10 

D3071 5 Human PAP (pancreaGOs-assodaled prots 10 

T24968 Hs.23038 HSPC071 protein 10 

AA485061 Hs.105652 ESTs 10 

H56499 Hs252692 ESTs. Weakly similar to 138022 hypotheti 3.0 

AKD01507 H8.44143 Homo sapfenscfc3neFLB6914PR01 821 mRNA. 10 

AI338993 Hs.134535 ESTs 10 

AW206193 hypothetical protein DKFZp761B2423 ■ 10 

AA180145 Hs.34348 Homo sapiens mRNA:cDNADKFZp434P0235(f 10 

W45350 g^2c81h08.&1 PancreaSc Islet Homo sapi 10 

AA4360Q2 Hs,183161 ESTs 10 

BE018717 Hs.42124 ESTs 10 

AA382169 Hs.54483 N^nyc (and STAT) interactor 3.0 

D38616 Hs.54941 phosphoiylase kinase, dpha 2 (liver) 3.0 

AK001389 Hs.15144 hypotheticai protein DKFZp564O043 3.0 

AI3771S0 H8.150914 EST$ 10 

AA036637 Hs.107052 ESTs 10 

H96988 Hs.42612 ESTs, WeaMy similar to ALUIJiUMAN ALUS 10 

F08208 Hs.283844 sln^torattricarboxytatecantor-^l 3.0 

B£3g0125 H8.143187 hypotheScal protein 10 

BE0186S8 H8.141003 Homo sapiens cDNA:FU21691 lis, ctoneC 3.0 

H58818 hydroxysferotd(17-betfli) dehydrogenase 7 3.0 

AW97e731 Hs.301824 hypothetical protein PR01 331 10 

R77698 H5.337778 ESTs 10 

N49776 Hs.170994 hypothetical protein M6C10946 10 

AI499220 Hs.71573 hypothetical protetoFU10074 10 

BE256988 HS.11B96 hypothetical protoInFU12089 10 

T57693 Hs.87929 Homo sapiens cDNAFU13707fis, done PL 10 

AB032964 Hs.115728 KIAA1138 protein 10 

AAS98464 * gb»e36l04^1 GessIerWtlms tumor Homos 10 

N(yL003478 Hs.101299 cUlHnS 3.0 

AW303285 Human DMA sequence fmm done RP11-110H4 10 

BE440157 Ks.85944 ESTs 10 

M21259 gb:HumanAhi repeals b the regkm 5* to 10 

AW950870 Ks.78961 protein phosphatase 1. regulatory (Inhlb 10 

AA733050 Hs.334612 small nudearribonucteoproleinpolypepl 10 

AA223447 H3.12835 A kinase (PRKA) anchor protein 7 10 

W27172 Hs.1857 phosphocfiasterase 6G, cGMP-spedHc, rod 3.0 

BE293520 Hs.18910 prosiBto cancer overe}q}ressed gene 1 10 

1^1321 Hs.25780 Homo sapiens cDNAFU 12252 lis, done MA 10 

K20832 gb7m48d01s18o8mlnfBntbrain1NIBH 10 

AA310417 Hs.47044 ESTs 10 

AW836130 Hs,75277 hypotheJfcal protein FU 13910 10 

BE622021 Hs.152571 ESTs. Highly similar to IGFm mRNA4)lnd 10 

H09232 Hs.26484 HIRA^teracUng protein 3 10 

AA143045 HsJIGSS v^HaRly-Zudcennan4fiBDne8aioomav 11 
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TABLE 71B 

Pkey: Unique Eos probeset idenliiier number 
CAT number. Gene duster number 
Accession: Genttank accession nunters 



Pkey 



CAT Number 
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108451 


13766 27 


107908 


46987J 


123819 


371681.1 


131495 


142008.1 


101445 


1650.-5 


100944 


25682.2 


117265 


10421J 


117297 


647718J 


108859 


137143.1 


125165 


18S2047_1 


116790 


19864.1 



103038 15024.1 



126086 
126098 
125464 
126127 
125558 
126426 



127262 
127315 
103898 
126769 
120147 
128080 
104590 



127496 
126872 

126887 
126132 
126987 
120325 
112511 



1606216.1 

1629789_1 

168460_1 

1205826.1 

1703083.1 

110687.1 



118644 81501.1 



231725.1 

37938.1 

187213.-3 

119008.1 

386607_2 

1540039.1 

44964.1 



340470.1 
142696.1 

1572189.1 
177108.1 
169750.1 



17406^ 



AAD79195AAD84955AA125308AA0M956 _ 

Aro87999 N29296 AI92B858 AAB46757 N20229 AA026894 080634 AW388923 AW388802 AW388957 AW571771 AW388839 

AI686682AW389079AA602099AW134567AV»17843A!682674AM74874AA374167 

AA602984AA609200 

AA812434 AI831542 AA766281 AI769894 N63376 AA2143S2 AA135933 AW605017 AA135965 AW450072 H04812 AA446459 
AW4391 51 AM26273 AI061423 AA598549 Z40087 AI753216 AA743075 AAB54268 AA458920 AAB43893 Ai566516 AI884957 
A)566634AW971760AA211796AA430089 
M21259 

L07518AW815215 

AA4S1966 NM_016370 AB036693 AL139228 R5B124AI634847 ALII 9333 W07356At334284 H2g050A] 192685 AA552438 
AW172B43W19794 N21460AI743862AW130622AA991348AI204553AA992664 N8084aAA699329AI824676R2^ 
AW779829 N22494AI248120 

AL1 21 500 AW291763 AI732432 AA129708 AA133309 AI733750 AI225224 
W45350W45406 

AVV161357 AI879062 A1928938 AW161097 AW161167 BE314465 AA351715 F07096 AA179034 FOBSIO F00653 AI936671 
AA476718 AW772454 AI807703 R44253 AA976667 AI9B5186 A!650254 H38942 R84829 AA018724 AA001000 H8S934 AA019126 
H85609 AA017000 AA339355 AW950556 D51397 AA213981 BE548002 AI056359 AA001560 AW952113 AA317769 A1857477 
AI857475AW249771 AW162561 H38943AA01B628 R85885A1984613AI934765AI796172AW157488AI929191 R85523 051221 
053851 H85610 AI749674 F21582 M323145 AA019127 AA687444 T06745 AI699293 H29532 AA214029 AA223656 NM.016834 
X14474 R19697 H09695 R17455 R1 3812 R190S6 AI881231 A1590200 R37671 AA861828 AI990O23 At93566S AW005021 AA324581 
H17335 R37659 R42802 R46242 R60936 R59731 H28993 AA479907 R44570 AI890696 AA308884 AA507078 R41 274 AI365507 
T16348 AI560453 F03259 F04722T16312 AA016081 AW073061 BE314824 W28930 R44098 R51045 
AA926960 AA926959 W76521 W24270 W21526 AA037172 BE267636 H83186 AA459909 N86396 AA001348 BE535736 AA081745 
BE566245 AAD82436 H72S25 H77575 N49786 W80555 H78746 BE569085 W04339 R9B127 T5593B BE279271 AW960304 T29812 
AA476673 BE297387 AA2927S3 AA177048 NM.001 826 X54941 BE314366 AA908783 At719075 88270172 BE26981 9 AA889955 
AI204630 W2S243 At9351S0 AA872039 W72395T99830AI422691 H98460 N31428 BE2S5916 K03265 A(857576 AA776920 
AA910644 AA459522 AA293140 AW514667 R75853 AW662396 AA662522 AI865147 AI423153 AW262230 AA584410 AA583187 
AW024595 AW069734 AI828996 AA282997 AA876046 AW613002 AA527373 AW972459 A1831360 AA621 337 AA100926 AA772418 
AA594628 A1033892 W95096 At034317 AA398727 A)085031 N9521 0 AI459432 A!041437 AA932124 AA627684 AA935829 
AI004827 A1423S13AI094597 H42079 R54703AI630359AA617681 AA978045AA643280W44561 AI991988AI537692AI090262 
AA740817 AI312104 AI911822 AA416871 AI1B5409 AA129784 AA701623 AI075239 AI139549 AA633648 AI339996 AI336880 
AA399239 AI07B708 AI085351 AI362B35AI346618A1146955AI989380 A1348243 N92892 AA7B5850 AI494230 A1278887 AA962595 
AI492600 W80435 AA001979 R97424 AI129015 N24127 AA157451 AA235549 AA459292 AA037114 AA12g785 AI494211 
AW059601 AW886710 R92790 N59755 AI361128 AW589407 H47725 H97534 H48076 H48450T99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49882 H42038 BE220698 BE220715 ^9552 AA701624 N74173 R54704 H79520 
H72923 H03266 BE281919AA769633AA480310 AA507454AA910S86AI203723AW104725 W25611 W25071 188980 H03513 
T77589 R99156W9S095 R97470AA702275T77551 AA911952 H82958NB3673 AA2B3672 
H75681 H70975 
|yt79088 N88221 
N71B07AA2C3399 
N95428 W24040 AW751366 H81987 
R59305 R19748 

AA125984 AA127189 AA065075 AA070377 AA100017 AAIJ79891 AA113255 AA0f75168 AA082764 AAD83380 N84829 AA084752 

AA076512 AA085119 AA085208 AA085045 ^««^ 

AA443241 R23784 R67255 R00047 AI457612R63254 R28645 R27816H01310R78329 R76048R76055AA369734N57914H94884 
AI953638 R31669 AW769278 R82398 AA131925 R21776 R79031 AI129553 N7O340 AI276116 AA7093B1 N63734 AI342605 
AA327133 AI805880 AA005377 AW611716 T86946 AA369083 R67250 HQ0240 R32578 W8627g R80248 R23734 H00977 R21732 
R78932 AA368675 R16780 R25523 H40161 W93671 R99823 R77956 H61601 R98785 R0973B H78317 H53537 H81056 T53780 
H12600 R82345 AA653499 AW953717 R63207 R33888 H54071 H66993 T48748 AI702300 R30775 H68996 AI014957 AA389082 
AR)75351 A1110886 AI742050 AW954245 AI768458 C19062 AI393674 C18911 C18029 AA708613 AI168432 R67389 AI168453 
H00188R21233 H03055 R53822 AA367558 R75872 W04161 AI2208S9 AI090290 AA368730 R94434 H81153H70950AA36n83 
H81514 HS3536 016968 C17797 C17677 C17064 AA082581 C17044 063217 CI 8791 D78839 D63290 078838 N91085 H54070 
R80360 H78318 H40199 R33887 R02529 H94918 R00900 T87029 AA1 31924 R99891 AA004267 D78764 078757 H51600 C18815 
H60972 R98784 W86323 R09737 C17492 D78811 R57494 078776 AA366244 H12650 D79043 AW957082 R82769 C19002 H01715 
AA828125AAB34883AA330555 
AF116622AI114S07AA640834AA377999 
AA248884 

AA083456AA127028AA084934AA120g72AA08S317 
AI917116 AA633698 AIS80389 Z39483 

F12310T66402 K07020 _ 
AW373062 T55662 AI299190 BE174210 AW579001 H01811 W40186 R87100 AI923888 AW952154 AA628440 AW898607 
AW898816AA70912SAVV896628AW898S44AA947932AW898625AVV898822Al276125Alie5720AW^ 
BE487708 AW243400 AWD43642 A12B8245 AI186932 D52654 DSS017 052715 052477 053933 054679 AI298739 AI146984 
AI922204 N98343 BE174213 AA845571 AI813854 AI214518 At635262 AI139455 AI707807 AI698085 AW884528 AI024768 
AI004723 AW087420 A1565133 r®4964 AI268939 AW513280 AI061126 AI435818 AI859106 AI360508 AI024767 AA513019 
AA757598 X56196AA902959 A1334784 AI860794 AA0102ff7 AW890091 AW513771 A1951391 Ai337671 T52499 AA890205 
A1640908H75g86AA463487AA35B688A(961767A186629SAA780994Aig8S913BE174196AA029094AW59^^ N79072 
At611201 AA910812 AI220713 AW149306 Ar758412 AA045713 R79750 N76096 

AKJ31650AA860882AI458212AA524453 _ ^^^^ 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AAB09054 AW238036 BE01 1212 BE01 1359 
BEO11387BEO113666E011362BEO112156EO11365BEO11363 
H20832Z40184 

AA225632 Ai820970 A1820952 AA226472 AI7321 40 A1732D59 AA226307 AA225S00 
AA20S976AA205930 

JwV^M A^ill AA811738 AVV301029BEZ^ AAfl39581 AI424444 AAB10925AA749267BE205843 

AI380184 BE205945 AI761798 AI185049 R94e06 AI343947 AI565B14 AI341735 AI749190 AW259588 AI39314S AI620080 
AW003752 AI750035 AI824837 AI797658 AA679066 N87192 AI937537 AW003753 R72610 R72574 AA757401 AA553744 AA460166 
AA535727 AW304422 R68273 AW027615 C01651 AI090327 AI538258 AW003744 AA93B372 AW051486 AA505513 BE466452 
AA931719 AI174548 AW182752 T84462 R1 1 149 AI767627 BE218556 BE217988 BE222697 AI782382 BE299599 AA987212 
AI767136 A!268928 AA090761'AA329290 AI700593 AA460766 BE222710 BE550651 R11148 AI979278 AW235B19 AW874095 
AW196492 
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106012 96214.1 



1 27705 966283.2 
127726 339B.S 
129012 22280.4 



106498 24S223J 



30 
35 

40 

45 

50 

55 

60 

65 

70 

75 



120934 
115197 



177521.1 
42406.1 



129706 81501.1 



131019 223488.3 



131170 
123470 



123532 
125032 
123808 
102398 
118129 
109841 
118822 



113119 
104799 
113560 
129794 



124250 
101447 
117357 
103392 
135029 
105225 
121292 



100237 
114988 
123423 



8113.1 

8785JZ9 



geitbanMAG08733 

Q8nbanlOr74884 

QenbardUW\620552 

entie^U423S9 

genbanKJ^493 

QenbanlLH01052 

220660.1 



genbanK_T47910 
genban)UVA029703 
genbariCTOIOIS 
39565.1 



314220.1 

entrsz.M21305 

genbank.N24829 

entreO<94563 

H58816.8t 

fienbanlLAA211777 

genbanKJVA401807 

82640.1 



Bntre^D30715 

9enbanlLAA251089 

genbanU\AS98484 



AI240665 T53681 N77488 H51833 AA147247 R75732 C18450 R73999 AI095755 T49904 H03868 AA41 1 580 R33395 AA410586 
T48869 063292 R31 981 HI 2498 H02668 AA035018 R75957 AI803329 R27528 R38203 AI809932 AI808765 R78948 AM1 1449 
AA976929 AI378760 AI378620T48B70 R73906 R75632 H03612 AA909684 N50695 H02580 H12B39 N58781 AA742532 AI360919 
K03502BE208298R685B8A13S0463R3193SAW069127AM11S21 R25671 R3810SH12451 H03869 H51263AAO3548SR251O9 
R25110AA147933 
AJ003322AJ003324 

AW404061 RD9854 T67160 N50566 K532S9 

R81936 078695 D7B806 D78780 C17009 AA004406 AA122102 R70625 AA148932 H59583 H81146 H80378 H49883 H21 182 
H46534 H51478 H20702 846515 H402O0 850046 H23647 H50439 H18383 H21846 H23840 H40182 R21924 H01290 H01283 
H42464 W31947 W39660 AA04S219 H03539 H01741 R22008 AA04391 1 AA1 56838 D78832 R36616 R66039 AM92481 AA088608 
R68918R35334 HB0281 N58483AI075154 A1086754AA595787 H81051 H01187 AI057251 T96992 H59584 T47016 R31800 
H13847 H01193 N74560 AA156601 K03455 R65040 R81937 R92416 H89486 R36617 R55795 A1088338 AJ373324 H66992 R96235 
AI494132 R16678 AA088178 AA705356 AA962143 AA148933 R09231 AI160937 R70525 T46980AI200046 H02301 AA367587 
R35968 T97108 D78703 M78072 078668 063268 R28197 AA085579 RB3766 RS2415 W00938 RB07e5 R67875 R27583 R09343 
H13646 R27682 T69007 

AI221919 Z19967 AA348760 AW964077 AW166028 BE540193 N94800 AA452388 N99604 AI341345 AW298800 AA724961 
AA931156 AI741227 AIB06660 AI982626 D812&3 053937 D52496 AA974487 AW043854 N50483 Z39997 A1492g61 AI361526 
R)4002 AA452141 T23551 AI472655 AI193657 A1341984 N92658 T32B70 R52664 N50428 AW089291 A1934175 Ai423737 D60665 
AA2261S8 AA226513 AA383773 

R1 6856 AW968014 AA262288 AW968002 X93079 AJ002788 R51324 At3816W 080031 AW500520 AW593740 F09382 AA810597 
AA262174 AA810595 AA810S96 

AA443241 R23764 RS7255 R00047 AI457612 R53254 R28645 R27616 H01310 R78329 R76046 R76055 AA369734 N57914 H94864 
A195363aR31659 AW769278 R82398 AA131925 R21776 R79031 AI129553 M70340 AI276116AA709381 N63734 AI342605 
AA327133 AI805980 AA005377 AW61 1718 T86946 AA369083 R67250 K00240 R32578 W86279 R80248 R23734 H00977 R21732 
R78932 AA368675 R16780 R26523 H40161 W93671 R99823 R77956 H61601 R98785 R09738 H78317 H53537 H81056 T53780 
H12600R82345 AA653499 AW953717 R63207 R33888 H54071 K66993T48748 AI702300 R30775 M68996 A1014957 AA369082 
AF075351 AI110886 AI742050 AW954245 AI76845B C18062 AI393874 C18911 C18029 AA708613 AI168432 R67389 AI168453 
H0O188 R21233 H03055 R53822 AA387558 R75872 VWWISI AI220869 A10»290 AA368730 R94434 H81 153 H70950 AA3577B3 . 
H81514H53536C16968C17797C17677C17064 AA082581 CI 7044 063217 CI 8791 D78839 063290 078838 N91 085 H54070 
R80360 H78318 H40199 R33887 R02529 H94918 RD0900 T87029 AA131924 R99891 AA004267 078764 078757 H61600 C18615 
H60972 R98784 W86323 R09737 C17492 D78811 R67494 D78775 AA38B244 H126S0 D79043 AW957062 R827S9 C19002 H0171S 
W28614W27435AI983043AA364395AW572472 AW190386AI129278AI913081 AW473549AA830713AI982871 AI638647 
A1828466AW572486 N52583 K89687AVV075567AI571047A1887479AI559469AI685802AA8052S6AI45877^ 
AI8e6032 A1823925 AI823566 AW198135 AI287510 AIS&S910 AA765775 AI866019 A1263697 A13558K 
AW105585 Ai824555 AW339175 020479 

NM.0142S3 AF100772 BE088769 AL022718 8E161779 AW8635S9 BE161640 AUS9080 BE168542 AW296S54 AA323193 
AA235370 AW779760 N48574 A1375997 R45432 D59344 AI203107 F07491 R35360 R25094 AI91 3631 A1498402 T61382 AI01 6320 
N45526T61415AA331486 

AW3032B5 AW117396 AA888929 BE174517 Z707M AL042292 K5495B AA310063 AW499643 AW501380 AW501202 AW501 293 

AW501596 AW630166 AL041326 AA780690 U0517 U21656 A1093182 AW062487 AA883387 AA931302 BE252601 BE258381 

AA827330 AL043930 AA3S6337 AW962830 AW393814 Z70r03 AW389484 T62231 AW780049 AU)43931 BE149744 AW963292 

BE439669 BE091737 BE091657 AA484004 AI673095 AA599106 AW601S45 AI538739 AI538730AI521786 AW366369 AVW^^ 

AA362S76 A12899Z7 AA382592 

AAe08733 

T74884 ' 

AA620552 

U42359 

N57493 

K01052 

AW206193AW137594 AI953685AI919143 AI424371 AW007698AW025681 AI954289R40438AW070384A!679653AI6790B1 

AI623213 AI652310 AI631411 AI565384 AI566734 AA149597 AI538172 AI040831 AI770021 AI914287 AA27931 1 W73001 AI433117 

AI693374 AI802007 AI990645 N29752 Rg4559 AAS06475 AA806828 R90998 R94S58 AW3001 12 W24097 AW003016 N91920 

T47910 

AA029703 

T91015 

AF093097 A1869509 T27070 AA326718 AA024743 W23922 AA479593 AI765668 W23908 U92986 AA081632 M5057B AA065245 

AW355046 AW365014 AA961091 AA150231 H53425 AA234651 N50S22 T79343 T87399 N9185B H41179 AW009453 AA024744 

727069 AI346379H88431AA152269 n45373AA4n432AI74S607AI807602mi152AA085244AI242S69AI^ 

F03857 H88369 AI174488 A1002B86 853427 T67293 N9ie69 W3S270 AA453723 AA453705 AW071829 AI393866 AW071807 

W23592A1342074 D31168AA833756AI991896AA447791 AI86412SAW377239AA29S365AI687970R34076F07607 AA521310 

AW49g885R94584R21283 

AA350256 H68126 

M21305 

N24829 

X94563 

H5B818 

AA211777 

AA401807 

AA195191AA190S78AI632307AA195227AA743633A]823408AIB32203AI653114AA205307AVVO2191^ 

AI675563 AI417935 AA707350AA135157 AI434721 AI151038 AI038305 R52843 AA780141 AW207646 D19691 AI474370 

AA401739 N22905 N70378 

D30715 

AA2510B9 

AA5g8484 



Table 72A8hoNS the Seq 10 No^ Ptey.ExAocn.Unig8nelD.^ 

80 Pter. Unique Eos probeselidenfonomba- ' 

Exf£cn: Exon^Acoesslbn number. Genbardtanesslonmtnto 

UmeenelO: Unigene nuniber 
IMgenaTItiei UnioenegeneliOe 
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SecflONo: 


Seq to number oonelaOon for (hose sequences in Tdbb 73 


Pkey' 


ExAocn 


UnigenelD 


Unigene Title 


103080 


AU077231 


H$J2932 


cyCQn 01 (PRA01: parathyroid adenomatos 


410102 


AW248508 


Hs.279727 


ESTs; homologue of PBM (Pona savlgnyi 


101104 


AWB62258 


Hs,169266 


neuropeptida Y receptor Y1 


447761 


AF051573 


Hs.19492 


protocacihenn 8 


428183 


AW969726 


Hs.98381 


ESTs, WeaMy simOar to serine protease 


439221 


AA737106 


HS32250 


ESTs. Moderately similar to I7B365 serin 


121619 


AA528339 


Hs.178062 


ESTs, WeaMy sbr^ to phosphalMylseri 


104659 


AW969769 


K5.105201 


ESTs 


105782 


H09748 


Hs^987 


CLUymphonia 1 1B (zinc finger pro 


1^77 


NM.000399 


Hs.1395 


eariy growth response 2(Kr}x-20 (Dr^op 


100299 


D49493 


Hs.2171 


growth differeniiaQon (actor 10 


116301 


AW969706 


Hsi93332 


ESTs 


106533 


AL134708 


Hs.145898 


ESTs 


131313 


R96290 


HSJ5B74 


ribosomal protein L44 


105316 


AI671245 


H3.24835 


trypotheOcal protfibi FU14S94 


113003 


AW292315 


Hs.7215 


ESTs 


102836 


U94320 


HS.1S8330 


.neuropepQite Y receptor Y5 


102745 


AW753885 


Hs.74376 


olfactomedin related ERIocdiized protsl 


123308 


C14187 


Hs.157208 


ESTs 


120147 


AI917116 




hemoglobin, beta 


123049 


BE047680 


Hs^11669 


dickkopf ptenopustaevis) homotog 2 


119082 


AF252297 


H&.91546 


cytoctvTsns P450 le&ioid metaboDdng pr 


10S301 


AW3S2357 


Hs.7457 


MAGE1 protein 


128478 


AA708205 


Ks.100343 


ESTs 


106111 


AW875398 


Hs.6451 


PRO0659 protein 


131307 


Ni^000025 


Hsi549 


adrenergic b6ta-3-. receptor 


120830 


A1566170 


H5.98886 


ESTs 


127664 


AA806164 


Hs.116502 


ESTs 


102725 


AB026187 


Hs.159156 


protocadherin 11 


132520 


AA257992 


Hs^l 


Janus tdnase 1 (a protein tyrosine Unas 


130637 


AA3567&4 


Hs.17109 


integral noerobrane protein 2A 


117602 


N35020 


Hs.44685 


C3HC4-lil;e zinc finger protein 


128797 


NM.002975 


Hs.105927 


stem celi growth (acton lymphocyte seer 


129703 


BE388665 


Hs.179999 


Homo sapiens, done tMAGE'^57003, mRNA 


125770 


AA143045 


HS.8166S 


¥4dt Hardy-Zucicennan 4 feline sarooma v 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Drosophila) homolog 2 


420462 


AF050147 


Hs,97932 


chondnxnoduDn i precursor 


410268 


AA316181 


HS.6163S 


sb( transmembrane epithelial antigen of 


104691 


U29690 


Hs^44 


Homo sapiens beta-1 adrenergic receptor 


416838 


D54745 


Hsi0247 


cholecystoldnln 



SeqIDNo 

SeqlDNoB1&B2 

SeqlDNoB3&64 

Seq(ONoB5&B6 

SeqIONoB7&B8 

8eqIONoB9&B10 

SeqiD(4oB11&B12 

SeqtDNoB13.B14.&615 

SeqIDNo 616 

SeqlDNoB17&618 

SeqiONoB19&B20 

SeqIONoB21&B22 

SeqlONo823&B24 

SeqlDNoB25^27 

SeqiDNoB28&B29 

SeqlDNoB30&B31 

SeqtOMo832 

SeqlDNoB33&B34 

SeqlDNo635-B40 

SeqIDNoB41&B42 

SeqlDNoB43 

SeqtDNoB44&B45 

SeqlDNoB46&B47 

8eqiDNoB48&B49 

SeqlDKoBS0^53 

SeqiDNoB54&B55 

SeqlONoBS6&BS7 

SeqlDNoB58&B59 

8eqiD^k>B60 

SeqlDNoB61&B62 

SeqlDNoB63&B64 

SeqlONoB65&B66 

SeqlONo867&B68 

SeqlDNo669&B70 

SeqiONo671&B72 

SeqlDNoB73&B74 

SeqroNoB75&676 

8eqtDNo677&B78 

SeqlDNoB79&680 

SeqlDNoB81&e82 

SeqlDNoB83&B84 



Table 72B 

Rfflf. Unkpie Eos piabeset identifier number 

CAT number Gene duster number 

Accession: Genbank aocesskm numbers 

Ploey CAT Numtier Aooesson 

131307 3138 1 NIWL000026)C72881 XTOBII M29932X70812S53291 AW015148AW581776AI276134 

131313 93372.1 RS6290H02411 C18327AA367688 AA367657 H89632C17954 AA568850AI75K83 AA699451 H04260 AI128118 AW193364 N94503 

AAD29995T40536AA368746AI189909BE0473&4AA747591 R22855Ai032539 AI151343AA148534 H63941 T49595 AA694405H74226 
A1200363 R79731 AA702947 AM00076 Ai088494 C17938 AAS99478 H02962 R77665 C17370 R65618 H7371 1 R58545 D79189 AW265710 
R77664 T62101 AW953745 AW953739 079107 AA029105 AW953738 AA456487 RB7000 AA156623 AA368336 H63662 

116301 52669J2 AW969706AA456256 AM91881 BE501639 062113 AVVg69710AI591236A)379869AVVg68997AA040053At807206AVV663gi7AA454^ 

AA489238 BE241958 AA743491 

103080 17092-1 AU077231 AA852219 M74092 X59798 M64349 NM_0017S8 AA226806 M73554 BE409154 AA160096 BE384352 AA160820 BE382880 

BE261734AA1 13821 BE407745 AA1 56380 BE390287BE390020AA1 00854 AA1 271 52 AW794066AW3871 01 AW367093 U47703AI347077 
VVDS266A1824103A149g061AA642944AI042556AA906539W60380AI571777AL135581AA112340^ 

AA126942A)023701 A1873252AA1S6319Alig0822 W60289A1274886R81309AA100801 AA227161 AI568929AA1 60603 AI074344Ai344S61 
AI150778AA85221BAA158286N20142AA622148AA884225AA576367AW182124T89175AI758455AA78057 

AI674613 AA642544 AW503909 AA1288S1 W39350 N40420 AA113a72 BE168116 AI620604 AI298125 BE075272 N40078 BE0751(» BE080779 
AI918938 BE168117 BE087369 AW995539 BE080949 BE080727 BB075271 BE075108 BE080955 BE089655 BEC81115 AW750304 HB6084 
A1146884BEO75154AVV992247A1186525AI752230AVV26314OVV03329N26O56AA94808OAAim^ 

AA148042 AI740837 BE082728 AA149570 W44495 6E089351 AA375im N26775 H27771 AA064705 BE091204 RB9337 N32676 N27141 
BE164704 H98049 W67603 AM25549 W31090 AAB07411 BE173280 BE000178T09020 W238S2 AA062709 BE167894 AA076515 R97329 
BE541980 N42088AA102307AA113772BE276181H20622VV44436VV57604VW6412AVmi1« 

BE396090 BE613371 BE61 2645 W46650 W95203 W92651 AI087288 R76299 AW604781 N55320 AI912334 AA403248 AW169156 H24970 

AW298822 AW080982 Ai073747 W24123 AA577S96 H21716 H27925 H26436 AI288304 AA148043AA2046766E047090 W48631 AA908347 

AA599485 Ai276505 AI953979 AA563710 H2S674 H51747 AA425389 AA516104 AK)95335 T77237 AA1 51696 T92084 AI689037 AI624162 

W49709AVV514883AA100676A1366087AA069474AA525859AVV771076AA029402AAg94114AI351505A 

AI346589 AA487700AI081104AA513344 AI377520A1284911 AJ311390AA622062 AI055890A!660B81 Ai366117 AA4O3090 AI272818 

AI073353 W46300AA062689A1755078Ar753397 AI633564 AI273471 AI339890AA6995MAA983722AI079968 AI752231 AA076431 AA113245 

AI168564AA916385AI0664d4A1123599AI921518VV94586AA535600AA054665AA705388AA064623AAS8250^ 

AW27S281 A)280632 T29597 W48728AVV9S4336VV38317VV94768AI084717 W46S67AI245845 AVV30K01 N722D1 AW$10663 AW079132 

AA207064 AI143740AW440672AA632154AI290286AI350704 AI271377AA025369AI864756T77451 H97348AA852165AI932951 N98526 

AA487488 R92970 AA934071 AI080448 AA0832S7 C05786 N99099 R42969 AA887065 AA662686 AA533833 AA662304 H51 748 BE539444 

A1382164 Ai814595 BE537043 AI168307 BE40893S AA4S3606 R89428 AA936527 AA936890 AW369618 AW264602 R16074 Ai474189 

AW372354A10943S8R37210AA948510AA226g09BE172527AI086652BE408324AVV2d2848Ai766962BE5407ra 
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AA2S7992 AA317029AA382097 017042 H4810OAW8381O7 AW838106AW838110AW838231 AW510241 AW838442 AA045061 AW384991 
Ban0988 AA1 31806 BE180577 AVm8269 AUJ39831 AI754380 a)6M1 AA1 31737 

AA379252 AWB38282 AA484027 AI423413 A16aS064 BE328307 A1241857 AWB38281 AW838284 AI6712S7 AIB14928 AA828387 A1476306 

AI27O180 AA622362 At076497 AI635919 A1444994 AI194028 AI8691S9 AI92B131 AA448B53 AI350143 AW079289 AW467807 AA480442 

AA522935BE180570AVV360087AW380109AI081015A1690818AI589485A1698510AA6420^^ 

C21398AA257993A13Q23S3Al669O18AWnO194AW75375OAI079164BE550338AAS59B51T161W 

AA045062AI95422SAA768SG9AA709308AW958363 

N35020 

AB025187 NM_014522 U79247 F13304 AA224524 F10902 R39431 

AW753865 F07644 F1 1280 F08355 F08138 U79299 F07459 F08750 F12419 F12842 F06488 F08585 F13403F05921 F05512 F11683 F05416 
F12841 F12810 F05418 F12850 R24551 F06276 H11803 H10237 H11542 H22894 RSSS63 H19351 H18722 F11237 F08507 H23123 R61595 
F07796 F06»1 F12289 F07107 1781 13 F08734 F13344 F05760 F11784 F08780 F05835 R20588 F07739 F11497 R17410 H17414 R20440 
T66090 R^292 T66238 T78766 T79908 R2S286 R60071 C14761 AW905192 AA331914 AW955291 H09000 R)5212 F08313 F05825 K06399 
H15135 H1 1378 R66424 R61541 AA235405 AI205O41 R59564 H23124 H1B638 R37359 AI571275 R417B0T66174 T66159 R54102 W07657 
H10794RS2337 R42890H14354 R40952T17391 H12088R56797 R51000R60011 R37617R39434 H29286T1$403 H09129R49114H15161 
H11729 R391G0 H10466 H24454 H11432 R54433 H08768 H24042 H09S11 H17261 H15292 R37849 AI991965 R52815 R428S0T17390T16438 
T17388 T16285 RA89B3 R507B2 R44247 R45729 R99558 R4069S R43575 R44420 T17403 R37754 R46637 R51039 R40513 T23785 F10433 
T16350 H10191 R43688 T23S43 H08S91 R44351 R37S75 R49508 F04379 F04056 F0S067 F02010 F09158 F10036 F08900 R)2559 F09914 
F09434 F05034 H17415 F02644 H10850 F10451 H14589 F10444 T16440 R44517 R46643 F03993 F10443 F02170 F09343 F10414 F04794 
F1 1 122 F04997 F03894 AW005932 F02767 F10941 F01777 R45260 F03386 R45261 T65009 F10990 R)8942 F01673 K23253 R)4533 F02082 
F01669 F03689 H09568 R407G3 K06400 R50958 N80570 AI082296 A1198226 T23763 AA81 4486 AW01 5823 AI886272 H06514 AA515989 
F10D49At984267At886279AA558516F02071 AI783843AI804884F01468AA730438AF035301 R38389 R41440RS9468R14437 T89036 
R43560 H10821 RS2850 RS6722 R14S59 T09299 H10467 R25829 F13S46 R56796 R51 151 082657 T32952 T66257 H0S569 F12430 T74326 
H08489 R34430 R13083 K23364 R21134 T77098 AW9S7157 H24150 H1 1403 R52338 F12830 T77173 T74281 T77663 T74738 H10649 H12087 
R20092T74735 R54530 R17$52 R59557 R54529 H17280 175171 H14694 774975 HI 5291 R18574 R54157 R13448 H1S221 R35120 R35031 
R21277R19632 R35820 R50899R13923R36771 R26485R33061 R20100 H24426 H09184H159S8R19253H15196H09510R36032R19691 
H14402 H1488S T17068 

AF252297 N27888 F13415 R34552 R14377AA323962AVV611514 AVV614097AW19S487 AtQe6O4OAI097653 AA012834AVV969885 AA98819^ 
R490&9 AWB73673 R37774 F08840AA484668 AW873G64AA170805AA1 70825 AA585275AA585280AA012833 
U94320NMJ006174U66275 

AA143045AVV2718(»AKM7093AA450183VV938g4AI8e5044Ai890068AVV264522AVVO^ 

A1457891 AW46B7B4AI5G7693 AW884460A1424892AA401298AW244098AI122938 N24824 AAS52932AA873164 AI308810 N26D98 H818S2 
N20798 AW263947 AI698267 AA449422 AW877967 
AI917116 AA633698 AI580389 Z39483 

AW292315 A109040BAW517041 AI928e95AI092060 AA716741 N98235T23829 

AW969769 AA484137 A1291362 N84753 AL133784 AA007153 6E046923 059990 C15166 059991 060194 AA648108 N76160 061027 060630 
U29690 AF272890 AA708166 AA010748 R23839 H59572 R23333 AI625747 AA011 176 AW66S324 H02674 HS9573 AA757931 H12502 R66851 
AA702310 AA748431 AA749352 C01127 AI003679 T96352 

AW352357 AA852419 T80568 AA332510 AW149659 AA341609 AA809550 A1163690 AA73D875 T31719 AW965361 AA779709 T05461 
AA564806 AI142536 AI553857 053384 053747 054101 0551 12 AW071935 AI082447 AJ29091 1 AW168872 AA1 99743 A1091546 AI246702 
AI271662 AW1 17296 AI032138 AI741489 AI5387C5 AA233541 AI682685 AW138268 A1689641 AA599916 AI174443 AI432176 AI131 328 
AW136277AA852418AA243035 T03902C15651 052660 H08108AA844525A!42Q548AI282445Am8986 
BE2B2936 

A1671245 D81062 A1085879 AI1932S2 At870535 N23132 Hg7661 H18098 AASS2173 AA233924 AA503379 H19618 060732 F04762 H1 1343 
AW590283 

AA806164 AA678564 AW978234 AW023951 AA651859 A1699785 AA913686 AW974569 AI661287 

AW87S398 A1064861 NM.014138 AF090943 BE071887 BE547232 BE622599 Ai345700 AA284315 AA147700 AW376340 BE220170 BE348475 
AA025037 AA0572B4 AAD25038 T16183 AW130196 AW594516 AI989991 AI139091 AW675747 AA83043S AI924977 AI670778 A1051887 
AA912S74 AW130213 AI82B335 AI570587 AI33Z760 A)3131 15 AA804653 AA477920 AA147645 AI916912 AW768839 AA715098 AA292723 
AA732421T33122AI631560A1698688AI18S386 R43829AA421638AA961400(»0503AA604025AA554^ 

AA708205H15963 R20851 L13829AI934608A1^63H92693At267243N66614H28g83R42031 H08198AI468628 H92779L13828 T34896 
K09748 R61554 AA350215 R40551 

AI568170 AA731212 AW294848 AW486878 AAS68908 AI2B1804 AI923633 AA347885 AI718854 AA714940 
NM 002975 AF020044 AB009244 BE620464 AA5416B3 AW000991 AW089850AVV664612AI63» 

AI49g682 Ai570905 AI376008 A1871 364 AI074230 At38371 9 AA905016 AA977446 AA844732 T78922 AW244007 AI924747 A1968230 AI824575 
A]699224AI569490AW379856AW292314A!016874AW292037 

AL134708AW206610AVV665919AW594227AI917093AI498361AA635141AA453786 054178 053224 Al^ 
AA4S3903AI278054A)334339AI380498AW23S939 D60373 D80993 D81246 
AAS28339AI017218AA416911 

BE047680AF177395NM.014421 AB033208AB035181 W55979A)093106AI663392AA715452AA300480AW022313AA648103 AA720544 
061856 AA481794 AA7371 12 AW3165S1 AA714937 AW974378 AI370632 Ai184289 A1333g62 AA714945 

BE388665 AA740814 At394542 AA484451 AW070675AAB77654AI360981 AI475799BE349560AL037351 N29437AI246453AW083444 
AA844441 A)150871 AI440477 Aira9061 AA682422AA660053 AA740212AI65872BAA490S89AW243892A1249450AI304448 AA534421 
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AA278541 R76523 AA565187 K85083 F24755 R7772g AA135137 BE250395 R76239 AW79482S AW794404 K61957 AA308197 AW996031 
AA345524 N5Z726 AA354360 AA179785 N57375 R16447 AA160845 W21481 

049493 NM_004962 049492 AA948479 AA993843 AA017558 AA992567 AA912897 Alie5383 AA016254 N93253 AA297544 T19187 R53287 

AA069337 AA297489 N62906 AW023929 AA018253 AA017615 W25110 R52085 R52633 W16549 H88916 H89134 AA069338 

NIyL0QO399 AA018140 AF139483 J04076 AW390694 AW361 756 AL07S830 A) 190587 AA731059 AA742743 A1635915 AA87B209 AI262612 

AI264201 AA446300 AA71S016 AA446027 AI274101 AAS93330 AAS93338AVV0153B8AI681724A)347984AL04527^ 

AW014396AW769445At588889AW338296A1801548AI367702AI888410 

C14167 AA496211 080456 AA484051 080455 AAS28446 

AW882258L07615 

AA356764 NM-004867 AF038953 W16828 R50284 AI929589 R01594 AA010474 W37917 N64212 N78175 R5751 1 AA329322 W77834 
AA169869 AA285240 AA356S70T93146 AA325779AA3570S4AA328151 AA306939AA306801 N31 260 AA306202AA31 2530 BE242703 
AA354371 N422S9AL021766 H50g85H3803BAI655965AIS66732AI9S39S2AWg72835AI077300AI948482A!760^ 
N32815AA768073 AW338161 AVif001742 AI810896AI015668N58011 AA677S84 All 63750 AW105S14W37875AW301 679 AA775257 
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410102 117657J 
10 410268 11B97J 



414761 14860J 



15 



20 
25 

30 447761 7355.1 



416836 16233.1 
420462 19379.1 



426163 287965.1 
439221 46993.1 



M730357 AI128690 AA630149 AW241515 AI147003 AI183499 N90923 AI968582 AA3S4145 A1147297 BE550502 AI085777 AI709087 
AA7r3947 AM69192 AI199289 AI127085 H89232 N30719 Aro28165 AW029469 AITWSU AA780834 AA778057 HB9125 A1554762 T93061 
N58822 AI360625 AA719982 AI248242 AA233925 AW262512 AW438666 AA010378 AM84010 AW438790 N22875 AA48371B All 85407 
AA736720 AA664587 AA648572 AA503406 AA5081 23 AA503140 H46638 N7581 2 AA730938 060548 AA483039 AASOHOS AA480448 
AI768654AA558955AA730803AA33D312AVV021127BE172raiAW961052AA339070AA648172BE172892AAU 
AW116733 AA4B4130 AA730783 AA730510 08^ 062828 AI469294 AI801343 NS3447 AA740226 A(831983 AAS16146 079799 AI758212 
AA484040 K3e039 AI038772 At039410 AAS2B387 AA730576 AA527231 AA74811 1 062946 U38439 AA721708 H22130 AA559229 H51620 
AA283355 AA730970R01595 N72450 AA469123 AA579389 AA480477 AA283305 AA347822 AA328013 

AW2485C8 AI278454 AI369343 AW291235 AI343946 AI347189 AA970152 AI279668 AI971894 AA664359 AI191039 AA084598 AA081 641 
AA316181 BE439545AC005053AF186249AW386101 AC004969AA730199AA032221 A1686139 AI167942AA809228AI184070AI394674 
AW969977AA032279AW079284AA513174AA888312AI453179AA483363AAS28432AA579511 

AU077228 U81 145 AL044256 AA206185 AL04497$ Nli/L0044S6 X95653 AW890905 AA3328Q6 AW965453 BE299226 U52985 BE275009 
AA428252 AA769212 AA312778 AA8109ai AA461315 AI989286 AI953658 AW296248 AI291422 AA704747 244716 BE541235 AW608806 
AA5885B3 AW403788 AW408386 AA053521 T87571 AA581 163 BE183241 T97908 R89149 A1808196 BE618521 A1970278 AI652680 AW130143 
Ai970272 AW197^ BES50449 A1628403 AAD10143 AW005t07 AA251815 AA805836 N53459 AA055309 AA643678 AI345616 AI345618 
AI635728 AI871 286 A1018437 At262S14 H75287 At033155 AI2D338a AA205802 AI475265 AW594186 AI696D59 A1333312 AW264710 AA6767B4 
AA864892AA173156A!292068 T79837VV93995AA506427AA815135AI376342AA010144AI31M^ 

AI0g0913 N55098 AA101673 AA626665 AA251840 K68857 Al8g0053 H05930 AI474199AI436166 AA628810 N64452 AI205538 AA055310 
BE244495 At385179T99536 AI203088 Z40549 AW139856 AA599627 R91847 AA483681 AW088982 H51746 AA091201 H13030 AA459732 
T99430AI051764 R45991 

054745 AU077189 KM_000729 R71188 HB7563 R80701 AW956319 AA376289 RB2442 H46452 R22481 R64323 AW043690 UW3S4 AW005571 
H19813 AI745597 AI769202 AI205982 AW205841 R64324 AI201975 H46993 C18067 R71 189 N29729 AI799838 AI091209 HB7455 R82443 
AW970995AW072136N40B41 H13148 A1219052 N30191 T28007AI817428AA503602 

AroS0147 NM_ra7015 AB006000 AB005999 6E21 81 14 A)703147 AI9B5891 BE222034 BE549774 BE327618 AI651459 BE549987 At657194 
BE222414 BE046984 AW470357 AW339338 C02525 AW590220 AI298082 AW296998 A1333135 AI867816 AA73051 4 6E348513 AA405500 
AW136179 AA327e34 BE327350 BE327366 

AW969726AA424028 A)256636AI271571 AA423988 A1266634AA7027e0 AA452037 Ai4246S6 AA742931 AA483361 

AA737106 N35765 N35779 AF066045 N30152 AW374028 AI478237 AA923562 AI382236 AAB46668 AI627389 A1371709 H98083 H9S983 

H2657& H95g84 AA745021 AA639180 H26575 

AF061573 NIUL002590 AA780138 AW135620 AI197884 AA483348 R05468 AW969724 H29216 
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Table 73 1 Sequences 

Seq ID NO: Bl DMA sequence 
Nucleic Acid Accession ft-. NM_053056.1 
^ Coding sequence: 210.. 1097 

I 11 21 31 41 51 

I 1 I I r I 

CACACGGACT ACAGGGGAGT TTTGTTGAAO TTGCAAAGTC CTGGAGCCTC CAGAGGGCTG 60 

^ TCGGOSCAGT AGCAGOGAGC AGCAGAGTCC QCA0GCTC03 QCXjAOGGOCA GAAGAGCX3CX3 120 

10 AGGGAGCGOG GGGCAGC310A AGCGAQAGCC QAQOSCGGAC CCAGCCAGGA CCCACAGCCC 180 

TCCCCAGCTG CCCAGGAAGA GCCCCAGCCA TGGAACACXyV GCTCCTOTGC TGCQAAGTGO 240 

AAACCATCOQ OOGOGCGTAC CCOGATGCCA ACCTCCTCAA 0GAC0GGGT6 CTGOGGGCCA 300 

TGCTGAAGGC OGAGQAQACC TGCGCGCCCT CGGTGTCCTA CTTCAAATGT GTGCAQAAGO 360 

AGGTCCTGCC GTCCATGOGG AA6AT0GT0G CCACCTQGAT GCTGGAQGTC TGCGAGGAAC 420 

15 AGAAOTGCGA GGAGQAGOTC TTCCCGCTGQ CCATGAACTA CCTGGACCGC TTCCTGTOGC 480 

TGGAGCCCGT GAAAAAGAGC OGCCTGCAGC TGCTGGGOGC CACTTOCATG TTOGTOGCCT 540 

CTAAGATGAA GGAGACCATC OCCCTGACOO CCGAQAAGCT GTGGATCTAC ACOOACAACT 600 

CCATCaSGCC CQAGGAGCTG CTGCAAATGG AGCTGCTCCT GGTQAACAAG CTCAAGTGGA 660 

ACCTG8C06C AATGACCCOG CAOQATTTCA TTGAACACTT CCTCTCCAAA ATGCCAGAGG 720 

20 CGGAGGAOAA CAAACAQATC ATCCGCAAAC AOGCGCAGAC CTTCGTTGCC CTCTGTGCCA 780 

CAGATGTGAA GTTCATTTCC AATCCGCCCT CCATGOTOGC AGCGQGQAGC GTGQTGGC06 840 

CAGTGCAAGQ CCTOAACCTG AGGAQCCCCA ACAACTTCXTT GTCCTACTAC CGCCTCACAC 900 

GCTTCCTCTC GAOAGTGATC AAGTGTGACC CAGACTGCCT COKMCCTQC CAQQAOCAGA 960 

TOGAAGCCCT QCTGGAGTCA AGCCTGCX3CC AGGCCCftGC3V QAACATGGAC CCCAAGGCCG 1020 

25 CCGAGGAGGA GGAAGAGGAO GAGGAGGAGG TGGACCTGGC TTGCACACCC ACOGACOTGC 1080 

GGGACGTGGA CATCTGAGGG CGCC3VGGCAO GCGGGCX3CCA CCGCCACCOQ CAGCX3AGGQC 1140 

OOAOCCGGCC OCAGGTGCTC CCCTGACAQT CCCTOCTCTC OGQAGCATTT TGATACCAGA 1200 

AGGGAAAGCT TCATTCTCCT TGTTGTTOGT TGTTTTTTCC TTTGCTCTTT CCCCCTTCCA 1260 

TCTCTQACTT AAOCAAAAGA AAAAGATTAC CCAAAAACTG TCTTTAAAAQ AGAGAGAGAG 1320 

30 AAAAAAAAAA TAGTATTTGC ATAACCCTGA GCGGTGGGGO AOGAGGGTTG TGCTAC AGAT 1380 

6ATAGAQQAT TTTATACCCX: AATAATCAAC TCGTTTTTAT ATTAATGTAC TTGTTTCTCT 1440 

GTTGTAAGAA TAGGCATTAA CACAAAGGA6 GCGTCTCGGG AQAGOATTAG GTTCCATCCT 1500 

TTAOOTcrrrr aaaaaaaagc ataaaaacat tttaaaaaca tagaaaaatt cagcaaacca iseo 

TTTTT A AAGT AOAAGAGGGT TTTAGOTAGA AAAACATATT CTTGTGCTTT TCCTGATAAA 1620 

35 GCACAGCTGT AGTGGGGTTC TAGGCATCTC TGTACTTTGC TTQCTCATAT GCATGTAGTC 1680 

ACTTTATAAG TCATTGTATG TTATTATATT COQTAGGTAG ATGTGTAACC rCTTCACCTT 1740 

ATTCATGGCT GAAGTCACCT CTTGGTTACA OTAGGGTAGC OTGGCOGTGT GCATGTCCTT 1800 

T6CGCCTQTQ AOCACCACOC CAACAAACCA TCCAGTGACA AACCATCCAG TGGAGGTTTG 1860 

. _ TOGGGCACCA GCCAGOCSTAC CAGGGTCGGG AAAOOCCAOC TOTCCCACTC CTACGATAOG 1920 

40 CTACTATAAA GAGAAQACX5A AATAGTOACA TAATATATTC TATTTTTATA CTCTTCCTAT 1980 

TTTTGTAGTG ACCTGTTTAT GAGATGCTGG TTTTCTACCC AACGGCCCTG CAGCCAGCTC 2040 

ACX3TCCAGGT TCAACCCACA GCTACTTGGT TrOTGTTCTT CTTCATATTC TAAAACCATT 2100 

CCATTTCCAA GCACTTTCAG TGCAATAGGT GtAGGAAATA G06CTGTTTT TGTTGTGTGT 2160 

GCAGGGAGGG CAGTTTTCTA ATGGAATQOT TTGGGAATAT CCATOTACTT GTTTGCAAGC 2220 

45 AGGACTTTGA GGCAAOTGTQ GGCCACTGTG GTGGCAGTGG AGGTGGGGTG TTTGGGAGGC 2280 

TGCGTGCCAG TCAAGAAQAA AAAGGTTTGC ATTCTCACAT TGCCAQQATG ATAAQTT CCT 2340 

TTCCTTTTCT TTAAAGAAGT TGAAOTTTAG GAATCCTTTG 6TGCCAACTG GTGTTTGAAA 2400 

GTAGGGACCT GAQAaOTTTA CCTAGAGAAC AGGTG G TTTT TAAOGOTTAT GTTAOATGTr 2460 

TCACAOCGOA AC G ' lTi ' TTA A ACACTAAAAT ATATAATTTA TAGTTAAGGC TAAA AAOTA T 2520 

50 ATTTATTGC3V QA6GAT0TTC ATAAGGCX3\G TATGATTTAT AAATGCAATC TCCCCTTGAT 2580 

TTAAACACAC AGATACAC3VC ACACACACAC ACACACACAC AAACCTTCTG CCTTTGATGT 2640 

TAGROATTTA ATACAGTTTA TTTTTAAAGA TAGATOCTTT TATAGGTGAG AAAAAAACAA 2700 

TCTGGAAGAA AAAAACCACA CAAAGACATT GATTCAOOCT GTTTGGOQTT TCCCAGA6TC 2760 

ATCTQATTQa ACAGOCATGO OTGCAAGOAA AATTAGGQTA CTCAACCTAA GTTOGGTTCC 2820 

55 GATGAATTCT TATCXXTCTGC CCCITCCTTT AAAAAACTTA GTQACAAAAT AGACAATTTQ 2880 

CACATCTTGO CTATGTAATT CTTGTAATTT TTATTTAGGA AGTGTTGAAG GGAGGTGGCA 2940 

AGAGTGTG6A GGCTGAGGTG TGAGGGAGGA CAGGOGGGAG GAGGTQTGAO GAGGAGGCTC 3000 

00GAGGG6AA GGGG0G8TGC CCACACCQ69 GACAGGGCGC AOCTCCATTT TCTTA TTGQS 3060 

CTGCTACXar TGACTTCCAO GCAGGGTTTG GAAATATTCA CATCGCTTCT GTSTATCTCT 3120 

60 TTCACATTQT TTGCTGCTAT TGGAGGATCA GTTTTTTGTT TTACAATGTC ATATA CTGCC 3180 

ATGTACTAGT TTTAGTTTTC TCTTAGAACA TTGTATTACA GATGCXTTTT TTGTAGTTTT 3240 

riT ' i ' lT ' im ' ATGTGATCAA TTTTGACTTA ATGTGATTAC TGCTCTATTC CAAAAAGOTT 3300 

GCTGTTTCAC AATAOCTCAX 6CTTCACTTA G0CATOGTG6 ACCCAG0GG6 CAGGTTCTGC 3360 

CT6CTTTGGC GGGCAGACAC G06G60QG6A TCCXACAC3U3 GCTGOCGGGG 6C066CCC00 3420 

65 AGGCOG O GTO OGTGAGAACC GOOCCOGTGT CXrCCAOAGAC CAGGCTGTGT CCCTCTTCTC 3480 

TTCCCTGCGC CTGTQATGCT GGGCACTTCA TCTGATOGGG GGCGTAOCAT CATAGTAGTT 3540 

TTTACAGCTQ TOTTATTCTT TGCGTGTAGC TATQQAAGTT GCATAATTAT TATTATTATT 3600 

ATTATAACAA OTGTOTCTTA OSTGOCACCA CSGGCaTTGTA CCT GTAOG AC TCTCAT TOGQ 3660 

GATGATTOQA ATAGCTTCTG GAATTTGTTC AASTTTTOQa TATOTTTAAT CTGTTATOTA 3720 

70 CTAOTGTTCT GTTTGTTATT GTTTTOTTAA TTACACCATA ATGCTAATTT AAAGAQACTC 3780 

CAAATCTCAA TQAAGCCAGC TCACAGTGCT GTGTGCCCCQ GTCACCTAGC AAGCTGCCGA 3840 

ACCAAAAGAA TTTGCACCCC GCTGCXSGGCX: CACX5TGGTT0 GGGCCTTGCX: CTGGCAGGGT 3900 

CATCCTGTOC TCGGAGGCCA TCTOSaGCAjC AGGCGCACCC 06CCCCACCC C TOCAQA ACA 3960 

CGGCTCAOGC TTACCTCMC CATCCTGGCT OOOGOGTCTa TCTGAACCSiC GCGGGGGCCT 4020 

75 TGAGGGAOOC TTTGTCTGTC GTGATGGGGC AAGGGCACAA GTCCTOOATG TTGTGTOTAT 4080 

CGAGAGGCCA AAGGCTGGTG GCAAGTGCAC GGGGCACAGC GGAGTCTQTC CTGTOAOGOJ 4140 
CAAGTCTOAG GGTCTGGGOO OCXWGOGGCT GOOTCTQTGC ATTTCTGGTT GCACCX30GGC 4200 
GCTTCCCA6C ACCAACATGT AACCGGCATG T1TGCA6CA6 AAGACAAAAA 6ACAAACAT0 4260 
AAAOTCTAGA AATAAAACTG GTAAAACCCC AAAAAAAAAA AAAAAA 

oO 

Seq ID NO: B2 Protein sequence 
Protein Accession NP_444284.1 



I 11 21 31 41 SI 
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MEHQLUXBV ETIRRAYPZ>A NLLNDRVLRA MLKABBTCAP SVSYPKCVQK BVLPSMRKIV 60 

ATWILBVCBB QKCBBBVFPIi AMHVU)RFLS X£PVKKSRLQ LLGATCMFVA SXMKBTIPLT 120 

- AEXLCIYTDN SIRPEELLQM BIiLLVNKLKN NIiAAMTFHDF IBHFLSKMPB AEEHKQURK 180 

5 HAQTFVALCA TDVKFISNPP SMVAAGSWA AVQGUnjRSP MNFItSVYRLT RFLSRVIKOD 240 
PDCLRACQBQ IEALLB8SLR QAQQNMDPXA AEBBBEBBBB VDIACTPTDV RUVDI 



Seq ID NO: B3 DNA sequence 
. Nucleic Acid Accession ft> XH_04416€ 
10 Coding sequences 1..1576 

1 11 21 31 41 51 

I I I I I I 

crmxfmt; qgcatgccta gtctaotggt atctggaata aigoaaaqaa atgggqqctt 6o 

IS TGGA6AACTA GGAT6TTT0G GGGGAAG06C TAAGQACX:aA G60CTGCT06 AAQAOGA GOS 120 

CGCCCTTCAG CTGGCTCTCO ATCAACTCTO CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 

CAGGGCX3G0C GAGGACGGGO GA60TGGGGQ GGGOGQCGCX: CCCGCGCAGC CGACAQCCCC 240 

CCCGCAGCCG GCGCCGCOGC 06CCQCCOOC G60GCCXXXX3 GCCGCCCOSA CGACGGCCCC 300 

COCAOOGCAG AOGCCCCAGC CCCGCAOGBC CCCCAAAGGG G06A60GA06 GCAAGCTCZG 360 

20 CGCTCTCTAC AAAGAGGCOO AGCTGOGCCT OAAiGGGCAGC AGCAACAOCA CGGAGTGTGT 420 

TCCOGTGCCC ACCTCXX3A0C A0GTQGCCC3A GATOOTGGGC AGQCAAGGCT GCAAOATTAA 480 

GGCCTTQAGO 6CCAA6ACCA ACACCTACAT CAAOACACCQ 0TGAGGGGCX3 AGQAACCAGT 540 

GTTCATGGTG ACAGGGCOAC GGGAGGACOT GGCCACAGCC 0QGCX5GGAAA T CATC TCAGC 600 

AGCGOAGCAC TTCTCCATGA TCCGT60CTC OOOCAACAAO TCAOGOGOOQ CCTTTGGTGT 660 

25 GGCTCCTGCT CTGCCOGGCC AGGTQACCAT CCGTGTGOGO OTGCCCTACC G06TG6TGGG 720 

QCTGGTGGTQ GGCCXX^AAG GGGCAACCAT CAAGOGCATC CAOCAGCAAA CCAACACATA 780 

CATTATCACA CCAAGCCGTQ ACCGCX3ACCC CGTGTTCQAG ATCACXK3GTG CCCOIGGCAA 840 

CaTGOAaCOT GOGGGCGAGG AGATCGAOAC GCACATOGCG GTGOGCaiCXG GCAAGATCCT 900 

CGAGTACAAC AATGAAAACG ACTTCCXGOC GGOOAGCCCC GAOOCAGCAA TOGATACOOO 960 

30 CTACTCCGAC GCCTGGCGGG TGCACCAQCC 0G0CT6CAAG CCCCTCTOCA CCTTC0G6CA 1020 

GAACAGCCTG GGCTGCATCG GCGAGTGOGG AOTGQACTCT GGCTTT6AGG CCCCAOGCCT 1080 

GGGTGAGCAG GGCGGGGACT TTGGCTACXM CGGGTACCTC TTTCCGGGCT ATGGOGTOOQ 1140 

CAAQCAQGAT GTGTACTACG GCGTGGOOGA GACTAGCCCC CCXKTTGTGGG CGGGCCAGGA 1200 

GAACGCCAOG CCCACCTCCG TGCTCTTCTC CTCT GO CTOC TCCTCCTOCT CCTCTTCCX3C 1260 

35 CAAGQCCOGC OCTGGGCCCC CGGGCXSCaCA COQCTCCCCT GCCACTTCOG CGGGACCOSA 1320 

GCTGGCCGGA CTCCOGAGGC GCCCCCOGGG AGAOCOGCTC CAGGGCTTCT CTAAACTTGG 1380 

TQGGGQCGGC CTGCGGAGCC COSGOGGCGG GCXK3GATTGC ATGGTCTGCT TTGAGAGCOA 1440 

AOTQACTGCC GCCCTTGTGC CCTGCGGACA GAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 

CATCTGCGAG AGGACGGACC CAGAOTGTCC OGTCTGCCAC ATCACAGCCA CQCAAGOC AT 1560 

40 CCX3AATATTC TCCTAAGCCC CGTGCCOCAT Q0CTCCXX3GG CCCACTCCAC T0G60CCACC 1620 

CTGGACCTGT TTTCCACTAA AGCCTTTTGG AAAGCGGTQA TTTGAGGGGC AAGGTGCTTA 1680 

GAQATACTCX3 CTCGCTGGGG AAGGGGGGAG GGAGGCAGTQ GTGGCTGGAG GGTGCX3CCAC 1740 

TTTCAGAGCC TCTGGTCACC CTGTCCTGGA AAGATTGGGA GGGGGCCAGA CTGAAAATTT IBOO 

TACTAGAOTT ACAACTCTOA TACCTCAAC3V CACCCTTAAA TCTG6AAGGA QCTAAGAQAA 1860 

45 ACTTTTGTTT TCCCAGAGGT GGCCACTAAG GCATTCTGAC GCCCTCTQCC CACCTCCCCC 1920 

GGTGTGTGTC ACTCCACCCC TTCTTCOQAG GAGGGGQTGG GTAAAAGGGA QAGGGAGAAT 1980 

TACCACCXGT ATCTA0A06T GCTCTTTGCA ATCCCTAAGC CCTCTGGTCC TQACCTCOGA 2040 

CCTCCCAGCT CTGTCTTGTT CCrTOTCTTT GTCTTTCTTC CCTTCCCCCT GCOCCT GCCC 2100 

CTACCAGCCC AGCTTTGGGG ACACCATCCT TCTGGGGAGA AGTAQQGGGA GQAATATTTG 2160 

50 GATGQTCCCT CCATTCCTCT TCAGGCATCT GGAGGCCCTC TCCXXX»CTC CTOCAAAGAA 2220 

ACATCTCAAA TTATTGATGG AATGTATCCC CATTCTCAGT GAAAATGTGA GGAGGGGACT 2280 

AATACTGGGG TAAAGGOTCA AACCCCCACC TTCATCACTA TGOOCATTAT ATTT AGGGAG 2340 

TAGTTCTTGG GCTGQATTTT CTGGTTGTGG AAGTGGGGGC GCCAGAGTAG TOTQTCXGCT 2400 

AXTTAAAGQA GC3W3GAAAGG GCGTGAGGCA GQAOGAGAGA CTGGTGGAGG G AAGAG CTGC 2460 

55 TOCTCCCATO CAGTGCCCGA CTCCCTGCAC CCCTCTCAAC CTGACXTTGAA CCTTTATTOA 2520 

ATOCTTATTA GCITGAATGC TTATTAGCTT GAATCCTCCA TGCAAATCAT GGAGTCTGTG 2580 

TCCCACCTGA TGTGOTTGAa QAOAAGCCAG GTCTTCAAAO AGGGGTCAGC CTGGGGCAAA 2640 

GCAGGACTGO GQGGAGGTGG GCAGOVGGGC CTATTCTGAO AATCACATAT TGTTACAGGC 2700 

CTTGCACCCC CTTTGCTGCT TCCCTCCTQC TCATTTGOGG CTGCCACCAG CTCTCCACCC 3760 

60 TCCTG6TT0C aCTGGCCQGa CXaAGAGAGG ATGQAGGGAT GGGAGTCCCA GGAQATCCTT 2820 

GTAAATAST6 6GGT6GGACT GTTCTGAGTO ATCACCOGAG CACTTAAAGC TCCAGAGTCC 2880 

aVTTCTTCCT GGATGOAGCA GGTGGAGGTG CAGAGGGGAT TTCCTCCTCT CCTTCCTCCT 2940 

OTOQAQAATT AACACCTCTC CACAGCCTTC CCCTCCAGAA CACCAGCCAG GGAGGGQTGO 3000 

GGAAGGAGOT CACAGCCAAG AAAACIGCCC TGTGAOQACT TCCXTTCCTTC COGOCTATOT 3060 

65 GAGCCATCCT GAGAT6TCT0 TACAATAGAA ACCAAACCAA ATGGGCACCC TCGGTTGCC6 3120 

GGG6GCA0GT GGGGAGGGGG GTGQGAAGAA GGGATGTCTG TCTGTOGTCC CCCTCGCXXTT 3180 

CTOCACTCTT TACCCACAAA GOCAGAAGAC TGTTACACTA GOGGGCTCAG CAAATTCAAT 3240 

CCCACCCTTA CCAATTGAGC CAAACCTAGA AACAAACACA AAACAOGAAT A6TGAGAGAC 3300 

AAAATAGAGG AGAQAAAGAG AGCATGAGAG GGAGCX3AGAC AGGCGACCAA CACAGAGGAG 3360 

70 A6AAAACAAA AATAGCAAAA AAAAAAAAAA AAA6CA0TTC TTTATAATTT AATATTCTAT 3420 
TTTAATAAAO GOQTTTATTA GCATATAAAT GTASCAAAGA ACCTGG6CTA ATATGAA 



75 



8eq ID NOt B4 Protein sequence 
Protein Accession «t XP_044166.2 



1 II 21 31 41 51 

111)11 

PCFAMPSLW SOIMERNQGF GBLGCFGGSA KDRGLLEDER ALQLALDQLC LI/3LGEPPAP 60 

- RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQFPTA PRQASDAXI^ 120 

80 ALYXEAELRL KGS8NT7BCV PVPTSBHVAB IVGRQGCKIK ALRAKTNTYI KTPVRGEEFV 180 

PMVT6RRBDV ATARRBllSA AEHPSMIRA8 RNKSGAAFGV APALPGQVTI RVRVPYRWO 240 

LWGPRGATZ KRIQQQTNTY IXTPSRDRDP VPBITGAPGM VERARBEIET HIAVRTGKIL 300 

BYNNENDFLA GSPDAAIDSR YSDAMRVHQP GCKPL6TFRQ NSLGCIGBOO VDSGFBAPRL 360 

GEQG(3)FQYG GYLFP6YGVQ KQDVYYGVAE TSPPIiWAGQB NATPTSVLP8 8ASSSSS88A 420 

798 



wo 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



KARAGPFGAH R8PATSA0PB LAOLPHRPPG BPLQ6PSKL0 GOaLRSPGGO RDCMVCPESB 
VTAM.VPOQH NLFCKBCAVR ICERTDPBCP VCXXTATQAI RZP8 

8eq ID NOi B5 DNA sequence 

Nucleic Acid Accession 9i KM_000909.1 

Coding sequencet 209.. 1363 ~ 



480 



1 
I 

CATTCCCACC 
GTTCA6TTCA 
AATAAQAATA 
ATAATCTATA 
TCATTCAOTC 
TGATTSTCAT 
CATTCTTSOT 
6AGAAATGTT 
CATGTOTCTC 
GATGTGTAAO 
GGTTCTCATT 
TAATAGACAT 
GCCTTTCCTG 
GTACAAAfSAC 
TACCACTCTC 
CTTCAAGATA 
TAAGTACAGG 
ATTTGCAGTC 
GA7CATTGCT 
ATCCACTTGT 
GCA6TTCTTC 
CATGTCCAOQ 
ATTTAAAAAA 
GGTCXXX3QAT 
TCTCCCAAGG 
TTACTGCTTT 
TCTTCTGGAA 
AATATAAAQA 
GCTQACTTCA 
TTAGA TTAG A 
TAQTGTGrTTA 
QAAGTCATTC 
TTTTTTCACC 
TTAAAAATGA 
ACAGGAATGA 
AGAGCATTTT 
TATATTTATT 
TTGTCAAGCT 
ACAAATATOQ 
CATCTTTCAA 
AGGQAAAAAT 
GrGTGACTTG 
GTTAATGTGC 
AAACTATATT 
TGTTTQATTT 
GGAQTCTCTT 



11 
I 

AGGGAAT6AA 
AGCTC3AAC3VG 
ACAACCAAAC 
CACTGTAATT 
CTGOCCTTQG 
GTCTCTGQAA 
ACCAACATCC 
CCCTTTACAT 
TTGAATTCTT 
GCTGTGGAAC 
GCTTATOTAa 
ATCTACXZAAO 
AAATAG6TGT 
CTCTTG6T6C 
TATATACGCC 
TCCAGTGAAA 
T6CTGGCTCC 
ACCT6CAACC 
GTCAA CXXXa 
TTCAACTTTT 
AT6CACACAQ 
ATCAACAACA 
6ACATCTGTT 
AATGGG6TT0 
TOTTGTAOTT 
ATA6TTTTGA 
CTTTTATACT 
GAAGTACCTG 
TTAGATTGTC 
CAATAGTAAC 
AQAAGTCGTT 
TTAAGGGAGG 
ATAAAAAGAC 
AOAGAGAAAO 
AQAOTAATTA 
TGAATTQATG 
TCTGOTCTAA 
TTTTCCATAC 
AGGAAGTAAC 
ACAOVAAAAC 
TGGOOTCTTA 
CTAATTTCAT 
TTAAAGAAGA 
TAAAAQOaOG 
TCAGTTTCAT 



21 
I 

TAATAAGCAG 
GAATTCAGAA 
TTQACCTGCT 
CAATCAAAAT 
TCTCAGAGAA 
CCATQATATT 
ACCTGGCCTT 
TGATTGTQAA 
TTGTCTACAC 
TTGT6CAAT6 
GAC3VTCAGCT 
GTATTGCTGT 
TAATGACTGA 
GCTTTGATCA 
TQCAOTATTT 
TAAAAAGGAG 
CCAAAAGAAT 
CTCTTACCAT 
ACAATCTGTT 
TATTTTATGG 
GT6ATTTCCG 
ATOTTTCCAA 
ATOATGATAA 
TAAAAAC3UV6 
AAATCATTTG 
GTCATAATTA 
CCAGACATCT 
GTACTTATTQ 
CCATCCAATA 
AACAGATTGG 
AOTATGCAAA 
TQAGGTTTCT 
CTTTCATTTC 
ATACT7CTCA 
CAOCTCCGCA 
ATTTAATAAA 
GTCAAGAGAT 
TATGTACTCG 
A6CAGTGCCT 
AGCAAG6TAC 
TGCAGATACT 
TAAATAATX3C 
0TATCTT3TA 
AGACATACTT 
GACAT TTTAT 
TTTTTCTCAT 



31 
I 

GAGOGAAAAA 
TAATTTTGGT 
TTGAAGAAAC 
GAATTCAACA 
GAATGCCCA6 
TACCTTAGCT 
GATCATAATC 
CCTTTCCTTC 
ATTAAT60AC 
TGTTTCAATC 
GATAATCAAC 
GATTTGGGTC 
TQAOCCGTTC 
ATTTCCATCG 
TGGTCCACTT 
AAACAACATG 
CAATATCATO 
CTTTAACACT 
ATTGCTGCTC 
GTTGCT6AAC 
GTCTOGGGAT 
AACTTCTTTG 
TGAAAAAATC 
CACAACCTGC 
AAAAtGACTA 
CATTTGGAAC 
TTGAAOTOCT 
GAATGAAATT 
CGQTCATTAO 
GCCATCCTTA 
A6CAGCATTC 
GTTTTTTGGT 
CTCCCGACTQ 
GCTGCAAATA 
ACTTCAAAAC 
GTAAATTAGT 
TTTCCATTTT 
AAAGACTTTC 
ATATAGTGAC 
AATOTTAAAG 
TCATATAGCC 
ACTGTAAAGA 
ATCATGATTO 
CAAT6TATTA 
TAAAA3GAAT 
CCCAXGACTT 



Seq ID NOt B6 Protein sequence 
Protein Accession tti NP 000900.1 



I 

MN9TLF8QVB 
LIIIILKQKE 
CVSITVSIFS 
DEPFQMVTLD 
RKNKMDKMRD 
LFLIiCHLTAM 
XTSLKQA8PV 



11 

1 

NHSVHSNPSE 
MRNVTNILIV 
LVLIAVBRRQ 
AYKDinrVCFD 
NKYRSSETKR 
ISTCVNPIFY 
AFKKINNNDD 



31 
1 

KNAQLLAFEM 
NLSFSDLLVA 
LIINP8GWRP 
QFPSDSHRLS 
INIMLL6ZW 
GFLNRNPgRD 
NEKI 



31 
I 

DDCHLPLAMI 
IMCIiPPTPW 
NNRHAYVGZA 
YTTLLLVLQY 
AFAVCIfLPLT 
liQFFFNFCDP 



CAAAATOTAA 
GACTCTCATA 
TOTTTTATAT 
ATGGACAAGA 
CTGCTCTCCA 
GT G T T T G ATT 
T6CCACCTCA 
AAAAACTTCX 
GATGATTATG 
AAGCAAGCAA 
TOAAACTACT 
AACATACTTT 
AGATTTTCTT 
AAAAGGTGTG 
TTTTGTGAAT 
TCTTTAAAGT 
ATTGGGTCAT 
CTTTATGATA 
AGGAGC06AA 
GGTTTTTGTT 
ATTGTCACTT 
TTATGGAGAA 
CATTT7G6TA 
ATT6CTGCAA 
TTTTACAGAC 
OQCTTACAAT 
TGATTTTAAC 
GAATATTCAC 
CATTTTAACT 
TTACTGAATA 
AGCCTCAGAA 
TACAGATAAA 
ATTGTTTTTG 
CCCTCOQATO 



41 

I 

FTLALAYGAV 
TLHDHHVF6B 
VIHVLAVASS 
FGPLCFIFIC 
IFNTVFDHNH 
RSBUDDYBTI 



51 
I 

AAAGAGGATT 
CAATATCGGO 
TTTGTCTAAA 
AGGTTGAAAA 
TTGAAAATGA 
GAGCT6TGAT 
AAAAGGAGAT 
TTGTTGCCAT 
TTGGT6AGGC 
TTTTCTCTCT 
GGAGACCAAA 
CTTCTTCTTT 
CACTTGATGC 
GGTTGTCTTA 
TTATTTGCTA 
TGAGAGACAA 
TTGTG6TAGC 
G6AATCATCA 
CAGCAATGAT 
AGAQAGACTT 
AAACAATA6C 
GCCCAGTCGC 
TATAGCCTAT 
(3ATTACCT6T 
GTCTTOCTTT 
GGCTTT6GGG 
TTATGCATAT 
ATTACGATGC 
CTTGATTAGA 
GGCATCATTT 
AGATAGTCTT 
TOmTTTTT 
AAATCAAAAT 
TTG6GCACCC 
CCT6ACAACA 
ATAGCTAAAT 
TGTTCAGTQT 
TTGTAGAAAC 
TTTCAATQTC 
TTTACCTAGC 
TGTATAAACT 
GTTGTGTCAT 
TCATTTGGAG 
GTATTACATO 
CTTTTTCTGA 
GT 



51 
I 

IILGVSGNLA 
AMCKLHPFVQ 
LPPHYQVMT 
YPRIYIRLKR 
QIZATQIBNL 
AMSTMHTCVS 



Seq ID HOi B7 DNA sequence 

Nucleic Acid Accession #i NM_003S90.2 

Coding sequence! 204.. 3416 " 



1 

I 

6CTTCTGTAA 
CTGGAGGCTC 
TTCCQAGAGC 
GGATGBCACT 
TGCCCTTGCA 
GATACAOCAC 
TGCATATGAA 
TQCTCOQGGT 
AOOGGCTGTG 



11 
I 

GAGAOGGAGA 
GGAGCTQCTA 
CTGAGACTGA 
TQA6GCTGGA 
GCTCTTCAGC 
CTTOGAGGAO 
AOTATCGGGT 
0CXKX3AAGGC 
TGGCCAGQCC 



31 

I 

GGCGCAGAGT 
CCCGCAQACT 
OGCCCGAOCT 
GGCATGAGTC 
CTCTGCTGG6 
QATOCCCCOG 
GACACAA6CT 
GACGGGCAGC 
COSCAOTOOO 



31 
I 

GAGGGOGGGT 
TCTCCGGCAC 
GGAAACCAGA 
CrOTQAGGOG 
TGCTCTCAGT 
GCAOGGTCAT 
TOC30CCTGAT 
TGAC0GTG6G 
TGCrGGCCTT 



41 

I 

COGOGCGTCC 
AGGGCTG6CA 
GAAGACTTCC 
TTO06GCAGC 
GGCCCAGAGC 
OQGGACCCTG 
GAAGCAATTC 
GGA0GC0G6C 
C3ATGTGGTC 



60 
120 
160 
340 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1660 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



60 
120 
180 
240 
300 
360 



51 
I 

TCAGAOCCCG 
AAGAG06TGA 
TTAGCCTTTC 
COCTGCCTTT 
AAAACAGTCC 
GCOQAGGACX: 
AACAGCTCTC 
CTGGAC0Q09 
AOCTTCTOGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



799 



wo 03/042661 



AGGAlGCASTT COO G CTGOTO CftCGTGGAGG TAOAGOTGAG GQACGTCAAC GACCA06CGC 600 

0G06CTTCCC CAOGGOCCAG ATCCOQGTAG AGGTGTCOGA GGGTSC6GCA 0TGGGCA06C 660 

GCATCOCCTT GGAGGTGCOG OTGGAO^AGG ACX3TGGG0QC CAAOGGGCTG CAGACCGTGC 720 

GCCTGGCCGA GCOGCACAGC OCCTTTOGOG TGGAGCTGCA GAOGCGAGCG OACGGOGCTC 700 

AGTQOGa\GA CCTGGT6CTG CTCCAGGAGC TGQACOGOQA GAGCCAGGCC GCCTACAGCC 840 
7GGAGCTGGT GGCCCAOQAC GGCQQOCGCC GGCGGOGCTC OQCXACGGCT GCCCTCAGCG 900 
TOOOOOTCCT GQATGOQAAT QACCACAGCC OGGCXriTCCC OCA0GGCX3CC GTGGCCGAAQ 960 

TGGAGCTGGC GGAA6ACX5CX5 CCCXnGGGCT CCCT6CTTCT CX3ACCTGGAC GCAGCCGACC 1020 

CXX3ACGAGGQ ACCTAACGGC GACGTGGTGT TCGCATTTGG CGCCOGCACC CCGCOGGAGG 1080 

O606CCGCCT CTTTOGGCTT GAOXGOGAT CAGGGGOCCT CACCCTG6CC GGGCCOGTGG 1140 

ACTAGGAGGQ TCA6GACACC TAOGAGCTGG A0OTG0GG6C GCAGGACOGC G6ACCCGGGC 1200 

CC0G06CTGC CACCTGCAA6 GTCAT08TGC GCATCGOAGA QGTCAATGAC AA06CACCCG 1260 

ACATOGCCAT CACCCCGCTG GCCQCCCCAO GCGCGCXX5GC AACCTCACCC TTCGCCGCTG 1320 

COGCOGCCGC CGCTGCACTC GGGGQAGOGG AC3GCTAGCrC OCCGGCQGGA GCCGGGACGC 1380 

CGGAGGCT6G TGCCACTTOG CTGGTGCOGG AGGGGG0G6C GOGOGAGAGC CTGGTGGCXX: 1440 

TGGTCAGCAC CTCGQACAGG GACT0GGG08 CCAA0G66CA AGTGCGCTGC GCCCTCTATG 1500 

GGCACGAGCA CTTCCX3GCTG CAGC O GGCCT AGG0G6GCA0 CTACCTGGTG GTGACGGCGG 1560 

CGTC6CTGGA CC6CGAA0GC ATC6C06A6T ACAACTTGAC GCTGGTGGCC QAGGATCGCG 1620 

GOGCGCCCCC GCTGCGCACA GTGCGGCCCT ACAOOOTGCG TGTGGGCGAC QAGAACGACA 1680 

AOGGGCCGCT CTTCACGCX3G CCGQTCTATG AGGTGTCGGT GCG0GA6AAC AACCCGCCAG 1740 

GOOCXTACCT 06CCACX5GT6 GCOGCC060G AOOGGGACCT GGQCOGCAAC Q6CCAGGTCA 1800 

CCTACCGGCT GCTG6A6GCC GAGGTGG6CC GCX3CCX3GGG6 06CCGTGTCC ACTTATGTCT 1860 

CGGTGGACCC AGCTACCGGA GCCATCTACG CGCTQOGCAG CTTCX3ACTAT GAGACX5CTGC 1920 

GCCAACTOGA CGTTOGCATC CAAGCTAGCG ACGGCGGCTC CCCTCAGCTT TCCAGCAGCG 1980 

CCCTAGTGCA AOTGCGCGTG CTGGACCAGA ACGACCATGC GCCAOTCCTQ GTGCACCCGG 2040 

CGCCA6CCAA TGGCTCCCTA GAACrrGGGGQ TGCCTG66C3G CACCGCAAAQ GACACGGTTG 2100 

T66CC0STGT GCAGGCCOGG GATGCAGAOG AG66A6GCAA GGGGGAOCTO GOQTTOQAGC 2160 

TGCAGCAGCA GGAGCOGCGC GAAGCCTTOQ CCAT0GGCXX3 CCX3CA0GGGG GAGATACTGC 2220 

TCACCGGOGA CCTCTC3GCAG GAGCCACCCG GTOGCGTGTT CAGOGCGCTC CTGGTCATAT 2280 

CCGACX3G0GG CCGTCCCCCG CTCACCACCA COQCAACTGT CAGCTTCGTG GTAACAGCAG 2340 

OGGGGGQGOG TGGGCGGGCT GCGCCTGCCA GTGCAGGAAG CGGGOftOOGT TCCCGCCCGC 2400 

eraOCTCTCG GCTCGGGGTG TCCGGGTOGG TGCTGCAATG GGACACGCCO CTGATCOTCA 2460 

TCATOGTGCT GGCCGGGAGC TGCACGCTGC TGCTOGCOGC CATCATOGCC ATCGCCACCA 2520 

CCTGCaiACOG CCGCAAGAAG GAGGTGOSCA AAGGGGGGGC CCTCCGGGAA GAGCGGCCCG 2580 

GGGCX3GGGGG CX3GOGGA6CC TOGGCTCCCG GCTCCXTOGGA GGAGGCCX3CC GGGGGAGCCG 2640 

GGCCCAOGCC CAACATGTTC GACGTGCTCSV CCTTCCCIGQ CACGQGCAAA G06CCCTTTG 2700 

GCA6CCC0QC G60QQA06GG CCT00GCCT6 OQGTC60CGC GGCGGAAOTG G0663CTCAG 2760 

AGGOCGOCAG CGCCACTGGG GAAAGC6CCT GTCACTTCGA GGGGCAOCAQ OGOCTOCGCG 2820 

GC606CAOGC CGAGCCCTAC GGTGCCTCCC OGGGTTTTGO AAAGGAGCCG GOGCCCCCTG 2880 

TGGCGGTGTQ GAAAGGACAC TCCTTCAACA CCATTTCTGG CAGAOAAGCA OAGAAGTTCA 2940 

GOGGCAAAOA CA60G6TAAA GGGGACAGTG ATTTCAAOGA CAlGOQATTCC QACATCAGCX3 3000 

GGQA06CTCT GAAAAAOOAT CTCATCAACX: ACAIGCAGAa TGGACTGTGG GOGTGCACCG 3060 

CTGAOTOTAA GATCCTGGGC CACTCTGACC GCTGCTGQAG CCCATCCTGC AGCGGGCCCA 3120 

ACX3CACATCC ATOGCCTCAC CCACCAOCCC AGATGTCAAC CTTCTGTAAG AGCACGTCAC 3180 

TGCCTCX3GQA TCCTCTGOGC AGGGACAATT ACTACCAGGC CCAGCTGCCC AAGACAGTGG 3240 

GGCTGCAOAG CX5TCTATQA6 AAAGTACTGC ACAGAGACTA TGACAGGACA GTCACTCTGC 3300 

TCrCCCCTCC CCGTCCAGGG AGGCTCCCAG ACCTGCAG6A QATTGGAGTA CCCCTCTACC 3360 
AGTCCCCTCC TGQCAGGTAC CTGTCCCCX3A AGAAGGGAGC CAATGAAAAT GTGTAATCCX: 3420 
ATGCTGCATG TCTTCACACA TATAC3\GGTC ACTCCGAGAA GCCCCTAAGT TATTGACOGO 3480 

TTTCAGTGTT GTATATATAA ATATCCAAGA TGTGCCTTAC AATOAAGTTG TTGGA AGCTA 3S40 
TTTGCAATCA CATTGGTACT GTTTG Q TATT TGCAAACAAA AATGTAGTTA ATGTAATTTT 3600 
TATGAAATGT GTGCAGTATT TAATTTTTCT TAT6CTATTG ACTTTGATTT CAATT6CG6C 3660 
TTGCCATTTT CTTAGTGTTT TTAACCTGTA CATT6TGTAA TGTAATGTTT GTATATAATG 3720 
AAATTTTGTT ATATTTTTAT ATAATAAAAG CTAAAGTGGA AOTTATTGCC AAAGGAACTG 3780 
TCTGTAAGAC AAAAAACAAA ACAT6TTGGA ATTACTTAAT TGAAATTTAT CTTTCACCTG 3840 
AAACAACCTA GTGTTTGAGA AATTTTGGCT TGCCAAGTAT AACTTQTATA TCTTGACTCT 3900 
GTGQTAGATT TCAAGTTCAA T8TTATTTAA TTACATTTGG TTTTCCGTAA ACCGTGTCAC 3960 
TTATAAOCAC AGTAATAAAA OATTTGTCAT GTGTTTGAAG AAAAAAAAAA AAAAAAAAAA 4020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID MOi B8 Protein sequence , 
Protein Accession 41 1 NP_002581.2 

1 11 21 31 41 51 

I I I I 1 I 

MSPVRRW6SP CZiFPliQLFSL CHVLSVAQSK TVRYSTFEBD AFGTVIGTXA SDIiHMKVSQD 60 

TSFRLHKQFN SSIiXiRVREGD GQLTVGDAGL DRERLCX3QAP QCVLAFDWS FSQEQFRIiVB 120 

VBVEVRDVND HAPRPPRAQI PVEVSBQAAV OTRIPLBVPV DBDVGAMGIX) TVRLAEPKSP 180 

FRVBLQTRAD GAQCADLVLL QELDRESQAA YSLELVAQDG GRPPRSATAA LSVRVLDAND 240 

HSPAFPQOAV ABVBLAEDAP VGSLliU3LDA ADPDBGPNGD WFAFQARTP PEARRLFRLO 300 

PRSGRLTIiAO FVDYBRQDTY ELDVRAQDBG PGPRAATCRV IVRJRDVNDM APDIAITPLA 360 

APGAPATSPF AAAAAAAALG GADASSPABA OTFBAGATSIj VPBQAAHESL VALVSTSDRD 420 

8GANGQVRCA LYGHEHPRLQ PAYAGSVIiW TAASLDRBRI AEYNLTLVAE DRGAPPLRTV 480 

RPYTVRVGDB HDNAPLPTRP VYEVSVRENN PPGAYLATVA ARDRDLGRNG QVTYRLLEAE 540 

VGRAGGAVST YVSVDPATGA ZYALRSFDYS TLRQIDVRIQ ASDGGSPQLS 8SALVQVRVL 600 

OQNDBAPVLV HPAFAMGSIiB VAVPGRTA2CD TWARVQARD ADB8ANGBLA FEliQQQEPRB 660 

AFAI6RRT6E ILLTGD1*SQB PPGRVPRALL VISDGGRPPL TTTATV8FW TAGGGRGPAA 720 

PASAGSPERS RFPGSRLGVS GSVLQWDTPIi IV2IVLAGSC TLLLAAIIAI ATTCNRRKKB 780 

VRKGGAIiREB RPGAAGGGAS APQSPBEAAR GAGPRPNMFD VLTPPGTGKA PFGSPAADAP 840 

PPAVAAAEVP QSB6GSATGB 8AC3IFEGQQR IiRGAHAEPYG ASFGFGKBPA PPVAVWKGHS 900 

FMTX86REAB KFS6XDSGK0 DSDFNDSDSD ISGDALKKUL INHMQSGLWA CTABCKIL6H 960 

SDRCHSPSC8 GPNAHPSPBP PAQMSTFCKS TSLPRDPLRR ONYYQAQLPK TVOI^QSVYEK 1020 
VUmOYDRTV TLLSPPRPOR LPDLQBIGVP LYQSPPGRYL SPKKQANBMV 
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Seq ID HOt B9 DNA sequence 

Nucleic Acid Accession «: AL121939.12 

Coding sequence! 165.. 1426 

1 11 21 31 41 51 

1 I I I I I 

A609C3A6TST CAGTGQC6A6 CTCOGGGTGC T6TGGCCCG0 CCrTGGCGOG GCGGCCTCCQ 60 

GCTCAGGCTG GCTGAGAGGC TCCCAGCTGC AOOGTCCCCG CCXGCCTCCT OGGQAOCTCT 130 

QATCTCAGCT GACAGTGCCC TCGGGGACCA AACAAGCCTG GCAGOACAAA ATTAOAAOAT 180 

CAAAATGGAA AATATGCT6C TTTGGTTGAT ATTTTTCACC CCTGGGTGGA CCCTCATTQA 240 

TQ6ATCTQAA ATGGAATGOQ ATTTTATQTO 6CACTTGAGA AAGGTACCCC GGATTOTCAQ 300 

TSAAAOGACT TTCCATCTCA CCAGCX2006C ATTTGAGGCA GATGCTAAGA TGATQGTAAA 360 

TACAGTGTGT GGCATCGAAT GCCAGAAAGA ACTCCCAACT CCCAGCXTTTT CTGAATTGGA 420 

GGATTATCTT TCCTATGAGA CTQTCTTTGA QAATGGCACC OQAACCTTAA CCAGGGTGAA 480 

AiGTTCAAGAT ' ril.b ' HCliU AGCOQACTCA AAATATCACC ACAAAGGGAO TATCTGTTAG 540 

GAQAAAGAGA CAGGTOTATG GCACG6ACAG CAGGTTCAGC ATCTTGGACA AAAGGTTCTT 600 

AACCRATTTC CCTTTCAGCA CAGCTOTGAA OCTTTCCACG GGCTGTAOTO GCATTCTCAT 660 

TTCCXXrrCRG CATGTTCTAA CTQCTGCCCA CTGTGTTCAT GATGGAAAGG ACTATQTCAA 720 

AGGGAGTAAA AAGCTAAGGG TAGGGTTGTT GAAGATOAGG AATAAAAGTG GAGGCAAGAA 780 

ACGTCGAQGT TCTAAGAQQA GCAGOAGAGA AGCTAGTGGT GGTQACCAAA QAGAGGGTAC 840 

CAOAOAGCAT CIGCAGGAGA GAGOSAAGGG TGGGAOAAOA AGAAAAAAAT CTGGCOGGGG 900 

TCAGAGGATT GCCGAAOGGA GGGCTTCCTT TCAGTOGACC 06GGTCAAGA ATACCCACAT 960 

TCCGAAGQGC TGGGCAOGAG GAGGCATGGQ GGAOGCTACC TTGGACTATG ACTATGCTCT 1020 

TCTGGAGCTG AAGCGTGCTC ACAAAAAGAA ATACATGGAA CTTGGAATCA CCCCAAOSAT 1080 

CAAGAAAATO CCTQGTGGAA TGATCCACTT CTCAGGATTT GATAAOQATA GGGCTGATCA 1140 

GTTOQTCTAT G9GTTTTGCA GTGTGTCYGA CQAATOCAAT GATCTCCTTT ACCAATAYTQ 1200 

OQATGCTGAO TOQGGCTCCA CCGGTTOQGO G0TCTATCT6 COICTGAAAG ATCCAGACAA 1260 

AAAGAATTGG AAGCGCAAAA TCATTGOGOT CTACTCAGGG CACCAGTGGG TGGATGTCCA 1320 

CGGGGTTCAQ AAGGACTACA ACGTTGCTGT TOGCATCACT CCCCTAAAAT ACXSCCCAOAT 1380 

TTGCCTCTGG ATTCACGGGA ACGATGCCAA TTQTGCTTAC GGCTAACAGA GACCTGAAAC 1440 

AGG6GG0TGT ATCATCTAAA TCACAOAGAA AACCAGCTCT GCTTAOOSTA OTGAOATCAC 1500 

TTCATAOOTT ATOCCTGGAC TTGAACTCTG TCRATAOCAT TTCWACATTT TTCAAAATCA 1560 

GGAGATTTTC GTCCATTTAA AAAATGTATA GGTGCAGATA TTGAAACTAQ GTGG GCACT T 1620 

CAATGCCAAG TATATACTCT TCTTTACATG GTGATGAGTT TCATTTQTAG AAAAATTTTG 1680 

TTGCCTTCTT AAAAATTAGA CACACTTTAA ACCTTCAAAC AGGTATTATA AATAACATGT 1740 

GACTCCTTAA TGOACTTATT CTCAGGOTGC TACTCTAA6A AGAATCTAAT AOGATGCTGG 1800 

TTGTGTATTA AATGTGAAAT YGCATAOATA AAGGTAGATQ GTAAAGCAAT TAGTATCAGA 1860 

ATAQAQACAG AAAGTTACAA CACAGTTTGT ACTACTCTGA GATGGAYCCA TTCAGCTCAT 1920 

GCCCTCAATG TTTATATTGT GTTATCTGTT GGGTCTGGGA CATTTAGTTT AGTTTTTTTG 1980 

AAQAATTACA AATCA6AAQA AAAAGCAAGC ATTATAAACA AAACTAATAA CTGTTTTACT 2040 

6CTTTAAOAA ATAACAATTA CAATGTGTAT TATTTAAAAA TGG6AQAAAT A0TTT6TTCT 2100 

AT6AAATAAA CCTAGTTTAO AAATA6QQAA GCTGAGACAT TTTAAGATCT CAAOTTTTTA 2160 

TTTAACTAAT ACTCAAAATA TGGACTTTTC ATGTATGCAT AGGGAAQACA CTTCACAAAT 2220 

TATGAATGAT CATQTGTTGA AAGCCACATT ATTTTATGCT ATACATTCTA TGTATGAGQT 2280 

GCTACATTTT TAGGACAAAG AATTCTGTAA TCTTTTTCAA GAAAGAGTCT TT TTCTC CTT 2340 

GAC3UUUmX AGCTTTTOTA TOAGGACTAT AflGGTOAATT CTCTOATTAQ TAATTTTAGA 2400 

TATGTCCTTT GCTAAAAATG AATAAAATTT ATGAATATGA CTTA AAAAA A AAARHOGACa 2460 

CGGCOSOQAA TTTAOTAGTA GTAGTOGACC GGGQAATTCC GQAOCGGTAC CTGCAGGGGT 2520 
ACGAGC 

Seq ID NO I BIO Protein sequence 
Protein Accession #: CAC35071.1 



1 
I 

NENMXiLHXiXP 
VC30IEOQKEL 
KRQVYOTDSR 
SKKLRVGLLK 
BIABGRPSPQ 
RMFGGMZHF8 
NWKRKIXAVY 



11 
I 

FTPGHTLIDG 
PTP8LSBLKD 
PSILDKRPLT 
MRKK6GGKKR 
WTRVKNTHZP 
6FDNDRADQL 
8GHQWVDVKG 



21 
I 



YLSYErVFEH 
NFPPSTAVKL 
RG5KRSRREA 
KGWARGGMGD 
VYRPCSVSDB 
VQKDYMVAVR 



31 
I 

UUCypRIVSE 
GTRTLTRVKV 
8TGCSGILI8 
SGG0QRE6TR 
ATLDYDYAIiL 
SKDXiLYQyCD 
ZTPUOAQIC 



41 

I 

RTPHLT8PAF 
QDLVLBPTQH 
PQHVLTAABC 
EHLRERAKGO 
ELKRAHKKKY 



LWIB6HDANC 



51 
1 

EADAKMMVNT 
ZTTKGV8VRR 

VHDGKDYVKG 
RRRKKSGRGQ 
KELGZSPTZK 
YUUiKDPDKK 
AYO 



60 
120 
180 
240 
300 
360 



Seq ID NO: 11 DHA sequence 

Nucleic Acid Accession #t NM_002035.1 

Coding sequence t 108 . • 1106 



1 
I 

AOGGGCCOGC 
COGGGGGCCC 
CTGCOGCCTT 
CCAAGCXCCT 
6GAAGTGCAT 
ATGAGGATAA 
AGOTGGTGCT 
TAAAACAAGC 
CAGTC3TCAGG 
ATTACCTGGG 
TGGGCAGGAT 
CCTACTCT6C 
AGCCATATAA 
COGAAGAAAA 
GCAAACCAOA 



11 

I 

C06CCQCGG6 
COGCCOQGCC 
CCTCGTGGCC 
OGCCCTGCCC 
T6CTAT0QAG 
GCTGCTGCAG 
TTGCATATCA 
ACAGGAQAAA 
AAAATTTQAA 
CAGCGTGTAC 
OGTOTTTGTO 
ATCCAA6TTT 
TGTCTACATC 
CAGAAC3UU^ 
ACAGGTGQCC 



21 

I 

TGATTCTCGC 
GCTCOGGGCC 
TT0GT6CTGC 
GGGOOGCATG 
TGCTATAAAC 
GCAAAGAAAO 
GTTGATGTAT 
CTG GGTC CAQ 
GATCTTOAAO 
CCCAGCCGGG 
TCCTCCCAGG 
GCCATAAGGG 
ACAQTTGCTT 
0CTTTGGA6A 
AAACAAATTO 



31 
I 

CTG6C0GCAQ 
CCXGGCCCCO 
TGCTGTACAT 
TGGTCGTTAC 
AAGGAGCTTT 
AAATTGAAAT 
CTCAA6ACTA 
TGGACATQCT 
TTAGTACCTT 
CCXTTGATCAC 
CAGGACAGTT 
OATTQGCAGA 
ACCCACCAGA 
CTOQACTTAT 
TTAAA6AT0C 



41 

I 

CCCAOCCCTO 
OGGAGGSATG 
GGTOTCTCOG 
AGGAGGTTCC 
TATAACTCTG 
GCACTCTATT 
TAACCMgTA 
GGTAAATTGT 
TGAAAGGTTA 
CACCATGAAG 
GGGATTATTC 
AGCTrroCAG 
CACAGACACA 
TTOVGAGACC 
CATACAAOGA 



51 
1 

06GGCCTTGC 
CTGCZGCTGQ 
CTCATCAGCC 
AGT6GCATCG 
OTTGCAOGAA 
AATGACAAAC 
QAQAATGTCA 
GCAGGAATGG 
ATGAOCATCA 
GAGCX3CGGGG 
GGTTTGACA6 
ATGGAGGT6A 
CCTGGCTTTO 
ACATCTGTGT 
AATTTCAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
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O TTCCCn'G G CTCAGATGGG TACATGCTCT OGGOXnmC CTGTG6GAT0 GCTCCAGTAA 960 

CTTCTATTAC TGAGGGGCTC CAGCAGGTGO TCACCATGGG CXnTTTCOGC ACTATTGCTT 1020 

TOTTTTACCT TGGAAOTrTT OACAGCATAG TTOG TOGCTG CATCATGCAO AGAOAAAA AT 1080 

CTQAAAATGC AGACAAAACT GCCTAATCTT CTTACCCCTT GGAAGAAGAC wiT l'iXIAAA 1140 

5 TAATTTGAAC AGCTTGCTGC TAAATGGGAC CCAATTTTTQ GOCTATAOAC ACTTATGTAT 1200 

TGTTTTCGAA TACGTCAGAT TGGACCAQTG CTCTTCAGGA ATGTGGCTGC AAGCAAGGGO 1260 

CTAGAAGTTC ACCTCCTGAC AGTATTATTA ATACTATGCA AATATGGAAT AGGAGACCAT 1320 

TTOATTTTCT AGGCTTTOTG GTAGAGAGGT GAAGGTATOA GAATTAATAG CXTTOTGAA CA 1380 

AAGTAAAGAA CAGGATTCCA GAATGATCAT TAAATTTGTT TCTATTTATT CTTTTT TQCC 1440 

10 CCCCTAGAOA TTAAOTCCAO AAATGTACTT TCTGGCACAT AAAGAAATCT TGAGGACTTT ISOO 

6TTTAAACCT TCCATAAAAA AACAATTTTC GQTTTCTCGQ GTT CTCTC TC T CTGTC TCTC 1560 

TOTCTCTCTO TCTCTCTGTC TCTCTGTCTC TCTCTCTCTC TCTCTTTCTT TCTTTG TGTA 1620 

TTTTATTCAA GATGAGTTGG ACCCATTGCC AGTGAGTCTG AATGTCACTG ACA GCCC TGT 1680 

GTTGTGCTCA GGACTCACTC TGCTGCTGGT GGAAACTCAT GOCTTCTCTC TCl'Cr rrGAT 1740 

15 CCCATAAAGC TAOGAGGGGQ AOGGQAGAGG GCAGTGCAAT QGGAAGTAAA GAGATATTTT 1800 

CCAGTAGGAA AA6CAATGCT TTCTTOTCTT TAGACTCAAA TGCTTAGGGA AOGTTTCATT 1860 

TCTCATTCAT 6QGGAAAGGC AGCCTCCTTA AATGTTTTCT QAAGAQOQGT AAAATCTAGA 1920 

AOCTTAAGAA TTTACAGTTC CTTCAATAAC CATQATGACC TGAAOTTCAC CT ATCC CATT 1980 

TTAGCATCTA CTTGTTTTTC CCATCTCTTC CTTTCCAATT TTGCTTATAC TOCTGi aATA 2040 

20 TTTTTOTAAA AAAAAAAAAA AAGGAAAAAA AAGACCAGCT AAAATTTTOG ACTTGACTTT 2100 

TTAACTTAAC TCATGAATTA ATTAAAGCAR. ATGAAAAAAT TAAAAAGTGT GACTTTTTCT 2160 

06GAOCATAT ATQTAGCTTT TAGOAAAGGC TGATGATGGT ATAAAGTTTG CTCATTAAGA 2220 
AAAAAAGACA AGGCTGATTT TQAAGAGAOT TGCTTTTGAA ATAAAATGAT CA 

25 Seq ID NOt B12 Protein sequence 
Protein Accession #: NF_002026.1 

1 11 21 31 41 51 

I 1 1 i i ' 

30 MLLLAAAPLV AFVIiLLYMVS PLISPKPLAL PGAHWVTGG SSQIGKCIAI ECYKQGAPIT 60 

LVARNEDXLL QAKKEIEMHS XHDKOWLCI SVDVSQDYNQ VENVIKQAQB KLGPVDKLVN 120 

CRGMAVSGKP EDLBVSTFBR LMSIMYLOSV YPSRAVITTM KERRVGRIVP VSSQAGQIiGL 180 

FQPTAYSASK FAIRQIAEAI* QMBVKPYNVY ITVAYPPDTD TPGPABENRT KPLETRLISB 240 

TTSVCKPEQV AKQIVKDAIQ GNPNSSLGSD OYMLSALTGO MAPVT8ITEG LQQVVTMQLF 300 
35 RTIALFYLGS FDSIVRRCMM QRBXSENAOK TA 

Seq ZD NOt B13 DNA sequence 

Nucleic Acid Accession 8: CAT Cluster 

40 1 11 21 31 41 51 

I 1 I I I I 

CTTGGGATGC ATTATATATT OCATTATATT TGCCGGTAAA ACTCGGTTAA TTTTTAAAAT 60 

COGCAAAATA TTGGTGCCTT TCCCGAAATT TGGTCCCGGG CTCCCTTATA GGATAATTGG 120 

TTTGGATTGG TTAAGTCCAA TTATTAAATG CTGOGGTTTC AAATTCCCAG CTGOAAGGAC 180 

45 CACCCATTTA AAAACTTCAO AAGGCAGQAT CCTOCTCAAT T TATAA GGCT TTGGAAAATT 240 

CCAGGCATTG GTTTGACATA TTTCGAOAOC TCAAATCIGC A(3TQTTCCAC A TGCAC ATAC 300 

AAGATOCAQA 6TCTCAT0TT AAAATCACTT ACATACCCAQ AAAOACCACC ACTTTGCAGG 360 

TATAATATTG TACTTAAAGA CAGAGAGGAA GTOTTTCTTA ATCCAAACAC ATGTA CACCA 420 

AAGAACACAT AAQATOCCTT CTTCCATCAA ATGCACTTGC TTGTGAATTA ATGGACTTGT 480 

50 AAATGAAACA ATGCAATCAG TCTTTTATAA TGCACTGTTC AATTTGABAT TCAAGTATTT 540 

CTATTTCTTQ GAAAAAATTT TAAOAATCAA AAATAAAGAA AATAAAAAGT GCATACAOTT 600 
AAACATTCCA AAAAAAAAAA AA 



55 Seq ID NOt B14 DNA sequence 

Nucleic Acid Accession tt: XMJ)86767.1 
Coding sequence: 276.. 611 

1 11 21 31 41 51 

60 I 1 1 I I I 

CTTOTTCATQ OCATCTTTAG AAACAAACTG CAATTTTATT TCATTTCCTT GTCGTTCATA 60 

OVAAGATTAC AAGACTAGCT TATGTGTOOA CTOTGACTOT TTTAAGGAAA AATCATOTCC 120 

TOGGCTGGGT TATCAAGCCA AGCTATTTAA AGGTGTTTTA AAAGAAAOGA TGGAAGGAAG 180 

ACCTCTTAGG ACGACTGTGT TTTTGGATAC AAGTGGTACA TATCCATTCT GTAGCCTATT 240 

65 ATTTTOTTCT CAGTATAATT GTTCCAGATA AAACTATGAT Q6ATG0CT0G TTTTCATTTA 300 

AATTATTAAA TCAOCTTGGA ATGATTGAAO AOCCAAGGCT TTATGAAAAG AACAAACCAT 360 

TTTATAAACT TCAAGAAOTC AAOATTCTTG CTCAATTTTA TAATQACTTT OTAAATATTT 420 

CAAGCATTGG TTTQACATAT TTCCAGAGCT CAAATCTGCA GTGTTCCACA TGCACAT ACA 480 

AQATCCAGAG ACTCATGTTA AAATCACTTA CATACCCAGA AAGAOCACCA CTTT6CAG0T 540 

70 ATAATATTGT ACTTAAA6AC AGA6AGGAAG TQTTTCTTAA TCCAAACACA TGTA CACCAA 600 

AGAACACATA AGAT6GCTTC TTCCATCAAA TBCACTTGCT TGTGAATTAA TGGACTTGTA 660 

AATOAAACAA TGCAATOVGT CTTTTATAAT GCACTGTTCA ATTTQAOATT CAAOTATTTC 720 

TATTTCTTGQ AAAAAATTTT AAGAATCAAA AATAAAOAAA ATAAAAAATO CATACAGTTA 780 
AACATTCC 



75 



8eq ID KOt BIS Protein sequence 
Protein Accession «: XP_086767.1 



1 11 21 31 41 51 

80 1 1 I 1 t I 

KNDGSFSFKL LNQL6MIEBP RLYSKZIKPFy lOiQBVKZLAQ FYNDFVNISS lOLTYFQSSH 60 

lOCSTCryia orlmlxslty pbrpplcryh ivlkdrbbvp lnfntctprk t 
seq ID NO I B16 DMA sequence 

802 



wo 03/042661 



Nucleic Acid Accession #t CAT cluster 

1 11 21 31 41 51 

I 1 I I I i 

I ' n ' i ' iiTri ' T ' rrrrmTrf ttaaatgcaa aggagcaaca aaatttatto actgaattaa 6o 

ACACAACAGT AAAATGGCAO TGTTGTAATT TCATTTTCAQ ATGTTTQAAT GGAACAAQAA 120 

aagtgctatt agcccaagct tcttacattc attaaaagag tgactatcaa aaacagcaac I bo 

ATOCACAATG QTACATATGC ACAAAATGGA ATTATATCAA CAAAT ATAC A AAATACCCAA 240 

AATAAAATAT TTACAGGTTT AAAAATATAA ACATTGATTC CTCTATCCCA TTAAACCATT 300 

GGAGTGGAGA AA6GAG6AAA GACCCTATTO CTATTTAQAA TCCTTTTTAA AACAAGTTTT 360 

TAAAACATAG AATTAGTTCT A6GAGACAAT TTTrOATGTT TTTCAGGGGT TTAACATTCT 420 

ATTATAAAAA TAAC»TCTAT AAACCTACTA ACAATTTTCC TCCTGTGCAC AAAAATAATA 480 

CTGCCAAAAC CTGTCCTCAA AQACATGCCT QACTTTCAGG AAAGCTAATT ATGG AAATGG 540 

AGTTTCTOCrr TTGGGTTATC TTTGTTACTA TTTTCAAATA ACCAGCAACT CCCTATATTA 600 

CACTOAGATA CTTTATATAA ATAAOGTGGG CGAAACCTGA AGTTCACAAT GAGCCTGCTA 660 

GGTAGCTGGT GTCAAGTACA AAT6ATAGGA ATTOACTTTO CCAACAACAT CAAAAGCATT 720 

TTCCCTGATA TTCCTOATAa ACCTACCACT ATCftOATCCT CCATATTCAA TAAGATTTAT 780 

CCTGGAAGCA ATGAAAATGT TAAATATTAC TTTOCTAOAO TTTCTCCTCC TTTATTTAGA B40 

AATAAATGTO TAGTGGGGAC CAGIGOTTQT AATGTAGATA CTTGAGAAGT TTCATTGATT 900 
CCTTCAGACC CATGCCG 

Seq XD NO: B17 DNA sequence 

Nucleic Acid Accession «i NM_022698.1 

coding sequence: 268.. 273 9 

1 II 21 31 41 51 

I I 1 I 1 I 

TTTTCTTGCT TTTCTTCCCT ' m ' rm ' CTT TTTGCAAACA AAACAAAAAA CAGCATAGAA 60 

GAAAQAGCAA AATAAAGAAG AAGAAGAGGA GGAAGAOAQO GAAAGAGAGG AAGGGAAAAA 120 

AAACAGCAAC C0GGGCA6AG GAGGAQOTGC OGCGGOGGCG G06GCGGCGG CAGOGGCGGC 180 

AOCGGCGOGa 0QGCG6CTCG GACCCCCTOC CCGGGCTCCC COCATCAGTO CAOCTCTCCG 240 

GGOGATGCCA QAATAGAT6C CGGGGCAATO TCCOQCOGCA AACAGGGCAA CCCGCAGCAC 300 

TTOTCCCAQA GGGAGCTCAT CACCCCAOAG GCTGACCATG TGGAGGCCGC CATCCTCGAA 360 

GAAOAOQAGG GTCTGGAGAT AGAGGAGCCA AGTGGCCTGG GGCTOATCGT GGGTGGCCCC 420 

GACCCTGACC TGCTCACCTQ TGGCCAOTGT CAAAT6AACT TCCCCTTGGG GQACATCCTG 480 

GTTTTTATAO AOCACAAAAa GAAGCAGTGT G60GGCAGCT TGGOTGCCTG CTATGACAAO 540 

OCCCTGGACA AGGACAGCGC GCCACCCTOC TCACJSCTCCG AGCTCAGGAA A0TGTCC6AG 600 

CCGGTG6AQA TCOGGATCCA AGTCACCCCC GAOGAAGATO ACCACCTGCT CTCACCCACG 660 

AAAGGCATCT OTCCCAAGCA GGAGAACATT QCAGGTAAAG ATGAGCCTTC CAGCTACATT 720 

T6CACAACAT GCAAGCAGCC CTTCAACAGC GCGTGGTTCC TGCTGCAGCA CGOQCAGAAC 780 

A08GACG6CT TCCGCATCTA CCTGQAGCGC GG0CC6G0CA GCAOCTCOCT CACQCCGCGG 840 

CTCACCATCC OGCCGCCGCT OQGGCOQaAG OCOGTGOOQC AOTCCCCGCT CATQAATTTC 900 

CTGGGOQACA GCAACCCCTT CAACCTGCTG OGCATGACGG GCCCCATCCT GCGGGACCAC 960 

CCGGGCTTOG GCGAGGGCCQ CCTGCCGGGC AOOCCGCCTC TCTTCA6TCC CCCGCCGOGC 1020 

CACCACCTGG AOCCOCACOG CCTCAGTQOC GAGGAGATGG GGCTOGTOGC CCAGCACCCC 1080 

AOTGCCTTOG ACOQAOTCAT GOGCCTQAAC CCCATG60CA TGQACTOQCC OGCCATGQAC 1140 

TTCTCGGG6C GGCTCCSOQA OCTGGCGGGC AACAGCTCCA C6CG600GCC CGTGTCCC06 1200 

GOCCGOGGCA ACCCTATOCA CCGGCTCCTG AACCCCTTCC AGCCCAQCCC CAAGTCCCCG 1260 

TTCCTGAGCA CGCCGCCGCT GCCGCCCATG CCCCCTGGCG GCACGCOGCC CCOQCAGCCQ 1320 

CCAGCCAAGA OCAAGTCGTO CGAGTTCTGC GGCAAGACCT TCAAGTTCCA QAGCAATCTC 1380 

ATOGTGCAOC OGGGCAQTCA GAOOGOOSAO AAOCCCTAGA AGTGGCAGCT GTGG6ACCAC 1440 

OCOT OC rCGC AGGCCAGCAA GCTCAA606C CACATQAAGA OSCACATGCA CAAG6CCGGC 1500 

TOGCTGGCOG GCCGCTCCGA CGAOGGGCTC TCGGCCGCCA GCTCCCCCGA GCCCXX3CACC 1560 

AGCGAGCTGO CGGGOGAGGG CCTCAAGGCG GCCGACGGTG ACTTCCGCCA CCAOQAOAGC 1620 

GACCG6TC6C TG6GCCACQA GCCGGAGGAG GAGQACGAGG AOGAGGAGGA GGAGQAGGAG 1680 

GAGCIGCTAC TQOAGAACOA GAGCCQGCCC GAOT06AGCT TCAQCATGGA CTCGQAOCTG 1740 

AGCCGCAACC GGGA6AA06G G6GTGGTGGG GTGOCOGGGG TGCOGGQOQC GGGGGGOGGC 1800 

GCGGCCAAGG CGCTGGCTGA CGA6AAGG0G CTGGTGCTGQ GCAAGGTCAT GGAGAACGTG 1860 

QGCCTAGGCG CACTGCCGCA GTAOGGCGAO CTCCrGGCCG ACAAGCAOAA GCGCGGCQCC 1920 

TTCCTGAA6C GTGCGQCGGG C6G<jGGGGAC G0GG6CGACG ACGACGAOGC GGGC66CTGC 1980 

6GGQA0QCGG GC00GGGC06 0QGQ6TCAAC GGGCQCQGGG GCGQCnOQC O0CAG6CACC 2040 

6AGCCCTTCC COGOGCTCTT CC0GC3CAAQ CCCGOGCOOC TGCCCAGCCC GGG6CTCAAC 2100 

AGCGCOGCCA AGCGCATCAA GGTGGAQAAG GACCTGGAGC TGCOQCCCGC CGOGCTCATC 2160 

COGTCCGAQA AOQTGTACTC GCAGTOGCTQ GTGGGCTACO CGQOGTCGCG GCACTTCATG 2220 

AAGGACCCCT TCCTGGGCTT CAOGGAOGCA C6ACAGTCGC CCTTOQCCAC 6T0GTC0GAG 2280 

CACTGOTGOQ AGAAOQGCAa CCTQC6CTTC TCCA06C00C OO^GGACCT GCTGQACGGC 2340 

GGCCTCTCGG GC0GCAG060 CAOGOCCAOC GGA0GCA6CA CCOCGCACCT GGGC6GCCCG 2400 

GGCCCOGGOC GGCCCAGCTC CAAGGAGGGC CGOCGCAGCG ACAOrrGOGA GTACTG0C3GC 2460 

AAGGTGTTCA AGAACTGCAG CAACTTQACG GTOCACCQGC GGAGCCACAC C6G0GAG0GG 2520 

CCTTACAAGT GOGAGCTOTQ CAACTAOSCG TGGGC6CAGA GCAGCAAGCT CAG60GCCAC 2580 

AT6AAGACGC ACGGGCAGAT GQQCAAGGAa OTOTAGCGCr QOQACATCTG CCA GATGO OC 2640 

TTCAGOGTCT ACAGCACCCT GGAOAAACAC ATGAAAAAGT GGCAOGGOGA GCACTTGCTG 2700 

ACTAAOQAOO TCAAAATCGA GCAGGCOSAG AGGAGCTAAG CGOGOGGGCC COSGCGCCCC 2760 

GCACCTGTAC AGTGGAACOG TTQCCAACOG AGAGAATGCT GACCTQACTT GCCTC CGTG T 2820 

CACC6CCACC C06CACCCC0 OGTGTCCCOQ GGGCCCAGGG GAGG06GCAC TCCAACCTAA 2880 

CCTGTGTCTG C3GAAGTCCTA TGGAAAOC03 AGGGTTOATT AAGG CAGTAC AAATT qTOGA 2940 

GCCTTTTAAC TOTGCAATAA TTTCTGTATT TATTGGGTTT TGTAATTTTT TTGGCATOTO 3000 

CAGGTACTTT TTATTATTAT ' m ' i ' TCiXJri ' TGAATTCCTT TAAOAQATTT TGTTGGGTAT 3060 

CCATCCCTTC ' mXiTrm ' T TTTTAACCOG GTAQTAOCCT GAG CAATGA C TCGCA AGCAA 3120 

TGTTAGAGGO GAAOCATATC TTTTAAATTA TAATTTGGGG QGAGGGOTGG TGCTGCTTTT 3180 

TTGAAATTTA AGCTAAGCAT OTGTAATTTC TTOIGAAOAA 6CCAACACTC AAATGACTTT 3240 

TAAAGTTOTT TACTTTTr CA TTCCTTCCTT TTTTTTGTOC TQA AATAAA A AOTGOCATOC 3300 

AGTTTTTTTT TTAATTATTT TTTAATTTTT TTTTTGGTTT TTOTl'lTO jC GGTGGG6G6T 3360 

OTGQATOTAC AGCGGATAAC AATCTTTCAA GTC6TAGCAC TTTGTTTCAO AACIGGAAXQ 3420 



803 



wo 03/042661 



PCTAJS02/36810 



GAGATGTAGC ACTCftTGTOG TOCOSAGTCA AGOGGGCTTT TCTGTGTTQA TTTCGGCTTT 3480 
CATATTACAT AAGGGAAACC TTGAOTCGTO GTGCTGQSO G AGGCACCCCA CAQACTCAGC 3S40 
GCCGCCA6AQ ATAGGGTTTT TGGAGGGCTC CTCTGGQAAA TCGCCCXSACA GCATTCTGAG 3600 
GTTGTGCATG ACX^WSCAGAT ACTATCCTGT T GG TQTGCCC TGGGGTGCCA TGGCTGCTAT 3660 
5 TCGCTGTAGA TTAGGCTACA TAAAATGGGC TGAGQGTACC TTTTPGGGGA GATGG GGTGG 3720 
CCTOCAOiaA CACAGAAAiGG AAGAAACTAO O S g l tlTT Cn ' TTAQ60QTTT TCTGGCTTGA 3780 
OGGCTTCTCT CTTTTTTTAA ATCACCCCCA CCACATAAAT CTCAAATCCT AT0TT6CTAC 3840 
AAGGGGTCAT CCATCATTTC CCAAGCAGAC GAATGOCXITA ATTAATTGAA GTTA GTGTTC 3900 
TCTCATTTAA TGCACACTGA TGATATTQTA OGGATGGOTG GGGTGGGGAT CTTQCAAATT 3960 
10 TCTATTCTCT TTTACTGAAA AAGCAGGGGA TGAGTTCCAT CAGAAOGTGC CCAGOQCTAC 4020 
TTCCXAOQTT TTTATTTTTT TTTTCCTATC TCATTAGOTT GGAAG GTACT AAATATT GAA 4080 
CTGTTAAQAT TAGAC A TTT G AATTCTOTTG AGCOGCACTT TAAAGCTTTT OTTTGCATTT 4140 
AAATTAAATG GCTTCTAAAC AAQAAATTQC AGCATATTCT TCrCTTTGGC CCAGAGGTGG 4200 
OTTAAACTGT AAGGQACAGC TOAQATTGAG TGTCAGTATT GCTAAG CGTG OCATTCACAA 4260 
15 TACTOGCACT ATAAAGAACA AAATAAAATA ATAATTTATA OGACAGTTTT TCTACTGCCA 4320 
TTCAATTTOA TGTGAGTGCC TTGAAAACTO ATCTTCCTAT TTGAGTCTCT TGAGACAAAT 4380 
GCAAAACTTT TTTTTTGAAA TQAAAAGACT TTTTAAAAAA GTAAAACAAO AAAAGTACAT 4440 
TCTTTAGAAA CTAACAAAGC CACATTTACT TTAAGTAAAA AAAAAAAAAA TTCTGGTTGA 4500 
AGATAGAGGA TATGAAATGC CATAAQACXT AATCAAATGA AGAAATAAAC CCAGCACAAC 4560 
20 CTTGGACATC CATTAGCTQA ATTATCCTCA GCCCCTTTT6 TTTTTGGGAC AACGCTGCTT 4620 
AOATATGGAG TGGAGGTQA7 TTACTGCTGA ATTAAAACTC AAGTGACACA A QTTACAAO T 4680 
TGATATCX3TT GAATGAAAAG CAAAACAAAA AGAATTCAGQ AACAAOGGCT AATTTTTTCT 4740 
AAAGTTAAAT TTAGTGCACT CTGTCTTAAA AATAOOTTTA CAGTATTGGG TACATACAAG 4800 
GGTAAAAAAA AAATTGTQTG TATGTGTGTT GGAGCGATCT TTTTTTTTCA AAGTTTGCTT 4860 
25 AATAGOTTAT ACAAAAATGC C31CAGTGGCX: G03TGTATAT TOTTTTCTTT TGGTGACGGG 4920 
GTTTTArSTAT ATATTATATA TATXAAAATT TCTTGATTAC TOTAAAAOTC GACCAOTATT 4980 
T6TAATAATC GA0AAT6CCT GGQCATTTTA CAAAACAAGA AAAAAAATAC CCTTTTC TTT 5040 
TCCTTCAAAA TSTTGCAGTA AAATTTAAAT GGTGGQTCTA TAAATTTGTT CTTGTTACAG 5100 
TAACTGTAAA GTC6GAQTTT TAGTAAATTT TTTTCTGCCT TGGGTGTTGA ATTTTTATTT 5160 
30 CAAAAAAAAT OTATAGAAAC TTGTATTTGO GGATTCaAAG GGGATTGCTA CACCATGTAG 5220 
AAAAAGTATG TAGAAAAAAA OTGCTTAATA TTGTTATTGC TTTGCAGAAA AAAAAAAAAT 5280 
CACATTTCTS ACCTGTACTT ATTTTTCTCT TCC06CCTCC CTCTGGAATG GATATA TTOQ 5340 
TTGGTTCATA TGATGTAGGC ACTTGCTGTA TTTTTACTGO AGCTOOTAAT TTTTTAACTG 5400 
TAAGCTTGTC CTTTTAAAGG GATTTAATGT ACCTTTTTOT TAGTGAATTT GGAAATAAAA 54 60 
35 AGAAAAAAAA AACAAAAACA AACAGQCT6C CATAATATAT TTTTTTAATT TGGCAGGATA 5520 
AAATATTGCA AAAAAAACAC ATTT G T A TGT TAAGTCCTAT TOTACA GGAQ AAAAAGGGTT 5580 
GTTTGACAAC CTTTQACSAAA AAGAAACAAA AGGAAGTAOT TAAATGCTTT GGTTCACAAA 5640 
TCATTTAGTT GTATATATTT TTTGTCGGAA TTGGCCTACA CAGAGAACOG TTCGTGTTGG 5700 
GCTTCTCTCT GAACGCCCCG AACCTTGCAT CAAGGCTCCT TGGTGTGGCC ACAGCAGACC 5760 
40 AGATQOGAAA TTATTTGTGT TGAGTOGAAA AAAATCAGTT TTTQTAAAGA TGTCAGTAAC 5820 
ATTCCACATC OTCSCTCCCTT TCTCTAAGAO 6CCATCTCTA AOATGTCAQA TGTAQAGQAa 5880 
AGAGAG06AQ AGAACATCTT CCTTCTCTAC CATCACTCCT GTGOCGGTCA CCAC CACCAC 5940 
CTCTCXXXKX: CTTACCAOCA GAAAGCAATG CAAACTGAQC TGCTTTAGTC CTTGAGAAAT 6000 
TCTGAAACAA ACACAAATAT CATAAAAGGA GCTGGTGATT CAGCTGGQTC CAOGTGAAGT 6060 
45 GACCTGCTGT TGAQACCGGT ACAAATTGGA TTTC31QQAAG QAGACTCCAT CACA GCCAGG 6120 
ACCTTTCOTG CCATGGAGAG TGTTGGCCTC TTGTCTTTCT TCCCTGCTTT GCT6CTTTGC 6180 
TCTCTGAAAC CTACATTCOG TCAGTTTCCG AATGCX3A6GG CCTGGGATOA ATTTGGTGCC 6240 
TTTCCATATC TOGTTCTCTC TCCTTCCCCT GCXSOTCCTC TCCATCCTTC ATCCTCCATT 6300 
GGTCCTTTTT TTTTCTTTCA TTTTTTATTT AATTTCTTTT CTTCCTGTCT GTTCCTCCCC 6360 
50 TAATCCTCTA TTTTATTTrT ATTTTTTGTA AAGCCAAGTA 6CTTTAAGAT AAAGTGGTGG 6420 
TCTTTTGGAT GAGGGAATAA TGCATTTTTA AATAAAATAC CAATATCAG6 AAOCCATTTT 6480 
TTATTTCAGG AAATQTAAQA AACCATTATT TCAGGTTATO AAAGTATAAC CAAGCATCCT 6540 
TTTGGGCAAT TCCTTACCAA ATGC3W3AAGC TTTTCTGTTC GATGCACTCT TTCCTCCTTG 6600 
CCACTTACCT TTGCAAAOTT AAAAAAAAGG GGGGAGGGAA TGGQA QAGAA AGCT GAOATT 6660 
55 TCAGTTTCCT ACTGCAGTTT CCTACCTGCA GATCCAGGGG CTOCTGrTGC CTTTGGATGC 6720 
CCCACTGAGG TCCTAGAGTG CCTOCAOGGT GU TtTAtXl ta TAOTCATAAC AGCTAOCCftG 6780 
TCCTCACOVG CTTACCAOAT TOCCftGGACT AAGCC ATCCC AA flOCAC AAG CATTGTGTGT 6840 
CTCTGTGACT 6CAQAGAAGA QAQRATTTTG CTTCTGTTTT GTGTTTAAAA AACCAACACG 6900 
GAAGCAGATO ATCCCGAGAG AGAGGCCTCT AGCATGGGTG ACCCAGCCQA CCTCAGGCCG 6960 
60 QTTTCCGCAC TCCCACAACT TTGTTCAAAO TTQCCCCCAA TTGGAACCTQ CCACTTGGCA 7020 
TTAGAGGOTC TTTCATGGGO AGAGAAOGAG ACIGAATTAC TCTAA6CAAA AT GTGAAA AG 7080 
TAAGGAAATC AGCCTTTCAT CCCGGTCCTA AGTAACCGTC AGCCGAAGGT CTCGTGGAAC 7140 
ACAGGCAAAC CCGTGATTTT GQTGCTCCTT GTAACTCAGC CCTOC AAAGC AAAGTCCCAT 7200 
TCATTTAAGT TGTTTGCATT TGTACTGGCA AOGCAAAATA TTTTTArTAC CTTTTCTATT 7260 
65 ACTTATTGTA TGAGCXTTTQ TTGTTTACTT GGAGGrTTTO TCTTTTACTA CAAGTTT GGA 7320 
ACTATTTATT ATTGCTrGOT ATTTGTGCTC TOTTTAAOAA ACAG6CACTT TTTTTTATTA 7380 
TXSGATAAAAT GTTGAGATGA CAGQAGGTCA TTTOVATATQ GCTTAGTAAA AXATTTATTa 7440 
TTCCTTTATT CTCTGTACAA GATTTTGGGC CTCTTTTTTT CCTTAATOTC AC3VATGTTGA 7500 
GTTCAGCATC TCTCTGCCAT TTGATTrGTA aSCTTQTTCA AAACCAAGTT TGTTCTGGTT 7560 
70 TCAAGTTATA AAAATAAATT GGACATTTAA CTTGATCTCX: AAA 

8eq ID NOs B18 Protein sequence 
Protein Accession #5 NP_075049,1 

75 1 11 21 31 41 51 

KSRRKQC3JPQ HLSQRELITP EAZfflVEAAXL BBDE6LE2BB PSGLGLMVGG PDPDLLTOGQ 60 

CQMNPPLGDI LVFIEHKRKQ CGGSLGACYD KALDKDSPPP SSRSELRKVS EPVEIGIQVT 120 

PDEDDHLLSP TKGICPKQEN lAOKDEPSSY ICTTCRQPFN SAWPLLQHAQ NTHGPRiyLB 180 

80 PGPASSSIjTP RLTIPPPLQP EAVAQSPI*1N PLODSNPPNL LRMTGPILRD HPGFGEGRLP 240 

6TPPLF8PPP RBHLDPHRI«S AEEMGLVAQH PSAFDRVNRL NPMA IDSP AM DPSRRXiRBLA 300 

GNSSTPPPVS PGRGMPMHRL UIPFOPSPKS PPIiSTPPliPP MPPGOTPPPQ PPAKSKfiCBP 360 

COKTPKFQSN LIVHRRSHTG BKPYK0QM3) HACSQASKIiK RHM KTHMHICA GSIAGRSDDO 420 

LSAASSPEPG TSELAGBQLK AADGDFRHHB SOPSLGHBPB EBDBEBEEEB BBLLLQIBSR 480 

804 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



PB8SPSMDSB LSRNRENGGG GVPGVPGAGG GAAKALADBR ALVLQKVMEN VGLQALFQYG 540 

BLLADKQKRG AFLKRAAGGQ DAODDDDAGO CGDAGAGGAV NGRGGGFAPG TEPFFGLFPR 600 

KPAPLPSPGL NSAAXRIKVE XDLBbPPAAL IPSB9VYSQW ItVOYAASBaP NKDPFU3FTD 660 

ARQSPFATSS EKSSENOSIA FSTPPGDLLD GGLSGKSGTA SG6STPRLGG PGPGRPS8KB 720 

GRRSDTCBYC GKVFKNCSNL TVHRRSRTGE RPyKCELOnf ACAQSSKLTR HMKIHOQIGK 760 
SVYRCDIOQM PFSVYSTIiEK HMKKHHQEHL LTODVKZEQA CRS 

Seq ID NOt B19 DMA sequence 
Nucleic Acid Accession St NM_000399.2 
'Coding sequence: 339.. 1769 ~ 

1 11 21 31 41 51 

I I I i I I 

TAACTQAOOQ AGOAGCAArr OATTAATAGC TOG60GAGGG GACTGACT6A CTQTTATAAT 60 

AACACTACAC CAGCAACTCC TG6CTTCCCA GCAGCOSGAA CACAGACAGG AGAGAGTCAG 120 

TGGCAAATAQ ACATTTTTCT TATTTCTTAA AAAACAGCAA CTTGTTTGCT ACTTTTATTT 180 

CTGTTQATTT ' IT iT ri ' Cn ' tJ GTGTGTGTGG itJGTm ' rrn ' TAAGTGTGGA GGGCAAAAGQ 240 

AQATACCATC CCACSGCTCAG TCCAACCCCT CTCCAAAAOO GCTTTTCTGA CACTCCAGGT 300 

AOCOAGGGAG TTGGGT C T C C AGGTTGTQOO AGGAGCAAAT GATGACCGCC AAGGCOGTAG 360 

ACAAAATCXX: AGTAACTCTC AGTGGTTTTG TQCACCAGCT GTCTOACAAC ATCTACC06G 420 

TGGAGGACCT OGCOGCCAOQ TCOOTGACCA TCTTTCCCAA T6C0QAACTG GGAGGCCCCT 480 

TTQACCAGAT GAACGGAGTO GCCX^GAGATO GCATGATCAA GATTQACATG ACTGGAGAOA 540 

AOAGGTOGTr GGATCTCCCA TATCCCAGCA OCTTTGCTCC OGTCTCTGCA CCTAGAAACC 600 

AGACCTTCAC TTACATGGGC AAGTTCTCCA TTGACCCTCA GTACCCTGQT QCCAGCTGCT 660 

ACCCAGAAGG CATAATCAAT ATTGTGAQTQ CAGGCATCTT GCAAGGGQTC ACTTCCCCAG 720 

CTTCAACCAC A6CCTCATCC AGCGTCACCT CTOCCTCCCC CAACCCACTG GCCACAGGAC 780 

CCCTGGGTGT GTGCACCATQ TCCCAGACCC AOCCTGACCT GGACCACCTG TACTCTCCGC 840 

CACOJCCTCC TCCTCCTTAT TCTGGCTGTG CAGGAGACCT CTACCAGGAC CCTTCTGCGT 900 

TCCTQTCAGC AGCCACCACC TCCACCTCTT CCTCTCTGQC CTACCCACCA CCTCCTTCCT 960 

ATCCATCCCC CAAGCCAGCC ACGGACCCAG GTCTCTTCCC AATGATCCCA GACTATCCTG 1020 

GATTCTTTCC ATCTCAGTGC CAGAGAGACC TACATQQTAC AGCTGGCCCA QACCGTAAGC 1080 

CCTTTCCCTG CCCACTOGAC ACXXTGCGGG TGCCCCCTCC ACTCACTCCA CTCTCTACAA 1140 

TCCGTAACTT TACCCTGGGQ GOCCCCAQTG CTGGGGTGAC OGGACCAGGG GCCAGTGGAG 1200 

GCAGCGAGGG ACCCGGGCTQ CCTGGTAGGA GCTCAGCAGC AOCAGCAGCC GCOGCOSCCG 1260 

CCGCCTATAA CCCACACCAC CTGCCACTQC GGCCXATTCT 6AGGCCTC6C AAGTACXXXA 1320 

ACAGACCCAG CAAGACGCCG GTGCACGAGA GGCCCTACCX: GTGCCCAGCA GAAGGCTGCG 1380 

ACCGQCGQTT CTCCOGCTCT GACGAGCTGA CACGGCACAT COGAATCCAC ACTGGGCATA 1440 

AGCCCTTCCA GTGTCGGATC TGCATGCGCA ACTTCAGCCO CAGTGACXavC CTCACCACCC 1500 

ATATCOGCAC CCACACCGQT QAGAAGCCCT TCGCCTGTGA CTACTGTGGC OGAAAGTTTG 1560 

CCC6GA6TGA TGA6AGGAAG CGCCACACCA AGATCCACCT GAGACAQAAA GAGCGGAAAA 1620 

6CAGT6CCCC CTCTGCATCQ GTGCCAGCCC GCTCTACAGC CTCCTGCTCT GGGGG06TGC 1680 

A6CCTGGG6G TACCCTG76C AGCAGTAACA GCAGCAOTCT TGGGGGAGGG CCGCTCGCCC 1740 

CTTGCTCCTC TCGGAC0C6G ACACXTTTQAG ATGAGACTCA GGCTGATACA CCAGCTCCCA 1800 

AAGOTCCCGG AGGCCCTTTG TCCACTGGAG CTGCACAACA AACACTACCA CCCTTTCCTG 1860 

TOCCTCTCTC CCTTTGTTGG GCAAAGGGCT TTGGTGGAGC TAQCACTGCC CCXnTTCCAC 1920 

CTAQAAOCAG OTTCTTCCTA AAACTTAGOC CATTCZAGVC TCTCTTAGGT GAGTTGACTA 1980 

TCAACCCAAG 6CAAA0GGGA GGCTGAGAAO QAGGIGCTGT 6GG6ATGCCC TGGOCAAGAG 2040 

GGCTGAGGTC TGACCCTGCT TTAAAGGGTT GT7TGACTA0 OTTTTGCTAC CCCACTTCCC 2100 

CTTATTTTGA CCCATCACAG GTTTTTGACC CTGGATGTCA GAGTTGATCT AAQACGTTTT 2160 

CTACAATAGG TTGGGAGATG CTGATCCCTT CAAGTGGGGA CA6CAAAAAG ACAAGCAAAA 2220 

CT6ATGTGCA CTTTATQOCT TGGGACTGAT TT66GGGACA TTGTACA6TG A6TGAAGTAT 2280 

AGCCTTTATG CCACACTCTG TGGCCCTAAA ATG6TGAATC AGAOCATATC TAGTTGTGTC 2340 

AACCCTTGAA GCAATATGTA TTATATACTC A6A6AACAGA AGTGCAATGT GATGGGAGGA 2400 

AOGTAGCAAT ATCTGCTCCT TTTOGAGTTG TTTGAQAAAT 6TAGGCTATT TTTTCAGTOT 2460 

ATATCCACTC AGATTTTGTO TATTTTTGAT GTACCCACAC TGTTCTCTAA ATTCTGAATC 2520 

TTTGGQAAAA AATGTAAAGC ATTTATGATC TCAGAG6TTA ACTTATTTAA GGGGGATGTA 2580 

OITATTCTCT GAAACTA66A TGGATQCAAT TO TOT TG GAA tf i tU' L V n ' W TC GCC i TGW 2640 

TGATGTAGAC AAATGTTACA AGGCT6CATG TAAATGGGTT GCX:TTATTAT GGAGAAAAAA 2700 

ATCACTCCCT GAGTTTAGTA TGGCTGTATA TTTATGCCTA 7TAATATTTG GAATTTTTTT 2760 

TAGAAAOTAT ATTTTTOT A T GCTTTOTTTT GTGACTTAAA AGTGTTACCT TT6TA6TCAA 2820 

ATTTCAGATA AGAATGTACA TAATGTTACX; GGAGCTOATT TGTTTGQTCA TTAOCTCTTA 2880 

ATAaTTGTQA AAAAATAAAT CTATTCTAAC GCAAAACCAC lAACTGAAOT TCAGATATAA 2940 
TGGATGGTTT GTGACTATAG TGTAAATAAA TACTTTTCAA CAAT 

6eq ID NO: B20 Protein sequence 
Protein Accession fit NP_000390.2 



I4MTAKAVDRI 
HXDMTGEKR8 
LQGVT6PAST 
LYQDPSAFLS 
TAGPDRKPPP 
AAAAAAAAAY 
ZRIKT6KXPF 



11 
I 

PVTLS6FVHQ 
LOIiPyPSSFA 
TASSSVT8A8 
AATTSTSSSL 
CPLDTLRVPP 
NFHHLPLRFI 
QCRXCMRNFS 
PSASVPAPST 



21 
I 

LSDNIYPVED 
PV8APRHQTF 
FNPLATt^LO 
AYPPPPSYPS 
PLTPLSTIRN 
LRPRKYFNRP 
RSDHLTIHIR 
ASCSG0VQP6 



31 
I 

LAATSVTIFP 
TYM0 KF8ID P 
VCIMSQTQPD 

PKFATDPGLF 
FTLGGPSAGV 
SRTFVHERFY 
TBTGEKPFAC 
6TLCSSN88S 



41 

I 

NAELGGPFDQ 
OYPOASCYFE 
USBLYSPPPP 
PMIPDYPGPF 
TGP0ASGG8E 
PCPAEGCDHR 
DYOGRKFARS 
LGGGPLAPC8 



51 
1 

MNGVAGDGMI 
OimiVSAOI 
PPPYSGCAGD 
PSQOQHDLHO 
GPRLPGSSSA 
FSRSDBlfTRH 
DERKRRTKZR 
8RTRTP 



60 
120 
180 

240 
300 
360 
420 



Seq ID NOi B21 DNA sequence 

Nucleic Acid Accession #t NM_004962.2 

Coding sequence: 457.. 1893 



11 



21 
I 



31 
I 
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CKCkCkOBGQ CGCAOGCACA GGGCASCCGG G0CAO0SAC6 ACCCTGTCAG CTGCRGCCCC 60 

AGAGOTCOGQ OGOOOSCAaC OG G GTCCC CT OSAGGGCGCA GCCGGC06CC OC3CCXX3QCC 120 

CCTOGAAGCA GCCGGGC06G GOCSCGCAOTG OGCTACAAAC TTTCGCAGOG 0QAGTCCX3CC IBO 

AAOGCAGCGC GCXXykCTOGG QCT0G6CTCJ5 GCTCTGOGCT GCTCCGGACQ GCTOTGACCQ 240 

CTGGC06GGG GCTCGGQCCG COGGTACCCA OGGACOQOBC GCCOGGOTOC CTGCTCCGCT 300 

AAO C COCT O Q COOOGCOCGQ ACCTGCXSTAT CCAGC600CT GCTGCOOQGG CTCTCGCCGC 360 

GOOCCCTACT GCCOCOACGT CAGTOOGCAG CCTCOGQTGC GCCAOOQCTC GCCTTOCTCC 420 

TCCTGGACTT CGGCCCTTTG COGCCCTCAC CAOGCCATGQ CTCATQTCCC CGCTCGGACC 480 

AGCOOGGGAC CCX3GGCCCCA GCTGCTGCTG CTGCTGCTGC OGTTGTTTCT GCTGTTGCTC 540 
OGGGATGTGG CCGGCAGCCA CAGGGCCCCC GCCTGGTCCG CACTGCCOOC GGCCGCCGAC 600 
G0GCIGCA6O GG6ACAGGQA TCTCCAGG6G CACCCTGGG6 AGGOGGCGGC CAG6TTGGGC 660 
CCCAGOOOCC AGGACATGOT C6CTGTCCAC AT6CACAGGC TCTATQASAA GTACA6CCGG 720 
CAGGGCQOGC GGCC6GGAGO GGGCAACAOG GTCOQCAGCT TCAGGGCCAG GCTGGAAGTG 780 
QTOQACCAGA AGGCOOTOTA TTTCTTCAAC CTQACTTCCA TGCAAGACTC GGAAATGATC 840 
CTTAOGGCXa^ CTTTCCACTT CTACTCASAG COGCCTOGGT GGCCTOGAGC GCTCGAGGTG 900 
CTATOCWVGC CQOOGOCXSVA GAAOQCTTCA GGCCGCCOGC TGCCCCTGGG OCCQCCCaCA 960 

C6CCAGCACC TGCTCTTCOG CAGCCTCTOG CAOAACACGG CCACACAGGG GCTACTCCGC 1020 

GGGGCCATGG CXX^TGGCGCC CCCACCGOOC GGCCTGTGGC AGGCCAAGGA CATCTCCXXX: 1080 

AT06TCAAGG CXX3CCXX3CXX3 GGATG6CX3AG CTGCTCCTCT CCGCCCAGCT GGATTCTGAG 1140 

GAGAGGGACC CGGGGGTGCC COGGCCCAGC CCCTATGOGC CCTACATCCT AGTCTATGCC 1200 

AACQATCTGQ CCATCTOGGA GCCCAACM3C GTG6CAGTGA GGCTGCAGAG ATAOGAOCCC 1260 

TTGCX:rGCC3Q GAGACCCCGA GCCC0GCX3CA 60CCCCAACA ACTCA6CGGA COXOGOGTG 1320 

CGCCXSAGCCG 06CAGGCCAC TGGGCCCCTC CAGGACAACG AGCT6CCGGG GCTQQATGAQ 1380 

AGGCCXSCOSC GOGCCCACGC ACAGCACTTC CACAAQCACC AGCTGTGGCC CAGCCCCTTC 1440 

CGGGCX3CTGA AACCCOQGCX: AGGGOOCAAA GAC06CAGGA AGAAG6GCCA GQAGGTGTTC 1500 

ATGGCOGCCT OGCAGOTGCT GGACTTTGAC GAOAAOAOQA TQCAGAAAGC COQaAOQAAG 1560 

CA6TGGGATQ AGCC0AGG6T GTGCTCCOGQ AOGTACCiaA AOGTGGACTT GGCAGACATC 1620 

QGCTGGAATG AATGGATAAT CTCACCGAAA TCTTTTGATG CCTACTACTO OGCGGGAQCA 1680 

TGTGAGTTCC CCATGCCTAA GATCXSTTCGT CXy^TCCAACC ATGCCACCAT CCAGAGCATT 1740 ■ 

GTCAGGGCTO TGGGCATCAT CCCTG6CATC CCAGAGCCCT GC7GTGTTGC CGATAAGATO 1800 

AACTCCCTTO GGGTCCTCTT CCTOQATGAG AATGGGAATG TGGTTCTGAA GGTGTACCCC 1860 

AACATOTCCO TGGACACCTG TGCCTGC O GG TGAGAOCACT CCA6GGTGGA AAGAAGCCAC 1920 

GCCCAGCAGA GCTGCCTTCT CGGAGCCTTC TGCAACCAOO ACTTQTGGTG CAGCTGCAGA 1980 

CACAGAGCAC AGCTCATGGQ CAACATCACT GGGGCCCAGA QAGAGCTGTC CX3CCAGTGCA 2040 

TCATTAGGGO GTCTTTCATT GCTAGTGACT AGCCCCTTAA ATOCXaOCCT GAOTACCTOA 2100 

AGGAATCTGO GAATTAGCCC TOGCCTQAAA GTGGCOCATC ATTCATACCC ACTGTTCTGA 2160 

AGGCTTQAAA ACAAAACATA TCCACAACAT TGGCTTGATG TGATCATGAT CTCATAACTO 2220 

AGCAAQAAGA CTATGCAAAT CTTA0GGCX5C TOGCTCCCTG CACAOGGAAA GAACTCTGTT 2280 

TAAATGCTCA GTTCAOAACA CTTTGGGCCA CATAGTGATT TTGGAAAACA GGATAATCGT 2340 

GGTGTAAATG AgT G 'lTTCCT TTCAAAGTCC ACTGCAOAGC TTTTATOCAT ATGGTATGCA 2400 

CATQTAGCCA ATATTQOTTT CTTTTTCTTA ATATATATAT TTTATTTTAA AACAACAAAA 2460 

AGGGAGGGCG rrOACAGCAT TCCGCACA6A GATAGTCATG CTGAGTOTGG OTTGTTXAAA 2520 

CATGCATATT GAAATAACAC ATATAOTAAC GTGGQAATAC TAAAAAATAA OCAAQATTTT 2580 

ATATTTTTGT AAATTATACT TTCTATACTG TAGATTGTGT ATGTTATGTQ TTTTTATG6A 2640 
AAGCTAATAA ATTAAAGGTA CAGT6GTATC TTOA 

Seq ID NOt B22 Protein sequence 
Protein Accession «t NP_004953.1 

1 11 21 31 41 51 

I I I I I 1 

MARVPARTSP GPGPQLLLIiL liPLFLLLIOU) VAGSHRAPAW SAIiPAAADGL QGDRSLQRHP 60 

OJAAATLQPS AQDMVAVHMH RliYEKYSRQO ARPGGGNTVR SFRARLBWD QKAVYPPMLT 120 

SMQDSEMILT ATFHPYSEPP RWPRALEVLC KPRAKNASGR PLPLOPPTRQ HLLPRSLSQN 180 

TATQGLLRGA MALAPPPRGL KQAKDISPXV KAARRDGBLL LSAQLDSEER DPGVPRPSPY 240 

APYILVU^ LAISEPNSVA VTLQRYDPFP AQDPSPRAAP NNSADFRVRR AAQAXGFLQD 300 

NELPOU>ERP PRAHAQHPHK HQLWSPFSA UCPRPGRKDR RKRGQBVPMA A8QVU)PDBK 360 

TMQKAIffiKQH DBPRVCSRRY LKVDFADIGW NEWZISPKSF DAYyCAQACB FPMPKIVRPS 420 
NHATZQSZVR AVOIZPOZPB PCCVPDKMN8 MVIjFLDEMR NVVLKVYPHM SVDTCACR 

6eq ID NOi B23 DNA sequence 

Nucleic Acid Accession «i AX026322.1 

1 11 21 31 41 51 

I I I I I j 

ATTCTTTAAA TACTTAAACA TQGTTATAAC AOCTGTGTTT TAAGCTOCTT GTCTGTTAAT 60 

TCCATCATCT GTCATTTTTO CATCT G TTTT TATTGACTAA CTTTCTTCCT CATCATGGTC 120 

AC3VTATCCTT GCTTCTTCCC ATATTTAGTA AAOTTTAGTT GGATGCTGGA TTGTTTTAAT 180 

TTAGCATTTT GGAOTGACTQ CTTTTGCACT TGCXTTTTAAA AGTATTGGAC TTCGTTTTGA 240 

TAOTTQAATT ACTTGCAAAT CAGCTTGATC ATTCTOAiaAC riWfTTTAA CTTTTOTOGA 300 

AAAOGCTATG CTACTTCAAO XATAATAAAA OCTAfiTTTTA GTTTTATCCT ATAACTAAGQ 360 

CATGTGTTCA ATTTQAATGT CTCCAAOCXrT GTGTGAACTC TOAAAATTGT TCAOCTCCCA 420 

ATTTCOCAOT AATTTTTTTT GTTCAQCCTT GCAGTCTCAT CCTACTCAAG TGTGGCTCTG 480 

TATCCAACAA CAGTCTTGGA QATCTCATGA AGATTTCTGA AACTTTTGCT CTGCAGOATA 540 

CCTTCCTTTG TGGTTATOTa CCCTQAAAAC TCCACCCTCC GTGTCTTCAA TTCAGTGAAA 600 

CTACTACTCT GCTTGGATTC GCCTCTTATA CAATGGTCTA C»AGTGACT GCAAACAQAA 660 

ATCTAAGGGA ATTTCAOGAO TCACCTCATT TATTTTCCTT TTTTTCAGGG ATTATA0TA6 720 

TACTATACCA CXTOTCAGCC AATOTQTQAG AATGGTGGTT TCATACATTT TCTCCAATTC 780 

TTTCTTGTTA CAGGAGCCTC CATTACTTTT CAAACAGTAA TGCAGTTGCA QTTGCCTCTC 840 

CTTTTCTTCA TTATGTQTTT ATCTCTGGCA GTTTGAGCCA AGAOAGGGCA CGGAGAAGTA 900 

CAATGACXAG AGAGCACTTC T Q TTGAGGCT CATTCATTGA COCCTACOOC AGTQCTTTAT 960 

GAATOTBIGC TGCAGATGTC ATACA6CATC ACAGCTTCTT CTCTAATTTA TGAOCCATAA 1020 

I ' i ' lf ' l ' l ' fm TGTATTTTTA TTGTATTACC TGCTTGOITO CAAOAGGATO' ATGAGGAOGA 10 BO 

CTACTATAGQ ATAAATTTGT TTTTATAGAG CAATTTCTGA TOGGTCAOGA GQGATAOTAC 1140 



I 



806 
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ICOVTTTTCC TCkCTQATAT CQOCTTCATC ATCTOASTCA OTQOaCTCRT TOTGQAAACA 1200 
TATGQAATGC TTTATGCTTT TTCTGAGGCT GCTTCIGTTA CATOAAATAA AOOCAGAOTT 1260 
QATTOTGAAA AAAAAAAAAA AAAA 

Seq ID NO: B34 DNA sequence 

Nucleic Acid Accession li cat cluster 

1 11 21 31 41 51 

1 I I 1 I 1 

TGPTATACATT CCTITCAAAT AAAOACCrrO AGAAAACAGC AGAGCCAAOT GAAlGATCACC 60 

TAAAGMICTT TGT66CTAAT TTATACTTCA TGTAOTAGCA GTGGGTACTG G6CAGG6TCT 120 

CTTTOCCACT CTQATQATTT OTQCTCTTAT TTTTCCTASA TTTACCTCAT CTAGGC5CATA 180 

TTCTTTTCCC TCTTCCTCTT TACXTTTTCCT GGTCCTGATC CCTCTGTACT CAOTTCCCCT 240 

AAATTATTGG ACTACAAACT AATATACTAO AAAAOCATAC ACTTATTTTA TTTGAATGCA 300 

GAAATGCTAT CTATCA6TAT ATATACATAA GAATGTATAT TACAOTATAT CTATATATAC 360 

TTAACACTCT AACTTTCJ^ ATTCCCX9\GT TAOGOTAOCT AACTCTCCTG TGGGTTATGT 420 

TAATTCTTAT TAQACTACTA GAGAAAAAGC AACTGGCAGT TTGCTAAGCA TATCTACTGO 480 

TGTTQTTTCT GCGCCCTCTT TTGGCTAATT QATGTAATTA TACTGGCTCT AAAGATTTAC 540 

TGCCCCATAA GTAAATACJTA TAGCCACATT CTGAACATAT CAAAAGTACA AACTTAGQAO 600 

GAGTGTATGT ACAAAAATGT AAAATTTTAT GAAAATGAAC ATGTTTTTAT GATGTTATTT 660 

CTAGTTCATA AGAATQTGAT GACTGCTTTG CTTOITTTAT GTACOTTGOC ATTATATTCT 720 

TGCTOTCAAT CAATCACAAA TTTATATCAG ATTAGGATAA ACTAAGCCAT TTTATGTATT 780 

TTATTTTAAA CCTTATTTTQ GCAGAGTAAT TCCTTAGAAT TGGAAAAGCT OTTACTTTGA 840 

AATTACXSUVT TTATTACAAA ACATAGAAAT GTATTQTAGC TACAAAGACA ACCAAGCATT 900 

TTCTGTGTTT TAATGAATAT CTAAAAAACT ACATTTA6TT TATTTTACTC AGTTTTGAAA 960 

TGATTTTTTT ACTGGCTCTA TTOCCTTAAA ATAACTAAGA GATTAATGAT TCTTTGTATA 1020 

ATTTTCCTTT TCTTTGTTCT TTTTTTTACA TTTOGCftGAa TTATATCTAT AOTTTTAGTA 1080 

ACAATTTCTT ATOTATTCTQ QATAACTQAA AACAACTAAA GGTQTTGGGC ATTAGAAAAT 1140 

AATTOTGAGC AOTAAOATTA CTGATGTAAT ATGTATQTTQ GACTOAAQTA TTTCTTTATA 1200 

AACATTCTAT TTGATTTTAA GCAAAATGTA TOTTAAAGCA TGTTTTTACA TCAGTAAAGT 1260 

CArrTGTCGA CCTTCTGGAA ATGAAAGGTT TTTACCTAOA TACTGTAAGT TACACCTOCT 1320 

TAACAATCAT ATTTGTCATT GTTGTTTTCT GCAAACAAAA ATOTTTATGG GCTTCATGTA 1380 

GGCTTAAGAT TOTAOGCAAA AATGGACTGA GTTCAGGACC CTTCAAGCAG TAGGCATTCA 1440 

GTTACAOAOC AGTTGGTACT TTGTAACCCA QACTTACAGT TTAAAAATAT CAAGTTAGCT ISOO 

GATGTTTCAT TATAATAAAA ATACTATTTT GCTTAAGAGT TGTATTACAA ATATTTQTGC 1560 

TTAACATTAG AAATAGCTQT TTTAAATTGT AGTTAACATA TTAACTTrTT CA6AAAAAAA 1620 

GCATGGTTTA TTTTTAATAA TGAAATAGAQ AACATAATAC GTAATGTTCA GTATAACAGC 1680 

TGAGTTAAAA CATCTGCCAG GATTAACATC AGTGOGTTTT TQCCAATGCA TAGAGGCATT 1740 

TTTCrCTAAO TATGATGGCT AATQATAACT ATTCTTTQTT ACACATTCAA GTCACTCCCA 1800 

TACAA6TAAC TAGTGGGTGA TATGTTTGAC TCCAAAGGGT GTATTAATTC TGAATGCTAA 1860 
TCATGAAGAC TTAAGTTA60 ACAACACTTC AAACCAGOAA QTOTQAACTG ATTTA 

Seq ID nOt B25 DNA sequence 

Nucleic Acid Accession Hi FGENESH predicted 

Coding sequence t 1..1144 

1 11 21 31 41 51 

I I I i i I 

ATGGAGCG06 GGATGCACCT CX3GTGCAG0G QCCGCO^GCG AGQACGACCT CTTCCXGCAC 60 

AAGAGCCTQA GOQCCTCCAC CTCCAAGCGC TTGGAAGCGG CTTTCOGCTC CAOGCCCCOG 120 

GGCATGGACC TGTCCCTGGC GCOQCCXSCXrr OGGGAAOGCC CGGCGTCCTC CTCCTCGTCO 180 

CCCCTGGGCT GCTTOGAGCC GGCTGACCCC GAGGGGQCAG GGCTGCTQTT GCCGCCXJCCT 240 

GGA6GAGGCG GCGGCX3GCAG 06CGGGAAGT GG0QG0GQO3 GOGGOGGCGG GGTGGGTGTC 300 

CCCGGGCTGC TAOTAOOTTC AOCOGGCXTTT GGGGGCGACC CTAGCCTAAG CAGCCTGCCG 360 

GCOGGGGCOG CCCTTTGCXrr CAAGTACGGC GAAAGOGOGA GCOGGGGCTC GGTGGCOGAG 420 

AGCAGOGGOG GCGAGCAGAG CCCOGAOGAC GACAGCX3A0G GTCGCTGOGA GCTCGTGCTG 480 

GGGGC0GGA8 TAGCOGACCC GOGGGCCTCC C0GGGAGCXK3 QAGGTGGTGG CGCGAAGGCA 540 

6CCQAGG6CT GCTCCAATGC CCACCTCCAC GGOQGCGCCA GOGTCCCCCC GGGGGGCCTG 600 

GQCGGCGGCG GOGGCGGGGG TAGCAGCAGC GGTAGCA6TG GCX3GCGGTGG OGGTAGCGGT 660 

AGOGGCAGOG G0GGCA6CAG CAGCAGGAGC AGCAGCAQCA GCAAGAAATC CAAAGA6CAA 720 

AAGGOGCTQC GGCTTAAGAT CAATGCCOGA GAG0600GGC GGAT6CACQA CCTGAACQAC 780 

G06CTGGACG AGCTG06CGC GGTGATCCCC TACGOOCACA GCCCCTOGGT GCGAAAGCTC 840 

TCCAAGATOG CCA0QCT6CT 6CTCGCCAAG AACTACATCC TCATGCAGGC GCAGQCCCTG 900 

GAOGAOATGC GGCGCCTAGT CGCCTACCTC AACCAGGGCC AGGCCATCTC GGCTGCCTCC 960 

CTGOCCAGCT CGOCGGCTOC AGOGGCAOCA GCTGCTGCCC TGCACCCGGC GCTCGGOGCC 1020 

TAOGAGCAGG CAGCCGGCTA CCOGTTCAGC GCCOGACTGC CCCOGGCTGC CTCCTGOOOG 1080 

GAGAAGTGOG CCCTGTTTAA CAGOGTCTCC TCGAGCCTCT GCAAACAGTG CACGGAGAAG 1140 
CCTT 

Seq ID MO I, B26 Protein sequence 
Protein Accession ii FGENESH predicted 

1 11 21 31 41 51 

1 I 1 I I 1 

MERQ»(HLGAA AAGSDDLFUI K8LSAST8KR LEAAFRSTPP GMDXiSLAPPP RERPASSSSS 60 

PX/3CFEPADP EGAGLLLPPP 6GGGGGSAGS GGQGGGGVGV PGLLVGSAGV GGDPSLSSLP 120 

AGAALCLKYG ESASRGSVAE SSGGEQSPDD DSDGRCELVL RAGVADPRAS PGAGGGGAKA 180 

AEOCSNAHIia GGASVPPGGL GGGGGGGSSS GSSGGGGGSG 8GSGGSSSSS SSSSKKSKEQ 240 

KALRUIIMAR BRRRMHDLND AU7SLRAVIP YAHSPSVRKL 6KIATLLLAK NYILMQAQAL 300 

BBKRIUiVAYIj NQ0QAI8AAS LP8SAAAAAA AAALMPALGA YEQAAGYPPS AGLFFAASCP 360 
BKCAI.FNSVS SSLCKQCTEK P 

Seq ID NO: B27 I»IA sequence 

Nucleic Acid Accession «t cat cluster 
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PCTAJS02/36810 



1 H ai 31 41 51 

I I I ) I I 

' mTiTl ' lTr TTTTTTTTAA ATOTTTGATC AAAATATTTT AATAAAGATT CTTTCTGJVCA 60 

5 TAGATAOVCA TACAAATGGT OGTACATACC TGTCATAOTC TGATTGACCT ATTTAATATA 120 

TATATGATTC TTTACACATC CAAAACCOGC CAACAGATCC ATCACAGCTC CCAACTCACC 160 

ATCCAAOCTQ ACAAACTGAA TTTGTATTAT CTGCAAOOAG TGQAAAATAO CAGGACTCCA 240 

TTTTTAAAAA AGATTTTCTT GATTTTCATA GGGCGGAAAG GCAGTCAAAC AGCCATGCAA 300 

AACTAAAACr GAAAGCTCAC TTTGGGTAAA TAGCTTCTTG TTCTTCCTTA GTTTTCTTTC 360 

10 TTTTTAAATT TTATTTTTTT A6AAAAATAA CAAAOGATTT CACACCATAG GCAAATCAAA 420 

CCAOTCTTTT AACTTAAATA ATTCTCCACA GTTAAAATAA CATATATGTA CATATATATT 480 

AAAAOCAATT AAATTAGACC TTTTAAAAAT GCACAGCACA GCCTGGAAAA ATAT7TGCTT 540 

AGCATGTTCT TACGTATAGC AACTATTGCT GTGATOTTTT TCCCTTTTGG AATGTAAAGG 600 

AGTCCCCTTT CAAAAAAGAQ ATCAATTCAT TCATCAATTA AOAATACACC TTTCCTOTAA 660 

15 TTTTTGGACT GAAGCAATTT ATTAAAGCTC AATTTAAATA CAGGGATGAT GCAACTGAAA 720 

ATATCCAGGT GACCTTTCAT AAAOCTAAGC AGCTCAGATA CATCAATATT TCTCTTCATA 780 

CTTGTTGGCA AATAAACCTT TAAACACTTG GCACACAOCA TAAGTAATCT ATAAAACAAT 840 

TTTAGAGGCA TTAAAAAAAT CTGCACATAA GACCCATGAC TTTAACACAT OATAAATACT 900 

GTTGATGTGG AAGGQTCATT QAAOAATAAC AAATAAATAC CATGAATTGT TAATACATCA 960 

20 TTGCAQAGTA GAAAGTAACA A6GTGCACAT AAATATTTTT AAATGCAATT CTTTCAGCCA 1020 

CAGTCAGTTT TTTTATATCA CTCTOSCCAA AACTTTOAGC ATTTTCAGA6 GATTAAAOTT 1060 

CAGAOACAAT AAAAAATACA AOT C TTT CA T AOTAACATGT TCTCTCTCTC TG6CTCT0G6 1140 

TTTTATTTCC CCCTCACTTA AOTCAACATT TCAAGTrTTT CTCCTOGCTC AGAATCAAAA 1200 

TTTATTTTCA AQTGCCCTTT CTOATTTOTC TOAATOAATA TTCCATCTCT CATGCTACOG 1260 

25 ATCG6CT 



30 



Seq ID NOt B28 DNA sequence 

Huclelc Acid Accession %t NH_002581.2 

coding sequence i 366 .. 5251 



1 II 21 31 41 51 

I 1 i I 1 I 

GGGGGAGGGA ATTCAGCGGA TCAQTCTTAA QAOGAOCTTT TTTTTGGAGC GAGAAATCAT 60 

ATAAAATAAA ATQAAATAAA ACAAGGAGGA AGGCAACCAG CTGT TAGGG G GAAAA TAAGG 120 

35 CAGATAAAGG A6CGGGGAGA GAAATTAATT GCCAACCAGG AGGAGTTGGG CTGTATTTTT 180 

CAAAGGTOGQ QAOAQTGGAG CACACACCTT GA06AGGAAA GGGAGAAAGA AAAGAAAAAA 240 

6CAA6TGAAG 6GGGGCTCGC CCAAGAAGGG TOAAQAAGOO AAGAAAOTOG AGGCGCCX3A0 300 

GCTCCCAAAG CTG6CAGCTC OGGGTGGOGG TGCAGGGGCG AAGGGGGGGG C6GGGGGAAC 360 

GT0G6ACATG OGGCTCTGGA GTTGGGTGCT GCACCTGGGG CTGCTGAGCG CCQOGCTGGQ 420 

40 CTGGGGGCTG GCCQA60GTC CCCX3CCGGGC COGGAOAGAC CCGCX3GGC0Q GCC6ACCCCC 480 

G080CCCGCC GCOaOOOOOO CCACCT60GC CACCC60GGC C06CG6CGGC OSOQCCTGGC 540 

CGCOGCCGCC 6CC30CCGCO6 GGGOTGCCTG GGAAGCOQTG CQ08TCCCCC GGGGGOGGCA 600 

GCAGOSGGAO GCGAGGGGCG CX3VCCGAGGA GCCGAGCCCG CCX3AGC0GGG CGCTCTATTT 660 

CAGOGGGCGA GGCGAGCAGC TGC3QAGTCCT CCGGGCOQAC CTOQAGCTGC CCXX3GGA0GC 720 

45 6TTCAGQCTG CAAGTGTGGC TG06AGCX3GA GGGOGGCCAG AGGTCTCCGG CAGTGATCAC 780 

AGGGCTOTAT GACAAATGTT CTTATATCTC AOQTQAOGGA GGATGGGTOQ TGGGCATTGA 840 

CACCATCAGT GACCAAGACA ACAAAGAOCC ACGCTACTTT TTCTOCTT6A AGACAGACCG 90d 

AGCCCGGCAA QTOACCACCA TCAATGCCCA CCQCAGCTAC CTCCCAGGCC AGTGGGTATA 960 

CXTTAGCTGCC ACCTATGATG GGCAGTTCAT GAAGCTCTAT OTGAATGGTG CCCAGGTGGC 1020 

50 CACCTCTGGG GAACAAGTGG 0TG6CATATT CA6CCCACT0 ACCCAGAAGT GCAAAGTGCT 1080 

CATGTTAOGG GGCAGT6CGC TGAATCACAA CTAOCXSGGGC TACATCX3A6C ACTTCA8TCT 1140 

GTGQAAGGTG GOCAOQACTC AGGGGQAGAT ACTGTCTG^C ATG6AAACCC ATGGCGCCCA 1200 

CACTGCTCTA CCTCAGCTCC TCCTCCAGGA GAACTGGGAC AATOTGAAOC ATGCCTGQTC 1260 

CCGCATGAAG GATGGCAGCA GCCCCAAAGT GGAATTCAQC AATGCCCACG GCrTTCTGCT 1320 

55 GGACA06AGT CTGGAGCCTC CTCrOTGCGO ACA6AGATTG TGTGACAACA CAOAGGTCAT 1380 

TGCXAGCTAC AATCAGCTCT CAAOTTTOCn OCAOOOCAAa 0TGGTGC3GCT AOOG QGTGGT 1440 

CAACCTCTAT GAAGATOATC ATAAOAACCC 6A0GQTQA08 GGG6AGCAG0 TG6ACTTCCA 1500 

GCACCATCAG CTGGCTGAGG CCTTCAAGCA ATACAACATC TOCTGaOAOC TGOACGTGCT 1560 

GGAGOTGAGC AACTCCTCCC TTCQCOQCCG CCTCATCCTG GCCAACTGTG ACATCAGCAA 1620 

60 QATTGGGGAT GAGAACTGTG ACCCGGAGTG CAACCACAOQ CTQACX30QCC AOGA0GGCX» 1680 

GGATIGCGGC CACCTGGOCC ACCCTGOCTT GGTGAA8AA0 CA6CACAAC8 GG6TGTGIGA 1740 

CATGGACTGC AACTATGAAC GGTTCAACTT TGATGGTGGA GAOTGCTOTG ACCCTGAAAT 1800 

CACCAATOTC ACTCAQACTT GCTTTGACCC OSACTCTCCA CACAGAGCCT ACTTGGATGT 1860 

^ TAATQAGCTG AAOAACATTC TTAAATTGGA TGQATCAACA CATCTCAATA TTTTCTTTGC 1920 

65 AAAATCCTCA GAGGAGGA6T TGGGAGGAOT AGGAACTTGO CCATGGGACA AGGAGGCCCT 1980 

GATQGACTTA GQTQGCATT6 TCTTGAACCC ATCTTTCTAT GGCATGCCIG GGCACACCCA 2040 

CAOCATGATC CATGAGATTG 6TCAGASGCT GGGOCTCTAT CAOOTCTTCC OAGOCATCTC 2100 

AGAAATCCAO TCCTGCAGTG ACCCCTGCAT GGAGACAGAG CCCTCCTTOG AGACTGGAGA 2160 

CCTCTGCAAT QATACCAACC CAGCCCCTAA ACACAAGTCC TGTQGTGACC CAGGGCCAGG 2220 

70 AAATGACACC TGTGGCTTTC ATAfiCTTCTT CAACACTCCT TAau«»ACT TCATGAGCTA 2280 

TQCAOATOAC OACTGTAOGG ACTCCTTCAC GOCCAATGAA GTGG0C3UZAA TGCACTOTTA 2340 

CCTGQACCTQ 6TCTACCA6G GGTGGCA6CC CTCGAOOAAA CCAOOQOCTG TTGCXXTTOOC 2400 

CCCCCAAGTT CTGGGCCACA CAAC6GACTC TGT6ACACTG GAGTGGTTCC CACCTATAQA 2460 

TGGCCATTTC TTTGAAAQAG AATTGGGATC AGCATGTCAT CTTTGCCTGQ AAGGQAQAAT 2520 

75 CCTGOTGCAG TATGCTTCCa ACGCTTCCTC CXXAATGCCC TGCAGCCCAT CAGGACACTG 2580 

OAGCCCTOGT OAAGCAGAAO GTCATCCTOA TGTTGAACAO CCCTGTAAOT CCAOTSTCOQ 2640 

CACCTGGAGC CCAAATTCAG CTOTCAACGC ACACAOGGTT CCTCCAGCCT 6CCCTGAGCC 2700 

TCAAOOCTGC TACCTCGAGC TGGAGTTCCT CTACCCCTTG GTCCCTGAGT CTCTGACCAT 2760 

o TTG G GTGACC TTTGTCTCCA CTGACTGGGA CTCTAaTGQA GCTGTCAATG ACATCAAACT 2820 

80 6TTGGCTGTC AGTOGGAAGA ACATCTCCCT GGGTOCTCAG AATQTCTTCT GTGATGTCCC 2680 

ACTGACCATC AGACTCTGGG A0QTG660GA GGAGGTOTAT QQCATCCAAA TCTACAOGCT 2940 

GGATQAGCAC CTGGAGATOQ AT6CTGCCAT GTTGAOCrCC ACTGCAGACA CCCCACTCTG 3000 

TCTACAGrrST AAGCCCCTGA AGTATAAGGT GOTGCGGOAC GCTOCTCFOC AGATGGATOT 3060 

OGCCTCCATC CTACATCTCA ATAGOAAATT OOTAGACATG GATCTAAA1C TTGGCAGTGT 3120 

808 
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QTACCAOTAT TGGGTGATAA CTATTTCAGG AACTQAAOAO AGTQAGGCKr CACCTGCTGT 3180 
CACATAOVTC CATGOAOOTG OGTACTGTGG COATGGCATT ATACAAAAAO ACCAA6GTGA 3240 
ACAATGOGAC QACATGAATA AGATCAATOG TOATGOCTOC TCCCrTTTCT GCCGACAAGA 3300 
AGTCrCCTTC AATTGTATTG ATGAACCCAO COGOTQCTAT TTCCATGATQ GTGATGGGGT 3360 
ATGTOAQGAQ TTTQAACAAA AAACCAGCAT TAAGOACTGT GGTOTCTACA COCCCCAGGO 3420 
ATTCCTGGAT CAOTGGGCAT CCAATGCTTC AGTATCTCAT CAAGACCAGC AATGCCCAGG 3480 
CTGGOTCATC AT0Q6ACAGC CAQCAGCATC CCAGGTGTGT CGAAGCAAGG TQATAOATCT 3540 
CACTGAAGGC ATTTOXAGC AT6CCTGGTA CCCTTQCACC ATCA6CTACC CATATTCCCA 3600 
GCTGGCTCAO ACCACTTTTT GGCTCCGGOC GTATTTTTCT CAACCAATGG TTGCXX5CAGC 3660 
TGTCATTGTC CACCTQGTGA 0GGATGG6AC ATATTATGGQ QACCAAAA6C AGQAQACCSVT 3720 
CAGGGTGCAO CTGCTTGATA GCAAAGATCA GAGCCftOGAT CTAGGCCTCC ATOT CCTGAG 3780 
CTGCAOGAAC AATCCCCTGA TTATCCCTGT GGTCCAT6AC CTCA6CCAGC CXTTCTACCA 3840 
CAGCCAGGCG GTACGTGTGA GCTTCAGTTC GCCCCTGGTC GCCATCTCGG GGGTGGCCCT 3900 
CCGTTCCTTC GACAACTTTQ ACCCCQTCAC CCTGAGCAGC TGCCAGAGAG GGQAQACCTA 3960 
CAGCCCTGCC 6AGCAGAGCT GCX3T6CACTT CGCATOTGAO AAAACTGACT 6T0CAGAGCT 4020 
GGCTGTGGAG AATGCTTCTC TCAATTGCTC CA6CAGGQAC C6CTAOCA0O GTGCCCAGTO 4080 
TACTGTGAGC TGCCGGACAQ GCTAOGTGCT OCAGATAOGG CGGGATOATG AGCTGATCAA 4140 
OAGCCAGACG GGACCCRGCG TCACAQTGAC CTGTACAOAG GGCAAGTGGA ATAAGCAGGT 4200 
GGCCTGTQAa CCAGTCGACT GCAGCATCCC AGATCACCAT CAAGTCTATG CTGCCTCCTT 4260 
CTCCTGCCCT GAG6GCACCA CCTTTGGCAG TCAATOTTCC TTCCAGTGCC GTCACCCTGC 4320 
ACAATTQAAA GGCAACAACA GCCTCCTQAC CTGCATGGAO GATGGGCTOT GGTOCTTCCC 4380 
A6AG6CCCTG TQTGAGCTCA TGTGCCTCOC TCCACCX3CCT GTQCCCAATO CAGACCTCCA 4440 
GACOGCCCGG TCCCGAQAGA ATAAGCACAA GGTGGGCTCC TTCTGCAAAT ACAAATGCAA 4500 
GCCTGGATAC CATGTGCCTG GATCCTCTCG QAAOTCAAAQ AAACGGGCCT TCAAGACTCA 4560 
GTGTACCCAG GATGGCAGCT GGCAQGAQGQ AQCTTGTGTT CCTGTGACCT GTGACCCACC 4620 
TCCACCAAAA TTCCATGGGC TCTACCAGTQ TACTAATGGC TTCCAOTTCA ACAGTGAGTG 4680 
TAGGATCAAG TGTGAAGACA GTOATGCXTTC CCAGGGACTT GGGAGCAATG TCATTCATTG 4740 
CCGGAAAGAT GGCACCTGGA ACGGCTCCTT CCATQTCTGC CAGGAGATGC AAQOCCAOTG 4800 
CTCGGTTCCA AACGAGCTCA ACAGCAACCT CAAACTQCAG TGCCCTGATG GCTATGCCAT 4860 
AGGGTCGGAO TOTGCCACCT CGTGCCTGGA CCACAAC3\GC GAQTCCATCA TCCTGCCAAT 4920 
GAACGTGACC GTGCGTGACA TCCCCCACIG GCTOAACCCC AChOGGGTAG AaAOAGTTGT 4980 
CTGCACTGCT GGTCTCAAGT GGTATCCTCA 0CCT6CTCTG ATtCACTGTQ TCAA AGGCTG 5040 
TGRGCCCTTC ATGGGAGACA ATTATTGTQA TGCCATCAAC AAOCGAQCCT TTTGCAACTA 5100 
TGA06GTX3GG GATTGCTGCA CCTCCACAGT GAAQACCAAA AAGGTCACCC CATTCCCTAT 5160 
GTCCtGTGAC CTACAAGGTG ACTGTGCTTG TCGGQACCCC CRGG CCCAAG AACACAGOCG 5220 
6AAA6ACCTC CGGGGATACA GCCATGGCTA AGQAAOOACA AOAAGTTGTC AAAOAATTOC 5280 
CAAOQCCAQO ACCCACATCC CTTTGGTATT GATTTCACAG TCAGCTGCTC AACGGAATGG 5340 
CCTCTCCACA CCAGGGATCC TTAGCACCCA ACC0GTCT6C CTTTAATTTT ACCCAGOAAG 5400 
QACTCACATT GGG6CGAATG AACCAA6TTT CGCCATGCTG GATGATGAAA TGGATTCCCA 5460 
TCCCAAAOTC TGAQATGQAT TGCATATACA GTGTGCAQTC CCAGAGCCTC CTAAAATTCT 5520 
AGCCATTTGT GACACAACCA GAGCAAAAAA AAA 

8eq ID MO I B29 Protein sequence 
Protein Accession #i HPJ>02572.1 

I II 21 31 41 51 

I I I I I t 

^!RLHSHVLHL 6LL8AALG0G LABRPRRARR DPRAGRPPRP AA6PATCATR GPRPPRLAAA 60 

AAAAGRAWBA VRVPRRRQQR EARGATBBPS PPSRALYFSG RQBQLRVLRA DLBIiPRDAPT 120 

LQVWLRAEGG QRSPAVITGL TOKCSYISRD ROWWOIHTI SDQDNKDPRY PFSLKTDRAR 180 

QVTTINAHRS YLPGQWVYLA ATYDGQFMKL YVNGAQVATS GBQVGGIFSP I*TQKCKVXM^ 240 

GGSALNHNYR GYIEHPSLWK VARTQREILS DMBTHGAHTA LPQUjLQENW DNVKHAWSPM 300 

KDGSSPKVEF SNAHOFLUXT SLBPPLOGOT LCDNTEVIAS YNQLSSFRQP KWRYKWNL 360 

YEDDHKNPTV TREQVDPQHH QLAEAPKQYN ISWBLDVLEV SNSSIiRRRLI LANCDISKIG 420 

DENCDPBCNH TLTGKZXXHX: RHLRHPAFVK KQHNGVCDMD CNYBRPMPDO GECa>PBITM 480 

VTQTCPDPDS PHRAYLDVNE IiKNXLKLDGS THUIIFPAKS SBEBLAGVAT WPW PKB ftliMH 540 

LGGIVUIPSP YGMPGHTHIM IHBIGBSLQL YHVFRGISEI QSCSDPCMBT EPSPBTGDLC 600 

MXmfPAPKHK SOGDPGPGMD TOSPHSPFNT PWNFMSYAD DDCTDSPTPN QVARMHCYLD 660 

LVYQGHQPSR KPAPVALAPQ VLGHTTDSVT LEWFPPIDGH FPBRELGSAC HLCLEGRILV 720 

QYASNASSPM PCSPSGHWSP REAEGHPCVE QPCKSSVRTW SPNSAVNPHT VPPAGPEPQG 780 

CYliELBPIiYP LVPESLTIWV TFVSTDWDSS GAVNDIKLLA VSGKNXSI/SP OKVPCDVPLT 840 

IRLWDVGBEV YGIQIYTIDE HLEIDAAMLT STADTPLCLQ CKPLKYKWR DPPLQMDVAS 900 

ZLBLNRXFVD MDLNLGSVYQ YWVITISGTE ESBPSPAVTY IHGRGYCGDQ IIQKDQGBQC 960 

DDMNKHIGDG CSLFCRQBVS FNCIDEPSRC YPHDGDGVCB BPEQKTSIKD CGVYTPQGPL 1020 

DQWA8MASVS HQDQQCPGWV IIGQPAASQV CRTKVIDLSE GIS QHAW YPC TISYPYSQLA 1080 

QTTPWLRAYF SQPMVAAAVI VHLVTDOTYY GDQKQBTISV QLLDTKDQSH DUSLBVLSA 1140 

MKPLIIPWU DLSQPPYKSQ AVRVSFSSPb VAISGVALRS FDNFDPVTLS 80QRGBTYSP 1200 

ABQSCVHPAC BKTDCPBLAV BNASLHCSSS DRYHGAQCTV 8C3m3YVLQI RRDDBLIKSQ 1260 

TQPSVTVTCT EQKHNXQVAC BPVDC8IPDH HQVYAASPSC PBGTTFGSQC SFQCRHPAQL 1320 

KGNNSLLTCM HJGLWSPPBA LCBLMCLAPP PVPNADLQTA RCRENKHKVG 8FCKYKCKPQ 1380 

YHVPGS8RKS KKBAFKTQCT QDGSWQEGAC VPVTOJPPPP KPHGLYQCTM GFQPNSBCRX 1440 

KCEDSDASQG liGSNVIHCSKK DGTHNGSFHV GQBMQGQCSV PNEUSSNIiKli QCPDGYAXOS 1500 

BCATSCLDHN 8BSIILPNNV TVRDIPHWLN PTRVERWCT AGLKWYPHPA LIHCVXGCEP 1560 

FMGDNYCDAI NNRAPCNYDG GDCCTSrVKT KIWTPPPHSC DXiQGDCACRD PQAQEHSRXD 1620 
LRGYSHG 

6eg ZD NO: B30 DNA sequence 

Nucleic Acid Accession ftt NN_032808.1 

Coddng sequences 61.. 1008 

1 11 21 31 41 51 

I 1 I I I I 

CTAGTCTATC TCCGCTTOCT CRA C CTCTCC TACAACCOCA TCA6CR0CAT T QAGQ GCTOC 60 

AT0TTGCAT6 AGCTGCTCOQ GCTGGAG6A6 ATGCA0CTG3 TGGOCGQOCA GCTQGC06T6 120 



809 
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GTGGA6C0CT WQCCTTC OG OGGCCTCAAC TACCTGOGOQ TQCTCAATGT CTCTQGCAAC 180 

CAGCTGACCA CACTGGAGGA ATTAGTCTTC CACTCQGT GG GCAACCTGGA QACACTCATC 240 

CTQGACTCCA ACCOGCTGGC CTGOQACTGT OGGCTCCTCT GQ Q T G TTOO O GCGCCX5CTGG 300 

CGGCTCAACT TCAACOGGCA CCAGCCCAOG TGOCCCACGC CCGAGTTTGT CCAGGGCAAG 360 

OAOTTCAAGQ ACrrCCCTOA TGTGCTACTG CCCAACTACT TCACCIGG06 COG OGCC OGC 420 

ATOOGGGACC GCAAGGCCCA GCAGGTQTTT GTGGAOQAGG GCCACAOOGT 6CA0TTTGTG 480 
TGGGGGGGOG ATGGOGAGOC GCCGCC06CC ATCCTCTGGC TCTCAOCCOG AAAGCACCTG 54 0 
GTCTCAGCCA AGAGCAATQG GCXX5CTCACA OTCTTOCCTG ATGGCAOGCT G6AGGTGCGC 600 
TAOGCCCAGG TACAGGACAA OGGCAOGTAC CTQTQCATOO OGGCCAACGC GGGOGGCAAC 660 
GACTCCATGC COQCCCACCT GCATGTOOGC AGCTACTOGC OOGACTGGCC CCATCAGCCC 720 
AACAAOAOCT TCGCTTTCA7 GTCCAAOAG COGGGCX3AaO GAOAGGCCAA CAGCAOCOQC 780 
GCCACTGTGC CTTTCCCCTT GGACATCAAG ACCCTCATCA TG60CACCAC CATGGOCTTC 840 
ATCTCTTTCC TGGGCGTCGT CCTCTTCTQC CTG6TGCTGC TG'nT C l'CTG GAGCCGGGGC 900 
AAGGGCAACA CAAAGCRCAA CATOGAQATC GAGTATGTGC CCCQAAAGTC GGACGCAGGC 960 

ATCA6CTC06 C06AC6CGCC CC6CAAGTTC AACATGAAGA TQATATGAGG C0GG060GGG 1020 

GG6CAGGGAC C00C6GG060 CGGGGGAOQO GAAGQGGCCT GGCOGOCACC TGCTCACTCT 1080 

CCAOTCCTTC CCACCTCCTC CCTACCCTTC TAC3VCA0GTT CTCTTTCTCC CTCCC30CCTC 1140 

OGTCCCCTGC TOCCCCCOGC CAGCCCTCAC CACCTGCCCT CCTTCTACCA GGACCTCAGA 1200 

AGCCCAGACC TGGQGACCCX: ACCTACACAG GQGCATTGAC AGACTGGAGT TGAAAGCCGA 1260 

CXSAACCGACA CX3CX3GCAGAG TCAATAATTC AATAAAAAA6 TTA06AACTT TCTCTGTAAC 1320 

TTaOGTTTCA ATAATTATQO ATTTTTATGA AAACTTGAAA TAATAAAAAG AGAAAAAAAC 1380 

TATTTCCTAT AOCTAGTCGG AAT6CAAACT TTTGACOTCC TQATTGCTCC AGGGCXJCTCT 1440 

TCCAACTCAQ TTTCTTGTTT TTCTCTTCCT CCTCCTCCTC TTCTTCCTCC TTTCTCTTCT 1500 

CTTCCCCAGT GGGGAGGGAT CACTCAGGAA AACAGGAAAG GAGGTTCCAG CCCCACCCAC 1560 

CIGCCCACCC CGCCCCAGGC AGCATCAGQA GCA6GCTAGG GGQCA6GCCT GGGCCCA6CT 1620 

CCX3GGCIGGC TTTTTGCAlGG GC6CA00TGG AOGGGACAGG TCTGCCGATG GGGGTGGGAG 1660 

CCTCTCTCCT GGGCTGCCAG GOGGCACCaC TOCAAOGGGT GGGAGCCTGG C TTGGGTGT G 1740 

GCTGAQACTC TGQACAGAGG CTGGGGTCCT CCTGGGGGAC AGCACAOTCA GTGGAGAQAO 1800 

CCAGQGGCTG GAGGTGGGGC CCACCCCAGC CTCTGGTCCC AGCTCTOCTG CTTCACTTGCT 1B60 

GTGTGGCCTC AAGCAGGTCA CIGGCCTCTC TGGGCCTCAG TCTCCACATC TGTACAAATG 1920 

GGAACATTAC CCCCTGCOCT GCCTACCTCA CAGGGCTOTT GTQAGGAATT QATGAGATGA 1980 
TGTATGTGAA ACACTTTOTA ACCTGTAAAG CGCTGTGCAC ACGTG 

Seq ID KO: B31 Protein sequence 
Protein Accession tti NP_116197.1 

1 11 21 31 41 51 

I I I I 1 1 

MLHELLRLQB IQLVGGQLAV VEPYAPRGLN YLRVLNVSGN QLTTIiEELVF HSVGNLETLI 60 

ZJ3SNPLACDC RIiLWVFRRRW RLNFNRQQPT CATPEFVQGK BFKDFPOVLL PNYFTCRSAR 120 

IRDRKAQQVP VDE6RTVQFV CSADGDPPPA ZXiHIiSFRKaL VSAKSNGRLT VFPDGTLEVR 180 

YAQVQDllGTy IiCIAANAOI3T OSMPABLRVR SYSPDWPHQP KKrFAFZSMQ PQBGEAMSTR 240 

ATVPFPFDZK TLZIATmGF Z8FLSWLFC LVUiFLWSRG KQETrKHNZBl BWPRKSDAG 300 
ZSSADAPRKP MMXMZ 

Seq ZD NO I B32 DMA sequence 

Nucleic Acid Accession #t CAT cluster 

1 11 21 31 41 51 

t i I t I I 

CAOCACATAC AAGAAACATA CAGTGTAC3CT GAAAAGGGGC CCTTGAAATG TCATCAAAGG 60 
OTAATAACCT AGTGAQTGAG TTGTGATGTC ATCTGQAACA TAGGAAAT60 GGCTCTTAGC 120 
GTATTCX3GTA CGAAGGAAGC CAGGCTGGTC CTGGCAGGAA GTAAATGATA ATCTTTGGGA 180 
AACCAGQACC CTGCCTCCCA GCCCAGAGGT GGAGGAGGGC GQTCAGGGTG GGGTCTACAO 240 
TGGGACAGCA CTGACAAAGO TAGAGGGAAA TGTAATAGCA CATCTAGGCT GCAOTCTGGT 300 
GAAA0TG60C OQQOTGGT C C TTGGAAAACA GTrGQGCTGT TCTTGOCAGG AATTAGTGAC 360 
AGC tTO ' CO a TCACGGGCAG GGAOGGCTTQ ATTTAAAAAA AATAAATAAA TAAAAOGTCT 420 
GGGTATAGAA A 

Seq 10 MOt B33 DNA sequence 

Nucleic Acid Accession ftt NM_006174.1 

Coding sequences 71.. 140 8 

1 11 21 31 41 51 

I I I I I I 

GAAAGGCTAT GGGTAACAAC TGACCTGCCA CAAAOTTAGA AGAAAGGATT GATTCAAGAA 60 

AGACTATAAT ATGGATTTAO AGCTCGAOGA GTATTATAAC AAQACACTTG CCACAGAGAA 120 

TAATACTGCT GCCACTCGGA ATTCTQATTT COCAGTCTGO GATOACTATA AAAGCAGTGT 180 

AGATQACTTA CAQTATTTTC TGATTGGQCT CTATACATTT GTAAGTCTTC TTGGCTTTAT 240 

GGGGAATCTA CTTATTTTAA T6GCTCTCAT 6AAAAAGGGT AATCAGAAGA CTAOOGTAAA 300 

CTTGCTCATA GGCAATCIGG CCTTTTCTGA TATCTTGQTT GXGCTaTTTT GCTCAOCTTT 360 

CACACTGACG TCTOTCTTGC TGGATCAOTG GATGTTTGGC A AAGT OHGT GCCATATTAT 420 

GCCTTTTCTT CAATGTGTGT CAGTTTTGOT TTCAACTTTA ATTTTAATAT CAATTGCCAT 480 

TGTCAGGTAT CATATQATAA AACATCCCAT ATCTAATAAT TTAACAGCAA ACCAT GGCTA 540 

CTTTCTGATA GCTACTGTCT GGACACTAGG TTTTGCCATC TOTTCTCCCC TTCCAGTGTT 600 

TCACAQTCTT GTGGAACTTC AAQAAAGATT T06TTCA6CA TTGCTGAGCA OCAOGTATTT 660 

ATGTGTTGAG TCATGGCCAT CTGATTCATA CAGAATTGCC TTTACTATCT CTTTATTGCT 720 

AGTTCAGTAT ATTCTGCCCT TACTTTGTCT TACTOTAAGT CATACAAGTG TCTGCAGAAG 780 

TATAAGCTGT OGATTGTCCA ACAAAOAAAA CAGACTTGAA QAAAATOAGA TGATCAACTT 840 

AACTCTTCAT CCATCCAAAA AOAGTGGGCC TCAOGTGAAA CTCTCTGGCA GCCATAAATG 900 

GAOTT A TTC A TTCATCAAAA AACACAGAAO AAGATATA6C AA6AA0ACA0 CATOTOTOTT 960 

ACCTGCrCCA GAAAGAOCrr CTCAAGAGAA CCACTCCAOA ATACTTCCAO AAA ACTTT OG 1020 

CTCIGTAAGA AOTCAOCTCT CTTCATOCAO TAAGTTCATA CCAGGGGTCC OCACl"iGU' l'T 1080 

TGAGATAAAA OCTGAAGAAA ATTCAGATGT TCATGAATTG AGAGTAAAAC GTTCrGTTAC 1140 
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AA6AATAAAA AAGASATCTC QAAGTGTTTT CTACAGACTQ ACCATACTGA TATTAGTATT 
T6CIGTTAGT TGQATGCCAC TACACCTTTT CCATGTGGTA ACTGATTTTA ATGACAATCT 
TATTTCAAAT ACSGCATTTCA AGTTGGTGTA TTQCATTTOT CATTTGTTGO GCATGATQTC 
C l VntiTC T r AATCCAATTC TATATGGGTT TCTTAATAAT GGGATTAAAO CTGAITTAQT 
QTCOCTTATA CACTGTCTTC ATATGTAATA ATTCTCACTO TTT 

Seq ID NO: B34 Pxoteln seqtuence 
Protein Accession fts NP_00616S.l 

1 11 21 31 41 51 

I I I I I 1 

MDLBIDBYYN KTLATEMNTA ATRNSDFFVN XJDYKSSVDDL OYFLIGLYTF VSLLGFMGKL 60 

LIU-IALMKKR NQKTTVMFLI GNLAFSDILV VLPCSPPTLT SVLLDQWMFG KVMCHIMPFL 120 

QCVSVLVSTL ILI81AIVRY HMIKHPISNN LTANHQYPLI ATVWTIiGFAI CSPLPVFHSL 180 

VBLQBTFGSA LLSSRYLCVB SWPSDSYRIA FTISIiLLVQY ILPLVCLTVS HTSVCR8ISC 240 

aX«SNXBNRXiB WEKIHLTVR P8KKS6PQVK LSG8HKHSYS PIKXHRRRYS KKTACVLPAP 300 

BRPSQBtlHSR ILPENFGSVR SQLSSSSKPX PGVFTCPEXK PBBtlSDVHEIi RVKRSVTRIK 360 

KRSRSVPYRL TILXliVFAVS WMPLHIiPBW TDFMSKLISN RHPKLVYCIC KLLGMMSCCL 420 
NPZLYOFUm 6IKADLVSI.I HCLRM 

Seq ID NOs B3S DMA sequence 

Nucleic Acid Accession fit NM_014279.1 

Coding sequence t 286.. 1689 

1 11 21 31 41 51 

I ) I I I i 

G0GC6GGGGA GCCATTAGGA CX50QA0GAQA OAOGAGOOCG CAGCTCCCGC CCAGCCCAGC 60 

CXrroCCCAGC CCTGCCCGGA GQCAGACXSCXS COGGAACOGG GACX3CDQATAA ATATGCAGAG 120 

CG6AGGCTTC GCX3CAGCAGA GCCCGCGCOC CGCCCGCTCC QGGTGCTGAA TCCAGGCGTQ 180 

GGGACACGAG CCAGOOGCOO COGCOGGAGC CAGOGQAGCC GGGGCCAGAG CCGGAGC300Q 240 

TCGGOQTCCA CGCAGC06CC GGCOGGCCAO CACCCAGG6C CCTOCATGCC AGGTCGTTGQ 300 

AGGTOOCAQC GAOACATGCA CCOGGCCOGG AAOCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AOTGCTGOCC ACCAACCCTG AQQAQA6CTQ GCAGGTGTAC 420 

AGCTCTQCCC AGGACAGCGA GQGCAOQTGT ATCTGCACAG TQGTCGCCCC ACAGCAGACX: 480 

ATGT6TTCAC GGGATGCCCG CACAAAACA6 CTGAGGCAGC TACTGGAQAA OGTGCAGAAC 540 

ATGTCTCAAT OCATAGAGGT CTTGGACAOG CX3GACCCA6A GAGACTTGCA GTACGT06AG 600 

AAOATGOAGA ACCAAAT6AA AGGACTGGA6 TCCAA6TTCA AACAGGTGGA GGAGAGTCAT 660 

AAGCAACACC TGGCCAGGCA 6TTTAAGGGG ATAAAAQ06A AAATGGATQA ACTTAGGCCT 720 

TTGATACCTG TaTTGGAAOA GTACAAGGCC GATGCCAAAT TGGTATTGCA GTTTAAAQAQ 780 

QAGOTCCAQA ATCTGACGTC AGTGCTTAAC GA0CTGCAA6 AQGAAATTGG CXSCCTATGAC 840 

TACX3ATGAAC TTGAOAGCAO AGTGTCCKAT CTTGAAOAAA GGCTC06T0C ATGCAT6CAA 900 

AAACTA8CTT GOGOGAAGTT GAOGGGCATC AGTQACCCOQ T8AC7GTGAA OACCTCOGGC 960 

TOGAG0TTCX3 QATCCTGGAT GACAGACCCT CTCXSCCCCTO AAGGCGATAA C CGGG TGTGG 1020 

TACATGGACG GCTATCACAA CAACOGCTTC GTACGTGAGT ACAAGTCCAT GGTTGACTTC 1080 

ATGAACACGG ACAATTTCAC CTCCCACOGT CTCCCCCACC CCTOaTOGGG CAQGGG GCAG 1140 

GTGGTCTACA AO GOT TCTAT CTACTTCftAC AAGTTOCAGA 6CGACATCAT CATCAGGTTT 1200 

GACCTQAAGA CAOAGACCAT CCTCAAGAGC C6GAGCCTGG ACTATGCOGG TTACAACAAC 1260 

ATGTACCACT ACGCCT6GGG TOOCCACTOO GACATCQACC TCATGGTGGA CGAGAGOSGG 1320 

CTOTQGOCOG TOTACX3CC3VC CAACCAGAAC GCTGGCAACA TOOTGGTCAG TAGGCTGGAC 1380 

CGOGTGTCCC TGCAGACCCT GCAGACCTGG AACACOAQCT ACCCCAAGCG CAGCX3CCGGG 1440 

GAGGOCnCA TGATCXGCQG GAOGCTGTAC GTCAGCAAOQ OCTACTGAOQ GG6TACCAA0 1500 

GTCCACTATG CATACCAGAC CRATGCCTCC ACCTATGAAT ACATC3QACAT COCATTCCAQ 1560 

AAO^TACT CCCACATCTC CATGCTGGAC TACAACCCX3V AGGACCX3GGC CCTGTATQCX: 1620 

TGGAACAACG GCCACCAGAT CCTCTACAAC GTGACCCTCT TCCAOGTCAT CCX3CTCCGAC 16B0 

GAGTTGTAGC TCCCTCCTCC TGOAAGCOVA GGGCCCAC3GT CCTCACCACA AAGGQACTCC 1740 

TOTQAAACTQ CT6CCAAAAA GATACCAATA ACACTAACAA TAOOaAT CTT QAAAAATC KT 1800 

CAGCAGTGCSa GATTCTGAC3V TCGAGGGATO GCATTACCTC C6TGTTTCTC CCTTTOGAGC 1860 

OGGCQGOCCA CAGACGTCGQ AAQAAACTCC CGTATTTGCA GCTQGAACTO CAGCCCAOGG 1920 

OGOCXrGGTT TTCCTCCCCG CCCTGTCOCT CTCTGGTCAA ACAACATACT AAAGAGGOGA 1980 

GGGAATGACT GTTGGCCAGT TCTCACOGGG GAAAAACCCA CTGTTA66AT GGCATQAACA 2040 

TTTOCTTAGA TCX^TGGTCAO CTCOGAGQAA TGTQOC36TCC AOGCTCTTTO AG AGCC ATGG 2100 

GCXOCACCOG GCOGTAOGCT AGTGTAACTC GCATCCCATT GCAGTGOGOT TTCTTOACTG 2160 

TGTTGCTCTC TCTTAQATTA ACCGTGCTGA GGCTCCACAT AGCTCCTGGA CCTGTGTCTA 2220 

GTACATACTG AAGCOATGOT CAGAGTGTGT AGAGTGAAGT TGCTGTGCCC ACATTGTrTG 2280 

AACTOGCGTA CCCOGTAGAT ACATTGTGCA ACOTTCTTCT GTTATTCCCT TGAGQ l^YA 2340 

ACTTOOTATO TTCAGTTTAT GGQAXOATTa TTGTAAATGC A ATQOO GTAG TTTGGATTAA 2400 

TAAGTQQATQ O ' lTrrX ' GTT T CTAAAAAGAA AAAAAAAATC AOTOTTCACC CTTATAOAGA 2460 

CATAGTCAAG TTCATGTTGA TAATAATCAA AGGAATTACT CTCTTCTTGT TAAATTAGCT 2520 

AAATCATGTA ACOQCAGATA OQAAGGGCTC ACCTGGGGAA ACTC TGGTTT CCGATGGGAC 2580 

A6GAAAGTCA TACGGGCAAC A6TATG0GGA AAOTAOGTrP TTTTAAOTAA AAAACAAAGG 2640 

CAAACTTTOT ACTATCCAGT TATCTAAOQA ACAATAAAAA CATTAOQAOA AAAAAAAAAA 2700 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: B36 Protein sequence 
Protein Accession #i NP_OS5094.1 

1 11 21 31 41 51 

I I I I I I 

MPORWRWQRD MHPARKLLSL LPLIU^GTBL TQVLPTNPBB SWQVYSSAQD SBGRCICTW 60 

APQQTMCSRD ARTKQLRQbL BKVQNMSQSI EVUJRRTQRD LQYVEKMENQ MKGLESXPKQ 120 

VBBSHRQHXA RQPKAIKAXM DEliRPLIPVL BBYKADAKLV LQFKEEVQNXi TSVLHBLQBB 180 

XGAYDYDELQ 8RV8NLBBRL RACKQKLAOG KLTGXSOPVT VXTS0SRPQ8 WWD PLAP EQ 240 

DNRVWYMDOY HNNRFVRBYK SMVDPMNTDH PT8HRLPRPW SGIGQWYNO SXYPNKPQSH 300 

XZXRFDUCTB TILKTRSUDY AOYNNMYHYA IftlQHSDIDUl VDESGIMAVY ATNQNAGNIV 360 
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VSRU)PVSLQ TLQIWITSyP KRSASEAPII COTLYVTMOY SOOTKWHYAY QTOASTYKfl 420 
DIPFQtncySH ISMLDYNPKD RALYAHNNGH QILYNVTLPH VZRSDBXi 

Seq ID HOt B37 DNA sequence 

Nucleic Acid Accession fit NM_006334.l 

Coding sequencei 286.. 693 

1 11 21 31 41 51 

) I I I I i 

G0Q0GC3GOQA 6CCATTAG6A GGOGAGGAGA GAGGAGGG08 CA0CTCCCX3C CCAGCCCAGC 60 

CCT6CCCA6C CCTGOCCO Q A GGCAGACGOQ COGGAAOCQG 6AC60GATAA ATATGCAGAG 120 

OQOAGGCTTC OOGCAOCAOA OCCOSOGOGC COCCGGCTOC GGSTGCTCAA TCCAGG06T0 180 

GGGACACGAG CCAGGCGCGG C0GCO6GAGC CAGCGGAGCC GGGGCCAGAG CCGC3AGCG0G 240 

TCCGCGTCCA CGCAGCCGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGGTOGTTGG 300 

A6QTGGCAGC GAGACATGCA CCOGGCCCGQ AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGG6CACTG AACTCACTCA ASTGCTGCCC AOCAAOCCTQ AOQAGAGCTG GCAGGTGTAC 420 

AGCTCTGCCC AGGACAGGGA GG6CAGGTGT ATCTGCACAQ TGOTOGOCCC ACAQCAGACC 480 

ATOTGTTCAC GGGATGCCCX5 CACAAAACAO CTGAGQCAGC TACTGGAGAA GGTGCAGAAC 540 

ATCrrCTCAAT CCATAGAGGT CTTGGACAGG CGGACCCAGA GAGACTTGCA GTAOGTGGAG 600 

AAGATGQAGA ACCAAATGAA AGQACTGOAO TCCAAGTTCA AACAGGT6GA GGAGAGTCAT 660 

AAGCAACAOC TGGCCAOOCA OTTTAAOGGC TAACTTAAAA GAGTTTTTTC A ATGCTOC RO 720 

TGACTGAAOA AGCA8TCCAC TCCCATOTAA CCATGAAAGA GAGCCAOAOA QCTTTTTGCA 780 

CCATGCATTT TTACTATTAT TTTCCAATAC TTAGCACCAT TTCACTAAGG AACCTTGAAT 840 

ACAACCAGGA TCCTCCTTTG CATGC3GACTG TAQCTGCATT TCATGAATAQ TTTGAACCCT 900 

T6TCAATGCA TTTTTTGAAA AA6AAAQAAA AAAAAAACTT CX3TGTATGTG ACTCAAAGCA 960 
TOTAACCTTA AC5ATGTTGCA TTCTAAACTO ACAATAAAGA CCTTTCCCC 

Seq ID NO: B3B Protein sequence 
Protein Accession Ss NP_006325.l 

1 11 21 31 41 51 

I I I I t t 

MPGRWRWQRD MHPARKLL8L LFLIIMOTEL TQVLBmVEB SWQVYSSAQD 8EGRCICTW 60 
APQQTMCSHO ARTKQLRQIiL EKVQNMSQSI EVU)RRTQRD LQYVBKMQIQ MXGLESKFXQ 120 
VEBSHKQHLA RQFXO 

Seq ZD NO: B39 DHA sequence 

Nucleic Add Accession «: NM_05ai99.1 

Coding sequence > 286 . . 795 

1 11 21 31 41 SI 

I 1 I I I 1 

GCGCGGGGGA GCCATTAGGA GGCGAGGAGA GAOaAGGGCG CAGCTCCOGC CCAGCCCAGC 60 

CCTGCCCAGC CCTGCCCGGA GGCAGAOGCG COGGAACCGG GACGCGATAA ATATGCAGAG 120 

OGGAGQCTTC G06CAGCAGA GCCCGCGOGC GGCCCGCTCC GGGTGCTGAA TCCAGGCGTG 180 

GG6ACA0QAG CCAGOCGCCG CCOCCGGAOC CAGOQGAGCC GGGGCCAGAG COQGAGCGCQ 240 

TGOGOQTCCA CGCAOCOGCC GGCCGGCCAG CACCCAGGGC CCTGCATGCC AGQTCGTTGQ 300 

AGGTGGCAGC GAGACATGCA CCCGGCCOGG AAGCTCCTCA GCCTCCTCTT CCTCATCCTG 360 

ATGGGCACTG AACTCACTCA AAATAAAAGA GAAAACAAAG CAOAGAAGAT GGGAGGGCCA 420 

GAGAG06AGA GGAAGACCAC AGGAGAGAAG ACACTGAAOO AGCTTCCCTT GTTTTGCCTG 480 

GAAGOOCAGO CTGGCTCCCT GGCTCTOCCC AGGAT6T6CA GTOCAAATOC CAATCCAOCA 540 

QTGGGGTTAT GTCSTCCOGC TTACCCTCAO AOCCCTTCTC CTQGTG CTGC CCAGAOGATC 600 

AGCCAGTOOC TCCTQGASAG GTTCTOCATQ GCCTCTAGGA GAGAAGTTTT CTTGGCCCCA 660 

GGAAGQCCTG GTGGAGGGTG GTGGTTGTGC ACTGTTGCTG GACAGATGCA TTCATTCATG 720 

TGCACACACA CACACACACA TGCACACACA GGGGAGCAGA TAC CTOCAGA GAAGAGCCAA 780 

CGAaGTOCro ATTAGTGGCA AGCT6CCGCA CAAAGGGCTA T6CCT0IQTC TrATTQAOAC 840 

AOCTTGGCAA AQAOATGCCT GATTCrGGOT GGTCCTOQAC ATGGC08CAC CCAAGGGCCC 900 

TOCAA6CCTT AATOGCACCC TQAAGCCTCC ATGCCCAGGC CAAAAOAIGC TTTTCCTCOC 960 
TAAAAAAAAA AAAAAAAAAA 

Seq ID NOt B40 Protein sequence 
Protein Accession «i NP_47ei06.1 

1 II 21 31 41 51 

11)11! 

MPGRHRHQRD MKPARKLLSL LFLXXMQTEL TQHKRBNKAB KMGGPESERK TTGBRTUIBb 60 
PLFCLEAHAO SLALPRHCSP NPNPAVGLCR PAYPQSPSPG AAQTI SQSL L ERFCMA8RRB 120 
VFLAPGRPGG GWHZ«CTVAGQ MHSFMCTBTH THAHTGBQIP AEKSQPGPD 

Seq ID NOt B41 DMA sequence 

Nucleic Acid Accession ftt AY038071.1 

Coding sequence: 1..1686 

1 11 21 31 41 51 

I I I I 1 I 

AT6AGCAATC AGTACCAGQA GGAGGOCTGC TCOGAOAGGC CC6AGTGCAA AAGTAAATCT 60 

CCAACTTTOC TCTCCTOCTA CTGCATOGAC AGCATCCTGG GCOGGAGGAG CCOQTOCAAA 120 

ATGOGGTTGC TGGQAGCOGC GCAGAGCTTG CCTGCTCOGC TGACCAGCOG OGCCGACCOQ 180 

GAAAAGOCCG TGCAAGGCTC CCCTAAGAGC AGCAGOGCCC CGTTOQAGGC OGAGCTGCAC 240 

C roC OGCCCA AOCTGGOGOO CCTGTACGGC COGGOCGGGG GCCGCCTCCT TCAGGGTGCG 300 

QCAGOGQOQO GOQCGOOGGC GGOG G C6 Q0Q GGGGCAQCGG C060CA00GC CAOGQCSOaGT 360 

CCACQCGGGG AOQCCCCTCC OCCGOCACCO CCAACOQOGC GOCOOGGGQA ACGGCCGQAC 420 

GGCQGAGGGG COOCCGOQOC AOCCGCGGCC GOGGCOGOCG CGQCCTOGGA CACGCTCAAG 480 

ATCAGOCAGO C600SCA0GT GA6CATCA0C CGCAOCAAOT OGTACCOGGA GAAOQGGGCG 540 
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PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGCTTGGTGC 
COGQAGGAGC 
ACGGGCACOG 
GAGGAACTGC 
AAGQAOCCCC 
GCAGTGGGOV 
GACGCTQAGa 
6AGGGGCTGC 
GAGGAACTGG 
CTGGCCATGA 
GCCAA0TG6C 
C06GGGC0GC 
C06CACCACC 
TTOCOGAGCC 
GGGCTGA6CA 
TT006CAGGC 
CTQAGACAGC 
GCCAOGGOGG 
QAGCACXXGO 
GTGTGC 



GG006CC6GC 
GCCTOGGCGT 
AGGA06ACGA 
TGGAGGACGA 
GGGGCT6TCC 
CAGAGGG0G6 
GCAAGGACX3G 
TGAAAGGCAA 
AGOGGQCCTT 
GGCTGGACTT 
GCAAGOGGGA 
TCTCOGCCAC 
CGGC6CT0GA 
TACCTCCGCC 
CTTTCCTCGG 
TCTTTTCCAC 
0CACACC06C 
GOQCAGACAG 
OQCAOCTCAC 



GQCOQ G CGC C 
GGAGQAGCTO 
CQAGQAGGAa 
TGTGGCCGCC 
GGAGCTGTCA 
OQA6GACAGC 
ACAGAGGOGC 
CCAGAAQACX3 
GACC0AC3GCC 
QAAGGCAOOC 
CCACCOGCTC 
CTCCGCTTGG 
TCOGGGCTOQ 
AGCX36CAGTG 
AATQQCCOCC 
OGTGGAGGGC 
AGGOGCCTCT 
GCAGCTCAAC 



QAGCTGQOOQ 
OC900GCA600 
CTOGAGGACG 
CTGCTGGAGG 
ACTGGG6CC0 
COCAAQQAGG 
GTQT QC CTCT 
TACOGCACCA 
CACTACCCGG 
06AGTCCAGG 
GOOCAGACOC 
AGCCCCTACC 
ACT6C0GCT0 
GCCA6CCTGC 
TTCOQACACC 
CTGACCAGOO 
GGAGTGGCAT 
AGCATAGCCG 
ATCCT6C0GQ 



G OCOBG GGGG 
GCCOGGCTGC 
AAGAAGAT6A 
AOQAOQCCOG 
TG6CCX3CAGC 
AOC TGCT GCT 
CTGOGOGCAG 
OGTTCACCAO 
ACGTCTTCAC 
TCTGGTTCCA 
ACCOCCCTGG 
TGGACXSCC3Va 
CCGC06C0GC 
OGCCCAGCGG 
CAGCTTTCAT 
COTCGACCGC 
GQGGC6C0CT 
GQCT6AGGCT 
GCftCCAGCM; 



OGTCAGCXAC 
GGQTGOTGGC 
GGACGAGQAA 
CGCGCTGCTC 
AGCTGCOOCT 
GCACOOGGAA 
06ACTGG6AG 
CTACCAGCTG 
CAGGGAGGAA 
6AACG0T06G 
GCTGOCCTTC 
CCCCTTCOCT 
OGCOGOOGCC 
GGC6C06CT0 
CAGCCOQGCA 
G6CCGCGCTC 
OGCCGAOCCG 
CAAGGCaUU3 
OGQCAAGGAO 



600 
660 
720 
780 
B«0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: B42 Protein sequence 
Protein Accession tt> AAK93901.1 



MSKQYQBBGC 
EKAVQGSPKS 
PRGEAPPPPP 
PFVPPPPALD 



LAMRLDLTEA 
PHHPALDSAW 
PGRUFSTMAP 
BKAAQltTQLN 



11 
I 



SSAPPBAEUI 
PTARPGERPD 
BLQ0PG6VTH 
LLBDDAItALL 
VdiSAGSDSB 
RVQVWFQNRR 
TAAAAAAAAA 
LTSASTAAAL 
ILPGTSTGKE 



21 
I 

PTLLSSYCID 
LPPKLRRLYG 
GAGAAAAAAA 
PEERLGVAGG 
KBPRRCPVAA 
EGIiLKRKQRR 
AKWRKREKAG 
PPSLPPPPGS 
LRQPTPAVEG 
VC 



31 
I 

SILGRRSPCK 
PGQGRXiLQGA 
AAAAAHDTLK 
PGSAPAAGGG 
TQAVAAAAAA 
YRTTPTSYQL 
AQTHPPQLPF 
ASLPPSGAPL 
AVASGALADP 



41 
I 

MRliMAAQSL 
AAAAAAAAAA 
ISQAPQVSIS 
TGTBODBBBL 
AVATEGGBLS 
EELERAPQKT 
PGPLSATHPL 
GLSTFLGAAV 
ATAAADRRAS 



51 
I 

PAPLTSRADP 
AAAAATATAG 
RSKSYRENGA 



Seq ID MO I B43 DNA sequence 

Nucleic Acid Accession «i CAT cluster 



1 

I 

cGCxrrGCCCC 

TTCCGCOGOT 
CTQCTCTCCT 
GCTOGTTTCT 
GTCTGTGTTT 
CCTGTTTTGQ 
TTGGTGCTTT 
ATCTTCTGTT 
OCTGAGOTTG 
CAGTCTCCAT 
AATGTGGAGA 
TQAGCATTGA 
CTTTCTTGGA 
TCTGGACATG 
TGQATTTTAA 
AGCATAGCCC 



11 
I 

CCCGTTGCTT 
TCCGTGCGCC 
CCX3CTCTCAT 
CTTQTTCCTC 
GTQGGTOSTC 
TCCCCCTGGT 
OCTCCTGCTC 
TTCTCTTTTT 
GGGAGAOATA 
GCCTGCTCAO 
CCTOTQAGCC 
AQAGAATAT6 
TQTTATTCCA 
GAGGCTGGAA 
GGCTGGAAAT 
TTOCTGATGQ 



21 

I 

CTCTCTCTTC 
CTCTCTGGGC 
GCCCCTTTCA 
TCCTTCCCQG 
CTCTGCGTGC 
TCTCGCGCGG 
TCTGTCTGGT 
Cl ' mTITLTi ' 
A08CTGTAAA 
CCAAGAAGQA 
TUTUTCOQGC 
TGGGAGAACA 
TGATAGATAG 
GAGTCAGGOT 
CTGAGGGTCA 
CAA 



31 
I 

CCTTCTGCCT 
TTOGGCTTCA 
GCTCCTACrC 
TTOTTOrrCT 
TGCCTCTTTC 
TTCTOTGTGC 
GTTCTCCGCC 
CTTOTTTTTT 
CTTTTATTTT 
GCTCACTGTO 
CCTGAACTCT 
AAACA6AAAC 
TAGGGGCAGG 
GTGATTCGGA 
QTQGTCCAAG 



TGGTGTOGTC 
GCTTCTTOGT 
TCTTGOGTTT 
GCGGGCCGCC 
GTCTGTGTCT 
TTTGTCTTTQ 
TCA66AAATC 
GGCACCAQAG 
CAAGCACAGG 
TGAAAGAATA 
AGTGAGAGAG 
GAOGGGCATO 
TCACTCAGAG 



PKRRIiTtTiHPB 
HYPDVPTREB 
SPYLDASPPP 
FRHPAPISPA 
SIAALRLKAK 



51 
I 

CCGTCTTTGG 
GGCTGTOGCT 
CCCCTOCCTC 
GTTCCCTCTC 
C0GGG06CTT 
CCOTTCTTTC 
TTCTCCTTTT 
TTTTCTCTTA 
TGGAAACCTA 
ACAGGGACCC 
GCAGGCTTCC 
TGCAAGGTGT 
GCTGACTAGG 
AOAAGQAAGQ 
ACAGAATCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NOt B44 DNA sequence 

Nucleic Acid Accession i^t NM_014421.1 

Coding sequences 716.. 1497 



GCA060GTCC 
GCGCTGOCAG 
AGTGOGOGTG 
TCCTGATCCC 
TTGACACATA 
GCTGATGCGG 
CCAAGGAGGT 
TGCACCTTGG 
TTGCAQAACT 
CCTGAGGCGT 
AACACAATCT 
CTTTGCATTC 
GCOGGGTTGA 
ATGGTGGAGA 
CTGGGCX3GGG 
GCATTGGGOQ 
AAOOAGTGTG 
GTGTQTGGGA 
TGCAATAATG 



11 
I 

GGOGGGAGCC 
0QATT06CTG 
TGGGGGACCT 
CTACTAAAGC 
AA6GATAGGA 
GCCTCCTQAT 
GCGGGG CAAG 
GTCAOGCCTC 
TCCTGTCCTG 
GAGACTGAGT 
GCAAGCCCCC 
CXACCCCTCC 
TGOGGAGCAA 
GCTCACAGAT 
AGACGCCTGG 
GCAGTAAGAA 
AAGTTGGGAG 
OAAAAAAOAA 
QCATCTGTAT 



21 
I 

O60GG0GA6C 
TCTCTTGT6A 
CXSATTCTCTT 
GCCTGC7AAC 
AAAQAQAGGA 
CAATTAAQAO 
AAGGAAOGGA 
CTTGGGGAGA 
GTGGAOAAGC 
TCATAGGGTC 
GOQACOCAAO 
GGGCTTTGOG 
GGATTOGTCC 
0GGCAGTT06 
TCAGGCCGCX: 
GGGCAAAAAC 
GTATTGOCAC 
GOOCTGCCAC 
CCCAGTTACT 



31 

I 

GTAGOGCAAO 
GTCA6GG6AC 
CAGATCTOQA 
TTTQAAAAGO 
GA6AAAAGCA 
GAGAOTTAAA 
AGOGGTGCGA 
AAGOQCCrOG 
GGGTCTGGCT 
CTGOGTCGCC 
TGAOOGGCOC 
TCTTCCTGGG 
TGCTGCCTOC 
CGGGCCAAAC 
AAT06ATCTG 
CTGGGQCAGG 
AGTCCCCACC 
GOAGATGGCA 
OAAA8CATCT 



41 

I 

TOOGCTCCCT 
AA06CTT0GG 
GGATTOGGTC 
AOCACTGTGT 
ACTQASTTGA 
COOOOQAOAT 
TCCACAGG6C 
CATTTQATTG 
TGGGTTCCGC 
OAACCAGGAA 
COTGTTGGGG 
GACCCCCTGG 
TCCTACTGGC 
TCAACTCCAT 
CX3GGCATGTA 
CCTACGCTTO 
AAOQATCATC 




TAAOCOCTCA 



CTTCCAGTTA 
TAATTTCTGT 
GGOTTQAGQa 
TOCTCCCTCC 
COOGGAGATQ 
OGOGGTGCTG 
CAAGTCCTCT 
CCAAGGACTG 
TAfiCAGlGAT 
GGCCTOCATG 
CAGTAGCGGC 
CATC0036CT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



813 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CTQGATGGTA CTCJGCCACAO AGATCGAAAC CAOSGTCATT ACTCAAACCA TGACTTGGGA 1200 
TOGCAGAATC TAGGAAOACC ACACACTAAG ATGTCACATA TAAAAGOGCA TGAAGGAGAC 1360 
CCCTGCCTAC GATCATCAQA CTOCATTQAA GGGTTTTGCT GTGCTCGTCA TTTCTCC3ACC 1320 
AAAATCTGCA AACCAGTGCT OCATCAGGGG QAAGTCTOTA CCAAACAAOG CAAGAAGGGT 1380 
TCTCATGGGC TGOAAATTTT CCAOOOTTQC GACTGT6CSA A£5G6CCTGTC T TGCAA AGTA 1440 
TGOAAAOATO CCACCTACTC CTCCAAAGCC AGACTCCATO TGTGTCAQAA AATTTGATCA ISOO 
CCATT6AGGA ACATCATGAA TTGCAGACTG TGAAOirGTO TATTTAATGC ATTATAGCAT 1560 
GGTGGAAAAT AAGQTTCAGA T6CAGAAGAA TGGCTAAAAT AAGAAAOGTO ATAAGAATAT 1620 
AGATGATCAC AAAAAGGQAG AAAGAAAACA TGAACIOAAT AOATTAGAAT GGGTGACAAA 1680 
TGCAGTGCAG CCAGTGTTTC CATTATGOKA CTTGTCTATG TAAATAATGT ACACATTTGT 1740 
GOAAAATGCT ATTATTAAGA GAACAA6CAC ACAGTGGAAA TTACTQATGA 0TAGCAT6TG 1800 
ACTTTCCAAG AGTTTAGGTT GT6CTGGA0G AGAGGTTTCC TTCAGATTGC TGATTGCTTA 1860 
TACAAATAAC CTACATGCXSV GATTTCTATT CAAOGTTAGA GTTTAACAAA ATACTCCTAG 1920 
AATAACTTGT TATACAATAG GTTCTAAAAA TAAAATTGCT AAACAAQAAA TGAAA ACATG 1980 
GAQCATTGTT AATTTACAAC AGAAAATTAC CTTTTGATTT GTAACACTAC TTCTGCTGTT 2040 
CAATCAAGA6 TCTTGOTAGA TAAGAAAAAA ATCAOTCAAT ATTTCCAAAT AATTGCAAAA 2100 
TAATGGCCAG TTQTTTAGGA AGGCCTTTAG GAAGACAAAT AAATAACAAA CAAACAGCCA 2160 
CAAATACTTT TTTTTCAAAA TTTTAGTTTT ACCTGTAATT AATAAOAACT GATACAAGAC 2220 
AAAAACAGTT CCTTCAGATT CTAOGQAATQ ACAGTATATC TCTCTTTATC CTATGTGATT 2280 
CCTGCTCTOA ATGCATTATA TTTTCX3UIAG TATACCCATA AATTGTGACT AGTAAAATAC 2340 
TTACACAOAO CA6AATTTTC ACAGATGGCA AAAAAATTTA AAQAT6TCCA ATA TATGTG G 2400 
GAAAAQAOCT AACAQAOAGA TCATTATTTC TTAAAGATTO GCCATAACCT GTATTTTGAT 2460 
AGAATTAGAT TGGTAAATAC ATGTATTCAT ACATACTCT6 TOOTAATAGA GACTTGAGCT 2520 
GGATCTGTAC TGCACTGOAG TAAGCAAGAA AATTGGGAAA ACTTTTTCGT TTGTTCAGGT 2580 
TTTGGCAACA CATAGATCAT ATGTCTGAGO CACAAGTTGG CTGTTCATCT TTGAAACCAG 2640 
GGQATGCACA GTCTAAATGA ATATCTGCAT GGGATTTGCT ATCATAATOT TTCCTATOCM 2700 
GNTGAATTCN GTGTGAGGTC CTGTGTCCGT CCTATCCTCa AATTATTTAT TTTATAGTX^C 2760 
TCAGATCCTC AAATAATCTC AATTTCXSGAG GTTTCACAAA ATGGACTCCT GAAGTAGACA 2820 
GAGTAQTOAQ QTTTCATTGC CCTCTATAAG CTTCTGACTA GCCAATGGCA TCATCCAATT 2880 
TTCTTCCCAA ACCTCTGCAG CATCTQCTTT ATTGCCAAAO GGCTAGTTTC GGTTTTCTGC 2940 
CAGCCATT6C GGTTAAAAAA TATAAGTAGG ATAACTTGTA AAACCTGCA,T ATT6CTAATC 3000 
TATAGACACC ACAGTTTCTA AATTCTTTGA AACCACTTTA CTACTTTTTT TAAACTTAAC 3060 
TCAGTTCTAA ATACTTTGTC T6GAGCACAA AACAATAAAA GGTTATCTTA TAGTTGTGAC 3120 
TTTAAACTTT TGTAGACCAC AATTCACTTT TTAGTTTTCT TTTACTTAAA TCCCATCTGC 3180 
AGTCTCAAAT TTAAGTTCTC CX:AaTAGAGA TTGAGTTTGA GCCTGTATAT CTATTAAAAA 3240 
TTTCAACTTC CCACATATAT TTACTAAGAT GATTAAGACT TACATTTTCT GCACAGQTCT 3300 
GCAAAAACAA AAATTATAAA CTAOTCCATC CAAGAACCAA AOTTTOTATA AACAGGTTGC 3360 
TATAAGCTTQ GTGAAATGAA AATGGAACAT TTCAATCAAA CATTTCCTAT ATAACAATTA 3420 
TTATATTTAC AATTTGGTTT CTGCAATATT TTTCTTATQT CCACCCTTTT AAAAATTATT 3480 
ATTTGAAGTA ATTTATTTAC AGGAAATGTT AATGAGATGT ATTTTCTTAT AGAGATATTT 3540 
CTTACAGAAA GCTTTGTAGC AGAATATATT TOCAGCTATT OACTTTGTAA TTTAGGAAAA 3600 
ATGTATAATA AGATAAAATC TATTAAATTT TTCTCCTCTA AAAACXGAAA AAAAAAAAAA 3660 
AAAAAAAAAA A 

Seq ID NOt B45 Protein sequence 
Protein Accession It KP_0S5236.1 

1 11 21 31 41 SI 

I I I 1 1 1 

MAAUIRSiCDS SCCUjLLAAV LKVESSQIGS SRAKLNSIKS SLGGBTPGQA AMRSAGtlYOG 
LAFGGSKK6K NLGQAYPCSS DKECEVQRYC HSPBQGSSAC MVCRKKKXRC HSDGMCCPST 
RCMNGICIPV TESILTPHIP ALDGTRBRDR NHGHY8NHDL GHQNLGRPHT KMSHIKGHBG 
DPCLR8SDCI B6FCCARBFH TKICKPVLHQ QBVCTKQRKK GSBGLEIFQR CDCAKGL8CK 
VWKDATYSSK ARLHVOQRI 

Seq ID NO: 46 DNA sequence 
Nucleic Acid Accessloo #> NM_019885.1 
Coding sequence t 1..1S39 



60 
120 
180 
240 



1 
I 

ATGCTCTTTG 
TCCGTGAOOC 
OGOGACAAGA 
GAGACCGGCC 
GGCAACGTGT 
AACGTGOGCA 
ACC06CATGT 
AAGCGCAAGQ 
ATCX!AGCTGG 
GTGTACCAGG 
AGCATOCCTG 
GTCITCTCCC 
CAGATCCTGC 
AAGGACTACT 
ATQACCATGC 
A066CCAG06 
AAGCTGOGGG 
GGCACACT6C 
GTCAT60QCC 
CTTQATOGTT 
GACACAGOGC 
GGGAGGGAGG 
TGCCTGGQCA 



11 
I 

AGGGCTTGGA 
TGCTGCTGGC 
GCTGCAAGCT 
ACTGGCTGCT 
TCAAGACGCA 
AGATCCTCAT 
TGCTGGGCCC 
TCTTCTCCAA 
TGATCCAGGA 
AGGOQCAGAA 
AGGAGGACCT 
TGOCTGTOGA 
AGAAGGGGCT 
TGGACGCXXT 
AGGAGCTGAA 
CCAGCACCTC 
ATGAGCTGCO 
GCCTGGACAC 
TGTTCACGCC 
TCCAQATCCC 
CCGTGTTCAA 
ACAAGGATGO 
AGCACCTG6C 



21 

i 

TCTGOrGTOQ 
.CGTGTCGCAG 
GCCCATCCCC 
GCAGGGTTCT 
TTTGTTGGGG 
GGGCX3AGCAC 
CAACA0GGT6 
GATCTTCAGC 
CACACTGCGC 
GCTGACCTTC 
TGGGCACCTC 
CCTGCXOTTC 
GGAGAAGGCC 
GQACCTCCTC 
GGACGGGACC 
ACTCATCATG 
GGCrCATGGC 
GCTCAGTGGQ 
CATTTCCGGC 
CAAAGGCTGG 
AGACGTGAAC 
COOCTTOCAT 
CAAGCTGTTC 



31 
I 

GOOCTGGCCA 
CAGCTGTGGC 
AAGGQATCCA 
G6CTTCCAGT 
OGGCCGCTGA 
CACCTCGTGA 
TCCAATTCCA 
CAOQAGGCCC 
QCCIGGAGCA 
OQCATGGCCA 
TTTGAGGTCT 
AGTGGCTACC 
ATC06G6AGA 
ATTGAGAGGA 
CTGGAGCTGA 
CAGCTGCTGA 
ATCCTGCACA 
CT6CGCTACC 
GGCTACCGCA 
AGTGTCATGT 
GTGTTCGACC 
TAGCTCCOQT 
CTGAAGGTGC 



41 

I 

CCCTOGCX3C3C 
AGCTGOGCTG 
TGGGCTTCCC 
0GT0GGGGA6 
TAOQC30T0AC 
GCACGGAGTG 
rrGGOQACAT 
TGGAGAGTTA 
GCCACCCOQA 
TG0GGGT6CT 
ACCAGCAGTT 
GGOQQGOCAT 
AGCTGCAGTG 
GCAAGGAQCA 
TCTTTX50G6C 
AGCACCCCAC 
GTGGCGOCTG 
7GQACTG06T 
CTGTGCTGCA 
ATAGCAIGOQ 
COGATOQCTT 
T06GTGGC330 
TO QO GOtGGA 



f 

GTGCCTGGTG 
GGCCGCCACT 
GCTCATCGGA 
GGAGAAGTAT 
OGGOOGGOAG 
GCCT06CA6C 
CXJVCCGCAAC 
CCTGCCCAAG 
GGCCATCAAC 
GCT6G6CTTC 
TGTG6ACAAT 
TCAGGCTOGG 
CACACAGGGC 
CGGGAAGQAG 
CTATGCXAOC 
TGTGCTGQAO 
CCCCTGCGAG 
CATCAAGGAG 
QACCTT06AG 
6GACACCCAT 
GAGGCAGGGQ 
TGTCXX3GACC 
6CTGGCTAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



814 
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ACCAiSCOGCT TTGAGCTGGC TACACGGACC TTCCCCCGCA TCACCTTGGT CCCOSTCCTQ 1440 

CACCCCGTGQ ATOQCCTCAG CX3TCAA0XTC TTTG6CCTGG ACTCCAACCA GAA0C2AQATC 1500 

CTGCOlQAaA CJGQAGGCCAT GCTGAGGGCC ACAGTCTAAC CCAAGACCCA CXraCCTCAO 1560 

CCCAGCCCAG GCA60GGGGT 60TGGTTGTQ OSAGOTAGAA ACCTGTGTGT GGGAGGGG6C 1620 

5 CGOAAOOOGa AGC5GC0AGTG GCSXTCATAC TTGCCCTCCC TTGCTCCCCC TCCCTGGCAA 1680 

ACCCTACCCA AAGCCAGTGG GCCCCATTCC TAGGGCTGGG CiVCCCnCT G6CTCCAGCT 1740 

TCCCTCCAGC CACTCCCCAT TTACCATCAO CTCAGCCCCT GGGAAGGGOG TGGC AGGOQ C 1800 

TCTGCATGCC CGTGACAGTG TTAGGTGTCA GC3GCGTGCTA CAOTGTTTTT QTOATGTTCT 1860 

GAACTGCTCC CTTCCCTCCG TTCCTTTCGG ACCCTTTTAG CIGGGGTTGG GGGACSGGAA 1920 

10 QAOCOGTGCC CCCTTGGGOO CACTCTTCAG CGTCTCCTCC TCCTGCQCCC CCACTQOGTC 1980 

TOCOCAGGAA CAGCATCCTG GGTAOa«»A CAGGAOTCAA CCTTGGCGGQ GOGGGGGCTG 2040 

GGTGCAAGCT GQAGATTGCC CTTCCCTATG CCACGGTTCC CACCCTCCCT CACCAGTTTG 2100 

GACAATTTGA AATTACCTAT TGCTGCTACT TGTTCTGTCC TCTGACCTTQ GGGCAAAGGA 2160 

GCCCCAGGCC CTGTCTCCCC AQCATCCTCC CTGOTOGCCC TGGGCAGGTG CACTGACACC 2220 

15 CCCACCTTCC CATCCCCTGC TGAACCAGGC CCTGTTACAC ACAGCXGCCT AAG6COCXKX3 2280 

GCTCATGTGC TGCCGGCCCC CATATTTATT CACTGATAGA GAATCTTGGG QATGCTGGGG 2340 

TCTG6A0TQA ACATCTCCTC CCCTTCATGC CCTAGCCTGT GTTCTAGCTQ TCCTGGCX3AC 2400 

ACTTCTGTGA OTGAAGAGGA AGGGGTCTCT OGTCAAACCC AOOCCCCAGG GCCTAGGGTT 2460 

QAAAGCCTTC CCOGGCTCOQ GGCATTATTT GGGTTTAATC TOGGAGCCTC ACTCCTGGAC 2520 

20 TGAAGTCCGG TGCCTCTGCC TTATCCCTGG TGOAGATGGA ATGTGGCCCA TTGCCTCCTC 2580 

CCTCTCCTOT CAAAAACCCT GATCAGGTAG ATTrGGAGGC GGCCACGATT TCCTGTTTGG 2640 

CCCXTTOTTCA CCCCAGTGCA CTGGCCCTGA CTCCAGGCQT GAGTATGCGQ AAGGATACGG 2700 

GTTCTTCTGA CGGGGAGCAA GGGCCTCCGT CTTCCCTTCC TTAACTCTCC CCCTTTGCCC 2760 

TCOQCCCTQA AAAAGGTQTC CTTGAAQTCC CTTCCACCTC TATGCCACPQ TCTGCTTAGC 2820 

25 CCAGCTCAGG GGTGGGGAAG AGGGQAAAGC OTGGOGGAGG TGAGCGCAGC GGCAGTTCTG 2680 

CCTOQQAGCT GATTOCAGGG CCCT8TGTGG TCTCCOGACA GCTGCX^AA GGCTGCCGCA 2940 

GCTGAAGCTG AAGAGGCG6C TACGT60GGT TTGTCAGGGO GATTGGGTTG AAAACTGGCC 3000 

AGTCGGGATG ACTGGGTQAA AGAGGAGTAG CTCCTGCCAC TGGCXSTTTTG AGTGTTGGCA 3060 

ATTTGGQATG CCTCCTGGGG AAGGTTTCOO GGOGTTTGGT GAGTCTCTAa ATTTTTCCTT 3120 

30 GCTTTCTGTG TTTATTGGTT TTTGATGTTG TAAAAGCAAT GAATCCCCTT TACAAGAAAA 3180 

TOGAAAACAC AGAAGAATGA AGGACATGCC AGTCCCCGAT OQCTOCTOTO AGCACCTCAO 3240 

TG6CTCCCTC AGACCAOATC CCJGTAGQCAG CCCCACAQAC OGACCCTGAC CCCACTCACA 3300 

GCCACCCTGA A6ATAGACTA TAGGAACGGG CCCATACCAC ACAGACTGCT CTCCAATCCC 3360 

TGAGTCTCAG ATGTTTCATT TATTTCCTAC TTTTCCACTA CTAAAAAACA GTGTGGAATA 3420 

35 GACATTATTG GCAAAATTGC TCATCCCTAA TCCTGAAAAA CAGGCCAGAA TGGGTAAAGA 3480 

CTTGTCAAAG CTTGCAACAT AGCTACATGG TGCACCOQGA CCTGTACCCC CTCCCCCCAA 3540 

CAC3UVAA0CA OTGTCTQGQA GGTTCATTTT CCTTTAAACT GATCCAGCTO GCCCTGAACC 3600 

AATTOTTTTT GACTGAGTAT CTAGGAGAGC AGTAAOTGQA ACTTCAGACA AGCCCACTGG 3660 

GTCTGGTC3CA GGTGAGGGGC AGGGGQCATG GGGCTGGGAG GTCTCAGGGG CCTTCCCTGG 3720 

40 GGQTGOGCAG CCTGGTAGGG GGCAGAGAAO GAAAAQCTGA GGGGGGTCCC TGTGAGGGAG 3780 

GAAAGAAGOA TCATTTGCCC CGCTGGGTCT CMMOCMTt GAGAAGAOAG CTGAAGAAA6 3840 

CTCTGOCTGO CTGACAOGAT CXXTiVlVrrQ TAATTGOTCC CTCCTTTCAO CTCTCTAGTO 3900 

AGATGCCCGT GTCTGTGCGT GTGCX5TGTGT GTTTCATACA 6CTAGCATTA GATGGGTGAT 3960 

GTTTCTTACT TATCATCCCT AACTATTGCA ACTTGACCTT AAAAAGACAA AACCCCACAA 4020 

45 AACTCTTCCT GCX3VCGGGCT TQCAGATTGA AGCACTTTOO ATGTTGGGCG CTGGCGTTTG 4080 

TGTTCTGGQC ACCACCGTGA CCCTOCCCAa ATOGCTATAA TATTATTTTA TACACAAAOC 4140 

TTTTTTTTCA TAAATGTTAT AATTTTGTGT CTOTCTTTAT AAACTATTAT A AGTACTA TT 4200 

TTTGTTATAA TTCAAAATAQ ATATTTAGTA TAAAGTTTTT GCTGTTAAAT ATTTGTTATT 4260 

TAGTAAAATA TGAATTTTGC TCTATTGTAA ACATGGTTCA AAATATTAAT ATGTTTTTAT 4320 

50 CACAGTC6TT TTAATATTGA AAAAGCACTT GTGTGTTTTO TTTTGATATO AAACT6GTAC 4380 

GGTOrOAGTO TTTTTGCTOT OSTOGTTTTA ATCTGTATAT AATATTCCAT 6TTGCATATT 4440 
AAAAA 

Seq ID NOt B47 Protein sequence 
55 Protein AcceBslon #t NP_063938.1 

1 11 21 31 41 51 

11)111 

MLFEGLDLVS ALATLAACLV SVTLLLAVSQ QLWQLRWAAT RDKSCKLPIP KGSMGPPLIG 60 

60 ETGSWLLCKSS GFQSSRREKy GMVFKTHLLG RPLIRVTGAB NVRKILKOEH HLVSTEHPRS 120 

TRMLIiSFSITV SMSIGDZIimi KRKVP6KZP8 HBALESYLPK XQLVZQDTLR ANSSHPBAZM 180 

VYQEAQKLTF RMAIRVLLGP SZPBBDXjGRL FBVYQQFVDH VFSLFVDLPF SGYRRGIQAR 240 

QIIiQKGZtEKA IREKIiQCTTOG KDYLDALDLL IES6KBKGKE MTMQBLKDGT LBLIFAAYAT 300 

TASASTSIjIM QLLKHPTVLB KLRDEItRAHO ILHSGGCPGB OTLRLDTLSG LRYLDCVIKB 360 

65 VMRLFTPISG GYRTVLQTFB IDGFQIPKGM SVMYSIRDTH DTAPVFKDVN VFDPDRFSQA 420 

R8EDXDGRFR YLPP66GVHT CU3KHLAXLP LRVLAVELAS T8RFEIATRT FFRITLVPVL 480 
HFVDGLSVKF FGLDSNQNBI LPETEAMLSA TV 

Seq ID MO: B48 DMA sequence 
70 Muclelc Acid Accession #i AB040527.1| AL136582.1 
Coding sequences 94.. 2319 

1 11 21 31 41 51 

nc \ I I i I ' 

/5 GOSGCrGOGG GGTTCXrPGGT GCTGAGGAOG GACGCCATTG GAGTTCCOGA GAAOCATGGC 60 

T6AGGQAAGC TTCA60GTGC AATCX3GAAAO CTACAGTGTT OAAOACATGO ATQAGGGTAG 120 

0GACX3AAGTC GGGGAGGAAG AGATGGTTGA AGGCAA06AC TATQAAGAAT T06GTG0GTT 180 

TGGTGGCTAT GQCACCCTCA CCAGCTTTGA CATCC3VTATC CTCAGAGCCT TOGGAAGCTT 240 

GGGTCCAGGC CTTCGCATCT TATOQAATGA OCCCTGGGAA CTGGAAAACC CTGTGCTGGC 300 

80 CCAGACCCTG GTX3GAGGCAT TGCAGCTGOA TCaSGAAACA CTT6CCAATG AGA0GGC06C 360 

COOT G CT G OC AAOGTAOCOC GCGCOGGGGC CTCCAACOOT G066CTGGGG CGGCTGCGGC 420 

CGCTQCCX3GT AGC36CCTrCA GTCAGGTGGrr OGCTAGCCAC OQGGTGGCCA 06C0QCAGGT 480 

CTCAG6AQAG GATACCCAGC CCAOOACCTA OSC06C06AO 0CTCA66GGC CCACXX3CTGA 540 

GCCAOCCCTT GCTTCTCC30C AflACCTCCCA GAT6TTA6TC ACCAGTAAGA TGGCTOOCCC 600 

815 
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OGAGGCTCCG GCAACCTCOQ CACAOTCCCA GACAGGCTCC CCX30CCCAG0 AGGCTCCT3VC 660 
TGAGGGCCCT AGTA606CCT CT IX X TTTCTC TCAGGCTCCG TGTGCCAGGQ AGGTGOAOQC 720 
CAACOGGCCC AGCACAQCCT TCCTGGGCCA GAATGATOTC TTOSATTTCA CTCAGCOGGC 780 
AGOTGTCAGT GGCATGGCCT TCCOGOGCCC CAAfiAQAOCT GCCCCAC3C0C AAGAG GCTQ C 840 
CACAGAGGGC CCCAGTGCTG CCXXrTGGTGT GCCCCAGACO GQACCTQOCA GGOAGGTGGC 900 
AGCCACCOGG CCCAAGACCA CCAAOTCGQG OAAGOOGCTG GCCAAGACTC GGTGGGTGGA 960 

GCCrCAGAAT GTTGTGGCAO CAGCTGCTGC CAAGGCCAAG ATGGCCA03A GCATCCCTOA 1020 

GCOGGAGGGT GCAGCTGCTO CCACTGCTCA GCACAGXaCT GAGCCCTGGG CCAGGATGGG 1080 

AGGCAAGAGG ACCAAGAAGT CCAAGCACCT GGATGATGAG TATGAGAOCA GGOAGGAGGA 1140 

QAGAGAGACT CCCGCGGTCC CACCCACCTG GAGAGCATCA CAGCCCTCAT TGACGOTGCG 1200 

GGCTCAGTTG QCCOCTCGGC CCCCGATOGC CCCQAGGTCC C3«3ATACCCT CAAGGCAOGT 1260 

ACTOTOCCXO CCCCCCOGCA AOGTQACCCT TCTGCAGGAG AGGQCAAATA AGTTGGTQAA 1320 

ATACCTGATO ATTAAGGACT ACAAGAAGAT CCCCATCAAG CX3CGCAGACA TGCTGAAGOA 1380 

TGTCATCAGA GAATATGATG AACATTTCCC TQAQATCATT QAAGGA6CAA GGTACACGCT 1440 

GOAAAAGAAG TTTGGGATCC ACCTGAAGGA GATCGACAAG QAAGAAOVCC TGTATATTCT 1500 

TGTCTGCACA OGGGACTCCT CA6CT0GCCT CCTTGGAAAA ACCAAGGACA CTCCCAGGCT 1560 

GAGTCTOCTC ITGGT6ATTC TGGGOGTCAT CTTCATGAAT GGCAAOOOTG CCAGCGAQGC 1620 

TGTCCTCTGG GAG6CACTAC GCAAGATGGG ACTGOGCCCT GGGGTGAGGC ACCCATTCCT 1680 

CGG0GATCT3 AGGAAQCTCA TCACAGATGA CTTTGTGAAG CAGAAGTACC TGQAATACAA 1740 

GAAGATCOCC AACAGCAACC CACXTTGAGTA TOAATTCCTC TGGGGCCTGC GAGCCCGCCA 1800 

TGAGAOCAGC AAOATQAGGG TCCTGAQATT CATOGCCCAQ AATCAGAACC GAGACCCCCQ 1860 

GGAATGGAAO OCTCATTTCT TG6AGGCTGT OOATQATGCT TTCAAOACAA TGGATGTGGA 1920 

TATGGCCGAG GAACATGCCA GGGCCCAGAT GAGGGCCCAG ATGAATATCQ GGGATQAAGC 1980 

GCTGATTGGA CGGTGGAGCT GGGATGACAT ACAAGTCGAG CTCCTGACCT GGQATQAGGA 2040 

CGGAGATTTT GGCX3ATGOCT GGGCCAOGAT CCCCTTTGCT TTCTGGGCCA GATACCATCA 2100 

GTACATTCTO AATAGCAACC OT6CCAACAG GAGG6CCA0G TGGAOAGCTG GOQTCAGCAG 2160 

TGGCAOCRAT GGAGGGGCCA GCACGAOCGT CCTAOATGGC CCCAGCACCA GCTCX:A CCAT 2220 

CGGGACOVGA AATOCTOCCA GA6CTGG0GC CAGCTTCTTC TCCTGGATCC AGCACCQTTO 2280 

ACQAACTGCA GCGATCTTAC TGGCCAAGCC AOAGCGCCTC CTCTCAQATT CCTTCTCQAC 2340 

ACAGCACCCr AGGCGGCTTC TTCCTGTCAO TOGGAGGTGG CATGCAAQAT GAAGCTCTCT 2400 

TTGCTCTTOC TGCTTTCATT TTGTGCTTTT Cenm i ailT TCATOTTTIG GOTATCAGTG 2460 

ttacattaaa gttgcaaaat taaa 

8eg ID NOi B49 Protein sequence 

Protein Accession «: BAB33378.lj CM66517.1 

1 11 21 31 41 51 

I 1 1 I I I 

MAEGSPSVQS ESYSVEDMDE GSDEVGBEBM VEGNDYEEFG APGGYGTLTS PDIHILRAFG 60 

SLGPGXjRILS NEPWELENFV IAQTLVEALQ LDPBTLANET AARAANVARA AASNSAAHAA 120 

AAAARTAFfiQ WASHRVATP QVSOEITIQPT TYAAEAQGPT PHPPLASFQT 6QMLVTSKMA 180 

APBAPATSAQ SQTaSPAQBA ATEQP88AGA F8QAPCAREV DANRPSTAFL GQNDVFDFTQ 240 

PAGVSGMAPP RPKRPAPAQE AATEGPSAAS GVPQTGPQRB VAATRPKTTK SGKALAKTRW 300 

VEPQNWAAA AAKAKMATSI PEPEQAAAAT AQHSAEPWAR KGGKRTKKSK HLDDBYESSE 360 

EERBTPAVPP THRASQPSLT VRAQLAPRPP MAPR8QIPSR HVLCLPPRNV TLLQERANKL 420 

VKyiMIXDYK KIPIXRADML KDVIRBYDEB PPBIIERATY TLBKKFGXHXi KBIDKBEHLY 460 

ILVCTRDSSA RLX^GKTKDTP RLSLUiVXliG VIPMNGNRAS EAVLWBALRK MOLRPOVRHP 540 

PLGDLRKLIT DDPVKQKYLE YKKIPNSNPP BYBPI*WGLRA RHETSKMRVL RPIAQNQNRD 600 

PREWKAHPLB AVDDAFKTMD VDMAEEHARA QMRAQMNIGD EALIGRW8WD DIQVELLTWD 660 

EDGDFGDAWA RIPPAFWARY HQYILMSNRA NRRATMRAGV SSGTUGGAST SVLDGPSTSS 720 
TIRTRNAARA OASPFSWIQB R 

8eq ID NOi B50 DHA sequence 

Nucleic Acid Accession #: XM_084965.l 

Coding sequence t 3 56.. 2014 

1 11 21 31 41 51 

I I I 1 I I 

GACCTAGCGT GTGCTa«;CT CTGGACAAGA CATOGATOTA OGCAOACATQ CTTCCTQCCT 60 

6CTGGGCTTA GiaCACCAAT 0CTAA6GACA GACATOAAGG GCTTGGGAAA GGCATGAAGA 120 

TGCAOGGAOT CTAGGAGAGA GGATAAGAAA AACAAGCSiTT TTOTOOTGG G TTOAATTGCO 180 

■ nmufrm ' tagaggcatt tgotoatoca tqottgcgct ccggtgaaga gggagtctao 240 

AGAAGGAATC C6AAQ0GQC0 G0GGC6A6CO OGQCTCAGGT TGArTTAGAA TACGGGTQAC 300 

AGTGGCCTGG CGC3GAGCCCA CTGCTOAOaA AAOOGGCTTA TCCOGCGCGG TTTCCATGGA 360 

QA06A0CGGG AOCCQOQGCQ 60GG0GG66C TGTCAGGGAG GGOGGGQGAG CTOGOOCGTC 420 

OGTGGGGGTC TGCAGQAfiGA AGGOGQAGOC C9GX»300GGO ACOGGOACOC TOGOBGCAGA 480 

CATGGACTTO CATTOTQACT GTGCCQCOQA AACGCCG6CC GCOGAGCOGC 0GTCG6GGAA 540 

GATTAATAAA GCTGCCTTCA AATTATTCAA GAAGAGOAAA TCX300TQGCA CCATGCCCAG 600 

CATTTTTGGG GTCAAAAACA AAGGGGAOGO GAAAAGCTOG GGTCOGAOGG GGCTGQTGAG 660 

OAGCAGGACC CA08AQGGAC TTGCXSSAGGT GCIG6T6GTG QA6AG06GCA OGAAG OAOQ A 720 

GOCG0GCG6C 6GGGG0GACA QCXSGGGQOGG OGQOGQOGGQ GGOCOQAACC OOQQSCCCCC 780 

CAGAGC06CA G0QCG0G60G GGGGCTCOCT 06CCAGCAGC TGG6TGGGCA AGTOGOkCAQ 840 

CTTCTTCTOG CTGCTGAAGA AGAAOGGGOG CTOOGAAAAC OGCAAGOQAQ AGCCTGTGGA 900 

OGOGAGCAAG GCCGGCGGCA AACAAAAGOG GGGGCTGCXKJ GGGCTGTTCA GCGGCATGOQ 960 

CTGGGACAGG AAAGACAAGC GGGCCAAGGC GGA6GC06CG GAGGGGOQGQ COCCOGGGGO 1020 

OGGCTTGATC CTACCCGGCT OGCTCACOGC CAOCCTGQAO TGOGTCAAGQ AG6A6AGGGC 1080 

CAOAGCCGOO GQGGA6CCG0 AGQAGCCGAG CCA6GAGGCC COGCGAGAOC CAGCAGGCTG 1140 

TGGAGATATT ATTGCAGACC AAGAGGAAGA GGCAGGTCCC AGCTGTGAC3V AGCATGTCCC 1200 

CGGGCCAGOC AAGCOGGCTC TGTCTAAAAA GAAOCCCGGC GTGGTGGCCT ACCAAGGAGG 1260 

G6GGGAAGAG ATGGCCAGCC OBGAOQAGGT GQAOQACACC TATCTACAGG AGTTCTGGQA 1320 

CATOCTCTCC CAGACOGAOQ AOCAGG6ACC 0GA8000CAG GAGGG060GQ CTAAOQ TQGC 1380 

AGCTG06CTG GAAAGCAAGO TGGTGOCOQA OAOOOOOUVA QACA CCAOO T GTGTGGAAGC 1440 

GOCCAAGGAC GCGTCCT03G TCAAGC6CA0 GAGGCTCAAC OQQATTCCCA TGQAGOCCCA 1500 

TCCTAAGGAO GAGCCCAAGC AGCOGGAGAA GQMZCAQCAO 0AAGG09TCC CGAACA60GA 1560 
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CGAGQQCTAC TGGGACTCCA CCACGCCAOG CCCAGACGAA GACAGCTCGA GCA6CGGGAA 1620 

GAAOGOGGGC ATCCCCCGGG ATACCTACAO CSSOGGAOGCa CTCTATGATC TCTATGCTGA 1660 

CCOGGAOGGA AGTCCAGCAA CCCTTCCTCQ AGGQAAGQAC AAOCSAOOAQA OGTCCTCCCT 1740 

GTCCOGGTTA AAGCCCGTAT CTGCAGGCAC CATCACCTOT GCACTGOOAA C3VCCA0GCA0 1800 

CTTGCTGAAG GACTCTAAGA TCCCTATTAG CATCAAQCAC CTGACCAACC TTCCATCTAG 1860 

CCATCCCGTG GTGCACCAOC AACCCTOCAG GA6TGAQATG CCCAGAACAA AAATCCOGGT 1920 

TTOCAAAGTG CTGGTCCGCA GAGTCAGCAA COGGGGCTTG QCTGGGACCA CCATCAGAGC 1980 

AACX3GCX:T6C CACQACAGTO CCAMiMarrT GTXSAGGTCTT CCAGGCCAAG GTGGATGGGC 2040 

CCCATGCCAA GQAATACAAC TTTTCCCTGG AAACCACTAA AOTAAOTTTT GCTTTTCCTA 2100 

AAOAAAGTCT TTTAGOACAC CACCCOTCCX: CCGCCCTQCT CCAQAGCGTG GACCGAGGAG 2160 

QTCmOTGC CCTQAGCAGO GACOGGATAA CACCAGAAAG .AGGGATGCTA CA0QGGG6TT 2220 

TCT C CTCTCA AGATAAGTCC CTGAGAATTA TTTTCAAQCA Cnvm ' CrA ' TTTTACCTTT 2280 

AAGTTTTTCT TCCTTTTGCT TTAATATACT GAACACTTGG AAGTCACCTT TACTTGCCTT 2340 

TQCAGAAAAC AGAACTTAGC CAAACCTAAG TAAGAQTCAT GCCTOOATAT TGGGATAAGC 2400 

CAOTGTCTAO AGGCCTQAAG QAACOGCTGA AGAACCAGA6 GAGATCTCCT CTTCCAAGAC 2460 
AOSTTTCCTT CCTTOCCITC TATCVl 'C A ' rf CCTTTCCAGT T 

Seq ID NOt BSl Protein sequence 
Protein Acceasion «t XP_08496S.l 

1 11 21 31 41 51 

I I I t I I 

KBTSRSR6GG GAVSERGGAQ ASVGVCRRXA EAGAGTGTLA ADMDIiHCDCA AETPAASPPS 60 
GKINKAAFKL FKKRKSGGTM PSIFGVKKKO DGKSS6PTGL VRSRTHDGLA BVLVLESGRX 120 
EEPRGGGDSG GGG6GRFNPG PPRAA6P6GG 8IA8SSVAKS HSFFSIjLKXN GRSBNGKGEP 160 
VDASKAGGKQ KRGLRGIiFSa MRNBRIOnCRA KABAAEQRAP GGGLXIiFGSL TA8LECVKEB 240 
TPRAAREPBE PSQXUkPRDPA GOCmilADQB EBAOPSCDXB VPGPGKPALS KKNP6WAYQ 300 
GGGEEMASPD EVDOTYLQCP WDMLSQTBEQ GPEPQBGAAK VAAALETKW PBTPKDTRCV 360 
EAAKDASSVK RRRLKRIPIE PHPKBBPKHP BKEQQEX3VFN SDBGYWDSTT PGPEEDSSSS 420 
GKKAGIPRDS YSGDALYDLY ADPDGSPATL PGGKDNEETS SLSRLRPVSP QTITCPLRTP 460 
GSLLXDSKIP ISIKHLTNLP SSHPWBQQP SR8EMPRTKI PVSKVLVRRV SNRGLAGTTX 540 
RATACRDSAK KL 

Seq ID NO: B52 DNA sequence 

Nucleic Acid Acceseion #j FGENESH predicted 

C3oding sequence; 1..2016 

1 11 21 31 41 51 

I I 1 I I I 

ATOQAGACGA GC0GGA0CX» OGGGGOCGOC GGGGCTGTCA GOGAGCGOGG CGGAGCTGGC 60 

OGGTCOGTGO GGOTCTGCAG GAGGAAGGGQ GA0GCCX3GQQ COGGGAOGOO GAOC CTOG OG 120 

GCAGACATGG ACTTGCATTQ TQACTGTGCC GCCGAAAOSC OSGCOGCOGA GCGGG06T06 180 

GGGAAGATTA ATAAAGCTGC CTTCAAATTA TTCAAGAAGA GGAAATCGGG TGGC3VCCATG 240 

CCCA6CATTT TTGGGGTCAA AAACAAAGGG GAOGGGAAAA GCTCGGGTCC GAOGGGOCTG 300 

6TGAGGAGCA GGACCCAC6A CXX3ACTTGCC OAGOTOCTGG T6CTGGAGAG CGGCAOGAAG 360 

QAOQAGCOGC GOQGCGGGGG CGACAG06GC G6GGGCGG0G G6GGGC6GCC GAACGC3QGQ0 420 

CCCCCCAGA6 C06CAG6GCC OGGOGGGGGC TCCCTOOCCA GCAGCTCGGT GGCCAAGTOS 480 

CACAOCTTCT TCTCGCTGCT GAAGAAGAAC GGGCGCTCGG AAAAOGGCAA GGGAGAGOCT 540 

GTGGAOGCGA GCAAGGCXX3G GGGCAAACAA AAGCX3GGGGC TGOGGGGGCT GTTCAGGGGC 600 

ATGCGCTOGC AGAGGAAAOA CAAG0C3GGCC AAGGCGGAGG CCGOGGAGGO GCX3QGG6CGC 660 

GGGGGOQGCT TGATGCTACC OGGCTGGCTC AOGQCCAOCC T0GAGIGCXIT CAAGGAGGAO 720 

A06CCCAGAG COQ08C300GA GCXSGGAGGAO CCCAGCX3U36 ACGCCCXI60Q AQACCCAGCA 780 

GGTGAGCCCX3 CAGGGGGAGA 6GAGGTGCCG GCCCC06CC6 ACCX3CGC0CC AOCXSCXKSAGC 840 

T0CCX3AGAGG CAGAGGGCCT aSOSCACCCC GGCGACACCG G06CC0GGG6 AGAGGACGCC 900 

GCGGGGCATC GGC6CX3CC6A G006GGGCCC GGGGAGGTCC QCAGQGCAGA GGA06CTTCC 960 

AG6ACX3GGGG COGTTCCGGT AAASAGGGTC GOCCTTGTOQ ACTCGOAAGO GGGOVGOGGC 1020 

OQGGGGCCOS COQCCCCAGA CCCTGCCTCT 6T06ATGCAC GCTCAQAGCC GTGG6CAGAT 1080 

CGTATTTOTT TGATGTTTTC TGACGTGACT TCACTGAAAA QCTTTGACTC TCTTACAOGC 1140 

TGTGGAOATA TTATTGCAOA CCAAGAQGAA GAGGCAGGTC CCAGCTGT6A CAAGCATGTC 1200 

CCCGGGCCAO GC3U\GC0GGC TCTGTCTAAA AAOAACCCGQ GOQTGOTGGC CTACCAAGQA 1260 

G6G6G0QAA0 AQATGGCCAO CCOGGAOQAO GTGGAGQACA CCTATCTACA GGAGTTCIGG 1320 

GACATGCTCT CCCAGACGGA GGA6CAGGGA OCOSAGCCCC AGGAGGGOSC 6GCTAAGQTG 1380 

GCAGCTGCGC TX3QAAACCAA GGTG6T6C0C OAGACCCCCA AAGACAGCA6 6TGTGTGGAA 1440 

GCX3GCCAAGG A0G06TCCTC G6TCAAG06C AGGAGGCICA ACCGGATTOC CATOQAGCCC 1500 

CATCCTAAGG AGQAGCCCAA QCACCOSGAG AAGQA6CAGC AOGAAGGOGT CCCCAACAGC 1560 

OACOAGGGCT ACTGGGACTC CS^CCAOGCCA GGCCCA6AGG AAGACAGCTC QAGCA60GGQ 1620 

AAGAAGGOQQ GCATCCCCOS G6ATA6CTAC A60GGG6A0G OOCTCTATGA TCTCTATGCT 1680 

GACXXXKSAOa QAAGTCCAGC AACCCTTCCT GGAGGGAAGG ACAAC6AGGA GAOGTCCTCC 1740 

CPGTCCCX3GT TAAAGCCOOT ATCTCCAGGC ACCATCACCT GTCCACTGOG AACACX3U3GC 1800 

A6CTTGCT6A AGGACTCTAA GATCOCTATT AGCATCAAGC ACCTGACCAA CCTTCCATCT 1860 

AGCCATCGOG TG6TGCACCA GCAAOOCTCC AGGAOTQAGA TGCCCAGAAC AAAAATGOOG 1920 

GTTTCCAAA6 TGCT6GTC06 CAGAQTOVGC AACOGGGGCT TGOCTGGGAC CACCATCAQA 1980 
GCAACGGCCT GCCA0QACA6 TGCCAAAAAG TTOTOA 

Seq ID NOt BS3 Protein sequence 
Protein Accession ftt FGENBSH predicted 

1 11 21 31 41 51 

1 I i i I I 

KETSRSRQGO GAVSERGGAG ASVGVCRRRA EAGAGTGTLA ADMDLHC2XA AETPAASPPS 60 
GKINKAAFKL FRKRK6G6IH PSIPGVKNKO DGKSSOPTOL VRSRTHDGLA BVLVLBSGRX 120 
BEPRGGGDSO GGGGGRPNPO PPRAAGPGGG SLASSSVAK8 RSPFSLLKKN GRSOnSKGEP 180 
VDASKA06XQ KRGLRGLF8G MRNHRXDKRA KAEAAEGRAP GGOLXLPGSL TA8LBCVKBB 240 
TPRAAREPEB PSQOAPRDPA 6BPAGGBBVP APADRAPAR8 CREAEQLABP GDTQARGBDA 300 
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PCT/US02/36810 



ACHRRABFGP 6BVRTAEDAS RTGAVPVKTV PLVDSEOGSG RAPAJVPDPAS VDPPSDPSAD 360 
RICIMFSDVT SLKSFDSLTG OGDIIADQBB EAGPSCDKHV PGPGKPALSK KNPGWAYQO 420 
GOBEMASPDS VDDTYLQBFW DMLSQTEEQO PBPQEGAAKV AAALST KWP BTPKDTRCVE 480 
AAKDASSVKR RRLNRIPIEP HPKBBPKHPB KBQQBQVPNS DEOYWDSTTP GPBBDSSSSG 540 
5 KKAOIPRDSY SGDALYDLYA DPDGSPATIiP CQKDHEBTSS LSRUCPVSPQ TITCPIiRTPG 600 
8LLKDSKXPI SIKHLTNLPS SHPWHQQPS RSEHPRTKIP V8KVLVRRV8 NRGLAOTTIR 660 
ATACBDSAKX h 

Seq ID NO I B54 DNA sequence 
10 Nucleic Acid Accesaion ft< NM_014138.1 
Coding sequence t 60.. 854 

1 11 21 31 41 51 

15 CTCCAGAGAC TTCCCAGOAA GGTCCAGOOC CCTCTCAGCC TTOGTACTCA GAACAGCCQA 60 

TOATGGGCCT CAOTAACCTO AiGCCCCXSGTC CXQGCCCCAG CCAGGCCQTG CCTCTCCCAG 120 

AQGGGCTOCT C0GCCA6CGG TACAGAGAGG AOAAGACCCT GGAA6AGOGG CGGTGGOAGA 180 

GGCTGGAGTT CCTTCAGAGQ AAQAAAGCAT TCCTGOGGCA TGTGAGGAGG AOACACOGCG 240 

ATCACATGGC CCCCTATGCT GTTGGGAGGQ AAGCCAOAAT CTCCCCATTA GGTGACAGAA 300 

20 GTCAGAATCX3 ATTCC6AT0T GAATOTCX3AT ACTGCCAGAG CCACAGGCCO AATCTTTCTG 360 

GGATOCCTGO OOAGAGTAAC AGGGOCOCAC ATCCCTCCTC CTGGQA6ACG CTGGTGCAGG 420 

GCCTCAGTGG CTTGACTCTC AGCCTAGGCA CCAACCAGGC OGGGCCTCTG CCTGAAGOOG 480 

CACTCCAGCC ACAGGAGACA GAGGAGAAGC GCCA6CGAGA GAG6CAGCAG GAGAGCAAAA 540 

TAATOTTTCA GAGGCTGCTC AAGCAQTGGT TAGAQOAAAA CTGAGACGTG CACXXCCATG 600 

25 GGATGGAGAC CCGAAG6GAC TCAOAOGGAG COGCOGTGTT GGCAGCGCCT GGOTGTGGGC 660 

CCATTTTGGG 6ACCAAACAG CAAGCTGTOO TOGQATQAOT GCCAGGACCT GTQTA CCGGG 720 

ACAiOGTGGGA GTCCTCCCRG CATGATGCTT QACTQAOCOG AGQAAGGTCC TCATGTTTCG 780 

TGCCTGTCAT TCTOGGATGG CTGTGAGGCA TTCCTTGOCA AQGQAOGCTG CGTACXSVGCG 840 

GTCCTCACCG CATCTCACAT GGCTCCTQTG ATOCATGTTG TOSCTTTCCC ACCCGOQATC 900 

30 TCCATCTCTC TTCXXTTTCCT GCTGTCAGTA AGAGATCAC3V TOTCTGTGTA GTGTGAATGC 960 

CT T O TOGCTG TCCTGTQCTT TTOCAOCATT GAGTTGACTG CCTCXGAGAA GC3«JCACTAG 1020 

GCCTGTTGAA ATGCAATGTG CTGCCCTGAO ATCCAGTTTC AAGAATGGGC AGGTAAAOGC 1080 

AOTGTGGGAA AGGAATGTGG AATQAGAACT TGOTGGTTCA CCGCTGTACT ATTTGTGTAA 1140 

ATGTTTACXJT ATGTGATAAG CTACATOTAT GTAAATGTTG CAATACCCCT AACAGTCGAG 1200 

35 TAGTAGTCTC CCTTAC3VGGA ATTTTTQAOG GGGTTCCTCA TCATCAATAC CAAATAAATA 1260 

TATGTAGGAA TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

40 Seq ID NO: BSS Protein sequence 
Protein Accession #i NP_0S4 857.1 

1 11 21 31 41 51 

45 HMOliSNLSPG PGPSQAVPLP EGLLRQRYRE BKTLEERRWB RLBFLQRKKA PLRHVRRRHR 60 
DHHAFYAVGR EARXSPLGDR SQNRFRCBCR YCQSHRPHLS 6XP0ESNRAP HPSSNETLVQ 120 
GXiSGLTIiSLG OMQPGPLPEA ALQPQBTBEK RQRERQQBSK IMPQRUiKQW LBBN 

Seq ID NO J B56 DNA sequence 
50 Nucleic Acid Accession 9* NM_O00O25.1 
Coding sequence i 198.. 1424 

1 11 21 31 41 51 

^^1 I 1 i 1 I 

55 GCTACTCXrrC CCCCAAGA6C GGTGGCACCG AGG6AGTTGG GGTGGGGGGA GGCTGAGCGC 60 

TCTQQCTOG O AGAGCCA6A0 AAGATGGCCX: AGGCTOGGGA AGTCX3CTCTC ATGCCTTGCr 120 

GTCCXXTCCC CTQAGCCAGG TaATTTGGQA QACCCOCTCC TTCCTTCTTT CCCTACOQCC 180 

CCACGCXKX5A CCCGGGQATG GCTCCX3TGGC CTCACGAGAA CAQCTCTCTT GCCCCATGGC 240 

CGGACCTCCX: CACCCTGGCX3 CCCAATACOG CCAACACCAG TGGGCTGCCA GGGOITCCXST 300 

60 GGGAGGCGGC CCTAGCOGGG GCCCTGCIGO 0GCTGGG6GT GCTGGCGACC GTGGG AGGCA 360 

ACCP Q CTOOT CATOGTGQOC ATGGCCTGQA CTOOQAGACT OCAOAOCAIG AOCAAOGTGT 420 

TCQTGACTTC 6CTGGC0GCA GCOGAGCTGG TGATGGGACT OCT GQ T G QTO C0GC0G GC3GG 480 

CCACCTTGGC GCTGACTGGC CACTGGCCOT TGGOOGCCAC TQGCTGCGAG CTGTGGACCT 540 

OGGTGGACGT GCTGTOTGTG ACCGCCAGCA TOGAAACCCT GTGCXSOCCTG GCXX3TGGACC 600 

65 GCTACCXGGC TGTQACCAAC CCX5CTG0GTT AOGGOQCACT GGTCAOCAAG CGCTGOGCXX: 660 

GGACAQCTGT G6TGCTGGTG TQGGTGQTGT CGG006GGGT GTOaTTTGOQ COCATCATGA 720 

GCCAGTGGIG GOOOOTAGGO GCOSAOGOOG AGGOGC3U30G CTGCCACTCC AACCOGOGCT 780 

QCrrGTGCCrr CXSCCTCCAAC ATGCOCTAOG TGCTGCTGTC CTCCTCCGTC TCCTTCTACC 840 

TTCCTCTTCr CCTQATGCTC TTCGTCTAaS C3G0GGGTTTT CGTGOTGGCT AOGOGCCAGC 900 

70 TGOQCTTOCT GCX3CGGGGAG CTGOGGOGCT TTO06CCC3GA GQAGTCTCOG C0GG05C0GT 960 

06GQCTCTCT GGGOOOGGOC COGOTGGGQA 0GTO06CTCC GCOCQAAGGO GTGCCOQGCT 1020 

GCGGCCGGOQ GCC0Q060QC CTCCTGCCTC TCCQ6GAACA COOOOCOCrO TOCACCTTGO 1080 

GTCTCATCAT QGGCACCTTC ACTCTCTGCT GOTTQCCCTT CTTTCTGGCC AACQTGCTGC 1140 

GC6CCCTGGG GGGCCCCTCT CTAGTCCOGG GCCCGGCTTT CCTTGCCCTO AACTGGCTAG 1200 

75 GTTATOCCAA TTCTGCCTTC AACCOOCTCA TCTACTGCJCG CAGCCCGGAC TTTCGCAOOG 1260 

CCTTCOGCOQ TCTTCrGTOC OOCTGOGGCC GTOGCCTOOC TOOOOAGCOC TGOGCGGGOG 1320 

CGGGCGOGGC C VA'CnCCOC TOQQGOGrTC CTGOGGCCOO GAOCA GCCC A 606CAGGCCA 1380 

GGCTTTGCCA AQSGCTCGAC GOOGCTTCTT GGGQAOTTTC TTAGGCCTGA AGGACAAGAA 1440 

^ GCAACAACTC TGTTQATCAG AACCPOTGQA AAACCTCTGQ CCTCTGTTCA GAATGAGTCC 1500 

80 CATGGGATTC CC0G6CTGT0 ACACTCTACC CTCCAGAACC TGACGACTGG GCCATOTQAC 1560 

CCAAGQAGGO ATCCTTACCA AGTGGOTTTT CAOCATCCTC TTGCTCTCTO TCTQAGAGAT 1620 

GITTTCIAAA COCCRGCCTT GAACTTCACT 0CT0CCTCR6 TGCTAOTGTC CAGGTGCOOT 1680 

GQAGCAGCftO OCTGOCTTTO GTA06GGCAC CCATCACCOO OCTTGOCTGT GC AGICAG TG 1740 

A U ' W Cl'l'AGG 6CAAAGAGAG CTCXrCCTGGT TCGATTCCTT CTGCCACOCA AAGCCTGATG 1800 
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AGAOCTTAGT OTTCTCCAGO CTCTGTg OO C CAGOCTGACA GCAQCAGGGT AQAAAAGACC 
AAGATTTGGG GTTTTATCTC TGGTTCCCTT ATTACTGCTC TCAAGCAGTG GCCTCTCTCA 
CTTTAGCCAT GGAATGGCTC CGATCTACCT CACACCAGTG TCAGAAQGAC TTOGCCAGGO 
TTTTOGGAGC TCCAGGGTTC ATAAGAAGGT GAACCATTAG AACAOATCCC TTCTTTTCCT 
TTTGCAATCA OATAAATAAA TATCACTGAA TQCAOTTCAT CCTOGGCCCA CTTTCCCTCC 
GTTTOTTTTC TTTrCATAAT CC»CTTACTC OCTTCOCTTC TACTCTGOQC TOQCTTTTGA 
C3WSAG6CAGT AAATTAGGCC TAATCCTCAC ' IVmTCl IC CTAAT CTTCA TCAAACAAAA 
AATOAAAAGT CTGTCTG O AC GAAGGGOAGT GAGCTTGAGC CTTTGATATC TTGCTCCCCC 
ACCCTTCCTG AAACTCTTGA AATCCAGTTG CCATTGAOTA GCAAAGCCAC GCTCCCCACA 
GGACTT6GAC AGAGGGGCCA CAGGGGGATG GGCTGGCTGT GGCCAGGTTT AG66CAGG66 
GCATTTGTCC CCTCCATGCT ATAATCCAGT GGTGCXrTTAC ATGGTSTGTQ TOIOTOTGTO 
TGCXST G T G TG TGTGT G TOTG TGTGTCT06A G6CACAGGCA CA AAGC ATTG CTTGGGTTGG 
TCAAATQTCT TGTGTCATAA ATATATTCTG ATGTTTCCCA 6CCTTTCCAC AACCTCTACC 
TTCCCACTCA CCTTCCCCAG CTACAAAAAT CTOTATTATC CTCTTAAAGT AAAACTGGAG 
TTAC 

Seq ID NOi B57 Protein sequence 
Protein Accession fi: IIP_000016.1 

1 11 21 31 41 51 

I I I I 1 1 

MAPWPHEIVSS LAPHPDLPTL APHTANTS6L PGVPHEAALA GALLALAVLA TVG(an<I«VIV 60 

AIAHTPRLQT MTNVFVTSLA AADLVMGLLV VPPAATLALT GKHPUIRTGC ELWTSVDVLC 120 

VTASIBTIiCA LAVDRYIiAVT NPLRYGALVT KRCARTAWL VWWSAAVSP APIMSQWWRV 180 

GADABAQROl SNPRCCAPAS NMPYVLLS8S VSFYLPLLVM LFVYARVFW ATRQLRI«LRG 240 

BIiGRFPFEES PPAPSRSLAP AFVOTCAPPB GVPAGORRPA HLLPIiREHRA LCTLGLIMGT 300 

FTLOOiPFFL AHVLRALGGP SLVPGPAFLA LNWLGYANSA PNPLIYCRSF DPRSAFRRLL 360 
CRCGRRLPPE PCAAARPALP PSGVPAARSS PAQFRLOQRb D6ASW6VS 

Seq ID NO: B56 DUA sequence 

Nucleic Acid Accession «i NM_032553.1 

Coding sequence : 37 . . 1038 

1 11 21 31 41 SI 

) I I I I 1 

CACGATTAGO CAAAGATAGT TTCTCTAGAG AGAATCATGC CT6CTAATTA CAOGTGTAOC 60 

AG6CCAGATG GAGACAATAC AGATTTTCOA TACTTTATTT ATQCAGT6AC ATACACIGTC 120 

ATTCTTGTGC CAGGTCTCAT AGGGAATATA TTAGCCCTGT GGGTATTCTA TGGTTATATQ 180 

AAAQAAACAA AACX3AGCTGT GATATTTATG ATAAACTTAG CCATTGCTGA CTTACTACAA 240 

GTTCTTTCCT TGCCACTGAO GATCTTCTAC TACTTGAATC ATGACTGGCC ATTTGGGCCT 300 

OOTCTCTOCA IWCrO TTT CTACCTGAAG TATGTOACA TQTATGCAAO CATCTACTTC 360 

TTGGTCTGCA TCAGTGTOOQ ACGATTTTGG TTTCTCATGT ACCCCrTTOO CTTCCATGAC 420 

TOCAAACAGA AATATGACCT GTACATCA6C ATTGCTOGCT GGCTGATCAT CTGCXTTTOCC 480 

TGTGTACTCT TTCCACTCCT CAGAACCAST GATGATACCT CTGGCS^AIAG GACCAAATQC S40 

TTTGTGGATC TTCCTACCAO GAATOTCAAC CTGGOCCAGT CX3GTTGTTAT 6ATQACCATT 600 

GGCCSAGTTQA T T OQG TTTGT AACTCOQCTT CTGATTOTOC TATATT6TAC CIGGAAOAOG 660 

OTTTTATCAC TGCAAGATAA ATATCCCATO OCOCAAGATC T T0GAG A6AA ACAQAAAGCC 720 

TTQAAGATOA TTCTAACCTG TGCAGGGGTA TTCCTAATTT GCTTTGCACC TTATCATTTC 780 

AGTTTTCCTT TAGATTTCCT GGTGAAGTCC AATGAAATTA AAAGCTGCCT AGCCAGAAGQ 840 

6TGATTCTAA TATTTCATTC TGTGGCATTG TOTCTTGCTA GTCTGAATTC ATeXCTTOAC 900 

OCAOTCATAT ACTACTTTTG GACTAAIOAa TTGC38AAGAC GGCTTTCAAO ACAAGATTTO 960 

GATGACAGCA TCCAACTCXSl TGCAAAATCC TTTGTGA6TA ACCATACAGC TTCCACCATO 1020 

ACACCTGAAT TATGCTAAAA CAAAAAACCA AACTGAATGT GACCTGAAAT GCAAGTACAT 1080 

CAGAACATAT CTGCAATACC CAAGCCACAG GQAAGAACTT GCAAAACAAC ACAGCTTTTC 1140 

A0TTCT6CTC TATCTTACTG CTATGGGOAA TTCACTTCTT CAAAOCAGGA CCTATTTGGA 1200 
GCATTAOQAT CCAGQATTAT TGATOTTOAC ATOTGCATOT AGTAATTTTT CTTCAA0T 

8eq ID NOt BS9 Protein sequence 
Protein Accession it NP_115942.1 

1 11 21 31 41 51 

II) 111 

MPANYTCTRP DCTNTDPRYP lYAVTYTVIL VPGLIGNILA LWVPYGYMKE TKRAVIPMIN 60 
liAXADLLQVL SLPLRIFYYL NHDWPPGPGL CMPCFVUCYV NMYASIYPLV CISVRRFWPL 120 
MYPFRFHDCK QKYDLYISIA GWLIICLACV LPPUiRTSDD T8GNRTKCFV DXJPTRNVNLA 180 
QSWMMTI6B IiIGFVTPLLI VLYCTHRTVL SLQDKyPMAQ DX/IBKQKAIiK HZIiTCAGVFL 240 
ICFAPYHFSP PLDPLVKSNE IKSOARRVI LIPHSVALCL ASUISCLDPV lYYFSTNBPR 300 
RRLSRQDLKD 8IQLHAKSFV SKHTASTHTP BLC 

Seq ID NOi B60 DNA sequence 

Nucleic Acid Accession fft CAT cluster 

1 11 21 31 41 SI 

III) 11 

QATTOGQATT TTAATC03AC TCACTATAOG GAATTTTGGC CCTOGA GGCC AA GAATT OGG 60 

COOGAGGGTT AGACATTTAA CTTGAGTCTT TTGATATGAC GOtATGCTTT TAT TTTAC OC 120 

TCATATATAA TTAATTTTCA TCTGGGCATA TATTTCCAGA TTAGAAATCA TTTCTTTTCA 180 

TACATTTTAA OQAATTTATA CATTTGCATC TQAAATTGTA TTCATTATCT TCT AQAAACC 240 

AAGAGT6GAC TTGGTATCOG TCTGATTATA rri ' i ' Crn ' q T TGGTA AAGAG CCTCTTTTTT 300 

CCCTTTTTAT TTTCATCTTC TCTTTCTCTA GTGTTTATAA TCTTATTTrT ATQAATGTAG 360 

TCTAATTATT TAGAACACXG TCTOOOAOTA TOTOTTTATT I'G G glTfOTG TTTTGAACAT 420 

TGCCTCCAGO TGOGAGTTTA TGTGTOTTTA TGTTTCCTAT TTQAGQTT6Q ACACTTAGAA 480 

AATTTTCTCA ATTCGAAAAC TATTTTOCTT CAOCACTCAG AAATTTTTAT CAAAG ATTAT 540 

TTCC A OOATT ATTTCCTGTC Crmm ' T ' lTi OOIOTTCTTA AGCTAGAAIT CAATTGTTTA 600 
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ATCTIACACC TTACAOVTCT TCAATGTATC ATATATATTT CACTCATATT ATTTATTGTC 660 

ATATCTTTTQ GACTATTGTG TGAAAGCAAT AAATGTATAT TACTTTAAAA AAAAAAAAAA 720 

AAAACTHCAT TTTCAAATTT AATACACACA AATTOTGCAT ACCCACACAC ATATACCACA 780 

GATATATTCA TTTTCAGGAT TTCTTTAAGT GTTATTTTAA AAATAATCAT ATTCTTATTT B40 

TCTQAACTGT GGGATTTCTT QAAA LTITI ' A GAAAACTCTC ATTA GT CTGT GTTTGTGTGG 900 

TTTCITTAAA TTATCTCTTC CTTCAQAATT ACTTTTOTGT TTTGAATTTG AOTCTTTGCC 960 

ATTCATAAGO OGGGCTTTCC TTGAATACCA GATAACTCTG AOTTTOCTOT TCATATTTAA 1020 
GAATAAATAA CCTAATTTGA AAAAAAAAAA AAAAA 

Seq ID NO: B61 DNA sequence 

Nucleic Acid Accession tti NM_014522.1 

Coding sequence! 846.. 3911 

1 11 21 31 41 51 

CTGOT GO TOC AGTAOCTCCA AAQATATOQA ATACACTCCT OAAATATCCT QAAAACTTTT 60 
TTTTTTCAGA ATCCTTTAAT AAGCAGTTAT GTCAATCTOA AAQTTOCTTA CTTGTACrTT 120 
ATATTAATAG CTATTCTTGT TTTTCTTATC CAAAGAAAAA TCCTCTAATC COCTTTTCAC 180 
ATGATAGTTQ TTACCATGTT TAGGCATTAG TCACATCAAC CCCTCTCCTC TCCCAAACTT 240 
CTCTTCTTCA AATCAAACTT TATTAGTCCC TCCTTTATAA TGATTCCTTG CCTOGTTTTA 300 
TCCAGATCAA TTTTTTTTCA CTTTGATGCC CA0A6CTGAA GAAATGQACT ACTOTATAAA 360 
TTATTCATTO CCAAGAGAAT AATTGCATTT TAAACCCATA TTATAACAAA OAATAATGAT 420 
TATATTTTGT GATTTGTAAC AAATACCCTT TATTTTCCCT TAACTATTGA ATTAAATATT 480 
TTAATTATTT GTATTCTCTT TAACTATCTT GGTATATTAA AGTATTATCT TTTATATATT 540 
TATCAATGGT GGACACTTTT ATAGOTACTC TGTGTCATTT TTGATACTGT AGOTATCTTA 600 
TTTCATTTAT CTrPATTCTT AATOTACGAA TTCATAATAT TTOAT TCfta A ACRAATTTAT 660 
CACTAATTAA CAGAGTGTCA ATTATGCTAA CATCTCATTT ACTGATTTTA ATTTAAAACA 720 
GTTTTTGTTA ACATGCATGT TTAGGGTTGG CTTCTTAATA ATTTCTTCTT CXnXrTTCTCT 780 
CTCTCCTCTT CrTTTGGTCA GTGTTGTGCQ GGTTAATACA ACAAACTGTA ACAA0T6TAC 840 
CTGGTATGOA CTTOTTGTCC OQGACGTACA TTTTOGCGGT CCTGCTAGCA TGCGTGCTCT 900 
TCCACrCTGG CGCCCAGGAO AAAAACTACA CCRTCXXSAOA AGAAAT6CCA G AAAAQg TCC 960 
TGATAGGCGA CTTGTTaAAA OACCTTAACT TGTCGCTQAT TCCAAACAAG TCCTTGACAA 1020 
CTQCTATGCA GTTCAAGCTA GTGTACAAGA C0GGA3ATGT QCCACTGATT CGAATTGAAG 1080 
AGOATACTCG TGAGATCTTC ACTACTGGOO CTOGCATTGA TCGTGAGAAA TTATOTGCTG 1140 
GTATCCCAAG OOATGAOCAT TGCTTTTATG AAGTGGAGGT TGCCaTTTTQ CCGGAT GAAA 1200 
TATTTAGACr GGTTAAOATA CGTTTTCTQA TAGAAOATAT AAATOATAAT OCACCATTGT 1260 
TCCCAGCAAC AGTTATCAAC ATATCAATTC CAOAQAACTC GGCTATAAAC TCTAAATATA 1320 
CTCTCCCAGC OGCTGrTGAT CCTGACGTAG GAATAAAOGG AGTTCAAAAC TAOQAACTAA 1380 
TTAAGAGTCA AAACATTTTT GQCCTCGATG TCATTGAAAC ACCAQAAQQA GACAAGATGC 1440 
CACAACTGAT TGTTCAAAAG GAGTTAGATA GGQAAQAGAA GGAT ACCTAC GTOATGAAAG 1500 
TAAAGGTTGA AGATGGTGGC TTTCCTCAAA QATCCAGTAC TGCIATTTTQ CAAaTQAGTO 1560 
TTACTGATAC AAATGACAAC CACCCAGTCT TTAAGGAGAC AOAOATTQAA GTCAGTATAC 1620 
CAQAAAATGC TCCTQTAGGC ACTTCAGTGA CACAGCTCCA TGOCACAGAT GCTGACATAO 1680 
GTQAAAATGC CAAGATCCAC TTCTCTTTCA GCAATCTAGT CTCCAACATT GCCAGGAGAT 1740 
TATTTCACCT CAATGCCACC ACTGGACITA TCACAATCAA AQAACCACT6 GATAGGGAAG 1800 
AAACACCAAA CCACAAGTTA CTGGTTTTOG CAAGTGATGG TGGATTGATQ OCAGCAAGAO I860 
CAATGGTGCT GGTAAATQTT ACAGATGTCA ATGATAATGT CCCATCCATT GAC3VTAAGAT 1920 • 
ACATCGTCAA TCCTGTCAAT GACACAOTTG TTCTTTCAGA AAATATTCCA CTCAACACCA 1980 
AAATTGCTCT CATAACTGTG ACGGATAAGG ATGOGGACCA TAATQOCAGQ GTGACATGCT 2040 
TCACAGATCA TGAAATCCCT TTCAGATTAA QGCCAGTATT CAGTAATCAG TTC CTCCTOT 2100 
AQACTGCAGC ATATCTTGAC TATGAGTCCA CAAAAGAATA TGOCATTAAA TTACTGGCTG 2160 
CAGATGCTOG CAAACCTCCT TTGAATCAOT CAGCAATGCT CTTCATCAAA GTGAAAGATG 2220 
AAAATCACAA TGCTCCAGTT TTCACXTCAGT CTTTCX3TAAC TOTTTCTATT CCTGAQAATA 2280 
ACTCTCCTGO C3VTCCAGTTG AOGAAAGTAA GTGCAATGQA TOCAGACAGT GGGCCTAATG 2340 
CTAAGATCAA TTACCTQCTA GGCCCTQATG CTCCACCT6A ATTCAGCCTG QATTGTCGTA 2400 
CAGGCATGCT GACTGTAGTG AAGAAACTAG ATAGAGAAAA AGAGOATAAA TATTTAT TCA 2460 
CAATTCTGGC AAAAGATAAC GGG6TACCAC CCTTAACCM CAAT6TCACA GTCrrTGTAA 2520 
GCATTATTGA TCAGAATGAC AATAGOCCAG TTTTCACTCA CAATOAATAC AACTTCTATG 2580 
TCCCAGAAAA CCTTCCAAGG CATGGTACAG TAGGACTAAT CACTGTAACr GATCCTGATT 2640 
ATCGAGACAA TTCTGCAGTT ACGCTCTCCA TTTTAGATQA QAATGATQAC TTCACCATTG 2700 
ATTCACAAAC TGGTGTCATC CGACCAAATA TTTCATTTOA lAGAGAAAAA CAAGAATCTT 2760 
ACACTTTCTA TGTAAAGGCT GAGQATGGTG OTAGAGTATC ACSJTTCITCA AOTGCCAAAO 2820 
TAACCATAAA TOTGGTTGAT GTCAATGACA AC3UU^CCAGT TTTCATTCrrC CCTCCTTCCA 2880 
ACTCTTCTTA TGAATTGGTT CTACOOTCCA CTAATCCAGG CACAGTGGTC TTTCAGGTAA 2940 
TTGCTGTTGA CAATGACACT GGCATGAATG CAGAGGTTCG TTACAGCATT GTAGGAGGAA 3000 
ACACAAGAGA TCTGTTTGCA ATCGACCAAG AAACAGGCAA CATAACATTQ ATQOACAAAT 3060 
OTGATGTTAC AGACCTTGGT TTACACAGAG TGTTGGTCAA AGCTAATGAC TTAGGACAGC 3120 
CTOATTCTCT CITCAGTGTT QTAATKJTCA ATCTGTTOGT GAATGAGTOG GTGACCAATG 3180 
CTACACT6AT TAATGAACTG QTGCGCAAAA GCACTGAAGC ACCAGTGACC CCAA ATACT G 3240 
AOATAOCTOA TGTATCCTCA CCAACTAGTG ACTATGTCAA 0AT0CTG6TT GCA6CTGTTQ 3300 
CTGOCACCAT AACTGTCGTT GTAGTTATTT TCATCACTGC TOTAGTAAOA TGTC6CCAG0 3360 
CaCCACACCT TAAGGCTOCT CAGAAAAACA AGCAGAATTC TOAATOOGCr ACCCCAAACC 3420 
GAGAAAACAO GCAGATGATA ATGATGAAOA AAAAGAAAAA GAAGAAGAA6 CATT CCCCTA 3480 
AGAACTTtSCT 6CTTAATTTT GTCACTATTG AAQAAACTAA GGCAGATGAT OTTGACAGTG 3540 
ATC6AAACAO AOTCACACTA GACCTTCCTA TTGATCTAGA AGAGC A AACA A TGGGA AAGT 3600 
ACAATTGGQT AACTACACXTT ACTACTTTCA AGCCCGACAG CCCTGATTTO GCCC6ACACT 3660 
ACAAATCTOC CTCTCCACAG CCTGCCTTCC AAATTCAOCC TOAAACTCCC CTGAATTCGA 3720 
AGCACCACAT CATCCAAGAA CTQCCTCrCO ATAACACCTT TGTGGCCTGT OACTCTATCT 3780 
CCAAGTOT T C CTCAAQCAGT TCAGATCXXTT ACAGOGTTTC TOACTGTGGC TATCCAGTGA 3840 
OGAOCrroGA GGTACCTGTG TCCGTACACA CCAGACOOGT AGGTAT CX3UI OTTTCTAACA 3900 
CAACTTTCTA ACTATTTTTT TATTATTATT TTCAGXTOAT GTAOAACITT ACAAAATCTA 3960 
TTOACTTCAA AOAGGGATCA AAACAATCAT ATTCTACAGA TGTAOCX»AT AGAT ATATO G 4020 
ATTCAATTAA GTTTGQTAOA AGATGAGAAC AAAATAACTA CTOATTTAGO AAA ATTGGAT 4080 
GCAGAATAAT AATTATAGTA GGGGCAATTT TGTCTOTAGA TGGCAGTAtG ACAATTCTTO 4140 
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CTAQAGAATA TATTGAAAAA AACTTCAACA CAAAOOOTTO TAGCACTGTC CTCAG TACCA 4200 
TT G TGTGCAT GAGQATCAGA ATAGTCTGGG CTAGATACAT CACATTAAAG CTTTTCAGAA 4260 
TCTGATAAAT AGCTCTAAAT ACTAATGATA TTOACAAOCC TAOCTTCACT TGGGAAAATC 4320 
TGTGGCTOTT CACAQAAATT CA6CACCAAG TTATTGOCCC CATACTCTAC CAOOOCTTCA 4380 
GGTOCTCATA AAGAAAAGTG TOGTTTTCAG ATTAQGAACT CAAAATTATT TTGOTGCATC 4440 
AAATCTACAG TCACACAATA TAACAAGAAT GGQATTAGAA AAAT6AAA6C CTACTCATTC 4500 

TCATCTTTAA GCCAGAGAAT QAAATATATA TGAGGTCTCT GGATAGCTAT TTAAATATTT 4560 
6CATATTTAT GCAAGGTATT TTOAGCOCTT CASAA6ACAT TCT 

Seq ID NO I B62 Protein sequence 
Protein Accession ft: NP_055337.1 

1 11 21 31 41 51 

I I I 1 1 1 

MDLLSGTYIP AVLLACWFH SQAQBXHYTI RBEMPENVLI GDLUa>Lm.S LXPNKSLTTA 60 

MQPKLVYKTG DVPLIRIEED TGEIPTTGAR IDREKLCAGI PRDBHCFYBV EVAILPDSIP 120 

RLVKIRFLIB DINDNAPLFP ATVINISXPE NSAIKSKYTL PAAVDPDVOI NGVQNYELIK IBO 

SQNIFOIiDVI ETPE6DKMPQ LZVQKELDRE EKDTYVMKVK VEDGGFPQRS STAILQVSVT 240 

DTNDNHPVFK BTBZEV8IPB NAFVGTSVTQ LHAHMDZOE KAKIHFSPSli LVSNIARRLP 300 

HLHATTOLIT IKEFLDREBT PNRKIiLVLAS DGGLMPARAM VLVMVTDVND NVP8XDIRYI 360 

VNPVNDTWL SENIPUJTKI ALITVTDKDA DHNGRVTCFT DHBIPPRLRP VPSNQPLLET 420 

AAYLDYES7K EYAJKIjLAAD AOKPPZiNQSA MLFIKVKDEM DNAPVFTOSF VTVSIPENN8 480 

POIQLTKVSA KDADSGPNAK INYLLGPDAP PEPSLDCRTG MLTWKXLDR BKEDKYLFTI 540 

LAKDNGVPPIi TSNVTVFVSI IDQNDNSPVP THNBYNFYVP ENLFRBOrVG LITVTDPDYa 600 

DNSAVTLSIL DENDDFTIDS QTGVIRPNIS PDREHQBSVT PYVKABDGGR VSRSSSAKVT 660 

INWDVNDNK PVFIVPPSNC SYELVIiPSTN PGTWPQVXA VDNDTOMNAE VRYSIVGONT 720 

RDIiFAIDQET GNITLMEKCD VTDIjaLHRVL VKANDLGQPD SLFSWIVNL PVNESVTNAT 780 

LIKBLVRKST EAPVTPNTEI ADVSSPTSDY VKXLVAAVAG TITWWIPI TAWRCRQAP 840 

HIiKAAQKNXQ NSEWATPNPE NRQMIMMKKK XKXKXHSPKN IiLUNFVTXEE TKADDVDSDG 900 

NRVTLDLPXD LEBQTMGKYN WVTTPTTFKP DSPDLARHYK SASPQPAFQX QPETPUfSKH 960 

HIXQELPIiDN TFVACSSXSK CSSSSSDPYS VSDGQYPVTT FEVFVSVBTR PVGIQVSNTT 1020 
F 

Seq ID NOt B63 DKA sequence 

Kucleic Acid Accession ft: XM__0591B0.2 

Coding sequence: 276.. 3740 

1 11 21 31 41 51 

I I I I I I 

GCX3G066CCG CGGAGTATCC TGGAGCTGCA GACAOTGOGG GCCTGOGCCC AGTCCCGGCT 60 

GTCCTCOCOQ CGACCCCTCC TCAGCCCTGG GOSCQOGCAC GCTGGGGCCC OGCGGCGCTG 120 

GCOGCCTAGC GAGCXTTGCCG GTCQACCCCA GCCAGOGCAG CGAOGQGGOQ CTGCCTGGCC 180 

CAGGOGCACA CX5GAAQTGCG CTTCTCTGAA GTAGCTTTGQ AAAGTAGAGA AGAAAATCCA 240 

GTTTGCTTCT TGGAGAACAC TGQACAGCTG AATAAATGCA GTATCTAAAT ATAAAAGAGG 300 

ACTGCAATGC CATGGCTTTC TGTGCTAAAA TGAGGAGCTC CAAGAAGACT GAGGTQAACC 360 

TGOAGQCCCC TGAGOCAGGG QTGGAAGTGA TCTTCTATCT GTCGGACAGG GAGCCCCTCC 420 

06CTGGGCAG TG6AGA6TAC AC»GCAGAGG AACTGTGCAT CAGGGCTGCA CAGGCATGCC 480 

GTATCTCTCC TCTTTGTCAC AACCTCTTTG CCCTGTATGA OGAGAACACC AAGCTCTGGT S40 

ATGCrcCAAA TOGCACCATC ACOGTTGATG ACAAGATQTC CCTCXX3GCTC CACTACCX3GA 600 

TGAG6TTCTA TTTCACCAAT TGGCATQGAA CXAAOQACAA TGAQCAGTCA GTGTGGCGTC 660 

ATTCTCCAAA 6AAGCAGAAA AATGGCTACO AGAAAAAAAA GATICCAGAT GCAACCCCTC 720 

TCCTTGAXGC CAGCTCACTG GAGTATCTGT TTGCTCAGGG ACAGTATQAT TTGGTGAAAT 780 

G0CTG6CTCC TATTCGAGAC CCCAAGACCQ AGCAGQATGG ACATQATATT GAGAACXSAflT 840 

GTCTAGGGAT GGCTGTCCTG GCX3VTCTCAC ACTAT6CCAT OATQAAOAAG ATGCAOTTOC 900 

CAGAACTOCC CAAGGACATC AGCTACAA6C 6ATATATTCC AGAAACATTG AATAAGTGCA 960 

TCAGAC3U6A6 GAACCTTCTC ACCAGGATGC GGAIAAATAA TGTTTTCAAG GATTTCCTAA 1020 

AGOAATTTAA CAAGAA6ACC ATTTGTGACA GCAGCGT6TC CAC6CATGAC CTGAAGGTGA 1080 

AATACTTGOC TACCTTGGAA ACTTTGACAA AACATTACGG TGCTGAAATA TTTGAGACTT 1140 

CCATGTTACr GATTTCATCA QAAAATGAGA TGAATTGQTT TCATTCX3AAT GAOGGTGGAA 1200 

AOGTTCTCTA CTACGAAGTO ATGGTGACTG GQAATCTTGG AATCCAGTGQ AOGCATAAAC 1260 

CAAATGTTGT TTCTGTTGAA AAGQAAAAAA ATAAACTGAA G08GAAAAAA CTGGAAAATA 1320 

AACACAA6AA GGATGAGGAG AAAAACAAGA TCOGGGAAGA GTGGAACAAT TTTTCTTACT 1380 

TCCCTQAAAT CACTCACATT GTAATAAAGG AOTCTGTGGT CAOCATTAAC AAOCAGGACA 1440 

ACAAGAAAAT GGAACTGAAG CTCTCTTCCC AOQAGGAGGC CTTGTOCTTT GTGTCCCTGG 1500 

TAOATGGCTA CTTCOGGCTC ACAGC3VGATQ CCCATCATTA CCTCTGCACC GAOQTGGCCC 1560 

CQOOffi T OAT 08TCCACAAC ATAGAOAATO GCTGTCATGG TCCAATCTGT ACAGAATA06 1620 

CCATOUVTAA ATTGCQGCAA GAAGQAAGCG AGQAGGGGAT GTACX3TGCTO AGOTGGAGCT 1680 

GCAC06ACTT TGACAACATC CTCATGACOS TCACCTGCTT TGAGAAGTCT GAGCAGGTGC 1740 

AGGGTGCCXIA GAAGCAG7TC AAGAACTTTC AGATOQAGGT GCAGAAGGGC OSCIACAGTC 1800 

TGCAOGGTTC GGACOGCAGC TTCCCCAGCT TQGGAGACX:T CATQAGCCAC CTCAAOAAGC 1860 

AOATCCTGCXS CAOGGAXAAC ATCAGCTTCA T6CTAAAA0Q CT6CTGCCA6 CCCAAGCCCX: 1920 

GAGAAATCTC CMCCTGCT6 GT06CTACTA A6AAAGCCCA GQAGTGGCAG COCGTCTACC 1980 

CCATGAGCCA GCTGAGTTTC GATOGQATCC TCAAGAAGGA TCTGGTGCAG GGOGAGCACC 2040 

TTGGGAGAGG CACQAGAACA CACATCTATT CTGGGACCCT GATGGATTAC AAGGATGACG 2100 

AAGGAACTTC TGAAGAGAAG AAOATAAAAG TGATCCTCAA AOTCTTAGAC CCCAGCCACA 2160 

GGGATATTTC CCTGGOCPrC TTOQAGGCAG CCAGCATGAT GAGACAOGTC TCOCACAAAC 2220 

ACATOGTGTA CCTCTATG6C GTCTOTQTCC OOQAOGTGQA GAATATCATG GTGQAAGAOT 2280 

TTOTGGAAGO GOGTCCTCTG OATCTCTTCA TOCACOGGAA AAGOQATGTC CTTACCACAC 2340 

CATGGAAATT CAAAOTTGCC AAACAGCTGG CCAOTQCCCT GAGCTACTTG GAGOATAAAG 2400 

ACCTGGTCCA TGGAAATOTO TOTACTAAAA ACCTCCTCCT GGCCCGTGAG GGCATOGACA 2460 

GTGA6TGTGG CCCATTCATC AA0CTCAGT6 ACCOCGGCAT CCCCATTAOG GTGCTGTCTA 2520 

GGCAAGAATO CATTOAAGGA ATCCCATGGA TTGCTCCTGA OTGTGTTGAG GACTCCAAGA 2580 

ACCTGAGTGT GGCT6CTQAC AAGTGQAOCT TTQOAACCAC GCTCTGGQAA ATCTGCTACA 2640 

ATGGOQAOAT CCCCITQAAA OAC3UU3AOQC TOAITGAOAA AOAQAOATTC TATGAAAQCC 2700 



821 



wo 03/042661 



PCTAJS02/36810 



GGTGCAGGCC AGTGACAOCA TCATQTAAGG AGCTGGCTGA CCTCATGACC C6CTGCATGA 2760 

ACTATOACCC CAATCAGAGG CCTTrCTTCC GAGCCATCAT GAGAGACATT AATAAGCTTQ 2820 

AAGAGCAQAA TCCAGATATT GTTTCAGAAA AAAAA0CA6C AACTGAAOTG GACCCCACAC 2880 

ATTTTOAAAA QOGCTTCCTA AAGAGGATCC GrCACTTGOG AGAGOGOCAC T TTGGQ AAOQ 2940 

5 TTGAGCTCTG CAGGTATGAC CC0GAA66G0 ACAATAGAGG 0QAGCAGGT6 GCTGTTAAAT 3000 

CTCTOAAGCC TGAQAGTGGA OOTAACCACA TAOCTGATCT GAAAAAGQAA ATOGAGATCT 3060 

TAAGQAACCT CTATCATGAQ AACATTOTOA AQTACAAAGG AATCTGCACA GAAGACGGAQ 3X20 

GAAATG6TAT TAAGCTCATC ATGGAATTTC TGCCTTOGGG AAGCCTTAAG GAAT ATCT TC 3180 

CAAAOAATAA GAACAAAATA AACCTCAAAC AGCAGCTAAA ATATQCOGTT CAGATTTOTA 3240 

10 AGGGQATGGA CTATTTGGGT TCTOGGCAAT ACXTTTCACXXS GGACTTGGCA GCAAGAAATQ 3300 

TCCTTOTTGA GAGTGAACAC CAAGTGAAAA TTGGAGACTT CGQTTTAACC AAAGCAATTG 3360 

AAACOGATAA GGAGTATTAC ACCGTCAAGG ATGACOGGGA CAOCCCTGTG TTTTGGTATG 3420 

CTCCAGAATO TTTAATGCAA TCTAAATTTT ATATTGCCTC TGACX5TCTGG TCTTTTGGAG 3480 

TCACTCTOCA TGAGCTGCTG ACTTACTGTG ATTCA6ATTC TAGTCCCATO OCTT TGTTC C 3S40 

15 T6AAAATGAT AGGCCCAACC CATGQCCAGA TGACAOTCAC AAGACTTOTG AATAOGTTAA 3600 

AAGAAGGAAA AOGCCTGCOG TGCCCACCTA ACTQTCCAGA TGAGGTTTAT CAACTTATGA 3660 

GGAAATQCTG GGAATTCCAA CCATCCAATC GGACAAQCTT TCAGAACCTT ATTGAAGGAT 3720 

TTGAAQCACT TTTAAAATAA GAAGCATGAA TAACATTTAA ATTCCACAGA TTATCAAGTC 3780 

CTTCrCCTGC AACAAATGCC CAAOTCATTT TTTAAAAATT TCTAATGAWV QAAGTTTGTS 3840 

20 TTCTGTCCAA AAAGTCACTO AACTCATACT TCAGTACATA TACATGTATA AG6CACACTG 3900 

TAOIGCTTAA TATOTOTAAQ GACTTCCTCT TTAAATTTGG TACCAGTAAC TTAGTOACAC 3960 

ATAATGACAA CCAAAATATT TGAAAOCACT TAAGCACTCC TCCTTGTGGA AAGAATATAC 4020 

CACCATTTCA TCTGGCTAGT TCACCATCAC AACTGCATTA CCAAAAGGGG ATTTTTGAAA 4080 

AOGAGGAGTT GACCAAAATA ATATCTGAAQ ATGATTGCTT TTCCCTGCTQ CCAGCT6ATC 4140 

25 TGAAATGTTT TGCTGGCACA TTAATCATAG ATAAAGAAAG ATTGATGGAC TTAGCCCTCA 4200 

AATTTCAGTA TCTATACAQT ACTAGACCAT GCATTCTTAA AATATTAQAT ACCAGGTAGT 4260 

ATATATTGTT TCTGTACAAA AATGACTOTA TTCTCTCACC AGTAGGACTT AAACTTTGTT 4320 

TCrCCAGTGO CTTAGCTCCT G ri ' C C Tri' GG GTGATCACTA GCACCCATTT TTGAGAAAGC 4380 

T6GTTCTACA TGGGQGQATA GCTQTGGAAT AGATAATTTG CTGCATGTTA ATTCTCAAGA 4440 

30 ACTAAGCCTG TGCCAGTGCT TTCCTAAGOV OTATACCTTT AATCAGAACT CATTCCCAGA 4500 

ACXTTGOATOC TATTACACAT GCTTTTAAQA AACGTCAATG TATATCCTTT TATAACTCTA 4560 

CCACTTTGGG OCAAQCTATT CCAGCACTGG TTTT6AATGC TGTATGCAAC CAGTCTGAAT 4620 

ACCACATACG CTGCACTGTT CTTAGAGG6T TTCCATACTT ACCACCGATC TACAAGGGTT 4680 

GATCCCTGTT TTTACCATCA ATCATCACCC TGTG6TGCAA CACTTGAAAG ACCCGGCTAG 4740 

35 AGGCACTATG GACTTCAGGA TCCACTAGAC AGTTTTCAGT TTGCTTGGAG GTAQCTGGGT 4800 

AATCAAAAAT GTTTAOTCAT TGATTCAATG TGAACGATTA CGGTCTTTAT GACCAAGAGT 4860 

CTGAAAATCT T TTTGTTATG CTGTTTAGTA TTCGTTTQAT ATTGTTACTT TTCACCT6TT 4920 

GAGCCCAAAT TCAGQATTGG TTCAGTGGCA GCAATGAAGT TGCCATTTAA ATTTGTTCAT 4980 

AGCCTACATC ACCAAGOTCT CTGTGTCAAA CCTGTGGCCA CTCTATATGC ACTTTOTTTA 5040 
40 CrCTTTATAC AAATAAATAT ACTAAAGACT TT 



8eq ID NO I B64 Protein sequence 
Protein Accesaion #i A3 9577 
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60 
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1 
I 

MOYLNZKBDC 
CIRAAQACRI 
DNEQSVWRHS 
D6HDIENECL 
NNVFKOFItKB 



BEWNNFSFFP 
HYLCTDVAPP 
CFEKSEQVQG 
XRCX!QPKraE 
TU1DYXDDEG 
VQ9IMVBBFV 
LLAREGIDSB 
TTLWEICYNG 
IMRDIKKLEB 
TQEQVAVKSL 
8GSLKBYLPK 
DFGLTKAIET 
DSSPMALFLK 
SFQNblEGFB 



11 
I 

HAMAFCAXMR 
SPLCHNLFAL 
PKKQKMQYBK 
GMAVLAISKY 
PNNKTICD8S 
LYyEVKVTGN 
EITHIVIKES 
IiIVKNIQKGC 
AQKQFIQIFQI 
ISNLLVATKK 
TSEBKRIKVI 
BGGPXiDIiFMH 
CGPFIKLSDP 
BIPLKDKTXiI 
QNPDIVSRKK 
KPBSGGNRXA 
NKNKINLKQQ 
DKBYYTVKDD 
MIGPTHGQMT 
AIiLK 



21 
I 

S8KKTEVHLE 
YDENTKLWVA 
KKIPDATPIiL 
AMMKXMQXiPB 
VSTHDLKVKY 
UGIQHRBXFN 
WSINKQD29K 
HGPICTBYAI 
EVQKGRYSIiH 
AQEHQPVYPM 
LKVlDPSBXtD 
RKSDVLTTPW 
GIPITVIiSRQ 
BKERFYESRC 
NQPTBVDPTB 
DUOCBISZLR 
UCYAVQICKG 
RDSPVFVnfAP 
VTRLVNTLKE 



31 
I 

APBPGVBVXF 
PNRTITVDDK 
DAS8LEYLFA 
LPKDISYKRY 
XjATZjETLTKH 
WSVBKBRIIK 
XMBLKIiSSHE 

GSDRSFPSU3 
SQLSFDRZIiK 
ISLAFFEAAS 
KFKVAKQIiAS 
ECIERIPWIA 
RPVTPSCKEL 
FBKRFZiKRIft 
NIiYHENrVKY 
MDYLQSRQYV 
ECU4QSKFYI 
GKRLPCPPNC 



41 

I 

YLSDREPLRXi 
MSIiRLHYRMR 
QGQYDbVKCL 
IPBTIiKKSIR 
YGABIFETSM 
LKRKKUalkb 
EALSFVSLVD 
GMYVLRWSCT 
DLMSHLKXQI 
KDLVQGEHL6 
MMR0V8HKRI 
AL8YLEDKDL 
PBCVEDSKNIf 
AD IiWTRCyai Y 
OLSEGBFGKV 
KGICTBDGGN 
HRDLAASMVL 
A6DVW8FGVT 
PDBVYQLMRK 



51 
I 

GSGEYTAEBL 
FYFTNWHGTN 
APIRDPKTEQ 
QRNLLTRMRI 
IiLISSENEMN 
KKDEEKNKIR 
GYFRLTADAH 
DFDNILMTVT 
LRTDNISFML 
R6TRTHIYSG 
VYLYGVCVRD 
VHGNVCTKNL 
SVAADKHSFG 
DPNQRPFFRA 
ELCRYDPEDN 
OIKLIMEFLP 
VESEKQVKIG 
LHELLTYCDS 
CHEFQPSNRT 



6eq ZD NO I B65 ONA sequence 

Nucleic Acid Accession #t NM_004867.1 

Coding sequence: 140.. 931 



1 
I 

QATCCCA6AC 
TAACAGGATC 
OOCAGCCCGA 
0GAGGAGGCX3 
GACOGGCAAG 
GCTTACTCTC 
TTACAAGTAC 
TGAGQATCCT 
GGCTQACATT 
TGATAGTOAC 
CTTOTTGCTG 



11 



TCCTCTTGCA 
AGATTGACTA 
CGGCAAGACO 
GAGCTC OGAG 
TTAGGCCTTT 
TTCATGCCCA 
GCAAATTCCC 
CGTaAGQATO 
CCTGCAGCAA 
GGGAACTGCT 



21 
I 

AGTAGTGTTA 
GTCTGCAQOC 
TGGTGAAAAT 
TGGAGGCCCr 
TTOCCACCCA 
CATTCATCTT 
AGAGCACCAT 
TTGQTGGAGO 
ACAACATTOC 
TTATTCATQA 
ATCIGAT60C 



31 
I 

QACTGAAGAT 
CAGGACX3CT0 
G6CCTTCAAT 
CCTGAGCGOC 
OGAAAAAGAG 
GGCA6GACTT 
TTAGGGTGGA 
AGAGOCTAAC 
AATCATTQAT 
CTTTGAAAAO 
CCTCAATACT 



41 

I 

AAA6TAAGT0 
ATTCCAOCAG 
A0CCCTAC06 
ACX3GTCAGAA 
GGCTCCTCTG 
ATTGTTGGTG 
GAGATGTGCT 
TTCCTGCCTQ 
GT6CCTGTGC 
GGAATGACTO 
TCTATT6TTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
040 
900 
960 
1020 
1080 
1140 



51 

I . 

CTGTTTGGQC 
GGCCTTACOG 
C0GT6CAAAA 
CTCAGATACT 
GGAGATGTAT 
GAGCCTGCAT 
TTTTTGATTC 
TGACTGAGOA 
CCAGTTTCTC 
CTTACCTGGA 
TGGCTCCAAA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AAATCTGGTA 
G0TTGGA6AA 
TATTTACCAA 
GOGTTTCAA.C 
TATTGTTX3A0 
AAGAAGTCAO 
TTACTCATQC 
AA 



GAGCTCTTTG 
GACCTAGTTG 
CTTTGCAATA 
AAACGTGCCA 
ACCAAGATCT 
AGATTTACAA 
ATTTACTCTA 



6CAAACTGGC 
CTGTGGAGGA 
ACAGAAAGTC 
TTC3ATAAATQ 
OTCAAGAGTA 
TATGACTTTA 
TTGCTTAT6C 



GAGTGGCAQA 
AATTOGTGAT 
CTTCGGCCrr 
CTG6AAGATT 
AGAGGCAACA 
ACATTAAGGT 
OOTAAAAAAA 



TATCTOCCTC 
GTTAGTAACC 
OGTCOCAOAG 
AGACACTTCC 
GATAOAOTQT 
TTATGG6ATA 
AAAAAAAAAA 



AAACTTATGT 720 

TTGGCATCTT 760 

ACCTCTTGCT 840 

CCAAOSAATT 900 

CCrrCGTAAT 960 

CTCAAGATAT 1020 

AAAAAAAAAA lOBO 



Seq ID NO: B66 Protein sequence 
Protein Accession #: NP_004B5e.l 

X 11 21 31 41 SI 

I I i 1 1 ) 

MVKIAFNTPT AVQKBEARQD VEALLSRTVR TQILTGKBLR VATQBKBGSS GRCMLTU/31* 
SFZLAGIiXVQ GACIYRyPMP KSTZyROaCC PFDSEDPANS LRGGEPNFLP VTEEADIRED 
ONZAIZDVPV PSFSDSDPAA XIBDFBKGMT AYIiDXiLIiGNC YLMPLNTSIV MPPKNLVELP 
GKLASGRYLP QTyWREDLV AVEBIBDVSN LOXPXYQLai NRXSFRLRRR OLLLGFNKRA 
XDKCHKIBKF ENBFIVETKI OQB 

Seq ZD NOt B67 DHA sequence 

Nucleic Acid Atecession #: XM_0B3862.1 

Coding sequencet 121. .813 



60 
120 

leo 

240 



AOQOC3G6A6G 
CCTTGAACCT 
ATGGGACAOC 
6CAAAACATG 
AQAGTAGCT6 
CTCTTTGAGG 
GTCTGTACCA 
TACCAGTTTA 
AGGTCCACCT 
CTTTGGATCX} 
GGGGGGGCTG 
TGGAGTQATC 
TCTTGGOTGG 
GCTGATGAGG 
TTGTGG6CTC 
ACAG6AAAAT 
TTAAAAATAA 



11 
I 

TCT6AGCTGT 
GG6AAACTAT 
AAATTTCGGA 
TTA08TTGGT 
AGCTTAATGA 
TACAACCTGG 
AGATTAACAA 
TTCAACTTTA 
CTOAAQAACC 
GAAGGGTGAA 
ACCTCATOCT 
GACACAGGAA 
TATCAGATGC 
CA6GCCAGCC 
AAATATTT6G 
CCATTTTCCC 
ACTTCCCATG 



21 
I 

GGGCIGAGGC 
OTQAAGCAAC 
TCAGACACAG 
TGGAGAGAGT 
TOTAAOGGC^ 
GTCTGATTCC 
AAGCAGTGGC 
TAAAGATATC 
TOATGAAAAC 
GCAGCTOACC 
OCCTTOTOCT 
TTGCCCTATT 
ACCCACTQAA 
CCACAGGCCA 
TCATG6GQQA 
CACTCTTTTA 
TCTTCC3UVAA 




ATTGTGGAGG 
ACAAOTCAAG 
TCATCXrrCTG 
GATQAGGAGO 
CACAGCTTTT 
tgt06cctac 
GATGATATGG 
TGACCTTGAA 
AGAATOCAGQ 
TTTTT6CIAT 
AAAAAAAAAA 



41 

1 

CT606CCAGG 
TTGAAAGACA 
ACAAOTTACC 
CTTATGAAGA 
CTGGCCAGGA 
GGAAAGTGGT 
CATCAOGGAT 
CAGCAGGAGT 
TAACATCTTG 
AGTOrrGTAT 
GTCAGAAGTO 
AGATGACIGG 
CTAACTATAT 
QTGAAAGTCT 
8TTGTG0CAC 
TCTGATCATT 



51 

I 

GTG06GGATG 
TCTTTTCATC 
AGAAAAAGTA 
ATTTCTOGGO 
AAAACATCTT 
TGTAOC^SGTG 
CATGAATTTA 
ATOGGCACAG 
TCAGGCXAGT 
CTGTATGGAT 
TATTGATAAA 
AGCAAATGAA 
TCTTAACATG 
TCTGTTGCTA 
TOGCAOGAC 
TtfrCCOOCTT 



Seq ZD NOt B6B Protein sequence 
Protein Accession #i XP_0B3B62.1 

21 



11 21 31 41 SI 

i I i I i t 

MGQQZSDQTQ LVZNKLPEKV AKHVTLVRBS GSLTYBEFLG RVABLNDVTA KVASGQBKHL 
LFEVQPGSDS SAFWKWVRV VCTKZNKSSO ZVBASRZMNIj yQPZQLYKDZ TSQAA6VLAQ 
SSTSEEPDQI 8SSVTS0QAS ZA3M6RVKQLT DBBBCCZOfD GRADLZLPCA HSFOQKCZDK 
WSDRHRNCPI CRLQMTGANB 8WWSDAFTB DDMANYZLNM ADEAOQPHRP 

Seq ZD NOt B69 DNA sequence 

Nucleic Acid Accession «i NM_002975.1 

Coding sequences 1B0..1151 



OQAGCAAGGG 
GGAACCTGGG 
ACCCCTCCCA 
TGCAGGCAGC 
GGGCT06GGQ 
GGGAGAGGGA 
GGGGGGATQA 
AOCAGGGGGA 
CTCCCACCCC 
GCCTGCACCA 
AOQGGCT OO G 
AGGAGGGQCA 
TGCGCCTGGG 
AGGOGCGGTG 
AGGOGCTCAC 
6G3TGCA0GA 
TCTTGGCCTG 
ATCOGCTCAO 
CTGACGAOGO 
TCCCCTTCTA 
TGOGGGBtGC 
GCTCCXX30AC 
TTOTTAQTGT 
IQTGQAATCT 



11 
I 

AOOGGACAGA 
GGCTAGTGAC 
CCCC ftQACA T 
CTGOCTTTTG 
AGCAGAGAGO 
GGCCCTGATG 
GAATCCTGCX; 
GGAAGAGGAG 
TGAGGACATC 
GCTGCA08TC 
GC3VGCTGCX36 
GGGT0GG6CC 
CCACAAGTGC 
CA0GG0G0G6 
TOQGTACCTO 
TGQGCGOGCC 
GCAT06CTCA 
CCOQGACCAO 
CTCCTGGTGG 
G0G6GG00GG 
OCAGCCTOCT 
CCOCAOTCTG 
CTTTCCTTGA 
QAAAAAAAAA 



21 
I 

GAOGAGGAGA 
CTGCAGACAG 
CCAQACATCT 
GGGGCTTTGG 
QAGTGGGAGO 
CTOA AOCATC 
GGAACTGTTO 
GAGGAA6CAA 
GTCACTTACA 
OOTCTGC3106 
AA0G0G6CA6 
GA0C60QAGC 
TTCCTGCTCr 
GGOGGQAGCG 
OG0606GGGC 
GAGGGCCTCT 
GCCC6CCCGG 
CCCAACGGTG 
GACCAOGACT 
TAOOCX36CCT 
COQGAKTOOC 
GGOGCTTCIG 
AGGOGOGGOC 



31 

I 

GGAACAGGAA 
GGCA GQGGC A 
GGAACTTTGG 
TGOTCCXXXai 
GAGGCTGGGG 
TGCAGGAAOC 
AGGQAAAAGA 
CGCCAACCCC 
TCCTGGGCCG 
CGTTGGACAC 
GCQAGACOOS 
AOGGCGGCTT 
CGOGOSACTT 
TGGOGCAGCC 
TG6CTCCCTA 
ACCTCTTOQA 
AGCTG6606C 
GCACSCTCQA 
GCCAGOGGCG 
CCTTG0CCA7 
CCTTCCCTTC 
GGAGGGCTCT 
ACCAGGCTAO 



41 

I 

GAQAQAAGCT 
CTCG6CAGTT 
GTGCCAAGAG 
QCTCTTGGGC 
AGGTGCOCAG 
CCTAOGACTG 
GGACTGGOAG 
ATCCTCCGGC 
CCTGGCXX3GC 
CGGOGTGGTC 
08AT6C0GTG 
GGAGGGCTGC 
OQAAGCTCAG 
GGCAGACCGC 
CAACTGGCCC 
AAAOQGGCAO 
CCAGCC CAOC 
GAACTGOOTG 
TCTCTACTAC 
CCCACCACCC 
CTGGCCACGA 
TGOGGTGCOG 
GTCOGOTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 



51 
I 

GGGAGAATOO 
CCCA6AGGCC 
TCCAGCTTAA 
TTTGGCCATG 
GAGG AQGAG C 
CCTQCTGGGA 
ATGGAGGAGO 
CXX3W3CCCCT 
CTGGACX3CAG 
GAQCTQACCC 
CAAGCCCT6C 
CTGAAGG6GC 
GOGGCG606C 
CAGCAGAT6G 
GTGTGGCrOQ 
GGOGTGTOCT 
GCCTGGCOOC 
G06CAGGGCT 
GTCTGOQAGT 
GGCCTTTCCC 
ATG6CAGGGT 
GCACTCCTCC 
AATAAATCCT 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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Seq ID HO I B70 Protein sequence 
Protein Accession it NP_00a966.1 

1 11 21 31 41 51 

KQAANLLQAL WPQLUSFGH GARGAEREHE GGWGGAQEEB REREAU4LKH LQBAI/^LPAG 60 

RGDBHPAGTV BGKEDWEMEE DQGEBBBEEA TPTPSSQPSP SPTPEDIVTY XLGRLAGLDA 120 

GLHQLHVRLH ALDTRWBLT QGLRQLRHAA GDTRDAVQAL QBAQ6RAERE KORLBGCLRG 160 

LRL6HKCFLL SRDFEAQAAA QARCTARG6S LAQPADRQQH BALTRYLRAA lAPYNWPVHIi 240 

6VHDRRABGL YIiFENGQRVS FFAWHRSPRP ELOAQPSASP HPLSPDQPNG GTLENCVAOA 300 
S0D6SWMDUD OQRRIfYYVCB FPF 

Seq ID NOt B71 DKA sequence 

Hucleic Acid Accession 9t BC000839.1 

I 11 21 31 41 51 

CTGQCCAGGG CmCTGGOGG ATAAGGTCTT GTGOGTGQCC TOGAGQCTTA AAAOTAGCAO 60 

TGGGGCTTTQ TGAAGOACAA AATGGCQATQ G0SGGC06TG TAGQTOCCCC TTCCTATQAT 120 

GAGGACCTTT TCACAOACCT GTACTGAGCT CXXTTGAGGAT AAGTAACTCT GAGGAGATGG 180 

GCCCreCAAG CCTCCTTCTT AGCCGTCTGT TCAOAAAATA OCXJTTTTOaA AATGCCCTGA 240 

GTTGACCTAA TGTCTTATTG GGCTCCTGTC TGCAC3GATTT AC3G06CAOGT TGGAACOOAA 300 

GAGAQCTCTG TTGTTGCAAT GTTCAGCCCA CAAGAGCTTA CTGGTGAAOG AATGGGA CAA 360 

GACCX3VTCTT TATGCAAAGC CAGCGTTACA GTAATGTTCC AiGCATCTCAT AATCTATCXTT 420 

GGGQAATTCA GCT6CCTCCC AGGGTGAATA CAGGTATTCC TOATQACAQT CTGCCTCTAT 480 

CTTACAGAGC AGCTTGTTGC TATATACCAT TGAAAAGCCT TCAGAGCTGA GAGGTACTAC 540 

TAACCAATAA CCTGCTTGGC TCAAAGGGCC AGCACCTTCT CTCTAAAGCC CAAGAGGAGT 600 

TTGAGGAAAA CTAGGTGTCT GTGTTCACTC CAG6CTGAAS TTACAOGTCT GAGCAAATAA 660 

GGTGTATAAA AAATGGAATC TGTCTTGGAG GACATCAGAA GGTGAATTTT CCAA6TTCTT 720 

GGACAACCTA GCTGTTGAAA AGCTTTCTOG GTTTGGGGOQ TATTTCACSAT GTA CCTTA AA 780 

GTGTTAGCAO ATACAGATTA AGACACTGGG AGCCAATGAA ACAGCAGTTO AGGGTTTOCT 840 

GTGTATCACA TTTCTGTATT TTATCACCCX: CTTCCTGCAA CATTATTTAT CTGGAAT CTA 900 

CXTTQCXXnTT TGTTTTTTAQ ATACAAGGGC TTGGTTTTQT TACCCAGGCT GSTTT^AGG 960 

CCATAOCTTT AAGAGATCCT CTCACCACAQ ATTTCCAAAG TGCTGGGATT GCAGGT6TGA 1020 

TTCATGGCAC CCAGACTTTG CTGCCTTTCT TACATGATCC AGGCCCAGAA CCCAAACTCA 1080 

GGGACTGTAT AGATGACCAC TTTCGTAAAC TACTGACCTA GCTTGTTGCC AATTGTTQAT 1140 

TGAACTTCCC ATAACTCCAC TTCOrGTCTQ TTCCTCTGTA TAC AOCCAC C TTCT6TTCCC 1200 

GTCATGAGCC TTTAGQTCTC CATTT6CATA TTGCAAATAC TA'fGTTCCAT OTAOQT AGCT 1260 

CATTCAGGGC CTTGCTCTTC ACTTCAAAAA AGGTTCCCTT GAGOACTOOC TGTCAATTTO 1320 

TGTTGCTGTG TlW n WX ' G ATGAAAATAA TAAAATGATT GATTACATAA AAAAAAAAAA 1380 
AAAAAAA 

Seq ID NOt B72 DNA sequence 

Nucleic Acid Accession #i CAT cluster 

1 11 21 31 41 51 

i I I I I I 

TGCTAGTAGG GCCTGGGTTA AT0GGCCX3AG GGTGGCTTGQ TGQTCTTTAT AGCTGTTACT 60 

CTTTOTACTT GTCTTTTTCT TTTATTTTCT TTTGAQCGAT TGTGOGAACA TAGCATAGCA 120 

OBCSICTATGG CSIOQAOGCTC GTGCXGCCTO G0CA6GGC6A CTGGOGGATA AGGTCTTOTG 180 

0GTGGCCT06 AGGCTTAAAA GTAGCAGTGG GOCTTTOTOA AGGACAAAAT GG0GATGGCX3 240 

OGCCGTGTAO OTCCCCCTTC CTATGATGAG GACCTTTTCA CAGACCTGTA CTG AGCTC OQ 300 

TOAQQATAAA TAACTCTGAG GA6ATGGGCC CTGCAAGCCT CTTGCTTAGC 0GTCT6TTCA 360 

GAAAATAGCG TTTTOGAAAT GOCCTGAGTT GACCTAATGT CTTATT6GGC TCCTOTCTGC 420 

AGQATTTAOO CX3CA06TTGG AA00GAA6AG AGCTCTGTTG TTOCAATGTT CAGCCCACAA 480 

GA6CTTACTG GTGAAGGAAT GGGACSVAGAC CCATCTTTAT GCAAAQCCAG OGTTACAGTA 540 

ATGTTCCAGC ATCTCATAAT CTATCXH^GG GAATTCAGCT GCCTCCCAGQ GTQAATACAO 600 

GTATTCCTQA TGACAGTCTG CCTCTATCTT ACAGAOCAGC TTGTTGCTAT ATACCATTGA 660 

AAAOCCTTCA QAGCTGAQAO GTACTACTAA CCAATAACCT OCTTGGCTCA AAGGQCCA6C 720 

ACCTTCTCTC TAAAGOOCAA GAGQA tflTW AGGAAAACTA GGTGrCTGTG TTCACTCCAG 780 

GCTQAAGTTA GAGOTCTOAG CAAATAAGGT GTATAAAAAA TGGAATCTGT CTTGGAGOAC 840 

ATCAGAAGGT GAATTTTCCA AGTTCTTGQA CAAOCTAGCT GTTQAAAAGC TTTCTGGQTT 900 

T6GGGGGTAT TTCAGATGTA CCTTAAAGTQ TTAGCaVGACA CAGATTAAGA CACrGQGAOC 960 

CAATGAAACA GCAGTTGAGG GTTTGCTGTG TATCACATTT CTOTATTTTA TCAOCCCCTT 1020 

CCTGOACAT TATTTATCTO OAATCTACCI GCCCTTTTGT TTTTTAGATA CAAGGOCTTG 1080 

GTTTTGrrAC CCAGGCTGGT TTCAAGOOCA TAGCTTTAAG AQATCCTCTC A CCACA QATT 1140 

TCCAAAGTGC TGGGATTGCA GOTOTGATTC ATGGCACCCA GACTTTGCTG CCTTTCTTAC 1200 

AIGATCCAOG CCCAGAACCC AAACTCAGGC ACTOTATAGA TGAOCACTTT OGTAAACTA C 1260 

TGACCTAGCT TGTTGCCAAT TGTTGATTQA ACTTOXATA ACTCCACTTC GTGTCTGTTC 1320 

CTCTGTATAC AGCXSUXJTTC TGTTCCOQTC ATGAQCCTTT AGGTCTCCAT TTGCATATTG 1380 

CAAATACTAT GTTCCATOTA GGTAGCTCAT TCAGGGCCTT GCTCTTCACT TCAAAAAAGG 1440 

TTCCCTTQAG GACTGGCTOT CAATTTGTGT TGCTGTGTTG GTTOTTGATG AAAATAATAA ISOO 

AATGATTOAT TACATAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACAAA CACAAAAAAC 1560 
GGCCQ06CTT TTTCCOQGGC GCACAAAGTT ATAAAA060C QTCCATC 

Seq ID NOt B73 sequence 

Nucleic Acid Accession St NM_000222.1 

Coding sequence: 22.. 2952 

I II 21 31 41 51 

II 1111 

GATCCCAT09 CA0CTA008C GATGAGA6GC GCT060QG0G CCTGOGATTT TCTCTGCXmf 60 
CXCCTCCTAC TGCTTOGCGT CCA0ACA06C TCTTCTOUkC CATCTGIGAG TCCAGGGQAA 120 
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COGTCTCCAC CATCCATCCA TCCAGQ AAAA 
ATTAGGCTOT TATOCACTQA TCCGGQCTTT 
AOQAATGAOA ATAAGCAOAA TGAATGQATC 
AAATACAOOT QCAGCAACAA ACAOSaCTTA 

5 ccTGocAAoc vrritX ' i ia T tgacogctcc 

GTCOSCTGTC CTCTCACAQA CCCAQAAGTG 
AACCCTCTTC CCAAGGACTT GAGOTTTATT 
AGTGTQAAAC GCGCCTACCA TOOGCTCTGT 
TCA QT Q C TGT OGQAAAAATT CATCCT6AAA 
10 GTGTCTGTGT CCAAAGCAAG CTATCTTCTT 
ACAATAAAAO ATGTGTCTAO TTCTOTOTAC 
AAACTACAGO AGAAATATAA TAQCrCGCAT 
ACGTTGACTA TCAGTTCAGC GAQAOTTAAT 
AATACTTTTO GATCAQCAAA TGTCACAACA 
15 AATATCTTCC CCATQATAAA CACTACAGTA 
ATTGTrOAAT ATQAAGCATT CCCCAAACCT 
ACCTTCACTG ATAAATGGGA AGATTATCCC 
GTAAGTGAAC TTCATCTAAC GAGATTAAAA 
GTGTCCAATT CTGAOGTCAA TGCTGCCATA 
20 GAAATCCTGA CTTACGACAG GCTCGTGAAT 
CCAGAGCCCA CAATAGATTO 6TATTTTTGT 
6TACT6CCA0 TGGATGTGCA GACACTAAAC 
GTTCAGAOTT CTATAGATTC TAGTGCATTC 
TACAACXSATG TGGGCAAGAC TTCTGCCTAT 
25 GAGCAAATCC ATCCCCACAC CCTOTTCACT 
OGCATGATGT GGATTATTGT GATQATTCTG 
GAAGTACAOT GGAAGGTTGT T0AGGA6ATA 
ACACAACTTC CTTATGATCA CAAATGGGAG 
ACCXrrGGGTG CTGGAGCTTT CQGGAAGGTT 
30 TCAGATGCGG CCATQACTGT CGCTGTAAAG 
OQGQAAGCCC TGAT6TCTGA ACTCAAAOTC 
6TQAATCTAC TTOGAOCCTO CACCATTGGA 
T6CTATGGTG ATCTTTT6AA TTTTTTGAQA 
CAGQAAGATC ATGCAGAAOC TGCACTTTAT 
35 TQCAGCGATA GTACTAATGA GTACATOGAC 
ACCAA060CQ ACAAAAGGAO ATCTGTQAOA 
GOOOCCATCA TOOAGGATGA OGASTTGGCC 
TACXyWSGTGQ CAAAGGGCAT GGCTTTCCTC 
GCAGCCAQAA ATATCCTCCT TACTCATGGT 
40 6CCAGAGACA TCAAGAATGA TTCTAATTAT 
AAOTGGATGO CACCTGAAAO CATTTTCAAC 
TCCTATGQGA rmTCTr T Q GGA6CTGTTC 
CCQGTCGATT CTAAGTTCTA CAAGATGATC 
CACXSCACCTG CTGAAATGTA TGACATAATG 
45 AGAGCAACAT TCAAGCAAAT TGTTCAGCTA 
CATATTTACT CCAACTTAGC AAACTGCAGC 
TCTOTGCGGA TCAATTCTGT CX3GCAGCA0C 
GAC6ATGTCT GA6CAGAATC AaTOTTTGGO 
OCTTCCATOA TGGTTATTTT CTTTTCTTTC 
50 CCCCACTGCA ATCCTGTCTT TCTGAOCACA 
CACCATGCTA TTGCAAAGGT TCCAACTGTA 
ATGAACAOAA AAC3VTTCTGA TTT6GAAAAA 
CGTTTCCAA0 GCTTCTCC3VA TTCTGCCCAA 
GGTAGTAATC ACS^GTTGGOC TTCAGAACCA 
55 A6CTQAAAAC CTAAGTCCTT TATQTGGAAA 
AXGAACACCT GGGCTTAAGA AATCTAOTAT 
AAAAAAtGAT CCCCAA6TQT GAACAAAAQA 
TATACTACOO ACCTG6TTTT TAAATAQAGT 
GCCTCCXTTAG CCAGCACTTG TATATAOOCA 
60 GGGGAAAACA CCATAAG6TT TOGTT TCTGT 
TAGAAOTAGA TTAAGAGCCA TATAAOmG 
AACAATATAA CCACAAAGCA CAOTTTGAAC 
TCTGTAGATT CTXTrOGAACA AGCCTATCAO 
GATQCTGTTT OACAAAQTTA CTOATTCACT 
65 GCGATCTTAQ TTTGGATTCT TATGTAGCAQ 
AGGOOGTCC TOGAG ACOM OOCftgTAT CT 
AGACAAATAT TTGGAGGGGT ATTTTT60CC 
AAGTAACTTQ GCTTTCATTA TTAGTACTOC 
TAGCTTACCA GAAGCTTCCA TA0TX3GTQCA 
70 TTTGCAGTTC ACCTGCACTT AAGGOCTCT 
OCTTAGACCT TCCATAATGC TACTGTCTCA 
GTA6CCTGGA TATTATTCTT GTAGTTTACC 
AACTCCCCTT CCTCACTOCC CAATATAAAA 
TTGTCTTGAA AQATTCAGGT ATGTTGCCTT 
75 TACATTTAGA GAACTG TGGC CGTT ATCTGG 
CTOQCACCTT TGCAAAOTTA ACAGATTTTO 
TTGCCATACr TTGTCTGAAA AATTCCTTTG 
AAOTGOTTGT TA6TTATAQA TGTCTAGOXA 
TTGCCATACT TTGTCTGAAA AATTCCTTTG 
oO AAOTGQTTGT TAOTTATAGA TGTCTAGQTA 
AATGTCTTTT OAATATTCCC AAG0CX31TGA 
AACTTTATQT GTAAATACAT AAGGGQOQTA 
TTATTGCXOT ATGTT6TCCA ATTGTT6ACA 



TCAGACTTAA TAGTCOGCGT GGGOQACGAO 180 
QTCAAATGGA CTTTTGAGAT CCTGGATGAA 240 
A06QAAAAGG CAGAAGCCAC CAACACOGGC 300 
AGCAATTCCA TTTATGTGTT T6TTAGAGAT 360 
TTGTATG6GA AAGAAGACAA 0GACA06CTG 430 
ACCAATTATT CCCTCAAGGO GTGCCAGGGO 480 
CCTGACCCCA AGGCGGOCAT CATGATCAAA 540 
CIGCATTGTT CTGTGGACCA GGAGG6CAAG 600 
GTGAGGCCAO CCTTCAAAGC IXyiXSCC f Q'A'T 660 
AGGGAAGGGQ AAGAATTCAC AGTGA0GT6C 720 
TCAACGTGQA AAAGAGAAAA CAGTCAGACT 780 
CACGGTGACT TCAATTATGA AOQTCACGCA 840 
QATTCTGGAQ TGTTCATGTG TTATGCCAAT 900 
ACCTTGGAAG TAGTAGATAA AGGATTCATT 960 
TTTQTAAAOO ATGGAGAAAA TGTAGATTTG 1020 
GAACACCAGC AGTGGATCTA TATQAACAGA 1080 
AAGTCTGAGA ATGAAAGTAA TATCAGATAC 1140 
GGCACCGAAO GAG6CACTTA CACATTCCTA 1200 
GCATTTAATG TTTATGTQAA TACAAAACCA 1260 
OGCATGCTCC AATGTGTGGC AGCAGGATTC 1320 
CCAGGAACTQ AGCRGAGATG CTCTGCTTCT 1380 
TCATCTGGGC CACOGTTTGG AAAGCTAGTG 1440 
AAGCACAATG 6CAC6GTTGA ATGTAAGGCT 1500 
TTTAACTTTG CATTTAAAGQ TAACAACAAA 1560 
CCTTTQCTGA r m UTT T CGT AATOGTAGCT 1620 
ACCTACAAAT ATTTACAGAA ACCCATGTAT 1680 
AATGGAAACA ATTATGTTTA CATAGACCCA 1740 
TTTCCCA6AA ACAGGCTGAG TTTTGGGAAA 1800 
GTTGAGGCAA CTGCTTATGG CTTAATTAAG 1860 
ATGCTCAAGC CGAGTGCCCA TTTOACAGAA 1920 
CTGAGTTACC TTGGTAATCA CATGAATATT 1980 
GGGCCCACCC TGGTCATTAC AGAATATTGT 2040 
AGAAAAOOTG ATTCATTTAT TTGTTCAAAG 2100 
AAGAATCTTC TGCATTCAAA GGAGTCTTCC 2160 
ATGAAACCTG GAGTTTCTTA TOTTOTCCCA 2220 
ATAGQCTCAT ACATAGAAAG AGATGTGACT 2280 
CTAGACTTAG AAGACTTGCT GAGCTTTTCT 2340 
QCCTCCAAGA ATTGTATTCA CAGAGACTTQ 2400 
OGGATCACAA AGATTTGTGA TTTTGGTCTA 2460 
QTGGTTAAAG GAAACQCTCQ ACTACCTGTG 2S20 
TGTQTATACA CGTTTGAAAG TQAOGTCTGG 2580 
TCTTTAGGAA GCAQCCCCTA TCCTGOAATO 2640 
AAGGAAGOCT TCCGGATGCT CAGCCCTGAA 2700 
AAGACTTGCT GGGATGCAGA TCCCCTAAAA 2760 
ATTQAGAAGC AOATTTCAOA QAGCACCAAT 2820 
CCCAAGC6AC AGAAGCOOGT GGTAGACCAT 2880 
GCTTCCTCCT OOCAGCCTCT GCTTGTQCAC 2940 
TCACCCCTCC AGGAATGATC TCTTCTTTTG 3000 
AACTTGCATC CAACTCCAGG ATAOTQGGCA 3060 
.CTTTAGTGGC CGATGATTTT TGTCATCAGC 3120 
TATATTCOCA ATA8CAAC6T AOCTTCTACC 3180 
GAQAGOQAGO TATQGACTGG GGGCCAGAGT 3240 
AAATATGGTT GATA6TTTAC CTGAATAAAT 3300 
TCCATAGTAG TATGATGATA CAAOATTAGA 3360 
ACAGAACATC ATTAGAACAA AGGACAGAGT 3420 
TTCATQCTGG GAATGAGACA TAGGCCATGA 3480 
TOCTCTTCTO TGGACCACTG CATGA6CTTT 3540 
TTGCTATTAG AGCATTQAAT TGOAGAGAAO 3600 
TCTATAAATT OTCCGTGTTC ATACATTTGA 3660 
ATACAACCCT GGCATTATGT GGACrGTGTA 3720 
AAGGAAACAG TTAATACCAT TTTTTAAGGA 3780 
AAAATCTCCT CTTTTAGCTO ATGAACTTAT 3840 
CTTCAGAATG QCATTGTACT CAATGQATTT 3900 
GCATGGCTCC CACAGGA6TG GGAAAACACT 3960 
GAAATAAAGT ATAGQTTTA6 CCTCCTT06C 4020 
ATATAT6TGT ATGTAOGTTT 6TATGTGTGT 40BO 
TGAGTCCAAG AGQGTCCTTT AGTACCT6AA 4140 
TCTTOTTTCT TTTCACATAG CTGTCTAGAG 4200 
GAGGAAGTGG AAGGCATCAG TGCCTATGTA 4260 
GTTATTTA6A CTCATCTTAC TSTACCTGTT 4320 
CTGAAACATT TAAATTTCAC CCTTTAGACT 4380 
TCTTTAAAAA CAAAACAAAA CAAAACAAAA 4440 
GGCAAAT6TG TACATGGCAG AGTTTGTGTG 4500 
TATGGTTTCC CCCTTCTACA TTTCTTAOAC 4560 
AAGTAACCAT TTOCACTGGA 6TTCTATGCT 4620 
GGaTTGTGTT GTCM3CCAAG AGATTGTTGT 4660 
TQTTTCTATT GACTTCAATQ ATAGTAAGAA 4740 
CTTCAGGGGC ACTTCATTOA 6A6TTTT0TC 4800 
TGTTTCTATT GACTTCAATG ATAGTAAGAA 4860 
CTTCAGGGGC ACTTCATTGA GAGTTTTGTC 4920 
GTOCTTGAAA ATATTTTTTA TATATACAGT 4980 
AQTrTAAAGQ ATGTTGGTGT TCCAOGTGTT 5040 
GTTCTGAA6A ATTC 
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PCT/US02/36810 



Seq ZD MO: B74 protein sequence 
Protein Accession 0t NP_000213.1 

^1 11 21 31 41 51 

Sill III 

HRGARGA1«DP XiCVLLLUiRV QTGSSQPSVS PGEPSPPSIH PQKSDLIVRV GDEIRLLCTD 60 

PGFVKNTFBI LDBTNQIXQN BHITEKAEAT KTGXmmiK HOLSNSiyVF VHDFAKLFLV 120 

DR8LYGKEDN DTLVRCPLTD PBVTNYSLKG CQGKPLPKDL RPIPDPKAOI MIKSVKRAYH 180 

RLCLHCSVDQ BGKSVI*SBKP ILKVRPAFKA VPWSVSKAS YLLRBCEBPT VTCTIKDVSS 240 

10 SVYSTWKREN SQTKLQEKyN SWHKGDFmrB RQATLTZSSA RVNDSGVFHC YAMNTFQSAN 300 

VTTTLBWDK 6PINIFPMIH TTVFVNDGEBI VDliIVEYBAF PKPBHQQWXY MNRTFTDKHE 360 

DYPKSBHESN IRYVSELHLT RUCaTEGGTY TFLVSHSDVN AAIAFNVYVN TKPBILTyDR 420 

LVNGHLQCVA AGPPEPTIDW YFCPGTBQRC 8ASVLPVDVQ TUISSGPPFG KLWQSSIDS 480 

SAPKHNGTVB CKAYNDVGKT SAYFNFAFKG NNKEQIHPHT LFTPUjIGFV IVAOWCIIV S40 

15 MIIjTYXYIjQK PMYBVQWKW EBINGNNYVY IDPTQLPYDH KWBFPRNRLS POiCn/SAGAP 600 

GKW&ATAY6 LZRSDAAMTV AVXMLKP8AH I*TSRBAZiNS& LKVLSYLQIB MNZVNIiLOAC 660 

TIGOPTLVIT BYCCYGDLU9 FLRRKRDSFI CSKQEDHAEA ALYKNLLHSR SSSCSDSltIB 720 

YMDMKFGVSY WPTKADKRR SVRIGSYIER DVTPAIMEDD EXiALDLEDLL SFSYQVAKGM 780 

AFI*ASKNCIH RDLAARKILL THGRITKIOD FGLARDIKND SNYWKGNAR LPVKWMAPES 840 

20 ZFNCVYTFES DVKSYGIFLN ELFSLGSSPY PGHPVDSKFY KNIKBGFRML SPEKAPAEKY 900 

DZMKTCHDAD PLXRPTFKQZ VQLZBXQI8B STKHZYSNLA NC8PNRQKPV VDHSVRXN8V 960 
6STASSSQPL LVHDDV 



Seq ZD NOi B75 DNA sequence 
25 Nucleic Acid Accession #t NM_004456 
Coding sequence! 58.. 2298 

1 11 21 31 41 51 

in ' ' ' ' ' ' 

3U GAATTCOGGG 0QACGGGC6G GAACAAGQOG AOTOOGGGCG OGGQACGAAG AATAATCATG 60 

GG0CAC3ACTG GGAAGAAATC TGA6AAGGGA CCAGTTTGTT 0G0GGAAGCX3 TOTAAAATCA 120 

6AGTACATGC GACTGAQACA GCTCAAGAGG TTCAGAOQAO CTGAT6AAGT AAAGA6TATG 180 

TTTAGTTCCA ATCOTCAGAA AATTTTGGAA AGAACGGAAA TCTTAAACCA AGAATGGAAA 240 

CAGCGAAGGA TACAGCCTGT GCACATCCTG ACTTCTGTGA GCTCATTGOO CQGGACTAGG 300 

35 GAGTGTTOGG TGACCAGTOA CTTGGATTTT CCAACACAAG TCATCCCATT AAAGACTCTG 360 

AATGCAGTTG CTTCAGTAOC CATAATOTAT TCTTGGTCTC CCCTACA6CA QAATTTTATG 420 

GTGOAAGATG AAACT G ' i 'TTT AGATAACATT CCTTATATGG 6ASATGAAGT TTTAGATCAG 480 

QATQOTACTT TCATTGAAGA ACTAATAAAA AATTATGATG GGAAAGTACA OGGGGATAGA 540 

QAATGTGGGT TTATAAATGA TGAAATTTTT GTGOAGTTGG TQAATQCCCT TGGTC3UITAT 600 

40 AATQAT6ATQ ACGATGATQA T6ATG6AQAC GATCCTQAAG AAAGAGAAGA AAAGCAGAAA 660 

GATCTGOAOO ATCAC08A0A TGATAAAGAA AOGOGGCCAC CTOQOAAATT TCCTTCIGAT 720 

AAAATTTTGG AGGCCATTTC CTCAATOTTT CC3U3ATAAGG OCACAGCAGA AGAACTAAAO 780 ' 

QAAAAATATA AAOAACTCAC CX3AACA6CAG CTCCXaVGGCG CACTTCCTCC TOAATG TACC 840 

CCCAACATAG ATGQACCAAA TGCTAAATCT GTTCAOAOAG AGCAAAGCTT ACACTCCTTT 900 

45 CATAOGCTTT TCTOTAiOOOG ATOTTTTAAA TATGACTGCT TCCTACATCC TTTTCATGCA 960 

ACACCCAACA CTTATAAGC3G QAAGAACACA GAAACAOCTC TAGACAACAA ACCTTGTGGA 1020 

CCACAGTGTT ACCAGCATTT GOAGGGAGCA AAGGAGTrTG CTGCTGCTCT CACOGCTGAO 1080 

CGGATAAAGA CCCCACCAAA AGGTCCA6GA GGCOGCAQAA GAGGAOGOCT TOCCAATAAC 1140 

AGTAGCAGGC CCAGCACCCC CACCATTAAT GTGCTQGAAT CAAAGGATAC AOACAGTGAT 1200 

50 AGQGAAGCAO GGACTGAAAC GGGGGGAGAG AACAATX3ATA AAGAAGAAGA AGAOAAGAAA 1260 

QATOAAACTT 06AGCTCCTC T6AA0CAAAT TCTOQQTSTC AAACACCAAT AAA QATGA AG 1320 

CCAAATATTO AACCTCCTGA GAATGTOGAG TGGA0TQGT6 CTGA AGCC TC AATGTTTOQA 1380 

GTCCTCATTO GCACTTACTA TQACAATTTC T6T0CCATT0 CTAOGTTAAT TGGGACCAAA 1440 

ACATQTAGAC AGOTOTATGA OTTTAGAGTC AAAGAATCTA GCATCATA6C TCCAGCTCCC 1500 

55 GCTGAGGATG TGGATACTCC TCCAAGQAAA AA6AAGAGGA AACAC OQQTT GTGGGCTGCA 1560 

CACT6CA6AA AGATACA0CT QAAAAAGGAC G6CTCCTCTA ACCATGTTTA CAACTATCAA 1620 

COCTQTGATC ATCCAOGGCA GCCITGTGAC AGTTOGTGOC CTTGTGTGAT AOCRCAAAAT 1680 

TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAC3TGTCAAA ACOGCTTTCC GGGATGCOaC 1740 

- TGCAAAGCAC AOTQCAACAC CAAGCA6TGC CXX3TGCTACC TGGCTQTCOO AGAGTGTQAC 1800 

60 CXTTQACCTCT GTCTTACTTa TGOAGCOGCT OAOCATT06G ACRGTAAAAA TGTGTCCTGC 1860 

AAOAACTGCA GTATTGAOCa GGGCTCSCAAA AAGCATCTAT tGCTGGCAOC ATCTQAOQia 1920 

GCAG6CTGGG GGATTTTTAT CAAAGATCCT OTGCAGAAAA ATGAATTCAT CTCAGAATAC 1980 

TCtTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTQTA TGATAAATAC 2040 

ATGTGCAGCT TTCTGTTCAA CTTGAACAAT QATTTTaTGO TGGATGCAAC COGCAAGGGT 2100 

65 AACAAAATTC QTTTT6CAAA TCA7TC6C3TA AATCCAAACT GCTATGCAAA AGTTATGATG 2160 

QTTAAOQOTQ ATCACAGQAT AGGTATTTTT GGCAAGAGAG GCATOCAOAC 7GG08AAGA6 2220 

CTGTTTGTTG ATTAGAGATA CA60CAGGCT GAT6CCCTGA AGTATGTGSG CATOQAAAXSl 2280 

GAAATGQAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTOAAACA GCTGCCTTAQ 2340 

CTTCAOGAAC CTOGAGTACT GTGGGCAATT TAGAAAAAOA ACATGCAOTT TGAAATTCTG 2400 

70 AATTTGCAAA GTACTOTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 2460 

GCCTTCTOU: CAGCTGCAAA GWmWi ' A CCAGTGAATT TTT8CAATAA TQCAOTATGG 2520 

TACATTTTTC AACTTTQAAT AAAGAATACT TOAACTTGAA AAAAAAAAAA AAAAAA 



75 



Seq ID NOt B76 Protein eequence 
Protein Accession #i NF 004447 



1 11 21 31 41 51 

} ) I I 1 I 

MGQTGKKSEK GPVCWRKRVK SEYMRIiRQLK RFRRADBVKS MPSSNRQKIL ERTBZLNQEH 60 

80 KQRRZQPVHZ LTSVSSLRGT RECSVTSDU) PFTQVZPLKT LNAVASVPZM YSWSPLQQHP 120 

MVEDBTVZ^ IPYMGDEVIiD QDOTFIESLX RHYDGKVHGD RB06FINDBI FVBLVNAl^ 180 

YMDODTOODO DDPBBRBBKQ KPtiBDHRPDK 8SRPPRXPPS DKILBAZSSM FFDXGTAEBL 240 

RBKYJOSLTEQ QLPGALPPBC TPNZDGFHAK SVQREQSI<HS FBTLFGRRCF |CYDCFLBPPH 300 

ATPNTYXRXH TETALDNKPC GPQCyQHLEQ AXBFAAALTA ERIKTPPKRP GGRRRGRLPN 360 
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NSSRPSTPTI NVLBSKDTDS DREAGTETGG ENNDKEBBBK KDETS9SSBA NSROQTPIKM 420 

KPNIBPPENV EWSGAEASMF RVtilGTYVDN FCAIARLIOT KTCRQVYBFR VKESSIIAPA 480 

PABDVDTPPR KKKRKHRLWA AHCRKIQLKK DGSSNHWHY QPCDHPRQPC DSSCPCVIAQ 540 

NFCBXFOQCS SBOQMRPPGC RCKAQOITKQ CPOTLAVREC DFOLOiTGOA ADRHDSKNVS 600 

CKNCSZQROS KRHLLIAPSD VAGKOXPIKD PVQXNEPZSB YGGBI2GQDB ADRRQK VYDK 660 

YMCSPLFNW NDFWDATRK GNRZRPAMHS VKPNCYAICVH HVNGOHRiaX FAXRAIQTGB 720 
BLFVDYRY8Q ADALKYVCXB REKEIP 

Seq ID NO: B77 DKA sequence 
Nucleic Acid Accession tf: NM_007015 
Coding sequence: 1..1005 

1 11 21 31 41 51 

I i I I I I 

ATGACAGAGA ACTCCGACAA AGTTCCCATT GCCCTGGTGG GACCTGATGA OQTGGAATTC 60 

TCCAGCCCCC CGGCGTAOGC TACGCTGACX3 GTGAAGCCCT CXAQCCCCXSC OCQGCTGCTC 120 

AAGGTGGGAG COOTGGTCCT CATTTOGGOA GCTGTGCTGC TGCTCTTTGG GGCCATOGGO 180 

GCCTTCTACT TCTGGAAGGG GAGCGACAGT CACATTTACA ATOTCCATTA CACCATQAOT 240 

ATCAATGGGA AACTACAAGA TGGGTCAATG GAAATAGACG CTGGGAACAA CTTGGAGACC 300 

TTTAAAATGG GAAGTGGAGC TGAAGAAGCA ATTGCASTTA ATGATTTCCA GAA3GGCATC 360 

ACAGGAATTC G TTTTGCTGG AGGAGAGAAG TGCTACATTA AAGOQCAAGT GAAGGCTCX3T 420 

ATTCCrOAGO TGGGCGCCOT GACCAAACAG AGCATCTCCT CXAAACTGGA AGGCAAGATC 480 

ATGCCAQTCA AATATGAAQA AAATTCTCTT ATCTGGGTGG CTQTAGATCA GCCT6TGAA6 84 0 

QACAACAGCT TCTTQAOTTC TAAOGTGTTA GAACTCTGOS GTQACCTTCC TATTTTCIGG 600 

CTTAAACCAA CCTATCCAAA AGAAATCCAG AGGGAAAGAA GASAAOTGGT AAGAAAAATT 660 

GTTCCAACTA CCACAAAAAG ACCACACAGT GQACCACGQA GCAACCCAGG C36CrGGAAGA 720 

CTGAATAATG AAACCAGACC CAQTGTTCAA GAGGACTCAC AAGCCTTCAA TCCTGATAAT 780 

CCTTATCATC AGCAGGAAGG GOAAAGCATO ACATTOQACC CTAGACTGGA TCACG AAGGA 840 

ATCTQTTGTA TAGAATGTAG GOGGAGCTAC ACCCACTGCC AOAAOATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT CAAOQCTGCC GTTOSGCCTO CAGAGTCATC 960 

ATGCCATGTA GCTGGT60QT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CAOOTGCTGT AAAATAAQAA CTAQCTGAAG AGAC3VACCAA AGAAGCATTA AGGCAGGTTG 1080 

ATGCTGATGG GACCATAAAA TATTTTTACA CGCAGCCTGA GOGGTTATTC TTGACACTCI 1140 

TAACAGAATT TTTTTAATOG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTOAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCOQTT ATTTGTTATT TTTTATTTGC ATTOATTTGC 1260 

CATAAGTCTT CCXTTTGCTTG CATCTTCCAA AOCTATTTGQ AAATAAACAC GAAAATTTAC 1320 
AOTTTGCC 

Seq ID KO: B7B Protein sequence 
Protein Accession ffi NP_008946 

1 11 21 31 41 51 

I I I I I I 

MTENSDKVPI ALVGPDDVEP CSPPAYATLT VKPSSPARLL KVGAWLISG AVLLLFGAIG 60 
AFYFHKGSDS HIYNVHY*IMS INGKLQDGSM EIDAGNJ^BT PKMGSGAEEA lAVNDFQNGI 120 
TGIRFAOGEK CYIKAOVKAR IPBVGAVTKQ SISSKLEOKI MPVKYBENSli IWVAVDQPVK 180 
DH8FX«S8KVL BLGGDIiPIFH IiKPTYPKBIQ RERRBWRKI VPTTTKRPH8 GPRSNPGA6R 240 
UOIETRPSVQ EDSQAFNPDN PYHQQB6B8M TFDPRU3HEG ZCCIECRRSY THOQKICBPL 300 
6QYYPWPYNY QGCRSACRVI HPCSHWVARI LtSMV 

Seq ID NOi B79 DNA sequence 

nucleic Acid Accession «i S]H_0l2449.l 

Coding sequence I 66.. 1085 

1 11 21 31 41 51 

I I 1 I I I 

CCGAOACTCA COGTCAAGCT AA060GAAGA GTGGGTGGCT 6AAGCCATAC TATTTTATAG 60 

AATTAATGGA AAGCAGAAAA GACATCACAA ACCAAGAAGA ACTTTGGAAA ATGAAGCCTA 120 

GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 180 

AAA6A0CT6T GCTTTTGCAT TTGCACCAAA CA0CCCAT6C TQATQAATTT GACTGCCCTT 240 

GAGAACTTCA GCACACACAG OAACTCTTTC CACA6TGGCA CTTGCCAATT AAAA TAGCT G 300 

CTATTATAGC ATCTCTGACT TTTCTTTACA CTCTTCTGAO OQAAGTAATT CACCCTTTAG 360 

CAACTTCCCA TCAACAATAT TTTTATAAAA TTCCAATCCT GGTCATCAAC AAAGTCTTGC 420 

CAATGOTTTC CATCACTCTC TTGGCATTGG TTTACCTQCC AGGTOTGATA GCAGCAATTG 480 

TCCAACTTCA TAATGGAAOC AAOTATAAGA AGTTTCCACA TTGGTTGGAT AAGTGQATGT 540 

TAACAA6AAA GCAGTTTGGG CTTCrCAOTT TCTTTTTTGC TGTACTGCAT OCAATTTATA 600 

GTCTGTCTTA CCCAATQAGG OGATCCTAGA GATACAAtJTT GCTAAACTGG GCATATCAAC 660 

AGGTCCAACA AAATAAAQAA GAT6CCTGGA TTGAGCATGA TGTTTQGAGA ATGGAGATTT 720 

ATGTGTCTCT GQGAATTGTG GGATTGGCAA TACTOGCTCT GTTGGCTGTG ACATCTATTC 780 

GATCTGTGAG IGACTCTTTO ACATGGAtSAG AATTTCACTA TATTCAGAGC AAGCTAGGAA 840 

TTOTTTCCCT TCTACTGGGC AGAATACACO CATTOATTTT TOCCTOOAAT AAOTGQATAG 900 

ATATAAAACA ATTTQTATGO TATAGAOCTC CAACTTTTAT GATAOCTGTT TTCCTTCCAA 960 

TT Q TTOTCCT GATATTTAAA AQCATACTAT TCCTGCCATG CTTGAGGAAO AAGATACTGA 1020 

AGATTAGACA TGGTTGGGAA GACGTCACCA AAATTAACAA AACTQAGATA TGTTCCCAGT 1080 

TGTAGAATTA CTOTTTACAC ACATTTTTGT TCAATATTGA TATATTTTAT CACCAACATT 1140 
TCAAOTTTOT ATTTSTTAAT AAAATGATTA TTCAAGGAAA AAAAAAAAAA AAAAA 

Seq ZD NO: 380 Protein sequence 
Protein Accession Si NF_036581.1 

1 11 31 31 41 51 

t 1 I I I 1 

MBSRKDZtllQ BBLHKMKPRR MI^mDYUIK DTGET8MLKR PVLLBLBOTA RADBFDCP8B 60 

LQHTQBLFPQ WHLPZXZAAZ lASLTPLYTL LREVZHPLAT SHQQYFYKZP ZLVUHCNOiPN 120 
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VSITLLALVY LPGVIAAIVQ UJNGTKYKKP PHWLDKWMLT RKQPGLLSPP PAVUIAIYSL 180 

SYPMRRSYRY KLUNWAYQQV QQNKEDAWIB HDWIRMEIYV SLQIVGIAIL ALLAVT8IPS 240 

VSDSLIWRBF HYIQSKU3IV SU*U3TIHAL IPAHNKWIDI RQPVWYTPPT FMIAVPLPIV 300 
VLIPKSILPL PCLRKKILKI RHGHEDfVTKI HKTBICSQL 

Seq ID NOi B81 DNA sequence 
Nucleic Add Accession fit iaM_000684 
Coding sequence i 87.. 1520 

1 11 21 31 41 51 

1111)1 

TCCTACOTC GCCCXSGQCTT CTGGGGTGTT CCCCAACCAC OGCCCAOCCC TGCCACaCCC 60 

CCCGCCXXXX5 GCCTCCX3CAG CTOGGCATGG GCGO GG GGGT GCTCXSTCCTG GGCXXXTTCOQ 120 

AGCCOGOTAA CCTOTOSTOG GCO6CAC00C TCCC0QAGG6 0606GGCACC G0GGCX3GGGC 180 

TGCTGGTQCC CGCGTOGCOG CCCGC C TOOT TGCTOCCTCC OGCX3W30GAA AGCCCOSAGC 240 

OGCTGTCTCA GCAGTOGACA GCGGGCATGG 6TCTGCTGAT GGOSCTCATC GTGCTGCTCA 300 

TOGTGGOGGG CAATGTGCTG GTGATCGTGG CCATCQCCAA GA06C060GG CTGCAGAOGC 360 

TCACCAACCT CTTCATCATG TCCCTGGCCA GOSCCGACCT GGTCATGGGG CTGCTGGTQO 420 

TGCCGTTCOG GOCCACCATC GTG6TGTGGG GC06CTGGQA GTAOSQCTOC TTCTTCTGOO 480 

AGCTGTGGAC CTCAGTGGAC QTGCTGTQCG TGAOGGCCAG CATOGAGACC CrOTGTGTCA 540 

TTQCCCTGOA CCOCTACCTC GCCATCACCT CGCCCTTCCG CTACCAGAOC CTGCTQACX3C 600 

6060GCGGGC GCXSGGGCCTC GTGTGCACCG TdXOGGCXSVT CTCGGCCCTG GTGTCCTTCC 660 

TGCXXaXCCT CATGCACTGQ TGGCXMGC6G AGAGCGACX3A GGCGCQCOSC TGCTACAACG 720 

ACXXCAAGTG CTGCGACTTC GTCACCAACC GGGCCTAOSC CATOGCCTOG TCGGTAOTCT 780 

CCTTCTAOGT GCX:CCTaTGC ATCATGGCCT TOGTGTACCT GCGGGT6TTC 060GAGGCCC 840 

AQAAGCAGGT 6AAGAAGATC GACAGCTGOQ AGCQCOQTTT CCTCGGCGGC CCAGOGCGGC 900 

CGCX:CTCX3CC CTCX5CCCTCG CCCGTCCCCG CGCCCGCGCC GCOGCCCGGA CCCCCGCGCC 960 

CCGCCX3CXX5C C5QCCGCCACG GCCCCGCTGG CCAACGGGCG TGCXSGGTAAG OGGCGGCCCT 1020 

CGCGCCTCGT GGCCCTACGC GAGCAGAAGG OGCTCAAGAC GCTGGGCATC ATCAT0G600 1080 

TCTTCACGCT CTGCTGGCTG CCCTTCTTCC TQGCCAACGT GGTGAAGGCC TTCCACOGOQ 1140 

A6CTGGT6CC CGACGGOCTC TTCOTCTTCT TCAACTGGCT GGGCTAOGCC AACTCGGCCT 1200 

TCAACCCCAT CATCTACT6C CQCAGCCCCG ACTTCCGCAA GGCCTTCCAG GGACTGCTCT 1260 

GCTGCX3CGCG CAGGGCTGCC CGCCGGCGCC ACGOGACCCA CX3GAGACCGG CCX3CG0GCCT 1320 

CGGGCTGTCT GGCCCXK3CCC GGACCCCCGC CATCXKXXX3G GGCCX3CCTCG GA06AGQAO0 1380 

AGGACGATGT CGTCGGGGCX: AOGCCGCCOG CGCXSCXTTGCT GGAGCCCTGG GCCGGCTGCA 1440 

AOGGGGOGGC GGOGGOGGAC AGOQACTGQA GCCTGGAOGA GC06TGC0GC CCGGGCTTCG 1500 

CCTGOGAATC CAAOOTOTAG GGCC0QGCX3C GGGGCGOGGA CTCOQGGCAC GGCTTCCCAG 1560 

GGGAACGAGG AGATCTGTGT TTACTTAAQA COGATAGCAG GTGAACTCGA AGCCCACAAT 1620 

CCTOGTCTGA ATCATCCQAG GCAAAGAGAA AAGCCAOGGA CGGTTQCACA AAAAGGAAAO 1680 
TTTGGGAAGG GATGGGAGAG TGGCTTGCTG ATGTTCCTTG TTQ 

Seq ID NO I B82 Protein sequence 
Protein Accession #i NP_000675.l 

I 11 21 31 41 51 

I I I I I I 

MaAaVLVLGA 8BPGNI>6SAA PLPDGAATAA RLLVPA8PPA SLLPPASBSP EPLSQQWTAG 60 

MGLLMALIVL LIVAGNVLVI VAIAKTPRLQ TLTNLPIMSL ASADLVKGLL WPPGATIW 120 

WGRWEYQSFF CBLWTSVDVL CVTASIETLC VIALDRYXiAI TSPFRYQSI*L TRARARGLVC 180 

TVWAISALVS PIiPIIiMHWWR AESDEARRCY NDPKCCDFVT NRAYAIASSV VSFYVPLCIM 240 

AFVYLRVFRE AQXQVXKID8 CBRRFZiGGPA RPPSPSPSFV PAPAFPPGPP RPAAAAATAP 300 

XiANGRAGKSR PSRLVALRBQ KAItRTIiQIIII (SNVTUOHUBF FLANWKAFH RELVPDRLFV 360 

FFHWIiSyAKS AFHPXIYCRS POPRKAFQGL LGCARRAARR RBATBGDRPR AS6CLARPGP 420 
PPSPGAA8DD DDDDWGATP PARLLBPWAO CMGOAAADSD 88UDBPCRPG FA8B8KV 

Seq ID NO t 683 SUA sequence 

Nucleic Acid Accession «i NM_000729.2 

Coding sequence : 2 . . 421 

1 11 21 31 41 51 

I I I I I I 

OGCTCAOCro CCGOGCTGCT COGGTTOGAA AG8CCAAGCC AGCTGCCGTC CTAATCCAAA 60 

AQCCAIGAAC AGOGGOQTGT GCCTGTGCGT GCTGATGGCG GTACTGGCGG CTGGCGCCCT 120 

GAOGCAGCOG GTGCCTOCCG CAGATCCCGC GGGCTCCGGG CTGCAGCGGG CAQAGQAGGC 180 

OCCCOGTAGG CAGCTGAGGG TATCGCAQAG AAOGGATGGC GAOTCCOGAG CGCACCTGGG 240 

CQCCCTGCTO 6CAAGATACA TOCAGGAOGC CCGQAAAGCT CCTTCIGGAC GAATGTCCAT 300 

OOTTAAOAAC CrGCAOAACC TGQACCCCAG CCACAOOATA AOTGACOGGO ACTACATGGG 360 

CTOGATGGAT TTTGGCOCTC GCAOTGCOGA GGAGTATGAG TACCCCTCCT AGAGGACCCA 420 

GCOGCCATCA GCCCAACGQA AGCAACCTCC CAACCCAQAG QAGGCRGAAT AAQACAACAA 480 

TCACACTCAT AACTCATTGT CTGTGGAGXT TGACATTGAA TGTATCTATT TATTAA6TTC 540 

TCAATOTGAA AATTQTGTCT GXAAGATTGT OCAGTGCAAC CACACAGGCT CACCA OAAOT 600 

TCTGCAAACT QAAGACAAAA CWrm ' CTf CATCTOTQAC TCCTOTTCTO AAAATCTTOT 660 
TATGCTATTA AAOTGATTTC ATTCTGCC 

8eq ID NOt B84 Protein sequence 
Protein Accession #: NP_000720.l 

1 11 21 31 41 51 

) I I I 1 1 

MMSGVCLCVL MAVLAAOALT QPVPPADPAG 6GLQRABBAP RRQIAVSQRT DGESRAHLGA 60 
UARYIQOAR KAPSQRMSIV RNLQHU>PSH RISDRDYHGW MDFORRSABB YEYPS 



828 



wo 03/042661 



PCT/US02/36810 



TABLE 74 Gene Sequences Mp- or Down-ReguXated In Cancer 



endothelial differentiation, sphingollpld G -protein- coupled receptor, 1 
HS.X54210 
M31210 
H31210 
251-1396 



Anglogenesia 

hi mA SEQUEWCB 
Gene name: 
Unigene number: 
Probeeet Accession ftt 
MucXeic Acid Accession ftt 
Coding sequence: 

TCTAAAGGTC GGGGGCAGCA GCAAGATGCO AAGCGAGCOG TACAGATCCC GGGCTCTCCG 60 

AACGCAACTT OOCCCTGCTT GAGCGAGGCT GOGGTTTCOG AGOCCCTCTC CAGCCA AGGA X20 

AAAOCTACAC AAAAAGCCTG GATCACTCAT CQAACCACCC CTQAAGCCAG TQAAG6CTCT 180 

CTOGCCTCGC CCTCTAGOGT TOGTCTGQAG TAGCGCCACC CCGGCTTCCT GG6GACACAG 240 

GOrrrOQCACC ATGGGGCCCA CCAGCGTCCC GCTGGTCAAa GCCCACCGCA GCTCGGTCTC 300 

TGACTAOGTC AACTATGATA TCATCGTC06 GCATTACAAC TACAOGGQAA AGCTGAATAT 360 

CAGOGOGGAC AAGQAGAACA GCATTAAACT GACCTGQGTG GTGTTCATTC TCATC TGCT G 420 

CTTTATCATC CTGGAGAACA TCTTTGTCTT 6CTQACGATT TGGAAAACCA A6AAATTCCA 480 

CCGACCCATO TACTATTTTA TTGGCAATCT G6CCCTCTCA GACCIGTTGa CAGGAGTAGC 540 

CTACACAGCT AACCTGCTCT TGTCTGGGQC CACCACCTAC AAGCTCACTC CCGCCCAGTG 600 

GTTTCTGCX3Q GAAGGQAGTA TGTTTGTGGC CCTGTCAGCC TCOGTGTTCA GTCTOCTCGC 660 

CATOGCCATT GAGCGCTATA TCACAATGCT 6AAAAT0AAA CTCCACAAOG GQAGCAATAA 720 

CTTCC6CCTC TTCCTGCTAA TCAOOGOCTO CTGQGTCATC TCOCTCATCC TCGGTOGGCT 780 

GCCTATCATG GQCTGGAACT GCATCAGTOC GCTOTOCAOC TCCTCCACCG TGCTGCOGCP 840 

CTACCACAAG CACTATATCC TCTTCTOCAC CAOGGTCTTC ACTCTGCTTC TGCTCT CCAT 900 

CXJTCATTCTQ TACTGCAGAA TCTACTCCTT GGTCAGGACT OGGAGCOGCC GCCTGACGTT 960 

CCGCAAGAAC ATTTCCAAGG CCAGCOGCAG CTCTGAGAAT GTGGCGCTGC TCAAGACCQT 1020 

AATTATCGTC CTGAGCX5TCT TCATCGCCTG CTGQGCA003 CTCTTCATCC TGCT CCTG CT 1080 

GGATGTGGGC TGCAAGGTGA AGACCTGTGA CATCCTCTTC A0A6CGGAGT ACTTCCT6GT 1140 

GTTAGCTGTG CTCAACTCCG GCACCAACCC CATCATTTAC ACTCTGACCA ACAAOGAGAT 1200 

G06TGQGGCC TTCATCCGGA TCATGTCCTG CTGCAAOTGC CCQAGCGGAG ACTCTGCTQG 1260 

CAAATTCAAG OGACCCATCA TCGCOGGCAT GGAATTCAGC OGCAGCAAAT CGGACAATTC X320 

CTCCCACCCC CAGAAAGAOG AAGGGGACAA CCGAGAQACC ATTATQTCTT CTG6AAA0GT 1380 

CAACTCTTCT TCCTAGAACT GGAAQCTGTC CACCCACOGG AAGOGCTCTT TACTTGGTOG 1440 

CTGGCCACCC CAQTQTTTGG AAAAAAATCT CTGGGCTTCG ACTGCTGCCA GGQAGGAGCT 1500 

GCTGCAA6CC AGAGGGAGQA AGGGGGAGAA TAOGAACAGC CTGGTGGTGT CGGGTGTTGG 1560 

TGQGTAGAGT TAGTTCCTGT GAACAATGCA CTGGGAAGGG TGGAOATCAG GTCCCGGCCT 1620 

GQAATATATA TTCTACCCCC CTGGAGCTTT GATTTTGGAC TGAGCCAAAO OT CTAG CATT 1680 

GTCAAGCTCC TAAAOGGTTC ATTTGaCCCC TCCTCAAAGA CTAATGTCCC CATGTGAAAO 1740 

OGTCTCTTTG TCTGGAGCTT TGAG6AGAT0 TTTTCCTTCA CTTTAQTTTC AAACCCAAGT 1800 

QAOTgTGTGC ACTTCTGCTT CTTTAGGGAT GCCCTGTACA TCCCACACCC CACCCTCCCT 1860 

TCCCTTCATA CCCCTCCTCA ACQTTCTTTT ACTTTATACT TTAACTACCT GA6AQTTATC 1920 

AOAGCTGGGG TTGTGGAATG ATCQATCATC TATAGCAAAT AGGCTATGTT GAGTACGTAG 1980 

GCTGTGGGAA GATGAAGATG QTTTGGAGGT GTAAAACAAT GTCCTTGGCT 6AGGCCAAAG 2040 

TTTCCAIGTA AQCGGGATCC GTTTTTTGGA ATTTGGTTaA AGTCACTTTG ATTTCTTTAA 2100 

AAAACATCrr TTCAATGAAA TGTQTTACCA TTTCATATCC ATTGAAGCOa AAATCTGCAT 2160 

AAGQAAGCCC ACTTTATCTA AATGATATTA GCCAGGATCC TTGGTGTCCT AGGAGAAACA 2220 

QACAAGCAAA ACAAAGTGAA AACCQAATGG ATTAACTTTT GCAAACCAAG G QAGAT TTCT 2280 

TAQCAAATQA OTCTAACAAA TAT6ACATCC GTCTTTCCCA CTTTrGTTGA TGTTTATTTC 2340 

A6AATCTTGT OTGATTCATT TCAAGCAACA ACATGTTOTA TTTTGrTGTQ TTAAAA GTAC 2400 

TTTTCTT6AT TTTTGAATGT ATTTGTTTCA GGAAGAAGTC ATTTTATGGA TTTTTCTAAC 2460 

C03TGTTAAC TTTTCTAGAA TCCACCCTCT TGTGCCCTTA AQCATTACTT TAACTGGTAG 252 0 

GGAAC6CCAG AACTTTTAAG TCCAGCTATT CATTAGATAG TAATTGAAGA TATOTATAAA 2560 

TATTACAAAG AATAAAAATA TATTACTGTC TCTTTAGTAT GGTTTrCAaT GCAATTAAAC 2640 

OGAGAOATGT CTitym TTI TAAAAAGAAT AGTATTTAAT AGGTTTCTGA GTTTTGTGOA 2700 
TCATTTTGCA CATAOCTTTA TCRACTTTTA AACATTAATA AACTGATTTT TTTAAAG 



A2 Protein sequence t 
Gene namei 
lAilgene nuniber: 
Probeeet Accession #t 
Protein Accession tt: 
Signal sequences 
Transmenbrane domains: 
Cellular Local izat ion t 



endothelial differentiation, sphingollpld G-proteln-coupled receptor, 

Hs. 154210 

K31210 

AAA52336 

none found 

50-71, 92-110, 122-140, 160-177, 202-222, 251-269, 283-301 
plasma meiribrane 



1 11 21 31 41 51 

I I i I I I 

MGPTSVPLVK AHRSSVSDYV NYDIIVRHYM YTGKLNISAD KEMSIKLTSV VFILICCFII 60 

IjENIFVLLTI WXTKKFHRPM YYFIGNLALS DLIiAGVAYTA NLLIiSaATTy KLTPAQWPLR 120 

EGSMFVAIiSA SVF8LLAIAI ERYITKLKMX LHNOSNNFRI* PLLISACHVZ SLZLGGLPIM 160 

GHNCISAL8S CSTVLPLYHK HYILPCTTVP TLLLLSIVIL YCRIYSLVRT RSRRLTFRKM 240 

ISKASRSSEH VALLKTVIIV LSVFIACNAP LPim JOVO CKVKTCDZLF RAEYFLVLAV 300 

LHSOTHPIIY TXilKKEMRRA FIRIMSCCKC PSODSAGKPK RPIIAGMBP8 RSXSDNSSHP 360 
QKDEGDNPBT IMSSGNVNSS 8 

A3 DMA SSQUB^CE 

Gene namei <3 protein-coupled receptor 51 

unigene nuoiberi Hs. 198612 

Probeset Accession «i AA4S2928 
Huclelc Acid Accession fti HM_004624.1 

829 



wo 03/042661 



PCTAJS02/36810 



OoAlng sequence: 



1-2826 (underlined sequences correspond to start and stop codona) 



1 11 21 31 fl 51 

I I I i ) I 

ATCGCTTCCC CGCGGAGGTC CGGGCAGCCA GGGCGGCCGC CGCCGCC6CC ACOSCOGCCC 
6GQC6CCIGC TACTGCTACT GCTQCTGCOG CTGCTGCTGC CTCTCGCGCX: CGGGGCCTGO 
GQCTGGOOGC GQGGCGCCCC CCX3GCOGCOG CCCAOCAGCC OGCOGCTCTC CATCATGGGC 
CTCATGCCGC TCACCAAGGA GGTGGCXaUVG GGCAGCKTCQ GGCQCQOTOT QCTCCCOG CC 
GTGGAACTGG CCATOQAOCA OATCCGCAAC GAGTCACTCC TQOQCCCCTA CTTOCTCGAC 
CTGGG6GTCT ATGACACGGA GTGCGACAAC OCAAAAGGGT TGAAAGCCTT CTACGATQCA 
ATAAAATAOG GGCCGAACCA CTTGATGGTG TTTGQAGGCG TCTOTCCATC OSTCACATCC 
ATCATTGCAG AGTCCCTCCA AGGCTGQAAT CTGGTGCAGC rn'tTIUTGC TGCAACCAOG 
CCTCTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACOOTCCC ATCAOAa^AT 
GCGGTOAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGOAAGCG CGTG QGCAO G 
CIGAC6CAAG ACGTTCAGAG GTTCTCTGAQ GTGCGGAATG ACCTOACTGG AGTTCTGTOT 
GGOSAGGACA TT6AGATTTC AGACACCGAQ AGCTTCTCCA ACGATCCCTG TACCAGTOTC 
AAAAAGCTQA AGGGGAATGA TGTGCGQATC ATCCTTGGCC AGTTTGACaV GAATATGGCA 
GCAAAAGTGT TCTCTTOTGC ATA06AGGAG AACATOTATG GTAGTAAATA TCAGTOGATC 
ATTCOGGQCT GGTAOGAGCC TTCTTGGTGQ QAGCAOOTGC ACACGGAAGC CRACTCATCC 
OGCTGC C TCC GGAAGAATCT GCTTQCTGCC ATGQAGGGCT ACATTGGOGT OQATTTOSAG 
COCCIQASCT CqUWSCAflAT CAAGACX^ITC TCAGOAAAQA CTCCACAGCA GTATGAGAGA 
OAGTACAACA ACAAGCGGTC AGGCGTGGGQ CCCAGCAAGT TCCACGGOTA OGCCTAOQAT 
GGCATCTOGG TCATOGCCAA GACACTGCAG AGGGCCATGO AGACaCTGCA TOOCAOCAGC 
OGOCACCAGC 66ATCCAGGA CTTCAACTAC ACGGACCACA OGCTGGGCAG GATCATOCTC 
AATGCCATGA AC33AOAOCAA CTTCTTCGGG GTCACGGOTC AAGTTGTATT CCGGAATGGG 
GAGAGAATGO GOACCATTAA ATTTACTCAA TTTCAAGACA GCAGGQAGGT GAAQGTGGQA 
GAGTACAACX5 CTGTGGCC6A CACACTGGAQ ATCATCRATG ACACCATCAO GTTCCAA GGA 
TCCQAACCAC CAAAAGACAA GACCATCATC CTGGAOCAGC TGCGGAAGAT CT CCCTAO CT 
CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCXK3QATGA TCATGGCCAG TGCTTTTCTC 
TTCTTCAACA TCAAGAACC6 OAATCAGAAG CTCATAAAGA TGTOGAGTCC ATAC ATGA AC 
AACXTTTATCA TCCrKSGAQQ OATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTOAT 
GGATCCTTT6 TCTCTQAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 
ACCGTOGGCr ACACGACCGC TTTrGGGGCX: ATGTTTGCAA AGACCTGQAG AGTOCAOGCC 
ATCTTCAAAA ATGTQAAAAT GAAGAA6AAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 
OIGGGGGGCA T6CTGCT6AT C6ACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC 
CTGOGAAGGA CAGTGOAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GQATATCTCC 
ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC ACCC ATAT QA CCA TCTGQ CT TGGCATOOTC 
TATGCCTACA AGGGACTTCT CATQTTGTTC GGTTGTrTCT TAGCTTGGGA GACCCGCAAC 
GTCAGCATCC CCX3CACTCAA OGACAGCAAG TACATOSGGA TQAGTQTCTA CAACGTGGGG 
ATCATOTOCA TCATCGGQGC CQCrGTCTCC TTCCTOACCC GGGACCA6CC CAATGTGCAG 
TTCTGOVTCG TGGCTCTGGT CATCA T CTTC TGCAOCAOCA TCACCCTCTG CCTGGTATTC 
GTGCXX3AAGC TCATCACXXTT QAGAACAAAC CCAGATQCAG CAACGCAGAA CAGGCGATTC 
CAGTTCACTC AGAATCAQAA QAAAGAAOAT TCTAAAAOGT CCACCTCGGT CACCAGTGTG 
AACCAAGCCA GCACATCCXX3 CCTGGAGGGC CTACAGTCAG AAAACXSITCG CXrTGOGAATG 
AACavrCACMS AGCTGGATAA AQACTTGOAA GAGGTCACCA TGCAGCTGCA GGACACACCA 
GAAAAQACCA CCTACATTAA ACRGAACCAC TACX3UUSAGC TCAATGACAT CCTCAACCTG 
GGAAACTTCA CTCAOAGCAC AGATGGAGGA AAGOCX»TTT TAAAAAATCA CCTCGATCAA 
AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAOAA 
GATATAAACT CTCCAGAACA CATCCAGOGT OGGCTGTCCC TCCAGCTCCC CATCCTCCAC 
C310GCCTACC TCCCATOCAT 0B6A6G0QTG OAGGCCAGCT GTGTCAGCCC CTOOQTCRGC 
CCCACCGCC31 GCCCCCGCCA CAGACATGTG CCfiCCCSCCt TCOGAGTCAT GGTCTCQ6QC 
CTGTAA 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
ISOO 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



A4 Protein sequence i 
Gene name: 
Utxigene nunben 
Probeset Accession #i 
Protein Accession #: 
Signal sequence t 
Pfom domains: 
Tranamsnfl>rane domains i 
Cellular iiocalization* 



G protein-coupled receptor 51 

H8. 198612 

AA452928 

NP 005449.1 

1-42 

7tm 3 [481-754] , ANP receptor [130-204] 

22-44. 477-499, 517-539, 552-574, 595-617, 653-675, 692-713, 
plasma mend>rane 



722-744 



1 
I 

MASPRRSGQP 
LMPLTKEVAK 
IKYGPNHLKV 
AVMPAZLKIiL 
fCKLIQSNDVRZ 
RCLRKNLLAA 
GZWVIAKTLQ 
ERMOTXKFTQ 
LYSZI«SAI>TI 
GSFVSEXTFB 
VGGMLLICLC 
YAYKGLXiMIiP 
PCrVALVIlF 
HOASTSRIfBG 
GHFTBSTDGG 
HAYLPSIGGV 



11 
I 

GRPPPFPPPP 
GSIGRGVLPA 
FGGVCPSVTS 
KHYQHXRVGT 
IliQQFDQimA 
MEGYIGVDFE 
RAMSTLHA88 
PQDSREVKVG 
LGMZMASAFL 
TLCTVSmili 
ILICWQAVDP 
GC7LAWBTRN 
C8TITLCLVP 
LQSENRRLRH 
KAILKNBLDQ 
OASCVSPCVS 



21 
I 

ARLLLLLLIiP 
VBLAIEQIRN 
XZAESLQGWN 
LTQDVQRFSB 
AKVFOCAYEB 
PLSSKQXKTI 
RHQRIQDFNY 
EYNAVADTLE 
PFNIKMRNQK 
TVCYTTAFGA 
LRRTVEKYSM 
VSIPAX^SK 
VPKLITLRTN 
KITBU3KDLB 
NPQI41MNTTB 
PTA8PRHRHV 



31 
I 

LLIiPLAPQAH 
BSLLRPYFLD 
LVQLSFAATT 
VRNDLTGVLY 
NMYGSKYOffZ 
SOKTPQQYER 
TDHTLGRIIL 
IINDTIRFQG 
LIKMSSPYMN 
KFAKIHRVEA 
BPDPAGRDIS 
YIGMSVYNVG 
PDAATQMRRP 
BVTMQIiQDTP 
PSRTCKDPIE 
PP8FRVKVSG 



41 . 
I 

GWARGAPRPP 
LRLyDTBC33N 
F\rLADKKKyP 
QEDIBZSDTB 
ZFGHYEPSIfW 
EYHNKRSGVG 
NAKNETNFFG 
SEPFEDKTII 
KLXZLGGMLS 
ZFKNVKMKKK 
ZRPLLEBCEH 
IKCIIG AAV8 
QFTQNQXXED 
EKmrZXQNH 
OXNSPailQR 
L 



51 
I 

PSSPPLSZMG^ 
AKGUCAFYDA 
YPFRTVPSDN 
SF8MDPCTSV 
BQVHTEANS8 
P8KFHGYAYD 
VTGQWFRMQ 
LEQLRKISLP 
YASIFLFGLD 
IIKDQKLLVI 
THMT1WI/3XV 
FLTRDQPNVQ 
SKTSTSVTSV 
YQBX2a>ZUni 
RLSLQIiPILH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



830 



wo 03/042661 



PCT/US02/36810 



AS DHA SEQUEWCB 
Gene name: 
Unigene number: 
Probeset Accession Si 
Nticleic Acid Accession 8: 



BSTs 

H8. 293616 
AW043782 
none found 



coding sequences 38>1075 (underlined sequence corresponds to start and stop codon) 



1 11 21 • 31 41 51 

10 I I I I I I 

AGCAAOGAOG CGOGGCAGiOG GQAOGGGOBG OOOOGC CATG TGGCTGCTGO GGGCGCTGTQ 60 

CCTGCTGCTQ AGCAGOGCGG €GQAQAGCCA GCTOCTCCCC 06GAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTCCAG CAATGGACQG TGCATCCC3GG GOGCCTGGCA 180 

0TGTGA06GG CTGCCTGACT GCTTOGACAA GAGTGATGAG AAG6AGTGCC CCAAGGCTAA 240 

15 GT0GAAAT6T GGCCCAACCT TCTTCCCCTG TGCCAGCG6C ATCCATTGCA TCATTGGTOG 300 

Cri ' OUUG TGC AATG U U ' m ' G AGGACTGTCC OQATG0CA6C GATGAAGAGA ACTGCACAGC 360 

AAAOCCTCTG CTTTGCTCCA COGOCGGCTA CCACTGCAAO AAG66CCTCT GTATTQACAA 420 

GA6CTTCATC TGCX^ATGGAC AGAATAACTG TCAAGACAAC A6TGATGAGG AAAOCTGTGA 480 

. - AAGTTCTCAA GAACCCGGCA GTGGGCAGGT QTTTOTGACT TCAGAGAACC AACTTGTGTA 540 

20 TTACCCCAGC ATCACCTATG CCATCATOQG CAOCTCOGTC ATTTTTGTGC TGGTG6TG6C 600 

CCT6CTGGCA CTG0TCT7GC ACCACCAGCO QAAGCGGAAC AAGCTCftTGA CGCTGOCOGT 660 

GCAC06GCTG C7U3CACCCTG TGCTGCTGTC CC6CCTGGTG GTCCTGGACC ACGCCCACCA 720 

CTGCAAOGTC ACCTACAACG TCAATAAT6G CATCCAGTAT GTGGCCAGCC AGGOQGAQCA 780 

QAATGOGTCG GAAGTAGGCT CCCCACCCTC CTACTCCQAG GCCTTOCTQG ACCAGAGOCC 840 

25 TGGGTGGTAT GACCTTCCTC CACOGCCCTA CTCTTCTGAC ACGGAATCTC TGAACCAAGC 900 

OQACCTGCCC CCCTACOGCT CC06GTCG6Q 6AGIGCCAAC AGTGOCAGCT CCCAGGCAOC 960 

CAGCAGCCTC CTGAGOSTGO AAGACACCAG CCACA6CCCG GGGC318CCTG GCCCCCAGGA 1020 

GGGCACTGCT GAGCCCAGGG ACTCTOAOCC CAOCCAQGGC ACTGAAGAAG TATAAGTCCC 1080 

AGTTATTCCA AAGTCCaVTAT GGGTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

30 TGCTCATGGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGGATATT 1200 

AACTATCTCT GC ATTCC CCT CCTCCCCCAfl ACTTCAOAGA TOrAYiTO 'G GOCT CTCAGT 1260 

TGACATGATC T6TT0TG0GT CTTTTCTOTC AGGTCACTCT TCCCTTGGGA CCOGAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTGTTTCT QTrGGAQAGA CAGCATATAA AACAGTATT6 1380 

AAATAGGCTG GGAGAGAGCA ATGTTTCTGT GCTATATTGG AT6CTCAGAA GTGCAGGAGA 1440 

35 CGCTGGACCC AATTCTCTCT GCTGGGTAGT TACCTTATAO CATTTGGGGA TTTGGGTTAG 1500 

ATGATCTAAC CAGGAGGGCA TCACTGGATG QTCACCCCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTG ATGCCCCCAG TTGAGCAGAG TCAGTGGCCA 1620 

AAGAAAACTT TGGACOTGAO TAACACCCTT CAQCAQTCX3C AACQTTATTT TGGTTTTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

40 CTCATTCTQA GAQCTTTCCT CAGCAGCATA TATCATCAOC CTCATCCTAA AATAOQCAGO 1800 

GAGCOCCTCC CATGAGTTTA TCOUU3TTCT CAGCTCCTAA AATGCAGGCT GGCAAGACCC 1860 

TACACCTGCX: CTGGCTCTAC A6CCACTTAC CTGGTTTCTG GACIGTCACC CTCXX»GCTO 1920 

ACCTGCCCGT AGCCAAGGAA TGAGGACCTA ACTTQAGTTQ GCCCAAAGTC TGACCTGGCT 1980 

. GTATGTCCCr QTGGCCCACA CCCAGCCTGT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

45 CTCGAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT OTGCATTTGQ ACTTQAGGAC 2100 

ACTGGTTrCT ATGACAGGTO AGAGCCAIOr TCAATACCTC CAGCAAOCTC TOCTG G CTCC 2160 

CTGCACTGTG GACQCTCCTC TTCCCAAGQT CCCAATACCA OCACCTCTAO TTAQAGTTAQ 2220 

GGTCAGGGTC ACGCCTCTCC CAACATCCOV GTAGTTTCTC CTCTGAGACA CATGG6CAA0 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

50 TGAAACAGTG TOrnxymT TTCCCTTCTA GTTAAGGGAC TATTTATATG TCTATAGGAA 2400 

AGCTG TCTC T TATmy m ' TTOC TTTAAC AAGGTCXAAA GAAAGATGCA AAAOGAO ATC 2460 

ACACCCTTGC CCOQCTGAaC CCOGT0ATAA CAAOTCACTC GAGACTAAGC T6T0TGCCA0 2520 

ACATTTGTGC ATTGTT6CAC TTTGAGGTTA TTATTTATCA A6TTCTTGAA G6AAGCAGAA 2580 

_^ AQAGGQACTC CTCTCTCCCT COOTGTATAG TCTCTATOTT TGTGCTAGTT ' mXTrn ' m ' 2640 

55 TTCTCTQTGT CCAGTCAGCG ACAGGGCCOG CCTCCCTGCA GGAATAA6GG GTAAAAC!GTT 2700 

AGOTOTTOTT TQGCAAGAAA OCACACXGAC TGATGAOGQa TAAAATGGAA GCAGGTAGAG 2760 

OCACTCGGGG CAGCTGTCAC CCATTCAGAA CTTCTTTG08 CA0CT6AAGA AATGTTCAGT 2620 

AACCTGTTTG AOGCTAATTA AAACAGAGCC TGCAGGAAGT GGG6CTAAAG TGGCATTCAG 2880 

TGATCCTGTT CTGTAGACTT TTCTTTCTTT TTTTAACCAA ATCCAAAGGA TQTTACAQAA 2940 

OO AAGCTA6CCA CTGGTATTTT GTTTTQTTTA AAAAAAAAAA GAAA6AAAGA AAGAAA6AAA 3000 

AAOGGAAAGO AACCTA6CI0 GCTOTATCTT TCATTTTTAA AATA8C2ACTT GAGTTATTTT 3060 

CTGAGTAATC CAATAAAGAA CTTTTGATGA CAOCCAGAAT GTGTTAGAAC TCTGGCTGAA 3120 

CATTTCATCT CCTGTQAGTC AGAAGGGCTT TATTTCTCCC TTTGATGGGG CCCCTTCTTC 3180 

- TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA AAGAATTCTA ATTTTAATTA ATTGOGC3V6T 3240 

05 GAGTTAATGT CACT06CTTT TCTGCTTCCA GGCATCTTAG GAAAAACAAA TGGTTTTAGT 3300 

AG ATAAGGQA TGCCTACTAA TGCTTTTTTA AAACAAACAG GQACATTTTT ATTATA GATT 3360 

TOATTTTTTT AATGAATGTT TTTAAAAATA TATAAATAGG ACACCAAAGC GGCAG66TTT 3420 

TTTTtGGGGG GAG6GGGTTT GTTTTCCAAC TCAAGATGQC ACATTAGTGG CCA6CAATAT 3480 

TTTTTAACTC ATTCCAACCA GGAAGCTTTT TTATACATTG CCTAAATCTA CX50CAACCAG 3540 

70 AAAATAGTCT CATCTCTTTT TTTCTCAAAT GAQATCXSSTG TTTTATTTTA GCATTAAATT 3600 

AOTTACACTG TGATGACTGG CCTATTACCT GACTCAGCTC CCTCTAOCTT GAAATTGACA 3660 

TTTTTAAAAA ATGCAACTAA GTGGTTAATA GTGTGTGAOQ CtCAAAGTTA AT6TAAACTQ 3720 

GAAAGGTTGT GTOTCGTTGC TTTTTGTGTT TTGGTTAGGC TTGOTTTTaT TTTTTAATTT 3780 

TTATACTTTC TAATAAATTT GCAGTrTCAT TCTTTCTGTT TGTGCAAAWG GWMCTAMARM 3840 

75 AAMMAAAAAC AWyWTTGGGG GG6CTTGGGC CTOGGAAAAA GTTTTTAACA CCACTTOGGG 3900 

TGGGGGGGCG GGGCCCAGGT AGGTA0G6CG ACCACGOGGG CCCAAACGGG ACCCCAGAAG 3960 

GAAAGOCTGG CCAAGAAAAA GGTGQOQAGA ATTCTCCACA OCSGAAAAAA AC60GCOOGG 4020 

GGAAAOXSCA GAGTGTTGC!G TAAACX3UAC COGAAGAGAG AACTCAGAAG CACACAAOOO 4080 
OGACTCAACX: AGGAGQACXX AAGG3AAC0C 6ATAGAGTAC 6 



A6 Protein sequence t 



BSTs 



831 
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UAigene rnnnber: 
probeset Accession ffi 
Protein Accession ft: 
Signal sequence: 
Transcneinbrane domains: 
LDba domains: 
Cellular Localixationi 



1 11 21 

I i 1 

MHLLQFXiCLL LSSAAESQLL PGNNFTNEQI 
EKECPKAKSK OGPTPPPCAS OIHCIIGRFR 
KNGLCIDKSP lOXKJNNCQD NSDEESCESS 
VIFVLWALL ALVLHHQRKR MNLMTLPVHR 

yvasqabqua sevgsppsys gauiDqrpaw 

NSAS6QAASS LLSVEDTSHS PGQPGPQEST 



Hs. 293616 
AH0437e2 
none £ound 
1-17 
169-191 

26-66. 70-lOB* 112-149 
plasma membrane 



31 41 51 

I I I 

IPOJPMCSNG RCIPQAWQCD QLPDCPDKSD 60 

CSIGFBDCPDG SDEENCTANP LLCSTARYHC 120 

QEPG5GQVFV TSQlQIiVyyP SITYAIIGSS 180 

I1QHPVZ.LSRL WUJHPHHCN VryNVMNGIQ 240 

YDLPPPPYSS DTBSUIQADL PPYRSRSGSA 300 
ABPRDSSPSQ GTBEV 



A7 DMA 8EQUBNCB 
Gene namex 

Unigene number: 
Probeset Accession ft: 
Nucleic Acid Accession ft: 
Coding sequence: 



Integrin, alpha 5 (fibronectin receptor, alpha polypeptide) 

Hs. 149609 
X06256 

NM 002205 ^ ^ ^ „i 

1-3150 {underlined sequences correspond to start end stop coaonsj 



1 11 21 31 41 51 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCOGTOC AGCTGOGCTG GGGCCCCCGQ 
raCCGACCCC CGCTSSTGCC GCTGCTGTTG CTQCTSSTGC CGCCGCCACC CAGGGTCGGG 
GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGO GOCCCCCGGG CTCCTTCTTC 
GGATTCTCAG TCGAGTTTTA OOGGCOGGGA ACRGAiOGGGO TCAGTGTGCT GGTGGGAGCA 
CCCAAfiGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTO GTGCTOTCTA CCTCTGTOT 
TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 
CTGGAGTCCT CACTGTCCAG CTCAGAGGQA GAGQAGCCTG TGGAGTACAA GTCCTTGCAQ 
TGGTTCGGGG CAACAQTTCG AQCCCATGGC TCCTCCATCT TGGCATCOGC TCCACTOTAC 
AGCTGGCGCA CAGftGAAGGA GCCACTGAGC OAOCCCaTOJ GCAOCTGCTA CCTCTCCACA 
GATAACTTCA CCCGAATTCT GOAGTATOCA CCCTGCOGCT CA6ATTTCAG CIGGGCAGCA 
GGACAOGGTT ACTGCCAAGG AGGCTTCAGT GCOQAGTTCA CCAAGACTGG CCaTGTQGTT 
TTAGOTGOAC CAGQAAOCTA TTTCTGQCAA GGCCAGATCC TGTCTQCCAC TCAGGAGCAG 
ATTOCAGAAT crTATTACCC CGAGTACCTG ATCAACCTGO TTCAGGGGCA GCTGCAGA CT 
CGCCAGGCCA QTTCCATCTA TGATGACAGC TACCTAQGAT ACTCIGTGGC J^^GTOAA 
TTCAOTGQTG ATGACACAGA AGACTTTGTT GCTQGTGTGC CCAAAGGGRA CCTCACTTAC 
OGCTATGTCA CCATCCTTAA TGOCTCAGAC ATTCGATCCC TCTACAACTT CTCftGGGGAA 
CAOATGQCCT CCTACTTTGQ CTATGCAGTG GCCGCCACAG ACOTCAATGQ GQACGGGCTG 
GATCACrroC TCGTGOGGGC ACCCCTGCTC ATQ6ATCGGA CCCCTOACGG OOGGCCTCAG 
C5Af3GTGGGCA GGGTCTACGT CTACCTOCAO CACCCAGCOO GCATAOAGCC CAOQCCCACC 
CTTACCCTCA CTGGCCATQA TGAGTTTGGC OQATTTGGCA OCTC CTTGAC CCCCCTGQGO 
GACCTCOACC AGGATGGCTA CAATGATOTG GCCATCGGGG CTCCCmGG TGGG6AGACC 
CAGCRGGGAG TAGTCTTTGT ATTTCCTGGG GGCCCAGGRG GOCTGGGCTC TAAGCCTTCC 
CAGGTTCTGC AGCCCCTGTQ GGCAGCCASC CACACCCCAO ACTTCTTTGO CTCTOCCCTT 
CGAGOAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TQATTOTOGO OTCCTTIGGT 
GTGGACAAGG CIGTGGTATA CAGGGGCCGC CCCATOGXOT C0GCTA6TGC CTCCCTCAOC 
ATCTTCCCCQ CCATGTTCAA CCCAOAGOAG OGGAGCTGCA GCTTAQAQGG GAACCCTGTG 
OCCTGCATCA ACCTTAOCTT CTGCCTCAAT GCTTCTGQAA AACACOTTGC TGACTCCATT 
GGTTTCACAO TGQAACTTCA GCTCGACTGO CAGAAGCAGA AGGGAOGGGT ACGGCGG6CA 
CTOTTCCrGG CCTCCAGGCA QGCAACCCTG AOCCAGACCC TGCTCATCCA aAATGGGOCT 
OGAGAGQATT QCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCA6AATT TCXSAQACAAA 
CTCTGGC06A TTCACATOGC TCTCAACTTC TCCTTGGACC CCCAA6CCCC AGTGGACAGC 
CACG6CCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 
ATCTTGCTGG ACIGTGGAGA AOACAACATC TGTGTGCCTO ACCTQCA6CT GGAA6TGTTT 
GGGGAGCAGA ACCATCTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC "TOWTGCC 
CAOAAlOTGa OTGAGGGTGG CGCCTATQAO GCTGAGCTTC GGOTCAOOGC CCCTOCRGAO 
GCTGAGTACX CAGQACTCOT CAGACACCCA GGQAACTTCT CCAGCCTGAG CTGTGACTAC 
TTTOCOG'rQA AOCAGAGCCO CCTGCTGGTO TGTGACCTGG GCAACCCCAT QAAGGCAGGA 
GCCAGTCTGT GGGGTGGCCT TCOGTTTACA 6TCCCTCATC TCCGGGACAC TAAGAAAACC 
ATCCAGTTTG ACTTCCAGAT CCTCAGGAAG AATCTCAACA ACTCGCAAAO CaAOBrQGTT 
TCCTTTCGGC TCTOCGTGGA GGCTCAGGCC CAG6TCA0CC TOAACGOTGT CTCCAAGCCT 
GAGGCA0T6C TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA QAAGGAGGAO 
GACCTGGGAC CTGCTOTCCA CCATGTCTAT GAGCTCATCA ACCAAGQCCC CAGCTCO^TT 
AGCCAGGGTG TGCTGGAACT CRGCTGTCCC CAGGCTCTGO AAGGTCAGCA GCTCCTATAT 
GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC ^^^^^^^ 
OAGTTGOATC COGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGC00CA6C 
TCTOCXTOCT OGGOACCTCA GATCCTGAAA TGCCOGGAOG CTGAGTGTTT CAGGCTGCGC 
TOT0A6CTO0 OGCCCCTGCA CCAACAAGAG AGCCAAAQTC TGCAGTTGCA TTTC^AGTC 
TCGGCCAAGA CTTTCTTQCA GCGGQAGCAC CAGCCATTTA GCCTGCAGTG TOAGOCTGTO 
TACAAAGCCC TGAAGATGCC CTACOGAATC CIGCCTCGGC AOCXGOCCCA M^^^ 
CAGGTCGCCA CAGCTOTGCA ATGGACCAAG GCAGAAGOCA OCTATQGCOT CCCACTOTM 
ATCATCATCC TAGCCATCCT GTTTGGCCTC CKJCTCCTAO QTCIACTCAT CTA^TOCTC 
TACAA6CTTG QATTCTTCAA AOGCTCCCTC CCATATGGCA COGOCATOGA AAAAGCTCAO 
CTCAAOGCTC CAOCCACCTC TQATOO CTQA 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



832 



wo 03/042661 



PCT/US02/36810 



A8 Protein sequence; 
Gene name: 
Unigene number: 
Probeset Accession tti 
Protein Accession ttt 
Signal 8e<iuencet 
Transmembrane domains 
Integrln alpha domalnsi 56-115. 268-316 
Cellular Localisation: plasma membrane 



Integrln, alpha S (fibronectin rec^tor, alpha polypeptide) 
Hs. 149609 
X062S6 
NP 003196 
1-42 
996-1020 

322-384, 3e8>444, 452-503, 1022-1036 



1 11 21 31 41 51 

I I I I i i 

M6SRTPBSPL HAVQLRWGPR RRPPLXiPLLL LLLPPPPRVO OFNIiDAEAPA VIiSGFPGSFP 60 

6FSVBPYRF0 TDGVSVLVGA PKANTSQPGV LQGGAVYLCP HGASPTQCTP IEFDSK6SRL 120 

LBSSLSSSEO EEFVSrKSXiQ HFGATVRAHG SSILACAPLY SWRTEKBPLS DPVGTCYLST 180 

DMPTRILBYA PCRSDFSWAA GQGYOQGGFS AEFTKTGRW LGGPGSYFKQ GQIL8ATQBQ 240 

lABSyyPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVOE PS6DDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSXiYHPSGB QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVCaivyvYLQ HPAOXBPTPT LTLTGHDBFG RFGSSLTPLG DmOGYNDV AIGAPFGGET 420 

QQGWFVFPG GPG6LGSKPS QVLQPLWAA8 HTPDPFGSAL R6GRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSIiEGNFV ACINLSFCUI ASGKKVADSX 540 

GFTVELQLDW QKQKG6VRRA LFLASRQATL TQTLLXQNGA REDCRB1KIY LRNBSEFRDK 600 

LSPIHIALNF SLDPQAFVDS HGLRPALHYQ SKSRIBDKAQ ILLDGOBDNI CVPDLQLBVF 660 

GEQNHVYLGD KNALHLTFHA QNV6BGGAYE AELRVTAPPB AEYSGLVRRP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLW6GLRFT VFHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTIiNGVSKP EAVLFPVSDW HFRDQPQKEB DLGPAVHHVY ELIMQQPSSI 840 

8QGVLELSCP QALBGOOLLY VTRVTGLNCT TNHPIKPRGL EliDPEGSLHH QQKREAPSR8 900 

SASSGFQILR CPEAECFRLR CELGPLKQQE SQSLQUIFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQNTK AEGSY6VPLH IIZLAXLFGL LLljGLLXYXL 1020 
YXLGFFKRSL PYOTAMBKAQ UCPPAT8DA 



A9 DNA SEQUENCB 

Gene namet Integrln, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2, MSK12) 

Unigene number t Hs. 267797 

Probeset Accession X07979 
Nucleic Acid Accession #: nm_002211.i 

Coding sequence; 1-2397 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

^QAATTTAC AACCAATTTT CTGGATTGGA CTGATCAGTT CAGTTT6CTG TGTGTTTGCT 60 

CAAACAGATO AAAATAGAT6 TTTAAAAGCA AAT6CCAAAT CATGTGGAGA ATGTATACAA 120 

GCAGGGCCAA ATTGTGGGTQ GTGCACAAAT TCAACATTTT TACAGGAAGG AATGCCTACT 180 

TCT6CACGAT GTGATGATTT AGAAGCCTTA AAAAA6AAGG GTrGCCCTCC AGATQACATA 240 

GAAAATCOCA 6AGGCT0CAA AGATATAAAG AAAAATAAAA ATGTAACCAA GG9TAGCAAA 300' 

G6AACAGGAG AQAAGCTCAA GCGAOAGQAT ATTACTCAGA TCCAACCACA GCAGTT6OTT 360 

TTGCGATTAA GATCAGGQQA GCCACAGACA TTTACATTAA AATTCAAOAG AGCTGAAGAC 420 

TATCCCATTG ACCTCTACTA CCTTATGGAC CTGTCTTACT CAATGAAAQA CGATTTGGAO 480 

AATGTAAAAA GTCTTGGAAC AOATCTOAXO AATQAAATGA GGAGQATTAC TTCGOACTTC 540 

A6AATTGGAT TT6GCTCATT TGTQQAAAAG ACTGTQATGC CTTACATTA6 CACAACACCA 600 

GCTAAGCTCA GOAACCCTTO CACAAOTQAA CAGAACT6CA CCAGCCCATT TAOCTACAAA 660 

AATGTGCTCA GTCTTACTAA TAAAGGAQAA GTATTTAATG AACTTQTTGG AAAACAGCGC 720 

ATATCTGGAA ATTTGGATTC TCCAGAA66T GGTTT06ATG CCATCATGCA AGTTGCAGTT 780 

TGTGGATCAC TGATTGGCTG QAG6AATGTT ACAGGGCTGC TGGTGTTTTC CACAGATGCC 840 

GGOTTTCACr TTOCTUQAGA TG6GAAACTT GGTQGCATTO TTTTACCAAA TGATGGACAA 900 

TGTCAOCTGG AAAATAATAT 6TACACAAT0 A6CCATTATT ATGATTATOC TTCTATTGCT 960 

CACCTTGTCC AGAAACTGAG TGAAAATAAT ATTCAGACAA TTTTTGCAGT TACTGAAGAA 1020 

TTTCAGCCTG TTTACAAGGA GCTGAAAAAC TTGATCCCTA AOTCAGCAOT AGGAACATTA 1080 

TCTGCAAATT CTAGOUITGT AATTCAGTTG ATCATTGATO CATACAATTC CCTTTCCTCA 1140 

GAAOTCATTT TQQA A AACQQ CAAATTGTCA QAAGGACTTAA GAATAAOTTA CAAATCTTAC 1200 

TGCAA0AA06 GGOTGAATGQ AACA6G6GAA AATG6AA6AA AATOTTCCAA TATTTOCATT 1260 

GGAGATGAGG TTCAATTTGA AATTAGCATA ACTTCAAATA AGTGTCCAAA AAAGGATTCT 1320 

GACAGCTTTA AAATTAGGCC TCT6G6CTTT ACG6AGGAAG TAGAGGTTAT TCTTCAGTAC 1380 

ATCTGTGAAT GTGAATQCCA AA6C6AAG6C ATOCCTGAAA GTCCCAA6TG TCATGAAGGA 1440 

AATGGOACAT TTQAGTOTGG C6CGTGCAG0 TGCAATOAAO Q QO OTqTT G Q TAOACATTOT 1500 

GAATGCAOCA CAGATGAAGT TAACAGTGAA GACATGQATO CTTACTGCAG GAAAGAAAAC 1560 

AGTTCAGAAA TCTGCAGTAA CAAT6GAGAG TGCGTCTGCG GACAOTGTGT TTGTAGQAAO 1620 

AGGGATAATA CAAATGAAAT TTATTCTGGC AAATTCTGCG AGTGTQATAA TTTCAACTGT 1680 

GATAGATCCA ATG6CTTAAT TTOTGGAGQA AATOOTOriT GCAAGTGTCO TGTGTGTGAO 1740 

TGCAACCCCA ACTACACTGG CAGTGCATGT GACIOTTCTT TGGATACXAO TACTIGTGAA 1800 

GCCAGCAAOQ GACAGATCTO CAATOG0060 GGCATCTGCG AGTGIOQTGT CTGTAAGTGT 1660 

ACAGATCCGA A6TTTCAAGG GCAAAOSTGT GAGATGTOTC AGACCTGCCT TGGTGTCTGT 1920 

GCTGA6CATA AAGAATGTGT TCAGTGCAGA GCCTTCAATA AAGGAGAAAA GAAAGACACA 1980 

TGCACACAGG AATGTTCCTA TTTTAACATT ACCAAGGTAO AAAGTOGOQA CAAATTACCC 2040 

CAOCOQGTOC AACCTGATCC TQTGTOCCAT TOTAAOOAGA AGGAT(3TTGA CGACT CT T GQ 2100 

TTCTATTTTA OGTATTCAOT GAATOGQAAC AACQAOQTCA TGOTTCATOT TGTOGAGAAT 2160 

CCAGAGTGTC CCACTGOTCC AGACATCATT CCAATTGTAO CTGGTGTGGT TGCTGGAATT 2220 



833 
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GTTCTTATTG GCCTT G CMT ACTGCTOATA TGGAAGCTTT TAATGATAAT TCATGACAGA 2280 
AGQGAGTTTG CTAAATTTGA AAAGGAGAAA AT6AATGCCA AAT606ACAC GGGTGAAAAT 2340 
CCTATTTATA AGAGTGCGQT AACAACTGTO GTCAATCC6A AGTAT6A6G0 AAAATQA 



Alo Protein eecwence; 
Gene name: 
Ukilgene nun4>ert 
Probeset Accession 0: 
Protein Accession 9i 
Signal sequence: 
Transmembrane domains t 
ZNB domain; 
PSI domain t 

Cellular Localization: 



Zntegrin, beta 1 (fibronectin receptor, beta, antigen CD29 includes MDF2« MSK12) 

B8. 287797 

X07979 

NP_002202.1 

1-21 

732-754 

34-464 

26-76 

plasioa membrane 



1 11 21 31 41 51 

i I I I I I 

MMLQPIFWIG LISSVCCVFA QTDENRCLKA NAKSCGBCIQ AGPNCGWCTN STPLQEGMPT 60 

SARCSDLEAL KKKGCFPDDI ENPR6SKDIK KNKNVTKRSK GTAEXLKPBD ITQIQPQQLV 120 

LRLR5GBPQT FTLKFKRAED YPZDLYYLMD LSYSMXDDLB NVKSLGTDLM NEMRRIT8DF 180 

RIOPOSPVEK TVMPYISTTP AKLRNPCTSE QNCrSPPSYK NVLSLTNKGB VFNELVGKQR 240 

ISGNLDSPEG GFDAIMQVAV 06SLIGWRNV TRLLVFSTDA GPHFAGDGKL GOIVLPNDGQ 300 

CHLEanWYTM SHYYDYPSIA HLVQKLSBNN IQTIFAVTEB FQPVYKBLKN LIPKSAVQTL 360 

SANSSNVIQL IIDAYNSLSS EVILENGKLS EGVTISYKSY CKNGVNGTGE NGRKCSNISI 420 

GDBVQPEISI TSKKCPKKDS DSFKIRPLQF TBEVEVILQY ZCECBCQSBO ZPBSPRCXEG 480 

NGTPECQACR CKE6RVGRHC EC8TDEVNSE DMDAYCRKEN SSEICfiKNGE CVCGQCVCRR 540 

RDNTNBIYSG KPCEC3»IPNC DRSNGLIOGG NGVCKCRVCE OIPNYTGSAC DCSLDTSTCB 600 

ASKGQIC21GR GICECGVCKC TDPKPQGQTC BMCQTCLGVC AEHKECVQCR AFNKGBKKDT 660 

CTQBCSYFNI TRVBSRDKLP QFVQPOPVSH CKEIOIVDDCW FYFTYSVNGN NEVMVKWBH 720 

PECPTGPDII PIVAGWAGZ VLZ6LALLLZ WKLLMZZBDR REPAKPBKEK MMAXHDTGEN 780 
PIYKSAVTTV VNPKYBGK 

All DNA SEQUBWCB 

Gene namei selectin E (endothelial adhesion molecule 1} 

UDigene number: Hs. 89546 

Probeset Accession ft: N24736 

Nucleic Acid Accession #t KM_000450 

coding sequence: 1-1833 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGATTGCTT CACAGTTTCT CTCAGCTCTC ACTTTGGTGC TPCTCATTAA AGAQAGTGGA 60 

GCCTGGTCTT ACAACACCTC CAGGOAAGCT A7GACTTAT0 ATGAGGCCAG TQCTTATTGT 120 

CAGCAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAG AAGAGATTGA GTACCTAAAC 160 

TCX3VTATTGA GCTATTCACC AAGTTATTAC TGQATTQQAA TCAQAAAAGT CAACAATGTQ 240 

TGGGTCT6G0 TAGOAAOCCA 6AAACCTCTG ACAGAAGAAG CCAAGAACTG GGCTCCAOGT 300 

6AACCCAACA ATAGGCAAAA AGATGAGGAC TGC6TGQAGA TCTACATCAA GAGAOAAAAA 360 

GATOTGGGCA TGTGGAATGA T6A6AGGT0C AGCAAGAAGA AGCTT6CCCT ATGCTACACA 420 

GCTGCCTGTA CCAATACATC CTGCAGTGGC CAOGGTGAAT GTGTAGAQAC CATCAATAAT 480 

TACACTTGCA AGTGTGACCC TGGCTTCAOT OOACTCAAGT GTGAGCAAAT TOTGAACTQT 540 

ACAGCCCrGG AATCCCCTGA 6CAT66AAGC CTGCrTTTGCA GTCAOCCAC7 GGQAAACTTC 600 

A6CTAGAATT CTTOCTGCTC TATGAGCIQT aATAGGOOTT ACCIGCCAAO CA6CATGGA6 660 

AOCATGCAGT GTATGTCCTr T6GAGAATG6 A G TGCTOCTA TTCCAGCCTG CAATGTGGTT 720 

GAGTGTGATG CTGTGACAAA TCCAGCCAAT GGGTTOGTGG AATGTrTCCA AAACCCTGGA 780 

AOCTTCCCAT GGAACACAAC CTGTACATTT GACTGTGAAG AAGGATTTQA ACTAATGGGA 840 

GGCCAGAGCC TTCM3T6TAC CTCATCTGGG AATTGGQACA ACQAGAAGCC AACGTOTAAA 900 

GCIGTGACAT GGAGGGCOQT CCGCCA6CCT CAGAATGQCT CIGTGAGGTG CA6CCATTCC 960 

CCTGCTGOAG AQTTCACCTT CAAATCATGC TGCAACTTCA CCTGTGAGGA AGQCTTCATG 1020 

TTOCAGGOAC CAGCCCA6QT TGAAT6CA0C ACTCAAGGGC A6TGGACACA GCAAATCCCA 1080 

OTTTOTGAAG Ciri 'C C A GTO CACAGCCTTG TCCAACCCCG AGCGAGGCTA CATGAATTGT 1140 

CTTCCTAGTO CTTCTGGCAQ TTTCC6TTAT GGOTCCAGCT GTQAGTTCTC CT6TGA6CAG 1200 

GGTTTTGTGT T6AAGGSGATC CAAAAGGCTC CAATGTGGCC GCACAGGGGA GTGQGACAAC 1260 

GAGAAGCCCA CKTGTGAAGC TQTGAGAXQC GATGCTGTCC ACCAGCCCCC QAAaGGTTTS 1320 

GTGAGGTGTG CTCATTCCCC TATTCGAOAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 1380 

TGTGA6GAG6 GATTTGAATT ATATGQATCA ACTCAACTTG A8T0CACATC TCAGGGACAA 1440 

TGOAGAGAAO A6GTTCCTTC CTGCCAAGTG GTAAAATGTT CAAQCCTGGC AGTTCCGGGA ISOO 

AAQATCAACA TQAGCTGCAO TOGGGAOCCC GTOTTTGGCA CTOT O T Q C A A OTTCGCCrGT X560 

CCTOAAGGUVT GGA08CTCAA TGGCTCT6CA GCTCGGACAT GTGGAGCCAC A6GACACTG6 1620 

TCTOGCCTGC TACCTACCTG TOAAGCTCCC ACTGAGTCCA ACATTCCCTT GGTAGCTGGA 1680 

CTTTCTGCTG CTGGACTCTC CCTCCTQACA TTAGCACCAT TTCTCCTCTG GCTTCGGAAA 1740 

TGCTTA066A AAGCAAAGAA A TTT G TTCCT 6CCAGCA6CT 6CCAAAGCCT TGAATCAGAC 1800 
G6AAGCTACC AAAAGCCTTC TTACATCCTT TAA 



A12 Protein sequence; 
Gene namei 
TItiigene number: 
Probeset Accession St 
Protein Accession #: 
Signal sequence: 
Transmembrane domains: 
C- lectin domain: 



Selectin B (endothelial 

K8.B9S46 

M24736 

NP_000441 

1-22 

555-573 

23-139 



adhesion molecule 1) 



834 



wo 03/042661 



PCT/US02/36810 



Cellular Localization i plasma membrane 



1 11 21 31 41 51 

1 t I I i I 

MIASQFLSAli TLVLLIKESG AWSYNTSTBA HTyDBASAYC QQRYTHLVAI QKKEEIEYU9 60 

SIIjSYSPSYY WIGIRKVNNV WVWVGTQKPL TBBAKNWAPG EPNNRQKDED CVBIYIKRBK 120 

DVGMHMDERC SKKKLALCYT AACTNTSCSG H6SCVETIMM YTCKCDPGPS GLKCBQIVMC 180 

TALBSPBUGS I.VCSHPL6NF SYNSSCSISC DROYLPSSHB TMQCHSSGEW SAFZPACNW 240 

BCDAVTOPAN GFVBCFQNPO 9PPHNTTCTP DCEBGFBLM6 AQSLQCTSS6 NWDMEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGBFTFKSS CNFTCBEGPM LQGPAQVBCT TQGQWTQQIP 360 

VCEAFQCTAL SNPBRGYMNC LPSASGSPRY GSSCEFSCEQ OFVLRGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKOL VRCAHSPXGB PTYXSSCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEEVPSOQV VKC8SLAVPG KINM8CSGBP VFGTVCXFAC PBQMTUfOSA ART08AT6RW 540 

SGUiPTCBAP TBSNIPLVAG I.SAAOI1SWT LAPFLUrLRK CLRXAKXFVP ASSOQSLBSD 600 
GSYQKPSYIL 



A13 DNA SEQUENCE 
Gene name: 
unigene nundoer: 
Probeset Accession Ut 
Nucleic Acid Accession 4: 
coding sequence: 



0 protein- coupled receptor 39 

Hs. 85339 

AA349893 

NM_001508 

1-1362 (underlined sequences correspond to start and stop codons) 



1 

I 

ATGQCTTCAC 
CCCGAGTTTG 
TTCGT6ATGQ 
AAAGGATACT 
TTGQTGTTCC 
ACQTCCAGCT 
GCTACGCTGC 
TTCAGOTACA 
GTCACCTCCG 
GTGAAC6TGC 
CAGCCCGAGA 
CAGTCCAGCA 
AIGTGCTOQA 
A0GCG6CCTC 
ACCATCATCT 
ATTCGGAGGA 
6G0TACATGA 
CGGCTCCTGT 
TGCCGCCTGT 
ACCACCOACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 




CCAGCCACCG 
CCTCCAATAT 
TCTTCQGCGC 
ACATGATGCA 
GGCAGCTGAG 
TCCTQAGGCT 
TCATGGCTGC 
TCCTCCrCCC 
ACACOGTGTC 
OGCTGCAGGA 
GCGCCOGCTT 
GAACTGAGAA 
AQTCATTGAO 
CAGAGAATOQ 



21 

I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGOSCC 
GGTGACAGAC 
GCCCATGQAG 
CTGCAAQCTG 
GACACTCAQC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GOTGCTGATG 
GAAGTCOQAO 
QATTGTTGTQ 
GGCCAAACCC 
CTTCTOGGAO 
CTCGCAGCAG 
GGCCAACCAC 
TGTGCAGC6C 
GATTTTCTTA 
TCTOGAGTCA 
TTTTCAGGAG 



31 

I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTGGGG 
CACATG6TGA 
TTCTACAGCA 
CACACTTTCC 
TTTQAGOGCT 
CAGOTGAAGC 
CT0TTT6CCA 
TGCAACCGCT 
ACCAA CCTC T 
TACCTOGTGG 
AAAAGGCAGA 
AGCGAAGAGA 
ACATTGGCOG 
AAGCACGACT 
AOGTTTTTCT 
TTT0660GGO 
GAGAAGOGCC 
COGTTGCTCT 
AGCACTTTTC 
CTA6AGCCCA 
CATGAAaTTT 



41 

t 

TCATTGATCA 
TTCTGGTGTA 
TCACCCA6GT 
6TTTGGCTT0 
TCATCTGGAA 
TCTTCGAGQC 
ACATCGCCAT 
TGCTGATTGO 
TGGGTACTGA 
CCAGCACCCQ 
CXaWSCCGCTG 
TCCTGCTCTC 
AGQGCTOQCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAOCTC 
TCTTCOTOCA 
TG06GGTACA 
TOGCGTCCOG 
AGAGOGAGGC 
ACTCAGGOGC 
GA 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTOGAOAAG 
CTGGGACATC 
TCCCCXGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCQTCTGO 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
06TAGCCTTC 
GGCCQGGGGC 
CAGGAGGCAG 
GCCCAAGCAG 
CTACTTCCGG 
GGTCATCAAC 
OQTGCTQTQC 
T60GCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAQCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A14 Protein sequence! 
Gene name* 
unigene nun^r: 
Probeset Accession (Hi 
Protein Accession Ot 
Signal sequences 
P£am domains: 
Transmeiobrane domains t 
Cellular Localizationi 



G protein- coupled receptor 39 

Bs. 85339 

AA349893 

NM_001508, NP_001409 
none found 

7tO_l C72-172, 224 -344 J 

32-54, 68-90. 111-133, 151-173, 

plasma membrane 



221-243, 280-301, 320-342 



1 11 21 31 41 51 

I I I I I 1 

MASPSLPGSD CSQZIDHSHV PBFBVATWIK ITLILVYLII FVMOIiLQTSA TIRV TQVU?K 60 

KGYLQKBVTD HMVSIACSDI LVPLIGMPME FYSIIWNPLT T8SYTLSCKL HTFLFEACSY 120 

ATLIiHVLTLS PERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL X*FAMGTBYPL IBO 

VNVPSHRGLT OIRSSTRHRB QPBTSNMSIC TNLSSRWTVF QSSZFGAFW YLWLLSVAF 240 

MCHNMMQVLM RSQKGSLAOG IRPPQLRXSE SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 300 

IRRZMAAAKP RHDNTRSYFR AYMILZtPFSB TFFYLSSVZN FLLYTVSSQQ FSRVFVQfVLC 360 

CRLSLQHANB BSRIAVHABS TTDSARFVQR PLLFASRRQS 8ABRTEKIFL STFQSBABPQ 420 
SKSQSXiSLBS LBPNSOAXPA NSAABIGFQE REV w 

AT Prostate 

AIS DNA sequence 

Gene name: CEGPl 

Utkigene nunibert Hs. 222399 

Prdbeset Accession «t AA256485 

Nucleic Acid Accession fti AJ400877 

Coding sequence: 81-3080 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

835 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



I I I I 1 I 

GGOOTOOOGO CACACCTCCC OO00CO SC XX3 0C6CCIVCGQC OOKACTCOG CCX3CCTCTGC 
COGOU IOOGC TGAGCCKTCC ATGG GGG TOG 0QG6C06CAA CUG l 'OCaS GG GOGOCCTGGg 
CGGTGCTGCT GCTGCTGCTG CTGCTGOOGC C3VCT6CTGCT GCTGGOSGGG 6C0GT0C0QC 
CGGGTOSGCSO COQTQCCXSOQ GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAO 
ATOACTGCCh TGCOOAOGCX: CTGTGTCAGA ACACACCCAC CTCCTACAAO T6CTCCTGCA 
A8CCTGGCTA CCAAQOO QAA GGCft QQCftQ T aTOAaOACAT GQATQAAT6T 6GAAAT6AGC 
TCAATGGAGG CTGTGTCCAT 6ACTGTTTGA ATATTCCAGG CAATTATG6T TGCACTTGTT 
TTGATGQCTT CATOTTGGCT CATOAOGGTC ATAATTGTCT TGATGTQQAC GAGTOCCTGG 
AGAACAATOO COQCTOCCAa CATACCTOTO TCAACGTCAT GGGGAGCTAT QAGTGCTGCT 
GCAAGGAG6G GTTTTTCCTQ AGTGACAATC AGCACACCTO CATTCACOGC T06GAAGAGG 
OGCXGAGCTQ CATGAATAAO 6ATCA0GGCT OTAGTCACAT CTOCAAGGAG G0CCCAAG6Q 
OCaGOGTCOC CTGTQAGTGC AG6CCTGGTT TTGA6CTGGC CRAGAACCAO AOAGACTGCA 
TCrrOACCTQ TAACCATQGG AAOGGTGGGT GCCAGCACTC CTGTGACGAT ACAOCCQATG 
GCCCAGAGTG CAQCTGCCAT CC31CAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 
AGOGAGASGA CACTGTGCTG GAGGTGACAO AGAGCAACAC CACATCAGTG GTGGATGGGG 
ATAAAOGGGT GAAAaSGOQG CTGCTCATOG AAAOGTGTQC TGTCAACAAT 0GAGGCT6TG 
A0CX3CACCTG TAAGGATACT TCS3ACAG6TG TCCACT6CAG T'ltrA' C Clim' GGATTCACTC 
TCCAOTTGGA TGGGAAGACA TGTAAAGATA TTGATGAaTG CCAGACCCX3C AATGGAGOTT 
GTGATCATTT CTGCAAAAAC ATOGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 
AATTATTAAC AGATGAGAAG TCTT6CCAA0 ATGTGGATGA GTGCTCTTTG GATAGGACCT 
GTGAOCACAO CTGCATCAAC CAGOCTOGCA GATTTGCTTG TGCTTGCAAC OGAGOGTACA 
CGCTOTATGQ CTTCACCCAC TGTGQAGACA CCAATGAGTG GAQCATCAAC AACOGAOGCT 
OTCAGCAOGT CTGTOTGAAC ACAGTGQGCA GCTATOAATG CCAGTGCCAC CCTGGGTACA 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 
CACCCOGTGT GTCCCTGCAC T6CGGTAA0A GTG6TGGAG6 AGA06GGTGC TTCCTCAGAT 
GTCACTCTGO CATTCACCTC TCTTCAGATG TCAGC31CCAT CAGGACAAGT GTAACCTTTA 
AOCTAAATOA A6QCAA0TGT AQTTTGAAAA ATQCTQAOCT GTTTOOOQAO GOTCTGCQAC 
CAGCACTACC AGAOAAGCAC AGCTCAGTAA AAGAQAGCTT COSCTAOGTA AACCTTACAT 
GCAGCTCrOG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AA6CACCCCT AAGQAAATGT 
TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 
TQAGCTQCAT 0GTAAA60QA ACOGAGAAGC GGCTCCGTAA AGCCATC06C AGGGTCAGAA 
AGQCOGTCCA CAGGGAGCAG TTTCACXrTCC AGCTCTCAGO CATGAACCTC GACGTGGCTA 
AAAA6CCTCC CAQAACATCT QAAOGCCAGG CAGAGTCCTQ TGGAGTGGGC CAGGGTCATQ 
CAGAAAACCA ATOTOTCAOT TGCASOGCTG GQACCTATTA TGATQQAQCA C3GAGAACGCT 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 
GCCCA AGAOC AGQAAATTGT OQGQ OCrTGA AaACCOGAGA A6CTTGQAAT ATGTCTOAAT 
6TGGAGGTCT GTGTGAACCT GQTQAATATT CTGCAGAT6Q CTTT6CACCT T6CCAGCTCT 
GTGCCCTGGQ CACGTTCCAO CCTGAAGCTG GTCXSAACTTC CT6CTTCCCC TGTGGAGGAG 
GCXTTGCXaiC CAAACATCAG GOAGCTACTT CCTTTCAGOA CTQTGAAACC AGAGTTCAAT 
6TTCA0CTGG ACATTTCTAC AACACCACCA CTCAC06ATG TATTCGTTGC CCAGTGGGAA 
CATACCAOGC TOAATTTGGA AAAAATAATT O T O r r TC Xl 'q CXXAGGAAAT ACTAOOACTO 
ACTTTGAT6G CTCCACAAAC ATAAOCCAGT GTAAAAACAG AA6AT6TGGA GGGGAGCTOG 
GAGATTTCAC TGGGTACATT GAATCCOCAA ACTACCCAGG CAATTACCCA GCCAAC3VC00 
AQTGTACX3TG GACX^TCAAC CCACCCCCCA AGCQCOSCAT CCT6AT0GTG GTCCCTQAGA 
TCTTCCTGCC CATAGAG6AC 6ACTGTGGG6 ACTATCTGGT GATG06GAAA ACCTCTTCAT 
CCAATTCTGT GACAACATAT GAAACCTOGC AGACCTAOGA ACX3CGGCATC 6CCTTCACCT 
CCAGGTCAAA GAAGCTGTG6 ATTCAGTTCA AGTCCAAT6A AGGGAACAGC GCTAGAGGGT 
TCCAGGTCCC ATAOGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAQTTC 
GA6ATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGQAT AAQAAACTTA 
TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACA6CCCAG6 
AGTO OOQAOA GATGTTTOCA AOATGOTTCA TCOGATTGCT AOSTTGCAAA OTOTCCAGGT 
TTTT6AGACC TTACAAATGA CTGA6CCCAC GTGCCACTCA ATACAAATGT TCT6CTATAG 
GOTTGOTGGG ACAGAGCTGT CTTCCTTCTG CATQTCA0C3V CAGTOGGOTA TTGCTGCCTC 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 
GAACXTGOTT TTTCTTTCCC AGGArCOTGG ATGTAGACTG AGAATGGCTT TGAGTOGCAT 
CAQCTTCTCA CTGCT6TGGG GGGATGTCTT 6GATAGATCA OQGGCTGGCT GAGCTGGACT 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGOGOTC TTACTCCTCC TCAAGGAGTC 
TGTAGTGGAA AGQAGGCCAC AGAATAA6CT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT OGTGTGCAGG CTCTGACCAQ GCAGAACAGG 
CAAGAGGGGA GGQAAGQAGA CCCXTTGCAGO CraCCTCXAC CCAOCTTGAG ACCTGOGAGG 
ACTCAGTTTC TOCACAGCCT TCTGCAGCCT GTGTQATACA AGTTTGATCC CAGGAACTT6 
A8TTCTAAGC AGTGCTOGTO AAAAAAAAAA QCAaAAAGAA TTAOAAATAA ATAAAAACZA 
AGCACTTCTG GAGACAT 



A16 Protein eequence 
Gene namei 
Unigene nuiriber: 
Prcd>e6et Accession #t 
Protein Accession #i 
Signal sequence* 
Transmendsrane doroainst 
PFAM 



CBGPl 
HB. 222399 
AA2564e5 
CAB92285 
1-31 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



Cellular Local izationt 



EGF-like domains {49-84, 132*167«177-213, 286-321. 407-442) 
CUB_domain [809-918] 
may be secreted 



80 



1 11 21 31 41 51 

I t I I I I 

KGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDBCA QGLDDCHADA 60 

L0QNTPT8YK C8CKPGYQGE GRQCBDIDEC QIELNGGCVH DCLHIPGNYR CTCFDQFMLA 120 

RDGHNCLDVD ECLENllGaOQ HTCVHVKOSY b60CKBQFPL SDNQBTCXBR SBBGLSCMNX 180 

DHGCSBZCXB APRGSVACBC RPOPBtiAKKQ RDCILTCNRG NGGOQHSCDD TADGPECSOi 240 

PQYKKIITDGR SCLERBDTVL BVTBSNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

836 



wo 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



STGVHCSCPV 
SCQDVDECSL 
TVGSYECQCH 
SSDVTTIRTS 
PC5APGRPSTP 
FHLQLSGMNL 
TFQNEEGQMT 
PBAORTSCFP 
KNNCVSCPGN 
PPPKRRILIV 
IQPKSNEGNS 
ZABPQNYFinr 



QPTLQLDGKT 
DRTCDHSCIN 
POYKLHNNKK 
VTPKXNEGKC 
KEMPITVEPB 
DVAKKPPRTS 
CBPCPRPOIS 
0G60LATKHQ 
TTTDFDGSTM 
VPBIFItPIED 
ARGPQVPyVT 
TAQB8REMPP 



CKDIDEOQTR 
HPGTFACACM 
DCVEVKGLLP 
SUCNAEIiFPB 
LBTKQKEVTA 
BRQAESOGVG 
GALRTPEAKM 
GATSFQOCET 
ITOCKNRROS 
DCGDYLVMRK 
YDEDYQELIE 
R8PXRLLR8R 



NGGCDHPCKN 
RQYTIiYGPTH 
TSV8PRV6IA 
GLRPALPBXH 
SCDLSCIVKR 
QGHAENQCVS 
MSEOQQLCQP 
RVQCSPGRPy 
GBLGDPT3YI 
TSSSNSVTTY 
DXVRDGRI»YA 
VSRFXiRPYK 



rVGSFDOGCK 
CQ0TNEC8IN 



SSVKBSPRYV 
TEKRLRKAIR 
CRAGTYYDGA 
GEYSADQPAP 
NTTTHRCIRC 
ESPNYPGNYP 
BTCQTYBRPl 
SBIHQEILKD 



KGFKLLTDEK 
N660QQVCVM 
FLRCHSGIHL 
NLTCSSGKQV 
TLRKAVHRBQ 
RERCILCFNG 
OQLCALGTFQ 
PVGTYQPBFO 
ANTECTWTIN 
AFTSRSKKLH 
KKLIKALPOV 



360 
430 
480 
540 
600 
660 
720 
7B0 
640 
900 
960 



M7 DNA sequence 
Gene name> 
Unigene nunibert 
Probeset Accession 8t 
Nucleic Acid Accession fit 
Coding sequence: 



ESTs 

Hs. 293102 

AI951118, AVI170035, AL044691. AI908272 
none found 

1-489 (entire sequence is an open- reading -frame) 



1 
I 

CAAAAAGAAA 
CrOAAGGCTC 
ATGCAAACTT 
AT6CAAAAGT 
GATOIQATGT 
GAQAGTCTCC 
AAAGAAATGO 
GATACAGTTC 
TCAGGAAAA 



11 
I 

TAOATAAAAT 
CCTGCAGAAT 
TCAAAQCAQA 
CTGTTCCAAA 
TGOCTTCAGA 
GTQAGACTGT 
ATAAAATAA8 
ATTCTTGTGA 



21 
I 

AAATG6A2VAA 
GAAAGTTTCT 
GCCTCCOGAG 
TAAAQCCTTG 
ATCAAAACAA 
TTCAC3U3AAG 
TGOAAAATTA 
AAOAACAAGO 



31 
I 

TTAGAAGAGT 
ATTCCAACTA 
AAGCCATCTG 
GAATTGAAGA 
AAGAASGTTO 
OATOTGTGIO 
GAAGATTCAA 
OAACTTCAAA 



41 
I 

CTCCT6ATAA 
AAGCCTTAQA 
CCTTOGAGCC 
ATQAACAAAC 
AAQAAAATTC 
TACCCAABGC 
CTAGCCTATC 
AAGACCCCTG 



51 

I 

TGATGGTTTT 
ATTGATGGAC 
TGCCATTGAA 
ATTG AGAGCA 
TTGGGATTCT 
TACAOITCAA 
AAAAATCTTQ 
TGACOCAGGT 



60 
120 
180 
240 
300 
360 
420 
480 



A18 Proteip sequence 
Gene name: 
Unigene number: 
Probeset Accession fti 
Protein Accession #: 
Signal sequence i 
Transmenbrane domains t 
Cellular Localization t 



ESTs 

KS. 293102 

AI951116, AW17003S« 

none found 

none 



AL044e91, AI908272 



nuclear 



1 XI 21 31 41 51 

I I I I I } 

QKEIDKXNGK LEESPDNDGF LKAPCRMKVS IPTKALELMD MQTPKAEPPE KPSAFEPAIE 
NQKSVFNKAL ELKNEQTI.RA DQMFPSESKQ KXVBEHSWDS BSLRETVSQK DVCVPKATHQ 
KEHDKISGKL BDST8LSKXL OTVH8CERTR BLQKDPCDPR S6K 



60 
120 



Breast* 

A19 DMA SEQUENCE 



Unigene n\traber< 
Probeset Accession fft 
Nucleic Acid Accession 8: 
Coding sequence i 



TMPR8S3a 
Ha. 2 98241 
AIS36613 
AB038157 

302-1566 (underlined sequences correspond to start and stop codons) 



AO0GG6CACC 
OQAAAGGGCT 
CCRTCTACAT 
AGAGGTCCTG 
TCATTOOGAT 
6ATGCIGTT6 
ATTGGGATCA 
TCAQGGAAOT 
GGAGTCTOGG 
AATGCOSTGC 
AAGGOTCACT 
QATAAGCTCA 
CACCTCTTGC 
TQTGCCTCTG 
AGCTCA06CA 
CTTCAGTTCC 
ACTOCTGCAC 
CTAGTTTCCC 
A6CAAGTACA 
CTCAOQTTCA 
QATGGAAAAG 
TCOOCIOTCC 
QAOGTGTACG 



11 
1 

GQAiOSOCTOG 
GTOTTTATG6 
TTTTQGGACT 
AAATAGTCAC 
CGCTTTTTGG 
CTGCACAOAT 
TTGCATTGAT 
ACAGATGTCG 
ATTGCAAAGA 
TCCAQGTGTT 
AOSCAAAfGT 
GAGTQAGCTC 
CAQATGACAA 
GCCAOGTGGT 
T0QTGGGT60 
AGQGCTAOCA 
ACTGTGTTTA 
TQTTGGACAA 
AGCCAAAGAG 
ATQAAAT6AT 
TGTGCTGGAC 
TGAACCAOQC 
GTGGCATCAT 



21 
I 

OGTACTTTCG 
6AA6CCA6TA 
CGGGAATTAT 
CATGQ GGGAA 
CCTTGATGAT 
CCTOTCACTG 
ATTAGCACTG 
CTCATCCTTT 
OGGGGAGGAC 
CACA6CTGCT 
TGCCTGTGCC 
GCTGGAGG6G 
GGT6ACTGCA 
TACCTTGCAO 
AAACATGTCC 
CCTQTGCQQG 
TGACTTGTAC 
TCCAOCCCCA 
GCTGGGCAAT 
CCAGCCTQTO 
GTCAOGATGG 
GGCGGTOCCT 
CTOOOCCTC C 



31 

I 

TTCTTAATTA 
ACACTGTGGC 
GAGGTAGAGG 
AATGATCCGC 
TTGAAAATAA 
CTGCCATTGA 
GCCATTQOTC 
AAGTOTATOG 
GAGTACOGCT 
TGGTGGAA6A 
CAACTGGGTT 
CAOTTCOGGG 
TTACACCACT 
TGCACAGCCT 
TTQCTCTOGC 
GGCTCTGTCA 
CTCCCCAAGT 
TCCCACTTGG 
OACATCGCCC 
TGCCTGCCCA 
GGGGCCACAG 
TTQATTTCCA 
ATQCTCT60Q 



41 

I 

GGTCATGCCC 
CTACTATCTC 
TGGAGGCGGA 
CTGCTGTTGA 
GTCCTGTTGC 
AGTTTTTTCC 
T6GGCATCCA 
AGCTQATAGC 
GT6TC0GGGT 
CCATGTGCTC 
TCCCAAGCTA 
AGGAOTTTGr 
CAGTATATGT 
GTGGTCATA6 
AGTGGCCCTG 
TCACGCCCCT 
CATGGACCAT 
TGGAiGAAGAT 
TTATGAAGCT 
ACTCTGAA6A 
AGQATGQAGC 
ACAAGATCTG 
CGGGCTACCr 



51 
I 

(tTGTGAGCCA 
TTCOGTGGTG 
GCOGGATOTC 
A6CC0CCTTC 
ACCAGATGCA 
AATCATOGTC 
CTTOQACTGC 
TGQAT07GAC 
GGGTQGTCAG 
CQATQACTGG 
TGTGAGTTCA 
GTCCATCGAT 
G ABgQM GG^ 
AAGQG6CTAC 
GCAGQCCAGC 
GTGQATCATC 
CCAQGTGGGT 
TGTCTAOCAC 
GGGOGGGCCft 
GAACTTCCCC 
AGGTGAOQCC 
CAACCACAOG 
OACGGGTG6C 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



837 



wo 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 

50 
55 
60 
65 
70 
75 
80 



OTtSQACAGCT 
AAGTTAOTGG 
TACAGC06TG 
ACCTGAAGAQ 
TCCTCOCCTO 
TTCOGGOICC 
CAAGCTGCTT 
AOTQCAGTCSG 
TTGCCTCAGC 
TTGTATTTTT 
CCTCAAAT6A 
AOOCCTAGOC 
AGGQOGGCCT 
ACGAGATAAO 
OCACCAGCCX: 
CCAAACCCAC 
ATGACTOOTT 

rmTCcmg 

AAAAAAAA 



GCCAGG6GCA 
GA6CQACCAG 
TCACCTCCTT 
GAAGGGQACA 
GACTCCCOTO 
AGTAGCAGGC 

■ rrmriTm ' 

CQAAATCCCT 
TTCCCCAGTA 
AGTAGAGACA 
TGTGCCTGCT 
TCACX3CTCCT 
TTCCCACTGQ 
CAGTTATQTG 
AOAAOTGCAO 
CCTTTCTACT 
TAAGGCCTAT 
ATTCCAAATA 



CAGOGGGGGG 
CTTTGGCATC 
CCT0QACT60 
AGTAGCCACC 
TAGGAACCTG 
CCGAAA(3AG0 
GTTTTmGA 
GCTGACTQCA 
GCTGGOACCA 
QGGTTTCACC 
TCA6CCTCCC 
TTCT6ATCTT 
TOCATCTGOT 
ACCTCAC6TG 
AACTOCAGTC 
TOCAAGACTT 
TTTdTGATT 
ATAT6TTTCC 



CCCCTGGTGT 
G6CTGCX3CAG 
ATCCACGA6C 
TGAGTTCCTG 
CACACGAGCA 
CACCCTTCCA 
GGTGQAGTCT 
OC C'lCU iC XT 
CAQGTGCCOQ 
ATGTTGGCCA 
ACA6T6CTGG 
CACTAAGAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTOCACOTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



GTCAAGAGAG 
AGGTGAACftA 
AGATQOAGAG 
A06TQATGAA 
GACACCCTTG 
TCTGATTCCA 
OGCTCTGTTG 
CCCTGGTTCA 
CCACCACACC 
GQCTGCTCTC 
6ATTACAG6C 
AAAAQAAGCA 
66G6TCTTGC 
AACAGCCACT 
TTCATCTCTA 
GTGGGGAGQT 
ATTTGGTGCT 
AAAAAAAAAA 



GAOQCT GTGG 
GCCTGGG6T6 
AGACCTAAAA 
GACAGCCGQA 
QAGCTCTOAO 
GCACAACCTT 
CCCAGGCT6G 
AGOGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATGGGCCACX; 
GCAACTTGCA 
AAAATTCCTG 
CAGAAAA6AC 
GQGACCAGAA 
TAATCTAGGA 
TQAGGTATTA 
AAA A AAAAAA 



1440 
ISOO 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 



A20 Protein sequence i 
Gene name: 
Unlgene nunibert 
Probeset Accession 9: 
Protein Accession 8: 
Signal sequences 
Transmendorane domains s 
Tryp_8Pc domain: 
Cellular Localisations 



MGBNDPPAVE 
LALAIOLGXH 
TAA6HKTMC8 
VTALHKSVYV 
LCGGSVITFL 
LGNDIALMKL 
AVPIiXSNXlC 
F6XGCAEVNK 



11 

I 

APF6FRSLFG 
FDCS6RYRCR 
DDHKGBYANV 
REGCASGHW 
WIITAAHCVY 
AGPLTFNQ4I 
NHRDVYGGII 
PGVYTRVTSP 



21 
I 

LDDLKISPVA 
SSFKCIELIA 
ACAQU3FPSY 
TliQCTAGGHR 
DLYLPKSWTI 
QPVCLFNSEE 
SPSMLCAGYL 
LDWZHBQMBR 



TMPRSS3a 

He. 298241 

AI538613 

BAB20077 

none found 

43-65, 239-261 

216-444 

not detexndned 



31 41 51 

) I I 

PDADAVAAQI LSUjPLKFFP IIVIGIIALI 60 

RCDGVSDCKD GEDEYRCVRV GGQNAVLQVP 120 

VSSDNIAVS8 LBOQFREBFV SIDHLLPDDK 180 

RGYSSRIVGG NMSLLSQWPM QAStQFQOyH 240 

QVGLVSLLDN PAPSHLVEKI VYBSKYKPKR 300 

NFPDGKVCWT SGWGATED6A GDASPVIJIUA 360 

TQGVDS0C2GD 8G6PLV0QBR RIiWKliVGATS 420 
DLKT 



ESTs; opposite Strand to TRPSl 



A21 DNA SEOOBWCB 
Gene names 
imigene number: 
Probeset Accession #: AA426090 
Nucleic Acid Accession tft AA428090 

Coding sequence: 1-S5B (underlined sequences correspond to start and atop codons) 



1 

I 

ATQARGCCCA 
CAAGTTGTTG 
TTTTCA6ACA 
GATTCTGTT6 
GGAAAAACAA 
CTATT6TTTA 
AAGTGTTTTG 
AAGG6CATTG 
GAAAAAGGGA 
CCCAAAAATG 
CAAGCAAATC 

GTTCTGCCTC 
TTTAACCAAA 
QGAATTOCAT 
TTTTTACACA 
CTGAATGTTT 
CTTTTTACAC 
ACAAOTTGAG 
GATTTCACCA 
TATATATATA 
AAA6TGAATC 
TATTCAAGTT 
CACCAAGAAC 
A06AAAAAAT 
ATCAGGACTA 
ATTTTCTGCT 
CCCCTCATGC 



11 

I 

GTTTGAAACA 
GAGAAGAAAT 
CAGAT6ATCT 
GGCTTCCTGG 
CTAACCTCAG 
AGCTCCAGGG 
TTCAGCTTGC 
CCATTATGAA 
AAAAAGAAAA 
AOAAATGAAG 
TTCTGAA6CA 
GTTAAAAACA 
ATTTCTCTTT 
CAAATAAGAT 
CATGTTCACA 
TCGTAACATA 
TTGAAATAAA 
AAAATATATC 
A6AGTCCAGG 
TTTCAATTCA 
TATATATATA 
CATATAAACC 
TA6ATQTAAC 
ATGTTTOTGA 
TTCATOTOTT 
CAAATTQAAT 
QGTQAATATT 
CTCCCCCAAA 



21 

I 

CTGQGCTQAT 
TCATCTACAA 
TATGTCCATA 
TAGAGCTGGA 
CCATTTGCCC 
GTCATACTAT 
AGTATTAAAA 
AOCCCCTCAA 
AAAAATAATA 
ATTGTATAAT 
ATCAATTATT 
TGCAG6TGAG 
CAAAGTCATC 
ATTTQATATA 
TGAGTATAOC 
ATCATTACCC 
CACATTTTTA 
AGTGAGAGAG 
CTTTATCAAT 
6CAGAGCCTG 
TATATTACAA 
ATACCAAOGC 
AGACATCTTT 
GTAGAAATGA 
GTAAGAACAO 
TCTTTTTCTT 
GCTOTAAGTT 
QTTTTCCATQ 



31 
I 

ATAAAAAT6T 
GTAGTATCTG 
GTGGAGGAAT 
GACAGACTGC 
AATTTCCACC 
GGCAATCTAC 
CAAAAAAAAC 
AGACTCTCTO 
ATTAGAA06A 
GA0GGGA6AT 
TATATACTTT 
TCTTGCCAAC 
TTTCAGGGAA 
TTAATTTAAA 
GAATTCAAAO 
AATACTCGAC 
TQTTATCTCT 
TGTTTGTTTA 
ATAAGTAATA 
TATATATATA 
TGATCTGTAT 
CGTTATGTGT 
GCTGCCT6AA 
ACATGCACTA 
AACTATTATA 
AGCAACATGA 
AAATTTTACA 
TGGTTGTCAA 




CTTGCCCTGA 
CTTTTTGAGA 
TTAAACTTCA 
ACTCAATATT 
CT6GAGAAAG 
AGAAAAAAAA 
ATTTTTTAGA 
TATATATATA 
TTCCTATTGC 
AACTGOTGOT 
GATTGTTTGC 
T6AAAACAAA 
GCCAACATTC 
AATCATTCCA 
TTGGCATTTT 
ATAGTC06C 



51 

I 

AGACCAAAGQ 
TGTAGAAAAC 
CTACAACOQT 
AAAACCCA6A 
TCAGAGTTCA 
TATTCT6AGC 
GCTGCAAAGC 
AACATGGAAA 
AATTTGGGCT 
TTAAACTCTC 
TTTTTGTATT 
ATCTGGATCT 
TTAATTTGAT 
TQATTGATTA 
TAAGCAGGAG 
TGATACTCAA 
TAGTATATAT 
TCAAAGCAGA 
ATGGTGATTT 
TATATATATA 
TAGAAGGATG 
AAAACTTTAT 
ATAAGAAATA 
ATAAAATAAA 
TAGTATTCAA 
TATOAAAGAC 
6AGAT6TTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



A22 Protein sequence: 



838 



wo 03/042661 



PCTAJS02/36810 



5 

10 
15 

20 

25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Gene naotei 
unigene nunibers 
Probeset Accession 8: 
Protein Accession 
Signal sequence i 
Transmembrane dootainsi 
Cellular X^ocalizationt 



BSTs; Opposite strand to TRPSl 

none 

AA428090 

none found 

none found 

113-129 

not determined 



1 U 21 31 41 51 

11)1)1 
MKPSLRHMAD IRMFSBIOQR QWGBBZilLQ WSVSYLVEN FSDTDDLHSZ VBBSDSCXMR 
DSVGLPGRAO DRLQPKTKPR GKTTNLSHIiP NFHLASYQSS LLFKLOGSYY GNLLVBPILS 
KCFVQLAVLK QKKHCLQLQS K6IAIMRAPQ RLSAIFKTNK BKQKXEKKXZ ZRRXCSLIWA 
PKNEK 



60 
120 
180 



A23 DMA SEQUENCE 
Gene namet 
Unigene number t 
Probeset Accession #: 
Nucleic Acid Accession fti 



ESTs 
Hs. 29383 
AW207206 
AL133619 

Coding sequence! 1>2070 (underlined sequences correspond to start and stop codona) 



ATGAGOGGTG 
CGGCGCCGGC 
CTCAGQCAGA 
CAGCAGCAGC 
GAAAACAAGO 
ACACTGCCQC 
6GG6GAACAC 
CCTGTATGCC 
AGGCOTGGCT 
GQGCCTGAGQ 
CCAAOZAQAO 
CCTCAGATTQ 
ATGCTGGGGG 
GCAGCAACCA 
GCACTTGCCC 
GCTCACTTCC 
TGOAGCCAGC 
GACATaOAOA 
CTGTTCTGGG 
GACAOQACAC 
CCCTCCTSCT 
GGCQCTOQCr 
AGGCTGAAGO 
GGCGGTAGOQ 
TCTOTCAAOT 
TCCTTCAACA 
CCCCTACTTC 
GAGAAAGCAG 
AGGCAGATGO 
ACCACACTTA 
ACCCAAGA6C 
CCGGAGGAAG 
AAGAGCCTCT 
CTQAAOCAGA 
AAACG6C6CC 



11 
) 

OGGGGGTGGC 
GCCAGCGCCC 
GCGACCCGCA 
ACTCGQAGAT 
GTGAGCOSGC 
TCCOGCAQCA 
AGGACGGGGA 
AACCCAGTGG 
GGACGATGTT 
TCATTGCAGG 
CTQAAATGGO 
CTGCTGTGGC 
CCCAGGGGAT 
TGGGGA CAAA 
ATCCT6ACA6 
CATTATCT7T 
CTGGQAACAT 
AQGOGGTTQA 
CAAAGTGTGG 
G6QAA0AGGC 
TTCCAGATGO 
GGGTCTGCAT 
AGGGCTCCTC 
OCGACACTOT 
CCATCTCTAA 
AGCAAGATTC 
ACAACA6CAA 
AGGCCTCTAA 
GGGGG6GGGC 
GGCAGTaOQA 
T6066CACCT 
CTAGCTTTCC 
CCAAGAAATG 
CCCOGAAGAA 
TGCATOGCTC 



21 

) 

GQCTGGGAGQ 
CTCTGTGGGC 
GAAACGGAAC 
GCTGQCCAAG 
GCGGGGCCCT 
CAGAAACACA 
GCCCCTCCAG 
GTACAGGTTC 
ATGCAGCCAA 
GCGGCAGGTO 
AAGOAAOCCC 
CAGGC0CA6G 
ATGGACACAC 
GGGAGGAAOC 
OQGCCCCCAC 
GGGGCTGGGO 
CGCAGCTGGG 
GGGAGGGCCC 
C0CAAG70Q0 
CATGCTTTOC 
CCCCTCAGQA 
CAACGGAGTG 
AOGGACACAC 
GOGCTCTCCT 
TTCAOCaVAC 
AAAAGCTGAC 
6CTGGACAAA 
TGCAGGAGCT 
ACACCCCCCA 
AOTGCTCATC 
CAAGTCCCTC 
CAGGGACCAA 
CCTGAQCCCA 
CAACTTTGCC 
AOTGCTTTGA 



31 
I 

CGGCOCCCCA 
GTCCAGTCCT 
CTGGACCTGG 
CTCCAT6AGG 
AGGGOGQCCC 
GCCATOSACT 
ACTQTCCTTG 
TGGGGGACCT 
GCACAGCAC6 
GCCACAQGGT 
TGGGACAGCC 
ATTTCCAGCC 
TCCATCC3iGG 
AGA6T0CTGT 
CCAGCCCA60 
CTQACATCAO 
GCAOTOCCTA 
TTCCCTAGCC 
CA0CCCCA6C 
CTCGGGACCT 
AACCAOCTTT 
TGGGTAGA6C 
AGGCCAGGAG 
OCAGACAGCC 
TCTCAAOOCA 
OTCTCCCAIQA 
GTTOCTGGGG 
GCCTGTATGG 
ATGATCCTQC 
CGCC3A6CTOT 
CTGGAAGGGA 
GAAGCOVOGC 
CCrOTGGOGG 
GAQAGGCAGA 



41 
I 

GCT06CC6AC 
TGAGGCG6CA 
AGAAAAGCCT 
AGAT06AGCA 
TGGCTCOCCA 
OCAGCACACG 
CCCACCTGGC 
GGACAGATGC 
TGCTGCTCTC 
GCTCOCCAiQA 
CCT6GCCT6C 
CTATGGCTCT 
GATCCCTTCC 
TTCCTTQCCA 
ATCCTOGQCT 
GAGGACATCT 
GGGCTCTCCC 
GCTGTGGCAA 
CCTGCAGTGC 
GCTOTTCCAT 
CCAGG6CCTC 
OOGGAGGACC 
GCAAGOGTGG 
TCTCCATGTC 
AGGCCAGGOC 
AGGG6GACCT 
TACAAGGGCA 
GGAACAGCCA 
CCCTTCCCCT 
06AATACCAA 
GCCAOAGGCC 
ATTTCCOCAA 
AGCX3T0CCAT 
AGAGGCTGCA 



51 
) 

CCOQGGCTCT 
GAGCCCGCAG 
GCAOTTCCTG 
TCTGAAG06G 
GOCACACTCA 
CCTGGGCTCA 
TGCACTGGCC 
CQCTACCTCT 
GGGAAGCCCA 
CCTCC CTOCT 
TAGATCTTTG 
GAGTCCTCAC 
TGCCATCTGG 
CTTGTCCAAG 
GTGQTCTCAA 
GACTGGTGGA 
TTCCCAGGGA 
CTCCAGTGAG 
TGGGGAGGCT 
GTOTCCCAAG 
TGCTCCCTTG 
CAGCCCTGCC 
GCGTCTTGOG 
AAGCTTCCAG 
CCAOCCOGQC 
GGAAGAGGAG 
GGCCAGAAAG 
GCACCAGGGC 
G08AAAGCCC 
OCTOCIGCAO 
CCA6GCAGCC 
GGTCTCCACC 
CCT6CC0GCA 
GGCAATGCAO 



A24 Protein sequence; 
Gene namei 
Unigene nuinbert 
Probeset Accession #s 
Protein Accession 9t 
Signal sequence t 
Transmembrane domains i 
Cellular Localization! 



ESTs 
Hs. 29383 
AH207206 
T434S7 
none found 
303-322 

not determined 



1 
I 

MSGAGVAAGT 
CXKIHSEMLAX 
GGTQDGBPLQ 
GPEVIAGRQV 
MLGAQGIWTH 
AHFPL8LGLG 
LFHAKOGPSR 
GARHVCINGV 
SVKSISNSAN 



11 
I 

RPPSSPTPGS 
LHEEZBHUCR 
TVLABLAAIiA 
ATGC8PDLPP 
SIQGSLPAIH 
LTSGGHLTGG 
QPQPCSA6DA 
WVBPGGPSPA 
SQGKARPQPG 



21 
I 

RRRRQRPSV6 



PVCQFSGYRF 
PSRAEHGRNP 
AATMGTKGGS 
HSQPGHIAAG 
DRTRBBAML8 
RLXEOSSRTR 
SFNKQDSKAD 



31 
I 

VQSLRPQSPQ 
RPALPFQAHS 
HGTWTDAATS 
WDSPCPARSL 
RVLFPCHLSK 
AVPRALP8QG 
XiGTCCSMCPK 
RPGGXRGRIiA 



41 

I 

LRQSDPQKRN 
TLPLPQKRNT 
SRGirmLCSQ 
PQIAAVARPR 
ALPHPDSGPH 
DMEKGVEGGP 
PSCFFDOPSO 
GGSAOTVRSP 
PUOISKLDK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
IBOO 
1860 
1920 
1980 
2040 



SI 




1 

LDLERSLQPL 


60 


AZNSSTRLGS 


120 


AQBVLLSGSP 


160 


ISSPMALSPH 


240 


PAQDPGLHSQ 


300 


PP6R0GN8SE 


360 


MBLSRASAPL 


420 


ADSLSMSSFQ 


480 


VPGVQGQARK 


540 



839 



wo 03/042661 



PCTAJS02/36810 



EKABASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPXiRXP TTUtQCEVLI RSLWMTNIJdQ 600 
TQELRHX«KSL LBGSQRPQAA PBEA5PPRDQ BATHFPiCVST XSLSKKCLSP PVABRAZLPA 660 
LKQTPKNNFA ERQKRLQAKQ KRRLRRSVL 



A25 DNA SEQUEWCE 
Gene name I 
unigene nuniben 
Probeset Accession ft: 
Nucleic Acid Accession #: 
Coding sequence t 



1 

I 

GGAGGCTGAA 
CTCCCTCTTT 
ATGTCCTCGT 
TAAAGAACTC 
GTGAACTTCT 
CATCTGCAAC 
TACCTCCTGG 
GCTGGTGGAG 
QGGAAGGAGA 
GOCACTTTGG 
CTAACCAOAT 
AGCCAGACCC 
ACCTGTGGAT 
TCCTGTATGA 
8GCAGCAAAC 
TT06CT0CAA 
TACCTAGTGA 
CTOTCATCTQ 
GCATCTTTCC 
AGOGCAAGTC 
ACTATGAGGA 
TGTCAGTCCA 
ACACTGACTC 
AACCCCAG6C 
CTOAAACAAC 
TTCAT6CT6G 
CCAGATCCTC 
CACOGGCCAC 
AGTCTAGAGA 
CTQAOCAGGA 
CCTTQQATTA 
AACAGAGAOA 
ATX3CCAAGGT 
CTAAAAAOQT 
ATCAAGCTQA 

TGGOTGorrr 

ACTAATGQAA 
AATGCTCAAG 
TTTTTAACCA 
AACTGTGATT 
AAAAGCACAC 
GCTTTCATGA 
AAGATGACAA 
AACTGCATAA 
AAAGAAAGAA 
TTGCTQATAT 



BSTs; prolactin receptor 

Hs. 25252 

AA057193 

NM_000949 

2e5>2153 (underlined sequences correspond to start and stop codons) 




A26 Protein sequence t 
Oene nanei 
unigene nuntben 
Probeset Accession #t 
Protein Accession #: 
Signal sequence; 
Transtnerabrane domains t 
Fm domains t 
Cellular Uocallzationi 



ESTs; prolactin receptor 

Hs. 25252 

AA057193 

NP_000940 

1-23 

237-253 

28-112« 127-215 
plasma membrane 



1 11 21 31 41 51 

) I 1 I t I 

MKENVASATV PTLLLPLNTC LUJGQLPPGK PBIFRCRSPN KETFTCWWRP GTDGGLPTJTY 60 

9LTYHREGET UIHECPDYIT GGPNSCHPGK QYTSMWRTYI MMVNATNQMG SSPSDBLYVD 120 

VTYIVQPDPP LBIiAVEVKQP EDRKPYLWIK WSPPTLIDLK TGHFTUiYBI RLKPBKAAEW 180 

BIHPAOQQTB FKILSLHPGQ KYLVOVRCXP XTKOYWSAHSP ATFIQIPSDF TKNDTTVWI8 240 

VAVLSAVXCL XIVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLBKOKSB BLLSAL60QD 300 

PPPT8DYEDL LVSYLBVDDS BDQHLKSVHS KEKPSQGMKP mJ^mDSQ RGSCD8PSLL 360 

SBRCBEPQAN PSTPyDPEVX EKPBNPBTTH TNDPQC18NB GKIPYPKAGG SKCSTWPLPQ 420 

840 



wo 03/042661 



PCTAJS02/36810 



PSQ HNPR SSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQRBVB 480 

SPHSBTDQDT PWLLPQEKTP PQSAKPLDYV BIHKVNKDGA LSLLPKQRKN SGKPKKPQTP S40 

QINKEYAKVS GVMDNNILVL VPDPHAKNVA CFEBSAKEAP PSLEQNOABK ALANFTATSS 600 
KCRLQLGGU) YLDPACPTHS PH 



A27 DNA SBQUEWCB 

Gene nantet Human neuropeptide Y receptor Yl (NFYYl) 



Unigene nuniber: 
Probeset Accession fii 
Nucleic Acid Accession §: 
coding sequences 



Bs. 169266 
L07615 
NM_000 909.1 

209-1363 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I 1 t I I 

CATTCCCACC CTTCCTTCTT TAATAAOCAO GAGOGAAAAA GACAAATTCC AAAQAGOATT 60 

GTTCAGTTCA A660AATGAA 6AATTCAGAA TAATTTTGGT AAATGGATTC CAATATCGGG 120 

AATAAGAATA AOCTQAAGAO TTQACCTGCT TTGAAOAAAC ATACTOTCCA TTT6TCTAAA IBO 

ATAATCTATA ACAACCAAAC CAATCAA AAT GA ATTCAACA TTATTTTCCC AGGTTGAAAA 240 

TCATTCAGTC CACTCTAATT TCTCAQAQAA QAATGCCCAG CTTCTGQCTT TTGAAAATGA 300 

TQATT6TCAT CTGCCCTTCG CCATGATATT TACCTTAGCT CTTGCTTATG OAGCrGTOAT 360 

CATTCTXGGT GTCTCTGQAA ACCTG6CCTT GATCATAATC ATCTTGAAAC AAAAG6AGAT 420 

GAGAAATGTT AOCAACATGC TGATTOXOAA OCTTTCCTTC TCAGACTT6C TTGTTGCCAT 4 BO 

CATOTOTCTC CCCTTTACAT TTGTCTACAC ATTAATGQAC CACTGGGTCT TTGGTGAGGC 540 

GATGTGTAAO TTGAATCCTT TTGTGCAATG TOTTTGAATC ACTOTGTCCA TTTTCTCTCT 600 

GGTTCTCATT GCTGTGGAAC GACATCAGCT GATAATCAAC CCTCX3A(3GGT GGAGACCAAA 660 

TAATAGACA7 GCTTATGTAa GTATTGCTGT QATTTGGGTC CTTGCTGTGG C T TCri ' CTTl ' 720 

60CTTTCCTG ATCTACCAAG TAATOACTGA TCAGCCOTTC CAAAATGTAA CACTTGATGC 780 

GTACAAAGAC AAATA06TGT GCTTTGATCA ATTTCCATOS QACTCTCATA GGTT6TCTTA 840 

TACCACTCTC CTCTTGGTGC TGCAGTATTT TGGTCCACTT TQTTTTATAT TTATTTGCTA 900 

CTTCAAGATA TATATACGCC TAAAAAGGAO AAACAACATG ATGGACAAGA TGAOAOACAA 960 

TAAGTACAGG TCCAQTGAAA CCAAAAGAAT CAATATCATG CTGCTCTCCA TTOTGGTAGC 1020 

ATTTOCAGTC TOCTGGCTOC CTCTTACCAT CTTTAACACT GTGTTTGATT GGAATCATCA 1080 

GATCATTGCT ACCTOCAACC ACAATCTGTT ATTCCTOCTC TGOCACCTCA CAGCAATQAT 1140 

ATCCACTTGT GTCAACCCCA TATTTTATGG GTTCCTGAAC AAAAACTTCC AGAGAOACTT 1200 

OCAGTTCTTC TTCAACTTTT GTGATTTCCO OTCTOGGGAT GATGATTATG AAACAATAGC 1260 

CATGT OCACG ATGCACACAG ATGTTTCCAA AACTTCTTTG AAGCAAGCAA QCCCAGTCGC 1320 

ATTTAAAAAA ATCAACAACA ATGATGATAA TQAAAAAATC TGAAACTACT TATAGCCTAT 1380 

G6TCCCX3GAT GACATCTGTT TAAAAACAAO CACAACCTGC AACATACTTT OATTACCTGT 1440 

TCTCCCAAGG AATGGGGTTG AAATCATTTG AAAATGACTA AGATTTTCTT GTCTTGCTTT 1500 

TTACTGCTTT TGTTQTAGTT OTCATAATTA CATTTGGAAC AAAAGGTX3TQ GGCTTTGGGG 1560 

TCrrCTGQAA ATAGTTTTGA CCAGACATCT TTGAAGTGCT TTTTaTQAAT TTATGCATAT 1620 

AATATAAAGA CTTTTATACT GTACTTATTG GAATGAAATT TCTTTAAAGT ATTA06AT6C 1680 

6CTGACTTCA GAAGTACCTG CCATCCAATA OGGTCATTAG ATTOGOTCAT CrTGATTAGA 1740 

TTAOATTAGA TTAGATTGTC AACAGATTGQ GCXATCCTTA CTTTATQATA GGCATCATTT 1800 

TAQTQTGTTA CAATAQTAAC AOTATQCAAA AGCAGCATTC AGGAGCX3SAA AGATAGTCTT 1860 

OAAGTCATTC AGAAGTGGTT TGAGQTTTCT QTTTTTTGGT GGTTTTTGTT T G Tr mTiT 1920 

TTTTTTCACC TTAAOGGAGG CTTTCATTTC CTCCXX3ACTG ATTGTCACTT AAATCAAAAT 1980 

TTAAAAATGA ATAAAAAGAC ATACTTCTCA GCTGCAAATA TTATGGAGAA TTGGGCACCC 2040 

ACAGGAATGA AGAGAGAAAG CAGCTCCXXZA ACTTCAAAAC CATTTTGGTA CCTGACAACA 2100 

AGAOCATTTT AGAGTAATTA ATTTAATAAA GTAAATTAGT ATTQCTGCAA ATAGCTAAAT 2160 

TATATTTATT TGAATTGATG GTCAAQAGAT TTTCCATTTT TTTTACAGAC TGTTCAGTGT 2220 

TTGTCAAGCT TCTGGTCTAA TATGTACTaS AAAGACTTTC OGCTTACAAT TTGTAGAAAC 2280 

ACAAAXATOO TTTTCCATAC AGCAGTGCCT ATATAGTGAC TGATTTTAAC TTTCAATGTC 2340 

CATCTTTCAA AGGAAGTAAC ACCAA6GTAC AATGTTAAAG QAATATTGAC TTTACCTA6C 2400 

AGGGAAAAAT ACACAAAAAC TGCAGATACT TCATATAGCC CATTTTAACT TGTATAAACT 2460 

GTSTGACTTO TGGOSTCTTA TAAATAATGC ACTQTAAAGA TTACTGAATA GTTGTGTCAT 2520 

GTTAATGTGC CTAATTTCAT GTATCTTGTA ATCATGATTG AGCCTCAGAA TCaTTTGOAG 2580 

A AACTA TATT TTAAAGAACA AGACATACTT CAATGTATTA TACAGATAAA GTATTACATG 2640 

TGrXTGATTT TAAAAGGG06 GACATTTTAT TAAAATCAAT ATTGTTTTTO CTTTTTCT8A 2700 
GGAOTCTCrr TCAGTTTCAT TTTTTCTCAT OCCATGACTT GCCT0C6ATG GT 



A28 Protein sequence i 
Gene natnei 
Ubigene number: 
Probeset Accession #< 
Protein Accession #« 
Signal sequence: 
Pfam domains: 
Transmembrane domains t 

Cellular Localizations 



Human neuropeptide Y receptor Yl (NPYYl) nOtNA, exon 2-3 and conplete cds 
Hs. 169266 

L07615 
NP_000900.1 
none found 
7tm 1 CS7-91) 

39-61« 77-99« 118-139, 157-179, 212-234, 264-286, 
300-322 

plasma roenibrane 



1 11 21 31 41 51 

I t 1 I i I 

MirSTLPSOVB NHSVHSNPSS KHAQLLAFSN DOCHLPLAMI PTLAIiAYGAV IILGVSGNLA 60 

LIIIILKQKE MRNVTOILIV NLSFSDLLVA IMCLPPTFVY TLMDHWVP6E AMCKLNPPVQ 120 

CVSITVSIPS LVLIAVERHQ LZINPRGWRP NNRHAYVGIA VIWVLAVASS LPFLIYQVMT 180 

DEPFQNVTLD AYKDKYVCPD QPPSDSHRLS YTTLLLVLQY PGPLCPIPIC YFKIYIRLKR 240 

RHNMMDKMRD NKYRSSBTKR INIHIiIiSIW APAVCWLPLT XFNTVFDWIR QIZATCNHNL 300 

LFLLOILTAM I8TCVNPIFY GPUfKHPQRD LQPPFNFCDF RSSDDDYETI AMSTMHTDVS 360 

841 



wo 03/042661 



PCTAJS02/36810 



KTSLKQASPV AFKKINNNDD NBRI 



A29 DMA SEQUBMCE 

Gene name: Homo eapiena G protein > coupled receptor (H0T7T17S) 
Utilgene number: Hs. 30822 9 

Probeset Accession Aiai9198 
Nucleic Acid Accession #t NM_033551.1 

coding sequence I 1-1197 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I i I ) I I 

ATOCA CACC G TTCCTACGTC CXK3ACCCAAC GOGTCCTGGG GGQCACOGGC CAAOGCCTCC 60 

GGCTGCCCGQ GCTGTGGCGC CAAOGCCTCG QACGGCCCAG TCCCTTOQCC GCGGQCCOTG 120 

GA06CCTQ0C TOGTGCCGCT CTTCTTCXK3G GCGCTQATGC TGCTGGGCCT GGTGGGQAAC 180 

TCGCTGGTC3V TCTACGTCAT CTGCOOCCAC AAGCOGATGC GGACCGTQAC CAACTTCTAC 240 

ATC6CCAACC TGGCGGCCAC GGACGTGACC TTCCTCCTGT GCTGCGTCCC CTTCACGGCC 300 

CTGCTGTACC CGCTGCCCGG CTGOOTQCTG GGOQACTTCA TGTGGAAOTT COTCAACTAC 360 

ATCCA6CAGG TCTOSGTGCA GGCCAOGTGT GCCACTCTGA COGCCATGAG TGTGGACX3GC 420 

TGGTAOGTGA CX3GTGTTCCC 0TTGCGCX3CC CTGCACOGCC GCAOGCCCCG CCTGG06CTG 480 

GCTGTCAGCC TCAGCATCTG GGTAGGCTCT GCGGCGGTGT CTGCOCOQGT GCTOGCOCTG 540 

CACXaSCCTOT CACCCX5GGCC GCGCGCCTAC TGC3W3TGAGG CCTTCCCCAG C0 a C 3 G O CC ro 600 

6AGC6CXKXT TCGCACTGTA CAACCTQCTG QCXJCTGTACX: TGCTGCOGCT QCTOGCCACC 660 

TGCX5CCTGCT ATGCGGCCAT GCTGC6CCAC CTGGQCOSGG T0GCC3GTG0G CXXCGOGCCC 720 

GC0GATA60G CCCTGCAOGG GCAGGTGCTG GCAGA00GCX3 CAGGCGCOGT G0GGGCCAA6 780 

GTCTCGCGGC TGGTGGCQGC CGTGGTCCTG CTCTTCGCCG CCTGCTGGGG CCCCATCCAG 840 

CTQTTCCTGQ TGCTGCAGGC GCTGGQCCCC GOGGGCTCCT GGCACCCACX3 CAGCTACXKX: 900 

GCCTACGCGC TTAAGACCTG GGCTCACTGC ATGTCCTACA GCMiCTCOSC GCTQAACCOQ 960 

CTGCTCTAOQ CCTTCCTGGG CTCGCACTTC COACAGGCCT TC0GCCGCX3T CTGCCCCTQC 1020 

QCX3CCGCGCC 6CCCCC0CCG CCCCCX3CCG0 CCXXJOACCCT CGGACCCOGC A6CCCCACAC 1080 

QOQQAOCTGC ACGGCCIGGO 6TCCCACCCG GCOCOOGCCA GGGCQCAGAA GCCAGGQAGC 1140 
AGT6GGCTG0 C0G08CGG66 OCTGTGOGTC CTGGG6GAGG AGAAOBCCGC TCT CTGA 

A30 Protein sequence 

Gene name: Homo sapiens G protein- coupled receptor (HOT7T175) , roRNA 

Unigene number i Hs. 2 0822 9 

Protein Accession #: AI819198 

Signal sequence: none £ound 

Pfam domains: ''tm^l [59-323] 

Transmembrane domains: 43-65, 86-108, 122-144, 159-181, 203-225, 260-282 

cellular Localization! plasma membrane 



1 11 21 31 41 51 

I I I I I I 

MHTVAT8GPK ASWGAPANAS GCP6CGANA8 D6FVP8PHAV BAWLVPZiFFA ALMLLQLVGN 60 

SLVIYVICRH KPMRTVTOpy lANLAATDVT FLLCCVPFTA LLYPLPGWVL GDPMCKPVNY 120 

IQQVSVQATC ATLTAMSVDR WYVTVFPLRA UiRRTPRliAL AVSLSIMVGS AAVSAPVLAL 180 

HRLSPGPRAY CSEAPPSRAL ERAFALYNLL ALYLLPLLAT CACYAAMLRH LGRVAVRPAP 240 

ADSALQGQVIi AERA6AVRAK VSRLVAAWL LFAACHOPZQ ItPLVLQALOP AGSHUFRSYA 300 

AYALKTWAHC MSYSNSALNP UiYAPLGSBF RQAFRRVCPC APRSPRRPRR PGPSPPAAPK 360 
AELHRLGSHP APARAQKPGS SQLAAHOLCV U3EX»IAPXi 

A31 DMA SEQireWCE 

Gene namei CH22_FaQ8B8.706f cadherin, BSF LAG seven-pass G-type receptor 1, Clamlngo (Drosophila) homolog 
(CELSRl) 

unigene number: Hs. 252367 

Probeset Accession #i MM_014246 

Nucleic Acid Accession i}t NM_014246 

coding sequence I 1-9045 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

ATGSCGCCGC CGCGQCGGCC COTgCl' G CCC O WCIXSCT G C T0CI06CCGC CGCOGCOQGC 60 

CTGCOGGCGA TGGGGCT600 AOCQOCOQCC TGGGAGCCQC GO0TAOCO9O OOGGACCOGC 120 

GCCrrOGCCC TCCGGCCCGG CTGTACCTAC GCGGTGGGOQ CCGCTTGCAC GGCCCGGGOG 180 

C0G0GG6AGC TGCTGOAOGT GGGCCGOGAT GGGOGGCTGG CAOGACQTOG G0GCQTCTCX3 240 

QGCGCGGGGC GCCCGCTGCC GCTGCAAGTC OQCTTGGTGG CCGOCAGTGC CCGQAOGGCQ 300 

CTGAGCCGOC GCCTGOQGGC GGGCACGCAC CITCG0G6CT GOQQAOOOCQ TOOOCGGCTC 360 

TGCGGAACGG OTQGCCGGCT CTGG6GG6CQ CTCTGCTTCC COGTCCOOGG CGGCTGCGCG 430 

GCCGCGCAGC ATTCGQCQCT CX3CAGCTCCG AOCACCTTAC CCGCCr GC O Q CTGCCCGCOO 480 

CGCCCCAGGC CCCGCTGTCC CGGCCGTCCC ATCTGCCTGC CGCCGGGC6G CTCGGTCCGC 540 

CTGOGTCTGC TG7GCGCCCT GCGGCGCGCG GCTGGCGCCO TC0GGGTG6G ACTGGC6CTG 600 

QA6GC0GCCA C0QCG6GGAC GCCCTCCGCG TCGCO^TCCC CAT06CCG0C CCTGCOGCOQ 660 

AACTTGCCOO AAOCCOGGGC G6GGC0GG0Q OGAOGQGCCC GG0G6GGCAC GAGCG6CA0A 720 

GGGAGCCTGA A0TTTCC6AT GCCCAACTAC CAGGTGGCOT TQ TTTGAQAA CGAACOGGCS 780 

GGCACCCTCA TCCTCCAGCT GCACX3CGCAC TACACCAT06 AGG6CGAGGA 6GAGCG0GTG 640 

AGCTATTACA TGGAGQGGCT GTTCQACGAG CGCTCCCGGG GCTACTTCOO AATCQACTCT 900 

GCCAOGGGOO COGTQAGCAC GGACAGOGTA CTGGACOGCG AGACCAAGQA GACGCACGTC 960 

CTCAGGGTGA AAGCOOTGGA CTACAOTACG CCGCCGCGCT CG6CCACCAC CTACATCACT 1030 

GTCTTGGTCA AA6ACA0CAA OGACCACAGC COGGTCTTCG AGCA6T0GGA GTACCGC6AG 1080 

OQOQTGOGGQ AGAACCTGGA GQTOQGCTAC GAOOTGCTQA CCAT0060GC CA6GGACO0C 1140 
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GACTOGCCCA TCAAOGCCAA CTTGCGTTAC GGGGTQTTGG 6QGG0 G0 QTG GQAajTCTTC 1200 

CAGCTCAACG AGAGCTCTGG aSTGGTGAOC ACAOSGGGGO TGCP6GA00G GGAGGAGG06 1260 

GCCGAGTACC AOCTCCTGOT GGAGGCCAAC GACCAGGGGC GCAATgOGGG CCOOCTCAOT 1320 

GCCACX»CCA COOTGTACAT OGAGGTGOAG GAOGAQAACG ACAACTACCC CCAGTTCAGC 1380 

J GAOCAGAACT AOGTGGTCCA GGTGCCC6AG GAOGTCQGGC TCAACA0Q6C TGTOCTGOGA 1440 

GTGCAGGCCA CQGA COGQOA OCAGGGCGAO AAGGG6GCCA TTCACTACAG CATCCTCAGC 1500. 

GGGAAOGTGQ G06GCCAGTT CTACCTGCAC TOGCTGAQCO GGATCCTG6A TGT6ATCAAC 1560 

CCCTTGGATT TOGAQOATGT CCAGAAATAC TCGCTGAGCA TTAAGGCCCA GGATGOGGGC 1620 

OGGCCCCCGC T CATC AATTC TTCAQQGGTG GTGTCTOTGC AGGTGCTGGA TGTCAAOGAC 1680 

AACGAGCCTA TCTTTGTGAG CAGCCXTTTC CA0GCCA06G TGCXGOAOAA T6TGCC0CTG 1740 

G6CTACCC0G TGQTOCACAT TGAGOGGGTG GAGGOBGACT CTGGAGAQAA OGCCOGGCTO 1800 

CACTATOGCC TOGT66ACAC GGCCTCCACC TTTCT0G6GG GOGGCRGCGC TGGGCCTAAO 1860 

AATCCTGCCC CCACCCCTGA CTTCCCCTTC CAGATCCACA ACAGCTCXX3G TTOGATCACA 1920 

^ _ 6TGTGTGC00 AGCTGGACCG CGAOQAGGTG GAGCACTACA GCTTOGGGGT GGAGGCGGTQ 1980 

10 GACCACGGCT OGCCCCCGAT GAGCTCCTCC ACCAGOGTGT CCATCAGGGT GCTGGACGTG 2040 

A ATGACAA OG AGCGGOTGTT CAOGCAGCXr ACCTAOQAGC TT06TCTGAA TGAGGATGOG 2100 

GCOGTOGGGA GCAGOGTGCT GACCCTGGAG GCC060QACC GTQA08GCAA CAGTOTQATT 2160 

ACCTACCAGC TCACAGOCGG CAACACCCGO AACOGCTTTG CACTCAGCAO CCAGAGAOGQ 2220 

GGCXSGCCTCA TCACXCTGGC GCTACCTCTO GACTACAAGC AGGAGCAGCA GTACGTGCTG 2280 

G0G6TGACAG CATCOGAOGG CACAOGGTCG CACACTGCGC ATGTCCTAAT CAAOGTCACT 2340 

CATGCCAACA CGCACAGGCC TGTCTTTCAO AGCTGCCATT ACACAGT6AG TGTCA6T6A0 2400 

GACAGGCCTG TGGGGACCTC CATT6CTACC CTOlGTGCCA A0QATGA6GA CACAGGAGAG 2460 

AATGCCXX3CA TCRCCTACXTT GATTCAGGAC CCOGTGCCGC AGTTCCGCAT TGACCCCX3AC 2520 

AGTQGCACCA TOTACACCAT GATGGAGCTG GACTATGAOA ACCAGQTCGC CTACACGCTG 2580 

ACCATCATOG CCCAGGACAA 06GCATCC0G CAQAAATCAG ACACCACCAC CXH'AGAQATC 2640 

CT CATC CTOG AT60CAATQA CAATGCACCC CAQTTCCTGT GGQATTTCTA CCAGGGTTOC 2700 

ATCTTTGAGG ATOCTOCACC CTOGACCAGC ATOCTCCAGG TCTCTGCCAC GGACOGGOAC 2760 

TCAGGTCCCA ATGGGOGTCT GCTX3TACACC TTCCAQGGTG GGGACGAOGG CGATGGGGAC 2820 

TTCTACATCG AGCCX»CGTC OGQTGTGATT CGCACCCAGC GCCGGCTGGA CCGGGAOAAT 2880 

3U GTGG CXX3TGT ACAACCTTTG GGCTCTGGCT OTGGATOGGG GCaCTCCCAC TCCCCTTAGC 2940 

GCCTOGGTAG AAATCCAGGT GAOCATCTTG GACATTAATG ACAATGGCCC CATOTTTOAG 3000 

AAG6A0GAAC TGGAGCTGTT TGTTGAGOAG AACAACCCAG TGOGGTOGGT GGTG6CAAAG 3060 

ATTCQTGCTA AOOACCCTGA TGAAGGCCCT AATGCCCAOA TCATGTATCA GATTGTGGAA 3120 

GGGQACATGC GGCATTTCTT CCAGCTGGAC CTQCTCAACQ GGQACCTGCG TGCCATGGTG 3180 

GAGC TGGACT TTGAGGTCCG GOGGGAGTAT QTQCTGGTGG TGCAGGCX3VC GTOGGCTCCG 3240 

CTGOTQAGCC GA6GCACG0T GGACATCCTT CTOGTGGACC AGAATOACAA CCOOCCTGTO 3300 

CTGCCCGACr TCCAOATCCT CTTCRACaUlC TATQTCACXa ACAASTCCAA CAOTTTCCCC 3360 

ACCGQCGTGA TCXXKrrGCAT CCOGGCCCAT GACCCCGACG TOTCAGACAa CCTCAACTAC 3420 

ACCTTCGTGC AGGGCAACGA GCTQOGCCTG TTGCTGCTGG ACCCOGCCAC GGGCGAACTG 3480 

CAGCTCAGCC GCGACCTGGA CAACAACOGG COGCTGQAGG CQCTCATGQA GGTGTCTGTG 3540 

TCTOATGGCA TCCACAGOGT CACGGCCTTC TGCA0CCT6C 6TGTCA0C3VT CATCAOGGAC 3600 

QACATGCTGA CCAACAGCAT CACTOTCCQC CTGGAOAACA TGTCOCAGGA GAA6TTCCTG 3660 

TCCCOGCTGC TGGCCCTCTT CX3TGGAGGGQ GTGGCOGCCG TGCTGTCCAC CACCAAGGAC 3720 

GACGTCTTOG TCTTCAACGT CCAQAACQAC ACCGACGTCA GCTCCAACAT CCTGAAOGTG 3780 

ACCTTCTCGG COCTGCTGCC TGG0GGCX5TC CGCGGCCAGT TCTTCCOGTC GGAGGACCTG 3840 

CAGGAG CAQA TCTACCTOAA T0QQACX3CT0 CTGACCACCA TCTCCACX3CA GOGCGTGCTG 3900 

OCCTTOGAOG ACAACATCTG CCTGOGOGAG CCCTGOGAGA ACTACATGAA GTQCGTGTCC 3960 

GTTCTGCX5AT TCGACAGCTC OGOQCCCTTC CTCAGCTCCA CCACOGTGCT CTTCCGGCCC 4020 

ATCCACCCCA TCAACGGCCT GOGCTGCCGC TQCCCGCCCXS GCTTCACOGG CGACTACTGC 4080 

GAGACGOAGA TCQACCTCTG CTACTCXXSAC CCGTGCGGCG CCAACGGCCG CTQCCGCAGC 4140 

0G0QA6GGOQ GCTACAOCTG OQAOTGCTTC GAGGACTTCA CTGGAGAGCA CTGTGAGGTG 4200 

GAT0CCC6CT CA60C0GCTG TGCCAA06GG GTGTGCAAQA ACGGGGGCAC CT G CGTGAAC 4260 

CTGCTCATOG GOGGCTTCCA CTGOOTOTGT CCTCCTGGCX3 AGTATGAGAG GCCCTACTGT 4320 

GAGGTGACCA CCAGQAGCTT CCCJGCXCCAG TCCTTOGTOV CCTTCCXSGGO CCTGAGACAG 4380 

OGCTTCCACT TCACCATCTC CCTCAOGTTT GCCACTCAGO AAAGGAACGG CTTQCTTCTC 4440 

TACAAOGGOC GCTTCAATGA 6AAGCACX3AC TTCATCQCCC TGGAGATOGT GGACGAGCAG 4500 

GTOCAGCTCA CCTTCTCTGC AOGCGAGACA ACAACGACXX3 TGGCACXX3AA GGTTCCCAGT 4560 

GOTGTQAGTG ACGGGCGGTG GCACTCTGTG CAGOTGCAQT ACTACAACAA GCCCAATATT 4620 

GGCCACCTGO QCCTOCCCCA TGGGCCGTCC GGGQAAAAGA TGGCOGTGGT GACAQTGGAT 4680 

Q ATTG TGACA CAACCATGGC TGTGOGCTTT GGAAAGGACA TOGGGAACTA CAGCTGCGCT 4740 

GOOCAGGGCA CTCAGACOGG CTOC3Utf5AAG TCCCTGGATC TGACOJGCCC TCTACTCCTG 4800 

GGGGOTGTCC GCAACCTGCC AGAAOACTTC CCAGTGCACA ACCGGCAGTT CGTCGGCTGC 4860 

ATOCGQAACC TGTCAGTCGA OGGCAAAAAT GTGGACATOG CCGGATTCAT OGCCAACAAT 4920 

GGCACCOGGQ AAGGCTGOQC TQCTOGGAGO AACTTCTGOG ATGGGAQGCX3 GTGTCAGAAT 4980 

GGAGGCACCT GTGTCAACAG GTGGAATATG TATCTQTGTO AGTQTCCACT C0GATTCX3GC 5040 

GQGAA6AACT GTQAGCAAGC CATGCCTCAC CCCCAGCTCT TCAGCGGTGA GAGOGTCGTG 510 0 

TCCTGGAQTQ ACCTGAACAT CATCATCTCT OTGCCCTGGT ACX^XSGGGCT CATGTTCCX3G 5160 

ACCCGGAAGQ AGGACAOCXST TCTQATGGAG GCCACa^GTG GTGGGCCCAC CAGCTTTCGC 5220 

CTCCAGATCC TGAACAACTA CCTCCAGTTT GAGGTGTCCX: AOGGCCCCTC C3QATGTGGA0 5280 

TOOGTGATGC TGTCCGGGTT GOOGGTGACC GAOGGGGAGT GGCACCACCT GCTGATOGAG 5340 

CTGAACAATO TTAAGGAGGA CAQTQAGATG AAGCACCTGG TCACCATGAC CTTGGACTAT 5400 

0GGATG6ACC AGAACAAGGC AOATATOGGO GGCATGCTTC COQGGCTGAC GGTAAGGAGC 5460 

GTGOTGGTCG GAGGOGCCTC TGAAGACAAG GTCTCOGTGC GCOGTGQATT CCGAGOCTGC 5520 

ATGCAGGGAG TGAGGATGGG GGGGAOGCXX: ACCAAOGTOG CCACCCTGAA CAT6AACAAC 5580 

GCACTCAAGO TCAGGOTGAA GGAOGGCTGT GATGTGGAOG ACCCCTGTAC CTCOAGCCCC 5640 

TtSTCCCCCCA ATAOCOGCTO CCAOGAOGCC TQGOAGGACT ACAGCTGOST CTGTGACAAA 5700 

GGQTACCTTO GAATAAACTG TGTGGATGCC TGTCACCTGA ACCCCTGOQA GAACATGGGG 5760 

GCCTGCGTGC GCTCCCCCGO CTCCXXIGCAG GGCTACGTGT GCGAGTGTGG GCCCAGTCAC 5820 

TACGGGCOGT ACTGTGAGAA CAAACTCGAC CTTCOSTGCC CCAQAGGCTG GTGGGGQAAC 5880 

COOGTCTGTO GACCCTGCCA CTGTGOOGTC AGCAAAGGCT TTGATCCXX5A CTOTAATAAO 5940 

A0CAA0G6GC AGTGCCAATG CAAGGA6AAT TACTACAAGC TOCTAOCCCA GQACACCTGT 6000 

CTGOCCTGCQ ACTGCTTCCC CXIATGGCTCC CACAGCOOCA CTTGOGACAT GGOCACOOGO 6060 

CAGTGTGCCT GCAA0CCO6O CQTCATOGGC OGCCAGTGCA AOOOCTGCXIA CAACCXXITTT 6120 

G0CX3A0QTCA OCAOGCTC3G6 GTGTGAACSIO ATCTACAATO GCTGTCCCAA AGCATTTGAG 6180 
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GCOGGChTCT GOIGGCOiaV GACCAASTTC GOGCAGCOOG CIXSOUO WC C AT6CCCTAA6 6340 . 
GGATCO Q TT G GAAATGOGOT COQAOVCTOC A0CGGGGA6A AOGGCXGQCT 6CCCCCAGAG 6300 
CTCTTTAACT GTACCACCAf CTCCTTOGTG GACCTCAGGG CCS^TOAATGA GAAGCTGA6C 6360 
CGCAATGAGA 06CAGGTGGA OGGOGCCAQO GCCCTGCAGC TGQTQAGQGC 6CTGCGCA0T 6420 
D GCTACftCAGC ACACGGGCAC GCTCTTTG6C AATQACXTTGC GCACGGOCTA CCAGCTGCTG 6480 
GGOCAOOTCC TTCAGCAOQA QAGCTQGCAQ GhGGQCTTGG A0CTG6CAGC CA0SCAG6AC 6540 
GCOQACTTTC ACGAG6A0GT C A TCCACTOQ GGCftOCGCCC TCCTGOCOCC AOCCACCAGG 6600 
QCGOCGTGGG AGCAGATCCA GOGGAGGGAG GGCGGCACGG CACAGCTGCT C0GGCX3CCTC 6660 
^ GAGGGCTACT TCAGCAACGT GGCAOGCAAC GTGCGOCQGA CGTACCT0CX3 GCCCTTCGTC 6720 
lU ATC3GTCACCG CCAACATQAT TCTTGCTGTC GACATCTTTQ ACAAGTTCAA CTTTAOGGGA 6780 
GCCAGGOTCC OGCGATTGGA CACCATGCAT aAAGAOTTGC CCAGGGAGCT QGAOTCXTrCC 6840 
GraXXYitX! CAGOCGACrr CTTCAGACCA CCTGAACAAA AAOAAGGCCC CCTGCTGAGO 6900 
COGGCTGGCC GGAGGACCAC CCCGCAGACC ACGOGCCCGG GGCCTGGCAC OGAGAGGGAG 6960 
GCCCC5QATCA GCAGGCGQAG GOSACACCCT QATQACOCTG GCC3«3TTC3GC OOTCGCTCTO 7020 
\5 GTCATCAITT ACC6CACCCT GGGGCAGCTC CTGCCOQAGC GCTA06AC0C CXMCXXSTGGC 7080 
AGCCTCOBOT T6CCTCAC0S GOCCATCATT AATACXXX3QA TGGTGAGCAC GCTGGTGTAC 7140 
AGCQA6GG60 CTCGGCTCCC GAGACOCCTG GAGAGGCC06 TCCTGGTGGA GTTG6CCCT0 7200 
CTGGAGGTGQ AOGAGOGAAC CAAGCCTQTC TGC6TGTTCT GGAACCACTC CXnGGCCGTT 7260 
0GTGGGAt3GG GAGGQTGGTC TGCCCXWGGC TGCGAQCTCC TGTCCAGQAA CCGQACACAT 7320 
20 GT06CCXGCC AOTGCAGCCA CACA6CCAGC TTTGOSGTGC TCATGGATAT CTGCAGG06T 7380 
GAGAAC0G66 AGGTCCT60C TCTGAAGATT GTCACCTATO CGGCTGT6TC CTTGTCACTa 7440 
GCAGCCCTGC TGGTGGCCTT C 3GTCCTCXTG AGCCTGGTCC GCATGCTGCG CTCCAACCTG 7500 
CACAGCATTC ACAAGCACCT CGCCQTGGOG CTCTTCCTCT CTCAQCTQGT OTTCGTGATT 7560 
GGGATCAACC AGACGGAAAA CCCX3TTTCTG TGCACA6TGG TTGCCATCCT CCTCCACTAC 7620 
2,5 ATCTACATOA GCACCTTTGC CTGGACCCTC OTGQAQA6CC TGCATGTCTA CC6CATGCT0 7680 
AO0GAG6TGC 6CAACATC3GA CAGOGGGCCC ATGOGGTTCT ACTAOGTOGT GGGCTGGOGC 7740 
ATCCOGGCCA T TCTCAC AGQ ACTGGOGGTC GG0CTG6ACC CCCAGGGCTA 0GG6AACCCC 7800 
GACTTCTGCT GGCTGTOGCT TCAAQACACC CTGATTTGQA GCTTTGCGGO GCCCAT06GA 7860 
QCTGTTATAA TCATCAACAC AGTCACTTCT GTCCTATCTG CAAAGGTTTC CTGCCAAAGA 7920 
OU AAGCACCATT ATTATGGGAA AAAAGGGATC GTCTCCCTGC TQA6GAC06C ATTCCTCCTG 7980 
CTGCTGCTCA TCAGCQCCAC CTGGCTGCTrG OGOCTGCTGO CTGTGAAC06 CGATGCACTG 8040 
AGCTTTCACT ACCTCTTOGC CATCTTCAGC GGCTTACAGG QCCCCTTOQT CCTCCrrTTC 8100 
CACT6C6T6C TCAACCAGGA GGTC06GAAG CACCTGAAGG GOGTGCniGG 06GGAGGAAG 6160 
CTGCACCTGG AGGACTCXaSC CaVCCACCAGG GCCACCCTGC TGACGCGCTC CCTCAACTGC 8220 
35 AACACXACCT TCGGTGACGG GCCT6ACATG CTG06CACAG ACTTGGGCGA GTCCACOGCC 8280 
TOGCTGSACA GCATOGTCAG GGATGAAOGO ATOCAGAAGC TOGGO G TSTC CTCTGG GC I G 8340 
GTGA6GGGCA GCCACX3GAGA GCCA6ACGCG TCCCTCATGC CCAGGAGCTO CAAGOATCCC 8400 
CCTGGCCACG ATTCCQACTC AGATAGCGAG CTGTCCCTGG ATGAGCAGAG CAGCTCTTAC 8460 
, GCCTCCTCAC ACTCGTCAQA CAOOQAGGAC GATGGOGTQQ QAGCTOAGOA AAAATGGGAC 8S20 

4U CCGGCCAGGO GCGCCGTCCA CAGCACXXCC AAAGG6GAC0 CTGTQGCCAA CCACGTTCCG 8580 
GCOGOCTGGC COQACCAOAO CGTQGCTGAO AGIGACAOTG AOQACCGCAO OGGCAAGCCX: 8640 
OGCCTGAAGO T6GAGACCAA G0TCA808TG GAGCTOCACC GOQAGGAGCA 66GCAGTCAC 8700 
CGTQGAQAGT ACCCCCCGGA CCAGGAGAGC GGGGGCGCAO CCAGGCTTGC TAGCAGCCAG 6760 
CCCCCAQAGC AGAGGAAAGQ CATCTTQAAA AATAAAGTCA CCTACCCGCC GCCGCTGACG 6820 
HJ CTGACX3GAQC AQACGCT6AA GGGCC3QGCTC 0GGGAGAA6C TGGC0GAC7G TGAGCAGAGC 8880 
CCCAGATCCT GGC6CA0OTC TTCCCTQQGC TCTGGG8QCC COQACTQGQG CAXCACAGTC 8940 
AAGAGCCCTG GGA666AG0C G66GCGTGAC CACCTGAAGG GGGTGGCCAT GAATGTG06C 9000 
ACTGGGAGCG CCCAGGCCGA TGOCTCOQAC TCTQAOAAAC OGTGAGGCAA GCCOGTCACC 9060 
CCACACAGGC TGCGGCATCA CCCTCAGACC TTGGAGCCCA AGGGGCCACT GCCCTTQAAG 9120 
5U TGGAGTGGGC GCAGAGTGTa 6CGGTCCCCA TGGTGGC3\GC CCCCC6ACTG ATCATCCAGA 9180 
CACAAAGGTC TTGGTTCTCC CAGGA6CTCA 0G6CCTGTCA QACCTGOTGA CAAOTGCCAA 9240 
AGQCCACAGG CATGA6GGA0 6CGTGGA0CA CT66GCCAGC AC06CTGAGT GCTAAGACTQ 9300 
CAGTCAAAGC CAGAACTQAG AGGGGACCCC AGACTGQGCC GAGAGGCTGO CCAGAGTTCA 9360 
GQAACQCCGG GCACAGACCA AAGACCGOGG TCCAGCCOCG CCCAGGCGGG CATCTCATGG 9420 
J J CAGT6CGQAC COGTGGCTGG CAGCCOGGQC AOTCCTTTGC AAAGGCACCC CrTOTCTTAA 9480 
AATCACTTCO CTATGTQGQA AAGGTGGAQA TACTTTTATA TATTTGTATG GGACTCTGAG 9540 
GAGGTGCAAC CTGTATATAT ATTGCATT03 TGCTGACTTT GTTATCCCGA GAOATCCATQ 9600 
CAATGATCTC TTGCTGTCTT CTCTGTCAAQ ATTGCACAGT TOTACTTGAA TCTOGCATGT 9660 
GTTGACQAAA CTGGTGCCCC AGCAGATCAA AGGTGGGAAA TAOGTCAGCA GTGGGGCTAA 972 D 
OU AACCAAG06G CTA6AAGCCC TACAGCTGCC TTOQGCCAGG AAGTOAGOAT GGTGTGGGCC 9780 
CTCCCC6C0G OCCCCCTGGQ TCCCCAGTGT TCGCIGTGTG TQ0GTTT6TC CTCTGCTGCC 9840 
ATCTGCCCC6 GCTGTGTGAA TTCAA6ACAG GGCAGTGCAG CACTAGGCA6 GTGTGAGGAG 9900 
CCCTGCTQAQ GTCACTGTGQ GGCACGOTTG CCACAOGGCT QTCATTTTTC ACCTGGTCAT 9960 
^ TCTGTGACCA CCACCCXrCTC CCCTCACCQC CTCCCAGOTG GCCCOGQAQC TQCAGGTGGQ 10020 

OJ GATGGCTTTG TCCTTT G CTC CTGCrCCCCG TGGGACCTGG GACCTTAAAG C G TTGCAGOT 10080 
TCCTGATTTG GACAGAGOTG TQGGGCCTTC CAGGCCGTTA CATACCTCCT GCCAATTCTC 10140 
TAACTCTCTG AGACTGCGAG GATCTCCAG6 CAGGOTTCTC CCCTCTGGAO TCTGACCAAT 10200 
TACTTCATTT TGCTTCAAAT GGCCAATTGT GGAOAGGOAC AAAGCCAOVG CCACACTCTT 10260 
CAAGGGTTAC CAAACTGTTT TTQGAAATTC ACACCAAGOT CGGGCCCACT GCAQGCAGCT 10320 
7U GGCA CAGOOT QGCCO GAGGG GCIGTGGAAC GGGTOCOGQA ACTGTCAGAC ATGTTTGATT 10380 
TTAGOGTTTC CTTTGTTCTT CAAATCAGGT 6CCCAAATAA GTGATCAGCA CAGCTGCTTC 10440 
CAAATAGGAO AAACCATAAA ATAGGAT6AA AATCAAGTAA AATQCAAAGA TGTCCACACT 105 00 
GTTTTAAACT TGACCCTGAT GAAAAT6T6A GCACTGTTAG CAGATGCCTA TGGGAGAGGA 10560 
AAAGOGTATC TGAAAATGGT CCAGGACAGG AGGATGAAAT GAGATCCCAG AGTCCTCACA 10620 
75 CCTGAATGAA TTATACAT6T GCCTTACCAG GIGAOTGGTC TTTOQAAGAT AAAAAACTCT 10680 
AGTCCCTTTA AACGTTTGCC CCTGGCOTTT CCTAAGTAOQ AAAAGGTTTT TAAGTCTrOQ 10740 
AACAGTCTCC TTTCATOACr TTAACAGGAT TCTQCCOCCT GAGGTGTAAT TrrmX ^ 'n'C 10800 
TATTTTTTTC CACGTACTCC ACAGCCAACA TCACC3AG0TG TAATTTTTAA TTTGATCAGA 10860 
Q ACTGTTACCA AAAAACAACT GTCAGTTTTA TTGAGATGGG AAAAATGTAA ACCTATTTTT 10920 
oU ATTACTTAAO ACTTTATOQG AGAGATTAOA CACTQGAGGT TTTTAACAGA AOnGTATTT 10980 
ATTAATGTTC AAAACACTGG AATTACAAAT GAGAAGAGTC TACAATAAAT TAAGATTTTT 11040 
aAATTTGTAC TTCTO OQOTQ CTG0TI7TTC T CCAC AAACA CCCC06CCCC TCCCCATGCC 11100 
CAGGGTGGCC GTOQAAGGGA OQGTTTAGGG AOGTGCAGCT GAGCTaTOOQ TOTCOCATGC 11160 
TOOCTCAGCC AOTOOAACQT G0G6GAACTT TTTQTOCATT GCCTAOTAGO CCTGOCACAG 11320 
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CCTAGA TCGO CAGTTTTTGT CTTTCACCAA ATTTGAGGAC Xmirim TOCCATTATT 11380 
TCTTCAGTTT TCTTTTCTTG CACTOATCTT TCTOCTCTCC TTCTOTGACT GCAGTGACTC 11340 
AQAOSTTAQA CCTCTTOATQ TTTTCCCACr G0TCGCTGA6 GCTCTOTTC 
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MAPPPFPVLP VLLLLAAAAA LPAMQLRAAA HEPRVPGGTR AFALRFGCTY AVQAACTPRA 60 

PRELLDVORD GRLAGRKRVS GAGRPLPLQfV RLVARSAPTA LSRRLRARTH LPG06ARARL 120 

OGTGARLCGA LCPPVPGQCA AAQHSALAAP TTLPACRCPP RPRPRCPGRP ICLPPGGSVR 180 

LRLLCALRRA AGAVRVGLAL EAATAGTPSA SPSPSPPLPP NLPEARAGPA RRARROTSGR 240 

GSLKPPMPNY QVA LPEWBP A GTLILQLHAH YTIEGEEERV SYVMEGLFDB RSRGYPRIDS 300 

ATGAVSTOSV LDRBTKBTHV LRVXAVDYST PPRSATTYXT VLVKHmiBS PVFEQSEYRE 360 

RVRENLEVGY 6VLTZRASDR DSPZNANLRY RVLGOANDVP QLNBSSGWS TRAVLDREEA 420 

AEYQLLVBAN DQGRNPGPLS ATATVYIBVE DENDNYPQFS EQNYWOVPE DVOLNTAVLR 460 

VQATDRDQGQ NAAIHYSILS GNVAGQPYLH SLSGILDVIN PLDFEDVQKY SLSIKAQDGG 540 

RPPLINSSGV VSVQVLDVND NBPIPVSSPF QATVLENVPL GYPWHIQAV DADSGENARL 600 

HYRLVDTAST FU2QG5AGPK NPAPmPPP QIBH8SGWIT VCABLDREEV EHYSFGVBAV 660 

DH6SPPMS8S TSVSITVLDV MDNDPVFTQP TYELRUIEDA AVGS8VLTLQ ARDRDANSVI 720 

TYQLTGGNTR NRFALSSQRG GGLITLALPL DYKQBQQYVL AVTASDGTRS HTAHVLINVT 780 

DANTHRPVPQ SSHYTVSVSE DRPVOTSIAT LSANDEDTQE NARITYVIQD PVPQFRIDPD 840 

SGTMYTMMEL DYEMQVAYTL TIMAQDNGIP QKSDTTTLEI LILDANDNAP QFLWDFYQGS 900 

ZFEDAPPSTS ZUIVSATDRD S6PNGRLLYT FQGGDDGDGD PyXEPTSOVZ RTQRRLDREN 960 

VAVYNLWALA VDRGSPTPL8 ASVEZQVTZL DZNDNAPMFB KDBLBLFVEE NNPVGSWAK 1020 

ZRANDPDBOP KAQIMYQIVB GDMRHFFQLD LLNGDLRAMV BLDPEVRREY VLWQATSAP 1080 

LVSRATVHIL LVDQNDNPPV IiPDFQILFNN YVTNKSNSFP TGVIGCXPAH DPDVSDSLNY 1140 

TFVQGNBLRL LLLDPATGEL QI*SRDLDNKR PLBALMEVSV 8DGIHSVTAF CTLRVTZITD 1200 

DMLTNSITVR LENMSQEKFL SPLLALFVBO VAAVL8TTKD DVFVFNVQND 1DVSSNZLNV 1260 

TF8ALLPGGV RGQFFP8EDL QEQIYLNRTL LTTZ8TQRVL PFD017ZCLRB PCENYMKCVS 1320 

VLRFDS9APF LSSTTVLFRP ZHPINGLRCR CPPGPTGDYC BTBIDLCYSD P03ANGRCRS 1380 

RBGGYTCECF EDPTGEHCEV DARSGRCANG VCKNGGTCVN LLIGGFHCVC PPGBYERPYC 1440 

BVTTRSFPPQ SFVTFRGLRQ RFHFT18LTF ATQ3SRNGLLL YNQRPNBKHD FIALEIVDEQ 1500 

V QLTFS AGBT TTTVAFKVPS GVSDORHHSV QVQYYNXPNl GKLGLPRGPS GBXMAWTVD 1560 

DCDTTHAVRF GKDZG!IYSCA AQ6TQTGSXK SUSLTGPLLL 6GVPNLPBDF PVHNRQFV6C 1620 

MRKLSVDGKN VDMAGPZANN GTRBGCAARR NPCDGRRC3QN GGTCVNRWNM YLCBCPLRFG 1680 

GKNCBQAMPH PQLFSGESW SWSDLKXZI8 VPWYLGLMFR TRKBDSVLME ATSGGPTSPR 1740 

LQILNNYLQF EVSHGPSDVE SVMLSGLRVT DGEWHKLLZE LXNVXBDSEM KHLVTMTLDY 1800 

QKXmMilQ GMLPGLTVR8 VWGQASEDK VSVRRGFRGC MQGVRHGGTP THVATUmTOf 1860 

ALKVRVXDGC SVDDPCTSSP CPFNSROIDA WBDYSCVOSK GYLOZKCVDA CSUIPCEKMO 1920 

ACVRSPGSPQ GYVCBC6PSH YGPYCENKLD LPCPRGWWGH PVOGPCHCAV SKGFDPDCNK 1980 

TNOQOQCKKN YYKLLAQDTC LPCDCFPHGS HSRTCDMATG QCACKPGVIG RQOIRCDNPF 2040 

AEVTTLGCgV ZYNGCPKAPB AOXWWPQTKF GQPAAVPCPK GSVGNAVRHC SGEKOWLPPE 2100 

LFNCTTZ8FV DLRAMNEKLS HM ETQV PGAR ALQLVRALRS ATQHTGTLPQ NDVRTAYQLL 2160 

GRVZ^KBSWQ QGPDLAATQD ADFHEDVZBS 6SALLAPATR AANEQIQRSE GGTAQLLRRL 2220 

EGYFSNVARN VRRTYLRPPV ZVTANMZLAV DXFDKFNFTG ARVPRPDTIH EEPPRKLBSS 2280 

VSPPADPPRP PBEKEGPLLR PAGRRTTPQT TRPGPQTERE APZSRRRRHP DDAGQFAVAL 2340 

VZ XYRTLGQL LPBRYDPDRR 8LRLPHRPIX UTPMVSTLVY 8B6APLPRPL BRPVLVEFAL 2400 

liBVBERTXPV CVPHNK8LAV GGTGGHSASO CELLSRKRTH VA0QC8BTAS FAVLMDZSRR 2460 

BMGEVLPLKZ VTYAAV8LSL AALLVAFVLL 8LVRMLRSHL HSZHKHLAVA LPLSQLVPVZ 2520 

GZNQTEMPFL CTWAILLHY XYMSTFAWTL VESLHVYRML TEVRNIDTGP MRFYYWGWG 2580 

IPArVTGLAV GLDPQGYGNP DPCWLSLQOT LIWSPAGPIG AVIIINTVTS VLSAKVSCQR 2640 

KHHYYGKKOZ VSLLRTAPLL LLLZSATHLL GLLAVNRDAL SFHYLFAIF8 GLQGPFVLLF 2700 

HCVZ^BVRK BLIQSVLQGRK LKLEDSATTR ATLLTR8UIC 1ITTF(B)GFDM LRTDLGBSTA 2760 

SLDSZVRDEO ZQKLGVSSGL VRGSBOEPDA SLMPRSCKDP FGRDSDSDSB LSLDEQSSSY 2820 

A5SHSSDSH) DGV6ABBKWD PARGAVH8TP KGDAVANHVP AGWPDQ8LAE SDSEDPS6KP 2880 

RLKVETKV8V BLHRBBQGSH RGBYPPDQSS GGAARLASSQ PPEQRKGILK NKVTYPPPLT 2940 

LTEQTLKORL REXLADCEQS PT8SRT8SLG 8GGPDCAZTV KSPGRBPGRD HLHGVAMNVR 3000 
TGSAQADQSD SEKP 



A33 nNA SEQUENCE 

Gene name: CXCR3 (G protein-coupled receptor 9) j chemolcine (C-X-C) receptor 3 
Ctaigene nunA>ert Bs. 198252 

Probeset Accession it X95B76 
Nucleic Acid Accession 9t X95876 

Coding sequence: 69-1175 (underlined sequences correspond to start end stop codons) 

1 li 21 31 41 51 

I t I I I I 

CCAACCAGAA QCA OCAA ACC AGAQGGQCAO GCAGCACACC ACGCAGCA6C CAGAGCACCA 60 

GCCCAGCGMMSpTOCniAO GTGAOTQAOC ACCAAGTGCT AAATOACSCC GAGGITGCCQ 120 



845- 



wo 03/042661 



PCTAJS02/36810 



CCCTCCTGGA GAACTTCAGC TCTTCCTATO ACTATGQAGA AAACQAGAOT QACT08TGCT 180 

GTACCTCCCC GOCCTGCCCA CAGOACTTCA GCCTQAACTT OQACOOGGCC TTCCT G CCAG 240 

CCCTCT ACAG CXTTCCTCTTT CTGCTGGGGC TGCTGGGCAA OGGOQCQOTO GCAOCOGTGC 300 

TGCTQA6C06 GCGGACAGCC CTOAOCAGCA CCOACACCTT CCTGCTCCAC CTAGCTGTAG 360 

CRQACa OGCT GCT6GTQCTG ACACTGCOQC TCTGGGCAGT GGAOGCTQCC GTOCAQTGGG 420 

TCTTTGGCTC TGGCCTCTGC AAAOTG6C3UJ QlXaCCCIVrT CRACATOVAC TTCTAOGCAQ 480 

GAGCCCrCCT GCTGGCCrOC ATCA6CTTTG ACGGCTAGCT GAACATA6TT CATGCCACCC 540 

AGCTCTACOQ CCGGGGGCCC COGGCCCGCG TQACCCTCAC CTGCCTGOCT OTCTGGOGGC 600 

TCTGCCTGCT TTT06CCCTC CCAGACTTCA TCTTCCTGTC GGCCCACCAC GA0GAGCX5CC 660 

TCAAOQCCAC CGACTG CCRA TACAACTTCC CACAGGTGGG CCGCAOGGCT CTGOGGGTGC 720 

TGCAGCTGGT 0GCTG6CTTT CTGCrGCCCC TGCTGGTCAT GGCCTACTGC TATQGCCACA 780 

TCCTGGCOGT GCTGCTGGTT TCCAQGGQCC AGOGGOGCCT 60GGGCCATG OaGCTGGT GG 840 

TGCTGOTOGT OGTGGCCTTT 6CCCTCTGCT GGACCCCCTA TCACCTGGTG OTGCTGGTGG 900 

ACATCCTCAT GGACCTGGQC GCXTTGGCCC GCAACTGTGQ CCGAGAAAGC AGGGTAGACG 960 

TGO CCAAGTC GGTCACCTCA GGCCTGGGCT ACATGCACTQ CTGCXrTCAAC CCOCTOCTCT 1020 

ATGCCTTTGT A6GG0TCAA0 TTGGGGGAGC GGATGTGGAT GCTGCTCTTG OGCCTGGGCT 1080 

GCCCCAACCA GAGA0Q6CTC CAGAGGCW3C CATCGTCTTC CXXJCOGGSAT TCATCCTGOT 1140 

CTOAOACCTC AQAGGCCTCC TACTOGGGCT TGTGAGGCOG GAATCOGGQC TCCCCTTTa} 1200 

CCCACAGTCT GACTTCCCOG GATTCCAGGC TCXncCCTCC CTCTGCCXSGC TCTGGCTCTC 1260 

CCCAATATCC TOGCTCCCGG GACTCACTGG CAGCCCCAOC ACCACCAGGT CTCCOGGGAA 1320 

GCCACCCTCC CaGCTCTOAO GACTGCACCA TTGCTGCTCC TTAGCTGCOV AOGCCXATOC 1380 

TGCCGCCCGA 6GTGGCTGCC TOOAGCCCCA CTGCCCTTCT CATTTGGAAA CTAAAACTTC 1440 

ATCTTCCCCA AGTGOGaOQA GTACAAGGCA TGGCX3TAGAQ GGTGCTGCCC CATGAAGCCA 1500 

CA GCCCA GGC CTCCAGCTCA GCAGTQACTG TGQCCATGGT CCCCAAQACC TCTATATTTG 1560 

CTCTTTTATT TTTATXSTCTA AAATCCTGCT TAAAACTTTT CAATAAACAA GATC6TGAGG 1620 

ACCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA BHCLITBRAT 1680 

UREBHCBIiAS TPBHCX)mHO LCX3UESK0US ECXCRMYLEV SERQVLDASD FAPLLENSTS 1740 

PYDYGBNESD F8DSPPCPQD FSI2IPDRTPL PALYSLLPLL GLLGNGAVAA VLLSQRTALfi 1800 

STOTPIiLHLA VADVLLVLTL PLWAVDAAVQ WVPGPQLCKV AGALFNINFY AQAPLLACIS 1860 

FDRYLSIVHA TQIYRIUJPRV RVALTCIWW QLCLLPALPD FXYLSANVDQ RLNATHCQVH 1920 

PPQVGRTALR VLQLVAOFIiL PLLVMAYGTA HILAVLLVSR GQRRFRAMRL WVWAAFAV 1980 

CWTPYRLWL VDIIM>VaVI( ARNG6RESKV DVAK8VT8(»I GYMHCCLNPL LYAPVGVXFR 2040 
EQMWKLFTRL GR8DQRGFQR QPS8SRRBSS NSBTTBASYL QL 



A3 4 Protein sequence 
Gene namai 
Ukilgene number: 
Protein Accession #« 
Signal sequence t 
Pfom domains I 
TraneiRendbrane domains s 
DRY boxi 

cellular Localization: 



CXCR3(G protein-coupled receptor 9) ;chemokine(C-X-C) receptor 3 
Ks. 198252 

P49682 
none £ound 
7tm_l {70-318] 

57-78, 92-113, 129-147, 169-190, 222-243, 257-275, 307-323 
148-149 

plasma raenft}rane 



1 11 21 31 41 51 

i 1 I I I I 

MVLBVSDHQV LNDAEVAALL QIFSSSYDYG EMESDSCCTS PPCPQDFSLN PDRAPLPALY 60 

SLLFLLQLLG NGAVAAVLLS RRTALSSTDT FLLBLAVADT LLVLTLPLHA VDAAVQWVFG 120 

SGLCKVAaAL FNINFXAaAI* IiLAdSFDRY UTIVHATQLY RRGPPARVTL TdAVHGLCL 180 

LFALFDFZFL SABHDERWA TROQIYKPPQV GRXAliRVLQL VAOFLLPLIiV MAYGYARZLA 340 

VLLVSRGQRR LRAMRLVWV WAFALCWTP YHLWLVDIL MDLGiOiARNC 6RBSRVDVAX 300 

SVTSGLGYMH CCU7PXAYAF VGVKFRERMW MLUALQCPN QRQLQRQPSS SRRD8SWSET 360 
SBASYSGL 



A3S PNA SEQOEWCE 
Gene namet 
Unigene nuznben 
Probeset Accession ft: 
Nucleic Acid Accession #i 
coding sequence t 



Di££erentially expressed C016 gene (clone MaCt5257) 

Hs. 69517 

AA447522 

BC001291 

44-541 (start and stop codons are underlined) 



1 11 21 

I I I 

G0GG60GC0G CGOGCTQACC CT C 0CT6GGC 
GCTQCTGGTC GTGGGCCTAC OGCSGGTGTG 
AQATCCAGAG GACTCCGAGC 6AA0GGAOQA 
TGAGAGAOAA AACACTTTOG AGTGCCAGAA 
CTGOOTT A TA GOQGCOSTOA AAATATTTOC 
OGCTGGTTGT GCAGCQAT66 AOAGACCCAA 
GCCCATGCCC TTCTTTTACC TCAAGTGTTG 
ACCTATCAAC TCATCAQTOT TCAAAGAATA 
GCTGTGGCTQ GCCATCCTCC TGCTGCTGGC 
AGCCACQGQA CTGOCACAOA CIGAGCCTTC 
ACCrGTTOCA TTAAACTTGT TTTCTOTTQA 
GGGATGGGAO AOTGGGGATC AGGTGCAGTT 
ACATTCAGAG OAAGTCCAGA TCTCCTGAGT 
AAATCAAACC TTGTAACTCA TTTATTGCTG 
CCTCTOAGGG CTTCAOTATT (XATGaQGAaa 
TGCTGAOATO CTTGCQAOCT TTCAGOTGAC 
GGGTGAAOAC ATC0CT0GA6 T6AAG6ACTC 
AGOGCTQCCC CGATTCCA(?r GCntSGAOOCG 



31 41 51 

I I I 

ACCGCTGGGG ACGATGGOGC TGCTOGCCTT 60 

GACAOACQCC AACCTGACTG COAGACAACO 120 

GGGTGACAAT AQAGTGTGGT GTCATGTTTG 180 

CCCAA6GAO0 TQCAAAT6GA CAGA6CCATA 240 

AOGTTrrTTC AT6GTT60GA AOCAOTGCTC 300 

OCCAGAOQAa AA0CGGTT7C TCCTGGAAQA 360 

TAAAATTCGC TACT6CAATT TAGAGGGGCC 420 

TGCTGGGAGC ATGGGTGAGA GCTOTGOTGQ 480 

CTCCATTOCA QCOGGCCTCA GCCTOTCTTG 540 

OGGftSC ATOQ ACTC Q C TCCA GACOGTTGTC 600 

TTACCTCTTO QTTTOACTTC CCAGGGTCTT 660 

GGCTCTTAAC CCTCAAGGGT TCTTTAACTC 720 

AGTGATTTTG GTOACAAGTT T'lT Cltrrm 780 

ATGGCCACTC TTTTCCTTOA CTCCCCTCTG 840 

OA0QCCXAAO TAOCACTCAT GGAGAGTATQ 900 

(XAGOAACAC TGOOGOAOTC TOAATQATTO 960 

CTCAGCATOG GQGGCA8TQG G6CACACQTT 1020 

CrGTGGATOQ C mcmTCU TCAAOCTTTC 1080 
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CTACCAGATT CCAOOAGOCA QAAGATAACT AATTGTGTPG AAGAAACTTA GACTTCftCCC 1140 

ACCAGCTC3GC ACAGGTGCAC AGATTCATAA ATTOOCACAC OTQTQTQTTC AACATCTOAA 1200 

ACTTAGGCCA AOTAQAGAGC ATCAGGOTAA ATGGOGTTCA TTTCTCTOTT AAGATGCAGC 1260 

CATCCATGGG GAGCTGAGAA ATCAGACTCA AAGTTCCACC AAAAACAAAT ACAAGGGQAC 1320 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



A3 6 Protein sequence i 

Gene namet Differentially expressed G016 gene (clone M6Ci5257> 

Unigene numben Hs. 69517 

Probeset Accession ft: AA447S22 

Protein Accession ft: AAK01291 

Signal sequence: 1-17 (first underlined sequence) 

Transmembrane domain t 146 - 162 

Cellular localization i plasma membrane 



1 11 21 31 41 51 

till)) 

M MiLA LLLW ALPRVWTDAN LTARQROPBO SQRTDE(a>NR VWCHVCEREN TFBCQNPRRC 60 
KHTEPYCVIA AVKXPPRPFM VAKQC8AGCA AMERPKPBBK RFLLEBPMPF PYLXCCKXRy 120 
CHLGGPPINS SVFKByAGSM GESOGQXjWLA XLIiXiLASXAA GLSLS 

A37 DNA SEQUENCE 

Gene name: ESTs 
Ubigene number 1 Hs. 2 93 616 

Probeset Accession ftt AN043782 
Nucleic Acid Accession ftt none found 

Coding sequence! 38-1075 (underlined sequence corresponds to start and stop codon) 



1 11 21 31 41 51 

I i I I I I 

A6CAACQAC0 CCGGGCAGCG GGAGCGQCGG CCGCGCCATG TGGCTGCTGG G8CCGCTGTG 60 

CCTGCTGCTG AGCAGCOCOG CGGAGAGCCA 6CTGCTCCCC GGGAACAACT TCACCAATGA 120 

GTGCAACATA CCAGGCAACT TCATGTGCAO CAATQGACOG T6CATCCCGG GOSCCTGGCA 180 

GTGTGACX3G0 CIGCCTGACT GCTTCGACAA GAGTQATGAQ AAGGAGTGCC CCAAGGCTAA 240 

GTOGAAATGT GQOCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTCG 300 

CTTCOGGTGC AATGG GTTTG AGGACTQTCC CGATGGCAGC GATGAAGAOA ACTGCACAOC 360 

AAACCCTCTQ CTTTGCTCCA COOCCCQCTA OCACTGCAAG AACGGCCTCT GTATKSACAA 420 

OAGCTTCATC TGCGATGGAC AGAATAACTG TCAAGACAAC AGTGATX3AG0 AAAGCTGT6A 480 

AAOTTCTCAA GAACCCGGCA GTGGGCAGGT GTTTGTGACT TCAGAGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATO CCATCATCGG CAGCTCCGTC ATTTTTGTGC TGGTGGTGGC 600 

OCTOCTGGCA CTGOTCTTGC ACCACCAOCQ GAAGCGGAAC AACCTCATGA CGCTGCCOGT 660 

GCACCGGCTG CAOCACCCTG TXJCTGCTGTC CCX5CCTGST0 8T0CI0GAGC ACCCGCACCA 720 

CTGCAAOGTC ACCTACAACG TCAATAATGG CATCCRGTAT GTOGCSCAGCC AGG0QGA6CA 780 

GA ATGC GTOG GAAOTAQQCT CCCCACCCTC CTACTCCGAG GCCTTOCTGG ACCAGAGGCC 840 

TOOQTGGTAT GACCTTCCTC CACOGCCCTA CTCTTCTQAC ACGGAATCTC TGAACCAAGC 900 

OGACCTGCCC CCCTACOQCT CCOGGTCCGG GAGTGCCAAC AGT6CCAGCT CCCAGGCAGC 960 

CAGCAGCCTC CTGA6CGTGG AAGACACCAG CCACAGCCCX3 GGOCAGOCTG 60CC0CA6GA 1020 

GGGCACTGCT GAGCCCAGGQ ACTCTGAGCC CAG0CA6GGC ACT6AAGAAG TATAAOTCOC 1080 

AGTTATTCCA AAGTCCATAT GGGTTAATCT GCTCTQACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATQGG AAGCTCTTTA AGCACCTGTA AGGATGTCTC AAGTTACAGT TTGGOATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCCAO ACTTCAOAGA TGTTTTTCTG GCGTCTCAQT 1260 

TGACATGATC TGTTGTGCGT CTTTTCrGTC AGOTCACTCT TCCCTTOQQA CCOQAGATCA 1320 

CACCCTCATT TTTCACATTA TTCTOTTTCT GTTGGAaAOA CAGCATATAA AACftOTATTG 1380 

AAATAGGCTG GGAQAGAGCA ATQTTTCTGT GCTATATTGQ ATGCTCAQAA GTGCAGGAGA 1440 

C6CTGGAGCC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGQA TTTGGGTTAG 1500 

ATQATCTAAC CAGGAGGCCA TCACTGGATG GTCACXICCCC CAAAAAAATT CCATTTGAGC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTTO ATGCOCOCAG TTCAGCAGAG TCAGTGOCCA 1620 

AAGAAAACTT TGGACGTGAG TAACACCCTT CAGCAOTCGC AAOOTTATTT TQQTTTTGTG 1680 

AAGGACTCTG AAA CCAT CTA CCCTGTATAA ATTCTGGCTT TAQAAATTTG CCCAAGAATO 1740 

CTCATTCTGA GAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAGGCAQG 1800 

GAGCCCCrCC CATQAOTTTA TCCAAOTTCT CAGCTCCTAA AATGCAGGCT GCCAAGACCC 1860 

TACACCTGCC CTGGCTCTAC AGCCACTTAC CTGQTTTCTO GACIQTCAOC CTCCGAGCTG 1920 

ACCTGCCOOT AGCCAAGGAA TOAGGACCTA ACTTGAOTTO GCCCAAAGTC TQACCTOOCT 1980 

GTATGTCCCT GTGGCCCACA CCCAGCCTOT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCC AAAG TT CCCTTAACAC TTGCAAAGTC CTTTTTACCT GTGCATTTGG ACTTGAGGAC 2100 

ACTGGTTTCT ATCACAGGTG AGAGCCATOT TCAATAGCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTO CACGCTCCTC TTOCCAAOOT OCCAATACCA QCACCTCrAO TTAGAGTTA6 2220 

GGTCAGGGTC AGGCCTCTCC CAACATCCCA GIAGTTTCTC CTCTGAGACA OITGGGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG QATCXATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAQTO TCWUTriT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGOTCCAAA OAAAOATGCA AAAGGAGATC 2460 

ACACCCTTGC COCGCTGAGC OCCGTGATAA CAA8TCACTC GAGACTAAOC TGICrTQCCAO 2520 

ACATTTGTGC ATTGTTQCAC TTTQAOOTTA TTATTTATCA AGTTCTTGAA GQAAGCAGAA 2580 

AGAGGGACTC CTCTCTCCCT CCGTGTATAG TCTCTATGTT TGTGCTAOTT Vint.TrrnT 2640 

TTCTCTGTGT CCAOTCAOCC ACAGGOCCOJ CCTCCCTGCA GGAATAAGGO GTAAAACOTT 2700 

AGGTGTTGTT TGGCAAGAAA CCACACTGAC T6ATQAGGGG TAAAATG6AA CCAGGTAGAG 2760 

CCACT CCGGG CAGCTGTCAC CCATTCaGAA C.nX,TntXXi CAQCTGAA6A AATOTTCRGT 2820 

AACCTGTTTG AOGCIAATTA AAACAQAGOC TQCASOAAOT GOOGCTAAAG TQGCATTCAO 2880 

T6ATCCT0TT CTGTAQACTT Ht ' ll ' lCm ' TTTTAACCAA ATOCAAASOA TGTTACAOAA 2940 

AA6CTAGCCA CTGGTATTTT OTTTTGTTTA AAAAAAAAAA GAAAOAAAOA AAGAAAGAAA 3000 
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AAOGQAAAGG 
CTGAGTAATC 
CATTTCATCT 
TTTCTOQTOC 
OAOTTAATCT 
ASATAAOGGA 
TGATTTTTTr 
TTTTTOGGGG 
TTTTTAACTC 
AAAATAOTCT 
A6TTACACTQ 
TTTTTAAAAA 
6AAAGGTTGT 
TTATACTTTC 
AAMMAAAAAC 
TGGGG066CG 
GAAACCCTGG 
GGAAACOGCA 
GGACTCAACC 



AACXTTAGCTG 
CftATAAAGAA 
OCTGTGAGTC 
TCTQGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GAGGGGGTTT 
ATTCCAACCA 
CATCTCTTTT 
TQATGACTGQ 
ATGCAACTAA 
GTGTCXsTTGC 
TAATAAATTT 
AWYWTTGGGG 
GGGCCCACGT 
CCAAGAAAAA 
GAOTOTTOGQ 
AOGAOGACCC 



CCTOTATCTT 
CTTTTGATGA 
AGAAGGQCTT 
GTTTAGAGGA 
TCTQCTTCCA 
TGCTTTTTTA 
TTTAAAAATA 
GTTTTCCAAC 
GGAAGCTTTT 
TTTCTCAAAT 
CCTATTACCT 
GTGOTTAATA 
TTPTTGrOTT 
OCAGTTTCAT 
GGGCTTGGGC 
AGGTAOGGCG 
GGTGQGGAOA 
TAAACCACAC 
AAQQQAACCC 



TCATTTTTAA 
a^CCAGAAT 
TATTTCTOOC 
AAOAATTCTA 
GGCATCTTAG 
AAACAAACAO 
TATAAATAGQ 
TCAAGATG6C 
TTATACATTG 
GAOATCCGTG 
GACTCAO CTC 
GTOTGTGAOG 
TTOGfTTAGGC 
TCTTTCTGTT 
CTOGCrAAAAA 
A0CA060QG0 
ATTCTCCACA 
CanAOAGAO 
GATAGAGTAC 



AATA6CACTT 
GTGTTAGAAC 
TTTGATGGQO 
ATTTTAATTA 
GAAAAACAAA 
OGACATTTTT 
ACACCAAAGC 
ACATTAGTGG 
CCTAAATCTA 
TTTTATTTTA 
OCTCTAGCTT 
CTCAAAGTTA 
TTGGTTTTGT 
TGTQCAAAWO 
GTTTTTAACA 
CCCAAA06G6 
0CA6AAAAAA 
AACTCAGAAO 
O 



GAGTTATTTT 
TCTGGCTGAA 
OCCCTTCTTC 
ATTGCGCAGT 
TGGTTTTAGT 
ATTATAGATT 
GGCA6GGTTT 
CCAGCAATAT 
OGCCAACCAG 
GCATTAAATT 
GAAATTGAGA 
ATGTAAACTO 
TTTTTAATTT 
GWMCTAMARM 
CCACTTOGGQ 
ACCCCA6AAG 
A06080CGGQ 
CACACM0CX3 



A3 8 Protein eecmencei 
Gene name: 
Unigene number: 
Probeset AcceeBion #i 
Protein Accession 
Signal sequence t 
Transmembrane domains i 
LDLa domains s 
Cellular Localization t 



ESTs 

Ha. 293616 
AW043782 
none found 
1-17 
169-191 

2B-66, 70-108, 112-149 
plasma membrane 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



I " 21 31 41 51 

MHLLGPLCLI, LSSAABSQLL PGNNPTWECN IPGNFMCSNG RCIPQAWQCD OLPDCFDKSD 60 

EKECPKAKSK CGPTPPPCAS GIHCIIGRFR CNGFEDCPDG SDEENCTANP LLCSTARYHC 120 

KNOLCIDKSF ICDGQNNCQD NSDBESCBSS QEPGSGQVFV TSEMQLVYYP SITYAIIQSS 180 

VIFVLWALI, ALVLHHQRKR NNUITLPVHR LQHPVLLSRL WLDHPHHCM VTYNVNNQIQ 240 

WASQABQNA SEVGSPPSYS EALLDQRPAN YDLPPPPYSS OTBSUiQADL PPYHSRSGSA 300 
NSASSQAASS LXfSVBDTSBS FGQPQPQEOT AEPRD8BP8Q GTBBV 



A3 9 DMA sequence 

Gene namei 

Unigene number t 

Probeset Accession $t 

Nucleic Acid Accession #t 

Ooding sequences 1-1749 

1 11 21 

i I 1 

ATGCTGTCTG GCTTCTTGAT QAGTOCCAOT 
GGAAAGAAAC TTCCXSTOOOA 60CTTCCATC 
GACCGGGAGA GGGAOAGCOO GCOOQAGGCT 
GGGGAGGOQQ AOAAGGGGAA CQ3GG6CX3AO 
CC6GGGC06C CGOCAQCTGO GCAGGCTCCC 
OGCCTGOGTC CTGOACGTTC C06GG6GAGG 
TCOGGACQAC AGCCCOQQGG GOCRCXGAC 
ACTCATAAGQ CAGT0CX:TAA QQO&iCCQQQ 
GCTCCTGOAC CTAGGGCCfCXJ OOOTCGTCGC 
COGOGOGQAA AGCGCCGCGG GACAGTCAGT 
CTTCTOQGAG ACXX5TCCTGC OCTCTCTGGA 
TGTGGOOOGC T0GCGGCT08 TOOCTCTCCT 
TGCTGCTGGC TGCGCTGCTO G0G6GGGG0G 
TGGCTQOAOa CXJCAGGGOGT CTGOCGCATC 
GGCGA0QCX31 CXaiTCTOCTG OGGCAGCTGC 
GCQOGCXTGG ACCAGGGCGG CTOOQACAAT 
C6GGGGGACA AAOAGGGGCC GGQAOGGCTC 
QQAGACQGCG AGGOTGCXSCC CCCACCCGTG 
TCCCCX3AAAG GAAGGCAGCT CCTC3VGGQCT 
OGCGGATTCC CATCTTCTCC ACXSOGGCGGC 
ATCTAOGTGC OOTTCCTCAT TGTTGQCTCC 
CTGGTGGGAG OCTOTTGCTO CAGATGrCTC 
GCCC CAOGGG GTAA006CTT GATGOAGACC 
CGGOGGTCGT CCTCA06CCA GTCCAGCACA 
GOGGCCCGGG CX5CCCCCAAC AAGQTCACAG 
AACAAOQTGT ATOTCAACAT 6CCCACGAAT 
CAGATTGTOC CACAICAAGQ GCAQTATCTO 
CAOOACTCTO TGOCCATGAC AGCTSTGOC A 
AQGCAGATTC AGTCCCCCTT CCCTCACACC 
ACTQTATAAC OQAGAGTCAC TGGTGGGTTC 
QATTCTC6AG GTG6AAGTCC GCACATGTOQ 
CCTTCATTTO CCCCCAGACT GTATQAAAAC 
CATCCAGGGT ATCATTOATT TATGATGGAA 
TTGCTGATCO OTOTATAACa AATOCTPGAO 
AAGAATTTTA TAAACTOATA AATTAAGGAT 



ESTs 

H8.128B99 

AA9832S1 

AA9832S1 

(underlined 

31 

I 

ACCCAGCACA 
GGTGOGCACA 
GG06GGCTCC 
COQCCOQCCT 
GOGACTOCGG 
GTC0G6TTBC 
TGCATOOOS^ 
CCACOQGCTO 
CTCCTGGGOG 
GAOQAGGCCC 
GA06CGCTGT 
CATCCIOGAA 
0GA66GCCCA 
GGCTTCCAGT 
QOQTTGCGCT 
GAOCGCCAGC 
OOCAOGGCTT 
AGG6CCTG6C 
TTCCOGGGGC 
CCCTCTCCCC 
GTGTTTGTCG 
OGGCCTAAGC 
ATCCCCATGA 
OCTGCCAGTT 
ACCRACTQTT 
TTCTCTGTQC 
CATCCCCCAT 
CCTTTCATGG 
AACAGTGAAC 
CTTTACTGAA 
6TGGTATTTA 
ATCTCOGAAT 
AAOCGGCCTC 
TCOQAAGTOC 
TTTTATTATO 



sequences 
41 



correspond to start and stop codons) 
51 



QAGCACAGTA 
0CTCC0BAG6 
TGTGGGAC03 
GQATCaSCGC 
CTGGGGGCGC 
CAGTGAAACC 
GATTTOCATC 
AGGA086GGA 
TOGOGGCAGA 
GGGGGTOGOC 
CXX30Q 0CCAG 
OGCOGCTTOQ 
OOGGOCSVGTA 
GTCCOGAGOG 
ACTGCTGCTC 
AGGGOQCTGQ 
CATQTCTTAG 
AGOQGTGCTC 
TGCTGOCCOG 
TQCAGOGGCC 
CCTTTATCAT 
AGGATOCCCA 
TCCOCAGTOC 
CCAGCTCCAG 
GCTTGCOQGA 
TQAACIGTCA 
AOOrGGGGTA 
AOGGOCTGCA 
AOAAQATGTA 
GGQAGAOGAA 
T6GCA0GATT 
TAGCATTTCr 
AGCTGGAGAT 
OCTTGAGATA 
TTOTTATTAT 



CACTOCGGGA 


60 


GOGAGGCAOC 


120 


06CTGCAGCC 


180 


CCAGCAGCAO 


240 


GCAGGACXXrr 


300 


TCCAQAGGCT 


360 


A60GAGTGCA 


420 


TGGCTTAGQA 


480 


GGGGAGTGGC 


540 


GGGGCCACGA 


600 


GGTGGTGCCA 


660 


CTCCTGGAGC 


720 


CTGCCA06GC 


780 


CTTOGACGGC 


840 


CAG0GCa3AG 


900 


CX3AGCCTGGC 


960 


GGGTACCCAA 


1020 


CGCT6AA66C 


1080 


TGCCAOAOGC 


1140 


OGCCTTGCCC 


1200 


CXTGGGGTCX: 


1260 


GCAGAGCCGA 


1320 


CAOCACCTCC 


1380 


OGCCAACTCC 


1440 


AGGGACCATG 


1500 


GCAGOCCACC 


1560 


CACXX3TGCAG 


1620 


GCCTGOCTAC 


1680 


CCCAGCX3GTG 


1740 


GGCAGGGGTG 


1800 


OCTTTGGATO 


1860 


GGATATGTTT 


1920 


GACTGXGATO 


1980 


TGaTTQAOGA 


2040 


TATTTCTTTT 


2100 
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TTGTTQTTGA CTGCACAG6A TCAAAATGCC 
TTTTTTTTTT TTTTTTTTAA TCAGACAOGG 
TGGT6CGATC TCGGCTCACT GCAACTTCAQ 
AGCCTCCCAC OTCGCTGGGA TTACAOOTGC 
TGTAGAGATG GG6TTTC71CC ATGTTGGCTQ 
TCTGC ClXfTC TCAGCCTOOC AAAGT6CTG6 
TGAGCCrm TTTTTTTCTA ATGCATCCAA 
ATTCTAAAAG GAAACCTCTT TGAACTCTGT 
GGCACACCTT AATTTCATTG TAAAAAQATA 
GGGCCTATTT TGTGCTTTTT TACCTTAT6T 
TTAAAAAQA6 ACTGAAATAA ATTCTATAGT 
T6GGATGATT TTAATCTTGA A0TAGTAG6T 
TTGATTGTAT CTTAATTTTC TGGCTTTAAQ 
TATATTQAftA TCATAAACTA TCACCCGCTQ 
TGGTTATGGT TTGGOSTTTC CTTCTGTTTO 
GAOIO CaUOT AATTACT ATA CTTQTAAAT6 
AAAAATTTTC TTGTCTTAGT TATAAAAATT 
TAQCTCCTCA GCX3VTAACCT QAGACTTGGG 
GCAGATCATA AOGCTTTTTA TACTCTTGTT 
OATTGTTAAG AGAAAAGCTT TTCftAOSAAG 
ATTTCCTCTC TCTTTCAGQC CTCAACAOGC 
GAAATTCAAQ TGAATTTATT TGTOTGTTCT 
ATQTGCAAGT ACATTTCCAA CTGCTA6CAC 
GTATGGTATG GTCTTCTACA C3VTTTATGTC 
TGTTTCAT6A C7GAATAAT0 TAAAACCAGT 
TTTTTTAATA ACCAAA66CA GQGGAAAATC 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



TOTTATCTOC CTTTTACTGG GACTTTTTTT 3160 

TCITGCICro TTGCCCAGGC TGGAQTGCAG 2230 

CCTCCTQQAT TCAGGCAACA CTCCTGCCTC 3280 

CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 

GOCTQGTCTC ACTCTCCTQA CCTCAA6CAA 2400 

GATTAGA06C GIOAGCCACC GOCCCCftOCC 3460 

GGTTAAGOGG AA6A06CAAA TAACAG6ACT 3530 

6AQATCAGTC ATCAGTCTCA OTATTCC^CA 2580 

TATATATTTT GTCTATTTTT GTGCTTTTOG 2640 

AGAQATCTTA TTACAAAGTG ATTTTCTACA 2700 

TACTTAACTA ATGAftGACAT TTCAQAACTC 3760 

G6TATAGTCA TAAAACCATT CATC C C C TTC 3820 

GTGACATCTa AOAaOTAATG CATTCTTTTT 3880 

CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 3940 

GrTTTGAGAG CCCCATOTCT ATATA6TCCT 3000 

AAGATCAGTA TTTCTGCCTA GATCTQATAA 3060 

CAAAC2AAATG TGTTACAAAQ ATACTTAGTA 3130 

ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 

QATTGCCTTT CTTCTCCCSlC ACTOTTCTTG 3300 

ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

TTACTTATAT AAAAAAAGAT AACTTTAAG6 3420 

AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 

TATAQATATC TATOGATCAT CTTTCTATTC 3540 

GTTGGCAATT 6GTATCATCA ATQATACTCA 3600 

ATTTTACTTA TTAATAAATA TTTTATGATG 3660 



A40 Protein sequence 
Gene nanet 
unigene nundber: 
Protein Accession U : 
Signal sequence: 
Transuieinbrane domains i 
Cellular localizationt 



BST8 

Ks. 138899 

none found 

1-11 
403-424 

not determined 



21 31 
I I 



41 

1 


51 




DRERSSRPBA 


1 

AGLLHDRAAA 


60 


RLRPGRSRGR 


VRLPVKPPBA 


120 


APGFRARRRR 


LLGVAAEGSG 


180 


CGALAARPSP 


HPGTPLRSCS 


240 


GDATICOGSC 


ALRYCG8SAB 


300 


GDGBGAPPPV 


RAHQRCSPEG 


360 


lYVPFLIVGS 


VFVAFIILGS 


420 


R0SS8RQSST 


AASS8S8ANS 


480 


QIVPHQGQYL 


BPPYVGyrVQ 


540 


TV 







A41 DMA SEQUBWCBt 

Gene name: ESTb, Weakly eimilar to JE0350 Anterior gradient -2 [H. sapiens] 

unigene number t Hs. 100686 

Probeset Accession #t AA487468 

nucleic Acid Accession fis AA487468 

Coding sequence: 55-555 (underlined sequences correspond to start and stop codons) 



1 11 31 31 41 51 

1 I I I I I 

GQGCACCAAG AGCA CTGQCC AAGTCAGCTT CTTCTGAGAG AGTCTCTAGA AGACATGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCQTCACAG TTTCTTCCAA CCTTGCCATT 130 

GCAATAAAAA AGGAAAAQAG GCCTCCTCAG ACACTCTCAA GAGGATGGGO AQATGACATC 180 

ACTTGGGTAC AAACTTATGA AGAAQOTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGQTTATTC ATCACCTGGA GGATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAfl AAATACAAGA AATGOCTCAG AATAAGTTCA TCATGCTAAA OCTTATGCAT 360 

QAAACCACTG ATAAGAATTT ATCACCTGAT GGGCAATATG TGCCTAGAAT CATOTTTCTA 420 

GAOCCTTCTT TAACAGTTAO AGCTGACATA GCTGGAAGAT ACTCTAACAG ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAGTCAOAGC T ATAA6 AGAT GATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAA6AAAACC TCTGGCACAT TGAGAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACFATTTA GTTTTTTTAA TGTGTTT6CA ATAGTCTTAT TAAAATAAAT GTOYITA AA 730 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A43 Protein sequence; 
Gene name: 
unigene number: 
Probeset Accession ft: 
Protein Accession tf: 
Signal sequence i 
Transmembrane domains t 
cellular Localization 1 



BST8« Weakly siodlar to JE03S0 Anterior gradient-3 (H. sapiens] 

Hs. 100686 

AA48746B 

none found 

1-33 

none found 
secreted 
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} 11 21 31 41 51 

I I I I I I 

MMLHSALGI#C LLLVTVSSNL AIAIKKBKRP PQTLSRGWGD DITWVQTYBE GLFYAQKSKK 60 
PLMVIHBLED OQYSQAUCKV FAQNEBIQBM AQNKPIMX2»i MHBTTDKNliS PDGQYVFRIM 120 
FVDPSLTVRA DIAGRYSNRL YTYBPRDIiPIi I*IENMXRALR LIQSBL 



A43 DMA SEQUBWCE 

Gene namei ATPase, Ca++ transporting, type 2C, menter I 

unigene nunibert Hs. 106778 

Probeset Acceasion ft: N51919 
Nucleic Acid Accession AF189723 

coding sequence i 1-2712 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I i I I I 

ATGATTCCTG TATTCACATC AAAAAAA6CA AOTGAATTAC CAGTCAGTGA AQTTGCAAGC 60 

ATTC ICCaAG CTQATCTTCA GAATGGTCFA AACAAAT6TG AAGTTAGTCA TAOaOQAGCC 120 

TTTCA ,TGGC T G6AATGA0TT TQATATTAGT GAAGATSAGC CACTGTGGAA GAA6TATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTOGCTT CTQCAOTCAT CAGTGTTTTA 240 

A TGCATCAG T TTOATGATOC OGTCAGTATC ACTQTGGCAA TACTTATOST TGTTACAOTT 300 

GCXrrTTGTTC AGGAATATOG TTCAGAAAAA TCTCTTGAAQ AATT6ASTAA ACTTGTGCCA 360 

GCAGAATGCC ATTGTOTGGS TQAAGGAAAA TTGGA6CATA CACTTGCCCG AGACTTGGTT 420 

CCAGOTQATA CAGTTTGCCT TTCTGTTGGG OATAGAGTTC CTGCTGACTT AOGCTTQTTT 480 

aAGGCTQTGQ ATCTTTCCAT TGATOAGTCC AGCTTGACAG GTQAGACAAC GCCTTOTTCT 540 

AA GGTGA CAG CTCCTCAGCC AGCTGCAACT AATGGAGATC TTGCATOSAG AAGTAACATT 600 

OCCTTTATGG GAACACTGGT CAGATGTGGC AAAGC3\AAGQ GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA GGTTTTTAAA ATGATGCAAO CAGAAGA06C ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA CCTCTTAGGA AAACAACTTT CCTTTTACTC CTTTGOTATA 780 

ATAGGAATCA TCAT6TTGGT TG6CTG0TTA CTGGQAAAAG ATATCCTGGA AATGTTTACT 840 

ATTAGTGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTOT GGTCAGAGTG 900 

AOGCTAG CTC TT06TGTTAT QAGAAT6GTG AAGAAAAG6G CCATTGTGAA AAA6CTGCCT 960 

ArrOTTGAAA CTCTGGGCTO CTOTAATQTG ATTTGTTCAO ATAAAACTGO AACACT6A0Q 1020 

AAQ AATG AAA TGACIGTTAC TCACATATTT ACTTCAQATG 6TCTGCATGC TQAGGTTACT 1080 

GGAGTTGGCT ATAATCSVATT TGGGGAAOTG ATTGTTOATG GT6ATGTTGT TCATGGATTC 1140 

TATAACCCAG CTGTTAGCAG AATTGTTGAG GCGGGCTOTQ TQTGCAATQA TGCTGTAATT 1200 

AGAAACAATA CTCTAATGGQ GAAGCCAACA GAAGGOGCXTT TAATTGCTCT TGCAATGAAO 1260 

ATGGGTCTTG ATGGACTTCA ACAAGACTAC ATCAGAAAAO CTGAATACCC TTTTA6CTCT 1320 

G AOCAAA AGT GGAT6GCTGT TAAGTGTQTA CACOQAACAC AOCAGQACAG ACCAQAGATT 1380 

TGTTTTATGA AAGGTGCTTA COAACAAGTA ATTAAGTACT GTACTACATA CCAOAGCAAA 1440 

GGQCAGACCT TGACACTTAC TCAGCAGCAG AQAGATGTGT ACCAACAAGA OAAGGCAOGC 1500 

ATGGGCTCAO OQGGACTCAG AGTTCTTGCT TTGGCTTCTG GTCCTGAACT GGGACAGCTG 1560 

ACATTTCTTO GGTTGGTG6G AATCATTQAT CCACCTAGAA CTGGIOTaAA AGAAGCTGTT 1620 

ACAACACrCA TTGCCTCAGG AGTATCAATA AAAATGATTA CTGGAGATTC ACAGQAGACT 1680 

GOVGTTGCAA TC6CCAGTCG TCTGGQATTG TATTCCAAAA CTTCCCAGTC AGTCTCAGGA 1740 

GAAGAAATAG ATGCAATGGA TGTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1800 

TTTTACAGAG CTAOCCCAAO GCACAAGATG AAAATTATTA AGTOGCTACA GAAGAA06GT 1860 

TCAgr mTAG CCATQACAOG AGATGGAOTA AATGATGCAO TTQGTCTGAA GGCIX3CAQAC 1920 

ATTGGAGTT6 OQATQGGCCA GACTGGTACA GATGTTTGCA AA6AG0CAGC AGACATGATC 1980 

CTAGTGGATG ATGATTTTCA AACCATAATG TCTGCAATCG AAGAGOOTAA AGGGATTTAT 2040 

AATAACATTA AAAATTTCGT TAQATTCCAG CTGAGCACGA GTATAGCAGC ATTAACTTTA 2100 

ATCTCATTGO CTACATTAAT QAACTTTCCT AATOCTCTCA ATGOCATGCA QATTTTGTGG 2160 

ATCAATATTA TTATQQATGG ACCCOCAGCT CAGAGCCTTG GAGTAGAACC AOTGGATAAA 2220 

6ATGTCATTC GTAAAGCTOC T0GCAACT60 AAAOAGAOCA TTTTGACTAA AAACTTQATA 2280 

CTTAAAATAC TTGTTTCATC AATAATCATT GTTTGTGGGA CnTGTTTOT CTTCTGGOQT 2340 

QAGCTACGAG ACAAT QTGAT TACACCTOGA GACACAACAA TGACCTTCAC ATGCTTTQTG 2400 

TTTTTTGACA TGTTCAATGC ACTAAGTTCC AGATCCCAOA CGAAOTCTOT 0TTT6AQATT 2460 

GGACTCTGCA G TAATA QAAT OTTTTQCTAT GCAOTTCTTG aATOGA-rcAT GGGACAATTA 2520 

CTAOTTATTT AC TTTC CTOC GCTTCAOAAO GTTTTTCAGA CTGAaAOGCT AAQCATACTQ 2580 

GATCTGTTGT TTCTTTTGOG TCTCAOCTCA TCACTGTGCA TA6TG6CAGA AATTATAAAO 2640 

AAGOTTOAAA GGA6CAGG6A AAAGATCCAG AA6CAT6TTA OTTGGACATC ATCATCTTTT 2700 
CTTGAAGTAT OA 



A44 Protein sequence i 
Gene name: 
Ukiigene number t 
Probeset Accession #: 
Protein Accession #t 
Signal sequence i 
Transmenibrane domainsi 
Pfam domains t 
{744-8893 

Cellular Localization: 



ATPase* Ca4-f transporting, type 2C« 
Hs. 106776 
N51919 
AAP27813 
none found 

60-82. 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

cation ATPase H [6-83) , B1-E2 ATPase [89-324] , Hydrolase [328-649] , Cation ATPase C 
not determined 



1 11 21 31 41 51 

I 1 I I I i 

MIPVLTSXXA SBLPV8BVAS ILQADLOKOL NKCBVSKRRA FHGWNEFDIS BDBPLHKKYI 60 

SQFKNPLXML UASAVISVI. MHQFDDAVSX TVAILIWTV AFVQBYRSEK 6LBBLSKLVP 120 

PBCHCVREOR LBRTIAHOLV PGDTVCLSVO DRVPAOIAtF EAVDbSIOBS SLTGBTTPCS 180 

KVTAPQPAAT MOTLASR6NZ AFMGTLVROO KAKGWIGTG EN8BFGBVFX KHQABBAPKT 240 

PLQRSMDLLO KQLSFYSFOZ IGZIMbVOWL LQXDILSIFr ZSVSIAVAAZ PBGLP2VVTV 300 
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nALGVMRMV KKRAIVKKLP IVBTLGCCNV ICSDICTOTLT KNEMTVTOIP TSDGLHAEVT 360 

GVOYNQPSBV IVDCSDWHGF YNPAVSRIVE AGCVCNDAVI RNNTU<QKPT EGALIALAMK 420 

MCTJGLQQDY IRKAEYPFSS EQKWMAVKCV HRTQQDRPBI CFMKGAYBQV IKYCTTYQSK 480 

WLTLTQQO RDVYQQBKAR MQSAGLRVIA LASaPBWSQL TPLGI*VGIID PPRTCVKEAV 540 

TTLIASGVSI KMITGDSQST AVAIA8RLGL YSKTSQSVSG BBIDAMDVQQ LSQIVPKVAV 600 

FTJASPRHKM KIIKSLQKNO SWAMTGDGV NDAVAI^KAAD lOVAKGOTCT DVCKEAADMI 660 

LVDDDFQTIM SAIEEC3KQIY NNIKNFVRFQ LSTSIAALTL ISLATLMNPP NPLNAMQILW 720 

INIIMDGPPA QSLGVEPVDK DVIRKPPRNH KDSILTKNLI LKILVSSIII VCOTLPVPWR 780 

ELRDMVITPR DTTMTPTCFV PPDMPWALSS RStyTKSVFEX OLCSNRMFCY AVLGSIMGQL 840 

LVIYPPPWIK VP(JTBSLSIL DLLPLM5I.TS SVCIVABUK KVBRSRBKIQ KHVSSTSSSF 900 



A45 PWA seouanca 



Gene 

ITtilgene number: 
Probeaet Accession Ut 
IVucleic Acid Accession #t 
coding Sequence i 



BSTs 

Hs. 157601 

N07459 

AC005383 

328-2751 (underlined sequences correspond to start and atop codons) 



1 11 21 

I I I 

OACA GTGTTC GCGGCTOCAC CGCTCGGAGQ 
T TTTA TTTGC AGACCTGGGC CGATGCCGCT 
CCXaOCGQTA GTTCCTCCGA CXTTCAGCCGG 
ACAAACAGOT GTCCCACGTQ GCA6CCGCGC 
CCCCCTGGCC CX3AGC0GCGC CCGGGTCTGT 
TOOCCGCTCT CCTTCCGTTA TATCAACATG 
GTTTTCCTOT TTTCCAGAGT GCCCCCATCT 
GAAACCRTOQ GGAAOATTTC AGCTGCCAGC 
AT CATQT TTC TOTTAGATGG GTCTAACAGC 
CACTTTGCCA TCACAGTCT6 TGAOGGTGTG 
GCATTCCAGT TCAGTTCCAC TCCTCATCTO 
CAGG AAGTQA AGGCAAGAAT CAAGAGGATG 
CTTGCTCTGA AATACCTTCT GCACAGAGGG 
CAGATCCTCa TCATCGTCAC TGATGGOAAO 
CA6CTGAAGG AAAGGGGTQT CACTGTGTTT 
GAGCTGCATQ CACTGGCCaG OQAOCCTAGA 
GAGGATQCCA CCAACGGCCT CTTCAGCACC 
ACGCCAGACT GCAGGGTCQA OGCTCACCCC 
GAGTTCX3CTG QCAATGCCCC ATGCTGGAGA 
GCACACTGTC CCTTCTACAO CTGGAAGAGA 
AGGACCACCT GCCCAGQCCC CTGTGACTCG 
CCAGAAGGAC TGGAOSGCTA CCAGTGCCTC 
TGTGCXXTGA AGCTGAGCCT GGAATGCAGG 
GCGGGCACCA CTCTGGACGG CTTCCTGCGO 
GCCXntSCTGA GCQAQOACTC TOQQGCCCGA 
CTGGTGGCX5G TGCCTOTOGG GGAOTACCAO 
GGCATTCCCT TCOGTGGTOG CCCCACCCTO 
CX3TGGCTTCG GGAOCGCCAC CAGOACAOGC 
CTCACTGAGT CACACTCXX3A GGAT6AG6TT 
GAGCTGCTCC TQ C TG GG TGT A60CA0TGAO 
GGCAGCCCAA AOCATOTGAT GGTCTACT06 
GAOCTGCAOG GGAAGCTGTG CAGCOQGCAG 
CTCGTCTTCA TOTTGGACAC CTCTGCXTCA 
AQCTTTGTGA GAAGCTGTGC CCTCCAOTTT 
CTGGTGGTGT ATGGCAGGCA GGTGCSUiAiCT 
GCTGC3GATGC TC0GGGCC31T TAGCCAGGCC 
ACOGCCCTGC TGCACATCTA TGACAAAOTG 
QTCCCCAAAQ CTGTGGTGGT GCTCACAGGC 
GCCCAOAAGC TQAGGAACAA TGGCATCTCT 
AGTGAGGOTC TGOGGAGGCT TGCAOGTCCC 
GCCGACCTOC GQTACCACCA GGAOGTGCTC 
CCAGTCAACX: TCTGCAAACC CAGCCOOTGC 
GGGAGCTACC GCTGCAAGTG TCGQGATGGC 
TGGAGCTCTT GCTCTGTATG TGTGAGCGAG 
ATGGCTCCGO TGCAGGAOOG GAGCAGCCGT 
GGCACTGAAA TGGTGCCTAC CTTCTGQAAT 
TTCCCJGCaST GGCX31GQACC ACTATTCTCA 
ATGCTGCTTA GAGACAAGAA AGCAGCTQAT 
TTGATGTGTA AGTAAATACC CACTTTCTGT 
CTGCCACC3T TCOCTTGAGQ ATAAACAAGG 
CGTTCCTTTG CACACAATCA ATGCTOOCCA 
AGGCCTTTAC TAQAGCATCC TTTGQACGGC 
GCAGCTTTTC CACTTCCCCA QAGACATTCT 
CTTQAGGQAC GTTTGTGACT TCTTGGCGAC 
GGPTCTCAGAC TGAATGTQAC CAATTAACX:a 
TOTGCATCOO COCAGGTCTG GAG6GCCAC98 
AOCTTGAAGG TCTTC 

A46 Protein sequence 
Gene 



31 
I 

CTGGGTGACC 
TTAAAAAACG 
GTC6GGTCGT 
CCC6GGCGCC 
GAGTAGAGCC 
C C CCCTTTCC 
CTCCCTCTCC 
AAAATGATGT 
GTCX»3GAAAG 
GACATCAQCC 
G AATT CCCCT 
OTTTTCAAAG 
TTGCCTGGAG 
TCCCAGGGGG 
GCTGTGGG6G 
GGGCAGGAGG 
CTCAGCAGCT 
TGTGAGCACA 
GGAT06CGGC 
GTGTTOCTAA 
CAGCOCTGCC 
TGCCCOCTGG 
GTCGACCTCC 
QCCAAA6TCT 
QT GG Q TO TUG 
GATGTGCCTG 
ACX3GGCAGTG 
CAGGACCX5GC 
GC6GGCGCA6 
GCaSTGOGGQ 
QATCCTCAGG 
CGGCCAOGGT 
QTAGQGCCCG 
6AGGT6AACC 
GCCTTOGOGC 
CCCTACCTAO 
ATGACCGTCC 
GGGAGAGGOG 
GTCTTGGTCG 
OGGQATTOCC 
ATTGAGT6GC 
ATQAATGA60 
TGGGA6GGCC 
GQATtSGATTC 
ACCCCTCCCa 
GTCTGTGCCC 
CTSAGGGAGG 
GTCACCCACA 
AGCTX3CT6TG 
GGTOCIGAAQ 
GAATOTTGTT 
GAAGGCCACG 
GGATGCATTT 
TGCCTTTTGT 
GCTTGOTTGA 
TAAAATOOTT 



41 
I 

0G0GTA6AAG 
CQAGGGGCTC 
GCCGCCCTCT 
CXrrcCTGTGA 
GOOOG GGCAC 
TGTTGCTGGA 
AGGAAGTCCA 
GGTGCTCGGC 
GGAOCTTTGA 
GOQAGAOQGT 
TGGATTCATT 
GAGGGOQCAC 
GC3W3AAATGC 
ATGTGGCACT 
TCAOGTTTGC 
TGCTGTTGGC 
CGGCCATCTG 
GGAC6CTGGA 
GGACCXTTTGC 
OCCACCCTGC 
AGAATQGAGG 
CCTTTGGAGQ 
TCTTCCTGCT 
T0GTGAAGCX3 
CCACATACAG 
ACGTGGTCIO 
CCTTGCGGCA 
CACX3TAGAGT 
OGOGTCAGGC 
CA6A6CTGGA 
ATCTGTTCAA 
GCXX3GACACA 
A6AATTTT0C 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
A6AGGGGTGC 
CAGAGGATOC 
TGQGOGTGGG 
TGATCCAOGT 
TGT6TGQAGA 
GCAGCTGOGT 
CCCACTGC3GA 
TTGAOAOQCX: 
GCAACTACAG 
CflGOTCCTTA 
AGGATGTCCC 
AAOGATGTTQ 
CCTTGTTGAG 
ACTTAAATTT 
GAC3U31GTAA 
GCCTTTCAAG 
QCATTGAGTC 
GTOTGOAAGA 
TGATGGGGGA 
CTGAOTOOTG 



51 
I 

TGAAGTACTT 
TAT6CACCTC 
CCCAGGAGAO 
TCCCGTAGCO 
CX3AG0GCTGG 
GGC0GTCT6T 
TGTAAGCAAA 
TGCAGTGGAC 
AA6GTCCAA0 
GAQA6TGGGA 
TTCAACCCAA 
GGAQACGGAA 
TTCTQTGCCC 
GCCATCCAAG 
CAGGTGGGAG 
T6AGCAGGTG 
CTCCAGOGCC 
GATG0TCXX3Q 
GQTGCTGGCT 
CACCTQCTAC 
CACATGTGTT 
GGAG6CTAAC 
GGACAGCTCT 
GTTTGTGOGG 
CAGGGAGCTG 
GAGCCTOGAT 
GGOG GCAGAG 
GGTGGTTTTO 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCXXTGGAC 
TCAGATGCAO 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCOGGC 
CCGGCCTGGT 
AGCOGTTCXrr 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAQCAG 
CCTGCAQAAT 
GAACCOTGAG 
CCTGAGGCAC 
AQAAGOCCTQ 
GAATGTCTGC 
AACTGCAGOC 
TTGAAAAGTT 
QCTATGTCAT 
AGOGGCCTGA 
TGCOCAGCAG 
ATGGAAA6CA 
tX3AAAGG6GO 
GACTT6GAAA 
GGGGCTGAGT 
AGCAGTGTCC 



60 ' 
120 

lao 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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tAnigene nundsen 
Protein Accession #: 
Signal sequence: 
TransmeoibranQ domains t 
VOW domains t 49*239; 341 
B6P domains I 398-333; 71! 
Cellular l«calizationt 



Hs. 1576 01 
none found 

1-17 

none found 
-516; 529-706 
i-74e 

secreted 



1 11 21 31 41 51 

I i I I i I 

MPPFLLLBAV CVFLPSRVPP SLPLQBVHVS KBTIGKISAA SKMMWCSAAV DIMFLLD6SN 60 

SVGKGSPERS KHPAITVCDO LDISPBRVRV GAPQPSSTPH LEPPU>SPST QQBVKARIKR 120 

MVFKOGRTBT ELAUnritLBR OLPGORNASV PQILIIVTOG KSQGDVALPS KQLKBRGVTV 180 

PAVGVRPPRW BEUIALASEP RGQHVLLAEQ VEDATNGLPS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REPAGNAPCW RGSRRTIiAVL AAHCPPYSWK HVPLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAPGGBA NCALKLSLEC RVDLIiPUiDS SAOTTliDGFL 360 

RAKVPVKRPV RAVLSEDSRA RVOVATYSRB LLVAVPVGBY QDVPDLVWSL DGIPPRGGPT 420 

LTOSALRQAA BRG POSA TRT GQDRPRRVW LLTESHSEDB VAOPARHARA RBLLLLGVGS 480 

EAVRABLBBI TOSPKHVMVY SDPQDLPNQI PBLQGKLC8R QRPGCRTQAL DLVPMLDTSA 540 

SVGPENPAQM QSFVRSCALQ PEVNPDVTQV GLWYGSQVQ TAFGU9TKPT RAAMLRAZGQ 600 

APYLGGVGSA GTALLHIYDK VKTVQRGARP GVPKAVWLT GGRGAEQAAV PAQKIiRMiaOX 660 

SVLWGVQPV LSBGLRRLAG PRDSLIHVAA YADLRYKQDV LIEWLOGEMC QFVMLCKPSP 720 

CKNEQSCVI4 NGSYRCKCRD OHEQPHCEKR BH8SC8VCV8 QGWZLBTPIiR HMAPVQEOSS 780 
RTPPSNYREQ L6TB1VPTPW NVCAPGP 



COIOW 



A47 DNA SEQUENCE I 
Gene name: 
unigene number > 
Probeset Accession ft: 
Nucleic Acid Accession #: 
Coding sequence: 



BSTb, Weakly similar to JE0350 Anterior gradient -3 (H. sapiens) 

Hs. 100666 

AA4B7468 

AA48746B 

55-555 (underlined sequences correspond to start and stop codons) 



GG6CA0CAA6 
CTACACTCAG 
GCAATAAAAA 
AGTTGGGTAC 
ATOGTTATTC 
CAAAATGAAO 
GAAACCACTG 
GACCCTTCTT 
TATGA6CCTC 
CaOTCAaAGC 
GAAGAAAACC 
TTACTATTTA 
TCTGAAAAAA 



11 
I 

AGCACTGGCC 
CTTTGGQTCT 
AGGAAAAGAG 
AAACTTATQA 
ATCACCTGGA 
AAATACAAGA 
ATAAGAATTT 
TAACAOTTAG 
GGGATTTACC 
TATAAGAGAT 
TCTGGCACAT 
GTTTTTTTAA 
AAAAAAAAAA 



21 

1 

AAOTCAGCTT 
CTGOCTCTTA 
6CCTCCTCA6 
AGAAGOTCrC 
GGATTGTCAA 
AATGGCTCAG 
ATCACCTGAT 
AGCTGACATA 
CCTATTQATA 
GATAGAAAAA 
TQACAAATAC 
TGTGTTTGCA 
AAAAAAAAA 



31 
I 

CTTCTGAQAG 
CrOOTCACAO 
ACACTCTCAA 
TTTTATGCTC 
TACTCTCAAG 
AATAAGTTCA 
OGGCAATATG 
GCTGGAAGAT 
GAAAACATQA 
AGCCTTCACT 
TAAATGTGCA 
ATAGTCTTAT 



41 

I 

AGTCTCTAQA 
TTTCTTCCAA 
GASQATGOQO 
AAAAAAGTAA 
CACTAAAOAA 
TCATGCTAAA 
TGCCTAGAAT 
ACTCTAACAO 
A6AAAGCATT 
TCAAAGAAGT 
AOTATATAGA 
TAAAATAAAT 



51 
I 

AQACATGATG 
CCTTGCCATT 
A6ATQACATC 
GAA6CCATTA 
AGTATTTGCC 
CCTTATGCAT 
CATGTTTGTA 
ATT6TACACA 
AAGACTTATT 
CAAATTTCAT 
TTTT6TAATA 
GTTTTTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



A48 Protein sequence: 
Gene name: 
Unigene number: 
Probeset Accession ft: 
Protein Accession fit 
Signal sequence: 
Transmembrane domains t 
Cellular Localization: 



E8Ts« HeaUy similar to JBO3S0 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA487468 

none Sound 

1-23 

none found 
secreted 



} 11 21 31 41 51 

I I I I I i 

MMIiHSAI/3LC LLLVTVSSNL AIAXKXEXRP FQTLSROHGD DITHVOTYEB GXiPYAQKSKK 60 
PLMVIHHLEO OQYSQALKKV PAQMEBIQEM AQNXFIMUOi tUHBTTDRiniS PDGQYVPRIM 120 
FVDPSLTVRA DIAGRYSNRL yTYBFRDLPL LZEZIMKKAZiR LIQSBL 



A49 DNA SEQUENCE 

Gene name: o protein^- coupled receptor 56 
unigene number: Bs.6537 
Probeset Accession ft: AA4 78599 

nucleic Acid Accession ft: HH_005682 

Coding sequence: 163-2244 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I ! I t 

CGQCAGCAGG GTCTCGCTCT GTCACACAG6 CTGGAQTOCA QT SO TGTGAT CTTGGCTCAT 60 

CGTAAC CTCC ACCTCCOGGG TTCAAGTGAT TCTCATOCCT CAOCCTCCOO AOTAGCT6GG 120 

ATTACAGGTO GTGACTTCCA AOAOTOACTC 09T0GQAGQA AAATCACTCC CCAOTO OC I Q 180 
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CTGCAGAOGH CRC lW TOC r GCTQAGTCT6 C l tT I XXT GO TGCAAGGTGC CCAOSGCAGG 240 

CGCCACAGGG AAGACTTT06 CTTCTOCAGC CAGC6QAACC AGAOICACAO GAGCA6CCTC 300 

CACTACAAAC CCACACCAC5A CCTGCGCATC TCCAT0QAC3A ACTCCGAAOA GGCCCTCACA 360 

GTCCATGCCC CTTTCCCTGC AGCCCACOCT GCTTCCCGAT CCTTCCCTQA CXXXAGGGGC 420 

CrCTACCACT TCTGCCTCTA CTGGAAC08A CATGCTGGGA GATTACATCT TCTCTATGOC 460 

AAGOQTGACT TCTTQCTOAO tGACAAAGGC TCTAGCCTGC TCTGCTTCCA GCACCAGGAG 540 

GAGAGCCTOG CTCA GGGC CC O CUU C IUTTA 6CCACTTCT0 TCACCTCCTG OTGGAGCXXrr 600 

CAQAACATCA GCCTGCCCAG T6C0GCCA0C TTCACCTTCT CCTTCCACAG TCCTCCCCAC 660 

ACXSGCCOCTC ACAATGCCTC GOTGGACATO TGOCAGCTCA AAAGGGACCT CCAGCTGCTC 720 

AGCCAGTTCC TGAAGCATCC CCA6AAGGCC TCAAGOAOGC CCTCGGCTGC CCCOGCCAGC 760 

CAGCAO TTBC A6AGCCT0GA GTCGAAACnG ACC TC TOTOA GATTCAT6GG GGACATGGTG 840 

TCCTTOQAGG AGGACCGGAT CAACX30CA0Q GTATQGAAGC TCCAGCCCAC AOC O O G OCTC 900 

CAGGACCTGC ACATCCACTC C0GGCAGGA6 GAGGAGCAGA QOQAGATCAT GGAGTACTCG 960 

CTGCTGCTGC CTCX3AACACT CTTCCAGAGG ACQAAAGQCC OQAGOGGGGA GGCTGAGAAO 1020 

AGACTCCTCC TGGTGGACTT CAGCAGCCAA GCCCTGTTCC AGGACAA6AA TTGCAGCCAA 1080 

GTGCTG66TG AOAAGG TCTT GGGaATTGTG GTACAGAACA GCAAAGTAGC CAACCTCAOG 1140 

0A6CCGGTGG TGCTCACTTT CCA6CACCAG CTACAGGG6A AGAAT6T6AC TCTOCAATGT 1200 

GTOTTCTGGO TTGAA6ACCC CACATTGAGC AGCCOGGGGC ATTGGAGCAG TGCTGGGTGT 1260 

GAGACCGTCA GGAGAGAAAC CCAAACATCX TQCTTCTGCA ACX31CTTQAC CTACTTTGCA 1320 

GTGCTGATGO TCTCCTCQGT GGAGGTGGAC GCX3QTGCACA AGCACTACCT GAGCCTCCTC 1380 

TCCTAOGTGO OC TOTCTC QT CTCTGCCCTG QOCIGCCTTa TOVOCATTGC CGCCTACCTC 1440 

TGCTCCAGGQ TGCCOCTGCC GTOCAGGAGO AAACCTGGGG ACTACAOCAT CAAGGT6CAC 1500 

ATGAACCTGC T0CTQQC06T CTTCCTQCTG GACACX3AGCT TCCTGCTOWS CGAGCCX5QTG 1560 

GCCCTGACAG GCTCTQAG6C TGGCTGCCGA GCXAGTGCCA TCTTCCTGCA CTTCTCCCTQ 1620 

CTCACCTGCC TTTCCTGGAT GGGCCTCGAQ GGGTACAACC TCTACOQACT CGTGGTGGAG 1680 

GTCTTTGGCA CCTATGTCCC T6GCTACCTA CTCAAGCT6A GCOOCATOaO CTGGGGCTTC 1740 

CCCATCTTTC TGGTQA06CT GOTGGCCCTG GTGQATGTGO ACSUlCTATGG CCCCATCATC 1800 

TTGGCTGTGC ATAGGACTCC AGAGGGCGTC ATCTACCCTT CCATGTGCTG GATCCQGQAC 1860 

TCCCTGGTCA GCTACATCAC CAACCTGGGC CTCTTCAGCC TGGTGTTTCT GTTCAACATG 1920 

GCCATGCTAG CCACCATGGT GOTOCAQATC CTGOSGCTGC GCOCCCACAC CCAAAAGTGG 1980 

TCACAT6TGC TGACACTGCT GGGCCTCAGC CTGGTOCTTG GOClGCCXrrG GGCCTTGATC 2040 

TTCTTCTCCT TTGCTTCTGG CACCTTCCAO CTTGTOGTCC TCTACCTTTT CAGCATCATC 2100 

ACCTCCTTCC AAGGCTTCCT CATCTTCATC TGGTACTGGT CXATGOGGCT GCAG6CX:CX3Q 2160 

GGTQQCCXXrr CCCCTCTQAA 6AGCAACTCA GACTGCGCCA GGCTCCCCAT CAGCTCX3GGC 2220 

AGCACCTCGT CCA6C0GCAT CTAGGCCTCC AGCCCACCTG CCCATOTOAT GAAGCAQAGA 2280 

TGOOOCCTGQ TOGCACAGTG CCFGTG6CCC CGQAGOCAGG CCCAGCCCCA GGCCAGTCAG 2340 

gOCRGACTT TG6AAAGCCC AAOGACCATO GA0AGATGG6 CGGTTGCCAT GQTGGACX3GA 2400 

CTCCCGG6GC TGGGGCTTTT GAATTGGCCT TGGGGACTAC TCX3GCTCTCA CTCAGCTCCC 2460 

AOGGGACrCA GAAGTGCX3CC GOCATGCTGC CTAGGGTACT GTCCCCACAT CTGTCCCAAC 2520 

CCAGCTGGA6 GCCTGGTCTC TCCTTACAAC CCCTGGGGCC A3CCTCATTG CTGGGGGCCA 2580 

GGCGrrGGAT CTTGAGGGTC TGGCACATCC TTAATGCTGT GCCCCTGCCT GGOACAGAAA 2640 

TQTQGCTOCA GTTGCTCTGT CTCTOQTGGT CiACCCTGAOG GCACICTGCA TCCTCTOTCA 2700 

TTTTAACCTC AGGTGGCACC CAGGGOGAAT GGGGGCCAOS GC3VGACGTTC AOGGGCAOAG 2760 

CCCTGG0G6A GGAGAGGCCC TTTGCCAGGA GCACAOGAGC AGCTCiGCCTA GCTCTGAGCC 2820 
G 



A50 Protein sequence 
Gene namet 
TJiaigene number t 
Protein Accessioa ii 
Signal sequence t 
GP8 domain: 
Pfam domain t 
Transmembrane domains: 
Cellular liocalisation: 



Q protein-coupled receptor 56 

Hs . 6527 

im_O0S682.1 

1-26 

342-394 

7tra_2 1400-665) 

410-432, 446-468, 482-504, 517-539, 
plasma mentbrane 



575-597, 608-630, 638-659 



1 11 21 31 41 51 

I I t I I I 

MTPQSLLQTT IiFLIiSIiIiFLV QGAHOtGHRE DFRFCSQRKQ THRSSLHYKP TPOLRISIEN 60 

SBBALTVHAP FPAAHPASRS FPOPROIiYHF CLYWNRHAGR LBLLYGKRDF UiSDKASSLL 120 

CPQHQBBSLA QGPPLI.ATSV TSWWSPQNIS LPSAASPTPS FHSPPHTAAH NASVDMCELK 180 

R0WLLSQPL KHPQKASRRP SAAPASQQLQ SLBSKLTSVR PKGDMVSFEE DRINATVWKL 240 

QPTAGLQDIJI ZH8RQBEBQ8 BIMEYSVUiP RTLFQRTKOR SGEAEKRLLL VDFSSQALFQ 300 

DKWSS QVI^ B KVLGIWQNT KVANLTBFW LTFQRQLQPK NVTLQCVFWV EDPTLSSPGH 360 

WSSAGCBTVR RBTQTSCPOI HLTYPAVU1V SSVBVDAVHK HYLSLLSYVG CWSAIiACLV 420 

TIAAYWZSRV PLPCRRKPRD YTIKVHMNLL IAVPLU>TSP LLSEPVALTG SEAGC31ASAI 480 

PLHFSUiTCL SWMOLEGYNL YRLWBVPOT YVPGYLLKLS AMGWGPPIPL VTLVALVDVD 540 

MYGPXXLAVH RTPE6VZYP8 MOTIRDSLVS YXTNLGLFSL VFLFMMAMLA TMWQILRLR 600 

PUTQKSrSKVL TUiOLSLVLQ LPKALIFFSP A8GTFQLWXi yi.F8IIT8FQ GFIiZFIWyWS 660 
MRLQARGGP8 PLRSNSDCAR LPZ8SGSTSS SRI 



A51 DMA SEQUENCB 

Gene namet Hypothetical protein FLJ20063 

Unigene number i Bb.5940 

Probeset Accession #> AA053660 

nucleic Acid Accession ftt AA053660 

coding sequence t 218-1360 (underlined sequences corraspond to start and stop codons) 



1 11 21 31 41 51 

t I 1 I I I 

CCCCCATGAC TTTGTACAaC TACTTCACTG CTTCCCXX3CA ATTAGTACAC ATAOTTCCTC 60 

853 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 



CACAATTCCT 
AOCTACTGCT 
AATCAC06CT 
TAACAATGAA 
OGCTTTATTQ 
TCCCTQTGCA 
AGAAGGGTAT 
TTCAGTGACA 
CTTGCATAOT 
GACT6TAATT 
CAAGTTTGTT 

gactqtgact 

TOATTTGACC 
TGGTTXA6CA 
CGTT6CTT0C 
AAAGAAGAGT 
TGGOAACTGC 
GCTGATCCTC 
ATTGATTOTC 
TGA09AAGAC 
AGGGAGOGTC 
CTATTCAAGA 
CGCCATGOCC 
QAAOTGAGCA 
CCTCTCIGAA 
TCTATQACAT 
TCAGTACAAA 
CAACAAGAAC 
CATTGCTCTT 
TCOGAGGGGA 
AAGTCAGAAT 
CA6CCCAQAQ 
CTTTTTCAGG 
ACTAAAAATG 
GGCAGAGAGT 
AGGATCAATT 
CAGCAAAGGA 
AACAAAAAAT 
CAAAGGOTGC 
TTTCTQAGGT 
ACCAACACAC 
CAAGCTTTTA 
GTTATTTTAT 
AAGTGCCTTT 



ACACCTGCTC 
GCAGACAGTG 
TCATCTCCAA 
AT6TOCC0CA 
GAGACCAGCA 
GATAATTOGT 
TACTACAACT 
GTATCAGAAA 
GAAA TTAC TA 
CTTACTGTAA 
GATGTAACAA 
GAGAAAATTA 
CTTCGGT6T6 
TQOQATTGCA 
AGTCTCAAOT 
GGTGGGGCCC 
CAAAAGTGTQ 
ACTATTGTGG 
ACAOGAAGAT 
TTTCAAAATC 
TTTCCTAAGQ 
CACAGCAGCA 
CCCAACCAAT 
GCAGTAAAGA 
TGGAAGTT6T 
TAAATGTAQT 
GTACTGAGAC 
CATTTCCAAT 
GAGTTAAGTG 
TGAGAAGGGA 
AGGGACACTG 
CTTGCAACCT 
AG6CGTGCCT 
TGCACTATGG 
AAAAAACATG 
AGAGAGGAGG 
6CACTTCTCT 
CCAGGAGTAA 
GA6AGGGAGC 
0TGCCATT03 
AACCACACAC 
CAAATQTTAT 
TTGTTATTAT 
OTTAOATGGT 



CCCCCATAAT 
AGTCAACCAC 
ATXjATGGATT 
CCACRflAAGA 
CCCTAAACAG 
TATGTQTTAA 
CTTCTACATG 
CATTTGAGCC 
GCTTGTTTAA 
GCACATCTCT 
TAGTAACAAT 
ATAAAGCAAT 
ATTATTATGG 
AATCTGAOCT 
GTGCTGATGC 
CTQAGTGTGC 
CATTTGGCTA 
6CACCAT0GC 
CAAATAACAA 
TAAAACTGOQ 
TCAGGATAAC 
TGCCCOGCCC 
G TACA AGCTA 
TCTOOOCTCC 
GAAT6TTTGC 
AGATGCTATT 
AATGGTTAGG 
CTAGAGGAAA 
AOCTAATTCC 
TACCCACCAT 
CTTCTATCCC 
AGCCTCACCC 
GGGAATCCAG 
GGTGCACTCT 
ACCTGGTAGA 
CACCTGGQAT 
AATCATGCCC 
GAGCCTTAGG 
TTGT8CTCA0 
GGCCTCAGCC 
ACACAACCAC 
TAQTGTCCTT 
TATTTOTTCT 
GAAAAAAAAA 



CAGTACACAT 
AAATGTAAAT 
AATCACAATG 
CAATCAATCA 
CACAGGTCCC 
GCTGCATAAT 
TAAGAAAGGA 
AGAAGAGAAA 
ACATQTATTT 
GTCACCAAQA 
TTTGGCAGAA 
TAGAAGTAGC 
CTGTAACCAG 
QCAAAGOGCT 
CTGCAAGGCA 
QTQOGTGCCC 
CAGTGGACTC 
TGGCATTGTC 
AAOQAAGCAT 
GT06ACAGGC 
GGCCrCCAGA 
TGACTATTAG 
TTATTTAGAQ 
GGGSTTTTTC 
AACX3AATGCA 
AGCGCTTOTC 
GTTGTTTTCT 
GCTC CCCAGC 
0CTGGGA6AC 
CTTTCAAGGG 
TCCAATGGAG 
AAOAAGACTG 
GAACTTTTTQ 
ACAOTTTTTG 
AGGAAGAOAO 
CCACCTTCTT 
TCCCGAAGAC 
TCAGTTTGAA 
GAQTCCAGCX: 
TTCTCTGGTG 
ACACACAAAT 
TTTTATTTCT 
TGACTGTTAA 
AAAAAAAAAA 



AOTTCCTCCA 
TCATTAGCTA 
UriX X TfCl tJ 
TCAGGGGCTC 
AGCAATOCTT 
ACAAGTTTTT 
AAGOTATTCC 
CATTOCATX30 
GGCACATCTG 
TCIGAAATGC 
ACCACAAGTG 
TCAAGCAACr 
ACTGOGGATG 
AAOCCACftGA 
CAGCACAAQC 
GGCTACCAGG 
GACTOTAAGG 
ATTCTCAGCA 
ATTGAAGAAG 
TTCACCAATC 
GACAGCCAGA 
AATCATAAGA 
TGTTTAGAAA 
TTCCATCTGA 
GCTCACTTGC 
AGA6AGGTGG 
TAATTCTTTT 
ATTGCTTGCT 
ATACGCATCA 
TCACAAOCTC 
AGATTCTQQC 
GAAAGAGACA 
ATGCTAATTA 
AAATGCTAGG 
QCAAAGGAAA 
CCTTAGGTCC 
TGGCTGGGA6 
ATTGGAGACA 
GTCCAGCCTC 
ACAQAGGCTC 
GGGGGCAAOC 
AATGCCTTQT 
TTGTGAATGO 



CAATTCCTAT 
CCTCTQACAT 
AAACACAAAG 
CCACrGGOkC 
GCCAAGATGA 
GCCTGTGTTT 
CTGGGAAGAT 
CCTATCAAGA 
TTTATGGACA 
6TGCTGATGA 
ACAATGAGAA 
TTCTAAACTA 
ACTGCCTCAA 
GCCCTTTCTO 
AATGCTTAAT 
AAGATGCTAA 
ACAAATTTCA 
TGATAATTGC 
AGAACTTGAT 
TTGGAGCAGA 
TGCAAAATCC 
ATGTGGAACC 
GACT6ATGGA 
CATCTGCCAG 
TAAATAAGAA 
TTTTCTTCAA 
CCTGGTAOGG 
CCTGGGCAAA 
ACTGTGGAGG 
ACTCTCTGAC 
AACCTTTGAA 
TATCTCTCAO 
GAAGGCCTGG 
AGGCAGAAGG 
CTGGGTGGGG 
CCTCCTCCAT 
AAG6TTTAAA 
AACTGTCTGG 
GGGGTGTAOG 
AGCTGT6GCC 
ACATGCAOTA 
CCTCTTAAAA 
TAAT6CAATA 
A 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
IBOO 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



A52 Protein sequence* 
Gene nansi 
Unlgene minO^en 
Probeset Accession #: 
Protein Accession S: 
Signal sequence? 
Transmeoibrane domains t 



SBA domain I 

Cellular Local Izationi 



Hypothetical protein FLJ20063 

Hs.5940 

AA053660 

none found 

none £oimd 

289-311 

45-74 

80-196 



60 
65 
70 
75 
80 



1 
I 

MVPSETQSNN 
NTSPCLCLEG 
PGTSVYGQTV 
SSSNFLMYDIi 
AQHKQCItlKK 
VILSMIIALI 
RDSQKQNPYS 



11 
I 

EMSPTTEDNQ 
YYYNSSTCKK 
ILTVSTSL8P 

TLRcmnrGCii 

SGGAPECACV 
VTARSMNKTK 
RKSSMPRPDY 



21 31 

I I 
SSOPPTGTAL LETSTLNSTG 
GKVPPGKISV TVSETPDPEE 
R8EMRADDKF VDVTXVTILA 
QTADDCUIOL ACDCKSDLQR 
PGYQEDAMGN OQXCAFOySO 
KZEBBNLIDB 0FQNLKLR8T 



41 51 

I I 

PSNPOQDDPC ADNSLCVKLH 60 

KHSMAYQDLH SBITSLFKDV 120 

BTTSONBKXV TBKIMKAIRS 180 

FNPQSPFCVA 8SUCCF0AGN 240 

LDCXDKFQLZ LTZVGTZAaZ 300 

GFTMLQABQS VPPXVRZTA8 360 



A53 DNA SEQ|UEWCB 
Gene name: 
Unigene number: 
Probeset Accession #: 
Nucleic Acid Accession #: 
Goding sequence t 



11 



TMPR8S3a 
Hs. 298241 
AI538613 
AB038157 

202-1566 (underlined sequences correspond to start and stop codona) 



, 21 31 41 SI 

I I i I I I 

ACOGGGCACC GGAOGG CTCQ OGTACTTTCO TTCTTAATTA GGTCATGOCC GTGTGAGCCA 
GGAAAOGGCr GTQTTTA TXXS GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGOTG 
CCATC TACAT TTTTGGGACT OQGGAATTAT GAGOTAGAGG TGGAGGOGGA GCCGGATGTC 
AGAGOTCCTG AAATAGTCAC CATGGGOQAA AA1QATC0QC CT6CTG1TGA AGCCCCCTTC 

854 



60 
120 
180 
240 
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PCTAJS02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TCATTCOGAT 
GATGCTGTTG 
ATTOGQATCA 
TCAGGQAAGT 
OOAGTCTCGG 
AATGCCGTGC 
AAGGGTCACT 
GATAACCTCA 
CACCTCTTOC 
TGTGCCTCTG 
AGCTCA08CA 
CTTCAGTTCC 
ACTGCTGCAC 
CTAGTTTCCC 
AOCAAiOTAGA 
CTCA0C5TTCA 
GATGGAAAA6 
TCCCCTGTCC 
QAOOTGTAOO 
OTGQACAGCT 
AAC3TTAGTGG 
TACACCCGTG 
ACCTGAAGAQ 
TOCTCOCCTG 
TTCC66CACC 
CRAGCTGCTT 
AGTGCAGTGO 
TTGCCTCAGC 
TT6TATTTTT 
CCTC3UUITGA 
AOQCCTAGCX: 
AGGGCGGCCT 
ACGAGATAAQ 
GCACCAOOCC 
CCAAACCCAC 
ATGACTOSTT 
TTGTCCTTTQ 
AAAAAAAA 



OSCTTTTTGG 
CT6CACAGAT 
TTGCATTGAT 
ACAGATGTOO 
ATTGCAAAQA 
TCCAGGTGTT 
ACGCAAATGT 
6AGTGAGCTC 
CAGAT6AGAA 
GCCACGTGQT 
TCGTGGGTGG 
AGGOCTACCA 
ACTOTGTTTA 
TGTTGGACAA 
AGCCAAAGAO 
ATGAAATGAT 
TGTOCTGQAC 
TGAACCACGC 
GTGQCATOkT 
GOCAGGGGGA 
OAGCGACCAG 
TCACCTCCTT 
GAAGG6GACA 
GACTCCOGTQ 
AGTAGCAGGC 

T iimrmT 

CGAAATCCCT 
TTCCCCAGTA 
AGTAGAGACA 
TOTGCCTGCT 
TCACGCTCCT 
TTCCCACTGG 
CA6TTATGTG 
AQAA6TGCAG 
CCTTTCTACT 
TAAG6CCTAT 
ATTCCAAATA 



CCTTQATXSAT 
CCTGTCACTG 
ATTAGCACTG 
CTCATCCTTT 
GGGGGAGQAC 
CACAGCTGCT 
TGCCTGTGCC 
OCTG GAGGGG 
GGT8ACT6CA 
TACCTTGCAG 
AAACATGTCC 
CCTOTGCGGG 
TGACrrOTAC 
TOCAGCCCCA 
GCTGG6CAAT 
CCAGCCTGTQ 
GTCAGGATGG 
6GCGGTCCCT 
CTCCCCCTCC 
CAGOGGGGGG 
CTTTGGCATC 
CCT6GACTGG 
AGTAGCCAGC 
TAGGAAGCTG 
CCGAAAGAGa 
GTTTTTTTGA 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTGACC 
TCAGCCTCCC 
TTCTQATCTT 
TCCATCTGGT 
ACCTCA06TG 
AACTGCAGTC 
TGCAAGACTT 
TTTCATOATT 
ATAT6TTTCC 



TTGAAAATAA 
CTGCCATTGA 
GCCATTGGTC 
AAGTGTATOQ 
6AGTAC06CT 
TCGTGGAAGA 
CAACTOGGTT 
CAGTTCCGGG 
TTACAOCACT 
TOCACAOOCT 
TTGCTCTCGC 
GaCTCTGTCA 
CTCCCCAAGT 
TCCCACITGO 
GACATOGOCC 
TGCCTX5CCCA 
GGGGCCACAG 
TTQATTTCCA 
ATOCTCT GOQ 
C0CCT6GTGT 
GGCTGGGCAG 
ATCCACQAGC 
TGAGTTCXTO 
CACAOGAGCA 
CACCCTTCCA 
GOTGGAGTCT 
GCCTCCXSCTT 
CAGGTGCCOQ 
ATGTT060CA 
ACA6TGCTGG 
CACTAAQAAC 
TTTCTCTCCA 
CAAAOCCACC 
ACTGCACGTT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTCAAA 



GTCCTGTTGC 
AGTTTTTTCC 
TGGGCATCCA 
AGCTQATAGC 
GTGTC06GGT 
CCATOTGCTC 
TCCCAAGCTA 
AGGAGTTTGT 
CAOTATAIOT 
OTGGTCATAG 
AGTQGCCCTG 

TCACGcxxxrr 

CATOGACCAT 
TGQAGAAGAT 
TTATGAA6CT 
ACTCTGAAGA 
AGGATGGAGC 
ACAAGATCTO 
C36GGCTACCT 
0TCAAGAGA6 
A6GTGAACAA 
AGATGGAGAG 
A6GTGATGAA 
GACACCCTTO 
TCTGATTCCA 
CGCTCTGTTQ 
CCCTGGTTCA 
CCACXACACC 
GGCTGCTCTC 
GATTACAGGC 
AAAAGAAGCA 
GGGGTCTTQC 
AACAQCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGQT6CT 
AAAAAAAAAA 



ACCAGATGCA 
AATCATOGTC 
CTTCGACTGC 
TOQATGTGAC 
GGGTGGTCAG 
CGATOACrGG 
TGTGAGTTCA 
GTCCATOGAT 
GAGGGAGGGA 
AAGGQGCTAC 
GCAGGCCAGC 
GTGGATCATC 
CCAGGTGGGT 
TQTCTACCAC 
GGCCGG6CCA 
GAACTTCCCC 
AGGTGA06CC 
CAACCACAGG 
GAOOGGTGGC 
GAGGCTGTGG 
OCCIGOGGRG 
A6ACCTAAAA 
GACAGCCaSA 
GAGCTCTGAO 
GCACAACCTT 
CCCAGGCT66 
AGOGATTCTC 
CAACTAATTT 
AAACOCCTGA 
ATGG6CCACC 
GCAACTTOCA 
AAAATTCCTG 
CA6AAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TQA06TATTA 
AAAAAAAAAA 



AS4 Protein aeouence; 
Gene namet 
Unigene number i 
Probeset Accession #: 
Protein Accession fti 
Sigxial sequence t 
Transmembrane domains t 
Tryp_SPc domain: 
Cellular Localization: 



TMPR8S3a 

Ha. 298241 

AI5386X3 

BAB20077 

none found 

43-65, 239-261 

216-444 

plasma nenbrane 



1 
I 

MGENDPPAVB 
LALAIGLGIH 
TAASWKTMCS 
VTALHHSVYV 
MXSGSVITPL 
LGNDIALMKL 
AVPLISNKIC 
FGIGCAEVNK 



11 
I 

APFSFRSLFG 
FDCSGKYRCR 
DDHKGR7AMV 
REGCASGHW 
HIITAAHCVY 
AGPLTFNENI 
KHRDVYGGZZ 
P6VXTRVTSP 



21 
1 

LDDLKISPVA 
SSPKCIELIA 
ACAQLGFPSY 
TLQCTAC3GHR 
DLYLPKSWTI 
QPVCLPK8EB 
SFSMZjCAGYL 
LDWIHBQMER 



31 
I 

PDADAVAAQI 
RCDGVSDCKD 
VSSDNLRVSS 
ROYSSRIVGG 
QVGIiVSLLDN 
NPPTORyCKT 
T6GVDS0QGD 
DIiKT 



41 

I 

LSLLPLKFFP 
GHDBYRCVRV 
LBQQFRBBFV 
NHSLLSQWPW 
PAPSHLVEKI 
S6HGATEZXSA 
8G0PLV0QES 



51 
I 

IIVIGIIAIil 
GOQNAVLQVF 
8IDHLLPDDK 
QASLOFOGYH 
VYH8KVXPKR 
6DA8PVUIHA 
RLWKLVGATS 



300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
X960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 



ASS DMA SEQUENCE 

Gene name* Putative O protein-coupled receptor OPCR150 
Unigene mudMn Ka. 97101 

Probeset Accession «t AA215333 
Nucleic Acid Accession #; m 014373 

coding sequence. 322-1338 (underlined sequences correspond to start and stop codons) 



1 

I 

GTGGCCTCGA 
GCCTCCGGAG 
ATGTCTCOQA 
AAATAACATA 
GACCCACTQG 
TCTTGAAATT 
CAGTTA06TC 
OOGAAAATAT 
TTTATGGAAT 
TCCATTATAT 
ATCTGCCTAT 
CTGACAQCTT 
10TCAAAAAT 



11 

I 

GOTGGTGQCA 
GCAGGTTOGG 
OCTTACTCAC 
ATT6AAGGCA 
AGAGGACAGA 
TAACTAAAAA 
AAACAAACCA 
TATTAAA TAT 

ArrrrxxjcAT 

TOTATTTCAG 
TTACTCAAAT 
G TATAG ATTA 
TATTTTATCT 



21 
I 

GGGCOGCCCC 
CTGQAAGGAA 
ATAGCATATT 
GTAAAAGTGA 
AAATGAA6CA 
TATGACTGCT 
GCCCCTAGAC 
CCTTACACTA 
TTCACTAOCA 
GGATTTTOTA 
TATTTCCTTT 
TT6CCTQAAT 
CTTTACAGTA 



31 
I 

CTGCAOTCOG 
CCGCTCTOGC 
GGTATATCAA 
AATTAAATAG 
GTGTTTTATC 
CTCTCTTCAO 
OTTAACTATC 
GQAATGAOAA 
TTCQTTGATC 
CTTTTAAGCA 
ACTTATGGCT 
TTCTC TAAAA 
ATTTTAATTT 



41 
I 

QAGAOGAAOQ 
TTOGTCCTAC 
AATGAAATGC 
GAAGATCATC 
ATGTGTATTT 
AGAACT6CTC 
TGCTATTCTT 
GAAAAAACAC 
TTTTACTTTT 
TTAGGTTCAC 
TTTTGCATTA 
CA ACCAAG CT 
GOATTTCAGT 



51 
I 

CACQOACGGG 
ACTT60GCAA 
AAOGAACCAA 
AGTCAAG6AA 
CAGC AGOTCT 
TTTTCAGTAC 
GATCATACTT 
CTGTCAAAAT 
GGTAAACATT 
TAAA TACCA C 
TOCAOTTTTC 
TTCATTTAAG 
CCTTGCTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



855 



wo 03/042661 



PCTAJS02/36810 



GTTTTGGGAG ACCCAGCCAT CTACCAAAGC 

TGi-or m x:! atgtcaocat tcaoagttac 

TTTGTAGCTT TCATAACCTG TTGGGAAGAA 
ACTTCCTATA TGAATGAAAC TATCTTATAT 
GTGAGATCTA AAAAAATATT CTTATCCAAQ 
CCATTTOTAC TACTTCAC3GT AATCATTGTT 
GAGATQAATA TTCCCTOGTT ATACTTTGTC 
TTTAATTGTC ACAAQCTTAA TTTAAAAQAC 
TGGAAOTOCT GCTTCATTCC ACTTACAATT 
TCAATAATGA TTrCTTAATA TTATTAATTA 
TATGAACAOA AA6AACTCAG GACATATTAA 
CCCTQ ACTGA TAGCATTTOV (3AATGT6TCT 
GTTTTATTTT AAAAACAAAA TAATTCCAAG 
TTACAAATAT TACTTTGTTA TTAAC^CAAA 
QATGTTTGTQ TTACTCAAAA AAACTACTGG 
ACTGACAACT TTAAGATATC AACX^IAAACA 
AAAAAAAAA 



CTG AA06C AC AGAATQCTTA TTCTOGTCRC 840 

T6GCTGTCAT TTTTCATGOT OATGATTTTA 900 

GTTACTACTT TGGTACAOGC TATCAGGATA 960 

TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

CTCATTOTCT GTTTTCTCAO TACCTGOTTA 1080 

TTAC TTAAAG TTCAOATTCC AGCATATATT 1140 

AATAGTTTTC TCATTGCTAC AGTGTATTGG 1200 

ATTGGATTAC CTTTGGATCC ATTTGTCAAC 1260 

CCTAATCTTG AGCAAATTGA AAAGCCTATA 1320 

AAAGTTACAG CTQTCATAAG ATCATAATTT 1380 

AAAATAAACT GAACTAAAAC AACTTTT6GC 1440 

TTTGAAG6GC TATAOCAGTT ATTAAATAGT 1500 

AAOTTTTTAT AGTTATTCAG GGACACTATA 1560 

AAGTGATAAQ AGTTAACATT TGGCTATACT 1620 

ATGCAAACTG TTATGTAAA7 CTGAGATTTC 1680 

rrrrTATTAA atgttcaaat otaagcaaga 1740 



A56 Protein sequence 
Gene namei 
Unigene mnnbert 
Protein Accession #t 
Signal sequence t 
Transmembrane domains : 
Cellular Localization: 



Putative G protein-coupled receptor GPCRISO 

Hs. 97101 

NP_055ie8 

none found 

23-45, 59-81, 97-119, 138-160, 184-306, 241-363, 276-397 
plasma membrane 



1 11 21 31 41 51 

I I I I i i 

MTALSSBNCS FQYQLRQTNQ PLDVNYLLFL IILGKIIiUfI LTLGMRRKMT CQNFMEYFCI 60 
SLAFVDUiLL VNXSIZLYFR DFVLLSXRFT KXHICLFTQX ZSFTYGFLHY PVFLTACIDY 120 
CLNFSKTTKL SFKOQKLFYF FTVILIHISV LAYVLGDPAI YQSLRAQNAy SRHCPFYVSI 180 
QSYHIjSFFMV MILFVAFITC WBEVTTLVQA IRITSVMNET ILYFPFSSHS SYTVRSKXIF 240 
L9KLIVCFLS TWLPPVLLQV HVLLKVQIP AYIEMMIPWL YFVK6FLIAT VYHFNCHXLN 300 
LKDIGLPIiDP FVNWKCCPXP LTIPNIiEQXB KPZSINXC 



A57 DNA SEQUEWCB 

Gene name: ESTs, HeaXly similar to CX;Ba7L collagen alpha I(IZX) chain precursor [H. sapiens] 

unigene number: Hs. 19322 

Probeset Accession ftt AA08a458 
Nucleic Acid Accession $i AA088458 

Coding sequence t 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 31 31 41 51 

I I t I I I 

GCCCTTGGAC ACTGACATGG ACTGAAGQAG TAGAATGQAG CAOGAGGACA CTGACATGQA 60 

CTGAAQAAAA AGGAGCTGGA GCAGGAGAAQ GAGGTGCTGC TGCAGGGTTT GGAGATGATG 120 

GOGOGGGGCC GC6ACTGGTA CCAGCAGCAO CTQCAAC6AG TGCAGGAGOG CCAG06C0GC 180 

CTGGGCCAGA GCAOAGCCAQ CQCCQACTTT GGGGCTGCAG 6GAGCCGCGQ CCCACIGGGG 240 

GGGCTACTGC CCAAGGTACA A6AG6IG6CC OGGTGCCTGO GGGAGCTQCT GGCTGCA6CC 300 

TGT6CCAGCC GGGCCCTGCC OCOGTCCTCC TOCGGGCCCC CCTGCCCTGC CCTGACGTCC 360 

ACCTCACCCC CGQTCTGGCA GCAGCAGACC ATCCTCATGC TGAAGQAGCA QAACOGACTC 420 

CTCACCCAGG AG3TGAG0GA GAAGAGTQAG 06CATCA0GC AGCTGGAGCA GGAGAAGTOG 480 

GGGCrCATTA AGCAGCTGTT T6AO0OCC0C GGCCXGAGCC AGCAGGACQG GQQACCTCTG 540 

GATTCCACCT TCATCTAGTC CTT6T6G6CC GGGTGGGCCC CCAGG6CCAG CCXGGCACTC 600 

AGCCCTTCQA GGGTGGGOQC CCCATCGCAC CCACCCTCTC TGGCTGGAGA CCCCCGGCAG 660 

QOCCAGGCAC AGTCCOGGAG TGGGCGCCTT CCTGCCGCCC TT6CCAGATG GGCTCCCCAG 720 

GCCTGCCCCC GOCTGGTCCC OGCAOCGAGC QCTTGACTOC GTTTKGGCTC CTOGTT G YTG 780 

ACATOGGCIG GGGGCTCTCT TGAGTCOGCA TAGTC06CAG CIACTACTOO CC S CTQTC A Q 840 

TGGACAOTOO OGTACCCCTC CATQAGTTAQ COTOCCCCOO TTTCCAGOGO TGCCGCCCTG 900 

GGTCCCATCT TCAGGGAAAO GCACTGCCCA CGCCAGGCTO CACTTCCAAC AACGOQCAGC 960 

AGAGGGOQOG GGGCGGCTCC GACG06GGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGA0GA6G TGGCTGTAGC TCGGAOGGAC GOAAGTAOAT GGAGGOGGTG GGGAOGGCCT 1080 

OTAAOGSaQO OOTQCCTGCC TGGCTGGGGA GCCCCAGOGA TA60GGTCGG ACTTCAGGTT 1140 

CTGGCGAAGG CTGAGGGACC CTGGCTGCAO CGQATOGGCA COCCGGOTGG GCXSAGAGCTT 1200 

GGCCTGCATQ TGCCTCCCAC AQACCCTCGG GTGATGGCCP TCCCCCTCTT GGCOGGGACG 1260 

TTOCCCCACG TTQAGTCCCA CACAACATCC TGTGAGCCTG GCTCCCCAGO AGGGCCCCCA 1320 

GACAGCTOCC AGGCAOGTCA TAGGCAAAGC CT G T TTC CCC OGACTOIQGA TTTCCAAGGC 1380 

CTGGOGTOCT GCTCACCGCC CXTTGCTCTC AOGCCCAGCC TGTCCOCAGG TTTCAGCrGG 1440 

QAGAGGCCAC CTCCCTCAGC CAAGGAAAAC GAOAACCCCC A0G6TACAGG AGGAGGCT60 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG OCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTQQGGG GGCCCTGCTC AGCCCAACCT GGAQGGTCCC AGTGTCACCA 1620 

GAACCAGOGG CACGGCAACA GCATCGATGG GrTCTGCAOC CCAGGGCCCC 0GATO06GGG 1680 

TCAGTGTGXa TG6GG0QCA6 G6CCTC0GAT GCGGGGTCA6 TGGGTG6GGG GOGCAGGGCC 1740 

CCOGATOOGG GGTCAGTGOG TG6GGGG0GC AGGGCCCCCT CGTGTGCAGG GCACTTTG6T 1800 

ACAC TQTCCC ACAAGGCACC TQTCTCAGAG QAQGGGCCCT GGCAGGCRGC GTGGCAACTC 1860 

CCTTCCGGAQ CCCAGCTCCA TGCTAACCTG CCCACAGCAA CCCaACAOAa CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGQ G G T G TCCCAG GAOVGGCCCA AGTCAGOCCA GCATGCAGCT 1980 

6CCCTCCTAC OCTQAAGATQ GQAGTQGOCT TTCCAGOGQA GATAAOQATO TCAG6CCTG0 3040 

ACCTCCTOOO CAGGAAAGGO TGCA6GTCCT QAG O GCCTOT GCCCCACAGC CCCAGCAOOC 2100 

AGGTGGACTG CAGOGCAOTQ GGTOQGCCAO TGGCA6CCA0 GOAGAAGCOC CCOQTCAQCA 2160 

GGCTGOOOTC XGCCCACOia GGGCrCCCCA C6TCTGCCTT TGAGGGtGCC TOCCATGCCC' 2220 



856 



wo 03/042661 



PCT/US02/36810 



5 
10 
15 



TGGGGGATCC 
GGTGACTTCA 
QAQACAGGCT 
AAAGAAATAO 
C^OQAGGGOA 
GCAGACCCTQ 
GAGCAGC3GTC 
G08TQCACAC 
CAGAAGTGTC 
TTTTGTGTTG 
CTGGAATCCC 
CCCCATCTCT 
TCATAAACAC 
TAGACCCAGA 
AQAAATAAAA 



TGGCATCTTT 
TCAQGAGACC 
GGCACCTCCO 
OTCCTOCCAa 
GAATTTAAAO 
OCTGQAGCCT 
CCTGGGCTCT 
TGTGATGACA 
COCA6TTGAO 
ATCAAOTTCC 
AGCACTTGAG 
ACAARAAAAA 
CACAAGGAAA 
TACTAGAATT 
GAGATTTCTG 



ACTGGACTGG 
6CCCACATAG 
GAAAAACTGC 
TTTACAGCTT 
GCCC0G6CTG 
GCCCTAGGAC 
ATCCGOGAGO 
GCGG6AAAT0 
AATCT G CCCC 
AAGGAAAAGG 
GCCAGGAQTT 
AAAAAGAAAO 
CAATACACTA 
ATCAGAGAOA 
GAAACATGAA 



AAQCAGGAGA 
AGCTGGACCC 
CTTTCAGCCT 
GAAATCAOGC 
GCAGGGTCTA 
GCTGGGOGGG 
TGCCACTAGC 
TCTCAGQAT6 
AGAOGAACAC 
AACATCTCAG 
CCAGAGCA6C 
AAAGAAAATO 
TQAGACCCAG 
ATATAAAGTA 



CAGAACAGT6 
OGCAGCTGAA 
TGGTGTTCOG 
TAGTGAGTOO 
GGTGGCTGGC 
TCAGTCTCCO 
GTOTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
CO6GG0GT6G 
CTGG6CAA0G 
AGAOATCCAO 
CAGAAGCAAC 
AGAOTGTTTT 



TCTGTCCaSG 
60GGAAATGT 
TCCAA GGTGA 
OCX^TGGAGAC 
AGAG6CACAT 
TGC3«3GATGT 
ACATACACGT 
TCCTTGGGGG 
GOCCTCAGGA 
TOGTTCAOGC 
CA6TGAGA6A 
GTTTAAAAAT 
AGATTGACTC 
ATATATCTAA 



2260 
2340 
2400 
2460 
2520 
2S80 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A5B DMA sequence 
Gene sianei 
Ukiigene number: 
Probeset Acceseion #t 
Nucleic Acid Acceesion St 
Coding Sequence: 



BST8 

Bs. 157601 

W07459 

AC005363 

328-2751 (underlined sequences correspond to start and stop codons) 



GACAGTGTTC 
TTTTATTTGC 
CCTGGGGGTA 
ACAAACAGGT 
CCCCCTGOCC 
TCX3CCGCTCT 
OTTTTCCrOT 
OAAACCATOS 
ATCATGTTTC 
CACTTTGCCA 
GCATTCCAGT 
CAGGAAGT8A 
CTTGCTCTOA 
CAGATCCTCA 
CAGCTGAAGG 
GAGCIGCAT6 
GAGGATOOCA 
ACGCCAGACT 
GAOTTCGCTG 
GCACACTGTC 
AGQACCACCT 
CCAGAAGGAC 
TGTGCCCT6A 
GCGGGCACCA 
GCCXTTGCrGA 
CTG6TGGCX30 
GOCATTCOCr 
0GTG0CTT06 
CTCACTGAQT 
GAGCTGCTCC 
GGCA6CCCAA 
GAGCT GCAGQ 
CTCGTCTTCA 
AGCTTTGTGA 
CTGQTOGTQT 
GCTGOQATGC 
ACOQCCCTGC 
GTCCCCAAAG 
OCCXIAGAAGC 
AGTGAG6GTC 
GCOGACCTGC 
CCAGTCAAOC 
GGGAGCTACC 
TGQAGCTCTT 
ATGGCTCCCQ 
G6CACTQAAA 
TTCCCGCOGT 
ATGCTGCTTA 
TTGATGTQTA 
CTGCCACCTT 
CX3TTCCTTTQ 
AGGCCTTTAC 
OCAGCTTTTC 
CTTGAGGGAC 
GGTCTCAGAC 
TGTGCATGGG 
ACCTTGAAGG 



11 

I 

GOGGCTGCAC 
AQACCTGGGC 
GTTCCTC06A 
GTCCCACGTG 
CGAGCCGCGC 
CCTTCCOTTA 
TTTCCAGA6T 
GGAAGATTTC 
TGTTAGATGG 
TCACAGTCTG 
TCAGTTCCAC 
AGGCAAGAAT 
AATACCTTCT 
TCATOGTCAC 
AAAG6GGTGT 
CACTGGCCAO 
CCAAG66CCT 
GCAGGGTC6A 
GCAATGCCCC 
CCTTCTACAG 
GCCCAGGCCC 
TGGAOGGCTA 
A6CTGAGCCT 
CTCTGGACGQ 
GCGAGQACTC 
TGCCIGTGG6 
TCCXSTGOTGa 
GGAG06CCAC 
CACACTCCGA 
T6CTGGGTGT 
AGCATGTOAT 
GGAAGCTGTO 
TGTTGQACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTQTGGTGGT 
TGAGGAACAA 
TQCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 
G6CCAGGACC 
GAGACAAGAA 
AGTAAATACC 
TCCCTTGAGG 
CACACAATCA 
TAGAGCATCC 
CACTTOCCCA 
OTTTGTGACT 
TQAATGTGAC 
CCCAGOTCTO 
TCTTC 



21 
I 

G6CTCGGAGG 
OGATGCOGCT 
CCTCAGCCGO 
GCAGCOGCGC 
CCGGGTCTGT 
TATCAACATG 
GCCCCCATCT 
AGCTGCCA6C 
GTCTAACAOC 
TGACGGTCTG 
TCCTCATCTG 
CAAGA06ATO 
GCACAGAGGO 
TGATGGGAAQ 
CACTGTGTTT 
CGAGCCTAGA 
CTTC3VBCACC 
GGCTCACCCC 
ATGCTGGAGA 
CTGGAAGAGA 
CTGTGACTOG 
CCAOTGCCTC 
G6AATGCAGG 
CTTCCTGOQG 
TCX5GGCCCGA 
GGAGTACCAQ 
OCCXACCCTO 
CAGGACAGGC 
GGATGAGGTT 
AGGCAOTGAG 
GGTCTACTOO 
CAGCOGGCAO 
CrCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
T6ACAAAGTG 
GCTCACAGQC 
TGGCATCTCT 
TGCAGGTCXC 
GGACGTGCTC 
CAGCCOGTGC 
TCGGGATGGC 
TG7GAGCCAG 
CAGCAGCOGT 
CTTCTGOAAT 
ACTATTCTCA 
AGCAGCTGAT 
CACTTTCTGT 
AXAAACAAGQ 
ATGC TOGCCA 
TTTGGAOGOC 
GAGACATTCT 
TCTTGGOGAC 
CAATTAACCA 
GAGGGCCACQ 



31 
I 

CTGGGTGACC 
TTAAAAAAOS 
GTCGGGTCGT 
CCCGGQOSCC 
GAOTAGAGCC 
CCCCCTTTCX: 
CTCCCrCTCC 
AAAATGATOT 
GTCGGGAAAG 
GACATCAGCC 
GAATTCCCCT 
GTTTTCAAAO 
TTGCCTGGAO 
TCCCAGG6GG 
GCTGTGGGGG 
GG6CAGCA00 
CTCAGGAGCT 
TGT6AGCACA 
GGATOGCXSGC 
GTGTTCCTAA 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATOTGCCTG 
ACGGGCAGTO 
CAGGACCGGC 
GCGGGCCCAG 
GCCGT60GGG 
GATCCTCAOG 
CGGCCAGGGT 
GTAGGGCCOQ 
GAGGTGAACC 
GCCTTOGGGC 
CCCTACCTAG 
ATGACOGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 
CTGAGGGAGG 
GTCACCCACA 
ACCTGCTGTG 
GGTCCTGAAG 
QAATQTTGTT 
GAAGGCCAOS 
GGATOCATTT 
T6CCTTTTGT 
aCTTGGTTGA 
TAAAATOGTT 



41 

I 

0GO3TA6AAG 
O GAOGGGCTC 
GCOQCCCTCT 
CCTCCTQTQA 
GCCOGGGCAC 
TGTTQCTGGA 
A0GAA6TCCA 
GGTGCT06GC 
GGA6CTTTGA 
OOQAGAGGGT 
TG6ATTCATT 
OAGGGGQCAC 
GCAGAAATGC 
ATGTGGCACT 
TCAGGTTTCC 
TQCTGTTOGC 
OGGCCATCTG 
OGAOOCTGQA 
GGACCCTTGC 
GCCACCCTGC 
AGAAT6GAGG 
CCTTTGQAGa 
TCTTCCTGCT 
TOGTGAAGCG 
CCACATACAG 
ACCTGOTCTG 
CCTTG06GCA 
CACGTAGAGT 
OQCOTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AQAATTTTGC 
CTGAOGTGAC 
TGGACACCAA 
GTOGGC?rGGG 
AGAGGG6TGC 
CAGA66ATGC 
TGGGCGTGGG 
TGATCCAOQT 
TGTGT6GAGA 
GCAGCTGOOT 
GCCACTG03A 
TTGAGAOSCC 
OCAACTACAO 
CAGGTOCTfA 
AGGATGTCCC 
AA06ATGT7G 
CCTTOTTGAG 
ACTTAAATTT 
GACACAOTAA 
GOCTTTCAAG 
GCATTGAGTC 
OTGTGGAAGA 
TGAT6GGGGA 
CTQAGTOGTC 



51 

I 

TGAAGTACTT 
TATGCACCTC 
CCCAGGAGAG 
TCCCX3TAG0G 
CGAGCX3CTGG 
GGCOGTCTGT 
TGTAAGCAAA 
TGCAGTGGAC 
AAGOTCCAAG 
CAGAGTGGGA 
TTCAACCCAA 
GGAGACG6AA 
TTCTGTGCOC 
GCCATCCAAG 
CAGCnCGGAG 
TGAGCAGGTG 
CTCCAGOGCC 
GATGGTCGGG 
GGTGCTGGCT 
CACCTGCTAC 
CACATOTGTT 
GQAGGCTAAC 
GGACAGCTCT 
GTTT6T6GGG 
CAGGGAGCTG 
GAGCCT06AT 
GGCGGCAGAG 
GGTGGTTTTO 
AAGGGCX30QA 
GGAGATGACA 
CCAAATCCJCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGT OGQC 
ACCCACCCGO 
CTCAGCCGGC 
COGGCCTGQT 
AGCGGTTOCT 
GCCTQTOCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACOBTGAG 
CCTGAGGCAC 
AOAAGGGCTG 
GAATOTCT6C 
AACTGCAGCC 
TTGAAAAGTT 
GCTATGTCAT 
AGOGGCCrOA 
TGCCCAGCAG 
ATGGAAAGCA 
TGAAAGQGGG 
GACITGQAAA 
GGGGCTGAOT 
AGCAGTGTCC 



60 
120 
160 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1660 
1740 
1600 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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PCT/US02/36810 



AS9 Protein gequence 

Gene name: BSTs 
Ottigene number: He. 157601 

Protein Accesaion 8i none found 

Signal sequence t 1-17 
Transmenlbrane domains t none found 
VGW domains: 49-223; 341-518; 529-706 
EOF domains: 298-333; 715-748 
Cellular Localization t secreted 

1 11 21 31 41 51 

111)11 

MPPPLLLBAV CVFLPSRVPP SLPLQEVHVS KBTIOKISAA SKMMHCSAAV DIMFLLDOSN 60 

SVQXQSFBRS KHFAITVCDG LDZSPERVRV GAFQP88TPH LBFPZiDSPST QQBVKARIXR 120 

MVFKGGRTBT GLAZiKyXiLHR QLPGGRNASV PQILIXVTDG KSQGDVALPS KOLKERGVTV 180 

PAVGVRFPRW BELHALASEP RGQHVLLAEQ VEDATNGLPS TLSSSAICS9 ATPDCRVEAH 240 

PCEHRTLEMV RBPAONAPCW RGSRRTLAVL AAHCPFYSWK RVPLTHPATC YRTTCPGPCD 300 

SQPOQKGOTC VPEGLDQYQC LCPLAFGQEA NCALKLSUSC RVPLLFLLDS SAOTTLDOFL 360 

RAKVFVXRFV RAVLSSDSRA RVGVATYSRB LLVAVFVGBy QDVFDLVH8L DGIPFRGGPT 420 

LTQSALRQAA ER6FGSATRT GQDRPRRVW LLTESHSEDB VAGPARKARA RBLLLLGVOS 480 

EAVRAELEEI TQSPKHVMVY SDPQDLFNQI PBLQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENPAQM QSPVRSCALQ PBVNPDVTQV GLWYGSQVQ TAFGLiyrKPT RAAMLRA2SQ 600 

APYLGGVGSA GTAUJUYDK VMTVQRGARP GVPRAVWLT G6RGAEDAAV PAQRLR2DIGI 660 

SVLWOVGFV LSEOLRRLAO PRDSItlHVAA YADLRVHQDV LZEHLOGBAK QFVNLCKPSP 720 

CMNE6SCVLQ NGSYRCKCRO GWBGPKCENR BWSSCSVCV8 QGNZU5TPLR HNAPVQEGSS 760 
RTPPSNYREG LOTEMVPTPW MVCAPGP 

A60 DMA SKQUEWCE 

Gene name: Homo sapiens type ZI roeiribrane serine protease mRNA 

Ubigene number: Hs. 63325 

Probeset Accession AA411502 
Nucleic Acid Accession #: MM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I i I I 

ATQT TACAGQ ATGCTQACAG TGATCAACCT CTGAACAGCC TCQATGTCAA ACG0CT6CGC 60 

AAAC COOSTA TCCCCATGGA GACCTTCAGA AAGGTG6GGA TCOOCATCAT CATAGCACTA 120 

CTQAQCCTGG OQAGTATCAT CATTGTGGTT GTCCTCATCA AOGTQATTCT GGATAAATAC 160 

TACTTCCTCT GCGQGCAGCC TCTCCACTTC ATCOOQAGGA AGCAGCTGTO TOACQGAaAG 240 

CTGQACTQTC CCTTGOQQGA GQACGAGGAG CACTQTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTOGGCCACA 360 

OGGAACTGGT TCTCTGCCTG TTTCQACAAC TTCACAQAAG CTCTCGCTGA GACAGCCTGT 420 

AG6CAGATG6 QCTACAGCAG CAAACCCACT TTGAQAGCTO TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

OGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG 6AAGAGCCTG 600 

AAGACCOCCC GTGTGGTGGG TGGGGAGQAG GCCTCTGTGG ATTCTTOOCC TTGGCAGOTC 660 

AGCATCCAOT AGGACAAACA GCACGTCTGT GQAGGGA6CA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG COCACTGCTT CAOGAAACAT A0CGAT3T0T TCAACTGGAA GGTGOGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATT6AA 840 

T TCAAC CCCA TOTACCCCAA AGACAATGAC ATCGCCCTCA TGAAQCTQCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAC GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCOCAC TCTGGATCAT TGGATGQGGC TTZAOGAAGC AGAATGGAGG 6AAGATGTCT 1020 

GACATACIGC TGCAGGCGTC AGTOCAGGTC ATTGACAGCA CACGQTGCAA TGCAGACGAT 1080 

GCX3TACCAGG GGGAA6TCAC 06AGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGOC AGGGTGACAG TGGTGGGCCC CTGATOTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATOG TTAGCTGGGG CXATGGCTGC 66GGGCCCGA GCACOCCAOG AGTATACACC 1260 
AAGGTCrCAG CCTATCTCAA CTGOATCTAC AATOTCTGGA AGGCTQAGCT GTAA 

A61 Protein sequence: 

Gene ziamei Homo sapiens type Zl nenbrane serine protease tdRKA 

unigene nuiriber: Bs. 63325 

Probeset Accession «i AA411S02 

Protein Accession #: NP_057509 

Signal sequence: none found 

Transmen^rane domains: 31-53 

XJDLa domain: 54-94 

Tryp_8Pc domain t 204-429 

Cellular Local izat Ion i plasma membrane/ER 

1 11 21 31 41 51 

I I I I I I 

MLQDPDSDQP LNSU>VKPLR KPRZPKBTFR KVGXPZIZAL LSLASIIIW VLIKVILDKY 60 
YFLOGQPLHF IPRKQLCDGB LDCPLGEDEB HCVK9PPEGP AVAVRL9KDR STLQVLDSAT 120 
GNWFSACPDN FTEALAETAC RQMGYSSKPT PRAVBIQPDQ DLDWBITSJ 6QBLRMRNS8 180 
GPCLSOSLVS LHCLAOGK6L KTPRWGGBB ASVDSHPWQV 8ZQYDKQHVC GGSXLDFHWV 240 
LTAAHCFRKH TDVFNWKVRA GSDKLSSPPS LAVAKZZZZB FHPKYPKDKD lALMKLQFPL 300 
TF8OTVRPZC LPPFDBBLTF ATPLHZZGWG FTXQIIGQIGCS PZLLQASVQV XD9TRCNADD 360 
AyoOEVTEKN NCAOZPGGQV DTOQSDSGGP LMYQSDQmiV VQXVSHGYGC GGPSTPCVYT 420 
KVSAYUntZY NVWKABL 
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A62 DWl SBOOBNCB 

Gene name« sSTa; Weakly similar to I.ITHOSTATHINE 1 BETA PRECURSOR [H.eapieas] 

Unigene nuTn]t>eri Hs. 105484 

Probeset Accession #: AA314779 
Nucleic Acid Accession #i none found 

Coding sequence! 103-579 (underlined sequences correspond to start and stop codons) 



1 11 31 31 41 51 

I I t I I I 

CCAAAC AGAT TTGCAQATCA AtSGAGAACCC AGGAQTTTCA AAGAAGOGCT AGTAAGGTCT 60 

CTQAQATCCr TGCACTAGCT ACATCCTCAG GGTAGOAGQA AGATGGCTTC CAQAAGCATQ 120 

CGGCTQCTCC TATTOCTOAO CTGCCTGGCC AAAACAGGAO TCCTGGGTGA TATCATCATG 180 

AOACCCAQCT GTGCTCCTGO ATGGTTTTAC CACAAGTGCA ATTGCTATGG TTACTTCA60 240 

AAGCTGAOGA ACTGGTCTGA T6C0QA6CTC (»OTGTCAGT CTTAOGQAAA aSOAGCCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCX: A6CACCATAG CAC3AGTACAT AAGTGGCTAT 360 

CAGAGAA6CC AGCOQATATQ OATTQGCCTG CAOQACCCAC AGAA6AGGCA GCAC3TGGCA0 420 

TGGATTQATG GGGCCATGTA TCTGTACAGA TOCTGGTCTG GC3lAl6TCCAT GGGT6G6AAC 480 

AAGCACTOTG CTOAGATOAO CTCXIAATAAC AACTTTTTAA CTTGGAGCAa CAACGAATGC 540 

AACAAGGOCC AACACTTGCT GT6CAAGTAC OBACCATAQA GCAAGAATCA AGATTCT6CT 600 

AACTCCTOCA CCAGCCCOBT CCTCTTCCTT TCTGCTAGCC TGGCTAAATC TGCTCATTAT 660 

TTCAGAGGGG AAACCTAGCa AACTAAQAGT GATAAGGGCC CTACTACACT GGCTTTTTTA 720 

GOCTTAGAGA CAGAAACTTT AGCATTGGQC CSCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCXX3CCATC CCTTTCCACA GTATCCTTCT TOCCTOCTCC CCTOTCTCTO GCTGTCT08A 640 

GCA8TCTAGA AGAGTGCATC TCCAGCCTAT GAAACAGCTG OGTCTTTGGC CATAAGAA6T 900 

AAAGATTTQA AQACAGAAOG AAGAAACTCA GGAOTAAaCT TCTAQAOCCC TTCAOCTTCT 960 

ACACCXrrrCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AGCCACTCAA CTCCTGCTTG 1020 

TTTTTCCTXT GGCCATAGGA AGGTTTACXIA GTAOAATCCT TOCTAGGTTO ATGTGGGCCA 1080 
TACATTCCTT TAATAAACXA TTOTOTACAT AAGAGAAAAA AAA A AAAAAA AAAAAAAAAA 



A63 Protein eequenee: 
Gene names 
Unigene nuiobert 
Probeset Accession 
Protein Accession 
Signal sequence i 
Transmeinbrane doznains: 
C-type lectin domain t 
Cellular Localizations 



BSTs; Keakly similar to LITHOSTATHINE 1 BETA PRECURSOR [H. sapiens] 

Hs. 105484 

AA314779 

none found 

1-22 

none found 

47-156 

secreted 



1 11 21 31 41 51 

I I I I I I 

MASRSHRLLL LLSCLAKTGV IiGDXIMRPSC AFGHFYHKSN CYOYFRKLRN WSDAELEOQS 
YGNGAHLASI LSLKBASTIA BYISGYORSQ PZWIGUIDPQ KRQQWQHZIXS AMYLYRSHSO 
KSMSGNRRCA EMSSMMNFIiT NSSNBCNKRQ BFLCKYRP 



60 
120 



A64 DBA SEQUfiWCB 
Gene names 
Uhigene numbers 
Probeset Accession #: 
Nucleic Acid Accession Ui 
Coding sequence: 



Cadherin 3« P-cadherln (placental) 

Hs,2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 
I 

GOGGAACACC 
TCCCTOGTCQ 
CCTCC6A6CC 
06GAGCAGGA 
CAQCTCTOTT 
AAAGAAOGTC 
GAAOACACAA 
CCTTCCCCCA 
ACAGCATCAC 
AQACAGGCTG 
TCTTTGGCCA 
TCATGOTGAC 
GTGTCTTAGA 
ATQAT6CCAT 
AGGACCCACA 
CCAGT6GCCT 
TOGATGGOGA 
ACAATGCTCC 
GCCATGAGOT 
GTGCCACCTA 
CTQAOAGCAA 
AGCACACGCT 
OCACAGCCAC 
CCVGCAAAGT 
CTGCAOAAOA 



11 

I 

GGCCCGCOGT 
ACCrCTOGOG 
OTGCGGGGOG 
GCC06GCCAG 
TAGCACTOAT 
ACTGAAGGAA 
OAGAGATTGG 
GAQACTGAA7 
GGG6CC6GG0 
GTTGTTGITG 
OGCTGTGTCA 
OQACCAGAAT 
GG6AGTCCTA 
CTACACCTAC 
OGACCrCATG 
GGACC6GGAA 
CG QCTCCA CC 
CATOTTTGAC 
GCAGAGGCTG 
CCTTATCATO 
CCA GGGCA TC 
GXAGOTTGAA 
CATAOTOGTC 
COTTOAGQTC 
CCCTGACAAG 



21 
I 

CGCG6CAGCT 
TCTCrCCTCC 
GTCTTCAQGG 
GCGCTGGGGA 
AATQATGACT 
AGGAATCCAT 
OTGOTTGCTC 
CAGCTGAAOT 
GCAGACAGCC 
AATAAGCCAC 
GAGAATGGIQ 
GACCACAAGC 
CCA6GTACTT 
AATGGG0TG6 
TTCACAATTC 
AAAGTCCCTO 
A0CACGGCA6 
CCCCA6AAGT 
A06GTCACT0 
GG06GTGA0Q 
CTGACAACCA 
GTGAOCSUiOO 
CAOGTGGAGG 
CAGQAOQGCA 
GAOAATCAAA 



31 

1 

GCTTCACCGC 
TTCTOCAOGT 
AQOCTQAAOT 
AAGTATTCAT 
TCACTGTGCG 
TGAAGATCTT 
GMTATCTGT 
CTAATAAAOA 
OCCCTGAGGG 
TGGACCGGGA 
CCTCAGTGQA 
GCAAGTTTAC 
CTGTGATGGA 
TTGCTTACTC 
ACOGGAGCAC 
AGTACACACT 
TOGCAGTAGT 
A06AG6CCCA 
ATCTGGACGC 
AOGGGGACCA 
GGAAGGGTTT 
A G QO O OC n T 
ATGTGAATGA 
TCOOCACTGO 
A0ATCA6CTA 



41 



TCTCTCXGCA 
TTGCTGQCTG 
QACCTTGQAG 
GGGCTGCCCT 
GAATGGCGAG 
CCCATCCAAA 
OCCTGAAAAT 
TAOAGACACC 
TGTCTTOGCT 
GGAGATTGCC 
GGACCOCATG 
OCAGGACACC 
(3GTGACA6CC 
CATCCATAGC 
AGGCACCATC 
GACCATCCAG 
GGJUSVTOCTT 
TGT6CCTQA0 
CCCCAACTCA 
TTTTACCATC 
GGATTTTGAG 
TGXGCTGAAG 
GGCACCTGTO 
OGAGGCTGrO 
COGCATOCTO 



51 
I 

GCCATGGGGC 
CAG'fGOGGGO 
GOGGGAGGOQ 
GGGCAAGAGC 
ACAGTCCAGG 
OSTATCTTAC 
OQCAAGGGTC 
AAQATTTTCT 
GTAGAOAAGG 
AAGTATGAGC 
AACATCTCCA 
TTCOBAGGQA 
AGAOATGAGG 
CAAGAACCAA 
AGCGTCATCT 
GCCACAQACA 
GATGCCAATO 
AATGGAiSTGO 
CCAGOGTGGC 
ACCACCCACC 
GCCAAAAAOC 
CTCOCAAGCT 
TTT O TCC CA C 
TQTGTCIACA 
AGAGACCCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
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CAGGGTGGCT A60CMOGAC CCA6ACAQT6 G6CAGGTCAC AGCTGTOGQC ACCXTTCQACC 1S60 

GTGAGGATGIl GCAGTIT G T G AGOAACAACA TCTATGAAGT CATGGTCTTO 6CCATGGACA 1620 

ATGOAAGCXX: TCCCACCACT GGCAaSGGAA CCCTTCTOCT AACACTOATT GATOTCAAOG 1680 

ACCATGGCCC AGTCCCTGAG CCCCQTCAGA TCACCATCTG CAACX»AAGC CCTGTGCGCC 1740 

AOGTGCTGAA CATCAOGGAC AAGGACCTGT CTCCCCACAC Cl Va.ViT i' C CAQGCCCAGC 1800 

TCACAGATGA CTCAGACATC TACTGQAOQO CAGAGGTCAA aSAGGAAGGT GACACAOTGO 1860 , 

TCTTQTCCCT GAA6AAGTTC CT0AAGCAG6 ATACATATGA OOIGCACCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAQAGCAO CTGACGOTQA TCAGGGCCAC T6T0TG0QAC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGQA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTQ 2040 

TCCTGGCTCT OCTOTTCCTC CTGCTGGTOC TQCTTTTGTT G0TGAGAAA6 AAGOGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCOSTGA CAAOGTCTTC TACTATG606 2160 

AAGAGGGGGG TGGCQAAGAG 6ACCAGGACT ATQACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

AOGCCAGGCC GGAGGTGGTT CTCOGCAATO AOQTQGCACC AACCATCATC CCGACACCCA 2260 

TGTACOGTOC TAGGCCAGCC AACCCAGATO AAATOSGCAA CTTTATAATT GAGAACCTGA 2340 

AGGOGQCTAA CACAGACCCC ACAGCOCOGC CCTAOQACAC CCrCTTGGTO TTGQACTATQ 2400 

AQGGCA6060 CTCOQAGGCC GOGTCCCTGA QCTOOCTC A C CTCCTC060C TCGOACCMG 2460 

AOCAAGATTA OQATTATCTG AAC6A0TGG0 GCAG0C6CTT GAAQAAGCTG GCAOACATGT 2520 

AGGGTGGCGG GGAGGACGAC TAGGOGGCCT GCCTQCAGGG CTGGGGACGA AACGTCAGGC 2580 

CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CXXTTTCAGCT GAGGACTTOG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT GGOGGAGACA GQCTATOAGT CTGA06TTAG AGT6GTTGCT 2700 

TCCTTAGCCT TTCAGGATOO AG6AAT6TG6 GCAOTTTQAC TTCAGCACTG AAAACCTCTC 2760 

CAOCTGGGCC AGGCTTTGCCT CAQAGGCCAA GTTTCCAGAA GOCTCTTACC TQGGGTAAAA 2820 

TGCTCAACCC TGTGTCCTGa GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

CTG6AATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT ■ rm - ^yxyn ' AATOCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCA6A GCTGCTGGGC CCACTGGCOS 9000 

TCCT6CATTT CTOQTTTCCA GACCOCAATG CCTCCCATTC GGATGGATCT Cl I S OG rrm ' 3060 

AT ACTGAG TG TGCCTAOQTT OGOOCTTATT TTTTATTTTC CCT O TTGOOT TQCTATAGAT 3120. 
aAAGQQTQAO GACAATOaTQ TATATQTACT AGAACTTTTT TATTAAAGAA A 



A6S Protein aeguenee; 
Gene naniet 
Ubigene nuniber: 
Probeset Accession # : 
Protein Accession Ut 
Signal sequence t 
Transmeinbrane domain t 

Cellular localizations 



Cadherin 3« P>cadherin (placental) 

Hs.2877 

X63629 

CAA45177 

1-24 

659-675 (second underlined sequence) 
plasma mendbrane 



1 11 21 31 41 51 

I I 1 I I i 

MGLPRGPLAS LLLLQVCWLQ CAA8BPCRAV FRBABVTLBA GGAEQBPGQA liGKVFMGCPG 60 

QEPALPSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENQ 120 

K3PPPQRI2JQ LKSNKDRDTK IFYSITGPGA DSPPEGVPAV EKETGWUiLN KPLDREEIAK IBO 

YELFGHAVSB HGASVEDPMN ISIIVTDQHD KKFKFTQDTF RGSVL£X3VLP GTSVMQVTAT 240 

DEDDAiyrm GWAYSIHSQ EPKDFRDLHF TIHR8TGTI6 VZSSGIiDREK VPByTIiTI(2A 300 

1DMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPBN AVGHEVQRLT VXDLDAPNSP 360 

AWRATYLIMG GDDGDHPTIT THPBSNQGIt* TTRKOLDPBA KNQHTLYVBV TKBAPPVUCL 420 

PTSTATIWH VEDVNEAPVP VPPSKWBVQ BGIPTGBPVC VYTAEDPDKE NQKISYRILR 480 

DPAGHIAMDP DSGQVTAVGT LDREDBQFVR NNXYEVMVLA KDNGSPPTTO TOTLLLTLID 540 

VMDHGPVPBP RQITIGNQSF VSKVUniDK DLSPBTSPFQ AQLTDD8DXY HTABVNEBOD 600 

TWLSLRKFL RQDTVSlVHIiS LSDHGNKEQIi TVIRATVCDC HGBVBTCPGP HKOOFZLFVb 660 

GAVIiAIiLFLL LVLLLLVR2CK RKIKEPUiLP EDDTRSKVFY YGBBGGGEED (2DYDXTQLBR 720 

GIjEARPEWL R2IDVAFTIIP TPMYRPRPAM PDEIGNFIIB NLKAANTDPT APPYDTXiLVF 780 
DYEGSGSDAA SLSSLTSSAS OQOQDYDYLH ENGSRFKKLA DMYGGQEDD 



A66 DMA 8BQUBWCB 

Gene names ATPaee, Ca** transporting, type 2C, member 1 

Oblgene numbers Ks. 106778 

Probeset Accession 9s N51919 
Nucleic Acid Accession fts AP189723 

Goding sequences 1-2712 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I 1 

ATGATTCCTG TATTQACATC AAAAAAAGCA 
ATTCTCCAAG CTGATCTTCA GAATGGTCTA 
TTTCATG6CT GGAATGAGTT T6ATATTA8T 
TCTOUSTTTA AAAATCCOCT TATTATGCTO 
ATOCATCAGT TTGATGATGC OGTCAGTATC 
GCCTTTOTTC AOGAATATOO TTCAGARAAA 
CCAQAAT6CC ATTGTGTGOQ TGAAGGAAAA 
CCAGGTGATA CAGTTTQCCT TTCTGTTGGG 
GAGGCXQTQG ATCTTTCCAT ItSATGAaTOC 
AAGGTSACAO CTCCTCAGCC AOCIOCAACT 
GGCTTTATGG GAACACTGGT CAGATGTGGC 
QAAAATTCTO AATTTGG6GA GOTTTTTAAA 
CCTCTGCM3A AGAGCAT6GA OCTCTTAGGA 
ATAQQAATCA TCATGTTGGT TOGCTGQTTA 



31 41 51 

I I I 

AGTGAATTAC CAOTCAGTGA AGTTGCAAGC 60 

AACAAATGTG AAGTTAGTCA TAGGCGAGCC 120 

GAAGATQAGC CACTG1GQAA GAAGTATArT 180 

crix:itjGLTr ctqcaotcat cagtgtttta 240 

ACTGTG6CAA TACTTATOGT TGTTACAGTT 300 

TCTCTTGAAG AATTGAGTAA ACTTGTGCCA 360 

TTGGAGCATA CACTTGCCX3Q AGACTTGGTT 420 

GATA6A6TTC CTOCTGACTT AG6CTT0TTT 480 

AGCTTGACAO GTGAOACAAC GCCTTOTTCT 540 

AATGGAOATC TTGCATOGAO AAOTAACATT 600 

AAAGCAAAGG GT G TTGTCAT TGGAACAGGA 660 

ATGATQCAAO CA6AAGAGGC ACCAAAAACC 720 

AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

CTGGQAAAAO ATATOCTGGA AATGTTTACT 840 
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PCT/US02/36810 



ATTA6TGTAA 
AG8CTA8CTC 
ATTGTTGAAA 
AAOAATGAAA 
GQ21QTT66CT 
TATAACXXAG 
AGAAACAATA 
ATCGGTCrro 
G AGCAAA AGT 
TGTTTTATGA 
OGOCAGAOCr 
ATGQGCTCAO 
ACATTTCTTG 
ACAACACTCA 
6CA0TTGCAA 
GAAG AAATAQ 
TTTTACAGAQ 
TCAGTTGTAG 
ATTQQAOTTG 
CTAGTGQATG 
AATAACATTA 
ATCTCATTGO 
ATCAATATTA 
GATGTCATTC 
CTTAAAATAC 
GAGCTAOSAG 
TTTTTT6ACA 
GGACTCTGCA 
CTAGTTATTT 
GATCTGTTG T 
AAGGTTGAAA 
CTTGAAGTAT 



GTTT6GCTGT 
TTGGTGTTAT 
CTCTGGGCTG 
TGACTGTTAC 
ATAATCAATT 
CTGTTAGCAG 
CTCTAATGGG 
ATGGACTTCA 
GGATGGCTGT 
AAOQTGCXTA 
TGACACTTAC 
OGGGACTCAG 



TTGCCTCAGG 
TOGCCAGTGO 
ATGGAATGOA 
CTA6CCCAAG 
CCATGACAGO 
CQATGGGCCA 
ATGATTTTCA 
AAAATTTOGT 
CTACATTAAT 
TTATGGATG6 
GTAAACCTCX: 
TTGTTTCATC 
ACAATOTGAT 
TGTTCAATGC 
GTAATAGAAT 
ACTTTCCTCC 
TTCTTTTGQQ 
GGAGCA6GGA 
GA 



AGCAGCAATT 
GA6AATQGTG 
CTGTAATGTG 
TCACATATTT 
TGGGQAAGTG 
AATTGTTGAG 
GAAGOCAACA 
ACAAGACTAC 
TAAGTGTGTA 
CGAACAAGTA 
TCAGCAGCAG 
AGTTCTTGCT 
AATCATTQAT 
AGTATCAATA 
TCTGGGATTG 
TGTTGAGCAa 
GCACAAGATG 
AOATGGAGTA 
GACTGGTACA 
AACCATAATG 
TAGATTCCAO 
GAACTTTOCT 
ACCCCCAGCT 
TCX3CAACTGG 
AATAATCATT 
TACAOCTCGA 
ACTAAGTTCC 
GTTTTGCTAT 
GCTTCAOAAQ 
TCTCACCTCA 
AAAGATGCAG 



CCTQAAGGTC 
AAGAAAAGGO 
ATTTGTTCAG 
ACTTCAGATQ 
ATTGTTQATG 
GCGGGCTGT6 
QMOGGGCCt 
ATCAGAAAAO 
CACCGAACAC 
ATTAAGTACT 
AGAGATGTGT 
TTGGCTTCTG 
CCACCTAGAA 
AAAATGATTA 
TATTCCAAAA 
CTTTCACAAA 
AAAATTATTA 
AATGATOCAG 
GATGTTTGCA 
TCTGCAATCG 
CTGAGCACGA 
AATCCTCTCA 
CAGAGCCTTG 
AAAGACAGCA 
GTTTGTGGGA 



AGATCCCAGA 
GCAOTTCTTQ 
GTTTTTCAGA 
TCA6T6TGCA 
AAGCATGTTA 



TCCOCATTGT 
CCATTGTGAA 
ATAAAACTGG 
GTCTGCATGC 
GTGATGTTGT 
TGTGCAAT6A 
TAATTGCTCT 
CTGAATACCC 
AGCAGQACAG 
GTACTACATA 
ACCAACAAGA 
GTCCTGAACT 
CTGGTGTGAA 
CTGGAGATTC 
CTTCCCAGTC 
TAGTACCAAA 
AGTCGCTACA 
TTGCTCTGAA 
AAGAGGCA6C 
AAGAGG6TAA 
GTATA6CA6C 
AT6CCAT6CA 
GAGTAGAACC 
TTTTQACTAA 
CTTTGTTTGT 
TGACCTTCAC 
CCAAGTCTGT 
GATCCATCAT 
CTGAGAGCCT 
TAGTGGCAGA 
GTTOGACATC 



6GTCACAGTG 
AAAGCTGCCT 
AACACTGACG 
TQAGGTTACT 
TCATGQATTC 
TGCTQTAATT 
TGCA ATGAAG 
TTTTAGCTCT 
ACCAGAGATT 
CCAGA6CAAA 
GAAGGCA06C 
GGGACAGCXG 
AGAAGCTGTT 
ACAGGAGACT 
AGTCTCAQGA 
GQTTGCAGTA 
GAAGAACGGT 
GGCTGCAGAC 
AGACATGATC 
AGGGATTTAT 
ATTAACTTTA 
GATTTTGTGG 
AGTGGATAAA 
AAACTTGATA 
CTTCTGGOGT 
ATGCTTTGTG 
6TTTGAGATT 
GGGACAATTA 
AAGCATACTG 
AATTATAAAG 
AlCATCTTTT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



A67 Protein sequence t 
Gene nanie: 
T&iigene murbert 
Probeset Accession 
Protein AcceBsion #i 
Signal sequence i 
Transmeinbrane domains: 
Pfam domains I 
(744-889] 

Cellular Localization t 



ATPase, Ca** transporting, type 2C, mentber 1 

Hs. 106778 

N51919 

AAP27813 

none £ound 

60-82, 86-108, 252-274, 282-304, 757-779, 627-849, 856-678 

Cation ATPase N C6-83] , E1-B2 ATPase [69-324] , Hydrolase [328-649] , Cation ATPase C 

not determined 



MIPVLTSKKA 
SQFKNPLZML 
PBCHCVREGK 
KVTAPQPAAT 
PLQKSMDLLG 
TLALGVMRMV 
GVGYNQFGBV 
KGLDGLQQDY 
GQTLTLTQQQ 
TTLIASGVSI 
FYRASPRHKM 
LVDDDFQTIM 
IMIIMDGPPA 
ELRDNVITPR 
LVIYPPPLQK 
LEV 



11 
I 

SELPVSBVAS 
LLASAVISVL 
IiEHTLARDLV 
NO^LASRSHI 
KQLSPYSPGI 
KKRAIVKKLP 
JVDGDWHGF 
IRKAEYPFSS 
RDVYQQEKAR 
KMIT6DSQET 
KIIKSLQKNG 
SAIEEGKGIY 
QSLGVEFVDK 
OTTMTFTCPV 
VFQTESLSIL 



21 

1 

IIiQADU2NGL 
MRQFDDAV8I 
PaDTVCLSVG 
AFMGTLVRCX3 
IGIIMliVGHIf 
IVETLGCCMV 
YKPAVSRIVB 
EQKNMAVKCV 
MGSAGLRVLA 
AVAIASRLGXi 
SWAMTGDGV 
HNXXNFVRFQ 
DVZRKPPHNW 
FFSMFNALSS 
DLLFLLGLTS 



31 
I 

NKCEVSHRSA 
TVAILIWTV 
DRVPADLRLF 
KAKGWIGTG 
LGKDILEMFT 
IC8DKT0TLT 
AGCVCNDAVI 
HR7QQDRPBX 
LA8GPELGQL 
YSKTSQSVSG 
NDAVALKAAD 
LST8IAALTL 
KDSILTRNLI 
RSQTKSVFBZ 
SVCZVAEIIK 



41 

I 

FHQWNBFDIS 
AFVQBYRSBK 
EAVDLSIDBS 
BMSBFGEVPK 
ISV8LAVAAI 
KKEHTVTBIF 



CFMKGAYBQfV 
TFIiGIiVGXZD 
E6IDAMDVQQ 
IGVAMGQrar 
ISLATIiMilFP 
IiKILVSSXII 
GLCSNRMFCY 
KVERSREEKXQ 



51 

I 

EDEPLHKKYI 
SL6BL8XLVP 
SLTGETTPCS 
MMQAEEAFKT 
PEGLPIWTV 
TSDGLHAEVT 
EGALXALAMK 
XKYCTTYQSK 
PPRTGVKBAV 
LSQIVPKVAV 
DVCXBAADNI 
NPLNAHQXLW 
VOGTLFVFWR 
AVIiGSIMGQL 
KKV88TS8SF 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



A68 DMA SEQOaiCS 



Ubigene nuntbert 
Probeset Accession #t 
Nucleic Acid Accession %t 
coding sequence I 



bone norphogenetlc protein 7 {osteogenic protein 1) (BMP7) 

Hs. 170195 

BB616633 

NM_001719 

123-1418 (underlined sequences correspond to start and stop codons) 



GGGCGCAQOG GGGCCOGTCT 6CAGCAAGTG 
CTGCCACCTG GGGCGGTGOG GGCCCGGAGC 
OQATGCACGT GCGCTCACTG CX3A6CTGCGG 
CCCTGTTCCT GCTGCGCTCC GCOCTGGCOO 
GCTTCATCCA COGGCGCCTC OSCAOCCAGO 
CCATTTTGGO CTTGCCCCAC 0G0CCG08CC 
CCATGTTCAT GCTGGACCTG TACAACGCCA 
GCCAGGGCTT CTCCTACCCC TACAAGGCOQ 
GCCTGCAAGA TAGCCATTTC CTCACOQACG 
TGGAACATQA CAAGOAATTC TTGCACCCAC 
mCCAAGAT OOCAGAAGQO GAAGCTGTCA 



31 41 SI 

I I I 

ACOGAOGGCC GGGAGGGCGG CCTGCCCGCT 60 

COGGAGCCCG GGTAG0606T AGAGC0GGO8 120 

0GC06CACAG CTTCGTG6C6 CTCTGGGCAC 180 

ACTTCA6CCT GGACAAC6AG GTGCACTOQA 240 

A008606GGA 6ATQCAG08C 6AGATCCTCT 300 

CQCACCTCCA GGOCAAaCAC AACTOGGCAC 360 

TGGGG6TGGA GGAGGGOGGC GGGCCCGG03 420 

TCTTCAGTAC CCAGGQCCCC CCTCTGGCCA 480 

CCGACATGGT CATGAGCTTC GTCAACCTOQ 540 

GCTACCAOCA TGGAGAOrTC OG G TTT GA TC 600 

OGGCAQCCGA ATTCOQGATC TACAAOGACT 660 



861 



wo 03/042661 



PCTAJS02/36810 



ACATGOQGCSA AOGCTrOQAC AATQAQAOQT 
PGCACmOQ CA6GGAAT0G GATCTCTTCC 
AGGAGGGCTQ GCTGGTGTTT 
GGCACAACCT GGGCCTGCAO CTCTaSGTQO 
AGTTGGOGGO GCTGATTGGG CGGCAOGGGC 
TCTTCAAGGC CAOOGAGGTC CACnCOQCA 
GCCA6AAC0G CTCCAAGACG CCCAAGAACC 
AQAAOIGCAO CAGOCSACCAG A6GCAG6CCT 
GAGACCTQGG CTGGCAGGAC T6GATCATC0 
AGGGGGAGTO TGCCTTCCCT CTGAACTCCT 
AGACGCTGOT OCdACTTCATC AACCOGGAAA 
A6CTCAATGC CATCTCCOTC CTCTACTTOQ 
ACAGAAACAT GGTGGTCCGO OCCTOTQGCT 
TTGGGGCXyUV GTTTTTCTOO ATCCTCCATT 
CTGCCTTTTO TGAGACCTTC CCCTCCCTAT 
AAACATQAQC AGCATATGGC TTTT6ATCA0 
TCCTACAAGC TGTGCA6GCA AAACCXA6CA 
6C0GG0CCAQ GTCATTGGCT GGQAAGTCTC 
TTATGAGOQC CTACCAGCCA GGCCACCCAG 
G6GCACATTQ GTGTCrGTGC GAAAGGAAAA 
CAATAAAAOQ AAT6AAT0 



TCOQQATCAG COTTTATCAO OTOCTCCAOG 720 

TGCTOQACAO OOGTACCCTC TGGGCCT060 760 

CCACCAGCAA OCACTGGGTQ GTCAATCC6C 840 

AGAOOCTOGA TGGGCAQAGC ATCAACCCCA 900 

CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

GCATCOGGTC CA0GG6GAGC AAACAG06CA 1020 

AGGAAGCCCT G0GGATG6CC AA0GT6GCAG 1080 

GTAAGAAGCA OGAGCTGTAT GTCAGCTTCC 1140 

OGCCTGAAGG CTA0GC06CC TACTACTGTG 1200 

ACATGAACGC CACCAACCAC GCCATOSTGC 1260 

OQGIGCCCAA OOOCT QC T G T GOQGCCAOOC 1320 

ATGACAOCTC CAAOSTCATC CTGAAGAAAT 1380 

GCCACTAGCT CCTCXX3AGAA TTCAGACCCT 1440 

QCTCGC CTTO GCCAOGAACC AQCAOACCAA 1500 

COCCAACTTT AAAGGTOT6A QAGTATTAGG 1560 

TTTTTCAOTQ OCAGCATCCA ATQAACAAQA 1620 

GGAAAAAAAA ACAAOGCATA AAGAAAAATG 1680 

AGCCAT6CAC GGACTOGTTT CCAGAGGTAA 1740 

CXX3T0GGAGG AAGGGGGOTr GGCAAQGGOT 1800 

TT6ACC0GQA AGTTCCTGTA ATAAATGTCA 1860 



A69 Protein sequencer 

Gene name: bone morphogenetlc protein 7 (osteogenic protein 1) (BMP7) 



Iftiigene nuiiiber> 
Probeset Accession <|i 
Protein Accession #: 
Signal sequence t 
Pfam domains I 
Transmenibrane domains: 
Cellular XiOcalizatiom 



B8. 170195 
BB616633 
NP_001710.1 
1-30 

TGEbjpropeptide (37-261] 

none found 

secreted 



1 
I 

MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
XRERFDNBTF 
HNL GLQL SVB 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRAOGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNIiV 
RISVYQVLQB 
TXiDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HLGRESDLFL 
LAGLIGRHGP 
N8SSDQRQAC 
TLVRPXNPET 



31 

1 

FSLDNEVHSS 
AVEEGGGPGG 
YKKREFRFDL 
LDSRTLMA8B 
QKXQPFMVAF 
XKHBLYVSFR 
VPKPCCAPTQ 



41 
I 

FIKRRLRSQE 
QGFSYPYKAV 
SKIPEGBAVT 
EGHLVFDXTA 
FKATEVHFR8 
DLOHQDHIIA 
LNAISVLYTO 



51 
I 

KREKQREILS 60 

FSTQGPPLAS 120 

AAEFRIYKDY ISO 

TSMHWWNFR 240 

ZRSTGSRQRS 300 

PBGYAAyyCB 360 

DSSNVIIiKXy 420 



Cervical 



A70 DMA sequence 
Gene namet 
VOigene number i 
Probeset Access4.on fti 
STueleic Acid Accession it 
coding sequences 



bone inoxphogenetic protein receptor IB (ALK-6) 

Rs. 87223 

AA250737 

MH_001203 

274-1782 (underlined sequences correspond to start and stop codons) 



GGOGGGGOGC 
GAGGACGCGG 
QT6AAAGGAA 
CATAACCATT 
TGCCATAA6T 
AATGTGGGCA 
TTGCGTTGTA 
QACGQATATT 
GGTTGOCTAO 
AGAAGATCAA 
CTGCCTOCAT 
ATATCTGTQA 
TATAAAAGAC 
ATTCCTCCTQ 
TCA6GCCTCC 
ATTGGAAAAG 
GTGAAAOTGT 
ACAGTGTTGA 
G6GTCCTGGA 
TATCTGAAGT 
AGTGGCTTAT 
CATOQAOATC 
QACCTGGGCC 
ACTCGAOTTG 
AATCACTTGC 
GTTGCTAOGA 
CTAGTGCCCA 



11 
I 

GQA6TCGGCQ 
GA6CCGGGA0 
AGGAAGATCA 
TQQCTCTQAG 
GAGAAGCAAA 
CCAAGAAAQA 
AATGCCACCA 
GTTTCACQAT 
GACTAGAAGG 
TTQAATGCTG 
TGAAAAACAO 
CTGTCTGTAO 
AAGAAACCAG 
GA6AATGCCT 
CTCTGCTGGT 
GT0GCTAT6G 
TCTTCACCAC 
TGAGG CATGA 
OCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
T8AAAAGTAA 
TGGCTGTTAA 
GGACCAAAOG 
AOTCTTACAT 
GATQVGTATC 
GTQACOCCTC 



21 
I 

GG6CCTC6CG 
CQCAOGOGOQ 
TTTCATGCCT 
CIATGACAAG 
CTTCCTTGAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCAOATTTT 
CACAGAAAGG 
AGATTTTGTT 
TTTGCTCTTG 
ACCTOSATAC 
GAGAGACTTA 
CCAAAGQACT 
GGAAGTTTGG 
AGAGQAAGCC 
AAACATTTTQ 
CCTAATCACA 
AGACSQCTAAA 
CACAGAAATC 
AAACATTCTO 
ATTTATTA6T 
CTATATGCCT 
CATGOCTQAC 
AGGAGGTATA 
TTATQAGQAC 



31 
I 

GGACGCG6GC 
GGGTGGAGTT 
TGTTGATAAA 
AGAGQAAACA 
AACATGCTTT 
AGTACAGCCC 
GAAGACTCAG 
GATGACTCTG 
CAGTOTGGGG 
AACGAATGTA 
GATGGACCTA 
GTCCTTATCA 
A6CATTGGGT 
ATTGA6CAGT 
ATA6CTAAGC 
ATGGGAAA6T 
AGCTGGTTCA 
GGTTTCATTG 
6ACTATCATG 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GATACAAATG 
CCAGAAGTGT 
ATGTATAGTT 
GTGGAAGAAT 
ATQAGGGAGA 



41 
I 

AGTGC6GAGA 
CAGCCTACTC 
GGTTCAGACT 
AAAAGTTAAA 
TGC6AAGTGC 
CCACCCCCCG 
TCAACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAO 
TATTATTTTO 
TAGAACAGGA 
CTCAGA6CTC 
AGATTGAGAT 
GGCGTGGOQA 
GAGAGACAGA 
CTGCAGATAT 
AAAATGGTTC 
AGTTAGCCTA 
AAGGCAAACC 
ATGGAACTTG 
AAGTTGACAT 
TaaACQAGAO 
ntSGCCTCAT 
AGCAGCTTCC 
TTOTOTOavX 



51 




1 

GCGCGGCGCT 


60 


TTTCTTAGAT 


120 


TCTGCTOATT 


160 


CTTACAAGCC 


240 


AGGAAAATTA 


300 


TCCAAAGGTC 


360 


TTGCAGCACA 


420 


GGTGACTTCT 


480 


TCCTCATCAA 


540 


ACACCCTAGA 


600 


GQCTTTACTT 


660 


TTACTTCGGQ 


720 


TGAAACTTAC 


780 


AGQAAGTGGA 


640 


6GTGAAACAG 


900 


AAAGGTAGCT 


960 


AATATATCAG 


1020 


CAAAGGGACA 


1080 


CCTTTAIGAT 


1140 


CTCTTCTOTC 


1200 


AGCAATTGCC 


1260 


CTGTATTGCT 


1320 


ACCACCTAAC 


1380 


CTTGAACAOA 


1440 


CCTTTGGGAa 


1500 


TTATCATQAC 


1560 


CAAOAAGTTA 


1620 



862 



wo 03/042661 



PCTAJS02/36810 



GGCCOCTCAT TCGCAAACOG GTGGAOCAGT GATOAOIGTC TAAG6GAGAT GGGAAAftCTC 1660 

ATQACAOAAT OCTGGGCTCA CAATCCTGOV TCAAGGCTGA CAGCCCTGOO GGTTAAGAAA 1740 

ACACTTQCCA AAATGTCAGA GTCXX3U3GAC ATTAAACT CT GAT AQGAOAO GAAAAGTAAG 1600 

CATCTCTGCA GAAAGCCAAC AGGTACTCTT CT0TTTGT60 GCAGAGCAAA A6ACATCAAA 1860 

TAAGCATCCA CAGTACAAGC CTTQAACATC GTCCTGCTTC CXy^QTGGQTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACCTG66 CAAAGACAGA GAAGCTCXXA GAAGGAGAGA TTGATCGGT6 1980 
TCTQTTTGTA 6600GAGAAA C0GTTGG6TA ACTTGTTChA GATATGAT6C AT 

A71 Protein sequence 

Gene name: bone moxphogenetic protein receptor IB (ALK-6) 

Unigene number t Hs. 72472 / Be. 87223 

Probeset Accession «i AA250737 / U89326 

Protein Accession #: NP_001194 

Signal sequence: 1-13 ~ 

Transmembrane domains: 126-144 

PPAM domainst activin_receptor t30-lll) , protein kinase [204-491] 
Cellular Localizations ~ plasma membrane 



1 11 21 31 41 51 

I I I i I I 

MU*R8AGKLN VGTKKEDGBS TAPTPRPKVL RCKCHHHCPB DSVNNIC8TD GYCF1T1IEBD 60 

DSGLPWTSG CLQLEGSDFQ CRDTPIPHQR R8IBCCTERN ECNKDLHPTL PPLKNRDPVD 120 

GPIHHRALLI SVTVCSLLLV LIILPCyPRY KRQETRPRYS IGLEQDBTYI PPGESLRDLI 180 

EQSQSSGSGS GIiPLLVQRTX AKQXQHVKQZ GKGRYGEVWM GKNR6BKVAV RVFPTTBEAS 240 

ffPRBTBIYQT VLMRKENZLG PIAADIXOTQ SWrOLYLZTD YHSNSSLYDY LKSTTU2AXS 300 

MLKLAYSSVS GIiOILHTEIP STQGKPAZAH RDLKSIQIILV XKNOTCXJZAD LGLAVKPISD 360 

TNEVDIPPNT RVGTKRYMPP BVLOBSUOUI BPQSYXNADM YSFOLIXiHEV ARRCVSGGIV 420 

EEYQLPYHDL VPSDPSYEDM REZVCIKKLR PSPPNRWSSD ECLRQH3XLM TECNAHNPAS 480 
RLTALRVKKT LAKMSBSQDI KL 



Bladder 

A72 DNA SEQUENCE 

Gene names Homo sapiens type ZZ membrane serine protease mRNA 

T3^gene nunberx B8.6332S 

Probeset Accession #i AA411502 

Nucleic Acid Accession « : NM_016425 

Coding sequence: 1-1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I i 1 

AT6TTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TC6ATGTCAA ACCCCTGCGC 60 

AAACCCCQTA TCCCCATGOA GACCTTCAGA AAOGTGGGGA TCCCCATCAT GATAQCACTA 120 

CTGAOCCTGG OGAGTATCAT CATTGTG6TT GTCCTCATCA A6GTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCOGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CQAAGOGCCT 300 

GCA6T6GCAG TGCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTGGGCCACA 360 

GG6AACTGGT TCTCTOCCTG TTTCGACAAC TTCACABAAO CTCT06CXGA GACAQOCTQT 420 

AGGCAQATG6 GCTACAOCAO CAAAGOCACT TTCA6AGCT0 TGGASATTGG C0CA6ACCAG 480 

GATCTGQATG TTGTTOAAAT CACAGAAAAC AGCCAGGA6C TTOOCATOOG GAACTCAA6T 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGOCTGTGG GAAOAGCCTG 600 

AAGACCCCGC GTGTGGTGGQ TGGGGAGQA8 QCCTCTGTGG ATTCTTQGCC TTGOCAQGTC 660 

AGCATCCIUiT AG8ACAAACA GCA06TCTGT GQAGOGAGCA TCCTOGACCC GCACTGGGTC 720 

CTCAC66CA0 CCCACTGCTT CAGOAAACAT ACCQAT6TGT TCAACTGQAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGQCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCCAA AQACAATGAC ATOGCCCTCA TGAAGCTGCA QTTCCCACTC 900 

ACTTTCTCA6 GCACAGTCAO GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTGCA 960 

GCCACOCCAC TCTGOATCAT TGGATQGGGC TTTA06AA6C AGAATGQAGG GAAGA1GTCT 1020 

GAGATACT6C T6CAGG0GTC AGTCCAGOTC ATTGACAGCA CACaOTGCAA TGCA0ACX3AT 1080 

GCGTACCAGG GGGAAGTCAC GQAGAAGATG ATOTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

QACAOCTGCC AGGGTGACAG TGGTGGGCCC CTOATGTACC AATCTGACCA GTQOCATGTG 1200 

GTGGQCATGG TTAGCTGGGG CTATGGCT6C GGGGGCCCGA GCACCCCAGG AGTATACACC 1260 
AAGGTCTCAO CCTATCTCAA CTGGKICTAC AAT9TCTGGA AGGCT6AGCT GTAA 



A73 Protein sequence! 
Gene names 
Unigene nunbers 
Probeset Accession #s 
Protein Accession 0: 
Signal sequence: 
Transmembrane domains: 
UDLa domain: 
Tx>TL.^Pc domains 
Cellular Localization: 



Homo sapiens type II 

H8. 63325 

AA4 11502 

NP_057509 

none found 

31>S3 

54-94 

204-429 

plasma meirbraBe/BR 



serine protease mlRNA 



1 11 21 31 41 51 

I I 1 I I I 

MLQDPDSDQP WSLOVKPLR KPRIPHBTPR KVGIPIZIAL LSLASXXIW VLZKVILOKY 60 
ypLCOQPLHP XPRKQLCDGB LDCPLGBDBB HCVKSFPBC3P AVAVRLSKDR 8TLQVU>SAT 120 

863 



wo 03/042661 



PCTAJS02/36810 



GNHPSACFDN PTBMABTAC RQNGYSSXPT FRAVEXGPDQ DLDWBXTQf GQELRMRNSS 180 

GPCLSGSLVS LHCIAC3CKSL RTPRWGQEB ASVDSHFHQV SIQYDKQHVC GGSZLDPHWV 240 

LTAAHCFRKR TDVFNWKVRA GSDKLGSFP8 LAVAKIIXIB FI7PMYPKDKD IAU4KLQPPL 300 

TFSOTVRPIC LPPFDEELTP ATPLWIIGWO FTKQNGGKMS DILLQASVQV XDSTROIADD 360 

AYQ6BVTBRM MCftGIPBCSGV DTCQGDSGGP LMYQSDQWiV VGIVSNGYGC GGPSTPOVYT 420 
KVSAYLNWZy NVHKABL 

A74 DMA SEQUENCE 

Gene name: EST8, Weakly similar to 0GHU7L collagen alpha l(liz) chain precursor [H. sapiens] 
Unigene nuitibers Hs. 19322 

Probeset Accession #> AA08845B 
Nucleic Acid Accession tf: AA088458 

Coding sequence t 662-1995 (underlined sequences correspond to start and .stop codons) 

1 11 21 31 41 51 

I I t 1 I 1 

GCCCTT6GAC ACTGACATGG ACTGAAGGAG TAGAATGGAG CACGAOGACA CTGACATG6A 60 

CTGAAGAAAA A66AGCTGGA GCAGGAGAAG GA6GTGCTGC TGCAGGGTTT GGAGATOATG 120 

GCOCGGGGCC GCGACTGGTA CCA6CA6CAG CTGCAACGAG T6CAGGA6GQ CCAGCGCCGC 180 

CIGGOCCAOA GCAOAOCCAG GGOOQACTTT G6G6CTGCA0 6GAGCCCCG0 GCGACIGGGO 240 

CGGCTACTGC CCAAGGTACA AGAGGTOQCC OGGTGCCTGG G6GAGCTGCT G6CTGCAQGC 300 

TGT6CCAGCC GGGCCCTGCC CCOGTCCTCC TCCGGGCCCC CCTGCCCTOC CCTGACGTCC 360 

ACCTCACCCC OGGTCTCGCA GCAGCAGACC ATCCTCATGC TGAAGGAGCA GAACOGACTC 420 

CTCACCCAGG AGGTGACOGA GAAGAGTGAG OGCATCAGGC AGCTG6AGCA G6AGAA6TC6 480 

608CTCATTA AGCAGCTGTT TGAGGGCOGC GCCCTOAGCC AGCAGGAOGG GGOAGCTCTQ 540 

GATTCCACCT TCATCTAOTC CTTGTGG6GC 60GTX3G6GCC CCAGGGCCAG OCTGGCACTC 600 

A6CCCTTCGA GGGTGGOOOC CCCATCQCAC CCACCCTCTC TGGCTGGAGA CCCC06GCAG 660 

GCCCAGGCAC AGTCCC3GGAQ TGGGCGCCTT CCTGCCGCCC TTCCCAGATG GGCTCCCCAQ 720 

GGCTQCCCCC GGCTGGTCCC CGCACGQAGC OCTTGACTCC GTTTKGGCTC CT QQ TTOYTQ 780 

ACATGGGCTG GGGOCTCTCT TGAGTCOGCA TAOTCOGCAO CTACTACTGG CCGCTGTCAO 840 

TGGACAGTG6 GGTACCCCTC CATGAGITAO CGTCCCGCCG TTTCCAGCGO T6CC6GCCTG 900 

GGTOCCATCT TCAGGQAAAG GCACT6CCCA CGCCAGGCTG CACTTCCAAC AACGGGCAGC 960 

AGAGGGOGCQ GGGCGGCTCC GACGCGGGTC CAAGGGCAGC TTCCCGCTCA ACCAGGGCAC 1020 

CAGGACGAGG TGGC TGTAGC T0GGA06GAC GQAAOTAGAT GGAGGGGGTG GGGACGGCCT 1080 

OTAAO0Q66G GQ1X3CCTGCC TG6CTGGGGA 6CCCCAG0GA TA60G6TCGG ACTTGAOGTT 1140 

CTGGCCAAGa CTGAGGOACC CTG6CSGCAG C6GAT0GGCA 06CCQGGTGG GOQAOAGCTT 1200 

GGCCTGCATG TGCCTCCCAC AGACCCTGGG GTGATGGCCT TCCCCCTCTT GGCOGGGAOG 1260 

TTGCCCCACO TTQAOTCCCA CACAACATCC TGTGAGCCTG GCTCOCCAGG AGGGCCCCCA 1320 

GACAGCTCCC AGGCACGTCA TAGGCAAAOC CTGTTTCCCC OQACTCAGGA TTTCCAAGGC 1380 

CTGGGGTGCT GCTCACCCGC CTTTQCTCTC AGSCCCAGCC TOTOCGCAGO TTTCAGCTGG 1440 

GAGAG6CCAC CTCCCTCAaC CAAOGAAAAC OAOAACCCCC AGGGTACAGO AOQAGGCTGO 1500 

GGCAGGTCCC CTTGGGTGTC ACTCCCTCAG CCCCTGCCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTACGCA CTGGTGGGGO GGCCCTOCTC AGCCCAACCT GQAGGQTCCC AGTGTCACCA 1620 

GAAC CAGG GG CACGGCAACA GCATC6ATX3G GTTCTGCA6C CCAGGGCCCC 06ATGCGGGG 1680 

TCAOTGTGTG TGGG6C6CA6 OGCCTCOGAT GGQGG6TCAG TG0QT6GG6Q 0CGGAG6G0C 1740 

CCCGAT6CG6 GGTCAGT6C6 TGGG6G6CGC AQGGOCCCCT OGTGTCCAGG OCACTTTGGT 1800 

ACACTGTCCC ACAAGGCACC TGTCTCAGAG GAGGGGCCCT GGCAGGCAGC OTGGCAACrc 1860 

CCTTCCGGAG CCCAGCTCCA TQCTAACCTG CCCACAGCAA CCCCACAQAO CCACATTCCC 1920 

T GCTGC ACCT GGTCTGCAGG GGTGTCCCAO GACAGGCCCA A6TCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC C CTOAft QATQ 6QAGT66GCT TTCCAGGGQA CA33UU38ATG TCAGGCCTGG 3040 

ACCTCCTGGG CAGGAAAG6G TGCAGGTCCT GAGG6CCTGT GCCCCACM3C CCCAGCACCC 2100 

AGOTGOACTG CAQOGCAGTG GOTGGGCCAG TGGCAGCCAG GGAGAAGCCC CCOGTCAGCA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA OGTCTGCCTT TGAGGGTGCC TGCCATGCCC 2220 

TQGGGQATCC TGGCATCTTT ACTG6ACZGG AAGCAGGAGA CAGAACM3TG TCTG3XXXX3G 2280 

GGTQACrrCA TCAGGAGACC GCCCACATA6 A6CTGGACGC CQCAGCTGAA GOGGAAATGT 3340 

GAGAGAGGCT GGCACCTCCG GAAAAACT6C CTTTCA6CCT TGGTGTTGOG TGCAAOQTGA 2400 

AAAGAAATAO GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGTGG CCCTGOAGAC 2460 

CA0GA6GGGA GAATTTAAAG GCCCGGGCTG GCA6GGTCTA GGTGGCTGGC AGAGGCACAT 2520 

GCAGACCCTG OCTGGAGCCT GCCCTAGGAC GCTGGGOGGG TGAOTCTCOG TGCAGGATGT 2580 

GAGCAGG8TC CCTGGGCTCT ATOCG06A66 TGCCA6TAGC GTOTGCAGGT ACAXACAOGT 2640 

GCSTGCACAC T6TGATGACA CC08QAAAT0 TCTCAGGATG TTGAAATGTQ TCCTTGGGGG 2700 

CAGAAGTGTC CCCAGTT6AG AATCTGCCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTOTGTTG ATCAAGTTCC AAGGAAAA6Q AACATCTCAG CCGGGCGTGG TGGTTCftOQC 2820 

CTGGAATCCC AGCACTTGA8 GCCAGGA6TT GCAGAGCAGC CT6GGCAAG0 CAGTGA6AGA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AOAGATCCAa OTTTAAAAAT 2940 

TCATAAACAC CAOVAOOAAA CAATACACTA TGAGAOOCAG CAOAAGCAAC A0ATT6ACTC 3000 

TAGAOCCAGA TACTAGAATT ATCAGA6AGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 

AOAAAtAAAA GAQATTTCTG OAAACATGAA AAAAAA 

A75 OKA SliiQUiaiCB 

Gene name: 8CCA2 or 2b, serine (or cysteine) proteinase inhibitor 

Unigene nuinben Hs. 227948 

Probeset Accession 8: AB03S089 
Nucleic Acid Accession Ut AB035089 

coding sequence I 9845-10219 (underlined sequences correspond to start and stop codons) 



1 11 21 ' 31 41 51 

I i I I I I 

GG6CATGCAO CCATCGGGGA AAATCCATAG T6CAGATAAA 0CAA6GAGSA AGAAGAAGGA 
GAGTTCZAOT AAAACGGAOA ACATCAATAT AGGATOTTTC TTAQCAATAG AAAAAGAAGG 
CCAAOAGQAA TTAGGQAGAO AOTTATAAGA GATCAGCAAa GGOACAaOGT TAOATTTOGT 
TTGGTTTQAA AGCATACAGT AAATATQAIG TCTOTOOCTG GCAOTOTTGG CAGAGTAGOA 
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AGQAGGAAOQ aAOGCAAOAG ATAATATCM mCi C A t yro CTOCAACTOT ACTTACATAT 300 
OAGACTATTT C C CrCTCTGC TTTTCAAACC TTACTOOAOT TOTTTTCCCT CATQAAAACC 360 
AAGAAAOGAA AGCTAGTTAG TCTTGTTCTO AQQTTOTTCA ATGTATACAT ATCTATATCT 420 
GTAGACAGAA TCCTTGGGAA TACAOTAATT GACATATATT CTGTTATTTO ATGCTTGAAA 480 
AATCTCCTCC ACTAACCAGT TTCOCTATAG ATTGOCACAA QCACATAATA AGAAACAATA S40 
AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CA6CCTTCAT 600 
AAGGAAGTQA GGTCCAG6AA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAA6GCATTA 660 
GTAAT6ACAG QATATTTCCT GAAAGTGTAA TTTCCCATTQ AGGATTTGTT TTTAATTTCT 720 
GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 
CAAGTOTTCA TATGCAAAAA CTTCTTOOAA TTTCTGAGTT CTCTOTGaCA ATATATGACA 840 
TCAG6ATATG TCCAGTCTCA CACACCAOGA TATGTOCTTT CTAGOCTOTC TATCACAT6C 900 
TAOGAGAACT ATTTAGGAAC AGAAAAAAAT 6CCTGAAATG ATTTCTCATT TQAACTCATC 960 
CAAGCTTTCT CTAAATTTAA GCAAACTOCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 
TTCAACCTTC AGGGCAAACC TCOGTGCCTC AGA OG T T T A G CCATAGTCTG AAATTCTCTT 1080 
CCATAGATTG GTCCX:CTOTA ACCCCOOTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTTC 1140 
CCTOOVTTCC CAQGATGA6C TT6TT0CTTC T6T0CTATGA GACATTAOAT TCCTTTTCTT 1200 
TGQTACC06A OTAAATOCAT CCTACTCCAA TAOAGGAAGO TCCATTTTTG TCTTATAGOG 1260 
CTGGATGCAG ACTCAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTC3VGATA 1320 
TTTCCTCTTC TTTCTTTTTC TTCTATCTTT OOATTTTTAG TCCATCAAOS CCXCMTTMyt 1380 
CTATTCrCCO ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 
ATATA7ATTG ATACAG6AGA CCTAAGAAOA GCATGTCTTG 6GGGTTGA00 AAACAGGCAG 1500 
GT6AGAAATT TCCAQATTGG AAACACA6CT TCCTTTCTCC CATCC3V6CCC CTA C TTTC A G 1560 
CCTATQTGTT TCTQGCACCT TGTTGTAGAT AAATCTCTCT TGACTTTGTO ATGTGCTGAG 1620 
AAAACAAACT CAOGGCTGOT GTTAAAAAGG GCCCATQACA ATACCAAGTG TTGGGGAGAA 1680 
TGTG6AGAAA TCAGAACTCT ATTGA066TC GGTTGGAATG CACACTTGTO CAGAATTCTA 1740 
TGOAQAAGAG TCTOGCATTT CCTCAAAATO TTAAGCTGQA TTTACCATAT QACCCAGCQA 1800 
TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA T6CCATGCAA AAAAATGTAC 1860 
ATQAAAGGTC ACAACATCAT TATTCATAAT AOTAAAAGGA TGGAAACAAC ACAAATOTCC 1920 
ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TG6AATATTA TTCGACCACA 1980 
AAAAGGAATG ATGTACTGAT CCATGCAAT6 ATGTGGACAA AGCATGAAAA TAACACTAGA 2040 
TTAAAGAA6C CA6TCACAAA AGGACTTACT GTATGATTCC ATTrACCTGA AATGTTTG6A 2100 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAA6G6A 2160 
AQAATGAAGT ACAAGATTTC TTTTQGAGGT AGTQAAATTG TFGTGQAATQ AGATCATGAT 2220 
6ATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAOTT GAATTTATGG 2280 
TATATAAATT ATATGTTAAT AAAAAGGGGG TGCACAAAAC AAACAGCCCC CCACTCTGGT 2340 
TGTCAGGGAO ATATTGGATT AAATGGCCTT aOAOVACAAC CCCTCTGCCT GGGCACAGAC 2400 
ATTCTTCAGA TTACAAQATA TTCCAGGGQA AACACTGGAA T6AGTCI0AA GCCAGGTGCT 2460 
AAAGAOAAGG ACCATTGAQA AATGTTGTQA TCCTQACAGG TCAAOCAATT TATTTTTOGG 2520 
CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCX3VTTTAAA ATGGCCCGTC TGTTTCAATT 2580 
OCTCTTCTCA OTGTCAGCCT 6TTAACTCAA TGTGTTAGTC TGTTTTCATQ CTGCTQATAA 2640 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGO GCTTAGAOTT CCAGGTGATT 2700 
GGGGAGOCCT CAGAATCACA 6TAGGAGGCA AAAGTTATTC rTACATGOTG GCTGCAAGAG 2760 
AAGATGAGQA AGAAGCAAAA 6AAGAAACCC CTGATAAACC CATGGGATCT CCTGAGGCTT 28?0 
ATTAACTATC ATGAGAATAG CACAAGAAAO ACCGGCOCCC ATGATTCAAT TACCTCTACC 2880 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 
AACAGAAAAC CATCTGGGAT GGTTOTAAGG GGCACAGGAA OTGACTGGTA OQATCACTGC 3060 
CAAAGCTGAO CACTCAGQAG AAGGC3\ATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 
ACTGAAGTAC ACTTCTTCAC TATCTCTTTG QACTTAGAAT TAOCACTACA TTCCTTGTTA 3180 
TACAGAAAAA TTACTAAGGA AATTCAXAGG ATGACAAAAA CTTTCA6AAC TGAAAAACAG 3240 
GAAATGTAAG CTTTTTAGTT CrTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 
CAAGAAAAGA ATGOTGOaGT r m WAlXfr TIX SG TTTT Q T TTTTOTTTTA CAOCTOGAGT 3360 
AGAATACAAA GGGATGGAGT TGAAACAAAT QAQAGQAAAT TGGAATTCTA AACTTATTCT 3420 
CATTGGCATT AGAAAGGC3VC CTACATGTAT TTCACATQAG aXSGTQACTG CTGACTTGCA 3480 
TTCTTATTTT TTCCCTATAO ATTAAAAAGO AGGTACAAT6 GTAGAACTGT AATCCTQTCC 3540 
TTTOTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 
T6A0TAACTT CAAATACTAA CCACA6AGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 
GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 
AATAATX3ATA CTGTCTTCTA TATCATGCAT CTCCTGCATT CTGTCrGATT ATATTTTACT 3840 
TATTCTGCCA QAGCAAAATT AAAATACCTA TTTCATCTGA TTTOTCCTTT ATCTAAATTG 3900 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 
GAGAGTCTTO AAGGAGATGT CAGG6A0GCA TCTTAACAGC TGGTTGGATQ TGATCCACAG 4020 
AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTQT AACCAGCAAT 4080 
GAAAGAAAGA TAAAGAQGGT OQATTACTTA TTTACAATAO TCTTTAAAAA OQTAGTTTTG 4140 
TAA6CCTTCT AATTAGGACA TTAATATATT TAATATAT6C ACATTGTAGA AAGATTGAAG 4200 
GGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 
TGATGCTTTT CXaiGQAGTTC CAGATCACAT CGAGTTCACX: ATGAATTCAC TCAGTQAAGC 4380 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAO AGAACAACAT 4440 
CTTCTATTCC OCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAO GAGCCAAAGA 4500 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATrAOGTTGT CCTGTTGCAO 4560 
rrTTTCTCTG GTrCXXTTOGG CTAGCAOGCA GATGGTAATA CATGTGGTGG TCTGATGGGT 4620 
AGGACAG6G0 GCT6TGCAGG AATTCCCATA ACTGTGAGAC GACT6ACTTA AACAGATCTT 4680 
TTGAGTAAAO TTTTCTTGTC OOOCTTCATO TCTCTTCCAG GTTCTTCACT TTGATCAAOT 4740 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATQTO AGTCACAGAO CACTCTQATT 4800 
CAGCTTTAGA TCCCTGAACA GOTCATAGTT TAAACCTGOA ACTTCACAAA AACTAAGAAA 4860 
AGGCX:AGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAQAQT GGGTTGGCAT 4 920 
TTCATGGCAC ATAATTATTA TTCCTCATTT CTGOGTTACT AAAAQACAGT CA0CACT6TA 4960 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGOTTCAO ACTCCAQCTT TGCTCTTCAC 5040 
AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTOX AGTGACCTCA TCCCAGAAAA 5100 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATOT TTACAACAGT 5160 
6CCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5230 
TAATCATAAT AATGTGAAAA TAATTTAATT TTCATT6AGT CATTAATQAO ATTCAGAGGA 5280 
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ATAAGOVCAA GTCCAAOTAT ATTTTGGAAA ATQATTGCTA TOQAATATAT TOGTTTAGAG 5340 
CCTTAATAGT GCAAAATGCT TTGCTG Q AAG GTA6AAAGTT CTAGATTTAA ACA6GCTTA0 S400 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA G UU ' lTl ' liii CCXXSVTTCTC 5460 
TQAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAOACTTA GAOTTACCXyV TOTAAAGTOC 5520 
TTAOCCATOQ ACCTOGCATA CACTCTTCTT ACX3TGCA0A0 AATGACCATC ATGAGGAAA6 5580 
A6GCACA6AT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAOOGCTT 5640 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TOGTATOCTT GATGACTGTT 5700 
AGAAGTGAAA TATGQTCCTT GCCCATAAGG AOCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 
AACCCTTTAA ACCAACAAGG AOAAAATCTA CTGGTAGACA GCGCTGCRTC TTTAGTTCAG 5820 
AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAA6AA0 TCAAATATAA 5880 
GGTGGATTTT GAAGGGTATT T8AGGTGAAA TACACCAATT ATCAOGGAAT AACATCAAAO 5940 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTQATCTAAC AGACTTAGCA TGGGTTTAOT 6000 
ATTTACATTG ATACACCAAT TGAATGATCT CCTrmTGA TGTTT6AAGG TTGATAGGTC 6060 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 
ATATOAGCTG AAGATOGCCA ACAAGCTCTT OSGAGAAAAG A06TATCAAT TTTTACAGGT 6180 
AATTTCAGCT G6CCTACCCA CATTTCATTT GCATGCTQAT GTCTGTOTCT CTOnOTGGCC 6240 
AAATQQAAOA AAOCAAOQCA GATGAGCCTG GCCGACCCAO GTGQA6A6CA TTTACTCIV6A €300 
6TGCATTAGC TCCATTTCCA CAACTCTCCC OCACTGQAGT QTCCCAGACC CCAACGATAC 6360 
ATCACTQAAO TQT6GATTTA GGQATAATCT TGTQATAAAA GAGGAGGTTG TGTAATAOAO 6420 
TGAGTAAGAG TAATAAOTAA TAAGATACCA TCX3ATAAACT GGCACTGACT CAGTCACATA 6480 
OQATACATCT TGGTGGGAAA TGTATGACTA ATG6QATATT ATTGGAATG6 GCAGGCTTGG 6540 
GTGA8TTCCT GAGAATAGTT GAGOAAGTAC OUSQAAATAT TGAAT6CACA GGATGAAAGA 6600 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT QAGAAGCAAT 6660 
GAATCTCCTT CAG6GAAGCC TGCTCTGCAG TTT6CAAACC ACAGCCTCTT CTGCTTCTGC 6720 
CTTTTGCCAA GATGATATTG ACCTTCAGTO ACCTCTTTCT TOTGCCAGCC CACATTCCCC 6780 
TTTTQCATTG CCTACATGAC ACCTOTATAA AAATATCCAT GOACAGQAOA TACTGCATCT 6840 
ATTCA6(X3TC TGGATTCAGC TTACT6TTGT TACAAATAA6 TAAGTTT6GT AATATATAGT 6900 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGQAAT ATTTAGATOC 6960 
CATCAAQAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCXyVGAAQA 7020 
AAGTOQAAAG AAGATTAACT OCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7060 
TTATAGAAAC ACTTTTQAGA AACCTATGCX: AGTGAGCCTT GTGCTTGACA CTGCATOGGG 7140 
GAACAG6TGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGG6CA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAA6AGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
GCCAATTAGA TGGAAACACA TCTGGA6AAT TATTTGCTTA TGGCX:CTOCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAG06TC TCTOGTCTOC 7440 
TTTACTAATT GGGAAACAAG CAQCTCTCTQ OTAAATCACC CTTTTGTCTC TGAGCTG6AG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCT6CAGGGA TTATCACAGC TCTCTTCCCC 7560 
A-TCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTAGA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATAGCCCGG 7660 
GAAGTAOTGT CTGACAGCAC AGQACATGGQ TTTCATATTA CAGAGCTCAA QTCACTCATC 7740 
CTAAAATOCA A7CAQG0CCT CCTTCCTCT6 AATGGG6ACC CCX3TAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGG6A66 6A6AAAGGAA AGACACAT6T TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAQAGAA CGTGAACrCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATA06ACT GGAGCCTIGA 7980 
AGAAAGCTGC ATCTGGTGAA OCACTGGTCA GIGAGTCTAA CAATTGAAAG ATCAAABTCA 8040 
GTGAQTCTCA AGCAGGGATT TGGGTCAATA ATTAAOGATC AGTCACQAAC ATTTGCAAAG 8100 
CATCTTCCA6 ACAAGCCATT T6TAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
6T6AA06CAA TCTATTTCAA AGGGCASTGG GAQAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGQAAAAAT TTTGGCOUU^ CAAGGTATTO TCXA7ATTTT ATTTATATAG TGTAATATGT -8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATrTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCXAAC TCATGQAATT CCTGGOTTAT TTCTTAOAAO AAGAAGGATG 8520 
TGTTQCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCAOGTGTTA TTTGTTGGAC 8580 
ACA TTQATT T ATTGCAQAAT ACATACAAAT CTOTACAGAT GATQAGGCAA TACAATTCCT 8640 
TTAATTTT6C CTTGCTGGAG GATGTACAGG CCAAGGrTCCT GGAAATACCA TACAAA6GCA 8700 
AAGATCTAAG CATGATTGT6 CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
CTTGCATCTA CAACTCTTCX: TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAG6TAAAA GCTTATGACC QAGTTGCCTC AAAAT6ATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACICACC TTCATATTAC AAATATTTGA GCATA6G6CC TGACACAAAC TGAAAGCTTA 6940 
GTTTTTGTTT tfllWmtJT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGQGTAC 9000 
ATGTGCACAA TGTGCAGOTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 
CACCXX31CAA CAGTCCTCAO AGTGTGATGT TACCTTCCTO TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTAT6ATTT AATT CCATCT ATGGCTTAOT TAATQATTAA TTTAT TAGAG 9340 
TTACATQCAT TGGATATCAA TTTQATQATA TTATTATGCA OCAATTTAAA CTTGACTGG6 9300 
AGAAATATAT ACCAATGTGA GGAAA6TTTA CAAATA6GCC GAGTAGAAAA GGGAATACAA 9360 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATX3 ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCA7 9540 
ACSUnrOTAT ACATACATAT GTACACATAT ACATATAOGT AAAAACATGA riCI W rTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATCTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATQTCTATTA AAOT6AACCT TCTQTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATGGACAGT 9780 
TAGTCATATC AGTTTCnTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTQAQAA 9640 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAQAQAGACA TGTSTaSATT TACACTTACC 9900 
TOGGTTCKaA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
QAATATCTTC AATGGGGATG CAGACCTCTC AGaCATOACC TGGAGCCACO GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTT G TGGA GGTCACTGAG GAGGGA6TGG AAGCTGC3VGC 10080 
T6CCACX36CT GTAGTAGTAG TOIAATTATC ATCTCCTTCA ACTAATGAAO AGTTCIGTTO 10140 
TAATCACCCT TTCCTATTCT TCATAAG6CA AAATAAOACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAOT CTGTCACTCC ATTTAOAAAA TQTTGAGCTA 10260 
GAGQTOTTCT QGTAAACT6A TTQCTG6CAA CAACAGATTC I fi'lta UC 'ltJ A TATTTCTTTT 10320 
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CTATCTCATC TTGATGATGA TAGTCATCftT GAAGAATTTA ATGATTAAAA TAGCMGOCT 10380 
TTCTCTCTTT CTCTTAATAA GOCCRCATAT AAATGTACTT TTCCTTCCAO AAAAATTTGC 10440 
CTTGAGGAAA AATCTCCAAG ATAAGATOAA TCATTTAATA COGTGTCTTC TAAATTTGAA lOSOO 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
5 ATTTAGCAAC CTAGAAACAC ACATTTCTTT QAATTTAGOT OATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTT6TGATTC TATAAAACAC ATCATCAATA AAA1AATGAC ATAAAATOVT 106B0 
TTTTGCTTTA CCrOTTTTCT CTCIGGAAAG QGOMStOrtC CAGTTACACA TAGC3AAAQAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGA6TA0 7TCATGATGA 10800 
GCCTGGAGTA QAAGGCATAT CCCAGAACAG GAGGAQCCTT GTAAACCACA TAGGAACTTC 10860 

lU CTATTTTATG CTAAAG6GAT AAGAAACTCA TTACAOOCTT TGATG0TT6T TTCTCAAAGA 10920 
G6G6CATAAA ATTATCATAT CCACATCTA6 AAAATACATC TCTGGCTA08 CT6ATATCAA 10980 
TGQATOOQAO GAAAGAACA6 TGT60TTACC ATATATAAAT TAGQAAATCA TTAGAGTATT 11040 
G66AGTGGAA ATGGAGA6AA AGAAA6AGCC T6GGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAOTTGCT GACCTATCTG ACTGGATGGA TGGAAQAATA TCTTGTTTCT 11160 

15 GAGA6AAAAA AAGACTTTGG 6TTTAAATTT 6TACTTGATG AATTAA6GTA CTTTTAATAT 11220 
TCAAATGOAT TTGCC1G6CA G6CACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT OTQATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAOTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAQAAGAAT CTAQAATT6A CCTTQAAGTA TATCAGCATG TGTAAAGATC AOGAATTQAT 11460 

2v CATTTTTATT TTCCAGAAA6 TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 

A76 Protein sequence t 

25 Gene namet SCCA2 or 2h, serine (or cysteine) proteinase inhibitor 

Uhigene nuntbert Hs. 227948 

Probeset Accession ff: AB0350B9 

Protein Accession BAB21525 

Signal sequence: none found 

30 Transmembrane domains t none £ound 

Serine Proteinase Inhibitor domain t 13-390 

Cellular Iiocalizationt secreted 

35 1 11 21 31 41 51 

I I i I I I 

MNSIiSEANTK PMPDLFQQFR KSXENNXFYS PXSITSALGM VXiLGAKDirrA QQISKVLHFD 60 

QVTENTTEKA ATYHVDRSGN VHHQFQKLLT BFNK8TDAYB LKIANKLFGG KTyOFLQEYL 120 

DAIKKFYQTS VESTDFAHAP EESRKKINSW VESQTUEKIK KLFPDGTIGN DTTLVLVNAI 180 

40 YFKGQHENKF KKBMTKBEKF WPNKNTYKSV QHMRQYNSFN FALLEDVQAK VLEIPYK6KD 240 

LSMIVX2.PKB IDOLQKLBEK LTABKUHEWT 8LQNMRETCV DLBLPRFKME ESyDUGXTLR 300 

TTtGMVNIFNG DADLS6MTWS HGLSVSXVIiK XAFVBVTEBQ VEAAAATAW WEIiSSPSTO 360 
EBPCCNHPFL FPIRQNKTMS ZLFYGRPSSP 

45 A77 DMA SEQUENCE 

Gene name: hypothetical protein FLJ13459 

unigene number: none found 

Probeset Accession Ut XM_047266 

Nucleic Acid Accession : XM_047266 

5U coding sequence: 485"1471 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

) i I i I 1 

CrOACCTCAA GTGATCCACC CACCTCTGCC TCCCAAAGTG CTGGGATTAC AGGTQTOAGC 60 

55 CACCATGCCA GGCCCTCTTA ACCTCTTCAA GTCTGTTTTC TCATCTGCAA AACAGAGGTA 120 

ATAAGATCAG TATCTTCTTA ATQGAAOCAC CTGGACTACA TTTTTTTGAT TCATTGTTAT 180 

CATAAATGAG GACTAACCTG TCTCCCGTTG GGAGTTTTQA ACCTAQACCT CAT6TCTTCA 240 

TGACGTCATC ACTGCCCCAG GCCCAGCTGT GTCCCTACAC CAGCCCCAGC TGA06CATCT 300 

TCTTTTTCTO CCTGTAQAGA TGGTTACAAT GCCTGGOGTQ ATGCATTCTG GCCTTOGCAG 360 

60 ATCCTGGOaO GGCTGTGCCA AOQCTGTQGC CTCCCT60CC CTGAATACOG AOGOQGTGCr 420 

GTCAAGGTGG GCAGCAAAGT CTTCCTGACA CCACOGGAGA C0C16CCCCC AGGGATCTCT 480 

TCACATQTGQ ATT6ACATCT TTCXTTCAAQA TOTQCCTGCT CCACCCCCAO TTQACATCAA 540 

GCCrCGGCAG CCAATCAGCT ATGAOCTCAO AGTTGTCATC TGGAACAOGG AGGATGTOQT 600 

TCTGGATGAC GAOAATCCAC TCACCGGAGA GATGTOQAGT GACATCTATG TGAAGAGCTG 660 

65 (3GTGAAGGGQ TTQGAGCAT6 ACAAOCAGGA GACAGACaTT CACTTCAACT CCCTGACTGG 720 

GGAGGGGAAC TTCAATTGGC GCTTTGTGTT CCGCTTTGAC TACCTGCCCA CGGA60GGGA 780 

GGTGAGOGTC T0Q08CAGGT CT6GACCCTT TGCCCT6GA0 GAGGCGGAGT TCOGGCAGCC 640 

TGCAGTGCTG GTCCTGCAGG TCTGGGACTA TGACOGCATC TCTGCCAATG ACTTCCTTGG 900 

ATCCCTGGAG TTGCAGCTAC CAGACATGGT GCGTOGGGCC CGGG6CCCC0 AGCTCTGCTC 960 

70 TGTGCAGCTG GCCCGCAATG GGGCOGGQCC GAGGTOCAAT CTGTTTOQCT GCOQCCGCCT 1020 

GAGGGGdGG TGGCOGGTAG TGAAGCTGAA GGAGGCAGAG GAOGTGGAGC GGGAGGCGCA 1060 

GGAGGCTCAG GCTQGCAAGA AGAAOCQAAA GCAGAGGAGG AG6AAGGGCC GGCCAGAAGA 1140 

CCTGGAGTTC ACAGACATGG GTGGCAATGT GTACATCCTC AOGGGCAAGG TGGAGGCAGA 1200 

QTTTQAQCTO CTGACTGTGQ AGGAGGCGGA GAAA0G6CCA GTGGGGAAGG GGCGGAAGCA 1260 

75 GCCAGAGCCT CTGGAGAAAC CCAGCCGOCC CAAAACTTCC TTCAACTGGT TTCTGAACCC 1320 

GCTGAAQACC TTTGTCTTCT TCATCTGGOG CCGGTACTGG CGCACCCTGG TOCTGCTGCT 1380 

ACT O GTGCTO CTCACOGTCT TCCTCCTCCT GGTCTTCTAC ACCATCCCTG GCCAQAtCAG 1440 

CCAGQTCATC TTCCGTCCCC TCCACAA GTQ A CTCTCGCTG ACCTTGQACA CTCACCCAGO 1500 

rtrt 6TQCCAACCC TTCAATGCCT GCTCCTGGAA GTCTTTCTTA CCCATGTGAa CTACCCCAQA 1560 

80 GTCTAGTOCT TCCTCTGAAT AAACCTATCA CAGCCACTO 



A7B Protein sequence i 

Gene name: hypothetical protein FLJ134S9 
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Unigene nuniber: 
Probeset Accessicm #i 
Protein Accession 9* 
Signal sequence: 
Transmembrane domains* 
C3 domain: 

Cellular Localization! 



none found 
XM 047266 
XpI047266 
none found 
291-313 
27-86 

plasma membrane / ER 



1 11 21 31 41 51 

I 1 1 1 t I 

MMXDIFPQDV PAPPPVDIKP RQPZSYEIiRV VIWNTEDWL DDENFLTGEM SSDZYVKSWV 60 

KGLEHDKQBT DVHFNSLTOB GNFNWRPVFR FDYLPTEREV SVWRRSQPPA LBBABFRQPA 120 

VLVLQVWDYP RI8ANDFLGS LELQLPDMVR GAR6PBLC8V QLARN6AGPR CI^jFRCRRLR 180 

GWWPWKLKB AEDVEREAQB AQAOKKKRKQ RRRXORPBDL EFTDMGGNVY ILTGKVEAEF 240 

BLLTVBSAEK RPVGKGRRQP BPLEKFSRPR TSFNWFVNPL KTFVFFINRR YWRTLVLLLL 300 
VLLTVFLLLV FYTIFGQISQ VZFRFLHK 



A79 mh SEQUEKCE 

Gene namet Homo sapiens mRNA; CDNA DRFZp434K0322 (£roro clone DKFZp434K0322) 

Unigene nun^ert Hs. 161031 

Probeset Accession AL137708 
Nucleic Acid Accession AL137708 

Coding sequence I 1315>1791 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1111 I 1 

GGCATTGATG CTGTGTGOQC GTGOOTGTOT QTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 60 

GTOTOTOTOT GTCrGGACTC ATGGCAGGOT COCi T J f CTC l ' Cltyi'Ci'CCT T QCTCIOCCCC 120 

A6ACTG0GG6 6CTGCAGAGG TGAGGGTATC TCGCCTCAAC AGCTGCTTAT TCCCGATGGG 180 

AT66CCT6GG CTGGGCCCCT OAGGCCAGGC TGACTT6GAC ATOQCAAOAO 6GGTCCCAGG 240 

CTCTTGTGGG CAAAGCAGGO GAGGCGCCAA TGT6GAGGAA CAGAGTCTCC TGGCTGGCTG 300 

CT6CTCTCTG GAGGGGGTGG AGTCAGG6AA OAGCTGAGCT 6GGGAGTCAC CCTGG6CCTG 360 

QGQTCACCOT AGGCCOCATG XAOCACCCIO OTTGCCCTGC CIGTAOGTOA CAGGAGCCAG 420 

CCCA6CCAGG TGTQGTCCCT CCCCA6GCCC TAOGCAGGGG G6IACA0GG0 CCA6CA0CTG 480 

OOCGCQCCCC AGCTTGCTTC CCACCCACAT GCOQAAGQGT 66CCAGGCAG GCA6GT6QAC 540 

GAOTCCAGQC AGCGGCTGAO TCAGTGTGTG TGGAATGTTC TGGCCGCTCC CAGCTGCACC 600 

CTGOCCCTAC CTGCCACCAC CTCACCTTCA TCCTCAGGCG CT806GCCCT GAGCCCCTGC 660 

CA6QAATGCA CCTTTAGCCC A6GCCTGCTC AGT6AGCTCC GCCGACAGCC AGCCCTGCTC 720 

CTC008CCAT GACCCTGCAG AGCCCTCTGG GCTTCCAAGT TCCTGG6GGC TGCAGTQAAC 780 

ATQCTCCACC TGCATGGCTG GQAAACCATG GTGGGCCCCA GCTGTGQTOC OTGCTGOGQT 840 

AGAGGCAAGG AAGTGATGGG ACC6CAGAGA TGAGACCCCC AGGGATQAGA TGGGACCOCC 900 

AOGCAGGGCC CAGGOTCCAQ GGCCCAGGAG AGAGAAGCAG GGAGGGAGAO AGCTTCCTGG 960 

TGGAOGAOGC ATCCTACAGT GGGGGCAAGG GTGCTCTGAG GTC06GTGAA GGCAGGGACT 1020 

AOGCTOCCCA GGCCCTGCCr GCTTGGCTGO GGCTGGGGGC TGCTG6QAGG TGGCTGGQAG 1080 

GCTGGGCCTG GGCAGCTAAG CTGQAGCTTT GGCCAQGGTC CAOAGCCTCC CTCCCTTCAG 1140 

CTTCCTGCTG CACAGAACCC TCGCCCCTGQ CCACCCCGTQ CTGCCTCCTT GCCCTGGCAG 1200 

ACCCAGCACr GGCTGCT6CT AGTCAGATGG' GGTAGOQGGC AGGGGCOGGA GGGGCCACCC 1260 

TCCCAGCTXSA CCCAGCCTCC TGGGCCGCTT CTTCCAAACC AGCAGGGTAO AAAGATGGGQ 1320 

CACCCACCAG TCTCTCCCAG TGCCCOGGCC CCAOCTGGCA CCACAGCTAT ACCTGGGCTT 1380 

ATTCCAQACC TTGTCGCCGG GACCCCCTGT GAGTrOTGGQ ATTCCCAAGA GGGGTGTGGQ 1440 

OATAACCCAQ CCAAQTGG6G GCTGCAGCTG TCCACAQATG CACTCAGCCT GQCCTCTACC 1500 

CCAGGGCCCC GCTGGGCTCT CATTGCCGGC GCCCTTGCCG CGGGCGTCCT CCTOGTCTCC 1560 

T6CCTCCTCT GTGCTGCCTG CTGCT6CTGC CGCOGGCACA GGAAGAAOCC CAGQGACAA6 1620 

GAQTCOGTGG GTCTGGGCAG T6CCCGOGGC ACCACCACCA CCCACCTGGT GAGGAGCG6C 1680 

TCCTTGCTCA CTCAGTCCAG AGA6GGCTTG AAATCCAGGC TCCAGAGCCC AGGGCAGCGA 1740 

GGOGAQTTCA GCCCCAGGGA TGGTTTAACC CCCACAOAGO CAGGGCG TTG AQ GACCTTCC 1800 

TGGCAGGGAA AGTGGGTGAA CAQAGGTGAO AAGGAGGCCA TGCAACAGGO GCIGCCCCAT 1860 

GGGCC06AGG GAOCCACAOC GOaTTCTTOA GGAAGGCAGG GGGTACCCCA GAT6GCACGT 1920 

TTTGGOTGGG TTTGGCOGGT CTCACAGAGC GAAGCOGAOG ATTTGTGCCT GTTGGGTGGC 1980 

CTGGCCTGGA GGCGGGGGGT CTTGACCCAT OTCATOCAAG QGCTGCCCQG GAGCCCAGGG 2040 

CTCTGAT6AG GCATGATGTC AGCACCACCT GCCCCTTGTC CCAACTCACT CCAGGTGCAA 2100 

CCTGATGTGG AT6GCCTGGA GTCCAGCCOQ GGGGAT6CTC AGCAATGGGG 6TGCCTGCAG 2160 

CTCTCCCTGG AGTTOGACTT TGGAAOCCAG GAGGTGAAGO GCGCC6CTGC GCAGGACCAG 2220 

C0GTTCTGO6 AGTTTCG6GA AAGGGTGAOS GGGGAAGGGC AGAGCCCATO CCCTGGGTGG 2260 

TGGGGAGCTO ACAGGGCAGG GGCCCT7GGC TGAGCCCACC COGCTGGCTC CCAGATCAGQ 2340 

GTGGGCCTQA G6CA6GCAGC CGACCTGAGO CCTGGGGGCA CGGTGGACCC CTATGCCCGQ 2400 

GTCAGOGTCT CCACCCAGGC OQGACACAOA CATQAOACAA AAGT6CAC00 AGGCACOCTC 2460 

TGCCCOGTGT TTGAOQAGAC CTGCTGCTTC CAOGTQAGTC AGGGATGGTC GGCTGGGTGG 2520 

aCCTGGACGG CTGGATGGQC CT6GGCTGGQ TGOGCCTGGG CAGCTGGGTO GGGCTGGGCA 2580 

GCTGGGTGGG CCTGAGCTAO GGCAGCAGGG CCTGGCTCAC GCCGCTGCCT CAGATCCCGC 2640 

AGGOGGAGCT GCCAGGGQCC ACCCTGCAGG TGCAGCTTTT CAACTTCAAG OQCTTCTCGG 2700 

GGCATGAGCC CCTGGGTOAO CTCCGTCTGC CACTGGQCAC COTGGATCTG CAGCATGTTC 2760 

TGGAGCACTG GTACCTGCTQ GGCOCGOCGG CTGCCACTCA GGTGAGGTGC TGGTCACCA6 2820 

GCCACAGCCC AAGGCAGAGC TGGCAGGQAC CCTGCCCTAT GGGCCATOGG AAA0ACA6GC 2880 

CTGATGGGCA GCATTTTCGG GGGTCTGAOC CCCAACTOGQ CCAGAATCAC CCTOCCGGGC 2940 

TGAAOOCCCT CTTGCTGCCC ACA6CCOGAG CAGQTCGGGG AOCTOTGCTT CTCTCTCOGO 3000 

TAOGTOOCCA GCTCAGGCGG GCTQACOGTG OTQGTOCTGG A6GCT0GAGG CCTGOGTCCA 3060 

GGACTTGCAO AOCCCTAOGT GAAOOTCCAO CTCATGCTGA ACCAQAGGAA GTGaAAGAAG 3120 

AGAAAGACAO CCACCAAAAA GGGCAOGOGO GCCCCCTACT TCAATQAGGC CTTCACCTTC 3180 

CTGGTQGGCT TGAGCCAOGT CCAGAATGTG GACCTG G TQC l^GGCTGrCTq GGACCBCAOC 3240 

CTG C OGCTOC OAACTGAGGC CGTAOGCAAO OTOCACCTGO G TOO CO G GGC CTCGGGGCAG 3300 
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CCCCTGCAGC ACTGQOCAflA CMGCTGGCC CA0GC005 G C G6CCCATT6C CCAGOOGOVC 3360 

CCCGTG0S6C CA0CCA6GGA GGTGGACOGC ATGCTQ6CGC TGCAGCXCOQ CCTTGGCCTO 3420 

CGCCT O OOCr T6CCCCACTC CTGAATGCAC CACATGCCIC TGTCTCC0C3G CTGAGCCCA6 3480 
GOVCTTOCOC AGGCCQCCCT GCAGQAOCAC TGOUVTAAAC GOCTTCTGCT GOC 



ABO Protein sequence t 
Gene names 
Unigene nuniben 
Probeset Accession #x 
Protein Accession #i 
Signal sequence i 
Transnieiidbrane domains: 
Cellular Localization t 

1 11 21 31 41 51 

1 I I I I I 

MGHPPVSPSA PAPAGTTAIP GLIPDLVAGT PCELWDSQEG OQDNPAXNGL QLSTDALSLA 
STPGPRWALI AGALAAOVLL VSCLLCAACC CX3(RHRXKPR DKESVGL6SA RGTTTTHLVR 
SOSLLTQSRB OLRSRLQSPG QRGEFSPIUX3 LTPTEA6R 



Homo sapiens RiRNA; CDNA DKPZp434K0322 (from clone DXFZp434K0322) 
He. 161031 
AL137708 
CAB708B5 
none found 
69-85 

plasma meoibrane 



60 
120 



A81 DWA SEQUENCB 
Gene name : 
Ubigene number: 
Probeset Accession #s 
Hucleic Acid Accession 8i 
Coding sequence i 



Cadherin 3, P-cadberin (placental) 

H8.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 11 21 31 41 51 

I I t I I I 

GOGGAACACC GGCCOGCOQT C8G6GCA6CT GCTTCACCCC TCTCTCTGCA GC CATQG QQC 60 

TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGOGG 120 

GCTCC6AGCC GTGCCGGGCG GTCTTCAGGO AGGCTGAAGT GACCTTGGAG GCGGGAGG06 180 

CGGAGCAGGA GCCCGGCCAG G06CTGGGGA AA6TATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CA6CTCTQTT TAGCACTGAT AATGATGACT TCACTGTQOQ GAATGQOQAQ ACAGTCCAGG 300 

AAAGAAG6TC ACTGAAGGAA AGGAATOCAT TGAAOATCTT CCCATCCAAA OGTATCTTAC 360 

GAAQACACAA GAGAQATTGG GTGGTTQCTC CftATATCTQT CCCTGAAAAT GGCAAGG6TC 420 

CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTQAGGG TGTCTTOGCT GTAGAGAAGG 540 

AGACAGGCTG GTTGTTQTTO AATAAGCCAC TaGACOQGQA GGAGATTGCC AAGTATGAGC 600 

TCTTTGGGCA OQCTGTOTCA (3AGAATCQTQ CCTCAOTGQA GGACCCCATO AACATCTCCA 660 

TCATOQTGAC C8AGCAQAAT GACCACAAOC CCAAGTTTAC CCAGOACACC TTC0GA6GGA 720 

GTGTCrrAQA GGGAGTCCTA CCAG6TACTT CTGTQATGCA GGTGACAGCC ACAGATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGOTGG TTGCTTACTC CATCCATAQC CAAOAACXIAA 840 

AGGACCCACA COACCTCATO TTCACAATTC ACCGGAGCAC AGGCACCATC AGOGTCATCT 900 

CCAGTGGCCT GGACCGGGAA AAAGTCGCTG AGTACACACT OACCATCCAG GCCACAQACA 960 

TGGATGGGGA OGGCTCCACC ACCAGGGCAO TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

ACAAT6CTCC CATGTTTGAC CCCCAQAAGT ACGAGGCCCA TGTGCCT6A0 AATGCAGTGG 1080 

GCCATGAGQT GCAGAGGCTG ACGGTCACTG ATCTGGACQC CCCCAACTCA CCAGCGTGGC 1140 

GTGCCACCTA CCTTATCATG GGCGGTGAOG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGAGAGCAA CCAGGGCATC CTGACAACCA OGAAGGOTTT (SGATTTTGAO GCCAAAAACC 1260 

AGCACACCCT GTAOGTTGAA GTQACCAAGG AOGCCCCTTT T6TGCTGAAG CTCCCAACCT 1320 

CCACAGCCAC CATAGTGGTC CAGQTGGAGG ATGTGAATGA GGCACCTGTG TTTOTCCCAC 1380 

CCTCCAAAGT CGTTGAGOTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 

CT6CAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA COQCATCCTO AGAGACCCAS ISOO 

CAGGQTGGCT AGCCATGQAC CGAGACAGTG GOCAGQTGAC AGCTGTGGGC ACCCTCQACG 1560 

GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTAT6AAGT GATGGTCTTQ GCCATGGACA 1620 

ATQOAAGCCC TCCCACCACT GGCA06GGAA CCCrTCTGCT AACACTQATT GATGTCAACG 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTO CAACCAAAGC CCT6TGCGCC 1740 

AOGTGCTGAA CATCAOGGAC AAGQACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAOC 1800 

TCACAGATGA CTCAQACATC TACTGGACGG CAGAGQTCAA CGAGOAAGGT GAOICAGTGG 1860 

TCTTQTCCCT QAAOAAGTTC CTGAAGCAGG ATACATATGA CQTGCACCTT TCTCT6TCTG 1920 

ACCATGGCAA CAAAGAGCAG CTGA06(?IGA TCAGGGCCAC TGTGTG06AC TGCCATGGCC 1980 

ATOTOGAAAC CTGCCCTGGA CCCTGOAAAO QAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCTGGCTCT GCTQTTCCTC CTQCTOaTGC TGCTTTTOTT OGTGAGAAAG AAGOGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCQ 2160 

AAOAGGGGGG TGGCGAAGAO GACCAOOACT ATQACAICAC CCAGCTOCAC CGAGGTCTGG 2220 

AGGCCAGGCC GGA6GTGGTT CTC06CAATG AOGTG G CACC AAGCATCATC COGACACCCA 2280 

TQTACGGTCC TAGGCCAGCC AACCCAGATG AAATOGGGAA CTTTATAATT QAGAACCrOA 2340 

AGGOGGCTAA CACAGACCCC ACAGCCCOGC CCTAOGACAC CCTCTPGGTG TTOGACTATG 2400 

AGGGCAGOGG CTCCGAC6CC GCGTCCCTGA GCTCCCTCAC CTCCTCCX5CC TCCGACCAAG 2460 

ACCAAGATTA CGATTATCTO AACQAOTGGG GCAGCOGCTT CAAGAAGCZXl GCAGACATGT 2520 

ACGGTGGCG6 G6AGGAC6AC TAGGOGGCCT GCCTGCAGGQ CTGGGGACCA AAC6TCAGGC 2580 

CACAGAOCAT CTCCAAG6GG TCTCAQTTCC CCCTTCAOCT GAGGACTTOQ GAGCrTGTCA 2640 

GQAAGTGGCC GTAGCAACTT GGOGGAGACA GGCTATGAOT CTGACGTTAG AGTGGTTGCT 2700 

TCCTTAGCCr TTCAGGATGG AGGAATGTGG GCAGTTTQAC TTCAGCACTO AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA CTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 

TGCTCAACOC TGT G TCCT GG GCCTGGGCCT GCTGTQACTG ACCTACAGTG GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT OCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAA06TT AGAGAAAGTT CTTCAA^KAGT GCAGCCCAOA GCTOCTGGGC CCACTGGCGG 3000 

TCCTGCATTT CXGGTTTCCA GACCCCAATO CCTCCCATT C O OATQQ ATCT ClMOJ e mTL * 3060 

ATACTGAOTG TGCCTAGGTT GCCCCITATT TTTTATTTTC CCT8TT60GT TGCTATAGAT 3120 
QAA66GT6AG GACAATOGTO TATATGTACT AOAACTTTTT TATTAAAGAA A 
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A82 Protein sequence: 
Gene names 
unigene nuraiber: 
Probeset Accession 8: 
Protein Access ion 9i 
Slgnel sequence: 
Transmenibrane doraains 

Cellular localizationi 



Cadherin 3, P>cadherin (placental) 

H8.2877 

X63629 

CAA4S177 

1-24 

659-675 (second underlined sequence) 
plasma niendbrane 



1 11 21 31 41 51 

I I I I I 1 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FRBAEVTLEA GOABQEPGQA LGKVPMOCPG 60 

QEPALFSTDN DDFTVRNGET VQBRRSLKER NPLKIFPSKR ILRRHKRDHV VAPISVPENQ 120 

RGPFPQRLKQ liRSNKDSDTK IFYSITGPGA DSPPEGVFAV EKBT6WLLW KPLDREBIAK IBO 

yEtiFGHAVSB KGASVEDPMN ISIIVTDQND KKPKFTQDTP RGSVXiEGVIiP GTSVMQVTAT 240 

DEDDAXYTXN GWAYSIKSQ EPXDPHDLMF TZBRSTGTIS VISSGLDREX VFEYTLTIQA 300 

TDMDGD6STT TAVAWEILD ANDNAPMFDP QKYBAHVPEtT AVOKEVQRLT VTDLDAPNSP 360 

AS«iATYI.IMG GDDGDHPTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPPVLKL 420 

PTSTATIWH VBDVNBAPVF VPPSKWEVQ BGIPTGEPVC VYTAEDPDKE NQKXSYRILR 480 

DPA6WUU4DP DSGQVTAVGT UiREDEQWR NHIYBVMVIA MDN6SPPTTG TGTUJiTLZD 540 

VNSHGPVFEP RQITZCaVQSP VRHVLNITDX DLSPBTSPFQ AQLTDDSDIY HTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNXEQL TVIRATVCDC H6HVBTCPGP WKGGFIUVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGBEGGQBED QDYDITQLHR 720 

GIiBARPBWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEOSGSDAA SL88LT8SAS DQDQDYDYUI EHGSRFKKLA DMYGGGEDD 

A83 DNA SEQUENCE 

Gene name: putative Q-protein coupled receptor 
unigene number: Hs.l6085 
Probeset Accession ttt F079S3 
Nucleic Acid Accession #t NM.016334 

Coding sequence! 104-1471 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

! I I i I I 

A6CACCTGGQ AAAAGGCAGA CCGTGTGAGG GGGCCTOTOa CCCCAGC3TG CTGTGGCCTC 60 

GGGQAGTGGG AAOTGOAGGC AGGAGCCTTC CTTACACTTC GC GATGA GTT TCCTGATOGA 120 

CTCCAGCATC ATGATTACCT CCCAAATACT ATTTTTTGGA TTTGGGTGGC TTTTCTTCAT 180 

OOQGCAATTG TTTAAAGACT ATGAOATACG TCAOTATGTT GTAC ACqTGA TCTTCT C OGT 240 

GAG8TTT6CA TTTTGTTGCA CCAT6TTTGA GCTCATCATC TTTGAAATCT TAGGAGTATT 300 

GAATAGCAGC TCCOGTTATT TTCACTGGAA AATGAACCTG TQOGTAATTC TQCTGATCCT 360 

GGTTTTCATG GTGCCTTTTT ACATTGQCTA TTTTATTGTQ AGCAATATCC GACTACTGCA 420 

TAAACAAOGA CitiCi ' ll ' m ' CCTGTCTCTT ATQGCTQACC TTTATGTATT TCTTCTGQAA 480 

ACTAOaAGAT GOCTTTCCCA TTCTCA6CCC AAAACATGGG ATCTT A TCCA TAGAAO^GCT 540 

CATCAGC066 6TTGGTGTGA TT6QAGTGAC TCTCATGGCT CTTCTTTCT Q GATTTGGT6C 600 

TQTCAACTQC CCATACACTT ACATOTCTTA CTTCCTCAGG AATGTGACTG ACACAGATAT 660 

TCTAGCCCTO GAACGGCGAC TGCTGCAAAC CATGGATATG ATCATAAGCA AAAAGAAAAG 720 

GATGGCAATG GCACGGAGAA CAATGTTOCA GAAGGGGGAA QTGCATAACA AACCATCAGG 780 

TTTCTCGK3GA ATGATAAAAA CrTGTTAOCAC TTCAGCATCA GGAA6T6AAA ATCTTACTCT 840 

TATTCAACAG GAAGTGGATG CTTTGGAA8A ATTAAGCAGG GAGCTTTrTC TOQAAACAGC 900 

TQATCTATAT GCTACCAAGG AGAGAATAGA ATACTCCAAA ACCTTCAAGG GGAAATATTT 960 

TAATTTTCTT GGTTACTTTT TCTCTATTTA CTGTGTTTGG AAAATTTTCA TGGCTAOCAT 1020 

CAATATTGTT TTT0ATC6A0 TTQGGAAAAC 6GATCCTGTC ACAAGAGGCA TTGAGATCAC 1080 

TGT(»ATTAT CTGGGAATCC AATTTGATGT GAAOTTTTGG TGOCAACACA TTTCCTTCAT 1140 

TCTTGTTGGA ATAATCATCG TCACATCCAT CAGAGGATTO CTGATCACTC TTACCAAGTT 1200 

CTTTTATGCC ATCTCTAGCA GTAAGTCCTC CAATGTCATT GTCCTGCTAT TAQCACAGAT 1260 

AATGGGCATG TACTTTGTCT CCTCTGTGCT GCTGATCCGA ATGAGTATGC CTTTAGAATA 1320 

CCGCACCATA ATCACTQAAO TCCTTGGAGA ACTGCAGTTC AACTTCTATC ACCGTTGOTT 1380 

TGATGIOATC TTCCTGGTCA GG(3CTCTCTC TAGCATACTC TTGCTCTATT TGGCTCACAA 1440 

ACAGGGACGA GAGAAGCAAA TGGCACCTTG AACTTAAGCC TACTACAGAC TGTTAG AGGC 1500 

CAGTGGTTTC AAAATTTAGA TATAAQAGGQ GGGAAAAATO GAACCAGGGC CrQACATTTr 1560 

ATAAACAAAC AAAATGCTAT GGTAGCATTT TTCACCTTCA TAGCATACTC CTTCCCCCTC 1620 

AGGTGATACr ATGACCATGA GTAGCATCAQ GCAGAACATG AGAGGGA6AA CTAACTCAAO 1680 

ACAATACTCA GGAOAOAaCA TCOOOTOTGQ ATATGAGGCT G6TGTAGAGG CGQAGAGGAO 1740 

CCAAGAAACr AAA66T6AAA AATACACTGQ AACTCIGGGO CAAGACATGT CTA TCCTA GC 1800 

TQAGCCAAAC ACQTAGOATT TCGQTTTTAA GGTTCACATa QAAAAG6TTA TA6CTTTQCC 1860 
TTGAGATTQA CTCATT AAAA TCAGAGACTG T 



A84 Protein sequence 
Gene name: 
Unigene mimbert 
Protein Accession #: 
Signal sequence t 
Transmendbrane domains: 
424-446 

Cellular Localizations 
1 11 21 



31 



putative G-protein coupled receptor 
H8. 16085 

IIP_05741B.l 
none £ound 

5-27, 42-64, 76-98. 110-132, 144-166. 289-311. 342-364. 380- 
plasma membrane 
41 51 



870 



wo 03/042661 



PCT/US02/36810 



1 

MSPLIDSSIM 
EILGVLNSSS 
MYPFWKLODP 
VTDTDILALB 
SBNLTLXQQB 
ZFmTINIVF 
ITLTKPPyAI 
FYHRHFDVXF 



ITSQILPFGF 
RYFHWKMNLC 
PPILSPFO^QI 
RRLLgriMDMI 
VPA IiEEL SRQ 
DRVGKTDSVT 
SSSRSSNVZV 
LVSALSSZLP 



GWLFFMRQLF 
VILLILVPMV 
LSIEQIilSRV 
I8KKKRMAHA 
LPLETADLYA 
ROZEITVNYX. 
XiIiLAQZKGMY 
LYLABKQAPE 



I 

KDYEIRQYW 
PFYIOYFIVS 
GVIOVTLMAL 
RRtKFQKGGV 
TKERIBYSKT 
GZQFDVXFU8 
FVSSVLLZRM 
KQMAP 



1 1 

QVIFSVTPAP SCTMFBLIIP 60 

NIRLLHKQRL LPSCLLHliTP 120 

LSGFGAVNCP YTYMSYPUm 180 

HNKPSGFKGM IKSVTTSASG 240 

PKGKYFNPW yPPaZYCVWK 300 

QHZSPZLVOI ZZVTSZRGLL 360 

SHPliEYRTXZ TBVLOBLQFII 420 



ABS Dm 8EQUEWCB1 
Gene namei 
Unigene zninibert 
PrcAeset Accession |}: 
Nucleic Acid Accession ft: 
Coding sequences 



TTK protein kinase 
Hs. 169840 
M86699 
NM_003318 

1026-3551 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 I 1 I I 

GQAATTCCTT TTTTTTTTTT TTTGAGATGO AGTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATGGCACA ATCTCAGCTT ACTGCAAGCT GGGCCTCCGO GGTTCAAGGO ATTCTCCTGC 120 

CTCAQCCTCT CAAGTAGCTG GQATTACAGQ CATOTQCCAC CACCCCTG6C TAACTAATTT • 180 

CTTTTCTATT TAOTAGAGAT GGOGTTTCAC CATGTTGGTC AGGCTGGTCT TQAACTCXn^ 240 

ACCTCA0GT6 ATCCACTT6C CTTG6CCTCC CAAA6T6CTA GGATTACAOC COTGAAACTG 300 

TGCCTGGCTG ATTCTTTTTT TGTTGTTGGA TTTTTGAAAC AGGQTCTCOC TTGQTCGCCC 360 

A6GCTGGAGT GOiaTGGTGC GATCTTGGCT CACTATAACC TCCACCTCCT GGTTTCAAGT 420 

GATCCTCCCA CTTTAGCCTC CTGAGTAGCT GTQATTACAO QOGTGCACXA CCACACCOGG 480 

CTAATTTTTG TATTTTTATT AGAGACAGGO TTTCACCATG TTGGCCAGGC TQTTCTCAAA 540 

CTCCT G GACT CAAGGGATCC GCCTGCCTCC ACTTCXXAAA GTCXZCGAGAT TAC3VGGTGTG 600 

AGTCACCATG OCTGAGCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTA6 660 

GGTGCTCACA ACAAATCTGC ATTAGGCGOT ACAATAATCC TTAACTTCAT 6ATTCACAAA 720 

AGQAAGATGA AGTGATTCAT GATTTA6AAA GGGQAAGTAG TAAGCCXACT GCACACTCCT 780 

GGATQATGAT CCTAAATCCA GATAGA6TAA AAATGGQGTA T6GGAAGGTA GAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TG6ATACATT GTTGATGTGA 900 

ATGTAAaACT GTACAOACTT CCTAGAAAAC AGTTTGOGTT CC31TCTTTTC ATTTCCOCAO 960 

T6CAGTTTTC T6TAGAAATG GAATCCX3AGG ATTTAAGTOG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTGAGA GACATTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTGATA CTACAGATAA CTOGGGAACT GTTAACCAAA 1140 

TTATGATGAT GGCAAACAAC CCAGAGGACT GGTT6AGTTT GTT6CTCAAA CTAGAGAAAA 1200 

ACAQTOTTGC GCTAAOTaAT QCTCTTTTAA ATAAATTGAT TGGTOOTTAC AGTCAAGCAA 1260 

TTGAAGCGCT TCCCCCA6AT AAATATGGCC AAAATGAGA6 TTTTGCTAGA ATTCAA6TGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAGAOC CAGATGATGC ACX5T6ACTAC TTTCAAATGO 1380 

CCAGAGCAAA CTGCAAGAAA TTTGCTTTTQ TTCATATATC TTTTGCACAA TTTGAACTGT 1440 

CACAAGGTAA TGTCAAAAAA AGTAAACAAC TTCTTCAAAA AGCTGTAGAA 0GT6GAGCA0 1500 

TAOCftCTAGA AATGCTGQAA ATTOCCCTGC GGAATTTAAA OCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAGGA GGAAAAQAAG AATTTATCAO CATCTAOGGT ATTAACTGCX: CAAQAATCAT 1620 

rrrcOGGTTC ACTTGGGCAT TTACAGAATA GGAACAACAG TTQTGATTCC AGAGQACAGA 1680 

CTACTAAAGC CAGGTTTTTA TATGGAGAGA ACATGCCACC ACAAGATGCA QAAATAGGTT 1740 

ACOQGAATTC ATTQA6ACAA ACTAACAAAA CTAAAGAGTC ATGCCCATTT GQAAGAGTCC 1800 

CAQTTAACCr TCTAAATAGC CCA6ATTGT0 ATGIGAAGAC AGATGATTCA GTTOTACCTT 1860 

GTTTTATGAA AAGACAAACC TCtAGATGAG AATGC06AGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAQTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTGTTCAA AATAGTCATT 1980 

TCAAGGAACC TCTGGTGTCA GATGAAAAQA GTTCTGAACT TATTATTACT GATTCAATAA 2040 

CCCTGAAGAA TAAAAGGGAA TCAAQTCTTC TAGCXAAATT AGAAGAAACT AAAGAOTATC 2100 

AAGAACCAGA GGTTCCAGAO AGTAACCAGA AACAOTGGCA A6CTAAGAGA AAGTGAGAGT 2160 

GTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGGA GATTCOGGAG TTAGCC08AA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AG7CACCACC AATATCAACA TCTAAATGGT TTGACCCAAA ATCTATTTOT AAQACACCAA 2340 

GCA6CAATAC CTTGGATGAT TACAT6AGCT GTTTTAGAAC TCCAGTTGTA AAGAATGACT 2400 

TTOCACCTGC TTGTCAGTTO TCAACAOCTT ATGGOCAACC T6CCTGTTTC CA6CAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTAGAGGT TTTA6CATCT TCTTCAGCAA 2520 

ATGAATGCAT TTCGGTTAAA GGAAGAATTT ATTGOVTATT AAAGCAGATA GGAAGTGGAG 2580 

GTTCAAGCAA GGTATTTGAG GTGTTAAATG AAAAGAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTA6A AGAAGCAGAT AACCAAACTC TTGATAGTTA CGGGAACQAA ATA6CTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCOSACT TTATOATTAT QAAATCAOGG 2760 

ACCAGTACAT CTACATGGTA AT6GA6TGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AGAAAAAATC CATTGATCCA TGGGAACGCA A6AGTTACTG GAAAAATATQ TTAGAGGCAG 2880 

TTCACACAAT CCATCAACAT GGCATTGTTC ACAGTGATCT TAAACCAGCT AACTTTCTGA 2940 

TAGTTGATG6 AAT6CTAAAG CTAATTGATT TTG6GATTGC AAACOUUITO CAACCAGATA 3000 

CAACAAGTOT TOTTAAAGAT TCTCAOOTTG GCACAOTTAA TTATATGCCA CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AQAGAGAATG OGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGrrro GTCCTTAGGA TGTATrTTOT ACTATATQAC TTAOGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAGATT TCTAAATTAC ATGCCATAAT TGATCCTAAT CATGAAATTG 3240 

AATTTGCXX3A TATTCCAGA6 AAAGATCTTC AAGATGTGTT AAAGTGTTGT TTAAAAA6G0 3300 

AOOCAAAACA QAGGATATOC ATTCCTQAOC TCCTOGCTCA TCCATATGTT CAAATTCAAA 3360 

CrCATCCAGT TAACCAAATG GCCAAGGGAA GCACTGAAQA AATGAAATAT GTTCTGGGCC 3420 

AACTTGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATATGAAC 3480 

ACTATAGTGG TGGTGAAAGT CATAATTCTT CATCCTCCAA GACTTTTQAA AAAAAAAGGG 3540 

GAAAAAA ATO AT TTGCAQTT ATTCQTAATG TCAGATAGGA GGTATAAAAT ATATTGGACT 3600 

GTTATACTCT TGAATCOCTQ TGGAAATCTA CATTTGAAOA CAACATCACT CTGAAGTGTT 3660 

ATCAOCAAAA AAAATTCACT QAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGGAACCAC 3720 

TTATGGCACT GTATATATTG TAOACTTOTT TTCTCTGTTT TATGCTCTTO TGTAATCTAC 3780 

TT6ACATCAT TTTACTCTTG GAATAGTGGO TGQATAGCAA GIATATTCTA AAAAACTTTO 3840 
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TAAATAAAGT TTTOTGGCTA AAATQA 



A86 Protein sequence i 
Gene name: 
Unigene nxiniben 
Probeset Acceesion 9i 
Protein Accession S: 
Signal sequence t 
Transmeinbrene domains: 
Protein Kinase Domain t 
Cellular Localization* 



TTK protein kinase 
HS.169B40 
N86699 
NP_003309 
none £ound 
none £ound 
S10-77S 
cytoplasmic 



and nuclear 



1 

I 

KNKVRDIKNK 
VPLSDALLNK 
ANCRKFAFVB 
EEEKKNLSAS 
NSIiRQTNXTK 
SGNDSCBZiRN 
PEVPBSNQKQ 
PPISTSKWFD 
QILATPLQNL 
LBEADNQTU) 
KSIOPHBRKS 
SWXDSQVGT 
XIHQZSXUIA 
PVNQMAKGTT 
K 



11 

I 

FKNEDLTDKL 
LZaHYSQAZB 
I8FAQFELSQ 
TVLTAQESFS 
QSCPFGRVPV 
LKSVQNSHFK 
WQAXRICSECX 
PKSICKTPSS 
QVLAS9SANB 
SYRNEIAYLN 

ynKNMLEAva 

VITYMPPEAIK 
ZIOPNHBIBP 
BBMKYVLGQIi 



21 

I 

SLNKXSADTT 
ALPPDinrOQN 
GNVKKSKQLL 
GSLGHLQNRN 
NLUISPDCDV 
EPLVSDBKSS 
HQNPAASSNH 
HTLDDYMSCP 
CISVKSRIYS 
KLQQHSDKll 
TIHQHOIVHS 



PDIPEKDLQD 
VGLNSFNSZL 



31 
I 

DNSGTVKQIM 
ESPARIQfVRF 
QRAVBRGAVP 
NSCD9RGQTT 
KTDDSWPCP 
EXiIITDSITL 
WQXPBUUUCV 
RTPWKNDFP 
ILKOIGSGGS 
RLYDYEITDQ 
DLKPANFLIV 
SXSXX8PXSD 
VLKCCLKRDP 
KAAKTLYBHY 



41 

1 

MHANNPEDWL 
ABLKAXQEPD 
LaOiBXAUW 
KARFLYGOm 
MKRQT8R8EC 
KNKTESSIiLA 
NTEQXHTTFB 
PACQIiSTPYG 
SKVPQVLNEK 
YIYMVMEOGN 
DGMIiKLIDFG 
VWSLGCILYY 
KQRISXPBUi 



51 
I 

SLLLKLEKNS 
DARDYFQNAR 
LNXiQRKQLLS 
PPQDAEIGYR 
RDLWPGSKP 
KLEBTKBYQE 
QFVFSVSKQ8 
QPACFQQQQR 
KQIYAZKYVN 
IDLNSWLKKK 
XAKQMQPDTT 
HTYGKTPFQQ 
AHPYVQIQTB 



60 
120 
180 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 
840 



A87 DMA SBQUEHCB 

Gene name: 
Unigene number t 
Probeset Accession #: 
Nucleic Acid Accession #t 
Coding sequence! 



bone morphogenetlc protein 7 (osteogenic protein 1) (BHP7) 

Hs. 170195 

BB616633 

NM_001719 

123-3.418 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I i I I 1 

QGGGGCA009 6GQCCCQTCT <3CAGCAA3TO AOOGAOGQCC GGQAGGGCCG CCTGCOCCCT ' 60 

CTGGCACCTO GGGCGGTG06 GGCCCOQAGC CCG6AGCC0Q G6TA60G0GT AGAGCC6G08 120 

OGATQCACOT GOGCTCACTG CGAGCTQCGG CQCOQCACAG CTTOGTGGCG CTCTGQGCAC 180 

CCCTGTTCCT GCTGOGCTCC GCCCTGGCOG ACTTCAGCCT GQACAACGAG GTGCACTCGA 240 

GCTTGATCCA CGGGCGCCTC CGCAGCCAGG AGGGGOGGGA GATGCAGOGC GAQATCCTCT 300 

CCKTTTTQGO CTT6CCCCAC C6C00606CC GGCAOCTCCA GG6CAASCAC AACTGGGCAC 360 

COVTGTTCAT GCTQQACCTG TACAAOGCCA TGGGGGTGGA GGAGGGCGGC GGGCCOGQCQ 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCA6TAC CCAOGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATOAGCTTC GTCAACCTOG 540 

TGQAACATQA CAAGGAATTC TTCCACCCAC GCTACCACCA TOQAGAOTTC CGGTTTGATC 600 

TTTGCAAOAT CCCAGAAGG6 GAA6CT6TCA CGGCAGOOQA ATTCOOGATC TACftAGGACT 660 

ACftTCOQGGA A06CTTOGAC AATGASA06T TCOGOATCAG OGTrTATCAO GTQCTCCAGQ 720 

AGCACTTGGG CAGGQAATCO 6ATCTCTTCC TGCTCGACAG CCX3TACCCTC TGGGCCTCGG 780 

AQGAGGGCTG GCTGGTGTTT GACATCACAO CCACCAGCAA CCACTGGGTG OTOATCC6C 040 

GGCACAACCT GGGCCTGCAG CTCTOGGTGG A6A06CTGGA TGGGCAGAGC ATCAACCCCA 900 

AGTTG60GGG GCXGATTGGQ OQ6CAC3GG6C CCCAGAM3A 6CAGGCCTTC ATGGTG6CTT 960 

TCTTCAAG6C CACGGAGGTC CACTTC06CA GCATCCGGTC CACGGGGAGC AAACAGCGCh 1020 

6CCRGAACCG CTCCAAGACG CCCAAGAACC AGGAAGCCCT GOGQATGQCC AACGTGGCAG 1000 

AQAACAGCAO CA6CGACCAG A6GCAGGCCT OTAAQAAQCA CQAQCTGTAT GTCAGCTTCC 1140 

GAQACCTGGG CTGGCAGGAC TGQATCATOO OGCCTGAA6G CTA06CCGCC TACTA CTOTO 1200 

ACSGQOQAOTO TQCCTTGCCT CTQAACTOCT ACAIGAAOQC CftCCAACCAC GCCATCSTOC 1260 

AGA06CIGGT CCACTTCATC AACC0G6AAA OGGTGCCC A A 6CCCTGCTGT GCGCCCA06C 1320 

AGCTCAATGC CATCTCCGTC CTCTACTTOO ATGACAGCTC CAACGTCATC CTGAAQAAAT 1360 

ACAGAAACAT GQTGGTCCGG QCCTGTGQCT GCCACTAGCT OCTCOQAGAA TTCAQACCCT 1440 

TTGGGGCCAA GITTTTCT6G ATCCTCCAT7 GCrCGCCTTG GOCAGQAACC AGCAGACCAA 1500 

CraCCTTTTG TGAOACCTTC C C CTC C CTAT CCCCAACTTT AAAGOTGTOA (3AGTATTAGG 1S60 

AAACATGAGC AGCATATG6C TTTT6ATCAG TTTTTCAGT6 GCA6CATCCA ATGAACAAGA 1620 

TCCTACAAGC TGTGCAG6CA AAACCTAGCA GGAAAAAAAA ACAAOGCATA AAGAAAAATG 1600 

GCOQGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCAC GGACTOQTTT CCRGAGGTAA 1740 

TTATQAGOSC CIACCAGCCA GGOCACCCAG CCGTGGGAGO AAGGOGGOaT GGCAAGGGGT 1800 

6GGCACAST6 GTCTCI G TGC GAAAOGAAAA TTGACC06GA AGTTGCTGTA ATAAATGTCA 1860 
CAATAAAACG AATGAATG 



A08 Protein aequencet 

Gene name* bone morphogenetic protein 7 (osteogenic protein 1) (BKP7} 



Tftiigene nunbert 
Probeset Accession tti 
Protein Accession 8: 
Signal sequence t 
P£am domains: 
Transmembrane domains i 
Cellular iiocalixationi 



Hs. 170195 
BB616633 
NP_001710.1 
1-30 

TGFb^ropeptide (37-281] 

none found 

secreted 
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1 11 21 31 41 51 

I t I I 1 I 

MHVRSLRAAA FH8PVALWAP X^FLLRSAUVD PSLDKEVHSS FIURIUjRSQB RREMQREJLS 60 

ILQLPHRPRP HLQOKHNSAP MFKLDLYNAM AVEEGGQPGG QGPSYPYKAV PSTQGPPLAS 120 

LQDSHFLTDA DMVMSFVNLV BHDKBFFHPR YHHHEPRFDL SKIPB9BKVT AAEFRZYKDY 180 

XRBRFDNBTF RZSVYQVIiQE HUSRBSDLFL LDSRTLWASB BQHLVFDITA TSNHHWNPR 240 

HNLGZ>QZ»SVB TLDGQSINPX LAGLIGRK6P QIHCQPFMVAP PKATEVHFRS IRSTGSKQRS 300 

QNRSIOTKNQ BAIiRMANVAB NSSSDQRQAC KKHELYVSPR DLGWQDHIIA PEGYAAYYCE 360 

GECAFPUiSY MNATNHAZVQ TLVHFINPET VPKPCOVPTQ LNAISVLYFD DSSHVZLKKY 420 
HNMWRACGC H 

A89 DNA SEQUBHCE? 

Gene name: Hotro sapiens nflUlAi cDNA DKFZp56401763 (from clone OKFZpS 6401763) 

Unigene number i Hs.27373 

Probeset Accession Ot F13036 

Nucleic Acid Accession 8: AC012478 

coding sequences 1-1896 (underlined sequences correspond to start and stop codons) 



ATGO GOGCCG 
GCTCCOSCOQ 
GGCX3AST0QC 
600GOGOQOG 
ATTTCCATCC 
TTTACCACCC 
AAGAAGACAC 
CTAAATGAA6 
TCCTT6C06G 
GTGCCCCCAC 
GGT6GAATCA 
TCAGCTGCAA 



TCAQACTGTC 
TTTGGOCATC 

CTGAATCTCA 
GCCAGGTGCT 
CCTTG9TQQC 
AGTAGCTTGQ 
AGAGCAOTQA 
GCCCTAQAGA 
ACCCATCCrO 
AGGGTGTTTC 
TGCCTTTTGG 
ATCTGTCTCC 
GTCCTTGOGT 
AAACGAGGGC 
GOQAACCTGC 
CACCCTGGAG 
GAT66CAGAT 
TOTQAGAGGC 
TCCG0C30SAC 



11 

I 

TGCCGCTGCC 
CCOGCGCCAQ 
GGCCACOGCC 
OCAGOGGCAO 
TCCTCOGCGA 
TCCTCATCX3C 
GCAAGTATGA 
AGGATGATGA 
CT6CACIGA0 
CCTTCATCCT 
GACCTGGTAA 
CCTGGGGTQT 
GQAACX9TTAT 
ACTGGCAAaC 
CCTTTAAAOT 
TGGAAAAGCT 
TGCCACTGGT 
ACTTCAGCGC 
6CTTGGATQT 
CTAAGOTTCT 
GTGGQTTTCC 
TCAQOTTGGC 
GGOGTCCGGO 
TTTTGAAGAT 
CCTGCTGTGC 
CATTTGAGCA 
CCA6TGGGCA 
AGACAATTCC 
GAGACTT6GA 
GCCAGAAGAT 
TCAC AGgT TC 
AGGCCCT6TT 




CTCCAGCTCC 
CCTAOCCACC 
CTGCCTGCTG 
TATCATCACC 
AGATGAGQAC 
AGGTCA6CTQ 
OGACATTGAC 
AACCTGTTTC 
GAAGGACTGG 
GTATAAAACC 
T06TTTCCAC 
GCCCCCTACT 
GGATTCCTCT 
CCTGGCAGAA 
CACAGOCTCT 
TTTCTGTGGT 
CCAGGGTAOC 
CXATCATCTC 
TOGTTCAGAT 
GCAGTCTCTG 
TCTGTTGAGG 
CGTQQAACAC 
6AGCCCACAA 
ATTGACCAGA 
AGATACCCAA 
TQGOGTQOCA 
GGTCCTGATG 
CCATCACTTC 
TCFQTCCAGG 



31 

I 

CTGCOGCTGC 
TCCGTCTOOG 
CCXGGGAACA 
AACM3CA6TO 
CTCAAGGCAG 
CTGOQOQTCT 
ACTCCAGCAG 
TCGACAGTAT 
GCAGGGTGCC 
CTTCCAGCAA 
CCAGCCTGGT 
ACCTGGAAGC 
CCA6CTCCAT 
GTCAOCACAA 
TCTACTCCCC 
GCCTTACGCA 
ATGQCGGCTO 
CCAATAAAAA 
GOCGGCCAGC 
TCTTTCTGCA 
AGGCTTCTCA 
OCCCGGGGAC 
CATGGCGGAG 
CGCCATCCTC 
CTAOGOGAAO 
AAGGCAOCTO 
CACACATGCC 
GGCXAGGAAG 
AATTTCTATT 
TCT6AGGAA0 
T CCAG CCATT 
GCCTQA 




CCACCCGGTT 
GC9QACG0CTT 
CCGTGATOST 
TCAGGTOGGQ 
AGCGA6TGGA 
TOQACATCAA 
AGACGCTACT 
GATGCAGTOG 
GGCATCCTGT 

ccTCTTGayr 

0QTGG6T6TC 
T6GAGTTGCT 
ATOQTTTTCO 
GAAACACCCG 
CTQAAAGTGA 
CCCTTTACAC 
GGGGCAGCTT 
AAGAGCTGOO 
GAGAGTGTCC 
AAGCCAGCCT 
GGTCAGCGGG 
ACCTTGACCT 
CCAAGA6AAG 
CTGCCCACGG 
CTGGCTGGGG 
GCCCAOGTGA 
TGOAGGAAGA 
GGCCACCTAG 
CCAAOTCTTQ 




AAAGAGGTTA 
AATGGCGCCA 
ATAGSUSAGTQ 
GACABTTCCT 
AAGGCCT6AT 
GGAAAGTTGG 
CGGAGGTOTT 
AGGCATCTGC 
TCTGC3CAGCC 
ACAACTGCAG 
GGCTCCATCT 
CCTTCCAAAT 
ACAAACCAT6 
TTGTOAAQAC 
CTGGAAGCCA 
TCCGCTGAGC 
OACGGGGAGG 
TACOGCAACT 
CTTCTACAAA 
CTCAGTGACT 
GGAGCCTGTC 
GATCACACAT 
GQATGTCACT 
GGGTTTOCAG 
TTTGACAGQA 
OTOCTTOCIT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
X680 
1740 
1800 
1860 



A90 Protein aequeneet 
Gene namet 
Unigene number: 
Probeset Accession #t 
Protein Accession ft : 
Signal sequence > 
Transmembrane domains i 
Cellular Localization! 



Homo sapiens ftiRNA; cDKA DRFZp56401763 

HS. 27373 

FX3036 

FGEHBSH predicted 
1-27 

94-115, 448-469 
not determined 



(£rom clone DKFZp56401763) 



1 11 21 31 41 51 

I I I t I I 

KRAVPLPAPL LPIiLLLAZiLA APAARASRAE SVSAPWPEPE RESRPPPGPG PGNTTRFGSG 60 

AAGOSGSSSS NSSGDALVTR ISZLIiRDLPT LRAAVIVAFA FTTIjLIACLI« LRVFRSOKRL 120 

KKTRKYDIIT TPAERVEMAP UfEEDDEDBD STVFDIKYRV SLPAAIiRRQL PGCQTLLTVP 180 

VPPPFILDID LPARCSGRPD GOIRPGKTCF PAHWHPVESW SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYICr PAPSCVSGIC SDCHWQARFH VTTMBLLLPP PGHPFKVPPT STPHGFRQLQ 300 

UTLMEKLDSS ALRRNTRAPS ARCI«PLVI*AE MAAABSDLFN PWWHPSATQS PIKTLYTQTM 360 

STLGLDVFCG AGQRGTFCED KAVTKVLQGS 8FSKQLRWKP ALBSGFPHHL RLLRECPPLS 420 

.THPVRLAR8D ARGQASLTGR RVFRRPRQSL HGGG8AGTAT CLLVLKILLR RHPHLDLFYK 4 BO 

ICLPCCAVEH LRBAKRSSVT VLASFEQSPQ KAAAAHGSPV KRGPSGQLTR HTCPGWGITH 540 

ANLQTIFDTQ GQBGPREDVT HPGGDLDGVA HFYLBEBOFQ ZXHROQKMVIM SBEOPPSL7G 600 
CERLTGSHHF SSHSKSHSFb SPRQPLFLSR P 

SWING 



A91 DMA SBQUEWCB 

Gene name: G protein-coupled receptor 64 
imigene number i Bs. 184943 



873 
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Probeset Accession fit 
Nucleic Acid Accession fti 
Coding sequence! 



AA43S577 
NM 005756 

73-3117 (underlined sequences correspond to start and stop codons) 



1 11 31 31 41 51 

I I I I I t 

AGCCAGCCOG AGGACJGCGAa C06CA66TGT GCACAOAGGT TCTCCACTTT GTTTTCTGAA 60 
CTCGOGGTCA GGATOGnTT CTCTGTCAGQ CAGTGTGGCC ATGTTGGCAG AACTGAAQAA 120 
GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATPTCTC TTCATGTOGT TCTGG TAACA 180 
TCCCTGGAAQ AAOATACTQA TAATTCCAOT TTGTCAOCAC CACCTGCTAA ATTATCTQTT 240 
GTCAGTTTTG CCCCCTCCTC CAATGftGGTT 6AAACAACAA 6CCTCAATQA TGTTACTTTA' 300 
AGCTTACrCC CTTCAAACGA AACAGAAAAA ACTAAAATCR CTATAOTAAA AACCTTCAAT 360 
GCTTCAGGOG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTO CAATGACTCA 420 
GCATTTTTTA GAGGTGAGAT CATQTTTCAA TATGATAAAG AAA6CACTGT TCCCCAGAAT 480 
CAACATATAA OQAATGGCAC CTTAACTQQA GTCCTGTCTC TAASTGAATT AAAAOGCTCA 540 
GA6CTCAACA AAACCCTGCA AACCCTAABT 6AGACTTACT TTATAAT6TG TGCTACAGCA 600 
GAGQCCCAAA OCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 
TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCCIAC CAATQQAACA CTOCTQCTGT 720 
TCTGTCAGGA TAGCCTGCCC TTCCTCCCCA GAASASTT6G 6AAASCTTCA GTGTGACCTa 780 
CAGGATCCCA TTGTCTGTCT TGCTGAOCAT CCACSTGGCC CACCATTTTC TTCCft GCCAA 840 
TCCATCCCAQ TGGTGCCTOO GGCCACTGTG CTTTCCCAiGG TCCOCAAAQC TACCTCTTTT 900 
GCTORaCCTC CAGATTATTC ACCTGTCACC CACAATGTTC CCTCTCCAAT AGGGOAQATT 960 
CAACCXXTTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTOCCAT TGACATGCCC 1020 
CCACAGTCTG AAACGATCTC TTCCOCTATG CCCCAAACCC ATQTCTCCGG CACCCC3VCCT 1080 
0CTGT6AAA0 CCTCATTTTC CTCTCCCRCC GTGTGiaGCC CTGOGAAIGT CAACRCTACC 1140 
A6CGCA0CTC CTGTCCAOAC A0ACATOOTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 
CRAGTGTTGC AGATGGAGAA GGCTCTGTCC TT0GGCA6CC TGGAGCCTAA CCTOGCAGGA 1260 
GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCOSC CTGACATGCT GGCCCCTCTO 1320 
GCTCAAAGAT TGCT6AAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACAOO 1380 
ACTATAAGTC TAACCTCGCX: TTCTTTGGCT CTGOCTCTOA TCAGAOTQAA T6CCAOTAGT 1440 
TTCRACACAA CTAOCTTTOT GGOXAAGAC CCTGCAAATC TTCAGGTTTC TCTGOAAACX: 1500 
CAAGCTCCTG AGAACAOTAT TGGOVCAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 
CCAGCTCATG ACATGQAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 
TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTQATCA GCTAOGTCAT ATCATCX3AGT 1680 
GTTGCAAACC TQACGGTCAO 6AACTTGACA AOAAAOQTGA GAGTCSUSVTT AAA6CACATC 1740 
AACOXSAOCC AGQATGAGTT AACAGTORGA TGrOTATTTT GGOACTTGGO CAOAAATGGT 1800 
GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 
ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTQC TGGACCTATC TAGGACATCT 1920 
GTGCTGCCTG CTCAAATGAT GGCTCTGACO TTCATTACAT ATATTGOTTG TGGGCTTTCA 1980 
TCSUITTTTTC TGTCAOTOAC TCTTGTAACC TACATA6CTT TTGAAAAGAT CGGGAGGGAT 2040 
TACOCTTCCA AAATCCTCAT CCAGCTGTX3T GCTGCTCTGC TTCTQCTQAA CCTGGTCTTC 2100 
CrCCTGQACr COTGQATTOC TCTOTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAQAAGC ATTCCATATG 2220 
TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATC08AA AATACATCCT TAAJVTTCTGC 2280 
ATTGTOGGT T GGGGGGTACC AGCIGTGQIT GTGACCATCA TGCTQACTAT ATCCCCAGAT 2340 
AACTATGGGC TTGOATCCTA TGGGAAATTC CCCAATGGTT CAOOGGATGA CTTCTGCTGQ 2400 
AXCAACAACA ATGCAQTATT CTACATTACG GTGGTGQQAT ATTTCTGTGT GATATTTTTG 2460 
CTGAAC3STCA GCATGTTCAT TGTGGTCCTG GTTCA6CTCT OTOQAATTAA AAAGAAGAAG 2520 
CAACTQGGAG CCCA6CGAAA AACCA3TATT CAAGACCTCA GGAGTATG6C TGGCCTTACA 2580 
TTTTrACTGG GAATAACTT6 G6GCTTTGCC TTCTTTGOCT GGGOACCAGT TAAG8TGA0C 2640 
TTCATQTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 
TQTGTGGCCA AAOAAAATGT CAGOAAGCAA TGGAGGOGGT ATCTTTGTTG TGGAAAGTTA 2760 
CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA QAAGCAQACT 2820 
OTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 
TCCACX:aCAC TGCTAGTGAA TAATGATTGC TCAGTACAOQ CAAGOGOOAA TGGAAATGCT 2940 
TCTACAGAGA GQAATGQGOT CTCTTTTAGT GTTCA6AATG GMKIQ^SGTIG CCTTCAOGAT 3000 
TTCACTGGAA AACAQCACAT GTTTAAOQAQ ATUSGAAOATT CCTGCAATGG GAAAQGCOGT 3060 
ATGGCTCTCA QAAaOACTTC AAAGOGGGGA AGCTTACACT TTATT6A6CA AAT GTGATT C 3120 
CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTOCAATT TTACCTTTTA 3180 
CACAATGTGA GATGTATGAA AATCRACTC» TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 
AAGCTAATTA AQGGCX3ATGA TTATTATTAC AAGAAGAAAC CAAQACATTA CACCATGOTT 3300 
TTTAGACATT TCTGATTTGQ TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 
ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAQTGAACTT TCAQCTACCT 3420 
TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAA6 CAACTGTTGA CTTCAGOCTG 3480 
TTGGTGAGTT TAGTTGTOCA TGCCTTTGTT GTATATAAGC TAAA7TCTAG TGACCCAT8T 3540 
GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 
TTGTAGAATC ATGGTTGTTT TGTCTCACQT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 
AAATCCTAAA GCTCCTTTTQ GAGATGATAT AGGATGTGAA ATACAQAAAC CTCAGTGAAA 3720 
' TCAAGAAATA ATGATCCCAO CCAQACTGAG AAAATGTAAG CAGACA6TGC CACAGTTAGC 3780 
TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACIGGGCASA CACAGCCCTA 3840 
TGGGTCATGG TTTQACAAAC AGAGTGAGAO ACCATATTTT AGCCXXACTC ACCCTCTTGG 3900 
GTGCAOQACC TGTACAGCCA AACACRGCAT CCAATATGAA TACCCATCCC CTQACCGCAT 3960 
CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACX: TTGGCCACAO 4020 
AGAGQGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAAOGTA GATX5AGGTAC 4080 
ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAOAGXAG ACTAGGATTC 4140 
CTTCTCrrAA AAAGGTAC31T ATATATOGAA AAAAATCATA TTOCOGTTCT TTAAAAGGCA 4200 
ACTGCATGOT ACATTGrTGA TTGTTATGAC TGGTACACTC TGGCCCAOCC AGAGCTATAA 4260 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAO GGGAOTAGCT ATTGGGAACA 4320 
GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 
TATACAG6QT CTATCTTGCT TOCIACCTAC ATCTGCTTGA GCAOTGCCTC AAGTACATCC 4440 
TTATTAGGAA CATTTCAAAC COCTTTTAOT TAAOTCTTTC ACTAAGOTTC TCITGCATAT 4500 
ATTTCAAaTO AATGTTGGAT CTCM3ACTAA CCATAGTAAT AATAC ACATT TCTQTQAGTG 4560 
CIQACTTQTC TTTOCAATAT TTCT T T' X C m ATTTATTTAA TTTTCTTCTA TTTATAT6TT 4620 
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AAAATCAAAA ATQTTAAAAT CAATQAAAIA AAITTGCAQT TAAGA 
A92 Protein sequence 

Gene name: G protein- coupled receptor 64 

Unigene numbers Hs. 184942 

Protein Accession ft: NP_005747.1 

Signal sequence t 1-38 

GPS domain: 564-615 

Transmembrane domains: 624-646. 660-682, 688-710, 733-755. 783-805, 828-850, 858-880 

Cellular Localization: plasma membrane 

1 11 21 31 41 51 

I I I I i I 

MVPSVRQOGH VGRTBEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSW8FA 60 

PSSNBVETTS UIDVTl^LLP SNETSKTKIT IVKTFNASGV KPQSNICNLS SICNDSAFFR 120 

GBIMFQYDKB STVPQNQHIT NGTLTGVLSL SELRRSEUTK TLQTLSETYP IMCATABAQS 180 

TLHCTFTXKli NIITMNACAAI AALBRVKIRP KEHCCCSVRX PCPSSPEBLG KLQaXiQDPI 240 

VCLADHPRGP PPSSSQSIPV VPRATVLSOV PKAT8PABPP DYSPVTHNVP SPIGBIQPLS 300 

PQPSAPIASS PAIDHPPQSB TI8SPMPQTH VSGTPPPVKA SPSSPTV8AP ANVNTTSAPP 360 

VQTDIVMT8S ISOLEMQVLQ MEKAIiSLOSli EPNIAGSflN QVSRLLHSPP DMLAPLAQRI. 420 

UCWDDXGLQ LNP8NTTIBL TSPSZALAVI RVKASSPNTT TFVAQDPANL QVSLETQAPE 480 

NSIQTITLPS SLMNNLPAHD MELASRVQPH FPBTPAIiPQD PSLENIiSLIS YVISSSVANL 540 

TVRNLTRMVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WSDNGCSVKD RRLKETICTC 600 

SHX.TSFGVLL DLSRTSVLPA QMMALTFITY IGOQLSSIFL SVTIjVTYIAF EKIRRDYPSK 660 

XLIQLCAALL LLNIiVFLLDS WIALYKMQGL CXSVAVPLHY FLLVSPTHMG LGAFKMYLAL 720 

VKVTimrXRK YXliKFCIVGW GVPAVWTIX LTISPONyGL OSYGXFPNGS PDDFCHllQQI 780 

AVFYXTWOy PCVXFLI^S NPIWLVQbC RXKKKKQLQA QRKTSXQDLR SXAOLTFIiUS 840 

ITWGPAFPAW GPVNVTPMYL PAXFNTLQGP FIFXPYCVAK KNVRKQWRRY LCOGKLRLAE 900 

NSDHSKTATN GLKKQTVNQG VSSSSNSIiQS SSNSTKSTTL LVKNDCSVHA SGNGNASTBR 960 
N6V8F8VQNG DVCLHDFTGK QHMFNEKEDS CNGK6RMALR RT8KR6SLHF XEQM 



FibroaiB 



A93 DMA SEQUENCE 
Gene name: 
unigene number: 
Probeset Accession #t 
Nucleic Acid Accession ft : 
Coding sequence: 



1 

1 

C06CAGA6GA 
CGCCCCTGCC 
OQAOTGGAOC 
GGGTGCGGCC 
ATGCCTCTQC 
GGGAACQCGG 
TGTCACTATO 
T6TGAAGCTA 
AGATGCTTTC 
AAACCCOGGC 
CTCAGTGGCC 
ATAAACTGTC 
TCAGGACTCC 
GGTAAAGTCA 
AAATGTCACA 
AATGAATGTA 
GGGTCCTTCA 
ATCOCTGAAA 
AAGAAGTTGC 
CCAQAACCCA 
ATAGTTTCCA 
6AGGGGCTTG 
AQOCTQCQAO 
CTGGTCCAAA 
GACTGCAGCT 
TQGAATCCTO 
GGTCACAAGA 
AACTTCIOTT 
TTTGTQAAAA 
TGGAAGACAG 
GAAGCAGAAC 
TCAGGCTTAT 
TT6ACTTT0T 
TTAOAATTAC 
TCTTGTATAA 
TTTCTGAATC 
CAOTATATCT 
TAOAAAAAAA 
TATQACATCA 
TGTATATTTA 



11 
1 

GCCTCGGCCA 
GCGGTGCCTG 
GGAGQAOCCG 
G6C6CCCTCC 
CCTGGAGCCT 
CCAGTGCAAG 
GAACTAAACT 
CATGCGAACC 
CAGGATACAC 
CATGCCAAGA 
ACATGCTCAT 
AGTAGAGCTQ 
CCCTGGCCCC 
TCTGTCGCTA 
TTGGTTTOGA 
CTATQGATAG 
AGTGTAAATO 
ATTCTGTQAA 
TTGCTCACAA 
CCAGGACTCC 
GAGGOQGOAA 
A66ATGA6AA 
OAOATGTGTT 
6GAAA6C6CT 
TCAATCATGO 
CTGATOGAGA 
AAGACATTGO 
TGCTCTTTOA 
ACAGTAACAA 
GQAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
ATOTCAOTTC 
TAGCTGAAAA 
GATATOCCAA 
TTTCCACATT 
GATTTGTATA 
ASCACAQAGA 
AAGATAGACT 
AATTCTTT G T 



21 
I 

GGCTAGCCAG 
QCCTCCCCTC 
AGCGGCTQAG 
06AGGGGGGC 
TGCGCTCCCQ 
GCATCACGQG 
GGCCTGCTGC 
TGGATGTAAG 
C6GGAAAACC 
CAQATOTGTG 
GCCA6AT6CT 
TGAAOACACA 
AAATGGAAGA 
CAATGOAAQA 
ACTGCAATAT 
CCATACGTGC 
CAAGCAX3GGA 
GGAAGTCCTC 
AAACAOCATQ 
TACCCCTAAO 
CTCTCATGGA 
AAGAGAAGA6 
TTTCCCTAAS 
AACTTCCAAA 
GATCTGTGAC 
TAATGCTATT 
CCGATTQAAA 
TXACGQGCTQ 
TGCCCTGOCA 
GTTGTATCAA 
CAAAACOGGC 
CCTTTTATCT 
CCTGGTTTTT 
ATTGTAATGT 
TATTT6CTTT 
ATATTATAAA 
AOTAAGTTGA 
AATOTTTAAC 
TTTOCCTAAG 
AATAATAATA 



BGP-like-domaini multiple 6 

Hs. 12844 

N67551 

NM_015507 

241-1902 (underlined sequences correspond to start and stop codons) 

31 41 51 

I I I 

GGOGCCCCCA GCCCCTCCCC AGQCCGCQAG 60 

CCAOACTGCA GGGACAQCAC CGGGTAACTG 120 

QAGAOAOGAG GGSGCGOCTT AGCTGCTACQ 180 

TCAGGAGGAG GAAGGAQGAC COGTGOGAGA 240 

CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

TTQTTAGCAT CGQCAOOTCA GCCTGGGGTC 360 

TAGGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TTTGGTQAGT GGGTOGGACC AAACAAATOC 480 

TGCAGTCAAG ATGTGAATGA 0TGTG6AAT6 540 

AATACACAOG GAA6CTACAA GTQCTTTTQC 600 

A0GT6TGTGA ACTCTAGGAC ATGTGCCATG 660 

OAAGAAGGGC CACA8TGGCT GTGTCCATCC 720 

GACT8TCTAG ATATTGATGA ATGT6CCTCT 780 

TGTGTQAACA CATTTGGAAG CTACTACT6C 840 

ATCASTGGAC QATATGACTO TATAGA7ATA 900 

AGCCAOCATQ CCAATTGCTT CAATACCCAA 960 

TATAAAGGCA ATGGACTTOG 6TGTTCTGCT 1020 

AGAGCAOCTG GIACCATCAA AGACAOAATC 1080 

AAAAAGAAOO CAAAAATTAA AAATGTTAGC 1140 

OTOAACTTGC AGCCCTTCAA CTATOAAQAO 1200 

GGTAAAAAAG 6GAATGAAGA GAAAATGAAA 1260 

AAA6CCCTGA AGAATGACAT AGA6GAG0GA 1320 

GIGAATQAAG CA6GTQAATT CGGCCTQKIT 1380 

CTQGAACATA AAQATTTAAA T A TCTOG O TT 1440 

TGGAAACAGG ATA6AQAAGA TGATTTTGAC 1500 

GGCTTCXATA TGGCAGTTCC GGCCTTGOCA 1560 

CTTCTCCTAC CTGACCTGCA ACCCCAAA6C 1620 

GCOSGAGACA AAOTGGGQAA ACTTOlAOra 1680 

TaOQAGAAGA CCACQAOTGA GGATGAAAAO 1740 

GGAACTGATQ CTAOCAAAAG CATCATTTTT IBOO 

GAAATOQCAO TGGATGGCGT CnXjCnvn ' 1860 

GTGQATQA CT QAA TGTTACT ATCTTTATAT 1920 

TTGATATTOC ATCATAGGAC CrCTGGCATT 1980 

ACCAACAQAA ATATT A TTGT AAGATGCCIT 2040 

AAATATCATA TCACTOTATC TTCTCAGTCA 2100 

ATATGQAAAT GTCAGTTTAT CTCCCCTCCT 2160 

TGAGCTTCTC TCTACAACAT TTCTAQAAAA 2220 

TGTTTQACTC TTATQATACT TCTTGGAAAC 2280 

TGGCTTAOCT GGGTCTTTCA TAGCCAAACT 2340 
TCCAAATCAT CAAAAAAAAA AAAAAAAA 
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5 

10 
15 
20 
25 
30 

35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A94 Protein sequence i 
Gene name: 
Unigene niunber: 
Probeset Accession St 
Protein Acceesion 9t 
Signal sequence t 
Transmembrane domains t 
MAM domain; 
EGF domalni 

Cellular Localisation t 



EGP-like-doniain; oniltiple 6 

Hs. 12844 

N67551 

HP 056322.2 

1-21 

none found 
402-546 
BO-259 
secreted 



1 

I 

KPLPNSLALP 
CBATCEPGCK 
LSGHMLMFDA 
GKVICPVNRR 
GSPKCKCKQQ 
PBPTRTPTPK 
SLRGDVPFPK 
HNPADRGNAI 
FVKNSNNALA 
SGLCPDSLLS 



11 
I 

LLLSHVA66F 
FGECVGPNKC 
TCVNSRTCAM 
CVOTFGSYVC 
YKGNOLRCSA 
VNLQPFHYEB 
VNEAOEFGLI 
GFYMAVPALA 
WEKTTSBDEK 
VDD 



21 
I 

QNAASARHHG 
RCPPGYTQKT 
INCQYSCBDT 
KCHIGFBLQY 
IPENSVKEVL 



LVQRKALTSK 
GHKXDZGRIiK 
WKTQKIQLYQ 



31 

I 

LLASARQP6V 
CSQDVNECGM 
EBGPQCLCPS 
ISGRYDCXDI 
RAFGTXXDRI 
QKKQNBEKMK 
tiEHKDUIISV 
LLLPDLQPQS 
GTOATKSIZF 



41 

I 

CRYGrKLACC 
KPRPCQHRCV 
SGLRLAFNGR 
NECTMDSHTC 
KXLLAHKNSM 



DCSFNHGIO) 
KFCLLFDYRL 



51 
I 

YGWRRNSK3V 
NTHGSYKCPC 
DCU>IDECAS 
SHHANCFNTQ 
KKKAKIKNVT 
KALKNDIEER 
WKQDREDDFD 
AGDKV6KLRV 
BZAVD6VLLV 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 



GLIOBLASTOMA 

A95 DMA SEQUENCE 
Gene name: 
unigene xmihben 
Probeset Accession 
Nucleic Acid Acceesion H: 



BSTs 

Ks. 293616 
AW043782 
none found 



Coding sequence j 38-1075 (underlined sequence corresponds to start and stop codon) 



1 
I 

AGCAA06A0G 
OCTGCTGCIG 
GTGCAACATA 
GT6TGACGG0 
GrCX3AAATGT 
CTTCOGGTGC 
AAACCCTCTO 
GAGCTTCATC 
AAGTTCTCAA 
TTACCCCAGC 
CCT6CTGGCA 
GCACCGGCTG 
CTGCRACGTC 
GAATGCGTCX3 
TOOGTGGTAT 
CGACCTGCCC 
CA6CAGCCTC 
GGGCACTGCT 
AGTTATTCCA 
TGCTCATGGG 
AACTATCTCT 
TGACATGATC 
CACCCTCATT 
AAATAGGCTG 
OGCTGGACCC 
ATGATCTAAC 
ATC3UUUICCT 
AAGAAAACTT 
AAGGACTCTG 
CTCATTCT6A 
QA6CCCCTCC 
TACACCTGCC 
ACCTGCCCGT 
OTATGTCCCT 
CTCCAAAGTT 
ACTG6TTTCT 
CTGCACTGTG 
GGTCA6QGTC 
AGACAATTTO 
TGAAACASTG 
AGCTOTCTCT 
ACACCCTTGC 
ACATTT6TGC 
AGAGGOACTC 
TTCTCTGTQT 
AaOTQTTGTT 
CCACTCCGGG 
AACCrGTTTO 



11 
I 

CCGGGCA6CG 
AGCAGCX3CCG 
CCAGGCAACT 
CTGCCTGACT 
GQCCCAACCT 
AATGGGTTTG 
CTTTGCTCCA 
TGCGATGQAC 
QAACCCGGCA 
ATCACCTATO 
CTGGTCTTGC 
CAGCACCCTG 
ACCTACAACG 
GAAGTAGGCT 
GACCTTCCTC 
CCCTACOGCT 
CTGAGOGfTGQ 
GA6CCCAGGQ 
AAGTCCATAT 
AAGCTCTTTA 
OCATTCCCCT 
TGTT6TGCGT 
TTTCACATTA 
GGAGAGAGCA 
AATTCTCTCT 
CAGGAGGCCA 
GCTTTGCACA 
TGGAOQTSAO 
AAACCATCTA 
GAGCTTTCCT 
CAT6AGTTTA 
C7GGCTCTAC 
AGCCAAGGAA 
GTG6CCCACA 
CCCTTAACAC 
ATCACAGGTG 
CAOGCTCCTC 
AGGCCTCTOC 
GAOTCAAGAT 
TGTTTGTTTT 
TT ' rm ' UTIT 
CCCGCTGAGC 
ATTGTTGCAC 
CTCTCTCCCT 
CCAQTCAGCC 
T66CAAGAAA 
CA6CT0TCAC 
AC6CTAATTA 



21 

1 

GGAGCGOOGG 
CGGAGAGCCA 
TCATGTQCAG 
GCTTCGACAA 
TCTTCCCCTG 
AGQACTGTCC 
CCGCCCGCTA 
AGAATAACXO 
6TGGGCAGGT 
CXATCATCGG 
ACCACCAGCG 
TGCTGCTGTC 
TCAATAATGO 
CCCCACCCTC 
CACCQCCCTA 
CCCX3GTCCGG 
AAGACACCAG 
ACTCTQAGCC 
GGGTTAATCT 
AGCACCTGTA 
CCTCCCCCAG 
CTTTTCTOTC 
TTCTQTTTCT 
ATGTTTCTGT 
GCTGGGTAGT 
TCACTOQATQ 
ATCCTATTTG 
TAACACOCTT 
O0CT6TAXIVA 
CA6CAGCATA 
TCCAAGTTCT 
AGCCACTTAC 
TOAGGACCIA 
CCC3\GOCT6T 
TTGCAAAGTC 
AGA6CCAT0T 
TTG0CAAG6T 
CAACATOOCA 
TTTOCATTTG 
TTCOCTTCTA 
TTCCTTTAAC 
CCOGTGATAA 
TTT6AGGTTA 
CCX3TGTATAG 
ACA6GGCC0Q 
CCACACTGAC 
CCATTCAGAA 
AAACAGAGGC 



31 
I 

COGCGCCATG 
GCIGCTCCCC 
CAATGGA06G 
GAOTGATGAO 
TGOCAGCGGC 
OGATGGCAGC 
CCACIGCAAG 
TCAASAC AAC 
GTTTGTQACT 
CAGCTCOQTC 
QAAOCQGAAC 
CGGCCT6GT0 
CATGCAGTAT 
CTACTCOGAG 
CTCTTCTGAC 
GAGTOCCAAC 
CXyVCAGCCCQ 
CAGCCAGGGC 
GCTCTGACTT 
AGGAT6TCTC 
ACTTCAGAGA 
AG6TCACTCT 
GTTG6A6AGA 
GCTATATTGG 
TACCTTATAG 
GTCACCCCCC 
AT600CCCAO 
CAGCTOT CGC 
ATTGTGGCTT 
TATCATCAGC 
CAGCTCCTAA 
CTGGTTTCTO 
ACTTQAGTT8 
CTTGCTC3VTT 
CTTTTTACCT 
TCAATACCTC 
CCCAATACCA 
OTAOTTTCTC 
GATCTATTTT 
GTTAAGGGAC 
AAGGTCCAAA 
CAAGTCACTC 
TTATTTATCA 
TCTCTATOTT 
CCTCCCTGCA 
T6ATGAGGG0 
CTTCTTTOOG 
TGCAOOAAGT 



41 
I 

TGGCTGCTGG 
GGGAACAACT 
TGCATCGC3SQ 
AAG6A0TGCC 
ATCCATTGCA 
GATGAAGAGA 
AAGQGCCTCT 
AGTQATGAG6 
TCAGAaAACC 
ATTTTTQTGC 
AACCTCATGA 
GTCCTGQACC 
GTG6GCAGCC 
GCCTTGCTOT 
AGGGAATCTC 
AOTOCCAGCr 
GG6CAGCCTG 
ACTGAAGAAG 
GTTGCCATTC 
AAGTTACAGT 
TGTTTTTCTG 
TGCCTT6GGA 
CAGCATATAA 
AT6CTCAGAA 
CATTTGGGGA 
CAAAAAAATT 
TTCAGCAQAG 
AAOGTTATTT 
TAGAAATTTO 
CTCATCCTAA 
AATGCAGGCT 
GACTGTCACC 
6CCCAAAGTC 
CATGCAGCCT 
GTGCATTTGG 
CAGCAAGCTC 
GCACCTCTAO 
CTCTGAGACA 
AAATCTTTTA 
TATTTATATG 
GAAAGATGCA 
CAGACTAACC 
AGTTCTTGAA 
TGTGCTAGTT 
GGAATAAGGG 
TAAAATGGAA 
CAGCTGAAOA 
GGGGCTAAAG 



51 

I 

GGCOGCTGTO 
TCACCAATQA 
G080CIOOCA 
CX»AGGCTAA 
TCATTGGTCG 
ACTGCACAGC 
GTATTGACAA 
AAA6CT0TQA 
AACTT6TGTA 
TGGTGGTGGC 
CGCTGCCC3GT 
ACCCCCACCA 
AGGC6GAGCA 
ACCA6AGG0C 
TGAACCAAGC 
CCCAGGGAGC 
GCCCCCAGQA 
T ATAAG TOOC 
TAACAATTTO 
TTGGGATATT 
GCQTCTCAGT 
CXX36AGATCA 
AACAOTATTG 
GTGCAGQAGA 
TTTGGGTTAG 
CCATTTQAGC 
TGAGTGGCCA 
TOOTTTTGrO 
CCCAAGAAT6 
AATAGGCAGG 
GCCAAGACCC 
CTCCCAGCTG 
TGAOCTGGCT 
CAAGACTGGC 
ACTTGAGGAC 
TCCTGGCTCC 
TTAGAGTTAG 
CATGGGCAAO 
GAAAT6CATT 
TGTATAGGAA 
AAAGGA GATC 
TGTGTGCCAO 
6GAAGCAGAA 

GTAAAAOQTT 
CCAGGTAGAG 
AATGTTCAGT 
TGGCATTCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
3520 
2580 
3640 
2700 
2760 
2820 
2880 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TGATCCTGTT 
AAGCTAGCOl 
AAOGGAAAGG 
CTQAGTAATC 
CMTTCATCT 
TTTCTOGTGC 
GAGTTAATCT 
AGATAAG6GA 
TGATTTTTTT 
TTTTTGG0G6 
TTTTTAACTC 

aaaataqtct 
agtta cactg 
tttttaaaaA 

GAAAfiGTTG'i* 
TTATACTTTC 
AAMMAAAAAC 
T6GGGCGGGQ 
GAAACCCTGO 
6GAAAC06CA 
GGACTCAACC 



CTGTAGACTT 
CTGGTATTTT 
AACCTAGCTG 
CAATAAAGAA 
CCT6TQA0TC 
TCTGGAAGTT 
CACTCGCTTT 
TGCCTACTAA 
AATGAATGTT 
GA6GGGCTTT 
ATTCCAACCA 
CATCTCTTTT 
TGATGACTGG 
ATGC AACTA A 
GTGTCGTTGC 
TAATAAATTT 
AVnfWTTGGGO 
GGGCCCACGT 
CCAAGAAAAA 
GAGTQTTGOG 
AGOAGGACCC 



TTCnrCTTT 
GTTTTGTTTA 
CCTGTATCTT 
CTTTrOATQA 
AGAAGGOCTT 
GTTTAGAOGA 
TCTGCTTCCA 
TCCTTTTTTA 
TTTA AAAATA 
GTTTTOaAC 
0C2AA6CTTTT 
TTTCTCAAAT 
CCTATTA CCT 
0T8GTTAATA 
TTTTTOTOTT 
GCAGTTTCAT 
GG6CTTGGGC 
AGOTAGGOCXS 
aOTGGOGAGA 
TAAACCACAC 
AAGGGAAOCC 



TTTTAACCAA 
AAAAAAAAAA 
TCATTTTTAA 
CAGOGAGAAT 
TATTTCTCCC 
AAGAATTCTA 
GQCATCTTAG 
AAACAAACAO 
TATAAATAGO 
TCAACATQGC 
TTATACATTO 
GAGATCOGTO 
GACTCAG CTC 
GTQTGTGAOQ 
TTG8TTAGGC 
TCTTTCTGTT 
CTOGQAAAAA 
ACCA0G06GG 
ATTCTCCACA 
COSAAGAGAG 
GATAGAGTAC 



ATCCAAAGGA 
GAAAGAAAOA 
AATAGCACTT 
GTGTTAGAAC 
TTT6AT0GGG 
ATTTTAATTA 
GAAAAACAAA 
QGACATTTTT 
ACACCAAAGC 
ACATTAOTGG 
CCTAAATCTA 
TTTTATTTTA 
CCTCTACXnT 
CTCAAAtSTTA 
TTGGTTTTGT 
TGTGCAA AWG 
OTTTTTAACA 
CCCAAAOGGG 
CCAQAAAAAA 
AACTCAGAAO 
O 



TOTTACAGAA 
AAQAAAQAAA 
GAOTTATTTT 
TCTGGCTGAA 
CCCCTTCTTC 
ATTGCX3CAGT 
TGGTTTTAGT 
ATTATAOATT 
GGCAGGGTTT 
CCAGCAATAT 
06CCAACCAG 
GCATTAAATT 
GAAATTGACA 
ATGTAAACTG 
TTTTTAATTT 
GWMCTAMABM 
CCACTTOGGG 
ACCCCAGAAG 
ACG06CCGGG 
CACACAAGCQ 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



A96 Protein sequence i 
Gene name: 
Ubigene number: 
Probeset Accession #i 
Protein Accession #t 
Signal sequence t 
Transmenddrane domains t 
LDLa domains: 
Cellular Localisation! 



1 11 31 

I I 1 

MWLLGPLCLIi LSSAAESQLL PGNNPTNECM 
EKECPKAKSK CGPTPPPCAS GIHCIIGRFR 
KNGLCIDKSF ICDGQNHCQD NSDBBSCBSS 
VIPVLWALL ALVLHHQRKR NHIiMTLPVHR 
yVASQABQlTA SBVGSPPSYS EAXiU)QRPAN 
NSAS80AASS LLSVEDTSR8 P6QPGPQEQT 



ESTs 

Hs. 2 93 616 
AH043782 

none found 

1-17 

169-191 

28-66« 70-106, 112-149 
plasma membrane 



31 41 51 

\ I I 

IPGNPMCSNG RCIPGAWQCD GLPDCPDKSD 
CNGFBDCPDG SDEENCTANP LLCSTARYHC 
QEPGSGQVPV TSEHQLVyyp 8ITYAIIG3S 
UJHPVLLSRL WLDHPHKCM VTOJVNNGIQ 
YDLPPPPY8S DTBSm QADIi PPYRSRSGSA 
ABPRDSBPSQ QTEEV 



60 
120 
160 
240 
300 



LUWG 

A97 DMA SEQUENCE 

Gene name< putative GPCR, Weakly similar to dJ365012.l 

Ukiigene nuirtben Hs. 2 5 68 97 

Prdbeset Acceasioo #i BB001836 

Nucleic Acid Accession ft* BB001836 

Cddlng sequences x-a95S (underlined sequences correspond to start and stop codons) 



1 
I 

ATGGCOOTCA 
ATGTTAAGTA 
GGTTTAGOCC 
GGGAATATTG 
AACATAQATG 
GAGAAGTCAA 
QAATATCAGC 
AATTTTCTQA 
AOAGCAAAGG 
GACAGCTACA 
GCTGGAGCAC 
TGTOAGAQAA 
TTGAATTCAT 
AAAG CATATG 
A6CATCX3TTG 
ATTQAACATG 
GGCTCTTTCA 
AAGGATGATG 
TOTGAGTCCT 
CTGAAC3UU3A 
GTGCAAAATC 
TOQGTOQTGT 

tccaattcaa 
gtaaccaact 
acattagaaa 
aaattcattg 
tatcagatta 
gqgtcagacc 

CTGGQQAACA 



11 
I 

AATCTTTTTC 
TQATAGGGAT 
CACATAGCAC 
TTCACTCTAA 
CTTCCTCTTT 
ATTCTGACAC 
TGCTGCTTCA 
AOCTCTTGAA 
CTACCACAGA 
CCTGOTTTCC 
TCCCAAGCT6 
CAAAGATTTG 
CTTCXGCTAT 
AAAGAATTCA 
CTGGGTATQA 
TTGGC36AGAA 
GAQTGTTCGG 
AATATACCCT 
CTGGOTGGCA 
ATTTCAGTAT 
TTTCTGTCAT 
CGATTCTGAG 
CAATG6AGGA 
GQACAGTCTT 
ACATCAGCAC 
ACTGGAAAOG 
AAATGTGTCC 
AATTCCAGAG 
TTCTACCOGT 



21 
I 

CTTCAACACT 
CAGTGGCCCA 
TAAAGGTCTT 
AAAAQATGTT 
OCTGATTAGT 
AATCCAGCAA 
GGT6ACCTAT 
GCCTCCATTA 
CTGCAACAGC 
TCOCTCATGC 
TGAATGTCAT 
GGGCaCTTTC 
ATACTCC AAA 
AGQTTTTGAlO 
AGTTGTTG6C 
GGCTAAGACA 
AAAAGCCCAG 
QCCCTGCA6C 
GGTCATCAGG 
6ATT6TAGGC 
CATTCQQCAA 
CAATATTTCA 
TGTCATCAGT 
ACIGOOGGAA 
TCTGGT6CCT 
GATTCCAGTO 
CCAAAATACA 
ATCCCTTCCA 
TTCCAAAAAT 



31 
I 

GGGAATQATG 
GCTGGTGTGG 
GAACATGCAA 
TCTATTAGAA 
GATOGCCATC 
GTAACTATAA 
AGAGATTCCA 
TTATGGTCAC 
CTQA ATGGAG 
CTTQATCCCC 
CTCAACAACC 
AAAATTAATG 
TATGCAAATG 
TOGGTTCAGG 
TCCAGCAGTQ 
OCCCTTCACA 
TGTAATQACA 
AGTG6CTACA 
QAGACTTGTO 
AAT6CCACTG 
AACCCATCAA 
TCTCTOTCAC 
ATA6CTGACA 
GAAAAGTATG 
COtSACAGCTC 
AACAAAA6CC 
TCTATTOCCA 
GAAACTATTA 
GGAAATGCTC 



41 
1 

GGCTAGACCX: 
AAGACAAGTO 
TTAGAA6CTC 
TTTACTTACT 
AGTTAACCCA 
AAACTGATGG 
AGGAGAAAAG 
ATGGGCTAAT 
TCCTGCAGTG 
AGAACTGCTA 
TCAGCCAGAO 
AAAGGTTTAC 
GAATTGAAAT 
TC31CCCAATT 
CATCTOAACT 
AGCTGTTTCC 
TTGTCTTTGO 
GGGGAAACAT 
TGCTCTCTCT 
AGGCAGCTQT 
CCACAGTGGG 
TGGCCAGCCA 
ATATCCTTAA 
CCAOCTCACG 
TTCCTCTGAA 
AACTCAAAAO 
7CAGAG6CCQ 
TCAGCATGGC 
AGGTCAATGO 



51 
I 

A6ACXXAATC 
GQTCATACTO 
AAATGTTQAC 
CTTTCATGAA 
AGTGCACTCA 
CCGAGTGGAA 
AGATTTQAGA 
TAGAATTATC 
TACCTOTGAA 
CCTTCACACX5 
T8TCAATTTC 
AAAT6AGCTT 
TCAACTTAAA 
TOGAAATGGA 
GCTGTCAGGC 
ATTAGAAOAC 
ATTTGGGTCC 
CACAGCCAAG 
OCTTGAAGAA 
GTCATCCTTC 
GAATCTG6CT 
T7TCAGGGT0 
TTCAGCCTCA 
GTTACTAQAG 
TTTTTCTOOG 
GGGTTACAGC 
tgtottaatt 
ctcgttqact 
AOCTGTGATA 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TCCAGGGTTA 
TCAAACCTQA 
GCAGGCTGCC 
ACCTCCTTCT 
TGGATCACCT 
QAGQCTTTGT 
AT6GTQAACA 
GTGGACACCA 
TTCTACCTCT 
ATCCTOGTOT 
TATGGGTGCC 
TAGAAAAGOA 

r nunvixx; 

ACAAA6CTCT 
ATCG6GGTG0 
GGAATAGCSAA 
AATGCATTOC 
CAACTTCTGT 
TCATCAGATT 
GGCCATTATa 
QTCTCAAATQ 



TTCAAAACTA 
G0CM3CCTCA 
ACCTAGTGAA 
COVTATTOAT 
ATOT GGGACT 
TTTGGAAGCA 
TAGCCCTGTC 
OGOTGAACCC 
CTTTGTTCTT 
TCCATCACAT 
CTCTCATTAT 
AAGATGTGTG 
CTGCACTGGC 
G6AGGCCGAC 
G6AAGAGCCT 
CAATAGTGGA 
AGGQATTTTT 
TCAACAAGTT 
TATCTGCCAA 
CATTTTCTCA 
AATAA 



rrCCATAAAT 
TTGTG Wrrf 
TGAAACTCAA 
GTCACCTTTT 
GGGTATCTCC 
C3ATTAAAAAA 
CCTCTTOATT 
TTCTGQACTC 
CTQQATQCTC 
G60CCA6CAT 
ATCTOTCATT 
TTGGCTTAAC 
TATTGTGGCT 
T6TXX3Q66AA 
CCTCATTCT6 
CAGCXaOAAT 
TATCTTATGC 
OTCTOCXriTA 
ACCCAAATTC 
TACTGQAGAT 



QAAGTrrrcc 

TGOGATTTCA 
GACATOGTQA 
OIGCCCTCTA 
ATTGGAAGTC 
A60CAAACCT 
GCTGATGTCT 
TGCACAGYTQ 
ATQCTTGGCA 
TTQAltSATGO 
ACCATTGCTG 
TOOTCCAATO 
GTGAACTTCG 
AGACT6A0TC 
ACOCCTCTGC 
CTGGCTTGGC 
TTTGQAATAC 
AGTTCTTGGA 
TCAAA60CTT 
TCCTCCGACA 



TATTTTTTTC 
OTCATTTGCA 
GGTGCCAATO 
CAATCTTOOC 
TCATTTTATQ 
CTCACACACQ 
GGTTTATTGT 
CTOTGTTCTT 
TGCTGCTGGC 
CTQTTGQATT 
TCACGCAACC 
GAAOCAAACC 
TTGTQGTGCT 
6GGATGACAA 
TAOOOCTCAC 
ATGTTATTTT 
TCTTGQACAG 
AOCAAACAGA 
TCAACCCACT 
ACATCATOCT 



CAAQATAGAG 
GTGGAACGAT 
TACTCACTTG 
OGTTGTAAAA 
CCTGATCATC 
TCOTATTTGC 
TGGTGCCACA 
TACACACTTC 
TTACOQQATC 
TT6CCT0GGT 
TAGCAATACC 
ACTCCTGGCT 
GCTAGTTCTC 
GGCC AO CATC 
CTGGG6CTTT 
TGCTTTACTC 
TAAGCTGCGA 
AAAGCAAAAC 
GCAAAACAAA 
AACTCAGTTT 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



A98 PROTEIN SEQUEHCB 
Gene name: 
Unigene number: 
Protein Accession #t 
Signal sequences 
Pfaro domain I 
Transmenibrane domains t 
Cellular Localization: 



putative GPCR, Weakly similar to dJ365012.1 
Hs. 256897 

none found 

1-17 

7tin_2 (561-820) 

545-567, 571-593, 605-627, 641-663, 677-699, 721-743, 767-789 
plasma membrane 



1 

I 

MHALLLCPSV 
LIRIIRAKAT 
QSVNPCBRTK 
QFRNG8ZVAQ 
FGFQSKDDBY 
AVSSPVQNLS 
LNSASVTNWT 
KRGYSYQIKM 
NGPVI8TVIQ 
QCTHLTSFSl 
TRRICMVMIA 
LAYRIILVFB 
KPLLAF9VPA 
LTWGFGIGTI 



11 
I 

l^OASGLSLL 
TDCMSLKGVL 
IWGTPKXNBR 
YEWGSSSAS 
TLPCSSGVItO 
VHRQNPSTT 
VLLRBEKYAS 
CPQNTSIPIR 
NYSINEVFLF 
LMSPFVP8TI 
LSZJiIADVWF 
HMAQHLMMAV 
LAIVAVNFW 
VDSQNLAUHV 



21 
I 

QSPVBBYQLL 
QCTCEDSYTW 
PTNDLLMSSS 
BLLSAIEHVA 
NITAKCESSG 
VGNLA8W81 
SRLLBTLENZ 
GRVLIGSDQF 
FSKIBSNLSQ 
PPWKWITW 
IVQATVDTTV 
GFCLGYGCPL 
VLLVLTSCLWR 
IFALLMAFQV 



31 
I 

IiQVTYRDSXB 
FPPSOiDPQN 
AIYSKYAtYGI 
EKAiCTAXiKKL 
WQVIRETCVL 
LSNZSSLSIA 
STLVPPTALP 
QRSLPETIIS 
PHCVFWDPSE 
GL0Z8ZGSLX 
NPS6VCTAAV 
IISVITIAVT 
PTVGSRLSRD 
RTVTITYCIV 



41 

I 

KRDLRNFLKIi 
CYLHTAGALP 
EIQLKKAYER 
FPLBDGSFRV 
8LLEBLNXNF 
8HFRVSNSTM 
LNFSRKPIPW 
MASLTLGNIL 
LQHNDAGCHL 
LCLI IEAIiPW 
FPTOFFYLSL 
QPSNTYXXUa) 
DKATIIRVGK 
K 



51 
I 

LKPPIiLWSHG 
SCECKLNNLS 
IQGFKSVQVT 
FGKAQCNDIV 
SMZVGKATEA 
EDVZSIADNI 
KGIPVNKSQL 
FV8KKGKAQV 
VNBTQDIVTC 
KQIKKSQTSB 
FFWNLMLGIL 
VGWLNWSNGS 
8LLXLTPLL0 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A99 DMA 3EQPEWCE 

Gene name: putative G-protein coupled receptor 
Unlgene number: Hs. 16085 

Probeset Accession ffi F07953 
Nucleic Acid Accession ft: KM 016334 

Coding sequence I 104-1471 (underlined sequences correspond to start and stop codons)' 



1 
I 

A6CACCTGGG 
G6GGA0TGGG 
CTCCAGCATC 
GOQCCAATTG 
6A0GTTTGCA 
GAATAGCAGC 
GGTTTTCATG 
TAAACAACGA 
ACTAGGAQAT 
CATCAGCCGG 
TGTCAACTGC 
TCTAGCCCTG 
6ATGGCAATG 
TTTCraGGQA 
TATTCAACAG 
TOATCTATAT 
TAATTTTCTT 
CAATATTGTT 
TGTGAATTAT 
TCTTGTTGGA 
CTTTTATOCC 
AATGG6CAT0 
CCGCACCATA 
TQATGTGATC 



11 
I 

AAAAGGCAQA 
AASTGGAGGC 
ATGATTACCT 
TTTAAAGACT 
TTTTCTTGCA 
TCCOGTTATT 
OTOCCTTTTT 
CTGCTTTTTT 
COCTTTCOCA 
OTTQOTGIGA 
CCATACACTT 
GAACGGC6AC 
GCACGGAQAA 
AT8ATAAAAA 
GAAGTGGATG 
GCTACCAAGO 
GGTTACTTTT 
TTTGATOQAQ 
CTGGGAATCC 
ATAATCATOS 
ATCTCTAGCA 
TACTTTGTCT 
ATCACTGAAG 
TTOCTGGTC A 



21 

I 

C0GTGTGAG6 
AGGAQCCTTC 
CCCAAATACT 
ATGA GATAO G 
CCATGTTTGA 
TTCACTGGAA 
ACATTGGCTA 
CCTGTCTCTT 
TTCTCAGC9CC 
TTGGAGT6AC 
ACATGTCTTA 
TGCTGCAAAC 
CAATGTTCCA 
GTGTTA0C31C 
CTTTGGAAGA 
AGAGAATAQA 
TCTCTATTTA 
TTGGGAAAAC 
AATTTGATGT 
TCACATCCAT 
GTAAGTCCTC 
CCTCTCTOCT 
TCCTTGGAGA 
GOSCTCrCTC 



31 
I 

GGGCCTGTGG 
CTTACACTTC 
ATTTTTTGGA 
TCftOTATGTT 
6CTCATCATC 
AATQAACCTG 
TTTTATTGTO 
ATGGCTGACC 
AAAACATGGO 
TCTCATGGCT 
CTTCCTCAGG 
CATGGATATG 
GAAGGGG6AA 
TTCAGCATCA 
ATTAAGCAGO 
ATACTCCAAA 
CTGTGTTrGG 
OGATCCTGTC 
GAASTTTTGG 
CAGAGGATTO 
CAATOTCATT 
GCTGATCCQA 
ACTGCA6TTG 
TAOCATACTC 



41 

I 

CCCCAGOGTG 
GC CATGA GTT 
TTT6GGTGGC 
GTACAGGnrSA 
TTT6AAATCT 
TQCGTAATTC 
AGCAATATCC 
TTTATGTATT 
ATCTTATCCA 
CTTCTTTCTQ 
AATGTGACTG 
ATCATAAGCA 
GTGCATAACA 
0GAAGT6AAA 
CAQCTTTTTC 
ACCTTCAAGG 
AAAATTTTCA 
ACAAQA6GCA 
TCCCAACACA 
CTGATCACTC 
GTCCTGCTAT 
ATGAGTATOC 
AACTTCTATC 
TTOCTCTATT 



51 




1 

CTGTGGCCTC 


60 


TCCTGATOOA 


120 


TTTTCTTCAT 


180 


TcrxcrcoGT 


240 


TAG6AGTATT 


300 


TGCTGATCCT 


360 


GACTACTGCA 


420 


TCT7CTGGAA 


480 


TAGAACAGCr 


540 


GATTTGGTXSC 


600 


ACACAGATAT 


660 


AAAAGAAAAO 


720 


AACCATCAGG 


780 


ATCTTACTCT 


840 


TGQAAACAGC 


900 


GGAAATATTT 


960 


TGGCTACCAT 


1020 


TTQAGATCAC 


1060 


TTTCCTTCAT 


1140 


TTACCAAGTT 


1200 


TAGCACAGAT 


1260 


CTTTAGAATA 


1320 


ACGGTTGOTT 


1380 


T6GCTGACM 


1440 
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ACAGGCACXyv 
CAOTGOTTTC 
ATAAACAAAC 
AGGTGATACT 
ACAATACTCA 
CCAAGAAACT 
TOAOCCAAAC 
TTGAGATTGA 



GAOAAGCAAA 
AAAATTTAGA 
AAAATGCTAT 
ATGACCATGA 
GCAC3AGAGCA 
AAAGGTGAAA 
ACQTAGGATT 
CTCATTAAAA 



TGGCACCTTO 
TATAAGA6GO 
GGTAGCATTT 
GTAOCATCAG 
TCCC3GTGT G G 
AATACACTGG 
TCCGTTTTAA 
TCAGAGACTS 



AACITAAGCC 
GOGAAAAATQ 
TTCACCTTCA 
GCA6AACATG 
ATA1GA06CT 
AACTCTGGGG 
GGTTCACATQ 
T 



TACTACAGAC 
OAACCAGGGC 
TAGCATACTC 
AGA6GGAGAA 
GGTGTA6AGG 
CAA6ACATGT 
OAAAAGOTTA 



TGTTAGAGGC 1500 

CTGACATTTT 1560 

CTTCCCCCTC 1620 

CTAACTCAAG 1660 

OGOAGAGGAG 1740 

CTAT6GTAGC 1800 

TAGCTTTGCC 1860 



AlOO Protein sequence 
Gene namet 
Unigene number t 
Protein Accessloii 8t 
Signal sequence i 
Transmenibrane domains t 
Cellular Local izat ion t 



putative G>protein coupled receptor 
Hs. 16085 
IIP_057418.1 
none £ound 

5-27, 42-64, 76-98, 110-132, 144-166, 289-311, 
plasma membrane 



342-364, 380-402, 424-446 



1 
I 

MSFLIDSSIM 
EIIiGVLMSSS 
MYFPWKLGDP 
VTDTDILALB 
SENLTLIQQB 
XFMATINIVF 
ZTLTKFPyAZ 
FYBRMFDVIF 



11 
I 

ITSQILPF6F 
RYPHWKMNliC 
FPILSPKHGI 
RRLLQinDMI 
VDALBBIiSRQ 
DRVQKTDPVT 
S8SK8SNVXV 
LVSALSSILF 



21 
I 

6WLFFMRQLF 
VILLTIiVFMV 
LSIEQLISRV 
Z6KKKRMAMA 
LPJUETADLYA 
ROZBXTVNYL 
tLLAQZMSHY 
LYIiAHKQAPB 



31 
I 

KDYBIRQYW 
PPYIGYPIVS 
GVIGVTLMAL 
RRTMFQKGBV 
TKERIEYSKT 
GIQFDVKFWS 
FVSSVIiLZRM 
XQMAP 



41 

I 

QVIPSVTFAP 
NIRLLBKQRL 
LSGFGAVNCP 
KNKPSOFWGM 
PKGKYPNFLG 
QHISFIIjVGX 
SMPLBYRTZZ 



51 
I 

SCTMFBLZIP 60 

LF8CLLHLTF 120 

yTYNSYPLRN 180 

XKSVTTSASa 240 

YFFSIYCVWK 300 

ZIVTSIRGLL 360 

TEVUGELQPN 420 



AlOl DMA 8EQUBWCE 

Gene name: ESTs, Weakly similar to 0GHU7L collagen alpha l(III) chain precursor [H. sapiens] 
unigene number i Hs. 19322 

Probeset Acceesion #: AA088458 
Nucleic Acid Accession #: AA088458 

Coding sequence > . 862-1995 (underlined sequences correspond to start and stop codons) 



TGGGGGATCC 
GGTGACTTCA 
GA6ACAGGCT 
AAAQAAATAO 
CA06A66GGA 
OCAGACCCTG 
GAGCAGOGTC 
GCGTGOUIAC 
CAGAAOTGTC 



11 

I 

ACTGACATGO 
AGGAGCTGQA 
GC6ACTG6TA 
GCA6AGCCA0 
CCAA6GTACA 
QGGCCCTGCC 
CGGTCTGGCA 
AGGTGACOQA 
AGCAGCTGTT 
TCATCTAGTC 
G6GTGGGCGC 
AGTCCC6QA0 
GGCTGGTOOC 
GGGGCTCTCT 
GGTACCOCTC 
TCAGGGAAAO 
GGGCGGCTCC 
TGGCTGTAGC 
G6T6CCTGCC 
CTGAGGGACC 
TGOCTCCCAC 
TTGAOTCCCA 
AGGCA06TCA 
GCTCACCCCC 
CTCOCTCAGC 
CTTGQOTQTC 
CTGGTGGGG6 
CAQ3GCAACA 
TGGG6C6CAG 
GGTCAGTGCX3 
ACAAGGCACC 
CCCAGCTCCA 
GGTCTGCAGG 
CCTGAAGATG 
CAG6AAAGG0 
CAGOGCAGTG 
TOCCCACCAQ 
TGGCATCTTT 
TCAGGAGACC 
GGCACCTCCG 
GTCCTCCCAG 
GAATTTAAAQ 
CCTGGAGCCT 
CCTGGOCTCT 
TGTQAT6ACA 
CCCAGTTaAa 
ATCAA6TTCC 



21 
1 

ACTGAAGGAG 
GCAGGAGAAG 
CCAGCAGCAG 
CGCCGACTTT 
AGAGGTGGCC 

GCAGCAGACC 
GAAGAGTGAG 
TGAGGCCOGC 
CTTGTGGGCC 
CCCATCGCAC 
TGGGOGCCTT 
CGCACCOAGC 
TGA6TCCGCA 
CATGA GTTAG 
GCACTGCCCA 
GAOG06GGTC 
TCGGACGGAC 
TGGCTGGGGA 
CrGGCTGCAG 
AGACCCTGGG 
CACAACATCC 
TAGGCAAAGC 
CTTTQCTGTC 
CAA6GAAAAC 
ACTCCCrCAG 
GGCCCTGCTC 
GCATCGATG6 
GGCCTCCOAT 
TGGGGGGOGC 
TGTCTCAGAG 
TGCTAACCT6 
GGTGTCCCAG 
G6AGTGG6CT 
TGCAGGTOCT 
66TGGGCCAG 
GGCCTCCCCA 
ACTGQACTGa 
GCCCACAXAO 
OAAAAACTQC 
TTTACAQCTT 
OCCCOGGCTG 
GCCCTAGGAC 
ATCOQCQAGG 
CCC O O AA A T Q 
AATCTGGCCC 
AAOaAAAAGG 



31 
I 

TAGAATGGAG 
6AG6TGCTGC 
CTQCAAC6AG 
GGG6CTQCA0 
GGGTGCCTGG 
TCCGGGCCCC 
ATCCTCATGC 
08CATCA00C 
GCCCTGAGGC 
OOGTGGQCCC 
CCACCCTCrc 
CCTGCOGCCC 
GCTTGACTCC 
TAOTCCGCftG 

easTccccccu 

OGGCAGGCTG 
CAA6GGCAGC 
GGAAGTAGAT 
GCCCCAOGQA 
OSQATCGGCA 
GTGATGOCCT 
T6TGAGCCTG 
CTGTTTCCCC 
AGGCCCA6CC 
GAGAACGCCC 
CCCCTGCCCA 
AGCCCAACCT 
GTTCTGCAGC 
0CGCS6GTCAG 
AGGGCCC C CT 
6AG66GCCCT 
CCCACAGCAA 
GACAGGCCCA 
ITOCnOOGQA 
GAGG6CCTGT 
TGGCAGCCAG 
OGTCTGCCTT 
AAGCAGGAGA 
A0CTG6ACCC 
CTTTCAGCCT 
QAAATCAGGC 
GCAGGGTCTA 
GCT8G60QG0 
TGGCA6TACC 
TCTCAOQATG 
AGAOGAACAC 
AACATCTCAG 



41 

I 

CACGAGGACA 
TGCAGGGTTT 
T6CAGQAGCG 
OQAQCCCCCO 
GGGAOCTGCT 
CCTOCCCTOC 
TGAAGGAGCA 
AGCTGQAGCA 
AGCAGOAOGG 
CCAGGGCCAG 
TGGCTGGAGA 
TTGCCAGATG 
GTTTKGGCTC 
CTACIACTGG 
TTT0CAGC60 
CACTTCCAAC 
TTCCCGCTCA 
GGAGGGGGTG 
TAG0Q6TGQ0 
00CGG6GT6O 
TCCCCCTCTT 
GCTCOCCAGG 
OQACTGAGGA 
TGTCCOCA(M 
AGGGTACA6G 
GGCCCACTCC 
GGAGGGTCCC 
CCAGGGCCCC 
TGC8TG6G66 
COTGTCCAOG 
GGCAGGCA6C 
CCCCACA6AG 
AGTCAGCCCA 
CATAAGQAIG 
GCCCCACAGC 
GGAGAAGCCC 
TX3AGGGT6CC 
CAGAACA6TG 
CGCAGCTGAA 
T GG TO T l' Om 
TAOTQAQTGG 
GGTGGCTGGC 
TCAOTCTCCG 
GTGTGCAGGT 
TTGAAATGTG 
ACCCACACCA 
C0 GGG 0STG6 



51 
I 

CT6ACATGGA 
GGAQATGATG 
CCAGC6C06C 
CCCACTGGGG 
GGCTGCA6CC 
CCTGACGTCC 
GAACOQACTC 
GGA6AA6T0G 
OOSAOCTCTO 
CCTG6CACTC 
CCCGC6GCAG 
GGCTCCCCAG 
CTQQTTGYTQ 
COGCTGTCAG 
TQ006CCCFG 
AAGGGGCAGC 
ACCAGGGCAC 
GGGA06GCCT 
ACTTCAGGTT 
GC8AQAGCTT 
G600G6GA0G 
AGGGCCCCCA 
TTTCCAAGGC 
TTTGAOCTQG 
AG6A6GCTG6 
OGCTGQTGCT 
AGTGTCACCA 
CQATG0GGG6 
GC6CAGG6CC 
GCACTTTGGT 
GTGGCAACTC 
CCACATTCCC 
QCATGCAOCT 
TCAGGCCTGG 
CGCAGCMXC 
CCOGTCAGCA 
TGCCATQCCC 
TCTGTCCCGG 
GCGGAAATGT 
T6CAAG0TGA 
CCCrGGAQAC 
AGAOGCACAT 
TGCAGGATGT 
ACATACAOQT 
TCCTTGOGGG 
GOOCTCAGGA 
TOGTTCACGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



879 



wo 03/042661 



PCTAJS02/36810 



CTGGAATCCC AGCACTTGAO GCCAOGAOTT CCAGAGCAGC CXGGGOVAOO CAGTGAGAfifi 2880 

CCCOVTCTCT ACAARAAAAIl AAAAAGAAAG AAAGAAAATG AGAGHTOCAG GTTTAAAAAT 2940 

TCATAAACAC CACAAGQAAA CAATACACTA TQAGACCCAO CAOAAGCAAC AGATTGACTC 3000 

TAGACCCAOA TACTAQAATT ATCAGAGAGA ATATAAAGTA ACAGT6TTTT ATATATCTAA 3060 
AGAAATAAAA GAQATTTCTG GAAACATGAA AAAAAA 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



A102 DMA SEQUBfCB 
Gene name: 
Uhigene number: 
Probeset Accession fi* 
Nliclslc Acid Accession 9i 
Coding sequence! 



TMPRSSBa 
Hs. 298241 
AIS38613 
AB03B157 

202-1566 (underlined sequences correspond to start and .stop codons) 



1 
I 

ACCGG6CACC 
6GAAAG0GCT 
CCATCTACAT 
AOAGGTGCTa 
TCATTCCX3AT 
QATGCTGTTG 
ATTGG6ATCA 
TCA6G0AAGT 
GGAGTCT0G6 
AATGC08TGC 
AAGGGTCACT 
GATAACCrCA 
CACCTCTTGC 
TQT6CCTCTG 
AGCTCAOOCA 
CTTCAGTTCC 
ACTGCTGCAC 
CTAGTTTCCC 
AOCAASTACA 
CTCACGTTCA 
GATGGAAAAO 
TCCCCTOTCC 
GAOGTGTAOS 
GTGQACAGCT 
AAGTTAGTGG 
TACACCOGTG 
ACCTQAAGAG 
TCCTCCCCTG 
TTCOGGCAOC 
CSIAGCTGCTT 
AQTQCAGTQG 
TTGCCTCAGC 
TTQTATTTTT 
OCTCAAAltZA 
AG8CCTAGCC 
AG0Q06GCCT 
ACGAGATAAG 
GGACCAOCGC 
CCAAAOCCAC 
ATOACTOGTT 
TT G TCC mU 
AAAAAAAA 



11 

I 

GGACGGCTCG 
GTOTTTATGG 
TTTTGGQACT 
AAATAOTOIC 
CGCITTTTGG 
CTGCACAGAT 
TTGCATTGAT 
AGAGATGTCX3 
ATTOCAAAGA 
TCCAGGT6TT 
AOGCAAATGT 
GAGTGAGCTC 
CAGATGACAA 
GCCAOGTQOT 
T0GTGG6TGQ 
AGGGCTACCA 
ACTGTGTTTA 
TGTTGGACAA 
AGCCAAAGAG 
ATQAAATQAT 
TGTGCTGGAC 
TGAACCACX3C 
GTG6CATCAT 
GGCAG6GGGA 
GAOOQACCAG 
TCACCTCCTT 
GAAGGGGACA 
6ACTCCCX3TG 
AGTAQCAGGC 

OGAAATCCCT 
TTCCCCAQTA 
AGTAGA6ACA 
TGTQCCTGCT 
TCA06CTCCT 
TTCCCACTGG 
CAGTTATGTQ 
AGAAlOTGCAG 
GCTTTCtACT 
TAAGGCCTAT 
ATTOCSUUITA 



31 
I 

GGTACTTTOG 
GAAGCCAGTA 
C6GGAAT7AT 
CATGG GGGAA 
CCTTGATQAT 
CCTGTCACTQ 
ATTAGCACTG 
CTCATCCTTT 
OSGGGAOGAC 
CACAGCTGCT 
TGCCTGTGCC 
GCTGGAGGGG 
GGTGACTGCA 
TACCTTGCAG 
AAACAT6TCC 
CCTGTGCX3G6 
TGACTTGTAC 
TCCAGGCXXA 
GCTGGGCAAT 
CCAGCCTGTG 
GTCAGGATGG 
GGCX3GTCCCT 
CTCCCOCTCC 
CAGOGGOOOQ 
CTTTGGCATC 
CCTGGACTGG 
AGTAGCCACC 
TAGQAACCTG 
C0QAAAGA06 
6TTTTTTT6A 
GCTCACTGCA 
GCTGGGACCA 
GGGTTTCACC 
TCAGCCTOOC 
TTCTQATCTT 
TCCATCTGGT 
ACCTCAOGTQ 
AACTGCSUm: 
TCCAAOACTT 
TTTCAIQATT 
ATATGTTTOC 



31 
I 

TTCTTAATTA 
ACACTGTG6C 
GAGOTAGAGO 
AATCATCOGC 
TTOAAAATAA 
CTGCCATTGA 
OCCATTGGTC 
AAGTGTATCG 
GAGTACG8CT 
T08TGGAAGA 
CAACTGGGTT 
CAOTTCCGGO 
TTACACCACT 
TQCAGAGCCT 
TTQCTCTOSC 
GOCTCTGTCA 
CTCCCCAAGT 
TCCCACTTGG 
GACATC30C0C 
TGCCTGCCCA 
GGGGCCACAG 
TTGATTTCCA 
ATGCTCTG06 
CCCCTGOTGT 
60CTGCGCAG 
ATCCACXSAGC 
TQAQTTCCTG 
CACACQAGCA 
CACCCTTCCA 
GOTGGAGTCT 
6CCTCCGCTT 
OVGGTGCCCG 
AT6TTGGCCA 
ACAOTGCIGG 
CACTAAGAAC 
TTTCTCTCCA 
CAAAGCCACC 
ACTGCACX5TT 
ATTTTCACAT 
TCTTTGTAGC 
TTCCCTC A AA 




GATTACAGGC 
AAAAGAAGCA 
GGGGTCTTGC 
AACAGCCACT 
TTCATCTCTA 
GTGGGGAGGT 
ATTTGGTGCT 
AAAAAAAAAA 



GCGGGATGTC 
AGCCOCCTTC 
ACCAQATGCA 
AATCATCQTC 
CTTOGACTGC 
T06A7GTGAC 
GGGTGGTCAG 
OGATGACTGG 
TGTGAGTTCA 
GTCCATOGAT 
GAGGGA0G6A 

I AAGGGGCTAC 
GCAGGCCAGC 
GTGGATCATC 
CCAGGTGGGT 
TGTCTACCAC 

.GGCOOGGCCA 
OAACTTCCCC 
AGGTGACIGCC 
CAACCACAGG 
GAOGGGTGGC 
GAG6CTGTGG 



AGACCTAAAA 
GACAGCCCGA 
GAGCTCTGAG 
GCACAACCTT 
CXXAGGCTGG 
AOCGATTCTC 
CAACTAATTT 
AAACCCCTGA 
ATG66CCACC 
GCAACTTGCA 
AAAATTCCTG 
CAGAAAAGAC 
GGGACCAGAA 
TAATCTAGGA 
TGA06TATTA 
AAAAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



A1D3 Protein sequence i 
Gene name: 
Unigene nuirber: 
Probeset Accession #i 
Protein Accession 9t 
Signal sequence: 
Transmembrane domains: 
Tryp_SPc domain: 
Cellular I«ocalization: 



TMPRSS3a 

H8. 298241 

AI538613 

BAB20077 

none £ound 

43-65, 239-261 

216-444 

plasma aentbrane 



MGENDPPAVB 
XAIAI0L8XH 
TAASHKTMCS 
VTALHHSVYV 
LOGGSVITPL 
XiGMDIALMKL 
AVPLISMXIC 
POXGCAEVHK 



11 

I 

APP6FRSLFG 
FtXISGKYRCR 
DDWRGBYANV 
RE6CASGHW 
WIITAAHCVY 
A6PL7FNEMI 
NHRDVyGOII 
PGWYTRWTSP 



21 
1 

U)DUCISFVA 
SSFKCIELIA 
ACAQLGPPSY 
TLQCTA06UR 
DliYLPKSWTI 
QPVCLPNSBB 
SPSMLCAOn. 
LOHIHBQMER 



31 

1 

POADAVAAQI 
ROXSVSDCKD 
VS9DNLKVSS 
RGYSSRIVGG 
QVGLVSLLDN 
NPPDGKVCWT 
TG6VDSCQGD 

Ducr 



41 



51 



LSLLPLKFPP ZIVIGIIALI 60 

GEDEYRCVRV GGQNAVLQVF 120 

LEGQFRBEPV SIDHLLPDDX 180 

NMSLLSQWPW QASLQFQGYH 240 

PAPSHLVEKI VYHSKYRPKR 300 

SGHGATEDGA GDASPVLNHA 360 

SG6PLVC3QER KLHKLVGAT8 430 



880 



wo 03/042661 



PCT/US02/36810 



A104 may SBQUEKCB 

Gene namei Homo sapiens 6 protein-coupled receptor (B0T7T175) 
Unigene nuniberi Hs. 2 0832 9 

Probeset Accession fli AI819198 
Nucleic Acid Accession ii nm 032551.1 

coding sequencei 1-1197 (underlined sequences correspond to start and stop codons) 

] " 21 31 41 51 

I i I I 1 I 

ATGCACACCG TCGCTACGTC CGOACCCRAC 0C3BTCCT0GG GCSGCACCG6C CAAOGCCTCC 60 

GGCTGCCCOQ GCTGTGGCGC CAAOGCCTCG C5ACGGCCCAG TCCCrTOGCC GCX3GGCCGTQ 120 

°AOGCCTGGC TCGTGCCGCT CTTCTTCGCQ GCGCTGATGC TGCTOGGCCT GGTGGGGAAC 180 

TOGCTGGTCA TCTACGTCAT CTGCCGCCAC AAGCOGATGC C3GACCGTGAC CAACTTCTAC 240 

ATCXJCCAACC TGGCGGCCAC GGAOOTQACC TTOCTCCTOT GCTOCQTCCC CTTCACGGCC 300 

CTOCTOTACX: CX3CTGCCCGG CTGOOTGCTG G60SACTTCA TGTGCAAGTT CGTCAACTAC 360 

ATCX3^GCAGG TCTCG6T0CA 0QCC3VCGTGT GCCACTCTGA CCGCCATGAG TCTGGACCGC 420 

TGQTAOOTGA OGGTQTTCCC GTTGCQCGCC CTGCACCGCC GCACGCCCCO CCTGGCGCTG 480 

GCTGTCftGCC TCAOCATCTO GOTAGGCTCT GCGGOGOTGT CTOCOCOGGT GCTCGCCCTG 540 

CACCX3CCTGT CACCCGGGCC OOQCQCCTAC TGCA0TGAG3 CCTTCCCCAG CCG06CCCTG 600 

GAGCGOGCCT TCGCACTQTA CAACCTGCTO GCGCTOTACC TGCTGCOGCT GCTCOCCACC 660 

TGCGCCTGCT ATGCQGCCAT 0CTGCX3CCAC CTGGGCOGGQ TCOCOGTGCXl CCCCGCGCCC 720 

GCCX3ATAQCQ CCCTGCAGGQ GCAGGTGCTQ QCAQAGCGCG CAGGCXKXGT GCGGGCXaUVG 780 

OTCTCGCGGC T6GTG0CG0C CGTGGTCCTO CTCTTCGCC6 CCTGCTGGGG CXXTCATCCAG 840 

CTGTTCCTGO TGCTGCAGGC GCTGGGCCCC GOGOOCTCCT GGCACCCACO CRGCTAOGOC 900 

GCCTA0GC6C TTAAGACCTO GGCTCACTOC ATGTCCTACA GCAACTCC3GC GCTGAACCCX3 960 

CTGCTCTACQ CXTTTCCrGGG CTCGCACTTC CGACAGGOCT TCCX5CC000T CTGCCCCTGC 1020 

GOGCCGCGCC GCCCCCGCCG CCCCCQCCGQ CCCGGACCCT CGGACCCX»C AGCXXXACAC 1080 

GCXSOAGCTGC ACXXSCCTGGQ 6TCCCACCCG 6CCCC06CCA GGGCQCAGAA GCCAGGGAGC 1140 
AGTGGGCTGG CCGOOCXSCGG GCTOTGOGTC CTGGQOGaUSG ACAACGCCCC TCTCTQA 

A105 Protein sequence 

5!??^ ^ sapiens G protein-coupled receptor (HOT7T175) , niRNA 

Unigene nuiribers Hs. 20822 9 

Protein Accession #: AZ81919e 

Signal sequencei none found 

Pfam domains: 7tin 1 {59.333] 

Transmeirbrane domains I 43-6S, 86-108, 122-144, 159-181, 203-225, 260-282 

cellular Localizationi piasina membrane 



60 



1 11 21 31 41 51 

( I I i i I 

MHTVATSGPM ASNGftPAMAS GCFGOaAHAS DGFVPSPRAV DAHLVPLPPA ALMLLGLVGN o„ 

8LVIYV1CRH KPMRTVTOPy lANIAATDVT PLLCCVPPTA lAYPLPGHVI. GDFMCKFVNY 120 

IQQVSVQATC ATLTAMSVDR WYVTVFPLRA LHRRTPRLAL AVSMIWVGS AAVSAPVLAL 180 

HRLSPGPRAY CSBAPPSRAL ERAPALVNLL ALYLLPLLAT CACYAAMLRH LGRVAVRPAP 240 

ADSALQGQVb AERAGAVRAK VSRLVAAWI. LPAACWGPIQ LPLVLQALGP AGSMHPRSYA 300 

AYALKTWAHC MSY8HSALNP LLYAPUGSHP FQAPRRVCPC APRRPRRPRR POPSPPAAPH 360 
AELBRIXSSHP APARAQKPGS SGLAARGLCV LOEDNAPL 

A106 DMA 8BQ0BNCE 

Gene name: integrin, beta 8 

Unigene number i Hs. 52620 

Probeset Accession dt AA479726 

Nucleic Acid Accession #t mm 002214 

coding sequence. 680-2990 (underlined sequences correspond to start and atop codons) 

1 II 21 31 41 51 

1 I I I I 



TCTGCCTGCA AAAC0ACO5G CGAGGTCCC6 CCTCGTTCCT CTGGGCAGCC 



AGCCAGGAOG 


60 


AACTAATCSGT 


120 


CAGACTTTTT 


180 


GAATGTACAT 


240 


TGGCTCTTGG 


300 


CTCTTTTCTT 


360 


GCCGGGCCCT 


420 


GAGCCGGOAG 


480 


ACC6CTTGCT 


540 


CrCQCCCGOG 


600 


CGGCGGGOGC 


660 


QCTGCATTTG 


720 




780 


AATGCAGCAT 


840 


GAGGATTTCA 


900 


AGCAAAGGCT 


960 


GAAAATQAAA 


1020 


GCCGAAGCTA 


108O 


TATCTTGTTG 


1140 


AAOGATTTAT 


1200 


TACGTTGATA 


1260 


TGCAGTGACT 


1320 


ACAQAGAACA 


1380 


GATACACCAG 


1440 



881 



wo 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



AAGGAGGTTT 
AAGAGOCTAA 
QCAAATTGGC 
ACOTCAAATC 
ACAACAACAT 
TTCTACOOCT 
ATAATTTOCST 
ACCAGQTACA 
CAGGCArOGA 
TTACAA TGAA 
GTTTTAATGA 
AOUSAGQACX: 
GTGATQAGAA 
ACAAGGATCA 
ACAAAATTAA 
CATATCAOCA 
GCTTCAGTG6 
TCAATTCAAA 
GCACCGATCC 
OCAAGQAAAA 
TTGATCAGTG 
CAGAATGTTT 
TCTTGATTGQ 
ATAAAATTAA 
TGCAAAGTGT 
TGGAXATCAG 
TTAAACACTT 
AAGTCACAGG 
GAAGACTQAC 
AAAATGTGTC 
ACTTATTTAO 
TAOCTGTTAT 
CACTACAAGG 
TATATTCTAA 
AIGAATAAAT 
AAAGATTATT 
TTT6CAAGAT 
TTTTTACAGG 
TACTGCCATA 
GAATOTTAA 



TGACGCCATe 
AAGATTGCTQ 
AGGCATAGTG 
GACAACCATG 
TAATGTCATC 
CTTGCCAGGC 
AGTGGAAGCC 
AGGCATCTAT 
AGQATGCAGA 
AAAATGTGAT 
AACC6CTAAA 
TAAAG6AAA0 
TAAATGTCAT 
GCCTGTTTGC 
6CTTGGAAAA 
TGGAAATCTO 
CTGGQAAGGT 
GGGCCAAOTG 
CAGQAGCATC 
CT6GAATTGT 
CAAAACCTCA 
CTCCAGCCCA 
GTTGCTTAAA 
GTCCTCATCA 
TTGCACAAGA 
GAAATTAAAT 
AATGGGAAAC 
AGGAGACAAA 
AAGTATCCTC 
TTACTACTGT 
ATCAGCATAO 
OCCTAOGCTT 
QTACAGTAAT 
GOTTGCCAAA 
QATTCGTGTT 
QCTTTTTAAA 
GGATACTAAT 
ATAAQTTTAT 
AAAAACTAAT 



CTTCAG6CA0 
CTGGTGATGA 
GTGCCCAATG 
GAACACCCCT 
TTTQCAGTTC 
ACCATTGCTO 
TATCAGAAGC 
TTTAACATTA 
AA0GTGACX3A 
GTCACAGGAO 
ATTCAT ATAC 
TQTGTAQATO 
TTTGATGAAG 
AOTGGTCGAG 
GTGTATGGAA 
T8TGCTGG6C 
GATOQATCCC 
TGCAGTGGAA 
GQCOGCTTCT 
ATGCAATGCC 
TQTGCrCTCA 
A6CTACTTGA 
OTCCTGATCA 
GATTACAGAO 
GCAGTOlCCT 
GCTCATGAAA 
TGGAATTGTT 
TTGCTCA06G 
ATCAT6ATGT 
TTGAGACTAO 
AATGTAGATC 
CCCAQAGAQA 
CCCTGCACTG 
CACTTCAACA 
TCACTCTTTC 
OXOTGTAOTT 
TCCAGCATTC 
QTATGTCACA 
AATACAATQT 



CrgTC T OTOA 
CAGATCAGAC 
AOGQAAACTG 
CACTAGGCCA 
AAGGAAAACA 
GXGAAATA6A 
TCATTTCAGA 
C06CCATCTG 
GCAATGATGA 
QAAAAAACTA 
ACAGAAACT6 
AAACTTTTCT 
ATCAGTTTTC 
GAGTTTGTGT 
AATACT6TGA 
ATGGAGA6TG 
AOTOCOCTTC 
GAGGCAOGTG 
OTGAACACTG 
TTCACCCTCA 
TGGAACAACA 
GAATATTTTT 
TTAQACAG6T 
TGTCAOCCTC 
ACOGAGGTGA 
CTTTCM39T3 
AATAATT6CT 
TCATOCCAGT 
QACTCACATA 
TGTCGTTGTA 
CTCTGAAGAG 
ACAAT6CTGT 
GACATGT6AG 
GTTGGTGGTT 
AAiGAGGTGAA 
TTATGCATGT 
TCTCCTCTTT 
OATGACTGOA 
CACTTTATCA 



AAOTGATATC 
GTCTCATCTC 
TCATCTQAAA 
ACTTTCAGAG 
ATTTCATT6G 
ATCAAA6GCT 
AGTGAAAGTT 
TCCAGATGOT 
AGTTCTTTTC 
T6CAATAATC 
CA6CTQTCA0 
A6ATTCCAAG 
TTCTQAGAGT 
TTGTGGGAAA 
AAAQGATGAC 
T6AA6CA0GC 
AOCAOCAGCC 
TGTGTGTGGA 
CCCXavCCTQT 
CAATTTGTCT 
GCATTATGTC 
CATCATTTTC 
GATACTACAA 
AAAAAAGGAT 
GAAGCCTGAA 
CAACTTCTAA 
CCTAAAOATT 
TGCTGGTTOT 
QCTGCTGACT 
GCACTTTACT 
CACTGATTAC 
GAGAOAGTTT 
GAAAAAAATA 
GAATAGACAA 
CA GATA CAAC 
GT6TTTATG0 
GCX.T1TATGT 
TTAATTAAGT 
GAATACTAQT 



O QATGG OGAA 
CCTCTTGATA 
AACAAOrrCT 
AAATTAATAG 
TATAAGGATC 
6CAAACCTCA 
CAGcnt36AAA 
TCCAGAAAGC 
AATQTAACAO 
AAAGCTATTO 
TGTOAGGACA 
TGTTTOOVCST 
TGCAAGTCAC 
TGTTCATGTC 
TTTTCTTGTC 
AGATO CCAAT 
CAOCACTGTG 
AGGTQTGAGT 
TATACAGCCT 
CAQQCTATAG 
6A0CAAACTT 
ATAOTTACAT 
TGGAATAGTA 
AAGTTGATTC 
GAAATAAAAA 
AAAAAGATTT 
ATAATTTTAA 
ACACTCX5AAC 
TTTTCAGAGA 
GTAATATATA 
ACTTTACAOG 
AGCATTGTGT 
ATCTGGCAAG 
GAACAGCTAG 
CTTAATCTTA 
TTTGCTTATT 
TTTGTTTTCT 
GCTAAGTTAC 
TTTAAAAGCT 



A107 Protein sequence ; 
Gene naroei 
unigene nunber: 
Probeset Accession #t 
Protein Accession Ut 
Signal sequence t 
Transmeiribrane domains t 
EGF domain: 



Cellular Localization: 



Integrin, beta 8 

He. 52620 

AA479726 

NP 003205.1 

1-39 

682-704 

552*564 

54-469 

plasma menbrane 



1 
I 

M06SALAFFT 
LGFE06HCVQ 
GEVSIQLRPO 
SRDFRLGFGS 
VHRQRI8GNI 
VPNDGNCHLK 
TIAGEIESKA 
NVTSNDBVI,? 
CVDETFIiDSK 
VyGRYCEKDD 
CSGRGTCVOQ 
CAU1BQQHYV 
DYRVSASKKD 



11 
I 

AAFVCLQNDR 
EDPI8GGSR6 
AEANFMLKVB 
yVDKXVSPYI 
DTPBGGFDAM 
NNVYVKSTTM 
AHLNNLWBA 
NVTVTNKKCD 
CFQCTENKCH 
FSCPYHHGNL 
RCECTDPR8I 
DQTSECFSSP 
KLIIQSVCTR 



21 
I 

RGPA8PLWAA 
BRCSIVSNLI 
PXiKKYPVDLY 
SIBPBdUNQ 
LQAAVCB8RZ 
EHPSLGQL8B 
YQKLI8BVKV 
VTOGKNYAII 
FD6DQP8SB8 
CAGRGEGBAO 
GRFCEHCPTC 
Sn*RIFFIIF 
AVTYRREKPB 



31 
I 

WVF8LVLGLG 
SKGCSVDSIE 
YIiVDVSASMH 
CSDYNLDCMP 
GHRKBARRLL 
KLIDNNIKVI 
QVENQVQGIY 
KPIGFNBTAK 
CKSHKDQPVC 
ROQCFS6WEG 
YTACKHJWNC 
IVTPLIGLLK 
BIKMDI8XUV 



41 

I 

QGEDKRCASS 
YP9VHVIIPT 
NNIEKINSVG 
PHOYIBVLSL 
LVMTDQT8HL 
FAVQGXQFHW 
FNITAICPDG 
IBIHRNCSCQ 
SGRGVCVC36K 
DROQCPSAAA 
MQCLHPHNLS 
VLIIRQVILQ 
ABETFRCNF 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



51 
I 

NAASCARCLA 
EHEZNTQVTP 
NDLSRXMAPF 
TENITEFBKA 
ALOSKLAOIV 
YKDLLPLLPG 
8RKPGME0CR 
CEDNRGPKGK 
CSCHKIKIiGK 
QHCVKSKGQV 
QAXLDQCKTS 



60 
120 
160 
240 
300 
360 
420 
460 
540 
600 
660 
720 



AlOB DNA aequence 
Gene name: 
lAiigene nizniberi 
Probeset Accession fft 
Nucleic Acid Accession fft 
Coding sequence i 



ESTs 

B8. 128899 

AA983251 

AA983251 

1-1749 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 I I 

ATOCTGTCrO GCTTCTTGAT QAGICCCAGT ACCCAOCACA 6A6CACAOTA CACTCCCGGA 
GQAAAOAAAC TTCCGTGGOA GGCTTC C ATC GGTGOGCACA CCTCCCGAOO GOQAOGCAGC 
GAOCQGOAGA GQGAOAGCOQ GCOOGAOGCT GCCGQGCrCC TQTGGGAGG6 GGCTGCAGCC 
GGGGAGGOOO AGAAGGGGAA C06GG6G6AG CCGCCC6CCT GGAT0C606C CCA6CA6CA0 

882 



60 
120 
180 
240 
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COGCGGCCGC CGCCAGCPGQ QCAGGCTCCC GGGACTGCGQ CTGGGGQCGC GCAGGACCCT 300 

CX3CCTQCGTC CTGGACGTTC COGGGQQAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 

TCOGGACQAC AGCCCCtSGGO QCCTTCTGAC TOCATCCCGA GATTTCCATC AOOaAGTGCA 420 

ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGOACGGGGA TGGCTTAGGA 480 

GCTCCTGGAC CTAGGGCCOQ GCGTC3GT0GC CTCCTGGGCG TOGCGGCAGA GGGQAGTGGC 540 

COGCQCGGAA AGOGCCGOGG GACAGTCAGT GAOQAGGCCC GGGGGT OS OC GGQGOCAteA 600 

CTTCTCGOAO ACCXSTCCTGC GCTCTCTGGA GACXK3GCTOT COOOSCCGAO OGTGGTGCCA 660 

TGTGGGGC3GC TOGCCGCTOG TCCGTCTCCT CATCCTGGAA C0CXX3CTTC0 CTCCTOCAGC 720 

TGCTGCTGGC TGOGCTGCTQ QCGGCGGGGQ OQAOGGCCCA GaJGaSAGTA CTGCCAOGGC 780 

TGGCTGGAOG CX3CAGGG00T CTGGC36CATC GOCTTCCAGT GTCCOQAGOG CTTOGACOGC 840 

GGOGAOCCCA CCATCTOCTG OGGCAGCTGC GO GTOO GCT ACTGCTGCTC CRGCXSCOGAG 900 

G06C36CCTGO ACCAGOGOGG CT60GACAAT QACXXSOCAGC AG0606CTGO OQAGGCTrGGC 960 

OOGGCGGACA AAGACGGGCC COGAOSGCTC G0CAG6GCTT CATGTCTTAG GGGTACCCAA 1020 

GGftGACGGCG AGGGTOCGCC CCCACCOSTG AOGOCCTGGC AGOGOTGCTC CCCTGAAGGC 1080 

TCCCCGAAAG QAAGGCA6CT CCTCAGG6CT TTCCC3GGG6C TGCTGCCCOG TGCCAOACGC 1140 

CGOGGATTCC CATC TTCTCC AGG0Q0CG6C OCCT C TOCCC TGCA6GGGCC GGCCTT6C0C 1200 

ATCTACGTGC OSTTCCTCAT TGTTGGCTCC U 'l V m' U 'l'OU CCTTTATCAT CTTOGOGTOC 1260 

CTGGTQGCAO CCrGTTGCTG CAGATGTCTC OGGCCTAAGC AGGATCCCCA GCAGAGCGGA 1320 

6CCCCAGGGQ GTAACXGCTT GATGGAQACC ATCOCCATGA TCCCCAQTGC CAGCACCTCC 1380 

OQGGGGTCOT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 

GGGG0C06GQ OGCCCCCAAC AAGGrTCACAG AGGAACIOTT GCTTGC08C3A AG6GACCATG 1500 

AACAAOGTGT ATGTCAACAT GCCCAOGAAT TTCTCTGTGC TGAACTGTCA OCAGGCCAOC 1560 

CAGATTGTGC CACATCAAGG GCAOTATCTO CATCCCCCAT ACX3TG G G G TA CACGOTGCAG 1620 

CACXSACrCTG TGCCX»TGAC AGCTGTGCCA CCTTTCATGG ACGGCCT6CA GCCTGQCTAC 1680 

AGOCAQATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AOAAGATGTA CCCAGC3GGT0 1740 

ACTQT ATAAC OSAG AGTCAC TG6TGGQTTC CTTTACTX3AA GGGAGAOBAA GOCAGGGOTO 1800 

GATTCTCGAO OTOaAAGTCC GCACATGTOG GTQGTATTTA TGGCAOGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCOGAAT TAGCATTTCT GGATATOTTT 1920 

CATCCAGQGT ATCATTQATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 

TTGCTGATGO GTGTATAACA AAT6CTTGAG TCOGAAGTGC CCTT6AGATA TGGTTGACX5A 2040 

AAQAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATQ TTGTTATTAT TATTTCTTTr 2100 

TT6ITGTTGA CT8CACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGO 6ACTTTTTTT 2160 

TTTTprrTTT TTTTTTTTAA TCAGAC3VGGG TCTTGCTCTQ TTGCCCAGGC TGGAGTGCAG 2220 

TGQTGOGATC TOGGCTCACT GCAACTTCAQ CCTCCTGQAT TCAOQCAACA CTCCTGCCTC 2280 

AGCC TCCCAC GTQGCTG6GA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TT6TATTTTT 2340 

TGTAG AGATG GGGTTTCACC ATGTTG6CTG QGCIGGTCTC ACT C TCCTGA CCTCAA6CAA 2400 

TCTOCCTGTC TCAGCCTOCC AAAGTGCTGG GATTACAG6C GTGAGCCAOC 6CCCGCAGCC 2460 

TCAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA QTATTCCACA 2580 

GQCA CACCTT AATTTCATTG TAAAAAGATA TATATATTTT OTCTATTTTT GTGCTTTT6G 2640 

GQSCCTATTT TOTOCTTTTT TACCTTATGT AGAGATCTTA TTACAAAOTG ATTTTCTACA 2700 

TTAAAAA6A0 ACTOAAATAA ATTOTATAGT TACTTAACTA ATGAA6ACAT TTC3U3AACTC 2760 

TGGQATGATT TTAATCTTGA AQTAGTAGGrC 66TATAGTCA TAAAACCATT CATCCCCTTC 2820 

TTQArrGTAT CTTAATTTTC TOGCTTTAAG GTGACATCTG AQAGGTAATG CATTCTTTTT 2880 

TATAT TGAAA TCAT AAACTA TCACXXX3CTG CTTCTCTGAQ TTACTTITAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCGTTTC CTTCTOTTTG GTTTTCAGAO CCCCATGTCT ATATAOTCCT 3000 

GftGTG CAAOT AATTACTATA CTTGTAAATG AAGATCAQTA TTTCTGCXTTA GATCTGATAA 3060 

AAAAATTTTC TT6TCTTAGT TATAAAAATT CAAAGAAATO TOTTACAAAG ATACTTAQTA 3120 

TAGCTCCTCA 6CCATAACCT GAQACTTGGO ATGAAATTTA AACCAGATAC 6ATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTT6TT ATCAAAATQG CTTATTTTTC AGOCACTAAQ 3240 

0ATT6TTAA0 AGAAAAGCTT TTCAAOSAAG GATTGCCTTT CTTCTCCCAC A C TGTT C TT G 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTOTATTCa TTQCCAA«5T TCCAAATTAT 3360 

CAAATTCAAO TGAATTTATT TGTQTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATOGCT 3480 

GTATGQT ATG GTCTTCTACA CATTTATGTC TATAQATATC TATOGATCAT CTTTCTATTC 3540 

mWC ATGA CTGAATAATQ TAAAA0CM3T OTTGGCAATT OQTATCATCA ATOATACTCA 3600 

TTTTTTAATA AOCAAAGGCA GGGOAAAATC ATTTTACTTA TTAATAAATA TTTTATOATQ 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A109 Protein eequence 

Gene name: ESTe 

Uhlgene nundser: Hs. 126899 

Protein Accession #t none £ound 

Signal sequence t l-ll 

Transneinbraiw dooalnst 402-424 

Cellular Localizations not determined 



1 11 21 31 41 51 

I 1 I I I I 

MLSGFLMSPS TQHXU^QYTPG GKRLPHEASI GAHTSRQRGS DRBRESRPEA AGLLKDRAAA 60 

GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRFGRSRGR VHLPVKPPEA 120 

SGRQPRGPSD CIPRFPSASA THKAVPKOTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS 0BAR6SPGPR LLGDRPALSG DALSAPRWP OGALAARPSP HPGTPUtSCS 240 

CCNLRCHRSG RGPSGBYCHG WLDAQOVMRI GFQCPBRFDG GCATZCCGSC ALRYCXTSSAE 300 

ARLDQGGCDN DRQQGAGBPG RADKDGPRRL GRASCLRGTQ CD6BGAPPPV RAHQRC8PBQ 360 

8PKGRQLXJUV PPGLLPRARR RGFPSSPRGO PSPLQRPALP lYVPFLIVGS VFVAPIXZX3S 420 

LVAACCCRCL RPKQDPQOSR APGGNRLMET IPMIPSASTS R6S8SRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPB6W HKVYVUMPTO PSVIiNOQQAT QIVPHQGQYL KPPYVQVTVQ 540 
HDSVPMTAVP PPMD6LQP0Y RQIQSPPPHT NSBQWWPAV TV 

Alio raiA SEQUEWCBi 

0«ne namet ESTs, Neakly similar to JB0350 Anterior gradient-2 (H. sapiens) 

883 
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Ubigene nuiift>er: Hs. 100686 

Probeset Accession St AA48746B 

Nucleic Acid Accession lit AA487468 

CSoding sequence I 55-555 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I i I 

COaCACCAAQ AGCACTGGCC AAGTCAGCTT CTTCTQAOAG AGTCTCTAGA AGACATGATO 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA ASQAAAAGAO GCCTCCTCAO ACACTCTCAA GAGOATGOGG AGAT6ACATC 180 

ACTTGGGTAC AAACTTATQA AGAAGGTCTC TTTTATGCTC AAAAAAGTAA GAA6CCATTA 240 

ATGOTTATTC ATCACCTOGA GGATTGTCAA TACTCTCAAG CACTAAAOAA AGTATTTGCC 300 

CAAAATQAAG AAATACAAOA AATGGCTCAG AATAAGTTCA TCATGCTAAA OCTTATGCAT 360 

GAAACCACTG ATAAGAATTT ATCACCTQAT GGGCAATATG TGCCTAGAAT CATGrXTOTA 420 

GAGCCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAOAT ACTCTAACA6 ATTGTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAQCATT AAGACTTATT 540 

CAGTCAGAGC TATAAGAQAT OATAGAAAAA AGCCTTCACT TCAAAOAAOT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TQTOTTTGCA ATAOTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



Alll Protein aeciuencei 
Gene namet 
Uaigene number : 
Probeset Accession S: 
Protein Accession #t 
Signal sequence t 
Transmembrane domains: 
Cellular Xiocalisationt 



Weakly similar to JE03S0 Anterior gradient-2 [H. sapiens) 
Ha. 100686 
AA4e7468 
none found 
1-23 

none found 
secreted 



1 11 21 31 41 51 

I I i 1 I I 

MMLHSALCHiC IiLLVTVSSNL AXAIKKBKRP PQTLSRGHGD DITNVQTYEB GLFYAQKSXK 60 
PLMVIHHLED C3QY8QALKXV FAQNEBIQEM AQNXFZMLNL KKBTTDXNLS POGOYVFRIM 120 
FVDPSLTVRA DIAGRYSNRL yTYEPRDLPL LIBMMKKALR LIQSBL 

A112 DMA SEQOEWCE 

Gene name: Homo sapiens type II membrane serine protease mRNA 

unigene number: Hs. 63325 

Probeset Accession #> AA411502 
Nucleic Acid Accession NH_016425 

coding sequence: l'>1314 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

t i I 1 I i 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACA6CC TC3QAT0TCAA ACCCCT6CX3C 60 

AAAOCCOGTA TCOOCKTOWi GAOCTTCAGA AAOGTGQGGA TCOCCATCAT CATAGCACTA 120 

CT6AGCCTGG 0GA6TATCAT CATTGTGGTT GTCCTCATCA AGGTGATrCT GGATAAATAC 180 

TACTTCCTCT GOGGGCAGCC TCTCCACTTC ATCCCX3AGGA AGCAGCTGTO T6ACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGAOGAGGAG CACTGTGTCA AGAGCTTCCC OGAAGGGCCT 300 

GCAGTGGCAG TCGQCCTCTC CAAGGAOCGA TCCACACTGC AGGTGCIGGA CTOQGCCACA 360 

OGQAACTOGT TCTCTQCCIG TTTC6ACAAC TTGACAGAAO CTCTCJGCTOA OACAGCCTGT 420 

A06C3U3ATG6 GCTACAGCAG CAAACCCACT TTOVQACKTrG TGGAGATTGO GCCAGACXAO 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGOAGC TTCX3CATG0Q 6AACTCAAGT 540 

GGGCXXTTGTC TCTCAGGCTC CCTGGTCTCX: CTGCACTGTC TTGCCTGTGG GAAOAGCCTG 600 

AAGACGCCCC GTOTGGTGGG TGGGGAGGAG GCCTCTQTGO ATTCTTGGC^ TTGGCAGGTC 660 

AGCATGCAGT A06ACAAACA GCACGTCTQT GBAGGGAGGA TCCTGGACGC CCACTGGGTC 720 

CTCACX3GCA6 CCCACTGCTT CAGQAAACAT ACOGATGTGT TCAACTGGAA GGT6CX3GGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CATCATTGAA 840 

TTCAACCCCA TGTACCCX:aA AGACAATGAC ATCGCCCTCA TGAAOCTGCA GTTCXX3«rrC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA QCTCACTCCA 960 

GCCACCCCAC TCTGQATCAT TGGATGGGGC TTTAGSAAGC AOAATQaAGG GAAGA7GTCT 1020 

GACATACTGC T6CAGG0GTC AiOTCCAGGTC ATTOACAGCA GAGGGTGCAA TGCAGAOQAT 1080 

GOGTACCAGG GGGAAGTCAC CGAGAAGATO ATGTGTGCA6 QCATCCCQGA AGGGGGIGTG 1140 

GACACCTGCC AGGOTGACAG TGGTGGGCCC CTOATOTACC AATCTGACCA GTGGCATGTQ 1200 

GTGGGGATOG TTAGCTGGGG CTATGGCTGC GGGGGCC06A GCACCCCAGG AGTATACACC 1260 
AAGQTCrCAO CCTATCTCAA CTQGATCTAC AATOTCTG QA AGGCTGAGCT GTAA 



AH3 Protein sequence: 
Gene name: 
unigene number: 
Probeset Accession #: 
Protein Accession $: 
Signal sequence i 
Transmenibrane domains: 
LDLa domain t 
Tryp^SPc domain: 
Oellular Localizationi 



Homo sapiens type J I 

Hs. 63325 

AA411S02 

NP_057S09 

none fouxul 

31>53 

54-94 

204-429 

plasma membrane/ER 



serine protease mRNA 
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1 11 21 31 41 51 

I 1 I I I I 

MIflDPDSOQP XUSliOVKPLR KPRXPMBTFR KVOIPIIIAX, LSIA8IXIW VLIKVILDKY 60 

VFLOSQPLHF IPRKOLCaME LDCPLOTDBB HCVKSFPBOP AVAVRIiSKDR STLQVU5SAT 120 

QWPSACPDM FTBAliAETAC RQMGY8SKPT PRAVBIGPDQ DLDVVEITE2I SQBUIMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPKWGGBB ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCPRKH TDVFNWKVRA 0SDKL6SFPS LAVAKIIIIE PNPMYPKDND lALMKLQPPL 300 

TPSGTVRPIC bPFFDSBtiTP ATPLWIIOWO PTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGBVTEKM MCAGIPBGGV 0TCQGZ>SG6P LMYQSDQHHV VGIVSWQYGC GGPSTPGWT 420 
KVSAYLNWry NVWKABL 



A114 DNA SEOUBWCEi 

Gene namet ttk protein kinase 

Iftiigene nuirbert He. 169840 

Probeaet Accession Si M86699 

Nucleic Acid Accession #: NM_003318 

Coding sequence: 1026-3551 (underlined 



sequences correspond to start and stop eodons) 



1 11 21 31 41 51 

I I I i I I 

GOAATTCCTT TTTTTTTTTT TTTGAQATQG AQTTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAKTGGCACA ATCTCAGCTT ACTGCAACCT CX^GCCTCJCCXS GGTTCAAGCG ATTCTCCTGC 120 

CTCAGCCrCT CAAGTAGCTG GQATTACAGO CATQTGCXIAC CACC C CTG G C TAACTAATTT 180 

CTTTTCTATT TAOTAGAGAT GGGOTTTCAC CATGTTGGTC A6GCTGGTCT T6AACTCCT0 240 

AO CTCA GGTG ATCCACTTGC CTTGGCCTCC CAAAGTGCTA 66ATTACA0C OGTOAAACTG 300 

TOCCTGGCTQ ATTCTTTTTT TGTTGTTGGA TTTTTQAAAC AOQOTCTCCC TTGOTCGCCC 360 

AGGCTGGAGT GCAG TOGTQC GATCTTGGCT CACTATAACC TCCACXTTCCT GGTTTCAAGT 420 

OAT CCTCCCA CTTTAGCCTC CTGAGTAGCT GTOATTACAO QCGTQCACCA CCACACCCGG 480 

CTAATTTTTO TATTTTTATT AGAGACAGGG TTTCAOCATG TTG6CCAGGC TGTTCTCAAA 540 

CTCCTGGACT CAAGGGATCC GCCTGCCTCC ACTTCOCAAA GTOCGQAGAT TAOIGGTOTG 600 

^TCACCATG CCTQACXnTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAO 660 

GGTCCrCACA AC31AATCTGC ATTAGGCGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAGAT6A AGTGATTCAT GATTTAGAAA GGGGAAGTAG TAAGCCCACT GCACACTCCT 780 

GGATG ATGAT CCTAAATCCA GATACA6TAA AAATGGGOTA TGGQAAGGTA GAATACAAAA 840 

TTTGOTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TGQATACATT GTTOATGTGA 900 

ATG TAAOACT GTACAGACTT CCTAGAAAAC AQTTTGGQTT CGATCTTTTC ATTTCCCCAG 960 

TOCAOTTTTC TGTAGAAATO GAATCOGAGG ATTTAAGTGG CAGAOAATTG ACAATTGATT 1020 

CCATAATOAA CAAAGTGAQA GACATTAAAA ATAAQTTTAA AAATGAAGAC CTTACTOATG 1080 

AACTAAGCTT GAATAAAATT TCTGCTOATA CTACAOATAA GTGQGGAACT GTTAAOCAAA 1140 

TTATGATQAT GGCAAACAAC CCA6AGGACT GGTTQAGTTT GTTOCTCAAA CTAOAOAAAA 1200 

ACAGTGTTCC GCTAAGTGAT GCTCTTTTAA ATAAATTGAT TGGTCGTTAC AGTCAAGCAA 1260 

TTGAAQCXSCr TCCCCCAGAT AAATATGGCC AAAATQAGAG TTTTGCTAGA ATTCAAOTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAQAQC CAGATOATGC AOGTGACTAC TTTCAAATGG 1380 

CCAGAGCAAA CTGCAAGAAA TTT G C iYriti TTCATATATC TTTTGCACAA TTTOAACIGT 1440 

CACAAGGTAA TGTCAAAAAA AOTAAACAAC TTCTTCAAAA AGCTGTAGAA COTOQAOCAO 1500 

TACCACTAGA AATGCTGGAA ATTGCCCTGC GGAATTTAAA CX:TCCAAAAA AAOCAGCTCC 1560 

TTTCAGAGQA GQAAAAGAAG AATTTATCAG C3VTCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCCGOTTC ACT TGGGC!A T TTACAGAATA GOAApAACAG TTOTGATTCC AGAGGACAGA 1680 

CTACTAAAGC CftOQTTTTTA TATGGAGAQA ACATGCCACC ACAAGATGCA GAAATAlQGTT 1740 

ACOGGAATTC ATTGAGACAA ACTAACAAAA CTAAACAQTC AT Q COC A TTT GQAAGA6TCC 1800 

CAGTTAACCr TCTAAATAGC CCAGATTOTG AT6TGAAGAC AGATOATTCA GTTOTACCTT 1860 

GTTTTATGAA AAQACAAACC TCTAOATCAG AATGCCGAGA TTTGGTTGTG CCTGGATCTA 1920 

AACCAAGTQG AAATGATTCX: TGTGAATTAA GAAATTTAAA GTCTCTTCAA AATAOTCATT 1980 

TCAAGGAACX: TCTGGTOTCA GATOAAAAGA OTTCIGAACT TATTATTACT 6ATTCAATAA 2040 

CCCTOAAGAA TAAAAOGGAA TCAAGTCTTC TA6CTAAATT AGAAOAAACT AAAGAOTATC 2100 

AAGAACCAGA GGTTCCAGAO AOTAACCAGA AACAOTGGCA AGCTAAGAGA AAGTCAOAOT 2160 

OTATTAACCA GAATCCTGCT GCATCTTCAA ATCACTGGCA GATTCOGQAG TTAGCCCGAA 2220 

AAGTTAATAC AGAGCAGAAA CATACCACTT TTOAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AQTCACXaCC AATATCAACA TCTAAAT6GT TTGACCCAAA ATCXATTTOT AAGACACCAA 2340 

GCAGCAATAC CTTGGATOAT TACATGAflCT OTTTTAGAAC TCCAGTTOTA AAOAATGACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATQGCCAACC TGCCT Q TTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAGCATCT TCTTCA0C3VA 2520 

AT6AATGCAT TTCGGTTAAA GQAA6AATTT ATTCCATATT AAA6CAGATA GGAAGTGQAQ 2580 

GTTCAAGCAA OOTATTTCAG OTCTTAAATG AAAAOAAACA GATATATGCT ATAAAATATG 2640 

TGAACTTAGA AQAAGCAGAT AAOCAAACTC TTQATAGTTA CGGQAAOGAA ATAGCTTATT 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCQACT TTATGATTAT GAAATCAOGQ 2760 

ACCAGTACAT CTACATGGTA ATGGAQTGTG GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

AQAAAAAATC CATTGATCCA TGGGAAOSCA AQAGTTACTO GAAAAATAT6 TTAGAGQCAQ 2880 

TT CACA CAAT CCATCAACAT GOCATTOTTC ACAOTOATCT TAAAOCAGCT AACTTTCTOA 2940 

TAGTTOATGG AATGCTAAAG CTAATTGATT TTOGOATTGC AAACCAAATG CAACCAGATA 3000 

CAACAAGTGT TGTTAAAGAT TCTCAG6TTQ GC3VCAGTTAA TTATATGCCA CCAQAAQCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAQAATG GGAAATCTAA GTCAAAGATA AGCCCCAAAA 3120 

GTGATGTTTG GTCCTTAGGA TGTATTTTGT ACTATATGAC TTACGGGAAA ACACCATTTC 3180 

AGCAGATAAT TAATCAOATT TCTAAATTAC ATGCCATAAT TGATOCTAAT CATGAAATTG 3240 

AATTTCCOGA TATTCCAGAG AAAOATCTTC AAaATGTGTT AAAOTGTTOT TTAAAAA0G6 3300 

ACCCAAAACA GAGGATATCC ATTCCTGAGC TCXrTGGCTCA TCCATATCTT CAAATTCAAA 3360 

Cr CATCCAQ T TAAC CAAATG GCCAAGGGAA CCACTQAAGA AATGAAATAT GTTCTGGGOC 3420 

AACrrGTTGG TCTGAATTCT CCTAACTCCA TTTTGAAAGC TGCTAAAACT TTATAIXSAAC 3480 

ACTATAOTQQ TOQTQ AAAOT CATAATTCTT CATCCTOCAA GACTTTTOAA AAAAAAAGGG 3540 

QAAAAAAATGJVTTTOCAOTT ATTOGTAATO TCAGATAG6A OGTATAAAAT ATATTOGACT 3600 

QTTATACTCT TGAATCCCTO TGGAAATCTA CATTTQAAQA GAACATCACT CTGAA8TGTT 3660 

ATCA6CAAAA AAAATTCMST OAGATTATCT TTAAAAGAAA ACTGTAAAAA TAGCAACCAC 3720 
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TTATGQCACT GTATATATTO TAGACTTGTT TTCrCTGTTT TATGCTCTTG TQTAATCTAC 3780 
TTGACATCAT TTTACTCTTG GAATAGTGGG TG6ATAGCAA 6TATATTCTA AAAAACTTTQ 3840 
TAAATAAAGT TTTGTGCCTA AAATGA 
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Alls Protein seqwencei 
Gene namet 
Unigene nuinber: 
Pzobeset Accession 8 : 
Protein Accession ft: 
Signal sequence i 
Transtnenibrane domains: 
Protein Kinase Donaini 
Cellular Localisation i 



TTK protein kinase 

He. 169840 

M86699 

KP_003309 

none found 

none found 

510-775 

cytoplasnic and nuclear 



1 

I 

MNKVRDIKNK 
VPIiSDAliUnC 
ANC3CKFAFVH 



NSLRQTNKTK 
SGNDSCELRN 
PBVPESNQKQ 
PFI8TSKWFD 
QILATPIiQNL 
LEEADNQTLD 
KSIDPWBRKS 
SWKDSQVQT 
XiNQISXIiHA 
PVNQMAKGTT 
K 

OVARIAN 



II 

I 

FKNEDLTDEL 
LIQRYSQAIE 
ISPAQFELSQ 
TVLTAQBSF8 
QSCPFGRVPV 
LKSVQNSHPK 
HQAKRKSECI 
PK8ICKTPSS 
QVLAS8SANE 
SYRNBIAYLN 
YWKNMLEAVH 
VNYMPPEAIK 
IISFNKBIEF 
EEMKYVLGQL 



21 

I 

SLNKISADTT 
ALPPDXYGQN 
GNVKKSKQLL 
GSLGHliQintN 
NLLNSPOCDV 
BPLVSDEKSS 
NQNPAASSNH 
NTLDDYMSCF 
CI8VKGRIYS 
KLQQBSDKII 
TIHQHQIVHS 
DM5SSRENGK 
PDIPEKDLQD 
VGLNSPNSIL 



31 
I 

DNSGTVNQIM 
ESFARIQVRF 
QKAVERGAVP 
NSCDSRGQfrT 
KTDDSWPCF 
ELIITDSITL 
WQIPELARKV 
RTPWKNDFP 
ILKQIGSGGS 
RI.YDYEI7DQ 
DLKPAHFLIV 
SKSKISPKSD 
VLKCCLKRDP 
KAAKTLYBHY 



41 

! 

MMANNFEDHIi 
AELKAIQEPD 
IiENLBIALRN 
XARFLYGEIM 
MXRQTSRSEC 
KNKTESSLLA 
NTEQKHTTFE 
PA0QLSTPY6 
SKVFQVUNBK 
YIYMVKEC6N 
DGMUaiDFG 
WfSLGCILYY 
RQRISXPELL 



51 
I 

8LLLKLBXKS 
DARDYFQMAR 
LNLQKKQLLS 
PFQDABXOyR 
RDLWPGSKF 
KLCETKBYQB 
QPVFSVSKQS 
QPACFQQQQH 
KQIYAIKYVN 

iDiiNSHijaac 

lANQMQPDTT 
MTYGKTPPQQ 
AHP YVQIQ TH 
SKTPEKRR6K 



60 
130 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
640 



A116 DMA SEQPBtTCB 



Unigene nunlber: 
Probeaet Accession fti 
Nucleic Acid Accession #i 
Coding sequence t 



Q protein-coupled receptor 39 

He. 85339 

AA349893 

NM_00150B 

1-1362 (underlined sequences correspond to start and stop codons) 



ATGG CTTCAC 
CCCGAGTTTG 
TTCQTGATOG 
AAAGGATACT 



ACQTCCAGCT 
QCTACOCTGC 
TTCAGGTACA 
OTCACCTGOO 
GTGAAC6TGC 
CAQCCOQASA 
CftGTC CAGCA 
ATGTGCTG6A 
AC60G6CCTC 
ACCATCATCT 
ATTCGOAGGA 
GC6TACATGA 
CG8CTCCTOT 
TGCOGCCTOT 
ACCACCOACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 

I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTQ6G 
TGCAOAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACQTQCT 
AOOCTQTOTC 



CCAGCCACCG 
CCTCCAATAT 
TCTT0G6CGC 
ACATGATGCA 
OQCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACAOGGTQTC 
CGCTGCAGCA 

GAACTGAGAA 
AGTCATTGAO 
CAGAGAATGO 



21 
I 

GGGCAGTGAC 
CTGOATCAAA 
GAACAGG6CC 
GGTGACA6AC 
GCCCATGQAG 
CTGCAAGCTO 
GACAC TCAGC 
GGGACCTTGC 
ACTGCCCTTO 
GGGTCTCACT 
GTCCATCTGT 
CTT08TGGTC 
OaTOCTCATQ 
GAAGTCCGAO 
GATTOTTGTO 
GGCCAAACCC 
CTTCTCGGAG 
CXGGCAGCAG 
GBCCAACCAC 
TGTGCAGOQC 
GATTTTCTTA 
TCTOGAGTCA 
TTTTCAQGAG 



31 

I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CAGATGGTQA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTOGTGG 
AAAAGCCAOA 
AGOQAAGAGA 
ACATT6GCCS 
AAGCAOGACT 
A06TTTTTCT 
TTTCGGGGGO 
GAGAAGC6CC 
COGTTGCTCT 
AGCACTTTTC 
CTAGAG CCCA 
CAT6AAGTTT 



41 

I 

TCATTGATCA 
TTCTGGTOTA 
TCACCCAGGT 
OTTTGGCTTG 
TCATCTGOAA 
TCTTOQAGGC 
ACATOGCCAT 
IGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAOCCGCTG 
TCCTGCTCTC 
AGGGCTOGCT 
GCAGGACC6C 
TATGCTGQAT 
GGA06AGGTC 
ACCTCAGCTC 
TQTTOGTGCA 
TG0GC6TACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACrCAGGOGC 
GA 



51 

I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAO 
CTCGGACATC 
TCCCCTGACC 
CTGGAOCTAC 
CTGTCACCCC 
CTTOGTCTGG 
OTACCCCCTQ 
CGACCAOQAO 
GACOGTOTTC 
CGTAGCCTTC 
GQCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CXACTTCOQG 
GGTCATCAAC 
GGTGCTGTGC 
TQOQCACTCC 
GCGCCAGTCC 
C3AGCCCCAG 
GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A117 Protein sequence! 
Gene namet 
tniigene nunbert 
Probeaet Accession #t 
Protein Accession 9t 
Signal sequence i 
Pfam domains! 
Trazisnienibrane domains t 
Cellular Localization: 



G protein-coupled receptor 39 

HS. 85339 

AA349893 

NM_001S08, NP_001409 
none £ound 

7tin 1 (72-172, 224-344] 

32-54 « 68-90. 111-133 » 151-173, 

plasma meanbrane 



221-243. 280-301, 320-342 



1 n 21 31 41 51 

I I I I t i 

MASPSLPGSD CSQIIUKSKV PEPBVATWIK XTLILVYLIZ PVN8LL6KSA TIRVTQfVLQK 

886 



60 



wo 03/042661 



PCT/US02/36810 



KGYXiQKEVTD RMVSIACSDI LVFLZGMPNE FY8IXNNPLT T8SYTZ«SCRL HTPLFEACSY 130 

ATLLBVLTLS FBRYIAIW FRYKAVSGPC QVXLLIGFVH VTSALVALPL LFAMGTBYPL 180 

VNVPSHRGLT CWRSSTRHHE QPBTSNMSIC INLSSRWrVF QSSIPGAPW YLWLLSVAP 240 

MCWNMMQVLM KSQKGSLAGO TRPPQLRKSB 8EESRTARRQ TIIPLRIilW TLAVCW4PNQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILLPPSB TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 360 

CRLSLQHAMH BKRXiRVHAKS TTDSARFVQR PU<FASRRQS SARRTEKIFL STFQSEABPQ 420 
SKSQSLSLBS LBFMSGAXPA NSAABNGFQB HBV 



AliB DMA sequence 
Gene name i 
Unigene nuaiber: 
Probeset Accession #t 
Nucleic Acid Accession 4s 
coding sequence: 



bone rooxphogenetic protein receptor IB (ALK-6) 

Hs. 87223 

AA2S0737 

liM_001203 

274-1782 (underlined sequences correspond to start and stop codons) 



C60GG6G0GC 
GAGGACG060 
GTGAAAGGAA 
CATAACCATT 
TOCOVTAAGT 
AATGTGGGCA 
TTGOQTTQTA 
GA06GATATT 
GGTTOCGTAO 
AGAAGATCAA 
CTGOCTCCAT 
ATATCTGTGA 
TATAAAAGAC 
ATTOCTC C TG 
TCRGGCCTCC 
ATT6GAAAA0 
GTGAAAGTGT 
ACAOTGTTGA 
0GGTCCTG6A 
TATCT6AAGT 
AGTG6CTTAT 
CATCQAGATC 
GACXnXKSGCC 
ACTGQAGTTO 
AATCACTTCC 
GTTGCTAGQA 
CTAOTGCCCA 
CGCCCCTCAT 
ATOACAOAAT 
ACACTT6CCA 
CATCTCTGCA 
TAAGCATCCA 
CTTTCAGGGA 
TCTGTTTGTA 



11 
I 

GGAGTO6GO0 
GAGCCGOGAG 
AGGAA6ATGA 
TGGCTCTGAG 
GAQAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCAOGAT 
QACTAGAAQG 
TTGAATGCTG 
TGAAAAACAG 
CTGTCTGTAO 
AAGAAACCAO 
GAGAATCCCT 
CTCTQCTGGT 
GTOGCTATGG 
TCTTCACCAC 
TOAO GCATGA 
GCCAGTTGTA 
GCACCAOCCT 
GTCATTTACA 
TGAAAAGTAA 
TGGCTOTTAA 
GCACCAAAOQ 
AGTCTTACAT 
GATGTGTATC 
GTGACCCCTC 
TCCCAAAC08 
GCTGGQCTCA 
AAATOTCAGA 
GAAAGCCAAC 
CAGTACAAGC 
GCGACCTGGG 
OOOQGAGAAA 



31 

I 

GGOCCTCQCG 
CGC3VCGCX3CX3 
TTTCATGCCT 
CTATQACAA6 
CTTCXTTTQAT 
G6ATGGTGA0 
CCATTGTCCA 
GATAGAAOAG 
CTCAGATTTT 
CACAQAAA60 
AGATTTTGrr 
TTTGCTCTTG 
ACXrrOGATAC 
GAlQAGACTTA 
CCAAAGQACT 
GGAAGTTTGO 
AGAGGAAGCC 
AAACATTTTO 
CCTAATCACA 
AGACGCTAAA 
CACAGAAATC 
AAACATTCTG 
ATTTATTAGT 
CTATATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGGAC 
GTOGAGCAGT 
CAATCCT6CA 
GTCCCAGOAC 
AGGTACTCTT 
CTTQAACATC 
CAAA6ACAGA 
COOTTGGGTA 



31 
I 

GGA0GCGG6C 
GGGTGGAGTT 
TGTTQATAAA 
AQAGGAAACA 
AA CATGC TTT 
AGTACAQCCC 
GAAGACTCAO 
GATGACTCTG 
CAQTGTGGGO 
AACX3AATQTA 
GATGGACCTA 
GTCCTTATCA 
A6CATTGGGT 
ATT8AGCAGT 
ATAGCTAAGC 
ATGGGAAAGT 
AGCTGGTTCA 
6GTTTCATTG 
GACTATCATQ 
TCAATGCTGA 
TTTAGTACTC 
GTGAAGAAAA 
GATACAAATG 
OCAGAAOTar 
ATGTATAGTT 
GTGGAAGAAT 
ATOAGGGAGA 
QATQAGTGTC 
TC!AAGGCTGA 
ATTAAACTCT 
CTGTTTGTGG 
GTCCTGCTTC 
GAAGCTCCCA 
ACZTGTTCAA 



41 

I 

A6TG0G6AGA 
CAGCCTACTC 
GOTTCAGACT 
AAAAOTTAAA 
TGOGAAGTGC 
CCACCCCCCO 
TCAACAATAT 
GGTTGCCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAG 
TATTATTTTO 
TAGAACAGGA 
CTCAQAGCTC 
AOATTCAOAT 
G6C6TGGCGA 
GAGAGACAGA 
CTGCAGATAT 
AAAAT6GTTC 
AGTTAGGCTA 
AAGGCAAACC 
ATGGAACTTG 
AAGTTGACAT 
TOGR OOASAG 
TTGGCCTCAT 
ACCRGCTTCC 
TTGTGTGCAT 
TAA GGCAGA T 
CASCX3CTG0S 
GATAGGAGAG 
GCAGAOCAAA 
CCAGTGGGTT 
GAAGGAGA6A 
OATATGATGC 



51 
1 




COGCGGOGCT 


60 


TTTCTTAOAT 


120 


TCTGCTGATT 


ISO 


CTTACAAGCC 


240 


AGGAAAATTA 


300 


TCCAAAGGTC 


360 


TTGCAGCACA 


420 


GGTCACTTCr 


480 


TOCTCATCAA 


540 


ACACCCTACA 


600 


GGCTTTACTT 


660 


TTACTTCCGG 


720 


TQAAACTTAC 


780 


AGGAAGTGGA 


640 


GGTGAAACAG 


900 


AAAG0TA6CT 


960 


AATATATCAG 


1020 


CAAAGGGACA 


1080 


CCTTTATGAT 


1140 


CTCTTCTGTC 


1200 


AGCAATTGCC 


1260 


CTGTATTGCT 


1320 


ACCACCTAAC 


1380 


CTTQAAGU3A 


1440 


CCTTTGGGAG 


1500 


TTATCATGAC 


1560 


CAAGAAOTTA 


1620 


GGGAAAACTC 


1680 


GGTTAAGAAA 


1740 


GAAAAGTAAG 


1800 


AQACATCAAA 


1B60 


CAGACCTCAC 


1920 


TTGATCOGTG 


1980 


AT 





All 9 Protein sequence 
Gene name: 
Unigene number i 
Prbbeset Accession fft 
Protein Accession fti 
Signal sequences 
Transmembrane domains t 
PFAM domains* 
cellular Xiocalisationi 



bone morphogenetic protein receptor IB (ALK-6) 
Hs. 72472 / Hs. 87223 
AA2S0737 / U89326 
NP 001194 

1-13 
128-144 

activin^receptor [30-111] , protein kinase [204-491] 
plasma nenbrane 



1 11 

t I 
MLLRSAGKLN 
DSGLPWTSG 
GPIHHRALLI 
EQSQSSG8GS 
HFRBTBIYQT 
MLXLAYSSVS 

TOEVDiprar 

EByQXiPYBDL 
RLTALRVXRT 



CLGLEGSDFQ 
SVTVCSLLLV 
GliPUiVQRTI 
VUmHENILG 
OXXSnJiTBIP 
RVGrXRYMPP 
VPSDPSYSDH 
LAKM8EGQDI 



21 
I 

TAPTPRPKVIi 
CRDTPIPHQR 
LIILFCYPRY 
AKQIQMVKQI 
PIAADIK6TG 
STQOKPAXAH 
BVLDESLNXQI 
RBIVCXKKLR 
KL 



RSIBCCTERN 
KRQBTRPRYS 
GKGRYOEVWM 
SWTQLYtilTD 
RDLKSXHILV 
HFQSyiNADM 
PSFPMRWSSD 



41 
I 

DSVNNICSTD 
ECNKDLBFTL 
lOLEQDETYI 
GXNRGEKVAV 
YHaOQSLYDY 
KX H GTOC I AD 
YSFOLIIiHBV 
BCLRQKOKLM 



51 
I 

QYCPTMIEED 
PPLKNRDFVD 
PPGESLRDLI 
KVFPTTEBAS 
LXSTTXiDAKS 
LGXAVKFI8D 
ARRCVSGQIV 
TECKAHNPAS 



60 
120 
180 
240 
300 
360 
430 
480 



A120 DMA SEQUENCE 

Gene name: LIV-I protein, 

Unigene number: 

Probeeet Acceeeion #i 

Nucleic Acid Accession #: 

Coding sequence i 



estrogen regulated 
Hs. 79136 
U41060 
NM_012319.2 

138-2405 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



51 

887 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



I I I I I I 

CTCQTQC06A ATTOGGOICS AiQACCGOGTG TTOGOOGCTG 0TAGA6ATTT CTCQAAGACA 60 

CCA6TGGGCC 0GTGTG6ARC CAAACCTGCG CGCaTGGCOQ GOCOOTGGOA. CAAOGAGGOC 120 

GOGGAQAOQA AOOCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC IBO 

TCTCTGTCAC AAATCCCCTT CATQAACTAA AAGCAGCTGC TTTCCOCCAO ACCACTGAOA 240 

AAATTAGTOC GAATTGQQAA TCTG6CATTA ATGTTGACTT GGCAATTTOC ACACX36CAAT 300 

ATCATCTACA ACAQCTTTTC TAC06CTAT6 GAOAAAATAA TTCrTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCAOQACCA TCACTCAGAC CAOaAGCATC ACTCAOACXav TGAGCGTCAC TCAGACCATG 480 

AGCATCACTC AGACCAGGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 

C LX i Cr iC XQa TAAAAATAAO OQAAAAQCTC TTTOGGCAOA OCATGACTCA GATAGTTCAG 600 

GTAAAGATCX: TAQAAACAGC CAGGGGAAAG GAOCTCACOG AGCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA OTQAAGTOAC CTCAACT6TG TACAACACTG 720 

TCTCTQAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGAT6T AAGCAOCTCC ACTCCAGCCA GIGTCACATC AAAOAGCCGO OTQAGCOQGC 840 

TGGCTGQTAa 6AAAACAAAT GAATCTOTGA OTGAOCCC06 AAAAGGCTTT ATOTATTCCA 900 

GAAACACAAA TGAAAATCCT CAGGAGTGTT TGAATGCATC AAA6CTACT0 ACATCTCAT6 960 

GC3VTGGGCAT CCAGGTTCCX3 CTGAATGCAA CAGAGTTCAA CTATCTCTQT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATAC3UVG TGAAAAOAAG GCTGAAATCC 1080 

CTCCftAAGAC CTATTCATTA CAAATAGCCT GGGTTGGT6G TTTTATAOCC ATTTGCATCA 1140 

TCAJBTTTCCT GTCTCTGCTG GGGQTTATCT TAGTGOCT C T CA7GAAT0G0 GTOTTTTTCA 1200 

AATTTCTCCT GA6TTTCCTT GTGGCACTGG CCGTTGGQAC TTTGAGTGGT GATOCTTTTT 1260 

TACaCXrrrCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAAT6GAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTGAAAAC ATAGAAOAAA 1380 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT A6GAGGCCT6 TATTTCATQT 1440 

TTCTTQTTGA ACAT6TCCTC ACATTGATCA AACAATTTAA AOATAAQAAG AAAAAOAATC 1500 

A6AAGAAACC T6AAAATOAT GATGATGTGG AGATTAAQAA GCAGTT6TCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAA6TA6 ATACAGATQA TGGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTO ATTCTCAGCA GCCT6C3W3TC TTGQAAGAAG 1680 

AAQAGQTCAT GATAGCTCA7 GCTCATCCAC AGGAAGTCTA CAAT6AATAT 0TACCCAGA6 1740 

GGTGCAAGAA TAAAT6CCAT TCACATTTCC ACX3ATACACT OQOOCAQTCA QAOOATCTGA 1800 

TTCACCACCA TCATQACTAC CATCATATTC TCXaVTCATCA CCACCACCAA AACCaVCCATC 1B60 

CTCACAGTCA CAGCCAGCGC TACTCTCGGG AOGAOCTGAA AGATOCCOQC GTOOCCACTT 1920 

TGGCCTGGAT GQTGATAATG GGTGATGGCX: TGCACAATTT CAGOSATGGC CTAGCAATTG 19 BO 

GTGCTGCTTT TACTQAAGGC TTATCAAQTG GTTTAAGTAC TTCTGTTSCT QTOTT CTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAOGCTQGC ATGACXX3TTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC QTATCTTGGA ATOGCAACAG 2160 

GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTQCTQ 2220 

GCTTATTCAT GTATOTTGCT CTGGTTGATA TQGTACCTQA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATQTAGCCGC TXKSGGGTATT TCTTTTTACa GAATGCTGGQ ATGCTTTTGG 2340 

GTTTTGGAAT TAT3TTACTT ATTTCCATAT TTQAACATAA AATOGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TOVTAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGQT TTTGTGATrT 2520 

TTGTATTGAA TATTOCTGTC TGTTACAAAO TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATO 2640 

TAAACAAQAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GQAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTQAA GAACTGCTGG 2760 

TGTTTAG6AA TAAGAATOTS CATGAAOCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAA0GA6AA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 

AAAAATCACA AAATTTGrTG TAAATTAGAG GGGAQAAATT TAGAATTAAO TATAAAAAGG 2940 

CAQAATTAOT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC COGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 

AAATATATTT AAT6AATTCA AGCAATATAC ACTT6ACCAA QAAATTGGAA TTTCAAAATG 3120 

TTOQT606GG TTATATACCA 6ATGAGTACA GTGAGTAOTT TATGTATCAC CAQACTGGGT 3180 

TATTGCC3UU3 TTATATATCA CCAAAAGCTG TATQACTGQA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAQTAQTAA AACTTTGATA TATATGAGQA TATTAAAACT ACACTAAGTA 3300 

TCATTTGATT COATTCAOAA AGTACTTTOA TATCTCTCAQ TGCTTCAGTG CTATCATTGT 3360 

GA6CAATTGT CTTTATATAC OGTACTGTAa CX3VTACTAG6 OCTOTCrOTG GCATTCTCTA 3420 
GAIOTTTCrr TTTTACACAA TAAATTCCTT ATATCAQCTT G 



A121 PROTEIN SEQUEWCg 
Gene name* 
Unlgene nuniberi 
Protein Accession #: 
Signal sequence! 
Pfam domain: 
Transmenibrane domains: 
Cellular Localisation t 



NARRLSVILX 
PYRYGEiniSI* 
EHHSDKDHHS 
SVSASEVTST 
NBSVSBPRKG 
RSCLXBTSEK 
LVALAVGTLS 
THKGLTALGG 
EBKVDTDDRT 
KSUFHDTLGQ 
MGDGLBNFSD 
NALSAHX«An* 



11 

I 

LIPAIiSVTNP 
SVEGFRKLLQ 
HHNHAASGKN 
VYNTVSEGTH 
PMYSRNTNEN 
XABIPPKTYS 
GDAFLHLLPH 
LYFMPLVEHV 
BOYLRAOSQB 
8DDLIHUHKD 
GLAIGAAFTB 
GMATOIPZGB 



21 

I 

LHELKAAAFP 
HXGIORIKRI 
KRKALCPOHD 
PLBTIBTPRP 
PQECFNASKL 
LQIAWVGQFI 
8BASHHHSHS 
LTLIKQFKDK 
PSHPDSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YABIVSMHXF 



LIV-1 protein, estrogen regulated 
He. 79136 

HP 036451 

1-ai 

Zip (591-743) 

330-346, 352-368, 427>444, 663>679, 
plasma raeiaibrane 



31 



688-703, 730-745 



QTTSKI8PNW 



SDSSOKDPRN 
GKLFPKDVSS 
LTSHGHQIQV 
AI8IISFLSL 
HEEPAKEMKR 
KKKNQIOaPEN 
VLEEEBVMIA 
QNHHraSHSO 
AVPCKBLPHB 
ALTAaXiPKYV 



41 


51 




1 

ESGINVDLAI 


1 

StRQYBLQQL 


60 


DHBHHSD3ER 


HSDHEHHSDH 


120 


SQGKQABRPE 


HA8GRRNVKD 


180 


8TPP8VT8K8 


RV8RLAGRKT 


240 


PU<ATEFNYb 


CPAIINQIDA 


300 


LGVILVPLMN 


RVPPKKLL8P 


360 


GPLF8HLSSQ 


NZEE8AYFDS 


420 


DDOVBIKKQL 


skyesq:.stii 


480 


HAHPOBVYNB 


YVPRGCKNKC 


540 


RYSREBLXDA 


GVATLAHMVZ 


600 


L6DFAVLLKA 


GMTVKQAVLY 


660 


ALVDMVPEKL 


BHDASDHGC8 


720 



888 



wo 03/042661 



PCT/US02/36810 



RWOYPFliQIlA GMU16F6ZML LI8IPBBRZV FRXNF 



Gene namei 
Unlgene number 1 
Proboaet Acceeaion #: 
Nucleic Acid Accession fit 
Coding sequences 



BGF- like-domain; multiple 6 

H8-12844 

N67551 

NM_015507 

341-1902 (underlined sequences correspond to start and stop codons) 



CCGCAQA06A 
OSCCCCT G OC 
08AGTGGA6C 
GGQTCCGGCC 
ATQC CIX.TGC 
GGOAACGOGG 
TGTCACTATG 
TGT6AAGCTA 
WSATGCTTTC 
AAACCCOGGC 
CTCAGTG6CC 
ATAAACIGTC 
TCA6GACTCC 
6GTAAAGTCA 
AAAT6TCACA 
AATGAATGTA 
OGGTCCTTCA 
ATCCCIGAAA 
AAOAAGTTQC 
CCAQAACCCA 
ATA8TTTCCA 
GAG6G6CTT0 
A6CCTGCX3AO 
CTGOTCCAAA 
GACTQCAGCT 
TGGAATCCTQ 
GGTGACMOA 
AACTTCTGTT 
TTT6T6AAAA 
TC3GAAGACAG 
GAAOCSUSAAC 
TCAG6CTTAT 
TTQACTTTQT 
TTAGAATTAC 
TCTTOTATAA 
TTTCTGAATC 
CAOTAIATCT 
TAGAAAAAAA 
TATGACATCA 
TOTATATTTA 



11 
I 

GCCTOGGCCA 
G06GTGCCT6 
GGAGGAOCOG 
GOOGOCCTCC 
CCTGGAGOCT 
CCAST6CAAG 
GAACTAAACr 
CATGOGAACC 
CAGGATACAC 
CATGCCAACA 
ACATOCTCAT 
AGTACAGCTO 
GOCTGGCCGC 
TCTGTOCCTA 
TTOGTTTOGA 
CTAT6GATAG 
AOTGTAAATG 
ATTCTGTGAA 
TTGCTCACAA 
CX3U3GACTCC 
GAGGCX36QAA 
A6QATGAGAA 
GW5ATGTGTT 
GGAAA6CX3CT 
TCAATCATGG 
CTGATCGA6A 
AAGACATTGO 
TGCTCTTTGA 
ACAGTAACAA 
GGAAAATTCA 
GTGGCAAGGG 
GTCCAGATAG 
ATGTCA6TTC 
TAGCTGAAAA 
GATATGCCAA 
TTTCCACATT 
GATTTGTATA 
AGCACAGAGA 
AA0ATA8ACT 
AATTCTTTOI 



21 

I 

G6CTAGCCAG 
OCCTC CCC YC 
AGOGGCTGAO 
CGAGGGGGGC 
TGCGCTCCCQ 
6CATCACGGG 
G80CTQCTGC 
T6GATGTAAG 
OSGGAAAACC 
CAGATGTGTG 
6CCAGATGCT 
TGAA6ACACA 
AAATGGAAGA 
CAATCGAAGA 
ACTGCAATAT 
CCATAOGTGC 
CAA6CAGGGA 
GQAAGTCCTC 
AAACAGCATG 
TACCCCTAAG 
CTCTCATGQA 
AAGAGAAGAG 
TTTCCCTAAO 
AACTTCCAAA 
OATCTOTGAC 
TAATGCTATT 
CXXSATTGAAA 
TTACCXjGCTG 
TGCCCTGGCA 
GTTGTATCAA 
CAAAACCGGC 
OCTTTTATCT 

ATTGTAATGT 
TATTTGCTTT 
ATATTATAAA 
AGTAAGTTQA 
AATGTTTAAC 
TTTQGCTAAG 
AATAATAATA 



31 

I 

GGCGCCCCCA 
CCAGACTGCA 
GAQAGAGGAO 
TCAGGA6GAG 
CTGCTGCTCT 
TTGTTAGCAT 
TACGGCTGGA 
TTTGGTGAQT 
TGCAGTCAAG 
AATACACACO 
A06TGT6TGA 
GAAGAAGGGC 
GACTGTCTAG 
TOTXyrOAACA 
ATCAGTGQAC 
AGCCACCATG 
TATAAAGGCA 
AGAGCACCTG 
AAAAAGAAGO 
GTGAACTTGC 
GGTAAAAAAG 
AAAGCCCTGA 
GTGAATGAA6 
CTGGAACATA 
TGGAAACAGG 
GGCTTCTATA 
CTTCTCCTAC 
GCOGQAGACA 
TGGGAGAAGA 
GGAACTGATO 
GAAATCGCAO 
GTGQATGACT^ 
TTQATATTGC 
ACCAACAGAA 
AAA7ATCATA 
ATATGGAAAT 
TGAGCTTCTC 
TGTTTQACTC 
TGGCTTAGCT 
TCCAAATCAT 



41 

1 

Gccxxnxxxx: 

GGGACAGCAC 
GCGGOGGCTT 
GAAGQAQGAC 
CCTG6GTGGC 
CX3GCA0QTCA 
GAAGAAACAG 
GCGTGGGACC 
ATGTGAAT6A 
GAAGCTACAA 
ACTCTAQGAC 
CACAGXGCCT 
ATATTGATQA 
CATTTGGAAG 
GATATQACro 
CCAATTGCTT 
ATGGACTTCG 
GTACCATCAA 
CAAAAATTAA 
AGCCCTTCAA 
GGAATGAAGA 
AGAATGACAT 
CAGGTGAATT 
AAGATTTAAA 
ATAGAGAAGA 
TQGCAGTTCC 
CTGAGCTQCA 
AACrrOGGGAA 
CCACXZAQTGA 
CTACCAAAAG 
TGGATGGCGT 
GAATGTTACT 
ATCATAGQAC 
ATATTATTGT 
TCACTGTATC 
GTCAGTTTAT 
TCTAC3UVCAT 
TTATGATACT 
GGGTCTTTCA 
CAAAAAAAAA 



51 
I 

AGGCOGCGAG 
CCGGTAACro 
AGCTGCTAOQ 
CCGTGCGAGA 
AGGTGGTTTC 
GCCTGGGGTC 
CAAGGGAGTC 
AAACAAAT6C 
GTGTGGAATQ 
OTGCTTTTGC 
ATOTQCCATO 
GTGTCCATCC 
ATGTGCCTCT 
CTACTACTGC 
TATAGATATA 
CAATACCCAA 
GTGTTCT6CT 
AQACAGAATX: 
AAATGTTAGC 
CTATGAAQAO 
GAAAATGAAA 
AGAGGAGCGA 
CGGCCTQATT 
TAT CTOGO TT 
TGATTTTGAC 
GGCCTTOGCA 
ACCCCAAAGC 
ACTTCGAGTG 
OQATGAAAAO 
CATCATTTTT 
CTTGCTTGTT 
ATCTTEATAT 
CTCTGGCATT 
AAGATGCX^TT 
TTCTCA6TCA 
CTCCCCTCCT 
TTCTAGAAAA 
TCTTGGAAAC 
TAQCCAAACT 
AAAAAAAA 



A123 Protein sequence 1 
Gene namet 
Unigene number: 
Probeset Accession #s 
Protein Accession #t 
Signal sequence: 
Transmenbrane domains s 
MAM doroaini 
EOF domain: 

Cellular Localization! 



B6F-like-domain; multiple 6 

Bs. 12844 

M67551 

NP__0S6322.2 

1-21 

none found 
402-546 
80-359 
secreted 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



MPLPHSXALP 
CEATCBPQCK 
LSGHMLMPDA 
GKVICPVNRR 
OSFKCKCKQG 
PEPTRTPTPK 
SLR8DVPFPK 
WNPADROHAI 
FVKNSmiALA 
SGLCPDSLLS 



11 
I 

I>LI*SWVAGGF 
FGECVGPNKC 
TCVNSRTCAM 
CVNTFOSYYC 
YKGNGLRCSA 
VNLQPFNYEB 
VNEAGEFGItl 
GFYMAVPALA 
NEXTTSEDBK 
VDD 



21 
1 

GNAASARHBG 
RCPPGYTGKT 
mCQYSCEDT 
KOIIGFELQY 
IPENSVKEVL 
IVSRGGKSHG 
LVQRKALTSK 
GHKXDIGRLK 
HXTGKIQLYQ 



31 

I 

LLASARQPOV 
CSQDVNECGM 
GEGPQCLCPS 
XSGRYDCIDI 
RAFGTXXDRZ 



LEHKDUJISV 
LLLPDLQPQS 
GTDATKSIIF 



41 

I 

CHYGTKLACC 
KPRPCQHRCV 
SGLRLAPNGR 
NECTMDSBTC 
KKLURXNSM 
EGLEDBKREB 
DCSFNHGICD 
NFCLLFDYIU^ 



51 
I 

YGWRKHSK6V 
NTHGSYKCFC 
DCLDIDECAS 
SBHAMCFNTQ 
KXKAKIKNVT 
ECALKNDIEER 
WKQDREDDFD 
A6DKVGKLRV 
EXAVD6VLLV 



60 
120 
160 
240 
300 
360 
420 
480 
540 



A124 DMA SEQUENCE 
Gene name : 
Ifoigene number 1 
Probeset Accession #s 
Nucleic Acid Accession «: 



bone morphogenetic protein 7 
Ha. 170195 
BB616633 
MM 001719 



(osteogenic protein 1) {aMP7) 



889 



wo 03/042661 



PCTAJS02/36810 



Coding eequencei 



123-1418 (underlined sequences correspond to start and stop codons) 



GGGCGCAGCG 
CTGCCACCTO 
OGATGCAOQT 
CCCTGTTCCT 
GCTTCATCXy^ 
CCATTTTGGG 
CCATGTTCAT 
OCCAGGGCTT 
GCCTC5CAAGA 
TGGAACATQA 
TTTCCAAQAT 
ACATCOGOGA 
AGCACTT6GG 
AGGAOGGCTG 
GGCAGAACCT 
AGTTGGGGGO 
TCTTOUVGGC 
GCCAGAACCG 
AQAACAGCAG 
GAOACCTGGO 
A6G6GGAGT0 
AGAOGCTGGT 
AGCTCAATGC 
ACA6AAACAT 
TTGGG6CCAA 

cixiOL ' iTrm 

AAACATQAGC 
TCCTACAAGC 
GCGGOGCCAG 
TTATGAGCQC 
OGQCACATTG 
CAATAAAACG 



11 

I 

GGGCOOGTCT 
GGGCGGTGCG 
GOGCTCACTG 
GCTGCGCTCC 
CCGGCXjCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGQ 
AC6CTTOGAC 
CAGGGAATC6 
GCTGGTGTTT 
GGGCCTSCAO 
CCTGATTGGa 
CAOQGAGQTC 
CTCCAAGACG 
CA6CGACCAG 
CTGGCAGGAC 
TOCCTTCCCT 
CCACTTCATC 
CATCTCOtSTC 
GGTGGTCCQG 
GTTTTTCTG6 
TOAGACCTTC 
A0CATAT6GC 
TGTGCAGGCA 
QTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 

1 

GCAGCAAGTG 
GGCCCGGAGC 
CGAOCTGCGG 
GCCCTGGCOG 
CX3CAGCCAGG 
OGCCOGOGCX: 
TACAAGQCCA 
TACAAGGGCO 
CTCACCGAOG 
TTCCACXXAC 
GAA6CTGTCA 
AATOAGACXST 
GATCTCTTCC 
GACATCACAG 

GGGCACGGGC 
CACTTC06CA 
CCC3VAGAACC 
AOGCAGGCCT 
T6GATCATCG 
CnjAACTGCT 
AACCCXaOAAA 
CTCTACTTOG 
GCCTQTGOCT 
ATCCTCCATT 
COCTCCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 



31 

I 

ACCGACGGCC 
CCGGAGCCCG 
CGCOGCACAQ 
ACTTCAGCCT 
A60QG06GGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAOTAC 
CCGACATGGT 
GCTACCACCA 
066CAGC0GA 
TC0C3GATCAG 
TOCTCGACAG 
CCACCAGCAA 
AGAOGCTGGA 
CCCAGAACAA 
GCATCOQQTC 
AGGAAGCCCT 
GTAAQAAGCA 
CGCCTGAAGG 
ACAIOAAOGC 
06GTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTOGCCTTO 
CCCCAACTTT 
TTTTTCAQTQ 
GOAAAAAAAA 
AGCCATGCAC 
COGTGGQAGG 
TTGACC06GA 



41 

I 

GGGAOGGCOQ 
GGTAOCGCGT 
CTTOOTGGCG 
GGACAACQAG 
GATGCAGOOC 
GGGCAAGCAC 
GQAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TGQAGAGTTC 
ATTCCG6ATC 
OOTTTATCAG 
COGTACCCTC 
CCACT6GGTG 
TGGGCAGAGC 
GCAGOCCTTC 
CAOGGGQAGC 
GCGGATGGCC 
OGAGCTGTAT 
CTACX3CCGCC 
CACX»ACCAC 
6CCCTGCTGT 
CAAOGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTQTGA 
GCAGCATCCA 
ACAACX3CATA 
GGACTCGTTT 
AAGGG60G8T 
AGTTCCTQTA 



51 
I 

CCTGCCCCCT 
AGAGCCX3QCQ 
CTCTGGGCAC 
GTGCACTCGA 
QAOATCCTCT 
AACTOGGCAC 
GGOCCCGGOO 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTQATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTOGG 
GTCAATCOGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACA6CGCA 
AA06TGGCAG 
GTCAQCTTCC 
TACTACTGTG 
OGCATOOTGC 
GOGCCQICGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAOTATTAGO 
ATQAACAAGA 
AAOAAAAATG 
CCAGAGGTAA 
GGCAAGGG6T 
ATAAATGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
60O 
660 
720 
780 
840 
900 
960 
1020 
1080 
1X40 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



A125 Protein oequencei 

Gene name; bone moxphogenetic protein 



7 (osteogenic protein 1) (BMP7) 



Unigene number i 
Probeset Accession Ut 
Protein Accession 4: 
Signal sequence : 
FCam domains: 
Transmembrane dooiainst 
Cellular Localization! 



He. 170195 
BB616633 
HP 001710.1 
1-30 

TGFbjpropeptide [37-281] 

none Eound 

secreted 



1 11 21 31 41 51 

I I I I I I 

MBVRSIiRAAA PHSFVAXiHAP LFLLRSALAD PSLDNBVHS8 FXRRRLR8QB RRBMQREZXiS 60 

IL6LFBRPRP HLQGKHHSAP MFMLDLYNAM AVBBGGGP66 QGFSYPYKAV FSTQGPPLAS 120 

LQDSHFLTDA DMVMSFVNLV EHDXEFFHFR YBHREFRFDL SRIPEGEAVT AABFRZYKDY 180 

IRERFD27ETP RI6VYQVI1QE HLGRESDLFL LDSRTLHASE EGHLVFDITA T8NHHWNFR 240 

KKLGLQLSVE TLDQQSINPK IiAOLIORHGP QNKQPFMVAF FKATEVKFRS ZRSTGSKQRS 300 

(^mSXTPXMQ BAXiRMANVAB HSfiGDQSQAC KKHBLYVSFR CLGWQSWIZA PEGVAAYYCB 360 

GECAFPUNSY MKATNHAZVQ TLVHFZNPET VPKPCXIAPTQ LNAZSVLYFD DSSNVILEOnr 420 
RNMWRACGC H 



A126 SNA SEQUENCE 
Gene namet 
U&igene nuinber: 
Probeset Accession #s 
Nucleic Acid Accession §t 
Coding sequence 1 



1 
I 

CCCAOAGCOO 
CTGCOQACTT 
QTTQQCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
T6TCCGGGAG 
TG6CCGTCQA 
QGCOOTAOGO 
CCGAGCGGCO 
GGCCCGGAGG 
GGGO OOGO CT 
TCTGCCTQCA 



integrin, beta 8 
Hs. 52620 
AA479726 
NM_002214 

680-2990 (underlined sequences correspond to start and stc^ codons) 




T06CCC6GGA 
OTTTTGCATT 
AAACOACOGG 



41 

i 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGC6CCA 
ATCACCCAOT 
TQATTOTOTT 
GAAACAAAAG 
TCCAOTCGCC 

ccGOOGrracc 

GCTTACCTGC 
CCTAOOQACA 
GGCAGCCAGQ 
TTTTTTTACC 
CTGG0CA6CC 



51 

I 

AGCCAGGACG 
AACTAATGGT 
CAQACTTTTT 
GAAT6TACAT 
TGGCTCTTOG 
CTCTTTTCTT 
GCGGG60CCT 
GAGCOGGQAO 
ACCGCTTGCT 
CT0QCCCG06 
0GGCG06CGC 
GCTGCATTTO 
TCGOTOTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



890 



wo 03/042661 



PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CACTTGTTCT TGGACTGGGC CAAQGTGAAO ACAATAQATG TGCATCTTCA AATGCAGCAT 840 

CXTTCTGCXIAQ GTOCCTTGOG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAOSATTTCA 900 

TTTCAGGTGQ ATCWU3AAGT GAACXSTTGTG ATATTCTTTC C3UKTTTAATA AQCAAAG6CT 960 

GCTCAGTTQA TTCAATA6AA TACCCATCTG TGCATGTTAT AATACCCACT QAAAATGAAA 1020 

TTAATACCCA GGT6ACAGCA GGAGAA6T6T CTATCCA6CT GOGTCCAGGA OCOGAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTOT GGATCTTTAT TATCUXJTltJ 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAQ AAAAATTAAA TTCCGTTGQA AAOQATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCOGTOACT TTOGTCTTGO ATTTGGCTCA TAOQTTQATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC COSAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT AC3VTCCATGT GCTGTCTTTO ACAGAOAACA 1380 

TCACTGAGTT TGAGAAAGCA OTTCATAGAC AOAAOATCTC TGGAAACATA OATACACCAQ 1440 

AAGGAOGrrr TGAOGCCATG CTTCAGGCAa CTGTCTGTGA AAGTCATATC GGATGGGQAA 1500 

AAQAGOCTAA AAGATTGCTG CTGQTGATGA CAOATCAGAC GTCTCATCTC GCTCTTOATA 1560 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGQAAACTG TCATCTGAAA AACAAOGTCT 1620 

AOGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAQAG AAATTAATAG 1680 

ACAACAACAT TAAT GTCATC TTTGCAOTTC AAOQAAAACA ATTTGATTGG TATAA GGATC 1740 

ttctacccgt cttgccagqc accattgctg gtgaaataga atcaaag6ct gcaaacctca 1800 

ATAATTTGQT AQTGQAAGCC TATCAGAAGC TCATTTCAOA AOTQAAAGTT CAGGTGGAAA I860 

ACCAGGTACA AGGCATCTAT TTTAACATTA COQCCATCTG TCCAGATGGQ TCCAGAAAGC 1920 

CA66CATGGA AGQATGCAGA AACOTGAOQA GCAATGATGA AOTTCrTTTC AAT6TAACA0 1980 

TTACAATGAA AAAATGTGAT QTCACAGQAa GAAAAAACTA TGCAATAATC AAACCTATTQ 2040 

G TTTTAATGA AACCGCTAAA ATTCATATAC ACAQAAACTG CAGCICnCAG TGrOAOGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCr AGATTCXZAAO T6TTTCCAGT 2160 

GTQATGAGAA TAAATQTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAOQATCA GCCTGTTTGC AOTOGTOGAG GAGTTTaTQT TTQTGGQAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GItSTATOGAA AATACTOTGA AAAGGATGAC TTTTCTTGTC 2340 

CATATCACXA TGOAAATCTG TGTGCTGQGC ATOQAOAOTO TOAAGCAGGC AOATOCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA OAGGCAOQTO TGTGTGTGGA AGGTGTGAGT 2520 

GCACOSATCC CAQGAGCATC GGCGGCTTCT 6T6AACACTG CCCCACCT6T TATACAGCCT 2580 

6CAAGQAAAA CTGGAATTGT ATGCAATGCC TTOVCCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGT6 CAAAACCTCA TGTGCTCTCA TGGAACAACA 6CATTATGTC 6ACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGQ GTT6CTTAAA QTCCTGATCA TTAGACAGGT GATACTACAA TOGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA 6ATTACAGAG TGTCA6CCTC AAAAAAGGAT AAGTTGATTC 2880 

T6CAAAGTGT TTGCACAAGA GCAGTCACCT AC08A0GTOA GAAGCCTGAA QAAATAAAAA 2940 

T6GATATCA0 CAAATTAAAT GCTCAT6AAA CTTTC3U3GTG CAACIT CTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AQGAGACAAA TTGCTCACGG TCATGCCAGT TGCTOOTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAAT6TOTC TTACTACTGT TTQAGACTAO TGTCOTTGTA OCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTQAAGAO CACTOATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACXSCTT CCOW5AGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTQAG GAAAAAAATA ATCTGGCAAO 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT 0ATTG6T6TT TCACTCTTTC AAOMXTTQAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCITTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA QATGACTGQA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAOCT 3780 
GAAT6TTAA 



A127 Protein sequence t 
Gene name { 
UAigene nunibert 
Probeset Aceesslon #i 

Protein Acceseion #: 
Signal seijuencei 
Transmembrane docnalns: 



Cellular Looalization: 



Integrin« beta 8 

HS. 52620 

AA479726 

NP_002205.l 

1-39 

682-704 

S53>S84 

54-469 

plasma membrane 



1 
1 

MOGSAIiAFFT 
LGPECXSWCVQ 
GBV8ZQLRFQ 
SRDFRLOFOS 
VHRQKISGNI 
VPNDGNC31LR 
TIAGBIESKA 
NVTSNDEVLF 
CVDETPU}SK 
VYGKYCBKDD 
CSGRGTCVC3G 
CAIMBQQHYV 
DYRV8ASKKD 



11 
I 

AAFVCLQNDR 
EDFISGGSRS 
ABANFMLKVH 
YVDKTVSPYl 
DTPEGGFDAM 
NNVYVKSTTM 
AMUIMLWEA 
NVTVTMKKCD 
CFQCDE3nCCH 
FSCPYHHGNIi 
RCECTDPRSI 
DQT8ECF88P 
KLILQSVCTR 



21 
I 

ROPASFLWAA 
ERCSIVSNLI 
PLKKSrPVDLY 
SZKPBRIHKQ 
LQAAVCESHI 
EUPSLGQLSB 
YQKLISBVKV 
VTOGKNYAIX 
FDEDQF88B8 
CAGHOECBAG 
GRPCBHCPTC 
SYLRIFFIIF 
AVTYRREKPB 



31 

1 

WVFSLVLQLG 
SKGC8VDSIE 
YLVDVSASMH 

GWRKEAKRLL 
KLIDMKINVI 
QVaiQVQGIY 
KPIOFIIBTAX 
CKSHKDQPVC 
RCQCFSGHEG 
YTACmiWNC 
IVTFLIGLLK 
BIKMDISKLN 



41 

I 

QGEimCASS 
YPSVHVIIPT 
NNZBKU78VO 
PKOYXHVLSL 
LVHItX3TSHL 
FAVQGKQFHW 
FNITAICPXXS 
IHZHRMCSOQ 
S6RGVCV0GK 
DRCQCPSAAA 
MQCLHPHNLS 
VLIIRQVILQ 
AaETFRCHF 



51 
I 

NAASCARCLA 
EKEINTQVTP 
MDIi9 RKMAP P 
TENITEFEKA 

ALOSKLAGIV 
YKDLLPIiLPG 
SRKPGMEGCR 



C8CBKZKLGK 
QHCVNSKGQV 
QAILDQCKTS 
NNSNRIKSSS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



A12a PHA SEQUENCE 

Gene namei a protein-coupled receptor 64 



891 



wo 03/042661 



PCT/US02/36810 



unigene nunben Be. 184942 

PrcibesQt Accession 9t AA43S577 

Nucleic Acid Accession ft% KN_005756 

Coding sequence: 73-3117 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

11)111 

AGCCAGCCCG AGGACGCXSAQ OGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTQAA 60 
CTOGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGQCC ATGrTGGCAG AACTGAAGAA 120 
CTTTTACTGA OGTTCAAOAT ATTOCTTGTC ATCMTTOTC TTCATGTOOT TCTGGTAACA 160 
TCCCTGGAA6 AAOATACTGA TAATTCCAGT TT6TCACCAC CACCTGCTAA ATTATCT6TT 240 
GTCAOTTTTO CCCCCTCCTC CAATQAGQTT QAAACAACAA GCCTCAATGA TOTTACTTTA 300 
AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 
0CTTCA6GCG TCAAAOXCA GAOAAATATC T6CAATTTGT CATCTATTTG CAAT6ACTCA 420 
GCATTTTTTA GAGGT6AGA7 CAT G TTTCAA TATGATAAAO AAA6CACT6T TOCOCAGAAT 480 ' 
CAACATATAA C6AAT6GCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAAGQCTCA 540 
QAGCTCAACA AAACCCTGCA AACCCTAAGT GAQACTTACT TTATAATGTG TGCTACAGCA 600 
GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATQCA 660 
TGTGCTQCAA TAGCCGCTTT 6QAAAQA0TA AAGATTCGAC CAATGGAACA CTGCTGCTGT 720 
TCTGTCAOGA TACCCTG C CC TTCCTCCCCA GAAGAGTTOG GAAAOCTTCA GTGTGACCTQ 780 
CA6GATCCCA TTGTCTGTCT TGCTGACCAT CCA0GT6GCC CACCATTTTC TTCCAGCCAA 840 
TCCATCCCAG TG6T6CCT00 GGCCACTGTG CTTTCCCAGG .TCCCCAAAGC TACCTCTTTT 900 
GCTGAGCCTC CAGATTATTC ACCTQTGACC CACAATGTTC CCTCTCCAAT AGGGQAGATT 960 
CAAOCCCTTT CACCCCAOCC TTCAGCTCCC ATAGCTTCCA GCCCT6CCAT TGACATGCCC 1020 
CCACAGTCTO AAAGOATCTC TTCCCCTATG CCCCAAACCC ATGTGT0CX3Q CACCCCACCT 1080 
CCTOTGAAAG CCTCATTTTC CTCTCOCACC GTGTCrOCCS: CIOCaAATOT CAACACTACC 1140 
AG06CACCTC CTGTCCAQAC AGACATOGTC AACACCAGCA GTATTTCTQA TCTTGA6AAC 1200 
CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAOCC TG6AGCCTAA CCTCOCAGGA 1260 
6AAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCC6C CT6ACATGCT G6CCCCTCTG 1320 
GCTCAAAOAT TGCTGAAAGT AGTGGAT8AC ATTGGCCTAC AGCTGAACTT TTCAAACAOG 1380 
ACTATAAOTC TAACCTCCCC TTCTTTOOCr CTQQCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 
TTCAACACAA CTACCTTTGT GGCXXAAGAC CCTGCAAATC TTCAGOTTTC TCTGQAAACC 1500 
CAAQCTCCTO AGAACAQTAT TGGCACAATT ACTCTTCCTT CATOGCTGAT GAATAATTTA 1B60 
CCAGCTCATG ACATGGAGCT AGCTTCCAGO GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 
TTGTTTCAaG ATCCTTCCCT GQAOAACCTC TCTCTOATCA QCTAGOTCAT ATCATCGAOT 1680 
GTTGCAAACC TSACXXJTCAO GAACTTGACA AGAAACGTGA CAOTCACATT AAAGCACATC 1740 
AACXXX5AGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 
GGCAGAGGAG GCTGGTCAGA CAATGGCT6C TCTGTCAAAG ACAGGAGATT GAATGAAACC 1860 
ATCrGTACCT GTAGCXyVTCT AACAA6CTTC GQCQTTCrGC TCGACX TTATC TAGO ACATC T 1920 
GT6CTGCCTG CTCAAATGAT GQCTCTOAOQ TTCATTACAT ATATTGOTTG TG6GCTTTCA 1980 
TC3WVTTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT OOGGAGGQAT 2040 
TACCCTTCCA AAATCCTCAT CCAGCTGTGT QCTQCTCTGC TTCTGCTGAA CCTGQTCTTC 2100 
CTCCTGGACT C6TGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGQCTGTA 2160 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG 6CCTA6AA6C ATTCCATATG 2220 
TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCOQAA AATACATCCT TAAATTCTGC 2280 
ATTGTOGGTT GGGGGGTACC A6CTGTGGTT 6TGACCATCA TCCT6ACTAT ATCCCCAGAT 2340 
AACTATGGGC TTGQATCCTA TCGGAAATTC CCCAATQQTT CACOGGATGA CTTCTGCTGG 2400 
ATCAACAACA ATGCAGTATT CTACATTAOG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 
CTGAAOQTCA GCATGTTCAT TGTGGTCCTG OTTCAQCTCT GTOQAATTAA AAAGAAGAA6 2520 
CAACTGGOAG CCCAGOGAAA AACCAGTATT CAAQACCTCA GGAGTATCGC TQGCCTTACA 2580 
TTTTTACTGG GAATAACTTG GGGCTTTGOC TTCTTTGCCT GGGGACCAOT TAAOQTQACX: 2640 
TTCATGTATC TQTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 
TGTGTGGCCA AAGAAAATGT CAGGAAQCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 
OGGCTGGCTG AAAATTCTGA CTGQAQTAAA AC7GC7ACTA ATGGTTTAAA GAA6CAGACT 2820 
OTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCA6TAA CTCCACTAAC 2880 
TCCACCACAC TQCTAGTaAA TAATGATTQC TCAGTACACQ CAAGOGGGAA TGGAAATGCT 2940 
TCTACRGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCAOGAT 3000 
TTCACXGGAA AACAGCACAT GTTTAAC3SA3 AAGGAAGATT CCTGCAATGG GAAAGGC06T 3060 
ATGGCTCTCA GAAGGACTTC AAAGOGGGGA AGCTTACACT TTATTGAGCA AATG^GATTC 3120 
CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCC3U^TT TTACCTTTTA 3160 
CACAATGTQA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 
AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 
TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 
ATACACTAAG AATGACTCCT ATAAAGAAAA CRAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 
TTTAAAGAGG CTAAGTTATC TTTGATAACA TOVTATAAAG CAACTGTTQA CTTCAGCCTG 3480 
TTGGTGAGTT TAGTTGTQCA TGC C m G TT GTATATAAGC TAAATTCTAO TGACCCATGT 3540 
GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTOTGTAA ATGTATTCCT 3600 
TTQTA6AATC ATGGTTGTTT TGTCTCACOT OATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 
AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 
TCAAGAAATA ATGATOCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 37B0 
TCATACAGTG CCTTTGAGCA AQTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 
TOGGTCATGG TTTOACAAAC AaAOTBAGAO ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 
GT0CA03ACC TOTACAOCCA AACACA6CAT CCAATATGAA TACCCATCCC CTGACOGCAT 3960 
CCGCAGTAGT CAGATTATAG AATCTGCACC AAGATOTTTA GCTTTATACC TTGQCCACAG 4020 
AGAOGGATGA ACTGTCATCC AGACCATGTG TCAGQAAAAT TGTGAA06TA GATGAGGTAC 4080 
ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AG6AGAGTAG ACTAGGATTC 4140 
CTTCTCTTAA AAAOGTACAT ATATATGGAA AAAAATCATA TTGCOGTTCT TTAAAAGGCA 4200 
ACTOCATGGT ACATT6TTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AQAGCTATAA 4260 
TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 
GGGAACTGTC CTACACTGCT ATTGTTOCrA CATGTATCGA GCCTTGATTG CTCCrAOTTA 4380 
TATACAOGOrr CTATCTT6CT TCCTACCTAC ATCTGCTTOA GCAOTGCCTC AAGTACATOC 4440 
TTATTAGQAA CATTTCAAAC CCCTTTTAGT TAA6TCTTTC ACTAAGGTTC TCTTGCATAT 4500 
ATTTCAAGT6 AATGTTGGAT CTOU3ACTAA CCATAOTAAT AATACACATT TCTGTOAOTG 4560 
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CTGACTT6TC TTT6CAATAT TTCTTTTCTQ ATTTATTTAA TTTTCTTOTA TTTATATGTT 4620 
AAAATCAAAA ATQTTAAAAT CAATGAAATA AATTTGCMST TAAGA 

A129 Protein sequence 

Gene name: O protein- coupled receptor 64 

Unigeae number: Hs. 184 942 

Protein Accession #i 1IP_005747.1 

Signal sequences 1-38 ~ 

GPS domain 1 564-615 

Transmenibrane domains: 624-646, 660-682, 688-710, 733-755, 783-805, 826-850, 858-880 

Cellular Localization t plasma menibrane 



NVFSVRQCX3H 
PSSNEVETTS 
GBI MFOYD KB 
TUICTFTIKL 
VCLADHFRGP 
PQPSAPZASS 
VQTDIVNTSS 
LKWDDIGLQ 
NSIOTITIiPS 
TVRNLTRNVT 
8HLTSFGVLL 
ILIQLCAALI, 
VKVPNTYIHK 
AVPYITWGY 
ITWGPAFFAW 
NSDWSKTATN 
NGVSPSVQNQ 



11 

I 

VGRTEEVItLT 
U3DVTLSLLP 
STVPQNQHIT 
miTMIIACAAI 
PPSSSQSIPV 
PAIDMPPQSE 
ISDLEKQVLQ 
UJPSNTTISL 
8IMNNLPARD 
VTLKHINPSQ 
DLSRTSVLPA 
LUILVFLLDS 
YILKPCIVGW 
FCVIFUiNVS 
GPVNVTFMYL 
GLKKQTVNQG 
DVCLHDFTGK 



21 

I 

FKIPLVIICL 
SNETBKTKIT 
NOTLTGVZfSL 
AAIiERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
MEXAX.SLGSL 
TSPSLALAVI 
MELASRVQFN 
DELTVRCVFW 
QMMALTFITY 
WIALYKMQGL 
GVPAVWTII 
MPIWLVQIiC 
FAIPNTLQGF 
VSSSSNSIiQS 



31 



HWLVTSLEE 
IVKTFNASGV 



MEHCCCSVRI 
PKATSFAEPP 
VSGTPPPVKA 
BPNLAGEMIK 
RVNASSFNTT 
PFBTPALFQD 
DLGRNGGRGG 
IGCGLSSIFL 
CISVAVPLHY 
LTISPDNYGL 
RXKKKKQLGA 
PIPIFYCVAK 
SSNSTN8TTL 
C3IGK6RMALR 



41 

I 

DTDNSSLSPP 
KPQRNIQ7LS 
TLQTLSBTyp 
PCPSSPBELG 
DYSPVTHNVP 
SFSSPTVSAP 
QVSRLLHSPP 
TFVAQDPANL 
PSLEKL8LZS 
HSDNGC8VKD 
SVTLVTYIAF 
PLLVSPTWMG 
GSYGKFPNGS 
QRKTSIQDLR 
ENVRKQWRRY 
LVNMDCSVKA 



51 

I 

PAKLSWSFA 
SIOTDSAPFR 
IMCATAEAQS 
KLQCDIiQDPI 
8PIGBIQPLS 
ANVNTTSAPP 
DMLAPLAQRL 
OVSLBTQAPB 
YVISSSVANL 
RRLNBTICTC 
BKIRRDYPSK 
LSAPRMYLAL 
PDDFCWINNN 
8IAGLTFLLG 
tiCCGXLRLAB 
S6NGNASTER 
ZBQM 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



A130 DNA SEQUENCE 

Gene namet G protein-coupled receptor 49 (OPR49) (EG38) (LGR5) 
Unigene nunibers Ks. 2 8 552 9 

Probeset Accession #: AA460530 
Nucleic Acid Accession fft NM_003667 

Coding sequence: 201-2924 {underlined sequences correspond to start and stop codons) 



GTG605GCAA 
TCAGQAACGC 
CGCC0QG08C 
GGCGCOCTAC 
TGCTGCTGCA 
GCCCCACACA 
ACCTGGGGCT 
GXATGAACAA 
AGTTAOQTCT 
ACAGTCTTAA 
TGCAGAATTT 
CCCCAAGCTG 
TAACAOAAAT 
CCCTGAACAA 
TTCTACATCT 
ACAGCCTAGA 
GGACACTCTC 
AGAAAGCATT 
AATTTGTTGG 
GTGCCTCACA 
CTTTAACTGG 
TCCAAOTGCT 
AAAAGCTTCA 
TCCAGCAGTT 
ACCCCAATGC 
TGTCGTCTTT 
ATGOCTTACA 
CTTATGCTTA 
AATGQAATAA 
TTCAGGCTCA 
AAGCCCTTCA 
TGCTTGATGG 
GTAATGCTTT 
TOTTAATTGO 
CTGGTGTGGA 
GG0TTG6TTG 
TGCTTACTCT 
GGAAAGCTOC 
CCATGGCOGC 

CTTTOCcrn* 

COCT T T O CT T 



11 

1 

CCGGCACCTC 
GGCGTCTGGC 
GCCA OGGCOC 
TTOGGGCACC 
GCTGGCGACC 
CTQTCATTGC 
CTCGGAGCTG 
CATCAOTCAG 
TGOGGGAAAC 
AGTTCTTATG 
GOQAAQCCTT 
TTTCAGTGGC 
CCCGGTCCAO 
AATACAOCAC 
CCATAACAAT 
GACTTTAOAT 
CAACCTTAAA 
TOTAOGCAAC 
GAGATCTGCT 
AATAACT6AA 
AGCACAGATC 
AGATCTGTCT 
GAAAATTGAC 
GCTTAGCCTC 
ATTTTCCACT 
TCCTATAACT 
GAGCTTGATA 
CCAGTGCTGT 
AGGTGACAAC 
AGATGAACOT 
TTCAGTGCAG 
CTGGCTGATC 

ggtgacttca 
ggtcatcgca 
tgcgttcact 
ccatgtcatt 
ggcagccctg 

ATTTTCTA6C 
AGTTCCCCTG 
TGGGQAGCCC 
OCTCATQATO 



21 
1 

AGTCCCOSCC 
GCTGCAGAOG 
OTAGCAGTCC 
ATGGACaCCT 
GGGGGCAGCT 
GAGCCOGACG 
CCTTCCAACC 
CTGCTGCOQA 
GCTCTGACAT 
CTGCAGAATA 
CAATCCCTGC 
CTGCATTCCC 
GCTTTTAQAA 
ATACCAGACT 
AGAATCCACT 
TTAAATTftCA 
GAACTAGGAT 
CCTTCTCTTA 
TTTCAACATT 
TTTCCTGATT 
TCATCTCTTC 
TACAACCTAT 
CTAAGACATA 
CGATCGCTGA 
TTGCCATCCC 
GGGTTACATG 
TCATCTGAAA 
GCATTTGGAG 
A6CAGTATGG 
OACCTTGAAG 
TGTTCACCTT 
AGAATTGGAO 
ACAGTTTTai 
6CAGTGAACA 
TTTGGCAGCT 
GGTTTTTTQT 
GAGCGTGGGT 
CTGAAAGTAA 
CTGGGTaOCA 
AGCACCA3GG 
ACCATTQCCT 



31 
1 

GCGCTTCTCC 
CCCGCTGAGT 
GGTGCTGCTC 
CCCGGCT06G 
CTCCCAGOTC 
GCAGGATGTT 
TCAGOGTCTT 
ATCCX2CTGGC 
ACATTCCCAA 
ATCAGCTAAG 
GTCTQQATGC 
TGA60CACCT 
GTTTATCG6C 
ATGCCTTTGG 
CCCT6GGAAA 
ATAACCTTGA 
TTCATAGCAA 
TTACAATACA 
TACCTGAACT 
TAACTGGAAC 
CTCAAACCOT 
TAGAAGATTT 
ATGAAATCTA 
ATTTGGCTTG 
TAATAAAGCT 
GTTTAACTCA 
ACTTTCCAGA 
TGTOTQAGAA 
AOQACCTTCA 
ATTTCCTX3CT 
CCCCAGGCCC 
TGTGGACCAT 
GATOCCCTCT 
TGCTCAOGGO 
TTGCACQACA 
CCATTTTTGC 
TCTCTQTQAA 
TCATTTTQCT 
OCAAQTATGa 
GCTACATQGT 
ACACCAAGCT 



41 

1 

TOGCCGCCCA 
TGCAGAAGCC 
TCX3GCC00GG 
TGTGCTCCT6 
TGGTGTGTTG 
6CTCAGGGTG 
CACCTCCXAC 
CA0TCTC06C 
GGGA6CATTC 
ACACGTACCC 
TAACCACATC 
QTGGCTGGAT 
ATTGCAAGCC 
AAACCTCTOC 
GAAATGCTTT 
TGAATTCCCC 
CAATATCAGG 
TTTCTATGAC 
AAGAACACTG 
TGCAAACCTG 
CTGCAATCAG 
ACCCAGTTTT 
0QAAAT7AAA 
GAACAAAATT 
GGACCTATCG 
CTTAAAATTA 
ACTCAAGGTT 
7GCCTATAA0 
TAAGAAAGAT 
TGACTTTQAG 
CTTCAAACCC 
AGCAGT7CTG 
GTACATTTCC 
AGTCTCCAGT 
TGGTGCCTGG 
TTCAGAATCA 
ATATTCXGCA 
CTOT G CCCT G 
G60CTCCCCT 
08CTCTCATC 
CTACTGCAAT 



51 
1 

CGCCGTGGGG 
CAGGQAGGGG 
TCOOGCTGQT 
TCCTTGCCTO 
CTGAGGGGCT 
GACTGCTCOG 
CTAGACCTCA 
TTOCTGGAOG 
ACT6GCCTTT 
ACAGAAGCTC 
A6CTATGTGC 
GACAATGCGT 
ATGACCTTG6 
AGCTTGGTA0 
GATGGGCTCC 
ACTGCAATTA 
TOGATACCTG 
AATOCCATCG 
ACTCTGAATG 
GA6A6TCTGA 
TTACCTAATC 
TCAGTCTGCC 
GTTGACACTT 
GCTATTATTC 
TCCAACCrCC 
ACAGGAAATC 
ATAGAAATGC 
ATTTCTAATC 
GCTGGAATGT 
GAAGACCTGA 
TGTGAACACC 
GCACTTACTT 
OC CATTA AAC 
GCCGTGCTGG 
TGGGAGAATG 

AAATT TGAAA 
CIGGCCTTOA 
CTCTGGCTGC 
TTGCTCAATT 
TTGGACAA60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GAOACCTGGA GAATATTTGG OACTGCTCTA TGGTAAAACA CATTGCCCTQ TTQCTCTTCA 2520 

CCAACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAO TGCTQAAGTA ATTAAOTTTA TCCTTCTOGT G6TAGTGCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTtCA ATCCTCACTT T3VAGGAGQAT CTGGTGA60C 2700 

TCSAOAAAGCA AACCTAOGTC TGGACAAGAT CAAAACACCC AAOCTTQATQ TGAATTAACT 2760 

CTGATQATGT CGAAAAACAG TCCTGTQACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2B20 

CCAGCATCAC TTATGACCTG CCTCCCAGTT CCGTGCCATC ACCAOCTTAT CCAGTGACTG 2880 

AGA6CT0CCA TCTTTCCTCT GTOGCATTTG TCCCATGTCT CTAATTAATA TGTGAAGGAA 2940 

AATGTTTTCA AAOGTTGAGA AOCIGAAAAT GTOAGATTGA QTATATCAGA GCA6TAATTA 3000 
ATAAOAAGAG CT6AGGTGAA ACTOOOrTTA AA 



A131 Protein sequence 
Gene nanei 
lAiigene nuinlber: 
Protein Accession #: 
Signal sequence { 
Transraeinbrane domains t 
Cellular Localization i 



6 protein-coupled receptor 49 (GPR49) (HG38) (L6R5) 
He. 285529 

NP_0036S8.1 

1-22 

557-579, 596-618, 641-663, 687-709, 724-746, 770-792, 803-825 
plasma membrane 



1 11 21 31 41 51 

t I I I I 1 

MXSTSRLGVLL SLFVLLQIAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY IjDLSMNNISQ LLPNPLPSLR PLEBLRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQIiRHVP TEALQNLRSL QSUOiDANUI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSAIiQA HTLAIiNKIHH IPDYAFGKIiS SLWLHLHNN RZHSLGKKCP DGLHSLETLD 240 

unnonj>EPP TAIRTLSNLR ELOFKSMNXR SIPBKAPVGN PSLITIHFYO NPIQFVGRSA 300 

FQHLPELRTL TLNGASQITE FPDLTGTANL BSLTLTGAQI SSLPOTVCKQ LPNLQVLDLS 360 

YNLLEDLPSF SVCQKLQKID LiRHNEIYBZK VDTFQQLLSL RSLNLAVINKI AIIHPNAFST 420 

LPSLIKLDLS SNLLSSFPIT GLHGLTHLKL TOIHALQSLI SSENPPELKV lEMPYAYQCC 480 

AFGVCENAYK I8NQWNKGDN SSMDDLHKKD AC94FQAQDER DLEDFLLDFE EDLKALHSVQ 540 

CSPSPGPFKP CBHLLDGWIjI RIGVWTIAVL ALTCSIALVTS TVFRSPLYZS PIKLLXGVIA 600 

AVNMLTGVSS AVLAGVDAFT FGSFARHGAW WEaiGVGCHVI GFLSIFASES SVFLLTLAAL 660 

ER6PSVKYSA KFETKAPFSS LKVZILLCAL LALTMAAVPL liGGSKYGASP LCLPLPPGEP 720 

STMGYMVALI LUTSLCFLMM TIAYTKLYCN LDKGOLENIW DCSMVKHXAL LLPTNCILNC 780 

PVAFLSFSSL INLTPZSPBV JKFILLVWP LPAGLNPLLY XLFHFHPKED LVSLRKQTYV 840 

NTRSKHPSLM SIHSDDVBKQ SCDSTQALVT FTSSSITYDL PP8SVPSFAY PVTBSCHLSS 900 
VAPVPOj 

A132 PNA SEQUnJCB 

Gene name: parathyroid hormone receptor 2 
Unigene nuniberi Hs. 159499 

Probeset Accession #i U25128 
Nucleic Acid Accession «i im_00S048 

coding sequence I 143-1795 (underlined eequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I ! I I 

GGCGGGTGGC CCOGGCCCGA CCACCCCA6C TGCGCOTCGT TACTGGOCAC AAGTTTGCTC 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGG6C TCTG6AGGAG GGTCCCTGCT 120 

TCTTCCTACA GCOGTTCCGG GCATGGCOGG GCTGGGGGCG TCGCTCCA06 TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGOCCAGCTG GATTCTGAT6 6CACCATTAC 240 

TATAGAGGAG CAQATT6TCC TTSTGCTQAA AGOQAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGG6AG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATOOGC TGTTOCATGC CCTCCTTATA TTTATGACTT 420 

CAACCATAAA GGAGTTGCrr TCOSACACTQ TAACOCCAAT GOAACATGOa ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACT6CCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AA8CAAQAAT TCTTTQAACG CCTCTATGTA ATGTATACCQ TTG6CTACTC 600 

CATCTCTTTT GOTi'CCriXJ O CTGTOGCTAT TCTCATCATT GGTTACTTCA GAOGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GA6CTACAA0 720 

CATCTTTGTC AAAGACAGAG TA6TCCAT6C TCACATAGGA OTAAAGGAGC T6GA6TCCCT 780 

AATAATGCAG GATGAGCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 840 

TATGGGGTGC AAQATTSCTQ TTOTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTO 900 

GATCCT6GTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTG6CTTTCT TTTGGGACAC 960 

CAAATACCTG TOOOOCTTCA TCTTGATAGG CXXXXKSGTTT CCRGCAGCAT TTGTTGCAGC 1020 

ATGGGCTGTG GCACGAGCAA CTCTGGCT6A TGCQAGGTGC TGGGAACTTA GTOCTGGAGA 1080 

CATCAAOTGG ATTTATCAAG CACGGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCT6AATA0G GTTAGAGTTC TAGCTAGGAA AATCTOGQAO AGCAATQCAO TTGGOCATOA 1200 

CACAA66AAG CAATACAGGA AACTGGCCAA ATCQAGACTG GTCCTQGTCC TAGT C TTTO Q 1260 

AGTGCATTAC ATCGTGTTOG TATGCCTGCC TCACTCCTTC ACTQGGCTCG GQTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 1380 

CTQ CTAC TOC AATGGAGAGG TTCA6GCAGA OGTGAAGAAO ATGTGGAGTC G6TGGAATCT 1440 

CTOCGTGQAC TGOAAAAOOA CAOOOOCATO TQGCASCCOC AGATGOOQCT CAGTGCTCAC 1500 

CACOGTQAOS CACAGCAOCA GCAGCCAGTC ACAGGTOGOS OCCAGCAGAC .GCATGGT6CT 1560 

TATCTCTQGC AAAGCTGCCA AGATOQCCAG CAGACAOCCT QACAGCCACA TCACTTTACC 1620 

TGGCTATOTC TQGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGQAGAC 1680 

CAAGQAAGAT AGTGGGAGGC AGGGAQATGA TATTCTAATO 6AGAAGGCTT CCAGGCCTAT 1740 

6GAATCTAAC CCAGACACia AAGGATGCCA AGGAGAAACT OAGGATGTTC TCTOAATGGA 1600 

C A TTTOT Q GC TGACTTTCAT GGGCT66TCC AATQOCTGGT TGTQTGAGAG G6CTTGGCIG 1860 

ATACTCCTAT GCTTGAGTTC AAA6GCT6AA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAOOCTC CATQAATTGG CTOCTSTAAA TACTAAOQAC ATQAAAATOC AAGTSTCAAT 1980 
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GGAGTAGTTT 
GCTCTGTGAT 
GCTGTMSCTT 
ATTTTOCTTT 
ATTTATTTTO 
GATCTAA6AA 
TTATAACAAT 
ACATCCCTTC 
TTCTTTOTAA 
TTGATTTTGT 
TTTGAGCTGT 
T 



ATTACCTTCT 
TGTTCATTTT 
TCTCTCATAT 
TAGAAACTAG 
OCTGTGCATA 
CAAGTACTTO 
TACATGTGTT 
TTTTQAATGa 
ACCATGTCAT 
TTGTAATGTA 
TACTACAT7G 



ATTGGCATCA 
TTTCTGCTAC 
ATATCACCCT 
TATTGTCTTA 
GQAGCAATTA 
CTGGAAAATT 
TTTGG6AACA 

GTGGAAAl^T 
TTTTGATAGC 
TACATOGCAT 



AQTTTTCCTC 
TTTTGGGTAG 
AAATATAATG 
TTTCTTACTT 
GGATCTAAAA 
AGTTGGCTGG 
AGGAAAATTT 
ACCA6CCAGA 
TTCCTCAGTT 
AAATCATGCT 
GT6GGATCAA 



TAAATTAATG 
AAAAAAGATT 
AAGATCTTTT 
TAATOTACTT 
AAATATATGG 
ACATTGATAA 
CTCAAAAAAO 
CCTCAGGTCT 
AGTGAGCTTO 
GQ^TCTATAT 
TTAAAAATTT 



TATGGTATrr 
CAATTOCTTQ 
AGTGTGTATC 
CTATCACTGC 
GAAGATAAAA 
AATAATGCAT 
AATATTTCAC 
TCACTCTTTC 
TGTGTGCAAA 

cmT ivriu 

GTTTTAAAAA 



2040 
2100 
2160 
2230 
2260 
2340 
2400 
2460 
2520 
25B0 
2640 



A133 PROTBIW SEQUEMCB 
Gene name: 
Utaigene numbers 
Protein Accession 8: 
Signal sequence* 
Pfam domain t 
Transmembrane domains: 
Cellular local izatiom 



parathyroid honoone receptor 2 
H8. 159499 

1IP_005039.1 

1-25 

7tm_2 (141-420] 

177~197« 228-250, 253-275. 280-302, 
plasma menbrane 



320-342, 363-3BS, 398-419 



I 

MAGLQASLHV 
GNCFPEWDGL 
NYSDCLRFLQ 
KHXiFVSFMLR 
VMPIYPLATN 
LADARCWBLS 
LAKSTLVLVL 
QA&VKKMHSR 
lAS RQPDS HI 
GOQGETEDVL 



11 
I 

WGHLMLQSCL 
ICWPRGTVOK 
FDISI6KQEF 
AT8ZPVKDRV 
YYWILVEOLY 
AGDIKWIYQA 
VFGVHYIVPV 
WNLSVDWKRT 
TLPGYVW8NS 



21 
I 

IiARAQLDSOG 
ISAVPCPPYI 
FERLYVMYTV 
VHAKX6VKBL 
LRNLIFVAFP 
PILAAIGUTF 
CLPHSFTGLG 
PPOQSRRGQS 
EQDCLFR8FH 



31 
I 

TITIEEQZVL 
YDFMHKGVAF 
6YSI8F6SIA 
B8LXMQDDPQ 
SDTKYLWGFI 
ILFLNTVRVL 
HBIHMHCELP 
VLTTVTH8TS 
BETKBD8GRQ 



41 

I 

VLKAKVQCBL 
RHCKPNGTWD 
VAILIIGYFR 
H8IBATSVDK 
LIGW6FPAAF 
ATKIWETKAV 
PNSFQGPFVS 
SQSQVAASTR 
GDDILMBRP8 



51 




1 

NITAQLQEQB 


60 


FMHSLNKTWA 


120 


RLHCTKNYIH 


180 


6QYI6CKXAV 


240 


VAAHAVARAT 


300 


GHDTRKOYRK 


360 


IIYCYCNGEV 


420 


MVLI86KAAK 


480 




540 



A134 DNA sequence 
Gene name: 
Unigene number i 
Probeset Accession Mt 
Nucleic Acid Accession it 
Cbding sequence: 



ESTs 

Hs. 128899 
AA9632S1. 
AA963251 

1-1749 (underlined 



sequences correspond to start and stop codons) 



ATGCTGTCTG 
GGAAAGAAAC 
GAOOOGGAGA 
GGGGAG60G6 
COGCXSGCOGC 
CGCCTGOQTC 
TCOGGAOGAC 
ACTCATAAGG 
GCrCCTGQAC 
COQCGCGGAA 
CTTCTCGQA3 
TGTGGGGCGC 
TGCTGCTQGC 
TGGCTGQAOQ 
GG06AC3GCCA 
GCG06CCTG0 
OG6GCX3GACA 
GGAGACGGCX3 
TCCCCGAAAG 
OGOGGATTCC 
ATCTA06TGC 
CTGGTGGCAG 
GCCCCMSGGG 
OGGGGGTCGT 
GG6GCCCGGG 
AACAA06TGT 
CA6ATTGTGC 
CAOGACTCTG 
AGGCAGATTC 
ACTGTATAAC 
GATTCTOSAG 
GCTTCATTTQ 
CATCCAGGGT 
TTGCTGATGG 
AAGAATZTTA 
TTQTTQTTGA 

• XTrrn T m ' 

TG6TG06ATC 
AOOCTCOCAC 



11 

1 

GCTTCTTGAT 
TTCOGTGGOA 
GGGAGAGCCXt 
AGAAGGGGAA 
C6CCAGCTQG 
CTGGAOGTTC 
AGCCCCGGGG 
CAGTCCCTAA 
CTAGGGCCOQ 
AGCGC06CGG 
ACCGTCCTGC 
TOGCOGCTOG 
TGCQCTGCTG 
COCAOGGOGT 
CCATCTGCTQ 
ACCAGGGOSO 
AA6ACGGGCC 
AGGGTG06CC 
GAAGGCAGCT 
CATCTTCTCC 
OGTTCCTCAT 
CCTGTTGCra 
GTAACCGCTT 
CCTCACGCCA 
CGCXXXXZAAC 
ATQTCAACAT 
CACATCAAGG 
TGCCC3VTGAC 
AGTCCCCCTT 
COAGAGTCAC 
GTGGAAGTCC 
CCCCCAGACT 
ATCATTGATT 
GTGTATAACA 
TAAACTGATA 
CTGCACAGQA 
TTTTTTTTAA 
TG6GCTCACT 
0TGGCT6GGA 



21 
I 

GAGTCCCAGT 
GGCTTCCATC 
GC0GGA6GCT 
CCX3GGGCX3AQ 
GCAGGCTCCC 
CCGGGGOAGG 
GCCTTCTGAC 
GGGGAC06GG 
G0GTGGT06C 
GACAGTCA6T 
GCTCTCTGGA 
TCCGTCTCCT 
GCX3G0GGGGG 
CTGGCGCATC 
OGGCAGCTGC 
CTGCGAGAAT 
C06ACGGCTC 
CCCACCCGTG 
CCTCAGGGCT 
ACGOGGCGGC 
TGTTGGCTCC 
C3U3ATGTCTC 
GATGGAGACC 
GTCCAGCACA 
AAGGTCACAO 
GCCCA06AAT 
GCAGTATCTG 
AGCTGTGCCA 
CCCTCACACC 
TGGT6QQTTC 
GGACATOTOO 
GTATGAAAAC 
TATGATGGAA 
AATGCTTGAG 
AATTAAGGAT 
TCAAAATOCC 
TCAQACAGGG 
GCAACTTCAG 
TTACAGGraC 



31 
I 

ACCCAGCACA 
GGTGOGCACA 
GC0C3GGCTCC 
CCX3CXXX3CCT 
GGGACTGCGG 
GTCCGGTTGC 
TGCATCCOOA 
CCAC06GCTG 
CTCCTGG606 
GACGAGGCCC 
GA060GCTGT 
C!ATCCTGGAA 
0QA6GGCCCA 
GGCTTCCAQT 
GOGTTGCGCT 
GA0CX3CCAGC 
GGCAGGGCTT 
AGGGCCTGGC 
TTCCCGGGGC 
CCCrCTCCCC 
GTGTTTGT06 
0GGCCTAA6C 
ATCOOCATGA 
GCT60CAGTT 
ACCAACTGTT 
TTCTCTGTGC 
CATCCCC9CAT 
CCTTTCATGG 
AACAGTGAAC 
CTTTACTGAA 
GTGQTATTTA 
ATCTCa3AAT 
AACOGGCCTC 
TC06AAGTGC 
TTTTATTATG 
TQTTATCTCC 
TCTTGCTCTG 
CCTCCTG6AT 
CTOCCC CCA T 



41 



GAGCACAGTA 
CCTCCCGAGG 
TGTGGQAC06 
GGATCOSGQC 
CTGGGGGOGC 
CAGTGAAACC 
GATTTCCATC 
AGGACGGGGA 
T06066QU3A 
6GGGGTC6CC 
CCGCGCCCAG 
OGCOGCTTOQ 
OOQGOQAGTA 
GTCCOQAGCG 
ACTGCTGCTC 
AG GGOGCTG G 
CATGTCTTA6 
A00GGT6CTC 
TGCTGCCCCX3 
T6CAGCGGCC 
CCTTTATCAT 
AGGATOCX:CA 
TCCCCAQTGC 
CCAGCT0C31G 
GCTT QCOGGA 
TGAACTGTCA 
AOGTGGGGTA 
A06GCCTGCA 
AGAAGATGTA 
GGGAGAOQAA 
TQGCAOQATT 
TA6CATTTCT 
AOCTGQAGAT 
CCTTGAGATA 
TTGTTATTAT 
CTTTTACTG6 
TTQGCCAGGC 
TCAGGCAACA 
OOCTAATTTT 



51 

I 

CACTCCCGGA 
GCQAGOCAGC 
CGCTGCAGCC 
CCAGCAQCAG 
GCAGGACCCT 
TCXZAGAGGCT 
AGCQAOTGCA 
TGGCTTAGGA 
GGGQAGTGGC 
GGGGCXACX3A 
GGTG6TGCCA 
CTCCTGCAGC 
CTGCCACGGC 
CTTOGAOOQC 
CAG06C06AG 
CGAGCCTGGC 
GGOTACCCAA 
CCCTGAAGGC 
TGCCA6A06C 
CGCCTTGCCC 
CTTGGGOTCC 
GCAGAGG08A 
CA6CACCTOC 
CGCCAACTCC 
AGGGACCATO 
GCAGGOCACC 
GAGGGTGCA6 
GCCTGGCTAC 
CCCAGOGGTG 
GGGAQGGGTG 
CX2TTTGQATQ 
GQATATGTTT 
QACTQTGATG 
TGGTTGAOQA 
TATTTCTTTT 
GACTTTTTTT 
TGGAGTGCAG 
CTOCT G CCTC 
TTOTATTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



895 



wo 03/042661 



PCT/US02/36810 



TGTAGAGATQ GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTQA CXTTCAAGCAA 2400 

TCr O CCTOTC TCAGCCTCCC AAAGTGCTGQ GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA AT6CATCCAA GGTTAAGGGO AAGAOGCAAA TAACAQOACT 2S20 

ATTCTAAAAQ GAAACCTGTT TGAACTCTGT QAOATCAOTC ATCAGTCTCA 6TATTCCACA 2S80 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT OT GC l l T l t iU 2640 

GGGCCTATTT IXnXiClTrri ' TAOCTTATQT AGAOATCTTA TTACAAAiSTG ATTTTCTACA 2700 

TTAAAAAGAO ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TC3GC5ATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCOCCTTC 2820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTO AGAGGTAATG CATTCTTTTT 2880 

TATATTQAAA TCATAAACTA TCA(XOGCTQ CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TQGTTATGGT TTGGCGTTTC Cn'CTC n TO QTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

QAGTOCAAOT AATTACTATA C T TGTAAATO AAGATCAGTA TTTCTGCCTA GATCTQATAA 3060 

AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAOAAATG TQTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT QAGACTTGGO ATOAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAO 3240 

GATTGTTAAG AQAAAAGCTT TTCAACGAAQ QATTGCCTTT CTTCTCCCAC ACTGTTCTTO 3300 

ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 

CAAATTCAAG TXSAATTTATT TOTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 

ATGTGCAAQT ACATTTCCAA CTQCTAGCAC AACCAOTATT TTGTAATTAA ACAAATCX3CT 3480 

GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATOGATCAT CTTTCTATTC 3540 

TGTTTCATQA CTQAATAATO TAAAACCAGT OTTOGCAATT GQTATCATCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA 0GG6AAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A13S Protein sequence 
Oene name: 
Unigene number t 
Protein Accession #: 
Signal sequence: 
Transmeihbrane domains t 
Cellular Iiocallzatlont 



BSTo 

Kb. 128899 

none found 

1-11 
402-424 

not determined 



1 11 21 31 41 51 

I 1 I 1 I I 

MLSGFLMSPS TQHRAQYTFG GKKLPHEASZ GABTSRGRGS DRERBSRPBA AGLLHDRAAA 60 

OEAEXGNROE PPANIRAQQQ PRPPPAGQAP QTAAGGAODP RLRPGRSRGR VRLPVKPPEA 120 

SGRQPRGPSD CIPRFP8ASA THKAVPKGTG PPAEDGDGI/3 APGPRARRRR UiGVAAEGSG 180 

PRGKRRGTVS DEARGSPGPR LLGDRPALSO DALSAPRWP CX3ALAARPSP HPGTPLRSCS 240 

CXWLRCHRRG RGPSGBYCHO WLDAQGVWRZ GFQCPERPDG GDATICTGSC ALRYCCSSAE 300 

ARLDQGGOIN ORQQGAaBPQ RADXDGPRRL 6RASCLRGTQ OXSEQAPPPV RAVfQRCSPEO 360 

SPKSRQUiRA FPGLLPRARR RGFP8SPRGG PSPLQRPALP lYVPFLIVGS VFVAFIXLGS 420 

LVAACXXRCL RPKQOPQQSR APGGNRLMET IPMIPSA5TS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN PSVLNCQQAT QIVPHQGQYL HPPYVOVTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPPPHT NSBQKMYPAV TV 



A136 DMA SEQUENCE 

Gene namei select In E (endothelial adhesion molecule 1) 

Unigene nuinberi RS.89S46 

Probeset Accession tt< M24736 

Nucleic Acid Accession «t HM_000450 

Coding sequences 1-1833 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

MGATTQCTT CACAGTTTCT CTCAGCTCTC ACTTTGQtGC TfCTCATTAA AQAGA6TGGA 60 

6CCTG8TCTT ACAACACCTC CAGGGAAGCT ATGACTTATO ATGAGGOCAQ TGCTTATTOI 120 

CA6CAAAGGT ACACACACCT GGTTGCAATT CAAAACAAAO AAGAOATTQA 6TACCTAAAC 180 

TCCATATTGA GCTATTCACC AAGTTATTAC TGGATTGGAA TCAGAAAAC5T CAACAATGTG 240 

TQGGrrCTGGQ TAOOAACCCA GAAACCTCTO ACAQAAGAAa CCAA6AACT6 GGCTCCAGGT 300 

6AAGCCAACA ATAGGCAAAA AGATGAGGAC TGCGTGGAQA TCTACATOVA 6A6AGAAAAA 360 

GATGTGGQCA TOTGOAATGA TGAGAGGTGC AGCAAGAA6A AaCTTOCOCT ATGCTACACA 420 

GCTOCCTOTA CCAATACATC CTGCAGTGGC CAOGGTGAAT GTGTAGAGAC CATCAATAAT 480 

TACACTTGCA AQTGTGACCC TGGCTTCAQT GQACTCAAGT GTGAGCAAAT TGTQAACTQT 540 

ACAGCCCXGG aU^TCCCCIGA GCATGQAAGC CTGGTTTQCA GTCACCCACT GGGAAACTTC 600 

AGCTACAATT CriCCl' GC TC TATCAGCT8T 6ATA0GGGTT AOCIGCOM GAGGATGGAO 660 

ACCATGCAOT GTATGTCCTC TGGAOAATGG AQT6CTCCTA TTCCAGCCTO CftATGTGGTT 720 

GAOTGTGATO CTGTGACAAA TCCAGCCAAT GGGTTOGTGG AATGTTTCCA AAACCCTGGA 780 

AGCTTCCCAT GGAACACAAC CTGTACATTT GACTGTQAAG AAGGATTTQA ACTAATGGGA 840 

G0CGA6A6CC TTCAGTGTAC CTGATCIGGG AATTGGGACA A0GAGAA6CC AAGGTGTAAA 900 

GCTGTGACAT 6CA 0G6 000T CCGCCAGOCT CAGAATGGCT CTOTaAQQTa CAOCCATTCC 960 

CCT G CTGGAG AGTTCACCTT CAAATCATGC T6CAACTTCA GCTGTQAGGA AGGCTTCATO 1020 

TTGCAGGGAC CAGCCCAGGT TGAATGCACC ACTCAAGGGC AGTGGACACA GCAAATCCCA 1080 

GTTTOTGAAQ CTTTCCAGTG CACAGCCTTG TCCAACCCCO AGOGAGGCTA CATGAATTGT 1140 

CTTCCTAGTO CTTCTGGCAO TTTCOGTTAT GGGTCCA6CT GTGAGTTCTC CTGTGAOCAO 1200 

otf rrrimw tgaaggqatc caaaaggctc caatgtggoc ccacagggqa gtcoq acaa c 126o 

GAOAAGCCCA CATOTGAAGC TGIGAGATGC QATGCTGTCC ACCAGCCCOC GAAGGGTTTG 1320 

GTGAGGTGTG CTCATTCCCC TATTGQAGAA TTCACCTACA AOTCCTCTTO TGCCTTCAOC 1380 

TGTGAOaAGG GATTTGAATT ATATGGATCA ACTCAACTTO AGTGCACATC TCAGGGACAA 1440 

tggacagaag AGGTTCCTTC CTGCCAAQTG GTAAAATQTT CAAGCCTGOC AGTTCOGGQA ISOO 

AAOATCAAGA TQAGCIGCAO T0GGGA6CCC GTGTTTGGGft CTOTGTGCAA GTTOGCCTGT 1560 

OCTGAAG6AT GOAOGCTCAA TOGCTCTGCA GCTGGGACAT GTGGAGCCAC AG6ACACTGQ 1620 

TCTGG0CT6C TACCTACCTG TGAAGCTOCC ACTGAOTCCA ACATTCCCTT GGTAaCTGClA 1680 

t ' lT ' l 'C IG Cl G CTGOACTCTC CCTOCTGACA TTAGCACCAT ri-CTOCTCIQ CCTTOGGAAA 1740 



896 



wo 03/042661 



PCTAJS02/36810 



TGCTTAaSOA AAGCAAA6AA ATTTOTTGCT GOCAOCAGCT GCCAAAGCCT TGAATCAGAC IfiOO 
GGAAGCTAOC AAAAGCCTTC TTACATCCTT TAA 



A137 Protein sequence: 
Gene natnei 
Unlgene Humbert 
Probeset Accession Us 
Protein Accession fts 
Signal se^encet 
Transniendbrane domains t 
C- lectin domain: 
Cellular Localization: 



8e lectin B (endothelial 

BS. 89546 

M24736 

NP_000441 

1-22 

555-573 

23-139 

plasma menibrane 



adhesion molecule 1) 



1 11 21 31 41 51 

) 1 I I I i 

MIA8QFLSAL TLVLLIKES6 AW8YNTSTBA MTYDEASAYC QQRYTHLVAZ QNKBBIEYLN 60 

SZLSYSPSYY WIGIRKVNNV WVWVGTQKPL TBEAKNWAPG EPKNRQKDED CVEIYIKREK 120 

DVGMWNDERC SKIOCLALCyT AACTNTSCSO HGECVETINN YTCKCDPGPS GLKCEQIVNC 180 

TALESPaiGS LVCSHPL6I7F SYNSSCSISC DRGYLPSSME TMQGMSSGBW SAPIPACNW 240 

ECDAVTOPAN GPVECPQNPO SFPWNTTCTP DCEEGFELMG AQSIiQCTSSG NVTONEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTPKSS CNFTCEEGFM LQGPAQVBCT TQGQMTQQIP 360 

VCEAFQCTAL SNPERGYMNC LPSASQSPRY QSSCEPSCEQ GPVLKGSKRL QCGPTGEWDN 420 

BKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYK8SCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEBVPSCQV VXCSSIAVPO KINMSCSGEP VFOTVCKFAC PE6HTUIG8A ARTGQATOHW 540 

SGLIiPTCEAP TESNIPLVA6 LSAAGLSLLT IAPFLU9IfRK CLRXAKKFVP A8S0QSLESD 600 
GSYQKPSYIL 

A13B DNA SEQUENCE 1 

Gene name: ESTs, Weakly similar to JB0350 Anterior gradient-2 [H. sapiens] 

Unigene number: 113.100666 
Probeset Accession AA487468 
Nucleic Acid Accession #t AA467468 

Coding sequenpe: 55-555 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

OSQCACCAAO AQCACTGGCC AAOTCAGCTT CTTCXGAQAG AOTCTCTAGA AQA CATGA TQ 60 

CTACACTCAO CTTTG6GTCT CTGCCTCTTA CTOGTCACAO TTTCTTOCAA CCTTGCCATT 120 

GCSUiTAAAAA AGOAAAAGAO GCCTCCTCAO ACACTCTCAA GAGGATGGGQ AOATQACATC 180 

ACTTGGGTAC AAACTTATQA AOAAGGTCTC TTTTATGCTC AAAAAAGTAA GAAGCCATTA 240 

ATGGTTATTC ATCACCTGOA GQATTGTCAA TACTCTCAAG CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAQA AATG6CTCA0 AATAAOTTCA TCAT6CTAAA CCTTATOCAT 360 

(SAAACCACTG ATAAGAATTT ATCACCTGAT GQ6CAATATG TGCCTAGAAT CATOTTTGTA 420 

GAOOCTTCTT TAACAGTTAG AGCTGACATA GCTGGAAGAT ACTCTAACAO ATT(3TACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTATT 540 

CAOTCAQAGC TATAAGAGAT GATA6AAAAA AGCCITCACT TCAAAQAAGT CAAATTTCAT 600 

GAAGAAAACC TCTG6CACAT TGACAAATAC TAAATGTGCA AGTATATAGA TTTTGTAATA 660 

TTACTATTTA GTTTTTTTAA TGTGTTT6CA ATAGTCTTAT TAAAATAAAT GTTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 



A139 Protein sequence: 
Gene name: 
Ubigene number: 
Probeset Accession tf: 
Protein Accession 9i 
Signal sequence: 
Transmentorane domains: 
Cellular Localization: 



SSTs, Heakly similar to JE0350 Anterior gradient-2 [H. sapiens] 

Hs. 100686 

AA467468 

none found 

1-23 

none found 
secreted 



1 11 21 31 41 51 

I I I I i I 

MNUISALOLC Z«LLVTVSSNL AIAIKKEXRP PQTLSRGHGD DITWVQTYEB GLFYAOKSKK 60 
PLHVIHHXiBD OQYSQALXKV FAONBBIQSM A(3NXFIMLNL KHBTTDKNLS PXX3QYVPRIH 120 
FVDPSLTVRA DIAGRY8MRL YTYEPRDLPL LIBNNRXAIA XiIQSBL 

A140 DWA SBQOEMCB 

Gene name: • TMPRSS3a 

TJnigene number: Hs. 298241 

Probeset Accession tft AI53a613 

Nucleic Acid Accession fl: AB038157 

Coding sequence: 202-1566 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 1 I I 

ACGQG6CACC GGAOGGCVCX3 G8TACTTTCG TTCTTAATTA 6GTCAT6C0C GTGTGAGCCA 60 

OGAAAOGGCT GTGTTTATQQ (SAA6CCAGTA ACACT6TGQC CTACTATCTC TTC06TG0TG 120 

CCATCTACAT TTTTQGGACT 0666AATTAT GAGOTAGAGG TOGAGGOGGA 60060ATGTC 180 

AGAGGTOCTQ AAATAGTCAC CATOG GQQAA AATQATOOQC CTGCTOTTGA AGCOCOCTTC 240 

897 



wo 03/042661 



PCT/US02/36810 



TCATTCCmT OGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAQATOCA 300 

GATGCrorra CTGCACACSAT CCTQTCACTG CTGCCATTQA AGTTTTTTCC AATCATCGTC 360 

ATTG GG ATCA TTGCATTGAT ATTAGCACTO G C X3VTTGGTC TGGGCATCCA CTTOBACTGC 420 

TCAOOQAAaT ACASATOTCS CTCATOCTTT AAGTGTATOG AGCTGATAGC T0GATGT6AC 480 

GGAGTCTCX30 ATK3CAAA6A CGGOGAGGAC GAGTAC06CT GTGTCOG GG T GGGTGGTCAO 540 

AATGCCX3TC5C TCCAGGTGTT CACAGCTGCT TCGTCQAAOA CCATGTGCTC CGATGACT6G 600 

AW3GGTCACT AC6CAAAT0T TGCCTGTGCC CAACTGG6TT TCCCAA6CTA TGTGAGTTCA 660 

GATAACCTCA QAGTGAGCTC GCTGGAGGGG CAGTTCOGGG AGGAGTTTGT GTCCATOQAT 720 

CACCTCTTOC CAGATQACAA aOTGACTOCA TTACAOCACT CAGTATATGT GAGGGAGOGA 780 

TGTGCCTCTG GCCACGTGGT TA CtTIG CAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACX5CA TCGTGGGTGG AAACATGTCC TTGCTCTCOC AGTGGCCCTG GCAGQCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG 6GCTCTGTCA TCACGCCCCT OTGGATCATC 960 

ACIGCTGCAC ACrGTGTTTA TQACTTGTAC CTCCCCAAGT CATQGACCAT CCA66TGGGT 1020 

CTAGTTTCCC TGTTGOACAA TCCAGCCCCA TCCCACTTGQ TG6AGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCX^CCC TTATGAA6CT GGCCGGGCCA 1140 

CTCACGTTCA ATOAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTQAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAQ AGGATGGAGC AGGTGACGCC 1260 

TCCCCTGTCX: TGAACCACGC GGC0GTCCX:T TTGATTTGCA ACAAGATCTQ CAACCACAGG 1320 

QAOQTGTACG GTGGCATCAT CTCCCOCTCC ATGCTCTGOQ GQGOCTAOCT GAOSGGTOGC 1380 

GTG6ACAGCT GCCAGGGG6A CAGCGGGGGG CCCCr G GTGT GTCAAGAGAO GAGGCTOTGG 1440 

AAGTTAGTGG GAGCOACCAO CTTTGGCATC GGCTGCGCAG AGGTOAACAA GCCTGGGGTG 1500 

TACACCCGTG TCACCTCCTT CCTGGACTGG ATCCACGAGC AGATGGAGAG AGACCTAAAA X560 

ACCTQAAGAO GAA6GGGACA AGTAGCX31CC TGAQTTCCTO AOGTGATGAA QACA6CCCQA 1620 

TCCTCCXXTTG GACTCCOGTG TAGGAACCTO CACACX3A6CA GAGAOCCTTO QAGCTCIGAO 1680 

TTCCX3GCACC A6TAGCAGGC COGAAAGAGO CACXXTTCCA TCT6ATT0CA OCACAACXTTT 1740 

CAAGCTGCTT TTTGTTTTTT GTTTTTTTGA GGTGGAGTCT CGCTCTGTTQ CCCAGGCTGG 1800 

AGTGCAGTGG OSAAATCCCT GCTCACTGCA GCCTCCQCTT CCCTGOTTCA AGCXSATTCTC 1860 

TTGCCTCAQC TTCCCCAGTA GCTGQGACXA CAGGTGCCCQ CCACX»CACC CAACTAATTT 1920 

TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTGGCCA GGCTGCTCTC AAACCCCTGA 1980 

CCTCAAATGA TQTGCCTGCT TCAGCCTCCC ACAGTGCTGG QATTACAGGC ATGGGCCACC 2040 

ACGCCTAGCC TCACGCTCCT TTCTGATCTT CACTAAGAAC AAAAQAAGCA GCAACTTGCA 2100 

AGGGCGGCCT TTCCCACTGQ TCCATCTGGT TTTCTCTCCA QQaOTCTTQC AAAATTCCTQ 2160 

ACOAGATAAG CAGTTATGTG ACCTCACX3TG CSUU^GCCACC AACAGCCACT CAGAAAAGAC 2220 

GCACCAGCCC AGAAGTGCAG AACTGCAGTC ACTGCAOQTT TTCATCTCTA GGGACCAGAA 2280 

CCAAACCCAC CCTTTCTACT TCCAAGACTT ATTTTCACAT GTGGGGAGGT TAATCTAGQA 2340 

ATGACTOGTT TAAGGCCTAT TTTCATGATT TCTTTGTA6C ATTTGGT6CT TGAOSTATTA 2400 

TTGTCCTTTQ ATTCCAAATA ATATGTTTCC TTCCCTCAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAA 

A141 Protein seq^ience; 

Gene namet TMPRSS3a 
Unigene number: Hs. 2 98241 

Probeset Accession AZ538613 
Protein Accession #i BAB20077 
Signal sequence: none found 

Transmeinbrane domains i 43-65, 239-261 

Tryp^SPc domain » 216-444 
Cellular Localization i plasma menibrane 



1 . 11 31 31 41 51 

I i i I I I 

MC^NDPPAVB APFSFRSLFG LDDLKISFVA PDADAVAAQI LSLLPLKPFP IIVIOIIAIiI 60 

LALAIOLGIH FDCSGKYRCR SSFKCIBLIA RCDGVSDCKD GEDBYRCVRV GQQNAVLQVP 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDKLRVSS LEGQPRBBFV SIDHLUPDCK 180 

VTAIiHESVW REGGASGRW TLQCTAGGHR ROYSSRIVGO HMSLLSQWPN 0A8LQFQ6VB 240 

LOGGSVITPIi NZITAAHCVY DLYLPKSWTI QVOLVSLLDN PAPSHLVBKI VYHSKnCPKR 300 

LGMDIALHXL AGPLTFNEKZ QPVCLPNSEE NFPDGKVCWT SOHGATBDOA QDASFVUIHA 360 

AVPLI8NKIC NHRDVYGGII SPSMLCAGYL TG6VD80QGD 8GGPLV0QBR RLWKLVOATS 420 
FQIGCABVNK PGVYTRVTSF LDWZHEQMER DIiKT 

A142 PMA 8EQUEWCS 

Gene naioe} BSTs; Weakly similar to LXTKOSTATHZMB 1 BBTA PRECURSOR [R. sapiens] 

unigene nuntber: He. 105484 

Probeset Accession #s AA314779 
NUcleic Acid Accession #t none found 

Coding sequence I 103-579 (underlined sequences correspond to start and stop codons) 



1 11 31 31 41 51 

I I I 1 I I 

CCAAACAGAT TTGCAGATCA AGGAGAACCC AGGAGTTTCA AAGAA606CT AGTAAGGTCT 60 

CTQAGATCCr TGCACTAGCT ACATCCTCAG GGTAGQAGGA AGATGGCTTC CAOAAOCATQ 120 

C06CT6CTCC TATTGCTGAG CTGCCTGGCC AAAACAGGAG TCCTGG G TOA TATCATCAIJQ 180 

AGACCCAGCT GTGCTCCTGG ATGGTTTTAC CACAAGTCCA ATT6CTATGG TTACTTGAGO 240 

AAGCTGA6QA ACTGGTCTGA TGCOQAGCTC GAOTSTCAGT CTTACGGAAA OOGAGOCCAC 300 

CTGGCATCTA TCCTGAGTTT AAAGGAAGCC AGCACCATAG CAGAGTACAT AAQTGOCTAT 360 

CAGAGAAGCC A6CCGATATQ GATTGGCCrG CAOGACCCAC AGAAQA6QCA GCAGTGGCAG 420 

TGQATTQATO GGGCCATGTA TCTGTACAGA TCCTOGTCTQ GCAAGTCCAT GGGTGGOAAC 480 

AAOCACZOIG CrQAOATQAfl CTOCAATAAC AACTTTTTAA CTTOGAOCAO CAACQAATGC 540 

AACAAGC60C AACACTTCCT GTGCAAGTAC OOACCATAGA GCAAGAATCA AGATTCT6CT 600 

AACTCCTGCA GCAGCCCOGT CCTCTTCCTT TCTGCTAGOC TGGCTAAATC TGCTCATTAT 660 

TTCAGAGGGG AAACCTA6CA AACTAA6AGT GATAAOGGOC CTACTACACT GGCTTrTTTA 720 



898 



wo 03/042661 



OGCTTAGAOA C3VGAAACTTT AGCATTGGGC CCAGTAGTGG CTTCTAGCTC TAAATGTTTG 780 

CCCCGCCATC CCTTTCCACA GTATCCTTCT TCCCTCCTCC CCTOTCTCTG GCTGTCTCGA 840 

GCAGTCTAGA AOAGTGCATC TCCAGCCTAT GAAACA6CTG GOTCTTTGGC CATAAGAAGT 900 

AAAGAT7TGJV AGACAGAAGG AA6AAACTCA GGAGTAAGCT TCTAGAOCCC TTCAGCTTCT 960 

ACACXXrrrCT GCCCTCTCTC CATTGCCTGC ACCCCACCCC AOCCACTCAA CrCCTGCTTG 1020 

m Yi ' Ccrrr ggccatagga agotttacca gtagaatcct tgctagottg atgtgggcca iobo 

TACATTCCTT TAATAAACCA TTGTGTACAT AAGAGAAAAA AAAAAAAAAA AAAAAAAAAA 



A143 Protein sequence: 
Gene name: 
Unigene Humbert 
Probeset Accession 9: 
Protein Accession 8i 
Signal sequence: 
Transmembrane domains; 
C-type lectin domains 
Cellular Localisation: 



BSTs; Weakly similar to LITKOSTATBHIB 1 BETA PRECURSOR (H. sapiens] 

H8.10S484 

AA314779 

none found 

1-22 

none found 

47-156 

secreted 



1 11 21 31 41 51 

) I I I I 1 

MASRSMRLLL LLSCLAKTGV IX3DIIMRPSC APGHFYHKSN CyGyPRXXiRK H8DABLEGQS 60 
YGNGAHLASI LSLKEASTIA EYISOYQRSQ PIWIGLKDPQ XRQQNQWIOQ AMniYRSWSO 120 
KSMGGNKHCA EMSSKNNFLT WSSNEQTKRQ HFLCKYRP 



A144 DMA SEQUENCE 

Gene namet Cadherin 3, P-cadherin (placental) 

Uhigene number: He. 2877 

Probeset Accession S: X63629 

Nucleic Acid Accession ft: X63629 

Coding sequence: 54-2543 (start and stop codons are underlined) 



1 11 21 31 41 51 

I I I 1 I I 

GCGGAACACC OGCOOGCOGT C60G6CAGCT GCTTCACCCC TCTCTCTGCA GC CATGG GGC 60 

TCCCTCGTGG ACCTCTCGCG TCTCTCGTCC TTCTCCA6GT T TGCTGGC T G CAGT6C6C6G 120 

CCTCGGAGCC GTGCCGGGCG GTCTTCAGGO AGGCTGAAGT 0ACCTTG6A6 GOGGGAGGOQ 180 

CGGAGCAGGA GCCCG6CCAG GC6CTGGGGA AA6TATTCAT GGGCTGCCCT GGGCAAGAGC 240 

CAGCICTGTT TAGCACTGAT AATGATGACT TCACTGTQGO GAATGQOQAO ACA6TCCAGG 300 

AAAGAAOOTC ACTGAAGQAA AGGAATCCAT TGAAGATCTT CCGATGCAAA OGTATCTTAC 360 

GAAfiACACAA GAGAGATTGG G1GGTTGCTC CAATATCTGT CCCTOAAAAT GGCAAGGGTC 420 

CCTTCCCCCA GAGACTGAAT CAOCTCAAGT CTAATAAA6A TAGAGACACC AAGATTTTCT 480 

ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTOAGGG TGTCTT06CT GTAGAGAAGG 540 

A6ACAGGCT6 GTTQTTGTTG AATAAGCCAC TGGAG06GGA GGAGATTQCC AAGTATQAQC 600 

TCTTTGGCCA GGCT6TGTCA GA8AATGGT6 CCTCAGTGGA G6ACCCCAT0 AACATCTCCA 660 

TCATCGTOAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 

GTGTCTTAGA 6GGAGTCCTA CCAGGTACTT CTGTGAT6CA GGTGACAGCC ACA6ATGAGG 780 

ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAQAACCAA 840 

AGGACCCACA OQACCTCATO TTCACAATTC ACCGQA6CAC AGGCACCATC AGOQTCATCT 900 

CCAG TGGC CT GGAC06G0AA AAAG TCOCT G AGTACACACT QAOCAT CCAfl GCCACAGACA 960 

TGQATGGG6A CGGCTCCACC ACCACGGCAO TOGCAOTAOT aC3AGAT0CTT OATGOCAATG 1020 

ACAAT6CTCC CATGTTTGAC CCCCA6AAGT AOGAGGCCCA TOTGCCTGAG AATGCAGTGG 1080 

GCCATGAGOT GCAGAGGCTG AOBGTCACTO ATCTGGACGC CCCCAACTCA CCAG03TQGC 1140 

6TGCCACCTA CCTTATCATG GGGGGTGAOG AGGGGGACCA TTTTACCATC ACCACCCACC 1200 

CTGA6A6CAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GQAT TTTGAG GCGAAAAACC 1260 

AGCACACCCT GTAOGTTQAA GTQACCAAGG AGGCCCCTTT TGTGCTQAAG CTCGCAACCT 1320 

CCACAGCCAC CATAGTGOTC CACGTGQAGG ATGTOAATGA GGCACCT6TG TTTOTCCCAC 1380 

CCTCCAAAGT CQTTGAGGTC CAGOAGGGCA TCGCCACTGO GQAGCCTQTQ TGTOTCTACA 1440 

CTGCA6AAGA CCCTGACAAG GA6AATCAAA AGATCAGCTA COGCATOCTG AGA6ACCCAG 1500 

GAGOGTQOCT AGCCATGQAC CGUSACAGTQ GGCAGOTCAC AOCTOTOOOC ACGCTOGACC 1560 

GTGAGGATQA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTT6 6CCATGGACA 1620 

ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACQ 1680 

ACCATGGCCC AGTCCCTOAG CCCCGTCAGA TCACCATCTG CAACCAAA6C CCTGTGCGCC 1740 

A0GTGCT6AA CATCAOGGAC AA6QACCTGT CTCCCCAGAC CTCCCCTTTC CAGGCCCAGC 1800 

TCACAOATGA CTCAGACATC TACTGOACGG CASAOOTCAA CGA6GAAGGT 6ACACAGTGG 1860 

TCTTGTCCCT GAAGAA6TTC CTGAAGCAGG ATACATATGA C6T6CACCTT TC TCTO TC rG 1920 

ACCATGQCAA CAAAOAGCAG CTGAOGGTGA TCAGGGCCAC TOTGTGOQAC T60CATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCXaOAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 

TCCIGGCTCT GCTGTTCCTC CT GC TGGTGC TGCTTTTGTT G6TGA0AAAG AAGC6GAAGA 2100 

TCAAGOAOCC CCTCCTACTC CCAGAAOATG ACAOCOQTGA CAA08TCTTC TACTATGGOS 2160 

AAGA6GGGGG TGGCGAAOAG GACCAGOACT ATGACATCAC OCAGCTCCAC OQAGGTCTGG 2220 

AGGCCAQGCC GOAGGTOOTT CTCCGCAATG ACGTGGCACC AACCATCATC COGACACCCA 2280 

TGTACCOTCC TAGOCCAGCC AACCCAGATG AAATOGGCAA CTTTATAATT GAGAACCTGA 2340 

AG6CGGCTAA CACAGACCCC ACAGCCCOGC CCTAOGACAC CCTCTTQQTG TTCGACTATG 2400 

AGGGCAGCQG CTCOGACXICC GCOTCCCTQA 6CT00CTCAC CTCCTCOGCC TCOQACCAAG 2460 

ACCAAGATTA GGATTATCTG AACGAGTG6G GCAGCCSCTT CAAGAAGCTG GCAGACATGT 2520 

ACGGTGGCQG GGAGGA06AC TAGGOGGCCT GCCT6CA6G0 CTGGGGACCA AACGTCAGGC 2580 

CACA6AGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 

GGAAGTGGCC GTAGCAACTT G60GGAGACA GGCTATGAGT CTQAOGTTAO AGTGGTTGCT 2700 

TCCTTAGCCT TTCAGGATGG AGQAATGT6G OCAOTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTQOGCC AGGGTTGCCX CAGAOQCCAA GTTTOCAOAA GOCTCTTACC TGCOQTAAAA 2820 

TGCTCAACOC TGTGTCCTGG GCCTGGGCCT GCTOTGACTG ACCTACAGTG QACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAO GOCTCCTGGT OCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 



899 
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PCT/US02/36810 



TCAAAAOGTT AGAGAAAGTT CTTCAAAAfiT GCAGCCCAGA GCTGCTOO G C CCACTGGCCG 3000 

TCCTGCATTT CrGGTTTCCA GACCCCAATQ CXHXXXATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAGTG TGOCTAGGTT GCCXXTTATT TTTTATTTTC C CX ' G 'nx a CgT TGCTATAGAT 3X20 
QAAG06TGA0 GACAATOCTG TATAT8TACT AQAACTTTTT TATTAAAQAA A 



A14S Protein gequence: 



Unlgene nundben 
Probeset Acceealon <|: 
Protein Accession 0$ 
Signal sequence: 
Transnieinbrane domain: 

Cellular localizations 



Cadherin 3, P-cadherin (placental) 

Ha. 2677 

X63629 

CAA4S177 

1-24 

659-675 

plasma menbraoe 



1 11 21 31 41 51 

I I I 1 i I 

MQLPRGPIiAS LUiLQVCWLQ CAASEPCRAV FREABVTLEA GGABQBPGQA LGKVFMGCPG 60 

QEPALFSTDN DDPTVRNGET VQERRSLKER NPUUPPSKR ILRRHKRDWV VAPISVPENQ 120 

KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLU7 KPLDREEIAK 180 

YEIiPGHAVSE NGASVBDPMN ISIIVTDQND HKPKPTQDTF ROSVLBOVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMP TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDKDGDGSTT TAVAWEILD ANDNAPMFDP QXYEAHVPEN AVGHEVQRLT VTDLDAPN8P 360 

AWRATYLIMG GDDGDHPTIT THPESNQGIL TTRKOLDPEA KNQHTLYVEV TNBAPFVLKL 420 

PTSTATIWH VEDVNEAPVP VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSQQVTAVGT LDREDEQPVR NNIYEVMVLA MDNQSPPTTG TGTLLLTLID 540 

VMDH6PVPEP RQITIOIQSP VRHVLtTITDK DLSPHT8PFQ AQLTDDSDIY HTABVMBEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP SfKGGPILFVL 660 

GAVLAIiLFLL LVIiLLLVRKK RKIKEPLLLP EDDTRX3NVFY YGBEGGGEHD QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIXE NLKAANTDPT APPYDTLLVF 780 
DYBGSGSDAA SLSSLT8SAS DQDQDYDYLN BWGSRFKKLA OMYGGGEDD 

A146 DNA SBQUeUCE-. 

Gene name: TTK protein kinase 

Unlgene number: Hs. 169640 

Probeset Accession #t Me6699 

Nucleic Acid Accession « : KM_003318 

Coding sequences 1026-3551 (underlined sequences correspond to start and stop codons) 



1 II 21 31 41 51 

1 I lilt 

GGAATTCCTT TTTTTTTTTT TTTGAGATGG A6TTTCACTC TTGTTGGCCA GGCTGGAGTG 60 

CAATQOCACA ATCTCAGCTT ACTGCAACCT COQCCTCCCG QGTTCAAGOG ATTCTCCTGC 120 

CTCAGCCTCT CAAGTAGCTG GGATTACAGG CATGT6CCAC CACCCCTGGC TAACTAATTT 180 

CTTTTCTATT TAOTAGAGAT GGGGTTTCAC CAT6TTGGTC AGGCTGGTCT TQAACTCCIG 240 

ACCTCAGGT6 ATCCACTTGC CTTGGCCTCC CAAAOTGCTA GGATTACAGC OGTGAAACTG 300 

TQCCTGGCPG ATTCTTTTTT TGTTGTT6GA TTTTTGAAAC AGGGTCTCCC TTGGTOGCCC 360 

AGQCXGGAGT GCAGTaOTGC GATCTTGQCT CACTATAACC TCCACCTCCT OQTTTCAAGT 420 

GATC CTCCCA CT TTAGC CTC CTGAGTAGCT 6TGATTACA0 GCGTGCACCA CCACACCCGG 480 

CTAATTTTTQ TATTTTTATT AOAOACSUIGG TTTCACCATG TTGOCCAOGC TGTTGTCAAA 540 

CTCCTGC3ACT CAAGGGATCC GOCT G OCT C C ACITCCCAAA GTGCCGAGAT TACAGGTGTG 600 

AQTCACCATG OCTGACCTTA TAATTCTTAA GTCATTTTTT CTGGTCCATT TCTTCCTTAG 660 

GGTCCTCACA ACAAATCTGC ATTAGGOGGT ACAATAATCC TTAACTTCAT GATTCACAAA 720 

AGGAAQATGA A6TGATTCAT GATTTAGAAA G6GGAAGTAG TAAGCCCACT GGACACTCCT 780 

GGATGATGAT CCTAAATOCA GATACAOTAA AAATGGGOTA TOGOAAaOTA OAATACAAAA 840 

TTTGGTTTAA ATTAATTATC TAAATATCTA AAAACATTTT TQGATACATT GITGATGTGA 900 

ATGTAAOACT GTACAGACTT CCTAGAAAAC AGTTTGG6TT CCATCTTTTC ATTTCOCCAO 960 

TGCAGTTTTC TGTAOAAATG 6AATCCGAGG ATTTAAGTGG CAGAGAATTG ACAATTGATT 1020 

CCATAATGAA CAAAGTOAGA QACAXTAAAA ATAAGTTTAA AAATGAAGAC CTTACTGATG 1080 

AACTAAGCTT GAATAAAATT TCT6CTQATA CTACAQATAA CTCGGQAACT 6TTAACCAAA 1140 

TTATGATGAT QGCAAACAAC CCAOAGGACT OQTTGAOTTT GTTOCTCAAA CTAGAGAAAA 1200 

ACAGTGTTCC GCTAAGTOAT GCTCTTTTAA ATAAATTQAT TGGTOGTTAC AGTCAAGCAA 1260 

TTGAAGCGCr TCCCCCAGAT AAATATGGCC AAAATQAQAO TTTTGCTAQA ATTCAAGTGA 1320 

GATTTGCTGA ATTAAAAGCT ATTCAAQAGC CAGAT6ATGC AOGTGACTAC TTTCAAATGG 1380 

CCAOAOCAAA CTGCAAGAAA ITl l SCTfrrQ TTCATATATC TTTTSCACAA TTTGAACTGT 1440 

GACAAOGTAA TGTGAAAAAA AGTAAACAAC TTCTTCAAAA AGCTQTAOAA GOTGGAGCAa 1500 

TACCACTAQA AATGCTGQAA A TTG C CCTGC GGAATTTAAA CCTCCAAAAA AAGCAGCTGC 1560 

TTTCAGAOGA GOAAAAQAAG AATTTATCAG CATCTACGGT ATTAACTGCC CAAGAATCAT 1620 

TTTCOGGTTC ACTTGGGCAT TTAGAGAATA 0GAACAACA6 TT6TGATTCC AGAGGACAGA 1680 

CTACTAAAGC CM3GTTTTTA TATGGAGA6A ACAT6CCACC ACAAGATGCA GAAATAG6TT 1740 

ACOGGAATTC ATTQAQACAA ACTAACAAAA CXAAACA0TC AT6CGCATTT GGAAOAGTCC IBOO 

CAGTTAACCT TCTAAATAGC CCAGATTGTG ATGTGAAGAC AGATGATTCA GTTGTACCTT 1660 

GTTTTATGAA AAOACAAACC TCTAGATCAG AATGCCGAGA TTTGGTTGTO CCTGGATCTA 1920 

AACCAAGTGG AAATGATTCC TGTGAATTAA GAAATTTAAA GTCTOTTCAA AATAGTCATT 1980 

TCAAGOAACC TCTGBT G TCA GATGAAAAGA G'lTCl U AACT TATTATTACT GATTCAATAA 2040 

CCCTOAAGAA TAAAAOGGAA TCAAGTCTTC TA6CTAAATT AGAA6AAACT AAAGAGTATC 2100 

AAOAACCAGA GGTTCCAGAO AGTAACCAGA AACAGTGGCA AGCTAAGAGA AAGTCAOAGT 2160 

GTATTAACCA QAATCCTaCT GCATCTTCAA ATCACTGGCA GATTCOGGAG TTAGCXX53AA 2220 

AAGTTAATAC AGA6CAGAAA CATACCACTT TTGAGCAACC TGTCTTTTCA GTTTCAAAAC 2280 

AOTCACCAOC AATATCAACA TCTAAATGOT TTOACOCAAA ATCTATTTGT AAOACAGCAA 2340 
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PCTAJS02/36810 



GCAGC AftTA C CTTGGATGAT TACATGAGCT GITTTAGAAC TCCAGrTGTA AAGAATQACT 2400 

TTCCACCTGC TTGTCAGTTG TCAACACCTT ATGGCCAACC TGCCTGTTTC CAGCAGCAAC 2460 

AGCATCAAAT ACTTGCCACT CCACTTCAAA ATTTACAGGT TTTAOCATCT TCTTCAGCAA 2520 

ATGAATGCAT TTOGGTTAAA GGAAGAATTT ATTCCATATT AAAOCAGATA OGAAaTGOAQ 2580 

6TTCAA0CAA GGTATTTCAO GTGTTAAATO AAAAOAAACA GATATAT6CT ATAAAATATG 2640 

TGAACTTAGA AGAAGCAQAT AACCAAACTC TTQATAOTTA COGOAAOJAA ATAQCTTArr 2700 

TGAATAAACT ACAACAACAC AGTGATAAGA TCATCCGACT TTATGATTAT GAAATCACGQ 2760 

ACCAOTACAT CTACATSGTA ATGGAGTGT6 GAAATATTGA TCTTAATAGT TGGCTTAAAA 2820 

ASAAAAAATC CATTGATCCA TGGOAAOGCA A6A0TTACTQ 6AAAAATATG TTAQAGGCAG 2880 

TT CACA CAAT CCATCAACaT GGC3VTTCTTC ACAGTGATCT TAAACCAQCT AACTTTCTGA 2940 

TAGTTGATGO AATGCTAAAQ CTAATTGATT TTGGGATTGC AAACCAAATO CAACCAGATA 3000 

CAACAAGTOT TGTTAAAGAT TCTCAGGTTG GCACAOTTAA TTATATGCX3V CCAGAAGCAA 3060 

TCAAAGATAT GTCTTCCTCC AGAGAGAAT6 QGAAATCTAA GTCAAAQATA AGCCCCAAAA 3120 

GTGATGTTtO GTCCTTAGGA TGTATTTTGT ACTATATOAC TTAOGGOAAA ACAOCATTTC 3180 

AGCAGATAAT TAATCAQATT TCTAAATTAC ATOCCATAAT TQATCCTAAT CAT6AAATTG 3240 

AATTTCCCGA TATTCCAGAG AAAGATCTTC AAGATOTOTT AAAOTGTTGT TTAAAAAGGG 3300 

ACCCAAAACA GAGGATATCC ATTOCTGAGC TCCTGGCTCA TCCATATGTT CAAATTCAAA 3360 

CT CATC CAQT TAACCAAATG GCCAAGGQAA CCACTGAAGA AATGAAATAT GTTCTGG6CC 3420 

AACnGTTOQ TCTGAATTGT OCTAACTCCA TTTTQAAAGC TGCTAAAACT TTATATQAAC 3480 

ACTATAGTOG TGGXXSAAAGT CATAATTCTT CATCCTCCAA GACTTTTGAA AAAAAAAGGO 3540 

GAAAAAA ATG AT TTGCAGTT ATTCGTAATG TCAGATAOGA GGTATAAAAT ATATTGQACT 3600 

C3TTATACTCT TGAATCCCTG TGGAAATCTA CATTTGAAQA CAACATCACT CTGAAGTGTT 3660 

ATCAGCAAAA AAAATTCAOT OAGATTATCT TTAAAAGAAA ACTOTAAAAA TAGCAAOCAC 3720 

TTATGGCACT OTATATATTG TAGACTTQTT TTCTCTOnT TATGCTCTTG TGTAATCTAC 3780 

TTGACATCAT TTTACTCTTG GAATAOTGGG TGOATAGCAA GTATATTCTA AAAAACTTTG 3840 
TAAATAAAGT TTTOTaGCTA AAATGA 



A147 Protein eequencei 
Gene namet 
Unlgene nutnber: 
Probeset Accession #: 
Protein Accession ft: 
Signal sequence: 
Transmenbrane domains t 
Protein Kinase Domain t 
Cellular Localisation: 



TTK protein kinase 

H8. 169840 

M86699 

NP_003309 

none found 

none found 

510-775 

cytoplasmic and nuclear 



1 
I 

MNKVRDIKNK 
VPLSDALUJK 
ANCKKFAFVH 



NSUIQTNKTK 
SGNDSCBLRK 
PBVPESNQKQ 
PPISTSKMH) 
QZIATPLQNL 
t£BADNQTLD 
KSXDPWERXS 
SWKDSQVGT 
IINQISKX^ 
FVNC^fAIQBTT 
K 



11 

I 

PKKBDLTDBIi 
LIGRYSQAIB 
XSFAQFELSQ 
TVLTAQBSFS 
QSCPFGRVFV 
LKSVQNSKFK 
WQAKRKSBCI 
PKSICKTPSS 
QVLA8SSANB 
SYRNBIAYLN 
YWKNMLEAVH 
VNYMPPEAIK 
ZIDPNHEZEP 
EaflXYVLGQL 



21 
I 

SUJKISADTT 
ALPPDKYGQN 
6NVKKSKQLL 
GSLGHLQNRN 
NLLNSPDCDV 
BPLVSDEKSS 
KQNPAASSNH 
NTLDDYMSCF 
CXGVRGRIYS 
XLQQBSDKZI 
TIHQHGIVHS 
DMSSSREtTGK 
PDIPEKDLQD 
VGU78PII8XL 



31 

! 

DNSGTVNQIM 
ESPARIQVRP 
QKAVERGAVP 
NSGDSRGQTT 
KTDDSWPCF 
ELIITDSITL 
HQIPELAIUCV 
RTPWKNDFP 
ILXQIGSGGS 
RLY0YEITDQ 
DLKPANFLIV 
SKSKI8PKSD 
VLKCCLiORDP 
KAAICTLYEHy 



41 

I 

MMAHNPEDWL 
AELKAIQBPD 
LEMLEIALRN 
KARPLYGENM 
MKRQTSR8BC 
KNKTESSLiLA 
NTEQKHTTFB 
PAOQLSTPYG 
SKVFQVLNEK 
YIYMVMECGN 
DGMLKLIDFG 
VWSLGCILYY 
KQRXSIPELL 



51 

I 

SLLLKLEKNS 
DARCYFQKAR 
LNLQKKQIiLS 
PPQDAEIGYR 
RDLWPGSKP 
KLEBTKEYQB 
QPVFSVSKQS 
QPACFQQQQH 
XQXYAIKYVN 
XDLNSWLKKK 
XANQMQPOTT 
MTYGXTPFQQ 
AKPYVOXQIH 
SKTPBRKSOK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



A14B DNA SEQUENCE 
Gene name : 
U&igene nundberi 
Probeset Accession ftt 
Nucleic Acid Accession #i 
Coding sequence: 



endothelial differentiation, lysophosphatidic acid 0-protein-coupled receptor. 
He. 2585 83 
MM 012152 
NmI012152 

43~1104 (underlined sequences correspond to start and stop codons) 



i 11 21 31 41 51 

I I I i I I 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATSAATGA GTOTCACTAT 60 

OACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GQAACAAAGC TTOTQATrGT TTTGTOTOTT OaQAOGTTTT TCTQCCTGTT TATTTTTTTT 180 

TCTAA TTCT C T66TCAT0GC GGCAGTGATC AAAAAC3VGAA AATTTCATTT CCCCTTCTAC 240 

TA CCTG 'rrUQ CTAATTTAGC TQCTGCOGAT TTCTTOSCTQ QAATTGCXTTA TGTATTCXTrG 300 

ATGTTTAACA GAGGCCCAOT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCOTCAO 360 

GGGCrrCTQG ACAGTAGCT7 OACTGCTTCC CTCACCAACT TGCTGGTTAT OGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGOQGGTC CATAGCAACC T6ACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCX3GTCCC CACACTGGOC 540 

TGGAATTGCC TCTQCAACAT CTCTOCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAQT 600 

TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGOT TGTGGTGTAC 660 

CTGOGGATCT ACQTOTAOGT CAAGAGGAAA ACCAAOGTCT TGT C TCOGCA TACAAGTGGO 720 

TO CATC AGOC GCOQGAGQAC AO0CA1GAAO CTAATGAA6A O6GT0ATGAC TGTCTTAGGG 780 

GCGTTTGTOO TATOCPQGAC OOOGGOCCTO GTGGTTCTGC TCCTOQAOGG CCTGAACT6C 840 

AGGCAGTGTQ GCGTOCAGCA TGTGAAAAGG TGOTTCCTGC TGCTGGOGCT GCTCAACTCC 900 

GTOGTQAACC CCATCATCTA CTCCTACAAG GAOGAGGACA TGTATGGCAC CATGAAQAAG 960 

ATGATCTGCT 6CTTCTCTCA OOAGAACCCA QAGAGGOOTC CCTCTCXJCAT CCCCTCCACA 1020 

GTCCTGAGCA GGAGTGACAC AGGCAGCCA6 TACATA6A60 ATAOTATTAG CCAAGGTGCA 1080 

901 



wo 03/042661 



PCTAJS02/36810 



GTCTGCAATA AAAGCACTTC CTAAACTCTO GATGOCTCTC GGCOCACOCA GGTGATGACT 1140 
GTCTTAGG 



A149 Protein sequence! 

Gene nanet 
Unlgene number: 
Probeset Accession Ui 
Protein Acceseion It: 
Signal sequence i 
Transmembrane domains: 
Cellular Iiocalization: 



endothelial diCferentiation. lysqphosphatidic acid G-protein-coupled receptor, 

Ho. 258583 

m 012152 

Np2036284 

none Cound 

31-53. 66-88. 150-172. 190-211. 239-261. 277-295 
plasma menibrane 



1 11 21 31 41 SI 

i I I I i 1 

MNECHYDKHM DPPYNRSMTD TVDDWTGTKL VTVLCVGTFP CLPIPPSNSL VIAAVIKNRK 60 

FHFPFYYLXA NLAAADPPAG XAYVFU4FNT GPVSKTIiTVN RWFI4RQGLI1D SSLTASLTNL 120 

LVIAVBRHMS IMRMRVHSNL TKK8VTLLZL XiVUAXAIFNO AVPTL6NNCL OIISACSSIA 160 

PIYSRSYLVP NTVSNLMAFL IMTWYLRiy VyVKRKXNVL SPHT86SISR RRTPMKLMXT 240 

VMTVLGAPW CWTPCXjWLL LDGWOtQCO VQRVKRHPLb LALLNSWHP XZYSYKDEDM 300 
YGTMKKMICC FSQEHPERRP SRIPSTVI«SR SDTGSQYXED SISQGAVCHK ST8 



Prostate 

Also DNA SEQUENCE 

Gene name: BSTs 

Ubigene nundtiert Hs. 293616 

Probeset Accession fft AW043782 

Nucleic Acid Accession 11 1 none found 

Coding sequence: 38-1075 (underlined sequence corresponds to start and stop codon) 



1 11 21 31 41 51 

111)11 

AGCAACGACG CCGGGCAGCG GGAGCGGOGG COGOQCCATS TGGCTGCTGG GGCOGCTGTG 60 

CCTGCTQCTG AGCAG0GCC6 CGQAGAGCCA GCTGCTOCOC GGGAACAACT TCACCAATGA 120 

6TGCAACATA CCAGGCAACT TCATGTGCAG CAATGQACG6 TGCATCCG6G G06CCTG6CA 180 

GTGTQAOGGG CTGCCTGACT GCTTCGACAA QAGTQATGAO AAOQAaTGCC CCAAOGCTAA 240 

GTCGAAATGT GQCCCAACCT TCTTCCCCTG TGCCAGCGGC ATCCATTGCA TCATTGGTOG 300 

CTTCCGGTGC AATGGGTTTO AGQACTOTCC OGATQOCAQC GATGAAGAGA ACTGCACAGC 360 

AAAGCCTCTO CTTTGCTCCA OOGOC O QCTA CCACTQCAAO AAOGGCCTCT OTATTGACAA 420 

GAGCTTCATC TGCGATGOAC AGAATAACTQ TCAAOACAAC AGTGATGAOa AAAGCTGTQA 480 

AAOTTCTCRA GAACCOGGCA 6TGGGCAGGT GTTTGTQACT TCA6AGAACC AACTTGTGTA 540 

TTACCCCAGC ATCACCTATG CCATCATCGG CAGCTCOGTC ATTTTTGTGC TGGTGGTGGC 600 

CCTGCTG6CA CTGGTCTTGC ACCACCAOGG GAAGCGGAAC AACCTCATQA CGCTGCCOGT 660 

GCACG8GCT6 CAGCACCCTO TGCTQCXOTC 006CCTGGT6 6TCCT0GACC AGCCCCACCA 720 

CTGGAAOSTC ACCTACAAGQ TCAATAATGG CATCCAGTAT GTGQGCAGCC AGG0GGA6CA 780 

GAATG08T09 QAAQTAGGCT CCCCACCCTC CTACTCOQAO QCCTTGCTGQ ACCAGAGGCC 840 

TGCGTGGTAT GACCTTCCTC CAOOGCOCTA CTCTTCTGAC AOGOAATCTC TGAACCAAGC 900 

CQACCTQCCC CCCTAC06CT CCCGGTCOGO GAGTGCCAAC AGTGCCAGCT CCCAGGCAQC 960 

CAGCAGCCTC CIQAGCOTGO AAOACACCAG CCACAGCCCQ GGGCA6CCTG GCCOCCAGGA 1020 

OGOCACTOCT GAGOCCAGGG ACTCTGA6CC CA6CCAGGGC ACTGAAGAAG TATAAQTCCC 1080 

AGTTATTCCA AAOTCCATAT GOOTTAATCT GCTCTGACTT GTTGCCATTC TAACAATTTG 1140 

TGCTCATGGG AAGCTCTTTA AGCACCTGTA AOQATGTCTC AAGTTACAGT TTGQQATATT 1200 

AACTATCTCT GCATTCCCCT CCTCCCCGAG ACTTCAGAGA TGTTTTTCTG GOGTCTCAQT 1260 

TGACATGATC TOTTGTGOGT CTTTTCIOTC AGGTCACTCT TCCCTTGOQA GCG6A6ATCA 1320 

CAGGCTCATT TTTCACATTA TTCTGTTTCT 6TTGGAGAGA CAGCATATAA AACASTATTO 1380 

AAATAGQCTG GGAGAGAGCA ATGTTTCTGT OCTATATTGG ATGCTCAGAA 6TGCAGGAGA 1440 

CGCTGGACOC AATTCTCTCT GCTGGGTAGT TACCTTATAG CATTTGGGGA TTTOGQTTAG 1500 

ATGATCTAAC CAGGAGOCCA TCACTGQATG GTCACCCOCC CAAAAAAATT CCATTTGAQC 1560 

ATCAAAACCT GCTTTGCACA ATCCTATTT8 ATGCCOCCAO TTGAGCAOAG TCAOTOOOCA 1620 

AAGAAAACTT TGGAOQTGAO TAACACCCTT CAGCAOTOGC AAOGTTAnT TGGTTrTGTG 1680 

AAGGACTCTG AAACCATCTA CCCTGTATAA ATTCTGGCTT TAGAAATTTG CCCAAGAATG 1740 

CTCATTCTGA QAGCTTTCCT CAGCAGCATA TATCATCAGC CTCATCCTAA AATAG6CAGG 180O 

GAGCCCCTCC CATQAGTTTA TCCAAGTTCT CAOCTCCTAA AATGCAGGCT GCCAA6A0CC 1860 

T ACAC CTGCC CTOGCTCTAC AGCCAGTTAC d ' OO r iTCrU GACTOTCACC CT00CA8CT0 1920 

ACCTGCC06T AGCCAAOOAA TGAGQACCTA ACTTGA6TTG GGCCAAA6TC TGAGCTG6CT 1980 

GTATGTCCCT GT6G0CCACA CCCAGCCTOT CTTGCTCATT CATGCAGCCT CAACACTGGC 2040 

CTCCAAAGTT CCCTTAACAC TTGCAAAGTC CTTTTTACCT OTGCATTTGG ACTTGAGGAC 21,00 

ACTGGTTTCT ATCACAGGTQ AQAGCCATGfT TCAATACCTC CAGCAAGCTC TCCTGGCTCC 2160 

CTGCACTGTG CAOGCTCCTC TTCCCAAGGT CCCAATACCA GCACCTCTA6 TTAOAGTTAG 2220 

GGTCAGGGTC AG6CCTCTCC CAACATCCCA GTAGTTTCTC CTCTGAQACA CATOOGCAAG 2280 

AGACAATTTG GAGTCAAGAT TTTCCATTTG GATCTATTTT AAATCTTTTA GAAATGCATT 2340 

TGAAACAOTG TGTTTGTTTT TTCCCTTCTA GTTAAGGGAC TATTTATATG TGTATAGGAA 2400 

AGCTGTCTCT TTTTTTGTTT TTCCTTTAAC AAGGTCCAAA GAAAGATOCA AAAGGAGATC 2460 

ACACCCTTOC GOCGCTOAOC GCGGTGATAA OVAGTCACTC CAGACTAACC TOTGrOCCAS 2520 

ACATTTGTGC ATTGTTGCAC TTTQAGGTTA TTATTTATCA AGTTCXTSAA GQAAGCAGAA 2580 

AQAOGGACTC CTCTCTCCCT CCGTQTATAG TCTCTATOTT TGTGCTAGTT mcmTrr 2640 

TTCTCTGTGT CCAGTCAGCC ACAGGGCCCO CCTCCCTGCA OQAATAAGGG GTAAAAOGTT 2700 

AGGTGTTGTT TQGCAAGAAA CCACACTGAC TGAT6AGGG0 TAAAATGGAA CCAGGTAQAO 2760 

CCACTCC6GG CA6CTGTCAC CCATTCAOAA CTTCTTTCGG CA6CTGAA6A AATGTTCA6T 2620 



902 



wo 03/042661 



AAOCTGTTTG AOGCTAATTA AAACAGAGCC 
TGATCCTGTT CTGTAGACTT TTCTTTCTTT 
AAGCTABCCA CTGCrTATTTT GTTTTGTTTA 
AAOSOAAAGG AACCTAGCTQ CCTGTATCTT 
CTOAQTAATC CAATAAAGAA CTTTTGATGA 
CATTTCATCT CCTGTGAGTC AGAAOGGCTT 
TTTCTGGTGC TCTGGAAGTT GTTTAGAGGA 
GAGTTAATCT CACTCGCTTT TCTGCTTCCA 
AGATAAGGGA TGCCTACTAA TQCTTTTTTA 
TGATTTTTTT AATGAATGTT TTTAAAAATA 
TTTTTGGGGO GAGOGGGTTT QTTTTCCAAC 
TTTTTAACTC ATTCCAACCA OGAAGCTTTT 
AAAATAGTCT CATCTCTTTT TTTCTCAAAT 
AGTrACACTQ TGATQACTOQ CCTATTACCT 
TTTTTAAAAA ATGCAACTAA OTGGTTAATA 
OAAAGOTTOT OTGTCGTTOC T TTITOWn ' 
TTATACTTTC TAATAAATTT GCA8TTTCAT 
AAMMAAAAAC AHYWTTGGGG GGGCTT60GC 
TGGGGCGGOO G(3GCCCACGT AQGTACX3GCG 
GAAACCCTGG CCAAGAAAAA G6TGG06AGA 
GGAAAGOGCA GAGTGTTGOQ TAAACCACAC 
G6ACTCAACC AGGAGGACCC AAGGGAAGCC 



T6CAGGAAGT 6GGQCTAAAG T66CATTCAQ 2880 

TTTTAACCAA ATCCAAAGGA TGTTACAOAA 2940 

AAAAAAAAAA GAAAGAAAGA AAGAAAGAAA 3000 

TCATTTTTAA AATAGCACTT GAGTTATTTT 3060 

CAGCCAGAAT GTGTTAGAAC TCTGGCTQAA 3120 

TATTTCTCXX: TTTQATGGGG CCCCTTCTTC 3180 

AAGAATTCTA ATTTTAATTA ATTGCXSCAOT 3240 

GGCATCTTAQ GAAAAACAAA TG Q TTTTAGT 3300 

AAACAAACAG GGACATTTTT ATTATAGATT 3360 

TATAAATAGO ACACCAAAGC GQGAGGGrrT 3420 

TCAAQATGGC ACATTASTGG CCAGCAATAT 3480 

TTATACATTO OCTAAATCTA C360CAACCAG 3540 

GAGATC0GT6 TTTTATTTTA GCATTAAATT 3600 

GACTCAGCTC OCTCTACXnT GAAATTGACA 3660 

GTGTGT6A0G CTCAAAOTTA ATGTAAACTQ 3720 

TTGGTTAGGC TTQGTTTTGT TTTTTAATTT 3760 

TCTTTCTOTT TOTGCAAAWG GHMCTAHARM 3840 

CTOGQAAAAA GTTTTTAACA CCACTTCGGG 3900 

ACCACGCGGG CCCAAACGGG ACCCCAOAAG 3960 

ATTCTCCACA CCAQAAAAAA ACG06C0G6G 4020 

OCGAAOAGAG AACTCAGAAO CACACAAGG6 4080 
GATA6AGTAC G 



A151 Protein sequence » 
Gene name: 
Unigene nuroben 
Probeaet Accession #t 
Protein Accession #i 
Signal sequence i 
Transmembrane domains: 
LDLa domains i 
Cellular Localisation: 



BSTs 

He. 293616 
AH043782 
none found 
1-17 
169-191 

28-66, 70-108, 113-149 
plasma raenibrane 



1 11 21 31 41 51 

I I I I 1 I 

IWhUSPLCLL LSSAAESQLL PGNNFTNECS IPGNTMCSNG RCIPQAWQCD GLPDCPDKSD 60 
EKECPKAKSK OQPTPPPCAS OIHCZIGRFR CNGPEDCPDG SDEENCTANP UiCSTARYHC 120 
KNGLCIDKSF ICDGQNNCQD NSDEBSCBSS QBFGSGQVFV TSENQLVYYP 8ITYAII0SS 180 
VIFVLWALL ALVLHHQRKR NNLMTLPVHR LQHFVLXiSRL WLDHPHKCN VTYNVNNGIQ 240 
YVASQAEQNA SEVOSPPSYS BALLDQRPAN YDLPPPPYSS OTBSLNQADZi PPYRSRSGSA 300 
1TSA88QAA88 LLSVEDTSHS PGQPQPQEOT AEPRDSSFSQ OTBEV 



A152 DMA SEQUEMCB 

Gene name: Transmenibrane protease, serine 2 (TMPRSS2} 

Unigene number: Bs. 105807 

Probeset Accession #i T48536 
Nucleic Acid Accession fi: I9M_005656.1 

coding sequence: 57*1535 (underlined'lBequences correspond to start and stop codons) 



1 II 21 

I I I 

GTCATATTGA ACATTCCAGA TACCTATCAT 
CTTTQAACTC AGGGTCACCA CCAGCTATTG 
GGGAAAACCC CIATCCCGCA CAGCCCACTQ 
CTCAOXACTA CCCSTCOCCC GTGGCCCAGT 
ACOCCGTCGT CTGCACGCAO CCCAAATCCC 
AGAAAGCACT GTGCATCAOC TTQACCCT6G 
CTGGCCTACT CTGGAAGTTC ATGGGCAGCA 
CCTCAGGTAC CTGCATCAAC CCCTCTAACT 
GGQAOQAOQA QA A T O GGTOT GTTC9QCCTCT 
CATCTCAGAG GAAGTCCTG6 CACCCT G TGT 
GGGOGGCCTG CAGGQACATG GGCTATAAGA 
ATXSACAGCGG ATCCACCAGC TTTATGAAAC 
ATAAAAAACT GTACCACAGT GAT6CCTGTT 
TAGGCTGOGG GGTCAACTTO AACTCAAGCC 
OGCTCCCGGQ GGCCTGGCCC TGGCAGGTCA 
GAGGCTCCAT CATCACCCCC GAGTGGATOG 
TTAACAATCC ATGGCATTGG AOGGCATTTO 
ATGGAGCOGG ATACCAAGTA CAAAAA6TGA 
AGAACAATOA CATTGCGCTO ATGAAGCTGC 
AACCAGTOTG TCTGCCCAAC CCAG6CATGA 
CGGGOTGGGG GGCCACCGAG GAGAAAQGGA 
TGCTTCTCAT TGAGACACAG AGATGCAACA 
CAGCCATQAT CTOTGCCGGC TTCCTGCAGG 
GA06GCCTCT GGTCACTTCG AACAACAATA 
GTTCTGGCTO TGCCAAAGCT TACAGACCAO 
ACTGGATTTA TCGACAAATG AAGGCAAAOG 
OGTTTTACAA GAAAACAATO GGGCTGGTTT 
ATGATTCAGA GGTCACTTCA TTTTTATTAA 
TGOCATACTO TGCAGGCTGC AOTGGCTGGC 



31 41 51 

I I I 

TACTGGATGC TGTT6ATAAC AQCAAQATGQ 60 

GACCTTACTA TGAAAACCAT GGATACCAAC 120 

TG6TCCCCAC TQTCTAOGAO GTGCATCOQG 180 

A06CCCOGAG G6T0CTQACG CAGGCTTCCA 240 

CATCOGGGAC AGTGTGCACC TCAAAGACTA 300 

GGACCTTCCT CGTGGGAGCT GCGCTGGCOQ 360 

AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

GGTGTGATGG CGTGTCACAC TGCCCOQQOG 480 

ACGGACCAAA CTTCATCCTT CAQATGTACr 540 

GCCAAQAOQA CT6GAACGAG AACZAOGGGC 600 

ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

TGAACACAAG TGCOGGCAAT OTCGATATCT 720 

CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

GCCAOAGCAG GATCGTGGGC GGTGAGAGCG 840 

GCCTGCAOGT CCAGAACGTC CACGTGTGCG 900 

TGACAGC06C CCACTGCGTG GAAAAACCTC 960 

COGOGATTTT GAGACAATCT TTCATGTTCT 1020 

TTTCTCATCC AAATTATGAC TCCAAQACCA 1080 

AOAAGCCTCT GACTTTCAAC GACCTAGTQA 1140 

TGCTGCAGCC AGAACAGCTC TQCTGGATTT 1200 

A6ACCTCAGA AGTGCTQAAC OCTGCCAAGO 12 SO 

6CAGATATGT CTATGACAAC CXOATCACAC 1320 

GGAACGTGGA TTCTTGCCAG GGTGACAGTG 1380 

TCTGGTGGCT GATAOGGGAT ACAAGCTGGG 1440 

GAGTGTAOGG GAATGTGATQ 6TATTCACGG ISOO 

GCTAATCCAC ATGGTCTTOO TCCTTOAOGT 1560 

TGCTTCCCCG TQCATQATTT ACTCTTAGAG 1620 

ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

CTGCOCAGCC TGCTCTOCCT AACCCCTTGT 1740 



903 



wo 03/042661 



PCTAJS02/36810 



CXX3CAAGGQQ TGATG6CGGG CTGOTTGTOG GCACTGGOGG TCAATTGTGG AAGOAAOAOG 1800 

QTTGGAGGCT GCCCXXy^TTG AGATCTTCCT GCTGAQTCCT TTCCAGOGGC CAATTTTGGA 1860 

TOAGCATGGA GCTGTCACTT CTCAGCTOCT GGATGACTTG A6ATQAAAAA GGAGAGAC^T 1920 

GQAAA6GGAG AOVGCCAGGT OGCACCTGCA GGGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGOTTC TTAOAGCCTT AGCAGCCCTG 2040 

OATOGTGGCC AGAAATAAAO GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTOT 2100 

AAOGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTQ CCCTTGGTGC 2160 

QAGGGAAGCA ATTGAAAAGG AACTTGCCCT QAGCACTCCT QGTQCAGGTC TCCACCTGCA 2220 

CATTG G GT GG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGGACC CTGGAGAGTG AATGCOCCTT GGTCCCTGGC AOGGOGCCAA GTTTGGGACC 2340 

ATGTOGGCCT CTTCAG6CCT OATAOTCATT GGAAATTGAO GTOaVTQGGO GAAATCAAGG 2400 

ATGCTCA6TT TAAGGTACAC TGTTTCCAT6 TTATGTTTCT ACACATT6AT GGTGGTGACC 2460 
CTGAOTTCAA AGCXA.TCTT 



A153 Protein sequence » 

Gene namet Transmenibrane protease, serine 2 (TMPRS92) 

Unlgene nuRibert He. 105807 

Probeset Accession ftt T48536 

Protein Accession «i NP_00S647.1 

Signal sequence: none found 

Transnieinbrane domains: 85-107 

Cellular Localization: plasma menibrane 



I 
I 

MALHSGSPPA 
SNPWCTQPK 
DSSGTCINPS 



CLACGVNUIS 
PLNNPKHHTA 
VKPVdiPNPG 
TPAMICAGPL 
TDWIYRQMKA 



11 
I 

IGPYYENHGY 
SPSGTVCTSK 
NWCDGVSHCP 
XNNPXSSQGI 
SRQSRZVGGB 
FAGILRQSFM 
MMLQPEQLCW 
QGNVDSCQGD 
NO 



21 
I 

QPEMPYPAQP 
TKKAliCITLT 
GGEDENRCVR 
VDDSOSTSFH 
SALPGAWPWQ 
PYGAGYQVQK 
ISGWGATEEK 
SGGPLVTSNN 



31 

I 

TWPTVYBVH 
LGTFLVGAAL 
LYGPNFILQM 
KLNTSAGNVD 
VSLHVQKTVHV 
VISHPNYDSK 
GKTSEVLNAA 
MIHWLIQDTS 



41 
I 

PAQYYPSPVP 
AAGLLWKFMO 
ySSQRKSHRP 
XYKfCLYHSDA 
CX3GSZITPBH 
TXNMDIALKK 
KVLLIETQRC 
NG8GCAKAYR 



51 
I 

QYAPRVLTQA 60 

SKCSNSGIEC 120 

VCQODHNENY 160 

CSSKAW8LR 240 

XVTAAHCVEK 300 

LQKPLTFNDL 360 

NSRYVYDNLI 420 

PGVYGNVMVF 480 



A1S4 DNA SEQUENCE 

Gene name: Bono sapiens eONA FLJlSSei Cis, clone PLACB1009039 
Unigene numbers Hs. 12 917 9 

Probeset Accession #i AI694767 
Nucleic Acid Accession tt: AI694767 

Coding sequence i 130*1086 (underlined sequences correspond to start and stop codons) 



1 11 

I I 
CAGAGAGGCT GTATTTCA6T 
GGGGTCACAC ATTCCTTCCA 
AGCTTCTT CA TGA TGGTGGA 
ATAGGGCTGC CIGGTTTAGA 
TACCTTATTO CTOTGCTAGG 
CTGCATGAGC GCATGTATAT 
ACCTCATCCA TGCCCAAAAT 
GATGCTTGTC TGCTACAGAT 
CTQCTGQCCA TGGCTTTTGA 
GTACTTACGT TGCCTOGIGT 
CTOATGGCAC CCCTTCCTGT 
TCCCATTCCT ACTGCCTACA 
AATGTCGTCT ATGGCCTTAT 
TCCTTCTCAT ATCTGCTTAT 
AAGQCATTTG GCACTTGCGT 
ATTG6ATT6T CCATGGTGCA 
TTGGCCAATA TCTATCTGCT 
ACAAAGGAGA TTC6ACAGCX3 
CCCTAGGTGT CA6TGATCAA 
GTTAAGATTT TGGMVGACAO 
ATCCTTCAAA TATGAAACTG 
GTTTTCTTGC TACATATAAT 
TTTTCATTTT ACCAT6CAOT 
OAOATAAGAA TGGTACATCT 
TAAAGACAGA ATATAATAAA 
ACTCCCAACC ACATTGGATC 
AAATAATTTT TCCTCXGQAC 
AOAOTACATT TACCTAOGTT 
ATGGACCCTG TTTTTCCTAT 
TT31QTACCCT CATT8TAGCC 
GGGQTCATAC AAOTATAAAA 
GGAAOAACTG TTAAAGAGAC 
TTCTARAGGA GGTATTTAAT 
6CAACAGAAC TCATGGCTTT 
AATTACCTGT GTCTTGQAAO 
AGAAAGTCTX3 CATAGGGCTT 
TQATAOGCAO T6AGGTTA0G 
TGAAQATAAC ATTGGCCTTT 



21 31 41 51 

i I I I 

GCAGCCTGCC AS A CCTCTTC TGGA66AAGA CTGGACAAAa 60 

TACQOTTQAG CCTCTACCTG CCTCGTGCTG OTCACAQTTC 120 

TCCCAATGGC AATGAATCCA GTGCTACATA CTTCATCCTA 180 

A6AGGCTCA0 TTCTGOTTGG CCTTOCCATT GTGCTCCCTC 240 

TAACTTGACA ATCATCTACA TTGTGCGQAC T6AGCAGA6C 300 

ATTTCTTTGC ATGCTTTCAa GCATTGACAT CCTCATCTCC 360 

GCTGOCCATC TTCTGGTTCA ATTCCACTAC CATCCAOTTT 420 

GTTTGCCATC CACTCCTTAT CTGGCATGQA ATCCACAGTQ 480 

CCGCTATGTG GCCATCTGTC ACCCACTQOQ CCATGCCACA 540 

CAGCAAAATT G6TGTG6CTG CT6TGGTGCQ GGGGGCTGCA 600 

CTTGATCAAO CAGCTOCCCT TCTGCOOCTC CAATATCCTT 660 

OCAAOATGTC ATGAAGCTGG CCTGTGATGA TATCOGGGTC 720 

CGTCATCATC TCCGCCATTG GCCTGGACTC ACTTCTCATC 780 

TCTTAAGACT GTGTTGGGCT TGACACGTGA AGGCCMSGCC 840 

CTCTCAT6TG TGT6CTGTGT TCATATTCTA TGTACCTTTC 900 

TOGCTTTAGC AA606G0GTG ACTCTCCACT GCCGGTCATC 960 

GGTTCCTCCT GTGCTCAACC CAATTGTCTA TQGAGTGAAG 1020 

CATCCTTOQA CTTTTCCATG TGGCCACACA GQCTTCAGAG 1080 

ACTTCTTTTC CATTCAGAGT TCTCTGATTC AGATTTTAAT 1140 

TAarrCAGAAA AAAAATTTCC TTAATAAAAA TACAACTCAO 1200 

GTTGQOaAAT CTOCATTTTT TCRATATTAT rn'CTl ' ClTr 1260 

TATTAATAOC CT6ACTAGGT TGTGGTTGQA GGGTTATTAC 1320 

CCAAATCTAA ACTGCTTCTA CTGATGGTTT ACAQCATTCT 1380 

AOAGAACATT TGCCAAAGGC CTAA6CACAO CAAAGGAAAA 1440 

ATQAOATAAT CTAOCTTAAA ACTATAACTT CCTCTTCAOA 1500 

TCAGAAAAAT ACTGTCTTCA AAATGACTTC TACAGAOAAO 1560 

ACTAGGACTT AAGGGGAAGA TTGGAAOTAA AGCCTTGAAA 1620 

AATGAAAGTT GACACACTGT TCTGAGAGTT TTCACAGCAT 1680 

TTAATTTTCT TATCAAOCCT TTAATTAGGC AAAGATATTA 1740 

AIGOOAAAAT TGATOTTCAO laQOOATCAO TGAATTAAAT 1800 

ATTAAAAAAA AAAOACTTCA TOOCCAATCT CATATGATGT 1860 

CAACAGGGTA GTGGGTTAGA GATTTGCAGA GTCTTACATT 1920 

TTCTTCTCAC TCATCCAGTG TTGTATTTAO GAATTTCCTQ 1980 

AATCCCACTA GCXATTQCTT ATTOTCCTOa TCCAATT OCC 2040 

AAOTQATTTC TAGGTTCACC ATTATGGAAG ATTCTTATTC 2100 

ATAGCAAGTT ATTTATTTTr AAAAGTTCCA TAGGTGXTrC 2160 

GAGOCACCAO TTATQATGQG AAOTATGOAA T66CA6GT0T 2220 

TGAQTBTGAC TOGTAGCTGQ AAA0TGA6GO AATCTTCAGQ 2280 

904 
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PCTAJS02/36810 



ACCATGCTTT ATTTGQGGCT TTGTGCW?rA TGGAACAOGG ACTTTGAGAC CGGGAAACCA 2340 

ATCTGACTTA GGCATGGGAA TCAGGC3VTTT TTGCTTCTGA GGGGCTATTA CCAAGGGTTA 2400 

ATAGGTTTCA TCTTCAACAG GATATGACAA CAOTCTTAAC CAAOAAACTC AAATTACATA 2460 

TACTAAAACA TGT6ATCATA TATGTGGTAA tflT TC A TTTT CTTTTTCAAT CCTCAGGTTC 2520 

CCTGATATGG ATTCCTATOA CATGCTTTCA TCOCCTTTTO TAATGGATAT CATATTTCGA 2580 

AATGCCTATT TAATACTTGT ATTTGCTGCT GQACrGTAAG CCCATGAGGG CACTGTTTAT 2640 

TATTGAATGT CATCTCTGTT CATCATTGAC TGCTCTTTGC TCATCATTQA ATCCCCCAGC 2700 

AAAOTGCCTA QAACATAATA GTGCTTATQC TTGACACOGQ TTATTTTTCA TCAAACCTGA 2760 

TTCCTTCTGT GCTQAACACA TAGCCAGGCA ATTTTCCAGC CTTCTTTGAG TTGGGTATTA 2 820 

TTAAATTTTA GCCATTACTT CCAATGTGAO TGGAA6TGAC ATOTGCAATT TTTATACCTG 2880 

GCTCATAAAA CCCTCCCATG TGCAGCCTTT CATGrTOACA TTAAATGTGA CTTGGGAAGC 2940 

TATGTGTTAC AGAGAOTTAA TTAACOIGAA AGGCCTGGNA ATTTTTTGNN AANNAAACTG 3000 

TGaCCMNOAO OCXXNCAACC CTTTTmOIA ATTTQOCAAN NTCCCACTTT GTANITIGGT 3060 
AAG6AGGCCA GTTGGATAA6 TGAAAAATAA AQTACTATTG TGrTC 



A155 PROTBIW SEQUBTCE 
Gene name: 
Unigene number t 
Protein Accession ft: 
Signal sequence: 
Pfam domain: 
Tranemenibrane domains: 
Cellular Loealizationi 



Homo sapiens cONA FLJ13S81 fis, clone FIjACB1009039 
Hs. 129179 

not available 
none found 
7tra_l (43-293) 

29-51, 57-79, 82-104, 203-225, 239-261, 273-295 
not determined 



1 11 21 31 41 51 

I I i t I I 

MVDPNQNESS ATYPILIQLP GLEBAQFWLA FPLCSLYLIA VLGNLTIIYI VRTEHSLHEP 60 
MYIFLCMLSG IDZLI8T8SM PKMLAIFWFN 8TTIQFDACL LQMFAIHSLS GMESTVLLAM 120 
APDRYVAXCH PIjRKATVLTL PRVTKIGVAA WROAAXMAP LFVPIKQLPF CRSNILSHSY 180 
CLBQDVfMKLA GDDXRVNWY 6LXVXXSAZG LDSLItlSFSY LLILKTVXiGL TRBAQAKAFG 240 
TCVSKVCAVF IFYVPFXGIiS MVHRFSXRRD SPLFVXLANX YLLVPPVUfP ZVYGVKTKBl 300 
RQRXLRLFHV ATHASEP 

A156 DMA SBQUEWCE 

Gene name: vasoactive intestinal peptide receptor 1 

lAiigene number: Hs. 198726 

Probeset Accession #: X77777 

Nucleic Acid Accession #: NM_004624.1 

Coding sequence: 57-1544 (underlined sequences correspond to start and stop codons) 



TOGGAGCCTG CGGA6GGTGG TGGTGGTGGT GGTGGTGGCC CTCGCCCGCC TCACTCATGC 60 

CTCCTCCTCC TCTGCTCrCG CTCAGGCGCC TGGGTGGC6G TTGOTCQGCG GTTAC6C6GC 120 

TGGTGGT06C 6GCG6GC6G0 GCTCGCTCTC GG0GAG6C0G OqOO SQATCT OGOG GOGC AG 180 

GGGGCGGOBG COGAGGTGGG 6TOGCG0GGC GGAOGCGGCT CQAGCTTCGT GCTGCGOQCT 240 

COCTCTTGOG CTCCTOGCTG CAGGAGGAOT GTGACTATOT GCAOATGATC GA6GTGCAQC 300 

ACAAGCAGTG CCTGOAGGAG OCCCAGCTGG AGAATGAOAC AATAGGCTGC AGCAAOATOT 360 

GGGACAACCT CACCTGCTGO CCAGCCACCC CTOGGGQCCA 6GTAGTT0TC TTQGCCTQTC 420 

CCCTCATCTT CAAGCTCTTC TCCTOCATTC AAGGCGGCAA TGTAAGCCGC A6CTGCACCO 480 

A06AAGGCTG GA06CACCTG GAGGCTGGCC CGTACCCCAT TGGCTOTOOT TTGQATGACA 540 

AGGCAGOGAG TTTGGATGAG CAGCAGACCA TGTTCrACGG TTCTGT6AA6 AC06GCTACA 600 

CCATTGGCTA OGQCCTGTCC CTCGCCAOCC TTCTGGTOGC CACAGCTATC CTGAGCCTGT 660 

TGAGGAAGCT CCACTGCACO OGGAACTACA TCCACATGCA GCTCTTCATA TCCTTCATCC 720 

TGAGG6CTGC CGCTOTCTTC ATCAAAGACT TGGCCCTCTT GGACAGCGOG QAOTCGQACC 780 

AOTGCTCGQA GGGCTCG6T0 GGCTGTAAOO CAOOCATGOT CTTTTTCCAA TATTQ TGTCA 840 

TGGCTAACTT CTTCTGGCTG CTGGTGGAGG GCCTCTACCT GTACACCCTG CTTGCCGTCT 900 

CCTTCTTCTC TGAQOQGAAa TACTTCTGGG GGTACATACT CATCQaCTQG GGGGTACCCA 960 

QCACATTCAC CATGGTGTGG ACCATCGCCA GGATCCATTT TGAGGATTAT GGTCTGCTCA 1020 

GOIGCIGGGA CACCATCAAC TCCTCACTOT GOTGGATCAT AAAGGGCCCC ATCCTCACCT 1080 

CCATCTT66T AAACTTCATC ClXJ ' m ' A ' iTi ' GCATCAT C OG AATOCTGCTT CAGAAACTGC 1140 

GGCCCCCAGA TATCAGGAAO AGTGACAGCA GTCCATACTC AAGGCTA6CC AGGTCCACAC 1200 

TCCTGCTGAT CCCCCTOTTT GOAGTACACT ACATCATOTT CGCCTTCTTT CCGGACAATT 1260 

TTAAGCCTGA AGTGAAGATO GTCTTTGAGC TCGTOQTGGQ GTCTTTCCAG GGTTTTCTOO 1320 

TGGCTATCCT CTACT6CTTC CTCAAT8QT0 AaOTGCAGQC G6A6CTGA6G CQGAAOTOQC 1380 

GG0GCTG6CA CCT6CAGGGC GTCCTGGGCT GGAACCCCAA ATAC006CAC OCGTCSG G AG 1440 

GCAGCAA06G CGCCACOTGC AGCAOGCAGG TTTCCATGCT GACCCGCGTC AGCCCAGGTO 1500 

CCOOCOGCTC CTCCAQCTTC CAAGCCGAAO TCTCCCTGGT CTGACCACCA GGATCCCAGC 1560 

OCAAGCGGCC CCTCCC6CCC CTTCCCACTC GCA6CAGACG CCGGG6ACAG A66CCTGCCC 1620 

GGGOQCOCCA GCCCOGGCOC T6GGCT0QQA QQCIQCCCCC GGOCOOC T GQ TCTCTG O T C C 1680 

GQACACTCCT AGAGAA06CA GCCCTAGAGC CTQCCTGGAO OGTTTCTAGC AAGTGAGAGA 1740 

GATGGOAOCT CCTCTCCTGG AGGATGCAOG TQGAACTCAG TCATTAOACT CCTCCTCCAA 1800 

AGGCCCCCTA CGCCAATCAA GGGCAAAAAG TCTACATACT TTCATCCTGA CTCTGCCCCC 1860 

TGCTGGCTCT TCTGCCCAAT TGGAGGAAAG GAACaSGTGG ATCCTCAAAC AACACTGGTG 1920 

TOAOCTGAGQ GCAGAAAGOT TCTQCCOOGG AAOOTCAOCA GCACOVACAC CAOOCTAGTG 1960 

CCTGAAATTT CACCATTGCT GTCAAGTTCC TTTGQGTTAA GCATIAGCAC TCAG6CATTT 2040 

GACTGAAOAT OCAOCTCACT ACCCTATTCT CTCTTTAOGC TTAGTTATCA GCTTTTTAAA 2100 

GTGGGTTATT CTGGAGTTTT TGTTTGGAGA GCACACCIAT CTTAGTGG'ri' CCCCACOGAA 2160 

OTGGACTGGC CCCTQG6TCA GTCTGOTGGG AGQAGGOTGC AACCCAAGGA CTQAGGQACT 2220 

CTGAAGCCTC TGGGAAATGA 6AAGGGAGGC AOCAjaCSAAT GCTAGGTCTC GGACTAAGCC 2280 

TACCTOCTCT CCAAGTCTCA GTG6CTTCAT CTOTCAAOTQ GGACTCTOTC ACACCA80CA 2340 

TTCTTATCTC TC T g lt X TIXlT OaAAGOUVCA GGAATGAAGA GACTGOOCT C CTTQTCCACC 2400 

CACCTATGrO CCAACTQTTG TAACTAGGCT CAGAGATGTO CAOCCATGQO CTCTQACAQA 2460 
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10 

15 

20 

25 

30' 

35 



AAOCAGATCC TCACCCTGCT ACACATACAG GATTTGAACT CWSATCTGTC TGATAOGAAT 2520 

GTCAAAGCAC GGACTCTTAC TGCTAACTTT TGTGTATCGT AACCAGCCAQ ATCCTCTTGO 2580 

TTATTTGTTT ACCACTTQTA TTATTAAIQC CATTATCCCT GAATTCOCCT T6CCACC0CA 2640 

GOCTCCCrCO AGTGTQOCIO AGGAGGOCTC CATCTCATGT ATC A TCT GG A TAGGAGCCIXS 2700 

CT6GTCACAG CCTCCTCT G T CTGCCCTTCA CCOCAGTGGC CACTCnBCrT CCTAOCCACA 2760 

GCTCTGCCAO AAQATOCOCT CAGGACTQGA ACAGQCTTGT 6CAACAATAA ATGTTGGCTT 2820 
G6AAAAAAAA AAAA 



A157 Protein sequence: 
Gene names 
Unigene number t 
Probeset Accession #$ 
Protein Accession #t 
Signal sequence: 
Transmembrane domains i 
Cellular Localization t 



vasoactive intestinal peptide receptor 1 

HS.15B726 

X77777 

JC2195 

none found 

lBl-202« 214-236« 255-277, 290-311, 332-354, 377-399. 408-430 
plasma membrane 



1 
I 

MPPPPLLSLR 
RSLLGSSLQE 
CPLIFRLFSS 
YTIGYGLSLA 
DQCSE6SVGC 
PSTFTMVWTI 
LRPPDIRKSD 
WAILYCFLN 
GARRS8SFQA 



11 
I 

RLGGGWSAVT 
ECDYVQMIEV 
IQGRNVSRSC 
TIXVATAILS 
KAAMVFPQYC 
ARIHFEDYGL 
SSPYSRLARS 
GEVQAELRItK 
EVSLV 



21 
1 

RLWAAAGAR 
QHKQCLEEAQ 
TDEGKTHLEP 
LPRKLHCTRN 
VMANPPWLLV 
LRCWDTINSS 
TLLIjZPLFGV 
WRRWHLQGVL 



31 

I 

SR0GRGGSHG 
LENETIGCSK 
GPYPIACGLD 
YIHMHLPISP 
GGLYLYTLLA 
LHHIIKGPIL 
HYIMFAFFPD 
GWNPKYRHPS 



41 

I 

AGGGGRGGVA 
MWDNLTCWPA 
0KAA5LDEQQ 
ILRAAAVFIK 
VSPPSERKYP 
T8ILVNF1LF 
NFKPEVKMVF 
GGSHGATG8T 



51 
I 

RRRRLELRAA 
TPRGQWVIA 
TMPYGSVKTG 
DliALFDSGES 
WGYILIGWGV 
ICIIRILLQK 
ELWGSFQGF 
QVSMLTRVSP 



60 
120 
180 
240 
300 
360 
420 
480 



A158 DWA SBQUEMCS 
Gene names 

unigene number t Hs.29383 
Probeset Accession #i AW207206 
Nucleic Acid Accession #s AIil336l9 

Coding sequence t 1-2070 (luiderlined sequences correspond to start and stop codons) 



40 
45 
50 
55 
60 
65 
70 
75 
80 



ATGAQCGGTG 
CGGCGCCX3GC 
CTCAGGCAGA 
CAGCAGCAGC 
GAAAACAAGG 
ACACTGCOGC 
GGGGGAACAC 
CC TOTATG CC 
AGCCGTGGCT 
GGGCCTGAGO 
CCAAGTAGAG 
CCTCAGATTO 
AT6CTGGGGG 
GGAOCAACCA 
GCACTTCCCC 
GCrCACTTCC 
TGGA6CCAGC 
GACATOGAGA 
CTQTTCTGGQ 
GAGAOGACAC 
COCTCCTGCT 
GOOGCrCGCT 
AGGCTGAAGG 
GGOQQTAGOG 
TCTGTGAAGT 
TCCTTCAACA 
CCCCTACTTC 
GAGAAAGCAG 
AGGCAGATGG 
ACCACACTTA 
ACCCAAQAGC 
CCGGAGGAAG 
AAQAGCCrCT 
CT6AAGCAGA 
AAA06G06CC 



11 
I 

OQGGGGTGGC 
GCCAGOGCCC 
0O6ACO0GCA 
ACTCGGAGAT 
GTGA6C0G6C 
TCC0GCA6CA 
AGGACGGGGA 
AACCC AGTGQ 
GGAGQATQTT 
TCATTGCAGG 
CTGAAATGGO 
CTGCTGTGGC 
CCCAGGGGAT 
TOGGGA CAAA 
ATCCTGACAO 
CATTATCTTT 
CTGGOAACAT 
AGGGGGTTGA 
CAAAGTOTQO 
GGOAAGAGGC 
TTCCAQATGG 
GGGTCTCCAT 
AGGGCTCCTC 
CCQACACTOT 
CCATCTCTAA 
AGCAAGATTC 
ACAACAGCAA 
AGGCCrCTAA 
GGGCGGGGGC 
GGCAGTGOGA 
TGCGGCACCT 
CTAOCTTTCC 
CCAAOAAATQ 
CCCCQAAGAA 
TGCATOGCTC 



21 

I 

GGCTGGGACO 
CTCT6TGGGC 
6AAA06GAAC 
GCTGGCCAAO 
GG6GGGGCCT 
CA6AAACACA 
GCCCCrCCAG 
OTACAOGrrC 
ATGCAGOCAA 
GCGGCAGGTG 
AAGGAACCCC 
CAGGCCCAGG 
ATGQACACAC 
GGGAGGAAGC 
OGQCCCCCAC 
GGGGCTGGGG 
CGCAGCTQGG 
GGQAGGGCCC 
CCCAAOT060 
CATGCTTTCC 
OCCCTCAGGA 
CAACGQAGTG 
ACGGACACAC 
GCGCTCTCCT 
TTCAGGCAAC 
AAAAGCTQAC 
GCTGGACAAA 
TGCAGQAGCT 
ACACCCCCCA 
AGTGCTCATC 
CAAOTCCCTC 
CAGGGACCAA 
CCTGAGCCCA 
CAACTTTGOC 
AGT6CTTTGA 



31 
I 

CGOCCCCCCA 
GTCCAGTCCT 
CTGGACCTGG 
CTCCATGAGG . 
AG6CGSGCCC 
GCCATCAACT 
ACTGTOCTTO 
TGGGGGACCT 
GCACAQCACS 
GCCACAGGGT 
TGGGACA6CC 
ATTTCCA6CC 
TOCATCCAGG 
AGAGTCCTGT 
CCAQCCCAGG 
CTGACATCAG 
GCAGTGCCTA 
TTCCCTAGCC 
CAG0CCCA6C 
CTCG6GACCT 
AACCACCTTT 
TGGGTAGAGC 
AGGCCAGGAO 
GCAQACAGCC 
TCTCAAGGCA 
GTCTCCCAQA 
GTTCCTGGGO 
GCCTOTATGG 
ATQATCCTGC 
CGCGAGCTGT 
CTGGAAGGGA 
6AAGCCA0GC 
CCTOTGGOGG 
GAGAG6CAGA 



GQAATACCAA 
GCCAGAGGCC 
ATTTCCCCAA 
AGOSTGCCAT 
AGAG6CTGCA 



51 
I 

CCOGGGCTCT 
GAGCCCGCAG 
GCAGTTCCTG 
TCTGAAGG66 
G6CACACTCA 
CCTGGGCTCA 
TGCACXGGCC 
OGCTACCTCT 
GGGAAGCGCA 
CCTCCCTCCT 
TAGATCTTTG 
QAGTCCTCAC 
TGCCATCTGG 
CTTOTOCAAG 
GTGGTCTGAA 
GACTGGTGGA 
TTCCCAGGGA 
CTGCAGTGAG 
TGGGQAOGCT 
GTGTCCCAAG 
TQCTCCCTTG 
CAGCCCTGCC 
GOGTCTTOCG 
AAGCTTCCAG 
CCAGCCOGGC 
GGAAGAGGAG 
GGCCAGAAAG 
GCACCAQGGC 
GCGAAAGCCC 
CCTCCTGCAQ 
CCAGGCA6CC 
GGTCTOCACC 
CCTGCCGOCA 
G6CAAT8CA0 



A1S9 Protein sequence i 
Gene name: 
Unigene ntimbert 
Probeset Accession fit 
Protein Accession ttt 
Signal sequencer 
Transmembrane domains t 
Cellular Localizations 



BSTS 

Hs.29383 
AW207206 
T434S7 
none found 
303-322 

not determined 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



906 



wo 03/042661 



PCTAJS02/36810 



1 IX 21 

I I 1 

MSGAGVAAGT RPFSSPTF6S RRRRQRPSVG 
QQQBSEMLAK LHEEIEHLKR ENKGEPAROP 
GGTQDGEPLQ WLAHLAAIA PVOQPSQYKF 
GPEVIA6RQV ATGCSPOLPP PSRAEMGRNP 
MLGAQGZWTH 8XQQSLPAIW AATNQTRGQS 
AHFPLSLGLO LTSQGHLTQS WSQPGNZAAO 
LFWAKCX3PSR QPQPC8AGDA DRTRBEAMLS 
GARWVCINGV HVEPGGPSPA RLKEGSSRTH 
SVKSISNSAN SQGKARPQFO 8FNXQDSKAD 
EXABASNAQA ACMQN9QHQQ RQNOAaABPP 
TQBLRHLKSL LBGSQRPQAA PEEASPPRDQ 
LXQTPKNNFA ERQKRLQAMQ KRRLKRSVL 



31 41 51 

t I I 

VQSLRPOSPQ JiRQSDFQKRN LDLEKSUQPL 60 

RPAXiPPQAHS TLPLPQHRNT AINSSTRLGS 120 

HGTNTDAATS SRGWTMLC8Q AQHVLLSGSP 180 

UDSPCPARSL FQIAAVARPR ZSSPMALSPB 240 

RVZiFPCHl.SK ALPRPDSOPR PAQOPGIiHSQ 300 

AVPRALPSQG DMEXGVEGGP PP8RCX3N8SB 360 

LGTCCSMCFK PSCFPIX3PSQ NHLSRASAPL 420 

RPOGKRORLA GGSAZXTVRSP ADSLSMSSFQ 480 

VSQXADIiEEB PLI^SKLDK VPGVQGQARK 540 

MZIiPZmUCP TTLRQCBVLZ RSLinmiLLQ 600 

BATHPPXCVST KSLSXKCLSP PVAERAZLPA 660 



A160 DMA 9EQ0ENCB 

Gene namet LZV-1 protein, estrogen regulated 
Unigene number i Hs.79136 
Probeset Accession #t U41060 
Nucleic Acid Accession fti im_012319.2 

Ooding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

111)11 

CTOGTOCOSA ATTGGGCAGS AGACOGOGTG TTOGOGCCTG GTA6AGATTT CTCQAAGACA 60 

CCAGTGGGCC CGTGTGGAAC CAAACCTGOG OGGQTQGCOO GGCCGTQQGA CAACGAG6CC 120 

GOSGAGACXSA AGGCGCAATG GOGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

ATCATCTACA ACAGCTTTTC TACCGCTATO GAOAAAATAA TTCTTTGTCA GTTGAAOGGT 360 

TCAGAAAATT ACTTCAAAAT ATAOQCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

ACCACGACCA TCACTCAGAC CAOQAOCATC ACTCA6ACCA T6AGCX?rCAC TCAGACCATG 480 

AOCATCACTC AGACCAOQAO CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 

CTGCTTCTGO TAAAAATAAO GGAAAAGCTC TTTGCCXrAGA CCATGACTCA GATAGTTCAG 600 

6TAAAGAT0C TAGAAACAOC CAGOGaAAAO QAGCTCACCO ACCAQ AACAT GCCAGTOOTA 660 

GAAOQAATQT CAAGQACAGT GTTA6TGCTA GTGAAGTGAC CTCAACTGTG TACAACACT6 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAQACTCC AAGACCTGGA AAACTCTTCC 780 

CCAAAGATQT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAOAQCCGG OTGAGCCGGC 640 

TGGCTGGTAG GAAAACAAAT 6AATCTGTGA GT6AGCCCC0 AAAAG6CTTT ATGTATTCCA 900 

GAAACACAAA TGAAAATCXTF CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATO 960 

GCATGGGCAT CCAGGTTCCO CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCA6CCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAOAAG GCTGAAATCC 1080 

CTCCAAAOAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAOCC ATTTCCATCA 1140 

TCAGTTTCCT GTGTCTGCrG GGGGTTATCT TAGTGCCTCT GATGAATOGG GTGTTTTTCA 1200 

AATTTCTCCT GAOTTTCCTT GTG6CACTGG COGTTQGQAC TTTGAGTOGT GATGCTTTTT 1260 

TACACCTTCT TCCACATTCT CATGC3UU3TC ACCAGCATAG TCATA6CCAT GAAGAACCAG 1320 

CAATGQAAAT GAAAA6AGGA CCACTTTTCA OTGATCTOTC.TTCTCAAAAC ATAGAAQAAA 1380 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTO TATTTCATGT 1440 

TTCTTGTTQA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAQAAO AAAAAGAATC 1500 

AGAAGAAACC TGAAAATQAT GATGATOTGG AOATTAAQAA OCAQTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAOACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAOTC TTGGAAOAAG 1660 

AAQAGQTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACXX»0A3 1740 

QOTGCAAOAA TAAATGCCAT TCACATTTCC AOQATACACT OGGCCAOTCA GAOOATCTCA 1800 

TTCACCACCA TCATQACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACX3VTC 1860 

CTCACAGTCA CAGCCAGCGC TACTCTOOGG AOGAQCTQAA AGATGCOGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CA606ATGGC CTAGCAATTG 1980 

OTGCTGCTTT TACTGAAGGC TTATCAAGTQ GTTTAA6TAC TTCTQTTGCT OTGTTCTGTC 2040 

ATQAGTTQCC TCATQAATTA GOTGACTTTQ CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGC AGGC TGT CCTTTATAAT GCATTQTCAG CCATGCTGQC GTATCTTGGA ATCGCAACAG 2160 

6AATTTTCAT TGGTCATTAT GCTGAAAATO TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

GCTTATTCAT GTATGTTGCT CTGGTTQATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

GTGACCATGG ATGTAQCGGC TQGGGGTATT TCTTTTTACA GAATGCTGGQ ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAQATQA GTTTGTATGC TOTACTATGC AGOGTTTAAA GTTAGTGGGT TTTGT^TTT 2520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTAGGTTTTA ATATTTAAGT 2580 

TArrrCTATCr TGGAQATAAA ATCTOTATOT GOVATTCAOC GGTATTACCA GTTTATTATG 2640 

TAAACAAQAG ATTTGGCATG ACATGTTCTG TATGTTTCAO GGAAAAATGT CTTTAATGCT 2700 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGO 2760 

TGTTTAGGAA TAAQAATGTG CATOAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2620 

AGCAAAGAAA TAAAGGA6AA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2860 

AAAAATCACA AAATnGTTG TAAATTAGAO GGQAGAAATT TAGAATTAAG TATAAAAAGO 2940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC A TTTTTGTCA GGATTATTTC COGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTO TACA7TTAAC TTTGTATAAT ACAQAAATCT 3060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTOST GOGG G TTATATACCA GATGAGTACA OTQAOTAGTr TATGTATCAC CAGACTGGGT 3180 

TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA Wn 'Cl' GG l A' ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATAT6AGGA TATTAAAACT ACACTAAGTA 3300 

TCATTT6ATT 0GATTCA6AA AGTACTTTOA TATCTCTCAO TGCTTCA6TO CTATCATTOT 3360 
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GAGOUVrrOT CTTTATATAC GGTACTGTAO GCATACTAGG CCltSTCTGTG 6CATTCTCTA 3420 
OMCTTTCTT TTTTACACAA TAAATTCCTT ATATCRGCTT G 



A161 PROTEIN SEQUENCE 
Gene name: 
Unigene nuniberi 
Protein Accession S: 
Signal sequence i 
Pfam domain I 
Transmenbrane domains: 
Cellular Localization! 



LIV-i protein* estrogen regulated 
Hs. 79136 

NP 0364S1 

1-21 

Zip(591-743) 

330-346. 352-368, 427-444, 663-679. 668-703, 
plasma neinbrane 



730-745 



MARKX^SVILI 
FYRYGEMNSL 



SVSASEVTST 
NE8VSEPRKG 
RSCLIHTSEX 
LVAIAV6TLS 
nnCOLTALGG 
EEKVIXTDDRT 
HSHFHDTLGQ 
KGDGLHNFSD 
NALSAMLAYL 
RMOYPFIiQtlA 



11 

1 

LTFAIiSVTNP 
SVEGFRXIiLQ 
UHNHAASGKN 
VYNTVSEGIH 
FMYSRNTNEN 
KAEIPPKTVS 
GDAFLRLLFH 
IiYFMFLVEHV 
EOYLRADSQE 
SDDLIHHHHD 
OLAIGAAPTE 
GMATGIPI6H 
GMLIiGFGIML 



21 

1 

LHELKAAAFP 
NIGZOKIKRZ 
KRKALCPDHD 
PLBTIBTPRP 
PQECFNASKL 
LQIAWVQQFI 
SHASHHHSHS 
LTLIKQFKDK 
PSHFDSQQPA 
YHHIIKHEHH 
GLSSGLSTSV 
YAQ)V8M»IP 
LISIFEHKIV 



31 

I 

QTTBKISPNW 
HIHHDHOHHS 
SDSSGKDPRN 
GKLFPKDVSS 
LTSHGMGIQV 
AISIISPLSL 
HEBPAMEMKR 
KKKNQKKPEN 
VI4EEEBVMIA 
GINHHPKSHSQ 
AVPCHELPHE 
ALTAGLFMYV 
PRIMP 



41 

I 

BSGINVDUa 
DHEKHS33HER 
SQGKGAHRPB 
STPPSVTSKS 
PLMATBPNYL 
L0VILVPU1N 
GPLFSHLSSQ 
DDDVEIKKQli 
HAaPQEVYNE 
RYSREELRZ3A 
LGDFAVIjLKA 
ALVDMVPEML 



51 

I 

STRQYHLQQL 
aSDHEUHSDH 
HASGRRNVKD 
RVSRLAQRKT 
CPAIINQIDA 
RVFFKFLLSF 
NIEBSAYFDS 
SKYESQLSTN 
YVPRGCKNKC 
GVATLAWMVI 
GMTVKQAVLY 
HNDA8DHGC8 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 



A162 DHA sequence 
Gene name: 
unigene number i 
E>robcset Accession 
Nucleic Acid Accession fti 
Coding sequences 



bone morphogenetic protein receptor IB (ALK-6) 

HS. 87223 

AA250737 

NM_001203 

274-17 82 (underlined sequences correspond to start and stop codons) 



1 

I 

CQOGGGGCGC 
GAGGA06CGG 
6TGAAAGGAA 
CATAACCATT 
TGCCATAAGT 
AATGTGGGCA 
TTX3GGTTGTA 
QACGGATATT 
GQTTGCCTAG 
AGAAOATCAA 
CTGCCTCCAT 
AXATCTOTQA 
TATAAAAGAC 
ATTCCTCCTO 
TCAGGCCTCC 
ArrGGAAAAG 
GTQAAAGTGT 
ACAaiGTTQA 
GGGTCCTGGA 
TATCraAAGT 
AGTGGCTTAT 
CATGQAGATC 
6ACCTG6GCC 
ACTOGAGTTG 
AATCACTTCC 
GTT6CTAG6A 
CtAGTGCCCA 
CG O CCCTCAT 
ATGACAQAAT 
ACACTTGCCA 
CATCTCTGCA 
TAAGCATOCA 
CTTTCAGGGA 
TCTGTTTGTA 



11 
I 

GGAGTC6GO0 
6AGC0QGQAO 
AGGAAGATCA 
TGGCTCTGAG 
GAGAAGCAAA 
CCAAQAAAGA 
AATGCCACCA 
GTTTCACGAT 
GACTAGAAGG 
TTQAATGCTG 
TGAAAAACAQ 
CTGTCTGTAO 
AAGAAACCAO 
GAGAATCCCT 



6TCGCTATGQ 
TCTTCACCAC 
TGA0GCAT6A 
CCCAGTTGTA 
CCACCACCCT 
GTCATTTACA 
TGAAAAOTAA 
TOGCTGTTAA 
6CACCAAA06 
AGTCTTACAT 
GATGTGTATC 
OTGACGCCTC 
TCCCAAACOO 
GCTGGGCTCA 
AAATOTCAGA 
GAAA6CCAAC 
CftOTACAAaC 
GCGACCTGGG 
GGCGGAGAAA 



21 
I 

GGGCCT060G 
CQCAGOOQGQ 
TTTCATGCCT 
CTATGACAAO 
CTTCCTTGAT 
G6ATGGTGAG 
CCATT6TCCA 
GATA6AAGAG 
CTCAGATTTT 
CACAGAAAGG 
AGATTTTGTT 

' nmc rc i T G 

ACCT0C2ATAC 
GAGAGACTTA 
CCAAAGGACT 
GGAAOTTTGG 
AGAGGAAGCC 
AAACATTTTG 
CCTAATCACA 
AGACGCTAAA 
CACAQAAATC 
AAACATTCIQ 
ATTTATTAQT 
CTATATGCCT 
CATGGCTGAC 
AGGAGGTATA 
TTATGAGQAC 
GTGGAlGCAGT 
CAATCCTGCA 
OTCCCAGGAC 
AGCTACTCTT 
CTTQAACATC 
CAAAOACAG A 
COGTTGGGTA 



31 
I 

GGA06CGGGC 
GGGTQGAOTT 
TGTTGATAAA 
A6AGGAAACA 
AA CATGC TTT 
AGTACAGCCC 
GAAOACTCAG 
SATGACTCTG 
CAGTGTCGGG 
AAOGAATGTA 
GATGGACCXA 
GTCCTTATCA 
AGCATTGGGT 
ATTGAGCAGT 
ATAGCTAAGC 
ATGOGAAAOT 
AGCTGGTTCA 
GOTTTCATTO 
6ACTATCATG 
TCAATGCTGA 
TTTAGTACTC 
OTOAAGAAAA 
GATACAAATG 
CCAGAAG7GT 
ATGTATAGTT 
GTGGAA6AAT 
ATQAGGQAGA 
GA1GAGTGTC 
TCAAGGCTGA 
ATTAAACTCT^ 
CTGTTTGTGG 
GTCCTGCTTC 
G AAGCTOCC A 
ACTTOTTCAA 



41 

1 

AGT60GGAGA 
CAGCCTACTC 
GGTTCA6ACT 
AAAAGTTAAA 
TGCQAAGT6C 
CCACCCCC06 
TCAACAATAT 
GGTT6CCTGT 
ACACTCCCAT 
ATAAAGACCT 
TACACCACAO 
TATTATTTTG 
TAOAACAGGA 
CTCAGAGCTC 
AQATTCAQAT 
GGCGTGGCGA 
GAGAGACAGA 
CTQCAQATAT 
AAAATGQTTC 
AGTTAGCCTA 
AAGGCA AACC 
ATGGAACTTO 
AAOTTQACAT 
TGGAOQAQAG 
TTGGCCTCAT 
ACCAGCTTCC 
TTOTOTGCAT 
TAAGGCAGAT 
CAGCCCTGOG 
^TAGGAGAG 
GCAGAGCAAA 
CCAGTGGGTT 
GAAGGAGAOA 
GATATQATOC 



51 
1 

CCGC66G6CT 
TTTCTTAGAT 
TCTGCTGATT 
CTTACAAGCC 
AGGAAAATTA 
TCCAAAGGTC 
TTGCAGCACA 
GGTCACTTCT 
TCCTCATGAA 
ACACCCTACA 
GGCTTTACTT 
TTACTTCOGO 
TGAAACTTAC 
AGGAAGTGGA 
GGTGAAACAG 
AAAGGTAGCT 
AATATATCAG 
CAAAGGQACA 
CCTTTATQAT 
CTCTTCTGTC 
AGCAATTGCC 
CTGTATrOCT 
ACCACCTAAC 
CTTGAACAGA 
CCTTTGGGAG 
TTATCATGAC 
CAA6AA0TTA 
GGGAAAACTC 
GGTTAAGAAA 
GAAAAGTAAG 
AGACATCAAA 
CAOACCTCAC 
TTGATCCGTG 
AT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



A163 Protein sequence 
Gene namet 
U&igene number t 
Probeset Accession A: 
Protein Accession #t 
Signal sequence 1 
Transmembrane domains t 
PFAM domains I 
Cellular Localisation t 



bone morphogenetic protein receptor IB (ALK-6} 

Rs. 72472 / Hs. 87223 

AA250737 / U89326 

NP_001194 

1-13 

128-144 

aetivin^receptor (30-111). protein kinase (204-4911 
plasma meatorane 



908 



wo 03/042661 



PCT/US02/36810 



X 11 21 31 41 51 

i I I I I I 

KLliRSAGKLN VGTKKEDGBS TAPTFRPKVL RCKOiHBCPE DSVKNICSTD GYCFTMIEB} 60 

DSGLPWTSO CL6LEGSDFQ CRDTPIPHQR RSIECCTERN ECSnCDLBPTL PPLKNROFVD 120 

GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRY8 IGLEQDBTYI PPGESLRDLZ 180 

EQSQSSGSGS GLPLLVQRTI AKQIQKVKQI GKGRYGEWfM OKWRQBKVAV KVFFTTEEAS 240 

WFReTEIYQT VLMRHENILG PIAADIKGTO SWTQLYLITD YHEKGSLYDY LRSTTLDAXS 300 

MLKLAYSSVS GLC3ILHTBIP STQGRPAIAH RDLKSKHILV KXNGTCCIAD LGLAVKFISD 360 

TNEVDIPPNT RVGTKRYMPP BVLDBSLNRN HFQSYINADM YSFOLXLNBV ARRCVSGGIV 420 

EEYQIiPYHDIi VPSDPSYEDM RBXVCZKKLR PSFFNRNSSD ECLRQMGKLII TBCHAHNPAS 480 
RliTALRVKKT UOCMSBSQDZ Kb 



A164 DNA sequence 

Gene namei ESTb 
Xftiigene nuniberi Hs.X57601 
Probeeet Accession Iti W07459 
Nucleic Acid Accession «: AC005383 

coding Sequence! 32B-2751 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I I I I I 

GACAOTGTTC GCGGCTGCAC CGCTCGGAOG CrGGGTGACC OGCXyTAQAAG TGAAGTACTT 60 

TTTTATTTGC AGACCTGGGC OGATOCCOCT TTAAAAAAOQ COAOGGGCTC TATGCACCTC 120 

CCTGGCXK;TA GTTCCTCCGA CCTCAGCCGG GTCGGGTOGT GCCGCCCTCT CCCAGOAGAO 180 

ACAAACAGGT GTCCCACGTG GCAGCCX3CX3C CCCGGGCGCC OCTCCTGTGA TCCOGTAGCG 240 

CCCCCXX3GCC 0GAGC06CGC COGGGTCTGT GAGTAGAGCC GOCOGGGdC CGAG06CTGG 300 

TGGCOGCTCT CCTTCCGTTA TATCAACATG COCCXTTTTCC TCTrGCTGGA GGCGGTCTGr 360 

GTTTTCCTGT TTTCCAGAGT GCOCOCATCT CTCOCTCTCC AGGAAGTCCA TGTAAGCAAA 420 

GAAACCATOG GGAAGATTTC AGCTGCCAGC AAAATGATGT GGTGCTOGGC TGCAGTOGAC 480 

ATCATGTTTC TQTTAOATGG GTCTAACAOC GTCX3GGAAAG GGAGCTTTGA AAOGTCCAAO 540 

CACTTT6CCA TCACAGTCT6 T6AGGGTCTG GACATCAGCC COGAGAGGGT CAGAGTGGGA 600 

GCATTCCAOT TCAOTTCCAC TCCTCATCIG GAATTCCCCT TG6ATTCATT TTCAACCCAA 660 

CAGGAAGTOA AGGCAASAAT CAAGAG6ATG GTTTTOUWG GA0G0CX3CAC GGAGAOGGAA 720 

CTTGCTCTQA AATACCTTCT GCACAGAGGG TTGCXM3AQ GCAGAAATGC TTCTGTGCCC 780 

CAGATCCTCA TCATOQTCAC TGATGGGAAO TCCCAGGGGG ATGTGGCACT GCCATCCAAQ 840 

CAGCTGAAGG AAAGGGGTGT CACT6TGTTT GCTGTGGGGG TCAGGlTrCC CAGGTGGGAG 900 

GAGCTGCATa GACTQQOCAO OGAQGCTAGA GGGCAOCAOQ TGCTGTTGQC TOAGCAGOTO 960 

GAGGAT6CCA CCAA0G6CCT CTTCAGCACC CTCAOCAGCT OGGCCATCTG CTCCA60GCC 1020 

AOGCCAGACT GCAGGOTOGA GGCTCAGCCC TGTGAOCACA GGACXSCTGGA GATGGTCCGG 1080 

GAGTTCGCTG GCAATQCCCC ATQCTGGAGA GGATOGOGGC GGACCCTTGC GGTGCTGGCT 1140 

GCACACTGTC CCTTCTACAG CTOGAAGAGA GTGTTCCTAA CCCACCCTGC CACXTTGCTAC 1200 

AGGAOCAOCT GCCCAOGCCC CTOTOACTOQ CA0CCCT6CC AQAAIGGAGG CACATQTCITT 1260 

CCAOAAGQAC TGGAOGGCTA CCAOTGCCTC TGCCOGCTGG OCTTT GQ AGQ GGAGGCTAAC 1320 

TGTGOCCTQA AGCT6AGCCT GGAATGCAGG OTOGACCTCC TCTTCCTGCT GQACAOCTCT 1380 

GCX^GGCACCA CTCTGGACGQ CTTCCTGCGG GCCAAAGTCT TOGTQAAGCG GTTTQTGCGQ 1440 

GCCGrrOCTGA GGGAGGACTC TCX3GGCCCQA GTGGGTOTGO CXACATACAG CAGGGAGCTG 1500 

CTGGIGGGQ6 TQCCTQTGGG GGAOTACCAO OATQTGCCT O ACiCT OG TCTG GAGCX^KSQAT 1560 

GGCATTCCCT TCCGTGGTGG CCCCACOCTG AG66GCAGT6 CCTTGGQGCA GGOGGCAGAO 1620 

OGTGGCTTOG GGAG06CCAC CAGQACAGGC CAGGACCGGC CACXSTAGAGT GGTGGTTTTG 1660 

CTCACTGAGT CACACTCCGA GGATGAGGTT GOGGGCCCAG CGOGTCACGC AAGGGCGOGA 1740 

GAGCTGCTCC TGCTQGGTGT AGGCAOTGAG QCOQT0CX3GG CAGAGCTQGA GGAGATCACA 1800 

GGOUSCCCAA A0CAT6TGAT GGTCTACTOG GATOCTCAGG ATCTGTTGAA OCAAATCCCT 1860 

GAGCTQCAGG GGAAGCTGTG CAGC06GCA0 CXSGCCSUSGGT GCOGGACACA AGCXTCIGGAC 1930 

CTOGTCTTCA TGTTGGACAC CTCTGCCTCA GTAGGGCCOO AGAATTTTGC TCAGATGCAG 1980 

AGCTTTQTGA GAAOCTGTGC CXrTCCAGTTT QAGGTGAACC CTGAOGTGAC ACAGGTOGGC 2040 

CTGGTGGTGT ATGGCAGCCA GGTGCAGACT GCCTTO GG GC TG6ACACCAA ACCCACCOGO 2100 

GCIGOGATGC TG0GG6CCAT TAGOCAGGCC CCCTAGCTA6 GTGGGGTGGQ CTCAGCOSGC 2160 

ACCGCCCTGC TGCACATCTA TGACAAAGTG ATGACOSTCC AGAGGGGIGC C06GCCTG0T 2220 

GTCCCCAAAG CTGTGGTGGT GCTCACAGGC GG6AGAG6CG CAGAGGATGC AGCCGTTCCT 2280 

GCCCAGAAGC TGAGGAACAA TGGCATCTCT GTCTTGGTOG TGGGOQTGGG GCJCTGTCCTA 2340 

AGT6AGGGTC TGCGGAGGCT TGCAGGTCXX: CX3GGATTCCC TGATCCACGT GGCAGCTTAC 2400 

GCXSACCTGC GGTACGACCA aGAGGTGCrC ATTGAGIGGC TGTGTGQAGA AGCXAAGCAO 2460 

OOVGTCAACC TCTGCAAACC CA6CCGGT6C ATGAATGAGG GCAGCTGCGT GCT6CAQAAT 2520 

GGGAGCTACC GCTGCAAGTG TCGGGATGGC TGGGAGGGGC 0CCACT6CGA GAACOGTGAG 2580 

TGGAGCTCTT GCTCTGTATG TGTGAGCCAG GGATGGATTC TTGAGAOQCC CCTQAGGCAC 2640 

ATGGCTCCOG T6CAGGAGGG CAGCAGCOGT ACCCCTCCCA GCAACTACAG AGAAGGCCTO 2700 

GGCACT gAAA TGGTGCCTAC CTTCTGQAAT OTCIOIOCOC CAOGTCC TTA QA ATOTCTqC 2760 

TTCCGQCC6T GGCCAG6ACC ACTATTCTCA CTGAG66AG0 AGGATCTCXX; AACT6CA60C 2620 

ATGCTGCTTA QAGACAAGAA A6CAGCTGAT GTCACCCACA AACX3ATGTTG TTQAAAAGTT 2880 

TTGATGTGTA AGTAAATACC CACTTTCTGT ACCTQCTGTQ CCTTGTTGAG GCTATGTCAT 2940 

CTOCCACCTT TCCCTTGAGG ATAAACAAGG GGTGCTGAAG ACTTAAATTT AGCGGCCTQA 3000 

0GTTCCTTT6 CAGACAATCA ATGCTGOOCA GAATGTTGTT QACAGAOTAA T6CCCAGCA0 3060 

AOOC CTTTA C TAOAGCATCC TTT6GAGGGC GAA06CCAC6 GCCTTTCAAG ATGGAAAGCA 3120 

GCAGCTTTTC CACTT CCCCA GAGACATTCT GOATQCATTT GCATTOAOTC TGAAAGGGGQ 3180 

CTTGAGGGAC GTTTGTOACT TCTTGGCGAC TGCCTTTTGT GTGTGQAAGA GACTTGGAAA 3240 

GGTCTCAGAC TGAATGTGAC CAATTAACCA GCTTG6TTGA TGATGGGGGA GGGGCTGAGT 3300 

T GTOCAT GQG CCCAGGTCTO 6A06GCCACQ TAAAATOGTT CTGAOTGOTO AGGAOTOTOC 3360 
ACCTTGAAG6 TCTTC 



909 



wo 03/042661 



PCTAJS02/36810 



A165 Protein sequence 
Gene name: 
Unigene number t 
Protein Accession 9: 
Signal sequence t 
Transmenibrane domains s 
VGH donrainsi 
BQP domainal 
Cellular Local Ixat Ion t 



BST8 

Bs. 157601 
none found 
1-17 

none found 

49>-323; 341-S18; 529-706 
298-333; 715-748 
secreted 



1 11 21 31 41 51 

i 1 I ) I i 

MPPFLLLEAV CVFLFSRVPP SLPLQBVHVS KETIGKISAA SKMMMCSAAV DIMPLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPBRVRV GAFQFSSTPH LBFPLDSFST QQEVXARIKR 120 

MVFKG6RTET BLAIiXYLLHR GZiPOQRNASV PQILIIVTDO K8Q0DVALPS KQUCBROVTV 180 

PAVGVRFPRW EELHALASBP ROQRVLLABQ VEDATN6LP8 TL8S6AIC88 ATPDCRVEAB 240 

PCEHRTLEMV RBFAGNAPCW R6SRRTLAVL AAHCPFYSWK RVPLTHPATG YRTTCPOPCD 300 

6QPCQNGGTC VPEGLDQYQC LCPLAFGQEA NCALKLSLBC RVDLLFLLDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE X^LVAVPVGEY QOVPDLVWSL DGIPFROGPT 420 

LTOSALRQAA ERGFGSATRT GQDRPRRWV IiLTBSHSEDB VAOPARKARA BELLLUSVGS 480 

BAVRAELEBX TGSPKRVMVY SDPQDI.FHQI PELQGKLCSR QRFGCRTQAL DLVFMLDT8A 540 

SVGPENPAQM QSPVRSCALQ FBVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VHTVQRGARP GVPKAWVLT 6GRGAEDAAV PAQKLRNNGI 660 

SVLWGVQPV LSEQLRRLAG PRDSI.IHVAA YAOLRYHQDV LIEWLOQEAK QPVNLCKPSP 720 

OINEGSCVLQ NGSYRCKCRD GNBGPBCaiR EH8SCSVCVS QGHILBTPLR HMAPVQBGSS 780 
RTPPSNYRSQ LGTENVPTFH NVCAFGP 



A166 DNA sequence 

Gene namei 
Unigene number: 
Probeset Accession : 
Nucleic Acid Accession t 
Coding Sequence t 



Homo sapiens beta-1 adrenergic receptor nANA, 3* XTTR 

Rs. 37744 

AA011176 

AF272890 

87-1520 (underlined sequences correspond to start and stop codons) 



TGCTACCGQC 
CCCGCOCCOQ 
AOGCGGOTAA 
TGCTGGTQCC 
CGCTGTCTCA 
TCGTGGOGGG 
TCACCAACCT 
TGCCQTTCQQ 
AGCTGTGGAC 
TTGCCCTGQA 
G0G0GC6GGC 
TQCOCATCCT 
ACOCCAAGTG 
CCTTCTAOGT 
AGAAGCAGGT 
06CCCTCGCC 
CCQCOGCCGC 
O6C6OCT0GT 
TCTTCACGCT 
A6CTGGT6CC 
TCAACGCCAT 
GCTQGOCOCO 
CGGGCTOTCr 
ACGA0GAT8T 
ACGGCGGGGC 
CCTOGGAATC 
GGQAAC6A00 
GCTGGTCTGA 
TTTGGGAAGQ 



11 

I 

GCCCGG6CTT 
6CCTCC6CAG 
CCTOTCOTOG 
CGCGTCGCCG 
GCAGTGGACA 
CAATGTGCTG 
CTTCATCATQ 
GGCCACCATC 
CTCAGTGGAC 
CCQCTACCrC 
GOGGGGCCTC 
CATGCACTQG 
CTOOQACTTC 
GCCCCTGTGC 
GAAOAAGATG 
CTCGCCCTOG 
CGCCGCCACC 
GGCCCTAOGC 
CTGCTGGCTO 
CGACC6CCTC 
CATCTACTGC 
CAGOOCTQCC 
G6CG0QGCCC 
06T0GGGGCC 
GGCGGCG6AC 
CAAGOTGTAG 
AQATCTGTOT 
ATCATCOBAG 
OAIGGSAOAG 



21 
I 

CTGGGGTGTT 
CTCGG CATGG 
QCC6CA006C 
GCCGCCTCOT 
GCGGGCATGG 
GTGATCOTGG 
TGCCTGGCCA 
GTGGTGTOGO 
GTGCTGTGOG 
GCCATCACCr 
GT6TGCACC0 
TGGOGGGGGG 
QTCACCAACC 
ATCATGGCCT 
GACAGCTGCO 
CCCGTCCCCG 
GCOOO GC TGQ 
GAGCAGAAGO 
CCCTTCTTCC 
TTCGTCTTCT 
CGCAGCCCOG 

GOACCCOOGC 
AC0CC6OCGG 
AGCGACTCGA 
GGCCOGGCGC 
TTACTTAA6A 
GCAAAGAOAA 
TGGCTTGCTG 



31 
I 

CCCCAACCAC 
GOGCGGGGQT 
TCCCGQAOQO 
TGCTGCCTCC 
GTCTGCTGAT 
CCAT06CCAA 
GCGCOGAGCT 
6C06CTGGGA 
T6ACGGCCAG 
CGCCCTTCCG 
TGTGGGCCAT 
AQAGGQACGA 
QGOGCTAGGC 
TOGTGTACCT 
AGOGCCGTTT 
G6CC06CGCC 
CCAAGOOGCG 
06CTGAA6AC 
TGGCCAAGGT 
TCAACTGGCT 
ACTTCCGCAA 
AOOCQACOCA 
CATOGCOOQG 
CGCOCCTGCT 
GCCTGGAC6A 
GGGGOGCGGA 
CCQATAGCAG 
AA6CCAQGGA 
AT8TTCCTT0 



41 

I 

GGCCCAQCCC 
GCTGQTGCTG 
GGOQGCGACC 
CGCCAGOQAA 
GGC6CTCATC 
GAOGC06CX3G 
GGTCATGGGG 
GTAC66CTCC 
CATOGAGACC 
CTACGAGAGC 
CT06GCGCTG 
OQCGOQCOGC 
CATOQOCTGG 

CCTOGGOGGC 
GCCGCCCXsGA 
TGCQGOTAAG 
GCTGGGCATC 
GGTGAAGGCC 
GGGCTAOGCC 
GGCCTTCCAG 
QQQASACCQO 
GGC06CCT0G 
GQAGCCCrGG 
GCCGTGCCOC 
CTCCGGGCAC 
QTQAACTCQA 
C0GTT6CACA 
TTO 




T QCTA CAAOQ 
TCCGTAGTCT 
G6CGAGGCCC 
CCAGCGOGGC 
CCCC06CGCC 
C6G0GGCCCT 
ATCATGGG06 
TTCCACOQOG 
AACTOSGCCT 
GGACTGCTCT 

GAGQACQAC6 
GCOGGCTGCA 
CCOGGCTTCG 
GGCTTCCCAO 
AGCGCACAAT 
AAAAGGAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1330 
1380 
1440 
1500 
1560 
1620 
1680 



A167 Protein sequence 
Gene nantai 
Unigene number: 
Protein Accession #: 
Signal sequence: 
Transmenibrane domains: 
Pfan dotniam 
Cellular Localization i 



-1 adrenergic receptor irRNA« 3* UTR 



Homo sapiens beta- 
Hs. 37744 
AA011176 

none found 

62-84, 95-117, 135-157, 177-198, 236-248 
7tm_l [75-377] 



1 11 21 31 41 51 

I t 1 I I 1 

M6AGVLVLGA 6BP6NL8SAA PLPDGAATAA RLLVPA8PPA 8LLPPA8ESP EPLSQQHTAO 60 

MOLLMALIVL LIVAGNVLVZ VAIAKTPRI^ TLTNLFZMSL ASADLVKGUi WPFGATZW 130 

910 



wo 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



WGRWBYGSPP CBIiWTSVDVL CVTASIBTLC VIALDRYUVI TSPFRYQSLL TRARARGLVC 
TVWAISALVS PLPILMHWWR AESDBARRCY NDPKCOFVT NRAYAIASSV VSPYVPLCIM 
AFWLRVPRE AQKQVKKIDS CERRPLGGPA RPPSPSPSPV PAPAPPPGPP RPAAAAATAP 
LANGRAGKRR PSRLVALREQ KALKTLGIIM OVFTLCWLPF FLANWKAFH RBLVPDRIiFV 
FPNWU^ANS AFNPIIYCRS PDPRKAPQGL LCXIARRAARR RHATHGDRPR ASGCLARPGP 
PPSPGAASDD DDDDWGATP PARLLBPWAG OIGGAAADSD SSIjOEPCRFG PASBSKV 



180 
240 
300 
360 
420 



A16B DNA sequence 
Gene namei 
Unigene nuinber: 
ProbesGt Accession 9i 
Nucleic Acid Accession fii 



CEX^l 
Hs. 222399 
AA2564eS 
AJ400877 



Ooding sequence 1 ai-3080 (underlined sequences correspond to start and stop codons) 



1 11 21 31 

1 I I I 

6G0GTCCG0G CACACCTCCC OGOQCOG C CG CCOCCACXSC 
C06CAAC06C TGAGCCATGC ATGGGGGTOG 0GGGCG6CAA 
C3GGT6CTOCT GCTGCTQCTQ CTGCTGOCGC CACTGCTGCT 
CX^GGTOGGGG CCGTGCCGCG G G GCCGCAGG AGQATQTAGA 
ATGACTGCCA TGCCX3ACGCC CTOTGTCAGA ACACACCCAC 
AGCCrCGCTA CCAAGGGGAA GQCAG6CA0T GTGAGQACAT 
TCAATGQAGG CTGTGTCCAT GACTQTTTQA ATATTCCA06 
TTQATGGCTT CATGTTGGCT CATGAGQQTC ATAATTGTCT 
AGAACAATGG CGGCTQCCAG C3VTACCTGTG TCAAOGTCAT 
GCAAGGAGGG GTTTTTCCTQ AGTGACAATC AQCACACCTQ 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT 
GCAGOGTCOC CTGT6A6TGC AlSaCCTGGTT TTOAOCTGGC 
TCTTGACCTG TAACCATGGO AA O GOTQGGT GCCAGCACTC 
GCCCAGAGTG CAQCTGCXIAT CCACAQTACA AGATQCACAC 
AGCGAGAGGA CACTGTCCTQ GAGGTGACAG AOAQCAACAC 
ATAAACGGGT GAAAOGGGGG CTGCTCATGO AAACGTOTGC 
ACCGCACCTS TAAG6ATACT TGQACAG6T0 TCCACTGCAO 
TCCAGTTGGA T0G6AAGACA TGTAAAGATA TTGATGA6TG 
GTGATCATTT CTGCAAAAAC ATOGrTGGGCA GTTTTQACTG 
AATTATTAAC AQATGAGAAO TCTTGCCAAG ATGTXWATQA 
OTGACCACAQ CTQCATCAAC CACCCTGGCA CATTTGCTTG 
CCCTGTATGG CTTCACCCAC TQTOQAGACA CCAATGASTG 
GTCAGCAGGT CTOTGTGAAC ACAGTGGGCA GCTATGAATG 
AGCTCCACTO GAATAAAAAA QACTGT6T6G AA6TGAA6GG 
CACXXX3GTGT GTCCCTGCAC TQCGGTAAGA GTGGTGGAGG 
GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT 
AGCTAAATQA AGGCAAGTGT AOTTTGAAAA ATGCTQAOCT 
CAGCACTACC AGAGAAOCAC AGCTCAOTAA AAGAGAGCTT 
GCAGCTCTGG CAAQCAAOTC CCAGOAOCCC CTGGCOGACC 
TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA 
TGAGCT6CAT C6TAAAQOQA ACCGAGAAGC G6CTC0GTAA 
AGGCOGTCCA CAGGGAGCAO TTTCACCTCC AGCTCTCAGG 
AAAAGCCTCC CAGAACATCT GAACGCC3VG0 CAGAGTCCT6 
CAGAAAAOCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA 
GCATTTTATO TCCAAATGGA ACCTTCCAAA ATGAGQAAGQ 
GCCCAAGACC AGGAAAtTCT GGGGCCXnm AGACCCCAGA 
OTG6AG0TCT 6TGTCAACCT GQTGAATATT CTGCAGATGO 
GTGCCCTG6Q CACGTTCCAO CCTQAA0CT9 GTOGAACTTC 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CXTTTTCAGGA 
GTTCACCTGG ACATTTCTAC AACACCACX3V CTCACCGATG 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG 
ACTTTGATQO CTOCACAAAC ATAAGOCAGT OTAAAAACAG 
GAQATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG 
AGT0TACX5TG GACCATCAAC CCACCCCCCA AGCGCCGCAT 
TCTTCCTQCC CATAQAGGAC GACTGTGGGG ACTATCTQOT 
CCAATTCT6T GACAACATAT GAAACCTGCC AGAGCTAQGA 
OCAGGTCAAA GAA6CT9TGG ATTCAOTTCA AQTOCAATGA 
TCCAGGTCCC ATA0GT6ACA TATGATGAGO ACTAOCAOGA 
6AGATGGCAG GCTCTATGGA TCTGAGAACC ATCAOGAAAT 
TCAAGGCTCr GTTTGATGTC CTGGCCCATC CCCAGAACTA 
AGT0CX3QAQA GATGTTTCCA AGATOOTTCA TC06ATTGCT 
TTTTGAGACC TTACAAATOA CTCA60CCAC 0TQ0C31CTCA 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA 
CX3GTATCAOT GACTCATTAO AGTTCAATTT TTATAGATAA 
GAACTTGGTT TTTCTTTCCC AGCATOGTGO ATOTAGACTG 
CAGCTTCTCA CTGCTGTGGG OGGATGTCTT GQATAQATCA 
TTGGTCAGCC TAGGTQAGAC TCAOCTGTCC TTCTQOGOTC 
TGTA6TGGAA AGGAGGCCAC A6AATAA6CT GCTTATTCTQ 
COQGCCCTCT CTAAGGGAOC CCTCTGCACT GGTGTGCAGG 
CAA6AGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC 
ACTCAGTTTC TCXJICAGGCT TCTCCAGCCT OTGTGATACA 
A61TCTAAGC AGtCCTCOTG AAAAAAAAAA GCAGAAAGAA 
AGCACTTCTG GAGACAT 



41 51 
I I 

COGCACTCOG COQCCTCTGC 60 

O06T0C0GGG G0G60CTG8G 120 

GCTGGOGGGG GCGGTCC06C 180 

TGAGTOTOCC CAAGGGCTAG 240 

CTCCTACAAG TGCTCCTGCA 300 

OQATGAATGT GGAAATGAGC 360 

CAATTATCOT T0CACTT6TT 420 

TQATOTGOAC QAQTQCCTGG 480 

GGGGAGCTAT GAGTGCTGCT 540 

CATTCACCXSC TCGGAAGAOG 600 

CTGCAAGGAG GCCCCAAGGG 660 

CAAGAAOCAO A6AGACT6CA 720 

CTGTQAOQAT ACAGCCGATG 780 

AGATGGGAOG AGCT6CCTTG 940 

CACATCAGTG GTGGATGGGQ 900 

TGTCAACAAT GGAGGCTGTO 960 

TTGTOCTSr r GGATTCACTC 1020 

CCA8ACC06C AATG6AGGTT 1080 

CGGCTGCAAG AAAGGATTTA 1140 

GTOCrCXTTO QATAGGACCT 1200 

TGCTTGCAAC OGAGGGTACA 1260 

CAGCATCAAC AACGOAGOCT 1320 

CCAGTGCCAC CXTTCGGTACA 1380 

GCrCCTGCCC ACAAGTGTGT 1440 

AGAOGGGTGC TTCCTCAOAT 1500 

CAGGACAAGT GTAACCTTTA 1560 

GTTTOCOGAO GOTCTGOQAC 1620 

C06CTA0GTA AAOCTTACAT 1680 

AAGCACCCCT AAGQAAATGT 1740 

GGTGACAGCT TCTTGTGACC 1800 

AGCCATCOGC AOGCTCAGAA 1860 

CAIGAACC7C GAOGTGGCCA 1920 

TGGAGTGGGC CAOGGTOVTG 1980 

TQATGGAGCA C6A6AAG6CT 2040 

ACAAATGACT TQTGAAOCAT 21O0 

AGCTTGGAAT ATGTCTGAAT 2160 

CTTTGCACCT TGCCAGCTCT 2220 

CTOCTTCCCC TGT6GA6GA0 2280 

CTGTGAAACC AGAGTTCAAT 2340 

TATTCQTTQC CX^GTGGGAA 2400 

CGCAQGAAAT ACTACX3ACTG 2460 

AAGATGTGGA GGGQAGCTGG 2520 

CAATXACCCA GCCAACAGOO 2580 

CCTGATCX5T6 GTCCCTGAGA 2640 

GATGOGGAAA ACCTCTTCAT 2700 

ACGCXXXaiTC GCCTTCACCT 2760 

AGGQAACAGC GCZAGAGGGT 2820 

ACTCATTGAA GACATAGTTC 2880 

ACTTAAGGAT AAGAAACTTA 2940 

T7TCAAGTAC ACAGCCCAGG 3000 

ACGTTCCAAA GTGTCCAGGT 3060 

AZACAAATST TCIGCTATAO 3120 

CAGTOGGGTA TTGCTGCCTC 3180 

TACAGATATT TTGGTAAATT 3240 

AOAATGGCTT TGAGTGGCAT 3300 

OGGGCTGGCT GAGCTGOACT 3360 

TTACTGCTCC TGAAGGAGTC 3420 

AAACrrCAGC TTCCTCTAGC 3480 

CTCTGACCAG GCAGAACAOG 3540 

CCACCTTGAG ACCTGGGAGG 3600 

AGTTTGATCC CAGGAACTTG 3660 

ITAGAAATAA ATAAAAACTA 3720 



A169 Protein sequence 
Gene name: 
Ublgene nuinber i 



CEQPl 
Bs. 222399 



911 



wo 03/042661 



PCTAJS02/36810 



Probeset Accession fit AA2S64B5 
Protein Accession #j CAB922es 
Signal sequence! 1-31 
Transmembrane domains: xione 

PPAM domaiosi B6P-like_doniaine (49-84.132-167,177-213,266-321,407-442} CUB_doinain 1809-918] 
Cellular Locallsatlont may be secreted 



1 11 21 31 41 SI 

I ) I 1 t I 

KGVAGRNRPG AAHAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QaU)OCKADA 60 

LOQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMIA 120 

KDGKNCXiDVD ECLENNGGGQ HTCVNVNSSY BCXX:KE6FFL SONQKTCIHR SBGGLSGMNK 180 

DHGCSHICKB APR6SVACBC RPGFELAKKQ RDCZLTCMHO NQGOOHSCDD TADGPECSCH 240 

POYKMHTDGR SCLEREDTVL EVTESMTTSV VDGDKRVKRR liLMKTCAVNN GGCDRTCKDT 300 

STGVHCSCPV GPTLQLDGKT CKDIDEOQTR NGGCDHTCKN IVOSPDOGCK KGPKLLTOEK 360 

SCQDVDEC8L DRTCDHSCIN HPQTPACACN RGYTLYGFTH OGDTNECSIN NGGCQQVCVN 420 

TVGSYEOQCH PGYXLHWNKK DCVEVK6LLP TSVSPRVSLU 0QKSGGGD6C FLRCHSGIKL 480 

8SDVTTIRTS VTFKLNBGKC SLKNAELPPB GLRPAI«PEiai SSVKBSFRYV NLTCSSGKQV S40 

PGAP6RPSTP KEMFITVBPB LBTNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHRBQ 600 

PHLQI*SGMML DVAKKPPRTS ERQAESOGVG QGHAENQCVS CRAGTYYDGA RBRCILCPNG 660 

TFQNEEGQMT CEPCPRPQfS GALKTPEAWN KSECGGLCQP GEYSADGPAP OQLCALGTPQ 720 

PEA6RTSCPP CGGG LATKHQ QATSFQDCBT RVQCSPGHFY NTTTHRCIRC PVGTYQPBFG 780 

KNNCVSCFGN TTTDFD6STN ITQCRNBROO GEXiGOFTGYX ESFNYPGNYP ANTECTWTIN 840 

PPPKRRZLXV VPBXFLPIBD OOSDYLVMRX TSSSNSVTTY ETCQTYERPX AFTSRSXKLH 900 

XQFKSNEGNS ARGFQVPYVT YDBDYQELXB DXVRDORLYA 8ENHQBXLKD KKLXKALFDV 960 
LAHPQNYFKY TAQESREHFP RSFXRLLR8K VSRFUIPYK 

A170 DMA sequence 

Gene namei DSMB-6 protein {KIAA0452} 

Unigene nuinber: Hs. 125783 

Probeset Accession #: AL039402 
Nucleic Acid Accession #x AP007170 

Coding sequence! 1-1725 [underlined sequences correspond to stop codon) 

1 11 21 31 41 51 

I 1 I I 1 i 

AAG6AGGCGQ CCTCCOGGAA AAQ06ACC6C AGGACTCCTG A6AGCAGCCT CCATGAGGCC 60 

CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAOTTCTC AGAAGCACTC 120 

AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTAOCACT CACT6ACATA TGCCACCATC 180 

CTG6AGATGC AQGCCATQAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC OGGCAACATG 240 

ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGQA AGTCTTCTGT AACAGATTCC 300 

TTCAGCAGCC TGQTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 

GAGGTCT6CT ATGCAGAGT6 CCTGCTGCAO GQAOCAGCCC TQACCTTCCT GCAOQACGAG 420 

AACATGGTGA GCTTCATCAA AGGCG6CATC AAAGTTOGAA ACAGCTACCA GACCTACAAG 480 

GAOCTGOACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCOGCACTTT 540 

GAAQQAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

AGG ATCC TGA GGCTGTTGGA GTTTGTGGGO TTTTCAOGAA ACAAGGACTA TGGGCTGCTG 660 

CAGCTGGAGQ AGGGAG06TC AGGGCACAGC TTCCQCTCTG TGCTCTGTGT CATOCTCCTG 720 

CTQT6CTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG G6AACGTCAA CATCGAGGAG 780 

GCCGAGAAGC TCTTGAAGCC CTACCTGAAC OGQTACCCTA AGGGTGCCAT CTTCCTOTTC 840 

TTTGC^OTA GQATTGAAGT CATTAAAGGC AACATTQATG CAGCCATCOG GCGTTTOGAO 900 

GAOTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATQTGCTA CTGGGAGCTO 960 

ATGTOarOCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 

AGCAAGGAOA ACTOCTGOTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 

ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACQAOG AAGTGOAATT ATTTCGAGCT 1140 

GTGCCAGGCC TQAAGCTCAA QATTGCTGGa AAATCTCTAC CCAGAGAQAA GTTTGCCATC 1200 

CGGAAGTCCC 66CGCTACTT CTCCTCCAAC CCTATCTCGC TOCCAOTOCC TOCTCrOGAA 1260 

ATGATOTACA TCTGQAAOGG CTAOGCGGTG ATTGGGAAGC AGCC6AAACT CACGGATGGG 1320 

ATACTTGAGA ITATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAAOQAGTAC 1380 

TCAGTGGATG ACX5AGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 

OGTGTCCAGG AGGCOGAGGA GAATTTTAGG AGCATCTCTO CCAATGAAAA 6AAGATTAAA 1500 

TATGACCACT ACTTGATCOC AAACGCCCTG CTGGA6CTGQ CCCTGCTGCT TATCGAGCAA 1560 

GACAOAAAOO AAQAOGCCAT CAAACTTTTG QAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 

TCCATGGAGT CAAGGACACA CTTTOGAATC CAGGCA6CCA CACTCCAAGC CAA6TCTTCC 1680 

CTAGAGAACA GCAGCAOATC CATGGTCTCA TCA6TGTCCT TGTAGCTTTG TGCAGCAGTT 1740 

C OGGGCT GGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 

CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 

TQTATC0GT6 CftGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 

G6CA6A0CA6 OTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 

GTGATGTTTA A6AQAATGTA TGAACAOTTT ACATTTTCCT TAGAAATACA TTQATGGGAT 2040 

CACAGTTG6C TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 

TATCATCT60 AGGTAAATCT CTTTATATGA TX3ATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

TCAGCAAGTT CTCAGCTTGT GTGACGQAAG GTCCTTCAQA GQACCTGAGG AATGCCTGGG 2220 

AGAOOCTAAO CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 

CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTOT AAATTTCCAA TTTAAAAATC 2340 

AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 

AC TAGAAA AT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 

AAATTTGGG6 OQCAGGAGQA GGTTCTCAGA ATCCAGTCTO TATCTTTGCT GTATGCCAAA 2520 

CT OAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 

CATTTGCTTA CTQACAGCAT TTTTGTTAAA ACXOTTATTC TTOAAAAAAA AAAAAAAAAA 2640 
AA 

A171 Protein sequence 



912 



wo 03/042661 



PCTAJS02/36810 



Gene naniet 
tftilgene number: 
Pr^beset Accesaion #« 
Protein Accession fti 
Signal sequence: 
Transmeicbrane domains t 
Cellular Localixationt 



DEME-6 protein (KIAA0452) 

Hs. 125783 

AL039402 

AAC39S82 

none 

210-226 

plasma membrane 



1 



11 



21 



31 



41 



51 



MTALDLPliTO QFSEALSYLK PRTKESMYHS LTYATIIiEMQ AMMTFDPQDI LLAGNMMKKA 60 

QMLOQRHRRK SSVTD8PSSL VNRPTLGQPT BBBIKAEVCY AECLLQRAAL TFLQOENMVS 120 

PIKGOIKVRN SYOmCELDS LVQSSQYCKG ENHraFBGGV KL6VGAFNLT LSMLPTRILR 180 

LXiBFVGFSGN KDYGI.LQLEB 6ASQISFRSV LCVMLLLCyH TFLTFVbOTO NVNIBEAEKL 240 

LKPYLNRYPK GAIPLFPAGR lEVIKGNIDA AIRRPEECCE AQQHWKQFHH MCYWELHWCP 300 

TYKGQWKKSY FYADLI>SREN CWSKATYXYM KAAYLSMFOK EDHKFFGDDB VELFRAVPGL 360 

KLKIAGRSXiP TEKFAIRKSR RYFSSKPXSL PVFALEMKYI KNGYAVIGKC^ PKLTDOXLEI 420 

ITKABBHLBK GPENBYSVDD ECLVXLXiKGL CLKYUSRVQE ABENFRSISA NERRIKYDHY 480 

LIPKALLBLA LLLMEQDRNB EAIKLLESAK QNYKNYSMES RTBFRZQAAT LQAKSSLB3S 540 
SRSMVSSVSL 



A172 DKA sequence 

Gene namet EST 

Unigene number: Hs. 200102 

Probeset Accession S: AL117406 

Nucleic Acid Accession #t none Sound 

Cbding sequences 1-4044 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 



ATGACTAGGA AGAGGACATA CTGGGTOOCC AACTCTTCTQ GTGGCCTOQT GAATCGTGGC 60 

ATOQAGATAG GOQATGACAT GGTTTCAOQA CTTATTTATA AAACCTATAC TCTCCAA6AT 120 

GOCCCCTGGA GTCAGCAAGA GAGAAATGCT GAGGCTCCAO GGAGGGCAGC TGTCCCACCG 180 

TG6GGGAA6T ATGATGCTGC CTTGAGAACC ATOATTCCCT TCOOTCCCAA GCCQAOQTTT 240 

CCTGCCCCCC AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCSVCXXTT GTCATGGCTC 300 

ACCOCGCTCA TGATCCAAAG CTTAOGGAGT CGCTTAQATQ AGAACACCAT CCCTCCACTQ 360 

TCAQTGCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA 420 

GTCTCAAGGC GAOQGATTGA AAAAGCTTGA GTGCTTCTGQ TGATQCTQAG GTTCXAGAOA 480 

ACAAGGTTGA TTTTOGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAO TGTACTOSQG 540 

CCAATATTGA TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC 600 

CATGQAGTGG GACTCTGCTT TGCCCTTTTT CTCTCCQAAT GTQTQAAGTC TCTQAGTTTC 660 

TCCTCCAGT T OQATCATCAA CCAACGCACA QCCATC3VGGT TCOGAGCAGC TGTTTCCTCC 720 

TTTGCCTTTG AGAAGCTCAT CCAATTTAAO TCTGTAATAC ACATCACCTC AGGAGAGGCC 780 

ATCAGCTTCT TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCC!CTA 840 

OTACTGATCA CCTQC3GCATC GCTGGTCATC TOCAGCATTT CTTCXTTACTT CATTATTQGA 900 

TACACTGCAT TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCXXACTG6C GGTATTCATQ 960 

ACAAQAATQG CTGTGAAGGC TCAGCATCAC ACATCTQAGG TCAGGGACCA GCGCATCOQT 1020 

GT6ACCAQTG AAGTTCTCAC TT0CAT7AAG CTGATTAAAA TGTACAGATG GQAGAAACCA 1080 

TTTGCAAAAA TCATTGAAGG TATGGAAAGT CTGACTTTCT GCTCCAAACC TGGTGATGGC 1140 

ATGGCCTTCA GCATGCTQGC CTCCTTQAAT CTCCTT03QC TOTCAGTOTT Crri G TUC C r 1200 

ATTGCAOTCA AAGGT CTCAC GAATTOCAAG TCTGCAGTGA TGAGGTTCAA GAAGTTTTTC 1260 

CTCC AGQftOA GCCGTGTTTT CTATGTCCAO ACATTACAAG AOCCCAOCAA AGCTCTGGTC 1320 

TTT6AGGAGG CCACCTTGTC ATGGCAACAG ACCTGTCC06 G6ATG6TCAA TGGGGCACTG 1360 

GAOCTGGAGA GGAACGGGCA TGCTTCTGAO GGGATGACCA 06CCTAGAGA TGCCCTCGGO 1440 

CCAGAGGAAG AAGGGAACA6 CCTGGGOCCA GAGTTGCACA AQATCAACCT GGTGGTGTCC 1500 

AAG QGQATG A TGTTAGGGGT CTGCGGCAAC AOGGGGAGTG GTAAQAGCAG CCTQTTGTCA 1560 

GOCATOCTQG AGGAGATQCA CTTGCTOQAG GGCT06GTGG GOGTGCAGGG AAOCCTGQCC 1620 

TATGTCCCCC AGCAGGCCTG GATCGTCAGC GGGAACATO^ GGCSVGAACAT CXTCATGGQA 1680 

GQOQCATATG ACAAGGCCOG ATACCTCCAG OTGCTCCACT GCTGCTCCXT GAAT0C3GGAC 1740 

CTGOAACTTC TGCCCTTTGG AGACATGACA GAGATTGQAG AGOGGGGCCT CAACCTCTCT 1800 

GGGGGGCAGA AACAQAGGAT CAQCXTTOGCC OGOSCOGTCT ATTCCGACCO TCAGATCTAC 1860 

CTGCTGOAOG ACXXXXTGTC TGCTGTGGAC GCCX3WXSTGG GGAAGCACAT TTTTGAGGAG 1920 

TGCATTAAGA AGACACTCAG GG6GAAQACX3 GTG6TCCTGG TGACCCACCA GCTGCAGTAC 1980 

TTAGAATTTT GTGGCCAGAT CATTTTGTTG GAAAATGGOA AAATCTOTOA AAATGGAACT 2040 

CACAGTOAGT TAATGCAGAA AAAGGGGAAA TAT6CCCAAC TTATCCAGAA GATGCACAAG 2100 

GAAGCCACTT CGGACATGTT GCAQGACAGA GCAAAQATAO CAGAGAAGCC AAAGGTAGAA 2160 

AGTCAGGCTC TGGCCACCTC CCTGGAAGAG TCTCTCAACX3 GAAATGCTGT GCOGGAGCAT 2220 

CAGCTCACAC AGGAGGAGGA GATGGAAGAA GGCTCCTTGA GTTGGAGGGT CTACGACCAC 2280 

TACarCCAGG CAQCTGGAGG TTACATGGTC TCTTGCATAA rm tT r C IT OQTQGTGCTG 2340 

ATOGTCTTCT TAAOGATCTT GAGCTTCTGG TGGCTGAGCT ACIGGTTG6A GCAGGOCTGG 2400 

GGGACCAATA GCAGCCQAGA GAGCAATGGA ACCATGGCAO ACCTGGGCAA CATTQCAQAC 2460 

AATCCTCAAC TGTCCTTCTA CCAGCTGQTG TAOGGGCTCA ACGCCCTGCT CCTCATCTGT 2520 

GTGGGGGTCT GCTCCTCAGO GATTTTCACC AAAOTCACGA GGAAGGCATC CACGGCCCTQ 2580 

CACAACAAGC TCTTCAACAA GGTTTTCOGC TGCCCCATQA GTTTCTTTGA CACCATGOCA 2640 

ATAGGCCQGC TTTTGAACTG CTTOBOIGGG GACTTOQAAC AGCTGGACCA GCTCTTQOOC 2700 

ATCTTTTCAQ AGCAGTTCCT G G TCCTGTCC TTAATGOTOA TCX3C0GTCCT GTTQATT3TC 2760 

AOTGTGCTGT CTCCATATAT CCTGTTAATQ GGAGCCATAA TCATGOTTAT TTGCTTCATT 2820 

TATTATATGA TGTTCAAGAA GGCCATOGOT OTGTTCAAGA GACTGQAGAA CTATAGCCX5G 2880 

TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TQAGCTCCAT CCATGTCTAT 2940 

GGAAAAACTG AAGACTTCAT CAQCCAOTTT AAQAQGCTGA CTGATQC3QCA GAATAACTAC 3000 

CrGCTGTTGT TTCTATCTTC CACAOGATGG ATGGCATTGA GGCTGGAGAT CATGACCAAC 3060 

CnOTQAOCT TOGCTOTTOC CCTOTTOOTG OCTTTTGOCA TTTCCTCCAC CCCCT A CTOC 3120 



913 



wo 03/042661 



PCT/US02/36810 



TTTAAAOTOV 
CGGATTGGCT 
AAGATGTQTG 
CO^CAGCftTQ 
AOOGTGCTTC 
AGG/VOGGGCT 
GCAGGCCGGA 
TCCAAGCTCT 
CTAGATCCCT 
CTQACCAAOG 
GGAAACTTCT 
TCCAAQATCA 
ATCCA6CX3CA 
GTCACCACTO 
TTTGATG66C 
ACA6CCACTT 
CTCAGAGGTT 
ATQAGAACTT 
AGCCTG6AAT 
ATG6GATTCA 
AGGTAAAAOC 
TGTAAAATAT 



TGGCTGTCAA 
TGGAQACAGA 
TCrOGGAAGC 
aOOAAATCAT 
AC66CATCAA 
CT6GGAA6TC 
TTCTCATTGA 
CAGTGATCCC 
TTQACCGTCA 
CCATCTCAAA 
CTGT6GGGGA 
TCCTTATCCA 
CAATC06TGA 
TGCT6AACTG 
06GAGGTACT 
CTTCACTGAG 
CACACAGGTG 
CTCCTGGAAG 
AGGCTACTTQ 
6T6ATCATGT 
TTATAGTTTT 
AAAACTAAGG 



CAT0C3TQCTG 
GGCACAOTTC 
TCCTTTACAC 
ATTTCAGGAT 
OCTGACCATC 
CTCCTTGGGC 
CGGCGTGGAC 
TCAAGATCCA 
CACTGACCAO 
GTTCCCCAAA 
GAGGCAGCT6 
TGAAGCCACA 
A6CCTT0CAG 
TGACCACATC 
GCG6AAGAAG 
ATAAGGA6AT 
CAGCTTCGAG 
CA6G66TAAA 
ATGGCTCTCA 
GGTTCICCTT 
CTQATCTGTG 
AAAACTCAAA 



CAGCTGGOGT 
ACGQCTGTAG 
ATGGAAGGCA 
TATGACATQA 
0GCX3GCCA0G 
ATGQCTCTCT 
ATTTGCAGCA 
GTGCTGCTCT 
CAGATCTGGG 
AA6CT6CATA 
CTCTGCATTG 
GCCTCCATTG 
GGCTGCACG6 
CTGGTTAT6Q 
CCTGGGTCAT 
GTGGAGACTT 
GCCCACACTC 
TGTAGGGGGO 
AOACCTTAOA 
TTAACTTACA 
TTAGAAGT8Y 
AAAAAAAAAA 



CCAGCTTCCA 
AGAGGATACr 
CAAGTTGTCC 
AATACA6M3A 
AAGT GGTG GG 
TOOGCCTGGT 
TOQGCCTGGA 
CAGGAACCAT 
ATOCXZTTGGA 
CftGATGTGGT 
CCAGGGCTGT 
ACAT6GAGAC 
TGCTOGTCAT 
GCAATGQQAA 
TGTT06CAGC 
CATGGAGGCT 
TGCGACCTTC 
6TGGGGATTQ 
ACCTCAOAAC 
T6CIGAATAA 
TGOUVAT6CT 
AAAAAAA 



GQCCACTGCC 
GCAGTACATQ 
CCAGGG6TGQ 
CAACACAOCC 
CATGGTGGGA 
GGAGCCCATO 
GGACTTGCGG 
CAGATTCAAC 
6AG6ACATTC 
66AAAA00GT 
GCTTOGCAAC 
AGACACCCT6 
TGCCCACCGT 
OGTGGTAGAA 
GCTCATGGCC 
QGCAGCTGAG 
TTGTTTGGAG 
CTGGATGGAA 
CATCTAAQAC 
TTTTATAATA 
6TACTGACT7 



3180 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



A173 Protein sequence 
Gene nainei 
Uhigene nimiber: 
Probeset Accession ft: 
Protein Accession j|: 
Signal sequences 
Transroentorane domains: 
943,101B>1034 
PFAM domains I 

Cellular Localization: 



Hs. 200102 
AL117406 
none found 
none found 
169-185. 199-215, 



275-291, 304-320, 387-403, 770-786, 829-845, 907-923, 927- 



ABC_transporter [502-673] , ABCjneinbrane_region [163-432, 771-1060] 
ATP~binding_dooiains [508-516, 1139-11471 
plasma membrane 



1 
I 

MTRKRTYWVP 
UGKYDAALRT 
SVHDASDKNV 
FIX.IIPKILE 
FAFEKLIQFK 
YTAFIAILCy 
FAKXIEGMBS 
LQESPVFYVO 
PEEBGNSLGP 
YVPQQAWIVS 
GGQKQRISLA 
LRFGGQZZLL 
8QALATSLEE 
IVPLTIFSFW 
VGVCSSGIFT 
ZFSEQPLVLS 
SPLFSHILNS 
LVTLAVALFV 
KKCV8BAPLH 
RIGSGKSSLG 
UDPFDRHIDQ 
SKXZLXDBAT 
FDRPBVLRKX 



11 

1 

N8SGGLVMRG 
HIPFRPKPRF 
QRLHRLWEEE 
YSEBQLGHW 
SVIHITSGEA 
LliVFPLAVFM 
LTPCSKFGDO 
TLQDPSKALV 
ELHKINLWS 
CmiRBNILMG 
RAVYSDRQIY 
ENGKZCEENGT 
8UIGHAVPEH 
WL9YWLEQGS 
KVTRKASTAL 
LMVIAVLLIV 
LQ6LSSIHVY 
AFGI88TPYS 
MEGTSCPQQW 
MALFRLVEPM 
QZWDALERTF 
A8IDMBTDTL 
PGSLFAALHA 



21 
I 

IDIGDDMV8G 
PAFQPLDNAO 
VSRR6IEKAS 
HGVGLCFALF 
ISFFTGDVKY 
TSMAVKAQHB 
MAFSMUV8UI 
FEEATLSWQQ 
RGMMLGVCGN 
GAYDKARYLQ 
LLDDPLSAVD 



31 
I 

LIYKTYTLQO 
LFSYttTVSHL 
VIiLVMtRPQR 
LSECVKSLSF 
liFEGVCYQPI. 
T8EVSDQRXR 
IiUUj8VFFVP 
TCPGZVNGAL 
TGS6KSSLI.S 
VXiHCCSLNRD 
AHVGKHIFEE 
YAQIiZQKMaK 



41 
I 

GFH8QQERNP 
TPIMIQSIiRS 
TRLIFDAZiLG 
SSSHIIMQRT 
VLITCASLVI 
VTSBVIiTCIK 
lAVKdirmSR 



QLTQBEEMBB 
GTNSSRESNG 
HNKLFNKVFR 
SVLSPYIUM 
(KTEDFZSQF 
FKVMAVNZVL 
PQHGBIIFQD 
AGRILIDGVD 
LTKAZSKFPK 
ZQRTZRBAFQ 
TATSSLR 



7T4ADLGNIAD 
CPMSPFDTIP 
GAXZHVZCFI 
KRLTDAQNMY 
QLA8SFQATA 
YHMKmmP 
ICSZGLEDLR 
KLHTDWENO 
GCTVLVZAHR 



AILEEKHLLB 
LBLLPFGDMT 
CIKKTLRGKT 
EKISOKUCffyX 
YZQAAGGYMV 
NPQIiSFYQIjV 
IGRIiLKTCFAG 
YYMKFKXAIG 
LLLFLSSTRM 
RZGLBTBAQF 
TVLHOZNLTl 
SKL8V1PQ0P 
GNFSVOERQL 
VTTVWCDKZ 



51 
I 

EAPGRAAVPP 
RLDENTXPPIi 
ZCFCXASVLG 
AIRFRAAVSS 
CSXSSYFIIG 
IiZXHYTWERF 
8AVHRFKKFF 
GMTRPRDALG 
GSVGVQGSLA 
SZGERGIitniS 
WLVTHQLQY 
AKZABKPKVB 
SCZZFFFWL 
YGLNALUiZC 
DLEQLDQLLP 
VFKRItEHYSR 
raUALBZHTH 
TAVERZLQYM 
RGHEWGIVQ 
VLLSGTIRPN 
LCIARAVLRN 
LVMGNGKWB 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



A174 DWA sequence 

Gene name: ESTs 
Ukiigene number: Hs. 128899 

Probeset Accession #t AA983251 
Nucleic Acid Accession fts AA9B3251 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I t I I I 

ATGCTQTCTQ QCTTCTTGAT GAGTCCCAQT ACCCAGCACA OAGCACAGTA CACTCCCGGA 60 

GGAAAGAAAC TTGCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCC36AGG GCGAGGCAGC 120 

GACCQGGAGA GQGA6A6CC6 6CCGGAQ6CT GG0GG6CTCC TGTGGGAC08 C6CT0GAS0C 180 

GGGGAGGGGG AGAAGGG6AA CCGGGGOGAO CCGCCOGCCT GGATCC6CGC OCAGCAGCAG 240 

COGOGGCCGC CGCCA6CTGG GCAGGCTCCC GG6ACTGCGG CTGGGGGGGC GCAGGACCCT 300 

OGCCTGCGTC CTGGAOGTTC CCGGGGGAGG GTCOGGTTOC CAOTGAAACC TCCAGAGGCT 360 

TCGG6ACGAC A6CCCCGGGG 6CCTTCTGAC TGCATCCCGA GATTTCCATC A60GA6TGCA 420 

ACTCATAAGG CAGTCCCTAA GG6GAC0GGG CCAGOGGCTG A06AO8G6QA TGGCTTAGGA 480 

GCTCCTGGAC CTA6GGCCCG G0GTCGT06C CTCCTGGGOG TOGGGGCABA GGGGAGTGGC 540 

C06CGCGGAA AOOGCCGCGG GACAGTCAOT GACGAGGCCC GOGGGTCQCC GGGGCCAOGA 600 

CTTCTCGGAG ACCGTCCTGC GCTOTCTGGA GACX3CGCTGT CCX3CGCCCAG GGTGGTGCCA 660 

TQTGGGGCGC TCGCOGCTCG TCCGTCTCCT CATCCTGGAA OGCGGCTTOG CTCCTGCAGC 720 

TOCTGCTGGC TGOSCTGCTO G0GG0G(X3GG CQAG6GCCCA GOGGCGAGTA CTGCCACGGC 780 
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TGCCTGGAOO 0GCAGGGCX5T CTGGOGCATC GGCTTCCAGT GTCCCGAGCQ CTTCOACGGC 840 

GGCX5ACGCCA CCATCTQCTG OGGCAGCTGC aCX3TTQCOCT ACTGCTGCTC CAGCGCCQAG 900 

G06CGCCTGG ACCAGGGOGO CTGOGACAAT QA006CCAGC A60G06CTG0 OGAGCCTGGC 960 

086G0GGACA AA6A0GGQ0C CCGAOGGCTC GQCKfiGGCTT CATGTCITAG GGOTACXCAA 1020 

GGAGA06G0Q AGGGTGO SC C CCCACCOGT G AGOGCCTGGC AG006TGCTC CCCTGAAGGC 1080 

TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCOGGGGC TGCTGCCC03 TGCCAGACXJC 1140 

CGCGGATTCC CATCTTCTCC ACGOGGCGGC CCCTCTCCCC TGCAGOGGCC CGCCTTGCCC 1200 

ATCTACGIGC OQTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAO CClOTrGCm CAGATGTCTC CX36CCTAAGC ACGATCXXCA GCAGAOCOSA 1320 

6CC0CAGGGG OTAACOGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 

OGGGGGTCQT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CJGCCAACTCC 1440 

GGGGCCOGGG CX5CCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG ISOO 

AACAAOGTOT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCA6GCCACC 1560 

CAGATIGTGC CACATCAA6G GCA6TATCTG CATCOCXXAT ACQTGGGGTA CAOQGTGCAG 1620 

CAOGACTCro TQCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCT6CA GOCTGGCTAC 1680 

AGGCAGATTC AOTCCCXXTTT CXICTCACACC AACAGTGAAC AOAAGATOTA CCCAOCGOTG 1740 

ACTGTATAAC CGAGA6TCAC TGGTGGGTTC CTTTACTGAA GGGAGAOGAA GGCAGGGGTG 1800 

GATTCTOQAO GTGGAAGTCC GCACATGTOS GTGGTATTTA TGGCAOGATT CCTTTGGATG 1860 

GCTTCATTTG CCCCCAGACT GTATQAAAAC ATCTCOQAAT TAOCATTTCT GGATATGTTT 1920 

CATCCAGGOT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTQTGATG 1980 

TTGCTGATGQ GTQTATAACA AATGCTTGAG TCGGAAGTGC CCTTGAGATA TGGTTGACXSA 2040 

AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTCTTATTAT TATTTCTTTT 2100 

TTGTTGTTGA CTQCACAGQA TCAAAATGCC TOTTATCTCC CTTTTACTQG OACTTTTTTT 2160 

TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGQAQTQCAQ 2220 

TGGTGCGATC TCOGCTCACT QCAACTTCAG CCTCCTGQAT TCAGGCAACA CTCCTGCCTC 2280 

AGCCTCCCAC GTGGCTGGQA TTACAGQTGC CTGCXXCCAT GGCTAATTTT TTGTATTTTT 2340 

TOTAOAOATa GGGTTTCACC ATGTTGGCTG GQCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

TCTGCCTGTC TCAGCCTCCC AAA6TGCTG0 GATTACAGGC GTGAGCCACX: GCCCCCAGCC 2460 

TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GQTTAAGGGQ AAGACGCAAA TAACAGGACT 2520 

ATTCTAAAAG GAAACCTGTT T6AACTCTGT GAOATCAQTC ATCAGTCTCA GTATTCXZACA 2580 

GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTG G 2640 

GGGCCTATTT ItflUL T r rrr TACCTTATGT AGAGATCTTA TTACAAA6TG ATTrTCTACA 2700 

TTAAAAAGAG ACTQAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 

TGGGATQATT TTAATCTTGA AGTA6TAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2 820 

TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTQ AGAGGTAATG CATTCTTTTT 2880 

TATATT6AAA TCATAAACTA TCACCOGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 

TGGTTATGGT TTGGCQTTTC CTTCTOTTTG OTTTTCAGAG OCCCATGTCT ATATAGTCCT 3000 

GAGTGCAAGT AATTACTATA CTTGTAAATO AAGATCA6TA TTTCTGCCTA GATCTGATAA 3060 

AAAAATTTTC TTGrCTTAGT TATAAAAATT CAAAGAAATO TGTTACAAAG ATACTTAGTA 3120 

TAGCTCCTCA GCCATAACCT QAQACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 

GCAGATCATA AGGCTTTTTA TACTCTTBTT ATCAAAATGO CTTATTTTTC AGGCACTAAG 3240 

GATTGTTAAG AGAAAAOCTT TTCAACGAAG qATT Q CCTTT CTTCTCCCAC ACTQTTCTTO 3300 

ATTTCCTCTC TCTTTCAGQC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCXAAATTAT 3360 

CAAATTCAAQ TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAQAT AACTTTAAGG 3420 

ATGTGCAAQT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCX3CT 3460 

GTATGGTATG GTCTTCTAf3V CATTTATGTC TATAGATATC TATOSATCAT CTTTCTATTC 3540 

T6TTTCATQA CTGAATAATO TAAAACCAGT OTTOGCAATT GGTATCAXCA ATGATACTCA 3600 

TTTTTTAATA ACCAAAGGCA 6GGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



A17S Protein sequence 
Gene namet 
Unigene ntiniber: 
Protein Accession #i 
Signal sequence} 
Transmend^rane domaines 
Cellular liocalizations 



ESTe 

Hs. 128899 
none found 
1-11 
402-424 

not determined 



1 11 21 31 41 51 

I I I I I 1 

MLSGPLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRQRGS DRERESRPEA AGIJ.WDRAAA 60 

GEAEKGNRQE PPAWIRAQQQ PRPPPAGQAP GTAAGQAQDP RLRPGRSRGR VRLPVKPPBA 120 

SGRQPRGPSD CIPRFPSASA THXAVPKGTQ PPABDGDGIiG APGPRARRRR LLGVAAEGSG 180 

PRGKRRGTVS OEARGSPGPR UiGDRPAZiSG DALfiAPRWP 0GALAARP8P KPGTPLRSCS 240 

CCWLRCWRRG RGPSGEYCRQ NIiDAQGVHRZ GFQCPBRPD6 GDATZ006SC ALRYCCSSAB 300 

ARLDQGGCDKf DRQQGAGEPG RAOKDGPRRL GRASCLRGTQ GDGBGAPPPV RAWQRC8PBG 360 

SPKGRQLLRA FPOLLPRARR RGFPSSPRGG PSPLQRPALP ZYVPFLIVGS VFVAFIIZXSS 420 

LVAACCCRCL RPKQDPQQSR APGGm(LMET ZPMZPSASTS RGS8SRQSST AASSSSSAKS 480 

QARAPPTRSQ 1NCCLPE6TM MMVYVNMPTM PSVUIGQQAT QZVPHQGQYL HPPYVGXTVQ S40 
RDSVFMTAVP PFMDGLOPOY RQIQ8PFPBT HSEQXMyPAV TV 

A176 DMA SEQUENCE 

Gene namei ESTs, Weakly similar to 0GKU7L collagen alpha l(lll) chain precursor (H. sapiens] 
Unigene nunibert Ha. 19322 

Probeset Accession fi: AA08845a 
Nucleic Acid Accession #t AA0B84S8 

Coding sequence! 862-1995 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I ) 

aCCCTTGGAC ACIGACATOG ACXGAAGOAG TAGAATGGAa CAOOAGQACA CTQACATGGA 60 

915 
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80 



CTGAAGAAAA AG6A6CTGGA GCAGGA6AA0 GAGGTGCTGC TGCAOGGTTT GGAOATGATG 120 

GCGCGGGGCC GOGACTGGTA CCA6CAGCAG CTGCAAGQAG TGCAGGftGCG CCAQ0GCXX3C I BO 

CTC3G6CCAGA GCAGAGCCAG CQCOQACTTT GQGGCTGGftO GGAGCCCCCO CCCACTOGGG 240 

CG6CTACTGC CCAAGGTACA AOAOGTGGCC OGGTGOCTGG GGGAGCTGCT GGCTGCAGCC 300 

TGTGCCAGCC GG6CCCTGCC CG06TCCTCC TCOGGQCCCC CCT G OCCTGC CCTGACGTCC 360 

ACCrCACCCC CGGTCTGGCA GCAGCaCACC ATCCTCATGC TGAAGQAGCA GAACOGACTC 420 

CTCACCCAGG AGOTGACCGA GAAGAGTGAG CGCATCACGC AGCTGOAGCA GQAGAAGTCG 480 

6GGCTCATTA AGCAGCTGTT TGA660CCGC GCCCTGAGCC AGCAGGACGO GQGACXTCTO 540 

GATTCCACCT TCATCTAGTC CTTGTGGGCC G08TGGG0CC CCAGGGCCAO CCTGGCACTC 600 

AGCCCTTOQA GG6TGGGG6C CCCATOBCAC CCACCCTCTC TCGCIGGAGA CCCC0GGCA6 660 

GCCCRG6CAC AGTCCCGGAO TGGGCGCCTT CCTGCOGCCC TTGCCA6ATG GGCTCOCCAC 720 

6CCTGCCCCC GGCTGGTCCC CGCACOOAGC GCTTGACTCC GTTTKGGCTC CTG G 'rroVTG 780 

ACATGGGCIG GGGGCTCTCT TGAGTCOOCA TAGTCOQCAG CTACTACTGG CCGCTGTCAG 840 

TQQACAGTGG 6GTACCCCTG CATGAOTTAG CGTCCCCCC8 TTTCCAGCGO TGCCGCCCTQ 900 

GGTOCCATCr TCAGGGAAAG 6CACT60CCA OGCCAGGCTG CACTTCCAAC AA06GGCA0C 960 

AGAGGGOGOG GGGCGGCTCC GACGCOGGTC CAAGGGCAGC TTCCO G CTCA ACCAGGGCAC 1020 

CAGGAOGAOO TG6CTGTAGC TCGGACGGAC GGAAGTAGAT GGAGGGGGTQ GGGACGGCCT 1080 

6TAAGCX3GGG GGTGCCTGCC TGGCTGGGGA GCCCCAGGQA TAGCGGTOGG ACTTCAGGTT 1140 

CTGGCGAAGG CTGAGGGAGC CTGGCTOC M 0GGATCX»3CA GOCOGGGTGQ GCX3AGAGCTT 1200 

GGCCTGCATG TGCCTCCCAC A6ACCCT0G0 OTGATGOCCT TCCCCCTCTT GOCOGGGAOG 1260 

TTGCCCCAOG TT6AGTCCCA CACAACATCC TGTQAOCCrO QCTCCCCAGQ AGGGCCCCCA 1320 

GACAGCTCCC AGGCACQTCA TAGGCAAA6C CTOTTTCCXX: CGACTCAGOA TTTCCAAQGC 1300 

CTGQGQTCCT GCTCACCCCC CTTTGCTCTC ACGCCCAGCC TGTCCCCAGG TTTCAOCTQQ 1440 

GAGAGGCCAC CTCOCTCAGG CAAGOAAAAC 6A6AACCCCC AGGGTACAGG AGGAG6CTGG 1500 

GGCAGGTCCC CTTGGGTQTC ACTCOCTCAO CCCCT6CCCA GGCCCACTCC CGCTGGTGCT 1560 

GGAGTAOQCA CTGGTGGGGG GGOCCTQCTC AGCCCAACCT GGAGGGTCCC AGTGTCACCA 1620 

GAACCAGGGO CAOGGCAACA GCATCGATGO 0TTCTGCA6C CXJ^GGGCCCC OGATGCGGGG 1680 

TCAGTGTGIQ TGQGGCGCAG GGCCTCGGAT GCX3GGGTCAG TGOGTGGGGO GOGCAGGGCC 1740 

CCOGATGOGG GGTCAGTGOQ TGGQGGOCGC AGGGCCCCCT OGTGTCCAGG GCACTTTGGT IB 00 

ACACT6TCCC ACAAGGCACC TGTCTCAGAG GAGGGGCXCT GGCAGGCAGC GTGGCAACTC 1860 

CCTTCOGGAO CCCAGCTCCA TGCTAACCTG CCCACAOCAA CCCX^CAGAG CCACATTCCC 1920 

TGCTGCACCT GGTCTGCAGG GGTGTCCCAG QACAGQCCX:A AGTCAGCCCA GCATGCAGCT 1980 

GCCCTCCTAC CCTGAAGATG GGAGTG6GCT TTCCAGGQGA CATAAGQATG TCA6GCCT66 2040 

ACCTCCTGGG CAGGAAAG6G TGCAGGTCCT GAGGGCCTGT OCCCCACAGC CCCAGCACCC 2100 

AGGTGGACTG CAGCGCAGTO GGTGGGCCAG TGGCAGCCAG G6AGAAGCCC CCC0TCA6CA 2160 

GGCTGGGGTC TGCCCACCAG GGCCTCCCCA OGTCTQCCTT TGAGGQTOCX: TGCCATGCCC 2220 

TGGGGGATCC TGGCATCTTT ACTGGACTGG AAGCAGGAGA CAGAACAGTG TCTGTCCOGQ 2280 

GGTGACTTCA TCAGGAGAOC GCCCACATAG AGCTGGACCC GGCAGCTGAA GOQQAAATQT 2340 

GAGACAGGCT GGCACCTCC6 GAAAAACTGC CTTTCAGCCT TGGTGTTCCG TGCAAGGTGA 2400 

AAAGAAATAG GTCCTCCCAG TTTACAGCTT GAAATCAGGC TAGTGAGT6G CCCTGGAGAC 2460 

CACGAGGGGA 6AATTTAAAG GCCCCGGCTG GCAGGGTCTA GGTG6CTGGC AOAGGCACAT 2520 

QCAGACXXTG CCTGGAGCCT GCCCTAGQAC GCTGGGCGGG TCAGTCTCCG TGCAGGATGT 2580 

GAGCA6CGTC CCTGGGCTCT ATCCGC3QAGG TGCCAGTAGC GTGTGCAGGT ACATACAOGT 2640 

GOQTGCACAC T6TGATGACA CCCGQAAATG TCTCAGGATG TTGAAATGTG TCCTTGGGGG 2700 

CAGAAGTGTC CCCAGTTGAG AATCT6CCCC AGAGGAACAC ACCCACACCA GGCCTCAGGA 2760 

TTTTGTGTTG ATCAAGTTCC AAOGAAAAGG AACATCTCAG CCXS0606TGG TGOTTCACXX: 2820 

CTGGAATCCC AGCACTTGAG GCCAGGAGrrT GCAGAGCAGC CTGGGCAA06 CAGT6AGAQA 2880 

CCCCATCTCT ACAARAAAAA AAAAAGAAAG AAAGAAAATG AGAGATCCAG OTTTAAAAAT 2940 

TCATAAACAC CACAAGGAAA CAATACACTA TGAGACCCAG CAGAAGCAAC AGATTGACTC 3000 

TAOACCCAQA TACTAGAATT ATCAGAGAGA ATATAAAGTA ACAGTGTTTT ATATATCTAA 3060 
AQAAATAAAA 6AGATTTCTG GAAACATGAA AAAAAA 



A177 DNA SEQTJEWCE 
Gene name: 
Iftiigene nuntbert 
Probeset Accession ftt 
Nucleic Acid Accession 9 s 
Coding sequences 



Cadherin 3, P-cadherin (placental) 

K8.2877 

X63629 

X63629 

54-2543 (start and stop codons are underlined) 



1 

I 

G0G6AACACC 
TCCCTOGTGG 
CCTCOGAGCC 
CXSGAGCAGGA 
CAGCTCTGTT 
AAAGAAGGTC 
GAAGACACAA 
CCTTCCCCCA 
ACAGCATCAC 
AGACAGGCTG 
TCTTTGGCCA 
TCATCGTGAC 
GTGTCTTAGA 
ATGATGCCAT 
AGGAOCCACA 
CCAOTGGCCT 
TGGAT6G6GA 
ACAATQCTCC 
GCCATGAGGT 
GTGOCACCTA 
CTGAGAOCAA 
AGCACACCCT 
CCACAGCXZAC 



11 

I 

G6CC06C0GT 
ACCTCTCGG6 
6TGCOGGGOG 
GCCCGGCCAG 
TAGCACTGAT 
AC7TQAAG6AA 
aAOAGATTGG 
GAGACTGAAT 
GGGGCGGGGO 
GTTGTTCTTO 
06CT6TGTCA 
CGACCAGAAT 
GGGAGTCCTA 
CTACACCTAC 
OGACCTCATO 
GGACCGG6AA 
OGGCTCCACC 
CATGTTTGAC 
GCAG AGGCTG 
CCTTATCATG 
CCAGGGCATC 
GTAOGTTQAA 
CATAGTQQTC 




GTCTTCAGGO 
GOQCIOGGGA 
AATGA1GACT 
AOGAATCCAT 
GTGGTTQCTC 
CAGCTCAAGT 
GCA6ACAGCC 
AATAAGCCSIC 
GAGAATGGTG 
QAGCACAAGC 
CCAGGTACTT 
AATGGGGTGG 
TTCACAATTC 
AAAGTCCCTG 
ACCACX^GCAG 
CCCC3VGAAGT 
AOGGTCACTO 
GGGQGTOAGO 
C7GACAACCA 
GT GACCA A06 
CACGTQGAGQ 



GCTTCACCOC 
TTCTCCAGGT 
AGGCTGAAGT 
AAGTATTCAT 
TCACTGTGCX3 
TGAAGATCTT 
CAATATCT6T 
CTAATAAAGA 
CXrCTGAGGG 
TOQACOGGGA 
CCrCAGTGGA 
CCAAGTTTAC 
CTGTGATGCA 
TTGCTTACTC 
ACGQGAGCAC 
AGTACACACT 
TGGCAGTAGT 
ACGAGGCCCA 
ATCTGGAOOC 
AGGOGGACOi 
ggaaggqttt 
AjOGCCCCTTT 
ATOTGAATGA 




GACCTTGGAG 
GGGCTGCCCT 
GAATQGOQAG 
CCC3VT0CAAA 
CCCTQAAAAT 
TAGAGACACC 
TGTCTTCGCT 
GGAGATTGCC 
GGAG00CAT6 
CCAGGACACC 
GQTGACAGCC 
CATCCATAGC 
AGGCACCATC 
GACCATCCAG 
GGAGATCCTT 
TGTGCCTGAG 
CXCCAACTCA 
TTTTACCATC 
G6ATTT1GA0 
TQIGCTGAAG 
GGCACCTGTG 



51 
I 

GO CATQQ GQC 
CAGTX30GOGG 
G0GG6AGGCG 
GGGCAAGAGC 
ACAOTOCAGQ 
OGTATCTTAC 
GGCAAGGGTC 
AAGATTTTCT 
GTAGAGAAGG 
AAGTATGAGC 
AACATCTCCA 
TTCOGAGGGA 
ACAGATGAGG 
CAAGAACCAA 
AGOOTCATCT 
6CCACAGACA 
6ATGCCAATG 
AATGCAGTGG 
CCA60GTGGC 
ACCACCCACC 
GgCAAA AACX: 
CTCCCAACCT 
TTTGTOCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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CCTCCAAAGT C X 3TT G A0GTC CAGQAGGGOV TCCCCACTGG GOAGCCTOTO TGTGTCTACA 1440 

CTQCAGAAOA CXXTTGACAAG GAGAATCAAA AOATCAGCTA CGGCATCCTG AGAGACCCAQ ISOO 

CAGGGT66CT AGOCATGGAC CCAGACAGTG GGCAGGTCAC AGCTaTGOGC ACCCTCGACC IS 60 

OTGASaATGA 0CA tfmt3 T G AGGAACAACA TCTAT6AAGT CATXSGTCTTO GOCATGGACA 1620 

ATGGAAOCCC TCCCACCACT GGCACX3GGAA CCCTTCTGCT AACACTOATT GAT0TCAACX3 1680 

ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 

ACXTTGCTGAA CATCAC3GGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CACGCCX31GC 1800 

TCACAGATGA CTCAGACATC TACTGGAGGG CAGAGGTCAA CGAGGAAOOT OACACAOTGG 1860 

T C i m TOX:!' GAAOAAGTTC CTGAAGCAG6 ATACATATGA CGT6CAGCTT TCTCTGTCTG 1920 

ACCATGGCAA CAAAGAGCAG CFGACXX3TGA TCAGOGCCAC TGTGTG06AC TGCCATGGCC 1980 

ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAQOTTTCAT CCTCCCTGTG CTGGGGGCT6 2040 

TCCTGOCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGA6CC CCTCCTACTC CCAGAAGATG ACACCCXnXSA CAAGGTCTTC TACTATGGOG 2160 

AAOAOGGaGG TGOCOAACSAO QAOCAGGACT ATGACATCAC CCAGCTCCAC CGAOGTCTGG 2220 

AG6CCAGGGC GGA6GTGGTT CTC06CAATG A0GT6GCACC AACX3VTGATC CCGACACCCA 2280 

TGTACCOTCC TAGGCCAGCC AACCCAGATG AAATCOQCAA CTTTATAATT GA6AACCTGA 2340 

AGGCGGCTAA CACAGACCCC ACAGCOXQC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

AGGGCAGOGG CTGCGAOGCC 60OTCCCTGA OCTCXXrTCAC CTCCTC06CC TCCGACCAAG 2460 

ACCAAiOATTA 06ATTATCIG AAGGAOTGGG GCAGCOGCTT CAA8AAGCT6 GCAGACATGT 2S20 

ACGGTGG06Q GQAGGACGAC TAGGCGGCCT GCCTGCAGGO CTOGGGACCA AAOOTCAGOC 2580 

CACAGAOCAT CTCCAA8GQQ TCTCAGTTCC CCXTTTCAOCT GAGGACTTCG 6AGCTTGTCA 2640 

GGAAGTGGCC QTAGCAACTT GGCGGAGACA GGCTATGAGT CTGAOGTTAO AGTGGTTGCT 2700 

TCCTTAOCCT TTCAGGATGG AGQAATOTGO GCAOTTTGAC TTCAOCACTO AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAG6CCAA GTTTCCA6AA GCCTCTTACC T6CG6TAAAA 2820 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT 6CTGTGACTG ACCTACAOTO GACTTTCTCT 2880 

CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGG6C CCACTGGCCO 3000 

TCCTGCATTT CTGQTTTCCA GACCCCAATO CCTOXATTC GGATGGATCT CTGCGTTTTT 3060 

ATACTGAQTG TGCCTAGGTT GCCCCTTATT TTTT ATTTTC CCTOTTGOOT TSCTATA6AT 3120 
QAAGGGTaAO QACAATCGTG TATATGTACT AOAACTTTTT TATTAAAQAA A 



A178 Protein sequence t 
Gene namei 
imigene niuhbert 
Probeset Accession ftt 
Protein Accession 8i 
Signal sequence i 
Transmenibrane domain t 

Cellular localization i 



Cadherln 3, P-cadherin (placental) 

HS.2877 

X63629 

CAA45177 

1-24 

659-675 

plasma meiribxane 



1 

I 

MGLPRGPLAS 
QEPALF8TDK 
KGPFPQRXdIQ 
YEXiFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATZWH 
DPA6WLAKDP 
VKDHGPVPBP 
TWLSI^KKFL 
6AVLALLFLL 
OXiBARPBWL 
DYEQSGSDAA 



11 

I 

LLLLQVCWLQ 
DDFTVRNGET 
LKSNXDBDTK 
NOASVEDPMN 
GWAYSZHSQ 
TAVAWEILD 
GiaSGDHFTIT 
VEDVHEAPVP 
DSGQfVTAVGT 
RQITICHQSP 
KQDTYDVRLS 
LVLLLLVRKK 
RHDVAPTIZF 
8LSSLTSSAS 



21 

1 

CAASBPCRAV 
VQERRSLKER 
IPySITOPGA 
ISZZVTDQin} 
EPKDPHDI14P 
ANDNAPMPDP 
THPESMQOZL 
VPPSKWEVQ 
U)RB)EQPVR 
VRHVLNITDK 
LSDHGNKEQL 
RKIKBPIjLLP 
TPMyRFRPAN 
OQDQDYDYUI 



31 

I 

FREAEVTliEA 
NPLKIFPSKR 
DSPPEGVFAV 
BKPKFTQDTF 
TIHRSTGTIS 
QKYEABVPEN 
TTRKQLDFEA 
B8IPTGEPVC 
NmyEVMVIA 
DLSFRTSPFQ 
TVIRATVCDC 
SCEDTRSNVFY 
PDEZGNFZIB 
EHGSRFXXIA 



41 


51 




1 

GGAEQEPGQA 


1 

LGKVFMGCPG 


60 


ILRRHKRDWV 


VAPISVPENQ 


120 


BKBTGWLLliN 


KPLDREEIAK 


180 


RG5VLB6VLF 


QTSVMQVTAT 


240 


VI8SGLDREK 


VPEYTLTIQA 


300 


AVtSIBVQRLT 


VTDLDAPNSP 


360 


KNQirrLYVEV 


TNEAPFVLKL 


420. 


VYTAEDPDKB 


NQKISYHILR 


480 


MDKG8PPTTG 


TGTIiLLTLID 


540 


AQLTDDSDIY 


WTAEVNEEGD 


600 


HGHVETCPGP 


WKGGPILPVL 


660 


YGEEGGGEED 


QDYDZTQLHR 


720 


NliKAAMTDPT 


APPYDTLLVF 


780 


DMYGGGBDD 







A179 DNA BBQOBWCB 
Gene naioei 
Unigene nund^en 
Probeset Accession Si 
nucleic Acid Accession #i 
Coding sequence I 



endothelial differentiation, lyaophosphatidic acid G-protein-coupled receptor, 

Hs. 258583 

NN_012152 

in«roi2i52 

43-1104 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC 
GACAAGCACA TOGACTTTTT TTATAATAGG 
GGAACAAAGC TTGTGATTGT TT T GTGTGTT 
TCTAATTCTC TGGTGATGGC GGCAGTGATC 
TACCTOTTGG CTAATTTAGC TQCTGC06AT 
ATGTTTAACA CAGGCCCAGT TTCAA AAACT 

(jGGCTTCTGG acagtagctt gactgcttcc 

AGGCACATGT CAATCATGAO GAT6CGGGTC 
CPGCTCATTT TGCTTGTCTG G6CCATCGCC 
TOGAATTGCC TCTGCAACAT CTCTGCCTGC 
TACCTTGTTT TCTGOACAGT (TTCCAACCTC 
CTOCGQATCT ACGTGTAOQT (aUtfSAGGAAA 
TCCATCAGOC QCCGGAGQAC ACOCATQAAO 



31 41 51 

I I I 

TTCTTCrCCA CAATSAATGA GTGTCACTAT 60 

AGCAACACTO ATACTGTOGA TGACrCGACA 120 

GOGAOGTTTT TCT0GCT6TT TATTTTTTTT 160 

AAAAACAOAA AATTTCATTr OCCCTTCTAC 240 

TTCTTOGCTG 6AATTGCCTA TGTATTCCTQ 300 

TTQACTGTCA ACCQCTGGTT TCTCCOTCAG 360 

CTCACCAACT TGCTGGTTAT OGCCGTCGAG 420 

CATAGCAACC TQACCAAAAA QAGGGTQACA 480 

ATTTTTATGO GGGCGCSTCCC GAGACTGGGC 540 

TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

ATQGCCTTCC TCATCATGGT TGTGGTGTAC 660 

ACCAACGTCT TGTCTCOQCA TACAAOTGGG 720 

CTAATQAAGA OQOT G ATOAC TGTCTTAGGG 780 
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GCGTTTGTGG TATGCTGQAC CCOOGGCCTG Ult SQ TiXrit JC TCCTOQACGG CCTGAACTGC 840 

AGGCAGTQTO GOGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGOGCT GCTCAACTCC 900 

GTCOTGAACC CCATCATCTA CTCCTACAAG 6AGGAGGACA TOTATQGCAC CATGAAC2AAG 960 

ATOATCTQCT GCTTCTCTCA GGAGAAOOCA GAGAGGOGfTC CCTCTCQCAT CCCCTCCACA 1020 

GTCCTCAOCA OOAGTGACAC AGQCAGGCAO TACATAGAOO ATAOTATTAO CCAAQGT6CA 1080 

6TCTGCAATA AAAGCACTTC CTAAACTCTO OATGOCTCTC GOCCCACOCA OOTGATQACT 1140 
GTCTTAGG 



AI80 Protein sequence: 
Gene name: 
Unigene number t 
Probeset Accession ft: 
Protein Accession 
Signal sequence i 
Transmenibrane domains: 
Cellular Localization t 



endothelial differentiation, lysophosphatidic acid G -protein -coupled receptor, 

Hs. 256563 

NM_0 12152 

1Ip20362B4 

none found 

31-53, 66-66, 150-172, 190-211, 239-261, 277-295 
plasma menibrane 



1 11 21 31 41 51 

I I I 1 ! I 

MHECHYDKHM DPPYNRSNTD TVDDWTGTKL VIVLCVCSTFF CLFIPPSNSL VIAAVIKNRK 60 
FHFPPyyLLA NIAAADFFAG XAYVFU4F1IT GFVSKTLTVN RWFLRQGLU) SSLTASLTNL 120 
IfVIAVERHMS IHRKRVHSNL TKKRVTLLIL LVHAIAZFM6 AVPTLGflNCL OTISACSSLA 160 
PIYSRSYLVF WTVSNLMAFL IMWVirLRiy VYVKRKIllVIi SFflTSGSISR RRTPMKUIRT 240 
VMTVLGAFW CWTPGLWLL LDGUIGRQCQ VQHVKRHFLL LALUTSWNP IZYSYKDEDM 300 
YGTMKKMXGC FSQENPERRF SRIPSTVLSR SDTOSQYIED SZSQGAVCNK ST8 



A161 DMA SEQUENCE 

Gene namei ESTs 

Unigene munberi H8.162B59 

Probeset Accession D; AA569531 

Nucleic Acid Accession ftz AA569531 

Coding sequence t 1-504 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i I t I I I 

ATGA CCTACA OTTACTCATT TTTCAGGCCT GAGTTQATCG TTAATCATCT TAATTATGTT 60 

CATTCIGAAG OCAACAGGAO AACCAA6ACC AAAACTTTAT TQTCTCTQCT TTCATTTCTT 120 

GATQAAACCT CTGGACTAA6 CACACATCTT CCTTOTTTAT CTCTCTCAAA GGAGT6TGGA 180 

GTGCTTCATC TGQACATCCA CGGGAAGAAO GAAOACATGA GAATGACCCA ACAGTCTTCC 240 

CAGCTATACC TGTGGGACAT GGGTGGTTTT ACAATATTTA AGAACCTGTO GATGAGCCTC 300 

ATACCCAGAa GGAACAAAOO CTCCCCAAAA AGAGTTACAG AAACCATCCT GAGAGATTTT 360 

AAGCAGAAGC AAAGTrCAAA GATCCAAGAG GAGAGACGAA GAGAOTCTOC AGGACCAAAC 420 

CTCTCTTCAT TCT6GTTT6T GGGGAATGCT GGAAGAGGAG ACAGGCCCCA 6ATTTGGGCA 480 

GGAAGTAAAC AGTTTTCAGO CTQAQGCCAA TCTGAGCAGG AACATTCCAA TATTTCTTCA 540 

GCTAOGTTGT CCCAGCACTT CACTGGTTAA CCTTTTATGT CCACCATTTG TGGATTTCAC 600 

AGCTACTTOT CAATGGTGAA TATTGATCAT CATCATTATC TACTGAGCTG CTACCATATC 660 

CCAGCTACTC CTTGCATGTT GTTCATTATT TTCTCAACAC TCAGCATATT TGCAATATGT 720 

TATQTAATAT CACAOACAAG GAAACT6AAC GCAGAAATGT TTTATTTCTT GCCAAACATC 780 

AGATQAGGAT 6MVCAA7GAA AGGQATTTGA AACCAGQATT GTerOATTCC AACATCTCTG 640 
QOTGCTTTTT CACTCIGATA TGCTGGAATT AAAAAGCCAT TTCTAAGACT GT 



A182 Protein sequence; 

Gene names ESTs 

Unigene nunberi Hs. 162659 

Probeset Accession S: AA569531 

Protein Accession ft: none found 

Signal sequence: 1-46 

Transmembrane domains t none found 

Cellular Localization: not determined 



1 11 21 31 41 51 

I I I I I ) 

MTYSySPFRP ELIVNHUIYV HSBANRRTKr KTLLSLLSFL DBTS6LSTUL PCL9LSKE09 60 
VLHLDIK6KK EDMRXTQQSS QLYLHDHGGF TZFKNLHMSL ZPRGNKRSPK RVTETILRDF 120 
XQXQSSKZQB ERUtESAGEM LSSFNFVQNA ORGDRPQIWA GSKQFSG 



A163 DWA SEQUaiCE 

Gene name: ESTs 

Unigene number: Hs. 179809 

Probeset Accession #: N95796 

Nucleic Acid Accession #: XM_0S0197 

Coding sequence: 310-1971 (underlined sequences correspond to start and st^ codons) 

1 11 21 31 41 51 

I i I I I i 

TCACACGTOC CAAGGGGCTG GCTCAGC6QA ACCAGCCTQC ACGCGCTGGC TCCGGGTGAC 60 

AGCCGCGCGC CTCGGCCAGG ATCTGAOTGA TGAQAOQTOT CC0CACT6A6 GT6CCCCACA 120 

GCAGCAGGTG TTGAOCATGO GCTGAGAAGC TGGACCGGCA CCAAAGGGCT GGCAGAAATG 160 

GGCGCCT66C TGATTCCTAG 0CAOTTGG08 GCAGGAAaQA GGAGA06COG CAOCTTCTOG 240 

AGCAGAGCOO AOAOGAAGCA GTTCTGGAGT GCCTQAACGG CCOOCTQAGC CCTACOOGCC 300 
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5 

Id 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGGCCCACTA^ 
CTCTTQCTGQ 
TATGTGCCGC 
GQCATTQGTC 
T6G06TGGAC 
CTGAGCCTCr 
AGGCCCCTGQ 
GTGTGCTTCA 
OGCCAGGCCT 
CTGCCrOCCA 
TGCCTCTTTQ 
GCTGA6QAGQ 
TCGCCCCACT 
0CC06GCTGC 
GAGCTGT6CA 
GAGGGGCTGT 
GATGAAGGCG 

AGTGTOQCAO 
GTGAGAGCTT 
ACACTGGCCT 
ACTGGAG6TG 
GGAGCTCOCT 
CC0GCX3CTCT 
ACOQAGGCGA 
GCCTTCCTGC 
CAGTCTGTCA 
GCTACACAGQ 
AGCACATTOG 
ATG6G0CTGC 
GCCACCCTGT 
CTCTCCCCAG 
TTATACA6GG 
ACXXAG6CTC 
GGOAOCTGAA 
0C5TTTAATGT 
ACATATGAAA 
CCTCA6CCCC 
TT 



TOGTCCAGAG GCTGTGGGTG 
TCAACCTGCT AACCTTTOGC 
CTCTGCTGCT GGAAGTGGGG 
CAGTGCTGGG CCTGGTCTGT 
GCTATGGCCQ CCGCO GG CCC 
TTCTCATCCC AAGGGCOJGC 
AGCTGGCACT GCTCATCCTG 
CTCCACTGGA GGCCCTGCTC 
ACTCTQTCTA TGCCTTCATO 
TTGACTGGOA CACCAGT6CC 
GCCTGCTCAC CCTCATCTTC 
CAGCOCTGGG CCCCACCGAO 
GCTGTCCATG COGGGCCCGC 
AOCAGCTGTO CTGCGQCATG 
GCTGGATGGC ACTCATGACC 
ACCAGGGOGT 6CCCAGA0CT 
TTGGGATG6G CA6CCTGGGG 
TCATGQAGOO GCTGGTGCAG 

c nri-cuL w r o gc to c oqqt 
CAGcaxxxrr cacogggttc 

CCCTCTACCA COQGGAQAAG 
CTAGCAGTGA GGACAGCCTG 
TCCCTAATGG AGAC6TGGGT 
GOGGGG C XnC T G CCTGTGAT 
6GGTGGTTCC G6600C3GQ6C 
TGTCCCA6GT GGCCCCATCC 
CTQCCTATAT GOTGTCTGCC 
TAGTATTTGA CAAGAGG6AC 
GGTG6A0GGC CTGCCTGACT 
0Q66CTGGCC GCCAOTTTCT 
GCTGCTGAGG TGCGTAGCTG 
TCTCTAGGGC TGCCTGACTG 
AGGCCA6AAG GGCTCCATGC 
AGGGTTAACA GCTAGCCTCC 
TAAACTCAGT CACCIGOTTT 
AGCTCTTGCA TGGGAGTTTC 
GTTATTTGTA GGOQAAGAGT 
ACAGGCACTG GTCTTTTTTG 



AGCCX5CCTGC 
CTGGAGGTGT 
GTAGAGGAGA 
GTCCOGCTCC 
TTCATCTGGG 
TG6CTAGCAG 
GGCGTGGGGC 
TCT6ACCTCT 
ATCAGTCTTG 
CT GGC CCCCT 
CTCACCTGCG 
CCAGCAOAAG 

COCOOGAOCC 
TTCAOGCTGT 
GAGCCGGGCA 
CTGTTCCTGC 
GGATTOQGCA 
GCCACAT6CC 
ACCTTCTCAG 
CAGGTGTTCC 
ATGACCAGCT 
GCT6GAGGCA 
GTCTCCQTAC 
ATCroCCTG O 
CTQTTTATGG 
GCAGGCCTGG 
TTGGCCAAAT 
GGGTCOCAGC 
(7TT0CTGCCA 
CACAGCTGGG 
GAGGCCTTCC 
ACTGGAATGC 
TAGTTGAGAC 
CCCATCTCTA 
TAGGATQAAA 
CCTGAGOGGC 
CTNGANTCCA 



TGGGGCAC06 
GTTTGGOOGC 
AGTTCATGAC 
TAGGCTCAGC 
CACTQTCCTT 
GGCrGCTGTQ 
TGCTGGACTT 
TOQGGGACCC 
GGGGCTGCCT 
ACCTOGGCAC 
TAGCAGCCAC 
GGCTGTCGGC 
GGAACCTGGG 
TG06C0GGCT 
TTTACACGGA 
CCGAGGCCrO 
AGTGCGCCAT 
CTCX3AGCAGT 
TGTCCCACAG 
0CCTGCA6AT 
TOCCCAAATA 
TCCTGCCAG6 
OTGGCCTGCT 
GTCTGGTGOT 
ACCT08CCAT 
GCTCCATTGT 
OTCTGGTOGC 
ACTCAGC GTA 

AAOTAATOTQ 

GGCTGGGGGG 
AAGGOGGTTT 
GG6GACTCTG 
ACACCTAGAG 
AGCCCCTTAA 
CACTCCTCXIA 
AACACACAAG 

coccccoccr 



GAAAGCCCAG 
AGGCATCACC 
CATGGTGCTO 
CAGTQACCAC 

GGGCATCxrra 

CCOGGATCCC 
CTGTG6CCA0 
GGACCACTQT 
GGGCTACCTC 
CCAGGAGGAG 
ACTGCTGQTG 
OCCCTCCTTG 
OQCCCTOCTT 
CTTOGTGGCT 

GAGACACTAT 
.CTCCCTGGTC 
CTATTTG6CC 
T GTGGCOGTG 
CCXGCCCTAC 
CCGAGGGGAC 
CCCTAAGCCT 
CCCACCTCCA 
GGOTGAGCGC 
CCTGGATAaT 
CCAGCTCAGC 
CATTTACTTT 
_GAAAACTTCC 
T0TTA6CCCC 
QCTCT C T GC T 
TCCCTCTCCT 
CAGTCTGGAC 
CAGGT6GATT 
AAGG6TTTTT 
CCT6CA6CTT 
TGGGATTT6A 
AACCAGOTCC 
CTTTACCCTT 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



A1B4 Protein se<3Mence8 
Gene name: 
unigene nunben 
Probeaet Accession #: 
Protein Accession #x 
Signal sequence: 
Transmembrane domains t 
Cellular Localizations 



BSTs 

HS. 179809 
N95796 
XP_OS0197 
none 

21-43, 53-75, 90-112, 125-147, 
plasma menibrane 



160-182, 199-221, 323-345, 350-372, 379-401 



MVQRIMVSRIi 
PVLGLVCVPL 
ELALL1L6VG 
IDHDT8ALAP 
CCPCRARI4AF 
YOaVPRABPG 
APPVAAGATC 
ASSEDSIMTS 
RWPGRGICL 
WPDKSDLAX 



11 
I 

LRKRKAQLLL 
L6SASDRWR0 
LIiDFOSQVCP 
YLGTQEECLP 
RNLQALLPRL 
TEARBHYDEG 
LSHSVAWTA 
FLPGPKPGAP 
DLAILDSAPL 
YSA 



21 
I 

VNLLTFQLEV 
RYGRRRPPIW 
TPLEALLSDL 
GliLTLIPLTC 
RQUX3U4PRT 
VXtMQSIiGXiPL 
SAALTOPTFS 
ppnqbvgago 
LSQVAPSXjPM 



31 
I 

CLAAGITYVP 
ALSLGILLSL 
FKDFSHCRQA 
VAATLLVAEB 
LRRLFVAELC 
QCAI8LVPSI1 
ALQIIiPYTIiA 
SGLLPPPPAL 
GSXVQLSQSV 



41 

1 

PLXjLEVGVEB 
FLIPRAGWLA 
YSVYAFMISL 
AALGPTEFAE 
SWMALMTFTL 
VMDRLVQRF6 
8LYHREKQVP 
06A8ACDVSV 
TAYMVSAAGIt 



51 
1 

KFMTMVLQIO 
GLLCPDPRPL 
GGCLQYLLPA 
GLSAP8LSPH 
FYTDFVGBOli 
TRAVYLASVA 
liPKyRGDTGQ 
SVWGBPTEA 
GLVAXYPATQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 



A185 DNA SEQUENCE 
Gene name: BSTs 
unigene number t Hs. 11260 

Probeset Accession Ut R73640 
Nucleic Acid Accession #1 AK002126 

Coding sequence! 1-1593 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGQ TTOGOC 
TGCTGTGCTA 
CTGQCACTGC 
GAGTGGGAG6 
AAGQAGQAOC 
6CT6CTG6GC 
TTCCTOCACT 
TAT6CAGCAG 
CTTACCG6CC 
6CCATT6AAT 



11 
I 

GQQQ6CT0CT 

TCTCTGTCCT 
CGAGGGCCAA 
AGCAGCACCQ 
TGCAGGAGAG 
TGGGTCTGGA 
CGCAGGTGGA 
TGCCTTTCQA 
ACCCGQAGGA 
CA6CCTTGGA 



21 
I 

TQ0QTG6ATT 
GTACATGTTG 
CAGCCCCACG 
CAACTACQTG 
OAOTOAGCAa 
CAGGAGCCCC 
CAAGGCAGAO 
Ti^CTTTACT 
GAAGCCTGTO 
GACOCTGAAC 



31 

I 

TCC0GG6TG0 
GCCTGCACCC 
GGGAAGGAGG 
AGCAGCCTGA 
CTCA6GAAT0 
CCA0A6AAAA 
OTGAATGCTG 
CTACAGAAGG 
AGGAAGGACA 
AATOCTGCA6 



41 

I 

TGGTTTTGCT 

CAAAAGGTGA 
G0TACCAG6C 
AGOGGCAGAT 
GGCAGTACCA 
CCCAG6C0SA 
GOQTCAAQCT 
TGTACCAGCT 
AGGQGQATQA 
AGAACAGOCC 



51 
I 

GGTGCTCCTC 
CGAGGAGCAG 
CGTCCTTCAG 
CGCACAGCTC 
AaCCAaOGAT 
CCTCCTGGCC 
GGCCACAGAG 
GGAGACTGGC 
GTTGGTQGAA 
CAATCACCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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CCTTACftOGG CCTCTGATTT CATAGAASGO ATCTACOGAA CAGAAAGGGA CAAAGG6ACA 660 

TTGTATGAGC TCACCTTCAA AGGGGACCAC AAACAOGAAT TCAAAOGGCT CATCTTATTT 720 

OQACCATTCO GCOXATCAT GAAAGTGAAA AATGAAAA6C TCAACATGGC CAACACGCTT 780 

ATCftATGTTA TOGTGCCTCT AGCAAAAA60 OTGGACAAOT TOCGGCAGTT CATGCAQAAT 840 

TTOUSGQAQA TGTGCATT6A GCAOGATGGG AGAGTCCATC TCACTGTTGT TTACTTTGGG 900 

AAA6AAGAAA TAAATGAAGT CAAAQQAATA CTT6AAAACA CTTCCAAAGC TGCCAACTTC 960 

AGGAACTTTA CCTTCATCCA GCTGAATGGA CAATTTTCTC GGGGAAAGGG ACTTGATGTT 1020 

GGAQCCQGCT TCTGGAAGGG AAGCAAGGTC CnViVXTlT TCTGTGATGT GOACATCTAC 1080 

TTCACATCTO AATTCCTCAA TAO G T GTA GQ CTGAATACAC AGOCAGGQAA GAAGGTATTT 1140 

TATCCAGTTC TTTTCAGTCA GTACAATOCT OGO^TAATAT A066CCACCA TGATGCAGTC 1200 

CCTCXXTTGG AACAQCAGCT GOTCATAAAG AAGQAAACTG GATTTTGGAO AGACTTTGGA 1260 

TTTGGGATGA CGTGTCAGTA TCGGTCAGAC TTCATCAATA TAGGTGGGTT TGATCTGGAC 1320 

ATCAAAOGCT QGQGCGGAGA GGAT6TGGAC CTTTATOQCA AOTATCTCCA CAGCAACCTC 1380 

ATAGTGGTAC GQA06CCTGT GGGAGOACTC TTCCACCTCT OQCAtXaAGAA G06CTGCATG 1440 

GAOQAGCTQA CCCCOSAGCA GTACAAGAT6 TGOVTGCAST CCAAGSCCAT GAAOQAOGCA 1500 

TCCCACOGCC AGCTGG6CAT 6CT0GTGTTC AGGGACOAiSA TAQAGGCTCA CCTTCGCAAA 1560 
GAGAAACA6A AQACAAOTAG C3UUUUU^CA TGA 



A1B6 Protein sequence » 
Gene name: ESTs 
Unigene number i Hs. 11260 

Probeset Accession #: R73640 
Protein Accession #i NP_060841 
Signal sequence* l-?6 
Transmembrane domainei iu>ne found 
Cellular X«ocalizatlons not determined 

1 11 21 31 41 51 

1 I t I I 1 

MVRSGLLAWI SRVWLLVLL CCAZSVLYML ACTPKGDEEQ LU<PRANSPT GKEGYQAVIiQ 60 
EWEBQHRNYV SSliKRQIAQli KEEl^BRSBQ LRNGQYQASD AAGI.GLDRSP PEKTQADLIiA 120 
FLHSQVDKAE VNAGVKLATB YAAVPFDSFT LQKVYQIiBTG LTRHPBBXPV RKDKRDELVB 180 
A1ESAI£TUI NPAENSPiOXR PYTASOFIGO XYRTERDXGT LYELTPKQDH KHBFKRLXLP 240 
RPFGPIKKVK NSKUIMAIITL INVXVPIiAKR VDKFRQFMQN FREMCZBQDG RVKLTWYFG 300 
KEBINEVKOI IiENTSKAANF RNFTPIQUIG EPSRGKGLDV GARFWKGSKV LLFPCDVDIY 360 
FTSBPIiMTCR LNTQPGKKVP YPVLPSQYNP GIIYQIHDAV PPLBQQLVIK KETQFWRDPG 420 
FGHTGQYRSD FZNIGOFOIJ) IKGWGGEDVH LYRKYLHSNL IWRTPVRGL PHLWHEKROI 480 
DELTPEQYKM CMQSKAMHBA SHGQL6NLVF SHBZEAHUtK QKQKTSSXKT 

A187 DNA SEQXJEWCB 

Gene name: ATPase, Ca** transporting, type 2C« mendtMr 1 

UOigene nuihben He. 106778 

Probeset Accession #t N51919 
Nucleic Acid Accession #i AF189723 

Coding sequence: 1-2712 (underlined sequences correspond to start and stop codons) 



* 1 11 21 31 41 51 

I } ) I i 1 

ATGATTCCTG TATTGACATC AAAAAAAQCA AOTQAATTAC CABTCAGTGA AGTTGCAAGC 60 

ATTCTCCAAQ CTGATCTTCA GAATGGTCTA AACAAATGTG AAGTTAGTCA TAGGOGAGCC 120 

TTTCATGGCT GGAATQAGTT TGATATTAOT GAAQATGAGC CACTGTGQAA GAAGTATATT 180 

TCTCAGTTTA AAAATCCCCT TATTATGCTG CTTCTGGCTT CTGCA6TCAT CAGTGTTTTA 240 

ATGCATCAQT TTOATGATGC CGTCAGTATC ACTGT6GCAA TACTTATCGT TGTTACAGTT 300 

GCCTTTGTTC AGGAATATCG TTCAQAAAAA TCTCTTGAAO AATTGAGTAA ACTTGTGCCA 360 

CCAQAATGCC ATTQTGTGCG TGAAGGAAAA TTGGAGCATA CACTTGCCXX3 AQACTTGGTT 420 

CCAGGTGATA CAGTTTGCCT TTCTGTTGGQ GATAGAGTTC CTGCTQACTT ACGCTTGTTT 480 

GAGGCTGTGG ATCTTTCCAT TGATGAQTCC AQCTTGACAG GTGAGACRAC GCCTTGTTCT 540 

AAGGTGACAG CTCCTCAGCC AGCTGCAACT AATGQAGATC TTGCATOQA0 AAGTAACATT 600 

GCCTTTATGQ GAACACTGGT CAGATGTGGC AAAGCAAAG6 GTGTTGTCAT TGGAACAGGA 660 

GAAAATTCTG AATTTGGGGA G6TTTTTAAA ATGATGCAAG CAGAAGAGGC ACCAAAAACC 720 

CCTCTGCAGA AGAGCATGGA OCTCTTAGQA AAACAACTTT CCTTTTACTC CTTTGGTATA 780 

ATAGQAATCA TCATGTTGGT TQG C TQ O TT A CTGGGAAAAO ATATCCTGGA AATOTTTACT 840 

ATTA6TGTAA GTTTGGCTGT AGCAGCAATT CCTGAAGGTC TCCCCATTGT GGTCACAGTG 900 

AOQCTAGCTC TTGGT6TTAT GAGAATG6TG AAGAAAAGGG CCATTGTGAA AAAGCTGCCT 960 

ATTOTTGAAA CTCTGGGCTG CTGTAATOTG ATTT6TTCA0 ATAAAACTGG AACACTGACG 1020 

AAQAATGAAA TGACT6TTAC TCACATATTT ACTTCAGAIG GTCTGCATGC TGAGGTTACT 1080 

GGAGTTGGCT ATAATCAATT TGGGGAAOTG ATTGTTGATG OTQATGTTQT TCATGGATTC 1140 

TATAACCCAG CTGTTA6CAG AATTGTTQAG G06G6CTGTG TGTGCAATGA TGCT6TAATT 1200 

AGAAACAATA CTCTAATOGG QAAOCCAACA GAAGGGGCCT TAATTGCTCT TGCAATQAAO 1260 

ATGGGTCrXG ATOGACTTOl ACAAGACTAC ATCAOAAAAG CTGAATACCC TTTTAOCTCT 1320 

GAGCAAAAGT GGATGGCTGT TAAGTQTOTA CAGGGAACAC AGGftGGACAO ACCAGAGATT 1380 

TQTTTTATOA AAQQTGCTTA OGAACAAGTA ATTAAGTACT OTACTACATA CCAOAGCAAA 1440 

GGGCAGACCT TGACACTTAC TCAGCAGCAG AGAGATGTGT ACCAACAAGA GAAG6CACGC 1500 

ATQGOCTCAa GQG8ACTCAG AGTTCTTGCT rrUUCTTCiU GTOCIGAACT GGGACAGCTG 1560 

ACATTTCTTG GCTTQ6TGGQ AATCATTGAT OCACeTAGAA CIOSTGTGAA AGAAGCTOTT 1620 

ACAACACTCA TTGCCTCAGG AGTATCAATA AAAAT6ATTA CTGGAGATTC ACAGGAGACT 1680 

GCAGTTGCAA TOGCCAOTCO TCTGGGATTO TATTCCAAAA CTTCCCAOTC AGTCTCAGGA 1740 

GAAQAAATAO ATGCAATGGA TOTTCAGCAG CTTTCACAAA TAGTACCAAA GGTTGCAGTA 1800 

mXACAQAO CTAGCCCAAG QCACAAGATG AAAATTATTA AOTOOCTACA GAAGAACGGT 1860 

TCAGTTGTAO CCAT6ACAG0 AGATGGAGTA AATGATGCAG TTGCTCtXlAA GGCTOCAGAC 1920 

ATTQQAGTTQ OQATGGGCCA GACTGGTACA GATGrTTTGCA AAGAGGCAGC A6ACATGATC 1980 

CTAGTGGATG AT6ATTTTCA AACCATAATQ TCTGCAATG5 AAGAGGGTAA A6G6ATTTAT 2040 
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AATAACATTA 
ATCTCATTGG 
ATCAATATTA 
GATGTCATTC 
CTTAAAATAC 
GAGCTACXSAG 
TTTTTTGACA 
GC5ACTCTGCA 
CTA6TTATTT 
GATCTOTTQT 
AAGGTTGAAA 
CTTCZAAGTAT 



AAAATTTOGT 
CTACATTAAT 
TTATGQATGG 
GTAAACCTCC 
TTGTTTCATC 
ACAATGrTGAT 
TGTTCAATdC 
GTAATAOAAT 
ACTTTOCTCC 
TTCTTTTGGG 
GGAGCAGGQA 
GA 



TAGATTCCAQ 
GAACTTTCCT 
ACCCCCAGCT 
T06CAACT60 
AATAATCATT 
TACACCTCGA 
ACTAAGTTCC 
GTTTTGCTAT 
GCTTCAGAAO 
TCTCACCTCA 
AAASATCCAO 



CTGAGCACGA 
AATOCTCTCA 
CAGAGCCTTG 



GTTTGTGGGA 
GACACAACAA 
AGATCCCAGA 
GCAGTTCTTG 
GTTTTTCAGA 
TCA0T6TGCA 
AAGCATGTTA 



GTATAGCAGC 
ATGCCATGCA 
GAGTAGAACC 
TTTTGACTAA 
CTTT g T WGT 
TGACCTTCAC 
CCAAGTCTGT 
GATCCATCAT 
CTOAGAGCCT 
TAGTGQCAGA 
OTTGGACATC 



ATTAACTTTA 
QATTTTOTGG 
AGTGGATAAA 
AAACTTOATA 
CTTCTGGGOr 
ATGCTTTGTO 
GTTTGAGATT 
GGGACAATTA 
AAGCATACTG 
AATTA TAAAO 
ATCATCTTTT 



2100 
2160 
2220 
22B0 
2340 
2400 
2460 
3520 
3580 
2640 
2700 



A188 Protein sequence: 
Gene name: 
Unlgene number: 
Probeset Accession 8: 
Protein Accession #: 
Signal sequence: 
Transroeiiibrane domains: 
Pfam domains: 
[744-689] 

Cellular Localization: 



Rienft>er 1 



ATPaee, C&** transporting, type 2C, 
HS. 106778 
N51919 
AAP27813 
none found 

60-82, 86-108, 252-274, 282-304, 757-779, 827-849, 856-878 

Cation ATPase K [6-83] , E1-B2 ATPase [89-324] , Hydrolase [328-649] , Cation ATPase C 
not determined 



i 11 21 31 41 51 

I ) I I I I 

MIPVLTSKKA SBLPVSBVAS ILQADLQNGL NKCEVSHRRA FHGWNEFDIS EDEPLWKKYI 60 

SQFKNPLIML LXiASAVISVL HHQFDDAVSI TVAZLIWTV AFVQEYRSEK SLBELSKLVF 120 

PBCHCVREGK LEHTbARDLV PGDTVCLSVG DRVPADLRLP BAVDZ«SJDES SLTGBTTPCS 180 

KVTAPQPAAT NGDLASRSMI AFMGTIiVRCO KAKOWIGTO ENSBFGBVPK MMQAEEAPKT 240 

PLQKSMDLLQ KQI*SFYSFGI IGIIMLVGWL LGKDILEMFT ISVSLAVAAI PEQLPIWTV 300 

TLALGVMRMV KKRAIVKKLP IVBTLGCCNV ICSDKTGTLT KNBMTVTHIP TSDGLHAEVT 360 

6V0YNQFGEV IVDGDWHGF YNPAV6RXVB A6CVQIDAVZ RNNTIiMGKPT EGAIiXAIiAMK 420 

HGLDGLQQDy ZRKABYPPSS EQXNHAVKCV KRTQQDRFEZ CFMRBAYGOV XKYCTTYOSK 480 

OQTLTLTOQQ RZJVYQQEKAR MGSAOliRVIiA LASGPELQQL TFLGLVGZZD PPRTGVKBAV 540 

TTLIAS6V9I KMITQDSQBT AVAIASRLGL YSKTSQSVSG KEIDAMDVQQ LSQIVPKVAV 600 

FYRASPRHKM KIIKSLQKNG SWAMTGDGV NDAVALKAAD IGVAMGQTGT DVCKEAADMI 660 

LVDDDFQTIM SAXEEGKGIY HHIKNFVRFQ LSTSIAALTL XSXiATLMNFP NPLNAMQZLW 720 

INIIMDGPPA QSL QVBP VDK XrVZRKPPRNW KDSXLTKNIiI IiKILVSSIZI VOnTiFVFWR 780 

BliRDNVITPR DTTMTPTCFV FFSMFMAIiSS R6QTKSVFBX OLCSHRNFCy AVLQSZMGQL 840 

LVXYFPPLQK VFQTBSLSIXi DUiFLLGLTS 8VCXVABXXK 1CVER6REKXQ XKV8ST8S8P 900 
LBV 



A189 DNA 8BQUBNCB 
Gene name: 

Unlgene number: 
Probeset AcceBoion #i 
Nucleic Acid Accession fti 
Coding sequence: 



BSTs 

HS. 293185 

N62096 

N62096 

1-1284 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i i t I I I 

ATOGGCTACC AGAGQCAGGA GCCTGTCATC CCGC06CAQA QAGGAT7GCC TTATTCAATG 60 

AAGC AAGCTG GgrrrCCATA' GGQAATATTQ CTTTTATTCT GGGTTTCATA TGTTACAGAC 120 

TTTTCCCTTG TTTTATTGAT AAAAGGAGGG OCCCTCTCTQ GAACAGATAC CTACCAGTCT 180 

TTGGTCAATA AAACTTTCGQ CTTTCCAGGG TATCTGCTCC TCTCTGTTCT TCAOTTTTTG 240 

TATCCTTTTA TAGCAATGAT AAGTTACAAT ATAATAGCTG QAQATACTTT GAGCAAAGTT 300 

TTTCAAAGAA TCCCAGGAGT TGATCCTGAA AACGTQTTTA TTGOTCQCCA CTTCATTATT 360 

GGACTTTCCA CAGTTACCTT TACTCTOCCT TTATCCTTOT ACOGAAATAT AGCAAAGCTT 420 

66AAAGGTCT CCCTCATCTC TACAGGTTTA ACAACTCTGA TTCTTGGAAT TGTAATGGCA 480 

AGGGCAATTT CACTGGGTCC ACACATACCA AAAACAGAAG ACGCTTGGGT ATTTGCAAAG 540 

CCCAATGCCA T TCAAGCGGT OGGGGTTATO TCTTTTGCAT TTATTTGCCA CCATAACTCC 600 

TTCTTAOTTT ACAOTTCTCX AOAAGAACCC ACAGTAGCTA AGTGGTCCOQ CCTTATCCAT 660 

ATOTCCATGQ T6ATTTCTGT ATTTATCTGT ATATTCTTTO CTACATGTGG ATACTTGAGA 720 

TTTACTGGCT TCACCCAAGG GGACTTATTT GAAAATTACT GCAGAAATQA TQACCTGGTA 780 

ACATTTGGAA GATTTTGTTA TQGTGTCACT QTCATTTTQA CATACCCTAT GGAAT6CTTT 840 

GTOACAAGAG AGGTAATTGC CAATGTGTTT TTTGGTGGGA ATCTTTCATC GQTTTTCCAC 900 

ATT6TT0TAA CAGTGAT6GT CATCACT6TA GCCAOSCTTO TGTCATTGCT GATTGATTOC 960 

CTGGGGATAG TTCTAGAACT CAATGGTGTQ CTCTOTOCAA CTCCCCTCAT TTTTAICATT 1020 

OCATCAQ CCT GTTATCTGAA ACTGTCTQAA GAACCAAGOA CACACTCC6A TAAGATTATO 1080 

TCTTGTOTCA TGCTTCCCAT TGGTGCrGTQ GTGATGGTTT TTGG A ITC B T CATGGCTATT H40 

ACAAATA CTC AAGACTGCAC CCATGGGCAQ GAAATCTTCT ACTOCTTTCC TQACRATTTC 1200 

TCTCTCACAA ATACCTCAGA OTCTCATGTT CAGCAGACAA CACAACTTTC TACTTTAAAT 1360 
ATTAGTATCT TTCAACTCXiA GTAA 



A190 Protein sequence: 

Gene name: bsTs 

Qnlgene number: Hs. 2 93 185 

Probeset Accesslcm fti II62096 

Protein Accession #: none found 

Signal sequence: none found 

Transneidbrane domains: 28-50, 66-88-112-134, 142-164, 217-239, 260-282, 298-320, 327-348, 359-381 

921 



wo 03/042661 



PCTAJS02/36810 



Cellular liocalizatlon: plasna metabrane 



1 
I 

M6YQRQEPVI 
LVNKTFGPPO 
GLSTVTPTLP 
PNAIQAVGVM 
PTGFTQQDLP 
IWTVMVITV 
SCVMLPI6AV 
ISIFQLB 



11 
I 

PPQRGLPYSM 
yiiLLSVLQFL 
LSLYRNIAKL 
SPAFICHHNS 
ENYCRNDDLV 
ATLVSLLZDC 
VMVFGFVKAZ 



21 
I 

KQA6FPL6IL 
yPFIAMZSYN 
GKVSLISTGL 
FLVYSSLBBP 
TFGRPCYOVT 
LGIVLEUIGV 
TmQDCTHQQ 



31 

I 

LLFWVSWTD 
IIAGDTLSKV 
TTLILGIVMA 
TVAKWSRLIH 
VXliTYFMBCP 
LCATPLZFZZ 
BNFyCFFDNF 



41 

I 

PSLVIiLIKGG 
FQRIPQVDPB 
RAIS1/3PHZP 
MSIVISVPIC 
VTRBVIANVP 
PSACYLKLSB 
SLTMTSBSKV 



51 
I 

ALSGTDTYQS 
NVPIGRHPH 
KTEDAWVPAK 
IFPATCGYLT 
PGQJLSSVFH 
EPRTHSDKZM 
QQTTQIiSTXiN 



60 
120 
IBO 
240 
300 
360 
420 



A191 DNA SEQUENCE 
Gene name: 
Unigene nund>ert 
Probeeet Accession #; 
Nucleic Acid Accession #x 
Coding sequence: 



ESTs 

Kb. 293185 

N62096 

N62096 

1-1203 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC C06CC6CAGT r n XXfl'mT TTTATTGATA 60 

AAAGGAGGGG CCCTCTCTGG AACAGATACC TACCAGTCTT TGGTCAATAA AACTTTCGGC 120 

TTTCCAOGGT ATCTOCTCCT CTCTOTTCTT CAGTTTTTOT ATCCTTTTAT AGCAATGATA 180 

AGTTACAATA TAATAGCTGG AGATACTTTG AGCAAAGTTT TTCAAAGAAT CCCAGGAGTT 240 

GATCCTQAAA ACGTGTTTAT TG6TCGCCAC TTCATTATTO QACTTTCCAC AGTTACCTTT 300 

ACTCTGCCTT TATCCTTGTA CCGAAATATA 6CAAAGCTTG 6AAAGGTCTC CCTCATCTCT 360 

ACAOOTTTAA CAACTCIGAT TCTTGQAATT GTAATGQCAA GQGCAATTTC ACTGGGTCCA 420 

CACATACCAA AAACAGAA6A G6CTTGGGTA TTTGCAAAGC CCAATGCCAT TCAAGCGGTC 480 

GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TCTTAQTTTA CAGTTCTCTA 540 

GAAQAACCCA CAGTAGCTAA GTGGTCCOGC CTTATCCATA TGTCCATCGT GATTTCTGTA 600 

TTTATCT6TA TATTCTTTGC TACATGTGGA TACTTQACAT TTACTGGCTT CACCCAAGGG 660 

GACTTATTTO AAAATTACTG CA6AAATGAT GAOCTOOTAA CATTTGGAAG ATTTTGTTAT 720 

GGTGTCACT6 TCATTTTGAC ATACCCTAT6 GAAT6CTTT6 TGACAAGAGA GGTAATTGCC 780 

AATGTOTTTT TTGGTOGGAA TCTTTCATCG GTTTTCCACA TTOTTaTAAC AGTGATGGTC 840 

ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 900 

AATGGTGTGC TCrGTGCAAC TCCCCTCATT TTTATCATTC CATCAOCCTO TTATCTQAAA 960 

CTOTCIGAAG AACCAAGGAC ACACTCCX3AT AAGATTATOT CTTOTGTCAT GCTTCCCATT 1020 

GGTGCTGTGG TGATQQTTTT TGOATTCOTC ATGGCTATTA CAAATACTCA A8ACTGCACC 1080 

GATGGGCAGG AAATGTTCTA CitiCmTXT OAGAATTTCT CTCTGACAAA TACCTCAGAG 1140 

TCTCAT6TTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACT06AG 1200 
TAA 



A192 Protein sequences 
Gene name: 
Uaigene number i 
Probeset Accesaion tf: 
Protein AccesBlon #i 
Signal sequence: 
TranBtnembrane dofnainst 
Cellular Localizations 



BSTa 

Hs. 293185 
N62096 
none found 

1-26 

45-61, 92-106, 117-132, 191-207, 274-290, 297-313, 335-351 
plastna membrane 



1 

1 

MOYQRQEPVZ 
SYNZIAGDTL 
TQLTTLlIiQI 
EEPTVAKS7SR 
GVTVZIiTYPM 
liGVLCATPLZ 
HOQEKFYCPP 



11 

I 

PPQPSLVLI.I 
SKVFQRZP6V 
VMASAZ8LGP 
L1HMSZVZ8V 
ECPVTREVZA 
FZZPSACYLK 
DNPSLTMTSB 



21 

I 

KGGALS6TDT 
DPENVFZGRB 

RZPicra}Awv 

PZCIPFATCG 
NVFFGGNLSS 
LSBEPRIBSD 
SHVQQTTQLS 



31 



41 

! 



51 



YQSLVMKTFG PPGYLLL8VL QFLYPFZAMZ 60 

FZZGLSTVTF TLPLSXiYRHZ AKLOKVSLZS 120 

PAXPNAZQAV 6VMSFAFZCH RNSFLVYSSL 180 

YLTFTGFTQG DLFB2IYCRND DLVTFGRFCY 240 

VPHZWTVMV ZTVATLVeUL ZDCLGIVLEL 300 

KZMSCVMLPZ 6AWMVFGFV HAITNTQOCT 360 
TI^ZSZFQLB 



A193 DMA SEQUENCE 
Gene name : 
Unigene nuniberi 
Probeset Accession tf: 
Nucleic Acid Accession ttt 
coding sequence I 



BSTS 

HS. 293185 

M62096 

N62096 

1-1140 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I t I 1 I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCOOCAGG TCAATAAAAC TTTCGGCTTT 60 

CCRGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAOT 120 

TACAATATAA TA6CTGGAGA TACTTTGAGC AAAGTTTTTC AAAOAATCCC AG GAGTTO AT 180 

OCTGAAAAOO T0TTTATTG6 TCQCCACTTC ATIATTG(3AC TTTCCACAOT TACCTTTACT 240 

GTGGCTTTAT CCTTQTACOG AAATATAGCA AAQCTTGOAA AGGTCT C OCT CATCTCTACA 300 

(3(3TTTAACAA CTCTGATTCT TG6AATTQTA ATOOCAAGQa CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGAGQC TTG6GTATTT GCAAASCCCA ATGCCATTGA AGOGGTCG(36 420 



922 
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PCTAJS02/36810 



GTTATGTCTT TTQCATTTAT TTGOOICCAT AACTCCTTCT TAGTTTACAG TTCTCTAQAA 480 

GAACCCACMS TAGCTAAGTQ tf lt-miCCli ' ATCCATATOT CCATCOTGAT TTCrGTATTT 540 

ATCTOTATAT TCTTTGCTAC ATCT5GATAC TTGACATTTA CTGOCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAQ AAATGATGAC CTGQTAACAT TTGGAAQATT TTGTTATGGT 660 

5 GTCACTGTCA TTTTQACATA CCCTATGGAA TGCTTTGTQA CAAQAOAGGT AATTGCCAAT 720 

G T OTlTX ' n ' G GTGGQAATCT TTCATGGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

ACTGTAGGCA CGCTTOTOTC A TTGCT G ATT GATTG C CTOG GGATAOTTCT AGAACTCAAT 840 

GGTQTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTOTTA TCTGAAACTG 900 

TCTOAAGAAC CAAQGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

10 OCTGTG G TGA TGGTTTTTGQ ATTOGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAOGAAA TOTTCTACTG CTTTCCTGAC AATTTCTCTC TCAC AAATAC CTCAGAGTCT lOBO 
CAT6TTCA6C AGACAACACA ACTTTCTACT TTAAATATTA OTATCTTTGA ACTCGAGTAA 



15 A194 Protein sequence: 
Gene name: 
unigene nxmiberi 
Frobeset Accession ft: 
Protein Accession tft 

20 Signal sequence i 

Transmenibrane domains: 
Cellular iiocalizationi 



BST8 

H8. 2 93 165 
N62096 
none £ound 
none found 

24-40, 70-86, 95-111. 171-166, 253-269, 
plasma metnbrane 



276-292, 314-330 



25 1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQVNXTFGF POYLIiLSVIiQ FLYFPZAMI8 yNIIAGDTLS KVFC^XPGVD 60 
PENVPIGRHF IIGLSTVTFT LPI*SLYRN1A KLGKVSLIST QLTTLILGIV HARAISLGPH 120 
IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSIiE EPTVAKWSRL IHMSIVISVP 180 
30 ICIPPATOGY LTPTGFTQGD LFEaiYCRKDD LVTPGRFCYG VTVILTYPMB O'VTRBVXAN 240 
VPFGGNLSSV FHZWTVMVI TVATLVSXjLI DCLGIVZtEUff GVXjCATPLIF IZPSACYLKL 300 
SEEPRTHSDX IMSCVMLPIG AWNVFGFVN AITNTQDCTH GQEMPYCFPD NPSItlNTSBS 360 
HVQQTTQLST LHISIFQIiB 

35 

A195 DMA SEQOBICBi 
Gene namei BSTa 
Unigene nuinben He. 293185 

Probeset Accession 8i N62096 
40 Kucleic Acid Accession #i N62096 

Coding sequence 1 1-1389 (underlined sequences correspond to start and stop codons) 



..1 11 21 31 41 51 

45 I I I I I I 

ATGGGCTAOC AOAOGCAGGA GCCZOTCATC C0G0CGCA6A GAGATTTAGA T GACAOftG AA 60 

ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC A0TCT6CTGC TCTTTTTAAT 120 

GTTGTCAACT 06ATTATAGG ATCTGOTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 180 

_^ GGGTTTCCrr TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACAGA CTTTTCCCTT 240 

50 G TTTT A TTGA TAAAAGGAQG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTGAAT 300 

AAAACTTTCG GCTTTCCAOG GT A TCTGCTC CrCfCTGTTC TTCAGTTTTT GTATCCITAT 360 

ATA6CAATGA TAAGTTACAA TATAATAGCT GGAOATACTT TGAGCAAAOT TTTTCAAAQA 420 

ATCCCAGGAG TTGATCCTOA AAACGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTCC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 540 

55 TOCCTCATCT CTACAGOTTT AACAACTCTO ATTCTTGGAA TTOTAATGGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTOQG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGGGG T06GGGTTAT GTCTTTTGCA TTTATTTGCC AOCATAACTC CTTCTTAGTT 720 

TACAOTTCTC TAOAAOAACC CACAC3TA0CT AAGTX3GTCCC GCCTTATCCA TATGTCCATC 780 

GTGATTTCTQ TATTTATCTG TATATTCTTT GCTACATGTG QATACTTGAC ATTTACTOGC 840 

OO TTCACCCAAG GGGACTTATT TGAAAATTAC TQCA6AAATG ATGAGCTGGT AACATTTG6A 900 

AQ AT TTTGTT ATGGTOTCAC TGTCA I TTT G ACATAOOCTA TGQAMGCTT TGTGACAAGA 960 

6AGGTAATT0 CCAATGTGTT TTTTGOTGGO AATCTTTCAT CGGTTTTCCA CATTGTTGTA 1020 

ACAGTGATGG TCATCACTOT AGCCACGCTT GTGTCATTGC TGATTGATTQ CCTOGGGATA 1080 

^- GTTCTAQAAC TCAATGGTGT GCTCTGTQCA ACTCOCCTCA TTTTTATCAT TCCATCAGCC 1140 

05 TGTTATCTGA AACT6TCTGA AGAACCAAGG ACAGACTCOG ATAAGATTAT GTCTTGTOTC 1200 

ATGCTTCCCA rrGGTGCTGT GGIGATGGTT TTTOGATTCQ TCATGGCTAT TACAAATACT 1260 

CAAGACTGCA CCCATGGGCA OGAAATOTTC TACT6CTTTC CTGACAATTT CTCTCTCACA 1320 

AATACCTCAG AGTCrCATGT TCAGCAOACA ACACAACTTT CTACTTTAAA TATTAGTATC 1380 
TTTGAATQA 



70 



80 



A196 Protein sequence i 

Gene namei BSTe 



Unigene nximber: Hs. 2 93 IBS 

75 Frobeset Accession ft: N62096 



Protein Accession fit none found 

Signal sequence! none found 

Transraenibrane doroalnsi 37-53, 66-82, 109-125, 155-172, 180-196, 255-271, 338-354, 361-377, 399-415 

Cellular Localization: plasma membrane 

1 11 21 31 41 51 

I I I I I I 

MOyQRQBPVZ PFQRDLODHE TLVSEHBYKB KTOQSAALFN WNSIZGSOI ZGLPYSHICQA 60 

923 
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PCT/US02/36810 



GFPLGILLLP WVSYVTDPSl. VLLIKGGALS GTDTYQSLVN KTFGPPOYLL LSVUJFLYPP 120 

lAMISYNIIA GDTLSKVPQR IPGVDPENVF IGRHPIIGLS TVTFTLPLSL YRNIAKLGKV 180 

SLISTGLTTL ILGIVMARAI SL6PHIPKTB DAHVPAKFMA IQAVGVMSPA PIC3IHNSFLV 240 

YSSIiBBPTVA RHSRLIKMSI VISVPICIPF ATOmiTFTG PTQCDLFOnr CRNDDLVTFQ 300 

RFCyCVTVIL TYFMECFVTR BVIANVPPGG NXiSSVPHIW TWVITVATL VSLLIDCLGI 360 

VLELNGVXiCA TPLZPZIPSA CYLKLSBBPR THSDXIMSCV MLPIGAWKV PGFVMAX71IT 420 
QOCTHQQEMP YCPPDNPSLT HTSBSBVQQT TOLSTLNISX PQ 



A197 DNA SEQUEHCE 
Gene name: 
Unigene number: 
Probeset Accession #t 
Nucleic Acid Accession #s 
Coding sequence: 



ESTs 

HS. 31608 

B18836 

NM_017636 

1-3501 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1 1 I ) 

ATGG AGGATG CCTTCG6GGC AGCOQTOGTO AC0GT6TGG6 ACAGGQAT6C ACACACCAOG 60 

GAGAAGCCCA CCGATGCCTA CX»3AGAGCTO CIACTTCACGG GG6COGGCXX3 CAAGCACAGC 120 

AATTTOCTCC GGCTCTCTGA COCVU^CQGMT CCAOCTGCAO TTTATAGTCT GOTCACAOGC 180 

ACATOGGOCT TGGGT6CCCC 6AACCTGGTQ GTGTCAGTGC TGGGGGGATC G6GGGGCCCC 240 

GTCCTCCAGA CCTGGCTGCA GGACCTGCTG OOTCOTGGGC TGGTGGCX3GC TGCCCAGAGC 300 

ACAGQAQCCT GGATTGTCAC TGGGGGTCTG CACACGGGCA TCGGC0GGC3^ TGTTGGTGTG 360 

GCTQTACGGG ACCATCAGAT GQCCAGCACT GGGGQCACCA AGGTGGTGGC CATGGGTOTG 420 

GCCCCCTGGG GTGTGGTCOG GAATAGAGAC ACCCTCATCA AC^2CAAGGG CTG6TTCCCT 480 

6CGAGGTACC GGT0606G6G TGACCCG6A0 QACG6GGTCC AGTTTCCCCT GGACTACAAC 540 

TACTCGGCCT TCTTCCTGGT GGACX^ACGGC ACACAOGGCT GCCTGGGGGO OGAGAACCGC 600 

TTCCGCTTGC GCCTGGAGTC CTACATCTCA CAQCAGAAGA OGGGCOTGGG AGGOACTGGA 660 

ATTGACATCC CTGTCCTGCT CCTCCTGATT 6ATGGTGATQ AGAAGATGTT QACGOGAATA 720 

GAGAACGCCA CCCAGGCTCA GCTOCOITGT CTCCT06T6Q CTG6CTCAGG GGGAGCTGOG 7 BO 

GACT6CCTGG GGGAGACCCT GGAAGACACT CT6GCCXX3UI GQAGTGG6GG AGCXAGGCAA 840 

GGOGAAGCCC GAGATCGAAT CAGGOGTTTC TTTCCCAAAG GGGACCTT6A GGTC3CTGC3Va 900 

GCCCAGGTGG AGAGGATTAT GACCCGGAAG QAGCTCCTGA CAGTCTATTC TTCTGAGOAT 960 

GGGTCTGAGG AATTCGAGAC CATAGTT7TG AAGGCCCTTO TGAAGGCCIO TGGGAGCTOS 1020 

GAG6GCTCAG CCTACCT6GA TGAGCTGOGT TTQGCTOTGG CrTGGAAGGO OGTGGACATT 1080 

GCCCAGAiOTG AACTCTTTCX3 GGG66ACATC CAATGGCGOT CCTTCCATCT 0GAA6CTTGC 1140 

CTCATGGA06 CCCT6CT6AA TGACCG6CCT GAQTTCGTGC GCTTGCTCAT TTCCCA06GC 1200 

ctcagcx:tgg Gccacrrccr gaccccxsatg cxjcctggccc aactctacao cgcgoosccc 12 6o 

TCGAACTCSQC TCATCOGCAA CCTTTT6GAC CAGGC6TCCC ACAG06CAGG CACCAAAGCC 1320 

CCAGCCCTAA AAGGGGOAGC TOOOGAGCTC GGGGCXXXTTG ACGTGGGGCA TQTQCTGAOG 1380 

ATGCTGCTGG GGAAQATQTO CSQCQCOQAGG TAGCCCTC08 GGGOGGCCTG G6ACCCTCAC 1440 

CCAGGCCAGG GCTTCX3GGGA GAGCATGTAT CTOCTCTOGG ACAAGGCCAC CTOSCXXSCTC 1500 

TGGCTGGATG CTGGCCTOGG 6CAGGCCCCC TGGAOCGACC TGCTTCTTTO GGCACTGTTG 1560 

CTGAACAGGG CACAGATGGC CATGTACTTC TGGGAOATGG GTTCCAAT6C AOTTTCCTCA 1620 

GCTCTTGGGG CCTGTTTGCT GCTCOSGGTG ATGGC3U36GC TGGAGCCTGA OQCTGAOGAO 1680 

GCAGCAOQQA GGAAAGACCT GGCX3TTCAA0 TTTQAGGGQA TGGGCGTTGA CCTCTTTGGC 1740 

GAGPTGCTATC GCAGCAGTGA GGTGAGGGCT GCCCGCCTCC TCCTCOOTCG CTGCCCGCTC 1800 

TGGGGGOATO CCACTTOCCT CCAGCTGQCC ATGCAAGCrG ACGCCOGTGC CTTCrPTGCC 1860 

CAOGATGGGG TACAOTCTCT GCTOACACAa AAGTGGTGGG 6AGA7ATGGC CAGCACTACA 1920 

CCCATCTGGa CCCTGGrTCT OQCCTTCTTT TGCCCSCCkC TCATCTACAC COGCCTCATC 1980 

AGCTTCAGOA AATCA6AAGA GGAGCCCACA 06GGAG6A6C TA6AGTTTGA CATGGATAGT 2040 

GTCATTAATG GGOAAGGGCC TGTCGG6AC6 GOGGACCCAG COGAiGAAGAC GCCGCTGGGO 2100 

GTCCCX3CGCC AOTCGGGCCG TCXX3GGTTGC TGCGQGGGCC GCTGCXSGGGG GOGCCGGTGC 2160 

CTAGGCCGCT GGTTCCACTT CTGGGG06C0 006GTGACCA TCTTCAT3GG CAAOGTGGTC 2220 

A6CTACCTGC TQTTOCTGCT GCTTTTCT09 OOQOTGCTGC TOGTOGATTT CCAGCCOQOQ 2260 

CG6CC0GGCT CCCTGQAGCT 6CTGCTCTAT TTCTG6GCTT TCAOGCrQCT GTG06A6GAA 2340 

CTGGGCCAGO GCCTQAGOGG AQGCX3GGGGC AGGCTG6CCA GOGGGGGCCC OGGGCCTGGC 2400 

CATGCCTCAC TGAGCCAGOQ CCTGCGCCTC TACCTOGC C G ACAGCTGGAA CCAGTGCX3AC 2460 

CT AQTGG CTC TCACCTGCTT GCTCCTGGGC GTG Q G C T G CC GOCTOACCCC GG0TTT6TAC 2520 

C3VGCI6G60C GCACTGTGCT CTGCATGGAC TTCATGGTTT TCA0GGT60G GCTGCTTCAC 2580 

ATCTTCACG6 TCAACAAACA GCTOGGGCCC AAGATOGTCA TOGTQAGCAA GATGAT8AAO 2640 

0A0ST6TTCT TCTTCCTCTT CTTCCTOGGC GTGTGGCTGO TAGCCTATOO OGTGGCCACG 2700 

GAGGGGCTCC TGAGGCCAOG GGACAGTGAC TTCX:CAAGTA TCCTGCGCCG CGTCTTCTAC 2760 

OQTCCCTACC TGCAGATCTT CGG6CAGATT CCCCAaOAQO ACATG6A06T GGCX:CTCATQ 2820 

GAGCACAGCA ACTGCT06TC GGAGC006GC TTCTGG6CAC ACCCTCCTGG GGCOCAGGOQ 2880 

GGCAOCT6C3G TCTCCCAGTA TGCCAACTGG CTGGTGGTGC TGCTCCTOGT CAT CW t X T U 2940 

CTCGTO6CCA ACATCCTGCT GGTCAACTTG CTCATTGCCA TOTTCAOTTA QVCATTCGGC 3000 

AAAGTACAGG GCAACA6C6A TCTCTACTGG AAGG0GCA6C QTTACXXSCCT CATCCGGGAA 3060 

TTCCACTCTC G6CCO0OGCT GGCCCOGCGC TTTATOGTCA TCTCCCACTT GOGCCTCCTO 3120 

CTCAGGCAAT TGTGCA660G A0CCG6GA0C GCOCAfiCOGT CCTCOCGQGC CCTG6A6CAT 3180 

TTC086GTTT AOCTTTCTAA G6AAGC08AG 0QGAA6CT6C TAAGGTGQOA AT0GGT6CAT 3240 

AAGGAGAACT TTCTGCT6GC ACGCGCTAGG GACAAGCGGO AGAGCXSACTC CGAGCX3TCTG 3300 

AAGCGCAGGT CCCAGAAGGT GQACTTGGCA CTGAAACAGC 7GGGACACAT COGOQAGTAC 3360 

6AACA0CGCC TGAAAOTGCT GGA6CGGGA6 GTGCAOCAOT GTA6CCX30GT CCTGGOGTGO 3420 

GTGQOCX3AG6 CCCTGA6CCX3 CTCTGCCTTG CTGCOOO C AG GTGGGCXX3CC AC3CCCCTGAC 3480 
CTGCCTGGGT CXJUWGACTQ A 



A19B Protein sequence: 

Gene namei ESTs 

unigene number: Hs. 31608 

Probeset Accession 8t Hie836 

Protein Accession 8t none Cound 



924 



wo 03/042661 



PCT/US02/36810 



Signal sequence t none found 

Transinenibrane domains; 214-230, 537-SS6, 642-662, 730-752, 760-7B2, 81S-B37. B42-864, 877-899, 973- 

Cellular Localization! plasma membrane 



1 11 21 31 41 51 

i I I I t ) 

HBDIIFGAAVV TVWDSDAHTT EKPTDAYGEIi DFTQAGIUCHS NFLRLSDRTD PAAVYSLVTR 60 

THGFRAFNLV VSVLGGSGGP VLQTWLQDLI. RR6LVRAAQS TQAHIVTGGXi KTOIGSHVGV 120 

AVRDRQMAST GGTKWANGV APWGWRina) TLINPRGSFP ARYRHRGDPB OGVQFPLOVN 180 

YSAFFLVZ3D0 THGCLGGErTR FRLRLESYIS QQKT6VG6T0 XDIPVLLLLI D6DEKMLTRI 240 

ENATQA.QLPC LLVAGSGGAA DCLAETLEDT LAPGSGGARQ GEAHDRIRRF FPKGDLEVLQ 300 

AQVBRIMTRR EXjLTVYSSED GSBBPBTIVL KALVKAGGSS EASAYLDELR UWAVINRVDI 360 

AQSBLPRGDI QHR8FHI£AS LMDALLNDRP EFVRLLISHO LSLGHFLTPM RIAQLYSAAP 420 

SNSLIRNLLD QASHSAGTKA PALKGGAAEL RPPDVOHVLR MLLGXMCAPR YPSQGAHDPH 480 

PGQGFGBSMY LLSDKATSPL SIiDAOLGQAP WSDLLLWaL LNRAQMAMYF WBMGSNAVSS 540 

ALGACLLLRV MARLEPDAEB AARRXDLAFK FEGMGVDLFG ECYRSSEVRA ARLItLRRCPL 600 

WODATCLQIA MQADARAFPA QDGVQSLLTQ KWWGDMASTT PIWALVLAFF CPPLIYTRLI 660 

TFRKSEEEPT REEIiEFDMDS VINOEaPVGT ADPAEKTPLG VPRQSGRPGC OGGROGGRRC 720 

LRRHFHFWOA PVTIFMGNW SYLLFLLLF8 RVLLVDFQPA PPGSLELLLY FWAFTI«LCBE 780 

LRQGLSGGGO SLASGGPGPQ HASLSQRLRL YLADSHKQCD LVALTCFLLG VGCRLTPGIiY B40 

HLGRTVLCIO FMVFTVRLLH XFTVNKQLOP KIVIVSKMMK DVFFPLFFLQ VWLVAYGVAT 900 

BGLIiRPRDSD FPSILRRVFY RPYLQIFGQI PQBDMDVALM EKSNCSSEPO FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLVNL LIAMFSYTPG KVQGNSDLYW KAQRYRLIRB 1020 

FHSRPALAFP FXVISHIiRLL LRQLCRRPRS PQPSSPALEK FRVYLSKBAB RKLLTWESVH 1080 

KENFLLARAR DKRBSDSBRXi XRT8QKVDIA LXQL6BXREY EQRUCVLBRE VQQCSRVIiOH 1140 
VABALSRSAIi bPPGGPFPFD liPOSXD 

A199 DNA SBQUEHCE 

Gene name: ESTs 
Unigene number i He. 40808 

Probeset Accession #: AA054237 
Nucleic Acid Accession «: AA054237 

Coding sequence: 1-894 {underlined aequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I i I 

ATGGAGCCGC GGGCGCTOGT CAOGGOSCTC AGCCTCX3GCC TCAGCCTGTG CTCCCTGGGG 60 

CTGCTOGTCA CX3GCCATCTT CACCGACCAC TGGTACGAGA CCGACCCCCG GOGCCACAAG 120 

GAGAGCTGCXS AGCGCA6CGG CGCGGGCGCC GACCCXCOGG ACCAGAA6AA COGCCTGATG 180 

OOQCTGTGGC ACCTGCGGCT GOOGOACTOG OGCCCGCTQO GGCX3C08GCT GCTCCOGGGC 240 

GGCGOGGGGC GCGCGOACCC CGAGTCCIGG C6CT0GCTCC TGGGGCTCGG CX3GGCTGGAC 300 

GCCGA6TGCG GCOGQCCCXrr CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 360 

CTQQGCATCQ ACCGGGACAT CQACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCAOG 420 

6CX31TCAAGT ACCACTTTTC TCAGCCCATC CX3CTTGCGAA ACATTCCTTT TAATTTAACC 480 

AAOACCATAC AOCAAjOATQA OTGOCACCTG CTTCATTTAA GMGAATCAC TQ CTGGCTTC 540 

CTCX3GCATGG CK3GTAGCCGT GCTTCTCTGC GGCTGQkTTG TGGCCACAQT CAGTTTCTTC 600 

TGGGAGGAGA GCTTGACCCA GCAOGTGGCT GGACTCCTGT TCXTTCATQAC AGGOATATTT 660 

TGCACX:ATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 720 

AAGCTAATrr ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCT TTTGC 780 

GGCTGQTGCA GTTTAGGCTT TATTGTGGCA OCTGGA6GTC TCTGCATCX3C TTATCCGTTT 840 
ATTA60CGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCAGGGT ATGA 



A200 Protein sequence: 
Gene namet 
unigene nuntoen 
Probeset Accession #x 
Protein Accession #: 
Signal sequence t 
Tran6meRft>rane domains i 
Cellular Localization: 



ESTs 
H8. 40808 
AA054237 
none found 
1-18 

179-201, 209-231, 257-279 
plasma nsnibrane 



1 U 21 31 41 51 

I I I I I I 

MEPRALVTAL SL6LSLCSLG LLVTAIFTDH WYETDFRRHK ESCERSHAQA DPPDQKNRLN 60 

PLSRLPLRDS PFLGRRLLFG 6P6RADPBSW RSLLGLGOLD AB06RFLFAT YSGLHRKCYF 120 

LGIDRDIDTL ILKQIAQRCT AIKYHFSQPI RLHNIPFNLT KTIQQDEWHL LHLRRITAQP 180 

LGMAVAVLLC GCIVATVSPF WEESLTQHVA GLLFIMIGIF CTISLCTYAA SISYDLNRLP 240 
KLIYSLPADV EHGYSW6IFC AWC8LGFXVA AGGLCIAYPF X6RTRXAQLK SGRDSTV 



A201 DMA SEQUENCE 
Gene name: CDA14 
Unigene number: Rs. 26813 

Probeset Accession «t N32912 
Nucleic Acid Accession «: NM_016570 

Coding sequence t 1- 1134 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGAGGCGAC T6AATCGQAA AAAAACTTXA AGTTTGQTAA AAOAGTTGGA TGCCTTTC06 60 
AAGGTTCCT6 A6A6CTATGT AGAQACTTCA 6CCAGTG6AG GTACAGTTTC TCXAATAGCA 120 

925 



wo 03/042661 



PCT/US02/36810 



TTTACAACTA TGGCTTTATT AACCATAAT6 GAATTCTCAG TATATCAAGA TACATGGATG 180 

AAGTATGAAT ACGAAGTAGA CAAQGATTTT TCTAGCAAAT TAAGAATTAA TATAGATATT 240 

ACTGTT6CCA TGAAGTGTCA ATATGTTGGA 60G6ATGTAT TGGATTTA6C AGAAACAAT6 300 

GTTGCATCra CAGATGGTTT A6TTTATGAA CCAACAGTAT TTGATCTTTC AOCACAGCAG 360 

AAAOAGTGGC AGAGGATGCT GCAGCTGATT CAGAC1TAG6C TACAAGAA6A GCATTCACTT 420 

CAAQATQTGA TATTTAAAAQ TGCTTTTAAA AGTACATCRA CAGCTCTTCC ACCRAQAOAA 460 

GATCATTCAT CACAGTCTCC AAATGCATGC AGAATTCATG 6CCATCTATA TGTCAATAAA 540 

GTAGCAGGQA ATTTTCACAT AACAQTGGGC AAGGCAATTC CACATCCTOO TGGICATGCA 600 

CATTTQGCAG CACTTOTCAA CCATOAATCT TACAATTTTT CTCATAGAAT AGATCATTTG 660 

TCTTTTGGAG AGCTTQTTCC AGCAATTATT AATCCTITAG ATGGAACTGA AAAAATTGCT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CAOCAGACAC CCATCAGTTT TCTGTGACAQ AAAGGGAAOO TATCATTAAC 840 

CATGCTGCAG GCAGCCATGG AGTCTCTGGO ATATTTATGA AATATOATCT CAlOTTCTCTT 900 

ATGOTOACAG TTAC TGAOG A 6CACATQOCA TTCIGG CAOT TTTTTOTAAG ACTCrOTOGT 960 

ATTGTT6GA6 GAATCTTTTC AACAACAOQC A10TTAGAT0 QAATTGOAAA ATTTATAGTT 1020 

OAAATAATTT GCTGTaSTTT CAGACTTGGA TCCTATAAAC CTGTCAATTC TOUCCTm 1080 
GA06ATGGCC ACACAGACAA CXrACTTACCT CTTTTA6AAA ATAATACACA TTGA 



A202 Protein sequence; 

Gene namex CDA14 

Unigene number: Es. 26813 

Probeset Accession fit H32912 

Protein Accession fti NP_0S7654 

Signal sequence t none found 

Transmenddrane domains t none found 

Cellular Localisationt nuclear 



1 11 21 31 41 51 

I I I I I I 

MRRLNRKKTL SLVKELDAPP KVPESYVET8 ASGGTVSLIA FTTMALLTIM EPSVYQDTKM €0 

KYEYEVDKDP SSKLRINIDX TVAMKOQYVO ADVLDLAETM VASADGltVYB PTVFDLSPQQ 120 

KEHQRMLQLI QSRLQEEB8L QDVZFKSAFK ST8TALPPRB DDSSQSPNAC RIHQHLYVNK 180 

VAGNFHITV6 KAIPHPRQHA HLAALVNHES YNFSHRIDHL SF6BLVPAII NPLD6TEKXA 240 

IDHNQMPQYF ITWPTKLBT YKISADTHQF SVTEItERXIN HAAOSHOVSG XFMKYDI*SSL 300 

MVTVTEEKMP FWQFFVRLC6 XVGGIFSTTG MLHGI6KPXV EIICCRFRLO SYKFVNSVPF 360 
EDGHTDNHLP LLENKTH 



A203 DMA SEQUENCE 

Gene namet dipeptidylpeptidase IV (CD26, adenosine deaminase complexing procein 3) 

Unigene nuniber: Hs.44926 
Probeset Accession #: 879876 
Nucleic Acid Accession #i NM_00193S.l 

Coding sequence I 76-2301 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 t I 1 I i 

CGCGCSTCTC C6COGCC0GC GTGACTTCTG CCTGCX;CTCC TTCTCTGAAC GCTCACTTCC 60 

GAGGAGAOGC COACGATGAA GACACCQTGG AAGATTCTTC TGGGACTGCT GQGTGCTGCT 120 

GCGCTTGTCA CCATCATCAC CGTGCCCGTG GTTCTGCTGA ACAAAOGCAC AOATGATQCT 180 

ACAOCIOACA OTOGCAAAAC TTACACTCTA ACTGATTACT TAAAAAATAC TTATAOACTG 240 

AAGTTATACT CCTTAAGATG 6ATTTCA6AT CATGAATATC TCTACAAACA AOAAAATAAT 300 

ATCTTGGTAT TCAATGCTGA ATATGGAAAC AGCTCAGTTT TCTTGGAGAA CAGTACATTT 360 

GATGAGTTTG GACATTCTAT CAATOATTAT TCAATATCTC CTQATGGGCA GTTTATTCTC 420 

TTAGAATACA ACTAOSTGAA GCAATGGAGG CATTCCTACA CAGCTTCATA TGACATTTAT 480 

GATTTAAATA AAAQOCAGCT GATTACAGAA GAQAGGATTC CAAACAACAC ACAGTGGGTC 540 

ACATGGTCAC CAGTGGGTCA TAAATTGGCA TATGTTTGQA ACAATQACAT TTATGTTAAA 600 

ATTGAACCAA ATTTACCAAG TTACAGAATC ACATGGA06G GGAAA6AAGA TATAATATAT 660 

AATGGAATAA CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTOTGG 720 

TaGTCrCCAA AOGGCACTTT TTTAGCATAT GCCCAATTTA AOGACACAGA AGTCCCACTT 780 

ATTOAATACT CCTTCTACTC TGATQAGTCA CTQCAOTACC CAAAGACTGT ACGGGTTCCA 840 

TATCCAAAGG CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT 900 

CTCAGCTCAG TCACCAATGC AACTTCCATA CAAATCACTQ CTCCTGCTTC TATGTTGATA 960 

GGGGATCACT ACTTGTGTGA TOTGACATGG GCAACACAAG AAAGAATTTC TTTGCAGTOQ 1020 

CTCA06AGGA TTCAQAACTA TTOGOTCATG GATATTTGTG ACTAT6ATGA ATCCAGTGGA 1080 

AOATGGAACT 6CTTAGTG6C ACGGCAACAC ATTGAAATGA GTACTACTGQ CTGGGTTGGA 1140 

AGATTTAOOC CTTCAGAACC TCATTTTAOC CTTGATGGTA ATAGCTTCTA CAAGATCATC 1200 

AGCAATGAAG AAGGTTACAG ACACATTTGC TATTTCCAAA TA6ATAAAAA AOACTGCACA 1260 

TTTATTACAA AAGGCACCTG GGAAGTCATC GQGATAGAAG CTCTAACCAO TGATTATCTA 1320 

TACTACATTA GTAATGAATA TAAAGGAATQ CCAGQAGOAA GGAATCTTTA TAAAATCCAA 1380 

CTTATTGACT ATACAAAAGT GACATGCCTC AGTTGTGAGC TGAATCCGGA AAGGTGTCAG 1440 

TACTATTCTG TGTCATTCA6 TAAAGAGGCX3 AAGTATTATC AGCTtSAGATG TTCCGGTCCT ISOO 

GGTCTGCCCC TCTATACTCT ACACAGCAGC GTGAATGATA AAGGQCTOAa AGTCCTG6AA 1560 

GACAATTCAG CTTTGGATAA AATGCTGCAG AATGTCCAGA TQCCCTCCAA AAAACTG6AC 1620 

TTCATTATTT TGAATGAAAC AAAATOTTGG TATCA6ATGA TCTTOCCTCC TCATTTTGAT 1680 

AAATCCAAGA AATATCCTCT ACTATTAGAT GTGTATQCAG GCCCATGTAO TCAAAAAGCA 1740 

GACACTGTCT TCAGACTQAA CTGOGCCACT TACCTTGCAA GCACAGAAAA CATTATAGTA 1800 

GCTA6CTTTG ATGGCAQAGG AAGTGGTTAC CAAGGAGATA AGATCATOCA TQCAATCAAC 1860 

AGAAGACTGG GAACATTTQA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA 1920 

ATGGGATTTG TQGACAACAA A0GAATT6CA ATTTOGGGCT 0GTCATATG6 AGGOTAOGTA 1980 

ACCTCAATG6 TOCTGOGATC GGGAAGTGGC GTQTTCAAGT OTGGAATAGC C6TGGC6CCT 2040 



926 
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GTATCCCCGT 
CCAGAA6ACA 
AAACAA0TT6 
TCAGCTGAGA 
ACTGATGAAG 
AGCCACTTCA 
AAGCTTATTA 
T6ATCTTTAA 
ATACCTATCA 
TTGAATTATC 
CAAATAGGAA 
TTTCTAACTQ 
AGTGATGTCA 
TG6CTGGGAA 
A6AAGA6CTG 
CAGGAAATCA 
AAAGAAATGT 
TGCTACAAAA 
TACTGATQTT 
CTGTTAAAA6 
TTTTTCTTAT 
TCATTTTAAA 
CTATGQAATT 



GG6A6TACTA 
ACCTTQACCA 
AGTACCTCCT 
TCTCCAAAGC 
ACCATGGAAT_ 
TAAAACAAT6 
AAACTCATTT 
AATACACACT 
TCTTAAOTAG 
GG6T0GGGTT 
TrGTTTTTAT 
GACTGQTTCA 
CTAGGGCAGO 
CCCAAOTCCA 
TTCACCAGQA 
AATATCQAAA 
AAGGGAAACT 
ACACAGCAAO 
CCTAGTGAAA 
ATGAAAATAT 
TTCATTTCTT 
AAATGOAACA 
TCTCCCAOTC 



TGACrCAGTG 
TTACAGAAAT 
TATTCATGGA 
CCTGGTGC3AT 
ACCTAGCAGC 
TTTCTCTTTA 
TTGTTTTCAT 
CAAATCAAGA 
GGACTTCTGT 
TTATTGTTTA 
GGAiCKSCTTTG 
AATGTTGTTC 
QACAGGATAA 
AGCATACCAA 
GACT6GCACA 
GCACT6ACTT 
GCCA6CAAGG 
GGTGATGGGA 
GAGGCAGCTT 
TTOTATCACA 
TGAGTGTCTT 
TAAAATACAA 
ATTTAATAAA 



TACACAGAAC 
TCAACAGTCA 
ACA6CAQATG 
GTTGCAOTGQ 
ACA6CACA0C 
CCTTAGCAGC 
TATCTCAAAA 
AACTTAAOOT 
CTTCACAACA 
AAATCATTTC 
CATAGATTCC 
TCTTCTTTAA 
GAGGGATTAG 
CAO6A0GAGG 
GTTTTCTGAG 
CTAAGTAAAC 
CAGCCCCCAG 
AAOCATTGTA 
GAAACTGA6A 
AATCTTAACT 
AATTAAAAGA 
TGTTATQTAT 
TGTGCCTTCA 



GTTACATGGG 
TGAGCAGAGC 
ATAACGTTCA 
ATTTCCAGGC 
AACATATATA 
TCAAAATACC 
CTGCACTGTC 
TACCrTTGTT 
OATTATTAGC 
TOCATCAOCT 
CTGAGCAGGA 
AGGGATG6CA 
GGA6AGAAGA 
CTACTGTCAO 
AAAOACTATT 
CACAGCAGTT 
GTGCCAGTTA 
AAT6TGCTTT 
TGTQAACACA 
TQAAGGAOTC 
ATATTTTAAC 
TATTATTCCC 
TTTTTTC 



TCTCOCAACT 
TGAAAATTTT 
CTTTCAGCAO 
AATGTG6TAT 
TACCCACATO 
ATGCCATTTA 
AAGATQATGA 
CCCAAATTTC 
TTACA6AAGT 
GCTGA AACAA 
TTTTAATCTT 
AGATGTGGGC 
TAGCAGGGCA 
CTGOCXnCGO 
CAAACAGTCT 
GAAAGACTCC 
TGGCTATAGG 
TAAAAAAAAA 
TCA6CTTGCC 
CTT6CATCAA 
TTCCTTGGAC 
ATTCTACATA 



3100 
2160 
2320 
2380 
3340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2020 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



A204 Protein sequence; 
Gene namei 
Unlgene number: 
Probeset Accession l^t 
Protein Accession ft: 
Signal sequence: 
Transmembrane domains; 
DPPIV_N_term domain: 
Peptida8e_S9 domain i 
Cellular I«ocalization: 



dipeptidylpeptidase ZV (CD26, adenoeine deaminase conplexing protein 3) 

He. 44926 

879876 

IIP_001926.1 

none found 

6>28 

43-557 

558-635 

plasma membrane 



1 

I 

MKTPWKILLG 
RWISDHEYLY 
VKQWRHSYTA 
PSYRITWTGK 
YSDESLQYPK 
CDVTWATQER 
EPHFTLDGM3 
EYKGMPGQRN 
TLUSSVMDK6 
PLLLDVYAGP 
FBVEDQZEAA 
YVDSVYTERY 
KALVDVGVDF 



11 
I 

LLGAAALVTI 
KQENMILVFN 
SYDIYDLNKR 
EDIIYNGITD 
TVRVPYPKAG 
I8LQWLRRIQ 
FYKIISNEBG 
LYKIQLIDYT 
LRVLEDNSAL 
CSQKADTVFR 
RQFSKKGFVD 
MQLPTPEDNla 
QAMHYTDEDH 



21 
1 

ITVPWL12IK 
AEYGNSSVFL 
QIiITEERIFN 
WVYBBBVFSA 
AVNPTVKFPV 
HYSVMDICDY 
YRHICYFQID 
KVTCLSCEUI 
DXMLQNVQNP 
LNWATYLAST 
NKRIAIHGHS 



GXA8GTAHQH 



31 
I 

GTDDATADSR 
ENSTFDEFQH 
NTQWVTMSPV 
YSALHHSFNG 
VNTDSLSSVT 
DESSGRWNCL 
KKDCTFITKG 
PERCQYYSVS 
SKKLDPIILN 
E37IIVA8FDG 
YGGYVTSHVL 
RAEHPRQVEY 

iyhimshfik 



41 

1 

KTYTLTDYUC 
SIMDYSISPD 
GHKIiAYVWm 
TFXiAYAQFND 
NATSIQITAP 
VARQHXEMST 
TWEVIGIBAL 
FSKEAKYYQL 
BTKFWYQMIL 
RGSGYQGDKI 
GSGSGVFKOG 
LLIHGTAIXDN 
QCFSLP 



51 
I 

NTYRLKLYSL 
GQPILLEYNY 
DXYVKIEFNL 
TEVPWBYSP 
ASMIjIGDHYL 
TGKVGRFRPS 
TSDYLYYISN 
RC8GPGLPLY 
PPHFDKSKKY 
MHAINRRLGT 
XAVAFVSRWE 
VHFQQSAQI8 



60 
120 
IBO 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 



A205 DMA SBQOEWCB 
Gene names 
Unlgene number: 
Probeset Accession #t 
Nucleic Acid Accession #: 
Coding sequence! 



predicted exon 
none found 
none £ound 
none found 

l'-361 (underlined seqaencee correspond to start and stop codons) 



1. 11 21 3X 41 51 

I I I I I I 

ATGGCTCTGG C6AAGGTGAG GQAGCCAAAC GCAAATQACA ATGCCATCA6 AGTTGACAAC 60 
AGAAGT6TGA TTAAAOTGCXS TGCTAACCAG TGTTCCCTGC ATGAGGCRGA AAGTGAATCC 120 
AGAAAOCCTC A6QAQCTCTG GATGGGGCTQ CTOCTCTTGA TGGGGGTCCT AGAAGCATGT 180 
GTQGAAATGA QGOCTCTOTC AQTCTGGTCC CTOAGAaATQ ACAAGGAGCA GAGCTCCCAC 240 
CAGCCXaCAC TGGATGTCTA A 



A206 Protein sequence i 
Gene namet 

Unlgene number: 
Probeset Accession # i 
Protein Accession #t 
Signal sequence: 
Transfnenibrane domains i 
Cellular Localization! 



predicted exon 

none found 
none f oxuid 
none fotind 
none found 
46-68 

not determined 



1 11 21 

i I I 

MALAKVRBRI ANONAIRVDN RSVIKVRANQ 
VBMRPLSVH8 LRDDXBQSPH QPTLDV 



31 
I 



41 51 

I I 
R2IPQSLHM6L LLLK6VLEAC 



927 
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A207 DNA SEQOEMCE 

Gene name: BSTs 

Ukilgene nmnber: Hs. 222886 

Probeeet Accession #: AI672225 

Nucleic Acid Accession 8i none found 

Coding sequence i 1>462 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

11)111 

ATGCCAAATQ CTGAGTTAGA AGCAAAGAGC CTTGGAAGCA GTAAATGTTT AAAAACTQCT 60 

CTCATACTT6 CTGTATGTT6 TGOATCAGCA AATATAGTCA GCCCTCTACT TGAGCAAAAT 120 

ATTGATGTAT CTTCTCAAGA TCTGQACACA OGOOCAGAGA 6TATGCT8TT TCTAGTCATC 180 

ATCATGTOGA CCAGTTTTGT GGAAQACAAT CTTTCGAT66 GCTGOGOQAA GCTA6AAGAT 240 

TTTATCGCTA TTGAAGAAOA AATOAAGAAG CACGQAAGTA CTCATOTGGG ATTCCCAGAA 300 

AAOCTGACTA ATGGT6CG6C TGCTGGC3yVT GGTOATQATG GATTAATTCC TCCAAGGAAQ 360 

AGCAGAACAC CTGAAAGCCA GCAATTTCCT GACACTGAGA AT6AAGAGTA TCACAG6TTT 420 
GTCAAAGATC AGATAGTTGrT AGATATGGGG CGTTATTTCT GA 



A20B Protein sequence: 
Gene nacoe; 
Unigene number: 
Prcibeset Accession fit 
Protein Accession fi: 
Signal sequence i 
Transmembrane domains i 
Cellular Localisation: 



ESts 

H8.222B86 
AX672325 
none found 

none found 
16-38 

not determined 



1 11 21 31 41 51 

I I 1 I I I 

MPNAELEAKS LGSSKOiKTA LILAVGOGSA NIVSPLLEQN IDVSSQDLDR RPESMLFLVI 60 
IMWTSFVEDN I.SMGHGXLED FMAIEEENKK HGSTHVGFPB MLTNGAAAGN GDDGLIPFRK 120 
6RTPESQQFP DTEMEEYHRF VKDQIWDNR RYF 

A209 DWA 8BQUEWCB 

Gene name: odz (odd Oz/ten-m, Drosophila) homoXog 1 

Unigene nunnber: Hs.23796 

Probeset Accession fi: NM_0142S3 

Nucleic Acid Accession #t ~ NM_014253 

Coding sequence: 65-8242 (underlined sequences correspond to start and stop codona) 



1 11 21 31 41 51 

I 1 I I I I ■ 

GACTGCTTGC ATTAAAGGAC TTCCTCATCC TTTTTTTCAT GAAACTGAGC TTGCTTAATC 60 

AGAGATGGAG CAAACTGACT GCAAACCCTA CCAGCCTCTA CCAAAAOTCA AGCATGAAAT 120 

GGATCTAGCT TACACCAGTT CTTCTQATGA GAGTGAAGAT GOAAGAAAAC CAAGACAGTC 180 

ATACAACTCC AGG6AGACCC TGCACGAGTA TAACCAQOAG CTGAGGATGA ATTACAATAQ 240 

CCAGAGTAGA AAGAGQAAAG AAGTAGAAAA ATCTACTCAA GAGATGGAAT TCTGTGAAAC 300 

CTCTCACACT CTGTGCTCTG GCTACCAAAC AGACATGCAC AGCGTTTCTC OGCATGGCTA 360 

CCAOCTAOAO ATQQQATCTQ ATGTGGACAC AGAGACAGAA GGTQCTGCCT CACCTOACCA 420 

TGCACTAAGA ATGTGQATAA GGGGAATOAA ATCAGAGCAT AQTTCCTGTT TGTCCAGOOQ 480 

GGCCAACTCT GCATTATCCT TQACTGACAC TGACCATGAA AGGAAGTCTG ATGGGGAAAA 540 

TQGTTTCAAA TTCTCTCCTG TTTGTTGTQA CATGGAGGCT CAAGCT6GGT CTACTCAAGA 600 

TOTGCAQAOC AOCCCACACA ACCAOTTCAC CTPCAGACCC CTCCCACCGC CACCTCCGCC 660 

TCCTCATGCC TGCACCTGTG CCAGGAAGCC ACCCCCTGCA GCGGACTCTC TTCAQAGGAG 720 

ATCAATGACT ACCCGCAGCC AGCCCAGCCC AGCTGCTCCA GCTCCCCCAA CCAGCAC6CA 780 

GGATTCAGTC CATCIGCATA ACAGCT6GGT CCTGAACA6C AACATACCAT TGGAGACCAQ B40 

OCATTCCCTG TTCAAACATG QATCTGGTTC CTCTGOGATC TTCAGTGCAG CCAGTCAOAA 900 

CTACCCTCTG ACATOCAATA COGTGTACTC GCCCCCTCCC AGGCCTCTTC CTCGAAGCAC 960 

CTTTTOCOGA CCTGCCTTTA CCTTTAACAA ACCTTACAGG T6CTGCAACT GGAAGTGCAC 1020 

AGCATTGAGC GCCACTGCAA TCACAGTQAC TTTGGCCTTO TTACTAQCCT ATGTQATTGC 1080 

AGTQCATTTG TTCGGCCTGA CTTGGCAGTT GCAACCAGTT GAAGGAGAGC TGTATGCAAA 1140 

TGGAGTTAGC AAAGGGAACA (SGGGQACCQA GTCCATGOAC ACTACTTACT CTCCAATTGO 1200 

AOGAAAAGTT TCTGATAAAT GAQAQAAAAA AGTGrrTCAG AAGG6A06GG G6ATAQACAC 1260 

TGGAGAAGTT GACATTGGTG CACAGGTCAT GCAGACCATT CCA0CTG6TT TATTCTGGOG 1320 

TTTCCAQATT ACTATCCACC ATCCAATATA TCTGAAGTTC AATATTTCTT TAGCCAAGQA 1380 

CTCTCTGCTG GGAATTTAT6 GCA6AAGAAA CATTCCACCT ACACATACTC AGTrTOATTT 1440 

TOTAAAACrA ATGQATOGCA AACAOCTQOT CAAGCA6GAC TCCAAGG6CT CIQATGATAC 1500 

ACAQCACTCC CCTCGGAAOC TGATCTTAAC TTOGCITCAG GAGACAOGTT TCATA6AGTA 1560 

TATGGATCAA GGACCTTGGT ATCTGGGGTT TTACAATQAT GQAAAAAAGA IGGAGCAAGT 1620 

ATTOOTGTTA ACTACAGCAA TTGAAATAAT GGATGACTQT TCAACCAATT GCAATGGAAA 1680 

TGGAGAGTGT ATCTCTGGCC ATTGTCATTG TTTCCCAGGA TTCCTTGGAC CTGACTGTGC 1740 

TAGAGATTCC TQCOCTGrOC TQTGTGGTGG GAAT06AGAA TACQAGAAAO QACACTOTGT 1800 

CT6CC6GCAT GGCIGGAAGO GQCCAGAGTO TGAC6TTCC6 GAAQAACAAT 6CATTGATCC 1860 

AACATGCTTT GGCCAOGGCA CCTGCATCAT 6GGAGTCTGC ATCTGTGTGC GAGGATACftA 1920 

AGQAGAAATA TGOSAGGAAO AGQACTGCCT AGACCCAATQ TGTTCCAACC ATGGCATCTG 1980 

TGTAAAAGGA GAATOTCACT GTTCTACIGG CTGGGGAGGA 0TTAACT6T6 AAACACCACT 2040 

TCCTGTATOT CAAGAGCAGT GCTCAGGACA G6GAACTTTT CTTCTGGAG6 CTG6AGTATG 2100 

CAGCTQTQAT CCCAAGT6GA CAGQATCTGA CIGCTCAACA GA6CTGTGTA CCATQQAOTG 2160 

IGGTAGCCAT GGASTCT6CT CAAGAGGAAT TTGCCAGTGT GAAGAAGGCT OOGTAGGACC 2220 

AACATOTGAG GAACGCTCCT GTCATTCTCA TTGTACTOAa CATGGCCAAT QGAAAGATGO 2280 
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AAAATGTGAG TGTAGCCCTG OMGGGAGGQ COAOCACTGC ACAATTCCTC ACTACTTAOA 2340 
TGCTQTCOGA G A TGGCTGCC CAGGGCTCTO CTTTGGAAAT GGAOQATOTA C0CTG6ATCA 2400 
AAATGOTTGG CACTGTGTGT GTCAGGTGGO TTGOAOrGGO ft CAGG CTGCA ATOTTGTCAT 2460 
GGAAATGCTT TGTGGAGATA ACTTGGACAA TQATGGAGAT G0TTTAACCX5 ACTGTGTGGA 2520 
5 TCCTGACTGT TGTCAACAAA GCAACTGTTA TATAAOTCCT CTCTGCCAGG GCTCA CCAGA 2580 
TCCTCTTGAC CTCATTCAGC AAAGOCAAAC TCTCTTCTCT CAGCACACTT CAAOACTTTT 2640 
TTATGATCGA ATCAAATTCC TCATTGGCJU^ GGAGA6TACT CATGTCATTC CTCCTGAG6T 2700 
GTCATTTGAC AGCAGGOGTG CCTOTCTOAT TCGAGGCCAA GTGGTGGCCA TAOATGGAAC 2760 
TCCTCTAGTQ GOAGTGAATG TCAGTTTCTT GCACCACAGT GATTATGGOT TTACCATCAG 2820 
10 CCGGCAAGAT GQAAGCTTTG AC C TOGTGGC CATaSGTGGC ATCTCTGTCA TCTT AATC TT 2B80 
OQACOGATCC C C TTTOCT G C CXGAGAAGAO AACACTCTGG TT6CCTTGGA ATCAOTTTAT 2940 
TGTGGTAGAO AAAGTCACCA T6CAGAGAGT TGTATCAGAC C06CCATCCT QOQATATCTC 3000 
CAACTTTATC AGCXCAAACC CTATTGTGCT TCCTTCACCG CTCACATCAT TTGGAGGGTC 3060 
CTQTCCAGAG AGGGGAACTA TTGTTCCTGA GCTGCAGGTT GTACAGQAGG AAATTCCCAT 3120 
15 TCCCTCXAGC TTTGTGAGGC TGAfiTTACCT GA6CA0CCGC ACCCXH^GGT ATAAAACCCT 3180 
GCTACGGATC CTTCTGACAC ATTCAAOGAT TCOOGIAGGC ATGATAAAAO TACACCTCAC 3240 
AGTAGCTGTG GAAGGGOGAC TCACACAGAA GTOGTTTCCC OCOGCAATTA ATCTTGTCTA 3300 
CACATTTGCT TGGAAC3VAQA CCGATATCTA TGGACAGAAG GTTTGGGGCC TGGCAQAGGC 3360 
TTTGGTATCT GTGGGATATG AATATGAAAC QTGCCCTGAC TTTATTCTCT GGGAGCAAAG 3420 
20 GACAGTOGTT TTACAAGGTT TTQAGATGQA TGCTTCTAAC CTAGGAGACT GGTCTTTOAA 3480 
TAA6CATCAC ATTTTGAATC CTCAAAGTGG AATCATACAT AAAOGGAATG GAGAAAATAT 3540 
GTTCATTTCC CAGCAGCCCC OWSTCATATC AACCATAATG GGTAA TGQAC ACCAAAGGAG 3600 
TGTAOCCTGC ACCAACTGCA ATGGCCC3W3C CCACAACAAC AAACTCTTTG CTCCTGTOGC 3660 
CTTAGCTTCT GGCCCTGATQ GCAOTGTOTA TGTTGGOGAC TTCAATTTTG TAAGGAGAAT 3720 
25 ATTTCCCTOQ GGAAACTCOQ TTAGTATTTT GQAATTAAGC ACAAGTCCTQ CTCACAAATA 3780 
CTATCTGGCT ATGGAOOCTQ TGTCTGAATC ACTCTATCTA TCA6ACA0CA ATACT OQCAA 3840 
AGTCTACAAO TTGAAATCTC TT0TG6AGAC GAAAGATCTO TCCAA6AATT TTGAAflTaOT 3900 
GGCAGQAACT GGTGATCAGT GCCTTCCCTT TGACCAQAGT CATTGTGGAG ATGGTGGGAG 3960 
AGCATCGGAA GCTTCACTQA ATAGCCCTCG AGGCATCACA GTTGATAGGC ATGGATTTAT 4020 
30 TTACTTTGTQ GATGGGACTA TGATTCGCAA AATTGATGAO AATGCTGTGA TCACAACTGT 4080 
AATOGGCTCa AATGGTCTGA CTTGCACACA ACCACTGAOC TGTGACTCAG OAATGQACAT 4140 
CACTCAGGTG OGATTAGAGT GGCCAACAGA CCTT6CA0TA AATCCTATGO ACAATTCATT 4200 
GTATGTCTTG GATAACAACA TTQTGCTGCA AATTTCTQAG AACAGGOGTG TTCGQATCAT 4260 
CQCAGGACGC CXCATTCACT GCCAGGTGCC AGGCATCGAT CATTTCCTGa TCAGCAAGGT 4320 
35 AGCAATTCAC TCCACTCTAG AGTCAGOGAO GGCCATCAGT GTCTCCCACA GCGGGCTGCT 4380 
CTTCATAGCT QAAACAGAOQ AGAGGAAA8T AAACGGCATT CAOCAAGTAA CCACCAAT6G 4440 
GGAGATCTAC ATCATOOCTG 6TGCCCCCAC TGACTOTQAC TGCAAAATTG ATCCAAACT6 4500 
TQACTGTTTT TCAGGTOATG GTGGCTATGC C3VAAGATGCA AAGATGAAAQ CCCCTTCCTC 4560 
CTTAGCAGTG TC6CCTGATO GAACCCTCTA TGTGGCAGAC CTCGQAAATG TTCGAATTCG 4620 
40 TACCATCAGC AGGAACCAAQ CCCACCTQAA TGACATGAAC ATTTATGAGA TTGCTTCACX: 4680 
OGCTGATCAG GAACTGTACX: AGTTCACTGT AAATOQAACC CAOCTACACA CCCTOAACTT 4740 
GATAACAAGG GACTATGTTT ATAACITCAC CTACAATTCT OAAGGTGACT TGG6C6CGAT 4800 
TACCAGCAGC AATGGCAATT CAGTGCACAT TOGCCGTGAT GCAGGOQGAA TGCOGCTATG 4860 
GCTTGTGGTG CCTGGCGGAC AAGTATACTG GCTGACTATA AGCAGCAATG GAGTCCTGAA 4920 
45 AAGAGTGTCA GCCCAAGGCT ATAATCOGGC CTTAATGACC TATCCAGGAA ACACAGGQCT 4980 
TCTGGCTACC AAAAGTAAOQ AAAATGGATG GACAACC3OTT TATGAGTATG AOCO OQAOG Q 5040 
ACACCTGACC AATGCAAOGT TTCCCACTGG AGA6GTCAGC AGCTTCCACA QTOACCTGOA 5100 
GAAGCTGACA AAAGTGQAGC TAGATACTTC CAACCGTQAA AATOTCCTCA TQTCAACCAA 5160 
CTTOACGGCA ACTAGTACCA TATATATTTT AAAACAAGAA AATACTCAAA GTACCTA'TCG 5220 
50 GGTGAATCCA OATGGTTCCC TGCGTGTCAC TTTTGCCAGC GGGATGGAGA TOGOCCTCAG 5280 
CTCAQAGCCC CACATCCTGG CAGGGGCAOT CAACCCTACC CTGGGCAAAT GCAACATCTC 5340 
ATTOCCOGGA GAGCACAATQ CAAACCTCAT CQAGTGGCGO CAGAOOAAGG AOCAAAACAA 5400 
AGGCAATGTT TOQGCTTTTG AAAGGAGGCT GAGGGCCCAC AACAGAAACC TACTCTCCAT 5460 
- AGATTTTGAT CATATAACCX: GCACAGGAAA GATCTATGAT GACCATCX3AA AATTCAOCCT 5520 

55 TOGAATTCTT TATGACCAGA CTGGGCX5ACC CATTCTOTGG TCTCCTGTAA GCAGAT ATAA 5580 
TGAAGTGAAC ATCACATATT CACCTTOGGG ATTGGTGA06 TTTATTCAAA GAGQAAOSTO 5640 
GAATQAAAAA ATGGAATATG ACCAGAGTGO GAAAATTATT TCAAOAACTT GGGCTGATGO S700 
GAAAATTTGQ AGCTATACCT ACTTAGAAAA ATCTGTGATG CTTCTCCTAC ACAGCCAOOG 5760 
GOJTTACATC TTTGAGTATG ACCAATCAGA TTGCCTGCTG TCAGTTACCA TGCCTAGCAT 5820 
60 GGTGCOCCAC AGCTTACAAA CCATGCTTTC AGTGGGCTAC TACCGTAATA TCTACACOCC 5880 
ACXX3GACAGT AGCACTTCTT TTATCCAAGA CTATAGTCGA GATGGCOGAT TGCTA CAGAC 5940 
CCTGCATCTQ GGGACAGGGC GCAGAGTCTT ATAC3UU3TAC ACCAAGCAAG CAAGGCTTTC 6000 
TQAGQTTCTC TAT6ATACCA CTCAGQTCAC ATTAACATAT GAAGAGTCTT CTGGAGTGAT 6060 
TAAOACAATA CACCTQATGC ATGAOGGATT CATCTGCACA ATCAGATACA GGCAAACAGG 6120 
65 ACCTCTTATT GGAOGCCAGA TTTTCAGATT CAGT6AAGAA GGCCTTGTGA ATGCAaJGTT 6180 
CGACTACAGC TACAACAATT TCOGAGTCAC AAGCATGCAA GCTGTAATCA AT GAAA CCCC 6240 
TTTOCCTATA GATCTTTACC QATATGTTGA TGTCTCTGGC A6AACAGAGC AGTTTOGAAA 6300 
ATTCAGTGTA ATTAATTAOS ATTTAAATCA GGTCATAACT ACTACAGTGA TGAAACACAC 6360 
CAAAATCTTC AQTGOCAATG OACAAGTCAT TGAAGTCCAA TATGAAATCC TAAAGQCAAT 6420 
70 TGCCTACTG6 ATGACCATTC AATATGATAA TGTGGGCOGA CATGGTAATA T6T6CATAAG 6480 
GGTAGGAGTA GATGCCAATA TAACAAGGTA CTTCTATGAA TAOQATGCTG ATGGGCAACT 6540 
TCAOACTGTT TCTGTAAATG ACAAAACCCA GTGGCGTTAT AGTTAOGATC TGAATQGAGA 6600 
CATCAACCTC TTAAGCCA3G G6AAGA0I6C TOGTCTTACT CXTCTCCGAT A TGAC CTCCG 6660 
AGACCGCATC ACXAGATTAG GAGAAATTCA OTATAAAATG GATGAAGATG GCTTTCTOAG 6720 
75 GCAGAGGGQA AATGATATTT TTGAATATAA TTCTAATGGC CTTGCTGCAGA AAGCCTACAA 6780 
TAAGGCTTCT GGCTGGACTG TQCAOTATTA CTATGATGGG crTGGGOGAC GTQTOGOQAO 6840 
TAAGTCCAGC CTAGGGCAGC ACCTTCAGTT CTTTOTCGAC GCX3AC03CGA ACCCCATAAG 6900 
AOTTACTCAT TTGTACAAOC ACACAAGCTC GGAOATTACA TCTCTGTATT ATGATCTCCA 6960 
AGGTCACCTT ATTGCCATG6 AGTTAAGCAG TGGTGAAGAA TATTATGTAO CCTOTGATAA 7020 
80 TACAOGTACC CCACTAGCTG TOTTCAGCAG CCGAGGTCAG GTCATAAAGG AOATAC TATA 7080 
CACACCTTAT GGCGATATCT ATCATGACAC TTACCCTQAC TTTCAOOTCA TAATTGGTTT 7140 
TCAT6GAGGA CTCTATGATT TCXTTTACTAA ATTAGTGCAC CTGGGGCAAA GGGA TTATGA 7200 
TOTTGITQCT G6CAGATGGA CAACGGCCTA TCATCACATA TGGAAACAGT TGAACCTCCT 7260 
TCCTAAACCA TTCAACC7CT ACTCCrTTGA AAATAACTAC OCAGTTGGCA AAATTCAAGA 7320 
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TGTTGCAAAO TATACCACAG ACATCAGAAQ TTGGTTGQAG CTATTTQGTT TCCAATTACA 7380 
CAAT6TACTA CCTGGATTTC CCRAACCTGA ATTAGAAAAT TTAGAATTAA CTTA CGAGCT 7440 
TCTAOGGCTT CAOACAAAAA CTCAAGAGTG GGATCCTGQA AAGACTATCC TGGGCATTCA 7500 
GTGTGAACTC CAOAAACAOC TCAOaAATTT CATTTCCTTG GAOCAACTAC CTATQACTCC 7560 
5 CCOATACAAT GATGGAOGGT GCC?rTQAAOG AGGGAAGCAA CCAAGGTTTO CTGCTGTCCC 7620 
TT Cltyri T rr GGGAAAGGTA TAAAATTTOC CATCAAGGAT GGCATAQTAA CAGCTGATAT 7680 
TATAGQAGTA GCCAATQAAO ATAQCAQQOG GCTTGCTGCC ATTCTCAATA ATGCC CATTA 7740 
CCTGQAAAAC CTACATTTTA CXATAQAGGG GAGGGACACT CACTACTTCA TTAAGCTTGG 7800 
GTCTCTGGAG GAAGACCTGO TGCTCATOOO TAACACTGGO GGGAGGOQQA TTCTGGAOAA 7860 
10 T60T0TCAAT GTCACTGTGT CCCAGATGAC 'nxri'Ci'GTI O AATGGGAGGA CTAGACGGTT 7920 
TGCAGATATT CAGCTOCAGC ATGGAQCCCT GTGCTTCAAC ATCCGGTATG GGACAACTGT 7980 
CGAAGAGGAA AAGAATCAOQ TGTTGGAQAT TGOCAGACAG CXSOGCAGTGG CCCAGGCCTO 8040 
GACTAAGGAA CAAAQAAGGC TGCAAGAGGG GGAAOAGOGG ATTAGGGCAT G GACAGAAGG 8100 
GGAAAAGCAG CAGCTTTTGA GCACTGGGCX3 GGTrACAAGGT TAOGATGGGT ATTTTGTTTT 8160 
15 GTCTQTTGAG CAGTATTTAQ AACTTTCTGA CAOTQCCAAT AATATTCACT TTATGAGACA 8220 
GAGC6AAATA GGCAGGAG GT AA CAAAAATA TCTCTGCCTT TGCGTCACCA AAGACTGCCT 8280 
G TTTTTAAAA CATAAAATGG TTTATPGTAT TGGTTTTCTA 6ATCAGAACT CTOTATATGT 8340 
AAATATGGAG GAAAAACATA TCCAACTGCC TTTCAATGTG AOGGAAGATG GTATTTTAAT 8400 
ATTGTTTGTT TAAACTCTTT AAGAAATGAC AGAGATTTTT AGTTCTTGTG TGGC AQTAT T 8460 
20 CAAAATAACA CAAOTAGAAC TCAAACAGCT AAAAACAGTT TTCAGAAAGC ACCACTTTCA 8520 
ATTT6CGQAG CCATGCATAT GTTCCAATAT CCAQAAAGAA CCCAAGGTTC TCTATCTCTA 8580 
TraraAOAAG CAGTTTCATC CTTAACTGTT GGCAGAACTT ACG GQCTA TT TGAATAGGTG 8640 
GTGCAATAGT ATCTGAAACT TGCCTTTCGA AAQACTGCCA GCCCTTTGAC GTTTTCCAGA 8700 
TCTGTTATAG GAAACTTAAA AACAGGTGTA AAATGTCTTC AGCCACCATC TCCTA6A6TG 8760 
25 AGGACCCAAT TGCCXTTTCCT TCTTGATTAT TCCTCCTTGC TTGTTAAAGT AAATGCCATA 8820 
TTGTT6TGCT QT6TTTTGGC GTGTGGTGGC TGGGTTCTGT CTACCATQCT TCCCTQTGGG 8880 
T0TG6TAACC AGACTGTATA GCCGCTATTT GCTOQTGTGT ACATGATACC AAAGCAGCTG 8940 
GCCAG06TGA CCTCTCTCAC AOGACCTGTT TTGACTCAAT TTTTTACTAA AAGTTOTTCA 9000 
GCTGTATTGG TATCATGTAA ACATAGCTTT TATTAACCTO GQTAGGAATT TCTCATTTAT 9060 
30 ATATAGGATG TGTTTTGGTC ATAQTTTCAC ATTAGTGATT CAGTATCTAT ACACTGACGC 9120 
AATG6TTTTG TGCACATGAA CGGTAATTTA CTTAAAAGTA TGATTCTGOT ACAAAAACAA 9180 
ACAAAGQCTT TA6CA6GCAT AOGTGTCTGG GATQCCGATA CATACATTAA CTACTACTGC 9240 
AGAAATTCAT AAQAGCCAAA ACCTTAAAAA AATAGACCTQ GTA CTTAA GT GAAA6TACTA 9300 
AAGGGAAGAC CAGACCAAAC ATCACAGCAG TTQCTGCCAC ATTGTTTCAG CCCACTTAGA 9360 
35 TTTATCTTTC AAATGTAC3A TTCTQTATTG AACATCTCCC AGCCATCTTC AGGAAATCGA 9420 
ATGAAGTAAA TCCTTTCCAA COGAAAACAT TTCAACTAAC TATAQAGA60 CAGACTCATT 9490 
TTTACTAAAA TAATTTATAC AGTTAGTTAT TTTCGTTCTC CGTACTTACC CATTTATCTT 9540 
TATTTAATCG TCTCTACTGC CTAGGAAAAT AACTATTTTC CAGGACGGGT TATTTGTTCT 9600 
GCGATCATTT AAAATTTGGA GAAAGGTCAG GATTAQTGTT AATATCAGCT GCAGTTTCTC 9660 
40 AATCTCTAGO AATCCTOCAG TAAAACAAGC CCCTTGOTGA GCTGGAAOAT TTGTQCCCAG 9720 
TOACSVAAGAa ATAGTTTGTA AAATGCTGTG TAATTGTAAO TTACCACAAA TGAAAATACA 9780 
TQACAGCACA ATQTGGCCCQ TAOAAAATTC CCCTGAGCCA QCTTCTGCAC TTTCATCACX: 9840 
GAATCTGAAC ATTTGCTATG TCTGAAG6CA AATTTATGAT GGAATGTTAG TTTGGATTCT 9900 
TTCCAGATGC TACCTAAATG CAGTGTGGGG TCATTGCCTT GCTTTGCGAT GACAGTTTCT 9960 
45 TTGAAAATAT GCAAAQTCAT AAGCTCATGT TAAGGTTTTT CAAGAGTCTQ CCTCCTACTA 10020 
CACAAAGGAA AGCAAGGGAA AGGAAATQAC OCTOGGAAAC AGTAGGGAAO GGTGTATTCA 10080 
AACATTTCAT TTTCAAAAOC TTOOGOTTAfl AATACCACTT ACACATCTAT TCiaAGAOAC 10140 
AGAATTCATG AGGAACTCAT CTCTCTTTAT AACTQGAAAC ACACCAGCTT GATATATTGC 1O20O 
TAATCCATAC TAAAATCATA TTATTGGQTT TTTTCTGAAT CAGGCCTGTA TTAATGGTAC 10260 
50 AOTATTTATT CAQAATGGAA TTCTAAAATT ACTAACAAAC TTOTTGAAAA TTTGA ATACC 10320 
TOCACACCAA CCTAAAAATQ OACCTTAAGT TCCXAGAACC TCTGATGT7C TTTTAAATTA 10380 
ATGOAAAAAT AATTTGTQAA CTGTATATAO AGAGTGCaTT CATAAATGTG ATTATGTATT 10440 
TTATCACAAA TCCAAAATGT CAATATTAOA GTCTATTTTO CTTATATTTT AAGCAATTAT 10500 
ACGTTTTTGC AATTGATTGA TGATGTATCA TTTTCAAACT GCTTTAAATA TCCATTAGAA 10560 
55 ACAAATATTT GAAGCTTTTA CTTAATAGTG ATTACCTTGA ACT6TGCATT TCTAGTTTGT 10620 
AATAOSTATT TG6TTG0TTC GTGCCTTTAiQ TTTOTTAAAQ TTACATTTOT ATTATATTCA 10680 
G6AAATGCAC TTTTTATTAC TTACAGCTGT GOTTTTRATA CTGCCTTGAA CTATTATTAT 10740 
TCTTTTTACA ACTCCTAAAG CTTQAGGGAG QAAAGAAAAA AAAAACAAAA CTACTAATCA 10800 
^ ^MStlKPiKSZ GAAGAGAAAC ATTTTGGCAT TTCTTAAGAA QAAGATGGAa ATATTGAGTA 10860 

60 TATCACTTCC TATTCAGCTO AATAGAAAGA ATGCCTTCAT TGACTTGCAQ TTCTGCAGTT 10920 
TAAATTATTQ AAAGAACAAT TOGTTTGCAT TTCCTGATGA AAGTAAAAOC ATTTTTCAOA 10980 
GAAACATATG AATTTCTCAT ACCCy«3CA0A CftGATGGCTG ACACTGCACA GCCACACACC 11040 
ATTOQAGTAA GTTAAAGTGA OAGCATAGTA GTTGQACTCT OCTATGAAGA ACATTCTGG G 11100 
CTGGAGGCAG GGAATACTCC ATGGTTGTTT CTTTTTCCTA CTTAAGCCCA TTTTGTTTGT 11160 
65 GCTTTTCTGT TTTOTTTTGr TTTCACTCTT GCACTACAGT CTAOAGATCC AAATGAACTG 11220 
AAAAGTTCAA AGTTTAACAG ATTTAAATAT GtTTACTTTT AOTTGTCATT CTAATGGTTA 11280 
TTGATTAGAA GCATC5ACTCC TCAAGQAAAO OOAAATAAAT CTCAATTCAT ACTAACTTGC 11340 
AACAAAACAC TTTTACCATA TAAATAAGTA TATGATTTAT TTTTAACCCA AAAAATGTAT 11400 
AAAATAAGXG TGTCCTTTAC TGTCAATTTA TCGAGAAQAT CTATAATATA TA GACTA CAT 11460 
70 ATATATAATA TATACAACAT AGCCAAATGT ATQAAAACTT GACAATGTAT AATT TOQAAT 11520 
1CACAT6CTA CCTA.TGTAGA CAG6TATGAA ATTAAGTXAT AATTTTCATG AOACA'm"l'C 11580 
ATCACTGTT6 AGACAGTTTC AAGGCATTCC ATCATOTTAT TTTGACTCTT TTTCTTTTTT 11640 
TTTTCTTTAA AAATATATTT TTAACTAGAC CAGGCCCCAC TATAATATCA CTTAAGAGAG 11700 
TCAGGGCAAA GTTTTTGCAT TTATGAAOAT 6TGTTCATGT AAGGGTGATT GTAATGQAGT 11760 
75 TCATTGGTAA TAOAAGCAAA AGTACAGTAA 06AAGTATTG AAAAGAAAAT TTTGGAQACA 11820 
TTG6A6CATA TTATATATAO CrrOTGGAAA GACATAAGQC TACAOATGQA ATOGAACATT 11880 
CCTGTTTTCT TQAAOAAATT CACATACACA TAGCTGACCT GACTAGTACT TCAGCTCTTC 11940 
CACAGCCTTC TATAAAGGTT CTTTCTTCTG CAAAGAAAAC AAAACAAAAC AAAACAAAAC 12000 
AAAAAAAAAC AAAAAAAGOG CAAAAAACAA AAAAACAAAA AAAAGC3\AAG TAAAATTTAA 1206O 
oO AAATACA6AA AACAAACAAC AAAAAAGAAT TCAACCATAA ATAGTOACTTA TTATTTTCAG 12120 

' i xmyfc crf c atgtgaaagc tattaaggac caaatatact actgttcata agaaqaaatt 12180 

ACTTTCTAAA CAGTAACIOA AAATACTTAO AGTTAAACTT GCTGTGGATT TTGTCTTGGC 12240 
AOTTOTCATC TTACATTATT TGTCAAAOQA AATGTGTTTG CCAGTtAAAA ATCTTTCCTT 12300 
AGATTTAGTO GTGGACTTTA ACCTCTTAAA TAAATOTTAG TATATOIGAT TOTGrCCTTQ 12360 
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AAAAATATTT TACTTGTATG AATCATOACA A0C3TCTAAAT CTPTACTATT CTTCTGGCAA 12420 
AAGCATCAGT AAGAAAGAAG GOQAAAAAGA GAAGTATAQC CTTTATGTCA GAAAAACATT 12480 
CTTTTTAGCT GCTTACTTTC TCATGAAAA6 TAAAOATGTT TACAGTQTAT GCCAAGTTTT 12540 
CAGTTTCTGT ATAAGAACAG GTAGAGOTTC TAATCATATT GAAAATTGTO TTATAATGGT 12600 
CTGAQCCATG TT6CTAGQAA ACAATAOGTT CCAATTTTOT ATTCCTQCTC TCCTGTGCT6 12660 
AAAAGTGACT GGATACTGTA CAGGTTCATG TTCTCPGGCT GCAGTTAAAT GGTCnTTGC 12720 
ATTTTGCTCT GGCTTTCAGG CCAGAACC3VT QCATTTTTCT ACAAOAGCAT CACAACAACA 12780 
TGCTGTAAAT ATTTAAAGTT AAACATTATG T G TTGATATT TGAAAOAAAA GTACTTTGAA 12640 
TATTTCATTT TTAAAAAATA AAATTGOCAA TQAAAAAAAA 



A210 Protein sequence t 
Gene natnet 
Ukiigene zmniber: 
Prdbeset Accession tfi 
Protein Accession #s 
Signal sequence: 
Transmembrane domains t 
Cellular Localizations 



odz (odd Oz/ten-m, Drosophila) bomolog 1 
Hs. 23796 
NM_014253 
Np2055068 
none found 
316-340 

plasma nenibrane 



1 11 21 31 41 51 

1 I I I i I 

MEQTDCKPYQ PLPKVKHEMD LAYTSSSDES EDGRKPRQSY KSRETLHBYN QELRMNYNSQ 60 

SRKRKEVEKS TQEMEFCET8 HTLC8GYQTD MHSVSRHGYQ LBMGSDVDTE TBGAASPOHA 120 

LRMWIRGMKS EHSSCLSSRA NSAI*SLTDTD BERKSDGEKG FKF8PVCCEM BAQAGSTQDV 180 

QS8PHNQFTP RPLPPPPPPP KACTCARKPP PAADSLQRRS KTTRSQP8PA APAPPTSTQD 240 

SVHLHNSWVL NSNIPLETRH 8LFKHGS6S8 AIPSAASQMY PLTSNTVYSP PPRPUPRSTP 300 

8RPAFTFNKP YRCCNWKCTA LSATAITVTL ALLLAYVIAV HLFGLTWQLQ PVEGELYAK6 360 

VSKGNKGTES MDTTYSPIGG KVSDKSEKKV PQKGRAIDTG EVDIQAQVKQ TIPPGIiFHRP 420 

QITIHHPIYL KFNI8IAKDS LLQIYORRNZ PPTHTQFDFV KLMDGXQLVK QDSKGSDOTQ 480 

HSPRNLILT8 LQETGPIBYM DQGPHYIiAFY NDGKRMEQVF VLTTAZBIMD 0C8TNCNGNG 540 

ECX86HCRCP PGFIiGPOCAR DSCPVLOGGN GBYBKGHCVC RHGWKGFBCD VPEEQCIDPT 600 

CFGHOTCIMQ VCICVPGYXG EICEEBDCLD PMCSNHGICV KGECHCSTGW GGVNCETPLP 660 

VOQEQCSQHO TPLLDASVCS CDPK»9TGSDC STELCTMECG SHGVCSROIC QCEEGWVGPT 720 

CEBRSCHSRC TBBGQCKDOK CECSPGHEOD HCTIAHYLDA VRDOCPGLCP GNGRCTLDQH 780 

GWHCVCQVOW SGTGOTWMB MIjCX3DNIiI»<D QDGLTDCVDP DCGQQSNCYI SPLCQGSPDP 840 

U3LIQQSQTL FSQHTSRLPY DRIKPLIGKD STHVIPPEVS FDSRRACVIR GQWAIDGTP 900 

LV6VNVSFLH HSDYGFTISR QDGSFDI>VAI GGISVILIPD RSPFLPBKRT LWLPWNQFIV 960 

VERVTMQRW SPPPSCDISN PISPNPIVLP SFIiTSFGGSC PEROTIVPEL QWQEEIPIP 1020 

SSFVRL8YLS SRTPGYKTLL RXIiLTHSTIP VOIIKVHLTV AVEGRLTQKW FPAAINLVYT 1080 

PANNKTDIYG QKVWGLAEAI. VSVQYEYETC PDPILWBQRT WLQGFEMDA SNLGOWSLNK 1140 

HHILNPQSGI IHKCarCHNMF ISQQPFVIST IHGKGHQR8V ACTNCNGPAH NKKLFAFVAL 1200 

ASQPDGSVYV GDFNFVRRIF PSQNSVSILE I^STSPAHKYY LAMDPVSESL YLSDTNTRKV 1260 

YKLKSLVETK DLSKNFBWA GTGDQCLPFD QSHCGDGGRA SKASLNSPRG ITVDRHQPIY 1320 

FVDOTMIRKI DEWAVITTVI GSNQLTSTQP LSCDSOTOIT QVRLEWPTDL AVNPMDNSLY 1380 

VUnmiVliQI SENRRVRIIA QRPIHOQVPG IDHPLV8KVA IHSTLBSARA ISVSHSGLLF 1440 

lABTDERKVN RIQQVTTNGE lYIIAGAPTD CDCKIDPNCD CPSGDGGYAK DAKMKAPSSL ISOO 

AVSPDGTLYV ADLGNVRIRT ISKKQAHUID KNXYEIASPA DQELYQFTVN GTHLKTUnjI 1560 

TRDYVYNFTY NSEGDLGAIT SSNGNSVHIR RDAGGMPLWL WPGGQVYWL TI8SNGVLKR 1620 

VSAQGYNPAI. MTYPGNTGLL ATKSNEN6WT TVYBYDPBOH LTNATPPTGE V8SFHSDLBK 1680 

LTKVEU5TSN RENVLMSTNL TATSTIYILK QENTQSTYRV NPDGSLRVTP ASQMBIGLSS 1740 

BPHILAGAVK PTLGKOTISL PGEHNAHLIB WRQRKEQNKG NVSAFERRLR AHNRNLLSID 1800 

FDHITRTGKI YDDHRKPTIiR ILYDQTGRPI LWSPVSRYNE VNITYSPSGL VTFIQRGTWN 1860 

EKMEYDQSGK IISRTWADGK IHSYTYLEKS VMLLLHSQRR YIFEYDQSDC LLSVTMP8MV 1920 

RHSIiQTMLSV GYYRNIYTPP DSSTSFIQDY SRIX3RLLQTL HLQTQRRVLY KYTKQARLSB 1980 

VLYDTTQVTL TYEE8SGVIX TIHLMHDGFI CTIRYRQT6P LIGRQIPRP8 EEGLVNARFD 2040 

YSYNNFRVTS MQAVINETPI* PIDLYRYVDV SGRTEQFGKF SVINYDLNQV ITTTVMKHTK 2100 

IFSANGQVIE VQYEILKAIA YWMTIQYDNV GRHGNMCIRV 6VDAHITRYP YBYDADGQLQ 2160 

TVSVHDKTQW RYSYDLNGDI KLLSHGKSAR LTFLRYDLRD RITRLGBIQY KMDEZX3FLRQ 2220 

RGHDZPEYNS N6LLQKAYNK ASGHTVQYYY DGLGRRVASK SSLGQHLQFF VDATANPIRV 2260 

THLYNHTSSB ITSLYYDLQG KLIAMELSSO EEYYVACONT GTPLAVFSSR GQVIRBILYT 2340 

PYGDIYBDTY PDFQVIIGFB QGLYDFLTKL VHLGQRDYDV VAORWTTAYH BIWKQLNLLP 2400 

KPFNLYBFEN NYPVGKIODV AKYTTDIR6W LELFGFQLHK VLPGPPKPEL ENLELTYELL 2460 

RliQTKTQEWD PGKTILOIQC ELQKQXjRNFI SIiDQLPMTPR YNDGRCLEGG KQPRFAAVPS 2520 

VFGKGZKFAI XDGIVTADII GVANEDSRRL AAIUQTAHYL EBHiRPTIBGR DTHYPIRLGS 2580 

LBEDLVLIGN TGGRRZLEMG VKVTV8QMTS LLNGRTRRFA DXQLQBQALC FNXRYGTTVE 2640 

BBKNBVLBXA SQRAVAQAWT KEQRRLQEGS BQXRAHTBQB KQQLIiSTGRV QGyDOYFVLS 2700 
VSQYLBLSDS ANNXBFMRQS EXGRR 



A211 laiA SEQUEWCEi 
Gene name t 
Unigene number: 
Probeset Accession ftt 
Nucleic Acid Accession ftt 
Coding sequence! 



Solute carrier family 26, member 4 

B8.1S9275 

AF030880 

NM_000441 

225-2567 (underlined sequences correspond to start and stop codona) 



1 11 21 31 41 51 

I I I 1 I i 

CTCA6CCTTC CCGGTTCQGG AAAGGGGAAG AATGCAGGAG GGGTAGGATT 7CTTTCCTGA 60 

TAGGATOGGT TGGGAAAGAC CGCAGCCTGT GTGTGTCTTT CCCTTCCACC AAGGT G TCT G 120 

TTGCTCOQTA AATAAAACGT CCCACTGCCT TCTGAQAGCG CTATAAAGGC AGOGGAAGGO 180 

TAGTCCGCGG GGCATTCGGQ GOGGGGCGOO AGCAGAGACA GGTCATOQCA GCGCCAGGOG 240 

GCAGGT06GA GCG6C0GCAG CTCCCOGAGT ACAGCTGCAa CTACATGGTG T060GGC06G 300 

TCTACAGOQA GCTOGCTTTC CAQCAACA6C A06A6C6003 CCXX3CA0QA0 CQCAAGA06C 360 



931 



wo 03/042661 



PCTrtJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGCGGOAGAG CCTGGCCAAQ TGCTOCAGTT GTTCAAGAAA GAGAGCCTTT GGTGTGCTAA 420 

AGACTCTTGT QCCCATCTTQ GAGTGGCTOC CCAAATACOG AiCTCAAGGAA TGGCTGCTTA 460 

GTGAOGTCAT TTOGGGAGTT AGTACT GG GC TAGTGGCXZAC GCTGCAAGGG ATGGCATATO 540 

GOCTACTAOC TGOUSTTOCT OTCGGATATQ GTCTCTACTC Iti C I TTTTTC CCTATCCTQA 600 

CATACTTTAT CTTTGOAACA TCAAOACATA TCTCAGTTG6 A CC TTTTCC A GTGGT G AGTT 660 

TAATQGTGGG ATCTGTTGTT CTGAGCATGG CCCCCQACGA ACACTTTCTC GTATCCAOCA 720 

GCAATGGAAC TOTATTAAAT ACTACTATQA TAGACACTGC AGCTAGAGAT ACAGCTAGAQ 780 

TGCTGATTGC CAQTGCCCTO ACTCTGCTGG TT66AATTAT ACAGTT6ATA TTTGGTGGGT fi40 

1GCAQATTQQ ATTCATAOTQ AGQTACTTQG CAOATGCTTT GGTTOQTGGC TTCACAACAO 900 

CPGCTGCCTT CCAAGTGCTG QTCTCACA6C TAAAGATTGr CCTCAATOTT TCAACCAAAA 960 

ACTACAATGG AGTTCTCTCT ATTATCTATA OGCTGGTTGA GATTTTTCAA AATATTGGTO 1020 

ATACCAATCT TGCTGATTTC ACTGCTGGAT TGCTCACCAT TGTOOTCrGT ATGGCA6TTA 1080 

AGGAATTAAA TGAT066TTT AGACACAAAA TCCXAGTCCC TATTCCTATA GAAGTAATTG 1140 

TQAGQATAAT TGCTACTGCC ATTTCATATO GAGCCAACCT GQAAAAAAAT TACAATGCTG 1300 

GCATT6TTAA ATCCATCCCA AGGGGGTTTT TGCCTCCTOA ACTTCCACXn GTGAGCTTGT 1260 

TCTCGOAQAT OCTGGCTGCA TCATTTTCCA TOQCTOTGGT GGCTTATGCT ATTGCAGTGT 1320 

CAGTAGGAAA AQTATATGCC ACX»AGTATG ATTACACCAT OGATGGGAAC CAGQAATTCA 1380 

TTGCXnTTGG GATCAGCAAC ATCTTCTCAG GATTCTTCTC TKrrTPTGTG GCCACCACTG 1440 

CTCTTTCCCG CACGOCCGTC CAGGAGAOCA CTGGAGGAAA GACACAGGTT GCTQGCATCA 1500 

TCTCTGCTGC GATT GTGATG ATCGCCATTC TTGCCCTGGG GAAGCTTCTQ QAACCXTO3C 1560 

AGAAGTCGOT CTTGGCAGCT GTTGTAATTG CCAACCTGAA AGGQATGTTT ATGCAGCTGT 1620 

GTGACATTCC TCX5TCTGTGG AGACAGAATA AGATTGATGC TGTTATCTGG GTGTTTACGT 1680 

67ATAGTGTC GATCATTCTG GG6CTGGATC TCGGTTTACT AGCTGGCCTT ATATTTGGAC 1740 

TGTTGACTGT GGTCCTGAGA 6TTCAGTTTC CTTCTTGGAA TGGCCTTG6A AGCATCOCTA 1800 

6CACAGA7AT CTACAAAAGT ACCAAQAATT ACAAAAACAT TGAAGAACCT CAAGGA6TGA 1660 

AGATTCTTAG ATTTTCCAOT CCTATTTTCT ATGGCAATGT CGATGGTTTT AAAAAATGTA 1920 

TOVAGTCCAC AGTTGGATTT GATGCXIATTA GAGTATATAA TAAGAGGCTG AAAGCGCTGA 1980 

GGAAAATACA GAAACTAATA AAAAGTGGAC AATTAAQAGC AAC3UVAGAAT GGCATCATAA 2040 

GTGATGCTGT TTCAACAAAT AATGCTTTT6 AGCXTTGATGA GGATATTGAA GATCTGGAGG 2100 

AACTTGATAT CCCAACCAAG GAAATAGAGA TTCAAGTGGA TTGGAACTCT GAOCTTCCAG 2160 

TCAAAGTGAA CGTTCCCAAA GTGCCRATCC ATAGCCTTGT GCTTQACTGT GGAGCTATAT 2220 

CTTTCCTGGA CGTT6TTGGA GTGAGATCAC TGCGGGTGAT TGTCAAAGAA TTCCAAAGAA 2280 

TTCATGTGAA TGTGTATTTT GCATCACTTC AAGATTATGT GATAGAAAAG ' CTGGAGCAAT 2340 

GCGGGTTCTT TGACGACAAC ATTAGAAAGG ACACATTCTT TTTGAOGGTC CATQATGCTA 2400 

TACTCTATCT ACAGAACCAA GTGAAATCTC AAGAGGGTCA AGGTTCCATT TTAOAAAOQA 2460 

TCACTCTCAT TCAGGATTGT AAAGATACCC TTQAATTAAT AGAAACAGAG CTGACGGAAG 2520 

AAQAACTTGA TGTCCAGGAT GAGGCTATGC GTACACTTGC ATCCTQAAAG TGGGTTCQGQ 2580 

AGGTCTCTAT GAGCAAGGAA TACAAGACAA AACTTCCTCA ATGCATTGAC TATTTCTTCA 2640 

GACTCAAAAC ACTCATTCTT TTTTCTATTA AGCCATTGAA AGAGAAGCAC TAAGACTGCT 2700 

TCTAGGCTTT ATTTATAAAA TAAACACCTT ATCCCTAACA TGGGCAAAAT G6CTAGAATT 2760 

ATTCAGACGA TTTGGCAGCQ TCCAGOQTAA 6CTGGTGTTA TAATACGCTG CTGATCTACA 2820 

TCACAGATTT GCTAATAATO TTCAOOTOQG COCTQGCATA TCTCTGTTCA GTTAGAGTGA 2880 

GTGCTGACCC AACAGCCTCT GTGGTCAAGC GAGTCACGAA T6ATTAA7CA TAAAGAAAAA 2940 

TCAGTTTrTG ACTGACCTGG ATATCCATGA GCTGCACTGA TCACCATGTA AGQTCACATT 3000 

TAGTAAATGC TGAAATAAAA TGATTAATGC ATT7ATCAAT AAAAGCCTTT GAAAATACTT 3060 

TGGATAATAA ATTGGAaTTT TAAAAATGCA AATTTGCTTA GTATCTAATA ATGAAGTGTT 3120 

ATTACATATA GCOGOAATTQ AGQATCTCTT TGATCCIGGA AATGGTTTAC CTAAAAGCTA 3180 

CAQAACCA6G CCAATATATT TTGAAATATT GATGCAGACA AATGAAATAA TAAAGAGATT 3240 

TTCATGGTTT ATAAAAATCT TTTTTGATAT GATAATAATC ATGATCACAA CTGAGATCAA 3300 

AAAAATATAT GACAGATTAT TTTGTTTAAA AATGCAGTTT TAATTATCTT AGTCTATAGA 3360 

AATOAT CATT GCATGOAOGC ATGTATAO OT ATOATCTGTO TAAAATCTGA CATAAAAACA 3420 

GTGCTATTCT GAGTGAAAAT TTTTTTQATG TGCTTACATA ACCAT6GTGA TTAAAATGAO 3480 

TTTATATTTT TTCTCAAAAA TTTTA6CA6T GTGTAAA6TA AGTAATCTTT AAC7GAACTC 3540 

TQACCACTTA AAAAAAAATC TAAAAATTGA ACTAOCTATA OTAGTCTOTG TTTAAAGTQA 3600 

ATTTTTAAAG ACAAAGCATT CTAAATGAAC TCAATATAAA AACATTCATT TGGAATGTAC 3660 

ATACTQAAA A ATACAGGTTT TTTTOACCAA AAOTTTTTAT ATCTTTTCTT TTTA TTTATT 3720 

TTTTTCCTAA GTGCCAACAA TTTTCTAGAT ATTATATACA ACACAGGCTT TGATCTTGGa 3780 

GACTTTTCCC ATATATTTCA CACTGQAGTO AATGAAGTTG TACTTCATTT CTAQAQAAAA 3840 

GTTATACCCA GGTCCCCAAT TGAGAATGTC TTGCTTGATr GAAAACGACA TCATCCCTTQ 3900 

6TATACTCCA GGGATTGGTT TCAGGACCCC TGCATTTACC AAAATTTOTG CACACTCAAG 3960 

TCXTTGCAGTC ACCCXnXSCCT AAAGATAfiAA TGGCTTCTCT GTTTTTCTTC TGAAATACAA 4020 

OCAGAAACAA TGTGTCTATT TCTGAAA6AA TAGGATTAAT GATCATACAA ATGGGTTAAT 4080 

CCTQAATTCT GQTTGTAAAT CTGGTTACAG CATAACTAGO ATTATAATGC TGCCTCAITT 4140 

TCACAGCACT ACTTGCTTAT ATTGACAACA AATCATCTOQ CTAAAGAGTG AATGTAGGCC 4200 

AGG060QGTG GCTCATGGCr OTAATCOCAO CACrTTGGGA GGCOQAGGOO GGTGGATGAC 4260 

GAGGICAG6A 6ATGQAGACC ATCCT06CTA ACATG6TAAA ACCCGGTCTC TACTAAAAAT 4320 

AGAAAAAAAG AAATTAGCCr AGOQTGGTGO CT6G0GG6CS CCrGTAGTCC CAGCTATTTO 4380 

G6AGGCTAAG GCAGGAGAAT GGCGTGAACC 066GAGGC36G AGCTTGCAGT GAGC06AGGT 4440 

OGTGCCACTO CACTOCAGGC TG6GCGACAG AGCAAGACTC OGTCTCAAAA AAAAAAAAAA 4500 

AAAAAAAAAA AGAGTGAATQ TAATAGTCTT GC31QAAAATQ AATGAATAGC TTTGTTC A AT 4560 

AAAGQAAATA TGCACTGCTC ACTTTTTTGA AOQAAATGCC AAAQTTAC8T TTTACAACAA 4620 

GQCTAQAQTT TOTAAATTCT GGGTTCATTT GTGATGACAT AAGTCA6CAA ACT60GGGAA 4680 

TACTGTCTCT TCTATGTATT TTGTGAATAG TAAGCATAAT TTTAGTTTTG TATTATCAAT 4740 

GAAAATTTCA CTTGAAATTA AAGCTGCCTT TTGTTATATT TTTAACCTAT AGGATAAGAT 4800 

TCXAGTATTG TATAT6AGTT TTAACAAATT AAAAAATCAA ATGATGTACA TTTGAAAATA 4860 

TTT6CACACA TTTAAAAATA AATOTAAAOT TOTCrrTTAA ACTACTQGOA TGTGTCCTTT 4920 
CT6AACAAAA 

A213 Protein sequence i 

Gene name: Solute carrier family 26, member 4 

YKiigene number t He. 159275 

Probeset AcceoBlon 9i AF030&80 
Protein Accession ftt 043511 
Signal sequence: none found 



932 



wo 03/042661 



PCT/US02/36810 



TransiRembrane domains: 
Cellular Localization: 



81>103« 109-131. 136-158, 185-207, 221-243, 245-267, 270-291, 295-317. 
347-369, 386-408, 420-442, 448-470, 486-508 
plasma nenibrane 



MAAPGGRSEP 
AFGVLKTLVP 
PPPILTYFIP 
RDTARVLIAS 
NVSTKMYNGV 
PIBVIVTIIA 
YAIAVSVGKV 
QVAGXISAAI 
IHVFTCZVSX 
BPQGVXILRF 
KN0IISDAV8 
DOGAISFLDV 
TVHDAILYIiQ 
QDEAMRTIAS 



11 
I 

PQLPBYSCSY 
ILEVfLPKVRV 
OTSRHISVGP 
ALTLLVOIIQ 
LSIIYTLVEX 
TAISYOANLB 
YATKYDYTID 
VMIAILAIiGK 
ZLGIiDLGIiLA 
SSPIFY6NVD 
TMNAFEPDED 
VGVRSLRVIV 
NQVKSQEGQO 



21 
I 

MVSRPVYSEL 
KEHLLSDVIS 
FPWSLMVGS 
LIF06LQI6F 
FQNZGDimA 
RNYNAGIVXS 
GNQEPZAFGI 
LLEPI4QKSVL 
GLXFGLLTW 
GFKKCZKSTV 
lEDLSELDZP 
KEFQRIDVNV 
SIZjETITIiIQ 



31 

I 

AFQQQHBRRL 
GVSTGLVATL 
VVL8M21PDEH 
IVRYLADPLV 
DFTAGLXiTIV 
ZFRGFXiPPEL 
SNIPSGPFSC 
AAWIANLKO 
LRVQFP8WNG 
GFDAIRVyNK 
TKBIEIQVDW 
yPASLQDYVI 
DCKDTIiELIB 



41 

1 

QERKTLRESL 
QGMAYALLAA 
FLVSSSNOTV 
GGPTTAAAFO 
VCMAVKBLND 
PPVSLF8EML 
FVATTALSRT 
MFMQLGDIPR 
LGSXP8TDIY 
RLKAZ^RKZQK 
NSBLPVKVMV 
BKLEQOOFPD 
TBLTBBBU7V 



51 
I 

AKCCSCSRKR 
VPVGYGLYSA 
LNTTMIDTAA 
VLVSQLKZVL 
RFRRKIPVPZ 
AASPSIAWA 
AVQESTGGKT 
LWRQNKIDAV 
K8TKNYXNIB 
LIK8GQLRAT 
PKVPIHSLVL 
DNIRKDTFFL 
QDEAMRTLAS 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



A213 DMA 8EQUBJCB1 
Gene name: 
Uhigene number 1 
Probeset Accession #t 
Nucleic Acid Accession Si 
coding sequence) 



ESTs, Highly similar to calcium- activated potassium channel r8K2 [R.norvegicus] 

Hs. 98280 

AA418000 

NM_021614 

458-2197 (underlined sequences correspond to start and stop codons) 



1 11 

1 I 
CGGCGGCAGC AGCCCATGCC 
GGGG60GGGA GATAACCTGT 
GACCCGCACC TCCT0GCC6C 
CCGGGGCAGC CAGCTCAATG 
GCAGCAGTAC GCGCAGCAGT 
CCACAGCCTG CAGCCCGCCG 
CCCGCTCTCG CACCACCACC 
GGG6G0CGQC QAGAOCAAGC 
GG6CGTCATG OGGCOG(CTCA 
CTCAOAGGCO CAGCOCCTQC 
CCCGTCTGCA GA06CTQC0G 
GTCTAA6CCC GAGCACAACA 
CAGCACTGGA GGAOGGGGGQ 
GTCCAGCAAA AAGAAAAACC 
C6AAAAGCGC AAGCGGCTCA 
CATGGTCATC GAGACCGAGC 
AGCTCTQAAA TGCCTTATCA 
CCAOQCCAGO GAAATACAGT 
CATQAGTTAT GAOOOTATTT 
CATACCTG06 AATTATACAT 
CACAACCACC GCTGATGTGG 
GATTGCCAGA QTCATGCTTT 
TGGAOCACTT A ATAAG ATAA 
ATaCCCAGGA ACT6TACTCT 
TGTCCGAGCT TGTGAAAGGT 
GATGTOGTTO ATATCAATAA 
ATACTGTG6A AAAGGAGTCT 
GGTOGTAGCT GTAGT08CAA 
TTTCAT6AT0 GATACTCAGC 
GGAAACATGG CTAATTTACA 
AAGAAAACAT aVACGAAAAT 
GCAGAGGAAA CTQAATGACC 
CATQTATQAT ATGATTTCTG 
TACCCTGGAA ACAAAACTAO 
AAGCCAQACC ATCAOOCAGC 
CAAGCAOGTC ACTTACAATG 
TTCGACAGCA OCACCAACTT 
TAAGACTTTT TGCCATCATA 
CTAATCAG06 TTATCOGGGT 
GGCCAAAATQ AOTGAAAACT 
TTGCTATATA GATTGTTCCT 
AACTTTACTA CTACATTATA 



21 31 

I I 
TCCGGTGCAA CAGCTGCGCC 
CCCTGCTGCT CC6CACCTCC 
T6T0GGGCTG GTCC TOCTQ C 
TGA6CGAGCT GAOGCOOTCC 
C0GC3CA0CA GTCQGCC3TCC 
CCAGCCCCAC GGGCAGCCTC 
ACGACC03CA CCCGGC3QCAC 
CCTTCACCQA AATAGC CATO 
GCAACTTGAG G60GTC0CGC 
AGCCCCCOQC 6TCTGT0GGA 
COGCOSCCGC TGTTTCGTCC 
ACTCCAACAA CCTGGCGCTC 
GOQGTGGAOO GAQGQGGCAC 
AQAACATG6G CTACAAGCTG 
GCQACTAC6C GCTCATCTTC 
TGTCGTGGGG CGCCTACGAC 
GTCTCTCCAC GATCATCCTG 
TGTTCATGOT GGACAATGQA 
TCTTCATCTG CTTGGAAATA 
TCACATGGAC GGCCCGGCTT 
ATATTATTTT ATCTATACCA 
TACATAGCAA ACTTTTCACT 
ACTTCAATAC Ad S ITTT G T T 
TGOTTTTTAO TATCTCATTA 
ACCATGATCA ACAGGATGTT 

crrrrcTCTC cattggttat 

GCTTACTTAC TGGAATTATG 

ggaaoctaoa acttaocaaa 

TGACTAAAAO AGTAAAAAAT 
AAAATACAAA GCTAGTGAAA 
TCCTGCAAGC TATTCATCAA 
AA6CAAACAC TTTGGTGGAC 
ACTTAAAGOA AAaaAaiGAA 
AGACTTTGAT TGGTAGCATC 
AGCAQAGAQA TTTCATTGAO 
CTGAGOGGTC COGGTCCTCG 
CATCAGAGAO TAGCTABAAQ 
TGGTCAATAT TTTAGCTTTT 
TCT6ATGTCA 6AATGCTG6G 

LYmTrriT ctttcagatg 

CCTGTAATTT CACTAACTTT 
TGATATATAA TAAAAAAAX3T 



41 

1 

TCCTCCGGTG 
TCGCCCGGCG 
TGCT6CTGCT 
A6CCAT6CCA 
GCCTCCCAGT 
GGCAGTCTGG 
CACCACCACC 
AGCAOGTGCA 
OGOAACCTCC 
OGAGGTGGOG 
TCAGCCCCOS 
TATGGAACOO 
GGCA0CA6GA 
GGCCACGGGC 
QGCATGTTCQ 
AAQGCGTCGC 
CTOQGTCTGA 
GCAO ATGACT 
CIGOTQTGTO 
GCCTTCTCCT 
AT3TTCTTAA 
GATGCCTCCT 
ATQAAQACTT 
TGGATAATTO 
ACTAGCAACT 
GGTGACATGG 
GGTGCTGGTT 
GCAGAAAAAC 
GCAGCTGCCA 
AAGATAGATC 
TTAAGAAOTG 
TTG6CAAAGA 
GACTTCQAGA 
CAOGCCCTCC 
GCTCAGATOG 
TCCAGGAG6C 
AGAATAAGTT 
ATTGXAAAGG 
AACCTGAACA 
CACAGGGAAT 
TTATTCATQC 
TAATTTOGQA 



51 

I 

GCGCCTTCCG 
GCT08TC6CG 



ACCACCAGTG 
GCTCOGOGCC 
ACCAGCCCCA 
GOTACAAOQG 
AOQAGATGGA 
QOQOQTCCTC 
AGATOGTGGT 



GTGGCACCAA 
GOGCCCTOTT 
GCATCGTGGT 
TGTATTCCTT 
TCATCGT6TA 
GGAQAATAGC 
CTATTCATCC 
ATGCCCCATC 
GACrCTATCT 
CTAGAA6CAT 
TAATGACTAT 
CCGCATGGAC 
TCCTTGGAGC 
TACCTAACAC 
G CACAG CCCT 
AOOTGGAGAA 
ATGTACTCAG 
ATGCAAAAGT 
TAAAAATGGA 
CCCAGAACAT 
AOAGOATTGT 
CT6GGCTCAT 
AGAGCTAOGA 
GGCG6TCCTC 
AACCACAAAA 
GOCTATGGTT 
CTAASTTTTA 
GCACCTATTA 
ACTTCAAACA 



60 
120 

leo 

240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



A214 Protein sequence: 
Gene namsi 
Uaigene number: 
Probeset Accession fit 
Protein Accession #1 
Signal sequence: 
Transmembrane domains: 



BSTs, Highly similar to calcium-activated potassium channel r8K2 (R.norvegicus 

Hs. 98280 

AA418000 

MP_067627 

none found 

135-157, 168-190, 208-230, 254-276. 306-328, 342-364, 373-394 
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PCTAJS02/36810 



Calmodulin binding domain: 
Cellular Local i sat ion i 



412-4B8 

plasma menibrane 



1 

I 

MSSCRYKCGV 
SSAPEIWSK 
LGHRRALPER 
LLGLIIVYHA 
lAFSYAPSTT 
VMKTLMTICP 
YGDMVPNTYC 
NAAANVLRBT 
DLAKTOHIKY 
BAQMSSYDXR 



11 

I 

MRPIiSNIiSAS 
PEHNNSNNLA 
RKRLSDYAIil 
REIQLFMVDN 
TADVDXXLS2 
6TVLLVFSIS 
GKGVCLLTGI 
WLIYKNTKLV 
DMISDLNERS 
VTYNAERSRS 



21 
I 



LY0TG66GST 
PGMFGIWMV 
GADDWRIAMT 
FNPIiRLYLIA 
LWXIAAWTVR 
MGAGCTALW 
KKIDHAKVRK 
EDFBKRXVTL 



31 
I 

AQPLQPPASV 

GGGGGG06S0 
lETELSWGAY 
YBRIFFICLE 
RVMUiHSXIfF 
ACERYHDQQD 
AWAKKLELT 
HQRKFLQAIH 
STKIiETliIGS 
APPTS9ESS 



41 
I 

G0GGGA8SPS 
H6SSS0TRSS 
DKASLYSLAL 
ILVCAZHPIP 
113ASSR9XGA 
VT8NPLQAMW 
KAEXHVHKFM 
QliRSVKMBQR 
XBALPGLISQ 



51 
I 

ASAAAAAAVS 

KKKN(»IIGYK 
KCLISLSTII 
GNYTFTWTAR 

IiXSXTFLSIG 
MDTQLTKRVK 
KLNDQANTLV 
TIRQQQRDFX 



60 
120 

leo 

240 
300 
360 
420 
480 
S40 



A215 DKA SBQUEWCBi 
Gene names 
Xftiigene nuinber: 
Probeset Accession ftt 
Nucleic Acid Accession ft: 
Coding sequence: 



C0I'B6 protein 
Hb. 109201 
AW161450 
NM_016029 

22B-1097 (underlined sequences correspond to start and stop codons) 



1 
I 

CTGCGATCCC 
OGGCGTGCGC 
TGCTCCTGCT 
TATG66C0GA 
TGACT66AGC 
TTTCTCTTGT 
TAGAGAATGO 
CTGGTTGCCA 
TGOTCAACAA 
ACAGAAA6CT 
CTCACATGAT 
TATCTGTACC 
ATGGCCTTCQ 
GACCTGT6CA 
GCAATAATGG 
TCAGCAT66C 
CATATTTGTG 
AAAGGATTCA 
AGACAAAACA 
A AACATQ AAA 
ACTTTTTAAT 
AGATTGCCAT 



11 
I 

GCAGGGCAGC 
GGCC6CAATG 
CTTGGTGCAG 
GTGGCAGGGA 
CTCGAGTGGA 
GCTGTCAGCC 
CAATTTAAAA 
TGAA6CGGCT 
T60TGGAAT0 
AATA6AGCTT 
CGAOAGGAAG 
TCTTTCCATT 
AACAQAACTT 
ATCAAATATT 
A6ACCAGTCC 
CAATGATTTG 
GCAATACATG 
GAACTTTAAQ 
T6ACTGAAAA 
ACAGCAATCT 
AOATATQACT 
GAATCTTGCA 



21 
I 

6ACGCGACTC 
AACTGQGAGC 
CTGCTGCGCr 
GGAGQGCCAO 
ATTGGTQAGG 
AOAAGAGTGC 
GAAAAAGATA 
ACCAAAGCTG 
T0CCA6CGTT 
AACTACTTAG 
CAA6GAAAGA 
GGATACTGTG 
GCCAGATACC 
GTGGAGAATT 
CACAAGAT6A 
AAAGAAOTTT 
CCAACCTGGG 
AGTGGTGTGG 
GAOCAGCTGT 
TCTTATGCTT 
TTGCTTCCAA 
AA 




TCCTGA6GGC 
AATOGGAGCT 
A6CTGGCTTA 
ATGAGCXtSGA 
TACTTOTTTT 
TTCTCCAGGA 
CTCTGTOCAT 
GGAOGGTGTC 
TTGTTACTOT 
CTAGCAAGCA 
CAGOTATAAT 
CCCTAGCTGG 
CAACCAGTOG 
GGATCTCAGA 
CCTGGTGGAT 
AT6CAGACTC 
ACTTTTCAAO 
CTGAATAATC 
CATGGAATQA 



41 

I 

CGTCTTCTTC 
GCTGCTOGTa 
TGACGG06AC 
OACTGATATG 
CCAGTTGTCT 
AAGGGTGAAA 
GCCCCTTGAC 
GTTT6GTAGA 
GQATA CCAOC 
CTTGACAAAA 
QAATAOCATC 

AGTTTCTAAC 
AGAA6TCACA 
TTGTGT60GG 
ACAACCTTTC 
AACCAACAAG 
TTCTTATTTT 
CGACTGGAGG 
AAAGACTAAT 
AATAAAAAAT 



51 
I 

CCCCCGAGCT 
CTGTGCGCGC 
CT6ACGCTAC 
GTGQTGTGGG 
AAACTAGGAO 
AGAAGATGCC 
CTGACCGACA 
ATC6ACATTC 
TTQGATGTCT 
TGTGTTCTGC 
CTGGGTATCA 
GGTTTTTTTA 
ATTTGOCCAG 
AAQACTATAO 
CTGATGTTAA 
TTGTTAGTAA 
ATGGGGAAGA 
AAAATCTTTA 
QAQAAATQQA 
TTQTQATTTT 
AAATAATAAA 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



A216 Protein sequence t 
Gene name: 
Unigene nuiziber: 
, Probeset Accession # i 
Protein Accession #i 
Signal sequences 
Transmembrane donainss 
Cellular Localizations 



OGI>86 protein 
Hs. 109201 
AH161450 
NP 057113 
1-26 

183-206, 221-243 
plasma membrane 



11 



21 
I 



31 



41 

I 



51 



MNWBLLLWLL VLCALLLLLV QLLRFLRADG DLTLLHAEWQ GRRPBWELTD MWWVTQASS 60 

OIGEELAYQL 8KLGVSLVLS ARRVHBLERV KRRCLENGNL KEXDZLVLPL DLTDTGSHEA 120 

ATKAVLQEFO RXDXLVMNQG MSQRSLCMDT SLDVYRXLXB UIYLOTVSLT KCVLFHMXER IBO 

KQ6KXVTVKS XLGXXSVPL8 IGYCASKBAL RGFFNGLRTB LATYPGZIVS NXCPGPVQSN 240 

XVENSLAQBV TKT2GNNQDQ SKKMTTSRCV RLMLXSMAND LKBVWISBQP FLLVTYLHQY 300 
MPTHAHHXTM KMOXXRXBrTP KSGVDADSSY FXIFKTiaiD 



A217 DMA SBQUEHCBt 
Gene namet 
unigene nun&er: 
Probeset Accession #: 
Nucleic Acid Accession ft: 
Coding sequence: 



Homo sapiens iQRNA; cDNA DXPZp56401763 (from clone DKFZp56401763) 

Hs. 27373 

F13036 

AC01247a 

1-1896 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATOO GOGOOG T6CCGC1GCC G6C0C06CTC CT6CC6CTGC TGCT6CT0SC OCTCCTGGCC 60 

GCT0C06C0Q CCC8C6CCAG CAGAGOOQAG TC0GTCTC08 OOCOCnGQOC GQAACCCGAG 120 

GGGQAC3T0GC GGCCACOQCC OGGCCOGGCO CCCGGG A ACA CCAC00Q6TT TCGGTCTOOG 180 

GOOGOQGGCa GCAQCQGCAG CT0CA6CTCC AACAOCAOTG (jOGAGOCCTT GGTGACCOGC 240 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



ATTTCCATCC 
TTTACOVCCC 
AAGAAGACAC 
CTAAATOAAa 
TCCTTGCOGO 
OTGCCCCCAC 
OGTGGAATCA 
TCAGCTGCAA 
GAAACCAAAA 
TCAGACTGTC 
TTT6GGCATC 
CTGAATCTCA 
GCCAGGTGCT 
CCTTGGTGGC 
AGTACCTTG6 
AGA6GA0TGA 
GCCCTACAGA 
ACCCATCCTG 
AGaOTOTTTC 
TGCCTTTTGG 
AT CTSTC TCC 
GTCCTTGOGT 
AAA06A6GGC 
GCGAACCTGC 
CACCCTGGAO 
GATOGCAQAT 
TGTQAGAGOC 
TOCCCCCGAC 



TOCTCOGCGA 
TCCTCATOGC 
GCAAGTATGA 
AGQATGATGA 
CTGCACTGAG 
CCTTCATCCT 
GACCTGGTAA 
OCTGGGGTGT 
06AACGTTAT 
ACTGGCAAGC 
CCTTTAAAOT 
TGOAAAAGCT 
T6CCACTGGT 
ACTTCAGOGC 
GCTTGGATGrT 
CTAAGGTTCT 
OTGOOTTTCC 
TCAGQTTGGC 
QGCOTCCGCG 
TTTTGAAGAT 



CATTTGAGCA 
CCAGTGGGCA 
AGACAATTCC 
6A6ACTTGGA 
0CCA6AA6AT 
TCACAGGTTC 
AGCCXrrOTT 



CCTACCCACC 
CTGCCTGCTO 
TATCATCACC 
AQATGAGQAC 
ACGTCAGCTG 
CGACATTOAC 
AACCTGTTTC 
GAA6GACTGG 
GTATAAAACC 
TOGTTTCCAC 
GCCCCCTACT 
GGATTCCTCT 
CCTGGCAGAA 
CACAGGCTCT 
TTTCTGTGGT 
CCAG6GTA6C 
CCATO^TCTC 
TCGTTCAGAT 
GCAGTCTCTG 
TCTGTTGAGO 
OGTGGAACAC 
GAGCCCACAA 
ATTGACCAOA 
AGATACCXSUV 
TGGGGTGOCA 
GGTCCTQATG 
CCATCACTTC 
TCTGTCCAGG 



CTCAAGGCAG 
CTGOGOOTCT 
ACrCCAGCAQ 
TCCACAGTAT 
CCAGGGTGCC 
CTTCCAGCAA 
CCAOCCTGGT 
ACCTGGAAGC 
CCA6CTCCAT 
GTCACCACAA 
TCTACTGCCC 
GCCTTA06CA 
ATGGGGGCTG 
CCAATAAAAA 
GC0GGCCA6C 
TGTTTCICCA 
AGGCTTCTCA 
6CCCGGGGAC 
CATGGOGGAG 
CGCCATCCTC 
CTACGGGAAQ 
AAGGCAGCTG 
CACACAT6CC 
GGCCA6GAAG 
AATTTCTATT 
TCTGAGGAAO 
TGCA6CCATT 
CCGTGA 



CCGTGATOGT 
TCAGGTGGGG 
AGOGAGTGQA 
T06ACATCAA 
AGAGGCTACT 
GATGCAGTGG 
GGCATCCTGT 
CCTCTTGOGT 
06TGCGTGTC 
TGGAGTTGCT 
ATGGTTTTCG 
GAAACACCOG 
CTGAAAOTQA 
OCCTTTACAC 
GGGGCACCTT 
AACAGCTGCG 
GAGAGTGTCX: 
AAGCCAQCCT 
GGTCAGCGGG 
AGCTTGACCT 
CCAAGAGAAG 
CTGCCCAOGG 
CTOGCTGGGO 
GCCCSVCXrTGA 
TOOAGQAAGA 
GGGCAOCTAG 
GCAAGTCTTG 



GGCGTTCGCC 
AAAGAGGTTA 
AATGGOGCCA 
ATACAGAGTG 
GACAGTTCCT 
AAGGCCTGAT 
GGAAAGTTGG 
OGGAGGTGTT 
ACGCATCTGC 
TCTGCCACCC 
ACAACTOCAO 
GQCTCCATCT 
CCTTCCAAAT 
ACAAACCATG 
TTGTGAAGAC 
CTGQAAGCCA 
TCCGCTGAGC 
GACGGGGAGG 
TACCGCAACT 
CTTCTACAAA 
CTCAGTGACT 
GGAGCXrrGTC 
GATCACACAT 
GGATGTCACT 
OGGTTTCCAO 
TTTGACAGGA 
GTCCTTCCTT 



300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 



A2ie Protein sequence i 
Gene name: 
Unigene number: 
Probeset Accession #: 
Protein Accession #: 
Signal sequence t 
TransmenOjrane domains: 
Cellular Localization t 



Homo sapiens ntRNA; CDNA DKPZp56401763 

Hs. 27373 

F13036 

FGQ^SH predicted 
1-27 

94-115. 448>469 
not determined 



(from clone DXF2p56401763) 



MRAVPLPAPL 
AAGGSGSSSS 
KKTRKYDIIT 
VPPPFILOID 
BT^NVHYKT 
UniMERLDSS 
STLGLDVPOG 
THFVRLARSD 
ZdiPCCAVEH 
ANIK2TIPXyiQ 
CERLTGSHHF 



11 

I 

LPLLLLALLA 
NSSGDALVTR 
TPABRVBMAP 
LPARC8QRPD 
PAPSCVSOIC 
ALRRNTRAPS 
AGQRGTFCED 
AR6QASLTGR 
LRBAKRSSVT 
OQEGPREDfVT 
SSHSKSHSFL 



21 

I 

AFAARASRAE 
ISILLRDLPT 



OOIRPGRIGF 
SDCBWOARPH 
ARCLPLVXiAE 
RAVTfCVLQGS 
RVFKRPRQSL 
VXASFBQSPQ 
HPGGDU3GVA 
8FRQPLFLSR 



31 
I 

8VSAPWPBPE 
LKAAVIVAFA 
SrVFUIKYRV 
PA»mHPVBSW 
VTTMBLLLPP 
MAAAESDLPN 
SFSKQLRWKP 
HG6GSAGTAT 
KAAA AHGEPV 
NFYIiEEBGFQ 
P 



41 

I 

RESRPPPGPO 
FTTLLIACLL 
SLPAALRRQL 
SAATW3VKDW 
FGHPPKVPPT 
PWWHFSATGS 
AZ«ESOFFHRL 
CLLVLKILLR 
KROPSGQLTR 
PGROQKMVUI 



51 
I 

PGNTTRPGSG 
LRVFRSGKRL 
PGCQTLLTVP 
TWKPSCVGGV 
SrraGFRQLQ 
PIKTLYTQTM 
RLLRECFPL8 
RHPHLDLFYK 
HTCPGHGITH 
SEBGPPSLTG 



60 
120 • 
180 
240 
300 
360 
420 
460 
540 
600 



A219 DNA SEQUEMCE 

Gene namei selectln E (endothelial adhesion molecule 1) 

Unigene numbers Ks. 89546 

Probeset Accession M24736 

Nucleic Acid Accession #: NM_0004SO 

Coding sequence t 1-1833 (underlined sequences correspond to start and stop codons) 



ATG ATTGCTT 
GCCTGGTCTT 
CAGCAAAGGT 
TCCATATTGA 
TGGGTCTGGG 
QAACCCAACA 
GAXGTGGGCA 
GCTGCCrGTA 
TAOICTTGCA 
ACAGCCCTGG 
AGCTACAATT 
ACCATGCAGT 
QAGTGTGATQ 
A6CTTCCCAT 
GCCCAGAGCC 
GCTGTGACAT 
CCTGCTGGAO 
TTGCAOGGAC 
GTTTGTGAAO 
CTTCCTAGTG 
GGTTTTGTGT 



11 

I 

CACAGTTTCT 
ACAACACCTC 
ACACACACCT 
6CTATTCACC 
TAGGAAOCCA 
ATAGGCAAAA 
TGTGGAATGA 
CCAATACATC 
A6TGTGACCC 
AATCCCCTGA 
CTTCCTGCTC 
GTATGTCCTC 
CTGTGACAAA 
GGAACACAAC 
TTCAGTGTAC 
GCAGGGCCGT 
AGTTCACCTT 
CAGCOCAOQT 
CTTTCCA6TQ 
CTTCTG6CAO 
TQAAGGQATC 



21 

1 

CTCAGCrCTC 
CACGOAAGCT 
GGTTGCAATT 
AA6TTATTAC 
GAAACCTCTG 
AGATGAGGAC 
TGAGAGGTGC 
CTOCAGTGGC 
TGGCTTCA6T 
GCATGGAAGC 
TATCAGCTQT 
TGGAOAATGG 
TCCAOCCAAT 
CTQTACATTT 
CTCATCTGGG 
CCGCCAOCCT 
CAAATCATCC 
TOAATQ CAOC 
CACAGCCTT6 
TTTCCGTTAT 
CAAAA06CTC 



31 
I 

ACTTTGGTGC 
ATQACTTATG 
CAAAACAAAG 
TOQATTGQAA 
ACAGAAGAAG 
TG06TGGAOA 
AGCAAGAA6A 
CA06GTQAAT 
GQACTGAAOT 
CTGGTTTGCA 
GATAGGGGTT 
A6TGCTCCTA 
GGGTTOGTGQ 
GACTGTQAAG 
AATT6GQACA 
CAOAATGGCT 
TGCAACTTCA 
ACTCAAGGGC 
TCCAACCC06 
GOGTCCAGCT 
CAATGTGGCC 



41 

I 

TTCTCATTAA 
ATGAG6CCAG 
AAGAGATTGA 
TCAGAAAAGT 
CCAAGAACTG 
TCTACATCAA 
AGCTTGCCCT 
0TGTAGA6AC 
GT6AGCAAAT 
GTCACCCACT 
ACCTaCCAAG 
TTCCAGCCTG 
AAT6TTTCCA 
AAGGATTTGA 
ACQAGAAGCC 
CTOTGAOGTO 
CCTGTGAGGA 
AffTGGACACA 
AOCQAGGCTA 
GTQAOTTCTC 
CCACAGQQQA 



51 




1 

AGAGAGTGGA 


60 


TGCTTATTGT 


120 


GTACCTAAAC 


180 


CAACAATGT6 


240 


GGCTCCAGGT 


300 


6A6AGAAAAA 


360 


ATGCTACACA 


420 


CATCAATAAT 


480 


TGTGAACTGT 


540 


G^AAACTTC 


600 


CAOCATGGAG 


660 


CAAT6TGGTT 


720 


AAACCCTOGA 


780 


ACTAATGGGA 


840 


AAGGTGTAAA 


900 


CAGCCATTCC 


960 


AGGCTTCATO 


1020 


GCAAATCCCA 


1080 


CATGAATTQT 


1140 


CTGTGAOCAO 


1200 


GTGGGACAAC 


1260 



935 



wo 03/042661 



PCTAJS02/36810 



GAGAAGCC^ CATGTGAAGC TCTGAGATGC GATOCTGTCC ACCAGCCCCC GAAGGGTTTG 1320 

GTGAGGTGTQ C TCATT CCCC TATTGGAOAA TTCACCTACA AGTCCTCTTG TGCCTTCAGC 13 BO 

TGrQAGOAGQ GATTTGAATT ATATGGATCA ACTCAACTTG AGTOCACATC TCAGGOACAA 1440 

TQGACAQAAO AQGTTCCTTC CXOCCAAOTG GTAAAATOTT CAAGCCTCGC AGTTCCOGGA ISDO 

AAGATCAACA TGACCT6CAG TOGOOAGCCC GTGTTTG6CA CTGTGTOCAA GTTOGCCTCT 1560 

CCTGAAGGAT GGAOG CTCAA TOGCTCTQCA GCTCGGACAT GTGGAGCCAC AGGACACTGG 1620 

TCTGGCCTOC TACCTACCTG TGAAGCTCCC ACTGAOTCCA ACATTCCCTT GGTAGCTGGA 16 BO 

CTTTCTGCTO CTGGACTCTC CCTCCTGACA TTAGCACCAT TTCTCCTCTO GCTTOGGAAA 1740 

TGCTTAGGGA AAGCAAA6AA ATTTQTTCCT GCCAGCA6CT GCGAAAGCCT ItSAATGAGAC IBOO 
GQAAGCTACC AAAAQCCTTC TTACATCCTT TAA 



A220 Protein sequence t 
Gene namet 
Ubigene nuirbert 
Probeset Accession #» 
Protein Accession #: 
Signal sequence: 
Transmembratte domains: 
C-lectin domain: 
Cellular Localization: 



Selectin E (endothelial 

Hs. 89546 

M24736 

NP^O 00441 

1-22 

555-573 

23-139 

plasma roentbrane 



adhesion molecule 1) 



} 11 21 31 41 51 

I ! I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI CNKEBIEYLN 60 

SIIiSYSPSYY WIGIRKVNNV HVWVGTQKPL TEEAKNWAPG BPNNRQKDBD CVEIYIKREK 120 

DVGMWNDERC SIOCKLALCyT AACTOTSCSO HGECVBTIKN YTCKCDPGPS GUCCBQIVNC 180 

TALESPEHGS LVCSHPLONF SYNSSCSXSC DRGYLPSSMB TKQCMSSGEH SAPIPAC37W 240 

ECDAVTNPAN GFVECPQNPG SFPWNTTCTP DCEBaFBLMG AQSLQCTSSG MWD»EKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGKFTFKSS CNFTCEEXSFM LQGPAQVECT TQGQHTQQIP 360 

VCEAFQCTAL SNPERQYMNC LPSASGSFRY GSSCEFSCBQ GPVLKGSKRL QCX3PTGEWDN 420 

EKPTC EAVRC DAVHQPPKGL VRCAHSPIGE PTYKSSCAPS CBEQFBLYGS TQLECTSQQQ 480 

HTBEVP80QV VKCSSIAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTOGATGHW 540 

SQIiLPTCEAP TE8NZPLVAG IiSAAQLSLIiT LAPFIibHIiRK CLRKAKKFVP ASSOQSLESD 600 
GSYQKPSyXlt 



Taxol Prostate 
A221 DWA SEQX3EWCB 

Gene naroei BSTb; Liprin A2 

Unigene nunibers Hs. 3 064 BO 

Probeset Accession #s N51002 

Kucleic Acid Accession #t N51002 

Coding sequence: 1-3793 (underlined 



sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I i I I 

ATGATG1GT0 AAGTGATQCC CACOATTAAT GAGGACAGCC CAATGAGCCA AAG6GGGTCC 60 

CAAAGCAGTQ GCTOSGACTC AGACTCCCAT TTTGAGCAGC TGATOGTGAA TATGCTAGAT 120 

QAAAGGGATC QTCTTCTAOA CACCCTTCGG GAGACCCAGG AAAGCCTCTC ACTTGCCCAG 180 

CAAAGACTTC AGGATGTCAT CTATGACCGA QACTCACTCC AGAGACAGCT CAATTCAGCC 240 

CTGCCR CAGQ ATATGQAATC CCTAACA6GA GGGCTGGCTO GTTCTAAGGG GGCTGATCCA 300 

CCGOAATTTQ CTOCACTGAC AAAAOAATTA AATGCCTGCA GGOAACAACT TCTAGAAAAG 360 

GAAGAAGAAA TCTCTGAACT TAAAGCTGAA A6AAACAACA CAAOACTATT ACTGGAGCAT 420 

TTGGAGTGCC TTGTGTCAOG ACATQAAAGA TCACTAAGAA TGACGGTGGT AAAAOGQCAA 480 

GCCC AOTCrC CCTCAGGAGT ATCCAGTGAA GTTGAAGTTC TCAA66CACT GAAATCTTTG 540 

TTTGAGCAOC ACAAGOOCTT GC3AT6AAAAO GTAAOGOAGC OACTGAGGGT TTCTTTAGAA 600 

AOAGTCTCTG CACTGGAAGA AGAACTAGCT GCTGCTAATC AQQAQATTQT TGCCTTGCGT 660 

GAACAAAATO TTCATATACA AAOAAAAATG GCATCAAGOO AGGQATCCAC AGAGTCAGAA 720 

CATCTTGAAO GGATGGAACC TGGACAGAAA GTCCATGAOA AGOGTTTGTC CAATGGTTCT 780 

ATAGACTCAA COGATGAAAC TAGTCAAATA GTTGAACTAC AAGAATTGCT TQAAAAGCAA 840 

AACTATGAAA TGOCOCAGAT OAAAOAAOGT TTAOCAOCOC TTTCTTOCOO AGTGGGAGAG 900 

6T6GAACA6Q AAGCAGAGAC AGCAAGAAAG GATCTCATTA AAACAGAAOA AATQAACACC 960 

AAGTATCAAA GGGACATTAO GGAGGCCATG 6CACAAAAGG AAGATATGGA AGAAAGAATT 1020 

ACAACCCTTG AAAAGCGTTA CCTCAGTGCT CAGAGAGAAT CTACCTCCAT ACATGACATG 1080 

AATGATAAAC TAQAAAATQA GTTA6CAAAT AAAGAAGCTA TCCTACG6CA GATGGAAQAG 1140 

AAAAACAGAC AGTTACAAGA AGGTCTTQAO CTAGCIOAAC AAAABTTGCA GCAGACCATG 1200 

AGAAAGOCTO AAACCTTGCC TGAAGTAGAG GCTGAACTGG CTCAGAQAAT TGCAQCCCTA 1260 

ACCAAGGCTG AAGAGAQACA TGOAAATATT GAAGAACGTA TX3AGACATTT AGAGGQTCAA 1320 

CTTGAAOAGA AQAATCAAGA ACTTCAAAGA GCTAGGCAAA GAQAGAAAAT GAATGAGGAQ 1380 

CATAACAAGA QATTATOOOA TACOQTTGAT AGACTTCTGA CTGAATCCAA TGAAOQCCTA 1440 

CAACTACACr TAAAOGAAAG AATGGCTQCT CTAGAAOAAA AGAATGTTTT AATTCAAGAA 1500 

TCAGAAACTT TCAGAAAGAA TCTTGAAGAA TCTTTACATO ATAAGQAAAO ATTAGCAGAA 1560 

QAAATTGAAA AGCTGAGATC TQAACTTQAC CAATTGAAAA TGAGAACTGG CTCTTTAATT 1620 

GAACCCACAA TACCAAGAAC TCATCTAGAC ACCTCAGCTG AGTTGCGGTA CTCAOTGGGA 1680 

TCCCTAGTGO ACAGCCAGTC TOATTACAGA ACAACTAAAG TAATAAGAAG ACCAAGGAOA 1740 

GGCOGCATGQ OTGTGOBAAa AGATGAGCCA AAGGTGAAAT CTCTTGGGGA TCAGGAGTGO 1600 

AATAGAACTC AACftGATTGO AOTACTAAOC AOCCACCCTT TTOAAAGTGA CACTGAAATO I860 

TCTOATATTO ATGATGATGA CAGAGAAACA ATTTTTAGCT CAAT6GATCT TCTCTCTCCA 1920 

AQTGQTCATT CCGATGCCCA GAOGCTAOCC ATGATGCTTC AGGAACAATT GQATGCCATC 1980 

AACAAAGAAA TCAGGCTAAT TCAGGAAGAA AAAGAATCTA CASAGTT6G0 T8CT6AAGAA 2040 

ATTGAAAATA QAOTGGCTAO TGTGAGCCTC GAAGGGCTGA ATTTG6CAA0 OGTCCACCCA 2100 



936 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GQTACCTCOl 
GGACACTCAA 
GTCATOACAC 
GATGGTCQAG 
GCCCTCAGAA 
TCTGTCTCTC 
CTTCACAAAQ 
AAAGAAAAAO 
GAGTCCCTGG 
AAGCATGAAC 
CCAACTOTGQ 
TGCCGAQCCA 
AOAiQAAATTG 
ATG6TTTCCC 
T6Q0TQACTC 
GAAGGAAOCT 
CATQAGT6GA 
TTTATGGAAT 
GTCCATTTAA 
TTAAAQA60T 
GAAATAAAAO 
GGACTTGOAO 

ctggatgaaa 
accx:ag6caa 

AGGCGACTGG 
TTTCCTCCTC 
GCTQGATTTA 
TCATCAAOAC 
G0GGC06CTT 



TTACTGCCTC 
CTCCAAAGCT 
TGCCAAGTGA 
AGGACAAAGC 
TGACTCACAC 
TTGAGCCAGA 
CCCXTCAAGAA 
CTCGACTTQG 
GGTTAGGCAA 
TTCTTGAAGA 
TCGCATGGCT 
AGGT6AAGAQ 
GAATCAGCAA 
TAACAAGTCC 
ATGAAGAAAT 
GGGCCCAGTQ 
TTGGAAATGA 
GCTTGGTAGA 
AAATGGTGGA 
TGAATTATQA 
AGGTQTTGGT 
AATATGCAAA 
ACTTTGACTA 
GGCAGATTCT 
ATGAAAGTGA 
GTGAAGTACA 
GGTTAACCAC 
TGCAGAOOTT 
TAA 



TGTTACAGCT 
CACCCCT06A 
TCTGAGGAAA 
AACAATTAAA 
TCTCCCTTCT 
AAaOCTOOGG 
GAAA0C3AATC 
GCAGCTCOQA 
ACTCGQAACT 
AGCTOGGAGA 
AGAGCTTTGG 
TGGTGCCATC 
TCCACTGCAT 
TTCAGCTCCT 
GGAAAATCTT 
TCOGQTTTTT 
ATGGCTTOCC 
TGCAAQAATG 
TAGTTTCCAT 
CAQAAAAGAA 
GTGGAGCAAT 
TAATATACTT 
CAQCA6CTTA 
TGAAAGAGAA 
TGACAAGAAC 
TGGAATCA6C 
AACCTCTOOG 
AQACAACTCC 



TCATCGCTGG 
A6CCCT6CCA 
CATOGGAGAA 
TGTQAAACTT 
TCCTACCACA 
CTT6GTAGTG 
AAGTCnCAA 
GGCTTTATGG 
CAAGCTGAGA 
AAGGGATTAC 
TTOGGAATQC 
ATGTCTGCTT 
CGCTTAAAAC 
CCAACATCTC 
GCA6CTCCAG 
CTACAGACOC 
AGCTTGGGGTr 
TTAQATCACC 
CGAACAAGTT 
CTAGAAAGAA 
GAC06AATTA 
GA6AGCGGTG 
ACTTTATTAT 
TACAATAACC 
TTCAQACGTG 
AT6ATGCCTG 
CAATCAAGAA 
ACTGTTOGCA 



CCAGTTCATC 
GGGAAATGGA 
AGATTGCAGT 
CTCCTCCTCC 
ATGATGCT06 
CCAACASCAG 
TAGGAOGTTT 
AQACTGAA6C 
AGGATCGAAG 
CTTTTGCCCA 
CTGOGTGGTA 
TATCT8ACAC 
TTOQATTAGC 
GAACTCCTTC 
CAAAAACX3AA 
TGGCTTATGQ 
TACCTCAQTA 
TAACAAAAAA 
TACAATATGG 
6ACG66AAGC 
TT0GCTG6AT 
TOCATGOCTC 
TACAGATTCC 
TCrrGQCCCT 
GATCAACCTG 
GGTCCTCAGA 
AAATGACAAC 
CATACTCATO 



TCCOCCCAGT 
TCGGATGGGA 
TGTGGAAGAA 
TACCCCTAGA 
AAGTAGTTTA 
CCAAGACTCT 
GTTTGGTAAA 
TGCAGCTCAG 
ACTAAAGAAA 
GTGGGATGGG 
0GTG6CAGCC 
TGAGATCCAG 
AATCCAGGAG 
AGGCAACGTT 
AQAATCTQAQ 
AGATATOAAT 
CAGAAGTTAC 
AOATCTCCOT 
AATTATGTGC 
AA6CCAACAT 
ACAAGCAATT 
ACTTATAGCC 
AACACAGAAC 
GGGAACTGAA 
6AGAAGGCAG 
AACATTAGCA 
AGATGTTGCT 
TCTGGAGTAA 



A222 Protein sequence: 
Gene namet 
Unigene nuiober: 
Probeset Accession #t 
Protein Accession ftt 
Signal sequence t 
Transinenibrane domains x 
AAA domain: 
SAN domains I 
Cellular Localization: 



ESTs; Liprin A2 
Hs. 306480 
N51002 

none found 
none found 
none found 
286-539 

895-964, 1017-1084« 
not determined 



MMCEVMPTIN 
QRLQDVIYDR 



FEHHKALDEK 
HIiEGMEPGQK 
VfiQEAETARK 
NOKLENBLAN 
TKABERHGHI 
QX^KBRMAA 
EPTIPRTHLD 
NRTQQIGVLS 
NKBIRLZQEB 
GHSTPKLTPR 
ALRMTHTLPS 
KEKARIiQQIiR 
PTWAWLELW 
MVSLT8P8AP 
HEHZGHEWLP 
LKRUTYDRKB 
LDENFDYSSL 
FPPREVHGI8 



11 

I 

EDTPMSQRGS 
DSLQRQLNSA 
RNNTRLLLEH 
VRERLRVSLE 
VHEKRLSNOS 
DLIKTEEKNT 
KEAILRQMEE 
BERMRBLEGQ 
LBBKNVLIQE 
TSAELRYSVG 



21 



LP^IESLTQ 
LECLVSRHBR 
RVSALEEELA 
IDSIDETSQI 
KYQRDZREAM 
KNRQIiQERLE 
LEEKNQGLQR 



31 

I 

FEQLMVNMLD 
OLAGSKGADP 
SLRMTWXRQ 
AANOEIVALR 
VELQEIiLEKQ 
AQKBDMEERX 
liAEQKLQQTM 



XB8TBLRABB 
SPAREKDRMO 
SYHNDARSSL 
GFMETEAAAQ 
LGMPAHYVAA 
PT8RTPSQNV 
SLGLPOYRSY 
LERRRSASQH 
TLXiLQIPTQH 
MMPGSSETLP 



SLVDSQSDYR 
SDIDDDDRET 
ZENRVASV8L 
VHTLPSDUIK 
6VSLEPBSLG 
ESLGLQKLGT 
CRANVKSGAI 
WVTHEBHENL 
FMBCLVDARM 
EIKZ3VLVWSN 
TQARQILER6 
AGFRLTTT86 



SLRDKERLAE 
TTKVIRRPRR 
IFSSMDLLSP 
EGUILARVKP 
KSRKXAWEE 
LGSANSSQDS 
QAEKDRRIiKK 
MSALSOTEIQ 
AAPAXTKBSE 
LDHLTKKDLR 
DRZIRHZQAI 
YIINLLALGTE 
QSRKMTIDVA 



41 
I 

ERDRLIiDTliR 
PEFAALTKEL 
AQSPSGVSSB 
EQNVHXQRKM 
NYEMAQMKER 
TTLEXRYLSA 
RKAETLPBVE 
HNKRLSDTVD 
EZEKIiRSELD 
GRMGVRRDEP 
SGHSDAiQTLA 
8T8ITASVTA 
DGREDKATZK 
LHKAPKKKQI 
KHELLEEARR 
RBZGZ8NPLK 
EGSWAQCFVP 
VHLXMVDSFR 
6LREYANNIL 
RRLDBSDDKH 
SSRLQRLDNS 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3760 



51 
I 

ETQESIiSLAQ 
NACRBQIiIiEK 
VEVLKALK8L 
ASSEGSTESE 
IiAALSSRVGE 
QRESTSZHDN 
AEXiAQRIAAL 
RIiLTESNERIi 
QLKMRT6SLZ 
KVKSLGDHEW 
MNLQEQIOAZ 
8SIASSSPPS 
CBTSPPPTPR 
RS8ZGRLFGK 
KQLPFAQWDQ 
RLKLRLAZQE 
XiOTLAYiaSMN 
RTSLQYOZMC 
ESGVHGSLZA 
PRRGSTWRRQ 
TVRTYSOiB 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



A223 DNA SEQUENCE 
Gene names CDA14 
Utiigene number: Hs.26813 
Probeset Accession #t N32912 
KUCleic Acid Accession ft: NM_016S70 

Coding sequence t 1- 1134 (underlined sequences correspond to start and stop codons) 



ATGAGGCGAC 
AAOQTTCCTO 
TTTACAACTA 
AAGTATGAAT 
ACTGTTGOCA 
QTTGCATCTG 
AAAGAGTG6C 
GAAOATOTGA 



11 
I 

TGAATCG6AA 
AGAGCTATGT 
TGQCTTTATT 
ACXSA AGTAG A 
T6AAGTGTCA 
CAOATOGrTT 
AGAGGATGCT 
TATTTAAAAG 



21 
I 

AAAAACTTTA 
AGAQACTTCA 
AACCATAATQ 
CAAGGATTTT 
ATATOTTOGA 
AGTTTATGAA 
GC3U3CTGATT 
TGCTTTTAAA 



31 

1 

AGTTTG6TAA 
6CX:AaTGGAQ 
GAATTCTCAG 
TCTAGCAAAT 
GOQGATGTAT 
CCAACAOTAT 
CAGAGTAGGC 
AGTACATCAA 



41 

I 

AAGAGTTOQA 
GTACA6TTTC 
TATATCAAGA 
TAAGAATTAA 
TG6ATTTAQC 
TTQATCTTTC 
TACAAGAAGA 
CRG C iVi'YOC 



51 
I 

TGCCTTTCGG 
TCTAATAGCA 
TACATGQATG 
TATAGATATT 
A6AAACAAT6 
ACCACAOCAO 
GCATTCACTT 



60 
120 
180 
240 
300 
360 
420 
480 



937 



wo 03/042661 
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GATGATTOVT CACAGTCTCC AAATGCATGC AGAATTCATG GCCATCTATA TOTCAATAAA 540 

CTTAGCAGGGA ATTTTCACAT AACAGTGGGC AAGGCAATTC CACATCCTCQ TGGTCATGCA 600 

CATTTGGCAQ CACTIGTCAA CCATGAATCT TACAATTTTT CTCATAOAAT AGATCATTTG 660 

TCTTTT66AO AOCTTGTTCC AGCAATTATT AATCCTTTAa ATGGAACTGA AAAAATT6CT 720 

ATAGATCACA ACCAGATGTT CCAATATTTT ATTACAGTTG TGCCAACAAA ACTACATACA 780 

TATAAAATAT CA6CA6ACAC CCATCAGTTT TCTGTGACAG AAAGGGAACG TATCATTAAC 840 

CATGCT6CAG GCAGCCATGG AGTCTCTGGQ ATATTTATGA AATATGATCT CAGTTCTCTT 900 

ATGGTGACA6 TTACTGAGGA 6CACATGCCA TTCTGGCAGT TTTTTGTAAG ACTCTGTGGT 960 

ATTOTTGQAO GAATCTTTTC AACAACAQ6C ATQTTACATO QAATTOGAAA ATTTATAOTT 1020 

GAAATAATTT 6CTGT0GTTT CAGACTTGGA TOCTATAAAC CTGTCAATTC TGTTCCTTTT 1080 
GAOQATQGOC ACACAQACAA OCACTTACCT CmTAGAAA ATAATACACA TTOA 

A224 Protein sequence: 
Gene names ' C33A14 

Ualgene n\iinber: us. 26813 

Probeset Accession ti: N32912 
Protein Accession ftt NP_057654 
Signal sequence: none found 

Transmembrane domains t none £ound 
Cellular Localizations nuclear 

1 11 21 31 41 51 

1 I I I I I 

MRRUIRKKTL SLVKBLDAPP KVPESYVBT8 ASQGTVSLIA FTTMALLTIM BFSVYQDTWM 60 
KYEYEVDKDF SSKLRINIDI TVAHKCQYV6 ADVLDLAETM VASADGLVYE PTVFDLSPQQ 120 
KEWQRMLQLI QSRIiQEEHSL QDVIFKSAPK STSTALPPRB DDSSQSPNAC RIHGHLYVNK 180 
VAGNFHITVG KAIPHPRGHA HLAALVNHBS YNPSHRIDHL SFOELVPAII NPLDGTE3CIA 240 
IDHNQMFQYF ITWPTKLHT YKISADTKQF SVTERERIIN HAAGSH6VS0 IFMKYDLSSL 300 
MVTVTBEHMP FVQFFVRLGG IVGGIFSTTG MLHGIGXFZV EIICCRPRIiG SYKFVNSVPF 360 
EDGHTDNHLP LLENNTH 

Uterine 

A225 DNA SEQUENCE: 

Gene name: BSTs, Neakly similar to JE0350 Anterior gradient -2 [K. sapiens) 

Unigene nunA>eri Hs. 100686 

Probeset Accession «i AA487468 
Nucleic Acid Accession it* AA4a7468 

Coding sequence: S5*-555 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 1 I I 1 I 

CGGCACCAAO AGCACTGGCX: AAC5TCAGCTT CTTCTGAGAG AOTCTCTAGA AGACRTGATG 60 

CTACACTCAG CTTTGGGTCT CTGCCTCTTA CTCGTCACAG TTTCTTCCAA CCTTGCCATT 120 

GCAATAAAAA AGGAAAAGAG GCCTCCTCAG ACACTCTCAA QAQGATGGGG AGATQACATC 180 

ACTTGGGTAC AAACTTATGA AGAAGGTCTC TTTTATGCTC AAAAAAOTAA GAAGCCATTA 240 

ATGGTTAT7C ATCACCTGGA GGATTQTCAA TACTCTCAAO CACTAAAGAA AGTATTTGCC 300 

CAAAATGAAG AAATACAAGA AATGGCTCAG AATAAGTTCA TCATGCTAAA CCTTATGCAT 360 

GAAACCACTQ ATAAGAATTT ATCACCTGAT QGGCAATATG TGCCTAGAAT CATGTTTGTA 420 

GACCCTTCTT TAACAGTTAO AGCTGACATA GCTGGAAQAT ACTCTAACAG ATTQTACACA 480 

TATGAGCCTC GGGATTTACC CCTATTGATA GAAAACATGA AGAAAGCATT AAGACTTArr 540 

CAOTCAGAGC TATAAQAGAT 6ATAGAAAAA AGCCTTCACT TCAAAGAAGT CAAATTTCAT 600 

GAAGAAAACC TCTGGCACAT TGACAAATAC TAAATGTGCA AGTATATA6A TTTTQTAATA 660 

TTAGTATTTA 6TTTTTTTAA TGTOTTTQCA ATAOTCTTAT TAAAATAAAT 6TTTTTTAAA 720 
TCTGAAAAAA AAAAAAAAAA AAAAAAAAA 

A226 Protein sequence: 

Gene name: ESTs, Heakly similar to JE0350 Anterior gradlent-2 [H. sapiens] 

Unigene number: Hs. 100686 

Probeset Accession #: AA487468 

Protein Accession #: none found 

Signal sequence: 1-23 

Transmembrane domains: ncme found 

Cellular Localization: secreted 

1 11 21 31 41 51 

) I ! i I i' 

MKLHSALGLC LLLVTVSSNL AIAIKKEKRP PQTLSRGWGD DITWVQTYEB GLFYAQKSKK 60 
PLMVZHHLED OQYSQALKKV FAONCEIQBM AQUKFIMLNL MHBTTDIOniS PDGQYVPRIJ4 120 
PVDPSLTVRA DIAQRYSNRL YTYEPRDLPL LXa9MKKALR LZQSBL 

A227 DNA SEQUENCE 

Gene name: g protein-coupled receptor 49 (GPR49) (H638) (LGR5) 
Unigene number: Hs. 285529 

Probeset Accession tf: AA460S30 
Nucleic Acid Accession #: NM_003667 

Coding sequence: 201-2924 (underlined sequences correspond to start and stop codons) 



938 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 11 21 31 41 SI 

1 I I I I I 

GT6GCGGCAA C0S6CACCTC AGTCCCCGCX: GOGCTTCTCC TCGCOGCCCA CGCCGTCGGQ 60 

TCAQGAACGC GGC6TCTGGC 6CTGCAGACX3 CC06CTGAGT TGCAGAAGCC CAOGGAGCGG 120 

CGCCGGGCGC GCCAGGGCCC GTAGCA6TCC G6TGCTGCTC TC06CC0G0G TC066CT0GT 180 

GGCCCCCTAC TTOGGOCACC ATOQAOVCCT CCOO G CTOGG T G T G CTCCTG TCCTT6CCXG 240 

TGCTGCIGOI GCTGGCX3ACC GGGGGCAGCT CTCCCAGGTC T GG TGTGTTG CTGAGGG6CT 300 

6CCCCACACA CTGTCATTGC GAGCCOGAOG GCA6GATGTT GCTCAGGGTG GACTGCTCCG 360 

ACCTGQGGCT CTCQGAGCTG CCTTCCAACC TCAOCGTCTT CACCTCCTAC CTAGACCTCA 420 

GTATGAACAA CATCAGTCAG CTGCTCCOGA ATCCGCTGCC GA0TCTG06C TTCCTGGAGO 480 

AGTTA08TCT T60QGGAAAC GCTCTGACAT ACATTCTCAA GGQAGCMTC ACTGGCCTTT 540 

ACAOTCTTAA AGTTCTTATG CTOCAGAATA ATCAGCTAAG ACAOQTACCC ACAGAAGCTC 600 

TGCAOAATTT GCGAAGCCTT CAATCCCTGC GTCTGOATGC TAACCACATC AGCTATOTGC 660 

CCCCAAGCTG TTTCAGTGGC CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT 720 

TAACAGAAAT CCCOGTCCAG GCTTTTAGAA GTTTAT0C30C ATTGCAA6CC ATGACCTTGG 780 

CCCTGAACAA AATACACCAC ATACCA6ACT ATQCCTr i GQ AAACCTCTCC AGCrTGGTAG 840 

TTCTACATCT CCATAACAAT AGAATCCACT CCCTG6GAAA GAAATGCTTT G AT GG G CTCC 900 

ACAaCCTAOA GACTTTAGAT TTAAATTACA ATAAC3CTTGA TOAATTCCCC ACTGCAATTA 960 

GGACACTCTC CAACCTTAAA QAACTAGGAT TTCATAGCAA CAATATCRGG TOJATACCTG 1020 

AGAAAGCATT TGTAGGCAAC CCTTCTCTTA TTACAATACA TTTCTATOAC AATCCCATCC 1080 

AATTTGTTGG GAQATCTGCT TTTCAACATT TACCTOAACT AAGAACACTG ACTCTGAATG 1140 

GTGCCTCACA AATAACTGAA TTTCCTGATT TAACTOOAAC TGCAAACCTG GAGAGTCTGA 1200 

CTTTAACTGG AOCACAQATC TCATCTCTTC CTCAAACX33T CTGC3VATC3\G TTACCTAATC 1260 

TCCAAGTGCT AGATCTGTCT TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAOTCTGCC 1320 

AAAAGCTTCA GAAAATTGAC CTAAOACATA ATGAAATCTA OGAAATTAAA GTTOACACTT 1380 

TCCAGCAGTT GCTTAGCCTC CGATOGCTQA ATTTGGCTTG GAACAAAATT GCTATTATTC 1440 

ACCCO^ATGC ATTTTCCACT TTGOCATCCC TAATAAAGCT OGACCTATOG TCCAACCTCC 1500 

TGTamrrrr tcctataact gggttacatc gtttaactca cttaaaatta acaggaaatc is60 

ATGCCTTACA GAGCTTGATA TCATCTOAAA ACTTTCCAGA ACTCAAGQTT ATAGAAATGC 1620 

CTTATGCTTA CCAGTGCTGT GCATTTGQAQ TGTGTGAGAA TGCCTATAAG ATTTCTAATC 1680 

AATGGAATAA AGGTGACAAC AGCAOTATGG AOQACCTTCA TAAQAAAQAT OCTGQAATQT 1740 

rrCAGGCTCA AGATGAACGT GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACXnGA 1800 

AAGCXxrrrcA ttcagtgcag tgttcacctt ccxx^ggccc cttcaaacxx: tgtgaacacc ib6o 

TaCTTGATGQ CTGGCT6ATC AQAATTGGAa TGTGQACCAT AGCAGTTCTG GCACTTACTT 1920 

QTAATOCTTT GGTGACTTCA ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC 1980 

TGTTAATTGG GQTCATCGCA GCAGTGAACA TGCTCAOGOQ AGTCTCCAQT GCOGTGCTGQ 2040 

CTGGTQTGGA TGCOTTCACT TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG 2100 

GGGTTGGTTG CCATGTCATT Ui ' mT l T U T CCATTTTTGC TTCA6AATCA TCTGTTTTCC 2160 

TOCTIACTCT GGCAGCCCTQ QAQO Q T OGO T TCTCTOTGAA ATATTCTQCA AAATTTGAAA 2220 

CGAAAGCTCC ATTTTCTAGC CTQAAAGTAA TCATTTTGCT CTGTGOOCTQ CTGGCCTTGA 2280 

CCATGGCOGC AGTTCCCCTQ CTQGGTOOCA GCAAQTATGO CGCCTCCCCT CTCTGCCTQC 2340 

CTTTGCCTTT TGGGGAGCCC AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT 2400 

C0CTTT6CTT GCTCATGATG ACCATTQGCT ACACXIAAGCT CTACTSCAAT TTGQACAAaQ 3460 

GA6ACCTGGA GAATATTTGG GACTQCTCTA TGGTAAAACA CATTGCCCTO TTGCTCTTCA 2520 

CCRACTGCAT CCTAAACTGC CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA 2580 

CATTTATCAG TCCTGAAGTA ATTAAOTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT 2640 

GTCTCAATCC CCTTCTCTAC ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC 2700 

TQAQAAAGCA AACCTACQTC TOaAGAAGAT CAAAAOUXX: AAGCTTQATa TCAATTAACT 2760 

CTGATGATGT CGAAAAACAG TCCTQTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT 2820 

CCA6CATCAC TTATGACCTG CCTCCCAGTT COSTGCCATC ACCAGCTTAT CXaOTQACTQ 2880 

AQA0CTQCC3V TCTTTCCTCT GIGQCATTTG TCCCATGTCT CTAATTAATA TGTGAAGQAA 2940 

AATGTTTTCA AAGGTTGAQA ACCTGAAAAT GTGAGATTGA GTATATCAGA GCAGTAATTA 3000 
ATAAGAAOAO CTGAGOTGAA ACTGGOTTTA AA 



A228 Protein sequence 
Gene namei 
xmigene nuniber: 
Protein Accession #i 
Signal sequence t 
Transmembrane domains t 
Cellular Localization! 



G protein-coupled receptor 49 (GPR49) (HG38) (LGRS) 
Be. 285529 
NP 003658.1 
1-22 

557-579, 596-618, 641-663, 687-709, 724-746, 770-792, 803-825 
plasma menibrane 



1 

I 

KDTSRLGVLL 
PSNIiSVFTSY 
LQNNQLRHVF 
AFRSLSALQA 

FQHLPBLRTL 
YNLLEDLP8F 
LPSLXKLDLS 
AFGVCBITAYR 
C8PSPGPFKP 
AVNMLTGVSS 
BRQFSVKYSA 
STMSYMVALI 
PVAFLSF88L 
HTRSKHPSLM 
VAFVFCL 



11 
I 

SLPVLLQLA7 
LDLSMNNZSQ 
TBALONl<RSL 
MTLAUnaHH 
TAIRTLSNLK 
TLNGASQITE 
SVCQKIiOKID 
SNLLS8FPZT 
ISNQWNKGDN 
CEHLLDGWLI 
AVLAGVDAFT 
KFETKAPF8S 
LUiSLCFLMH 
ZMLTFI8PEV 
SINSDDVBKQ 



21 
I 

GGSSPRSGVL 
LLPNPLPSIiR 
QSLRIiDANHI 
IPDYAPGNLS 
BliGFHSmilR 
FPOLTGTANL 
LRHNBZTBIK 
GLHOLTHUa* 
S8MDDLHKRD 
RIGVWTIAVL 
FGSFARKQAW 
LKVZILLCAL 
TZArrXLYQI 
IKFZZJiWVP 
SCDSTQALVT 



31 

! 

LRGCPTHCHC 
FliEEIiRIiAGN 
SYVPFSCFSO 
SLWIiBUQQI 
8IPHKAFVGN 
ESLTLTGAQI 
VDTFQQLLSI. 
TGNRALQ8LI 
AGNFQAQDER 
ALTQIALVTS 
NENOVGCRVI 
LALTMAAVPL 
UJXGDLENXW 
LFACLNPLLY 
FTSSSITYDIi 



41 

1 

BFDGRMLLRV 
ALTYIPKGAF 
XiHSliRBUniD 
RZHSLGKRCF 
P8LZTZHFYD 
SSLPQTVQIQ 
R8Un«AHNKZ 
SSENFPBLKV 
DXjEDFZjLDFB 
TVPRSPLYZS 
GPLSIFASBS 
LGGSKYGASP 
DCS MVKHZAL 
ZLFNFHFXED 
PPSSVPSPAY 



51 




1 

DCSDLGLSEL 


60 


TGLY8LKVLM 


120 


DNALTBZFVQ 


180 


DGLHSLETLD 


240 


NPZQFVGR8A 


300 


LFNLQVLDLS 


360 


AZIHPNAF8T 


420 


XEMPYAYQCC 


480 


EDLKALHSVQ 


540 


PZKLLZ6VZA 


600 


SVFLLTLAAL 


660 


LCLPLPFGBP 


720 


UiPTNClLNC 


780 


LVSLRKQfTYV 


840 


PVTESOlIiSS 


900 



939 



wo 03/042661 



PCT/US02/36810 



Taife 76A depWs Seq ID No; Ui^endft IWgBnBTWe; 
Table TBAbDnked by SeqID No to Table 7B. 



SeqlONo: 

Ptey: 

ExAocn: 

IMoenelO: 

UnlgensTItte 

PredSubceOLoc: 



Sequence ID No for sequences In table 

Unique Eos probesel idenfc number 

Exernplar Accession nuinber. Gotbank accession number 

Unigene number 



Pre^ded sutvceOularlocanzaUon 



SeqlONo 

Seql01&2 

SeqlD3S4 

SeqlD5&6 

SeqlD7&8 

SeqlD9&10 

SeqtD11&12 

SeqlD13&14 

SeqlD15&16 

SeqlD17&18 

SeqlD19&20 

Seql021&22 

Seql023&24 

8eqlD25&26 

SeqID27&28 

SeqiD29&30 

Seql031&32 

SeqlD33&34 

SeqlD35&36 

Seql037&38 

SeqlO39&40 

Seq[D41&42 

8eqlD43&44 

SeqtD45&46 

SeqI047&48 

SeqlD49&50 

Seql051&52 

Seql053&54 

SeqlDS5&56 

SeqlD57&58 

SeqlD59&60 

SeqlD61&62 

Seq{063&64 

SeqID65&66 

SeqlD67&68 

SeqID69&70 

Seqt071&72 

Seql073&74 

SeqlD75&76 

SeqlD77&78 

SeqlO79&80 

SeqlD81&82 

SeqlD83&84 

8eqlD85&86 

Seql087&88 

SeqtD89&90 

SeqlD91&92 

Seql093&94 

Seqt095&96 

8eql097&98 

SeqIO99&100 

SeqtO101&102 

SeqID103&104 

Seq(O105&10€ 

Seq(D107&108 

SeqiD109&110 

Seql0111&112 

SeqlD113&1U 

SeqtD115&116 

Seqt0117&118 

SeqlO119&120 

Seq 10 121 & 122 

SeqlD123&124 

SeqlD12S&126 

Seq ID 127 & 128 

SeqlO129&130 

Seq ID 131 & 132 

Seql0133&134 

Seq ID 135 & 136 

SeqlD137&138 



"PRiy ESccn UnigenelO unigene Title 



Pred Subceil Loc 



425023 
424503 
429423 
400289 
419172 
418007 
418007 
407836 
414577 
418738 
428368 
415138 
429276 
416994 
407975 
429113 
418506 
423961 
414812 
417433 
417433 
424399 
422867 
428227 
444381 
439569 
411558 
400303 
411789 
428698 
450098 
421552 
452747 
415539 
416636 
416636 
409079 
442082 
400297 
451398 
429220 
421524 
423242 
423242 
423242 
452190 
452190 
325372 
450375 
426215 
425247 
429353 
429353 
432201 
427585 
448163 
442117 
428179 
431211 
447033 
447033 
447033 
115522 
452679 
446051 
422048 
410418 
446342 
422260 AA315993 



AW%6889 
NH,002205 
A1016712 
X07820 
AW33862S 
M13509 
M13S09 
T79340 
AI056548 
AW388633 
BE440042 
C18356 
AF05608S 
AA296520 
X89426 
028235 
AA084248 
D13666 
X72755 
BE270266 
BE270266 
AI905687 
L32137 
AA321649 
BE387335 
AW602166 
AA102670 
AA242758 
AF246505 
AA852773 
W27249 
AF026692 
BE153855 
AI733881 
N32S36 
N32536 
W87707 
R41823 
A)127076 
AI793124 
AW207208 
AA312082 
AL039402 
AL039402 
ALA394Q2 
H2673S 
H26735 

AA009647 
AW983419 
NM.005940 
AL117406 
AL117406 
AI538613 
031152 
AA026880 
AW654984 
A1127772 
M86849 
A13S7412 
AI357412 
A1357412 
BE614387 
242387 
6E048061 
NM_012445 
031382 



Hs.154210 

Hs.149609 

Hs^87797 

H3^58 

Hs^120 

H$^169 

Hs.83ie9 

Hsi00272 

Hs.72116 

Hs.6682 

Hs.83326 

Hsl95944 

K8.198612 

Hs.69546 

Hs.41716 

Hs.196384 

KS35339 

Ks.136348 

Hs.77367 

Hs.82128 

Hs.62128 

Hs.1584 

Hs.2248 

Hs^13 

Ks.222399 

Ks.70725 

Ks.79136 

Hs.72157 

Hs.334836 

Hs.8109 

H3.105700 

Hs.61460 

Hs.72472 

Hs.42645 

K5.42645 

Hs.82065 

Ks.7413 

Kb^6201 

K8.144479 

H3.105445 
Hs.125783 
Hs.125783 
Hs.125783 
Hs.91668 
H8J1666 



Ks.155223 

Hs.155324 

Hs.335891 

H3.335891 

Ks^241 

Hs.179729 

Hs^252 

Hs.128899 

H&279696 

H3^23733 

Hs.157601 

Hs.157601 

Hs.157601 

Hs.333893 

Hs.e3883 

H&37054 

Hs.268126 

Hs33325 

Hs.14846 

Hs.105484 



endothelial dlRerentlaOon, sphingoOp) 
integrin, alpha 5 (iibnneclin rec^, 
bitegiln, beta 1 (fibranedln receptor, 
matrix metallopiDtetnsse 10 (stromelysln 
ESTs; similar to TRANSMEMBRANE 4 SUPERF 
matrix matalloproleinase 1 (interstitiat 
matrix metaOoprotelnBse 1 (inieistillal 
B-cell CLUIymphoma 6, menter B. zinc fi 
hypotheQca) protein FU20992 similar (o 
sotute canter fantily 7, (caGonlc amino 
matrix metaOoprotelnase 3 (stromdysin 
Qssue tector pafiwvay Inhibitor 2 
G protein-coupled receptor 51 
seledln E (endothelial adhesion molecul 
endolherial celi-specilic molecuie 1 
pnostaglamfin^doperoxide synthase 2 (p 
G protein-ooupled rsceptor 39 
periosOn (OSF-2o8} 
monokine induced tiy gamma Interferon 
5T4 oncofetal trophoblast glycoprotein 
5T4 oncofetal trophoblast glycoprotein 
AI905687:IL-BT095>190199>019 BT095HomD 
carOage oilgomerfe matrix protein (ipse 
small inducible cytokine subfamily B (Cy 
ESTs. Weakly similar to S64054 hypotheO 
CEGP1 protein 

gamma-aminobutyric add (GA6A) A reoepto 
UV-I protein, estrogen regulated 
AdDcan 

K1AA1866 protein 
hypolheticai protein FU21060 
secreted frizzled^lated protein 4 
tg superfamily receptor LNIR 
BMP-RIB 

solute canter femOy 16 (monocarixMcytic 
solute canter family 16 (monocarbo)^c 
InterieuMn 6 signal transducer (gplSO, 
ESTs 

hypothetical protein DKFZp56401278 
ESTs 

ESTs 

GDNF fiantily receptor alpha 1 
0EME4 protein 
DEM&6 protein 
DEME^proteh 

Honno sapiens done PP1498 unienown mRNA 
Homo sapiens done PP1498 unlmown mRNA 
Phase2&3Exons 

a dlslntegrin and metritopfote ln ase doma 
stanniocaldn 2 

matrix metaltoprotdnase 1 1 (stromelydn 
ATP-binding cassette transporter MF(P8 
ATP-blndlng cassette transporter MRP8 
Transmemtjrane proteasOt serine 3 
collagen, type X alpha 1 (Schnvd metaph 
prolacGn receptor 

ESTs: hypotheOcat protein for IMAGE:447 

senmi/glucocarficoid regulated kinase^ 

gap junction protein, beta 2. 26kD (com 

ESTs 

ESTs 

ESTs 

o-Myc target JP01 

transmembrane, prostata androgen Induced 
ephrin-A3 

spondin 2, extracdUar matrix protein 



l^asma 
secreted 



Intracell 



secreted 
secreted 

plasma memtirane 
plasma membrane 
secreted 
intraceS 



plasma rneml^rane 
plasma membrane 



plssmd memtxane 
plasma membrane 



intracell 



l^asma meirtirBne 



plasma irembrane 
l^ma membrane 
plasma membrane 



intracell 

plasma membrane 



plasma menftrane 

Intracell 

intracell 

intracdl 

Intracdl 

cytoplasmic 

plasma membrane 



secreted 



plasnra membrane 



plasma merhbrane 
iriasma membrane 

intracell 

plasma membrane 
secreted 



intraceQ 

fdasma membrane 
plasma membrane 



Hbmo sapiens mRNA; cONA DKFZpS64D016 {ft 
legeneraSiQ gene t)^ IV 

940 



plasma membrane 
plasma membrane 
secreted 
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SeqK}139ft140 


409041 


A6033aK 


Hs^l 


HypoUietical protein. XP .051860 (KIAA1 1 9 


secreted 


Seqi0141&142 


409041 


AB0330S 


Ks.50081 


HypoUieScal protein. XP_051860(KIAA119 


secreted 


SeqD143&t44 


111929 


AF027208 


Hs.1 12360 


pronM (mouseHite 1 


plasma membrane 


SeqlD145&146 


111929 


AF027a08 


Hs.112360 


prondnin (mouse)-Dte 1 


plasma membrarw 
plasma membrane 


SeqtD147&148 


111929 


AR)27208 


Hs.112360 


prendnin (mouse}^ 1 


8eq]D149&150 


104688 


AW939591 


Hs^940 


mudn 13, epittieSd Iransnienifareiie 


plasma men^jrane 


SeqlD151&1S2 


420159 


AI572490 


Hs.99785 


Homo sapiens cDNA: FIJ21245 fis. done C 


plasma membrane 
plasma membrarte 


SeqlD153&154 


422330 


D30783 


Hs.115263 


epIreguSn 


SeqID155&155 


452461 


N78223 


Hs.108106 


transcripQon factor 


intraceO 


SeqiD157&l58 


413324 


V00571 


Hs.75294 


oortiootropin releasinQ hormone 


secreted 


SeqlD1S9&160 


412420 


AL03S668 


Hs,73853 


bone (Ttorphogeneflc protein 2 


secreted 


8eq(0161&162 


416658 


U03272 


Hs.79432 


fMn 2 (congenital contractural era 


secreted 


Seqt0163&164 


407811 


AW190902 


Hs.40098 


cysteine knot superfan^ 1. BMP antagon 


secreted 


SeqlD165 


402230 






Finest! predicted: CYTOCHROME P450 4F5 ( 


Intracefl 


Seq[D166&167 


402230 






Fgenesh predicted: CYTOCHROME P450 4FS ( 


intracell 


Seq (0168(169 


432829 


W60377 


Hs.57772 


ESTs 


bdracell 


SeqlO170&in 


425721 


AC002115 


Hs.159309 


uroplaldnIA 


plasma menrtvane 


SeqI0172&173 


420370 


Y13845 


Ks.97234 


uropiaidn2 


plasma membrane 


Seq{D174&175 


437652 


BEooiese 


Hs.2S68g7 


EST8,Wea]dy^IartodJ365012.1 [Ksa 


(dasma memljrano 


SeqlD176&177 


402075 






ENSP00000251056*:Plasnia membiaie caldun) 


secreted 


SeqlD178&179 


421110 


AJ2S0717 


Hs.1355 


calhepsin E 


secreted 


SeqtD180&181 


451666 


Z43948 


Hs.326444 


caiinage acidic protdn 1 


plasma membrane 


SeqID162&183 


451668 


Z43948 


Hs^26444 


cardage addic protein 1 


secreted 


SeqlD184&185 


451668 


Z43946 


Hs^26444 


cartilage acidic protein 1 
interteuKin6 


Intracell 


SeqlD186&187 


408243 


Y00787 


Hs.624 


secreted 


8eql0168&189 


422282 


AF019225 


Hs.114309 


spo&popiotdn L 


secr^ed 


8eqlD190&191 


425852 


AK001S04 


Hs.159651 


death receptor 6. TNF superfamlly nember 


plasma men^ane 


Seq(D192&ig3 


439738 


BE246502 


Hs.9598 


seme domain, tmiTiiJnogl^lln domain (Ig), 


plasma membrane 


SeqlD194&ig5 


404875 






NM.022819';Homo sapiens phospholipase A2 


intraceK 


SeqtD198&197 


425883 


AL13r708 


Hs.161031 


Homo sapiens mRNA; cONA DKFZp434K0322 (f 


plasma memt}rane 


SeqlD198&199 


404977 






InsuOn^ike growth tactor 2 (somatomedi 


secreted 


SeqID200 &201 


420876 


AA918425 


Hs.177744 


ESTs 


(^asma memtKane 


SeqID202 &203 


427747 


AW4il425 


Hs.1 80655 


serine/threonine Idnase 12 


intracell 


SeqlD204 &205 


420281 


AI623693 


Ks.323494 


Piecficted cation efflux pump 


plasma memt)Fane 


SeqlD206 &207 


446573 


NigL016361 


Hs.15871 


IPAP for lysophosphatidic add phosphata 


IntraceD 


Seq(D208 


437553 


AI82g935 


Hs.130497 


ESTs, Weakly skniiar to MAT8_HUMAN CHLOR 


plasma membrane 


Seq{O209 &210 


437553 


AI829935 


Hs.130497 


ESTs. Weddy similar to MAT8_KUMAN CHU}R 


plasma memt)rane 


S6qt021U212 


437553 


AI829935 


Hs.130497 


ESTs. WeaUy sbnaar to MATS JtUMAN CHLOR 


plasma nieml)rane 


Seq(D213&214 


426900 


AW163564 


Hs.142375 


ESTs 


^asma membrsne 


Seq(D215&216 


426900 


AW163564 


Ks.142375 


ESTs 


plasma membrane 


S6qlD217&218 


426900 


AW163564 


Hs.142375 


ESTs 


plasma membrane 


SeqlD219&220 


405932 






C15000305:g!I3e08122jgt}|AAC69198.1| (AFO 


Intracell 


8eqtD221&222 


405932 






C15000305:gi|3B08122|gblAAC6918ai| (AFO 




SeqlD223 &224 


424008 


R02740 


Hs.137555 


putative diemoUne receptor; GTP-Uhdlng 


plasma membrane 


S6qlD225 &226 


424008 


R02740 


Hs.137555 


putative chemoMne receptor; GTP'^^ng 


plasma memtrane 


SeqtD 227 &228 


424008 


R02740 


Hs.137555 


putative chemoktne receptor; GTP-t)inding 


plasma membrane 


Seq 10229 &230 


444342 


NM-014398 HS.108&7 


similar to lysosome-assodated membrane 


secreted 


8eqlD231&232 


421379 


Y1S221 


H9.103982 


smaD Inducfiile cytokine subfamily B (Cy 


secreted 


Seql0233&234 


417079 


U65590 


Hs.81134 


interteiAin 1 receptor antagonist 


secreted 


SeqlD235 &236 


412986 


X81120 


Hs.75110 


cannablndd receptor 1 (Ja^n) 


plasma membrane 


SeqlD237 &238 


412986 


X81120 


Hs.75110 


cannabtnoid receptor 1 (brain) 


plasma membrane 


Se4lD239 &240 


412986 


mm 


Hs.75110 


cannabinoid receptor 1 (brain) 


plasma membrane 
tabaceB 


Seq ID 241 & 242 


447072 


D61S94 


Hs.17279 


tyrosy^proldn sulfbtrarsfBrase 1 


8eql0243&244 


419723 


AL120193 


H3.339810 


longeW^ assurance (LAG1 , 8. cerevisiaa 




Seql0245&246 


419723 


AL120193 


K3.339810 


bngevi^ assuranoe (lAGI, S. cerevisiaa 


plasma mentf)rano 


Seq{D247&248 


430890 


X54232 


Hs.2699 


glyplcani 


plasrna mernbrano 


SeqtD249&2S0 


456759 


BE259150 


Hs.127792 


deUa (OrosophBa)-lik0 3 


plasma membrane 


Seq ID 251 


429466 


M8Se35 


Hs.12827 


ESTs 


Seq ID 252 


429486 


M85835 


H3.12827 


ESTs 




SeqlD253 &254 


419721 


NM.Q01650 Hs^88650 


8quaporin4 


plasma tnembiane 


SeqtD255 &256 


407034 


U84540 




gt):Human dystrobre\rin isofbrm DTN^ (DTN 


8eqlD257&2S8 


413472 


BE242870 


H5.75379 


solute canier fanily 1 (glid high affi 


plasma li^^^^tffi^T^ff 


Seq 10 259 &260 


438380 


T06430 


K8.6194 


chondroltin suSato proteoglycan BEHAB/b 


secreted & plasma i 


Seq 10 261 & 262 


426271 


AF026547 


Hs.169047 


chondrolSn sulfate proteoglycan 3 (neur 


secreted * 


Seq ID 263 &264 


419704 


AA429104 


Hs.45057 


ESTs 


fntracell 


Seq 10 26S&266 


444471 


AB020684 


Hs.1 1217 


KIAA0877 protein 


plasma mem^ane 


Seql0267&268 


409395 


U46745 


Hs.336678 


dystiobrevin, sdpha 


secreted 


SeqlD269 &270 


413063 


AIJ035737 


H3.76184 


cfdtinase 3-^ 1 (cartilage glyooprote 


secreted 


Seq ID 271 & 272 


433600 


AI034361 


Hs.135150 


tung type-l cell memtvane-^sociated gly 


plasma rnentbrane 


Seq ID 273 


456435 


AI418718 


H3.144121 


ESTs, Weakly similar to T4691 6 hypotheO 




Seq ID 274 


458435 


AI418718 


Hs.144121 


ESTs. weakly sin^ to T46916 hypotheS 




Seq(0275&276 


424343 


AW956360 


Hs.4748 


adenylate cydase acSv^g polypepfide 


(dasma manbrane 


8eqlD277 &278 


424998 


U56515 


Ks.154138 


chilinase 3<(ike 2 


secreted 


SeqlD279 &280 


412709 


AL022327 


Hs.74518 


KIAA0027 protein 


plasma memtvane 


Seql0281&2S2 


435615 


Y15065 


Hs.4975 


potassium voltag&^ated channel, KQT-fik 


|dasma membrane 


SeqD283 &284 


404049 






NM.016937*:HomD sapiens protocadherfn be 


plasma membrane 


Seq(D28S&286 


416932 


134059 


Hs.89484 


cadiierin 4, type 1, R-cadtierin (refin^ 


plasma membrane 
plasma membrane 


SeqlD287 &288 


404029 






NM.01 8938':Homo sapiens protocadheibi be 


8eqiO289 &290 


436480 


AJ271643 


Hs.87469 


putathre add>sensing ion channel 


intracell 


SeqtD291&292 


452401 


NlilL00711S H&29352 


tomor necrosis factoft alpha-induced pro 


secreted 


8eqt0293&294 


452401 


NM 007115 Hs^352 


tumor necrosis fBctoft flIphaMuoed pro 


secreted 


SeqlD295&2g6 


436895 


AF037335 


Hs^ 


carbonic anhydrase XII 


plasma membrane 
plasma membrane 


8eqiD297 &2S8 


421471 


U9054S 


Hs^7179 


solute canier farnOy 17 (sodhsn phospha 



941 
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SeqrO29g&300 


428296 


NNL003058 Hs.183572 


solute canierfandly 22 (organic cafion 


plasma memtirsrte 


Seq(D301&302 


423508 


AW604297 


Hs.129711 


hepalltis A vinis cellutar receptor 1 


pissma memtxBfie 


SeqtD303 &304 


450001 


NNL001044 


Hs.406 


solute canter famDy 6 (neurotransnvQe 


plasma metrdirane 


SeqID305&306 


410407 


X66839 


H3.63287 


carbonic anhydrase IX 


plasma msmtjrano 


SeqlD307&308 


453496 


AA442103 


Ks^084 


sfitute canier fan^ 2 (faciKtated gbi 


plasma membrarte 


SeqID309 &310 


420737 


IJ08096 


Hs.998g9 


CD70; tutnornecrasis factor (ligand)s 


plasma m^nbrsns 


SeqI0311&312 


309931 


AW341683 




gb:hd13d01.x1 Soares.NFIJT.GBC.SI Homos 


plasma membrene 


SeqlD313&314 


412719 


AW016610 


Hs.816 


ESTs 


intraceU 


SeqID 315 & 316 


417034 


NM.006183 


Ks.80962 


neurotensin 


secT'^sd 


Seql0317&318 


430486 


BE062109 


Hs.241551 


chiGrido channeli cahAiin acfivsted, fam 


plasma mandvans 


SeqID 319 & 320 


413753 


U17760 


HS.75S17 


laaMiK beta 3 (nio^ (12SkD). kaDnbi 


secreted 


Seq ID 321 & 322 


425650 


NIUL001944 K5.1925 


desmo^dn 3 ^jemphlgus vulgaris anllQsn 


plasma membrane 


SeqlD323&324 


423673 


BE003054 


Hs.1695 


matrix metalloprotdnase 12 (macroph^a 


secreted 


Seq ID 32S & 326 


418663 


AK001100 


Hs.41690 


ddsmocol(in3 


plasma rnemtirane 


SeqlD327 &328 


418663 


AK001100 


Hs.41690 


desniocoliln 3 


plasma mentljrane 


Soq ID 329 & 330 


429610 


AB024937 


Hs.211092 


LUNX protein; PLUNC (patate hii« and nas 


Seq ID 331 & 332 


406690 


M29540 


H$i20529 


c&rcldoenibiyonte flntiQetwBlflted cell ad 


plflsma mwrtbrene 


SeqlD333&334 


431846 


BE019924 


Hs^1580 


uropiakin IB 


plasma membrane 


Saa ID 335 & 336 

14^ www Ql WWU 


422158 


L10343 


Hs.1 12341 


protease inhitiltor 3, sldn-deri\^ {SKAL 


secreted 


Sea(D337&338 

ti/ wwf ul www 


431958 


X63629 


Hs.2877 


cadhertn 3, type 1i P-cadherin ^llacenta 


plHwn3 memt}r8ne 


SeqlD339 &340 


437044 


AL035884 


Hs.69517 


diRersniially expressed in Fsnoonfs an 


plasms memtyHne 


Sea ID 341 & 342 


428484 


AF104032 


Hs.184601 


gpltrtQ canter family 7 (caQonIc anAio 


plasma membrane 


Seo ID 343 & 344 


429211 


AF052693 


Ks.ig8249 


98p jundbn protein, beta 5 (connexln 3 


plji^i^Q membrane 


SeatD345 &346 


417389 


BE260964 


H5.82045 


mUl^e (neuiite 9rowth*|Nomoting (actor 


secreted 


Seq)D347 &348 


431009 


BE149762 


HS.489S6 


gap Junciiofl proton, beta 6 (conrtexln 3 


plasma membrane 


SeqID 349 &350 


417542 


J041^ 


Ks.82269 


pioQestagen^assoctaled endometnai prote 


secreted 


Seq ID 351 & 352 


449230 


BE613348 


Hs^11579 


melanoma ceO adhesion molecule 


plasma memt)rane 


Seq ID 353 & 354 


410555 


U92649 


Hs.64311 


a dislntegrtn and metdloprotetnase dMia 


plasms membrane 


Seq ID 355 & 356 


410555 


U92649 


Hs.64311 


a dlsintegrin and metaOoproteinase doma 


plasma membfsne 


Seq ID 357 & 358 


424687 


JO5070 


Hs.151738 


matrix metaOoproteinase 9 (getaQnase B 


secreted 


Seq ID 359 & 360 


418462 


BE001596 


Hs.85266 


inte^, beta 4 


plasma memt)r8ne 


Seq ID 361 &382 


410274 


AA381807 


Hs.61762 


li^)oxia-lnductble prot^ 2 


secreted 


S6qlD363 &364 


439606 


W79123 


HS.S8561 


G prote!nKX}U{ded receptor 87 


plasma membrane 


SeqlD36S&366 


404877 






NM_005385:Homo salens melanoma anSgen, 


Intraoell 


SeqlD367 &36B 


444781 


NM»014400 Hs.11950 


GPI-anchored metastasfe-assocteted prote 


plasma membrane 


SeqlD369 &370 


109424 


NM.005329 Hs.85962 


hyaluronan synthase 3 


plasma membrane 


Seq ID 371 & 372 


415817 


U68967 


Hs.78867 


protein ^^rosine phosjdiatasei rsceptor~t 


plasma membrene 


Seq ID 373 & 374 


415817 


U88967 


HS.788S7 


piobdn tyiosfais phosphatsssi veoBptof't 


plasma membrane 


Seq ID 375 & 376 


415617 


U88987 


Hs.78867 


protein ^rostne ptiosphatesOt foc6ptor*( 


plasma mombrano 


Sea ID 377 & 378 


415817 


U88g67 


Hs.78867 


protein tyrosine phosphatase, receptor'4 


plasma membrane 


Seq ID 379 & 380 


415817 


088967 


Hs.78867 


protsn tyrosine ptiosphatase, receptor-t 


plasms msmt^sne 


Sea ID 381 & 382 


415817 


U88967 


H5.78867 


protein tyrosine phosphatasOi rec^(M''t 


plasma membrsne 




421817 


AF146074 


Hs.108660 


ATP-Mnding cassfittfl^ sub-fianiiiy C ((3*TR 


pteramembrane 


SealD385&386 

Wv*^ wvw w www 


418678 


NWL001327 H5.e7225 


cancerftes& antigen (NY-GS0>1) 




Seq ID 387 & 388 


418678 


NM.001327 Hs^7225 


cancerAes& anfigen (NY*CS(^1) 


intracell 


Sea ID 389 & 390 


409420 


Z15008 


Hs,54451 


taminin, gamma 2 (niodn (10(HcD), IcaDni 


sGcteted 


Sea ID 391 & 392 

%^wi| iw w9 1 O wwA 


332180 


AF134160 


Hs.7327 


daudin 1 


plasma memtxsnB 


Sea ID 3^ & 394 

Ovt| w9v vl Wirt 


408790 


AW5802Z7 


HS.47B60 


neurotaoplilc ^fiosAne Idnase, recep^ori 




Sea ID 395 & 398 


408790 


AW580227 


H5.47860 


neurotropttto ^po^ne kinasei f^ec^ptOTi 


plasma mc^nbraiw 




439223 


AW238299 


Hs.250618 


in 16 hindina nmlidfi 2 


plasma membfano 


Sea ID 399 & 400 

Www Iv www ul HVw 


409757 


Nh^001898 Hs.123114 


cvfitatin SN 


secreted 


Seo ID 401 &402 


4289^ 


AF120274 


HS.1946B9 


artemin 


^creted 


SealD403 &404 


428969 


AF120274 


Hs.194689 


arlemin 


secreted 


Seq ID 405 & 406 


428969 


AF120274 


Hs.194669 


artemin 


secreted 


Sea ID 407 & 408 

IV *fVf W "WW 


428969 


AF120274 


Hs.194689 


artemin 


secreted 


SeqID 409 & 410 


450701 


H39960 


HS.28&467 


hypotheflca) protein XP_0%151 (leucine 


secreted 


Seq ID 411 &412 


450701 


H39960 


Hs.288467 


hypoU^fical protein XPJ)98151 (leudne- 


secreted 


Seq ID 413 & 414 


414774 


X02419 


K5.77274 


]!4asniinoQ6n acSvstoft urokbutso 


secreted 


Seq ID 415 & 416 


407944 


R34008 


HS.2397Z7 


desmocoflin 2 


plasma membrane 


Sea ID 417 ft 418 


407944 


R34008 


Hs.239727 


desmocolfin 2 


pfasoia membrane 


Seq ID 419 & 420 


428486 


AW583497 


Ks.184604 


p8r)crsa6c poj^fpcpflde 


secreted 


Seq ID 421 & 422 


457489 


AI693815 


Hs.127179 


cryptic gene 


secreted 


Seq ID 423 & 424 


432874 


W94322 


Hs.279651 


rrfilanoma ii^nbitofy acBvi^ 


secreted 


Seq ID 425 & 426 


445891 


AW391342 


Hs.199460 


DPCR1 protein 


plasma membrane 


Seq ID 427 & 428 


445691 


AW391342 


Hs.199460 


DPCR1 protein 


plasma membrane 


SealD429 &430 

VW«| *TfcW «• "fwv 


404682 






C9001188*:gil12738842|reflNPj073725.1| p 




Sea ID 431 &432 


429547 


AW009166 


Hs.99376 


ESTs 


secreted 


Sea ID 433 & 434 


425921 


NM_007231 Hs.162211 


5(^ute canter family 6 (neurotransmitEs 


plasma mernbrans 


S6qlD43S&438 


407242 


M18728 




gbiHuman nonspecific cfossrBflctinQ antig 


plasma n^mbrarte 


&a ID 437 ft 438 


407242 


M18728 




^bcHuman nonspedSc tfuwsreacfli^ 8i4{g 


ptesms men^b^sno 


SeqlD439 &440 


407242 


M1872B 




glKHuman nonspeci&s crossrsacCng antig 


(riasma memlirane 


SeqlD441&442 


432596 


AJ224741 


Hs,278461 


malrflin3 


secreted 


SeqtD443 &444 


444006 


BE395085 


Hs.10086 


type 1 transmembrane protein Fn14 


plasma membrane 


SeqiD445&446 


423685 


6E350494 


Hs.49763 


lived autoanfigen with cofled co9 domd 


iniiaoell 


SeqlD447&448 


428392 


H1Q233 


Hs.2265 


secvstoiy granule^ neuroendocftne prold 


secreted 


SeqlO449&450 


429597 


NM_003816 H$^442 


a dlsintegrin and mstdloprotBinase doroa 


(toma membrane 


SeqlD451&452 


448030 


N30714 


Hs^5960 


memt>r8ne<5panning A-domska, 8ubfBn% A 


plasma membrane 


SeqlD453 &454 


422109 


S73265 


H5.1473 


gastTlrvreleasing pepOde 


secreted 


8eql0455&456 


419235 


AW470411 


Hs.288433 


neurotrirnbi 


plasma membrane 


SeqlD457 &458 


449048 


Z45051 


Hs.22920 


simSar to S68401 (ealQs) gtooose indue 


plasma membrane 


8eqiD459 &460 


427333 


AF067797 


Hs.176656 


aquspodnS 


pbsma memlirane 


SeqlD461&462 


417931 


W95642 


Hs.82961 


beloS toctor 3 (intesSnsQ 




SeqlD463 &464 


419216 


AU076718 


Hs.164021 


small faiduclbie cytokine sulibniOy B (Cy 


secreted 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seqt0465&466 
SeqtD467 &468 
Seqn)469 &470 
SeqlD471&472 
SeqtD473 &474 
SeqlD475 &476 
SeqlD477 &478 
SeqlO479 &4e0 
SeqID4B1&482 
SeqlD483 &4B4 
Seql048S&486 
SeqtD487 «488 
SeqiD489 &490 
SeqiD491&492 
Seql0493&494 
SeqID495 &498 
SeqlD497 &498 
Seq(D499&500 
SeqIDSOI&SOa 
SeqlO503 &504 
SeqlD505 &506 
SeqlD507 &508 
SeqlD509 &510 
Seq 10511 &512 
SeqIDSia 
Seq iD 514 
Seq ID 515 & 516 
Seq ID 517 & 516 
Seq ID 519 & 520 
Seq ID 521 & 522 
SeqlD523&S24 
SeqlD52S&526 
SeqlD527&528 
Seq ID 529 & 530 
Seqi0631&532 
Seq ID 533 & 534 
SeqlD535 &538 
SeqlD537&S38 
SeqlO539 &540 
SeqlD541&542 
S6qlD543&&44 
SeqlD545&546 
SeqlD547&548 
SeqlD549 &550 
SeqlD551&552 
SeqlD553 &554 
SdqlD555 &556 
SeqiD557 &558 
SeqlDS59&560 
Seq 10 561 & 562 
SeqlD563&S64 
Seq!D565 &566 
SeqlD567 &568 
SeqlD569&570 
SeqlD571&572 
Seq 105734574 
SeqlD575 &576 
SeqlD577 &578 
8eqlD579&580 
Seq ID 581 & 582 
SeqlD583 &564 
SeqlD585 &586 
SeqlD587 &588 
SeqID5B9&590 
Seql0691&592 
Seql0593&S94 
SeqtD595 &596 
SeqlD597&5S8 
SeqIOS99&600 
8eqID601&602 
SeqiD603 &604 
SeqlD605 &60S 
SeqID607&608 
SeqIO609 &610 
Seq 10611 &612 
Seq ID 613 & 614 
Seq ID 615 & 616 
Seq ID 617 & 618 
SeqtO619&820 
8eqt0621&622 
8eql0623&624 
8eql0625&626 
8eqtD627&628 



431629 

413554 

452194 

452194 

426322 

429010 

431462 

448243 

426427 

428187 

428579 

428579 

428579 

428579 

445537 

422278 

424620 

406400 

431130 

420440 

428450 

446619 

436982 

453392 

431989 

439820 

409178 

426514 

421478 

421478 

425775 

425776 

452097 

416530 

431515 

419452 

412078 

412078 

411773 

407792 

431616 

452792 

400294 

432653 

432653 

432653 

432653 

400290 

410001 

418398 

451027 

446057 

433466 

453370 

453370 

414569 

413435 

426501 



AU077025 

AA319146 

AI694413 

A1694413 

J05068 

Y18198 

AW5S3872 

AW363771 

K^6699 

AI687303 

NM.005756 

NM.00S756 

NNL005756 

/U245671 

AF072873 

AA101043 

NM.006103 

NM_002407 

NW.014791 

AU076643 

AB018305 

U23752 

AW972870 

AL360204 

BE393948 

BE616633 

AI683243 

AI883243 

U25128 

U25128 

AB002364 

U62B01 

NM.012152 

U33635 



408369 
412828 
403047 
403047 
408430 
445413 
451982 
427958 
421887 
425071 
432101 
407786 
416836 
416539 
131083 
131083 
131083 
131083 
422424 
428970 



439018 
420610 
425723 



NMj0067g9 

AI077715 

AA508552 

AB037765 

N95796 

N62096 

N62096 

N62096 

N62096 

H18836 

AB041036 

AI765805 

AW519204 

AI420227 

AA508353 

AI470523 

AI470523 

AF109298 

X51405 

AW043782 

AF179274 

R38438 



S79876 
AA151342 
F13036 
AA4180ro 
AW161450 
NM.013989 
AI9169S0 
AA687538 
054745 
Y07909 
Y09763 
Y09763 
Y09763 
Y09763 
AI186431 
8E276891 
122524 
AW300887 
AI683163 
NM.014420 



H5^65827 Interfenm. elpha^ndudfala protein (cb 

Hs.75426 &8crBtooraninll(chromogTanInC) 

Hs.332649 Ul]!qdtii>4ifQ9 piotein FAT1077? 

Hs332649 lAiquMQce prolan FAT1077? 

H&2012 transoobalamtnl(vQatnInB12bini£ngpr 

Hs,19472S one cut domain, fandymsntber 2 

Hsu%831 1 gninin4ikB neiuoendoortne pepUdo procu 

Hs.52620 Integrin, l)eta 8 

Hs.169840 TTK protein idnase 

Hs^B5529 G protevKOUpIed receptor 49 

Hs.184942 Gproteiivooupled receptor 64 

KS.1B4942 G protebvcoupled receptor 64 

Hs.184942 GproteiTKCOUpIed receptor 64 

Hs.184942 G proteirKX)up!ed receptor 64 

Hs.12844 EGRike^ain. imdiipie 6 

H8.114218 frizzled (DrDsophna)homoiog 6 

Hs.151254 kaflikreln 7 (ctiym^iypSc, sbatum com 
kanSoein 8 (neurop^nitavasln) (KLHB) 

Hs.2719 HE4;epidldyinis«pedlic.vvh8y^acUicpr 

Hs.97644 mammag1obln2 

H3.184339 KIAA0175 gene product 

H5.313 secreted pliosphoproteln 1 (osteoponSn, 

Hs^78 spondin 1, (tepondin) extraceliular mat 

Hs.32964 SRY(8exdetenninIngfegtonYH)ax11 

Hs^1089 ESTs 

HsJ^83853 Horrosaf^ensmRNAMIIengthinsertcON 

Hs.50915 kalGkreinS 

Hs.170195 t)one morphogeneOc protein 7 (osteogenic 

Ks.97258 ESTs. Moderately similar to S29539 ribos 

Ks.97258 ESTs. Moderately similar to S29539 rtbos 

Hs. 1 59499 parsUryrold hormone receptor 2 

Hs.159499 parathyroid homione receptor 2 

Hb.27916 adlsintBgi(n-IikoandfflstaDoprotease( 

Hs.79361 ksO!lcreln6{neuiosln,^rme} 

HS.2S6583 endothelial diflerentialion, lysophoapha 

Hs.90572 PTK7 protein tynisine kinase 7 

Hs.73149 paired box gene 8 

Hs.73149 paired box gene 8 

Hs,72Q26 pnstoase, serine, 21 (tesQsin) 

Hs.39384 putathre secreted tigand homologous to f 
Hs.195639 ESTs. Weakly simitar 10 138022 hypotheli 
Hs.30652 K1AA1344 protein 
HS.27869S Homo sapiens prasiebimRNA,oonnplelBcds 
H5^3185 EST8.WeaklysimnartoJC7328amtnoaci 
K&293185 ESTs, Weakly simitar to JC7328 amino ad 
Hs.293185 ESTs, Weakly simHar to JC7328 amino a:i 
Hs.293185 ESTs. Weakly simitar to JC7328 amino ad 
Hs.31608 hypotheticai protein RJ20041 
Hs.57771 kaiKkrdnll 
Hs^6691 ESTs 
Hs.40806 ESTs 

Hs.149358 ESTs, Weakly sintilar to A46010 X^nked 
Hs.105314 rBlax]n1(H1} 

Hs.139336 ATP-binding cassette. sutHfamily C (CFTR 
Hs.139336 ATP-binding cassette. sutMamilyC(CFTR 
H5.1 18258 prostate cancer assodafed protein 1 
Hs.75360 carboxypeptkfaseE 
Hs.293616 ESTs 

H&2279\ transmembrane protein With EGF'like and 
HS.182S75 sotute canier farnily 15 (H??7 transport 
Hs^06051 hypothetteai protein MGC2648 

f^005655*:Homo sapiens transmembrane pr 
N1^005656*:Homo sapiens transmembrane pr 
Hs.44926 dipepCdyipeptidase IV (C026^ adenosine 
Hs.12677 OGI-147pretoln 

Hs.27373 Homo8a|^n8mRNA:cDNAOKFZp56401763<f 
Hs.98280 potasskimlntennedlate/small conductance 
Ks.109201 CX3i-88 protein 
Hs.1 54424 detodinase. iodothyronine, type ti 
Hs.123842 EphA3 
Hs38972 tetraspani 
Hs.80247 cholecystokinin 
Hs.79368 epHheHai membrane protein 1 
Hs. 22785 gamm&aminobutyric add (GABA) A reoepto 
Hs2278S ganima«nlR6butyricadd(GABA)Araoepto 
HS.2Z785 gamne^mlnobutyrk; add (GABA) A reoepto 
H5.22785 gamma-smbiobuiyric add (GABA) A reoepto 
Hs^96638 prostate ditferanOaOon factor 
Hs.194691 retinok: add Induced 3 
KS.22S6 matrb(inetaaoprotBinase7(mabt)y6in. 
Hs.26638 m8ntbrane<pannIng44omain8, subfamily A 
Hs.99348 dIstdHesshomeoboxS 
HS.1S9311 dIdd(opr(Xenopt8l3evis)homolog4 
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secreted 



plasma membrane 
plasma membrane 



inliacefi 



(riasma membrane 
intraceO 

plasma memt)rane 



plasma menttvane 
plasma memtjrane 
pla^na mendyane 
secreted 



secreted 
secreted 
secTBted 
secreted 
intraceU 
secreted 
secreted 
intracell 



plasma membrane 
ptasma membrane 
(dasma membrane 
plasma membrane 



ptasma membrane 
plasma membrane 
IntraceB 
inbaceq 
secreted 



plasma membrane 
plasma membrane 



plasma membrane 
plasma membrane 
ptasma membrane 
plasn^ membrane 
plasma memt)rane 
secreted 

(rfasma membrane 
l^asma men^rane 
plasma membrane 
secreted 

ptasma memt)rane 
plasma memlxane 
plasma memtirane 



plasma memtirane 
plasma membrane 
plasnra memtjrane 
secreted 

plasma membrane 
ptasma fnembrane 
plasma membrane 



Intraceli 

plasma memlirsno 
plasma membrane 
secreted 

plasma RiemtNane 
ptasma membrane 
secreted 

plasma membrane 
plasma memtxane 
plasma memlvane 
plasma membrane 
plasma memtjrane 



secreted 

plasns membrane 
intraoeD 
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SeqlD629& 630 456662 NlyLW2448 Hs.1434 msh (DrosophOa) homeo box homokn 1 (fo MraoeB 

Seq(D631&632 41B281 U09550 Hs.1154 ovkfudalglycoprotdn 1. 120kD(mtjdn9 secreted 

Seq!0633& 634 429903 AL134ig7 Hs,93597 (ycfliHtepemtent kinase 5, regutoysu Mreoel 

Table 76B: 

Pkey: Unique Eos piobeset idemifier nuihbcr 

CAT number. Gene duster number 

Accession: Genbank accession numbers 



Ptey CAT Number Aocesston 

424399 238981J AI905687 AI905624 AI905837 AJ9C5623 AA340069 R75793 W72B37 B8}74512 AI905633 W72838 BE092421 AI127172 BE186013 AW070916 

A1139456AW176044AW291950 
429220 3D1384J AW207208AW34t473AA448195Al951341 

450375 83327.1 AA009647 AA131254 AA374293 AW954405 K04410 AW606284 AA151166 BE157467 BEt57601 H04384 W46291 AWS63674 H04021 H01532 

AA190993 H03231 H59605 H01642AA8S2876AA113758 AA626915AA746952AI161014 AA09g554 R69067 

Table 76G 



Pkey: Uitk^e number correspondlno to an Eos probesel 

Ref: Sequence source. The 7 dlgll numbers In thte column are Genbank W8nffliar(GI)numbef8. 1)unhaml.eteL-rBferatoihepublicatk)nenfiQed^DNA 

sequence of human chromosome 2Z' Dunham Ida).. Nature (1999) 402:489495. 
Strand: IndteatesDNA strand from whtehexons were predicted. 

Ntjiosffion: Indicates nucleotide positions of predk;tedexons. 



Pkey 


Ref 


Strand 


NLposHton 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161,124455-124610,125672.126076 


402230 


9966312 


Minus 


29782-29932 


403047 


3540153 


Minus 


5979^-59968 


404029 


7671252 


Plus 


108716.111112 


404049 


36B8074 


Minus 


75765-78155 


404682 


9797231 


Minus 


40977^1150 


40487S 


9801324 


Phis 


96588-98732.97722-97831 


404877 


1519284 


Phis 


1095-2107 


404977 


3738341 


Mnus 


43081-43229 


405932 


7767812 


Minus 


123525-123713 


406400 


9256298 


Phis 


1553.1712.1878-2140,4252-4385,5922-6077 



944 



wo 03/042661 



PCT/US02/36810 



77 provU8sPkBy:Seq ID No; Disease ln(ft»{Ions;an^ SeqlDNoMBlh6inEormaBontnTabla77toTabte7& 

SeqIDNo: Saquenoe 10 No for sequences tntebb 

Pkey. Unique Eos probeset UenQQer number 

Disease IndlcaQons: Diseases donated for ooverage as described in Teblel 

Prefarrod Utffijy: Preference of ufi%, based partly upon prediclfid localizaGon (Ab is anfibody; sm Is small motecute targ^ CTL Is vaccine target) 



SeqlONo Disease Indicatnns 



SeqlD1&2 425023 
Seql03& 4 424503 
SeqIDS&G 429423 
SeqI07&8 400289 
SeqlD9&10 419172 
SeqlD11&12 41B007 
Seql013&14 418007 
SeqlD15&16 407838 
SeqlD17418 414577 
8eqlD19&20 418738 
Seql021&22 428388 
SeqiD23&24 415138 
SeqlD25&26 429276 
Seq(D27& 28 418994 
SeqlD29&30 407975 
SeqlD31&32 429113 
SeqlD33&34 418506 
SeqlD35&36 423961 
Seq!D37& 38 414812 
SeqlD39&40 417433 
SeqlD41442 417433 
SeqlD43& 44 424399 
SeqlD45& 46 422667 
8eql047& 48 426227 
SeqtD49&50 444381 
SeqlD51&52 439569 
SeqlD53& 54 411558 
Seql055& 56 400303 
SeqlD57&S8 411789 
SeqlD89&60 428698 
SeqlD61&62 450098 
SeqlD63&64 421552 
Seqt065&66 452747 
$eqIDG7&68 415539 
SeqID69ft70 416636 
SeqiD7U72 416636 
SeqlD73& 74 409079 
SeqlD75& 76 442082 
SeqlD77& 78 400297 
SeqlD79& 80 451398 
SeqID8U 82 429220 
Seql083& 84 421524 
SeqlD65& 66 423242 
SeqlD87& 68 423242 
SeqlD89& 80 423242 
Seql091&92 452190 
SeqlD93& 94 452190 
SeqlD9S&96 325372 
SeqlD97& 98 450375 
Seq(D99&100 426215 
Seq ID 101 & 102 425247 
Seq ID 103 & 104 429353 
Seq ID 105 & 106 429353 
Seq ID 107 & 108 432201 
Seq ID 109 & 110 427585 
Seq ID 11U 112 446163 
Seq ID113&114 442117 
Seq ID 115 & 116 428179 
Seq ID 117 & 118 431211 
Seq ID 1 19 & 120 447033 
Seq 10 121 & 122 447033 
Seq ID 123 & 124 447033 
Seq ID 125 & 126 115522 
Seq ID 127 & 128 452679 
SeqlD129&130 446051 
Seq ID 131 & 132 422048 
Seq ID 133 & 134 410418 
Seq ID 135 & 136 446342 
Seq ID 137 & 138 422260 
SeqlD139&140 409041 
Seq ID 141 & 142 409041 
SeqlD143&144 111929 
Seq ID 145 & 146 111929 
Seq ID 147& 148 111929 



pancreas, prastatei snoiogenesiSk Uadder* bu^ 
angiogenesis 

angiogenesis, bladder, tung. cervical, ovarian, head & neck 
angiogenesis, renal 

angiogenesis, lunSt bladder, fibrosISi head & neck, pancreas, stomach, coton, ovarian 
angiogenesis, king, bladder, fitMO^ head & neck, pancieas, stcxnach, colon^ ovarian 
angiogenesis 



angfogenesb, lung, ovarian, bladder & stomach, panoeas, uterine 

angiogenesis. heal & neck, stomach 
angiogenesis, pancreas, &tomac^ lung, uterine 
angiogenesis. bladder, gftoblastoma 
prostate, angfogene^ 
angfogenesis. renal 
an^ogenesis, titadder and stomach 

angiogenesis, ovarian, gTtoblastoma, uterine, hing, bladder, pancreas 

breast colon, bladder, tung, fibrosis, pancreas, head and neck, ovarian 

breast, bladder, lung. Fibrosis, pancreas, colon, head and neck, cenrical. stomach, renal, ovarian 

pancreas, bre^ bladder, tung. head & neck, cen/foal, uterine, ovarian, stomach, renal 

pancreas, breast bladder, kmg. head & neck, ceivicai, uterine, ovarian, stomach, renal 

breast, uterine, head & nE»k 

breast, ovarian, prostate, pancreas, king, cokm. uterine 

breast lung, bladder, ovariarn head & neck, fibrosis, ooton, sk^ 

breast octodt triadder, lung, pancreas, head & neck, ovarian, stomaiA, utsrlne, renat angfogenesis 

breast androgen wlSidravva! prestale, prostate, bladder 

pancreas, prostate, stomacli, breast uterine, oervfoat ovarian 

breast ovarian, pros, stomach, uterine, bladder lung head & neck 

pancreas, lung, breast stomach head & neck, ovarian, uterine 

breast oofon, lung, pancreas, stomach, head & nebk, ovarian 

breast king, stomach, uterine 

breast, ovarian, pancreas, cervfcsl, uterine, prostate, king, stomach, head & neck 

breast bladder, hmg, head & neck, ovarian, stomach, uterine, pancreas 

breast prostata, ovarian, oeivfcat uterine 

breast, pancreas, uterine 

breast, pancreas, uterine 

t}reast prostata 

breast prostate, ovarian 

breast bladder, colon, proslate 

breast ovarian 

breast prostate, benign prostatic hyperplasia 
breast 

breast renal, ovarian, prostate, colon 
breast renei, ovarian, prostate, ooton 
breast renal, ovarian, prostate, coton 
breast stomach, pancreas 
txeast sfomach, pancreas 
breast 

breast ovarian, head & neck, pancreas, king, coton 
breast kmg, renel, ooton, ovarian, uterine 

breast ovarian, lui^, oofon, pancreas, head & neck, stomach, titerine, oenricat bladder 



breast prostate 

breast cofon, lung chemo, ovarian, 8tomac^ pancreas, uterine, cenrical 
breast kng, head & neck, pancreas, stomach, cofon. ovarian, oenricai 
breast cervteat uterine 

breast hmg, tdadder, pancreas, tiead & neck, stomachi ovarian, prostate 



colon, bladder, fong, pancreas, head & neck 
cofon, prostate, fibrosis, breast hmg pancreas 
colon, prostate, fitxosis, tireast fong pancreas 
colon, prostate, fibrosis, breast king pancreas 
coton, kmg, bladdv, pancreas 
prostate, cofon, pancreas. Taxol ptostala 
cofon, breast 
coton. pancreas, prostate 

cofon. bladder, king, ovarian, pancreas, heads neck 
uterine, cofon, prostate 
colon, ovarian mudnous 

uterine, ovarian, lung, coton, stomach, head & neck, breast pancreas 
uterine, ovarian, lung, cofon, stomach, head & neck, breast pancreas 
cokm. breast fibrosis 
colon, tirea^ Svosis 
ooton. breast fibrosis 

. 945 



PreferredUtility 

Ab.8m, CTL, imaging 
Ab, sRk bnagkig 
Ab,8m 

Ab, sm. CTL, diagnosfic 
Ab. sm, cn, Imaging 
Ab, sm, diagnostic 
Ab, 8IIV (fiagnostfo 
CTL 

Ab, CTUdiagnostfo 
Ab,5m. CTL. Imaging 
Ab.sm, diagnostic 
Ab, cn., diagnostic 
Ab,sm, Imaging 
Ab,CTUbnaglng 
Ab,CTUdiagnoslfo 
sm,CTl 
Ab. sm, imaging 
Ab, sm. diagnostfo 
Ab,CTL.di8gnosIfo 
Ab.CTL, imaging 
Ab.CTU imaging 
Ab, CTL, diagnostic 
Ab.cn. diagnostfo 
Ab, CTL, diagnostfo 
Ab, CTL, diagnostfo 
Ab, CTU diagnostfo 
Ab.sn\ imaging 
Ab.sm, CTL, imaging 
Ab,CTL,diagnoslte 
Ab, sm, bnaging 

cn 

Ab, sm, CTL, diagnostfo 
Ab. imaging 
Ab,sm, CTl, imaging 
Ab.sm, CTL. imaging 
Ab,sm, CTL. imaging 
Ab,sm, CTL. imagfog 
Ab, imaging 
Ab^sm. CTL, Imaging 
CTL 

Ab. CTL, imaging 
Ab.sni, CTL, imaging 
Ab. CTL. imaging 
CTL 

ca 

CTL 
CTL 
CTL 

Ab^sm, CTL, imaging 
Ab. CTL, diagnostfo 
Ab, sm, CTL, diagnostfo 
Ab, sm, CTL, imaging 
Ab.sm, CTL. imaging 
Ab,sm,CTL.knaging 

ca 

Ab, sm, Imagfog 

Ab, CTL, imaging 

8m,CTL 

Ab. sm, imaging 

Ab,CTL,d]agnosGc 

Ak^ CTL diagnosSc 

Ab, CTL, diagnostfo 

CTL 

Ab^CTU Imaging 
Ab,8m,CTL, in^ng 
diagnostfo 

Ab, sm, CTL imaging 
Ab, sm, CTL, imaging 
Ab, sm, CTL. diagnostfo 
Ab, CTL, diagnostfo 
Ab, CTL, diagnostfo 
Ab, snv Imaging 
Ab, snv imaging 
Ab, sn\ tmaglng 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Seq ID 149 & 150 104888 
Seq ID 151 & 152 420159 
Seq ID 153 & 154 422330 
Seq ID 155 & 156 452461 
Seq ID 157 & 158 413324 
Seq ID 159 & 160 412420 
Seq ID 161 & 162 416658 
Seq ID 163 & 164 407811 
Seq ID 165 402230 
Seq ID 165 & 167 402230 
Seq ID 166 & 169 432829 
Seq ID 170 & 171 425721 
Seq ID 172 & 173 420370 
Seq ID 174 & 175 437852 
Seq 10 176 & 177 402075 
Seq ID 178 & 179 421110 
Seq ID 180 & 181 451668 
Seq ID 182 & 183 451668 
Seq 10 164 & 185 451668 
Seq to 188 & 187 408243 
Seq ID 168 & 189 422262 
Seq ID 190 & 191 425852 
Seq to 192 & 193 439738 
Seq ID 194 & 195 404875 
Seq ID 196 & 197 425883 
Seq ID 198 & 199 404977 
Seq ID 200 A 201 420876 
Seq ID 202 & 203 427747 
Seq ID 204 & 205 420281 
Seq{D206 &207 446673 
Seq to 208 437553 
Seq 10 209 & 210 437553 
Seq ID 211 & 212 437553 
Seq 10 213 & 214 426900 
Seq ID 215 & 216 426900 
Seq ID 217 & 218 426900 
Seq ID 219 & 220 405932 
Seq to 221 & 222 405932 
Seq ID 223 & 224 424008 
Seq ID 225 & 226 424008 
Seq ID 227 & 228 424008 
Seq ID 229 & 230 444342 
Seq 10 231 & 232 421379 
Seq ID 233 & 234 417079 
Seq ID 235 & 236 412986 
Seq ID 237 & 238 412986 
Seq ID 239 & 240 412986 
Seq ID 241 & 242 447072 
Seq ID 243 & 244 419723 
Seq ID 245 & 246 419723 
Seq ID 247 & 248 430890 
Seq ID 249 & 250 456759 
Seq ID 251 429466 
Seq ID 252 429466 
Seq ID 253 £ 254 419721 
Seq ID 255 & 256 407034 
Seq 10 257 & 258 413472 
Seq ID 259 & 260 438380 
Seq 10 261 & 262 426271 
Seq 10 263 & 264 419704 
Seq ID 265 & 266 444471 
Seq ID 267 & 268 409395 
Seq ID 269 & 270 413063 
Seq ID 271 & 272 433800 
SeqID273 458435 
Seq 10 274 458435 
Seq ID 275 & 276 424343 
Seq 10 277 & 278 424998 
Seq ID 279 & 280 412709 
Seq ID 281 & 282 435615 
Seq ID 283 & 284 404049 
SeqlD285 & 286 418932 
Seq ID 267 & 288 404029 
Seq ID 289 & 290 436480 
Seq ID 291 & 292 452401 
Seq ID 293 & 234 452401 
SeqiD295 &2g6 438895 
Seq ID 297 & 298 421471 
Seq ID 299 & 300 428296 
Seq ID 301 & 302 423508 
Seq ID 303 & 304 450001 
Seq to 305 & 306 410407 
Seq ID 307 & 308 453496 



colon, stomach, uterine 
tiladder, stomach 
pancreas, colon, bladder 

bladder, lung, head & neck, ovarian, gOoMastama. stomach 00^ 
bladder 

bladder, glioblastoma, tung. stomach 

hing, ovarian, uterine, Uadder 

bladder, pancreas, stomach, uterine, king 

bladder 

Uadder 

bladder 

bladder 

bladder 

bladder, lung 

bladder. lung, head & neck, cervical 

bladder, pancreas, stomach, ovarian^ lung 

Uadder, ovarian, bng 

bladder, ovarian, lung 

tiladder, ovarian, bng 

bladder, stomach, head & neck, cervical 

bladder, lung, head & neck 

bladder, lung, head & neck 

bladder, lung, cervical 

bladder 

tiladder. pancreas 
bladder, ovarian 



bladder, lung, ovarian, stomach 

hmg. bladder, ovarian, pancreas 

bladder 

bladder 

bladder 

bladder 

bladder, prostate 
bladder, prostate 
bladder, prostate 

bladder, lung, head & neck, cenrical 
bladder, lung, heed &fleck, cervical 
bladder, head & neck, stoniach, oervteal 
bladder, head & neck, stomach, cervical 
bladder, head & neck, stomach, cervical 
hepatitis C tong. fibrosis, bladder 

breast pancreas, head & neck, king, stomac^ bladder, cervtedl. colon 

bladder, hmg, head & neck, oervfcal 

gficdilastoma 

glioblastoma 

gBoblastoma 



gifoblastoma 

gttobtastoma. hmg. oervtosl, bladder 
gfioblastoma 



gSoblastoma, uterine 

gnobtastoma, lung 

glioblastoma 

gitoblastoma 

gnoUastoma 

gBottotoma 

gRobiastoma 

glioblastoma, hmg, coton 

gitoblastoma 

glioblastoma, ovarian, bladder, hmg 
glioblastoma, lung 
glioblastoma 
glioblastoma 

gitoblastoma, ovarian, uterine 

glioblastoma 

glioblastoma 



glioblastoma 
glioblastofna 
glioblastoma 
glioblastoma 

bladder, breast pancreas, head & neck, stomach, hmg, flrthrflb, lenal 
bladder, breast pancreas, head i neck, stomach, hmg, arthritis, rend 
breast renat ovarian, gltoblaslDma 

renai 
ronsH 

loiatkmg 

renat kmg, cotorv, stomadh, ovarian, uterine 
renat prostate 
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At), Imping 

Ab, snv CTU Imaging 

Ab.sm, Cn. knaglng, dlagnosSc 

CTL 

Ah. sm.CTL, diagnostic 

At), <Sagnos(to 

Ab,CTl.diagnosBe 

Ab, sm, (fiagnostto 

sro,cn 

5m,CTL 

CTL 

Abbima^ng 

Ab. CTLrimaging 

Ah. sm, CTL, Imaging 

(fiagnosQc 

Ab. sm, diagnostic 

Ab.8m,CTU diagnostic 

Ab.sm. CTL. diagnostic 

Ab,sm.CTU diagnostic 

Ab, diagnostic 

CTUd^nostic 

Ab.sm. CTL, imaging 

Ab, snv CTL, imaging 

sm,CTL 

Ab,CTL, ima^ng 

Ab, sm, CTU diagnostic 

Ab.smiCrL,imagtog 

8m.ca 

Ab,8m.bnaglng 

6m,CTL 

Ab,CTL. Imaging 

Ab,CTU imaging 

Ab,CTL. imaging 

AbiCTL, Imaging 

Ab,CTL. imaging 

Ab,CTL, imaging 

sm 

sm 

Ab,sni, CTL, imaging 
Ab,sm, CTU imaging 
Ab,sm, CR, Imaging 
Ab.CTL,diagnostto 
Ab. on. diagnostic 
Ab, diagnostic 
Ab,sm,CTL, imaging 
Ab,sm, CTL, imaging 
Ab,sm,CTU imaging 
6m,CTL 

Ab.CTl, dlagnosSc 
Ab,sm, CTU imaging 
Ab, CTL, Imaging, diagnostic 
Ab,smi CTU imaging 



Ab.8m. CTU imaging 
Ab. CTU diagnostic 
Ab,sm, CTU Imaging 
Ab. CTU diagnostic imaging 
Ab. CTU diagnostic 
8m,CTL 

AhjBm, CTU imaging 
Ab, CTU diagnostic 
Ab. CTU diagnostic 
Ab. CTU imaging 



Ab.sni, CTU Imaging 
Ab.CTUdi3gno66c 
Ab,sm,CTUimat^g 
AbpSm* CTU Imaging 
Ab,sm* CTU imaging 
Ab,sn\CTUknaging 
Ab,8m, CTU imaging 
8ni,CTL 

Ab. CTU diagnostto 
Ab, CTU diagnostic 
AbtSm, imaging 
Ab.8m. CTU imaging 
Ab.8m. CTU imaging 
Ab. CTU knaging 
Ab.sm. CTU imaging 
Ab.8m, CTU imaging 
Abm CTU imaging 



wo 03/042661 



PCT/US02/36810 



SeqIO309&31O 420737 
Seqt0311& 312 309931 
Seq{D313&314 412719 
SeqlD315&316 417034 
SeqID317& 318 430486 
Seq ID 319 & 320 413753 
Seq 10 321 & 322 425650 
Seq ID 323 & 324 423673 
Seq 10 325 & 326 416663 
Seq 10 327 &32S 416663 
Seq 10 329 & 330 429610 
SeqlD331&332 406690 
Seq 10 333 & 334 431846 
Seq ID 335 ft 336 422158 
Seq ID 337 ft 338 431958 
Seq ID 339 & 340 437044 
Seq ID 341 & 342 428484 
Seq ID 343 & 344 429211 
SaqlD345 &346 417389 
Seq ID 347 & 348 431009 
Seq ID 349 & 350 417542 
Seq ID 351 & 352 449230 
Seq ID 353 & 354 410555 
Seql0355&356 410555 
Seq ID 357 & 358 424687 
Seq 10 359 & 360 418462 
Seql0361&362 410274 
Seq to 363 & 364 439508 
Seq ID 365 & 366 404877 
Seq iO 367 & 368 444781 
Seq ID 369 & 370 109424 
Seq ID 371 & 372 415817 
Seq ID 373 & 374 415817 
Seq ID 375 & 376 415817 
Seq ID 377 & 376 415817 
Seq ID 379 & 380 415817 
SeqlD381& 382 415817 
Seq 10 383 & 384 421817 
Seq ID 385 & 388 418678 
Seq 10 387 & 388 418678 
Seq ID 389 & 390 409420 
Seq ID 391 ft 392 332180 
Seq ID 393 & 334 408790 
Seq ID 395 & 386 408790 
Seq ID 397 & 398 439223 
Seq ID 399 &400 409757 
SeqlD401&402 428969 
Seq ID 403 & 404 428969 
Seq (D 405 & 406 428969 
Seq ID 407 & 408 428969 
Seq ID 409 & 410 450701 
SeqlD411&412 450701 
Seq ID 413 & 414 414774 
Seq ID 415 & 416 407944 
Seq ID 417 & 418 407944 
Seq 10 419 & 420 428486 
Seq 10 421 & 422 457489 
Seq ID 423 & 424 432874 
Seq ID 425 & 426 445891 
Seq to 427 ft 428 445891 
Seq to 429 ft 430 404682 
Seqt0431 ft 432 429547 
Seq 10 433 ft 434 425921 
Seq ID 435 ft 436 407242 
Seq ID 437 ft 438 407242 
Seq ID 439 ft 440 407242 
Seq ID 441 ft 442 432596 
Seq ID 443 ft 444 444006 
Seq 10 445 ft 446 423685 
Seql0447& 448 428392 
Seq to 449 ft 450 429597 
Seq ID 451 ft 452 448030 
Seq ID 453 ft 454 422109 
Seq ID 455 ft 456 419235 
Seq ID 457 ft 458 449048 
Seq ID 459 ft 460 427333 
Seq ID461 ft 462 417931 
Seq 10 463 ft 464 419216 
Seq ID 465 ft 466 431629 
Seq ID 467 &468 413554 
Seq ID 469 ft 470 452194 
Seq ID471 ft 472 452194 
Seq ID 473 ft 474 426322 



renal 
tung 

limg. head ft neck, bladder, gEobtastoma. oeMcat 

tung. head & neck, aooufdeceracsf 

lung, bladder, head ft nedc. cervlca) 

lung, bladder, head ft neck, panoeas, ceMcsl, colon 

lung, head & neck, oenric^ bladder 

bladder, king, head ft nack, ovarian, pancreas, coton, slomacli, uterine, cen^ 
kmo, bladder, head ft neck. oenrlBal 
lung, bbdder. head ft neck, cervfcal 
tung 

lung, head & neck, pancreas, stomach, bladder, ookNi,cervk:al 

lung, bladder, head ft neck, utsrina, oenrica]. stomach, ovartan 

head ft neck, bladder, lung, cetvfeal, stomach 

kmo. bladder, cen^. head ft neck, ovartan. colon, prostate. panoBas. bteast 

head ft neck, cervical, king, Uadder, breast piostate. ovarian, stomach 

kmg, glioblastoma bladder, head ft neck, cokm, oeivfeal 

king, bladder, head ft neck, ceivicalk stomach 

ovarian, tung. bladder, utarine, cervical, pancreas, stomach 

lung, bladder, head ft neck 

lung 

lung, cenficd. head ft nedt, Uadder, ovarian, cobn 

lung 

tui^ 

head & neck, pancreas, lung, uterine, cervteal, colon, stomach 
lung, bladder 
King, renal 

tung, bladder, head ft neck, cervical . 
hmg, bladder 

kmg, bladder, head ft neck, cervteal 
bladder, king 

king, gfioblastoma, head ft neck, ceivical. fibrosis 

king, glioblastDma, head ft neck, cenrical. fibrosis 

kmg. glioblastoma, head ft neck, cenrical, fibrosis 

kmg. gOoblastoma, head ft neck, cenrical. fibrosis 

kmg, gDoblastoma, head ft neck, cervteal, fibrosis 

kmg, glloblastoina, head ft neck, cenrteal, fibrosis 

kmg, cenricai. head & neck & bladder 

kmg, bladder, stomach, ovarian, pancreas ft cervical 

tong, bladder, stomack ovailan, pancreas ft cervteal 

king, head ft neck, pancreas, stomach, cervteal, bladder 

king 

kmg 

tung 

tung, head ft neck, cendcai, bladder ft ooten 

pancreas, stomach, king, bladder, stomach 

hmg. cervical 

lung, cervical 

kmg, cervical 

kmg.cendcal 

kmg, head ft neck, pancreas, breast, stomach, ovarian, colon 
kmg. head ft neck, pancreas, breast, stomach, ovarian, ooten 
kmg, bladder, head ft neck, pancmas, stomach, ovarian 
kmg, head ft neck 
tong, head ft neck 



pancreas, prostate, king 
pancreas, stomach 
stomach, pancreas, ovaifan 
stomach, pancreas, ovarian 
pancreas 

pancreas, head & neck, lung, ovarian 
stomach, pancreas 

pancreas, coton. bladder, head ft neck, stomach, hmg. ovarian, cervteal 
pancrsas. coton, bladder, head ft neck, stomach, hmg, ovarian, cervteal 
pancreas, coton. bladder, head ft neck, stomach, hmg, ovariam, cervtedl 

pancreas, breast 

pancreas, coton. kng, ovarian ft oenrtod 

paiureas, uterine, coton 

pancreas 

pancreas, coton. stomach, tung 

pancreas, renal and stomack 

pancreas, hmg, coton 

pancreas, fibrosis, head ft neck ft kmg 

pancreas, ovarian, uterine. giteUastoma. head ft neckft lung 

pancreas, coton 

ovarian, pancreas, stomach, coton. uterine, prostata 

pancreas, lung, stomach, cendcal, prostate, head ft neck 

pancreas, uterine, cervicd, stomach 

pancreas. gOoblastoma 

stomach, pancreas, renal, colon 

stomach, pancreas, ren^ coton 

pancreas, bladder, stomach 

947 



Ab,sm,CTL,ima^ 
Ab,sn\ CTL, tma^ng 

ca 

Ab.CTUdi3gnostte 
Ab,sm, Imaging 
CTUdiagnQstte 
Abttmaglng 
Ab.CTl.d!agnostto 
Ab, imaging 
Ab. Imaging 
Cn.. diagnostte 
Ab, imaging 
At). Imaging 
diagnostic 
Ab,CTL,kTiaging 
Ab^bna^ng 
Ab. sm. Imaging 
Ab, Imaging 
Ab. diagnostic 
Ab. sm. imaging 
CTL. dagnostte 
Ab, Imaging 
Ab,sm, imaging 
Ab,8m. Imaging 
Ab,8m.dIagnosite 
At^ knaging 
diagnostte 
Ab.sm. imaging 
CTL 

Ab, Imaging 
Ab.sm,CTI^ Imaging 
Ab.sm, CTL. imaging 
Ab,sm, Cn, Imaging 
Ab,sm, CTL, Imaging 
Ab,an. CTU Imaging 
Ab,sm, CTU Imaging 
Ab.sm. CTUimaglng 
Ab,8m,CTL.bnagtog 
CTL 
CTL 

CTL, diagnostte 
Ab, sm, Imaghg 
Ab,8in.CTl,kraglng 
Ab.sm,CTL,bnaglng 
Ab. CTL. Imping 
Ab. CTL, diagnostte 
Ab, CTL. diagnostte 
Ab, CTL, diagiUDsfic 
Ab, CTL, diagnostte 
Ab, CTL, diagnostic 
Ab,Ca. diagnostte 
Ab. CTL, diagnostic 
Ab,8m, diagnostte 
Ab. sm. imaging 
Ab,8m, Imaging 
Ab,CTL.dI^nostte 
Ab, CTL. diagnostte 
Ab, CTU diagnostte 
Ab.CTl,lmagteg 
Ab, CTL, Imaging 
Ab, CTL, diagnostte 
Ab, CTL, diagnostic 
Ab,sm, CTL, Imaging 
AbiSm, CTL. imaging 
Ab,sm,CTL.bnaglng 
Ab,8m, CTL, fanning 
CTL 

Ab,CTUImagbn 
CTL 

Ab,CTl, diagnostte 
Ab,8m, CTL, bnaging 
Ab,smL Imaging 
Ab. CTL, diagnostte 
Ab, CTL, Imaging 
Ab,CTL,bnagtog 
Ab,sm, bnaging 
Ab, diagnostte 
Ab, CTL. diagnostte 
Ab,CTU diagnostte 
Ab,CTU diagnostte 
Ab,8in,CrU imaging 
Ab,snv CTL, Imaging 
Ab. diagnostte 
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Seq 10 475 & 476 429010 
Seql0477& 478 431462 
Seq ID 479 & 460 448243 
Seq 10*481 & 482 426427 
Seq ID 483 & 484 428187 
SeqI0465& 486 428579 
Seq ID 487 & 468 428579 
Seq to 489 & 490 428579 
Seq ID 491 & 492 428579 
Seq(0493& 494 445537 
Seq U>495& 496 422278 
Seq I0497& 498 424620 
Seq ID 499 & 500 406400 
Seq (0 SOU 502 431130 
Seq 10 503 & 504 420440 
Seq 10 505 & 506 428450 
Seq ID 507 & 508 446619 
Seq (0 509 & 510 436982 
Seq ID 511 & 512 453392 
Seq 10 513 431989 
Seq ID 514 439620 
Seq ID 515 & 616 409178 
Seq 10 517 & 518 426514 
Seq ID 519 & 520 421478 
Seq ID 521 & 522 421476 
Seq ID 523 & 524 425776 
Seq ID 525 & 526 425776 
SeqlD527& 528 452097 
Seq ID S29& 530 416530 
Seq 10 531 & 532 431515 
Seq ID 533 & 534 419452 
Seq ID 535 & 536 412078 
Seq to 537 & 538 41 2078 
Seq ID 539 & 540 411773 
Seq 10 541 & 542 407792 
Seq ID 543 & 544 431616 
Seq 10 545 £ 546 452792 
Seq ID 547 & 548 400294 
Seq 10 549&SS0 432653 
Seq ID 551 & 552 432653 
Seq ID 553 & 554 432653 
Seq 10 555 & 556 432653 
Seq ID 557 & 558 400290 
Seq 10 559 & 560 410001 
Seq to 561 & 562 418396 
Seq ID 563 8 564 451027 
Seq ID 565 & 566 446057 
Seq ID 567 & 668 433466 
Seq ID 569 ( 570 453370 
Seq ID 571 & 572 453370 
Seq ID 573 & 574 414569 
Seq ID 575 4576 413435 
Seq ID 577 8 578 426501 
Seq ID 579 & 580 446999 
Seq 105818 582 408369 
Seq ID 583 & 584 412628 
Seq ID 585 8 586 403047 
SeqlD587& 588 403047 
Seq to 589 & 590 408430 
Seq ID 591 & 592 445413 
Seq ID 593 & 594 451982 
Seq ID 595 & 596 427958 
Seq to 597 & 598 421887 
Seq ID 5998 600 425071 
Seq ID 601 & 602 432101 
Seq 106038 604 407786 
Seq 106058 606 416836 
Seq ID 607 & 608 416539 
Seq ID 609 & 610 131083 
Seq ID 611 & 612 131083 
8eqn>613&614 131083 
Seq 106158616 131083 
Seq 10 617 & 618 422424 
Seq 106198 620 428970 
Seq ID 621 & 622 428330 
Seq ID 623 8 624 439018 
Seq ID 625 8626 420610 
Seq ID 627 8 628 425723 
Seq ID 6298 630 456662 
Seq 10 631 8632 418281 
Seq ID 633 8634 429903 



pancreas, lung, glioblslonia 

ovarian, uterine, lung, stomach, head 8 neck. gfloblaslDma, p 
ovarian, lung, head & neck, cenM ootofi. utaine, storrech 
ovarian, uterine, colon, sfaxnach 
ovarian, Ewing, uterine, pancreas 
ovarian, Ewing, uterine, pancreas 
ovarian, Ewing. uterine, pancreas 
ovarian, Ewing, uterine, pancreas 

ovarian, bladder, uterine, breast king, head & neck, renal, fibrosis, pancreas, cervical 
ovarian, head & neck, bladder, ceivial, kmg 



ovarian, uterine 

ovarian, uterine 

ovarian, uterine, cenrical 

ovaiten, ceivksd, pancreas, king 

ovariait, iibrods, pancreas, head 8^ neck, kmg, cokm 

ovarian, fibrosis 

ovarian, lung, gikiUaslonGi 

ovarian 

ovarian, uterine^ cer^rfcat breast prostate 
ovariart breast * 

ovarian, colon, bladder, hntg^ oervtesl 

ovarian, bladder 

ovarian, bfaddef 

ovarian, uterine, lung 

ovarian, uterine, lung 

ovarian 

ovarian, uterine 

ovarian, prostate, king 

ovarian, prostate, king, breast uterine 

ovarian 

ovarian 

ovarian 

ovarian, uterine, cervical, pancreas 
prostate, pancreas, colon 
prostate, uterine, breast 
presto, taxol piost^ 
prostate, king 
prostate, lung 
prostate, lung 
prostate, kmg 
prostate, colon 

warian, prostate, uterine, oeivlcat kmg 
prostate 

prostate, uterine^ gtMbtasbma 



prostate 



prostate, gtoWastoina, pancreas 
prostate, breast glioblastomas kmg 
prostate, gOobtastoriQ 

prostate, lung, fibrosis, uterinei Q^totriastonis^ oervicait ovarian 
prostate 

prostate, tiladder, coton 
prostate, bladder, ookin 

prostate 

prostate, coton, uterine, ovarian, kmg, pancreas 
prostate, bladder 
prostate, gQot}lastonia 



prostate, colon, stomach, uterine, cervical, head & neck, pancreas 



prostate, coton, uterine, stomach, infiammatory bowel disease, ovarian 

Prostate. Bwlng, gEoblastoma 

Z01839 resistant cancers, head 8 neck 

androgen withdranvsd prostate 

androgen wQhdiayral prestats 

arxkogen wtthdrwral prostate 

androgen withdrawal prostate 

bladder, pancreas, prostate, angtogenesis. oobn, stomadi, kmg 
stomadi, pancreas, ookm 

uterim, ovarian, Gtxosis, prostate, pancreas, kmg, bladder, head & neck 
uterine, stomach, prostate 
uterine, ovarian endometriokl king 
ovarian endometriokl, uterine^ colon 
uterine, ovarian 



snvCTL 
Ab.di30nos6c 
Ab,sm.tm^^g 
8m,CTL 

Ab3n\ CTU fanaging 
Ab,sm,CTUknaging 
Ab,sm,CTUbnagihg 
Ab.sm,CTl, imaging 
Ab.sm.CTU imaging 
Ab^CTL, diagnostic 
Al^,kna^g 
Abb Cn, diagnostic 
Ab, CTU-dlagnostlc 
Ab, Cn, diagnostic 
Ab,C7U diagnostic 
sm 

Ab, diagnostic 
Ab,CTL, diagnostic 
CTL 



Ab. CTL, diagnostic 
Ab, CTL, diagnostic 
Ab, CTL, im^ng 
Ab, CTL, imaging 
Ab,sm, CTL. Imaging 
Ab.sm, CTU Imaging 
Ab, sm, diagnostic 
Ab,CTl,diagnosb'c 
Ab,sm, CTL, Imaging 
Ab,sm,CTl,imagk)g 
CTL 
CTL 

Ab,8m,CTL,diagnosGc 
Ab. CTU diagnostic 
Ab,sm,CTl, imaging 
Ab, CTL, bnaging 
Abysnt CTL, Imaging 
AbjBTn, cn, imagtaig 
Ab,8m. CTL, imaging 
Ab,smi CTL. imaging 
Ab,8n\ CTL, imaging 
AbisnvCTUknagkyg 
Ab, CTL, diagnostic 
Ab.sm, CTL, imaging 
Ab,8m, CTL, imaging 
Ab.sm, CTL, imaging 
Ab,CTL,d[dgnostlG 
Ab,8m. CTL, Imaging 
Ab,sm, CTL, bnaging 
Ab,&m, CTt, knaging 
Ab.sm. diagnostic 
Ab, CTl, Imaging 
Abjsm, cn, kn^ng 
AbisnvCn, bnaging 
Ab, CTL, diagnostic 
Ab,8m, CTL, knaging 
Abism, CTL, imaghig 
Abism, CTL. Imaging 
diagnosOc 
CTL 

Ab,8m, CTL, imaging 
Ab, CTL, imaging 
Ab, diagnostic 
Al^sn^ bnaging 
Ab,8nt Imaging 
Ab. CTL, diagnostic 
Ab, sm, CTL, knaging 
Ab, sm. CTL, bnaging 
AtK snv CTL, bnaging 
Ab^ STHb CTL Imaging 
Ab, sm. CTL, knaging 
Ab, CTU diagnostic 
Ab,snvkn3gbg 
AI\8m,CTUdi90nostIc 
Abysm, CTU kna^g 
CTL 

Ab, CTU diagnostic 
CTL 

Ab.CTU<n8gnosGc 
sm 
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Table 78 



5 

10 
15 

20 
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75 

80 



Seq ID NOx 1 DMA eequence 

Nucleic Acid Accession fit NM_001400 

Codisg sequence : 2 5 1 1 3 99 



1 11 21 31 41 51 

I 1 I I i I 

TCTAAAQGTC GGGC36CAGCA 6CAAGATGCG AAGCGAGCC6 TACAGATCCX: GGGCTCTCCG 60 

AACGCAACTT GGCCCTGCTT GA60GAG6CT GOGQTTT CO G AGQCCCTCTC CAGCCAAG6A 120 

AAAGCTACAC AAAAAGGCTO GATCACTCAT CQAACCACCC CTQAA6GCAQ TQAAGGCTCT 180 

CT06CCTCGC CCTCTAGCGT TOGTCI GG AG TAGCX3CCACC C06GCTTCCT aGGCSAGACAO 240 

GGTTGGCACC ATGGGGCCCA CCAOOSTCCC GCTGGTCAAG GCCCACOGCA GCTCGGTCTC 300 

TGACTACGTC AACTATGATA TCATOTrCCG GCATTACAAC TACAOGGGAA AGCTGAATAT 360 

CAGOGOQOAC AAGGA6AACA QCATTAAACT GACCTCX36TG GT6TTCATTC TCATCTGCTG 420 

CTTTATCATC CTQQAGAACA TCTTTGTCTT QCTQACCATT TGGAAAACCA ACAAATTCCA 480 

CCGACCCATG TACTATTTTA TTGGCAATCT GQCCCT C T CA GACCTQTTGO CA06AGTAGC 540 

CTACACAOCT AACCTGCTCT TOTCTGGGGC C3VCCACCTAC AAGCTCACTC CCGCCCAGTQ 600 

GTTTCTGOGG GAAGGGAGTA T G TTTGTGGC CCTGTC3VGCC TCCOTQTTCA GTCTCCTCGC 660 

CAT06CCATT GAGOGCTATA TCACAATOCT GAAAATGAAA CTCCACAAOQ GGAGCAATAA 720 

CTTCCGOCTC TTCCTGCTAA TCAGC6CCTG CTQGGTCATC TCCCTCKTOC TGGGTGOCCT 780 

6CCTATCATQ GGCTQGAACT GCATCAGT6C GCTGTCCAGC TGCTCCACOQ TGCTGCCGCT 840 

CTACCACAAG CACTATATCC TCTTCTGCAC CAOGGTCTTC ACTCTGCTTC TGCTCTCCAT 900 

CX3TCATTCTG TACTGCAGAA TCTACTCCTT GGTCAGGACT CXX3AGCXX3CC GCCTGACGTT 960 

OOQCAAGAAC AITTCCAAGG OCAGCOGGAG CTCTGAGAAO TGGCTGG06C T6CTCAAGAC 1020 

OQTAATTATC GTCCXGAGOG TCTTCATCGC CTGCXOOaCA OOGCTCTTCA TCCTOCTCCT 1080 

GCTGGATGTG GGCTGCAAGG TGAAGACCTG TGACATCCTC TTCA0A60GG AGTACTTCCT 1140 

GGTGTTAGCT GTGCTCAACT CCX3GCACCAA CCCCATCATT TACACTCTOA CCAACAAGGA 1200 

GATGCGTCGO GCCTTCATCC GOATCATGTC CTGCTGCAAG TGCCOGAGCG GAGACTCTGC 1260 

TGGCAAATTC AAGCGACCCA TCATOGCOGG CATG6AATTC AGC06CAGCA AATOQGACAA 1320 

TTCCTGCCAC CCCO^GAAAO AOGAAGGQQA CAACCCAGAO ACCATTATGT CTTCTGGAAA 1380 

C30TCAACTCT TCTTCCT A QA ACTOGAAOCT GTCCACCCAC OGGAAGOGCT CTTTACTTGG 1440 

TCX3CTGGCCA CCCCAGTGTT TGGAAAAAAA TCTCTGGGCT TCGACTQCTG CX»GGGAGQA 1500 

GCTGCTGCAA GCCAGAGGGA GGAAGGGGQA GAATACX3AAC AGCCTGGTGG T6TGGGGTGT 1560 

TGQTGGGTAQ AGTTAGTTCC TGTGAACAAT QCACTGGGAA GQGTGOAGAT C3U3GTCC(XK3 1620 

CCTGGAATAT ATATTCTACC CCCCTGGAGC TTTCATTTTa CACTGAGCCA AAG6TCTAGC 1680 

ATTGTCAAGC TCCTAAAGGG TTCATTTGGC CC CTOCT CAA AGA CTAAT GT CCOCATGTOA 1740 

AAGOGTCTCT TT6TCTGGAG CTTTGAGGAG ATGTTTTCCT TCACTTTAGT TTCAAACCCA 1800 

AOTGAOTaTQ TGCACTTCTG CTTCTTTAGG GATGCCCTGT ACATCCCACA CCCCACCCTC 1860 

CCTTCCCTTC ATACCCCTCC TCAAOGTTCT TTTACTTTAT ACTTTAACTA CCTGAGAGTT 1920 

ATCAOAGCTQ GGGTTGTGGA ATGATCX3ATC ATCTATAGCA AATAGGCTAT GTTGAOTAOQ 1980 

TAGGCTGTGG GAAGATGAAG ATGGTTTGGA GGTGTAAAAC AATGTOCTTC 6CXGAGG0CA 2040 

AAGTTTCCAT GTAAGCGGGA TCOQTTTTTT GQAATTTGGT TGAAGTCACT TTGATTTCTT 2100 

TAAAAAACAT CTTTTCAATG AAATGTGTTA CCATTTCATA TCCATTGAAG CCGAAATCTQ 2160 

CATAAQGAAG CCCACTTTAT CTAAATGATA TTAGCCAGGA TCCTTGGTGT GCTAGGAGAA 2220 

ACAGACAAGC AAAACAAAGT GAAAACGGAA TGGATTAACT TTTGCAAACC AA6GGA6ATT 2280 

TCTTAGCAAA TGAGTCTAAC AAATATQACA TCCQTCTTTC CCACTTTTOT TQATGTTTAT 2340 

TTCAGAATCT TGTGTGATTC ATTTCAAGCA ACAAC3VT6TT GTATTTTGTT GTGTTAAAAG 2400 

TACTTTTCTT GATTTTTGAA TOTATTTGTT 'TCAOaAAGAA GTCATTTTAT GGATTTTTCT 2460 

AACCCGTOrr AACTTTTCTA GAATCCACCC TCTTGTGCCC TTAAGCATTA CTTTAACTGG 2520 

TAGGGAACGC CAGAACTTTT AAGTCCAGCT ATTCATTAGA TAGTAATTGA AGATATOTAT 2580 

AAATATTACA AAGAATAAAA ATATATTACT GTCTCTTTAO TATGGTTTTC AGTGCAATTA 2640 

AACGGAGAOA WlXmVITr TTTTAAAAAG AATAGTATTT AATAGGTTTC T6ACTTTTGT 2700 
GGATCATTTT GCACATAGCT TTATCAACTT TTAAACATTA ATAAACTGAT TTTTTTAAAO 



8eq ID NOt 2 Protein sequence 
Protein Accession «t NP_001391 



KGPT8VPLVK 
LENIFVLLTI 
KGSMFVALSA 
GWNCISAL8S 
ISKASRS8BK 
VINSGTNPII 
PQKDBGDNPE 



11 
i 

AHRSSVSDYV 
WKTKKFHRPM 
8VF8LZAIAI 
C8TVLPLYHK 
SLALLKTVII 
YTLTNKEMRR 
TIMSSGNVN8 



21 
I 

NYDIIVRHYN 
YYFIGNLALS 
GRYITMIiKMK 
HYILPCTTVP 
VL8VPIACHA 
AFIRIMSCCaC 



31 
I 

YTGKLHISAD 
DLLAGVAYTA 
LHKQSNNFRI* 
TLLLLSZVIL 
PLFILLLLDV 
CPSGDSAGKF 



Seq ID NO: 3 DMA sequence 

Nucleic Acid Accession «i NM_002205.1 

Coding sequence i 1..3149 



ATGGGGAGCC 
CX3C06ACCCC 
GGCTTCAACT 
GQATTCTCAG 
CCCAAGGCTA 
TQGGGTQCCA 
CTGGAGTCCT 
TGGTTCGGGG 
AGCIG60GCA 
GATAACTTCA 



11 

1 

GGACGCCAGA 
CGCTSSTGCC 
TAGA CGCGGA 
TGGAGTTTTA 
ATACCA6CCA 
GCXXXIACACA 
CACTGTCCAG 
CAACAOTTOO 
CAQAGAAGQA 
CCOGAATTCT 




41 51 
I I 

KENSIKLTSV VFILICCPII 60 

NliLLSGATTY KLTPAQWFLR 120 

FLLISACWVI SLILGGLPIM 180 

YC31IYSLVRT RSRRLTPRKN 240 

GCKVKTCDIIt FRABYFLVLA 300 

KRPIXAQMBF SRSXSDNSSH 360 



51 
1 

GGGCCCCOGG 60 

CA6GGT0GGG 120 

CTCCTTCTTC 180 

GGTGGGAGCA 240 

CCTCTGTCCT 300 

CTCTCXXSCTC 360 

GTOCTTGCAG 420 

TOCACTGTAC 480 

CCTCTC CACA 540 

CTGGGCA6CA 600 



949 



wo 03/042661 
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10 
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GGACAGGOTT ACT6GCAAGG AGGCTTCAGT GCGQAGTTOV CCAAGACTGO CCGTGTGGTT 660 
TTAQ6T6GAC CAQCSAAGCTA TTTCTG6CAA 6GCCMSATGC TOTCTGCC A C TCAGGAGCAO 720 
ATTGCA6AAT CTTATTACCC CGAGTACCTG ATCAACCTGO TTCAOGGGCA GGTGCAQACT 760 
0GCXy«3GCCA GTTCCATCTA TQATGACAGC TACCTAGGAT ACTCTGTGGC TCTTGGTOAA 840 
TTCAGTGGTG ATGACACAGA AOACTTTOTT GCTGGTGTGC CCAAACGGAA CCTCACTTAC 900 
GGCTATGTCA CCAT CCTTAA TGGCTCRaAC ATTOGATCOC TCTACyulCTT CTC AGGG OAA 960 

CAGATGGOCT CCTACTTTGO CTATGCAGTG GGCGGCACAO ACGTCAATGG GGA0666CTG 1020 

GAT6ACTTGC TGGTGGGGGC ACCOCT6CTC ATGGAT06GA OCCCT6A060 GOGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG QCATAGAGCC CACGCCCACC 1140 

CTTAGGCrcA CTGGCCATGA TGAGTrrGGC CGATTTGGCA GCTCCTT6AC CCCCCT GGGG 1200 

GACCTGGACC AGGATGGCTA CAAtGATGTO GCXATOCGGG CTCXXrmX M T060GAQACC 1260 

CAQCAGG6AG TAOTQTTTGT ATTTCCTQGQ OGCCCAGGAO G8CTG6SCTC TAAGCCTTCC 1320 

CACX5TTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CltSTGGTATA CAGGGGCCGC CCCATOQTGT COQCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG COVTGTTCAA CCXAGftOGAG OGGAiGCTGCA GCTTAOAGGQ GAACCCTQTG 1560 

GCCTQCATCA ACCTTAGCTT CTOCCTCAAT GC I TCTGGAA AACA0GTT6C TGACTCCATT 1620 

6QTTTCACAG TOGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT AC6GCGQGCA 1680 

CTGTTCCTQO CCTCCAOGCA GQCAACCXnX3 ACCCAQACCC TQCTCATCCA GAATGOGQCT 1740 

0GM3AG6ATT GCAGAGAGAT GAAGATCTAC CTCA6GAACG AGTCAOAATT TCGAGACAAA 1800 

CTCT06C0GA TTCACATOGC TCTCAACTTC TGCTTGGACC CCCAAOCGCX: AaTQGACAaC 1860 

OV066CCTCA 6GCCAQCCCT ACATTATCAG AGCAAGA6CC GGATAGAGQA GAAGGCTCAO 1920 

ATCTTGCTGQ ACTGTGQAGA AGACAACATC TGTGTGCCTQ ACCTGCAGCT GGAAGTGTTT 1980 

GCGGAGCAQA ACCAT0XX3TA CCTGGGTQAC AAQAATGCOC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG 6TGAGGGTGG 0GCCTAT6AG GCTGAGCTTC G6GTCACC6C CCCTCCAGAG 2100 

GCTOAGTACT CAGGACT06T CAGACACCCA GGGAACTTCT CCA6CCTGAG CTGTGACTAC 2160 

TTTGCXSTGA ACCA6AGCCX3 CJCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTOT GGGGTGOCCT TGOGTTTACA GTCCCTCATC TCOSGGACAC TAAQAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCX5CAAAG CGACX3TGGTT 2340 

TCCTTT06GC TCTCOSTGGA GGCTCA6GCC GAGGTCACCC T6AA0GGTGT CTCCAAOCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGOGACTGG GATCCCCOAG ACCAGCCTCA GAAOGAGGAG 2460 

GACCT6GGAC CT6CTGTCCA CCATGTCTAT OAGCTCATCA AOCAAGGCCC CA6CT0CATT 2520 

AGCCAGGGTG TGCIQGAACT CAOCTGTCCC CAGGCTCTGQ AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC COQAQQQTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCA8C 2700 

TCTGCTTCCT OGGGAGCTCA GATCCTGAAA TGCCOGGAGG CTGAGTGrTT CAGGCTGOGC 2760 

TGT6AGCTGG GGCOOCTGCA CCAACAAGAG AQGCAAAOTC TGCAGTT6CA TTTCCGAOTC 2820 

TG6GCCAAGA CTTTCTTQCA GOGGGAGCAC CAGGCATTTA GCCTGC3VGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCCT 2940 

GAGGTGGGCA CAGCTOTGCA ATGGACGAAG GCAGAAGGCA GCTATGGOGT CCOICTGTGO 3000 

ATCATCATCC TAGCGATGCT GTTTG6CCTC CTGCTCCTAO GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG OATTCTTCSUl ACQCTCCCTC CCATATGOCA CXGCCATOGA AAAAGCTGAG 3120 
CTCAAGCCTC CAGGCAGCTC TGATGCCTGA 



8eq ID NOs 4 Protein eequence 
Protein Accession #i NP_002196.1 



MQSRTPESPL 
GFSVBPyRPO 
LBSSLSSSEG 
DNPTRILEYA 
lAESYYPEYL 
GyVTILNQSD 
EVGRVYVYIiQ 
QQGWPVPPG 
VDKAWYRGR 
GFTVELQU3W 
LSPXHIAUIF 
GBQNRVYLGD 
PAVNQSRLLV 
BFRLSVEAQA 
SQGVLBL8CP 
GASSGPQXLK 
YKALXMPYRI 
YKLGFFKRSL 



11 
I 

HAVQZiR»GPR 
TDGVSVLVGA 

EEPVEYXSLQ 
PCRSDF8WAA 
im<VQGQLQT 
IRSLYNFSGB 
BPAGIEPTPT 
GPGGL6SKPS 
FIVSASASLT 
QKQKG6VRRA 
SXiDPOAPVDS 
XHALNLTFRA 
CDLGNPMKAG 
QVTLNGVSKP 
QALEGQQLLY 
CPBABCFRLR 
LPRQLPQKER 
PYGTAMEKAQ 



21 
I 

RRPPLLPUiL 
FKANTSQPGV 
WFGATVRAHG 
GQGVCQGGPS 
RQAS8IYDDS 
QHASYFGYAV 
LTLTGHDEF6 
QVLQPLWAAS 
IPPAHFNPEE 
IiFLASRQATL 
BQIiRPAiayQ 
QNVGBGGAYB 
ASLWGGLRFT 
EAVLFFVSDW 
VTRVTGLNCT 
CEUBBJJBQQE 
OVATAVQHTK 
LKPPATSDA 



31 
I 

LLLPPPPRVG 
bOGGAVYLCP 
SSILACAPLY 
ABPTKTGRW 
YLGYSVAVGB 
AATDVNGDGL 
RFQS8LTFL6 
HTPDPFGSAL 
RSC8LBGKFV 
TQTLLIQUGA 
8XSRIEDXAQ 
ABLRVTAPPB 
VPHLRDTKKT 
HPRDQPQKEB 
TNHPINPKGL 
SQSLQLBFRV 
AB6SYGVPLN 



41 
I 

GFNLDABAPA 
HGASPTQCTP 

SNRTEKEFIiS 
LGGPGSYFWQ 
PSGDDTKDFV 
DDUiVOAPUi 
DLDQDGYNDV 
RGGRDXJ3GNG 
ACINI»SPCWI 



ZLLDCGBDNZ 
AEYSOLVRHP 

IQFDFQZLSK 
DZiGPAVHBVY 



HAKTFLQREH 
IIIZAZLF6L 



51 
I 

VLSGPPOSFP 
IEPD8KGSRL 
DPVGTCYLST 
GQILSATQEQ 
AGVPKGNLTY 
HDRTPDGRPQ 
AIGAPFGGET 
YPDLIVGSFG 
A8GKRVADSI 
LRNBSEFRDX 
CVPDLQLEVF 
GHFSSIjSCDY 
NUINSQSDW 
ELINQGPSSI 
QQXRBAPSRS 
QPFSIQCBAV 
LLLGUiZYIL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seg ID NO I 5 DMA sequence 

HUclelc Acid Accession ffi MM_002211.1 

Coding sequencei 1..2397 



ATGAATTTAC 
CAAACAGATQ 
GCAGGGCCAA 
TCTGCAOQAT 
QAAAATCCCA 
GOAACAGCAa 
TTGGGATTAA 
TATCCCATTQ 
AATGTAAAAA 



11 

I 

AACCAATTTT 
AAAATAGATO 
ATTGTGGGTG 
OTGATQATTT 
GAGGCTCCAA 
AGAAGCTCAA 
GATCAGGG6A 
AGCTCTACTA 
GTCTTGOAAC 



21 
I 

CTGGATTGGA 
TTTAAAAGCA 
GT6CACAAAT 
AGAAGCCTTA 
AGATATAAAO 
GCCAQA6QAT 
GCCACAGACA 
CCTTATGQAC 
A6ATCTGATG 



31 
I 

CTQATCAGTT 
AATQCCAAAT 
TCAACATTTT 
AAAAAGAAGG 
AAAAATAAAA 
ATTACTCAOA 
TTTACATTAA 
CTGTCTTACr 
AATOAAATGA 



41 

I 

CAG T ' X TGCr G 
CAT6TGGAGA 
TACAGGAAGG 
OTTGCCCTCC 
ATGTAACCAA 
TCCAAGCACA 
AATTCAAGAG 
CAATQAAAGA 
GOAGQATTAC 



51 

I 

TGTGTTTGCT 
ATGTATACAA 
AATOCCTACT 
AQATQACATA 
CCGTAGCAAA 
GCAGTTGOTT 
AGCT6AAGAC 
CQATTTGGAG 
TTCGGACTTC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
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AGAATTGGAT 
GCTAAGCTCA 
AATGTGCTCA 
ATATCTGGAA 
TGTGGATCAC 
GGGTTTCACT 
TGTCACCTX3G 
CACCTTGTCC 
TTTCAGCCTG 
TCTGCAAATT 
GAA6TCATTT 
TGCftAOAAOQ 
6GAGATGAGG 
QACAGCTTTA 
ATCTGTGAAT 
AA1GGGACAT 
GAAT6CA6CA 
AGTTCA6AAA 
AGGQATAATA 
GATA6ATCCA 
T6CAACCCCA 
GCCAGCAACG 
ACAGATC06A 
GCTGAGCATA 
TGCACACAGG 
CAGC06QTCC 
TTCTATTTTA 
CCAQAGTOTC 
GTTCTTATTG 
AGGGAGTTTG 
CCTATTTATA 



TTGGCTCATT 
GGAACCCTTG 
GTCTTACTAA 
ATTTGGATTC 
TQATTGGCTO 
TTGCTGGAGA 
AAAATAATAT 
AGAAACTGAG 
TTTACAAGGA 
CTAGCAATQT 
TGGAAAAC360 
GOOTQAATGG 
TTCAATTTGA 
AAATTAGGCC 
GTGAATGCCA 
TTGAQTSTGG 
CAGATQAAOT 
TCTGCAGTAA 
CAAATQAAAT 
ATG6CTTAAT 
ACTACACT60 
GACAGATCTO 
AGTTTCAAGG 
AAGAATGTGT 
AATGTTCCTA 
AACCTGATCC 
CGTATTCAGT 
CCACTQOTCC 
GCCTTGCATT 
CTAAATTTGA 
AGAGTGCCGT 



TGTGGAAAAO 
CACAAGTGAA 
TAAAGGAGAA 
TGCAGAAGGT 
GAGGAATGTT 
TGGQAAACTT 
GTACACAATO 
TGAAAATAAT 
GCTGAAAAAC 
AATTCAGTTG 
CAAATTGTCA 
AACAOGGQAA 
AATTAGCATA 
TCTGGGCTTT 
AAGCGAAGGC 
OGGGTGGAGG 
TAACAOTGAA 
CAATGGAGA6 
TTATTCTGGC 
TTGTGGAG6A 
CA0TGC91TGT 
CAAT0GO086 
GCAAAOQTGT 
TCAGTGCAGA 
TTTTAACATT 
TGTOTCCCAT 
GAATG6GAAC 
AOACATCATT 
ACTGCTGATA 
AAAGQ AGAAA 
AACAACTGT6 



ACTGTQATGC 
CAQAACTGCA 
GTATTTAATG 
GGTTTOGATG 
ACACGGCTGC 
GGTGGCATTG 
AOOCATTATT 
ATTCAGACAA 
TTGATCCCTA 
ATCATTGATG 
GAAQOASTAA 
AATOGAAOAA 
ACTTCAAATA 
ACGGAQGAAG 
ATOCCTOAAA 
TGGAATGAAG 
GACATGGATO 
TGCGTCTGCG 
AAATTCTGCX5 
AATGGTGTTT 
GACTGTTCTT 
GGCATCT6CG 
GAQATOTGTC 
GCCTTCAATA 
AGCAAGGTAG 
TGTAAOGAGA 
AACGAGGTCA 
CCAATTOTAO 
TGGAAGCTTT 
ATGAATGCCA 
GTCAATCCQA 



CTTACATTAQ 
CCAOCCCATT 
AACTTGTTCQ 
CCATCATGCA 
TGGTGTTTTC 
TTTTACCAAA 
ATQATTATGC 
TTTTTGCAGT 
AGTCAGCAGT 
CATACAATTC 
CAATAAOTTA 
AATGTTCCAA 
AGTGTCCAAA 
TAGAGGTTAT 
GTCCCAAGTG 
GGOSTOTTGa 
CTTACT6CAG 
GACA6TGTGT 
AGTGTGATAA 
GCAAGTGTOG 
T GOATA CTftQ 
AGTOTGGTGT 
AGAGCTGCCT 
AAGGAGAAAA 
AAAGTOGGGA 
AGQATGTTOA 
T6GTTCATGT 
CTOarOTGOT 
TAATGATAAT 
AATGGGACAC 
AGTATGAGGG 



CACAACACCA 
TAGCTACAAA 
AAAACAGOGC 
AGTTGCAGTT 
CACAGATOGC 
TGATG6ACAA 
TTCTATTGCT 
TACTGAAGAA 
AGGAACATTA 
CCTTTCCTCA 
CAAATCTTAC 
TATTTCCATT 
AAAGGATTCT 
TCTTCAGTAC 
TCATGAAGGA 
TAGACATTGT 
GAAAGAAAAC 
TTGTAGGAAG 
TTTCAACTGT 
TGTGTGTGAG 
TACTTGTQAA 
CTGTAAOTGT 
T GGTGTCTGT 
GAAAGACACA 
OVAATTACGC 
0GACT0TTG6 
TGTGGAGAAT 
TGCTGGAATT 
TCATGACAGA 
GGGTGAAAAT 
AAAAT6A 



Seq ID NOt 6 Protein eeguence 
Protein Accession KP_002202.1 



1 
I 

MNLQPIFWIG 
SAROJDLBAL 
LRLRSGSPQT 
RI0F6SFVBK 
XSGMIiDSPEQ 
CHLENNMYTM 
SANSSNVIQL 
GDEVQFEI8I 
NGTFEGGACR 
RDNTNBZYSG 
ASNGQICNGR 
CTQECSYFNI 
FECPTGPDII 
PIYXSAVTTV 



11 

I 

LISSVCCVPA 
KKKGCPPDDI 
FTXiKFKRAED 
TVMPY18TTP 
GFDAXHQVAV 
SHYYDYPSIA 
IIDAYNSLSS 
TSZnCCFKKDS 
OIGGRVGRHC 
KFCECDNFNC 
GICEOGVCKC 
TKVESRDKLP 
PIVA6WAGI 
VNPKYBGK 



21 
I 

QTDENRCLKA 
EKPRGSKDIK 
YPIDLYYLMD 
AXUtKPCTSB 
GQSZtXGWRNV 
HLVQKLSENN 
EVILENGXLS 
OSFKIRPLGF 
BCSTDSVIISS 
DRSNQLIOGG 
TDPKFQGQTC 
QPVQPDPVSH 
VLZGLALLLI 



31 

I 

NAKSC6ECIQ 
KNKNVTNRSK 
LSYSMKDDLE 
QNCTSPFSYK 
TRLLVFSTDA 
IQTIPAVTEE 
EGVTISYKSY 
TEBVEVILQY 
DMDAYCRKEN 
NGVCKCRVCE 
EMCQTCLGVC 
CKSKDVDDCW 
WKLLMZIHDR 



41 

I 

AGPNOGWCTN 
GTABKLKPED 
NVXSliGTDLM 
MVIiSXiIMKOB 
0FKFAQ3GKL 
FQPVYKELKN 
CKNGVNGTGB 
ICECE0QSB6 
SSBIC8NNGE 
CNPNYTGSAC 
AEHKECVQCR 
FYPTYSVNGM 



Seq ID HOt 7 DMA eeguence 

Nucleic Acid Accession ft: NM_002425 

Coding sequence t 26.. 1453 



AAAQAAGGTA 
AOTCTGCTCT 
TGCCCAGOUV 
AAAGGACACT 
GGTGACAGGG 
TCCTGACGTT 
TACATACA60 
TGAGAAAGCT 
AGGAGAG6CT 
TGATGGCCCA 
TATTCACTTT 
OGTTGCTGCT 
TTTGAT6TAC 
TQATGTGAAT 
GaTGCXX:ACA 
QTCCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AOGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGQTT 
ACAOTTTGAG 
GTTACATTGC 
ATTATTCATC 



11 

I 

AGGGCA6TGA 
GCCTATCCTC 
TACCTASAAA 
AATCTCATTG 
AAGCTAGACA 
GGTCACTTCA 
ATTGTGAATT 
CTGAAAGTCT 
GATATAATGA 
GGACACAGTT 
GATGATGATG 
CATGAACTT6 
CCACTCTACA 
GGCATTCAGT 

aaatctgttc 
gcx:atcagca 

TCCCACTGGA 
TCATATTTQO 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAO 
GATGCTGTAT 

nroAoccxiA 

TAGGOQAOAT 
TAATOTATTA 



21 
I 

GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCrCCTTTCC 
ATACAGCAGA 
GG6AAQAGGT 
TCTCTTTCXJC 
TGGCTCATGC 
AAAAATGGAC 
GOCACTCCCT 
ACTCATTCAC 
CTCTCTACX3G 
CTTCGGGATC 
CTCTGAGGGG 
AOCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTPGC 
OCTTCCCTAG 
TACA6GCATT 
ATQCCAQGAT 
AGGGGQAAOA 
TQAGCCAAAA 



31 

I 

TCTTGCATTC 
AGCAAAAGAG 
OCTOGAAAAG 
GCAAGGAATG 
GGAGQTGATO 
TGGCATGCCG 
TTTGCCAAGA 
GACTCCACTC 
AGTTAAAOAA 
CTACCXZACCT 
AGAAGATGCA 
GGGGCTCTTT 
AGA8CT0GCC 
ACCTOCOC C T 
TGAQATGCCA 
AQAATATCTQ 
TGAATTTCAT 
TGAAOTTAAC 
A6GAAATGA0 
CATAAGGAAA 
AGOGGACAAA 
ACTAATAGCT 
TGQATTTTTC 
GGTOACACAC 
CAGATA TGGG 
T6GTTAATTT 



41 



CTTGTGCTGT 
GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AAGTGGAGGA 
GATGGTQTTQ 
ACATTCTCCA 
CATGGAGACT 
GQACCTGGGC 
TCAQGCACCA 
CACTCAGCCA 
CASTTCOGOC 
GCCTCTACTQ 
GCCAAGTGTG 
TTCTTTAAAO 
TTGATTTCTG 
AOOlOGaACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGACSAT 
GATQACTTTC 
TACTTCTTCA 
ATATTAAAQA 
TGTTTTTAAT 
TTOCTQCATO 



600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



51 
I 

STFLQBGMPT 
ITQIQPQQLV 
NEMRRITSDP 
VFNELVGKQR 
GGIVLPNDGQ 
LIPKSAVGTL 
NGRKCSKI8I 
IPESPKCSiBG 
CVOGQCVGRK 
DCSLDT8TCE 
AFNRGEKKDT 
NEVMVHWEN 
MtTAKHDTGEH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



51 

I 

TGTGTCTGCC 
ACAAGGATCT 
AOTTTAGAAG 
TTGGGTTGGA 
GGTOTGGAOT 
AAACCCRCCT 
ATTCTGCCAT 
GGCTGTATGA 
TTTACTCTTT 
7TTATGGAGA 
ATTTATTCCT 
ACACTGAAGC 
TTT06CAAQA 
AGGAACCCCT 
ATCCTQCTTT 
AC3VGATATTT 
CATTTTGGCC 
GG6TTTTTAT 
GTTATOCAAa 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTTGA 
OTQQATCATC 
GTAACAGCTG 
AAATCXAATA 
TTCTGTQACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



951 
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GAAGAAGATG AGCCTTGCRG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGOVCT GAACAQAATT AAGAAATACT CAT6TGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCT6TTC 1740 
CTT 

Seq ID NOi 8 Protein aequence 
Protein Accession #: NP 002416 



MHLAFLVUjC 
KXQGMQKFLG 
PDLPRDAVDS 
HAYPP6FGLY 
FTELAQFRLS 
RGEYIiFFXDR 
ZRGNEVQAGY 
PRLIADDPPG 



11 
1 

LFVCSAYPLS 
LEVTQKLDTD 
AIEKALKVWE 
GDIHFDDDEK 
QDDVNGIQSL 
YFWRRSBHNP 
PROIBTZiGFP 
VEPKVDAVLQ 



21 

I 



TLEVMRXPRC 
BVTPLTFSRL 
WTEDASGTNL 
YGPPPASTEE 
EPEFHLZSAF 
FTZRKZDAAV 
AFGPFYFPSG 



31 
I 

DLAQQYLEKY 
GVPDV6HPSS 
YBGEADIMIS 
FIiVAAKELGH 
PIiVPTKSVPS 
HPSliPSYLDA 
SDKBKKKTYF 
8SQFSFDFNA 



41 
I 

YNLBKDVKQP 
FFOfFKWRKT 
PAVKEHGDPy 
SLQLFHSANT 
GSCMPAKCSP 
AYBVNSRDTV 
PAAOKyNRFD 
RMVTHIUCSH 



51 
I 

RRKDSNLIVK 
HLTTRIVNVT 
SFDGPGH8LA 
BALMYPLYMS 
AL8PDAI8TL 
FIFKGNEFHA 



Seq ID NOi 9 DNA sequence 

Nucleic Acid Accession «i XM_0S81B9.2 

ceding sequence: 169.. 774 



1 
I 

GAAOACCAOC 
AGTATTTCAC 
TATQAAGTGA 
AAGTGTGGAG 
AACATATTAT 
AACTIWGTGT 
ACAGTTCTTC 
AGCAAAAAAT 
GGATACTGCX: 
GATGGCTGGQ 
T66ATTCAGT 
CTCATAACCX: 
TCC3U«3ATAC 
ACAAAATGTT 
GACT TGAGGG 
TTTGCAGTCC 
AATAT6CATC 
' 6AATCTGTAC 
TGCAACATTA 
AGQATTAAGT 
ATAAAAAATA 



11 
I 

TCAGCTCTTC 
ACT6AGATTG 
AGGQCTCTGA 
GCTGCCTAAQ 
TGTATTTCCC 
G6TATTTTGA 
TGGTACTGGA 
ATGTQACACT 
TGOTCATCTC 
AGTATGCTTT 
6CCTGGAACC 
TCAGTGGGCT 
T6TGTGGAAG 
TTCCATTATC 
CAATATTGAA 
TCACTGCACA 
AGAAACTTCA 
TAGGGAAGTA 
TTTAATATTC 
AAAGAGTGGT 
AAGAGTACTA 



21 

I ^ 

AOTTOTTQAT 
TGGGCT606G 
CCCTGGAAGT 
TTGTTTGCTG 
GAATGGGCAA 
AGGAATCTGT 
GAATAATAAC 
OCTGTCAATT 
TGCCTTGGGT 
TGAAGGCACT 
T6CACAT0TT 
TCAAGTGATC 
CTATTCAGTG 
AAGACATGGC 
AT6ATGGTGC 
TGCAAOTATA 
GAAATACTTC 
AATAAGAATA 
TGGAAAAT1X3 
ACATACTGTA 
CATQQTTGTA 



31 
I 

CATTQTCTAT 
GTATATTCCA 
GGTTCTAAGC 
ATTCCGCTTG 
ACTTCCTATG 
TTCTCAGGCA 
AACTATAAAT 
ATCTTTTCTT 
CTTGTCCAAG 
GCTGGAC6TT 
GTGGAGTGGA 
ATCTGCCTCA 

ATcrrccAGC 

CATCTATCTA 
TTTCTGCATT 
CCACCCTTCC 
TGCCCTTTGA 
TGAGAGAAAC 
GAAACACGCC 
AATQTTTTCT 



41 
I 

TOTTCTGCAA 
ATTCCCCGTC 
AGGGCAAAAT 
CACTTTGGAG 
CATGCAGCAA 
TCAT6ATGCT 
6TTGGCAGA0 
CCCTOGGAAT 
GGCCATATTG 
TCCTTACAGA 
ACATCATTTT 
TCAGAGTAGT 
CTGGAATCAT 
AATATTATAT 
TGGTGTTTAT 
ATTTAGTAT6 
TCAAACAAAT 
CTTTATGCAA 
AAAATTCTAA 
QATATTAAAA 



51 

I 

AGAOTAAACC 
T0CTCAT6AA 
GGGGTCTOGG 
TATAATCGTG 
TAAACTCACC 
TATAGTAACA 
TGAAAACTGC 
TGCTTTTTCT 
CCGCACCCTT 
TTCTAGCATA 
ATTTTCCATT 
CATGCAACTA 
TTQAATAAGG 
CAACTGTGTA 
TTGTAA AAAA 
TTTTTTAAGT 
CC3VTTTCCAA 
ATATGTATAT 
ACTCAGAGGA 
AAAAAATTAA 



Seq ID KOi ID Protein sequence 
Protein Accession fti XP_0S8189.1 

1 11 21 31 41 51 

1111)1 
MGSRKOGGCL SCLLIPLALW SIIVNILLYF PNGQTSYASS NKLTNYVWYF EGICFSGIMM 
LIVTTVLLVL ENNNNYKCCQ SENCSKKYVT LLSIIFS81.G lAPSGYCLVI SALGLVQGPY 
CRTLDGHEYA FEGTAGRFLT DSSIWIQCSjB PAKWEWNIZ LFSILITLSG LQVIICLIRV 
VMQLSKIIiOG SYSVIFQPGI X 

Seq ZD NOi 11 DNA sequence 

Nucleic Acid Accession #: 1IM_002421.2 

Coding sequence t 1..1409 " 



ATGCACAGCT 
CC3VG06ACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACAOGCCAO 
TGGAm-AATG 
ATATCTTTTQ 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACX3TTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
OGQTTTTTCA 
OCCAAGGACA 
CTTTCTGAGG 
OATOAATAXA 



11 

1 

TTCCTCCACT 
TAOAAACACA 
TGAAOAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCZAAQ 
TCACACCTCT 
TCAGGGGAQA 
CTTTTCAACC 
CCAACAATTT 
TT66ACTCTC 
GT6ATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTX3G 
AAC6ATCTAT 



21 

I 

GCTGCTGCTG 
AGAGCAAOAT 
TGGGAGGCAA 
6CAGGAA7TC 
GAAGCAOCCC 
TGGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAQAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCXS^TC 
AACTACGATT 
CTTCTACCOO 
GCTTGAAGCr 
OTACTGGOCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATOCAOGT 



31 
I 

CTGTTCTGGO 
OTGGACTTAG 
GTTGAAAA6C 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGAC3VTTG 
6GCCCACAAA 
CGGGGAGAAG 
aAAOTTQAGC 
GCTTAOGAAT 
GTTCAGGGAC 
CCTAGAACTG 

TTCTTT o rro 

TATCCCAAAA 



41 

I 

GTGTGGTGTC 
TGCAGAAATA 
G6ABAAATAG 
AAGTGACTGG 
TGCCTGATOT 
TGACCTACAG 
TTOAGAAAGC 
AGGGTCAAGC 
TTGATGQACC 
ATGCTOVTTT 
6TGTTGG6GC 
CTTTGAT6TA 
ATGGCATCCA 
CCCCAAAAGC 
TQATGTTCTT 
TCAATTTCAT 
TTGCOQACAG 
AGAATGTGCr 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
T6GCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGAACTC 
CCCTAOCTAC 
AGCCATATAT 
ATOTQACAOT 
TAAAGACAGA 
TTCTGTTTTC 
AGAT6AAGTC 
ACAOGGATAC 
OQATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



952 



wo 03/042661 



GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAOATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA AOGAAGAOAA TTTT6ACTCT CCAGAAAaCT 1380 

AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NOi 12 Protein eequence 
Protein Accession ft: NP_002412.1 

1 11 21 31 41 SI 

1 I i 1 I i 

KHS7PPLLLL LFW6WSHSF PATLETQEQD VDIiVQKYLSR YYNLKNDGRQ VEKRRNSGPV 60 

VEKLXQMQBP POLKVTGKFO AETLKVMRQP RG6VPDVAQP VLTB6NPRWE QTHLTYRXEN 120 

YTPDLPRADV DKAIEKAFQI« WSNVTPLTP7 KVSBGQADIM ISFVRGDHRD HSPPDGPGGN 180 

liABAFQPOPO IGGDAHFDED ERVITNNPREY MLHRVAAHEL GHSLOLSHST DIGALMYPSY 240 

TPSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTPDAITTI RGEVMFFKDR 300 

PYHRTNPPyP EVSliNFISVP HPQLPNGliEA AYEFADRDEV RPPKGMKlfWA VQGQMVLHGY 360 

PKOZYSSFOF PRTVKHIDAA LSBQ3T6KTY PFVANKYHRY DEYKRSMDPG YPXMXARDPP 420 
GZ6HKVDAVF HKD6FPYFFH GTRQYXPDPK TKRZLTIiQKA NSVFNCRKN 

Seq ID NO I 13 DNA sequence 

Nucleic Acid Accession ft: NM_002421.2 

coding sequence: 1..1409 

1 11 21 31 41 51 

I i I I I i 

ATGCACAGCT TTCCTCCACT GCTGCTGCXO CTGTTCTGGO G TGTGGTGTC ACACAGCTTC 60 

CCA606ACTC TAQAAACACA AGAGCAAGAT OIGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAA6C GGAGAAATA6 TGGCCCAGTO 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTG6GCTGA AAGT6ACTGG QAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGOAO TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACT6 AQGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACA6 6ATTGAAAAT 360 

TACACGCCAO ATTTGCCAAG AGCAOATOTa GACCATOCCA TT6A6AAA6C CTTGCAACTC 420 

TGGAOTAATQ TCACACCTCT GACATTCACC AAGGTCTCT6 A6GGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 540 

CTTGCTCATG CTTTTCyVACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGOGGC TCATGCCCTC 660 

G6CCATTCTC TTGGACTCTC CCATTCTACT GATAT06G6Q CTTT8AT3TA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTA6CTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCX: AAAATCXTIGT CCA6CCCATC GGCCCACAAA CCCCAAAAGC ATGTQACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TQATGTTCTT TAAAGAC3M3A 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGQCCACAAC TOCGAAATGO GCTTGAAOCT GCTTAOQAAT TTGOOGACAO AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGOGCT GTTCA6G6AC AGAAT6T6CT ACAGGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAOAACTG TGAAGCATAT CX3ATGCT6CT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTQ CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCGCAAAA T6ATAGCACA TGACTTTCCT 1260 

GGAATT8QCC ACAAAOTTQA TOCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATA CTAAT T TGATCCTAAA AGGAAGAGAA TTTT6ACTCT CCAGAAAGCT 1380 
AATAGCTGGT TCAACTGCAa GAAAAATTAG 



Seq ZD NO: 14 Protein eequence 
Protein Accession fti NP_002412.1 

1 11 21 31 41 51 

I I I I I ) 

MHSFPPLLLL LFW6W8HSP PATLKTQfBQD VDLVQRYLEK YYNLIQIDQRQ VBKRSN86PV 60 

VEKLKQKQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQP VLTBCS^RWE QTHIiTYRIBN 120 

YTPDLPRADV DHAIBKAFQL WSNVTPLTFT KV8EGQADIM I8FVRGDHRD NSPFDGPGGN 160 

LAHAFQPGPO IGGDAEFDED ERVfTNNFREY NLHRVAAHAL GHSLGI.SHST DIGALMYPSY 240 

TPSGDVQLAQ DDZDOZQAIY GRSQNPVQPI GPQTPKACDS KLTFDAZTTI R6EVMFPKDR 300 

FYMRTNPFYP EVBUIFZSVP NPQLPNGLEA AYEFADRDEV RPFK6NKYHA VQGQNVLHGY 360 

PKDZYSSFQP PRTVKHIDAA LSEENTGKTY PFVANmURY DEYKRSMDPG YPKMIAKDPP 420 
GIGHKVDAVP MKDGFFYFFH GTRQYKPDPK TXRZLTLQKA NSWFHCRKN 



Seq ZD N0( 15 DHA sequence 

Nucleic Acid Accession ft: FGENBSB predicted 

coding sequence: 141..15B0 

1 11 21 31 41 51 

I 1 1 I I I 

TCTGGGTGTG CCGGG(Kn!AG OGGCTGGAAG TCCTGGCTCT AGTTGCACCT 06GAAG6AAA 60 

AGGCAAACAO AGGAGGGAAG GC6TCTTA60 ACTGCCTGGA TCCA6A6CAC TTTCCTOGGC 120 

CTCTACAGOC CTGTGTCGCT ATGGGTTCCC CCGCCGCCCC GGAGGGAGCG CTGGGCTACG 180 

TCCGOQAGTT CACTGGCCAC TCCTCGGAOG TGCTGGGCAA CCTCAACGAG CTGCGCCIGC 240 

OCGGQATCCT CACTGACQTC AOGCTGCTGG TT06CGG6CA ACCCCTCAOA GCACACAAGO 300 

CAGTTCrCAT OOCCTGCAGT GGCTTCTTCT ATTCAATTTT CG6GGGCC6T G0GGGAGT06 360 

GGGTGGACGT GCTCTCTCTG OCCGGGGGTC COGAAGCQAG AGGCTTCOCC CCTCTATTGG 420 

ACTTCATGTA CACTTCGOGC CTGCGCCTCT CTCCAGCCAC TGCACCAGCA GTCCTAGOGG 480 

COGCCACCTA TTTGOVGATG GAGCACGTGG TCCAGGCATO CCACOGCTTC ATCCAGGCCA 540 

GCTATGAACC TCTGGGCATC TCCCTGCGCC CCCT6QAA6C AGAACCCCCA ACACCOCCAA €00 

OGGCCCCTCC ACCAGOTAOT CCCAGGOQCT CCQAAOGACA GCCAQACCCA CCTACTGAAT 660 

CTOQAAGCTO CAGTCAAGQC CCCCCCAGTC CAGCCAGCOC T6ACC0CMG GGCTGCAACT 720 

GGAAAAAOTA CAAGTACATC GTGCTAAACT CTCAGGCCTC GCAAGCAGGO AGGCTOOTCG 780 



953 



wo 03/042661 



PCTAJS02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



OGGAQAGAAG TTCT6GTCAA CCTTGCCCCC AA6CCAG0CT GCCCAGT6GA GACGAOGCCT 840 

CCAGCA6CAG CAGCAGCAGC ACCAGCAGCA GTGAAGAAGG AOOCATTCCT GGTCCCCftGA 900 

GCAGGCTCTC TCCAACTGCT GCCACKJTGC AGTTCAAATQ TGGGGCTCCA GCCAOTACCC 960 

CCTACCTCCT CACATCCCAG GCTCAAGAOV CCTCTGGATC ACCCTCTGAA OGGGCTGGTC 1020 

CACTACOQGG AAGTGAATTT TTCAGCT6CC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT 1080 

OQGGGCTGQA CTCCTT60TT CCTGGOGAGQ AAOACAAAOC CTATAAOTGT CAGCTGT G CC 1140 

GGTCTTGGTT CCGCTACAAG GGCAACCTTG CCAGTCATCO TAC3VGT0CAC ACAGOGQAAA 1200 

AGCCTTACCA CTGCTCAATC TGCGGAGCCC GTTTTAACOG GCCAGCAAAC CTGAAAAOGC 1360 

ACAGCCGCAT OCATTCGGGA GAOAAGCCGT ATAAGTGTGA GACGTG0G6C TCGCGCTTTO 1320 

TACAGGTGGC ACATCTGCGG GCX3CACGTGC TGATCCACAC CGGGGAGAAO CCCTACCCTT 1380 

GCCCTAGCTO 06QAACC0GC TTCOGCCACC TOCAOAOCCT CAAOAGCCAC GTTGGCATCC 1440 

ACACCGGAGA GAAGCCTTAC CACT60SACC OCTGTGGCCT GCATTTCOGG CACAAGAGTC ISOO 

AACTGGGGCT 6CATCTGOGC CAGAAACACO QAGCTGGTAC CAACACXZAAA GTGCACTAGC 1560 

ACATTCrCGG GGGGCCCTAO CTGAGOGCAG GCCCAGGCCC CACTTGCTTC CrGC3GGGTGG 1620 

GAAA6CTGCA GGCCCAGGCC TTGCTTCCCT ATCAGGCTTO GGCATAGGGG TGTGCCAGGC 1660 

CACTTTCGTA TCAOAAATTQ CCACCCTCTT AATTTCTGAC TQGGGAOAOC AGQGGTGGCA 1740 

GATCCTGGCT AGATCTGCCT CTCSTTTTGCT GGTCAAAACC TCTTCCCCAC AA6CCAQATT 1800 

GirrCTGAGG AGA6AGCTAG CTAG6GGCTG GGAAAGGGGA GAGATTGQAG TCCTGGTCTC 1860 

CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATTGCATT CAGTTTATCT GTAAATATAA 1920 

TTTATTQAOa CCTTTOGOTO GCACOGQOGC CTTCATTCGA TTOCATTTCC CACTCCCCTC 1980 

TTOCACS^AGT GTGATTAAAA GTQACCAGAA ACACAGAAGO TGA6ATCACA OCTCTOCTGO 2040 

CAGAGATTAC TAGOCCTTGG CTCTCTOGTT TCGCTTGOaT ATTTTATATT ATTTCTGTCA 2100 

TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 2160 

TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 

TCTAGGGCAG CTCTGGGAAC ATGOGGGATT QT6GAATTGG GTCAGGAACC CTCTCT6GTA 2280 

TTCTGGATGT TGTAGGTTCT CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 2340 

CAAGGOTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 2400 

GAGGCTGTGG GTGTGGGGGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 2460 

GGGTGTGGGG GATTCTGTAT CTGGATTCOG TATCACTCCA AGTGGA6GCT GGCAGGTTTT 2520 

TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCr 2580 

GCTGTATCCA TCTATAGTGG TA6AGACCCA GCAGG6CTCA AGTOGAGTCC ATCATCCTCC 2640 

CAC6GGGGCC TGTTCTTAGC ACTGAGTTGA TOGCTCCATG GGGGAGAGAT CAGACATTCC 2700 

TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATO TTATQGCCTA 2760 

GGGAAGAATC ATGAAACTCT TTAGCTTGAT TAGATGGTAA ACAGTGTTAA CCCATCCTTT 2820 

ACTACAGAGG CATATGGGTT T6AATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCCCCT 2880 

TCTTOCTTTA GTQG6TTTTG GACATCTTCT GGCAAGTGTC CAGATQCCAQ AACCTTCTTT 2940 

TCCTCTAGAA GG6ATGGTGC TTGGTAACCT TACCTTTTAA AAGCTGGGTC T6TGACX:TGG 3000 

TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGT6AATGC ATTAGAACCT TCCATAGGAA 3060 

AAGAAAAOGG GCTGAGTTCC ATTCTGGGTT TGCPGTAGTT TGQTTGGGAT TATTOTTGQC 3120 

ATTACA6ATG TAAAAGATTG ACTAGCCCAT AGGCCAAAOG CCTGTTCTAG TTGACCAAGT 3180 

TTCAAGTAOG AT7AAGAGQT TQOTTGAGOQ GTGCAOTTTC TGGIOTAOGC CAQQTAGaTA 3240 

GAAAGTGAG6 AACA6G6TTG CCTCTTGGCT GG6TGGAGTC TCT6AAATGT TAGAAGAAGC 3300 

GCTGAAGCGT TQATTGATAG TTCTQCCCCT TGTTGCCCTO GGGCTTATCT GATTATGGGA 3360 

CGAGGGTAGA AAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 3420 

GTTCTAGTGG CIGTGGCCTG GGOACTAGTG AQAAAGCTAC TCTTCTCCCT CTTCCCTCTT 3480 

TCTCCCCATG GCCCCACTQC AOAATTAAAO AAGOAAQAAO OQAAQQOSQA GOAOTCTATA 3540 

AGAAG6AATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA 08T8GAGAAT GCCTOGGOOT 3600 

ASAAATGTTA GATCTTOCAA CATCAOATOC TTGGAATAAA GAAGOCTCTC TGYGCWRAAA 3660 
AAAAAAAAAA AAAAAA 

Seq ID NDt 16 Protein eeqttence 
Protein Acceseion ft: F6ENB8B predicted 



MGSFAAPEQA 
OFPySIFRGR 
EHWQACHRF 
PPSPASPDPK 




FSOQNCBAVA 
CX3ARFNRPAN 
FRHLQTLKSH 



Seq ID NOx 17 DNA sequence 

nucleic Acid Accession tt XH_O39209 

Coding sequence t 1..2049 



41 

I 

TLLVGGQPLR 
XAXjSPATAPA 

PRRSEGHPDP 
PCPQARLPSQ 
AQDTSGSPSS 
GNLASHRTVH 
AHVLXHTGEK 
QXH6AATNTR 



51 
I 

ARKAVZiXACS 
VLAAATYLQM 

PTESRSCSQG 
DBASSSSSSS 
RARPLPGSEF 
TGEKFXHCSI 
pypCPTOGTR 
VHYBIUSGP 



60 
120 
180 
240 
300 
360 
420 



ATQCTGAAGA 
GGAGATGCTA 
AATGGGAAOC 
CTGCTGAGTO 
CXSGAOTGACA 
ACAGAATGTQ 
A6CCTGTTGC 
TGCAAAGACT 
CAAACAACTG 
CCAGATTTTC 
QA ATATQAC A 
GAGGTT8TGA 
OOTCTCTTCA 
TTCAAGQAGC 



11 
I 

TGCTCTCCTT 
AGTTTGGGGA 
CCCCGAAGOG 
GGGGAGAGAT 
GCCOGGGGCT 
GGAAOTTACT 
ACTCACCTQA 
ATTGCAAAGA 
OGGATGAGTT 
CAABAAAACA 
AAGTQ6AAGA 
OTOGGCTGGS 
TTCT GOAAA A 
CTTATTTGOA 



21 
I 

TAAGCTQCTG 
AAOAAAOGAA 
CXTTGAAAAGG 
GCTGTGOGGT 
AGGGOGCCTG 
GGAGGAAATC 
GAGAGAAGTC 
ATTCTTTTAC 
TTGCTTTTAC 
AGTCAGAGGA 
OATCAGCAOA 
GCAGCC06TT 
AOAAGGrTAT 
CATTCACAAA 



31 

I 

CTQCTGGCOG 
GGGA60GGAG 
AGAGACAGGA 
GGCTTCTACC 
GAQAATAAGA 
AAATGTGCAC 
TTGGAAAGAG 
ACTTGCOQAQ 
TATGCAAGAA 
CCAGCATCTA 
AAGC ACAAAC 
GGT0CCCT6C 
OTOAAGATAC 
CTTOTTCAAA 



41 

I 

TGGCTCTGGG 
CAAGGAGGAG 
GGATGATGTC 
CTCGGCTGTC 
TATTTTCTGT 
TTTGCTCTCC 
ACCTAGTACT 
GCX3VTATTCC 
AAGATGGT06 
ACTACTTGQA 
ACAACTGCTT 
ATAGTGGGGA 
TTACCGCTQA 
OTQGAATAAA 




CCAGATGGAA 
CTGTATTCAG 
TGGCTCX3C3U^ 
AGGAGAAATT 
GQGAGQAGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 



954 



wo 03/042661 



GAAAGA06AC TGCTAAGCCT OGCATTCQIT CCCAATTACA AGAAAAATGG AAA G TAVl'AT 900 

GTGTCCTATA CCACCAACCA AGAAOGGTQQ 6CTAT0GGGC CTCATGACCA CATTCTTAGG 960 

6TTGTGGAAT ACACA6TATC CAGAAAAAAT CCACACCAAG TTGATTTGAG AACAGCCAGA 1020 

Givrn tr rro aagttgcaga actccacaga aagcatctgg gaggagaact gctctttggc loso 

CCTGAOGGCT TTTTGTACAT CATTCrrGGT GATOGGATGA T7ACACTGGA TGATATGGAA 1140 

GAAATGGATG GGTTAAQTGA TTTCACAGGC TCAGTGCTAC GGCTGGATOT GGACACAGAC 1200 

ATGTGTAAOG TGCCTTATTC CATACCAAGG AGCAACOCAC ACTTCAACAO CACCAACCAG 1260 

CCCCCCGAAG TGTTTGCTCA TGGGCTCCAC GATCCAGGCA GATGTGCTGT GGATAGACAT 1320 

CCCACTGATA TAAACATCAA TTTAA06ATA CTGTGTTCAG ACTCCAATGG AAAAAACAGA 1380 

TCATCAGCCA GAATTCTACA GATAATAAAG GGGAAAGATT ATGAAAGTGA GCCATCACTT 1440 

TTAGAATTCA AGCCATTCAQ TAATGGTCCT TT GG TTGGT G GATTTGTATA COGGGGCTGC 1500 

CAGTCAGAAA QATTGTATGO AAGCTACGTO TTTGGAGATC GTAATGGGAA TTTCCTAACT 1560 

CTCCAGCAAA GTCCTGTGAC AAAGCAGTGG CAAGAAAAAC CACTCTGTCT OGGCACTAGT 1620 

GGGTCCTGTA GAGGCTACTT TTCCGGTCAC ATCTTGGGAT TTGGAGAAGA TGAACTAGOT 16 80 

GAAGTTTACA TTTTATCAAG CAGTAAAAGT ATGACCCAGA CTCACAATGG AAAACTCTAC 1740 

AAAATTGTAG ATCCCAAAAG ACCTTTAATG CCTGAGGAAT GCAGAGCCAC G6TACAACCT 1800 

GCACAGACAC TGACTTCAOA GT6CTCCAGG CTCTGTCGAA ACGGCTACTG CACCCCCACQ 1860 

GGAAAGTGCT GCTGCAGTCC AGGCTGGGAG GGGGACTTCT GC3W5AACTGC AAAATGTQAG 1920 

CCAGCATGTC GTCATGGAGG TGTCrGTQTT AGACCGAACA AGTGCCTCTG TAAAAAAGGA 1980 

TATCTTGGTC CTCAAT6TGA ACAAGTGGAC AQAAACATCC GCAGAGTQAC CAGGGCAGAC 2040 
ATCAGCTAG 



Seq ID NO: 18 Protein sequence 
Protein Accession #: XP_039209 

1 11 21 31 41 51 

I I I I i I 

MLKMLSFKLL LLAVALGFFE GDAKFGERKE GSGARRRRCL NGNPPKRLKR RORRMMSQLB 60 

UiSGGEMIiCX3 GFYPRIjSCCL RSDSPGLGRL aiKIFSVTNN TEOGKU^I KCALCSPHSQ 120 

8LFHSPEREV LERDLVLPLL CKDYCOCEFPY TCRGHIPGFL QTTADEFCFY YARXDGQLCP 160 

PDFPRKQVRG PASNYLOOME EYDKVBBISR KHXBNCFCZQ EWSQLRQPV OALRSGDGSQ 240 

RLFILEKEGY VKILTPEGEI FKEPYLDIHK LVQSGIKGGD ERGLLSLAFH FNYKKNOKLY 300 

VSYTTNQERW AIGPHDHIIiR WBYTVSRKN PHQVDLRTAR VFLEVABLHR KHLGGQLLFG 360 

PDGFLYIILG DGMITLDI^E EMD6LSDFTG SVLRLDVDTD KOtVPYSIPR 8NPHFNSTNQ 420 

PPBVFABGLH DFGRCAVDRH PTDINIKLTI LCSDSilGXNR 6SARXLQXIK GKDYBSBPSL 480 

LBFKFFSMOP LVGGFVYRGC QSERLYGSYV FGOBNGHFLT LQQSFVTXQIf QEXPLCLOTS 540 

GSCRGYFSGH ZLGFGEDEU3 EVYXLSSSKS KTQTBNGKLY KIVDPKRPIM PEBCRATVQP 600 

AQTLTSBC8R LCRNGYCTPT GK0CC8P6WE 6DFCRTAKCE PACSBGGVCV RPNKCLCKXG 660 
YLGFQCEQVD R29IRRVTRAD IT 

Seq ID NOi 19 DNA aequence 

nucleic Acid Accession #t NM_014331.2 

Coding sequence t 1..1506 

1 11 21 31 41 51 

I I . I I I I 

ATGGTCAGAA AGCCPGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCXXTT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAOTC TCCATTATCA TTGGCACX3VT CATTGGAGCA 160 

GGAATCTTCA TCTCTCCTAA GQQCX3T0CTC CAQAACAOSO GCAGCXSTQQO CATGTCTCTO 240 

ACCATCTGQA OGGTQT G TGQ QGTCCTGTCA CTATTTGGAG CTTTSTCTTA TGCTQAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTOTAOGAGT CTGGGTGGAA CTCCTCATAA TAOGCCCTGC AGCTACTOCT 420 

GTGATATCCC TGGCATTTGG A06CTAC3VTT CXX^GAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTQ 00ATCAA6CT CATTACAGCT OTGGGCATAA CTGTAOTQAT 60TCCTAAAT 540 

AGCAT6AGTG TCAGCTQGAG CX3CC0GGATC CAGATTTTCT TAACCTTTTG CAA6CTCACA 600 

GCAATTCTGA TAATTATAGT CGCTGGAGTT ATGCAGCTAA TTAAAGGTCA AAOGCAGAAC 660 

TTTAAAGAOG OGTTTTCAGG AAGAGATTCA A6TATTA0GC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATAT6C TG6CTGGTTT TACCTCAACT TTGTTACTGA AGAA6TAGAA 760 

AACCCTGAAA AAACCATTCC OCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGOC 640 

TATGT6CTGA CAAATGT66C CTACTTTAOG ACCATTAATG CTGAG6A6CT GCTGCTTTCA 900 

AATGCAGTGG CAQT6ACCTT TTCTGAGOGG CTACTGOQAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTQ CTTTGGCTCC ATGAAC6GTQ GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCX9 AGAGGGTCAC CTTCCAQAAA TCCTCTCCAT GATTCATGTC 1080 

CX3CAAGCACA CTCCTCTAOC AGCTGTTATT OTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCrGQAG ACCTG6ACA0 TCTTTT6AAT TTGCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

G6GCTGGCA0 TTGCTGGGCT GATTTATCTT GGATACAAAT GCCCAGATAT QCATOGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATT06GAC0C ATTTAGTACA GGGATTGGCT TOGT CATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATG6 GACAAGAAAC CCAOGRTGGTT TAGAATAATQ 1440 

TCAOAGAAAA TAACCAGAAC ATTACAAATA ATACT06AAG TTGTACGAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCT6CCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGQAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAQAA AGTGAATATO CAGTTATTCT AT6AGT0GGA CAATTCTTGA 1740 

GTCTCT6ATA CX^ACCTATT 0GGGTTA66A GAAAAGACTA GACAATTACT ATGTQGTCAT 1800 

TCTCTACAAC ATATGTTAGC AOGOCAAAGA ACXTTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGQ TTTTGTAAAG ATGOTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTO 1920 

TTTTCAATTC TGAAAAAAAO CATACATCAT 6ATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCGOCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACXX3k QAOCACTTTG AACAAAGOTC AGIGGG6ATT GTTQAATAC3i 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAOACACA TOCAGGAGTT ATGTTTAA6T 2160 

AAAAATCCTT GA6AATTTAT TATQTCAGAT GTTTTTTCAT TCATTATCAG GAAOTTTTAG 2220 
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TTATCTGTCA ■ nYrnHTX TCACATCAQT 
ACCAAtSAOTT AGTTTGGTAT TAAATCCTCA 
TACCCCTGAT QAOTCTATCT AAACATATGC 
TC5AGAGAAAT AACCAACAAA GAAGATGTTC 
5 CTACATOCAA TGTTAGTAAT TCTGAAGTTT 
TGAATTTTOA CAGTTTGT6C ATTTTCTTTA 
CTTCAGATGA AACTCTOCAG ATTAATTAGO 
AAGAAATCSTC GCTGTAAATA AGATTTACAA 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT 

10 CTOATAAGAA OAAAATTGAA ATGAGAATCT 
GTTTTGOOVG TATTAGAAAA TACTGTGAaC 
GCACTTTGGO AGGCTGAGGG GGT06ATCAC 
CAACATGGAG AAACCCCATC TCTACTAAAA 
GCTGGTAATC TCAGCTATTQ AGOAGGCTQA 

15 GA6GTTGCAG TGAGCCAAGA TTGCACCACT 
CCATCTCCAA AAAAAAAAAA AAAA 



TTGATCAG6A AAGTGTATAA CACATCTTAG 2280 

TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

AAAATAATAQ TCCCATATCT QTAATCATAT 2460 

TTTAAATTTA TGGCTATTTT TACACOATOA 2520 

TACATTTTAT ATTCTTCIGT TAAAATATCT 2S80 

AAAAG6CATA TATTAACATA AAAATT6CAA 2640 

CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

6TGGATAAGT GTTTGTGTTC AGAAQATGTT 2620 

GGGGCATGOT GQCTTACATC TGTAATCCCA 2880 

CT6AG6T06G QAGTTCTAOA CCAGCCTGAC 2940 

ATACAAAATT AGCTQGGCAT GGTGGCACAT 3000 

GGCAGGAGAA TTGCTTGAAC CCGGQAGGCG 3060 

GTACTCCA6C CTQGGTGACA AAGTCAGACT 3120 



8eq ID KOi 20 Protein eequence 
Protein Accession ft: HP_055146.1 

20 

1 11 21 31 41 51 

I I I } I I 

MVRKPWSTl SKGGYLQGNV NGRLPSLGNK EPP6QBKVQL KRKVTLLRGV SIIIGTIIGA 60 
GIPISPKOVL C3NTGSVGMSL TIWTVCX^VLS LFGALSYAEL GTTIKKSGGH YTYILBVPGP 120 

25 LPAFVRVNVB LLIXRPAATA VISXiAPGRYI liEPPPZQCEI PELAIKLITA VGITWMVLN 180 
SMSVSHSARZ QIPLTPCKLT AXLIZIVPGV MQLXKQQTQN PRDAPSGRDS SITRLPLAFY 240 
YGMYAYAGHP YLNFVTEBVE NPEKTIPLAX CI5MAITIGV YVLT19VAYPT TINAEELLLS 300 
NAVAVTFSER LLGNPSLAVP IPVALSCFGS MNGGVFAVSR LPYVASREGH LPBILSMIHV 360 
RKHTPLPAVI VUiPLTMIML PSGDIJ5SLLN FLSPARWLPI GIAVAGLIYL RYKCPDMHRP 420 

30 PKVPLFIPAL PSFTCLFMVA LSLYSDPPST GIGFVITliTG VPAYYLPIIW DKKPRWFRIM 480 
SBKITRTLQI ZLEWPSEDK L 



50 



Seq ID NO: 21 DNA sequence 
35 Nucleic Acid Accession #: NM_002422.2 
Coding sequence x 64.. 14 97 

1 11 21 31 41 51 

/lA > ' ' » ' ' 

4U ACAA6GAGGC AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT CCTACTGTrG CrOTOCGTGG CAOTTTGCTC AOOCTATCCA 120 

TTGGATGQAG CTGCAAGGGG TOAGGACACC AGCAT6AACC TTGTTCAGAA ATATCTAGAA 180 

AACTACTAOG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAQOA CAGTGGTCCT 240 

G TTG TTAAAA AAATCCQAOA AATGCAOAAO TTCCTTGGAT TGGAGQTGAC GGGGAAGCTG 300 

45 GACTCCGACA CTCTGQAQQT GATGCGCAAG CCCAGGT6TG GAGTTCCTGA TGTTGGTCaC 360 

TTCAGAACCr TTCCTGGCAT CCCX3AAGT60 AGQAAAACOC ACCITACATA CAGGATTGTQ 420 

AATTATACAC CAOATTTGCC AAAAGATGCT GTTQATTCTG CTGTTQAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

ATGATCTCTT TTGCAGTTAG AGAACATGGA QACTTTTACC CTTTTQATGQ ACCTGQAAAT 600 

GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGAT7AATQ GAGATGCCCA CTTTQATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAAXTTAT TTCTGGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTQ AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT QACTCGGTTC CGCCTQTCTC AAOATGATAT AAATG6CATT 840 

CAQTCCCTCr ATGGACCTCC CCCTGACTCC CCTGAQACCC CCCTGGTACC CACGGAACCT 900 

55 GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TOATGCTQTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGG06 CAAATCCCTC 1020 

AGGAAGCTTQ AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCXZITCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTOGTTT TCATTTTTAA AGGAAATCAA 1140 

TTCTGGOCCA TCAGAGGAAA TGAGGTACXSA GCTGQATACC CAAGAGQCAT CCAC31CCCTA 1200 

GGTTTCCCTC CAACCGTGAG GAAAATOQAT GCAGCCATTT CTQATAAG6A AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATO AOAAGAGAAA TTOCATGQAO 1320 

CCAG6CTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAQ AATTTGOOTT CTTTTATTTC TTTACTGGAT CTTCAC31QTT GGAGTTTGAC 1440 

CCAAATGCAA AGAAAGTGAC ACACACTTTO AAGAOTAACA GCTGGCTTAA TTGTTGAAAG 1500 

65 AGATATGTAa AAGGCACAAT ATQGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACT03AGT CACACTCAAO 1620 

GQAACTT6A0 OSTGAATCTQ TATCTTGCGO GTCATTTTTA TGTTATTACA GGOCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTG GACCTIGTCA CATAGAGTGA TCTTTCCCAA GAQAAGGSGA 1740 

AGCACTCSStCG TGCAACAOAC AAGT6ACTQT ATCTQTOTAG ACTATTTGCT TATTTAATAA 1800 

AQAOGATTTG TCAOTTQTTT T 

Seq ID NOt 22 Protein sequence 
Protein Accession ft: NP_002413 

75 1 11 21 31 41 51 

I I I 1 I I 

MKSLPILLLL CVAVCSAYPL DQAARGEDTS MMLVQKYLEN YYDLBKDVKQ PVHRKDSGPV 60 

VKKIREMQKP LGLBVTGKLD SDTI»EVMRKP ROGVPDVGHF RTFPQIPKWH KTHLTYRIVN 120 

YTPDLPKDAV DSAVEKALKV WEBVTPLTFS RLYBQBADIM I8FAVREHGD FYPFDGPGNV 180 

80 LAHAYAFGPG IK6DARPDDD EQWTKDTTOT NLFLVAAHEI GHSLGLFKSA OTBALMYPLY 240 

RSliTDLTRFR LSQDDINGIQ SLYGPPPDSP ETPLVPTEPV PPBPGTPANC DPALSFDAV8 300 

TLRGSILZFK DRKFWRKSLR KX<BPBX^I8 8FWP8LPSGV DAAYEVT8XD LVPIPKGMQP 360 

WAIRGNBVRA GYPRGIUTLQ FPPTVRKZDA AISDKEIQIKT YFFVEDKYWR FDBRRNSMBP 420 
GFPKQIABDP PQIDSKXDAV FBBFGPPYFP TGS8QLBFDP NAKKVTHTLK SNSWLNC 
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Seq ID NO: 23 DMA Sequence 
Nucleic Acid Accession 9t NM 006528 
Coding sequence t 57.. 764 ~ 

1 11 21 31 41 51 

I i . 1 I I 1 

GCOGCCAOCG GCTTTCTCGQ AOGCCTTGCC CAGOGGGCCG C300GACCCCC TGCACCATGQ 



60 



^ ACCG0GCTG6 CCCCCTOGGO CTGTCGATTC TSCTGCTTTT 0CTQACX5QAG GCTGCACTGG 120 

lU OOGATGCTGC TCAGGAGCCA ACAGGAAATA ACGOGGAOAT CTOTCTCCTQ CCCCTAGACT 180 

ACGGACCCTG CCGOQCCCTA CTTCTCCQTl' ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGA6GCTT 300 

GCGA0C5ATGC TTGCTG GAGG ATAQAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

- . TGGA06AOCA GTOTGAGG06 TCCACAOAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

ID QTGAAA AATT CTTTTCOGGT GGGTOTCACC G6AAC0GGAT TGAGAACRGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATQTGA CTCGCTATTA TTTTAATCCA AOATACAOAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTC3T0 GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 

on AG6ATTGCAA ACGTGCAT8T GCAAAAGCTT TGAAAAAQAA AAAQAAGATO CCAAAGCTTC 720 

ZU 6CTTTGCCA6 TAQAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 780 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGOTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAOOAA ACAA ATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT T6GATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA QTCTACCATT 960 

^ - TTTAATTTAT GOTTCAACTO TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTTOGGTCGT ATTCCT6ATT TCAGAAQAG6 ATCATAACTG 1030 

30 



— — •'W'Mawn* A * w« « nn\. A \a Awav 

AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAQAATAA AAAOOACTAG 1140 
OC 



Seq ID NOi 24 Protein sequence 
Protein Accession #> NP_006519 

1 11 21 31 41 51 

I I I I I I 

_ MDPARPLOliS ZUjIiFLTEAA LCa>AAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYT^S 60 

jD CRQFLYGGCE GMAHNPVTWE AO^DACWRIK KVPKVCRLQV SVDDQCEGST EKYFPNLSSM 120 

TCEK FPSGGC HRHRiaiRFP DBATCKGFCA FRXIPSFCYS PKDBGLCSAN VTRYYFNPRY 180 
RTCDAFTYTG CGGNDNKFVS REDCKRACAK ALXKXKKNPK LRFASRXRKI RKKQF 

Seq ID KOt 25 DNA sequence 
4U Nucleic Acid Accession #: NM_00545B.l 
Coding sequence I 1..2825 ~ 

1 11 21 31 41 SI 

AK \ I I i 1 I 

43 ATGGCTTCCC CX3CGGAGGTC CXIGGCAGCCA GGQCGGCCGC COCCGCOSCX: ACCGOOSCCC 60 

GCGCGCCTGC TACTQCTACT GCTGCTGCCG CTGCTGCTGC CTCT G QCGOC 06GGG0CT66 120 

GGCTGQGC6C GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC C0OCX3CTCTC CATCATGQ6C 180 

CTCATGCCQC TCACCAAGGA GGTGGCCAAG GGCAGCAT06 GGOGOGGTGT GCTCCCX3GCC 240 

GTGGAACTGG CCATOQAGCA GATCCGCAAC GAGTCACTCC TGOSCCCCTA CTTCCTOSAC 300 

jU CTGCGGCTCT ATGACAOGGA GTGOGACAAC GCAAAAGGGT TGAAAGCCTT CTAGGATGCA 360 

ATAAAATAOS GGCCQAACCA CTTGATGGTG TTTOGAaOOG TCTOTCCATC GGTCACATOC 420 

A TCATTG CAQ AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTOC TGCAACCA03 460 

CCTQTTCTAG CCGATAAGAA AAAATACCCT TArTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GOGOTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AOTGGAAGCG CQTQGGCAOS 600 

30 CTGACGCAAG ACX5TTCAGAG GTTCTCTGAG GTGOGGAATO ACCTGACTGG AGTTCTGTAT 660 

GGOSAGQACA TTGAQATTTC AGACACXX3AG AOCTTCTCCA ACOATCOCTO TACCAOTGTC 720 

AAAAAGCTOA AOGGGAATQA TOTGOGGATC ATCCTTGQCC AQTTTGACCA OAATATGOCA 780 

GCAAAAGTGT TCTGTTGTGC ATAOQAGGAO AACATGTATQ QTAOTAAATA TCAGTGGATC 840 

ATTCOQGGCT GGTAOBAGCC TTCTTGGTGG GAGCAGGTGC ACAOGGAAGC CAACTCATCC 900 

OU CGCTGCCTCC GGAAGAATCT GCTTGCTOCC ATGOAGGGCT ACATTGGCGT GGATTTCGAG 960 

CCCCTGA6CT CCAA6CA6AT CAA6ACCATC TCAGGAAA6A CTCCACAGGA GTATQAGAGA 1020 

QA6TAC3U^CA ACAAGC330TC AGGCQTGGQO OXAGCAAGT TCCAOGQGTA OGCCTACGAT 1080 

G6CATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGO AGACACTGCA TGCCAGCAGC 1140 

OGGCACC3tf3C OGATCCAGGA CTTCAACTAC AOQGACCACA OGCTGQGCRG GATCATCCTC 1200 

03 AATGCXaXGA AOQAOACCAA CTTCTTOGGG GTa«3GaGTC AAGTTOTATT CCGGAATGGG 1260 

GAGAGAATGG GGACCATTAA ATTTACICAA TTTCAAOACA GCAGGGAGOT 6AAGGTGGGA 1320 

GAGTACAAOG CTGTGGCCGA CACACTOOAO ATCATCRATO ACACCATCAO GTTCCAA6GA 1380 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGQAGCAGC TGCGGAAGAT CTCXOTACCT 1440 

in CTCTACAGCA TCCTCTCT6C OCTCACCATC CT0QGQAT6A TCATGGCCAG TGCTTTTCTC 1500 

/U TT CTTC AACA TCAAGAACCG GAATCaVSAAO CTCATAAAOA TGTOGAGTOC ATACATGAAC 1560 

AACCT TATC A TCCTTGQAGO QAT OCTC TOC TATGCTTCOl TAXTTCTCTT T GGCCTTGAT 1620 

GGATCCTTTG TCTCTGAAAA GAOCTTTGAA AGACTTT6CA COGTCAGGAC CTGGATTCTC 1680 

ACOGTGGGCT ACACX3AC0GC TTTTGGGGCC ATOTTTGCAA AGACCTOGAO AGTCCAOOCC 1740 

— ^ ATCTTCAAAA ATGTGAAAAT GAAGAAQAAQ ATCATCAAGG ACX^GAAACT GCTTGTGATC 1800 

/3 OTOGGGGGCA TGCTGCTGAT 06ACCTOTGT ATCCTGATCT GCTGGCAGOC T0TG6ACCCC 1860 

CTGOSAAGGA CAGTGQAGAA GTACAGCATG OAGGGGGACC CSUSCAGGAGQ GGATATCTCC 1920 

ATCOQCCCTC TCCTGGAGCA CT6TGAGAAC ACCCATATGA OCATCTGGCT TOOCATOOTC 1980 

TATGCCTACA AGGOACTTCT CATGTTGTTC GGTTGTTTCT TAGCTTGGQA GACOOGCAAC 2040 

GTCAGCATCC CCGCACTCAA CGACAOCAAG TACATCQGGA TOAQTGTCTA CAAOGTGGGG 2100 

0\J ATCATX5T0CA TCATCGGGOC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TTCTGCATCQ TGGCTCTGGT CATCATCTTC TOCAOCACCA TCACCCTCTG CCTG6TATTC 2220 

6TGC0QAA6C TCATCACCCT GAGAACAAAC CCAGATGCAG CAA03CA6AA CAGGOQATTC 2280 

CAGTTCACTC AQAATCAGAA OAAAGAAOAT TCTAAAACGT 0CA0CT06GT CACCAOTGT O 2340 

AACCAAGCCA GCACATCCOQ CCTGGAOGGC CTACAGTGA6 AAAACCATOQ CXrroGOAATG 2400 
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AAGA.TCACAG AGCTGQATAA A6ACTTQGAA GAGGTCftCO^ TGCAGCTOCA GQACACAOCft 2460 

GAAAAGAGOV CCTACATTAA ACAGAACCAC TAGCAA6AGC TCAATGACAT CCTCAACCTG 3520 

GGAAACTTCA CTGAGAGaVC AGAT06AGGA AAGGCCATTT TAAAAAATCA CCTGQATCAA 2S8D 

AATCCCCAGC TACAGTGGAA CACAACAQAQ CCCTCTCGAA CATOCAAAGA TCCTATAGAA 2640 

GATATAAACT CTCCAGAACA CATCCAGOGT GGGCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

CACGCCTACC TCCCATCCAT GG6AG6GC3TQ GA06CCAGCT GTGTCAGCCC CTG0GTCA6C 3760 

CCCACOGCCA GCCCCOGCCA CAGACATOTG CCAOCCTCCT TC06A0TCAT G6TCT0GGGC 3630 
CTGTAA 

Seq ID HO: 26 Protein sequence 
Protein Accession «t NP_005449.1 

1 11 21 31 41 51 

I I I i I I 

KASPRRSGQP GRPPPPPPPP ARLLLLLLLP LLLPLAPGAH 6WARGAPRPP P8SPPLSXMQ 60 

LMPLTKBVAK GSIGR6VLPA VEIiAIEQIRN BSliIiRFYFU) LRLYDTBCDN AKOUCAFYDA 130 

ZKYGPKHUfV FG3VCPSVTS IIAESLQGMN XiVQLSFAATT FVLADRKKYP YPFRTVP8X>N 180 

AVMPAILXLL KHYQHKRVGT LTQDVQRP8B VRNDLTGVLY GBDIEISDTE SPSNDPCTSV 240 

KKLKGNDVRI ILGQPDQNMA AKVFCCAYBE NMYGSKYQWI IPGWYBPSWW EQVHTEANSS 300 

RCLRKNUAA ME6YI6VDFE PI#SSKQIKTI SGKTPQQYER BYNNKRSGVG PSKFRGYAYD 360 

GIWXAXTIiQ IIAHBTLHASS RHQRIQDPHy TDKniGRIZL NAKNBTNPPG VTGQWFRNG 420 

BRMGTZXFTQ FQDSREVKV6 BYKAVADTLB ZXNDTZRFQG SEPPKDRTII LEQLRKZGLP 480 

LYSILSALTI LGMIKASAFL FFNIKNRKQK LIKMSSPYMM NLIILGC^ILS YASIPLPGZiD 540 

GSFVSBKTFE TLCTVRTWII, TVGYTTAFQA MFAKTWRVHA IPKNVKMKKK IIKDQKLLVI 600 

VGGMLLIDIiC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLBHCEtl THMTZWliOIV 660 

YAYKOUiMIiF GCFLAWETRN VSZPALNDSK YZGMSVYMVQ ZMCZZGAAVS FLTRDQPNVQ 720 

PCZVALVIIF CSTZTLCLVP VPXLZTLRTN PDAATQNRRP QPTQNQKXED SKTSTSVTSV 780 

NQASTSRIiEG LQSENHRUW XZTELDKDLE EVTMQLQDTP BKTTYZXOHH YQBUIDZLIIL 840 

GKFTESTDGO KAILKNHLDQ NPQLQWMTTE PSRTCXDPIB DZNSPEHZQR RZiSLQXiFZLB 900 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSPRVMVSO L 

Seq ID NOs 27 DNA sequence 

Nucleic Acid Accession «: NM_0004S0.1 

Coding sequence: 117.. 1949 

1 11 21 31 41 51 

I I I I I I 

CXnrGAGACAG AOGCAQCAGT GATACCCACC TQAGAGATCC TffTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTOGGTGAA AAGAACTCTT GAAGTCATQA 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGT6CTTCT OVTTAAAGAG A6TGGAGCCT 180 

GGTCTTACAA CACCTCCACX3 GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAA6GTACAC AC3VCXnt33TT GCAATTCAAA ACAAAGAAQA GATTGAQTAC CTAAACTCCA 300 

TATTQAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACTGQGCT CCAOGTGAAC 420 

COVACAATAG GCAAAAAOAT QAGQACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 460 

TGGGCATGTG GAATGATGAO AGGTOCAGCA AGAA6AA6CT TQCCCTAT6C TACACAGCTO 540 

CCTGTACCAA TACATCCTGC AGTGGCCA06 GTGAATGTGT AGAGACCATC AATAATTACA 600 

CTTGC3VAGTG TGACCCTGGC TTCAGTGGAC TCAAGTGTGA 6CAAATTGTG AACTGTACAG 660 

CCCTGGAATC CXXTTGAGCAT GQAAGCCTGG TTTQCAGTCA CCCACTGGGA AACTTCAGCT 720 

ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 780 

TGCAGTOTAT GTCCTCIGGA GAATGGAOTO CTOCTATTCC AGCCTGCAAT 0TG3TTQA0T 640 

GTGATQCTGT GACAAATCCA 6CCAATGGGT TOGTGGAATG TTTCCAAAAC CCTGQAAGCT 900 

TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTQAACTA ATGQGAGCCC 960 

A6AGCCTTCA QTQTACCTCA TCTGGQAATT OGGACAACGA GAAGCCAACX3 TGTAAAGCTG 1020 

TGACAT6CAG GGCC0TCC6C CA6CCTCAGA A TGGC TCTGT GAGGTGCAGC CATTCCCCTO 1080 

CTGQAGAaTT CACCTTGAAA TCATGCTQCA ACTTCACCTO TQAGGAA6GC TTCATGTTOC 1140 

AGGGACCAGC CC31SGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAI3TTT 1200 

GT6AAGCTTT (3CAGTGCACA GCCTTGTCCA ACCCXXSA0CX3 AGQCTAGATO AATTGTCTTC 1260 

CTAOTGCTTC TGGCAGTTTC OSTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGCAGGGTT 1320 

TTGT G TTGAA GOGATCCAAA AGGCTCCAAT OTGGCCCCAC AGGGQAGT06 6ACAA06AGA 1360 

A600CACAT0 TGAAGCTGTG AGATGOGAZQ CIGTCCAOOV GCCCCOGAAO 6GTTTG6TGA 1440 

G6TGTGCTCA TTCCCCTATT GQAGAATTCA CCTACAAGTC CTCTTGTOOC TTGAGCTX3TG 1500 

AGQAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG CACATCTCAG GGACAATGGA 1560 

CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAQ CCTGGCAGTT COGGGAAAGA 1620 

TCAACATGAO CTGCAGTGGG GAGCCOSTGT TTG6CACTGT QTGCAAOTTC GCCTGTCCTG 1680 

AAGGATGCAC GCTCAATGGC TCTGCA6CTC GQACATGTGG AGCCAGAGGA CA C TGQTCT Q 1740 

GCCTGCTACC TACCTQTQAA GCTCCGACTO A6TCCAACAT TCCCTTGGTA GCTGGACrTT 1600 

CTGCTGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT CXK3AAATGCT 1860 

TAOGGAAAGC AAAGAAATTT QTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGAOGGAA 1920 

GCTACCAAAA GCCTTCTTAC ATCCTTTAAG TTCAAAAGAA TCA6AAACAG GTGCATCTGO 1980 

GGA ACTA GAG GQATACACTG AASTTAACAO AGAaUBATAA CTCTOCTOGQ GTCT C TQQOC 2040 

CTTCTTGCCT ACTATGCCAO ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA AGTCCAGCAG GCAAGGACGG CCTTCAACT6 AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 2220 

CAAA6GTQAA GAOACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2260 

OGCXGTAAAA TCTTGGCACA QAAACACAAT ATTTTGTGGC ' mrriltTi ' TTGOCCTTCA 2340 

CAGTGTTTOQ ACAOCTQATT ACACAGTT6C TGTCATAAGA AIX3AATAATA ATTATCCAGA 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGCATGGA GG6TTGTTAA TGGTGCAAAT OCTACTGAAT GCTCTGTGCG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATOG GATTCAGTGC TTCTTAAAGA 2560 

GTTCrrAAGG ATT6TGATAT TTTTACTTGC ATTGAATATA TTATAATCTT GCATACTTCT 3640 

TCATTCAATA CAAGTGTGGT AGG6ACTTAA AAAACTTGTA AATOCTGTCA ACTATGATAT 2700 

6GTAAAA0TT ACTTATTCTA GATTAGGOOC TCATTGTTTA TTAACAAATT ATCriTACATC 3760 

TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCCCT AGCAAG6CAT GATGTTAAOC 2830 
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10 
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20 

25 

30 
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80 



AOfUlTAAAGT 
AGTAAAAACT 

CACACAAAGO 
GCTTTGCATT 
TATGOAAGAT 
TTTAAOQAAT 
GCTCT6GAA6 
AACAATTCCA 
AGTAATTGCC 
CCATTAACrr 
AAOGACAAAG 
TTAAAGGGGC 
ATG6AATACA 
GCATTAGAAA 
TTTAAATTAT 
TCA6ACCTAT 



TCTQAGTGffT 
GAATGGAAG& 
AAACTTCCAT 
TATAATTTTA 
CCTACAAAOA 
TTTAAATTCA 
GAAGATGTCT 
AQAGGAATGC 
AAGGAATCTC 
AAAGCTGCTC 
A6CATGTGTT 
CCAACAGTCA 
A6AAAAACTC 
GTGTTATTTT 
TTAOCTGTOT 
AACTTAAAAT 
TTOACATAAC 



TTTACTACAO 
TTTGTATATT 
GAG6CCAAAC 
TGAATGTCTT 
TGTTTGTCAa 
CAATAGAAAC 
AATAGTTATT 
CTGTGTGAGC 
CAGTTTTCAG 
TAGCCTTGAG 
GAAAAAAAAA 
AAACAQAGAT 
TGGGAAATAA 
CTTTGAAATT 
QAAATACCAG 
ATTTTATAAT 
ACTATAAAGO 



GTCAGATATT 
GTTTTQAACT 
TGTTGGAAAA 
ATGTGATATG 
TCACCATGTA 
C0CTATTT6T 
AAGCATTTAT 
TTGATCACTG 
GA6TQTGAGA 
GTTTCAGAGA 
GTGATAAGGA 
GAGA6AACAA 
GTTTAAGTGT 
TGTGGTTTGT 
TTTTAAAGTA 
TTGACAATAA 



AAAAC ATGOT 
TTTTCAGAAA 
AATAAAAGCA 
GAATACAGAA 
TAAACATAAT 
AAAQAGTCAT 
TTTCTTCTGT 
GTTTATTTAT 
GCAATGAAAA 
ATCAAAACTC 
AGTTCTGOCT 
TCAGAACAGC 
CTACTOTGAT 
TGTAAATATT 
GTTTGAGTTT 
TATATTTATT 
ATGTGCTTAT 



AGAATTG0A6 
TATOTGQTTT 
TAAATGCAAA 
AGATGGATGT 
TCTTGTATAT 
CTGGTAQATT 
AT6TTA6GGT 
AAGCAGATTT 
ATTCTCAGTC 
TCCTACACTT 
GAACACTGGC 
AGAGGTTCTT 
CAGGCTATGT 
TATGTAAACT 
TATTGAOAAT 
TAA0CTTAT6 
GTTT 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



Seq ID NOi 28 Protein sequence 
Protein Accession #: NP 000441.1 



MIASQFLSAL 
SXLSYSPSYY 



TALESPEH6S 
ECDAVTMPAM 
AVTCRAVRQP 
VCEAFQGTAL 
EKPTCEAVRC 
HTBEVPSOQV 
SGUiPTCBAP 
GSYQKPSyiL 



11 
I 

TLVIiLIKESQ 
WXGIRKVNNV 
SKKXLALCYT 
LVCSKPLGNP 
GPVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPXGL 
VXCS5LAVP6 
TESNXPLVAO 



21 
I 

AWSYNT8TEA 
WVWVGTQKPL 
AACTOTSCSQ 
SYNSSCSZSC 
SPPWNTTCTP 
PAGBPTFKSS 
LPSASGSFRY 
VRCABSPIGE 



31 

! 

MTYDEASAYC 
TEBAXNWAPG 
BOECVETIXQI 
DRGYLPSSNB 



41 

I 

QQRYTHLVAI 



LSAAOLSUiT 



CNPTCEEQFM 
GSSCEFSCEQ 
PTYKSSCAPS 
VFGTVCXFAC 
LAPFLXiNLRK 



YTCXCDPGPS 
TMQCM8SGBW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFEIiyGS 
PE6WTUIQSA 
CLRXAKKFVP 



SI 

1 

ONKEEIEYLM 
CVBIYIKREK 
OLKCEQZVNC 
6APZPAC3IW 
NWDNEKPTCK 
TQGQWTQQIP 
QCQPTGEWDN 
TQLECTSQGQ 
ARTCX3ATGaW 
ASSOQSLBSD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID HOt 29 PNA sequence 

Nucleic Acid Accession #t 11N_007036 

Coding sequence i 56. .610 

1 11 21 31 41 51 

I I 1 I 1 I 

CTTCXXrACCA GCAAA6ACCA CGACTGGAGA GCGQAQCOGG AGGCAGCTGG GAAACATGAA 
GAGCQTCTTQ CTGCTGACCA OGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAQCAA 
TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAQTGCAAAA GCAGCCCX3C6 
CTGCAAGAGG ACAGTGCTCG AOGACTGTQO CTGCTGC06A GTOTOCQCTG CAGGGCGGGG 
AGAAACTTGC TACOQCACAG TCTCAGGCAT GGATGGCATG AAGTGTGQCC CGGGGCTGAQ 
GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAA6A0 TTTQQTATCT GCAAAOACTG 
TCCCTAOGQC ACCTTCGQOA TGGATTGCAG AGAGACXTTGC AACTGCCAGT CAGGCATCTG 
TGACAGGGGQ ACGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATTCAG TAACCAAQTC 
TTCCAACAGA TTTGTTTCTC TCAOGQAOCA TGACATGGCA TCTGGA6ATG 6CAATATTGT 
GAQA6AAGAA GTTGTGAAAG AGAAT6CTGC GGGGTCTCCC GTAAIGA6GA AATGOTTAAA 
TCCAOGCTGA TCCCGGCXGT GATTTCTGAO AGAAGGCTCT ATTTTOGTGA TTGTTCAACA 
CACAGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACAT6TAAT TTTTQAAGAT 
CCAAATTGTG ATGCATGGTG GATCCAGAAA ACAAAAAGTA GGATACXTAC AATCCATAAC 
ATCCATATGA CTGAACACTT GTATGTGTTT GTTAAATATT CGAATGCAT6 TAGATTT6TT 
AAATGTGTGT GTATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGQTAA TCTTACATOG 
AOACAGGTCA ACCAAAOAGG GAGCTAGGCA AAGCTGAAGA COGCAGTGAO TCAAATTAGT 
TCTTTGACTT TGATGTACAT TAATGTTGGQ ATATGGAATG AAGACTTAAG AGCAGGAGAA 
QATGGGQAGG GGGTGGGAGT GGQAAATAAA ATATTTAGCC CTTCCTTGGT AGGTAGCTTC 
TCTAGAATTT AATTGTGCTT ' nTl ' m 'TTT TTrGGCTTTG GGAAAAGTCA AAATAAAACA 
ACCAGAAAAC CCCTGAAGGA AGTAAQATGT TTGAAOCTTA TGGAAATTTG A9XAAC1UVAC 
AGCTTTGAAC TOAGAGCAAT TTCAAAAGGC TGCTGATGTA GTTCCOGGGT TACCTGTATC 
TGAAG6ACGG TTCTGQGGCA TAGGAAAC3VC ATACACTTCC ATAAATAGCT TTAACGTATO 
CXaVCCTCSVGA GATAAATCTA AGAAGTATTT TACCCACTGG TGGTTTGTQT GTGTATGAAQ 
6TAAATATTT ATATATTTTT ATAAATAAA7 GTGTTAOTQC AAOTCATCTT CCCTACCCAT 
ATTTATCATC CTCTTGAGGA AAGAAATCTA GIATTATTTG TTQAAAATGG TTAGAATAAA 
AACCTATGAC TCTATAAGGT TTTCAAACAT CTGAGGCAtG ATAAATTTAT TATCCATAAT 
TATAGGAGTC ACTCTGGATT TCAAAAAATO TCAAAAAATG AGCAACAGAG GGACCTTATT 
TAAACATAAG TGCTGT6ACT TCX3GTGAATT TTCAATTTAA GGTATGAAAA TAAGTTTTTA 
GGAGGTTTGT AAAAGAAGAA TCAATTTTCA GCAGAAAACA TOTCAACTTT AAAATATAOQ 
TGGAATTAGG AOTATATTTG AAAQAATCTT AGCACAAACA GGACTGTTGT ACTAGATGTT 
CTTAGGAAAT ATCTCAGAAO TATTTTATTT GAAGTGAAGA ACTTATTTAA GAATTATTTC 
AGTATTTACC TGTATTTTAT TCTTGAAGTT G6GCAACAGA GTTGTGAATG TGTGTGGAAG 
6GCTTTGAAT GTAAAGCTGC ATAAGCTGTT AGGTTTTQTT TTAAAAGGAC ATQTTTATTA 
TTGTTCAATA AAAAAGAACA AQATAC 

8eq ID NOj 30 Protein sequence 
Protein Accession »x NP_008967.l 

1 11 21 31 41 51 

I I I I I I 

MK8VLLLTTL LVPAHLVAAH SNHYAVDCPQ HCDSSBCKSS PRCaCRTVUH) 06(XRVCAA0 
RGBTCYRTVS GMDGMKG6PG LROQPSHGO) PFGBBFOICK DCPYOTFCMD CRBTQIOQSO 
laSRGTGKCL KFPFFQYSVT RSSNRFVSZ-T EHDMASODGN IVRBEWKEK AAOSPVMRKM 

959 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



60 
120 
180 
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LNPR 

Seq ID KO: 31 PNA sequence 

- Nucleic Acid Accession #: NM_000963 

J Coding sequence: 135.. 1949 

1 11 21 31 41 51 

1 I 1 I I I 

CAATTGTCAT ACGACTTGCA OT0AGOC5TCA GGAGCACGTC CAGGAACTCC TCAOCAGCGC 60 

^ - CTCCTTCAGC TCCACAOCCA GACGCCXntlA GACAGCAAAG CCTACCCCOQ 00CXX3OGCCC 120 

10 TGCCCGCCGC TOGGATGCTC GCCaSOGCXr TGCTGCTGTG CG0S6TCCT0 GOGCTCAGCC 160 

ATACAGCAAA TCCTTGCTGT TGCCACCCAT GTCAAAACC6 AGGTGTATGT AXGAGTGTGG 240 

GATTTGACCA GTATAAOTGC GATTGTACXX: GGACAGGATT CTATGGASAA AACTGCTCAA 300 

CACOGGAATT TTTGACAAGA ATAAAATTAT TTCTGAAACC CACTCCAAAC ACAGTGCACT 360 

ACATACTTAC CCACTTCAAQ GOATTTTGGA AOOTTQTGAA TAACATTCCr TTCCTTOOAA 420 

15 ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATTGACAGT CCACCAACTT 480 

ACAATGCTGA CTAT6GCTAC AAAAGCTG6G AAGOCTTCTC TAftCCTCTCC TATTATACTA 540 

GAGCCCTTCC TCCTGTGCCT GATGATTGCC CGACTCCCTT GG GTGTCRAA GGTAAAAAGC 600 

AGCTTCCTGA TTCAAATGAG ATTGTGGAAA AATTQCTTCT AAGAA6AAAG TTCATCCCTQ 660 

ATCXXCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAQTTTT 720 

20 TCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACCAACGG GCTG66CCAT GGGGTGGACT 780 

TAAATCATAT TTAOQGTGAA ACTCTGGCTA GACAOCQTAA ACT0G3GCTT TTCAAGGATG 840 

6AAAAATGAA ATATCAGATA ATTGATGGAG AGAT6TATCC TCXXACAGTC AAAGATACTC 900 

AGGCAGAGAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 

AGGTCTTTGG TCTGGTGCCT GGTCTGATGA TGTATGCCAC AATCtGGCTO CX3GGAACACA 1020 

25 ACAGAGTATG CGATGT6CTT AAACAGGAGC ATCCTGAATO GGGTGAT6A0 CAGTTGTTCC 1080 

AGACAAOCAO GCTAATACTG ATAOGAGASA CTATTAAQAT TOTOATTGAA GATTATGTGC 1X40 

AACACTTGAO TGOCTATCAC TTCAAACTGA AATTTGACOC AGAACTACTT TTCAACAAAC 1200 

AATTCCAGTA CCAAAATCGT ATTGCT6CTG AATTTAACAC CCTCTATCAC TGQCATCCCC 1260 

TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 

30 ACAACTCTAT ATTGCTGCSAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAOGCAAA 1380 

TT6CT66CAG GGTTGCTOGT GGTAGGAATG TTCCACC06C AGTACAGAAA GTATCACAGQ 1440 

CTTCXATTGA CCAGAGCAGG CAGATQAAAT ACCAGTCTTT TAAT6AGTAC CGCAAA08CT 1500 

TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAAAGGAA ATGTCTGCAG 1560 

AGTTGGAAGC ACTCTATGGT GACATCGATQ CTGTGGAGCT GTATCCTGCC CTTCTGOTAO 1620 

35 AAAAGCCTGG GCCAGATGCC ATCTTTGGTG AAACCATGGT AGAAOTTGQA GCACCATTCT 1680 

CCTTGAAAGG ACTTATGGGT AATGTTATAT GTTCT0CT6C CTACTGGAAO CCAAGCACTT 1740 

TTGGTGQAGA AGTG QG TTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTQCA 1800 

ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCA6TGTTCC AGATCCAGAG CTCATTAAAA 1860 

CAGTCACCAT CAATGCAAGT TCTTCCCGCT COGGACTAGA TGATATCAAT CCCACAGTAC 1920 

40 TACTAAAAGA ACGTTCGACT GAACTGTAGA AGTCTAATGA TCATATTTAT TTATTTATAT 1980 

GAAOCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACICCT TATGTTACTT 2040 

AACATCTTCT GTAACAGAAO TCAG7ACTCC TQTTGGGGAG AAAGGAOTCA TACTTGTOAA 2100 

GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 

TATTCTGTTT TATAAACCAG AGAGAAATGA GTTTTGACQT CTTTTTACTT GAATTTCAAC 2220 

45 TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATG6CA 2260 

AAATGCTGAA AGTTTTTACA CTGTG6ATGT TTCCAATGC3V TCTTCXIATGA T6CATTA6AA 2340 

GTAACTAATO TTTGAAAT^ TAAAOTACTT TTGOTTATTT TTCTGTCATC AAACAAAAAC 2400 

AGGTATCAGT GCATTATTAA ATGAATATTT AAATTAQACA TTACCAGTAA TTTCATGTCT 2460 

ACTTTTTAAA ATCAGCAATQ AAACAATAAT TTOAAATTTC TAAATTCATA GQGTAaAATC 2520 

50 ACXrrGTAAAA GCTTGTTTGA TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA 2580 

G CTGTCT TGQ ATTTAAATCT GTAAAATCA6 AT8AAATTTT ACTACAATEO GTTGTTAAAA ■ 2640 

TATTT7ATAA GTGATGTTCC TTTTTCACCA A6A6TATAAA CCTTTTTAOT GTGACTGTTA 2700 

AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTGGAGCCaC TGCAGTGTTA 2760 

TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GTTTATATGG CTOGTAACAT 2820 

55 GTAAAATCTA TATCAGCAAA AGGGTCTACC TTTAAAATAA GCAATAAOUV AGAAGAAAAC 2680 

CAAATTATTO TTCAAATTTA OOTTTAAACT TTTGAAOCAA ACT I TTT TO ' ATCCTTGTOC 2940 

ACTGCAGGCC TG6TACTCA6 ATTTTGCTAT GAGGTTAATG AAGTACCSU^ CT6TGCTTQA 3000 

ATAACGATAT QTTTTCTCAG ATTTTCTGTT GTACAGTTTA ATTTAGCAGT CCATATCACA 3060 

^ TTGCAAAAGT AGCAATGACC TCATAAAATA CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

OO CATTAATTTT ATCTCAOTCT TGAAGCCAAT TCAOTAaOTQ CATTOQAATC AAGCCTGGCT 3160 

ACCT6CATGC T G TT C CTTT T CT TT TC T TC f TTTAGCCATT TTGCTAAGAa AGACAGTCTT 3240 

CTCATCACTT CX3TTTCTCCT ATTTTGTTTT ACTAGTTTTA AGATCAQAGT TCACTTTCTT 3300 

TGGACTCTGC CTATATTTTC TTJ^CTGAAC TTTTGCAAGT TTTCAG6TAA ACCTCAOCTC 3360 

AGGACTGCTA TTTAGCTCCT CTTAAGAAGA TTAAAAQAGA AAAAAAAAGG CCCTTTTAAA 3420 

65 AATAGTATAC ACTTATTTTA AQTGAAAAOC AGAGAATTTT ATTTATAGCT AATTTTAGCT 3480 

ATCTGTAACC AAGATG GATQ CAAAGAGGCT AGTGGCTCAO A6AGAACTOT ACGG6GTTTG 3540 

TOACTGGAAA AAOTTACOTT CCCATTCTAA TTAATQCOCT TTCTTATTTA AAAACAAAAC 3600 

CAAATGATAT CTAAGTAGTT CTCAGCAATA ATAATAATGA CGATAATACT TCTTTTCCAC 3660 

^ ATCTCATTQT CACTGACATT TAATQGTACT GTATATTACT TAATTTATTG AAGATTATTA 3720 

70 TTTATGTCTT ATTAQGACAC TATGGTTATA AACTGTOTTT AAGCCTACAA TCATTGATTT 3780 

TTTTTTGTTA TSTCACAATC AGTATATTTT CTflt ^G G G TO ACCTCTCT6A ATATTAT6TA 3840 

AACAATCCAA AOAAATGATT GTATTAAQAT TTGTGAATAA ATTTTTAGAA ATCiaATTGG 3900 

CATATTGAGA TATTTAAGGT TGAATGTTTO TCCTTAGGAT AGGCCTATGT GCTAGOCCAC 3960 

AAAGAATATT GTCTCATTAG CCTGAATQTG CCATAAGACT GACCTTTTAA AATGTTTTGA 4020 

/5 GGGATCTGTG GATGCTTCGT TAATTTOTTC AGCCACAATT TATTGAOAAA ATATTCTGTO 4060 

TGAAGCACTG TaGGTTTTAA TATTTTTAAA TCAAACX2CTG ATTACAGATA ATAQTATTTA 4140 

TATAAATAAT TGAAAAAAAT TTTCTTT TO Q OAAOAGGGAO AAAATGAAAT AAATATCATT 4200 

AAAOATAACT CAGGAGAATC TTCTTTACAA TTTTACX5TTT AGAATQTTTA AGGTTAAGAA 4260 

A6AAATASTC AATATQCTTG TATAAAACAC TOTTCACTGT TTTTTTTAAA AAAAAAACXT 4320 

oO GATTTGTTAT TAACATTGAT CTGCTGACAA AACCTGG6AA TTTGGGTTGT GTATGOGAAT 4360 

OTTTCAGTOC CTCAGACAAA TGTGTATTTA ACTTATGTAA AAOATAAGTC TG6AAATAAA 4440 
TGTCT8TTTA TTTTTGTACT ATTTA 
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Seq ZD NOt 32 Protein sequence 
Protein Accession Ut NP 000954 
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1 
I 

MLARALLLCA 
TRIXLFLKPT 
GYKSHEAFSN 
NMMPAPP3\QH 
QIIDGEKYPP 
VLKQEHPEHG 
HRIAABFNTL 
W3GRNVPPAV 
YGDIDAVBLY 
GFQIUrrASI 
8TEL 



11 
I 

VLALSKTANP 
PNTVHYIIiTH 
LSYYTRALPP 
FTHQPPKTDH 
TVKDTQAEMI 
DEQLFQTSRL 
YHWHPIiLPDT 
QKVSQASIDQ 
PALLVEXFRP 
QSLICNNVXO 



21 
I 

CCSHPOQMSG 
PKOFHNWNN 
VPDDCPTPLQ 
KRGPAFTNGL 
ypPQVPEHUt 
ILZGBTIKIV 
FQIRDQKniY 
SRQMKYQSFN 
DAIFGETMVE 
CPPTSFSVPO 



31 
I 

VCMSVGPDQY 
ZPFLRNAZMS 
VXGKKQLPDS 
GKGVDLNHZY 
FAVGQGVFGL 
XEDyVQKLSG 
QQFZYNNSZL 
BYRKRFMLKP 
VGAPPSUCGL 
PELZKTVTIN 



41 
I 

KCDCTRTGFY 
YVLTSRSHLZ 
NBIVBKLLLR 
GETLARQRKL 
VPCL«MYAT1 
YHFKUCFDPB 
LEB6ZTQFVB 
YE8FEELT6B 
KGNVICSPAY 
ASSSRSGXiDD 



Seq ID NO: 33 DMA sequence 

Nucleic Acid Accession Hi NM_001508.1 

Coding sequence { 1..1361 



ATGGCTTCAC 
CCOGAGTTTG 
TTCQTOATGG 
AAAGGATACT 
TTGQTGTTCC 
ACX3TCCAGCT 
GCTACGCTGC 
TTCAOGTACA 
GTCA0CTCC6 
GTQAAOGTGC 
CAGCCCGAGA 
CAOTCCAGCA 
ATGTGCTGGA 
AGGGGGCCTC 
ACCATCATCT 
ATTCGGA6GA 
60GTACAT6A 
CGGCTCCT6T 
TGCCGCCTOT 
ACCAC06ACA 
TCTGCAAGGA 
TCTAAQTCXX: 
AATTCTGCTO 




ACAOGG'XGTC 
OGCTOCAGCA 
GCGCCCGCTT 
GAACTGAGAA 
AOTCATTGAQ 
CAGAGAATGG 



21 
I 

GGGCAGTGAC 
CTG6ATCAAA 
GAACAGOQTC 
G6TQACAGAC 
GCCCATOGAG 
CTGCAAGCTG 
GACGCTCAGC 
GG6ACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCOTGGTC 
GGTGCTCATG 
GAAGTCGGAG 
GATTOTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTOOCAGCAO 
OGCCAAGCAC 
TGTGCAGOGC 
6ATTTTCTTA 
TCTCXSAGTCA 
TTTTCAGGAQ 



31 

1 

TGCTCCCAAA 
ATCACCCTTA 
AGCATT06G0 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCX3CT 
CAGGTGAAGC 

ciwr m c cA 

TGCAACOGCT 
ACCAACCTCT 
TACCTGGTGG 
AAAA6CCAGA 
AGOGAASAGA 
ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTT0GGCX5GG 
OAOAAGOGCC 
COGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATQAAGTTT 




AGGQCTCOCT 
aCAGGACCX3C 
TATGCTGGAT 

ggacx3aggtc 
acx:tcagctc 

tqttoqtgca 
TGGSOQTACA 

t0g0gtccc3g 
AQAQCGAGGC 
ACTCAGGCGC 
GA 



51 
I 

GENCSTPBFL 
DSPPTYKADY 
RKFIPDPQGS 
RLFKDGKMKY 
WLRERNRVCD 
LLFNKQFQYQ 
SFTR0ZA6RV 
KBMSAELEAL 
WKPSTFGGEV 
INPTVLLKER 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAOAAO 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCA CCCC 
CTTOGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
OSTABCCTTC 
GGG06GGGGC 
CAGGAG6CA6 
6CCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGT6CTGTGC 
TQGGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



Seq ZD NO: 34 Protein sequence 
Protein Accession #t NP 001499.1 



MA8PSLPGSD 
KGYIiQKEVTD 
ATLLHVLTLS 
VKVPSHRGLT 
KCWHMMQVLM 
IRRIMAAAKP 
CRIiSLQHANH 



11 
I 

CSQZZDHSHV 
BMVSIiACSDZ 
FERYZAZCHP 



KHDWTRSYFR 
EKRIiRVHAUS 
LEPNSGAKFA 



21 
I 

PEPEVATHIR 
LVFLI04PME 
FRYKAVSGPC 
QPETSNMSZC 
TRPPQLRKSB 
AYMILLPFSE 
TTDSARFVQR 
N8AAENGFQE 



31 

I 

ZTLZLVYLZZ 
FY8ZZWNPLT 
QVKLLZGFVW 
TNLS8RWTVF 



TFFYLSSVIN 
PIiLFASRRQS 
HEV 



41 

I 

FVK6LLGNSV 
TSSYTLSCKL 
VTSALVALPL 
QSSZPOAFVV 
TZZPLRLZW 
PUiYTVSSQQ 
8ARRTBKZFL 



Seq ZD NO* 35 DMA eeguence 

Nucleic Acid Accession «t NH_006475.1 

Coding sequencei 28.. 2538 



1 
I 

AACAQAACTG 
TTGCTGCTTA 
AGTOQTATCA 
ACCAAAAAQA 
AAAAOGACTG 
TGCCCAGCAG 
ACAAOGOVGC 
TTCACTTACT 
GQTTTOGAGA 
AASA6AATGT 
TTGGGGCTTT 
ATCCATGGGA 
CAAATTGOTA 
GCA0CT60CA 
TTT6CTCCCA 
GGAGACAAAG 
TCTGAGTCTA 



11 

I 

CAAOGGAGAG 
TTGTTAACCC 
GGGOTGGGGA 
AATACTTCAG 
TTTTATAIGA 
TTTTGCCCAT 
GCTATTCTGA 
TTOCACCGAG 
GCAAOGTQAA 
TGACCAAGGA 
TCATTAACCA 
ACCAGATTGC 
CCTCAATTCA 
TGACATGGGA 
CCAATOAGQC 
TGGCTrCCQA 
TTATQQQAGO 



21 
I 

ACTCAAGATG 

tataaaogcc 
cx:aaggcoca 

CACTTGTAAO 
ATGrZGCGCT 
T6ACCATGTT 
CGCCTCAAAA 
TAATGAGGCT 
TGTTGAATTA 
CTTAAAAAAT 
TTATCCTAAT 
AACAAATGGT 
AQACTTCATT 
CATATTGGAG 
TTTTGAOAAA 
AGCTCTTATG 
A0C3UITCTTT 



31 

1 

ATTCCCTTTT 
AACAATCATT 
AATQTCTGTa 
AACTGGTATA 
OGOTAZATGA 
TATGGCACTC 
CTOAGGGAGG 
TGGGACAACT 
CTQAATGCTT 
GGCAT6ATTA 
GGGGTTGTCA 
GTTGTCCATG 
GAAGCAGAAO 
GCCCTTGQAA 
CTTCCAOQAO 
AAGTAGCACA 
OAQAGGCraO 



41 

I 

TACCCATGTT 
ATGACAAGAT 
OOCTTCAACA 
AAAAGTCCAT 
GAATGQAAGQ 
TGGGCATOGT 
AGATOQAGGG 
TGGATTCTGA 
TACATAGTCA 
TTOCTTCAAT 
CTOTTAATTO 
TCATTGACOG 
AT6A0CTTTC 
GAGACGCTCA 
OTOTOCZAGA 
TCXTAAATAC 
AAGOAAATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 
I 

TZRVTQVLQK 
HTFLFBACSY 
LFAMGTBYPL 
YLWLZiSVAF 
TLAVGHMPNQ 
FSRVFVQVIiC 
STFQSEAEPQ 



60 
120 
180 
240 
300 
360 
420 



51 

i 

TTCTCTACTA 
CTTGGCTCAT 
QATTTTGGGC 
C1GTGGACAG 
AATQAAAG6C 
GGGAGCCACC 
AAAGGGATCC 
TATCaSTAGA 
CATQATTAAT 
OTATAACAAT 
T6CTXX3AATC 
TGTGCTTACA 
ATCTTTTAGA 
CTTCACACTC 
AA06TTCAT6 
TCTCCAOTGT 
AATTGAOATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



961 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GG^TGTGAOO 
GTGACAAATA 
CAAGTTATTG 
GGCTTCK3CAT 
GCATTTTCTG 
CAOVTATTGA 
ATCGGAG6CA 
TGCAT6GAGA 
ATCAAGCCAO 
TTCCTCAGCC 
ACATTATTTQ 
CTGATAGGGO 
TTCATTGGAA 
AAAATCTTTC 
TCTGACATCA 
OACAGACCTG 
CAAATTAAC3T 
AAAATTATAA 
ATTATCAAAA 
AGACTGATTA 
AAATACACCA 
GAA06AATCA 
ACAGAAQAAA 
GAAGGTGGTG 
ACACCOGTGA 
GAAGGT0C3TT 
AATAACCTGA 
ACAAA6AAGC 
GAAACAT6AG 
CACCTTACAC 
AGAAAAATCC 
CCATTQAAAA 
TATCTCTCCA 
TCAAAAGGCT 
CAACTAATTT 
TCTCSJkAOGT 
ATTGAOAAAA 



GTGACAGTAT 
ATGGTGTGAT 
AGCTGGCTGG 
CTGCTCTGAQ 
ATGATACTCT 
AAGTAAAAGT 
AACAGCTCAO 
AAGGGAGTAA 
CAQAGAAATC 
TACTTGAAGC 
TQCCAACCAA 
ACAAAAAT6C 
AAGGATTTGA 
TOAAAGAAGT 
TGACAACAAA 
TTGGAAATGA 
TTOTTOOTGG 
CCAAAGTTGT 
CTGAAGGACC 
AAGAAGGTGA 
AAATCATTGA 
TTACAGGTCC 
CTCTGAAGAA 
ATGGTCATTT 
GGAAGTTGCA 
CTCAGT6AAA 
CCTTAGAAAA 
AATCATCAAA 
GGAAATTGTO 
CCTTTTTCAT 
TTGTCACCAS 
GACCQAQCCT 
TGGGAA6CTA 
TTGCACATTT 
TGTACTCTCA 
TTC31ATAAAA 
ACTCAAQATT 



AACAGTAAAT 
CCATTTGATr 
AAAACAGCAA 
GCCACATGGA 
CASC AT G G TT 
T06CCTTAAT 
A6TCTTCGTA 
GCAAGGGAQA 
CCTCCATGAA 
TGCAGACTTG 
TGAT6CTTTT 
TCTTCAAAAC 
ACCTGGTGTT 
AAATGATACA 
TGGTGTAATT 
TGAACTOCTG 
TAGGACCTTC 
GGAACCAAAA 
CACACTAACA 
AAGAATAACT 
TGOAGTGCCT 
T6AAATAAAA 
ATTOTTACAA 
ATTTGAAGAT 
AGCCAACAAA 
ATCCAAAAAC 
TT6TGAGAGC 
TAATTCTGAA 
GAGTTAGCCT 
CTTGACATTA 
ATTCATTACA 
TGTAT6TAT0 
AGTTATAAAA 
CTATATQAGT 
GAATGTTTGT 
CCATTTTTCA 
TAAGTTAAAA 



GGAATCAAAA 
CATCAGGTCC 
ACCACCTTCA 
GAATACACTT 
CAGOGCCTCC 
GAGCTTTACA 
TATOGTACAG 
AAOGGTGOGA 
AAGTTAAAAC 
AAAGAGCTCC 
AAGGGAATGA 
ATCATTCTTT 
ACTAACATTT 
CTTCTOOTQA 
CATGTTGTAG 
6AAATACTTA 
AAAGAAATCC 
ATTAAAGT6A 
AAAGTCAAAA 
GAAGTQATCC 
6T6GAAATAA 
TACACTAGGA 
GAAGAGGTCA 
GAAGAAATTA 
AAAGTTCAAQ 
CAOAAAAAAA 
CAAGTTGACT 
CACAAATTTA 
CCTGTGGTAA 
AAAGTTCTGG 
ATTCAAAT06 
TTATGGATAC 
ATAGGTGCTT 
G6GTTTACTG 
CATAT6CTTC 
GATATAAAGA 
AGTGGTTTGG 



TGOTQAACAA 
TAATTCCTGA 

0GGATCTT6T 
TGCTGGCACC 
TTAAATTAAT 
AOGGGCAAAT 
CTGTCTGCAT 
TTCACATATT 
AAGATAAGCO 
1GACACAACC 
CTAGT6AAGA 
ATCACXrTGAC 
TAAAGACCAC 
ATGAATTGAA 
ATAAACTCCT 
ATAAATTAAT 
CCGTGACTGT 
TTGAAGGCAG 
TTGAAGGTGA 
ATGGAQAGCC 
CraAAAAAOA 
TTTCTACTGG 
CCAA6GTCAC 
AAAGACTGCT 
QTTCTAQAAO 
TGTTTATACA 
TCAGGAACTG 
ATATTTTTTT 
AGGAATTGAA 
CTAACTTTGG 
AAGAGTTQTG 
ATAAAAT6CA 
GGTGTACAAA 
6TAAATTAT0 
TTGCAATGCA 
GAATTACTTC 
ACTTGGGAA 



AAAGGATATT 
TTCFGCCAAA 
GGCCCAATTA 
TGTQAATAAT 
TCTGCAGAAT 
ACTGGAAAOC 
TGAAAATTCA 
COGOGAGATC 
CTTTAGCACC 
TGGAGACTGG 
AAAAGAAATT 
ACCAGGAGTT 
ACAAGGAAGC 
ATCAAAAGAA 
CTATCCAGCA 
CAAATACATC 
CTATACAACT 
TCTTCAGCXrT 
ACCTGAATTC 
AATTATTAAA 
QACAaSAOAA 
AGGTGGAGAA 
CAAATTCATT 
TCAGGGAGAC 
A0GATTAA6G 
ACCXTTAAGTC 
AAACATCAGC 
TTCTGAATGA 
GAAAATATAA 
AATCCATTAO 
AACTGTTATC 
C6CAAGCCAT 
ACTTTTTATA 
TTATTTTTTA 
TATTTTTTAA 
AAATTGAGTA 



Seq ID NOi 36 Protein sequence 
Protein Accession #: NP 006466.1 



1 

I 

MIPFLPMFSL 
XNWYKKSICG 
KIiREEIEGKG 
MGMZZPisMyN 



MKYHILNTLQ 
IDQVLIPD8A 
VQRLLKLILQ 
RNGAZBIFRB 
FKI^SEEKE 
TLLVNELKSK 
FKEIPVTVYT 
TEVIHGEPIZ 
QEEVTKWTKF 



11 

I 

LLLLIVNPIN 
OKTTVLYECC 
SPTYFAPSNE 
NLGLFINKYP 
RAAAITSDIL 
CSESIMGGAV 
KQVIBLAGKQ 
MHILKVKVOL 
IIKPAEKSLH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
XKYTKIIDGV 
lEGQDGHLFB 



21 

I 

AimBYDKILA 
PGYMRMEGMK 
AHDNLDSDIR 
NGWTVNCAR 
BALGRDGHFT 
FBTLEGNTIE 
QTTPTDLVAQ 
NEliYMGQZIiE 
EKLKQDKRFS 
NIILYHIiTFG 
IHWDKLLYP 
KIKVXBGSIiQ 
PVBITEKETR 
DEBIKRLLQG 



31 

I 

HSRIRGRDQG 
GCPAVLPIDH 
RGIiBSNVNVE 
IIKGNQIATN 
LFAPTNEAPB 
IQCDGDSITV 
LGIiASALRPD 
TIGGKQLRVF 
TFLSUBAAD 
VFIGK6FEPG 
ADTPVGNDQL 
FIXKTEGPTL 
VB&11TG9BX 
DTPVRKLQAN 



41 

1 

PNVCALQQIL 
VYGTLGIVGA 
ItUIAliRSHNZ 

KIiPRGVLBRF 
NGIKMVNKXD 
GEyTLLAFVN 
VYRTAVCZEtf 
LXELLTQPGD 
VTNIIiKTTQG 
LEILNKLZKY 
TKVKIEQSPB 
KyTRXSTGGG 



Seq ID NO: 37 PNA sequence 

Nucleic Acid Accession «< NM_002416 

Coding sequence: 40. .417 



ATCCAATACA 
TTCCTCTTGG 
AAGGGTOGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGT6GGAGA 
AA6AAAGTTC 
ACCACTTCAC 
TTCCAAAGGA 
ATTACTCTGA 
TTGTTAAAGG 
TTAAG6CCAT 
CTCACAACA6 
TATCTGAGGC 
TOAAATTOAG 
CTACAGGCCT 
ACTGGAGATC 
TTOCATCTTG 
AAOTCAOCTC 
O^TATGCTCT 



11 
I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TQAAAGTTCX3 
CAATAA6TAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTA6CA 
CT6CCTG6AA 
ACATGTCAGC 
CTGGACCTCA 
CACACAOUVT 
ACCAGTGTGT 
CCCGCi'CAGG 
TTCTCCATGC 
GATTTATCTG 



21 
I 

GGAACTCCAT 
GCTGGTTCTG 
CAGCACCAAC 
AAGCCCTTCC 
TCTAAACXrCA 
CCAAAA6AAA 
AAAATCTCAA 
TCTGTQTTAA 
TAATACAAAG 
AAAGTTAGAA 
CTTTGTTCTT 
ATACCXZATGT 
6A0CAGCCX;r 
AAOTCCTAAO 

cxyuvGCTGcr 

GTGTCTGAGA 
aGCTTTCAGA 
CT6ACCACTT 
TAOCACAATG 
AOTCAACTGC 



31 
I 

TCTATCACTA 
AT7GGAGT6C 
CAAGGGACTA 
TG0GA6AAAA 
GATTCAGCAG 
AAGCAAAAGA 
OQTT C TOBTC 
AAATOTTCTA 
GCTTATTAAT 
AGTTOATTTT 
CTACCACCCA 
CTACACAGAT 
AG6CTTCCAC 
CCTGTTAGCA 
GTGGCCATCA 
GATTCATGCT 
GOCTCCTTTC 
TATTTCTTTT 
CAGTGCCTTT 
ITTCTCATCT 



41 
I 

TGAAQAAAAG 
AAGGAACCCC 
TCCACCTACA 
TTGAAATCAT 
ATGTGAAGGA 
ATGGGAAAAA 
AAAAGAAOAC 
TTTTAATTAT 
TTGACTAGAA 
AAGAATCCAA 
CCAGTTGAAT 
GTTCACCCAA 
GTACTGCA6C 
TGCTGGTGAG 
ACCTCTOTAT 
GATTGTTATT 
TGGCTTTGGA 
TGTTCOOCTT 
CTTCTCTCCA 
TGTOCOCAAC 



1080 
1140 

1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
IBOO 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



51 

1 

GTXKKYFSTC 
TTTQRYSDA8 
NKRHLTKDIiK 
TQIGTSZQDF 
MGDKVASEAL 
rVTNNQVIHL 
HAFSDDTLSM 



WTLFVPTNDA 
SKZFLKEVND 
IQZKFVRGST 
FRLIKEGETZ 
BTBETLKKLL 
REGR5Q 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



51 
I 

TGGTCTTCTT 
AGTAGTGAGA 
ATCCTTQAAA 
TGCTACaCTQ 
ACTGATTAAA 
ACATCAAAAA 
TACATAAGAO 
ACCQCTATCA 
AATTTAAAAC 
AOGTTAAGAA 
TTCATCATGC 
CCACATCCCA 
CTCCAGA6AG 
OCAAGCAGTT 
TTOAATCAGC 
GGGTATCACC 
AOOCATGTGA 
TGCTTCATTC 
OTGCAOCTGT 
ACXXXACAGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



962 



wo 03/042661 



AGTGCTTTCT TCTCCCAATT CATCCTCACT CAOTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 

AAATAAACCT TTTrGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 

CACATGGOTO AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 

AGATTG TCA G CTCCTTGAGG GCAAGAGCCA CAGTATATTT CXXTTOTTTCT TCCACAOTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGOA 1500 

TGGCAACCAO ACC A TTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTO OTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGQ GACCAGGGAT 1620 

GATGCAACAT ai T l' Cm. T iT TTATGACAGO ATGTTTGCTC AGCTTCTCCA ACAATAAQAA 1680 

GCACX5TGGTA AAACACTTGC GGATATTCTG GACTGTITTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACAA TGAAAAGQAC TTTATAQATC AGCCAGTGAC CAACCTTTTC IBOO 

CCAACCATAC AAAAATTGCT TTTCCOGAAO GAAAAGG6CT TTCTCAATAA GGCTCMSCTT 1860 

TCTAA6ATCT AACAAGATAG GCACOQAGAT CCTTATOGAA ACTCATTTTA GQCAAATATQ 1920 

AOTTTTATTG TCOQTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACX3V 1980 

TCTCCCATGA AGAAAGGGAA OGGTGAAOTA CTAAGOGCTA GAGGAAGCAG CCAAOTOGGT 2040 

TAGTGGAAGC ATGATTGOTO CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 

0GA6GTTCAO TGAATTQTGT AOGAGAGGTT GTCTGTGGCX: AGAATT TAAA O CTATAC TCA 2160 

CTTTCCCAAA TTGAATCACT OCTCAGACTO CTGATGATTT AGAGTGCTGT COOGTGGAGA 2220 

TCCCAOCOGA ACOTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 

AAAAATCTAA GTGTTTCATA AATTTQAGAG TCTGTOACXX: ACTTACCTTG CATCTCACAG 2340 

GTAGACA6TA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 

TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTGA 2460 

AAACAGTATT GACTTGTATA CCTTOTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NOi 38 Protein sequence 
Protein Accession NP_00a407 

1 11 21 31 41 51 

I I I I 1 1 

MKKSGVLFLI. GIIIjLVLIGV QGTFWRKGR CSCISTKQGT IHU2SLKDIiK QFAPSPSCEK 60 
XBIXATLKNG VQTCLNPDSA DVKELXKKHE XOVSQXKXQR NGKRHQKKKV UCVRKSQRSR 120 
QXKTT 

I 

Seq ID NO: 39 DKA sequence 

Nucleic Acid Accession tfi MM_006670 

Coding sequence I 85*. 1347 

1 11 21 31 41 51 

I I i I I I 

CGGGCTCGCX3 CCCTCOGGGC CCAGCCTCCC GAGCCTTCGG AG0GGG06CC GTCCCAGCCC 60 

AOCTCOGGGG AAAGQOQAOC OGCiQATGCCT GG66GGT6CT GCOGGGGGOC CX3C06COGGQ 120 

GAC6G606TC TGOGQCTGGC GCGACTAGCX3 CTOGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTOXACCT CCTCQGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 240 

TCOGCCCAGC CCCCGCTGCC GGACCAGTGC CCCXKXSCTGT GCGAGTGCTC CGAGGCAQOG 300 

CX3CACAGTCA AGTGC3GTTAA CCGCAATCTG AGGGAGGTOC CCAOGQACCT GCCOGCCTAC 360 

GTGOGCAACC TCTTOCTTAC GQGCAACX»G CTGGOOGTGC TCCCTGCOGQ OGCCTTOGCC 420 

0GC006CGGC C6CTGGCX3GA GCTGGCOGCS CTCAACCTCA GCGGCAGCCG CCTQGAOGAO 480 

GTGOQOOOGG GCGCCTTCQA QCATCTGCCC AGCCTGCX^CC AGCTCGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTCX3CTTTC TCGGGCAGCA ATGCCAGOGT CTOGGCCCCC 600 

AGTCCCCTTG TGOAACTQAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCA6AAC 660 

OGGAGCTTOS AGGQCAT6GT GGTOG0 3Q0C CTGC TGGCX3Q GOGOTGCACT QCAGGGGCTC 720 

CGCCGCTTGG AGCTQGCCA6 CAACCACTTC CTTTAGCTQC OQOQQGATGT 6CTG6CCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAQT AATAATTCGC TGGT6AGCCT GACCTACGTO 840 

TXXrTTCOGCA ACCTGACACA TCTAGAAAGC CTCXaiCCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG 6CACCCTGGC TGA6TTGCAA GGTCTACCCC ACATTAG66T TTTCCTGGAC 960 

AACAATCCCT OGOTCTGOGA CTGCCACATG GCA6ACATGG TGACCXOGCT CAAGGAAACA 1020 

QAGGTAGTGC AGGGCAAAGA CX3QQCTC A CC TGTX^TATC CGGAAAAAAT GA6GAAT0GG 1080 

OTCCTCTTQQ AACTCAACAG TGCTGACCTG GACTGTGACC CDGATTCTPCX: CCCATCCCTG 1140 

CAAACCTCTT ATGTCTTCCT GQGTATTGrr TTAOCCCTQA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TQAACGGCAA GGG6ATAAAA AAGTGGATGC ATAACATCAG AQATGCCTGC 1260 

AGOGATCACA TGGAAGGGTA TCATTACAOA TATGAAATCA ATGOGGACOC CA6ATTAACA 1320 

AACCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAGG ACAGAGCAAG GACAACTCTG 1380 

CATQAGATQT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

TAQATACAAC GGACTTTGAC TAAAAGCAGT GAAGGQQATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGAOG ATGAACAGTT GTGTATAOTG TTTTACCCTC 1560 

TTCTTTTTCT TGOAACTCCT CAACAOGTAT 66AGGGATTT TTCAGGTTTC AGCATGAACA 1620 

TGGGCTTCn GCTOTCTGTC TCTCTCTCA6 TACAGTTCAA GGTGTA6CAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGtXTCAA CTTTTTCGAO AAAAATACTT TATTCATAAA 1740 

TATCAGTTTT ATTCTCATGT AOCTAAGTTO TQGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGA06T TAGCAGGCTC TTCAAAAIAA CTGCATGGTG CACAGQAGCA OCTGCATCCA 1860 

AGAGCAT6CT TACATTTTAC TGrTCTGCAT ATTACAAAAA ATAACTTGGA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTrTTTTQAT TGCAGTTTAT ATGAAAATGT ACIGATTTTT 1980 

TTTTAATAAA CTGCATGGAG ATCCAAOOQA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2040 
ATTCTTAAAA GAA 

Seq ZD MO: 40 Protein sequence 
Protein Accession #i NP_00666X 

1 11 21 31 41 51 

I 1 t ) I I 

NPGGCSRGPA AGDGRLRIAR LALVLLGWV8 8S8PTSSASS FSSSAPFLAS AVSAQPPLFD 60 

QCPALCECSB AABTVXCVNR NLTBVPTDIiP AYVIUrLPLTO MQIAVLPAGA FARRPPLABL 120 

AAUIbSOSRL DEVRAGAFEH LPSLRQLDLS HNPLADLSPP AP8GSKASV8 AP8PLVBLIL 180 

MKIVPPEDSR QNR8FEQMW AALLAGRAUQ GLRRLBLASN HPLYLPSDVL AQLPSLRBLD 240 



963 



wo 03/042661 
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5 



15 

20 
25 



30 
35 



50 
55 



70 



80 



LSNNSLVSLT YVSPRNI^THL BSLHLEDNAL KVLHNGTLAB LQC3LPBIRVP LDNNPHVOX: 300 
KMADMVTWLK ETEWQGKDR LTCAYPEKMR NSVUiEUfSA DLDCDPILPP SLQTSYVFLG 360 
IVLALXGAIF LLVLYUIRK6 ZKKWMIDIIRD ACRDHMBGYH YRYBZNftDPR LTNLSSNSDV 

Seq ZD NOs 41 DMA sequence 

Nucleic Acid Accession li Eos sequence 

Coding sequence! 1..927 



1 11 21 31 41 51 

10 I I I I I I 

ATGCCTGGGG GC5TGCTCCCQ GGGCCCOGCC GCCGGGGAOG GGOCTrCTGCG GCTGGCGCGA 60 

CTAGOGCTGG TACTCCTGGO CTOaOTCTCC TCXTTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCOX GCTGCXXJGAC X80 

CAOTGCXXXIG CGCTGTGCGA GT6CTC00AG GCAGCGCGCA CAOTCAAGTG CX3TTAAC0GC 240 

AATCTGACOS AGGTGCCCAC GGACCTGCCX: GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCOGCGGG AT6TGCTG6C GCAACTGGCC 360 

AGCCTCAGQC ACCTGGACTT AAOTAATAAT TOGCTGGTGA GCCTGACCTA C g l W C C l'TC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GQACAACAAT 540 

CCCTGGOTCT OCGACTGCCA CATGGCAGAC ATGGTQACCT GGCTCAAGGA AACAQAGGTA 600 

GT6CAGGGCA AAGACOGGCT CACCTGTGC31 TATCOGOAAA AAATGAGGAA TOGGGTCCTC 660 

TTGQAACTCA ACAOTGCTGA CCTGGACTGT GACCOQATTC TTCCCCCATC CCTGCAAACC 720 

TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCX? CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAOATOC CTGCAGGGAT 640 

C ACATGG AAG GGTATCATTA CAGATATGAA ATCAATGCCG ACCCCA6ATT AACAAACCTC 900 
AGTTCXAACT OGGATGTCCT 06AGTGA 



Seq ID NOt 42 Protein sequence 
Protein Accession #i Bos sequence 

1 11 21 31 41 51 

I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS PSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR KLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SZiRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQOL PHIRVFLDNN 180 
PHVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEXMRNRVL LELNSADZiDC DPILPPSLQT 240 
SYVFLGIVLA LZOAIFLLVL YUIRRGIKKW HKNIRDACRD HMBOYHYRYS INADPRXjTNIi 300 
SSNSDVLE 

40 Seq ID NO I 43 DNA aequence 

Nucleic Acid Accession Mt NM_05B173 
coding sequence i 68.. 340 

.-1 11 21 31 41 51 

45 I I I I I I 

AGCBCCTTGC CTTCTCTTAG GCTTT6AAGC ATTTTTGTCr GTGCTCCCTG ATCTTCAGGT 60 

CAGCAGCATG AAGTTCrTAG CAGTCCTGOT ACTCTTGGQA GTTTCOVTCT TTCTGOTCTC 120 

TGCCCAGAAT CCQACAACAG CTGCTCCAGC TQACACGTAT CCAGCTACTG GTCCTGCTGA 180 

TGATGAAQCC CCTGATGCTG AAACCACTGC TGCTQCAACC ACTGCGACCA CTGCTGCTCC 240 

TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA QACATTCCAG TTTTACCCAA 300 

ATQGGITGGO GATCTOCOQA ATaGTAGAaT GTOTCOCTGA QATGQAATCA GCTTGAGTCT 360 

TCIGCAATTQ QQTCACAACT ATTCATGCTT CCTOTGATTT CATCCMCTA CTTACCTTGC 420 

CTAGGATATC C O CTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
OOQAGCTAAC AT 



Seq ID NOt 44 Protein sequence 
Protein Accession #: NP 477521 



1 11 21 31 41 51 

60 I I I I I I 

KKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YFATOPADDE APDABTTAAA T1ATTAAPTT 60 
ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 

^ Seq ID NOi 4S DNA aequence 

03 Nucleic Acid Accession NM_00009S.l 
Coding sequence: 26.. 2299 



1 11 21 31 41 51 

I 1 I 1 I I 

CAQCACCCAQ CTCCCC6CCA COQCCATGQT GCCOGACACC aCCTQOGTTC TTCTGCTCAC 60 

CCT6GCTGCC CTC66G60GT CCGGACAGGG OCAGAGCCOQ TTGGG CTCA g ACCTGGGCCC 120 

OCAGATGCTT CGGGAACTGC ABGAAAOCAA 08CG606CXO CAGQACGTGC GG6ACTGGCT 180 

G06GCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACO GTOATG6A6T 6TGACGCGTG 240 

_ CGGGATGCAG CAGTCAGTAC GCACCG6CCT ACCCA60QT6 CG6CCCCTGC TCCACTGCGC 300 

75 GCCOGGCTTC TGCTTCCCOQ GCGTGGGCTX3 CATCCAQACG GAQAOCQGOG GCOGCTGCGG 360 

CCCCTGGCCC GC GGGCTT CA CGG6CAAGGG CTCGCACTGC ACOGAOQTCA AC6AGTGCAA 420 

08CCCAC0CC TGCTTCCCCC GAGTCCQCTG TATCAACACC AOCCOGGGOT TC06CTGCGA 480 

G6CTTGCCCG COGGGGTACA GOOGCCCCAC CCACCA6G6C GTGGGGCTGG CTTTCGCCAA 540 

G6CCAACAAG CAGGTTTGCA OGOACATCAA CQAOTGTGAO ACCGGGCAAC ATAACTGOOT 600 

CCCCAACTCC GTOTGCATCA ACACCOGGGG CTCCTTCCAG TGOGOCCOOT GCCSUGCCCGG 660 

CTTCSTGGGC GACCAG60GT COGGCTGCCA GOQOQGCGCA CAQ06CTTCT OGGCGGAGOQ 720 

CTCOCCCASC GAOTOCCAOO AGCATGCA6A CT60GT0CTA QAaOSGQATO QCTOGOQQTC 780 

GTGOGTGTGT GGOQTTGGCT GGOCOGGCAA GGGGATCCTC TGTGGT08CG ACACTQACCT 840 

AQA0G6CTTC COQGAOQAGA AGCTGOGCTO GCGGOAGOOG CAGTOOOGTA AGGACMCTO 900 
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COTGACTGTQ CCCAACTCAO G0GAGGA6GUV TGTOGAOOQC GATOGChTGO GAOVOBCCTG 960 

OGATCCGGAT GC06A0G006 AO G G GG TCCC CAATGAAAAG OAOUVCTGCC CGCTGGTGOG 1020 

GAACCCAC5AC CAGCGCAACA CGGAOGAGQA CAAGTGGGGC GAT6CGTG06 ACAACTGCCO 1080 

GTCCCA6AAG AACX^CGACC AAAAOGACAC AGACCAGGAC GGCCGGGGCG ATGOCTTGCGA 1140 

CGAOGACATC GACGGCGACC GGATCOGCftA CCAGGCCX3AC AACTGCCCTA GGOTACCCAA 1200 

CTCAOACCAO AA06ACA0TG AT0GGGA1GG TATAGG06AT OCCTGTGACA ACTGTCCOC A 1260 

GAAGA6CAAC C0G6ATCAGG CGGATGTGOA CCAOQACTTT QTG6GAGAT0 CTTOTQACAO 1320 

CX3ATCAAGAC CAGGATGGA6 ACGGACATCA GGACTCTCGG GACAACT6TC CCAGGGTCCC 1360 

TAACAGTGCC CAGQAGQACT CAGACCACGA TGGCCAGGGT GAT6CCTGCG AGGACXSAOGA 1440 

OGACAATGAC GGAGTCCCTG ACAGTGGGGA CAACTGCCGC CTGOTQCCTA ACCCOSGCCA ISOO 

6GAGGACG0Q QACAGGGAOO G08TGGG0QA 0GT6TGCCAG GAOGACmO ATGCAGACAA 1560 

GGTG6TAGAC AAGATOGAOG TGTGTCOGGA 6AA0GCTGAA GTCAOGCTCA CGQACTTCAG 1620 

GGCCTTCCAG ACAGTOQTGC TGGACCCGGA GGGTGAOGCG CAQATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAAT6AAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAAT6 GGGTGGACTT CX3A0GGCACG TTCCATQTGA ACAOG6TCAC 1600 

0QAT6A0QAC TATOCXSGGCT TC A TC I TT GG CTACCAGQAC AGCTCCAGCT TCrAGGTGGT 1860 

CATGTGGAAG CAGATGQAGC AAAOGTATTG 6CAGG03AAC CCXITTCOSTO GTGIGGCCGA 1920 

GCCTGGCATC CAACTCAAOG CTCTTGAAOTC TTCCACAGGC CCCG6GGAAC AGCTGCGGAA 1980 

OGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGOGG CTGCTGTGGA AOQAOC06CG 2040 

AAACGTG G GT T6GAAGGACA AGAA6TCCTA TOGTTGGTTC CTGCA6CACC GGCCCCAAGT 2100 

GGOCTACATC A6GGIGCGAT TCTATGAGGG CCCTGAGCT6 GTGGCG6ACA GCAAGGTGGT 2160 

CTTGGACACA ACCATGGGGG GTGGCGGCCT 660GGTGTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACC6CTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGQACCCOCC GGATGACMC CACCCTCACC 2340 

GCGGCTGQAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CX3TCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAG6ACAAA ATAAAGTGT6 TGTGCAGOG 

Seg ID KO: 46 Protein sequence 
Protein Acceeslon #i np_000086.1 

1 11 21 31 41 51 

I 1 » ' ' ' ^ 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVSD WLRQ(JVREIT 60 

FLKNTVMECT ACQMQQSVRT GLPSVRPLLH CAPCFCFPOV ACIQTBSGGR CGPCPAGFTG 120 

NGSHCTDVNE CNAHPCPPRV RCIIITSPQPR CEACPPGYSQ PTHQGVGLAP AKAH KQVCTD 180 

INECBTGQHN CVPNSVCINT RGSFQGQFGQ PGFVGD0A9G OQRGAQRPCP D6SPSBCREH 240 

AOCVLERDGS RSCVCRVGWA GNGILCGRDT DUX3FPDEKL RCPEPQCRKD NCVTVFNSGQ 300 

EDVDRDGIGD ACDPDADGDG VPNEKDNCFL VRNFDQRNTD EDKH6DACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDQ DGIGDAOMTC PQKSNPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDAOJD DDDNDGVFDS 480 

RDKCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKZDVC PENAEVTIjTD FRAFQTWU 540 

PEGDAQIDPN WWLMQGREI VQTMNSDPGL AVGYTAFNGV DFBGTFHVMT VTDDDYAGPI 600 

FGYQDSSSFY WMWKQKEQT YWQANPFRAV AEPGIQLKAV KSSTGPOSQL RNALNHTGDT 660 

ESQVRLLWXD PRMVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WIiDTTMRGO 720 

RLGVFCFSQE NIIWANLRYR an>TIPEDYE THQZiRQA 

Seq ZD NOt 47 DWA sequence 

Nucleic Acid Accession #: nm_001S6S.1 

Coding sequence : 67 . . 3 63 

1 11 21 31 41 51 

I I I I I I 

QAGACATTCC TCAATTQCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

AGCACCATGA ATCAAACIGC GATTCT6ATT TGCTGCCTTA TCTTTCTOAC TCTAAGTGGC 120 

ATTCAAGQAO TACCTCTCTC TAQAACOGTA CGCTGTACCT GCATCAGO^T TAGTAATCAA 180 

CCTGTTAATC CAAGGTCTTT AGAAAAACTT QAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

0GTGTT6AGA TCATTGCTAC AATGAAAAAG AAGGGTGAQA AGAGATGrCT GAATCCAGAA 300 

TOQAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGO AAATGTCTAA AAGATCTCCT 360 

TAAAAGCAGA GGGGAGCAAA ATOSATGCAO TQCTTGCAA6 GATGGACCAC ACAGAGGCTG 420 

CCTCrCCCAT CACTTCCXTTA CATGGAGTAT ATGTCAAGOC ATAATTGTTC TTAGTT TGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAOC TGCTACTACT CCTGTAGGAA 540 

GOTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTAT6T TCTTAGTGGA TGTTCTQACC CTGCTTCAAA TATTTCCCTC 660 

ACCTTTCCCA TCTTOCAAGO OTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

TCAGAATCTC AAATAACTAA AAG6TATGC3^ ATCAAATCTG CTTTTTAAAO AATGCTCTTT 780 

ACTTCATGQA CTTCCACTQC CATCCTCCCA AGGGQCCCAA ATTCTTTCAG TGGCTACCTA 840 

CATACAATTC CAAACACATA CA6GAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACIGT ACAAASTATA AQTCTTAGAT GTATATATTT CCTA7ATTGT 960 

TTTCAQTGTA CATGQAATAA CATOTAATTA AGTACTATQT ATCAATGAGT AACAGGAAAA 1020 

TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGITACAT AAGATAAATG T0CTGAAT6G 1080 
TTTTCAAATA AAAAIGAGGT ACTCTCCT G Q AAATATTAAO 

Seq ZD HOi 48 Protein sequence 
Protein Accession «i NP_001556.1 

1 11 21 31 41 51 

1 1 I I 1 I 

KNQTAILICC LIPLTLSOIQ GVPLSRTVRC TCISISNQPV NPRSLBKLBI IPASQPCPRV 60 

BIZATMKKXG BKRCWPBSK AZKNLLKAVS XEKSKR8P 

Seq ID KO: 49 DMA sequence 

Nucleic Acid Acceeeion ft: XM_057014 

Coding sequence: 143.. 874 
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1 

1 

G6QAGGGAGA 
CGOGGCGGAG 
G6CTGCC0GG 
C060GGCCTC 
CCCCAAG6G0 
AATGT6CTTA 
CATTCOGGGT 
TCTGAGGGAA 
ATTQAHTTAT 
AAATA6T6CT 
CTGTCAQCGT 
AGCTATAATT 
aVCTTCTTC T 
CTGOGTTGGC 
TTCTCGCRTC 
TTTTTTTATT 
CATCTQAATG 
TTTAAATCTA 
TGG TTAGAAT 
GGTCTTTTOT 
TGTACAATTT 
CAACCTTAAA 




AAGCAAAAGO 
CAAG60CCA0 
ACACCT6GGA 
AGCTTTGAGG 
GGaVTAGATC 
CTAAGAGTTT 
TGOTATTTCA 
TATTTGGACC 
GTGQAAGGAC 
ACTT6TTCA0 
ATTATTCSAAO 
ATGCCTTGQA 
AAAAGCAAAG 
OCATTATTCA 
ACTTTCTTCA 

tttttcxx:tt 

GTAAATGTTA 
AAAAAAAAAA 



21 

1 

GGTGAAAOGC 
ACCAOGTTCC 
CCATGCX3ACC 
T6CTGCTGCA 
C6CAGCTC00 
CAGGAGTGCC 
TCCCAGGTCG 
AGTCCTGGAC 
TTQ6GAAAAT 
TGTTCAGTGG 
GATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTAT AOCAT 
AGAATTTTTT 



31 
I 

OCATTGATGC 
TCTCCTCGGT 
C CAGGGCC CC 
GCTGCCCGOQ 
GraOAGGGAS 
TGOTOQAGAC 
GGATGGATTC 
ACGCAACTAC 
TGOQQAGTQT 
CTCACTTOQG 
AGCTGAATQT 
TGAAATGAAT 
AATTGGTGCT 
AG0AGAT6CT 
ATAAAT6CTT 
TAAATGACAT 
TTACAOACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 
I 

AOCCTOOGGC 
CTCCTCOQCC 
GCGGCCTGCC 
COGTOGAOOG 
GTGGTGGACC 
GGGAGCCCTO 
AAAGGAGAAA 
AAGCAQTQTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGO 
TCTACTGQAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTQTGATT 
GTTTCAATAT 
ATAATTT6GA 
ATATAAAA6C 
TAAATAAAAA 



SI 
1 

GGCCTCGQAG 
TCCAQCTCCO 
CGO^GCXSGCT 
CCTCTGAGAT 
TOTATAATGG 
GGGCCAATGG 
AGGGGQAATQ 
CATGGAGTTC 
AGATGOGTTC 
GAAATOCATG 
TTCCCATTGA 
ATATTCATCG 
AT6TTGCTAT 
GGAATTCAGT 
TTQCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTQTTQT 
TACCAATCTT 
TTATTTCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 50 Protein sequence 
Protein Accession XP_057014 

1 11 21 31 41 51 

111)11 

MRPQOPAASP QRLRGIiLLLL Z«LQLPAPSSA SEIPXGKQKA QUIQREWDL YNGMCLQGPA 60 

GVPGRDGSP6 AMOIPGTPGI P6RDGFKGBK GBCbRBSFEE SWTFNYKQCS W88UIYGIDL 120. 

GKIAECTFTR HRSNSALRVL FSGSUOiKCS NACGQRWYFT FMQABCSGPL PIBAIIYLDQ 18*0 

GSPEMN8TZK IHRTSSVB6L CBGIGAGLVD VAIHVGTCSD YPKGDASTGH M&VSRXIIBE 240 

I.PK 

Seq ID KOt 51 DNA sequence 

Hucleic Acid Accession #i 1IM_020974 

Coding sequence: 81.. 3080 

1 11 21 31 41 51 

I 1 I I I I 

GGCXrrCCQCa CACACCTCCC CG0GCX3GCC0 CCXSCXaVCOGC CCGCACTCOQ COGCCTCTGC 60 

CCGCAACCGC TOAGCCATCC ATGGGGGTCG CX3GGC0GCAA CCGTCCOGGG GOGGCCTGGG 120 

CGGTGCTGCT GCTGCTGCTG CTGCTQCCGC CACTGCTGCT GCTGGOQGGQ GCOGTCCGGC 180 

C0GGTC3SGGG C0GT6CCG0Q GGGCGGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 

AT6ACT6CCA TGCCGA06CC CTGTGTCAGA ACACACCCAC CTGCTACAAG TGCTCCTGCA 300 

AGCCTGGCTA CCAAGGGGAA GQCAGGCAGT GTGAGGACAT CQATGAATGT GGAAATGAGC 360 

TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGO CAATTATCGT TGCACTTGTT 420 

TTGATGGCTT CATGTTGGCT CATGACX3GTC ATAATTGTCT TGATGTGGAC GAGTGCXn^GG 480 

AGAACAATGG GGGCTGCCAO CATACCTGTO TCAAOQTCAT GGGGA6CTAT GAOTGCTGCT 540 

GCAAGGAGGG GTTTTTCXnXS AGTGACAATC AGCACACCT6 CATTCAGOSC TCGGAAQAGG 600 

GCCTGAGCTG CATGAATAAG GATCACGGCT OTAGTCACAT CTGCAAQGAQ GCCCCAAGG6 660 

GCAGCGTCGC CTGTGAGTGC AGGCXTGGTT TTGAGCTGGC CAAGAACCAG AGAGACT6CA 720 

TCTTGACCT6 TAAGCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 

GCXXAGAGTG CAOCTGCCAT CCACAGTACA AGAT6CACAC AGATGGGAG6 AGCTGCCTTG 840 

AGOaAGAGGA CACrGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTa GTGGATGGGG 900 

ATAAACGG6T 6AAACGGCG0 CTGCTCATGG AAACGTGTGC TGTCAACAAT GQAGGCTGTG 960 

ACaSCACCTG TAAGGATACT TOGACAGGTG TCCACTGCAG TTOTCCTQTT GGATTCACTC 1020 

TCCAGTTGGA TG0GAAGAC3V TGTAAAGATA TTGATGAGTG OCAGAOCCGC AATGGA66TT 1080 

GTGATCATTT CIGCAAAAAC ATGGTQGGCA GirTTGACTG OGGCIGCAAO AAAGGATTTA 1140 

AATTATTAAC AGATGAGAAG TCTTGOC A AG ATGTGGAT6A GTGCTCTTTG GATAGGACCT 1200 

GTGACCACAG CTGCATCAAC CACCCTGQC3V CATTTGCTTG TGCTTGCAAC COAOGOTACA 1260 

CCXrPGTATGG CTTCACCOVC TGTGGAGACA CC3UVTGAGTG CAGCATCAAC AACGGAGGCT 1320 

GTCA6CA06T CTQTGTGAAC ACA6TGGGGA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 

A6CTCCACTG GAATAAAAAA GACTGTGTOQ AA6TGAAG6G GCTCCTOCGC ACAAGIGTGT 1440 

CACCCOGTGT 6TCCCTGCAC TG060TAAGA GT6GTGGAGG AGACGGGT6C TTCCTCAGAT 1500 

GTCACTCTGG CATTCACCTC TCTTCAQATO TCACCACCAT CAGOACAAQT GTAACCTTTA 1560 

AGCTAAATQA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCOGAG GQTCTGCGAC 1620 

CAGCACTACC AGAGAAGCAC AGCTCAGTAA AA6AGAGCTT CO6CTA0GTA AACCTTACAT 1680 

GCAGCTCZGG CAAGCAAOTC 0CAGGA6CCC CT0600QAOC AAGCAOCOCT AA6GAAATQT 1740 

TTATCACtGT TGAGTTTGAG CTTGAAACTA AGCAAAAGQA GOTGACAGCT TCTTGTQACC 1800 

TGAGCTGCAT CQTAAAQCQA ACCX3AGAAGC GGCTCOOTAA AGCCATCCXa: ACGCTCAGAA 1860 

AGGCCX5TCCA CAGGQAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 

AAAAGCCTCC CAGAACATCT GAAOGCCAGG CAGAOTCCTG TGGAGTGG6C CAGGGTCATO 1980 

CAGAAAAOCA ATGT6TCAOT TOCAOGGCTG GGACCTATTA TGATGGA6CA CX3AOAA06CT 2040 

GCATTTTAT6 TCCAAATGGA ACCTTGCSVAA ATGAGQAAGG ACAAAT6ACT TQTGAACCAT 2100 

GCCCAAGACC AGQAAATTCT GQGGCCCTOA AGACXXX»GA AGCTTGGAAT ATGTCTGAAT 2160 

GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 

GTGCCCTGGO CAC6TTCCAG CCTGAAGCTG GTOGAACTTC CTGCTTCCCC TGTG6AGGAG 2280 

6CCTTGOC3^ CAAACATCA6 GGAGCTACTT GCTTTCA6GA CTGTGAAACC AGAOTTCAAT 2340 

OTTCACCIGG ACATTTCTAC AACAOCAGCA CTCACGOATG TATTOOTTQC CCAGTGQQAA 2400 

CATACCAQCC TGAATTTGGA AAAAATAATT GTGTTTCTT6 CCCAGGAAAT ACTAOSACrO 2460 

ACrrTQATGa CXCCACAAAC ATAACCCAOT GTAAAAAGAG AAOATOIGGA GGGGAGCTGO 2520 
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PCTAJS02/36810 



5 



GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCC3U3G CAATTACCCA GCCAACACOO 2580 

AGTGTACGTG GACCATCAAC CX»CCCCCCA AGCGC0GC3VT CCrGATCGTG GTCCCTGAGA 2640 

TCTTGCTGCC CATAGAGGAC GACTGTGGGG ACTATCT66T GAT60GQAAA ACCTCTTCAT 2700 

CCAATTCTOT GACAACATAT GAAACCTGCC AGACCTA0C3A ACGCCCCATC GCCTTCACCT 2760 

CCAOGTCAAA QAAGCT6TGG ATTCAGTTCA AGTCCAATQA AGGGAACAGC 6CTAGAGGGT 2820 

TCCAGGTCCC ATACGTQACA TATQATQAGG ACTACCAGGA ACTCATTQAA GACATAGTTC 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAAOC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACA6CXX3VGG 3000 

AGTCCCX5AGA GATGTTTCCA AGATCGTTCA TCOGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 

10 TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCftCTCA ATACAAATGT TCTGCTATAG 3120 

GGTTGGTGGG ACAGAGCTGT CrTCCTTCTG CATGTCAGCA CAOTOGGOTA TTSCTGCCTC 3X80 

CarrATCAGT GACTCATTAG AGTTCAATTT TTATAQATAA TACAOATATT TTGGTAAATT 3240 

GAACTTOGTT TTTCrTTCCC AGCATCX5TG0 ATGTAGACTO AQAATG6CTT TOAGTGGCAT 3300 

CAGCTTCTCA CTGCTGTGGG CXX5ATGTCTT GQATAGATCA CGGGCTGGCT GAGCTGGACT 3360 

TTGGTCAGCC TAGGTGAGAC TCACCT6TCC TTCTGGGGTC TTACTCCTCC TGAAGGAGTC 3420 

TGTAGTG6AA AGGAGGCXAC AGAATAAGCT 6CTTATTC7G AAACTTCAGC TT0CTCTA6C 3480 

CCGGCCCTCT CTAAGGGAGC CCTCT6CACT CGTGTGCAGG CTCT6ACCAG GCA6AACAGG 3540 

CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CXACCTTGAG ACCTGGGA6G 3600 

ACTCAGTTTC TCXACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTQ 3660 

2U AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA OCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
ASCACTTCTG GAGACAT 
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Seq ID NOi 52 Protein sequence 
Protein Accession #: NP_066025 

1 11 21 31 41 51 

I I I I I I 

M6VAGRHRPG AAWAVLLLLL LLPPLLLIiAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 

LCQNTPTSYK CSOCPGYQGH GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 

HDOHNCLDVD ECLENNGGCQ BTCVNVMGSY ECCCKEGFFI« SDNQHTCIHR 8EEGLSCMNK 180 

DHGCSaiCKE APRGSVACBC RPGFELAKNQ RSCXLTGHH3 NGGGQEISCDD TA06PECSCH 240 

PQYXHHTDGR SGLEREDTVb EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 

SCQDVDBCSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 

TVGSYECQCH PGYKLKNNKK DCVEVKGLLP TSVSPRVSLH 06KSGGGDGC FI<RCHSGIHL 480 

SSDVTTIRTS VTFXLNEGXC SLKNABLFPE GIjRPALPBKH SSVKBSFRYV NLTCSSOKQV 540 

PGAFGRPSTP KENFITVBFB LEIMQKBVTA SCDLSCIVKR TEKRURKAIR TLRXAVRREQ 600 

FHLQLSGNHL DVAKKPPRTS BRQAB8CGVG QGBAENQCVS CRAGTVYDGA RERCILCPNG 660 

TFQMEEGOMT CEPCPRPGKS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 

PBA6RTSCFP CGGGLATKHQ GAT8FQDCBT RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 

KNNCVSCPOl TTTDFD6STN ITQCKNRRGQ OBLGDFTGYI E8FNYP(»IYP AKTECTWTIN 840 

PPPKSRXLIV VPBZFLPZED DCBDYLVKRK T8S8NSVTTY ETOQTYERPI AFTSRSKXLH 900 

IQFKSKEGIIS ARGFQVPyVT YDEDYQELZB DIVRDGRLYA SBIHQBILKD KKLZXALFDV 960 
LAKPQIIYFKY TAQBSREKFP RSFZRLLRSK VSRFLRPYK 

Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession «: NM_014211 

coding sequence! 157.. 1479 " 

1 11 21 31 41 51 

I I I I I I 

GG6ACAGG6C TGAGGATGA6 GAGAACCCTG GGGACCGAGA AGAC0GT6CC TTGCCGGGAA 60 

GTCCTGCCTG TAGGCCTGAA GGACTTGCCX: TAACAGA6CC TCAACAACTA CCTGGTGATT 120 

CCTACTTCAG CCCCTTGQTO TGA6CA6CTT CTCAACATGA ACTACAGCCT CCACTTGGCC 180 

TTCGTGTGTC T6AGTCTCTT CACTQAGAGG ATQTGCATCC AG6GGAGTCA GTTCAACQTC 240 

GAGGTC9G6CA 6AA8T6ACAA GCTTTGCCTG CCTG6CTTT6 AGAACCTCAC AGC3U3QATAT 300 

AACAAATTTC TCAGGCCCAA TTTTGGTGGA GSAACCCXtTAC AGATA606CT 6ACTCTQGAC 360 

ATTGCAAGTA TCTCTAGCAT TTCAQAGAGT AACATGGACT ACACAGCCAC CATATACCTC 420 

CQACAGCGCT GGATGGACCA GCGQCTQGTG TTTGAAGGCA ACAAGAGCTT C3VCTCTGGAT 480 

GCCCGCCTOQ TGQAGTTCCT CTGGGT6GCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 

TTCCTCCATG AAGTCACTGT GOGAAACAGG CTCATCCX30C TCTTCTGCAA TGGCAGGGTC 600 

CTGTATGCXX: TCAGAATCAC GACAAGTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 

ATGGACACAC AGACATGCAA GTTGCAQCTG GAAAGCTGGQ GCTATGATGG AAATGATGTG 720 

GAOTTCACCT GOCTGAQAOO GAACQACTCT GTGOGTGGAC TGGAACACCT GOGGCTTGCT 780 

CAGTACACCA TAGAGOGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGGA GACAGGAAAT 840 

TACACTAGAT TGGTCTTACA GTTTGAGCTT CX^AGGAATO TTCTOTATTT CATTTTGGAA 900 

AGCTAC6TTC CTTOCACTTT OCTGGTGGTG TTGTOCTGGO TTTCATTTTO OATCTCTCTC 960 

GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGAOGAOCQ TGTTATCAAT GACCACACTG 1020 

ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGQC CAT06ATGT0 1080 

TAOCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTT6C TAGAATATGC AGTTGCTCAC 1140 

TACAGTTCCT TACAOCAGAT GGCAGCCAAA OATAGG6GQA CAACAAAOQA AOTAOAAGAA 1200 

GTCAOTATTA CTAATATCAT C3UU»6CTGC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 

GCCAGOITTG AAATTTCCAG C6ACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 

GACAAGTTCA AGTTTGTCTT COGAGAAAAG ATGOGCAGGA TTGTTGATTA TTTCACAATT 1360 

CAAAACCCCA 6TAATGTTGA TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 

GOCAATGTAT TTTACTGG6C ATACTAGAIG TATTTTTGAG TCAAIGTTAA ATTTCTT6CA 1500 

TGCCATAGGT CTTCAACAOG ACAAGATAAT GATGTAAATG GrATTTTAGG CCAAGTGTGC 1560 

ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAOTCTTTC TGCTCAAAGA 1620 

ATOAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG QATCTTGTAA 1680 

TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG GAGCCCAA6A 1740 

TTACAAATOT ACTCAGGGCT GTTTATTOGG TGQCTOCCTO OTTTGCATTT ACCTCATATA 1800 

AAGAATGGGA AG6A6ACCAT TGGGTAAOCC TCAAGTGTOl GAAOrrGTTT CTAAAGTAAC 1860 

TATACATGTT TTTTACTAAA TCTCTGCAaT GCTTATAAAA TACATTGTTG CCTATTTAGO 1920 

GAGTAACATT TrCTAGTTTT ■ itli'TTCTG G T TAAAATBAAA XATGGGCTTA TGTCAATTCA 1980 

967 
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5 
10 
15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TTGGAAGTCft 
ATACCJICAJ^C 
ACrrgAAGAAA 
GGATTOCOCA 
TTTATTATTA 
GGAAATGGAA 
A6GCTTGCAG 
AAAOTACTGO 
GAGCAACACT 
TGCTCTAATG 
GGCTTAGCTT 
TGAGTGGCTT 
AACAATGA6A 
CCAGCCAGAO 
AGT6TGCCA0 
TTAATTTCTT 
ACCAAG6CTC 
CTTTTCCCAT 
ACAAAATATT 
TATGA6CCAA 
AACCCCACTT 
AATATACCAT 



ATGCACTAAC 
AGAATTATCC 
AAATTAOTAO 
TACTQGAAGG 
TACACACATC 
GATTTTTTTG 
AATTGAGTCC 
GTTGACTCAG 
CTCCCAGTGG 
ATCAGQAATG 
AAOTAAACTT 
ATCCOSCATG 
CACGTTACAG 
ATGCACATTC 
GGTAAAGGCT 
AGCTTCCTGT 
TAAAAOATGA 
TATACTTCTC 
TCTAACGGGA 
TCATATTT6T 
AA6CATT6TT 
ATTAGCTACC 



TCAATACCAA 
CCAATTTOCA 
ATCAACAATC 
ACTCTQAGGC 
CATCCTAAAC 
TAACTTGTTC 
ATTTTCTAGC 
AGAGTOGCTG 
CAGATCCCCT 
ATGCTTATTA 
GGCTTTGCTC 
AGCAGGAGCG 
AACCTATGTT 
CTGGGCCAGT 
TCCAGTTCAG 
TCTAATAAAT 
TTTCCCTTCT 
ACAATTCAQT 
ATGGGTGGGA 
GATTTTTTAA 
TTTATATAAA 
CACC 



GATGAGTTTT 
ATAAGTOCTA 
TAA ACAAATC 
TTTATTOCOC 
TATACTAAAO 
TAGAAQTCTT 
T6CCTTTATT 
TCATTCTGTC 
GTATCATTCC 
GAAAACAAAC 
AGATCCCTGA 
TGCTQQCCCT 
CAGGTTOCGG 
CTCAGCCAAC 
CXrrCAGTTAT 
GCAOGGCTTT 
GTAACTCCCT 
TTCTAT6AGT 
GTGCTGOTGA 
AAAAAGTTTA 
AACAATGATA 



TAAATAATGA 
TCATTQAAAA 
CCi'OOaTTCT 
CACTATBCAT 
C CV ri'TT'C C C 
AATATGGGCT 
CACATAGTGA 
ATTGCTGCTA 
AAGAGQAGCA 
TGCTTGACCC 
TCCTTCCAGC 
GAGTACTGAA 
GTGAGCTGCX: 
AGTACCAAAA 
TTTAGACAAT 
ACCTTTCCTG 
AGAGCCACAG 
TTGATCAOCT 
AAAGAGATGA 
AAAGGAAATA 
AAGATGTQAA 



ATATTATTTA 
TTCAAATATA 
AA6ATACAAT 
ATCTTATCAT 
ATGCATGGAT 
GTTGCCATGA 
TGGGGTACTA 
CTCTAACACT 
TTCATCCCTT 
AGGAACAAGT 
TGGTCTGCTC 
CTTTCTGAGT 
CTCTCCAAAT 
GTGATTTTTG 
CTOGCCATCT 
TCAGAAATAA 
GTTCTCATTC 
GATTTTTTTA 
AATGTGGTTG 
TCTGTTCTGA 
CTGTGAAATA 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



Seq ID NO: 54 Protein eectuence 
Protein Accession #: NP_0S5026 



MNYSLHLAFV 
VQIALTLDIA 
yXVESXKSFIi 
WGYDGNDVBF 
NVLYPILETY 
CFIKAIDVYL 
SFKRKISPAS 
LFPL2FMLAN 



11 

I 

CLSLFTERMC 
8ZSSISESNM 
HBVTV(mRI*Z 
TWLRGNDSVR 
VPSTFLWLS 
GICPSFVPGA 
lEISSDNVDY 
VPYHAVYMYF 



21 
I 

IQGSQFNVEV 
DYTATIYLRQ 
RZiPSNGTVLY 
GXfHLRLAQY 
WVSFWISLDS 
LLEYAVAHYS 
SDLTMKTSDK 



31 
I 

GRSDKLSLPG 
RHMDQRLVFE 
ALRITTTVAC 
TIERYFTLVT 
VPARTCIOVT 
SLQQMAAKDR 
FKFVFREKMG 



41 

I 

FENLTAGYNK 
GNKSPTLDAR 
NMDLSKYPMD 
RSQQETGHYT 
TVI»SMTTW4I 
GTTKEVEEVS 
RIVSYFTIQN 



51 
I 

FLRPNFGGBP 
LVEFLWVPDT 
TQTCKIiQLBS 
RLVLQPELRR 
GSRTSI4PNTN 
ITNIINSSIS 
PSNVDHYSKL 



Seq ID NO: 55 DKA Sequence 

Nucleic Acid Accession «: XM_084007 

Coding seqpience: 13 8.. 2405 

1 11 21 31 41 51 

11)111 
CTCGTGCCGA ATTCX3GCACG AGACCX30GTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 
CCAOTGGGCC CX3TGTGGAAC CAAAGCTOCXS 0QCGTGGCCX3 GGGOGTGGGA CAACGAGGCC 
G0GGAGAC6A AGGOGCAATQ QOQAGGAAGT TATCIGTAAT CTTQATCCTO ACCTTTGCCC 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCOCOCAQ ACCACTGAOA 
AAATTAGTCC GAATTGGQAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACAOGGCAAT 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTOTCA GTTGAAGGGT 
TCAQAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 
ACCAOGAGCA TCACTCAGAC CACX3AGCATC ACTGAGACCA TGAGGGTCS^ TCAQACCATG 
A6CATCACTC AGACCAOGAG CATCACTCTG AOCATGATCA TCACTCCCAC CATAATCATG 
CTGCTTCTGG TAAAAATAAG C6AAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 
GTAAAGATCC TAGAAACAGC CAGGGQAAAG GAGCTCACOG ACCAGAACAT GCCAGTGGTA 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 
TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTOC AAGAOCTGGA AAACTCTTCC 
CCAAAGATOT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCG6 GTGAGCCGGC 
T6GCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGGCCOG AAAA6GCTTT ATGTATTCCA 
GAAACACAAA TGAAAATCCT CAGQAGTGTT TC31ATGCATC AAAGCTACTG ACATCTCATG 
GCAT6GGCAT CCAGGTTCCO CTGAATGCAA CAGA6TTCAA CTATCTCtGT GCA6CCATCA 
TCAACCAAAT TGATGCTAGA TCTTQTCTGA TTCATAGAAG TOAAAAOAAa QCTOAAATCC 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATA6CC ATTTCCATCA 
TCAGTTTCCT GTCTCTGCTG GQGGTTATCT TAOTQCCTCT CAT6AAT0GG GTOTTTTTCA 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CXX5TTGGGAC TTTGAGTGGT GATGCTTTTT 
TACACCTTCT TCCACATTCT CATGCAAG7C ACCACCATAG TCATAGGCAT OAAGAAOCAO 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCA AAAC ATAGAAGAAA 
GTGCCTATTT TGATTCCAOO TGGAAGGGTC TAACAGGTCT AGGAOGGCTG TATTTCATGT 
TTCTTGTTQA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 
AGAAGAAACC TGAAAATGAT GATGATGTGG A6ATTAAGAA GCAGTTGTCC AAGTATGAAT 
CTCAACTTTC AACAAATGAG GA6AAAGTA0 ATACAGATGA TOQAACTGAA GGCTATTTAC 
GAGCAGACTC ACAAGAGCCC TC0CACTTT8 ATTCTCAGCA GGCTGCAGTC TTGGAA6AAG 
AAGAGGTCAT GATA6CTCAT GCTCATCCAC AGOAASTCTA CAAT6AATAT GTACCCAGA8 
GQTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT OSGCCASTCA GACGATCTCA 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 
CTC ACAO TCA CAGCXrAGCGC TACTCTOGGG AaGAGCTOAA AGATQCGGGC GT06CCACTT 
TGGGCTOaAT G6TQATAATG GOTGATGGCC T6CACAATTT CAGCGATGGC CTAGCAATTG 
GTGCTGCTTT TACTQAAGGC TTATCAAGTG GTTTAAGTAC TTCTOTTQCT 0TGTTCT6TC 
ATGAGTTGCC TCATGAATTA GOTGACTTTG CTGTTCTACT AAAGGCTOGC ATGAC06TTA 
AQCAOGCTGT CCTTTATAAT GCATTGTCAG CCRTQCTOGC GTATCTTGQA ATGGCAACAQ 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTO 6ATATTTGCA CTTACTQCTG 
QCTTATTCAT GTATCTTGCT CTGGTTQATA TGOTACCXGA AAT6CT6CAC AAXGATGCTA 
GTGACCATOG ATGTAOCOOC TGGGGQTATT TCTTTTTACA GAATGCTOGG ATQCTTTTGO 
GTTTT GG AAT TATQT TACTT ATTTCCATAT TTGAACATAA AATOGTGTTT 08TATAAATT 
TCTAGTTAAa GTTTAAATGC TAGA07AGCT XAAAAAGTTG TCATAGTTTC AOTAOQTCAT 



60 
120 
160 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
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AGGGAGftTGA 
TTGTATTQAA 
TATTCTATCT 
TAAACAAGAO 
TTTTCAAGAA 
TG7TTAQGAA 
A6CAAAGAAA 
AAAAATOVOV 
CAGAATTAGT 
CTAGTGAGCA 
AAATATATTT 

TATTGCCAA6 
CAAAATTATC 
TCATTTGATT 
GAQCAATTGT 
GATGTTTCTT 



GTTTGTATGC 
TATTGCTGTC 
TG6AGATAAA 
ATTTGGCATG 
CTAACACAGT 
TAAOAATGTG 
TAAAGGAOAA 
AAATTTGTTG 
ATAGAGTACA 
CTCTCATATA 
AAT5AATTCA 
TTATATACCA 
TTATATATCA 
AQAGTAGTAA 
CGATTCAGAA 
CTTTATATAC 
TTTTACACAA 



tGTACTATGC 
TC3TTACAAA6 
ATCTGTATGT 
ACATGTTCTG 
TATTCCTATA 
CATGAAOCCT 
AAGAGAAGAA 
TAAATTAC5AG 
TTCATTAAAC 
CTAATTAGTG 
AGCAATATAC 
GATGA8TACA 
CCAAAAGCTO 
AACTTTQATA 
AGTACTTTGA 
GGTACTOTAG 
TAAATTCCTT 



AGCGTTTAAA 
TCAOTTAAAQ 
GCAATTCACC 
TATGTTTCAG 
CTGGATTTTA 
AAAATACXAA 
TCTGAGAATT 
GGGAGAAATT 
ATTTTTGTCA 
TACATTTAAC 
ACTIGAGCAA 
GTGAGTAGTT 
TATGACTGGA 
TATATGAGGA 
TATCTCTCAG 
CCATACTAGG 
ATATCA6CTT 



OTTAOTQOOT 
OTA06TTTTA 
GGTATTAOCA 
GGAAAAATOT 
GGTCTCTGAA 
GAAAGCTTAT 
GGGQAG6CAT 
TAQAATTAAG 
QQATTATTTC 
TTTGTATAAT 
GAAATT6QAA 
TATGTATCAC 
TGTTCTGOrr 
TATTAAAACT 
TGCTTCAGTG 
CCIGTCXaTG 
0 



TTT8TGATTT 
ATATTTAAGT 
OTTTATTATG 
CTTTAATGCT 
GAACTGCTGG 
ACTQAATTTA 
AGATTCTTAT 
TATAAAAAGG 
CCX3TAAAAAC 
ACAGAAATCT 
TTTCAAAAT6 
CAGACTGGGT 
ACCTGGTTTA 
ACACTAAGTA 
CTATCATTOT 
6CATTCTCTA 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID NO: 56 Protein sequence 
Protein Accession 8: XP 084007 



MARKLSVILX 
FYRYGENNSIt 



6VSA8EVTST 
NESVSEPRKG 
RSCLIHTSEK 
LVAIiAVGTLS 
TWKGLTALGG 
EEKVXXTDDRT 
HSHFHDTLGQ 
HGDGLHNFSD 
NALSAMLAYL 
RWGyPFLQNA 



11 

1 

LTFALSVTNP 
SVEGPRKLLQ 
HHNHAASGXN 
VYNTVSEGTH 
PMY8RNTNEN 
KAEIPPKTYS 
GDAFLHLLPH 
LYPMPLVEHV 
EGYLRADSQG 
SDDIilHHHHD 
GLAIGAAPTE 
GKATQZPXGH 
GMLLGFGIMIi 



21 
I 

LHELKAAAFP 
HIGIDKZKRI 
KRKALCPDHD 
FLBTIETPRF 
PQBCFNASKL 
LQIAWVGGFI 
SHASHHKSHS 
LTLIKQFKDK 
PSHFOSQQPA 
YHHILHHHHH 
GLSSGLSTSV 
YAENVSMWIF 
LISIFEHKZV 



31 



QTTEKISPNW 
HIKHDHDHHS 



41 

I 

ESGIMVDLAI 



GKLFPKDVSS 
LTSHGMGIQV 
AISIISFItSL 



VLEBEEVMIA 
QNHHPHSHSQ 
AVFCHEXiFHC 
ALTAGIiFMYV 
FRINF 



STPPSVTSKS 
PLKATEFNYL 
USVILVPLMN 
GPIiFSHLSSQ 
DDDVBXXXQIi 
BAHPOEVYNB 
RY8REELKDA 
IjGDFAVLLKA 
ALVDMVPBML 



51 
I 

STRQYHLQQL 
HSDHEHHSDK 
HASGRRNVKD 
RVSRIAORKT 
CPAIINQIDA 
RVFPKFLIiSF 
NIEESAYFDS 



YVFRGOQIKC 
GVATLAWMVZ 
QMTVKQAVLY 



Seq ID NOi 57 DMA sequence 

Nucleic Acid Accession tf: NM_015419.1 

Coding sequence: 1..8487 " 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



1 11 21 31 41 51 

I I i i I I 

ATGCCCAAGC GCX3CGCACTG GGGGGCXXTC TCCGTOGTGC TGATCCTGCT TTGGGGCCAT 
CCX3CX3AGTGG CGCTGGCCTG CCCGCATCCT TQTGCCTGCT ACX5TCCCCAG CXSAGGTCXZAC 
TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTOOCATTG CTAGACACGT GGAAA9AATC 
AATTTGGGGT TTAATAOCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTQACCAAG 
TTQGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCXTCCXSATGQ AGCTTTAAQA 
GACCTCAQCT CTCTTCAGGT TTTCAAGTTC AQCTACAACA AGCTGAGAGT GATCACAGGA 
CAGACOCTCC AGGGTCTCTC TAACTTAATQ AGGCTGCACA TTGACCACAA CAAGATOSAG 
TTTATCXaCC CTCAAGCTTT CAAOQQCTTA ACGTCTCTGA G6CTACTCCA TTTGQAAGGA 
AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCAOGT TCACATTTTT GQATTATTTC 
AGACTCTCCA CXATAAGGCA CCTCTACTTA GCAGAGAACA TGOTTAOAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCT GG AG AATCTTTACT TGCAGGGAAA TC06TGGACC 
TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCXAGAGG AATTCTGAAG 
TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTCCTT CAQTCCAAAG 
AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTGTGAA GCCTTCAATA 
GAGTCCXXrrC TGAOACAGAA CAOGAGCAGG AGTATTGAGQ AGGAOCAAGA ACAGGAAGA6 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CGGAGTGGAG CATCTCTTTG 
AATATGACCG ACGAGCACX3G GAAGATGGTG AACTTG6TCT GTGACATCAA GAAACCAATG 
GATGTGTACA AGATTCACTT GAACCAAAC6 GATCCTOCAG ATATTGACAT AAATGCAACA 
GTTGCCTTGG ACTTTGAGTG TGCAATGACC CGAGAAAACT ATQAAAAGCT ATGGAAATTG 
ATAQCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAO AGCTCATGCT CAGCAAAGAC 
CCCA6A6TCA 6CTACCAGTA CAGGCAGGAT OCTQATGAGG AAGCTCTTTA CTACACA6GT 
GTGAGAGCCC AGATTCTTGC AGAACCAQAA TGGGTCATGC AGCCATCCAT AGATATCCAO 
CTGAACOGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 
CAAACAATAT CCACCAAAGA TACAAGOCAG GCTOGGGGCA GAAGCTGGGT AATGATTGAG 
OCTAOTGGAG CT6IGCAAAG AGATCAGACT 6TCCTGGAAG 66GGTCCAT0 CCAGTTGAGC 
TGCAAGGTOA AA6CTTCTGA QAOTCCATCT ATCTTCTGGG TGCTTCCAQA TGGCTCCATC 
CPGAAAGOGC CCATG6ATGA CCCAGACA6C AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 
AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TC3\AGTGAGG 
GATGAAATGG A0C6CATGGT ATATAGGGTA CTT6TGCAGT CTCCCTCCAC TGAGCCAGCC 
GAGAAAGACA CAGTGACAAT TGQCAAGAAC OCAGGGGAGT GG6TGACATT GCCTTGCAAT 
GCITTAGCAA TAOCGQAAGC CCACCTTAGC TOGATTCTTC CAAACAGAAG GATAATTAAT 
GATTTGGCTA ACACATCACA T6TATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 
GTCCAA6TCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGQG6CAGAC 
CATTTTACX3G TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTT6CCATC CAAAAGAGGC 
A6AGQGCCAG OTQCAAAGGC U'Cl'TTOCAGA GTCAGAGAAO ACATOQTGGA GGAT6AAGGG 
G6CT060GCA TGGGAGATGA AGAGAACACT TCAAOSA6AC TTCTGCATCC AAAGGACCAA 
GAGGTGTTCC TCAAAACAAA GGATGAT6CC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 
AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 
GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCX3 
GAOGGCTOaG CTQATATTTT AOCCAAAOTC OGTGGQAAAA ATCTCOCTAA GGGCACAOAA 
GTACCCCOGT TOATTAAAAC GACAAOTOCT OCATCCTTGA GOCTAGAAGT CACACCACCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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TTTCCTGCTO m 'Cl'CCCCX: CTCAGCATCT OCTOTGCAGJ^ GAOTAACCAG TGCTGAAGAA 2520 
TCCrCAGCAO ATGTACCTCT ACTTGGTGAA QAAGA6CAGG TTTTGGGTAC avTITCCrCA 2S80 
GCCAGCATGG GOCTAGAACA CAACCACAAT OGAGITATTC TTGTTQAACC TGAAGTAACA 2640 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTQAGA AGACTQAGGA GATAACTTCC 2700 
J ACTQAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTOAGCC TTATGAACCA 2760 
TCTCCTACTC TCCACACATT AGACACAOTC TATQAAAA6C CCACCCATGA AGAGAOSGCA 2820 
ACAGAGGGTT GGTCT GC AGC AGATGTTCQA TGGTCACCAO A6CCCACATC CAGTGAGTAT 2880 
GAGCCTCCAT TQQATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTOACCCA 2940 
GATTTGQAQA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 
10 CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCA6TA CATCACAGTT ATTTGAGGAT 3060 
TCTACTATAO GGOAACCAGO TOTCCCAGGC CAATCAGATC TACAAOGACT GACAGACAAC 3120 
ATCCACCTTO TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAA6GGTAT6 3180 
AAAOAGATGT CTGAQACACT ACAGGGAGGA AATATGCTAO AGGGAGACXX: C3VCACACTCC 3240 
AGAAGTTCTQ AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCtGA CTCCACACTG 3300 
15 GGTATAATGA GCA8TATGTC TCCASTTAAQ AA6CCT6C3GG AAACX:ACAGT TGGTACCCTC 3360 
CTAGACAAAO ACACCACAAC AOTAACAACA ACACCAAOGC AAAAAGTTGC TC06TCATCC 3420 
ACGATGAGCA CTCACCCTTC TOCAAGGAGA CCCAACXX3GA GAAG6A6ATT AOGCCCCAAC 3480 
AAATTC06CC ACCGGCACAA GCAAACCCCA CCCACAACTT TT6CCCCATC AGAGACTTTT 3540 
TCTACrCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 
20 GTTGCTACAG CTTGGGTGOA TAACACAGTT AATACCCCCA AACAQTTOGA AATGGAGAAQ 3660 
AATQCAGAAC CCACATCCAA GGGAACACCA GGGAGAAAAC A0G6GAA6A6 GCCAAACAAA 3720 
CATOQATATA CCCCTTCTAC AGTGAGCTCA AGA6CGTC0G GATCCAA6CC CA6CCCTTCT 3780 
CCA6AAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAQ ATTAOVTGAC AACCACCAGA 3900 
25 AAAATATATT CATCTTACCC TAAAGTCCAA GAQACACTTC CAQTCACATA TAAAOXACA 3960 
TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTQACAAACA TAAAAGTGAC 4020 
ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTGTOQCTC CTTGQTCTCC 4060 
ACTATGGGAO AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4X40 
AATCCCTCAA GOACXWCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 
30 G6GGAAAATC TTACAGACXX: TCCCCTTCTT AAAGAGCTT6 AGGATGTGGA TTTCACTTCC 4260 
OAQ ri TTTQT CCTCrTTQAC AGTCTCCACA CX:aTTTCACC A0GAAGAA6C TGGTTCTTCC 4320 
ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4360 
GATCAAGATC ATCTTQAAAC CACT6TGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 
CACACCCCTA CTQCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 
35 ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 
QCATCTAOAO ATTCCAAGGA AAAT6TTTTC TT6AATTATG T6QGQAATCC AQAAACAGAA 4620 
GCAACGCCAG TCAACAATGA A0GAACACA6 CATAT6TCA6 G6CCAAAT6A ATTATCAACA 4680 
CCCTCTTCCO ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 
TTTGGTAQTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 
40 GCTTCTCATC AACTAACCAO AGTCCCTGCX: AAACCXJVTCC TACCAACA6C AACAGTGAGG 4860 
CTACGTGAAA TQTCCACACA AAOOGCTTCC A6ATACTTTQ TAACTTCCCA GTCACCTCGT 4920 
CACTGGAGCA ACAAACCX3GA AATAACTACA TATCCTTCTG GGGCTTT6CC AGA6AACAAA 4980 
CAOTTTACAA CTCCAAQATT ATCAAOTACA ACAATTCCTC TCCCATTQCA CATGTCCAAA 5040 
CCCAGCATTC CTAGTAAGTT TACTXSACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5X00 
45 GTGTTTGGAA ATAACAACAT COCTGAGGCA AGAAAiCCCAO TTGGAAAGCC TCCCAGTCCA 5160 
AGAATTCCTC ATTATTOCAA TQGAAGACTC CCTTTCTTTA CCAACAA6AC TC T llVrm ' 5220 
CX2ACAGTTGG GAGTCAOCOO GAGACCCCAG ATAGCCACTT CTGCTQCCCC AGTAATQAGA 5280 
GAGAGAAAAO TTATTCXMO TTCCTAC3VAC AGOATACATT CCCATAGCAC CTTCCATCTG 5340 
GACTTTGGCC CTCOQGCACC TCCGTTGTTG CACACTCCGC AGACCAOGGQ ATCACCCTCA 5400 
50 ACTAA CTTAC AQA ATATC CC TATGOTCTCT TCCACCCAGA GTT CTATCTC CTTTATAACA 5460 
TCTTCTGTCC AGT0CTGA6Q AAGCTTCCAC CAGAGCA6CT CAAAGTTCTT T6CAG6AGGA 5520 
OCTCCTGCAT CCAAATTCT6 GTCTCTT6GG 6AAAAG0CCC AAATCGTCAC CAAGTCCCCA 5580 
CA6ACTGTGT CCGTCACOGC TGAGACAGAC ACTGTGTTCC CCT6TGAG6C AACAGGAAAA 5640 
CCAAAGCCTT TOGTTACTTG GACAAAGGTT TCCACAGGAO CFCTTATGAC TCGGAATACC 5700 
55 AGGATACAAC GGTTTGAOGT TCTCAAGAAC GGTACCTTAG TGATAOGGAA 6GTTCAA6TA 5760 
GAAGATGQAG GCCA6TATAT GTGCACOGCC A6CAACCT6C A0G6CCTGGA CAOQATGGTG 5820 
GTCTTGCTTT CGGTCAGOGT GCAGCAACCT CAAATCCTAO CCTCCCACTA CCAGGACX3TC 5880 
ACTGTCTACC TGGGAGACAC CATTGCAATQ GAGT G TCTGG CCAAAGGGAC CCCAGCCCCC 5940 
CAAATTTCCT QOATCTTCCC TQACAQGAGQ QTOTGOCAAA CTQTGTCCCC OGTOQAQAGC 6000 
60 CGCATCAOCC TGCACGAAAA COGGACCCTT TOCATCAAGG AGGCXSTCCTT CTCAGACAGA 6060 
G60GTCTATA AGTOCGTGGC CAGCAATGCA OGOGGGGGGG ACAGOCIOGC GATGCGCCTG 6120 
CAG6TGG00G CACTGCCCCC G6TTATCCAC CA6GA6AAGC TG6A6AACAT CT06CT6CCC 6180 
CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTQ CX3CCCCTGCC CAGOSTGCGC 6240 
TGGGTGCTG6 6GGA0GGTAC CCAGATC06C CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 
55 GTTTTCCCCA AGQGQACGCr CTACATCGGC AACCTCGC6C CXAACGACAO G6G60SCTAT 6360 
' GAGTGOSTGG CCOCCMXXT GOTAGQCTCC G0G0QCA6GA 0QG1X3CAGCT GAACXfTOCAO 6420 
06TGCAGCAG CCAAG6G6GG CATCACGGGC ACCTCCCCX3C GGA6GA0GGA CXSTCAGGTAC 6480 
GGAGGAACCC TCAAGCTGOA CTQCAGCX5CC TCGGGGGACC CCTGGCCOOQ CATCCTCTOQ 6540 
A66CTGC0GT CCAAGAGGAT GATOGACGOG CTCTTCA6TT TTGATAGCAG AATCAAGGTG 6600 
70 TTTGCCAATG GGACCCTQGT GGTGAAATCA GT6A0GGACA AAGATGCOGG AGATTACCTG 6660 
TGCOTAGCTC GAAATAAGGT TaOTGATOAC TAOOTGGTOC TCAAAOTGGA TGIGGTGATG 6720 
AAACCGGCCA AQATTGAACA CAAGGAGGAG AACGACCACA AAGTCnCTA 066GGGTGAC 6780 
CTaAAAQTGQ ACTGTGTOGC CACCGGQCTT CCCAATCCOO AGATCTCCTG GAGCCTCCCA 6840 
QAOGGGAGTC TGGTQAACTC CTTCATGCAG TCGGATGACA GCX5GTGGA0G CACCAAGCGC 6900 
75 TATGTOQTCT TCAACAATGG GACACTCTAC TTTAACQAAG TGGGGATGAG GGA0GAAG6A 6960 
GACTACACXTT GCTTTQCTGA AAATCAGGTC GGGAAOQACG AGATGAOAGT CAGAOTCAAO 7020 
GTGGTGACAG CGCCCGCCAC CATCOGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 
GGAGAOGTGQ TCACTGTAGC CTOTGAGQCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 
TTGTCXCCAA CCAACAAGGT GATCCXX31CC TOCTCTGAGA AGTATCAGAT ATACCAAQAT 7200 
oO QGCACTCTCC TTATTCAOAA AGCCCAGOGT TCTGACAGOG GCAACXACAC CTOCCTGGTC 7260 
AGGAACAGOG OQGGAGAGGA TAGGAA6AG0 GTGTGGATTC ACQTCAAOQT CCAGCCACCC 7320 
AAOATCAAOO GTAACCCCAA COCCATCAGC ACO0T6O6GG A6ATAGCAGC OOOOGOCAGT 7380 
OQGAAACTQA TTGACTGCAA AOCTGAAOGC ATOCOCACCC 0GAGGGT6TT ATOGGCTTTT 7440 
CCOQAGGaTa TQGTTCT6GC AGCTCCATAC TATQG^CC GGATCACXOT CCATGGGAAC 7500 
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GGTTOCCTGG ACKTCA0GA6 TTTGAOQAAG AGOSACTCCG T0CA6CTGGT ATGCATG6CA 7S60 

0GCAACX3AG0 GAGGGQAOGC 6AGGTTGATC GTOCAGCTCA CTGTOCTGGA GCCCATGGAO 7620 

AAACCCATCT TCCACGACCC GATCAGOGAG AAGATCACGQ CCATOGOGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCOGC OOGGACCCCG ACACCCAGCC TOGTGTGGGT CCTTCCCAAT 7740 

GGCAC0GA7C T6CAGAGTGG ACAOCA6CTG CAGG6CTTCT ACCACAAGGC TGACGGCATG 7800 

CTACACATTA GGG6TCTCTC CTGGGTGGAC GCTOGGGCCT AGOOCTGCXn' 66CC06CAAT 7B60 

GCCGCT6G0C ACAOGGAGAG GCTGGTCTOC CTGAAGGTGG GACTGAAQCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

CXrrCCCGGGG CTGGGCAGGG ACGTTTCTCC TOQAOOCTCC CCAATGGCAT 6CATCTGGAG 8040 

GGCTCTCAAA CCCTGGGACG 06TTTCTCTT CTGGACAATG GCACCCTCAC GGTTGGTGAG 8100 

GCCTOGGTGT TTOACAGGGG TACCTATGTA 1GCAGGAT6G A6AGGGAGTA OGGOOCTTaS 8160 

6TCACCAGCA TCCCC3GTGAT TGTGATOGCC TATCCTCCCC GGATCACCAO OGAGGCCACC 8220 

COGGTCATCT ACACCOQGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAOCTO ACATCAOQTO GGAOTTACOQ GATAAGTGGC ATCTQAAGGC AGGGGTTCAG 8340 

GCTOGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCX: 8400 

ACACAOAOAG ATGCCGGCTT GTACAAGTOC ATOGC AAAAA ACATTCTOGG GAOIGACTGC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTO 8520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

GTCACAGTGC ATGGTQGCCT CTGGTQQOTT TCAAGTTGAG OTTQATCTTQ ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACAOGAG AAGGAGOGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTQTOTT TACATCATOC CAGaGGCTTC ATTCAGGGTO TCTOTGCTCT QACTGCAATT 8760 

TTTCTT C TTT T6CAAATGCC ACTCGACT6C CTTCATAAGC GTGCATAG6A TATCTGAG6A 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGAOGCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAA6TCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT 6GCXAATAA6 GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAOAOT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAOACGAT GAGACTAGAA GGAGAAATAC TW C VG^t C l ' V 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AQACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG 6AACCCTCCA GTGGOGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAOGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGOTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTOCCAAAAA AAAAA 

6eq ID NOt 58 Protein sequence 
Protein Accession tfi NPJ}56234.1 

1 11 21 31 41 51 

I I I 1 I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

NLGFNSIQAL SBTSFAGLTK LELLMIHC^ IPSIPOGAIiR DLSSIiQVFKF SYMKLRVITG 120 

QTLQGliSNLM RLKIDBNKIE FIHPQAFMGL TSLRVLHLBG NIiLKQLKPST FSTFTFLDYP 180 

RLSTIRHLYL AENMVRTLPA SMLRMMPLLB KLYLQGNPWT OXXMRHFLB WDAKSRGILK 240 

CKKDKAYSGQ QLCAMCFSPK RLYKHEXKKL KDMTCbKPSI BSPLRQMRSR SIEEEQEQEB 300 

DGGSQLILEK FQLPQWSISL NMTDEHGNMV NliVCDIKKPM 0VYK1HLNQT DPPDIDINAT 360 

VALDFECPMT RBNYBKLWKIi lAYYSBVPVK UIRBLMLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE NVMQPSIDIQ UOtRQSTAXK VLLSYYTOYS QTISTKDTRQ ARGRSWVMIE 480 

PSGAVQRDQT VLE6GP0QLS CNVKASBSPS IFHVIiPDGSI LKAPHDDPDS XP8ILSSGHL 540 

GIiYQCIAQVR DEMDRMWRV LVQSPSTQPA BKDTVTIGKN PGESVTLPO* 600 

WILPNRRIIN DLANTSHVYM LPN6TLSIPK VCJVSDSGYYR CVAVNQQGAD 660 

KGSGLPSKRG RRPOAKALSR VREDIVBDEG GSGKGDEENT SRRLLHPKDQ 720 

INGDXKAXKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQnn> 780 

RGKNZiPXGTB VPPLZKTTSP P8X.8LBVTPP FPAVSPPSAS PVQTVTSAEB 840 

EEHVIiGriSS ASM6LBHNHN GVZLVEPBVT STPLEEWDD IjGEKTBBITS 900 

PTLZSEPYEP SPTLHTLDTV YEKPTHBETA TEGWSAADVG 8SPEPTSSEY 960 

EfiEPMQYFDP DLBTKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 1020 

QSHIiQGIiTDN IKLVXSSLST QDTLLIXK6M KEMSQTLQGG NMLB6DPTKS 1080 

KSnUDSTI) 6IMSSMSPVX KPAETTVGTL LDKDTTTVTT TPRQKVAP88 1140 

PNGRRRZiRFN XFRRRHXQTP PTTFAPSBTF STQPTQAPDI KZSSQVESSL 1200 

MTPKQLEKEK NAEPTSKGTP RRKHGKRPNX HRYTPSTVSS RASGSKP8PS 1260 

PSSETILLPR TVSIiKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

ATNVDKHKSD ILVTGESITK AIPTSRSLVS TMQEFKEBSS FVGFP6TPTH 1380 

LQTDIFVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFBQEEAG88 1440 

ASSQAETTTL DQOHLETTVA lUiSETRPON HTFTAARHXB PASSSPSTZL 1500 

PALP8PRISQ ASRDSKENVF LNYVGITPETE ATPVNMBC3TQ HMSGPNELST 1560 

STKLELEKQV FGSRSZjPRGP DSQRQDGRVH ASHQLTRVPA KPZLPTATVR 1620 

RYFVTSQSPR HWTNKPEZTT YPSGALPENK QFTTPRLSST TZPLPLHMSK 1680 

RTDQFNGYSK VPGNMNIPEA RNPVGKPPSP RXPHYSMGRL PFFTNRTL8F 1740 

ZPTSPAFVMR BRKVZPGSYH RIRSHSTFBL OFGPPAPPLL HTPQTTGSPS 1800 

8TQSSISPIT SSVQSSGSPH QSS8KPFAGG PPASKFMSLG EKPQILTKSP 1860 

TVPPCEATaK PKPFVTWTKV STQALMTPNT RIQRFEVLKN GTLVIRKVQV 1920 

SNIiHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECLAKGTPAP 1980 

VWQTVSPVES RZTLHEKRTL SZKEASF8DR GVYKCVASNA AGADSLAZRX* 2040 

QEKLENZSLP PQLSIHIHCT AKAAPLPSVR NVLGDGTQZR PSQPLHGNIiF 2100 

NIAPiCDSaRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

SGDPWPRIIiW KLFSKRNIDA LPSFDSRIKV FANG7LWK9 VTDXDAGDYL 2220 

YWXiKVDWM KPAKIEBXBB NDHKVPYGGD UCVDCVATGL PKPHISWSLP 2280 

8D0SGGRTKR YWFMNOTLY RIBVGMRBBQ DYTCFABNGV GKDEMRVRVK 2340 

KTYliAVOVPY ODfWTVACBA KGEFMPKVTW LSPnnCVZPT SSEKYQIYQD 2400 

SOSGNYTCLV RNSAGEDRXT VHZHVNVQPP KXMGHFIIPZT TVREZAA66S 2460 

ZPTPRVLKAF PBOWLPAFY YGimiTVKGM GSLDXRSLRK SDSVQLVCMA 2520 



AltAZPEAHLS 
HFTVQITVTK 
EVFLKTKDDA 
ERWADILAKV 
SSADVPLUSE 
TE6DLKGTAA 
BPPIiDAVSLA 
STZGEPGVPG 



TMSTKPSRRR 
VPTAWVDNTV 
PENKHRNIVT 
SDGKEZKDDV 
HPSRTAQFGR 
TTLSSZKVEV 
MSLGQTTTTK 
PSSDRDAFNti 
LPBMSTQSAS 
PSZPSKFTDR 
PQLGVTRRPQ 
TNLQHIPMVS 
OTVSVTAETD 
QDRGQYMCTA 
QZSWZFPDRR 
RVAALPPVIH 
VPPNGTLYIR 
GGTLKLDCSA 
CVARMKVGDD 
DGSIiVNSFKQ 
WTAPATIRN 
GTLLZQKAQR 
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RNBGGBARLI VQLTVLBPMB KPIFBDPI8B 
OTDLQSGQQL QRFYHKADGM LHISGLSSVD 
KQYHNLVSII NGETLKLPCT PPGAGQGRFS 
ASVPDRGTYV CRMBTBYGPS VTSIPVIVIA 
PKADITWELP DKSHLKAGVQ ARIfYGNRPLH 
KTTYIHVP 



KZTAHAGRTX SUIC8AAGTP TPSLVWVLFN 2580 

AGAYRCVARN AAGBTBRLVS LBCVGLKPEAN 2640 

WTLPNOMHLB GPQTLGRVSL USNGTIiTVRB 2700 

YPPRITSKPT PVIYTRPGNT VKLNCMAMGI 2760 

PQGSXiTIQHA TQRSAGPYXC HAKNZLGSDS 2820 



Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..5001 



5 

10 

1 11 21 31 41 51 

I I I I I i 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTAOIGAGT GATATTGAAG 60 
15 ACCTCTCAA6 AGGA06AATT GGATGTACCT GA08ACATCA GOGTCOGGGT TATGTCATCT 120 
CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA A6TTGTT6CA 180 
TCAAGACA6T ACACOGTGOS CTATGGAGAO AAGGGGaAAT TG6CGAG3T0 GGATTATAAG 240 
CAGATCGCTA ACAGQCGTOT QCTGATTOAG AACCTGATTC CAGACACTGT GTATGAATTT 300 
GCAGTCOGTA TTTCAGAGGG TGAAAGAGAT QGCAAATGGA GTACOTCAOT CTTCCAAAGA 360 
20 ACACCAGAAT CTGCCXXTTAC CACAGCTCCT GAAAACTTGA AOGTCTGGCC AGTCAATGGC 420 
AAACCTACAG TTGTCGCT6C ATCTTGGGAT 60QCTACCAG AGACT6AGGQ GAAAGTOAAA 460 
OTCTGTCTGC TGGACACAGG ACTGTTTTCA O'm'CCl'UCi' TCCAACXatC TGCCAAATCA 540 
TTTCAGAATA CATTCTTTCA TACGCCCCX30 CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 
CCTATCCTGQ AGACACTACT TCTGCCCTGQ TGGATGGTCT GCAQCCTGGG GAACGCTATC 660 
25 TTTTCAAAAT CCQGGCCACA AACAGGAOAG GCCTGGGACC TCACTCCAAA GCCTTCATTQ 720 
TCGCTATGGC AACAAGAATG CAGCTGTACC CAGAAGOATT TCASTTGTCT AGCTTACCTO 780 
ATCGATATGC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGG6AG TQTTTTTGGA 840 
CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 
ATGTGCTATG AAQACCCANN TOTTTCTTCT TTGACAGOCA ATTCTTTAAA ATCTGTTGCA 960 
30 GCCAGTAAGG CGGATGTTCA GCAGAACACG QAGGACAATG GGAAACCCGA AAAACCTGAG 1020 
CCTTCCTCAC CTTCTCCCAO AGCTCCAGCT TCCTCCCAAC ACGCCTCTGT GCCT6CTTCT 1080 
CCCCAAGGGA 6AAATGCCAA GGACCTTCTT CTTGACTTGA A6AACAAAAT ATTGGCTAAT 1140 
GGTGGG6CX3C CCCGAAAACC CCAGCTT06C GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 
TCGACAGAAA TCACTGQGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCX: 1260 
35 TCAGACACCC AAGACCAQAA ACGGACCCTO AGGCCGCCAA GTAQACAOGG CCACTCGGTG 1320 
OTTGCTCCCG GCAGGACTGC AGTGAGGGCC COGATGCCAG CX3CTGCCCCG AAGGGAAGGC 1380 
GTAGATAAGC CTGG C T' 1"1T C CCTGGCCACG CAGCCCCGCC CAGGGGCXJCC CCCCTCGGCT 1440 
TCGGCCTCTC CTGCXZCACCA CX3CGTCCACC CAGGGCACXT CTCATCX3TCC TTCCCTGCCT 1500 
QCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAOACGAAG ATGAGCX3CGC TGTGGGCTCC 1560 
40 CTCCACCCCA AGGGCGCCTT CGCCCAGCCC OGGCCAGCCC TGTCCCCCAO CCGCCAGTCC 1620 
COSTCCAGCG TTCTCCGCGA CAOAAGCTCT GTGCACCCCG GOGCAAAGCC AQCCTOGCCX3 1680 
GCX3CGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAOTGCOTC AOCCCCACCC 1740 
TCAAGACTTT CTCCACCCCA TQGGGGATCA TCTOGGCTGC TGCCCACCCA GCCACACCTG 1800 
AGCTCTCCAC TTTCCAAGGQ CQGQAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 
45 GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACOCAa 1920 
OTCTCTQAOO QAQCGGAOGC TTCTGATGGT GAAAGCCACG GTGAOGGCGA TAGGGAAGAC 1980 
GGCGGAAGGC AGGCGGAGGC CACGGCCCAG AGGCTGGGGG CCG6GCCTGC CTCTGGACAC 2040 
TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTOCRAG CAGGTTCAGC 2100 
ATTGGGCGGG GACCTCGGCT GCAQCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCOSA 2160 
50 6CCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 
GGAGACGAGG AG6ATGAGAA GCCGCTTCCT GCCACOGTTG TCAATGACCA CGTGCCTTCC 2280 
TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAOAGAGGG 2340 
GCCAGCCTGC ATOGGAAGOA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 
CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATQTTCA ACAQAGCACA 2460 
55 GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGOG 2520 
TCCCCTGCrC GTCCTCCCGC AGCAOGGTCA CAGCAGCATC CCAGTGTTCC CAQAAGQATQ 2580 
ACACCCQQCC GGQCCCCAGA ACAGCAGCCC CCTCCTCCCG TOGCCAOGTC CCAGCACCAC 2640 
CCGGGACCCC AGAGCAGAGA CGOGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 
CAGGCOaGGC 0GCCC06CCC CAOQTOGCAG OGCCGCTCCC ACTCCTCCTC GOACCCTTAC 2760 
60 ACGGC6A0CT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAQ 2820 
GGCAGCTACG ACGACQACAO CACAGAAOTC OAOGCCCAGG AT3TGCGG0C CCCOGCGCAC 2880 
GC0GCX3CG0G CCAAGQAQGC AGCTQCGTCC CTTCCCAAGC ACCAGCAGQT GOAOTCrCCC 2940 
ACAGGCGCAG GGGCAGOTGQ CGACCACAGG TCCCAGOGCO GAGATOOSGC CTCCCCOGCC 3000 
^ AGGCCCAGCC QACCCGGCGG CCCCCAGTCC CGOSCCOGGG TCCCC3M«3W3 GGCAGC3GC0G 3060 

65 6GGAAGT0GQ AGCCTCCTTC CAAGOGGCCC CrGTCCrCCA AGTCCCAGCA GTOGGTCTCA 3120 
6COGAGGACO AGGAGGAGGA GGAC6CGGGG TTTTTTAAAG GCGGGAAA6A AGACCTTCTG 3180 
TCTTCCTCTO TGCCAAAGTG GCCCTCTTCC TCCACTCXXaV GGGGCGQCAA AGACGCCGAT 3240 
GGGAGCCTCG CCAAGGAAGA 6A0GGAGCCT GCCATC3GCQC TTGCCCCTCa CGGAGGGA6C 3300 
CTCGCTCCTG TOAAOCOACC TCTCCC C CCA CCTCC3«»CA GCTCCCCCAG GGCCTCCCAC 3360 
70 GTCCCTTOCC GACCGCCGCC TCGCAGCGCT GCCACOGTGA GCCCCGTOGC GGGCACCCAC 3420 
CXCTGGCOGC GGTACACCAC 0060GCCCCV CCTGGCCACT TCTCCACCAC CCCQATGCTO 3480 
TCCTTGOGCC AGAGGAT6AT GCATGCCAGA TTCCGTAACC CTCTCTCCOQ ACAGCCTGCC 3540 
AGACCCTCTT ACAQACAAGO TTATAATGQC AGACX»AATa TAGAAG6GAA AGTCCTTCCT 3600 
^ GGTAOTAATQ GAAAACOGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 
75 GTTGTGGACC TTGATCGTGO QTTAGTATTG AAT6CAGAAG GAAGGTACCT CCAAQATTCA 3720 
CATOGAAATC CTCTTOGOAT TAAACTAGOA GGAGATGQTC GAACCATTGT AGATCTGQAA 3780 
GGGACCCCOS TGQTGAGTCC TGACGGCCTC CCACTCTTTO GGCAGGGGOG ACATGGCACA 3840 
CCTCTGGCCA ATOCCCAAGA TAAOCCAATT TTGAQTCTTG GAGGAAAOCC GCTGGTGGGC 3900 
TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 
80 ACGACGCCCC T6CCTACCAC TACAACCCOG AGQCCCACCA CTGCCACCAC CATGCAGCCC 4020 
ACCACTACTA CGAOGCCCCT GCCTACCACT ACACXS3AGGC CCACCACTGC CACCACCOGC 4080 
G6CA0GACCA CCAGGOOTCC AACAACCACA 6TC0GAACCA CTACG0G6AC AACCACCACC 4140 
ACCAOCCCCA AACCCACCAC TCCCATCCCC ACCTGT C OCC CrGGOAOCTT GGAAOGGCAC 4200 
GAOQATQATG OGAACCrOAT AATGAGCTCC AATGOOATCC CA£2AaTQCTA OOCTOAAGAA 4260 
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5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



GATGAGTTCT 
TATGATGAAG 
ACTQCTACCA 
GAATTTGATC 
QACCCATCAG 
GATGAAATOV 
AACATCACO; 
GCCACCCCAO 
ATCAGGAACA 
AASCCCAACA 
ATCAGCCXTT 
OCAOGOGGTQ 
GOCATGGACQ 
GTAATTCACT 
TTGGAGACAO 
6AAGAACAG0 
A6TATGC3TCA 
TGGOCTOOTA 
CT6CAAGCTT 
CAGCCA6CAT 
CATTCTGGTC 
TTCAQTTTTQ 
CCAGAGACAT 
AATTOSGACA 
T6CTACAT6C 
TCAOGTCCCT 
ATATCCTACT 
TTATTTOTAA 
TOTCTGCAAO 



CAGGCTTCSGA 
ATTATGAATT 
CACOGAGGGT 
TGGCTGGAAG 
CCCOGTGCTC 
TCCCCAATGA 

GAGATTTG6T 
AGTTTTCCAC 
OGAGGTATTA 
OGGTCTCATT 
AGCTATCTGG 
GCAATATGTG 
GAGGTATAAA 
CTGQGQAAOA 
GCCTCAGTCC 
GGA6CCTGTC 
CGAGTGTGGG 
GCCCTGCCCA 
GCTCAGCCCC 
ATCTCAOTCT 
CrOTTAACTT 
CAGAAACCAG 
CTTCAGTATT 
TTTCTGTTTT 
TTGTAT6CAG 
TGAAATTTAC 
ATTCTCAATT 
AATATTQATT 



OACTGACACT 
TGAGACGTCA 
QATCCCAOAO 
GAAA06ATTT 
TCTGACTQAT 
CCTGAAGAAG 
GGAAGGTT6C 
CACAGGTTAT 
TCAA6CTTCA 
TTTTAAAGTG 
TGTCACOGAA 
ATCCCATTG6 
AAGCGCACXrr 
ATCTACCTCA 
GGTGAAGACC 
TATGTAGAAG 
AGGTTTGGGA 
GTCTCCATCC 
6C0CCACCAA 
GCT6CCCTA0 
GGAACrCAGT 
T6CTTCTCTA 
CAACTOATTC 
TCCAGGAATA 
TCTCATTTTG 
TAGAAAGGAA 
TCTATGGACr 
TTGATATATA 
AAAATTQCTA 



GCAOTACCTA 
AGGCCACCAA 
GAAGOOGCCA 

GCACTGGATC 
AGTGATCTGC 
CACrCATTTG 
TTGOTTTACA 
TCAGTAACTC 
CAAGCACAAA 
TCAGATAATC 
CT1TCAAACA 
GGTATOGAAA 
GTGACAACCT 
ATTOCCAATT 
CCCTCGCTAC 
ACATOGGCTT 
CTGGAAA6TG 
CTAAGTOQCA 
GT6CCA0GAA 
CCCACTTCTT 

crrrmrm 

AGTQTQATTT 

gcatatgcac 
gatttctcx:a 

TTATTAAAAA 
TACCCACTGC 
TATQTATATA 
AATTTGTACT 



CGOAAOAGGC 
CCACCACT6A 
TCAGTTCCTT 
ACGTGAOGTA 
ACTTCCAAGT 
CTCCCCAGCA 
TCATTGTGGA 
GTGCATCCTA 
ACTTGCCCAT 
ATCCTCATGG 
CTCTGCITGT 
TQATCCCAGC 
GTTCGTGGGA 
GAAAGATACA 
TGTGQATTCA 
TATTCAAG6C 
G6GAA0CCCC 
GTAATCACAO 
CTAGGGGCTQ 
GGTCACAOAT 
GGCCTG6ACA 
TTTGTTTGTA 
CCCAGACTTT 
GCTGTTCTTG 
AAACTAACTG 
CACCACCAAA 
TA6AATAAAT 
TGCATATACA 
TOTTCACCAA 



CTAOGTTATA 
GCCTTCGACC 
TCCTGAAGAA 
CCTAAATAAA 
GQAaUBCCTG 
TGCrCCCC G C 
TTGGQACAAA 
TGAAGATTTC 
TGAGAACXTTA 
CTAOQGACCT 
TGTGAGGCCC 
TACAOOGACT 
GTrOTTCTTT 
TTCTACAGCA 
CACCTTGATQ 
TACTATOGOC 
TACTACTATO 
GACCGTCATO 
TQAQCAAAGA 
GGACACTGGC 
AT6AACAGGA 
ATA6CACATC 
TTAGGCATGA 
CTTCATGGAA 
AATTTAAGCT 
QAAAATAAAT 
GTATGAAATC 
TATCCACACT 
AAAAAAAAAA 



4320 
43B0 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5260 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 



Seq ID NO: 60 Protein eequence 
Protein Accession #i Eos sequence 



1 
I 

MPGTKLTRTG 
SRQYTVRYRE 
TPBSAPTTAP 
FQNTFPHTPR 
SItOQQECSCT 
MCYEDPVSSL 
QGRNAKDLLL 
DTQDQKRTLR 
ASPAHHASTQ 
SSVLRDRSSV 
SPLSKGGKDG 
GRQABATAQT 
HPRVP8HSDS 
SLHRKEPIPB 
PAXtPPAARSQ 
AGRPRPTSQG 
ABAKEAAASL 
KSBPPSKRPL 
SLAKEEREPA 
ttPRYTTRAPP 
SNGKPNGQRI 
TPWSPDGLP 
TPLPTTTTPR 
TPKPTTPIPT 
DEDYEFETSR 
PSAPCSLTDA 
TPGDLVTGYL 
SP8V8FVTBS 



11 
I 

APADYRVILK 
XGBLARWDYK 
ENLNWfPVNG 
LSNHLEQSPS 
QXDFSOAYL 
TGNSLK8VAA 
OLKNKILANO 
PPSRHGH8W 
GTSHRPSLPA 
BPGAKPASPA 
EDAPAIIMSHA 
LRARPASQHF 
HPKLS8GIHG 
NPKSTQADTH 
QHPSVPRRMT 
RSHSSSDFYT 
PKHQQVESPT 
SSKSQQSVSA 
lALAPRQGSL 
GHFSTTFMLS 
INGPQGTKHV 
LFGQ6RH6TP 
PTTATTMQPT 
CPPGTLERHD 
PPTTTBPSTT 
LDHFQVDSLD 
VYSASYEDFZ 
CNPLLWRPP 



21 
I 

TSQEDELOVP 
QIANRRVLIB 
KPTWAASWD 
PILBTUiLPH 
ZDIQTKQVNK 
SXADVQQNTB 
GAPRKFQLRA 
APGRTAVRAR 
SLNDHDLVDS 
RRTPHSGAAB 



HLLRHKPFAA 
DEEDBKPLPA 
PQGXySSIiAS 
P6RAPBQQPP 
ASSRGMLPTA 
GAGAG6DHRG 
EDEEEBDAGF 
APVKRPLPPP 
UtQRIWKARF 
VDLDRQLVLN 
LANAQDKPIL 
TTTTPI»PTTT 
DDGNLIMS8N 
ATTPBVIPEB 
BZIPHDLKXS 
RNKFSTQASS 
GQBLSOSBSL 



31 
I 

DDZSVRVMSS 
NLIPDTVYBP 
ALPBTEGKVK 
HMVCSLGKAX 
DPQLB6SVFG 
ONGKPBKPEP 
XKAB6LDLQS 
MPALPRREGV 
DEDERAVGSIi 
EDSSA8APPS 
SSHLSSRTQV 
NGR8PSRPSI 
TWNDHVPSS 
KAQDVQQSTD 
PPVATSQRHP 
IiQNQDEDAQO 
QRGRAASPAR 
FKGGKEDUiS 
PGS6PRASHV 
RMPLSRQPAR 
AEGRYIiQDSH 
8LGGKPLVGL 
PRPTTATTRR 
GIPEOrAEBO 
GAI8SPP6BB 
DLPPQHAPRN 
VTHLPIBHiK 
SNHZPATRTA 



41 
I 

QSVLVSHVDP 
AVRXSQGERD 
VCLLDTGLPS 
PSK5GFQT6E 
PCPLFYFLTF 
8SPSFRAPAS 
TBITGEEBIiG 
DKPGFSLATQ 
KPKGAFAQPR 



51 
I 

VLEKQKKWA 
GKWST9VFQR 
V88PQPSAK8 
AWDLTPKPSL 
HLDZ66F8FZ 
SQHPSVPASP 
SREDSPMSPS 
PRPGAPPSAS 
PALSPSRQSP 
RLLPTQPHLS 



SBQABASDGB 
GRGPRLQPSS 
SRQPISRGWE 
ADTBGKSPXA 
GPQSRDAI^S 
SYDDDSTEVB 
PSRPOGPQSR 
SSVPKWPSSS 
PSRPPPR8AA 
P8YRQGYNGR 
GNPLRZKLG6 
BVIKKTTHPP 
TTTRRPTTTV 
EPSGLETDTA 
FDLAiSRKRFV 
ZTWAVE6CB 
PNTRyyFKVQ 
KDGNM 



spqstvpsra 
dlrrspqrga 
qpgstdrhas 
psqprlsltq 
aqevrapaha 
arvpsraapg 
tpr0grz3ad0 
tvspvaothp 

PHVEQKVXiFG 
DGRTIVDLEQ 
TTTOQPTTTT 
RTTTRTTTTT 
VPTEEAYVIY 
APYVTYUIKD 
8FVZVDHDKA 
AOHPHOyaPZ 



8eq ID NO: 61 DMA sequence 

Nucleic Acid Accession #t NM_022743 

Coding sequence : 12 8 133 7 



1 
I 

GTGGATTTTA 
TCTCTQCTGC 
AAM3CTGATG 
AAAAGCTTGG 
TGCTCCAQAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAO 
AGTGATCTGC 
ATATC0CA6T 
TOGGCOCCAC 
CT6CTACCTG 
CTGCTTTOAA 



11 
I 

GAGATACCTC 
AACACCCCTA 
OGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAOAAGCTTT 
AAAGAGG6CC 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTQC 
CTCTTACT6C 
GATATGCTGA 
TOTGACTOTT 



21 
I 

CCCTCCTTCT 
CTGTTCTCOG 
AGTOCCGCGT 
AGOQGOAATG 
TTCTTGGCAG 
ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 
CCATCT6TAA 
TCAAT ^CAO 
QAGCAGTCOQ 
TGACCAGTGA 
TCOGTTQCCA 



31 
I 

GCrCAGCTGC 
TGTATTGOCT 
OGCCAAATAC 
CAAATGCCIT 
AGTTQTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGOGGAGATG 
CTOTGAOCCC 
AGACATCGAO 
GGAGCGCCGG 
AACCCAGGAC 



41 
I 

CTT6CA6TAA 
TTTCTGGGCA 
TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGQAAGTTG 
AACT8TT0QA 
GTGOQAQAnO 
AAGOXGCTGA 
AAGGATQCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



51 
I 

TTAAACTCTT 
GCAGGAA6GA 
AGTGTCAGAA 
AAOCCAGATA 
ATGGAGCACC 
ACAAACTGAC 
TCATGAGAGA 
CCTTTGCAAA 
OTGTTGGCCT 
TTGTGTTCAA 
AOCTCAGCAT 
GGGAOCAOTA 
ATATQCTAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
769 
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15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGGTGATGAG 
GGCACJVCTGG 
TQAAOGOCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAO 
TGATATTAT6 
TTTAQAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTT6CAAA 
ATTTGGTTGA 



CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTOCGCCTGT 
TTTTTCCCAO 
CTACATCAAQ 
AGAGTGACAC 
TGCGACOCCA 
CTTTGTTGAA 
CTTATTG6AA 
CChCAAOAAT 
GGAT6CCAAA 



A6GAAGTTCA 
AGGTTCTCGC 
ACATCTACCA 
TGGAGGAAGC 
GAAQCCATCC 
GCATGTTTCC 
ATX^GCAGAiSA 
ACATCAGACC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



AGAATCCCTQ 
CATGTGCCAO 
6CTQAAGGTG 
CTTQTTCTAT 
OGTCAGAGGO 
CCAAGCAATO 
ACACAGCCT6 
ATCCTAAGGG 
AGGTCACACA 

AGAGAA6CAC 



AAAAAAATTG 
GOGATCATAA 
CTOGACTGCG 
GGTACTCGGA 
GTTCAAOTGA 
AAGAATCTGA 
ATT6AAGATT 
AACGCAGTCA 
CTCTATOCTT 
AGGTAAATAA 
6ATTATAATA 



AAGAACTQAA 
GCAGCAATTC 
CCATGGATGC 
CCATGOAOCC 
TQAAAQTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGAAATA 
TQTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



B40 
900 
960 
1020 
1080 
1140 
1300 
1260 
1320 
13B0 
1440 



Seq ID NO: 62 Protein sequence 
Protein Accession #i HP 073580 



1 11 

I I 
MRCSQCRVAK YCSAKCQKKA 
SEKLY8FYDL ESNINXLTED 
arSFTICNAB MQEVQVGLYP 
LDHLMTSBBR RKQLRDQYCF 
WKWEQVLAMC QAIZ8SNSBR 
IPFPGSHPVR 6VQVMKVGKL 
BCDANZRAS 



21 31 41 51 

III! 

WPDHKRECKC LKSCKPRYPP DSVRLLQRW FKLMDGAPSB 
KKEX3IiRQZ«VM TFQBFMREEI QDASQLPPAF DLFEAFAKVI 
SISLUmSCD PNC8IVFNGP HLIiLRAVRDZ EVGEBLTZCY 
ECDCFROQTQ DKZMIWXiTGD EQVMXEVQE8 UOCIEBLKAK 
LPDZMIYQUC VLDCAKDACZ NLGUiEBALP YOTRTNBPYR 
QLHQGMFFQA MKNLRLAFDX NRVTHGRms LIEDLIIiU.B 



60 
120 
IBO 
240 
300 
360 



Seq ID NOi 63 DHA sequence 

Nucleic Acid Accession «s lIM_003014.a 

Coding sequence: 238.. 648 ~ 

1 11 21 31 41 51 

I I I I I I 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGOGCTOCTC GTGCCCTGTG TGCCAGACGG 
CGGAGCTCXX3 CGQCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGG06AA0 G6ACAG0QAA AGATGAGGGT 
GGCAGGAAOA GAAQOOGCTT TCTQTCT Q CC GGOGTOGCAO 0GCX3AGAG0Q CAGTG CCATO 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCTO TGGCTGCACC 7X3606CT6GO CGTGOGOGGC 
GCGCCCTOOO AGGCGOTGCG CATCCCTATG TGCOGGCACA TGCCCTGOAA CATCAOQCGG 
ATGCCCAACC ACCTGCACCA CAGCACX3CAQ QAGAACGCCA TCCTGGCX31T CGAGCAGTAC 
GAGGAGCTGQ TG6AC0T0AA CTGCAGCGCC GTOCTGCGCT TCT TCrT CTG TQCC ATOTAC 
QCX3CCCATTT GCACCCTGQA 6TTCCT6GAC QACOCTATCA AGCOGTGCAA GT0GGTQT6C 
CAACG06CQC GCX3A0GACTG CGAGGCCCTC ATQAAGATGT ACAACCACAO CTGGCCOQAA 
AGCCTGQCCT GCOAOGAOCT GCCTGTCTAT GACOQTGGCG TGTGCATTTC GCCTGAAGCC 
ATCGTCACGQ ACCTCCOQGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA 0GCCTAA6CC CXXSATOQOTG CAAGTGTAAA 
AAGGTGAA6C CAACTTTG6C AAGGTATCTC AGCAAAAACT ACA6CTATGT TATTCATGCC 
AAAATAAAA6 CTGTGCAGAG 0AGTG6CTGC AATGAGGTCA CAAOQGTGGT GGATGTAAAA 
GAOATCrrCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCOGCTCAT TACAAATTCT 
TCTTGCCAGT OTCCACACAT CCTGCCCCAT . CAAGATGTTC TCATCATGTG TTACGAGTGG 
CXSTTCAAQGA TGATGCTTCT TGAAAATTGC TTAOTTOAAA AATOGAGAGA TCAGCTTAGT 
AAAAQATOCA TACAGTGG6A AGAGA06CTG CAG6AACAGC GGAGAACAOT TCAGQACAAO 
AAGAAAACAO CCG6GCX3CAC CAGTCGTAGT AATCCCCCGA AACCAAAGGG AAAGCCTCCT 
GCTCCCAAAC CA6CCAGTCC CAAGAAGAAC ATTAAAACTA G6A6TGCCCA GAAGAGAACA 
AACCCGAAAA 6AGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC rTCCTTACAG 
GATGAGGCTQ GGCATTGCCT GGGACAGCCT AT6TAAGGCC ATGTGCCCCT TGCOC TAACA 
ACTCACTGCA 6TGCTCTTCA TAGACACATC TT6CA6CATT TTTCTTAA60 GTATGCTTGA 
GTTTTTCTTT GTAA6CCATC ACAAGCCATA GT8QTAG0TT TGCCCTTT06 TACAGAA6GT 
GAGTTAAAGC TGGTGQAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTQ TQTGCATACT 
CTAQAAGAGT AGGQAAAATA ATGCTTGTTA CAATTCGACC TAATATOTGC ATTGTAAAAT 
AAATGCCATA TTTCAAACAA AACA06TAAT TTTTTTACAQ TATGTTTTAT TACCTTTTQA 
TATCTGTT6T TGCAATGTTA GTGATOTTTT AAAATGTGAT OAAAAXATAA TOTTTTTAAQ 
AAGGAACA6T AGTGGAAT6A ATGTTAAAAG ATCTTTAT6T GTTTAT6GTC TGCAGAAGGA 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 
TGTOTTTTTT TACCAATQAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 
AATAATAAAG AAAAATAAAT AAAAA66AGA G6CAGACAAT GTCTGGATTC CTGTTTTTTG 
GTTAOCTQAT TTGCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 
ACAG7GAGTT TGTCTGTAGC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 
ATTTTATACC CACAAQAOAG OTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 
TTAAATATTT TCTTTGCCTA AATACATGTG AGASGAGTTA AATATAAAT6 TAGAOAGAGG 
AAAGTTGAOT TCCACCTCTO AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAQAA 
AAAAAGAACT TATTT6CA6C ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGQAGGC 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 
ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT OTTTGCTTAT 
GCACTTATAA AATGATTTGA ACAAATAAAA CIAOGAAOCT GTATACATGT GTTTCATAAC 
CTGOCTOCTT TGCTTG GO CC TTTATTGAGA TAAQTTTTCC TGTCAAGAAA GCAGAAACCA 
TCTCATTTCr AACAGCTOTG TTATATTCCA TAOTATGCAT TACTCAACAA ACT GT TQT O C 
TATT66ATAC TTAGOTOGTT TCrTGACTOA CAATACTGAA TAAACATCTC A006QAATTC 

Seq ID NOs 64 Protein sequence 
Protein Accession ii Np 003005.1 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



21 



31 



41 
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) I I t 1 I 

MFLSILVAU: ZAfLHLALGVR GAPCBAVRIP KCRKMFHNIT RMPKHLHHST QENAIIAIEQ 60 

YEELVDVNCS AVLRFFFCAM YAPICTLEPL HDPIRPCK8V CQRARDOCEP LMXHYNHSKP 120 

BSLACDEIiPV YDRGVCISPE AIVTDLPEDV KWIDITPDMM VQBRPLDVDC KRI»SPDRCKC 180 

KKVKPTUVTY LSKNYSYVIH AKIKAVQRSG CNBVTTWDV KBIFKSSSPI PRTQVPLITN 240 

SSOQCPHZLP HODVlflMCYE HRSRMNLLEN CLVBKHRDQL 8XRSIQWEER LQBQRRTVQO 300 
KKKTAGRTSR SNPPKPXGKP PAPXPASPKR NIKTRSAQKR TMPKRV 

Seq ID NO: 65 DNA sequence 
Kucleic Acid Accession fli BC010423 
Coding sequence: 248. .1780 

1 11 21 31 41 51 

I i I I I i 

CACA60GTG0 QAAOCAGCTC TGGOGQAOCT 06CSASCTCCC GATCACXSQCT TCTTGGGOOT 60 

AGCTAOGGCT GGQTCT S TAG AAOGGGOOCG O06CTG666C TGOOTOOCCT AGTG6AGACC 120 

CAAOTGCGAO AGOCAAOAAC TCTGCAGCTT CCiqOCTTCf GGGTCAOTTC CTTATTCAAO 180 

TCTOCAQCCO GCTCCCAGOO ACJATCTOGOT GGAACTTCAG AAAOGCTGGG CAGTCTGCCT 240 

TTCAACCATO CCCCTGTCCC TGGGAGCCGA GATGTGGGGG CCTQAGGCCT GGCT6CTGCT 300 

GCTGCTACTG CTGGCATCAT TTACAOGCOO GTGCCC060G GGT6AGCTGG A6ACCTCAGA 360 

CX3TGGTAACT GT G OTG C TGO G0CAG6ACGC AAAACTGCCC TGCTTCTAGC GA66GGACTC 420 

OGGOQAOCAA GTGOGGCAAO TGQCATG6GC TCGGOTOGAC 60GGGCX3AAG OOOCCCAGGA 400 

ACTAGOGCTA CTGCACTCCA AATACGGGCT TC3VTGTGA6C COGGCTTAOJ AGOGCOGOGT 540 

GGAGCAGCaS CCGCCCCCAC GCAACCXXXTT GQACX3GCTCA GTGCTCCTGC GCAACGCAOT 600 

GCAQ60GGAT GAGG606AGT A06AGT6C00 GGTCAGCACC TTCCCCGCCG GCAGCTTCCA 660 

G6C3QOGGCTG GGGCTCCX3AG TGCTGGTGGC TCCX:CTGCX:C TCACTGAATC CTOGTCCAGC 720 

ACTAGAAGAG GGCXZAGGGCC TQAOCCTQGC AGCCTCXirTGC ACAGCTGAGG GCAGCCXAGC 760 

CCCCAGCGTG ACCTGGGACA CX^GAGGTCAA AG6CACAAGG TCCAGCOGTT CCTTCAAGCA 840 

CTCCCQCTCT QCTGCOQTCA CCTCAOAOTT CCACTTGGTG CCTAQCCX3CA GCATGAATOG 900 

GCAGOCACTG ACTTGTGTGG TGTCCCATCr TG GCCTG CTC CAGGACCAAA GGATCACCCA 960 

CATCCTCCAC GTOTCCTTCC TTGCTGASGC CTCTQTQAGG OGGCTTOAAG ACCAAAATCT 1020 

GTGGCACATT GGCAGAGAAG GAGCTATGCT CAAOTGCCTG AGT8AAGGGC AGCCCCCTCC 1080 

CrCATACAAC TGQACACGGC TGGATQOGCC TCTGCCCAGT GGGGTACGAG TGGATGGGOA 1140 

CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC ATCTACGTCT GCCATGTCAG 1200 

CAATQAGTTC TCCTCAAGG6 ATTCTCAGGT CACTGTGGAT GTTCTTGACC CCCAGGAAGA 1260 

CTCTGQGAAG CAGOIGGACC TAGTGTCA6C CTOGSIG GTO CltSgit SGO 'l' G TOATCGCCGC 1320 

ACTCTTGTTC TQCCTTCTGO TGGTGGTQGT GGTGCTCATG TCCCGATACX: ATCGGCGCAA 1380 

GGCCCAGCAG ATGACCCAGA AATATGAGGA GGAGCTGACX: CTGACCAGGG AGAACTCCAT 1440 

CCGGAGGCTG CATTCCCATC ACACX;GACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCOAG GOCCACCCTG ATAGTCTCAA GGACAACAOT AGCTGCTCTG TGATGAGTGA 1560 

AOAGCCOQAO GGCOOCAOTT ACTGCAOOCT GACCA0GGT6 AGGGAOATAO AAACACAGAC 1620 

TGAACTOCTG TCTCCAGGCT CTGOGOSGGC C6AGGAGGAG GAAGATCA06 ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGrTCAGQA GAATGGGACC CTACX3G6CCA AGCCCACGGG 1740 

CAATGGCATC TACATCAATG GGOGGGGACA CCTGGTCTGA CCCAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG Cit X Tl t riXTi' TGACATGGGA GATTTTAGCT CATCTTGOGG GCCTCCTTAA 1860 

ACAOCCCCAT TTCTTGOGGA AGATGCTCCC CATCCCACTQ ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTG TTCATOGGGA GGGCTGCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTOT GCCTGTQTGA OTGTTGACTO ACTOTOTGTO TGTGGAGGGG 2040 

TQACTGTCCG TGGAGGGGTG ACTGTGTCCG TGGTGTGTAT TATGCTGTCA TATCAGAGTC 2100 

AA0TGAACTG TGGTGTATGT GCCAOQQQAT TTGAGTGGTT GCX3TGGGCAA CACT6TCA6G 2160 

GTTTG600T6 TGTGTCATOT GGCTGTGTGT GACCTCT6CC TGAAAAAGCA GGTATTTTCT 2220 

CAGACCCXIAG AGCAGTATTA ATGATGCA6A GGTTGGAG6A QAGSU3GT6QA GACTGTGGCT 2280 

CAGACCCAG6 TGTGCGGGCA TAGCTGGAGC TGGAATCT6C CTC06GTGTG AGGGAACCTG 2340 

TCTOCTACCA CITCGGAGCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

QAGGCTTGAA CTGTTACAGA A6CCCTCT6C CCTCTGGTGG CCTCTGGGCX! TGCTGCATGT 2460 

ACATATTTTC TGTAAATATA CATGG6CXX3G GAGCTTCTTG CAGGAATACT GCTCOQAATC 2520 

ACTTTTAATT TTTTTCTTTT V mrKV l Q CCCTTTO^T TAOTTQTATT TTTTA TTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCAGCCTGQ AOQATATAGC CAGACGCTQT 2640 
CXOTAAAAAA ACCAAAAOOC AAAAAAAAAA AAAAAAAAAA 

Seq ZD NOi 66 Protein sequence 
Protein Accession «: AAH10423 

1 11 21 31 41 51 

I I I I 1 I 

MPLSLGAEMN GPSAWLLLLL LLA9FTQRCP AOBLBTSDW TWLGQDAKb PCFYRGDSGB 60 

QVGQVAWASV DA6EGAQELA LLHSKYGLBV SPAYBGRVEQ PPPPRNPLDG 8VLLRNAVQA 120 

DEQBYECRVS TFPAGSFQAR LRLRVLVPPIj PSLNPQPALE BGQGLTLAAS CTAEGSPAPS 180 

VTWDTEVKGT T8SRSPKHSR SAAVTSBPHL VPSRSKNGQP ItTCWSHPGL LQDQRITHIL 240 

HVSPLABASV RGLEDQNLMH IGRBGAMLKC LSEGQPPPSY NHTRLDGPIiP SGVRVSGDTL 300 

QFPPXiTTBHS OIYVCHVSNB PSSRDSQVW DVIiDFQBDSO KQVDLVSASV VWGVXAAIiL 360 

FCLLWVWL MSRYHRRKAQ QMTQKYEEEL TLTREHSIRR LBSHHTDPRS QPEBSVGUIA 420 

EGHPD3LKDH SSCSVNSBBP E6RSYSTLTT VRBIETQTeL L3P0SGRABB EEDQDEQIKQ 480 
AMNHFVQQK3 TUtAKPTGNG ZYIKGRGHLV 

8eq ID NOi 67 PHA sequence 

Nucleic Acid Accession ft t KM_001203 

Coding sequence t 274.. 1782 

1 11 21 31 41 SI 

I I I I I I 

COGGQGGOGC GGAOTGGGGO GGGCCTGGCO GGAOGCXSGQC AGIGOGGAGA C0Q0GG08CT 60 

GAGGAOGOGG GA6C0GGGAO 0QCA0G0GC6 G66TGGAGTT CAGCCTACTC TTTCTTAGAT 120 

GTGAAAGGAA AGQAAQATGA TTTCATGCCT TGTTQATAAA aOTTCAOACT TCTGCTGATT 180 



975 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



CATAAOOITT 
TQCO^TAAGT 
AATGTGGGCA 
TTGOOTTGTA 
GAOGGATATT 
GGTTGCCTAG 
AOAAGATCAA 
CTGCCTCCAT 
ATATCTGTGA 
TATAAAAQAC 
ATTCCTCCTO 
TCA60CCTCC 
ATTGOAAAAG 
OTGAAAGTGT 
ACAGTGTTGA 
OGQTOCTGQA 
TATCTGAA6T 
AQTGGCTTAT 
CATOGAGATC 
GACCTG6GCC 
ACTCGAGTTQ 
AATCACTTCC 
GTTGCTAGGA 
CTAGTGCXXa^ 
06CCCCTGAT 
ATQACAGAAT 
ACACTTGCCA 
CATCTCTGCA 
TAAGCATCCA 
CTTTCAGGGA 
TCTGTTTGTA 



TGGCTCTGAO 
QAGAAGCAAA 
CCAAGAAAGA 
AATGCCACCA 
GTTTCA06AT 
GACTAGAAGG 
TTGAATGCTO 
TGAAAAACAG 
C TGTCTGTAG 
AAGAAACCAG 
GAGAATCCCT 
CTCTGCTOGT 
GTOGCTATGG 
TCTTCACCAC 
TGAGGCATGA 
CCatfSTTQTA 
CCACCACCCT 
6TCATTTACA 
TGAAAAGTAA 
TGGCTGTTAA 
GCACCAAA06 
AGTCTTACAT 
GATGTGTATC 
GTGACtXCTC 
TCCCAAACOQ 
GCTGGGCTCA 
AAATGTCAGA 
GAAA6CCAAC 
CAGTACAAGC 
G06ACCTGGG 
OGGGGAGAAA 



CTATQACAAO 
CTTOCTTGAT 
GGATGGTGAG 
CCATTGTCCA 
GATAGAAGAG 
CTCAGATTTT 
CACAGAAAG6 
AGATTTTGTT 
TTTGCTCTTG 
ACCTOGATAC 
GAGAGACTTA 
CCAAAGGACT 
GGAAGTTTGG 
AGAGGAAGCC 
AAACATTTTG 
CCTAATCACA 
AGAOQCTAAA 
CACAGAAATC 
AAACATTCTO 
ATTTATTAOT 
CTATATGCCT 
CATGGCTGAC 
AGGAOGTATA 
TTATGAGGAC 
GTGGAGCAGT 
CAATCCT6CA 
GTCCCAGGAC 
AGQTACTCTT 
CTTG/ACATC 
CAAAQACAGA 
COGTTGGGTA 



AGAGGAAAGA 
AACAT6CTTT 

AGTACA6CCC 
GAAGACTCAG 
GATGACTCTG 
CA0T0T03 C Q 
AAOGAATGTA 
GATGGACCTA 
GTCCTTATCA 
AGCATTGGGT 
ATTGA6CAGT 
ATAGCTAA6C 
ATGGGAAAGT 
AGCTGGTTCA 
GGTTTCATTG 
GACTATGATG 
TCAAT6CTQA 
TTTAOTACTC 
GTGAAGAAAA 
6ATACAAAT0 
CCAGAAGTGT 
ATGTATAGTT 
GTGGAAGAAT 
ATGAGGGAGA 
GATGA6TGTC 
TCAAQGCTGA 
ATTAAACTCT 
CTGTTTGT6G 
GTCCTGCTTC 
GAAGCTCCCA 
ACTTGTTCAA 



AAAAGTTAAA 
TGOGAAGTGC 
CCACCCCCOO 
TCAACAATAT 
GGTTGCCTGT 
ACACTOCCAT 
ATAAAQACCT 
TACAGCACAG 
TATTATTTTC 
TAGAACA66A 
CTCAGAOCTC 
AGATTCAOAT 
GGOGT0GO8A 
GAGAGACAGA 
CTGCAGATAT 
AAAATOGTTC 
AGTTA6CCTA 
AAGGCAAACC 

ATGGAAcrra 

AAQTT6ACAT 
TGGAOGAGAO 
TTGGCCTCAT 
ACXA6CTTCC 
TTGTGTGCAT 
TAA6GCAGAT 
CAGGGCTGOG 
GATAGGAOAG 
GCAGAGCAAA 
CCAGTGGGTT 
6AAG6AGAGA 
GATATGATGC 



CTTACAAGCC 
AGGAAAATTA 
TGCAAAGGTC 
TTGCAGCACA 
GGTCACrrCT 
TOCTCATCAA 
ACACCCTACA 
OQCTTTACTT 
TTACTTCOGG 
T6AAACTTAC 
AGGAAQTGQA 
GGTGAAACAG 
AAAGGTAGCT 
AATATATCAG 
CAAAGGGACA 
(XTTTTATGAT 
CTCTTCTGTC 
AOCAATTGCC 
CTGTATTGCT 
ACCACCTAAC 
CTTGAACAGA 
CCTTT6GGAG 
TTATCATGAC 
CAAGAAOTTA 
GGGAAAACTC 
GGTIAAGAAA 
QAAAAOTAAG 
AGACATCAAA 
CAGACCTCAC 
TTGATCCGTG 
AT 



Seq ID NOt 68 Protein aeouence 
Protein Accession ftt HP 001194 



1 
I 

MLLRSAGKLN 
DSGLPWTSG 
6PXHHRALLX 
BQSQSSOSGS 
WPRBTBIYQT 
MLKLAYSSVS 
TNEVDXPPNT 
EEYQIiPYBDL 
KLTAIAVKKT 



11 

I 

VGTKKEDGE8 
CLGLEGSDFQ 
STTVCSLLLV 
6LPLLVQRTI 
VLMRHEHXIJG 
GLCHUiTEXF 
RVGTKRYMPP 
VPSDPSYEDM 
LAKMSBSQDI 



21 
I 

TAPTPRPKVI* 
CRI>TPIPBQR 
LXXLFCyPRY 
AXQXQMVXQX 
FXAADXKOTG 
8TQGKPAXAH 
EVLDBSLNKN 
RBIVCXlOaiR 
KL 



31 
1 

RCKCHHHCPE 
RSXECCTERN 
KRQBTRPRYS 
6K0RYQBVWM 
SWTQLYLITD 
RDLKSKNXLV 
HFQSYIKADM 
P8FPNRHS8D 



41 

I 

DSVNNXCSTD 
EQna>LHPTL 
IGLEQOETYX 
GKMRGEKVAV 
YHEMG8LYDY 
KKMGTCCIAD 
YSFGLILWBV 
ECLRQM6KLM 



Seq ID HOt 69 DNA sequence 
ITuclelc Acid Accession 
Coding sequences 166.. 1737 



Eos sequence 



TTQGGGGTTT 
AGTGCAGTAG 
TOCTCCCACC 
TTAAAGCTTT 
G06GTAGCTG 
GGTGTCTTCT 
ATAATCTCAA 
AATGGTTT06 
G TGGCO GO CT 
GGTCTG6GAT 
AAAAGAOGTT 
TTOGCACCAG 
GT6GGCCTAC 
AGAGGACCAG 
CTTGAAAATG 
ACCTCACCTA 
CAOCAGOTCC 
TTCTCCATTT 
CTQGQATTCT 
CAGGACCGCG 

ggagctggtt 

GTCATCTTAT 
ATQTCATGCA 

ctgcttoctg 

ATCTTCATTC 
CAAAGTAAGA 
GTGTGCXrrOG 
QAAACAAAGG 
GAAATGGATC 
ACTTTCTTAC 
AO^CAAOGAG 



11 

I 

ATTCTCTTCC 
TGTGATCACT 
TCAGCGACCC 
GTTCCAAAGC 
TTTCATTTTT 
TTAATGACTT 
TCTOTGTGTT 
GACACCXSTCT 
CCTTCTCACA 
ACTGCTTTAG 
CCATAGTCAC 
CAATCATGGC 
TACAGTTAAA 
0GTCACXX3AA 
AGAAAACAOG 
AAAATGTGCC 
TGGTGAAGAC 
TGAAAQAGAA 
TTGCACCTTC 
CTGCTTTTTT 
TTGTCCTCAA 
TGACTGTGTC 
GCATATTTTT 
AGGATGATGT 
AGAGCATAGC 
TCTACAGCAO 
CCCTGGTGAO 
TAGTOAGCCA 
TTGCAAAAAA 
AACAACAGOC 
GCAGAGQAGC 



21 
1 

CTTCTAACTT 
TCTTACIOCC 
AA8TGGCTGG 
CAAT6T6TAT 
CTTOGTTOAA 
AATGGACA6T 
TGTCTTAACA 
GGTAGTGATG 
AGAGOTTTCT 
TTTTCTCCCA 
TGCAGTTGCT 
TCTGAAGGAG 
CATTGTCATC 
AATAGTCATC 
AACCTCAATA 
TACTCACACT 
CAGOXCAGG 
AAGTTTTATT 
CTTGTACATC 
ATTATCTA06 
CAGGGAGOCC 
TCTGTTT6CC 
TGGGTTTATQ 
CGTGGGCATT 
AGGACTGGGT 
GGCCTTCTAC 
AC06TGTAAG 
TOGTGQGAAG 
TGAGC ACAG A 
ACTOTGTTGG 
TAACCCCICT 



31 



GACAGGGTCT 
GCCTCAAOCT 
GACTACAGAT 
ACTGAAGTGC 
gtcttcacct 
tttaatgaat 
TTTTCAGCTC 
TTG60GG66C 
CATATGTACO 
ACTGTAACCA 
TCCACAGGAG 
OGCATTGGCT 
TTGGQAGCAC 
CAGGAAAATC 
GACTCCATTG 
AACCTGQAAC 
CCAAOOQAAA 
TGTTATGCAT 
ATTCCTCTGG 
ATGGCCATTG 
ATTCQTAAGA 
TTTACTTTTG 
OTTQOAACAA 
6AGAAGATGT 
GGAC06CC0C 
TCCTGCGCAG 
ATGGQACTGT 
ACTTTACAfla 
GTTCACOTGC 
CTGOAOAGGG 
ACTOCACTTT 



41 

I 

TGCTCTGTCA 
TCCAGCCTCA 
TAAGAATGAC 
CTGATG6A6G 
AOGGCATOVT 
CCAATAGCAO 
CCCTG6CCAC 
TACntSTCAG 
TC6CCAT0GG 
TCCTATCACA 
AATGTTTCGC 
6GAGATACAG 
TQCrCAGACC 
OGAAAGAAGC 
ACTCAGGAGT 
TGQAGCOGAA 

agaaa60ccc 
tatttqgtct 
gcattagtct 

CAGAAGTTTT 
TTTACATTGA 
CTACTGAATT 
TAGGAGGQAC 
CTTCIGCA6C 
TTGCAGQTTT 
CTGGCATGGC 
GCXIAGCATCA 
ACATACCTQA 
AAATGGA60C 
ATQGOQTGOO 
CAAAACXACA 



240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 



51 
I 

GYCFTMIEED 
PPLKNRDFVD 
PPGESLRDLI 
KVFFTTEEA8 
LKSTTUDAK8 
LGLAVKFXSD 
ARRCVSGGrV 
TECWAHNPAS 



60 
120 
160 
240 
300 
360 
420 
460 



51 

I 

TTCAGGCAAO 
ACTCAAGCAA 
CCAAAATAAA 
ATG6GGCTGG 
CAAGACATTT 
GATCTCATGG 
AGTCCTGAGC 
CAC06GGAT6 
CATCATCTCT 
ATATTTTGGC 
TGTGTTTGCT 
OCTCCTCTTC 
GATCTTTATC 
GCAGTATATG 
AGAACTAACT 
GGCOGACATG 
GCTATTAGAC 
CTTTQGAACA 
6G6CATTGAC 
OOGAAGGATC 
GCTCATCTGC 
CTGGGGTCTA 
TCACATTCCA 
TG6GGTCTAC 
GTTGGTGQAC 
CCTGGCTGCT 
TCA CTCAO QT 
A6ACTTTCTG 
OOTATQACAC 
CCCAAOGGGG 
TTTTAAAGGQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 



976 
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5 

10 

15 

20 

25 

30 
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AATGTGTATG TQAAGAGCAC TACCAACATC GCTTTTGTTr ' IXj ' lTri ' Grn ' TGTTTTAAGC 

T T' iiii ' irrr tgcttgtttt taaagccaaa acaaaaaaca accaagcact cttccatata 

TAAATCTGGC TGTATTCAGT AGCAATACAA GAGATATGTA OAAAGACTCT TTGOTTCACA 
TTOOQATATT AAAATAGTGA CATGAACTQO CAAAGTGCTT TTAAAAGCTT TCACGTGGGA 
TAAAT6ATTT TCTTTTTTrC TTTTCTTTCT TCCTATG6TC TTOTCIQAAT AAACTACTCT 
CCTGAATAAA ACAACATCCA ACCCAGGTCA TTOAAATGAA ATTGQOCAOT C 

Seq ID NO: 70 Protein sequence 
Protein Accession ft: Bos sequence 



1930 
1960 
2040 
2100 
2160 



1 
I 

MTQNKLKLCS 
SRISWIISIC 
IQIISGU3YC 
YSLLFVQLLQ 
GVELTTSPKN 
GLPATLQFFA 
IBLICVILLT 
AAGVYIPIQS 
HHESGETKW 



11 

I 

KANVYTBVPD 
VFVLTPSAPL 
PSPLPTVTIL 
UflVIFGALL 
VPTHTNLBLB 
PSLYIIPLGI 
VSLPAFTFAT 
lAGLAOPPLA 
SHRGKTIiQDI 



21 

I 

GGWGWAVAVS 
ATVLSNRFGH 
SQYFGKRRSI 
RPIPIROPAS 
PKADMQCfVLV 
SLGIOQDRAA 
EFWGLMSCSI 
GUiVDQSXIY 
PEDFXjEMDZA 



31 
I 

PFPVEVPTYG 
RLWMLGGLL 
VTAVASTQEC 
PKIVIQENRK 
KTSPRPSEKK 
FLLSTKAIAE 
FPGFMVGTIG 
SRAFYSCAAG 
KNEHRVBVQM 



41 

I 

IIKTFGVFFN 
V6TGMVAASF 
FAVFAFAPAI 

EAQYMLBKER 
APLLDFSILK 
VPGRIGAGFV 
QTHIPLLAED 
MALAAVCLAIi 
EPV 



51 



BQEVSHMYVA 
MALKERXGWR 
TRTSIDSIDS 
BKSFICyAl4P 
LNRBPIRKIV 
DWGIEKMSS 
VRFCKMGIiGQ 



Seq ID NO: 71 DNA sequence 

Nucleic Acid Accession #> NM_004694 

Coding sequence) 166.. 1737 



11 



21 31 

I I I 

ATTCTCTTCC CTTCTAACTT QACAGGGTCT 
AGT6CA6TAG TGTGATCACT TCTTACTGCC GCXTTCAAGCT 
TCCTCCCACC TCAGCCACCX: AAGTGGCTGG GACTACAGAT 
TTAAAGCTTT GTTCCaUUVGC CAATGTGTAT ACTGAAGTGC 
GOGGTAGCTG TTTCATTTTT CTTOQTTGAA GTCTTCACCT 
GGTGTCTTCT TTAATGACTT AATGQACAGT TTTAATGAAT 
ATAATCTCAA TCTGTeTGTT T6TCTTAACA TTTTCAGCTC 
AAT06TTTG6 GACACXX3TCT GGTASTQAXG TTGGGGGGGC 
OTOOCCGCCT CCTTCTCACA AGAGGTTT CT CATATGTACG 
GGTCTGGGAT ACTGCTTTAG TTTTCTCCCA ACTGTAACCA 
AAAAQAOGTT CCATAGTCAC TGCAQTTGCT TCCACAGGAG 
TTCGCACCAG CAATCATGGC TCTGAAGGAG OGCATTGGCT 
GTOGOCCTAC TACAGTTAAA CATTGTCATC TTOGQAGCAC 
AGAGOACCAG CGTCACOGAA AATAOTCATC CAGGAAAATC 
CTTGAAAATG AGAAAACACG AACCTCAATA GACTCCATTG 
ACCTCACCTA AAAATGTGCC TACTCACACT AACCTGGAAC 
CAGCAGOTCC T66TGAAGAC CAGCCCCAGG CXIAAGCaAAA 
TTCTCCATTT TGAAAGAOAA AAGTTTTATT TOTTATQCiAT 
CTGGGATTCT TTGCACCTTC CTTGTACATC ATTCCTCTOO 
CAGGACCGCG CmC r TTTTT ATTATCTAOQ ATGGCCATTO 
GGAQCTGOTT TTGTCCTCAA CAGGGAGCCC ATTCQTAAGA 
GTCATCTTAT TGACTGTGTC TCTGTTTGCC TTTACTTTTG 
ATGTCATOCA GCATATTTTT T6G6TTTATG GTTGGAACAA 
T6CTTGCTGA A6ATGATGTC 0TG66CATT0 CAGAAGATGT 
ATCTTCATTC AGAGCATAGC AOGACrGGCT GGACCGCCCC 
CAAAGTAAGA TCTACAGCAG OGCCTTCTAC TCCTGOSCAQ 
OTGTGCCTGG CCCrGGTGAG ACGQTGTAAG ATGGGACTGT 
GAAACAAAGO TAGTGAGCCA TOGTGGGAAG ACTTTACAGG 
GAAAT6GATC TTGCAAAAAA TGAGCACAGA GTTCA06TGC 
ACTTTCTTAC AACAACAOCC ACTGTOTTGG CTGGAGAGGO 
ACACAAGGAG QCAGAGGAGC TAACCCCTCT ACTCCACTTT 
AATGTGTATG TQAAGAGCAC TACCAACATC GCTTTTGTTT 

■riTiTrrrrr TGCTTGrrrr taaaggcaaa agaaaaaaca 

TAAATCrOGC TGTATTCAGT AGCAATACAA GAGATATGTA 
TTCCGATATT AAAATAGTGA CATQAACTGG CAAAGTGGTT 
TAAATGATTT TCTTTTTTTC TTTTCTTTCT TCCTATGOTC 
CCTGAATAAA ACAACATCCA A0CCA6GTCA TT6AAATGAA 

Seq ID NO: 72 Protein sequence 
Protein Accession #i NP 004685 



41 51 

I I 
TGCrCTGTCA TTCAGGCAAG 
TCCAGCCTCA ACTCAAGCAA 
TAAGAATGAC CCAAAATAAA 
CTGATGGAGG ATGGGGCTGG 
AOGGCATCAT CAAGACATTT 
CCAATAGCAG GATCTCATGG 
CCCTCGCCAC AGTCCTGAGC 
TACTTGTCAO CACOOGQATG 
TCGCCATOGG CATCATCTCT 
TCCTATCACA ATATTTTGGC 
AATGTTTOGC TGTQTTTGCT 
GGAGATACAQ CCTCCTCTTC 
TGCTCAGAOC CATCATTATC 
GGAAAGAA6C 6CAGTATATG 
ACTCAGGAGT AGAACTAACT 
TGGAGCCGAA GGCCGACATG 
AGAAAGCCCC 6CTATTAGAC 
TATTTGGTCT CTTTGCAACA 
OGATTAGTCT 6G6CATTGAC 
CAGAAGTTTT OGGAAGGATC 
TTTACATTGA GCTCATCTGC 
CTACTGAATT CTGGGGTCTA 
TAGGAG GACT CACATTGCAC 
CTTCTGOUjC tgggstctac 

TTGCAGGTTT GTTGGTGGAC 
CTGGCATGGC CCTGGCTGCT 
GCCAG09TCA TCACTCAGGT 
ACATAOCTGA AGACTTTCTG 
AAATGQAGGC 60TATGAGAC 
ATGGGGTGGG CCCAAOGGGG 
CAAAACTACA TTTTAAAGGG 

' iXirmirm ' tgttttaagc 

ACCAAGCACT CTTCCATATA 
GAAAGACTCT TTGGTTCACA 
TTAAAAGCTT TCACGTGGGA 
TTGTCTQAAT AAACTACTCT 
ATTGGCCAGT C 



1 
I 

NTQNKLKLCS 
SRISWIISIC 
lOIISGLGYC 
YSLLFVGLLQ 
GVELOTSPXN 
GLFATLGFPA 
tELICVILLT 
AAGVYIPIQS 
RHHSGETKW 



11 

I 

KANVYTEVPD 
VFVLTPSAPL 
PSPLPTVTIL 
UlIVIFGALL 
VPTHTNLBLB 
PSLYIIPLGI 
VSLPAFTFAT 
lAGLAGPPLA 
SHRGKTLQOI 



21 
I 

G6HGHAVAVS 
ATVLSNRFGH 
SQYFGKRRSI 
RPIIIRGPAS 
PKADMQQVLV 
SLGIDQDRAA 
EFWGLMSCSI 
GLLVDQSKIY 
PBDPLEMDLA 



31 
I 

PPFVEVFTYG 

RLWMLGGLL 
VTAVASTGBC 
PKIVIQENRK 
KTSPRPSBXX 
FLLSTMAIAB 
FFGFMVGTia 
SRAFYSCAAG 
KNEHRVHVQM 



41 
I 

IIKTFGVFFN 
VSTC^AASP 
FAVFAFAPAI 
EAQYMLEHEK 
APLLDFSILK 
VPGRIGAGFV 
GLTFHCLLKH 
MALAAVCLAL 
EPV 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



SQEVSHMYVA 
MALKERIGWR 
TRTSIDSIDS 
EXSFICYALF 
UIREPIRKIY 
MSWALQKMSS 
VRPOmGLOO 



Seq ID NOi 73 DNA sequence 

Nucleic Acid Accession ft: NM_002184.1 

Coding sequence: 256.. 3012 



60 
120 
180 
240 
300 
360 
420 
480 
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1 11 21 31 41 51 

1 I I i I.I 

GAOCAGCCAA AA0GCCXX3CG GAGTG6G6CT G6GCCGCCGC O606CAGCTG AACCGGGQOC 60 

C60GGCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCQCC AAOGGGTTGO TGCGCTGTGa 120 

AGAGGOGGAO GGTCGAGGCG GCGCG6CCTQ AGTGAAACCC AAT6GAAAAA GCATGACATT 180 

TAGAAGTAGA AQACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATC06C GCAAGATGTT GACGTTGCAG ACTTGGGTAQ TQCAAGCCTT GTTTATTTTC 300 

CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAQ TCCTGAATCT 360 

CCAGTTGTAC AACTTCATTC TAATTTCACT QCAOTTTOTO TGCTAAACGA AAAATGTATG 420 

GATTATTTTC ATGTAAATGC TAATTACATT GTCTGQAAAA CAAAOCATTT TACTATTCCT 480 

AAGGAOCAAT ATACTATCAT AAACAGAACA GCATCCA3TG TCACCTTTAC AGATATAGCT 540 

TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 

TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT OAOTTGCATT 660 

GT6AACGAOG GGAAGAAAAT GAGGTGTGAO TGQGATQGTO GAAGOGAAAC ACACTT6QA0 720 

ACAAACTTCA CTTTAAAATC TQAATGGGCA ACACACAA6T TTGCTGATTG CAAAGCAAAA 780 

CXSTGACACCC CCAOCTCATQ CACTGTTGAT TATTCTACTG TGTATTTTOT CAACATTQAA 840 

GTCIt3G0TA0 AAGCAOAOAA TQCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 

CCTGTATATA AAGTGAAGCC CAATCOGCCA CATAATTTAT CA6TQATCAA CTCAGAGGAA 960 

CTGTCTAOTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 

AAATATAACA TTCAATATAG GACCAAAGAT QCCTCAACTT GGA6CCA6AT TCGTGCTOAA 1080 

GACACAGCAT CCACCCGATC TTCATTCACT QTCCAAOAOC TTAAACCTTT TACAOAATAT 1140 

GTGTTTAOGA TTC6CTGTAT GAAGGAAGAT GOTAAGGGAT ACT06AGT6A CT6GAGTQAA 1200 

GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACaVAGTTT CTGGTATAAA 1260 

ATAGATCXaVT CCCATACTCA AGGCTACAOA ACTGTACAAC TOGTGTGGAA GACATTGCCT 1320 

CCTTTTQAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 

CATTTACAAA ATTACACAQT TAATOCCAGA AAACTGACAO TAAATCTCAC AAATGATCGC 1440 

TATCTAOCAA CCCTAACACST AAGAAATCTT GTTG6CMAT CAGATGCAGC TGTTTTAACT 1500 

ATCCCTGCCT GTGACTTTCA AOCTACTCAC CXTPQTAATGQ ATCTTAAAGC ATTCCCCAAA 1560 

OATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGQAAT CTGTAAAGAA ATATATACTT 1620 

GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAO ACTGGCAACA AGAAOATGGT 1680 

AG0GT6CATC GCACCTATTT AAOAGGGAAC TTAGCA6AGA GCAAATGCTA TTTQATAACA 1740 

GTTACTCCAG TATATGCTGA T6GACCAGGA AOCCCTQAAT CCATAAAGGC ATACCTTAAA 1800 

CAAGCTCCAC CTTCCAAAGG ACCTACTGTT OGGACAAAAA AA6TAGGGAA AAAOGAAGCT 1860 

GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAQAATG GATTTATCAG AAATTATACT 1920 

ATATTTTATA QAACCATCAT TGGAAATGAA ACTGCTGTOA AT6TGGATTC TTCCCACACA 1980 

GAATATACAT TGTCCTCTTT 6ACTAGT6AC ACATTGTACA TGGTACGAAT G6CAGCATAC 2040 

ACAGATGAA6 0T6GGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA OTTTGCTCAA 2100 

GGAGAAATTQ AAGCXaVTAOT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 

GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 

CCAGATCCTT CAAAQAQTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 

TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAO TGTTGTGQAA 2340 

ATAGAA6CAA ATOACAAAAA OCCTTTTCCA GAAOATCIGA AATCATTGGA CCTGTTCAAA 2400 

AAG6AAAAAA TTAATACTGA AGGACACAGC AGTGQTATTG GGGGGTCTTC ATGCATGTCA 2460 

TCTTCTAGGC CAAGCATTTC TAGCAGTGAT QAAAATQAAT CTTCACAAAA C3VCTT0GAGC 2520 

ACTGTCCAGT ATTCTACCGT GGTACACAQT OGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 

GTCTTCTCAA GATCOGAGTC TACCCAGCCC TTGTTAGATT CAGAGQAGGQ QGCA8AA6AT 2640 

CTAGAATTA6 TAGATCATGT AGATGQCGGT GATGGTATTT TGCCCAOGCA ACAGTACTTC 2700 

AAACAGAACT GCAGTCAGCA TQAATCCAGT CCAGATATTT CACATTTTGA AAG6TCAAA0 2760 

CAAGTTTCAT CAGTCAATGA GGAAQATTTT GTTAOACTTA AACAGCAGAT TTCAOATCAT 2820 

ATTTCACAAT CCTGTGGATC TGQGCAAATG AAAATOTTTC AGGAAGTTTC TQCAGCAOAT 2880 

GCTTTTGGTC CAGGTACTGA GOQACAAGTA OAAAOATTTQ AAACAGTTGO CATOGAOOCr 2940 

GGGACTGATO AAGGCATGCC TAAAAGTTAC TTACCACAOA CTGTAOGGCA AGGC06CTAC 3000 

ATGCCTCAGT GAAGGACTAQ TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAOTAAAO 3060 
CTAAAATGAT TTXATCTGTG AATTC 



Seq ID NO I 74 Protein sequence 
Probeln Accession #i NP_002175.1 



1 
I 

MLTLQTHWQ 
NANYIVHKIM 
ISGLPPEKPK 
SCTVDYSTVY 
KLTWTNPSIK 
GMKED6KGYW 
GKILDYBVTL 
FQATHFVMDL 
YLRGNLABSK 
QLFVDVQMQF 
KD6PEPTFTT 
SHIAQHSPRT 
TEGHSS6IGG 
ESTQPLLDSE 
NEEDPVRLKQ 
MPKBYLPOTV 



11 
I 

ALPIPLTTES 
HFTIFKEQYT 
NLSCIVNBGK 
FVNIBVWVEA 
SVZILKyNZQ 
8DW8EEASGI 
TRHKSHLC9IY 
KAPPKDNMLW 
CYLITVTPVY 
IRNVTIFYRT 
PKPAQ6BZEA 
PPRBNFNSKD 
SSCMSSSRP8 
ERPEDLQLVD 
QI8DHISQSC 
ROGGYMPQ 



21 
1 

TGELLOPOGY 
IIHRTASSVT 
KMRCEWDQGR 
QIALGKVTSD 
YRTKDASTWS 
TYGDRPSKAP 
TVNATKLTVN 
VEWTTPRESV 
ADGPGSPESI 
ZZGNETAVNV 
IWPVCLAFL 
QNYSDCNFTD 
ISSSOEtTBSS 
HVDGGDGILP 
GSGQMKMFQB 



31 
I 

ISPESPWQl. 
PTDIASUJIQ 
ETHLBTMFTL 
HINFDPVYKV 
QIFPEDflAST 
SFWYKZOPSH 
LINDRYIiATL 
KKYIIiEMCVL 
KAYLKQAFPS 
DSSKTEYTL9 
LTTLLGVLFC 
VSWEZEAND 
QNTSSTVQYS 
RQQYFKQNCS 
VSAADAFGPG 



41 
I 

H8NFTAVCVL 
LTCNIIiTFGQ 
KSEWATHKFA 
KPNPPHNLSV 
RSSFTVQDLK 
TQOySTVQLV 
TVRNLVGKSD 
SDKAPCZTDH 
KQPTVRTKKV 
8LTSDTLYMV 
FNKRDLXXKH 
KKPFPEDUCS 
TWHSGYRHQ 
QHBSSPDISH 
TEGC2VERFET 



51 
I 

KEKCMDYFBV 
LBQNVYOXTl 
DOCAKRDTPT 
INSEEI*SSIL 
PPTBYVPRZR 
NXTLPPFEAN 
AAVLTZPACD 
QQEDGTVHRT 
GKNEAVLEWD 
RMAAYTDBGO 
ZWPNVPDPSK 
LDliFKKEKIN 
VPSVQVPSRS 
PERSKQVSSV 
VGMEAATDEG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 75 DMA sequence 

Nucleic Acid Accession #: MMj022131 

Coding sequence: 11.. 2878 ~ 

1 11 21 31 41 51 

I I I I I I 

TGCIGOGAGQ ATGCTGCCTG GGGGGCTGT6 CTG0GT6C0G CTCCTGCrGG 

978 
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GGGGAOOOOC A G0 6 G CQGTG OOGGGGACAQ CCGGCRGCGC OGCCTCCTOQ OGGCTAAAGT 120 

CRATAAGCAC AAGCCATGGA TCGAGACTTC ATATC3VTGGA OTCATAACTG AGAACAATGA 180 

CACAOTCATT TTGQACCCAC C31CTGGTAGC CCTGOATAAA GATQCACOGG TTCCTTTTGC 240 

AGGGGAAATC TGTGCGTTCA AGATCCATGO CCAGGAGCTO CCCTTTGAQG CTGTGGTGCT 300 

5 CAAC3Utf3ACA TCAGGAGAGO GCOSGCTCOO TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 

OAAOQAGTAC ACATTCATCA T0CAG6CCTA TGACTGTOOT GCTGGGCCCC AOGAGACAGC 420 

CT6GAAAAA0 TCACACAAGO C09T6GTCCA TATACAGOTO AAGGATGTCA ACGAGTTTGC 480 

TCCCACCTTC AAAOAGCCAG CCTACAAGGC TGTT0TGA06 GAGGGCAAGA TCTATGACAG 540 

_ CATTCTGCAG GTGGAGGCCA TTGAOGAGGA CTGCTCCCCA CAGTACAGCC AOATCTGCAA 600 

10 CTATGAAATC GTCACCACA6 ATGTGCCTTT TGCCATOQAC AGAAATGGCA ACATCAGGAA 660 

CACTGAGAAG CT6AGCTATG ACAAACAACA OCAGTATGAG ATCCTGGTGA COGCC TAOB A 720 

CTGTGGACAG AAGCCO G CTG CTCAGGACAC CCTGGTOCAG GTGGATGTGA AGCCAGTTTG 780 

CAAGCCTGOC TGGCAA6ACT GGACCAAGAG GATTGAGTAC CAGCCTG6CT CC6GGAGCAT 840 

GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGOGATGGA GCCQTOTCTT CCCTCCAGAT 900 

IS OQTCACAQAO CTQCAGACTA ATTACATTGG GAAGGGTtGT GACCGGGAGA CCTACTCTGA 960 

GAAATCOCTT CAGAAGTTAT GTG6AGCCTC CTCTGGCATC ATTQACCTCT T6CCATCGCC 1020 

TAGaSCTGGC ACCAACTGQA CTGCAGGACT GCTGGTGGAC AGCACnOAGA TOATCTTCAA 1080 

GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAOA ACCTGACCGA 1140 

TCAGTTCACC ATCACCATGT GGATOAAACA OGGCCCCAGC CCTGGTOTaA GAGCCGAQAA 1200 

20 GGAAACGATC CTCTGCAACT CAGACAAAAC 0GAAAT6AAC 06GCATCACT ATX^CCCTGTA 1260 

TGTGCACAAC TGCOGCCTOG ' Ifn ' TCA ' Cl ' l G0GGAA6GAC TTOGACCAGG CTOACACCTT 1330 

TGGCCCOGOG GAGTTCCACT GGAAGCTGGA TCA6ATTTGT GACAAAGAGT GGCACTACTA 1380 

TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 

ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCXZATGC AACTCACAGT ISOO 

25 CX30CX3CTT6T TGGCAAGGAG GAGAAGTCAC CAAAGCACAG TTTGCTCA6T TCTTTCATGG 1S60 

AAGCCTGGGC AGTCTCACXSV TCOQCOCTGQ CAAAATGGAA AGCCAOAAGG TQATCTCCTG 1620 

CCT6CAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTQ GAAAGCXmO 6CGAAGGAAT 1680 

AAAGTATCAC TTCAACCCCT OGCAGTCCAT CCTGGTGATG GAAGGTGACO ACATTGGGAA 1740 

CATTAACOGT GCTCTCCAQA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAAOGGCGGQ IB 00 

30 TGTGOSGOGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACO TATGCATCAO 1860 

TATCGCTGAO GTAQATQCCT ATGTGATGGT CCTGCAGGCC ATGQAGCCCC GGATCACGCT 1920 

C06GG6CACA 6ACCACTTCT GGAGACCT6C TGCGGAGTTT GAAAGTGCXS^ GGGQAGTGAC 1980 

CCTCTTCCCT QATATCAAGA TTOTQAGCAC CTTC6CCAAA ACOGAAGCGC COGGGGACGT 2040 

GAAAACCACA QACCXXSVAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 

35 TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG 6AGTGCTTGG AGCTCAACCA 2160 

CAGTGAGCTC CACCAACGAC ACCTG6ATGC CACXAATTCT ACTGCAGGCT ACTCCATCTA 3220 

CGGTGTGG6C TCCATGAGCC GCTAT6AGCA GOTGCTACAT CACATCCGCT ACOGCAACTG 2280 

GCX3TCCGGCT TCCCTTGAGG CCOGGC3GTTT COGQATTAAO TGCTCAQAAC TCAATGGGOG 2340 

CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAQACC AAGTCTCAQA 2400 

40 TAAGQAGCAT GTCAATCATC TQATTOTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 

TOAOTCCOGO AGTAGCATCC AGCACAGTTC AGTGQTCCCA AGCATTGCCA CAGTGOrCAT 3520 

CATC3VTCT0C GT6TGCAT6C 'nX i 'm'mxyi' OGTGQCCATG GGTGTGTACC OQGTCOGGAT 3580 

03CCCACCAG CACTTCATCC AGGAQACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 2640 

OGATTCTGOG CTGACTATCA CAGTCAACCC CATGGAQAAA CATGAAGGAC CAQGGCATGO 2700 

45 GGAAGATGAO ACTGAG6GAG AAGAGGAGQA AGAAGCGQAG GAAQAAATGA GCTCCAGCAG 2760 

TOGCTCTGAC GACAGOGAAS AGGAG6AGGA GGAGGAAG6G ATGGGCAGA6 6CAGACATGG 3630 

GCAGAATGGA GCCAGGCAAG GCCAGCTGGA GT6G6AT6AC TCCAGCXTTCC CCTACTAGTG 2880 

CCCAGGGOTC TGCTGCCTGQ CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 

CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCyVGCT 3000 

50 TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTOT TTCATCCATG GGQAAGTTCC 3060 

AAGAAGCCCA GCATG6CCAT CAGTGAGGAC TTCAGGGXAO ACTTTGTGCT OTAGOCTCCA 3120 

CTTCTGCCCT AAGTTCCCCA GCATCCXGAC TACCTOTCTO C31GAGTTT6C CTTTGTTTTT 3180 

TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT C3VCTCACCTC CCCAGGCTCA' GAGTCCCCAA 3240 

GGCCCTGGGG TTCCAACTCA CTGTGCOTCT CCTCXACACA GACCAGTAGG TTCTCCTATG 3300 

55 CTGACTCCAO GTT6CTTCAT ACAAGGAGGQ TGGTTGAACT TCACACAC3GT AA6GTCTTA0 3360 

TGCTTAACAa TTTAAAGGAA AGTCCTTGTT GAG6GAGAAC TAAOTTTACA GGGAAAGGTA 3430 

CACACATTCr CTCTCTCTCT CTCTCTCTOT CTATCTAGTT OCCCAGCTTO GAGAGCCTTT 3480 

CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAOAATAGG 3540 

TCCTTGGCCA CAAGCSUSGGT CTGATCCCCC ATCAGAGCTA TCTQAGCCTG CCTGTCTGGG 3600 

60 CACCTGCTGC AAOCATGCAO CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

CCAGGAGGCA TTGCACACAa GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 

GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TA6AGCCAGA 3780 

AAGCAGATGG AAATGCTAAT GAGGTCAAAC OTAQGCTTCA TGGTGGGTOG AGTGGGGGTQ 3840 

GCTGGGCTCC CCCAGGACAG AGGGGACXXTT QAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 

65 TATGGTCCCT TATCTCCTAT CTTCCXrrcrr QA6AAAATAC AOQCTTTCTO CATGTATTAG 3960 

AAA0QCAO2A 6CTCGA0CAA GTCTACAATO AAAGTTTOAA ATTTAACT6C AAOQAATTAG 4030 

AAOCATATTT 6CAATCATTG C»GCTTCTTC TTTCTTCTOC TCATAAAA6G AGGAACACTT 4080 

TAGATAGAGQ GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 

TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAQAOAA TTGTTTCTAT AGTAACTGGT 4200 
70 CTGTGATCTT TTOTQGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
AOGAAGCTCT QAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TOWTGIGTA 4320 
TGCTCTGTCC OCATOCTTCA CTCCTCCTCA AGCICACAOC AATTGGTTTO GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAO TGGCATTTAT GTTAAAAAAA A 

75 Seq ID NOs 76 Protein sequence 
Protein Accession #i NP_071414 

1 11 21 31 41 51 

on 1 I 1 ' ' • 

OU MLPGRLCWVP LLLALOVGSQ SOGQGDSRQR RliLAAKVmoi KPWZETSYB6 VXTENNX>TVI 60 
IiOPPLVALDK DAPVPPAGBX CAFKIHGQBL PFBAWUIKT SOBGRLRARS PXDCBI^KEY 130 
TFIZQAYDOQ AOPaBTAHKK SHKAWHIQV KDVNEPAPTF KBPAYKAWT EGKIVDSILQ 180 
VEAIDEDC8P QYSQXCNYBI VTTOVPPAID RNOIIRIITBK LSyDRQBQyB ZLVTAYDOGQ 340 
KPAAQUTLVQ VDVKPVCKPG WQDWTKRIBY QPGSGSMPtiP PSIHLBTCOG AV68LQIVTB 300 



979 
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10 



LQTNYIGKGC DRETYSERSL QKLCGASSGI lOLLPSPSAA TNWTAGLLVD SS94IFKFDG 360 

RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEKN KHHYALYVHN 420 

CRLVPLUIKD FDQADTPRPA BPHHKLDQIC DKEHHYYVIN VBPPWTLYM DGATYBPYLV 480 

TNDWPIHPSH lAMQLTVGAC WQGGHVTKPQ PAQPFHOSLA SLTIRPGKME SQKVISCtiQA 540 

CKBGLDINSL ESLGQGIKYH PNPSQSILVM EGDDIGNINR AZiQlCVSYINS RQFPTAOVKR 600 

LKVSSKVQCF GEDVCISIPE VDAYVMVLOA IBPRITLRGT DHFWRPAAQF ESARGVTIaFP 660 

DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGQDLDPRQ ECLEUmSEL 720 

HQRHLDATMS TAGYSIYGVQ SMSRYEOVLH HIRYRNWRPA SI.EARRPRIX CSEXJ^GRYTS 780 

NBFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIIIS 840 

VCMLVFWAM GVYRVRIABQ HFIQBTBAAK BSEMDWDDSA LTITVKPMEK HEGPGHQEDE 900 
TEGBEEBEAB EEMSSSSGSD DSBEEBEBBG KGRGRBGQUO ARQAQLSHDD STLFY 



Seq ID NO: 77 DMA sequence 
Nucleic Acid Accession tfs Eos sequence 
13 coding sequence: 4B2..3007 

1 11 21 31 41 51 

I I I I I t 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AQQAAGGAQA ACATTGCTGC AGCTTGGATC 60 

20 TACAACCTAA GAAAGCAAGA GTGATCAATC TGAGCTCTGT TAAACATCTT GTTTACTTAC 120 

T6CATTCAGC AGCTT6CAAA TGGTTAACTA TAT6CAAAAA ASTCAGCATA GCTGTGAAOT 180 

ATGCCGTGAA TTTTAATTGA GGGAAAAAGO GACAATTQCT TCAGGATGCT CTAGTATGCA 240 

CTCTGCTTGA AATATTTTCA ATGAAAT6CT GAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAQ CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAOTGGTT 360 

Z5 GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 

ATTACAOATA AOOTCAOQAA GAOTCTCTGC TTTACAGAAT GGQATTTCAT CACAT8ACAA 480 

CAT6AAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT 6CCTGTATAT CTTTACACTC 540 

CCAAACTCCA GTGCTCTCAT CX^AGAGGCTC TTGTGATTCT CTTTGCAATT GTOAQGAAAA 600 

AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGQTAT CTQAAATAAG 660 

30 TOTGCCACCA TCA06ACX:TT TCCAACTAAG CTTATTAAAT AAGGGCTTQA OSATGCTTGA 720 

CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CAOCTTGGAT TTAACAATAT 780 

TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCXC CTQAAACAAC TTCATATCAA 840 

TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 

CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTQCCTTTA GCAAGCTCAA 960 

35 CAGACTCAAA GXGTTAATTT TAAATGACAA TOCTATTGAQ AOTCTTCCTC CAAACATCTT 1020 

GGGATTTGTT CCTTTAACCC ATCTAGATCT TCQTGGAAAT CAATTACAAA CAT1GCCTTA 1080 

ixj'rmGYrrr ctogaacaca ttggcx^qaat attggatctt caqttoqagg acaacaaatg ii4o 

GGCCrercAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 

TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAOTA TACTCAGTAG 1260 

40 ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCA6TGTAT 6AAGAACATQ AOQATCCTTC 1320 

AGGATCATTA CATCTGGCA6 CftACATCTTC AATAAATQAT AGTOQCftTGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAOC ACCAGQTTTO ATACCTTATA TTACMVA6CC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 

ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAQ ATCTQAQACC 1560 

45 TCCTCCGCAA AATCCTAOAA AGCTCATTCT AGCX3GGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACT7TGGA AATGCTTCAC TTGGGAAACA ATGGTATTGA 1680 

AGTTCTTGAA GAAC3GATOQT TTATGAACCT AAOGAGATTA CAAAAACTCT ATCTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 

ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCXTTAA 1860 

50 ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAO TTTAOCCATC TAOCTGTAAO 1980 

TAATATTTT6 GATGATCTTG ATTTACTAAC GCAGATTGAC CTTGAGGATA ACCCCTG66A 2040 

CTGCTCCT6T GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGGA AGAACACAGT 2100 

GACAGATOAC ATCCTCTQCA CTTCCCXrCXSa QCATCTCQAC AAAAAQQAAT TOAAAGCCCT 2160 

55 AAATAGTGAA ATTGTCTGTC CAG6TTTAGT AAATAACCCA TCCAT6GCAA CACAGACTAG 2220 

TTAOCTTATG QTCAOCACTC CTQCAAOVAC AACAAATA08 GCTGATACTA TTTXAOGATC 2280 

TCTTAGGQAC 6CTGTGCCAC TGTCTSTTCT AATATTGG6A CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTQTTCTT CACOQCAGGA GAAQATACAA 2400 

AAAGAAACAA GTAGATGAGC AAATGAGAGA CAAC3VGTCCT GTGCATCTTC AGTACAGCAT 2460 

OO GTATGGCXaVT AAAACCACTC ATCACACTAC TGAAAGACX» TCTGCCTCAC TCTATGAACA 2520 

GCACATGGTO AGCCOCATGG TTCATGTCrA TAGAAGTOCA TCCTTTGGTC CAAA6CATCT 2580 

GQAAGAGGAA GAA6AGAGGA ATGAGAAAGA AGGAAGTQAT 6CAAAACATC TGCAAABAAO 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATOAAAT ACAAAACCAC 2700 

GAACCAATCA ACAQAATTTT TATCCTTCCA AQATGCCAGC TCATTGTACA GAAACATTTT 2760 

65 AGAAAAAGAA AGGGAACTTC AOCAACTOQQ AATGACAGAA TACCTAAGGA AAAACATTGC 2820 

TCAGCTCCAG CCTGATATOO A66CACATTA TCCTGGAGCC GA06AAGAGC TGAAGTTAAT 2880 

GGAAACATTA ATGTACTCAC GTCCAAGGAA G0TATTAGT6 6AACA0ACAA AAAAT6AGTA 2940 

TTTTQAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

AACATAGATG GAGAGTTTQA GGGCTTTCGC AGAAATGCTG T6ATTCTGTT TTAAGTCCAT 3060 

70 ACCTTGTAAA TAAGTGCCTT ACGTGAGTOT GTGATCAATC AGAACCTAAO CACAGCAOTA 3120 

AACTATGGGG AAAAAAAAAG AAG AAGAA AA GAAACTCAGO G ATCACIGG Q AQAAGCCATQ 3180 
GCATTATCTT CAGGCAATTT AGTCTOTCCC AAATAAAATC AATGCTTGCA T6TAAATC 

Seq ID NO* 78 Protein sequence 
75 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 i I I I 

0^ MKLHXULFYS SLLACISl>HS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKMVSEIS 60 

oO VPPSRPFQLS LUINGLTMLH INDPSGLTNA I8ZHLGFNNZ ADIEIGAFNG U3LLKQUIIN 120 

BNSLEIUCBD TFKGI*BNLEF LQASNNFITV IBPSAFSXZJI RLKVLILNDN AIBSLPPNIP 180 

RPVPLTHLDL HGKQUTrUBY VGPLEKIGRI LDLQLSDNKN AOrCDLLQIiR TWLENMPPQS 240 

ZIGDWOfSF PFFKGSIX«8R LKKESICPTP FVYSBHEDPS OSLRLAATSS XNDSRHSTKT 300 

T8ILKLPTKA PGLZPYZTICP STQLPGPYCP ZPCNCKVL8P SGIiLlHOQER NZBSLSDLRP 360 

980. 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



PPQMPRKLIL 
NHLTKLSKGM 
GVPLTKVNLK 
TDDILCTSPG 
LTDAVPLSVL 

LLEQENBSPL 
QLQPDMBAHY 
T 



AGNIIKSLMK 
FLOLHNLBYL 
TNQFTHLPVS 
HLDKKBLKAL 
IU3LLIKFIT 
ERPSASLYEQ 
TOSNMimCTT 
PGAHEELKLM 



SDLVBYFTLB 
YLSYNAIKEI 
NILDDLDLLT 
NSBILCPQLV 
IVFCAAGIW 
HMVSPMVHVY 
NQ9TEFLSFQ 
BTLMYSRPRK 



MLHLGmiBIB 
LPGTFNPMPK 
QIDLEDNPWD 
NNPSKPTQTS 
LVLHRRRRYR 
RSPSFGPKHL 
DASSLYRNIL 
VLVBQTKNBY 



VLEBGSFKML 
LKVLYLNNNL 
CSCDLVGIiQO 
YI^IVTTPATT 
KKQVDECyiRD 



EKERBLQQLQ 
FBLKAMLKAB 



TRIiQKLYLNG 420 

LQVLPPHIFS 480 

WIQKLSKNTV S40 

TNTADTILRS 600 

KSPVHLQYSM 660 

GSDAXEbQRS 720 

ITEYLRKNIA 780 

PDYIiEVLBQQ 840 



Seq ID NOt 79 DMA aequence 

Nucleic Acid Accession ft: im_016640.2 

Coding sequence! 39.. 1358 " 



1 

I 

GCTTAAGTTG 
GGCCTTTGCT 
CTACAGAAAC 
TGGCCTCCAT 
C6ACGGTGCA 
TTATGAAGTA 
ACTTCACCAA 
AGCCOGAACC 
ACTGCCT6CT 
A0AQCQA6GT 
TCAQCXX»CA 
TTTACTGGGT 
ACTTGCGATA 
CAGAGTTTQT 
CAQACAAACT 
CTGCAGATCC 
GGGAAA6GCT 
CTATTGCAAG 
AAGCAGATOT 
CCTTTTTCTO 
CTCGTAAAAA 
ATGATGTGAA 
GACCAAAAGA 
TGTGGGAATA 
ATTAAATACA 



11 
I 

ACCTCTGGGT 
ACOCQQTCCG 
QACXTCCCAA 
GACAGCGGAC 
GQCTGOOQAO 
CATGGTTTAC 
GACCXrrOTTC 
OGAACCTGAA 
GCAGG AGCAC 
CATATCTTTO 
CAACCCGGGC 
GCOTG6TGAA 
CCAQATAGAT 
GCCATTGGAT 
TCCATTATTC 
TT G CTGT T AC 
TTTGAQACAA 
CCTTTTTGCT 
TACTGGACCT 
CTACCAGCTA 
TATATGTTOQ 
AGOTTTTAAT 
AGAAAAATCA 
TTTAAATTTT 
TTGATTTTTG 



21 
I 

C06GAATC6C 
AQGCTTTCAT 
0ACGT06CGG 
AGCAAAGCTG 
TGS0TA6A0G 

CTGTCGGGTC 
CXrrGCGCTGG 
TTCTACCTGC 

cccrrocTGO 

CTGOCGGCTG 
QAAATTATTC 
GATAAACCAA 
TATTCTOTTC 
AAA0GS6GAGT 
GOTCACACCC 
AACTGTGCTG 
TGGACTGGAG 
TTTGTCTCOC 
AATACTTTGG 
GGTACACAAA 
GATGATGTTC 
CAGCTGTTGG 
ACTGAAG6AA 



31 
I 

GGGCAAAGAT 
TGCACACCGC 
06ACCCCC6T 
CA06GCTG0G 
AGAA6CTGCG 
TOGOQCTGAA 
TGCOOCCGCC 
ACCTOGGGGC 
GG06GAGGCG 
ATCAGCTGGT 
C0GCCCF06A 
CTCGTGGTCA 
ACAACCAGAT 
CTATAGAAAT 
ATGAAAAGCA 
AOTTTCATCT 
ATCAGATAGA 
CACAAGCTAT 
AGGCTGTQAT 
CACTGACXAC 
GTAAGCCTCT 
TACTTCAGAT 
AAAACTGAAA 
CAATAATGAT 
AAAAAAAAAA 



41 

i 

G6G6G0GGCC 
GGCTAATGCC 
OGCGOGGTAC 
GGGGATOSAG 
AATOCTGAGC 
TGGGQACGGC 
CCCAG0G6AG 
GCTGCGTGCX3 
6CGCGTGCAC 
GTCAAOCCTC 
TTATAQATGC 
TCGAAGAGGT 
TCQAATATCC 
CCCCACTATA 
CATA7TTGTT 
GTTACCT6AC 
AOTTGTTTTT 
GTATCAAQGA 
CACA6ATGGA 
ACAAOCTGAT 
TTATGAAACA 
AGTTCACTTT 
AAGCATATTT 
GAGATTTGTA 
AA 




CCAGTTCATT 
OGAATTGATG 
AAGCAACTCO 
AAATGTAAAC 
GGCTCAAAAA 
AAATTAAGAA 
AGAGCTAATQ 
TTCTGQAGTO 
AAATACTTTT 
CAAAATAACC 
ATTGAGGATA 
CTACTGAATA 
GATTGAGAAC 
ACT6TCAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1380 
1440 



Geq ID NOi 80 Protein sequence 
Protein Accession #< KP 057724.1 



MAAARCWRPL 
RRIERHQATV 
PPAEPEPEPB 
VSTLVGLLSP 
IRI8KQLAEF 
LLPDXLRRER 
ITDGKYFSPP 
XVRPLLNRPK 



11 
! 

LRGPRLSLHT 
HAABSVDEKL 
PBPBPALDLA 
HMPALAAAAL 
VPLDYSVPIB 
LLRQNCADQX 
CYQIjNTLALT 
EEKSQLLEN 



21 

i 

AANAAATATE 
RILTimQFMK 
ALRAVACDCL 
DYRCPYHFYW 
XPTIKCXUWK 
EWFRANAZA 
TQADQNNPRK 



31 
I 

TTSQDVAATP 
YMVYPQTPAL 
LQEHFYIiRRR 
VRQBEIIPRa 
LPLFKRQYEN 
SXiFAWTOAQA 
NICWGTQSKP 



41 

I 

VARYPPIVAS 
NADRWYQYFT 
RRVHRYBESE 
HRRGRIDDLR 
HIFVOSKTAD 
MYOGFWSEAD 
LYETIEDNDV 



51 

I 

MTADSKAARL 
KXVFLSGIiPP 
VISLFFLDQL 
YQXDDKPMMQ 
PCCY6HTQFK 
VTRPFVSC2AV 



6eq ID HOi 61 DMA sequence 

Nucleic Acid Accession #i fgenesh predicted 

coding sequence: 1..2070 



1 

I 

AT0AGC6GTG 
CGGC6C0GGC 
CTCAGQCAGA 
CAGCAGCAGC 
GAAAACAAOG 
ACACTGCOGC 
GGGGGAACAC 
CCTGTATGOC 
AOCOGTGGCT 
GG0CCT6AG0 
CCAAGTAGAG 
CCTCAGATTG 
AT6CTGGGGG 
GCA GCAACC A 
OCACTTCCCC 
GCTCACTTCC 
TGGA6CCAGC 
GACATGGAOA 
CTQTTCTGGG 
GACAOGACAC 
CCCTOCTGCT 



11 
I 

OGGGGGTGGC 
GCCAGCGCCC 
GC6AGC06CA 
ACTOGGAGAT 
GT6AGC0GGC 
TCCOGCAGCA 
AGQAC6GGQA 
AACCCAST6G 
GQAGGATGTT 
TCATTGCAGG 
CTGAAATGGG 
CTGCTGTGGC 
CCCAGGGGAT 
TGGGQACAAA 
ATCCTQACAfi 
CATTATCTTT 
CTGGGAACAT 
AGGGGGTTGA 
CAAAaTGTGG 
GOGAAGAOGC 
TTCCAGATGG 
GGGTCTGCAT 



21 
1 

GGCTGGGACG 
CTCTGTGGGC 
GAAAGGQAAC 
GCTGGGCAAG 
GCQGGQCGCT 
CAGAAACACA 
OCCCCTCCAG 
GTACA6GTTC 
ATGCAGCCAA 
OGGGCAOGTG 
AAGGAACCCC 
CAGGCCGAGG 
ATGGACACAC 
GOGAGGAAGC 
OGQCCCCCAC 
GGGGCTGGGG 
0GCA6CTGGG 
GGGAGGGCCC 
CCCAAOTGGO 
CATGCTTTCC 
CCOCTCAGaA 
CAAaSGAOTG 



31 

I 

C6GCCCCCCA 
GTCCAGTCCT* 
CTGGACCTGG 
CTCCATGAGG 
AGGCGGGCCC 
GCCATCAACT 
ACTGTCCTTG 
TGGGG6ACCT 
GCACAG CAOO 
GCCACAGGGT 
TGGGACAGCC 
ATTTCCAQCC 
TCCATCCAGG 
AGAQTOCXGT 
CCAQ0CCAG6 
CTGACATCAG 
GCAGTGCCTA 
TTCCCTAGCC 
CAGCCCCAGC 
CTGGGGACCT 
AACCACCTTT 
T6GGTA6AGC 



41 

I 

GCTCGC06AC 
TGAGGCGOCA 
AGAAAAGCCT 
AGATOSAGCA 
TGGCTCGCCA 
CCAGCACACG 
CCCACCTGGC 
GGACAGATGC 
TQCTQCTCTC 
OCTCCCCAGA 
CCTGCCCTGC 
CTATGOCTCT 
GATCCCTTCC 
TTCCTTCCCA 
ATCCTGGGCT 
GAGGACATCT 
GGOCTCTCCC 
GCTGTGGCAA 
CCTGCAGTGC 
GCTGTTCCAT 
CCAG6GCCTC 
OGGGAGGAGC 



60 
120 
180 
240 
300 
360 
420 



51 
I 

CCCGGGCTCT 
GAGCCCGCAG 
GCAGTTCCTG 
TCTQAAOCXSG 
GGCACACTCA 
CCTGGGCTCA 
TGCACTGGCC 
CGCTACCTCT 
GGGAAGCCCA 
CCrCCCTCCT 
TAGATCTTTQ 
GAGTCCTCAC 
T6CCATCTGG 
CTTGTCCAAO 
GTGGTCTCAA 
GACTGGTGGA 
TTCCCAGOGA 
CTCCAGTGA6 
TQOQGAOGCT 
GTGTCCCAAG 
TGCTCCCTTG 
CA6CCCTQCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



981 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



AGGCTGA AGG 
OGOQOTAGGG 
TCTGTCAAGT 
TCCTTCAACA 
CCCCTACTTC 
GAGAAAGCAO 
AGGCAGATQG 
ACCACACTTA 
ACCCAAGAGC 
COSGAGGAAG 
AAGA6CCTCT 
CTGAAGCAGA 
AAAOGGCGCC 



AOGGCTCCTC 
006ACACTOT 
CCATCTCTAA 
AGCAAGATTC 
ACAACA6CAA 
AG6CCTCTAA 
GGG0GG6GGC 
GGCAOTGOGA 
TGCGGCACCT 
CTAGCTTTCC 
CCAAGAAATG 
CCCC6AAGAA 
TGCATOGCTC 



AOGGACACAC 
OCGCTCTCCT 
TTCAGCCAAC 
AAAAGCTGAC 
6CTGGACAAA 
TGCAGGAGCT 
ACACCCCCCA 
AGTGCTCATC 
CAAGTCCCTC 
CAGGGACCAA 
CCFQAGCCCA 
CAACTTTGCC 
AGT6CTTTGA 



AOGCCAGGAG 
GOUSACAGCC 
TCTCAAGGCA 
0TCTCCCA6A 
GTTCCTGGGG 
GCCTQTATGG 
ATGAT0CT6C 
OGCGAGCTGT 
CTGGAAGGGA 
GAAGCCAC36C 
CCTGTGGOGO 
QAGAG6CAGA 



GCAAGOGTGO 
TCTCCATGTC 
AGGCCAGGCC 
AGGOGGACCT 
TACAAGGGCA 
OGAACAGCCA 
CCCTTCCCCT 
GGAATACCAA 
GCCAGA66CC 
ATTTCCCCAA 
A00QT6CCAT 
A6A6GCTGCA 



GOGTCTTGGG 
AAGCTTCCAO 
CCAOCCXXJGC 
GGAAGAGGAG 
G6CCAGAAAG 
GCACCAGGGC 
GCGAAAGCCC 
CCTCCTGCAG 
CCAGGCAGCC 
GGTCTGCACC 
CCXGGOOGCA 
GGCAAT6CAG 



1380 
1440 

ISOO 
1S60 
1620 
1680 
1740 
1800 
1B60 
1920 
1960 
2040 



Seq ID KO: 82 Protein sequence 
Protein Accession #i FGENSSH predicted 



1 
I 

MSGAGVAAGT 
QQQ^SEHbAK 
GGTQD6EPLQ 
GPEVIAGRQV 
MLGAQGIHTH 
AHFPLSLGIiQ 
LFVIAKCGPSR 
GARWVCINQV 
SVKSISNSAN 
EKABASNAGA 
TQELRHLKSL 
LKQTPXMMFA 



11 
I 

RPPSSPTPGS 
LHSBIEHLKR 
TVLAHLAALA 
ATGCSPDLPP 
SIQGSLPAIW 
LTSGCSILTGG 
QPQPCSAGDA 
WVEPG6PSPA 
SQGKARPQPG 
ACMGNSQHQG 
LEGSQRPQAA 
ERQKRLQAMQ 



21 
I 

RRRRQRPSVG 
B17KGEPARGP 
PVCQPSGYRF 
PSRAEMGRNP 
AATMGTKGGS 
HSQPGNIAAG 
DRTREBAMLS 



SFNKQDSKAD 
RQMGAGAHPP 
PEEASFPRDQ 
KRRLHRfiVli 



31 
I 

VQSLRPQSPQ 
RPAIiPPQAHS 
WGTWTDAATS 
WD9PCPARSL 
RVLPPCHLSK 
AVPRALPSQG 
LGTGCSKCFK 
RPGGKRGRLA 
VSQKADLEEE 
MILPLPLRKP 
BATHPPKVST 



41 
I 

LRQSDPQKRN 
TLPLPQHRNT 
SRGWTMIfCSQ 
PQIAAVARPR 
ALPHPDSGPK 
DMSKGVE6GP 



GGSADTVRSP 
PLLHKSKIiDK 
TTLRQCEVLI 
KSLSKKCLSP 



Seq ID NO: 83 DMA sequence 

Nucleic Acid Accession #: NM_005264.1 

coding sequence : 5S7 . . 1954 



51 

i 

LDLBKSLQFXi 
AINS8TRLGS 
AQHVLLSGSP 
XSSPMALSPU 
PAQDPGLHSQ 
FPSRGGNSSB 
NRLSRASAPL 
ADSLSMSSFQ 
VPGVQGQARK 
RELWNTNIiLQ 
PVAERAILPA 



51 



1 11 21 31 41 

I i 1 I I I 

GAATTCCGGC CA6AAGAAAT CTGGCCTCGG AACACQCCAT TCTCCGOOCC GCTTCCAATA 
ACCACTAACA TCCCTAACGA GCATCCGAGC 06AOGGCTCT GCTGGGAAAT OGTCCTGGCC 
CAACTCGGCC CTTOSAGCTC TCOAAGATTA CCGCATCTAT TTTTTTTTTC TTmPTTTCT 
TTTCCTAGOG CAGATAAAOT GAGCCCQGAA AGGGAAGGAG GGGGGGGGGA CACCATTGCC 
CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG COGCAGCTGG ACGCGGTCGG 
TTQAGTCCAG GTTGGGTOGO ACCTGAACCC CTAAAAGOGO AACCGCCTCC CGCCCTCGCC 
ATCCCGOAGC TGA6TGGCCG GOG6CX3GTGQ CTGCTGCCAG ACCOGGAGTT TCCTCTTTCA 
CTGGATGGAG CTGAACTTTG GGCGGGCAGA GCAGCACAGC TGTCOGGGQA TCGCTGCACG 
CTOAGCTCCC TOSGCAAGAC CCAGCGGCGG CTCOGGATTT TTTTGGGGGG 6CGGGGACCA 
GCCCOGCGCC GGCACCATGT TCCTGGOQAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 
GCTCCTGTCG GCCGAAGTGA GCGQCGGAGA CCOCCTGGAT TQCQTQAAAO CCAGTGATCA 
GTGCCTGAA6 GAGCAGAGCT GCAGCACCAA GTACCGCA06 CTAAGGCAGT GOGTGGCGGG 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCX3CAGCGC 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCC6C TGCAAGCGGG GTATGAA6AA 
GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATOTACCAG AGCCTGCAGG GAAATGATCT 
GCTGQAGGAT TCCCCATATG AACCAGTTAA CAGCAQATTG TCAGATATAT TCCGGGTGGT 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 
GGATOCAGCG AAOGCCTGCA ACCTCGAOGA CATTTGCAAG AAGTACAGGT CGGOGTACAT 
CACCCCGTGC ACCACCA60G TGTCCAAGGA TGTCTGCAAC OGCCGOUVGT GCCAGAAGGC 
CCTCOGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 
CTGCCGGGAC ATCGCCTGCA CAGAGCGQAG GOGACAGACC ATCGTGCCTG TGTGCTCCTA 
TGAAGAGAGG GAGAAGCCCA ACIGITTGAA TTTGCA6GAC TCCTGCAAaA COAATTACAT 
CTGCAGATCT CGCCTTGOGG ATTTTTTTAC CAACTGCCAG CCAGAOTCAA GGTCTGTCAG 
CAGCTGTCTA AAGGAAAACT AOGCTGACTG CCTCCTCGCC TACTOGGGGC TTATTGGCAC 
AOTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAOT GTGGCCCCAT GGTGTGACTG 
CAGGAACAGT GGGAAOGACC TAGAAGAGTO CTTGAAATTT TTGAATTTCT TCAAOGACAA 
TACATOTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCQTGTGGCA 
GCCAGCCTTC CCAGTACAGA CCACCACTOC CACTACCACC ACTGCCCTCC GGGTTAAOAA 
CAAGCCCCTG GG6CCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TQCCACCGTG 
TGCAAATTTA CAGGCACAGA A6CTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 
TTCCAATGGT AATTATGAAA AAGAAGGTCT GGGTGCTTCC A6CCACATAA CCACAAAATC 
AATOGCTGCT CCTCCAAGCT O TO Q TCTGAG CCCACTGCTG QTCCTGOTGG TAACOGCTCT 
GTCCACCCTA TTATCTTTAA CAOAAACATC ATAGCT6CAT TAAAAAAATA CAATATGGAC 
ATGTAAAAAG ACAAAAACCA AOTTATCTGT TTCCTOTTCT CTTGTATAGC TGAAATTCCA 
GTTTAGQACC TCAOTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 
AAGAAAGCTT CTTGT6ATCC TTOGQGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTOTAA CTTGGGCTCT 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT QATGATCATC ATGATCATQA TGATTTTAAC 
AOTTTTACTT CTG6CCTTTC CTAGCTAOAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 
TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 
CTTTTTTGCC ACAAAQAAGA TTCTTACCAA GAGTGGGCTT TGTGOAAACA GCTGGTACTG 
ATGTTCAOCr TTATATAT6T ACTAGCATTT TCCAOGCTGA TGTTrATGTA CTGTAAACAO 
TTCT G CACTC TTOTACAAAA GAAAAAACCA OOOSOAATTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 

300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 84 Protein sequence 
Protein Accession tfi NM_00S264.X 



982 



wo 03/042661 



PCTAJS02/36810 



1 11 21 31 41 51 

1 I I 1 t I 

. MFLATLYPAIi PLU>LX«LSAS VSGGDRLDCV XA8DQCLXE0 SCSTKYRTLR QCVAQKETNF 60 

5 SLASGLEARD ECRSAMSALX QKSLYKCXCK RGMXKEKNCL RXYWSMYQSL QGNDLLEDSP 130 

YEPVNSRLSD IPRWPPISD VFQQVEHIPK GNNCU)AAKA QILDDICKKY RSAVITPCTT 180 

SVSNDVC2JRR KCHKALRQPP DKVPAKHSYG MLFCSCRDIA CTBRRRQTIV PVCSYBEREK 240 

PKCLNLQDSC XTNYICRSRL ADFFTNCQPB 8RSVSSCLKE KYAOCLLAYS GLIGTVMTPN 300 

YIDSSSLSVA PWCDCSNSGN OLEECLKPUl PPKCMTCUCN AIQAPGNQSD VTVWQPAPPV 360 

10 QTTTATTTTA LRVKNKPLGP AGSBfBIPTH VLPPCANLQA QKLKSNVS6N THLCISNQNY 430 
EKBQU3ASSH ITTKSMAAPP SGGLSPXiLVL WTALSTLLS LTBTS 

Seq ID NOi 65 DMA eequence 
Nucleic Acid Accession 8: XM_027172.l 
15 Coding sequence! 143.. 1405 

1 11 21 31 41 51 

I I i i I ) 

GQTGGAGACA CX33CCTCAGG GCTCGGTGCA CAGTGGACAT TTGGGGAGOG TTGTGGGTGA 60 

20 CCCCCACACA GGCACTGGGA ATGCAGGGGA GAGGGGGCCA AGGGQGAAAG GGGCCAGAGT 120 

gTTGGCTTTG GATTCAQGAO GGATGGATTC CAGTCCTAGC TTOCCACTTA TTAGQACTCC 180 

TGAGAGCAGC CTOCATGAGG CCCTGGACCA GTGCATGACC QCCCTGCACX TCTTCCTCAC 240 

CAACCAGTTC TCAGAAGCAC TCAGCTACCT CAAGCCCAGA ACCAAGGAAA GCATGTACXa 300 

CTCACTQACA TATGCCACCA TCCTGGAGAT GCAGOCCATG ATGACXnTTG ACCCTCAGQA 360 

25 CATCCTOCTT GCCG6CAACA TQAT0AA6GA GOCACAQATG CTGTGTCAGA GGCACCXX3A0 420 

GAAOTCTTCT GTAACAGATT CCTTCAGCAG CCTGGTQAAC CGCCCXACGC TGGGCCAATT 480 

CACTGAA6AG GAAATCCACG CTGAGGTCTG CTATGCAGAG TGCCTGCTGC A6C6AGCAGC 540 

CCTOACCTTC CTGCAGGGTT CCTCACACGG AGGGGCAGTC AQ6CCCAGAG CCTTGCATGA 600 

TCCCTCTCAC GCCTQCAGCT GCCCACCTQG GCCAGGCCGT CAGCaTCTTT TCCTCCTGCA 660 

30 GQA06AQAAC ATGGTOAGCT TCATCAAAGG OGGCATCAAA GTT06AAACA GCTACCAGAC 720 

CTACAAG6A0 CTG6ACA6CC TTGTTCAGTC CTCACAATAC TGCAAGGOTQ AGAACCACCC 780 

GCACTTTOAA GGAGQA6TQA AGCTTGGTGT AGGGGCCTTC AACCTGACAC TGTCCATGCT 840 

TCCTACTAGG ATCCTGAGGC TGTTGGAOTT TGTGGGQTTT TCAGGAAACA AGGACTATGG 900 

GCTGCTGCAG CTGGAGGAGG GAQCX3TCAQG GCACAGCTTC CGCTCTGTGC TCTGTGTCAT 960 

35 GCTCCTGCTQ TGCTACCACA CXTTTCCTCAC CTTOGTGCTC GGTACTGGGA ACGTCAACAT 1020 

CGAGGAGGCC GAGAAGCTCT TGAAOCOCTA CCTOAAOOOO TACCCTAAGG QTGCCATCTT 1080 

CCTGTTCTTr GCAGOGAGGA TTOAAGTCAT TAAAGGCAAC ATTGATGCAG TQAGTQATGG 1140 

GGGTCCGGGC CGGGGCTGGG GATCCCTCGG GGTCTCOCAO ACXIAGCAGGA AGTCAGGCAC 1200 

ATGTGACATA CTCAGGGACA GOATAGACTG GGGGCGGGGQ GGGGGCCAAQ AGAGAACCAA 1260 

40 CXaCAGAGCA QQGOCAGGAG AGGCCCTTCT GGCAGAGCAG CCTGGGAAGA CAAQQGAGGA 1320 

GGAGGCATTT GTGOTGCCTQ OQATTTIGAC TGGGAOATAT AGtSACTGCAG CATTGCAGTG 1380 

OAGGGAGOTG GAGGGAGGTG CTTGAGGGAG GCAGAGGTTA G6AAA6CCCA TCTOTT TAGG 1440 

GCATGAOGAT TAGGCTGGAG TCTGGTACCT CCCCTCCATT ATAGCTCTCT CCTOCTCTTT 1500 

CATTTTGTTA CTAAAAACCA GAGTCCTAGG CGGGGGCTGT ATTTGAGCCC AAOTTCATGT 1560 

45 AAGACTTAGG AGGTAAAACC AGGACTGGAQ G0CA6ATCTC CTGGCTOCTG GGGCCCXACC 1620 

TGAGCCTAGC ACAGGGCTGG ACCACTATOC GCTOaAOGAO T C COGOTCTG CTGTGGTQTT 1680 

GGGAGOTTC96 GAGGATGCAG AGGGGTTGOG GCTGG G TGGG CACCGGTCAG GCT GACC AGA 1740 

AGGTGCCTGC AGGCCATCCG GCGTrTCGAG GAOTGCTGTG AGGCGCAOCA GCACTGGAAG 1800 

CAGTTCCACC ACATOTGCTA CTGGGAGCTG ATGTGGTGCT TCAGCTACRA GGGCCAGTGG 1860 

50 AAGATQTCCT ACTTCTACGC OGACXnXSCTC AOCAAGGAGA ACTGCTGOIC CAAQQTGGGC 1920 

T0AT3CCA0G TOTTAOGGGC ATTG G GTGAC CAGGGCTGAC TGTGTGCCTC CAGACCAOSG 1980 

GCXAAATCCC TAACT6AACA CAGATGTCTC AGCTGOAATC TAAACATAAC CTTAAATTCT 2040 

AATAGGACTC AGGCTTGGAA GQAACTAAAG ACCACAAQAG AAACTTCTGA ACCACAATGT 2100 

GTCACAAAGA GATTTCTAGT CACAAAGGAC AGAAACATGG CTCCCTCTGT CCAGTAGAAC 2160 

55 TGTTTCT6TG GTAGAAATQT TCTACTTATG CACTGCCCAA ATATGQTAGT CACCAGCCAC 2220 

ATGT6CACAA TGAGCACATG AAATGTGCCT AQTGCAACTG GGAAACTGAT TQTTTTCAST 2280 

TTTATTTAAT TTTAATTAAC TAAATOTTAA ATTTAAATAG GCATGTAGGG CTTGTGGCCA 2340 

CTATATTGGA CTATQCAGOT CCAAAACACA AAAGGCTCAT ATAACTGAAC ATTCrTGGCA 2400 

CATCCGACTT CAGGTAGGGC TGGATCCAGO AATTCAAATG ATGTOOTCTG GCTTGGTCTT 2460 

60 TCCATTTGTO GCTGtGCTCT CTCCTATGAC ATCTTTGTTT CTGCTGCATC TTTGCAGGAA 2520 

GGTTCTCTCC ATGTGACAGG CAAG6TGGCC AGGG6CIGCT TCTACTGATA TCCTGCOCTT 2580 

GQTTTCAACC .AGAGTCCCAC AOTTTGAGCX: TCACTGGTCT GACTTGTCAC CTGCCAATGC 2640 

CTGGAACAGA GGGGTGGGAA QATTCXTTCAT GGGCTQAGAO TAGGAGAGGG GTGQTTCCAC 2700 

AGAA6AAAAT GATGCACCCA GAAATAQTGG AGGGATTAAC AAGATGCCAT ACAGGCAAAA 2760 

65 CAAA6CCAAA CAGATGCCCO CCTACCAAAG ATGAAATTTA TCATGGTAAG TATTGAAATA 2820 

AGTGTTAGCr TGTACCATCA TAOTAATQAT AOTGCAGAAA TTG6AACCAA GAGTCTTACA 2880 

ACCACCTA6C TCAGCAAAOQ TCTAATCTGT TATTTGTAAA TACACA0GAC ATGTGTCTTC 2940 

ATGGCTTCAT CCCCTGCCCA TAGCAGACAT TGCTAATCAA TCCTCTGCCA TGAGCCTAGC 3000 

TGTGACCTTA GATACCTTCC CTGCAAAGCT CCAGGCAGCC GTTAGAACTG ACCCACATTG 3060 

70 GCACCTOTTA G6AGACCTGT CATCTGCATC TGGTCCCTCC CCCTCCTCTG CATGGAGGTC 3120 

TCTTCGOTAO CCCT G CCT OQ GGACCAAGCT QAAOGGOAAT TCTC TOOOT C TGQGQCA flGT 3180 ^ 

GGGGCTGG6Q AAGGQAGCA6 6CTTCTACTG A6CTCTCAAC ATGTCTGGCA CCTGTCATGT 3340 

CCTCCACACC ACACCCAGTG AGGAGAAGTT ATTATCTCCA TTGGGCACAT GAGGGOCACT 3300 

GATGCTAAQC GAGQACTGTG ACTTTCCCAC CCACATQGCT GTTGGTGACA GACACOSGAT 3360 

75 TTGAGGACAO TCCTGTQTGA CTCAGAGGCC TCTGCTTCTC TCTTGCTCCT AGAGTCCTCC 3420 

TGAGOAOTOQ G Qg CTT G CCC T8A0C0CACC CT O CTOTTQA AGGTOCTTCC TCAGGCCCAO 3480 

CTCCCATOGC OCCCACACCC CCTCCTCATC ACCTCCTACT CCCAAAAAGG ACAAAOCCTC 3540 

AGGGAACCTT ' rm ' Ci ' l i ' i i TTAGAGACAG GGTCTTGCTA TGTTGGTCAG OCTAGTCTTG 3600 

AACTCCTGGG CCCAAGCAAT CTTCCCACCT CTGCCTCCTA AAGTGCTGGQ ATTCTGGGCC 3660 

80 TTAGGGAACC TTTTTGAACT GAAAGTGACT CTOGAAOCCT TCCTGTAGAO TGAGGTGGGT 3720 

agCTGGaOCT AACCATATGG QGAAGOAGAO AOCTGGTG G Q GGCACACAGC TQCTATATAG 3780 

A0GAACA8AA GGT6GGCCAG GCCTCOCAOC TGrCAOOAAA TCTGOCTCAG TCCCCAQACC 3840 

QCrGTOTOGC TTTOGGOTGO CCAGTOCCTC TCTCTCTQCA ACACTCTCCT ^TQAGGACT 3900 

TCTOTGAAAA ATGGGGOTTG TAACCCTCTG AGTTCTGGG6 CCCCAOCCAT TTCTAATCTC 3960 
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CAGAACrrCQ CCACCACAGA CTCCACCAAC TTCTAGTCCT GGGGCCTGGG CCTCTGGCCA 4020 

TTGCCATAGG CACX3VCCTGC TCTGTGCAGG CAGCGCCCCC CTCTGCCAGG ATCCTCOGAG 4080 

GTCAGCTGCT GGGTCTGACC OGCAGACCCT GGCTOAGCGA CGCyiTGAACG GA CTATGC AG 4140 

ACACAGGCTT GCCTGTCAGC AGATGGGGGA CCCCTGCCAQ AGTCAGCAOC GGOCCCCATA 4200 

5 AQCCTGCCAC GCTTGCATTT ATTTAGTACA GATGTAATGA CAAAQGCCTA AAGCAAACTC 4260 

CATTTOTGGO TAATTAACAT TQTCGCCCCC CCAGAAAGAC CAGTCCTCOO CATGATGATT 4320 

AAAGGCCAG6 TTCOGAGGCC TAAGTAAACC AACTTATCTA GATCAATTCC CTTACTTCTT 4380 

GTTATCTACr CTGAGAQAAT TCAGCTGCCT TCAQCCAAAT CCTTTCCOGA AGCTTTTGCA 4440 

AAACCTCCGA GCCTTCCAAG GTTTGCTTCT TTCTGTAATT TTTCTCACCA CCCTQACCTA 4500 

10 TCTCCTGCAG TCAGCCCTGT GGAGGCCTTT GTGTTTCCXX: CAQT GCTG GC AGCCTAGAGG 4560 

CTGAOATGOC CA6AAACAAO GTGGTGACA6 TGGCGTGCTC AG6GCTTGGG AAAOCCAAGG 4620 

A6CTAAA66C ATGCCCAG6C AACXIAAAGAG GACAGGAAGG CTTCTGAGQA GAGACCTCTG 4680 

AGGTGGGTCT TGGAQAQGAA GQACTTAQGQ AGGCAGAGTQ GAGGAAGTGA GAGGACACCC 4740 

CAAGCCAAQA GGGCGGCAGQ ACCAAAGGCT CAGAAGCCAG GGOGCTGCAG AGGGQCTGTG 4800 

15 TGCCACAGGG TGAAQAQTTT GTGTGQCAOA AGGGCAOOGQ GCTTGCATCA GGGGTGACAG 4860 

cixiCi' C T r rr gtgccagcat agcccctgta catccctgga gagctggggc gtccacaact 4920 

CTAAGTCACA GCCCCC3VTCC TAACCCTGGT GGTaCAQTGA GGGTQAGCTG TCTGTGGGCA 4980 

GGAQGGAAGA CTCTTGGAGA TGAGCCTGGT GAAGGGATAA TGGCATCCCG GGCCGAGGAG 5040 

CAGCACAGGC AGAGGCCTGG G6AGAGTTTA AGGAGTGTAG GGQAGQAAAT GGCAQAAGAT 5100 

20 GA6CCAGAAA AAGAAAGGTT AGGGCAGGTC CTGGAGGACA TQAGTGGCTO TTTGGGCTTT 5160 

ATCXAGCAGT GQGGGAOOCT TGGCAGGCTT GTGGCTTAQA TAOGTGCm AGAAAGGCCA 5220 

CCAQCAOTTG CTGGGCCACC C CSCi'GqCTQ U H I X Oi'lWL V TAAOGCAGGA AATACAAGCA 5280 

TGAGCAGGAA AAQACCCCCT CAAGGCTCAC GTCCTAGrTGG GGAGACAAGA AACACAGATG 5340 

GGCAATATAA CACGATGTCT GGTTCCAGTA AGTGCAGTGA AGAACAAGCG AGGCTGGATG 5400 

25 CAGGGAGTQA TGGGAGGGGC TTTGTAAGGG GAGGTCGGGG GAAGCCTGTC TCAGAGGACA 5460 

CGAGAATOGA G0GCM3GAGC AGCA0GTG6C AGTChCATGO GA6GC06TTA GGGGA6AGGG 5520 

AGCTGGOCAG GGCACAOCAO GGCAGGAOTQ TGTTT6ATGT GTCCTGG6AA COGCCCTQAQ 5560 

6C0GTCX3TGT GGCTGGAGTG CTGCAGQTGT CAACdAAATT GTAGQAGATG TCTCCTGAOT 5640 

GTGATGGAAT ATAACCAGAT TTCXyVGAAGG AACTGACATG ATCTGACTTA AAAAGGTCAG 5700 

30 TGTOCGAAAT GGCTTGCA06 GGACAGGAGT GGGAGCAGGG AGATAGGA6A CAATGTGTAC 5760 

CAGGACAGCA GAAAQACATC CCG6GTAGCC TOGAACA0G6 AGA066TGTG GAGATGGTX^G 5820 

CAGTCCGATA ATGAGAOCCG TAGG6CAAGG CCAGCAGGAT CCrAGAGTGA GA06GGA0GT 5880 

AAAGTCACCG GGACTTGGTG TCTCCAOGTC AGGGGCAOGO GAAAGGGAGA G6ACAAGGGT 5940 

GACCCGGGAG GTTAAAQATG GGACOGGGGC CAGAOGCAGT GGCTCATGCC TGTAA TCCT A 6000 

35 GCACTTTGGG AGGCTGAGGC GG606GATAG CTTGAGGTCA GOAGTTTaAA ACCAQCCTGG 6060 

CCAACATGGT GAAACCC06T CTCTACTAAA ATATACAAAA ATTAG0CTG6 06TGGTGGTG 6120 

CATGGCTGCA OTCCCAGCTA TTCAGGAG6C TGAG6CAACA AGAAT06CTT GAACCTQGQA 6180 

GGOGGAGGTT GCAOTQAGCC GAGAT06GGC CATAOCACTC CAGCCTTA6C CTGGGGQACA 6240 
OAOCQAOACC AGATC 

40 

8eq ZD NOt 86 Protein sequence 
Protein Accession #i XP_027172.l 

^^1 11 21 31 41 51 

45 1 1 I i I I 

MDSSPSLPLZ RTPBSSLHBA LDQCKTAIOL FLTNQFSEAL SYLKPRTKES HXHSLTYATI 60 

LEMQAMMTFD PQDIX«LAGNM MKEAQMLOQR HRRKSSVTDS FSSIiVNRPTL GQFTEEBZHA 120 

BVCYAECLLQ RAALTPLQQS 9HQGAVRPRA LHDPSHACSC PPGPGRQHLP LLQDKNMVSP 180 

IKGGIKVRNS YQTYKELDSL VQSSQYCKGB NHPHFEGGVK LGVGAFNLTL SMLPTRILRL 240 

50 LEFVGFSGNK DYGLLQLEEO ASGHSPRSVL CVMIjLLCYHT PLTFVLGTGM VHIKBAEKLL 300 

KPYLNRYPKS AIFLFPAGRZ EVIKGNISAV SDG6PGSGHG SU^VSQTSRK SGTCDILRDR 360 
IDW6RGGGQE RTNQRAGAGB ALIAEQPGKT REBEAFWPO ILTGRYRTAA I^HREVEGGA 



- Seq ID NO: 87 DNA sequence 
55 Nucleic Add Accession tft AB007921 
Coding sequences 143.. 1363 

1 11 21 31 41 51 

iCA I 1 J I 1 ' 

OU GGTGGAGACA CCGCCTCAGO GCT0G6T6CA CAGTGGACAT TT6GGGAGCX3 TTGT6GGTQA 60 

CCCCCACACA G6CACTG6GA AXGCAGGGGA GAGGOOQCSCA AGGOGGAAAO GGGCCAOAOT 120 

UTiqGC TTTG GATTCAGGAG GQATOGATTC CAGTCCTAGC TT6CCACTTA rTAGQACTCC 180 

tGAGAGCAGC CTCCATGAGG CCCTGGACCA GTGCATGACC GCCCTGOACC TCTTCCTCAC 240 

CAACCAGTTC TCAOAAGCAC TCAGCTACCT CAAGCCCAGA ACCAAGGAAA GCATGTACCA 300 

65 CTCACTGACA TATGCCACCA TCCTGGAGAT GCAGGCCATG ATGACCTTTG ACC CTCAG QA 360 

CATCCT6CTT GOOQGCAAGA TGATQAAOGA GGGACAGAT6 CTOTGTCAGA GGCACOGGAO 420 

GAAGTCTTCT OTAACAGATT CCTTCAGCAG CCTGGTGAAC G6CCCCAGGC TGGGGCAATT 480 

CACTGAAGAG QAAATCCAOQ CTGAGGTCTG CTATGCAOAO TGCCTGCTGC AGOGAGCAGC 540 

CCTQACCTTC CTGCAQGGTT CCTCACAOGG AGGGGCAGTC AGGCCCAGAG CCTTGCATGA 600 

70 TCCCTCTCAC GCCTGCAGCT GCaaUXTGO GCCAGGCG6T CAOCATCTTT TCCTCCTGCA 660 

GGAOGAGAAC ATGOTOAGCT TCATCSVAAGQ C3G6CATCAAA GnOSAAACA 6CTACCAGAC 720 

CIACAA6GAG CTCGACAGCC TT0TTCA6TC CTCACAATAC TGCAAGGGTO AGAACCACCC 780 

GCACTTTGAA GGAGQAGTQA AOCTTOGTGT AGGGGCCTTC AACCTQACAC TGTCCATGCT 840 

TCCTACTAGG ATCCTGAGGC TGTTGQAGTT TGTGGGQTTT TCAGGAAACA AGGACTATGG 900 

75 GCTGCPGCAG CTGGAGGAGG GAGOGTCAGG GCACAGCTTC CGCTCTOTGC TCT6TGTCAT 960 

6CTCCTGCTG TGCTAOCACA CCTT C CTCAC CTTCGTGCTC GGTACTGOGA AOCTCAACAT 1020 

OBAGGAGGCC GAGAAGCTCT TGAAGCCCTA CCTQAACCGG TAOCCTAAOG GTGOCATCTT 1080 

CCTCTTCTTT GCAGGQAGOA TTQAAGTCAT TAAAGQCAAC ATTGATGCAG TGAGTGATGG 1140 

GGQTCCGOOC CGGGGCTGGG GATCCCTCGG GGTCTCCCAG AOCAGCAGGA AOTCAGGCAC 1200 

80 AT6TGACATA CTCAGGGACA GGATAGACTO GGQGOGGGGG GGGGGGCX3UI GAGAGAACCA 1260 

AOCAOAOAGC AGGG6CAGQA OAGQOCCTTC TG6CAGAGCA GCCIGGOAAO ACAAGGGAGG 1320 

AGGAGGCATT TGTQQTGCCT Q6GATTTTGA CIGGOAGATA TAGGACTQCA GCATT6CAGT 1380 

GGAGOOAGGT G0AGG6AGGT GCTTGAOGaA GGCAGAGOTT AGGAAAGCCC A TCTG TTTAQ 1440 

GGCATGAOGA TTAOGCrOGA GTCTGGTACC TCCOCTOCAT TATAOCTCTC TCCTGCTCtT 1500 
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TCATTTTGTT ACTAAAAAOC A6AGTGCTA0 GOGG G O G CT O TATTTGAGGC CAACGTCATG 1560 
TAAGACTTAG GAGGTAAAAC CAGOACTGGA GGCCAGATCT CCtGGCTCCT GGGGCCCCAC 1620 
CTGAGCCTAG CACAGGGCTG GACCACTATG CCCTGGAGGA OTCCCQGTCT GCTGTGGTX3T 16 BO 
TGGGAGGTTC GGAGGATGCA GAGQGGTTGG GGCTGGGTGG GCACCCX5TCA GQCTQACCAG 1740 
5 AAGGIGGCTQ CAQGCCATCC OGOQTTTOGA GGAGTGCTGT GA66CCCAGC AGCACTGGAA 1800 
GCAGTTCCAC CACATGTGCT ACTGGGAGCT GATGTGGTGC TTCAOCTACA AGG6CCAGIG 1860 
GAAGATGTCC TACTTCTAG6 C06ACCTCCT CAGCMOQAG AACT G CTGGT GCAAGGTGGG 1920 
CTGATGCCAC GTGTTAGGGQ CATTGGGTGA CCAGGGCTGA CTGTGTGCCT CCAQACCACG 1980 
GGCCAAATCC CTAACTGAAC ACAGATGTCT CAOCTGGAAT CTAAACATAA CCTTAAATTC 2040 
10 TAATAGGACT CA66CTTGGA AGGAACTAAA GACCACAAGA GAAACTTCTG AACCACAATG 2100 
TGTCACAAAG A6ATTTCTAG TCACAAAGOA CAQAAACATG GCTCCCTCTG TCCAGTAGAA 2160 
CmT ' IlCm T GOTAGAAATG TTCTACTTAT GCACTGOCCA AATATGQTAO TC ACCAG CCA 2220 
CATGTGCACA CTGAGCACAT GAAATGTGCC TAOTGCAACT GGQAAACTGA TTGTTTTCAQ 2280 
TTTTATTTAA TTTTAATTAA CTAAATGTTA AATTTAAATA GCCATGTAQO GCTT GTGG CX: 2340 
15 ACTATATTGG ACTATGCAGO TCCAAAACAC AAAAG6CTCA TATAACTQAA CATTCTTGGC 2400 
AGATGOGACT TCAGGTAGGG CTGGATGCAG GAATTCAAAT GATGTCGTCT G^TTGGTCT 2460 
TTCCATTTGT GQCTGTGCTC TCTCCTATGA CATCTTTGTT TCTGCTGCAT CmUOGGA 2520 
AGGTTCTCTC CATGTGACAG GCAAGGTGGC CAOQGGCTGC TTCTACTCAT ATCCTCCCCT 2580 
TGGTTTCAAC CAOAGTCCCA CAGTTTGAGC CTCACTGGTC TGACTTGTCA CCTQCCAATG 2640 
20 CCTGGAACAG AGGGGTGGGA AGATTCCTCA TGGOCTQAGA OTAGGAGAGG GGTGGTTCCA 2700 
CAGAAGAAAA TGATQCACCC AGAAATAGTG GAGOGATTAA CAAGATGCCA TACA6GCAAA 2760 
ACAAAGCCAA ACAGATGCCC GCCTACCAAA GATOAAATTT ATGATGQTAA GTATTGAAAT 2820 
AAGTGTTAGC TTGTACCATC ATAGTAATGA TAOTGCAQAA ATTGGAACCA AGAGTCTTAC 2880 
AACCACCTAG CTCAGCAAAC GTCTAATCTQ TTATTTGTAA ATACACAGOA CATQTGTCTT 2940 
25 CATGGCTTCA TCCCCTGCCC ATGGCAGACA TTGCTAATCA ATCCTCTGCC ATGAGCCTAG 3000 
CTGTGACCTT AGATACCTTC CCTOCAAAGC TCCAG6CAGC CGTTAGAACT GAOCCACATT 3060 
GQCACCrGTT AGGAGACCTG TCATCTGCAT CTGOTCXXTTC CCCCTCCTCT GCATGGAGGT 3120 
CTCTTCCGTA GCCCTGCCTG GQGACCaUlGC TGAAGGGGAA TTCTCTGGGT CTGGGGCAGG 3180 
TGGGGCTQOG OAAGGGAGCA GGCTTCTACT GAGCTCTCAA CATGTCrGGC ACCTGTCATG 3240 
30 TCCTCCACAC CACACCCAOT GAG6AGAAGT TATTATCTCC ATTGGGCACA TGAGGGCCAC 3300 
TGATGCTAA6 CGAGGACTOT GACTTTCCCA CCCACATGGC TGTTGGTGAC AGACACOGGA 3360 
TTT6AGGACA OTCCTGTOTO ACTCAGAGGC CTCTGCTTCT CTCTTOCTCC TAQAGTCCTC 3420 
CTGAGGAOTC GGOQCTTGCC CTGAGCCCAC CCTGCTGTTG AAGGTGCTTC CTCAGGCCCA 3480 
GCTCCCATGG CCCCCACACC CCCTCCTCAT CACCTCCTAC TCCCAAAAAG GACAAAGCCT 3540 
35 CAGGGAACCT m T i ' CTi 'rT TTTAGAGACX5 GGGTCTTGCT ATGTTGGTCA GGCTGGTCTT 3600 
QAACTCCTGG GGCCAGGCAA TCTTCCCGCC TCTGCCTCCT AAAGTQCTGG 6ATTCTGGGC 3660 
CTTAGGGAAC CTTTTTGAAC TGAAAGT6AC GCTCGAA6CC TTCCTGTAGA GTQAOGTGGG 3720 
TGGCTGGGGC TAACCATATO GGGAAGGAGA GACCTGGTGG GGQCACACAG CTQCTATATA 3780 
GAGGAACAGA AGGTGGGCCA GGCCTCCCAG CTOTCAGGAA ATCTGGCTCA GTCXCCAQAC 3840 
40 CGCTGTGTGG CTTTGGGGTG GCCA6TCCCT CTCTCTCTGC AACACTCTCC TCATOAGGAC 3900 
TTCTCTGAAA AATGGGGGTT GTAACCCTCT GAOTTCTGGG GOOCOUXCA TTTCTAATCT 3960 
CCAGAACTTG GCCACCACAQ ACTCCACCAA CTTCTAGTCC TGGGGCCTGG GCCTGTGGGC 4020 
ATTGCCATAG 6CACCACCTG CTCTGTQCAG GCAGCGCCCC CCTCTGOCAQ GATCCTCCGA 4080 
GGTCAGCTGC TGGGTCTGAC CCGCAGACCC TGCCTGAGCXS ACGGATGAAC GGAGTATQCA 4140 
45 GACACAGGCT TGCTTQTCAG CAQATGGGGG ACCCCTGCCA 6AGTCA6CA6 CG6CCCCCAT 4200 
AAGCCTGCCA CGCTTGCATT TATTTAGTAC AGATGTAATG ACAAAGGCCT AAAGCAAACT 4260 
CCATTTGTGG GTAATTAACA TTQTCGCCCC CXrCAGAAAGA 6CAGTCCTCC GCAT6ATGAT 4320 
TAAAGGCCAG GTTCOGAGGC CTAAGTAAAC CAACTTATCT AGATCAATTC CCT TACTT CT 4380 
TGTTATCTAC TCTQAGAGAA TTCAGCTGCC TTCAGCX3UVA TCCTTTCCCG AAGCTTTTGC 4440 
50 AAAACCTCCG AQCCTTCCAA GGTTTGCTTC TTTCTGTAAT TTTTCTCACC ACCCT6ACCT 4500 
ATCTCCTGCA GTCAGCCCTO TGGA6GCCTT 'At3'r G TTT 0 0C CCAGTGCTGG CAGC CTAG AG 4560 
GCTOAGATGG CXSW3AAACAA GGTGGTGACA GTGGCGTGCT CAGGGCTTGG GAAACCCAAG 4620 
GAGCTAAAOG CATGCCCAGQ CAACX^UUVGA GGACAGGAAG GCTTCTGAGG AGAGACCTCT 4680 
GAGGTGGGTC TTGGAQAQGA AGGACTTAGG GAGGCAGAGT GGAGGAAGTG AGAGGACACC 4740 
55 CCAAGCCAAQ AGGG0G6CAG QACCAAAGGC TCAGAAOCCA GGGOGCTGCA GAGGGGCTGT 4800 
GTGCCACAGG GTGAAGAGTT TQTGTGGCAG AAGGOCAGGG GGCTTGCATC AQGGGTGACA 4860 
GCTGCTCTTT TGTCCCAOCA TAGCCCCTGT ACATCOCTGO AQAQCTQQGG CG TCCA CAAC 4920 
TCTAAGTCAC AGCCCCCATC CTAACCCTGG TGQTGCAGTG AGGGTGAGCT GTCTGTGGGC 4980 
AGGAGGGAAG ACTCTTGGAG ATGAGCCTGQ TQAAGGGATA ATGGCATCCC GQGCCGAGGA 5040 
60 GCAGCACAGG CAGAGGCCTG GGGA6A0TTT AAGGAGTGTA GGGGAGGAAA TGGC AGAAGA 5100 
TGAGCXZAGAA AAAGAAAGGT TAGGGCAGGT CCTGGAGGAC ATQAGTGGCT QTTTGGGCTT 5160 
TATCCAGCAO TG6gGQAGCC TTGGCAGGCT TGTGQCTTAG ATAGGTGCTT TAGAAA6CCC 5220 
ACCAGCAGTT GCTGGGCCAC CXXGCTGGCT GGGTCCXXjTT CTAAGGCAGG AAATACAAGA 5280 
ATGAGCAGGA AAAGACCCCC TCAAGGCTCA OGTCCTAGTQ GGGAGACAAG AAACACAGAT 5340 
65 GGGCAATATA ACACGATGTC TGGTTCCAGT AAGTGCAGTG AAGAACAAGC GAGGCTGGAT 5400 
GCAGGGAGTG ATGGGAGGGG CTTTGTAAGG GGAGGTOGGO GGAAGO CTGT CTCAGAOGAC 5460 
ACCA6AATGG AOCGCAGGAO CAGCACGTQG CAGTCACATG GCAG(XCGTT AGGGCA6AGG 5520 
GAGCXGGGCA GGGCACAGCA GGGCAGGAGT GTGTTTGATG TGTCCTQGOA ACOQCCCTGA 5580 
G G OCOTCGTG TGGCTQGAGT GCTGCA6GTG TCAAGGAAAT TGTAGGAGAT GTCTCCTGAG 5640 
70 TOTGATGGAA TATAACCAGA TTTCCAGAAQ GAACTGACAT GATCTGACTT AAAA AGOTC A 5700 
OTOTGCGAAA TGGCTTGCAG GGGACAGGAG TGGQAGCAGG GAQATAQGAG ACAATGTGXA 5760 
CCAGGACAGC AGAAAGACAT CCOSGGTAGC CTGGAACAGG GAQACGGTGT OGAGATGGTO 5820 
GCAOTCOGAT AATGA6AGCC GTAGGGCAAG GCCAQCAGGA TCCTAGAGTQ AGAOGGGAGG 5880 
TAAAGTCACC GGGACTTGGT GTCTCCACX3T CAGGGGCAGG GQAAAGGGAO AGGACAAOGG 5940 
75 TGACCCGGOA GGTTAAAGAT GGQACOGGGG CCAGACX3CAG 1X3GC TCATG C CTGTAATCCT 6000 
AGCACTTTGG GAGQCTQAGG CXMGOGGATG GCTTGAOGTC AGQASTTTaA A ACOG GCCTG 6060 
6CCAACATGG TGAAACCCCG TCTCTACTAA AATATACAAA AATTAGOCTQ GCGTOQTGGT 6120 
GCATOCCTGC A6TCCCAGCT ATTCAGGAGG CTGAGQOGAC AAOAATCGCT TGAACCTGGG 6180 
_ _ AGGOGGAGGT TGCAOTGAGC CQAOATCGOO CCATAGCACT CCAGCCTTAO CCTOGGCOAC 6240 
80 AGA6C6AGAC CACATC 



8eq ID KOi 88 Protein sequence 
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VETPPQQSVH SQHIiGSWGD PHTGTGMAGE ROPRGXGARV UOiDSGGMDS SP3 LP1#I RTP 60 

5 BSSUiBALDO CMTALDLPLT NQFSKALSYL KFRTKESMYH SLTYATILQ4 QAMPfTFDPQD 120 

ILUAGmWKE AQhJLCQRHRR KSSVTDSPSS LVNRPTLGQF TBEEIHAEVC YAECLLQRAA 180 

LTPLQGSSHO GAVRPRALHD PSHACSCPPG PGRQHLFLLQ DENMVSPIKG GIKVRNSYQT 240 

YKELDSLVQS SQYCKGENHP HFEGOVKLGV GAFNLTLSML PTRILRWiBF VGFSGNKDYO 300 

UiQLBBGASO HSFRSVLCVM LLLCYHTPLT PVLGTGNVMI EBAEKLLKPY LNRYFKGAIF 360 

10 LFFAGRIEVZ RGNIDAVSDG GPGRGWGSLO VSQTSRKSGT CDILRDRIDW GROGGPRQIQ 420 
PESRGRRGPS GRAAHEDKGG GGZOGAHDFD WEI 

8eq ID NOt 89 DMA sequence 
Nucleic Acid Accession #; AF007170 
15 Coding sequence! 73.. 1725 

1 11 ai 31 41 51 

1 I I I I I 

AAGGAGGOGG CCTCCGGGAA AAGOSACOGC AGGACTCCTG AGAGCAGCCT CC3VTGAGQCC 60 

20 CTGGACCAGT GCAT6ACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 

AGCTACCTCA AGGCCAOAAC CAAGGAAA6C ATGTACCACT CACTGACATA TGCCACXyVTC 180 

CTGGA6ATGC AGQCCATQAT GACCTTTGAC CCTCAGQACA TCCTGCTTQC OSGCAACATG 240 

ATOAAGGAGG CACAOATGCT GTGTCAOAQG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCRATTCA CTGAAQAAGA AATCCAOGCT 360 

25 GAGGTCTGCT ATGCAGAGTG CCTGCTOCAG CX3AGCAQCCC TGACXTTTCCT GCAGGACGAG 420 

AACAT6GTGA GCTTCATCAA AGGG6GCATC AAAGTTGGAA ACAGCTACCA GACCTACAAG 480 

GAGCTGQACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTQAQAACCA CCCGCACTTT 540 

GAAGGAGGAG TGAA6CTTG0 TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

AGGATCCTGA GGCTGTTGGA GTTTGTGGGO TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 

30 CAGCTGGAGG AGOGAOOGTC AGGGCACAGC TTCOGCTCTG T6CTCT0T0T CATGCTCCTG 720 

CTGTGCTACC ACACCTTCCT CAGCTTOTTG CTOGGTACTG GGAAOGTCAA CATCGAGGAG 780 

GC06AGAAGC TCTTGAAGCC CTACXTOUVC OGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 

TTTGCAGGGA GGATTOAAGT CATTAAAGGC AACATTGATG CAGCCATCOG GCGTTTCGAG 900 

GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 

35 ATQTQGTGCT TCACCTACAA GGGCCAQTGG AAGATGTOCT ACTTCTAOQC OGACCTGCTC 1020 

AGCAAG6AGA ACTGCTGGTC CAAG6GCACC TACATTTACA TQAAGGCGOC CTACCTCS^GC 1080 

ATQTTT6GGA AGGAOGACCA CAAGGOOTTC GGGGA00AC3Q AAGTGGAATT ATTTO GAGCT 1140 

GTGCCAGGCC T6AAGCTCAA GATTGCTGGG AAATCTCTAC CCACA6AGAA GTTTGCCATC 1200 

CX3GAAGTCCC GQCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTQCC TGCTCTGGAA 1260 

40 ATGATGTACA TCTGGAACGG CTACGCOGTQ ATTGGGAAGC AGCCXSAAACT CACOGATGGG 1320 

ATACTT6A6A TTATCACTAA GGCTGAAGAa ATGCTGGAGA AAGGCOCAGA GAACGAGTAC 1380 

TCAOTOQATG AC6AGT6CTT GGTGAAATTO TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 

CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 

TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGQ CCCTGCTGCT TATGGAGCAA 1560 

45 GACAGAAAOG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA C3UU3AATTAC 1620 

TCCATGGAGT CAAGGACACA CTTTCQAATC CAOOCAOCCA CACTCCAAGC CAAGTCTTCC 1680 

CTAGAGAACA GCAGCA6ATC CATQGTCTCA TCA O TGTCCT TGTAGCTTTG TGCAGCAGTT 1740 

COGGGCTGGA AGACAGAGAC AGCTGGACAQ AGCTCCTGAA AACATTTCAA AATACCXXXTT 1800 

CCCCCTQCCC TGCCCTGCCT TTQGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 

50 TGTATCCGTG CAGAAQCCGA GCTGGCATTT TCACCAGTGT AQCCAAGGGC C TTTGCC AAG 1920 

GGGAOAGCAO OIGGAOCCCT CTGCCT G CCC TATCACACAT AOGGGTACTT GCTTTTCACT 1980 

GT6ATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGA AATA CA TTGATGGGAT 2040 

CACA6TTGGC TTTAAAAACC AAC3VACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 

TATC31TCTGG AGGTAAATCT CTTTATATQA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 

55 TCAGCAAOTT CTCAGCTTGT GTQAOGQAAO GTCCTTCAGA GGACXTOAQG AATGCCTGGG 2220 

AGAGGCTAAO CCTCAG6CTT CAATGCTTCT G6GGTTGGGC ATGAGQATGT ACACAGACAC 2280 

CXavCTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 

AAGCAOOTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTQ TAGAAAAATC AATCTCTACC 2400 

AGTAGAAAAT GCCAGGGCTT QATGGAAGAG CTGTGTAGCC CTTTCTATQC CAAAGCCAOG 2460 

60 AAATTTGGGG GGCAGGAQGA GGTTCTC3VQA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

CTGAAACCAC 1GGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTGA CT0CAAG6TA 2580 

CATTTGCTTA CTOACAOCAT TTTTOTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

65 Seq ID N0» 90 Protein sequence 
Protein Accession AAC39582.1 

1 11 21 31 41 51 

ni^ \ I 1 • I ' 

70 KBAASGKSDR RTPBSSLHEA LDQCMTALDL FLTNQF8EAL SYLKFRTKES MYHSLTYATI 60 

IiEMQAKMTFD P^ILLAGMM MKBAQMLGC2R BRRKSSVTD3 FSSLVNRPTL GQFTBEBXBA 120 

BVCYAECLLQ RAALTFLQDB MMVSFIKOQl KVRNSYQTYR BLDSLVQ8SQ YCKGENBPBF 180 

EGGVKLGVGA FKLTL8MLPT RZLRLLEFVG FSGNKDYGLL QLEEGASGHS FRSVLCVMLL 240 

liCYHTFLTFV LGTGNVNIEB AEKLLKPYLN RYPKGAIPLF FAGRIEVIKG NIDAAIRRFE 300 

75 EGCBAQQBWK QFHHMCYWEL MWCFTYKGQH KMSYFYADLL SXENCWSKAT YIYKKAAYLS 360 

HFQKBDHKPP GDDEVELFRA VPGLKLKIAO KSLPTEKFAI RKSRRYFSSN PZSLFVPALB 420 

MMYINNQYAV lOKQPKLTDG ILEIITKABB MLBRGPSIEY SVDDECLVKL LRGLCLKYLO 480 

RVQBAEBNFR SISANEKKIK YCHYLIPMAL LBLALLLMBQ DRMEBAZRLL BSAKONYXBiy 540 

SMBSRTHFRI QAATLQAKSS LBNSSRSKVS 8VSL 



80 
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Nucleic Acid Accession tf> Bos sequence 

Cbdlng sequence i 31.. 906 
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1 11 21 31 41 51 

I i 1 1 i I 

CGGGTOGACC CAQ6C6TCCQ GGGAGftAAGG ATGGC06GCC TG6O66C3G06 G TTGGT CCTO 60 

CTA6CTGGGG CAG00GO6CT G0CX3AGGQQC TCOCASG6GQ AGOQTQAGCC GGTGTACGGC 120 

GACT606TAC TGCAGTG06A AGAGOVGAilC TGCTCTGOGG 6C6CTCTGAA TCACITCOUC 180 

TCCCGCCAGC CAATCTACAT GAGTCTAGCA GGCTGQACCT GTCGGGA06A CTOTAAGTAT 240 

GAGTGTATGT GGGTCACaJT TGGGCTCTAC CTCCAOTAAG GTCACAAAGT GCCTCAGTTC 300 

CATGGCAAGT GOCCCTTCTC CXS3GTTCCTG TTCTTTCAAG AGCCGGCATC GGCCGTGG CC 360 

TCGTTTCTCA ATGGCXTGGC CAGCCTGGTG ATGCTCTGOC GCTACGGCAC CTTOCjTGCCA 420 

GCCTCCTCCC CCATGTACCA CACCTGTQTG GCCTT060CT GGGTGTCCCT CAATGCATGG 480 

TTCTGGTCCA CAGTYTTCCA CACCAGGGAC ACTGACCTCA CAGAQAAAAT GGACTACTTC 540 • 

TGTGOCTCCA CTGTCATCCT ACACTCAATC TACCTGTGCT GCOTCAQCCT CATCOGCTTC 600 

GACTATGGCT ACAACXTTGGT GGCCAACGTG GCTATTGGCX: TGGTCAAOQT GGTGTGGTGG 660 

CTGGCCTGGT GCCTGTGGAA CCAGCGGCGG CTGCCTCAOG TOCGCAAOTO CGT G gT G GT Q 720 

GTCTTGCTGC TGCAGGGGCT GTCCCTGCTC GAGCTGCTTO ACTTCCCACC GCT CTTCTGG 780 

GTCCTGGATG CCXZATGCCAT CTGGCACATC AQCACCATCC CTGTCCACGT CCTCTTTTTC 840 

A6CTTTCTGG AAGATGACAG CCTGTACCTG CTGAAGGAAT CAGAGGACAA GTTCAAGCTO 900 

GACTGAAGAC CTTGGAGCQA GTCTGCCCCA GTGGGGATCC TGCCCCOQCC CTGCTG GCCT 960 

CCCTTCTCCC CTCAACCCTT GAGATGATTT TCTCTTTTCA ACTTCTTGAA CTTGGA CATO 1020 

AAG6ATQT6G GCCCAGAATC ATGTGGCCAG CCCACCCCCT GTOK3CCCTC ACX»,GCCTTG 1080 

GAGTCTGTTC TAGG6AAGGC CTCCCAGCAT CTGGGACT08 AQAGTGGGCA GCCCCTCTAC 1140 

CTCCTGGAGC TGAACTGGGO TGGAACTGAQ TGTGCTCTTA GCTCTACXGQ QAGGACAGCT 1200 

GCCTGTTTCC TCCCCATCAQ CCTCCTCCCC ACATCCCCAG CTGCCTGGCT GGGT CCTOAA 1260 

GCCCTCTGTC TACCTGOGAG ACCAGGOACC ACAGGCCTTA GGGATACAGQ GGGTCCOCTT 1320 

CTGTTACCAC CCCCCACCCT CCTCCAGGAC ACCACTAGGT GGT6CXGGAT GCTTGTTCTT 1380 

TG6CCAGCCA AGGTTCACX3G OGATTCTCCC CATGGOATCT TGAGGGACCA AGCTGCTGGG 1440 

ATTGGGRAGQ AGTTTCACCC TGACCRTTGC CCTAGCCAGG TTCCCAGQAa GCC TCACC AT 1500 

ACTCCCTTTC AGGGCCAGGG CTOCAGCAAQ CCCAGGGCAA QGATCCT6TG CTQCTGTCTG 1560 

GTTGAOAaCC TGCXyvCCGTG TGTOGGGAGT GTGGGCCAGQ CT6AGTGCAT AGGTOAGAGG 1620 

GCCGT6AGCA TGGGCCTGG6 TGTGTGT6AG CTCAGGCACT AGGTGCGCAG TGTGGAGAOS 1680 

GGTOTTOTCS GGGAAGAOGT GTGGCTTCAA AGTGTTGTGT GTGCAGGGGG TKGGTGTGTT 1740 

AA6GGTG0GT TA06GGAACQ TGTGTGCGC6 TGCTGGTGGG CATGTQAOAT GAGTGACTQC 1800 

CGGTGAATGT GTCX31CAQTT QAGAGGTTGQ AQCAGGATGA GGQAATCCTQ TCACCATCAA 1860 

TAATCACTTG TGGAGCGCXA CTTGGCCCAA GA0GCC3VCCT GGGCGGACAG CAGGAGCTCT 1920 

CCATGGCCAG GCTGCCTQTQ TGCATGTTCC CTGTCTGGTO CCCCTTTGCC OSCCTCCTGC 1980 

AAACCTCACA GGGTCGCCAC ACAACAGTGC CCTCCAGAAG CAGCCCCTOQ GAGGCA6AGG 2040 

AAG6AAAATG GGGATG6CT0 GG6CTCTCTC CATCCTCCTT TTC TCCTT GC CTTCGCATGG 2100 

CTGGCCTTCC CCTCCAAAAC CTCCATTCCC CTGCTGCCAG CCCCTTTGCC ATAGCCTGAT 2160 

TTTGGGGAGG AGGAAGGGGC GATTTGAGGQ AGAAGGGQAQ AAAGCTTATG GCTGGGTCTG 2220 

GTTTCTTCCC TTCCCAGAGG GTCTTACTOT TCCAGGGTGG CCCCAGGGCA GGCAGGGGCC 2280 

ACACTATQOC TGCGCCCTGG TAAAGGTGAC CCCT6CCATT TACCAGCAGC CCTGGCATGT 2340 

TCCTGCCCCA CAGGAATAQA ATGGAOGGAO CTCCAGAAAC TTTCCATCCC AAAGGCAGTC 2400 

TCCGTGQTTG AAGCAGACTG GATTTTTGCT CTGCCCCTGA CCCCTTGTCC CTCTTTGAGG 2460 

GAGGGGAQCT ATGCTAGGAC TCCAACCTCA GGQACTCGGG TQGCCTGOGC TAGCTTCTTT 2520 

T6ATACTGAA AACTTTTAAG GTGGQAGGGT GGCAAGQQAT GTGCTTAATA AATCAATTCC 2580 
AAGCCTCAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

8eq ID NOt 92 Protein sequence 
Protein Accession #i Bos sequence 

I 11 21 . 31 41 51 

I I I I I I 

MAGLAARLVL LAGAAALASG SQGDRBPVYR DCVLQCEBQM CSGGALNHFR SRQPIYMSLA 60 

GWTCRDDCKY ECMWVTVGLY liQBGHKVPQP HGKWPPSRPL PFQBPASAVA SFLNGLASLV 120 

MIiCRYRTFVP ASSPMYHTCV AFAMVSLNAW FWSTVFHTRD TDLTEKMDYF CASTVILHSI 180 

YLCCVRTVGL QHPAWSAFR ALLLLHXiTVH VSYLSLIRFD YGYNLVANVA IGLVNWWHL 240 

AMCLWNQRRli PHVRKCWW liLLQGLSLLB LLDFPPLFNV LDAKAIHHX8 TIPVKVLFFS 300 
FLBDDSLYLL KESEDKFKLD 

Seq ID NO: 93 DHA sequence 

Nucleic Acid Accession ft; NM_033419.1 

Coding sequence: .18. .980 

1 11 21 31 41 51 

111''' 

0GA6CCA6GG AGAAAGGATG GCCGGCX:TG6 CG6C60GGTT G6TCCTGCTA GC TGGG GCAG 60 

CGGOGCTGGC GAOCGGCTCC CA0G6GGACC GTGAGCCGGT GTACOQGQAC TGCGTACTGC 120 

AGTGCQAAGA GCAGAACTGC TCTOGGGGCO CTCTGAATCA CTTCOGCTCC CGCC AGCC AA 180 

TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACGACTG TAAGTATGAG TGTATGTQGG 240 

TCACC6TTGQ GCTCTACCTC CAGGAAGGTC ACAAAGT6CC TCAGTTCCAT GGCAAOTGGC 300 

OCTTrCTCCOG OTTCCTOTTC TTTCAAOAGC GGQCATCG6C OQTGQCCTOG TTrCXCAATO 360 

6CCTGGCCAG CCTGGTQATO CTCTGCCGCT ACOGCACCTT CQTGCCA6CC TCCTOCCCCA 420 

TGTACCACAC CTQTGTGGCC TTCQCCTGGG TGTCCCTCAA TQCATGGTTC TGGTCCACAG 480 

TTTTCCACAC CAGG6ACACT GACCTCACAO AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 

TCATCCTACA CTCAATCTAC CTOTGCTGOG TCAOGACOGT GGGGCTGCAG CACCCAGCTG 600 

TGGTCAGTGC CTTCOG G OCT CPCCTGCT6C TCAT6CTQAC CSTGCAOSTC TC CTAOCTO A 660 

GCCTCATCCG CTTOGACTAT GGCTACAACC TGOTGOCCAA OCSTGGCTATT GGCCTGGTCA 720 

ACGTGGTGTO GTCGCTGQCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CAOGTGOGCA 780 

AOTGCGTGGT GGTGGTCTTQ CTGCTGCAGO GQCTGTCCCT GCTOGAGCTG CTTGACTTCC 840 

CACOQCTCTT CTGQGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 

ACOTCCTCTT TTTCAGCTTT CTGQAAGATO ACAGCCT6TA CCTO CTOA AO QAATCftaWO 960 

ACAACnrCAA GCTGOACTGA AGACCTTOQA GCQA6TCTGC COCASTOGGG ATOCT COOOC 1020 

OOOC C T G CTG GOCTOCCTTC TCCCCTCAAC CCTTGAGATQ ATTTTCTCTT TTGA^TTCT 1080 

T6AACTTGGA CATQAA6GAT GT6G6CCCAO AATCATOTGO CCAGCCGACC CCCTGTTGGC 1140 
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CCTCACCAGC CTTGGAGTCT OTTCTACOCSA AGGCCTCCCA GCATCTGGGA CTCX5ACSAGTG 1200 

GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 

COGGQAGGAC AGCTGCCTGT TTCCTCCCCA TCAOCCTCCT CCCCACATCC CCAGCTGCCT 1320 

6GCTGGGTCC TGAAGCCCTC T6TCTACCTG GGAOACCAGO 6ACCACAGGC CTTAGGQATA 1360 

CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTC C A GGACACCACT AGGTGOTGCT 1440 

QGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGOGATTC TCCCCATGGQ ATCTTQAOGO 1500 

ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 

GGAGGCCTCA CCATACTCCC TTTCAGGGCX: AGGGCTOCAG CAAGCCCAGG 6CAAGGATCC 1620 

TGTOCTGCTG TCTGGTTGAG AGCCTGOCAC CGTGTGTOBO GAQTGTGOGC CAGGCTGAGT 1680 

GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTQTG TGAGCTCAGG CCTAGGTOOG 1740 

CAGTGTGGAO AOGQGTGTTG TOGGGGAAGA GGTGTGGCTT CAAAGTGTGT OTOTGCAGGG IBOO 

G G TG G GTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCOC GTGCTGQTGG GCATGTGAGA 1860 

TGAGTGACTG C0G0TGAAT6 TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 

OTCACCATCA ATAATCACTT GTGGAGOGCC AOCTCTGCCC AAGQOQCCAC CTGGGOQGAC 1980 

AGCCAGGAGC TCTCCATGGC CAGGCTGCCT OTOTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 

Q0C08CCTCC TGCAAACCTC ACAiGGGTCCC CACACAACAQ TGCCCTCCAG AAGCAGCCCC 2100 

TCGQAGGCAO AGGAAGGAAA ATGGGGATGG CTOOGGCTCT CTCCATCCTC CTTTTC TCCT 2160 

T0CCTT06CA TGGCTGGCCT TCCCCTCXAA AACCTCCATT CCCCTGCTGC C3^GCCCCTTT 2220 

GCCATA6CCT GATTTTGGGG AGGAGGAAGQ GGCGATTTQA GGQAQAAGGG GAGAAAGCTT 2280 

ATGGCTGGGT c m UTiT C r r CCCTTCCCAG AGGOTCTTAC TGTTCCAGGQ TGGCCCCAQG 2340 

GCAGGCAG6G GCCACACTAT GCCT6G6CX:C TGGTAAAGGT GACCCCTGCC AT TTAC CAGC 2400 

AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AOAATGGAGQ GAGCTCCAGA AACTTTCCAT 2460 

CCCAAAGGCA QTCTCXX3TGG TTGAAGCAQA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 

TCCCTCTTTG AGGGAGGGGA GCTATGCTAG QACTCCAACC TCAGGGACTC GGOTGGCCTG 2580 

CGCTA6CTTC TTTTGATACT QAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

Seq ZD NO: 94 Protein sequence 
Protein Accession #t MP_219487.1 

1 11 21 31 41 51 

I I I I I I 

MKDVGPESCG QPTPCWPSPA L6SVLGXASQ HLQLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACPLPISLL PTSPAAWLGP EALCLPGRPG TTGLRnTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQG9S KPRARILCCC 180 
LVBSXiPPCVG SVGQAECIGD RAVSH8U3VC ELRPRCAVHR RVL8GKRGQP KVCVCRGWVC 

Seq ID NO: 95 DMA sequence 

Nucleic Acid Accession St XM_090469 

Coding sequence t 

I 11 21 31 41 51 

] I I I I I 

ATQGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAQ GAACGTOGAA AQATAGCTCC 60 
TGCCGCGTGC TGGACRGGAC QCAGGTGAAC AAGCCCTGCC TGCTGQAGGC TGCCAAGCAC 120 
AGCCCACACC CCTTQCATGO ACASTCGGTC AACAGCCACC CATCAGGT6C TCATCAGAAA 180 
CCACCTOAAG TGAAAATOTT TGGAGCCAGT CAAGOTTTGC TOACAATGQA AACAAACCAO 240 
TCCCTGGCAC AAGGCACAGG CTQCTC3«»rG TGCTGGGAGG TQAAOGGCCC AGGGTTGGAA 300 
ATGTCTCTTC CTGGAGTGCT CAGTGCTGAT GCTGGCCAGG TGGAGCACAG AAGAOUVATG 360 
AACTC06CAG ATACTGACTG GGOAACTGAG GAGQOACCAO GAAGTCTAQA TT TCGC CGTG 420 
0CT60CCACC AGGAA6ACAC TTTCTTTTTO AAAOACATCA AOCAGACAAQ TA08TTCAG0 480 
CAATCRGTTC AGCAGCAGAA TTOCATTTAC AOCCCCAGAO AAAAAOOCTG TGGGAATQTC S40 
AGGGCACCTT GOQOGCCACX: AAGGCGAGAG GCCCOGCTTQ GGCTCAGCAO GCCCTGGCQG 600 
CTCAGCCGCA GCCCGGCGCC GTCCCCAOGT ACACCCATQG OGCXXTTCCC GACCTCTQAC 660 
CXSCOAGCTAO ACGC6CCAG0 CCCGCCCCCC GGGCTGOGGA GCT06GCAGC AGCCCCTCAC 720 
TG C CT GO COG CGGCTCCAGA AGCCCaOAAA CTCTTCTTCC TGCIOCOOCr TTATCCAGAT 780 
GGCAGGOCAC CACCTAAflOA CATCRGCAO ACSCTCCHGC ACAA06CACC TOGGAAOAGA 840 
CT6CATCAAG AATCACCAGG TTCATTTCCA ATAGGTTTAT GTAACAOAAC AOCTG AACCA 900 
TGTATCAAAO GGGCTCTGGG TGTO Q CTGCT GAAOCAOCAT TTCACTTACA GTTTTCCAGT 960 
OAATGA 

Seq ID NO: 96 Protein sequence 
Protein Accession #: XP_090469 

1 11 21 31 41 51 

I 1 I I I I 

KGFGDQGTVB 6SL6TSKDSS CRVUSRtQVM KPCLLEAAKH SFHPIiHQQSV NSHPSGAHQK 60 

PPEVKMFGAS QGLLTMETNQ SIAQGTGCSV CWEVNOPOLE MSLPGVLSAD AGQVEHRRQM 120 

NSADTDWQTB BOPGSLDFAV AAHQBDTFFI* KDIKHTSTPR QSVQQQNCIY SPREXPOGNV 180 

RAFCAPPRRE APLALSRPNR LSRSPAFSPR TPMAPFPTSD RBXiDAPGPPP GXiRSSAAAPH 240 

CLPAAPBAQK LFFLLPLYPD GSPPPKDIIiQ TLQKKAPGXR LRQESPGSFP IGLOIRTABP 300 
CIXGALGVAA BAAFBLQFSS E 

Seq ID NO: 9*7 DNA sequence 

Nucleic Acid Accession #i NM_003474.2 

Ccxling sequence: 37.. 3036 

1 11 21 31 41 51 

I 1 I 1 I I 

CACTAAGSCT CTTCCTAGTC CCC6GQCCAA CT066ACA6T TTGCTCATTT ATT6CAACGG 60 

TCAAGOCTGO CITGT G CCAG AAOGGCXXXSC G06CGA06CA OOCACACACA GGGGGQGAAA 120 

CrmTTAAA AATQAAAGGC TA6AAGAGCT CAGG00GG6C G0GGGC06TG CXK36AGGGCT 180 

COSGAGCIGA CT06CCX3AGO GAGGAAATCC CTG0GGT06C GAC36CC06GC CCC6CT0GGC 240 

OGCCGGGTGa GATGGTGCAG OGCTGGCCGC CG060CGOA6 A6CT6CT6CA CTQAAGGCCG 300 
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G06AOC3ATGG CAG0G06CCC QCTQCCCQTO TCCCCCGCCC GCXSCCCTCCT GCTCGCCCTG 360 ' 

GCOGGTGCTC TGCTOGCGCC CTGCGAGGCC OGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

GCTGATGAAG TTGTCAGTGC CTCTGTTOGO AGTQGGGACC TCTGGATOCC AGT6AAGAGC 480 

TTCGACTCCA AGAATCATCC AGAAGTGCTO AATATrOSAC TACAAOGQOA AAOCAAAQAA 540 

5 CTGATCATAA ATCTGGAAAQ AAATGAAGGT CTCATTGCCA GCAOTTTCAC GGAAACCCAC 600 

TATCTOCAAG AOSOTACTGA TGTCTCCCTC GCTCX3AAATT ACACGGTAAT TCTGGGTCAC 660 

TGTTACTACC ATGQACATGT ACGGGGATAT TCTGATTCAG OVGTCAGTCT CAGCAC3GTGT 720 

TCTGGTCTCA GGGGACTTAT TQTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 

AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAQAAGC TGAAAAGCGT COGGGQATCA 840 

10 TGTGGATCAC ATCACAACAC ACCAAACXTTC 6CTQCAAAGA ATGTGTTTCC ACCACCCTCT 900 

CAGACATGGO CAAGAAGQCA TAAAA6AGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCT6 960 

OTGATOntJG CAGACAACG6 AGAGTTTCAG AGGCAAGGAA AAGATCT60A AAAAOTTAAG 1020 

CAGOGATTAA TAOAQATTGC TAATCAOGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 

ATCGTGTTOQ TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1140 

15 CCATTCACCA GCCTCCATGA ATTTCTOOAC TGGAGGAAOA TGAAGCTTCT ACCTCGCAAA 1200 

TCCCATGACA ATG0QCA6CT TQTCA6TGGG GTTrATTTCC AAGGGACCAC CATCGGCATG 1260 

GCCCCftATGA TQAGCAT6TG CAOGOCAGAC CAGTCTG6G0 GAATTGTGAT GGACCATTCA 1320 

GACAATCCCC TTGGT6CAGC OGTGACCCTG 6CACAT6AGC TGGGCCACAA TTTCGGGATG 13 BO 

AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGO OGGTTGAOAA AGGAGGCTGC 1440 

20 ATCATGAAOO CTTCCACX3GG GTACXXaTTT CCCATGGTGT TCA6CAGTTG CAGCAGGAAG 1500 

6ACTTGGASA GCAOCCTGGA GAAAOGAATG GGGGTGT6CC TGTTTAACCT GC06GAA6TC 1560 

AGGGA UTL T f TOGOOQGCCA GAAOTOTGOG AACAGATTTG TGGAAiQAAOG AGAGGAGTGT 1620 

QACTOTGGGG AGCCAGAGGA ATGTATGAAT OGCTGCTGCA ATGCCACCAC CTGTACCCTQ 1680 

AAGCCGGACX3 CTGTGTGOGC ACATGGGCTG TGCTOTGAAQ ACTGCC3«3CT QAAGCX^t^ 1740 

25 GOAACAQOGT 6CAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGQCC IB 00 

AGCCCTCACT GCCCAGCCAA GGTGTACCTG CACGATG6GC ACrCATOTCA OGATOTGGAC 1860 

GGCTACTOCT ACAATQGCAT CTGOCAOACT CACGAOCAGC AOTGTOrCAC ACTCTGGGGA 1920 

CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AOaTOATCXrr 1980 

TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AAT0C3GAGAT GAGAGATGCT 2040 

30 AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCC3VGC0GQC CAGTCATTGG TACCAATGCX 2100 

OTTTCCATAO AAACAAACAT CCCCCTGCAQ CAA6QAGGCC GQATTCTOTQ CCGGGGGACC 2160 

CAOSTGTACT TOSGCOATGA CAT6C0G6AC CCAGGGCTTG TGCTTGCA6Q CACAAA6T0T 2220 

GCAGATGQAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 

aAOTGTGCAA TGCAGTGCCA OQGCAQAQOG GTGTGCAACA ACAGGAAQAA CTGCCACTGC 2340 

35 GAGGCCCACT GGGCACXnXX: CTTCTGTQAC AAGTTTGGCT TTGGAG6AAG CACAGACAGC 2400 

GGCOCCATCC GGCAAGCAGA TAACCAAOOT TTAACCATAO GAATTCTOaT QAOCATCCTG 3460 

' imirrivim ctgcoggatt tqtggtttat ctcaaaagqa aqaccttgat aosactgcto 2S20 

TTTACAAATA AGAAGACCAC CATTOAAAAA CTAAGGTGTG TGOGCCCTTC COGGCCACCC 2580 

CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG QAAAAGGCCT GATGAGGAAQ 2640 

40 CCQCCAGATT CCTACCCACC OAAGOACAAT CCCAGGAQAT TGCTGCAGT6 TCAQAATGTT 2700 

GACATCAGCA GACCCCTCAA OGGCCTQAAT QTCCCTCAGC CCCAGTCAAC TCAGGGAGTG 3760 

CTTCCTCCCC TCCAC06GGC CCCyVOOTQCA CCTAGOGTCC CTOCCAGACC CCTGCCAGOC 2830 

AAGCCTGCAC TTAGGCAGGC CCAGGGOACC TGTAAGCCAA ACCCOXTCA GAAGCCTCTG 2880 

CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTC3VTQ CCTTGGCCAG GACCCCAGGA 2940 

45 CAATGGGAGA CTGGGCTCCG CX^XSGCACCC CTCAGACCTO CTCCACAATA TCCACACCAA 3000 

QTOOOC3U3AT CCACCCACAC CQCCTATATT AAGTQAGAAO COGACACCTT TTTT CAACAG 3060 

T6AAQACAGA AGTTTGCACT ATCTTTCAGC TCCAOTTGGA GTTTTTTOTA CCAACTTTTA 3120 

GOATTTTTTT TAATOTTTAA AACATCATTA CTATAAGAAC TTTQAGCTAC TGCOGTCAGT 3180 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAAT TTAT 3240 

50 GCAGAAT6TT GATTACAGTG CA6TG0GCTG TAOTAGGCAT TTTTACCATC ACTGAG TTTT 3300 

CCATGGCAGG AA06CTTGTT GTQCTTTTAG TATTTTAGTO AACTTGAAAT ATOCTGCTTQ 3360 

ATGGGATTCT GGACAG6ATG ' iVmOC TTT CTGATCAAGQ CCTTATTQGA AAOCAOTCCC 3420 

CCAACTACCC CCAGCTGTGC TTATGQTACC AGATGCAGCT CAAGAGATCC C3UW3TAGAAT 3480 

CTCAGTTGAT TTTCTGQATT CCCCATCTCA GGCCAGAGCC AAGGGOCTTC AGGTCCAGQC 3540 

55 TGT6TT7G3C TTTCAGGGAO OCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 

ACACCTGGGA 6AAATCTQGC TTCTGGGCAO GAAGCTTTGG TGAGAACCTO GGTTGCAOAC 3660 

A6GAATCTTA AG6T0TAGCC ACACCAG6AT A6AGACTG0A ACACTAGACA AGCCAGAACT 3720 

TQACCCTGAG CTQACCAGCC GTQAGCATGT TTGQAAGGGG TCTGTAGTGT CACTCAAGQC 3780 

■ GGTGCTTGAT AOAAATGCCA AGCACTTCTT TTTCTOGCTG TCCTTTCTAG AGCACTGCCA 3840 

60 CCA6TAGGTT ATTTAGCTTG GGAAAGGTGO TGTTTCTGTA AGAAACCTAC TG CCCAG GCA 3900 

CTGCAAAC06 CCACCTCCCT ATACTGCTTG OAGCTGAGCA AATCACCACA AACTO TAATA 3960 

CAATQATOCr GTATTCA6AC AOATGAGGAC TTTCCATGGO ACCACAACTA TTTTCAGATG 4020 

TGAACCATTA ACCAGATCTA GTCAATC3U«3 TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 

AATTAGGCAQ ACTCTTTATG CTTOCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

65 TATAGTTCAT GTCTQCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCT ATGGGQ 4200 

CATCCTCTTT TTCCAAGTTQ GCT6CAG6AA TCTTTAAAAG AtGCTTTTAA GAGAGTCTGA 4260 

ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACaGAATQTO ATAA GGAA AT 4320 

CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATQTGGC TTGGOCAQCA TCCCCTTGAA 4380 

CTCTTCACTC TTCAAATOCC TGACTAGGQA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

70 AATGGCAT6A GAAATACAAA AATACTCA6A TAAQQTAAAA TGCCAT6ATG CCTCTQ TCTT 4500 

CTGGACTGQT TTTCACATTA GAAOACAATT GACAACAGTT ACATAATTCA CTCTQAGTOT 4560 

TTTATGAQAA AOCCTTCTTT TGGGGTCAAC AOTTTTOCTA TGCTTTGAAA CAGAAAAATA 4620 

TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 

• TaXCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTG ACAC 4740 

75 AAACACACAC AAAAGQGAAC CCAGCTCTAA TACATTCCAA CTCQTATAGC ATOCATCTGT 4800 

TTATTCTATA OTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTOGAAAAT AATACTGCTO 4860 

AGATACATAC AQAATTACTG TAACTGATTA CACTTGGTAA TTGXACTAAA GOCAAACATA 4920 

TATATACTAT TAAAAAGGTT TACAGAATTT TATOGTGCAT TA09TGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAQAT CTGGCATGTT AGCOCTTCCT CCAATTATAA GAGGATATQA 5040 
oO ACCAAAAAAA AAAAAAAAAA AA 

Seq ZD HOi 98 Protein aequence 
Protein Accession 0: NP 003465 
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1 11 21 31 41 51 

1 I I I I i 

MAARPLPVSP ARALLLALAG AltLAPCEARG VSLHNB6HAD BWSASVUSG DLWIPVKSFD 60 

SXMKPBVLNI RLQRESXEIiI XMLESNBOIiZ ASSFTBTHYL QDGTDVSIAR HYTVXXiGHCy 120 

YHGHVRGYSD SAV8LSTCS6 LRGIiZVFEME SYVLBPMKSA TNRYKLFPAK KLKSVRGSOG 180 

SHHNTPNLAA KNVFPPPSQT WARRHKRETl. KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVBVWNDM DKCSVSQDPP TSLHEPLDWR KMKLLPRKSH 300 

DNAQLVSGVY FQOTTIGMAP IMSMCTASQS GGIVKDKSQN PLGAAVTIAH SLGHNFGMNH 360 

STLDRGCSGQ NAVBXGaCIM NASTGYPFFM VF8SCSRKDL BTSLBKGM6V CLFNLPBVRE 420 

SF6GQK06NR FVEEGBBCDC 6BPBECKNRC C33ATTCTLXF DAVCAHGLCC BDCQLKPAOT 480 

ACRDSSNSCD LPEFCTOASP RCPANVYLHD GHSC3QDVDGY CYNQICQTHE QQCVTLWGPG 540 

AKPAPGICFE RVNSAC5DPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 

IBTNIPLQQG GRILCRGTHV YLCa)DMPDPO LVIiAGTKCAD GKICLNRQOQ NXSVFGVHCC 660 

AMQGHGRGVC MNRKNCHCBA HWAPPFOIKF GFGGSTDSGP ZRQADHQGLT IGIXiVTXLCL 720 

LAAGFWYLK RKTLIRLLFT NKXTTIBKLR CVRPSRPPRG FQP0QAKL6H XiGKGLMRXPP 780 

DSYPPKDNPR RLLQCQNVDI SRPLMGUTVP QPQSTQRVLP PLKRAPRAPS VPARPLPAKP 840 

ALRQAQGTCK PNPPQKPliPA DPLARTTRLT HALARTPGQH BTGLRLAPLR PAPQVPHQfVP 900 
RSTHTAYIK 

8eq ZD 170: 99 IMA sequence 

Nucleic Acid Accession #t im_003714 

Coding sequence: 135.. 1043 

1 11 21 31 41 51 

111)11 

GAGGAGQAGG QAATU^GGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA AGOGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAQ AACCATQTGT GCOGAGCGGC TGGOGCAGTT CATGACCCTO GCTTTGGTGT 180 

TGGGCACCTT TGACC0GGG8 OGGGGGACGQ AOBCCAGCAA CCCACCCGAG GGTCCCCAAG 240 

ACAG6AGCTC CCAGCAGAAA G6C0GCCTGT CCCTGCAGAA TACA6CGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GCGTGTTTGA ATGTTTCOAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTOCA TQACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATQCCCAGGG CAAGTCATTC ATCAAAGAC6 CCTTGAAATG TAAGGCCCAC GCTCTGOGGC 480 

ACAGGTTCGG CTGCATAAGC GGGAAOTOCC OGGCCATCAO QGAAATGSTG TCCCAGTT6C 540 

AGCGGGAATG CTACCTCAAO CACOACCTGT GOGCGGCTGC CCAGGAQAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACX5A ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT QACCTGTGGQ GAGGAGGTGA AGQAGGCCAT CACXXACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCX3GCCATCC 780 

A6AAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGQ GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGOGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCAQ GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCOGGAGG TGAAATGAAA GGCCTGGCCA OGAAATCTTT CXn'CCACXSCC 1080 

GTOTVTTTTC TTATCTATGO ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

AOGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AAOSQACAGG 1200 

TGAQAT6GAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 1260 

ACGTACTCAA GGOAGCGCGC CCGOGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCX3C TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCC06A AGCTTCTGGT 1440 

GCAGCA6CCT CTGGTGCTGT CTCOGCGGAA GTCAGGGCGG CTGGATTCCA GQACAGGAGT 1500 

GAATGTAAAA ATAAATATOQ CTTAQAATGC AGGAGAAGGG TGGAGASGAG GCAGGGGCCX3 1560 

AGGGGGTGCT TGQTGCCAAA CTGAAATTCA GTTTCTTGTG TGGGGCXTTG CGGTTCAGAG 1620 

CTCrrGGOGA GGGTQGAG6Q AGGAGTGTCA TTTCTATGTG TAATTTCTOA QCCATTGTAC 1680 

TGTCTGGGCT GGGGGOGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TATTTTAACC 1740 

ACTGCTTCAA ATCTOSATTT CACTTTTTTT ATTTATCCAQ TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TG6CTTTCAA AC3UUW3CAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCA6CC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAQCTA TCAAACAGOG ACATAOCCAT ACATCTGACT GCCTGACATO GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCXMAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTG06TC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAG6 AAATGCTQAC CTTTTACATA 2220 

AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCC3M3ATCC TAATTG6AAA 2280 

TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAOAaA GAGAGAOAOA 2340 
GGGAGAGGAA OAAAAGAGAG AGAQAAAAGA GCCTOGTGCC 

Seg ID MOi 100 Protein sequence 
Protein Accession #: NP_003705 

1 11 21 31 41 51 

I I I I I I 

MCABRLGQFM TIALVZiATFD PAROTDATNP PB6PQDRSSQ QKGRLSLQNT ABIQHCLVNA 60 

GDVGCGVFEC FENNSCEIRG UIGICI4TFLH NAGKFDAQGK SFIKDALKCK AHALRHRFGC 120 

ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLLUIEP YVDLVNLLLT 180 

CGBEVKBAIT BSVQVQCEQN HGSLCSILSF CT8AIQKPPT APPERQPQVD RTKI^SRAHHG 240 

BAGKKLPBPS SRETGRGAKG ERGSK8HPKA KARGRVGGLO AQGPSQSSEW EDBQSEY8DI 300 
RR 

Seq ID NOt 101 DMA sequence 
Nucleic Acid Accession #: 1IM_005940 
Coding sequence t 23.. 14 8 9 

1 11 21 31 41 51 

I I I 1 I i 
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AAOCCCAGCA GCCOCGGGGC GGATGGCTCC GGCOGCCTGG CTCOGCAGCO CXSGCGGOGCXS 60 

OGCCCTCCTO (XCCOGATGC TGCTCCTGCT GCTCCAGCCG C0GCX3GCTGC TGGCCCGC50C 120 

TCTGCOGCOG GAOGTGCACC ACCTCCATGC OOAGAOGAGQ GGGCCACAGC CCTGGCATGC 180 

AGOOCTGCCC iMSTAGCCOGG CAGCTGGCCC TGCCAOQCAO GAAGCCGCCC OGCCTGCCAO 240 

CA6CCTCAGG CCTCCCCGCT tfltSGOOT GC X: CQACCCATCT GATGGGCTGA GTGCCOGCAA 300 

CCGACAGAAG AGGTTCGTGC ITi Cl tS G O GS GOGCTGGGAG AAGACG6ACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA QOACCAGGTO OGGCAGAOGA TGGCAGAGGC 420 

CCTAAAGGTA TGQAGCGATQ TQACGCCACT CACCTTTACT 6AGGT6CA0Q AGGGCCGTGC 480 

TGACATCATO ATOQACTTCO CCAGGTACTG GCATGGGGAC GACCTGCCGT TTQATGGGCC 540 

TGGGGGCATC CT GG CCCATG CCTTCTTCCC CAAGACTCAC 06AGAAGGGG ATGTCCACTT 600 

OGACTATGAT QAGACCTGGA CTATCGGGGA TGAOCAGGGC ACAGACCTGC TOCAGOTGGC 660 

AGCCCATGAA TTTGGCCACQ TGCTGGGGCT GCAGCACACA ACAGCA6CCA AGGCCCTGAT 720 

GTCOGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGOGOGT 780 

TCAACACCTA TATGGCCAGC CCTGOCCCAC TQTCACCTCC AGGACOOCAO CCCTGGQCCC 840 

OCAGGCTGGO ATAGACACCA ATGAaATTGC AG0GCTGGA6 CCAGAGGGCX! 06CCAGAT6C 900 

CTGTGAGGCC TCCTTTGACG OGGTCTCCAC CATCOGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTQ TGGOGCCTCX: GTGGGGGCCA GCTGCAGCXX: GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTX3CC TTCGAGGATO CCCAGGOCCA 1080 

CATTTGGTTC TTCCAAGGTG GTCAGTACTG GGIGTACiaAC QGIGAAAAOC CAGTCCTGGG 1140 

CCCCOCACCC CTCACCX3AQC TGGGCCTOGT 6AGGTTCC0G GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAQAACAAGA TCTACTTCTT COSAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACXXGG CGTQTAGACA GTCXXXTTGCC CCQCAGGGCC ACTQACTGGA GAGGG6TGCC 1320 

CTCTGAGATC QACGCTGCCT TCCAGGATGC TGATGGCTAT GOCTACTTCC TGOGGGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGOT GAAGGCTCTO QAAGGCTTCC CCOGTCTCOT 1440 

GGGTCCTGAC TTCTTTGGCT QTGCC6AGCC TGCCAACACT TTCCTCTGAC CATGGCTTGO 1500 

ATGCCCTCAG GGGTCCTGAC CCCTGCCAGG CCAOGAATAT CAGGCTAOAG ACCCATGGCC 1560 

ATCTTTQTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGQATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GQAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGOGACTGTC TCAGACTGGG CAGGGAGGCT TTGQCATGAC TTAAGAG6AA GGGCAGTCTT 1740 

GGQACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT OTCCCTCAOO 1800 

QTAGCACCAT GGCAGGACTG GGGGAACrGG AGTGTCCTTQ CTGTATCCCT GTTGTGAGGT 1860 

TCCTTCCAGG GQCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTQQC 1920 

TGAGCAACTG GGCTGTAGGG CAGGGCCACT TCCTQAGGTC AGOTCTTGOr AGGTGCCTGC 1980 

ATCT6TCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTOGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCOCTTTTCQ CA GCACTGCT 2160 

ATCCTCCAAA GCCATTCSTAA ATGTGTGTAC AGTGTGTATA AAOCTTCTTC TTCTTTTTTT 2220 
TTTTTAAACT GAGQATTGTC ATTAAACACA GTTGTTTTCT 

Seq ID 190: 102 Protein eequeoce 
Protein Accession ftx NP_005931 

1 11 21 31 41 51 

I I I I I I 

MAPAAWLRSA AARALLPPMl* UiliLQPPPI^ ARALPPDVHH LHAERRGPQP WHAALPSSPA ■ 60 
PAPATQEAPR PASSLRPPRG OVPDFSDGIiS ARNRQKRFVL SGGRWEKTDL TYRILRFPWQ 120 
liVQBQVRQTM ABALKVHSDV TPLTFTBVHB GRADIMIDPA RYWHGDDLPF DGPGGILAHA 180 
FFPKTHREGD VHPDYDBTWT IGDDQGTDLL QVAAHEFQHV LGLQHTTAAK AI/4SAFYTFR 240 
YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN EIAPLEPDAP PDACEASPDA 300 
VSTIRGELFP FKAGFVWRLR GGQIiQPGYPA liASRHWQGLP SFVDAAPEDA QGHIWFFQGA 360 
OyHVyCGBRP VLQPAPLTEL 6LVRFPVHAA LWOPBKNKI YFFRGRDYWR FHPSTRRVDS 420 
FVPRRATOWR GVP8EZDAAF QDADOyAYFXi RGRLYHKFDP VKVKALSGFP RLVGPPPFGC 480 
AEPANTPL 

Seq ID NO: 103 DMA sequence 

KUeleic Acid Accession fti 2m_033l5l.2 

Coding sequence > 3 5 1 .. 44 99 

1 11 21 31 41 51 

I I I I I 1 

ACT6GGATAA AGCAAGAAGA CTGATTTTAT GAOCAGGGGT TTQATACATC AAAGG AGATT 60 

G0CCAG6ATC AAGGGT6CG6 TGrr G GG GG T GGOTTGGGGA GG6T0QTTAG AGAAGGTTTC 120 

ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATOTTTAAA AAOAGGOATC AAGCACAGGC 180 

TAAGGAGAGG AAAGAGCAGQ CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 

GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC QAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 

CTTTTCATAT TCTCTGATTC TGGGAAACQA AOAATTGGCA G6AACT6AAA ATGACTAGGA 360 

AGAGQACATA CTGGGTG C CC AACTCTTCTQ GTG6CCTC6T GAATGOTGGC ATCGACATAG 420 

GCQATGACAT GQTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 

GTCAGCAAQA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGQGGGAAGT 540 

ATGATGCTGC CTT6AQAACC ATGATTCCCT TC06TCCCAA GCOGAOGTTT CCTGCCCCCC 600 

AGCCOCTGOA CAATQCTGGC CTGTTCTCCT AOCTCACOOT GTCATGGCTC ACC COGCT CA 660 

TGATCCAAA6 CTTAOGGAGT CGCTTAQATG A6AACACCAT CCCTCCACTO TCAQTCCATG 720 

ATGCCTCAGA CAAAAATGTC CAAAGOCTTC ACC6CCTTTG GGAAGAAQAA GTCTCAAGGC 780 

GAGGGATTGA AAAAQCTTCA GTGCTTCTGG T6ATGCTGAG GTTCCAGAGA ACAAOOTTGA 840 

TTTTCGAT6C ACTTCTGGQC ATCTGCTTCT GCATTGCCAG TGTACTOGGG CCAATATTGA 900 

TTATACCAAA GATCCTQQAA TATTCAGAAG AGCAQTTGG6 GAATQ TTOT C CATGGAGTGG 960 

GACTCTGCTT TGOCCTTTTT CTCTOCQAAT GTGTGAAGTC T CTQAG TTTC TCCTC CAGT T 1020 

GQATCATCAA CCAACGCACA QCCATCAGGT TCCGAGCAGC TGTTTOCTCC TTTQCCTTTG 1080 

AQAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1140 

TCACCGOTGA TOTAAACTAC CTGTTTOAAO GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 

CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGQA TACACTGCAT 1260 

TTATIGCCAT CTTATGCTAT CTCCTGOTTT TCCCACTGGC GGIATTCATO ACAAGAATGG 1320 

CTGTGAA06C TCAGCATCAC ACATCT8AO0 TCAG06ACCA GGGCATGOaT GTGACCAGTG 1380 

AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG QQAGAAACGA TTTGCAAAAA 1440 
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TCATTGAAOA CCTAAGAAGG AAGGAAAGGA AACTATTGOA GAAGTGOGGO CTTGTCCAGA 1500 
QCCTCACAAG TATAACCTTG TTCATCATCC CCACAGTQGC CACAGCGGTC TGQGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG OGTCAATGOC CTTCAGCAT6 CTGGCCTCCT 1620 
TGAATCTCCT TOGGCTGTCA tfii aT TCTTO TOCCTATTGC AGTCAAAGGT CTCRCB AATT 1680 
5 CCAAQTCTQC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GOAGAGCCCT GTTTTCTATO 1740 
TCCAQACATT ACAAGACCCC AGCAAAGCTC TCGTCrTTGA GGAGGCCACC TTGTCATGGC IBOO 
AACAOACCTO TCXXGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTQAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GOAAGAAGGG AACAGCCTGQ 1920 
GCCCAQAGTT GCACAAOATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCQ 1980 
10 GCAACACXK3G GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGOACaAQ ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGOTG CAGGGAAGCC TCGCCTATQT CCCCCAGCAQ GCCTGGATCG 2100 
TCAGOGGGAA CATCAGGGAO AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGQACCTGQA ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGOGG GGCCTCAACC TCTCTGGGGG GCA0AAACA6 AGQATCA6CC 2280 
15 TGGCC0GCX3C CGTCTATTCC GACOGTCAGA TCTAOCTGCT GGACGAGCGC CTOTCTGCTG 2340 
TGGACGCCCA OGTGGGGAAO CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 
AOAOGOICGT CCTGGTGACC CACCAGCTGC AGTACTTAQA ATTTTGTGGC CAOATCATTT 2460 
TGTTGGAAAA TGGQAAAATC TGTGAAAATG QAACTCACAO TGAGTTAATG CAGAAA AAGG 2S20 
QGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
20 ACACAOCAAA GATAGC3VGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGOCC ACCTCCCTGO 2640 
AAGAGTCTCT CAAOGOAAAT QCTGTGCOGO AQCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAOGCTC CTTQAGTTGQ AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCQT CTTCTTAACG ATCTTCAGCT 2820 
TCTGQTGGCT QAGCTACTGG TTGGAGCAGG GCTOGGGGAC CAATAGCAGC CGAGAGAOCA 2880 
25 ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TOGTGTACGG GCTCAAOOCC CTGCTCCTCA TCT G TGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CAOGAGGAAG GCATCCACX3G CCCTOCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGO CCGGCTTTTQ AACTGCTTCG 3120 
CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAOCAQ TTCCTGGTCC 3180 
30 TGTCCTTAAT GGTGATCGCC GTCCTOTTGA TTGTCAGTGT 6CTGTCTCCA TATATCCTGT 3240 
TAA1GGGAGC CATAATCATG GTTATTTOCT TCATTTATTA TATQATOTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG QAGAACTATA GCOGGTCTCC TTTATTCTCC CACATCCTC3V 3360 
ATTCTCTGCA AGGCXTTGAGC TCCATCCATG TCTATG6AAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAQ GCT6ACTGAT GCGCAGAATA ACTACCTOCT GTTGTTTCTA TCTTCCACAC 3480 
35 GATGGATGGC ATTGAGQCTG OAGATCATGA CCAACCTTGT GACCTTGOCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGO^TTTCC TCGACCCXXTT ACTOCTTTAA AGTCATGQCT OTCAACATCG 3600 
TGCTGCAGCT GGGGTOCAGC TTCCAGQCCA CTGCOOQGAT TOGCTTGGAa ACAGAGGCAC 3660 
AGTTCACGQC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTCTGTCTCa GAAGCTCCTT 3720 
TACACATGGA AGGCACAAOT TGTCCCCAGQ GGTQGCXACA GCATGGGGAA ATCATATTTC 3780 
40 AGGATTATCA CATSAAATAC AQAGACAACA CACXX^COGT GCTTCAOQGC ATCAACCTQA 3840 
CCATCC3SCGG CCAOQAAOTG GTGGQCATOG TGGGAAGGAC GGOCTCiaGa AA6TCCTCCT 3900 
TGG6CATGGC TCTCTTCCGC CTGOTGGAGC CCATGGCA6G COGGATTCTC ATTGACGGCX3 3960 
TGGACATTTG CAGCATCGGC CTGGAGGACT TGOGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGQA ACCATCAGAT TCAACCTAGA TCCCTTTGAC OQTCACACTG 4080 
45 ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCX: 4140 
CCAAAAAGCr GCATACAGAT GTGGTGGAAA AC6GTGGAAA CTTCTCTGTG OGGGAOAOGC 4200 
AGCT 6 CTCTQ CATTGCCAGG 6CTGT0CTTC GCAACTCCAA GATCATCCTT ATGGATGAAO 4260 
CCACAGCCTC CATTGACATG 6AGACAQACA CCCTQATCCA 0CX3CACAATC COTQAAGCCT 4320 
TCCAGOGCTG CACCGTGCTC GTCATTGCCC ACXXTIGTCAC CACTGTGCTG AACTGTGACC 43 BO 
50 ACATCCTGGT TATGGGCAAT GGGAA6GTGG TAGAATTTGA TOQGCOGGAG QTACTGOGGA 4440 
AGAAGCCT6G GTCATTQTTC GCAOCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAO 4500 
GAGATGTGGA GACTTCATGG AG6CTGGCAG CTQAGCTCRO A06TTGACAC AGGTGCAGCT 4560 
TOGAGGCCCA CAGTCTGCX3A CCTTCTTGTT TGQAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAQ GGGOQGTGGQ GATTGCTGGA TGCAAAOXT GGAATAGQCT ACTTGATGGC 4680 
55 TCTCAA6ACC TTAGAACCCC AGAACCATCT AAGACATOGG ATTCAGrTQAT CATCTGGTTC 4740 
TOCTTTTAAC TTACATGCTG AATAATTTTA TAATAACGTA AAA6CTTATA GTTTTCTGAT 4800 
CTGTOTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAOGAAAAC 4860 
TC 

60 Seq ID NOt 104 Protein sequence 
Protein Accession «: HP_149163.2 

1 11 21 31 41 51 

,c I I I I ' I 

05 MTRKRTYWVP NSSGGI^VNRG IDIGDDMVSO IiIYKTYTIjQD GPHSQQERHP BAPGRAAVPP 60 

NGKXDAALRT MZPFRPKPRF PAPQPLDHAG LFSYLTVSHL TPLMXQSLRS RLDSITIPPL 120 

SVHDA8DKNV QRLHRLWEEB VSRRGZEKAS VLLVMLRPQR TRLZPDALLG ICFCIASVLG 180 

PILIIPKILB YSEEQLGNW HGVGLCPALP LSECVKSLSP fiSSWIINQRT AIRFRAAVSS 240 

PAFEKLIQPK SVIHITSGBA ISPPTGDVNY LFBGVCYGPI* VLITCASLVI CSISSYPIIG 300 

70 YTAFIAIIiCy tIjVPPLAVPM TRMAVKAQHH TSEVSDQRIR VTSEVLTCIK LIKMYTWEKP 360 

PAKIXEDXiRR KBRKLLBKOG LVQSLT8ZTL PXIPTVATAV HVLZBTSIiKL KLTA SHAFS M 420 

LASLNUiRLS VPFVPZAVK8 IiTNSKSAVMR FKKPFLQB9P VPYVQTLQDP SKALVPEEAT 480 

LSWQQTCPGI VNGALELERN GHASBGMTRP RDALOPBEBG NSLGPBLHKI NLWSKGMMIi 540 

GVCXariGSGK SSLLSAIIjEE MHLLEGSVGV QGSLAYVPQQ AWIVSGNIRE NILMGGAYDK 600 

75 ARYLQVLHCC 8IKRDLELLP PGDNTBIGBR GIiHLSGGQKQ RI8LARAVYS DRQIYLLDOP 660 

LSAVDABVGK HIPEECXKKT LRGKTWLVT BQLQYLBFGG QIXLUENGKl CBNQTHSELM 720 

QKKGKYAQLI QKMHXEATSD MLQDTAKZAB KPKySSQALA TSIfEBSLNGN AVPEHQLTQE 780 

EEMBBGSLSW RVYHHYIQAA GGYMVSCUP PPWLIVPLT IPSPWHLSYW LEQGSGTNSS 840 

^ RESNGTMADL GNIADNPQ1.S PYQLVYQLNA LLLICVGVCS SQIPTKVTRK ASTAI^NKLF 900 

80 MKVFRCPKSP FDTIPIGRLL NCPAGDLBQL DQLLPIPSEQ PLVLSU4VIA VLLIVSVLSP 960 

YZXilMGAZZM VZCPZYYMMP KXAZGVPKRI. BIYSRSPLPS BZIiNSLOOIiS SZHVYGKTBD 1020 

PZSQFKRIfTD AQlQiyLUjFL SSTRHMALRL BIHTNLVTXA VALFVAPGZS STPYSFKVKA 10 80 

VNXVLQLAS8 PQATARZGXjE TBAQPTAVER XLOYNKMCVS BAPLHMBOTS CPQOMPQHQB 1140 

IZFQDYHMKy RDNTPTVLHO XllZiTZRGHEV V0ZV6RTG86 XSSLGMAXiPR LVEPMAGRZIi 1200 
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IDGVDICSIG LEDLRSKLSV IPQDPVLLSQ TIRFNLDPPD RHTDQQIWDA LERTPLTKAI 1260 
SKFPKKLHTD WQJGGKPSV GERQLLCIAR AVLRMSKIIL IDBATASIDM BTDTLIQRTI 1320 
REAFQGCTVL VIAHRVTTVL HCDBZLVMO^ GXWEPDRFB VLRXKPGSLP AAU1ATATSS 1380 
LR 

Seq ID NOt 105 DNA sequence 

Nucleic Acid Accession 8; Bos sequence 

Coding sequence: 1..4043 



1 11 21 31 41 51 

I I I I I i 

ATGACTAGGA AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCX3T GAATCGTGGC 
ATCGACATAG GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT 
GOCCCCTGGA GTCAGCAAGA GAGAAATCCT GAGOCTCCAO GGAGGGCAGC TGTCCCACOG 
TGGGGGAAGT ATGATGCTGC CTTGAOAACC ATGATTCOCT TCCGTCCCAA GOCGAGGTTT 
CCTGCCCCCC AGCCCCTGQA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC 
ACGCOQCTCA TGATCCAAAG CTTACGOAGT COCTTAGATO AGAACACCAT CCCTCCACTG 
TCAOTCCATG ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTPQ GGAAGAAQAA 
GTCTCAAGGC GAGGGATTGA AAAAGCTTCA GTGCTTCTGO TGATGCTGAG GTTCCAGAGA 
ACAAGGTTGA TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCC3VQ TGTACTGGGG 
CCAATATTGA TTATAGCAAA GATC CTGGAA TATTCAGAAO AOCAQ TTGGG GAAT O TOT C 
CATOGAGTGG GACTCTGCTT T6CCCTTTTT CTCTCCGAAT OTOTGAAGTC TCTQAGTTTC 
TCCTCCAGTT GGATCATCAA CCAAOGCaca GCCATCAGGT TCOQAGCAGC TOTTTCCTCC 

r r m cc m t j agaagctcat ccaatttaag tctgtaatac acatcacctc aggagaggcc 
atcagcttct tcaccggtga totaaactac ctgtttgaag gggtgtgcta tggaccccta 
gtactgatca cctgcgcatc octqotcatc tgcagcattt cttcctactt cattattgga 
tacact6cat ttattgccat cttatgctat ctcctggttt tcccactogc ggtattcatg 
acaagaatgg ctgtgaaggc tcagcatcac acatctgagg tcagcgacca gcgcatcggt 
gtgaccagtg aagttctcac ttgcattaag ctgattaaaa tgtacacatg ggagaaacxza 
ttt6caaaaa tcattgaagg tatggaaagt ctgactttct gctccaaacc tggtqatggc 
atoocxtptca gcatgctgoc ctccttgaat ctccttoggc tqtcrgtgrr ctttgtgcxrr 
att6cagtca aag6tctcac gaattccaag tct6cagtga tqaggttcaa aaagtttttc 

CTCCA6GAGA GCCCTGTTTT CTATGTCCAG ACATTACAAG ACCCCAGCAA AGCTCTGOTC 
TTTGAGQAGG CCACCTTGTC ATGGCAACAG ACCTGTCCCG GGATCGTCAA TGGGGCACTG 
GAGCTGGAGA 6GAA0GGGCA TGCTTCTGAQ GGGATGACCA GGCCTAGAGA T6CCCTCGGG 
CCAGAGGAAG AAGG6AACAQ CCTGG6CX:CA GAGTT6CACA AGATCAACCT GQTGGTGXCC 
AAGGGGATGA TGTTAGGGGT CTGCGGCAAC AGOQGGAGTG GTAAOAGCAO OCTGTTOTCA 
GCX:ATCCrGG AGGAGATGCA CTTGCTCGAG GQCTCGGTGG GGGTGCAGGG AAGCCTGGCX: 
TATQTCCCCC AGCAGGCCTG GATCGTC3VGC GGGAACATCA GGQAGAACAT CCTCATQGQA 
GGCX3CATATQ ACAAGGCCOG ATACCTCCAG GTGCTCCACT GCTGCTCCCT GAATOGGGAC 
CTGGAACTTC TGCCCTTTOG AGACAT6ACA 6A6ATTGGAG AGG6GGGCCT CAACCTCTCT 
GGG6GGCAGA AACAGAQGAT CAGCCTGGCC CXSCGCOGTCT ATTCGGACCG TCAG ATCTAC 
CrGCTGGAOO ACCCCCTGTC TGCTGTGGAC GCCCACGTGG GGAAGCACAT TTTTGAGGAO 
TGCATTAAGA AGACACTCAQ GGGGAAGACG GTC0TCXnX»3 TGACCCACCA GCTGCAQTAC 
TTAGAATTTT GTGGCCAGAT CATTTTOTTO GAAAATGGGA AAATCTGTGA AAATGGAACT 
CACAGTGAGT TAATGCA6AA AAAGGGGAAA TATGCCCAAC TTATCCA6AA 6ATGCACAA6 
GAAGCCACrr CGGACATGTT 6CAGGACACA GCAAA6ATAO CAGASAAGCC AAAGGTAGAA 
AGXCAGGCTC TGGCCACCTC CCTQGAAGAG TCTCTCAACG GAAATGCTGT GCCGGAGCAT 
CAGCTCACAC AGGAGQAGGA GATGQAAGAA GGCTCCTTGA GTTGGAGGGT CTACCACCAC 
TACATCCAGO CAGCTGGAGG TTACATGGTC TCTTGCATAA TTTTCTTCTT G6TG6TGCTG 
ATCGTCTTCT TAAOSATCTT CAGCTTCTOG TGGCTGAGCT ACTGGTTOGA OCAGGGCT08 
GQGACCAATA GCAOGOaAGA 6A0CAATGGA ACCATGGCAO ACCTOGGCAA CATT6CAGAC 
AATCCTCAAC TGTCCTTCTA CCAGCTGGTG TAOGGGCTCA ACGCCCTGCT CCTCATCTGT 
GTGQGGGTCT GCTCCTCAGO GATTTTCACC AAAGTCAOGA GGAAGGCATC CACGGCCCTG 
CAC3UVCAAGC TCTTCAACAA GGTTTTCCGC TGCCCCATGA GTTTCTTTGA CACCATCCCA 
ATA6GCGG6C TTTTGAACTO CTTGGCAGGG GACTTGQAAC AGCTGGACX31 GCTCTTGOCC 
ATCTTTTCAO AGCAGTTCCT GGTCCTGTCC TTAATGOTGA TCGCCX3TCCT GTTGATTGTC 
AOTGTGCTOT CTCCATATAT CCTGTTAATG GGAGCCATAA TCAT6GTTAT ITGCTTCATT 
TATTATATGA TGTTCAAGAA GGCCATCGQT OTOTrCAAGA GACTGGAGAA CTATAGCOGG 
TCTCCTTTAT TCTCCCACAT CCTCAATTCT CTGCAAGGCC TGAGCTCCAT CCATOTCTAT 
GGAAAAACT9 AAOACTTCAT CAGCCASTTT AAGAGGCTQA CTGATGCGCA GAATAACTAC 

CXtjC'm'nXJT ttctatcttc cacacjgatgg atggcattga ogctggagat catgaccaac 

CTTGTGACCT TGGCTGTTGC C C TGTTOGTG OCTTTTGGCA TTTCCTCCAC CCCCTACTCC 
TTTAAAGTCA TGGCTGTCAA CATCX3TGCTG CAQCTGGCGT CCAGCTTCCA GGCCACTGCC 
OGGATTGGCT TGGAGACAGA GGCACAGTTC AOSGCTGTAG AOAGGATACT GCAOTACATG 
AAGAIOTGTO TCTCGQAAOC TCCTTTACAC ATGGAAGGCA CAAOTTGTCC CCAG660T6G 
CCACA6CAT6 GGGAAATCAT ATTTCAGGAT TATCACATGA AATACAGA6A CAACACACCC 
ACOGTOCTTC ACX3GCATCAA CCTGACX31TC CQCGGCCACG AAGTGGTGGG CATOGTGGGA 
AGGACGGGCT CTGGGAAGTC CTCCTTOGGC ATGGCTCTCT TCCGCCTGGT GGAGCCCATG 
GCAGGCGGGA TTCTCATTGA GGGGGTGGAC ATTTGCAGCA TC6GCCTGGA OGACTTGOGG 
TCCAASCTCT CAOTQATCCC TCAAGATCCA QTGCTGCTCT CAG6AACCAT CAGATTCS^ 
CTAGATCCCT TTGACOGTCA CACTGACCAO CAGATCTOGO AT6CCTTGGA GAQGACATrC 
CTQACCAAGG CCATCTCAAA GTTCCCCAAA AAGCTGCATA CAGATGTGGT GGAAAAOQGT 
GGAAACTTCT CTOTGGGGQA QAGGCAGCTG CTCTGCATTG CCAGGGCTGT GCTTCGCAAC 
TCCAAGATCA TCCTTATCQA IQAAGCCACA GCCTCCATTG ACATQGA6AC AGACACCCTG 
ATCCAOCQCA CAATCOQTGA AGCCTTCCAG Q GC TGCACCQ TOCTOQTCAT TGCCCACOGT 
GTCACCACra TGCTGAACTG TGACCACATC CTGGTTATGG GCAATGOGAA GOTGOTAGAA 
TTTGATOQGC OGQAOGTACT GCSQAAGAAO CCTGQOTCAT TGTTOGCAGC CCTCATGGCC 
ACAGCCACTT CTTCACTGAG ATAA 



Seq ID KOi 106 Protein secruence 
Protein Accession fti Bos sequence 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
XS60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 

4020 
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1 11 21 31 41 51 

llTRKRTYWVP KSSGGLVNRG IDIGDDMVSQ LiyimrTLQD GFWSQQBRNP EAPGRAAVPP 60 

KGXynAALRT MIPFRPKPRP PAPQPLDKAG LFSYLTVSML TPLMIQSLRS RLDENTIPPL' 120 

SVHDA8DKHV QRLHRLIfEEE V8SRQXBKAS VLLVMLRFQR TRLIFDAI*LG ICPCIASVWS 180 

PILIIPKILE YSBBQW3NW HGVGLCPALF LSBCVKSLSP SSSWIIIfQRT AIRFRAAVSS 240 

FAPEKLIQFK SVIHITSGEA ISFFTGDVNY LPEGVCYGPL VLITCASLVI CSISSYFIIG 300 

YTAFIAIIiCY LLVFPLAVPM TRMAVKAQHH TSBVSDQRIR VTSEVLTCIK LIKMYT WEaCP 360 

PARZZEGMES LTFCSKP(TO MAFSMLASLN LLRLSVFPVP ZAVXGI*TN8X 8AVMRFKKFF 420 

LQBSPVFYVQ TLQDPSKALV FEBATLSHQQ TCPGIVNGAL ELBRNGHASE QfTRPRDALG 480 

PBBE6NSLGP ELHKIMLWS KGMMLGVOGN TQSGKSSLLS AILBEMHLLE GSVGVQGSLA 540 

YVPQQAWIVS GNIRQJILMG GAYDKARYIiQ VLHCCSLNRD LELLPFGDMT BIGERGUJLS 600 

GGQKQRISLA RAVYSDRQIY LU)DPLSAVD AHVGKHIFSE CIKKTLRGKT WLVTHQLQY 660 

lAPOGQIILL ENGKICENOT HSBLMQKKGK YAQLZQXMHK BATSDMLQDT AKIAEKPKVB 720 

SQALATSLBE SLN6KAVPSH QLTQBEEMEB GSLSHRVYHR YIQAA6GYMV SCIIFFFWL 780 

IVFLTIPSFW WLSYWLEQGS GTMSSRESNQ TMADLGNIAD NPQLSPYQLV YQLNALUilC 840 

VGVCSSGIFT KVTRKASTAL HNKLPNKVFR CPMSFFDTIP IGRLUJCFAG DLBQLDQLLP 900 

IFSBQFLVLS LMVIAVLLIV SVLSPYILLM QAIIMVICPI YYMMFKKAIO VFKRLEMYSR 960 

SFI^SKIZMS liQGEiSSIHVy GXTEDFZ8QF KRLTDAQMNY LLLF LSST RW MALRLEIMTN 1030 

LVTLAVALFV AFGISSTPYS FKVMAVtflVL QLAS8FQATA RIGLETBAQF TAVBRILQYM 1080 

KMCVSEAPLH MBGTSCPQGW PQHGBIIPQD YHMKYRDNTP TVLHGINLTI RGHBWOIVG 1140 

RTGSGKSSLG MALPRLVBPM AGRILIDGVD ICSIGLEDLR SKLSVIPQDP VLLSGTIRFN 1200 

IDPFDRHTDQ QIHDALERTP LTKAISKFPK KLHTDWENO GNFSVGERQL LClAHAVIiRN 1260 

SKZIItZDEAT ASIDMETDTX* XQRTIRBAFQ GCTVLVZARR VTTVLMCDHI LVHGNGKWB 1320 
FDRPBVLRXK PGSLFAALMA TATSSLR 

Seq ID NO: 107 DMA sequence 
Nucleic Acid Accession ttx NM_024022 
coding sequence t 202 . . 1563 

1 11 21 31 41 SI 

I I i I ] I 

ACCGGGCACC GQA0GGCTC30 OGTACTTTOG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GQAAAGGGCT OTSTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC ITOCOTGGTO 120 

CCATCTACAT TTTTGGGACT 0G6QAATTAT OAOGTAGAGG TGGAG GCGG A GCCGOATGTC 180 

AOAGGTCCTG AAATA6TCAC CATGGGGGAA AATGATCCX5C CTGCTOTTGA AGCCCCCTTC 240 

TCATTCOGAT OGCTTTTTGQ CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAOATGCA 300 

GATGCTGTTQ CTGCACAGAT CCTGTCACrG CTGCCATTGA AGTTTTTTCC AATCATOGTC 360 

ATTGGGATCA TTGC3VTTGAT ATTAGCACIG GGCATTGQTC TGGGCATOCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATOO AGCTGATAGC TOGATGTGAC 480 

GGAOTCTCGG ATTGCAAAGA CGGGGAGGAC GAQTACCGCT GTGTCCX3Q0T 6GGTGQTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTOCT TCOTGGAAQA CCATGTGCTC CGATGACTGG • 600 

AAGGGTCACT ACGCAAATGT TOCCTGTGCC CAACTGGGTT TCCC AAOCTA TGTaAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGO CAOTTCOGGO AOGAGmOT 6TCCAT0QAT 720 

CACCTCTTGC CAGATGACAA GGTQACTGCA TTAC31CCACT CAOTATATGT GAGGGA66QA 780 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGQTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAOTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTOQATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT OVTGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTQGACAA TCXZAOCCCCA TCCCACTTGG T6QAGAAQAT TGTCTACXaiC 1080 

AGCAAGTACA AQCCAAAGAG GCTGGGGAAT GACATOQCXX: TTATGAAGCT GGCCGGGCCA 1140 

CTCAOGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCXXa^ ACTCTGAAQA GAACTTCCCC 1200 

GATGGAAAAQ TGTGCTGGAC OTCAGGATGG GGGGCGACAO AGQATGOAOa TGACXJCCTCC 1260 

CCIGTCCTGA ACCACGCGGC OGTCCCTTTO ATTTOCAACA AGAT CTQCft A OCACAG GGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGO GCTACCTQAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CX3GGGGGCCC CTGGT6TGTC AAGAQAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACXSIGCTT TGGCATCGGC TGCGCAGAGQ TGAACAAGCC TGGGGTQTAC 1500 

ACXXX3TGTCA CCTCCTTCCT GGACTGGATC CA03AGCAGA TGGA6AGAQA C CTAAAA ACC 1560 

TOAAGAGGAA GGGGACAAGT AGCCACCTGA 0TTCCTGAG6 TGATQAAGAC AGCCO aATCC 1620 

TCOCCTGGAC TCCOGTGTAG QAACCTGCAC ACGAGC3«3AC ACOCTTGQAG CTCTGAGrTC 1680 

0G6CACCAGT AGCAGGCCOG AAAGAGGCAC CCTTCX3VTCT GATTCCAGCA CAACCTTCAA 1740 

GCTOCTTTTT Q TTTTTT GT T TTTTTGAG6T GGAGTCTOGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCX3A AATCCCTGCT CACTOCAOCC TCXX3CTTCCC TQ6TTCAASC GATT CTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAO GTGCCCGCCA COVCAOOCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTOOCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTQCTGGGAT TACAGQCATG GGCCACCACG 2040 

CCTAGCCTCA OGCTCCTTTC TGATCTTCAC TAAQAACAAA AG AAGCA GCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGQ 6TCTTQCAAA ATTCCTGAOO 2160 

AGATAAGCAG TTATGTGACC TCACX3TGCAA AGCCACCAAC AGCCACTCAO AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACQTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AAOOCAOCCT TTCXACTTCC AAGACTTATT TTCACATGTG GGQAGGTTAA TCTAGGAATO 2340 

AGTCCTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGOTGCTTGA OGTATTATTG 2400 

TOCTTTGATT CCIUU^TAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA ^AAAAAAAAAA 2460 
AAAAA 

Seq ZD MO: 108 Protein sequence 
Protein Accession «t NP_076927 

1 11 21 31 41 51 

I I I I ( t 

MGBNDPPAVB APF8PRSLFG U3DLXISPVA PDADAVAAQI LSLLPIiKFFP IIVIGXIALX 60 

LALAIGLGIH PDCSGKYRCR SSFKCIELIA RCDGVSDCKD GBDBYRCVRV GGQNAVLQVF 120 

TAA8HKTMCS DDWKGHYANV ACAQU3FP6Y VS8DNLRVSS LBGQFREBFV SIDHIiLPDDK 180 

VTALHHSVYV REGCASGHW TliQCTACGHR RGYSSRIVGG NM8LLSQWPW QASLQPQGYH 240 
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LCGGSVXTPL WIITAAHCVY DLYLPKSWTI QVGLVSUiDN PAPSHLVBKI VYKSKYXPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSBE KFPDGKVCWT SGWOATEDGG DASPVLNHAA 360 

VPLXSNKICN BRDVYGGII8 PSMLCAGYLT GGVDSGQ6DS GGPLVOQERR LVfXLVGATSP 420 
^ GIGCABVNKP GVYTRVTSFL DHIKEQMERD I*RT 

Seq ID KOt 109 DHA sequence 

NUdelc Acid Accession #t NM_000493.3 

Coding sequence: 97.. 2139 

10 1 11 21 31 41 51 

i 1 I I I i 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCAOOCA GAATCCATCT QAOAATATGC TGCCACAAAT ACCCTTTTTO 120 

CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTAOG CTGAAOQATA CCAAATGCCC 180 

15 ACAGGCATAA AAGGCCCACT ACCCAACACC AAOACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG CSTATAGCAOT AAOAOQAGAG CAAGGTACTC CTGGTCCACC AGGCC CTQCT 300 

GQACCTOGAQ G6CACCCAGG TCCTTCTGGA CCACCAGOAA AACCAGGCTA OGGAAGTCCT 360 

GQACTCCAAO GAGAGCCAGG GTTOCCAGGA CCACCGGGAC CATCAOCTGT AGGGAAACCA 420 

GGTGTGCCAO QACTCCCAGG AAAA0CA6GA GAGA6AGGAC CATATQQACC AAAAGGAGAT 480 

20 GTTGGACCAG CTGGCCTACC AGGACCCC6G GQCCCACCAG GACCACCTGG AATCCCTGGA 540 

CGGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 

CCCAGGGGCT TTCCTX36AGA AAAGGOTGCA CCAGGAQTCC CTGOTATQAA TGGACAGAAA 660 

GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAO 720 

GOTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGQTQAAAA TG6GGTTCCA 780 

25 GGACaGCCAQ GCATCAAAGQ TQATAGAQQT TTTC0Q6GAG AAATGGGAOC AATTGQCCCA 840 

COWSOTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 

GCTGGAGCCC CAGGCCAGCC AGOGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTOGGC CCCCAGGGCC TCCTGGCTTT GGOAAAOCAG GCTTGCCAQG CCTOAAGGQA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGaGGGT CCAGQTGCCA AAGGG6AACA AQQGOCAGCA 1080 

30 G6TCTTCCTG GGAAGCCAQG TCTGACTGGA CCCCCTGGQA ATATGG6ACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAO GTGAGACAGG GCCAGCTGOG 1200 

CCTGCAGGAT AOCCTGGGGC TAAGGGTQAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 

GGGTACCCAG GAAAACCAGG TCTCGATQGT CCTAAGGGTA AOCCAGGQTT ACCAGGTCCA 1320 

AAAGGTQATC CTGQAGTTGQ AGGACCTCCT GQTCTCCCAG GOCCTGTGGG CCCAGCAGGA 1380 

35 GCAAAGGGAA TGCCOGGACA CAATGGAGAG GCTGGCCCAA GAG6TGCCCC TGGAATACCA 1440 

GGTACTAGA6 GCCCTATTCG GCCACCAQGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGOAAGTC CCX3GTCCTCC TG6CCCAGCT GGCATAGCAA CTAAGGGCCT CAATG6ACCC 1560 

ACCGGQCCAC CAGQGCCTCC AGGTCCAAGA GGCCACTCTG GAQAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC AOCAGQTCAA GCAGTCATGC CTGAGGGTTT TATAAA06CA 1680 

40 GGCCMMSGC CCAGTCTTTC TGG6ACCCCT CTTGTTAQTG CCAACCAQGO GGTAACAG6A 1740 

ATCCCIOTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

ATACCATTTO ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGOCCTGTA TAAQAATGOC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

45 ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTG6C TCCAGCTTCC CAATGCOGAlS TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT 0CTAGTG6CT CCAATGTGAQ TACACCCCAC AQAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAOOCTQ AAAAGAATGT AATTTTTATT TTCTGAAATA CAQATTTGAG CTATCA QACC 2280 

50 AACAAACCTT CCCCCTOAAA AGTGAGCAGC AACGTAAAAA OGTATGTGAA QCCTCTCTTG 2340 

AATTTCTAOT TAOCAATCTT AAOaCTCTTT AAGOTTTTCT CCAATATTAA AAAATATCAC 2400 

CAAAQAAGTC CTOCTATGTT AAAAACAAAC AACAAAAAAC AAAQCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAOCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTOGGC 2520 

ATTTCCTTTT TAAAAAAGCC TQTTTCTAAC TATQAATATG AOAACTTCTA GGAAACATCC 2580 

55 AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTC3UUVTT TA AAAGA CAC 2640 

TGIAICCXXn AAAATATTTC TGATGGTGOV CTACTCTQAG GCXW3TATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGT6CATAT ATACTTGTTA AAOCTCTTAT ATAAAA AAGC 2760 

CCCAAAATAT TGAAGTTCAT CTQAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 3820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAQCA TAAC TTGQA A AC AGGTA TCT 3880 

60 GACCTATTCT TATTTAflTTA ACACaU«3TGT GATTAATTTG ATTTCTTTAA TTCCTT ATTQ 2940 

AATCTTATQT GATATGATTT TCTGGATTTA CAGAACATTA GCACATQTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTATAA TTTACACTOA GGGTTTCAAA ATTOGACTAG AAGTGGAGAT 3060 

ATATTATTTA TTTATOCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAOCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 

65 AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAO GCTTTTCATT G6ACACAATA AAATAACATC AATAO 



70 



Seq ID NOt 110 Protein sequence 
Protein Acceasion #i 1IP_000484.2 



1 11 31 31 41 51 

JLpqipplll VSLNLVHGVP YAERYQMPTG IKGPLPNTKT QPPIPYTIKS KGIAVRGEQQ 60 

TPGPPGPAGP RGHPGPSGPP GKPGYQSPGL QGEPGLPGPP GPSAVGKPGV PQLPGKPGER 120 

75 GPYOPHGDVO PAGLPGPRGP PGPPOIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 

VP(91NGQKGB KOyGAPGRPO BSGIiPGPQGP TQPSGPPGVQ XR8E3K3VPGQ PGIKGDRGFP 240 

GEM8PXGPPG PQ6PPGBRGP EQIGKPGAAO APGQPGZPGT KSLPGAPGIA GPPGPPGFGK 300 

PGLPGLKQBR GPAGLPGGPQ AKGEQGPAGI. PQKPGLTGPP ONMGPQGPKG IPGSHGLPGP 360 

KOETQPAGPA GYPGAKGERG SPGSDGKPGY PQKPGliDGPK GNPGLPGPKG DPGVGGPPQIi 420 

80 POPVGPAGAK GMPGHNGEAO PRGAPGIPGT RGPIGPPGIP GFPGSKQDPG SPGPPGPAGI 480 

ATKQLNGPTO PPGPPGPRGK SGBPGLPGPP GPPGPPGQAV MPEQPIKAGQ RPSLSGTPLV 540 

SAMQQVT6MP VSAPTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTOIFT CQIPGIYYFS 600 

YBVHVKGTBV WVGLYKNGTP VKYTYDBYTK OYIiDQASGSA IIDLTENDQV IfLQLPNAESN 660 
GLYSSBYVBS SFSGFXiVAFM 
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Seq ID NO: 111 DMA sequence 
Nucleic Acid Accession Ot nm_000949 
Coding sequence t 285.. 2153 

1 11. 21 31 41 51 

11111) 

GGAGGCTGAA ATCCCCAGAC QCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC G06AAACAGC TTTCCACACA ATGGAGCTTC 120 

ATGTCCTCGT 6CAGGAAQTA CTCATCGACT GATGTG6CAG ACTTT6CT0C CTQACAAAAC 180 

TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 

GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAO QAAAATGTGG 300 

CATCTGCAAC OGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 

TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTOSTTCTCC CAATAAQCSAA ACATTCACCT 420 

GCTGGT06AG GGCTGG6ACA GATGGA6GAC TTCCTACCAA TTATTCACT6 ACTTACCACA 460 

GGGAAGGAGA QACACTCATG CATGAATOTC CAGACTACAT AACXXR3TGGC CCCAACTCCT S40 

GCCACTTTGQ CAAGCAGTAC ACCTCCATGT GGAGOACATA CATCATGATG GTCAATGCCA 600 

CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 

AGCCAOACOC T CC TTTG G AG CTGGCraTGO AA6TAAAACA GCCAOAAGAC AQAAA ACCCT 720 

ACCTGTGGAT TAAATGGTCT CCACCTACOC TGATTGACTT AAAAACTGGT TG6TTCAGGC 780 

TCCTGTATGA AATTCX3ATTA AAACCOGAGA AAGCAGCTGA GTGGGAQATC CATTTTQCTO B40 

GGCAGCAAAC AGAQTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

TTCGCTGCAA ACCAOACXSVT GGATACTGGA GT6CATGGAG TCCAGCGACC TTC31TTCAGA 960 

TACCTAGTGA CTTCAOa^TQ AATQATACAA COGTGTGGAT CTCTGTOGCr OTOCTTTCTO 1020 

CTQTCATCra TTTQATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGT6 A0CT 1080 

GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TQATGCTCAT CTGTTGGAGA 1140 

AGGGCAAGTC TGAAGAACTA CTGAQTQCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

ACTATGAOGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGA6GAC CAGC ATCTAA 1260 

TGTCAOTGCA TTCAAAAOAA CACCCAAOTC AAOQTATGAA ACCCACATAC CT QQATO CTQ 1320 

ACACTGACTC AGGCCGOGGO AGCTGTGACA GCCCTTCCCT TTTGTCTOAA AAGTOTGAGO 1380 

AACCCCAGGC CAATCCCTCC ACATTCTATO ATCCTGAGGT C3VTTQAGAAG CCAQAGAATC 1440 

CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

TTCATGCTCO TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAG CCCAG C CAGCACAACC 1560 

CCAGATCCTC TTACCACAAT ATTACTGATG TGTOTGAGCT GGCTGTGGGC CCTGCAG6TG 1620 

CACOGGCCAC TCTGTTOAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 

AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAOGQAGGT AOAAAGCTTC CATTCTQAGA 1740 

CTGACCAGQA TAOGCCCTGG CTGCTGCCCC AGQAGAAAAC CCCCTTTGGC TCCGCTAAAC 1800 

CCTTGGATTA TGTG6AGATT CACAAGQTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1860 

AACAGAGA6A GAACAGOQGC AAGGCCAAGA A6CCGGG6AC TCCTGAGAAC AATAAGGAGT 1920 

ATGCC3JU3GT GTCXXSGGGTC ATGGATAACA ACATOCTGGT OTTGGTGCCA GATCCAOITG 1980 

CTAAAAACGT GGCTTGCTTT GAAGAATCAQ CCAAAGAGGC CCCACCATCA CTTGAACAQA 2040 

ATCAAGCTGA QAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTQC AGGCTC CAGC 2100 

TGGGTGGTTT 6GATTACCTG 6ATCCC0CAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 

ACTAAtGGAA TGATIGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTAGGTGA 2220 

AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC CCAGCTOCTT TCA TGCTC CA 2280 

TTTTTAACXy^ CTTGCCTCTT TCTCCAGCAG CTOATTCCAO AACAAATCAT TATGTTTCXT 2340 

AACTGTGATT TOTAQATTTA CTTTTTGCTG TTAGTTATAA AACTATQTOT TCAATGAAAT 2400 

AAAAGCACAC TQCTTAGTAT TCTTQAGGGA CAATGCCAAT AGGTATATCC TCTGQAAAAG 2460 

GCTTTCATQA TTTGOCATGG GAC3U3AGGGA AATGAAATTG TCAAAATTOT TTACCATAGA 2520 

AAGATGACAA AAGAAAATTT TCCACATAGO AAAATGOCAT QAAAATT6CT TTTQAAAAAC 2580 

AACTGCATAA CCTTTACACT CCTCGTOCAT TTTATTW3GA TTACCCAAAT ATAACCATTT 2640 

AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 

Seq ID MOt 112 Protein sequence 
Protein Accession #: np__000940 

1 11 21 31 41 51 

I 1 I I i t 

MKENVASATV PTLLLFLSTC LLKGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTHY 60 

SLTYHREGET LMHECPDYIT GGPNSOIFGK QYTSMWRTTI MMVNATNQMG 8SFSDELYVD 120 

VTYIVQPDPP LBLAVEVKQP EDRKPYU»IK WSPPTLIDLK TGWPTLLYEI RUCPEKAAEW 180 

BIHFAGQQTB FKXLSIjBPGQ KYLVOVROCP DHGYWSAHSP ATPIQIPSDF TMNDTTVWIS 240 

VAVLSAVICL IIVHAVALKO Y8NVTCXFFP VPGPKIKOFD AHLLSXGK8E ELLSKUBCQiD 300 

FPPTSDYEDL LVEYLBVDDS EDQHLMSVH8 KEHPSQGKKP TYIiDPDTDSG RGSCDSPSLL 360 

SBKCEEPQAN PSTPYDPBVI EKPEMPBTTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 

PSQHNPRSSY H^aTDVCBLA VGPAGAPATL UJBAGKDALK SSQTIKSREE GKATQQRBVE 480 

SFHSETDQDT PWLLPQKKTP FGSAKPU3W EIHKVNKDGA LSLLPXQREM SGKPKKPGTP 540 

EinnCEYAKVS.QVFfDNNILVIi WDPHAXNVA CFEB8AKSAP PSLEQNQAER ALANFTATSS 600 
KCRLQLGGLD YIiDFACFTHS FH 

Seq ID NOt 113 DMA sequence 
Nucleic Acid Accession #i XM_062811 
Coding sequence I 1..88B 

1 11 21 31 41 51 

1 I I 1 1 i 

ATGTGGGGOG CTCGCCGCTC GTCC6TCTCC TCATCCTGGA AOGCOOCTTC GCTCCTGCAG 60 
CTGCTGCTGG CTGCGCTQCT GGOGOCGGGG GCGAGG6CCA GCOGGQAGTA CT6CCA0G6C 120 
TGGCTGGACG OGCAGGGOGT CTGGCGCATC GGCITCCAOT GTCCOGAOCO CTT06ACGGC 180 
GGCGAOGCCA CCATCTOCTG C6GCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCQAG 240 
QCGC6CCTGO ACCAGGGCGG CTGOGACAAT GACCGCCAGC AGGGCGCTGG CXSAGCCTGGC 300 
CGGGOGGACA AAGAOGGCCC OGACGGCTOO GCAOTGCCCA TCTAOGTQCC QTTCCTCATT 360 
GTTtSGCTCOG TGTrrGTCGC CTTTATCATC TTGGGGTGOC TG6TQGCAGC CTGTTOCTOC 420 
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AGATGTCTCC 
ATGGAGACCA 
TCXAGOVCAG 
AGGTCACAOA 
CCCACGAATT 
CAGTATCTGC 
QCTGTGCCAC 
CCTOVCACCA 



66GCTAAGCA 
TCCCCATGAT 
CTQGCAGTTC 
CCAACTGTTQ 
TCTCTGTGCT 
ATCCCCCATA 
CTTTCATGGA 
ACAGTGAACA 



GGATCCCCAG 
CCCCAOTGCC 
CAGCTCCA6C 
CTTGOOGQAA 
GAACTGTCA6 
CGTGGGGTAC 
CGGCXrrGCAG 
GAAGATGTAC 



CAQAGCOQAO 
AGCACCTCCC 
6CCAACTCA0 
G06AGCATGA 
CAGGCCACCC 
ACGGTGCAGC 

cctggctaca 
cx:ago6gtga 



CCCCAGG6GG 
OQGGSTOGTC 
GOGCOOQGOC 
ACAAGQTGTA 
agattgtgcc 
acxsactctgt 
G6CAGATTCA 
CTGTATAA 



TAACOGCTPG 
CrCACGCCAO 
GCCCCCAACA 
TGTCAACATG 
ACATCAAOGG 
OOCCATGACA 
O 'ftXXX.'Cn'C 



480 
540 
600 
660 
720 
780 
840 



Seq ID KOi 114 Protein sequence 
Protein Accession XP_062811 

1 11 21 31 41 SI 

I t I I I ) 

KHGARR8SV8 SSWHAASLLQ LLLAAIiLAAG ARAS6EYCH0 HUOAQGVWRZ GFQCPBRFDG 60 

6DATICC0SC ALRYCCSSAB ARLDQGOCDN DRQQGAGBPC RADKDOPDGS AVPIYVPPLI 120 

VGSVFVAPH LGSLVAACCC RCLRPKQDPQ QSRAPGGNRL METIPMIPSA STSRQSSSRQ 180 

SSTAASSSSS ANS6ARAPPT R8QTKOCLPB G7MKNVYVMM PTOFSVUTCQ QATQIVPHQG 240 
QYLBPPyVQY TVQHDSVPMT AVPPFMDGLQ PGYRQIQSPF FHTNSSQKMY PAVTV 

Seq ID NOi 115 DNA sequence 

Nucleic Acid Accession #: NM_0132S7 

Coding sequences 223.. 1512 

1 11 21 31 41 51 

1111)1 

GGTCTGCTCr. TGAGGGATTA AATGCAAAGA GATCACACX3V TGGACTACAA GGAAAGCTGC 60 

CXMUSTOTAA 6CATTCGCAG CTCOGATGAA CACAGA6AGA AAAAGAAGAG GTTTACTGTT 120 

TATAAAGTTC TG6TTTCAGT GGGAA6AAGT GAATGOTTTa TGTTCAGGAG ATATQCAGAG 160 

TITGATAAAC TTTATAACAC TTTAAAAAAA CAGTTTCCTO CTATGGCCCT GAAGATTCCT 240 

GCCAAGAGAA TATTTGGTGA TAATTTTQAT CCAGATTTTA TTAAACAAAQ ACGAGCAGGA 300 

CTAAAOGAAT TCATTCAGAA CCTAGTTAGQ TATCCAGAAC TTTATAACCA TCCAGATGTC 360 

AGA6CATTCC TTCAAATGGA CAGTCCAAAA CACCAGTCAG ATGCATCTGA AGATGAGGAT 420 

GAAAGAAGTT CTCAGAAGCT ACACTCTAGC TCACAGAACA TCAACCTGGG ACCXSTCTGGA 480 

AATCCTCATG CCAAACCAAC TGACTTTGAT TTCTTAAAAG TTATTGGAAA A66CA6CTTT 540 

GGCAAGGTTC TTCTT6CAAA ACGGAAACTQ GATGGAAAAT TTTATGCTGT CAAAGTGTTA 600 

CAGAAAAAAA TAGTTCTCAA CAGAAAAGAG CAAAAACATA TTATGGCTGA AOGTAATGTG 660 

CTCTTGAAAA ATGTGAAACA TCCGTTTTTG GTTGGATTGC ATTATTCCTT CCAAACAACT 720 

GAAAAGCTTT ATTTTGTTCT GGATTTTQTT AATGQAGGGG AOCTTTTTTT CCACTTACAA 780 

AGAGAAOQGT C C TTTCCTGA GCACAGAGCT A6GTTTTA06 CTGCTGAAAT TGCTAGTGCA 840 

TTGGGTTACT TACATTCCAT CAAAATA6TA TACAQAGACT TGAAACCAGA AAATATTCTT 900 

TTGGATTCAG TAOGACATGT TGTCTTAACA GATTTTGGGC TTTGTAAAGA AGGAATTGCT 960 

A7TTCTGACA CCACTACCAC ATTTTGTGGG ACACXAGAGT ATCTTGCACC TGAAGTAATT 1020 

AGAAAACAGC OCTATGACAA TACTOTAOAT TGGTGOTGCC TTGGGGCTOT TCTGTATGAA 1060 

ATQCT6TATG GATTGCCTCC TTTTTATTGC OGAGATGTTG CTQAAAT6TA TGACAATATC 1140 

CTTCACAAAC CCCTAAQTTT GAGGCCAGGA OTGAGTCTTA CAGCCTGGTC CATTCTGGAA 1200 

QAACTCXTTAG AAAAAGAGAG OCAAAATOGA CTTGGTGCCA AGGAAGACTT TCTTGAAATT 1260 

CA6AATCATC CTTTTTTTGA ATCACTCA6C TGG6CIGACC TTGTACAAAA GAAGATTCCA 1320 

CCACCATTTA ATGCTAATGT G6CTGGAGCA 0AT6ATATCA GAAACTTTGA CACAGCATTT 1380 

ACAGAAGAAA CAGTTCCATA TTCTGTGTGT GTATCTTCTO ACTATTCTAT AGTQAATGCC 1440 

AGTGTATTGG AGGCAGATQA TGCATTCGTT GGTTTCTCTT ATGCACCTCC TTCAGAAGAC 1500 

TTATTTTTGT GAGCAGTTTG CCATTCAGAA ACCATTGAGC AAAATAAGTC TATAGATGGQ 1560 

ACT6AAACTT CTATTTGTGT GAATATATTC AAATATGTAT AACTAOTQGC TCATTTTTAT 1620 

ATOTAATOAT GAAAACTATO AAAAAATOTA TTTTCTTCTA TGTGGAAOAA AAATAGGGCA 1680 

TTTCAAAGAG CTGTTTTGAT TAAAATTTAT ATTCTTQTTT AATAAOCTTA TTTTTAAACA 1740 

ATTTAAAAGC TATTATTCTT AGCATTAACC TATTTTTAAA GAAACCTTTT TTGCTATTGA 1800 

CTGTTTTTTC CCTCTAAGTT TACACTAACA TCTACCCAAG ATAGACTGTT TTTTAACAGT 1860 

CAATTTCAOT TCAGCTAACSl TATATTAATA CCTTTQTAAC TCTTTGCTAT GGCTTTTGTT 1920 

ATCACAOCAA AACTATSCAA TTGOTACATG GTTGTTTAAO AAISAAAOOGT ATTTTTGCAT 1980 

6ATAAATCAC TGTrTOAAAT ATTTGGTTCA TGOTATGATC 6AAATQTAAA AGCATAATTA 2040 

ACACATTGGC TGCTAGTTAA CAATTG6AAT AACTTTATTC TGCAGATCAT TTAAGAAGTA 2100 

ACAGGCCGGG CGCXXTPGGCT CAOGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGQCGGGC 2160 

AQATGACCTG AGGTCAGGAG TTGGAGACCA GCCTGACCAA CATG6ACAAA CCCOGTCTCT 2220 

ACTAAAAATA CAAAATT6GC AGG6TGTGGT GGCACATOCC TATAAtCCCA QCTACTTGGQ 2280 

AGQCTAAGGC AGOAQAATOG CTTGAACCXDG GGAGQOGQAG OTTGCAGTGA GCOGAGATOO 2340 
CACCATTGCA CTCCTGCCTG GGCAACAAGA GT6AAACTCC ATCTCCAAAA A 

Seq 10 NO: 116 Protein sequence 
Protein Accession ft: KP_037389 



HALKIPAKRI 



YAVKVLQKKI 
LPFHLQRERS 
CXBGIAISDT 
EKYDITILUKP 
VQKKIPPPFN 
APPSEDLFL 



11 
I 

FQDNPDPDFI 
QKLHSTSQNI 
VIJJRKBQKHI 
PPEHRARFYA 
TTTPCGTPEY 
LSLRPGVSLT 
PNVAGPDDIR 



21 
I 

KQRRAGLNBF 
MLGPS^IPHA 
MAERMVIiLXN 
AEIASALGYL 
LAPEVIRXQP 
AHSILBELLB 
NFDTAFTEBT 



31 
I 

ZQNLVRYPEL 
KPTDPDFLKV 
VKHPFLVGUi 
H8IKZVYRDL 
YDNTVDWWCL 
KDRQNRLGAR 

vpysvcvsso 



41 
1 

YNHPDVRAPL 
IGX6SPGKVL 
YSFQTTEKLY 
KPEIIILLDSV 
GAVLYS4LYG 
BDFLEIQNHP 
YSXVKASVIiB 



51 
1 

QMD8PXHQSD 
LAXRKUXIKF 
FVLDPVNGGB 
GHWLTDFGL 
LPPPYCRDVA 
FFESLSHADL 
ADOAFVOFSY 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO; 117 DNA sequence 

Nucleic Acid Accession fi: NM 004004.1 
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Coding sequences 1..681 

1 11 21 31 41 51 

lilt I I 

ATGGATTGG6 GCACGCTGOV GAOGATCCTG GGGGGTOTOA ACAAACACTC CACCAGCATT 60 

GC5AAAGATCT GGCTCACXXTT CCTCTTCATT TTTCGCATTA TGATCCTOGT TGTGGCTGCA 120 

AAGOAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGQCTGC 180 

AAGAAOGTGT GCTACX3ATCA CTACTTCCCC ATCTCCCACA TCOCXXTTATG GGCCCT6CA0 240 

CTOATCTTCO TOTCCAGCCC AgC G CTCCTA GTG6CCATGC AOGTGGOCTA C0Q6AGACAT 300 

GAGAAGAAGA G6AAGTTCAT CAAGOGGGAG ATAAAGAGTG AATTTAAGGA CATGQAGGAG 360 

ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 420 

TTCTTCOGGG TCATCTTCGA AGCCGCCTTC ATGTACCTCT TCTATGTCAT GTACGAOGGC 480 

TTCTCCATGC AGOGGCTGGT GAAGTGCAAC GCCrGGCXTTT OTCCCAACAC TGTGGACTGC 540 

TTTGTOTCCC GGCCXA0Q6A QAAQACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 600 

ATTTGCATCC TGCT6AATQT CACTGAATTG TOTTATTTQC TAATTAGATA TTGITCrGGQ 660 
AAGTCAAAAA AGCCAGTTTA A 

8eq ID UOt 118 Protein eequence 
Protein Accession #t NP_003995.1 

1 11 21 31 41 SI 

I I i I I 1 

HDHGTLQT2L G6VNKHST8I GKINLTVLFI FRIMILWAA KEVWGDEQAD FVCNTLQPGC 60 
XNVCYDHyFP ZSHIRLWALQ LXFVSSPALL VAMRVAYRSH EKXRKFIK6B IKSBFKDXSB 120 
IKTQKVR1B6 SUWTYTSSI PPRVIPEAAP MYVFYVMYDO PSMQRLVKCM AWPCPMTVDC 180 
FVSRFTBICrV FTVfMIAVSQ ICXLLNVTEL CYLLZRYCSO KSKKFV 



Seq ID NO: 119 PNA eequence 

Nucleic Acid Accession «i XM_061091.1 

Cbding sequences 1..2461 

1 11 21 31 41 SI 

I 1 I 1 I i 

AT6CCAAATA CTTCAGGAAC AACXAGGATT QAAATrTGGC TTCTCCAAGA GC06CCCGGG 60 

CACOGAGCGC TGGTCGCCGC TCTCCTTCOO GTGAGTCCCA GCCCOaAGTT GGCTCTGGOG 120 

CCCGGGTACC OGCCAQTGCC 6GCTGCCX3AT QACX:GATTCA CGCTCCOGAT GATTGGAGGT 180 

CAQATOCATG GT6AGAAGGT AGATCTCTGG AGCCTTGGTG TTCTTTGCTA TGAATTTTTA 240 

GTTGGGAAGC CTCCTTTTGA G6CAAAG6AA GTCCATGTAA GCAAAOAAAC CATCG6GAAG 300 

ATTTCAGCTO CCAGCAAAAT OATGTGGTGC TaSGCTGCAG TGGACATCAT GTTTCTGTTA 360 

GATGGGTCTA ACAGCGTC3GG GAAAGGGAGC TTTQAAAGGT CCAAGCACTT TGGCATCACA 420 

GTCTGTQACQ GTCTGGACAT CAQCCCCGAG AGGOTCAGAG TGGGAGCATT CCAGTTCAQT 480 

TCCACTCCTC ATCTGGAATT COCCTTGGAT TCATTTTC3UI CCCAACAGGA AOTQAAGGCA 540 

AGAATCAAGA GGATGGTTTT CAAAGGAGGO OQCAOQQAOA OQGAACTTGC TCTGAAATAC 600 

CTTCT6CACA GAGGGTTGCC TGGAGGCAOA AATGCTTCTG TGCCCCAGAT CCTCATCATC 660 

GTCACTGATG GGAAGTCCCA GGGG6ATGTG GCACTGCCAT CCAAGOWSCT GAAGGAAAGO 720 

QGTGTCACTG TGTTTGCTQT GGGGGTCAGG TTTCCCAGQT GGGAGGAGCT GCATGCACTG 780 

GCCA6CGAGC CTAGAG66CA GCAGGTGCTG TTGGCTGAGC AGGTGGAGQA T6CCACCAAC 840 

G6CCTCTTCA GCACCCTCAa CAGCTOGGOC ATCT6CTCCA GCiGGCAOQGC A6CTGGGA6C 900 

CCCGAGCTTG TCTTCATGQA GCGGTTAATG G6CATCTCTC TGATAGOOCC CTGT6ACTGG 960 

CAGCCCTGCC AQAATGGAGG CACATGTGTT CCAGAAQQAC TGQAOGGCTA CCAGTQCCTC 1020 

TGCCCGCTGG CCTTTOGAGG GQAGQCTAAC TGTGCCCTGA AGCTGAGCCT GGAATGCAGG 1080 

GTCGACCTCC TCTTCCTGCT GGACAGCTCT GCGGGCACCA CTCTGOACGG CTTCCTGC66 1140 

GCCAAAGTCT TC8TQAAGGQ GTTTGTGCGO GCOOTGCTGA GGQAGGACTC TCOGGCCCGA 1200 

GTGGGTGTGQ CCAGATACAG CAGGGAGCTG CTGGTGGOGG TGCCTGTGGO GGAGTACOUS 1260 

GATGTGCCTG ACCTGQTCTG GAGCCTCGAT GQCATTCCCT TCCGTGGTGG CCCCACXXH^ 1320 

ACGGGCAGTQ CCTTGCXSGCA GGOGGCAGAC CX3TGGCTTCG GGAGOGCCAC CAGGACAGGC 1380 

CAGGACCGGC CACGTAGAGT aC3TGGTTTTG CTCACTGAGT CACACTCCGA GGATGAGGTT 1440 

G03GGCCCAG G6CX3TCAOGC AAGGGOQ06A GA6CTGCTCC T6CT0GGTGT AGGCAGTGA6 1500 

GCGGTGCGGG CAGAGCTGGA 6GAGATCACA GGCA60CCAA AGCAT6TGAT GGTCTACTOQ 1560 

GATCCTCAGG ATCTGTTCAA CCAAATCCCT GAGCTGCAGG GGAAGCTGTG CAGCCGGCAQ 1620 

CGGCCAGGGT GCOGGACACA AGCCCTGGAC CTOGTCTTCA TGTTGGACAC CTCTGC CTCA 1680 

GTAGGGCCCX3 AGAATTTTGC TCAGATGCAO AGCTTTOTGA GAAGCTOTGC CCTCCAGTTT 1740 

GAG6TGAACC CTGAOGTGAC ACAGGTOQGC CfG G TG G TGT ATGGCASOCA GGTGCA8ACT 1600 

GCCTTCGGGC TGGACACCAA ACCCACCCGG GCTGGOATGC TGOGGGCCAT TA6CCAGGCC 1860 

CCCTACCTAG GTGGGGTCGG CTCAGCOSGC ACCX5CCCTGC TGCACATCTA TGACAAAGTG 1920 

ATGACCGTCC AGAGGGGTGC CCGGCCTGGT GTCCCCAAAG CTGTGQTGQT OCTCACAGGC 1980 

GGGAGAGGOQ CAGAGGATGC AGCCGTTCCT GCCCAGAA6C TGAGGAACAA TOGCATCTCT 2040 

GTCTTGGTCO TGGGOGTGGO GCCTGTCCTA AGTGAOGGTC TGCBGAaGCT TGCAGGTOCC 2100 

OGGGATTCCC TGATCCACGT GGCAGCTTAC GC0GACCT6C GGTACCACCA GGAGGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG CCAGTCAACC TCTQCAAACC CAGCCGGT6C 2220 

ATGAATGAGG GCAGCTGOGT CXTTGCAGAAT GGGAGCTACC GCTGCAAGTG TCGGGATGGC 2280 

TGGGAGGGCC C0CACTGCX3A OAACX^TGAG TGGAGCTCTT GCTCPOTATG TGTGA6CCAG 2340 

GQAT6GAT7C TTGAQAOGCC CCTGAGGCAC ATG6CTCC0G TGCAGGAGQG CAOCAGCGGT 2400 

ACCCCTCCCA GCa^ACTACAO AGAAGGCCTG GGCACTGAAA TGOTGCCTAC CTTCTGGAAT 2460 
GTCTQTGCXX; CAGOTCCTTA 6 

Seq ZD NO: 120 Protein sequence 
Protein Accession #t XP_061091.1 

1 11 21 31 41 51 

111)11 

MPNTSGTTRl BZHLLQEPP6 HRALVAALLP V8PSPBIALA PGYPPVPAAD DRFTLPMIGO 60 

QMH6EKVDLW SLGVLCYBFL VGKPPFBANB VHVSKBTIGK ZSAASKMMWC SAAVDiMFLIi 120 
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DOSNSVGRGS FERSKHFAIT VC3DGLDISPB RVRVGAPQPS STPHLBPPLD SPSTQQBVKA 180 

RIKRMVPKGG RTETELALKY LLURGLPGGR MASVPQILII VTDGKSQGDV ALPSKQLKER 240 

6VTVFAVGVR FPRWEEUIAL ASBFRGQHVL LABQVCDATN GLPSTLSSSA ICSSATPAGS 300 

PBLVFMERLH GISLIGPCTO QPCQHGGTCV PBGIiDGYQCL CPIAFGGBAN CALKLSLECR 360 

S VDLLPLU)S8 AGTTU)6FLR ARVPVXRFVR AVLSBDSRAR VGVATYSRBL LVAVPVOBYQ 420 

DVPDLVHSLD GIPPRGGPTL TGSALRQAAB RGFGSATRTG QDRPRRVWL LTESHSEDBV 480 

AOPARHARAR BLLLLGVGSB AVRAELEEIT GSPKHVMVYS DPQDIiFNQIP ELQGKLCSRQ 540 

RPGCRTQAIiD LVFKLDTSAS VGPENPAQMQ SPVRSCAIjQP EVNPDVTQVG LWYGSQVQT 600 

^ AFGLDTKPTR AAMLRAZSQA PYLGGV6SAG TAIjLHIYDKV MTVQRGARPG VPKAWVLTO 660 

10 ORGABDAAVP AQXUWKGXS VLWGVGPVL SBGUUILAGP RDSLIBVAAY ADUiYHODVL 720 

XEHLCGEAKQ PVNLC3CPSPC HNEQSCVliQN GSYRCKOW MEGPHCBNRE WSSCSVCVSQ 7 BO 
GHZLBTPLRH NAPVQBOSSR TPPSHYRBGL GTEHVPTFHN VCAFGP 



Seq ID NO: 121 DNA eequence 
IS Nucleic Acid Accession ff: Sos sequence 
Coding sequence I 1..2424 
I 

1 11 21 31 41 51 

^^1 I I I I 1 

20 ATGCCCCCTT TCCI G TT G CT 6QAGGCC0TC TGTGTTTTCC TOTTTTCCAG AOTQCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TOGQQAAGAT TTCAGCTGCX: 120 

AGCAAAATGA TQTGGTGCTC GGCTGCAGTQ GACATCATGT TTCTGTTAGA TOGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGOTCC AAOCACTTTG CCATCACAGT CTGTGACGGT 240 

CIGQACATCA GCGCOGAOAO GGTCAGAGTG G6AGCATTCC AGTTCAGTTC CACTCCTCAT 300 

25 CTGOAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGO 360 

ATGGTTTTCA AAGGAGGGCG CACGGAOACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAQAAA TGCTTCTGTG CCCCAOATCC TCATCATCGT CACTGATGGO 480 

AAGTCCX3U3G GG6ATGTGGC ACTGCXATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

' m taCTGTGG OaOTCAOGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAQGQAGOCT 600 

30 AGAGGGCAGC A06T0CTGTT GOCTGAGCAO OTGGAGGATO CCACCAAOGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCQGCCAT CTGCTCCAGC GCX3V0GCCAG ACTGCAGGGT C3AGQCTCAC 720 

CCCT6TGAGC ACAGQACGCT GGAGATGGTC CGGQAOTTOO CTGQCAATGC CCCATGCTGG 780 

AGAGGATGGC GGOGGACCCT TGOGOTGCTG GCTQCACACT GTCCCTTCTA CAGCTGGAAO 840 

AGAGTGTTCC TAACCCACCC T6CCACCXGC TACAG6ACCA CCTGCCCAOQ OCCCTOTQAC 900 

35 TCGCAGCCCT GCCAGAATGG AGGCACATGT OTTCCAGAAG GACTGGAOGG CTACCAGTGC 960 

CTCT6CC0GC TGGCCTTTGO AGGGGAGGCT AACTGTGCCC TGAAGCTQAQ CCTGGA ATGC 1020 

AGGGTOGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CX3GCTTCCTG 1080 

COGGOCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCOGTGC TGAGCGAGGA CTCTC3QGGCC 1140 

CGAQTGGGTQ TG6CCACATA CAGCAGGQA6 CT G CTGQTGG CX3QT00CTqT GGGQQAGTAC 1200 

40 CAGQATGTGC CTGACCTGGT CTGGA6CCTC GATGGCATTC CCTTCOGTGG TGGCCCCACC 1260 

CTGAOGGGCA GTGCCTTGCG GCAGGCX^GCA GAGCXSTGGCT TCGOOAGCSGC CACCAGGACA 1320 

GGCCAGGACX: GGCCACGTAQ AGTGGTGOTT TTQCTCACTG AGTCACACTC C6AGGATGAG 1380 

GTT6CGGGCC CAOCGOGTCA CX3CAAGGGCG CXSAGAGCT6C TCCTGCPQGG TGTACGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GQAGGAGATC ACAGGGAGCC CAAAOCATGT OATGGTCTAC ISOD 

45 TOGGATCCTC AGGATCTGTT CAACCAAATC CCTOAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGOGGCCAQ GGT6CCGGAC ACAAGCCCTG GACCT06TCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGQGC CCGAGAATTT TGCTCAQATG CAQAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTT6AGQTGA ACCCTGACGT GACAOW^TC GGCCTGGTGG TGTATGGCAG CCAGGTGCAQ 1740 

ACTGCXTTTCO GGCTGGACAC CAAACCCACC CQGGCTGOaA TQCIGG666C CATTAGCCAG 1800 

50 GCCCCCTACC TAGQTGGGGT GGGCTCAGCC GGCACC6CCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCXS TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAQA AGCTGAGQAA CAATGGCATC 1980 

TCTOTCTTGG TCGTGGGOGT GGGQCCTGTC CTAAGTGAGG GT CTGO QQAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TAOQCGQACC TGOGGTACCA CCAGGACQTG 2100 

55 CTCATT6AGT OGCTGTGTGG AGAAGCCAAQ CAGOCAGTCA ACCTCTQCAA ACCCAGCCOG 2160 

TGCATGAATG AGGGCAGCTG OGTCCTGCAQ AATOOGAOCT ACCQCTGCAA GTGTCGGGAT 2220 

GGCTGGQAGG GCCCCCACTG OGAGAACCGT GAGTGGAGCT CTTGCTCTQT ATGTGTGAGC 2280 

CAQGGATGGA TTCTTGAGAC GCCCCTGAGQ CACATGGCTC CCXSTGCAGGA GGGCAGCAGC 2340 

CX3TA0CCCTC CCAGC3UVCTA CAGAGAAGQC CTGGQCACTO AAATGGTGCC TAOCTTCTGG 2400 
60 AATGTCTG T G CCCCAGGTCC TTAG 



Seq ID NOs 122 Protein sequence 
Protein Accession #t Eos sequence 

65 1 11 21 31 41 51 

I i I I ) I 

MPPPLLLEAV CVFLPSRVPP SULQBVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSPERS KHFAITVCDG LDISPBRVRV GAPQFSSTPH LBPPLDSFST QQBVKARIKR 120 

MVPKQGRTBT ELAUCyLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

70 FAVGVRFPRW EELHALASEP RGQHVUiAEQ VEDATNGLPS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTIiSIV RSFAGNAPCH RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPC3) 300 

SQPOQNGGTC VPE6UX3YQC LCPLAFGGBA NCALKLSI^ RVDUjFLIjDS SAGTTLDGFL 360 

RAKVFVKRPV RAVLSBDSRA RVGVATYSRB LLVAVPVGBY QDVPDLVWSIi DGIPFRGGPT 420 

LTOSALRQAA BRGFG9ATRT GQDRPRHVW LLTBSHSBDE VAGPARHARA RELL Ii L OVGS 480 

75 EAVRAELEEI TGSPKHVMVY 6DPQDLFNQI PBLQGKLCSR QRPGCRTQAI. DLVFMLDTSA 540 

8VGPENFAQM QSFVXtSCAIiQ FEVNPDVTQV GLWYGSQVQ TAPGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIVDK VMTVQRGARP GVPKAVWLT GGRGABDAAV PAQKLRNNGI 660 

SVIjWGVGPV LSBGI*RRIiAO PRDSLIHVAA YADIiRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

04NEGSCVLQ NGSYRCKCRD GWBGPHCBNR EHSSCSVCVS QGHILBTPLR HMAPVQBGSS 780 

80 RTPPSNYREG LGTEMVPTFW NVCAPGP 



Seq ID KOt 123 PMA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence! 8 9.. 23 56 
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1 11 31 31 41 51 

I I I 1 I I 

GCCCCCTGGC 0CGA6CCGC6 C C XXSGGTCTq TGAGTAGA6C G6COC3G6GaV CCGAGCGCT6 
GT06C06CTC TCC r fC CO n' ftTATGAAOVT GCCCCCTTTC ClOTT GC r GQ AAGCCGTCTO 
TGTTTltX'lt; TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CA6QAAGTCC ATGTAAGCAA 
AGAAACCATC GGQAAGATTT CAGCTGCCAG CAAAATOATO TGGTGCTOGQ CTGCAGTGGA 
CATCATGTTT CTGTTAQAT6 GGTCTAACAG CGTCGGGAAA GGGAGCTTTG AAAGGTCCAA 
GC^CTTTGCC ATCACAGTCT GTGA0G6TCT GGACATCAGC OOOGAGAGGO TCA0AGTG60 
AGCATTCCAO TTCAGTTCCA CTGCTGATCT GGAATTCOCC TTG6ATTCAT TTTGAACCCA 
ACAG6AAGTG AAG6CAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGOSCA OGGAQACGGA 
ACTTGCTCTG AAATACCTTC TGCACAOAGG GTTGCCTGGA GGCA6AAATG CTTCTQTGCC 
CCAGATCCTC ATGATCOTCA CTGATGGQAA GTCCCAGGGG GAT6TGGCAC TGGCATCCAA 
6CASCTGAAG GAAA6GG6TG TGACIGTGTT T6CIGTGGGG GTCAGGTTTC CCAGGTGGGA 
GGAOCTGCAT GCACTGGCCA GC6A6CCTAG AGG6CA6CAC GTGCTGTTG6 CTGA6CAGOT 
GGAGGAT6CC ACCAACGGCX: TCTTCAGCAC CCTCAGCAGC TOGGCCATCT GCTCCAGC6C 
CACX3CCAGAC TGCAGGGTOG AGGCTCAOCC CTGTQAGCAC AGQACGCTGG AGATGGTCOO 
GGAGTTCGCT GGCAAT6CCC CATGCTGQAO AGGATGGCGO CGGACCCTTG OGGTGCTGGC 
TGCACACTGT CCCTTCTAGA GGTGQAAGAO AOTOTTCCTA ACCCACCCTG CCACCTQCTA 
CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 
TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTQCCOGCTG GCCTTTGGAG 6GGAGGCTAA 
CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCQACCTC CTCTTCCTGC TGGACAGCTC 
TGCGGQCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCXJTGAAGC GGTTTQTaCG 
GGC0GT6CTG AG06AGGACT CTOGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 
GCTGGTGGCG GTGCCTGTGG GGGAGTACCA G6ATGTGCCT GACCTGGTCT 6GAGCCT0GA 
TGGCATTCCC TTCCGTGQTQ QCCCCACCCT GACGGGCAGT GCCTTCOGGC AGGC3GGCAGA 
GCGTGGCTTC GGQAGGGCCA CCAGGACAGO CCAGGACCGO CCACGTAGAG TGGTGGTTTT 
GCTCACTGAG TCACACTCCG AGGATGAGGT TGC?GGGCCCA GG6CQTCA0G CAAGGGOGCG 
AGAGCTGCTC CT6CTGGGTG TAGGCA6TGA GGCCGTGGGG GCAGAGCTGG AGGAGATCAC 
AGGCAGCCCA AAGCATGT6A TGGTCTACTC GGATCCTCAQ QATCTGTTCA ACXIAAATCCC 
TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GOGGCCAGGG TQ<XGGACAC AAGCCCTGGA 
CCTCGTCTTC ATGTTGGACA CXTTCTGOCTC AGTAOGGCCC GAGAATTTTQ CTGAGATQCA 
OAGCTTTGTG AGAAOCTGTG CCCTCCA6TT TGAGGTGAAC CXTGACGTGA CACAGGTCGG 
CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCXK3Q CTGQACACCA AACCCACCCO 
GQCTGCGATG CTOOaGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTC3VGCCGG 
CACaSCCCTG CTGCACATCT ATGACAAAGT GATQACCGTC CAGAGGGGTG CCCGGCCTGG 
TGTCCCGAAA GCTGTGGTGG TGCTCACAGO CGGGAGAGGC GCAGAGGATG CAGCCGTTCX: 
TGCCCAGAAG CTOAGGAACA ATGGCATCTC TGTCTTGGTC GTGGQOGTGG GGCCTGTCCT 
AAGTQAGGGT CTGCGGAGGC TT6CAGGTCC CXX3GQATTCX: CTGATCCACX} TGGCAGCTTA 
CGCOQACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCX:AAGCA 
GCCAOTCAAC CTCTOCAAAC OCAGCCO6T0 CAT6AA TQAG QGCAQCTOGO TCCTGCAGAA 
TGGGAGCTAC OQCTGCAAGT GTGGGGATGG CTGG6AGG6C CCCCACT6C8 AGAACOGATT 
CTTGAGAOGC CCCTGAOGCA CATGGCTCCC GTGCAG6AGG GCAGCAGCX:0 TACCCCTCCC 
AGCAACTACA GAGAAGQCCT GQGCACTGAA ATGGTGCCTA CCTTCTGGAA TOTCTGTGCC 
CCAGGTCCTT AGAATGTCTG CTTCCOSCCXS TQGCCAGQAC CACTATTCTC ACTGAGGGAQ 
QAG6A TQT0C CAACT GCAGC CATGCTQCTT AQAOACAAOA AAQ CftGCIOA TQTCACCCAC 
AAAGQATOTT GTTGAAAAGT TTT6ATGTGT AAGTAAATAC CCACTTTCT6 TACCT6CT6T 
6CCTTGTTQA GGCTATGTCA TCTGCCACCT TTCCCTTGA6 GATAAACAAG GQGTCCTGAA 
GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTOGCC AQAATGTTGT 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGAOGG 



Seq ID NOi 124 Protein aequence 
Protein Accession #t Eos sequence 



1 
I 

MPPFIiUjBAV 
8VGKGSFBR8 
MVFKGGRTBT 
PAVGVRFPRW 
PCSHRTLEMV 
SQP0(9IGGTC 
RAKVPVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVOPBNFAQM 
APYLGGVGSA 
SVbWGVGFV 
CMNBGSCVIiQ 



11 
I 

CVFLFSRVPP 
KHFAITVCDG 
BLALKYIiLHR 
EBLBAIiASEF 
REFAG2IAPCW 
VPBCniDQYQC 
RAVLSEDSRA 
ER0F6SATRT 
T6SPKHVMVY 
QSFVRSCShLQ 
GTALUHYDX 
LSBGLRRLAO 
MGSYRCKCRD 



21 
I 

SLPIiQEVKVS 
IiDISPERVRV 
GIfPGGRKASV 
RGQHVLLAEQ 
ROSRRTI4AVI. 
LCPLAFGGBA 
RVGVATYSRB 
GQDRPRRVW 
SOPQDLFNQI 
PEVNPDVTQV 
VMTVQRQARP 
PRD8LIHVAA 



31 
I 

KETXGKISAA 
GAFQPSSTPH 
PQILIIVTXX3 
VEDATN6LP8 
AAHCPFYSWK 
NCALXLSZiBC 
LLVAVPVGEY 
LLTE8HSEDE 
PELQGKLCSR 

GVPKAVWZiT 
YADbRYHQDV 
FLRRP 



41 

I 

SKMNWCSAAV 
LBFPLDSFST 
KSQGDVALP8 
TLSS8AIC88 
KVFLTKPATC 
RVDLLFLU^ 
QDVFDLVWSL 
VAGPARHARA 
QRPGGRTQAL 
TAFQUDTXPT 
G6RGAEDAAV 
LXBWLCOEAK 



8eq ZD NOt 125 DMA sequence 

nucleic Acid Accession «t NM_031943.1 

Coding sequence $ 145.. 1260 



1 
I 

CCOGAGCCCC 
TGCTOCTCCT 
COGATCTGGG 
GTAAAGAAGA 
TCCTCTQATQ 
TCAGTTC3QGG 
G0G AT6AA GT 
CAGCXtTfCAG 



11 
I 

0CCCCTC06G 
GCTGTGGGAC 
CACC0GCC3VC 
ACTTAAA6AA 
ACAGTTOTGA 
AAQGCTGTAG 
TTCCAGCX3CX3 
AQAATTCIGT 



21 
I 

GCCOGGOTOO 
06CTGACCGC 
CA6CATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
QhCXX3QC»6C 
GAGTACCAG6 
GACTGATTCC 



31 
I 

GOGOGGCCAO 
GOGGCTGCTC 
GCTOGCOGOG 
GTGAAGTTGA 
TCTOATAATT 
CAGTGCAGGC 
GGAOCAACCA 
AACTOOGATT 



41 
I 

0CT60CAG0C 
OGCTCTCCCC 
TGCGGCAQAA 
TTTCCATQGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAOATGA 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
7B0 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
3640 
2700 
3760 



51 
I 

DXNFLU3GSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVBAH 
YRTTCPOPCD 
SAGTTLD6FI* 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMI1RAI8Q 
PAQXLRMNOI 
QPVHLCKPSP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



51 
I 

GOGCIGCTGC 
GCTCCAAGCG 
AGATGTCAGA 
AACCTOQTCA 
OAGGCTGCAO 
TCTCA6GGTG 
AOAOTCCCGC 
AAGTGGAATG 



60 
120 
180 
240 
300 
360 
430 
480 



1000 
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35 



60 



70 



Seq ZD MOi 126 Protein sequence 
Protein Accession it NP_114148.1 



Seq ID KOt 128 Protein sequence 
Protein Accession ftt AAL09357.1 



Seq ID KOt 129 DWA sequence 

Huclelc Acid Accession fit nm 004952.1 



AATTTmtSQ AGAAAAGQGC TTTAAATATA AAGCAAAACA AAQCAATQCT TCCAAAACTC 540 

ATGTCTSAAT TAGAAAGCTT CCCTGGCTOG TTCOGTOGAA GACATCCiOCT CCCAGGCTCC 600 

CACTCACAAT CAAGGAOACC OOOAAGGOGT ACATTCCCGG G l' GVi ' UC r rC CAGGAGAAAC 660 

« CXTTGAACGOA GAGCTCXJTCC TCTTACCAGG TCAAOGTCCX: GGATCCTOGG GTCCCTTOAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTOOTGAG AAAGAGGAAG 780 

ACOQTGGATG GCTACATGAA T6AA0ATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTQACCCTTC OOCATATAAT TCGCCrAGT G GAAGAAATTA CAGAGGAGGA GTTGGAGAAC 900 

OTCTGCAGCA ATTCTCGAQA GAAQATATAT AACOGTTCAC TGGGCTCTAC TTGTCATCAA 960 

- ^ TGCOGTCAOA A OACTAT TGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGOOrr 1020 

IV CQAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACCGTTATG OTGAAGftGGT GAGGGATGCT 1080 

CTGCTGGATC 0GAACPG6CA TTGCCOGCCT TOTCGAGGAA TCTGCaACTG CAGTTTCTGC 1140 

CBGC AGOSAG ATGGAOQGTG TGOGACTGGG GICCTTOTGr ATTTAGCCAA ATATCATGGC 1200 

TTTG06AATG TQCATGCXTA CTTGAAAAGC CTGAAACAG6 AATTTOAAAT GCAAOCATAA 1260 

1 ^ ^^^"^^gy ^ ATTTGCT6CC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

ID TTTTTTCACT GAAACCTCAO TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

ftAT CAAGTTA ATCTTAG CAG ACaiGTOTTT CTGGAGCATC ACAGAAGGTA TATTOCTAGT 1440 

TACACTTTGC CCTCCTGCRO TTTCTTCTCT GCTCCX»ACX: CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACOSCTT AGTTTCTGAA TTTCTTTTAA ATTACAOTTT 1560 

« TATGAAAOCA TATTTTATTT ACTTGOTGTT GAAATAOCCC TCATAAAACC TAAGCACTTO 1620 

ZU GAAACACAAT AATAQTATTA ACTAACTAGA TCTATTGAAT TTCAOAGAAO AGCCTTCTAA 1680 

CTTGTTTACA CAAAAAOGAO TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAG6CACA AAAGTCTTAA AACX^TQTGG AAAAATTAGG TAATTATTOC AGATTQATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA QACTTAATTT 1860 

CTCCTAATTT CTTCTOCCCO AAGGGTAAGT GGTGCOTCCA GCTTACAGGA TCATAATTCA 1920 

ZO AAGOTTGGTQ GGCAATGTAA TACrTAATTA AAATAATQAT GGAAGAGCTA TCT6GAGATT 1980 

ATGAGTAAQC TGATTTOAAT TTTCAOTATA AAACTTTAOT ATAATTGTAQ TTT6CAAAGT 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAAC TAGTCT GTGGTTCTTT aCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAG GTTCA A GTTTAQATTT TAAGCACTTT TATAACAATG ATAAOTGOCT TTTTGOAGAT 2220 

JU GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAOTTTCTQ GGCAOOTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCnTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAOTOTTTOT ATGT CCAAT T TACTTGCATA TOTAAACXJVT TGCTGTGCCA TTCAATGTTT 2400 

QAT6CATAAT TGOACCTTGA ATCGATAAGT GTAAATACAO* CTTTTQATCT GTAATGCTTT 2460 
TATACAAAAQ TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 



40 t r r r ? r 

MDARRVPQKD LRVKKNLKKF RYVKLISMET SSSSDDSCDS FASDMPANTR LQSVREGCRT 60 

RSQCRHSGPL RVAMKPPARS TRGATMKKAE SRQPSENSVT DSNSDSEDES GMNPLEKRAL 120 

NIKQNKAMIA KLMSBUISFP GSPRGRHPLP GSDSQSRRPR RRTPPGVASR RNPERRARPL 180 

. TRSRSRILQS LDALPMEEEE EBDKYMLVRK RKTVD6YMMB DDLPRSRRSR SSVTLPHIIR 240 

n PVEEITEEEL ENVCSNSREK ITOR8L0STC HQCRQKTIDT KTNCRHPDCW GVHGQFCGPC 300 

LRNRYGEEVR DALUJPNWHC PPCRGICNCS FCHQRDGRCA TOVLWIiAKY HOPGNVHAyi. 360 
X8LKQBFEMQ A 

Seq ID NOi 127 DMA sequence 
J\) Nucleic Acid Accession #: AF305616.1 
Coding sequence t 1 . . 863 

^ 11 21 31 41 SI 

' I I 1 I I 

J J ATGCACOGCT TGATGGGGOT CftAGAGCACC GCOGCCGCOQ CC0CC3GG0CA GCCCAATCTC 60 

TCXTTGCACGT GCAACTGCAA ACXSCTCTTTG TTCCAGAGCA TGGAGATCAC GGAGCTGGAO 120 

TTTGTTCAGA TCATCATCAT CGTGGTX3GTG ATGATGGTQA TGGTGGTGGT GATCA06TX3C 180 

CTOCTOAGCC ACTACAAGCT GTCTGCAOGO TCCTTCATCA GCOGGCAGAQ CCAGGGGCOO 240 

AGGAGAGAAO ATGCCCIGTC CTCAGAAGGA TGCCTGTGGC OCTOGQAGAa CACAGTGTCA 300 

GGCAAOQGAA TCCCROAGCC GCAOOTCTAC 6CCC0GCCTC 6GCCCAC0QA CCGCCTOGCC 360 

GTX3C0GCCCT TOGCCCAGOG GGAGCGCTTC C3VC0GCTTCC AGCCCACCTA TCCGTACCTG 420 

CA6CACQAGA TCQACCTGCC ACCCACXrATC TCGCTGTCAG ACGGGGAGGA GCCCCCACCC 460 

TACCAGGGCC CCTQCACCCT CCAGCTTOGQ QACCCOOAGC AGCAGCTGGA ACTGAACCGG 540 

^_ GAGTCGGTGC GOGCACCCCC AAACAOAACC ATCTTCGACA GTGACCrSAT GGATAGXGCC 600 

AGGCTOGOOO OCCCCTGCCC CCCOlSCaGT AACTCGGGCA TCA0C3GCXaC GTGCTAOOOC 660 

■ AGCGG0GG6C GCATGQAGOQ QCCGCCGCCC ACCTACAGCQ AGOTCATOSG CCACTACXXTQ 720 

GGGTCCTCXrT TCXAGCACCA GCAGAGCAGT GQOCOGCCCT CCTTGCTGGA GOGQACCCGG 780 

CTCCACCACA CACACATCGC GCCCCTAQAG AQCGCAGCCA TCTGGAGCAA AGAGAAG6AT 840 
AAACAOAAAG GACACCCTCT CTAO 



ne ] 11 21 31 41 51 

MHRLMGVNST AAAAAGQPNV SCTCNCKRSIj FQSMEITELE FVQIIIIVW MMVMVWITC 60 

LLSHYKLSAR SFISRHSQGR RRBDAI»SSEG CLWPSBSTVS GNGIPEPQVY APPRPTDRLA 120 

VPPPAQRBRF HRFQPTYPYL QHBIDLPPTI SI#SD6BBPPP YQOPCTLQLR DPBCXJLBLNR 180 

jj* BSVRAPPNRT ZFDSDLMDSA RLGGPCPPSS NS0I8ATCYG SGGRME6PPP TYSBVZGBYP 240 

OU GSSFQHQQSS GPPSLliBOTR LRBTBIAPLE 6AAIW8XBKD KQKOHPL 



1001 



wo 03/042661 



PCT/US02/36810 



Coding sequence s 1 . . 716 

} 11 21 31 41 51 

s i I i I I I 

J ATGGCX5GCGG CTCCGCTGCT GCTGCTGCTO CTGCTCGTC5C COGTGCOGCT GCTGCCGCTO 60 

CTGGCCCAAQ GGCC06GJVG6 GGCGCTGGOA AACXX3GCAT0 GGQTGTACTG GAACAGCTCC 120 

AAOCAGCACC TGCGGOQAGA 6GGCTACACC GT6CAGGTGA AOGTQAAOQA CTATCTGOAT 180 

ATTTACTGCC OGCACTACAA CA6CT0GGG0 GTGQQCCCOG GOGOOQCSAOC OGOGCCGGQA 240 

GGOGGGGCAQ AGCAGTACGT GCTGTACATG GTGAGCCGCA AOCSGCTACOJ CACCTGCAAC 300 

IV GCCAGCCAGG GCTTCAAGCG CTGGGAGTGC AACCGGCOGC ACGCCCOSCA CAGCCCCATC 360 

AAOTTCTCGO AQAAGTTCCA GCGCTACAGC GCXTTTCTCTC TGGGCTAC5GA GTTCCACGCC 420 

G8CCA0QA0T ACTACTACAT CTGCA08C0C ACTCACAACC TGCACTG6AA QTOTCXQAOG 480 

ATGAAOOTOT TOQTCTGCTO OGCCTCCACA TOQCACTCCG GOQAGAAGCC GGTCCCCACT 540 

- - CTCCCCCAGT TCACCAT6GG CCCCAATGTG AAGATCAACG TOCTGGAAGA CTTTGAGOQA 600 

ID GA GAAC CCTC AGGTGCCCAA GCTTGAOAAG AGCATCAGOG GQACCAGCCC CRAAOGGOAA 660 
CACCTGCCCC TGGCCGTGGG CATCGCCTTC TTCCTCATQA O G H'C n XJU C CTCCTAO 



20 



65 



Seq ZD tK>t 130 Protein eeguence 
Protein Accession ft: MP_004943.i 



1 11 21 31 41 51 

I I I I 1 I 

MAAAPLLLLL LLVPVPLLPL LAQGPGGALO NRHAVYHMSS NQHLRRBGYT VQVNVNDYLD 60 

lYCPHYNSSG VGPGAGPOPG GQAEQYVLYM VSRNGYRTCN ASQGFKRWEC NRPHAPHSPI 120 

ZD KPSEKFQRYS APSLQYBPHA GHBYYYISTP THNLHWKOiR MKVFVCCAST SHSGEKPVPT 180 
I*PQPTM6PNV KINVLEDPEQ ENPQVPiCLBK SISGTSPKRB HLPLAVQIAF PLMTPLAS 

Seq ID NO: 131 DNA sequence 
„ Nucleic Acid Accession #: NM_01244S.1 
OV Coding sequence: 276.. 1271 "~ 

1 11 21 31 41 51 

I I I I - I I 

„ OCAOQAGGGA AQAGGGTGAT COGACCCGGG GAA6GTCGCT GGGCAGGGOO AGTTGGGAAA 60 

JO GCX3GCAGCCC CC6CCGCCCC CGCAGCCCXTT TCTCCTCCTT TCTCCCACGT CCTATCTGCC 120 

TCTOGCTGGA GGCCAGGCCG TGCAOCATCG AAQACAGGAG GAACTGGAGC CTCATTGGCC 180 

GGCCOQGGGC GCC8GCCTCG QGCTTAAATA GGAGCTCOGG GCTCTGGCTG GGACCOGACC 240 

GCTGCOGGCC QOSCTCCOGC TQCTCCTGCC GGGTGATGGA AAACCCCAGC C0GQC0GXXX3 300 

CCCTQGGCAA 6GCCCTCTGC GCTCTCCTCC TGGCCACTCT OQQCGCCGCC GGCCAGOCTC 360 

*fV TTGGGGGAGA GTCCATCTGT TCCGCCAGAG CCCCGQCX3VA ATACAGCATC ACCTTCACGG 420 

GCAAOTGGAQ CCAGACGGCC TTCCCCAAGC AQTACCCCCT GTTCCGCCCC CCTGOGCAGT 480 

GGTCTTOGCT GCTOOOGGCC GCGCATAGCT COGACTACAG CATGTGGAGG AAGAACCAGT 540 

AOGTCAGTAA CGGGCTGC6C GACTTTGC6G AGCGCGGOGA GGCCTGGGCG CTGATGAASG 600 

- _ AGATOQAGOC GOCGGGGGAG GCXSCTGCAGA GOGTGCAOGC OGTOTTTTCQ GOOCCCQCOG 660 

TCCCCAGOGG CACCGGGCAG ACGTOGQCGG AGCTGGAQGT GCA6GGCAGG CACTOOCIOO 720 

TCTOGTTTGT GGTGCGCATC GTGCCCAQCC CCGACTGOTT CGTGGGOGTG GACAGCCTGG 780 

ACCTGTGCGA COGGGACCGT TGGCGGGAAC AGGOGGOGCT GGACCTGTAC CCCTACGACG B40 

COGGGACGGA CAGCGGCTTC ACCTTCTCCT CCCCCAACTT OGCCACCATC CCGCAOGACA 900 

CGGTGACOGA GATAACGTCC TCCTCTCCCA GCCACCOGOC CAACTCCTTC TACTACOOGC 960 

GGCTOAAGGC CCTGCCTCCC ATOOCCAQGa TGACACTGGT OOGGCTGOGA CAOAGCOCCA 1020 

GGGCCTTCAT CCCTCCCGCC CCAGTCCTGC CCAGCAGGGA CAAT6AGATT GTAGACAGCG 1080 

CCTCRGTTCC AGAAACGCCG CTGOACTGCG AGGTCTOCCT GTGGTOOTCC TGOGOACTGT 1140 

GCGGAOGCCA CTGTGGQAGQ CTOGGQACCA AGAOCAGGAC TOGCTAOSTC OGGGTCCAGC 1200 

- - CCGCX3UVCAA CGGGAGCCOC TGCCCOGAGC TOQAAGAAGA GGCTGAOTQC GTGCCTQATA 1260 

ACTQCGTCTA AQACCAGAGC CCOGCRQCCC CTGGGGCOCC OGGAGOCATG GGGTOTOGGQ 1320 

GGCTCCTCTG CAGQCTCATG CTGCAGGOGG CCGAGGCACA GGGGGTTTOO CGCTOCTCCT 1380 

GACCGOGOTG AGGCOGCGCC GACCATCTCT GCACTGAAGG GCCCTCTGGT GGCCGOCACO 1440 

GGCATTOGGA AACAGCCTCC TCCTTTCCCA ACCTTGCTTC TTAGGGGCCC COGTGTCCCQ 1500 

TCTGCTCTCA GCCTCCTCCT CCTGCAGQAT AAAGTCATCC CGAA6GCTGC AGCTACTCTA 1560 

OXJ AATTATGOTC TCCTTATAAG TTATTGCT6C TCCAOGAGAT TCTCCTTCAT 0GTCCA0G6G 1620 

CCTGCCTCCC ACGTGGTTQC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTCAGCCCA 1680 

CTCTCCOOftQ GGCGCATCCA AGCGGOGGCC ACTTQAGAAQ TGAATAAATG GGOCGGTTTC 1740 

GGAAGCGTCA QTGTTTCCAT OTTATGGATC TCTCTGOGTT TGAATRAAGA CTATCTCIGT 1800 
TGCTCAC 



Seq ID NO: 132 Protein sequence 
Protein Accession «s NP 036577.1 



'70 } 2^ 31 *1 51 

70 I I I I 1 I 

MENPSPAAAL GXALCALLLA TLGAAGQPLO GBSICSARAP AKySITPTGK WSQTAPPKQY 60 

PLPRPPAQWS SUiOAAHSSD YSMWRKNQYV SNGLRDFAER GEAWALMKEI EAAGEALQSV 120 

HAVFSAPAVP SGTGQTSABL BVQRRHSLVS PWRIVPSPD WFVGVDSIiDL CDGDRWREQA 180 

« AliDLYPYDAG TOSGPTPSSP NPATIPQDTV TBZT8SSPSH PAN8PYYPHL KALPPIARVT 240 

fj LVRLRQ8PRA FIPPAPVLPS RDHEIVDSAS VPBTPIJXZV SLHSSWGtiCO GHOGRLGTKS 300 
RTRYVRVQPA NNGSPCPELB EEAECVPONC V 

Seq ID NO: 133 DMA sequence 
NUdeic Acid Accession #: NM_019e94 
OU coding sequence: 1..1314 

1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTOACAO TGATCAACCT CTGAACAOCC TOQATGTCAA ACCCCTGCGC 60 
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AAAGCCOGTA TCCCX:ATGGA GACCTTCAGA AAGGTOQGGA TCCCCATCAT OVTAGCACTA 120 

CT6AGCCT6G 06AGTATCAT CATTOTGGTT GTCCTCATCA AG6TGATTCT GGATAAATAC 180 

TACTTCCTCT OCGGQCAGCC TCTCCACTTC ATCCXX^GGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTCTC CCTTGGGGQA GGAOGAGGAQ CACTGTQTCA A(3AGCTTCCC CGAAQOOCCT 300 

GCAGTGGCAG TOCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTCGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTOGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG T6GAGATTGG CCCAGACCAG 480 

GATCTGGAT6 TT6TT6AAAT CACA6AAAAC AGCCAGGAGC TTOGCATGOG GAACTCAAOT S40 

GG6CCCTGTC TCTCAG6CTC CCTGGTCTCC CTOCACTQTC TTGGCTGTG6 GAAGAGCCTQ 600 

AA6ACCCCCC GT6TG6T60O TGG06AGGA0 GCCTCTOTGG A TTCTT GGC C TTG6CAGGTC 660 

AGCATCCAGT AOGACAAACA GCAOOTCrGT GGAGGGAGCA TCCTGGACCC CCACTGGOTC 720 

CTCACGGCAG CCCACTGCTT CAGOAAACAT ACaSATOTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTG6GCAG CTTOCCATCC CTGGCTGTGG CCAAQATCAT CATCATTGAA 840 

TTCAACCCCA TGTACOCCA^ AOACAATGAC ATOGCCCTCA TQAAOCTGCA GTTCCCACTC 900 

actttctcao gcacagtcao gccovtctgt ctgcccttct ttgatgagga gctcactcca 960 

GCX:aCCCCAC TCTGGATCAT TGGATGQGOC TTTAOQAAGC AOAATOGAQG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GOSTACCAOQ GOGAAGTCAC C6AQAAGATO ATGTGTGCAO GCATCCOQGA A6GGGGTGTG 1140 

GACAC CTGCX: AOGOTQACAS TG GTOQTOC C CTOATQTACC AATCTOAGCA GTGQCATOIO 1200 

GTOGQCATCG TTAGCTGGGO CTATGGCTQC GGGG60C0GA GCACC0CA6G AGTATACAGC 1260 
AAGGTCTCAG CCTATCTCAA CTGGATCTAC AATGTCTGGA A06CTGAGCT GTAA 

Seq ID NO: 134 Protein sequence 
Protein Accession #: HP 063947.1 



1 11 21 31 41 51 

I I i I i I 

MLQDPOSDQP LNSLDVKPUl KPRIFMETFR KVGJPIZIAL LSLASXIIW VLIKVIiOKY 60 

YFLOGQPIiHP IPRRQLCDQB UXTLGEDBB HCVK8FPBGP AVAVRLSXDR STI^VLDSAT 120 

30 GNWFSACFDK FTEALABTAC RQMGYSSKPT FRAVEIGPDQ DIiDWSITEN SQBLRNRKSS IBO 

GPCL86SLVS LHCLACX3C8L KTPRWGGEE ASVDSHPHQV SIQYDKQHVC GGSIIJ3PHNV 240 

LTAAHCFRKH TDVFNWKVRA GSDXliGSFPS LAVAXIIIIE FNPMYPKDND lALMXLQPPL 300 

TF8GTVRPIC LPFFSEELTP ATPLWZZGWG FTKQliGGKMS DIIiLQASVQV IDSTRCNADD 360 

AYQGEVTBXM MCAOIPBQGV OTOQGDSGGP LMyQSDQmiV VGIVSHGYGC GGPSTPGWr 420 



Seq ID NO: 135 DNA sequence 
Nucleic Acid Accession #t KM__003045 
Coding sequencei 14B..2037 " 

1 11 21 31 41 51 

1 I I I I i 

CQATCCTGCC GGAGCCCCGC CGCCGCOGGC TTGGATTCTG AAACCTTOCT TGTATCCCTC 60 

CTOAQ ACATC TTTQCTGGAA QATOaAGQCT GTCCTCTGGT GAGAACtOTGO TGAGGCTTCC 130 

OGTCATATTC CAGCTCTGAA CAGCAACATO GGGTGCAAAG TCCT3CTCAA CATTGG6CAG 180 

CAGATGCTGC GGCGGAAGGT GGTGGACTGT AGCCG6GAGG AGA0GC3GGCT GTCTCGCTGC 240 

CTGAACACTT TTGATCTGGT GOCCCTOGOO GTOaQCAGCA CACTGQGTOC TGGTQTCTAC 300 

GTOCTGGCTG GAGCTGTY3GC COGTGAGAAT GCAGGCCCTG CCATTGTCAT CTCCTTCCTG 360 

ATCGCT606C TGQCCTCAGT GCTGGCTGQC CTGIGCTATO QCQAaTTTOG TOCTOGGGTC 420 

COCAAGACGG GCTCAGCTTA CCTCTACAGC TATGTCACOO TTGGAGAGCT CTGGGCCTTC 480 

ATCACCGGCT GGAACTTAAT CCTCTCCTAC ATCATGGGTA CTTCAAGCOT AGOGAGGGCC 540 

TGGAGOGCCA CCTTCGACX3A GCTGATAGGC AGACCCATCG GGGAGTTCTC ACGQACACAC 600 

ATGACTCTGA AC36CCCC060 0QT6CTG6CT 6AAAACCCC6 ACATATT06C AGTOATCATA 660 

ATTCTGATCT TGACAGQACT TTTAACTCTT GOTGTQAAAO AGTC9GGCCAT GGTCAACAAA 720 

ATATTCACTT OTATTAAGQT CCTGGTCCTG GGCTTGATAA TGGTGTCAGG ATTT6TGAAA 780 

GGATCX3GTTA AAAACTGGCA GCTCA06GAG GAGGATTTTG GGAACACATC AGGCXX3TCTC 840 

TGTTTGAACA ATGACACAAA AGAAGGQAAG CCCGGTGTTG GT6GATTCAT GCCCTTOGGG 900 

TTCTCTGGTG TCCTGTOGGG GGCAOCQACT TOCTTCTATG CCTTOGTGGG CTTTGACTGC 960 

AT06CCAOCA CAGGTGAA6A GGTGAAGAAC CCACAOAAGG CCATCCCCXaT OGGGATOGTO 1020 

G06TCCCTCT TOATCTGCTT CAT06CX:TAC V riX3U GU TUT OGGCTGCCCT CAGGCTCATG 1080 

ATGCXXn'ACT TCTGCCTGGA CAATAACA6C CXXXTTGCCCG AOSCCTTTAA GCACGTQGGC 1140 

TGGGAAGGTG CCAAGTACOC AGTG6CCGTG GGCTCCCTCT GC6CTCTTTC CGCCAGTCTT 1200 

CTAGGTTCCA TGTTTCCCAT GCCTOQGGTT ATCTAT6CCA TGOCTGAGGA TGGACTGGTA 1260 

TTTAAATTCT TAGGCAAC6T CftATQATAGG A OCAAAA CAC OUVTAAT06C CACATT AGOC 1320 

TCGGGTGCOG TTGCTGCTGT GATQGOCTTC CTCTTT6ACC TGAAGGACTT GGTGGACCTC 1380 

ATGTCCATTG GCACTCTCCT GGCTTACTOG TTGGTGGCTQ CCT G TGTGTT GGTCTTACX30 144 0 

TACCAGCCAO AGCA6CCTAA CCTOGTATAC CAGATGGCCA GTACTTOCGA OGAGTTAGAT 1500 

CCAGCAGACC AAAAT6AATT G6CAAGCACC AAT6ATTCCC AGCTGGG6TT TTTACCAGAO 1560 

GC3VQAGATGT TCTCTTTGAA AACCATAGTC TCACCCAAAA ACATGQAOCC TTCCAAAATC 1620 

TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATGAT CACCTTCTQC 1660 

ATTOTGACCG TGCTTGGAAG GQAGGCTCTC ACCAAAGGGQ CGCTGTOGGC AGTCTTTCTG 1740 

CTOGCAGGGT CTGCCCTCCT CTGTGCOGTQ GTCACGGGCG TCATCTGQAO GC3^CCCGAG 1800 

AGCAAGACCA A6CTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT CCTGAGCATC 1860 

TTOOTQAACQ TCTATCTCAT QATGCA GCTO GAOCft OGGCA OCTOGG TCOQ GTTTGCTGTG 1920 

T0GATGCT6A TAG6CTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAO OGAGGAGGOG 1980 

TCCCT6QATQ COGACXSU^GC AAGQACTCXT GAOGGCAACT T6GACCAGTG CAAGTCACGC 2040 

ACAGCCOCGC CCCCOGGAGQ TGQCAGCaVGC CCCOAOGGAC GCOCXXAOAO QACCXJGQAGG 2100 
CACCCCAOCC TCCCCACCAG TQCAACAGAA ACXaCCTGOO TCCACACCCT CACTGCA 



80 Seq ID MOt 136 Protein sequence 

Protein Accession fft NM 00304S 



11 21 31 41 51 
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KGCKVLLNIG QQKLRRXWD CSREBTRLSR CLNTFDLVAL GVGSTLGAGV WLAGAVARE 60 

NAGPAIVISP LIAALASVIA GliCyGEFQAR VPXTOSftYLY SYVTVOELWA PITCWNLILS 120 

YIIGTSSVAR AWSATPDBLI GRPIGBPSRT HMTtHAPGVL AEHPOXPAVI IILILTGLLT 180 

LGVKESAMVN KIPTCINVLV U3PIMVSGFV KGSVKNWQLT BEDFGNTSGR LCLNNDTKEG 240 

KPGVGGFMPP GPSGVLSGAA TCFYAPVGFD CIATTGBBVK NPQKAIPVQI VASLLICPIA 300 

YPGVSAALTL MMPYPCLDNM SPLPDAPiCHV GWEGAKYAVA VGSLCALSAS LLOSMPPMPR 360 

VZYAMABDOL LFXPLANVMD RTKTPIIATti ASQAVAAVMA PLFDLKDLVD LMSIGTLLAY 420 

SLVAACVLVL RYQPEQFNLV YQMASTSDEL DPADQHBIAS 1HDSQU5FI,P EAEMPSLKTI 480 

LSPKUMBPSK ISGLIVNIST SLIAVLIITP CIVTVLGREA LTKGALWAVP LLAGSALLCA 540 

WTGVIMRQP ESKTKLSPKV PFLPVLPILS IPVNVYLMMQ LDQGTWVRPA VHMLIGPIIY 600 
POYGLWHSEE ASLDADQART PDQNLDQCK 

Seq ID NOi 137 DNA sequence 

Nucleic Acid Accession Ut NM_032044.1 

Coding sequences 182.. 658 

t r r V ? V 

AAGATATAAA AGCTCCAGAA ACGTTGACTG GGACCACTGG AQACACTGAA GAAGGCAGGG 60 

GCCCTTAGAQ TCTTGGTTGC CAAACAGATT TGCAGATCAA GQAGAACCCA GGAOTTTCAA 120 

AGAAGCGCTA GTAAGGTCTC TOAOATCCTT 6CACTA6CTA CATCCTCAOG GTAGQAGGAA 180 

GATGGCTTCC AOAAOCATGC GGCTGCTCCT ATTQCTGAGC TGCCTGGCX:A AAACAGGAGT 240 

CCTGGGTGAT ATCATCATQA GACCCAGCXQ TGCTCCTGGA TGGTTTTACC ACAAGTCCAA 300 

TTGCTATGGT TACTTCAGGA AGCTQAGGAA CTGOTCTGAT GCCGAGCTCQ AOTGTCAGTC 360 

TTACGGAAAC GGAGCCCACC TOGCATCTAT CCTGAGTTTA AAGGAAGCCA OCACCATAGC 420 

AQAGTACATA AGTGGCTATC AGAGAAOCCA GCCGATATGO ATTGGCCTGC ACGACCCACA 480 

GAAGAGOCAO CAGTGGCAOT OGATTGATGG GGCCATGTAT CTOTACAGAT CCTGGTCTGG 540 

CAAOTCCATQ GGTGGGAACA AGCACTOTOC TQAGATQAOC TCCAATAACA ACTTTTTAAC 600 

TTGGAGCAOC AACGAATGCA ACAAGOGCCA ACACTTCCTG TGCAAGTACC OACCATAGAQ 660 

CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC T6CTAGCC7G 720 

GCTAAATCTO CTCATTAT TT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGGCCCT 780 

ACTACACTGG CTTTTTTAGO CTTAGAGACA QAAACTTTAO CATTGGCCXai GTAQTGGCTT 840 

CTAflCTCTAA ATGTTTGCCC CXSCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 900 

OTCTCTGGCT GTCTCGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATOAA ACAGCXGQOT 960 

CTTTGGCCAT AAGAAGTAAA GATTTGAAQA CAOAAGGAAO AAACTCAGGA GTAAGCTTCT 1020 

AGACCCCTTC A6CTTCTACA CCCTTCTGCC CTCTCTCCAT TQCCTCCACC CCACCCCAGC 1080 

C ACTCAA CTC CTGCTTGTTT TTCCTTTGQC CATAGGAAGQ TTTACCAOTA GAATCCTTGC 1140 
TAGGTTGATa TGGGCCATAC ATTCCTTTAA TAAACCATTQ TOTACATAAO AAAAAAAAAA 

Seq ID NOi 138 Protein sequence 
Protein Accession #i NP_114433.1 

1 21 31 41 51 

I I 1 I I I 

MASRSMRLLL LLSCLAKTGV U3DIIMRPSC APGWPYHKSH CYGYPRKLRH WSDAELBCQS 60 

YCaJGAHLASI LSLKBASTIA BYISQYQRSQ PIWIOLHDPQ KRQQWQHIDO AMYLYRSWSG 120 
KSMG6NKHCA BMSSNNNPLT H8SNEQIKRQ HFLCXYRP 

Seq ID NO: 139 DNA sequence 

Nucleic Acid Accession #: XN_051860.2 

Coding sequence: 52.. 3042 " 

1 11 21 31 41 51 

I I I I I I 

GCTCACCCA6 GAAAAATATG CAATOGTCCC ATTGATATAC A06CCACTAC AATOGATCGA 60 

GTT»AOCT» GCACCGAGGT TGTCTACAAA AAAOGCCAGG ATTATAGOTT TGCTlGCTAC 120 

OACOOGGGCA GAOCCTGCCG GAQCTACCGT QTACGGTTCC TCTOTGGGAA GCCTGTCAGG 180 

CCCAAACrCA CAGTCACCAT TQACACCAAT GTGAACAGCA CCATTCTGAA CTTQGAaaAT 240 

AATGTACAGT CATOGAAACC TGGAQATACC CTGOTCATTQ CCAOTACTGA TTACTCCATO 30O 

TACCAGGCAG AAQAGTTCCA GQTGCTTCCC TGCRGATCCT GC9QCCCCCAA CCAGOTCAAA 360 

GTGG CAGOQA AACCAATQTA CCTGCACATC QGGGAGGAQA TAGAOGGCGT GGACATGOGQ 420 

GCGQAGOTTQ GGCTTCTQAG COGGAACATC ATAGTGATGQ GGGAGATGGA OGACAAATGC 480 

T ACCCCTA CA GAAACCACAT CTGCAATTTC TTTOACTTCG ATACCrTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTOGAGGOCA CGGAGCrOAA GCATATOQ6A 600 

CAGCAGCTGO TGGGTCAGTA CCCGATTCAC TTCCACCTOO OOGOTOATOT AQACGAAAGG 660 

GGAGGTTATO ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTOGC 720 

TQ08TCACAG TCCATGGCTC CAATOGCTTG TTQATCAAGQ ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACr GCTTCTTCAC GGAAGATGGG COQGAGGAAC GCAACACTTT TGACCACTOT 840 

CTTGGCCTCC TTGrCAAGTC TGOAACCCTC CTCCCCTOGO ACOGTGACAG CAAGA7GTCC 900 

AAQATGATOl CAGGAGACTC CTACCCAGGO TACRTCCCCA AGCCCAG6CA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTO TGCOOCTCCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCOTO 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACOGA 1140 

GCACATTCCA ACTACCGG6C T06CATOATC ATAOACAACSG GAGTCAAAAC CACOGAGGCC 1200 

TCTGCCAAOG ACAAOCGGCC GTTCCTCTCA ATCATCTCTO CCAOATACAO CCCTCACCAO 1260 

GACGCOGACC CGCTGAAGCC COQGGAGCCX3 GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC OGOQATGTGT GQCTQGACAO CTQC06GTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGQTGGAA CCTTCCOGTA TGACQAOGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTOTTT OTTGGCQAOA OTQGCAAOGT GGGGAGGGAA 1500 

ATGATGGACA ATAGG ATCTG GGGCCCTGGC 6GCTTG6ACC ATAGOGGAAG GACCCTCCCT 1560 

ATAGQCCAGA ATTTTCCAAT TAGAOGAATT CAOTTATATO ATGOOOCCAT CAACATCCAA 1620 

AACTGCACTT TCOOAAAGTT TOTGGCCCTG GAGGGCCGGC ACACCAGOQC CCTGGCCTrC 1680 

CG CCTXS AATA ATGCCTOGCA GAGCTCCCCC CATAACAAOO TGACOOGCAT TGCCTTTGAG 1740 

QA00TTO06A TTACTTCCAG AGTGTTCTTC GGAGAGCCTG OGOCCTGGTT CAACCA6CTG 1800 
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QACATGGATG OOGATAAGAC ATCTGTGTTC CATGAO^rOQ AOGGCTCOGT GTCOGAGTAC IB 60 

CCTGGCTCCr ACXrrCACXSAA GAATQACAAC TGGCTOGTCC GOCACCXaGA CTGCATCAAT 1920 

GTTCCOQACT GC3AGAGGGGC OVTTTGCAGT GGGTGCTATO CACAGATGTA CATTCAAGCC 1980 

TA CRAOAC CA GTAACCTGG6 AATGAAGATC ATCAAOAATG ACTTGOCCAO C CACOCT C T T 2040 

J TACCTGGAGQ GGG06CTCAC CAG6AGCACC CATTACCACC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGG GCTACACCAT GCACTGGGAC CAGACGGCCC CCGCOSAACT GGCCATCTGO 2160 

CTCATCAACT TCAACAAGGG COACTGOATC CGAGTGGGGC TCTSCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCC TCTOGG A TGTTCACAAT 06CCT6CTGA AGCAAAOGTC CAAGACXSOGC 2260 

^ _ GTCTTOGTQA GQAOCTTGCA GATGQACAAA OTGOAGGAGA GCTAOCCTGG CAGGAGCCAC 2340 

lU TACTACTGGO AOQAGGACTC A066CTGTTG TTCCTQAAGC TGAAAGCTCA GAACGAGAGA 2400 

6AGAAGTTT0 CTTTCTGCTC CATGAAAGGC TGT6AGAGGA TAAAOATTAA AGCTCTQATT 2460 

CCAAAOAACXS CAGGGGTCAG TGACTGCACA GCX3VCAGCTT ACCCCAAGTT CACXGAOAGa 2520 

GCTOTOGTAG ACGT6CCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

1 C C A TT TC W 'OG AGGTGAAGAT GGAGAGTTOC AA GCAQC ACT TCTTOCAOCT CTGOAAOQAC 2640 

15 TTOGCTTACA TTGAAOTGGA TGGGAAQAAO TAC0CCA6TT 0G6AGGATG0 CATCCAGOTG 2700 

CSTGOTGATTG ACGGGAACCA AGGOCXSCGTO GTQAaCCACA GGAGCTTGA6 GAACTOCATT 2760 

CTGCAAGGCA TACCATG6CA GCTTTTCAAC TATGTGGCQA CCATCCCTGA CAATTCCATA 2820 

6T6CTTATGG CATCAAAGGG AAGATA06TC TCCAGAaOCC CATGQACCAO AGTGCTGQAA 2880 

AAOCTTGGGa CAOACAGQGG TCTCAAOTTO AAAaAOOVAA TGQCATTOQT T6GCTTCAAA 2940 

20 06CAGCTTCC GGCC CA TCTG GGTGACACT6 GACACTGAGQ ATCACAAAGC CAAAATCTTC 3000 

CAAGTT0T6C CCATCCCTGT GGTGAA6AA6 AAGAAGTTGT GA6GACAGCT QCOQCCOGGT 3060 

GCCACCTCGT GGTAGACTAT GACXMTGACT CTTGGCAGCA GACCAQTGGO GGATGGCTGQ 3120 

GTGCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCOGTO TTTCAGCXCT 6AT6GGCCAA 3180 

GGQAAGGCTA TCAGAGAOCC TGGTGCTGCC ACCTGCOOCT ACTCAAGT6T CTACCTGGAG 3240 

25 CCCCTGGG6C GGTGCTGGCC AAT6CTGQAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TT CT CrCCT A TCTQTOCCTC TTCAQTQOGO OTTTGOOGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCaGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGOTCA TTCACAGATC CXXIATGGTCT TCAOCAGACA AGTGAGGQTO OTAAATQTAO 3480 

GAGAAA6AGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAQAGCC AAGCTCACAG 3540 

30 GATTAGOAGC TGGGOTAGAA CTGGCTATCC TTGGGOAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTQAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAOAAAT CTGCTGCATT TCACATGOTA QCTQGMiCCC 3720 

AACAQTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGOGC ACACGGQATO 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAQ 3840 

35 OTCCATGTGC ACT6CAATGC CAGGTGGAGA AATCACAGAO AGGTAAAATQ GAGGCCAGTO 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTT6CTTACA 6GAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATCAGG CAGCCTCTGG AATGGCTCAG GOATTCAGCC CTCXXTTOCCXJ 4020 

CTGCCTGCTG AAGCTGGT6A CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATOA GCACAQAGGA 4140 

40 ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGO TQAAGTCCAC AGAAGTGAGC 4200 

TOCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT QAGCACAGGQ GQCCTCCAGQ 4260 

AGACCCTAGA TGTGCTOGTA CTCCC706GC CTGGQATTTC AOAGCTGGAA ATATAOAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA QATGGQAAAQ 4380 

AACJCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTOCTQ OCACACCACA 4440 

45 TT6CCTCAAC AACOQGCGCC AGAOTGCCCA GGCACTCCTO AGGTAGCTTC TGOAAATGGG 4500 

GACAAGTGOC CTGGAAGQAA AGQAAATOAC TAGAOTAQAA TQACAGCIAG CAGATCTCTT 4560 

CCCtCCTGCT CCGAOOQCAC AC3M0C0QC OCTCCC C TT Q GTGTTGGOGG T O CCTOT GG C 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

^ TGGTGCTACC TGQCTCTCCT GTCTCTQCAG CTCTACAGGT GAGGCCCAGC AGAGG6A6TA 4740 

50 GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCT ATTG 4800 

GGTCCACOOC A6TCCCTTTC AOCTQCTOCT TAATQCCCTG CTCICTGOCT GOCOCACCTT 4860 

ATAOAOAGCC CAAAGA6CTC CTOTAAOAGG GAGAACTCTA TCTGTGGTTT ATAATCTT6C 4920 

ACXSAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGOCX: 4980 

CAACCACAAA CXCTTTCCTT CAAAGAGGGC CTQCXTrGGCT CCCTCCACCC AACTOCACCC 5040 

55 ATGAGACTO} GTCCAAGAGT CCATTCCCCA 6GTGGGAGCC AACTGTCAiQG GAGGTCTTTC 5100 

OCACXAAACA TCTTTCAOCT GCTOGGAaaT GAOCATAGGO CICTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA G6CCAGA6TC ACA06AAGGA CTTCTTCCAg GGAGATTAOT GGTCATGOAO 5220 

AQQAOAOTTA AAATGACCTC ATGXCCTTCT TOTCCAGGGT TTT O TTGAOT TTTCACTCTT 5280 

^ CTAATGCAAO GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGQ 534 0 

OO AAT6TTGAAT GTCTTTGGCT CAOTTCATTT AAAAAA6ATA TCTATTTGAA AGTTCTGAGA 5400 

OTTGTACATA TGTTTCACAO TACAG6ATCT GTACATAAAA GTTTCTTTOC TAAACCATTC 5460 

ACCAAGAGCC AATATCIAGG CATTTTCTTG GTAGCAC3\AA TTTTCTTATT GCTTAGAAAA 5520 
TT Q T C CTCCT TGTTATTTCT GTTTQTAAGA CTTAAGTQAO TTAGGTCTTT AAGGAAAGCA 5580 
ACQCTCCTCT QAAATGCTTQ TCTTTTTTCT GTTGCOQAAA TAGCTGGTCC TTTTTOGGGA 5640 
65 QTTA6ATCTA TAQAGTQTTT GTATOTAAAC ATTTCTTQTA GGGATCACCA TQAACAAAGA 5700 

TATATTTTCT ATTTATTTAT TATATGT6CA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

70 Seq ID NOt 140 Protein aequeoce 
Protein Accession #i XF_051860.2 

1 11 21 31 41 51 

I I I 1 II 

/D KDGVNLSTEV VYKKGQDYRP ACYDRGRACR SYRVRFLCGK PVRPKLTVTX OTNVNSTIIjN 60 

LBDHVQSHKP GDTLVXASTD ySMYQASEFQ VLPCRSCAPN QVKVAGRPMY IBIGEBIDGV 120 

DKRABVG&L9 RNIZVMGEMB DKOrPYRHUZ GNPFDPDTFQ GBIXFALOFK AAHIiBGTELK 180 

HMQQQLVGQY PIHFHLAGDV DERGGYDPPT YIPDLSZHHT P6RCVTVRGS KGLLIKDWG 240 

YNSLGHCPPT EDGPBERNTF OHCIiGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

80 DOIAVSTPWM AKPHHNLIKC AAAGSEBTGF WFIFHHVPTG PSVGMYSFGY SEHIPLGKFY 360 

MNRAKSNYRA GMIIDNGVXT TBASAKDKRP PLSZZSARYS PHQCADPXiKP RBPAIIRBFZ 420 

AYKHQDHGAW LRGGOVHZJ^S CRPADNGIGL TLASOGTFPY DDGSKQBXKN 8Z*FVQBSQNV 480 

GTBKMDNRZW GPGOLDKSQIt TZfPIGQNFPI RGZQLYDGPZ NZQNCTPRXP VALBGRBTSA 540 

XiAFIUjlQIAHQ 8CPKIQIVTQI APBDVPZTSR VPFGEPOPITP KQU3MDGDKT 8VPHDVD0SV 600 
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SBYPGSYLTK NDNWLVRHPD CINVPDHRQA ICSGCyAQMY IQAYKTSMLR MKIIKNDPPS 660 

HPLYLEGALT RSTHYQQYQP WTLQKOYTI HWDQTAPABL AIKLIMPNKO DWIRVGLCYP 720 

ROTTPSILSD VHNRLLKQTS KTGVFVRTIiQ MDKVBQSVPO RSHVYHDEDS GU^KLKAQ 780 

NBREKFAPCS MKCCERIKIK ALIPKMAGVS DCTATAYPKP TBRAWDVPM PKKLPGSQLK 840 

TKDHFLEVKM BSSKQHPPHL WNDPAYIBVD GKKYPSSBDG IQNAWIDGNQ GRWSHTSPR 900 

NSIWIPWQ LFNYVATIPD KSIVLKASKG RWSROPWTR VLEKLGADRG LKLKEQMAFV 960 
GPXGSFRPIH VTIiDTmHXA KIFQWPIFV VKXKXL 

Seq ID NO: 141 DMA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence! 261.. 2861 



GAGCTAGOGC TCAAGCAGAG CCCAGCGOGO TGCTATOQGA CAGAGCCTGO CGAOOSOUIG 60 

CGGC3QCGGGG AGCCAGOGGO GCTGAQ0G03 GOCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTAOCACT COGCTTGCOC A0GG0C06GG AGCT0GCX3GC GCCTGGOGGT CAGOSACCAG 160 

AOGTCOGGGG COSCTGCGCT CCTGGCXMGC GAGOOGTGAC ACTGTCTOGG CTACAGACCC 240 

AGAGOQAGCA CACTGCCA6G ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCRAGGCCA 300 

TGCTQACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGQ G6CCACATCC ACAGTOOCTC 360 

CTGOTTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAQACC 420 

ACCATQTGCA TATGGGCCAG GGCAAGAGAC TQCTGCTCAC CTCTTCTGCC AOOOTCTATT 460 

CCftTCC ACAT CTCAGAQGGA OGCAAGCTGG TCATTAAAQA CCACQACGAQ CCSSATTGTTT 540 

TOTGAACCOG 6CACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AgTOC C C r CT 600 

GCCX nTTCC A GGGCAATTTC ACCATCATTT TGTATGGAAG G6CTGATGAA OOTATTCAaC 660 

OGGATCCTTA CTATGGTCTO AAQTACATTG GGGTTGGTAA AGQAGGCQCT CTTGAGTTGC 720 

AtGOAGAOAA AAA OCTCTCC TGQACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGO 780 

CAGAAQGAGG CTATTTTTTT OAAACGAGCT GGGGCCACCG TGGAGTTATT GTTCATOTCA 840 

TC3GACCCCAA ATCAGGCACA GTCATCCATT CTOACCGGTT TOACACCTAT AGATCCAAQA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTQA ACGCGGTGCC COATGGCAGG ATCCTTTCTG 960 

rroCaOTQAA TGATGAAGGT TCTOQAAATC TGQATGACAT GGCCAGGAAO GOQATGACCA 1020 

AATTGGQAAG C3ttACACTTC CTQCMXTTQ GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TOAAAGGAAA TCCATCATCT TCAOTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCT GCTGC CXX^GGTATTC AAATTGTTCC AQACAGAGCA TGGC6AATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGQTT CAAGAaSTGG AGTGGACGQA GTGGTTOGAT CATOATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACXTTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCA ATCG TCGCATTOAT ATACAGGCCA CTACamTGOA TGGAGTTAAC CTCAGCACXX3 1380 

AGGTTGTCTA CAAAAAAGGC CAGQATTATA GGTTTGCTTG CTAOGACOSG GGCAGAGCXTT 1440 

GCX»GAGCTA CCOTGTAOGG TTCCTCTGTO GQAAGCCTGT GAGGCX3CAAA CTCACAOTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GQATAATGTA CAGTCATGGA 1560 

AACCTQGAOA T AOOCTG GTC ATTGCX»OTA CTQATTACTC CATGTACCAG GCAGAAGAGT 1620 

TOCAOQTGCT TCOCTOCAGA TCCTGQGCCC CCAACCAGGT CAAA0T6GCA GGGAAACCAA 1680 

TGTACCTOCA CATOGGGGAG GAGATAQAOO OOQTGOACAT GOGGGCJGGAG GTTCGQCTTC 1740 

TGAGCCGGAA CATCATAGTQ ATGGGGQAGA TQGAGOACAA ATGCTAOCCC TACAGAAAOC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGOGGGCCA CATCAAOTTT OCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGQACAGC?«3 CTGGIGGGTC 1920 

AGTACCOGAT TCACTTCCAC CTQGCOGGTG ATGTAGACGA AAGOGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TOGCTGOGTC ACaGTCCATG 2040 

QCrCCAATGQ CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTOGGC CACTOCTTCT 2100 

TCACGQAAGA TGGGCCGGAQ GAACX5CAACA CTTTTGACCA CT OTCr i GGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGQ 2220 

ACTCCTACCC AGGGT ACATC CCCAAGCCCA GGCAAGACT6 CAATQCTGTG TCCACXTTTCT 2280 

GG ATGG CCAA TCCCAAC3^C AACCTCATCA ACTOTOCOGC TGCAGGATCT GAGGAAACTO 2340 

GATTTTGGTT TATTTTTCAC CAOQTACCAA CGOGOCOCTC OOtOaOAATO TACTCCCOUS 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA OOSAGCACAT TCCAACTACC 2460 

GG GCTGG CAT GATCATAOAC AACGGAGTCA AAACCRCCGA GOCCTCTGCC AAGGACAAGC 2520 

GOCOOTTOCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGAOGCC GACCOGCTGA 2580 

ASCO0OGG6A GCOGGCCATC ATCAGACACT TCATTGCXTTA CAAGAACCAG GACX3VCGGGG 2640 

CCTGGCTGOG OGGCGGGGAT OTGTGQCTQO ACAGCTGCCA TTTCAOAGGO OAGGCTCAGG 2700 

AAOGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT 6CTGG6GGGA OATOAGGCAQ 2760 

OCTCTGGAAT GGCT CAGGGA TTCAGOCCTC CCTGCC3GCTG CCTQCTQAAG CTGGTGACTA 2820 

CX3GGGTO0CC CT TTGC TCAC GTCTCTCTGG CCCACTCATG ATGGAGAAOT GTGGTCAGAG 2880 

GQQAGC3UVTG GGCTTTGCTQ CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCAC3USA AGTQAGCTCC TGCCTTAGGO CCTCATTTOC 3000 

TCTTCA TCCA GGGAACTOAG CACAGGGGGC CTOCAGGAGA OCCTAOATGT OCTOSTACTC 3060 

CCTCJGGCCTQ GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 

TRACRGATGG GQAAAOTGAG CCCCCAAGAT GQGAAAQAAC CACAOVOCTA AGGQAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGOCA CACCACATTO CCTCAACftAC OGGCCCCAQA 3240 

GTGCCCAGGC ACTCCTQAGG TAGCTTCT6G AAATQGGGAC AA0TCXX5CTC GAAGaAAAOG 3300 

AAATGACTAG AGTAGAATGA CAGCTAGCAO ATCTCTTCCC TCCTGCTCCC AGOGCACACA 3360 

AACCCOCCCT CCCCTTGGTQ TTCGCXIGrCC CTGTQGCCTT CACTTTGTTC ACTACCTCTC 3420 

A6CCCAGCCT GGGTGCACAO TAGCTGCAAC TCCXX:ATTGG TGCTACCTCG CTCTCCTGTC 3460 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAOTAGGG CTCGCCATGT TXCTGCJXGAO 3540 

CCAATTTGGC TQATCTTGGO TGTCTGAACA GCTATTGOGT OCACOCCAflT OCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA OAQAOCCCAA AOAGCTCCTG 3660 

TAAGAGGQAO AACTCTATCT GTX3GTTTATA ATCTTGCAOG AGGCACCAGA OTCTCCCTGG 3720 

CTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGGCCTO CCTGGCrCCC TCCACXX»AC TGCACCCATG AQACTOGGTC CAAGAOTCCA 3840 

TTCCCCAGGT GGQAGCCAAC TGTCAGGGAfl G I CTTTCOCA CCAAACATCT TTCAGCTOCT 3900 

GGQAGOTGAC CATAGQGCTC TGCmTAAA GATATG6CT6 CTTCAAAGQC CAGA6TCACA 3960 

gOftftCGAC TT CTT CCAGGQA GATTAGTGGT GATGGAGAGO AQAOTTAAAA TGACCTCATG 4020 

TCCITCTTGT COVOMTTTT OTTOAOTTTT CACTCTTCTA ATGCAAGGOT CTCACACTGT 4080 

GAAOtawrrTA GOATGTGATC ACTTTCAGGT GGCCAOGAAT GTTGAATGTC TTTQGCTCAO 4140 

TTCATTTAAA AAAGATATCT ATTTQAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 4200 
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PCTAJS02/36810 



AGG ATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 4260 

TTTCTTOGTA GCACAAATTT TCT TATT GCf TAOAAAATTO ItX TC L T i m ' TATTTCTGTT 4320 

TGTAA GACTT AAOTOAOTTA GGTCTTTAAG GAAAGCAA06 CTCCTCTSAA ATGCTTOTCT 4380 

5TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTaSGQAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATQA ACAAAQATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACrr CAAGAAOTCA CTGTCACAGA AATAAAOAAT TGTCTTAAAT GTCATQATTO 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGQGT OTACCTAOAG CCMGGAAAT TOGCTCTGGT 4620 

TTGCSAAAAAT TTTGCTGTTA TTATAQTAAA CATAC3kAA6S ATGTCAAAAA AAAAAAAAAA 4680 
AAAAAAAAAA AAAAAAAAAA AA 



10 
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Seq ID NO: 142 Protein sequence 
Protein Accession (ft Bos sequence 



,C I " 21 31 41 51 

HGAAGRQDFL FKAMLTISHIi TLTCFPGATS TVAASCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTXtLLTSSA TVYSXBISEG GKLVZKDHDB PIVLRTKBIli ZDNGGELHAO 8ALCPFQ6NF 120 

TIILYGRADB GIQPDPYYGL KYIGVGKGGA LELHGQXKLS MTFLNKTLHP GGKAEGGYFP 180 

ERSWGHRGVI VHVrDPKSGT VIHSDRPDTY R8KKBSBRLV QYUIAVPDGR ILSVAVNDBG 240 

ZU SRNLDDMARK AMTKLG9KHP LHLGFWIPMS PLTVKGNPSS SVEDHIEYHG HRGSAAARVP 300 

KLFQTEHGBY PNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 

ZQATTMDOVN LSTEWYKKG QDYRFACyDR GRACRSYRVR PL06RPVRPK LTVTIDTNVN 420 

STZUniEDHV QSWKPGDTLV lASTDYSHYQ ABEPQVLPCR SCAPNQVKVA CKPHYIiHIGB 480 

EIDOVDMRAB VGLLSRMIIV MGEMEDKCYP YRNHIOIFPD PDTFGGHIKF ALOFKAAHLB 540 

ZD GTEIiKH>«QQ LVG QYPI HFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGIiLI 600 

KDWGYNSLG HCPFTEDGPE ERNTPDHCLO LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STPWMAIIPNN NLXNCAAAGS EBTOFHFIFH HVPTGPSVGM YSPGYSBHIP 720 
USKFYNNRAH SNYRA01IID MGVKTTBASA XDRRPFLSII SARYSPHQDA DPLXPREPAI 



Seq ID NO: 143 MM DMA sequence 
Nucleic Acid Accession #t MN_006017 
Coding sequence: 38.. 2635 ^ 



780 



ntHFIAYKNQ OHGAHliRGaD WILDSCHFRO BAQEGFLLTG MKAGGIIiLGG DEAASGNAQG 840 
J\) FSPPCRCLLK LVTTGSPPAH VSLAH8 



1 11 21 31 41 51 

I I 1 I I I 

CCAAG TTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCXTTCQTAC TCGGCTCCCT 60 

GTTGCTGCTG GGGCTGTGOG GGAACTCCTT TTCAGQAGGG CAGCCTTCAT CCACAGATQC 120 

4U TCCTAAGOCT TGGAATTATG AATTGCCTGC AACAAATTAT QAGACCCAAG ACTOCCATAA 180 

AGCTG GACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCX; 240 

GOSTGATTTC OCAGAAGATA CTTTGAGAAA ATTCTTACAO AAGGCATATG AATCCAAAAT 300 

TGATTATQAC AAGCCAGAAA CTGTAATCTT AGGTCTAAAO ATTGTCTACT ATGAAGCAGG 360 

AC GAT TATTCTA TGCTGTQTCC TGGGGCTQCT OTTTATTATT CTGATGCCTC TGOTOGQOT A 420 

*¥J TTTCTTTTOT ATGTQTCGTT GCTGTAACAA ATGTOOTGGA GAAATGCACC AGCOACAGAA 480 

GGAAAATGG6 CCCTTCCTGA GGAAATGCTT TGCAATCTCC CTGTTGGTGA TTTGTATAAT 540 

AATAAOCATT GGCATCTTCT ATGGTTTTOT GGCAAATCAC CAGQTAAGAA CXXXMATCAA 600 

AAG6AGTCX36 AAACTGGCAO ATAGCAATTT CAAGGACTTG CGAACTCTCT TGAATGAAAC 660 

TCCAGAQCAA ATCAAATATA TATTGOCC7C31 OTACAACACT ACCAAGGACA AG6CGTTCAC 720 

DV AGATCTQAAC AQTATCAATT CAGT6CTAGG AGGOGGAATT CTTGACOGAC TGAGACOCAA 780 

CATCATCCCT GTTCTTQATG AGATTAAQTC CATGGCAACA GCGATCAAGG AGACCAAAGA 840 

GGOQTTGQAa AACATGAACA GCACCTTGAA OAGCTTGCAC CAACAAAGTA CACAGCTTAG 900 

CAGCAGTCTQ ACCAGCOTGA AAACTAGCCT GCX3GTCATCT CTCAATGACC CTCTGTGCTT 960 

G0T0C31TCCA TCAAGTGAAA CCTGCAAC3W3 CATCAGATTG TCTCTAAGCC AGCTGAATAO 1020 

00 CAACCCTGAA CTQAGGCAGC TTCCACCOGT GGATGCAGAA CTTQACAAOG TTAATAA06T 1080 

TCTTAGGACA GATTTGGATG GCXTTGOTCCA ACAGGOCTAT CAATCCCTTA ATGATATACC 1140 

TGACAGAGTA CAACX3CCAAA CCACGACTGT CX3TAGCAG0T ATCAAAAGOG TCTTGAATTC 1200 

CATTG GTTCA GATATCGACA ATGTAACTCA GCGTCITCCT ATTCAGGATA TACTCTCAOC 1260 

^_ ATTCTCTGIT TATGTTAATA ACACTGAAAG TTACATOCAC AGAAATTTAC CTACATTGGA 1320 

AGACTATGAT TCATACTGGT QGCTGGOTGG CCTGGTOITC TGCTCTCT6C T6ACCCTCAT 1380 

CXJTGATTTTT TACTACCTGG GCTTACTGTG TGGCGTGTGC GGCTATGACA GGCATGCCAC 1440 

CCCXSACCAOC OGAGGCT QTG TCTCCAAOVC OGGAGGOGTC TTCCTCATGG TTGQAGTTGG 1500 

ATTAA OTTTC CTCTTTTGCT GGATATTGAT GATCATT6T0 GTTCTTACCT TTGTCTTTGG 1560 

TGCAAATGTG GAAAAACTGA TCTGTGAACC TTACAOGAGC AAGGAATTAT TOOOG G T T TT 1620 

00 GGATACACOC TACTTACTAA ATGAAGACTG GOAATACTAT CTCTCTGGGA A6CTATTTAA 1680 

TAAATCAAAA ATGAAGCTCA CTTTTGAACA AGTTTACAGT GACT6CAAAA AAAATA6AGO 1740 

CACTTAC3GGC ACTCTTCACC TGCAGAACSUS CTTCAATATC AGTGAACATC TCAACATTAA 1800 

TGAGCATACT GGAAGCATAA GCAGTGAATT GGAAAOTCTG AAGQTAAATC TTAATATCTT 1860 

TCTGTTGGGT GCAGCAGGAA GAAAAAAOCT TCAGGATTTT OCTGCTTGTG GAATAOACAG 1920 

/U AATQAATTAT GACAGCTACT TGGCTCAQAC TGGTAAATCC CCCX3CAGGAG TGAATCTTTT 1980 

ATCATTTGCA TATGATCTAG AAGCAAAAGC AAACAGTTTG CCCCCAGGAA ATTTQAG6AA 2040 

CTCCCTGAAA AGAGATGCAC AAACTATTAA AACAATTCAC CAGCAA06AG TCCTTCCTAT 2100 

AGAACAATCA CTGAGCACTC TATACCAAAG OGTCAAGATA CTTCAACGCA CAGGGAATGG 2160 

ATTGTTGGAG AGAGTAACTA GGATTCTAOC TTCTCTGGAT TTTOCTGAOA ACTTCATCAC 2220 

/O AA ACAATA CT TCCTCTGTTA TTATTGAGGA AACTAAOAAO TAT6GGA6AA CAATAATAG6 2280 

ATATTTTGAA CATTATCTGC AGTGGATOGA GTTCTCTATC AGTGAGAAAO TGGCATOGTG 2340 

CAAATCTGTG GC CACCGCTC TAGATACTGC TOTTQATGTC TTTCTGTGTA GCTACATTAT 2400 

OGACCCCTTO AATTTGTTTT GOTTTGGCAT AGGAAAAGCT ACTGTATTTT TACTTC0G6C 2460 

g-. TCTAATTTTT GOGGTAAAAC TQQCTAAGTA CTATCXm3QA ATGGATTOGG AGGAGGTOTA 2520 

OU CGATQATGTT QAAACTATAC CCATGAAAAA TATGGAAAAT GGTAATAATG OTTATCATAA 2S80 

A GATCAT GTA TAIGGXATTC ACAATCCTOT TATGACAAGC CCATCACAAC ATTGATAGCT 2640 

GMGTTGAAA CTGCTTGAGC ATCAGGATAC TCAAAGTOGA AAGOATCACA GATTTTTGGT 2700 

AGTTT CIGGG TCTACAAGGA CTTTCCAAAT CCAGGAGCAA CGCCAGTGOC AACXTTACTGA 2760 

CTCAGGOGGO CACCAAGQCA AOSGCAOCAT TGOTCTCTGO GTAGTGCTTT AAGAATGAAC 2820 
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A CAATCAOOT TATAOTCCAT G6TCCATCAC TATTCAAGGA TGACTCOCTC CCTTCCTQTC 2880 

TAl'l'lTl'GT'f TTTTACTTTT TTACACTGAG TTTCTATTTA GACACTACAA CATATCGGGT 2940 

OTTTGTTOCC ATTGOATGCA TTTCTATCAA AACTCTATCA AATOTQATGQ CTAQATTCTA 3000 

ACATATT6CC ATGTGTQGAO TGTGCTGAAC ACACACCAGT TTACAGGAAA GATGCATTTT 3060 

GTGTACAOTA AACGGTGTAT ATACCTTTTG TTACCACAGA GTTTTTTAAA CAAATQAOTA 3120 

TTATAGQACT TTCTTCTAAA TGA6CTAAAT AAGTCACCAT TGACTTCTTG GTGCTQTTGA 3180 

AAATAATGCA TTTTCACTAA AAGT6TGTGA AACCTACAGC ATATTCTTCA CX5CAQAGATT 3240 

TTCATCTATT ATACTTTATC AAA6ATTGGC CATGTTCOVC TTGGAAATQO CATGCAAAAQ 3300 

CCATCATAGA GAAACCTGCG TAACTCCATC TGACAAATTC AAAAGAQAGA GAGAGATCTT 3360 

GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG AGTTQTTTTA ACAOATQCCA ATTACGGTOT 3420 

ACAOTTTAAC AGAGTTTTCT GTTGCATTAG GATAAACATT AATTGGAGTG CAGCTAACAT 3480 

GABTATCATC AGACTAGTAT CAAGTGTTCT AAAATOAAAT ATQAGAAGAT CCTGTCACAA 3540 

TTCTTAGATC TGGTGTCCAQ C3VTGGATGAA ACCTTTGAGT TTGGTCXXTA AATTTOCATQ 3600 

AAAGCACAAG GTAAATATTC ATTTGCTTCA GGAQTTTCAT GTTGGATCTG TCATTATCAA 3660 

AAGTGATCAG CRATGAAGAA CT6GT0GGAC AAAATTTAAC OTTQATQTAA T66AATTCCA 3720 

GAT6TAG6CA TTCCCCCCAQ OTCTTTTCAT GTGCAGATTG CAGTTCTGAT TCATTTGAAT 3780 
AAAAAGGAAC TTGG 



8eq ID NO: 144 NP Protein sequence 
ZU Protein Accession #t NP 006008.1 



1 11 21 31 41 51 

I I I I I I 

MALVLGSLLL LGL0SHSP66 GQPSSTDAPR AWNYELPATN YBTQDSHKAO PIOXLFELVH 60 

25 ZFLYWQPHD FPEDTLRKPL QKAYBSKIDY DKPETVILGL XIWyEAOZI LGCVLGLLFX 120 

ILMPLVGYPP CMCRCOnCCG GBMHQRQKEN GPPLRKCPAI SLLVICIIIS IGIFYGFVAN 180 

HQVRTRIKRS RKIADSNFKD LRTLLNBTPB QIKYILAQYN TTKDKAPTDL N8INSVLGGG 240 

ILDRLRFNII PVLDEIKSMA TAIKETKEAL OmNSTLKSL HQQSTQLSSS LTSVKT8LRS 300 

SUlDPLCZiVH PSSETQtSIR LSItSQLNSKP BLSQLPPVDA EZiDNVMNVIiR IDUXSLVQQG 360 

30 YQSLNDIPDR VQRQTTTWA OIKRVUJSIG SOIDNVTQRL PZQDILSAFS VYVNHTESYI 420 

HRNLPTLBEY DSYWWLGGLV ICSLLTLIVI FYYLGLLOGV OGYDRHATPT TROCVSNTOO 480 

VPLMVGVQLS PLKMILMII WLTFVPGAN VEKLICEPYT SKELPRVUJT PYLLMEDWEY 540 

YL8GKLFNKS KMXLTPEQVY SDCKKNRGTY GTLHljQMSFN ISEHLNINEH TGSZSSELBS 600 

LKVm^NIFIiL GAAQRKNI^D FAACX3IDRMK YDSYLAQTGK 8PAGVNLLSF AYDIiBAKANS 660 

LPPGNLRNSL KKDAQTZKTI HQQRVLPIBQ SLSTLYQSVK ZLQRTGNQLL ERVTRIIA8L 720 

DFAQNFITNN TSSVIIEETK KYGRTIZ6YF BBYLQWZEP8 Z8EKVASCKP VATALOTAVD 780 

VPLCSYIIDP LNLPHFGIGK AWFLLPALZ PAVKZAKYYR RMDSEDVYDD VETZPMXNMB 840 
NGNNGYKKDH VYGIHNFVMT SPSQH 



Seq ZD NOt 145 DMA sequence 

Nucleic Acid Accession 9« Bos sequence 

Coding sequence: 38.. 2605 



1 11 21 31 41 51 

45 I I I I I I 

CC3UW3TTCTA CCTCATGTTT GGA6GATCTT GCTAGCTATG 6CCCTCGTAC TCXSGCTCCCT 60 

GTTGCTGCTG GGGCTGTGCG GOAACTCCTT TTCAGGAGQG CAGCXITTCAT CCACAGATGC 120 

TCCTAAGGCT TGGAATTATG AATTOCCTGC AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

A0CTG6AGCC ATTGGCATTC TCTTTGAACT AGTGGATATC TTTCTd'ATO TGGTACAGCC 240 

GCX3TGATTTC OCAGAAGATA CTTTGA6AAA ATTCTTACAG AAOGCATATO AATCCAAAAT 300 

TGAT TATGAC ATTGTCTACT ATGAAGCAGO GATTATTCTA TGCTGTGTCC TGGGGCTGCT 360 

GTTTATTATT CTGATGCCTC TGGTGGGGTA TTTCTTTTGT ATOTOTOOTT GCTGTAACAA 420 

A7GTG0TGGA GAAATGCACC AQC6ACAGAA GGAAAAT6GG CCCTTCCT6A GGAAATGCTT 480 

«- TGCAATCTOC CTGTTG6TGA TTTGTATAAT AATAAOCATT GGCATCTTCT AT66TTTT6T 540 

3^ GGCAAATCAC CA6GTAAGAA 0CC3GGATCAA AAGGAGTC6G AAACTGGOUS ATA6CAATTT 600 

CAAGGACTT6 CX5AACTCTCT TOAATGAAAC TCCAGAGC3\A ATCAAATATA TATTGGCCCA 660 

GTACAACACT ACCAAGGACA AGGCXTTTCAC AQATCTGAAC AGTATCAATT CAGTGCTAGG 720 

AG6C3G6AATT CTTGACOGAC TOAOACCCAA CATCATCCCT GTTCTTGATG AGATTAAGTC 760 

GATGGCAACA GGQATCAAGO AGAGCAAAOA GGOGTXGOAG AACATGAACA GC3^CCTTGAA 840 

OAOCTTGCAC CAAOUUkOTA CAaVGCTTAO CAGCAGTCTG ACCAGOGTGA AAACTAGCCT 900 

G06GTCATCT CTCAATGACC CTCTGTGCTT GGTGCATCCA TCAAGTGAAA CCTGCAACAG 960 

CATCA6ATT0 TCTCTAAGCC AGCTGAATAG CAACCCTGAA CTGAG6CAGC TTCCACCXXJT 1020 

GGATGCAGAA CTTGAC3^CX3 TTAATAACX3T TCTTAGGACA GATTTGGATG GCCTGGTCCA 1080 

ACAGGQCTAT CAATCGCTTA ATGATATAOC TGACAOAOTA CAA06CCAAA CCAOGACTGT 1140 

OGTAGCAOGT ATCAAAAGGG TCTTGAATTC CATTGGTTCA 6ATATCQACA ATGTAACTCA 1200 

GCGTCTTCCT ATTC3M3GATA TACTCTCAGC ATTCTCTGTT TATGTTAATA ACACTGAAAG 1260 

TTACATCCAC AGAAATTTAC CTACATTQGA AGAOTATGAT TCATACTGGT GGCTGGGTQO 1320 

CCTGGTCATC TGCTCTCTGC TGACCCTCAT CGTOATTTTT TACTACCTOG OCTTACTQTO 1380 

TGGOOTQTGC QGCIATQACA GGCATQCOC CCGGACCACC C8AG6CIGTG TCTCCAACAC 1440 

OGGAGGGGTC TTCCTCATGG TTOQAOTrGQ ATTAAGTTTC CTCTTTT6CT GGATATT6AT ISOO 

GATCATTOTG GTTCTTACCT TTGTCTTTGQ TGCAAATGTG QAAAAACTGA TCTOTGAACC 1560 

TTACAOGAGC AAGGAATTAT TCCGGGTTTT GGATACACCX: TACTTACTAA ATGAAGACTG 1620 

GGAATACTAT CrCTCTGGGA AGCTATTTAA TAAATCAAAA ATGAAGCTCA CTTTTQAACA 1680 

AGTTTACACrr GACTGCAAAA AAAATAGAOG CACTTACXSGC ACTCTTCACC T6CAGAACA0 1740 

CTTCAATATC A6TGAACATC TCAACATTAA TGAGCATACT GGAAGOO'AA GCAOTGAATT 1800 

GGAAAGTCTG AAGGTAAATC TTAATATCTT TCTGTTGGGT GCAGCAGQAA GAAAAAACCT 1860 

TCAGGATTTT GCTGCTTGTG GAATAGACAG AATQAATTAT QACAGCTACT TGGCTCAGAC 1920 

TGGTAAATCC CCCGCAGGAG TGAATCTTTT ATCATTTGCA TATGATCTAG AAGCAAAAGC 1980 

AAACAOTTTG CCCCCAG6AA ATTTGAGGAA CTOCCTGAAA AGAGATGCAC AAACTATTAA 2040 

AACAATTCAC CAGCAAOQAO TCCTTCCTAT AGAACAATCA CTGAOCACTC TATACXAAAG 2100 

COTCAAGATA CTTCAACGCA CAGGGAATGG ATTGTTGGAG AGAGTAACTA GGATTCTAGC 2160 

TTCTCTGGAT TTTGCTCAGA ACTTCATCAC AAACAATACT TCCTCTGTTA TTATTGAGGA 2220 

AACTAAGAAG TATGGGAGAA CAATAATAOG ATATTTTGAA CATTATCTGC AGTGGATOGA 2280 

OTTCTCTATC A0T6AGAAA6 TGOCATOOTO CAAACCTGTG QCCAOOQCTC TAGATACT6C 2340 
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TGTTOATGTC TTTCTGTGTA GCXACATTAT OQACCCCTTG AATTTGTTTT GGTTTGGCAT 2400 

AGC3AAAA6CT ACTSTATTTT TACTTG006C TCTAATTTTT GCGGTAAAAC TGGCTAAGTA 2460 

CTATOGTOSA ATGGATT066 AGGAOGTGTA OOATGATOTT OAAACTATAC CCATGAAAAA 2520 

TATGGAAAAT GGTAATAATG GTTATCATAA AGATCATGTA TATGGTATTC ACAATCCTQT 2580 

J TAT6ACAAGC CCATCACAAC ATTGATAGCT GATOTTGAAA CTGCrTQAGC ATCAGGATAC 2640 

TOVAAGTGGA AAGGATCACA GATTTTTGGT AGTTTCTQGG TCTACAAGQA CTTTCCAAAT 2700 

CCW56AGCAA OQCCAQTGQC AACX3TAGTGA CTCAGGOGGG CACCAAGGCA ACGGCACCAT 2760 

TGQTCTCTGG GTAGTGCTTT AAQAATQAAC ACAATCACGT TATAOTOCAT GGTCCATCAC 2820 

TATTCAAOOA TGACTCCCTC CCTTCCTGTC TATTTTTGTT TTTTACTTTT TTACACTGAQ 28B0 

iU TTTCTATTTA GACACTACAA CATATGGGGT QTTTGTTCCC ATTGQATGCA TTTCTATCAA 2940 

AACTCTATCA AATGTGATGG CTAGATTCTA ACATATTQCC ATGTQTGGAG TOTGCTGAAC 3000 

ACACACCAOT TTACAGGA AA QATGCATTTT GTGTACAGTA AACGGTGTAT ATACCTTTTG 3060 

TTACCACAGA GTTTTTTAAA CAAATGAOTA TTATAC3QACT TTCTTCTAAA T6A6CTAAAT 3120 

AAGTCACCAT TGACTTCTTQ QTGCTGTTGA AAATAATCCA TTTTCACTAA AAGTGTGTGA 3180 

ID AACCTACAGC ATATTCITCA OQCAGAGATT TTCATCTATT ATACTTTATC AAAGATTGGC 3240 

CATGTTCCAC TTGGAAATGG CATGCAAAAG CCATCATAGA GAAACCTQCO TAACTCCATC 3300 

TGACAAATTC AAAAGAGAGA OAQAGATCTT GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG 3360 

AOTTGTTTTA ACAQATGCCA ATTACX3GTGT ACAGTTTAAC AaAGTTTTCT GTTGCATTAG 3420 

GATAAACATT AATTGQAGTG CAGCTAACAT GAGTATCATC AGACTAGTAT CAAOTGTTCT 3480 

Z\) AAAATGAAAT ATQAGAAGAT CCTGTCACAA TTCTTAOATC TGGTGTCCA6 CATGQATGAA 3540 

ACCTTTGAGT TTGOTCCCTA AATTTGCATG AAAGCACAAG GTAAATATTC ATTTGCTTCA 3600 

GGAGTTTCAT GTTGGATCTG TCATTATCAA AAGTGATCAQ CAATGAAQAA CTGGTC6GAC 3660 

AAAATTTAAC GTTGATGTAA TGGAATTCCA GATGTAGGCA TTCCCCCCAG GTCTTTTCAT 3720 
GTGCAGATTG CAGTTCTGAT TCATTTGAAT AAAAAG6AAC TTGG 



25 



50 



Seq ZD HO: 146 Protein sequence 
Protein Accession #t 



OA ^ 11 . 21 31 41 51 

30 I I I I I I 

MALVLGSLLL LGLCGKSPSG GOPSSTDAPK AWNYELPATN YETQDSHKAG PIGIIiPELVH 60 

IFIiYWQPRD FPEDTLRKPL QKAYESKIDY DIVYYBAQII LCCVLGLLPI ILMPLVGYPF 120 

CKCKCCNKCG GEMHQRQKEN GPPLRKCFAI SLLVICIIIS IGIFYGFVAN HQVRTRIKRS 180 

RKLADSNPKD LRTLUIETPE QIKYILAQYN TTKDKAPTOL NSZNSVLGGO ILDRLRPNIX 240 

jD PVLDEIKSMA TAIKETKBAIi ENMNSTLKSL HQQSTQIiSSS LT8VKT8UIS SLNDPLCLVH 300 

PSS ETQJS XR LSLSQLMSNP ELRQLPPVDA BLDNVNNVLR TDLDGLVQQG YQSLNDIPDR 360 

VQRQTTTWA GIKRVLNSIG SDIDKVTQRL PIQDILSAFS VYVNNTESYI HRNLPTLEBY 420 

DSYWHUSGLV XCSLLTLIVI FYYIiGLLCGV OGYDRHATPT TRGCVSNTGG VFLMVGVQL8 480 

FIJCHILMII WLTFVPGAN VBKLICEPYT SKBLPRVLDT PYLtMEDWBY YLSGKIJWKS 540 

4U KMKLTFEQVY SDCKKNRGTY GTLHLQNSFN ISERUIINEH TGSXSSEIfS UCVNUaFLL 600 

GAAGRKNLQD P7UVCX3IDRMN YDSYLAQTGK SPAGVNIiLSP AYDLEAKANS LPPCaiLRNSL 660 

KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK XLQRTGMGLL BRVTRILASL DFAQNPITNN 720 

TSSVXXEBTK KYGRTIIGYP EHYIjQWXEPS ISEKVASCKP VATALDTAVD VFLCSYIIDP 780 

LNLPHFaiOK ATVFLLPALI FAVKXiAKyyR RMDSEDVYDD VBTIPMXNKB HGNMQYHXDH 840 

4!) VYGXHNPVHT SPSQB 



Seq XD NOs 147 DMA aequence 

Nucleic Acid Accession #: BC012 089.1 

Coding sequence I 23 6.. 2 806 



1 11 21 31 41 51 

I I 1 i I I 

ATGCTCTCAG CTCTCCCGCC GCGGGATGGT OCXTTTGAQTO AATGAOCCCC TTGGAOAACA 60 

TTCTTCXX3CA TCCCTOGCCT GAAOOCAGCX: TCAGACAGAA AACTGAAGAT TCAGCAGATC 120 

J J CAGTGCTTCC TGCTCCTCTT CXGOOCS^GGA AGAOQCTTGC CTTOCCCAAO QCTTCCAGAA 180 

GCTCTGAGGC AGGAGGCACC AAGTTCTACC TCATGTTTGG AGQATCTTGC TAGCTATGOC 240 

CCTCGTACTC GGCTCXrCTGT TGCTGCTGGG OCTGTGCGGQ AACTCCTTTT CAGGAGGGCA 300 

GCCTTCATCC ACAGATGCTC CTAAGGCTT6 GAATTATGAA TTGCCTGCAA CAAATTATGA 360 

GACCCAAGAC TCCCATAAAO CTGGAOCCAT TGGCATTCTC TTTGAACTAG TGCATATCTT 420 

OU TCTCTATQTO GTACA600GC GTGATTTOCX; AGAA6ATACT TXOAGAAAAT TCTXACAGAA 480 

GGCaTATQAA .TCCAAAATTG ATTATGACAA 6ATTGTCTAC TATGAAGCAG GQATTATTCT 540 

ATGCTGTGTC CTGGGGCTGC TGTTTATTAT TCTGATGCCT CTGGTGOGQT ATTTCTTTTG 600 

TATGTGTCX3T TGCTGTAACA AATGTGGTGG AGAAATGCAC CA6G6ACAGA AG6AAAATGG 660 

GCCCTTCCTG AGGAAATGCT TTGCAATCTC CCTGTTGGTG ATTTOTATAA TAATAAGCAT 720 

Oj TGGCATCTTC TATGGTTTTQ TGGCAAATCA CCAGGTAAGA ACCCGGATCA AAAGGAOTOO 780 

GAAACTGGCA QATAGCaATT TCAAGGACTT GOQAACTCTC rPOAATGAAA CTCCAGAGCA 840 

AATCAAATAT ATATTGGCCC AGTACAACAC TACX»AGGAC AAGG06TTCA CAQATCTQAA 900 

CRGTATCAAT TCAGTGCTAO QAGGCOGAAT TCTTGACCGA CTQAGACCCA ACATCATCCC 960 

TGTTCTTGAT GAGATTAAGT CCATG6CAAC A0G6ATCAAG GAGACCAAAQ AOOGGTTGGA 1020 

iK) GAACATGAAC AOCAOCTTGA A6A6CTTGCA CCAACAAAGT ACAGAGCTTA GCAGCAGTCT 1080 

QACXAGCGTG AAAACTAGOC T606GTCATC TCTCAATGAC CCTCTOTOCT TGGTGCATCC 1140 

ATCAAGTGAA ACCTGCAACA QCATCAOATT QTCTCTAAGC CAGCTGAATA GCAACCCTGA 1200 

ACTGAGGCAG CTTCCACCOG TGGATGCAGA ACTTGACAAC GTTAATAA06 TTCTTAGGAC 1260 

A6ATT7GQAT GGCCXOQTOC AACAGGGCTA TCAATCCCTT AAT6ATATAC CTOACAGAGT 1320 

fj ACAACGCCAA ACCAOQACTG TOGTASCAGG TATGAAAAGG GTCTTGAATT CCATTGGTTC 1380 

AGATATOGAC AATGTAACTC AGOGTCTTCC TATTCAGQAT ATACTCTCAG CATTCTCTGT 1440 

TTATGTTAAT AACACTGAAA GTTACATCCA CAGAAATTTA CCTACATTGG AAQAGXATGA 1500 

TTCATACTGG TGGCTGGGTG GCCTGGT,CAT CTGCTCTCTG CTGACCCTCA TOGTGATTTT 1560 

TTACTACCTQ OOCTTACTGT GTGGOGTGTG 0G6CTATGAC AGGCATQCCA CCOGGACGAC 1620 

OU CCGAGGCTGT GTCTCCAACA OOGGAGGOQT CTTOCTCATG OTTGQftGTTQ 6ATTAA6TTT 1680 

CCTCTTTTQC TOaATATTQA TQATCATTGT GGTTCTTACC TTTGTCTTTG OTGCAAATGT 1740 

GOAAAAACTG ATCTGTGAAC CTTACAOGAG CAAGGAATTA TTCOGGGTTT TGGATACACC 1800 

CTACTTACTA AATGAA6ACT GGGAATACTA TCTCTCTGGG AAGCTATTTA ATAAATCAAA 1660 

AAT6AA6CTC ACTTTTGAAC AAGTTTACAG TGACTGCAAA AAAAATAGAG GCACXTACGG 1920 
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CACTCTTCAC CTGCAGAACA GCTTCAATAT CAGTGAACAT CTCAACATTA ATGAQCATAC 1980 

TGGAAGCATA AGCAGTGAAT TGGAAAGTCT GAAGGTAAAT CTTAATATCT Tl'CTG'rT GGG 2040 

T6CAGCAGGA AQAAAAAACC TTCAGQATTT TGCtGCTTOT GGAATAGACA GAATGAATTA 2100 

TQACA6CTAC TT6GCTCAGA CTOQTA AATC COOOQCAGGA GTGAATCTTT TATCATTT6C 2160 

D ATAT6ATCTA GAA6CAAAAG CAAACAGTTT GCCCCCAG6A AATTTGA66A ACTCCCTGAA 2220 

AAGAGATGCA CAAACTATTA AAACAATTCA CCAGCAACGA QTCCTTCCTA TAGAACAATC 2280 

ACTGAGCACT CTATACCAAA GCGTCAAGAT ACTTCAAOGC ACAGGGAATO GATTCTTGGA 2340 

GAO AGTAA CT AGGATTCTA6 CTTCTCTGGA TTTTGCTCAG AACTTCATCA CAAACAATAC 2400 

TTCCTCTQTT ATTATTQAGO AAACTAAQAA GTATGGGAGA ACAATAATAO GATATTTTGA 24fi0 

iU AGATTATCTY3 CAGTGQATOB A6TTCTCTAT CAOTGAGAAA GTGOCATOOT GCAAACCTGT 2520 

GGCCACGGCT CTAGATACTG CTGTTGATGT CTTTCTGT6T AGCTACATTA TCGACCCCTT 2580 

GAATTTGTTT TGOTTTQGCA TAGQAAAAGC TACTOTATTT TTACTTCOGG CTCTAATTTT 2640 

TG06GTAAAA CTGGCTAAGT ACTAT06T0G AATGGATTCO GAGGA06TGT A08AT6ATGT 2700 

TGAAACTATA CCCATQAAAA ATATGGAAAA TOGTAATAAT GOTTATCATA AAGA7CATGT 2760 

ATAT6GTATT CACAATCCT6 TTAT6ACAAG CCCATCACAA CATTGATAGC TQATGTTGAA 2820 

ACTGCTTGAO CATCAGOATA CTCAAAQTGQ AAAG6ATCAC AGATTTTTGG TAGTTTCTGQ 2880 

GTCTACAAGG ACTTTCCAAA TCCAGGAGCA ACG(XAQTQQ CAACQTAOTG ACTCAGGOGG 2940 

GCACCAAGGC AACGGCACCA TTGGTCTCTG GGTA6T6CTT TAAGAATGAA CACAATCAGG 3000 

TTATRG TCCA 7GGTCCATCA CTATTCAAGO ATGACTCCCT CCCTTCCTqT CTATTTTTGT 3060 

TTTTTACTTT TTTACACT6A GTTTCTATTT AGACACTACA ACATATGGGG TOTT TO TTCC 3120 

CATTGQATGC ATTTCTATCA AAACTCTATC AAATOTGATG GCTAGATTCT AACATATTGC 3180 

CATGTGTGGA GTGTGCTGAA CACACACCAG TTTACRGGAA AGATGCATTT TOTOTACAGT 3240 

AAAOGGTOTA TATACCTTTT GTTACCACAG AGTTTTTTAA ACAAATGAGT ATTATAGGAC 3300 

TTTCTTCTAA ATGA6CTAAA TAAGTCACCA TTQACTTCTT 68TGCIGTT0 AAAATAATCC 3360 

ATTTTGACTA AAAGTGrTGTG AAAOCIACAO CATATTCTTC AGGCAGAGAT TTTCATCTAT 3420 

TATACTTTAT CAAAQATTGG CCATOTTCCA CTTGGAAATQ GCATGCAAAA GCCATCATAG 3480 

AOAAACCTGC GTAACTCCAT CTGACAAATT CAAAAGAGAO AGAGAGATCT TGAGAGAGAA 3540 

ATGCTGTTOQ TTCAAAAGTO GAOTrGTTTT AACAGATGGC AATTAOGGTG TACAGTT7AA 3600 

GAQAGTTTTC TGTT6CATTA G6ATAAACAT TAATTGGAGT GCAGCTAACA TGAGTATCAT 3660 

CA6ACTA0TA TCAAGTGTTC TAAAATGAAA TATGA6AAGA TCCTGTCACA ATTCTTAGAT 3720 

CTGGTGTCCA GCAT6GATGA AACCTTTGAG TTTGGTCCCT AAATTTGCAT GAAAGCACAA 3780 

GGTAAATATT CATTTGCTTC AGGAQTTTCA TGTTGGATCT GTCATTATCA AAAGTGATCA 3840 

GCAATGAAGA ACTGGTCGGA CAAAATTTAA CGXTGATGTA ATGGAATTCC AGATGTAGGC 3900 

ATTCCOXCA GGTCTTTTCA TOTGCAGATT GCAGTTCTOA TTCATTTGAA TAAAAAGQAA 3960 
CTTGSAAAAA AAAAAAAAAA AAA 
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Seq ID NO: 148 Protein sequence 
Protein Accession #t AAH12089,1 

1 11 21 31 41 51 

11)111 

KALVLGSIiLL LGLCGNSFSQ GQPSSTDAPK AVTHYELPATN YBTQDSHKAG PIGILFELVH 60 

IPLYWQPRD FPBDTIiRKFL QKAYESKIDY DKIVYYBAGI ILCCVLGLLF IILMPLVGYP 120 

FCKCRCOIXC GGEHKQRQKB NGPFUIRCFA ZSUiVICIII SIGIFYGFVA NHQVRTRIKR 180 

SRKLAD8NFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRFNZ 240 

IPVLDEIKSM ATAIKETKEA LENMNSTLKS LHQQSTQLSS 6LTSVKTSLR SSUJDPLCLV 300 

HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVNNVL RTDLDGLVQQ GYQSIiNDIPD 360 

RVQRQTTTW AGZKRVLNSI GSDIDNVTQR LPIQDIL8AF SVYVNNTESY IKRKLPTLEE 420 

YDSYVMLGOL VZCSZiLTIiZV IFmAUiCQ VGGYDRHATP TTRGCVSNTG GVFLMVGVGL 480 

SFZiFCHZLMZ XWLTFVFGA NVEKLICBPY TSKELFRVIJ} TPYLtiNEDWE YYLSGXLFNR 540 

SKMKLTPEQV YSIX3CKNR0T YGTIiHIiQNSF NISEHLNINE HTGSISSBLB SLKVNLMIPL 600 

LGAAGRKNLQ DFAACGIDRM NYDSYIAQTG KSPAGVKLLS FAYDLEAKAN SLPPGNLRNS 660 

ZiKRDAQTZKT ZHQQRVLFZE QSLSTLYQSV KXLQRTGKGL LERVTRZLAS LDFAQNFZTN 720 

NTSSVXXEBT KXYGRTXZGY FEHYLQHZBP SZSEKVASCK PVATALDTAV DVFLCSYZID 780 

PLNLFHF6XG KATVFIiLPAL ZFAVKLAKYY RBMD8EDVYD DVBTIPKKNM EKGNN6YHKD 840 
HVYGIHNPVM TSPSQH 

Seq ZD NO: 149 DMA sequence 

Nbcleic Acid Accession Si NM_033049.1 

Coding sequencei 28.. 1566 " 

1 11 21 31 41 51 

I I I I I I 

0CAGQG8TCC GAGCAAfiAAC AOCTAAAATO AAAGCCATCA TTCATCTTAC TCTTCTTGCT 60 

CTCCTTTCTO TAAACACAOC CACCAACCAA GGCAACTCAO CTGATGCTGT AACAACXIACA 120 

GAAACTGOGA CTAGTGGTCC TACAGTAGCT GCAGCTGATA CCACTGAAAC TAATTTCCCT 180 

GAAACTQCTA GCACCACAOC AAATACACCT TCTTTCCCAA CAGCTACTTC ACCTGCTCCC 240 

CCC3VTAATTA GTACACATAG TTCCTCCACA ATTCCTACAC CTGCTCOCCC CATAATTAGT 300 

ACAGATAOTT CCTCCACAAT TCCTATACCT ACTGCTGCAO ACAGTGAGTC AACXACAAAT 360 

GTAAATTCAT TAGCTACCTC TGACATAATC ACOQCTTCAT CTCCAAATGA TGGATTAATC 420 

ACAATGGTTC CTTCTGAAAC ACAAA6TAAC AATGAAATGT CCCCCACCAC AGAAGACAAT 480 

CAATCATCAG GGCCTCCCAC TGGCACCGCT TTATTGGAGA CCAGCACCCT AAACAGCACA 540 

GGTCCCAGCA ATCCTTGCCA AGATGATCCC TGTGCAQATA ATTCGTTATG TQTTAAGCTG 600 

CATAATACAA GTmTGCCT OTGTTTAGAA G6GTATTACT ACAACTCTTC TACATGTAAG 660 

AAAGGAAAGG TATTCCCTGO GAAGATTTCA GTGACAOTAT CAGAAACATT TGACCCAGAA 720 

GAGAAACATT CCATGGCCTA TCAAGACTTG CATAGTQAAA TTACTAGCTT GTTTAAAQAT 780 

GTATTTGGCA CATCTGTTTA TGGACAGACT GTAAT7CTTA CTGTAAGCAC ATCTCTGTCA 840 

CCAAGATCTG AAATGOGTGC TGATGACAAG TTTGTTAATG TAACAATAGT AACAATTTTG 900 

GCAOAAACCA GAAOTQACAA TGAOAAGACT GTGACTGA0A AAATTAATAA AGCAATTAGA 960 

AGTAGCrCAA GCAACTTTCT AAACTATGAT TTGACCCTTC 60TX3T6ATTA TTATGGCT6T 1020 

AACCAGACrO CGQATGACTO CCTCAATGGT TTAGCATOOG ATTOCAAATC TGACCTGCAA 1080 

AGGCXn'AACC CACAGAGCCC TTTCIX 3 C G TT GCTTCCAGTC TCAAGTGTCC TGATQCCTGC 1140 

AACXKZACAGC ACAAGCAATG CTTAATAAAG AAGAGTGGTG GGGCCOCTGA G TGT G OGTGC 1200 

GTGCC0G6CT ACCAG6AAGA TOCTAATGQO. AACTGCCAAA A6TGT0CATT TOGCTACAGT 1260 
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GGACTCX5ACT QTAAGOACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGGC 1320 

ATTGTCATTC TCAGCATGAT AATTGCATTQ ATTGTCACAO CAAGATCAAA TAACAAAACQ 1380 

AAGCATATTG AAGAAGAGAA CTTOATTGAC GAAGACXTPC AAAATCTAAA ACTGCXXSTCO 1440 

ACAGGCTTCA CXIAATCTTGQ AGCAOAAGQO AGOQTCTTTC CTAAGGTCAG GATAAGGGCC 1500 

TCCAGAGACA GCCAGATGCA AAATGCCTAT TCAAGACACA GCAGCATQCC CCOCCCTGAC 1560 

TATTA GAATC ATAAOAATGT GGAACCOGCC ATGGCCCCCA ACCAATOTAC AAGCTATTAT 1620 

TTAGA GTCrrr TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTQ GCCTCCGOGG 16 BO 

'i i'i'n' Cirrcc atctgacatc tgccswscctc tctgaatgga aottgtgaat gtttgcaaco 1740 

AATCC AGCTC ACTTGCTAAA TAAOAATCTA TGACATTAAA TGTAGTAGAT 6CTATTAGGG IBOO 

CTTCTCAGAO AGGTGGTTTT CTTGAATCAG TACRAAOTAC ItJAOACaU^TG GTTAGCGTTO 1860 

TTTTCTTAAT TCTTTTCCTG OTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 1920 

CCCAGCATTO CTTGCTCCTO GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 1980 

GQAGACA7AC GCATCAACTG TGGA0GTCCX3 A6GGGATGAG AAGGGATACC CACCACXTTTT 2040 

CAAGGGTCAC AA6CTCACTC TCTGAGAAOT CAGAATAQGG ACACT6CTTC TATCCCTCCA 2100 

ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCX: TCACCCAAOA 2160 

AGACTG GAAA GAQACATATC TCTCAOCTTT TTCAGQAGGC GTGCCTGGGA ATCCAGGAAC 2220 

TTTTTGATGC TAATTAGAAQ GCCTGGACTA AAAATOTCCA CTATGGGGTG CACTCTACAQ 2280 

TTTTT6AAAT GCTAGGAG6C AGAA6GGGCA 6AGAGTAAAA AACATGACCT GGTAGAAGGA 2340 

AGAGAGGCAA AGGAAACTGO GTOGOQAGGA TCAATTAOAO AGOAGQCACC TGG6AT0CAC 2400 

CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCAC TTCTCTAATC ATGCCCTCCC 2460 

GAA6ACTGGC TOGGAGAAGO TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTA6GTCA6 2520 

TTTGAAATTG GAGACAAACT GTCTGQCAAA GGGTGCQAQA GGGAGCTTGT GCTCAQQAGT 2 5 80 

GCAGCOGTCC AGCCTOGGGO TGTA6GTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 2640 

CTOQTGACAO AGGCTCAGCT GTG6CCAGCA ACACACAACC ACACACACAC AACCACACAC 2700 

A CAAATO GGG GCAA CCACAT CCAGTACAAa CTTTTACAAA TGTTATTAGT GTCCrTTTTT 2760 

ATTTCTAATO CCTTGTCCTC TTAAAAGTTA TTTTATTTOT TATTATTAIT TQTTCTTQAC 2820 

TGTTAATTGT 6AATG6TAAT GCAATAAAGT GCCTTTGTTA 6ATGGTGAAA AAAAAAAAAA 2880 
AAAAAAA 

8eq ID NO I 150 Protein aequance 
Protein Accession #t NP_149038.1 

1 11 21 31 41 51 

I I I I I 1 

MKAXXHLTLL AZiLSVNTATN QGNSADAVTr TETATSQPTV AAADTTBTNF PETASTTANT 60 

PSFPTATSPA PPIZSTHSSS TIPTPAPPII STHS8STIPI PTAADSBSTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQ3 NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPOQDD 180 

PCADNSLCVK LHNT6FCLCL EGYYYNSSTC KKGKVFP6KI SVTVSETFDP EEKHSMAYQD 240 

L HSEIT SLPK CVFOTSVYGQ TVILTVSTSL SPRSQIRADD KFVNVTIVTI LABTTSDNEK 300 

TVTEKIHKAI RSSSSNFLNY DLTLRCDYYG CNQTADDCLil GLACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCLI KXSG6APECA CVPGYQEDAN GNOQKCAFGY SGLDCKDXFQ 420 

LILTIVGTIA GIVILSMIIA LIVTARSMNK TKHIESENIiZ DEDFQNUCLR STGFTNLQAE 480 
GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 

8eq ZD NOt 151 DMA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence i 87 . . 914 

1 11 21 31 41 51 

I I I I I I 

AGTAOGCXSOa GCCGOGTTTT aCCTCCXX3W3 CAGCTCTGGG CTCTTCTCAG CTGCGCGAGC 60 

AGCTGCTCCA ATGCCCXIGGA GTGGCCATGG GCGCCCOSCA CP03TGGGAC CAGCTQCAGG 120 

CTQGTAGCTC GGAGGTGGAC TGGTGOGAGG ACAACTACAC CATOQTOOC r GCTAT06CCG 180 

AGTTCTACAA CA06ATCAQC AATGTCTTAT TTTTCATTTT ACGQCCCATC TOCATGZ6CT 240 

TOTTTOOTCA OTATGCAACA TOCTTCAACA OTGOCATCTA CTTAATCTG6 ACTCTTTTGO 300 

TTGTAGTGGG AATTGGATCC GTCTACTTCC ATOCAACCCT TAGTTTCTXa GGTCAGATGC 360 

TTQATGAACT TGCAGTCCTT TQGGTTCTGA TGTGTGCTTT GGCCATGTGQ TTCCCCAQAA 420 

GGTATCTACC AAAOATCTTT OGGAATQACC GGGOTAGGTT CAAGGTGGTG GTCAGTOTCX: 480 

TQT CTGOG GT TAOGAOGTGC CT66CATTT0 TCAAGCCT6C CATCAACAAC ATCTCTCTQA 540 

T8AC0CTGGG AOTTCCTTGC ACTGCACT6C TCATOSCAGA GCTAAAGAGG TGT6ACAACA 600 

TGOGTGTQTT TAAGCTGGGC CTCTTCTOGG GCXn^TGGTG GACCCTGQOC CrGTTCTGCT 660 

OGATCAGTGA OOGAGCTTTC TQOQAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT 720 

GCATGTGGCA CATCCTCATC TGCCTTGCTG CCTAGCTGOa CTGTGTATGC TTTOCCT A CT 780 

TTOATGCTGC CTCAOAGATT CCTGAGCAAO GCOCIGTCAT CAAGTTCTGG OCCAATOAQA 840 

AATGGGCCTT CATTGGTGTC CCCTATGTOT CCCTCCTGTG TGCCAACAAQ AAATCATCAG 900 

TCAAGA TCAC GTQATGGCAA GATG6TGGCT GGCTTCTCTG CTTATCGCXX: CTCATOCAGT 960 

GGGCTTCCTT TQCTAQGAAQ ACAGCCAAGG GAGTTCGAAT AGTTGGGOTG TGGGCTATCT 1020 

TTTCAAAAAT CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTQTA TGTAGQOATT 1080 

TAAA CTTTQ T C ATATQOTA C AAATATTCCC TGCCOCCCTO CAGTTTOCCA VTLMiVmV 1140 

A GTATGTTAA TATTTTTGTO CCATACTGGT TTTAAACTTT CAT6TTGTCA CATCTGTTAA 1200 

TCTTTTCTTT AGGATTTCTO QATTTTGTGT AATTTTTAAA AAGGTCCCXTT CCTCCTCCCT 1260 

AATGTGTCTG TQGACXACCT G6ATTCCACT GTACAAGGOa AAAAGTGTCT ATTCCTTTCC 1320 

CAAAGATGGA AAA1GGAGG0 CTTAGGOACA CTAGATGCAT CTTTCTCAflC ATCACTTCCA 1380 

GATQCAOTaA CTTGTTGG6C TQOQTGCTTA ATGGCCATGG CAGAGCA6TC 0CTTGG0G6A 1440 

TOCAOCCCTO TACAATGCAT CTCTTCCTGO AOAAAOCTGa CCT6CTCCA0 ACCCCACCAT 1500 

TCCCAOOOQC CCTT6GAGTG GACTCTACTG ATQACAGACA GACCCTCTGA QAGACAAGAC 1560 

CCTCTQACTC TGTGATGQAA GATGCCAOAG ATTTTCCTTT OGGGTAATTQ TCCTTAAACA 1620 

AAACCAAACA GATGAAACAC ACACAGGACT TGTGGCTAAA AAGQCrAGTT TTTCACTTGC 1680 

ATTTCTCAAC TAACGCAGGT TTTACATGCA TCTGTGAATC CrTTTACTAC TA C C TC TGTG 1740 

QASAOATOQA GAGACTTCAG ATAAAC3GTGA AOCTAATGAO TAAAAGOCTC TCTOCCAAAA 1800 

CCTAC ACTOC ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA 1860 

TTTTTAAATA CTGACATCAC TTCXTrCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT 1920 

CTACAOAGAA GAAOAATTAT OGCATGAACA TTCCCACAGA CCXaCCATCT TTAAGACTTG 1980 

AOCTCTGTAA 6TTTACCAAA GGGCTCCTCA CAATTGTOGT OGGOaTTCTG GTTCAAAATT 2040 
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TGQAOCAAAC ATGAAGTTTT TGC3AAACGTT TTCTCATTTQ AAGCCTCCAG TATOCTGTAC 2100 

TATTCTGGAA ATTACCTTCA AGAGTCTCAC TTCTTGrrTC TCTTOTOTTT TCTOTGGGCA 2160 

TCATGTTCTT CAOSCTTGCA GTAQAAGGTO CTTTCTOQGT TTCCCAGAGT ATCCAAOGGC 2220 

TCACCTTTCT CAAGTGCTGO CAGTAGCTAT GCACTCAOGG QCTGGTTTQO aTOGGTGGTG 2380 

CAGCAGOQCA AATCTGTTGC Ci TC T G AATT T TTCTCACCT AATGTGACAC TGGCTACAAT 2340 

GAATCTTCTC TTCATOSGGC TGAATGAAAG ATTCAAQAAC CATCTTCAAG OTGCATGOTG 2400 

GOAATTATCA ACCTCAGGGA TACTCATTTT AACTCAQOCG TOTCCTGCTT TGTAACATTC 2460 

CATrGTTGCG AOAGGGCAOO ACAGGTGTOT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC 2520 

CTACATATGT AAGAGTTGGG AAGGTQACX3A TTTTTGACAC ATCCAGGAAC TCTTA CTCT A 2580 

GTTA6AATTT 6TACCAGATC CAAGGTOAAA ACCCXAATAA OCAACTGAAT TTAGAGTTTA 2640 

AAAATGAATQ ACTTTATGCT ACATCTGTGG TTATCAAATT ATATA6GTTO TTOAGAAOCA 2700 

GAAOGCTGTT TGTAGTAAGA AATCTTTGTX3 QAACCCCAQT GTGTQAAQTA AATTGTATGT 2760 

TATTAAATTT ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA 2620 

AATGCAAATT AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGOTGTTO AATGAATGTA 2880 

TATGTTG6TT TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGQ 2940 

OTCTTQCTCT ATTAOOCAGQ CTGGAGT6CA GTGGTGCAQT CATGACTCAC TOTAGCCTCT 3000 

GTCTCCCAGG CTGAAGTGAT OCTCTCACCT CAGCCTCCCC AGGAGCTGGG AGTACTGGTG 3060 

OQTGCCTCCA CTCGCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT 3120 

QCCCAGGCTQ GTCTCAAACT TCTGGGCTCA AGAQATTCGC COGCCTCGGC CTCOCAAAGT 3180 

GCTGGQAATA GGCATQA6CC QTCACGCCTG GCCTAAAAAA TATTTTTAAA ATQATCTTTG 3240 

AATTAAATAT TCGTAGAATT TCTAATGTAT CTCTTTOAQA CCTAGGAOOT TQATOQAACA 3300 

GAACTOCT G T TAA8TCCTTT GOOTTTCAAO TCTA6AATTT TTTAAA6GCA AATATCAGCT 3360 

CATTCTTATT TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGGTA 3420 

TAAATGCTCC AAGGTAGGGG GTTTGQTATA TATTTTAAGC CXX3GCTTTTT TTTTTTTTTT 3480 

riTrrrr TT T tttttaatgt gagaagcaga atgtgcttct agaaactggt tttaaagaqa 3540 

TGAGCTQAOA AA6AAATGTG GAATGGAGTA TATTT8A60A GQACAAAACA TAACTTCACT 3600 

TTtGAACAGA AATCACTCTA 6CTTGCCA6C ATGGGATGTA AACCAAGAQA OTAOAAATAT 3660 

ACCCATCTTA TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATQTAAAAG CACTACAAAC 3720 

TCTTTAAAGA AAATTGGGAA ACTACAGAOA AGTCAAAGAA AAAAAAAAGT AACCCATATT 3780 

TCTATTGCCC AGGTATAATC CTTGTTAATA TTTTGQTTTG OTCTCCTCTT TTTTTCCCCC 3840 

AATATAGTTO TAAATAAATG ATGTCTTTCA GAGTTOACAT TTATCCTGTA GCTTGAATGG 3900 

CATGTAAATO CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAOTTA 3960 

GCTATATGCT TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTQAOTC TACCTTTTTA 4020 

ATCAAGCCTA ACATGAATGG GCTCC3UVAAA GTAATGAATG TAATTGTACT TTTTGATGTG 4080 

CCTCTGCACT TGGCTTGGTG AGTCATCATA AATAQCTGTT AAATATGTGA CTTTACAGAT 4140 

TTTGATATGT TCAOATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA 4200 
GAATCTCOCT CC 

Seq ID NOi 152 Protein sequence 
Protein Accession #i Eos sequence 

1 11 21 31 41 51 

I i I ) I I 

MGAPHWWDQI* QAGSSEVDWC EDNYTIVPAI AEFYNTISNV I^ILPPICM CLFRQYATCF 60 

NSGIYLIWTL LVWOIQSVY FHATLSFLGQ MLDBIiAVLWV LMCALAMWFP RRYLPKIFRN 120 

DRGRFXVWS VL8AVTTCIA FVKPAZ1INX6 LMTLGVFCTA LblAELXRCD NHRVFKLGLF 180 

S6LHNTLALP CHZ8DRAPCE LLSSPNFPYL HCMMHILXCL AAYIjGCVCFA YFDAASBIPB 240 
QOPVIKPWPN BKWAFZGVPy VSLLCAMKKS SVKIT 

Seq ID KOt 153 DMA sequence 

Kuclelc Acid Accession fft IIM_001432.1 

coding sequence i 167.. 676 

1 11 21 31 41 51 

I i I 1 1 I 

TCACTTGCCT GATATTTCCA GTOTCAGAGG GAC3^CAGGCA AOSTG GGQTC CCTTCTAGGC 60 

T6ACAGCC6C TCTCC3M5CCA CT6CGG08A6 CX300TCTGCT 0C06CCCTGC CCGTGCACTC 120 

TOCOCAGCOG CCCTCOGCCA AGCCCCAGCG OXGCTCCCA TOGCCQATGA OOO aSGGG AG 180 

GAQGATGGAQ ATGCTCTQTG COGGCAGGGT CCCTGCGCTG CTGCTCTGCC TQGGTTTCCA 240 

TCTTCTACAO 0C3\GTCCTCA GTACAACTGT QATTCCATCA TOTATCXrCAQ GAGAOTCCAG 300 

tGATAACTGC ACAGCTTTAO TTCAGAGAOA AGACAATCCA CaTGTGGCrC AAGTGTCAAT 360 

AACAAAOTOT AGCTCTGACA TOAATGGCTA TTGTTTGCAT GaACAOTOCA TCTATCTGOT 420 

GGACATGAOT CAAAACTACT GCAGGTGTGA AGTQGGTTAT ACTGGTGTCC GATGTGAACA 480 

CTTCTTTTTA ACCX5TCCACC AACCTTTAAG CAAAGAGTAT GTGGCTTTGA COGTGATTCT 540 

TATTATTTTG TTTCTTATCA CAGT00TCX3G TTCCaVCATAT TATTTCTGCA GATGGTACAG 600 

AAATCOAAAA AOTAAAOAAC CAAAOAAGQA ATATGAGAGA 6TTACCTCA0 GGQATCCAGA 660 

GTT0C06CAA 6TCTGAAT06 06CCATCAAA CTTATGGGCA GGQATAACAO TOTOCCTGQT 720 

TAATATTAAT ATTCCATTTT ATTAATAATA TTTATCTTGG GTCAAGTGTT AGGTCAATAA 780 

CACTOTATTT TAATGTACTT GAAAAATOTT TTTATTTTTG TTTTATTTIT GACAGACTAT 840 

TTGCTAATOT ATAATGTGCA QAAAATATTT AATATCAAAA QAAAATTGAT ATTTTTATAC 900 

AAOTAATTTC CTGAGCTAAA TGCTTCATTO AAAGCTTCAA AOTTTATATG OCTGOIGCRC 960 

AGTGCTTAGA AGTAAOCAAT TCCCAGGTCA TAGCTCAAGA ATTCTTAGa^ A ATOA CAGAT 1020 

TTCTGTAAGC CTATATATAT AQTCAAATCX3 ATTTAGTAAG TATGITTTTT ATGTTCCTCA 1080 

AATCAGTGAT AATTGGTTTG ACTGTACCAT GGTTTGATAT GTAGTT0GC3^ CCATGOTATC 1140 

ATATATTAAA ACAATAATGC AATTAQAATT TGGOAOAAGC AAATATAGGT CCTGTQTTAA 1200 

ACACTACACA TTTGAAACAA GCTAACCCTO GGGAOTCTAT GGTCTCTICA CTCAGGTCTC 1260 

A6CTATAATT CTOTTATATG AGGGGCStfSTG GACAOTTCCC TAT60CAACT OlOGACTCCT 1320 

ACAGGTACTA GTCACTCATC TACCAGATTC TGCCTATGTA AAATGAATTG AAAAACAATT 1380 

TTCTGTAATC TTTTATTTAA GTAQTGGGCA TTTCATAGCT TCACAATGTT CCTTTTTTGT 1440 

ATATTACAAC ATTTATOTGA QOTAATTATT 6CTC3UICAGA CAATTA GAAA AAAG TCCAC A 1500 

CTTGAAOCCT AA A TTT G T G C TTTTTAAGAA TATTTTTAGA CTATTTCTTT TTAT AOGOG C 1560 

TTTQCTQAAT TCTAACATTA AATCACAGOC C AAAA TTTGA TGGACTAATT AT TATTT TAA 1630 

AATATATQAA GACAATAATT CTACATGTIG TCTTAA6ATG GAAATACAGT TATTTCATCT 1680 

TTTATTCAAG GAAOTTTTAA CTTTAATACA GCTCAGTAAA T QQLTi ' CHC TAOAATGTAA 1740 
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PCTAJS02/36810 



10 



20 



AGTTATGTAT TTAAAGTTGT ATCTTQAOVC AOGAAATGGG AAAAAACTTA AAAATTAATA IBOO 

TtWTOTATTT TTCCAAATGA AAAATCTCAA TTQAAAGCTT TTAAAATGTA QAAACTTAAA 1660 

CACACC TTCC TOTGGAGGCT OAOATOAAAA CTAGGOCTCA TTTTCCTGAC ATTTGTTTAT 1920 

TTTTTOQAAO AGACAAAGAT TTCTTC1X3CA CTCIGAGCOC ATAGGTCTCA QAGAOTTAAT 1980 

AGGAOTATTT TIGGGCTATT GCATAAGGAO CCACTGCTGC CAC3CACTTTT OGATTTTATG 2040 

GGAGQCTCCT TCATCX3AATG CTAAACCTTT OAGTAOAOTC TCCCTGGATC ACATACCAGG 21O0 

TCAGGGACOA TCTOTTCTTC CTCTAOGTTT ATCCTGGCAT GTGCTAGGGT AAAOGAAGGC 2160 

ATAATAAGCC ATGGCTGACC TCTGGAGCAC CAGGTGCCAG GACTTGTCTC CATGTGTATC 2220 

CATGGATTAT ATACCCTGGT GCAATCACAC OACTGTCATC TAAAOTCCTG G CCCTG GO CC 2280 

TTACTATTAO QAAAATAAAC AOACAAAAAC AAGTAAATAT ATATGGTCCT ATACATATTG 2340 

TATATATATT CATATACAAA CATGTATGTA TACATGACCT TAATGQATCA TAGAATTOCA 2400 

GTCATTTGGT GCTCTGCTAA CCATTTATAT AAAACTTAAA AACAAGAGAA AAGAAAAATC 2460 

AATTAGATCT AAACAGTTAT TTCTGTTTCC TATTTAATAT AQCTGAAOTC AAAATATGTA 2520 

^ AOAAGACATT TTAAATACTC TACTTACAGT TX3 G 0CCTCTQ TGOTTA6TTC CACATCTGTO 2580 

1^ GATTCAACCA ACCAA6GAC0 GAAAAT6CTT AAAAAATAAT AGAAGAACM CAAAAAATAC 2640 

ATTATAACaU^ CTATTTACTT TT l " mTm ' CTTTTTGAGA TGGAGTCTOO CTCTOTTOCC 2700 

CAGGTTGGAO TQCAGTGGCA CX3ATCTCX3GC TCACTGCAAC CTCACCTCCC GGGTTCAAGA 2760 

GAT CCTCCTG C CTCAGC CTC CTQAGCAGCT GQGACTACAG 60QCATQCCA CCATGGGCAO 2820 

CTAATTTTTG TATTTTTAGT A6A6GG6GGG TTTCACCATG TTQG0C3V06A TG6TCTCAAT 2660 

CTCCTAACCT TGAGATCCAC CCTCCACAGC CTCCCAAACT GCTQGGATTA CAGGOGIGAG 2940 

CCACOGCAGG TAGCATTTAC ATTAG6TATT ACAAGTAATG TAAAGATOAT TTAAGTATAC 3000 

AGGAGGATGT GAATAGGTTA TATGCAAGCA CTATGCCCTT TTATATAAGT GACTTOAACa 3060 

TCT6TGCCCG ATTTTAGTAT GTGCAGGGGG GCGATCTGGG AATCA8TCCC CT6TQGATAC 3120 

CAAGGTACAA CTGTA TTTAT TAAG6CITAC TAOATGTGAG GAGAGTCT6A ATATTTTCAO 3180 

ZD TGATCTTGGC TGTTTCAAAA AAATCTATTG ACTTTTCAAT AAATCAOCTO CAATCCATTT 3240 

ATTTCATTTA CAAAAGATTT ATT6TAAGCC TCTCAATCTT GGTTTTTCAO TTGATCTTAA 3300 

GCATGTCAAT TCATATVAAAC AAGTCATTTT TOTATTTTTC ATCTTTAAQA ATOCTTAAAA 3360 

AAGCTAATCC CTAAAATAGT TAGATCTTTO TAAATGCATA TTAAATAATA AAGTATGACC 3420 

^ . CACATTACTT rrTATOGGTO AAAATAAGAC AAAAATAATA OTTTTAGTQA OGATGOTGCT 3480 

j\) GAGTAAACAT AAAAACTGAT TTGCTCTCAG CTGATGTGTC CTGTACACAG TGGGAAGATT 3540 

TTAGTTCACA CTTAGTCTAA CTCCCCCATT TTACAGATTT CTCACTATAT ATATTTCTAG 3600 

AAGGGGCTAT GCATATTCAA TGTATTGAGA ACCAAAQCAA CCACAAATQC ATAAATGCAT 3660 

AATTTATGGT CTTCAACCAA GGCCACATAA TAACCCA6TT AACTTACTCT TTAACCAGGA 3720 

ATATTAAGTT CTATAACTAG TACTCAAGOT TTAACCTTAA AATTAAGATT TCCTTAAGCT 3780 

TAACCTTAAA ATTGATATTA TATTAAACAT ACATAATACA ATOTAACTCC ACTGTTCTCC 3840 

TGAATATTTT TTGCTCTAAT CTCTCTGCCO AAAOTCAAAG TOATGOGAGA ATTGGTATAC 3900 

TGGTATGACT ACGTCTTAAG TCAGATTTTT ATTTATGAGT CTTTGAGACT AAATTCAATC 3960 

ACCACCAGGT ATCAAATCAA CTTTTATGCA GCAAATATAT GATTCTAGT6 TCTGACTTTT 4020 

GTTAAATTCA GTAAT6CAGT TTTTAAAAAC CTGTATCTGA CCCACTTTGT AATTTTTGCT 4080 

4U OCAATATCCA TTCTGTAGAC TTTTGAAAAA AAA8TTTTTA ATTTGATGOC CAATATATTC 4140 

TGACCGTTAA AAAATTCTTG TTCATATGGG AGAA6GGGGA GTAATQACTT GTACAAACAG 4200 

TATTTCTGGT QTATATTTTA ATGTTTTTAA AAAOAGTAAT TTCATTTAAA TATCTGTTAT 4260 

TCAA ATTTGA TG ATGTTAA A TOTAATATAA TGTATTTTCT TTTTATTTTO CACTCTGTAA 4320 

TTOCACTTTT TAAGTTTQAA GAGGCATTTT GGTAAACX3GT TTTTATTAAA GATGCTATGG 4380 

*fJ AACATAAAGT TGTATTGCAT GCAATTTAAA GTAACTTATT 7GACTATGAA TATTATOSGA 4440 

TTACTGAATT GTATCAATTT GTTTGTGTTC AATATCAGCT TTGATAATTO TGTACCTTAA 4500 

QATATTGAAG GAGAAAATAG ATAATTTACA AGATATTATT AATTTTTATT TATTTTTCTT 4560 

GGGAATT6AA AAAAATTGAA ATAAATAAAA ATGGATTGAA CATCTTGCAT TCAAAATCTT 4620 
CAGTQAC 
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60 



8eq ID NOs 154 Protein sequence 
Protein Acceesion «: NP_001423.1 

1 11 21 31 41 51 

I I I I I i 

MTAGRRMEML CAGRVPAIiLL CLGFHLLQAV L8TTVIP8CI PGBSSDKCTA LVQTEDNFRV 60 
AQV8ITKCS8 DMNGYCLHGQ CJYhVDHSQtl YCRCEVOYTG VRCEHFFLTV HQPLSICByVA 120 
LTVILIIIiFL ITWGSryYF CRWYIOIfiKSK EPKKEYERVT 8GDFBLPQV 

Seq ID NOi 155 DMA aequence 

Nucleic Acid Accession fii im_013282.2 

Coding sequence: 85.. 2466 



1 11 21 31 41 51 

65 I I 1 1 I I 

CGACTCCTTA 6AGCATGGCA TGGCTCAGAG GTGCTGGTAA AACTQATGGG GGTrTTTGCT 60 

GTCCCTCCCC TCAGCGCC6A CACCATGTGG ATCCAGGTTC GGACCATGGA 0GGGA0QC3VG 120 

ACCCACACG6 TGGACTCQCT GTCCAGGCTQ ACCAAGGTGG AGGAGCTOAO GCGGAAGATC 180 

CAGQAGCTOT TGCACGTGGA 6CC3U3GCCIG CA6AGGCTQT TCTACAOGGG CAAACAGATG 240 

/O GAGGAOOGCC ATAC C CTCTT CGACTACGAG GTCCGCCTQA ATGACACC3VT CCAGCTCCTG 300 

GTCCX3CCAGA GCCTCXnXKrT CCXXCACAQC ACCAAQGAGC GGGACTCOQA GCTCTCOGAC 360 

ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA AQTCCTCCAC CXACS300GAG 420 

GCXK3C0GC0G AGACTOACAG CAG6CCA0CC GATGAGGACA TGTGGGATGA GACGGAATTG 480 

_ GOGCTOTACA AGGTCAATQA GTACGTCGAT GCTCGGGACA CXIAACATGGG GOCOTGOTTT 540 

/j 6AG600CAGG TOOTCSVOGOT GAOOOSGAAG GCCCCCTCCC GGOAOGAGCC CTGCAOCTCC 600 

ACX3TCCAGGC CXSGCGCTGQA GGAGQAOOTC ATTTACCAOG TOAAATAOQA CQACTACXX3G 660 

GAGAACGQCO TGGTCCAOAT GAACTCCAGG GAC3QTCC3GAG CGCGCXKXXG CACCATCATC 720 

AA6TGGCAGG ACCTGGAGGT GGGGCAGGTG GTCATQCTCA ACTACAACCC OGACAACGCC 780 

AAGGAGGOGO GCTTCIGGTA 0GA0GCGGA6 ATCTCCAGGA AGCGOGAGAC CAGGACXX3CG 840 

OU OGQGAACTCT ACX3CCAAC8T OOrGCTGGGO GATGATTCTC TGAAOQACTG TCGGATCATC 900 

TTOSTGGAOO AAOTCTTCAA GATTGAGCX5G COOOGTGAAG GGAGCCCCAT GOTTQACAAC 960 

CCCATGAGAC GGAAGAGCGQ OCX3GTOCTGC AAGCACTGCA AGGACX3A0GT GAACAGACTC 1020 

T6C0GGGTCT GOGCCTGCCA CCTGTGOOGG GGCOGGCAGG ACCC09ACAA GCA6CTCAT0 1080 

TG0QATGA6T GOQACATGOC CTTCOICATC TACT O CCT G Q AOOO G COC C T CAGCAOTGTT 1140 
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CCCAGOQAGG 
00GG6AGAGC 
TCACAG06GG 
GTCCCGTCCA 
C6AQTCCAGG 
AGCAA08A0Q 
GGGAATTTTT 
GOGGAACAGT 
TTTGCTCCCA 
GTOVGOGTXX; 
AACCGCTACX3 
TTTCTCGTGT 
GAGGGQAAGG 
GAAGCCCTGG 
CAG6AGGGGG 
GGAGGTGGCC 
COCTACAGTC 
CTGTGQAATQ 
TTGTTCCTGA 
OGGCCCATCA 
0GGGCACA106 
CAGGTGAACC 
GQQTQATCTC 
GA.TCX3GCACT 
CCTAAAAAOG 
GAATTTATGT 
CATAAAAGCX: 
ACTAOGTGGT 
CAACTCTTTA 
A6CAAGCATC 
TGGCCOGTGO 
AAAGAGGAAA 
TGCTTAGCGT 
CAGGCAGAAA 
TGTCCGACGA 
GATTOGTTCC 
0TC3UVCCAGA 
GGAAC06TTT 
CTTACAAOAO 
TTTTTTTTGT 
TTGTTTTTGT 
TTGCAGCCTA 
GAGCAATGTT 
AGGQAAOAAT 
TTAGATTCTC 



ACGAGTGGTA 
GGCTGAGAGA 
ACTGGGGCAA 
ACCACTACGG 
TCAGOSAGTC 
GAGCGTACTC 
TCACATACAC 
CTTGTGATCA 
TCAATQACCA 
T606CAATGT 
ATGGCATCTA 
GGCGCTACCT 
ACXXGATCAA 
C CAAOQBAGA 
GCTTOGGQTC 
CGAGCAGGGC 
TCAOGGCCCA 
AGGTCCTG6C 
GTAAAGTG6A 
OGACOGTOTO 
TGTTCAGCTG 
AGCCTCTGCA 
CAAGCACTTC 
QATTTTGTTC 
TITQTCTTCC 
ATTCT6GCTA 
TGCAATTTCT 
GTGGAGGCTG 
AG AAGG OCaAC 
TTCCTGAC3VG 
CAGCCCX3TGG 
CATCTOQGGC 
CT6AGATC06 
TGGCCTCAAG 
A0GCG6CCAC 
TTCTTTCTAA 
TTCTAGAAAC 
GAGCCTTATA 
GGTTTTTTTT 
AGTTACTGTA 
ATTTTTTTTC 
TACCTCAATA 
ATTTTTAAAG 
6A0ACAATTT 
AGAATAAATG 



CTGCCCTGAG 
GAGCAAGAAG 
GGGCATGGCC 
ACCCAT(XCG 
GGGTGTCCAT 
CCTAGTCCTG 
GGGTAGTGGT 
GAAACTCACC 
A6AA66Q6CC 
CAAGGGTG6C 
GAAGGTTGTG 
TCTGCGGAGG 
GAAGCTGGGG 



GGCCAGGAOG 
CGGOTCCCOO 
GCAGAGCAGC 
GTCACTCAAG 
GGA6A0GTTC 
GCAGCACAAC 
CCCTGCCTGC 
GACGGTCCTC 
T06ACA660G 
TTAGTGGGCT 
TTTTTTTTTA 
AAAGTTGGAC 
CQACAAAACA 
TTGATGTTTC 
AGGATCAGTC 
C A TTTTGTC3V 
CATGGCGTGG 
CTA6TTCAAA 
06TGAAAAGT 
GG6ACTCT6C 
G6AC6GACGC 
AGAOSACAGT 
TGOGGTCATC 
GATCATTTAC 
TAATTTTTTT 
TATGTACCAA 
TTTTGAAAGG 
AAACAGGGAT 
GGTTTTTTTC 
TarGTAQOCT 
TTTTTCACAG 



TGCCGGAATG 
AAGGCGAAGA 
TGTGTGGGCC 
GGC3ATCCCCG 
C3GGCCCCACX3 
GCGGQGGGCT 
GGTCGAQATC 
AACACCAACA 
GAG6CCAA0Q 
AAOAATAGCA 
AAATACXX36C 
GAOGATGATG 
CTGACXATGC 
GAGAACA6CA 
GGGAAG66CA 
OGCOGGACAT' 
CrCATCAGAG 
GAC0G6CCG0 
CAGT6TATCT 
GTGTGCAAGO 
CGCTAOGACC 
AACCAGCTCT 
TTTTGCTGAA 
TAACTTAAAC 
TTTTTATTTT 
TTCTCAGTAT 
ACACAAGATT 
TGGTGTCAAG 
CTTCTCTAGO 
TCTAAA6TCC 
CTCAGCTGTC 
CCTTTQCCTC 
OCTCTGCCCA 
TCCA0GT6GG 
CAGCACAOGA 
CTTTGTTGTT 
CAGTTCTTCC 
ATTCAATT7T 
TTCTCTTAAT 
GAAAGATATA 
GTTTGTTAAT 
ATTTTAAATC 
ACCTCCTTAT 
TTTTCTAAAG 
ATTGAAAAAA 



ATGCCAQCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATQA 
TTTCOGGCAA 
0GG06CTGGC 
ACIGGGQQTC 
AOTAGBCOCC 
COGAOAAGGG 
AGCCTGGCCC 
AGTATGCAGA 
AGA0GGA08A 
AGTG6AAG0G 
CXAAGAAAAC 
AGGACAAGAG 
C6AG0G6CAG 
GCTGTCAOQA 
ACTGGCTGGA 
TGGGCCGCAG 
TCCCCGGCTA 
AAOGTQTOGG 
AGGTAOTGTT 
TCAAATCTAT 
TGTGTTTAGT 
TTTTAAAGAT 
TTCTCA6AAG 
GTTCTGGCCC 
AGT6ACATGG 
TGTTQAA6TT 
AAAGCCATCC 
CGAGAGCAGG 
GCCAOGCXnG 
AGTCAOGTGC 
AGCACTGAAT 
TGACACXCGA 
TTTAACTCAO 
6 AACA CATTT 
AOGTTACKSGT 
TTTTCTAATT 
ACATACCTGC 
TCTTAGATTA 
TCCAGTACTT 



GOTGGTACTG 
CACATOGTCC 
ATGTACCATC 
GTGGCX3GTTC 
ACAGGGCOGG 
OGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAA6CG6 
06CTGAGG6C 
GAAGTCCGGQ 
TTGGA0GAA6 
AGGCTACCTG 
GGAGGAGCAG 
GAAGTOOGCA 
CAA6GTGGAG 
CAACGCCAAG 
CCOGTTCCAG 
GCTGGTOTTC 
CAGATOCTTT 
CTATGCCAT6 
CGGCAATGGC 
AGGGCTOGTT 
TOCTCCQTTC 
AGATTTTCAO 
TCTTTGAAAA 
GGAATCAGAA 
TTGCTGCCAC 
CCAAGGTCAG 

rroccoGTGO 

GTTGCAAGGA 
CCCACCAGAC 
GAGTTGGGGC 
TGACT OACQC 
AAGTGCCTTT 
TATTGAAAAT 
TGGGTGCTTG 
CAAGTGAGAA 
TCTAAATGAA 
TTQOTTOTTT 

ttacx:aaaot 
agacaaactg 

TTAATGTATT 
TGTCCASATT 



Seq ID NO; 156 Protein sequence 
Protein Accession #> NP 037414.2 



1 
I 

MHIQVRTMDG 
YEVRUaxriQ 



OVIYHVinDD 
AEI8RRRETR 
SCKHCKDDVN 
PECRNDASEV 
IPOIPVGTMH 
SGGRDLSGNR 
GGKKSKYAPA 
LGLTMQYPBG 
SPRRTSKKTK 
TFQCIOCQBL 



11 

I 

RQTHTVDSLS 
XiLVRQSLVLP 
ELGLYKVHEy 
YPENGWQMN 
TARELYANW 
RLCRVCACHL 
VLAGERLRBS 
RFRVQVSESG 
RTAEQSC33QR 
EGNRYDGIYK 
YLEALANRER 
VEPYSLTAQQ 
VPRPITTVCQ 



21 

I 

RLTKVEELRR 
HSTKBRDSEL 
VDARDIMMGA 
SRDVRARART 
LGDDSLNDCR 
GGGRQDPDKQ 
KKKAKMASAT 
VHRPHVAGIH 

WKYWPEKGK 



31 
I 

KIQBLPHVBP 
SDTDSGCCLG 
WPEAQWKVT 
ZIKNQOLEVG 
IIFVDBVPKI 
mCDEC3DMAF 



41 

I 

GXiQRLFYRGK 



SSLIRBDKSN 
HNVCKDCUDR 



GRSKDGAYSL 
NCPAPINDQB 

SGFLVWRYIX 
EQQEGGFASP 
AKLHNEVLAS 
SPRAQVPSCP 



QWMLNYNPD 

ERPGEGSPMV 
HIYCLDPPLS 
MACVGRTKEC 
VLAOGYEroV 
GABAXDWR8G 
RRDDDEPGPH 
RTQKQKWKRK 
LXDRPASGSP 
ACRYDLGRSY 



Seq ZD NOt 157 DNA sequence 

Nucleic Acid Accession NM_000756.1 

Coding sequence! 186.. 776 



AGAAACTCAG 
GAGAGTGGGA 
TCTCXOCAGA 
CTAACAT6G6 
GCC06CCATG 
AGCACCCTCA 
CQCAGGCTOG 
TCAACSiAGAO 
6CAQC00CCC 
TGCTGCTGCC 
ATGCCCTCGQ 
TCTCCCTGGA 
ASTTA6CACA 



11 
I 

AGACCAAGTC 
ACAGTAAAGA 
GAGG006CAG 
GCTGCOGCTG 
CAGGGCGCTC 
GCCCTTGGAT 
GCGGGTCCTG 
CCG6GC0GCT 
TTOGCCGGAA 
TCGG06CTCG 
CGGCCACCAG 
TCTCACCTTC 
GCAAGCTCAC 



21 
I 

CATTGAGAGA 
GAAAGGAAGA 
CAOOOQGCTC 
CTTQTGTCOG 
CTGAGCCGCG 
TTCTTCCAGC 
CTCOGCATGG 
CCCCTTTOGC 
CAG60QACC0 
CTCX3ACAGCC 
GAGGCACOGG 
CACCTGCTCC 
AGCAACAGGA 



31 
1 

CTGAGGGGAA 
CAAGCTCCAG 
AOCTGOQAAO 
OGGGAGTCCT 

CGCCGCCX3CA 
GAGAGGAOTA 
C06CCTCCTC 
CCAACTTTTT 

coGOGGcrcr 

AOAGAGAAAO 
GGGAAGTCTT 
AACTCATGGA 



41 

I 

AGAGAGGAGA 
AGAAA6GCCC 
CXSGCTGOQAA 
GCTOGTGGCT 
GGGAGCTOQO 
QTCOGAGCAG 
CTTCCTGOQC 
GCT CCT O QOC 
CCGOQTOTTO 
CGCGGAGCX«: 
GCGGTCCGAG 
GGAAATGGCC 
GATTATIGGG 



1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



51 
I 

QMEDGHTliFD 
QEAAAETDSR 
S8TSRPAItEB 



DNPMRRKSGP 
SVPSBDBWYC 
TIVPSimYGP 
DBGNFFTYTG 
KFVRWRMVK 
TKEGKDRIRK 
8AGGGPSRAG 
FQLFLSKVEE 
AMQVNQPXiQT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



51 
I 

GAAAGAAAAA 
GGGAQACGTC 
GOQAOTGCCC 
CTCCT6CCCT 
CAGGCGCCGC 
CXXJCAGCAGC 
CTGGGGAACC 
GGAGGCA6G0 
CTGCAGCAGC 
GGCGCTAGGA 
GAGCCTCCCA 
AGGGC06AGC 
AAATAAAACG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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PCTAJS02/36810 



OTOOBTTTCG CCAAAAAQAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCT6CTC TTATSOCATT TQTTTATTTT TATATACCTT GAAACATAGA 900 

GGOAGAOAGO GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAOTOTA TTCACXOTCA 960 

^ SSfSff!!!^ AATQTTATTC QTTTTGTCTA CGTTTAOrTT C0GTTTCCA6 GTCTTTATAG 1020 

J TQQT6TTTTA AAGAOAATGT AGACCTGTGA GAAAAOOTTT TOTTTGAAAA AGCAGACAQA 1080 

AGTCACTCAA TTCSTTTTTCT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTCGQTCGG 1140 

TAAG ACTAA A TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAAOCTTT CAGTAATAAA 1200 

ACATCTTTCC AOTCCTTOOT CAOTTTOGTT CTGTAAGAOA ATGTTQAATA CTTATATTTT 1360 



10 



55 
60 



TAATAAAAGT TGCAAAGGT 

Seq ID NOi ISB Protein sequence 
Protein Accession ftt HP_000747.1 



1C ^ 11 21 31 41 SI 

MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDPFQPP PQSEQPQQPQ 60 

ARPVLUtMOB BYPLRLGHUI KSPAAPL8PA 8SLLAG6SQS RPSPEQATAN PFRVLLQQLL 120 

LPRRSLDSPA ALAERGARNA U36HQBAPER BRRSBBPPZS LDLTPHLLRB VLEMARAEQL ISO 
2Q AQQAHSNRKL MBIIOK 

Seq ID NOt 159 DNA sequence 

Nucleic Acid Accession «: MM 001200.1 

Coding sequence t 325..1S14 " 

25 I 11 21 31 41 51 

1 I I 1 I t 

G6GGACTTCT TGAACTT6CA GGGAGAATAA CTTG06CACC CCACTTTOCG COOGTGCCTT 60 

TGCCCCAOOO QAOCCTGCTT 06CCATCTCC 6A0CCCCACC GCCCCTCCAC TCCTCGGCCT 120 

TGCCCX5ACAC TQAGAOGCTQ TTCCCA60GT GAAAAGAGAQ ACTOCGCGQC OGGC3VCCCGQ 180 

GAGAATCAGG AG GCRA AGAA AAGGAACGGA CATTCGGTCC TTGOGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTOQACG CTCTTTCAAT GGAOSTGTCC COGCOTGCTT CTTAQACGQA 300 

CTOCGGTCTC CTAAAQGTCQ ACCATOGTOO OOSGQACCCXS CTOTCTTCTA GCGTTGCTGC 360 

TTCCCCAOGT CCTCCTGGGC GG060GGCT0 GCCTCGTTCC GGAGCTGGGC CGCA6GAAGT 420 

^ TCXS0GGCX3GC GTCGTOGGGC CGCCCCTCAT CCCAGCXXTC TGAOQAGGTC CTOAGCGAQT 480 

DD TCQAGOTGOG GCTGCTCAGC ATGTTOGGCC TGAAACAGAO ACCCACCCCC AGCAGGGACG 540 

CCQTGGTGCC CCCCT ACATQ CTAGACCTGT ATCGCAGGCA CTCAGGTCAO CXX3GGCTCAC 600 

COGCOCCAGA CCACCQO TTG 6AGAGGGCA0 OCAGCOGAGC CAACACTGTO OGCAGCTTCC 660 

ACCAT GAAGA ATCTTTGGAA GAACTAOCAG AAAOGAGTGG GAAAACAACC CGGAGATTCT 720 

TCTTTAATTT AAGTTCTATC CCCACGGAGQ AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 

*f\) TCCGAQAACA QATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGAATTAATA 840 

J^ATOAAAT CATAAAACCT GCAACAGCX31 ACTCX3AAATT CCXXX3TGACC AGACTTTTGG 900 

960 



ACACC AQOTT OQTGAATCAO AATGCAAGCA OGTGGGAAAO TTTTQATGTC ACCCCCOCTG 

TGATGCGQTG (aCTGCACAO GQACA06C3CA ACCATGGATT OOTGGTGQAA GTGGCCCACT 1020 

- TGGAGGAGAA ACAAOGTGTC TCX^AQAGAC ATGTTAGGAT AAGCAGGTCT TTQCACCAAG 1080 

*fj ATGAACACAG CTQQTCACAG ATAAGQCC3VT TGCTAOTAAC TTTTGGCCAT GATGQAAAAG 1140 

GGCATC3CTCT CX^CAAAAGA GAAAAACGTC AAGCCAAACA CAAACSIGOGO AAACGCCTTA 1200 

AOTCOJGCTO TAAGAGACAC CCTTTGTACG TQQACTTCAG TGACGTGGGO TGGAATGACT 1260 

GGATTOTGGC TCCCCCGQGG TATCAOGCCT TTTACTGCCA OGQAGAATGC CCTTTTCCTC 1320 

TGGCTQATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GAOGTTGQTC AACTCTQTTA 1380 

DV ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAQTGCTATC TCGATCCTQT 1440 

ACCT TGAOGA GAATGAAAAG GrTOTATTAA AGAACTATCA GGACaTGGTT GTGQAGGOTT 1500 
GTGGOTGTGS CTAGTAGAGC AAAATTAAAT ACATAAATAT ATATATA 



Seq ID N0< 160 Protein sequence 
Protein Accession ft: NP_001191.1 

] " 21 31 41 51 

I I I I I I 

MVAGTRCLIA LLLPQVLL6G AAGLVPBLGR RKPAAASSQR P8SQPSDEVL SEPELRLLSM 60 
PGIiKQRPTPS RDAWPPTfML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SFHHEE8LEB 120 
LPBTSQKTTR RFPPNLSSIP TEBPITSABL QVPRBQMQDA ZiGMNSSFHHR IMIYBIIKPA 180 
TAMSKFPVTR LLDT 



Seq ID MOj 161 DNA sequence 
OD Nucleic Acid Accession #t NM 001999.2 
Coding sequence i 1,.8736 ~ 

1 11 21 31 41 51 

70 i ' I I I I 

/V ATGGGOAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGOCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC OG6CC3U3CCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCOGCOGC CQCAACAG6T TOGGTOOGCT ACAOCAGGCT CTQAAGGOGQ GTTTCTAGCO 180 

COOGAGTATC GOOAGGAGGG TOOOOCAGTG 6CCAGCO0CG TCCGCOGOCG AGGACAGCAO 240 

QACOTGCTCC GAGGGCCCAA CGTGTG0G6C TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

/j TGGAA GACOC TCCCTGGAGQ AAACCAOTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGQA 360 

GATG GATTTT GTTCCOGTCC TAACATOTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTQGATCAA AATCAATTCA GCAGXGCAOT GTGAGATOCA TGAATOOTGO GACCTGTGCA 480 

GATQACCACT 6CCAGTGCCA GAAAOGATAT ATTOGAACTT ATTGTG6ACA ACCTOTCTOT 540 

GAAAAT6GAT GTCAGAATGO TGGACOTTGC ATOGCCCAAC COTGTGCTTG TGTTTATGGG 600 

OU TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGQCC CGTGTTTCAC TCAGQTCAAC 660 

AACCAGATOT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTOTGCC 720 

AOaCTGGAC GGGCGTOGOG OCATCCCTGT GAGATGTGTC CA6CCCA6CC TCA0C0CT6C 780 

GGAC06GOTT TCATCCCCAA CATOOQCACT 6QAGCTTGCC AAGATGTTGA TGAAT60CAQ 840 

OCTATCCCAG GGATATCCCA AGGAOQAAAC TOTATCAATA CAQTGOGCTC TTTTGAATGC 900 

1015 
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AGATGCCCTQ CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAO 960 
T6CAGCATCA TTCCTGGGA7 ATGTGAAACT GGTGAATGTT CXSUICAOCXTT GGGAAGCTAT 1020 
I ' i I' lViVlT J GTCCACGTGG ATATGTAACC TCAACAOATQ GCTCTOGATO CATCGATCAG 1080 
AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGC06CT GTGCACAAGA GCTCCCGGGG 1140 
AGAATQAOQA AAATCCAGTG CTGCTGTGAG OCTGGCOGCT GCTGGGGCAT OGGAACCATT 1200 
CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GQATGGACTT 1260 
CCAATGOGAG GAATTCCAGG GA6TGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA GTG6CAATG6 CAAT6GCTAT GGCCCAGGAG GGACAG6CTT CATCCCCATC 13 BO 
CCIGGA06CA ATOGCTTTTC TCCT GG C G TT GGGOGAOCGO GTGTGGG6QC 06G6GGACAG 1440 
G6ACCTATCA TGACTOGACT AACftATTCTG AACCAGACAA TAGATATCT6 TAAGCATCAT 1500 
GCTAA CCm ' GTTTAAAT6G A06CTGTATA CCAACTGTCT CAAGCTAC06 AT6TGAATGC 1560 
AACATGGGTT ATAAGCAGQA TGCAAATGGA GATTQTATAG ATGTTGAT6A ATGCACATCA 1620 
AATCCCT6CA CTAAT6GAGA TTGTGTTAAC ACACCT66TT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTGC AGAGGACTGC TAOCAAGCAA GCATGCATTO ATATTQATGA GTGCATCCAG 1740 
AATGG G GTT C TTTQTAAAAA CGGTOGATOC 6TGAACTCAG ATGGAAGTTT CCAGTOCATT 1800 
TGCAATGCCO GCTTTGAATT AACTACAGAT GGAAAAAACT QTOTTGATCA T6ATGAATGT 1860 
AC3VACTACCA ACATGTOTTT GAATGQAATQ TQCATCAATG AAGATGGCAG CTTCAAGTGC 1920 
ATCTGCAAAC CAGGATTTGT CXTGGCTCCA AATGGOCGTT ACTGTACTGA TGTTGATGAA 1980 
TGCCAGAOOC CAGGAATCTO CATQAATGGO CACTOCATGA AGAOTOAAOG QTCCTTCOGC 2040 
TQTGACTaTC OCCCA66CCT G6CTGTGG6C ATGQATGGAC GTGTGTGTGT T6ATACTCAC 3100 
ATGCGCAGTA CCT6CTATGQ AGGAATCAAG AAAGQAGTGT GTOTOOOTCC TTTCCCXX3QT 2160 
OCAGTGACCA AOTCCQAATG CTGCTGTGCC AATCCAGACT ATGQTTTTGG AGAACCCTGC 2220 
CA6CCATGCC CTGCAAAAAA TTCAGCTGAA TTCCAOGGCC TTTGTAGTAG TGGAGTAGGT 2280 
ATCACTGTGG ATGGAAOAOA XATCAATGAA TOTGCTrrGO ATCCT6ATAT AT6T6CCAAT 2340 
GGG A TTT G TG AAAACTTACX3 TGGTA6TTAC 0GTT6TAATT GCAACAGTGG CTATGAACCA 2400 
GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 
GATAAOGGAT TGTGCXXyiAA CAC6CCAG6A AGTTACAQCT GTACGTGCCC ACCAGGQTAT 2520 
GTGTTCAGGA CT6AGACAGA GACCTGTGAA GATATAAAT6 AATGTQAAAG CAACCCAT6T 2580 
OTCAATGGGG CCTGCAOAAA CAACCTTG6A TCTTTCAATT QT6AATQTTC 6CC30GQCAOC 2640 
AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTOA AOGGGACCTO TTGGCTCAAC 2700 
ATCCAGGACA GCCGCTOTQA GGTGAATATT AATGGAGCX3V CTCTGAAATC TGAATGCTGT 2760 
GCCACCCTCQ QAGCCGCCTG GGGQAQCCCC TGTGAGCGGT GTQAACTAGA TACAGCTTGC 2820 
CCAAGAGGGC rTGCCAGGAT TAAAGGTGTT AGOTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 
TTCCCTGGGO TTTGTCXZAAA TGGAGGCTGT GTCAACAGTA AGGGATCTTT TCATTGGGAO 2940 
T6CCCTGAAG GCCTTA06TT GGATGGQACT GGCCGT6TAT OTTTGOATAT T06CATGGAG 3000 
CAGTOTTACT TGAAOTGGGA T6AAGATGAA TGCATCCACC CCOTTCCTGG AAAGTTCOGC 3060 
ATGGATGCCT GCTGCTGTGC TGTCGGGGCQ OCTTGGGGCA CCGAGTGTGA GQAGTQCCCX: 3120 
AAACCTGGCA GCAAGGAATA CGAGACACTG TGCCCCCX3CG GGGCTGGCTT TGCTAACCGA 3180 
GGG6AT6TTC TTACTGGGOQ OOCATTTTAC AAAGACATCA ATGAATGGAA AGCATTTOCT 3240 
GGGATGT6CA CTTATQQGAA GTGCAGAAAT AGAATC6GAA GCTTCAAATG 0C6TT6CAAT 3300 
AOrGGCTTTG CTCTAGACAT GQAGGAAAQA AACTGCACGG ACATCQACGA GTGCAGGATT 3360 
TCTCCTGACC TCTGTGGCAG TGGAATCTGC OTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 
TGCTTOGAAG GCTATGAAAO TGGCTTCATG ATGATGAAGA ACTGCATGGA CATT6AGGGA 3480 
TOTGAACGTA ACCCTCTCCT TTGTAGGGGT GOCACCTGTO TQAACACTOA GG6CA0CTTT 3540 
CAGTGTGACT 6CCCACTGGG ACAOOAOCTG TCACCATCOC 0T0A6GACT0 TGTGGATATT 3600 
AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGOAA AATGTGTGAA CATGATTGGA 3660 
ACCTATCAGT GCTCTTGCAA TCCTGOATAT CAGGCTAOGC CAGACCGCX». GGGCTGTACA 3720 
GATATTOATG AATGTATGAT AATGAAC30GA GGCTGTGACA CCCAGTGCAC AAATTCAGAO 3780 
GGAAGCTA06 AATGCAGCT6 CAGTGA6GGT TATGCCCTGA TGCXAGATGG GA6ATOQTGT 3840 
GCAGACArrO ATGAATOTGA AAACAATCCT GATKTCTGT6 ATGGGGGCCA GTGTACCAAC 3900 
ATTCCTGGAQ AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GQACATQAAA 3960 
ACATGCATTG ATGTCAATGA ATGTQACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 
GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGQAT GTAGAGATGT GQATGAaiGT GAAATTQGTO CTCATAACTQ OQACATGGAT 4140 
GCCTCAT6TC TGAATATCCC AGGAAGCTTC AAOTQTAGCT GCAGAGAAGG CTGGATTGGA 4200 
AACXMCATCA AGTGTATTGA TCTGGAOGAA TGTTCTAATO GAACCCACCA GTGTAGC31TC 4260 
AATGCrCAGT GTGTAAATAC CCCGGGCTCA TACC5GCTGTG CCTGCTCCXSA AGGTTTCACT 4320 
GQTGATGGCT TTACCTGCTC AGATGTTGAT GAGTOTGCAG AAAACATAAA CCTCTGTGAG 4380 
AAOSGACAST GCCTTAATGT OCOGGGTaCA TATOGCTOCJG AGTOTGAGAT OGGCTTCACT 4440 
CCAGOCTCAG ACAGCAGATC CTGGCAAGAT ATEGATGAAT GCTGCTTCCA AAACATTTQT 4500 
GTCTCTGQAA CATGTAATAA CCTGCCTGQA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 
GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTOTGCAGA TCCTATAAAC 4620 
TGTGTCAATG GCCTATGTGT CAACAOGOCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 
TTTCAGTTQA AGCCAACTG6 TGTGGGTTQT GTT6ACAACC GTGTGGGCAA CTGCTACCTO 4740 
AAGTTTG6AC CTOGAGGAGA T6GQAGTCTG TCTTGCAACA CCGAGATOGG GGTGGGCGTC 4800 
AQTOGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG TQAGACATGC 4860 
CCCCCTGTC3V ATA6CACTGA ATATTACACC CTQTQTCCOO GAGGTQAAGG CTTCAGACCT 4920 
AACCCCATCA CAATCATTTT AOAAGACATT GAOQAATGCC AGGAGTTACC AGGTCTCTGC 4980 
CAG66TGGAA ACTGCATCAA CACTTTT6GG AQCfT C C A OT GTOAOTGCCC ACAA66CTAC 5040 
TAGCTCA6GG AG6ATACCC0 CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 
GTGTGTGGGC CTGGGACCTG CTATAACACC CTG06AAATT ACACCTGCAT TTGCCCACCT 5160 
GAGTACATGG AGGTCAATGG AGGCCACSU^C TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 
CGAAGCTATA ATGGAACXIAC TTGTGAGAAT GAGTTGCCTT TCAATQTGAC AAAAA66ATG 5280 
TGCTGCTGCA CATATAATGT GGGCAAA6CT GGQAACAAAC CrTGTGAACC AT60CCAACT 5340 
CCAGGAACAO CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTT6ACATT 5400 
CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 
GGTGTGTGCA TTAACCAGAT TGGCAGTTTC OGCTOTGAAT OCCCTACAGQ ATTCAGTTAC 5520 
AATGACCTGC TGTT G GTTTG TGAA6ATATA GATGAGTGCA GCAATQGTGA TAATCTCTGC 5580 
CAGCQGAATG CAGAGTGCAT CAATAOTCCT GOTAQTTACC GCTOTGAATG TGCCGCGGGT 5640 
TTCAAACTTT CACOCAATGQ GGC C TGTGT A GAT06CAATG AATGTTTAGA AATTCCTAAC 5700 
GTTTGCAGTC ATGGCTTGTG TOTTGATCTG CAAGGAAGTT ACCAGTOCAT CTGCCACAAT 5760 
GOCTTTAAGG CTTCTCAGGA CCAOACCATG TCCATGGATG TTGATOAGTG CGAGCGQCAC 5820 
CCATGTGGAA ATGGAACTTG TAAAAACACC GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 
GOGTTTGAAC TCACTCATAA TAATGATTGC CTOGACATAG ATGAGT6CAG TTCCTTTTTT 5940 
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GQTCA6GTQT GCAOAAATOO AOtJTTGTTTT AATQAAATTG GTTCtTTCAA GTCTCTATGT 6000 

AA0GAA66TT ATCAACTTAC CCCAGAT6GC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 

GCCCTTCCOQ GCTCTT6CTC TCCTGOTACC TSTCAOAATT TGGACGGATC CTTCAGATGC 6120 

ATCT GTCCOC CAGGGTATGA AGTAAAAAGC GAGAACTGCA rTGATATAAA TQAATQTGAT 6180 

OAAGATCCCA ACATTTGTCT TTTTG6TTCC TGTACTAATA CTCCAGGGGO CTTOU A OTGC 6240 

CTCTGOCCCC CTGGCTTTGT ACTATCTGAT AATGGAOGGA GATGCTTTGA TACTOGCCAQ 6300 

AGCTlVlWr TCACAAATTT TOAAAATGQA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 

ACARAftGCAA AAT6CT0CT0 T AGTAA GATG CCAGGAGAGG GCT GG GGGGA CCCCTGT6A6 6420 

CTGTGCCOCA AAGAC6ATGA AGTT6CATTT CAGGATTTGT GTOCATATGQ CCATGQAACT 64 BO 

6TCCCIA0TC TTCAT6ATAC AOOTGAAGAT GTCAATQAOT QTCTTGAQAO CCCAGQCATT 6540 

TGTTCAAAT6 GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 

TACAACCTTG ACTACACTGG AGTACGCIGT GTGGATACTO ATGASTGTTC AATCGGCAAT 6660 

C06TGTGGAA ATQQTACATG CACCAATGTT ATTGG6A3TT TTQAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAG66CCCAT GATGAATT6T GAAGATATCA AGGAATGT6C CCftGAACCGA 6760 

Cl^WlXfra CTTTAOGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTAT6CCC TCAGGGAAGA TCAAAAQATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACA0C3ACT GTGAATCTAG GGGCATQATO TOTAAQAATC TAATOGGCAC CTTCATGTGC 6960 

ATCTGCCXrrC CTGGAATGGC COGAAGGCCC GATGGAQAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCA66ACXA AGCX9VGGAAT CTOTGAAAAT 6GAG8TTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATOTGAGT GTAATGAAOO ATTCCAGTCA AGTTCTTCAQ GCACTGAATG GCTT6ACAAT 7140 

OSACAGGGTC TCTGCTTTOC AGAGGTACTG CASACAATAT GTCAAATGGC ATCCAGTAGT 7200 

C6CAATCTCG TCACTAAGTC AGAAT6CTGC TQTGATGGTO GGCGAGGCTQ GGGCCACCAG 7260 

T6CX3AGCTTT GCXXa^CTTCC TGGAACTGCC CAGTAC3«AA AGATATGTCC TCATGGCCCA 7320 

Q QATATAC AA CTGATGGAAO AOATATTGAT GAAIOTAAGO TAATO CCAAA GCTCTGCACC 7380 

AATGQTCAGT 6CATCAATAC CATGGGCTCA TTCOGATGCT TCTGCAAGGT TQ6CTACACC 7440 

ACAGACATCA GTG6AACCTC TTOTATAGAC CTTGATOAAT OCTCCCAOTC CCOaAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7S60 

6TCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGGAGCAT 7620 

AACTQCCAOT TCCTCTGTGT CAACACOCTO GG6GGGTTTA OCTOTAAATO TCCAOCTGOT 7680 

TTCACACA6C ATCACACTGC TTGTATCQAC AACAAC6AAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATQ TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CXAGAAC31TC CTGGOTGGCT ACAGATGTG6 CTGCCGCCAA 7920 

GOCTACATCC AGCACTACCA GTGGAATCAO T6T6T0GATG AGAATGAAT6 CTCCAATCOC 7980 

AATGCCTGTO GCTCTGCTTC CTGCTACAAC ACCCTQ6GGA OTTACAAGTG OGCXnOCCCC 8040 

TOGGOGTTCT CCTTOOACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACAOGGAGG GGGGCTACCT CTGTGGCTGC 8160 

GCCXXTTOGGT ATTACAGAGT GGGACAAGGC CACTGT6TCT CAGGAATGGG ATTTAACAAQ 8220 

0G6CAGTACC TGTCACTOGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8280 

T6CTA0QAGT GCAAAATCAA 066CTATCCT AAGAAAOACA 6CA0GCA6AA GAOAAGTATT 8340 

CATGAACCTG ATCOCACTGC TGTTGAACAQ ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCOrrCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCOGCCA TCCAGCCCCT CAACAACCAC ATCGGTTATG TCATCTCTCA AGGGAAC6AT 8520 

GACAOOGTCT TC06CATCCA CCAAAG6AAT GGGCTCAOCT ACTTGCACAC GGOCAAOAAG 8580 

AAGCTCATQC CGOGCACATA CACACTG6AA ATCACTA6CA TOCCTCTCTA CAAGAAGAAG 8640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGA6 8700 

GCTCTCAQAA TGAGGCTGCA GATTCAOCTC TATTAACCGT TCACAGACTT GOGOCXSVGGC 8760 

TCAAATCCTA GCACAGCCAG TCTOCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC VlMm\:m CCTCCCTGTC TTAGACTTTa AATGTTOAOC 8880 

CTCACAGGQA GQGATAATTT AGACTCT6GT ATOGCXAAAO ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT rmrr rXTl' 9060 

OCXGTTAGCA QTCTGTAACA CTTTGGGTAT TTTGCTATA6 TTGCTAATTA AAAAAATATA 9120 

GATQTTTATT TATTTTTAAT 6CAGTAATAT ATGQAOAAAT QAACAAACTA TOTAAACAAA 9180 

AAGGGAAACT CACrraTTTT TCTTTAGATT TATAAATTTG AGCT A TTTTT TTTA0AGGT6 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT OTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTQATACACA CCTGATOGAT TTTAAAGAAA GCCACACAGA GCTGAATOGG GCAQTGCTAA 9360 

TGAATAATTT AAAAGACATG AATGTCATTA GATCGTTTAT AAOQTAGATC 6AAGCCAAAG 9420 

CAOCTCATTT GTOACAACAT TTCATATCAC CAOACACACC AGQCAACAGA AGTTaAAGCA 9480 

CAACCACICT A6CAAAATAC CTT6ACTGCT TGTGAGAOCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCXTTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACXTTCATT 9600 

CTTACCCAGG GTGCGCTGCX} TCCTCATGGT ACTGTAGGCA GCTGAAGAAC OGCOQTTCCC 9660 

TTG AAAGQQA ACAC CTGGC A TTCIGTGGT6 TTTCXST6CTQ TCTTAAATAA TGGTGCATTT 9720 

ATTATGTTCA AOTTATTTCA GGATTGCCAT AT6T6CAAAC AAATCATGCA ATQCAGGCAA 9780 

GQAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT TTTAQAATTT TCATTAATAC 9840 

TGTAGTTATA CACCATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AQATOTTTGT 9900 

ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC CAGGATCTGC 9960 

TATTAAGAGC A0GTATCX3VT TATTCTTCTC AACCCAAGAA CCTQTTTCCT GGACCAlCTGA 10020 

OCAAAGCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT CCTGGTTGTT CCICTGAAAC 10080 

CIGT6CTGAC CAAAGATTAO TAACCA6TTA TACCCAGTAT TTTQAOGTTT 7ATTQTTTTT 10140 
TTAATAACTA AAAAAAAACT CGTGCC 

8eq ID NOs 162 Proteiji sequence 
Protein Accession «i HP_001990.1 

1 11 21 31 41 51 

I I 11 i I 

MQRBRRICLQ LYPLWLGCW LWAQGTAGQP QPPPFKPPRP QPPPQQVR8A TAG8EGGPLA 60 
PEXBEBGAAV ASBVRRRGQQ DVLRGPNVGG SRFHSYCCPa WKTU^SGNQC IVPICRNSCG 120 
DGFCSRPNMC TCSS6QIS8T CGSKSIQQCS VRCMNGGTCA DDHCQOOKGY IGTYOQQPVC 180 
EKGOQNGGRC lAQPCACVYG PTQPQCBRDy RTGPCPTQVN NQMCQGQLTG IVCTKTLCCA 240 
TTGRAWGHPC EMCPAQPQPC BRGFIFNIRT GACQDVDECQ AIPGICQGGN CIinVOSPBC 300 
RCPAGHKQSB TTQXCEDIDB CSIIPGICBT GECSinvGSY FCVCPRGYVT STDGSRCIDQ 360 
RTGMCFSGIiV MaRCAQBLFG RMnO«2CCCB PGRCHGIGTX PBACPVHGSE BYRRLCMD6X« 420 



1017 
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PKGGIP6SA6 SRFGGTGQIG PAPSGHGNGY GPGGTGPIPI PGOIGFSPGV 6GA6VGA00Q 480 

GPIITGLTXL NQTIDICKHH ANLCUIGRCI PTVSSYRCEC MMGYKQDANG DCIDVDBCTS 540 

IIPCTNGDCVN TPGSYYCKGK AOFQRTPTKQ ACIOIDECIQ NGVIiCKNGRC VK3DGSFQCI 600 

CKAGFBLTID GKMCVDHDBC TrTNMCU7GN CZNBDGSFKC ZCKPGFVIAP HGRYCTDVDB 660 

C3QTPGX01NO BCIHSBQSFR CDCPPGUIVG KD6RVCVDTH MRSTCyCGIK K6VCVRPFPG 720 

AVTKSECCCA NPDYGFGEPC QPCPAKKSAE FBGI^CSSGVO ZTVDGRDINB CALDPDICAN 780 

GICENLR6SY RCNQISGYEP DASGRNCIDI DSCLVNRLLC DNGLCRNTFG SYSCTCPPGY 840 

VPRTETETCE DINECESNPC VNGACRNNU3 SFNCBCSPGS KLSSTGl«ICI DSLK6TCWI4N 900 

XQDSRCBVHI NGATLKSECC ATU3AAMGSP CBRGBUMTAC PROLARIKGV TCEDVNECSV 960 

PPGVCPZiGRC VKSKG5FRCS CPB6LTLDGT GRVCLDIRME QCYLKHDSDE CXHPVP6KFR 1020 

HDACCCAVGA AWGTBCBBCP KPGTKEYETL CPRGAOFANR GDVLTGRPFY KDINECKAFP 1080 

GMCTYGKCRN TIGSPKCRC3f SGFALDMBER NCTDIDECRI SPDLGOSQIC VNTPGSPECE 1140 

CFBQyESGFM MMXNCMDIDQ CERNFLLCRO GTCVNTBOSP QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGKCVNMIG TYQCSQIPGY QATP0RQ6CT DXDECHZMNO OOITQCTHSE 1260 

OSYBCSCSEG YALMPXX3RSC ADIDECENNP DICDGGQCIN XPGEYRCLCY D6FMASMDMK 1320 

TCIDVMECDL NSMICMPGEC ENTKGSFICH 0QL6Y6VKKG TTGCTDVDEC BIGAHNC3)MH 1380 

ASCLNIPQSP KCSCRBGWIG NGIKCIDLDB CSNGTHQCSI HAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAEHINLCE NGQCLMVPGA YRCBCEMQFT PA5DSR5GQD IDECSFQNXC 1500 

VSGTOantPG MFKCICSOOy BWRTGGHCT DXOBCADPIH CVNGLCVNTF GRYBCNCPPD 1560 

FQU7PT6V6C VDNRVGNCYL KFOPRGDGSL SCNTBXGVaV SRSSCCCSLO KAHGNFCETC 1620 

PPVKSTEYYT LCPGGEGFRP NPITIILEDl DEC3QBLPQLC QGQNCINTFQ SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCOPQTCYNT LGNYTCICPP BYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEI7 BLPFNVTKRM CCCTYNVGKA GNKPCBPCPT PGTADFKTXC GNXPGFTFDI 1800 

RTOKAVDXDE CKBXPGICAN GVCZNQXGSF RCECPTGFSY NDLLLVCBDI DECSNGDKLC 1860 

QRHADCIKSP OSYRCBCAAO FXLSPNGACV DSNECXEIPN VC8HGI1CVDL QGSYQCXCHN 1920 

GFKASQDQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC IiDlDECSSFF 1980 

GQVCRNGRCP NEIQSPKCLC NEGYBLTPDG KNCIDTNECV ALPGSCSPGT CQNLE6SFRC 2040 

XCPPGYEVKS ENCXDIHECD EDPNlCSiFCS CZNTPGGFQC LCPPGFVLSD KGRRCFDTRQ 2100 

SFCFTNFEHQ XCSVPKAFWT TKAKCCCSKM FQE6HGDPCB I/^PKDDEVAF QDLGPY(B2(IT 2160 

VP8LHDTRED VNECLE8FGI CSNGQCINTD G8FRCBCPM6 YNLDYTGVRC VDTDECSX6H 2220 

PCGNGTCTNV XGSFECNCNE QFEPGPMMNC EDINECAQNP IiIXALRCKNT FGSYECTCPX 2280 

GYALREDQKM CKDLDECAEG LHDCBSRGMM CKNLIGTFMC ICPPGWARRP DGEOCVDENE 2340 

CRTKPGICEN GRCVNXIGSY RCEO^FQS SSSGTECLDM RQGLCFAEVL QTXCQMA88S 2400 

RNLVTXSECX; CDGGRGNOHQ CBLCPLPOTA QYKKICPHGP OYTTOGRDIO BCKVMPllIiCr 2460 

NGQCXNTM6S FRCPCKVGYT TDX8GTSCIO LDBC8QSFKP CNYXCKKTBO SYQCSCPRGY 2520 

VLQEDGKTCK DLDECQTKCJH NCQFLCVNTL GGFTCKCPPG FTQHHTACID NNECX3SQPLL 2580 

CGGKQICQNT PGSFSCECQR GFSLDATGLN CEDVDECDGN HRCQHGCQNI LGGYRCXSCPQ 2640 

GYXQHYQHNQ CVDBNECSNP NACX3SASCYN TL6SYKCACP SGFSFDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTBGGYLOQC PPGYYRVGQG KCVSGMGFNK GQYIfSLOTEV DBBNALSPEA 2760 

CYECKXMGYP XKDSRQKRSI HEPDPTAVBQ Z8LESVIX1IDS FVMMKFNLSR LQSKEHIItBL 2820 

RPAIQPXiNMSI XRYVISQGND SSVFRXHQRN OLSYLKTAKK KXjMPGTYTIiE XTSXPLYKKK 2880 
ELKXLEBStiE DDYLL6BLGE ALRMRLQIQL Y 

8eq ZD KOt 163 DHA aeguence 

Kuoleie Acid Accession Si NM_oi3372.i 

Coding sequence 1 63.. 617 " 

1 11 21 31 41 ' 51 

I I i I I I 

GCGGCCX3CAC TCAGCGCCAC GC6TCGAAA0 CX3CAGGCGCC GAGGACC06C OGCACTGACA 60 

GTATGAGCCX5 CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCrCTTGGGG ACXXTTGCTGC 120 

CGOCTOCTQA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCX3C CCCAGCAGCC TGGCTCCAGG AACGGQOGGC 240 

GGGGCCAAGG 6CXX3GOCACT 6CCAT6CCC0 GGQAGGAGGT 6CTGGAGTCC AGCCAA6A00 300 

CCCT6CATGT 6AC6GAGC6C AAATACCTGA AGGGAGACTG GTGCAAAACC CAGCOGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTOGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAOTGCAA CTCTTTCTAC ATCCCCAGGC ACATCOGGAA GGAGGAAGGT TCCTTTCAaT 480 

CCTGCTCCTT CTGOUVGCCC AAQAAATTCA CTACCATGAT GGTCACACTC AACTGCCXTTG 540 

AACTACAGCC ACCTACCAAO AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGQAAGTCCC AOACCTAAAA CAACCAQATT CTTACTTGGC TTAAACCTAO A66CCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGQAGCCTG CTTGTGCQTA GTTOQTGTGC ATGAUTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAQACACCA 6AGAAAACAC AGTCTCTGCT A0AGA6CACT 840 

COCTATTTTG TAAACATATC TOCTTTAATO GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

CTCACATCTA AAGGGGC3GGG GC0GT6GTCT GGTTCTOACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATOAATG TTCATQGAAG AGGCTCCTCT GAGGGCAAQA 1020 

QACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTOCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTQTGAAa ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGQTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGGAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAQAAAGTC 1360 

TTTTCCTAGT ATTTAACAGA ACCCAAGTQA ACAGAGQAGA AATQAGATTQ CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CXTTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTOSGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTO OAAAAGAGAG GTAGTTTA6A ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAOA GQCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AQAGCCACTA ACTTOATTQA TAAAGATCCT 1740 

GCCTCTGCIO AOTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTO 1800 

TTTTAGCAAG ARATATTKTG GGGOTCTTTT TGTTTTAACT ATTGTCAGGA GATTGG6CTA 1860 

RAGAQAAGAC GACGAGAGTA AOGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GOTGTTAATA CCTGQTAGAA ATCTAAGGGA TATGACCTCX: CTTrCTTTAT GTGCTCACTQ 1980 

AGGATCT6AG GGGACCCTGT TAGQAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTO 2040 

CTACTGGTTO GATQGACATA ACTATTGTAA CTATTCAGTA TTTACT08TA GGCACTGTCX: 2100 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TCTOATTAAA 
AOGGTGGGTG 
TTTTATATAC 
AGTCCTATGT 
TCTGGCATTC 
TCCA6TGCTC 
ACACCCAAAA 
CTAGGOBAAT 
CCAAATCTTT 
ACACAAATGC 
ATGTAAAACC 

AGCACTAATC 
CAAGTAAAGA 
TAAAAGCATA 
GACTA6TACA 
TTTTATTGGA 
TTATG6CAA0 
TGAATTTTAT 
GTCTGTAAGT 
TGCSAOOAQAO 
ATTTAATGTA 
TTCTCCTTTA 
TTAGAGTCTT 
QAOTCAOTGC 
AAGCAATATT 
AC3QAATA6CA 
AATOAAAAGT 
AGTTCTATTG 
AAATCATTTA 
ATGAAAGGGG 
ACTAGAATTT 
TAAATTAAAC 



CTTGQCCTAC 
AACTTTATT6 
AAACTCCCTQ 
AATAT6GAAA 
AGAGAACCCT 
TCCCATCTAA 
TGTTGGOTCT 
TTGTCCAAAC 
GTATTGTCCA 
TAAGGCAQAA 
ACACCAGGGA 
AACTCT6CGA 
TTCTTTTAGG 
OAATTTCCTC 
TCACTAQCXA 
AATGTGGTGT 
GTCACTOATQ 
ATATTTGTGG 
GATGTACACT 

GATAATTTCC 
ATTATTACTT 
TATOACTTTC 
TTATCTGGTC 
CTGAATCTTT 
AAQAAAOACT 
QATAATGATO 
GOATAAACAG 
ACATTCCTCA 
AAAACGGCAA 
A6TTGATAGT 
AATTTTCACC 
CIATTCTTTC 



T6GCAATY3GC 
TACTTTGGAT 
AATACTCTTT 
ACAAACACTG 
TGCAACTCGA 
CAACTAAACA 
QATTTTCAAA 
ACATAGTOTO 
CATTCTCCAA 
TTTTGAGGGT 
G6AAAAATQA 
CA AGAATG CA 
A6CTTSTAGC 
AACACTAACr 
AAGAGGGAAA 
GTCTTCCAAC 
ATOTAATQAT 
TCTTGATCAT 
TTGTGCTTGQ 
TACTQTAGOT 
ACTGTGTGGA 
CAAATGCTTT 
TCTGAGTTGG 
AGGGQAAACA 
ATTTTTTAAA 
TTAAATGTTA 
ACTAGTTa\C 
AACATTTATA 
AGATATTTAA 
AGAATTATAT 
CTCATAAAAC 
CCAATAATOT 
AAAAAAAAA 



TACTTAGQAT 
TTGQTTAACC 
TT6CCTT6TA 
CAQACTT6A0 
GAAGCTSTTT 
GGAGC CATTT 
CTTTTAAACT 
TOTGTTTTGT 
CAATAAAGCA 
GGGAGAGAAG 
CATTCA6AAC 
ATTTGGTTAA 
ACAGTCTTGC 
TCACTGGGAT 
TATCTGTTCT 
TTTCATTQAA 
ATATTTTTTC 
ACCTATTAAA 
CATTAAAAGA 
CTTCAAAGTT 
ATGTGAATAG 
(K3TCACTGTG 
GCAAAGAAGA 
AAATCTTGAC 
TTQAATOTTC 
TTTTGGAAGA 
ACATAAAGTC 
AGTGATCAGT 
TATCAACTGC 
AGACTATGAG 
TAAT7TGGCT 
TCTATATAGC 



TGATCTAAGG 
TQTTTTCTTC 
TCTTCTOVGC 
ATTCAOTTQC 
TTATTTCGTT 
CAAGGOGGGA 
CACTACTGAT 
ATACACTQTA 
CAGAGTGGAT 
AAAAGGGAAA 
CAGCAAACAC 
TGGACAT6AC 
ACATAA6TGC 
AATCAGCAGC 
TCTTACTGTG 
AATGCICATAT 
ATTATTATA6 
ATAATGOCAA 
AAAAAACACA 
AAQAGTGTAA 
TTAAATGAAA 
ATTTCAA6CA 
AGCTGACACA 
CCAGCTQAAC 
CTTAAAGGTT 
CTTAOGATGC 
CTTTTAAGGA 
TAATGCCTAA 
ATTATGTATT 
GTACCTTQCT 
TCAAOTTTCA 
CTTTGCTAAA 



GCCAAAGTGC 
AAGOCTQAQQ 
CTCCTAGCCA 
CQATCAAGGC 
TTTGTrrTGA 
GATATTTTAA 
GATTCTCAOG 
TGACCCCACC 
TTAATTAAGC 
GAAGCTQAAA 
TGAATTTCTC 
TTAAGTTG6C 
AGATTTGGCT 
GTAACTACOC 
CCTATATTAA 
CTATACCATA 
TAGAATATTT 
ACACCAAATA 
CATCCTGGAA 
GTGAAAAATC 
AGTTATQQTT 
T GIl T ' l ' CnT 
COQTATGTTG 
ATGTCTTCCT 
AACATTTCTA 
ATGTATACAA 
GAAAATCTAA 
GAOTGAAAOT 
ATQTCTGCTT 
6TGTAGGAG0 
TGAATCTGTA 
GAOCAACTAA 



Seq ID NO: 164 Protein sequence 
Protein Accession Ui NP 037504.1 



1 11 21 31 41 51 

I 1 I 1 t I 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSHNRGR 
GQGR6TAMPG EBVLB8SQEA LHVTERKyiiK RDWCKTQPLK QTIHEEGC3IS RTIINRFCYG 
QGNSPyZPRH IRXBBOSFQS CSFCKPKKFT TMMVTWCPB liQPPTICKKRV IUVXQCRCZS 
IDLD 

Seq ID NO I 165 DNA sequence 

Nucleic Acid Accession fti CAT cluster 



51 



1 11 21 31 41 

i I I I I , 

GAATTGCATC GGACAGAQCT TCGCCATGGC CGAGTTGCQC GTGGTTGTGG CaCTAACACT 
OCTACQTTTC 0GGCT6AG0G TGGACC6AAC G06CAAGGT0 OG0066AAGC 0SGA6CTCAT 
ACT6CX3CAGG OAGAACSGaaC TCTGGCTCAA Q 



Seq ID NO I 166 DNA secruence 

Nucleic Acid Accession 0t FGBIGSH predicted 

CSodlng sequences 1..1650 



1 
I 

A7G0CACCTC 
GCCATCACAG 
GCGGTGTCCA 
CTGAGGCTCT 
CCCCGGOGCA 
CAAGATGAGA 
0TCCTQCXX3C 
GCTG0CAT06 
QOGCTGCTGC 
TTCCACTTTQ 
CATQCTAAAT 
ATCAGGCTCA 
CAAGAGAAGA 
CGCCAGTATC 
AGGTTCCGGC 
06GCGGGCAC 
TT(X3ACTTTA 
GAGGATATCC 
GGGATCrCTT 
GAAGAQATTC 
ACrCAOCTGC 
ACTCTTOTCT 
AAAGQAATCA 
GACTCCAAS6 



11 
I 

TGCCATCCAG 
ACOQOCTGCr 
CCCTTCTCCT 
GCAGGAGCTT 
ACTGGCTGCT 
AGAAGGTACT 
TOTT GQ TTCT 
CCCCCAA06A 
TCAGCAAAGG 
ACATCCTGAA 
GGOGGCATCT 
TQACCCTGGA 
TGAGTGAT7A 
GCTTGCACCA 
AGGCCTGTQA 
TGCGTCAGCA 
TTQATGT6CT 
GAGCGGAAGC 
GGATGCTGTT 
AGGAAGTCAT 
CCT7TACAAC 
CTOKXAATO 
TCTGCTTGGT 
TOTACAACCC 



21 
I 

AATGTCTOGG 
GQVOCrCXTG 
CCTCCl-QCrC 
CTACATCACC 
GGGCCACCTG 
GgACAACATG 
GGTGGACCCT 
T6ACCTCTTC 
TGACAAGTGG 
GCCTTACATG 
GQCAGAGGGC 
CAGTCTTCAG 
TATCTCCX5CT 
CTACCTCGAC 
CATGQTGCAC 
GGGGGCCGA6 
GCTCCTGGGC 
AGACACCTTC 
CAATTTGGCA 
GAAAOGCOGG 
TATGTGCA1T 
CAOOOAOOAC 
CAGCATCTAT 
CTACOQCTTT 



31 

I 

GGGCCCTCCT 
GGGCTGGAGA 
TTCTTCCTGT 
TGCCGCOGGC 
GGCATGTACC 
CACCATGTAC 
GATTACATCA 
TATGGCTTCC 
AGCCGOCACC 
AAGATCTTCA 
TCAGCGGTCT 
AAATGTGTCT 
ATCATTGAAC 
TTCATTTACT 
CACTTCACCA 
GCCTGGCTTA 
AGGGATGAAG 
ATGTTTGAGG 
AA67ATC0GG 
GAGCTGGAGG 
AAGGAGAGCC 
ATCAAGCTCC 
GGAACCCACC 
GACC066ACA 



41 

I 

TGGTGACAGG 
AGAOGGOGTT 
TCOGCCTGCT 
TGOQCTGCTT 
TTCCAAATGA 
TCTTGGTATG 
AACCC C TTT T 
TAAAACCTTG 
GT06CCTGCT 
ACCAGAGCOC 
CCCTTGATAT 
TCAGCTACAA 
TGAGOGCTCT 
ACOOCTOGQC 
CTQAAGTCAT 
AGOCCAAGCA 
ATGGAAAGQA 
GTCAOGACAC 
AATAGCAGGA 
AGCTGGAGTG 
TG060CAGTA 
CAGATGGGCG 
ACAACOCCAC 
ACCCACASCA 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2620- 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3760 
3840 
3900 
3960 
4020 



60 
120 
180 



60 
120 



51 
1 

CAGGATGCTG 
COGCATATAC 
GCTOCGGTTC 
CCOCCAGCCT 
GGCXSGGCCTT 
GATGGGACCT 
GG6AGGCTCA 
GCTAGGGGAT 
GACACCOGCC 
TQACATTATQ 
GTTTGAGCAT 
CAGCAACTGC 
GTCTGTCCGG 
GQATGGQOGG 
CCAOGAAOGG 
GGGGAAGACC 
ACTGTCAOAC 
AACATCCAOT 
GAAATGCQGA 
GGAC9GATCTG 
CCCACCTGTC 
CATCATCCCC 
AGTOTGGCCT 
GCGCTCTCCA 



60 
120 
160 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
700 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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5 



CTGGOCTATO TGCCCTTCTC TGCAQGACCC AGGAATTGCA TCG6ACAGAG CTTOOCCATG ISOO 

GCOGAGTTGC GCGTGGTTGT GGCACTAACA CTGCTACXSTT TOOGCCTGAG CGTGOACCXSA 1560 

ACGCGOVAGQ TGCGGOGQAA GCCX;GAGCTC ATACTGOGCA CX3GAGAA060 GCTCTQ6CTC 1620 
AAaGTGOAGC CGCTGCCTCC GCGGGCCTGA 

Seq ID NOi 167 Protein sequence 
Protein Accession 8t FGQ«BSH predicted 

in ^ }^ 2^ 3^ *1 51 

10 I ! 1 I 1 I 

MPPLPSRMSR GPSLVTGRML PITDRI*LHLL GLBKTAFRIY AVSTLLLPLL PFLPRLLLRP 60 

LRLCRSFYIT CRRI)RCPPQP PRRMHLLGHL GMYliPHBAGL QDBXKVLCim HHVLLVWMGP 120 

VLPLLVLVHP DYIKPLLOAS AAIAPKDDLP YGPtKPHWSD GUU^SKGDRW aRHRRliLTPA 180 

PHPDILKPYM KIFNQSADIM HAKWHHLAEG SAVSLDMFBH ISLMTLDSLQ KCVFSYNSNC 240 

10 QEKMSDYISA IIELSALSVR RQYRLHHYLD PIYYRSADGR RFRQACDMVH HFTTEVIQER 300 

RRALRQQGA8 AWLRARQGXT IDFZDVItLLA fiDEDGXEIiSD EDIRAEADTP MFB6HDTTSS 360 

GISWMI*FNLA KYPBYQBKCR EBXQEVMKGR BLEELSNDDL TQLPFTTHCI KESLRQYPFV 420 

TLVSRQCTED IKLPD6RIIP KOZICLVSXY GTBHNPTVirP DSKVYNPYRF DPDNPQQRSP 480 

LAYVPPSAGP RNCI6QSFAM ABLRVWALT LLRFRLSVUR TRKVRRKPEL lUlTElIGLWL 540 
KVEPLPPRA 



20 
25 



Beq ID NOi 168 DNA gecruence 
Nucleic Acid Accession «: AK0S8068.1 
Coding sequence I 252.. 1772 

1 11 21 31 41 51 

1 I I I I I 

AGGAAACCAA GGCAAGCTCC CXXTOTCAAA GCSVCCirOGC 0C3VTAA6AA0 AAAAGGQGGA 60 

GCCCCAGATQ TGATQAGOOC TTCCAGGCTT CAGGCTCAGA AGGCOCCCCC AGCTCTCCTG 120 

JU TAACTCAGAG GCXy^GTGTQA TGGGAGTTCC TCCACTCAGC ACACTTCCCC TGTAAACACX5 180 

CCTGTGGTGG GCAAAAGGGC TTTGQAACGG TTGCTTOTCT TTTCTCTCCT GOGTAATTTC 240 

CACTTTCATT CATQATAATQ TOOAACAOOC ACAAA6CT0Q GCTGGJAAOGC C9GG0TCACIG 300 

GCTCAACCAA CCGGTGGCGT TTGCCCAAAC AGCCTTTCTC T6GGGACCTG CTCTCACTTT 360 

^ CXXaGATGTG CAAGGCTCTG AGCATAGACT TTGAGGAAGC TTTGAGGAAC CCAGACAGGT 420 

TATGCATTTC ACAAATCCAQ AAGTTTTTCT TTQAGAATrT CAAGAACAAG GACATCCAAA 480 

GTG6GGAAGC AGATOTGATT CT0GAGT6CC TGGGCTTCAA ATGQGAGCTC CATCAGCCCC 540 

GGCTTTTT^ GTCTGAGAOC TTOGCCAAGC TCTACCTGAA AGCGCT06C6 CAG6GCACXA 600 

CACACCCOCT GAGGGAGCTG GAGQAGCTTC TGC3GAGCr<a ATCACCTAAG AAGACX3UU« 660 

AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA CTGGTCACTA 720 

4U AAGTCX3CCTT CGCCACGGCC CTGAAGAACC TCTACATGAQ TGAGGTGGAG ATTAACTTGG 780 

AAGACXTTACT GGGAGTGCTG GCTTCC6CCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 840 

GGTG0GT6QA TGTGAT6ATA GGCAGACTGA AGGCAAOCAC CATCAAGAAA TTCTACX3A06 900 

C060CT6CAA GTACAAGGAA GAGCAGCTCA CCACOGGCTG CX3AQAAGT00 CTGQAAATGA 960 

. ACTTGGTTCC TCTAGGGGGG ACOC3USATCC ACCTCCACAA AATCCCACAQ GACCTGCTCC 1020 

ACAAAQTGCT GAAGTCCCCX: AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 1080 

TGCTTTTGTG GGTCTTCTTG CAACTGAACT ACAAGATTCA GGCAATTCCG ACTTATCAAA 1140 

COGTGATGAC ATTTTTTAAO AGCTTTCCTG AGAACTGTTG CTTTCTGOAC CGGGACATAO 1200 

GAOGGAGCTT GAGGCOGCTC TTCCTCTGCT TGCGTCTGC3V CQGCATCACC AAAGGCAAGQ 1260 

ATCTGGAGGT GCTGOGGCAC CTTAACTTCT TCCCAGAGTC ATGGCTCGAC CAGGTTACAG 1320 

D\) TCAACCATTA CCACXSCACTG GAGAATGGGG GCOACATGQT CCACXTTGAAA GATCTTAACA 1380 

COCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCAGGAGAA TAC3\ACTTAT TOGAAAAOGA 1440 

TTGCTCTATA TGGATTCTTC TTTAAGATAA AGGGACTGAA ACATGATACT AOCTCTTATA 1500 

GTTTTTACAT GCAGAfiAATA AAOCACAC3M3 ACCTGGAATC TCCCTCT60S OTCTACGAGC 1560 

ACAACCACX3T CAGCCTGOGA GCX3GCA0GCC TGGTGAAGTA TGAQATCAGA GCAGAGGCCC 1620 

DO TGGTTGACGG CAAGTGQCSIG GAGTTCAGGA CAAACCAGAT CAAGCAQAAG TTTQGGTTGA 1660 

CCAOCTCATC CTGCAAAA6C CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 1740 
TAAOTTTTOC ATTCATCTTC CCAGCATCTT GACAGTTTCX: AGAAOAATCT ATQGGATTTT 
GGCCCCACT6 GTCTGCATAA AA6AAAATAA AAtGACATAA AAGGGAGC 

60 Seq ID NO! 169 Protein sequence 
Protein Accession «t BAB71658.1 



65 



75 



1800 



1 11 21 31 41 51 

I i I I I t 

MIMSNTHKAR LERKVTGSTN RWRLPKQPPS QDIiLSLSQMC KALSIDPEEA LRNPDRLCIS 60 

QIQKFPPEMP KNKDIQSGBA DVILECLGFK WBLHQPRIiPQ SETIAKLYUC ALAQGTTHPL 120 

RELBELLRAQ SPKKTKEKSP AKRZIISLKI NDPIiVTKVAF ATAUCHLYIIS EVBINLEDLL 180 

GVLASAHILQ FSGLFQRCVD VMIARLKPST IKKPYBAGOC YKEBOLTTGC EKWLEMNLVP 240 

ICGTQIHLHK IPQDLLHKVI* KSPRLPTPSB PHLLKTMLLW VFLQUWRIQ AIPTYBTVMT 300 

/V PPKSPPENCC FLDRDIGRSL RPLPLCLRLH GITKGKDLEV LRHLMPPPBS WLDQVTVNHY 360 

HALBNGGDMV HLKDLNTQAV RF6LLFNQEH TTYSKTIALY GPFPKIKGLK HDTTSYSFYM 420 

QRZKHTDXiBS PSAVYBHMHV SUtAARLVKY EIRAEALVDG KHQBFRTMQI XQXFGLTTSS 480 
CKSHTLKIQT VOIPIYVSFA PIPPAS 



Seq ID NO: 170 DNA Bequence 

Nucleic Acid Accession |}: NM_,007000.l 

Coding sequence! 1...777 " 



OA ^ 11 21 31 4X 51 

ATGGCGTCTG OGGCAGCAGC GGAGGCCGAO AAGGGATCTC CAOTTGTGGT GGGCCTGCTA 60 

GTTGTGGGCA ATATCATTAT TCTGCTOTCA OGCCTGTCCC TOTTTGCTGA OACCATATGG 120 

GTQACAGG06 ACCAG TACOS TGTATACCCA CTGATOGGAO TCTOlGOCAA GQATQAOGTC 180 

TTOQCTGGTG CCTOQATTGC CATCTTCT6C OOCTTCTCCT TCTTCATQGT AGCCAGTTTT 240 

1020 
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OOTGTGGGTG COSCACTCTG CXX3CCGCCG0 TCCATOOTCC TCACGTACCT GOTGCTCATC 300 

CTCATOCTCT ACATCTTCGA OTOOGCCTCC TGCATCAOOT CCTAOMXCA OOCTOACTAC 360 

ATGGTGTCCA ACCCATCCCT GATCACCAAG CAGATQCTGA CCTTCTACRO COOGGACACC 430 

- OACCAGGGCC AGGAGCTGAC COGCCTCTOG GACCGCGTCA TGATTQAGCA AGAATOCTGT 480 

^ GGCACATCTG GTCCCATGGA CTGGGTGAAC TTCAOTrCAG CCTTCCGGGC GGCCACTCCG 540 

GAGGTGGTCJT TCCCCTGGCC CCCACTGTGC TGTOGCOOGA CX3GQAAACTT CATCCCCCTC 600 

AAOQAGGAOG GCTGCCGCCT GG6GCACAT0 GACTACXTTGr TCACX»AGGO CTOCTTOOAA 660 

CAC ATOGGC C ACGCCATOGA CAGCTACAOO TOOQOTATCT OQlGO'mU O OTTTQCCATC 720 
CTGATQTGGA CGCTCCOSGT CATGCTGATA GCCATGTATT TCTACACCAT QCTCIGA 



25 
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Seq ID NDi 171 Protein sequence 
Protein Accession ffi Np_00893i.i 



8eg ID NO: 172 DHA sequence 

Nucleic Acid Accession NM_006760.l 

Coding sequence t 3 9 . . 5 93 



ic I 11 21 31 41 51 

1^ 1 I I I I j 

MASAAAAEAB KGSPVWGLL WONIIILLS GLSLPABTIW VTADQYRVYP LMGVSGKDDV 60 
PAGAWIAIPC GPSFFMVASP GVGAALCRItR SMVLTYbVLM LIVYIPBCAS CITSYTHRDY 120 
MVSNPSLITK QMLTFYSADT DQGQBLTRIJH DHVMIEQBCC OTSaPWWVN PTSAFRAATP 180 

- EWPPWPPLC CRRTGNPIPL KBEQCRLGBH DYLPTKGCPB HIGHAIDSYT WGISWTOPAI 240 

JLV UWTLPVHIiI AMYFyTrai 



1 11 21 31 41 51 

' i I 1 I I 

GAAAGCCTGC CAGCACCTAT TCCACCTCCC AOCCCaCCAT GGCACCCCTO CTOCCCATCC 60 

GGACCTTGCC CTTGATCCTO ATTCTGCTGG CTCTQCTGTC CCCAGGGGCT GCAGACTTCA 120 

JU ACATCTCAAG CXrTCTCTGGT CTOCTGTCCC CGQOGCTAAC GGAOAGCXTG CTCGTTQCCT 180 

TGCCCCCCTO TCACCTCACA GGAQGCAATG (XACACTGAT GGTCCGGAGA GCCAATOACA 240 

OCAAAGTGQT OAOGTCCAOC TTTGTQOXGC CTCOSTCCOO TGGGOQCAGG GAACTOGTCA 300 

OTGTGGTGGA CACSTGGTGCT GGCTTCACAG TCACTCGGCT CAOTGCATAC CAGGTOACAA 360 

- - ACCTCX3TGCC AGGAACCAAA TTCTACATTT CCTACCTAGT OAAQAAQGGG ACAGCCACTO 420 

J J AQTCCA6CAG AOAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATOGAA TCCATTCGGC 480 

TGGGTATGGC COGCACAGGG GGCATGGTGG TCATCACX3QT GCTGCTCTCT OTOGCCATOT 540 

TCCTGCTGGT GCTGGGCTTC ATCATTGCCC TGGCACTCGQ CTOCCGCaAG TAAGGAGGTC 600 

TGCCOGGAGC AGCAGCTTCT CCAGGAAGCC CAGOOCACCA TCCA6CTC3CC CAGCCCACCT 660 

GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCT03 CCTCCTCXrTC CPGCCCTCCT 720 

*fU CTCCCCTAGA GCCCTCTCCT CCCTCTQTCC CTCTCCTTGC CXXX»GTGCX: TCACCTTCCA 780 

ACACTCCATT ATTCCTCTCA CCCCACTCCT GTCAOAGTrO ACTTTCCTCC CATTTTACCA 640 
CTTTAAACAC CCCCATAACA ATTOCXXXAT CCTTCAGTOA ACTAAOTCCC TATAATAAAG 
GCTGAGGCTQ CATCTGCCAA AAAAAAAAAA AA 

45 Seq ID NO: 173 Protein sequence 
Protein Accession Ui NP 006751.1 



900 



j 11 21 31 41 51 

I i I I I I 

MAPLLPIRTL PLILILLALL SPGAADPMIS SLSGLI^SPAL TBSLLVALPP OaTOGNATL 60 

MVRRANDSKV VTSSFWPPC RGRHavSW DSGAOPTVTR LSAYQVTOLV PGTKPYI6YL 120 

VKKGTATESS REIPMSTLPR SNMESI6LGN ARTGOHVVIT VLL6VAMPLL VMJPIIALAL 180 
GSSK 

55 Seq ID NOt 174 DMA seque nce 

Nucleic Acid Accession #: Bos sequence 
Coding sequence i I.. 273 3 

^ " 21 31 41 SI 

AT6AAAOTTG OAOTQCTGIXS GCTCATTTCT TTCTTCACCT TCACTGAOGG CCACGGTGGC 60 

TTCCTGGOQA AAAATOATOO CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 

C31TCTAG6CC CAGTC GAAGA ATATC AOCTG CTGCTTCAGO TGACCTATAG AQATTCCAAG 180 

GAGAAAAGAG ATTT6AGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGTCAC3VT 240 

GGGCTA AT TA GAATTATCAO AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA C3W3CTACACC TQOTTTCCTC CCTCATGCXTT TGATCCCCAG 360 

AACTOCTACC TTCACA0C5GC TGQA6CACTC CCAAGCTGTQ AATGTCATCT CAACAACCTC 420 

AGraGAOTO TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATQAA 480 

AGGTTTACAA ATGACCTTTT GAATTCATCT TCPGCTATAT ACTCCAAATA TGCAAATCQA 540 

fV ArPGAAATTC AACTTAAAAA AGCATATGAA AQAATTCAAG GTTTTGAOrC GGTTCAGGTC 600 

ACCCAATTTC OAAATGGAAO CATCGTTGCT GOOTATQAAG TTGTTGGCTC CAGCAGTGCA 660 

TGTCAGCCAT TQAACAT QTT GCOGAGAAGO CTAAGACAGC CCTTCACAAG 720 

CTOTTTCCAT TAGAAGAOSG CTCTTTCAQA GTGTTOGGAA AAGCCCAGTG TAATGACATT 780 

arCTTTGGAT TTGGGTCCAA GQATGATGAA TATACCCTGC CCTGCAQCAO TCGCTACAGO 840 

GGAAACATCA CAGCCAAGTO TGAGTCCTCT GGOTGGCAGG TCATCAGGGA OACTTOTOTO 900 

CTCTCTCTGC TTGAAGAACr GAACAAQAAT TTCawSTATGA TTOTAOOCAA TGCCACTGAG 960 

GCAOCTOTOT CATCCTTOGT GCAAAATCTT TCTGTCATCA TTOGGCAAAA CCCATCAACC 1020 

ACAOTGGGGA ATCTGQCTTC GGTGGTGTCO ATTCTGAOCA ATATTTCATC TCTGTCACTC 1080 

GCCAOCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATO TCATCAGTAT AGCTQACAAT 1140 

OV ATCCTTAATT CAOCCTCAGT AACCAACTGQ ACAGTCTTAC TGOGGQAAGA AAAGTATGCX: 1200 

AOCTCAOGGT TA CTAGAG AC ATTAGAAAAC ATCAGCACTC TGOTGCCTCC GACA6CTCTT 1260 

OCTCTGAATT TTTCTGGGAA ATTCATT6AC T0GAAAG6QA TTOCAOTGAA CAAAAQCCAA 1320 

CTCAAAAGGQ GTTAC AGCTA TCAQATTAAA ATGTGTGCCC AAAATACATC TATTCCCATC 1380 

AGAGGCOQTO TGTTAATTGG GTCAGAOCAA TTOCAGAGAT CCCTTCCAQA AACTATTATC 1440 

1021 
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AGCATGGCCT OOTTQACTCT GGGGAACATT CTACCOOTTT CCAAAAATQO AAATGCTCAG 1500 

GTCAATG GAC CTGTQATATC CAOGOTTATT CAAAACTATT CCATAAATOA AGrTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCArr GTGTGTTTTO GGATTTCAGT 1620 

- CATTTGCAGT GGAACGATOC AGGCTOCCAC CTAGTGAATO AAACTCAAQA CATCGTOACG 1680 

J TGCCAATOTA CTCACrrGAC CTCCTTCTCC ATATTCATGT CACCTTTTOT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG C3ATCACCTAT GTGGC5ACPGG GTATCTCCAT TCGAA«3TCTC 1800 

ATTTTATGCC TGATOITCGA GGCTTTGTTT TG8AAGCAGA TTAAAAAAAG OCAAAOCTCT 1860 

CACACACGTC CTATTTGCAT GGTGAACATA GCOCTOTCCC TCTTGATTGC T0ATX3TCTGG 1920 

TTTM- ^"i-it 3 C5TGCCACAGT OGACACCACO GTGAACCCTT CTCGAGTCTQ CACAGCTGCT 1980 

lU GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT QCTTQGCATC 2040 

^C T GGCTT ACCGQATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT QATCATGGCT 2100 

GTTGGATTTT OCXrCQGOTTA TGGGTGCCCT CTCATTATAT CTGTCATXAC CATTGCTOTC 2160 

AOQCAACCTA QCAAT A CCTA CAAAA6GAAA GATGTGTOTT OOCTTAACTG OTCCAATGGA 2220 

- AGCAAACCAC TCCTGGCTTT TGTTOTCCCT GCACTGGCTA TTGTG GC rGT GAACTTCOTT 2280 

ID GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGOCOGACTG TTGGGGAAAG ACTGAGTCX3Q 2340 

GATOACAAGG CCACCATCAT CCQ0GT0G6G AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT 0GG6CTTTGG AATAGGAACA ATAOTGGACA GCCaOAATCT GGCITGGCAT 2460 

GTTATTTTTG CTTTACTCAA TOCATTCCAG GGATTTTTTA TCTTATOCTT TGGAATACTC 2520 

TPGGACAGTA AOCTGCGACA ACTTCTOTTC AAC3UM3TTGT CTGCCTTAAG TTCTTGGAAG 2580 

ZU CAAACAQAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTAT6CA TTTTCTCATA CrGGAOATTC CTCOSACAAC 2700 
ATCATGCTAA CTCAOTTTOT CTCAAATGAA TAA 

Seq ID NOt 175 Protein sequence 
jij Protein AccesBion #i Bos sequence 

} " 21 31 41 51 

1111(1 

^ MKVOVUWiIS FPTFTDGHGG FLQKNDGIKT KKELIVNKKK HLGPVBBYQL LLQVTYRDSK 60 

D\J EKRDLRNPLK LLKPPLLHSH GLIRIIRAKA TTDOISLNaV LQCTCEDSYT WFPPSCLDPQ 120 

NCmiTAGAL PSCECHLNNI, SQSVMFCERT KIWOTPKINB RPINDLLNSS eAIYSKYANG ISO 

IBIQLKKAYE RZQGFESVQV TQPRHOSIVA GYBWGSSSA SBLLSAZEHV AEKAKTALKK 240 

LFPLBDOSPR VFOKAQCaiDI VPGP6SKDDB YTLPCSSGYR 6M1TAKCES8 6WQVIRBTCV 300 

IiSLLEKLNKN PSMIVGNATB AAVSSFVQNL SVIIRQNPST TVGNLASW3 ILSNTSSLSL 360 

JD ASHPRVSNST MEDVISIADM ILNSASVTNW TVliLREEKYA 8SRLLETLEN ISTLVPFTAL 420 

PLNFSRKFID WKGIPVNKSQ UTOGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 400 

SMASLTLGMl LPVSKNGNAQ YNQPVISTVI QNYSIMEVFI. PPSKIESNLS QPHCVFWDPS 540 

HLQWNDAGCH LVNETQDIVT OQCTHLTSPS ILMSPPVPST IFPWKMITY VGIiOISIGSL 600 

Af\ II'CLIIEALF WKQIKKSQTS HTRRICMVNI AIiSLLIADVW PIVGATVDTT VNPSGVCTAA 660 

**\J VPFTHPPYLS LPFWMLMLGI LIAYRIILVP HHMAQHIJWA VQFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCSfLNWSNG SKPLLAPWP ALAIVAVNPV WLLVLTKLW RPTVGERLSR 780 

DDXATIXRVO KSLLIIiTPLL QLTWOFGIGT IVDSQNLAWH VIFALUIAF^ GPFILCFGIL 840 

LDSKLRQLLP NKLSALSSHK QTBKQNSSDL SAKPKPSKPP NPLQNKGHYA PSHTGDSSDN 900 
IMLTQPVSNE 



45 
50 



Seq ID KO: 176 DNA sequence 
MUcleic Acid Accession At AB035089. 
Coding sequence t 9845.. 10219 



] 11 21 31 41 51 

i I I I I I 

GGGCATOCAG CCATOGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAQAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GA-TCAGCAAG GGQAC3VGGGT TAOATTTCCTr 180 

JO TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG QCAGTGTTQG CAGAGTAGGA 240 

AGGAGGAAOG GAGGCAAGAQ ATAATATCAT TTTCTCTGT6 CTCCAACTGT ACTTACATAT 300 

GAGACTATTT OCCTCTCTGC TTTTCAAACC TTACTGGAOT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGQAA AOCTAOTTAQ TCTTOrrCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTO ATGCTTOAAA 460 

OU AATCTCCTCC ACTAACCAGT TTCCCTATAG ATT6CCACAA GCACATAATA AGAAACAATA 540 

AATAAAAT6T TCTCTTQACT TTGTTACTTA AC3UVTGCTGA GAAAACTTTA CAOCCTTCAT 600 

AAGGAAGTGA GGTCCAGQAA AATCTAGOAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAOAGA CATAAGTTCO 780 

OJ CAAGTGTTCA TATGCAAAAA CTTCTTQGAA TTTCTGAGTT CTCTOTGGCA ATATATQACA 840 

TCAGGATATG TCCAGTCTCA CACACCAGQA TATGTCCTTT CTAGCCTGTC TATCACAT6C 900 

TAGGAGAACT ATTTAGQAAC AGAAAAAAAT GOCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

TTCAACX:rrC AGGGCAAACC TCXXITGCCTC AGACJGTTTAG OCATAOTCTS AAATTCICTT 1080 

fV CCATAGATTG GTCCCCTGTA ACCXXX3GTTT OTCTCAGCTT OTT A ltXI G T mTriVlTC 1140 

gggC ATTC C CaUSGATGAGC TTGTTGCTTC TGTCCTATGA GACATTA6AT TCCTTTTCTT 1200 

TGOTACOOGA OTAAATCCAT CXTTACTCCAA TAGAGGAAGG TCCATTTTTO TCTTATAGCG 1260 

CTGGATCCAG ACT CAGCTG A GAAQACCATT ATTCATTTTT GGAATTCTTT ATCTCAOATA 1320 

TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCA7CAA0Q GCXXATTAGT 1380 

fO CTATTCCCOQ ACTTCAATCA GGQAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAQA GCATGTCTTG GGGGTTQAGO AAACAGGCAG 1500 

CnGAGAAATT TOCAQATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG IS 60 

CCTATOTOTT TCTGGCACCT TGTPGTAGAT AAATCTCCCT TX3ACTTTGTG ATGTQCTGAQ 1620 

Sn AAAACAAACT CACGGCTGOT OTTAAAAAGG GCCCATGACA ATACXAAOTO TTGGGGAGAA 1680 

OU TGTGGAGAAA TCAQAACTCT ATTCACGGTC GGTTGQAATO CACACTTOTG CAGAATTCTA 1740 

ySQAG AAGAQ T CTGGCA TTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCttGOGA 1800 

TTTCATTCAT AG6TTTATAC TCAAAAGAAA TOAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

ATQAAAGGTC ACAACATCAT TATTCATAAT AQTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAOAA AATCTGOTCT ATTCATAGAA TGGAATATTA TTC3GACCACA 1980 
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AAAAGQAATO ATCTACTGAT CCATQCAATG ATCTGGACAA ACCATGAAAA TA ACACTAO A 
TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTCQA 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 
AGAATOAAGT ACAAGATTTC TTTTGQAGOT AGTGAAATTG TTGTGGAATG AQATCATGAT 
5 GATCATAGCA CAACTTTGTO AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGQ 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACA6CCCC CCACTCTGGT 
TGTCAQGGAG ATATTOGATT AAATGGCCTT GGACAACAAC CCXTTCTCCCT GGCCACAQAC 
ATTCTTCAGA TTACAAGATA TTCCAGQQGA AACACTGGAA TGAGTCTGAA G CCAGGTQCT 
AAACASAAGQ ACCATTGAGA AATGTTGTGA TCCTGACAGO TCAAGCAATT TATTTTTOGG 
10 C n - CA T T l' i r AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TQTTTCAATT 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 
AAACATACCT GAGACTGQCA AGAAAAAGAG GTIIAATTOO OCTTAGAGTT CCACGTGATT 
GGGGAGOCCT CA6AATCACA GTAGGAGGCA AAAOTTATTC TTACATGOTO OCTGCAAGAG 
AAGATGAGGA AGAAGCSVAAA GAAGAAACCC CTQATAAACC CATCQGATCT CCTGAGGCTT 
15 ATTAACTATC ATOAQAATAG CACAAQAAAG ACCQGCCCCC ATGATTCAAT TAC CTCTAC C 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 
OGQAACACAO CCAAACCATA TCACTCA6CA AGOCAGATAA CXTTCrCACT GAQCCTATGC 
AACAOAAAAC CATCTG6GAT GGTTOTAAGO 6GCACAQGAA GTOACTOGTA GGATCACTGC 
CAAAGCTGAG CACTCAGQAO AAGGCAATAG AATCCTATTC TCCATAQTAT GCTATAAGAT 
20 ACTGAAOTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 
TACAGAAAAA TTACTAAGQA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 
GAAATGTAAG CTTTTTAQTT CTTTGGTATT OGAAOTATGC CTAAAAGACA ATGCAAAATC 
CAAOAAAAGA ATCOTGGGGT ' mWn ' l ' Q T TrGGTmOT TTTTGrTTTA CAGCTGGAGT 
AQAATACAAA GGGAIGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCX 
25 CATTQQCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTQ CCGACTTGCA 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTAC3UIT0 GTAGAACTGT AATCCTGTOC 
TTTOTCATAA ATTTTCATAT TCATAAAOCT GAGTGTTAGC CCGCrTOTOA AATCTQAAGT 
TGAGTAACTT CAAATACTAA CCACA6AGG0 AAAGGCAOCA A6A00AGAGG CATAAM^A 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 
30 GQAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTT6AGAM 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCT GATT ATATTTTACT 
TATTCTGCCA GAGCAAAATT AAAATAOCTA TTTCATCTGA TTTGTCCITT ATCTAAATTG 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GQAACACAGA GGGAGAOTGC CTTGCAGCCA 
C3AQAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TQGTTGGATG TGATCCACAG 
35 AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACC AGCAAT 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAQ TCTTTAAAAA OSTAGTTTTG 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATQC ACATTGTAGA AW3JOTGAAG 
06TTAAAAAT AAGAGAAAAA CTTTAAATOT CAAAATCTCA CAACCCAGAT ATATCATTTC 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAOTCATTCT GAC^GAATC 
40 TOAT6CTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATQAATTCAC TCAGTGAAGC 
CAACACCAAG TTCATGTTCG ATCTOTTCCA ACAGTTCAGA AAA TCAAAA G AGAACAACAT 
CTTCTATTCC CCTATC3VOCA TCAOlTCaflC ATTAGGGATO QTC CTCTTAQ GAGCCTAAGA 
CAAC^CTGCA CAACAAATTA GCAAGGTAGC TATCAGC31TC ATTACOrTOT CCTGTTGCAG 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTQATGGGT 
45 AGCACAGOGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAOATCTT 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTOATOAGT 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC AtATCATOTO AOTCftCAGAG »Crara?II 
CAOCTTTAGA TCCCTGAACA QGTCATAQTT TAAACCTGGA ACTTCACAAA AACTAAGWU^ 
aOQCCaGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 
50 TTCATOGCAC ATAATTATTA TTCCTCATTT CTOCGTTACT AAAAGACAGT ^GCACTCTA 
CXTOIGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCROCTT TGCTCTTCAC 
AAATGATGAA TAAGAGCAGG ACACAACTGC TOQGAOTOOC AGTGACCTCA TCOCAGAAAA 
CTAAGGGTAA GAAAAAATCT QACTCAATAC ATGCAAATAC ATGCAAATOT TTACAACAOT 
QOCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 
55 TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCfl^A 
ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 
OCTTAATAQT GCAAAATGCT TTGCTGGAAO GTAOAAAOTT CT AGATTTAA ACAOGCTTAG 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GOGTrrTTTT CCCCATTCTC 
TGAGCTTTCT TGTGTTCATC T6AATTQAAC TAAAGACTTA GAOTTACCCA TGTAAAGTCC 
60 TTAGCCATGG ACCTGGCATA CACTCXTCTT ACGTGCAGAG AATQACCATC ATGAGGAAAG 
AGCCACAGAT CAGTCAATGT GTCCTACAAQ ATAATAGCAC CAACAGOTAT AACAO GGCTT 
CCTGGCATAA TCTATTTAAA ATATCC3ACC TTCAACATAO TOBTATCCTT GATOACTOTT 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCXOAGAOT TTAACTGGGA AGCTAAACCT 
AAOCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GOSCTGCATC TTTAGTTCAQ 
65 AAGAGAAAAG ATTOCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 
GGTGGATrrr GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 
GTCCrCAATG AGACTACCAG CATTTAGGGA C TOATCTAA C AGACTTAGCA JGOSTTTAGT 
ATTTACATTO ATACAOCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG ™»»?GTC 
AGQAAATOTT CATCAOCAGT TTCAAAAOCT TCTGACTGAA TTCAACAAAT CCACTGATGC 
70 ATATGAGCTO AAGATOGCCA ACAAGCTCTT CGQAGAAAAG ACGTATCAAT TTTTA^GT 
AATTTCACXrr GGCCTACCCa^ CATTTCATTT GCATCCTGAT GT^^ 
AAATGGAAGA AAGCAAGGCA QATGAGCCTG GCCGACCCaO GXGGAGAGCA TTTACTCAOA 
QTGCATTAGC TCXaiTTTCCA CAACTCTCCC CCACTGGAGT CTCCMACC CCAACGATAC 
„^ ATCACTOAAG TGIGGATTTA GGGATAATCT TGTGATAAAA GAOGAOTTTO TGTAATAQAG 
75 TGAGTAAGAG TAATAAOTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 
CGATACATCT TGGTGGGAAA TGTATQACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 
6TGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATCCACT; GGATGAWJGA 
SaAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAG^agCTAT 
OAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAOCCTCTT CTGCTTCTGC 
80 CTTTTOCCAA GAtGATATTG ACCTTCAGTG ACCrCTTTCT TGTGCCAGCC CACATTCCCC 
TTTTGCATTG CXTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 
ATTCAGGOTC TGQATTCAGC TTACTGTTCT TACAAATAAO TAACTTTGGT AWATATAOT 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 
CATCAAGAAA TTTTAOC»GA CCAGTGTGGA AT€TAC10AT TTTGCAAATG CTCCAGAAGA 
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AAGTCGAAAG AAGATTAACT CCTGGGTQQA AAOTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
OAACAGGTQT GGGGATTOAO ATGGGTTTGC AGOOAGGGCT QAAGAC3QGCA CTCCAGATGA 7200 
AQG AJ TTGT C CAAATGAATA TGAAGAGAGC CTA0GG6AGC CAA00AG6AA ATCACAGGAA 7260 
5 GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TOGCCC TGCA T GACAA TAGC 7320 
TTTOrGGATC CCCTOTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 73 BO 
AQACTCAAAT TTTTATGATQ AATATTTAAT AGAAAACATT AGAAAGCGTC TCTO GTCT CC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG 6TAAATCACC CTTTTGTCTC TGAGCTGOAO 7500 
CT6CCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTA TCAC AGC TCTCTTCOCC 7560 
10 ATCAAGGGCA AAGAGCTTGA CAAAOTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCrCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAOQAA QATACCCCXX3 7680 
6AAGTAGTGT CTGACAGCAC AGGACATGOO TTTCATATTA CRGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC COGTAGTTAA AAAAAAATAA 7800 
AAOTAGGAAG AGGAG6GAG0 GAGAAAGOAA AGACSUSITGT TGOAAGAOTA GACAAAATCA 7860 
15 GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACaCAOAA OSTGA ACTC C 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATAOQACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTOGTCA GTQAQTCTAA CAATTCAAAG ATCAAAGTCA 8040 
QTOAGTCTCA AGCA6GQATT TGGGTCAATA ATTAAOSATC AGTCAOGAAC AT TTQCAAAQ 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAQACT CTTTTATTCT TTCOCTTGCA 8160 
20 GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATAOGAC ACTGOTTCTT 8220 
GTOAAOGCAA TCTATTrCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAQ TGTAATATGT 8340 
TAATACATGO AATGTTAAAC ATTTCTGATG GAAT6TAACA TGATAAOTAA AAAATAAAAA 8400 
TTGTTCATCT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATAGCT 8460 
25 GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAO AAQAAGGATG 6520 
TGTTGCTATC TCAATAATAT TATCTTTTTT OTCTTGTQTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTQATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAOGTCCT OGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTQTG CTGCTGCCAA ATGAAAT06A TGGTCTGCAG AAOOTAAGAA 8760 
30 CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAOATAC CAAOTTTAAA 6820 
CAAGGTAAAA GCTTATGACC GAOTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACICACC TTCATATTAC AAATATTTGA GCATAGGGCX: TG ACAC AAAC TQAAAGCTTA 8940 
Um ' l ' l ' G'i TT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAQC TTTAG GGTAC 9000 
ATGTCCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGQ TGTGCTOCSVC 9060 
35 CO^TTAACTC ATCATTTAGC GTTAOGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 
CACCOCACAA GAGTCXTTCAQ AQTOTGATGT TACXTTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATCATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTAO^TGCAT TGGATATCAA TTTGATQATA TTATTATGCA GC3VATTTAAA CTTGACTGGG 9300 
AQAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAOAAAA GGQAATACAA 9360 
40 ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATQTATACA 9420 
QAAAAATATO ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTOG C3VCAAQAAAT 9480 
AGTOATACaT ACAGTTCATT GTGTACAAAA TAATOTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT OTACACATAT ACATATACGT AAAAACATGA TTCTQTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
45 TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CC3UAATAAC OAATCTCCAC ATAOTCAATT CATTGTTAAG GTGTATTAGA QATOGAC3U3T 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTOT ATAGCTTOAA 6AGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGQACAAGTT TGCAGAATAT GAGAGAGACA TQTGTCGATT TACACTTACX: 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGQACAOG TTOAGAACCA TGGGAAT6GT 9960 
50 GAATATCTTC AATGGGGATG CA6ACCTCTC AGGCAT6ACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAA6TC CTACACAAOG CXrTTTGTGGA GGTCACTQftO GAGG6A0TQG AAGCTGCAGC 10080 
TGCCACOGCT GTAGTAGTAO TCQAATTATC ATCTCCTTCA ACTAATQAAQ AGTTCTGTTG 10140 
TAATC3VCCCT TTCCTATTCT TCATAAGGCA AAATAAGAOC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTC3V TCCCXZATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTC ACCTA 10260 
55 GAGGTOTTCT QOTAAACXOA TTGCTG6CAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTQATGATGA TAGTCATCAT CAACJAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATCTACTT TTOCTTCCAG AAAAATTTCC 10440 
CTTCAQGAAA AATQTCCAAG ATAAGATGAA TCATrTAATA COGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACX^^CC AAATCATACT TTCTCTTCAA 10560 
60 ATTTA6C3UIC CTAGAAACAC ACS^TTTCTTT QAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTT G T G ATTC TATAAAAC3« ATCATCAAXA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTOTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGQAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATQA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAQ GAGGAGCCTT GTAAACCACA TAQGAACTTC 10860 
65 CTATTTTATO CTAAAGGGAT AA6AAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
OGGGCATAAA ATTATGATAT OCACATCTAG AAAATACATC TCTGGCTAOO CTQATATCAA 10980 
TGGATOCQAG GAAAOAAC3W5 TOTOGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AQAAAGAGCC TGGGGGAATT ATTTAG6AAA T AATAGTTA C 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGOA TGQAAGAATA TCTTCTTTCT 11160 
70 GAGAGAAAAA AA6ACTTTGG GTTTAAATTT GTACTTOATO AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT rTGCCTGOCA GQCaCTTGAA GATATTAGTC TAAATCTCAG AAACAfl AATA 11280 
TGATCTGAAO CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAOTCA TTOGQATAAA 11340 
TATGGTAGTT GTAGCTAAAA QCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAOAAQAAT CTAGAATTQA CCTTGAAGTA TATCAGCATG TQTAAAGATC AGOAATTGAT 11460 
75 CATTTTTATT TTCCAGAAAG TAOCTTTTCT TAOGOTTCCA TATTTACTCC CATAGATTCT 11520 
TOCC 



Seq ID NO* 177 Protein eequence 
Protein Accession #i BAB21525.1 

1 11 21 31 41 SI 

KNSLSBANTK PMFDLPQQPR KSKENMIPY8 PISITSALGM VLLGAKDNTA QQI8KVLHFD 60 
QVTENTTEXA ATyHVDRSO? VBHQFQKUiT BPMKSTDAYB LKXAHKLFGE KTYOFIAEYL 120 
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DAIKKFYCTS VBSTDFANAP EESRKKINSW VESQTNBKIK NLFFDGTIGN DTTL\a<VNAI 180 

YFKOQWEliKP KKENTKBEKP WFNRNTYKSV QMMRQYNSFM FALLEDVQAK VLEIPYKQXD 240 

LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQMMRBTCV DLHLPRFRMB ESYDUOmiR 300 

TMQMVMIFMQ DADLSOMTHS HGLSVSKVUl KAPV6VTBGG VBMAATAW WBLSSPS1N 360 
EEPCCNHFPL FFIRQNKTNS ZLPYGRFSSP 

Seq ID NO: 178 DNA sequence 

Nucleic Acid Accession Ui NM_001910.1 

Coding sequence: 50.. 12 40 

1 11 21 31 41 51 

I I I I i I 

GQAOAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA T6AAAAC6CT 60 

CXriTCTTTTG CTO C TGg TO TCCTG6AGCT OGOAGAaGCC CAAOOATOCC TTCACAGGQT 120 

GCCCCTCAG6 AGGCATCOST CCCTCAAGAA GAAGCTTGCGG GCAOOGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TOaACATOAT CCAGTTCACC OAGTCCTOCT CAATQGACCA 240* 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 300 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTOQACACT GGCTCCTGGA ACCTCT6GGT 360 

C C CCTCTGTG TACTGCACTA GOCCAGCCTO CAAGACGCAC AGCAGQTTC3C AGCCTTCCCA 420 

GTCCA6CACA TACABCCA6C CAGGTCAATC TTT C TCC A TT CAiSTATGGAA COGGGAGCTT 480 

GTCCGGGATC ATTGGAGCCX3 ACXIAAOTCTC TGTGGAAGQA CTAACCGTGO TTGGCCAGeA 540 

GTTTGGAGAA AGTGTCACAG AGCCAQGCCA GACCTTTGTG GATOCAQAGT TTGATGQAAT 600 

TCTGGGCCTG GGATACCCCT CGTTGGCTOT GGGAGGAGTG ACTCCAGTAT TTGACAACAT 660 

GA TGGCTC AG AAGCTGGTGG ACTT6CG6AT mmtWrC TACATGAGCA GTAAOCCAGA 720 

A6GT66TG00 G0QAG06AGC TQATTTTTGG AGGCTAOGAC CACTCCCATT TCTCTGGGAO 780 

CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTGOCAO ATTGCACTGG ATAACATCCA 840 

GOTGGGAGGC ACTGTTATGT TCTGCTCCX3A GOGCTGCCAO GCCATTGTGO ACACAGGGAC 900 

TTOCCT^TC ACTGGCCCTT CCGAC3UV6AT TAA6CAGCT0 CA AAAOQ CCA TTGGGGC3W3C 960 

CCCOGTGGAT GQAGAATATG CTGTGGASTG TGCCAACCTT AAOGTCATGC CXjQATGTCAC 1020 

CTTCACCATT AAOGGAGTCC CCTATACCCT OUSCCCAACT GCCTACACCC TACTGQACTT 1060 

aSTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGQATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACOG 1200 

TGGGAATAAC CGTGTGGGAC TGGCCGCAGC AGTCCCXn'AA GGAGG6GCCT TGTGTCTGTG 1260 

CCT GCCTGTC TGACAOACCT TOA ATATOTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 

GTTATTTTCC A6AGAATGTA 6CTGTTTCCA GGGTT6CAAC TTGAATTAAO ACCAAACA6A 1380 

ACATQAGAAT ACACACACAC ACACACATAT ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATOAT GGAGGAATTA OQTTATACAT TCATATTTTG TATTGATTTT 1500 

T6ATTATGAA AATCAAAAAT TTTCACATTT GATTAT6AAA ATCTGCAAAC ATATGCACAA 1560 

QCAGAOATGA TGG TATAATA AATOCCTTTG CAACTCCACT CAG^CZaAC AACCCATOCA 1620 

CACACGGCCA GGCXTOITFA TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 1680 

OTACCTGGAT CATTCrOAAO CAAATTCOGA GCATTACATC ATTTTQTCCA TAAATATTTC 1740 

TAACATCCTT AAATATAC3VA TCGGAATTCA AGCATCTCCC ATTGTCCCAC AAATQTTTGG 1800 

CrOTTTTTGT AGTTGGATTG TTTGTATTA6 GATTCAAGCA AGGCCCATAT ATTGCATTTA 1860 

TTT6AAATGT CTGTAAGTCT CTTTCCATCT ACAiQAGTTTA GCACATTTQA ACGTTGCT6G 1920 

TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGOT 1980 

AGTTAGATCT GQAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTQGTG CTGGGTACTT 2040 

CTTGTTGCAT CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAQA 2100 
CTOCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC A6CATTTT 

Seq ID NO: 179 Protein sequence 
Protein Accession #x KP_001901.1 

1 11 21 31 41 51 

I I I I I I 

MKTZiLLLLLV LLELGEAQGS LHRVPLRRHP SLKKKIiRARS QLSEFHRSHK liDHIQFTBSC 60 

SMDQSAXBPL ZNYLDMEYFG TISIGSPPQN FTVIFDTGSS KLWVPSVYCT 8FACKTHSRF 120 

QPSQSSTYSQ PGQSF8IQYG TGSLSGIIGA DQVSVB6LTV VGQQFGESVT EPGQTFVDAE 180 

FDGILGLGYP SLAV6GVTPV FDNMMAQNLV DLPMFSVYMS SNPEGGAOSE IiIFQGYDHSH 240 

FSGSUmVPV TKQAYWQIAL DNIQVGGTVM FCSEGOQAIV DTGT8LITGP SDKIKQLQNA 300 

IGAAPVDOBY AVECANLNVM PDVTFTIN6V PYTIiSPTAYT LLDPVDGMQP CSS6FQGLDI 360 
HPPAGPIAfIL GDVFIRQFYS VPSaONlIRVG LAPAVP 

Seq ID NO: 180 DNA sequence 

Nucleic Acid Accession fix NM_0180S8.1 

Coding sequence i 3 1 9 157 5 

1 11 21 31 41 51 

I t t I t I 

TACGCGCTGC GGGAOOGGCA GGG6AA0QCC ATCGGGGTCA CAGCCTGCtSA CATCGAOGOG 60 

GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTOSGG GGTGGCCAOQ 120 

TACACCGACA AGTTGTTCAA GTTC06CAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 180 

GTCAACGTGG CCOGTGGTGT GGCCAGCCTC WX S C CGG AC GCTCTOTGGC CTOTGTGGAC 240 

AGAAAGGGCT CTGGAOGCTA CTCTATCTAC ATTGCCAATT AOGCCTACGG TAATGTGGGC 300 

CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 360 

CTCAGAGATG TGGCTGCTGA GGCTGGGGTC A6CAAATATA CAGGQGGCCQ AGGCGTCAGC 420 

GTGGGCCCCA TCCTCACCAG CAOTGCCTCG GATATCTTCT GCGACAATQA QAATGGGCCT 480 

AACTTCCTTT TCCACAACOG GGGCGATGGC ACCTTTGTGG ACGCTQOOGC CAGTGCTGGT 540 

GTGGACQACC CCCACCAGCA TGGGCQAGGT GT05CCCTGG CTGACTTCAA COGTGATGGC 600 

AAAGTGGACA TCGTCTATGG CAACTQGAAT GGCOCCCACC GCCTCTATCT OCAAATQAOC 660 

ACCCATGQGA AGGTCOSCTT COGGGACATC GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

GTC06CACGG TCATCACCGC CGACTTTGAC AATGACCAGQ AGCTGGAGAT CTTCTTCAAC 780 

AACATTOCCT ACCGCAGCTC CTCAGCCAAC OGCCTCTTCC GOGTCATCOG TAGAGAGCAC 840 

GGAGACCCCC TCATOQAGGA GCTCAATCCC GGCQAGOCCT TGGA6CCTGA GGGOCGGGGC 900 

ACAGGGGGTG TGGTGAC06A CTTOQAGGGA GA0GGGA16C TGQACCTCAT CTT6TCCCAT 960 
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GGAGAGTCCA TGGCTCAGCC GCTGTCCX3TC TTCCGGG6CA ATCACGGCTT CAACAACAAC 1020 

TGGCTGCGAQ TGGTGCCACG CACCCGGGTT GGGOCCTTTG CX»GGGQAGC TAAGGTCGTG 1080 

CTCTACACCA AGAAGAGTGG G6CXXACCTG AGGATCATCG AOGGGGQCTC AG6CTACCT0 1140 

TGTGAGATGG AGCCOGTGGC ACACTTTGOC CTGGGGAAGQ ATQAAGOCAG CAGTGTGGAG 1200 

GTCaVGCSTGGC CAOATGGCAA QATGGTGAGC CGGAACGTGG CCAGOGGGGA 6ATGAACTCA 1260 

GTGCTGQAGA TCCTCTACCC COXJGATGAG GACACACTTC AOGACCCAGC CCCACTGGAG 1320 

ACACCAATQA ATGCATCCAG TTCCCATTCQ TGTGCCCTCG AQACAAGCCC GTATGTGTCA 1380 

ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTOGGGGC TAOGAGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGOACTC TCGGCCAGTC ACCGGGCCCC C6CCCCACCA 1500 

CCCCCACOGC TQCTGCTGCC ACTGCOGCTG CTGCTGCCGC TGCTGGAGCT GCCACT GC T G 1560 

CACaSGTCCT OGTAGATGGA GATCTCAATC TQGGGTC6GT GGTTAAGGAG AGCTGGGAGC 1620 

CCAGCTOCTO A6CA6G6GTG GGACATGAAC CAGG6GAT6G AGTCCAGCAG GGGAGTGGQA 1680 

AAQTGQGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CXTAAQCCCA TCCATGCACA TTACTTAGCT AACAATTAGQ 6AGACTCGTA AGGCCAGGCC 1800 

CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTQA TGQCQCTTAC 1860 

ATTCCA6TG6 GTCTAATQAC CATATCTTAO GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTOCACAGGA AGTATGAGGA CTTTAGTGTC CTOAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 204 0 

AAATGGGGAT TAAOAATAGA ATCTTGGGGT TAGTGTGGAQ ATTAGATTAA ATGTATGTAA 2100 

GACACTT6GC ACAAAACCTG GCACATA6TA AAGGCTCAAT AAAAACAAGT 6CCTCTCACT 2160 
GGGCTTTOTC AACACGTG 

Seq ID KOt 181 Protein sequence 
Protein Accession 9: 1IP_060528.1 

1 11 21 31 41 51 

I I I I I I 

MDPBASDLSR GILALRDVAA EAOVSKYrGG RGVSV6PILS &SASDIPCD17 ENGFNPLFHN 60 

RGDGTFVDAA ASAGVDDPHQ HGRGVALADP HRDGKVDIVY GNWNGPHRLY LQMSTHGKVR 120 

FRDIASPKPS HPSPVRTVIT ADFDNDQBLE IFFNNIAYRS SSANRLPRVZ RREHGDPLIB 180 

EXiMPGDALEP EGRGTGGWT DFDGDGKLDL ZLSHGBSMAQ PLSVFRGNQG FNNNHXiRWP 240 

RTRVOhPARG AKVVLYTXKS OAHLRIXDGG SGYLCEMEPV AHFOLGKDEA 88VBVTHPDG 300 

KMVSRNVASO EMNSVLEIXiY PRDEDTLQDP APIjETPMNAS SSBSCAIjETS PYVSTPMEAT 360 
GAGPTRSAVO ATSPTRMAQP AHGLSASHRA PAPPPPPIiUi PLPLLXiPUiE LPLUIRSS 

Seq ID NO; 182 DMA sequence 
Nucleic Acid Accession ff: AJ279016 
Coding sequence : 1 . . 1962 

1 11 21 31 41 51 

I I I I I I 

ATQTCCAGQA TGTTACOnT CCl'GC l ' G CTQ CTCTQGTTTC TOCCCATCAC TQAGGGGTCC 60 

CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TOACTATGAC 120 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATQTGGACCA TGATGGGGAC 180 

TTTGAGATOG TCGTGGCX3GG GTACAATGGA CCCAACCTGG TTCTOAAGTA TGACOGGGCC 240 

CA0AA60G6C TGGTGAACAT OQOSOTCGAT 6AG0GCAOCT CAOCCTACTA 0606CTQ0GQ 300 

GAOOGGCASa GGAACGCCAT CGGGGTCACA 6CCTG0QACA TOGAOOGGGA OGGCOGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTOGGGGG TGGCCAOGTA CACCQACAAG 420 

TTGTTCAAGT TCCX5CAATAA CCXOTGGGAA QACATCCTGA GCX3ATGAGGT CAACX3TG6CC 480 

CGTGGTGTGG CCAGCCTCTT T6CGGGA06C TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACXSCTACr CTATCTACAT TOOCAATTAC GOCTACGGTA ATGTG6GCCC TGAT6CCCTC 600 

ATT6AAATGG ACCCTGAGGC CA6TGA0CTC TC0066GGCA TTCTG6CGCT CnaAGATGTG 660 

GCTGCTGAGG CTGGGGTCAO CAAATATACA GGGGGCCX5AQ GCX3TCRGCGT GGGCOCCATC 720 

CTCAGCAGCA GTGCCTCX3GA TATCTTCT6C GACAATQAGA ATGGGCCTAA CTTCCTTTTC 780 

CACAACX36GG G06A1X3GCAC CTTTGTGGAC GCTGGGGCCA GTGCTGGTGT GGA06ACCCC 840 

CAOCAOCATO OOOGAGGTGT 0G0CCIG6CT GACTTCAAGC GTGATGGCAA AGTOGACATC 900 

GTCTATGGCA ACTGGAATGO COOCCACCGC CTCTATCTGC AAATOAOCAC CCATGGGAAG 960 

GTCOGCTTCC GGGACATCGC CTCACCX3Wtf3 TTCTCCATGC CCTCCCCTQT COGCACX56TC 1020 

ATCACOQCCG ACTTTGACAA TQACCAGGAG CTGGAGATCT TCTTCAACAA CATTQCCTAC 1080 

GQCAGCTCCT CAGCCAACGQ CCTGTTCGGC QTCATCOGTA GAGAGCAOGG AGACCCCGTC 1140 

ATGQAGGAGC TCAATCCG6G 06A060CTTO GAGCCTQAGO GGCGGGGCAC AGGGGGTGTG 1300 

GTGACOGACT T06AOGGAGA GOGGATGCTG GACCTCATCT TGTCCCATGO AGAGTGCATG 1260 

GCTCAGC03C TGa'CCUTCIT COGGGGOUVT CA6GGCTTCA ACAACAACIG GCTGOGAGTG 1320 

GTGCCACX3CA CCCGGTTTGQ GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

AAGAGTGGGG CCCAGCTGAG GATCATOGAC GGGGGCTCA6 GCTACCTGTG TGAGATGGAG 1440 

CC08TGGCAC ACTTTSOOCT GGGQAAGGAT GAAGCOIGCA 0T8TGGAGGT GAOGTGGCCA ISOO 

GATGGCAAGA T66TGAGGCX3 GAA0GT6GCC AGOGGGGAGA T6AACTCAQT GCTGGAGATC 1560 

CTCTACCCCC GGGATGAGGA CAC3VCTTCAG GACCCAGCCC CACTGGAOTO TGGCCAAGGA 1620 

TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACXAATO AATGCATCCA GTTCCCATTC 1680 

GT6TGCCCTC GAQACAAGCC OOTATGTGTC AACACXTATO GAAGCTACAG GTQCCX36ACX: 1740 

AACAAGAAGT GCAGTOGGGG CTA06AGCCC AAGGAGQATQ OCACAGOCTG GOTOGGGACT 1800 

CTOGGCCAGT CACOGGGCCC CCGOCCCACC ACCCCCACOG CTGCTGCTGC CACTGCOGCT 1860 

GCTGCTGCOG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TOTrAGATGO AGATCTCAAT 1920 

CTGGGGTOGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAOCAGGGC?r GGGACATGAA 1980 

CCAGOGGATG GAGTCCAGCA GGG6AGTGGG AAAGTGGGCT TGTGCTGCTO CCTAGACAGT 2040 

AGGGATGTAA AGGCCTGGGA GCTA6ACCCT GGCCAAGCCC ATCCATGGAC ATTACTTA6C 3100 

TAACAATTAO GGAQACTCGT AAGGCCAGGC OCTGT G CTGG GCACATAGCT GT6ATCACAG 3160 

CAGACAGGOT CGCTGCCCTG ATGGCJGCTTA CATTCCAGTO GGTCTAATQA C CATA TCTTA 2220 

GQACACAGAT GTGCCCAOGQ AGGTGGTOTC ACTGCACAGO AAGTATQAGG ACTTTAGTGT 2280 

CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAA6CTATG TQACCITACA CCAGTCACTT 2340 

AACTTQTTAO OCATCCATTA TG6CATCTGC AAAATGQGGA TTAAGAATA6 AATCTTGGGG 2400 

TTAGXGTGGA GATTAGATTA AAT6TATGTA AGACACTTGO CACAAAACCT GGCACATAGT 2460 
AAAGQCTCAA TAAAAACAA6 TQOCTCTCAC TGGGCTTTGT CAACAGG 
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8eq ID KOt 183 Protein eeouence 
Protein Accession S i CAC08451 



5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



MSRMLPFLLL 
FEIWAGYKQ 
BIYFLtmiNA 
GRYSIYIAMY 
LSSSASDIFC 
VYGKHNGPHR 
RSSSANRLFR 
AQPLSVFRGN 
PVAHFGL6KD 



liQQSFGPRPT 



11 
1 

LWFLPITBGS 
PNLVLXYDRA 
FSGVATYTOX 
AYGNVGPDAL 
DNENGPHFLP 
LYLQM8THQK 
VIRREHGDPL 
QGFMNMWLRV 
BASSVEVTHP 
mMECIQFPF 
TPTAAAATAA 



31 
I 

QRABPMPTAV 
QKRLVNIAVD 
LFKFRNNRWB 
IEMDP8AS0L 
HNR6DGTFVD 
VRFROIASPK 
IEEU7FGDAL 
VPRTRFGAFA 
DGKMVSmiVA 
VCPRDKPVCV 
AAAAAGAATA 



31 
I 

TOSVLPPDYD 
ERSSPYYAIiR 
DZLSDBVKVA 
SR0ILAUU3V 
AAASAOVDDP 
PSMPSPVRTV 
BPE6RGTG6V 
R6AKWLYTK 
86BMKSVLBX 
NTYGSYRCRT 
AFVI.VXX3DLN 



41 

I 

SNPTQZ^GV 
DRQGHAIGVT 
RGVASLFAGR 
AAEAGVSKYT 
RQRQRGVALA 
ITADFD17DQB 
VTDFDGDGML 
KSGAHLRIXD 
LYPRDBDTXjQ 
NKKC8R6YEP 
LGSWKBSCB 



SI 
I 

AVTDVDHDGD 
ACDID6DGRB 
SVACVDRKGS 
GGRGVSVQPI 
DPMRDOKVDI 
LGIFFNNIAY 
DLILSHOBSM 
GGSGYLCEME 
DPAPIiBOGQO 
NEDQTACVGT 
PSC 



60 
130 
180 
240 
300 
360 
420 
480 
540 
600 



8eq ID NOt 184 DNA aeguence 
Nucleic Acid Accession ftt F6ENESHH 
Coding sequence: 1..4794 



1 11 21 31 41 51 

I I I I I I 

ATQQOGTGTC 0G6QAG6ACT CCCAGCCGQT TOCTCTGOTT GQATGOGACT GGGTGGGCCC 60 

AGCGGCTCCT CCCCAGCATC CXXTTCCCCAT TCCTCCTCCA G6TACAATG0 ACCCRACCTG 120 

GTTCTGAAGT ATGACCX3GGC CCAGAAOOGQ CTGQTQAACA •TOGOGGTCOA TOAGCGCAGC ISO 

TCACCCTACT AOGCXSCTGOO GGACCGGCAG GGGAACGCCA TOSGGGTCAC AGCCTGOGAC 240 

ATGGACGGGG AOGGCOGGGA GGAGATCTAC TTCCTCAACA 0CAATAAT6C CTTCTOGOGC 300 

CACAGCAGCT CAG03CAGGT CCCTTCTG G G CTCCACAOAA ACA660CI6T GCTOAAGCCT 360 

CCACCTACAA CXXCTGCAOG CCTC C TG G GT CTGCXTTCCAC TCAGOSSAAO GGACTTTTCC 420 

TCCTCCCTQG QTCAGGCTTC TCOGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 480 

TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 540 

GGGGTGGCCA OGTACACCOA CAAGTTGTTC AAGTTCOGCA ATAACQSGTO GGAAGACATC 600 

CTGA6C6AT6 AGGTCAACX3T GGCCCGTGGT GTGGCCAGCC TCTTTGCXBG ACXSCTCTOTG 660 

GCCTGTGTGG ACAQAAAGGG CTCTGGACGC TACTCTATCT ACATTGOCAA TTAC3G0CTAC 720 

GQTAATGTGQ GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 7B0 

GGCATTCTGO CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAOAAGGC 840 

TTCTCCCACA CTGCCTCTCC AAQCATTGGT GAGATATCTQ 6CAGAAC0GA 6GAGCGGGAA 900 

GGAOGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAOOCAACTO 960 

TGCOGGCTQG GCTGGAAGQA OQGGCAGTTC AAGQAAGAAa CAGCAGCTTT GGTG6AGGAA 1020 

CA6A0G6AGG CTGGGGCAGC TGGCGTGCCC AGAGGACXnt3 TTCGAACAGC TCTGCAGACT 1080 

TCCAAAA6CC ATTTGGCTGA CAAGAACCTA TTTGGCCXAC CATGTTACTA TTCTOTCTGC 1140 

GCX3CCTTCTC CAGCCCACCC TTTCCCTGCC OGCCAAGOCC CXXIAACACTA CCCTGTAGCC 1200 

CCCCTTGTCA CTCAGCTAAT QACACATGGA CGTCTGGCTG GAAAACXAGC COGGAQTGTC 1260 

CCXrCACCCCC GAGCCCCAGO AATGGACCCC AAATQTAAGG GCCGCCATGC TOAGCCXXSGC 1320 

CTGATGGCTG A06CTTTGG6 OGOGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 1380 

CT6AGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TOTCCAQATG TGCACTCAGG 1440 

GAGCTGGGAQ GTCCCTGGAQ CXaUUSCCACA CAOCACCTGC CTGCTAQAQA GCTGTATGAC 1500 

CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC GAGGGAQGAQ AAGGGACTCG 1560 

OCCAAGGTCA CACAGOAOTQ CCATCTAGTG GCCACCATGC CA6CTCT0GG GGGACTOGAG 1620 

GGOCCOQGOA G6GT6GGCAA GGGAGAGATT GG6AGAGAGA CTGGGGGAGT AGGAAQACCA 1680 

CTCTCOCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 

GT6C066GAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 

CTG6CX3TGGA ACCAGATQGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TQAGCCCAGA 1860 

TTTAGGCTCa 6GAAAGCA0G GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 1920 

CTGC3UnnrGC GCTGAGGCCT CAGAGGCAGC CCTQTCCTCC AGGTGGQCCT GGGGCTTGCT 1980 

TCTGCCACTC ACTGTSGOTC OATOTCTTTT CTAGOGGGCC GAGGCGTCAG OGTGGGCCCC 2040 

ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGC3QACAATQ AQAATGGGCC TAACTTCCTT 2100 

TTCCACAACC GGGGCGATGG CACCTTTpTG GACGCTGOOO CCAGTGCTGA ACGTCX3TTTA 2160 

GCCTTCATOG TTCAOCTCAA ATATCACCTC TGCAGA6ATT TTCCTCaiCTC CCT0TQCX3VC 2220 

CTASCAOAAA CTOOTO C TTC CTOCTOCTGC TCCCCGTGGC ATGCACSTCT TCTTCAGGCT 2280 

COICATTGCC ATCATOOTTT GTCTAT6A6C TTTACAAGGA CC3GGGTCACG GTTCTATTCA 2340 

TTCTTGAOOC AAGGCrTGGC CTCCAGTGCC CACCGQAQOA CACTCAGCCT CCAGGGTTCT 2400 

CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTrGGGGTC TCTGATCCCC 2460 

ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCXyVGAGA GCCTGATGAC CCACAGCTAT 2520 

TTGTCCr CT G AAAGAGTC3UI OgiXSGOTGTQ GAOGACXXXX ACX:A6CATGG GOGAGGTGTC 2580 

GOOCTGGCTG ACTTCAACOG TGATGGCAAA GTGGACATCO TCTATGGCAA CTGOAATGGC 2640 

CC0CACC6CC TCTATCTGCA AATGAOCACC CATGGQAAGG TCCGCTTCOQ GGACATCGCC 2700 

TCACCCAAGT TCTCCATGCC CTCCCCTGTC OGCACGGTCA TCACOGCOGA CTTTQACAAT 2760 

GAGCAGGAGC TGGAGATCTT CTTCAACAAC ATTOOCTACC OGAGCTCCTC AGCCAACOGC 2820 

CTCTTG08AT GCT C C A TCC T 6GCTCX3TG6C TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GQTCAGGGAa AAGQTTTAAO AATCAGAAGG GGAGGGTTOC CAOGGCCAGG GGGTCAGGCC 2940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC OGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGQ QAAATGTGGC CCAAAGTGTG CCCAOAACCC AAGCX3CCACA AGATACAAAG 3120 

CCACACTAGC ACAAAAAGG8 GCTACAG6GT CCAATCACTA OCAGGAAAAG GGGCTACGGG 3180 

OTCCAATCAC TACCAGOAAA AGGGGCTACXI GGQTCCAATC ACTACCAGQA AAAGGGGCTA 3240 

CQGGGTCCAA TCACTACCAG GAAAAGG6GC TAOGGGGTCC AATCACTACX: AGGAAAAGGG 3300 

QCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

A6GGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CA6GGTCCAA TCACTACCAG 3420 

AGAAAG66GC TAOGGGCTCC AATCACTACC AGGAAAAGGG GCTAGOGGGT CCAATCACTA 3480 

OCAOGAAAAO GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTAGG GGGTCCAATC 3540 

ACTACCAGQA AAAGGGGCTA CGG6CTCCAA TCACTAOCAG GAAAAGGG6C TAGGGOGTCC 3600 

AATCACTACC AGGAAAAGGG GCTACAGQGrr CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 
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GTGCAATCAC 
OGGGOTCCAA 
GCTATGGGGT 
AGGGGCTATG 
AGAGAOCAOQ 
GGCCGGGGCA 
TTGTCCCATO 
AACAACAACT 
AAGGTCGTGC 
GGCTACCTGT 
AGTQTGGAGQ 
ATGAACTCAO 
CCACTGGAGT 
GAATGCATCC 
GGAAGCTACA 
GQCAGAGGCT 
CCCAAAAAGO 
CCGOaTTOCC 
CAGAAAQCTC 



TACCACAGAA 
TCACTACCAG 
CCAATCACTA 
GGGTCCAATC 
GAQACCCCCT 
CAGGGGGTGT 
OAGAGTCCAT 
GGCTGCGAGT 
TCTACACCAA 
GTGA6ATGGA 
TGAG6TGGCC 
TOCTGGAGAT 
OTGGCCAAGG 
AGTTCCCATT 
GGTGCGGGAC 
6GG1GGGTAC 
AGCT6CAACT 
GGCTGCTCCT 
CAGQTATTCC 



A06GGCTACG 
GAAAA6GGGC 
CCAGGAAAAG 
ACTACCACAO 
CATC3GAGGAG 
G6TGAC06AC 
GGCTCAGC06 
GGTGCCAOGC 
OAAGAGTGGO 
GCCOGTGGCA 
AGAT6GCAAG 
CCTCTACCCC 
ATTCTCCCAG 
OOTGTQCCCT 
CAACAAGAAG 
TGAOCTAGGC 
TTCCCAAG6C 
CAAAAGAGCT 
AQAAOCOCAA 



GGCTCCAATC 
TACGGGCTCC 
GGGCTAGGQG 
AAAG6GGCTA 
CTCAATCCOG 
TTOGAOGOAO 



ACCOGGTTTG 
GCCCACCTGA 
CACTTTGGCC 
ATOOTQAGCC 
06GGATGAGG 
CAGGAAAATG 
CX3AGACAAGC 
TGCAGTGGGO 
TCTAGGCATA 
ATCTGCACCC 
CAOCTCCAGQ 
GT6TATGAAC 



ACTACCAGGA 
AATCACTACC 
CTGCAATCAC 
OOGGGTCCAA 
GOGAOGCCTT 
ACGGQATGCT 
TCOQGGGCAA 
GQGCCTTTQC 
GGATCATCGA 
TGGGGAAGGA 
GGAA0GTG6C 
ACACACTTCA 
GCCATTOCAT 
CCQTATGTGT 
GCTACGAGCC 
CAATGACX3T0 
COSTCTGGTC 
CTGCTCCCAO 
AAGATCAGGA 



AAAGG6GCTA 
AG6AAAA6AG 
TACCAGGAAA 
CGTCATOCGT 
GGAGGCT6A0 
GGACCTCATC 
TCAGGGCTTC 
CAGGGQAGCr 
CGGGGGCTCA 
TGAA6CCAQC 
CAGCGGGGAG 
GGACCCAGCC 
GGACACCAAT 
CAACACCTAT 
CAACGAGGAT 
GAAACCAAGG 
CTTTTTCCTG 
CACCCTTCTC 
ATAA 



3720 
3780 
3B40 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4S00 
4S60 
4 620 
4680 
4740 



Seq ZD NOi 185 Protein aequence 
Protein Accession ffs 



1 11 

I I 
MACPQQbPAR CSGHMQUGP 
SFYYAUtDRQ GHAIGVTAO) 
PPTTPAGLLG LPPLSGRDFS 
GVATYTDKLP KPRNNRWEDI 
GNVGPDALIB KDPBASDLSR 



SRSHLAinCNL 
PHPRAPGMDP 
ELGGPWSQAT 
6PGRVAKREI 
LAMMQMBXEB 
SATHOGSKSF 
APIVHLKYHL 
PLTQGLASSA 
I«SSBRVNVGV 
8PKF6MP8PV 
GQGEOItRZRR 
KGKGNVAQSV 
RQPITTRKRG 
RK0LRAPIT7 



PGPPCYYSVC 

KCKGRHAEPG 
QHLPARELYD 
QRETGAVGRP 



AMOSNHYQEK 
GRGTGGWTD 
KWLVTKKSG 
MK8VLBZt.YP 
GSYRCRTNKK 
PGCRLLLKRA 



I166R6VSVOP 
CRDFPHSLCH 
HRRTLSLQGS 
DDPHQHGRGV 
RTVITADFDN 
GGFP GP OOQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKROYGVOSL 
PITTHXROyR 
QLRAPZTTRK 
PDQDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 



QLQAAPSTLL 



IDGDOREEiy 
SSLGQASPDS 
L8DEVNVARG 
GIIiALHDVAA 
CRLQWXDGQF 
APSPAHPFPA 
LMAEAL6AWP 
LGBPPILQRT 
LSHPLVPNFP 
PRIAXARBAE 
IL8SSASDZF 
LABTGP8SSC 
QGAPPCLLAR 
ALADFNRDGK 
DQBLBZPEMN 
KVNTGPU4KK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGAT6SNH 
VQSLP0K6AT 
RQYGVQ8LPQ 
LSHGE6MAQP 
GYLCEMEPVA 
PLBCGQGF8Q 
GTACVQTBZjO 
QKAPGZPEAQ 



31 
I 

SSSRYMGPNL 
FliNTOHAFSO 
RQGERVPVPC 
VASLFAGRSV 
BAGVSKYTBG 
XEBAAALVEB 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRROS 
SCLRPLBA6T 
FPPGSSEEPL 



41 

I 

VLKXDRAQKR 



C3RGGLRPTHB 
ACVDRKGSGR 
FSHTASPSZG 
QREAGAAOVP 
PltVTQUITHO 
liRSWEBSRQK 
PKVTQECHLV 
VPGAALPGNP 
I^FPSGXiROS 



CPHHARLLQA 
APCVLGSIilP 
VDIVYGNVIKO 
ZAYRSSSANR 
QKGRKDEDWA 
PZTTRKRGYG 
aiiQGPITTRK 
YQEKGLRGPZ 



KGATGSNVZR 
LSVFRGNQGF 
BFGLGKDEAS 
QEHGHQ4DT1I 
SRRTHTHKFR 
VYBQDQB 



TAYYZVtWSA 
PHRLYIiQMST 
LFRC8ZI1ARO 
R60SNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPZTTRKRO 
RKHGUPLIBB 
NNNHLRWPR 
SVEVTWPDGK 
ECIQPPFVCP 
PKREXiQZfSQG 



51 
I 

LVNIAVDERS 
IjHRNRPVLKP 
PEPPLLRPKS 
Y8IYIANYAY 
EISGRTEERE 
RGRVRTALQT 
RLAOKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGXA 
DAAA8AERRL 
PTRTG3RFY8 
ZPBSLMTHSY 
H6KVRFRD1A 



AKBPA8AZAG 

GSNEYQEKOL 
KGATGSNHYH 
SLPGKGATGS 
Y6I1Q8I1PGKB 
LNPGDALEPE 
TRFGAFARGA 
MVSRNVA86B 
RDKPVCVNTY 
ICTPVH8FFL 



Seq ID NO: 186 DNA sequence 

Nucleic Acid Accession «: MM_000584a 

Coding eequencei 75.. 374 " 



AG CAGAG CAC 
C3VCTGT0TOT 
CTGCACCTCT 
GCATAAAGAC 
AGAGTGQACC 
TCTGTCTGQA 
CTGAGAATTC 
GAAACTTCAA 
CCSVOATOCAA 
CATTGTAOCA 
CAAAAAACAA 
TACAAATAGC 
GAATGGGTTT 
TTTGCCATAA 
CCTAGTCTGC 
TTTCTAAGTG 
TGGAA6CACT 
CTATTTATTA 
AGAA6AT6AA 
QATATTAAAT 
AC3UVTT6GGT 
AGTAGATTAT 
AG TC TTOTC A 
AACTATTAAA 
TGTTTATTAT 



11 
I 

ACAAOCTTCT 
AAACATGACT 
GT6TGAA6GT 
ATACTCCAAA 
ACACTGCGOC 
CC9CCAAG6AA 
ATAAAAAAAT 
GCAAATCTAC 
TACAAGATTC 
TGAAATATCX: 
CAAATAATTT 
AAAATTGAGC 
GCTAGAATGT 
AGTCAAATTT 
TAGCCAGGAT 
GAAAAAGTAT 
TTAAGTTTTT 
TTTATGTATT 
TCATTGATTQ 
OATGTTTTAT 
ACCC3VBTTAA 
TQTTTATCTG 
TTGCCAOCTG 
ACAGCCAAAA 
GTACAAATAO 



21 
I 

AGGACAAOAG 
TCCAAGCTGO 
6CAGTTTT6C 
CCTTTCCAOC 
AACACAGAAA 
AACTGGOTGC 
TCATTCTCTG 
TTCAACACTT 
CTGGTTAAAT 
AOAACATACT 
TTAAATATAA 
GAAGGGOCAA 
GATATTTGAA 
AGCTGGAAAT 
CCACAAGTCC 
TAGCCACCAT 
TCATCATAAC 
TATTTAAOCA 
AATAQTTATA 
TAGATAAATT 
ATTTTCATTT 
AAAGTTTrAA 
TGTTGGTAGT 
CTCCACAGTC 
ATTCTTATAA 



31 
I 

CCAOGAAGAA 
C0QT06CTCT 
CAAOGAGTGC 
CCAAATTTAT 
TTATT6TAAA 
agagqgttgt 
tggtatccaa 
CATQTAtxOT 
ttgaatttca 
TATATGTAAA 
GGATTTTCCT 
GAGAATATCC 
GCATCACATA 
OCTGQATTTT 
TTQTTCCACT 
CTTACCTCAC 
ATAAATTATT 
TCAAATATTT 
AAGATOTTAT 
TCAATCAGGG 
CAOATAAACA 
TTO AACTAA C 
GCTGTQTTGA 
AATATTAOTA 
TATTATnAA 



41 
I 

ACCAGOSGAA 
CTTGGCAGCC 
TAAAGAACTT 
CAAAGAACTG 
GCTTTCTQAT 
G6AGAAGTTT 
QAATCAGTCA 
gTGGQTCTGT 
GTAAACAATG 
GTATTATTTA 
AGATATTGCA 
OAACTTTAAT 
AAAAT6ATG6 
TTTCrOTTAA 
GTQCCTTGGT 
AGTGATGTTG 
TTCAAOrGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCXITAGTT 
ATTAOGGAAT 
ATTTCTTGCT 
ATGACTGCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
730 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



51 
1 

GGAACCATCT 
TTOCTQATTT 
AGATGTCAGT 
AGAGT6ATTG 
GGAAGAGAGC 
TTGAAOAGGG 
AGATGCCAGT 
TGTAGG6TTG 
AATAGTTTTT 
TTTGAATCTA 
OQGGAGAATA 
TTCAGQAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAOTTAO 
GGTTGAAACr 
TTTTAAATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
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AAGGCTTTAT ATTTTTAACT TTAAGATQTT TTTATCSTGCT CTCCftAATTT TTTTTACTGT 1S60 
TTCTGATT6T ATGOAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 1620 
AAOTAAAAAA AAAAAAAAA 

5 Seq ID NOt 187 Protein sequence 
Protein Accession fti 1«P_000575.1 

1 11 21 31 41 SI 

ml • I ' ' ' 

lU MTSKLAVALL AAPLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKPIK ELRVIBSGPH 60 
CAMTBZIVKL SDGREtCLDP KEHWVQRWB KFLKRAENS 

Seq ID NO: 188 DNA sequence 
Nucleic Acid Accession Ut NM_003661.1 
15 Coding sequence > I.. 1152 

1 11 21 31 41 51 

I i I I ) I 

ATOAOTOCAC TTTTCCTTGO TGTGGGAGTG AGGGCAGAGG AAGCTG6AGC GAGGGTGCAA 60 

20 CAAAAOGTTC CAAGTGGGAC AGATACTGGA QATCCTCAAA GTAAOCCCCT CGOTGACTGG 120 

GCTGCTGGCA GCATGOACCC AOAGAQCAGT ATCTTTATTG A8GAT6GCAT TAAGTATTTC 180 

AAGOAAAAAG TQAGCACACA GAATCTGCTA CTCGTGCTGA CI6ATAATGA GGCCTGOAAC 240 

GGATTCX3TGQ CTGCTGCTGA ACT6CCCAG0 AATGAG6CA0 ATOAGCTCOO TAAAGCTCTQ 300 

OACAACCTTO CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAOCAQ 360 

25 TACAGAAACT GGTTTCTGAA AGAGTTTGCT OG6TTGAAAA GTGAQCTTGA GGATAACATA 420 

AOAAOOCTOC GTOOCCTTGC AGATGG6GTT CAGAAGOTGC ACMAGQCAC GACCATOOCC 480 

A A TGTGGTGT CTGGCTCTCT CAGCATTTCC TCTG0CATC3C TOACCCTCOT OGQCATQGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGOCTT GTACTCTTGG AACCTGQGAT GGA6TTGGGA 600 

ATCACAGCOG CTTTGACCGG GATTACCAGC AGTACCATGG ACTACGQAAA GAAOTGGTGG 660 

30 ACACAAGCCC AAGCCCAC6A CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGT6AGG 720 

GAOrrTTTGG GTGAGAACAT ATGCAACTTT CTTTGCTTAG CTGGCAATAC TTACXAACTC 7B0 

ACAC6AGGCA TTGG6AAG6A CATCCGTGCC CTCAGACQAG CCAOAOGCAA TCTTCAGTCA 840 

GTACC6CATG CCTCAGCCTC ACGCCCCCX5G GTCACTGAGC CAATCTCAGC TGAAAG06GT 900 

GAACAGQTGQ AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

35 ACGGATGTGG CCOCTGTAAG LTTCmCi ' T GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAOCACT TACATGAGOG GGGAAAGTCA GAGACAGCrO AQOAGCTGAA QAAQOTGGCT 1080 

CAG6AQCTGG AOQAOAAGCT AAACATTCTC AAGAATAATT AXAAGATTCT GCAGGGGGAC 1140 
C3^AGAACTGT GA 

40 Seq ID NO J 189 Protein sequence 
Protein Accession #s I9P_003 652.1 

1 11 21 31 41 51 

>ic i ■ 1 • I ' ' 

4j MSALFLGVGV RAEEAOARVQ QNVPSGTDTG DFQSXPXjGDW AAGTMDPESS IFIEDAIKYF 60 

XEKV8TQNUL XjLLTDHEAMH GFVAAAEliPR NBAOEUUCAL SNLARQMIKK DXNNKDXGQQ 120 

YRNWFZiKBFP RLXSELEDNI RRIiRAlADGV QKVHKSTTIA MWSOSLSIS S6ILTLVGHG 180 

IAPPTEGQSL VLLSP04ELQ ITAALTGITS STMDYGKKWW TQAQAHDI.VI KSLDKLKBVR 240 

BFLGENISNP LSLAGNTYQL TROIGia>IRA IJlRARANIiQS VPHASASRPR VTEPISAESG 300 

50 BQVGRVKEPS IX<EMSRGVKL TDVAFV8FFL VLDWYLVYE SKHLHEGAKS ETAEBX«KKVA 360 

QELBEKLNIL KHNyXILQAD QEb 

Seq ID NOt 190 DNA sequence 
Nucleic Acid Accession #i NM_014452.1 
55 Coding sequences 1..1968 

1 11 21 31 41 51 

I I I i I I 

ATGGGQACCT CTCOGAGCAG CAGCACCGCC CTOGCCTCCT GCAGCCGCAT CGCCCGCCQA 60 

60 GCCACAGCCA OGATGATOGC GGGCTCCCTT CTCCTOCTTG GATTCCTTAG CACCACCACA 120 

GCrCAGCCAa AACAOAAOGC CTG8AA7CTC ATTGGCACAT ACCGCCATQT TGACGGTGOC 180 

ACCG6CCAGQ TGCTAACCTG TQACAAGTGT CCAGCAGGAA CCTA TGTCT C TQAGCATTQT 240 

ACCAACACAA GCCTGCGOGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGQCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT QATTGAGAAA 360 

65 TTACCTTGTG CTQCCTTOAC TGACOGAGAA T6CACTTGCC CACCTGGCAT GTTCCAOTCT 420 

AAOGCTACCT GTGCCCCOCA TAO GG TOTGT OCTGTGGGTT GGGGTGTQCa OAAGAAAGGG 480 

ACA6AGACTG AGGATGTGCG GTQTAA0CA6 TGTOCTOGGG GTACCTTCTC AGATGTGCCT 540 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GQTGGTGATC 600 

AAGCOGGGGA CCAAGGAGAC AGACAAOGTC TGTGGCACAC TCCCGTCCTT CTCCAOCTCC 660 

70 ACCTCACCTT OCCCTGGCAC AGCCATCTTT CCAOGCCCTG AGCACATGGA AACCCATGAA 720 

GTCXX.TTCCr OGACTTATOT TOCCAAAQQC ATOAACTCAA GAOAATOCAA CTCTTCTGCC 780 

TCTGTTAOAC CAAAOGTACT 6A0TAGCATC CAOGAAGQGA CAGT0CCT6A CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAQA CGTGAACAAO ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CGCACCACAG ACACATCCTG AAGCTGCTOC OOTCCATGGA GQCCACTGGG 960 

75 G6C6AGAAGT CCA6CACGCC CATCAAGGGC CCCAAGAG66 GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TT6ACATCAA TGAGCATTT6 CCCTGGATOA Tl X S l tK- TnT CXTOCT6CT0 1080 

OTGCTTGTGG TGATTGT6GT OTGCAGTATC C3GGAAAAGCT CGA6GACTCT QAAAAA6GG6 1140 

COCOGGCAGO ATCCCA6TGC CATTGT G GAA AAG6CAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACX GGQAOAAATG QATCTACTAC TGCAATGGCC ATGQTATCGA TATCCTQAAQ 1260 

80 CTT6TA6CAG CCCAAGTGGG AAGCCAGTOQ AAAGATATCT ATGAGrTTCT TTGCAATOCC 1320 

AGTQAflAGOG AGGTTQCTQC TTTCTCCAAT GGGTACACAG CGQACCAOGA 6GQGGCCTAC 1380 

QCA6CTCTGC AGCACTGQAC CATCOGGGGC CCCGAG6CCA GCCTOSOCCA GCTAATTAGC 1440 

GCCCTGOQCC A6CAC08QAQ AAAOGATQTT GTOGAGAAGA TTCGTGGGCT 6ATGGAA6AC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTGOCQATGA GCCCCAQGCC GCTTAOOOOG 1560 

1029 
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AGCGCCATCC CCAGCCCCAA CGCGMKACTT GAGAATTC06 CTCTCCTGAC GGTGGAGCCT 1620 

TCCCCACAGQ ACAAGAACAA GGGCTTCTTC GTGGATGAGT OGGAGCCCCT TCTCCGCTGT 1680 

GACTCTACAT CCAGC6QCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACXaAAGAA 1740 

AAGAAGOACA CAGTGTTGCG GCAGGTACGC CTGOACXXXTT GTQACTTGCA OOCIATCTTT 1800 

GATC5ACATGC TCCACTTTCT AAATCCTGAG QAGCTGCGGO TQATTQAAOA QATTCCCCAG 1860 

GCTGAGGACA AACTAGACOO GCTATTOQAA ATTATTG6AG TCAAGAGCCA GQAAGOCAGC 1920 
CACACCCTCC TGGACrCTGT TTATAaOCAT CrTCCTOAOC TGCTOTAO 

Seq ID NO; 191 Protein sequence 
protein Accession «i NP_055267.l 

1 11 21 31 41 51 

1 I I I I ) 

MGTSPSSSTA lASCSRIARR ATATMIAGSI> LLLOFLSTTT AQPBQKASNL I6TYRHVDRA 60 

T6QVLTCDKC PAGTYVSEHC TNTSLRVCSS CPVGTFTRHE NGIBKCHDCS QPCPWPMIEK 120 

XiPCAALTDRB CTCPPGMFQS NATCAFBTVC PVGHGVRKRG TBTEDVROCQ CARGTFSDVP 180 

SSVMKCKAVT DCLSC^WX KPGTKBTDNV CQTIiPSPSSS TSPSPGTAIP PRPEHMETHE 240 

VPSSTYVPKG MNSTESNSSA SVRPKVLSSI QE5GTVPDNT8 SARGKEDVNK TLPNLQWNH 300 

QQGPHHRHIIi KLLPSMBATG GEKSSTPIKG PKRGHPRQNL HKHPDINEHlf PWMIVLPLLL 360 

VLWXWCSI RKS8RTLXXG PRQDP8AIVE KAGLRKSMTP TQKRBKWIYY QIGKGIDILK 420 

LVAAQVGSQH KDIYQFLOIA SBRBVAAFSN GYTADHERAY AAIK2HWTIRG PBASLAQLIS 480 

ALRQHRRNDV VEXIRGLMED TTQLBTDKIiA UMSPSPL8P SPIPSPNAKL ENSALLTVBP 540 

SPQDKZnOGFF VDESEPU.RC DSTSSGSSAL SRNGSFITXE KKDTVUIQVR LDPCSLQPXP 600 
DDMUiFUYPE EIiRVIEEIPQ AEDKU>RLFE IIGVKSQEAS QTLUJSVYSH LPDLL 

Seq ID NOi 192 DKA sequence 

Nucleic Acid Accession «: XM_044533 

Coding sequence t 238.. 2751 

1 II 21 31 41 51 

I I I I I 1 

GCTCTGCCCA AGCCGAGGCT GOGGGGCCXSO OGCOQGOGOO AGQACTGOGG T6CCCCGCGQ 60 

AGGGGCTGAG TTTGCXywSGG CCCACTTGAC CCTQTTTCCC ACCTCCOSCC CXXICAGGTCC 120 

GQAGGCGGOG GCCCCCGGGQ CGACTCGGGQ GCGGACCGOG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACXTTGA GCCCGAQCCQ CGGGACACCQ TOQCTCCTGC TCTCCGAATG 240 

CTGOGCAOOO GOATGGGCCT QAGQAGCTGG CT0GCXX3GCC CATGGGGOGC GCT6C0GCCT 300 

C6GCCAC06C TGCTGCTGCT CCTGCTOCTO CTGCTCCTOC TGCA0C06CC OCCTCOQACC 360 

TGGGG6CTCA GCCCCCOQAT CAGCCTGCCT CrGGGCTCTG AAGAGCJGGCX: ATTCCTCAGA 420 

TTOGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TQAOCAGGGA TGOCAaOACC 480 

CTGTACGTGG GTGCTOQAaA OOCCCTCTTT GCACTCAGTA QCAACCTCAG CTTCCTGCCA 540 

QOGOGGQAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAO AGAAGAAACA GCAGTGCAGC 600 

TTGAAQGGCA AG6ACCCACA GOGOGACTGrT CAAAACTACA TCAAGATCCT CCTGCG6CTC 660 

AGCGOCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTQ TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC OAQAAGGGGA ATGTCCTCCT GGAAOATGGC 780 

AAGGGCCX3TT OTCCCTTCGA CCCOAATTTC AAOTGCACTQ CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC OGGCCATCTC 0C0GW3CCAA 900 

AGCCTTCGCC C3CACCAAGAC OGAGACCTCC CTCAACTOOC T6CAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT AC3VTTCCTGA GAQCCTGGGC AGCTTGCAAG GCX5ATGATQA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AQAACACXy^T TGTGTCCCGC 1080 

ATT0CCCX3CA TCTGCAAG6G 03ATQAGG6T GGAGAGOGGG TGCTACAGCA GOGC TG GACC 1140 

TCCTTCCTCA AG6CCCAGCT G CTOT G CTCA CQGOCX38AOS ATGGCTTCCC CTTCAACGTO 1200 

CTOCAGGATG TCTTCACGCT QAOCXXXAGC CCOCAGGACT 6G0GTGACAC CXTOPTCTAT 1260 

GGGGTCTTCA CTTCCCAGTQ GCACAGGGGA ACTACAGAAG QCTCTGCCGT CTOTOTCrTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAQC GGCCTCTACA AGGAGGTGAA CCX3TGAGACA 1380 

CAGCAGTGGT ACACOSTGAC CCACCCQGTG CCCACACCCC GGCCTGGAGC GTGC ATCACC 1440 

AACAGTGGCC GGGAAAGGAA GATCAACTCA TOOCTGCAGC TGCCAGACGG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

CCCCAGGCrC 6CTACCAGCX5 C GT GG CTSTA CAC0GCS3TCC CTCGCXTTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGOTGACGGC OGQCTCCACA AGGCAGTGAG CGTGGGCCCX: 1680 

COOOTGCACA TCATTGAGGA GCTGCAGATC TTCTCATOGG GACAQCCOOT GOIGAATCTO 1740 

CTOCTGOACA OCCACAGGQO GCT6CTOTAT GG6G0CTCAC ACTOGGGOGT AGTCCAG6T6 1800 

CCCATG6CCA ACTGCAGCCT 6TACAGGAGC TGTQGG6ACT GCCTCCTCGC COGGGACCCC 1860 

TACT6TGCTT GGAGOGGCTC CAGCT3CAAG CACX3TCAGCC TCTACCAGCC TCAGCT66CC 1920 

ACCAGGCOOT GGATCCAGGA CATCGAGGGA GCCAGOGCCA AQGACCTTTG CAGOOCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCftACA GGOGAGAAGC CATOTGAGCA AOTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTG Q OCTGC GC6CTCCTCT CCAACCTGGC GAGCG8ACTC 2100 

TGOCTACGCA ACGGGGCCCC OGTCAATGCX: TCGGCCTCCT GOCAOESTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAQC CAGCTACT6C CCAGAGGTGG TGQAGQAOGG GGTGGCAGAC 2260 

GAAACAGATO AGG0T6GCAG TGrACCOGTC ATTATCA6CA CATOQOOTGT GA6TGCACCA 2340 

OCTGGTG G CA AGGCCAGCTG QGO^SQCMAC AGGTCCTACr GGAAGGAOTT CCTCqT OATO 2400 

TGGA06CTCT TTGT6CTG6C OGT GC TOCTC CCAOTTTTAT TCTTpCTCTA CO6QCAO00Q 2460 

AACAGCATGA AAGTCTTCCT QAAOCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTQC 2520 

CCXGTGOTGC TGCCCCCTGA GACCOGCCCA CTCAAOGGCC TAGGQCCCCC TAGCACCCCG 2 580 

CTOGATCACC GAGGGTACCA OTCCCTGTCA GACAGCCCCC CGGGGTCXOT AGTC TTCACT 2640 

GAGTCAQAGA AGAGGCXIACT CAGCATCCAA OACftGGTTOa TGQAGGTATC GOCAGTGTGC 2700 

CCCOGGCCCC GGGTCCGCCT TGGCTOSOAG ATCOOTGACT CTOT SG TGTG AOAOCTGACT 2760 

TCCAOAGGAC GCTGCCCTGQ CTTCAGGGGC TGTGAATGCT OGGAQAGGGT CAACT GGAC C 2820 

T0C3CCTC0GC TCTGCTCTTC GTGGAACACX3 ACOCSTGGTGC CCOGCCCTTG GGAGCCTTGG 2880 

GGOCAGCTGG CCTGCTGCTC TGCAGTCAAO TAOOGAAGCT CCTACGACCC AGACAGGCAA 2940 

ACAGGOOTGO OCCCAQAOGT CCTGGGCAAA TATGOGGGCC TGGCTA6GTT GGTGOAACAO 3000 

TGCrCCTTAT GTAAACTGAO GCCTTTGTTT AAAAAACAAT TOCAAATGTO AAACTAGAAT 3060 

QAOAGGGAAG AGATAGCATG GCATGCAGCA CACAOGGCTG CTGCAOTTCA TGGCCTOCCA 3120 

GGQQTOCTGG GGATGCATOC AAAGTGCnTO TCTQAGACAG AGTTGGAAAC CCTCACGAAC 3180 
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■iXSGCX.-lX.Tl'C 
CAGGACCAGC 
CTGCOGTOGT 
OOTCCTGGGC 
GAGAGGACAG 
CAGOaAAGAO 
CCACCATATC 
GTCCrCTGCC 
TCAAOICTGC 
ATGCACTTTA 



ACCTTCCACA 
TTGGGCTOOG 
CCCACCACC7 
TOOQACCCAA 
GQOQAGCTCA 
ACTBTOCCCT 
CACCCTCGCT 
CAGTCCCOV6 
CCA6CACAG6 
TGTCATTTTT 



TTATCC06CT 
TGOGTTCTGC 
CAGGGACCAG 
CTGCTQGACC 
6GAGAGATTT 
GCCTTCCTCC 
CCATCTTTGA 
TTCACCCTCC 
66CCCT0AA7 
TAATAAAGTC 



GCCACC3GGCT 
CTTGCCAOTC 
AGGGCTAGGT 
TTfOCAGCCT 
OGTGACAIITG 
GTTGTTGGGT 
ACTCAAACAC 
ATCGCTCACC 
TTATGTGOTT 
TGAAGAATTA 



GCCCTOTCTC 
AGOOOAGQAT 
TGGGACT6G6 
GTATCAG8CT 
TAC6CCTTTC 
GAGAACCCQT 
GAGGAACTAA 
TTCCTCCACT 
TTTATACATT 
CTGTTT 



ACTGCAGATT 
GTAGTTOTTG 
GCCC TCAOCA 
GTGGGCACAC 
CCTCAGAATT 
GTGOCCCTTC 
CTGCAOCCTG 
CTAAG06ATA 
TTTTAATAAO 



3240 
3300 
3360 
3430 
3480 
3S40 
3600 
3660 
3720 



Seq ID MO: 193 Protein sequence 
Protein Accession «: XP 044533.3 



1 

I 

MLRTAMGLKS 
RFBAEHISNY 
8FKGXDPQRD 
GKGRCPFDPN 
VASAYIPESL 
T8FLKAQLLC 
FTMKDVQRVP 
NFLKDHPl^ 
PRVHZIBBLQ 
PYCAWSGSSC 
PQPNTVNTLA 
B6FQQLVASY 
HCTLFVLAVL 
PLDHRQYQSIf 



11 

1 

WLAAPHQALP 
TAUjLSROGR 
CQNYXKXXiLP 
FX8TALVVXX3 
GSLQODDDKI 
SRPDDGPPPN 
SGIiYKBVNRE 
GQVRSRMLLL 
IP8SGQPVQH 
KBVSLYQPQL 
CPLLSNLATR 
CPBWBDGVA 
LPVLFLLYRB 
SDSPPGSRVF 



21 
I 

PRPPLLLLLL 
TLYVGARSAIi 
LSGSKliFTOQ 
ELYT6TV88F 
YPPPSE3X3QB 
VLQDVPTIiSP 
TQQWYTVTHP 
QP QARYQ RVA 
LLLDTHRGIiL 
ATRPWIQDIE 
LHLRNGAPVN 
DQTDEGGSVP 
RNSMKVFLKQ 
TESBKRPLSI 



31 
I 

LLTjLTjQPPPP 
FAIiSSNLSFL 
TAAPSPMCry 
QQ3DPAX6RS 
PEFPENTIVS 
BPQDWRDTLP 
VPTPRPGACI 
VHRVPQLHHT 
YAASHS6WQ 
GASAKDLCSA 
ASASCHVLPT 
VIISTSRV8A 
GECASVHPKT 
QDSFVEVSPV 



41 

1 

TWALSFRISIj 
PGGEYOBXiLH 
INM BIFTLAR 
QSIJlPTKTtiS 
RIARICKGDE 
YGVPTSQWHR 
TNSARBRRIN 
YDVLFLGTGD 
VFHANCSLYR 
S9WSPSFVP 
GDLUiVGTQQ 
PA6GKASWGA 
CPWLPPBTR 
CPRPRVRL6S 



51 
I 

PLGSBERPPL 
GADAEKKQQC 
DBXQNVLLED 
SLNWiODPAF 
GGERVLQQRW 
GTTEGSAVCV 
SSLQLPDRVL 
GRLHXAV8VQ 
SCGDCLLARD 
TGEKPCEQVQ 
LGEFQCHSLE 
DRSYHKEFLV 
PUIGL6PFST 
EIRDSW 



Seq ID NO I 194 DNA sequence 

Nucleic Acid Accession «t MM_022ai9.1 

Coding sequence i 1..635 



ATGGCAOATG 
TTCTCTGOGT 
AGCCTGG6TA 
GCTCACGGCA 
ATCCTOTCCT 
GATGAGGTGG 
06CTGTCACC 
TGCA6TGACC 
GTTCTGTGCC 
TGCCAGGGCC 
A6TCACCAAT 



11 

I 

GGGCAAA6GC 
GGAGOQGCOC 
TGAAGAAOTT 
GCCTGCTCAA 
TCGTGQGCTA 
ACTGGTGCTG 
CCTATQTGGA 
TCAACAAGAC 
TCATGAACCA 
CCAOGCCCAA 
CCCCAGCGCC 



21 
I 

CAACCCCAAA 
A0SCTTG8G6 
CITCACOSTO 
CCTQAAGGCC 
OGGTTGCTAC 
CCACGCCCAC 
CCACTAVGAT 
AGAOTOTGAC 
GACGTACGGA 
CTGCAQCATC 
CCCCX3CCCCT 



31 

1 

GGGTTCAAAA 
GGCTCCTGTC 
GOCATGCTTG 
ATGGTGGAGG 
TOTGGGCTGQ 
QACTGCTGCT 
CACACCATGG 
AAGCAGACAT 
GAGGAGTACC 
TATGAACCQC 
CCCTA6 



41 

I 

AGAAGGTGCT 
CITCAAGAAC 
CT6GCAG0GT 
CCGTCACAGG 
GGGGCCGTGG 
ACCAGGAACT 
AOAACAAGAG 
QCATQTGTGA 
GTGGCTTCCT 
GCCCTGA8QA 



SI 
I 

GGATAGATGC 
CTCCAGGTCT 
TCTGTCCACA 
GAGGAGCGCX: 
COVGCCCAAG 
CTTTGACCAA 
TQAGATAGTC 
CAAOAACATO 
CAATGTCTAC 
GGTCACCTGC 



Seq ID NOi 195 Protein sequence 
Protein Accession #s IIP_073730 

1 11 21 31 41 51 

1 I I I I I 

MADGAKANPK GFKKRVtDRC FSGWRGPRFG A8CPSRTSRS SLOSKKPFTV AZLAGSVLST 
AHGSLLNLKA KVBAVTGRSA ILSFVOYGCY 0QI/3GRGQPK DBVDHCC3IAH DCCYQELFDO 
GCHPYVDHYD HTIBNNTBIV CSDLNKTECD KQT04CDXNM VLOilNQfrYR EByRGFIMVY 
OQGPTPNCSI YEPPPBEVTC SHQSPAPPAP P 

Seq ZD MOi 196 DMA sequence 

Nucleic Acid Accession Its XM_028196.1 

Coding sequence i 13 15.. 1791 " 



GGCATTGATG 
GTaTOTOrGT 
AGACTGGGGG 
ATGGCCTGGO 
CTCTTGTGGO 

GOGTCACXXJT 
CCCA6CCAGG 
CGCCCGCCCC 
GAGTGCAGGC 
CTGCCCCTAC 
CAGGAATGCA 
CTCCC6CCAT 
ATGCTCCACC 
A6AGGCAAGG 
AGGCAGGOCC 
TGGAGGACGC 
AG6CTGCCCA 
6CTGQGCCTG 



11 
I 

CTGTGTGOGC 
GTCTGGAGTC 
GCTGCAGAGG 
CTGGGCCCCT 
CAAAOCAGGG 
GAGOSGGTGQ 
AGOCCCCATG 
TGTGCTCCCT 
ACCTTCCTTC 
A6CGGCTGA0 
CTQCCACCAC 
CCTTTAGCCC 
GACCCTGCAO 
TGCATGQCTG 
AAGTGAT660 
CAGOGTCCAQ 
ATCCTACRGT 
GQCCCTGCCT 
GQCAGCTAAG 



21 
I 

GTGCGTGTGT 
AT6GCAG6GT 
TGAGGGTATC 
GAGGGCAGGC 
GAGOCGCCAA 
AGTCAGGGAA 
TAGCACCCTG 
CCCCAGGCCX: 
CCACCCACAT 
TCA6TGTGT6 
CTCACCTTCA 
AGQCCTGCTC 
ACCCCTCTGO 
QCAAACCATO 
ACCGCAGAGA 
GGCCCAOGAO 
GGGGGCAAGQ 
GCTTGGCTGG 
CTGGAOCTTT 




TGGCCTCAAC 
TGACTTGGAC 
TGTGGAGGAA 
GA6CTGAGCT 
GTTCCCCTGC 
TAGGCAGGCX3 
GCOSAAGG6T 
TG6AATGTTC 
TCCTCAGGCG 
AOTGAGCTCC 
GCTTOCAAGT 
GTOGGOOCCA 
TOAGACCCCC 
AGA0AAGCAG 
GTGCTCTGAG 
GGCTGGGGGC 
G6CCAGGGTC 



41 

I 

GTGTGTGTGT 
CTGTCTCCTT 
AGCTGCTTAT 
ATG6CAAGA6 
CAGA6TCTCC 
GGGGAGTCAC 
CTGTAGGTGA 
G6TACAGGGG 
GGCCAGGCAG 
1GGC06CTCC 
CTGOGGCCCT 
GCCGACAGCC 
T CCTGGGGG C 
GCTOTGOTQC 
AGGGATGAGA 
GGAGGGAGAO 
GTCCGGTGAA 
T6CTGGGAGG 
CAGA6CCTCC 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 



TGGGACCOCC 
AGCTTCCTGG 
GGCAGGGACT 
TGGCTGGGAG 
CTCCCTTCAO 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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CTTCCTGCTQ 
ACCCAGCACT 
TCCXy«5CTGA 
CACCCACCAG 
ATTCCAGACC 
GATAACCCAG 
CCAGGGCCCC 
TGCCTCCTCT 
GA6TCX30TGQ 
TCCTTGCTCA 
QGOGAGTTCA 
T6GCAGGGAA 
GG6CCCGAGG 
TTTGGOTGGQ 
CTG6CCTGGA 
CTCTGATQAO 
CCTGATGTGG 
CTCTCCCTOa 
OGOTTCTOOQ 
T66GQAGCTG 
GT660CCTQA 
GTCA6CGTCT 
T6CCCCGT0T 
q OCTO GAOGQ 
GCTGGGTGGO 
AGGG6GAGCT 
GQCATQA6CC 
TGGAlQCACtO 
GCCACAGCCC 
CTOATGOGCA 
TGAAGCCCCT 
TACX3TGCCCA 
GGACTTGCAG 
AQAAAOAGAQ 
CTGGT6CGCT 
CT6COQCTCC 
CCCCTGCAQC 



CACAGAACCC 
GGCTGCTGCT 
CCCAGCCTCC 
TCTCTCCCAO 
TTGTOGCCG G 
CCAAGTGGGG 
GCTGGGCTCT 

OTCTOGGCAG 
CTCAGTCCAO 

GCCCCAGGGA 
AGTGGGTGAA 
GAGGCACAGC 
TTTGQGCX3GT 
G6G6GGG6GT 
6CATGATGTC 
ATGGCCT6GA 
AOTTOQACTT 
AGTTTGOQGA 
ACAGG6CAOG 
GGCA6GCAGC 
CCACCCAGGC 
TT6ACXSA0AC 
CTGGATGG6C 
CCTGAGCTAO 
GCCAGG66CC 
CCTGGGTGAG 
GTACCTOCTG 
AAG6CAOAGC 
GCATTTTOQO 
CTTGCTGCCC 
GCTCAGGCCG 
AGCCCTACX3T 
CCACCAAAAA 
TCAQCCAGGT 
GAACTGAGCC 
ACTGQGCAGA 
CAGCCAGGGA 
T6CXXXACTC 
AGGOOGCCCT 



TCGCOOCTGO 
AGTCAOATGG 
TGGGCCX3CTT 
TGC0C06QGC 
GACCCCCTGT 
GCTGCAGCTG 
CATTQCOGGC 
CTGCTGCTSC 
T60C0G0GGC 
A6AG0GCTTG 
TGGTTTAACC 
CAGAGOrrGAO 
GGGTTCTTQA 
CTGACAGAGC 
CTTGACCCAT 
AGCACCACCT 
GTCCAGCCCXI 
T6GAAGCCAG 
AAOGGTGAOQ 
GGCCCTTGGC 
06ACCTGAGG 
CX3GACACAGA 
CTGCTGCTTC 
CT GGGC TGQQ 
GGCAGCAGGG 
ACCCTGCAGG 
CTCOGTCTGC 
G6CC0GCGGQ 
TG6CAG0GAC 
GG6TCTQAGC 
ACAGCCXXSAG 
GCTQACCX3TG 
GAAGGTCCAG 
GGGCACG60G 
GGAQAATGIQ 
OGTAGGCAAG 
CATGCTGGCC 
GGTGGACCGC 
CTGAATGCAC 
GCA6QACCAC 



OCACCCGOTG 
GGTAGCGGGC 
CTTCCAAACC 
OCAGCTGGCA 
GAGTT6T6GG 
TCCACAGATG 
GOCCTTGCCG 

06ccgcx:aca 

AOCACCACCA 
AAATGCA6GC 

CCCACAGAGG 
AAGGA66CCA 
GGAAGGCAGG 
GAAGO06A0G 
GTCAT6CAA0 
GCXXCTTGTC 
GGGGATGCTC 
GAG6T6AAGG 
GGGGAAGGGC 
TGA6GCCACC 
CCTGGGGGCA 
CATGAGACAA 
CAGGTGAGTC 
TOOGGCTGGO 
CCTGGCTCAC 
TQCAGCTTTT 
CACTGGGCAC 
CTGCCACTCA 
CCTGOCCTAT 
CCCAACT060 
CAGGT06GGG 
GTGOTGCTGG 
CTCATGCTQA 
GCCCCCTACT 
GACCTGGTGG 
GTGCACCTGG 
CACGCCOGGC 
ATGCTGGCCC 
CAOVrGCCTC 
T6CAATAAAC 



CTGCCTCCTT 
AGGGGCOGGA 
A6CAGG0TAG 
GCACAGCTAT 
ATTOCXAAGA 
GACTCA6CCT 
CGQGOQTCCT 
G6AAGAAGCC 
GCCACCTG6T 
TOCAGAGOCC 
CAGGGGGTTG 
TGCAACA6GG 
6GGTA0CCCA 
AnTGTGGCT 
GGCTGOOOGG 
CCAACTCACT 
AQCAATGGGG 
GCCCOGCTGC 
AGACCGCAT6 
CCGCTGGCTC 
CCGTGGACCC 
AAGTGCACCO 
AGGGATGGTC 
CAGCTGQGTO 
6CG6CTG0CT 
CAACTTCAAG 
CGTGGATCTG 
GGTGAOGTGC 
G6GCCATO0O 
CCAGAATCAC 
AQCTGTGCTT 
AGGCTCGAGG 
ACCAGAGGAA 
TCAAT6AGGC 
TGGCTGTCTO 
GTGCCCGGGC 
GGCCCATTGC 
TGCAGCCCOO 
TQTCTCOOOO 
GCCTTCTCCT 



GCCCTGGCAO 
GGGGCCACXrC 
A AAOAT GGGG 
ACCT6G6CTT 
GGGGTGT06G 
GGCCTCTACC 
CCTOGTCTCC 
CA6GGACAA0 
GA6GAGCGGC 
AGGGGAGCtSA 
AGGACCTTCC 
GCTQCCCCAT 
GATGCCACGT 
GTT6GGTG6C 
GAGCCCAGGO 
CCAGGTGCAA 
GTGCCTGCAG 
GCAGGACCAG 
C0CTGGGT66 
CGAGATCAGG 
CTATGCC06G 
AGGCAOGCTC 
GGCIGGGTG6 
QQ0CTGG6CA 
CA0ATCC06C 

CAGCATGTTC 
TGGTCACCAG 
AAAOACAGGC 
CCTCCCGGGC 
CTCrCTCCGG 
CCT6CX3TCCA 
OTX3QAA6AAG 
CTTCACCTTC 
GQAC06CAGC 
CTOQQGGCAG 
CCAGCGGCAC 
CVlTOGCCrO 
CTQAGCC7CA0 
GCC 



Seq ID NO: 197 Protein sequence 
Protein Accession ft: XPJ)28196.1 

31 



11 



TTCTCCOGCA 
CTCCCCCCCT 
TTCIGTTTCT 
TCTCCCCCTC 
CrCTGCCCCC 
GCGAAAAOTA 
CAGAC6AATT 
ACATTOQGCC 
CCAAOGCCOG 
TTCCAGACAC 
60CTT06CCT 
CTGGTGGACA 
A00CQT0T6A 
CTGGCOCTCC 
CCTCCGACCG 
QACACCTGGA 
CX5C0GGGGTC 
CCCCTGATTG 
AGCAATCGGA 
CTCCTCCTGA 
CCCCCTGGGG 
CCTGGGCCCC 
GATTGGCTTT 



ACCTTCCCTT 
CAOGOCOGOC 
CTCOGTGCTG 
OCCCTCTCTT 
CTCTATCCTT 
CAACATCTGG 

CTCcccxxxx: 

CCCGG6ACTC 
CTGTTOGGTT 
CAATGGGAAT 
C0T6CTGCAT 
CCCTCCAGTT 
GCGGTGGCAG 
TGGAGAOSTA 
TGCTTCCX3GA 
AGCAOTCCAC 
A06TGCTCGC 
CTCTACCCAC 
AGTGAGCAAA 
CCACGGACGT 
CTTCTCCTGA 
CTCCATGGGG 
AAACAOCCTT 



21 

I 

OGCTCCCTCC 
CTCTOGCCTT 
TTCTCTCCXn 
GGGCCGCCOC 
GATACAACAG 
CCCGCCCCAG 
CCCCAAAAAA 
GGCCAQAOOG 
TQCGACAOQC 
CCCAATGGGG 
TQCTGCTTAC 
GGTCTGTGGG 
OCGTGGCATC 
CTOTGCTACC 
CAACTTCCCC 
CCA60GCCTG 
CAAGGAGCTC 
CCAAGACCCC 
ACTGCCX3CAA 
TTCCATCAGG 
CCCAOTCCCC 
CTGAGGAAGC 
CACATACCCT 



31 
I 

OQTCCCCCCC 
OQCOGAACCA 

cror oo QCCT 

GTTTCACX3TT 
CTGACCTCAT 
CCOGAAGACA 
AAAA6CCATC 
G06CTGGCA6 
AGCAGGGAGG 
AAGTCXSATGC 
CXSCXXXSUTTG 
GACXXKXK3CT 
GTTGAGGAGT 
CC0GCCAA6T 
AGATACXXXK 
CGCAGGGGCC 
GAGGOQTTCA 
GCCCAG6GGG 
GTCTGCAOCC 
TTCCATCCX» 
GTGCCCOGCC 
ACAGCAGCAT 
CCCCCC 



41 

I 

AQCTCCTAGC 
AAGT6GATTA 
GCXXSCCTCT 
CACTCT6TCT 
TTCCCQATAC 
GCCCGTOCTC 
CCCCGGCTCr 
AjQQAGTGTCC 
TGGG0GGCA6 
TGOTQCTTCT 
AGACCCTGTG 
TCTACTTCAG 
GCTGTTTGOG 
GOGAGAGGGA 
TGGGGAM3TT 
TGCCTGCCCT 
GGGAGQCCAA 
GC6CCCCCCC 
OGGOGCCAOC 
AAAATCTCTC 
TCCCCGAAAC 
CTTCAAACAT 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



1 11 21 31 41 51 

I I I I I I 

KGHPPV8P8A PAPAGTTAIP OLIPDIjVAGT PCELWDSQEG OGDNPAKWGL QLSTDALSLA 
8TPGPRWALI AGALAAGVLL VSCLLCAACC CCRRERKKPR DKESVGLGSA RGTTTTHIiVR 
SGSUiTQSRE GUCSRLQSFO QRQEPSPRDG LTFTBAGR 

Seq ID NOs 198 DMA sequence 

Nucleic Acid Accession #> NM_000612.2 

coding sequence I 5 53.. 1095 ~ 



60 
120 



51 
I 

CTCCX3ACTCC 
ATTACAOGCT 
OGCTQTCCTC 
CTCCCACTAT 
CTTTTCCCCC 
CCTGGACAAT 
GCCC06T0GC 
QGCAGGAGGG 
06TGGC0GGC 
CACCTTCTTG 
CGQCX3GGQAG 
CA6GCC06CA 
CA6CT6TQAC 
06TGTOGACC 
CrrCCAATAT 
OCTGCGTGCC 
AOGTCACOQT 
A6A6ATGGCC 
ATCCTGCAGC 
QOTTCCACGT 
AGGCTACTCT 
OTACAAAATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



8eq ID NO: 199 Protein sequence 
Protein Accession 0: NP_000603.1 

1 11 21 31 



41 



51 

1032 



wo 03/042661 



PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



) I I I 1 I 

MGIPMGKSML VLLTPIAFAS CCIAAYRPSB TliCGaELVDT LQFVOSDRGF YFSRPASRVS 
RRSRGIVBEC CFRSCDUUiL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTHK 
OSTQRUtKGL PALtRARRGB VLMCBLBAFR SAKRHRPLIA LPTQDPAHGG APPBIASNRK 

Seq ID mOt 200 DHA sequence 

Nucleic Acid Accession «: AK057131.I 

Coding sequence t 61 . . 1146 



60 
120 



1 

1 

AGTCTGGGOG 
ATGOCTCAGC 
AOOQTCTOCA 
TCTGAOTTCA 
ATTCCAACAG 
AAATACATTC 
GTGGAA6A0T 
CTGCXX3ATGG 
TTCTCTGQOT 
TCTGTGATGG 
AGATCTTTCT 
ATCTCAGCXX: 
TTCATCTTGO 
GAGAAOGQTT 
CCTCCAAAGG 
CCAATOCCTA 
6AGGTTTATG 
CATATTCAAA 
CTTCCGGATT 
GOCTGATAAC 
TTCTGATGAC 
6GAAGTCAAT 
AAAlCCTCACX: 
ACGCATCCAT 
CATCTTTGAG 
ACTGTGAGGT 
AAGG66GAAA 
ATCTOTCTTG 
AGAAATCTG6 
GCCCTCTTGT 
ACTTAACAAA 
TTA0G6CACT 
TTCTTOCCCC 
CTCTC TCCro 
CCAAACT9T0 
C 



11 

I 

TTTAGGTCAG 

GATGQAT6GG 
GCAAAGA6GA 
TGACTTTTAG 
ATAAAGTCTT 
ATAQCCACCT 
CTCACTTTGA 
TGGAGOQTQA 
CATTACTGCA 
TCATTTCTCT 
T6CTACA0TC 
ATGATTTCAT 
CCATGlkACCT 
AQACAAGCAT 
TCATATTTGG 
GAGAGAAATC 
GCAOOGOTCA 
CAOCCTGGAC 
AGAGTCCAGT 
AAGGCCTTGG 
ATTACTGCCC 
ACTQGCATCT 
GGAGTCAACG 
TTGATTCA6A 
CAAGGGGCCT 
GCTAGT6TTG 
CTCACTCTTA 
CTTCTCOCTT 
CCCTTATCTG 
TGTAOATTTG 
CTTGCTGGGA 
TTTTATTCCC 
CATCTCTCCC 
ATACTACCAT 



31 
I 

AACTACCCCG 

GOCGAGGAGC 
AGG0GTG606 
CTCTGAOAAG 
TCCTACAGTG 
GTTCACTATC 
TGTGQTOGCT 
TG60GTCATC 
6GCCTTGGAG 
GGGCCATGAT 
AAGGGQOQTC 
TCCIAACTTC 
CATOCTGCAA 
TGGCATCCTT 
AA6CGGGACA 
CCTTAACCAA 
AGTTCAATaT 
AGACAOTCCA 
TCCATGTGTT 
GCTACCAQCT 
CAGTTACTTC 
ACAGGTTCTA 
AGAAAATCTC 
ATGCTGACAC 
GCTGGGTTAG 
ATGAAACTTT 
AACTCTOCCA 
CCT0006ACA 
TCTTATGCTG 
AGAAATCTCA 
TGCTTTCTCC 
CTCTCTTCTT 
CTTACTCTTC 
AATTOTCACC 



31 
I 

GTAGCCTGAC 
CTGGTGGCTA 
6GGGAGCATC 
ATGAAAGAG6 
TCCAATACTA 
GTCAGCACCA 
C AAOQCT CGG 
GGCCCTGAAG 
TATGGTT6G0 
CTCCTGCTGA 
GAGGAGTCAT 
CAGCTAGCCT 
AAOAAQCCCA 
GTAAACATGA 
GCAGCTGCTG 
0TG6TGACTG 
TGCAATAATC 
CAT0CXXX3CA 
AOAOGTCCTA 
GAGTGCCTTT 
GCTCOGCCAG 
TATTGGCAAC 
CCCCATCTAC 
AGTCCAAGCC 
AGACCAGGAG 
GCATGCCC3GA 
T6ATCAAAAC 
AQAA CAAGGC 
TCTGCATCCC 
GTTATTTCAC 
ACCAGTTGTT 
AGAGCTTATA 
6GACTCATGA 
AATTTATTCr 
ATAATCAGTC 



41 

I 

AGCAGGAGCT 
TGCTGCTCCT 
AAA6GG0STC 
06CTGAAAGG 
CAGCCX^TGGC 
GCTTTATCCA 
ACCCCAGCTT 
AAG6CTG66A 
GCACACTG6A 
TCAGGAAGTA 
CAGGGACAGG 
TCATTGTGGA 
T0GCCTT6AT 
CTTCAGGCCA 
TCAGCCGATT 
TATTGCAQCA 
AGGACCACCA 
GTGGGCCAGQ 
GAACTCA06A 
GACCCCCTCC 
ACCGTACAGT 
ACA6ACA0CC 
ATACAQCCIG 
TATGAGACCC 
CCAGTTTCTC 
CCXrCGGGACA 
CACATTOTAA 
CXSGGXTTAAGG 
TT6ATCCACT 
TGCTTCACCT 
ACCTGATAGG 
TATTTCTTCT 
QCTGTCTCTT 
ACTTCTG6AC 
AAATAAAGTG 



51 

i 

CGAGAGAAGC 
AGmTCCCT 
OOQAATGCCT 
7GCCATCCAG 
TGAOTTCGGA 
GCATGAAGTC 
QCAGCCCTAC 
GGT6C000CA 
CGACAAGAAC 
CATCCCCCOA 
GGCTCAQAGG 

cGAGGaaaoc 

T6CA0TCTCA 
CTCTTCAOCT 
GGAGCAGACA 
ACTGQCAAAT 
CGGCACTCAC 
CX3U3MITCAA 
AOAACATTGT 
COGTCAGCCC 
COGTCTTCCC 
GATTCTTTAC 
AAGACTTCAA 
AAGTGAAATT 
ACCTGCACAA 
GGACTAACCC 
AACATTGCCC 
TAAAOTCAOC 
GGCATTTGCT 
TCCAGGCTTG 
AGTCTTTAAT 
TACTAGAACT 
CATCTCTCCT 
CTGGACTTAC 
ATCTQTOCAT 



Seq ID NOt 201 Protein sequence 
Protein Accession #t BAB71366.1 



51 



1 11 21 31 41 

I i I 1 I I 

MAQRCVCVLA LVAMI*LLVFP TVSRSMGPR8 GBHQRASRIP SQPSKBBRVA MKBALKQAIQ 
1PTVTFS3EK SNTTAIAEFG KYIHKVPPTV VSTSPIQHBV VEEYSHLFTI QGSDPSLQPY 
LLMAHFDWP APEEGNBVPP FSGLERDGVI YGWGTLDDKK SVMALLQALE U.LIRKYIFR 
RSFFISLGHD EBSSGT6AQR ISALLQSRGV QLAFIVDSG6 FILDDFIPNF X3CPIAI.IAVS 
BKGSMNLMLQ VMHT8GHGSA PPKBT8IGIL AAAV8RI*BQT FMPZIFOSGT WTVLQQLAN 
EVY6BKSUIQ OQIQDRHGTH HIQ8R6QVQC RPPSGPGHSQ LPDSFH7DSP R6PRTRBEBC 
G 

Seq ID NO: 202 DNA sequence 

Nucleic Acid Accession #t NM_004217.1 

Coding sequence: 58.. 1093 



GGGOQGGAGA 
GCCCAGAAGG 
AGCACCCTGC 
ATGAGCOGCT 
AGTGGGACAC 
CCTCTGGGCA 
ATGOTGGOGC 
CTGOGCAGAG 
AACTATTTTT 
CTCTACAAGG 
GAGGAOTTGG 
AAOCCAGAAA 
TCTGTGCATO 
CCA6AGATGA 
CTTTGCTATG 
TAT06CCQCA 
6A0CTCATCT 



11 

1 

GTAGOVGTGC 
AGAACTCCTA 
CCCAGOGAGT 
CCAATGTCCA 
CCGACATCTT 
AAGGCAAGTT 
TCAAGOTCCT 
AGATCQAAAT 
ATGACCGGAG 
AOCTGCAGAA 
CAGATGCTCT 
ATCTGCTCTT 
OGCCCTCCCT 
TTGAGGGGCO 
AGCTGCTGGT 
TOGTCAAGGT 
CGAAACTGCT 



31 
I 

CTTGOAOOCC 
CCCCTGGCCC 
CCTCCQGAAA 
GCCCACAGCT 
AACGCGGCAC 
TGGAAACGTG 
CTTCAAGTCC 
CCAGGCCCAC 
QAGGATCTAC 
GAGCTGCACA 
AATGTACTQC 
AGGGCTCAAG 
GAGGAGGAAG 
CATGCACAAT 
GGGGAACCCA 
GGAOCTAAAG 
CAGGCATAAC 



31 
I 

AGCTCTCCTC 
TACGGCOGAC 
GA6CCT0TCA 
GCCOCIGGCC 
TTCACAATTO 
TACTTGGCTC 
CAQATAGAGA 
CTGCACCATC 
TTGATTCTAG 
TTTGACGAGC 
CATGGGAAGA 
GGAGAGCTGA 
ACAATGTGTG 
GAGAAGGTQO 
CCCTTTQAGA 
TTCCCOGCTT 
CCCTOGGAAC 



41 
I 

OOCCTTTCrC 
AGACGQCTCC 
GCCO^TCTGC 
AQAA6GTGAT 
ATOACTTTGA 
GGGAGAAGAA 
AGGAGGGOGT 
CCAACATOCT 
AGTATGCGCC 
AG0QAACA6C 
AGGTGATTCA 
AGATTGCTCA 
GCACCCTOGA 
ATCTGTGOTO 
GTGCATCftCA 
CT6TGCCCAC 
G6CTGCCCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
3100 



60 
120 
180 
240 
300 
360 



51 

I 

TCTAAGGATG 

ATCTGQCCTG 

ACTTQTOCTC 

GGAGAATAGC 

GATTGGGOGT 

AAGCCATTTC 

6GAGCATCAG 

GCGTCTCTAC 

C0GCGGG6AG 

CA06ATCATG 

CAGAGACATA 

CTTOGGCTGO , 

CTACCTGCCC 

CATTGGAQTG 

CAAOGAGACC 

GGGASCCCAG 

GGCOCAGGTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1033 



wo 03/042661 



PCTAJS02/36810 



5 
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15 
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25 

30 

35 

40 
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80 



TCAGCOaVCC CTTGC30TC0G GQCCAACTCT CGGAGGOTCC TaCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CG Utf lXSOGrG TGTTTGTATQ TCTGTGTATG 1140 

TATAGGQGAA AGAAOGQATC CCTAACTGTT CCCTTATCTQ TTTTCTACCT CCTCCTTTOT 1200 
TTAAT3UMGG CTGAAGCTTT TTGT 

Seq ID KO: 203 Protein eequence 
Protein Accession #i NP_00420a 

1 11 21 31 41 51 

I I I I. 1 I 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KBPVTPSALV U1SRSNVQPT AAPGQKVMEN 60 
SSGTFDILTR HFTIDDPEIG RPLGKGKFGN VYLAREKKSH FIVALKVLPK SQIEKEGVEH 120 
QLRREIBIQA RLHHPNILRL VNYPYPRRRI YLILEYAPRO EZ^YKBLQKSC TFDBQRTATI 180 
MBBLADALMY CHGXKVXHRD ZXPQIUIiLGL KGBLKIADPO HSVRAP8LRR KTMOOTLDYL 240 
PPETIIBQRMH NEKVDUICIG VLOrEUiVGN PPPBSASHNB TYRRIVKVDL KFPASVPTGA 300 
QDLISKLLRH NPSERLPLAQ V6AHPHVRAK SRRVLPPSAL QSVA 

Seq ID NO: 204 DMA aeciuence 
nucleic Acid Accession «t AK055663 
Coding sequence : 38.* 1423 



11 



21 



31 41 51 

I 1 I I I I 

AOAAOGGCTT CCGOGQaaAO CTGTGCAGCT CCTTATCATO GGGACAATTC ATCTCTTTOO 
AAAACCACAA A6ATCCTTTT TTGGCAAGTT GTTA06G6AA TTTAGACTTG TAGCAGCTOA 
C0C3AAGGTCC TGGAAGATAC TGCTCTTTGQ TGTAATAAAC TTGATATGTA CTGGCTTCCT 
GCTTATGTGG TGCAGTTCTA CTAATAGTAT AGCTTTAACT GCCTATACTT ACCTGACCAT 
TTTTGATCTT TTTAOTTTAA TGACATGTTT AATAA6TTAC TGGGTAACAT TGA6GAAACC 
TA6CCCTGTC TATTCATTTO GGTTTGAAAO ATTAGAAOTC CTQGCTGTAT TTGCCTCCAC 
AOTCTTGGCA CAGTTGGGAG CTCTCTTTAT ATTAAAAGAA A6T6CA6AAC GCTTTTT G GA 
ACA6CCCGAG ATACACAOGG GAAOATTATT AOTTGGTACT TTTGTGGCTC TTTQTTTCAA 
CCTGTTCACQ ATQCTTTCTA TTCGGAATAA ACCTTTTGCT TATGTCTCAQ AAGCTGCTAO 
TAOGAOCTGG CTTCAAQAGC ATGTT6CAGA TCTTAGT06A AGCTTQiaTO GAATTATTCC 
GGGACTTAGC AGTATCTTCC TTOOCOGAAT GAATCCATTT QTTTTGATTa ATCTTQCXGQ 
A6CATTTGCT CTTTGTATTA CATATATGCT CATTGAAATT AATAATTATT TTQCOGTAGA 
CACTGCCTCT GCTATAGCTA TTGCCTTGAT GACATTTGGC ACTATGTATC CCATGAGTGT 
GTACAGTGGG AAAGTCTTAC TCCAGACAAC ACC3VCCCCAT GTTATTGGTC AGTTGGACAA 
ACTCATCAGA GAGGTATCTA CCTTAGAT6G AOTTTTAGAA GTCOGAAATO AAGATTTTTG 
GACCGTAG6T TTTGGCTCAT TGGCTG6ATC AGTGCATGTA AOAATTCGAC GAGATGCCAA 
TGAAC3UWVTG GTTCTTGCTC ATGTGACCAA CaVGGCTGTAC ACTCTAGTOT CTACTCTAAC 
TGTTCAAATT TTCAAGQATG ACTGGATTAG GOOTOCCTTA TTGTCTQGaC CTGTTGCAGC 
CAATGTCCTA AACTTTTCAG ATCATCAOGT AATCCCAATG CCTCTTTTAA AGGGTACTGA 
TGATTTGAAC CCAGTTACAT CAACTCCAGC TAAACCTAOT AOTCCACCTC CAGAATTTTC 
ATTTAACACT CCTGGGAAAA ATGTGAACCC AGTT A TT C TT CTAAACACAC AAACAAGGOC 
TTATGGTTTT GGTCTCAATC ATGQACACAC ACCTTACAGC AGCATGCTTA ATCAAGGACT 
TGQAGTTCCA GGAATTGGAG CAACTCAAGG ATTGAGGACT GGTTTTACAA ATATACCAAO 
TAQATATGGA ACTAATAATA GAATTGGACA ACCAAGACCA TGATAGACTC TAACTTATTT 
TTATAAGGAA TATTGACTCC TTGGCTTCCA ATTTATTTAO TAATCX3UVCT TTGCATTGAC 
TGTTTAATCA TTTACTCTAA ATGTTAGATA ATAGTAGTCT TGTTCACATT TCAT6AAACC 
TATGAAACTA TATTTriGTA AAATGTATTT GTGACAGTGA AATCCTOGTA AATGT7AAA6 
GCTTTAAATA GGCTTCCTTT AGAAAATGTG TTTCTTTAAA TTTGGATTTT GGTATCTTTG 
OTTTTGTAGT TOACTGCAGT GTGATGTGAC CTTACCTTTA TAAQAGCCAC TTGATGGAGT 
AGATCTGTCA CATTACTAAG ATACGATATT TCTTTTTTTT TCOGAGACGG AGTCTTGCTC 
TQCCACTGTG CCOGGCCAAT ACATTATTAT TAACTTAAGG CTGTACTTTA TTAAGGCTTC 
CTTAGTTTTT GTTTTGTTTT GT TTTTT G AG ATGGAGTCTC ACrCTGTCGC CCAGOCTGGA 
ATGCAGTGQC ATGATCTCAG CTCACTX3CAA (XTCTGCCTC CTGAGTTCAA ATQATTCTCX: 
TGCCTCAGCC TCCCX3AGTAG CTGGQATTAC AGGCACCTGC CACCAOSCCC AGCTAATTTT 
TGTATTTTTA GTAAAGACGG GGGATTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG 
ACCTCATGAT CCACCCACCT TAGCXTTCSCCA AAGTGCTGGG ATTAGGTGTG AGCCACCGCA 
CCTGGCCGAT ATTTTCTTTA ATGAAATTTA TAAATATGCT TCTTGAATAA TACACATTTT 
GGQAAAGGGA AAAATGTCTG TTCAAAAAGT AAAGGTCTCT TTTATAGCTT TTCCAAACTT 
AATTGCTAAA TTTTTCTTTG AGGTTCTCCT GAATTATGTC TTACAAACTA AAAGC3UVAAA 
TTTTTAGCAG AAATTTTG6A ATACATTCTA TCTA6CACAA TTTGAATTTT TAATTATCAA 
GATTTTTSTT AAA G TTTCT C TCCTTTAAAA ATTTTASTAC ATTTGTAAAT 



8eq ID HO: 205 Protein sequence 
Protein Accession «t BAB70980.1 



1 

I 

HQTIHLFRKP 
TAYTYLTIPD 



RSLOGIIPGL 
OTMYPMSVYS 
VRIRRDANEQ 
HPUiKGTDOL 
8SMLKQGLGV 



11 
I 

QRSFPQRUjR 
IiFSIMTCLIS 
BIHTGRU.VO 
SSIFLPRMNP 
GKVLLQTTPP 
MVLAKVTNIUj 
HPVTSTPAKP 
PGIGATQGLR 



21 
I 

BFRIiVAADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLAGAP 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPBFSFN 
TGPTNIPSRY 



31 

I 

SWKILLPOVI 
VYSFOFERLE 
TMLSIRNKPP 
ALCITYMLIE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 

I 

KLICTGFLLM 
VLAVPASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNBHFWTX* 
LLSGPVAAMV 
LUiTQTRPYO 
P 



Seq ID KO: 206 DMA sequence 

Nucleic Acid Accession fft NM_016361.1 

Coding sequence: 397.. 1662 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
.1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



51 
I 

WCSSTNSXAL 
AQLGALFILK 
WLQBHVAOLS 
8AIAIALMTP 
GFGSLAGSVH 
UTPSDKHVIP 
FGUIHCSITPY 



60 
120 
180 
240 
300 
360 
420 



11 



21 



31 



51 
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GGAACTCAGQ QCOGGCTCCT GTTCCTTOVA GAGTQCTGGA OGCCAAACTT QftAATACAAG 60 

TTTAATGTTC CTCGTCGGGC AAAAQATAAG GATOOQATCT CCCCOGGCCC GGTGTGCAGC 120 

AGGAGOGACC AACCCOQACC CGGGTTAAAA CTCCCAGGGA CTCTTCGCTG CTGOCACCTC 180 

TTGTTCTCTC CCCO G TTCCC ACTOGGGGTC TCCCTCAGGO CCGOGAOTCA CAGOG GTOC C 240 

TGCTTGCTGA AGOQCTGOAT GTACGCATCC GCAGGTTCOC GOGGACTTGG GGaOQCCCGC 300 

TGAGCCCCGQ CGCCOGCAQA AGACTTGTGT TTGCCTCCTO CAGCCTCAAC CCOGAGGCAG 360 

CSAGGGCCTA CCACCATGAT CACTGGTGTG TTCAGCATGC GCTTGTGGAC CCCAGT6GGC 420 

GTCCTGACXrr CGCTGGCGTA CTGCCTGCAC CAGCGGCGGG TGGCCCTGGC CQAQCTCCAO 480 

GAGGCOGATG GCCAGTGTCC GGTOGACCGC AGCCTOCTGA AGTTGAAAAT GGTGCAGGTC 540 

GTCTTTOGAC A0GGGGCTC6 GAGTCCTCTC AAGC06CTCC CGCTGGAGGA GCAGGTAOAO 600 

TG6AACCCCC AGCTATTAGA GGTCCCACCC CAAACTCAGT TTGATTACAC AGTCACCAAT 660 

CTAGCTGGTG OTCCGAAACC ATATTCTCCT TAOGACTCTC AATACXaVTGA GACC AC CCTO 720 

AAGGGGGGCA TGTTTGCTGG GCAGCTGACC AAGOTGGGCA TGCAGCAAAT 0TTT6CCTT0 780 

GGAGAGAGAC TQAQOAAGAA CTATQTGGAA GACATTCCCT TTCTTTCACC AACCTTCAAC 840 

0CACAG6AGQ TCTTTATTCG TTCCACTAAC ATTTTTCGGA ATCTGOAGTC CACCCGTTGT 900 

TTGCTGGCTG GGCTTTTCCA GTOTCAGAAA GAAGGACCC3V TCATCATCCA CACTGATQAA 960 

GCA6ATTCA0 AAOTCrrGTA TCCCAACTAC CAAAGCTGCT GGAGCCTOAG QCACAQAACC 1020 

AGACGCCGGA GGCAGACTGC CTCTTTACAG CCAGGAATCT CAOAGGATTT GAAAAAGGTG 1080 

AAGQACA6GA TGGGCATTOA CAGTAGTGAT AAAGTGGACT TCTTCATGCT CCTGQACAAC 1140 

OTO G CTGCOG AGCAGGCAGA CAACCTCCCA AGGTGCOCCA TGCTGAAQAO ATTT6C3^CGG 1200 

AT0AT06AAC AQAGAOCTGT GOACACATCC TTGTACATAC TGCCCAAGOA AGACAGGQAA 1260 

AGTCTTCAGA TGGCAOTAGG CXXATTCCTC CACATCCTAO AOAGCAACCT QCTGAAAGCC 1320 

ATGGACrcra CCACTGCCCC COACAAGATC AOAAAGCTQT ATCrCTATGC GGCTC ATGAT 1380 

GTQACCTTCA TACCGCTCTT AAT6ACCCTG GGGATTTTTQ ACCACAAATO GCCACOOTTT 1440 

GCTQTTGACC TGACCATGQA AGTTXA0C3U> CACCTG6AAT CTAAOQAGTG GTTTGTGCAG 1500 

CTCTATTACC AOGGQAAGGA GCA O GTOCOG A6AGGTT6CC CTGATGGGCr CTOCCOGCTG 1560 

GACATGTTCT TGAATGCCAT GTCAQTTTAT ACCTTAAGCC CAGAAAAATA CCATGCACTC 1620 

TQCTCTCAAA CTCAGGTGAT GGAAGTTGGA AATGAAGAGT AACTGATTTA TAAAAOCAQG 1680 
ATGTGTTGAT TTTAAAATAA AGTGCCTTTA TAGAAAAAAA AAAAAAAAAA A 

Seq ID NOi 207 Protein sequence 
Protein Accession #*. NP_057445.1 

1 11 21 31 41 SI 

I I I I I i 

MRLHTFVGVL TSLAYCLHQR RVALAELQBA DGQCPVSRSL LKLKMVQWP RBGASSPLKP 60 
LPLEEQVEWN PQLLEVPPQT QFDYTVTKIiA GOPKPYSPYD SQYHETTIiKa QMPAOQLTKV 120 
GMQQMFAIiGE RLRKNYVEDI PPLSPTFNPQ EVPIRSTNIP RNLESTRCLL AGLPQCQKEG 180 
PIIIHTDEAD SBVLYPNYQS CWSLRQRTRG RRQTASLQPO ISEDLKKVKD RMGIDSSDKV 240 
DFPItiLONVA ABQAHNLPSC PMLKRFARMI BQRAVDTSIiY ZIiPKEDRBSL QMAVGPFLHI 300 
IiBSNXiLKAMD SATAPDKIRK LYLYAAHDVT PXPIiLKTX/31 FDHXKPPFAV DLTMBLYQHL 360 
BSXBWPVQIiY YH0KBQVPR6 CPDGI<CPIJ)M FUtAHSVYTlj SPBKVEALCS QTQVMBVGNE 420 
B 

8eq ID NOt 208 DMA sequence 

Hucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I 1 t I I I 

TTTGAOGGGG TGGTGOaGGO AGTTTAATTC ATAAAOAAGC CTCCTOATCA GAAAGGGOCC 60 

TAACAGCCTG CCCTTGGAGA GAAGTCCTTC CTTQAGGATA AGQCCTCCCA GGGGAOGAGO 120 

T6CTGGGGGC CAGTGTTAGG CTTCAQGCCA TCCCTGGAGG CCAGTCCTGT GCTCAGCAAG 180 

TAGTGGCAGA GCCTGGAGTG ATGAGTGGGA TGGCCTTCTC AGGTACAGQA CTOTGCTGCT 240 

TCTGGCTGCT CTTGCaiTTTO CATTTGCCAC TCAGAACTGC C60GATCCCA GCAATGGCCA 300 

GGAGCCCTCC OGAGATCAGT CC6CTCAGCT GCAGQTTTTT GCAGTCATAO TAOAAOGQAT 360 

06TCTTTATT GGCAAATGGO TCATTOGCTT OCAAOGCAOT GAOOGCSUICT GTGTGACtCT 420 

GCAQGTTGCT CACTQCTCCT TCACCAGTGT CCTGOGAGGT CACCTTGGOG AQGQGTCACC 480 
TGA6CTGGCA OCGCAO 

Seq ID NO: 209 DMA sequence 

Nucleic Acid Accession #t FQENBSH predicted 

Coding sequence* 1..564 

1 11 21 31 41 51 

i I 1 I I i 

ATGGAGCCCT GGGOOTGGCT GCAGGGTTTA AAGAGCCQAC CCACX5TQCCC AGCAOCCTCC 60 

TCAGATCOGT TCTCTGCGCT GCCAGCTCAO GACACTGQTQ AAGGAGCAGT GAGGAACCTQ 120 

CAGAGTCACA CAOTTGOCCT GACTGCCTTG GAAGCCAATG ACCCATTTGC CAATAAAGAC 180 

GATCCCTTCT ACTATGACT6 GAAAAACCTO CAGCT0A0CX3 OACTOATCTO CGGAOGGCTC 240 

CTGGCCATTO CTOQGATOOC QGCAflTTCTO AGT06CAAAT GCAAATGGAA GAG CAGOCRQ 300 

AAGCAGCACA GTCCTGTACC TGAGAAGGCC ATOCCACTCA TCAC TCCAG G CRQAT TTCTC 360 

ACCTTGGCCA AATCAAATAA ACCTTTATCT CXSUVGCACCT TTGTCTTGGT OTTTGG^TC 420 

AGCTACACAT CAGTCTTCOG AGTGCCTCTT TCTGOQTCOC TQTACCCTGC CATTCCTGGT 4 BO 

. GATGCTGCTG CCCTCACATC AGGCCATCCA AGCATGCAGA ACATAACCAT GCAGAACACT 540 
GOAAGGAAGO GCT6TACCTA ATQA 

Seq ID NOi 210 Protein sequence 
Protein Accession #i FGENESH predicted 

I 11 21 31 41 51 

I I I 1 I I 

KBPWANLQGL XSRPTCPAAS 8DPFSALPAQ DTGBQAVRNL QSKTVGIjTAL BANDPPANXD 60 

DPFYYDinCNL QLSQLI06GL XAXAOIAAVL SGKCKCKSSQ KQH6PVPBRA XPIalTPGRFL 120 
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PCTAJS02/36810 



TIAKSNKPLS P6TPVLVFGI SYTSVFRVPL SASLYPAIPG DAAALTSGHP SMQNXSMQNT 180 
GTKGCT 

. Seq ID NO: 211 DNA sequence 
J Kucleic Acid Accession ft: FGQIBSH predicted 
Coding sequence: 1..318 



10 
15 
20 
25 
30 
35 



75 



»1 21 31 

I I I 



6eq ID KO: 212 Protein eequence 
Protein Accession #: P06NESH predicted 

1 11 21 31 

I 1 I I 



HPSKQNISMQ imSTKGCT 



41 

1 


51 




CCCXSGCTGCC 


1 

CCGTCTGGGA 


60 


GTGCCTGCTG 


CGCTGG6CCC 


120 


TCCACCCCAG 


GGCCTGCTCC 


160 


GGGTCTTCAT 


TCTCACCCTG 


240 


ACAACTGTCT 


CACCCC3U3CA 


300 


6CCTCACTCT 


TOQOOCCCAT 


360 


CCCTTCACGT 


CCAGTTATCC 


420 


ATCAOGACAT 


AGOGCAGCAC 


480 


QTCACACAGT 


TGGCCTGACT 


540 


CCTTCTACTA 


TGACTGSAAA 


60O 


CCATT6CT60 


GATOGOSGCA 


660 


AGCACAGTCC 


TGTACCT6AG 


720 


TGGCX»AATC 


AAATAAACXrr 


780 


ACACATCAGT 


CTTCOGAGTQ 


840 


CTGCTGCOCT 


CACATGAS6C 


900 


OGAAGGGCTG 


TACCTAA 




41 

1 


51 




RPPVPOQALR 


1 

PPTFHPRACS 


60 


GNSP6SRAPC 


GVAGLTLRAH 


120 


INDTGB6AVR 


NliQSHTVQIiT 


180 


VLSQKCKCKS 


SQKQBSPVPE 


240 


PLSASLYPAI 


FQ}AAAIiTSG 


300 



.-^ Seq ID NO: 213 DNA sequence 
4U Nucleic Acid Accession fti FQENBSH predicted 
Coding sequence: 1..175B 

1 11 21 31 41 51 

^ I I I I I 

**D ATGATGGGGT CTCATOTTCC CXaGGCTGOT CTTGAACTCC TGGGCTOQAG TOACCCTCCT 60 

GCCTTGGCCT CCGAAAGTGC TGGGATTACA GGACTGTTAT TACAGGAATC CATAACACTG 120 

GAGOATGTGO CXGTGGACTT CACTTGQQAQ GAGTGGCAAC TCCTGGGOGC TOCTCAGAAG 180 

GACCTGTACC GOQATGTGAT GTTQGAGAAC TACAGCAACC TGGTGGCAGT GGGGTATCAA 240 

GCCAGCAAAC OGGATGCACT CTTCAAGTTQ OAACAAGOAO AACAAGTOTG GACAATTOAA 300 

JU GATGGAATCX: ACAGTOOAGC CTGTTCAGOT TCTCCRAAGa TCCCXSTTCTC CATTTTCTCA 360 

TCTGTGCCCr TCACTCTTCA AAATTGCCTT CATTCTAACA TATGQAAAGT TGATCaTOTG 420 

CTGGAGCGCT TGCAGAGTGA AAGCCTGGTG AACAOAAGQA AACX:aTCTCA TGAACATGAT 480 

GCATTTQAAA ATATTGTTCA TTGCAOCAAA AGTCAGTTTC TOTTAGOQCA AAATCATCAT 540 

ATATTTGACT TAOGTGGAAA AAGrTCGAAA TOC»ATTTAA CTTTACTTAA CCAGAGCAAA 600 

J J GGCTATGAAA TAAAGAACTC TGTTCAGTTT ACTGGAAATG GGGACTOCTT TCTTCATGCT 660 

AACX31TGAAC QACTTCATAC TQCRATTAAA TTCCCTGCAA GTCAAAAACT CATCAGCACT 720 

AAGTCCCAAT TCATCAOTCC CAAGCATCAG AAAACAC6AA AATTAGAGAA QCATCATCTG 780 

TGCAOTGAAT GTGGQAAAGC CTTCATCAAG AAOTCTTGGC TAACTGATCA CCAGGTAATG 840 

CATACAGGAO AOAAACCCCA CAQATCTAOT CTATGrOAOA AftGCCTTCTC CAGAAAOTTC 900 

OU ATGCTTACTO AACATCAGOQ AACTCATAC3V GGAGAAAAAC CTTATGAATO CCXTTGAAIGT 960 

GGCyWUUSCCT TTCTCAAGAA ATCAOGGCTC AACATACATC AGAAAACACA TACCGGAGAG 1020 

AAACCCTATA TATGCAGTGA AT6TGQAAAA GGCTTCATCC AGAAAGQAAA TCTCATTGTA 1080 

CACrCAGOSAA TTCATACAGG TGA6AAACCT TATATATGCA ATQAAT6TGQ AAAAGGCTTC 1140 

ATTCAOAAGA G6T0TCTCAT AGCACATCAO AQATTTCACA CAGGAAAOAC GOCCTTTOTO 1200 

00 TGCA6T0AAT OTGGAAAATC CTOTTCTCAO AAATCRG6TC TCATTAftACA TCAAAOAAIT 1260 

CACACAGGAG AGAAACCCTT TGAATOTAGT GAATOTGGGA AAGCCTTTAG CACAAAGCAA 1320 

AAGCTC3VTTQ TCCATCAAAG GACTCATACA GOAGAGAGAC CCTATGGCTG TAACGAGTGT 1380 

GGGAAAGCGT TTGCOTATAT GrTOGTOTCTG GTTAAGCATA AGAQAATACA CACAAGGGAO 1440 

AAACA AOAGQ CAGCCAAOGT OQAAAATOCT CCTQCSUSAaA GGCACAGCTC ATTACACSWX: 1500 

/l; AOTCATGTCA TGCAOGA0AA AAACTCTGCT AACJGGGOCOA CTACACAAGT GCCrTCTGTG 1560 

GCCCXrrCAQA CATCATTAAA CATCAGCXK3C CTCCTCGCAA ACAGGAAOGT AGTCCTTOTG 1620 

GG ACAGCCAQ TGGTCAGATG TX3CAGCCTCA GGAQATAACA GAGGATTTGC ACAGGACAGA 1680 

AACCITOTQA AT6CAGTGAA TGXG6TTQTO CCTTC06TQA TCAATTATGT CTTATTTTAT 1740 
6TTAGAGAAA ACCCATAG 



Seq ID NOt 214 Protein sequence 
Protein Accession «t FGENESH predicted 



fin } \^ 2^ *1 " 51 

80 1 I I I I 1 

MMGSHVAQAG LBLLGSSDPP ALASBSAGIT GLLLQESITL EDVAVDPTWB EWQLIX3AAQK 60 

DLYRDVMLBN YSNLVAVGYQ ASKPDALFKL BQGEQLWTIE DGIHSQACSQ SPKVPPSIFS 120 

SVPPTLQNCL HSNIWKVDHV LERLQSESLV NRRKPCHBHD APENIVHCSK SQFLLOQNHD 180 

IFDLRGKSLK SHLTLYNQSK OYBIKNSVBP TQNGDSFUIA NHBRIBTAIK FPASQKLIST 240 
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KSQPISPKHQ KTRKLEKHHV CSBCCSKAPIK KSWLTDHQVM KTQBKPHRCS XXIEKAPSRKP 300 

MLTEMQRTHT GEKPYECPEC GKAPLKKSRL NIHQKTHTQB K2YICSE0GK GPIQKGNLIV 360 

HQRIHTOBKP YIOJECGKGF IQKTCLIAHQ RFHTQICTPPV C8EG6KSCSQ KSGLIKKQRI 420 

HTGBKPFECS ECXSKAFSTKQ KLIVHQRTHT GERPYGC3JEC GKAPAYMSCL VIOfKRIHTRB 480 

KQBAAKVEHF FAERHSSUiT SDVMQEKNSA NGATTQVPSV APQTSUHSG LLANRMWLV 540 
GQPWRCAAS GDHRGFAQDR NLVNAVNWV PSVZNYVLFY VTQIP 

8eg ID NO; 215 DKA sequence 

Nucleic Acid Accession NM_032190.1 

Oodins sequence: 502.. 13 32 

1 11 21 31 41 51 

I i I I I I 

GATTCOGTGT TCTTGGCCAT GTTAGCCATA ATATCCTGTG CAGTATGTTT TTCCTGTGCA 60 

GAGGCAAAAA CATATTGGGC ATATGTTCOC AAGCCCCCAG CAGTATGACC CATACTTTGG 120 

A6TGACACTC CTCCTAAGAT TTATCA7GAT TAAGGAGGAT GQGCTCCAGG ACCCCTAACT 160 

CCACCTGACA TAGAACAOTT A6ACTCTCAO AATAATGTCA TTAATTATAC OGCTCCATTG 240 

GAA6GACTTC CTfT G TGTGT CACCACAAAG ACATCACTCA GCXATAGCTG TCTTACAGTT 300 

CAAGCTCACA CATGGTTGAO TCACTATGGG AAAATCATOT ACTTATTAAQ TCTTGOTTAT 360 

ATTAATGTAA COGGTGTGCT AACCAACCAT TCCT6GCCCA ATOGCCTTCA TTOTGCTOAC 420 

TAT7U»GAAT GGATTCCCTT CAATAGTTCC TACOXCCTC CATAGACCCA GTGTCTTGGC 480 

CCACTGGCTA GAAAACAATC TATGTTAACT GGAGACATTG TGGATTG6GG AGCTAAAGGC 540 

CAATTAGATG GAAAAOAAOA AAATCAGAAA TGGTG6CACA AACTTTGCTG GCATTGGTGG 600 

CAAGCTTTTA ATGCTTCTTC TTTATATAAC ACTGGGATCC AATCCCAGTC GGCCGCCCAG 660 

ATTGCTTGGC ATGQAGCAGG CTTTAGCCOO CCXCTTCCTC AGTGGCATTA TCTAGGGAGG 720 

AAAGGACXAA TTCAAAAGAT GATATGGAA6 GCAGCATTCC CATTTATGAA TG GCAAC ATC 780 

TGGGTTGCCA TAATACTATC CAATAATAGC AATAGTAAGC AACACAOTCT TAATQTTACA 840 

TTTGTAAAQA ATATCACCAC TCAATTTACA GTTT6TGTTT TTAATCCTTA TGTGTTTTTG 900 

GCAGCTAAGA AGGACCAGCT CCAGGTAAAC AATACCCAAT TGACCTGTAA ATCTTQCCAG 960 

TTATATCACT GCATTAATCA TAGCACATTQ CAAACACATA ATATCTCTAC TTTGATGATT 1020 

TTAGGTTGCA TOCCTGGGCT ATGGATTCCT GTTAATCTGT dQAGCCATQ OgCTG OCACA 1080 

A TTGCTTTAC ATTTTGTGAA ACTTCTTCTA ACTCAQTTTA CTCATTQTGT COGTAGAGOC 1140 

TTAGGCATOA TAATTTTTGC TATTGTTTAC TTGGTCACAC TAATAATTTC TGTTGTGATG 1200 

TCCTCTGTAG CTTTGCATAG TTCTATTCAA ACAOCTCAGT ATGTGQAGAA CTGGACACGC 1260 

ACAGTCAACC AAGGGTGGCT ACTTGAGAAT AAAATTAACA CTGAGTTACA AACTGAAGTO 1320 

GCAGTGTTAT AATCCAGQAT TCTATGGTTA GQGQAACAAO TACAAAGCTT OGAATTOGAG 1380 

CAGTAATTOT GTTOTCATTT TAATCACACT CATATTTGTO TAACCAACTT AGAATATAAC 1440 

CAAAGTGAGT ATCCATGGOA TCTTGTGAAA GCCCATTTQC AGGGAGCTTT CACATCCX3AC 1500 

ATCACCTTTG ATATTGOTQA ATTACAAAAC AAAATTCTTG ATTTAAATAA ACAAATTCCA 1560 

GA0TTTCA6C CTTCTTTAGA AGACTGGACT GAATTCCAGC AAGGCCTGGA QAGCGTCAAC 1620 

CCTTGGACCT ATCTAAAGCA CCACATTAAC ATCTTATATA TAGTTCTTGG AATAATGTTO 1680 

TTTTOTCTCT GTCTTCTGTT CATAGTCT6T AAAATOGGAT GGACTOCCAA TOQGAQAATO X740 

AAAGCTACCC AGCCTGGCCT TACATTCTTT CACTTAATAC ATAAACAAGA AGGGGGAAAT 1800 

QTTGGGAGCC AAAAAGGCCA AAGGGATOGT OACCAACTCA QCATTCCACT GGAGGCTACA 1860 

TGATCAAACA GCAAACTGTT TATCATGAAT ACAGAATGTG GGCAAACTC3Q CTTCTQTGCC 1920 

TGCXrCAOAAO OTTTOCTGAG GGCCA3GGCT COCTGGCCCC OGCTCCTTGA GQTTATCTAC 1980 

T6G6ACATCT A0A6GCTATT GTTOGAGQAA TGCAGTCTTG CAA6CCTACT CrGGAGGSAO 2040 

CAGCTQACCT CTTCTTCCAC AOCOCTTCfC ACTATCTCTT TT6CCTAATA AATATG6AG0 2100 

GCTGTGTAAA GCTCAGG6CC CTTOTOCACT AGAGOCAAGQ TGTCCCCTGA OCCXTCTTCC 2160 
AAACAT 

Seq ID NOx 216 Protein sequence 
Protein Accession ffi NPJIX5566.1 

1 11 21 31 41 51 

I I I ! I 1 

KLTGDIVDHQ PKGQLDGKEB NQKSWHKLCH HWWQAFHASS LYMTGXQ8Q8 AAQIAHHGAG 60 

FSPPLPQWHY LGRKGPIQKM IWKAAFPFMM GNTWVAIILS NNSNSKQHSI. NVTPVKNITT 120 

QFTVCVFNPY VFLAAKKDQI* QVNNTQLTCK SCQLYHCINH 8TLQTHNIST LMILGCIPGI* 180 

NIFVNL8BPW AATIALHFVX LLLTQFTHCV RRGLGMIIFA ZVYLVTLZIS WHSSVALHS 240 
SIQITAQYVEN HTRTVNQOHL LENKINTEIQ TBVAVIi 

Seq ID NOt 217 DNA sequence 

Nucleic Acid Accession #t fgenBSB predicted 

Coding sequences 1..1S66 

1 11 21 31 41 51 
1111 i t 

ATGGTGAACC CCAAATCCAC TTCCTCCCTC TTCAGQTTAT GTTTTTTGCT CCTGAGGAOT 60 

CAGAACCTGT GGGTT6AAGA 6CAAATTCAA TGCAAAAACA TATTGOGCAT ATGTTCCCAA 120 

TCCCCCAGCA 0TATG6CCTA TACTTTGOAG CTCACTCCTC CTGAGATTTA TCA06ATCAO 180 

GGAGAGTGGG CTCCAGQACC CCTAACTCCX: OGTGACATAO AAAAOTTAGA CTCTCAQAAC 240 

AATOTCATTA ATTATACCAC TTOVCTGOAA GQACTCOCTT TGTTTATCAC CACAAAGACQ 300 

TCGCTCAGCC ATAGCTGTCT TGCAATTCAA GCTCAAACAT GGTTGAGTCA CTATGQAAAA 360 

ATTATGTACT TATTAGQTCT TGGTTCTATT AATGTAACTG GTGTGCTAAC CA ATCATTC C 420 

CAGTCCAGTC ACOCTAATTG T6CTQATTAT ACAGAATG6A TTCCATTCAA TAOTTCCTAC 480 

CCCacrCTGT OGACCCAOTG TCTTGATCCA CTOGCTAOTA AACAATATAT GTCAACTGAA 540 

GACACTGTGG ATTGGGAACC TAAAGQTCAA TTAGATGQAA AAGGTGAAAG TCAGAAATCA 600 

TGGOVCAAAC TTCACTGGCA TTGGOGGCAA GCTTTTAATG C nt Tl CUI ATACAACAGC 660 

AGAATCCAAT CCCAGTCTGC TGCTCAGATT GCTTGGCATQ GAGCAGGCTT TAGCCCACCT 720 

CTTCCTCAGT TGCATTATCT GGGQAGQAAA GOACCAATTC AAGAAACTAT ATG6AAGGCA 780 

GCACTtXCAT TTATGAATGO CAACATCTGO ATTGGAACAC TOTCTAATAA TAGCA ATAGT 840 

AA6CAACACA GTCTTAATGT TGCATTTGTA AAOAATATCA CCACTCAOTT TACA6TTTGT 900 

GTTTTTAATC CTTAT6CCTT TTTGOGAGCT AAOAAGAAGC AGCTTCAGGT GQAGAACTGa 960 
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ACACQCACAG CTGACCAAGC QAGGCrACTT CAGAATAAAA TTAACACTQA GTTACAAACT 1020 

GAAGTGGCAA TGTTGAAATC CATGGTTCTO TGGTTAGGAG AACAGGTACA AAGCTTGCAQ 1080 

TTGCAGCAQC AATTGCGTCA TCATTTTAAT CACATTCATA TTTGOGTAAC TA ACTC AGAA 1140 

TATAACCAAA GTGAGTATCC GT6GGACCTT GTGAAAGCCC ATTTGCAAGQ AGCTTTCACA 1300 

TCCAACATCA CCTTTGATAT TGGTQAATTA CAAAACAAAA TTATTGATTT AAATAGQCAA 1260 

ACTCAAOAAT TTCAGCCTTC TTTAGAAGAC TGGACOGAAT TCCAGGAAGO CCTGOAQAGC 1320 

CTCAACCCTT GGACCTATCT AAGGCACCAC ATTAACATCT TATATGTAGT TCTrCGAATA 1380 

ATGTTGTTTT GTCTCTGTCT TCGGTTTATA GTCTGTAAAA TOSGATGGAC CACCAATTGG 1440 

AGAATGAGAQ CCTCCCAGCC CAGCCTTACA TTCTTTCAAT TAATACATAA ACAQAAAGGO ISOO 

GGATATQCAG GGAGCCAAAG GCCTOTGGGA OQTOACCAAC TCAGCATTCT GCTGGAGGCT 1S60 
ATATGA 

Seq ID NO; 218 Protein sequence 
Protein Accession #i FGSHBSH predicted 

1 11 21 31 41 51 

I I 1 I I i 

MVNPKST8SL FRLCPLLLRS QNLHVEBQIQ CKNILGICSQ SPSSMAYTLB LTPPBIYHDQ 60 

GEWAPGPLTP RDIEKIJ3SQN NVINYTTPLE GLPLFITTKT SLSHSCLAIQ AQTWLSHYGK 120 

IMYLDGIiGSI NVTGVLTNH8 QSSHPNCADY TEWIPFN8SY PTLHTQCLDP LASKQYMSTB 160 

DTVDHSPKSQ LDGKGBSQKS NHKLHHRHRQ AEKASSLYNS RIQSQSAAQI ANK6AGF8PP 240 

LPQLHYLGRK GPZQBTIWKA ALPFMNGMZW IGTLSNNSKS KQHSU4VAPV KKITTQPTVC 300 

VFNPYAFLAA KKNQLQVENH TRTADC2ARLL QNKINTELQT EVAMLKSMVL WLGBQVQSLQ 360 

liQQQLKHHPN HIHICVTNSE YNQSEYPWDL VKAHLQGAPT SNITFDIGEL (^fKIIDLNRQ 420 

TQBFQPSIjED HTEFQEGLES LNPHTYIiRHH ZNIXiYWLOI MLFCLCLRFX VCKIGHrmW 480 
RMRASQPSLT FFQLIHKQXG 6YA0SQRFVG RDQIiSILLEA Z 

Seq ZD NO: 219 DNA sequence 

Kucleic Acid Accession #t FGENBSB predicted 

Coding sequence! '1..900 

1 11 21 31 41 51 

I I i I ) ) 

ATGCOGCCGC GGGAGCTQAG CGAGGCOGAO CCQCCCCCX5C TCCX3G0CCCC QACCCCTCCC 60 

CGGCGOOGGC 6TAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTG6T GGGG6ACGGC 120 

GCGGTGGGCA AGA6CAGGCT CAT06TCAGC TACAOCTGCA ATGGOTACOC 060606CTAC 180 

066CCCACTO CXSCTGGACAC CTTCTCTGGT AG6TAGGTTC AATOGCOOST 6CX3GC0606T 240 

GGCrGCX360G GGGCTGTGCA CCGGGGAGCT GGGGOSGGOQ TCTG6G0QGG AGGGGGCA6A 300 

GGACCCCGGG GA6GAQACTG GAGGAGGCCC 0GA0QTGGCX3 CTGGTGOGGC GCAGGACGCT 360 

CTTCCTAACT CAGGCTCTCC CXGCCCCGCC CXTOCa«5TGC AAGTCCTGQT GGATGGAGCT 420 

CCGGT0CX3CA TTGAGCTCTQ GGACACAOOQ GOAaUSGAGO ATTTTQACOB ACTTOGTTCC 460 

CTTTGCTACC CGQATACOQA TGTCTTCCTG GOQTGCTTCA 6C0TGGTGCA GCCXAGCTCC 540 

TTTCAAAACA TCACAGAGAA ATX3GCTGCCC GAQATCCGCA OGCACAACCC CCAGQOGCCT 600 

GTGCTGCrGG TGOGCACCCA GGCCGACCTO AGGGACGATG TCAACGTACT AATTCAGCTG 660 

GACCAGGGGG GCCGGGAGGG CCXXC3TGCCC CAACXXXAGQ CTCAGGGTCT GGCCaAGAAO 720 

ATCGOAGCCT GCT6CTACCT TGAOTQGTCA 6GCTTGA06C AGAAGAACTT GAAGGAAGTA 780 

TTT6ACTGGG CTATTCTCA6 TGCCATTGAO CACAAAQCCC GQCIGGAGAA GAAACTGAAT 840 
GCCAAAGGTG T6CGCACX:CT CTCC0GCT6C OGCTGaAAGA AOTTCTTCTG CTTOGTTTGA 

Seq ZD NOi 220 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

1 1 I I 1 I 

HPPREliSEAE PPPLRAPTPP PRRRSAPPBL GZKCVLVGDG AVGKSSLZV8 YTCNGYPARY 60 

RPTAIiDTPSQ TYVQSPVRPR G0QGAVHR6A GAGVSAGGRR 6PRG6DWSRP RGGAGAAQDA 120 

IiPNSGSPRPA PAVQVLVDGA PVRZELNDTA GQEDFDRLRS IiCYPDTDVPL ACF8WQP88 180 

FQNZTSKWLP BZRTHNPQAP VLLVGTQADL RDDVNVblQL DQOQREGPVP QFOAQQLABK 240 
ZRACCYXiECS ALTQKMLKEV FDSAILSAZE KKARIiBKKIdT AK9VRTLSRC SWXKFFCFV 

Seq ZD NOi 221 DMA sequence 

Nucleic Acid Accession «t XM_063832.2 

Coding sequence: 1..711 

1 11 21 31 41 51 

I I I I I I 

ATGCOGCCGC GGGAGCTGAQ C6AGGCCGAG CGBCCGC06C TCOGGGCGOC GACCOCTCCC 60 

CCGCGGOGGC GTAG060GCC CCCAGAGCTG GGCATCAAGT GCGT6CTG6T GGGCGACGGC 120 

6CCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC 0GC6CGCTAC 180 

CG6CGCACT6 CGCTGGACAC CTTCTCTOTG CAAGTCCTGG TGGATOGAGC TC0GGTG06C 240 

ATTGAOCTCT GOGACACAGC GGGACAGGAG GATTTTGACC GACTTCQTTC CCTTTGCTAC 300 

C06GATACCG ATOTCTTCCT GGCGTGCrTC A00GTG6TGC A6C0CA0CTC CTTTCAAAAC 360 

ATCACAGAGA AATGGCTGCC OGAGATCCGC AOGCACAACC CCCAGGCGCC TGTGCTGCTO 420 

GTGGGCACCC AGGCCGACCT GAGGGACGAT GTCAAOOTAC TAATTGAGCT GGACCAGGGO 480 

GGCOGQGAGG GCCCCGTGCC CCAAOCCCA6 GCTCAGGGTC TOGCOQAGAA GATCO GAGCC 540 

TGCIGCTACC TT6AGTGCTC AGCCTTGAOG CAQAAQAACT TGAAOQAAOT ATTTGACTOO 600 

GCTATTCTCA GTGCCATTQA GCACAAAGCC OGGCTGGAGA AGAAACTGAA TGCCAAAOGT 660 
GTQCQCACCC TCTCCCGCTO CCGCTGGAAG AAGTTCTTCT GCTTCOITTO A 

8eq ZD NO: 222 Protein sequence 
Protein Accession ft: XP_063832.1 

1 11 21 31 41 51 

t I 1 I I I 
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MPPRELSBAB PPPLRAPTPP PRRRSAPPBL GIKCVLVGDG AVGKSSLIVS YTQJOYPARY 60 

RPTALETTPSV QVLVDOAPVR lELWDTAGQE DFDKLRSLCY PDTOVPIACF SWQPSSFQN 120 

ITBKHLPBIR THNPQAPVLL VGTQADIiRDD VNVLIQLDQO GREQFVPQPQ AQGLABKIRA 180 
CCYLBCSALT QKNLKSVFDS AXLSAIEHRA RLEKKUOXG VRTLSRCRWK KFFCFV 

Seq ID KOx 223 DNA sequence 

Nucleic Acid Accession ft: Bos sequence 

Coding sequence: 1..1161 



I 11 21 31 41 51 

! I I I I ) 

ATGAATOGGC ACCATCTGCA GOATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCOGAGATG ACTTCATTGC CAAGGTGTTG COGCCGGTGT TGGGGCTGGA GTTTATCTTT 120 

^ . 6GGCTTCT06 GCAATGGCCT TGCCCTGTCG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

15 TCCAGCCX3GA TTTTCCT6TT CAACCTGGCA CrTAGCTGACT TTCTACTSAT CATCT G CCTG 240 

CCQTT05TGA TGQACTACTA TGTGCGGCX3T TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGGTGC TCTTCATGTT TGCCATOAAC OGCCAGGOCA GCATCATCTT CCTCACGQTG 360 

GTGGOGGTAG ACAOOTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATTGGACAO CAQCCATCAT CTCTTQCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480 

2U CACCTCCTGA A6AAGAAGTT QCTGATCCAG AATGGCCCTG CAAATOTGTQ CATCAGCTTC 540 

AGCATCTGCX: ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGOCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCT6C3GGCA GAQACAAAT6 660 

GACOGGCATO CCAAGATCAA GAGAGCCATC ACCTTCATCA TGOTGGTGGC CATCGTCTTT 720 

^_ 6TGATCTGCT TCCTTCCC A G OGTGGTTQTG CG3ATC0GCA TCTTCTGOCT CCTGCACACT 780 

25 TCGGGCAOOC AGAATTGTGA AGTGTACCGC TCGGTGQACC TGQaJTTCTT TATCACTCTC 840 

AGCTTCACCT ACAT6AAC3^ CATGCTGQAC CCOOTGOTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCaUVCT TCTTCTCCAC TTTGATCAAC OGCTGCCTCC AGAGGAAQAT GACAGCJTGAO 960 

CCA8ATAATA ACCGCAGCAC GA006TGQAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCA6AGG OGTTAATGGC CAACTCOGGT GAGCCATGGA GCCCCTCTTA TCTQOQCCCA 1080 

ACCTCAAATA ACCATTCX:aA 6AAGGGACAT TGTCACGAAG AACCAQCATC TCTGGAGAAA 1140 
CAGTTGGGAT GTTGCATCGA G 



Seq ID NOt 224 Protein sequence 
Protein Accession #i Eos sequence 



1 11 21 31 41 51 

I I ) I I I 

MNmmiiQDHF LBIDKKNCCV FRDDFIAKVL PPVDGLEFIP OLLGNGLALH IFCFHLKSWK 60 

. SSRIPLPNIA VADFLLIICL PPVMDYYVRR SDWKPCTIPC RLVLPMPAMN RQGSIIPLTV 120 

4U VAVDRYPRW HPHHALMKIS NWTAAIISCL LWGITVGLTV HLUOOCLLIQ NQPANVCISF 180 

8ICHTFRWHB AMFLLEFLLP LGZXLPCSAR IXWSUtQRQM DRBAKZKRAI TPIMWAIVP 240 

VICFIiPSVW RIRIFWLLHT SOTQNCBVYH SVDLAPPITL SPTYMNSMLD PWYYPSSPS 300 

FPNFFSTLIN RCLQRKMTGB FDNMRSTSVB LTGDPNKTRO APBALNANSQ BPWSPSYLGP 360 
TSNNBSKKGH CHQEPASIiEK QLGCCIB 



Seq ID NO I 225 DMA sequence 

Nucleic Acid Accession fts Eos sequence 

Coding sequence t 1 . . 1092 

1 11 31 31 41 51 

I I I I I I 

ATGAATOGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTBTG 60 

TTCCGAGATG ACTTCATTGT CAAGGTGTTG COGCCGGTGT TGGGGCTGGA GTTTATCTTC 120 

GGGCTTCTGG GCAATGGCCT TGCCCTOTGO ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

TCCAGCOGGA TTTTCCTOTT CAACCTGGCA GTGGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCCTTCCTGA TQQACAACTA TQTGAGaOOT TGGGACTGGA AGTTTGGQQA CATCCCTTGC 300 

OGGCTGATGC TCTTCATGTT GGCTATGAAC OGCCAGGGCA GCATCATCTT CCTGAOQGTG 360 

GTGGOGQTAO ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATOGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTATTGG CCTGACAGTC 480 

CACCTCCTGA AQAAGAAGAT GCOSATCCAG AATGG06GTG CAAATTTGTG CAGCAGCCTC 540 

AGCATCTGCC ATACCTTOCA GTGGCACGAA QGCATGTTOC TCCTGGAOTT CnWltJax: 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCI600GCA GAGACAAATG 660 

GACOGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGOTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG OGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGOGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAO CATGCTGGAC CCOQTOGTOT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC OGCTGCCTCC AGAGGAAQAT 6ACAGGTGAO 960 

GCAGATAATA ACCGCAGCAC 6A60GT0GA0 CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTOCAGAOG OGTTAATGGC CAACTCOGGT GAGOCATGGA GOCCCTCTTA TCTQGGCCCA 1080 



Seq ID NOi 236 Protein sequence 

Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I 1 I 1 I 

MNRHHLQDHP LBIDKKNCCV FRDDFXVKVL PPVLGLBFIP GItLGllGLALIf IFCPHLKSHK 60 
SSRIFLFNLA VADFLLIICL PFLMDNYVRR WDWKFGDIPC RIMLFMLAMN RQG9XIFLTV 120 
VAVDRYFRW HPHHALNKIS NRTAAIISCL LWGITIGLTV HLLKKKMPIQ NGGANLCS8P 180 
SICHTFQWHB AMFLLEFFLP LGIILFCSAR IIWSLRQRQH DRRAKIKRAI TFIMWAIVF 240 
VICFLPSWV RIRXFHUiHT SGTQHCBVyR SVDLAFFXTL SFTYNKSMLD PWYVPSSPS 300 
PPNFF8TLXN RCbQRKKTGB PDNNRSTSVE LTGDFNKTRG APBALKANSG EPHSPSYLGP 360 
TSP 
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Se<2 ID N0> 227 DMA aequenee 

Nucleic Acid Accession 9: NM_006018 

Coding sequence! 61.. 1224 

1 11 21 31 41 51 

I I I I t I 

CGCCACTTTG CTG6AGCATT CACTAG60QA GG08CTCCAT CX3CACTCACT AGCOGCACTC 60 

ATGAATCGGC ACCATCTGCA GOATCACTTT CTGGAAATAG ACAAGAAQAA CTQCTGTGTG 120 

TTC0GAC3AT0 ACTTCATTGC CAAGGTGTTO OCGCCGGTQT T6GGGCTGGA GTTTATCTTT 160 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ArrrTCTGTT TCCAOCTCAA GTCCTGGAAA 240 

TCCftOOOQGA rrrfC C l t a T CAAOCTGGCA OTAGCTGACT TTCTACTGAT CATCTGCCTQ 300 

qCGTTGBTGA TGGACTACTA TGTGCG G CGT TCA6ACTG6A ACTTT66G6A CATCCCTTOC 360 

CGGCTGQTOC TCTTCATGTT TGCCATQAAC 06CCAGGGCA GCATCATCTT CCTCACGGTG 420 

GTGGOGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 480 

AATTGOACAO CAGCCATCAT CTCTT6CCTT CTBT6GG6CA TCACTOTT66 CCTAACAGTC 540 

CACCTGCT6A A6A AGAA0TT 6CTQAXCCA0 AATO QOCCIQ CAAATSTSTO GATCAOCTTC 600 

AGCATGTOCC ATACCTTGC6 GTGGCAOGAA 6CTATGTTCC TCCTGQAGTT CCTCCTGCCC 660 

CTGGQCATCA TCCTGTTC7G CTCAGCCAQA ATTATCT6QA GGCTGOGOCA 6AGACAAATG 720 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCRTCA TGGTGGTGGC CATCGTCTTT 780 

GTCATCTOCT TCCTTCCCAO CGTOOTTaTO OQGATCOGCA TCTTCTGOCT CCTQCACACT 840 

TC6GGC»CX(C A0AAT7GTGA AGTQTACOBC TOOGTGQACC TGGOGTTCTT TATCACTCTC 900 

AOCTTCACCT ACAT6AACA0 CATGCTGGAC CCC G TOOTGT ACTACTTCTC CAGCCCATCC 960 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCXTTCC AGAGGAAQAT GACAGGTQAG 1020 

CCAGATAATA ACC6CAGCAC GAG06TCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1080 

GCTCCAGAG6 CGTTAATGGC CAACTC066T GA6CCATGGA 6CCCCTCTTA TCTGGGCCCA 1140 

ACCTCAAATA ACCATTGCAA GAAGGQAGAT TGTCAGCAAG AAOCAGCATC TCTGGA6AAA 1200 

CAQTTGGGCr GTTGCATCXSA GTAATGTCAC TGOACTGGGC CTAAGOTTTC CTGOAACTTC 1260 

CAGATTCAGA QAATCTGATT TAGGGAAACT GTGGCAQATQ AOTGGGAQAC TGGTTQCAAG 1320 

GTGTGACCAC AGQAATCCTG GAGGAACAGA GAGTAAAGCT TCTAGGCATC TGAAACTTGC 1380 

TTCATCTCTG A0GCT06CAG GACTGAAGAT GGGCAAATTG TAG6C6TTTC TGCTQAGCAG 1440 

AGTTQQAGOC AGAGATCTAC TTGTOACTTG TTG6CCTTCT TCCCACATCT GCCTCAOACT 1500 

GGGGGGGGCT CAGCTCX:TC0 GGTGATATCT AGCCTGCTT6 TGAGCTCTAG CAGGGATAAG 1560 

GAGAGCTQAG ATTGOAGGGA ATTGTOTTGC TCCTGGAGGA AGCX:CAGGCA TCATTAAACA 1620 

AGCCAGTAGG TCACCTGGCT TCCGTGGACC AATTCATCTT TCAGACAAGC TTTAGAQAAA 1680 

TGGACTCAGG GAAGAGACTC ACATGCTTTG GTTAGTATCT GTGTTTCCGG TGGGTQTAAT 1740 

AGGGGATTAO CCCCA6AAGG GACT6AGCTA AACAQTGTTA TTATOGGAAA GOAAATGQCA 1800 

TT GC TGCrrr CAACCAGOGA CTAATGCAAT 0C3VTTCCTCT CTTGTTTATA 6TAATCTAA0 1860 

GGTTGAGCAG TTAAAAOGGC TTCAGGATAG AAAGCTOTTT CCCAGCTGTT TOOTTTTACC 1920 

ATTAAAAGGQ AAACGTGCCT CTGCCCCACG GGTAGAGGGG GTGCACQTTC CTCCTGQTTC 1980 

CTTOQCTTQT GTTTCTGTAC TTACCAAAAA TCTACCACTT CAATAAATTT TGATAGGAGA 2040 
CAAAAAAAAA A 

Seq ID HOt 228 Protein sequence 
Protein Accession #i NP_006009.1 

1 11 21 31 41 51 

I I I I I I 

MNIUmLQDKF LEIDKKNCCV PRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLK6WR 60 

SSRIFLPNLA VADFLLIICL PFVMDYYVRR SDWNFGDIPC RLVLFMFAMN RQGSIIPLTV 120 

VAVDRYFRW HPHHALNKZS NWTAAIISCL LHGITVGLTV HLLKKKLLIQ KGPANVCISF 180 

SIGHTFRNHB AMFUUBFUiP XiGXIIiFCSAR IIHSLRQRQM OKHAKIXRAX TFXHWAIVF 240 

VICFLPSWV RIRIFWUar SGTQNCBVyR SVDLAFFITL SFTyMHSMLD PWYYFSSPS 300 

FFNFFSTLIN RCLQRKMTGE FDNKRSTSVB LTGDPNKTRG APBAUfANSG EPHSPSYLOP 360 
TSNNHSKKGR CRQEPASLBK QLGCCIE 

Seq ID NO: 229 UNA sequence 

Nucleic Acid Accession «s NM_014398.1 

Coding sequence i 64 . . 1314 

1 11 21 31 41 51 

I I t I i I 

GQCACOGATT 06GGGCCT6C COGOACTTGG G0GCAGGCT6 CAGAACCTCG CCCAGGGCCC 60 

ACCATGCCCC GGCAGCTCAG CGOGGCGGCC GCXSCTCTTCX3 CGTCCCTGGC OGTAATTTTG 120 

CAOQATGGCA GTCAAATGAG AGCAAAAGCA TTTCCAGAAA CCAGAGATTA TTCTCAACCT 180 

ACTGCAGC3U3 CAACAOTACA GGACATAAAA AAACCXGTCC A6CAACCASC TAAlGCAAOCA 240 

CCTCAOC»AA CTTTAGCAGC AA6ATTCATG GATGGTCATA TCACCTTTGA AACAGGGGCC 300 

ACASTAAAAA TTCCAACAAC TACCCCAQCA ACTACSUUUUl ACACTGCAAC CACCAGCCCA • 360 

ATTACCTACA CCCTGGTCAC AACCCAGGCC ACACCCAACA ACTCACACAC AGCTCCTCCA 420 

GTTACTQAAO TTACAGTCGG CXXTTAOCTTA GCCCCTTATT CAC7GCCA0C CACCATCACX 480 

CCA0CA6CTC ATACAGCIGG AACCAOTTCA TCAACOQTCA 6CCACACAAC TGGGAACACC 540 

ACTGAAOCGA GTAAOCAOAC CftCCC r rC UA GCAACTTTAT OQAIAGCACT GCACAAAA6C 600 

ACAACCGGTC A6AA6CCTGA TCAACCCACC CATGCCCCAG GAAC3U^C0GC AGCTGCCCAC 660 

AATACCACCC GCACAGCTGC ACCTGCCTCC AOGGTTCCTG GGCCCACCCT TQCACCTCAG 720 

CCATCOTCAG TCAAGACTGG AATTTATCAG GTTCTAAAOG OAAGCAOACT CTGTATAAAA 780 

GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT 06GTTTTTTC ACCTOGGAGA 840 

TACTTCAAGA TOGAOCCCAA 0QCAA06CAA GOCTCrGGOA ACTGTGQCAC CCGAAAATOC 900 

AACCTTCTOT TGAATTTTCA GGGCGGATTT 6TGAATCTCA CATTTAOCAA GGATGAAGAA 960 

TCATATTATA TCAOTQAAOT GOGAGCCTAT TTQACOGTCT CAOATCCAGA GACAOTTTAC 1020 

CAAGGAATCA AACATQOGGT GGTGATGTTC CAGAC3U3CAG TCX5GQCATTC CTTCAAOTGC 1080 

GTGAOTGAAC AGAGCCTCCA CTTGTCAGCC CACCTGCAGG TGAAAACAAC GGATGTCCAA 1140 

CTTCAA6CCT TTOATTTTGA AGATGACCAC TTTGQAAATQ TGGATGAGT6 CTOCTCTGAC 1200 

TACACAATTG 1QCTT0CTOT GATTGGGGCC ATO Q T OO TIO GTCTCXGCCT TATGGOTATG 1260 

GGTOTCTATA AAATGOGCCT AAOQTGTCAA TCATCTGGAT ACCM2A0AAT CTAATTGITO 1320 

CCGGOQGGOA AT6AAAATAA TQGAATTTAO AGAACTCTTT CATCOCTTCC AGGATGOATQ 1380 

1040 



wo 03/042661 



PCT/US02/36810 



TTGGQAAATT CGCTCAGAOT GT G G G TCCTT OVAACAATOT AAAGCACCAT CTTC TATTCA 1440 

AATGAAOTGA GTCATGTGTQ ATTTAAGTTC AGGCAGCACA TCAATTTCTA AATACTTTTT ISOO 

GTTTATTTTA TGAAAGATAT AGTGAGCTGT TTATTTTCTA GTTTCCTTTA GAATATTTTA 1560 

GCCACTCAAA GTCAACATTT GAOATATGTT GAATTAACAT AATATATGTA AAGTAGAATA 1620 

5 AGCCTTCAAA TTATAAACCA AGGGTCAATT GTAACTAATA CTACTQTGTG TG CATTG AAG 1680 

ATTTTATTTT ACXXriTGATC TTAACAAA6C CTTTGCTTTG TTATCAAATG GACTTTCAGT 1740 

GCTTTTACTA TC imt i TT'l'T ATGGTTTCAT GTAACATACA TATTCCTGGT GTAGCACTTA 1800 

ACTCCTTTTC CACTTTAAAT ' n Xjrr mtJ 'A' TTrTTGAGAC GGAGTTTCAC TCTTGTCACC 18S0 

CAGGCTGGAG TACaCTGGCA CX5ATCTCGGC TTATGGCAAC CTCCGCCTCX: CGG6TTCAAG X920 

10 TGATTCTCCT GCTTCAGCTT CCOGAGTAGC TGGGATTACA GGCACACACT ACCACX3CCTG 1980 

GCTAATTTTT GTATTTTTAT TATA6AGGGG TTTCACCATO TTGGCCA6AC T0GTCTT6AA 2040 

CTCTTGACCT CAGGTGATCC ACXX3VCCTCA GOCTCCCAAA GTGCTGGGAT TACA6GCAT0 2100 

AGCCATTGCG CCOGGCCTTA AATGTTTTTT TTAATCATCA AAAAGAACAA C ATATCTCA G 2160 

GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA AATCAGCTTA TATTTTTTAT 2220 

15 CTTGAT6ACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA GGTGGCTACA GATGGTAGAA 2280 

CTAAACAATA AOCAAGAGAC AATAATAATG GOCCTTAATT ATTAACAAAG TGCCAGAQTC 2340 

TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAA6T GAATGAGTAA 2400 

ACTGAGACTT AAGGGAACTQ AATCACTTAA ATGTCACCTO GCTAACTGAT GGCAGAGCCA 2460 

GAGCTTGAAT TCATGTTGQT CT6ACATCAA GQTCTTTGQT CTTCTCCCTA CACCAAGTTA 2520 

20 CCTACAAOAA CAATGACACC ACACTCTGCC TQAAGGCTCA CACCTCATAC CaUSCATACGC 2580 

TCACCTTACA GGGAAATGGG TTTAT0CA6G ATCATGAGAC ATTAjQGGTAO ATGAAASGAG 2640 

AGCTTTGCAO ATAACAAAAT AOCCTATCCT TAATAAATCC TCCACTCTCT GQAAGGA6AC 2700 

TGAGGGGCTT TGTAAAACAT TAGTCAGTTQ CTCATTTTTA TGGGATTGCT TAGCTGGGCT 2760 

GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTGA TAATA6AGAA 2820 

25 ACTTCGCTAA CCAACTGTTC TTTCTTGAOT GTAT AGCCC C ATCTTGTGGT AA CTTGC TQC 2880 

TTCTGCACTT CATATCCATA TTTCCTATTO TTCACTTIAT TCTGTAQAGC A6CCTGCCAA 2940 

OAATTTTATT TCTOCTGTTT TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATOTTAAC 3000 

AGAAAA6TCC ACATAACCCT AGAATTCTTA OTCAAGGAAT AATTCAAGTC ASCCTAGAGA 3060 

CCATGTTOAC TTTCCTCATO TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 3120 

30 TAGTCTTAAT AAAACATTGA ATTGTAQTAA AGGTTTTTOC AATAAAAACT TACTTTGG 

Seq ID NO: 230 Protein sequence 
Protein Acceeslon «t NP_055213.1 

35 1 11 21 31 41 51 

I I ) ) I I 

MPRQLSAAAA LPASLAVILH DGSQMRAKAP PBTRDYSQPT AAATVQDIKK PVQQPAROAP 60 

KQTLAARFHD GHITFQTAAT VKIPTTTPAT TKNTATTSPI TYTLVTTQAT PNNSHTAPPV 120 

TEVTVGPSLA PYSLPPTITP PAHTAGTSSS TVSHTTGNTT QPSNQTTIjPA TLSIALHKST 180 

40 TGQKPDQPTH APGTTAAAHN TTRTAAPAST VPGPTLAPQP SSVKTGIYQV IWGSRLCIKA 240 

EMGIQLIVQD KESVFSPRRY FNIDPNATQA SGNCX5TRKSN LLUIFQGGFV ULTFTKDBBS 300 

YYISBV6AYL TVSDPETVYQ QIKHAWMFQ TAVKISPKCV SEQSIiQliSAH LQVKTTDVQI* 360 
QAFDPBODHP GNVDBCSSDY TIVLPVI6AI WGLCLMGMQ VYXIRIiRGQS SGYQRI 

45 Seq ID NO: 231 DNA flcquence 

Nucleic Acid Accession tft NM_O0S409.3 
Coding sequence : 94 . . 3 7 6 

1 11 21 31 41 51 

11)111 

TTCXTTTTCAT GTTCAQCATT TCTACTCCTT CCAAOAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC A6CAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGIGC TACASTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

T G TCTTTGCA TAGG0CCTQ6 G6TAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCXTTCC 240 

ATAATGTACC CAAOTAACAA CTOTGACAAA ATAGAAGTQA TTATTACCCT GAAAGAAAAT 300 

AAAQGACAAC GATGCCTAAA TCCCAAATOG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

6AAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGQAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACr GTOACXACXO AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT AT6GTTTTST GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGOT 540 

GGOTQAAAGO ACCAAAAACA QAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAO GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAQAAAG6CT 660 

GGTTACCATC G6AGTTTAC3V AAGTGCTTTC ACXSTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTOTCTCrAG AAGTTATCTG TCTOTATTOA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCXCTTAT AAGTCRAAAG 900 

CATCTATGTG TOGTAAAQCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATAT6TAG GQAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTGATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATA06CTA 1080 

TGGGATACTG 0CAACA8TOC ACATATTTCA TAACCAAATT AGCAOCACOG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATT AGGA TAT GTG TGTTT AC 1200 

TGTACTTTTT OTTTTGATCX: OTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAA6AAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTOTA ATTCTOTCTC TTAGAAAAAT ACATA ATCT A ATCAATTTCT 1380 

TPGTTCATGC CTATATACTO TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACXaCSWl CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID MOt 232 Protein eequence 
Protein Accession 0i 1IP_005400.1 

80 1 11 21 31 41 51 

I 1 I I I I 

MSVKGMAIAL AVILCATWQ GPPMPKRGRC LCIGP6VKAV KVADIBKASI MYPSNNCDKI 60 
BVIZTLKENK GQRCLNPXSK QARLIIKKVB RKNF 



1041 



wo 03/042661 



PCT/US02/36810 



8eg ID NO: 233 DMA sequence 

Nucleic Acid Accession «t NM_000577.1 

Coding sequence: 41.. 520 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



QQCAOGAGGQ 
CGGACCCTCT 
GAAGAOCTTC 
CAATTTAGAA 
CCATGGAOGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
OGTCATGGTC 
TCCCATTCTT 
GCTATGGGGG 
CCTGGTCACA 
GTCTTTCTAA 
TCTGCATTCA 
CTTCCTCCCT 
ACCAAGT<3GC 
TTTAAGGGTT 
QAAGGAGAGC 
ATTOCTGOIT 

ATTTTTTTTT 
CCCAGCCTOC 
TCCTATCAGA 
TCTTCCTCro 
ACTTGTATGA 
GAGCAOGAAA 
CTCTT6GCAG 
CTGTQACTTC 
AAAAAAAAAA 



11 

I 

GAAGACCTCX: 
GGGAGAAAAT 
TATCTGAGGA 
GAAAAGATAG 
AAGATGT6CC 
AACATCACTG 
GACA6TGGCC 
GOQATGQAAG 
ACCAAATTCT 
GCAT6GCAAG 
CACTGAGGAC 
GGACTCTGCX: 
TGTX3TGAATC 
GGATCAAACC 
CATTCCACCT 
TCCCACACCC 
TGTGGAAAAT 
CCTTCATTTG 
TTQTGAAATQ 
ATQTCCCAAC 
TCCTTTTAAA 
AAGCTCCATC 
AGTTTCTCAG 
CTGAAG6AAT 
AA6ATGGCT6 
CATGACT06T 
GTACTCAGOQ 
AGCrCTOTTT 
AAAAAAAAAA 



31 

I 

TGTCCTATCA 
CCAGCAAGAT 
A CAACCAA CT 
ATOTGGTACC 
TGTCCTGTGT 
ACCTQAGOQA 
CCACCACCAO 
CTQACCAOCC 
ACTTCCAGGA 
GACTOCAGGG 
CAGCCATTGA 
TCCTCTTCAA 
AGAGC3VCAGC 
COQACCACCT 
TCCCATGCCC 
TGTTTTACAR 
GAAAATTAGG 
GAGATTATGT 
ATGOTGAAAO 
TT6TAAAAAT 
ACACTTCCAT 
TCCACTCCAQ 
CTCCCAAGOC 
AAATTGCTCC 
TGCCTCT6GC 
ATATGTCTCA 
AATQAATGCT 
TACAATAAAA 
AAAAAAAAAA 



31 

I 

GOCCCTCCCC 
GCAAGOCTTC 
AOTTQCCQGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAOAAAO 
TTTTGAGTCT 
COTCAOCCTC 
GGACGAGTAG 
ACTGCCAGTC 
GGGGTGGACC 
CTGACXIAGCX: 
AGCCCCTGCA 
GCCCAACCTQ 
TGQATCCATC 
AAAAGAAAAG 
ATTTCATGAT 
TCTTTOGGGG 
TAAGTGQTAG 
TAAAAGTTAT 
AATCTGGACT 
ATTTTTTACA 
TCTQAGCAAA 
TTGACATT6T 
TGTCTOOCCC 
6GTC0CTGCA 
GTATATGTTG 
TCTTGAAAAT 
AAAAAA 



41 

I 

ATGGCTTTAG 
AGAATCTGGG 
TACTTO CAAO 
CAT6CTCT0T 

GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 
CTCAGAAGGC 
TCCATGCTGC 
CAAAGCCCTT 
CTCTCCTCTT 
AGGCCACTTG 
ACCAGTCCAT 
TTTTTTTTTT 
AGAGGCTGAG 

cmrcccTT 

GOTACTATGT 
CCTCTGTCCA 
GCTGCCTGCA 
TGTGGCTCCT 
AGAGCTTCTG 
ACCAGGCTGG 
GGGCCAAGCA 
GGTGQU^AGT 
6CCTAAAAAA 




CTQACQAAGG 
GCCTGCCTGT 
AGGGCTCCCG 
GTCACAACAA 
CTCCAGAATG 
CCATOTCGCC 
GCCACTGCCT 
ATGACCCCCA 
GAGGGAGGTT 
CAGTCCCCX3T 
GACTTAAAAT 
CTTTTTCTTC 
TAGCCCCATA 
GGCACTGCTG 
GTACTTTACC 
GGGGGTTCTT 
GCACTTGGAG 
GAGCTCTGCA 
CCTAGCCTCG 
TCCCTACTTC 
AAAAAAAAAA 



60 
120 
160 
240 

300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ZD NO: 234 Protein sequence 
Protein Accession #t NP_000568.1 

1 11 31 31 41 51 

1 I I I 1 ) 

MAt^nCRPS GRKSSKMQAP RZWDVNQKTP YIiRNNQLVAO YLQGPNVNLB BKIDWPXBP 
HAIiFI/SIKGG KMCLSCVKSO OETRLQLBAV NXTDLSEHRK QDKRPAFXRS DSGPTTSFBS 
AAGPGHFLCT AMBADQPVSL TNMPDB6VMV TXPYPQEDB 

Seq ZD NOi 335 DMA sequence 

Nucleic Acid Accession fti NM_001840.1 

Coding sequences 149.. 1567 



GGGOACTAOG 
TCCCGAGGAC 
6AGCTCAGCC 
CACCTTCOQC 
AGACATCAAA 
TTCCTTTAGG 
CrCAGCAGAC 
GAATGAGGAQ 
GAACCCCAGC 
CCTQGAGAAC 
TTOCTACCAC 
CTACAGCTTC 
CAAACTGGGT 
CATGGACAGO 
CAA06GC6TG 

TGATGAAACC 
GTATGOGTAC 
TGOCAGCCAO 
GCCftG ACCAA 
GTTQATCATC 
GAACAAGCTC 
OGTGAACCCC 
QTTTCCCTCT 
CCTGCACAAA 
GA6CA06GTC 
TCTGTGAGCC 
ATCTAGAAGA 
AAGGTGATT6 
TOC3TA66AAA 



11 
I 

GAGAGCTCTQ 
CAGGGGATGC 
TAATCAAAQA 
ACCATCACCA 
GQTQACATGS 

CAGGTQAACA 
AACATCCAGT 
CAGCA6CTGG 
CTCCTGOTGC 
TTCATCG6CA 
ATTGACTTCC 
GGGGTCACGG 
TACATATCCA 
GTGGOGTTTT 
TGGAACTGCO 
TACCTGATGT 
ATGTATATTC 
AAOAGCATCA 
GCCG6CATG0 
TGCTG6Q0CC 
ATTAAGACGG 
ATCATCTATG 
T6TGAJVGGCA 
GAOSCAAACA 
AAGATT6CCA 
TGATGCCTCC 
GTCTATTGTC 
CCACAT6TCA 
OCTTT 



31 
I 

CAGGGAGCOO 
6AAGGGATT6 
CT6AGGTTAT 
CTGACCTCCT 
CATCCAAATT 
TCOUU3A6AA 
TTACAGAATT 
GTGGGGAGAA 
GCATTGCAGT 
TQTGGGTCAT 
GCCTG6CG6T 
A0GT6TTCCA 
CCTCCTTCAC 
TTCACAGG^ 
OCCTOATGTG 
AGAAACTGCA 
TCTGGATOQG 
TCIGQAAGGC 
TCATCCACAC 
ACATTAOOTT 
CTCTGCTTGC 
TGTTTGCATT 
CTCTGAGOAG 
CTGCGCAGCC 
AT8CAGCCA0 
AG6TAACCAT 
CTGGCAGCAC 
TCCTTGGTTA 
CTTATTTGCT 



31 



AGGCCCCCGC 
CCCCCTGTGG 
GAAGTCGATC 
GTACGTGGGC 
AG8GIACTTC 
GATQACTGOG 
TTACAACAAG 
CTTCATGQAC 

OCTCCACTCC 
G6CAGACCTC 
COGCAAAGAT 
TGCCTCOGTG 
CCTQGCCTAT 
QACCATA6CC 
ATCIOTTTGC 
GGTCACCAGC 
TCACAGGCAC 
GTCTGAGQAT 
AOCC AAOACC 
AATCATGGTQ 
CTGCAGTATG 
TAAGGACCTG 
TCTGGATAAC 
TGTTCACAOG 
QTCTGTGTCC 
AGGAAAAGAA 
TATTTTTTTA 
TAGTTTCCGT 



41 

I 

CCGGGCCAAG 
GTCACTTTCT 
CTAGATGGCC 
TCAAATGACA 
CCACAOAAAT 
GGAGACAACC 
TCTCTCTCGT 
ATAGAGTQTT 
ACGCT6G6CA 
C8CAGCCTCC 
CIOGGGAGT6 
AGCOGCAACG 
GGCAGCCTGT 
AAQAOQATTQ 
ATTGTQATCG 
TCAGACATTT 
GTACTGCTTC 
GCCGTCGGCA 
GGGAAGGTAC 
CrGOTCCTGA 
TATGATOTCr 
CTCTGCCTGC 
CGACACGCrr 
AGCATGGOOG 
GOOGCAGAAA 
ACAGACAOGT 
Trri ' lT ' lTiT 
ACTTTACCAT 
TTGG6CTAAT 



60 
120 



51 
I 

GGA6CTTCTG 
CAGTCATTTT 
TTGCAGATAC 
TTCAGTAOGA 
TCCCITTAAC 
CCCAGCTA6T 
CCTTCAAGGA 
TCATGQTCCT 
CCTTCACGGT 
OCTGCAGGCC 
TCATTTTT6T 
TGTTTCTGTT 
TCCrCACAGC 
TCAGCAOGCC 
OCOTGCT OC C 
TCGCACACAT 
TOTTCATOOT 
TGATTCAGCG 
AGGTQACCOG 
TOCTGGTOCT 
TTGGGAAGAT 
TGAACTCCAC 
TCOGGAGCAT 
ACTOGGACTO 
GCTGCATGAA 
CTGCGGAGGC 
TAAGCTCAAA 
GCTCAATGAA 
CTTCCGGG6T 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1300 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NOt 336 Protein sequence 
Protein Accession ii NP 001831.1 



1042 



wo 03/042661 



1 11 21 31 41 51 

1 I I I I I 

MXSXLOGLAD TTFRTITTOL LYVQSin>ZQY EDIKGDHASK L6YFPQKFPL TSFRGSPFQB fiO 

KMTAGDNPQL VPADQVNITE PYKKSLSSFK QIEENIQaSB NFMDIECFKV LNPSQQLAIA 120 

VLSLTWrrPT VLENLLVLCV ILHSRSLRCR PSYHFIGSLA VADLLGSVIP VYSPIDFHVP 180 

HRKDSRNVPL PKLGGVTASF TASVGSLFLT AHJRYISIHR PLAYKRIVTR PKAWAFCLM 240 

WTIAIVIAVL PLLGWNCEKL QSVCSDIPPH IDETVLMPWI GVTSVLLLFI VYAYMYILWK 300 

AHSHAVRMIQ RGTQKSIIIH TSEDGKVQVT RPDQARMDIR LAKTLVLILV VLIICWGPLL 360 

AIMVYDVFGK MNKLIRTVPA FCSMLCZ»L!T8 TVNPIIYALR 8KDLRHAFRS MFPSCEGTAQ 420 
PLDNSK6DSD CLRKHA29HAA 8VHRAABSCZ KSTVXIAKVT NSVS1DTSAB AL 



Seq ID NO: 237 DNA sequence 

Nucleic Acid Accession i: NMj016083.2 

Coding sequence; 64..14B2 ~ 



1 11 21 31 41 51 

) I i i I I 

GATTGCCCCC TGTGGGTCAC TTTCTCAGTC ATTTTGAGCT CAGCCTAATC AAAOACTOAa 60 

GTTATGAAGT CQATCCTAGA TGOCCTTGCA GATACCACCT TCOGCACCAT CACCACTGAC 120 

CTCXrrGTAOG T66GCTCAAA T6ACATTCAS TAOGAAOACA TCAAAGGTGA CATGOCATCC 180 

AAATTAQGQT ACTTCCCACA GAAATTCCCT TTAACTTCCT TTAGGG6AA6 TCCCTTCCAA 240 

6AGAAGATGA CT6CGGGAGA CAACCCCCAO CTAGTCCCAG CAOACCAGGT GAACATTACA 300 

GAATTTTACA ACAAGTCTCT CTOGTCCTTC AAGGAGAATG AGQAQAACAT CCAGTGTGGG 360 

GAGAACTTCA TGGACATAGA GTGTTTCATO OTCCTGAACC OCAOCCAOCA GCTQGCCATT 420 

GCAGTCCTGT G0CTCA08CT GGGCACCTTC ACGOTCCTGO AGAAOCTCCT GGTGCTQTGC 480 

GTCATCXrrCC ACTCC06CA0 O CTCOgCTGC AGGCCTTCCT ACCACTTCAT GQGCAGCCTG 540 

GCGOTOGCAG ACCTCCTG6G GAGTQTCATT TTTGTCTACA GCTTCATTGA CTTCCAOGTQ 600 

TTCCACC3GCA AAGATAGCCG CAACOTQTTT CTGTTCAAAC TGGOTGGGGT CAOGGCXTTCC 660 

CTCACTGCCT CGGTGGGCA6 CXnXSTTCCTC ACAGCCATOG ACA6GTACGT ATCXIATTCAC 720 

AGGCCCCTGG GCTATAAGAO GATTOTCAOC A6GCCCAAG0 G08TG6T6GC tfrmXiCCm 780 

ATGTG6ACCA TAGCCATT6C OATOGC O QTG CTGCCTCTCC TGGGCTGQAA CTOGQAOAAA 840 

CTGCAATCTG TTTGCTCAGA CaTTTTCCCA CACATTGATG AAACCTACCT GATGTTCTGG 900 

ATOGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATQ OGTACATGTA TATTCTCTGQ 960 

AAGGCTCACA GCCA0QCXX3T COGCATGATT CAGCOTGGCA CCCAGAA6AG CATCATCATC 1020 

CACAOQTCTO AGGATGGQAA GGTACAOOTO ACOOGGCCAG AOCAAGCCOO CATGGAGATT 1080 

AGGTTAGCCA AOACCCTGQT CCTGATCCTG GTGGTGTTGA TCATCTQCTO 66GC0CTTT0 1140 

CTTGCAATCA TGGTGTATGA TGTCTTTOGG AAGATGAACA AOCTCATTAA QAOGGTGTTT 1200 

GCATTCTGCA GTATGCTCTG CCTGCTQAAC TCCACXX3TGA ACCCCATCAT CTATGCTCTO 1260 

AGGA6TAAGG ACCTGGGACA CGCTTTCCGQ AGCATGTTTC CCrCTTGTGA AGGCACT6CG 1320 

CAGGCTCTGG ATAACAGCAT GGG06ACTOO QACTGCCTGC ACAAACAOGC AAACAATGCA 1380 

GCCAGTQTTC ACAGGGCOOC A0AAA6CTGC ATCAA6AGCA GGOTCAAOAT TGCCAAGGTA 1440 

ACCATGTCTG TGTCCACAGA CAOGTCTGCC GAGGCTCTGT GAGCCTGATO CCTCCCTGGC 1500 

AGCACAGGAA AAQAATTTTT TTTTTTAAGC TCAAAATCTA GAAQAQTCTA TTGTCTCCTT 1560 

GGTTATATTT TTTTAACTTT ACCATOCTCA ATGAAAASGT GATTOTCACC ATGATCACTT 1620 

ATCAOTTTGC TAATGTTTCC ATAGTTTAGG TACTCAAACT GCATTCTGCA GGGOTTTACA 1680 

GTGAASAAAO C C TGTTGTT T AAGTQACTQA ACQATCCTTC AAAGTCTGAA TQAAATAGGA 1740 

GGGAAAGCTT TGGCTACACA ATTGGAAGTC TAAGAAOCCA TGGAAAAATO CCATCAAATO 1800 

AATAATGCCT TTOTAACCAC AACTTTCACT ATAATOTGAA ATQTAACTGT CCGTAGTATC 1860 

AGAGA TGTCC ATTTTTACAA GTTATA6TAC TAGAOATATT TTGTAAAATG TATTATOTCC 1920 

TGTQAGATQT OTATCAOIGT TTAXGTQCrA TTAATATTTO TTTAOTTCAO CAAAACTGAA 1980 

AGGTA6ACTT TTATGAGAAC AATGGACAAO CAGTGGATAC GTGTCAATOT GTGCACTTTT 2040 

TTTCTATATT ATTGCCCATG ATATAACTTT AGAAATAAAC CTTAATATTT CTTCAAATAT 2100 

CTCTATTTAA TTTTGACACT GAAATAACCG TAAAGGTTTA TTTTTCTQTT ACCTCAACAA 2160 

GAAGAATTTG AAQACTTCAA AATATTGA6C AGAATTCATT CATACTTAAA AATTTATTAG 2220 

CCCTGCATTT TCATAGGAAG ACACATTATC TTCTGGACrA TAOCIGTTCT AATGGATTAT 3280 

AATC AGAAT G OAAGAGAGAA AGCATATTGA CTTTTTTTGA GC6ACATCTC TGACrTTCTT 2340 

TAGTCTTTAG CTATTACTGG ATCTCTTAAG ACAOCATGTG TTAATCTTAA TGTATATCGT 2400 

TATCACTGTG CAGTTGCTGT TTACTTGAAT AGTATTGTGT TOCTATATTC CAGQTTTAAQ 2460 

TAGATTTCAT GCCTOGQTGG CCAAACAACA OTCTTCATTT TTTTTAATTG AAAAGAAGTA 2520 

GTGTCTGOAT CAGTAAAATT ATACTGTGTO TSAOTGTGAA TATAAATGTG TQT A TOTOTO 3580 

TTTCTQ TCCQ TAACTGTTAC AGTAATGTCA TAAAGTGAGA AAACTGTQAC CAAGTATAAA 2640 

CTTTTACCAC TTGCTGCACT CTTOCACATG GATTCAGTTT CTAAAATTGA OTTCTTCCTG 2700 

TAATCTTOTT GATAAAAATA CTGACTCCAA CCATTCAAAA ATTTCACCCC ATCCCTCCTT 2760 

AAGA6ATTGG ATCAAGTATT ACTAAATTQA CCTTTAGGTA TTACACAAGA CCAGTQCTTA 2820 

GCAAAAAATA ATGACAGGCA TGGAAGGAAO OGATGTATTT GTAOTGTTAT T6CCAGGAAA 2680 

GGAGAGTACT T T GQl"ITCrU AGCACCGAAT ATTGAOCAAT ATGTCAGTCA CTAAAAGGAA 2940 

GACAOTTCTA CAGAAAAACA AATGGTAACA TTTTTCAATA GOGTGTGTAG ATAGTATQCA 3000 

CTATATACAT CACGTTAAAG TAGQACTATC ACACCCAGCC CATGTGGCTA AAAAAGCTGA 3060 

ATCAGACA6T GGATGAGACA CACAAG6GCA 6TGAAGAACC GATACACTTG GCATTOACGT 3120 

CTAGCTATOC TGTATCTOTG CTTTGCCCAC AT800CTTGG TGAGAGCTGA GCACCCAGCT 3180 

CTOTCTTGGT AGOTTTGGGC TAAGGAACAA ATCTCTCCTT TOCTOGTGGT TAGCAAOATA 3240 

CACTCAAGCA TGAAGATAAA CACA6CTGCT TTCTTCTTAC ACCCC60TCT CATGCTCCTT 3300 

AATGGCGCCA TGGGTGCTTO TTGOQCCTTT TTCCAQTAAQ QAATGATATT GCTGAAQAAT 3360 

CTACTTAACC CTGACAAATT TTAATTATAA TCTCTTCTTA TACAGATAAA ACATGACTCC 3420 

TACAAOGCCC CAAG6TTTAC ATAOTCIGAA GTGAAOTAai GAOCTGGCAT CT A TCTOGTG 3480 

ATTTCTAGCT CTCGAGATAC CCAA6CAGCC TGATGGOGCA G'iTCC C Cn'C TTAOGGTTCA 3540 

CGCTCTAAGG CAGQATOTGG CTTATGAGAT ACTTTGCATT GTCTOTCTOC ACACCTTQAA 3600 

TCTGCCTGCT GGCTCCCTTA CTTTACCTCT CTGTCATGTG CAGATGAAGO CTCAGGQTOC 3660 

TAGAGQATTA GTAAGATCTC TTTCTAAAGA CAGGAOAGAT TATTTACAA6 AAGAACTCAC 3720 

CAGGGTTTA6 TTT6CATTTA AGAATTGCCA GTCTTTTQTC CTGCATCATC TTQAACATTA 3780 

ATGCACATGT TTCAGAG CTC ACCAGGGAGT ACCAATGCTC TTTTCACAGC TA1GAAGAGC 3840 

TAGAGAAATT CTTGTTATG6 TAGAAAAATT TCAOGOTTCA TTTTTGAAAC TOCATTTGTO 3900 

0GTAT6CAGT QTAGATTTTA TAgTGTGTT G TGCTTTCAAG ATCTAAATOl TATATAATAA 3960 



1043 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ATTAAGGGAC AATOOCXKrTG ACAGCACTAA 
TCT6TGAAAT ATCATCAOGT AT6TTATACA 
AGATTCACTT TGACACTTTT CATATCATTT 
AATGAATATA CTCATTAGAA TTACCATTTG 
TAGATCCATT AATTTAAATQ ATTTAAATTT 
OTATCTTACT TTCCTCTGAO GATGATGTAC 
ATGTTCAGAA AG6GCCAAAT TCCCAACCTG 
AATCAGTCCT ACAATOTTTC ATTTGCACAA 
TTATTTTGTA TGAAATATAT QTOAAAGGAT 
CCTACACTTG AGATQATTTC CAAOCCTCTC 
AAATGCCAAG GACCTTTT6C TGCCTAAATC 
ATGAGAATTA GGCCCTGGAT GAOTAG06CT 
CATTTCATGT CCATAAAAGA GACCACCCAT 
TAACTOTATA TTTGTATGAT ATTTTAAAAT 
ATTAATTGTC TTGCACTGGC CTTCTGATGA 
AAAAACATTC TATCTACTGA TTTGGGCTQA 
AT G TTT G AGC AglG G CXTf A C AAATCAOTAA 
CGTGGCATCT TAAAAGCTAT CTTCATGTAA 
AGTTCCCAAG AAAGQAAACC TTTTCTTGTA 
AGGGGGAAAC ATGACTCTTT ATTGTCTGTA 
GAATATTGTA TTGTTAOATG TTTGTTOAGC 
TTCGTTAGTA ATCTGTATAA TACTTTGTAT 
AGCTTCAGTC ATTAAATTAC TATAGCAAAO 
TTAAOCCCAC AGTATATTTT ATTTCAATGT 
ACATCTCATC AT6TCTATTG TCCAAAGTTA 
ATGAACBUiCA TA 



ACTT0GT6CT 
ACCTTCATTT 
CTTAACCCAA 
TTAATATCAC 
AA6TAAGTTT 
TTGCCCT6AC 
CTCATTTTTT 
GTAGG6CTGC 
ATGAATCTGA 
TGGCACTTTG 
CACTCTGCAO 
ATAGTTACT6 
ATCATGCACA 
CTCCTAAATG 
AATGTTAACA 
ATGTATGTAA 
TTTTOGOOTG 
ATTGACTGTA 
TCTGGATTCA 
AATCTAACAT 
TQ6TAACATC 
ACAAGTACTO 
TAOTACTTCT 
AATTAAACTG 
CCTGQAATCA 



TATTGATATT 
AAAAAGGTTT 
GTGACXSAAAA 
TCATTAATTA 
TATAAGGTCT 
CAT6CATTTT 
TTTTTATCAO 
CTCCAAGAGO 
GAGATGCTGT 
AGTTAAGTCT 
OAAATAOGOC 
TCCTGTTGAT 
CAATTAGATT 
CTGGGCAATG 
ATGCCTATTG 
ATAGGTTTCT 
GGA6A6TTTC 
CTA06CCTAC 
AATTTATTTC 
TATTACTTTT 
GTTGCAACCA 
GTAAQATTGT 
TCTGTAATAT 
TTAACTTATT 
AATAAAAATT 



CTAAGAAATA 
AAAACTAGTT 
CATTGTCCCC 
ACCOCATAAT 
GACATCAOAG 
ACCATCACAC 
A6TCATGAT6 
AACCTCTGAT 
AGACATCTOT 
ATCTGGTATT 
CAACCACCAG 
TAATTTCTGC 
TCTCACACTC 
GCTATTAACA 
TAATATAQAA 
AAAAAGTCAG 
TTTACATTGC 
TGGGGATCAG 
CAATGTTTCA 
CCTCTTAGAA 
CT6CAATATC 
TATTAAATOT 
TTACAATGTA 
CAAA6AGAAA 
CTAGATTACC 



4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
S2B0 
5340 
5400 
5460 



Seq ID NOt 238 Protein Bequence 
Protein Accession «t NP_0S7167.1 



1 
1 

MKSILDGLAD 
KMTAGDNPQL 
VLSLTt/jTFT 
HRKDSRKVFL 
HTIAIAZAVL 
AHSHAVRMIQ 
AXMVYDVFGK 
PLDNSKGDSD 



11 
I 

TTPRTITTDL 
VPADQVNITE 
VIiEKLLVLCV 
FKLGGVTASL 
PLLGWIiCBXL 
RGTQRSIXIH 
MNKLXRTVPA 
CliHfCHANNAA 



21 
I 

LYVGSNDIQY 
FYNKSLSSFK 
XLHSRSLRCR 
TASVGSLFLT 
QSVC8DXFPB 
TSEaXSKVQVT 
FCSMLCLLKS 
SVHBAAESCl 



3X 
I 

EDXKGDMASK 
ENEENXQC6E 
PSYHFIGSLA 
AXDRYV8IBR 
XDETYLMFWX 
RPDQARNDIR 
TVNPIIYALR 
KSTVKXAKVT 



41 
I 

LGYFPQKFPL 
NFMDIECFMV 
VADZiLGSVXF 
PXiAYKRIVTR 
GVTSVLLLFX 
LAKTLVLILV 
SKDLRHAFR8 
MSVSTDTSAE 



51 

1 

TSFRGSPFQB 
LNPSQQLAIA 
VYSFXDFHVF 
PKAWAFCLM 
VyAYMYXLNX 
VLXI CWGPL L 
HFPSCEGTAQ 
AL 



60 
120 

xeo 

240 
300 
360 
420 



Seq ID NOt 239 DNA secmence 

Hucleic Acid AccesBion fts !m_033181.1 

coding sequences 17.. 1252 



1 

I 

ATGAAOTCQA 
CTGTACGTGG 
CGAGTGTGGO 
GCTGGCCATT 
GGTOCrOTGC 
CGGCAGCCTG 
CTTCCACX3TG 
CACX3GCCTCC 
ATCCATTCAC 
GTTTTGCCTO 
CTGCGAGAAA 
GATGTTCTGQ 
TATTCTCTGG 
CATCATCATC 
CATGGACATT 
GGGCGCTCXO 
GAOGGTGTTT 
CTATGCTCTQ 
AGGCACTGOG 
AAACAATGCA 
TOOCAAGGTA 



11 
I 

TCCTAGATGG 
GCTCAAATGA 
GAGAACTTCA 
GCAGTCCTGT 
GTCATCCTCC 
GCGGTGGCAG 
TTCCACOGCA 
TTCACTGCCT 
AGGCCCCTGG 
ATGTGGACCA 
CTGCAATCTG 
ATCX3GGGTCA 
AAGGCTCACA 
CACAOGTCTO 
AGGTTAGCCA 
CTTGCAATCA 
OCATTCTGCA 
AGGAGTAAGG 
CAGCCrCTGG 
GCCAGT6TTC 
ACCATQTCTO 



21 

I 

CCTTGCAGAT 
CATTCAGTAC 
TGQACATAGA 
CCCTCAGGCT 
ACTCCCX3CAG 
ACCTOCTGGG 
AAGATA6C0Q 
CCGTGGGCAG 
CCTATAAGAO 
TAGCCATTGT 
TTTGCTCAQA 
CCAOOOTACT 
GCCAGGCOGT 
AGQAT6GGAA 
AGACCCTGGT 
TGGTGTATGA 
GTATGCTCTQ 
ACCTGOGACA 
ATAACAGCAT 
ACAGGGCXX3C 
TQTCCACA(B( 




GAGTGTCATT 
CAACGTGTTT 
CCTGTTCCTC 
QATTGTCACC 
GATOGCOGTG 
C3VTTTTCCCA 
GCTTCTGTTC 
CGGCATGATT 
6GTACAGGTG 
CCTGATCCTG 

CCTGCTGAAC 
CGCTTTCOGG 
GGGGGACTCG 
AGAAAQCTGC 
CACOTCTGCC 



41 

I 

GCACaVTCAC 
AAGGAGAATO 
GTCCTGAACC 
AOG QTCCTO G 
AGOCCTTCCT 
TTTGTCTACA 
CTQTTCAAAC 
ACAGCCATOG 
AOGOCCAAGG 
CTGCCTCTCC 
CACATTGATG 
ATCGTGTATG 
CAGOGTGGCA 
ACCOQGGCAG 
GTGGTGTTGA 
AAGATGAACA 
TCCACCJGTGA 
AGCATGTTTC 
GACTGCCTGC 
ATCAAG AGOl 
QAGGCTCTGT 



51 
I 

CACTGACCTC 
AGGAGAACAT 
CCAGCXIAGCA 
AOAACCTOCT 
ACCACTTCAT 
GCTTCATTGA 
TGGGTGGGGT 
ACAGGTACAT 
OOdTQGTGGC 
TGG6CTG6AA 
AAACCTACCT 
CGTACATGTA 
CCCAOAAOAO 
ACCAAGGOCO 
TCATCTGCTG 
AGCTCATTAA 
ACCCCATCAT 
OCTCTTGTQA 
ACAAACAGGC 
GGGTCAA6AT 
GA 



Seq ID NOx 240 Protein sequence 
Protein Accession fls NP_149421.1 



1 
I 

MALQXPPSAP 
LSLTLGTPTV 
RKDSRNVFItF 
TXAXVXAVI.P 
HSHAVRMXQR 
IMVYDVPGIM 
LONGMGDSDC 



11 
I 

SPLTSCTHAQ 
LENLLVLCVI 
lOiGGVTASFT 
LLGNNCEKLQ 
OTQK8ZIXHT 
NKLZKTVPAP 
LKXHANNAAS 



21 
I 

MTPSTKTSKB 
IiHSRSLRCRP 
ASVGSUTfTA 
SVC8DIFPHX 
SEDGKVQVTR 
C8KLCZ«IiHST 
VHRAASSCIK 



31 
1 

KEQIIQCGEd 
SYHFIGSX^AV 
XDSYX8IHRP 
DETYIXFWXO 
PDQARMDIRL 
VNPIIYALRS 
STVKIAKVTM 



4X 

I 

FKDIECFMVIj 
ADLLOSVIPV 
lAYKRIVTRP 
VTSVIiLLPIV 
AKTLVLILW 
KDUtHAFRSM 
GV8TDT8AEA 



60 
X20 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 



51 
1 

HPSQQLAXAV 
ySFZOFHVFB 
KAWAFCLKH 
YAYMYXLWICA 
LIICWGPLIA 
FPSCEOTAQP 
L 



60 
X20 
X80 
240 
300 
360 



Seq ID NOi 24X DMA sequence 

Nucleic Acid Accession 8i NM_003596.X 



1044 



wo 03/042661 



Coding sequences 82.. 1194 



] " 21 31 41 SI 

' I I I 1 I 

GTAGACTGTC CMGGCCTGA ACATTTTC08 AAAATCATTT TOAGCWVAAT ATCTOTTTAA 60 

TAACAAGATA ACCACATCAA GATGGTTGGA AAGCTQAAGC AGAACTTACT ATTGOCATGT 120 

CTGGTQATTA GTTCTGTGAC TGTGTTTTAC CTGGGCCAGC ATGCCATGGA ATCCCATCAC 180 

CGGATAGAGG AA06TAGCCA GCCAGTCAAA TTGGAQAGCA CAAGGACCAC TGTGAQAACT 240 

GGCCTGGACC TCAAAGCCAA CAAAAOCTTT QCCTATCACA AAGATATGCC TTTAATATTT 300 

ATTGGROOTO TGCCTC6QAG TOGAACCACA CTCATGAOGO CCATOCTGGA CGCACATCCT 360 

GACATTCGCT GTQGAGAOGA AACCAGGOTC ATTCCCCGAA TCCTGGCCCT GAAGCAGATO 420 

TGGTCACGGT CAAGTAAAQA GAAGATCOGC CTGGATGAGQ CTGOTGTTAC TGATGAAGTG 480 

CTGGA TTCTG CCATOCAAGC CTTCTTACTA GAAATTATCG TTAAOCATGG GGAOCCAGCC 540 

CCTTATTTAT GTAATAAAGA TCCTTrreCC CTGAAATCTT TAACTTACCT TTCTAGGTTA 600 

TTCCCCAATG CCAAATTTCT CCTGATGGTC CGA6ATG6CC GGOCATCAGT ACATTCAATG 660 

ATTTCTOGAA AAGTTACTAT AGCTOGATTT OATCTGAACA GCTATAGGGA CTQTTTOACA 720 

AAGTGGAATC GTGCTATAGA GACCATGTAT AACCAGTQTA TGGAaOTTGG TTATAAAAAG 780 

TGCATGTTGG TTCACTATGA ACAACTTGTC TTACATCCTG AAOGGTGGAT GAOAACACTC 840 

TTAAAGTTCC TCCAGATTCC ATGGAACCAC TCAGTATTGC ACCATQAAGA GATGATTGGQ 900 

AAA6CTG6GG OAGTSTCTCT GTCAAAAGTG GAGAQATCTA CAQACCAAOT AATCAAOCCA 960 

GTCAATQTAG QAGCTCTATC AAAATGGGTT GGGAAGATAC CGCCAQATGT TTTACAAGAC 1020 

ATGGCAGTGA TTGCTCCTAT GCTTGCCAAG CTTGGATATG ACCCATATGC CAACCCACCT 10 BO 

AACTACGQAA AACC TGATC C CAAAATTAIT QAAAACACTC GAAGGGTCTA TAAGGGAGAA 1140 

TTCCAACTAC CTOACTTTCT TAAAOAAAAA OCACAGACXO AGCAAQTOGA QTAGCAGAAC 1200 

CAQGAGGCTC TTCCATACAT GAGQAAAGAT TGCTQCCTTT TCAGCAQAAO G6AAATTCCT 1260 

AGGATTGOCT GTCCCCTGCC AAGCTTGGTG GAGOGTCTGC ACXnTTGOCTG OGCOGCCTGT 1320 

GCATTTGCCA 6TTTCCTCCC ACTGAQAGGA TGGAGGTGTC CGCACAGCTT TGGGCCTCGT 1380 

6AGGQATCT0 CCTCCTGAGC AAAGAGCTCT TGATCCCGAT TTCATX3CACA GCCCTGCAOT 1440 

AAGGAG OCCA G AAGGA ACAT OTGTTTCCTG TTAAAACTCC TCTTGTTCTC TTTTCTTACA 1500 

TTATGA OQTT TOTTTTCAAG OAGAGGOTTT AAAAATGGQA TCCTGTAAGC AGACTTGG6C 1560 

AGTCTCCTTT TQAAATAGGT TQTCTGTACA TGTTCTAATQ TTTTQTAGAA CAOOTOTGCC 1620 

TQTTTAAGTG TATTGATQTG AATAATATTA AATATCCTAA TTATTTAATT CATR3TATTG 1680 

TTTCTG AGAA GTTGGQAAAT TACCATTATA CATTTACAAC CTAATGACTT TT3TATTTTA 1740 
TTTTTCAAAA TAAAA6CTTT CAATGTGA 



Seq ID NO: 242 Protein secmence 
Protein Accession «> 2IP_0035e7.1 

41 51 

I I 

PVKLESTRTT VRTOLDLKAN 60 
TKVIPRILAL KQMH8RSSKB 120 
PPALKSLTYL SRLFPNAKPL 180 
TMYNQCMEVG YKKOCVHYB 240 
SKVERSTDQV IKPVNVGALS 300 
RZZBITRRVY KGEFQI.PDFL 360 



Seq ID NOt 243 DKA sequence 

Nucleic Acid Accession «: NM 001492.3 

Coding sequence: 1395.. 2513 " 



1 11 21 31 

I I I I 

MVGKLKQNLL LACLVISSVT VPYLGQHAME CHHRIEERSQ 
KTPAYHKDMP LIFIQGVPRS GTTLMRAMLD AHPDIRCGEE 
KIRU>BAGVT DEVLDSAMQA FLIiBIIVKHG EPAPYLQUCD 
LKVRDGStASV HSHISRXVTI AGPDLNSYRD CLTKWNRAIE 
QLVLRPERWM RTLLKFLQIP HNHSVLHHEB HIGKAGGVSL 
KHVGKIPPOV LQDHAVIAPM LAKLGYDPYA NPPNYGKPDP 
KBKPQTEQVB 



1 11 21 

i I I 

ACQCGGGGOG CG0GGCTC06 TGGGCTACOG 
CGA60GGGCG OTATGGCGGC GGOGOGGCCC 
CCXaAGCTAOG CGCAGCTAGT GCAGCGCGQC 
TGCACG6ACT 60GGCTQQGG GCTGGOGOGT 
COGCGOGAGC TGCTGCTGCT GGOGCTCGGC 
GCCACTGCGC GCCTCTTTCG GCCCCTGGOG 
GCCAAOATGC COGAGAGCGC TTGOAAQTTT 
GCCTACCTGC TQTTTGGCAC OGACTACCCC 
GACTGGA06C OGGGCATGGC AGTGCCAOGG 
AGCTTCTATG GCCACTGCAT CTAC6CTA0Q 
OTOOTCATGC TGCTCCACCA OGTOOTCACT 
CGGTACCACA ATOTGGQCAT CCTTOTGCTC 
GA GTTC ACCA AGCTCAACAT TTACTTCAAG 
GCCTTGOCAG CAGACTTGGG CTGCCTCAGC 
TACTGGTTCC CGCTCAAGQT CCTGTATGCC 
GACATCOCCr TCTACTTCTT CTTCAATGCO 
TACTGGTTCC TGTACATCOT GGOGTTTGCA 
CTGAAGGACC TGCGGGAGTA TQACACAGCC 
GAGAAGCCAC TGAGQAAOGG CCTGGTGAAG 
0CCC 3TGG AC CCGGCCCCAC CCOGAATACC 
TCCAO OOCC T CCAACTCTQC TCCTCTAGGG 
CTCATOCTGC CTCCATTTCC CGGCCAG6CC 
GQCCCOGCCC TCAGCCCACT GGTCCCGGGC 
OGOCTG GGAO GAAGATQCCA COGCCQCAGC 
TOCTGOCCCX GCTGCTGCCC TOGCTGCCCC 
COGCOGOCCT GCTCCAGGCT CTAGOACTGC 
^COGOTTCC CCCGGTCATO TGGOGCCTGT 
CTGGCTOGOG QOGGAOQTCC CCAQGGGTCA 
GGOTOGOCQG AAACATCGTG OGCCACATCC 
AGGCTOTCTC GGCCOOGGGO CATT6CCCTG 



31 41 51 

I I I 

06GGCGGGCG CAGGOQAOGO GCAOGGOGGG 60 

GCGGCGGGGC CGAOGGGGCC CGAOCCCATQ 120 

TGGGGCAGCG CGCTGGCGGC GGC6CGGGGC 180 

GGCGGCCTGG CTGAGCAG6C GCACCTGGOG 240 

GOGCTGGGCT GGACOGCGCT GCGCTCOGOG 300 

AAGOGGTOCT GCCTCCAOCC CAGAGATQCC 360 

CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

TTCTTCCATO ACCCAOCATC TGTCITCTAC 480 

GACATTGCAG COGOCTACCT GCTCCAGGGA 540 

CTATACATGG ACACCTGGOQ CAAGGACTCO 600 

CTCATCCTCA TCGTCTCCTC CTAOGCCTTC 660 

TTCCTGCAOG ATATGAOTOA CGTGCAGCTT 720 

TCCOQCGGOG GCTCCTACOl TCQQCTGOIT 780 

TTOGQCTTCA GCTGGTTCTG GTTCOGCCTC 840 

ACCAGTCACT GCAGTCT6CG CACGGTGCCT 900 

CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

CCCAAGGTGT TQACAGGCCh GOTGCAGSAG 1020 

6AGGCCCAGA GCCTGAAGGC CAGCAAAGCC 1080 

GACAAGOGCr TCTGAACCCC TOGGCOCGGC 1140 

COGGCCACGC TCCCCCTCCT TGGCCOCCCC 1200 

COGCCOCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CCCCAGGACC CCTGOCCCTC OGGGGACAOC 1320 

CQC0606GAC CCTGOGCACT CTCTGGTCAT 1380 

AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TGACC060GC CCCCGTGCCC CCAGGCCCAO 1500 

GCGATGAGCC CCAOGOTGCC CCCAGGCTOC 1560 

TTOGACOCOG GGACCCCCAG GAQAGCAOGT 1620 

CCCTGCAACC GTGCCAOGTQ OAGGAGCTQG 1680 

O0GACC60GO TGCGOOCAOC OGGGOCTO G G 1740 

AGTGGACAGT OGTCTTOOAC CrOTOG Q CTQ 1800 



1045 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



TGGAACXXX3C 
C66CAGCXXC 
C6GACCCGQO 
CGGA6CTGCT 
OGCTGGOGCr 
T6CTGGTGAC 
AACCCGTGTT 
TCX36CX3AGGT 
GCCAiGGGTCA 
ACCAOOCTGT 
GCTGOSTGCC 
TGOTGCTQCQ 
CGGGCAGGGA 



TQAGCGCCXX3 
GGAGGGCGGC 
GCCGGTGCTO 
GGGOGOOGCT 
ACGCCXXXX3G 
0CT06ACC0G 
OGG06GCX3GC 
GGGCTGGOVC 
GTGCX5CGCTG 
GCTOOQCGOG 
G60G06CCT6 
GCAOTATGAG 
CX3CGGGCCCA 



AGCCGGGCCC 
T6GGA<3CTGA 
CTCCGCCAGT 
TGGGCTCGCA 
GCCCCTGCOQ 
CGCCTGTGCC 
CCCGGGGGOO 
CGCTGGGTCA 
CCCGTCGCGC 
CTCATGCACG 
TGGGCCATCT 
GACATOGTGG 
ACAATAAATG 



GCCTGQAGCT 
GOGTGGOGCA 
TGGTGCCOGC 
ACGCCTCATQ 
CCTGGQGGCG 
ACCCCCTGGC 
CTTOTOOCGC 
TCX30GCXX3CQ 
TGTCGGGGTC 
0GGCC6CCCC 
COGTGCTCTT 
TGGAGQAGT6 
CCG0QT6GTC 



GGGTTTOGGG 
AGCGGGCCAG 
CCTGGOGCOQ 
GCGOCGCAGC 
CCTGGCCGAG 
COQGCOQCGQ 
60GGCGGCTG 
CGGCTTCCTO 
CGGGGGGCOG 
GGGAGCOGCC 
CTTTGACAAC 
C G GCTGCCGC 
TGCTC 



GCGGCGGCGG 
GGCGCGGGCG 
CCAOTGOGCO 
CTCXX3CCTGG 
GCCTOGCTGC 
OOCGACGCCO 
TA0GTQA6CT 
GCCAACTACT 
COGGOGCrCA 
GACCT6CCCT 
AGOGACAAOO 
TAACCCGGGG 



IB60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 244 Protein sequence 
Protein Accession «i NP 001483.2 



MFPPQQGPCG 
VHHRLFRRKD 
AGHCPEWTW 
VLLRQLVPAL 
DPRIiCHPLAR 
ALPVALSGSG 
YEDMVVDBCX3 



11 

I 

KHZiLLIiLALL 
PQETRSGSRR 
FDI*SAVEPAB 
GPFVRAEIiLG 
PRRDAEPVLG 
GPPAUIHAVL 
CR 



21 
1 

LPSLPLTRAP 
T8PGVTLQPC 
RPSRARLELR 
AAWASNASWP 
GGPGGACRAR 
RALMHAAAPG 



31 

1 

VPPGPAAALL 
HVBBLGVAGN 
FAAAAAAAPE 
RSIALALALR 
RLYVSFREVG 
AADLPCCVPA 



41 
I 

QALGXiRDEPQ 
IVRBXPI3RGA 
GGHELSVAQA 
PRAPAACARL 
WHRWVIAPRG 
RLSPISVLFF 



51 
I 

OAPRLRPVPP 
PTRA8EPV8A 
GQGA6ADPGP 
AEASLLLVTL 
FLANYCQGQC 
DNSDNWLRQ 



60 
120 
180 
240 
300 
360 



Seq ID NO I 245 DNA sequence 

Nucleic Acid Accession #i NM_031267.1 

Coding sequence : 17 . . 1125 



1 11 21 31 41 51 

t I I i I I 

ACGCGGGGCX3 CGCGGCTCXX3 TC00CTACX30 CGGQCGGGOG CAQGOGACGQ GCACX3G0GGG 60 

CXSAGCGGGCQ GTATGGOGGC GGCGGGGCCX: GCG6CGGGGC CGACGGQGCC CX3AGCCCATG 120 

COGAGCTAGO GGCAGCTAGT GCA6CX30GGC TQGG0GA60G CXXTrGGGQaC GGC60GGGGC 180 

T6CA0GQACT GCGGCTGOGG GCT6GOGC9GT 0GCG6CCTGG CTGAGCA06C 6CACCT6GCG 240 

CCGCCOGAGC TGCTGCTGCT GGOGCTOGGC OCQCTGGGCT GCMICCXSCGCT GOGCTCOGCG 300 

GCCACTGCGC GCCTCTTTCa OCCCCTGGCG AAGCGGTGCT GCXTTCCAQCC CAGAGATGCC 360 

GCCAAGATGC CCGAGAGOGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTQ QAGCTACAQT 420 

GGCTAGCTGC TGTTTGGCAC CXIACTACCOC TTCTTCCATG ACOCACCATC TGTCTTCTAC 480 

GACTGOACGC 0GGGCATG6C AGT6CCA0GG GAGATTGCAG COGGCTACCT GCTCCAGGOA 540 

AGCTTCTATO GCXaiCTCCAT CTAOGCTAOO CTATACATGG ACACCTGGCG CAAGGACTCG 600 

GTGGTCATGC TGCTCCACCA CX3TGGTCACT CTCATCCTCA TCGTCTCCTC CTACXSCCTTC 660 

CGGTACCACA ATGTGGGCAT C C TTGT G CTC TTCCIGCSlCa ATATCAGT6A CXSTGCAacrT 720 

GAiGTTCAGCA AGCTCAACAT TTACTTCAA6 TCC0GCG6G0 GCTGCTACCA TOSGCrGCAT 780 

GCCTTGGCAO CAGACTTGG6 CTGCCTGAGC TTGGGCTTCA GCTGGTTCTG GTTGOOCCTC 840 

TACTGOTTCC OQCTCAAGQT CCTOTATQCC ACCAGTCACT GCAGTCTGCG CAOGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCXnGCTGC TGCTCACCCT TATQAACCTC 960 

TACTGGTTCC TGTAC3VT0GT GG0GTTT6CA GCCAAGGTOT TQACAGGCCA GGTGCAGGAO 1020 

CT8AAGGACC TGOOGGAGTA TGACACAGOC GA66C0CAGA GCCIGAAGCC CAfiCAAAGCC 1060 

GAGAAGCCAC TGAOGAAOGO CCT6GTGAA0 GACAAQCGCT TCTGAACCOC T0G6CCC06C 1140 

CCCOGTGGAC COGGCJCCCAC CCCGAATACX: CX»GCCAOGC TCCCOQTCCT TGGCOGCCCC 1200 

TCCACCCCCT CCAACTCTGC TCCTCTAOGO COGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CTCATCCXGC CTCCATTTCC CG6CCA0GCC CCCCAGGACC CCTGCCXZCTC OGGGGACACC 1320 

QQCOCOGCCC TCAGCCCACT GG1CC06GGC OGCCX3C3GGAC OCTGCSCACT CTCTQOTCAT 1380 

CGGCT6GQAG GAAOATGCCA CX3GOOGCA6C AAGGTCCCTG CGGGCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCX3CTGCCCC TQACCCGCGC CCOOOTOCCC CCAGQCOCAO IS 00 

COGCOGCCCr GCTCCAQQCT CTAGOACTGC G0GAT6AGCC CCAGGGTGCC CCCAGGCTCC 1560 

GGGCGGTTOC CCCGGTCATQ TGGGGCCTGT TTOGAGGCOG GGAGCCCCAG GAGACX»GQT 1620 

CIGGCrasaS GOGGACGTCC CCAGGGGTCA CCCIQXSACC GIGCCACXSTG GAGGAGCIGG 1680 

GGGTOSGOOG AAACATCOTG OGCCACATCC CG6ACGGGG0 TGOQCGCAOC 06GGCCT0QG 1740 

AGCCTGTCTC OGCCGOaQQG CATTGCCCTG AGTGGACAOT OGTCTTOQAC CTGTC3GGCT0 1800 

T6GAACCCGC TQA6C6CCC6 AGCCGGGCCC GCCTGGAGCT GOQTTTCGGG GCGGCGGCGG 1860 

CGGCAGCCCC GGAGGGCGGC TGGGAGCT6A 6CGTGG0GCA AGCGGGCCAG GGCGCGGGCG 1920 

OGGACCCCGG GCCGGTGCTO CTCCGCCAGT TGOTGCC O GC GCTGGGOCGG GCAaTGOGGG 1980 

0QGAGCT6CT GGGG6C0GCT TGGGCTGQCA AGGCCTCATO QC06CGCAGC CTCOSCCTGG 2040 

CGCTGGCGCT ACQCCCCCGG GCCCCTGCCG CCTGCGCGOG CCTGGCCGAO GCCTOGCTGC 2100 

TGCTGGTGAC CCTCGACCCG OGCCXOTGCC ACCCCCTGGC COSQCCGOGG OGOQACGCCG 2160 

AACOCQTOTT GGGCGGCGGC CC06GGGGCG CTTGT0606C QG6G0GGCTG TAOGTGAGCT 2220 

T O O GO S A GGT GGGCTGGCAC OOCTS GG TCA TC00SC0800 OGGCTTGCTG GCCAACTACT 3260 

GCCAGQGTCA GTGCGGGCTG CCCGTCGCGC TGTCGGGGTC 0GGGGG6CGG CCGGOGCTCA 2340 

ACCAOSCTOT QCTGOGCGOG CTCATGCACG OGGCOQCCCC GGGAGCOGCC GACCTGCCCT 2400 

GCTOOGTGCC CGCGCGCCTG TCGCCCATCT CCGTOCTCTT CTTTGACAAC AGCGACAACG 2460 
TGGTGCT6G0 GCAOTATGAG GACATGGT6G TGGAOGAGTG GGGCTGC06C TAACCCGGGG 

Seq ID KOt 246 Protein sequence 
Protein Accession #i HP_067090.1 

1 11 21 31 41 51 

I I i I 1 I 

HAAAGPAAGP TGPBPMPSYA QLVQRGW68A LAAAROCTDC GWGLASSOIA SOKLAPPBL 60 

LLIAU3AL6W TALRSAATAR LFRPLAKRCC LQPRDAAKNP ESANKFLPYL GSWSYSAYLL 130 

POTDYPPFBD PPSVFYDWTP GMAVPRDIAA AYLLQGSFYO H8IYATLYKD TWRRDSWMli 180 
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UmWTLIliI VSSYAFRYKN VGXLVLFLHD ISDVQliBFTK LNXYFKSR06 SYHRUIALAA 240 

DLGCLSFGFS WFWFRLYWFP UCVLYATSHC 8LRTVPDIPP YPPFNALLLL LTLMNLYWFL 300 
YIVAFAAKVL TGQVHEIjKDL REYDTABAQS UCPSKAEXPL RN6LVKDXRP 

8eq ID NOt 247 DMA sequence 

Nucleic Acid Accession «i NM_002081.1 

Cdding sequence i 222.. 1698 

I 11 21 31 41 SI 

I I I I 1 I 

66CTGCCCX»^ GGOAGGOTTC GGACCTC6CA CCCOGOGOGC CCOSOGCOGC OOCOSCOGCC 60 

GGCTTTTGTT GTCTCC G CCT CCT06GCOSC OGCOGCXTTCT GCSACOSOGAa C0GGQGG06C 120 

CGGGACCTTG GCTCTGCCCT TOGCGQOOQQ QAACTGOGCA GGACCCGGCC AGOATCOGAG 180 

AGAGGCGOGG GQGGGTGGCC GGGGGOOCOQ CCGGCCCCGC CATGGAGCTC OGQGCCOGAG 240 

6CTGGTG6CT GCTAIOTGCG GCGQCAGOQC TGGTCGCCTG CGCCCGOGGG GAGCG6GCX:A 300 

GOVAGAGCOG GAGCT60G6C GA60TC0GCC AGAT CTAO GQ AG CCaAGTCC TTCAGOCTGA 360 

GOBACGTGCC CCA6G0G6AG ATCTCGGOTG AGCACCTQGG 6ATCTQTCCC CAGG6CTACA 420 

CCTGCT6CAC CAGCX3AGATG GAGGAGAACXT TGGCCAACCG CAGCCATGCC GAGCTGGA6A 480 

CCGOGCTCCG GGACAGCAGC OGOGTCCTGC AGGCCATGCT TGCCACCX»0 CTOOOCAGCT 540 

T0QATQACX31 CTTCCAGCAC CT6CTGAA0G ACT0G6AG00 GA06CTGCAG GCCACCTTCC 600 

GOGGCSGCCTT GGGAOAOCTG TACAOSCAGA AGGG6AGGGC CTTCOGGQAC CTGTACrCAO 660 

AOCTGOGGCT OTACTAC06C GQTGCCAACC T6CA0CTGGA GGAGAOGCTG 6C0QAGTTCT 720 

GGGCCCOCCT GCTC6AGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTO CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGQCCX3 AGGCOCTQOQ GCCCTTCGGG QAGGCCCCGA 840 

GA6AGCTGCG CCTGCGGGCC ACCOgTGC X rr TCGTGGCTGC TOGCTCCTTT GTGCAGGGCC 900 

TOGGOGTOOC CAGGQAOQTO GTCOGGAAAG TGGCTGAOGT CCCCCTGGGC O0G6AGTOCT 960 

06AGAGCTGT CATGAA8CTG GTCTACTGTG CTCACTGOCT GGGA6TCCCC 0G06CCAGGC 1020 

CCTGCCCTOA CTATTGCCX3A AATQTOCTCA AGGGCTOCCT TGCCAACCAG GCCGACCTGG 1080 

ACX5CCQAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

CATCGGGTGT GGA6A6TGTC AT0G6CAG00 TGCACAOQTQ GCTGGCGGAG GCCATCAA06 1200 

CCCTCCAGGA CAACAGGGAC AOSCTCACGG CCAAGGTCAT CCAGGGCTGC GGaAACCXX:A 1260 

AGGTCAACCC CCAGGGCCCT G66CCTGAGG AGAAOCGOOG COGGGGCAAG CT6GCCCCGC 1320 

GQGAGAGQCC ACCTTCAGGC ACGCTGOAOA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GOGAOOTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAOATGOCCC 1440 

TGAGCACTGC CAGTGAT6AC OGCTGCTGGA A06GQATGGC CA6AGGCCX30 TACCTCCCXX3 1500 

AGGTCATGGG TGAOGGCCTG GCCAACCAGA TCAACAACCC COAGGTGGfta 6TGQACATCA 1560 

CCAAGCOGGA CAT6A0CATC CGGCAQCAGA TCATGGAGCT GAAGATCATG ACCAACOGGC 1620 

TGCOCAGCOC CTACAACGGC AACGAOGTGG ACTTCCAGQA OGCCAGTGAC QAGGGCAGGO 1680 

GCTCGGGCAG OGGTGATGGC TGTCT GG ATQ ACCTCTGOSQ CCGOAAGGTC AOCAGGAAGA 1740 

GCTCC3VGCTC COGGACGCCC TTGACCCATO CCCTCCCAGG CCTGTCAGAO CAQGAAGGAC 1600 

AQAAQACCTC GGCTGCCAGC TGCCCCX^GC CCCCGACCTT CCT0CT60CC CTOCTCCTCT 1860 

TCCTGQCCCT TACAGTAGCC AGGCXXXBOT GGCGGTAACT GCOCCAAGGC 0CCA66GACA 1920 

GAG6CCAAG6 ACTGACTTTG CCAAAAATAC AACACAGAOG ATATTTAATT CACCTCAOCC 1980 

TGGAOAGGCC TGGG6TGGGA CAGGGAGGGC 0G6CGGCTCT QAGCAGGGGC AGGC36CAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGOTCAGCT GGGAGCCAOT GTGCCCAAAA GCCATGTATT TCAGGQACCT CAGGG6CACC 2160 

TCCOGCTGCC TAGCCCTCCC CXCAGCTCCC TGCACCGCCG CAQAAGCAGC CCCTCX3AGGC 2220 

CTAGAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCSVCAG CX3AGCCTOT0 CCTTCCTCCC 2280 

06CCT0CTCC CACTOGOACT CCCAGCAGAG CCCACCAGCX: AQCCCTGGCC CACXXXX3CAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTQ GGTGTOOGCC ATCCAGGGTC TGGCAGAOCC 2400 

TCTOAGATOA TGCATGATGC CXTTCCCCTCA GOGCAGGCTG CAQAGCCOGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGOOTCTGCA GGGTGACQCC TGAGACAGCA CCACTGCTQA 2520 

GGAGTCTGAG GACXOrOCTC CCAGAOACCC TGCAGTQAGG GGCCCTCGAT G06CAQAT6A 2580 

G6GGCCACTG ACCCACCTOC GCTTCTGCTO GAGGAGG6GA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGOO TGGGCTCTGC CAATGTGQGC TGCGCCT08C ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCOGAGGGC TGAGGAGCAG CCAGQACCOG 2760 

CCTGCTCCX:a TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCAOGQ TQACAC3W3GT 2820 

CA0G6CTCAG AOTOACCCTC GGCTGTCACC TGCTCACftGG GATGCTGGTG GCTGGTGAGA 2880 

CCC06CACT0 CACAGQGGAA TGOCTAQOTC CCTTCCOQAC CCAGCCAGCT GCACTGCAGO 2940 

6CACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

'l UlXXTlVrr CATGGAGAGC TGTTaSCTCC TCCCAQATGG CTTOGGAGGC COGCAGQGCC 3060 

CACXTTTGOAC CCTGQTGACC TCCTOTCACT CACTOAGGCC ATCAGGGCCC TGCCCCAGGC 3120 

CT0GACG06C CCTCCTTCCC TCCTGTGCCC CAGCTGCCAO GTGGCCCTGa GGAGGGGTGG 3180 

WitKyiOTrO GGAAGOGGTC CTGC3U3GGGG AGGAGQACTT GGAGGGTCTG GGGGCAOCTO 3240 

TCCTGAACCO ACTGACCCTQ AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACOOTCC 3300 

OGCACAOTOO AOSQAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC COCATAATAA 3420 

CCCSGCCMn GOCAGGOTGG GCIQGQQACT CTQGGACSlGT GATGCOQGGC 6CCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GA06GCCTA6 GGGTGGGBCT CA6ACCCCAC CCTACGCTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC ASTGGCCAAQ CC TGCTOTG T 3600 

CCTTCCTCCA CAAGGTCCCC CCACC3GCTCA GTGTCAGCOa QTGA06TGTG TTCTTTTGAG 3660 
TCCTTGTATO AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 248 Protein sequence 
Protein Accession #t NF 002072.1 



1 11 21 31 41 51 

1 1 I I I I 

HEIAARGHWL LCAAAALVAC AR6DPASKSR SCGEVKQIYG AKGFSLSDVP QAEISGEHLR 60 

ICPQaYTCCT SEMEBNIjANR 8HAEX«BTALR DSSRVLQAML ATQLRSFDDH FQHLLHDSER 120 

TLQATFPGAF GELYTQNARA FRDLYSBLRL YYRGANLHLE BTLABFHARL LERLFKQLHP 180 

QLLLPDDYLD CLGXQABALR PFGBAPRBLR LRATRAFVAA RSFVQGLGVA SDWRKVAQV 240 

PLGPECSRAV MKLVYCABCb GVPGARPCPD YGRMVUQSCL ANOADLDAEH RHLUSMVLZ 300 

TDKFHGTSGV BSVIGSVH7W LABAINALQD NBDTIiTAlCVX QGOONPKVNP QQPOPEBXRR 360 
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ROXZAPRBRP PSGTI£XIiVS BAXAQLRDVQ DFMISXiPGTL CSBKMALSTA SDDRCHKGMfV 
RGRYLPEVHG XXSIANQZNNP EVBVDITKFD HTIRQQZMQL KZMTNRLR&A YNGNDVDFQD 
ASDDGS6SGS 

Seq ID NO: 249 DNA sequence 

Nucleic Acid Accession ffi NM_001493.3 

Coding sequence* 8.. 1864 



420 
480 



1 11 21 31 

I I ! I 

GAAGGGCATG GTCTCCCCAC GG ATOTOOQQ OCTCCTCTCC 
CATTTTOCTC CCCCAQACAC GGCCCOCTGQ CQTCTTGGAG 
GCCGGGTCCA GGCCCTGGGG CCCOGOGGTC OCCCTGCAGC 
CTTCTTCAOA GTCTGCCTGA AGCCTGGGCT CTCAOAGOAG 
CCTGGGOGOG 6CGCTGAGT6 CGCGCGGACC G6TCTACACC 
GCCTGATCTC CCACTGCXXXj AOGGGCTCTT GCAQGTGCCC 
CAO C TTCTCT TTCATCATOO AAACXrTGGAO AOAGQAGTTA 
06CCTGGAGC CTGCTGGCGC GCQTGGCTCG CAQGGG6C0C 
GGCCCGGGAC ATTCAGCGCX3 CAGOCGCCTQ GOAOCTGCGC 
OQAGC^CCT GCOGTOGGGA COGCGTGCAC GOGCCTCTGC 
6C38GTG06aT COGGGACXGC GGCCCTQCX3C ACC3QCT0QAG 
GGTGTGCCX3A OCAGGCTGCA GCCCTQAQCA TQQCTTCTGT 
ATGCCTAGAG GGCTGGAGTG GACCCCTCTG CAOGGTCCXTT 
CAGCCCCAGG GGCCCX3TCCT CTGCTACCAC CGGATGCCTT 
TGAOGGGAAC CCGTGTGCCA ATGOAGGCAG CTGTAGTGAG 
CACCTGCOCQ OGTGQOTTCT AOOGOCTGOO GTOTOAGQTQ 
TGGACCCT6C TTCAAGQGOG GCTTGTGTGT OGGGGOTGGA 
CTGCCACTGC CCACCTGOTT TCCAAGGCTC CAACTGTGAG 
CCTGCAGCCA TGCCGCAATG GCXX3ACTCTG CCTGGACCTG 
CTGCXX3CGCC GGCTTCGGGG GTCXTTOQCTO OQAGCACGAC 
06CCTG0GCT AACGG06GCA CGTGTGTGGA GGGGG6GGGC 
GCTGGGCTTC GGCGGC06C8 ACTGCGGOGA 6C6060GOAC 
TGCTCACGOC GGCGGCTGCT AOGCCCACTT CTCOGGCCTC 
CTACATGGGA GCGCGGTGTG AGTTCCCAGT GCACXX:CGAC 
GOCCCCOCCO 6GCCTCA6GC CCGGGGACCC TCAQCGCTAC 
ACTGCT0GT6 GCCG00GGCX3 TGGCOGOOGC 'I^^^CTCTTO 
TGGCCACTGC CA6GATGCXX3 GOTCTOOCTT GCT6GCTG60 
COCACTCCCG GATGCACTCA ACAACCTAAG GA06CA06AG 
CTO G TCCGTA GATTGGAATC GCCCTOAAiGA TGTAGACCCT 
TGCTGCTTCC ATCTACGCTC GGGAG6TAGC GAOQCCCCTT 
G080QCTG6G CAGAGGCAGC ACCT GCTTTT TCOCTACGCT 
ATOAATTGGO TAQAOTCTCT GQAAOQITTT AAOCCCATTT 

Seq ID N0» 250 Protein sequence 
Protein Accession #: NP 058637.1 



41 
I 

CAOACrGTOA 
CT6CAGATCC 
GCCCGGCTOC 
GCCX3C0GAQT 
GAGCAGCCOQ 
TTCCX3G0A0Q 
GGAGACCAQA 
TTGGCAGCOa 
TTCTCGTACC 
OGTCCGOGCA 
GAGGAATQT6 
GAACAGCCOG 
GTCTCCACCA 
GTCCCTGGGC 
ACACCCAGGT 
AGOGGGGTGA 
GACCCTGACT 
AAQAGGGTGG 
GGCXZAOGCCC 
CTGQAOGACT 
GCX3CAC0GCT 
CCGTQGQCOQ 
GTCT60GCTT 
GGCXSCAAGOG 
CTTTTQCCTC 
CTGGTCCACG 
ACCC06GAGC 

CAAGGGATTT 
TTCCCCC06C 
TGCTGGATTC 
TC31GTTCTAA 



51 

I 

TOCTAGOSCT 
ACTCTTTOGQ 
CCT6C0GCCT 
CCCOGTGCGC 
GAGOGCCCGC 
OCTGGCCTGO 
TTGGA66GCC 
GAGGCCC6TG 
G0GCGCGCT6 
GCX3CCCCCTC 
AGGOOCCQCT 
GTGAATGOCG 
GCAGCTGCCT 
CTGGGCOCTG 
CCTTTQAATG 
CATGTQCAGA 
CTGCCTACAT 
ACCGGTGCAQ 
TGCGCTGCOG 
GCQ CGGQ CCG 
GCTCCTG06C 
0Q0GCCCCT6 
G06CTCCCGG 
CCTTGCCCX3C 
08GCTCTGGG 
T6C3600GCCX3 
OGTCAOTCCA 
ATGGTCOGAG 
ATQTCATATC 
TACACACT60 
TGTCCXiTQAA 
CTTACTTTCA 



MVSPRMSGUi 
RVCLKPGLSB 
SFIZBTWRRE 
PAVGTACTRL 
EGWTGPLCTV 
PRGFYGLRCB 
PCRNGGLCLD 
FGGRDCRBRA 
PGLRPGDPQR 
PDAUmiiRTQ 
GQRQHLLFPy 



11 
I 

SQTVILALIF 
EAAESPCALG 
LQDQIGGPAH 
C31PRSAPSRC 
PVSTSSCLSP 
VSGVTCADOP 
LGKALROICR 
DPCAARPCAH 
YLLPPALGLL 



21 

1 

LPQTRPAGVF 
AALSARGPVY 
SLliARVAGKR 
GPGLRPCAPL 
ROPSSATTGC 
CPNOGLCVQG 
A6FAGPRCEH 
GGRCYAHPSG 
VAAGVAOAAL 
VDHNRFEDVD 



31 
I 

ELQIHSPGP6 
TBQPGAPAPD 
RIAAGGFHAR 
HDECEAPLVC 
LVPGPGPCDG 
ADPDSAYICH 
DIjDDCAGRAC 
LVCAGAPOYM 
UiVHVRRRGB 
POGXYVISAP 



41 

I 

PGPGAPRSPC 
LPLPDGIiLQV 
OIQRAGAWEL 
RAGC8PEKGF 
NPCAHGGSCS 
CPPGFQGSNC 
ANQQTCVEGG 



SQDAOSRLtiA 
8ZYARBVATP 



PSSILSVK 



Seq ID NOt 251 DKA sequence 

Nucleic Acid Accession At CAT cluster 



GAAATATAAC 
AGATTATTGA 
GCTTCAATTC 
TTAQAAAGTC 
A6TTTTACTA 
TTGTGCAAAC 
GGGT6CTATT 
GGTTTTCTTT 
GTTTACTQAA 
TTATGTAAAT 
TGCCATTGTC 
TAAAAATACA 



11 
I 

CATTGCAATT 
CAATCCCAAA 
CTTATATGAC 
TGAGAGACTT 
CTOTAGT6AC 
AACCTTTCAT 
TACCTAQAAC 
TTCAAGGAGG 
ACCACTCAAT 
ATOTTAAATA 
ATATCAAACT 
TTTCAATGTT 



21 
I 

AGAAAATATC 
TATACAATTT 
TTCAGACTGQ 
TATACATAAA 
CQTTQAGAAO 
T6TTAAGTGC 
CATTGTCTAC 
TTCAATTAAG 
GAAGTCAAAT 
TAACATCTAA 
TGCTTTGTAT 



31 
I 

CAAAATAGCC 
TTCTTTAAAA 
AGAAGCCTGT 
TTCTCAATTT 
ACCCTTOTTT 
CTGTATTCCT 
TACAATTAAC 
GCAATAAGAT 
TACTQAAGCT 
GGAAAATAAA 
CATACTAATG 
AAAAAAAA 



41 
I 

TGTTATCTTC 
GTAGTACAAT 
TAAACCACTQ 
GGCTGCTGTA 
ATTTACATTT 
TTCATTTACT 
ATTTACATTA 
GTTTGCTGGA 
TTTGCCTACA 
CAATATTATA 
TTACATAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



51 
I 

SARLPCRLFF 
PFRDAWPGTF 
RFSYRARCEP 
CTQPGECRCL 
ETPRSFECTC 
EKRVDRC8LQ 
GAKRCSCAIiG 
DGA8ALPAAP 
GTPSPSVBAL 
LFPPXiXTGRA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



51 
I 

CAGGTGGCCT 
TTOCTTTGTA 
TTAGTTTCAG 
CAOGT6CCAG 
GAA6CACT0T 
TGATQTCCAO 
CAAA0T8T0T 
QAAACCTATT 
TCTTGOTCTT 
ATTATGTQTT 
TATOOATCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NOi 252 DNA sequence 

Nucleic Acid Accession ft Eos sequence 

1 11 21 31 41 51 

1 I I I I I 

AGGTACTGCC AGAAAOSATC A66ACCTGQA GTCT6GCAAG AOGAAGACAG AGGCCTOTOT 
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GGGAAGCGAG TTGTTATCTT TGGTTATCTA GCT6TATGA0 TGTATTGGTC TTC3VTAAAGC 
TAGATAACCO AAAGTAAAAA CTTCTTCAAO ATGGCSOGGGO AGOGTGTGAO AATGAAAGAC 
TACAGCaSAO AGACAOTAAA AACCAGAAAG GTCAGOAATA CTTATTGAAT CTAACTTTGT 

I i nmrnti vrrrm ' CLT tatqattaaa ggtgggatqa gagaaaatta aatqacacac 

ACATGCTAAA ATATCAAGGT TGCAGATATG TCTTGAGAGG G6TTGTTGCA GCT6CAAAG6 
AGAAGTGTAT AGTGATAATG AGTAAAQATG CATGTGCAQT TTGTTCTATT TTAAGGCAAA 
AGTTATATCA GGGATTTTTT TCTTAGAAAG GTGTTQCAQA GATGTCTGGT ACXTTACTTTA 
AAAATGATTC CATAATATGT AGACTTGGGC AGTTCCTTTG GGAGGCACCT CCXTTCTCAAA 
ATTTGAAGAT TGTGCTTGGA AATTACTTTA CAT6TATTTG GGCTOTATQT CAATTT6TAC 
GAAATTAGAC TTTCA6AAAA GTTTATACTG GAAGCTTAAT AATTTGTATC TACTGAG6AC 
TTAGAGCTAO CA66CAAAAT GAAAAAAAAA AAAAAAAARG CAAGGGCTGA TTTTTATTCT 
TCTATTCAAA ATACAAGGAC AGATGCTTCT CTGTTCCAAG AGGQTTTCCT TQAGGAAGCT 
ACTGAAOCAG AAAGACATGA TGGAGACGAG ATCGCCTCXC CTCTT G TCAA AQTGTTAAAA 
AAATGTTCTG TCTTACTCTG CGCCTAGCAT TGGAAATGAA AGTGACATTT ACGCCAC3UIC 
CCACX3TGTGC QCCTCCTCTC TTTTrGTTTA AGGATGATCA GGTCTATCCA GGAAACAGCT 
CTGGCATCCC AAACTGAAAT AATTAOQAOO TATATAGACC TGACAAAAAT GGAAAGGGGG 
TGGGGAATCT GAGGGTCTGT CTTGCCTAAT TGATTCCGCT AAAOOGAATG CAGGAGATGT 
GAACG6CAG0 AOSCrCCSAT TCCCAOGCTC GaGGGCAAGT GATAAAOOSQ GGCOGOGCAO 
CCTATGACAG ACAGCCCTGT TGGGGGGTGG GGGTATGAAA AAAACATCAA GTGCACACAC 
CATACTCATC TCCATCGCTT AAGAAAQTAA AGGCATTTCC CACCCACAGC CATCTGCAGC 
TTCCCAAATG GCAGCACCAA CTGGTCTGTA GCTQCTACAT AGTCTGCTTC TGTTAATTTT 
TTAACCACT6 TTTAATCTGG CCATAATTAA GTTTQGCTTT CTTCQTTQTT TQAQATTTTC 
AGAATTCAAG GCAAGCTAGT AGAAAGCAAT TCCAAQAAAG TCCCATGACT GCCTGCCCCT 
AATGTCAAAA TCTCAQTCCA TGAGATTATG GCXTTGTGAC CA<MTTTTG CrTT T G TTT T 
G6GTGGGCAA ATGTGTATAG AGATAAAATA CATATCTCTA TATAACAOTC 6TTATTATAA 
TTTCATGAGG CTTTTTACCT CTTAACATGA TACATCTAGG AACTTGGTCT AATTGTGCTA 
OTAGATATAC ACTAGAAOTA AAACTATAAA GTCATTAGCC T6TAGAAAGT TGATTATGAT 
AACAATATQA TAAAAAGTTT GTTTTGGAAT AQTCTCAGCT AGATGGGTCT AAATAGCCAT 
TTTAATQTAA TCTAAAAATA ACACTATQCC TAQCAGAAAC TTTGGCQCTT TQGAGGTCCC 
CATTGTCGCC TTTTCATAAA AGTCCTTAAG TTTTCCATAT GTCACCAAGC AAACATTTGT 
ATA06CATTT ACACAOAAAT ATAACCATTO CAATTAGAAA ATATOCAAAA TA0CCT6TTA 
TCTTCCACGT 6QCCTAGATT ATTGACAATC CCAAATATAC AATTTTTCTT TAAAA6TA6T 
ACAATTTCCT TTGTAGCTTC AATTCCTTAT ATGACTTCAG ACTGGAGAAO CCTGTTAAAC 
CACTGTTAGT TTCAGTTAGA AAGTCTQAQA QACTTTATAC ATAAATTCTC AATTTGGCTG 
CTGTACACGT 6CCAGAGTTT TACTACTGTA GTGACCX3TTG AGAAGACCCT TGTTTATTTA 
CSVTTTGAAOC ACTQTTTGT6 CAAACAACCT TTCATTGTTA AOTGCCTGTA TTCCTTTCAT 
TTACTTCATG TGCAGGG6T6 CTATTTACCT AGAACCATTG TCTACTACAA TTAACATTTA 
CATTACAAAG TGTGTGGTTT TCTTTTTCAA GGAQQTTCAA TTAAOGCAAT AAGATOTTTQ 
CTGGAGAAAC CTATTGTTTA CTGAAAGCAC TCAATGAAGT CAAATTACTG AAGCTTTTGC 
CTACATCTTG QTCTTTTAT6 TAAATATGTT AAATATAACA TCTAAGGAAA ATAAACAATA 
TTATAATTAT GTGTTTGCGA TTGTCATATC AAACTTGCTT TGTATCATAC TAATOTTACA 
TAACTTATCG ATCAATAAAA ATACATTTGA ATGTT 

Seq ID NOt 253 DKA sequence 

Nucleic Add AccesBion #: m_00X650.2 

Coding sequence! 40.1011 



1 
I 

GGG6CAGGGA 
AGGOGGXOQO 
G6G6TCTGGA 
TTTGTTCTCC 
GTCGACATGG 
TTTGGCCATA 
AGQAAOATCA 
ATTGGAGCA6 
ACCATGOTTC 
TTTCAATTGG 
TCAATAOCTT 
ACTGGT6CCA 
GAAAACCATT 
TAT6AGTATG 
AAAGCTGCCC 
GAGACX3GAT6 
GAGGAGAAGA 
CQCACTGAAA 
6AAACA6ATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAOTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 

1 

ATGAGAGCTQ 
GTAAGTGT06 
CTCAAGCTTT 
TCAGCCTGGG 
TTCTCATCTC 
TCAGOGGTGG 
GGATG8CCAA 
GAATCCTCTA 
ATGGAAATCr 
TGTrTACTAT 
TA6CAATTGG 
GCAT6AAT0C 
GGATATATTG 
TCTTCrOTCC 
AGCAAACAAA 
ACCT6ATTCT 
AGGGGAAAGA 
6CA0ACAAGA 
TGTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 




120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



AOCCTTCAOC 
GAGTCAGGTA 
T6ACCGGG6A 
ACTACAAGAT 
ACCCATTAAG 
ATATTACTCA 
TATTGTGAAT 
ACCTATTTTA 
TTAAGTCTTG 
ATATTGGTAT 
C 



51 



60 
120 

leo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NOt 254 Protein sequence 
Protein Accession ttt NP_001641.1 

1 11 21 31 41 

I 1 I I i I ' 

MSORPTARRW GKOGPLCTRB NIMVAFKGVH TQAFWKAVTA EFLAMLIFVL LSLGSTINWG 
GTEKPLPVDN VLISLCFGLS lATNVQCFGH X8GGBZNPAV TVAMVCIRRI SZAKSVPYXA 
AQCLGAIIGA GILYLVTPPS WGGLGVTMV KGNLTAGHGL LVELZITFQL VPTZPASCDS 
XRTDVtGSIA LAXGF8VAXG HLPAXNYT6A 8MNPARSF0P AVXKGNHBIIB WXYWVGPIXO 
AVLAGGLYEY VPCFDVEPXR RPKEAP8KAA QQfTKGSYMEV EDNRSQVBTD DLILKPGWH 



60 
120 
180 
240 
300 
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VIDVDRGEBK KGKDQSGEVL SSV 

Seq ID NO I 255 DNA sequence 
Nucleic Acid Accession U26742.1 
Coding sequence: 325.. 144 9 





TTTCCCATTT 
TCACATTTCT 
OUVTAAA6CT 



Seq ID NO; 256 Protein sequence 
Protein Accession #i AAC50424.1 



ATTTGCTCTC 
ATATATTTTC 
ATTCXnTOGO 
CACAGAACAG 
CTTGGACAOG 
TGGACTAGCA 
GASTATGATQ 
CT6CTTCT6G 
CACGTCATGG 
TGCTTCCAGC 
CTT6GCTCAC 
GTTCTCCCAC 
TTTT6GTGQA 
TACTTTGAOC 
ATTTCCCTTT 
TGTTTAAAQC 
GGGTGCT8TT 



60 
120 

leo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



MIED8GKRGN 
ALRENALNHL 
PE6HGKIGVF 
VPEGPSFOYT 
VBCSYCHSBS 
NALSKSLSCA 
FZTRSSDOAF 



11 

I 

TMAERRQLPA 
DPNTELNVSR 
AVKMAIiATLC 
EQSARSCFSQ 
MMGPRYROQQ 
SSREPIiHPMF 
GGCV 



21 

I 

EMRAQDLDRI 
LEAVLSTIFY 
GGKXMDKIAy 
QKKVTUK3FL 
CTNYQI^QDC 
PDQPEKPUHi 



31 41 

I I 
RLSTYRTACK LRFVQXKCinj 
QLNKRMPTTH QIHVEQSISL 
IFSMI8DSS6 VMVYGRYDQF 
DTLMSDPPFQ CLVWbPUiHR 
FHR(^A06SH 6NQI^2MKByT 
AHIVDTWFPR FVTSMNDTLF 



51 
I 

HLVDIWNVIE 
LLNPLLAAFD 
LREVIiRLPTA 
LANVENVFHP 
8HK8PAXKLT 
SHSVPSSQSP 



Seq ZD NO: 257 DNA secwence 

Nucleic Acid Accession #i NM_004172.1 

Coding sequence* 179.. 1807 ~ 



G0G6ATT6TT 
CACCOCTTAC 
TGTGGGTGAT 
OACTAAAAOC 
AGTCC6TAAA 
TAAAAOTTAC 
TACAATCCTT 
CTCCTTTCCT 
CTCCAGTCTT 
QC6AGCTOTA 
TGTCATCATC 
T0TACGA6TG 
TCTGQTAOAA 
OGCCATCCAO 
GOAOACTCTT 
CAATGCCCTG 
GGAACAGGGO 
GCTTAGCAGTA 
TGTGGAGA10 
CATTGTTQ6C 
ACGGAAAAAC 
OACCTCTTCA 
CGTGaACAAQ 
6ACTGCCCTC 
GAACTTCGGA 
TGOAATTCCT 
CACTGACGAC 



AGTTCTGCCA 
OGOGTCACCA 
TATGAG6CTT 
CAAATTATTA 
CAGGOQGQCC 
ATCAGQCTCA 




AGAACAATGO 
ACATQGAtXSG 
ACTTTGAACT 
TT0QG0GA6C 
TOQGCCTGOC 
TOCGGAOCAC 



60 
120 
160 
240 
300 
360 



60 
' 120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



1050 
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CACCAAOGTA CIOGGAGACT CCCTQGGAGC TGGGATTGTG QAGCACTTGT CAOGACATGA 1680 

ACTGAAGAAC AOAGATGTTG AAATGGGTAA CTCAGTGATT GAAGAGAATG AAATGAAQAA 1740 

ACCATATCAA CTOATTGCAC AGQACAATGA AACTGAOAAA CCCATCGACA GTCAAACCAA 1600 

aATGTAOACT AACATAAAGA AACACTTTCT TGAGC3«XAG GTGTTAAAAA O GATTA TAAA 1860 

ATCTTTCCAT CTCATTACAG CTCATTOOCT CCAGCAAGCC CGTCA TCTTC CCTTTOCTCC 1920 

CTTCXGATAA GACTC6AAAA TASTOCTGCA AAACACAAGG GAGGATTTTO GGTGGCCAAA 1980 

GTGTACAATT TTCATCCCAC AATTGAAATT TTTAAATCAT TTCATGrTAG TCTTACCGAA 2040 

TAAGGTACCA AGATCACAAA TAGTGTTQAT CAGATCTTAC AAGTTTATGT GGCACACAAT 2100 

TCCTATAAAT GTGATTTTTT TATATAAGTT AAAGAGACAA ATAGTAGOCT AAAAACATTT 2160 

TAAAATCAAC TTTTGAAATT TAAAAATCTT TCAGAATACA ATTCAGTTTT AGTTTCAAAA 2220 

TGTTAACAAC TTGAATTACA ACOGGrTATC AGTTGGACAO TAAGATTTTA TCCCTTTCTC 2280 

TTCTGACTG6 TATACCTATT TCATTAGTAG CTAOOTOGAC ATATACATCT AGCACA6CT0 2340 

TGAQQACAOA CAGAAGGCAA AQTTTCCATG TGGCCTTGAG CAAGTCCCAT CTCACCTCTA 2400 

GQCCTCAGTO TCCTCATCTA TAAAATGAGG QACTTCCCTA GAAOTCTTCA TGOTCTCTTC 2460 

CAGCCCAGAC ATCCTGTQAT GTCATGAAAO CACCT6CCCT CTGTTTCCCC TCAGAACACC 2520 

CTGTACCATC CAT6QAGCAC QAG6CCTTCA GAAAAGACAC TTCAAT6GGA GTGAACATTT 2580 

CTAACTAAGO ACAGGAIGGG TGTGT6TGGT QGTCACCAGG TOCTGTGAOC AAAGT6CAGG 2640 

TTATGCAAGT OGCCAGGCAO GAGGCCATTC CAGGAGTGGQ ATTATTCATC AAACTCTTTO 2700 

CCCAGTTCAT CCCAATGGGQ GAAOTATTCC CTTCrTTCCT ACTCTGGGAA QAATOTCTCC 2760 

TQCCACTCCT CAACTGATGA TAGACTTCGA AAACASATGA GAAGACTAGC AGCTAGCAAG 2820 

GGTGCTTGTA GTCACACTGT GGAACACTAA AOAOCrAGGA AAOAGTTGAO CACAGGCAAC 2660 

ATTACAAAGA AAGGATTTQA AAACACCAAG AGTAC3U3GTC TTCTTT^AGG AAGAATAAAA 2940 

AAGAAGAGGT TCATTTTTCT GGCTTTTTTT TTCAOCTGAA ACACTTTTTC TCGAGTCCAA 3000 

AATCATTCCC CCCGTQAAQT CTGCTTACCA AAACATAAGA CX5ACTTATAT ATTTGAAAGA 3060 

AGTCAAATGA ATGAOCTCTC TAATAQAAOT CCATGAGTTG AOTOGGTATT TCTTA TTTQA 3120 

AAOTGTTTTT CTTTAATCAA AAOTCCTTAO AATGAGGGAA ACAAAATATT TATTTGTTTT 3180 

OGAATCCCAC TTATCAAATC ATTCAAAACT TTC3W3CTGGA U 'I' UUUtfrA ' rG CTTTTQTTTT 3240 

GTTTGTGTCC ATAAGAGAAA TGGTAGAAGA TQAATCAGTA TGAAGACACT GTCAATGAGG 3300 

TTATGAGAAA AAAACAGCAQ GGGCATTAGT TTCAGGCAAG GCAGCTCCCA GOTTTAGAGA 3360 

TTAATTTTTA CCCCCTAAGG AATATCCAQT CAAAGAOGCT GAGTGGGAGC TGTCAGGCAG 3420 

TAOCAGCTGT OTTTGAGTTT CTOGCTGAAA ATG6TGAAGA AT6GACTTAA TTATGCTAAC 3480 

AAACTGAAAA ATCTAGACAT AGATCCTCT6 ATATACAATT A6A0ATATTT TTATATAGAC 3540 

CCCAAOCATT CTGT6CATAA AAOTTAACAT TAG6CT6TGQ TQCA6TAACC ATTTAATGTC 3600 

GAGGCTCTAT TTCGGAAATA CACTACAAAT GTTAAAGTAC OTGGCTGTCC TC TTAAGAC A 3660 

CTAGTAQAOC AAA6ACTTAA TCATATCAAC TTAATTCTGT TACACAATAT GTGTTTTrTA 3720 

ATATACTAAC CATTTCTTAT GGAAAGGTCC TGTGGGGRQC OCATCCTCTC GCCAAOOCAT 3780 

CACAG6CTCT GCATACACAT OCACTCAGTG T06ACTGGQA AGGATTACTT TGTAGAT6TA 3840 
TTTTCAATAA AGAAAAAAAT AGTTTTACAT T 

Seq ID NO: 258 Protein eequence 
Protein Accession ftt 1IP_004163.1 

1 11 21 31 41 51 

I I i 1 I 1 

MTKSNGBBPK KGGRMERFQQ GVRKRTUAX KKVQNITKED VKSYIiFIQIAF VLLTVTAVIV 60 
GTILQFTLRF yRMSYREVKY FSFFQBLIiMR KLQMLVLPLI ISSLVTOIAA U38KASGKM6 120 
MRAWYYHTT TIIAWIGII IVIIIHPGKG TKENMHREGK IVRVTAADAP IDLIRHMPPP 180 
NLVEACPKQP KTNYEKRSFK VPIQANETLV GAVINNVSBA METLTRITBE LVPVPGSVN6 240 
VHAL0LWF8 MCFGFVIGNM KBQGQALREP PDSLNBAIMR LVAVIKKYAP VGILFLIAGK 300 
IVEMEDMGVI GGQIiAMYrVT VIVGLLIHAV IVIiPIiLYFLV TRKMPWVFIG GLUSALITAL 360 
GTSSSSATIiP ITFKCLEENII GVDKRVTRFV LPVGATIHMD GTALYBAIAA XFZAQVNNFB 420 
U7F6QXX7X8 XTATAA8X6A AOXFOAGLVT MVXVLTSVGL PTI3PITLXIA VOWFUSSLRT 480 
TTlilVIiGDSLO AOXVEHLSRH ELIQiRXlVBMQ NGVXBENEMK KPyQLXAQDtl BTBKPXDSET 540 
KM 

8eq ID MOt 259 DMA sequence 

Nucleic Acid Accession fit NN_021948.1 

Coding sequencei 48.. 2783 

1 11 21 31 41 51 

1 1 I I I I 

TOTGGCACTG CCTGCGTACC CAAOCCCAGC CTTGGGTAGC CTGCA GCATQ QCCCA GCTG T 60 

TCCT6CCCX:T GCTGGCAGCC CTGCTCCTGG CCCAGGCTCC TGCAGCTTTA GCAGATGTTC 120 

TGGAAGOAQA CAOCTCAGAQ GACOGCQCTT TTCG0GTGCX3 CATOGCGGGC GACGOGCCAC 180 

TGCAGGGOQT GCTCGGOGGC GCCCTCACCA TCCCTTGCCA CX5TCCACTAC CTGOGQCCAC 240 

06G0QAGC0Q CCGOGCTGIG CIGGGCTCTC G60GGGTCAA GTGQACTTTC CIGTCCOGGG 300 

GCCGGGAGGC AGAGGTGCTG GTGGOGOGGG GAGTGOGOGT CAAGGTGAAC GA66CCTACC 360 

GGTTCOGOGT GOCACTGCCT OOOTACOCAG OOTCGCTCAC OGAOGTCTCC CTGGOGCTGA 420 

OCGAGCTGCG CCCCAACGAC TCAGGTATCT ATCGCTGTGA GGTCXAGCAC GGCATCX5ATG 480 

ACAGCA6G6A CXK:TGTGaAO GTCAAGGTCA AAGGGGTOGI CTTTCTCTAC CQAOAGGGCT 540 

CIGOOOSCTA TOCTTTCTOC ■rm-CllSGGG OOCAGGAGGC CTOTGCOOGC ATTGOAGCOC 600 

ACATCX3GCAC CCCX36AGCAO CTCTAT6CXX3 Cfr A CLYl'GG OOGCTATGAO CAAT6TGATG 660 

CTGGCTGGCT GTCGGATCAQ ACOGTGAGGT ATCCCATCCA GACCCOVCGA QAOGCCTGTT 720 

ACGGAGAC31T GGATGGCTTC CCOGGGGTCC GQAACTATGG TGTGGTGGAC COQGATGACC 780 

TCTATGATGT GTACTGTTAT GCTGAAGACC TAAATGGWSA ATTGTTCCTQ GGTGACXX:TC 840 

CAGAGAAGCT QACATTGOAO GAAGCACGGG OGTACTQOCA G6AGCGGGGT GCAGAGATTO 900 

CCACCACX3G6 CCAACTGTAT GCAGCCTGGG ATGGTGGOCT GOACCACTGC AOCC CROG GT 960 

GGCTAGCTOA TGGCAGTGTG OGCTACCCCA TOGTCACAOC CAGCCAGG6C TOTGGTGGGG 1020 

GCTTGCCTGG TGTCAAGACT CTCTTCCTCT TCCCCAACCA GACTQGCTTC CCCAATAAGC 1080 

ACA6CCGCTT CAACGTCTAC TGCTTOOGAQ ACTOGGCCCA GCCTTCTGCC ATCCCTGAGO 1140 

CCTCCAACCC AGCCTCCAAC CCAGCCTCTQ ATGOACTAQA 6GCTAT0GTC ACAGTGACAG 1200 

A6ACCCTGGA GGAACTGCAO CT O OCTCAOQ AAGOCACAGA GAGTGAATOC OGTGGGGCCA 1260 

TCTACTCCAT CCCCATCATO GAGGAG6GAO GAGGTGGAAO CICCACXOCA GAAGAOCXJ^ 1320 

CAGAG6C00C TAGGA06CTC CTAOAATTTO AAACACAATC C3VT6GTAOOS COCACGGGQT 1380 



1051 
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TCTCACAAGA GGAAGGTAAG GCATTGGAGQ AAOAAGAGAA ATAT6AA0AT GAAGAAGAGA 1440 

MGAGQAGGA A^AGAAGAG GAGGAGGTGQ AGGATQAG6C TCTGTGCSGCA TGGCCCAGCQ 1500 

AGCTCAGCAG CCOGGGCCCT GAGGCCTCTC TCCCCACTQA GCCAGCAGOC CAGGA6QAGT 1560 

- CACTCTCCCA GGCGCCAGCA AGGGCAGTCC TOCAOCCTOO TCCATCACCA CrTOCTOATO 1620 

O GAOAGTCAQA AQCTTCCAGO CCTCCAAGGO TCCATOOACC ACCTACTOAO ACTCTGCCTA 1680 

CTCCCAGGGA GAGGAACCTA GCATCCCCAT CACCTTCCAC TCTGGTTGAG GCAAGAGAGG 1740 

TGGQGQAGGC AACTGGTGGT CCTGAGCTAT CTQGGOTCCX: TCGAGGAGAG AGCGAGGAGA 1800 

CAGGAACCTC CQAGGOTGCC CCTTCCCTGC TTCCAGCCAC ACGGGCCCCT GAGGGTACCA 1860 

. ^ GGQAQCTGGA GGCCCCCTCT GAAOAXAATT CTGQAAOAAC TGCCCCAGCA GGGACCTCAO 1920 

1 \J TGCAGGCCCA OCCAGTGCTG CCCACT6ACA GCGCCRGCOG AGOTQGAGTO GCCGTGGTCC 1980 

COGCATCAGG TGACTCTGTC CCCAGCCCCT GCCACAATG6 TGGQACATGC TTGQAGGAGG 2040 

AGGAAGGGGT COOCTGCCTA TGTCTGCCTG GCTATGGGGQ QQACCTOTOC GATGTTCGCC 2100 

TCOGCTTCPG CAACCCOQGC TGGGAOGCCT TCCAGGG06C CTGCTACAAG CACTTTTCCA 2160 

- - CACGAAOGAO CrGGGAGOAG GCAGA6ACCC AGTQOOBQAT OTACGOOQCQ C3VTCTGGCCA 2220 

1^ GCATCAGCAC ACCOOAGOAA CAGGACTTCA TCAACAACOG GTACCOOQAG TACCAGItSGA 2280 

TOGGACTCAA OGACAGGACC ATCOAAGGCG ACTTCTTGTO GTCGGATGGC 6TCCCCCTGC 2340 

TCTATGAGAA CTGGAACCCT GGGCAGCCTG ACAGCTACTT CCTGTCTOQA GAGAACTGOG 2400 

TQQTCATGGT GTGGCATGAT CAGGQACAAT GGA0TQAC5GT GCCCTOCAAC TACCACCTOT 2460 

on ^fgy"^ CAAOATGGOG CTGGTQTCCT GTGOQOOGOC ACOGQAOCTQ CCCCTGGCTC 2520 

AAGTGTTCGG COGCCCAOIG CTOGGCTATG AGGTG6ACAC TGTGCTTCGC TACOQOTOCC 2580 

GGGAAGQACT GGCCCAGCJGC AATCTGCCGC TGATCCX»ATO CX^AOAQAAC GGTCGTTGGG 2640 

AGGCCCCCCA QATCTCCTOT GTGCCCAGAA GACCTGCCOG AGCTCTGCAC CCAQAGGAGQ 2700 

ACCCAGAAGG AOGTCAGGGQ AGGCTACTGG GAOOCTGGAA OGCGCTQTTG ATCCCCCCTT 2760 

^_ OCAGOCCCAT GO CAGQT OCX; TAG6GGGCAA GGCCrtQAAC ACTGC080CC ACAGCACTGC 2820 
Z,J CCTOTCACCC AAATTTTCCC TCACACCCXG OQCTGACCAC AGGAAGTtSAC AACATGAC 

Seg ID NOi 260 Protein sequence 
Protein Accession #j NP_068767.1 

30 1 11 21 31 41 51 

) I i { ) i 

MAQLFLPLLA ALVLAQAPAA LADVLEGDSS EDRAPRVRIA GnAPLQOVLG GALTIPCHVH 60 

YLRPPPSRRA VIGSPRVKHT PLSRGREAEV LVARGVRVKV NBAYRPRVAL PAYPASIiTOV 120 

. - SLALSBLRPN OSGIYRCEVQ HGIDDSSDAV BVKVKGWFL YRBGSARYAP SPSGAQBACA 180 

JD RIGAHIATPE QLYAAYLGOY EQGDA6WLSD QTVRYPIQTP REACT6DMD0 FPGVHNYOW 240 

DPDDLYDVYC YAEDUIGBLP LGDPPEKLTL EEARAYCQER GABIATTGQL YAAWDGGLDH 300 

CSPGWLADGS VRYPIVTPSQ RCGGGLP6VK TLFLPPNQTO FPNKKSRPNV YCPRDSAQPS 360 

AIPEASNPAS KP ASDGL EAI VTVTETIiEEL QLPQBATESE SRGAIYSIPI MEDGGGGSST 420 

PEDPABAPRT UjEFBTQSHV PPTOFSEBEG KALEBBEKYB DEEEKEEBEE BEEVEDBALH 480 

*t\) ANPSELSSPG PEASLPTBPA AQBBSLSQAP ARAVLQPGAS PI.PDGESEAS RPPRVHGPPT 540 

ETLPTPRERN LASPSPSTLV EAREVGEATG GPELSOVPRO ESEETCSSEG APSLLPATRA 600 

PBGTRELEAP SEDNSGRTAP AGTSVQAQPV LPTDSASRGG VAWPASOTC VPSPCHNQGT 660 

CLBBEEGVRC LCLPGYGGDL CDVGLRFC3IP GWDAFQGACY KHFSTRRSWB EAETQCRMYG 720 

AfflASISTPB EQDPIMNRYR BVQWIGLNDR TIEGDPLHSD OVPIiLYBNWN PGQPDSYFLS 780 

OBICWMVWH DQC3QHSDVPC MYHLSYl'CKM GLVSGQPPPB LPLAQVPGRP RLRYEVDTVL 840 

RYRCREGLAQ RNLPLIROQB KGRHEAPQIS CVPRRPARAIi KPEmVBGRQ GRLLGRMKAL 900 
LIPPS8PMPG P 



45 



Seq ID NOt 261 DMA sequence 
JV nucleic Acid AccesBion #i l)M_004386.1 
Coding sequence i 2.. 3967 " 

} 11 21 31 41 51 

<^ I I I I I I 

GATGGGGGCC COOTTTQTCT GGGOCTTGGG CXnTTTOATO CTOCAGATGC TGCTCTTTGT 60 

OGCTGGGGAA CAGGGCACAC AGGATATCAC CGATGCCAGC GAAAGGGGGC TCCACATGCA 120 

GAAGCTQGGG TCTGGQTCAG TGCAQGCTGC GCTGGOGGAO CTGOTGGCCC TGCCCTGTCT 180 

CTTTA CXXTTG CAGCCAOGGC CAAGCGCAGC COGAGATQCC CCTOGGATAA AGTGOACCAA 240 

GGTOOQGACT 600TGGGGCC AGOGACAGGA CTTOOOCATC CTGOTGGCCA AGGACAATGT 300 

OU CGTGAGGGTG GCXaAAAOCT GOCAGGGAOG AGTGTCACTG CCTTCCTACC CCCGGCX3C00 360 

AGCCAAOGCC A06CTACTTC TGGGGCCACT GAOQGCCAOT GACTCTOGGC TGTACOGCTG 420 

CCAGGTGGTG AGGQGCATCO AGGATQAGCA GQACCTGGTG CX X TT GQ AGG TOACAGOTOT 480 

TOTGTTCCAC TACOGATCAG CCCXSGGACOO CTATOCACTO ACCTTCGCTG AGGCCCAGGA 540 

GGGCTGCOGT CTCAOCTCAO CCATCATTGC ASCCOCTGGQ CATCTACAGQ CTGCCTTTQA 600 

OJ GGATOGCTTT GACAACTOTG ATGCTOQCTO OCTCTCTOAC CGCACTGTTC GGTATCCTAT 660 

CACXX31GTCC CGTCXTTGOTT GCTATGGCX3A CCQTAGCAGC CTTCCAGGGQ TTOGQAGCTA 720 

TGGGAGGCGC AACCCACAGG AACTCTACQA TGTQTATTGC TTTGCCOGGG AGCTGOOGGG 780 

OQAGGTCTTC TAOGTQGGCC 06GOCX36COG CCTGACACTO GCOQGOOCGC GTGCACAGTG 840 

OOOC OQCCAO GGTGCOGGGC 1QG0CT080T GGGACAGCTG CACCTGGCCT OQCATGAQGO 900 

/U CCT6QACCAG TaCOACCaSG GCTGG CT GGC OQAOGOCAOC OTGOQCTACC OGATCCAGAC 960 

GCCGOGCCGG CGCTGCGGGG GCXXaCCCfCC GGGOGTGOGC ACCXjrCTACC QCTTCGCTAA 1020 

COOGACCGGC TTCCCCTCAC COGCCGAGCG CTTCGAOOCC TACTGCTTCC GAGCTCATCA 1080 

CCCCAOGTCA CAACATGGAO ACXTTAGAGAC CCXATCCTCT GGGQATQAGG GGGAGATTCT 1140 

QTCAGCAQAG GQGCCCCCAO TTAOAGAACT GQAGCCCACC CTGGAGQAGG AAGAGGTOGT 1200 

iO CACCCCTGAC TTCCAOGAGC CTCTGGTGTC CAGTGGGQAA QAAGAAACCC TGATTTTGQA 1260 

QGAGAAGCAQ GAOTCTCAAC AGACOCTCAO CCCTACXXCT GGGGACCCCA TQCTGQCCTC 1320 

ATGGCCCACT GGGGAAOTGT GGCTAAGCAC 6GTGGCCCCC AOCCCTAGCG ACATGGGGGC 1380 

AGGCACTGCA GCAAGTTCAC ACAOOGAGOT GGCCCCAACT GACCCTATGC CTAGGAQAAO 1440 

0G0G06CTTC AAAOaGTTGA ATOGOCXSCTA CTTOCAOCAG CAGGAACCGG AGC0C5GGGCT 1500 

Oi; GCAAGGGGGO ATCGAGGCCa GGGCCCA6CC CCCCACCTCA GAGGCTGCAO TGAACCAAAT 1560 

GGAOCCrCOG TTGGCCATGO CAGTCACAGA QATOTTGOGC AOTGGCCAGA GCCGQAGCCC 1620 

CTGGGCTGAT CTGACCAATG AGGTOGATAT GCCTGGAGCT GGTTCTGCTO GTGGCAAGAG 1680 

CTCCCCAGAQ CCCTGOCTGT GQOCCCCTAC CATOGTOCCA CCCAGCATCT CAGGCCACAG 1740 

CAOQGCXrCT GTCCTGGAGC TAGAGAAACC OQAOGGCTOC AGTOOCAGGC CR6CCACCCC 1800 
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AGACXriOTTT I W SWCXX T TGOAGGCCAC TGTCTCAGCT CCCAGOOCTO CCCCCTOGGA 1660 

GGOVTTGCCT 6TGGCCAOCT OCCCAGATCT CCCTATGAT6 GCCAT6CTGC GTGGTCCCAft 1920 

AGAOTGGATQ CTACCAOICC CCACCCCCAT CTCCACCGAG GCCAATAGAO TTQAG6CACA 1980 

TGGTGAGGCC ACX:GCCAC6G CTCCACXXTTC CCCTOCTGCA GAGACCAAQQ TGTATTCCCT 2040 

QCCT C TCTCT TTGACCCCAA CA6GACA6Q6 TGGAGA6GCC ATGCCCACAA CACCTGAGTC 2100 

GCCCAGOGCA GACTTCAGAO AAACTOGGGA GACCftSGCCT GCTGAGGTCA ACAAA6CT6A 2160 

GCACTCCAGC TCCAGCCCAT G6CCTTCTQT AAACAOGAAT 6TGGCTGTAG Wrim ' aX ' 2220 

CACTGAGACT GCCACTQAOC CAACGGGCCT CAGGGOTATC C06GGGTCTG AGTCTGGG6T 2260 

CTTCGACACA GCAGAAAGCC CCACTTCTGG CTTGCAGGCC ACTGTAGATG AGGTGCAGQA 2340 

CCCCT^CCC TCAGTGTACA GCAAAGG6CT GGATGCAAGT TCXXXATCTG CCCOCCTGGG 2400 

GAQCXrCTGQA GTCTTCTTGG TACCCAAAGT CACOCCAAAT TTG6AGCCTT GGGTT6CTAC 2460 

'AGATGAAGGA COCACTQTGA ATCCCATGGA TIXJCACAGTC AC6CCGGCCC CCAGT6ATGC 2520 

TAOTGGAATT TGGQAACCTG GATCCCAGQT GTTTGAAGAA GCCGAAAGCA CCACCTTGAG 2580 

CCCTCAGGTG GCCCTGQATA CAAGCATTGT GAOGCCCCTC ACGACCCTGG AGCAGGGGGA 2640 

CAA6GTTGGA GTTCCAGCCA TOTCTACACT QGQCTCCTCA AGCTCCCAAC CCCACCCA6A 2700 

GCCAGAGGAT CA0GTG6AGA CCCA6GGAAC ATCAOGAGCT TCAGT6CCTC GGCATCAGAG 2760 

CAGTCCCCTA GGGAMCOGO CTQTTCCTCC TGGGACACOS ACTGCA6CCA 0T0TG6606A 2820 

GTCTGCXITCA QTTTCCTCAG GGGAGCCTAC GGTACCGTGG GACCCCTCCA GCACXXTTGCT 2860 

GCCTGTCACC CTGQGCATAO AGQACTTOSA ACTGQAGGTC C7G6CAGGGA GCC06GGTGT 2940 

AGAOAGCTTC T6GGA6GAGG TGGCAAGTGG AGAGGAGCCA GCCCTGCCAG GGAGCOCTAT 3000 

6AATGCAGGT G0GGAG6AGG TGCACTCAGA TCCCIGTGAG AAC3UIOCCTT GTCTTCATGG 3060 

AGGGACATGT AATGCCAATG GCACCATGTA TOGCTGTAaC TGTGATCAGO GCTT03C0G6 3120 

GOAGAACTGT GAGATTGACA TTGATGACTG CCTCTQCAGC CCCTGTGAGA ATGGAGGCAC 3180 

CTGTATTGAT GAQGTCAATQ GCTTTQTCTG CCTTTGCCTC CCCAGCTATG GGGGCM3CTT 3240 

TTGTGAGAAA GACAC06A6G GCTGTGACOG 0G6CT6GCAT AAGTTGCAG6 GCCACTGTTA 3300 

COQCTATTTT OCCCAOC9GK3A GGQCATGGOA AGATG008AO AAGGACTQCC GCCX3C0GCTC 3360 

CGGCCACCTG ACCAGGGTCC ACTCAC06GA GGAACACAGC TTCATTAATA GCTTTGG G CA 3420 

TGAAAACACO TGGATOGQCC TGAAOQACAG GATCQTGGAG AGAGATTTCC AGTGQACGGA 3460 

CAACACOGGG CTGCAATTTO AGAACTGGCG AGAGAACCAG CCX3GACAATT TCTTCGCGGG 3540 

TGGGGAGGAC TGTGTGGT6A TGGTGGOGCA TGAAAOOGQG 06CTGGAACG ATGTCCCCTG 3600 

CAACTACAAC CTACCCTATG TCTGCA AQAA GGGCACAGTG CTCTGTGGTC CCCCTCCG6C 3660 

AGTGGAOAAT GCCTCACTCA TCGGT6CC0G CAAGGCCAAG AACAATGTCC ATQCCACTGT 3720 

AAGGTACCAG TGCAATGAAQ GATTTGCCCA GCACCAT3T0 GTCACCATTC GATGCCGQAG 3780 

CAATGGCAAG TGGGACAGGC CCCAAATTGT CTGCACCAAA CCCAGACGTT CACATCGGAT 3840 

6C6G6GACAC CACCACCACC ACCAACACCA CCACCAGCAT CACCAGCACA AATCCC6CAA 3900 

G6AG0GCAGA AAACACSUVGA AACAOCCAAC GGAGGACIGG GAGAAGGAOG AAG6GAATTT 3960 

TTGCIGAAGA ACC3USAAAAA AGAAAGCACA ACAOCTTTGC CATGCCTCCT CTGGA6CCTT 4020 ' 

06CCTGGGGA GACA6AACCC AGAGAGAAAC AAGAGAGTCC AGAAGTCCCT GAACCCCAAA 4080 

CTGTTCTOGC AAAAAAAATA TTCCTTTGAA CAAAGGTCTT CTTTTCCTTT TTTTACATAC 4140 

ACAAGATCTT CTTGGCAGGT GGAGCCAGGT GTCTGAAAAG TTCATTCTCG TCTGGCTGAA 4200 

CTCTGGQAGT GTOTGCCAQC TGAGGGAAGC ACAAGTAOCA AAGCTCA3TG GTCTQGTCTC 4260 

TTOTTT6CC31 GOCTQATTGA AGCAGGCCTT GATQAGGGTQ CATQAOTGTA TQTTTQCATT 4320 

CACATGAAGG AATTGCTTTT CACACCAGAA ATTCAQACTT AGTCAATGTT GGCTGAATTC 4380 

CTAAATCCAG GAAGAAGCCT GQACGTAGGG TCATTAGCTT TGGGAATAGA AGQCTACACA 4440 

GAAGCACACT GTTTTTGAAC TTGACAACAG CTCTOTCTi'T ACCCTGGACT TCAGCCCAAG 4S00 

TTCGGTCTTT GGTCTTGGTG GATAAAC3^ CAGTGTGGAG ATOOCAOSTA CT6»TTTXA 4560 

GGGATGTTTT TAGGACAACC TCCCTCCATG CCTTCAGAGT TAG6AOTGAG AATGATCAAA 4620 

GCAATAT6TA GGTGATQGAG GGAGAGTGTA TTGCTAACCC TTCX3VGGTCT AGTCCAGOGC 4680 

TQAQATTTGG TGGTTCTGCA TGTGTGATGA ATCTCTTTCA CACAAATAQA OQAQAGGATA 4740 

TTTAGGGCTA GATGAGCCCA GATTTCTTCC CCCTCCATCT CTCAGGGAQA CAAAGAACCT 4600 

CCTTCCTGQA OCAAGGAGGT GCTGCCAAGT TTTCTAGCCC AOTQCAiaiTA CCCAOTCCTT 4860 

AAGCAGACAT TGGTAGTGCC CCTGCCCTGG GTCCCACTCC TGCCGCACCC CACCCTTGTC 4920 

CCTGGCCATT GCCTGOTGGT CTAGAAACAC TTAAAACTTQ AAQTAQTGAC ACCTACCTQC 4980 

GGTCATATTG TAGAGAGATQ CTCAGTGTTA AAACTGAAAC ACACAAACAC ACACACACAC 5040 

ACATTTTTCT CTTGTAGATT TTAATTTTTT AAOTQOGAAA GAACTCAGCT TGCCTTCCTC 5100 

CCCCAAATGT GCAACCTGTA AAAGGTCTCT CCACACCAGG GGCCAGGATC CAGTTCCCTC 5160 

ATCTCTGGCA GGAAA6ATCC ACAGCTTTTC CTCCATGTCT 6TTACTCACT TTCA6CAGTC 5220 

06GGTAAAAT CTGTGGATCA GGGTTAAAAA AGCACCGTGG AGAATGGCCC TCTTCAGGAA 5260 

AGAAAAATAA GCAAATGAAT G6TCCACCTA GGGGTTCAGT AAAGAAAGAA ATGT6TTAAC 5340 

TGAGCCTGAA TCCCTTCTGG GAAGTAATAA TGACCATTGA CAACTAAGAA GTAGACACCA 5400 

TGCTAAAGAC TTACATACAA TCTCCTTQAA TCTTCTCAAT AQCCCATTGA CTTAGAAACT 5460 

GTTACTTTCC CATTTTACAC ACAGTGAAAC TGAGOCTCAG ATATAAAGGA AAGGTACTGG 5520 

CTTGAAGTCA CAACCAOGAC AGGAGTAAGG ATTTGGAATA AGGATTTGGT CCTGTTTTCT 5580 

GGACCAAATC CTTACTCTGG CTCTGCTTAC ACTTTCTCTC CATCACCAAA TCCTTACTCC 5640 

AAATCCAQAA GTCA6AGCCA ACTCCCATCT TGGTTCTGAC CCAAATCCTG CTCTGGACTC 5700 

TG6AGAGGA0 ATTGAAATAT AATTQCACCC TCATACACAT TTAGGAAATG GTTAAGAAGT 5760 

GTAAACTGAA CCCTTATCCT TGTCTTCAAT CTTCCTCCCT GTAGACATCT ATCTTATTAT 5820 

GGTTATTATT CAOAAAACCC AGGGATACAG QTTTGTCTTC TTACTTTGAT AACTCTTCTT 5860 

AGTTTAAAAT AATAATAATA ACACATCTTT GGTCATCTAT GTCACACAAA AATTTTCCTT 5940 
7GTTTGGGGG GGGCTGGGGA TGCAGTGTTT TTTGGGGGGT CTTGGTTTAT QCTCCCTGCC 6000 
CTT6AGCCCC TCAGCCGTTT QCCCTGCCCC CACCTCGQCT CCATGGTGGG AGGGGGCTCT 6060 
GGTCTTTTCT AAAGTGGGCG QTTTQTCTTT TGATCTTTCC CTTTTGGATG TGOGTGTGTG 6120 
TCTGCGTQTO CCATOTGCGT GGCAOGCATA TGAGTGTGTG TOCOTGTGAA OGOCTTTGGG 6180 
TCCTGCTGGT TTTGCTGTGA GCTGCAOTGT TCTGTGGQTC TGT6GTATCT GACACTGTGG 6240 
ACATTAATGT ACTTCTTGGA CATTTTAATA AATTTTTTAA CAGTTCAAAA AAAAAAAAAA 6300 
AAAAAAAAAA 

8eq ID NOi 262 Protein eequence 
Protein Accession «t NP 004377.1 



1 11 21 31 41 51 
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M6APFVHALG LLMLQHLLFV AOBQOTQDIT DASBUSLBMQ KL650SVQAA LABLVALPCL 
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FTLQFRPSAA 
ANATUjLGPL 
ACRLSSAIIA 
GIUmPQELYD 
LDQCDFGWIA 
-PTSOKODUST 
ERQESQQTLS 
GRFKGUnSRY 
WADLTMEVDM 
DLPWSPLEAT 
6BATATAPPS 
KSSSSPHPSV 
PWPSVYSKGL 
SGIWBPOSQV 
PEDQVBTQQT 
PVTLGZBDFB 
GTCNANOTMy 
CEKDTE6CDR 
ENTHIGUTDR 
NYNLPYVCKK 
NGXNDRPQXV 
C 



RSAPRIKHTK 
RASDSGLYRC 
APRHLQAAFE 
VYCPAREIiGG 
DOSVRYPIQT 
PSSGDBGBIL 
PTPGDPKLA8 
FQQQBPBPOL 
PGAOSAGGKS 
VSAPSPAPWB 
PAAETKVY8L 
NRNVAVGFVP 
DA8SPSAPIiG 
FEEAESTTLS 
SGASVPPKQS 
LBVLAGSPGV 
GCSGDQGFAO 
GWHKFQGHCT 
IVERDPQWTD 
GTVIjOGPPPA 
CTKPRRSHRM 



VRTASGQRQD 

DGPDHCOAGW 
EVFYVGPARR 
PRRROGGPAP 
SAE6PPVRBL 
WPTGBVWLST 
QGGNEASAQP 
SPBPWLWPPT 
AFPVATSPDXi 
PIiSLTPTGQG 
TETATBPTGL 
SPGVFLVPKV 
PaVAIJ>TSIV 
SPLQKPAVPP 
ESFNEBVASG 
ENCBIDZDDC 
RYFAHRRAKB 
NTGLQPENWR 
VENASIiIGAR 
R6HHHHHQHH 



LPZX*VAia3NV 
DLVPLEVTGV 
LSDRTVRYPI 
LTLAQARAQC 
GVRTVYRPAN 
BPTLBBEBW 
VAPSPSDMGA 
PTSEAAVNQM 
MVPPSIS6RS 
PMMAMXiROPK 
GEAMPTTPBS 



TPNLBPWVAT 
TPLTTLBQGD 
GTPTAASVGB 
EEPALPGTPM 
LCSPCENOGT 
DABKDCRRRS 
EKQPDNFFAG 
fCAKXmVHATV 



Seq ID NO J 263 DNA sequence 

Nucleic Acid Acceaaion #: Eos sequence 

Coding sequence t 7. .3065 



11 



VRVAKSKQGR 
VFHYRSARDR 
TQSRPGCYGD 
RRQGAAIiASV 
RTGFPSPAER 
TPDFQBPLVS 
GTAASSBTBV 
BPPLAMAVTE 
RAPVLELBKA 
EHMLPHPTPI 
PRADFRBTGE 
FDTAESPTSO 
DEGPTVNPMD 
KVGVPAMSTL 
SASVSSGBPT 
HA8AEBVHSD 
CXDBVNGFVC 
GBLTSVHSPB 
GEDCWMVAH 
RYQCNEGFAQ 
ERRKHXXHPT 



VSLPSYPRRR 
YALTFABAQE 
RSSLPGVRSY 
QQLHLAWHE6 
PDAYCFRAHH 
^EEETblLE 
APTDPMPRRR 
MLGSGQSRSP 
EGPSARPATP 
8TBANRVEAH 
TSPAQVNKAE 
LQATVDBVQD 
STVTPAPSDA 
GSSSSQPHPB 
VPHDPSSTLL 
PCBNNPCLHO 
LOiPSYGGSP 
SISFZNSFGB 
ESGRWNDVPC 
HHWTIRCRS 



120 
IBO - 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



21 31 41 51 

111)1 

OOOGOGATGG CCAGCACCAQ GAGTATCGAG CTGGAGCACT TTOAGGAAOG GGACAAAAGG 60 

COGCGGOOGG GOTOGOGGAa AQGGGCCXXX: AGCTCCTCCG GGGGCAGGAG CAGCTCQGGC 120 

CCCAAGGGGA ACGGGCTCAT COCCAGTCGG GCGCACAGTG GCCACTQCAG CrTCTACCGC 180 

A060GGAGCC TGCAGGCCCT CAQCTOOOAG AAOAAGGCXA AGAAGG06C6 CTTCTACC6G 240 

AACOGGGACC GCTACTTCAA GGGCCTGGTG TTTGCCATCT CCAGCGACCG CTTCCGGTCC 300 

TT06ATGCGC TCCTCATAGA GCTCACCOGC TCCCTOTCGG ACAAOOTQAA CCTGCCXX3«3 360 

GGTGTCGOCA CTATCTACAC CATOGAGGGC A6C0QQAAGG TCACCAGCCT G6A0GAGGTG 420 

CTGGAAGGTG AGAGTTAOGT GTGTGCATCC AATGAACCAT TTOQTAAAQT 06ATTACAGC 480 

AAAAATATTA ATCCAAACTO GTCTOT G AAC ATCAAQOGTQ GQACATCCCG A60GCTGGCT 540 

GCT6CCTCCT CTGTGAAAAQ TGAAGTAAAA GAAAGTAAAQ ATTTCATCAA ACCCAAGTTA 600 

GTGACTGTGA TTOGAAOTGO AGTGAAOCGT AGAAAAGCOO TGOGGATCCT TCTQAATAAA 660 

AAOACTGCTC ATTOCTTTGA ACAAGTCTTA ACAGATATCA 0CX3AA6CCA7 TAAAGNAGCC 720 

TCAGGAGTCG TCAA8AGGCT CTGCACCCTG GATGGAAAQC AGGTGAOAOT TACGTQTGTG 780 

CATCTOCCAO ACTTTTTTGG TGATGAOOAT GTTTTTATTQ CATGTOGACC AGAAAAATTT 840 

CGTTATGCCC AAGATGACTT TGTCCTGGAT CATAGTGAAT GTOGTGTCCT OAAGTCATCT 900 

TATTCTOQAT CCTCAGCTGT TAAGTATTCT QGATCCAAAA OCCCTGGGGC CTCTOGACGC 960 

AQCCAGATTT CTQCTCATGG CAGATCTTCT TCCAATGTAA ACGOTGGAGC TGAOCTTGAC 1020 

CGTTGCATAA GTCCTGAAGG TQTXSAATGGA AACAOATGCT CTGAATCATC AACTCTTCTT 1080 

GAOAAATACA AAATTGGAAA GGTCATTGGT GATGGCAATT TTGCAGTAGT CAAAGAGTGT 1140 

ATAGACAGGT CCACTGGAAA GGAGTTTGCC CTAAAGATTA TAGACAAAGC CAAATGTTGT 1200 

GGAAAGGAAC ACCTQATTGA GAATGAAQTG TCAATACTGC GCCQAGTGAA ACATCCCAAT 1260 

ATCATTATGC TG6TCGAGGA GATGGAAACA GCAACTGAGC TCTTTCXQQT GATGGAATTO 1320 

GTCAAAGGTG GAGATCTCTT TQATGCAATT ACTTOGTOGA CCAAQTACAC TGAGA6AGAT 1380 

GGCAGTGCCA TGQTQTAC3VA CTTAGCCAAT GCCXntSVGGT ATCTCCATGG CCTCAGCATC 1440 

GTGCACAGAG ACATCAAACC AGAOAATCTC TTGGTGTQTO AATATCCTGA TGGAACTAAG 1500 

TCTTTGAAAC TGGGAGACTT TGGGCTTGCG ACTGTGGTAG AAGGCOCTTT ATACACAGTC 1560 

TGtiGGCACAC CCACTTATGT GGCTCCARAA ATCATTOCTG AAA CTGGC TA TGGCCTGAAG 1620 

GTGGACATTT GGGCAOCTGG TOTGATCACA TACATACTTC TCTGTGGATT CCCAGCATTC 1680 

OGAAGTQAGA ACAATCTCCA GGAAGATCTC TTOSACCAGA TCTTGGCTGG GAAGCTGGAG 1740 

TTT0CX3GCCC CCTACTGGGA TAACATCAC3 GACTCTGCCA AGGAATTAAT CAGTCAAATG 1800 

CTTCAGQTAA ATGTTGAAGC TCGGTGTACC GOGGGACAAA TCCTGAOTCA CCCXrrGGGTO 1860 

TCAOATOATG CCTCCCAGGA GAATAACATO CAAGCTGAQO TGACAGGTAA ACTAA AACAG 1920 

CACTTTAATA ATGCX3CTCCC CAAACA6AAC AGCACTACCA C06GGGTCTC CX3TCATCATG 1980 

GTGAGTGQAA GGOGGCAGGT CTGGCCTQAC TGaSGAGCOG GCCTTQAAGT TTTTGAATTA 2040 

GGTAGCCGGG AOCTGCCCTC ACATGQAAGT TGGTGCCTTC COTAGTCCTA TTTCATATGA 2100 

AQATTGGCTT QGCATGTGGA GGGCACTCAT TCXX3CAACTC OCAGGCTTTO GGCACT6TGT 2160 

GGAGGGGCTT GTGTAQGGAC OVGCAGGCCT GGTGTGAGGG GT0CAS606T CAAG6AGCTC 3220 

CTGGCTGGGC GCTCTGGGCA GCTGCTTCCA CTCTTGTCTC TGCCTTCTCA TCTAGAGAGA 2280 

CTCCCAAOGC CIGQAQQQGT GTOTT G TGTT AOGAATTAAC TCCCTGOCTA CCCCAAGGCC 2340 

TCAGAAATAO ATTATTAGAG ATGTGAATTA TTCTTTGAGA CTTGGGATAA GAAACAGCCA 2400 

AAGCTAAACA TATTTCAGTT TTAAAAAATC AGTGTTTTAT AAAACACAGT TTGGG GCTTT 2460 

TAAAGGTACA TAATCAAGGA AAAAAATATA TATTCATTTT TCAGGGTTGG TAACATTTTA 2520 

TGAQAIGTCA GTGACAAOGA TGGCCTTATT TTTTTCAGCC TTTTCTTCTT OCAAAATGTT 2580 

TCTTAAG6CA ACICTCCTAA ATACATAAAC ACAACAAATT AAAATGAAAA GTGACATQAG 2640 

AGTAAATGAA TCAAAAGGAA AAAACATTGA ACCAGAGGTG AGG6CAGCAC ACCCQCAQCA 2700 

GCTQTCCAGG CCTGAGCCAA TQCAACCCTO GGC6GGAAGG CCAGCTCACC 0T6AGCAGGT 2760 

AQAAGCCAGC CAGCCACCCA GGCAGGGACC TTGGTTCTCC OCACACACTC CCAGGAGCAG 2820 

GGAACAGGGG TGGAGTGGCC TTTCCCAQAO CTGGAGTTGG CTOCAGCAGC TTTCGAATCA 2880 

QACCTGCCAA GOTGATGGGC GTCTGAGTTT CACATCTGGG CCCCCOQTOA CCCCACTGAO 2940 

TCCTQACAGC TAAGGATGGG CX3VCCTCCAC AGCTCOGTCA CTCQTACTTG GGACAGOCCT 3000 

CTCATCCTCT GGGAAOOTCC TCCTTGTTTC CTACCCaUlCT AGAAGGGAAA CAGTGGCATA 3060 

TTCTCATGGT ACATGGTTGT CTGAAAGCCT TACCTAGGAA OAGQCAOQQT CTAQATAGAA 3130 

OCTATAAGGA AGCCACACAC ATAACCCACA TCCCCACACC CCCAACATOC CXXACACTCC 3180 
CCACACCCCC CACAOCCCCC AOITOOOCAC CATAATTACC CCCACCICCA AATATCTCAT 
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Seq ID NOt 364 Protein aeguenee 
Protein Accession #: Bos sequence 



^1 11 31 31 41 51 

5 I I I I I I 

MASTRSXSLB HFBERDKRFR PGSRRGAPSS SGOSSSSGPK GMOLIPSPAR SAHCSPYRTR 60 

TX^ALSSEKK AXKARFYRNQ DRYPKGLVPA I8SDRFR8FD ALLZBLTRSL SDNVllIiPQOV 130 

RTIYTIDGSR KVTSUJELLE GESYVCASNB PFRKVDYTKN INPNWSVNIK GGTSRAIAAA 180 

SSVKSEVKES KDPIKPKLVT VIRSGVKPRK AVRILIjNKKT AHSFEQVLTD ITEAIKXASQ 240 

10 WKRUTTLDG KQVRVTCVHL PDFP6DDDVF lACGPBKPRY AQDDFVLDHS ECRVLKSSYS 300 

RBSAVmSGS KSPGPSRRSQ ISAHGRS9SN VNGQPELDRC ZSPE6VNGMR CSESSTIfLEK 360 

YKIGICVIGDG NPAWKECID RSTGKEFALK XIDKAKCOSK EBL1B1BV8I LRRVKKPNII 430 

MLVEEMBTAT ELFLVMELVK GGDLFDAITS STKYTERDQS AMVYNIiAMAL RYLHGLSIVH 480 

RDIKPENLLV CEYPDGTKSL KLGDPGLATV VEGPLYTVCG TPTYVAPXII AETQYGLKVD 540 

IS ZWAAGVITYI LLOGPPPFRS ENNLQEDIiFD QIIAGKIiBFP APYHDNZTDS AKELZSQMLQ 600 

VHVBARCTAG QZL8HPWV8D DASQENNMQA BVTQXXiKQHP lOlALPXQHST TT0VSVXMV8 660 
GRRQVWPDOS AGLEVPEL6S RELP8H6SHC LP 



Seq ZD NO: 265 DNIA eequence 
20 Nucleic Acid Accession 8$ AB020684.1 
coding sequence* 1..1744 

1 11 21 31 41 51 

1 1 I I I I 

25 CCCCCTTGTC ATTAATACAT TAAAAAQATT CAATCTTTAC CCTGAGGTAA 'rmXS GCCAG 60 

TTGGTACCJGG ATTTATACCA AAATAAT6GA CTTGATTGGT ATTCAAACCA AGATATGTTG 120 

GACGGTTACC AGA6GAGAA6 6ACTCA6TCC TATTGAAAGC TGTGAAaOAT TGGGAGATCC 180 

TGCTTGCTTT TATQTTGCTG TAATTTTTAT TTTAAATGGA CTAATGATGQ CATTATTCTT 240 

CATATATGGC ACATATTTAA GTGGCAGCCO ATTAGGAGGC CTGGTTACAG TQTTGTGCTT 300 

30 CTTTTTCAAT CATGQAGAGT GTACCCX5TGT AATGTGGACA CCACCTCTCC QTGAAAGCTT . 360 

CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTA6TQACT CATATTCTCA GGGCT ACAAA 420 

ACTTTATAOA QQAAGCTTOA TTGCACTCTO CATTTOCAAT GTATTTTTCA TGCTTCCTTO 480 

GCAGTTTGCT CAGTTTGTAC TTCTTACTCA GATTGCATCA TTATTT6CA0 TATATGTTGT 540 

CGGC3TACATT GATATATGTA AATTACQOAA GATCATTTAT ATACACATGA TTTCTCTTGC 600 

35 ACTTTGTTTT GTTTTGATGT TTGGQAACTC AATGTTATTA ACTTCTTATT ATGCTTCTTC 660 

TTTGGTAATT ATTTGGGGTA TTCTGGCAAT GAAACXSVCAT TTCCTGAAAA TAAATGTATC 720 

TOAACTTAGT TTATQGGTTA TTCAAGQATG TTTTTGGTTA TTTGGAACtG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTQC AGATGAOSCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAOTTATA AQQATTTTGA TACTTTATTG TATACXTTCTG CAGOG GAGTT 900 

40 TGACTTTATG GAAAAAGAGA CTCCACTQAG ATACACAAAG ACAT TATTOC TTCCAGir GT 960 

TCTTGTAGTG TTTGTTGCTA TTGTTAOAAA QATTATTAOT (3ATAT6TCX3G GTGTCTTAGC 1020 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATC3VT GGAGAOCTGG TTTACCATGC 1080 

ATTGCAATTQ TTAGCATATA CAGCCCTTGG TATTTTAATT ATQAQACTAA AACTCTTCTT 1140 

GACAGCACSIC ATGTQTOTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 1200 

45 CTTTTGCAAA GTACATOCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTaTAOGG GA6TTCAGCA ATTTGOCOCA 1320 

AGAAQAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGOGGGTSC 1380 

CATGCCCACG ATGOCAAOIO TTAAGCTCTC TGCACTTCXSG CCCATTGTGA ATCATCCACA 1440 

TTATQAAGAC OCAGOCTTGA OAGCCAOAAC AAAAATAGTA TACTCAATGT ATAQTCGGAA 1500 

50 AGCAGCGGAA GAAGTQAAGC GAGAACTGAT AAAGTTAAAA QTQAACTATT ACATTCTAGA 1560 

AGA6TCATG0 TOTOTAAOAA GATCGAAGCC TGGTTGCAGT ATGCCTQAAA TTTGGGATGT 1620 

AGAAGATCCr GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGOTGA AGGAT TCCAA 1680 

ACXTCACTTC ACGACTGTAT TCCAOAACAO TOTTTACAAA GTCCTAGAAG TTGT AAAAGA 1740 

AT6ACTGCTA CAT6ACCTGC TGOCTAOGQA OAACTACATC TGTAATGGTT TTAATGTTTT IBOO 

55 GCTAAGTCAT GTQTTQTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAA ATAG AA 1860 

AACGTTTTAT TTGGTCAATT TGAATOTCAT TCTAATTATA AAAATGACTT ACACCT TTAT 1930 

C3UVTTQG1TA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTT6 1980 

TACTT6ACTT TAATATTTQT GCTAAAOTQA OC3UU«3CTAC CTGTATAAAG AAAACACAGT 2040 

GGGTTGT6AC AAGGATGACA TGAAAATACA GGACAATTCT GACAATQTAG GGGCTGATTT 2100 

60 TATAQTQTAA OAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

TQQACATTTC AGTOATTGTA AGTTCTTOGQ TCATGTCAGC CCCTGTCATC A ACTTG AGTT 3220 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 2280 

CTTGTGOGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

^_ CCAAACTCTT G6ACCTTGTG OTTAQGAAAT TCCCTTAACT TCCAGCX^^TA TGGCATTATC 2400 

65 GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTC C0C3VTA TTTTCTGTCA 2460 

AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAQTACT TATTCTTGOT T TTAA AAAAA 2520 

ATTAATGGTA ACTOTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTT TAATA C 2580 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAQAAAA ACTATTTTGA 2640 

ACAACCOCAA ATATAGT6CA TCTAGAAACT AATOTATATT TGATTAOACA TCATTTATAG 2700 

70 TGQAACAGTA GACTOTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATA AAACA 2760 

TGACTAATTT TQCTGTC3VAA AATGTAAAGA ATAATGATAA ATGGAQTTTT TTA TATTTT A 2820 

CTTTTAAOAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2880 

TCTAAAAACC ATCATTTCAG TATAAGOAAT AAQTAXATTT O OTCCT CCIC TTTAQTTTTT 2940 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTQTATGA 3000 

75 ATTTTTOTTT CTTAGAAGTT AATTTGTGTQ AAATGAGATT CTTCAAAACX5 ATGAAACXrTC 3060 

ATAGCTCTQA GAAAAGGrTT TAGGGTTTTA AATTCTAAGC AAAOOGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTO 3180 

TSGATPGTCC TTCAGAGCTT AGTGCTCAGG CATOGTTTCT GGTOCXXSVCT CCTGQAAQCC 3240 

GCTOTTCCCT TTCTACCTTC TTAOaGAOC CCAAGGOCAG QCCTGQTCCC GGGGAAGCAG 3300 

80 CAGCTTGCTG ACATAAGTCA GCTGCAAAGO CTGAGGAGTG TGCCCTCAQA GAAGCAOOGC 3360 

CCCCCAGTCT TOTGCCAGCG CCTAGAGCXXS CAGCTCCCAG GGATGCTCCT TCCXTT GOAGG 3420 

CAGCGCAGGA GAGGGACTCT GGCAGOGTTC TTCAGATTTG TGGCCACTQT TTCTCATTTO 3480 
CTG8TTQACT GTTTTTATTT CTTAGOCTTT TQCTAOTTTT AOAAAATAGG GAAGCA GCOC 3S40 
TTOATTTarO GATTAAAAGC AACATTTGAO OQATOATOCA CAACAGTGCA OOAAAATOOO 3600 

1055 



wo 03/042661 



PCT/US02/36810 



06GTGGAC;VC TT6AGGCTGA GGJITGGGAQT TGACAT6AGC AGGQAGAGGG AGGTGOGCGC 3660 

TGCTTAT CTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

. TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTQTATA GGAAAAAAGC ATTAAAAATA 3840 

^ CTATTATAAC TGCTTCATTT GCTGGQAACC ATTAAAAGTA ATATAAATTA GCTTTTTCCA 390O 

GAAGGATCCT TTTGTAGCAO TGTTTATGAA TGTAACCCCX: AGCAAAATAT GQCTATATAT 3960 

TAGGOGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCOG TGGCCACAGC 4020 

TCGCAGCCCA AOCACTGTGG AGCATCCACA CCTTTQATGG OUVTGCAGAT TGOTAGCAGQ 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATArTGTT AGCATTTACA 4140 

lU AATATTTTTO CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGOGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTOa GGCZOTATTT CTTTAAAAAa ATAAGCCTCT 4260 

AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAOGCC AGTGATATTA ATGAGAAAAT 4320 

GAAAGTATGT ATCA08AATA AAGTGATATT GCATAGGAOT ATTOTATTTT TATGAATTTT 4380 
AT6CCAGTTG TTTACATGTA CTATATAT6T TAAATTAAAA AAAATGATGA 6AAATG 



15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



8eq ID NO: 266 Protein sequence 
Protein Accession #i BAA74900.1 

1 11 21 31 41 51 

i 1 I I I I 

PLVINTLKRP KLYPEVILAS HYRIYTKIMD LIOIQTKICN TVTRGEGLSP XE8CEGLGDP 60 

ACPyVAVIFI LNQU1MALPF XYGTYIiSGSR LQGLVTVLCF FFNHGECTRV MWTPPLRESF 120 

SyPPLVLQMIi LVTHILRATK LYRGSLIALC ISNVPFMLPW QFAQFVLIiTQ lASLPAVYW 180 

GYIDICKLRK IIYIHMISLA LCFVLMFGMS MLLTSYYASS LVIIWGILAM KPHFLKINVS 240 

ELSLWVIQGC PWLFGTVILK YLTSKIFQIA DDAHXGNLLT 8KFFSYKDPD TLLYTCAABF 300 

SFMEKETPIA YTKTLLLFW LWFVAIVRK II8I»<WGVLA KQQfTHVSXHQ FDHGELVYHA 360 

LQLLAYTALa ILIMRLKLFL TPHMCVMASIi ICSRQLFGWL FCKVHP6AIV FAXLAAMSXQ 420 

GSANLQTQWN IV6BFSNLPQ EBLIEWIKYS TKPDAVPAGA NPTMASVKLS ALRPIVNHFH 480 

YEDAGLRART KIVYSMYSRK AAEEVKRELI KLKVNYYILE ESHCVRRSKP GCSMPBIHDV 540 
BDPANAGKTP LQILLVKDSK PHPTTVPQHS VYKVLEWKE 

8eg ID NOt 267 DMA sequence 
Nucleic Acid Accession « i U26744 . 1 
Coding sequence: 59. .1600 

1 11 21 31 41 51 

I I I I I I 

CTTCAAAGAA TATAGATGGT TTTGAAAAGT TCATGCTGTC CCTTCATTGA ATTTTAGAAT 60 

GATTGAAGAT AGTGGGAAAA 6AGGAAATAC CATQGCAGAA AGAAQACAQC TGTTTGCAGA 120 

0ATQA6GGCT CAAQATCTGG ATOGCATCCG ACTCTCCACC TACAGAACAG CATGC3UVGCT 180 

TAOOTTTGrT CAOAAGAAAT GCAATTTGCA CCT QQ TgqAC ATATGQAATO TCATAGAAGC 240 

ATTGGGG6AA AATGCTCTGA ACAAGCTG6A CCCAAACACT GAACTCAAOG TGTCC06CTT 300 

AGAGGCTGTG CTCTCCACTA TTTTTTACCA GCTCAACAAA CGQATGCCAA CCACTCACCA 360 

AATCCATGTG GAGCAGTCCA TCAGCCTCCT CCTTAACTTC CTGCTT6CAG CGTTTGATCX: 420 

GGAA6GCCAT G6TAAAATTT CAGTATTTGC TGTCAAAAT6 GCTTTAGCCA CATTGTGTGG 480 

AGOGAAOATC ATGQACAAAT TAAOATATAT TTTCTCAATQ ATTTCTGACT CCAOTaOQQT 540 

GATOGTTTAT GGA06ATATG ACCAATTCCT TCGGGAA6TT CTCAAACTAC CCACXSGAAGT 600 

TTTGGAOGOT CCTTCATTTO GTTACACAaA ACAGTCAQCC AGATCCTGTT TCTCCCAACA 660 

GAAAAAAGTC ACGTTAAATG GTTTCTTGGA CAOGCTTATQ TCAGATCCTC CCCCQCAOTG 720 

TCT GQTCTG G TTGCCTCTTC TGCATGGACT AGCAAATGTG 6AAAATGTCT TGCATCOGGT 780 

TGAGTOTTCC TACTGCCACA GTGAOAGTAT GATGGGATTT GQCTAG06AT 60CAACA0T0 840 

TCACAATTAC CAGCTCTGTC AGGACTGCTT GTGGAGGGGA CATGCC6GTG GTTCTCATAG 900 

CAACCAGCAC CAAATGAAAG AGTACACX3TC ATGGAAATCA CCTGCTAAGA AGCTGACTAA 960 

TGCATTAAGC AAGTCCCTGA 6CTGTGCTTC CAGCCGTGAA CCTTTGCACC CCRTGTTCCC 1020 

AGATCAGCXIT GAGAAGCCAC TCAACTTGGC TCACATOGTT GATACTTGGC CTCCCAGACC 1080 

TGTAACCAGC ATGAAOSACA CCCTGTT C TC CaVCTCTGTT CCXTTCCTCAG GAAOTCCTTT 1140 

TATTACCAQQ A6CAT6CTTG A6AGTTCAAA C0GGCTT6AT GAAGAACACA 06CTAATT6C 1200 

CAGGTATGCG 6CAA66CTGO CAGCAGAGTC CTCTTCGTCT CAOCCACCTC AGCASAQAAG 1260 

TGCrCCTQAC ATCTCTTTCA CCATCGATGC GAATAAGCAQ CAAAGGCAGC TGATTGCTGA 1320 

GCTAGAAAAC AA6AACAGA6 AAATCTTACA GGAQATOCAG AGACTTCGGC TAQAGCATGA 1380 

ACAAGCTTCT CAGCCCAGGC CAGA6AAG6C ACA6CAAAAC CCX»CCCT6C TGGCAGAACT 1440 

CCQGCTCCTC AGACA6C36CA AA6ATGAGCT GGAACAGAGA ATGTCTGCTC TCCAQGA6AG 1500 

COGGAGAQAG CTAATGGTCC AGTTGGAGGG TCTCATGAAG CTACTAAAGG AAGAAGAACT 1S60 

GAAGCAGGGA GTAAOTTATG TCCCCTACTG CAGGTCXTAA CTAACA6TGQ AOQGGCCTQC 1620 

CGACCTGCGG TTTTCTCATT GCTTTTGCTC TAATGTAT6T TCAT6CTTGA 6TTTGGAAAG 1680 

AGAAAAAAGT CATACTAATT TQCTTCTTTT TCAATGTAGT 6CTT6AATT6 AGATATATAA 1740 

ATTTAGCATT TTTTATAACT ATCACTACTA TCCACATCAA AAGAAGAACT ATQACATCTT 1800 

TTAQAAAAGG GAAOGAArPG TCATTTATTO GAAACATTTT AGATCCCCAG AGGTATAAGT 1860 

TTCAAACCAG TCTTAGCTTT TCAAQTTOTT GATCAOACCC TTCTCTTAAC AGAGAGATAC 1920 

CACAGTCACT AGAGATACCC TGAGGTTCAT GTCATCCCAA AACCCACAGC ACTCAGAAGC 1980 

TAACCTCTAC ACCCACTCAC ACTGrOftOTA TTGASTtCGG TXTCATTTTA CTQAAAAOCT 2040 

GTQAAACCTC TTTTTATAAA AATCAG6CAA TTAAATCCCT TTTCATCACA CAATTATTGA 2100 

GCCTTOTTCC CCATGGCTCA CCAAAATOTG CTCAATTTTG TQAOAQAAAG ACTGTACTCC 2160 

ATAACTGACT ATTCACGTCC CATCTTTTTG GCTCTTCCCC AAAGCAGAAT CCTTACTGTT 2220 

GQTTGACAGT AATCTCTTTT TAAAAAGTAA CTCTCAGCTT TTTCCTTAGC ACCAGAGCCT 2260 

TTCGGCTCOB GOAOACQAGA GOGTCATTAC ATACTTTTTT rmiTCI ' GU AAATAGGGGC 2340 

ATTGTQACTT TATAGCCTAA ACTGGAGCTG TCTGAAOCTG TQGTCAGGCT CAAGAGCCAO 2400 
CAGGQGGAGC AGCAAACTC 

8eq ID NO: 268 Protein sequence 
Protein Accession tti AAC50426.1 

1 11 21 31 41 51 

1 I I i t I 
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MXBDSGKRQI TMAERRQIiFA B4RAQDU>RI RLSTYRTAOC UIFVQKKOJL HLVDIWNVIB 60 

ALRENALNML DPNTBLNVSR LEAVI,STIFY QLKKRKPTTH QIHVCQSZSL LLNFLLAAPD 120 

PEGHGKISVP AVKMALATLC QGKIMDKLRY IPSMISOSSO VMVYGRYDQP LRBVIJCLPTB 180 

VIiEGPSFGYT BQSARSCPSQ QKKVTliNGFL DTIXSDPPPQ CLVNLPZiUIR LANVEMVPHP 240 

VECSYCHSES MMGFRYRCQQ CHNYQLOQDC FWRGHAGQSH SNQKQMKEYT SWKSPAKKLT 300 

NALSKSLSCA SSRBPIjHPMF FDQPBXPUIL AHIVDTHPPR PVTSMNDTLP SHSVPSSGSP 360 

PXTR8MLB8S NRLDBBHRLI ARYAARLAAE SSSSQPPQQR SAPDI6FTID ANKQQRQLIA 420 

SEiEmOIRBXL QBZQRLRLEH BQASQPTPEK AQQNFTLLAB LRLIiRQRKDB LEQRMSAI^B 480 
SRRBUfVQbB GLfOOiUCBBB LKQCVSYVPY CRS 

Seq ID NOi 269 DMA sequence 

Nucleic Acid Accession ftt NM_001276.1 

Coding sequence 1 127.. 1278 

1 11 21 31 41 51 

I I 1 I I I 

AGTGGAGTGG GACAGGTATA TAAAGGAAOT ACAGS60CTG GGSAAGAGQC CCTQTCTAG6 60 

TAGCTGGCAC CAGGAGCOGT GGQCAAGGGA AGAGGCCACA CCCTGCCCTO CTCTGCTGCA 120 

GCCAGAATGG GTGTCAAGGC GTCTCAAACA GGCTTTGTGG TCCTGGTGCT GCTCCAGTGC 180 

TGCTCTQCAT ACAAACT6GT CTGCTACTAC ACCAGCTGGT CCCAGTACOQ GGAAGGOGAT 240 

QQGAOCTGCT TCCCAOATGC CCTTGACOQC TTGCTCTGTA CCCACATCAT CTACAGCTTT 300 

GCCAATATAA GCAAGQATCA CATCQACACC TGGGAGTGGA ATGATGT6AC 6CTCTAGGGC 360 

ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCrrOTC TGTOGOAGGA 420 

TQGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAQCCTCCA ACACCCASAG TOGC CGGA CT 480 

TTCATCAAOT CA6TACGGCC ATTCCTOCOC ACCCATGGCT TTGATGCGCT GGACCTTGCC 540 

TGGCTCTAGC CTGGAOQQAO AGACAAACAO CATTTTACCA OOCTAATCAA GGAAATGAAO 600 

GC06AATTTA TAAAOOAAGC CCAOCCAGGG AAAAAGCAGC TCCTGCTCAG GGCAGCACTG 660 

TCrOCGGGGA AGGTCACCAT TQACAGCAGC TATOACATTG CCAAGATATC CCAACACCTG 720 

GATTTCATTA GCATCATGAC CTACGATTTT CATOGAGCCT GQCGTGGOAC CACAGOCXaVT 780 

CACAOTCCCC TGTTCOGAGG TCAGGAGGAT GCAAOTCCTG ACAGA TTCAQ CAACACTGAC 840 

TATGCTGTGG GGTACATOTT GAGGCTGGGO OCTCCTGCCA G7AAGCTQGT GATOGOCATC 900 

CCCACXTOXI OGAGGAGCTT CACTCTGQCT TCTTCTGAGA CTGOTOTTGO AOCCOCAATC 960 

TCAGGACCGG GAATTCCAGG COGGTTCACC AAGGAGQCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTOACT TCCTCOGOGG AGCCACAGTC CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCXAGCAAGG GCAACCAGTG GGTAGGATAC GAOOACCAGO AAAGC6TCAA AAGCAAGGTG 1140 

CAGTAGCTGA AGOATAGGCA OCTGGGAGGC GCCATGGTAT GGGCCX:TGGA GCTGGATOAC 1200 

TTCCAGGGCT CCTTCT6C3GO CCAG6ATCTG CGCTTCOCTC TCACCAATGC CATCAAGGAT 1260 

GCACTOGCTQ CAACX3TAGCC CTCTGTTCTO CACACAGCAC GGGGGCCAAQ GATGCCCCGT 1320 

CCCCCTCTGO CTCCAOCTGG COSGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 1380 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC A6AQQTCCAC AACACACA6A TTTGAGCTCA 1440 

GCCCTGGTGQ GCAOAGAGGT AGGGATGGGG CT0TG6GGAT A GTQAG GCAT CQCAATQTAA 1500 

GACTCGGGAT TAGTACACAC TTOTTGATQA TTAATGGAAA TOTTTACASA TCCCCAAGGC 1560 

TGQCAAGGGA ATTTCTTCAA CTCCCTGCCC CCTAQCCCTC CTTATCAAAG GACACX3VTTT 1620 

TQGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA CACAGTGACC ATACTAATTA 1680 

TACCCCCTGC AAAGCXaCCT TGAAACCTTC ACTTAGOAAC OTAATOGTGT CCCCT ATCC T 1740 

ACTTOCCCTT OCTAATTCCA CAGCTGCTCA ATAAAQTACA ABAOTTTAAC AOTOTGT TGQ 1800 

0GCTTT6CTT TGGTCTATCT TTGAOCOCCC ACTAQAOCCA CIGGACTCAC CTCCCCCATC 1860 

TCTTCIGGGT TGCTTCCTGT GAGCCTTGG6 ACCCCTGAGC TTGCASA6AT 6AAGGC03CC 1920 
ATGTT 

Seq ID NOt 270 Protein .secmence 
Protein Accession #t 2IP_001267.1 

1 11 21 31 41 51 

1 I I I 1 I 

MGVKASQT6F WLVUiQCCS AYKLVCyVTS HSOYRBODGS CPPDALDRFL CIHIIYSFAN 60 

ISNDHIDTWE HNDVTLYGML NTIfRKRNPIlL KTIiLSVGGHN FGSQRFSKXA 8NTQSRRTFI 120 

KSVPPFLRTH GFDGLDLAWL YPGRRDKQHP TTLIKEMKAB PIKEAQPGKK QLLLSAALSA 180 

QKVTIDSSTO lAKISQHLDF ISIMTYDPHQ AWRGTTGHHS PLFRGQEDAS PDRPSNTDYA 240 

VOYMLRLGAP ASKLVK6IPT FGRSFTLASS BTGVGAPISa POZPGRFTKE AGTLAYYBIC 300 

DPIiRGATVHR TLGQQVPYAT KSNQWVGYIK} QBSVRSKVQY LXDRQLA6AN VirAU}U»>FQ 360 
GSFCXSQDLRF PLTNAZRCAL AAT 

Seq ID NO I 271 DMA sequence 

Nucleic Acid Accession #: NM_006474.1 

Goding sequence i 181. .669 

1 11 21 31 41 51 

I i i i I I 

GCPGCCTAGG GTCTGGAAAG CTOGGGCACC CTCCCTCTOC GGGGCTCCTG CTCXXIA CCCX: 60 

TCOQGOOOCC CCACOGTCGC GCTCCTOCAG GCTGGGGCTG T660C6CGGT GCT1TTAATI 120 

TTCCCCCAQC TCAGAATCTT GCTG CT OGGC CCOCAOGAGA GCAACAACTC AAO GTOAA OB 180 

ATGTGGAAGO TGTCAGCTCT QCTCTTOGTT TTGGGAAGCG OGTOG CTCTG GGTCCTGOCA 240 

QAAGGAGCCA OCACAGGCCA GCCAGAAGAT GACACTGAGA CTACAGGTTT GGAAGGCQGC 300 

GTTGCCATGC CAGGTGCC6A AGATGATOXO OTGACTCCA6 GAACCAGOQA AGACCGCTAT 360 

AAQTCTQGCr TGACAACTCT GGTG6CAAGA AGTGTCAACA GTGTAA CABG CATTOGCATC 420 

GAGGATCTGC CAACTTCAGA AA6CACAGTC CAOGOQCAAG AACAAAGTCC AAGOGCCACA 480 

GCCTCAAAOO TGGCXIACCAG TCACTCCACG QAGAAAGTGG ATGGACACAC ACAGACAACA 540 

GTTGAGAAAO ATGOTTTGrC AACAGTGACC CTGOTTGGAA TCATAGTTGa GGTCTTACTA 600 

GCCATOGGTT TCATTG O TG Q AATCATOGTT 6T0GTTAT6C 6AAAAATGTC GGGAAOOTAC 660 

TOaCCCTAAA GAGCTGAAGG GTTAOOOOCT GCTIGCCAAC OTGCTTTAAA AAAAGACCGT 720 

TTCTOACTCT GTQGCCCTGT G0CT6AGCTC GTOOQOAGAA GATGACCCTO GGAACATTTO 780 

OOOGCCCATT CSUSATTCCAC GGTGACTTTC OGTTTGOCAA ATTAACOQAO GAAAGACCTT 840 
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TCACCAGATT TOQTTCTTAA ACTTT 

Seq lO NO: 272 Protein sequence 
Protein Accession tf: NP_006465.1 

1 11 21 31 41 51 

1 I I I I i 

MWKVSALLFV liGSASLWVLA EGASTGQPED DTBTTGLEGG VAMPGAEDDV VTPGTSgpRY 60 
KSGLTTLVAT SVNSVTOIRI BDIiPTSESTV HAQBQSPSAT ASMVATSHST ERVD6DTQTT 120 
VGKDGLSTVT LVGIIVGVLL AIGFIGQIIV WMRKMSGRY SP 

8eq ID NOi 273 DMA sequence 

Mucleic Acid Accession ft: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

GCGGCOGCCA GCTTOCAAAG COGAAGTCTG GCO QO GCTCT TOQACTOGCT GCGOC^GGTC 60 

CCCGGGG6TG COGAGCOSGC G6GGGGTGAG GTG6CTGG6C CGGCX3GCGGG GCTAGGAQGT 120 

GCGGGCACTQ GGGGCGOGGG AGGGGACGTG GCAOGCCOOG CGGGGGCCAC G6CGATCCCA IBO 

GG6GCCAGGA AGGTCCOGCT 606GGCA06C AATCT6CCTC OGTCCTTCTT CACQGA6CCG 240 

TG00GG6CAG G0GGGGGC6G GTGTGGCCCa T OSGOG C O OQ ACGTOAGCTT GGGOSAGCTG 300 

GAQAAGG600 CGOAGGCaST OGAGTTCTTT GAGC TOC TG G GGCCOQACTA GGGGQCGGGC 360 

ACGGAG6CGG CAGTCTTGCT TGCOGCCGAG CCTCTCGACG TGTTCCCOGC CGGAOCCTCC 420 

GTACTGCX3QG QACCCCCGGA GCTGGAOCCC GGCXTTCTTTG AGCCX3CCGCC GGCAGTGGTG 480 

GGAAACCTAC TGTACCCCGA GCCCTGGAGC GTCCCX3GGCT GCTCCXX3GAC CAAAAAGA6C 540 

CCCTT8ACTO CCCTCC6CX30 OaQGTTGACC TTQAAOQAGC OCTTQAGCCC CCTGTAGCCC 600 

GCOGCTOOOA ATTTCTCC03 GCGGGGAOOA OGGGCGGGGC CATTTG6CTT CTTTOGCCCC 660 
CTTCTTTCCA GACTGCGCTT TGC 

Seq ID NOi 274 DMA sequence 

Nucleic Acid Accession #s Bos sequence 

1 11 21 31 41 51 

I 1 I I I I 

CAAAGAGGCC GGGGTCCAGC T0CX3Q GG GT C CCCGCAGTAC GGAGGCTCGG GOGGGGAACA 60 

OQTOQAGAGG CTCGGCGGCA AGCAAGACTG CCGGCTCOGT GC06GGGG06 TAGTGGGGCC 120 

CCAGCA6CTC AAAGAACTCC AG0GCCTCC6 CGCCCTTCTC CAOGTOGCCC AAGCTCACGT 180 

COSGCXrCGA OGGQCCACAC CCXSCCGCOGC CTGCCCGGGA CGGCTCOGTG AAGAAGQAC6 240 

GAGGCAGATT GOGTGCCCGC AGOGGGACCT TCCTGGCCCC TGGGATOOCC GTGQCCCCCG 300 

CBQOGCCTGC CACQTCCCCT COCXSOQCXXX: CAGTGCCOSC ACXITCCTAGC CCGGCOQCCG 360 

GGQCAGCCAC CTCACCCCCC GCOGGCTGOG CAG0CCCQG6 GAOOTGQOQC AGOGAOTCSA 420 
AGAGGGGGGC CAGACTTGQG CTTTGCAAGC TG0CX3GCG0C 

Seq ID NOi 275 DNA Bcquence 

Nucleic Acid Accession «s NM^OOlllB.l 

Coding sequence t 74 . . 1651 " 

1 11 21 31 41 51 

I i I I I I 

AGOXAGAGA CACATTGGGG CTQACCTGCC GCTGCTGTCA GTGGGAGGCC AGTG6TGCTG 60 

GCCAAGAAGT GTCATGGCTG GTGTCGTGCA CGTTTCCCTG GCTGCTCACT GCGGGGCCTG 120 

TCGGTGGG6C CGGGOCAOAC TCCGCAAAGG ACG06CM5CC TGCAAGTCCG CGGCCCAGAG IBO 

ACACATTGGG GCTQACCTGC CGCTGCTGTC AGTGGGAGGC CAGTQOTGCT GGCCAAGAAG 240 

TGTCATQGCT GGTGTOGTGC ACGTTTCCCT GGCTGCTCTC CTCCTGCTGC CTATGOCCCC 300 

T6CCATGCAT TCTGACT6CA 7CTTCAA6AA GGASCAA6CC AT8TGGCTG0 AGAAGATGCA 360 

GAGGGCCAAT GAGCTGATG6 GCTTCAATQA TTOCTCTOCA GOCTOTOCTG 6GATGTGGGA 420 

CAACATCACG TGTTGGAAaC C06CCCATGT GGGTGAGATG GTCCT G GTCA GCTGCOCTGBV 480 

GCrCTTCCGA ATCTTCAACC CAGACCAAGT CTGGGAGACC GAAACCATTG GAGAGTCTGA 540 

TTTTGGTQAC AGTAACTCCT TAOATCTCTC AGACATGGGA GTGGTGAGCC GGAACTGCAC 600 

GGAGGATGGC TGGTOGGAAC CCTTCCCTCA TTACTTTGAT GCCTGTQGGT TTGATGAATA 660 

TGAATCTGAG ACTGGGGAOC AOGATTATTA CTA0CT6TCA QT6AAGGGCC TCTACAC3GGT 720 

TGGCTAC3VGC ACATCCCTCG TCACCCTCAC CACrGCCATG OTCATCCTTT GT0GCTTC06 780 

GAAGCTGCAC TGCACACGCA ACTTCATCCA CATGAACCTG TTTGTGTOOT TCATGCT6AG 840 

GGOGATCTCC GTCTTCATCA AAGACTG6AT TCTGTATGCO QAGCAGGACA GCAACCACTG 900 

CTTCATCTCC ACTGTGGAAT GTAAGGCCX3T CATGGTTTTC TTCCACTACT GTGTTGTGTC 960 

CAACTACTTC TGGCTGTTCA TGGAGGGCCT GTACCTCTTC ACTCTGCTGG TGGAGACCTT 1020 

CTTCOCTGAA AGGAGATACT TCTACTGGTA CAOCATCATT GOCTGGGGGA CCCCAACT6T 1080 

GTGXGTOACA GTOTGOQCTA GGCTGAGACT CTACTTTGAT GACACAGGCT GCTGGGATAT 1140 

QAATGACAGC ACAGCTCTGT 6GTGGGTGAT CAAAGGCCCT 6TGGTTGGCT CTATCATGGT 1200 

TAACTTTGTO CTTTTTATTG 6CATTAT06T CATCCTTGTG CAOAAACTTC AGTCTCCASA 1260 

GATGGQAGOC AA7QAGTCCA GCATCTACTT G06ACIGGGC GGGTGCAOCC T6CTGCTCAT 1320 

GCCACTATTC GGAATCCACT ACACAGTATT TGCCTTCTCC OCAGAGAATQ TCAOCAAAAG 1380 

GGAAAGACTC QT6TTTQAGC TGGGGCT6GG CTCCTTCCAG GQCTTTGTGG TGGCTGTTCT 1440 

CTACTGTTTT CTQAATGGTG AGGTACAAGC GGAGATCAAO OGAAAATGGC GAAGCTGGAA 1500 

G6TGAAC06T TACTTCGCTG TGQACTTCAA GCACCGACAC COGTCTCTGG CCAGCAGTGG 1560 

GQTGAAT6GQ G6CACCC3U5C TCTCCATCCT GA6CAAGAGC AGCTCCCAAA TCGGCAXOTC 1620 
TGGCCTCCCT GCT6ACAATC TGGCCACCTO AGCCATGCIC GCCT 

Seq ID NO: 276 Protein sequence 
Protein Accession fti NP_001109.1 

1 11 21 31 41 . 51 

I I I I 1 I 

MAGWBVSLA AUGQACPWQR 6RLRKGRAAC KSAAQRKIGA DLPLLSVGGQ HCNPRSVHAG 60 
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WBV&LAALL LLPMAPAMHS DCIFKKBQAM CLEKIQRAKB LKGPMDSSPG CFGKNDNITC 120 

WKPAHVGEMV LVSCFELFRI PNFDQVHBTB TIGSSDFGDS NSLDLSDM6V VSRNCTEDGW 180 

SBPPPHYFDA OGFDEYESBT GDQDYYYI.SV KALYTVOYST SLVTLTTAMV ILCRFRKLKC 240 

TRNFIHMNLP VSFMLRAISV FIKDHILYAB QDSNHCPIST VECKAVKVPF HYCWSNYFW 300 

LPIBGLYLFT LbVETFFPBR RypywyTIIO HGTPTVCVTV MATLRLYFDD T6CHDMNDST 360 

ALWWVIKGPV VGSIHWFVL PIGIIVILVQ KLQSPDMGQ) ESSIYXiKLAR 8TLLLIPLF6 420 

IHYTVPAFSP ENVSKRERLV PBLGU5SFQG FWAVLYCFI* NGBVQABIKR KHRSWCVNRY 480 
PAVDFKHRHP SLASSGVNGG TQLSILSKSS SQIRMSGLPA DMLAT 

Seq ZD NO: 277 DNA eequence 

Hucleic Acid Accession #t MM_004000.1 

Ooding sequencei 36.. 1193 ~ 

1 11 21 31 41 51 

I 1 t I 1 i 

AOAAOAAGCT GGCXJ^GGAT ATGGGAGCAA CCACCATCGA CCAGAAOTCT CTCTGOGCAG 60 

GTQTAGTGGT CTTGCTGCTT CTCCAGGGAG GATCTGCCTA CAAACTGOTT TGCTACTTTA 120 

CCAACTGGTC OCAGGACOQG CAGGAACCAG 6AAAATTCAC CCCTGA6AAT ATTGACCCCT 180 

TCCTATOCTC TCATCTCATC TATTCATTGQ CCA6CAT0GA AAACAAGAAG GTTATCATCA 240 

AGGACAA6A0 TGAAOTQATG CTCTA0CA6A CCATCAACAG TCTCAAAACC AAGAATCKCA 300 

AACTGAAAAT TCTCTTGTCC ATTGGAGGGT ACCTGTTTGG TTCCAAAGGG TTCCACCCTA 360 

TGGTGGATTC TTCTACATCA OGCTTGGAAT TCATTAACTC CATAATCCTO TTTCTQAQQA 420 

ACCATAACTT TGATGGACTO GATGTAAGCT GGATCTACCC AGATCAGAAA GAAAACACTC 480 

ATTTCACTOT GCTGATTCAT GAOTTAOCAO AAGOCTTTCA GAAGGACTTC ACAAAATCCA 540 

CCAAGGAAAG GCITCTCTTG ACTGOBGGGO TATCTGCAGO GAGGCAAATG ATTGATAACA 600 

GCTATCAAGT TOAGAAACTQ GCAAAAQATC TGGATTTCAT CAACCTCCTG TCCTTTGACT 660 

TCCATGGQTC TTGGGAAAAG CCCXTTTATCA CTGGCCACAA CAGCCCTCTQ AGCAAGGGGT 720 

6GCAG6ACAO AOGGGCAAOC TCCXACTACA AT6T6GAATA TGCTGTGGGO TACTGGATAC 780 

ATAAGGGAAT GCCATCAOAQ AA6GTGGTCA TG6GCATCCC C31CATAT0GQ CACTGCTTCA 840 

CACTGGCCTC T6CAQAAACC AC081XSGG6G CCOCTGCCTC TGGCCCTGGA GCTGCTGGAC 900 

CCATCACAGA GTCTTCAGGC TTCCTGGCCT ATTATGACAT CTGCX»GTTC CTGAAAQGAG 960 

CCAAGATCAC G06CCTCCAG 6ATCAGCAGG TTCCCTAOGC AGTCAAGGGG AACCAGTGGG 1020 

TGG6CTATQA TGATGTGAAO AGTATGGAQA CCAAGGTTCA ' GTTCTTAAAG AATTTAAACC 1080 

TGG6AGGAGC CATGATCTGG TCTAT1GACA TGGATGACTT CACTGGCAAA TCCTGCAACC 1140 

AGGGCCCTTA CCCTCTTGTC CAAGCAOTCA AGAGAAGCCT TGGCTCCTTG TGAAGGATTA 1200 

ACTTACAGAG AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAOQAATT 1260 

CTCATQTG6G ATTCXXTCTTG CCAGGCTGGC CTTTGGATCT CTCTTCCAAQ CCnTCCTGA 1320 

CTTCCrCTTA GATCATAGAT TGGAC CTGGT rmnYi ' CC T6CAGCT6TT GACTTGTTGC 1380 
CCTGAAGTAC AATAAAAAAA ATTCATTTTQ CTCCAGTA 

Seg ID N0< 278 Protein sequence 
Protein Accession 9t NP_003991.1 

1 11 21 31 41 51 

I I 1 I I ) 

MDQKSLWAGV WLLLLQGGS AYKLVCYFTN WSQDRQEPGK FTPENIDPPL CSBIjIYSPAS 60 

lENNKVIIXD KSBVMLYQTI NSIiKTKNPKL KILLSIGGYL FGSKGFHPMV DSSTSRLEFI 120 

NSIILFLfiMH NFDGLDVSWI yPDQKEHTHF TVLIHELAEA FQKDFTK8TK ERLLLTAGV8 180 

AGRQMZONSY QVSRIiAKDU) FINLLSFDFH 6SWBKPLITG HNSPLSKGWQ DRGPSSYYNV 240 

EYAVGYHIHK GMPSBKWKG IPTY6HSFTL ASABTTVGAP ASGFOAAGPI TESSGFIiAYY 300 

EZOQPLKGAK ZTRLQDQOVP YAVKGNQWVG YSDVKSMETK VQPLKKUILG GAMZW8ZDMD 360 
DFT6KSQIQ6 PYPLVQAVKR SL6SL 

Seq ZD NOt 279 DNA Sequence 

Nucleic Acid Accession #t NM_015166.1 

Coding sequence i 116.. 1249 

1 11 21 31 41 51 

i I I I 1 I 

TGCTGGAAGT CCCTCACCCA GAGACCAGTG CTCCCAAGGG GAQAGCAQOS GGGGAGAXAA 60 

AGAACTGGTG ACACGTGGCT GTACATTCA6 CACAGCTGTG 6TGTCCCCAA GTGOCATQAC 120 

CCAGGAGCCA TTCAGAGAGG AGCTGGCCTA TGACCGQATG GCCACGCTGG AGGG6GOCC3G 180 

GC3UU3ACCCC GCCAGCTATG CXXXrAGACXSC GAAOCOaAGC GACCTGCAGC TGTCQAA6AG 240 

ACTGCCCCCC TGCTTCAGCC ACAAGAOGTQ GGT C TTCTCT OTGCTGATGG GGAGCTGCCT 300 

CCTG GTGACC TCX3GGGTTTT GGCZGTACCT GGGGAAOGTO TTCCCG6CT6 AGATGGATTA 360 

CTTG06CTGT 6CT6CAG6CT CXT6CAT0CC CT06GCAATT GTQA6CTTCA COGTCTOCAG 420 

GAGGAACX3CC AATGTQATTC CCAACTTTCA GATATTGTTT OTTTOCAOGT TTGCTGTGAC 480 

CACTACX5TQT TTAATTTG6T TTGGATGCAA ACTAGTCCTG AAC3CCATCAG CAATAAACAT 540 

CAACTTCAAC CTCATCCTQC TGCTCCTGCT GGAGCTGCTC ATGOOQOCCA CGGTGATCAT 600 

CGCTGCAOGQ TCCA6C3QAGG A6GACTGCAA GAAAAAGAAG G6CTCCATQT CTGACAGOGC 660 

CAACATTCTG QACGAASTGC CATT T OCT G C TGGGGTCCTG AAATCTTACT CAGTOGTGQA 720 

GGTAATOGGA OQC A TCTCTO C O QTOCTOGQ 0GQ6AT CATT GCCCTGAAGB TGGATGACTC 780 

AGTTTCAGGC OCACACCTCT CAGTGA(3GTT CTTTTGQATC CTAGTGGOCT GCTTTCCAAO 840 

TQCCATTGCX: AGTCATGTGG CAGCAQAGTG TCCCAGCAAO TGTCTGOTGO AGGTCCTGAT 900 

TGCCATAAGC AGCCTCAOGT CTCOGCTGCT GTTCACAaCC TCTOOATATC TGTCATTGAO 960 

CATCATGAGA ATGGTG6AGA TGTTTAAGGA TTA0C06CCA 6GGATAAAAC CATCCTACGA 1020 

TGTGCTGCTO CTGCTGCTGC TGCTAOTGCT CCTGCTGCAG GCOGOCCTCA ACACGGQCAC 1080 

CGCCATCCAG TG0GTG06CT TCAAGGTCAG TGCAA6GCTG CAGQQTOCAT CCTGGGACAC 1140 

CCAGAACGGC CaSCAGOAGC GCCTX3GCTG0 GGAGGTGGCC AGGAGCCCCC TGAAGGAGTT 1200 

CGACAAGGAG AAAGCCTGQA GAG00ST C X 3 T GGTGCAAATG 6CCCAGT6AC OOCCAGAOGC 1260 

GGAAACOQGO TGGCAGOOCC CAGCCIGGCC CCAAGCAIGO AAA06CACAA OOOCTAATOG 1320 

CCCTGAGCTA CT6CTTCTAA CACCTCTTTT OX Tl Xi ' mm AGGGCAAAOC AGGCTGCAGG 1380 

TGGGGTTTTC ACTTCCTAGG GTAGTTTAAT TTTAAAATAG GCGAATGTTG GCTAGTCTGT 1440 

GCCTCAGT6A GATCAGTCAG CTOCGAGTGG CTCCOGTGTC GTAACAGCAG GAGCATGGCC 1500 
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PCTAJS02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GCAACTTCCC 
CTOQCCCCAG 
CTGACCMSAC 
GCTTA6GGTC 
6(3T6CACCT0 
TTG60GOCTC 
GOQACACAGO 
TGCCCTGTCT 
OCCftGOTCCT 
OGGGCACTGT 
TCATGCCCAG 
(SGGCTGCATC 
GGGAAGCTGA 
GGTCAGAQCC 
TTGTCTTOAT 
GGGGTCTTAQ 
GGTOTGAOAA 
TCCTOQTCCC 
GGGCGACTGT 
TTCACRATGG 
CX3CCGTGGGG 
GTOGCCTGGG 
CGGCCTGCAG 
GAQTQACTCT 
CAACCATTGA 
CAGGCCCTCT 
CTTTGCCOVC 
CCGGIVTGCAO 
GCGGGGTCTG 
TCTTCTTGGG 

ATGGGOTTTA 
AAAOQCTGAC 



AGGCCGAGQA 
CTCCTCTTCX: 
TGTGGGTCCC 
TGTCC0 S G6T 
CCCCAGOCTC 
TGTCCTGQCC 
CTOCCGCTGC 
CCCTACTTAG 
GGGTGCTGTC 
CA.TGCCCAOC 
CTTGGAGTGC 
CAAGGCTTTG 

CAGGAATTCT 
CCTCACTGAQ 
GGTACAGCCC 
TAACACTGGC 
GAGGGCGCCT 
QAAACTGQAA 
TTTTGTTTCT 
TGTATTGACT 
GCCATCAAGA 
ACCCTGCCAG 
GQCTGCCAAG 
AGGGGGATGC 
CTGCAGTCCT 
AGCCCCAAAC 
ACCCCACCCT 
CTCCTGGGAA 
GGCAGACCAO 
CTTGCACTTT 
AAAQAACTGT 
TTGCAGACGT 



MSGGCCCCCG 
TGCCTCTCrC 
TGOQTCTGCr 
ACTCftGTCSUS 
AGGCTGOCCA 
CGGGAGCCGC 
TCTCCCAGGT 
ACCCAGTCAT 
ATGCCCAGCT 
TTGGAGTGCC 
CAGGTGGCCG 
TAAAOOGGCT 
CCTGTGTTTQ 
CACAGGTTCA 
AAGGCCCCAO 
AGGCQGTCAC 
TTTQGQTAGT 

TGCTGTTACT 
TTCCTTCTGA 
GGTCCGCCAT 
TCAAAGAACC 
GGGGTTTGAG 
ATACCTTCAT 
AGGCAAGACA 
CGGCAGCTCC 
AAGCGCCACC 
GTCTGCAGGC 
TGGGGTGG6A 
AGAGCTCAAG 
cttattttct 
CTCTTTCAGT 
G 



GCTOGOCCTC 
ATGGCTTGGG 
OOCCACTCTO 
CCCAGTGGGA 
GOGGCTCTTC 
AGGGGCCCCT 
GAAATCCACA 
TCTAGAGGGA 
TOGAGTGCCA 
ACAT6600QC 
CTGCTGTGAC 
GGACCAOGTC 
CTCCCATTAC 
CCCAQCGCCC 
GOCAaCQGTC 
TGCCCSVCCTG 
GCCATGGCCA 
AGCTQACGGC 
TTGATAATTA 
TCTGAGAAGA 
GGGCTGCTGG 
AGGAGGCCTG 
GACCCTCCCA 
GGTGTTCATG 
CCTTCCCAGC 
GGAAGGACAC 
GTGGGA6AGG 
CACCCCCACC 
GCCACAGGGA 
TTTCAGA6TC 
TATTTTAGAO 
CTACATTTTT 



TTQAGAGCCC 
CTGGAGTGGQ 
AC0GG6CTTC 
TCTTAOCCAC 
CTOGACAGTO 
CCTCCAGAGC 
CCAGTCCAOG 
TCCACCX3CCA 
CGTGGCCGCT 
TGCCCAC6TC 
AGGCAGTGTT 
TCCCTGGCCC 
TGAAAATGCA 
TCTACCTCCT 
TTCTCCATCT 
CCAGGCTGCA 
QQAGTGGQTT 
AGTAAATCCA 
CTTTCCAGCA 
CATGAACGTT 
AAAGGCCOGG 
GGAGATGCAO 
GGTTTCCCAC 
ACAAOTGGAA 
TGCTCCTAGA 
AGTCAGGGGC 
AGAGGCTGCT 
TCrcXGCAGC 
CQACCCGGGG 
AGAATTAGGC 
OQCTTAAAAA 
GTTTAATA08 



CACCCCTGAA 
CTCTCTQGAC 
CTCCCTCCAC 
TTCGCT6CAA 
A6A6CAGGGC 
CTGGGCGCAA 
CCGGGTCGCC 
CACTGGCCGO 
GCCCACGTCC 
GOC36GCACrQ 
CTTGGGGGTG 
CAGTGACOGG 
GQACAGATCA 
AGCAAGTACT 
GOQCTGTTTT 
GGGACAOTTG 
TCCCTGCOTC 
CAGTGAGTTG 
GGTGTTTTCC 
TTCTCTTCAC 
AGATGCATCT 
CTGGATGGGG 
TGOGGAACAG 
TCATTATTTT 
GGGGACAAGC 
CX3GGCAAACA 
GTCACTGGTA 
TTTGAGGCTG 
CGGGCTGATG 
ACTTGGAACO 
ATCGGGAAAA 
CTTGAGCAAT 



Seq ID VOi 260 Protein sequence 
Protein Accession #: HP 0SS9B1.1 



MTQBPFRGBL 
GUiVTSGFSL 
VTTTCLXNFG 
SAKILDEVPF 
PSAIASHVAA 
YDVLLIiLLLX^ 
EFDKEKAHRA 



11 

I 

AYDRMPTLGR 
YLGNVFPAEM 
CKLVUIFSAI 
PARVLKSYSV 
ECPSKCLVBV 
VLLLQAGIiNT 
VWQMAQ 



21 

I 

GRQDPASYAP 
DYLRCAAG8C 
NINFKLXLLL 
VEVIAOISAV 
LIAISSLTSP 
GTAIQCVRPK 



31 

1 

DAKPSDLQLS 
ZPSAXV8FTV 
LLEUiMAATV 
LGGZIALNVD 
LLFTASGYLS 
VSARLQGASW 



51 



Seq ID NO: 281 DNA eequence 

Nucleic Acid Accession #t NM_004518.1 

Coding sequence: 43.. 2577 



GCTGAGCCTG 
AA06G06GGG 
CXGGAOOOOG 
GCCCCCAAGC 
CCCCCCAAGC 
CGGCCGOGCQ 
CTCGTOCTGT 
TACATCCTQQ 
G CCGC AOGCT 
COGTTCTGTG 
TCCCAGGGCA 
GGQATGATCC 



6CACTCTG6T 
ACCTGGAACG 
CTQCCTGCAG 
AAGCACTTTO 
TA03CCACC31 
GTCAC06TGC 
CTCAAQAGTA 
CAGAAGGTCA 
AAGGQGTOCC 
GACAGCCOCA 
GCTTTCCXSCA 
GAGGACATTG 
GGCCTCAAAG 
TTCAAGQAGA 
CACCTGGACA 
OGGGGOCCAO 
GAG6ACC0CA 



11 

I 

AQCCCQACCC 
TATACCG060 
GOSOOOCOQA 
6O06CAGCAT 
GCAAOSCCTT 
GCTGGGOQTT 
CTGTGTTTTC 
AAATOGTGAC 
CCTGCTGCOG 
TQATTGACAT 
AOGTCTTTGC 
GCATGGACOQ 
AGGA6CTGQT 
TGTACTTGGC 
GGGGCCTGAT 
GCAGGCTCCT 
GCATCnGGQ 
A0AA6AGGGG 
ACCTCTOOGG 
CCATGTACAG 
AATCTGGACT 
GTTTGAAAOA 
06CAGG0CCA 
GCAAGQTOGC 
TCAA6GGTGC 
TGGATGACAA 
TCAGCATCAG 
GCCTGCX96CC 
TOCTGTCCGG 
CGATCAOGOA 
GCA7GATGGG 



21 

I 

GGGGCGCCTC 
CCCGAGCGGG 
CT0CACCGG9 
CCTCAGCAAA 
CTACC6CAAG 
CATCTACCAC 
CACXIATCAAQ 
TATOQIGOrq 
GTACG0T66C 
CATOGTGCTC 
CACATCTGOG 
GOQGGGA6GC 
CACTGOCTGG 
AGA6AAGGGG 
CA06CTGACC 
TGCGGCAACC 
GTCTGGGTTT 
GAACCOGGCA 
CACAGAGCT6 
ACTTATCCOC 
CGCTTTCAGG 
TOGTOTCTTC 
GACTGTGAGO 
CAAGAGCTGQ 
OOOGTCAOGG 
GAGCTGCCCC 
AGCOGTGTGT 
CTAOSAOQTG 
AATTAAQAGC 
CAAGGACCGC 
AO0GCT0GG6 



31 
I 

CCX5CCAGGCA 
GAGAAGAAGC 
GAGQGGGCSSC 
CCTGG06060 
CTGCAGAATT 
OCCTAOGTOT 
6AGTATGAOA 
TTTOOOQTGO 
TGGAGGGG6C 
ATOGCCTCCA 
CTCOGGAGCC 
ACCTGGAAGC 
TACATCX3GCT 
QAGAACGACC 
ACCATTGGCT 
TTCACCCTCA 
6CCCTGAAGG 
GCAGGOCTGA 
CACTCCAOQT 
COGCTQAACC 
AAGQACCCCC 
TCCAGOCCCC 
CX3STCACCCA 
AQCTTOGGGG 
CAGAACTCAG 
TGOQAGTTTG 
OTCAT006GT 
AtGGACOTCA 
CTGCAGTOCA 
ACCAAGGGOC 
AA0GTQGA6A 



41 

I 

CCATOGTGCA 
TGAAGGTGGG 
T6CI6AT06C 
GOQOOGCQGO 
TCCTCTACAA 
TCCTCCTGGT 
AGAGCTCX3GA 
ASTACTTOGT 
GGCTCAA6T7 
TTGOGGTGCT 
TGOGCTTCCT 
T6CTGGGCTC 
TOCmOTCX 
ACTTTBAGAC 
AOGGGGACAA 
TCGGTGTCTC 
TTCAGGA6CA 
T0CA0T06GC 
GQCAQTACTA 
AGCTGGAGCT 
OGCGGGAGCC 
GAGGCGTGGC 
GOGCOQAGCA 
ACOGCAGGCO 
AAGAAGCAAG 
TGACCX3AGGA 
TCCTGGTGTC 
T0QA6CAGTA 
GftOTOQACCA 
06GCX9GAGGC 
AGCAGGTCTT 



1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 



41 

I 1 
KIUiPPCFSHK TWVFSVLMGS 
SRSNANVIVN FQILFVSTPA 
IIAARSSEED CKKKXQSMSD 
DSVSGPHLSV TFPWILVACP 
PSIMRIVEMP KDYPPAIKPS 
DTQK6PQBRL AGEVARSPLK 



60 
120 
180 
240 
300 
360 



51 
I 

GAAGTCGCGC 

CTTOGTGGGG 

OGGCTGGGAG 

OGCOOGQAAG 

CGTGCTGGAG 

TTTCTCCTGC 

GGGGGCCCTC 

GOSQATCTGG 

TGCC0G6AAA 

GGC^GCOGGC 

GCAGATTCTG 

TGTGGTCTAT 

GATOCTGQCC 

CTA060GGAT 

GTACOOCCAO 

CTTCTTOGCG 

GCACAGGCAG 

CTGQAOATTC 

CGAGCXAAOG 

GCTQAG6AAC 

GTCTCX»AGC 

TGCCAAGGGG . 

GA6CXM0GA6 

6GCACX3CCAG 

CCTCCCOGGA 

CCTGACCCOG 

CAAGCGGAAG 

CTCAGCCXSGC 

6ATCGTGGGG 

GGAGCTGCXTC 

(ITCCATGQAS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 



1060 
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AAOAAGCTGG ACTTCCTGGT GRATATCTAC ATGCAOOGOA TGGGCATCCC CCC GACftG AG 1920 
ACOQAGGCCT ACTTTGGGOC CAAAGASOOG GAGCOGGOSC CGCOGTACCA CAOCCCGGAA X980 
GACAGCCGGG AGCATGTG6A CAGGC3V0GGC TGCATTQTCA AOATOrraCG CTCCAGCAGC 2040 
TCCAOJGGCC AGAAGAACTT CTCGGCGCCC C0GGC0G06C CCCCTGTCCA GTOT COGC CC 2100 
TCCACCTCCT Q0CA6CCACA GAGCXa^CCOG OGCCAGGGCC AOGGCACCTC CCCOGTGGGG 2160 
GACCA0G6CT CCCTGGTGOG CATCCOGCOG 0C60CTGCCC AOQAGCGGTC GCTGTCOGCC 2220 
TAQG G OGQGQ 6CAACXX30GC CAOCATGGAG TTCCT6CGGC AGGAGGACAC CCOGGGCTGC 2280 
AGGCCC C CCG AGGGOACCCT GOSGGACAGC GACA06TCCA TCTCCATXrC GTCCGTGGAC 2340 
CACGAGGAGC TGGAGCGTTC CTTCAGCGGC TTCAOCATCT CCCAG TCCA A GGAOAACCTG 2400 
GATGCTCTCA ACAOCTGCTA 0G066CCGTG GCGCCTTGTG CCAAAGTCAO 6CCCTACATT 2460 
GGGGAG6GAG AGTCAGACAC OaACTOCOAC CTCTGTAOCX: 06TG0GGGCC CCCGCCAOGC 2S20 
TOGGCCACOG GCX3A00GTCC CTTTOGTGAC GTGGGCTGGQ GCGG6CCCAO GAA GTGAGO C 2580 
GGCGCTGGGC CAGTGGACCX: GCC06C3GGCC CTCCTCAGCA CG6TGCCTCC QAGGTTTTGA 2640 
GGCGGGAACC CTCTGGGGCC CTTTTCTTAC AGTAACTGAG TGTGGCX3GGA AGG6TGGGCC 2700 
CTG6AGGGQC CCATGTGGGC TQAAGGATGG GGGCTCCTGO CaOTGACCTT TTAC A AAAG T 2760 
TATTTTOCAA CAGGGCACTC CCAGGCGCTG TOQCCATTOA QGT6CCTC0G CTGGGCTGTC 2820 
TCCTCACCCC TCOCTOTGCT GGAGCCTGTC CGAAAAAGGT OOCAACTGGG AOGCCTOGQA 2880 
AGCCACTGTC CAGGCTCCCA CTGCCTGTCT GCTCTGTTCC CAAAGGCAGC GTGTGTGGCC 2940 
TOGGGCCCTG CGGTGQCATG AAGCATCCCT TCTGGT G TGG GCATCOCTAC GTGTTTTOOT 3000 
GGCAQ CG - ri T CAOGGCGGTG CCCTTGCTGT CTOCCTTGGO CTGGCTOQAG CCTGGGGTCC 3060 
ATCTCCCTTT OC0OTCC08T CATG660CA0 OOAATGCATA GOGGGQCOCA CAGGCAGGGO 3120 
TATQAGTGCG TCCCACCCAA OGCAGCACCA GCCCC3GGCCA CCGCTGOCGG TGTCCCCAGT 3180 
TCCGTCTCAG CTACCTGGAC TCC3W3GACCC TOOAGAAGGG AGACCTGGCA GTGGAGGGAG 3240 
OCXOTGCTQT GTGTCCCCCT GCAGQTOTGA CCCCGCCTGC TCTTTCCTCC CCCGCCAGGT 3300 
GTGGCCCOGC CTGCTCTTTC CTCCCCCACC AQTATGGCCC CACCTGCTCT TTCXTTNCCCC 3360 
OCCAAGOTOT GOCCCCAOCT GT T CT TTOCT CCCCTGCOGA GOTOTOACCC CACCTGCTCT 3420 
TTCCTOCCTC CCAGTATGGC CCCACCTGCT CTTTCCTCCC OOGAGGTGAG 6CCCCGCCTG 3480 
CTCTTTCCTC CCATGGGAGC 06CTGAGG0G TGOQCACCTO GGCACAOGTT GGOOCTCTGC 3540 
AGOATGAGGA AGACAGGCCA ATCCCTTCCC TCCCAGAAGC TGGCCGCCCA GCAGGAGGGA 3600 
CTGAGGCCAG ACTCATGTCC AGCAAGGAAC GTGTG6TGTG TCCCCTGGGA AGTCTCTGGQ 3660 
CCCTGG6AAG AGGGAAGGTG CAO G TCCTGG GATGGTTGC6 GGGCCCTGTT TTGGGAGACA 3720 
AAGGGGTAGA GGGTCTGTCT TtSNGCCCCCC CAGACTCTAG CCCQAGCA6T GCAGCCACCT 3780 
ACTQCCCCAC CTCAGAGAAQ TGCA60GGGA AGOAGGCTGO AGGTGGTOCG GOQCTGCCTC 3840 
GGGTGTCTGC GTGAATGAGC 6TGGCCAAGG ACCAGTGCCA CCTCATGGCA AAGAGCTCCC 3900 
GCAGTGTTTO TTAGAGTGCA CATCCTACGT GCCCACTGGC ACACACAOGT GCTCACATAC 3960 
ANGTCNGMGT ACAGGNGTAC ACATGCAN6C TTGCACACAT GCACACAGAC CACATAGCAC 4020 
ACATGTGCAC lOACCACACC TGTATAOACC ATGCACAGTA CACATACGTG CATACACATO 4080 
CCTGCATACA GGCATACACA TGCACQCTTA CATGTACAOG TGCACAGATC ACACACATGC 4140 
ACACACGTQT AGCTCACACA CAQTATACAC ATACACAAGT GCACAQACCA CACACAGCAC 4200 
TAACACATGC ACACACAAAO TGCATAGGCC ACACAGCACA TGCACACAGG TOCACAGACC 4260 
ACACAGCACA CACAA6T6CA CAGAOCffCAC TGGACACATQ CACACACACA CGOG TG CATQ 4320 
CACACTCCTC GCACTTCCAO CCTTGGAOCC C'A"1' C 1GTCT C TGGTCTTTCT CTTTQACCCT 4360 
GCTGAGTQTA AGCTGCCTGG GGAGGGQCTA GAAOOAQTAA TTGTGQCTTT AGGGGTCGTQ 4440 
GTGATGCTGQ AATGTCAAQC GCCGTCOTGG GGTATCCGAC TGTCCGGGCT CCTGGTCCGC 4500 
AOTGOCAGAG OGCCAGGCAO AGCCAATCAO GGTCTOGTGC T6CCCTTC0C CCCCACAGCC 4560 
TGGCAGCCAT CCAQAGGAG8 GGCTCTACCA OATGCCAAOQ TGCC OOGGTO TCTQT ATGGG 4620 
TGTCCGGTTQ GGTCCTGTGT TTOGTCTGCC CT6GAGGTGG CTGGGCCCTC CTOQQ ATGG Q 4680 
TGGCTCAGCC TCGAATCCCA GGCCCCAOCC CAGGCAQGTO CTGCTQCCTG TTOTGGTTTC 4740 
CTGGCCCAGC TTCTCCTTCT CCCTCTGCAT AAAATCACAG TCCGTGAGTC TTCCA6CTGC 4800 
CACCACGGCT GGGACAOGCT GGGGGAGGGC TCCTCOCATG CCTCCTGCAC ACAGCOGTCT 4860 
GAGCAGGQCA GGTGCCAACA CCCCGCAGCO QAGACA06CT GOOCCTCAGC GATGOCOCTA 4920 
CCTTTTGGGG GCCCTCGTCI CAAGCCCCCC CTTGGAGGCT GAAATCACCC CAGGCACTGT 4980 
GAGGOCTTCr CCAGGGGGAC AOOCTTTGAG CTOTGGGTCT GATCACCCCA AGTCCCGCAC 5040 
ACGQAGGAGA GQCACAGCCA GGGO6TGTG0 TTTAATGTTT GCCCCTTCGG GGCTGQAGQT 5100 
CTCAOTGTTT CTA6ATTCCA GACOCTGCTG CCAOAOAGAC CTGCTGCC66 AGAGAAGGGG 5160 
AGGAGGACTC CAGCTGGGCT 0 (3 0 TCC0C CA CAGTCAGGGA GCCCCATAAA 6GACACCC0C 5220 
TTCTCTCTAG AAAGAGCTG6 GCTCTCA6CT ATTTCTAOTT GCTTCCCAGA AGCOGAGGAO S280 
CAOAAGGAGC TGTQAGAGCT TTGCAGAAAC GCCCTTGTOC COGCCCTCCT GAGCTATGAA 5340 
TQCOGTACAO AOCAGAGGCT GGGGCATTGG CAAGATCACA GGTTGATGCT GCA CAGC CCC 5400 
ATTGACACAA ACCCTCAAAO CA6A00TGAG AOGQAOGGTT CACAAAGCTT GGACCTGCOG 5460 
TGGAOQGTQC CCGGCAGACO TGGCQTGAQA GGQAGG6CTC ACOAQ QCTTO GA CCTGCIO T 5520 
GGAGGGTGCC CAGCAGACGT OGTOTaAOAO OAAOGGCTCA OOAGACTTGG ACCTGGTOGA SS80 
GGOTQCCCAG CAOACGTGGT GTGAGAGGGA OGGCTCACAG GGCTTGOACC GOAGAGAQAT 5640 
GGCTCATGAG ACTTGGACCT GCOGTGGAGO GTGOCCAGCA GACGTGGTAT GAGAGGQATG 5700 
GCTCAOGAGG CTTGGACCXQ GTGGAGGGTG CCOGGCAGAC GTOTGAGAGO GAOGOTTCAC 5760 
AAGOCTTGGA CCTGCCATGG AGGGTGCOCA GCA0AC8TGG TOTGAOAGGO AgOCTCAOS 5820 
AGGCTTGGAC CTGCOGTGGA GGGTGOCCAG CAGGGG6CT6 AGCTCXGAGO GGTGgSTGCT 5880 
CAGTGCAOGO GTGCCCCCAG TGTCCrCTGA TCCTGT006G TOOCTCCCCC AACCOCCACA 5940 
CCCATGCAGA ACTCCCAGGT CACATGCAOQ TATGTCCAGG GCATGGGGGT GGOGTGAAGA 6000 
GGCCTGOTCA GGGCCTTTAO GGGCTGCAGG AOGGAATQGC CACCTGGGGA G0CTOTGT6G 6060 
CTGTGCOGGG CAGCCATCCT GCATTCCCAC GCA6060QCA 6TCT0CA0CT OG600CCAGC 6120 
AAAGCGCTAA GCAOCCGGAO AGACAGCCAG GGC6GCTTCC TGAAGGATGT GGGATGGTGO 6180 
ACTCCGGGGT OGAGGGAATA OGCAGGTTCC TGTCCTCOOG 6AGACCTAGA QAAGCTGCAC 6240 
ACCCAGGAGC TTTCCATGAC CCGGGAGCAT GAOTGAATGG GGGOTTCCAQ TTTGCTGAAC 6300 
TTTGCTGTCT TGTAAGGGTG GGGGCTGACG GCGGACCCTG GQAGQAGGTQ ACACCGCAGG 6360 
GQGAGGTTGT GG6CAACGGT GGAGGAGOAG AOAOGGOAGG G GACCAr A'I'G (WATOGAGGG 6420 
GCCTCTTCAG AQTTTTAAAA GG0GTTT6TG GGGTGGAGTT GAGTGTGCTC TGGGCX TGQA 6480 
CACTTQCOGT GGTGCCCCTG GCTGGOCQAG GAGACTGGCT CTGGCCAGGO CCCCGTCCTG 6540 
AGAGOTCCTC AGCGTCTQAC TCTCGQCCAG GCGCCAGCAA GGAGGGGCGG OTCCCOGOGG 6600 
CTACCAGGCA GGCACGTGCA CATOGCCATC GCCACAOGOC AACTCOGCCT G GGTTTTA CA 6660 
AAGTGGTTGC CTEAATQCAT GTGGACAGQA ACTOCCTOAO OTOGCCOCAT GOOOOCTOGC 6720 
TOTGCCAOOT AOGGAGGCGC TGGACCCTGC GAACAOQTQQ QOGQGGCQAO OGGOOOiAOG 6780 
GAGGGGCTCC AGAOACAGQC GCAGGGCAOG AOGG GTCTCA CG6AGGGGTC TCQa^CTGAG 6840 
GCGOCCAOAO C TGGTGCTCC C6CTGGACQC CATCCCTCTO GC0Q6QATCC ACAGQGGCCA 6900 
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10 



C gTQ'roCXXXj CCKT6000GC GCCCCAOGCC ATT6CA0TCT TCCATCCTCT CGCOGTGAOG 6960 

Q TGQCTGCAG CTTCCOCATT TGCGCOQTTG CCTCTGGCTG TCTGCACTTT TGTTCATGCT 7020 

CCAAAGAACA TTTCATAATG CCTTCAGTAC CGAOSTACAC TTCTGAOCAT TTTOTATGTG 7080 

TCCTTGTGCC GTAGTGACCA GGCCTTTTTT TGOTGGATOT GTTACOCOGC ACACTT CAAT 7140 

CTCAACTTTG TGCA0CX3TCC ATTTTCTAGO GATAGA06CC CAGGGAATOA ACrCT AGTT T 7200 

TCTAACAGAT TAGCTGAGAT ATTAACTTAC TCACACGGAC AGGTTGATGC CAGAGCCGTA 7260 

A0AATG06CC A G T OO G O OTT T60C3GG6GAC TTOGGGTGTG GGGTCCTGOG GCOGOSATGG 7320 

OOGT OS AAGG TTCTQ6GGAT GOCTGCTGCC AGGGGGAOQA GTTGGGAaSC CAGGTGQACC 7380 
T6TQCACTCA 6TAAAA08CA QTGATTCAAA AAAAAAAAAA 

'seq ID NO I 282 Protein sewenca 
Protein Accession ff: NP_004509.1 

, ^ 1 11 21 31 41 51 

IS 1 I I I 1 I 

KVQKSRNOGV YPGPSGEKRL KVGPVOLDPO APDSTRDGAL MAGSEAPKR GSILSKPRAG 60 

GAGAGKFPKR NAFYRKLQNF UYNVLERFRG HAFIYBAYVF LLVFSCLVLS VPSTIKEVEK 120 

SSEGALYILB IVTIWFGVB YFVRIWAAGC CCRYRGWRGR LKFARKPFCV IDIMVLIASI 180 

AVLAAGSQON VPATSALRSL RPLQILRMIR MDRRGGTWKL LGSWYAHSK ELVTAWYIGF 240 

20 LCLILASPLV YLAEKGENDH FDTYADAIiWH GLITLTTIOY GDKyPQTHHG RliLAATPTLI 300 

GV6FPALPAG ZLGSQFAUCV QBQHRQKHFE KRBNPAAGLZ QSAWRFYATM LSRTDUiSTW 360 

QYYERTVTVP MYRLIPPUIQ IiELLRNLKSK SGLAFRKDPP PBPSPSQKVS LKDRVF8SPR 420 

GVAAKGKG8P QAQTVRRSPS ADQSLEDSPS KVPKSWSFGD RSRARQAFRI RSAASRQNSE 480 

EASLPGEDIV DDKSCPCBFV TEDLTPGLKV SIRAVCVMRF LVSKRKPKES LRPYDVMDVI 540 

25 EQY8AGHLDM LSRIK8LQSR VDQIVGRGPA ITDRDRTKGP AEAEXiPBDPS MMGRLGKVEK 600 

QVLSMEKXXiD FLVNZYKQRM OIPPTETEAY FGAREPEPAP PYHSPBDSRB HVDRHQCIVK 660 

XVR8SSSTGQ KNFSAPPAAP PVQCPPSTSH QPQSHPRQGH GTSFVGDHGS I.VRIPPPPAH 720 

ERSLSAYGOQ MRASMBFLRQ EDTP6CRPFE OTLRDSDTSI SIPSVSKE5L BRSPSQPSIS 780 

QSKENLDAUl 8CYAAVAPCA KVRFYIAE6B SDTDSDLCTP GGPPPRSAT6 EGPF6DVGWA 840 

30 GPRK 



35 



80 



8eq ZD NOi 2838 DNA sequence 
Nucleic Acid Accession #: AF152496. 
Coding sequence I 1..2391 



1 11 21 31 41 51 

t I I I I I 

ATGGAGGCX3G OAGGAOAGCQ ATTTCTTAGA CAAAGGCAAG TCTTGCTTCT CTTTQTTTTT 60 

CTGGGAGGGT CTCTGGCTGQ GTCCQAQTCA AGACGCTATT CTGTGGCTGA GGAAAAAGAG 120 

40 AA0G6CTTTT TAATAGCCAA CCTAGCAAA6 GATCTGGGAC TAAQGQTAGA GQAACTG6CC 180 

6GQAGGGG0G CCCAAOTTOT GTCCAAAOGO AACAAACAOC ATTTTCAGCP CSWCATCAG 240 

ACRGOTGATT TGCTOCTOAA TGAGAAATTO QACCXSGGAGG AGCTATGOOG CCOCACAGAA 300 

CCATGCATAC TACATTTTCA GATATTACTG CAAAACCCTT TGCAATTOGT TACAAAOGAG 360 

CTCOGTATCA TAGATGTAAA TQACX3\TTCT CCX^GTATTCT TTGAAAATGA AATGCA TCTG 420 

45 AAAATCCTAG AAAOGACTCT GCCAGGAACA GTAATTCCTT TGGGAAATQC TGAGGACTTQ 480 

GATGTOGQAA 0AAACA6CCT CCRAAACTAC ACTATCRCTC OQAATTCCCA CTTCCAOQTA 540 

CTCACTOGCA GTCGTAGGGA OGGAAGGAAQ TACCGOGAAC TAQTACTGGA TAAAGOGCTC 600 

GATCGGQAGG AGCAGCXX5GA ACTCAGCTTA ACGCTCACOO CGCTG6ACGG OGGCTCTCCC 660 

CCTCGGTCTG GOACAGCCCA GATAAACATC CAGGTCTTAO ATATAAACGA CAATGCACCA 720 

50 GAATTTGCAC AGCXSSCTCTA TGAGGTTGCA GTTCTA6AGA ATACOOCJOGT TAACTCTGTC 780 

ATTGTCACTG TCT06QCTTC T6ACTTA0AT ACAGGAAGTT TTOGGACAAT ATCATATGCA 840 

TTTTTTCATO CTTCT6AAOA AATTOGCAAA ACTTTTCAGC TAAATCCAAT TACTGGTGAT 900 

ATGCAACTGG TCAAATATTT GAATTTTGAA OOQATTAATA GTTATGAAGT OSAGA rOSAQ 960 

GCCAAQGATG GOGGAGGCCT ATCCGGAAAG TCTACAGTCA TAGTCCAGGT GGTTGATQTC 1020 

55 AACX3ACAACC CACOJGAACT GACCTTGTCT TCAOTAAACA GCCCTATTCC TGAGAACTCG 1080 

GGAOAGACTG TACTQGCTQ T TTTCAGTGTT TCTQATCTAO ACTCI66AGA CAACXSGAAGA 1140 

OTQATQTGTT CCATTQAGAA CAATCTOCXX: TTCTTOCTOA AACCATCTCT AGAOAATTTT 1200 

TACACCXTTAG TGTCAQAAGG OQOGCTGGAC AQAGAGACCA GATCCGAGTA CAACATTACC 1260 

^ _ ATCACTATCA CTGACCTGGG QACACCCAGQ CTGAAAACC31 AOTACAACAT AACOSTGCTG 1320 

60 GTCTCCGACX3 TCAATGACAA OQCCCXXXKX: TTCACCCAAA TCTCCTACAC CCTGTTOGTC 1380 

OGCSAOAACA AC3150COOGC CCTGCftCATC GGCAGT8TCA GCGCCACAaA GAOAQACTOl 1440 

6GC3VCCAA00 CCCABOTAAC CTACTOGCTG CTOCCGCCCC AGQACXJOGCA CCTGCCXCTC 1500 

TCTTCCCTGG TCTOCATCAA C6CGGACAAC GGCCACCTGT TTGCCCTCAO OTaSCTGGAC 1560 

TACGAGGCCC TQCAGGCGTT CGAGTTCOGC GTGGGOGCCA CAQACOGTGG CTCCCXX5GCT 1620 

65 TTGAGCA60G AGGOSCTGGT GOQCGTGCTG GTGCTGGACG CCAAOGACAA CTOGCCCTTC 1680 

GTGCTGTACC 06CT6C3U3AA OGOCTCGGOG CCCTGCACOG AGCIGGTGCC CGGGGC6GCT 1740 

GAGCaSGGCT ACCT6GTGAC CAAGQTGGTO GC0GT66AOG GC6ACTG6G0 CCA6AA0GCC 1800 

TGGCTOTCGT ACCAGCTGCT CAAG6CCAGG GAGCCOGGGC TOTTOGGOQT GTGGGOGCAC 1860 

„ AATGGOSAAG TGCGCACC6C CAGGCTGCTG AGCGAGC30CQ ACGCGGCCAA GCACAGGCTG 1920 

70 GTGGTCCTGO TCAAGGACAA TGGCGAGOCT CC3G0GCTCG0 CCAC06CCAC 6CTGCAT0T0 1980 

CTCCTGGTGG AOGGCTTCTC CCAGCCCTAC CTOCCTCTOC OGQAGQOGGC ACOQGCCXWJ 2040 

GCCCAGGCCG ACTTGCTCAC OOTCTACCTG 6TGGTGGCAT TGGCCTCGGT GTCTTC3GCTC 2100 

TTCCTCTTTT CGGTGCTCCT GTTCGTGGOG OTGOSGCrGT GCAGOAGGAG CA6GGCGGCC 2160 

TCGQTGGGTC GCTGCTOGGT GCCCGAGOGC CCCTTTCCAG GGCAGATGGT GGAOSTGAGC 2220 

75 OaCACCGGGA CCCTGTCCCA GAGCTACCA6 TACGAGGTGT GTCTGACTGG AGGCTCCGGO 2280 

ACAAATGAGT TCAAOTTCCT GAAGCCAATT ATGCCCAACT TOGTTGCTCA GGQTGCAGAO 2340 
AGGGTTAGCG AG6CAAATCC CA3TTTCA06 AA6AGCTTTG AATTCAGTTA A 



Seq ID N0» 284 Protein sequence 
Protein Accession «t AAD43757.1 

1 11 21 31 41 51 

11)111 

KBAGOERFLR QRQVLLLFVF LGGSLAGSES RRYSVAEBKE KOFLIANLAK DUJLRVBBLA 60 
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AKOAQWSKO NKQHFQLSHQ TGDLLI2IEKL DREBLC3GPTB PCILRFQILL QNPU}FVTNS 120 

LRIIDVNDHS FVFFENEMHL KILBSTLPGT VIPLQNAEDL DVGRNSIiQNy TXTFNSHFHV 180 

hTRSRSDGRK YPBLVLDIUUj DREBQPBLSL TLTAIiDGGSP PRSGTAQINI QVLDINDNAF 240 

BFAQPLYEVA VLENTPVNSV IVTVSASDLD TQSP3TISYA PPHASEBIRK TFQLNPITGD 300 

5 KQLVKYLNFB AZNSYBVDIB AKD66GLSQK STVIVQWDV NXU7PPBLTLS SVKSPIPQfS 360 

GETVIAVFSV SDLDS6DNGR VKCSIBKNLP PPLKPSVEKF YTLVSEGALD RETRSEYNIT 420 

ITITDLOm UCnCYNZTVL VSDVNI3NAPA PTQISYTLFV RENNSPALHI GSVSATDRDS 480 

GTNAQVTY8L LPPQDPHLPL SSLV8IMADN GHLPALRSLD YEALQAFEFR VGATDROSPA S40 

LSSEALVRVL VLDANDNSPF VLYPLQNGSA PCTELVPRAA EPGYLVTKW AVDGDSOQNA 600 

10 WLSYQLLKAT BPGLFGVWAH NOEVRTARLL SERDAAKHRL WLVKDNGEP PRSATATLHV 660 

LLVDQFSQpy LPLPBAAPAQ AQADLLTVYL WALASVSSL PLFSVLLPVA VRLCRR8RAA 730 

SVGRCSVPEO PPPGQMVOVS GTOTLSOSYQ YBVCLTGGSG TNBFKFLKPI IPNFVAQ6AE 780 
RVSEANPSPR KSPEP8 

IS Seq ID NOi 285 DMA aequence 

Nucleic Acid Accession «i NM_001794.2 
Cbding sequence i 15.. 2765 

1 11 21 31 41 51 

20 I. I i I II 

C6CGGGGGGG GAAGATGACC GGGGGC6C06 G0GT6CTCCT TCTGCTGCTC TCXKTCT0CX3 60 

GGQGGCTCOG GGCCCATAAT GAGGATCTTA CAACTAGAGA GACCTGCAAG GCT06GTTCT 120 

CTGAAOATGA TTACAOGOCA TTAATCTCCC AAAATATTCT AGAAGGGGAA AAQCTACTTC 160 

AAOTCAAQTT CAQCAGCTGT GTGGGGACCA AGGGGACACA ATATGAGACC AACAGCATGG 240 

25 ACTTCAAAGT TGGGGCAGAT GG6ACA6TCT TCGCCACGG6 GGAGCT6CAG GTCGCCTG06 300 

AOCAOGTGGC GTTCAOGGTG ACTGCATGOG ACAGCCSUSAC AGCAQAGAAA TGGGA0G0C30 360 

T0GT6066TT 6CTGGTGGCC CAGACCTCQT CCCCGCACTC T6GACACAAG COGCAGAAAG 420 

GAAAGAA60T CGTGGCTCTG GACCCCTCTC C6CCTCCGAA GGACACCCTG CT6CXX3TG6C 480 

CCCAGCACCA QAACGCCAAC GGGCTGAGGC GGCGCAAACG GGACTGGGTC ATCCCGCCCA 540 

30 TCAACX3TGCC OGAGAACTGG OGCGGGGCCT TCCCGCAGCA GCTC3ST6AGG ATCXX3GTC0Q 600 

ACAAAGACAA TGACATCCCC ATCOGGTACA GCATCACX3GG AGTGG6C6CC GACCAGCCCC 660 

CG^TOGAGGT CT7t3^CATT GACTCCATGT C06GCC66AT GTAOGTGACA AGGCCCATOG 720 

ACOGGGAGGA GCACQCCTCT TACCACCTCC GAGCCCAOGC TGTGGACATG AATGGCAACA 780 

AGGTGGAGAA CCCCATCGAC CTGTACATCT A0GTCATCX3A CATGAATGAC AACCGCCCTG 840 

35 AOrrCATCAA CCAGOTCTAC AAGGGCTC06 TGGACGAGGG CTCCAAOGCA GGCACCTA06 900 

TGA1GA003T CAGGGGCAAC GATGCTGAOG ACAGCACCAC 0GCCAACX3GG ATGGT6CGGT 960 

ACCGGATOQT GACCCAGACC CCACAGAOCC OGTCCCAOAA TATOTTCACC ATCAACA6CX3 1020 

AGACTGGAGA TATOGTCACA GTGG0SGCT6 GCCTGGACOG AGAGAAAGTT CA6CA6TACA 1080 

CAGTCATCX3T TCAGGCCACA GATATGGAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 1140 

40 CCACAOCCAT CATCAOGGTO ACAGATGTQA ATQACAACCC GCCAGAATTT ACOGCCAGCA 1200 

OSTTTQCMiG GOAOGTCCXX GAAAACOGOS T0QA0AC03T 6GT0GCAAAC CTGAOGGTGA 1260 

TGGAOOSAGA TCAQCCOCAC TCTCCAAACT GGAATGCOGT TTACOSCATC ATCAGTGGGG 1320 

ATCCATCCGG GCACTTCAGC GTCOQCACAG ACCCCX3TAAC CAACGAGGGC ATG GTCA CCG 1380 

TGGTGAAGGC AGTCGACTAC GAGCTOIACA GAGCTTTCAT GCTQACAGTG ATGGTOTCCA 1440 

45 ACCAGGOGCC CCTGGCCAGC GGAATCCASA TGTCCTTCCA GTCCAGGGCA GGGGTGACCA 1500 

TCTCCATCAT G6ACATCAAC GA06CTCCCT ACTT C CCCTC AAACCACAAG CTGATC08CC 1560 

TGQAG6AGGG 0GT6CCCCCC GGCACOGTGC TGACCA06TT TTGA6CT0TQ QACCCTGACC 1620 

GGTTOVTGCA 6CAGGCTGTG AGATACTCAA AGCTGTCAGA CCCAGOGAGC TGGCTGCACA 1680 

^ TCAATGCCAC CAACX^GCCAG ATCAGCAOSO CGGCAGTOCT GGACOOTGAG TCCCTCTACA 1740 

50 CCAAAAACAA C6TCTA0GA0 GCCACCTTCC TGGCAGCTGA CAATGGGATA CCCCGGGCCA 1800 

GCXSGCACCJOQ QACXX7CCA0 ATCTATCTGA TTGACATCAA OSACAAOGCC CCTGAGCTGC 1860 

TGCCCAAGGA GG03GAGATC TGOGAGAAGC CCAACCTGAA OGCCATCAAC ATCA06G0GG 1920 

COGACGCTQA OOTOQACCCC AACATOGGCC OCTACGTCTT CGAGCTGCCC TTTGTCCCX30 1 980 

CGGCOGTGOG QAAGAACTaG ACCATCACCC GCCTGAACGO TGACTATGCC CAACTCAGCT 2040 

55 TGOGCATCCT GTACCTGGAG GCOGGGATGT AT6AOGTCCC CATCATCGTC ACAGACTCTG 2100 

GAAACCCTCC GCTGZGCAAC A08TCCATCA TCAAAGTCAA GOIGTGCCCA TOTQATGACA 2160 

A06GGGACTG CACCACCATT G6G6CAGTGG CAGO GG CTGG TCIGGGCACC OGTGCXZATOG 2220 

TGGCCATCCT CATCTGCATC CTCATCCTGC TGA0C3VTGGT CCTGCTGTTT GTCATGTGGA 2280 

TGAAGCGGCG AGAQAAGGAG OGCCACAOGA AGCAGCTGCT CATTGACCCC QAGOAOGACG 2340 

60 TCGGOSACAA CATCCTGAA6 TAT6AC6AGG AAGGC96GTGG OGAGGAGGAC CAGGACTA06 2400 

ACCTCAOOCA GCIGGAGCAO CCX3QAAGCCA TGGOOCAOOT G0CAA6CAAA GCCCCTGGOG 2460 

TGOGTGGCGT GGATGAGGGG COOGTGGGOG CTGAGCCCCA GTAOC06ATC AGGCGCATGG 2520 

TGCCXraVCCC AGGOQACATC GGTGACTTCA TCAATOAGGa ACTCOGCGCT GCTGACAACG 2580 

ACCCCACGGC ACCCCCCTAT GACTCCCTGC TX3QTCTTCGA CTACGAGGGO AGOGGCTCCA 2640 

65 COGCAGGCTC C0TCA6CTCC CTGAACTCAT CCAGTTCOGG GGAGCAAOAC TAOGATTACC 2700 

TCAAOQACTO GG6GCCCAGA TTCAAGAAGC TG606GACAT GTATGOAGGT GGTGAAGAGG 2760 

ATTGACTGAC CTGGGATCTT 06GAC06AAO TGA6AGC06T GCTCGGACGC GGGAOGAGCA 2820 

GGACTQAGCA QAGGCGGCCG GTCTTCXX3GA CTCCCTGOGG CTGTGTCCTT AGTGCTGTTA 2880 

GOAGGCCCCC CAATCCCCAC GTTGAGCTGT CTAGCATGAG CACCCACXXX CACAGCOCCC 2940 

70 TGCACXOGGC CXSClGCXrAO OVCCGOQCTO GCTGGCACTQ AAGGACAOCA AGA66CACTC 3000 

TCTCrrCACT T6AATTTCCT AOAACAGAAG CACTGTTTTT AAAAAAAAAA AAAAAAAAAG 3060 
AAO 

_ Seq ID NO: 286 Protein sequence 
75 Protein Accession «t NP_001785.2 

1 11 21 31 41 SI 

1 I I I 1 i 

MTAGAOVUiL LLSLSGALRA HNEDIiTTRET CKAGPSEDDY TALISQNILB GBKLUJVKPS 60 

oO SCVGTK6TQY BTNS^!DFKVG ADGTVFATRE LQVPSEQVAF TVTAWDSQTA BKHDAWRLL 120 

VAQTSSPHSG HXPQKGKKW AXiDFSPPPXD TLLPHPOHQH ANOLRRRRRD HVIPPINVPB 180 

M8RGPPPQQL VRIRSDXDND IPZRYSZTGV OADQPPMEVP SIDSMSGRMY VTRPMDRBER 240 

ASYKUUHAV DMMGNKVBNP IDLYIYVIDM NDMRPBPXNQ VYMGSVDEGS RPGTYVMTVT 300 

ANDADDSTTA NGNVRYRIVT QTPQSPSQNM FTINSET6DI VTVAAGLDRB KVQQYTVZVQ 360 
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15 



AT0MEGN12IY GLSNTATAJI TVTDVNDMPP EPTASTPAGB VPESKVBTW ANLTVMDRDQ 420 

PHSPNWNAVY RIISGDPSGH PSVRTDPVTN EGMVTWKAV DYBUIRAFML TVMVSNQAPL 480 

ASGIQMSPQS TAGVTISIMD INEAPYPPSM HKliIRLBBGV PPGTVLTTFS AVDPD RgKQQ 540 

AVRYSKLSDP ASWLHINATO OQITTAAVLD RBSLYTKNNV YBATFLAADN GIPPASGTGT 600 

M3IYLIDIND NAPELLPKEA QICKKPNUIA INITAAOADV DPMIGPYVPB LPPVPAAVRK 660 

NMTITRLNGD YAQLSLRILY IiEAGMYDVPI IVTOSGNPPL SNTSIIKVKV CPCDDMGDCT 720 

TIGAVAAAGL GTGAIVAILI CILILLTMVL LFVKWMKRRE KERHTKQLLI DPEDDVRDNI 7B0 

LKYDEBGGGB BDQDYDLSQL QQPEAMGHVP SKAPGVRRVD ERPVC3AEPQY PIRPMVPHPQ B40 

DIGDPINEGL RAADNDPTAP FYDSLLVFOY BSSGSTAOSV GSLN888SGD QDYDYLNDHG 900 
PRFKKLADMY GGGBED 

Seq ID NOt 287 DNA sequence 

NUclfllc Acid Accession ftt AF152495.1 

Ooding sequence* 1..2397 



1 11 21 31 41 SI 

111)11 

ATGGAGGCCG GAGAGGGGAA GGAGCGC3QTT CCGAAACAAA GGCAAQTCCT GATATTCTTT 60 

GTTTTGCTGG GCATAGCTCA GGCTAQTTaC CAGCCTAGGC ACTATTCAGT GGCOOAGGAA 120 

20 AOGGAGAOTG GCTCCTTTGT GGCCAATTTG TTAAAAGACC TGGGGCTGQA GATAGGAOAA 160 

CTTGCTQTGA GGGGGGCCAG GGTOGTTTCC AAA6GAAAAA AAATGCATTT GCAGTTOGAT 240 

AGGCAGACOO GGOATTT G TT GTT7VAATQAG AAATTGGACC GGGAGGAGCT GTGCGGCCCC 300 

ACAGAGCCCT GTGTCCTACC TTTCCAGCjrO TTACTAOAAA ATCCCTTQCA GTTTTTTCAG 360 

GOQOAGCTAC GOATTAGOaA OGTAAATGAT CATTCCTCAG TTTTCCTAGA CTIAAGAAATA 420 

25 CTTTTGAAAA TTCX»GAAAG TATCACTCCT GGAACTACTT TCTTAATAGA ACGTGC CCAO 480 

GACTTGGATG TAGQAACCAA CAOTCTCCAA AATTACACAA TCAGTCCCAA TTTCCACTTT 540 

CATCTTAATT TACAAGACAO TCTCGATGGC ATAATATTAC CACAGCTGGT GCTGAACAQA 600 

GCCCTGGATC GCGAGGAGCA OCCTQAQATC AOGTTAACCC TCACAGOGCT AGATGGCGGG 660 

AGTCCACCCA GGTCCGGCAC GGCCCTGGTA CGGATTGAAG TTGTGQACAT CAATOACAAC 720 

30 GTCCCAQAGT TTGCAAAGCT GCTCTATGAQ GTGCAGATCC CGGAGGACAQ CCCCX5TTGGA 780 

TOCCAGOTTG CCATCGTCTC TG0CA6GGAT TTAGACATTG GAACTAATGG AQAAATATCT 840 

TATGCATTTT CCCAAOCATC TGAAGAOITT OQCAAAAOGT TTCGATTAAG TGCAAAATCG 900 

GGAQAACTGC TTTTAAGACA QAAACTGGAT TTCGAATCCA TCCAGACATA CACAGTAAAT 960 

ATTCAGGCGA CAGATGGTGG GGGCCTATCT GGAACTTGTG TGOTATTTGT CCAAGTGATG 1020 

35 GATTTGAATG ACAATCCTCC OGAACTAACT ATGTOGACAC TTATCAATCA GATCCXAGAA 1080 

AACTTGCAOG ACACCCTCAT TGCTGTATTC AGCQTTTCAG ATCCTOACTC OGGAQACAAC 1140 

GGAA6GATG0 TCTGCTCC A T CCaAGATOAT O T CVriTT T TCTTQAAACC TTCTGTTGAO 1200 

AACTTTTACA CTCTGGTGAT AAGCACGGCC CTGQACCGGG AGACCAQATC OGAATACAAC 1260 

ATCACCATCA CXX?rCACCGA CTTCXSGGACA CCCAGGCTOA AAACOGAGCA CAACATAACC 1320 

40 OTGCTGQTCT CCGAOGTCAA TGACAAOGCC CCCGCCTTCA CCCAAACCTC CTACACCCTG 1380 

TTOQTGCGOG AGAACAACAG CCC08C0CTO CACATOGGGA G08TCAG06C CACAGACAGA 1440 

OACTOQGGGA CCAAOGCCCA GQTCACCTAC T O GCTQCTGC CGOCOCAGQA CCOGCACCTG 1500 

CXXXnXXJCCT CCCTGGTCTC CATCAACGCG GACAAOGGCC ACCTGTTCGC TCTCCAQTCXl 1560 

CTGQACTACG AGGCCCTQCA GGOGTTOQAG TTCCGCXJTGO GCGCCGCAGA CCGCGGCTCC 1620 

45 CCXSGOGTTGA GCAOOGAGGC GCTQGTOCGC GTGCTGGTGC TGGACGCCAA CGACA ACTO G 16B0 

CCCTTOGTGC TGTACCOGCT GCAQAAG66C TO00O6CCCT QCAGCX3AGCT GGT6CC006G 1740 

GOGGCCQAGC OOGGCTACCT GGTQACCAAO OTGGTGGOGO TGGAG66C6A CTOGGGGCAG 1800 

AACGCCTGGC TGTC6TACCA GCTGCTCAAG GCCACGGAGC COGGGCTQTT CGOCGTGTGG 1860 

GCGCACAATG GCGAGGTGOG CACCGCCAGQ CTGCTGAGGG AGCXSCXSAOSC TGCCAAGCAG 1920 

50 AOGCTGGTG G T6CTG6TCAA GGACAATGGC GAGCCTCCGC GCTOGGCCAC 06CCACGCTG 1980 

CAC3GT6CTCC TGGTG Q AOGG CTTCTCCCAG CGCTACCTGC TGCTCC0G6A GGOGGCACGG 2040 

0CCCA6GCCC AGGCOGACTT GCTCACCGTC TACCTGGTG6 TGGOQTT GG C CT0G6TGTCT 2100 

TOGCTCTTCC TCTTCTOGGT GCTCCTGTTC GTGOOGQTGC GGCTOTQC3U3 GAGGAGCAGG 2160 

_ _ GOGGCCrCGG TGOOTCGCTG CTCGQTGCCC GAGGGCCXX:r TTCCAGGGCA GATGGTGGAC 2220 

55 GTGAGCGGCA CCGGGACCCT 6TC0CAGAGC TACCAGTA09 AGGTGT6TCX GACTGGAGGC 2280 

TCGGGGACAA ATGAGITCAA GTTOCTGAAO CCAATTATGC O CAAC TTOGT TGCTCAGOGT 2340 
GGA0AGA6GG TTAGOQAGGC AAATCOCAGT TTCAGGAAGA 6CTTTQAATT CACTTAA 

Seq ID NOx 288 Protein sequence 
60 Protein Accession tti AAD43756.1 

1 11 21 31 41 51 

I I 1 I I I 

^ MEAGEGKERV PKQRQVLIPP VLLGIAQASC QPRHYSVAEB TESGSFVANL LKDLGLEIGE 60 

65 tiAVRGARWS KGKKMHLQFD RQTGDLLIiNE KLDREELGGP TBPCVLPFQV LLEMPI^PFQ 120 

AEUIIROVND HSFVFLDXEI LUCIPESITP GTTPIiIERAQ DbDVOTNSLQ NYTISPNFHF 180 

HUlLQDSIiDG IILPQLVUIR AU3REEQPEI RLTLTALDGQ 8PFRSGTALV RIEVVDINDN 240 

VPEFAKLLYE VQIPEDSPVG SQVAIVSARD tiDIGTNGEIS YAF8QASEDI RKTFRLSAKS 300 

GKUiLRQKU) PESIQTYTVN IQATDGGGIiS GTCWPVQVM DLNDNPPELT MSTLINQIPB 360 

70 MLQDTLIAVF SV8DPDSGDN GRMVCSIQDD LPPFXjKPSVB NFYTLVI8TA LDRETR8BYN 420 

ITITVTDFGT PRI.KTBBHIT VLVSDVNDNA PAFTQT8YTL FVREamSPAL BIQSVSATDR 460 

DSGTHAQVTY SLLPPQDPHL PLASLVSIKA DNOaLFAI^S UJYBALQAFE PRVGAADRGS 540 

PALSSEALVR VLVLDANDNS PFVLYPLQNG SAPCTELVPR AABPGYLVTK WAVDGDSGQ 600 

NAWLSYQLLK ATEPGLP6VW AHNGEVRTAR LLRERDAAKQ RLWLVKDNG EPPRSATATL 660 

75 HVUiVDGPSQ PYUiPEAAP AQAQADLLTV YLWALASVe SLPLPSVLLF VAVRLCRRSR 720 

AASVGRCSVP EGPFPGQMVD V8GTGTLSQS YQYEVCXTGO SOTNBFKFUC PIIPNFVAQO 780 
ABRVSBANPS PRKSFEFT 

Seq ID NO: 289 DNA sequence 
80 Nucleic Acid Accession «: KM_018674.l 
Ooding sequencet 390.. 2009 

1 11 21 31 41 51 

I t 1 I I I 
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OGQAGOVCAT OCTGAGOGC3A GCGGCTGGGG CTGCGCGGCG TGCGSGAOOV GGGCTOQCTC 
CCT06CTGAC T06CTCX3CTC 6CAGGGACAC AOSCAGGGOC TGACAGCTGT GCTGGT6CTG 
ATAAGGGAAO CCACAAGGAG A06AT0GAG6 AGAQA6ACAA GCGGCAGCAG AGGCAGCAGC 
C36CAQAGOCA QC3UXAGGQC TGQKAOCTQ CTGGQAOTGa GAOTGACrCC 0CCACCT0G6 
GCOCCCACCC TGTCCCT G TC CTCTTCCCGC T TGCCCTGftG TTTAGAAQAG CAGCOGCTGC 
CACCACTGCC ACTCGGGA6G GCACCAGGGC TGCTGGCTAG 6GAGGGACAG GGCAGGGAGG 
CTCTGGCCAG TCCCAGCAGC CGGGGACAGA TGCCGATCGA GATTGTQTQC AAAATCAAAT 
TTG CTQAGG A GGATGOSAAA CCC AAGGAG A A0GAG6CAGG GGATGAGCAG AGCCTCCTGQ 
QGQCTQTTGC GCCTGGAGCA GOCO0CO6AG A0CTG6CCAC CTTTGCCA6C ACCAGCAGCC 
TGCATG6ACT GOGCCGOGCC TGTGGCCCAG GCCCCCACGG ACTGOGCAGA ACCCTGTGGG 
CACTGGCCCT ACTCACCTOG CTGGCTGCCT TCCTGTACCA GGOGGCTGGC CTGGCCCGGG 
6CTACCTGAC CCGOCCTCAC CTGGTGGCAA TGGACCCCX5C TGCCCCAGCC CCAGTGGCXX3 
GCTTCCOQGC TGTCACCCTC TGGAATATCA AC06CTTC0G GCATTGGGCA CTCAGOGATO 
CCQACATCTT CCAlCCTQGCC AATCTGACAG 0GCT6C000C CAAAQACOQO GATQQOCACC 
GTG0GGCTG6 CCTG06GTAC CCAGAGCCTG ACATGGTAGA CATCCTCAAC 06CACTGGCC 
ACCAGCTOGC OGACATGCTT AAGAGCTGCA ACTTCAGTGG GCATCACTGC TOCGCCAGCA 
ACTTCTCTGT GGTCTATACT CQCTATGGGA AGTGTTACAC CTTCAAOGOG GACCCGOGGA 
0CT06CTGGC CAGCCGGGCA GGGGGCATGG GCAGIGGCXTT GGAGATCATO CrGGACATGC 
AGCAGGAOGA GTACCTGCCC ATCTGGAGGG AOACAAATGA QAO S T O BTTT GAOGCAOQTA 
TTCX»X3T0CA GATCCACAGC CAGGAGGAGC 06CCCTACAT CCAGCA6CTG GGGTTC3GGGG 
TOTCCCCAOQ CTTCCAGACC TTTGTOTCCT GCCAGGAACA GOGOCTGACC TACCTQCCCC 
AGCCCTGGGG CAACTGCCGC GCAGAGAGTG AGCTCAGGGA GCCTQAGCTT CAGGGCTACT 
CG6CCTACAG TGTGTCTGCC TGCCGGCTGC GCTGTGAAAA GQAOGCOTTO CTTGAGCGCT 
6CCACT6CGG GATG6TGCAC AT6CCAGGCA ATGAOACCAT CTBOCC AO CA A ATATC TACA 
TOGAGTGTGC AGACCACACA CTGGACTCCC TGGOTGGGGG CCCT6A66GC CGGTGCTTCT 

gccccacccc ctgcaacctg acacx3ctatg ggaaagagat ctccatggtc aggatcccca 
acaggggctc agcccggtac ctggcx3agga agtacaaccg caacxsagacx! tacataoogg 
aqaw:ttcct ggtcctaoat gtcttctttg aggccctgac ctctqaagcc atggagcagc 
gagcagccta tggcctgtca gccctgctgg gagacctcgq ggqacaqato ggcctgttca 
ttggggccaq catcctcacg ttgctggaga tcctcqacta catctatgao ototcctogg 
atcx3actgaa gogggtatgg aggcx3tccca agacccccct gcggacctcc actgggggca 
tctccacttt ggggcttcag gagctqaagq aacaqaqtcc ctgcccx3a0c ctgggccxjao 
cggagggtgg gggggtcagc a6tctgctcc ccaatcacxza ccacccccac ggtcccccag 
gaggtctctt tgaagatttt gcttgctagg acggtgctqt qactgaaagg acccaggaot 
ctgggacccc tcctggoatc cccagcacat tctcxttgctc ctgggagagg cctgggggog 
gtgctcacto gqaoggccag gactcagttc crgctctcat cctcccctgc cctgatqtca 

GCTGCTTTGC ACAAAGQTCC TTCTT6TCCA CACCCCTTAT CCCCAOGCTG GTGCCCCGGG 
AGGGCTGGAO ACCAGGCCAT GGGCCCTCAC GGAQAGGAAG GGAAGGAAGG AGAGGGAGGG 
GGAGGATAGA GCCCATCCCA GCOGGGGAGG GGaAG(XCTC TOTACATTTO TAAATATTTA 
GGGA^AGCCXS GGTGGGGGGA GGGGATACAG ATGTAGAAG6 TGGGTAGGGC TACAGGGGTG 
GGTGATTTAG GGACAGCXAG GGTCCCAGOC CCAATGTCAG CAGGATAGGG AQAGCCCCA6 
GACTCAGGAG TGCTOGGCTG GTCCTACTTC CTGCCCCTCT CCAGGCCCAG CTOCCCTCTT 
GGCAGGGGGA GAGGATGGCC CAGCAGGCCT GGCCCAGCTC CCAGTTCXXX: CTGCACCAGC 
COCACCCCTA QAGTCCCTTC TATAGGGAGO GGGCAGGAGA CCTTCCAQAC TTOQGCTGAQ 
CTTGGAG6GT GGGAAGGGAQ CCTTCTCAGT OCTCTCTCCC TCCAOTCTGA TTTTATAAAG 
TGCTGACCSAO 



8eq ID KOs 390 Protein sequence 
Pxotein Accession #t HP 061144.1 



MPXBIVCKIX 
NRFHHSAIiSD 



ETNETSPEAG 
ELRBPELQGY 
LGGGPE5GPCF 
SALTSEAMBQ 
KTPIATSTGG 



11 
I 

FAEEDAXPKE 
ALALLTSLAA 
ADIFHLANLT 
HFSWYIRYO 
IRVQXHSQEB 
SAYSVSACaiL 
CPTPCNLTRY 
RAAYGLSALL 
XSTLGLQBIiR 



21 

i 

KEA(a>BQSIili 
FLYQAAGIAR 
GLPPKDSDGH 

PPYIHQLGFO 
RCEKEAVLQR 
GKEISMVRIP 
GDLGGQKGZiF 
EQSPCPSLGR 



31 
I 

GAVAPGAAPR 
GYLTRPHLVA 
RAAGLRYPEP 
SSLPSRAGGK 
VSPOPQTPVS 
CBCRMVBMPG 
NRGSARYIiAR 
XGA9ILTLI.B 
ABGG6V68LL 



41 

I 

DXiATFASTST 
MDPAAPAPVA 
DMVDZLNRTG 
GSGLBXKU)! 
OQBQRLTYLP 
NBTICPPMIY 
KYNRNETYIR 
XU3YIYEVSH 



Seq ID NO; 291 DMA sequence 

Nucleic Acid Accession #x Bos sequence 

Coding sequence X 62.. 895 



CACTGCTCTG 
TAT6ATCATC 
CAA6GATGGA 
AGAAGCSIOGG 
TGAAGGCGGC 
TGTCTOTGCT 
GCCCAACTGT 
TGAAAGATGG 
AGATCCAAAO 
CTGCTACTGG 
TGACCTTGAA 
TGATGTCCAT 
TACAOQAAAT 
CCAAATCAAA 
TACTACTTCT 
AAAAAAAGOA 



11 
I 

AGAATTTGTO 
TTAATTTACT 
ATTTTTCATA 
TCTGGCAAAT 
CATCTOGCAA 
GCTGGATGGA 
GGATTTGGAA 
GATGCCTATT 
CAAATTTTTA 
CACATTAGAC 
GATGACCX^IG 
GGCTTTGTGG 
QTCKIGAOCT 
TAT G TTQCAA 
ACTGGAAATA 
TQATCAAAAC 



21 
I 

AGCAGCCCCT 
TATTTCTCIT 
ACTGCATAIG 
AGAAGCTCAC 
CTTACAAGCA 
TGGCTAAGGG 
AAACTGGCAT 
GCTACAACCC 
AATCTCCAGO 
TCAAGTATGG 
OTTGCTTGGC 
GAAGATACTQ 
TQAAQTTTCT 
TGGATCCIOT 
AAAACTTTTT 
ACACAGTGTT 



31 

I 

AAGAGGCTGT 
GCTATGGGAA 
GCTTQAACSaA 
CTAOSCAGAA 
GCTAQAGGCA 
CAGAOTTGGA 
TATTQATTAT 
ACAOOCAAAG 
CITCCCAAAT 
TCAGCGTATT 
TGATTATGTT 
TGGAGATGAG 
AAGTGATGCT 
ATCCAAATCC 
AGCTGGAAGA 
TATGTT6GAA 



41 

I 

TACTTCACTA 
GACACTCAAQ 
GCAOCOGGTO 
GCTAAGGOGG 
GCCAGAAAAA 
TACCCCATPG 
GGAATCCGTC 
GAGTOTGGTG 
GAGTAOSAAG 
CACCTQAGTT 
GAAATATATG 
CTTCCAQATG 
TCA6TGACA6 
AOTCAAGGAA 
TTTAGCCACT 
TCTTTTQQAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



51 
I 

LHGIiGRACGF 
6PPAVTLCNI 
HQLADMLKSC 
QQEEYLPINR 
QPWGNCRAES 
IBCADBTLDS 
ENFLVLDVFP 
DRIiKRVWRRP 
GGUnSDFAC 



60 
120 
180 
240 
300 
360 
420 
480 



51 
I 

CAACT6ACGA 
GATGGGGATT 
TGTACGACAG 
TGTGTQAATT 
TTGGATTTCA 
TGAAGCGAGG 
TCAATAGGAG 
GCGTCTTTA^ 
ATAACCAAAT 
TTTTAGATTT 
ACAGTTAOGA 
ACATCATCAG 
CTGGAGGTTT 
AAAATACAAG 
TATAAAAAAA 
CTCCTTTOAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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CTCACIGTTA TTATTAACHT TTATTTATTA TTTTTCTAAA TGTOAAAGO^ ATACATAATT 1020 
TAGG6AAAAT TG6AAAATAT AGGAAACTTT AAA0GA6AAA ATOAAACCTC TC ATAATC CC 1080 
ACrcCATAGA AATAACAA5C GTTAACATTT TCATATPTTT riVlTl' CAOT CATTTTTCTA 1140 
TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GQAATCC TGC 1200 
5 TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTOAAC TTTATAAACA TTTTCTGAAA 1260 
TCATTQATTA TTCTACAAAA ACA3QATTTT AAACA6CR3T AAAATATTCT ATBAT ATQAA 1320 
TGTTTTATGC ATTATTTAAG OCTGTCTCTA TTGTTGQAAT TTCAOQTCAT TTTCATAAAT 1380 
ATtOTTGCAA TAAATATCCT TQAACACACA AAAAAAAAAA AA 

10 Seq ID NOi 292 Protein sequence 

Protein Accession it Eos sequence 

1 11 21 31 41 51 

1 I I I I I 

O MIILIYIiFLL LWBDTQGWGF KDOIFUNSIW LBRAAGVYHR EARSGICTKLT YAEAXAVCBF 60 

EGGBLATYKQ LBAARKIQFH VCAAQWMAKG RVGYPIVKPG PNOGPGXTGX IDYGXRUIRS 120 

BRWDAYCYNP HAKECG6VFT DPKQIFKSPO PPNEYBDNQX CYNHXRIiKyG QRIRLSFLDF 180 

DIiBDDPOCLA DYVBIYDSYD DVBQFVaRYC GDBLPDDII8 TOIVMTLKFL 6DASVTA60F 240 
QIKYVAKOPV 8K88QGKNT8 TTSTGNKNFL ASRFSHL 

Seq ID HOi 293 DMA sequence 

Nucleic Acid Accession S« NM_007llS.l 

Coding eequence; 69.. 902 

25 1 11 21 31 41 51 

I I I I I I 

GAATTC3GCAC TOCTCTOAQA ATTTOTaAaC AGCCCCTAAC AGGCTGTTAC TTCACTACAA 60 

CTGACX^TAT GATCATCTTA ATTTACTTAT TTCTCTTGCT ATGGOAAQAC ACTCAAGGAT 120 

GGGGATTCAA GGATGGAATT TTTCATAACT CCATATGGCT TGAACGAGCA GCCGGTGTGT 180 

30 ACCACAGAOA AGCACGGTCT GGCAAATACA AGCTCACCTA CXSCAGAAGCT AAGGCGQTGT 240 

GTGAATTTGA A660GGCCAT CTGGCAACTT ACAAOCAGCT AGAGGCAGCC AOAAAAATTG 300 

6ATTTCATGT CTGTGCTGCT 6GATGGATG6 CTAA06GCAG AGTTGQATAC CCCATTGTGA 360 

AGCX3VGGGCC CAACTGATGA TTTGGAAAAA CTGGCATTAT TGATTATQOA ATCCGTCTCA 420 

ATAGOAQTGA AAGATGGGAT GCCTATTQCT ACAACCCACA CGCAAAGGAG TGTGGTGGCG 480 

35 TCTTTACAOA TCCAAAGCGA ATTTTTAAAT CTCCAGGCTT CCCAAATGAS TACGA AGATA 540 

ACCAAATCIO CTACTGQCAC ATTAGACTCA AQTATGQTCA GOGTATTCAC CTGAGTTTTT 600 

TAQATTTTGA CCTT6AA6AT GACCXSUSGTT 6CTTG6CTGA TTATQTTOAA ATATATQACA 660 

GTTAOSATGA TGTCCATGGC TTTGTGGGAA GATACTGTGG AGATGAGCTT CCAGATGACA 720 

TCATCAQTAC AGGAAATGTC ATGACCTTGA AGTTTCTAAG TGAT6CTTCA GTGACAGCTG 780 

40 GAGGTTTCCA AATCAAATAT OTT6CAATGG ATCCTGTATC CAAATCCAGT CAAGGAAAAA 840 

ATACAAOTAC TACTTCTAGT GQAAATAAAA ACTTTTTAGC TOGRAQ ATTT AGCCACTTAT 900 

AAAAAAAAAA AAGGATGATC AAAACACAC3i GTGTTTATOT TGQAATCTTT TGGAACTCCT 960 

TTGATCTCAC TOTTATTATT AACATTTATT TATTATTTTT CTAAATGTGA AAGAAATACA 1020 

TAATTTAGGG AAAATTGGAA AATATAGGAA ACTTTAAACG AGAAAATGAA ACOTCTCATA 1080 

45 ATCX:CACTGC ATAGAAATAA CAAGOGTTAA C3VTTTTCATA TTTTTTTCrT T CAOTC ATTT 1140 

TTGTATTTGT GOTATATGTA TATATGTAOC TATAT6TATT TOCA TTTOA A ATTTT GGAAT 1200 
CCTQCTCTAT GTACAGTTTT GTATTATACT TTTTAAATCT TGAACTTTAT OAACATTTTC 1260 
TOAAATCATT GATTATTCTA CAAAAACATG ATTTTAAACA GCTGTAAAAT ATT CTATGA T 1320 
ATGAATGTTT TATGCATTAT TTAAGCCTGT CTCTATTGTT GGAATTTCAa GTCATTTTCA 1380 

50 TAAATATTGT TGCAATAAAT ATCCTTCGGA ATTC 

Seq ID NO: 294 Protein sequence 
Protein Accession «t NPJIO 9046.1 

55 1 11 21 31 41 51 

I I I I I 1 

MIILXYIiPLI* LWEDTQGWOP KDOIFHNSIW LBRAAGVYHR BARSQKYKLT YAEAKAVCBP 60 
BGGHLATYKQ LBAARKIGFH VCAAGWMAKG RVGYPXVKPG FNXXFGKTGI IDYGIRLNRS 120 
- ERWDAYCYNP HAKECX30VFT DPKRXFKSPG PPNEYBDNQX CYWHIRLKYG QRIHLSFLDP 180 

DU DIiEDDPGCXiA DYVBXYDSYD DVHGFVGRYC GDBliPDDXXS TGN\7NTIiKFL SDASVTAGGP 240 
QIXYVAMDFV SKSSQaOiTS TT8TGNKKFL AGRFSHL 



^ Seq ID NO; 295 DNA sequence 

65 Nucleic Acid Accession #t NM_0012ie*2 
Coding sequence i 116.. 1180 

1 11 21 31 41 51 

«n 1 » ' • • ' 

/U GTACTOGCCA CGGCACCCAG GCTGOGOGCA OGCX5GTCXXX3 GTOTGCAQCT GGAOAGCXJAO 60 

OGGCCACOOG QAGCCCCCGG CACAGCCOSC GOXGCCCCXS CAGGAGCCOS CGAAOATOOC 120 

CCGGCGCAGC CTGCAOGOGG aSGCOGTGCT CCTGCTGGTG ATCTTAAAGG AACAGCXTTC 180 

CAGCCCGQCC CCAGTGAACG GTTCCAAGTG GACTTATTTT GGTCCTGATG GGGAGAATAG 240 

CTGQT(XAAO AAGTACCCGT CGTGTGGGGG CCXGCTGC3«3 TCCCCX31TAG ACCTGCACAG 300 

75 TQACATCCTC CAGTATGACQ CCAGCCTCAC GCCCCTCOAG TTCCAAGGCT ACAATCTGTC 360 

TGCCAAC3\A0 CAGTTTCTCX: TGACCAACAA TGGCCATTCA GTQAAGCTGA ACCTGCGCTC 420 

GGACATGCAC ATCCAGGGCC TCCAGTCTCQ CTACAOTGCC AOQCAQCTGC ACCTGCACTQ 480 

GGGGAACCOO AATGAOCCGC ACGGCTCTGA GCACACCGTC AGCGGACAGC ACTTOQCCGC 540 

CQAGCTGCAC ATTGTCCATT ATAACTCAGA CCTTTATCCT GACGCCAGCA CTQCCAQCAA 600 

80 CAAGTCAGAA GGCCTOGCTG TCCTGGCTGT TCTCATTGAO ATQGGCTCCT TCAA TCOGTC 660 

CTATGACAAG ATCTTCAGTC ACCTTCAACA TGTAAAflTAC AAAG6GCAG0 AAGCATiaiT 720 

CCOSGGATTC AACATTQAAQ AQCTGCTTCC GGAQAGOACC GCTGAA TATT ACOOCTAOOG 780 

GGGGTOXnXI ACCACACCCC CTTGCAAOCC CACTGTGCTC TGGACAGTTT TCOGAAAOCC 840 

OGIGCAAATT TCGCAGOAOC AGCTGCTGGC TTTGGAlQACA GCOCTOTACT GCACACACAT 900 
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5 

10 
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GGACGfVOCCt 
GAGGCTGGTA 
GGOCATCATC 
GTCCATTTGO 
CAAGCCAGCC 
ACATGCAGQA 
ACCTCAAGGT 
TGGGTOCCTC 
TGCACTCTCA 
GAAGTGGTCT 
AAAACACTTT 
GCTTCTCCAA 
CAACTTCTTT 
CTTTAATCTG 
AGAAAAAAGA 
CCTCT66ATC 
OGTGGGGATO 
TCTOGGAOAQ 
TGTAAGQACA 
ATGTTTTATA 
CTGTGTCCTG 
AATAGGCAGA 
CACTTCTGCA 
CT6AAGATGT 
ATCTGTQCAG 
ATACTCCATA 
TTCCTCCCGC 
ACCXXrTAGTT 
GACAGCATQA 
CCCTCTGACT 
AGTAAGCCAA 
AAAAAAAAAA 



TCCCCCAQAG 
TACACCTCCT 
CTCTCACTGO 
CTTTTCAGAA 
ACCAAGATGG 
AGGACCTT6C 
GTTCTCT6TA 
CTTOCCTTGG 
GACCAAGCA6 
QAATTCTGGA 
CATCCG006C 
6CTTTCAGAC 
CCTCTGGAGA 
CACCTTACAA 
AAACAAGAGA 
ACAGACACTQ 
AGAAGGGAGA 
TCTCACTTTG 
GAATAATACA 
CTCTACAGAT 
GAGGAGACAT 
AACAAAATGA 
GGATCCCTAC 
CAAATGGTGG 
GGGGCTGGCA 
TTCTTCCCOO 
AGOTGAGAAC 
GAGAGCACCT 
AATGTCCTCT 
CTAAGAATTC 
GGTCATGCCA 
AAAAA 



AAAT6ATCAA 
TCTCCCAAOT 
CCCTGGCTQG 
GGAAGAGTAT 
AGACTGAG6C 
TTTQGACCCT 
GCTCAATCTG 
GACCATGGCC 
CAGGAATTCA 
ATCACAAACC 
ITTGCCAGAG 
AAGAATGTGC 
CGGGGCATCT 
CTOGGGGACA 
TATACATTGT 
CACAGAUCIT 
QAGAGCTAGA 
GAATCAQAAT 
ATATTAAGTC 
TTTACAAATA 
QAGTTCOQAG 
CTGTAGAACT 
TXSCCAACCTG 
ITTAQATCAA 
TCATGTATTT 
AATCAGACAO 
CCTGTOGAGA 
TGCAQTTCCC 
TGAAGCATAG 
TCTCXTCTGO 
CTQCACTCTA 



CAACTTCX300 
GCAAGTCTGT 
CATTCTTGGC 
CAAAAAAQGT 
GCAOGCTTGA 
ACACACTT08 
CAAACATGCC 
ACCCCAGAGC 
AAGCTGCTTQ 
AAGCCATGCT 
CGTGCTTTCA 
ACTCrCTGCT 
CCCTCTGATT 
AATGGGGACA 
GATATATTAG 
AGGQAATGGC 
GG6ACAAAGA 
TGGAATCACA 
C3UVATCCAAC 
ATGAGGCTGT 
ATQAOOCAAT 
TATTCTCTGT 
GAATGQAGAC 
ATATATTTCA 
AGGGGCAAGT 
ACAGTTTCTG 
TGAGTC3VGT0 
0QAG7VACTTT 
CTTTTTAAAT 
AAT08CTTGA 
GCCTGQGTGA 



CAGGTCCAGA 
ACTGOGGCAG 
ATCTGTATTO 
GATAACAAGG 
GGTGCCOGGA 
GCTCTCTGGA 
AOGCCTCAGG 
CATCOQATOO 
CTGTAACPGT 
G6TGGGCCAT 
AflT OTCCTO O 
TAGGTrTTGC 
TCCTTCTGCT 
GGAAGGATCA 
GGACACTTTC 
AOGTTCAAGT 
6AATGAGAA0 
TTCTGTTTAT 
CTCCTGTCAG 
TCCTTGAAAA 
CTGCCRTGA 
AGGCCAAATT 
TTTTATCTAC 
AGCTATAAAA 
AATAATGGAA 
ACAG6GQCAA 
OCATGACTGA 
CTGATTCACA 
ATCTTTTTCC 
ACGCA6GAG6 
CAGAGGQAOA 



AGTT06ATGA 
GACTGAGTCT 
'l^GGTGGTGGT 
GAGTCATTTA 
GCTCCOGGGC 
CACTTGCXaAC 
GATCCTCT6C 
ATGGATGGQA 
GTGAGATTGT 
TAATGGTTGG 
AAAGTCTGCT 
TTGGGAAACT 
ATGACAAAAC 
AGTTGTAQAG 
ACAGTCCTGT 
TOCACTTCTT 
ACATGOATGA 
CAAGCCATAA 
TGGAGCAGTT 
TGTGTTGTTO 
ATCTOGAGGA 
TCATTTCA6C 
TTCTCTCTCT 
GCAGGAGGTT 
TGCTACTAAG 
CTCCtCCATT 
QAAG6AACG6 
GTCTCATTTT 
TTCTACTCCT 
CGGAGGTTGC 
CTCCATCTCA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



Seq ID NO: 296 Protein sequence 
Protein Accession #i NP_001209.1 



41 



51 



1 11 2X 31 , 

I 1 I 1 i I 

MPRRSUIAAA VLLbVILKEQ PSSPAPVNGS KWTYPGPDGE NSWSKKYPSC GGLLQSPIDI. 60 
HSDILQYDAS LTPLEFQGYN LSANKQPI»LT NNGHSVKLNL PSD MHIQ GLQ SRYSATQLHIi 120 
HWQNPNDPHG SBHTVSGQHP AAELHIVHYN SDLYPDASTA SNKSBSLAVL AVLIEMOSPM IBO 
PSYDKIFSHL QHVKYKGQBA FVPGFNIEEL LPERTABYYR TOGSLTTPPC NPTVLWTVPR 240 
NPVQISQEQL XALETALYCT HMDDPSPREM INNFRQVQKF DERLVVT8PS QVQVCTAAOL 300 
SLGIXL8LAL AGILGICIW WSXHLFRRK SIKRQDHMSV lYKPATKMBT BAHA 

Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession #t NM_006632.1 

Ooding sequence t 377 . . 1582 



A060GTG0GC 
60GGC0GT0C 
GACAAQAA6A 
CTTAACAAGC 
GGG6TGCTTC 
CTAGQAOAGG 
CAAGAACGCA 
ATGTTCT6CT 
ACAAAATGTC 
CCAGCTCAAT 
CCCAAAGAOT 
GTGGGGCCCT 
GGGAT6CTTT 
TGTCTTCTAT 
TTATGATGAC 
CTCCTTGAAA 
ATCTCTACCC 
AATGGTTGTA 
TGGACTTCTA 
TCTGGCAGAT 
AATTTTAGGA 
CTATATCACA 
AGGOATTTAT 
ATCAA6AGGA 
TAGTCAGGAC 
GTTAGGACTA 
GA6AAAACTC 
TATTQOITAA 
OTGGCrCTTT 
ATGTAA6ATG 



XI 
I 

CCAGGGGTCC 
T6GAGCACCT 
CXX3QCATC31T 
TGCTGGGCX3A 
TGCCCCAGAA 
AGAAACACAA 
CAAOATATGC 
GGCTATGGAA 
ATCATQAACA 
GATTCCTCTG 
CTTCCTGCAA 
GCACAAGAAC 
ACTGCCATOC 
ATCTTTG6A0 
CCCTTTTCCT 
CAACAGGTOQ 
ATTTGGTCCA 
TACATACCAA 
TCTGCCCTTC 
TTCCTTCTAA 
AGTCrCCCCT 
GCAACTOCCT 
ATCAATGTCT 
TTTTOQAGCA 
CCTGAGTTTG 
CTCTTCTACC 
ACTOGTTTAT 
AATAGAAGGC 
TCAATTAT6A 
AATAAAAATT 



21 

I 

GCCCACGOGT 
6ACOGCC6AG 
CCTGCX5CCAC 
AGTCACCATC 
GACCAAGAGC 
TGGCXyvCCAA 
AAGTOGATGA 
TAGCOCTOGT 
TCACCATGGT 
AOGTGCTGCC 
AGTCCTCAAT 
GAAGCAGACT 
TCATAGGTGG 
GTGTTGGCTG 
ATCCATGQAT 
GGTCTTCTAA 
TATGTTTAOO 
CTTACATCAG 
CTTTTATTGT 
CCAAAAAGTT 
CTTCAGCACT 
TGCTGACX3CT 
TA6ATATTGC 
TAGCACCTGT 
GGTGGAGGAA 
TCATATTTGG 
GAAGnrTATCC 
TTOCQTQATG 
GATCAGTTCA 
CAAATAAAAT 



31 

I 

C0G6TC6GGG 
ATCCTGGAGC 
CTGTAGCTGG 
G0GCAGGGCX3 
CACCACAAGG 
GACAGA6TTG 
QACACTGATC 
CTTACATTTC 
AGCCATGQTC 
TGTTGACTCA 
ACTTGGGGGT 
CTGCAGCATT 
CTTCATTAGT 
TOTCPGCTGC 
AAGCACCTCA 
GCAOCCTCTT 
CTGTTTCAGC 
CTCTGTGTAC 
TGCCTGGGTC 
TAGACTCATC 
CATTGTOTCT 
CTCTTGCXjGA 
TCCAAGGTAT 
CATTGTACCC 
TGTCTTCTTC 
AGAAGCAGAT 
CACCTTGGAT 
AAAATACCAG 
TTATTTTATT 
OATAACTAAO 



41 
I 

CCA0A606CA 
TGQCTGGCAA 
CCATTOGCAA 
GTGTGCTQCC 
CCAAGG6TGA 
AGTOXACAO 
CCCAGGAAAG 
TGCAATTTCA 
AACAQCACAA 
TTTGGTGGCC 
CAGTTTGCAA 
GCTTTATCAG 
GAAACCCXTG 
CTTCTCrGGT 
6AAAAAGAAT 
CCCATCAAAO 
CATCAATGOT 
CAT6TTAACA 
ATAGGC3VTGG 
ACTGTGAGGA 
CTGCCTTACC 
TTAAGC3VCAT 
TCCAGTTTTC 
ACTGTCAOCG 
TTGCTGTTTG 
GTCCSUVGAAT 
G6AAAAGTCA 
TGAA AAGATT 
GAGACTTTTT 
AAAAAAAAAA 



51 
I 

GGTQTACCTO 
CCXXKSCCCGC 
OGGCGAGQAG 
CAACATTCAO 
AAAOCATTCA 
CAAOG GAQAG 
GTOCAAGTTT 
CAAOQATAGC 
GCCCTCAATC 
TAAOTAAAGC 
TTT66GAAAA 
GAATGTTACT 

GGTGGCCxrrr 

TTGTTGTQAT 
ACATCATATC 
CTATGCTCAG 
TAGTTAGCAC 
TCAGA6ACAA 
TGGGAGGCTA 
AAATTGCCAC 
TCAATTCG6G 
TGTGTCA6TC 
TCATGGGAGC 
GATTTCTTCT 
COSTTAACCT 
GGGCTAAAGA 
TTA0GCACCX3 
TTTTTTTCCT 
TTTGAGAGAA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NOi 298 Protein sequence 
Protein Accession 9: NP_006623.1 
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1 II 31 31 41 51 

I I I I I I 

MQVDBTLIPR KSFSLCSARY GIALVLRFClf FTTZAQNVIM NITMVAKVNS TSPQSQUIDS 60 

SEVLFVDSFG GLSKAPKSLP AKS8ZLG0QF AIHBKWQPPQ GR8RLC8IAL SGHLLGCFTA 120 

ILIGGPISET IjGWPPVPYIP GGVGCVOCLL WFWIYDDPP SfYPWISTSBK EYIISSLKQQ 180 

VGSSKQPLPI KAMLRSLPIW SICLGCFSHQ WLVSTMWYI PTYI8SVYHV NIRDNGLLSA 240 

LPPZVAWVIG MVCGYLADFL LTKKFRLITV RKIATILGSL PSSALIVSLP YLNSGYITAT 300 

ALI.TLSOGLS TLCQSGIYIN VIiDIAFRYSS FLMOASRGFS SXAFVIVPTV SGFIiLSQDPG 360 
FGWRNVFFLL FAVNLLGU.F YLIFGEADVQ EWAXERKLTR L 

Seq ID NO: 299 DMA sequence 

Nucleic Acid Accession NM_003058.1 

Coding sequence t 145.. 1812 ~ 

1 11 21 31 41 51 

I I I I I i 

GGCCCTGCXX: TGAAGGCTQG TCACTTGCAO AGOTAAACTC CCCTCTTTGA CTTCTGGCCA 60 

GGGTTPGTGC TGAGCTGGCT GCAGCCGCTC TCAGCCTCGC TCOSGGCAOQ TCGGQCAGCC 120 

T06GGCCCTC CTGCCTGCAG GATCATGCCC ACCACOGTGG ACGATGTCCT GGAGCATGGA 180 

GGGGAGTTTC ACTTTTTCCA QAAGCAAATQ TTTTTCCTCT TGQCTCTGCT CTCGGCTACC 240 

TTCGOGCCCA TCTACGTGQG CATCGTCTTC CTGGGCTTCA CCCCTGACXyv CCGCTGCCGG 300 

AOCCCCGGAO TGGCOGAGCT QAOTCTOGOC TQCGGCTGGA GTCCTGCAGA GGAACTGAAC 360 

TACAOGGTGC CGGGCCCAGG ACCTGCX3GGC GAAGCCTCCC CAAGACAGTG TAGGCX3CTAC 420 

GAGGTGQACT QGAACCAGAG CACCTTTGAC TGCXHtSGACC CCCTGGCCAG CCTGQACACC 480 

AACAGGAGCC GCCTGCCACT GGGCCCCTGC CX3GGA0GGCT GGGTGTACGA GACGCCTGGC 540 

TCX5TCCATCG TCACCOAQTT TAACCTGGTA TGTGOCAACT CCrGOATGTT GGACCTATTC 600 

CAGTCATCAG TGAATGTAGG ATTCTTTATT GGCTCTATGA GTATOGGCTA CATAGCAGAC 660 

AGGTTTGGCX: GTAAGCTCTG CCTCCTAACT ACAGTCCTCA TAAATGCTGC AGCT6GAGTT 720 

CTCATGGCCA TTTCCCCAAC CTATACGTQO ATGTTAATTT TTCQCTTAAT CCAA6GACTQ 780 

GTCAGCAAAG CAGGCTGGTT AATAG6CTAC ATCCTGATTA CAGAATTTGT TGGGOQGAQA 840 

TATOGGAQAA CAGT66G6AT TTTTTACCAA QTTGCCTATA CA6TTG6GCT CCTGGTGGTA 900 

GCTGGGGTGG CTTACGCACT TCCTCACTGG AGGTGGTTGC AQTrcaCAGT TGCTCTGCCC 960 

AACTTCTTCT TCTT6CTCTA TTACTGGTGC ATACCTGAGT CTCCCAGGTG GCTQATCTCC 1020 

CAQAATAAGA AT6CTGAAGC CAT6AGAATC ATTAAGCACA TOGCAAAGAA AAATGGAAAA 1080 

TCTCTACC06 CCTCCCTTCA OGGCCTGAGA CTT6AAGAGG AAACTGGCAA GAAATTGAAC 1140 

CCTTCATTTC TT6ACTTGGT CAOAACTCCT CAGATAA6GA AACATACTAT GATATTGATQ 1200 

TACAACTGGT TCACGAGCTC TGTGCTCTAC CAGGGCCTCA TCATOCACAT GGGCCTTGCA 1260 

GOTQACAATA TCTACCTGGA TTTCTTCTAC TCTGCCCTGQ TTOAATTCCC AQCTGCCTTC 1320 

ATGATCATCC TCACXIATOSA C06CAT0GGA CGCOGTTACC CTTGGGCTGC ATCAAATAT6 1380 

(3TTGCAQ6QQ CAGOCTGTCT GGOCTCAOTT TTTATAOCTO QTGATCTACA ATQGCTAAAA 1440 

ATTATTATCT CATGCTTOGO AAQAATG66G ATCACAATGG CCTATGAGAT AOTCTGCCTG 1500 

GTCAATGCTG AGCT6TACCC CACATTCATT AGGAATCTTG GOGTCCACAT CTGTTCCTCA 1560 

ATGTGTGACA TTGGTGQCAT CATCAOGCCA TTOCTGQTCT ACOGGCTCAC TAACATCTGG 1620 

CTTGAGCTCC CGCTGATGGT TTTCGGOGTA CTTGGCTTGG TTGCTGGAGG TCTGGTGCTG 1680 

rrOCTTCGAO AAACTAAAGQ GAAAGCTTTG CCTGAaAGCA TGQAOQAAGC CBAAAATATQ 1740 

CAAA6ACCAA GAAAAAATAA AGAAAAQATQ ATTTACCTGC AAGTTCAGAA ACTAOACATT 1800 

CCATTGAACT AAGAAGAGAG ACCXTTTGCTO CTGTCATQAC CTAGCTTTGA TGGCAGCAA6 1860 

ACCAAAAGTA GAAATCCXTG CACTCATCAC AAAGCCCATA CAACTCAACC AAACTTACCC 1920 

CTGAGCCCTA TCAACCTAGQ TCTACAGCCA QTG6AGTCTA TT6TACACT0 TGQAAAAATA 1980 

CCCATGGGAC CAQATCCTGC CAAATTCTTC CAQCTCACTT TATTCTCAQC ATTCCTAGGA 2040 

CATTGGACAT TGGTTTTCT6 6AG6GTTTTT TTTCOGATCT TTGTATTTTT TTAAATTTQA 2100 

TTCITTTCTT TQCAATGCTA GCAACCAQAA TACATAGGGG AACTGTQOGC TAQQCAAANA 2160 

AAATAGAAAA AOTGTGAAAA ACAGTAAAGT TGGGAGAGGA GCATCTArrT TCTTAAAGAA 2220 
ATAAAACACC NAAAACAAAA AAAAAAAAAA AAAAAAA 

Seq XD NOt 300 Protein sequence 
Protein Accession «t NP_003049.l 

1 11 21 31 41 51 

I I I I I I 

MFTTVDDVLB BGQBFHFFQR QMFFLLALLS ATFAPZYVOZ VFLSFTFDHR CRSPGVAELS 60 

LROGWSPAEE I2IYTVPGPGP AGEA8PRQCR RYEVDHNQST FDCVDPLASL DINRGRLFLO 120 

PCRDGWVYET FGSSIVTEFN LVCANSHMLD LFQSSVNVGF FIGSMSIGYI ADRFGRKLCL 180 

LTTVLIMAAA GVLMAISPTY TWMLIFRLIQ GLVSKAGWLI QYIIiITEFVG KRYRRTVOIF 240 

YQVAYTVGLL VLAOVAYALP HHRHLQFTVA LPNFPFIiLYY WCIPSSPRNL XSQNKNABAM 300 

RXXKHXAKKN 6KSLPASLQR LRLESBTOXK UTPSFWLVR TPQXRXHTMX U4VXmFTS9V 360 

LYQGLXKRMG LAGDNIYLDF PYSALVEFPA AFMZZLTIDR ZGRRYPUAA8 KMVAGAACLA 420 

SVFIPGDLQH LKIII8CL6R MQITMAYEXV CLVNAELYPT FIRNLGVHIC SSNCDIGGII 480 

TPFLVYRLTN IWLELPLKVF GVIiGLVAGGL VLLLPETKGK ALPETIEEAE NMQRPRXKKE 540 
KMXYLQVQKL DIPU9 

Seq ID N0» 301 DMA sequence 

Nucleic Acid Accession NN_012206.1 

CSoding sequence t 52.. 1131 ~ 

1 11 21 31 41 51 

I I I I I t 

GTTACCCAGC ATTGTGAGTG ACAGAGCCTG GATCT0AACX3 CTQATCCCAT AATQCATCCT 60 

CAAGTG6TCA TCTTAAGCCT CATCCTACAT CTGOCAGATT CTGTAGCTGO TTCTGTAAAQ 120 

GnOSTGQAS AGGCAGGTCC ATCTOTCACA CTAG0CT6OC ACTAGAOTGO AGCTOTCACA 180 

TCAATGTGCT GOAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGO C A TTGT C T GO 240 

ACCAATGGAA COCAOGTCAC CTATOGQAAG 0ACACA06CT ATAAGCTATT GGGGGAOCTT 300 

TCAAGAAGGO ATGTCTCTTT GACCATAGAA AATACAOCTQ TQTCTQACAG TO G CQT A TAT 360 
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T G rf G COCr m TTGAGCACai TGOaTGOTTC AATGACATGA AAATCACOGT ATCATTGGAQ 420 

ATTOTCCCAC CCAAGGTCAC GACTACTCCA ATTGTCACAA CTCTTCCAAC CGTCAC3GACT 480 

GTTGQAAOGA GCACCACTGT TCCAACGACA AOQACTGTTC CAAOGACAAC TGTTCCAACA 54 0 

ACAATOAGCA TTCCAAGOAC AA0QACT6TT COQAOQACAA T6ACTGTTTC AAOQACAACO 600 

AGCX3TTCCAA CXSACAAGOAG CATTCCAACA ACAACSMTG TTCC A OT G AC AACAAOGQTC 660 

TCTACCTTTQ TTCCTCCAAT QCCTTTGCCC AGGCAGAACC ATGAACXAGT AGCCACTTCA 720 

CCATCTTCAC CTCACCCAGC AGAAACCCAC CCTAC3GACAC TGCAGGGAGC AATAAGGAGA 780 

GAACCCACCA GCTCACCATT GTACTCTTAC ACAACAGAT6 GGAAT6ACAC CGTGACAGA6 840 

TCTTCAOATO aOCTTTGGAA TAACAATCAA ACTCAACTCT TCCTAGAACA TAQTCTACTO 900 

ACGGCCAATA CCACTAAAGG AATCTATGCT GGAGTCTOTA TTTCT6TCTT GGT6CTTCTT 960 

GCTCTTTTOG OTQTCATCAT TGCCAAAAAG TATTTCTTCA AAAAGGAGGT TCAACAACTA 1020 

AGTGTTTCAT TTAGCAGCXn' TCAAATTAAA GCTTTQCAAA ATQCAOTTGA AAAGGAAGTC 1080 

CAAGCAQAAO ACAATATCTA CATTGA6AAT AGTCTTTATG CCAOGGACTA AGACCCAGTG 1140 

(jmci t:mti agagtttaog cccatcacto cagaagacto aacaootatc aocacatcao 1200 

ATGTCTTTTA GACTCCAAOA CAATTTTTCT OTTTCAGTTT CATCTGGCAT TCCAACATGT 1260 

CAOTOATACT QGGTAGAGTA ACTCTCCCAC TCCAAACTGT GTATAGTCAA CCTCATCATT 1320 

AATOTAGTCC TAATTTGTTT TGCTAAAACT GGCTCAATCC TTCTGATCAT TGCAGAGTTT 13 BO 
TCTCTCAAAC ATGAACACTT TAGAATTGTA TGTTCTCTTT AOACCCCATA AATCCTGTAT 

Seq ID NO: 302 Protein sequence 
Protein Accession #: NP_03633B.l 

1 11 21 31 41 51 

I 1 I I I I 

KHPQWIliSL ILHLAOSVAG SVKVGGEAGP SVTLPOIYSa AVTSMCHKRG SCSLFTCQNG 60 

IVWTNGTHVT YRKDTRYKLL GDLSRRDVSL TIENTAVSDS GVYCCRVEHR GWFNDMKITV 120 

SLEIVPPKVT TTPIVTTVPT VTTVRTSTTV PTTTTVPTTT VPTTMSIPTT TTVPTTMTV8 180 

TTTSVPTTTS IPTTTSVPVT TTVSTFVPPM PLPRQNHBPV ATSPSSPQPA BTHPTTLQGA 240 

IRREPTSSPL YSYTTDGNDT VTBSSDGLHN NNQTQLFLEH SLLTAmTXG ZYAGVCISVL 300 
VLLALLGVIX AKKYFFKKEV QQLSVSFSSL QIKALQNAVE XEVQAB]NIY XBNSLYATD 

Seq ID NOt 303 DMA sequence 

Nucleic Acid Accession #: NM_001044.1 ' 

Coding sequence: 129.. 1991 

I 11 21 31 41 51 

1111)1 

ACCQCTCCGG AGOGGGAGGG 6AGGCTTCGC GGAACGCTCT CGGCGCXSlGa ACTOGOffTGC 60 

AAAGCCCAGG CC0GGGGG6C CAGACCAAGA GGGAAQAAGC ACAGAATTCC TCAACTCCCA 120 

OrGTOCOCAT GAOTAAGAGC AAATGCTCOG TCGGACTCAT GTCTTCOQTG 6TGGCCC0GG 180 

CTAAGGA8CC CAATGCG6TG G6CC0QAAGG AGGTOGAGCT CATCCTTGTC AAGGAGCAGA 240 

ACQGAOTGCA GCTCAGCAGC TCCACCCTCA CCAACCCGCG 6CAGAGCCCC GTGGAGGCCC 300 

AGGATCGOGA GACXTTGOGGC AAGAAQATOG ACTTTCTCCT GTCCGTCATT GGCTTTGCTG 360 

TGGACCTGGC CAACGTCTGG CX3GTTCCCCT ACCTOTGCTA CAAAAATGOT GQCGGT6CCT 420 

TCCTGGTCXX: CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCX:ACTTTTC TACATQQAGC 480 

tGGCCCTGQO CCAGTTCAAC AGGGAAGGaO COGGTGOTGT CT66AAGATC TGCCCCATAC 540 

TQAAAGOTOT GQOCTTCAGG OTCATCCTCA TCTCACTQTA TGTGG6CTTC TTCTACAACG 600 

TCATCATCGC CTGGGCGCTO CACTATCTCT TCTCCTCCTT CACCACQGAG CTCCOCrGGA 660 

TCCACTGCAA CAACTCCTGG AACAQCCCCA ACTGCTOSGA TGCCCATCCX GGTGACTCCA 720 

GTG6AGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC aAOTACTTTQ 780 

AAOGrTGQOOT GCTQCAOCTC CACCAOAOCC ATGGCATCX3A GGACCTG660 GCTC060GGT 640 

GGCAGCrCAC AOCCT QO CT G GT6CTGGTCA T 0 GT6CTQCT CTACTTCAGC CTCTG6AA0G 900 

OCGTGAAOAC CTCAGGOAAG GTGQTATGQA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGOGT G GG GTCACCCTCC CTGGAGCCAT AOACOGCATC AGAGCATACC 1020 

TGAGCGTTGA CTTCTACGGO CTCTGCGAGQ 06TCTGTTTG GATT6AG60G GCCACCCAGG 1080 

TGT8CTTCTC CCTGG606T0 GGGTrOGGGO TGCTQATC3GC CTTCTOCAGC TACAACAAGT 1140 

TCACCAACAA CTQCTACAGG GAOSGGATTG TCACCACCTC CATCAACTCC CTGAOGAGCT 1200 

TCTCCrOOOa CTTOOTOGTC TTCTOCTTC C TX3GGGTACAT GGCACAGAAG CACAOTOTGC 1260 

CCATOGGGGA CGTGGCCAAG GAOGGGCXAG GGCTGATCTT CATCATCTAC CXX^GAAGCCA 1320 

TOOCCACOCT CCCrCTQTCC TCAGCCTGGG COGTGGTCTT CTTCATCATG CTGCTCACCC 1380 

TGGGTAT06A CAQOGCCATG OGTQGTATGO AgTCAGTOAT CACC3066CTC ATOGATGAGT 1440 

TCCA6CT6CT GCACA6ACAC OGTOAGCTCT TCAOGCTCTT CATOGTCCTG GOGACCTTCC 1500 

TCCTOTCOCT GTTCTGCGTC ACCSVAOGGTG GCATCTACGT CTTCAOOCTC CTGQACCATT 1560 

TTGCAQCCOQ CA03TCCATC CTCTTrGGAQ TGCTCATOGA AGCCATCGGA GTGGCCTQ6T 1620 

TCTATGQTGT T6GGCAGTTC AGOGAGGACA TCCA0CA6AT GACGGQ6CAG 0GGCCCA6CC 1680 

TGTACTQOOG GCTOTQCTGO AAGCTGQTCA GCXXCTGCTT TCTCCTGTTC GTGGT0GT6G 1740 

TCAGCATTGT GACCTTCAGA OCCCCCCACT ACGGAGCCTA CATCTTCXXX: GACTGGGCCA 1600 

ACGCX3CT6GG CTGGGTCATC GCCACATGCT GCATGGCCAT GGTGCCCATC TATGOSGCCT 1860 

ACAAOTTCTQ CAGCCTGCCT GGOTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACXXXS 1920 

AGAAGGACOG TGAGCTGGTG GACAGAGGGG AGGTG06CCA GTTGACX3CTC CX3CC31CTGGC 1980 

TCAAOGTGTA QAaOGAGCAO AjQAOGAAGAC COCAGGAAOT CATOCXGCAA TGGGAQA6AC 2040 

ACGAACftAAC CAAG6AAATC TAAGTTTOGA GAGAAAGGAO GOCAACTTCT ACTCTTCAAC ' 2100 

CTCTACTGAA AACACAAACA ACAAAOCAGA AGACTCCTCT CTTCTQACTG TTTACACCTT 2160 

TCOGTGCOGG GAGOGCACCT OGCCGTGTCT TGTGTTGCTQ TAATAA03AC GTAGATCTGT 2220 

OCAGOGAGGT GCACCCCX3TT GTTGTOCCXG CAOGGCAGAA AAAOGTCTAA CTTCATGCTG 22B0 

TCTQTOIQAO GCTGCCTOOC TCCCTGCTCC CTGCTCCOGG CTCTOAGGCT GCCCCAOQQO 2340 

GACTOTOTTC TCAG60GGGG ATCAOQATOC TTOTAOAOQC AOCTGCTSAG AATCCCOGTG 2400 

CTCACASTAO CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAOCCAAG TGTCCTGCTT 2460 

GGTTTAQCTG TGCAGAAGGT GAAATGOAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2520 
QATGCGTGGC TCCCASCAOA GOCCGTAAAT TQAG06TTCA GTTGACACAT TGCACACACA 2S80 

GTCTOTTCAa AGGCATTOGA GGATGGGGOT GCIGGTATOT CTCACCAOQA AATTCXGTTT 2640 
ATGrrCTTGC AGCAOAOAOA AATAAAACTC CTTGAAACCA 0CTCAG6CTA CTGCCACTCA 2700 
GGCAGOCT O T GGOTCCTTGT GGTQTAOGOA ACGQCCTGAO AaQAQGGTGT OCTATCCCOG 2760 
GA06CATOCA GGGCCCCCAC AGGAGOGTOT OCTATCOCOO OAOGCATGCA G6GCCCCCAC 2820 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



AGGAGCATGT 
AAOGCATGCA 
T66M3GGTGT 
GACX96GA06C 
CGOVOIGGAO 
CCCAGGACX3C 
TGAGCCGTGA 
CATCAATAAC 
ATCAAAACAA 
TTTACAAGAA 
CACACTGCCC 
TOjGTGGGCC 
tcccctgcaa 
GCCTGTGAAC 
GGGAGG6ACA 
GOGGCTTCOC 
6TTGTTGAAG 
A6CAACXCAG 
TAAGCACAAT 



CCTATCCCTG 
GGGCCCCCAC 
ACTACCCCAG 
ATGCAG GGCC 
OQTOTACTAC 
ATGCAGGGCC 
CCTCCAGGAA 
AACAQTTTTT 
ATTCAAGAAT 
TAATTAGCAA 
TCTGCCACTG 
TGCCTACGTO 
GTGGAOGTGG 
TGCCAGGCAO 
CAGAGGACGG 
CATTGCCTTC 

GTCTTGTCOG 
AAAAOACATC 



GACaCATGCA 
AGGAGCX3TGT 
GACGCAT6CA 
CCCACAGGAG 
CCGAGGATOC 
CCCATGCAGG 
GGGACCCCAC 
ATGTTTGOQA 
GCAGTATCCG 
TACTGAGTGA 
ACAG6AAAOT 
CTGCCGQAOG 
6CTCCAGGGA 
CTGCAGTTAG 
CTTCCCCATC 
TGGGGAGGGA 
GAG06GCTTC 
TGTCTGTTGA 
CACAATG6AA 



GGGCCCCCAC 
ACTACCCCAQ 
GGGCCCCCAC 
OGTGTACTAC 
ATGCAGGGCC 
CAGCCTGCAG 
TGQAATTTTA 
ATGGCTTTTT 
CGAGCCTGCT 
AGGATGTTGG 
GGATGCCATA 
GCAGGGGCOS 
CTGGAGTGTA 
CACAGAGGAT 
GCCTTCTGGC 
CACAGAGGAC 
CCCAT06CCT 
CCAATCTCTA 
AAAAAAAAAO 



AGGAGOOTGT 
GACGCATGCA 
AGGAGCGTGT 
CCCAGGAG6C 
CCCACAGGAG 
ACCAACACTC 
TTTCTCTCAG 
AAAATCATAT 
TGCTGATATT 
CCAAAA6CTG 
6TTTQAATTC 
TGCAGOGCCA 
ATGCTOGGTG 
GGCTTCCCCA 
OGCTGCAGTC 
AQTTTCCCCA 
TCTGGGGAGG 
TTCAGCATCO 
GAATTC 



ACTACCCCAQ 
GGGCCCCCAC 
CCTATOCCOO 
ATGCAOGGOC 
OGTGTACTAC 
TGCCTGGCCT 
GTGOGTGCCA 
TTACCTGTGA 
6CAGTTTTT0 
CTTTCCATOG 
ATGCCTCAAG 
GTCATGGCTQ 
GGAGCCGTCA 
TTQCCTTCTQ 
AGCACAGAGA 
TOGCCTTCTG 
GGCTCCGTGT 
TOTGGGTCCC 



Seq XO NOt 304 Protein sequence 
Protein Acceaelon St NP 001035.1 



MSXSKC8VQL 
BTWGKKZDPL 
GQFNREQAAG 
NNSWNSPNCS 
TACLVIiVIVL 
DFYRLCBASV 
GPWFSFXiOy 
DSAMGGMBSV 
GTSIIiFGVI«I 
VTFRPPHYGA 
RELVDRGEVR 



11 

I 

MSSWAPAKE 
L8VIGFAVDL 
VHKICPILKO 
DAHPGDSSGD 
LYFSLWKOVK 
WIDAATQVCP 
MAQXRSVPIO 
ITGLIDEFQL 
EAIGVAWFYO 
YZFFDWANAL 
QFTLRKWLKV 



21 

I 

PNAVGPKEVB 
ANVWRFPYLC 
VGFTVILI9L 
SSGU3DTFGT 
TSGKWWITA 
SLGVGFGVLI 
DVAKDGPGLZ 
LHRHRELFTL 
VGQFSDDIQQ 
GHVXATSSMA 



31 
I 

ZiILVKEQNGV 
YKNGGGAFLV 
YVGFPVNVII 
TPAAEVPERQ 
TMPYWLTAL 
APSSymCPTlI 
FIZYPBAZAT 
FXVLATFLLS 
MTGQRPSLYW 
MVPZYAAYKF 



41 

I 

QLTSSTLTNP 
PYLLPMVZAG 
AWALHYLPSS 
VLHLHQSHGZ 
LLRGVTLPGA 
MCYRDAIVTT 
LPLSSAHAW 
LFCVTNGGIY 
RLCWKLVSPC 
CSIiPGSFREX 



Seq ZD NO: 305 DNA sequence 

Nucleic Acid Accession #t NN_001216.1 

Coding sequence i 43.. 1422 



1 
I 

GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 
TCCCCCTTGG 
AGTGAAGAGG 
QAGQATCTAC 
TCCCTGAAOT 
AATAATGOCC 
CCGCCCTGQC 
CGCCCCCAGC 
CTCCOGCGGC 
GCTCCTGGGC 
CItSCACTGGQ 
CCTGCOQAGA 
GGGCGCCOGG 
AOTOCCTATG 
GAGGTCCCAG 
TATQAGGGOT 
CAQACAGTGA 
GGTOACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
arQAGCXACC 
TGTGA6AAGC 
AT6CCACTTC 



11 

I 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTQAT 
GAG6AGQCTC 
ATTCACCCAG 
CTGGAGAGQA 
TAQACOATCT 
ACAOGOACAA 
CCOGGGTO T C 
TCQCOGCCTT 
TCCCAGAACT 
TAQA6ATGGC 
GGGCTQCAGG 
TCCACGTGGT 
6AGGCCTGGC 
AGCAOTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGGTACAGCT 
TCCCTGCTGG 
TGGCTGCTG6 
O gri 'CC ATOl ' 
6CCCAGCAGA 
CAGGCAGA60 
CTTTTAACTQ 



21 

1 

GGQACACCCC 
GATCCOGGCC 
GCCZOTCCAT 
TTCXGOGGAA 
AGAQGAGQAT 
6GATCTACCT 
ACCTACTQTT 
AOAASGGGAT 
CCCAGCCT6C 
CTGCCCGGCC 
GC6CCTG06C 
TCTGGGTCCC 
TOSTOOOGGC 
TCACCTGAGC 
0GT6TTGGCC 
OTCTOQCTTG 
ATCTGCACTC 
AC06COCTGT 
TAAGCAGCTC 
GAACTTOCQA 
AGTGGACAGC 
TGACATCCTA 
6CAGATQAGA 
GGTAGCGBAG 
CATCTQAGQO 
0CAA6AAATT 



31 
1 

ACAGTCAGCC 
CCTGCTCCAQ 
OCCCAOAGQT 
GATQACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAG6CTCCTG 
GAOCAGAOTC 
GO6GOCC0CT 

AACAATGGCC 
GGGOGGGAGT 
TCGOAGCACA 
ACCQGCTTTQ 
GCCTTTCTGG 
GAAGAAATOG 
CTGCCCTCTG 
GCCCAOGOXO 
CACACCCTCT 
QCGA0GCA6C 
AGTCCTCGGG 
6CCCTGGTTT 
A0GCA6CACA 
ACI6GAGCCT 
GGAOCCGGTA 
TTTTAAAATA 



41 

I 

GCATGGCTCC 
GCCTCACTGT 
TQCCCCGGAT 
TGGG06AG6A 
AGGAGGATCT 
CTAAATCAGA 
QAGATCCTCA 
ATTGOOQCTA 
TGCAOTOCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACGGGGCTCT 
CTGTGQAAGG 
CCAOAOTTQA 
AGGAGGGOCC 
CTGAGQAAGG 
ACTTCAGGCG 
TCATCTGGAC 
CTGACACCCT 
CTTTQAATGG 
CTGCTGAGCC 
TTG6CCTCCT 
OAAGGGQAAC 
AGAG6CTGGA 
ACTGT CC TGT 
AATATTTATA 




2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3460 
3540 
3600 
3660 
3720 
3780 
3840 
3900 



51 
I 

RQSPVEAQDR 
MPLFYMELAL 
PTTELPWZHC 
DDLGPPRNQL 
ZDQZRAYLSV 
SINSLTSFSS 
FPZMLLTLOI 
VFTLLDHFAA 
FXiLFWWSI 
liAYAZAPEXD 



TGGAGGOQAC 
GOTGGATATC 
GGGCTTCCAG 
ACTGACCCTO 
GCAGCTGCAT 
CCACC6TTTC 
OQAQGCCTTO 
GGAA6AAAAC 
CrCAGAGACT 
CTACTTCCAA 
TOTQTTTAAC 
QTGGOQAOCT 
GOQAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
C!AAAOGGOGT 
TCTTG6AGAA 
CCTOCTCATT 
AT 



8eq ZD NOi 306 Protein sequence 
Protein Accession #$ NP_001207.1 



1 
1 

MAPLCPSPWL 



DPQBPQNNAH 
BUiGFQLPPL 
VEGHRFPABZ 
BEGSETQVPG 
DTIiWGPGPSR 
GU.FAVTSVA 



11 
I 

PLLIPAPAPG 
SPREEDFPGE 
RDKEGDDQSB 
PSXiRLRNNGH 
HWHLSTAFA 
UDZSAUaPSD 
LQUIPRATQP 
FLVQNRRQHR 



21 
I 

LTVQLUiSLI. 
EDIiPGBEDLP 
WRYGGDPPWP 
SVQLTIiPPQL 
RVDEALGRPG 
F8RYFQYEGS 
UKSBVZBASP 
RGTXGGVSYR 



31 

1 

LLMPVKPQRL 
GEEDLPEVXP 
RVSPACAGR? 
EMALGPGREY 
GLAVLAAFLB 
LTTPPCAQOV 
PAGVDS8PRA 
PABVAETGA 



41 

I 

PRMQEDSPLG 



QSPVDIRPQL 
RALQLHLHWQ 
EGPEENSAYE 
ZWrVFNQTVN 
AEPVQIMSCL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



EDLPTVEAPO 
AAFCPALRPL 
AAGRPG9EUT 
QIjLSRLEEZA 
LSAKQLHTLS 
AA6DXZALVP 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NO: 307 DKA sequence 

Nucleic Acid AccesBion «t NM_003039.1 

Coding sequence: 76.. 1581 " 

5 1 11 21 31 41 51 

I I 1 I I i 

CTTCTCTCrC CATTCAOTGC ACGOGWACT TTGGCTAAAA QGAGGTGWGC GGCACTCTGC 60 

CCTTCCAGAG CAAGCATGGA GCAACAGGAT CAGA6CAT6A AGGAAGGGAO 6CTGACQCTT 120 

GTGCTTGCCC TGGCAACCCT GATAGCTGCC TTTGOOTCAT CCTTCCAGTA TGGGTACAAC IBO 

10 GTGGCTGCTO TCAACTCCCC AGCACTGCTC ATGCAACAAT TTTACAATGA GACTTACTAT 240 

GGTAGGACG6 GTGAATTCAT G6AAGACTTC C0CTTQAO9T TGCTQTG G TC TOTAACOGTC 300 

TCCATGTTTC CATrroOAOO GTTTATOGGA TCCCTCCTOO TCOQCCCCTT GGTOAATAAA 360 

TTTGGCAGAA AAGGGGCCTT GCrGTTCAAC AACATATTTT CTATOGTGCC TGCGATCTTA 420 

ATGGGATGCA GCRGAGTOGC CACATCATTT GAGCTTATCA TTATTTCCAG ACTTTTGGTQ 480 

IS GGAATATGTG CAGGTGTATC TTCCAACGTG GTCXCCATGT ACTTAGGG6A GCTGOCOCCT 540 

AAAAACCTGC G060GGCTCT GGGG6T0(nG CCCC3U3CTCT TCATCACfGT TOGCATOCTT 600 

GTGGCCCAGA TCTTTCQTCT TOGGAATCTC CTTGOUU^CG TA0ATG6CTG GCOQATCCTO 660 

CTGGGGCTQA CCGQGGTCCC OG0600GCTO CAOCTCCTTC TOCTGCCCTT CTTCCCOQAa 720 

AGCCCCAGGT ACCTGCTGAT TCAGAAGAAA GA0GAAQC6G COQCCAAGAA AGCCCTAC31Q 780 

20 AC6CTGOGG0 GCTGGGACTC TGTO6ACAG0 GAGGrGGCCG AOATCCGGCA G6AG6ATGAQ 640 

6CAGAGAAGG COa QO aG C rr CATCTCOGTG CTGAAGCTGT TCGGOATGCG CTCGCTGGGC 900 

TGGCAGCTGC TGTCCATCAT G6TCCTCAT8 GGOGGCCAGC AGCT8T0GGG OGTCAACGCT 960 

ATCTACTACT ACQCOGACCA GATCTACCTG AG06CCGQCQ TGCCGGAGGA GCACX3TGCAG 1020 

TAOGTGAOGO CCGGCACCX5G GGCCQT6AAC GTGGTCATGA CCTTCTGCGC OGTGTTOOTG 1080 

25 GTGOAOCTCC TGGGTCGGAG GCTOCTGCTG CTGCTGGQCT TCTCCATCTQ CCTCATAGCC 1140 

T0CTGCX3TGC TCACTGCAGC TCTGGCACTG CAGGACACAG TGTCCTGQAT GCCATACATC 1200 

AGCATCGTCT GTGTCATCTC CTACOTCATA GQACAT6CCC TOGGGCGCAG TCOa.TACCC 1260 

GOGCTGCTCA TCACTGAGAT CTTCCT6CAS TCCTCTOsGC CATCTGCCTT CAT6GTGGG0 1320 

GGCAGTGTGC ACTQGCTCTC CAACTTCACC GTGGGCTTGA TCTTCCCGTT CATCCAGGAG 1380 

30 GGCCTCGGCC OQTACAGCTT CATTGTCTTC GCOGTQATCT GGCTCCTCAC CAGCATGTAC 1440 

ATCTTCTTQA TTGT O COGGA QACCAAOaCC AAQMX3TTCA TAGAOATCAA OCAOATTTTC 1500 

ACCAA6ATGA ATAAGGT6TC TGAAGTGTAC COGGAAAAGG AGGAACT6AA AGAGCTTCCA 1560 

CCTCTCACTT OGGAACAQTG ACTCTGGAGA GGAA6CCAGT GGAQCTGGTC T6CCAGGGGC 1620 

TTCXXavCTTT GGCTTATTTT TCTGACTTCT AGCTQTCTQT GAATATCCAG AAATAAAACA 1680 

35 ACTCTGATGT GGAATGCAGT CCTCATCTCC AGCCTCCXXZA CCCCAGTGGG AACTQTQCAA 1740 

AGGGCTGCCT TGCTQTTCTT GAAOCTQOGC TGTCTCTCTC CATGTTGQCC T6TCACCAGA 1600 

CCCGAGTCAA TTAAACAGCT GGTCCTCCAC Vl lXJCAX i U r r CAQCCTTOGT OTGQCTCCTG 1860 

GTAACGTGGC TCCACCTTOA TGGGTCAAOC TTTOTOTGGC TCCTGGTAAC ATAACAACAA 1920 

CAGTTACTAT A6TGGTGAGA TGGAAGGAAT CAAATTTTGC CAGAGAAACT AACTCGGTGG 1980 

40 CCCCAACAGG TCTTCCGGGG CCATGGOCAT TTOTTTAQAO CCAAATTCAT CCTCTTACCA 2040 

GATCCTTTTC CAOAAATACC TGTCTAGOAA G6TGTGATOT CAGAAACAAT GACATCCAGA 2100 

AAGCTGAGQA ACAGGTTCCT GTGGAGACAC TGAGTCAOAA TTCXTCATOC AA ATTATT TT 2160 
OTTAOTOGAA AATGGAATTG CTTCTOTGTA OTCS^TAAAA TGAACCTGAT CACTTTTC 

45 Seq ID NO: 308 Protein sequence 
Protein Accession ft: NP_OO3030.1 

1 11 21 31 41 51 

CA I I i ' ^ ' 

J\J MEQQDQSMKB GRLTI.VLAIA TLIAAPGSSP QYGYNVAAVN SPAI»IjMQQFY NETYYGRTGE 60 

FMEDPPLTLL WSVTVSMFPF GGFIGSLLVG PIjVNKFORKQ ALLPKNIFSX VPAILMGCSR 120 

VATSFELIII SRLLVGICAG VSSNWPMYL GEIAPKKLRG ALGWPQLFI TVOILVAQIP 180 

GURNLLANVD GWPILLGIiTG VPAALQUiLL PFPPBSPRYL LIQKKDEAAA KKALQTLRGH 240 

DSVDRBVAEI RQEDEAEKAA GPISVUCLFR MRSLRWQLLS IIVLMGGQQL SGVNArifYYA 300 

55 DQIYLSAGVP BEBVQYVTAG TGAVNWMTF CAVFWBLLG RRLLLLLGFS ICLIACCVIiT 360 

AALALQDTVS HMPYISIVCV ISYVIGHALG PSPIPALLXT BIFLQSSRF8 AFMVGGSVHH 420 

IiSNFTVGLZP PFXQEOIiGPy SFIVFAVICL LTTIYIFLIV PSTKARTFIB INQIFTKHNR 480 

VSBVyPEKBB UCBLPFVTSB Q 

60 Seq ID NO: 309 DNA sequence 

Nucleic Acid Accession #i KM_0012S2.1 
Coding sequence: 138*. 719 

1 11 21 31 41 51 

65 I 1 1 I 1 I 

GGCTGGTGCC CTGACAGGTT GAAGCAAOTA GAC6CCCAGG AGOCOCQGGA GG6GGCTGCA 60 

QTTTCCTTCC TTCCTTCTCQ GCAGC3CTCC QOGCCCCCAT CGCCCCTCCT GCGCTA60GG 120 

AGGTGATCGC CGCGGCGATG CCGGAGGAGG GTTCGGGCTG CTCGGTGCGG CGCAGGCCCT 180 

ATGGGTGGOT CCTGOGOGCT GCTTTGGTCC CATTGGTCGC OGGCTTGGTG ATCTGCCTCG 240 

70 TGQTGTGCAT CCAfiCGCTTC GCACAQGCTC AGCAGCAGCT GCOGCTOQAG TCACTTGGGT 3O0 

GGGAOGTAGC TGAGCTGCAG CTQAATCACA CAGGACCTCA GCAGGACOCC AGGCTATACT 360 

QGCAGGGGGG CCCA6CACTQ GGCCGCTCCT TCCTGCATGG ACCAGAGCTG GACAAGGGGC 420 

AGCTACGTAT CCATOGTGAT GGCATCTACA TGOTACACAT CCAGQTGACX} CTGGCCATCT 480 
GCTCCTCCAC GAOGQCCTCC AGGCACCACC CCACCACCCT GGCCGTGGGA ATCTGCTCTC 540 

75 COGCCTCCCQ TAGCATCAGC CTG C TGOGTC TCAOCTTCCA CCAAGQTTOT ACCATTOCCT 600 
CCCAOCGCCT GAOGCCCCTO GCCCQAQQGG ACACACTCTG CACCAACCTC ACTGGGACAC 660 
TTTT G CCTTC COGAAACACT GATGAGACCT TCTTTGGAGT GCAGTGGGTG 0GCCCCT6AC 720 
CACTGCltSCT OATTAGGGrTT TTTTAAATTT TATTTTATTT TATTTAAQTT CAAGAGAAAA 780 
AGTGTACACA CAGGGOCCAC C OGG GGTTGG GGTGGGAGTG TGQTGGGGGG TAGTGGTGGC 840 

80 AOGACAAGAG AAGGCATTGA GCTTTTTCTT TCATTTTCCX ATTAAAAA 

Seq ID NO: 310 Protein sequence 
Protein Accession #> NP_001243.l 
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5 



I 11 21 31 41 51 

1111)1 

MPBBQSGCSV RRRPYGCVLR AALV?LVAGL VICLWCIQR FAQAQQQLPL ESLGKDVAEL 60 
QLNHTGFQQD PiOiyWQGGPA LGRSFLHGPB LDKGQLRIRR DGIYMVHIQV TIAICS8TTA 120 
SRHHPTTLAV GXC8PASRSI SUiRLSFHQG CTIASQRLTP LAROmiCnT LTGTLLPSRN 160 
TDSTPFGVQW VRP 



Seq ID NOt 311 DMA sequence 
Nucleic Acid Accession fit Bos sequence 
Iv Coding sequence: 1..3978 



1 11 21 31 41 51 

I 1 I i 1 ! 

ATGGTGGGTQ AA6GACCCTA CCTTATCTCA GATCTGGACC AGOGAGGCOG GCGGAQATCC 60 

TTTGCAGAAA GATA7X3ACGC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTOTGCAAGG 120 

TTAGCACCCA ACCCGOTGGA TGATGCOSGO CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

A0GC0G6TGA TGG7GAAAGG CTACCGGCAA A06CT6ACCQ TAGACACCCT GCCOCCATTG 240 

TOQACATATG ACTCATCTQA CACXyuVTGCC AAAAGATTTC GAGTCCTTTG GGATQAAGAG 300 

GTAGCAAGGG TGGGTCCTGA GAA6GCCTCT CTGAGCCAOQ TGGTGTGGAA ATTCCAOAGG 360 

ACACiGCGTOT TGATGOACAT GGTGGCCAAC ATCCTGTGCA TCA7CATGGC AGGCATAGGG 420 

C06ACA6TTC TC ATTC ROCA AATOCTGCftG CAGACTQASA GGACXTTCTGO GAAAOTCTGG 480 

GTTGaCATTG GACTGlt3CAT AGCOCTTTTT GOQVC06AGT TTACCAAAGT CTTCTTTTGG 540 

GCCCTTGCCT GGGCCATCAA CTACXX3CAC0 GOCATCOGOT TGAAGGTGGC GCTCTCCACC 600 

TTGOTTTTTG AAAACCTAGT GTCCTTCAAG ACATT6ACCC ACATCTCTGT TGGOQAGOTG 660 

CTCAATATAC TOTCAAGTGA TAQCTATTCT TTOTTTGAAG CTGCCTTGTT TTGTCCnTG 720 

CCAGCraCCA TCCCGATCCT AATGGTCTTT TGTGC3GGGGT ACGCCTTTTT GATTCTG600 7B0 

CrCACAOCTC TCATOGGOAT ATGAGTGTAT GFCATATTCA TACCC6TCCA 6ATGTTTATG B40 

GCCAAGGTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAOACAA 60QAOTTCAO 900 

ACAATGAATG AOTTTCTGAC CTGCATCAGQ CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGQ A0GGAAA6AA AATTACTGGA AAAAGCTG6A 1020 

TTTQTCCAAA 0TG6AAACTC TGCCCTG6CC CGCATCGT6T CCACCATAGC CATOGTGCTG lOBO 

ACATTATCCT GCCACATCCT CCTQAGAC6C AAACTCAC06 O^CCCSGTGGC ATTTAOTCTG 1140 

ATTGCCATOT TTAATGTAAT QAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAA6 CGAATGTCTC TCTAAGGAOA ATQAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACT6TCT TGCTTTTAGC AAATQCCACC 1320 

TTGACATGGO AGCATGAAOC CAGCAGGAAA AOTAOOCCAA AQAAATTGCA GAAOCAGAAA 1380 

AGGCATTTAT 6CAAGAAACA GAGGTCAGA6 GCATACA6T6 AQAaOAGTCC ACCAGGCAAO 1440 

GGA6CCACTQ GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TOGTTATCCC GAAQCXXAGC TCCTGQCTTG GAGGTGGCCA 1560 

GCAQTGTTTG TTOGGA6AAT C3iTCAGAGGA TACAGGCCTC ATGGATTTTC T6CTAAAGAC 1620 

AAGQATGAAT CTA0AAG6CT TCTTACTTGO CGCGAAGAAO TOGATAQGAC TCAAAGGGCA 1680 

GOCAAATACC TG6GGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAOCTCC 1740 

CTCCTTGCAG CTCTCXTTAGO ACAGATGCAG CTGCAGAAAG GGGTOOrGGC AGTCAATGGA 1800 

ACTTTGGCCT AOGTTTCAOV GCAGGCATGG ATCTTTCATG QAAATOTGAG AGAAAACATA 1860 

CTCTTTGGAG AAAAQTATGA TC3VCCAAAG6 TATCAGCACA CAGTC0606T CTGTGGCCTC 1920 

CAGAAGBACC TOAGCAACCT CXXCTATGQA GACCTOACTG AOATTGOGQA GOQQGGCCTC 1980 

AAOCTCTCTO QQGGOCMMS GCAGAGQATT AGCCTGGOCX: 6CXSCTGTCTA CTCOGACOGT 2040 

CAGCrCTACC TGCTGGA08A CCCXXTGTCG GCCGTOGAOO CCCACGTGGG QAAGCACGTC 2100 

TTTOAGGAGT OCATTAAGAA GACGCTCAGO GQAAAGACAG TCX3TCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAOAOTCTTa TGATGAAGTT ATTTTATTAO AAGATGGAGA GATTTGTGAA 2220 

AAGGOAACCC ACAAGGAGTT AATGGAGGAO AGASGGOGCT ATGCAAAACT OATTCACAAC 2280 

CTGGGAGGAT TGGAGTTCAA GGATOCTGAA CACCTTTACA ATGCAGCAAT G6TGGAA6CC 2340 

TTCAAGGAGA GCCCTGCTGA GAQAGAGGAA GATGCTGGTA TAATC3QGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTOATG ATTGQC3^GCG CTGCCTTCAG CAACTGGTGQ 2460 

CTOQOTCTCT G6TTGGAGAA GG6CTCACX36 AT6ACCTGT6 GGCCCCAGGG CAACAGGACC 2520 

ATCTGTXSAOG TCXSGOQOGQT OCTGGCAGAC ATGG6TGAQC ATGTGtACCA GTG OG T G TAC 2580 

ACTOCAA6C31 TGGTGTTCAT GCT G G T G TTl' GGGGTGACXSV AAGGCTTGGT CTTCACCAAG 2640 

ACCACACTQA TGGCATCCTC CTCTCTGCAT GACAOGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGOCTAA T6AACXX5TTT TTCCAAGGAT 2760 

ATGGA06AGC TQGATGTGAG GCT6CG0TTT CAC6CA6AGA ACTTTCT6CA GCAGTTTTrr 2620 

ATGGTGOIGT TTATTCTCGT GATCTTGOCT GCTGIGmC C TO CIG l 'CC r TTTAGTCOTG 2680 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTAGGCATTT TCCACAGAGG AGTCCAGGAO 2940 

CTCAAGAAGG TGGAGAATGT CAOCCGGTCA CCCrOGTTCA CCCACATCAC CTOCTCCATG 3000 

CAOGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA 0CT6CATCAC CTATACTTCSV 3060 

TCCAAAG6GC TGTCATTGTC ATACATCATC CAGCT6A6CG GACTGCTCCA AGTGTGTGTG 3120 

09AACGQQAA CAjBAGAGGCA AGCCAAATTC AeXTTCOBTOO AGCTGCTCAlG OGAATACATT 3180 

T06ACCTGT6 TTCCTtSAATG CACTCATCCC CTCAAAGTGG GGACCTGTGC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAOAOAC TATCAGATGA GATACAGAQA CAACACCCCC 3300 

CTTGTTCTC6 ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAiGTOGQ GATrGTTGGA 3360 

/U AQAACAGGTT CCGGAAAGTC AT0QTTAG6A ATOQCTTTGr TTO G TCTGGT GGAGCCAGCC 3420 

AOTGGCACAA TCTTTATTQA TGftGGTGGAT ATCTGGATTC TCA GC TT G QA AGACCTCAQA 3480 

ACXZAAGCTQA CXX5TGATCCC ACAGGATCCT aTCCTGTTTG TAOQTACSIST AAGGTACAAC 3540 

TTGGATCCCT TTGAQAGTCA CACCGATGAO ATGCTCTGGC AGOTTCTGOA GAGAACATTC 3600 

ATGA6AGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAG6G6A ACGTCAGCTG CTTTGTGTGG CCOQAOCTCT 7CTC0QTAAT 3720 

TCAAAGATCA TTCTOCTTGA TGAA6CCACC GCCTCTATG6 ACTGCAAGAC TGACACCCTG 3780 

GTTCAOAACA CCATCAAAGA TGCCTTCAAO GQCTGCACTG TGCTOAOCAT 06CCCACGQC 3840 

CTCAAC3VCAO TTCTCAACTO CX3ATCA0GTC CTGQTT A TOG AAAATGG6AA GGTQATTGAG 3900 

TTTGACAAGC CT6AAGTCCT TGCAGAGAAO CCAOATTCTQ CATTTGOQAT GTTACTAGCA 3960 
GCAGAAOTGA GATTGTAG 

8eq ID NOi 312 Protein sequence 
Protein Accession «x Eos sequence 
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1 11 21 31 41 51 

111)11 

MV6G6PYLI8 DLDQRGRRRS FABRYDPSUC TMIPVRPCAR lAEKPVDDAG LLSFATFSWL 60 

TPVMVKGYSQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEB VARVGFEKAS LSHWHKFQR 120 

TRVLMDIVAM ILCIIMAAIG PTVLIHQILQ QTERTSQKVW VOIGLCIAIiP ATBPTKVPPW 180 

ALM9AINYRT AZRUCVALST LVPENLVSFX TLTHX8VGEV LNILSSDSYS LFEAALFCPl. 240 

PATZPIUfVP GAAYAFFIU3 PTALtGISVY VIFZFVQMFM AKLNSAPRRS AILVTDKRVQ 300 

•TMNEFLTCIR LIKMYAWEKS PTNTIQDIRR RERKLLBKAG PVQSGNSAIA PIVSTIAIVL 360 

TLSCMILLRR KLTAPVAFSV lAMFNVMKFS lAILPPSlKA MAEANVSLRR MKKILIDKSP 420 

PSYITQPEDP DTVLLLANAT LTKBHBASRR STPKKLQNQK RELCKKQRSE AY8ERSPPAK 4 BO 

6AT6PEEQSD SLKSVLKSIS PWRiCLCRyP BAQLLAMRVfP AVFVGRIIRG YRFKOFSAKD 540 

KDESRRLLTW PQBVDRTQRA AKYLGKILOI CGNVGSGKSS LLAAIiLGQMQ LQKGWAVN6 600 

TLAYVSQQAW IPHGNVRENI IjPGEKYDHQR YQHTVRVCGL QKDLSNLPYQ DLTBIGKRGL 660 

NLSGOQRQRI SIARAVYSDR QLYLLDDPLS AVDAHVGKHV FBBCIKKTLR GKTWLVTHQ 720 

LQPLBSaSEV ZLLEDGBICE KOTHKELMEB RQRYAKLZBN IiRQLQFKDPB HLYNAAHVEA 780 

FKESPAEREB OlAOIZOYLLS LFTVFZiFXiLM IGSAAPSNHH liGLSniDROSR MTOQPQGNRT 840 

HCBVGAVLAD ZGQRVYQHVY TASMVFMLVF 0VTK3FVFTK TTLHA888LH DTVFOKZLKS 900 

PriSFFDTTPT GRLMNRPSKD MDELDVRLPP HAENFLQQFF MWPILVIIA AVFPAVLLW 960 

ASIjAVGPFIL LRIFHRGVQE LKKVENVSRS PWPTHITSSM QQLOIIHAYG KKESCITYT8 1020 

SKGIiSLSYII QLSGIiLQVCV RTGTETQAKP TSVELLREYI 8TCVPBCTHP LKVGTCPIOJW 1080 

PSOSBITFRD YQMRYRDNTP LVLDSLNLNZ QS6QTV0XV0 RTGSGXSSLO MALFRLVBPA 1140 

8GTZFXDEVD ZCZLSZ^DLR TXLTVZFQDP VLFVGTVRYN liDPFBSHTDE MLfiTQVLERTP 1200 

MRDTIKXLPB KLQAEVTBNG EaVFSVGSRQL LCVARALUtN SKZZLU>EAT ASMDSRTDTL 1260 

VQNTIKDAFK GC1VI.TZAHR LNTVLNCDHV LVMENGKVIB FDKFEVLAEK PDSAFAKLLA 1320 
AEVRL 

Seq ZD NO: 313 DNA sequence 
Nucleic Acid Accession #i Z31560 
Coding sequence: 1-966 

1 11 21 31 41 51 

I I I I 1 I 

CACAOCGCCC GCATGTACAA CATQATGGAG AC6QAGCTGA AQCOOCCGGG CCCGCAGCAA 60 

ACTTOGGGGG GCGGOCSGCGG CAACTCCACC GCGGOGGCGG COGGOQQCAA CCAQAAAAAC 120 

AGCC0G6ACC GCQTCAA6CQ GCCCATGAAT GCCTTCATGG TGTGGTCCCX3 OGGGCAGCGG 180 

OGCAAQATGG CCCAOGAGAA 00CCAA6ATG CACAACT0G6 AGATCAGCAA GOGCCTGGOC 240 

GCCGAGTGGA AACTTTT G TC GGAGAOGGAG AAG0QQCC6T TCATGGAOGA GGCIAAGOGG 300 

CTGCX3AGCX3C TGCACATGAA GGAGCACCCG GATTATAAAT ACOGGCCCCG QOGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTOCCOQGOQ GQCTOCTGGC CCCCX5G0GGC 420 

AATAGCATGO OGAGOGGGOT OGGGGTGGGC GC0GGCCTG6 GOQ0GGGO6T GAACCAGOGC 480 

AT6GACAGTT AGQ06CACAT GAAOSGCTGa AGCAAOQGCA GCTACAGCAT GATGCAGGAC 540 

CA6CTGG6CT ACC06CA8CA CXZOGGGCCTC AATG06CACX3 G0QCA80GCA GATGCAGGCC 600 

ATGCACOGCT AOGACGTGAG CGCCCTGCAO TACAACTCCA TGACCA6CTC GCAQACCTAC 660 

ATGAACGQCT OGCCCACCTA CAGCATGTCC TACTCX3CAGC AGGGCACCCC TGGCATGGCT 720 

CTTGOCTCCA TCGGTTCGQT GGTCAAGTCC GABGCCAGCT CCAGCCCCCC TGTGQTTACC 780 

TCTTOCrCCC ACrCCAGGGC GCCCTOCCAO OGGOGGGACC TCOGGGACAT GATCAGGATQ 840 

TATCTCCXXa GCGCOGAGGT GCXXK3AACCC GCOGCCCCCA GCAGACTTCR CATGTCCCAG 900 

CACTACCAGA GCX5GCCCGGT GCCCGGCAC30 GCCATTAACG GCACACTOCC CCTCTCACAC 960 

ATGTGAGGGC CJQOACAGCGA ACTQQAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGACSGGGT GCAAAAGAG6 AGAGTAAGAA ACAGCATGOA GAAAACCOGG TAOOCTCAAA 1080 
AAAAA 

8eq ZD NOi 314 Protein sequence 
Protein Accession it CAA83435 

1 11 21 31 41 51 

11)111 

HSARMYNMME TELKPPGFQQ TSGGGGGNST AAAAGGKQKN SPDRVXRFMN AFMVWSRGQR 60 

RKMAQQ7PXM ENSEISKRLG AEWKLIiSETE KRPFIDBAKR LRALHMKEHF OYKyRPHRKT 120 

RTIi»1KKD]CyT LPGGIiIiAPGG NSKASGV6VG AGLGAGVNQR MDSYABMNGH SNGSYSMMQD 180 

QLGYPQHPOI. NARGAAlQKQP MRRYDVSALQ YNSMTSSQTY MNOSPTYSMS YSQQGTPGMA 240 

LGSHGSWKS EASSSPFWT 8SSHSRAF0Q AGOLRDMZSH YLPGABVPBP AAFSRIiHMSQ 300 
HYQSGPVPGT AINGTLPLSH M 

Seq ID NO: 315 DNA sequence 
Nucleic Acid Accession ftt U91618 
Coding sequence i 29.. 541 

1 11 21 31 41 51 

I I t I I I 

CSGACTTGQC TTGTTAOAAO GCTQAAAOAT GATGQOVG6A ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAO TCTOTGCTCA GATTCAGAAO AGOAAATGAA 120 

AGGATTAOAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAOCACATGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

A6CT0AGGAA ACA6QAQAA0 TTCATGAAQA GGAGCTTGTT GCftAGAAGGA AACTTCCTAC 300 

TGCTTTAOAT OOCTTTAOCI TOOAACCAAT 6TTQACAATA TAOCAGCTCC ACAAAATCTG 360 

TCACM3CAGG OCTTTTCRAC ACTGGGAOTT AATCCAGGAA GATATTCTTQ ATACT6GAAA 420 

TGACAAAAAT GGAAAGGAAG AAOTCATAAA GAGAAAAATT OCTTATATTC TGAAAOGGCA 480 

OCTQTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AQA GATT CTT ACTATTACTG 540 

AOAGAATAAA TCATTTATTT ACATGTOATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTSAAAATO TGACAAACAC ACTTATCTOT CTCTTCTACA ATTGTGGTTT 660 

ATTQAATGTG TTTTTCTOCA CTAATAOAAA TTAORCTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGG6CC GCAATT 



1073 



wo 03/042661 



8eq ID NOt 316 Protein sequence 
Protein Accession 6: AABS0564 

1 11 21 31 41 51 

i I I 1 I I 

KNAGMKIQLV OILLIAPSSW SLCSDSEBEH KALEADFLTN HHTSKISKAH VPSHKMTLLN 60 

VCSIiVMNLNS PABBTGEVKE EELVARRKLP TALDGFSLBA MLTZYQUIRX CHSIIAFQHHE 120 
liIQEDILDTO NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 317 DtiA sequence 

Nucleic Acid Accession ft: MM_006S36.2 

Coding sequencei 109.. 2940 

1 11 21 31 41 51 

j i t I I I 

ACCTAAAACC TTGCAA6TTC AG6AA6AAAC CKtCTGCKSC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTCIAOTGAAC TGCBIGGCTTC TCTACAACAT GACCCAAAGO 120 

AGCATTGCAS QTCCTATTTO CAACCTGAAG TTTGTGACTC TCCTGOTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGOAGC TGOAOTACAa CTTCAAQACA ATGGGTATAA TGGATTGCTC 240 

ATTCCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGQAAATO 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT 6CTACCAAOA GAAOAOTATT TTTCRGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAOAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTOQTATG GOGCACATGG AGATGATCCA 480 

TACACOCTAC AATACAGAGG GTGTGGAAAA GAG6GAAAAT ACATTCATTT CACACCTAAT S40 

TTOCTACTGA ATGATAACTT AACAGCTGGC TACGQATCAC GA06CX3GAOT tfi llWt .'CAT 600 

GAATGGQCCC ACCTCCGTTG GOGTGTGTTC OATQAGTATA ACAATQACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATGAG AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTQ CCCCCMOfA AACTGTATTA TTAQTAAGCT TTTTAAAGAA 780 

GGATOCS^CCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 640 

AGTTTATCTT CTGTGQTTGA ATTTTGTAAT GCAAOTACCC ACAACCAAGA AGCACC AAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCIGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTC6CTT 1020 

GTACAGGCTG GT6ACMAOT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GA TGGCA GAO 1080 

QCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTQAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCRGCTA 1200 

CACCAAATTA ACACCAATGA TGATCGAAAO TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TGAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTQA GGTGGTTGAA 1320 

AAACTOAATO QAAAAOCTTA TGGCTCTGTG ATGATATTAG TGAOCAGCGO AGATGATAAG 1380 

CTTCTTGGCA ATTGCITACC CACTGTGCTC AGCAOTGGTT C3VACAATTCA CT CCATT GCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGQTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTGC 1560 

TCTGGAACTQ GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAQTACAGQ TGAAAATOTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGOATA ATACTOTGGG CAAC36ACACT 1680 

AT G TTTCT A G TTACGTGGCA G6CCAOTGGT CCTCCT6AGA TTATATTATT TGATCCTGAT 1740 

GGAOQAAAAT ACTACACAAA TAATTTTATC ACX»ATCTAA CTTTTOGGAC AGCTAGTCTT 1800 

TGGATTCCAQ GAACAGCTAA GCCTGGQCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CXXTOAAAOT GACAGTGACC TCTOGCGCCT OCAACTCAGC TGTQCCCCCA 1920 

GCO^CTGTGG AAGGCTTTQT GGAAAGAOAC AOOCTOCATT TTCCTCATCC T6TGATGATT 1980 

TATGGGAATO TOAAACAGOO ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTG6AGATCC TGTTACGCTQ AGACTCCTTG ATGATGQAGC AGGTGCTGAT 2100 

GTTATAAAAA ATOATGGAAT TTACTCGAGG TATTTTTTCT CCTTTQCTGC AAMGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

OCAGGGAGTC ATGCTATOTA TOTAGCAGOT TACACAGCAA AOGGTAATAT TCAGATGAAT 2280 

OCTCCAAOGA AATCAGTA06 CAGAAAT6AG GAGQAGOGAA AGTGGGGCTT TAGCCG AGTC 2340 

A6CTCAQGAG GCTCCTTTTC AGT6CTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAQ AAQAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATC3^ GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AOTAAAAfiTC TACAGAATAT CCAAGATGAC TTTAACAATQ CTATT TTAGT AAATACATCA 2580 

AAGOGAAATC CTCA6CAAGC TGGGATCAGO GAGATATTTA 0C7TTCTCACC CXAGATTTCC 2640 

AOGAATQGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATAOGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCOOCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TAIATTGAAA 2820 

GQAGTTTTAA CAGCAATOGG TTTGATAGGA ATCATTT6CC TTATTATAGT TQTGACACAT 2880 

CATACTTTAA 6CAGGAAAAA GAGAGCAGAC AAOAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATOC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTAC AAAAA 3000 

CATACTAACA AAGTCAAATT AACATCRAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA OATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAO AT TAQA AAAC 3120 

CCTTAGACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAA6 3180 

GCAAA6GGAA GGGTAAAOTC GGACCAGTOT CAAGGAAAGT TTGTTTTATT GMG TGGAAA 3240 

AATAGCCCCA AGCAGAQAAA AGQAGGGTAO GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACnTQATTA ATTTTTCTTT TCTCCTTATC TGTGCRGTAC AGOTTGCTTG 3360 

TTTACATQAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATXJCSUVAO Cifi TlACCT 3420 

CTTGCTATTT TGTTATATAT ATTTGAGATG ACATCTCCCT GCTAATG CTC AGAGA TCTTT 3480 

TTTCACrOTA A6A06TAACC TTTAACAATA TQGGTATTAC CTTTGTCTCT TCATAGOGGT 3540 
TTTATGACAA AGGTCTATTO AATTTATTTO TWTGTAAGTT TCTACTGCCA TOAAGCAGC 3600 
TTTCTAAGTT TATTGOCTTO GGTTATTATO GAAT6ATA6T TATAOCCCCN TATAATGOCT 3660 
TACCTAGGAA A 

Seq ID KO: 318 Protein sequence 
Protein Accession fit NPj006527.1 

1 11 21 31 41 51 

I I I I I I 

MTQRSIAGPI CNUCFVTUfV ALS8BLPFU3 AGVQLQDNGY NGXiLIAZNPQ VPEKQHLI8N 60 
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PCTAJS02/36810 



ZKEMITBASF YI»FNATKRRV PPHNIKILIP ATWKANNN8K ZRQBSYSKAN VIVTDMYGAH 120 

GDDVrCUm GOGKBGKYIH PTPNFmiDll LTAOYGSROR VFVHEWAHLR WGVFDEYNND 180 

KPFYIKGQNQ IKVTTRCSSDI TGIPVCEKGP CPQEKCII8K LFKEGCTPIY NSTQNATASI 240 

MFMQSIiSSW EFCMASTHNQ ERPNLQNQMC SLRSAWDVIT DSADPHHSPP MNGTELPPPP 300 

5 TFSLVQAGDK WCLVLDVSS KMABADRLLQ LQQAAEPVU4 QIVBIHTFVQ lASFDSKGBI 360 

RAQU{QIHSN DDRKLLVSYI* FrTVSAKTDI SICS6LKKGF EWHKU36KA YGSVMZLVT8 420 

GDDKLLGNCL PTVLSSOSTI HSXALGSSAA RWLBELSRLT GGLKPFVPDI SNSNSMIOAP 480 

SRISSC5TGDI FQQHIQLEST GQIVKPHHQL KNTVTVDNTV GWnMPLVTW QASGPPEIIL 540 

PDPDGRKYyT NNPITNLTFR TASLWIPGTA KPGHWTYTIiN NTHHSLQALK VTVTSRASNS 600 

10 AVPPATVBAF VERDSUiFPH PVMIYANVKQ GFyPXLNATV TATVEPBXiSD PVTLRLLDDG 660 

AGADVIKNDO ZYSRYPFSPA ANGRYSXiKVH VNHSPSISTP AH8ZF6SHAN YVPGYTANGN 720 

IQMNAPRKSV GRNEEBRKWG FSRVSSGGSF SVL6VPA0PB PDVFPPCKXl DLEAVKyBBB 780 

LTLSWTAPGE DPDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIRBXFTFS 840 

PQISTNGPEH QPNGBTHBSH RIYVAIRAMD RNSLQSAVSN lAQAPLFXPP NSDPVPARDY 900 

IS Z«ZIiKGVLTAM GLXGIICLII WTHHTLSRK KRADKKEKGT KLL 

Seq ID NOi 319 PNA sequence 

Nucleic Acid Accession #t nm_000228.i 

Coding sequences 82.. 3 600 

20 

1 11 21 31 41 51 

I I I i I I 

QCTTTCAGGC GATCTGGAGA AAQAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGOCTGAA GATGAGACCA TTCTTCXrrCT TGTGTTTTGC CCTGCCTGGC 120 

25 CTCCTGCATG CCCAACAAGC CTGCTCCOGT GGGQCCTOCT ATCCACCTOT TGGG GACCT G ISO 

CTTGTTGGGA GGAOCGCSGTT TCTCOQAGCT TCATCTAOCT GTGQ ACTGAC CAAGC CTGAO 240 

ACCTACTGOl CCCiAGTATGQ OGAGTGGCaWS ATQAAAT6CT GCAAQTOTOA CTCCAGGCAO 300 

CCTCACAACT ACTACAGTCA CXS3A3TAGAG AATGTGQCTT CATCCTC06G CCCCRTGCGC 360 

TGGTGGCAOT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

30 TTCCAGCTTC AAGAAOTCAT GATGGAGTTC CAGGGGCCCA TGCCGGCOGG CATGCTGATT 480 

GAG06CTCCT CAGACTTOGO TAAGA0CTG6 C6A6T6TACC AGTACdXKK TGCOOACTGC 540 

ACCTCCaCCT TCCCTOGGGT CXX5CCAGGGT CGGCCTCAOA GCTGGCAGGA TCTTCGGTGC 600 

CS^GTCOCTGC CTCAGAGQCC TAATGCAOGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

AXGGATTTAG TOTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 720 

35 ATCACAAACT TGAGAGTCAA TTTCACCAGO CTGGOCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGOQ CCTACTATGC TGTGTCCCAO CTCOGTCTGC AGGGOAQCTO CTTCTQTCAC 840 

GGCCATGCTG ATCGCTGOGC ACCCRAGCCT GGQGCCTCTG CAGGCOOCTC CACOGCTGTG 900 

CAGGTCCACQ ATGTCTOTGT CTGCCAGCAC AACACTGCOB GCCCAAATTG TGAGCGCTGT 960 

QCACCCTTCT ACAACAACCG GCCCTGGAGA CCXSGCGGAGG GCCAGGAOGC CCATGAATGC 1020 

40 CAAAGGTGCG ACTGCAATGG GCACTGAGAG ACATGTCACT TTQACCCCGC TGTQTTTQCC 1080 

0CCA6CCAGG GGGCATATGG A G GTGTgTGT QACAATTGGC GGGAOCACAC GGAAOGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACOQGC GCOOGGGAOC TTCCATTCAO 1200 

OAGACCTGCA TCTCCTQCQA GTGTGATCC3G GATGGGGCAG TGCCAGGGOC TCCCTGTGAC 1260 

CCAQTGACCG GGCAGTGTQT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

45 AAGCCQGGCT TCACTGGACT CACCTACGCX: AACCCGCAGG GCTGCXIAOCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGC06 TGTOAOQAGO AGAOTOGOOG CTGCCTTTGT 1440 

CTGCCCAAOG TGGTGG6T0C CAAATGTOAC CAOTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AOTGGCCAGO GCTGT6AACC GTGTGCCTGC GACCOQCACA ACTCCCXTTCA GCXXACAGTG 1560 

GAACCAOTTC ACA6GGCAGT GCCCTOTOGO GAAGGCTTTG GTGGCCT6AT GTGCAGCX5CT 1620 

50 GCAGCCATCC GCCAGTGTCC AGACCX3GACC TATGGAGAOG TGGCXrACAOO ATQCOGftgCC 1680 

TGTGACTGTG ATTTCCGGGG AACaWSAGGGC CCX3GGCT60G ACAAG6CATC AG6G0GCTGC 1740 

CTCTGCCGCC CTCGCTTGAC aSGQCCCOSC TGTQACCAGT GCCA60GAGQ CTACTGCAAT 1800 

0GCTACC066 TCTG0GTG6C CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCOQC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 1920 

55 GGQCTOGAGQ ACXGTGGCCT GGCCTCXXX3G ATCXTTAGATG CAAAGAOTAA GATTGAGCRG 1980 

ATCCX3AGCAG TTCTCAGCAG CCCOGCAGTC ACAGAGCA66 AGGTQGCTCA GaTGaCGAOT 2040 

OCCATOCTCT CCCTCAGQOG AACTCTCCAG GGCCT6CA6C TGGATCTGCC CCTGGAGGAG 2100 

GAQAOGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTQACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAaaAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 2220 

OO GCCTTCCGGA TCCTGAGCAC AQCCTACGAG CAGTCAGCCC AGGCIGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCX3GA OAGAGGCAQA GAGGCTGOTG 2340 

CGGCABGCX3G GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTOQCCCT GftGGCTGGAG 2400 

ATGTCTTCXrr TGCXTTGACCT OACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGOCTTGCA CCCCAATATC ATGCCCTGGT QAGCTATGTC CCC3«0ACAA TGGCACaGCC 2S20 

65 TGTGGCTCX:C GCTGCAGGGG TOTCCTTCCC AGGGCOGGTG GGGCCTTCTT GATGGOGGGG 2580 

CAGGTGGCTG AGCA6CTGC6 GGGCTTCAAT GCCCAGCTCC AGOGQACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAOATT CAATCCAOTO CCCAGCGCTT GQA6ACCCAG 2700 

OTGAGOSCCA GCOQCTCCJCA GATGGAGGAA GATGTCAGAC GCACAOGGCT CCTAATCCAO 2760 

CAGGTCOGGG ACTTCCTAAC AGACCCOQAC ACTGATGCAQ CCACTATCCA GGAGGTCAGC 2820 

70 GAGGCOGTGC TGGCCXnXJTQ GCTGCCCACA GACTCAGCTA CT GTTCTGCA GAAOATOAAT 2880 

GAGATCCAGQ CCATTGCAGC CAGGCTCXXC AACQTGQACT TGGTGCXGTC CCAGACCSkAG 2940 

CAGGACATTQ 06GGTGCCCG COCGTTGCAQ GCTQAGGCTG AGGAAGOCAO GAGCGQA6CC 3000 

CATGCAGTGG AOGGCCAGGT GGAAOATGTG GTTGGGAACC TGCGQCAGGG 6ACAGTGGCA 3060 
CTGCAOGAAG CTCAGGACAC CATGCAAGGC ACCAGCOGCT CCCTTCGGCT TATCCAGGAC 3120 
75 AGGOTTGCTG AGGTTCAQCA GGTACTGCGG CCAGCAGAAA AGCTQGTQAC AAGCAT GACC 3180 
AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GOCAOCAAGC COGGCAGCAG 3240 
66GGCAGAGG CAGTCCAGGC CCAGCAGCTT GOGGAAGOTG GCAG06AGCA GGC ATTG AGT 3300 
QGCCAA6AS0 GATTTGAOAG AATAAAACAA AAGTATGCTG A6TTQAAGGA COGGTTGGGT 3360 
CAGA6TT0CA T G CT G GGTGA GCAGGGTGCC OGGATCCAGA GTOTGAAQAC AGAGGCAGAO 3420 
80 GAGCTOTTTG GGGAQACCAT GGAGATGATG GACAGGATQA AAGACATG6A .GTTGQAGCTO 3480 
CTGCGGGGCA GCCAGGCCAT CAT6CTG06C TOGGOGGAOC T6ACA0GACT GGAGAAGGGT 3540 
GTGGA6CAOA TCOGTOACCA CATCAATGGG CGOGTBCTCT ACTATQCCA C CTQCAAGtqA 3600 
TCCTACAGCT TCCAGCCCGT T6CCCCACTC ATCTGCCGCC TTTOCTTTTO GTTGGG6QCA 3660 
GATTGGGTIG GAATOCTTTC CATCTCCAGO AGACTTTCAT OCAGCCTAAA GTACAGCCIG 3730 
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GACCACOCCT GG T G TGT A GC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAiGCCAAT 3780 
GG6ACAGTTA CACTTGACAO ACAAAOATQO T6GAGATTG0 CATG CXZATTG AAA CTAAGAO 3840 
CTCTCAAGTC AAOOAAOCTG GGCT G G G CAQ TATCCCCO G C CTTTAGTTCT CCACTOOGQA 3900 
GGAATCCTGG AGCAAGCACA AAAACTTAAC AAAAOTGATO TAAAAATGAA AAGOCAAATA 3960 
AAAATCTTTG G 

Seq ID NOi 320 Protein sequence 
Protein Accession «: NP_0002l9.l 

1 11 31 31 41 SI 

I I I I I I 

MRPFFUjCFA LPGLLHAQQA CSROACYPFV GDLLVGRTRP LRAS8T0GLT KPBTYCTQYG 60 

EWQMKCCKCD SRQPHNYYSH RVENVA8SS6 PMRHHQSQND VNPVSLQU)L DRRFQLQEVM 120 

MEFQGPMPAG MLIERSSDFG KTWRVYQYLA ADCTSTPPRV RQGRPQSWQD VRCQSLPQRP 180 

NARIiNOGlCVQ UILMDLVSGI PATQSQKIQB VGBITNLRVN PTRIiAPVPQR GYKPPSAYYA 240 

VSQLRIiQGSC FCSGHADRCA FKPGASAOPS TAVQVHDVCV OQHNTAGPNC ERCAPPYNNR 300 

PWRPABGQDA KBCQRCDQIO HS£TCKFDPA VFAASQGAYG GVOJNCROHT EGXNCEROQL 360 

HYFRNRRPGA SIQBTCISCB aJPDOAVPGA PCaJPVTQQCV CKKHVQGBRC DI.CKPGPTGL 420 

TYANPQGCHR CDCNILGSRR DMPCDBBSGR CLCLPNWGP KCDQCAPYHH KLASGQGCBP 480 

CACDPKNSPQ PTVQPVKRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRAGDCSFRG 540 

TBOPGCDKAS GRCLCRPQlfT QPRCaX}GQRG YCNRYPVCVA CTPCPOTYDA DXiRBQALRFO 600 

RLRNATASLH SOPGLEDRGL ASRZLDAKSR lEQIRAVLSS PAVTEQBVAQ VASAXLSLRR 660 

TLQGLQLDLP LEEETLSLPR DLBSLDRSEN GLLTMYQRKR EQPBKISSAD PSGAPRMI.ST 720 

AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQA6GG0 GTGSPKLVAL RLEMSSLPDL 780 

TPTFNKLOGN SRQMACTPI8 CPGELCPQDN GTAOGSRCRO VLPRAG6AFL MAGQVAEQXiR 840 

GmAQliQRTR QMIRAAEBSA SQIQS8AQRL BTQV&A8R8Q MBEDVRRTRL XilQQVRDPLT 900 

DPDTDAATIQ BV8BAVLALH LPTDSATVLQ KMNEIQAZAA RIiPNVDLVLS QTKQDXARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNIiRQG TVALQEAQDT KQGTSR8LRL IQDRVAEVQQ 1020 

VLRPABKLVT SMTKQLGDFW TRMEELRHQA RQQQAEAVQA QQLAEOASEQ ALSAQEGPER 1080 

IKQKYAELKD RLGQSSML6G QGARIQSVKT EABELFGBTM EMMDRMKOMB LELLRGSQAI 1140 
MLRSADIiTGL EXRVEQIRDH XHGRVLYYAT CK 

Seq 10 NO: 321 DNA sequence 

Kucleic Acid Accession ft: NM_001944.1 

Coding sequence: 84.. 3083 

1 11 21 31 41 51 

) 1 i I I I 

TTTTCTTAGA CATTAACTGC AGAOSGCTGG CAOQATAGAA QCAGOQGCTC ACTTGGACTT 60 

TTTCACCAGQ OAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

OCATCTTOQT GGTGGTCATA TT6GTTGATG GAGAATTOOQ AATAGAGACT AAAG3TCAAT 160 

AT6AT6AAQA AOAGATGACT ATGCAACAAO CTAAAA6AAG 6CAAAAA08T GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAO ATAACTCAAA AAGAAACCCA ATTGCCAAOA 300 

TTACTTCAOA TTACCRAGCA ACCCAOAAAA TCACCTACOG AATCTCTGGA GTGGGAATCG 360 

ATCAQCCGCC TPTTGaAATC TTTGTXGTTO ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTOGA C0GGGA6QAA ACTGCAAGCT TCGTOATCAC ATOTG6G0CT CTAAATGCCC 480 

AAGGACTAGA TOTAOAOAAA CCACTTATAC TAAC960TTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TOGGTGAAAT TGAAGAAAAT AOTGCCTCAA 600 

ACTCACTGGT QATOATACTA AATGCCACAG ATOCAQATQA ACCAAACCAC TTGAATTCTA 660 

AAATT6CCTT CAAAATTGTC TCTCAGGAAC CA8CAGGCAC ACCCATGTTC CTCCTAAOCA 720 

GAAACACJOa GGAAGTGOQT ACTTTGACCA ATTCTCTTGA COQAGAGCAA GCTAGGAQCT 780 

AT00TCTG6T TGTGUUSTGQT GCAGACAAAG ATGOAGAAGG ACTATCAACT CAATGTQAAT 840 

OTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCX»AT GTTTAGAOAC TCTCAGTATT 900 

CAGCAOGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TOGATTTCAA QTAACAGATT 960 

TG6ATGAA0A OTACACAGAT AATTGGCTTG CAOTATATTT CTTTAGCTCT G6QAATGAA6 1020 

OAAATTGOTT TQAAATACAA ACTGATGCTA GAACTAAT6A AG6CATCCX0 AAAGTGGTGA 1080 

AG6CTCTAGA TTATGAACAA CTACAAAGOG T6AAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAOTT ATCTCTOQAT ACOOAOTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TOCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT G6GAACATAT CAAG CCATO G 1320 

ATOAGQACAC TAACAAAGCT 6CCTCAAATG TCAAATATGT GATCGOAGGT AAGGATGSTO 1380 

QATACCTAAT GATTGATTCA AAAAClGCTG AAATCAAATT TOTCAAAAAT ATGAACOSAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTQAGOT TCTGGCCATA GATQAATACA 1500 

OGGGTAAAAC TTCTACAQGC AOGGTATATO TTAGAGTACC CQATTTCAAT GACAATTGTC 1560 

CAACAGCTQT CCTOOAAAAA GATOCAOTTT GCAGTTCTTC AOCTTCCGTG GTTGTCTCOG 1620 

CTAGAAGACT GAATAATAGA TACACTGGCC CCTATACATT TGdACTGGAA OATCAAOCTO 1680 

TAAAGTTGC3C TGC06TATG0 AGTATCACAA CCCTCAATGC TACCTOGGCC CTCCICA6AG 1740 

CCCAGOAACA QATACCTCCT GGAGTATACC ACATCTCCCT GOTACTTACA GACAGTCRGA 1800 

ACAATCX5GTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTOTCAGTOT GACAACAOaQ 1860 

OCATCTOTGQ AACTTCTTAC CCAACCAC3UV 6CCCTGGGAC CAGGTATGGC AGG0C3QCACT 1920 

CAGGGACGCT GG6G0CTGOC 6CCAT0G0OC IGCTGCTCCT TGOTCTCCTG CTGCTGCTOT 1980 

TGGCXXXrCCT TCTGCTGTTG ACCIGTOACT GT6G00CAGG TTCTACTGGG GGA6TQACAQ 2040 

GTGGTTTTAT CCX»GTTCCr GATGGCTCAG AAGQAACAAT TCATCAOTGG GGAATTOAAO 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

QAGCXX3ATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCA CAGO GG 2220 

TGGAAG6CAC TTCAGGAATG 6AAAT6ACCA CTAA6CTT0G AGCAOCCACT GAATCTQQAO 2260 

OTGCTOCAOG CTTTGCAACA GGGACAQTGT GAG6A0CTGC TTCAG6ATTC GGAGCAGGCA 2340 

CTG6A6TTGG CATCTOTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAOGAACCAA TAAGGACTAC OCTGATGGGG CGATAAGCAT QAATTTTCTG QACTCCTACT 2460 

TTTCTCAGAA A6CATTTGCC T6T6CXKAGG AAGAGGATGO CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TQATAATOAA GGCGCAGATG CCACTQGTTC TOCTGTGOGC TCOGTOGGTT 2580 

GTTGCAOTTT TATTGCTGAT GACCTGGAT6 ACAGCTTCIT GGACTCACTT GGACXXJUVAT 2640 

TTAAAAAACT TGCAQAQAXA AOGCTTGGTG TTOATGGTGA AGGCAAAGAA QTTCAGCCAC 2700 
CCTCTAAAGA CRGCXSOTTAT GGOATTQAAT CCTGTGGGCA TCCCATAGAA GTCCAGCAOA 2760 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CW3GATTTGT 
CTGGGTCTGT 
TAAOGGAGAC 
CACTTCrCAC 
CTGGCAACCT 
ATCCTTGCTC 
ATCTTTQGAC 
TG6CACTTAT 
TACCCCAAAA 
TCTTAAAGTT 



TAAGTGCCAQ 
CCAGCCAQCT 
TTACTOGGCT 
ACAAAATQTG 
AGCTGGCCCA 
(XXSTCTAATA 
TAAAOTATTC 
TAGCTTCTCT 
GCAATATGTT 
TTTCAAAACC 



ACTTTGTCAO 
GTTTCCATCC 

TCTGGTTCCC 
ATAGTGACAG 
ACX3CA6CTAC 
TGACCAGAAT 
AAAATA6CAT 
CATAAACTGA 
GTCACTCCTA 
CTAAAATCAT 



QAAGTCAAGO 
CTQACCCTCT 
TOQTGCAACC 
AAAGGGTGAT 
CSA66GTCACA 
GAGCTG6AAT 
AGCAAAGCTC 
TCACX5ATTAT 
ATTCTCAAGT 
ATTOGC 



AOCTTCTOCT 
GCAGCAT6GT 
TTCCACTQCA 
CTOTCCCATT 
TACTATGCrC 
ACCACACTQA 
ACTGTATTG6 
AAATTAAATG 
ACTATTCAAA 



r r oTOO GO CT 

AACTATTTAG 
GGCTTTQATC 
TCCAGTGTTC 
T6TACAGA0G 
CCAAATCT60 
GCTAATAATT 
TTTGQGTTCA 
TTGTACSTAAA 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



Seq ID NO: 322 Protein sequence 
Protein Accession fii NP.001935.1 



MMGLFPRTTG 
GEDHSKRNPZ 
PSFLXTCRAL 
ATDAOEFNHL 
DKDGEGLSTO 
WLAVYFPTSG 
SRYRVQSTPV 
SNVKYVMORN 
VYVRVPDPND 
ITTIiNATSAL 
TT8PGTRY6R 
GSSGTIBQWQ 
KTTKLGAATB 
DGAISMKPLD 
LDDSFLDSLG 
LS0SQGASAI4 
VTBRVXCPZS 



11 
I 

ALAIPVWIL 
AKITSDYQAT 
HAQ6LDVBKP 
NSKIAFKIVS 
CECNIKVKDV 
NEGNHFEIQT 
TIQVINVREO 
DGOYLMIDSK 
NCFTAVLEKD 
LRAQEQIPPG 
PHSGRLOPAA 



SGGAAGFATG 
SYPSQKAPAC 
PKFKKLAEIS 
SASGSVQPAV 
5VP6HLAGPT 



2X 

I 

VHGSLRZErTK 
QKZTYRI6GV 
LILTVKXLDI 
QEPAGTPMFIi 
NDNFPMFRDS 
DPRTNEQILK 
lAPRPASKTP 
TAEZKFVKMM 
AVCSSSPSW 
VYHISLVLTD 
lOLLLLGLLL 
ITHICVPPVT 
TVSQAASGFG 
AEEDDGQEAN 
LGVDGSGKEV 
SIPDPLQKGM 
QLRGSHTMIiC 



31 
I 

GQYDEEEMTM 
GIDQPPFOIF 
NDNPPVFSQQ 
LSRNT6GVRT 
QYSARIEENI 
WXALDYEQli 
TVQKGISSKK 
NRDSTFIVNK 
VSARTLNNRY 
SQNNRCBMPR 
LLLAPLLLLT 
ANGADFMSSS 
AATGVGIC6S 
DCLLIYDNEG 
QPPSKDSGYQ 
YLVTBTYSAS 
TEDPCSRLI 



41 

1 

QQAKRRQKRE 
WDKMTGDIN 
IFM6EIEENS 
LTKSIiDRGQA 
LSSELLRFQV 
Q8VKLSIAVK 
LVDYILGTYQ 
TITAEVLAID 
TGPYTFALED 
SLTLBVGQCD 
CDGGAGST60 
EVCTNTYARG 
GQSGTMRTRH 
ADATGSFVGS 
lESCGHPIEV 
GSLVQPSTAG 



51 
I 

HVKFAKFCRB 
ITAIVDREBT 
ASNSLVMILN 
SSYRLWSGA 
TDLDBBYTDM 
NXAEFHQSVI 
AIDBDTNKAA 
EYTGKTSTOT 
QFVKLPAVHS 
NRGZCQTSYP 
VTQGPIPVPD 
TAVEGTSGME 
STGQTNKDYA 
VGCC8FIADD 
QQTOPVXOQrr 
PDPIiLTQNVI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NOs 323 DNA sequence 

Nucleic Acid Accession Ut XM_058069.2 

coding sequence i 1..1413 



1 

1 

ATGAAGTTTC 
AGCTCTAGAA 
TATG6CCTTG 
AAOGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTO 
T6GA6TAATG 
GTOGr n TT G 
CTAGCCCATG 
GAArrCTGGA 
GGCCATTGCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAOACA 
ATTTCTTCCT 
AGAAATCAAO 
QAGCXAAATT 
GATGCAQCTG 
TGGAGOTATG 
AACTTGCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 

1 

TTCTAATACT 
GCXrrGGAAAA 
A6ATAAACAA 
TCCAAGAAAT 
TGGAQATGAT 
CAGGOGGGCC 
ACATGAACOG 
TTAOCCCCTT 
CC0GTG6AGC 
CTTTTGGACC 
CTACACATTC 
TAG6TCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGA9 
GGTTCTTCTG 
TATG6CCAAC 

ATCCCAAGAO 
TTTTTAACCC 
ATGAAAGGAG 
GAATOGGGCC 
AAG6ATCTAA 
GCAATAOCTO 



21 

1 

GCTCCTGCAG 
AAATAATGTQ 
ACTTCCAGTG 
6CAGCACTTC 
GCACGCACXrr 
OOTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTGGCCrC 
GAACCAAG6C 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAA6ATQAC 
CATACATTCT 
AaSTTTTTAT 
ACAGATGATO 
TAAAATTGAT 
CGAATTTGAA 
tf m-GQTTGT 



31 

I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
OQATGTQQAG 
AAACATTATA 
GACTA06CAA 
AAGATTAACA 
TTGCATGCTT 
ATTGGAGGGG 
AACTTQTTCC 
GATGCAAAGG 
TCTGCTGAJG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
G6CATT6AAG 
AAATACTGOT 
TTTOOTTTTC 
AGQACCTACT 
OACCCTGGTT 
GCAOTCTTCT 
TATOACTTCC 
TGA 



41 

I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACOGG 
TCCCCGATGT 
TCACCTACAG 
TCOGGAAAGC 
CAGGCATGGC 
TTOATGGCAA 
ATGCACATTT 
TCACTGCTGT 
COGTAATGTT 
ACATACX3T6G 
CTGAGAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAAT TAOC AA 
CTAACTTTGT 
TCTTTOTAQA 
ATCCCAAACT 
ACrCTAAAAA 
TACTOCAACG 



51 

1 

TCCCCTOAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
GGATGAGGAC 
TCAOGAGATT 
CCCXIACCTAC 
CATTCAGTCC 
AGAACCAGCT 
QATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAQTAT 
GATTACCAAO 
CAAATACTAC 
TATCACGAAA 



Seq ID NO: 324 Protein sequence 
Protein Accession #: P39900 



MKFUjILLLQ 
KERIQEMQHP 
YTPDMNREDV 
LAHAFGPGSG 
KYVDINTFRL 
FXDRPFHLKV 
EPNYPKSIHS 
ttPQQZGPKID 



11 

1 

ATASGALPUl 
LGLKVTGQLD 
DYAIRKAFQV 
IGGDAHFDED 
SADDIRGIQ8 
SERPKTSVNL 
FGFFNFVKKI 
AVFYSKNKYY 



21 

I 

SSTSLEKNNV 
TSTLEMMKAP 
HSNVTPliKPS 
BFWTTHSGGT 
LYCa}PKENQR 
ISSLWPTLPS 
DAAVFNPRFY 
YPFQGSNQFB 



31 
1 

LPGERYLEKF 
ROSVPDVaHF 
RIMTGMADXl. 
HLFLTAVHBI 
LPNFDNSBPA 
OIEAAYEIBA 
RTYFFVDNQY 
YDFL1X2RITX 



41 

i 

YGLEINKLPV 
REMFGGPVWR 
WFARGAHGD 
GHSLGLGHSS 
LCDPNLSFDA 
RNQVFLPKDD 
WRYDBRRQMM 
TLKSHSWF6C 



Seq XD NOi 325 DNA sequence 

Nucleic Acid Accession «: NM_024423.1 

Coding sequence: 64.. 2590 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



51 

I 

TKMKYSOTIiM 
KHYITYRINN 
FHAFDGKGGI 
DPKAVMFPTY 
VTTVGMKIPF 
KVWLISNIiRP 
DPGYPKLITK 



60 
120 
180 
240 
300 
360 
420 



11 



21 



31 



41 



51 
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I I I I I I 

GGCA66TCTC GCTCTCGGCA CCCTCOOGGC 6CC0GCGTTC TCCTGGCCCT GCCCGQCATC 
COGATGGCCG CCGCTGGGCC CCOGCGCTCC <5TG0GCS»AG COGTCTGCCT OCATCTGCTG 
CTGACCCTCO TGATCTTCAG TCGTGATGGT QAAGCCTGCA A AAAGG TGAT ACTTAATGTA 
5 CXTTCTAAAC TAGAGGCAtSA CAAAATAATT GGCAQAGTTA ATTTGOAAOA OTGCTTCAGG 
TCIGGAGACC TCATCOGGTC AAGTGATCCT GATTTCAOAG TTCTAAATGA TGGG TCAGTG 
TACACAGOCA GG6CTGTT6C GCTGTCTGAT AAGAAAAGAT CATTTACX^T ATGGCTTTCT 
GACAAAAGQA AACA6ACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 
TOSAAGACAA GACACACTAG AGAAACTGTT CTCAGGOOTG CCAAGAGGAG ATGGGCACCT 
10 ATTCCTTGCT CTATGGAAOA GAATTCCTTG GOCCCTTTCC CATTGTTTCT TCA ACAAO TT 
GAATCTGATG CAGCACASAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 
AAAGAACCTT TAAATTTQIT TTATATAGAA AGAGACACTG 6AAATCTATT TTGCACTOSG 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 
GOATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATQA AAAT GACA AC 
15 CACCCTGTTT TCACAGAAOC AATTTATAAT TTTGAAGTTT TGGAAAGTAO TAGACCTGGT 
ACTACROTQO GGGTGGTTTG TGCCACAGAC AQAOATQAAC COaA CACAAT GCAT ACGCGC 
CTGAAATACA GCATTTT6CA GCAGACAGCA AOGTCACCTG GGCTCTTTTC TGTGCATCCC 
AGCACAGGCX3 TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAQGTTOT AGACAAGTAC 
TCATTGATAA TGAAAGTACA AGACATGQAT GaCCAGTTTT TTGGATTOAT AGGCACATCA 
20 ACTTQTATCA TAACA6TAAC AGATTCAAAT GATAATGOVC CCACTTTCAO ACAAAATGCT 
TATOAAGCAT TTGTAGAOGA AAATGCATTC AATOTGGAAA TCTTAC GAAT ACXTATAQAA 
6ATAAGGATT TAATTAACAC T00C3\ATTG0 AGAOTCAATT TTACCATTTT A AAGGQAAAT 
QAAAATGQAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATOAAQGTGT TCTTTCTGTT 
GTAAAGCCAC TGAATTATGA AOAAAACOGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 
25 QAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAQCCT TOAACAGAGC CTTGGTTACA 
GTTCATGTOA GGQATCTGGA IGAGOGGGCT GAATGCACTC CIGCAGCCCA ATATGTGOQO 
ATTAAAOAAA ACTTAGCAGT 6GG6TCAAA0 ATCAAGGGCT ATAAGOCATA TGACCCGQAA 
AATAGAAATG GCAATGQTTT AAGGTACAAA AAATT6CATG ATCCTAAAGO T TGGA TCACC 
ATTQATGAAA TTTCAGOOTC AATCATAACT TCCAAAATCC TGGATAGGQA GGTTOAAACT 
30 CCCAAAAATQ AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGA TCATG T 
ACTGOAACAC TTGCTOTGAA CATTOAAOAT GTAAATGATA AT CCACC AOA AATAC TTCAA 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACOG ACATTTTAGC TGTTGATCCT 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 
AGTAGACTOT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 
35 AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 
GCAACAAAAT TATTQAQAOT TAATCTOTGT GAATGTACTC ATCCRACTCA GTGTCX3T0GG 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGOCAA TO UT'XGCAAT ATTACT6QGT 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTQQ TGCAACTAAA 
GGGAAAOOTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 
40 CCrGGAOACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 
AGCCAAGGTT TTTOTGQTAC TATGGGATCA GQAATGAAAA ATGGAGQGCA GOAAACCATT 
OAAATGATGA AAGGAGGAAA CCA6ACCTTG GAATOCTOCC GGGGGGCTGO GCATCATCAT 
ACCCTGGACT CCTOCAGGGG AGOACACACG GAGGTGQACA ACTGCAGATA CACTTACTCG 
OAOTGGCACA GTTTTACTCA ACCCCGTCTC GGTQAAQAAT CCATTAGAGG ACACACTGGT 
45 TAAAAATTAA ACATAAAAQA AATTGCATCQ ATGTAATCAO AATGAAGACC GCATGCCATC 
CCAAOATTAT GTCCTCACTT ATAACTATGA GGGAAOAGQA TCTCCAOCTG OTTCTGTGGa 
CTGCTGCAGT GAAAAGCAGG AAGAAGATGQ CCTTQACTTT TTAAATAATT TGGAACCGAA 
ATTTATTACA TTAGCAOAAO CATGCACAAA GAGATAATGT CACAOTOCTA CAATTAGGTC 
TTTGTCAjGAC ATTCTO Q AGG TTTCCAAAAA TAATATTOTA AAGTTCAATT TCAACATGTA 
50 TGTATATOAT G AWITIT C TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 
AGCCAOTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATOT TAAA ACAQAC AACTQQTA AA 
TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACO 
OATATTTTAG TAATAAATAT OCPGQATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 
ATATCACATT ATTATGTATT CACTTTAAOT OATAGTTTAA AAAATAAACA AGAAATATTG 
55 AOTATCACTA T6TGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 
ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACIACCAAAT TCATTTGACT 
TTGGAGGCSVA AATGTGTTQA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 
GOAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATQAQ 
AACAAAGAGG AAAATOOTAA AAACTTGAAA TQAGGCTGGQ GTATAGTTTG TCCTACAATA 
60 GAAAAAAGAO AGAGCTTCCT AOGCCTGGGC TCTTAAATOC TOCATTATAA CTGAGTCTAT 
6AGGAAATAG TTCCTGTCXai ATTTQTGTAA TTTGTTTAAA ATTGTA AATA A ATTAA ACTT 
TTCTGOTTTC TGTGGGAA6G AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTT6CAGT 
CTGTTTCAA6 ATTTCTGCAT CCACAAGTTA OTAGCAAACT GGGG AATACT CGCTGCAQCT 
GUGQl'rCOCT OCTTTTTGGT AOCAAGGGTC CAGAGATGAG QTGTTTTTTT C3 GGGGA0CTA 
65 ATAACAAAAA CATTTTAAAA CTTACCTTTA CTQAAGTTAA ATCCTCTATT GCTGTTTCTA 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAAOCAT OTOCTOCTAG 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAOCACCCT GGGGAGATTG 
ATTOIOCTTA AACCTAASCC CCAC»AACTT GACACCTGAT CAGGTCTGOO AQCTACAAAA 
TTTCATTTTT CTCCTCACTO O XT I'C l ' A ' Cr GAGTGGCATT GGOCTGAATC AA6GAAAGCC 
70 AGGCCTTX3TG G6CCCCCTTC TTTOGGCTTT CTOCTAAAGC AACAC CTOCA GCAGAGAXTC 
CCTTAAGTGA CTCCAQGTTT TCCACCATCC TTCAGCGTQA ATTAArTTTT AATCAGTTTG 
CTTTCTCCAG AGAAATTTTA AAATAATAGA AQAAATAQAA ATTTraAATO TATAAAA6AA 
AAAQATCAAQ TTGTCATTTT AOAACAOAGG GAA CIT T G CG AGAAAQCAGC CCAA6TAGGT 
TATTTOTACA GTCAGAGOGC AACAGGAAGA TOCAGGCCTT CAAGGGCAAG GAGAGGOCAC 
75 AAGQAATATG GGTGGOAGTA AAAGCAACAT OGTCTGCTTC ATACTTTTTC CTAGGCTTGG 
CACTQCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGOI GAGGGATCAG 
CCAACCTCTT CTCTATGGCT CACCTTATTT OOAGTGAGAA ATCA AGGAGA CAGAGCIGAC 
TGCATGATOA GTCTQAAGGC ATTTGCAG6A TGAGCCTGAA CTGQTTOTGC AGAACAAACA 
AOGCaTTCAT GGGAATT6TT OTATrCCTTC TGCAGCCCTC CTT CTGGG CA CTAAG AAGGT 
80 CTATGAATTA AATQCCTATC TAAAATTCTG AT TTATTC CT ACATTTTCTG TTTTCTAATT 
TCACCCTAAA ATCTATGTGT TTTAOACTTA GACTTTTTAT TGCC CCCCOC i»wl i.i.i i,iT 
TTGAGACGGA GTCTOOCTCT 6A0GCACAGG CTGQAGTGCA gTGG CTCOG A TCTCTOCTCa 
CTQAAAOCTC 06CCTCCXX36 GTTCATGCXA n VltJCTQCC TCAGCC TCCT GAOTAGCTGO 
GACTACAG6C GCCCACCACC AOOCXXXSGCT AATTTTTTGT ATTTTTAATA OAGAOGOOOT 
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TTCACTOTGT TAGCCAGGAT GGTCTOOATC TCCTGACCTC GTGATC CGCC TGCCTOCXJCC S040 

TCCCAAAQTG CTGGGATTAC AOGCATGACX: CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTOGTCTTCT TTTAATGTAA TCATTTTQAA CATGTQTGAA AGTT6ATCAT AOQAATTQQA 5160 

TGAATCTTtSA AATACTCAAC CAAAAGAOU3 TC6AGAAGCC A60G66AGAA AGAACTCAGO 5220 

5 GCACAAAATA rPGCTCTGAG AATGCAATTC TCTGTAAaCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AaCCAGTTTT ATCTAACX3GC TACTGAAACA CCCACTGTOT TTTGCTCACT 5340 

CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CITIGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTAT6CCCGA ATTTGTAATT CTTTTCTCTC 5520 

10 AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAA6GIQA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATOATTT AGAAimTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGQA AGCA6CTGTC TAAAATGCAO TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAOTTACT TTGCTTTTAA AQAAACTTGO 5820 

IS CTGCTTAAAA TAAOCAAAAA TIQQATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCT6CTTAAA ATAAGCAAAA ATTGGATGCA TAAAOTAATA 5940 

TTTACAGATG TGGGSAGATO TAATAAAACA ATATTAACTT GQTTTCTTGT TTTTGCTGTA 6000 

TTTAGAOATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GQCTGQCATG 6060 

GAAATAGAAA TACTCAATTA ItaTCm^lT OTATTAATGG GGAATATTTT GQACAATGTT 6120 

20 TCATTATCAA ATTGTOQACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCQG 6240 

GTATTAAAAG TATTA6AAG0 T6QTTATAAT TGCAGAGTAT TCCATGAATA 6TACACT6AC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGG6AAA ACATGAGTTA AAAAGAAAAG 6360 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT QAACTTTAAT 6420 

25 GACAAOATGA TCCAACCATA AAGGTGCTCT GTGCTrCACA GTGAATCTTT TCOCCATGCA 6480 

GGAGTGTGGT CCCCTACAAA OGTTAAQACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTOAACC AAAATTTCAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT QrTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCX3GATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

30 GTTGAGAAGC ATGGACACTA GAGOCaCAAT GCTTGGATAT OAATCCTGGA TCTQTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TA6CTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GG0GTT6TTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAOTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 

35 CATATATATA ATCCOGAAAC ATG 

Seq ID NOi 326 Protein aequence 
Protein Accessicm #t NP_077741.1 

40 1 11 21 31 41 51 

I I I I I I 

NAAAGPRRSV RGAVCLHLLL TLVZFSRDGB ACKKVXLNVP 8KLBADKIZG RVMLEECFR8 60 

ADLZRS5DFD FRVLNDGSVY TARAVALSDR KRSFTim«SD KRKQTQKBVT VLLERQKKVS 120 

KTRHTRBTVL RRAKRRWAPI PCSMQENSLG PFPLFLQQVE 8DAAQNYTVP YSISGRGVDK 180 

45 EPLHIiFYIER ErTGNIiFCTRP VDREEYDVFD LIAYASTADG YSADLPI»PLP IRVEDKNDNH 240 

WPTEAIYNP EVIiBSSRPGT TVGWCATDR DBPDTKHTRL mrSILQQTPR 8P GLPS VHP8 300 

TGVZTTVSBy USREWDKYS LXMKVQDMDG QFFGLXGT8T CZXTVTD8liD KAPTFRQNAY 360 

BAFVEEKAFN VCXIiRXPXBD RDLXKTANWR VMPTXLKGNB N6HFKXSTDX ETNBGVLSW 420 

KPLNYBEMRQ VNLBIGVNNB APFARDIPRV TAUJRAIiVTV HVRDLDEOPE CTPAAQYVRI 480 

50 KBNLAVQSKX NOYKAYDPEN RNQNOLRYKK LHDPK6WITZ DEXSGSXIT8 KXLDREVETP 540 

KMELYNITVL AIDKDDRSCT GTLAVNIBDV NDNPPBILQB YWICKPKMG YTDILAVDPD 600 

EPVHQAPFYF SLFN7SPEXS RLWSLTKVMD TAARLSYQKN AGFQEYTZPX TVKDRA6QAA 660 

TKLLRVNIiCB CTHPTQCRAT SRSTGVZLGK WAZLAZLLGI ALLFSVLLTL VOSVFGATKG 720 

KRFPEDIiAQQ NLZISNTEAP GDDRVCSANG FMTQTTNNSS QQFCGTMGSG MKNGGQBTZE 780 

55 MMKGGKQTIiB SCRGA6HHHT LD8CRGGHTE VDNCRYTY8B HHSFTQPRLQ EE8XRGaTG 

8eg ZD MOt 327 DMA sequence 

KUcleic Acid Accession #*. 21M_001941.2 

Coding sequence* 64.. 2754 

oO 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC 6CTCTCGGCA CCCTCCCGOC GOCX»0GTTC TCCTGGCCCT GCCCGGGATC 60 

COOATGGCCG CCQCTGGGCX: CXX3GCGCTCC GTGCGCGQAQ CCGTCTGCCT GCATCTQCTG 120 

65 CTGACOCTOG TGATCTTC3«3 TC3GTGATGGT GAAGOCTGCA AAAAGGTQAT ACTTAATGTA 180 

GCTTCrAAAC TAGAGGC3tf3A CAAAATAATT GGCAGA6TTA ATTTGGAAGA GTOCTTCAGG 240 

TCTGCAOACC TCATCOGGTC AAGTtSlTCCT GATTtGAGAG TTCTAAATGA TGGG TCAG TG 300 

TACAGAOCCA GGGCTGTTGC GCrGTCTGAT AA6AAAAQAT CATTTACCAT AT6GCTTTCT 360 

QACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

70 TCGAAGACAA GACACACTA6 AQAAACTGTT CTCAG60GTG CCAAOAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATQCAAGA GAATTCGTTG GGOOCTTTOC CATTOTTTCT TCAACAAQTT 540 

6AATCTGATG CA6CACAGAA CTATACT6TC TTCTACTCAA TAA0TGGA08 TQGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AQAGACACTG QAAATCTATT TTGCACTOGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGOGTC AACTGCAGAT 720 

75 GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATQACAAC 780 

CACCCTGTTT TCACAOAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAQACXIIGGT 840 

ACTACAGTGG GGGTGGTTIG TGCCACAGAC AGAGATGAAC CG6ACACAAT GCATA060GC 900 

CTOAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TQTGCATCCX: 960 

^ AGCACAGGCG TAATCACCAC AQTCTCTCAT TATTTGGACA GAGAGGTTOT AQACAAGTAC 1020 

80 TGATTGATAA TQAAAGTACA AOACATGGAT QGCCA6TTTT TTGGATTGAT AGGCACATCA 1080 

ACrrGTATCA TAACAOTAAC AGATTCAAAT GATAATOCAC CCACTTTCAG ACAAAATGCT 1140 

TATOAAOCAT TTGTAOAGGA AAATGCATTC AATGT6GAAA TCTTAGQAAT AGCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTQG AGAGTGAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGOTGT TCTTTCTGTT 1320 
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OTAAAGCCAC TGAATTATOft AGAAAAOOGT CAAOTGAACC TGGAAATTGO AGTAAACAAT 1380 
OAAGOOCCRT TTOCTAQAGA TATTCCCAOA GT6ACAGCCT TGAACAGAOC CTTGGTTACA 1440 
OTTCATOTOA GGQATCTGGA TOAGGQOCCT OAATGCACTC CTGCAGC3CCA ATATGTGOGG 1500 
ATTAAAOAAA ACTTA8CAGT GGGGTCAAAG ATCAAOGGCT ATAAGGCATA TQAC CCX3QAA 1560 
5 AATAQAAATQ GCAATGOTTT AAGGTAOUUV AAATTOCATQ ATCCTAAAGG TTGGATCACC 1620 
ATTQATGAAA TTTCAOOCnC AATCATAACT TCC3MATCC TGGATAGGGA GGTTOAAACT 1680 
CCCAAAAAT6 AGTTGTATAA TATTACAGTC CTGGCAATAO ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTOAAGAT QTAAATGATA ATCCACCAGA A ATAC TTCAA 1800 
^ GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACG6 ACATTTTAQC TGTTGATCCT 1860 
10 GAT6AACCTG TCCATGGAGC TCCATTrXAT TTCAOTTTGC CCAATACTTC TCCAGAAATC 1920 
AOTAGACTGT GGA6CCTCAC CAAAGTTAAT 6ATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
AATGCTGGAT TTCAA6AATA TACC A TTCCT ATTACTGTAA AAGACAGGGC OGGCCAAGCT 2040 
GCAACAAAAT TATTQA6AGT TAATCTGTGT QAATGTACTC ATCCAACTCA GTGTC6TGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGQCAA TCCTTGCAAT ATTACTGGGT 2160 
15 ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG T6CAACTAAA 2220 
C3GGAAA0GTT TTGCT6AAGA TTTAGCACAO CAAAACTTAA TTATATCAAA CACAOAAGGA 2280 
CCTGGAGAC8 ATAGAGTGTQ CTCTGCGAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGG6CA GOAAACCATT 2400 
GAAATGATQA AAGGAGGAAA CCAGACCTTO GAATCCTOCC GGGGGGCTGG GCATCATCAT 2460 
20 ACCCTOGACT CCTGCAGGGG AGGACACAOG 6AG6TGGACA ACTGCAGATA CACTTACTOO 2520 
6AGTG6CACA GTTTTACTCA ACCOOGTCTC GOTQAAAAAT TGCATOOATO TAATCAQAAT 2580 
GAAOACGGCA TGCCATCCCA AOATTATGTC CTCACTTATA ACTAT6AGGG AAO AGGATC T 2640 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AACCAGQAAO AAOATQOCCT T6ACTTTTTA 2700 
AATAATTTGQ AACCCAAATT TATTACATTA OCAQAAGCAT GCACAAAGAG ATAATGTCAC 2760 
25 AGT6CTACAA TTAGGTCTTT 6TCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 
TTGAATTTCA ACAIGTATOT ATATGATOAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 
CCAATTTATA TTTTTAAAGC CA0TTGTT6C TTATCTTTTC CAAAAAGTQA AAAATOTTAA 2940 
AACAGACAAC TGGTAAATCT CAAACTCCAO CACTGQAATT AAGGTCTCTA AAOCATCTGC 3000 
TCTTTTTTTT TTTTAOGGAT ATTTTAGTAA TAAATATGCT GQATAAATAT TAGTCCAACA 3060 
30 ATAGCTAAQT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AOTTTAAAAA 3120 
ATAAACAAOA AATATXGA6T ATCACTATGT GAAOAAAGTT TTGGAAAAGA AACAATGAAO 3180 
ACTGAATTAA ATTAAAAATG TTGCAQCTCA TAAAQAATTG 6GACTCACCC CTACT6CACT 3240 
ACCAAATTCA TTTGACTTTG GAGOCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 
TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATQCAAT 3360 
35 ATTTAAAATG AAAT6AGAAC AAA6A66AAA ATG6TAAAAA CT7GAAATGA GGCTGGGGTA 3420 
TA Q TTTGTCC TACAATAGAA AAAAGAQAGA GCTTGCTAOG CCTGG6CTCT TAAATGCTGC 3480 
ATTATAACTQ AOTCTATQAO GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 
GTAAATAAAT TAAACTTTTC TGGTTTCT6T GGGAAGGAAA TAGQGAATCC AATGGAACAO 3600 
TAGCTTTGCT TTGCAGTCTG TTTCAAQATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 
40 GAATACTOGC TGCAGCTGGG GTTCCCT6CT TTTTGGTAGC AAGGOTCCAG AGATGAGOTO 3720 
' mTm XS GQ GGA6CTAATA ACAAAAACAT TTTAAAACTT AUCm'A CTO AAGTTAAATC 3780 
CTCTATTOCr OTTTCTATTC TCTCTTATAO TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 
TAACCATGTC CTCCTAGAGT TTAQAGGCTA GAGGQAGCTG AGGOGAGQAT CTTACTGAAA 3900 
GCACCCTGGQ GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTQATCAG 3960 
45 GTCTOGQAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAa TGGCATTQGC 4020 
CT6AATCAAG GAAAGCCAGG GCTTQTCZGGC GOOCTTCTTT CGGCTTTCTG CT AAAG CAAC 4080 
ACCTCCAGCA GAQATTCCCT TAAGTGACTC CAGQTTTTCC ACCATCCTTC AGCXSTGAATT 4140 
AATTTTTAAT CAQTTTGCTT TCTCC3\GAGA AATTTTAAAA TAATAQAAGA AATAGAAATT 4200 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAQA ACAGAGGGAA CTTTGOQAOA 4260 
50 AA6CAGCCCA AGTAGGTTAT TTGTACAGTC A6AGG6CAAC AGGAAGATGC AGGCCTTCAA 4320 
GQQCAAGQAO AGQCCACAAa OAATATGGQT GGGAGTAAAA GGAACATCGT CTGCT TCATA 4380 
CTTTTTCCTA GGCTTG6CAC TGCCTTTTCC TTTCTCAGGC CAAT GGCA AC TGCCATTT6A 4440 
QTCC3GGT0AG G6ATCA6CCA ACCTCTTCTC TATGGCTCAC CTTATTTQGA GTGAGAAATC 4500 
AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TOCAGGATGA QCCTGAACTG 4560 
55 GTTQTGCAGA ACAAACAAGG CATTCATQGO AATTGTT6TA TTCCTTCTGC AGOjCnXTi' 4620 
CTGGGCACTA AQAAGGTCTA TGAATTAAAT GOCTATCTAA AATICTQAIT TATTC CTACA 4680 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATOTOTTTT AGACTTAGAC TTTTTATTQC 4740 
CCCCCCCCCC TT T TTTTTT G AQAOGQASTC TC3GCTCTGAC GCACAGGCTO GAOTGCAQTG 4800 
GCTCOGATCT CTGCTC31CTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTG CCTCA 4860 
60 GCCTCCTGAO TAQCTGGGAC TACAG606CC CACC3\CCACQ CCOGOCTAAT rm Wi'ATT 4920 
TTTAATAGAG AOGGGGTTTC A C IGT 0 TTA3 OCAGGATCKST CTCQATCTCX: TGACCTOGTG 4960 
ATCCGCCTQC CTC6GCCTGC CAAAOTQCTO G6ATTACAG6 CAT6ACGCAC COCTCCOGQC 5040 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 
TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCH AGAAGCCAGO 5160 
65 GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCIGAGAAT GGAATTCTCT GTAAGCCTA6 5220 
TTGCTGAAAT TTCCTOCI OT AACCAOAAGC CAGTTTTATC TAAOGGCTAC TGAAA CACC C 5280 
ACTGTGTTTT GCTCACTCCC TCACTCAOCG ATCAAAACCT OCTAOC TCCC CAAOACTTTA 5340 
CTAQTQCCGA TAAACTTTCT CAAAGAOCAA CCAQTATCAC TTCCCTGTTT ATAAAACCTC 5400 
TAACCATCTC TTTGTTCTTT QAACATGCTG AAAACCACCT GGTCTGCATO TATGCXZGQAA 5460 
70 TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG QATTCATITC TATATTTTCA 5520 
CATATGTAOT ATTATTATTT CCTTATATOT QTAAGGTGAA ATTTATOGIA TT TGAGTGT Q 5580 
CAAGAAAATA TATTTTTAAA QCTTTCATTT TTCCCCCAOT GAATOATTTA GAATTTTTTA 5640 
TOTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 
75 TOCTTTTAAA GAAACTTQQC TGCTTAAAAT AAGCAAAAAT TGQ ATGC ATA AAGTAATATT 5820 
TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTG6 TTTCTTGrTT TTQCT6TATT 5880 
TAGAOATTAA ATAATTCTAA GATOATCACT TTGCAAAATT ATGCTTATGQ CTG6CATGGA 5940 
AATAOAAATA CICAATTATG TCTTTGTTGT ATTAATGGGO AATATTTTOG ACAATGTTTC 6000 
ATTATCAAAT TGTOSACATC ATTAATATAT ATTOTAATOT TGGGAAOAGA TCACTATTTT 6060 
80 GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATGGGOT 6120 
ATTAAAAGTA TTAGAAOGTO GTTATAATTG GAGAOTATTC CATS AATA CT ACACTGACAC 6180 
A6GGGTTTTA CrTTGAGGAC CAOTGTAOTC AAGGGAAAAC ATGAGTTAAA A AGAA AAOCA 6240 
G6CAATATTG CAGTCTT6AT TCTOCCACTT ACAGGATAOA TAAT GCCIO A ACTTTAATQA 6300 
CAAQATGATC CAACCATAAA GCJTQCTCTGT GCTTCACAOT GAATCTTTTC GOCATGCAGQ 6360 

1080 
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AGTGTGCTCC CCTACAAAGG TTAAGACTBA TCIVTTTCAAft AMTCTATTAG CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG GTTQAACCAA AATTTCRATT CCRGTOACTT CTATTGTAAC 6480 

CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TrraTTTOTA ATAGTAAAAT 6540 

AC06GATACA TTTCAOGTGT CCTTCAC5TAT TQATTTGGTT GAATATTGOQ TCATAATGGT 6600 

5 TGAQAAGCAT GGACACTAGA GCCA6AATGC TTC3QATATGA ATCCTGGATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAQ OCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AOCCTCATGO G OTT G TT G AA TQATTAAATT AGTTAA TATA CCTAAAGTAC 6780 

ATAGAACACT GCCTGCACAT AGTAAAA8AA TTATAAGTGT GA6GTAGTTG QT AAAATT AT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AOGGAAATAA AGTTTGTGCA 6900 
10 TATATATAAT CCCGAAACAT G 

Seq ID KO; 328 Protein sequence 
Protein Accession fti MP_001932.1 

15 1 11 21 31 *1 51 

I I I I I I 

MAAAGPRRSV RGAVCXjRLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIO RVNIjBBCFRS 60 

ADLIRSSDPD PRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKBVT VLLEKQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PPPLPLQQVE SDAAQMYTVP YSISGRGVDK 180 

20 EPIiNIiFYIER mGNXiFCTRP VDRBEYDVFD LIAYASTADG YSADLPLPLP XRVEDENDNH 240 

PVFTBAIYIIF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTFR SPOLFSVHPS 300 

TGVITTVSHY LDREWOKYS LIMKVQDMDO QPPGLIOTST CIITVTDSSD MAPTPRQMAY 360 

EAFVBQIAFN VEILRIPXH) KDLINTANWR VNPTILKGNE NQHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VKLEIGVNNB APFARDIPRV TALNRALVTV HVRDIiDEQPB CTPAAQ YVRI 480 

25 KENLAVGSKI NGYKAYDPEM RNGNGUIYKK LHDPKGWITI DEISGSIITS KXLDREVETP 540 

KNBLYNITVIi AIDKDDRSCT GTIiAVMIBDV HOMPPBIMJB YWZCKPXMQ YTDILAVDPD 600 

EPVHGAPFYP SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQBVTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTQVILGK KAXLAILLGI ALLFSVIiTL VCGVFGATKG 720 

KRPPEDLAQQ NLIISNTBAP GDDRVCSANG FMTQTTNNSS QGPCGTW3SO MKNGGQETIE 780 

30 MMKGGNQTLB SCRGAGKHHT LDSCSiGGHTE VDNCRYTYSB WKSPTQPRLO EKLKRQ7QNE 840 
DRMPSCS)YVL TYNYEGR6SP AGSVGCCSEK QEEDGLDFLN NLEPKPITLA EACTKR 



70 



8eq ID KOt 331 DNA sequence 

Nucleic Acid Accession ff: im_004363.1 

Coding sequence t 115.. 322 3 



Seq ID NOi 329 DNA sequence 
35 nucleic Acid Accession #< NM_016583.2 
Coding sequence t 72 . . 842 

1 XX 21 31 41 51 

^rx 1 I I I I I 

40 GGAGTOaOOG AGAGAOAGGA GACOVGGACA GCTGCXGASA CCTCTAAQAA OTCCAGATAC 60 

TAAGAGCAAA GATGTTTCAA ACTQGGGOCC TCATTOTCTT CT A0GG6 CTQ TTA6CGCAGA 120 

CCATGGCCCA GTTTGQAGGC CTGCX»GTGC CCCTGGACCA QACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT (XCACT^GGTC TTQCAGQAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGGCTGCT GTCXGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

45 TGAAGCCTGQ AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAOTGAOQT 360 

CAGTGATTCC TGGCCTQAAC AACATCATTQ ACATAAAGGT CACTGACXXX: CAGCTGCTGQ 420 

AACTTGOCCT TGTGCAQAQC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACX3 CCCCTGOTOO GTGCAAOTCT GTTQAGGCTG GCTGTGAAGC 540 

TGQACATCAC TGCAGAAATC TTAQCTGTGA GAGATAAGCA GGAOAGGATC CACCTGGTCC 600 

50 TTCQTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTQ CTTGAT GGACTTGGCC 660 

CCCXCCCCAT TCAAGGTCTT CTGOACAGCC TCACAGOGAT CTT6AATAAA GTCCTGCCTG 720 

AOTTCQTTCA GGGCAACXSTQ TGCCXTTCTCG TCAATQAGGT TCTCAGAOGC TTGGACATCA 780 

CXXrrGGTGCA TGACATTQTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT QAGCTGCTTC CX3«3TQCTCA CAGATGGCTO 900 

55 GCCCATGTGC TGGAAGATGA CACAQTTGCC TTCTCTCCGA QGAACCTGCC CCCTCTOCTT 960 

TCCCAOCAGG CGTOTOTAAC ATCCCATGTQ CCTCACCTAA TAAAATOOCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

8eq ID KOj 330 Protein sequence 
60 Protein Accession ftt NP_0S7667.1 

1 11 21 31 41 51 

1 I i I I ) 

MFQTCGLIVP YGLIiAQTMAQ PGGLPVPLDQ TLPI2JVMPAL PLSPTGLAGS LTMALSKGLL 60 
65 SGGLLGILBN LPLLDILKPG GGTSGQLLGG MiGiWTSVIP GliNNIIDIKV TDPQUiEUSL 120 
VQSPDGHRLY VTIPLGIKLQ VNTPLVOASL XiRLAVKLDIT ABXIAVRDKQ ERIRLVU3DC XBO 
THSPGSLQIS LLDGLGPLPX QOLLDSLTGI LNKVLPELVQ GNVCPLVNEV LROliDXTItVH 240 
DXVMMLZHOL QFVIKV 



75 X XX 21 3X 41 51 

I I I i I I 

CTCAGGOCAG AGGGAGGAAG GACAGCAGAC CA6ACAGTCA CAGCAGCCTT OACAAAAOGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGQ AOGACAGAGC AGACAGCAGA GACCATGGAG 120 

TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCXXrTGGC AQAGGCTCCT GCTCACAGCX: 180 

80 TCACTTCTAA CCTTCTGGAA CCOQGCCACC ACTOCCAAGC TCACTATTGA ATCCAC6C00 240 

TTCAATGTOa CAGAGGGOAA GQA6GTGCTT CIACTTOTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAOA GTGQATOGCA AOOOT CAAAT TATAGGATAT 360 

GTAATA06AA CICAACAAGC TACCCCA06G COOGCATACA GTOGTOGAGA 6ATAATATAC 420 

CCCAATQCAT CCCTQCTOAT OCAGAACATC ATCCAQAATG AACAOOATT CTACACCCTA 480 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CftOCSTCMTAA 
GAGCTGCCCA 
GTGGCCTTOV 
CAGAGCCTCC 
TTCAATQTCA 
GGCAGGCGCH 
TCraCTCTAA 
TCTAACCCAC 
GAGCTCTTTA 
AACTCAOACA 
CCCAAACCCT 
TTAACCTGTG 
CTCCOGGTCA 
GTCACAAGGA 
CACA6C6ACC 
TOVTACACCT 
CX310CTGCAC 
TTTATCTCCA 
GCCAOTGOCC 
CCCTCCATCT 
TGTGAACCTO 
GTCAGTCCCA 
AGAAATGACX3 
GACCCAGTCA 
TCGTCTTACC 
COQCAOTATT 
GCCAAAATCA 
GGCG6CAATA 
CTCTCAGCTG 
TAGCAGCCCT 
TAAAGCATTT 
AGACTCTGAC 
AAATACAAAA 
TGAGGCAGGA 
ACTGCACTCC 
TCTGACCTGT 
AACTTTAATG 
TAATTAATTT 
TTCCCAGATT 
AAATATACTT 
AGACTTGGGA 
TCAATAAAAA 



AGTCAGATCT 
AGCCCTCCAT 
CCTGTGAACC 
CX5GTCA0TCC 
CAAGAAATGA 
GTGATTCW3T 
ACACATCTTA 
CTGCAC3«5TA 
TCCC CAACAT 
CT06CCTCAA 
TCATCAGCAO 
AACCTGAOAT 
GTCCCAGGCT 
ATGATQTA6G 
CAGTCATCCT 
ATTACCGTCC 
AGTATTCTTG 
ACATCACTGA 
ACAOCAGGAC 
CCAGCAACAA 
AOGCTCAQAA 
GGCT6CAGCT 
CAAGAGCCTA 
CCCTGGATQT 
TTTCGGGAGC 
CTTGGGOTAT 
CGCX3VAATAA 
ATTCCATAGT 
GGGCCACTGT 
GQTGTAGTTT 
GCAACAGCTA 
CAGAGATCGA 
ATGAGCTGGG 
6AAT0GCTTQ 
AGTCTGGCAA 
ACTCTTGAAT 
AACTAACTQA 
CATGGGACTA 
TCAGGAAACT 
TTGTGAACAA 
AACTATTCAT 
TCTGCTCTTT 



TGTGAAT6AA 
CTCCA6CAAC 
TGAGACTCAG 
CAGGCTGCAG 
CACAGCAAGC 
CATCCTGAAT 
CAGATCAGGG 
CTCTTGGTTT 
CACTGTGAAT 
TAGGACCACA 
CAACAACTCC 
TCAGAACACA 
GCAGCTOTCC 
ACCCTATGAG 
GAAT6TCCTC 
AOGGQTOAAC 
GCTGATTGAT 
GAAGAACAGC 
TACAGTCAAG 
CTCCAAACCC 
CACAACCTAC 
OTCCAATGGC 
TGTATQTGGA 
CCTCTATGGG 
GAAGCrCAAC 
CAATGGGATA 
TAACGGQAGC 
CAAGA6CATC 
CGGCATCATG 
CTTCATTTCA 
CAGTCTAAAA 
GACCATCCTA 
CTTGGTGGCG 
AACCCGGGAG 
CA6AGCAAGA 
ACAAGTTTCT 
CAGCTTCATG 
AATGAACTAA 
TTTTTTCTTT 
AAATTGAGAC 
OAATATTTAT 
GTATAACAGA 



GAAOCAACTG GOCAGTTCCO 
AACTCCAAAC COGTGQACGA 
6ACX3CAACCT ACCTGTGGTQ 
CTGTCCAATG GCAACAGGAC 
TAC3U\ATGTG AAACCCAGAA 
OTCCTCTATO GCCOGGATGC 
GAAAATCTGA ACCTCTCCTQ 
OTCAATGGGA CTTTCCAGCA 
AATAGTGGAT CXTTATACGTO 
GTCACGACGA TCACAGTCTA 
AACXrCGTGG AGQATGAGGA 
ACCTACCT6T G0T66GTAAA 
AATQACAACA GGACCCTCAC 
TGTGGAATCX: AGAAOGAATT 
TATGGCCCAG ACGACXXX3^C 
CTCAGCCTCT CCTQCCATQC 
GOGAACATCC AGCAACACAC 
GOACTCTATA CCTGCCAGGC 
ACAATCACS^ TCTCTGOOQA 
GTGGAGGACA AGGATGCTGT 
CTOTG G TG Q Q TAAATGGTCA 
AACAGGACXX: TCACTCTATT 
ATCCAGAACT C3VBTGAGTGC 
COGGACACCC CCATCATTTC 
CTCTCCTGCC ACTOGGCCTC 
GOGCAGCAAC ACACACAAGT 
TATGCCT6TT TTGTCTCTAA 
ACAGTCrCTG CATCT6GAAC 
ATTGGAOTQC TGGTTGGGGT 
GQAAGACTGA CAGTT6TTTT 
TTGCTTCTTT ACCAAGGATA 
GCCAACATCG TGAAACCCCA 
CGCACCTOTA GTCCCAGTTA 
OTGGAGATTG CAOTGAGCCC 
CTCCATCrCA AAAAGAAAAG 
GATACCACTG CACIGTCT6A 
AAACTGTCCA CCAAGATCAA 
TGAGGATTGC TOATTCTTTA 
TAAGCTATCC ACTCTTACAG 
ATTTACATTT TCTCCCTATO 
ATTGTATGGT AATATAGTTA 
AAAA 



G6TATACCCQ 
CAAGGAT6CT 
GGTAAACAAT 
CCTCACTCTA 
CCCAGTGAGT 
CCOCACCATT 
CCAC6CAGCC 
ATCCACCCAA 
CCAAGCCCAT 
TGCA6AGCCA 
TGCTGTAGCC 
TAATCA6A6C 
TCTACTCA6T 
AAGTGTTGAC 
CATTTCOCCC 
AGCCTCTAAC 
ACAAGAGCTC 
CAATAACTCA 
6CTGCCCAAG 
GGCCTTCACC 
OAGCCTGCCA 
CAATGTCACA 
AAACOSCAGT 
CCCCCCAGAC 
TAACCCATCC 
TCTCTTTATC 
CTTG6CTACT 
TTCTCCTGGT 
TGCTCTGATA 
GCTTCTTCCT 
TTTACAGAAA 
TCTCTACTAA 
CTCGGGAGGC 
AGATCGCACX: 
AAAAGAAGAC 
GAATTTOCAA 
GCAQAOAAAA 
AATGTCTTGT 
CAATTIQATA 
TGGTOGCTCC 
TT6CACAAGT 



540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
IBOO 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



Seq ID NO: 332 Protein sequence 
Protein Accession tf: HP 0043S4.1 



MESPSAPPHR 
HLFGYSWYKQ 
TLHVIK8DLV 
IINQSLPVSPR 
TISPLNTSYR 
AmiSDTGI^ 
QSLPVSPRLQ 
SPSYTYYRPO 
NSASGHSRTT 
IiPVSPRLQLS 
FDSSYLSGAK 
ATGRNNSIVK 



11 

I 

WCIPHQRLLL 
ERVDGNRQII 
NEEATGQFRV 
LQLSNGNRTL 

TTVTTITVYA 
LSNDNRTLTL 
VNLSLSCHAA 
VKTITVSAEL 
NGNRTLTIiFV 
liNLSCHSASN 
SITVSASGTS 



21 
1 

TASIiLTFMNP 
GYVIGTQQAT 
YPGLPKPSXS 
TLFNVTRNDT 
AASNPPAQYS 
BPPKPPIT8N 
I.SVTRNDVGP 
SNPPAQYSWL 
PKPSISSKNS 
VTRHDARAYV 
PSPQYSHRIN 
PGLSAOATVG 



31 



PTTAKLTIES 
PGPAY80RBZ 



ASYKCETQNP 
WFVNGTPQQS 
KSNPVEDEDA 
YECGIQ NE^ 
IDGNIQQHTQ 
KFVBDKDAVA 
G6IQNSVSAN 
aiPQQHTOVIi 
INlGVItVGVA 



41 

I 

TPFNVAEGKE 
IYPNA8Lt*IQ 
DAVAFTCEFB 
VSARRSDSVI 
TQEliFIPNIT 
VALTCEPEIQ 
VDK8DPVIUI 
SIiPZSKITEK 
PTCEPEAQNT 
RSDPVTXiDVIt 
FIAKZTFIQ1N 
LZ 



51 

i 

VLLLVHNLPQ 
NIIQNDTGFY 
TQDATYZMWV 
LNVLYOFDAP 
VNNSGSYTCQ 
NTTYLWWVNN 
VLYGPOOPTZ 
NSOIiYTOQAN 
TYLHHVNGQS 

YGFiyrpixsp 

QTYACFVSML 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO I 333 DHA sequence 

Nucleic Acid Accession ftt NM_006952.1 

Coding sequence I 11.. 7 93 



I 

AATOCXXSACA 
TG6AAAT6T0 
ATCTGAOCSUl 
GGCTQCCTGG 
TGTAGGCATC 
AGTATATGCC 
AOCCAACCTC 
TQATGACCAG 
CAATT6CTGT 
TGAQAATAAT 
AOAACCTCTC 
CTGCTAT6AA 
ATTT6CCATT 
AATTGAATAT 



11 
I 

ATGGC6AAAG 
ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAOTCCA 
TTTGAAGTXaG 
TTOCTQAAGC 
TGGAAAAACA 
GGOGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTGCIGGA 
TAAGAA 



21 

I 

ACAACTCAAC 
GTTGCX3GCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTCAC 
GTCXS^TCAGA 
ATCCCTGQCC 
CTTGTAAACT 
GTCCAATGAA 
CTTTTT GG GT 



31 
I 

TGTTCGTTQC 
TGCCCTGACT 
TGAAGGCACC 
CTGCCTCTTC 
TCTTCTGGCG 
CACAGCAGCA 
GAGGTACCAA 
CAAAACCTGG 
CTGGCAAAAA 
TC6TCAATGC 
AGGCX3TGCCT 
CCGACACGCC 
TCTGCTGGGT 



41 

I 

TTCCAGGGCX: 
GCGGAGTGCA 
QACAAG6ATG 
TOCCTGTCT O 
TATTTCATTC 
ACACAAOGAO 
AACAACAGCC 
GACA06CTCA 
TACACATCTG 
TGTGTTATGA 
GQTTTTTATC 
TOGGGGQTTG 
ACCATGTTCT 



51 
I 

tgctgatttt 
tcttctttgt 
acatctatgg 

TTCTAOGCAT 
TGATGTTTAT 
ACTTTTTCAC 
CTCCAAACAA 
TGCTCCAGGA 
CCTTCOGGAC 
ACAATCTTAA 
ACAATCAGGG 
CCTOOTTrGG 
ACTGGAGCAO 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NOt 334 Protein secfuence 
Protein Accession #i NP_00a883.1 
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1 11 21 31 41 51 

I I I I i t 

MAKDNSTVRC FQQLLIPGMV IIGCCGIALT AECIPPVSDQ HSLYPLLEAT DNDD IYQA AW 60 
5 IGIFVGICLP CLSVLGIVGl MKSSRKILIA YPILMFIVYA FBVASCITAA TQRD FFTPM L 120 
FLKQMLBRYQ NNSFFNNDDQ HIQQiaVTKTH DRLMLQDHCC GV1R3FSDWQR YTSAFRTENN 180 
DAOYPNPRQC CVMNNLKEPL NLBACKLGVP GFYHHQGCirB LZSQPtQIRKA HGVAWF6PAI 240 
liCWTPWVLLG TMPYWSRIEY 

10 Seq ID NO: 335 DMA aequencB 

Nucleic Acid Accession «: NM_002638.1 
Coding sequences 120.. 473 

1 11 21 31 41 51 

15 1 I I 111 

CAATACAGCT AAGQAATTAT CCCTTOTAAA TACCACAGAC COGCGCrGQA GCCAGGGCAA 60 
GCTGGACTGC ATAAAGATTG OTATOQCCIT AQCTCTTAGC CAAACACCTT CCTQACACCA 120 
TGAGGGCCAG CAGCTTCTTQ ATOQTGGTGG TGTTCCTCAT CGCTGGGACG CT^TTCTAG 180 
AOGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTOTTCCAT 240 
20 TCAATGQACA A6AT0C0GTT AAAGQACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAO 300 
CGCAAOAGCC AOTCAAAGGT CCAOTCTCCA CTAAGCCTGG CTCCTGCCGC ATTATCTTOA 360 
TCCGGTGOGC CATGTTGAAT CCCCCTAACC OCTGCTTOAA AOATACTGAC TGCCCAGGAA 420 
TCAAGAAGTG CTGTGAAGGC TCTTOCGGGA TGGCCTGTTT 0GTTCCX3CAG TGAAGQGAGC 480 
OGGTCCTTGC TGCACXTTCTG CCGTCCCCAQ AGCTACAOGC CCCATCTGGT CCTAAGTCCC 540 
25 TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGAT6 CCCACXSGCTG 600 
GAQCTGCC TC TCTCATCCAC TTTCCAATAA A 



30 
35 



Seq ID NO I 336 Protein sequence 
Protein Accession #t MP_002629.1 

1 11 21 31 41 51 

I 1 1 t 1 I 

MRA88FLIW VFLXAGTLVL BAAVTGVFVK GQDTVXORVP FNGQDPVNSQ VSVRGQDKVK 60 
AQBPVKQPVS TXPGSCPIXL ZRCAMUIPPH RCLXDTDCP6 ZXKCCB68C0 MACFVPQ 

8eq ID NOt 337 DNA sequence 
Nucleic Acid Accession #i NM_001793.2 
coding sequence: 71.. 2560 

40 1 11 21 31 41 51 

I I I I 1 I 

AAAGGGGCAA GAGCTOAGCG GAACACCGGC COGCOOTOGC GQCAGCTGCT TCAOC CCTCT 60 

CTCTGCAGOC ATGGGGCTCC CTCGTGGACC TCTOGOGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTOQCTGCAQ TGOGOOQCCT CCGAQCCGTO CCGGGCGGTC TTCAGGGAGG CTGAAGTQAC 180 

45 CTTGGAGGCQ GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGOO QAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAQ CTCTGTTTAO CACTQATAAT GATGACTTCA CT QTOOGGAA 300 

TGOCGAQACA GTCCAGGAAA GAAGGTCACT GAAOGAAAGG AATCCATTCA AQATCTTCCC 360 

ATCCAAAOGT ATCTTACQAA GACACAAGAG AGATTOOGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGOTCCCT TCCCCCAQAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

50 AGACACCAAG ATTTTCTACA GCATCACGGQ GCOGGGGGCA OACAGC CCCC CTGAGGGTGT 540 

CTTCGCTOTA GAGAAGGAGA CAGGCIQ G TT GTTGTT6AAT AAGGCACTGG AOGOGQAOGA 600 

GATTCCCRAG TATGAGCTCT TTGGCCAOGC TOTGTCAGAO AATGGTGCCP C AOTGG AGGA 660 

OCCCATGAAC ATCTCCATCA TCQTOAOCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAlGAGGQ AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

55 GACAGCCAOO GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTG6TTG CTTACTCCAT 840 

CCATAOCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC AO CATTC AOC OGAOCACAGO 900 

CACCATCAGC GTCATCTCCA GTGGCCTQGA CCGGGAAAAA CTOOCTGAST ACACACT6AC 960 

CATCCAGQCC ACAGACATQG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAOTQGA 1020 

GATCCTTGAT GOCAATGACA ATGCTCCCAT GTTTQACCCC CAQAAGTACG AGQCCCATGT 1080 

60 GCCTGAGAAT GCAGTGGGCC AT3AGGTGCA GAGGCTGACO GTCACTGATC TGQAOGCCCC 1140 
CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GOTGACQACG QOGAOCATTT . 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGOCATCCTO ACAACCAOGA AGOTTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCT6TA OGTTGAAGTG ACCAAOGAQG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATQAGGC 1380 

65 ACCIGTGTTT GTCCCACCCT CCAAAOTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCrcTGTGT GTCTACACTQ CAGAAGACCC TGACAAGGAO AATCAAAAGA T CAGCT ACOO 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AG0TCACA6C 1560 

T6TGGGCA0C CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GOTCTTQOOC ATGQACAATG GAAGCCCTCC CACCACTGGC ACGQGAACCC TT CTGCTA AC 1680 

70 ACTGATTQAT GTCAATGACC ATGQCCCAGT OCCTGAGCCC GOTCAGATCA CCATCTG^ 1740 

CCAAAGCCCT GTOOGCCAGG TGCTGAACAT CAOGGACAAQ GACCTGTCTC GCCACAOCTC 1800 

CCCTTTCCAG GCOCAGCTCA CAGATGACTC AGACATCTAC TGGAOGGaG AGGTCAAOGA 1860 

QGAA6GTGAC AGAOTGGTCT TGTCCCTGAA GAAOTTCCTG AAGCAGGATA CATATGACGT 1920 

OCACCTTTCT CTOTCTQACC ATGGCAACAA AGAGCAGCTG AOGGTGATCA GGGC CACTQT 1980 

75 OTOOGACTGC CATGGOCATG TCQAAACCTG CCCTGGACOC TGGAAGGGAG GTTTCATC CT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTOCT GTTCCTCCTG CTGGTGCTGC TTTTGTT GGT 2100 

GAGAAAGAAG OGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA 0006TGACAA 2160 

C8TCTTCTAC TATGGOGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCAGCQA GGTCTOQAGG CCAGGCCGGA GGTGGTTCTC OGCAATQACO T06CACCAAC 2280 

80 CATCATCCOG ACACCCATGT ACOQTCCTCG GCCAGCCAAC CCAGATOAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCOCCBGOCT ACQAg^ CCCT 2400 

CTTCGTOTTC GACTATGAGG GCAGCGGCTC OQAO6C0G00 TC CCTO ftOCT CCCT CACCT C 2460 

CTCCGCCTCC OACCAAOACC AAGATTACGA TTATCTGAAC GAQ TOTOGC A GGOSCTTCAA 2520 

GAAGCTGGCA GACATGTAC6 GTGGCOGOQA OQAGQACTAO GOGQOCIGCC TGCAGQOCTG 2580 
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GGQAOCAAAC GTOVGGCCAC A8A6CATCTC CAAGGGOTCT CAlQTrCCCCC TTCAGCIGAO 2640 

GACTTCGGA6 CTT6TCAGQA AGTGGCOOTA GCAACrTGOC GQAOftCAGGC T ATGAC TCTG 2700 

ACOTTAGAGT GQ T T GCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AOCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGQCCAAOTT TCCAGAAGCC 2820 

5 TCTTACCTGC OGTAAAATGC TCAACCCTGT GTCCTOQGCC TGGGCCTCCT GTGACT GACC 2880 

TACAGTGGAC riTCItTlVI'O OAATQGAACC TTCTTAG60C TCCTGQTOCA ACTWATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAAOOTTAQA GAAAGTTCTT CAAAAGTGCA 6CCCA6AGCT 3000 

GCTG G GCCC A CT0GCC6TCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CC CATT CGGA 3060 

TGGATCrCTO CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CXTTTATTTTT T ATTTTCC CT 3120 

10 GTrGOGTTGC TATAGAT6AA GGGTGAGGAC AATC6TGTAT ATOTACTAGA ACTTTTTTAT 3180 
TAAAGAAAGT TTTCCCAGAA AAAAA 



Seq ID HOt 338 Protein segruence 
Protein Accession ftt NP_0017B4.2 

1 11 21 31 41 51 

I I I I I I 

KGLPRGPIiAS LUiUQVCWLQ CAASBPCRAV PRBAEVTLEA GGAEQEPGQA LGKVPMGCPG 60 

QEPALPSTDN DDPTVRMGET VQERRSUCER NPUCIPPSKR ILRRHKRDWV VAPISVPQTG 120 

20 KGPPPQRIiNQ LRSNiCDItDTK IFYSZTGP6A DSPPB6VFAV EKETGWLLU« KPLDREBIAK 180 

YELF8KAVSB KGASVEDPMN ISIIVTXXSND HKPKFTQOTP RGSVLBSVXiP CPrSVMQVTAT 240 

DEDDAXyTYN GWAY8IHSQ BPKDPKDLMP TZKRSTGTZS VI8S6L0RBR VPEYTLTIQA 300 

TDMDGDGSTT TAVAWSILD ANDNAPMFDP QKYEAHVPEN AVOTBVQRLT VTDLDAPNSP 360 

AWRATYLIMO GDDGDHPTIT THPESNQGIL TTRKOLDFEA KNQHTLYVBV TNEAPPVLKL 420 

25 PTSTATIWH VEDVNEAPVP VPPSKWEVQ EGIPTGEPVC WTAEDPOKB MQKZSYRIIiR 480 

DPAGWIiAMDP OSGQVTAVGT XjDRBDEQFVR NNZYEVMVLA KDNGSPPTTG TGTLL LTLID 540 

VNDKGPVPEP RQITICNQSP VRQVI2IITDK DLSPHTSPPQ AQLTDDSDIY HTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCax: HGHVETCPGP WKGGPILPVL 660 

GAVIiALLPLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGBEGGGEED QDYDITQLHR 720 

30 GLEARPBWL RMDVAPTIIP TPMYRPRPAN PDBI6MPIIB NLKAANTDPT APPYDTLLVP 780 
DYBGSGSDAA SLSSLTSSAS DQDQDYDYLN EMGSRFKKLA DMYOGGEDD 



Seq ID MOt 339 DMA sequence 
Nucleic Acid Accession «: Boo sequence 
35 Coding sequence: 1..672 

I 11 21 31 41 51 

I I I I 1 t 

ATGAtSQCTCC AAAGACCCC3G ACAGQCCCCG GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 60 

40 CGGOGCrCCC CCTACCGGCC AGACCCGGGO AGAGGOQCGC GQAGGCTGCG AAGGTTCCAG 120 

AAGGGOQGGG AGGGGGCGCC GCGCOCTGAC CCTCCCTGGG CAC06CTGGG GAC ^Q QCO 180 

CTGCTCXKCr TGCTGCTGGT CGTGGCCCTA C030GGGT0T GQACAQACX5C CAACCTQACT 240 

G06AGACAAC GAGATCCAGA GGACTCCCAG OQAAOGGACG AGGGTGACAA TAGAGTGTGG 300 

TGTCATOTTT GTGAGAGAGA AAACACTTTC GAQTGCCAQA ACCCAAGGAG GTGCAAATGG 360 

45 AlOUSAGCCAT ACTGCGTTAT AGCGGCCGTG AAAATATTTC CAOGTTTTTT CATG GTTGCG 420 

AAGCAGTGCT COGCTGOTTG TGCAQCGATG GAGAGAOCCA AGGCAGAGQA OAAGOGGTTT 480 

CrOCTGOAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTOQ CCACTGCAAT 540 

TTAQAG666C CACCTATCAA CTCATCAlOTG TTCAAA6AAT ATGCTGGGAG CAT6GGTGAO 600 

AGCTGTGOTO GGCTOTGGCT GGCEATOCTC CTQCTGCTGG CCTCCATTGC AGCCGGCCTC 660 
50 AGCCTGTCTT GA 

Seq ID NO: 340 Protein sequence 
Protein Accession #i Bos sequence 

55 1 11 21 31 41 51 

11)111 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGBGAPRAD PPWAPZ^3TMA 60 

UiALIiUVVAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCEREOTF BOQNPRRCKW 120 

^ TBPYCVIAAV KIFPRFFKVA KQCSA6CAAM BRPKPEEKRP LLEBPMPPFY UCCCKIRYOI 180 

60 LEGPPINSSV FKEYAGSMQE SCGGLWLAIIi LLZiASZAAaL SIiS 



65 



Seq ID NO: 341 DNA sequence 

inicleic Acid Accession fft XM_035292.2 

Coding sequence i 53.. 1576 



1 11 21 31 41 51 

I I I I I 1 

GCTOGCTQGG G08CGGCTCC CQOOTGTCCC AGGCCOSGCC GGTGCGCAOA GCATGGOGGQ 60 

TCOQGGCCGG AACCGOCOCG CGCTAGCGGC GCCGGOGGCC GAGGAGAAGG AAGAGGOGOQ 120 

70 QQAGAAGATG CTGGCCGCCA AGAGOGOGGA CGGCTOGGCO CCGGCAGGOQ AG^CGAGGG 180 

OGTGACCCTG CAGCGGAACA TCAOGCTQCT CAAOGGCGTG GCCATCATOG TOSOO ACCAT 240 

TAT0GG C TC3S GGCATCTTCG TGA06CCCAC GGGOGTGCTC AAGGAGGCAO GCTGGCOGGG 300 

GCTGGGGCTG GTGGTGTGGQ COGOOTGOGO 03TCTTCTCC ATCGTGGGCG CGCTCTQCTA 360 

CGCGGAGCTC GGCAOCACCA TCTCCAAATC GGOCGGCGAC TACGCCTACA TGCTGGAGOT 420 

75 CTAOGGCTOG CTGCC06CCT TCCTCAAGCT CTaOATCQAG CTGCTCATCA TCOGGCCTTC 480 

ATCGCAGTAC ATOSTGGCCC TG3TCTTCQC CACCTACCTG CTCAA OCOG C TCXTCCCCAC 540 

CTGCCOGGTQ CCOGAGGAGG CAOCCAAGCT OGTGGCCTGC CTCTGCGTCC TGCTGCTCAC 600 

GGC0QT6AAC TGCTACAOOG TGAAOGCCGC CACCOGGGTC CAGGATGCCT TTGCCGCCGC 660 

CAAOCTCCTG GCCCTGGCCC TGATCATCCT GCTGOGCTTC OTCCAGATCO QAAAGGGTGA 720 

80 TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAG6CACC AAACTGGATO TGGQGAACAT 780 

TGTGCTQGCA TTATACAGOG GCCTCTTTGC CTATGOAGaA TG6AATTACT TOAATTTOST 840 

CACAOAGGAA ATGATCAACC . CCTACAGAAA CCT6CCCCT0 GCCATCATCA TCTCCC IGCC 900 

CATOGTQAOQ CTQGTQTAOQ TQCTOACCAA CCTGOCCTAC TTCAGCACCC TOTCCACGQA 960 

GCAGATGCTG TCGTCC6AGG COQT06COQT GGACTTOQQO AACTATCMrC TGGOOGTCAT 1020 
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OTCCTGGATC ATCCCCGTCT TOOTGGGCCT GTCCTGCTTC GGCTCCGTCA ATOGGTCCCT 1080 

GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCBGOAA GGCCACCTGC CCTCCATCCT 1140 

CTCX31TGATC CRCCCACAOC TCXTTCACCCC CXnXSCCGTCC CTOaTGTTCA CGTS TOTGAT 1200 

GAOO C T G CTC TAOQCCTTCT CCAAGGACAT Ci ' lUimiTC ATCAACTTCT TCAGCTTCTT 1260 

CAACTGQCTC TGGQTGGCCC TGGCCATCAT CX3GCAT8ATC TGGCT60QCC AC ftQAA AOCC 1320 

TGAGCTTQAQ CGGCCCATCA AGGTGAACCT GGCCCTOCCT OTOTTCTTCA TCCTGQCCTO 1380 

CCTCTTCCTQ ATOGCCOTCT CCTTCTGGAA GACACCXXTTQ GAGTGTGGCA TCGGCTTCAC 1440 

CATCATCCTC A60GGGCTGC COGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 1500 

GTQGCTCCTC CA0G6CATCT TCTCCAGQAC 06T0CTGT6T CA6AA6CTCA TGCAOGTQGT 1560 
CCCCCAG6A6 ACATAGCCAG OAG6COC3AGT GGCTGCOGOA CGASCATGC 

Seq ID KOt 342 Protein sequence 
protein Accession 8: XP_035292.2 

1 11 21 31 41 51 

I 1 I I I I 

MAGAGPKRRA LAAPAAEEKE BAREKMLAAK SADGSAPAGE GBQVTLQRNI TLLKGVAIIV 60 
GTXIGS61FV TPTGVUCBAO SPGLALWWA ACXnTFSIVGA LCYAELGm SKSGGDYAYM 120 
LBVYGSLPAF LKLHXELLXZ RPSSOYXVAIt VFATYXiLKPL FPTCFVPEEA AXLVAOiCVL 180 
LLTAVMCYSV KAATRVQDAF AAAKLUALAL ZILLGFVQXG KGDV8NLDPN FSFBGTKUDV 240 
ONXVLALYSG LPAYGGWNYIi NPVTEEMINP YRNLPIAIII 9LPXVTLVYV LTNLAYPTTL 300 
STEQMIiSSEA VAVDFGNYHIi OVMSWIIPVP VGLSCPGSVN GSLPTSSRLP FVGSREGHLP 360 
SILSMIHPQL LTFVPSLVFT CVMTLLYAFS KDIFSVXNFF SFFNWLCVAL A1ZGMI1«LRH 420 
RKPELBRPIK VNIiAItPVPFX LACLFLIAV8 FHKTPVBOGI GFTXXLSQLP VYPFGVHHXN 480 
KPKWLLQGIF STTVLOQKLM QWPQET 

Seq ID NO: 343 DNA sequence 

Nucleic Acid Accession «t NM_O0S268.1 

Coding sequence x 168 . . 989 

1 11 21 31 41 51 

11)111 

TAAAAAGCAA AAGAATTOGC GGCCGC6TC0 ACACGGGCTT CCCCX3AAAAC CTTCCC06CT 60 

TCTG6ATATG AAATTCAA6C TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAOCXIAGQAO 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCX3TGG GTCCACXrATG AACT6GAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGQ CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC GGOGTGCTGG TGTACCTOGT GAOGGCCOAQ CGTGTGTGQA 300 

6TGAT6ACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC OQGCTGCTCC AAC6TCTGGT 360 

TT0AT6AGTT CTTCCCTGTG TCCCATGIGC GGCTCTGOGC OCTGCAOCTT ATCCTGGIGA 420 

CATGCCCCTC ACTGCTC36TO GTCATGCACX3 TGGCCTACCG GGAGGTTCAO GA6AAGAGGC 480 

ACCX3AGAAGC CCATGC^AQ AACAGTGGGC GCCTCTACCT GAACCCXXX5C AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCXSAGCGTG GACATOGCCT 600 

TTCTCTATGT OTTCCACTCA TTCTACCCCA AATATATCXTT CCCTCCTGTG GTCAA6TGCC 660 

ACX3CAGATCC ATGTCCCAAT ATAGTGGACT OCTTCATCTC CAAGCCCTCA GAOAAGAACA 720 

TTTTCACCCT CTTCATGGTQ GCCACAGCT6 CCATCTOCAT CCTGCTCAAC CIt3QTGGAGC 780 

TCATCTACCT GGTGAQCAAO AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 840 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGOTGACCT CATCTTTCTG GGCTCAQACA GTCATCCTCC TCTCTTACCA GACQ GCCC CC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGQCTQCCTO GACTGGTCTG GGAOGTTGGG 1020 

CXrrOGATGGG GAGGCTCTAG CATCTCTCAT AGGT6CAAGC TGAGA6TG6G GGAGCTAA6C 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGA06CTC TGGGAGGCAa TTCCTA6TCC 1140 

TCAACTCCAQ CCACCTGCCC CAGCTCGACX3 GCACTGGGCC AOTTCOCOCT CTGCTCTGCA 1200 
GCTC3GGTTTC CTTTTCTAGA ATGGAAATAO TGAGGGCCAA TQC 

Seq ID NO: 344 Protein sequence 
Protein Accession fti NP_0052S9.1 

1 11 21 31 41 51 

i I I I I t 

MNWSIFBGLL 8GVNKY8TAF GRIWLSI*VFX PRVLVYLVTA BRVWSDDBKD FDCNTRQP6C 60 
SNVCPDBFFP VSHVRLWALQ LILVTCPSIiL WMHVAYREV QBRRHREABG ENSGRLYtHP 120 
GKKRGGIiWWT YVCSLVFKAS VDIAPLYVFH SFYPKYILPP WKCHADPCP NIVDCPISKP 180 
SEXNXFTI<FM VATAAICILI. NLVSIiIYLVS KRCHECIiAAR XAQAMCT6HH FHGTT6SCKQ 240 
SDUjSGDLIF LGSDSHPPLL FDRPHDHViaC TIL 

Seq ID NOi 345 DNA sequence 

Nucleic Acid Accession #t NM_002391.1 

Cbding sequence: 26.. 457 " 

1 11 21 31 41 51 

111)11 

0Q6GC0AAGC AGCGCGGGCA GOQAGATGCA GCACOGAGGC TTCCTCCTCC TCACCCTCCT 60 

QGCCCTGCTG GCX3CTCACCT CXX3CGQT0GC CAAAAAGAAA GATAAGGTGA AGAAGGGOQG 120 

CCCGGGGAGC GAGTGCGCTG AGTGG6CCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

OGGOGT GG GT TTCCGCGAGG 0CACCTGCX3G GGCCCAGACC CAGOGCATCC GGTGCAGGGT 240 

OCCCTGCAAC TGGAAGAAGG AGTTTGGAGC OQACTGCAAG TACAAOTTTG AGAACTGGGG 300 

TGOGTGIGAT 6G666CACAG GCACCAAAGT C06CCAAGGC AGCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TQCCAGGAOA CCATCCGCGT CACCAA6CCC T6CACCC0CA AGACCAAAGG 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GOACTAGAOG CCAAGGCTGO ATGCCAAGGA 480 

GOCCCTGGTG TCACATQGGG CCTGGCCAOa CCCTCCCTCT CCCAG6CC0G AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TOGTTAGCTT TAATCAATCA TGCCCTGCCT TGTOCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTOCCCAAA GTGGGGAGGQ ACAAGGGATT CTGGGAAGCT 660 

TOAQCCTCCC CCAAAGCAAT CTGAOTCOCA GA6CCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC rri TTOOOOC CAATAAAAGC TCTTCTTTTT 780 
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TAATAT 

Seq ID NOs 346 Protein sequence ^ 
Protein Accession 8i NP.002382.1 

1 11 21 31 41 SI 

I I I 1 1 I 

MQHROPLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AMOPCTPSSK DCGV6FREGT 60 
08AQTQRIRC RVPOW KKBP GAOCKYKFEN NGACDGGTGT KVRQGTXiKXA RYHAQOQETI 120 
RVTKPCTPRI KAXAKAKKSK GKD 

Seq ID NOi 347 DMA sequence 

Nucleic Acid Accession S: NM_006783.I 

Coding sequencei 1..786 

1 11 21 31 41 51 

I I 1 I I 1 

ATGGATTGGG GGAOGCTGCA CACTTTCATC GGOGOTGTCA ACAAACACTC CACCAGGATC 60 

GOOAAaQTOT GGATCAGAOT CATCTTTATT TTCCGAGTCA TGATCCTAGT OGTGOCTGCC 120 

CAGGAAGTGT GGGGTGA06A GCAAGAGQAC TT06TCTGCA ACACACTGCA ACGOGGATGC 180 

AAAAATGTGT GCTATGACXyi C T m ' l 'CCCQ GTGTCCCACA TCOSGCTGTG GGCCCTCCAG 240 

CTGATCTTOO TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATOTGGCCTA CTACAGGCAC 300 

GAAACCACTC GCAAGTTCAG GCX3AGGAGAG AAOAGGAATG ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAOC ACAAGGTTG6 GATAGAGGGO T08CIGXG6T GGACSTACAC CAGCAQCATC 420 

TTTTTCOQAA TCATCTTTGA AGCAOCCTTT ATGTATGTOT TTTACTTCCT TTACAATGGG 480 

TACCACCTGC CCTGGGTOTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 

TTTATTTCTA GGCCAACAGA GAAGACOGTO TTTACCATTT TTATGATTTC TGCGTCTGTG 600 

ATTTGCATGC TGCTTAAOGT GGCAGA6TTG TGCTACCTGC TQCTQAAAGT GTGTTTTAGG 660 

A6ATCAAAGA GA6CACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA OQ AGAGT RAG 720 

CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTOCCA 780 
AGCTAA 

Seq ID MO I 348 Protein sequence 
Protein Accession #t NP_006774.1 

1 11 21 31 41 51 

I i i I I ) 

KDWGTLHTFI GGVNKH8TSX GRVWZTVXFI FRVMILWAA QEVHGDEQED WOmJQPGC 60 

HNVCVDHPFP VSHIRLWALQ LIFVSTPAXiL VANUVAYYRH BTTRKFRRGB KRKDFRDXBD 120 

IKKKKVRZEG SLHWTYTSSI FFRIIPBAAP MyVFYFLYNG YHLPtlVLKOG XDPCFm«VDC 180 

FX6RPTBKTV FTZFNX8ASV ICMLUNVABL CYLIiIiKVCFR R8XRAQTQKN HPNHALKB8K 240 
QNEKNELI8D SGQMAXTGFP S 

Seq ID NOt 349 DMA sequence 

Nucleic Acid Accession NM_002571.1 

Coding sequencei 99.. 587 

1 11 21 31 41 51 

1)11)1 

CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC A6CCAT0CT0 TGCCTCCTGC 60 

TCACCCT6QG GGTGGCCCTQ QTCTGTGQTG TCCOGGCCAT GGACATCCCX: C31GACCAAGC 120 

AGGACCTGaA OCTCCCAAAG TTG6CAGGGA CCTGGCACTC CATGGCCATG GCX3ACCAACA 180 

ACATCTCXXrr CATGGOSACA CTGAAGGCCC CTCTGAGOGT CCACATCACC TCACTGTTGC 240 

CX:ACCCCCaA GGACAACCTG GAQATCGTTC TGCACAGATG GGAGAACAAC AOCTOTGTTQ 300 

AGAA6AAGGT CCTTGGAGAG AAGACTGG6A ATCCAAAGAA OTTCAAGATC AACTATAC8G 360 

tggogaacga ggccaogctg ctoqatactg actaogacaa r n xxr r GrrT ctctgcctac 420 

AGQACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTOGCCAGA OTCCTQGTGG 480 

AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT S40 

GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCOGCCT 600 

CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAQC AGTGGGACTT CCTCCTGCCC 660 

TTTCAAAGAA TAACCACAGC TCAGAAGACX3 ATGAGGTGGT CATCrSIGTC GCCATOCCCT 720 

TCCT6CTGCA CACCTGCACC ATTGCXyVTGa GGAOQCTGCT O CCTGGGQ G C AGAOTCTCTG 780 
GCAOAGGTTA TTAATAAACC CTTGGAGCAT G 

Seq ID NO: 350 Protein sequence 
Protein Accession «i MP_002562.1 

1 11 21 31 41 51 

I 1 ) ) 1 1 

MDXPQTKQDL BLPKLAGTWH SMAMATNNXS LMATLKAPLR VHXT8LLPTP EDMIfXVLKR 60 
WBNNSCVBKK VLGEKTGNPK KFKXNYTVAN BATLLDTDYD NFLFLCLQDT TTPIQSMMGQ 120 
YLARVLVEDD BIMQGFIRAF RPLPRBLHYL LDLKQKEBPC RF 

Seq ID NOt 351 DKA sequence 

Nucleic Acid Accession <}: NM_006500.1 

Coding sequencei 2 7.. 1967 

1 11 21 31 41 51 

11)1)) 

ACTTGOGTCT CGCCCTCCGG CCAAGCATGO GGCTTCCCAG GCTGOTCTGC GCCTTCTT6C 60 
TCGCQGCCTG CTGCTGCTGT CCTCGCGTOQ CGGGTGTGCC COGAGAGGCT GAGCAGCCTG 120 
OQCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAQCCCT TCTGAAGTGC GQCCTCTCCC 180 
AGTCCCAAGG CAACCTCAGC CATGTCGACT GQTTTTCTGT CCACAAGGAO AAGCGGACGC 240 
TCATCTTCOO TGTGCX^CCAG GGCCAGGOCC AGAGGGAACC TGGGGAGTAC GAGCAGCGGC 300 
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TCAGCCTCCA GGACAGA660 GCTACTCTGQ CCCTGACTCA AGTCAOOCCC CAAGACGAGC 360 
GCATCTTCTT GTGCCAGGGC AAGCOCCCTC QGTCCCAGGA GTACC3GCATC CAQCTCCGCG 420 
TCTACAAAOC TCOBGAGGAa CCAAAC3VTCC AGGTCAACCC CCTOgG CATC CCTCTGAACA 480 
OTAAOQAGGC TGAG6A0GTC GCTACXTQTO TAOGQAGGAA OGGGTACCCC ATTCCTCAAO 540 
5 TCATCtGGT A CAAOAATGQC C G QCCTCT Q A AGGACGAGAA GAACGGGOTC CACATTCAGT 600 
03T0CCA0AC TGTGGAGTCG AGTGGTTTOT ACACCTTOCA GAGTATTCTO AAGGCACAOC 660 
TGGTTAAAGA AGACAAAGAT OCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 
GGAACCACAT GAAGGAOTCC AGGGAAGTCA CCGTCOCTOT TTTCTACCCG ACA6AAAAAG 780 
^ ^ TGTOGCTGQA AGtGGftGCCC GTOGGAATCC TOAAGQAAGG GGACGG0GT6 GAAATCAQOT 840 
10 GTTTGGCTGA TG6CAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 
GGGAGGCAGA GGAAGAGACA ACXTVAOGACA ACX3GGGTCCT GGTGCTG6AG CCTGCCCGGA 960 
AGGAACACAG TOGGOSCTAT GAATGTCAGG CCTGGAACTT GGACACCATO ATA TOGC TGC 1020 
TGAOTQAACC ACAGGAACTA CTGGTGAACT ATGTGTCTQA CXSTCOQAGTQ AGTCOCGCAG 1080 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA OCCTGACCTQ 1GAGGCA6AG AGT ATOCAOQ 1140 
15 ACCTCGAGTT CCAGTGGCTG AGAGAAGAQA CAGACCAGGT GCTG6AAAG6 GG6CCTGTGC 1200 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TOGCTGCGTG GCGTCTOTGC 1260 
CCAGCATACC CGGCCTQAAC CXSCAOVCAGC TGGTCAAGCT GGCCATTTTT GGCCCCXXITT 1320 
GGATGQCATT CAAGGA6AGG AAGGTQTQGG TQAAAGAGAA TATGGT6TTG AATCTGTCTT 1380 
GTGAAGCGTC AGGGOVCCCC COGCCCACCA TCTOCTGGAA OBrCAA OGGC AOSGCAAGXG 1440 
20 AACAAOACCA AGATCCACA6 CGAGTCCTGA GCAOCXTOAA TGTCCTOGTG ACCCOGGAGC 1500 
TGTTGQAGAC AGGTGTTQAA TGCACX3QCCT CCAAOGACCT GGGCAAAAAC ACCAGCATCC 1560 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACX»OA CTCCAACACA ACCACTGGCC 1620 
TCMCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA QAGAGAAAGC 1680 
TCCCGGAGCC QGAGAGCCGG GGOGTGGTCA TC6TQGCTOT OATTOTOTGC A TOCT GOTCC 1740 
25 TQ6CGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG COGTOCAGGC 1800 
GCTCAGGGAA GCAGQAGATC ACGCTGCCCC CGTCTOGTAA GACCQAACTT QTAGTTGAAG 1860 
TTAAOTCAGA TAAGCTCCCA GAAGAQATOG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 
GGGCTCCGGG AGACCAGGOA GAQAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 
CAGCTCCCTT CCCTGCCTGG ACCATTCOCA GCTCCCTGCT CACTCTTCTC TCAOO CAAAG 2040 
30 CCTCCAAAGG GACTAGAOAO AAQCCTCCT6 CTCCCCTCAC CTGCACACCC CCTTTCAQAO 2100 
QGCCACTGOG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCX3CTTT TCAGGGACCA 2160 
OTCCACXaCC ATCTCCTCCA COTTGAGTGA AGCTCATCCC AA6CAAGGAG CCCCAGTCTC 2220 
CC6AGCGGQT AG6AQAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCrGRGCTGQ TTTCCTOOCC 2340 
35 CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAOTO AGOACACACC GGAGCC3U3GC 2400 
GCCTGCTCAT GTTSAAGTGC GCTGTTCACA CCCGCTCCGG AGAGC3VCCCC AGCGGCATCC 2460 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 
ACATTTTTTC TTrGGTCAQA AGCCAGQAAC TGOT6TCATT CCTTAAAAGA TACGTGCOGG 2580 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGGOQA 6GGGG6G0QA 2640 
40 TCACAAAGTC AGGAOGAGAC CATCCTGGCT AACAOGGTQA AACOCTGTCT CTACTAAAAA 2700 
TACAAAAAAA AATTAGCTAa GOGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCX3GAAGG 2760 
CTQAAGCAOO AGAATGGTAT GAATCCAGQA GGTOGAGCTT GCAQTQAGCC GAGACCGTGC 2820 
CACTGCACTC C»GCCTGQGC AACACAGCGA GACTCCGTCT OGAGGAAAAA AAAAOAAAAO 2880 
ACGCGTACCT GCX3GTGAGGA AGCTGGGCGC TGTTTTCQAG TTCAGGTOAA TXAGOCTCAA 2940 
45 TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTQATGGATC ACGTAAAACT GAAAG6CAGC 3000 
GGGGAGCAGA CAAA6AT6AG GTCTACACTG TCCTTCATGG GGATTAAAQC TATGGTTATA 3060 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG OCCAAATGAG 3120 
AGAAIGGTAC TTAGGQATGG AAAAOGGGGC CTGGCTAGAG CTT OGGGT GT GTGTGTCTGT 3180 
CTGTQTGTAT GCATACATAT GTGTGTATAT AT6GTTTTOT CaMSGTGTGTA AATTTGCAAA 3240 
50 TTGTTrcCTT TATATATOTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 
AAAGCTTAAT TGrOCCAOAA AATCATACAT TGCTTTTTTA TTCTACATGG 6TACCACAGG 3360 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACOGTTTCC AGTTGGCAGC 3420 
AGAGATCAGQ GOTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA OAGCAGACAG 3480 
CTACCCTACT TTTCAGCAGC AAAACXJTCCX: GTATGACGCA GCAOGARGOa CCTGOCAGGC 3540 
55 TGTTAGCAGO AGCTATGTCC CTTCCTATCQ TTTCOGTCCA CTT 

8eq ID NOt 352 Protein sequence 
Protein Accession #: NPJ)06491.1 

60 1 II 21 31 41 51 

I I I I t I 

GLPRIjVCAFL LAACCCCPRV AGVPGBAEQP APBLVBVEVQ STALLKCGLS QSQOJLSHVD 60 

HPSVHKEKRT LIPRVRQGQQ QSBPGEyEQR LSLQDRGATL ALTQVTPQDE RIFIjGQGKRP 120 

RSQBYRIQLR VYKAPBBPNI QVNPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPL ISO 

65 KEBKKRVHIQ SStyrVBSSGL YTLQSIWCAQ LVKBDKDAQF YCBOmiLPS GNHMKESREV 240 

- TVPVFYPTEK VWLEVEFVGM LXEGDRVEZR CLADGNPPPH FSISKQNPST REAEEETTtn> 300 

NSVLVIiBPAR KEHSGRYEOQ AHNUXCMISL IiSEPQBIiIiVN YVSDVRV8PA APERQEGSSL 360 

TLTCEAESSQ DIjBFQWLRBB TDQVLERGPV LQLHDIjKRBA GGGYRCVASV PSIPGLNRTQ 420 

LVKLAIPGPP WMAPKBRKVM VKENMVtNLS CEASGHPRPT ISWNVNQTAS EQDQDPQRVI. 480 

70 STLNVLVTPB LLETGVECTA SNDLGKNT8I LPLBLVMIiTT LTPDSNTTTG LSTSTASPHT 540 

SANSTSTERK LPBPBSR6W IVAVXVCILV IAVL6AVLYF LYKROKLPCR RSGXQBXTLP 600 
PSRXTELWB VIC8DKLPEB4 GXiLQGSSGDK RAPGDQOBRY IDLRH 

Seq ID NO: 353 DNA sequence 
75 Nucleic Acid Accession §« NM_003183.3 
Codii^ sequence: 165.. 263 9 

1 11 21 31 41 51 

80 TCaAOCCTGa CGGTAGAATC TTCCCAGTAG GOGGCGOGGG AGGGAAAAGA GG ATTG AGGG 60 

GCTAOGGOGO 60G6ATCC00 TCCTCXXCCXS ATGTGAGCAG TTTTCCGAAA CXXOOTCAGG 120 

GQAAGOCTGC CCAGAC2AGGT GGAGTOGGTA GCGGGGCCX3G GAACATGAGG C3 U3TCT CTCC 180 

TATTCCTtSAC CAOOGTGOTT CCTTTOGrGC TGGOGCOOCO ACCTCOGGAT GACCOSGGCT 240 

TOGQCCCCCA CCAGAGACTC GAGAAGCTTQ A TTCTTTGCT CTCAGACTAC GATATTCTCT 300 
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CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATO 360 
TAGAAACACT ACTAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 420 
GTACTGAACQ TTITTCACAA AATTTCAAGa TOGTGGTGGT G QATOG TAAA AACGAAAGCG 4 BO 
AGTAC3VCTGC AAAATGGCAG GACTTCTTCA CTGGACACST GGTTGGTGAO CCT6ACTCTA 540 
GGGTTCTAGC CCACATAA6A GATGATGATG TTATAATCAQ AATCAACACA GATG GGOCO G 600 
AATATAACAT AGAGOCACTT TGQAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 660 
TTTATAAATC TGAAQATATC AAGAATGTTT CACGTTTGCA GTCTCCAAAA GTGTGTGGTT 720 
ATTTAAAAGT GGATAATGAA GAGTTGCTCC CAAAAGOGTT AOTAOACAOA GAAC CACCTG 780 
AAGAGCTTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACAOGTGTA 840 
AATTATTQGT GOTAQCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGQG GAAOAQRGTA 900 
CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGTTGA TGACATCTAT CGGAACACTT 960 
CATGG6ATAA HSCMMIl T n ' AAAGGCTATG GAATACAGAT AGAGCAQATT CGCATTCTCA 1020 
AGTCTCCACA AGAGGTAAAA CCTGGTGAAA AOCACTACAA CATGGCAAAA ACTTA CCCAA lOBO 
ATGAAGAAAA GGATGCTTGG GATGTQAAGA TGTTGCTAGA GCAATTTA6C TTTOATATAG 1140 
CTGAGGAAQC ATCTAAAGTT TGCTTGGCAC ACCTTTTCAC ATACCAAOAT T TTGATATG G 1200 
GAACTCRGO ATTAGCTTAT GTTGQCTCTC CCAGAGCAAA CAGCCATGGA GGTGTTTGTC 1260 
CAAAOOCITA TTATACCCCA OTT6QGAAGA AAAATATCTA TTTGAATAGT GGTTTGACGA 1320 
GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGQAAGC TQACCTGGTT ACAA CTCATG 1380 
AATTGGGACA TAATTTTGGA OCAGAACATQ ATCCGGATGG TCTAGCAQAA TGTOCCCCGA 1440 
ATOAQGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGT6AGTG6C GATCACGAGA 1500 
ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA TAAGACCATT GAAAGTAAGG 1560 
CCC3UWA0TO TTTTCAAGAA OGCAGCAATA AAOTTTOTGG GAACTCXSAGG GTGGATGAAQ 1620 
GAGAAGAGTQ TGATCCTGGC ATCATGTATC TGAACAACGA C ACCTG CTGC AACAG CGACT 1680 
GCAOGTTOAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAQ TCCTTGCTGT AAAAA CTOT C 1740 
AGTTTGAGAC TGCCCAGAAG AAGTGCCAGQ AGGCGATTAA TGCTACTTGC AAAGGCOTOT 1800 
OCTACraCAC AOQTAATAGC AGTGA6TGCC OGCCTCCAGG AAATQCTQAA AATGACACTG 1860 
TTTGCTTGGA TCTTGGCAAG TOTAAGGATO GGAAATGCAT CCCTTTCTGC GAGAGGOAAC 1920 
AGCAGCTGGA QTCCTGTQCA TGTAATGAAA CTGACAACTC CXGCAAGOTO TGCTG CAGGG 1980 
ACCTTTCTGG CCGCTGTGTG CCCTATGTCQ ATOCTGAACA AAAGAACTTA TTTTTGAGGA 2040 
AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAO A AACQAG TAC 2100 
AGGATGTAAT T6AACGATTT TOGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA 2160 
AGTTTTTAGC AQACAACATC OTTOGGTCTG TCCTGGTTTT CTCCTTGATA TTTTGGATTC 2220 
CTTTCAQCAT TCTIGTCCAT TGTGTGQATA AGAAATTGGA TAAACAGTAT GAATCTCTQT 2280 
CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTGCA TOGOTTOaCA 2340 
TTATCAAACC CTTTCXTTGCXS CCCCAGACTC CAGGCCGCCT 6CAGCCTGCC CCTGTQATCX: 2400 
CTTCOOCGCC AGCAGCTCCA AAACTGGACC AOCAORGAAT GGACACCRTC CAGGAAGACC 2460 
CCAGCACAGA CTCOCATATG GAC6AGGAT6 GGTTTGAGAA GGACCCXTTTC CCAAATAGCA 2520 
6CACAGCTGC CAAGTCATTT GAGGATCTCA OGGACCATCC GGTOGCCAGA AGTGAAAAGG 2580 
CTGCCTCCTT TAAACTGCAG OGTCAGAATC GTGTTAACAG CAAAGAAACA GAGTGCTAAT 2640 
TTAGTTCTCA GCTCTTCTGA CTTAAGTOTG CAAAATATTT TTATAGATTT GACXTTACAAA 2700 
TCAATCACAG CTTOTATTTT OTOAAGACTO 06AAGTGACT TAGCAGATGC TGGTCATGTG 2760 
TTTGAACTTC CTOCAGGTAA ACAUTILTIU TGTGOTTTGO OCCTTCTCCT TTTGAAAAGG 2820 
TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAA TATCTT 2880 
TTGACCTGTG GTQCAAAAGC AGAAAATACA GCTGGATTGG GTTATQAATA TTTAcu-i iri i - 2940 
TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG GAA ATGATC A GTTTTTTTTT 3000 
ATACACTGTA AT6AAC06CT GAATATGAA6 CATTTGGCAT TTATTTGTOA GAAAAOTGGA 3060 
ATAGTTTTTT ' nTlTmiJ TTTTTTTTGC CTTCAACTAA AAACAAAGGA GATAA ATTTA 3120 
GTATACATTO TATCTAAATT GTGGGTCTAT TTCTAOTTAT TACCCAGAGT TTTTATGTAa 3180 
CAGGGAAAAT ATATATCTAA ATTTAQAAAT CATTTGGQTT AATATGGCTC TTCATAATTC 3240 
TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GQGCTATACA TGGTAGCCAQ 3300 
TTOAATTTAT 6QAATCTACG AACTGTTTAO G6C0CTGATT TOCTGGOCAO TTTTTCTGTA 3360 
rrrTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT AGAAAA TTCA 3420 
CTArroOCTG GOAOTOGTGO CTCATGCCTG TAATCCCftGC ACITOOAOAO 0CTQA6GTTQ 3480 
060CACTACA CT0CAGCCT6 GOTGACAGAO TQAOATCTGC CTC 

Seq ZD NO: 354 Protein sequence 
Protein Accession it NP_003174.2 

1 11 21 31 41 51 

I I I I I I 

MRQSLLFIiTS WPFVIiAPRP PDDFGFGPHQ RLEfCLDSLLS DYDZLSLSNI QQHSVRKRDL 60 

QTSTHVBTLL TPSALKRHFK LVI»TSSTE31P SQMPKWWD 0KNBSBY7AK WQDPPTGHW 120 

GBPDSRVLAH IRDDDVIIRI NTDGAEVNIB PLWRFVNDTK DKRMLVYKSE DIKNV8RLQS 180 

PKVCXSYLKVD NEBLLPKQLV DRBPPEEIiVH RVKRRADPDP MKNTCKLLW ADHRPYRYMO 240 

RGBBSTTTMY LIBLIDRVDD lYRNTSWDNA GPKOYGIQIB QXRILKSPQE VKPGEaCHVNM 300 

* AXSYPNBBKD AWDVKMLLBQ FSPDIAEEA8 KVCLAHLFTY Q DPPilQT LGL AYVG8PRAHS 360 

HGGVCPXAYY SPVGKKNIYL HSGLT8TKNV GKTILTKBAP LVTTHBLGHN POAEHDPDGIi 420 

AECAPNBDQG GKYVMYPIAV 8GDHENNKMP SMCSKQSIYK TIESKAQECP QBRSNKVCGN 480 

8RVDEGEECD PGIMYLNNDT CCNSDCTLKB GVQCSDRNSP CCKNOQFETA QKKCQBAINA 540 

TCKOVSYCTG KSSECPPP6N AHIDTVOiDI* GKCKDGKCIP FCBRBQQLES CAQIBTDNSC 600 

KVCCRDI.S6R CVPYVDAEQK NLPUtKGKPC TVGFOIMNGK CBKRVQDVXB RFNDFZDQLS 660 

INTFGKPLAD NIVGSVLVF8 LIFHXPFSXL VHCVDKKIiDK QYBSLSLFKP SHVBCi SSMD 720 

8ASVRIXKPF PAPQTPGRIiQ PAPVXPSAPA APKU3BQRMD TXQEDPSTDS HMDEDOFBXD 780 
PFPNS8TAAK 8FEDLTDHPV ARSBKAASFK XiQRQNRVNSK ETBC 

Seq XD NO I 355 UNA sequence 

Nucleic Add Accession it NM_02 1832.1 

Coding sequence i 164.. 2248 

1 11 21 31 41 51 

I I 1 I I I 

TGGAGCCTOG OGOTAOAATC TTCCXAOTAO 6CX560GOG6O AGQAAAAGA6 G ATTQ AGGGG 60 

CTAGGCOQGG OGGATCCCQT CCTCCCCOGA TGTGAOCAGT TTTCXXSAAAC CCOOTCAGGC 120 

GAAGOCroCC CAGAOAGGTO OAGTCGQTAG aaSGGCCGGO AACATOAOOC AGTCICTCCT 180 
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ATTCCTGACC AGCGTGOTTC CTTTOGrTGCT GGCGCCGCGA CCTCCGGATG ACCCQGGCTT 240 

OGGCCCCCAC CAGAGACTCG AGAAGCTTGA Trcm G CT C TCAOACTAOG ATATTCTCTC 300 

TTTATCTAAT ATCCAGCAOC ATTCGGTAAG AAAAAGAQAT CTACAQACTT CAACACATGT 360 

AOAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATAiX TCACATCAAG 420 

TACTGAAOQT mTCACAAA ATTTCAAGOT OOT GO T GG TG GATGOTAAAA ACGAAAGCOA 480 

QTACACTGTA AAATOGCAGG ACTTCTTCAC TGGACACGTG GTTGGTQAGC CTQACTCTAG 540 

GOTTCTAGCC CACATAAGAG ATQATOATGT TATAATCAQA ATCAACACAG ATG GGGCO GA 600 

ATATAACATA GAGCCACTTT GOAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT OAAGATATCA AOAATC3TTTC AOCmTaCAQ TCTCCAAAAG TGTGTGGTTA 720 

TTTAAAAGTG- GATAATQAAG AOTTGCTOCC AAAA6GGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CATCGAGTGA AAAGAAQAGC TGACCCAGAT CCCATQAAGA ACACGTGTAA 840 

ATTATTGGTG GTAGCAQATC ATOGCTTCTA CAGATACATG GGCAQAGGGG AAGAGAGTAC 900 

AACTACAAAT TACTTAATAG AGCTAATTQA CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGG6ATAAT GCSUSGTTTTA AAG6CTATGG AATACAGATA GAGCAGATTC GCATTCTCAA 1020 

GTCTCCACAA QAGGTAAAAC CT66TGAAAA GCACTACAAC ATGGCAAAAA 6TTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTT6CTAQAG CAATTTASCT TTGATATAGC 1140 

TGAGOAAGCA TCTAAAGTTT GCTTQGCACA CCTTTTCACA TACCAAQATT TTQATATGGG 1200 

AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAOCAAAC AOCCATGQAG GTGTTT GTCC 1260 

AAAGGCTTAT TATA6CCCAG TTGOQAAGAA AAATATCTAT TTGAATAGTG GTTTGAG6A0 1320 

CACAAA6AAT TATGOTAAAA CCATCCTTAC AftAGGAAGCT 6ACCTGGTTA CAACTCATOA 1380 

ATTOGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCOGAA 1440 

TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT QTQAGTGQCG ATCACGAGAA 1500 

CAATAAGATQ TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTOT TTTGAAGAAC GCAGCAATAA AGTTTGTGGG AACTCQAGG6 TGGATQAAGO 1620 

AGAAGAGTGT GATCCTGGCA TCATQTATCT GAAC3VACX3AC ACCTGCTGCA ACAGOGACTQ 1680 

CACGTTGAAG 0AA6GTGTCC AOTGCAGTGA C3VGQAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTQAGACT GCCCAGAAGA AGTGCCaGGA GQCGATTAAT GCTACTTGCA AAGGCGTGTC 1900 

CTACTGCACA GGTAATA6CA GT0AGT6CCC GCCTCCAGQA AATGCTGAAG ATGACACTOT 1860 

TT0CTTG6AT CTTGOCAAGT GTAAGGATGO GAAATOCATC CCTTTCTGOO A6A0GQAACA 1920 

GCAGCTGGAO TCCTGTGCAT OTAATOAAAC TGACRACTCC TOCAAGGTGT GCTG CAQQQA 1980 

CCTTTCCGGC CX3CTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGQAA 2040 

AGGAAAGCCC TGTACAGTAO GATTTTOTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATQTAATT GAACXATTTT GQGATTTCAT TGACCAGCTG AG CATC AATA CTTTTGGAAA 2160 

GTTTTTAGCA 6ACAACAT00 TTGGGTCIOT CCTGGTTTTC TOCTTGATAT TTTGGATTCC 2220 

TTTCAGCATT CTTOTCCATT OTOrGTAACG TCGAAATGCT GAGCAOCATG OATTCTGCAT 2280 

CGQTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCXXJCXTTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA QCAGCTCCAA AACTQGACCA CCy«5AGAATG GACA CCATC C 2400 

AGGAAGACX:C CAGCACAOAC TCACATATGG ACGASGATGQ GTrrOAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGOC AAOTCATTTa AGOATCTCAC G6ACCATC06 GTCAOCAGAA 2520 

GTCAAAAGGC TOCCTCCTTT AAACTGCAGC GTCAGAATOQ TGTTGACAQC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTrCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCTG 2700 

GTCATOTGTT TGAACTTCCT GCAGGTAAAC AQTTCTTGTG TGQTT TGGCC CTT CTCCTTT 2760 

TGAAAAGQTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGOTT TTAOTTTTTA 2820 

AAATATCTTT TGACCTGTGG TGCAAAAGCA 6AAAATACA0 CTGGATTGG6 TTATOAGTAT 2880 

TTACGTTTTT GTAAATTAAT CTTTTATATT QATAACAGGC ACTGACTAGG GAAATGATCA 2940 

UTl-rmT TT ATACACTGTA ATGAACCX3CT GAATATGAAQ CATrTOOCAT TTATTTGTGA 3000 

QAAAAGTGGA ATAGTTTTTT rrrrr TT T TT TTTTTTTTGC CTTCAACTAA AAACAAA6GA 3060 

GATAAATTTA 6TATACATTQ TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACOCAGAGT 3120 

TTTTATOTAQ CAGGQAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT OCTCAaAACXI TAACCACTAC CTTACAGTGA GG GCTAT ACA 3240 

TGGTAGCCAQ TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAQ 3300 

TTTTTCTGTA TTTTATAAGT ATCTTCAT8T ATCCCTGTTA CTGATAGGGA TACMOTCTT 3360 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTGATGCCTO TAATOCCAOC ACTTGaAGAO 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAOCCTG GGTGACAaAO TOAQATCTGC CTC 

8eq ID NOi 356 Protein sequence 
Protein Accession if: IIPJ)68604.1 

1 11 21 31 41 51 

I I i I I I 

MRQSLLFLT8 WPFVLAPRP PDDPGFOPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVBTLIj TPSALKRHFK LYLTSSTERF SQMFKVWVD GKHBSBYTVK WQDFFTQIW 120 

GEPDSRVLAH IHDDDVIIRI NTDOAEYNIE PWIRPVMDTK DKRHLVYKSE DI1QIVSRIX2S 180 

PKVCXSYLKVD NBELLPKGLV DREPPBELVH RVKRRADPDP MKNTCKLliW ADHRPYRYMG 240 

ROBBSTTTNY LIELIDKVDD lYRNTSWDNA GPK6YGIQIB QIRILKSPQE VKPGKKHYNM 300 

AKSYPNEEKD AWDVKMLLBQ PSPDIAEEA8 KVCIAHLPTY QDFDMGTLQL AYVGSPRANS 360 

HOOVCPKAYY SPVQKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHEI/aJN PQABHDPDGL 420 

AEC3«*NEDQG GKYVMYPIAV SOTHENMKMF SNCSKQSIYK TIESKAQBCP QERSNKVOGN 480 

8KVDBGEECD POIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQPETA QKKOQBAINA 540 

TCKGVSyCTO H8SECPPPGN AEDDTVCIjDIi GKCKDGKCXP FCBR EQQLBS CACMETDNSC 600 

KVCCRDItSOR CVPyVDABQK HLFLRKGKPC TVGFCDMKGK CBKRVQDVIE RFWDFIDQLS 660 
INTFGKFLAD NIVGSVLVPS LIFWIPPSIL VHCV 

Seq ID NO: 357 DMA sequence 

Nucleic Acid Accession ft: 1IM_004994.1 

Coding sequence : 2 0 2 143 

1 11 21 31 41 51 

I 1 I I 1 I 

AOACAOCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTO GTCCTGGTGC TCCTGGTGCT 60 
GG6CTGCT6C TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTOCTCT TCCCTGGAGA 120 
CCTGAGAACC AATCTCACCQ ACAGGCAGCT GGCAGAGQAA TAC CTOTA CX! GCTATGGTTA 180 
CACTGOGGTG GC3W5AGAT6C GTGGAGAGTC GAAATCTCTQ OGGCCTGOGC TGCTGCTTCT 240 
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PCTAJS02/36810 



GCAGAAGCAA CTOTCCCTGC COGAGACOGQ 
GCCAACCCCA CGGTGCGCSGa TCCCAGACCT 
CAAGT66CAC CACCACAACA TGACCTATTG 
G60C3GTGATT OACGAOGCCT TTGCGOGOSC 
5 CACCTTCACT 060GTGTACA GC06CX3A06C 
GCA0G6A0AC GGGTATCCCT 70GA0G6GAA 
TGGCCCCGGC ATTCAGGGAG AOGCCCATTT 
GGGOGTOGTG GTTCCAACTC GGTTTGGAAA 
- CATCTTOQAG GGOCG C TCCT ACTCTQCCia 
10 CTGGTGCAGT ACCAOGGCCA ACTACQACAC 
GAGACTCTAC ACCOOGOAOG GCAATGCTOA 
CCAAGGCCAA TCCTACTCOG CCTGCACCAC 
CGCCACCACC GCCAACTAOG ACCGGGACAA 
CTC38ACJG6TO ATOGGGGGCA ACTGQ6GGGG 
IS GOGTAAGGAG TACTOQACCT 0TACCAGC6A 
TACCACCTOG AACTTTGACA GOGACAAQAA 
TTTGTTCCTC GTGGCXK5CGC ATQAGTTOGG 
GCCGGAGGOG CTCATGTACC CTATGTACC3G 
CGAOGTGAAT GGCATCG6GC ACCTCTATOO 
20 AACCACCACC ACAC03CAGC CCACX3GCTCC 
TGTCCACCCC TCAGAGCGCC CCAC3\GCTGa 
AGGTCCCCCC ACTGCTGGCC CTTCTACGGC 
TQCCTGCAAC GTGAACATCT TOGA06CCAT 
- _ CAAOGATGGG AAGTACTGGC aATTCTCTGA 
25 CCTTATOGCC GACAAGTGGC CCGCXSCTGCC 
GCTCTCCAAG AAGCTTTTCX TCTTCTCTGG 
GGTGCTGGGC CCQAGGCGTC TGGACAAGCT 
06GGGCCCTC CX3GAGTGGCA GGGGGAA6AT 
GTT0QACX3TQ AAGGOGCAGA TGGTGGATOC 
30 CCCGGGGGIG CCTTTGGACA GGCAaSAOOT 
CCAGGACOSC TTCTACTGGC GOGTGAGTTC 
GGGCTACGTG ACCTATGACA TCCTGCAGTG 
GCAGTGCCAT GTAAATCCCC ACZGGOACCA 
CAAACTQGTA TT C T G TTCTG 6AGQAAAGG0 
35 TCACCTTTGT TTTTTGTTGO AGTOTTTCTA 



TGAGCTGGAT AGCQCXAOSC TGAAGGCCAT 300 

GGGCAGATTC CAAAGCTTTO AGGGOGAOCT 360 

GATCCAAAAC TACT0GGAA6 ACTT6C0606 420 

CTTCGCACTQ TGGAGCGCQG TGACGCCGCT 480 

AGACATOGTC ATCXaVGTTTG GTGTCGOGGA 540 

GGACGGGCTC CTGGCACAOO CCTTTCCTCC 600 

OGACQATGAC GAGTTGTGGT CXX7FGGQCAA 660 

OGCAOATGGC GCGGCCTGCC ACTT CCCC TT 720 

CAGCACOQAC OGTOGCTCOG AOGGCTTGCC 780 

CGAGQACCGG TTTGGCTTCT GCCCCAQOQA 840 

TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

GGACXX3TCGC TCCGAOGGCT ACCGCTGGTG 960 

6CTCTT0GGC TTCTGCCOQA CCCXSAGCTGA X020 

GGAGCTQTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGCC60G0A GATGGGCGCC TCTQGTGCGC 1140 

GTGGGGCTTC T6CCCGGACC AAGGATACAG 1200 

CCACGOQCTG GGCTTAGATC ATTCCTCAGT 1260 

CTTCACTQAO GGGCXXXTCCT TGCATAAGGA 1320 

TOCTCGCCCT GAACCTGAGC CAGG6CCTCC 1380 

CCCGAGGGTC TQOOCCACOQ QACCCCCCAC 1440 

CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

CACTACTGTG CCTTTGAGTC CX3GTGQA0GA 1560 

CG0QGA6ATT GGGAAGCA6C TGTATTTGTT 1620 

GGGCAGGGGG AGCG0GGC8C AGGGCCCCTT 1680 

COOCAAOCrO GACTCGGTCT TTGAGGAGCC 1740 

GOGCCAGGTG TGGGTGTACA CAGGCGCX3TC 1800 

GGGCCTGGGA GCXXavCGTGG CCCAGGTGAC 1860 

GCTGCTGTTC AGOGGGCGGC GOCTCTGQAG 1920 

G06QAGGGCC A608AGGTGG AO0GGAT6TT 1980 

CTTCCnSTAC 06A0AGAAAQ CCTATTTCTG 2040 

CCGQAGTGAG TTGAAOCAGG TGGACCAAGT 2100 

CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

ACCCTGGGGA AGGAGCCAGT TTGCCX3GATA 2220 

AGQAQTGGAO GTGOGCTGGG CCCTCTCTTC 2280 
ATAAACTTGO ATTCTCTAAC CTTT 



Seq ID NO: 358 Protein sequence 
Protein Acceaeion <fs NP_004985.1 

40 1 11 21 31 41 51 

MSLWQPLVLV LLVL6CCPAA PRQRQSTLVL PPGDLRTNLT DRQLAEEYLY RYGYTRVAEM 60 

RGESKSLQPA LLLLQKQLSL PBTGKLDSAT LKAMRTPROG VPDLGRFQTF EGDLKWHHHN 120 

ITYWIQNYSE DliPRAVIDDA FARAFAIMSA VrPLTFTRVY SHDADXVXQF GVAEHGDGVP 180 

45 FDOKDOLIAH AFPPOPGIQG DAHPDDDBLW 8L6HSWVPT RF6MADQAAC HPPPIFBGRS 240 

YSACTTDGRS DGIiPWCSTTA NYDTDDRFQF CPSBRLYTRD GNADGKPOQF PPIFQGQSYS 300 

ACTTDGRSDG YRWCATTANY DRDKLFGPCP TRADSTVMGQ NSAGBLCVFP FTFLGKBYST 360 

CTSBGRGDGR LWCATTSNFD SDKKWOPCPD QGY8LFLVAA HEFGHALGLD HS6VPEALMY 420 

PMYRFTBGPP liHKDDVNGIR HLYGPRPBPE PRFPTTTTPQ PTAPPTVCPT GPFrVBPSER 480 

50 PTAGPTOPPS AGPTGPPTAG PSTATTVPLS PVDDACMVNI PDAIAEIGKQ LYliFKDGIWW 540 

RFSEGRGSRP QGPFI,IADKW PALPRKIiDSV FBEPIiSKKLF FFSGRQVWVY TGASVLOPRR 600 

LDKLGLOADV AQVTGALRSQ RGKMLLFSGR RLWRFDVKAQ MVDPRSASBV DRMFPGVPLO 660 
THDVFQYREK AYFOQDRFYW RVSSRSELNQ VDQVGYVTYD lUJCPED 

55 Seq ID NO I 359 DNA aeouenee 

Nucleic Acid Acceasion #i NM_000213.1 
coding sequence! 127..53BS 

1 11 21 31 41 51 

60 I I I I I I 

CX3CCCGCG0G CTGCAGCCXX ATCTCCTAGC GGCAGCCCAO GC6CGGAGGG AGOGAGTCCG 60 

CCCCX3AGGTA GGTCCAGGAC GGGOOCACAG CAGCAGCCGA GGCTGGCCGG GA GAQGQ AGQ 120 

AAOAGQATGG CAGGGCCACG CXXX3U3CCCA TGGGOCAGGC TGCTCXTGGC AGCCT TGATC 180 

AGCGTCAGCC TCTCTGQOAC CTTGQCAAAC OGCTGOttGA AOGCOOCMST OAAOAGCTQC 240 

65 ACGGAOTCTG TCCGTGTGGA TAAGGACTGC GCCTACTOCA CAGACOAOAT GTTCAGGOAC 300 

. CGGOGCTGCA ACACCCAGGC GGAGCTOCTQ GCOGCGGGCT GCCAGCGGGA QAGCATOOTG 360 

GTCATGGAGA GCAGCTTCCA AATCAC3«5AG GAGACCCAGA TTOACACCAC CCTGCG GCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGOSGGTC CGTCTGCGGC COGGTGAGGA GOGGCATTTT 480 
GAGCTGGAGO TGTTTGAQCC ACTGGAGAGC CCC3GTGGACC TGTACATCCT CATGQACTTC 540 
70 TCCAACTCCA TGTCCQATGA TCTGGACAAC CTCAAGAAOA TGGGOOGAA CCTGOCTOGG 600 
6TCCTGAG0C AGCTCAOCA0 03ACTACACT ATTGGATTTG GC3UVGTTTGT GGACAAAGTC 660 
AGOQTCCOGC AGAOQGACAT GAGGCCTGAG AAGCTGAAGO AGCCCTGGCC CAACAGTQAC 720 
CCCCCCTTCT CCTTCAAGAA OGTCATCAGC CTGACAGAAO ATGTGGATGA OTTCO GQAAT 780 
AAACTCCAGG OAGAGCXSGAT CTCAGGCAAC CTGGATGCTC CTQAGOGOGO CTTCGATGCC 840 
75 ATOCTGCAGA CAGCTGTGTG CAOGAGGQAC ATTQGCTGGC GCCCX3GACAG CACCCACCTG 900 
GTG6TCTTCT CCACCGAGTC AOCCTTCCAC TATQAGGCTG ATGGCGCX»A CQTGCTGGCT 960 

GGCaTCATOA GCOSCAACGA TGAAOGGTGC CACCTQGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC aTCQGTGCCC ACXXTGGTGC GCCXGCTC6C CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAOCT ACTAOSAGAA QCTTC ACACC 1140 

80 TATTTCCXrrO TCTCCTCACT GGGGGTGCTG CAGGAGGACT OBTCCAACAT OGTGGAGCXO 1200 

CTOGAGGAGO CCTTCAATC6 GATCCGCTCC AACCTGGACA TCOGGGCCCT AGA CAGCCCC 1260 

C6AGGCCTTC GQACAQAGGT CACCTCCAAG ATGTTCXaQA AG AOGAG GAC TGGGTCCTTT 1320 

CACA7C0GGC OOGGGGAAGT GGGTATATAC CAGGTGCAGC TOOGGGCCCT TGAGCAC3GT0 1380 

GATGGGACXIC AOOTGTOCCA GCTQCCGGAO OACCaGAAQG GCAACATCXIA TCTGAAAOCT 1440 
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TCCnCfCCXa ACQ G CCTOUV GATGGA0GCX3 GGCATCATCT GTGATCTQTG CAOCTGCGAO 1500 
CTQCAAAAAQ AGGTOOOGTC A6CTCGCTGC AOCTTCAAOG GAOACTTOGT GTGOGGACAG 1560 
T6TGTGTGCA GCX^GGGCTQ GAOTGGCCAa ACCTGCAACT GCICCACCGG CTCTCTQAGT 1620 
GAOVrrCAGC CCTG C CT G C Q GGAGGG06AG GACAASOOGT GCTOOGOCOG TGGGGAOTOC 16B0 
5 CA0T6CGGQC ACTGTGTGT G CTACGGGGAA CGCX38CTAOO AOOCSTCRGTT CTGOGAOTAT 1740 
GACAACTTCC AGTQTCCCCO CACTTCCGGG TTCCTCTGCA ATQA005RGG ACGCTGCTCC 1800 
ATGGGCCAOT GTGTQTGTGA GCCTGGTTGG ACAGGCCCAA GCTCTGACTG TCCCCTCftGC 1860 
AATGCCACCT GCATOQACAG CAATOGGGGC ATCTGTAATG GAOGTGGCCA CTOTGAGTOT 1920 
GGCCXSCTGCC ACTGCCACCA 6CA6TCGCTC TACAGGGACA CCATCTGOGA GATCA ACTAC 1980 
10 T0G60QAT0C ACCCGGGCCT CTGCGAGGAC CTACOCTCCT GOGTGCAGTG CCA6GC6TG0 2040 
GGCACGGGCG AGAAOAAOGG G06CAOC5TGT GAGGAAT6CA ACTTC3VAGGT C3^AGATGGTG 2100 
GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGOGCrGCT OCTTCCGGOA CGAGQATGAC 2160 
OACTGCACCT ACAGCTACAC CATGGAAGOT GAOGGOQCCC CTGGGCCCAA CA6CACTGTC 2220 
CXGGTGCACA AGAAGAAGGA CTGOCCTCOG OOCTCCTTCT GOTGOCTCAT CCCCCTGCT C 2280 
15 CTCCTCCTCC TGCCGCTCCr OQCCCTOCTA CTGCTGCTAT GCTGGA AGTA CTG TGCC TGC 2340 
TGCAAGGCCT GCCTGGCACT TCTOCOGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 
GAA6ACCACT ACATGCTGOG GGAGAACCTQ ATGGOCTCTG ACCACTTGGA CAOGCCCATG 2460 
CTCCGCAGCG GGAACCTCAA GGGCCGTGAC G TGGTCCGCT GGAAGGTCAC CAACAACATQ 2520 
CAGCGGCXTTG GCTTTGCCHC TCATOCOGOC AGCATCAACC CCftCAGAOCT 6GTGCCCTAC 2580 
20 GGGCTGTCCT TGCGCCTGGC CU U CCTTT GC AOOGAGAACC TGCTOAAGCC TGACACTCGG 2640 
GAQTQCGCXX: AGCTGOSCCA GGAGGTGGAO GAGAACCTGA ACXSAGGTCTA CAGGCAGATC 2700 
TCCGGTGTAC ACAAGCTCCA OCAGACCAAG TTCOGGCAGC AGCCCAATGC CG0GAAAAA6 2760 
CAAGACCACA CCATTGTGGA CACAGTGCTO ATGGCXJCCXX: GCTCGGCCAA GCCGGCCCTG 2820 
CTOAAGCTTA CAGAGAAGCA GGTGQAAGAO AOOGCCTTCC AOBACCTCAA GOTGOOCCCC 2880 
25 GQCTACTACA CCCTCACTGC AGACCACGAC GCCC66GGCA TGQTGGAOTT CCA0QAG66C 2940 
GTGGAGCTGG TGGACGTACO GQTOCCCCTC TTTATCCGGC CTGAGGATQA CGAOGAGAAO 3000 
OVGCTGCTGG TOGAGGCCAT CGAOGTGCCC GOVGGCACTG CCACCCTOGG CCGCCGCCTG 3060 
GTAAACATCA CCATCATCAA GGAOCAAQCC AQAGACGTGG TGTCCTTTGA GCAGCCTQAO 3120 
TTCTCGGTCA GCC6CG0GGA CCAGOTGGOC CGCATCCCIO TOlTCOaGOG TGTCCrGQAC 3180 
30 GGCGGGAAGT CCCA6GTCTC CTACCQCACA CatfXJATOGCA COGCGCAGGG CAACCX3GGAC 3240 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 
TTCCACX3TCC AGCTCAGCAA CCCTAAOTTT GGGGCCCACC TGGGCCAGCC C CACTCCAC C 3420 
ACC31TCATCA TCAGGGACCC AGATGAACTG GACOQGAGCT TCACGAGTCA GATGTTGTCA 3480 
35 TCACAGCCAC CCCCTCAOSG OQACCTOGGC 6CCOCX3CAGA ACCCCRATGC TAAGGCOSCT 3540 
GGGTCCAGGA AGATCCATTT CAACT03CTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 
GTAAAGTACT GGATTCAGQG TGACTCCGAA TCCQAAGCCC ACCTGCTCGA CAGCAAGGTG 3660 
CCCTCAGTOG AGCTCAGCAA CCTGTACCCG TATT6C3SACT ATGAGATOAA GGTGTGCGCC 3720 
TAOQGGGCTC AGGGCGAC3GG ACCCTACAGC TCCCTGGTOT CCT6C06CAC CCACCAGQAA 3780 
40 GTGCCCAGCG AGCCAQGGCG TCTGGCCTTC AATGTOGTCT CCTCCACOOT OACCCAGCTG 3840 
AGCTOGGCTG ACCOGGCTGA GACCAAOGGT QAQATCACAG CCTACGAGGT CTGCTATGGC 3900 
CTGOTCAACG ATGACAACCG ACCTATTGGG CCCATGAAQA AAGTGCTGGT TOACAACCCT 3960 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT GGGGAQTCCC AGCCCTACCG CTACACGGTG 4020 
AAGGCGCGCA ACGGGGCGQG CTGGGGGCCT GAQCGGQAOG CCATCATCAA CCTGOCCACC 4080 
45 CAGCGCAAOA GGCCCATGTC CATCCCCATC ATCCCTQACA TCCCTATOQT GOACGCCCAG 4140 
AOGGGOOAGG ACTACQACAG CTTCCTTATG TACAOOQATG ACGTTCTACG CTCTCCATOG 4200 
GGCAGCCAGA 6QCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTQAATGG COGGATQQAC 4260 
TTTGCCTTCC CGGQCAGCAC CAACTCCCTG CACAQGATGA CCACQACCAO TGCTOCTGCC 4320 
TATGGCACCC ACCTGAQCCC ACACGTGCCC CACCGCGTGC TAAGCACKTC CTCCACCCTC 4380 
50 ACAOGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTOGACCAC ACTGCOGAGG 4440 
GACTACTCCA CCCTCACCTC CQTCTCCTCC CACQACTCTC 6CCTGACTGC TGGTGTGCCC 4500 
GACAGGCCCA CCCGCCTGGT GTTCTCnGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 
CAGGA6CCGC GGTQCGAGOS GCOGCTGCAG GGCTACAGTG TGQAGTAC^ GCTGCTGAAC 4620 
GGOGGTQAGC TGCATCOGCT CAACATCCCC AACCCTGCCC AGACCTOGGT eOTGar GgAA 4680 
55 GAOCTCCT G C CCAACCACTC CTAOGTGTTC CGCGTGCGGG CCCA GABCC A GGAAG8CTGG 4740 
G6CCGAGAGC QTGAGGGTGT CATCACCATT GAATCCCAGG TGCACC08CA QAQCCCACTG 4800 
TOTOCCCTGC CAG6CTC0GC CTTCACTTTG AGCACTCCCA GTGCCCCAGO CCCGCTGQTG 4860 
TTCACTGCOC TGAGCCCAGA CTOGCTGCAG CTGAGCTGGG AG06GCCAGG GAGGCCCAAT 4920 
GGGGATATOO TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAOG GOCAGOCAOC 4980 
60 GCATTCCQQG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGO CCTCAOOOAG 5040 
AA08TOCCCT ACAAGTTCAA GGTGCAGGCC AGOACCACTO AGGGCTTCGQ GCCAGAG06C 5100 
GAGQGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCOGT 5160 
GCCGGOCTCT TCCAQCACCC GCTGCAAAGC GAOTACAQCA GCATCACCAC C ACCC ACACC 5220 
AGOGCCACCG AGCCCTTCCT AGTGGATGGG COGACCCTGG GGGCCCAGCA CCTOGAGGCA 5280 
65 OGOQOCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCOGQ ACACT GACCACCAGC 5340 
OQAACGCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCT6CCCCA 5400 
CCCCOGCCAT GTCCCACTAO GCOTCCTCCC GACTOCTCTC CCGGAGCCTC CTCAGCTACT 5460 
CCATCCrroC ACCCCTGGGG GCCCAQCCCA CCCOCATGCA CAGAGCAGGG GCTAOTTQTC 5520 
TCCTGGGAGQ CATGAAGGGG QCAAGGTCCO TCCTCTGTGG GCCCAAACCT ATTTQTAAC C 5580 
70 AAAGAGCTGG GAOCAGCACA AGGACCOVGC CTTTGTTCrG CACTTAATAA ATGGTTTTGC 5640 
TACTG 

8eq ID KO: 360 Protein sequence 
Protein Acceaslon #: NP_000204.l 

1 11 21 31 41 SI 

1111)1 

MAGPRPSPWA RLLIiAALISV SI»SGTIANRC KKAPVKSCTB CVRVPKDCAY CT DBIFR DRR 60 
OJTQAELLAA GCQRESIWM BSSFQITEET QIDTTLRRSQ MSPQGLRVRL RFGEERHFEL 120 
EVPEPLESPV DLYILMDPSN SMSD01J3NLK KKGQNLARVL SQLTSDTriG PGKPVDKVSV 180 
PQTDMRPBKL KEPHPNSDPP F8PKKV18LT EDVDKPRNKL QGBRXSOILD APEGGPDAIL 240 
QTAVCTRDIO WRPDfflHIiLV FSTBSAPHYB ADQANVLAOI MSRNDBRCHL DTTGTYTQYR 300 
TQDYPSVPTL VRLIAKHNII PXPAVTNYSY SYVBKLHTYP PVSSIjGVLQB DSSNIVELLE 360 
EAFNRIRSmi DIRALDSPRQ IiRTSVTSKMP QKTRTOSFHl RRGBVGIYOV QLRALEBVDO 420 
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THVOQLPBDQ KGNIHLKP8P 6DGLKMDAGI ICDVCTCBLQ KEVRSARC8F N6DFV06QCV 480 
CSBCtfSGQTC NCSTGSIjSDI QPCLREGBDK PCSGRGSOOC GHCVCYGEGR YBGQFCEYDN 540 
FQCPRTSOFI* QIORGRCSKG QCVCBFGHTO PSCDCPLSNA TCIDSNGGIC NGRGRCECGR 600 
CHCHQQSLYT DTICEIMYSA, IHPGLCBDLR SCVQCQAWGT GEKKGRTCBB QIPKVKKVDE 660 
S LKRAEEWVR CSFRDEDDDC TYSYTME6D6 AFGFNSTVLV HXKKDCPPGS FIWLXPIjLLL 720 
LI#PLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGPKED HYMLRENUtt SDHLDTPMLR 780 
SGaJLKGRDW RWKVTNNMQR PGPATHAASI NPTELVPYGL SLRLARLCTB NLLKPOTRBC 840 
AQLRQBVEEN LNBVYRQISG VHKLQQTKFR QQPNAGKKQD HTXVDTVLMA PRSAKPALLK 900 
LTEKQVBQRA FHDUCVAFGY YTLTADQDAR GMVBFQEGVB LVDVRVPLPX RPBDDDBKQL 960 

10 LVBAIDVPAO TATLGRRLVN IT2XKEQASD W8FEQPBFS VSRGDQVARZ PVZRRVLDGG 1020 

KSQVSYRTQD GTAQGNRDYI PVBGHLLFOP 6EAHKELQVK LLBLQBVDSL LRGRQVRRFH 1080 

VQLSNPKFGA HU3QPHSTTI IIRDPDBLDR SPTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGXPMGYRVK YHIQGDS6SE AHLLDSKVPS VBLTNLYFYC DYEKKVCAYG 1200 

AQGE6PYSSI* VSCRTUQEVP SEPGRLAFXVV V8STVTQL8W AEPAETNGEZ TAYEVCYOLV 1260 

15 NDDMRPIGPM laCVLVDNPKM RMLLIB«iRB SQPYRYTVKA RHGftGHGPER BAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSO BDYDSPLMYS DDVLRSPSGS QRPSVS0DTE HLVNQRMDPA 1380 

PPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPOT P7RLVFSALG PTSLRVSHQE PRCBRPLQGY SVEYQUilIGO 1500 

^ . BliHRIiNIPNP AQTSVWEDL LPKHSYVFRV RAQSQEGH6R ERBGVITIES QVKPQSFLCP 1560 

20 LP6SAFTLST PSAPGPLVFT ALSPDSLQIiS WBRERRPMGD IVGYLVTCBN AQGGGPATAF 1620 

RVDGDSPCSR LTVPGIiSENV FYKFRVQART TEGFGPEREG IITIBSQD06 PFPQU3SRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT U3AQHLBAGG SLTRHVTQEP VSRTLTTSGT 1740 
LSTHMDQQFF QT 

25 Seq 10 NO I 361 OHA eequence 

Nucleic Acid Accession #t NM_013332.1 
Coding sequences 1..63 

1 11 21 31 41 51 

30 i I I I I I 

GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 

AGTAACOGAC TTTCCICCGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GATCCACTCC 120 

CX3QCTGTTCC CCCX3GAGGGT CCAGAGGCCT TTC3U3AAGGA 6AAGQCAGCT CTGTTTCTCT 180 

GCAGAGGAGT AOGOTOCTTT CAGCCATGAA GCATGTGTTO AACCTCTAOC TOTTAOOTOT 240 

35 GGTACTGACC CTACTCTCCA TCTTGQTTAO AGTOATGOAQ TCCCTAGAAG GCTTACTAGA 300 

GAGCCCATOG CCTGGGACCT CCTGQACCAC CAGAAGCOUV CTAGCCAACA CAGAGCCCAC 360 

CAAGGGCCTT CCAQACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 420 

ATATTTTCGA ACACTGACCT AOACATOTCC AGATGGQAST OCCATTCCTA GCAGACAAQC 480 

TGAGCACCGT TQTAACCAQA GAACTATTAC TAGQCCTTGA AGAACCTGTC TAACTGGATG 540 

40 CTCATTGCCT GG6CAAGGCC TGTTTAGQCC GGTTGOGGTG GCTCATGCCT GTAATCCTAG 600 

CACTTTGQQA GGCTGAGGTG GGTGGATCAC CTQAGGTCAQ GA3TTCGAGA CCAQCCTCGC 660 

CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTG6CAGA 720 

GGCCTGTAAT CCCAOTTCCT TGGQAGGCTG AGGGGGGAOA ATTGCT7GAA CCC0GGGA03 780 

GAGGTTGCAG TGAACOGAGA TOGCACTQCT GT ACCC AQCC TGGGO CACAO TGCAAGACTC 840 

45 CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CT6TTTAATG CACAGGTGTG AGT66ATTGC 900 

TTATOGCTAT GAGATAGGTT GATCTCGCCC TTACCCOGGG OTCTGQTGTA TGCTGTGCTT 960 

TCCTCAGCSVa TATGGCTCTG ACATCTCTTA QATOTCCCAA CT TCAG CTGT TGGGAQATGG 1020 

T6ATATTTTC AACCCTACTT CCTAAAC3^TC TGTCTGGGGT TCCTTTAGTC TTQAATGTCT 1080 

TATGCTCAAT TATTTGGTGT TQAQCCTCTC TTCCACAAQA QCICCTCCAT 6TTTGQATA0 1140 

50 CAGTTGAAGA GGTTGTOTGG QTGGGCT6TT GGGAGTGAGG AT6QAGTGTT GAOTQCOCAT 1200 

TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTa TGTATTGGTC TGAAGGGGGT 1260 

GGTGGQATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

55 Seq 10 NOs 362 Protein aequence 
Protein Accession «i NP.037464.1 

1 11 21 31 41 51 

60 MKHVIiMLYLL GWLTLLSIP VRVMESLBOL LBSPSPQTSIf TTRSQLANTB PTKGLPDBPS 60 
RSM 

Seq ID NO: 363 DSJA sequence 
Nucleic Acid Accession #j NM_0239lS.l 
65 Coding sequence t 250.. 1326 

1 11 21 31 41 51 

) I I 1 I 1 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTOCC G AGCTT AGTT 60 

70 TCAAAGCTTA TTCTTAATTA QAQACAAGAA ACCTOTTTCA ACTTGAAQAC ACGGTATGAO 120 

OXOAATGGAC AGCCA6CCAC CACAATGAAA 6AAATGAAAC CAGGAATAAC CTATGCTGAA 180 

CCCAGGCCTC AATC8TCCCC AAOTGTTTCC TGACACGCAT CTTT6CTTAC AGTGCATCAC 240 

AACTGAAQAA TGGGGTTCAA CTTQACGCTT GCAAAATTAC CAAATAAOGA GCTGCACGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAQC OAOGGGCCAQ GAAAGAACAC CACOCTTCAC 360 

75 AATQAATTTO ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT OGCAAGCATC 420 

TTOCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 4 BO 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA OGCTG ACATT TCCATTTOGA 540 

ATAGTCCATG ATGCAGGATT TGQACCTTGO TACTTCAAGT TTATTC TCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGQCT OATAAGCATT 660 

80 GATC6CTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATG T ACAG CATAAOCTTC 720 

ACGAAGGTTT TATCTOTTTG 'AtyiTTGGGT G ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA AT6GTCAGCC AACAGAQGAC AATATCCATG ACTGCTCAAA ACTTA AAAGT 840 

CCTTTOGGGG TCAAATGGCA TAOGGCAOTC ACCTATGTOA ACAGCTGCTT GTTTGTGGCC 900 

QTQCTOOTOA TTCTOATCGG ATOTTACATA GCCATATCCA GOTACATCCA CAAATCCAGC 960 
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AGGCAATTCA TAAGTCAGTC AAOCOOAAAG CQAAAACATA ACCAO AGCAT CAGGtfiwif X020 

G TG OC TG TO T TTTTTACCTG CTTTCTAOCA TATCACTTGT QGAGAA.TTCC TTTTACTTTT 1080 

AGTCACTTAG ACAGGCTTTT AGATOAATCT GCACAAAAAA TCCTATATTA CTGCAAA GAA 1140 

ATTACACTTT TCTTQTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 1200 

5 TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 1260 

ATCAGATCAC TGCAAAGTGT GAGAAGATOG 6AA6TTCX3CA TATATTATQA TTACACTQAT 1320 

GTGTAGGCCr TTTATTGTTT GTTGGAAT06 ATATGTAC3UV AOTGTAAATA AATGTTTCTT 1380 
TPCATTATCC TTAAAAAAAA AA 

10 Seq ID NO: 364 Protein sequence 
Protein Accession tf: NP_076404 

1 11 21 31 41 51 

,c \ I i ' ' ' 

15 MGPHLTLAKL PNNBLHGQES HNSGNRSDGP GKNTTLHNBP DTIVLPVLYL IIPVASILUJ 60 

GLAVWIPPHI RNKTSPIPYL KMlWADIiIM TLTPPPRIVH DAGPGPWYFK PILCRYTSVL 120 

FYANMYTSIV PliGLISIDRY LJCWKPPGDS RMYSITPTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTBDNIH DCSKLKSPLG VKMHTAVTYV NSCLPVAVLV ILIGCTIAIS RYIHKSSRQF 240 

^ - ISQ8SRXRXK HQSIRWVAV PPTCPLPYHI. CRZPFTPSHl* DRUAESAQK II.YYCKEXTL 300 
20 FLSAOIVaiD PZXYPFHCRS PSRRLPKKSN IRTRSBSIR6 LQSVRR8EVR IVyDYTDV 



25 



55 
60 



Seq ID NO: 365 DNA sequence 

Nucleic Acid Accession #: NM_00536S.l 

Coding sequences 1..948 



1 11 21 31 41 51 

I i ) I ) I 

ATGTCTCTCG A6CAGAGGA0 TCCGCACTGC AAGCCTGATG AAQACCTTGA AGCCCAAGGA 60 

GAGGACTTGG GCCTGATGGO TOCACAGGAA CCCACAGGGO AGGA66AGGA GACTACCTCC 120 

30 TCCTCTGACA OCAAGQAGGA OGAGGTGTCT GCTGCTGGGT CATCAAGTCC TCCCCAQAGT IBO 

CCTCy^GGGAG GCX3CTTCCTC CTCCArrXCC QTCTACTACA CTTTATGGAG CCAATTCGAT 240 

GAQGGCTCCA 0CA6TCAAQA AGAG6AAGAQ CCAAGCTCCT CGGTOGACCC AGCTCAGCTQ 300 

GAGT7CATGT TGCAAGAAGC ACTGAAATTG AAGGT66CT0 AQTTGGTTCA TTTCCTGCTC 360 

CACAAATATC GAGTCAAOGA GCOGOTCACA AAGGCAGAAA TGCTGGAGAG OGTCATCAAA 420 

35 AATTACAAGC GCTACTTTCC T6TGATCTTC GGCAAAGCCT CCGAGTTCAT GCAGGTOATC 480 

TTTGGCACTG ATGrGAAOGA GGTGGACCCC GCCGGCCACT CCTACATCCT TGTCACTGCT 540 

CTTGGCCTCT OGTGCGATAG CATGCTGGGT GATGGTCATA GCATCCCCAA GGCCGCCCTC 600 

CT6ATCATTQ TCCTGGGTOT GATCCTAACC AAA0AC3UVCT GOGCXXCTGA AGAGGTTATC 660 

TGGGAAOGOT TQA8IGTQAT G6GGQIGTAT OTTGGGAAGG AGCACATGTT CTACGGGGAO 720 

40 CCCAGGAAGC T6CTCACCCA AGATTOaOTO CAOGAAAACT AOTrGGAGTA CXX3GCAGGTG 780 

CCCGGCAGTO ATCCTGCX3CA CTAOJAGTTC CTGTGGGGTT CCAAGGCCCA OGCTGAAACC 840 

AGCTATGAGA AGGTCATAAA TTATTTGGTC ATGCTCAATG CAAGAOAGCC CATCTGCTAC 900 
CCATCCCTTT ATGAAGAGGT TTTGGGAQAG GAGCAAGAGG GAGTCTGA 

45 Seq ID NO: 366 Protein sequence 
Protein Accession «; NP_005356.l 

1 11 21 31 41 51 

50 MSIiBQRSPHC KPDEDLBAQG EDLGU4QAQB PTGEEEBTTS SSDSKBBEVS AAGSSSPPQS 60 

PQGGASSSIS VYYTLWSQFD EGSSSQEEBB PS8SVDPAQL BFNFQEALKL KVAELVHFLL 120 

HKYRVKEPVT KAEMIiBSVIK NYKRYFPVXF GKASEPMQVI PQTDVKEVDP AGHSYILVTA 180 

LGLSCDSMLG DCffiSMPKAAIi LIIVU3VXItT XDNCAPEEVI NEALSVT4SVY VGKEHMFYGB 240 

PRKLLTQDHV QENYLEYRQV PGGDPAHYEP LHOSKAHAET SYEKVINYLV MUIARBPXCY 300 
PSLYEEVLGE EQEGV 



Seq ID NO: 367 DNA sequence 
Nucleic Acid Accession <f: NM_014400 
Ooding sequence t 86.. 112 6 



11 21 31 41 51 

1 I I I I t 

GGTTACTCa^T CCTGGGCTCA GGTAAGAGGG CCOGAGCTOQ GAGGCGGCAC ACCCAGGGGO 60 

GAGGCCAAGQ GAOCAOGACO OAGOCATSQA CCOC360CAGO AAAGCAGOT6 OCC AOGC CAT 120 

65 GATCTGGACr GCA6GCT6GC T G CT G CTGCT GCTGCTTGGC GGAGQAOOSC A66CCCTGGA 180 

GTGCTACAGC TGOGTGCAGA AAGCAGATGA OGGATGCTCC 00GAAC3UVGA TGAAGACAGT 240 

QAAGTGOGGG COGGGOGTGG ACGTCTGCAC OGAOOCCOTQ GGGQOGGTGG AGACCATOCA 300 

GGGACAA7TC TG6CTGGCAG TGCSGGOTTQ GGOTTOGGGA CTCCCGGGCA AOAATOACCG 360 

GQGCCTOGAT CTTCAGOGGC TTCXt3G00TT CATOCAGCTG CA6CAAT00G CTCAGOATGG 420 

70 CIGCAACGCC AA6CTCAACC TCAGCTG60G GGCOCTOOAC CCGGCAGGTA ATGAGAGTOC 480 

ATACC06CCC AACGGCGTGG AGTGCTACAG CTGTGT6QGC CTGAGCOGGO AGGCX3TGCCA 540 

GOGTACATCG CCQCCGGTCX5 TGAGCTGCTA CAACXSCCAOC QATCAIGTCT ACAAGGGCTO 600 

CTTCGACGGC AAOSTCACCT TGAOGQCAGC TAATGTGACT GTGTCCTTGC CTGTCCX3GGG 660 

CTQTQTCCAG GATGAATTCT GCACTGGG6A TGGAGTAACh GGCCC3^GGGT TCACGCTCAO 720 

75 TOGCrCCTGT TGCCAGGGGT CCCGCTOTAA CTCT6ACCTC GGCAACAA6A CCTACTTCTC 780 

CCCTCOAATC CCAC<XCTTG TCOGGCPGCC CXXTCCAGAG CCCAOGACTG TGGCCTCAAC 840 

CACATCTOTC ACCACTTCTA CCTOSGCCCX: AGTQAGACCC ACATCCACCA CX^VAACCCAT 900 

GCCAGOQCCA AGCAGTCAGA CTC06AGACA GGGAGTAGAA CACGAGGGCT GCCGGGATGA 960 

QQAGCCCAGG T1GACT0GA6 G06C0GCTG6 CCAGCA6GAC GQCAGCAATT GAGGGCAOTA 1020 

80 TCCTGCAAAA GGGOGOCXXX: A6CAGCCCCA TAATAAAGGC TGIGTGGCTC CCACMCTGG 1080 

ATTGOCAGCC CTTCTGTTOO CC3GTGGCTGC TGGTGTCCTA CTGTaAOCTT CTCC ACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCIGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCXG OCCCAGCCCC TGTTTTTOCA ACATTCCCCA GTATCCCCAG 1260 

CITCTGCIGC GCTGGTTTGC GGCTTTQGQA AATAAAATAC OBTTGTATAT ATTCTGGCAG 1320 
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70 
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80 



GGOTGTTCTA GCTTTTTGAO GACAGCTCCT OTATCCTTCT CATCCTTGTC TCTCOGCTTG 
TCCTCTTQTG ATQTTAGGAC AQAGTQAQAO AAGTCAGCTG TCAOGGGQAA OGTGAaAGAQ 
AGGATCCTAA 6CTTCCTACT CACTTTCTCC TAGCCAGCCT GQACrrTOGA GCOTGGOGTG 
GGTQGGACAA TGGCTCOCCA CTCTAAGCAC TSOCTCCCCT ACTCCCOGCA TCl TfOOQG A 
ATGGGTTGCC CATATGTCTT CCTTACTAQA CTGTGAGCTC CTGGAGGGCA G6GAC08TGC 
CTTATOTCTO TOTGTOATCft OTTTCTOGCA CATAAATGCC TCAATAAAQA TTTAATIACT 
TTQTATAGTG AAAAAAAA 

Seq XD NOt 368 Protein sequence 
Protein Acceosion fi t NP 0S521S 



1380 

1440 
1500 
1560 
1620 
1680 



1 11 21 31 41 51 

I I I I I I 

MDPARXA6AQ AMZWTA6WLL LLLLRGGAQA LECySCVQXA DDGCSPNRMK TVKCAPGVDV 
CTBAVGAVBT IHGQFSLAVX GGGSGZiPGXN DRGLDLHGLL AFIQXiQQCAQ DRCNAKUniT 
SRALDPAQIE SAYPPNGVEC YSCVGLSREA CQGTSPFWS CVNASDHVYK GCFDGHVTLT 
AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CKSDLRNKTY F8PRIPPLVR 
LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMFAPTSQTP RQGVEHEA8R DEBPRLTGGA 
AGRQDRSNSG QVPAKOGPOQ PHNK6CVAPT AGIAALLLAV AAGFVLL 

Seq ID NO: 369 DNA sequence 
Nucleic Acid Accession #) NM_005329.1 
Coding sequence: 1..1662 



60 
120 
180 
240 
300 



41 



51 



1 11 21 31 

I 1 I I I I 

ATGCCX3GTGC AGCTGACGAC AGCCCTGCGT GTGQTGGGCA CCAGCCTGTT TGCCCTGGCA 
GTGCTGGGTG GCATCCTGGC AGCCTATQTG AOGGGCTACC AGTT CRTCC A CACGGAAAAG 
CACTACCTGT CCTTCGQCCT GTACGGOQCC ATCCTGOGCC TGCA0CT6CT CATTCA<3AaC 
CTTTTTGCCT TCCTGGAGCA C0GGC6CAT0 OOAOOT G COG GCCAGGCTCT GAAGCTGCCC 
TCCCCXKX3QC GGGGCTOGGT GGCaCTGTGC ATTGCCGCAT ACCAGGAGOA CCCT6ACTAC 
TTGOGCAAGT QCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATQ 
GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTQG ACATCTTCCA CQAGGTGCTG 
OGGGQCACCQ AQCAGGCCGG CTTCTTTGTG TGGCGCAaCA ACTTCCAT6A OGCAGGGGAO 
GGTGA6ACGG AGGCCAGCCT GCAOGAOGGC ATGGACCGTG TGCX^GGATGT GOTGOGGGCC 
AGCACCTTCT CGTGCATCAT GCAGAAGTGQ GGAGGCAAGC GCQAOGTCAT GTACACGGCC 
TTCAAGGCCX: TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCX3ACTCTGA CACTGTGCTG 
GATCCAGCCT OCACXS^TOGA GATGCTTOGA GTCCTGGAGG AG6ATCCCCA AGTAGOGGGA 
GTCGGGGGAG ATGTCXIAGAT CCTCAACAAO TAOQACTCAT GGATTTGCTT CCT GAGCAQC 
GTGCGGTACT GGATGGCCTT CAAOGTGGAO 0GGG0CT6CC AGTCCTACTT TGGCTGTGTO 
CAGTGTATTA OTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 
GACTGGTACC ATCAGAAGTT CCTAGGCASC AAGTQCAGCT TC3GGGGAT6A COGGCACCTC 
ACCAACCGAG TCCTQAGCCT TGGCTACC3QA ACTAAGTATA CGGCGOSCTC GAAOTGCCTC 
ACAOAGACCC CCACTAAQTA CCTCOGGTGG CTCAACCAGC AAACC06CT6 GA0C3WGTCT 
TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATQACC 
TACXSAGTCAO TGGTCA06GG TTTCTTCCCC TTCTTCCTCA TTOCCACXOT TATACAQCTT 
TTCTACOGGG 6CC3GCATCTG GAACATTCTC CTCTTCCTGC TGAC6GT6CA GCTGGTGGGC 
ATTATCAAGO CCACCTAOOC CTGCTTCCTT CX3GGGCAATG CAGAGATQAT CTTCATGTCC 
CTCTACTCCC TCCTCTATAT GTCCAOCCTT CTGCOGGCCA AGATCTTTGC CATTGCTAOC 
ATCAACAAAT CTGGCTOQGO CACCTCTGGC OGAAAAACCA TTGTGGTGAA CTTCATTGGC 
CTCATTCCTG TGTCCATCTO GGTGGCAGTT CTCCTOOAGG 6GCTGGCCTA CACAGCTTAT 
TGOCAGQACC TOTTCAGTQA GAOU3AGCTA GCCTTGCTTG TCTCTGGGGG TATACTGTAT 
GGCTGCTACT GGQTGGCCCT OCTCATGCTA TATCTG6CC3V TCAT0QCC06 GOSATGTGGG 
AAGAAaCC3GG AGCASTACAO CTTOGCTTTT GCTGAOGrtGT 6A 



Seq ID NOt 370 Protein eequence 
Protein Accession #: HP 005320.1 



1 
I 

MPVQZfTTALR 
LFAFLBHRSM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DHYHQXFLGS 
YFREtniYNSL 
IIKATYACFL 
LIPVSIWVAV 
KKPEQYSLAF 



11 

I 

WGrrSLFALA 
RRA6QALKLP 
YMLDIFHEVL 
GGKRBVWYTA 
YDSWISFLSS 
KCSFa)DRHL 
MPHKHHIjWMT 
RGNAEMIFMS 
LLEGLAYTAY 
AEV 



21 

I 

VLGGILAAYV 
8PRRGSVALC 
GGTEQAGFFV 
FKAIiGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSliLYMSSL 
CSQCLFSBTBZi 



31 

I 

TGYQFIHTBK 
lAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
FFLIATVIQL 
LPAKIFAIAT 
AFLVSOAIIiY 



41 

I 

HYLSF6LYGA 
LRKCLRSAQR 
GETBASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIMNIL 
XMXSGWGTSO 
qCYHVALLHXi 



Seq XD VOi 371 DMA sequence 

Nucleic Acid Accession Bos sequence 

Coding sequence: 148-7095 



CACACATAOQ 
CAAAAAAAAC 
CGOOQAQGGa 
CAQCTCCTCT 
CTTGTTGAAO 
AAATATOCAA 
CAA6TAAATG 



11 
I 

CACGCACQAT 
ATTTCCTTCG 
C0QCA6AC0Q 
GTOTTTGCOO 
AOATTGOCTG 
CATGTAATA6 
TGAATCTTAA 



21 

I 

CTCACTTOQA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCFATACA 
OCCAAAACAA 
GAAACTTAAA 



31 
I 

TCTATACACT 
CTCTCCACTC 
06AATCCTAA 
GCTAATGGAT 

ggagcactga 
tctcctatca 

TTTCAGGGTT 



41 

I 

GGAGGAT7AA 
TGAQAAGCAG 
AGOGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGAXAAAAC 



60 
120 
180 
240 
300 
360 
420. 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



51 
I 

ILOLHLLXQS 
ISFPDLKWM 
KDRVRDWRA 
VLEEDPQVG6 
RNSLLQQFXiB 
LNQQTRWSKS 
LPLLTVQLVQ 
RXTXWNFXG 
YXAXXARROQ 



60 
120 
180 
240 
300 
360 
420 
480 
S40 



51 
I 

AACAAACAAA 
AGGAGCOGCA 
CX3CTT6CATT 
ACAGAGAAAA 
TTGOGGAAAG 
AGATCTTACA 
ATCATTGGAA 



60 
120 
180 
240 
300 
360 
420 
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AACACRTTCA TTCATAAOVC TGCSOAAAACA 0T6GAAATTA ATCTCR CTAA TGACTACCGT 480 
GTCAGOGGAO GAQTTTCAQA AATQGTGTTT AAAGCAAGCA AQATAACTTT TCACTGGGQA S40 
AAATGCAATA TQTCATCTGA TGQATCAOAG C»TA6TTTAG AAGGACA AAA ATTTCCACTT 600 
GAOMGCAAA TCTACTGCTT T6ATG0GGAC OBATTTTCAA GTTTTGAGGA AGCACTOlAA 660 
5 GGAAAACGQA AGTTAAQABC TTTATCCATT TrerTTGAGO TOGGACSMSA AGAAAATTTG 720 
GATTTCAAAG OGATTATTQA TGGAGTCGAA AGTGTTAGTC GTnTGGOAA GCA6GCT6CT 780 
TTAGATCCAT TCATACTQTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTAC ATTTAC 840 
AATO6CTCAT TGACATCTCC TCCCT6CACA GACACAGTTG ACTGGATTGT TTTTAAAOAT 900 
ACAGTTAGCA TCTCTGAAAG OCAGTTGQCT tfrimia TQ AAGTTCTTAC AATGCAACAA 960 
10 TCTOGTTATG TCATGCTGAT OGACTACTTA OUU^CAATT TTCGAGASCA ACAO TACAAO 1020 
TTCrCTAOAC AGGTGTTTTC CTCATACACT GGAAAGGAAQ AOATTCATGA AG^GTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTOAC CCAGAQAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAOAC CTC0AGT06T TTATGATACC ATGATTGAGA AGTTTGCAQT TTTGT ACCAO 1200 
CAGTTGOATG 6AQAGC5AOCA AACCAAGCAT OAATTTTTQA CAQATGGCTA TCAAGACTTO 1260 
15 GGT6CTATTC TCAATAATTT QCTACCCAAT ATGAGTTATG TTCTTCRBAT AOTAGCCATA 1320 
TOCACTAATG GCTTATATGG AAAATACAGC QACCAACTGA TTGTOGACAT GCCTACTOAT 1380 
AATCCTGAAC TrGATCTTTT CCCTQAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
QAAGAGGGAA AAGACATTGA AGAAGGC6CT ATTGTOAATC CTGGTAGAGA CA6TGCTACA IS 00 
AACCAAATCA GQAAAAAGGA ACCCCAGATT TCTACCACAA CACACTAC3UI TOGCATAGOG 1560 
20 ACGAAATACA ATOAAGCCAA OACTAACOGA TCCOCAACAA GAGGAAGTOA ATTCTCTGQA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT QACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTQTO 1740 
GAAGGTACTT CAGCCTCTTT AAATGATCGC TCTAAAACTG TTCTTAGATC TCX»CATAT6 1800 
AACTTOTCGG GGACTGCAOA ATCCTTAAAT ACAGITTCTA T AACABR ATA TGAOQAGOftO 1860 
25 AGrTTATIGA CCAGTTTCAA GCTTGATACT GaAESCTGAAO ATTCTTCWSO CTCCAQTOCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AAC^TATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAO AAATGCTTCC 2040 
QAAGATTCAA CTTCATCAGQ TTCAGAAQAA TCACTAAAGG AT CCTTC TAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AOACATAACA GCACAGCCOQ ATGTTOGAIC AOGCRGAGAG 2160 
30 AGCTTTCTCC AGACTAATTA CACTQAGATA CGTGTTGATQ AATCTGAGAA OACAACCAAG 2220 
TCCTrrrCTQ CAGGCCCAGT GATGTCACAG GGTCCCrcaQ TTACAGATCT GQAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGOTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACX3GTC AACGTGGTAT ACTCGCAQAC AACCCAACCO 2400 
GTATACAATG GTOAOACACC TCTTCAACCT TCCTACAGTA OTGAAGTCTT TCCTCTAGTC 2460 
35 ACCCCTTTGT TCCTTGACAA TCAGATCX:TC MCACShCCC CT6CTGCTTC AAGTAGTOAT 2S20 
TCGGCCTTGC ATGCTACX3CC TGTATTTCCC AGTGTCGATO TGTCATTTGA ATCCATCXnXS 2580 
TCTTCCTATO ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TCCATACAQT TTCTCAAATC CTTCCACAAG TTACTTCAOC TACCGAGAGT 2700 
GATAAGQTGC CCTTQCATGC TTCTCTGCCA GTGGCTGGGO QTGATTTOCT ATTAGAGCCC 2760 
40 AGCCTTGCTC AGTATTCTGA TGTQCTOTCC ACTACTC!ATG CT6CTTCAGA GACGCTGOAA 2820 
TTTGGTAGTO AATCTGGTOT TCTTTATAAA AOGCTTATGT TTTCTCAAGT TGAACCACCC 2880 
AQCAGTOATO CCATGATGCA TGCACOTTCT TCAGGGCCTO AACCTTCTTA TGCCTTGrCT 2940 
GATAATOAGG 6CTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGO GTGTAACTTA TCAGGGTTCC TTATTTAGCG QCCCTAGCCA TATACCAATA 3060 
45 CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACT6C AGOCTACTCA TGGCXTCTCT 3120 
GOTQATOGGG AATG3TCTO0 A6CCTCTTCT GATAGTGAAT TTCTTTTACC TGACAC3«3AT 3180 
OGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGQAAATGAG 3300 
ACTCAACTGC AAATTCCTTC TTTCAATGAO ATOQTTTACC CTTCTGAAAO CACAQTCATS 3360 
50 CCCAACATGT ATCATAATGX AAATAAQTTG AATGOQTCTT TACaUMSAAAC CTCTGTTTCC 3420 
ATTTCTAGCA CCAA60GCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
QATCATQAOA TTAGTCAAOT TCCSVGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGftGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA C TCRGC TCTT ATTTTATQRG 3660 
55 ACCrCAGCTT CTTTTAOTAC TQAAGTATTO CTACAACCTT CCTTTCAGGC TTCTOATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTOCCCAOTG ATCCAATATT GGTTGAAACC 3780 
GCCAAAGTrG ATAAAATPAO TTCTACaVATG TTGCATCTCA TTGTATCRAA TTCTGCTTCA 3840 
AGTOAAAACA TGCTGCACTC TACATCTGTA CCAOTTTTTG ATOTGTCGOC TACTTCTCAT 3900 
ATGCACTCTQ CTTCACTTCA AGGTTTGACC ATTTCCTATQ CAAGTGflGAA ATATOAACCA 3960 
60 GTTTTGTTAA AAAQTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATOATGAO 4020 
TTOrrCCAAA OGGCCAATTT GORGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTCTTTTATC AATTOATGAA CCATTAAATA CACTAATAAA T AAGC TTATA 4140 
CATTCOGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAO TTGCTTCTQA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGQAAAT 4260 
65 GGGCa^TCTTQ CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTOTAfcC CTCAACaAAG 4320 
TTGCTCTTTC CTTCTAAOOC AACTTCTGAG CTGAGTCATA GTGCCRAATC TOATGCCGGT 4380 
TTAGTGGOTO GTOGTOAAOA TGOTGACACT GATGATQATO GTGATGATGA TQATGATGAC 4440 
AGAGGTAGTG ATGGCTTATC CATTCATAAG TQTATGTCAT GCTCATCCTA TAGAQAATCA 4500 
CAGGAAAAGG TAATGAATGA TTCAGACACC CACQAAAACA GTCTTATGGA TCAGAATAAT 4560 
70 CCRATCTCAT ACTCACTATC TGAGAATTCX GAAtaAAQATA ATAGAGTCAC AAOTGTATOC 4620 
TCAGACAOTC AAACTGOTAT GOACAOAAOT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 
TCCCAAAAOC ACAATGATGG AAAAGAS6AA AAT6ACATTC AGACTGGTAG TGCTCTGCTT 4740 
CCTCTCAGCX: CIGAATCTAA AGCATOGGCA GTTCTGACAA GTGATGAAGA AA QTQGAT CA 4800 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATQAGACTT CCACA3ATTT C3U3TTTT6CA 4860 
75 6ACACTAATG AAAAAGATGC TQATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 
GGATTCOCAC AOTCCCCAAC ATCATCTGTT ACTAGCXSAGA ACTC3«JAAGT CiTCCACGTT 4980 
TCRGAG6CAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGG GGTTG 5040 
GAATCOGAGA AGAAGGCAGT TATACCCCTT GTGATOGTGT CAGCCCTQAC TTTTAT CTGT 5100 
CTAOTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGQAAAT GCTTCCAGAC TGC ACACTTT 5160 
80 TACTTAGAGG ACAGTACATC CXXTTAOAGXT ATATCCACAC CTCXaVACACX: TATCTTTCCA 5220 
ATTTGAGATG ATGTOGGAGC AATTOCAATA AAGCACTTTC CAAA GCATGT TGCAGATTTA 5280 
CATGCAAGTA GTGGGTTTAC TOAAOAATTT GAGACACTGA AAQAGTTTTA CCAGGAAGTG 5340 
CAGABCTOTA CTGTTGACTT A66TATTACA GCAGACABCT CCAA CCACC C AGACAACAAG 5400 
CACAAGAATC GATACATAAA TATOGTTOCX: TAIGATCATA GCA6GGTTAA GCTAGCACAG S460 
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CTTGCTGAAA AGGATGGCAA ACTGACTGAT 
AACAGACCAA AACCTTATAT TGC TGCCC RA 
TGGAGAAT6A TATGGGAACA TAATGTGGAA 
AAAGG^AGGA GAAAATOTGA TCAQTACTQQ 
5 TTTCTOOTCA CTCASAAOAO TGTGCAAGTQ 
CTAAOAAACA CAAAAATAAA AAAGGOCTCC 
ACACAGTATC ACTACACGCA GTGGCCTOAC 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC 
CACT6CAGTG CTGGAGTTGO AM3AACAGGC 
10 CAGATTCAAC A06AAGGAAC TGTCAACATA 
AGAAATTATT T6GTACAAAC TGAGOAGCAA 
GCCATACTTA OTAAAGAAAC TOAGGTGCTG 
CTCCTCATTC CTGGACCA6C AGGCAAAACA 
CAGTCAAATA TACACCAGAG TOACTATTCT 
15 AATCQAACTT CTTCTATCAT OCCTOTGGAA 
GGAGAAGGCA CAGACTACAT CAATGCCTCC 
TTCATCATTA CCCAGCACCC TCTCCTTCAT 
GACCATAATG CCCAACTQGT GGTTATGATT 
TTTCTTTACT GGCCAAATAA AGATGAGCCT 
20 ATGGCTGAAG AACACAAATG TCTATCTAAT 
TTAQAAGCTA CACAGGATGA TTATGTACTT 
CCAAATCCAG ATAGCCCX^T TAGTAAAACT 
GCTGCCAATA GGGATGGGCC TATGATTGTT 
ACTTTCTOTQ CTCTGACAAC CCTTATGCAC 
25 TACCAGGTAG CCAAGATGAT CAATCTGATG 
TATCAGTTTC TCTACAAAGT GATCCTCAGC 
TOCACCTCTC TGQACAGTAA TGQTGCAOCA 
GAGTCTTTAG TTTAACACAO AAAGGGGTGG 
TTCCTAAAAT TAGGCAGGAA AATCAGTCTA 
30 GACAGTAACT TTCATGACAT AGGATTCTGC 
CXTTTTQCAA GACTTGTAAT TTACTTATTA 
TATTTCTAAO AATGGAATTG TGGTATTTTT 
TTTATAGAGG TTAGGAATTC CAAACTACAO 
GCTGTATTTG TAGCAATTAT CAGGTTTGCT 
35 AAATAAAACA CTCTTCCATA TQATATTCAA 
AOAAATAATC TGTTACTTAT T6TAAATACT 
TTTATAATTG TAGATTTTTA TATTTTACTA 
GTTTAGTTTA ATGAOQTAOT TCATTAGCTG 
TTGTGTTACC TAAGTCATTA ACTTTOTTTC 
40 GAAATACCTT a^TTTTGAAA GAAGTTTTTA 
AATGGTTTTT ATCCAAGQAA TTGCAAAAAT 
AAAAAAAAAA AAAAAAAAAA AAAA 



TATATCAATO CCAATTATOT TGATGGCTAC 5520 
GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 
GTTATTOTCA TGATAACAAA CCT06TGGAG 5640 
CCTGCOQATG GGAGTGAGOA GT AOQOGA AC 5700 
CITGCCT A TT ATACTGTOAG GAATTTTACT 5760 
CAGAAAGGAA GACCCAGTGG A OG TGT GG TC 5620 
ATGGGAGTAC CAGAGTACTC CXTTGCXIAGTG 5880 
AAGC6CCAT6 CAGTC3GG6CC TQ7TGT06TC 5940 
ACATATATTG TGCTAGAGAO TATOTTGCAO 6000 
TTTGGCTTCT TAAAACACAT COGTTCACAA 6060 
TATGTCTTCA TTCATGATAC ACTGGTrQAQ 6120 
GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 
AAGCTAGAGA AACAATTCCA GCTCCTQAGC 6240 
GCAOCGCTAA AGCAAT6CAA CAGGGAAAAO 6300 
AGATCAAOGG TTOGCATTTC ATCCCTQAGT 6360 
TATATCATGG GCTATTACCA QAGCAATGAA 6420 
ACCATCAAGG ATTTCTGOAO GATGATATGG 6480 
CCTGATGGCC AAAACATGGC AGAAGATGAA 6S40 
ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 
OAGGAAAAAC TTATAATTCA 6GACTTTATC 6660 
GAAGTGAGGC ACTTTCAGTO TCCTAAATGO 6720 
TTTGAACTTA TAAGTGTTAT AAAAGAAQAA 6780 
CATGATGAGC ATGGAGGA6T GACX9GCAGQA 6840 
CAACTASAAA AA6AAAATTC OGTGGATGTT 6900 
AGGCCAG6AG TCTTTGCTGA CATTGAGCAO 6960 
CTTGTGAGCA CAAGGCAGGA AOAGAATCCA 7020 
TTQCCTGATG GAAATATAQC TGAGAGCTTA 7080 
GGGGACTCAC ATCTGAGCAT TOTTTTCCTC 7140 
GTTCTGTTAT CTOTTGATTT CCCATCAGCT 7200 
CGCCAAATTT ATATCATTAA CAATGTOTGC 7260 
TGTTTGAACT AAAATGATTG AATTTTACAG 7320 
TTCTGTATTQ ATTTTAACAQ AAAATTTCAA 7380 
AAAATOTTTG TTTTTAGTOT CAAATTTTTA 7440 
AGAAATATAA CTTTTAATAC AGTAOCCTGT 7500 
CATTTTACAA CIGCAGTATT CACCTAAAOT 7560 
GCCXrTAOTGT CTCCATGGAC CAAATTTATA 7620 
CTGAGTCAAG TTTTCTAOTT CTOTGTAATT 7680 
GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 
AGCATGTAAT TTTAACTTTT GTGQAAAATA 7800 
TX3AGAATAAC AGCTTACGAA ACATTGTTCA 7860 
AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 



Seq ID NOj 372 Protein sequence: 
45 Protein Accession #: built from XP_031379 

1 11 21 31 41 51 

ilRIUCRFLAC IQUXiVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWO KKYPTQISPK 60 

50 QSPIMIDBDL TOVNVHLKKL KFQGHDKTSL ENTFIHNTGK TVEI NIiTOP Y RVSG6VSEMV 120 

FKASKITFHSI GKCaiMSSDQS EHSLEQQKFP LBMQIYCFDA DRFSSFBEAV KQKGKLRALS 180 

HiFBVGTEEM LDPKAIIDGV ESVSRFGKQA ALDPPILLMIi LPNSTDKYYI YNGSLTSPPC 240 

TDTVDHIVFK DTVSISBSQL AVPCBVLTOQ QSGYVMUOY LQNNPREQQY KFSRQVPSSY 300 

TGKEBIHEAV CSSBPENVQA DPENYTSLLV TMERPRWYD TWIEKPAVliY QQIJ3GBDQTK 360 

55 HEPLTDGYQD LGAILNNLLP NHSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPB LDLPPB 420 

IiIGTEEZZKB EEE6KD1SEO AIVIIPGRD8A TNQIRKKEPQ ISTTTHYNRI OTKYNBAKTN 480 

RSPTRGSEPS GKtBJVPNTSL MSTSQPVTKL ATEKDISLTS QTVTBLPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESI. NTVSXTBYEB ESLLTSPKLD TGAEDSSOSS PATSAIPPI8 600 

ENISQGYIFS SENPBTITYD VLIPBSAHKA SEDSTSSGSB ESLKDPSMEG NVWPPSSTDI 660 

60 TAQPDVGSGR ESFIiQTNYTE IRVDESBKTT KSF8AGPVM8 QQPSVTDLEM PHYSTFAYPP 720 

TBVTPHAFTP SSRQQDLVfiT VHWYSQTTQ PVYIIQBTPLQ PSYSSBVFPL VTPLLLDNQI 780 

lOTTPAASSS DSALHATPVP PSVBVSPESI LSSYDGAPLL PFSSASFSSE LFRHLHTV8Q 840 

ILPQVTSATE SDiCVPLHASL PVAGGDLLLB PSLAQYSDVL STTHAASETIi BPOSBSGVLY 900 

KTLMPSQVEP P8SI>A^^MHAR SSQPBPSYAL SDNBGSQHIP TVSYSSAIPV HDSVGVTYQG 960 

65 SLPSOPSHIP IPKSSLITPT ASLLQPTHAL SGDGEHSGAS SDSEPLLPDT DGIiTALNISS 1020 

PVSVABFTYT TSVFGDDHKA LSKSBIIYGW ETBLQIPSPN BMVYPSBSTV KPNMYDNVNK 1080 

LNASLQBTSV SISSTRGMFP GSLAHTTTICV FCBEXSQfVFE HNPSVOPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSP8TQLLPY ETSASPSTEV LLQPSFQASD VDTI*LKTVI*P 1200 

AVPSDPILVB TPKVDKISST MLHLIVSNSA SSEHMiaSTS VPVFDV8PTS HMHSASLQGl. 1260 

70 TISYASBKYE PVLLKSESSH QVVPSLY8ND BLPQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPUITLTinCL IH8DBZI1T8T KSSVTGKVFA 0IPTVA8DTP VSTOHSVPIG NOTVAITAVS 1380 

PHRDGSVTST KLXiFPSKATS ELSH8AKSDA 6LVGG6BDGD TDODGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRB SQEKVMNDSO THENSIMXJN NPISYSLSEN SEEDNRVT8V SSDSQTGMDR 1500 

8PGKSP8ANG LSQKHNDGKB ENDIQTQSAL LPI*SPESKAW AVLTSDEESG SGQGTSDSLN 1560 

75 ENBT8TDF8F ADTNEKDADO ILAAGDSEIT PGFPQSPTSS VT8ENSBVPH VSEABASNSS 1620 

KBSRIGIiABG X^ESEXXAVIP LVXVSALTPI CLWL VGIIiI YHRKCPQTAH FYLEDSTSPR 1680 

VZSTPPTPZP PISDDVGAIP XKHFPKBVAD LRASSGFTBB PBTLKBFYQB VQSCTVDI^I 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QIAEKDGKLT DYINANYVDG YNRPKAYXAA 1800 

QGPLKSTABD PHRMIWEHNV EVIVMITMLV BKGRRKOQY WPADGSEEYG NFLVTQKSVQ 1860 

80 VLAYYTVRNF TUOJTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKBHAVGPW VBCSAGVGRT GTYIVU>SML QQIQHB6TVN IFGFUCHIR8 QRMYLVQTEB 1980 

QYVPIHDTLV BAIZiSKETBV UlSBIHAYVN ALLIPGPAGK TKLEKQFQLL SQSKIQQSDY 2040 

8AALKQCNRE KNRT8SIZPV BRSRVGISSIt SGBGIDYINA 8YIMGYYQSN BPIZTQKPU* 2100 

HTIKDPHRMI WDHKAQLWM IPDGQNMAED EFVYWPBKPS PINCBSPKVT LMAEBHKOiS 2160 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
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60 
65 
70 
75 
80 



MEBKLIIQDF ILBATQDDYV LBVRHPQCPK HPNPDSPISK TFELISVIXE EAMIRDOPMI 
VHDEHGGVTA GTFCALTTLM HQLBKENSVD VyQVAKMlNL MRFGVFADIB QVOPLYKVIL 
SbVSTRQEEN PSTSLOSKGA ALPDGMIAB8 LESLV 

Seq ID KO: 373 DMA sequence 

Nucleic Acid Accession fl: built from NH_002851 

Coding sequences 148-4S18 

1 11 21 31 41 51 

I I 1 i t 1 

CACACATAOQ CACGCAOGAT CTCACrTOGA TCTATACACT GGAGGATTAA AACRAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAOCAG AGGAGCOGCA 
CGGCGAGGGG COGCAGACOG TCTGGAAATO OQAATCCTAA AGCX3TTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTT G OQO CCIOGATTQO GCTAATQ6AT ACTACAOACA ACAOAOAAAA 
CTTGTTOAAO AGATTGGCTG GTCCTATACA GOAGCACTGA ATCAAAAAAA TTGGGGAAAO 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AC3ATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATT^AA 
AACACATTCA TTCATAACAC TOGGAAAAOl GTGGAAATTA ATCTCA CTAA TGA CTACOOT 
GTCAGCX3QA0 GAGTTTCAGA AATGGTQTTT AAAOCAAGCA A8ATAACTTT TCACIGOGGA 
AAATGCAATA TOTCATCTOA T0GATCA6AO CATAOTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TGATGO G GAC CGATTTTCAA GTTTTGAGOA AGCAQTCAAA 
GGAAAA6GGA AGTTAAQAGC TTTATCCATT TTGTTT6AGG TTGGGAC AGA AGAAAATTTQ 
GATTTCAAAO CGATTATTGA TGGAGTCQAA AGTGTTA6TC GTTTTGOGAA GCAGG CTGC T 
TTAGATCCAT TCATACTGTT QAACCTTCTG CCAAACTCAA CTQACA AGTA TTACA TTTAC 
AATCGCTCAT TGACATCTCC TCCCT0C31CA GACACAGTTG ACTGGATTOT TTTTAAAOAT 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTT6TG AAGTTCTTAC AATGCAACAA 
TCTG G TT A TG TCATGCTQAT GGACTACTTA CAAAACAATT TTCGA6AGCA ACAGTACAAG 
TTCTCTAGAC AOGTGTTTTC CTCATACACT GGAAAGGAAQ AOATTCATQA AOCAQTTT GT 
AGTTCAGAAC CAGAAAATGT TCAGGCTQAC CCAOAQAATT A TACC ftGCCT TCTTOTTACA 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATT6AGA AGTTTGCAGT TTTGTA CCAQ 
CAOTTOOATG GAGAQGACOV AACC3UWK3^T GAATTTTTQA CAGATGGCTA TCAAGACTTG 
OGTGCTATTC TCAATAATTT QCTACCCAAT ATQAGTTATQ TTCTTCAGAT AQTAGCCATA 
T6CACTAATG OCTTATATGG AAAATACAGC GACCAACTGA TTGTCX3ACAT GCCTACT6AT 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTOGAACTG AAGAAATAAT CAAGOftGGAQ 
GAAGAGGQAA AAGACATTGA AQAAGGCGCT ATTOTGAATC CTQGTAGA6A CAOTOCTACA 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TOGCATAGGG 
ACGAAATACA ATGAAQCCAA GACTAACCGA TCOCCAACAA GAGGAAGTGA ATTCTCTGQA 
AAGGGTGATQ TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACX»GTCAC TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTO AACTGCCACC TCACACiaiQ 
QAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTO TTCTTAGATC TCCACATATG 
AACTTGTCGO GGACTGCRGA ATCCTTAAAT ACAGTrTCTA TAACAGAATA TQAGGAGGAG 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG C TCCA GTCCC 
GCAACTTCTQ CTATCCXATT CATCTCTGAQ AACATATCCC AAG6GTATAT ATTTTCCTCC 
GAAAACCCAG A6ACAATAAC ATATGATGTC CTTATACCAG AA TCTGCTA G AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
OTQTGGTTTC CTAGCTCIAC AGACATAACA GCACAGCCOG ATGTTGGATC AGGCAGAGAG 

AocrrrcTCC aoactaatta cactgagata ogtgttgatg aatctgagaa gacaaccaag 

TCCTTTTCTG CAGGCX:CAGT GATGTCACAG GGTCOCTCAG TTACAG ATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATQCTTT TA COCC ATOC 
TCCAGACAAC AGGATTTGOT CTCCAGGGTC AACOTGGTAT ACTCGCAGAC AACCCAACOO 
GTATACAATG CAOAOQCCAa TAATAGTAQC CATGAGTCXC GTATTGGTCT AGCTQAGGGG 
TPGGAATCOG A6AAGAA06C AOTTATACOC CTT6TGAT0G TGTCAGCXICT GACTTTTATC 
TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 
TTTTACTTAG AGQACAGTAC ATCCCCTAGA OTTATATCCA CACCTCCAAC ACCTA'i'CTfT 
CCAATTTCAG ATGATGTC3GG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 
TTACATGCAA QTAGTQQOTT TACTQAAGAA TTTGAQACAC TGAAAGAGTT TTACCAOGAA 
GTGCAGAGCT OTACTOTTQA CTTAGGTATT ACAOCAQACA GCTCCAACCA CCCAGACAAC 
AAGCACAAGA ATOGATACAT AAATATCX3TT GCCTATGATC ATAGCaWSGGT TAAGCTAGCA 
CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATQCCAATTA TGTTGAT06C 
TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAG0CXX3VC TGAAATCCAC AQCTGAAGAT 
TTCTGGAGAA T6ATATG0GA ACATAATGTG GAAOTTATTG TCATQATAAC AAACCTCGTG 
GAGAAAGQAA GGAGAAAATO TGATCAGTAC TGGCCTGCOQ ATGGGAGTGA GQAQTAOQQG 
TkACTTTCTGG TCACTCAQAA GAGTGTGCAA GTGCTTGCCT ATTATACT3T QAGGAATTTT 
ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACOCAO TGGAOGTGTG 
GTCACACAOT ATCACTACAC GCAGTGGOCT GACATGGGAG TACC3V6AGTA CT CCCTGCC A 
GTGCIGACCT TTGTGAOAAA GGCAQOCtAT GCX3^CX3CC ATGC3VGTGGG GCCTGTTGTC 
GTCCACTGCA GTGCTGGAGT TGGAAOAACA 6GCACATATA TTOTGCTAGA CA OTATQTTG 
CAOCAGATTC AAC3V0GAAGO AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCOQTTCA 
CAAAGAAATT ATTTGQTACA AACTQAGGAG CAATATGTCT TCATTCATGA TACACTOCTT 
GAGGCCATAC TTAOTAAAGA AACT6AGGTQ CTGGACAGTC ATATTCATGC CTATGTTAAT 
GCACXCCTCA TTOCTOGACC AOCAGOCAAA ACAAAOCTAG AGAAACAATT CCAGCTCCTG 
AGCCAQTCAA ATATACAGCA GAGTGACTAT TCTOCAOCCC TAAAGCAATO CAACAGGGAA 
AAGAATCX3AA CTTCTTCTAT CATCCCTGTG QAAAGATCAA GGGTTGGCAT TTCATCCCTG 
AGTCGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAQAGCAAT 
GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 
TOGGACCATA ATQCOCAACT GOTGOTTATO ATTOCTGATG GCCAAAACAT GGCAGAAGAT 
GAATTTOTTT ACTGGCCftAA TAAAGATGAO CXTTATAAATT OTOAaAOCTT TAAGQT CACT 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACT TATA AT TCAGOACTTT 
ATCTTAQAAG CTACACAGGA TGATTATGTA C TTGAAG TGA GGCACTTTCA OTOTCCTAAA 
TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 
GAAGCTCCCA ATAOOOATOG OCCTATQATT 6TTCATGATG AGCATCQAOG AGTQAOGGCA 
GGAACTTTCT GTGCTCIGAC AACXXTTATO CACCAACTAO AAAAAGAAAA TTCCGTOOAT 
GTTTACCAOO TAOCCAAOAT GATCAATCTG ATQ AOGCCAO GAGTCTTTGC TGACATTGAO 
CAOTATCAOT TTCTCTACAA AOTGATOCTC AOCCTTOTOA OCACAAGOCA GQAAGA6AAT 



2220 
2280 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840' 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
ISOO 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
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CCATCCACCT CTCTGGACAQ TAATGOTOCA GCATTGCCTQ ATGGAAATAT AGCTQAGAOC 4500 

TTAQAQTCTT TAGTTTAACA CAGAAASGGG TGGGGGGACT CS^CATCIGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG QAAAATCAOT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCXOACAOTA ACTTTCATGA CATAQOATTC T6CCGCCAAA TTTATATCAT TAACAATOIO 4680 

TGCCTTTTTG CAAOACTTOT AATTTACTTA TTATCTTTQA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CyVATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTOTTTTTAG TGTCAAATTT 4860 

TTACCTGTAT TTGTAGCAAT TATC»GGTTT CCTAG AAATA TAACTTTTAA TACAGTAGCC 4920 

T6TAAATAAA ACACTCTTGC ATAT6ATATT CAACATTTTA CAACTGGAOT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACIOCXXITAG TGTCTCCATG OACCAAATTT S040 

ATATTTATAA TTOTAGATTT TTATATTTTA CTACTGAGTC AA6TTTTCTA GTTCTGTOTA 5100 

ATTQTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAOTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTT6TGGAAA 5220 

ATAGAAATAC CTTCATTTTO AAA6AASTTT TTATGA6AAT AACACCTTAC CAAACATTGT 5280 

TCAAATC6TT TTTATCCAAO GAATTGCAAA AATAAATATA AATATT6CGA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAA A A A A AAAAAAA 



Seq ID NO: 374 Protein sequence t 

Protein Acceesion ftt built Crom XP_031379 

1 11 21 31 41 SI 

I I I I I I 

MRILKRFIAC IQIiLCVCRLD HANGYYRQQR KLYBBIGWSY TI^UiQICNHG KKYPTQISPK 60 

QSPINIDEDL TQVNVNI*KEa> KFQOWDKTSL ENTPIHHTGK TVBIHLTODY RVSGOVSEMV 120 

FKASKITPHW GKCNMSSDGS EHSLBGQKPP LEMQIYCFDA DRPSSPEEAV KGRGKLRAL8 180 

ILFEVGTESN LDPKAIIDGV ESVSRPGKQA ALDPPILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTV9ISESQL AVFCBVLTMQ QSGYVMLMDY LQ!INPRBQQY KFSRQVPSSY 300 

TGfCEEIHEAV CSSEPENVQA DFENYTSZtLV TWBRPRWYD TNIGKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD UJAIUmLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDHPT DHFELDIiFPB 420 

LIGTEEIIKE BEEGKDIEBG AIVNPORDSA TNQIRKKEPQ I8TTTHYJJRI GTKYNEAKTN 4 BO 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTBI*PPHT VEGTSASLND 540 

6SKTVLRSPH MNLS0TAE8L NTVSITEYEE BSLLTSFKLD TGAEDSSGSS PATSAIPPI8 600 

EKISQGYIPS SB«PBTITYD VLIPSSARNA 8EDST8S6SB BSLKDPSME6 NVNFPSSTDI 660 

TAQPDVGSGR ESPLQTNYTB IRVDESEKTT KSF8AGPVMS QGPSVTDLEM PHYSTPAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAB 6LBSEKKAVI 780 

PLVIVSAIiTP lOiWIiVGIL lYVmKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHPPKHVA DLHASSGFTE EFETIiKEFYQ EVQSCTVDLG ITADS8NRFD NKHKNRYINI 900 

VAYDHSRVKli AQLAEKDGKL TDYINANYVD GYNRPRAYIA AQGPLKSTAB DFNRKltfSHK 960 

VEVrVMITOL VEKGRRKCDQ YWPADQSBBY OIPLVTQKSV QVLAYYTVRM PTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMOVPBYSI* PVLTPVRKAA YAKRHAVGPV WHCSAOVOR 1080 

TQTYIVLDSM LQQIQHBGTV NIFGPLKHIR 8QRNYLVQTB EQYVPIHDTIi VEAILSKETB 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGI8S IiSGEGTDYIN ASYIMGYYQS NEFIITQHPL IjHTIKDFWRM IWDHNAQLW 1260 

MIPDGQimAE DEFVYWFNKD EPIKCESFKV TLMAEEKKO* SKEBKLIIQD FXIjEATQDDY 1320 

VLEVRHFQCP KNPNPDSFI8 KTFSLZSVXK BBAAMRDGPM IVHDEHGGVT AGTFCALTTL 1380 

NRQIiBKEKSV XJVYOVAKMZlf U4RPGVFADI EQYQFLYKVX LSLV8TRQBE NP8T8LDSN6 1440 
AALPDOHAE 8LESI«V 

8eq ZD KO: 375 DWA sequence 

Nucleic Acid Accession fti EOS sequence 

Coding sequence a 148-4494 

1 11 21 31 41 51 

i I I I I 1 

CACACATAOO GA00CAC6AT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTGa CTCOCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

O0GO6AGQG0 COGCAGACCG TCTGQAAATG CGAATCCTAA AOCGTTTCCT OGCTTGCATT 180 

CAGCTCCTCT GT G TTTGCCG CCTGGATTGQ GCTAATGGAT ACTACAOACA ACAGAGAAAA 240 

CTTOTTOAAG AQATTQGCTG QTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGOaAAAO 300 

AAATATOCM CATGTAATAO CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAAT3 TQAATCTTAA GAAACTTAAA TTTCAGGGTT GGQATAAAAC ATCATTGOAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGQAAATTA ATCTCACTAA TGA CTAC OGT 480 

GTCACOGGAO GA3TTTCAQA AATGGTGTTT AAAGCAAGCA AQATAACTTT TCACTOQaOA 540 

AAATOCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAQ AAQQACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAOAC OGATTTTCAA GTTTTGA66A AGCAGTCAAA 660 

OQAAAAGGGA AOTTAAOAOC TTTATCCATT TTGTTTOAGG TTOGQACAGA AGAAAATTTG 720 

CATTTCAAAG OGATTATTGA TOGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATOCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAOTTAOCA TCTCTQAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATO TCATGCTGAT GGACTACTTA CAAAACAATT TTOGAGAQCA ACAOTACAAG 1020 

TTCTCTAQAC AGGTGTTTTC CTCATACACT GGAAAGQAAG AGATTCATGA AGaVGTTTGT 1080 

AOTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAQAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCQAGTGGT TTATGATACC ATOATTGAGA AGTTTGCAGT TTTGTACCAQ 1200 

CAQTTGGATG GAGAOGACCA AA0CAA6CAT GAATTTTTQA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAOTTATQ TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATO GCTTATATOG AAAATACAGC QACCAACTGA TTGTOGACAT GCCTACT6AT 1380 

AATCCT6AAC TTQATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAOQAO 1440 

GAAGAG6GAA AAGACATTGA AOAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA OGAAAAAGGA ACCCCAOATT TCTACCACAA CACACTACAA TOGCATAOGO 1560 

AOOAAATACA ATOAAGCCAA QACrAAOCOA TCOCCAACAA GAGOAAGTGA ATTCTCTGGA 1630 

AAOGOTGATG TTCCCAATAC ATCTTTAAAT TCCRCTTCCC AACCAOTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT QACTTCTOVG ACTGTGACIG AACTQCCACC TCACACTGTG 1740 
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GAAGGTACTT CAGCCTCTTT AAATQATGQC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAQA ATCCTTAAAT ACAGTTTCTA TAACRQAATA TOAGG AGGRG 1860 
AGTTTATTGA CCAGTTTCAA GCTTQATACT GGAGCTQAAG ATTCTTCAOS C TCCRG TOOC 1920 
OCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAOGGTATAT ATTTTCCTCC 1980 
5 GAAAACCCAG AGACAATAAC ATATQATOTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCRGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGQAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA QCACAGCCCG ATGTTGGATC AGGCAGAGAO 2160 
AGCTTTCTCC AQACTAATTA CACTGAGATA CGTGTTGATQ AATCTGAGAA GACAAOCAAG 2220 
TCCTTTTCTO CAGGCCCAGT GATOTCACAQ GGTCCCTCAG TTACAQATCT GGAAATGCCA 2280 
10 CATTATTCTA CCTTTGCCTA CTTOCCAACT GAGOTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTQGT CTCCACGGTC AACOTGGTAT ACTOSCAQAC AACCCA ACOS 2400 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTC30TA TTQOTCTAGC TGAGGGGTTG 2460 
GAATCCGAGA AGAAGQCAGT TATACCCCTT GTGATOOTGT CAGCCCTGAC TTTTATCTGT 2S20 
CTAG TOG TTC TTGTGGOTAT TCTCATCXAC TGGAGOAAAT OCTTCCAGAC TGCACACTTT 2580 
15 TACTTAGAGG ACAGTACATC COCTAQAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAQATO ATGTC6GAGC AATTCCAATA AAGCACTTTC CAAAGCATOT TGCAGATTTA 2700 
CATGCAAGTA GTGQGTTTAC TGAAQAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
GOTATTACAG CAQACAGCTC CAAOCACCCA GACAACAAGC ACAAGAATCG ATACATAAA7 2820 
ATOGTTGCCT ATGATCATA6 CAGGGTTAAG CTAGCACAGC TTOCTQAAAA GGATGGCAAA 2880 
20 CTGACTGATT ATATCAATOC CAATTATCTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
GCTGCCCAAO OCCCACTGAA ATCCACAGCT QAAGATTTCT GGAOAATGAT ATGGGAACAT 3000 
AATGTGQAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATOTGAT 3060 
CA6TACTGGC CTGCOQATGG OAGTGAGGAG TACGGGAACT TTCT6GTCAC TCAGAAQAOT 3120 
GTGCAAOTGC TTGOCTATTA TACrGTGAOO AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
25 AAGGGCTCCC AOAAAGGAAO ACCCAGTG6A CGTGTGGTCA CACA GTAT CA CTACACGCAG 3240 
TGGCCT6ACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCrTTGT GAGAAAGGCA 3300 
GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCX5TCC ACTGCAGTGC TOaAGTTGGA 3360 
AOAACAGGCA CATATATTGT GCTAGACAGT ATGTTQCAGC AGATTCAACA OGAAGQAACT 3420 
GTCAACATAT TTGGCTTCTT AAAACACATC OSTTCACAAA GAAATTATTT GGTACAAACT 3480 
30 GAGGAOCAAT AT6TCTTCAT TCATGATACA CT6GTTGAGG CCAIACTTAG TAAAGAAACT 3540 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 
GACTATTCTO CA6CCCTAAA GCAATGCAAC AGGGAAAA6A ATCQAACTTC TTCTATCATC 3720 
CCTGTGGAAA GATCAAGOGT TGGCATTTCA TCCCTGAGTG OAGAAGGCAC AGACTACATC 3780 
35 AATGCCTCCT ATATCAT6GG CTATTACCAO AGCAATQAAT TCATCATTAC CCAGCACCCT 3840 
CTCCTTCATA CCATCAAGGA TTTCTGGAGO ATGATATGGG ACCATAATGC CCAACTOGTG 3900 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 
GATGA6CCTA TAAATTOTGA GAGCTTTAAG GTCACTCTTA TGGCTQAAGA ACACAAATGT 4020 
, _ CTATCTAATG AGGAAAAACT TATAATTCAO GACTTTATCI TA6AA0CTAC ACAGQATGAT 4080 
40 TATOTACTTQ AAOTOAGGCA CTTTCRGT3T CCTAAATGOC CAAATCCAQA TAGCCCCATT 4140 
AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAO GGATGGQCCT 4200 
ATGATTGTTC ATGATGAGCA TGGAGGAGTQ AOGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 
CTTATGCACC AACTA6AAAA AGAAAATTCC GTGGATQTTT ACCAGGTAGC CAAGATGATC 4320 
AATCTQATQA 0Q0CA6QA6T CTTTGCTGAC ATT6AGCAGT ATraOTTTCT CTACAAAGTO 4380 
45 ATCCTCAGCC TTOTGAGCAC AAGGCAOQAA GAOAATOCAT CCA CCTC TCT GGACAGTAAT 4440 
GGTGCAGCAT TQCCTGATGO AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 
AAGGGGTGGG GGGACTCACA TCTGAQCATT OTTTTCCrCT TCCTAAAATT AGQCAGGAAA 4560 
ATCAGTCTAG TTCT6TTATC TGTTGATTTC CCATCACCTG ACRGTAA CTT T CATG ACATA 4620 
OG A TTCTOCC GCCAAATTTA TATOOTAAC AATGTQTGGC TTTTTGCftAO ACTTGTAATT 4680 
50 TACTTATTAT GTTTGAACTA AAATOATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 
GGTATTTTTT TCTGTATTGA TTTTAACaQA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTOTATTTGT AGCAATTATC 4860 
A6GTTTGCTA QAAATATAAC TTTTAATACA GTAGGCIGTA AATAAAACAC TCTTCCATAT 4920 
^ _ 6ATATTGAAC ATTTTACAAC TGCAGXATTC AOCTAAAOTA GAAATAATCT GTT ACTTAT T 4980 
55 OTAAATACTO CCCTAOTOTC TCCATGGACC AAATTTATAT TTAT AATTG T AOATTTTTAT 5040 
ATTTTACTAC TGA(?rCAAGT TTTCTAGTTC TGTGTAATT6 TTTAQTTTAA TGAOGTAGTT 5100 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT A AGTCA TTAA 5160 
CTTTOTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 
AAOTTTTTAT GAGAATAACA GCTTAOAAA CATTOTTCAA ATGGTTTTTA TCCAAOGAAT 5280 
60 TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 376 Protein sequence t 
Protein Accession #i BOS sequence 

65 

1 11 21 31 41 51 

I i I I I I 

MRILKRFLAC IQLLCVCaiLD WANOYYRQQR KLVBEIGWSY TGALNQKNWG KWrPTQISPK 60 

^ QSPIHIDEDL TQVMVllXiKKL XFQGHDKT6L ENTFXHNTBK TVBIKLraPY RVSGGV&EMV 120 

70 FKASXITFHW GKQ1M8SXXS6 BHSLEGQKFP LEMQIYCFDA DRFSSFEBAV XSK6RLRALS 180 

ILFEVGTBKM LDFKAZIDGV BSVSRFGKQA AlDPPILmi* LPHSTDlonri YNGSLTSPPC 240 

TDTVDWIVPK DTVSISESQL AVPCEVLTMQ QSGYVMIjMDY MJNNFREQQY KFSRQVPSSY 300 

TGKEEIHEAV CSSBPENVQA DPENYTSIiLV TWERPRWYD TMIBKFAVLY QQLDGEDQTK 360 

HBFLTDGYQD LGAI&NHLLP MMSYVXX^XVA XCINGLYGKY S PQLIVO MPT DNPBLDLPPB 420 

75 LIGTBBIIKE BEBQKDIBEG AIVMPGRDSA TNQIRKKEPQ lOTTTHYNRI GTKYHBAKTM 480 

R8PTRG8SF8 GKGDVPNTSL NSTSQFVTKL ATEKDISLTS QrVTBIiPPHT VEGTSASLND 540 

GSKTVI^SPH MNLSGTAESIi NTV8ITEYEB ESISL/TSFKJ^ TGABDSSGSS PAT8AIPFIS 600 

EmSQGYIFS SBNPGTITYD VLIPBSARNA 8EDSTS8G8B BSLXDPSKEG NVWFP88TDI 660 

TAQPOVGSQR BSFLQTMYTB ZRVDESEXTT KSFSAGPVMS QGPSVTOLSK PH Y8TPA YFP 720 

80 TBVTFHAFTP S9RQQDI«VST VNWYSOTTQ PVYMEASNSS HESRIGLAE6 LBSBKKAVIP 780 

LVIVSALTPI CLWLVGILl YWRKCPQTAH FYLBDSTSPR VI8TPPTPIP PISDOVGAIP 840 

IKHPPKHVAD LHASSGPTEB PBEVQSCTVD LOITADSSNH PDKKHKNRYI NIVAYDHSRV 900 

RLAQLABKDG KLTDYINANY VDGYNRPXAY IAAQGPI*KST AEDPWHMIHB HNVEVIVMIT 960 

NLVEKBRRXC DQYWPADOSB BYGMFIiVTQX SVQVLAYYTV RNFTLRIITKI KKGSQKGRPS 1020 
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GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRBAVG PVWHCSAGV GRTQTYIVLD X080 
SMLQQIQHEO TVNIPOPIiKH IRSQRNYLVQ TEBQYVPIHD TLVBAILSKB TEVLDSHIHA 1140 
YVNALLIPGP AGKTKLEICQP QLLSQSNIQQ SDYSAALKQC NREKKRTSSI XPVBRSRVGI 1200 
SSIiSGBGTDY XNASyZHOyy QSNEPIITQH PLLHTIXDFN RMZNDHZlAQIi WMZPDGQNM 1260 
S AEDEFVYWPK fODSPXNCBSP RVTLMAEEHR CLSNBEKLII QDFZIiBATQD DYVLBVRHFQ 1320 
CPKWPNPDSP I8KTPBLZSV ZKEEJU^NROO FHXVHDBRGG VTAGTFCALT TLKHQbEKEN 1380 
SVDVYQVAKM INLMRPGVFA DZEQyQFLYK VXLSLVSTRQ BGNPSTSLDS HGAALPIXSNX 1440 
ABSLBSIiV 

10 

eeq ID N0» 377 DNR sequence 

Nucleic Acid Accession #t EOS sequence 

Coding sequence t 501-4514 

15 1 11 21 31 41 51 

I I I ' t I I 

CACACATACG CACGCACGAT CTCACTTCX3A TCTATACACT GQAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCO CTCCCCCTCC CTCTCCACTC TGAQAAGCAG AGOAGCOQCA 120 
OGGOGAOGOQ CO0CAOAGO3 TCTGGAAATG 06AATGCTAA AGGQTTTCCT OOCTTGCATT 180 
20 CAGCTCCTCT OTGTTTGCOG CCTGGATTGO OCTAATOOAT ACTACAGACA ACAGAOAAAA 340 
CTTGTTGAAG AOATTGGCTO GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA QATCTTACAC 360 
AAGTAAATGT GAATCTTAAO AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTG GAAA 420 
ACACATTCAT TCATAACACT GGGAAAACA6 TG8AAATTAA TCTCA CTAAT QACTACOGTO 460 
25 TCAGOGGAGG AGTTTOVGAA AT6GTQTTTA AAGCAAGCAA GATAACTTTT CACTOG QQAA 540 
AATOCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGOACAAAAA TTTCCACTTG 600 
AGATQCAAAT CTACTGCTTT QATaCGOACC QATTTTCAAG TTTTGAGGAA GCAGT CAAAG 660 
GAAAAGG6AA GTTAAGAGCT TTATCCATTT TGTTTOAGQT TGGGACAOAA GAAAATTTGG 720 
ATTTOUVAOC GATTATTGAT GGAOTOGAAA GtGTTASTOO TTTTGGGAAO CA6GCTGCTT 780 
30 TAGATCCATT CATACTOTTO AACCTTCT6C CAAACTCAAC TGACAAOTAT TACATTTACA 840 
ATGGCTCATT GACATCTCCT CCCTGCACAQ ACACAGTTGA CTQGATTGTT TTTAAAGATA 900 
CAGTTAGCAT CTCTOAAAGC CAQTTGQCTQ TTTTTTGTQA AGTTCTTACA ATGCAACAAT 960 
CTGOTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TGQAOAGCAA CAG TACAAGT 1020 
TCTCTAGACA GGT6TTTTCC TGATACACTG GAAAGGAAGA GATTCATGAA GCAOTTTGTA 1060 
35 GTTCAGAACC AGAAAATQTT CA86CTGACC CAGAGAATTA TACCA6CCTT CTTGTTACAT 1140 
GGGAAAGACC TCGAQTCX3TT TATOATACCA TGATTGAGAA GTTTGCAOTT TTGTA CCAG C 1200 
AGTTGGATGG AGAGQACCAA ACCAAGCATQ AATTTTTGAC AGATGGCTAT CAAQACTTGG 1260 
GTXSCTATTCT CAATAATTTG CTACCCAATA TGAOTTATGr TCTTCAGATA GTAGCCATAT 1320 
. _ GCACTAATGG CTTATATGOA AAATACAOOO AOCAACTGAT TGTOGACATG CCTACTGATA 1380 
40 ATCCTGAACT TGATCTTTTC CCTQAATTAA TTGGAACT6A AGAAATAATC AAGGAGGAGG 1440 
AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 
ACCAAATCAG GAAAAAGQAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 
CQAAATACAA TGAAGCX»AG ACTAACCGAT CCCCUkCPAQ AGGAAGTCAA TTCTCTGGAA 1620 
AGGGTGATOT TCCCAATACA TCTTTAAATT GGACTTCCCA ACCAGTCACT AAATTAGOCA 1660 
45 CAOAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 
ACTTGTCGGO GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT QAGGAGGAGA 1860 
GTTTATTGAC CAGTTTCAAG CTTGATACTG GAQCTGAAGA TTCTTCAGGC TCCAGTCCOG 1920 
CAACTTCTQC TATCCCATTC ATCTCTGAGA AGATATCCCA AGGGTATATA TTITCCTOC30 1980 
50 AAAACCCaGA GACAATAACA TATOATGTCC TTATACCAGA ATCTGCTAGA AATOCTTCOG 2040 
AAGATTCAAC TTCATCAGGT TCAQAAGAAT CACTAAAGGA TCCTTCTATO QAGGGAAATO 2100 
TGTGOTTTCX: TAGCTCTACA GACATAACAQ CACAGCCOGA TGTTGGATCA GGC3U3AGAGA 2160 
GCTTTCTCCA GACTAATTAC ACTGAOATAC GTGTTOATGA ATCTGAGAAQ ACAACCAAGT 2220 
CCTTTTCTGC AGGCCCAGTQ ATGTCACAGG GTCCCTCAGT TACA6 ATCTQ GAAATGCCAC 2280 
55 ATTATTCTAC CTTTGCXrrAC TTCCCAACTG AOOTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAOACAACA GGATTTGGTC TCCAOGGTCA ACGTGGTATA CT0GCA3ACA ACCCRACCQG 2400 
TATACAATGA GGCCAGTAAT AGTAOCCATG AOTCTCXTTAT TGGTCTAGCT GAGGGGTTGQ 2460 
AATCCX5AGAA 6AAGQCAGTT ATACCCCTTG TGATCGTGTC AGCCCTOACT TTTAT CTOTC 2520 
TAOTGOTTCr TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAOACT OC ACACY m' 2580 
60 ACTTAGAGGA CRGTACATOC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTOGGAGCA ATTCXAATAA AGCACTTTCC AAAO CATO TT GCaOATTTAC 2700 
ATGCAAQTAQ TGGGTTTACT QAAGAATTTG A6ACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AQAGCTOTAC TOTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAA6C 2820 
ACAAQAATOQ ATAC3VTAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGOlCAaC 2660 
65 TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATOTT GATGGCTACA 2940 
ACA6ACCAAA AGCTTATATT GCT6CGCAA0 GCCCACPOAA ATCCACAGCT GAAGATTTCT 3000 
G6AGAATQAT ATGGQAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTOGTGGAGA 3060 
AA0GAAGGA6 AAAATGTGAT CAGTACTGGC CTGCOQATGG GAGTOAQGAG T AOGGQA ACT 3120 
TTCTGGTCAC TCAQAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAG6 AATTTTACTC 3160 
70 TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA OGTGTGGTCA 3240 
CACAGTATCA CTACAOGCAG TGOCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT OCTAGACAGT ATGTT GCAGC 3420 
__ AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC OGTTCACAAA 3460 
75 GAAATTATTT GGTACAAACT GAGGAGCAAT ATOTCTTCAT TCATGATACA CT60TTGAGG 3540 
OCATACTTAG TAAAGAAACT GAOOTGCTOQ ACA6TCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCrCATTCC TGGACCAOCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCXTCTAAA GCAAT GCAAC AG GGAA AAGA 3720 
ATOGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TG6CATTTCA TC0CTGA8TG 3780 
80 GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGQG CTATTACCAG AGCAATGAAT 3840 
TCATGATTAC OCAGCACCXTT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATOGO 3900 
ACCATAATGC OCAACTGGTG GTTATGATTC CTGATG6CCA AAA CATGG CA GAAGATGAAT 3960 
TTGTTTACTO GOCAAATAAA GATOAGCCTA TAAATTOTGA GAGCTTTAAG GT CACT CTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAA7G AGGAAAAACT TATAATTCAO GACTTTATCT 4080 
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TAGAAGCTAC ACAGQATGAT TATQTACrrO AAGTGAG6CA CTTTCAOTCT CCTAAATGGC 4140 

CAAATCCAGA TAGOCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAQAAG 4200 

CTGCCAATAG GGATG GG CCT ATOATTGTTC ATGATGAGCA TGQAGGAQTO ACGGCAGGAA 4260 

CTTTCTGTGC TCTOACAACC CTTATGCACC AACTAOAAAA AOAAA ATTCC GTGOATOITT 4320 

5 ACCAGGTAGC CAAGATGATC AATCTOATGA GGCCAGGAGT CTTTGCTOAC ATTGAGCAGT 4380 

ATCAOTTTCT CTAGAAAOTO ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

OCaVOCTCTCT GQACAGTAAT GGTGCAQCAT TGCCTGATGG AAATATAGCT QAGAGCTTAG 4S0O 

AGTCriTAOT TTAACACAQA AAGGGQTCGO GGGACTCACA TCTQAGCATT GTTTTCCTCT 4S60 

TCCTAAAATT AGGCAGGAAA ATGAGTCTAQ TTCTGTTATC TGTTGATTTC CCATCACCTO 4620 

1 0 ACAGTAACTT TCATSACATA GGATTCTGCC GCCAAATTTA TATCATTAAC A ATOTG TGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACrA AAATGATTOA ATTTTACAGT 4740 

ATTTCTAAGA ATGOAATTGT GQTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAQAGGT TAGGAATTCC AAACTACAGA AAATOTTTGT TTTTAGTGTC AAATTTTTAO 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA QAAATATAAC TTTTAATACA GTAGOCTGTA 4920 

15 AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAOTA 4980 

GAAATAATCr GTTACITATT OTAAATACTQ CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTPVAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGAOOTAGTT CATTAGCTQO TCTTACTCTA CCAO TTTTCT QACATTGTAT 5160 

TGTOTTACCT AAGTCATTAA CTTTGTTTCA GCATQTAATT TTAACTTTTG TGGAAAATAG 5220 

20 AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACOUlA CATTOTTCAA 5280 

ATGOTTTTTA TCOIAGGAAT T6CAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID N0» 378 Protein sequence i 
25 Protein Accession #i BOS sequence 

1 11 21 31 41 51 

i I 1 I i I 

MVPKASKITP HWGKCNMSSD GSEH3LEGQK FPLEMQXYCP DADRFSSPEB AVKOKGKLRA 

30 LSILFEVGTB ENLDPKAIID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 
PCTDTVDMIV FKErrVSISBS QliAVPCEVLT MQQSGyVMLM DYLQNNFRBQ QYKPSRQVFS 
SYTGKEEIHE AVCSSEPSIV QADPBNYTBL LVTHERPRW YDTMIEKPAV LYQQLDGEDQ 
TKHEPLTDGY QDUSAILNML LPNMSYVLQI VAICTNGLYG KYS DQLIVD M PTDNPBLDLP 
PELIGTBEII KEEEEGKDIE EGAIVNPGRD SAINQIRKKE PQISTTTHYN RIGTKYNEAK 

35 TNRSPTRGSE F6GKGDVPNT SLMSTSQPVT KLATEKDISL TSQTVTELPP HTVBGTSASI. 

NDGSKTVIiRS PHMNIiSGTAE 8IMTV8ZTEY BBBSUiTSFK LDTGAEDSSG 8SPAT8AIPF 
ISailSQGYl FSSEMPBTIT TOVLIPBSAR NA8BDST8S0 SEE8UCDPSM BGNVWFPSST 
DITAQPDVGS GRESPLOTNY TBIRVDBSEK TTKSFSAGPV MSQGPSVTDL EMPHYSTPAY 
FPTBVTPHAF TPSSRQQDLV 8TVNWYSQT TQPVYNBASN SSHESRIQLA EQLBSEKKAV 

40 IPLVIVSAIiT PICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 
IPIKHFPKHV ADLHASSGFT BBPBTLKBPY QBVQSCTVDL GITADSSMHP DNXHXNRyiN 
IVAVDHSRVK lAQLAEKDGK LTDYIKAMYV DGOmPKAYX AAQOPUtSTA EDPWRMIWER 
NVBVIVMITN LVEKGHRKCB QYWPADGSEB YGNPLVTQKS VQVLAYYTVR NFTLRNTKIK 
KGSQKGRPSQ RWTOYHYTO MPDMOVPEYS LPVLTPVRKA AYAKRHAVOP VWHCSAGVQ 

45 RTOTYIVIiDS MLQQIQHEGT VNIPGFLKHI RSQRNYLVQT BBQYVPIHDT LVEAILSKET 
BVU3SHIHAY VNALLXPOPA GKTKIiEKQPQ LLSQSNIQQS DYSAALKOOI .REKMRTSSII 
PVBRSRVOIS SL86BQTDYI HASYIMOYYQ SNEFIITQHP LLHTIKDPHR MIWDHHAQLV 
VMIPDGQNMA EDEPVYWPNK DBPINCESFK VTLMAEEHKC LSNEEKIiI IQ DPILEATQDD 
YVLBVRHF^C PKHPNPDSPI SKTFELISVI KBEAANRDGP MIVHDBHGGV TAGTFCALTT 

50 U4HQLEKENS VDVYQVAKMI NLMRPGVFAD lEQYQPLYKV ILSLVSTRQB ENPSTStDSN 
QAALPDQIZA BBLESIiV 

Seq ID NO: 379 DMA sequence 
Nucleic Acid Accession #i EOS sequence 
55 Coding sequences 148-4632 

1 11 21 31 41 51 

I I I ) t t 

CACACATAOQ CACGCACGAT CTCACTTOQA TCTATACACT GGAGGATTAA AACAAACAAA 

60 CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TG AGAAO CAG AGGAGCCGCA 120 

CGGOSAGGGG CCGCAGACCO TCTOQAAATG CGAATCCTAA AAOGTTTCCT OGCTTGCATT 180 

CAOCTCCTCT GTOTTTGCCQ CCTQGATTGO 6CTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTOTTCAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTOGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

65 CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTOGAA 420 

AACACATTCA TTCATAACAC T06GAAAACA GIGGAAATTA ATCTCR CTAA T6ACTAG0GT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTOTTT AAAGCAAGCA A6ATAACTTT TCACTGGGGA 540 

AAATOCAATA TQTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATQCAAA TCTACTGCTT TGATGCWJAC CGATTTTCAA GTTTTGAGGA AGCAQTCAAA 660 

70 GQAAAAGGQA AGTTAAGAQC TTTATCXaTT TTGTTTGAQO TTOGG ACAQA AGAAAATTTG 720 

QATTTCAAAG OGATTATTGA TGQAGTCXIAA ACTOTTAOTC GTTTTGGOAA GCAfiGCXOCT 780 

TTAGATCCAT •rCATACTGTT GAACCTTCTO CCAAACTCAA CTOACAAaTA TTOCATTTAC 840 

AATOGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATPGT TTTTAAAGAT 900 

ACAOTTAGCA TCTCTQAAAG CCAGTTGQCT GTTTTTTGTQ AAOTTCTTAC AATGCAACAA 960 

75 TCTGOTTATQ TCATGCTQAT GQACTACTTA CAAAACAATT TT0GA6AGCA ACA GTACAAG 1020 

TTCTCTAGAC AGGTOTTTTC CTCATACftCT GGAAAGGAAO AQATTCATOA AGCAfc^ ^LYXflT 1080 

AGTTCRGAAC CAGAAAATQT TCAGOCTQAC CCAGAQAATT A TACCAO CCT TCTTOTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATQATACC ATGATTQAGA AOTTTQCAGT TTTGTAOCAG 1200 

CWJrrGQATG GAQAGGACCA AACCAAOCAT GAATTTTTGA CAQATGCCTA TCAAGACTTG 1260 

80 GCTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATO TTCTTC AGAT AOTAGCCATA 1320 

TOCACTAATG GCTTATATGO AAAATACAOC GACGAACIGA TTOTOQACAT GOCTACIGAT 1380 

AATCCIGAAC TTQATCTTTT CCCTGAATTA ATTGGAACTO AAOAAATAAT CAAGGAGGAG 1440 

QAAGAGGGAA AAGACaTTGA AOAAGGOQCT ATTOXOAATC CTGOTRGAOA CACTGCTACA 1500 

AACCAAATCA GOAAAAAGGA ACCOCAQATT TCTACCAGAA CACACTACAA TGQCATAGGG 1560 
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AC3GAAATACA ATGAAGCCAA OACTAACCGA TCXXX»ACAA GAGGAAGTOA ATTCTCTOGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AAOCAGTCAC TAAATTAGOC 
ACA6AAAAAG ATATTTCCTT OACTTCTCAG ACTOTQACTa AACXGCCACC TCACACTGTQ 
QAACGTACTT CAGCCTCTTT AAATQATGOC TCTAAAACTQ TTCTTAGATC TCCACATATG 
5 AACTTCTCGO GQACT6CASA ATOCTTAAAT ACAGTTTCTA TAACAQAATA TGAGGAOQAG 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GOAGCTGAAQ ATTCTTCAQQ CTCCAG TCCC 
GCAACTTCTG CTATCCCATT CATCTCTOAO AACATATCCC AAGQOTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATOATGTC CTTATAOCRQ AA TCTGCTA G AAATGCTTCC 
QAAGATTCAA CTTCATCAOO TTCAGAAGAA TCACTAAAGO ATCCTTCTAT GGAGQ6AAAT 
10 GTGTGGTrrC CTAGCTCTAC AGACATAACA GCACnQGCCO ATGTTGGATC ACSGCAGAGAO 
AOCTTTCTCC AGACTAATTA CACTGAGATA O QTGTTGATQ AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAG ATCT GQAAATGCCA 
CATTATTCTA CCTTTCCCTA CTTCCCAACT GAOGTAACAC CTCATGCTTT TACCCCATCC 
TCCAGACAAC AGGATTTGGT CTOCAOGGTC AAOGTGGTAT ACTO^AGAC AAOOCAAOOQ 
15 GTATACAATQ AGGCCS^GTAA TAGTAOCCAT GAOTCTGGTA TTQGTCTRGC TGAw w™ 
GAATCG6AQA AGAAGQCAGT TATACCCCTT GTGATCGTGT CAGCCCTOAC TTTTATCTGT 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGQAGGAAAT GCTTCCAGAC TGCAOVCTTT 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATC CACAC CTCCAACACC TATCTTTCCA 
ATTTCAGATQ ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAA GCATG T TQCRGATTTA 
20 CATGCAAGTA OTGGGTTTAC TQAAGAATTT GAGACACTGA AAGAGTrrTA CCAGGAAQTO 
CAGA0CT6TA CTGTTQACTT AGGTATTACA OCAGACAOCT CCAACCACXX: AGACAACAAG 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 
CTTGCTGAAA AGGATGGCAA ACTQACTGAT TATATCAATG CCAATTATGT TGATOGCTAC 
AACAGACXAA AAGCTTATAT TGCTGCCCAA GGC0C31CTGA AATOraCAGC TQAAGATTTC 
25 TGGAGAATOA TATG6GAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTOGAG 
AAA06AAGGA GAAAAT6TGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GT AOGGQA AC 
TTTCTOGTCA CTCRSAAQAG TGTGCAAGTG CTTGCCTATT ATACTOTGAG GAATTTTACT 
CTAAOAAACA CAAAAATAAA AAAGGQCTCC CAGAAAGGAA GACCCAGTGG ACGTG TGGTC 
ACACAGTATC ACTACAOGCA GTGGCCTGAC ATGGGROTAC CAOAOTACTC CXnGCCMSTG 
30 CTGACCTTTG T6AGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 
CACTGOUSTG CTGGAaXTQO AAGAACAGGC ACSITATATTG TGCTAGACAG TATGTTGCAG 
CAOATTCAAC ACGAAGGAAC TGTCAACATA TTTGQCXTCT TAAAACACAT COGTTCACAA 
AGAAATTATT TGGTACAAAC TGAGQAQCAA TATGTCTTCA TTCATGATAC ACTOOTTGAG 
GCCATACTTA OTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATOCA 
35 CTCCTCATTC CTGGACCAGC AGGCAAAACA AAQCTAQAGA AACAATTCCA GGGTCTCACT 
CTGTCACOCA GGCTGQAOTG CAQAGGCACA ATCTGGGCTC ACTGCAACCT TCCTCTCCCT 
GGCTTAACTO ATCCTCCTAC CTCAGCCTCC CQAQIGGCTG GGACTATACT CCTGAGCCAG 
TCAAATATAC AGCAQAGTGA CTATTCTGCA GCCCTAAAGC AATOCAACRG G G iB y AOAAT 
CQAACTTCTT CTATCATCCC TGTGGAAAQA TCAAGGGTTG GCAXTTCATC CCTGAGTGQA 
40 GAAGGCACAQ ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATOAATTC 
ATCATTACCC AOCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGQAC 
CATAATCCCC AACTOGTGCT TATOATTCCT GATGGCCAAA ACATGGCAGA AGATOAATTT 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CAv.-i\.TlATG 
GCTGAAQAAC ACAAATGTCT ATCTAATQAQ GAAAAACTTA TAAT TCAGG A CTTTATCTTA 
45 GAAGCTACAC AGGATOATTA TQTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATG6CCA 
AATCCROATA GCCCCATTAG TAAAACTTTT QAACTTATAA GTGTTATAAA AGAAGAAGCT 
GCaUVTAGGG ATGGGCXrTAT OATTGTTCAT GATGA6CATG GAGGAGTGAC GGCAGGAACT 
TTCTGIOCTC TGACAACXXn" TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATQTTTAC 
CAGGTAGCCA AGATGATC3WV TCTGATGAGG CCAGGAGTCT TTOCTQACAT TGAGCAGTAT 
50 CAGTTTCTCT ACAAAQTOAT CX:TCAGCCTT GTGGGCACAA GGCAGGARGA GAATCCATCC 
ACCTCTCTOO ACAGTAATGS TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAO 
TCTTTAOTTT AACACAGAAA GGGGTGGGOQ GACTCACATC TGAGCATTGT TrTCCTCTTC 
CTAAAATTAO GCAGGAAAAT CASTCTAGTT CTOTTATCTG TTGATTTCCC ATCACCTOAC 
ACTAACTTTC ATGACATAGQ ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 
55 TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 
TTCTAAGAAT GGaATTGTGQ TATTTTTTTC IGTATTGATT TTAACAQAAA ATTTCAATTT 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATOTTTOTTT TTA6TGTCAA ATTTTTAGCT 
GTATTTGTAG CAATTATCAO GTTTGCTAGA AATATAACTT TTAATACAGT AGCXTOTAAA 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGXATTCAC CTAAAGTAGA 
60 AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 
ATAATTOTAQ ATTTTTATAT TTTACTACTG AaTCAAGTTT T CTAGTT CTG TGTAATTGTT 
TAGTTTAATG AOQTAOTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTQ 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATSTAATTTT AACTTTTGT6 GAAAATAGAA 

ATACXrrrCAT tttgaaagaa gtttttatqa gaataacacc ttaccaaaca ttottcaaat 

65 GGTTTTTAIC CAAGGAATTG CAAAAATAAA TATAAATATT OCCATTAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA A 
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Seq ID VOt 380 Protein aeouence i 
Protein Acceesion #t EOS eequence 

1 11 ai 31 

I I 1 I 

MRILKRFLAC IQLLCVOUiD WANGYYRQQR KIjVEBIGWSY 
QSPINIDEDL TQVNVNLKKL KPQGWDKTSL EMTPIHNTGK 
FKASKITFHW GKCNMSSDGS EHSLBGQKFP LEMQIYCFDA 
XLPBVGTBEN LDFKAXXDQV E9VSRFQXQA AUSPFlUiNL 
TDTVDWIVFK DTVSlSBSQL AVFCBVLTMQ QSGYVMLMDY 
TCKEBIHEAV CSSBPQIVQA DPENYTSLLV TWBRPRWYD 
HEPLTDGYQD LOAILNNLLP NMSYVLQIVA ICTlIGLYOKy 
LI6TEBIIKB BEEGKDZ£B9 AIVNPGBDSA TNQZRKKEPQ 
XtSPTBOSEFS GKODVPOTSL N8TSQPVTXL ATSXDISLTS 
GSKTVLR8PH MNLSGTAESL NTVSITEYEB BSLLTSFKU} 
BNISQOyXFS SBNPETXTyP VLXPB8ARMA SEDST8SGSB 
TAQPDV0S6R ESFLOnnTTB XRVDBSBKTT KSF8AGFVMS 



41 

I 

TGALKQKNWG 
TVEINIiTNDY 
DRFSSFBBAV 
LPNSTDKYYI 
LQNNPREQQY 
TMIBKPAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TQAEDSSGSS 
BSLKDPSMEG 
QGPSVTDLBM 



51 
I 

KKYPTCNSPK 
RVSGOV8EKV 
KGKOKLRALS 
YNQSLTSPPC 
KPeRQVPSSY 
QQUX3EDQTK 
DNPBIiDLFFB 
GnCyNBARTN 
VGGTSA8UID 
PATSAIPPIS 
NVWFPSSTDI 
PHYSTPAYFP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



1102 



wo 03/042661 



PCT/US02/36810 



TEVTPHAPTP SSRQQDLVST VUWYSOTTQ PVYNEASNSS HBSRiaiABQ LBSBKKAVIP 780 

LVIVSALTPl CLWLVGILI YWRKCPQTAH PYLKDSTSPR VISTPPTPIP PISDDVGAIP 840 

IKHPPKHVAD LHASSGPTBB FETLKBFYQB VQSCTVDLOI TADSSSHPDN KHJCNRYINIV 900 

AYDHSRVKIA QLABKDGKLT DYmftlWVDO TORPKAYIAA QGPLKSTABD FWRMIWEHNV 960 

5 EVIVMITNLV BKORRKCDQY WPADGSBEYQ HFLVTQKSVQ VIAWTVRNF TIjRNTKIKKO 1020 

SQKGRPSGRV VTQYHYTQWP DKQVPBYSLP VLTFVRKAAY AKRHAVGPW VHC8AGVGRT 1080 

OTYIVLDSML QQIQHEGTVN JFGFLKHIRS QRNYLVQTEE QYVFIHDTLV BAILSKBTBV 1140 

liDSHIKAYVN ALLIPGPAGK TKLEKQPQGL TMPRLECSG TISAHCHLPL PGLTDPPTSA 1200 

SRVAGTIULS QSNIQQSDYS AALK0C3JREK NRTSSIIPVB RSRVCISSLS GBQTDYXNA8 12fi0 

10 YIMGYYQSNE FIITQHPLLH TIKDFHRMIW DHNAQLWMI PDGQNMAEDB FVYWPNKDSP 1320 

INCESFKVTL MAEBHKCLSM EEKLIIQDPI LEATQDDYVL BVRHFQCPKH PNPDSPI6KT 1380 

PELISVIKBE AANRDGPMIV HDEHGGVTAG TPCALTTLMH QLBKKNSVDV YQVAKMDILM 1440 
RP6VFADIBQ YQFLYKVZLS LVGTRQEQfP STSLDSNGAA LPDGNIAESti ESLV 

15 

8eq ID NOi 381 DNA eecnience 

Nucleic Acid Accession KH_0028S1.1 

Coding sequence i 148.. 7092 

20 1 11 21 Jl «1 Si 

111)11 

CACACATAG6 CAOGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGASAAOCAG AGGAGC CGCA 120 
CQGCX3AGGGG CXX5CAGACCG TCTGGAAATO C3GAATCCTAA AGOGTTTOCT C3GCTTGCATT 180 
25 CAGCTCCTCT QTGITTaCCG CCTGGATTGQ GCTAATGGAT ACTACaGACA AOVGAGAAAA 240 
CTTGTTGAAO AGATT6GCTG GTCCTATACA GGAGCACTQA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG OCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATQ TGAATCTTAA GAAACTTAAA TTTCAQOaTT GGGATAAAAC ATCATT GGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCA CTAA TQACTACOGT 480 
30 QTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATOCAATA TGTCATCTOA TGQATCAGAQ CATAGTTTAG AAGQACAAAA ATTTCCACTT 600 
GA6ATOCAAA TCTACTGCTT TGATGC3GGAC CGATTTTCAA GTTTTGAGGA AGC3«3TCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGG ACAGA AOAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTOGAA AGTGTTAGTC OTTTTGGGAA GCAGGCTGCT 780 
35 TTAGATCCAT TCATACTGTT GAACCTTCTG CX»AACTCAA CTGACAAGTA TTAC ATTTAC 840 
AAT6GCTCAT TOACATCTCC TCCCTGCACA GAOVCAGTTG ACTOGATTGT TTTTAAAQAT 900 
ACAOTTAGCA TCTCTGAAAG CCAQTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTCQTTATG TCATGCTGAT GGACTACTTA CAAAACRATT TTCGAOAGCA ACAG TAC3VAG 1020 
TTCTCTAGAC AGQTGTTTTC CTCATACACT GGAAAQGAAQ AOATTCATGA AGCAGTTTGT 1080 
40 AOTTCAGAAC CAGAAAATGT TCAGOCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TOGGAAAGAC CTC6AGT0QT TTATOATAOC ATQATTQAOA AOTTTQCAGT TTTGTACCAG 1200 
CAGTTGOATG GA0AG(5ACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTAT3 TTCTTCAGAT AOTAGCCATA 1320 
TGCACTAATQ GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT QCCTACTGAT 1380 
45 AATCCTGAAC TTQATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGQAGGAG 1440 
GAAOAGGGAA AAGACATTGA AQAAGGC3CT ATTOTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA G6AAAAAGGA ACXXXaOATT TCTACCRCAA CACACTACAA TCGCATAGGO 1560 
ACQAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA OAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACX3W3TCAC TAAATTAGCC 1680 
50 AC»fiAAAAA6 ATATTTCCTT GACTTCTCAG ACTGTQACTO AACTGCCACX: TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATOATGOC TCTAAAACTO TTCTTAGATC TCCACATATG 1800 
AACTTQTCGG G0ACTGCA6A ATCCTTAAAT ACAGTTTCTA TAACAOAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTQATACT GQAQCTGAAG ATTCTTCAGG CTCCAGTOCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTQAG AACATATCCC AAOOGTATAT ATTTTCCTCC 1980 
55 GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG .AAATGCTTCC 2040 
QAAGATTCAA CTTCATCAG6 TTCAGAAGAA TCACTAAAGO AT CCTT CTAT GGAGGQAAAT 2100 
(jTUTQgrrrC CTAGCTCTAC AGACATAACA GCACAGCCOQ ATOTTGGATC AGGCAGAGAG 2160 
AGCTTTCrOC AGACTAATTA CACTOAGATA CGTGTTGATG AATCTOAOAA OACAACCAAG 2220 
TCCTTTTCTO CAG6CCCAGT GATGTCACAG GQTCCCTCAG TTACAG ATCT GGAAATGCCA 2280 
60 CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATOCTTT TACCCCATCC 2340 
TCCAOACAAC AGQATTTGGT CTOCAGOGTC AAOQTGGTAT ACTOGCAGAC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAOTC 2460 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTOCTGCTTC AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCX»TQ TGTCATTTGA ATCCATCCTG 2580 
65 TCTTCCTATO ATGGTOCACX: TTTGCTTCCA TTTTCCTCTG CTTOCTTCAG TAQTGAATTG 2640 
TTTOGCCATC TGCATACAGT TTCTCAAATC CITCCACAAG TTAC TTCAG C TACOSAGAGT 2700 
QATAAGOTGC OCTTOCATGC TTCTCTGCCA GTGGCrGGQG OTQATTTOCT ATTAGAGCCC 2760 
AGCCTTGCTC AGTATTCTGA TGTOCTGTCC ACTACTCATG CTGCTTCAGA GACSGCTGGAA 2820 
TTTGGTAGTO AATCTGGTGT TCTTTATAAA AOGCTTATGT TTTCTCAAOT TGAACCACCC 2880 
70 AGCAGTGATO CXaTGATGCA TGCACXJTTCT TCAGGGOCTG AACCTTCTTA T GCCTTG TCT 2940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTQGAAT ACCTGTGCAT 3000 
6ATTCTGTGG OTOTAACTTA TCAGGOTTCC TTATTTAGOG GOOCTAGCCA TATACCAATA 3060 
CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCXXTCTCT 3120 
GOTGATGGGQ AATGGTCTGO AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
75 GGGCTGACAG CCCTTAACAT TTCTTCACCT OTTTCTOTAG CTGAATTTAC ATATACAACA 3240 

'i t ri wurnti gtoatoataa taaggogctt tctaaaagto aaataatata tggaaatgag 3300 

ACTGAACTGC AAATTCCTTC TTTCAATCAO ATGOTTTACC CTTCTGAAAG CACA GTCAT G 3360 

CCXyUVCATGT ATGATAATOT AAATAAGTTG AATGOGTCTT TACAAGAAAC CTCTOTTTCC 3420 

ATTTCTAGCA OCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGimnT 3480 

80 6ATCATGAGA TTAGTCAAG7 TCGRGAAAAT AACTTTTCAG TTCAACCTAC ACATACT6TC 3540 

TCTCAAGCAT CT GG T G ACAC TTOGCTTAAA CCTGTQCTTA OTGCAAACTC A GAGC CAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATQTTA TCTCCTTCAA CTOG CTCTT ATTTT ATGAG 3660 

ACCTCAOCTT CTTTTA6TAC TQAAGTATTO CTACAAOCTT CCTTTCAOGC TTCTGA TGTT 3720 

QACAOCTTGC TTAAAACTGT TCTTGCAGCT GTGOCCAGTQ ATOCAATATT GGTTGAAACC 3780 



1103 



wo 03/042661 



PCTAJS02/36810 



CCCAAAGTTO ATAAAATTAG TTCTACAATO TTOCATCTCA TTOTATCAAA TTCTQCTTCA 3840 
AOTQAAAACA TGCTGCACTC TACATCTOTA CCAGTTTTTG ATOTGTCGCC TACTTCTCAT 3900 
ATCCACTCTO CTTCACTTCA AGGTTTGACX: ATTTCCTATG CAAGTGAGAA ATATQAACCA 3960 
OTTTTOTTAA AAAOTOAAAO TTCOCaCCAA OTGOTACCTT CTTrOTACWS TAATGATOAG 4020 
5 TTOTTCCAAA CGGCCRATTT OOAOATTAAC CftOGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTOCTACAC CTCTTTTATC AATTQATQAA CCATTAAATA CACTAATAAA T AAGC TTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTOGTAAOGT ATTTGCTGC3T 4200 
ATTCCAAGAQ TrGCTTCTOA TACATTTGTA TCTACTGATC ATTCTGTTOC TATAGQAAAT 4260 
GGOCATOTTO CCATTACAGC TGTTTCTCCC CACAQAOATG OTTCTGTAAC CTCRACAAAG 4320 
10 TTGCTOTTTC CTTCTAAGQC AACTTCTGAG CTGAOTCATA GTOCCAAATC TQATGCOBGT 4380 
TTAQTGCSGTO GTGGTGAAGA TCGTGACACT GATGATQATG GTGATGATGA TQATGACAQA 4440 
GATAGTCATG GCTTATCCAT TCATAAQTGT ATGTCATGCT CATCCTATAO AGAATCACAG 4500 
GAAAAGGTAA TGAATGATTC AGAOVCCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 
ATCTCATACT CACTATCTSA CAATTCTOAA GAAGATAATA GAOTCACAAO TGTATCCTCA 4620 
15 GACAGTCAAA CT00TAT6GA CAfiAAGTCCT GGTAAATCAC CATCAGCAAA TCGGCTATCC 46B0 
CAAAAGCACA ATGATGGAAA AGAGGAAAAT OACATTCAOA CTGGTAGTGC TCTGCTTCCT 4740 
CTCAGCCCTG AATCTAAAQC ATGGGCAGTT CTGACAAGTO ATGAAGAAAG TG6AT CAGGG 4800 
CAAQGTACCT CAGATAGOCT TAATGAOAAT GAGACTTCCA CAGATTTCAG TTTTGCAQAC 4860 
ACTAATGAAA AAOATOCTOA TOOQATCCXG GCAfiCAGOTG ACTCA OAAAT AAC TCCTGGA 4920 
20 TTCCCACAOT CCCCAACATC ATCTQTTACT AQOOAGAACT CM3AAGT0TT OCACOTTTCA 4980 
QAGGCAGAGG CCAGTAATAO TAGCCATQAG TCTaSTATTO GTCTAGCTGIA GGGCTTGOAA 5040 
TCOGAGAAOA AGGCAGTTAT ACCCCTTGTG ATCGTOTCAG CCCTGACTTT TAT CTGTCT A 5100 
GTGGTTCTTO TGGGTATTCT CATCTACTGG AGGAAAT6CT TCCAGACTGC ACACTTTTAC 5160 
TTAGAGGACA OTACATCCCC TAGAGTTATA TCiCACACCTC CAACiA CCTAT CTTTCCAATT 5220 
25 TCASAX6ATG TCGGAGCAAT TCCAATAAAO CACTTTCCAA AG^fCTTGC AGATTTACAT 5280 
6CAAGTAGTQ OaTTTACTGA AOAATTTGAO ACACTGAAAO AGTTTTACCA GGAAGTGCAG 5340 
AQCTOTACTG TTOACTTAGG TATTACAQCA GACAGCTCCA ACCACCCAQA CAACAAGCAC 5400 
AAGAATCGAT ACATAAATAT OGTTGCCTAT GATCATAGCA GGOTTAAGCT AGCACAGCTT 5460 
QCTQAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTQA TGGCTACAAC 5S20 
30 AGACCAAAAQ CTTATATTGC TQCCCAAGGC CX3VCTGAAAT CCACAOCTGA AOATTTCTGG SS80 
AGAATOATAT GGGAACATAA TGTCGAAGTP ATTGTCATGA TAACAAACCT OGTGQAGAAA 5640 
GGAAGGAGAA AATGTQATCA QTACTGGCCT GCC6ATGGGA GTGAGQAGTA CGGGAACTTT 5700 
CTGGTCACTC AGAAGAGTCT GCAAGXGCTT GCCTATTATA CIGTGAGOAA TTTTACTCTA 5760 
AGAAACACAA AAATAAAAAA OOOCTCCCAG AAAGGAAGAC CCROTOaAOS TOTOOTCACA 5820 
35 CAGTATC3«rr ACACGCAGTO GCCTGACSVTO GGA0TACCA3 AGTACTCCCT GCCAGTGCTO 5880 
ACCTTT6TQA 6AAAGGCAGC CTATGCCAAO C0CX3VTGCAG TGGGGCCTGT TOTOGTCCAC 5940 
TGCAGTGCTG 6A0TTGGAAG AACAGGCACA TATATTGTGC TAQACAGTAT GTT6CAGCAG 6000 
ATTCAACAOG AAGQAACTGT CAACATATTT G6CTTCTTAA AACACATCCX3 TTCACAAA6A 6060 
AATTATTTOQ TACAAACTOA GOAGCAATAT OTCTTCATTC ATOATACACT OGTTGA06CC 6120 
40 ATACTTAQTA AAGAAACTGA GGTGCTGGAC A6TCATATTC ATOCCTATGT TAATOCACTC 6180 
CTCATTCCTG GACCAGCAOO CAAAACAAAG CTAQAGAAAC AATTCXaCCT CCTGAGCCAG 6240 
TCAAATATAC AGCAGAGTGA CTATTCTGCA QCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 
CGAACTTCTT CTATCATCCC TQTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 
GAAGGCACAG ACTACATCaA TGCCTCCTAT ATCATGOGCT ATTACCAGAG CAATGAATTC 6420 
45 ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCT6GAGGAT GATATGGGAC 6460 
CATAATGCCC AACTGOTGGT TATGATTCCT GATGGCX3VAA ACATGGCAGA AGATGAATTT 6540 
GTTTACTOGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTC TTATQ 6600 
6CT6AAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAAT TCAGQA CTTTATCTTA 6660 
GAAGCTACAC AGQATGATTA TGTACTTOAA GT6AGGCACT TTCAGXeTOC TAAATGGCOi 6720 
50 AATCCAGATA GCCCCATTAQ TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 
GCGAATAOGO ATGGGCCTAT GATTGTTCAT GATGAGCATG QAGGAGTGAC GGCAGGAACT 6840 
TTCIOTGCTC TX5ACAACCCT TATGCACCAA CTAQAAAAAQ AAAATTCCGT GQATGTTTAC 6900 
CAGGTAGCXaV AOATGATCAA TCTQATGA6G CCAGGAOTCT TTGCTGACAT TQAGCAOTAT 6960 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT aXOAGCACAA GGCAGGAAGA OAATCCATCC 7020 
55 ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCT6ATGGAA ATATAGCTOA 6AGCTTAQAG 7080 
TCTTTAGTTT AACACAGAAA GGGGTGGGOG QACTCACATC TGAGCATTGT TTTCCTCTTC 7140 
CTAAAATTAG GCAGQAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACC TGAC 7200 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TCTGTQCCTT 7260 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATOATIOAAT TTTACAGTAT 7320 
60 TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA A TTTCAA TTT 7380 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTOTTT TTAGTGTCAA ATTTTTAOCT 7440 
OTArrrOTAO CAATTATCAG GrrrOCTAGA AATATAACTT TTAATACRGT AGCCTOTAAA 7500 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAA GTAGA 7560 
AATAATCTGT TACTTATTGT AAATACTGCC CTAOTQTCTC CATCGACCAA ATTT ATATTT 7620 
65 ATAATTGTAG ATTTTTATAT TTTACTACTO AGTCAAGTTT TCtAGTTCTO TGTAATTOTT 7680 
TAOTTTAATO ACGTAOTTCA TTAGCTGGTC TTACTCTACC AGTTTTCtaA CATTGTATTO 7740 
TOTTACCTAA GTCATTAACT TTGTTTCAOC ATGTAATTTT AACTTTTQTG GAAAATAGAA 7800 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACRCC TTACCaAACA TTGTTCAAAT 7860 
GOTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATP GCXATTAAAA AAAAAAAAAA 7920 
70 AAAAAAAAAA AAAAAAAAAA A 



8eq ID KOi 382 Protein sequence 
Protein Acce SB ion #: NP_002842.1 

75 1 11 21 

I I I 

KRIUCRPXAC IQLIjCVCRLD HAKGYYRQQR 
QSFXKIDEDli TQVNVNLKKL KFQGWDKTSL 
FKASKITPBW GKCNMSSDGS SISIjEOQKPP 
80 IIiFEVOrBEH LOFKAIIDGV ESV8RFGKQA 
IDTVDHIVFR DTVSISESQL AVFCBVLTKQ 
1GKBEIBBAV CSS6PENVQA DPEHYTSIiLV 
HEPLTDGYQD LGAIUQfLLP NMSYVIiQIVA 
LXGTBBIXXB BBE6KDZBBG AZVNPGRDSA 



31 41 51 

I 1 I 

KIiVBEIGWSY TGALNQKNWG KKYPTC2ISPK 60 

BNTPIHNTGK TVBINLTODY RVSGGVSEMV 120 

LEMQZYCFDA DaFSSFBBAV KGR0KLRAL8 180 

ALDPPlLUn* LPNSTDKYyi YNOSLTSPPC 240 

QSGYVMLHDY LQNNPRBQQY KPSRQVPSSY 300 

TWERPKWYD TMIEKPAVLY QQU3GBDQTK 360 

ICINGLYGKY SDQLIVDMPT DNPBIiDLPPB 420 

IKQZRKKBFQ ISTTTHYMRI GTKYNBAKHI 480 
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RSPTRGSBPS OKQDVPNTSL NSTSQPVTKL ATBKDISLTS QTVTBLPPHT VBGT8ASLND 540 

GSKTVLRSPH KNLSGTAESL NTVSITBYEE ESLLTSFKLD TGABDSSGSS PATSAIPPIS 600 

EMISQQYIPS SEMPETITYD VLIPESARNA SEDST8SQSB BSLKDPSMBG NWFPSSTDI 660 

TAQPDVGSGR BSPLQTNYTB IRVDESBKIT KSPSAOPVMS QGPSVTDLEM PHYSTFAYFP 720 

5 TBVTPHAPTP SSRQQDLV8T VHWYSQTTQ PVYNGBTPLQ PSYSSBVFPL VTPLIiLDHQI 780 

U7TTPAAS8S DSALHATPVF PSVDVSFBSI LSSYX)GAPI<L PFSSASFSSB LFRHLHTVSQ 040 

ILPQVTSATE SDKVPLHASL PVAGGOLULB PSLAQYSDVL 6TTHAASBTL BPaSBSCVLY 900 

XTLMPSQVBP PSSDAMKHAR SSGPEPSYAL SDNB6SQHIP TVSYSSAIPV KDSVCVTYQa 960 

^ SIiPSGPSRIP ZFKSSLXTPT ASLIiQPTiXAIi S6DGBWS6AS SDSEFLLPST DGLTAliNISS 1020 

10 PVaVABPTYT TSVFGDDNKA LSK8EXIYGN CTELQIPSFN BMVYPSBSTV MPMMYDNVNK 1060 

LNASLQBTSV SISSTKGMPP GSLAHTTTKV FDHEI9QVPB NNPSVQPTHT VSQAS6DTSL 1140 

KPVLSANSBP ASSDPASSEM LSPSTQLLPY ETSASPSTBV LLQPSPQASD VDTIiXiKTVLP 1200 

AVPSDPII.VB TPKVDRISST MLHLXVSNSA SSENMLHSTS VPVFDVSPTS WIHSASLQGb 1260 

^ - TISYASEKYB FVUjRSBSSH QWPSXiYSND BLFQTANLBI KQAHPPKGHB VPATPVLSZD 1330 

IS BPIiNTLINKL IRSDE1LT8T K8SVT6KVFA GIPTVASDTP VSTDB9VPIG NQRVAITAVB 1380 

PHBD6SVT8T KLLPPSKATS BLSBSAKSDA GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK 1440 

CMSCSSYRES QBKVMNDSDT HENSLMDQNN PISYSLSEKS BEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSAKQL SQKRNDGKEE NDIQTGSALL PUSPBSKAWA VLTSDEBBGS GQGTSOSLME 1560 

NBTSTDPSPA DTNEXDADGZ IiAAjSDSBZTP 6PPQSPTSSV TSENSBVFBV 8BABASNSSH 1620 

20 BSRZGLAEGL BSBKKAVXPL VXV8ALTFIC LWLVGZLIY WRRCFQTAHF YIjEDSTSPBV 1$B0 

ISTPPTPIFP I8DDV6AIPI KHPPXHVADL HASSGFTEBF BTLKEFYQEV QSCTVDI/JIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKIiAQ LREKDGKLTD YINANYVDOY NRPKAYIAAQ 1800 

OPLKSTABDP WRMIWEKNVE VIVMIINLVB KGRRKC3)QYH PADGSEEYQN FLVTQKSVQV 1860 

LAYYTVRNPT UaJTKIlOQGS QRQRPSGRW TQYHYTQWPD M0VPBY8LPV WPVRKAAYA 1920 

25 KRHAVGPVW HCSAOVGRTG TYIVbDSMLQ QXQHBGTVIII FGFLKHIRSQ lOIYLVQTBBQ 1980 

YVFIHDTLVB AILSKBTEVL DSHIBAYVNA LLIPGPAGKT KLEKQFQIiLS QSNIQQSDYS 2040 

AALKQCNREK NRTS8IXPVE RSRVOISSLS OEOTDYINAS YIMGYYQSKB PIITQHPIiLH 2100 

TZKDFWRMIH DHNAQLWMX PDGQNMAEDE FVYWPNKDEP XMCBSFKVTL MAEEHKCLSN 2160 

BBKLIZQDPI LBATQDDYVL EVRBFQCPKN PKPDSPZ8KT PELZSVIKBB AA13I1DGFMIV 2220 

30 KDEHG6VTAG TPCALTTLMK QLBXENSVDV YQVARNXMUI RPGVFADIEQ YQFLYKVZLS 3280 
LVSTRQBENP STSLDSNGAA LPOGNXAESIi BSLV 

8eq ID NO: 383 DNA sequcgice 
Nucleic Acid Accession #t NM_005688.1 
35 coding sequence! 126.. 4439 

1 11 21 31 41 SI 

I I I I t 1 

C06GGCAGGT 66CTCAT6CT GQG6AG0GT0 GTTQAGGG6C TG6G6GGGTT 6TCCTG6AGC 60 

40 AGGG6C0CA0 OAATFCTOAT GTGAAACTAA CAGTCTGTGA GCCCTOGAAC CTC06CTCAG 120 

AGAASATGAA GGATATOQAC ATAGOAAAAQ AGTATATCAT CCCX3VGTCCT GGGTATAQAA 180 

GTGTGAGGQA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAQTTCA 240 

08ASAACT00 ACOGTTGGAA T6CCAAGAT0 CCTTGOAAAC AOCAGCXXXIA GCOSAGGGCC 300 

TCTCXCTTGA TGCCTCCATG CATTCTGAaC TCAOAATGCT GGATGAGGAO OVTCCCAAGO 360 

45 GAAAGTACCA TCATGGCTTG AGT6CTCTGA AGGCCSITCGO GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT OTATOACTTT TTOGTOGCTT TCTTCTCTGQ 480 

CCOGTGTGGC CCACAAQAAG GGGGAOCTCT CAATGGAAGA OGTGTGGTCT CTGTCXS^AGC 540 

AOQAGTCTTC TGACGT6AAC TGCAGAA6AC TAGAQAOACT GTGGCAA6AA GA6CT6AATG 600 

_^ AAGTTGGQCX: AOACGCTOCT TCCCTQCQAA GGCrTTOTSTQ GATCTTCTGC GGCACGAOOC 660 

50 TCATCCTOTC CATOQTGTGC CTGATGATCA OGO^OCTOGC TQOCTTCAGT 0GACCA6CCT 720 

TCATGGTGAA ACACCTCTTO GAGTATACCC AGGCAACA6A GT CTAA CXTTG CAGTACAGCT 780 

TQTTQTTAGT GCTGGQCXrrC CTCCTQAOGG AAATOGTGOG GTCTTGOTCQ CTTQCACTGA 840 

CTTGGQCATT GAATTAOOCSA ACOGGTQTCC OCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAQAAQAT CCTTAAGTTA AAGAACATTA AAOAGAAATC CCIGG OTGAO CTCATCAAGA 960 

55 TTTGCTCC3A CmTGGGCAQ AGAATGTTTG AGGCAGCAGC OGTTSGCAGC CTGCTG6CTG 1020 

GAGGAOCOGT TGTTGCCATC TTAGQCATGA TTTATAATGT AATTATTCTO GGACCAACAO 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACOCAGC AATGATQTTT GCATCAOSGC 1140 

TC3^CAGCATA TTTCAGGAjQA AAATGCGTGG COGCCAOQGA TGAAOGTGTC CA GAAGAT GA 1200 

ATGAAlGTrCT TACTTACATT AAATTTATCA AAATGTAIGC CTQQCITCAAA OCATTTTCTC 1360 

60 A6AGTGTTGA AAAAATC06C 6A6QA66AGC GTOGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTO GCTCCCATTG TGGTGGTGAT TGCCAGOGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTOGATCTGA CAGCAGCACA GGCTTTCACA GTGGTQACAG 1440 

TCTTCAATTC CATOACTTTT OCTTTGAAAG TAACACOQTT TTCAGTAAAG TCCCTCTCAO 1500 

AAGCCTCRGT OQCTeTTOAC AGATTTAflOA tfrmf TTCT AATQQAAGAO GTTCACATOA 1560 

65 TAAAQAACAA ACXIAGCCAOT CCTCACATCA A6ATAGAGAT 6AAAAATGGC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAQT ATCCAGAACT 060CCAAGCT QACCCCCAAA AT6AAAAAAG 1680 

ACAAGAGG6C TTCCAGGGQC AAGAAAGAOA AGGTGAGGCA GCTGCAGCGC ACTGAQCATC 1740 

AGGOaOTQCT GGCAGAGCAG AAAGGCCACC TCCTCCTQ6A CAGTGAOQAG CGGGCCAGTC 1800 

COQAAGAGGA AGAAGGCAAO CACATCCACC TGGGOCACCT GOQCTTACAG AGGACACTGC 1860 

70 ACAGCATOGA TCTGGAOATC CAAOAGGGTA AACIGGTTGG AATCTGGGGC AGTGTOGGAA 1920 

GTCGAAAAAC CTCTCTCATT TCAGCCATTT TAGQCCAGAT GACX3CTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATOTGG CCCAGCAGGC CTGGATOCTC AATGCTACTC 2040 

TGA6AQACAA CATCCTGTTT GOGAAGGAAT AT6A7GAAQA AAGATA CAAC TCIGTGCTGA 2100 

ACA6CTGCTQ CCTQAGGCCT GAGCTOGCCA TTCTTCOCAG CA60QA 0CTG A aWAG ATTQ 2160 

75 GAGAG0QAG6 AOCCAACCTG AGOGGTGGGC AGOGCCAGAO 6ATCAG0CTT GCC0GG6CCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG AOGACCCOCT CAGTGCCTTA GATGCCCATG 2280 

TOQGCAACCA CATCTTCAAT AGTGCTATCC GGAAAOVTCT OVAGTC CAAG ACAOTTCrOT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTO ACT0T6ATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTUT A Tl' A C GGAAAGAGGC AOCCATGAGO AACT6A3GAA TTTAAATGCT GACTATGCTA 2460 

80 CCATTTTTAA TAACCTGTTO CTGGGAfiAGA CACCX30CAOT TGAOATCAAT TCAAAAAAGG 2520 

AAACCAGTGO TTCACAGAAO AAOTCACAAO ACAAGOQTCC TAAAACAGQA TCAGTAAAQA 2580 

AGGAAAAAGC AGTAAAGCCA QAGGAAGGGC AGCTTGTGCA GCTGGAAGAO AAAGG6CAGG 2640 

GTTCAOTCCC CTGGTCAtyrA TATGGTGTCT ACATCCAQOC TGCTG GGGG C CCCT TGGC AT 2700 

TCCTGGTTAT TATOOOCCTT TTCATGCTQA ATGTAGQCA6 CACOOCCTTC AGCACCTGOT 2760 
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OGTTQAGTTA CTGGATCAAG CAAOGAAGOG OSAACACCAC TGTQACTOGA GGGAAOGAGA 2820 
CCTOGGTGAG TGACAGCATC AAGOACAATC CTCATATGCA GTACTATGCC AGCAT CTAOQ 2680 
CCCrCTCCAT GGCACTCATQ CTCSATCCTOA AAQCCATTCG AGGAGTTGTC TTTOTCAAGG 2940 
OCAOGCT60G AGCTTCCTOC OGGCTGOOG AOCSAGCTTTT CCX3AAG GATC CTTCGAAGCC 3000 
5 CTATCAAGTT TTTTGACAG8 ACCX30CAC3U3 GGAOQATTCT CAACAGGTTT TC CAAA QACA 3060 
TGQATGAAGT TGACGTG0C3G CPQCaSTTCC AGGCOGAGAT GTTCATCCAO AACGTTATCC 3120 
TGGTGrrCTT CTGTGTGGGA ATQATOSCAO GAQTCTTCCC GTGGTTCCTT GTGGCAGTGQ 3180 • 
GGCCCCTTOT CATCCTCTTr TCAGTCCTGC ACATTOTCTC CAGGGTCCTG ATTCS^QAGC 3240 
TGAAGOCTCT GGACAATATC ACGCftGTCAC CrTTCCTCTC CCAC ATCftOG TCCAGCATAC 3300 
10 AGGGCCTTSC CACCATCCAC GCCTACAATA AAG6GCACGA GTTTCTGCAC AGATACCAGO 3360 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGOGATQ OGGTG GCTGG 3420 
CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CAOSGGGCTQ ATGATGGTTC 3480 
TTATGCAOQG QCAGATTOC CCA6CCTATG OGGGTCTOGC CATCTCTTAT QCTGTCCAGT 3540 
TAAOGOOGCT GTTCCAQTTT AOGGTCABAC TSOCATCTGA GACAOAAQCT OGATTCACCT 3600 
15 06GTGGAGAG GATCAATCAC TACATTAA6A CTCTGTCCTT GGAAGCAOCT GCCAGAATTA 3660 
AGAACAAQGC TCCCTCCCCT GACTGGCCCC AGQAGGGAGA GGTOACCTTT GAGAACQCAG 3720 
AGATGAGGTA COGAGAAAAC CTCCCTCTTQ TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 
CTAAAQAQAA 6ATTQGCATT GTGGGG06GA CAGGATCAGO GAAQTCCTOG CTGGGGATGG 3640 
_ ^ CCCTCTTGOG TCTGQTG6AG TTATCXGQAS OCTOCATCAA 6ATT6ATGGA 6TGA0AATCA 3900 
20 OTOATATTOO CCTTGCXX3AC CTCOGAAGCA AACTCTCTAT CATTCCTCAA QAOCOGOTOC 3960 
TSTTCAGTGG CACTGTCAQA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 
TTTGGGATGC CCTGOACJAaG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTQAAAC 4080 
TTQAATCTGA AGTGATGGAQ AATQGGGATA ACTTCTCAQT GGGGQAACGG CAGCTCTTGT 4140 
^ _ GCATAGCTAG AGCCCTQCtC OaOCACTGTA AOATTCTGAT TTTAGATGAA GCCAC AGCTO 4200 
25 CCATGGACAC AOAOACAGAC TT A TTGATTC AAGAGACCAT COQAOAAGCA TTT6CA0ACT 4260 
GTACCAT6CT GACCATT6CC CATOQCCTQC ACAOGOTTCT AGOCTCX5C5AT AGGATTATGG 4320 
TGCTOGCCCA QGQACAGGTQ GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 
GTTCCX3GATT CTAT6CCATG mXiCl'GClta CAGAOAACAA GGTCGCTGTC AAGGGCTGAC 4440 
TCCTOOCTGT TOAOOAAGTC TCTTTTCTTT AQAGCATTGC CATTCCCTQC CTGGGOOOGO 4500 
30 CCCCTCATC6 CGTCCTCCTA CC6AAA0CTT GCCTTTCTOO ATTTTATCTT TCGCACAGCA 4560 
GTTCOGGATT GQCTTGTGTG TTTCACTTTT AGGGAGAQTC ATATTTTGAT TATTGTATTT 4620 
ATTCCATATT CATGTAAACA AAATTTAQTT TTTGTTCTTA ATTQCACTCT AAAAGGTTCA 4680 
GGGAACCXSTT ATTATAATTG TATCAOAGGC CTATAAT6AA GCTTTATACG TGTAGCTATA 4740 
TCTATATATA ATTCTOTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT qTTTAT 'fTTA 4800 
35 TATTAAAATA A6CACT0TGC TAATAAOVQT GCATATTCXTT TTCTATCATT TTTGTACAGT 4660 
TTGCTGTACT AGAGATCTGG TTTTQCTATT AGACTGTAGG AA6AGTA6CA TTTCATTCTT 4920 
CTCTAGCTGQ TGGTTTCRCO GTGCCAGGTT TTCTGGGTGT CXaAAGGAAG ACGTGTGGCA 4980 
ATAGTGGGCC CTCOGACAOC CCCCTCTGCC 6CCT0CCCAC AGCCX3CTGCA GGGGTGGCTG 5040 
OAQAOOGQIO QGOOQCTGGA GACCSVTGCAO AG06G0GTGA GTTCTGAGSG CTCCT6CCTT 5100 
40 CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCA6C GG6GCGAA6C CCAGGCOCCT 5160 
TTTCACTCCC TCXaVTCAAQA ATGGGGATCA CAGAGACATT CCTCCGAOCC GGGGAGTTTC 5220 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACA6AQAG 5280 
TCCCACTGCC TCAGGTTCCT ATCGCTQGCC ACTGCACAGA GCTCTCCAGC TC CAAGA CCT 5340 
G'l " iX3GTTCC A AGCOCTGGAO CGAACTGCTG CTTTTTGAGQ TGQCACTTTT T CATTTGCCT 3400 
45 ATTCCCACAC CTCCACAGTT CAOTGQCS^ OCTCAGOATT TOOTGGGTCT GTTTTCCTTT 5460 
CTCACOGCAG TCGTOOCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTO CAACTTTAAG S520 
CAGCTCTTGC TAATCAGTGT CTCACACTGO CGTAOAAGTT TTTGTACTGT AAAGAGACCT 5580 
ACCTCAGGTT GCTGGTTOCT GTGTGQTTTG GTGT6TTCCC GCAAACCCCC TTTQT GCTOT 5640 
^ GGGGCTG6TA GCTCAG G T G Q GO G TG G TCAC TGCTOTCATC A0TTGAATG6 TCAOOGTTGC 5700 
50 ATGTOGTGAC CAACTAGACA TTCTOTCGOC TTAGCATGTT T6CTQAAGAC CTTGTGGAAS 5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 384 Protein eequence 
55 Protein Accession #: ]IF_005679.1 

1 11 21 31 41 51 

MXDIDIGKEy IIPSPOyRSV KBRTSTSGIB B0REDSXFRR TRPLEGQOAL ETAAKABGLS 60 ' 

60 LDASMHSQLR ILDEEHPXGK YHHOLSAXiKP IRTTSKHQKP VDNAGLP8CM TPSHL88LAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCSUUiB RLWQBBLNEV GPDAA8LRRV VWIFCRTRLI 180 

LSIVCUIITQ LAGPSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTBI 'VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNZKB KSIiGELIMZC SSIDGQRMPBA AAVGSLLAGG 300 

PWAIIGMIY MVIILGPTGP LGSAVPI^iPY PAMMPA8RLT AYPRRKCVAA TDBRVQKMNB 360 

65 VLTYIKPIKM YAWVKAFSQS VQKIREBERR ILBKAGYPQG XTVGVAPIW VJASWTPSV 420 

HMTLGFDLTA AQAFTWTVP NSMTPALKVT PFSVKSLSEA SVAVDRFKSL FLMBBVHMIK 480 

NKFASPHIKI EMKNATIiAWD 8SHS8IQN8P KLTPKMKKDK RASR6KKEKV RQIiQRTEHQA 540 

VLAEQKGBLL XiDSDERPSPB EEEGKHZHLO HLRIiQRTLBS IDLEIQBGKL VOIOGSVGSO 600 

KTSLZSAZLG QKTLLBGSIA I8GTFAyVAQ QAHILKATLR SHXtiFQKByD EERYM8VU9S 660 

70 CX3JIPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARAMT SDRS1YIU5D PLSAUJAHVQ 720 

NHIPNSAIRK HIiKSKTVUV THQLQYLVDC DEV1FMKB6C ITERGTHEEL MNLNGDYATI 780 

FNNLIiLGETP PVEINSKKET 8GSQKKSQDK GPKTQSVKKE KAVKPBBGQL VQLBEKGQGS 840 

VPWSVYOVYI QAAGGPLAFl* VIMALFMLNV OSTAFSTHWL SyWIKQGSCW TTVTRGNBTS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKOT UtASSRIiHDB LFRRXLRSPK 960 

75 KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQMVILV PPCVGMIAGV PPWFLVAVGP 1020 

LVILPSVLHI VSRVLIRBLK RLDNITQSPP LSHITSSIQO LATIHAWKO QEPIiHRYQEL 1080 

LDDNQAPPFL FTCAMRWLAV RLDLISIALI TTTQLMIVLM HGQIPPAYAO LAISYAVQLT 1140 

QLFQFTVRIiA SBTEARFTSV ERUIHYZKTL SliEAPARIKN KAPSFDNPQE GEVTPBMAEM 1200 

RYRQILPLVL KKV8FTIXPK BKIGZVGRTQ S0KSSL6MAL FR1.VBLSG0C IKIDGVRZSD 1260 

80 XGLADLRSXL SIZPQBFVIiF 8GTVRSKU>P FMQYTBDQZW DALERTHKRB CIAQIiPUaiB 1320 

SEVMEHCaSNP SVGERQItLCI ARAZajRHCKX I«ZXJ>BATAAM DTBTDIiXilQB TIRBAFADCT 1380 
MLTZAHRLHT VLQSDRZNVL AQaQWEPDT PSVZiLSNDSS RPYAMPAAAE NRVAVKO 
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Seq ID NO) 385 DMA aequence 

nucleic Acid Accession #t NM_001327.1 

Coding sequence < 8 9 . . 6 3 1 

1 11 21 31 41 51 

I I 1 i I I 

AGCAGGGGGC GCTGTGTGTA COSAGAATAC GAGAATACCT OGTGGGCXXrr GACCTT CTCT 60 

CTGAGAGCOQ GGCAGAGGCT COGGAGCCAT GCAGGCOSAA GGCCGGQQCA CAGGGGGTTC 120 

GACGGGOGAT GCTGATGGCC CAGGAGGCCX: TGOCATTCCT GATGGCCCAO GGGGCAATGC ISO 

TGGCGGCCCA GGAGAGGOGG GTGCCAOQQO CGGCAGAGGT CCCCX3GQGCG CaOOOOCAaC 240 

AA8Q60CT0Q GGGCXGGGAG GA6GCGCCCC GCGGGGTCOO CATGGOGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGOVGAT GOGGGGCCAG GGGOCCGGAG AGCOSCCTGC TTGAGTTCTA 360 

CCTOGCCATG CCTTTCX5C0A CACCCATGGA AGCAGAGCTO GCCCXSCAGOA GCCTOTCCCA 420 

GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCT6AA0 GAGTTCACTG TGTCC06CAA 480 

CATACT6ACT ATCC6ACTOA CTGCTGCAGA CCACCGCOU^ CTGCAGCTCT CCAT CAGCTC 540 

CIOTCTCCAO CAG C T T TCCC TQTTGATGTG GATCAOGCAQ TGCTTTCTGC CCGTGTTTrT 600 

G6CTCAGCGT CCCTCAOaOC AOACGCTGCTA AGCCCAGCCT GG06CCCCTT CCTAGGTCAT 660 

GCCTCCTCGC CTAlGGOAATG GTOCCAGCAC GAGTGGCCAO TTCATTCTGG GGGCCTGATT 720 
0TTTGTCX3CT OGAGGAGQAC GQCTTACATO rfT O n T C T G TAGAAAATAA AACTORGCTA 

Seq ID NOt 366 Protein sequence 
Protein Accession #: NP_001318.1 

1 11 21 31 41 SI 

MQA E Q R GTQG 8TCn3AD6FGG POXPDGPGGN AGGPGEAGAT G6RGPRGAGA ARASGPGGQA 60 
PRQPHG6AAS GIAIGOCROQA RGPESRLLEF YLAMPFATPM EABLARRSLA QDAPPLFVPQ 120 
VHiKEPTVSG NILTIRLTAA DHRQLQLSI9 SCLQQIiSLLM MITQCPLPVP LAQPPSGQRR 

Seq ID NO; 387 DMA sequence 

nucleic Acid Accession #t Eos sequence 

Ooding sequence! 52.. 459 

1 11 21 31 41 ' 51 

I I I I I I 

CCTOSTGGGC CCTOACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAOC C3^TGCAGGCC 60 

GAAGGCCA06 GCACAGGGGG TTOGAOGGGC GATGCTGATG GCCCAGQAGG CXXTGGCATT 120 

CCTGATQGCC CAGGGQQCAA TOCTGGOGGC CCAGGAGAGQ CGGGTGCXS^C GGQC0GCA6A 180 

GGTCCCCX3GQ GCOCAGGGGC AGCAAGGGCC TCGGGGCCQh GAGGAGGCGC CCCGCGGGGT 240 

CCGCATG6CG GTGCOGCTTC TGCOCAGGAT GGAAGGTGCX: CCTGOGGGGC CAGGAGGCOQ 300 

6AGA0006CC TOCTTCStfITT COGACTGACT GCT6CAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAOCA 6CTTTCCCT0 rrOATGTGGA TCACX3CAGTQ CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTC!AGGGCAG AGQCQCTAAG COCAGCCTGQ OGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAQC3VCGA G TGGCC RGTT CATTGTGGGG 540 

6CCTGATTGT TTGTOGCTGO A6GAG0A0GG CTTACATGTT TUTITCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seq ID NO: 388 Protein sequence 
Protein Accession #t Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MQAEGOGTGG STGDADGPGG POIPDGPGGN AaSPOBAGAT OQROPRGAOA ARASGPRGGA 60 
PRGPHGGAAS AQDGRCPOGA RRPDSRUX2F RLTAADBRQti QIi8I68CIK3Q LSLLMWITQC 120 
PIiPVFLAQAP 8GQKR 

Seq ID NOi 389 DMA sequence 
Nucleic Acid Accession fti HM_005562.1 
Ooding sequence i 90.. 3671 

I 11 21 31 41 51 

11)111 

ACAQCG6A0C GCAGAGTGAG AACCACCAAC CGAGGCGCCXS GGCAGCX3ACC CCTGCftGOGQ 60 

AGACAGAGAC TQAGCGGCCC GGCACOSCCav TGCCTGCGCT CTGGCTQGGC TGCTGCCTCT 120 

GCTTCTCX3CT CCTCCTGCCC GCAGCCOGGG CCACCTCCAG GAGGGAAGTC TQTGATTGCA 180 

ATGGGAAOTC CAfSGCAOTGT ATCXTTGATC GQGAACTTCA CRGACAAACT GQTAATGGAT 240 

TCOQCTQCCT CAACTGCAAT QACAACACTO ATOOCATTCA CTOOGAGAAG TGCAAGAATG 300 

GCTTTTACOG GCACAQAOAA AGGQACOGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATQTGAC AACTCTGQAC GGTGCftGCTO TAAACCAGGT GTGACAGGAQ 420 

CCAGATGCGA COQATGTCTG CCAGGCrTCC ACATOCTCAC GGATGCGGGQ TGCACCCAAQ 480 

ACCAOftOACT OCTAGftCTCC AAGTGT6ACT GTOACCCRQC TGOCATOGCA OGGCCCTGTG 540 

ACGCOGGCOG CTGTGTCTQC AAGCCAGCTQ TTACTGGAOA AOSCTG TGAT AGGTGTOGAT 600 

CAGGTTACTA TAATCTOQAT GGOGGGAACC CTGAGGGCTO TAOCCAOTGT TTCTGCTATQ 660 

QOCATTCAGC CAGCTGCOGC AGCTCTGCAG AATACAGTQT CCATAAGATC ACCTCTACXTT 720 

TTCATCAAGA TGTTGATGGC TGQAAGGCTO TCCAAOGAAA TGGGTCTCCT 6CAAAGCTCC 780 

AATQOTCACA GGQCCATCAA 6AT0TGTTTA GCTCAGCOCA AOQACTAGAC CCTGTCTATT 840 

TTGTG6CTCC T60CAAATTT CTTOOGAATC AACROQTOftO CTATGGGCAA AGOCTGTCCT 900 

TTGACTACCX3 TGTGOACAGA GGAOGCAGAC ACCCATCTGC CCATGATGTG ATTCTCGAAG 960 

GTGCIXMTCT AOGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCRAGAC TTACACATTC AQOTTAAATQ A6CATCCAAG C3UITAATTGG AGCCCCCAGC 1080 

TOAGrrTACTT TGAGTATOQA AGGTTACTGC GGAATCTCAC AGCXXnrCCXSC ATCXMAGCTA 1140 

CATATOGAGA ATACAGTACT GOGTACATTO ACAATQTQAC CCTGATTTCA GCCCGCXXrTG 1200 

TCTCTOQAGC COCAOCACCC TG6GTT0AAC AOTOTATATG TCCrGTTGGG TAC AAGGGGC 1260 

AATTCTOCCA GOATTOTGCT TCXGOCTACA W3AGAGATTC AGCQASACTG GGOCCTTTTO 1320 
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GCACCTC5TAT TCCTTGTAAC TGTCRAGGGQ OAOOGOCCTG TGATCCAOAC ACAGGA GATT 1380 
GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTQA CTGCCC AATT GCSTTTCTACA 1440 
A0GAT0C6CA 06ACCCC06C AGCTQCAAGC CATGTCCXnXS TCAT AACGGG TTCAGCTGCT 1500 
CAGTGATGCC GGAGAGGGAG GAOGTQQTGT GCAATAACTO CUCTCCOGG Q GTGAOCGGTG 1560 
5 CC O GCTGTGA GCTCTQTGCT QATGOCTACT TTGGGGACGC CTTTGGTGAA OOGGCCCAO 1620 
TGAGGCCTTG TCACCOCTGT CAATGCAACA ACAATGTGGA CCCCAGTOCC TCTOGOAATT 16B0 
GTGACCGGCT GACAGGCAGQ TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTA CTGC G 1740 
AOCAGTGCAA AGCAGCCTAC TTOGGGQACC CATTGGCTCC CAACOCAGCA GACAAGTGTC 1800 
GAGCTTGCAA CTGTAACCCC ATGGOCTCAG AOCCTGXAOa ATOTOSAAGT GA TGGC AOCT 1860 
10 GTCTTTGCAA GCCAG6ATTT GGT GG CCCCA ACTOTOAGCA TOQAG CATTC A6CT0TOCAG 1920 
CTTGCTATAA TCAAGTGAAO ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 
AGGCCCT6AT TTCAAAGGCT CAGGGTGGTG ATGGAGTAQT ACCTGATACA GAGCTGQAAG 2040 
GCAGGATGCA GCAGGCT6AQ CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAO 2100 
AA6QTGCTAG CAGATOCCTT GGTCTCCAGT TGQCCAAGOT GAOQAOOCAA ^QAA CAGCT 2160 
15 ACXaVGAGCOQ CCTG6AT6AC CTGAAOATQA CTOTQOAAAG AOTTOGGGCT CTGG6AAGTC 2220 
AGTACCAGAA CCGAGTTCGQ GATACTCACA GQCTCATCAC TCAQATGCAG CTGAGCCTGG 2280 
CAGAAAGTGA AGCTTCCTTQ GGAAACACTA ACATTCCTQC CTCAGACCAC TA QgTGGG GC 2340 
CAAATGGCTT TAAAAGTCTQ GCTCAGQAG6 CCACAAGATT AGCAGAAAGC CA06TTQAOT 2400 
CAGCCAGTAA CATGGAGCAA CTGACAAOGG AAACTQAaOA CTATTOCAAA CAAGCGCTCT 2460 
20 CACTGGTGOO CAAGGCXXTO CATGAAGGAO TOGGAAGOGG AAGCGGTAGC COGQA OGGTO 2520 
CTCTGGTQCA A6GGCTTGTQ GAAAAATTGG AGAAAACCAA OTCCCTOGCC C3W3CAGTTGA 2580 
CAAGGGAGGC CACTCAAQCG GAAATTGAAG CAOATAGGTC TTATCAGCAC AGTCTCCX^CC 2640 
TCCTGGATTC AGTQTCTCXK3 CTTCAGGGAG TCAOTGATCA GTCCTTTCAG OTGGAAQAAG 2700 
CAAAGAGGAT CAAACAAAAA GOQQATTCAC TCTCAAOQCT GOTAAGCAOO CATATGOATG 2760 
25 AGTTCAAOOG TACACAAAAG AATCTG66AA ACTGGAAAGA AGAAOOlCAO CAOCTCTTAC 2820 
AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAOCPOCT TTCCCGTGCC AATCTTGCTA 2880 
AAAGCAGAQC ACAAGAAGCA CTGAGTATGG GCAATQCCAC TTTTTATGAA GTTGAGAGCA 2940 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGOTGGACAA CAGAAAAOCA GAAGCTGAAG 3000 
AAGCCATGAA GAGACTCTCC TACATCAGOC AOAAGGTTTC A6ATGCCAGT OACAAGACCC 3060 
30 AQCAAGCAOA AAGAGCCCTG GGQAGCGCTO CTGCTGATGC ACAOAGGGCA AAGAA TG6G G 3120 
CGGGGGAGGC CCTGGAAATC TCCAGTGAQA TTGAACAGQA GATTGGGAGT CTGAACTTGO 3180 
AAGCCAATOT GACAQCAGAT GGAGCCTTGO CCATGQAAAA GGGACTGGCC TCTCTGAAGA 3240 
GTGAGATGAO GGAAGTGQAA GGAQAGCTGG AAAQGAAQQA GCTGGAGTTT GACAC6AATA 3300 
TGGAT6CAGT AC3VSATGGTQ ATTACAGAAG CCCAGAAGOT TGATAC3CA6A 6CCAAGAAG0 3360 
35 CTQGGGTTAC AATCCAAGAC ACACTCAACA CATTA6A0QO CCTCCTGC3VT CIGATGGACC 3420 
AGCCTCTCAG TGTAGATGAA GAGGQGCTGO TCTTACTGGA GCAQAAGCTT TCCCGAGCCA 3480 
AOACCCAOAT CAACAOOCAA CTGOSGOCCA TGATGTCAGA GCTGOAAGAO AGGGCAOGTC 3540 
AGCAGAGGGG CCACCTCCAT TTGCTQGAQA CAAGCATAGA TGGGATTCTG GCTGATCTGA 3600 
AGAACTTQQA GAACATTAQG GACAACCTGC CCCCAGQCTG CTACAATACC CAGGCTCTTO 3660 
40 AGC3UICAGTG AAGCTGCCAT AAATATTTCT CAACTGAGOT TCTTGGGATA CAGATCTCAQ 3720 
GQCTOGGGAO CCATGTCATO TQAGTGOGTG GGATGGGGAC ATTTQAACAT GTT TAATGG G 3780 
TKVaCrCMSa TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGQTG GTTCTCn'AT 3840 
TGCACCATAC TCCTTGCTTC CTQATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 
ATOATCAAGO ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAO 3960 
45 ATGTTTGCCT CATAATAGTC GTAAGTGQAQ TCCTGQAATT TGGACAAGTO CTGTTGGGAT 4020 
ATAOTCAACr TATTCTTTQA GTAA1GTGAC TAAAGOAAAA AACTTTGACT TTGCCCAGGC 4080 
AXGAAATTCT TCCTAATGTC AQAACWSaGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 
ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGA6TTCCT CCTAC TTACA 4200 
ACXXaVGGGTQ TGAACATGTT CTCCATTTTC AAGCTGQAAG AAGTGAQCAG TOTTGGAGTG 4260 
50 AGGACCTOTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 
GTTCTGQACC TGGGCATGAC ATCCTTTCTT TTAATGATOC CATGGCAACT TAGAGATTGC 4380 
ATTTTTATTA AAGCATTTCC TACCA6CAAA GCAAATGTTG GGAAAOTATT TACTTTTTCG 4440 
0TTTCAAA6T GATAOAAAAG TGTGGCTTGO GCATTGAAAG AGGTAAAATT CTCTAOATTT 4500 
ATTAGTCXTTA ATTCAATCCT ACTTTTOGAA CACCARAAAT GATOOSCATC AATOTATTTT 4560 
55 ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 
CACACTTGAa CTCGGTCACA TOCATCOCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 
TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGrGTa CCAGGGGCTG 4740 
GTGGGACAGT GGT6ACATAG TCTCTGCCCT CATAGAOTTQ ATTGTCTAGT GAGGAAQACA 4800 
AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTOTT TGTCACAAGT GGTGTTTATT 4860 
60 GCAATAACCO CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTOC AAGACCCTCC 4920 
aVTGGGGQCA CTTGAOTTTT GGCAAOGCTO ACAOAGCTCT GGGTTGTGCA CATTTCTTTG 4980 
C3VTT0CAGCT GTCACTCTCT GCCTTTCTAC AACTGATTGC AACAGACTOT TGAGTTATOA 5040 
TAACACCAGT GGGAATTGCT GQAGGAACXa GAGQCACTTC CACCTTGGCT GGGAAGACTA 5100 
TGGTGCTGCC TT0CTTCT6T ATTTCCTTOQ ATTTTCCTGA AAGTGTTTTT AAATAAAOAA 5160 
05 CAATTGTTAG AT6CC 

Seq ID MO: 390 Protein aeqnence 
Protein Accession #s NP_005553.1 

70 1 11 21 31 41 51 

MPAIMLGCCL CPSIiLLPAAR AT8RRBVCDC KGKSRQCIFD REI£RQTGIIG FRCUTCITDNT 60 

XX3IHCEKCKN GFYHHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARODRCLPGF 120 

HMLTDAOCTQ DQRLLDSKCD CDPAGIAGPC DAQRCVCKPA VTQERCDRCR SGYYNUX3QN 180 

75 PBGCTQCPCY GHSASCaSSA BYSVHKITST PKQDVDGWKA VQRKGSPAKL QWSQRHQDVP 240 

8SAQRU>PVy FVAPAKPIX3N QQVSYGQSLS FDYRVDRGGR HPSAHDVZLB GAOUIITAPL 300 

MPU3ICTLPO0 LTKTYTPRLH EHP8NNWSPQ IfSYFEYRRLL RMLTALRXRA TYGBYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCICPVCYKG QFOQDCASGY KRDSARIX3PP GTCI PCNOQ G 420 

6GACDPDTGD CYSGDQJPDI BCADCPIGPY NDPHDPRSCK PCPOINGPSC SVMPETEBW 480 

80 CNNCPPGVTG ARCBLCADOV PGDPPQBHQP VRPOQPOQCM NNVDPSAS6N OJRLTGRCIiK 540 

CIRiaTAGIYC DQCKAOYPGD PLAPNPADKC RACNCNPHBS BPVOCRSDGT CVCKPOFGGP 600 

HCEHGAPSCP ACVNQVKIQH DQFKQQU2RM BALISKAQGO D6WFDTELB GRMQOAEQAL 660 

QDZUtDAQIS BGASRSXiGLQ LAKVRSQQTS YQSRLDDLXM TVBRVRALGS QYm(VR]>TH 720 

RLITQKQI*SL AESEASLGNT NIPASDHYVO PHGFKSLAQB ATRLAB8WVB SASNMBQLTR 780 
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ETEDYSKQAL SLVRKALHEG VOSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQABIE 840 

ADRSYQHSLR LLDSVSRLQQ VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EPKRTQKNl/J 900 

NHKBBAQQLL QNGKSGREKS DQLLSRANLA KSRAQBALSM GMATFYEVBS ZUQILRBFDL 960 
QVDNRKABAB BAMKRLSYZS QKV8DASDXT QQABRALGSA AADAQRAXNO AOEALBISSB 1020 
S ZEQBiaSLNL BANVTiOXSAL AMEK6IASLK SEMRBVGQBL ERKBLBPDTN MDAVQMVITE 1080 
AQKVDTRAIQI AGVTZQDTXM TLDGLIilIUCD QPLSVDBBQL VLLEQKLSRA KTQZMSQLRP 1140 
MMSBLBBRAR QQRGHLHLLB TSZDGILADV KNLBHXRIINL PPGOnrCQAL GQQ 

Seq ID NOs 391 PKA sequence 
10 Nucleic Acid Accession ft: AP101051.1 
Coding sequence: 221.856 

1 11 21 31 41 51 

I 1 I I I I 

iJ GAGCAACCTC AGCTTCTAGT ATCCAGACTC CA6C6C0GCC 00G66G6CG6 AOCCCAACCC 60 

OGACCCAQAG CTTCTCCAGC GGCX3GCGCAG OGAGCAGGGC TCCCOGCCTT AACTTCCTCC 120 

GGGGG6CCCA GCCACCTTOQ GGACTCOQGG TTGCCCACCT 6CAAACTCTC CX3CCTTCTGC 180 

ACCTGCCACC CCTQAGCCAO CGCX5GGCGCC CGAGCGAGTC ATGGCCAACG OOGQQC TOCA 240 

GCTGTTGGGC TTCATTCTCQ CCTTCCTGGG ATGQATOGGC GCCAT06TCA GGACTGOCCT 300 

20 GCCCCAGTOO AGGATTTACT CCTATGCCGQ CGACAACATC GTGACG6CCC AG6CCATGTA 360 

G6A06G6CTG TG6ATGTCCT GCGTGTCX3CA GAGCACOGGQ CAGATCCAGT GCAAAGTCTT 420 

T6ACTCCTT0 CTOAATCTGA OCAGCACATT GCAAGCAACC OGTGCCTTGA TGGTGGTTGQ 480 

OVTCCTCCTG GGAOTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGT6 540 

CTTGGAAGAC GATGAGGTGC AQAAQATGAG GATGGCTGTC ATTGGGGGTG OGATATTTCT 600 

25 TCTT6CRGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TC30TTCAAGA 660 

ATTCTATGAC OCTATGAGCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 

TGGCTGGGCT OCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTQTCC 780 

CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 

GAAAGACTAC GTGTCACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AAOCOAAAAT 900 

30 GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAOAAT TTTGGGIATP GTAATCTOAA 960 

GTATGOTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT A CTCA GTGCT 1020 

AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAQ ATTTTACX»T 1080 

TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCXHTAAA 1140 

TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 

35 CTCATTATGT TGATACTAQC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 

CCATATTQAT GAAGATQTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 

CAOTCAAATA TCATTTACTC TTCTTCATTA GCTTTQGOTG CCT TTGCC AC AAGACCTAGC 1380 

CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 

TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 

40 TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 

AOCXAAGAAG AATTTATTAC AAATCAQAAC TTTGQAGGCA AATCTTTCTG CATGACC3«A 1620 

OTGATAAATT CCTGTTOACC TTOCCACACA ATCOCTQTAC TCTGACCCAT AGCACTCTTG 1680 

TTTQCTTTGA AAATATTTOT CCAATTGAGT AGCTGCATGC TGTTCtXCCA 6QTGTT0TAA 1740 

CACAACTTTA TTGATTGAAT TTTTAAQCTA CTTATTCATA GTTTTATATC CCCCT AAACT 1800 

45 ACCTTTTTGT TCCCCATTCC TTAATT6TAT TGTTTTCX:CA AGTGTAATTA TCATQCX3TTT 1860 

TATATCTTOC TAATAAGGTG TQGTCTGTTT GTCIGAACAA AOTGCTAGAC T TTCTG GAGT 1920 

GATAATCTGO TOACAAATAT TCTCTCTGTA OCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 

TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAQ AGGTAGTGTG 2040 

AATATTAATT AOTTTATATT ACTCTCATTC TTTGAACATO AACTATGCCT ATGTAQTGTC 2100 

50 TTTATTTGCT CAGCXGGCTG AGACACTGAA GAAOTCACTG AACAAAAC5CT ACACACGTAC 2160 

CTTGAT8TGA TTCACTGCCT TOCTCTCTCT ACCA0TCTAT TTCCACTGAA CAAAACCTAC 2220 

ACACATACCT TCATGTQQTT CAGTGCCTTC CTCTCTCTAC GAOTCXATTT CCACTOAACA 2280 

AAACCTACXX: AC3VTACCTTC ATGTGGCTCA GTGCXmCCT CTCTCTACC3V GTCTATTTCC 2340 

ATTCTTTCAG CTGTGTCTGA CATGTTTGTO CTCTGTTCCA TTTTAACAAC TOCTCTTACT 2400 

55 TTTCCaGTCr 6TACA0AATO CTATTTCACT TGAGCAAGAT GAT6TATGGA AAGGGT6TTG 2460 

GGACTOOTOT CTG6AGACCT GOATTTGAGT GTTGGTGCTA TCAATCAOOQ TCrGT Q'fTTQ 2520 

AGCAAGQCAT TTG6CTGCTG TAAflCTTATT GCTTGATCTG TAAGOOGTGO TTTOTAATTC 2580 

CTGATCTTCC CACCTC3VCAG TGATGTTGTG GGGATCCAGT GAGATA6AAT ACATGTAAGT 2640 

GTGQTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

60 CGTTTTGGTG TTGCTTTTCA AAIGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GOCTTAACXSi GTCTCTCAA6 TGAT6AOACA GTQAAGTAAA ATTGAOIQCA CTAAAOQAAT 2620 

AAGATTCTGA OGAAOTCTTA TCTTCT0CSW3 TQAGTATGOC CCAAT6CTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGOTA AATCCAACAO CAAG6GA6AT 2940 

rrTTQAATCA TAATAACTCA TAAOGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 

65 TGTTAGCTGQ CAGCTGAC3GC TGCTAGGATA GTTAGTTTGG AAATGQTACT T CATAAT AAA 3060 

CTACACAAGO AAA6TCAGGC AGOQIGTCTT ATOAGGAATT GGAOC TAATA AATTTTAGTG 3120 

TGCCrrOC A A ACCXGAGAAT ATATOCTTTT G6AAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATQ TGTQAGTGTA ATTOCATQTG GATATCAGTT ACCAAACATT 3240 

„^ ACAAAAAAAT TTTATGGCCC AAAATGACCA AOQAAATTGT TACAATAGAA TTTATCCAAT 3300 

70 TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACA6A CCAATA GACA TTTTGGGOTT 3360 

TTATAATGGO AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTOTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: 392 Protein aequence 
75 Protein Accession #t AAD16433.1 

1 11 21 31 41 51 

i I t ) I I 

MANAGLQLLO PILAPIiGWIQ AIVSTALPQW RIYSYAQDNI VTAQAMYBGL WMSCVSQSTG 60 

80 QIQCKVFDSIi LNLSSTLQAT RAU4VVGILL GVIAIFVATV CTCCCM KCLB ) DBVQXMRMAV 120 

ZGGAXFUAG LAZLVATAWY GNRZVQBFYD PNTFYNARYB P0QALFT6HA AASLCLZiGGA 180 

LUxrscpRicr tsyptprpyp kpapssgxdy v 
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Seq ID NOi 393 DMA sequence 

Nucleic Acid Accession #: MM_006180.1 

Coding sequence t 352.. 2820 ~ 

1 11 21 31 41 51 

I I I I I i 

CCCCGATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAO TCCOQQAOOC GQOOGGTGGO 60 

TOOCOOqoO C GCCGGGCCAT GCAGGGACGG CCGOOQOGCSA GCTCOSAGCA GCGGTAGCGC 120 

CCCCC TOTAA A0CG6TT06C TATGC0666A OCACTQTQAA CCCT6G06CC TQGOQQAACA 180 

CTCTTOSCTC OGGACCAGCT CAGCCTCTGA TAA6CT6GAC T0G6CA06GC 06CAACAAGC 240 

ACOGAGGAGT TAAQAGAGCC 6CAAGCXK»0 QGAAG6CCTC CCOGCAOGGG TGGGGQAAAG 300 

CGGCOGGTGC AGCGOGGGGA CAGGCACTOO GGCTGGCaVCT 6GCTGCTAGG OATOTCGTCC 360 

TGGATAAGGT GGCATGGACC COCCATGGOQ CXSGCTCTGGG 6CTTCTGCTG GCTGGTTGTG 420 

6GCTTCTGGA G6GCO0CTTT COCCTOIOCC AOOTOCT O C A AATOCAOTOC CICTOGGATC 480 

TGGTGCAOOO ACCCTTCTCC TQGCATOQTG GCATTTOOQA GATTGGAGCX: TAACAGTGTA 540 

GATCCTGAGA ACATCACOQA AATTTTCATC GCAAACCAQA AAAGGTTAGA AATCATCAAC 600 

GAAOATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGQATTA 660 

AAATTTGT6G CTCATAAAGC ATTTCTGAAA AACA6CAACC T6CA6CACAT CAATTTTACC 720 

06AAACAAAC TGAOQAGTTT GTCTAGGAAA CATTT0C90TC ACCTT6ACTT GTCTGAACIG 780 

ATGCTOGTGG QCAATOCATT TACATGCTCC TGTQAGATTA TGTG6ATCAA GACTCTCCAA 840 

GA06CTAAAT CCAGTCCAGA CACTCAGQAT TTGTACTGCC TGAATGAAAG CAGOVAGAAT 900 

ATTCCCCTGG CAAACCTGCA QATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGOAGGA AQGAAAOTCT ATCACATTAT CXrTGTAGTGT GGCAGGTGAT 1020 

GGGGTTCCTA ATATGTATTG GGATOTTOaT AAGCTGGTTT CCAAACATAT GAATQAAACA 1080 

A6CCACACAC AGQGCTCCTT AAG6ATAACT AACATTTGAT CCX3ATGACAG TGGGAAGCAO 1140 

ATCTCTTGTG TG6066AAAA TCTTGTA6GA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TOAAAGGCAA CCCCAAACCA 600CTTCAGT GGTTCTATAA 0GGGGCAA7A 1320 

TTGAATQAGT GCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTG6ATAA TCCCACTCAC ATGAACAATQ QGGACTACAC TCTAATAGCC 1440 

AAQAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGG(XTGGA 1500 

ATTGACGATQ GTQCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GOGAATGACA TCGGGGACAC CACGAACAGA AGTAATQAAA TCCCTTGCAC AGAGGTCACT 1620 

GATAAAACOQ GTCOGGAACA TCTCTCGGTC TATGCTGTGG TGGXtSATTGC GTCTGT6GTG 1680 

GGATTTTGCC TTTTGGTAAT GCTQTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATQAAAGGCC CAGCCTCOGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TOGGAAGGTO GCCCAGATGC TGrCAyTTATT 1860 

GQAATGACCA AOATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCX3ACATA ACATTGTTCT GAAAAGGQAG 1980 

CTAGGOGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAAOCT CTGTCCTQAG 2040 

CAGGACAAQA TCTTGGTGGC AGTGAAGACC CT6AAGGATG CCAOTGACAA T6CA0GCAA0 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT OSTCftAOTTC 2160 

TATQGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT QAAGCATGOG 2220 

GACCTCAACA AGTTCCTCAG QQCACACGGC OCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GT06CAGATG CTGCATATAG CCCA6CAGAT OGCCQOSQGC 2340 

AT6GTCTACC TGGCGTCCCA GCACTTCOTG GACGGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAQA ACTTGCTGGT QAAAATCGGO GACTTTGGGA TOTCCOOGGA OGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTOGCTGGAT 6CCTCCAQAG 2520 

AGCATCATGT ACAGGAAATT CACGAOSGAA AQOGAOQTCT GGAQCCTGGG GGTOGTGTTG 2580 

TGGGAOATTT TCACCTATGQ CAAACAGCCC TGGTACCAGC TGTCAAACAA TGACGTGATA 2640 

GAGTGTATCA CTCA6GGCC0 AGTCCTGCAG OGACCOOQCA OGTGGOOCCA GQAGGT6TAT 2700 

GA6CTGATGC TGGGGT6CT0 GGA006AGAG COCXaiCATGA GGAAGMCAT C3UV00GCATC 2760 

CATA CCCTC C TTCAGAACTT GGCX3VAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGAOGGGCTG AOAGQATGAA 2880 

CATCTTTTAA CT0003CTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT COGACaA GCT CT08AG6GAA GCAOIQTGTA CTTCTTCATC CATAGACACA 3000 

QTA TTGACTT CTTTTTSGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTQ OTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTOOTC TTCCCTGCTT CAOQATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCXXACCA CAACTAACAA 3240 

TGCCTTGTT6 TATTCCT6CC TTTGATOTGO ATGAAAAAAA GGGAAAACAA ATAXTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAOATAT OQAOAOTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAQAA 3420 

AACTTGTGTT CAATCTQTGA AGCCTTTATC TATGGGA6AT TAAAACXAGA GAGAAAGAAG 3480 

ATTTATTATG AACOGCAATA TGGGAGGAAC AAAGACAACX: ACTGGQATCA GCTGGTG T CA 3540 

OTCCCTACTT AGGAAATACT CA6CAACTGT TAOCTGGGAA GAATOTATTC GGCAOCTT OC 3600 

CCTGAGGACC TTTCTGASGA GTAAAAAOAC TACTOGCCTC TGTG0CATG6 ATQATTCTTT 3660 
TCCCATCACC AGAAATGATA G0GTGCA6TA GAGA8CAAAG ATGGCTT 

Seq ID NOj 394 Protein sequence 
Protein Acceasion #i NP_00617i.i 

1 11 21 31 41 51 

1)11)1 

HSSHIRWHGP AMARLW6FCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLBP 60 

NSVDPENITB IFIANQKRIiB IINBDDVEAY VGLRHLTIVD SGLKFVAHXA FltKNSNLQHI 120 

NFTRNKLT6L 8RKBFRHU)L SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLHBS 180 

8KKIPLANLQ IPNGGLP8AN LAAPNLTVBB GK8ITLSCSV A6DPVPMMYW DVGNLVSKHM 240 

KBT8RTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT PLESPTSDHH 300 

NCIPFTVKGN PKPALQWFYM GA2LNE8KYI CTKIHVTNHT EYHGOiOLDN PTBMNNGDYT 360 

LZAKKBYGKO EKQXSABFMO HPOIDDOANP NYFDVZYEDY GTAANDIGDT TNRSNBIP8T * 420 

OVTDKTGREB LSVYAWVIA SWQPCLLVH LFLLRLAROS KFGMKOPASV ZSNSDDSASP 480 

LRHXSNGSNT PSSSGQGFDA Vll&VnaPV lENPOYFOrr N8QLKPDTFV QiaKRRIIIVL 540 

XRBLC^GAFG KVPLABCYNL CPBQDKILVA VXTUCDASDN ARKDFHREAB LliTNIXSBEHl 600 
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VXFYOVCS^ DPLIMVFBYM KHGDUOCFLR AHGPDAVUW GGKPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HPVHRDIATR NCLVGENU<V RI6DFGMSRD VYSTDYYRVO GHTMLPIRNM 720 

PPBSINYRKF TTESDVWSXiO WliWBIFTYO KQPHYQLSKN EVIECITQGR VDQRPRTCPQ 780 
EVYBLML6CH QREPHMRXNX RGZBTLLQNXi ARASFVYLDI LG 

Seq ID HO: 395 DNA sequence 
Nucleic Acid Accession §> AF410899 
Coding sequence! 483.. 2999 

1 11 31 31 41 51 

I t 1 I I 1 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT OSOGCTCTAC GOGCTCAGTC CCOGGCGQTA 60 

0CAGQA6CCT GGACCCA6GC GCXX»300GCXS GaCGTGAOGC GCCG6AGCCC GGCCTCGAGG 120 

TGCATAC060 ACCCCCATTC GC ATCTA ACA AGQAATCTGC GGCOCAQAGA OTCCOGGAGQ 180 

OCX3CXX3GTOG OTGCCCGGOO 0600GGQCCA TGCA008A00 GCOQCOQOOG AOCTCCGAOC 240 

AGOGGTAGCX3 CCCCCCTGTA AAGCGGTTC3Q CTATGCXX3GG ACCACTQTGA ACCCTGCOGC 300 

CTGCCGGAAC ACTCTTCGCT COGGACCAGC TCAGCCTCTG ATAAGCTGGA CTOSGCAOGC 360 

CC6CAAC3UU3 CAC06AG6A0 TTAAGA0A6C 06CAAG06CA GGGAAGGCCT CCCCGCACXX3 420 

GTOGGGQAAA GCGGCXX36TG CAGOGGGGGG ACA6GCACTC GGGCTGGCAC TGGCT6CIAG 460 

QOATOTOaTC CTOOATAAOG TGQCATOGAC CCGCCATGGC GOGGCTCTGO G GC TTCT G CT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCOSCTT TCGCCTOTCC CAC3GTCCTGC AAATGCAGTG 600 
CCTCTOQOAT CTGCTGCAGC GACCCTTCTC CTGGCATOGT GGCATTTCCG AGATTGQAQC 660 
CTAAGAGTGT A0ATCCT6AG AACATCACOQ AAATTTTCAT OGCAAACCAG AAAAGGTTAG 720 
AAATCATC3UV GGAAGATGAT GTTGAAOCTT ATGTGGGACT GAGAAATCTO ACAATTQTGG 760 
ATTCTGGATT AAAATTTGTG 6CTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 640 
TCAATTTTAC CCQAAACAAA CTOACGAGTT TGTCTAGGAA ACATTTCOGT CACCTTGACT 900 
TGTCTQAACT GATCCTGOTQ GGCAATCCAT TTACATGCTC CTQTGACATT ATGTGGATCA 960 

A6ACTCTCCA AGAOGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAOCAAGAA TATTCGCCTG GCAAACCTGC AQATACCX^AA T AXg l - GG T rf G CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTG6AGG AAGQAAAGTC TATCAOVTTA TCCTGTAGTO 1140 

TGGCAGOTGA TCOGGTTCCT AATATOTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAA6CA GATCTCTTGT GTGG06GAAA ATCTTGTAGG AGAAGATCAA 6ATTCTGTCA 1320 

AGCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTOOA ATCTCCAACC TCA6ACCA0C 1380 

ACT66T6CAT TCCATTCACT GT6AAAG6CA ACCXX3VAACC AGOGCTTCAG TG6TTCTATA 1440 

AOQQGOCAAT ATTGAATGAG TCCAAATACA TCTOTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA OGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAAC3VAT GGGGACTACA 1560 

CTCTAATA6C CAAGAATGAG TATGGGAA63 ATGAOAAACA GATTTCTGCT CACrTCATGG 1620 

QCT8Q0CTG0 AATTGAOOAT GGTGCAAAOC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATOQAACTGC AGGQAATGAC ATCG6GGACA GCAOQAACAO AAOTAATOAA ATCGCTTCCA 1740 

CAGAOOTCAC TOATAAAACC GGTOGGGAAC ATCTCTOGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTQTGGT GGQATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGa CATGAAAGAT TTCTCATGGT TTGGATTTGG OAAAGTAAAA TCAAGACAAG 1920 

GTOTTGGCCC AOCCTCCGTT ATCAGCAATO AT8ATGACTC TGCCAGCOCA CTCCATGACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CX30AAGGTG0 OCCAOATGCT OTCATTATTG 2040 

GAATGACCAA QATCXXTOTC ATTQAAAATC CCCAGTACTT TGGCATCACC AACAGTCAQC 2100 

TCAAGCCAQA CACATTTGTT CAQCACATCA AQOGACATAA CATTOTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCXTTTTOGA AAAGTGTTCC TAGCTQAATG CTATAACCTC TGTCCTGAGC 2220 

AOGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAOGATGC CAGTGACAAT GCA06CAAGG 2280 

ACTTCCACOQ TQAGGCCGAG CTOCTGACCA ACCTCCAGCA TGAOCACATC GTCAAGTTCT 2340 

ATGGOGTCTG OGTGGAGGGC GACXXXXTTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCQT GCTGATGGCT QAGGOCAACC 2460 

0GCCCA06GA ACTGAC6CAG TOGCAGATGC TGCATATAGC CCAGCAGATC GCOQOGGGCA 2520 

TGGTCTAOCT GGOGTCCOUS CACTT06T6C ACCGOSATTT GOOCACCAGG AACTGCCTOG 2580 

TOGGGGAGAA CTTQCTX30TG AAAATOQGGG ACTTTGGGAT GTCC06GGAC GT6TACAGC3V 2640 

CTQACTACTA CAGGGTOGGT GGCCACACAA TGCTCCCCAT TCGCTGQATQ CCTCCAGAGA 2700 

6CATCATGTA CAGGAAATTC AOGAOGGAAA GOaACGTCrO GAQCCTGGGO GTOGTOTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCOCT GGTACCAGCT OTCAAACAAT QAGGTOATAG 2820 

AGTGTATCAC TCAGGGCGGA 6TCCTGCAGC GACCCCGCAC OTGGOCCCAO QAGGTGTAIG 2860 

AGCTGATGCT GGGGTGCTGG CAGCGA6AGC OCCACATGAG GAAGAACATC AAGGGCATCX: 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCOGGTCTA CCTGGACATT CTAGGCTAGQ 3000 

GCCCTTTTCC CXW3AC0GAT CCTTCCCAAC OTACTCCTCA GACX^GGCTGA GAGGATGAAC 3060 
ATCTTTTAAC T6C0GCTGGA GGCCACC3U^G CTGCTCTCCr TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CX3AGAAGCTC TOQAGGGAAG CAGTGT6TAC TTCTTCATCC ATAGACACAG 3180 
TATTGACTTC TTTTTGQCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

Tr cr r r rriT aaattttctt 'rrr ci TCTTT tttttogtct tccctqcttc acxjattctta 3300 

CCCTTTCTTT TQAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA QACAAAGGCC 3360 
TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAOTT TGCCCACCAC AACTAACAAT 3420 
GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAO GGAAAACAAA TATTTCACTT 3480 
AAACTTTGTC ACTTCT6CTG TACAGATATC GAQAGTTTCT ATGGATTCAC TTCTATTTAT 3540 
TTATTATTAT TACTGTrCTT ArrGTTTTTG GATOQCTTAA GCCTOTGTAT AAAAAAGAAA 3600 
ACTT6T0TTC AATCTGTGAA GCCTTTATCT ATCGGAOATT AAAACCAGAO AGAAAGAAQA 3660 
TTTATTATGA ACGGCAATAT GGGAGGAACA AAQACAAGCA CT6GGATCA0 CIGGTGTCAG 3720 
TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAQ AATGTATTOQ GCACCTTCCC 3780 
CTGAGGACCT TTCIGAGGAG TAAAAAGACT ACXGGCCTCT GTOCGATGOA TGATTCTTTT 3840 
CCCATCACCA GAAATQATAO GGTQCAGTAO AGAGCAAAOA TGGCT7C0GT GAGACACAAG 3900 
AT0G06CATA GTGTGCTOSO ACACAGTTTT GTCTTOGTAQ GTTGTGATGA TAGCACTGGT 3960 
TTGTTTCTCA AGOOCTATOC ACA6AACCTT TGTCAACTTG AGTTOAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTOGO GGTCAGGTCG GAAAGCC 



8eq ID HOt 396 Protein aequence 
Protein Acceasion fii AAL6796S.1 
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21 
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MSSHIRWHGP AMARLWQPCW LWGFWRAAP ACPTSCKCSA SRIMCSDPSP GIVAPPRLBP GO 

NSVDPBNXTB IPIANQKRLE IINEDDVBAY VGLRNLTIVD SQLKFVAHKA FLKMSNX^HI 120 

HFTSMKLTSL SUEHFRKLDL SBLILVGNPP TCSCSIMHIK TLQBMCSSK) TQDLYCUIBS 180 

SKNIPIANU2 XPNCGLPSAN LAAFNLTVEB GKSITLSCSV AGOPVPHMYH OVGNLVSXHM 240 

NETSHTQGSL RITNISSDOS GKQISCVAEM LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPPTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT BYHGCLQU)N PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMQ WPGIDDGAMP NYPHVIYEDY GTAAKDZGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAVWIA SWGPCLLVM LFLUOiAimS KFGMKDF8WF GPGKVKSRQG 460 

VGPASVISZOD DDSASFUIHl SNGSNTPSSS EGGPDAVIIO MTKXFVIBHP QYFGITNSQL 540 

KPDTFVQHIK RHNIVLKREI* GBGAFGKVFL AECYNLCPEQ DKILVAVKTL KDASDNARKD 600 

FHREAELLTN UJHEHIVKPY GVCVBGDPLI MVPEYMKHGD LNKPLRAHGP DAVU4ABGNP 660 

PTBLTOSQML HIAQQIAAGM VYLASQHFVH RDIiATRNCLV GENLLVKIGD PGMSRDVYST 720 

DYYRVGGHTM LPIRWMPPES IMYRKPTTBS DVWSLGWLW EIFTYOXQPIf yQLSNNBVIE 780 
CITQGRVLQR PRTCPQBVYE LMLGCHQRBP HMRKNIKGIH TLLQNXiAXAS PVYLDXU3 

Seq ID NO: 397 PWA sequence 
ITucleic Acid Accession #: AB052906 
Coding sequence i 74 . . 814 

1 11 21 31 41 51 

I I I 1 I i 

AAAACCTT6A GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAOTCTCTC CTCCCTAGCX3 60 

CTCTGGGTCC TTAATGGCAG CAQCCGCCGC TACCAAGATC CTTCTGTGCC TCCOGCTTCT 120 

GCTCCTGCTO TC06GCTGGT CCOGGGCTGO GCGAGCCGAC CCTCACTCTC TTTGCTATGA 160 

C3VrCAC0QTC ATCCCTAAOT TCAGACCTGG ACCAOGGTGO TGTGCXSGTTC AAGGCCAGGT 240 

GQAXQAAAAO ACTTTTCITC ACTATGACTG TGGCAACAAQ ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGQAAAGCA CAGAACCCAG TACTGAGAGA 360 

GGTGGT6GAC ATACTTACAG AGCAACTG06 TGACATTCAG CTOGAOAATT ACACACX!CAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTQAGCAG AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGO CMGTTCAiOTT TGGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA AGGGTTCATC CTGGAGCCAa AAAGATQAAA GAAAAGTGGG AQ AATQ ACAA 600 

GGTTGTGGCX: ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCXrTGGA GCCAAGTGCA GGAQCACCAC TCGCCATGTC 720 

CTCAGGCACA ACCX»ACTCA GGGCCACAGC CACX31CCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATOCTCGGC TGCTTGATGC TCC9CTGGCAT GTQAGGAGAG TOCTTTAGAG TGACAGGTTA 840 

AAOCTGATAC CAAAAGGCTC CTOTQAGCAC GGTCTTOATC AAACTCGCCX; TTCTOTCTGQ 900 

CCAGCTGCCC AOOACCTAOG GTOTATGTCC AGTGGCCTCC AGCAGATC3VT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATQCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCACTTAAAG 1080 

TTCIOOCTGA CTAAACAAGA TATATCATTT TCOTCTTCT CTTTTTGITT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTO CAAATGATAT TOICAGTAAA ATAATCAOGT 1200 

TAGACTTCAO ACCTCTGQGO ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA ATTTATATTA ATGATTGTTT CCTTTAOTAA TTTATTQTTC TGTACTQATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA AAAAAAAAAA AA 

Seq ID NO I 398 Protein sequence 
Protein Accession #t BAB61048.1 

1 11 21 31 41 51 

I I I I I I 

HAAAAATKIXi LCLPLLLLLS GWSRASRADP HSLCYDITVI PKPKPOPRHC AVQGQVOEKT 60 

FLHYDCGNKT VTPVSPLGKK XjMVTTAWKAQ NPVLRBWDI LTBQLRDIQL ENYTPKEPLT 120 

LQARMSCBQK ABGHSSGSWQ FSPDGQIPLL FDSEKRMWTT VHPGARKMKB KWENDKWAM 180 

8FHYFSMGDC IGHI^OFLMG MDSTXiBPSAG APLAMSSGTT QLRATATTXjI IiCCLltllLPC 240 
FZLPGX 

Seq ID NO I 399 PNA aeguence 

Nucleic Acid Accession «i |IM_001898.1 

Coding sequence* 57.. 482 

1 11 21 31 41 51 

1 I I I I I 

GGCTCTCACC CTCCTCTCCT GCAC5CTCCAG CTTTGTGCTC T6CCTCTGAQ GAGACCATGO 60 

CCCAGTATCT OAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA QOAGGATAGO ATAATC0CXX3 GTGGCAXCXA TAACGCA6AC CTGAATGATG ISO 

AGTGGGTACA OC XT m CCCTl' CACTTOGCCA TCA6CSAGTA TAACAAGGCC ACCAAAOATO 240 

ACTACTACAO AOGTCCGCTG CGGOTACTAA GAGCCAG6CA ACAGACOGTT GGOGGGGTGA 300 

ATTACTTCTT OGAOGTAGAG GTGGGCCGCA CCATATGTAC CAA GTCCCAO CCCAACTTQG 360 

ACACCTGTGC CTTCCATGAA CAGCCAQAAC TGCAGAAOAA ACAOTTGTGC TCTTTOGAGA 420 

TCTA08AA6T TCCCTGOGAG AACAGAA60T CCCTG6TGAA ATCCAGQTOT CAAO AATCCT 480 

AGGGATCT6T GCCAGGCCAT TCGCRCX3U3C CACCAOCCAC TCCCACCCCC T6TAGTGCTC 540 

CCACCCCTGG ACTGGTOGCC CCCUCCCTGC GGGAGGCCTC CCCATQTGCC TGOGCCAAGA 600 

QACAGACAOA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCQCTCT GCCCTCCCTC 660 

CrrCLViVIT GCTTCTAATA GCCCTGOTAC ATGGTACACA CCCCCCCACC TCCPGCAATT 720 
AAACAGTA6C ATCGCC 

Seq ID NO: 400 Protein sequence 
Protein Accession «i NP_001889.1 

1 11 21 31 41 51 

\ \ \ \ \ ^ 

KAQYLSTUiL XJiATLAVALA WSPKBBIHIZI PGGIYNADUI OBWVQRALHP AISEYNKATK 60 

DDYYRRPIiRV LRARQQTVGG VNYFFDVBVG RTZCTKSQPN LOTCAFHEQP BU)KXQLCSF 120 
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BIYEVPWENR RSLVKSROQB S 

Seq ID NOt 401 DMA sequence 

Nucleic Acid Accession fti NM_003976.2 

Coding sequences 299.961 

1 11 21 31 41 SI 

I I I I I I 

CTCTGAGCTT CTCTGA60CT TGTTTGCTCA TCTGGAAAAA GGGQATTAAA CGATTTACCT 60 

GATGQAGTTG TGAAAJQAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAO 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT A6GCCCCTTG TTCCTCACCT GGAQAAACTO 180 

GGGTGGCAGG CXX3GTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTOT TGATAGAOAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT OTCCCACTGC CCCTGGCCTA GGOGGCAGCC 360 

TGCXXrrOTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AQOGTOGCAO AGOCCTCCCT 420 

GGGCTCOGCG CCCOGCAGCC CTGCCCCXXX3 COAAOGCCCC CCGCCTQTCC TGGCOTCCCC 480 

OGCCGGCCAC CTGCCGGC3GQ GACGCACGGC CCGCTGGTGC AGTGGAAGAQ CCCG6CX3GCC 540 

GCXX5CC3GCAG CCTTCTOGGC CX3GCGCCCCC GCCGCCTGCA CCCCCATCTQ CTCTTCCCCG 600 

OGGGGGCCGC GCGGCGCGOG CTGGGGGCXrC QGGCAGCCGC 6CTCGGGCAQ 06GGGGCGCX3 660 

GGOCTGCCGC CTGCGCTOGC AGCTGGTGCC GGTG0QC3GCG CTCGGCCTGG GCCACCQCTC 720 

COACXSAGCTG QTGCGTTTCC GCTTCTGCAG CXSOCTCCTGC OGCCGCQOGC GCTCTCCACA 780 

CGACCTCAQC CTGOCCAGCC TACTGGGCX5C CGGGGCCCTG CGACCGCCCC CGGQCTCCCG 840 

GCCOSTCAGC CAGCCCTGCT GCGGACCX:AC GCGCTAOGAA OCGGTCTCCT TCATGGACGT 900 

CAACAGCACX; TGGAGAACCO TGQACCGCCT CTCCGCCACC QCCTGCGGCT GCCTGGGCTO 960 

AGGGCTOGCr CX3«5GGCTTT GCAGACTGGA CCCTTACCGQ TGGCTCTTCC TGCCrGGGAC 1020 

CCTCOGGCAjQ A0TCGCACTA GCCAGCGGCC TCAGCX»GGG ACX3AAGGCCT CAAAGCTQAO 1080 

AGGCCCCTAC G6GTG66T6A TGGATATCAT CCCGGAACAO GTQAAGGGAC AACTOACTAO 1140 

CAGCCXX3VQA GCCCTCACCC TGOGQATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GQAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGOAC 1260 

CCCTCCTCTQ ATGAACACTA CAGTG6CTGA OGCATCAGCC CCX:GCCCAQG CCCTGTAGGG 1320 

ACA6CATTTG AAGGACAC31T ATTGCAGTrG CTTGGTTGAA AGTGOCrGTG CTGOAACIGG 1380 
CCTGTACTCA CTCATGGGAG CTGGGCCC 

Seq ID HO: 402 Protein aequence 
Protein Accession NP_003967.1 

1 11 21 31 41 51 

I I i I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS WSAPRSPAP RBGPPPVLAS 60 

PAGHLPGGRT ARNCSORARR PPPQPSRPAP PPPAPP8ALP RGGRAARAGG PGSRARAAGA 120 

RQCRLR8QI*V FVRAIXniGBR SDBLVRFRFC SGSCSIRARSP HDLSIASLLO AOALRPPPGS 160 
RPV8QPCCRP TRYEAVSFMD VNSTHRTVDR LSATAOSCLQ 

Seq ID NO: 403 DMA sequence 
Nucleic Acid Accession tf: NM_OS7091.1 
coding sequence: 783.. 1445 

1 11 21 31 41 SI 

I I i I I 1 

ACIGGCOGCT aAOAGAAGAA TCGGGTG6AG CAGAOAGCAG CTQCTGCAGG GCAGACA6CC 60 

GGAGCGCCAA ATCTGCACX3T ACCM3CAGTC AOCOGCCCCA CSCAOGGAOC GQCTTACOCC 120 

TOGCTCCCOO CCCTCACTCA CT TTCTCCOg CCCT€XK3CXX: OGCCTCCCAG CTCTCTACTT 180 

CXX:GTGTCTA CAAACTCAAC TCCOGGTTTC OBTGCCTCTC CACOGCTOGA GTTCTCTACT 240 

CTCCATATCC GAGGGGCCCC TCXX3W3CATC TACCCCCCTC CCAACCTOGG GGGACXTTAGC 300 

CAA6CTAGGG GGGACTGQAT COGA06GGTG GAGGAGCCA6 GTGAOCCCOO AAAGGTGGGG 360 

0G6GGCAGG6 GG6CTCGCAG CXXXACCCCO GOATCTGQTG AC96CTGGGGC TGGAATTTGA 420 

CACOGGACGG CTGCGGOGGC GGGCAGGAGO CTGCTGAGGG AT0GAGTTG8 OCCOGGCCGC 480 

CAGACAAGGC CCGGGGGCTC CGCCAOCAQC AG6TCCCT00 GGCCCCAGCC CTOGCTGCCA 540 

CCOQQGCCTG QAGCCCCACA CCCQAGGGTG CAGACTGGCT GCXaAGGCCA CACTTTTGGC 600 

TAAAA0AG6C ACTQCXa^T GTACAGTCCT GGGCATGCXSC TGTTTQA6CT TOGQGGGAGA 660 

GOGCAGGAGT aGTCCCOGGA. AAO O T G OCTA 6AAGAACAAG GTGCAGGAOC C08T6CT6CC 720 

TCAACAGOAO GGTGGG6QAA CAfiCTCAACA ATGGCTGATO GGOGCTCCTG OTOTTGATAO 780 

AGATGGAACr TGGACTTGGA OGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGOOTC GCAGAGGCCT 900 

CCXrrGGGCTC CX3GGCXXX36C AGCCCTQCCC CCOQCOAAGG CCCCCCOCXn* GTCCTGGCX3T 960 

COCCOQCOOQ CCAOCTGCOG GGGGGAOGGA CXSGOCOSCTG OTGCASTQQA AGAOCCOOQC 1020 

GGCOGCCGCC GCAGCCTTCT OG6CCCG0GC CCCOQCOOCC TGCACCCOCA TCTGCTCTTC 1080 

CCOGOQGOGG CCQCXSOGGCa OQGOCTGGGG QCCCGGGCAG CX3G0GCT0GG GCAGOGGGGG 1140 

OGCGGGGCTG CCGCCTGCGC TOOCAGCTGG tGCCGGTGOG OOOGCTOGGC CTGGQCCACC 1200 

6CTC0GA0GA OCTGGTGOQT TTOCJOCTTCT 0CA0C3GGCTC CTGCOGCCGC GOGCGCTCTC 1260 

CACAOGACCT CAGCCTGGGC AOOCTACXGO GGGCXSGG6C CCIGC9GAG08 CC0CCXSG6CT 1320 

CXXG6GCCST CAGCXMCCX: TGCPGOCXSAC CCAOOOGCTA COAAGOSGTC TCCTTCATGG 1380 

AC30TCAACAQ CACCTGGAGA ACCGTGGACC GCCTCTCOGC C31C0GCCTGC GGCTGCXrTGG 1440 

GCTGAGGGCT OGCTCCAGGG CTTTGCAGAC TGOACXXHTA CCGGTGGCTC TTCCTGCCTG 1500 

GOACCCTCCC GCAOAGTCCC ACTAflCCAGC GGCCTCAGCC AGGGAC3GAAG .GCCTCAAAGC 1560 

TQAGAOGCCC CTAGOGGTGG GT6ATG0ATA TCATCGCCXIA ACAOGTGAAG GGACAACTGA 1620 

CTA6CAGCCC CAGAGGCCTC ACCCIGCQGA TCCCAGCCTA AAAGAGACCA GAOAOCTCAG 1680 

CTATGGAGCC CTTOGQACCC ACTTCTCACA GACTCTGGCA CTGGCCAGOC CTCGAAOCTG 1740 

GGACCCCTCC TCTOATGAAC ACTACAGTGG CTGAGGCaTC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACA6CA TTTGAAGOAC ACATATTGCA GTTGCTTGGT TGAAAGTGGC TGTGCIGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGOCC CC 

Seq ID NOt 404 Protein sequence 
Protein Accession «i NP_003 967.1 
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PCT/US02/36810 



X 11 21 31 41 51 

I I I I I I 

MBLGLGGLST LSHCPHPRRQ PALHPTXAAL AIibSSVAEAS LGSAPRSPAP RBGPPFVLA8 60 
PAGaU?G6RT ARHCSGRARR PPPQPSRPAP PPPAPPSALP RG6EUVARAG6 PGSRARAAGA 120 
RGCRLRSQLV PVRALSLGHR SDELVRPRFC SGSCSffiARSP BDXiSIASLX^ AGALRPPPGS 180 
RPVSQPCaiP TRYBAVSFMD VNSTWRTVDR I»SATA06CbO 



10 Seq ID NOi 405 DNA sequence 

Nucleic Acid Accession «t 1IM_057160.1 
Coding sequence t 1..714 " 

1 11 21 31 41 ' 51 

IS I I I i t I 

ATGCCCGGCC TOATCTCAGC COGAGGACAQ CCCCTCCTTG AGGTCXTTTCC TCCCCAAGCC 60 

CACCTGGGTQ CCCTCTTTCT CCCTGAGGCT CCACTTGOTC TCTCOGCQCA GCCTQCCCTO 120 

TOGCCCACXX: TGGCCGCTCT GGCTCTGCTQ AQCAGOITCO CAGAGOCCTC CXrTGGGCTCC IBO 

GCGCCCX3GCA GCOCTGCCCC C0GC3SRAGGC CCCOOOCCTO TCC TOQ CGTC CC0CXKXS5GC 240 

20 CACCTGCGGG GGGGACGCAC G6CCG6CTG6 T0CAGT6GAA GAGCC0G6O6 GC0GCC6CCG 300 

CAGCCTTCTC GGCCCGCGCC CCCQCCGCCT GCACCCCCAT CTGCTCTTCC CCQCGGGGGC 360 

CGCGCGGCX3C GGGCTGGGGG CCCGGOCAGC OGOGCTCGGG CAG0C3GGGGC GOGGG GCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCOGGT G CGC G06CT06GOC T6G6CCA008 CTCCGAGQAO 480 

Cl- G G'IX j OJlT IX.'UGCnXrrO CAOOOGCTOC TGCOGOOQOG OGOGCTCTGC ACAOGAOCTC 540 

25 AGCCTOGCCA GCCTACT6GQ CGCOGGGOCC CTGOQACCGC CCCOQQGCTC OC3Q6CCOOTC 600 

AGCCAGCCCT GCTGCOGACC CACGCOCTAC GAAQCGGTCT CCTTCATGGA CX3TC3VACAGC 660 

ACXTTGOAGAA CXX3TGQACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTOAGGGCTC 720 

GCTCCAGGGC TTTGCAQACT GGACCCTTAC OGGTGOCTCT TCCTCCCPGG GACCCTCCOQ 780 

CAOAOTCCCA CTAOCCAGCG GCCTCAGCCA GGOACGAAGG CCTCAAAGCT GAjQAGGOOGC 640 

30 TACCGGTGGQ TGATGGATAT CATCCCCGAA CAGGTQAAOG GACAACTQAC TAGCA6CX3CC 900 

AGAGGCCTCA CCCTGOGQAT CCCAGCCTAA AAGACACCA6 AGACCTCAGC TATGQAGCCC 960 

TTOQGACCCA CTTCTCAC3W5 ACTCTGGCAC TGGCCAGGCC TCQAACXTrGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCOGOCC AGGCCCTGTA GGG ACAGC AT 1080 

TTOAAGOACA CATATTGCAG TTGCTTGGTT QAAAGTGOCT GT6CTG6AAC TGGCCTGTAC 1140 

35 TCACTCATGG GAGCTGGCCC C 

Seq ZD MOt 406 Protein aequence 
Protein Accession #: NP_47650l.l 

40 I 11 21 31 41 51 

I I I I 1 I 

KPGLISAROQ PIiLEVLPPQA HL6ALFLPEA PLQLSAQPAL WPTLAALALL SSVAEASI/3S 60 

APRSPAPREQ PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSAI«PRGG 120 

RAAJRAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDB LVRFRPCSGS CRRARSPHDL 180 
45 SLASLLOAGA LRPPPGSRPV SQPCXaiPTRY BAVSFMDVNS TWRTVDRLSA TA0GCU3 

8eq ID NOt 407 PNA sequence 

nucleic Acid Accession it KM_057090.1 

Coding sequence! 29.. 715 

1 11 21 31 41 51 

I I I I i I 

CT6ATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGOA CTTGQAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTOGT CTCTCX3G0GC AGCCTGCCCT 120 

55 OTGGCCCACC CTOQCOQCTC TGGCTCTGCT GAGCAGOGTC GCAGAGGCCT CCCIGGGCTC 180 

CGG6C00CGC AGCCCIGCOC CCOGOQAAGO CC0CC06CCT OTCCTGGCGT CCCCOGCCGG 240 

OCACCTGOOS GGGGGA06GA CGQCCCGCTO GTGCAGTGGA AGAGOXGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCOGCQC CCCCGCOGCC TGCACCCCCA TCTGCTCTTC CCCGOGGGGG 360 

CCGOGCG0CX5 CGGGCTGQGQ OCCCGGGCAO CCGCGCTCGG GCAGOGGGGG C60GGGGCTG 420 

60 COGCCTGOGC TCGCAGCTGG TGCCGGTG06 0GCGCTCG6C CTGQOCCACC OCTCCGAOGA 480 

GCTGGTGGQT TTOOSCTTCT GCAGGSGCTC CTGCOGCOGC GC6CGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCrACTGO OOQCCGGGGC CCTGCGACGO CCCCOGOOCT CCCGGCCOGT 600 

CAGCCAGCCC TGCTGCCQAC CCACGOGCTA CGAAGCGOTC TCCTTCATGG AOOTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCOGC CACOGCCTGC GGCTGCCTGG GCTQAGGGCT 720 

65 CGCTCGAGGO CTTTGCAGAC TGGACCCTTA COGGTOGCTC TTCCTGCCTG GQACCCTCCC 780 

GCAOAGTCCC ACTAGCCAGC GGOCTCAOCC AGGGACGAAG GGCTCAAAGC TGAGAGGCCC 840 

CTACOQOT G Q OTGATGGATA TCATCCCOGA ACAGGTGAAO GQAGAACTGA CTAQCAGCCC 900 

CAGAGCCCTC ACCCTGOGQA TCCCA6CCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTOGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTOGAACCTG GQACCCCTCC 1020 

70 TCT6ATQAAC ACTACAGTGO CTGAGGCATC A0CCCC06CC CAGGCCCTGT AGGGACA6CA 1080 

TTTGAA06AC ACATATTGCA GTrGCTTGGT TGAAAOTGCC TGTGCZGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 

Seq ID MO J 408 Protein sequence 
75 Protein Accession #i NP_476431.1 

1 11 21 31 41 51 

KBLGLGGLST LSHCPWPRRQ APJjGLSAQPA LWPTLAALAL LSSVTIRASLG 8APRSPAPRB 60 
80 OPPPVliASPA GHLPGGRTAR WCSGRASRPP PQPSRPAPPP PAPPSAIiPRO GRAARAOOPO 120 
6RARAAQAR0 CRIASQLVPV RAL6U3HRSD BLVR7RPCS0 SCRRARSPHD XjSLASLUSAG 180 
AXiRPPPGSRP V8QPCCRPTR YEAVSFMDVN 8T1IRTVDRL8 ATACQCLO 
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Seq ID NO: 409 DMA sequence 

Nucleic Acid Accession 8: Bos sequence 

Coding sequence t 1 . . 

5 1 11 21 31 41 51 

I I I i I t 

ATGCCACTGA A6CATTATCT CCTTTTGCTQ OTGGGCTGCC AAGCCTGQOa TGCAGQGTTG 60 
GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGO CCTCCCAOGT GGAOTGCACC 120 
GGGGCACX3CA TTGTGGCGGT GCCCACX3CCT CTGCCCrGGA AOQOCATQAO CCTGCAQATC 180 
10 CTCAACACGC ACATCACTGA ACTCAATOAO TCCCCGTTCC TCRATATCTC AGCOCT CATC 240 
GCCCTOAGGA TTQAGAAGAA TGAGCTGTaS CGCATCACGC CTGGGGCCTT COGAAAOCTQ 300 
GGCTOGCTGC GCTATCTCAG CCTOGCCAAC AACAAGCTGC AGGTTCTGCX: CATCGGCCTC 360 
TTCCAOGGCC TGQACAGCCT TGAGTCTCTC CTTCTOTCCA QTAA CCAG CT GTTGCAGATC 420 
CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC 7GCA0TT6CA 0G6CAACCAC 480 
15 CTCQAATACA TCCCTGAOGO AGCCTTOSAC CACCTGGTAQ GACTCAOGAA GCTCAATCTG 540 
GGCAAGAATA 6CCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGQ CAATCTCCAG 600 
GTCCTCOGGC TQTAT6A0AA CAGGCTCACG GATATCCCCA TGGGCACTTT T OATG GGCTT 660 
GTTAACCTGC AGGAACTGGC TCTACA6CA0 AACCAQATTG GACTGCTCTC CCCTGG TCTC 720 
TTCCACAACA ACCACAACCT CCAOAOACTC TACCTGTCCA ACAACCACAT CTGCCA6CTG 780 
20 CCACCCAQCA TCTTCATGCA GCTGOCOCaG CTCAACCOTC TTACTCTCTT T GGQAATT CC 640 
CTGAAOGAGC TCTCTCTGGG GATCTTOGGG CCCATGCCCA ACCTGCGGGA GCi 'fTGGC TC 900 
TATGACAACC ACATCTCTTC TCTACCCGAC AAT6TCTTCA GCAACCTCCG CCAGTT6CAG 960 
OTCCTOATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC OGOGTQCCTT CAAOGGOCTA 1020 
AOGGAGCTTC GGQAGCTOTC CCTCCACACC AACXKACTGC AGGACCTQQA OGOaAATQTC 1080 
25 TTCCGCATGT TGGCCAACCT GCAGAAC3VTC TCCCTGCAQA ACAATOGCCT CRGACAGCTC 1140 
CCAOGOAATA TC T TOQCCAA OOTCAATOGC CTCATGGCCA TCCAGCTQCA GAACAACXAG 1200 
CTOQAGAACT TGCCCCTOOO CATCTTOQAT CACCXGOGGA AACTOTOTGA 0CT6CGGCTQ 1260 
TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCOGC TCCGCAACTG GCT CCTGCT C 132 0 
AACCAGCCTA GOTTAGGGAC GGACACTGTA CCrQTGTGTT TCAGCCCAOC CAATOTCO OA 1380 
30 GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGOGTCCA TGTCCCTGAG 1440 
GTGCCTAQTT AOGCA0AAAC ACCATGGTAC CCAOACACAC CX3W3TTACCC TGACACCACA 1500 
TCOGTCTCTT CTACCACTGA 6CTAACCA6C CCTQTGGAAG ACTACACTGA TCTGACTACC 1560 
ATTCAGGTCA CTGATGACOQ CAGCGTTTGG GGCATGACCC AGGCCC3VGAG OGG QCTGGC C 1620 
ATTGCOGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCXTrGGCTOC CTGOGTOGGC 1680 
35 TGTTGCTGCT GCAAGAAQAO GAGCCAAOCT OTCCTGATGC AGAT6AAGGC ACCCAATOAO 1740 
TGTTAAA6AG QCAGOCTQGA GCAGGGCTGG GGAATGATGG GACTGQAOGA CCTGGGAATT IBOO 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT QGAGCTTTCX: CGTGATTGCT CTTTCTGGCC 1860 
CTAGATAAAG GTGTGCCTAC CTCTTCCTQA CTTGCCTGAT TCTCCCX3TAa AQAAG CAGGT 1920 
CXSTGCOOOAC CTTCCTACAA TCAGQAAGAT AGATCCAACT GGCX3VTGGCA AAAGCCCTGQ 1980 
40 GGATTTCCX5A TTCATACCCC TQGQCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCOC 2040 
ACCTOTGCTC CAAOAACAGC CTTCCCTGCQ CCCAGGCCCC CTCCX5Q0CCT CTGTAGACTC 2100 
AGTTAOTCCA CAGCCTGCTC ACTTOGTGGO AATAGTTCTC OGCTGAQATA GCCCCTCTTO 2160 
CCTAAGTATT ATGTAAGTTa ATTTCCCTTC TTTT6TTTCT CTTGTTTGTG CTATGGCTTG 2220 
ACCCAGCATQ TCCCCTCAAA TGAAAGTTCT CCCCTTQATT TTCTGCTCCT GAAGGCAGGG 2280 
45 TQAGTTCTCT CCTCAAAQAA GACTTCAAAC CATTTAACTQ GTTTCTTAAO AGCCGTCAAT 2340 
CAQCCTGOTT TTGGGQATGC TATGAAASAG AlQAAGGAAAA TCATOCCGCT CAGTTCCTGG 2400 
AGACAGAAGA GCOGTCATCA GT G TCTC A CT TGTGATTTTT ATCTGGAAAA QGAAQAAACA 2460 
CCCCAGCACA 6CAA0CTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 
50 AAAATCAGCT TATTAATAOG OQATAGAGAA AGAAATCTGG TGCCTGGQGG TCOCTGIGTT 2640 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAQCATG TGAAGTGTAC STGCAGAA AA 2700 
6TGG6AACAT GATAGTOTAT GGCTTOGTCG ATTTTCACAA ACTGAACATA CCTGT GTAAT 2760 
CAGCATCTAO ACCCAGACOC AGAGCATCAC AAATATCCCX: CATCCTGGGC TTTTCCCAGA 2820 
GGAGATGGGO GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 
55 TCCCCXXACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAQG GGTGGAGGAG AATATGTGGG 2940 
TQTG6ACAG0 ATGGGAGACT QTGQCCTOAA CAOQAOAITT TATTATATCT GGAGACXX^XS 3000 
AGAGACCCTO AOACCTQQGO CACCATG6CT GGOCAGGTCA GAAOOVTCCT GACTGCAQAG 3060 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCX3GC TCAT OGGAGGCCCC 3120 
TCCX5CCTGGA GCCTTCTATQ GAOOTQATAT GCCTGTATCT OTTTTTAATT TTCATTCTTC 3180 
60 ACTTAGQGGA AGTQAAATOQ CTCAGAGATQ AQATCCTTTA ATTGAAAACO AAGTGTAACO 3240 
GAATCIAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CftTCACAQTC AGCTGGCAGC 3300 
TGAACTTCAG AATCTCACTT ACAGCAGGOO ACAOGGGOOT ACACCX3ATGG G TCACACTGG 3360 
GTCTGGGGGC TCCXH^GAGC TCCTCCTGOG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACQAAT ACCTGCCTTC TCTGGC TTTC 3480 
65 CTGCTATACA CATATTCACA TGGCX3CTCAA 0AAGTTA6GC TCATGGCAAC GTGTGTCTTT 3540 
CTCTGGACAA CT60CCCAGT TTACAGTGAA ATGQAGAATT TCAGGTCTCC AC6TCTGCCC 3600 
A0QAAA6AAC TTCAGCPGAC TCCA0GG6GA TCTOOAAATC CAOQACCAAT CCCX3ATCXXSC 3660 
TCTTATTAGC TCCCOGCTCC ACAA6ACACC TGTGCTTTGO AAATCCACCA CCAATCXXX» 3720 
^ TCGGCTCTTA TTAGCTCCCC GCTCCACAAQ ACACCTGTOA TCTGGAAATC TACCACCAAT 3780 
70 CCOGATGGGC TCTTATTAGC TCCCOGCTCC ACAAGACACC TGTGACATCC TCCAG«3CCA 3840 
aaOAgCAOQ TGCTGAOCAG TTTTCCCTTC CAOTTCCTGC ACAAAAAOTG TCCAGAGGGC 3900 
TGTTTOCAAA CACTAGTOCA CTTTOTAGCT TTTCACCCTC TCTCCCAGGG AATCTAGGAO 3960 
AGATGAGGCC OOTCAGAGTC AAGAGATGTC ATOCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 
ACTATTGGTG GCACCTCGAG QACATGCACC AAGQCTTGCC AGAGCCAACA GQAAGTGAGC 4080 
75 CCAGAGCATG GCACATQAGC ATCACCCGCT GATGGTGGCC TGCigTOCCT GOTGCCAACA 4140 
GGGGOVTCCC GOCCCOTACC CCTCGAISACA GGAAOCATGO OTTTGCCCAC AGACCTGTCG 4200 

Gtfixacrccro tgaotgccct ccaqatgtct ttotocatao gcacaagtgg gccagggctg 42 eo 

GAGGGAGOTG GGAAACCTCA TCATCCGGTQ GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG GCAGTAGCCA TGACATTGOA GCACCTTCCT CrCCAGCCAG AGGCTGACCT 4380 

80 GAGGGCCACT OTCCTCAQAT GACACCACCC AGQAGCACCC TAGGTGAGGG GTQAGGGCCC 4440 

CCTTATOTGA ACCTCTTOOC TCTTCCTTTC TCOCATCAGA GTGGTTGGAT OGAGCCATTG 4500 

GCCTCCTTTT CTTCAGOGGG CCCTTCAAOC TCTCTGCACC ATQfTTGTCra GCT GAGOAG C 4560 

1ACXAGAAAA GCTGAGTOGA GTCTCCTTTC CAACAGGATO ATOCATTTGC TCAAT TCTCA 4620 

GOGCTGOAAT GAGCOGGCTG GTCCCCCAGA AAOCTGGAGT GGGGTACAOA GTTCAGTTTT 4680 

1115 



wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCTCTCTGTT 
GTGTTGGAOA 
TGCACAQATA 
GGTAG6AGTG 
GGTGTTCAiVT 
TTOTCTTGGG 
TTAGTCTTGQ 
GGAAAAAATA 
TG06CT6TAT 
AACTTTTOVT 
GAACTTCCAA 
AGTTGGT06A 
GCCCCCAGAT 
GGAAGGAAGC 
CTCCTTC06C 
CTTCATGCTG 
GGCCCAGTGC 
A6CCCTGGT0 
OGTGTACAGIL 



TACAGCTCCT 
AGAAACAACA 
CTCTTCAAGC 
OOQCCTCTAC 
AOQCTGGGAO 
CTTTCGTCAT 
TCATCAGAAC 
AACTCTTCCA 
GTATATTOTT 
GGACACAATT 
ACTCAGQAAG 
CAGATQTTAG 
CCCACAGTCA 
CATG6CTGT0 
CCCA6GTTTC 
CCTTCAAAGC 
TTGGOQATGC 
GGCAGGGTTO 
ATCAAGAATA 



TGACAGTCCC 
AAAGCCAATT 
ACTGGA06T0 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAO 

LTixxr i ix.y i' 

TCCACAAOCT 
TTTGCAQAOA 
ATOTATCCTA 
GAACTGAATC 
QTTCAiSAaAO 
TTCTTCTCTT 
TAOATCATGT 
ATTTACAGAT 
GGSGGTCT6T 
AATAATATAC 



AOGOCCATCT 
AGAACCACTA 
OATTCTCTCT 
T6GG0TACA0 
CTCTTGAAAC 
QAAATOGAAG 
ACCATATAGA 
AATAOAATAO 
ASAATTTAGA 
TTCAGATOCT 
GCAGACAQCT 
GCTTTTAQCC 
TGGGTTQTTG 
GGT0Q6CTG6 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



GGAGTGGGAO 
TTTTTAAAAA 
CTAGCCCTOV 
AGGGACTTGC 
TTTOTACAAG 
OCATTCCCCT 
TCAAAAGCTT 
TTTGTCXICTC 
GATACAAQAO 
GATOTAOAOC 
AGAOATAACT 
ATAAACC31CT 
G6AAGCCAGC 
CAAGGCACTT 
TGTTCTCACC 
TAGAQAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



CTGGGAGTTA 
GTGCTTACTG 
GCACCCCTQC 
TCTTCTQCAT 
AGCTCAT6GC 
GTTGCTCTCC 
TGTAACCACA 
TCATG GQAA T 
TTCTACTTAO 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
C06GGGAAAA 
AACCOQCTGC 
CTGCAAATCA 
GTAGAQTGTG 
TAATCCATTT 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



Seq ID NOt 410 Protein eequence 
Protein Accession ft: BAB84587.1 



MPLKHYLLLIi 
UITHITEIjNE 
FQGLDSLESL 
GiCKSLTHISP 
FHMHHKLQRti 
YDHHISSLPO 
FSMLANLONI 
ymfPHRCDSD 
VPSYPBTPWy 
lAAIVIGIVA 



11 
I 

VQCQAWGAGL 
SPFLNI8ALI 
LLSSNQLLQI 
RVPQHL6^^JQ 
YLSNNHISQL 
NVFSNLRQLQ 

ILPLRNWLLL 
PDTPSYPDTT 
LACSIiAACVG 



21 

I 

AYHGCPSECT 
ALRISKNBIiS 
QPAHPSQCSN 
VLRLYEHRLT 
PPSIFMQLPQ 
VLIIiSRNQIS 
PGNIFANVNQ 
NQPRLGIDTV 
SVSSTTELTS 
CCCCKKRSOA 



31 
I 

CSBAfiQVECT 
RXTPGAFRNL 
LKE IiQLHCare 
DIFMQTFIXSi 
LNRLTXjPGlie 
FISPGAFNGli 
liMAIQLONNQ 
PVCPSPANVR 
FVEDYTDLTT 
VLMQMKAPNB 



41 

I 

GARIVAVPTP 
GSLRYL8LAN 
LEYIPDOAFD 
WLQBLALQQ 
LXBLSLGIPG 
TBLRBXiSLBT 
LENLPLOIFD 
6QSLIZINVN 
ZQVTDDR8VW 
C 



51 

I 

LPHNAMSLQI 
HKLQVLPIGL 
HIiVGLTKLNL 
NQIGIJiSPGL 
FKPinABLWL 
HALQDLDGNV 
HLGXIiCBLRL 
VAVPSVHVPB 
GNTOAQSQIiA 



Seq ID NO I 411 ia<A sequence 
nucleic Acid Accession #t XMJ>96151 
Coding sequences 1..447 



1 

I 

ATQATGCATT 
AGTGGGOTAC 
TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CAGAGQCACr 
AACTTTGTAC 
AAGCCATTCC 



11 
I 

TGCTCAATTC 
AGAGTTCAGT 
QAGCTQGQAG 
AAA6TGCTTA 
TCAGCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 

I 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
I 

AATGAGCOGG 
GTTTACAGCT 
AGAAQAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAOOCTGQ 
GG6CTTT0GT 



41 
I 

CTSOTCCCCC 
CCTTGACAGT 
ACAAAAGCCA 
AGCACTGGAC 
TACCCACTTO 
GAOTTTTATT 
CA7TAAACCA 



60 
120 
180 
240 
300 
360 
420 
480 
S40 



51 
I 

AGAAAGCTCG 
CGCS^OOGCCA 
ATTAOAACCA 
GTGGATTCTC 
TGATGGG6TA 
TATCTCTTCA 
AAGGAAATGQ 



Seq ID NOi 412 Protein sequence 
Protein Accession #: XP_098151 

1 11 21 31 41 51 

I I I I I ) 

MHBLLNSQGW NBPAjSPFBSH 66VQSSVFLS VYSSLTVPRP 8GVGAGSQCN RRHNKSQLBP 
LPUCSAYCAQ ILFKmnmil* SLAIjSTPAVO VPPLPTCDGV QSHLLFCKVP NRIiGVIiFISS 
NFVQEUtAOi GLSSUSQRKW XPFPCXSP 

Seq ID NO: 413 DNA sequence 

MUcleic Acid Accession #: NM_002658.1 

Cfxling sequence I 77.. 1372 



GTCCCC6CAG 
CCCCGACCTC 
GAOOGACTCC 
TGQAGGAACA 
qAA ATTOQQ A 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTOGGC 
AAA QOCCT CC 
COQCTTTAAO 
CATCTACAGG 
CCCTTGCTGG 
CAT06TCTAC 
GSTGOAAAAC 
CATrOOCTTG 
ACAGAGCATC 
CACTQOCTTT 



11 

I 

GGCX36T06GG 
GCCACCATQA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCA6CACT 
OOAOGAAAGO 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTGCTCCAG 
ATTATTOGGO 
AGGCAGOGGG 
QTGATCAG03 
CTGGGTOGCT 
CTCATCCTAC 
CIGAAGATCC 
TGCCTGCOCT 
GOAAAAGAGA 



21 

I 

COCTCCTGCC 
OAGCOCTGCT 
ATGAACTTCA 
ACAAGTACTT 
OTGAAATAGA 
CCAOCACTGA 
AAAGGTACCA 
GCAGGAAGCC 
TTOTCCAAGA 
AAGAATTAAA 
QAOAATTCAC 
GGGGCTCTGT 
OCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CXSATGTATAA 
ATTCTA0CX3A 



31 

I 

GCAOGCXSVCC 
GGCGCGCCT6 
TCAA6TTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCAT66GC 
TGCCCACAGA 
AOACAACCGG 
QTGCATGGTO 
ATTTCAGTGT 
CACCATCGAG 
CACCTA06TG 
CTTCATTGAT 
CTCCAACAOO 
CAGOGCTGAC 
GGGCAGGTGT 
CX2ATCCCCAG 
CTATCTCTAT 



41 

I 

gaggcx:gccg 

CTTCTCTGOO 
TaSAACrOTG 
CACTQGTGCA 
ACCTGCTATG 
CG6CCCTGCC 
TCTGATGCTC 
AGGGQACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAQA 
CAAQGQGAGA 
AOGCTTGCTC 
0C0C AGCX3VT 
TTTGGCACAA 
GGQQAGCABC 



60 
120 
180 
240 
300 
360 
420 



60 
120 



51 

I 

CCGTCTA0CX5 
TCCTGGTOGT 
ACTGTCTAAA 
ACT6CCCAAA 
AGGGGAATGG 
TGCCCTGQAA 
TTCAGCTGGG 
GGTGCTATOT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGG6GC 
GCCTCATCAO 
AGQA QQACTA 
TGAAGTTTGA 
ACCACAAGGA 
CCOGGACTAT 
GCTGTQAGAT 
TQAAAAT6AC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TGTTGTGAAQ CTGATTTCOC ACCGGGAGTG "rcAGCAGCCC CACTACTAOO GCTCTGAAGrT 
CACCACCAAA ATGCTATOTG CTGCTQACCC CCAATGOAAA A CAGMT CCT GCCAGGGAGA 
CTCAGGGGGA CCCCTOQTCT GTTOCCTCCA AGGC06CATQ ACTTTGACTO aAATTGTGAO 
CIQOGGCOOT GQ A TOT Q CCC TQAAGGACAA QCCACSQOCITC TACAOQAGAG TCTCACACTT 
CTTACOCnSG ATCOGCAGTC ACACCAAGGA AGAOAATGGC CTGGCCCTCT OAGSGTCOCC 
AGOOAGOAAA CGGGCACCAC CCGCTTTCTT 6CTGGTTQTC ATTTTTOCAO TAGAGTCATC 
TCCATCRGCT GTAAOAAQAG ACTGQQAAGA TAGGCTCTGC ACAQATGGAT TTGCCTGTGG 
CACCACCAGO OTGAAOGACA ATAOCTTTAC CCTCACGGAT AGOCCTGGOT GCTGOCTGCC 
CA6ACGCTCT GGCCAGGATO GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGC AACT 
TGTCTTTrrC TGGACT6AAG CCTGCAGGAG TTAAAAAG6G CAGGGCATCT OCTGTGC ATO 
GGCrCGAAGG QAGACCXSVGC TCCCCOGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 
AATQAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGQ GAGCTTA6CC 
AATGTGOGAG CAOOGGTTTG GGGAGCA6AG ACACTAAOGA CTTCAGG6CA QGQCTCTQAT 
ATTGCATOAA TGTATCAGQA AATATATATG TGTGTGTATO TTTGCAOICT T6T7GTG1GQ 
GCIGTGAGtG TAAGTOTGAG TAAGAOCTGO TGTCTGATTG TTAAGTCTAA ATATTTCCTT 
AAACTOTGTG GACTGTGATQ CCACACAGAG TGGTCTTTCT GOAGAOOTTA TAGGTCACTC 

ctgggqcctc ttgggtcccc caootqaows tgcctgggaa tgtacttatt ctgcagcatq 
acx:totgaoc aqcactgtct caotttcact ttcacataga tctcoctttc ttgg ccagtt 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTGACTOGG TOGGGTGAGG AOCACTOCTT 
ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTO 

ATCAATAAAA TGTQATTTTT CTGA 

Seq ID NO I 414 Protein eeguence 
Pzotein Acceaeion #i 19P_002649.1 

1 11 21 31 41 51 

I I I 1 i I 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLHGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCBIDXSKTC YB6N0HFYR0 KASTDTMGRP CIiFHNSATVL QQTYHABR8D ALQIiGLQKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECKVHD CADGKKPSSP PKELKFQOGQ KTLRPRPKII 180 

GGEFTTIQJQ PWFAAIYRRH ROQSVTYVCG GSIiISPCWVI SATOCFIDYP KKEDYIVYLQ 240 

RSRIHSNTQG EMKPEVENLI LHKDYSADTL AHHNDIALLK IRSKEQRCAQ PSRTIQTICL 30 0 

PSMYNDPQFG TSCBITGPGK ENSTDYLYPE QUCMTWKLI SHREOQQPHY YGSEVTTKML 360 

CAADPQWKTD SC3QGDSGQPIi VCSXiQGRNTL TGIVSHGRGC ALKDKPGVYT RVSHFIiFHZR 420 
SHTKEENGLA L 

Seq ID NO: 415 DNA sequence 

Nucleic Acid Accesaion «i NM_024422.1 

Coding sequences 202.. 2907 

1 11 21 31 41 51 

I I I I I I 

OGCCAAAGGA AAAGCCCCTT GGATGAQAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCrCCGCXSC GCCCCACXrrC CTCOGCXTTOG CGCTCCTCCT QAGGAOGOGG CCCAGAGTGC 120 

GCTCaSGCCG OGGCCCTOSC CCCGCGGAOC CCTOCTACCC CGGCCCGAOQ CTCGGCCOSC 180 

OACCT6CC0C GAOCCCTCTC CATGGAGGCA GCCOGCCCCT CCGQCTCCTG GAACGGAGOC 240 

CTCIGCCGGC TGCTCCTGCT QACCCTOGCG ATCTTAATAT TTQCCAGTGA TGCCTQCAAA 300 

AATGTGACAT TACATOTTCC CTCCAAACTA GATGCCQAGA AACTTGTIGO TAGAOTTAAC 360 

CrOAAAQAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA aTQATGCTGA CTTCCAAATT 420 

TTGGAOQATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTOCTOGGA GAAGAGAAOT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAQAAAAGA AQAAAATATT TGTCTTTTTG 540 

6AGCATCAAA CAAAGGTCCT AAAQAAAAOA CATACTAAAO AAAAAGTTCT AA GGOSO GCC 600 

AAGAOAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACAOI GCOCAAAACT ATACCATATA CTATTOCATA 720 
AGAGGTCCTO GAGTTGACCA AGAACCTCGG AATTTATTTT ATOTCGAGAG AGACACTGGA 780 
AACTT6TATT GTACTCGTCC TGTAGATCX3T GAGCAGTATG AATCTTTTQA GATAATTGOC 840 
TTTGOVACAA CTOCAQATOO GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCA AAATA 900 
GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

QAAAATTQCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATQAOOCT 1020 

GACAOQATOC ACACAOGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAQQ03TG ATCACCACAA CATCATCTCA GCTAGA CftOA 1140 

6A0TTAATTQ ACAAOTACCA GTTGAAAATA AAAGTACAAG ACATGOATGO TCA GTATT TT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATO ATGTAAATGA CCAC TTOCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAOTTQA TOTGQAAATC 1320 

TTAOGAGTTA CTGTTGAGGA TAAGGACTTA GTOAATACTG CTAACTGGAO AOCTAATTAT 1380 

AOCATTTTAA AOGGCAATGA AAATGGCAAT TTTAAAATTQ TAACAQATGC CAAAAC CAAT 1440 

QAAGGA0TTC TTTOrGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG ISOO 

CAAATTGQTO TA6TTAATGA AOCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCAtG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATQ AGGQCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTGGCAT GAAAGAAAAT GCAOAAGTGQ GAAOUVCAAG CAAT66ATAT 1680 

AAA6CATATG ACCCAGAAAC AAGAAGTAGC A6TGGCATAA G6TATAAGAA ATTAACTQAT 1740 

CCAACAGGOT GGGTCACCAT TGATQAAAAT ACAGGATCAA TCAAAQTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAAT6GC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA OAACATGTAC GGGGACACTG GGCATTATAC TTCAAGAOGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAQTGATC ATCTGCAAAC CCAOCATGTC ATCTGCGQAO 1980 

ATTGTTQOQG TTGATCCTGA TGAGCCTATC CATGGCCXaC CCTTTOACTr TAOTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATOTGG AOACTOAAAG CAATTAATGA TACAGCAGCA 2100 

otfi v r n xxT atcaqaatga Tccr c c A 'rn' ggctcatatg tagtacctat aacagtgaqa 216O 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTOTXTTGA CTGCATTAOC 2220 

GAAAATGACT GCACACATOJ TGTAOATCCA AGQATTGGOO GTGGAGGAOT ACAACTTQ QA 2280 

AAGTGGQOCA TCCTTGCAAT ATTGITGGGC ATAGCATTOC TCTTTTOCAT CXnxnTTAOS 2340 

CTOQTCTGTQ 6GGCTTCTG0 GACX3TCTAAA CAACCAAAAG TAATTCCTGA TOATTTAGCC 2400* 

CAGCAGAACC TAATTGTATC AAACACAGAA OCTCCTGQAG ATGACAAAOT GTATTCTGCX3 2460 

AATGGCTTCA CAAGOCAAAC TOTQGQOQCT TCTGCTCAGa GAOTTTOTGG CACOHGOGA 2520 
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TCAGGAATCA AAAACGGAGO TCAGGAGACC ATOGAAATGG TOAAWSGAGQ ACACCAGACC 2S80 

TCGOAATCCT GCCGGGGGOC TQGCCACCAT CACACCCTGO ACTCCTGCAO GGOAOOACAC 2640 

ACQQAGGTGQ ACAACT6CA0 ATACACTTAC TOGQAOTGGC ACAOTTTTAC TCAGCCCCOT 2700 

CIT66TGAAA AAGTGTATCT OTGTAATCAA GATQAAAATC ACAAOCATGC CCAAGACTAT 2760 

5 OTGCTGACAT ATAACTATGA ASGAAGAGGA TOG O TO G CTO GOTCTQTAGG TTGrTGCAGT 2820 

QAACGACAAG AAGAAGATGG OCTTGAATTT TTGOATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAOAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCrCTGAA AGCCAGTGGC 2940 

TTTATOACTT TTAAAAAAAA TTACAAACCA AOAATTTTTT AAAQCAGAAG ATGCTATTTG 3000 

TGOGGGTTTT TCTCTCATTA TTTGQATGQA ATCTCTTTOG T CRAATG CAC ATTTACAOAO 3060 

10 AGACACTATA AACAASTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTAC TTATCT 3120 

TCTATCCAAG QAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA rrTGGGTATA 3180 

ATGACAACAO CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATT6T AGAGAGCTTG CTTTAACATT ATCT CCAG TT 3300 

AATTAAGTGT TCATG10QTO CTTGGAAACT GTTGTTTTGC TGAACATCTA AAGTOTOTAG 3360 

15 ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGAOCT TTTaVCTOTG CRAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT ATTACAATTT CATT 



20 



Seq ID NOt 416 Protein sequence 
Protein Accession «t HP 077740.1 



1 11 21 31 41 51 

I I I ! I I 

MBAARP6GSW NGALGRLLLL TLAILZFASO ACKNVTLHVP SKLDABKLVO RVNLXECFTA 60 

ANUHSSDPD FQIIiEDOSVY TTNTILIiSSE XRSFTXLIjSH TEHQEKKiaF VFLEHQTXVIi 120 

25 KKRHTKEKVL RRAKRRMAPI PCSMLBNSLO PPPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFB IIAFATTPDG YTPBLPLPLI IKIEDENKIY 240 

PIPTEBTYTP TIPENCRVGT TVGQVC3VTDK DBPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ UaKVQDMDG QyFGLQTTST CIINIDIIVIID HLPTPTRTSY 360 

VT9VEBNTVD VBILRVTVED KDLVNTANWR ANYTIIiKailB KOIirPKIVTDA KIKEGVLCW 420 

30 KPLNYBBKQQ MILQIGWNB APFSREA8PR SAMSTATVTV NVBDQDBGPE CKPPIQTVRM 480 

KBMAEVQTTS NGYKAYDPBT RSSSGIRYKK LTDPTGWVTI DENTQSIKVP RSLDRBABTI 540 

KNQIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPPIPKK TVIICKPTMS SABIVAVDPD 600 

BPIHQPPPDF SLESSTSBVQ RMMRLKAXMD TAARLSYQND PPPGSYWPI TVRDRLOMSS 660 

VTSLDVTLCD CITBn}CTHR VDPRIGGGGV QLOKMAXXiAZ LLOZAUiFCI LFTZiVGGASG 720 

35 TSKQPKVIPD DLftQQlH.lVS MTBAPGnDKV YSAMOFTTQT VOASAQGVOO TVGSOIKNGO 780 

QGTIBNVKGG HQTSBSCRGA GRBBTI«DSCR GGUTBVDNCR YTYSEWRSFT QPRZiGEKVYL 840 

CKQDBNHKBA QDYVLTYHYE QRGSVAGSVO CCSBK(V3SDG tiBFLDNZiEPK FRTLAEACHK 900 
R 

40 Seq ZD NOi 417 DMA sequence 

Nucleic Acid AccesBion #s KM_004949.1 
Coding sequences 202.. 2745 

,^1 11 21 31 41 51 

45 I I 1 I I I 

CX^CCAAAGGA AAAGCCCCTT GGATGAQAGG CAOaOGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CrCTCCGCGC GCCCCACCTC CTCCGCXTTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCOGGCOQ CGGCCCTCX5C CCOGCGGAGC CCTCCTACCC 0GGCC0QACX3 CTOGGCCCGC 180 

GACXMCCCC GAGCCCTCTC CaTGOAOOCA OCOCGCXXXT OOGGCTCCTG GAAOQQAGCC 240 

50 CTCTGCCGGC TOCTCCTGCT GACCCTOGOG ATCTTAATAT TTQCCAGTGA TOCCXGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GAT6C0QAGA AACTTGTTGG TAGAGTTAAC 360 

CIGAAAOAOT OCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCXTO^ CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTOGGA GAA GAGAAG T 480 

TTTACCATAT TACTTTCCAA CACTQAGAAC CAAGAAAAGA AGAAAATATT TSTCrmTG 540 

55 GA6CATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAO AAAAAGTTCT AAGGC60G0C 600 

AAGAGAAGAT GOQCTCCAAT TCCTTQTTOG ATGCTAGAAA ACTOCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACAOQ GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AQAACCTOGO AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAQATOGT GAGCAGTATG AATCTTTT6A QATAATTGOC 840 

60 TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTGCAC TGOCCCTAAT AATCA AAATA 900 

QAGGATQAAA ATGATAACTA CCCAATTTTT ACA6AAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGOOA CAAOTGTOTO CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGG6C AGQTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA C3VTCATCTCA GCTAGA CAGA 1140 

65 GAGTTAATTG ACAAGTACCA OTTQAAAATA AAAOTACAAG ACATGGATGG TCA GTATT TT 1200 

GOTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATO AT3TAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA OTGQAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTAOaAGTTA CTGTTGAGGA TAAGGACTTA QTQAATACTO CTAACTGOAG AQCTAATTAT 1380 

^ ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTO TAACAGATGC CAAAAO CAAT 1440 

70 GAAGGAGTTC TTTGTQTAOT TAAOCCITrO AATTATGAAG AAAAGCAACA GATGATCTTO 1500 

CAAATTGGTG TAGTTAATGA AGCTCXATTT TCCAGAOAGO CTAQT OCAAG ATCAGCCATG 1560 

A6CACA6CAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTOGCAT GAAAGAAAAT GCAGAAGTGG QAACAACAAO CAATGGATAT 1680 

_ _ AAAGCATATO ACCCAGAAAC AAGAAGTA6C AGTGQCATAA GOTATAAGAA ATTAACTOAT 1740 

75 CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGO CAQAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGAOGT GAATQATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC C CACCA TGTC ATCTGOGGAG 1980 

ATTGrTGOGG TTGATCCTGA TGAGCCTATC CATGGOCCAC CCTTTGACTT TAOTCrGGAa 2040 

80 . AGTTCTACTT CAGAAGTACA GAQAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

O G l'Cm'CCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGXQAGA 2160 

QATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTOTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATOG TOTAGATCCA AGOATTGGCG GTOGAGQAGT ACAACTTGGA 2280 

AAGTOGQOCA TCCTTGCAAT ATTGTT0G6C ATAGCATTOC TCTTTTGCAT CCrGTTTAOG 2340 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



ctggtctgtg 
caocagAacc 
aatggcttch 
tcaggaatov 

T06GJUITCCT 
ACGGAG6TGG 
CTTQOTQAAO 
OTATCTGTGT 
CTATGAA6QA 
AGATGGGCfT 
CATGAAGAGA 
AAAAAATTAC 
TCATTATTTO 
AGTACACAAA 
TCTACAGAGA 
TTTATAGTGC 
CCTAAT6GAA 
OTGGTGCTTO 
ATTATTTTAT 
TTQACTATTA 



GOGCTTCTOG 
TAATTGTATC 
CAACCCAAAC 
AAAAOGGAGO 
GCCGGGGGGC 
ACAACTOCAG 
AATCCATTAG 
AATCAAGATG 
A6A0GATCGG 
GAATTTTTGG 
TC3AGTGTGTT 
AAACCAAGAA 
GATGGAATCr 
TTTTTCAATT 
AATTAAAGTC 
AATAAAATGT 
AATTGTAGAG 
GAAAC7GTTG 
TCTTGTAATG 
CAATTTCATT 



GACOTCTAAA 
AAACACAGAA 
T6TGGG06CT 
TCA6QAGACC 
TGGCCACCAT 
ATACACTTAC 
AGGACACACT 
AAAATCAC3UV 
TGGCTGGGTC 

ataatttgga 

CTAATAAOTC 
TTTTTTAAAO 
CITTGGTCAA 
TTTACATATT 
TGCCTTATTT 
AATTAATTCA 
ACCTTGCTTT 
TTTTOCTSAA 
TGAGCTTTTC 



CAACCAAAAG 
GCTCCTGGAG 
TCTGCTCAGG 
AT06AAATG6 
CACAGCCTGO 
T0GGAGTG6C 
CTGATTAAAA 
GCATGCCCAA 
TGTAGGTTGT 
GCCCAAATTT 
TCTGAAAGCC 
CAGAAGATGC 
ATQCACATTT 
TTTAAATTAC 
GTTACATTTO 
AGTCCTTATT 
AACATTATCT 
CATCTAAAGT 
ACTGTGCftAA 



TAATTCCTGA 
ATQACAAAST 
GAGTTTGTGO 
TGAAAGGAGO 
ACTCCTGCAO 
ACAGTTTTAC 
ATTAAACAAT 
GACTATGTCC 
TGCAGXGAAC 
AGGACACTAG 
AGTGGCTTTA 
TATTTGTGGG 
ACAGAGAGAC 
TTATCTTCTA 
GGTATAATGA 
ATAGACTATT 
CCAGTTAATT 
GTGTAOACTG 
GG6AGATTTC 



TGATTTAGCC 
GTATTCTGOG 
CACCGTGGGA 
ACACCAGACC 
GGGAGGACAC 
TCAGCCCCGT 
GAAA6AAAGT 
7GACATATAA 
aACAAOAAGA 
CAGAAGCATO 
TGACTTTTAA 
GGTTTTTCTC 

actataaaca 
tcx:aaggagq 
caacagccaa 

TGAAGCACAA 
AAOTOTTCAT 
CATTCTTGCT 
TAGCCAGGCA 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 418 Protein eequence 
Protein Accession NP 004940.1 



MEAARP8GSW 
ANLIHSSDPD 
KKRHTKEKVL 
GPRMLFYVER 
PIFTBETYTP 
TGVITTTSSQ 
VTSVEENTVD 
KPLKYEEKQQ 
KENAEVQTTS 
KNGimXTVL 
BPIHGPPFDP 
VTSLDVTLCD 
TSKQPKVIPD 
QETIEMVKGQ 
GHTLIKN 



11 
I 

NGALCRLLLL 
FQIIiEDOSVy 
RRMCRRWAPI 
DTGNLYCTRP 
TIFENCRVGT 
XiDRELIDKYQ 
VBILRVTVED 
MILQIGWNE 
NQYKAYDPET 
hBDQSQRTCr 
SLBSST8EVQ 
CITENXXrrHR 
DLAQQNLIVS 



21 
I 

TIAILZFASD 
TTNTILLSSE 
PCSMLENSLG 
VDREQYESPE 
TVGQVCATDK 
LKZKVQDMDG 
KDLVNTANHR 
APFSREASPR 
RSSSGIRYKK 
GTLGIXLQDV 
RKNRLKAIND 
VDPRIGGGGV 
NTEAPGDDRV 
GHHHTIiDSCR 



31 
I 

ACaCKVTLKVP 
KR8FTILL8N 
PPPLFLQQVQ 
IIAFATTPD6 
DBFDTHRTRL 
QYPOLQTTST 
AKYTILKGNB 
SAMSTATVTV 
LTDPTGWVTI 
NDNSPPIFKK 
TAARLSYQND 
QLGKHAILAI 
YSANOPTTQT 
GGHTBVDNCR 



41 
I 

8KLDABKLVG 
TENQEKKKIF 
SDTAQNVTIY 
YTFELPIiPLI 
KYSIIGQVPP 
CIXNIDDVND 

NVEDQDEGFE 
DENTGSIKVF 
TVIXCKPTHS 
PPPGSYWPX 
LLGIALLPCI 
VGASAQGVCG 
YTYSEWHSPT 



51 

1 

RVNIiKECFTA 
VFIiEHQTKVL 
YSIRGFGVDQ 
IKIEDENZ3NY 
SPTLFSMHPT 
HLPTPTRTSY 
KTNBC3VLCW 
OIPPIQTVRM 
RSU}RBABTZ 
8ABXVAVDPD 
TVRDRLC^S 
LPTLVOGASG 
TVGSGIKNGG 
QPRLGEESIR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 419 DNA secfuence 
Nucleic Acid Accession ft: 11M^002722.1 
Coding sequence: 14.. 301 



1 
I 

ACTCTGQACT 
GOGTGGCTCT 

ACATCAACAT 

TCTCGGAGTG 
AATGCCACCT 
ACCCTTGGCT 
AAGCC 




CTGGCCAAAG 



Seq ID NOt 420 Protein sequence 
Protein Accession fti NP_002713.1 

1 11 21 31 41 51 

I t I t I I 

MAAARLOiSL UiLSTCVALL LQPLIiGAQGA FLEPVYPGDN ATPBQMAQYA ADLRRYXMML 
TRPRYOKRBK EDTIiAFSBHa SPHAAVPREL SPU)L 

Seq ID MOi 421 DMA sequence 

SlUcleic Acid Accession ftt NM_032545.1 

Coding sequence t 4 6.. 71 8 



1 
I 

AAACTGATCT 
CCATGTCAGG 
CTATCAAAGA 
GCACGGACAO 
GQAOQGCTOQ 
OG080GGCCG 
CCCGGCCCAC 
CCTGGAGCAC 
CCTGCACTGC 
CCACQCTCAC 
ACTCCT6GAC 
CXSTO CTOCAO 
TTCTATGTTO 



11 
1 

TCAATGCACT 
CTTCTGTTTA 
QAGAAACATA 
TCACCGCTCA 
GGGCGGGAGG 
OOCTGCTGCA 
TTCACOGGCC 
GGAGCCTGGA 
CTCCCCCTCC 
GOGCOQAGOG 

G0GGAGCQ6C 
TAAATAATAO 



21 
I 

AAGAGAAGGA 
CGGTCAGTTT 
A06GCQGTAG 
ACTGGACCTC 
AGCCOCTCCC 
G6AA0GG0G6 
GCTACTGOGA 
CCCTCCGOGC 
AGAOGCCTGA 
CGQGG8GG8C 
GCCCG6ATGC 
GOOCCTQGQG 
ATGTGTTTAG 



31 
I 

GACTCTCAAA 
GGCATTACAG 
AGAGGAAGTC 
CA6TCATTTC 
CTACTCCOQG 
TACCTGOGTG 
GCATGACCAO 
CTGCCACCTC 
CCGCTGTOAC 
GCOCAGCCTG 
OOCC6G0CAC 
AAGOGCGGGA 
TTTAC08TAA 



41 
I 

OCAAAAATGA 
ATCATCRATT 
ACCAA6GTTG 
GGAOAGGrGA 
GCXTTOGGAG 
CTGG6CAGCT 
AGG06CAGT0 
TGCAGGTGCA 
COGAAAGACT 
CTACrCTTGC 
CCTGGGTCCC 
CTTGGGCATC 
GCTGAAOCAC 



60 



51 
I 

CCTG6AGGCA 
TGGGAAACAG 
CCACTCAGAA 
CTGGGAGOGC 
AGGGTGGGTC 
TCTGCSTGTG 
AAT60GGC6C 
TCTTOGGGGC 
TOCTGGCCTC 
1GCGCTGCXZC 

TOOTCocrrc 

GCCTTTAATT 
TOGOXGAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TTTTXATTOa OTAATAAATA TTTTCATGAA A6060CAAAA AAAAAAAAAA AAAAAAAAAA 840 
AAAAAA 



Seq ID NO: 422 Protein eequence 
5 Protein Acceaaion «: NP_115934.1 

1 11 21 31 41 51 

MTWRHHVRLL FTVSLALQII NLGNSYQRBK HNGC3RBEVTK VATQKHRQSP HTW TSSH PQE 60 

10 VTCSAEGMGP BBPLPYSRAP GEGASARPRC CRNGOTCVLG SPCVCPAHFT GRYCEHDQRR 120 

SB08ALBHGA WTLRACKLCR CIFGAMCLP LQTPDRCDPK DPLASHAHOP SAGOAPSIiM. 180 
UtPCALLHRL LRPDAPAKPR SLVPSVLQRB RRP0QRPGU3 HRXi 

Seq ID NO I 423 DNA sequence 
IS Nucleic Acid Accession #: NM_006533.l 
Coding sequence I 72.. 467 

I 11 21 31 41 51 

20 AOGGAOAGAG GQAGGGGAGG AAATTGGA6A CCCCAOCACC CCCm CTCA CTCTCTPGCT 60 

CACAGTCCAC 6AT6GCCGGO 'XXaXTGOTGT GCCTTGGTGT CATCATCTTG CTGTCTGCCT 120 

TCTCOGGACC TOOTGTCAGO GGTGOTCCTA TGOCCAAOCT GGCTOAOCQG AAGCTGTGTG 180 

CGGACCAGGA GTGCAGCCAC CCTATCTCCA TGGCTGTGGC CCTTCAGGAC TACATGGCCC 240 

COQACTGCCQ ATTCCTQACC ATTCACOGGG GCCAAGTGGT GTATGTCTTC TCCAAGCTGA 300 

25 AG0QCCX5TGG GCGGCTCTTC TGGGGAGQCA GOQTTCAOOO AGATTACTAT GGAOATCTGG 360 

CTGCTGGCCT QGGCTATTTC CCCAGTAOCA TTGTCCQAGA GSAOCAGACC CTGAAACCTG 420 

GCAAAOTOOA TOTOAAGACA GACAAATGGG ATTTCXACXO OCAQTGAGCT CAGCXTTACCG 480 
CTGGCCCT6C OOTTTCCCCT CCTTGGGTTT ATOCAAATAC AATCAGCCCA OTOCAAAC 

30 Seq ID NOs 424 Protein sequence 
Protein Accession #t NP_006S24.1 

1 11 21 31 41 51 

35 ilARSLVCLGV IILLSAFSGP GVRGOPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 60 
PIiTXHRQQW YVFSKLKGR6 RLPWGGSVQ6 DVYGDLAARIi OYPPSSIVRB DOriOTGKVD 120 
VRTDXMDPYC Q 



Seq ID NO: 425 DNA sequence 
40 Nucleic Acid Accession «: NM_080B70.1 
Goding sequence: 3.. 710 

1 11 21 31 41 51 

45 ioATGACACA AGTCACAGAA AAOTCCACAG AACACOCAQA AAAQACCACG TCAACC3VCAG 60 

AGAAAACCAC AAQAACOCCA GAAAAGCCTA OQCTATACTC AGAGAAOACC ATATOCRCCA 120 

AAGGGAAAAA CACACCAGTC GCAGAAAAGC CTACAQAAAA CCTGGGOAAC ACCACACTGA 180 

CCACTOAOAC CATAAAA6CC CCAGTAAAGT CCACAOAAAA CCCAGAAAAA ACAOCAGCAG 240 

TCACAAAGAC TATAAAACCT TC3VGTCAAG0 TCACAGGAGA CAAATCTCTC ACTACTACCT 300 

50 CTTCTCATCT AAATAAAACT GAAGTTACTC ATCAGGTGCC CACTGGTTCT TTCACCCTCA 360 

TTACATCTA6 AACGAA6CTO AOTTCIATCA CATCAGAAGC CRCRGQAAAC GAGAGCCATC 420 

C31TACCTCAA TAAAGATGOC TCACAGAAAG GTATCCAGGC TGOAGAGATG GGA6A6AAT6 480 

ATTCATTCCX: TGCATGGGCC ATAGTTATTG TGGTCCT G GT GGCTGTGATT CTCCTCCTGG 540 

TGTTCCTTGQ CCTGATCTTC TTQGTCTOCT ATATGATGCG QACACGCCGC ACACTAAC CC 600 

55 AGAACACXX31 GTACAATGAT GCAGAGGATQ AGGOTGOCCC CAATTCCTAC C X3GGTC TACC 660 

TGATOGAOCA GCAGAATCTT GQCATOGGCC AGATOCCTTC CCCAGOSTGA TCTTGGAOTA 720 

GGOGCCCAGC CCTGGCTCTT CX3VTGCTCTG COOCTTTCCT GOATQAGOAA CCGGACTCAC 780 

AATTTCTATT TCOGGGACTA CAGQAAOGQC AGAOAATACT GACGGTTACC AOTATT AACC 840 

CTTCATCTQT TCTTGAAACT GGTTGGGGAA TGAGGTOATA AGCAATOAGQ GTGTAAGTTT 900 

60 AQGGGACAAA GAAGAAAGAA TGAATAATAC GAGCAGACAT TCrCTGTAGA AGGTAATGOT 960 

CIGAGAATGA AAAGGTOTTT GATGGACATO TTGTOG G G O C AGCAAIGGAO AA^CTQCAC 1020 

TGAGTCCTAA AOGAAGGACA GGACCCTTAT AGGCAATGOC C CftOAC TSAC TTCTGROTGO 1080 

GGTTTATGGG GAAAGGGAGO GACTGA6GQC AQAGTCTCTO GQTTTCAGGA CAGCATTATG 1140 

TTATTTCCAT TCACTATTAC TTAAGAGTTT OTOTGTAAAC AGGCTCATCT CTGAGTTCTC 1200 

65 AGGAOCCTTG CCCCCACCCC CATTTTTTTA ATGAAAAAAA AAAACAAAAA AAAOQGATCC 1260 

AAGAAOAAAA GAGAATTXAT TTCCTTCrCC ACTCTCTOCA T8CCCT6GA0 AAAAAAAAOT 1320 

CCAGAAGAAA TCATAAATAT CTCTCATCTA CATG O TTGCT TCC TCTT CCT C CCAAATC OC 1380 

TTAGTTTTCC TAAATOTCTA CAGTGGAOGC OCTGTTGGTT TGGCTTQCTG GOTTGTGGGT 1440 

GQACAC6CAA GQAGGGGATT TTTATTTGGC CAGCAOTCTC ACCCACTGAT CTCCACCCCA 1500 

70 GACCTTCCCT GATTGGTSTC TCAGCATTTA ITTTCCTGTC TCTTCCACCA AAAGCCAOCT 1560 

OTAQCTTTAT CTOGTAAAAO TTACCOVTCT TCXCTACTOT COCCATTCTC TCTOCTOCCA 1620 

CCTTCACXCC AOATTCAAOT m ' i X'lt.W OTAGGCATTT CATCTQTOTQ TCTTTTCTOG 1680 

ATTTTCTCTC TCTCTTCTTA TOGGCATTTC ACCTTATTAC TGATTGGGTA GAGG6GGAAA 1740 

AGGAGAATGA TGATOATAOT TTCCTTCTOT CTATT6ACCT TTTTTATAAT AAAGTATAAC 1800 
75 ATGTT 



Seq ID NO I 426 Protein Beouence 
Protein Accession #•. MP_543146.1 

80 1 11 21 31 41 51 

im)VTBKST8 HPBKTTSTTE I ri' K TPK KP T LYSEKTICTR OKNTPVPBKP T MgNTTL T 60 
TETIXAFVKS TBNPBKTAAV TKTIKPSVK7 TGDKSLTTTS SHLNKTBVTH QVPTOSFTLI 120 
TSRTKLSSZT SBATtSIBSRP YUIKDGSQXG IHAGQHOEND SPPAHAXVZV VLVAVZLLLV 180 
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PliGLZPLVSy MMRTRRTLTQ NTQYMOAEDB GGPNSYPWL KSCXSILGHGQ XPSPR 

Seq XD HOi 427 DNA sequence 
Nucleic Add Accession S: XM_069480.1 
S Coding sequence t 1..4383 

1 11 21 31 41 SI 

ATCGACACTO TGCTGGTGCT OCTCCTGGGC CTOCAGOCCT TGGCOGQACC ' CAGTCOGA AG 60 
10 CCCCaOAAGO ACTCTGTCTC AGACTGGGCC ArTGTOITCA TCACTCTCAC TTTGOTGGCA 120 
OCAATTGTGA GCCTAATGTA OQGTATCAAG AAGGCCTQCC AOTTCOGGAG GGAGATGAGT 180 
C TOG GGTGTO QCTGTG6CTC TfflGACCCCT TACAGCAGCC ACCATGAGGG GOAGGCTQCC 240 
AGCCAGCGCr ACTCTTGTCA AAT6AAAGCT TCTTGGGGGO CAGOTGCTAC TACATTCCAA 300 
GAATATCAGA AAACTGOGGA ACTCTCAACA TCOGATCACA TATTTCCCCT CACTOCAOGC 360 
15 CTTCTTTATA GTATCOCTTT TGATCACaTT GTTCTGCATT CAGQACAAAG ACXTCCAGAG 420 
CTCCCTAAAT CTACAOAAAT CCAT8AGCAA AAAC6CCACT GCAACACCAC ACGCCATTCT 460 
AAaCCAACXO ACAAGCCTAC AGGC3UVCTCC AAAACTATAQ ACCACAAAAO CTCTACAGAT S40 
AATCATGAGG CTCCTCCCAC TTCTOAAQAA AACTCCAGCA ACCAAGGGAA AGACCCAATG 600 
ATCCX3QAACC AQCGCTCTGT TGATCCTGCT OACTCCACTA CCACACATAA AGAATCCOCT 660 
20 GGAAAAAAAC ATATAACGCC AQCACXrCAAO AGCAAAATAA ACTGTC3CTAA GTCCACAACA 720 
GGCAAATCAA OGOTAACAAQ AAAATCAGAT AAAACTGOAA GACCTTTGGA AAAGTCCATG 780 
AOTACTTTGG ATAAGACAAQ TACC»CCTCA CATAAOACTA CAACTTCCTT CCACAACTCA 840 
GGCAATTCAC AGACX3VAGCA AAAAAGCACA TCTTTTCCAO AAAAAATCAC AGCAGCXTCA 900 
AAAACAACAT ACAAGACCAC AGGAACCCCA QAAQAGTCAQ AAAAAACTGA AQATTCCAOA 960 
25 ACAACAGTTG CCTCAGACAA GCTCCTGACA AAAACTACAA AAAACATACA AGAGACCATA 1020 
TCA6CCAATO AGCTCACACA ATCTCTAGCA OAGCCTACAG AACATGGAGQ AAGQACAGCC 1080 
AATGAGAACA ACACACCATC CXXftOCAGAO CXTTACAGAAA ATAGAGAAAQ GACAGCCAAT 1140 
6AGAACACCA CACTATCCCC AGCAGA6CCT ACAGAAAATA GAGAAAGGAC AGCCAATGAG 1200 
AACACCGCAC C3VTTCCCAGC AGGGCCTACA QAAAATAGAG AAATGACAQC CAATGAGAAT 1260 
30 ACCACACTAT TCCCAGCA6A GCCTACAGAA CATGGAOAAA GGACAGCCAA TGAGAACACC 1320 
ACACCATCCC CAGCAGAGCC TACAOAACAT OGAOAAAGGA CAGCCAATGA GAACACTACA 1380 
CCATCCCCRG CAGAGCCTAC AOAACATGGA QAAAGOACCX; OlTTTGCCAA TGACAAAACC 1440 
ACATCATCCT CAGCAGAGTC TACAGAACAT GOAGAAAGGA CCCCACTGGC CAAOGAGAAC ISOO 
ACCACACCAT CCCCAGCAGA GCCTACAGAA AATAGAOAAA GGACAGCCAA TGAGAACACC 1560 
35 ACACCATCCC CAOCAGGGCC TACAGAAAAC AGAGAAAOQA CA6CCAACGA GAAQACCACA 1620 
CTATCCCCAG TAGAGCCTAC AGAAAATAOA GAAACAACAG CCAATGAGAA GACCACACCA 1680 
TCCCCAGCAG AOOCTACAGA AAATOGACAA AGGACCCCAT TTQCCAATGA GAAAACCACA 1740 
TCATCCTCAG CAGAGCCTAC AGAACAOOGA GAAAGGACCC CACTGGCCAA TGAGAACACC 1800 
ACACCATCCC CAGCAGAGCC TACAGAAAAT AQAGAAAGGA CAGC CAATGA GAAGACCACA 1860 
40 CCATOCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CITTGGCCAA TGAGAAOACC 1920 
ACX3CCATCTC TAGCAGAGCC TACAGAAAAT GOACAAAGQA CCCCATTTOC CAATGA6AAG 1980 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CAaaAAGAAA GGACTCX3VCT GGCCAATGAG 2040 
AACACCACAC CATCCCCGGC AGAGCCTACA GAAAATAGAG AAAGGACAQC CAATGAGAAC 2100 
ACCACACCAT CCCCAOCAGG GCCTACAGAA AATAGAOAAA TGACAGCCAA OGAGAAGACC 2160 
45 ACACTATTCC CAGCAGAGCC TACAGAAAAT AGAQAAAGGA CAGCCAATGA GAAGACCACA 2220 
TCATCCCCAG CaGAOCCTAC AGAAAATQOA CAAAOQACCC CATTTOOCAA TGAGAAAACC 2280 
ACATCATCCC CAGCAGAGCC TACAGAACAC GGAGAAAOGA COCCACTQGC CAATGAGAAC 2340 
ACCACACTAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAACACC 2400 
ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAQAAAGGA CAGCCAATGA GAAGACCACA 2460 
50 CCATCCCCAG CACAGCCTAC AGAAAATGGA GACAGGACTC CATTG GOCAA TGAGAAGACC 2520 
ACACCAICTC TAGCAGAGCC TACAGAAAAT GGAAAAAGGA CCCCATTTOC CAAIGAQAAG 2580 
ACCACATCAT CCTCAGCAGA 6CCTACRQAA CA06CA0AAA G6ACTCCACI GGCCAATGAG 2640 
AACACCACAT CATCCCCAOC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAG 2700 
ACCACACAAT TCCCAGCAOA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAAGACC 2760 
55 ACACCATTCC CAGCAGAGCC TACAGAAAAT AGA6AATGGA CAGCCAATGA GAAGACCACA 2820 
CTATCCCCAG CAOAQCCTAC AGAACATGAA GAAATGACCC CATTGGCCAA TGAGAAGACC 2880 
ACACTATCCC CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCATTTAC CAATGAGAAG 2940 
ACCACACCAT CCTCAGCAGA GCCTACAGAA CATGGAGAAA GGACCCCACT GGCCAATGAG 3000 
ATCACCACAC CATCCCGAGC AGAGCCTACA GAACATGGAG AAAGQATAOC CAATGAGAAG 3060 
60 GCCACACCAT CCCCAGCAAA GCCTACAGAA CATGGAGAAA C6ACAGTCAA TGAGGACACC 3120 
ACACCATCCT CAGCAGAGCC TACAGAAAAT GOAGAAAGGA CCCCACTGGC CAAIOAGAAC 3180 
ACCACAACAT CCCCAACAQA GTCTACAOAA CATGGAGAAA GGACAGCCAA TGAGAAGACC 3240 
ACACCATCCC CAGCAGAGCC TACAGAACAT GOAGAAAGGA CACCATCAGC CAATGAGAAG 3300 
ACCATACCAT CTCCAGCAAA GCCTACAGAA CACGAAQAAA TOACCCCATC GGCCAATGAG 3360 
65 AACACCACAC CATCCCCAGT AAAGCCtACA GAACATGGAG AAAAGACTAC ATTGGCCAAT 3420 
6AGAAGATCA CACTATCCCC ACAAG06CCT ACAGAACATO GAGCAAAAAC TAOSTCGGCC 3480 
AATGAGAAGA TCACACCATC OCTAGCAAAO CCTACAGAAC ATOQAOAAAG GAOCACATCA 3540 
CCCAATX3ACA AGATCACCTC ATCTGCAGCA GAGTCTACAQ AACATAGAOA TAGGGCTACA 3600 
TCAGCCAATG TGATCACACC AGCCCCA6CA GAGCCTATAA AACATGCAAA AAGGACCACA 3660 
70 TTGGCCCATG AGAAGATGAC ACAAGTCACA QAAAAGTCCA CAGAACACCC AGAAAAQACC 3720 
ACGTCAACCA CAGAGAAAAC CACAAGAACC OCAQAAAAGC CTAC6CTATA CT CftGAGAAG 3780 
ACCATATOCA CCAAAGGGAA AAACACACCA GTCOCAGAAA AGCCTACRGA AAACCTGGGO 3840 
AACACCACAC TGACCACTGA GACCATAAAA GCCCCAGTAA AOTCCACAGA AAACCCAGAA 3900 
AAAACAGCAG CAGTCACAAA GACTATAAAA CCTTCAGTCA AOGTCACAGG AGACAAATCT 3960 
75 CrCACTACTA CCTCTTCTCA TCTAAATAAA ACTGAAGTTA CTCATCAGGT GCCCACTGGT 4020 
TCTTTCACCC TCATTACATC TAGAACQAAG CTGAOTTCTA TCACA TCWgL AGCCACAGGA 4080 
AAOGAGAGCC ATCCATACCT CAATAAAGAT GGCTCACAGA AAGGTATGGA CGCTO QACAQ 4140 
ATGGGAGAGA ATGATTCATT CCCTGCATGO GCCATAOTTA IWIGGTCCT GGTGOCTOTO 4200 
ATTCTCCTCC TGGTQTTCCT TGGCCTGATC TTCTTGOTCX CCTATATQAT GOGGA CACGC 4260 
80 CGCACACTAA CCCAGAACAC CCAOTACAAT GAT6CAGAGG ATGAGQOTGG COCCAAT TCC 4320 
TACCOOGT C T AOCTGATGGA 6CAGCAGAAT CrXOGCATaO QOCAGATOCC TTCCOCACGG 4380 
TOA 
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Seq ID NOi 428 Protein sequence 
Protein Accession fft XP_069480.1 

1 11 21 31 41 SI 

Si i I I I I 

MUTVLVLLLQ LQALAGPSPK PQKDSVSDWA IVLITLTLVA AIVSLMYGIK KACQFRREMS 60 

LGOGCGSVTP YSSHHBGEAA SQRYSOQMKA SKGAQATTPQ EYQKTGELST SDHIPPLTPG 120 

LVYSIPFDHI VUlSGQRPPE LPKSTBIHEQ KRH CNTTRH 9 KPTDKPTGNS KTIDHKSSTD 180 

NHEAPPTSEB NSSNQGKDPM IRNQRSVDPA DSTTTHKBSA GKKHITPAPK SKIHCRKSTT 240 

10 GKSTVTRKSD KTGRPLEKSM STLDKTSTSS HKTTTSPHNS GNSQTKQKST SPPBKITAAS 300 

KTTyKTTOTP BESEKTED8R TTVASDKI.LT KTTKNIQBTl SANELTQSLA BPTEHGGRTA 360 

MEMHTPSPAB PTENRHRTAN BNTTLSPABP TBNRERTANB NTAPPPAGPT EHREHTAMEM 420 

TTLPPAEPTB HGERTANENT TPSPAEPTEH GBRTANENTT PSPAEPTBHG ERTP FAMDKT 480 

TSSSAESTEH GERTPIANEN TTPSPAEPTB NRBRTANEHT TPSPA GPTEN RETTANBKTT 540 

15 LSPVEPTBNR BTTANBKTTP SPABPTBIGQ RTPFANBKTT SSSAEPTEHG ERTPLAHENT 600 

TPSPAEPTEH RBRTANEKTT PSPABPTEMG DRTPLANEKT TPSLAEPTEN GQRTPFAMEK 660 

TTSSSAEPTB HEBRTPIANE NTTPSPAEPT ENRERTANEN ITPSPAGPTB NREMTAHBKT 720 

TliFPABPTEN RERTANEKTT SSPABPTENG QRTPFANEKT TSSPAEPTBH GERTPLANEM 780 

TTLSPABPTE NRERTANBKT TPFPAEPTEN RERTANENTT PSPAQPTENG DRTPLANEKT 840 

20 TPSLAEPTEN GKRTPPANEK TTSSSAEPTB HAERTPLANB NTTSSPAEPT ENRBRTANBK 900 

TTQFPAEPTB NRBSTANEKT TPPPABPTBN REWTANENTT LSPAEPTEHB EMTPLANEKT 960 

TLSPAEPTHI OBRTPPTNEK TTPSSAEPTE HGBRTPLANE ITTPSRAEPT BHGERIANBK 1020 

ATPSPAKPTE HGBTTVNBDT TPSSAEPTEN GERTPLANEH TTT6PTESTE HGERTWIEKT 1080 

TPSPAEPTEH GERTPSANEK TIPSPAKPTB HEEMTPSANB NTTPSPVKPT EHQERTTLRH 1140 

25 EKITLSPEOP TEKGAKTTSA NEKITPSLAK PTEHOBRTTS PNDKITSSAA BSTBHRDRAT 1200 

SAHVITPAPA EPZXKAXRTT LAHHKMTQVT EKSTEHPEKT TSTTBKTTRT PEKPTLYSEK 1260 

TICTKGKNTP VPEKPTBNLG NTTLTTBTIK APVKSTBNPB KTAAVTKTIK PSVKVTGDKS 1320 

LTTTSSHLNK TBVTHQVPTG 8FTLIT8RTK LSSITSEATO NESHPYLNKD GSQKGIHAGQ 1380 

KGENDSFPAW AIVIWLVAV ILLLVFLGLI FLV8VMKRTR RTLTQNTQYH DABDBQOPMS 1440 
30 YPVYUfEQQN L6M6QIPSPR 



Seq ID NO: 429 DNA sequence 

Nucleic Acid Accession «: F6ENESH predicted 

Coding sequences 1.. 10674 

1 11 21 31 41 51 

ATCnCGCCrC GCCTCGCCTT TTGT T GCTGG GGTCTOOOaC TCGTTTCGGG CTGGGCGACC 60 

TTTCAGCAGA TGTCCCCGTC GOGCAATTTC AQCTTCCGCC TCTTCCCGOA GACCGC3GCCC 120 

40 GGGGCCXXrOG GGAGTATCCC CGCGCCGCCC GCTCXTrGGOG AOGAAGCGGC GGGGAGCAGA 180 

OnSGAOCXSQC TGGGCCAGGC OTTCCQOCOA OQCOTOCGQC TGCTQCGGGA GCTCAGCGAG 240 

C3GCCTGGAGC TTGTCTTCCT GGTOGATGAT TC3STCCAG00 TGGGOQAAGT CAACTTCOGC 300 

AGCGAGCTCA TCTTCGTCCG CAAGCTGCTG TCCGACTTCC CCX5TGGTGCC CACGGCCACG 360 

CGCGTGGCCA TCGTGACCTT CTCX3TCCAA0 AACTACGTGG TGCCGCOCGT CGATTACATC 420 

45 TCCACCC6CC GCGCGOOCCA GCACAAGTGC GCGCTGCTCC TCCAAOAQAT CXXTTGCCATC 480 

TCCTACCOAO GTGGCGGCAC CTACACCAAO GOOOCCTTCX: AOCAAGGCGC GCAAATTCTT 540 

CTTCATOCTA 6AGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 600 

GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGOQAQATT CAGGAGTG6A GATCTTCACT 660 

TTTGGCATAT GGCAAGGGAA CATTCOAGAG CTGAATGACA TGGCTTCCAC CCCAAAGGAG 720 

50 QAGCACTGTT ACCTGCTACA CAGTTTTGAA GAATTTQAGG CTTTAGCTCX3 CC3GGGCATTG 780 

CATGAAGATG TACCTTCTGO OAGTTTTATT CAAGATGATA TGGTOCACXG CTCATATCTT 840 

TCTQATGAAG GCAAGGACTO CTGTGACOQA ATQGGAAGCT GCAAATOTGO GACACACACA 900 

OGCCATTTTG AGTGCATCTG TGAAAAGGGG TATTACGGGA AAGGTCTGCA GTATGAATGC 960 

ACAGCTTGCC CATCGGGGAC ATACaAACCT GAAGGCTCAC CAGQAGGAAT CAGCAGTTGC 1020 

55 ATTCCATGTC CTGAT6AAAA TCACACCTCT CCACXTTGOUV GC ACATCCCC TQAAGACTGT 1080 

GTClGCAQAa AGQGATACAG GOCATCTGGC CAGAOCTOTO AACITOTOCA CTQCO CIGCC 1140 

CTGAAGCCTC COQAAAATGO TTACTTTATC C3VAAACACTT GCAAGAACCA CTTCAATGCA 1200 

GCCTGTGGGG TCCGATGTCA CCCTGGATTT OATCTTGTGG GAAGCAGCAT CATCTTATGT 1260 

CTACCCAATG OTTTGtGQTC CGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 1320 

60 CTCCGCCAGC OGAAACATGG CXaiCATCAGC TGTTCTACAA GGGAAAT6TT ATATA AGACA 1380 

ACATGTTTGO TTGCCTOTOA TBAAGGGTAC AflACTAGAAO GCAOTGATAA GCTTAC TTQT 1440 

CAAGGAAACA GCCAGTGGQA TGGGCCAGAA CCCCX3GTGT0 TGGAGCGCCA CTOTTCCACX: 1500 

TTTCAGATGC CCAAAQATGT CATCATATCC CCCCACAACT GTGGC3UV0CA GCCAGCCAAA 1560 

TTTGGGACGA TCTGCTATGT AAGTTGCCX5C CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 1620 

65 ATCCTOAGAT GTACCACTTC TGGAAAATGG AATGTCGGAG TTCAGGCAGC TGXGTGTAAA 1680 

GACOTGGAGG CTOCTCAAAT CAACTGTGCT AAGGACATAG AGGCXAAGAC TCTGOAACAQ 1740 

CAAGATTCTO CCAATGTTAC CTGGCAOATT CCAACAGCIA AAGACAACTC TGGTOAAAAG 1800 

GTGTCAGTCC ACGTTCATCC AGCTTTCACC CCACCTTACC TTTTCXXaWT TGGAGATGTT 1860 

OCTATCOTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT TTTCCATATC 1920 

70 AAGGTTATTG ATGCAOAACC ACCTGTCATA GACTGQTGCA GATCTCCACX! TCCCGTCCAG 1980 

OTCTCOGAGA AGOTACATGC CGCAAOCTGO GATGAOCCTC AGTTCXCAQA CAACTCAGOO 2040 

GCTCAATTGG TC^TTACCaO AAOTCATACA CAAGQAGACC TTTTCCCTCA AOGGGAOACT 2100 

ATAGTACAOT ATACAGCCAC TGACCCCTCA GGCAATAACA GGACATGTGA TATC CATATT 2160 

GTCATAAAAO GTTCTCCCTG TGAAATKXA TTCACACCTG TAAATGGGGA TTTTATATGC 2220 

75 ACTCCAGATA ATACXGQAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 2280 

OAAGGGTCTA CTGACAAGTA TJ A TTGTGCT TATGAAOATO 600TCTOQAA AC CAACATAT 2340 

ACCACTOAAT GGCXAGACTG TGCCAAAAAA OGTTTTGCAA ACCAOGGGTT CAAOTOCTTT 2400 

GA6ATGTTCT ACAAAOCAGC TCGTTGTQAT OAOVCAQATC TGATGAAGAA GTTTTCTGAA 2460 

GCATTTGAGA CQACCCTGGO AAAAATGGTC CCATCATTTT GTAGTOATQC AGAGGACATT 2520 

80 GACTGCAGAC TOGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 2580 

GAAAATG6CT TTOCAATTGG ACCAOGTGGC TGGGGTGCAG CTAATAOQCT GOATTACTCT 2640 

TAOGATOACT TCCTGGACAC TGTGCAAQAA ACAGCCACaUV OCATOGGCAA TQCC AAGTCC 2700 

TCACGQATTA AAAGAAGTGC OCCATTATCT GACTATAAAA TTAASTTAAT TTTTAACATC 2760 

ACAOCTAOro T60CATTAOC CGAT6AAAQA AATOATACCC TTQAATOGGA AAATCAGCAA 2820 
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OQACXCCTTC AGAOVTTGQA AACTATCACA AATAAACTGA AAAGOACTCT CAACAAAGAC 28 BO 
CCCATCTATT CCTTTCAOCr TGCATCAQAA ATACTTATAG CCGACAGCAA TTCATTAGAA 2940 
ACAAAAAAGQ CTTCCCCCrr CTGCAOACCA GGCT CAQTOC TaAOASG GOO TATGTGTGTC 3000 
AATT6C0CTT TGGOAACCTA TTATAATCTC GAACATTTCA CCTOTGAAAG CTGCOGOATC 3060 
5 GGATCCTATC AAQATQAAGA AGGGCAACTT GAGTGCAAGC TTTGOOCTC TGQGATCTAC 3120 
AGGOAATATA TCCATTCAAO AAACATCTCT GATTGTAAAG CTCAGTOTAA ACAAOGCACC 3180 
TACTCATACA GTOGACTTGA GACTTOTOAA T03TGTCCAC TGGGCACTTA TCAGCCAAAA 3240 
TTTGOTTCCC QGACCTGCCT CTOGTGTCCA GAAAACACCT CAACTGTOAA AA GAGG AGCC 3300 
GTGAACATTT CTGCATGTGQ AGTTCCTTGT CCAGAAGGAA AATTCTOGOS TT CIGGQTT A 3360 
10 ATGCCCTGTC ACCCATGTCC TOCTGACTAT TACCAACCTA ATGCAfiGGAA GGCCTTCTGC 3420 
CTG G OCIOTC CCTTTTATGG AACTACCCCA TTaSCTGGTT CCAGATCCAT CACAGA ATGT 3480 
tCAACTTCAO TTCTGAATAT TACTATTTTC GGTGQATTTG GGCATCTGGA GTTOTTAAAT 3540 
•IXri'CCrTC TO AGGTTTTOCA TGAATGCTTC TTTAACOCTT GCCACAATAO TGGAAC CTGC 3600 
CAaCAACTTG GGCGTGGTTA TQTTTGTCTC TGTCCACTTO GATATACAGG CTTAA AgTQT 3660 
15 GAAACAGACA TCGATGAGTG CAGCCX3VCT0 CCTTQCCTCA ACAATGGAGT TTGTAAAGAC 3720 
CTAOTTOaQQ AATTCATTTG TGAGTOCCCA TCAGGTTACA CAGGTCAGCQ GTGTQAAQAA 3780 
AATATAAATG AGTCTAGCTC CAGTCCTTGT TTAAATAAAO GAAT CTOTG T TGATGGTGTG 3840 
GCTGOCtATC GTT6CACATG TGTGAAAGGA TTTOTAGGCC TGCATTGTGA AACAGAAGTC 3900 
AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA G6TTQ QGGGA 3960 
20 TTCTTBTGCA AAT6CCCACC TGGATTTTTG GGTACCCGAT aTGGAAAOAA 0GT06ATGAG 4020 
TC TCTCftQTC AGCCAT6CAA AAATGGAGCT ACCTOTAAAG ACGGTGCCAA TAGCTTCAGA 4080 
TC C C ro TGT Q CAGCrOGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TG AATG TCAG 4140 
TCTAATCX31T GTAGAAATCA GGCCACCTGT GTGOATOAAT TAAJOTCATA CAGTT G TAAA 4200 
TGTCAGCCAG GATTTTCAOG CAAAAGGTOT 6AAACAGAAC AGTCTACAGG CTTTAAOCTG 4260 
25 QATTTTOAAO TTTCTGGCAT CTATGGATAT GTCATGCTAO ATGOC3VT6CT COCATCTCTC 4320 
CATGCTCTAA CXITGTAOCTT CTGOATQAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 
ATCTOCTATG CAGTTGATAA 0GGCAGC3GAC AATACCTTGC TCCTOACTGA TTATAACGGC 4440 
TGGGTTCrrr ATGTGAATGG CAGGGAAAAO ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 
AGATGGCATC ATATTGCaU^T CACTTGGACA AGTGCCAATG GCATCTG6AA AGTCTATATC 4560 
30 OATOOOAAAT TATCTGAOGQ TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGOT 4620 
GGTGOGTTAG TTCTGGGOCA A0A6CAAGAC AAAAAAGQAG AGGQATTCAG CCCAGCTGAG 4680 
TCTTTTOTGO 6CTCCATAAO CCAGCTCAAC CTCTQG6ACT ATGTCCTGTC TCCACAGCAG 4740 
GTGAAGTCAC TGGCTACCTC CTGCXXAGAG GAACTCAGTA AAGGAAACGT OTTA GCATGQ 4800 
CCTGATTTCT TOTCAGGAAT TGTGGGGAAA GTGAAGATOG ATTCTAAGAG CATATTTTGT 4860 
35 TCTGATTOCC C3UX5CTTAGG AGGOTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 
AA6CCAG0TT CCAAAGTCAA TCXGTTCTGT GATCCAGGCT TCCAGCTGOT OGGGAACXXrT 4980 
GTGC3W5TACT GTCTGAATCA AGGACA6TGG ACACAACCAC TTCCTCACTG TGAACX3CATT S040 
AGCTGTGGGQ TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCOSATQA CTTCTATGCT 5100 
GQCAGCACAO TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATO 5160 
40 TTCTOTACAQ ATAATGGQAO CTGGAAOSGC GTTTCACCAT CXTTGCCrrQA TGTCGAT6AG 5220 
TQTQCAGTTG GATGAOATTQ TAGTGAGCAT GCTTCTTGCC TGAAOGTAGA TCGATCCTAC 5280 
ATATGTICAT GTGTCCCACC GTACACAG6A GATQGQAAAA ACTGTGCAGA ACCTATAAAA 5340 
TGTAAGGCTC CAGQAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 
GCCOGAGTCA CATTTTCGTG TCAGQAAGGA TACCAOTTGA TGGGAGTAAC CAAAATCACA 5460 
45 TGTTTGQAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATOTQGT 5520 
AAACOQQCTA TTCCAGAAAA TOGTTGCATT GAGGAOTXAO CATTTACTTT TGQCAGCAAA 5580 
GTGACATATA GGTGTAATAA AGQATATACT CTQ G COGGIG ATAAAQAATC ATCCTGTCTT 5640 
GCTAACAGTT CTTGQAGTCA TTCCCXTTCCT GTGTGT6AAC CAQTQ AAGT G TTCTAGTCCG 5700 
QAAAATATAA ATAATGQAAA ATATATTTTG AOTGGQCTTA CCTACCTTTC TACTGCATCA 5760 
50 TATTCATOOO ATACAGGATA CA0CTTAC3W GGCCCTTCCA TTATTGAATG CACGGCTTCT 5820 
GOCATCTGOO ACA0AG06CC ACCXQOCTQT CACCTOOTCT TCTGTGGAGA ACCACCTGCC 5880 
ATCAAAGATO CTQTCATTAC OGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 
IGCAAAGAAO GCTATACTCT TGCTGOTCTT GACACCATTQ AATGCCTGGC CGACGGCAAG 6000 
TGGAGTAGAA GTGACCAGCA GTOCCTGGCT GTCTCCTGTO ATGAGCCACC CATTGTGQAC 6060 
55 CACOCCTCrC CAOAGACTGC CCATCXMCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 
GATOGTTACA GCCTA6CAGA CAATTCCCA6 CTTCXCTGCA ATGCCCAGGQ GAAGTGGGTA 6180 
CCOCCSUSAAO CnCAAGACAT GCOCOGTTGT ATA6CICATT TCTGTGAAAA ACCT CCATOG 6240 
GrrrCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTO 6300 
AGCTTTAAAT OCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTAT6A6A 6360 
60 GOTOGGGAQT GGAAGCCTTC 0CXX31TQTCC ATCCAGTOCA TCCCTGTGCX3 GTGTGGAGAO 6420 
CCAOCAAOCA TCATGAATOQ CTATGCAAGT GGATCAAACT ACAGTTTTGG AOCCATGGTG 6480 
OCTTACAOCT GCSVACAAiOGO GTTCTACATC AAAOQGQAAA AGA AQAgC AC CTOCQAAGCC 6540 
ACAGGGCAGT GGAGTAGTCC TATACOGAOG TGCCACXX3GG T ATCTT GTGG TGAACCACCT 6600 
AAGGTTGAGA ATQGCTTTCT GGAOCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 
65 TATCAGTGTA ACCOKSGCTA TAAGTCAOTC GGAAGTCCTG TATTTGTCTG CX^AGCCAAT 6720 
CGCCACTQOC ACAGTOAATC CXCTCTGAIG TOTGTTOCTC TOGACTGTGO AAAACCTCCC 6780 
COOATCCAGA ATGGCTTCAT OAAAGGAOAA AACTTTOAAG TA6GGTCCAA GGTTCAGTTT 6840 
TTCTGTAATO AGGGTTATGA OCrTGTTGGT GACAGTTCTT OGACATGTCA GAAATCTGGC 6900 
AAATGGAATA AQAAGTCAAA TOCAAAGTGC ATGCCTGCCA AGTGCCCAQA GCCGCCCCTC 6960 
70 TTGGAAAACC A6CTAGTATT AAAGQAGTTG ACCAC06AGG TAGQAGTTGT GACATTTTCC 7020 
TGTAAAOAAO GGCATGTCCT QCAAGGOCOC TCT UTOCTO A AATOC TTGCC ATCCCAGO ^ 7080 
TG6AAT6ACT CTTTCCCTGT TTGTAAOATT OTTCTTTOTA OOOCACCTCC CCTAA TTTC C 7140 
TTTGQTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 
QTAOOTGOOT TTTTCCTAAQ AGGAAATTCT ACCACCCTCT GCCAACCTOA TGGCACCTGO 7260 
75 AGCTCTCCAC TQCCAGAATQ TQTTCCAGTA GAATGTCCCC AACCTOAGOA AATCCCCAAT 7320 
OGAATCaVTTO ATOTGCAAGO CCTTGCCTAT CrGAOCACAO CTCTCTATAC CTGCAAGCCA 7380 
GOCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAO AAAATGGTCA CTGGCTTGQA 7440 
OQAAAACCAA CATOTAAAOC CATTGAGTGC CTQAAACCCA AGGAGATTTT GAATGGCAAA 7500 
TTCTCTTACA 006ACCTACA CTATGGACAG ACOGTTACCT ACTCTTGCAA CCGAGGCTTT .7560 
80 OGGCTOSAAO GTCCCAGTGC CTTOACCTGT TTAGAOACAQ GTGATTGGQA TGTAG ATGCC 7620 
CCATCTTOCA AtGCCATCCA CIGT6ATTCC CCACAAOCCA TTGAAAATGG TTTTG TAGAA 7680 
GGTOCAOATT ACAGCTATQG TGGCATAATC ATCTACAGTT OCTTCCCTGG OTTTCAGGTG 7740 
GCXaJTCATO CCATGCAOAC CTGTGAAGAO TCAOQATGQT CAAQ TTCC AT CXX»ACATGT 7800 
ATGCCAATAO ACTGICGCCI CCCTCCICAT ATAGATTTTG OAOACTGTAC TAAACTCAAA 7860 
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QATQACCAGG OATATTTTGA GCAAGAAGAC OACATGATGG AAGfTTCCATA TGTGACTCCT 7920 

CACCCTCCTT ATCATTTGGG AGCAOTGGCT AAAACCTGGO AAAATACRAA GQAGTCTCCT 7980 

OCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTQ TAATCCACGA 8040 

TATGAACTTC T0GGC3AAGCC TGTGCIOATC TGCCAGOAAO AIGOAACTTG GAA TGGCAGT 8100 

GCACCATCCT GCATTTCAAT TGAATGttSAC TTGCCTACTO CTCCIOAAAA TGGCTTTTTG 8160 

CGTTTTACAG AOACTAGCAT GGGAAGTGCT GTGCAOTATA GCTOTAAACC TGGAC ACAT T 8220 

CTAGCAGGCr CTGACTTAAG GCTTTOTCTA GAGAATAGAA AQTGGAGTGQ T6CCTCCCCA 82 80 

OQCTGTQAAG CCATTTCATO CAAAAAGCX3V AATCCAGTCA TGAATGGATC CATC AAAGGA 8340 

AGCAACTACA CATACCTGAG CAOSTTGTAC TATGftaTGTO ACOOCSGOATA TGTGCTGAAT 8400 

GGCACT6AGA G6A6AACAT0 CCAGGATGAC AAAAACT66G ATGAGGATGA GCCCATTTGC 8460 

ATTCCTOTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATQ GCCAGOTGAG AGGAGACX5AG 8S20 

TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TQAGGGAGCC 8580 

AGQAGTOGGG TTTOTCTTGC CAATGGAAGT T06AGTGGAG GCACTCCOGA CTGTGT6CCT 8640 

GTCAGATGTG CCACCCCGCC ACAACTGGCC AATOGGGTOA 0GGAA6GCCT GGACTATGGC 8700 

TTCATGAAGO AAOTAACATT CCACTGTCAC GA6GGCTACA TCTTGCAOGG TGCTCCAAAA 8760 

CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAQAOATPC CTCTCTQTAA ACCAGTCAAC 8820 

TGTGGACCTC CTQAAOATCT TGCCCATGGT TTCCCTAATQ GrTTTTCCTT TATTCATGGG 8880 

GGCCATATAC AGTATCAGTO CTTTCCTGGT TATAA6CTCC ATGGAAATTC ATCA AGAAG Q 8940 

TGCCTCTCCA ATGGCTCCTO GA0TQGCA6C TCACCTTCCT GCCTGCCTTO CAOATGTTCC 9000 

ACACCAGTAA TTOAATATGG AACTGTCAAT GGQACAOATT TTGACTGTQO AAAGOCAOCC 9060 

OGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAOQACTTT CTQAAATCAC CTGTQAAGCC 9120 

GATGGCCAOT GGAGCTCTGG GTTCCCCCAC TOTGAACACA CTTCTTGTGG TTCTCTTCCA 9180 

ATGATACCAA ATGCGTTCAT CAGTGAQACC AGCTCTTGGA AGQAAAATGT GATAACTTAC 9240 

AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAQATC TGATTTGTAC AGAGAAAGGO 9300 

OTATOGAQCC AGCCTTATCC AGTCTGTGAG CCCrTGTCCT 6TGQQTCCCC ACOGTCTGTC 9360 

GOCAATGCAG TGGOU^CTGQ AQAGGCACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 9420 

CTQGAAGGTT ATAOGATGGA TACAGATACA GATACATTCA CCTOTCAQAA AGATGGTOGC 9480 

tGGTTCCCTG AGAGAATCTC CTGCAOTCCT AAAAAATGTC CTCTGC06GA AAACATAACA 9540 

CATATACTTG TACATGGGGA CGATTTCAGT GTGAATAGGC AAGTTTCTGT QTCATGTQCA 9600 

GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAQTATGTC A6CTTGATGG AACCTGGGAO 9660 

CCAOCATTCT COGATQAATC TTGCAGTCCA GTTTCTTQTG GGAAACCTGA AAGTCCAGAA 9720 

CATGQATTTG TGGTTGGCAG TAAATACACC TTTGAAAQGA CAATTATTTA TCAGTGTGAG 9780 

CCTGQCTATG AACTAGAGGO OAACAGQQAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 9840 

GQAGQGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTQAATT TCTCAATG6G 9900 

AAAGCTGACA TTGAAAACAG GAC6ACT0QA CCCAACGTGG TATATTCCTG CAACAGAGGC 9960 

TACA6TCTTG AAOGGCXATC TGAGGCACAC TGCAC3VGAAA ATGGAACCTG GAGCCACCCA 10020 

GTCCCTCTCT GCAAACX3VAA TCCATGCCCT OTTCCTTTTG TGATTCCCGA GAAT GCTCTG 10080 

CTGTCTGAAA AOGAGTTTTA TGTTGATCA6 AATGTGTCCA TCAAATGTAQ GQAAOGTTTT 10140 

CTOCTGCAGG GCCAOGGCAT CATTACCTGC AACCCCGAOG AQACXSTGOAC ACAGACAAGC 10200 

GCX3UUITGTO AAAAAATGTC ATGTGOTCCA CCAGCTCAOG TAGAAAATGC AATTGCTOGA 10260 

GGCJOTACATT ATCAATATG6 AGACATGATC ACCTACTCAT GTTACAOTGG ATACATGTTQ 10320 

GAG6GTTTCC TGA6GAGT6T TTGTTTAOAA AATGGAACAT GGACATCACC TCCTATTTGC 10380 

AGAGCTGTCT QTOQATTTCC ATGTCAGAAT GGGGQCATCT GCCAAOGCCC AAATGCTTGT 10440 

TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAQAAC CAATCTGCAT TCTTCCCTGT 10500 
CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAOTGTGACT GCCOGCCTGG CTQGACGGGG 10560 
TCTOQCTOTC ATACAGCTGT TTGCCAGTCT G0CT6CTTAA A TQQTQ GAAA ATGTGTAA6A 10620 
CCAAACCGAT GTCACTGTCT TTCTTCTTGG AOGOGACATA ACTGTTCXAG GTAA 

Seq ID NO: 430 Protein aequence 
Protein Accession #: FGSNSSK predicted 



1 11 21 31 41 SI 

I 1 i i i I 

MWPHIiAPCCW GliALVSGWAT FQQMSP8RNP SPRLPPBTAP GAPOSIPAPP APGDEAAGSR 60 

VERLQQAFRR RVXUjLRELSB RLELVFLVDD SSSVGEVNPR SBLMFVRKIiL SDFPWPTAT 120 

RVAIVTF8SK NYVVPRVDYI 8TRRARQHKG AIiLLQBIPAX SYKGGGTrrX GAFQQAAQIL 180 

XiBAREarSTKV VFLXTDGYSN OGDPRPIAAS XiRDSGVETFT FGIHQGMIRB liNDMASTPKS 240 

EHCYLLBSFE BFEAI«ARRAL HEDLP8GSPI QDDMVRCSYL CDB6KDCCDR MGSCKCGTHT 300 

GHPECICEKO YYGKGIiQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 360 

VCREGYRASG QTCBLVHCPA LKPPBNGYPI ONTOINHFIIA ACX3VRCBPGP DIjVGSSIILC 420 

LPNGUHSGSB SYCRVRTCPH LRQPKHQHIS CSTRENLYKT TOiVACDEGY RIiEGSDKLTC 480 

QGNSQWOGPE PRCVERHC8T FQMPKEVIZS PHNCGKQPAK FGTXCYV8CR QGFXLSGVKE 540 

MLRCTTSGKM NVGVQAAVCK DVEAPQXNCP KDIEAXTLEQ QDSAMVTWQX PTAKDNSQEK 600 

VSVHVHPAPT PPYLPPIQDV AIVYTATDLS GMQASCIPHX KVIDAEPPVX DWCRSPPPVQ 660 

VSBKVHAA6W DEPQFSDNSG ABLVITRSKT QGDLFPQGET IVQYTATOPS GNNRTC3>IHI 720 

VIXGSPCBXP FTFVN6DFXC TPDNTGVNCT LTCLBSSDFT EGSTDKYYCA YED6VWKPTY 780 

TTBWFDCAXX RFAMHGFKSF EMFYKAARCS DTDUQCKFSE ATBTTXiGKMV P8FCSDAEDI 840 

DCRLEENLTK KYCLEYNYDY ESGFAIGPGG IfGAANRLDYS YDDPIiDTVQB TAT8IQNAKS 900 

aHIKRSAPLS DYKXKLZFKl TASVPLPOER NDTZiEWEKQQ RWrLETIT MKLKRTUIXD 960 

PMY8PQLASB XLXADSHSIjB TKXASPFCRP GSVZiRGRMCV NGPXXaTYYNL BHPTGESCRl 1020 

GSYQDEEQQXi BCKLCPSGMY TBYXKSRIIXS DCKAQGKQGT YSYSGLBTCB 8CPLQTYQPK 1080 

POSRSOiSCP ENTSTVKRGA VNISA06VPC PEGKFSRS6L KPCBPCPRDY YQPNAGKAFC 1140 

LACPFYGTTP FAGSRSITEC STSVLMITIP GGFGHLELLN CPSBVFHBCP FKPCHII8GTC 1200 

QQLGRGYVCL CPLGYTQLKC BTD1DBC8PL PCmNGVCKD LVGEPICECP SGYTGQRCEE 1260 

NIMBCSSSPC LNKGXCVDOV AOYRCTCVKQ FVGLHCETEV NEOQSMPCLN NAVCEDQVGG 1320 

FXfCKCPPGFli GTRCOXNVDB CLSQPCKIIGA TCXDGANSFR CLCAAGFTGS aCBUTINECQ 1380 

SMPCRNQATC VDBUISYSCK 0QP0F8GKRC BTBQSTGFNL DFBV6GXYGY VMLDGMLPSL 1440 

HALTCTFWMK SSDDMNYGTP ISYAVDMGSD NTLLLTDYNG WVbYVNGREK Z TKCP SVNDG 1500 

RWHHIAITWT SANGZWKVYl DGKLSDGOAQ L8VGLPIPGG GAIjVU^QEQD KKGBGP8PAE 1560 

SFVQSI8QUI LWDYVLSPQQ VKBLATSCPB ELSKGNVLAW PDPLSGIVGK VKXDSKSIFC 1620 

SDCPRXiGGSV PHIiRTASEDZ* KPGSXVNIiFC DPGFQLVQ2IP VQYOjNQGQW TQPLPHCERI 1680 

SOQVPPPIiEN OFHSADDFYA aSTVTYQQlII GYYLLGDSRM FCTDZIGSKNQ VSPSCLDVDB 1740 

CAVGSDC8EH ASCU9VDGSY ICSCVPPYTG'DGKHCABPXK C3CAPGNPBI0 BSSGBIYTVG 1800 

AaVTFSOQEO YQLMSVTKXT CbBSQEWNBL XPYCKAVSOQ KPAIPBNGCX BBLAFTPGSX 1860 
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VTYROIKGYT lAODKBSSCL AHSSHSHSPP VCEPVKCSSP ENINNGKYIL SGLTYLSTAS 
YSCDTOYSLQ GP8IIBCTAS GIHDRAPPAC HLVFCGBPPA IKDAVITOIK PTFRNTVTVT 
CKBGVTLAGL DTIEOADGK WSRSDQQCLA VSCDBPPIVD HASPETABRI* FGDIAFYYCS 
DGYSLADNSQ LLQ7A0GKMV PPBGQDMPRC lAHFCBKPPS VSYSILSSVa KAKFAAGSW 
SPKCWBOFVL NTSAKIBCMR GGQWNPSPMS IQCIP VROG B PPSIMMOYAS GSMYSFOAMV 
iVySQIRaFYX XGBXKSTCBA TGQWSSPIPT CHPVSOGBPP KVENOPLEHT TGRIPESEVR 
YOOTPOyKSV G8PVFV0QAN RHNH5ESPLM CVPLDOGKPP PIQNGPMKOB NFEnraSKVQP 
PC3IEGYBLV0 DSSWTOQKSO KWNKKSNPKC MPAKCPBPPL I*nJQLVljKBL TTEVGWTPS 
CKEGHVLQGP SVLKOiPSOQ HNDSPPVCKI VLCTPPPWS FGVPIPSSAL HFQSTVKYSC 
VGGFFLRGtlS TTLOQPDGTW 8SPLPBCVFV ECPQPEBIPN GUDVQGljAY LSTAIiYTCKP 
GFELVGNTTT LOGBNGKWIiQ GKPTdCAIEC LKPKBIl^GK PSYTDLHYGQ TVTY8CNRGP 
RiaGPSALTC IiBTGDNDVDA PSC3IAIHCDS PQPIQIQFVB GADYSYGAII lYSCFPOFQV 
AGHAMQTCBB SGWS9SIPTC MPIDOGLPPH IDFGDCTKLK DDQGYFBQBD DMMBVPYVTP 
HPPYHIJaAVA KTWENTKEfiP ATHSSNFLYG TMVSYTCNPQ YEIjLGNPVLI CQBDGTWNGS 
APSCISIBCD LPTAPENGFL RFTBTSM6SA VQYSCKPCTI LAGSDLRIiCI* QTOKWSGASP 
RCEAISCKKP NPVMHQSXKG SNYTYLSTLY YBCDFGYVLN GTERRTOQDD KMWDEDEPIC 
ZPVDC88PPV 8AKGQVRGDB YTFQKBIBYT OIEGPLLEOA RSRVCLANGS WSQATPDCVP 
VRCATPPQLA MOVTEOLDYG FMKEVTPHC3f BGYILHGAPK LTCQSDGNWD ABIPLCKPVN 
CGPPEDUAHG FPNOFSFIHG GHIQYQCFPO YKXiHONSSRR CLSNGSWSGS SFSCLPCRCS 
TPVIBYGTVN GTDPDOGKAA RIQCPKGPKL LGLSHITCKA D6QWSS0PPH CEHTSCGSLP 
MIPKAPISET S8HXBNVZTY SCRSOYVIQQ SSDLICTSXO VWSQPYPVCE PLSCGSPPSV 
AMAVAT6BAH TYBSBVKIiRC LBOYTMDTOT DTFTOQKDGR MPPBRISCSP KKCPLPEHIT 
HILVHGDDFS VNRQVSVSCA EGYTFBGVNI SVCQLDGTWE PPPSDESCSP VSCGKPESPE 
HGFWGSKYT FBSTIIYQCE PGYELBGNRE RVCQENRQWS GGVAICKBTR CBTPLEFLNG 
KADIEKRTTG PNWYSCNRa YSLEGPSBAH CTENGTWSHP VPLCKPNPCP VPFVIPEMAL 
LSERBPyVDQ NVSIKOIBGF IiLQGHOXZTC WP D B llfT Q TS AKCEKI8CGP PAHVEtlAIAH 
GVBYQYGDMI TYSCYSGYMIi BGFLRSVCLB NGTHTSPPXC RAVCRFPCQll GGICQRFNAC 
SCPEOWMGRL CBBPICXI«PC UIGGRCVAFV QCDCPFGHTG 8RCHTAV0QS POiHGGKCVR 
FNRCHC3*SSH TGHNC8R 

8eq ID NOt 431 DWA eequence 
Nucleic Acid Accesaion #i FGENBSH predicted 
Coding sequence! 1..390 

1 11 21 31 41 51 

11)111 

ATQAGGTTCA OTGTCTCAGO CATGAGGACC QACTACCCCA GGAGTGT6CT GGCTCCTGCT 60 

TATGTGTCAG TCTGTCTCCT OCTCTTGTQT CCAAGG6AAG TCATOGCTCC CGCTGGCTCA 120 

GAACCATGGC TQTOCCAGCC GGCACCCAGG TGTGGAQACA AGATCTACAA CCCCTTGGAG 180 

CAGTGCXGTT ACAATGAOGC CAT03T0TCC CTGAGOSAQA 0CC3GCCAATG TX3GTCCCCCC 240 

TGCACCTTCT GGOCCIGCTT TGAGCTCTGC TGTCTTGATT OCmOQCCT CACAA AQGAT 300 

TTTOTTGT Q A AGCTGAA6GT TCAGG0T6T0 AATTGCCAGT GCCACTCATC TGCCATCTOC 360 
AGTAAATGTO AAAGAGGCOG QATATGTTAQ 

8eq ID NO: 432 Protein Bcquence 
Protein Accession #i FGENBSH predicted 

1 11 21 31 41 51 

MRF8VS(a4RT DYFRSVIAFA YVSVCLIiZ<LC PREVIAPAGS EPHLOQPAPR OGDKIYMPLB 60 
QOCniDAIVS IiSBTRQOGPP CTFWPCPELC CLDSPGLIMD FWKLIWQGV MSOOISSPIS 120 
SKCERGRXC 

Seq ID NO: 433 DNA sequence 

Nucleic Acid Accession #: MM_007231.1 

Coding sequence I 89.. 2017 

I 11 21 31 41 51 

i I i 1 1 I 

TAGGAACAGG GGAGAOTGCA CCTGCTACCA GTCAAGCTCA GCCAGACTGC AAGAG GAGGC 60 

QAGGOGOAGC CAGCG6AGGG MTOMOCKr GGACAAGTTO AAATGGOGQA OTTTCTTCAA 120 

GTGCAGGGAG AAGQAOAAAO TQTCGGCTTC ATCAGAGAAT TTCCATGTTO QTOAAAAtOA 180 

TCAGAATCAQ GACOGTGGTA ACTGGTCCAA AAAATOGGAT TATCTTCTAT CTATGATTGO 240 

ATACGCAGTG GGATTAGGAA ATGTGTGGAQ ATTTCCATAT CTOACCTACA QCAATGGT GG 300 

AGGGGCCTTC TTGATAOCTT ATGCAATTAT OTTAGCATTG GCltjO T TTAC wm\»i ~xCrT 360 

TCIG6ASTQT TCACTGOQAC AATTTGCTAQ CTTAOGTGCA STT TCAQTTT QQAOGA.TTCT 420 

'iVCAV TO TTT CAAGGTGTOG GAATTACRAT GOTCCTGATC TCCATTTTTG TOACAATCTA 480 

TTACAATGTC ATAATTGCCT ATAOTCTTTA CTACATGTTT GCTTCTTTTC AAAGTOAACT 540 

ACCATGGAAA AA T T OT TCTT OGTGGTCAOA TAAAAACTOT AGCAOATCAC CAATAOTAAC 600 

TCACTGTAAT GTGAGTACAG TGAATAAAGG AATACAAGAG ATCATOCAAA TGAATAAAAO 660 

CT6GQTAQAG ATCAACAATT TTACCTGCAT CAA06GCAGT GAAAmATC AGCCASG6CA 720 

GCTTCCCAGT GAACAATATT GGAATAAAOT 6GGQCT0CAA OGGTOUVGTG GAATG AATGA 780 

GACTGGAGTA ATTQTTTGQT ATTTAGGACT TTOTCTTCTT CTGGC TTGO C TC3VTA GTTGG 840 

AOCAGCACTA TTTAAAGCSAA TCAAATGGTC TGQCAAGOTO GTATATTTTA CAGCTCTTTT 900 

OCCCTATGTG GTCCTACTCA TCCTGTTAGT AOSAGGTGCA ACTCTGGAGG GTGCTTCAAA 960 

AG6CATTTCA TACTATATTG GAGO0CA6TC AAATTTTACA AAACTTAAQQ AAOCTGAG GT 1020 

ATGGAAAGAT GCTGCCACTC AGATATTTTA CTOCCrTTCA GTGGCTTGGQ GTQGCTTOOT 1080 

TOCTCTATCA TCTTACAATA AGfTTCAAAAA CAACTGCTTC TCT QATG CCA TTGTQOrTTQ 1140 

TTTGACAAAC TGTCTCACTA GCGTGTTTGC TGOATTTGCT ATTi'lilCTA TAT TGGGAC A 1200 

CATGGCCCAT ATATCTGQAA AGGAAOTTTC TCAAOTTGTA AAATCAOOTT TTGATTTGGC 1260 

ATTCATTGCC TATCXAGAGO CTCTAGOOOV ACTOCCAOOT QGTGCAlTrT GGTCCATATT 1320 

ATTTTTTTTC ATGCTTTTAA CTTTGGQTCr OS A TTCTC A G TTTGCTTOOA TTOAAAOOAT 13 BO 

CACAACAACA ATTCAAOATT TATTTCCCAA AOTQATGAAO AAAATQAOOO TTCCCAtAAC 1440 

TTT OGG CTGC ' IQCnwm TGTTTCTCCT TQGTCIOOtC TGTGTOACTC AGGCIGGAAT 1500 
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TTACICGOTT CATCTOATTO ACCACTTCTO TGCTGQAT60 GGCATTTTAA TTGCAGCTAT X560 
ACTGGRGCTA GTTGQAATCA TCTGGATTTA TOGAGGGAAC AOATTCATTG AGGATACAGA 1620 
AATCATGATT GGAGCAAAGA GGTGGATATT CTOOCTATGG TOGAOAGCTT GCTGGTrTOT 16 BO 
AATTAOGCCT ATCCTTTTGA TTGCAATATT TATCTGGTCA TTGGTGCAAT TTCATAGACC 1740 
TAATTATGOC GCAATTCXAT ACCCTGACTG GGGAOTTOCT TTAGGCTGGT GTATGATTGT 1800 
TTTCTOCATT ATTTGGATAC CAATTATGGC TATCATAAAA ATAATTCAOO CTAA AGGAAA 1860 
CATCTTTCAA OOCCTTATAA GTT6CTGCA0 AC3CAGCTTCT AACT SGGGT C CATACCTGQA 1920 
ACAACATCGT GGGGAAAGAT ATAAAGACAT GGTAGATCCT AAAAAAGAGG CTQACCATOA 1980 
AATACCTACT GTTAGTGGCA OCAGAAAACC GQAATQAGAT CTCATTQAAA AAAATATATG 2040 
ATTGTATAAT OTQATTTTTT TTAGAATAGG GGOAAOCTTA TTTAT TTGTG TGTTAACTOA 2100 
ATAGGAAAAT GTACATACTA TOTTCATGAT AGTGTQATTT TTTTCAGATT TAAOC A OGAA 2160 
TGCAATATAA AAAT6TGAAT CTCTTAATTC TCA6CCATGT GCTTATTATA TTTCTTTTTA 2220 
GATTGTCTAT CTGTATAACA CACACACACA CACCTAAGAG TCTCTATTTC ACAATTATAT 2280 
TTTTCTAAAT AQTATATGCA TTTTTAATAC ATTGGAQGCT TTATTTTGAA CTAATTTCTT 2340 
AQAGAATAGT TATATTTTCT ATTACACAAG TTTAAAAATA TTATTAACTT QTATTTTCTT 2400 
AATATACAAT CTATCTTTTC CAGAAATATO AGTGGGAAAT AAATCAGCAC ATTTGAAAGA 2460 
AAGTGTTAAA ACTGAAGGCC TCACTTAATT AOAAAOGTGA TAAATATATG GACAAATGGA 2520 
CTATACATAC TATAAGAGGA CTGTAGTTTA ATACTTTTTA CCCAAATATG TTTAAAAACA 2S80 
T06TGCATTT QTTACAGCTC ATGTTTTCTA TATGAACTTA GTCATTAATG TTCTTTATAA 2640 
AAAQTGAAAT AAGATGGAAA AATTAGGATC CTACA6CCAG TACCTGATAA ATCTAG AAAA 2700 
TTOAGTTTTG AGTACCTCTT TTCCCATATA CAATCTTOCT TCCTT AflGTA ATTTG6AAGA 2760 
AAACTATOAC CCATTTAATT TCTATTGTGT TTCACCAAAT TCAGTGTTGT TCATTATACC 2820 
TCTCTGAAAT ATAGGTTTAA TTTCAAATAG AATATGOACT TAAATGTTAA TG AGAAAC TG 2880 
GCTTTAATCA ATTCTAOCAT TTTATTACTO TAATACAGGQ CTQATAGAGT GATTTTGTCT 2940 
TATATGAGTC AOTTACTACT TACAOGTGAT AACTTGCATA CTATTGGAAO ATAAAOTTGT 3000 
CRAACTTGTC AAGAATGAGA AAAGCCAAAT TAGAAAATCC TATGTOCTAG TTTCCTTACC 3060 
AAGOATAATT AAATATATCA CTAAGAOCTT TATATATTGA TTATATATTG T TQACAA CTG 3120 
GTTTAAQCAT CATAGCCTAT GATGATAAAC ACTGCCTATA TATGTAAATA GCTTTTCATC 3180 
AATTCTTAAA TTTCTTAACC TAGGCTTCAG GQAGCATATG AAACCAAAAT TATATGOAAC 3240 
ATTTTCTGTG TGTACATGTA CATGCATTTT TCTAGGGAGA QAGTCOGTAG GTTTAT CAG A 3300 
ATATCAAGGA AAACTGTGAC CCAAAGAAQT TTAAOAATCA CATACAGT6C TGCTGaCTTT 3360 
TTGTGCTTGG CAAATOAGTG ACAATAQAAQ AAATAATTTT TCTTACACAT TTTAAAAOQT 3420 
TTTCTCTTOC TTOTGATTGA AGATGAAAGG AOTAAGAAAT TAAGGCATTT GTTTAATTTA 3480 
TACTGGTAAC TTATTTAGGG GGGAGGQQAC ATGAAGGTAG GTAAATAGGT AGQCCTCTAA 3540 
TTGAACCACC TCTCTAAGTT ATGTACXrTAT ATATAAGCTG AAATTGTGTT TGACATTCTQ 3600 
AGGQTTTTCT TTTTCTTTTT CCTTTTTTTT rmTTT GG T GGGGGGCTGG OGGTCAGAGT 3660 
CTTCTTCIGT TGCXnX3GGCT GQAGTGCAGT G6CATGATCT CAGCTCACTO CAAC!CTCTGC 3720 
CTTCIGOATT CAAQTQATTC TCCTGCCTCA GCCTCTTGAG TAGCTGGGAC TACAGOTQCC 3780 
OOCCAOCACA CCAGCTAATT TTTGTATTTT TAGTAGAGGC GAAGTTTCCC CATCTTGGCC 3840 
AGGCTGGTCT TGAACTCXXS3 ACCTCAACTG ATCTGTCTAC C TCGGCCTCC TAAAGTGCTG 3900 
AGATTACAGG TCTGAGCCAC CGTGCCCGGC CCATTCTAAG GGrPTTCTTT GA AGACA OGT 3960 
C3UUV.TGCTGT TAGTAAGTTT CAGGAGATTG TTAATTCCTC AOTTATACCA GATTTTATAA 4020 
AATATTTOAO AATAGAT6GC TAACAAOAGG TTAGAAATAC TTTTCCTTAA TTTTAATCCA 4080 
CAOTATGTTA CATGCATTCT ACCACIACAT TTTGGTGCTA TTTAAGGTGT GCAATTTTCT 4140 
ATAGGTGACT TTTQCAATTC AGGQAAGATT TGGGCATATT AAATGAAAGA ATAXCTAATT 4200 
GGGGGAGGTG TGAAGGGAAA GAAATTCTTT TCAAAAGCTG ACCACAAAGA GTAOTTAAAA 4260 
GTTTTTOTCA CTATCTTCAC AAGTGTGTAA AGCACAGATT TCAACAGAGT GCTTG6CATA 4320 
TTGTAGGGTG CTCAATGOTQ GTTTTTATTA TTATTACTCA GATTCCACAQ TGGCAAGAAA 4380 
CATCATTCTA CATAATOGAA AACATTTACA TCAAATCCCA CTTACTTTAA TGCGAACTTQ 4440 
GAGATAATTT AIGGTATTGT ATTOTAAACC ATTAATOAAA ACTTTTTCAC AGTTGAGTCA 4500 
AATTAAAATC ACTATATCTC 



Seq ID NOi 434 Protein sequence 
Protein Aceesaion ffi NP_009162.1 

1 11 21 31 41 51 

ilDKLKCPSFF KCREKEKVSA 8SENFHVGEM DBKQDRGNWS KKSDYLLSMI GYAVGLGNVW 60 

RPPYLTYSNG GGAFLIPYAI MLALAGIiPLF FliECSLGQFA SLOPVSVWRI LPLFQ6VGIT 120 

MVLISIPVTI YYNVIIAYSL YYMPASPQSE IJ>WKNCSSW8 DKNCSRSPIV THCNVSTVNK 180 

GIQBIIQMNK SWVDHJNFTC INGSBIYQPG QLPSBQYWNK VM^RSSGMN BTGVIVWYLA 240 

liCLLLAWLIV GAALPKQIKS SGKWYPTAL PPYWLLILL VRGATLBGAS KOISYYIGAQ 300 

SHFTKLKBAB VWKDAATQIP YSLSVAHGGL VALSSYNKPK NNCPSDAIW CLTNCLTSVP 360 

AGFAXFSIUS HMAHISGKBV SQWKSGFBL APIAYPBALA QLPGGPFWSI LFFFMLLTLG 420 

LDSQFASIET ITTTIQDLFP KVMKKMRVPl TLGCCLVLFIi LGLVCVTQAG lYMVHIilDHP 480 

CAOWGILIAA ILBLVGIIWI YGCaiRPIEDT EMMIGAKRWl FWLWWRACWP VITPILLIAI 540 

FIWSLVQFHR PNYGAIPYPD WGVAMWOG VFCIIWIPIM AIIKIIQAKG NIFQRLISCC 600 
RPASHHQPYIi EQHR6ERYKD MVDPKXBADH EZPTVSGSRK PE 



Seq ID NOj 435 DNA Bequence 
Nucleic Acid Accession ft: H1872e.l 
Coding sequence* 51.. 1085 

1 11 21 31 41 51 

GQAGCTCAAG CTCCTCTACA AAGAGGTGGA CA6AGAAGAC AGCAGAGACC ATGGGACCOC 60 

CCTCA6CCCC TCCCTGCAGA TT6CATGTCC CCTGGAAGGA OGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGQAACCCA CCCAOCftCTO CCAA6CTCAC TATTGAATCC ACOCCATTCA 180 

ATGTCQCAOA 0GG0AAG6AO 6TTCTTCTAC TOGCCTACAA CCTGCCCCAG AATCQTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAQAGTGG ATGGCAACAG TCTAATTGTA GOATA TGTAA 300 

TAGGAACTCA ACAAQCTACC CX»GGGCCCG CATACAGTGG TCOAGAOACA A TATAC COCA 360 

ATOCATCCCT GCTCATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT AC CCTAC AAQ 420 

TCATAAAGTC AGATCTTOrG AATQAAGAAO CAACCX5GACA OTTCCATOTA TACC OOGAQC 480 

TGCCCAAGCC CTCCATCICC AOCAACaU^CT OCAAGCCGGT OOAGGACAAO GATQCTOTGO 540 

1126 



wo 03/042661 
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10 

15 

20 

25 

30 

35 

40 
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60 
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70 

75 

80 




> CCCAGCCATQ 
' GTGCAGTTTC 
, AGTTGTAGAA 
' CTTTATTCTA 
• TACCCTCCTA 
: TTTAAATGTC 
' ACAAAACTCA 
; CAAATGGTGG 
I 0TQAGCX3CAT 
AAGATAGATC 
i TTCCAGTCTA 
, AGTAACCTGA 
\ TTCCAATTT6 
' ACTTQTA6AQ 
. ACT 



GTTCAOAACA 
CTGCAGCrCT 
GGATCCTATO 
CTGAATOTOC 
CCAGGGQAAA 
TGOTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCQ6CATGA 
TTOQATATTT 
ATCCCATTTT 
ATGCTGGAQA 
TGCCACTCAO 
GACXSACTTCA 
TAAGGCrCTT 
ATQATOCTGT 
AGATCTATCT 
TGACTOACAT 
CACSAGTTGGA 
CAATGOCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTA8TTQ0 
ATAGTCATAC 
TGCATGCAOC 
GAGAAATGTO 
TAACTGATAA 
TGAGCCA6TG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



CAACCTACCT 
CCAATGOCAA 
AATOTQAAAT 
TCTATGGCGC 
ATCTGAACCT 
ATGGGACGTT 
goggatccta 
OQATGATCAC 
OGATTGGAGT 
CAGGAAOACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAO CATC 
CTTCTAGACT 
TAATAOAATT 
TGTTGAACAT 
GTGCTGCTTO 
TTTGTATCTT 
TAGTAGTCAT 
CAGGCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



OTGGTGGGTA 
CATGACCCTC 
ACAQAACCCA 
AGATGTOCOC 
CTOCTGCC A C 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGO 
GGCA6ATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AQCITTTOOC 
A A TTT G TCCT 
TCACAAGAAQ 
AACGTTTTAC 
TTTAACACAG 
CAOCTOTTCT 
GCtCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTC C CTG Q T 
ATAGT6AATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TOTATTTATT 
ATTGTATTGC 
AATCACAAAT 



AATGGTCAQA 
ACTCTACTGA 
G06AGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCXXATAACT 
AGTGCTCCTG 
GTQGCTCTGA 
ACCAGACOCT 
ACAAGGTCTQ 
TAAAGGGAAA 
CAGTCAAACT 
AA6ATQTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
COGTGTOTTC 
GACTOOCTGT 
AGCT6AACAG 
AATGGGTATC 
CTACACTCAT 
06TAGTCCAA 
GTAGTGTATT 
GTCTCTCXTT 
CCCATGAAQG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAA6A 
TACTTTAACT 
TCTQTGOTTC 
CCAGQGGGAO 
AAAAGCCAAT 



600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ZD NO: 436 Protein aequence 
Protein Accession fit AAA59907.1 

1 11 21 31 41 51 

11)111 
KGPPSAPPCR LHVPWKEVLL TASLLTFWNP PTTAKLTIES TPPNVABGKB VLUAHNLPQ 
NRIOySWYKO BRVDGNSLIV OWIGTQQAT POPAYSGRET lYPNASLLIQ NVTQNDTGPY 
TLQVIKSDLV NEEATGQPHV YPBLPKPSIS SNNSNPVBDK DAVAPTCBPE VQNTTYLWWV 
NGQSI*PVSPR LQLSNtaiMTL TI*LSVKRNDA GSYBCBIQNP ASAMRSDPVT IiNVLVOPDVP 
TISPSKANYR PGERLNLSCH AASKPPAQYS WPINGTFQQS TQELPIPHIT VHNSGSYMOQ 
AHNSATGLNR TTVTMXTVSG SAFVLSAVAT VGITIGVLAR VALI 

Seq ID NO: 437 DNA sequence 
Nucleic Acid Accession #: M1B72B.1 
coding sequence: 1355.. 1657 



60 
120 
180 
240 
300 



1 

I 

GGAOCTGAAG 
CCTCAGCCCC 
TTCTAAOCTT 
ATGTOGCAGA 
GTTACAGCTG 
TAGOAACTCA 
ATGCATCCCT 
TCATAAAOTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACOGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
T0CTCTCA6C 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCrCAOQC 
6CMACCATG 
AACAAGACTC 
TGOCTCTTTC 
GOGTAACTTA 
AGATCCTTTA 
AAATGTACAO 
TTTAATTCAA 
GGASGAOTCT 
GCIQAOACTA 



11 



CTCCTCTACA 
TCCCTGCAGA 
CTOGAACOCA 
GGGGAAG6AG 
GTACAAA6GC 
ACAAGCTACC 
GCTG ATCCAQ 
AGATCrTGTO 
CTCCATCTCC 
TGAACCTQAG 
CAQTCCCAGQ 
GAACQATOCA 
CCCAGTOVCC 
CAATTACCXST 
ACA6TACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCOICC 
CIGGTOTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCKT6A 
GCTTGGCAGG 
ACAGAGTOTC 
GTGCACCCAG 
TOOTCCXTTT 
CCCAGOmG 

GTGCAGrrrc 

AjGTTGTAOAA 



21 
I 

AAGAGGTGGA 
TTGCATOTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAA6AGTGG 
CCAGGGCCCO 
AACGTCACCC 
AATGAAGAA3 
AGCAACAACT 
GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTQAATOTCC 
GCAOGGGAAA 
TGOTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTOGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAG6CTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTOACAT 
GAOAOTTGQA 
CAATGCCAAA 
TQACACTTGT 
ATTAACAAAT 



31 
I 

CAGA6AAGAC 
CCTGGAA6GA 
CCAAGCTCAC 
TOGCCCACAA 
ATGGCAACAQ 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AAT6TGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GOGGATCCTA 
OGATGATCAC 
CQATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAOCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGA ACAT 
OTGCTGCTTG 



41 



AGCAGAOACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TOQAaAGAGA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GT6GTGGQTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCA6CAATCC 
TATGTGCCAA 
AOTCTCrGGA 
GCTGQCCAGG 
GGCAGATTGQ 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AQCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTOTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 



51 
I 

ATOGQAGCCC 
ACAGCCTCAC 
ACGCCATTCA 
AATCGTATTG 
G6ATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCG6AGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTGA 
GOQAOTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACr 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGOTCIG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
O. U i'G'iGVI'C 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



1127 
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5 
10 



25 



75 



CTQACTCA.TT CTTTATTCTA TTTTAOTTGO TTTQTATCTT GCCTAAGGTG OGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAJ3TAGTCAT ACTCCCTGGT GTAGTGTATT 19B0 

CTCTAAAAGC TTTAAATOTC TOCATOCAGC CAOOCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGOAATT ACAAAACTCA GAOAAATGTG TCATCAGGAQ AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGO TAACTOATAA TAOCACTAAT GCTTTAAOAT TTGGTCACAC 2160 

TCTC AOCTAQ GTGA60GCAT TGAGCCA6TG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

OTTAAGGAAO AAQATAQATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACA66A6A TTCCAGTCTA CTTOAGTTAO CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

T TTAC AAAAA AOTAACCTGA ACTAATCTGA TGTTAACCAA TQTATTTATT TCTOTG G TTC 2400 

it^mvyn'G ttccaattto acaaaacxxsv cTGTTcrroT attgtattoc cavasoGQAa 3460 

CTATCACTGT ACTT6TAGAG TGGTGCT6CT TTAATTCATA AATGACMAT AAAAGCCAAT 2520 
TAGCTCTAXA ACT 



^ ^ 8eq ID NOt 438 Protein sequence 
Id Protein Accession tf: AAA59908.1 



i IX 21 31 41 51 

1 I 1 I 1 I 

- - MDSFSQDVKT RLLZMIRUiP PFlOiSUiMPA 8FANQDDAVI SZSQEVASEQ NliTBOQIYUV 
2U NFMVLRKZRD PLVHPVTDZS SIFNTAVCSN VQHSFSBLDF 



Seq ID NO: 439 DNA sequence 
Nucleic Acid Accession #: Mie72B.l 
Coding sequencei 2370.. 2501 



1 11 21 31 41 51 

I 1 I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AQCAQAGACC ATOGGACCCC 60 

CCTCAGCCCC TCCCTQCAGA TTGCATOTCC CCrOGAAGGA GGTCCT6CTC ACAGCCTCAC 120 

DV TTCTAACCTT CXGGAACCCA CCCACCACTG CCAAGCTttC TATTOAATCC AOGCCATTCA IBO 

ATQTCGCAGA GGGGAAGGAO GTTCTTCTAC TCGCCdCAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATQTAA 300 

TAGGAACTCA ACAAQCTACC CCA6GG0C06 CATACAGTGG TCGAGAGACA ATATACCCCA 360 

^_ ATGCATCCCT GCTGATCCAQ AACGTCAOCC AOAATGACAC AGGATTCTAT AOCCTACAAO 420 

J J TCATAAAGTC AGATCTTGTQ AATGAAGAAG CAAC06GACA GTTCCATGTA TACCOGQAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCC6T GGAGGACAAG GATGCTGTGQ 540 

CCTTCACCTG TGAACCTGAQ OTTCAGAACA CAACCTACCT GTGOTGGGTA AATGGTCAGA 600 

GCCTCC5C6GT CAGTCCCAGO CT6CAQCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAO GAACQAT6CA GGATCCTAIG AATGTGAAAT ACAOAACCCA GOQAGTGCCA 720 

4U ACCGCAOTOA OCCAOTCACC CTC3AATBTCC TCTATGGCCC AQATOTCCCC ACCATTTCCC 780 

CCTCAAAGQC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTOC ACAGTACTCT TGGTTTATCA ATGGGAOQTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAOO CACT OO CCTCAATAOG ACCACAGTCA OGATGATGAC AOTCTCrGGA AQT G CT C CTO 1020 

'ID TCCTCTCAGC TGT60CX!ACC OTOGGCATCA 0GATTG6A0T GCTGGOCAGa GTGQCTCTQA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCr AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAA6CCCTAT ATGCTGGAOA TGGACAACTC AAT6AAAATT TAAAGGGAAA 1260 

AOCCTCAGOC CTaAOOTOTO TGCCACTCAO AGACTTGACC TAACTAOAOA CAGTCAAACT 1320 

DU GGAAACCATG GTGA6AAATT GAGOACTTCA CACTAT6GAC A6CTTTTCCC AAGATGTCAA 1380 

AACAA GACT C CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTQTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGa ATGATGCTGT CATTAGTATT TCACAAGAAQ TAGCTTCAGA 1500 

GQGT AACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AAOGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GT6CA00CA0 TOACTGACAT TAGCAGCATC TTTAACACAO 008TGTGTTC 1620 

DD AAATGTAGAG TGGTOCTTTT CAGAGTTQGA CTTCTAOACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCRA CCCA6GCATG CAATGCCAAA TAATAGAATT OCTCCCTACC AGCTGAACAQ 1740 

GGAGGAGTCT GTGCAGTTTC TOACACTTOT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCiaAGACTA AGTT6TA6AA ATTAACAAAT GmCT U Cim GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG Vmri A TLT A ' GCCTAAGGTG GGTAGTCCAA 1920 

OU CTCTTOOTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCOCTCGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATOTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAQ AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

^- TCTCAOCTAG GTGAOOGCAT TGAGCCAOTO GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

OD GTTAAGGAAO AA6ATAQATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTOAOTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

T TTAC AAAAA AOTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTOTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGOAG 2460 

^/s CTATCACTGT ACTTOTAOAG TG6T6CT6CT TTAATTCATA AATGACAAAT AAAAGCCAAT 2520 
/U TAGCTCTATA ACT 



Seq ID NO: 440 Protein sequence 
Protein Accession #t AAA59909.1 

1 11 21 31 41 51 

I I I I 1 I 

MLTNVPXSW LFPCSNIiTKP TVLVLYCPGG AZTVLVEWOC FN8 



80 Seq ZD NOi 441 DNA sequence 

Nucleic Acid Accession «: NM_002381.2 
Coding sequencei 64.. 1524 ~ 

1 11 21 31 



1128 



wo 03/042661 



AMTOOBAGC CTCGCGTQGO CTCCTGQCCC CaSAOGOACA CCACCAGGCC CACXSOAGCCC 60 

ACCATGCGGC GCCOGGCCCC CX5CGCQCCCC CTCCCXSGQAC TCCTCCTGCT GCTCTQOCCO 120 

CTGCTGCTGC TOCCCTCOGC OGCCCCCGAC CCCGTGGCCC GCCOGGGCTT CCGGAGGCTG 180 

QAGACCCGAQ GTCOrGGGGG CAGCCCTGGA CGCCGCCCCT CTCCTGC3GQC TCCOQAOGGC 240 

G0GCCCX3CTT CXX3GGACCAG OOAGCCTGGC 0G06CCCGCG GTGCAGOTQT TTGCAAGAGC 300 

AGAOCCTTGQ ACCTGOTCnT TATCATTGAT AGTTCTOGTA 60QTA06GCC CCTGGAATTC 360 

ACC AAAGTG A AAACmTGT CTCCG6GATA ATOGACACTC TGGACATTGG GCCAGCOOAC 420 

AOQOSGGTGG CAGTGGTGAA CTATGCTAGC ACTQTGAAiQA TOQAUITUUV ACTCXZAOGGC 480 

TACACA6ATA AGCAGTCCCT GAAOCAGGCT OTOGGTGQAA TCACACOCTT OTCMCMSGC 540 

ACCATOTCAG QCCTAGOCAT CCAC3ACAGC» ATGGAOGAAO CCTTCACAGT GGAGGCAGGG 600 

GCTCX3A6A6C CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTOTTACAGA TGG6AGGCCC 660 

CAGGACCAGQ TGAATGAAGT GGOOGCTOGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 

GTGGGOGTGG ACCGGGCAGA CATGGCGTCC CTCAAOATQA TGOCCAOTQA GGCCCTAGAG 780 

OAQCATGTTT TCTACX3TGGA GACXTTATCGG CTCATTGAGA AACTTrCCTC TAQATTCX3VG 840 

GAAACCTTCT GTGOGCTGGA CCCCTOTQTQ CTTGGAACAC ACXJiGTGCCA GCAOGTCTGC 900 

ATCAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAQCC AAOOATACAC CTTGAATGCC 960 

GACAAGAAAA OOtXSTTCAGC TCTTQATAGG TGTGCTCTTA ACACCCAOQO ATGTGAGCAC 1020 

ATCTSixSTGA ATGACAGAAO TOGCTCTTAT CATTOTGAGT GCTATQAAOG TTATACCTTG 1080 

AATQAAGACA GGAAAACTTO TTCAGCTCAA GATAAATGTG CTTTOOGTAC COVTOG G TOT 1140 

CAGCACATTT GTGTGAATOA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATQ CAGATAAAAA AACATGTTCA GTCOGTQACA AOTGTGCCCT AGGCTCTCAT 1260 

G0TTGCCA6C ACATTTGTGT GAGTQATGGG GCCGGATCCT ACCACTQTGA TTGCTATCCT 1320 

GGCTACACCT TAAATOAGGA CAAGAAAACA TGTTCAQCCA CTQAGQAAGC AOOAAOACTT 1380 

OTTTCCACTQ AAGATGCTTG TGOATGTGAA GGTACACT6G CATTGC3U!!6A CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT QATGACATTT TGGAGAAGTT GAAAATAAAT ISOO 

GAATATQQAC AAATACATCO TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATO CATTCTTTTG CACAOCXGTT ATTGCCAATG TTCCTGCTAA 1620 

TAATTT6CGA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGC AGAATC AOCATGATTT TTCCAAGQAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATT66TA0 6AAATAGAAT QAAAAGTTTA 1800 

GTOTTTCTTT ATCTACTAAT TGASCCATTT AATrTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTaATAOT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TOTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAOA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTOT TGGAGTATTA 2040 

TTQAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

TGAACATTAC ATTGCCATTT TTAATTCATT CTX3GTCTTTG AAAGAAATGT ACTACTAAAO 2160 

A6CACTA6TT GTGAATTTAG GGTGTTAAAC TTTTTACCAA GTACAAAAAT CCCAAATTCA 2220 

CTTTATTATT TTGCTTCAOO ATCCAAOTGA CAAAOTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGAC!AAAA TCTAATGTTG TCTTTTTAAT OTTAOTGATC GAOCTGCCTC AGCCTCXXAA 2340 

AGTGCTGGGA TTACAGGCTT QAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTQ QCTTTGAAAC TTGCAACTTT QAGAACAAAA CRGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCX3T TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATO TCTATTTTTG ATOACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 
TTAAAAAAAA AAAAAAAAA 

Seq ID NOi 442 Protein sequence 
Protein Accession #t 1IP_002372.1 

1 11 21 31 41 51 

I I I I t I 

MFRPAPARRIi PGLLLIXHPL LLLPSAAPOP VARPQFRRIiE TRGPGGSFGR RP8PAAPIX3A 60 
PASOTSEPQR ARQAQVCKSR PLDLVPIIDS SRSVRPLEFT KVKTPVSRIl DTLDIGPADT 120 
aVAWNYAGT VKIBFQLQAY IDKQSXiKQAV GKZTPLSTOT HSGLAXOTAM OBAFTVEAGA 180 
REPSSNIPKV AIIVTD6RPQ DQVNBVAARA OASGIBLYAV GVDRADMA8I< KMMASEPLBE 240 
HVPYVBTYGV lEKLSSRFQE TPCALDPCVL OTHQOQHVCI SDGBQIMHCB CSOGYTLHAD 300 
KKTCSALDRC AIiKTRGCEHI CVNDRSG8YB CBCYE6YTU7 EDRKTC8AQD KCALQTHGOQ 360 
HICVNDRTGS HHCECYEGYT UtADKKTCSV ROKCALGSHG OQHICVSDGA ASYHCDCYPG 420 
YTLMEDKKTC SATBEARRLV STEDA06CBA TLAFQDKVSS YLQRUITKU) DILBKLKIMB 480 
YGQIRR 

Seq IB NO: 443 DMA secfuence 
Nucleic Acid Accession #t MM_016639.1 
Coding eequencei 40.. 429 "* 

1 11 21 31 

i i t I 

GOGGCGGGOG CAGACAG06G 0GG606CAGG AOGTGCACTA 
OGGTTQCTGC GGCTOCTGQT GCIGGG6CTC TGGCTGGG8T 
6A6CAAG0GC CA6GGAC0GC CCCCIGCTCC OGOOGCAGCT 
. AAGTGCATGG ACTGGQCGTC TT6CAGG6CG CGACG6CACA 
GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA 
CTGACCTTOQ T6CTG6GGCT GCTTTCTGGC TTTTTQQTCT 
GAOAAGTTCA CCACCCCCAT AGAG6AGAOC GGCSQGAGAGG 
ATCCAOTGAC AATGT6CCCC CTGCCAGCGQ GGGCTOQCCC 
TTCTAOAGCC AOTCTCTGCC TCCCAGAOQC GGCGGOAGCC 
GGGGTQGGGO GCX3GTGAATC ACCTCTGAGQ CCIGGOCCCA 
AGGTGTCTGG TTGCCCT6CC TCTGGCTCCA GAACAGAAAG 
ACAAAACA6C TGACACTQAC TAAG6AACTG CAGCATTTGC 
OCTTCCTTAO GACCTG G G G G CCAGGCT6AC TTGGG G GGCA 
TCACTCAGAT GTCCTGAAAT TCXyiCCAOGO GGQTCACCCT 
TTAACACTAQ GGGCTGQCCC ACTAGQAGGO CTGGCCCTAA 
CCCAAA60G0 GGAGGAGATA TTTATTTTGG GGAGAGTTTG 



41 51 

I I 

TGQCTCGGGG CT08CTG0GC 60 

TGCTGOGCTC OGrTGGOC^GGO 120 

GCrGGAOOGC GGACCTGGAC ISO 

60GACTTCTG CCTGGGCTGC 240 

TCCTTGGGGG OGCTCTGAGC 300 

GGAGAOGATQ CCX^CAG GAGA 360 

GCTGCGCAGC TGTGGOGCTG 420 

ACTCATCATT CATTCATCCA 480 

AAGCTCCTCC AACXACAAGG 540 

GGGTTCAGGQ GAACCTTCCA 600 

GGA6CCTCAC GCTGGCTCAC 660 

ACAGGGGAGG GGGGT6C0CT 720 

GACTTQACAC TAGQCXXXAC 780 

6GGGGGTTAG GGACCTATTT 840 

QATACAGACC CCOCCAACTC 900 

6AGGGGAGGG AQAATTTATT 960 



1129 



wo 03/042661 



AATAAAAOAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seg ID NO: 444 Protein eequenee 
Protein AccesBion fti HP 057723.1 



1 11 21 31 41 51 

I I I I i I 

MARGSLRRLL KLLVIiGLWLA LU18VAGEQA PGTAPCSRGS SHSADLDKCM DCASCStARPH 
SDFCLGCAAA PPAPFRLUIP ILG6ALSLTP VbQLbSOPLV HRRC3UUIBKF TTPXBBTGSB 
GCPAVAUQ 



60 
120 



8eq ID NOt 445 DWA sequence 
Kuclelc Acid Accession «t AP322916.1 
Coding sequences 50.. 4300 



1 11 21 31 41 51 

I I 1 i I I 

GCACTCCGCA GCCTTTAAGO TTOGCGOGGG GGCCAGGCAA 6AC3TTAGGCA T6AA6AG0CT 60 
CAA6TCC06C CTOAOQAGGC AGGACX3TGCC OgOCOC O QOS TGGTCTGGGG COGCOQGOQC 120 
CAGOGOGCAT GCAGCAGATT GGAATAAATA TGATOACOQA TTGAT6AAA0 CAGCA6AAAG 180 
GOGGGATGTA GAAAAAGTGA CCTCAATCCT TGCTAAAAAG 6GGGTCAATC CAGOCAAACT 240 
AOATOTGOAA GGCAGATCTG TCTTCCATGT TGTGACCTCA AAGGGGAATC TTGAGTGTTT 300 
GAATGCCATC CITATACATO GAGTTGATAT TACAACCAGT GACACTGCAG G6AGAAATGC 360 
TCTTCACCTG OCTGCTAAOT ATGGACATOC ATTGTGOCTA CSUUUU^CTTC TACAQTACAA 420 
TTGTGCCACT GAGCAT6CAG ACCTGCAGGG AA6AACTGCA CTTCAAAAAA AA0CAATG6C 460 
AGATTGTCCT TCTAGCATAC A6CTQCTTTQ TGACCATGGG GCCTCTGTGA ATGCCAAAQA 540 
TGTAAAAGGQ CGOACACCAC TT G TTCTGGC TACTCAGATQ AGTAGGCCAA CAATATGTCA 600 
ACTGCTQATA GATAOAGQAO CXK3ATGTTAA TTCCAGAGAC AAACAAAACA GAACTGCOCT 660 
CATGCTAGGT TGOGAATATO GTTGCAGAQA TGCAGTAGAA GTCTTAATTA AAAATGGT3C 720 
TGATATAAGC TT6CTGGAT0 06CTTG0CCA TGATAiSTTCT TACTATGCAA GAATTGGTQA 7fi0 
CAATCTGGAC ATTCTAACXn' TGTTGAAGAC TGCATOGGAA AATACCAACA AAGGGAGAGA 640 
ACTTTGQAAG AAAGGACOIT CTTTQCAACA GCOAAATTTQ ACACACATQC AAQATGAAGT 900 
AAATGTGAAO TCACATCAOA GGGAGCATCA AAATATTCAG GATTTGQAGA TT6AAAATX3A 960 
AGATTTGAAA GAQAGGTTGA GAAAAATTCA GCAAGAACAA AGAATACTTT TGGATAAAOT 1020 
CAATGOTTTA CAGTTACAOC TQAATGAQOA AOTTATOGTr 6CT6AT6ATC TGGAAAGCGA 1080 
QAQAGAAAAO CTOAAGTCCC TTTTGGCAGC TAAAGAAAAG CAACATGAAG AAAGCTTAAQ 1140 
GACTATTGAG GCTCTGAAAA ATAGATTTAA ATATTTTGAG AOTGATCATT TAGGATCAGG 1200 
AAGTCATTTC AGTAACOQAA AAGAAGATAT GCTTCTTAAA CAAGOTCAGA TGTATATGGC 1260 
AGACTCACAG TGTACTTCCC CAGGTATACC AOCCCATATG CAAAOCAOAT CTATSTTAAO 1320 
ACCTCTGGAA CTATCTTTAC CXaOTCAAAC GTCATACTCT GAAAATGAAA TTTTAAA6AA 1380 
AGAOTTAGAA GCAATGCGAA CTTTCTGTGA GTCAGCAAAA CAAGACGGAC TGAAGCTOCA 1440 
AAATQAACTG GCACACAAAO TGGCAGAATG CAAAGCTTTA GCATTAGAAT GTGAAAGGGT 1500 
CAAGQAGGAT TCAGATGAAC AGATAAAGCA ATTAGAAGAT GCATTAAAAG ATGTGCAGAA 1560 
GAGGATQTAT GAGTCAQAAQ GTAAAGTTAA ACAAATOCAO ACCCATTTTC TTGCCCTTAA 1620 
AGAACACTTA ACAAGT6AAG CAGCCTCAGG GAATCACAOA CTAACCGA6G AACTGAAGGA 1680 
TCAGTTGAAA 6ACTTGAAAG TAAAATATGA AGGTGCTTCA GCAGAA6TG6 GGAAATTAAQ 1740 
AAACCAAATC AAAGAAAATQ ASATGATAGT AGAAOAOTTT AAGAQGGATG AAGGCAA6CT 1800 
0ATAQAA6AA AATAAGCGAT TACAGAAGGA ACTTAGTATG TGTGAAAT6G AGOGAGAGAA 1860 
GAAAGGAAGA AAGGTCACAG AOATGGAAGG CGAGGCAAAA GAATTGTCAG OGAAGTTGGC 1920 
CCTTTCCATT CCAGCTGAAA AATTTGAAAA CATQAAGAGC TCATTATCAA ATGAAGTGAA 1980 
TGAGAAAOCA AAAAAATTAO TAGAAATQGA AAOAOAACAT GAAAAATCAC TTAGTGAAAT 2040 
TAGACAOTTA AAGACSAOAAC TT8AGAATGT TAAG6CCAAG CTTGCTCAGC A06TCAAACC 2100 
AGAGGAACAT GAACAGGTTA AGAGCA6ATT A6AACAGAAA TCAGGAQAAC TT GGGA AQAA 2160 
QATCACTGAO TTAACATTOA AAAATCAGAC ACTACAAAAG GAAATTGAAA AAGTTTATTT 2220 
GGATAATAAG CTCXTTCAAGG AGCAA6CACA TAACTTAACA ATTGAAATGA AAAATC3VTTA 2280 
' iWl ' CVrriA AAAOTAAGTO AAOACATGAA AAAOTCACAT GATGOUVTTA TTGATGATCT 2340 
TAATAGAAAO CTTTTA&ATG TAACAOVAAA ATATACAGAA AAGAAGTTGG AAATGGAGAA 2400 
ATTGCTACTG 6AAAATGACA GCTTAAGTAA G6ATGTAA0C OGCCTAGAAA CTGTGTTTGT 2460 
ACCTCXnXJAG AAACATQAAA AAQAGATAAT AGCTCTQAAA TCCAATATTG TTGAACTTAA 2520 
GAAACAGCTG TCTGAACTTA AQAAAAAATQ TOGTaAAOAC CAGGAGAAAA TACACGCTCT 2580 
CACATCXGAA AAGACTAACT TOAAGAAOAT GATGAGIAAT CA6IATGTGC CAGTTAAAAC 2640 
CCATGAA6A0 GTTAAAATGA CACT6AATGA CACXTTTAGOC AAAACTAACA QAGAATTATT 2700 
AGATGTGAAG AAAAAATTTG AAGATATAAA TCAGGAATTT GTAAAAATAA AAGATAAGAA 2760 
TGAAATATTA AAAAGAAACC TGGAAAACAC TGAGAACCAA ATAAAAGCT6 AGTACATCAO 2820 
CCTGGCAGAG CACGAQGCAA AQATGAGCTC GCTAAGTCAG AGCAIGAQAA AGGTGCAGGA 2880 
TAOTAATQCT GAAATCTTGG CCAACTACAG AAAAGGCCAA GAAGMaATTG TGACACIGCA 2940 
TGCOGAAATT AAA6CCCA6A AGAAGOAGCT 06ACACAATA CAAfiAATOCA TTAAGGTAAA 3000 
ATATGCCXX31 ATTGTCAGCT TTGAGGAGTG OGAQAGAAAA TTTAAA6CAA CAGAGAAAGA 3060 
ACTAAAAGAC CAGTTATCAG AGCAGACACA AAAGTATAGT GTCAGTGAAO AAGAAGTCAA 3120 
QAAAAACAAG CAAGAGAATG ACAAOTTAAA GAAGGAGATT TTTACCCTTC ABAAAGATTT 3180 
GAGAOATAAG AGAOTTCTCA TTGAOAAGTC TCATQAAAT6 GAAAGAOCAT TAAQCA6AAA 3240 
AACASAOGAG CTAAACAAAC AGTTAAAASA CTTOTCACAO AAATACAOGO AAGTAAAGAA 3300 
TGTGAAAGAG AA6CTAGTAG AAGAAAATGC CAAACAGACT TCTGAGATAC TTGCAGTGCA 3360 
AAATCTTTTG CAAAAACAAC ATGTTCCATT GGAACAGOTT GAGGCTCTQA AAAAATCTCT 3420 
TAAT6GCACA ATTGAAAATC TAAAGGAAGA ACTGAAGAGT ATGCAAAGGT GTTACX3AGAA 3480 
A0AGGAGCA6 ACAGTGAGGA AACTGCATCA ATTGTTQGAG AATGAAAAGA ACTCTTCTOT 3540 
AOCCCTGGCA GAGCATTTGC A6ATTAAAGA AGCATTTGAG AAAQAAGTTG GAATCATAAA 3600 
AGCCAGCTTO AGAGAAAAGG AA6AAGAAAG CCAAAACAAA ATGGAAGAAG TCTCCAAACT 3660 
TCAGTCGGAQ GTTCAGAATA CTAAACAAGC ATTAAAAAAA TTAQAGACTA OAGAGGTAGT 3720 
TGACTTOTCT AAATATAAAQ CAACAAAAAO TGATTTGGAG ACACAGATTT CTAOC TTAAA 3780 
TGAAAAATTO GCCAATCTGA ATAOAAAOTA TaAGQAAOTA TGlXaGGAAO TTTTGCATGC 3840 
CAAAAAOAAO GAAATATCTG CAAAA0AT6A GAAGGAATTA CTGCATTTCA CCATTGA6CA 3900 
AGAAATTAAG GATCAGAAiGG AA0GAT8TGA TAACTCCTTA ACAACAATCA CAOAGTTACA 3960 
AAGAAOAATA GAAGAATCTO CTAAACAAAT A6AAGCAAAA OATAATAAGA TAACTGAACT 4020 



1130 



wo 03/042661 



PCT/US02/36810 



GCTTAATGAT 
AACTGGGMVC 
ATCPCTGCSW3 
TOGGACACAC 
6CT0CAGATC 
TATCIGTTTT 
CTTACTGTGC 



GTGQAAAQAT 
CCCACCAAGA 
CAACAGCTGG 
CTTCTTAGTG 
ATACAAATGC 
ATCTTGCTG6 
TGOTATAATT 



TAAAACAGGC 
GQCAGAGCCA 
CCX3ATQCTQA 
CTGGACAGGO 
GGCAQGOQCT 
TGCTQAACAT 
AAAATAAAAT 



ACTCAATGGC 
GCTGATTGAC 
CAGACAGCAC 
TCACATGQAT 
TGTGTGCTAa 
TCr r T OTC CA 
ATATTTTOTT 



CTTTCCCAAC 
ACTCTGCAGC 
CAAGAAGTAA 
GAAGATGTTC 
0G6TTAGCAC 
ACTCCATGGT 
CPG GQ TOT 



TCACCTACAC 40 BO 

AOCAAOTGAA 4140 

TTGC3VATTTA 4200 

AGQAG6CTCT 4260 

TQACTQCCAG 4320 

CTTTCTGGGC 4380 
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Seq ID NOt 446 Protein sequence 
Protein Acceselon 8i AAG49577.1 



1 
I 

KKSLKSRLRR 
PGKU3VEQR5 
IK)YNCPTEHA 
TIOQLLIDRQ 
RZGDNLDILT 
IBNEDliKERIi 
BSLRTIEALR 
SHXAPItELSL 



ELKDQLKDLK 
EREKKGRKVT 
LSBIROIiKRB 
XVYLDinailtK 
EMEKLLLBND 
IHALTSENIN 
KDXNEILKRN 
VTLHABIKAQ 
BEVXKNKQEH 
EVKMVKEKLV 
CYBKBQQTVT 
VSKIiQSBVQN 
VLHAXKKBIS 
ITELLNDVER 
lAIYRTRLLS 



11 

I 

QDVPGPASSG 
VPHWTSKGN 
DLQGRTALQK 
ApVNSRDKQN 
LLKTA8BNTN 
RKIQQEQRIL 
NRFKYFBSDH 
PSQfTSySBIB 
QIKQLEDALK 
VKYECSASAEV 
EMBGQAKELS 
LSNVKAKLAQ 
BQAHMLTXEM 



21 

I 

AAAASAHAAD 
LECLMAILIH 
KAHADCPSSZ 
RTALMLGCBY 
KBRELWKKGP 
U7KVNGLQLQ 



31 

1 

GVDITTSDTA 
QLLCDBGASV 
OCRDAVEVLI 
8LQQRHLTHM 
LNEBVMVADO 
KEOMLLKQGQ 



LKKMMSNQYV 
LENTQNQIKA 
KKELDTIQEC 
DKLKXBXPTL 
EENAKQTSEI 
KIiHQLLENQK 
TKQALKKLBT 
AKDEKEliLHF 
LKQALKOLSQ 
AAQ6HMDEDV 



ILKKELBAMR 
DVQKRMYBSB 
GKLRNQIXQN 
AKLALSIPAE 
HVXPEBHEQV 
KMHYVPLiCVS 
TVFVPPBXBB 
FVKTHBBVKH 
EYISLABREA 
IKVKYAPIVS 
QKDLRDXTVb 
LAVQMUiQKQ 
NSSVPLAEHL 
REWDLSKYK 
SZEQBIKDQK 
ItTXT^SIPTK 
QEALLQIIQM 



(3KVKK2HQTHF 
EMIVEBFKRD 
KFENMKSSLS 
KSRLBQKSGB 
EDHKXCSHDAZ 
KEXZALKSMZ 
TLNDTLAKTN 
KMSSLSQSMR 



HVPLBQVEAL 
QIKBAFBKEV 
ATKSDLETQI 
ERCDKSLTTZ 
RQSQLZDTIiQ 
RQQLVC 



41 

I 

AAERGDVHKV 
GRNALKLAAK 
NAKDVXGRTP 
KNGADISLIiD 
QDBVNVKSHQ 
liESEREKLKS 
HYMADSQCTS 
LKLQNBXiABK 
LALKEHLTSE 
EQKLIEENKR 
NBVNEKAKKL 
liGKKZTBLTL 
iroLNRKLLD 
VBLKKQLSEL 
REIiLDVKKKF 
KVQDSKAEIL 
TEKEZiKDQIiS 
LSRKTDELNK 
KKSLNGTZEN 
OXZXASIiREK 
SSINEKLANL 
TELQRRZQBS 
BQVKSLBQQL 



51 

I 

TSILAKKGVN 
YGHALCLQKL 
LVLATQMSRP 
AL6HDSSYYA 
REHQNIQDLE 
LXiAAKEKQHE 
FGZPABKQSR 
VABCKALALB 
AASGMHRLTB 
LQKEL8MCEM 
VEMBREUEKS 
KHQTLQKEZE 
VTQKYTBKKL 
KKKOGEDQER 
EDINQEFVKZ 
ANYRKGQBBI 
BQTQKY8VSB 
QLKDLSQKyr 
tiKEELKSMQR 



NRKYEEVCEB 
AKQZEAKDNK 
ADADRQHQBV 



Seq ID HOi 447 DNA sequence 
Nucleic Acid Accession MM_003020.1 
Coding sequencer 29.. 664 ~ 



1 
I 

06CTCCTOG6 
TGGCCTACTG 
CGCTGAC06G 
GOGCATT6CC 
CCAGAGCATT 
CCCCAACATC 
GGGGTAOCCA 
AAACAOOCCT 
TCCGGAACAT 
GAA66GAGGA 
GGATAATGTT 
OTAAAOAGAA 
T6CACGTGTA 
TACAAAGCAG 
AAATTAGAAT 
TAAAAATTAA 
TQCAGTTTAA 
TTTGATTTTG 
OCTOTACTCA 
GCAAGCArrO 



11 

I 

OCTGCCCCTC 
TTTTGGCTGO 
GTCTCAGAAG 
AGGCCCCGAO 
GAAGGTGGAO 
QTGGCAOAOT 
6ACCCTCCAA 
GACACTGCAO 
GACTATCCAG 
6AGA6AC6AA 
GTTGCAAAGA 
6AT0CTAGAC 
AATGOAGTCC 
CTGTATGTAG 
AAGAGCTTTT 
AATGTGAATG 
AATGOTQTCT 
ATTATGTAOT 
AGAOGAGGAO 
GC 



21 

1 

GGTTGACAAT 
CATCTOGATG 
CAGATATCCA 
TGGAATATCC 
CTCATXyuUSG 
TQACTGGAGA 
ATOCCTGTCC 
AOTTCAOTCQ 
GCTTGGGCAA 
AGCGGA6GAG 
AGTCTGTCCC 
GAAAACOCAC 
CTGTGAATGA 
ATAGTGTATT 
TTQTTTCTTG 
TCAACAATAA 
GAOOTTQTAC 
TCATCCAGCC 
CTGACACATT 



31 
I 

GGTCTCCAGG 
6ACTCCAGCA 
GAGQCTGCTT 
AGCTCACCAG 
ACTTCAGCAT 
CAACATTCCT 
TGTTQGAAAA 
AGAGTTCCAO 
GTGGAACAAG 
TGTCAATCCA 
CCATTTTTCA 
ATTACCTGTT 
C!A6CATGTTT 
GTCTTCACAC 
GQTTTTTAAA 
AAAGCAAGAC 
TATTTTQQCC 
CTTGGGCATT 
TCACTTGQCT 



41 

I 

ATGGTCTCTA 
TTTGCTTACA 
CATGGTGTTA 
GCCATGAATC 
TTGGGTCCTT 
AAQQACTTTA 
ACAGATQATG 
TTGCACCAOC 
7AACTCCTTT 
TATCTACAAO 
GATGAG6ATA 
AGQOCTCAGC 
CTTACATAOA 
CGATGATTCT 
ATOTGAATCT 
TATQAAAGGC 
AAOTCTOTAO 
GTTATACAOC 
GCXiTCTTAAT 



51 

I 

CCATGCTATC 
Q0CCC06GAC 
T6GAGCAATT 
TTGTGGGCCC 
TTGGCAACAT 
OT GAGGAT CA 
GATGTCTAGA 
ATCTCTTTSA 
AOGAOAAGAT 
GACAOAaACT 
AGGATCCAGA 
ATGGCTTATO 
TAATTATGQA 
GCTTTTTGCT 
GCAATQATCA 
TCAGATTTCT 
AAAOCTOTGA 
AGTAAAQAAa 
AAACATGAAT 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NOi 448 Protein sequence 
Protein Accession fti NP_003011.1 

1 11 21 31 41 51 

1 1 I I I I 

HVSRMV81ML SGLLPIfZASG NTPAFAYSPR TPDSV8BADI QRLI£6VNBQ XiSZARPRVEy 
PAHQAMNLVG PQSZBGGAHB GLQHLGPFG!! ZPNZVABLTO CHZPRDPSED QGYPOPPHPC 
FVGKnSDGO. ENTPDTASFS RBFQUiQHLF DPBBDYPGLO XMNXXU.YBK NKGQERRXRR 
SVHPyLQGQR U»IWAKK6V PHPSDEDXDP B 

8eq ID NOi 449 DWA sequence 
Nucleic Acid Accession ft: NM_003816.1 
Coding sequence; 79..2S38 ' 

1 11 21 31 41 51 

I I I I I 1 

OGGCAGGGTT GGAAAATGAT GGAAQAaGGG GAGQTGGAGO OGAGOQAOTG CTOAGAOOAA 
OCTG OGGAAT 0GGC06AGAT GG6GTCT0GC GOQGQCTTTC CCTOSGGOAC CCTT0GT6TC 
O^TGGTTQC TGTT6CTXOQ CCTQCaSGOBC OCAOTGCTOG GTQCaSGGGOG 6CCAGGCTTT 

1131 



60 
120 
180 



60 
120 
180 
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CAACAOiVCCr CACATCITTC TTCTTATGAA ATTATAACTC CTTOOAGATT AACTAGAGAA 240 

AGAAGAGAAO CCCCTAOGGC CTATTCAAAA CAAGTATCTT ATOTTAITCA GGCTGAAGGA 300 

AAAGAGCATA TTATTCACTT GQAAAGGAAC AAAGACCTTT TGCCTGAAQA TTTTOTGOTT 360 

TATACTTACA ACAAGGAAGO GACTTTAATC ACTGACCATC CCAATATACA OAATCATTGT 420 

CATTATCGGG GCTATGTGGA GGOAGTTCAT AATTCATCCA TTGCTCTTAC OGACTGTTTT 4 BO 

OO ACTCRO AO CATTGCT6CA TTTAGAQAAT GOBAOTTATO G6ATTGAACC CCTGCA6AAC 540 

AGCTCTCATT TTGAGCACAT CATTTAT06A ATGGATGATO TCTACAAAGA GGCTCT6AAA 600 

T6TG6AGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGQATGAAGA GGAAOAGCX:! 660 

CCCAGCATGA CTCAOCTACT TCGAAGAA6A AGAGCTGTCT TGCCACAGAC CCGGTATGTG 720 

GAGCTGTTCA TTGT O GTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACT6CT 780 

OTGAGAGAAG AGATQATTCT CCTG6CAAAC TACTTQQATA OTAT GTAT AT TATQTTAAAT 640 

ATTOGAATTQ T6CTAQTT00 ACTGGAOATT T06A0CAATG GAAACCTGAT CAACATAGTT 900 

GGGGGTGCTG QTGATGTQCT GG66AACTTC GTGCASTGGC GGQAAAAQTT TCTTATGAOV 960 

OGTCGGAGAC ATQACAOTOC ACAGCTA6TT CTAAAOAAAG GTTTTGGTGG AACTGCA6GA 1020 

ATGGCATTTG TGGGAACAST GTGTTCAAGG AGCCA06CA6 60GGGATTAA TGTGTTT6GA 1080 

CAAATCACTG TGOAOACATT TGCTTCCATT GTTGCTCATa AATTGGGTCA TAATCTTG6A 1140 

AT6AATCACG ATQATGQGAG AGATT6TTCC TGTGGAGCAA AGA6GTGCAT CATGAATTCA 1200 

GGAGCATOGG GTTCCAQAAA CTTTAGCAOT TGCA(3TGCA0 AGGACTTTGA GAA6TTAACT 1260 

TTAAATAAAG QAOQAAACTQ CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

CCCTCCTGTG GTAATAAGTT GOTQ6AC6CT GGGGAAQAQT OTQACTGTGO TACTCCAAAQ 1360 

GAATOTGAAT TGGACGCTTQ CTGGSAA66A AGTACCTGTA AGCTTAAATC ATTTGCTGAO 1440 

TGTGCATAT6 GTGACTGTTQ TAAAGACTGT GGGTTCCTTC CAGGAGGTAC TrTATQCQQA 1500 

GGAAAAACCA OTOAaTGTGA TOTTCCAGAG TACTGCAATO OTTCTTCTCA GTTCTGTCAG 1560 

CCRGATGTrr TTATTCAGAA TGGATATCCT TGCCAQAATA ACAAAGCCTA TTGCTACAAC 1620 

GGCATGTGCX: AOTATTATGA TGCTCAATGT CAAOTCATCT TTGGCTCAAA AGCCAAGGCT 16 BO 

GCCCOCAAAG ATTGTTTCAT TGAAGT6AAT TCTAAAGGTG ACAaATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAA6TGT6CC ACTGGGAATQ CTTTQTOTGG AAAGCTTCSIO 1800 

TOTOAQAATG TACAAGAGAT ACCTGTATTT GGAATTQTGC CTGCTATTAT TCAAACGCCT 1860 

AGTOGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATQT TCCAGATCCT 1920 

GGQATG6TTA AOQAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTQAA TTATGACTGT QATGTTCASA AAAAOTOTCA TGQACATGOG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTGCCCC AAATTGT6AG 2100 

ACTAAAGGAT ACGGAGGAAG TGT66ACAGT GGACCTACAT ACAATGAAAT GAATACT6CA 2160 

TTSAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTOTGCTATT 2220 

TTTATCTTCA TCAAGAGGQA TCAACTGTGG AGAAGCTACT TCAGAAAGAA QA6ATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATGAAGCA AACCCTTCTA GACAGGOGQQ GAQTOTTGCT 3340 

OQACSVTGTTT CTOCAGTQAC AOCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACXaACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

COGAAAGTAT CATCTCAGGQ AAACTTAATT CCTGCCOOTC CTOCTCCTGC ACCTCCTTTA 2520 

TATAOTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTQAAAAOCC TTTCTOTTGC AACTATQAAT 2640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAO 2700 

AGTTOTGAAA TACAAGQAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCaTTTTC 2760 

CATC3VTTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 

TGAACATGTT ATTGC3VGTGA TTCTCAAATT AACTGTATTG GTGTAAQATT TTTGTCATTA 2880 

AQTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATQTAO TTCCTCATTG 2940 

AACATGTQAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAQA ATCTTOTCTG TCACTCACTA 3060 

CATGAATAAO CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

TTATTTTGAA AOTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCAOGCA GTTACTOQCI 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT OTGAAAGCAT GACATTOQTT CACAATAGCA CTATTTTAAA 3300 

TAAATTATAA GCTTTAAGOT AOGAAOTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

CTTGAQAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCrXGCTAT TAAATCATTT 3480 

AOAATOTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGQAGA AAGAAGGAAQ AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGOT ATCTATQAGT TATCATCTTA OCTGTGTTAA AAAXGAATTT TTACTATGGC 3660 

ASATATGGTA TGGATCX5TAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 3720 

AAAGTTTAAT AATAGGTTTA TTAACT6AAT TTCATTAlSTT TTTTAAAAGT GTTTTT6GTT 3780 

T6TGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 3840 
AAAAAAAAAA AAAAAAAAAA AAAAA 



Seq ID NO I 450 Protein sequence 
Protein Accession «: MP 003807.1 



MGSGARFPSG 
PYSKQV8YVI 
EGVHNSSIAL 
DIEKETAKDE 
LLANYXiDSMy 
AQLVLKKGFG 
RDCSOQAKSC 
LVDAOEECDC 
DVPEYCNGSS 
IBVNSKGDRF 
HGVDPQLGSD 
HCHCEKGWAP 
DQIiHRSYFRX 
KQPQQFPSRP 



11 

I 

TLRVRWLLLl. 
QAEGKEHIIH 
SDCFGXjRGLL 
EEBFPSMTQL 
IMLKIRIVLV 
QTAC2KAFVGT 
IMNSGASGSR 
OTPKECBZ£}P 
QFOQPDVFIQ 
GNOGFSGNEY 
VPDPGMVNBQ 
miCETKGYGG 
KBSQTYBSDO 
PPPQPKV6SQ 



21 
I 

GLVGPVLGAA 
LERNKDLLPE 
HLENASYGIE 
LRRRRAVLPQ 
GLEIWTNGtTL 
VCSRSHAGGI 
NFSSC8AEDF 
CCSGSTdOiK 
KGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYNE 
KNQANPSRQP 
GNLIPARPAP 



31 
I 

RPGPQQTSHIj 
DFWYTVNKB 
PLQNS8HFBH 
TRYVELFIW 
INIVGGAGDV 
NVPGQITVET 
EKLTLNKGGN 
SFABCAYGDC 
yCYKGMOQYY 
GKLQCBNVQB 
NFQCVDA8VL 
MNTALRDGLL 
GSVPRHVSPV 
APPLY88LT 



41 

I 

8SYBIITPWR 
GTLITDHPNI 
IIYRKDDVYX 
DKERYDMMGR 
I/aiFVQWREK 
FASIVAHELG 
CLUnPKPDE 
CKDCRFIiPGG 
DAQOQVIFGS 
IPVPGIVPAI 
HYDCDVQKKC 
VPPPLIVPLI 
TPPREVPXYA 



51 

I 

ItTRERREAPR 
QNHCBYRGYV 
BPLKOQVSNK 
KQTAVRBEMI 
FLlTRRRHDS 
HNLQQIHDDG 
AYSAPSOBNK 
TZjCRGKTSBC 
KAKAAPKDCP 
ZQTPSRGTKC 
HGHGVCNSmC 
VCAZFIPIKR 
NRPAVPTYAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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Seq ID NOi 451 mA sequence 

Nucleic Acid Accession fii NM_016650.1 

Coding sequence; 196.. 789 ~ 

Si 11 21 31 41 51 

I I I I I i 

GtSTTTCAATA TATOCAGATO TCTCGATATA GGAATQAAAT TAOGTCTTTO GAACAACTTA 60 

AATAAGTCAA ATATACTTGG AGCTTTAAAA ATTAAJVAGGA GAGAQATTOO AGCACCTTTT 120 

^ CTGCTG CCAT GACAACCAT8 CAAGGAATGG AACAG60CAT G CC A G QQ TTG GCCTGGTGTQ 180 

iU COCCAGCTGO GAAACATGGC TGTCATACSVT TGACftTCTGT GQAAAGGATT 6CAAGAGAAG 240 

TTCTTGAAGG GAGAACCCAA AGTCCTT6GG GTTGT GC AGA TTCTGACTGC CCTGATGAGC 300 

CTTAOCATGG GAATAACAAT GATGTGTATG GCATCTAATA CTTATGGAAG TAACCCTATT 360 

TCCGTGCATA TCGGGTACAC AATTTOGGGG TCAGTAATGT TTATTATTTC A6GATCCTT6 420 

TCAATTGCAG CAOQAATTAG AACTACftAAA GOCCTG G TCC GAGGTAGTCT AGGAATGAAT 4B0 

i J ATCACCAGCT CTGTACTGGC TGCATCAGGG ATCTTAATCA ACACATTTAG iSriXSGiXi ' l ' TV 540 

TATTCATTCC ATCACCCTTA CTGTAACTAC TATGGCAACT CAAATAATTG TCATG6GACT 600 

ATGTCCATCT TAATGGGTCT GGATGQCATO OTGCTCCTCT TAAQTGTGCT GGAATTCTGC 660 

ATTGCTGTGT CCCTCTCTGC CTTTGGATGT AAAGTGCTCT OTTGTACCCC TGGTGGGGTT 720 

OTO TTAATT C TGCCATCACA TTCTCACATQ GCAGAAAGAG CATCTCCCAC ACCACTTAAT 780 

GAG6TTTGA6 GCCAACAAAA GATCAACA6A CAAATQCTCC AGAAATCTAT GCTGACTGTO 840 

ACACAAGAGC CTCACATGAG AAATTACCAQ TATCCAACTT OCSATACTGAT AGACGTGTTQ 900 

ATATTATTAT TATATGTAAT CCAATTATQA ACTGTOTGTG TATAGAGAGA TAATAAATTC 960 

AAAATTATGT TCTCATTTTT TTCCCTGGAA CTCAATAACT CACTTCACTO QCTCTTTATC 1020 
GA6AGTACTA GGAGTTAAAT TAATAAATAA TGCATTTAAT GAQ6CCACAG GAAAAA 



20 
25 



35 
40 
45 
50 
55 
60 
65 
70 



80 



Seq ID MOt 452 Protein eequence 
Protein Acceaslon «t NP_0S7734.1 



1 11 21 31 41 51 

30 I I I 1 I I 

HAVIRSHLWK GLQEKFLKGE PKVLGWQIL TALMSLSMOI TMMOfASNTY GSNPISVHIG 60 

VTIWGSVMFI ISGSLSIAAG IRTTKQLVRG SIXSHMITSSV lAASGILINT FSLAFYSFHH 120 

PYOnnrGNSN NCHGTMSILM GLDGHVLLLS VLEFaAVSI* SAFGCKVLCC TPGGWIiILP 180 
SH8HHABTA5 PTPUIBV 



Seq ID NO: 453 DNA sequence 

Nucleic Acid Accession NH_002091.1 

Coding sequence i 56.. 503 

1 11 21 31 41 51 

I I I I I I 

AGTCTCTGCT CTTCCCAQCC TCTCOSGCGC GCTCCAAGGG CTTCCCGTOQ GGACCATGCG 60 

OGGCAGTGAG CTCCCGCTGG TCCTGCTGQC GCIGGTCCTC TGCCTAOCGC CCCGGGGGCG 120 

AGGOQTCOCO CTGCCTGCGG GOGGAGGGAC CGTGCTGAOC AAQATGTACC GGCGCGGCAA 180 

CCACTGGGOa OTGGGGCACT TAATGGG6AA AAAGAGCACA GGGQAGTCTT CTTCTGTTTC 240 

TGAGA6AGGG AGCCTGAAGC AGCAGCTGAG AGAGTACATC AGGTGGGAAG AAGCTGCAAG 300 

GAATTT6CTO GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GGCCTTGGGC AATCA6CAGC CTTCGTG66A TTCAQAGGAT AGCAGCAACT TCAAAGATGT 420 

AG8ITCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAAOQTG AAGGAAGGAA 480 

COCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGAAAA ACAAAACCCC 540 

TAAGAGACTG A6TTCTGCAA GCATCAGTTC TACGGATOVT CAACAAGATT T0CTTGT6CA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTQATTTT CAA6CAGCAT 660 

CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTOGAAA AGAOTCTTCC AATTAATGCT 720 

TTTTTATATC TAGGCTACCT OTTOQTTAGA TTCAAG60CC GQAGCTGTTA CCATTCACAA 780 
TAAAAGCTTA AACACAT 

Seq ID NO I 454 Protein sequence 
Protein Accession #i NP_002 062.1 

1 11 21 31 41 51 

I I I I I I 

MRGSELPLVL LALVLCLAPR 6RAVFLPAGG GTVLTKMYPR GNHWAVGHLM GKXSTGBSSS 60 
VSERGSLKQQ LREYIRHEEA ARNUjGLIEA KENRMHQPPQ PKAIiGNQQPS WDSEDSSNFK 120 
DVQSKGICV6R LSAPGSQREG RNPQIUQQ 

Seq ID NOt 455 DNA sequence 

Nucleic Acid Accession #i NM_016S22.1 

Coding sequence: 265.. 1299 " 



1 11 21 31 41 51 

I I 1 I I I 

GOGGAAGCAG CQAGGAGGGA GCCCGCTTTG GCCGTCC7CC GTGGAAC06G TTTTCCGAGG 60 

CTGG CRAAAG CC6AGGCTGG ATTTGGG6GA G6AATATTAG ACTOGGAGQA OTCTOCGOGC 120 

TTTTCTCCTC CCCGOGCCTC C0GGT06CCG CGGGTTCACC GCTCAGTCCC CGOGCTCQCT 180 

/j CCGCACCCCA CCCACTTCCT GTGCTOGCCC GGGGOOCGTG TGCCGTGCGG CTGCCQGAGT 240 

TCGGGGAAGT TGTGGCTGTC GAGAATGGGG GTCTGTGGGT ACCTGTTCCT GCCCTGGAAG 300 

TGCCTCGTGG TCGTGTCTCT CAGGCTGCTG TTCCTTGTAC CCACAGGAGT GCCCOTOOGC 360 

AGCQGAGATG CCACCTTCCC CAAAGCTATG GACAA0GTX3A 06GTC0GGCA GGGGGAGA6C 420 

6CCACCCTCA QGrGCACTAT TGACAACCGG GTCACCOQGG TGGCCTGGCT AAACCGCAGC 480 

ACCATCCTCT ATGCTGGGAA TGACAAGTOQ TGCCTGGATC CTOQOQTQQT CCTTCTGAGC 540 

AACACCCAAA COCAGTACAG CATCGAGATC CAGAACGTGG ATGTGTATGA CGAG6GCCCT 600 

TACACCTGCT OGGTGCAGAC AGACAACCAC CCAAAGACCT CTAGGGTCCA CCTCATTOTG 660 

CAA6TATCTC CCAAAATTGT AGAGATTTCT TCA6ATATCT GCATTAATGA AGGGAACAAT 720 

ATTAGCCTCA CCTGCATAGC AACTGGTAGA OCAGAGCCTA OGOITACTTG OAGACACATC 780 
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TCrCCCAAAG OQGTTGGCTT Ttnt^AGTGAA QAOOAATACT TGGAAATTCA G60CATOVCC 840 

CX3QGAACAQT CAGQGQACTA OC3AOTOCAGT GCCTCCAATG ACGTGQCOGC GCCOGTGGT A 900 

OGQAGAGTAA AGGTCACOGT GAACTATCCA CCATACATTT CA6AAGCCAA GGGTACAGOT 960 

QTOXOOTGQ GACAAAAGGG GACACTGCAG TGTQAAGCCT CAGCAGTCCC CTCAGCAGAA 1020 

TTCCAGTGGT ACAAGGATGA CAAAAiQACTO ATTGAAGGAA AGAAAGGG6T OAAAGTrGGAA 1080 

AAGAGAOCTT TCCTCTCAAA ACTCATCTTC TTCAATQTCT CT6AACATGA CTATGQ6AAC 1140 

TACACTTGOS TGGCCTCCAA CAAGCTGOGC CACACCAATG CCA6CATCAT GCTATTTGGT 1200 

CCAGGOGCCG TCAGCGAOGT GAGCAACGGC ACG7CX3AGGA GQ6CAGQCTG OGTCTGGCTG 1360 

CTGCCTCTTC TGGTCTTGCA CCTGCTTCTC AAATTTTGAT GTQAGTGCCA CTTCCCCACC 1320 

CGGGAAAGGC T6CGGCCACC ACCACCACCA ACACAACAOC AATGGCAACA C06ACAGCAA 1380 

CCAATCAGAT ATATACAAAT GAAATTAGAA OAAACACAGC CTCATGSQAC AiGAAATTTQA 1440 

6GGAGG0GAA CAAAOAATAC TTTGGGGGGA AAAGAGTTTT AAAAAAGAAA TT6AAAATT0 1500 

CCTTGCAGAT ATTTAGGTAC AATGGAGTTT TCTTTTCCCA AACXXSGAAGA ACACAGCACA 1560 

CCCX5GCTTGG ACCCACTGCA A0CT6CAT0G TGCAACCTCT TTGGTGCCAO TGTGGGCAAG 1620 

GGCTCAGCCT CTCT6CCCAC AGACTGCCCC CACX3TGGAAC ATTCTGGAGC T6GCCATCCC 1680 

AAATTCAATC AOTGCATAOA GAOBAACAGA ATaAGAOCTT COG G CCCAAG CXSTGGOGCTT 1740 

CCGGCCCAA6 CX3TG00GCTG OOGGCACTTT GOTAGACrGT GCCACCAOGO CS'fGTGll'GT 1800 
GAAA06TGAA ATAAAAAGAQ CAAAAAAAAA AAAAAAAAA 

Seq ID N0> 456 Protein sequence 
Protein Accession fit NP_057606.1 

1 11 21 31 41 51 

I I I I I I 

MGVCXSYLFLP WKCLVWSLR LLFLVPTGVP VRS6DATFPK AMDKVTVRQ6 ESATLRCTID 60 

xneVTBVAHUI RSTIIiYAOlD KWCLDPRWL LSNTQTQySZ BXQNVDVYDE GFYTCSVOID 120 

KHPKTSRVHL IV0V8PKIVB I88DISZNEG NNZSLTCIAT GRPBPTVTHR BI8PKAV0FV 160 

SEDEYLEIQG ITRGQSGDYB CSASNOVAAP WRRVKVTVN YPPYISBAXG TGVPVGQKGT 240 

LQCEASAVP8 ABFQWYKDDR RLIEGKKGVK VENRPFLSKL ZPFNV8EKDy GHYTCVASNK 300 
LGimiASIMli FGPQAVSEVS NGTSRRAQCV WLLPLLVLHL LLKF 

Seq ID NOt 457 DNA sequence 

Nucleic Acid Accession #; im_012261.1 

Coding sequence I 203.. 1045 

1 11 21 31 41 51 

I I I I I I 

QATTT Q CTCT GCCAGCAGCT GTCXSGTGCOG CGCTOQACAC OGAGTCCTAG CTAGG06CTC 60 

ACAGAATACG OGCTCCCTCC CTCCCCCTTC TCTQTCCCCC GOCTCTOGCT CACCCCGOCC 120 

CACTCCAGOG 6CGACTTTGA GGGATTCCCT CTCTGGC6GC CTCT G CAGCA GCACAGC066 180 

CCTCATTOGG OGCACTGOQA GTATG6ATCT GCAAGQAAGA GGGOTGCCCA GCATCQACAO 240 

ACTTCGAGTT CTCCTGATGT TOTTCCATAC AATGGCTCAA ATCATGGCAG AACAA6AAGT 300 

GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAQAT ATATTTGTGG TGOGGGAAAA 360 

TGGGA0GACX3 TOTCTGATOa CAQAOTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 420 

GQCCAGCAAC TAOQTAGATC T6ATCACAGA ACAGGCOQAT ATO OCATTGA CC06GGGAGC 480 

TGAQ0T6AA0 GGC0GCTGT6 6GCACAGCCA OTOGGAaCTG CAAGTQTTCT GGGTGGATOO 540 

OOCATATGCA CTCAAAATGC TCTTTGTAAA GGAAA6CCAC AACATGTCXaV AGGGACCTGA 600 

GGCX3ACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTAOGAC TCXnOGGAGA AAACCCACTT 660 

CAAAGAOGCA GTCAGTGCTO GQAAOCACAC AGCCAACTOQ CACCACCTCT CTGCCTTGGT 720 

CACCCC06CT GGGAAGTCCT ATGAGTGTCA A6CTCAACAA ACCATTTCAC TGGCCTCTAG 780 

TOATCGGCAG AAGAGQOTCA CCATGATCCT GTCTGOGOTC CACATCCAAC CTTTTGACAT 840 

TATCTCA6AT TT TG T C TTCA GTGAA6AGCA TAAATGCCCA 6TGGATGAGC GGGAGCAACT 900 

GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTOOTCATCA T6GTAACACT 960 

CGCXSATTTAC CACGTCCACC ACAAAATOAC TQCCAACCAG GTOCAQATCC CTCGGGACAG 1020 

ATCCCAGTAT AAGCACATGO GCTAGAGQCC 0TTA6GCA60 CACXXXXTTAT TCCTGCTCCC 1080 

CCAACIQQAT CAGGTAOAAC AACAAAAGCA CTTTT OCAT C TTGT ACAOBA GATACA OCAA 1140 

CATAGCTAO^ ATCAAACAGG CCTGGGTATC T6AGGCTTGC TTG6CTTGTG TCCATGCTTA 1200 

AACCCAOGQA AGGGG6AGAC TCTTTCOQAT TTGTAGGGTG AAATGQCAAT TATTCTCTCC 1260 

ATGCTGGGGA GGAGGGGAGG AGGGTCTCAQ ACAGCTTTCG TQCTCATGGT GGCTTGGCTT 1320 

TGACTCTCCA AAGA6CAATA AATGCCACTT GOAOCTGTAT CTGGCOCCAA A6TTTAGGGA 1380 

TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAA6AATA TGGGGTGCTT 1440 

TGCTCCCTTG GACACAGCT6 GCTTATCCTA TAGAGTTGTC AATGCACACA GAATACAACC 1500 

TCATGCTCCC TOCAGCAAGA CXXCTGAAAG TGATTCATGC TTCTGGCTGG CATTCTGCAT 1560 

GTTTAGTGAT TGTCTTGGGA ATGTTTCACT GCTACCCX3CA TCCAQCQACT GCAGCACCAG 1620 

AAAACX3ACTA ATGTAACTAT GCAGAOTTGT TTGGACTTCT TCCTGTGCCA GGTCCAAGTC 1680 

0GGGGACC1G AAGAATCAAT CTOTOTGAGT CTGTTTrTCA AAATGAAATA AAACACACTA 1740 
TTCTCT6GC 

Seq ID NO: 458 Protein sequence 
Protein Accession #i 27P_036393.1 

1 11 21 31 41 51 

I I ) I I I 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAB QBVENLSGLS TNPBKDIPW REKGTTCLMA 60 

BFAAKFXVPY DVHASNYVDL ITBQADIALT RGAEVKGRCX3 HSQSELQVPW VDRAYALKML 120 

PVKESHNMSK GPEATWRLSK VQPVYDSSEK THFKDAV8A0 KHTANSHHLS ALVTPAGKSY 180 

BGQAQQTISL ASSDPQKTVT MILSAVHIQP PDZISDFVF8 BEBKCPVDBR EQIiEBTlPLI 240 
LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYXHMO 

Seq ID NO 3 459 PWA sequence 

Nucleic Acid Accession tft HN_001169.1 

Coding sequence: 85.. 870 

1 11 21 31 41 51 
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I I I I i 1 

TAGOAOATAA GAGTATCTTG CACAGCAGGT GC3\GGTTTCC CAOCAGCTCA GGCAAGAGTC 60 

CGATQTTTCT GCCATCIGAT CCTGATGTCT GQAOAOATAG COVTGTGTGA GCCTQAATTT 120 

GGCAATGAOV AG0CCACG6A GCXmGOGTG GGTG G CAGGT GG0QA6TQTC CTGGTACGAA IBO 

5 CGOTTTGTGC AGCCATGTCT GGTOSAACTG CTGGGCTCTO CTCTCTTCAT CTTCATOGOO 240 

T0CCT0TCX3G TCATTGAGAA TGOOACGGAC ACTGGGCTGC TGCAGCX3GGC CCTGGCCCAC 300 

GGGCTGGCTT TGGGGCTCGT GATT6CCA00 CTGGGGRATA TCAGTGOTGO ACACTTCAAC 360 

CCTGOGGTGT CCCTGGCAGC CATGCTGATC GOAGGCXTTCA ACCTGGTOAT GCTCCTCCCG 420 

TACTGGGTCT CACAGCTGCT CGOGGGGATO CTCGGGGCTG CCTTGGCCAA GGTGGTGAGT 480 

10 CCTGAGGAGA GGTTCTG6AA TGCATCTGGG 60G6CCTTT6 TQACAOTOCA GQA6CA6GGG 540 

CAGGTGGCAO GG606TTGGT GGCAOAGATC ATCCTGA06A 06CTGCTG6C CXTOGCTOTA 600 

TOCATGG G TG CCATCAAT6A GAAGACAAAG GGCC C TCTGG CCCGGTTCTC CAT06GCTTT 660 

GCCGTCACCG TGQATATCCT QGCTQGGGGC CC rG TOTCT G 6AGGCTGCAT GAATCCCGCC 720 

CGTGCTTTTG OACCTOCGGT GQTOGOCAAC CACTGGAACT TCCACTGGAT CTACTGGCTG 780 

15 GGOCCACTCC TGGCTQGCCT GCTTGTTGGA CTOCTCATTA GGTGCTTCAT TOGAGATOGG B40 

AAOACXX33CC TCATCCTGAA GOCTOGXnGA GCAGAGCTOG TGGGATTCXrT GCTQCTCCAO 900 

GTGTCCTCAG CTCACCTGTC CCAGACTGAG GAaUSGGGAO TTOCTGCATT TCCTOCCAGG 960 

GCAGAGGCCC AGAGGAGCGA CCCCCTGCTT CCaCTGCTTG GGCCTGCTTT CTCAGATAGA 1020 

CTGACTGCTG AGGAGGCTCT AGGTTCTTGG AATTCCTTTG TGCTCATCAG AGACCCCAGC 1080 

20 CTGGGGAACA OGCTGCXZOGC ACTGGCCAOA GAGCAOTGCA AACACCACAA CACGAGCGTG 1140 

TTTCTTGAOA GQAATGTCCC OQAGTTGGAC AAGQAG6CTG TTTCT G CACA TCAGCTCATT 1200 

TCCOGCAfXC CATTTCTTGC TTOATTGCTT TOTTGGOOaC CPOGOCACTT CCTT6CTTCT 1260 
CAAGCTOACA ATTCrCACTT TQCAATAAAT AGTCCAOTGT TTGCTTCAT 

25 

Seg ZD MOt 460 Protein sequence 
Protein Accession #t NP_001160.1 

1 11 21 31 41 SI 

30 I I I I I I 

MSGEIAHCBP BFGNORARBP SVGGRWKVSM YERFVQPCLV SLLOSALFIF IQCLSVXENO 60 

TDTGLLQFAL AKOLALGLVX ATLGKISGGH FNPAVSLAAH LIG6LNLVML LPYHVSQIiLQ 120 

GMLGAALAKV VSPBERFHNA SQAAFVTVQE QGQyAGALVA EIZLTTUAb AVCMOAINEK 180 

TKGPLAPFSI GFAVTVDIIiA QGFVSGGOIN PARAFGPAW ANSWNFHWXY WliGPLLAGZiL 240 
35 VGLLIRCPIG DGKTRLHiKA R 



Seq ID MOt 461 DKA sequence 
Nucleic Acid Accession #t NH_003226.1 
40 Coding sequence: 2.. 226 

1 11 21 31 41 51 

I I I I I I 

GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTQAGGAGT ACGTGGGCCT 60 

45 GTCTGCAAAC CAQTGTGCX6 TGCOGGCXaA GGACAGGGTO GACTGOGGCT ACCCCCATGT 120 

CACCCCCAAG OAOTOCAACA AOOCSGGGCTQ CTQCITTQAC T0CAG6ATCC CTGGAGTGOC 180 

TTGGTGTTTC AAGCCCCTX3A CTAOGAAGAC AGAATOCACC TTCTGAGGCA CCTOCAGCTQ 240 

CCCCTGGOAT GCAGGCTGA6 CACCCTTQCC CGGCTGTQAT TGCTGOCAOS CACTGTTCAT 300 

CTCAQTTTTT CTGTCCCTTr GCTCCOGOCA AGCTTTCTGC TGAAAOTTCA TATCTGQAGC 360 
50 CTGATGTCTT AAOOAATAAA GGTCCCATGC TCC»CCOO 

8eq ID N0< 462 Protein sequence 
Protein Accession #i 2IP_003217.1 

55 1 11 21 31 41 51 

I I I I I t 

MLQLVIALI.S SSSAEBYVGL SANQCAVPAK DRVDOGYPUV TPKBCHNSGC CFDSRZP6VP 60 

WCFKPI/TRKT ECTF 

60 Seq ID NO: 463 DMA sequence 

Nucleic Acid Accession #f NM_002993.1 
Coding sequence i 64.. 408 

,^1 11 21 31 41 51 

65 t I t I I I 

GGGAOQAOCC A0TCTCG6CQ CCTCCACCCA GCTCAOOAAC CG6CGAACCC TCTCTTGACC 60 

ACTATQAOCC TCCCGTOCAO COGCGOGGCC CGTGTCCCGO GTCCTTOOGQ CTCCTTGTGC 120 

GCGCTGCTOG CGCTGCTGCT CCTGCTOAOQ OCJGCOGGGGC CCCTOOCCAa OOCTGOTCCT 180 

GTCTCTGCEO TGCTGACAGA GCTQCOTTGC ACTTOTTTAC GOGTTACGCT GAGAGTAAAt 240 

70 COCAAAAOQA TTGGTAAACT GCAGGT6TTC COOGCAGGGC CGCAGT6CTC CAAGQTQGAA 300 

GTQOIAGCCT CCCIGAAOAA OBGOAAGCAA tfm t l T CT G Q ACCCGGAAGC OCCTTTTCTA 360 

AAGAAAGTCA TCCAGAAAAT TTTOGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAG 420 

ACCATGCATC ATAAAATTGC CCAGTCTTCA GOOGAGCAGT TTTCTSGAQA TCCCTG GACC 480 

CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 540 

75 GAAGAGTGTG G6GGAAAGCC TAC6CTTCTC CCTBAAGTTT ACAOCTCAGC TAAT OAAGTA 600 

CXAATATAOT ATTTGCACTA TTTACTOTTA TTTTA0CT6A TAAOTTATTG AACCCTTTG6 660 

CAATT6ACCA TATTGTGAGC AAAGAATCAC TGQTTATTAG TCTTTCAATG AATATTOAAT 720 

TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAAG TCTTACCGAA AAGGCTGTGG 780 

ATTTCOTATG GAAATAATGT TTTATTAGTQ TGCTQTTGAG GQAGGTATCC TQTTGTTCTT 840 

80 ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT riT Ci'l^GOGQ AATAT GTTAC 900 

TCTTTACCCr AOQATQCTAT nAAOTTClTA CTGTATTAGA ACACTG0GTX3 TGTCATACCG 960 

TTATCTGTCC AGAATATATT TOCTTATTCA QAATTtCTAA AAATTTAAGT TCTGTAAG60 1020 

CTAATATATT CTCTTCCTAT GGmTAGAT OTTTGATOTC TTCTTAGTAT GGCATAATGT 1080 

CATOATTTAC TCATTAAACT TTGATTTTGrT ATGCTATTTT TTCACTATAO OATOACTATA 1140 
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ATTCTGGTCft CTAAATATAC ACTTTAOATA GATOAAOAAG CCCAAAAACA GATAAATTCC 
TGATTQCTAA TTTACATAGA AATGTATTCT CrrUaT ' lVl ' l TAA ATAA AAQ CAAAATTAAC 
AATGATCTGT GCTCTGCAAA G TTTT Q AAAA TATATTTGAA CAATTTQAAT ATAAATTCAT 
CATTTAGTCC TCAAAATATA TACAGCATTO CTAAGATTTT CAGATATCTA TTOTOGATCT 
TTTAAAGGTT TTGACCATTT TGTTATGAGO AATTATACAT OTATCAOVrT CA CTATA TTA 
AAATTGCACT TTTATTTTTT CCTOT Q TGTC ATGrTTGGTTT TTOGTACTTO TATTOTCATT 
TGGAGAAACA ATAAAAGATT TCTAAAOCAA AAAAAAAAAA AAAAAAA 

Seq ID NOt 464 Protein sequence 
Protein Accession tt: NP_0029e4.1 

X 11 21 31 41 51 

11)111 
MSLPSSRAAR VPQPSGSLCA LLALT.TiT.T.TP PGPLASAGPV SAVLTBLRCT CLRVTLRVNP 60 
KTIGKLQVPP AGPQCSKVBV VASLKNGKQV CLDPBAPPIiK KVIQKIIjDSG NKKN 

Seq ID NOs 465 DNA Bequence 
Kucleic Acid Accession ii: NM_002038.2 
Coding sequence: 108.. 500 

1 11 21 31 41 51 

I I I 1 1 I 

GAACCGTTTA CTOGCTQCTG TGCCCATCTA TCAQCAGGCT COSGGCTGAA GATTGCTTCT 60 

CTTCTCTCCT CCAAGGTCTA GTGAOGGAGC COGOQOGCGO OQCCACCATQ CGQCRGAAGQ 120 

0GGTATCX5CT TTTCTTOTGC TACCTOCTGC TCTTCACTTG CAGTGGGGTO QAGG CAGG TA 180 

AGAAAAAOTG CT0GSASA6C TCGGACAGGG GCTGGGGGTT CTGGAAGGGC CTQACCTTCA 240 

TGGCCGTGGG AGGAGGACrC GCAGTCX3G00 GOCTGCGOQC 6CTGGGCTTC ACGGQGQC06 300 

OCATOGCGGC CAACTOGGTG GCTGCCTCX3C TGATGAGCTG GTCTGCGATC CTQAATQGGG 360 

GOGGCGTGCC CGCCGGGGGG CTAGTGGCCA CGCTQCAGAG CCTCGGGGCT GGTGGCAGCA 420 

GOQTOOTCAT AGGTAATATT GGTGCCCTGA TGGGCTACGC CACCCACAAG TATCTOGATA 480 

GTOAGQAGaA TGAGGAQTAO CX»GCAGCTC CCAOAACCTC TTCTTC C TTC TTQG CCTAAC 540 

TCTTCCAQTT AGOATCTAOA hCSTKiCCn TTTTTTTTTT ITPTTTTTTT TrTOAGATOG 600 

GTTCTCACTA TATTGTCCAG GCTAGAGTGC AGTGGCTATT CACAGATGCG AACATAOTAC 660 

ACTGCAGCCT CCAACTCCTA OCCTCAAGTG ATCCTCCTGT CTCAACCTCC CAAGTAGQAT 720 

TACAAGCATO CGCCGACQAT GCCCAGAATC CaOAACTTTG TCTATCACTC TCCCX»ACAA 780 
CCTAGA7GT0 AAAACAGAAT AAACTTCACC CAQAAAA 

Seq ID NOt 466 Protein sequence 
Protein Accession 8t NP_002029.3 

1 11 31 31 41 51 

1 1 1 I I I 

MRQKAVSLFL CYLUiFTCSG VEAGKRRCSB SSDSQSOFHX ALTFMAVG6G LAVAGLPAL6 60 

FTGAGIAANS VAASLMSHSA XUIOGGVPAO GLVATIiQSLG AGGSSWIGN ZGALMGYATH 120 

KYLDSEEDBB 

Seq ID NO: 467 DMA sequence 

Kucleic Acid Accession #t IIM_003469.2 

Coding sequence: 92.. 1945 

1 11 21 31 41 51 

11)111 

GAAACGGCCC GAOAAGCTOQ CCOGQAGAAC GGGGAGGAAT ATGCTGTGGA GCTCC TCTG C 60 

CATATAAACA AAAAQAGQAA ATCTTTCAAA CATG6CTQAA GCAAAGACCC ACTGGCTTGG 120 

AGCAGCCCTG TCTCTTATCC CTTTAATTTT CCTCATCTCT GGGGCTGAAG CAGCTT CATT 180 

TCAQAOAAAC CAGCTGCTTC ASAAAGAACC AGACCTCAGO TTGOAAAATO TCCAAAAOTT 240 

TCCCAGTCCT GAAATQATCA GGGCTTTGGA OTACATAGAA AACCTCOGAC AACAAGCTCA 300 

TAAGGAAQAA AGCAQCCCAO ATTATAATCC CTAOCAAGGT GTCTCTGTCC CCCTTCAGCA 360 

AAAAQAAAAT GGOGATGAAA GCCACTTGCC CGAGAGGOAT TCACTQAOTG AAGAAGACTG 420 

GATGAGAATA ATACTCOAAO CTTTGAQACA 6GCTGA AAAT GAGCCTOHGT CTGCACCAAA 480 

AGAAAATAAO CCCTATQCCT T6AATTCAGA AAAOAACTTT CCAATOGACA TQAGiaATGA 540 

TTATOAGACA CAGCAOTGGC CAOAAASAAA GCTTAA6CAC AT6CAATXCC CTCCTATOTA 600 

TGAAQAGAAT TCCAGGGATA ACCCCTTTAA AC6CACRAAT GAAATAGTaG AOGAACAATA 660 

TACTCCTCAA AOCCTTGCTA GATTGOAATC TOTCTTCCAA GAGCTGGGGA AACTGACAGG 720 

ACCAAACAAC CAGAAA06TG AGAGGAT6GA TGAGGAOCAA AAACTTTATA OGGATGATOA 780 

AGATGATATC TACAAOOCTA ATAACATTQC CTATGAAOAT OrGGTOGGGG GAGAAOACTG 840 

GAACCCAOTA GAGGAOAAAA TAGAGAaTCA AACCCAQQAA GAGOTGAGAG ACAGCAAAGA 900 

GAATATAGGA AAAAATGAAC AAATCAACOA TGAGATGAAA CGCTCAOGOC AGCTTOGCAT 960 

CCAGGAAOAA GATCTTCQQA AAGAQAOTAA AQACCAACTC TCAGATGATG TCTCCAAAQT 1020 

AATTGOCTAT TTGAAAAGQT TAGTAAATGC TGCAQGAAGT GGGAi gTT AC AGAATGGGCA 1080 

AAATGGGQAA AGOGCCACCA GGCTTTTTGA QAAAGCTCTT GATTCTGAGT CTATTTATCA 1140 

QCTGATTGAA ATCTCAAGGA ATTTACAGAT ACCGOCAGAA GACTTAATTO AG ATGCTC AA 1200 

AACTGGGGAG AAGCOGAATO GATCAQTGGA ACOOOAQCGQ QAGCTTGAOC TTCCTGTTGA 1260 

CCTAGATGAC ATCTCAGAaO CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 1320 

CTCCAAGAGT GGCTACCCTA AAACACCTGG TCQTGCTGGO ACTGAG6CCC TACCAGAOGG 1380 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGOGATGGAO A6TGCAGCAA ATCA6AAAAC 1440 

OTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTO CCAAGGCTCC CTTATGGTGC 1500 

TGGAAGATCT AGATCGAACC AGCTTCCCAA AOCTGCCTGG ATTCCACATQ TTGAAAACAG 1560 

ACAOATGaCA TATQAAAACC TGAACQACAA GGATCAAGAA TTAGQTGAOT ACTTGOCCAG 1620 

GATGCTAGTT AAATACOCTG AGATCATTAA TTCAAACCAA OTGAAOOQAO TTCCTGGTCA 1680 

AGGCrCATCT QAAGATGACC TQCAGGAAGA GGAACAAATT GA6CAG60CA TCAAAGA8CA 1740 

TTTGAATCAA GGCAGCTCTC AOQAOACTGA CAAOCTGGCC OC GGTOAO CA AAAGOTTCOC 1800 

lOrGGGGOOC OOGAAOAATO ATGATACCCC AAATAGGCAO TACTGGGATG AAGATCTGTT 1860 

AATGAAAinO CTGGAATACC TCAATCAAGA AAAOGCAOAA AAGGQAAGGG AOCATATTGC 1920 
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TAAGAGAGCa ATGGAAAATA TGTAAGCTGC TTTCATTAAT TACCCTACTT TCATTCCTCC 1980 

CACCCCAAGC AAATCCCAAC ATTTCTCTTC AGTGTGTTGA CTTCTATCCT GTTAACACTQ 2040 

TAATATCTTT AAATGATGTA CAOGCAGATO AAACCAQGTC ACTGGOOAGT CT6CTTCATT 2100 

TCCTCTGAOC TGTTATCTTG TGTATGGATA TQTQTAAATG TTATGACTCC TTQATAAAAA 2160 

5 ATTTATTATG TCCATTATTC AAGAAAGATA TCTATGACTG TGTTTAATAO TATATCTAAT 2220 

GGCTGTG6CA TTGTTQATGC TCACATATQA TAAAAAAOTQ TCCtATAATT CTATTGAAAO 2280 

TTTTTAATAT TTATTQAATT ATTTTGTTAC T6TCIGTA0C OTTTTGTGGA GTACTGGAOC 2340 
AAAAAAATAA AGCATTATAA ATATA 

10 Seq ID NO: 468 Protein sequence 
Protein Accession fis NP_003460.l 

1 11 21 31 41 51 

-^111 ill 

15 MABAKTRWIiG AALSLIPIiIF LI6GABAA8F QRNQLLQKEP DLRLSIVQKF PSPEMIRALE 60 

YIENLRQQAH KEESSPDYNP YQGVSVPLQQ XENODBSHLP ESOSLSfiEDN MaiZLEAX«RQ 120 

AENEPQSAPK ENXPYAUN8E XNFPMDMSDD YETQQWPERK LKKMQFPPMY EBNSRDNPFK 180 

RTNBIVBEQY TPQSLATLES VPQELGKLTQ PNMQXRBRMD BEQKXiYTDDE DDIYKANNIA 240 

YBDWGOBDW NPVEEKIESQ TQBBVRDSKB HIGKNEQIND EMKRSGQLQI QEBDLRKESK 300 

20 DQLSDDVSKV lAYLKRLVNA AGSGRLQNGQ NGBRATRLFB KPLDSQSIYQ LIEISRNLQI 360 

PPEDLIEMLK TQBKPHOSVE PBRBLDLPVD LDDZSBADIiD HPDliFQN3tML 8K8GYPRTPG 420 

RAGTBALPDO LSVEDXLNLL GME8AANQKT 8YPPKPYKQB KVLPBLPYOA GR8RSNQLPK 480 

AAWIPHVB7R QMAYENLNDK DQEUSEVXAR MLVRYPBZIN 8MQVKRVPGQ GSSEDOLQBB 540 

EQIEQAIKEH LNQGSSQBTD XLAPV8KRPP V6PPKMDDTP NROYHDEDLL MKVLBYX^B 600 
KAEKGREHIA XRAHENM 



25 
30 



55 



6eq ZD NOt 469 DNA sequence 
Nucleic Acid Accession tt: MM_00639e.l 
Coding sequence > 19.. 516 



1 11 21 31 41 51 

I I I I I 1 

GGCCCCTTGT CTGCAGA6AT GGCTCCCAAT GCTTCCTQCC TCIXSTGTGCA TGTCCQTTCC 60 

GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCXATATG ACAGCQTOAA AAAAATCAAA 120 

35 GAACATGTCC GGTCTAAGAC CAAGQTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 180 

AAGATCTTAA AGCCAOQGAG AAOOCTCTCA TCTTATGGCA TTQACAAAGA GAAGACXrATC 240 

CACCTTACOC TGAAAGTGGT GAAOGCCAOT GATGAG6AGC TGCCCTTGTT TCTTGTGGA6 300 

TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTQGCACAA 360 

GTQAAAOCAA TGATCGAGAC TAAQACS3GGT ATAATCCCTG AGACCCAQAT TQTGACTTOC 420 

40 AATGGAAAGA GACTG6AAGA TGGGAAOATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC 480 

TTACTCTTCC T06CATCTTA TTGTATTGGA GGOTGAGCAC OCTGGGQATQ QaOTOTTOQC 540 

AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 600 

TCCCAAAATT AATGAGAATQ AGATGAGTAG AGTAAGATTT G6GTGGGATG GGTAOGATGA 660 

AGTATATTGC CCAACTCTAT GTTTCTTTOA TTCTAACACA ATTAATTAAG TGACATGATT 720 
45 TTTACTAATO TATTACTGAO ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 

Seq ID VOi 470 Protein sequence 
Protein Accession #s 1IP_006389.1 

50 1 11 21 31 41 51 

I ' I I 1 I t 

MAPMASCIiCV HVRSEENDLN TFDANPYDSV KKZKEBVRSK TKVFVQPQVZi ULGSKIXiRFR 60 
RSLSSYGXDK EKTXHLTLKV VXFSDEBLPIt FLVESGDBAK RBLliQVRRSS SVAQfVKAraE 120 
TKTGIZPETQ IVTCNGKRIiB DOKimADYOI RXGNLLFLAS YCIGO 



Seq ZD NO: 471 DKA sequence 

Nucleic Acid Accession #t XM_094741.1 

Coding sequence t 1 . . 948 



60 1 11 21 31 41 51 

I t I 1 I I 

ATGAAGGCCA ACTACAGCGC AGAGGAaGOC TTTCTCCTGC TGGGTTTCTC COACTGGCCT 60 

TCCCTGCAGC CGGTCCTCTT OGCCCTTGTC CTCCTGTGCT ACCTCCTQAC CTTQACGGGC 120 

AACTOGGCGC TGGTGCTGCT GGCGGTGCGC GACCCGOQCC TGCACAOGCC CATOTACTAC 180 

65 TTCCTCTGCC ACCTGGCCTT GQTAOACQCO GGCTTCACTA CTAGC6TGGT GCCGCOGCTO 240 

CTGGCCAACC TGCGCGGACC AGC6CTCTG6 CTGOCSOGCA GCC ACTOC ftC OGCCCAGCTO 300 

TGOGCATOGC TGGCTCTGGG TTCGGCCGAA TGOOTCCTCC TGGCGGTGAT G6CTCTG6AC 360 

CGCGCGGCOG CAGTGTGCCG CCCGCTGCGC TATGCGGGGC TCGTCTCCCC GCGCCTATGT 420 

OGCAOGCTGG CCAGOGCCTC CTQGCTAAGC GGCCTCACCA ACTCGOTTGC GCAAACCGCG 480 

70 CTCCTGGCTG AGCGGCCGCT GTGCGCGCCC GGCCTGCTG6 ACCACTTCAT CTGTG AGCT G 540 

CCGGCGTTGC TCAAGCTGGC CTGCGQAGQC 6A0G6AGACA CTACOOAGAA OCAOATSTTC 600 

GCCQCCCGCG TGGTCATCCT GCTGCTOCCG TTTGCCGTCA TCCTG6CCTC CTAOQGTGCC 660 

GTGGCCCOAG CTGTCTGTTG CATGCGOTTC AGOGGAGGCC GGAGGAGGQC GOTOGGCACG 720 

TGTGG6TCCC ACCTGACAGC CGTCTGCCrG TTCTACGGCT OGGCC ATCTA CACCTACCTO 780 

75 CAGCCOGCGC AGC6CTACAA CCAGGCAOGG G6a\AGTTGG TATOOCTCTT CTACACCGTG 640 

GTCACACCTG CTCTCAACCC GCTCATCTAC ACGCTCAGQA ATAAOAAAGT GAAGGGG6CA 900 
G06AGQAGGC TOCTOOGGAG TCTGGGGAGA GGGCAGGCTO GGCAOTGA 

OA ^'^'^ Protein sequence 

80 Protein Accession #t XP_094741.1 

1 -11 21 31 41 51 

I 1 i I 1 I 

MKAKY8ASBR PLIiUSPSDWP SbQPVLFALV LLCYLLTLTO N5ALVLLAVR DPRLHTPMYY 60 
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FLCHLALVDA GPTTSWPPL LANLRGPALW LPRSHCTAQL CASLALQSAB CVLLAVMALD 120 

RAAAVCRPUl YAGLVSPRliC RTLASASNI*S GLTNSVAQTA LLAERPI/CAP RLLDHFICEL 180 

PALLKIAGGO DGDTTQIQMP AARWIIiLLP PAVILASYGA VARAVCCMRF 5G6RRRAV0T 240 

CGSBLTAVCL FYGSAXYTYL QPAQRYIIQAR GKFV8LPYTV VTPALNPLZY TLRNKKVRGA 300 
5 ARRLLRSU3R 6QAGQ 

Seq ID NO; 473 DNA sequence 
Nucleic Acid Accession #t NM 001062.1 

10 ■ 

1 11 21 31 41 51 

I I I I I I 

GCTCTCATTA CCTTCTGCCC ATCACTTAAT AAATAOCCAS CCAATTCATC AACATTCTGO 
TACACTGTTO GAOAaATGAa ACft&TCACAC CAGCTGCCOC TAGTGGGGCT CTTACTCTTT 

15 TCTTTTATTC CAAGCCAACT ATGOGAGATT TGTGAGGTAA OTSAAGAAAA CTACATCCX3C 
CTAAAACCTC TQTTGAATAC AATGATCCAQ TCAAACTATA ACAGGGGAAC CAGaSCTGTC 
AATGTTGTGT TGTCCCTCAA ACTTGTTQGA ATOCAGATCC AAACCCTOAT GCAAAAQATG 
ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATOTAAGCTC GGGAGAGCTT 
GCCTTOATTA TACTGGCTTT GGGAGTATGT CGTAAOGCTQ AGOAAAACTT AATATATGAT 

20 TACCAOCTGA CTGACAAGCT A6AAAATAAA TTCCAAGCAa AAATTGAAAA TATGGAAOCA 
CACAATGGCA CTCCCCTGAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTOT 
CTQTTCAATQ GQAACTACTC AACCGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 
AACTATTATT TTGGTAGCCA GTTCTCAaTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 
ACCTGTGTGA A6AAGAGTCT AATAAAT6GG CAGATCAAAO GAOATOAAGG CAOTTTAAAG 

25 AACATCAGTA TTTATACAAA GTCACTGGTA GAAAAGATTC TOTCTGAOAA AAAAGAAAAT 
GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC AGGCOCTCTT TGTATCATCA 
GACTATTATA ATQAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 
ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGGCC AGGTCITAGC TGCCCTGATG 
GGAAAGACCT TCTTGQATAT TAACAAAQAC TCTTCTT60G TCTCTOCTTC AGGTAACTTC 

30 AACATCTCCG CTGATOAOCC TATAACTGTG ACACCTOCTG ACTOICS^ATC ATATATCTCC 
GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATQTCACTGT GCTAAATGOT 
TCTGTCTTCC TCAGTGTGAT GGAQAAAQCC CAGAAAATGA ATQATACTAT ATTTGGTTTC 
ACAATGQAGG AGCGCTCATG GGGGCCXTTAT ATCACCTGTA TTCAGGGCCT ATGTQCCAAC 
AATAATGACA GAACCTACTQ GGAACTTCTO AaTGOAOQCO AACCACTGAG CCAAGGAGCT 

35 GGTAOTTACG TTGTCCGCAA TGGAGAAAAC TTGGAGGTTC GCTGQAGCRA ATACTAATAA 
GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTT6CAGTG GAOTTCCATO TTTATTOTCC 
TTATGOCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TOOCCTTCTC 
TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 

40 Seq ID NOt 474 Protein sequence 
Protein Accession «i 1IP_001053.1 

1 11 21 31 41 51 

AC \ I i i J ' 

45 KRQSHQLPLV GLLLFSPIP8 QLCBICEV6E ENYIRLKPUi NTMIQSNYNR GTSAVNWLS 

LKIiVGIQIQT LMQKMZQQXK YNVXSRIiSOV SSGBLALIIL AliGVCRNAEE NLIYDYHLTD 

KLQIKFQABI ENMEAHNGTP LTNYYOLStiD VLAIiCLPNQN YSTABWNBF TPKtnCNYYFQ 

8QPSVDTGAM AVLALTCVKK SItlNGQIKAD BOSLKNISIY TKSLVEKILS EKKENGLIGN 

^ TPSTQEAMQA LFVSSDYYNB NDWNOQQTLN TVLTBI8QQA FSNPKAAAQV IJ»ALMGKTFL 

50 DINXDS8CVS A8GNFHISAD EPITVTPPOS QSYISVNYSV RIHBTYFTNV TVIiMGSVFLS 

VKEKAQKMND TZFGFTMBBR SHGFYXTCIQ GLCAMNNDRT YMBLLS6GBP LSQGAGSYW 

RNGEHLEVRN SKY 

Seq ID NO: 475 DMA sequence 
55 Nucleic Acid Accession «» NM_0048S2.1 
coding sequence I 69. .1546 

1 11 21 31 41 51 

,n 1 ' » i • ' 

OU GCCOOCOOCC GCCCOGGGCX: CTOATGGACT GAATGAAGGC TGCCTACACC GCCTATOQAT 60 

GCCTCACCAA AGACCTAOAA CGTGOGCCAT GAACXX3GGAG CTGACAATGG AAAGTCTOGG 120 

CACTTTQCAC GO00C3GOGCG GGG60GGCAO T G G OGG G G GC 6G0GGCGGGG GGGGCGG06G 180 

OGGOGGCGGG GGCCOGGGCC ATGAGC3VGGA GCTGCTGGCC AGCCCCAGCC CCCACCAG6C 240 

- GCS3C00CGGC CCGOGTGGCT CGCTGOGGOO CXXTTCOGCOO OCTCCAACOG CX3CACCAGGA 300 

65 GCTGGGCACX3 G0GGCAGCXK3 OSGCAGOGGC GGOGTCGCGC TCGGCCATGG TCACCAGCAT 360 

GGOCTOGATC CTGOAGGGGO GG6ACTAGC0 OOOCXSAOCTC TOCATCCOGC TGCACCA06C 420 

CAT6AGCAT0 TCCIGOQACT OGTCTCOQOC TGOCATGGGC ATQA6CAACA CCTACACCAC 460 

GCTGACACOG CTCCAGCOGC TGCXavCCCAT CTCCACOGTG TCTGACAAGT TCCACCACCC 540 

TCACCCGCAC CACC3VTC00C ACXyvCCACCA CCACCACCAC CACCAGOGOC T6TCCGGCAA 600 

70 CGTCAGOGGC AGCTTCACCC TCATGCGCX3A CGAG060GGG CTCCCGGCCA TQAACAACCT 660 

CTACAGTCCC TACAAGGAGA TGCCOGGCAT QAOCGAGAGC CTGTGOOOQC TGGCOOCCAC 720 

GCCX3CTGGGC AAGOGGCTAO GOGGCCTCCA CAAOGC96CAG CAGAGTCTGC OCAACTAOGG 760 

TC06C0GGGC CAOGACAAAA TGCTCAGCCC CAACTTOGAC GOGCACCACA CTGCCATGCT 840 

GACCCGOGGT GAOCAACACC TGTOCCX3CGG CCTQGQC3«X: CCACCTGCGQ CCATOATGTC 900 

75 GCACCTGAAC GGCCT6CACC ACXrOGGGCCA CACTCAGTCT CAOQGGCOGG TGCTGGCACC 960 

CAGTOGGGAO 06GCCACCCT OOTCCTCATC GGGCTCGCAO GTGGCCAOQT GQGGCCA6CT 1020 

G^AAOAAATC AACACCAAAG AGGTGGOCCA GOGCATGAGA GCQGAGCTGA AGGGCTACAG 1060 

TATCCOOCAG GOQATCTTTG OGCAGAGGGT GCTGTOCOOO TCTCAGGGGA CTCTCTCCGA 1140 

CCTOCTCOGG AATCCAAAAC CX3TGGAGTAA ACTCAAATCT GGCAGGGAGA CCTTCCGCAG 1200 

80 GATGTGGAAG TGGCTTCAGG AGCCOGAGTT CCAGOGCATO TOOGCCTTAC GCCTGGCAGC 1260 

OTGOVAAOOC AAAGAGCAAG AAOCAAACAA AOACAGGAAC AATTCCCAGA AGAAOTCCOG 1320 

CaGGTGTTC ACTGACCTCC AA0QCG6AAC ACTCTTOGCC ATCTTCAAGG AGAACAAACG 1380 

OCOQTCAAAG GAGATGaUIA TCACCATTTC CCA0CA8CTG GGCCTGOACC TCACAAC06T 1440 

CAGGMCTTC TTCATGAA06 CG0C3GGQOC0 CAGGCTGGAG AAOTOOCAAG AOOATCTGAG 1500 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 
420 
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10 
15 



CACAGGGGGC TCCTCGTCCA CCTCCAGCAC GTOTACCAAA GCATGWIGGA AGGACTCTCA 1S60 

CTTGG6CAOV AGTCACCTCC AAATQAGQAC AACAGATACC AAAAGAAAAC AAAGQAAAAA 1620 
GACACXX5GAT TCCTAGCTOO QGCCCTTCAC TGGTG 

Seq ID HO: 476 Protein sequence 
protein Accession 0t NP_004B43.1 

1 11 21 31 41 51 

I I i i I I 

MUPBLTMESL GTLHGARGGO 8GGGGGGGGQ GGGGGPGHBQ BLLASPSPHH ARRGPRGSLR 60 

GPPPPPTAHQ ELGTAAAAAA AASRSAMVTS MASILDGGDY RPELSIPLHH AMSMSODSSP 120 

PGKGMSNTyr TLTPLQPLPP ISTVSDKFBH PHPHHHPHHH HHHHHQRLSG NVSGSFTLMR 180 

DSRGLPAMNN ItYSPYKBMPG HSQSX^SPLAA TPLGNGIiGGIi HNAQQSLPNY GPPQHDKMLS 240 

PNFDAHHTAN IiTRGSQHLSR GLGTPPAAKN SHUIGLBHPO BTQSBGPVLA PSRBRPPS88 300 

S6SQVAT8GQ LEBIKTXBVA QRITASLXRY 8IPQAIFAQR VLCRSOQIXfi DLLRNPKPWS 360 

KLKSGRETPR RMHKNLQBPE FQRMSALRLA ACKRKEQBFN RDRNNSQKKS RI.VFTDLQBR 420 

TIiFAIFKEHK RPSXEMQITI SQQLGIiBLrr V8HPFNNARR RSLEKHQDDl* STOGSSSTSS 480 
TCTKA 

20 Seq ZD NOt 477 DNA sequence 

Nucleic Acid Accession tft NM_013271.1 
Coding sequencet 27.. 809 * 

1 XI 21 31 41 51 

25 1 I I I I I 

TCGGGAGCCA GGCTOGCTGQ GGCAGCATGG CGGGGTCOCC 6CTGCTCTGG OGGCCGGGGG 60 

CCG6GGGCGT CGGCCTTTTG GTGCTGCTQC TGCTCGGCCT OTTTCGGCCG CCCC006CGC 120 

TCTG0G06CG OCCGGTAAAG GAACOCOGOG GCCTAAGOGC AGOTrCTCCG CCGTTG6CTG 180 

AGACTGGOGC TCCTGGCOGC TTC0G6GG0T CAGTGCCCXX3 A6GTGA6G00 GOGGGGGOGG 240 

30 TGCAGQAGCT GGOGCGGGCG CTGGOGGATC TGCTGGAGGC OGAACGTCAO GAGGGGGG6C 300 

GGGCCX3AGGC GCAGQAGGCT 6AGGATCAGC AGGCX3CGGGT CCTGOCGCAG CT0CT0CX3CG 360 

TCTGGGQCGC CCCCCGCAAC TCTGATCCGG CTCTGGGCCT GQACQAOQAC CCCXSAOGCGC 420 

CTGCAG06CA GCTCGCTC6C GCTCTGCTGC 600CCOGCCT TGACCCTGCC GCCCTAGC3U3 480 

GCCAGCTTGT OCC06CX3CCC 6TCCCCX3CC6 CX36CX3CTCCG ACXXXGGOCC CGGGTCTAOG 540 

35 AGGAOGGOCC GOCGGGGCOG GATGCT6AGG AGGCAGGOQA OQAQACAOCC GAC6TGGACC 600 

CGGAGCTGTT GAGGTACTTG CTGGGACGGA TTCTTG06GG AAOOQOGGAC TCC6AGGGGG 660 

TGGCAGCCCC GCGCCGCCTC CGCCGTGCCG CCGACCACX3A TGTGGGCTCT GAGCTGCCCC 720 

CTGAGGOOGT GCTGGGGGOG CTGCT606TG TGAAAOGCCT AQAGACCGOO GOGOCCCAGG 780 

_ TGCCT6CAGG CXgCCTCTl'G OCACCCTGAG CACTG0CXX3G ATCOOGIGCA CCCTGG6A0C 640 

40 GAGAAOIGCC GOOGOCATCC OGCCAOCAGG ACTTCTCCGC GCCAGCAGOT CCA6AGCAAC 900 

TTACOCCGGC GAGCCAGCCX: TCTCACCOGA GGATCGCTAC CCOCTGGOCC ACAATAACAT 960 
GATCTGAGC 

Mm ID NO: 478 Protein sequence 

45 Protein Accession NP_037403.1 

1 11 21 31 41 51 

I I I I I I 

MAOSPLIinOP RAGGVGLLVL LLIiGIiFRPPP ALCARPVKEP RGLSAASPPL AET6APRRFR 60 

50 RSVPRGBAAG AVQELARALA HLI£A£RQBR ARAEAQEAED QQARVLAQLL RVHGAPRNSD 120 

FALGLDDDPD APAAQLARAIi LRARLDFAAl* AAQLVPAFVP AAALRPRPFV YDD6PA6PDA 180 

EEAGDBTPDV DPELLRYLZX3 RILAGSADSB GVAAPRRLRR AADHDVG3EL PPBGVIiGAUj 240 
RVKRLETPAP QVPARRLLPP 

55 Seq ZD NOt 479 DNA sequence 

Nucleic Acid Accession #i NMJ)02214 
Coding sequence! 681.. 2990 ~ 

1 XI 21 31 4X 51 

60 I I I I I 1 

CCCAGAGCCO CCTCCCCCTG TTGCTGGCAT CCCGAGCTTC CTOCCTTQCC AGCXa«X5AC0 60 

CTQCCGACTT GTCTTTGCCC GCTGCTCCX3C AGAOGGGGCT GCAAAGCTOC AACTAATGGT 120 

GTTGGCCTCC CTQCCCACCT GTGGAAGCAA CTGCXSCTGAT TGAT60GCCA CAGACTTTTT 180 

^_ TCGCCTCQAC CTCGCCGGCG TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACAT 240 

05 TAGGOTOGTT TCCCCCCCAO CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTOQ 300 

CTAAGCTQAT TTATSCAGCA GAAGCCCCAC OGGCTGGAGA GAAACAAAAG CTCTTT T CTT 360 

TGTCCCQGAO CAGGCTGOG6 AGCCCTTGCA GAGCCCTCTC TCCAGTGGCC GCCGGGCXXZT 420 

TGGCCGTOGA AOQAGOTGCT TCTC0G66AG ACOGCGQGAC CC6CCGT60C GAGCCGGGAO 480 

_ GGCGGTAGGG GCCCT6A6AT GCCGAGCGGT GCCOGGGCCC GCTTACCTGC ACC6CTTGCT 540 

70 C0QAGC060G GGGTCOQCCT 6CTA66CCTO CQGAAAA06T CCTAGOQACA CT06GC060G 600 

GGCCCCGAGG TOGCCOGGGA GGCC6A6CCC GOOTCCGGAA GGCAGCCAGG 06GCGGGC0C 660 

GGGOOOGGCT GTTTTQCATT ATGTGOGGCT OSGCCCTGGC TTTTTTTACC GCTGCATTTO 720 

TCTGCCTQCA AAAOGACCGQ CGAGGTCCOG CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 

CACTTGTTCT TGQACTGGGC CAAOGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

75 CCTGTOCCAG QTGCCTTOOG CTGGGTCCAO AATGTGGATO OTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAOTTGA TTCAATAGAA TACCCATCTO TOCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GOGTCCAGOA GCCQAAGCTA 1080 

ATTTTATGCT QAAAQTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTQTTG 1140 

OO ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCOTTGOA AACGATTTAT 1200 

CTASAAAAAT GGCATTTTTC TCC06TGACT TTOGTCTTGG ATTTGGCTCA TAOGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC COGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TQAGAAA6CA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAO 1440 
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AAOQAGGnr TGACaCCATQ CTTCAGOCAO CTQTCTGTOA AAGTCATATC GGATOG06AA ISOO 

AAGAGGCTAA AAG A TTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 

OCAAATTGGC AGGCATAGTG GTGCCCAATQ A06QAAACT0 TCATCTGAAA AACAAOGTCT 1620 

AOGTCAAATC QACAACCATG GAACACCCCT CACTAGOCCA ACTTTCAGAG AAATTAATAG 1680 

5 ACAACAACAT TAATGTCATC TTTGCAGTTC AAGQAAAACA ATTTCATTGO TATAAGGATC 1740 

TTCTACCCCT CTT6CCAG0C ACCATTGCTQ 6TGAAATAGA ATCAAAGQCT QCAAACCTCA 1800 

ATAATTTGGT AGT60AA6CC TATCAGAAGC TCATTTCAGA ACTOAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCOCCATCTG TCCAOATGGG TCCAQAAAQC 1920 

CACGCATGGA AGGATGCAGA AACX3TGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

10 TTACAAT6AA AAAATGTQAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAA7GA AAGOQCTAAA ATTCATATAC ACAGAAACTO CAGCTGTC3^ TGTGAGGACA 2100 

AGAGJU36ACC TAAAGQAAAG TGT6TAGA313 AAACTTTTCT AGATTCGAAG TGTTTCCAGT 2160 

GTaATGAGAA TAAATOTCAT TTT6ATGAA0 ATCAOTTTTC TTCTQAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTOTTTGC AGTGGTOGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

15 ACAAAATTAA GCTTGQAAAA QTGTATGQAA AATACTOTGA AAAGGATOAC TTTTCTTGTC 2340 

CATATCACCA TGOAAATCTG TGT O CT G G G C ATOOAGAGTG TGAAGCAG6C AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATOSATGCC AOT8CCCTTC AGCAGCAOCX: CAOCACTGTG 2460 

TCAATTCAAA GGGCCAAOTG TGCAGT6GAA GAGGCACGTG TOTGTOTGGA AGGT6TGAGT 2520 

QCACCX3ATCC CAGGAGCATC GGCXX3CTTCT GTQAACACTG CCCCACCTGT TATACAGCCT 2580 

20 6CAAGQAAAA CTGGAATTOT ATOCAATGCC TTCACCX:TCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTAT6TC GACCAAACTT 3700 

CAGAATGTTT CTGCAGCCCA AGCTACTTQA GAATATTTTT CATCATTTTC ATAOTTACAT 2760 

TCTTGATTGG GTTOCTTAAA GTCCTQATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2620 

ATAAAATTAA QTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

25 TSCAAAGTGT TT6CACAAGA GCAGTCACCT ACGQACX3TGA GAAGCCTQAA GAAATAAAAA 2940 

TGOATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAA AGAT TT 3000 

TTAAACACTT AA1GGGAAAC TQGAATTGTT AATAATTGCT CCTAAAQATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCAOGG TCATGCCAGT TGCTGGTTGT ACACTOGAAC 3120 

GAAQACTOAC AAGTATCCTC ATCATGATQT QACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

30 AAAATGTGTC TTACTACTGT TTGAGACTAG TGTOGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAO ATCAGCATAG AATQTAOATC CTCT OAAGAQ CACTG ATTAC ACTT TACAG G 3300 

TACCTGTTAT CCCTA06CTT CCCAGAGAGA AGAATGCTGT GA6AGAGTTT A0CATTGT6T 3360 

CACTACAAGG GTACAOTAAT CCCTGCACTG GACATGTGAO GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTQCCAAA CACTTCAACA OTTGOTGOTT GAATAGACAA GAACAGCTAG 3480 

35 ATGAATAAAT GATTOGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GrGTOTAOTT TTATOCATGT GTGTTTATOG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TOCAGCATTC TCTCCTCTTT GCCTTTATGT UTW ' m ' lt.T 3660 

TTTTTAGAGG ATAAGTTTAT GTATGTCACA OATQACTGGA TTAATCAAGT OCTAAOTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA 6AATACTAGT TTTAAAAGCT 3780 

40 GAAT6TTAA 

Seq ID NO I 480 Protein eequence 
Protein Acceseion #t 1IP_00220S 

45 1 H 21 31 41 51 

I I I I I I 

MCGSAIAFFT AAFVCLQNDR RGPA8FLHAA HVFSItVZiGLG QGEDNRCASS NAASCARCLA 60 
LGPSCGWCVQ EDPXSGGSRS ERGDIV8NLI 8KGCSVDSIB YPSVHVIIPT EKBZNTQVTP 120 
GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVeASMH NMIEKLNSVG NDLSRKMAFF 180 

50 SRDPRLGFGS WDKIVSPYI SIHFERIKNQ CSDYNLDCMP PHOYIHVLSI^ TENITBPBKA 240 
VBRQKISGNI DTPBGOFDAM LQAAVCBSHZ GWRKEAKRLL LVMIDQTSHL AU)8XLAaZV 300 
VPNDGHCHLK NNVYVRSTTH EHPSLQQLSE XLZONNINVI PAVQGKQPKH YXDUiPIiLPa 360 
TIAGBZESKA AKLNNLWBA YQKLISEVKV QVBNOVQGIY FNITAXCFDG SRKPGKEGCR 420 
HVTSNDEVLF HVTVTMKKCD VTGGKNYAII KPIGPNETAK IHIHRNCSOQ CEDNRGPNSK 480 

55 CVDBTPU3SK CFQCDEtncca FDBDQFSSES CKSBRDQPVC S6RGVCVCGK CSCHKIKLGK 540 
VYGKYCEKDO F8CPYRRGNL CAGHGECBAO R0QCP8GWE0 DR0QCP8AAA QHCVNSKGQV 600 
CSORGTCVGO RCECTDPRSI GRPCBHCPTC YTACKENHNC MQOiHPBKLS QAIIiDQCECTS 660 
CALMBQQBYV DQTSECFSSP SYLRIPPZIF ZVTFLZGLLK VI.IIRQVILQ UH8NKIKS8S 720 
SYRVSASKKD KLILQSVCTR AVTYIUtEKPB BIXMDXSKUI AKBTPR07F 

60 

Seq ZD NOi 481 DMA sequence 

Nucleic Acid Accession NM_003318.1 

Coding sequence i 1..2574 

65 X 11 21 31 41 51 

1 I 1 I I I 

ATGGAAT009 A6GATTTAA0 TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTQ 60 

AGAGACATTA AAAATAAG7T TAAAAAT6AA GACCTTACTG ATQAACTAAG CTTGAATAAA 120 

ATTTCTGCXG ATACTACAGA TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

70 AA00CAGAG6 ACT6GTT6AG TTTGTTGCTC AAACTAGA6A AAAACAGTGT TCCGCTAAGT 240 

OATGCTCTTT TAAATAAATT GA7TGGT09T TACAOTCAAO GAA7TQAAGC GCTT C OOOC A 300 

GATAAATATG GCCAAAATGA GAGTTTTGCT AOAATTGAAG TGAGATTTCC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATQA TGCA06TGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGC3V CAATTTGAAC TGTCACAAGG TAATOTCAAA 480 

75 AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAO CAGTAOCACT AGAAATGCTO 540 

GAAATTQCCC TOOOOAATTT AAACCTCCAA AAAAA6C3U3C TGCTTTaGA GQAGQAAAAG 600 

AA6AATTTAT CAGCATCTAC GOTATTAACT GCCCAAGAAT CATTTTCOGG TTCACTTGGG 660 

CATTTACAGA ATAGOAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAQ GTTACOGGAA TTCATTGAGA 780 

80 CAAACTAACA AAACTAAACA GTCATOOCCA TTTQGAAGAa TCXXAGTTAA CCTTCTAAAT 840 

AOOCCAGATT GTGATQTGAA GACAGATCAT TCAGTTGTAC CTTGTTTTAT GAAAASACAA 900 

ACCTCTAGAT CAGAATGCOG AGATTTQQTT GTQC C TGQAT CTAAACCAAO TG6AAATGAT 960 

TCCTGTGAAT TAAOAAATTT AAAGTCTOTT CAAAATAOTC ATTTCAAGQA ACCtCT GG TG 1020 

TCAQATGAAA AGA6TTCTGA ACTTATTATT ACTQATTCAA TAACGCTGAA GAATAAAAOQ 1080 



1140 



wo 03/042661 



PCTAJS02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 



OAATOU^GTC 
GAGAGTAACC 
QCTGCiVrCTT 
AAACATACCA 
AOVTCTAAAT 
GATTACATOA 
TTGTCAACAC 
ACrCCACTTC 
AA AGGAAGA A 
CAG6TX3TTAA 
GATAACCAAA 
CACAGTGATA 
GTAATC30A6T 
CCATGOGAAC 
CATG6CATT6 
AAQCTAATTG 
GATTCTCAGG 
TCCAGAGAGA 
GGATGTATTT 
ATTTCTAAAT 
GAGAAAGATC 
TCCATTCCTQ 
ATQQCCAAGO 
TCTCCTAACT 
AOTCATAATT 



TTCTAGCTAA 
AGAAACAGTG 
CAAATGACTG 
CTTTTGAGCA 
GGTTT6ACCC 
GCTGTTTTAG 
CTTATGGCCA 
AAAATTTACA 
TTTATTCCAT 
AT6AAAA6AA 
CTCTTGATAG 
AGATCATCCX5 
GTGGAAATAT 
GCAAGAGTTA 
TTCACAGTGA 
ATTTTGGGAT 
TT6GCACAGT 
ATGGGAAATC 
TGTACTATAT 

tacatgcx:at 
ttcaagatgt 

A6CTCCTGOC 
GAACXZACTGA 
CCATTTTGAA 
CTTCATCCTC 



ATTAGAAGAA 
GCAATCTAAO 
GCAGATTCOQ 
AOCTGTCTTT 
AAAATCTATT 
AACTCCAGTT 
ACCTGCCTGT 
GGTTTTAGCA 
TTTAAA6CAG 
ACAOATATAT 
TTACCGGAAC 
ACTTTATGAT 
TGATCTTAAT 
CTGGAAAAAT 
TCTTAAACCA 
TGCAAACCAA 
TAATTATATG 
TAAGTCAAAG 
GACTTACGGG 
AATTQATCCT 
GTTAAAGTGT 
TCATCCCTAT 
AGAAATGAAA 
AGCTGCTAAA 
CAAGACTTTT 



ACTAAAGAGT 
A6AAAGTCAG 
GAQTTAGCCC 
TGAOTTTCAA 
TGTAAQACAC 
GTAAA6AATG 
TTCXaVGCAGC 
TCTTCTTCAO 
ATAGQAAGTO 
6CTATAAAAT 
GAAATAGCTT 
TATGAAATCA 
AGTTG6CTTA 
AT6TTAGAGG 
GCTAACTTTC 
ATGCAACCAG 
CCACCAOAAG 
ATAAGCCCCA 
AAAACACCAT 
AATCATGAAA 
TGTTTAAAAA 
GTTCAAATTC 
TAT6TTCTGG 
ACTTTATATQ 
GAAAAAAAAA 



ATCAAGAACC 
AGTGTATTAA 
QAAAAiGTTAA 
AACAGTCACC 
CAAGCAOCAA 
ACTTTCCACXr 
AACAGCATGA 
CAAATGAATG 
GAGG7TCAAG 
ATGTGAACTT 
ATTTGAATAA 
OGGACCAGTA 
AAAAGAAAAA 
CAGTTCACAC 
TOATAGTTGA 
ATACAACAAO 
CAATCAAA6A 
AAAGTGATGT 
TTCAGCAQAT 
TTGAATTTCC 
GGGACCCAAA 
AAACTCATCC 
GCCAACTTGT 
AACACTATAG 
GGGGAAAAAA 



AGAGGTTCCA 
CCAQAATCCT 
TACAGAGCAG 
AC CAATAT CA 
TACCTTGGAT 
T G CTTGTC A G 
AATACTTQCC 
CATTTOQGTT 
CAAQGTATTT 
AGAAGAAGCSV 
ACTACAACAA 
CATCTACATG 
ATCCATTGAT 
AATCCATCAA 
TGGAATGCTA 
TaTTGTTAAA 
TATQTCTTCC 
TTGGTCCTTA 
AATTAATCAO 
CX3ATATTCCA 
ACAGAGGATA 
AGTTAACCAA 
TG6TCTGAAT 
TGGTGaTGAA 
ATGA 



Seq ID NO I 462 Protein sequence 
Protein Accession ft: NP 003309.1 



1 
I 

MBSEDLSGRB 
NPEDWIiSLLL 
AIQEPDDARD 
EIALHMLNLQ 
X*YGENMPPQD 
TSRSECRDLV 
ESSIiLAKLEE 
KHTTPEQPVF 
LSTPYGQPAC 
QVLNBKKQIY 
VMECXSNIDLN 
KLIDFGIANQ 
QCILYYMITG 
8IPELLAHPY 



11 
I 

LTIDSIMNKV 
XLEKKSVPL5 
YFQMARANCK 
KKQLLSEEEK 
AEIGYRNSIiR 
VPGSKPSGND 
TKEYQEPEVP 
SVSKQSPPIS 
FQQQQHQILA 
AIKYVZILEEA 
8WLKKKKSID 
MQPDTTSWK 
KTPPQQIINQ 
VQIQTHPVNQ 



21 

I 

RDIKNKFKNE 
DALLKKLIGR 
KFAFVHZSFA 
KKLSASTVLT 
QTNKTKQSCP 
SCBLRNLKSV 
ESNQKQWQSK 
TSXWFDPK8I 
TPXiQNLQVIiA 
DNQTLDSYRN 
PWERKSYHIQT 
DSQVGTVNYM 
ISKLHAIIDP 
HAR6TTEEMK 



31 

! 

DLTDBIfSLNK 
YSQAXEALPP 
QFEI.SQGNVK 
AQESFSGSLG 
FGRVPVNLLN 
QKSHFKEPLV 
RKSECINQNP 
CKTPSSNTLD 
SSSANSCISV 
BIAYLNKLQQ 
MLEAVHTIHQ 
PPESAIKDMSS 
NHEIBFPDIP 
YVLGQLVGLN 



41 
I 

I8ADTTDNSG 
DKYGQNESFA 
KSKQLLQKAV 
HLQNRNNSCD 
SPDCDVKTDD 
SDEKSSELIZ 
AASSNHWQIP 
DYMSCFRTPV 
KORIYSILKQ 
HSDKIIRLYD 
HGIVHSDLKP 



1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 



EKDLQDVLKC 
8PN8ILKAAK 



51 
I 

TVNQIMMMAN 
RIQVRPAELK 
BRGAVPLEML 
SRGQTTKARF 
SWPCFMKRQ 
TDSITLKNKT 
ELARKVNTEQ 
VKNDFPPACQ 
IGSGGSSKVP 
YBITDQYIYM 
ANFLIVDGML 
ISPKSDVWSL 
CLKRDPKQRI 
TLYEHYSGGB 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 



50 
55 
60 
65 
70 
75 
80 



Seq ID NO 3 483 DNA sequence 

Nucleic Acid Accession #: MM_003667.1 

Coding sequence t 1 . . 2 651 



ATGGACACCT 
GGGGGCAQCT 
GAGCCCGACG 
CCTTCCAACC 
CTGCTCCOGA 
GCTCTGACAT 
CTGCAGAATA 
CRATCCCTGC 
CTQCATTCCC 
GCTTTTAGAA 
ATACCAQACT 
A6AATCCACT 
TTAAATTACA 
GAACTACATT 
CCTGAACTAA 
ACTGGAACTO 
CAAACOGTCT 
QAAOATTTAC 
GAAATCTAOG 
TTGGCTT^3A 
ATAAAGCTGG 
TTAACTCACT 
TTTCCAQAAC 
TGTOAGAATG 
QACCTTCATA 
TTCCTGCTTG 
CCAGGCCCCT 
TGGACCATAG 
TCCOCTCTGT 
CTCAOGGGAO 
GCAC3QACATQ 



11 

1 

CCCGGCTCGG 
CTCCCAGGTC 
GCAGGATGTT 
TCAGOQTCTT 
ATCCCCTGCC 
ACATTCCCAA 
A7CAGCTAA0 
GTCTGGATGC 
TGAGGCACCT 
GTTTATCGGC 
ATGCCT7TGG 
CCCT6GGAAA 
ATAACCTTOA 
TCTAT6ACAA 
GAACACTGAC 
CAAACCTGGA 
GCAATCAGT7 
CXZAGTTTTTC 
AAATTAAAGT 
ACAAAATTGC 
ACCTATCGTC 
TAAAATTAAC 
TCAAGGTTAT 
CCTATAAGAT 
AGAAAGATGC 
ACTTTGAGGA 
TCAAACCCTG 
CAGTTCTGGC 
ACATTTCCCC 
TCTCCAOTQC 
GXGCXrrGGTG 



21 

I 

TGTGCTCCTG 
TGGTGTGTTG 
GCTCAGGGTQ 
CACCTCCTAC 
CAGTCTCCGC 
GGGAGCATTC 
ACACGTACCC 
TAACCACATC 
GTGGCTGGAT 
ATTGCAAGCC 
AAACCTCTCC 
GAAATGCTTT 
TQAATTCCCC 
TCCCATCCAA 
TCIGAATQQT 
GAGTCTGACT 
ACCTAATCTC 
AOTCTQCCAA 
TGAGACTTTC 
TATTATTCAC 
CAACCTCCTG 
AGGAAATCAT 
AOAAATQCCT 
TTCTAATCAA 
TGGAATOTTT 
AGACCTGAAA 
TGAACACCTG 
ACTTACTTQT 
CATTAAACTQ 
0BT6CTG6CT 
G6AGAATGQ0 



31 

I 

TCCTTGCCTG 
CTGAGGGGCT 
GACTGCTCCG 
CTAGACCTCA 
TTCCTGGAGG 
ACTGGCCTTT 
ACAGAAGCTC 
AGCTATGTQC 
GACAATGCGT 
ATGACCTTGG 
AGCTTGGTAG 
QATGGGCTCC 
ACTGCAATTA 
TTTGTT6GGA 
OCCTCACAAA 
7TAACTGGAG 
CAA GTGCT AG 
AAGCTTCAGA 
CAGCAGTTGC 
CCCAATGCAT 
TOGTCTTTTC 
GCCTTACAOA 
TATGCTTACX: 
TGGAATAAAQ 
CAGGCTCAAG 
GCCCTTCATT 
CTTGATGGCT 
AATGCTTTOG 
TTAATTGGGO 
G6TGTGGAT0 
GTTG6TTGCC 



41 

I 

TGCTGCTGCA 
GCCCCACACA 
ACCTGGGGCT 
GTATGAACAA 
AGTTACGTCT 
ACAGTCTTAA 
TQCAGAATTT 
CCCCAAGCTG 
TAACAGAAAT 
CCCTGAACAA 
TTCTACATCT 
ACAGCCTAGA 
6GACACTCTC 
GATCTQCTTT 
TAACTQAATT 
CACAGATCTC 
ATCTGTCTTA 
AAATTGACCT 
TTAOCCTC03 
TTTCCACTTT 
CTATAACTGG 
GCTTSATATC 
AGTGCT6TGC 
GTGACAACA6 
ATGAACX3TGA 
CAGTGCA6TO 
GGCTQATGAG 
TGACTTCAAC 
TCATCGCAGC 
OGTTCACTTT 
ATQTCATTOQ 



51 

I 

GCTQGOQACC 
CTGTCATTGC 
CTCX3GAGCTG 
CATCAGTCAG 
T6CGGGAAAC 
AOTTCTTATG 
GOGAAGCCTT 
TTTCAGTGQC 
CCCCGTCCAG 
AATACACCAC 
CCATAACAAT 
GACTTTAGAT 
CAACCTTAAA 
TCAACATTTA 
TCCTGATTTA 
ATCTCTTCCT 
CAACCTATTA 
AAGACATAAT 
ATGGCTGAAT 
GCCATCCCTA 
GTTACATGGT 
ATCTGAAAAC 
ATTTGGAGTG 
CABTATGGAC 
CCTTQAAGAT 
TTCACCTTCX: 
AATTGGAOTG 
AGTTTTCAQA 
AOTGAACATG 
T06CAGCTTT 

v m ' v i xnvc 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
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5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



ATTTTTQCTT 
TCTQTQAAAT 
ATTTTOCTCT 
AAGTATQGGQ 
TACATGGTOO 
ACCWVQCTCT 
GTAAAACACA 
TTGTCCTTCT 
CTTCTOGXOO 
CCTCACTTTA 
AAACACCCAA 
ACTCAAGCCT 
GT6CCATCAC 
CCATGTCTTA 



CAGAATCATC 
ATTCTGCAAA 
GTGGCCrrGCT 
CCrCCCCTCT 
CTCrCATCTT 
ACTGCAATTT 
TTGCCCTGTT 
CCTCTTTAAT 
TAOTCCCACT 
AGGAOGATCT 
GCTTGATGTC 
TGGTAACCTT 
CAGCTTATCC 
A 



' m ' i"i"nu :r G 

ATTTQAAAOG 
GGCCTTGACC 
CTGCCT6CCT 
GCTCAATTCC 
GOACAAOGGA 
GCTCTTCACC 
AAACXTTACA 
TCCTGCATGT 
GGTGAGOCTG 
AATTAACTCT 
TACCAGCTCC 
AOTQACTOAO 



CTTACTCTGO 
AAAGCTCCAT 
ATGGCCX3CA0 
TTGCCTTTTG 
CTTT3CTTCC 
GACCTGGAGA 
AACTGCATCC 
TTTATCAGTC 
CTCMTOOOC 
AGAAAOCAAA 
GATGATGTCG 
AGCATCACTT 
AiQCTGCCATC 



CAOCCCTGGA 
TTTCTAGCCT 
TT C CCC TO CT 
GGGAGOOCAO 
TCATGATQAC 
ATATTTGGGA 
TAAACTQCCC 
CTGAAGTAAT 
TTCTCTACAT 
CCTAOOTCTG 
AAAAACAGTC 
ATGACCTGCC 
TTTCCTCTGT 



GCGTGG6TTC 
GAAAGTAATC 
GGGTG6CAGC 
CACCATGGGC 
CATTGCCTAC 
CTGCTCTATG 
TOTOOCTTTC 
TAAGTTTATC 
CTTGTTCAAT 
GACftAGATCA 
CTGTGACTCA 
TCCCAGTTCC 
GGCATTTGTC 



1920 
1980 
2040 

aioo 

2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 



Seq ID NO: 484 Protein sequence 
Protein Accession #i NP^OOB 658.1 



I 
1 

MDTSRLGVLL 
PSNLSVFTSY 
IiQNNQIiRHVP 
AFRSLBALQA 
UIYNNLDEFP 
TGTANLBSLT 
EIYBIKVDTP 
LTHLKLTGNH 
DUQCKDAOMP 
WTIAVLALTC 
AHHGAWWEmS 
ILLCALLAXiT 
TKLYCNLDKG 
LIiVWPLPAC 
TQALVTPTSS 



11 

I 

SLPVLLQLAT 
LDLSMHNISQ 
TEALQMLRSL 
MTIiAUIiaHH 
TAIRTLSNLK 
LTGAQI68LP 
QQLLSIiRSLN 
ALQSLXSSEN 
QAQDBRDLED 
NALVTSTVFR 
VGCHVIGFLS 
MAAVPLLGG9 
DLENIWDC8M 
UIPLLYILFN 
SITYDLPP8S 



21 

I 

CGSSPRSGVL 
LLPNPIiPSZiR 
QSLRUMMHI 
ZFDYAFGNLS 
BLHFyxaiPIQ 
QTVCKQLPNL 
liAimKIAIIH 
FPELKVIEMP 
FLLDFBEDLK 
SPLYX6PIKL 
IFASESSVFL 
KYQASPlfCLP 
VKHIAXiUiFT 
PRFKEDLVSIi 
VP8PAYPVTB 



31 
I 

LRGCPTHOIC 
FLEELRXAOI 
SyVPP8CF8G 
8LWLHLHNN 
FVGRSAFQHL 
QVIiDLSYNLI* 
FNAPSTLPSL 
YAYQCCAPSV 
ALHSVOCSPS 
LIGVZAAVNM 
LTLAALERQP 
LPFGEPSTMG 
NCIU7CFVAF 
RKQTYVWTRS 
SCHLSSVAFV 



41 

I 

EPDGRMlJiRV 
ALTYIPKOAF 
LHSLRHLHIiD 
RIHSLGKKCF 
PELRTLTIjNG 
EDLPSFSVCQ 
IKLDLSSKLL 
CBOVYKISNQ 
POPFKPCBBL 
LTGVS8AVLA 
SVKYSAKFET 
YMVALILLNS 
LSPSSLINLT 
KRPSLMSmS 
PCL 



Seq ID NOt 485 DNA sequence 

Nucleic Acid Accession NM_005756.1 

Coding sequence! 73.. 3117 



11 



41 



51 
I 

DCSDUSLSEL 
TGLYSLKVLM 
DNALTBIPVQ 
DOLHSLBTI^ 
ASQITEFPDL 
KLQKIDLRBN 
SSFPITGLHQ 
WNK6DNSSMD 
LDGWLXRZOV 
GVDAFTFOSF 
KAPPSSLKVI 
LCFLMMTIAY 
FISPBVIKPZ 
DDVEKQSCD8 



51 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
760 
840 



21 31 

i I I I I ! 

AGCCAGCCOG AGGAOGCGAG CGGCAGGTGT GCRCAOAGOT TCTCCACTTT GTTTTCTGAA 
CTOOOQGTCA GGATGGTTTT CTCTGTCAGG CAOTGTGGCC ATGTTGGCAG AACTGAAGAA 
GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGT06T TCTGGTAACA 
TCCCroOAAG AAQATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTATCTGTT 
OrcAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TOTTACTTTA 
AGCTTACTCC CTTCRAAOGA AACAQAAAAA ACTAAAATCA CTATAGTAAA AAOCTTCAAT 
GCTTCA66CG TCAAACCCXA GAGAAATATC TGCAATTTQT CATCTATTTQ CAATGACTCA 
GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATQATAAAQ AAAGCACTOT TCOCCAGAAT 
CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAAOGCTCA 
QAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAQCA 
GA6GCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 
TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTOGAC CAATGGAACA CTGCTGCTQT 
TCTOTCAGQA TACXICTGCCC TTCCTCCOCA QAAGAGTTGG 6AAAGCTTCA GTGTGACCTG 
CAOOATCCCA TTGTCTGTCT TGCTGACCAT OCAOGTGGCC CACCATTTTC TTCCA GCCAA 
TCCATGOCAO TGGTGCCT06 GGGGACTGTO CTTTCCCAGQ TCCCCAAAGC TACCTCTTTT 
OCTQAGCCTC CAOATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAOATT 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCGCTGGCAT TOACATOCCC 
CCACAOTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC A TGTCTCCGO CACCCCACCX 
CCTGTGAAAQ CCTCATTTTC CTCTCCCACC GTOTCTGCCC CTGOQAATOT CAACACTACC 
A6CGCACCTC CT0TGCAX3AC AGACAT06TC AACACCAGCA GTATTTCTGA TCTTGAGAAC 
CAAGTGTTGC AGATGGAQAA G6CTCTGTCC TTGG6CAGCC TGGAGCCTAA CCTCGCAGGA 
GAAATQATCA ACCAAQTCAO CAGACTCCTT CATTCCCOGC CTGACATGCT GGOCCCTCTG 
GCTCAAAGAT TGCTGAAAGT AGTG6ATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 
ACTATAAGTC TAACCTCCCC TTCTTTQQCT CrGGCTQTOA TCAGAGTQAA TGCCAQTAGT 
TTCAACACAA CTACCTTTGT GGCCOUUSAC CCTGCAAATC TTCAGGTTTC TCTGQAAACC 
CAAGCTCCTG AGAACAQTAT TGGCACAATT ACTCTTCCTT CATCOCTGAT GAATAATTTA 
CCAQCTCATQ ACATGGAGCT AGCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 
TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTAOGTCAT ATCATOQAGT 
GTT6CAAACC TGAGOSTCAG GAACTTOACA AOAAAOaTGA CAGTCACATT AAAGCACATC 
AACCCXIA6CC A0GAT6AGTT AACAGTGAGA TGTGTATTTT 0G6ACTTGG0 CAGAAATOGT 
GGCAGAGGAG GCTGGTCAGA CAATGGCT6C TGTGTCAAA6 ACAOGAGATT GAATGAAACC 
ATCTOTACCT GTAGCXyCTCT AACAAGCTTC GGCX3TTCTGC TGGACCTATC TAGGACATCT 
GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATrGGTTO TOOGCTTTCA 
TCAATTTTTC TQTCAOTQAC TCTPOTAACC TACATAGCTT TTGAAAAQAT COGQAGGGAT 
TAOCCTTCCA AAATOCTCAT CCA6CTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 
CTCCTGGACT OGTGGATTGC TCTGTATAAG ATGCAAGQOC TCTGCATCTC AGTGGCTGTA 
TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGOATGG GCCTAGAAGC ATTCCATATQ 
TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCOQAA AATACATCCT TAAATTCTGC 
ATTGTOQOTT GGGGOQTACC AOCrGTGOTT GTGACCATCA TCCTGACTAT ATCCOCAGAT 
AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACOOGATQA CTTCTOCTGQ 
ATCAACAACA ATOCAOTATT CTACATTAOQ GTX3GTGGGAT ATTTCTGTQT GATATTTTTG 
CTGAAOGTCA GCAT6TTCAT TGTGGTCCTG OTTCAOCTCr QTOSAATTAA AAAOAAGAAG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



CAACTGGGAO CCCAGCXSAAA AACXyVOTATT CAAGACCTCA GQAGTATC3QC TCGCXTTTACA 2 580 

TTTTTACTGG GAATAACTTG GGQCTTTGCC TTCTTTGCCT GGGGACCAGT TAAOCTTQACC 2640 

TTCATGTATC TCTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 

TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TG6AG6C6GT ATCTTTGTTQ TGOAAAGTTA 2760 

CGGCTG6CTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2B20 

GTAAACCAAO QA6T6TCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTGCACTAAC 2B80 

TCCACCACAC T6CTAQTGAA TAATGATTCC TCAGTACACQ CAAOOOGQAA TGGAAATGCT 2940 

TCTACAGAGA GGAATGGGGT CTCTTTTAOT GTTCAGAATG GAGATGTGTG CCTTCAOGAT 3000 

TTCACTGGAA AACA6CACAT GTTTAACGAO AAG6AA6ATT CCTGCAATGQ GAAAGGCOGT 3060 

AT6GCTCTCA GAAG6ACTTC AAA6CGGGGA AGCTTACACT TTATTGAQCA AATOTGATTC 3120 

CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATQTCCAATT TTACCTTTTA 3180 

CACAATGT6A GATGTATGAA AATCAACTCA TTTTATTCTC G6CAACATCT GGA6AAGCAT 3240 

AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 

TTTAQACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GT7TTAAACA 3360 

ATACACTAAO AATGACTCCT ATAAAOAAAA CAAAAAAAGO TAGT6AACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTQATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTQ 3480 

TTGGTGAGTT TAGTTGTGCA TGCCTTTOTT GTATATAAQC TAAATTCTAG TGACCC3VTGT 3540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 

TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAQ AAAATCCTTG CTCX3TTCCGC 3660 

AAATCGTAAA 0CTCCTT7T6 GAQATQATAT AOGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

TCAAOAAATA ATGATCXX3\G CCAOACTGAQ AAAAT6TAAG CAGACAGT6C CACAGTTAGC 3760 

TCATACAGTO CCTTTGAGCA AOTTAGaAAA AGATGCCCCC ACTG66CA6A CACA6CCCTA 3840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAO ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 

GTGCACGACC TGTACAGCCA AACACAGO^T CX3UITATQAA TACCCATCCC CTGACCGCAT 3960 

CCCCA6TAGT CSUSATTATAG AATCTGOVCC AAG A T G TTT A 6CTTTATACC TTGGCCACAG 4020 

AGAGG6ATQA ACrOTCATGC AGACCATGTG TCAGGAAAAT TOTGAAOQTA OATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGOAGAOTAG ACTA6GATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCOGTTCT TTAAAAGOCA 4200 

ACTGCATGGT ACATTGTTGA TTGTTATOAC TGGTACACTC TGGCCCAOCC AGAGCTATAA 4260 

TTGTTTTTTA AATGT6TCTT GAAGAAT6CA CAGTGACAAG GGGAGTAGCT ATTG66AACA 4320 

GGGAACTGTC CTACACTGCT ATTQTTOCTA CATGTATOQA GCCTTGATTG CTCCTAGTTA 4380 

TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCA6TGCCTC AAGTACATCC 4440 

TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 45 00 

ATTTCAAGTG AATGTTGGAT CTCAQACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 

CTGACTTGTC TTTGCAATAT TTCTTTTCTQ ATTTATTTAA TTTTCTTGTA TTTATATOTT 4620 
AAAATCAAAA ATGTTAAAAT CAATOAAATA AATTTGCAGT TAAGA 

Seq ID NO I 486 Protein sequence 
Protein Accession «: NP 005747.1 



1 
I 

MVPSVRQCGH 
PSSNEVBTTS 
QNQKITNGTL 
NAGAVIAALB 
8Q8ZPVVPRA 
MPPQSSTZSS 
ENQVLQMEKA 
NTTZSIiTSPS 
NLPAHDNEZiA 



T8VLPAQMMA 
VFI.I.DSWXAL 
FCZVGWGVPA 
FLU9V8MFIV 
VTFHniFAIF 
QTVNQ6VSSS 
KDFTGKI^MF 



11 
I 

VGRTSEVLLT 
LNDVTLSLLP 
TGVLSLSELK 
RVKIRPKBHC 
TVL8QVPKAT 
PMPQTHVSGT 
LSLGSLEPNL 
LALAVZRVNA 
&RVQF1IFFET 
VRCVFWDLGR 
LTPITYIGOG 
YKMQGLCISV 
WVTIILTIS 
VLVQLCRXKX 
MTLQGFFXFI 



21 
I 

PKIFLVIIdi 
SNETQVKPQR 
RSELNKTIiQT 
CCSVRZPCP8 

SFAEPpmrsp 

PPPVKASFSS 
AGEMINQVSR 
SSFNTTTFVA 
PALFQDPGIf 



LSSIFLSVTL 
AVPLHYFLLV 
PDNYGUSSYG 
KKQLGAQRKT 
FYCVAKENVR 
TNSTTIiLVNN 
GRKAliRRTSK 



31 
I 

HWLVTSZjEB 
NICNIiSSiai 
LSETYFIHCA 
SPEBLBKLQC 
VTHNVPSPZG 
PTVSAPANVN 
UJISPPDMLA 
ODPANLOVSL 
NI.SLXSYVZS 
GCSVKDRRLN 
VryiAPBKIR 
SFTWIGLEAF 
KFPNQSPDDF 
SIQDLR8IA0 
KQNSRYZ^CGQ 
DCSVHASCaiG 
nSSIiHFZEQM 



41 

I 

DTDNSSLSPP 
DSAFFRGEIM 
TABAQSTLNC 
DLQDPZVCLA 
BZQPLSFQPS 
TTSAPPVQTD 
PLAQRIiLKW 
ETQAPE2ISIG 
SSVANUTVRN 
ETICTC8BLT 
RDYPSKIMQ 
HMYI*ALVKVP 
CWZNNNAVFY 

KLXOAENSDW 



51 
I 

PAKLSWSFA 
FQYDKESTVP 
TFTIKLNNTM 
DBPRGPPF8S 
APZASSPAZD 
IWT8SISDL 
DDZGLQLNF8 
TZTLPSSLMN 
LTKNVTVTUC 
9FGVLLDLSR 
LCAALLLLNL 
NTYIRKYILK 
ZTWGYFCVZ 
FAFFAHGPVN 
8XTATMGLRK 
FSVQNGDVOi 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



8eq ID NOt 487 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

coding sequence t 1 . . 2 904 



1 
I 

ATG6TTTTCT 
TTCAAGATAT 
GATACTGATA 
AA06AAACAG 
G0CXM3AGAA 
GAGATCATGT 
GGCACCTTAA 
ATAAAACTQA 
ATTCGAGCAA 
GAGTTGOAAA 
GGTGGCCCAC 
TCCCAGGTCC 
AATGTTCCCT 
GCTTCCAGCC 
CAAACCCATG 
TCTGCCOCTO 
ACCA6CAGTA 
GGCA6CCTGG 



11 
I 

CTQTCAGOCA 
TCCTTGTCAT 
ATTCCAGTTT 
AAAAAACTAA 
ATATCTGCAA 
TTCAATATGA 
CTGQAGTCCT 
ATAATACAA7 
TGGAACACTG 
AGCTTCAOTG 
CATTTTCTTC 
CCAAAOCTAC 
CTCCAATAGG 
CTOCCATTQA 
TCTC06QCAC 
GGAATQTCAA 
TTTCTGATCT 
AOCCXAACCT 




T6ACCTGCAS 
CAGCCAATCX: 
CTCTTTTGCT 
GGAGATTCAA 
CATGCCCOCA 
CCCACCTCCT 
CACTACCAGC 
TOAGAACCAA 
CSGCAGGAGAA 



31 

I 

GTTGGCAGAA 
CATGTCQTTC 
CCTQATGTTA 
GTAAAAACCT 
ATTTGCAATG 
ACTOTTOOCX: 
GAATTAAACA 
GCTGTAATAG 
GTCAGGATAC 
QATCCCATTG 
ATOCCAOTGa 
GAGCCTCCAG 
CCCCTTTCAC 
CAOTCTGAAA 
GTGAAAGCCT 
OCAOCTOCTQ 
GTOnGCAGA 
ATQAltAACC 



41 
I 

CT6AAGAA6T 
TGGTAACATC 
CTTTAAGCTT 
TCAATGCTTC 
ACTCAI3CATT 
AGAATCAAGA 
CATTAAATTG 
CTGCTTTCGA 
CCTGCGCTTC 
TCTGTCTTGC 
T6CCTGGGGC 
ATTATTCACC 
CCCAGCCT7C 
C6ATCTCTTC 
aVTTTTGCTC 
T0C3UaACA0A 
TOQAOAAQGC 
AAOTCAGCAO 



51 
I 

TTTACTGAGQ 
OCTGGAAGAA 
ACTCCCTTCA 
AGQO QTCAAA 
TTTTAGAGGT 
TATAAOQAAT 
TACATTCACA 
AAGAGTAAAG 
CTOCCGAOAA 
TGACCATCCA 
CACTQTGCTT 
TGTGACOCAC 
AGCTCTXATA 
CCCIATGCCC 
TCGCAGOGTO 
CATOBTCAAC 
TCTGTOCTTG 
ACTCCTTCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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ACATGCT60C CGCTCTOGCT CAAAQATTQC TGAAAGTAGT GQATGACATT 1140 

GGCCTACAGC TGAACTTTTC AAACAOGACT ATAAGTCTAA CCTCCCCTTC TTTGGCTCTG 1200 

GCTOTQATCA GAGTGAATGC GAOTAOTTTC AACACAACTA CCTTTGTGGC CCAAGACCCT 1260 

. GCAAATCTTC AGGTTTCTCT G6AAAG0CAA GCT0CTGA6A ACAGTATTGG CACAATTACT 1320 

5 CTTCCTTCAT CX3CTGATGAA TAATTTACCA GCTCATGACA TGGAGCTAGC TTCGAGGCTT 1380 

CAerrCAATT TTTTTOAAAC ACCTGCTTTQ TTTCAGGATC CTTCCCTGGA GAACCTCTCT 1440 

CTGATCAGCT AOGTCATATC ATCGAGTOTT GCAAACCTQA CCGTCAtSOAA CTTGACAAOA 1500 

AAOOTGACAO TCACATTAAA GCACATCAAC CCQAGCCAGO ATGA6TTAAC AGTQAGATGT 1560 

GTAT T TT GG Q ACTTGGGCAG AAATGQTGGC AGAGGAGGCT GGTCAGACftA TGGCTGCTCT 1620 

10 GTCAAAGACA GGAGATTGAA TGAAACCATC T6TACCTGTA 6CCATCTAAC AAGCTTC6GC 1660 

GTTCTQCTGG ACCTATCTAG GACATCTOTG CTGCCTGCTC AAATQATGQC TCTGAOGTTC 1740 

ATTACATATA TiXXmiriXS G GCTTTCATCA ATTTTTCTGT CAGTGACTCT TQTAACCTAC 1800 

ATAGCTTTTO AAAAOATCOO GAGGGATTAC CCTTCCAAAA TCCTCATCXSl GCTGTGTGCT 1860 

GCTCTGCTTC TGCTQAACCT GGTCTTCCTC CTGGACTOGT GGATTGCTCT GTATAAGATG 1920 

13 CAAGGCCrCT GCATCTCAGT GGCTOTATTT CTTCATTATT TTCTCTrGOT CTCATTCACA 1980 

TGGATGGGCC TAGAAGCATT CCATATGTAC CTOQCCCTTG TCAAAGTATT TAATACTTAC 2040 

ATCCGAAAAT ACATCCTTAA ATTCTGCATT OTOGGTTGaO GGOTACCAGC TOTGOTTGTG 2100 

ACC3VTCATCC TGACTATATC CCCAQATAAC TATGGGCTTG GATCCTATGG GAAATTCCCC 2160 

AATGGTTCAC CGGATGACTT CTGCTGGATC AACAACAATG CAGTATTCTA CATTAOCGTG 2220 

ZU GIGOOATATT TCTGTGTQAT ATTTTTGCTG AAOGTCAGCA TGTTCATTGT GGTCCTG6TT 2260 

CAOCTCTGTC OAATTAAAAA GAAGAAOCAA CTOGGAGOOC AGGGAAAAAC CAGTATTCAA 2340 

6ACCTCAGGA GTAT06CT6G CCTTACATTT TTACTQGGAA TAACTTGGG6 CTTTGCCTTC 2400 

•mt3CClX3GG GACCAGTTAA CGTGACCrTC ATGTATCTGT TTGCCATCTT TAATACCTTA 2460 

CAAGGATTTT TCATATTCAT CrTTTACTGT GTGGCCAAAG AAAATGTCAQ QAAQCAATG6 2520 

2, J AGGOGGTATC TTTQTTQTGO AAAGTTAOSQ CTOQCTGAAA ATTCIGACTG GAGTAAAACT 2560 

GCTACTAATG GTTTAAAGAA GCAOACTOTA AA0CAAG6AG T6TGCAGCTC TTCAAATTCC 2640 

TTACAQTCAA GCAGTAACTC CACTAACTCC ACCACACTGC TA6TGAATAA TGATTGCTCA 2700 

GTACAC6CAA GCX»»3AAT60 AAAT6CTTCT ACAGAGAGGA AT66GGTCTC TTTTAGTGTT 2760 

CAGAATOQAG ATGTGTGCCT TCACGATTTC ACTOGAAAAC AOCACATGTT TAACGAGAAQ 2820 

30 GAA6ATTCCT GCAATGGGAA AGGCCGTATO GCTCTCAGAA G6ACTTCAAA GC6GGGAA0C 2680 
TTACACTTTA TTGA6CAAAT GTGA 



35 



55 
60 
65 
70 
75 
80 



Seq ID NOi 468 Protein sequence 
Protein Accession ff; Eos sequence 



1 11 21 31 41 51 

I I I I I I 

KVPSVRQGGH VGRTBEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PDVTLSLLPS 60 

NBTEKTKITI VKTFNASQVK PQRNICKLSS ICNDSAPFRG EIMFQYDKES TVPQNQHITN 120 

40 GTLT6VLSLS ELNTIiNGTFT IKLNNTKNAC AVIAALERVK IRPMEHCCCS VRIPCPSSPB 180 

ELEKLQCDLQ DPIVCLADHP RGPPF88SQ8 ZPWPRATVL SQVPKAT8FA SPPDYSPVTH 240 

NVPSPIQBIQ PLSPQPSAPI ASSPA333MPP Q8ETZ8SPMP QTHVSGTPPP VKASP8SPTV 300 

SAPANVNTTS APPVQTDIVN TSSZSDLENQ VliQMBXALSXi GSLBPNLAOB MIMQVSRLLH 360 

SPPDMLAPLA QRLUCWDDI GLQLNFSNTT ISLTSPSLAL AVIRVNASSP NTTTFVAQDP 420 

45 AMLQVSLBTQ APQI8XGTIT LP8SLM13NLP ARDMELASRV QFNFFETPAL FQDPSLENLS 480 

XiISYVXSSSV AMLTVRMLTR HVTVTLKBQI PSQDBLTVRC VFHDLGRMGQ RGGWSDNGC8 540 

VKDRRU7BTX CTCSHLT8FG VUiDLSRTSV LPAQMMALTP imGOGIiSS ZFLSVTLVTY 600 

lAFEKIRRDY PSKILIQLCA ALLLLNLVFL LDSWIALYKM QGLCISVAVF LHYFLLVSFT 660 

^ WMGLBAFHMY LALVKVFNTY IRKYILKFCI VGWGVPAWV TIILTISPDN YGLGSYGKFP 720 

50 NGSPDDFCHI NMNAVFYITV VOYFCVIFLL NVSMFZWLV QLCRXXKKKQ LGAQRKTSXQ 780 

DLRSZAGLTF LLGXTHGFAF FAHGPVNVTF MYLFAXPNTIi QGFPIFXFYC VAKENVRKQH 840 

RSyLCGGKLR LAEHSDH8KT ATNGLKKQTV NQ6VSSSSNS LQSSSNSTNS TTLLVHNDCS 900 

VBASGN(aiA8 TBRNGVSFSV QNGDVCUIDP TGKQHHFIIEK BDSCNGR6RH ALRRTSKRGS 960 
IfiFIBQH 



Seq ID MO: 489 DNA sequence 

Nucleic Acid Accession #t Eos sequence 

coding sequence: 1..2eil 

1 11 21 31 41 51 

I I I I I I 

ATGGTTTTCT CTGTCAOGCA OTGTGGCCAT OTTGGCAQAA CTOAAGAAGT TTTACTGAOG 60 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTOSTTC TGGTAACATC CCTGGAAGAA 120 

GATACTGATA ATTCCAGTTT GTCACCACCA CCTaAGGTTQ AAACAACAAG CCTCAATGAT 180 

GTTACTTTAA GCTTACTCCC TTCAAACGAA ACAGGOSTCA AACCCCAQAO AAATATCTGC 240 

AATTTGTCAT CTATTTGCAA TGACTCAGCA TTTTTTAGAG OTQAQATCAT QTTTCAATAT 300 

QATAAAOAAA GCACTSTTCC CCAGAATCAA CATATAAC6A ATGGCACCTT AACTGGAGTC 360 

CTGTCTCTAA GTGAATTAAA ACGCTCAGA6 CTCAACAAAA CCCTGCAAAC CCTAAGTGAO 420 

ACTTACrTTA TAATGTOTGC TACAGCAQAG GCCCAAAGCA CATTAAATTG TACATTCACA 480 

ATAAAACTGA ATAATACAAT 6AATGCATQT GCXGTAATAG CTGCTTTGGA AAGAGTAAAG 540 

ArraOACCSUl TGOAACACTO Clt a Cl' O TTCT aTCAQOATAC CCTO OCCITC CTCCCCA6AA 600 

GA8TTGGAAA AGCTTCAGT6 TGACCTGCAO QATCCCATTO TCTGTCTTGC TGACCATOCA 660 

OGTGGCCCAC CATTTTCTTC CA6CCAATCC ATCCCAGTX5G TGCCTOGGGC CACTGTGCTT 720 

TCCCAGGTCC CCAAAGCTAC CTCTTTTGCT GAGCCTCCAQ ATTATTCACC TGTGACCCAC 780 

AATGTTCGCT CTCCAATAGG GGAGATTGAA CCOCXTTCAC CCCA6CCTTC AGCTCCCATA 840 

QCTTOCAOCC CTGCCATTGA CATGCCCCCA CAGTCTGAAA GGATCTCTTC CCCTATGCOC 900 

CAAACCCATG TCTCOGGCAC CX3CACCTCCT GTGAAAGCCT CATTTTGCTC TCCCACC3GTG 960 

TCTGCCCCTG OQAATGTCAA CACTACCAGC GCAOCTCCTO TCCAQACAGA CATCGTCAAC 1020 

ACCAGCAOTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA TGQAGAAGGC TCTGTCCTTG 1060 

GOCAGOCTQG A6CCTAACCT OGCAGQAGAA ATGATOUICC AAGTCAGCAG ACTCCTTCAT 1140 

TCOOOOOCTQ ACATGCTG6C CCCTCTGGCT CAAAQATTQC TGAAAGTAGT GQATQACATT 1200 

GGGCXAGAQC TOAACTTTTC AAACAOOACT AXAAOTCTAA CCTCCCCTTC TWm QC X C iq 1360 

GCXQTGATCA GAGTQAATQC GAOTAOTTTC AACACAACTA CCrTTGTGOC GCAAOACCCT 1320 

GCAAATCTTC AOOTTTCTCT GGAAAGCCAA GCTCCTQAOA ACAOTATTQG CACAATTACT 1380 

1144 
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PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CTTCCTTCAT 
CAGTTCAATT 
CTtSVTCAGCT 
AA OOTSACAQ 
GTATTTTGGG 
GTCAAAGACA 
GTTCTGCTGG 
ATTACATATA 

GCTCTGCTTC 
CAAGGCCTCT 
TGOATGQGCC 
ATCC6AAAAT 
ACCATCATCC 
AATGGTTCAC 
GTGGGATATT 
CAGCTCTGTC 
GACCTCAGGA 
TTTGCCTGGG 
CAAGGATTTT 
AGGCOGTATC 
GAGAGGAATG 
GGAAAACAGC 
CTCA6AM3GA 



CGCTGATGAA 
TTTTTGAAAC 
ACGTCATATC 
TCACATTAAA 
ACTTGGGCAG 
G6AGATT6AA 
ACXTTATCTAG 
TTGGTTGTGG 
AAAAGATCCO 
TGCTGAACCT 
GCATCTCAGT 
TAGAAGCATT 
ACATCCTTAA 
T6ACTATATC 
0CX3ATQACTT 
TCTGTGTQAT 
GAATTAAAAA 
GTATCGCTGO 
GACCAGTTAA 
TCATATTCAT 
TTTGTTGTGO 
GGGTCTCTTT 
ACATGTTTAA 
CTTCAAAGGS 



TAATTTACCA 
ACCTGCTTTG 
ATOGAGTGTT 
GCa CATCA AC 
AAATGGTQGC 
TCAAACCATC 
GACATCTGTG 
GCTTTCATCA 
GAGGGATTAC 
GGTCTTCCTC 
6GCTGTATTT 
CXyVTATGTAC 
ATTCT6CATT 
GOCAGATAAC 
CTGCTGGATC 
ATTTTTGCTQ 
GAAGAA6CAA 
CCTTACATTT 
CX5TGACCTTC 
CTTTTACTGT 
AAAGTTA06G 
TAGTGTTCAG 
aSAGAAGQAA 
66GAAGCTTA 



GCTCATGACA 
TTTCAGGATC 
GCAAAOCTGA 
C0QA6CCAGQ 
AGAGG AQGC T 
TGTACCTGTA 
CTGCCTQCTC 
ATTTTTCTOT 
CCTTCCAAAA 
CTGGACTOGT 
CTTCATTATT 
CTGQCCCTTG 
GTOGOTTGGG 
TATG6GCTTG 
AACAACAATO 
AACGTCAGCA 
CTGGQAGCCC 
TTACTGGGAA 
ATGTATCTGT 
QTQGCCAAAG 
CTGGCTGAAA 
AATGGAGATG 
GATTCCTGCA 
CACFTTATTG 



TGGAGCTAGC 
CTTCCCrGGA 
COGTCAGGAA 
ATQA6TTAAC 
GGTCAGACAA 
6CCATCTAAC 
AAATQATGQC 
CAGT6ACTCT 
TCCTGATTCA 
GGATTOCTCT 
TTCTCTTGGT 
TCAAAGTATT 
GG6TACCAGC 
GATCCTATGO 
CAQTATTCTA 
TGTTCATTOT 
AGC GAAA AAC 
TAACTTGGGG 
TTGOCATCTT 
AAAAT0TCA8 
ATTCTGGAAA 
TGTOCCrTCA 
ATGGGAAAGG 
AGGAAATGTG 



TTCCAGGGTT 
GAACCTCTCT 
CTTGACAAGA 
AGT6A0ATGT 
TGGCTGCTCT 
AAGCTTOQQC 
TCTGACOTTC 
TGTAACCTAC 
GCTGTGTGCT 
GTATAAQATG 
CTCATTCACA 
TAATACTTAC 
TGTGGTT6T6 
QAAATTCOCC 
GATTAOGGTG 
GGTCCTGGTT 
CAGTATTCAA 
CTTTGCCTTC 
TAATACCtTA 
GAAGCAATG6 
TGCTTCTACA 
CQATTTCACT 
COQTATGGCT 
A 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2S60 
2640 
2700 
2760 



Seq ID HO: 490 Protein secmence 
Protein Accession #t Eos sequence 



MVF8VRQCGR 
VTLSLLPSNB 
I.STtSKTtKRSB 
IRPMEHCCCS 
SQVPKATSFA 
QTRVS6TPPP 
GSUSPNLAGB 
AVIRVNASSP 
QFNFFETPAL 
VFWDIiGRNGG 
ITYXGOGLSS 
QGIiCISVAVP 
TIILTI8PDN 
QLCRIKKKKQ 
QQPPIPIFYC 
GRQHMFNEKE 



11 

1 

VGRTEEVLLT 
TGVKPQRNIC 
UJKTLQTLSE 
VRIPCPSSPE 
EPPDYSPVTH 
VKASFS6PTV 
NINQVSRIiliH 
NTTTFVAQDP 
FQDPSLENIiS 
RG6WSDNGCS 
ZFZiSVTLVTy 
IjRYFIjLVSFT 
YGLG5YGKFP 
LGAQRKTSIQ 
VAXENVRKQN 
DSCaVGKGRMA 



21 
I 

FKIFLVIICL 
NLSSICNDSA 
TYPIMCATAE 
EL6KLQa)IiQ 
NVPSPIGEIQ 
SAPANVNTTS 
SPPDMLAPIiA 
ANLQV8LETQ 
LISYVISSSV 
VKDRRLNETX 
lAFEKIRRDY 
MMGLBAFBMy 
NOSPDDFCWZ 
DLRSIAGLTP 
RRYLCCX5KLR 
LRRTSKROSL 



31 
I 

HWLVTSLEE 
FFRGEIMFQY 
AQSTUICTFT 
DPIVCLADHP 
PLSPQP8API 
APPVQTDIVN 
QRLLKWDDI 
APQ7SIGTIT 
ANLTVRNLTR 
CTCSHLTSPG 
PSKILIQLCA 
LALVKVFNTY 
NNNAVFYITV 
LLGITWGFAF 
LAEMSGNAST 
HFIEQM 



Seq ID NOi 491 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence t 1 . . 3045 



41 
I 

DTDNSSliSPP 
DKESTVPQNQ 
IKLNNTMNAC 
RGPPFSSSQS 
ASSPAIDNPP 
TSSISDLQIQ 
GLQLNF8NTT 
LPSSLKNNLP 
NVTVTLKHIN 
VLLDLSRTSV 
ALLLLNLVFL 
IRKYILKFCI 
VGYFCVIPLI* 
FAWGPVNVTP 
ERNGVSFSVQ 



51 
I 

PEVETTSLND 
HITNGTLT6V 
AVIAALERVK 
IPWPRATVL 
QSETISSPMP 
VLQMEKALSL 
I8LT8PSLAL 
AHDMEIiASRV 
PSQDBLTVRC 
LPAQMMALTP 
LDSWIALYKM 
VGWGVPAVW 
NVSMFIWLV 
MYLFAIFNTL 
NGDVCLBDPT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1 11 21 31 41 51 

I I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGT6GCCAT GTTGGCAGAA CTQAAGAAGT TTTACTGACG 60 

TTCAAGATAT TCCTTGTCAT CATTTQTCTT CATGTCXITTC TOGTAACATC CC TGQAAG AA 120 

GATACTGATA ATTCCAGTTT GTGACCACCA OCIGCTAAAT TATCTGTTGT CAOrmGCC 180 

CCCTCCTCCA ATGAGGTTGA AACAACAAGC CTCAATGATQ TTACTTTAAG CTTACTOCCT 240 

TCAAAOQAAA CAGAAAAAAC TAAAATCACT ATAGTAAAAA CCTTCAATGC TTCAGGOGTC 300 

AAACCCXaGA 6AAATATCTG CAATTTGTCA TCTATTTGCA ATGACTCAGC ATTTTTTAGA 360 

G6TQAGATCA TQTTTCAATA TGATAAA6AA A6CACTGTTC CCCA6AATCA ACATATAAGG 420 

AATGGCAOCT TAACTGGAGT GCTGTCTCrA AGTGAATTAA AAOGCICAOA GCICAACAAA 480 

ACCCTGCAAA OCCTAAGTQA GACTTACTTT ATAATGTGTG CTACM3CAGA GGCCCAAAGC 540 

ACATTAAATT OTACATTCAC AATAAAACTG AATAATACAA TQAATGCATQ TGCTGCAATA 600 

GCCGCTTTGG AAAGAGTAAA GATTOOAOCA ATGOAACACT GCTGCTGTTC TGTCAGGATA 660 

CCCTGCCCTT CCTCCCCAGA AGAQTTGGGA AAGCTTCAGT GTGACCTGCA GGATCCCATT 720 

GTCTGTCTTG CTGACCATCC A0STG6CCCA CCATTTTCTT OCA GCCA ATC CATOOCAGTG 780 

QT Q CCTOGQG CCACTOTOCT TTOCCRQOTC CCCAAAGCTA OCTCTTTTGC 1QAGCCTOA 840 

GATTATTCAC CTGTGACCCA CAATGTTCCC TCTCCAATAG GGGA6ATTCA ACCCCTTTCA 900 

CCXXAGCCTT CAGCTCCCAT AGCTTOCS^ CCTOCCATTQ ACATGCCCCC ACAQTCTQAA 960 

AOGATCTCTT CCCCTATGCC CCAAAGCCAT GTCTOOGGCA CCCCACCTCC TCTGAAAOCC 1020 

TCATTTTCCT CtOCCACGOT GTCTGCCCCT OCQAATGTGA ACACTACCAG OOC ACCTO CT 1080 

OTCCAGACAO ACATOOTCAA CACCAOCRGT ATTTCTOATC TTGAOAAOCA AGTGTTGC3U3 1140 

ATGGAOAAGG CTCTOTCCTT GGGCAGCCIG GAGCCTAACC TOQCAGGAGA AATQATCAAC 1200 

CAAGTCAQCA OACTCCTTCA TTCCCOGCCT GACATGCTGG CCCCTCTGGC TCAAAGATTG 1260 

CTGAAAGTAG TGGATGACAT TGGCXrXACAG CTGAACTTTT CAAACACQAC TATAAGTCTA 1320 

AOCTCCCCTT CTTTGGCTCT GGCTGTQATC AGAGTGAATG CCA6TA0TTT CAAC ftCAA CT 1380 

ACC T TT G TG G COCAAGACCC TGCAAATCTT CAGGTTTCTC TGGAAACCCA AOCTCCTGAG 1440 

AACAGTATTG GCACAATTAC TCTTCCTTCA TOGCTGATOA ATAATT TACC AGCTCA TGAC 1500 

ATGGAOCTAG CTTCCAGGGT TCAGTTCAAT TTTTTTGAAA CACCTGCTTT GTTTCAGGAT 1560 

CCTTCCCTQQ AOAACCTCTC TCTGATCAGC TAOQTCATAT CAT0QAGT6T TGCAAACCTG 1620 

AC06TCAO0A ACTTGACAAO AAAOOTQACA OTCACATTAA AGCACATGAA CCCGAGCCA6 1680 

GATGAOITAA CAGTOAGATO TGTATTTTGO GACirGOaCA QAAATGOrGG GAQAGGAQGC 1740 

TGOTCAOACA ATGGCTGCTC TQTCAAAGAC AOGAQATTGA AT8AAA0CAT CTGTACCTGT 1800 

AGCXaXCTAA CAAGCTTCOG CXS ' n ' Cl ' GC l' G GACCTATCTA GGACATCTOT GCTOCCTOCT 1860 

1145 
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CAAATGATGG 
TCAGTGACTC 
ATCCTCATCC 
TGOATTOCTC 
TTTCTCTTGO 
GTCAAAGTAT 
GGGOTACCAa 
GGATCCTAT6 
GCAGTATTCT 
ATGTTCATTO 
CAGCGAAAAA 
ATAACTTGGG 
TTTGCCATCT 
GAAAATGTCA 
AATTCT6ACT 
GTGTCCAGCT 
CTAQTQAATA 
AATGGGGTCT 
GAGGACA3GT 
AGGACnCAA 



CTCTCAOGTT 
TTGTAACCTA 
AGCTGTGTGC 
TGTATAAGAT 
TCTCATTCAC 
TTAATACTTA 
CTOTGGTTGT 
GQAAATTCCC 
ACATTAOGQT 
TOGTCCT GG T 
CCAGTATTCA 
OCTTTGCCTT 
TTAATACCFT 
GOAAGCAATO 
GGAGTAAAAC 
CTTCAAATTC 
ATQATTGCTC 
CrTTTAGTGT 
TTAACOAOAA 
AGOGGGGAAO 



CATTACATAT 
CATAOCTTTT 
TGCTCTGCTT 
GCAAOGCCTC 
ATGGATC3GGC 
CATCOQAAAA 
GACCATCATC 
CAATGG7TCA 
GGTGOGATAT 
TCAGCTCTGT 
AQACCTCAGG 
CTTTGCCTGG 
ACAA6QATT7 
GAG60G0TAT 
T6CTACTAAT 
CTTACAOTCA 
AGTACAOGCA 
TCAOAATOOA 
GGAA6ATTGC 
CTTACACTT7 



ATTGGTTGTG 
GAAAAGATCC 
CTGCTGAACC 
TGCATCTCAO 
CTAGAAQCAT 
TACATCCTTA 
CTGACTATAT 
CaSGATGACT 
TTCTGTGTGA 
06AATTAAAA 
AOTATCGCTG 
GGACCAGTTA 
TTC3VTATTCA 
CTTTOTTGTG 
GGTTTAAAGA 
AGCAGTAACT 
AGCXSGGAATO 
GATGTOTGOC 
TGCAATGGGA 
ATTGA6GAAA 



GGCTTTCATC 
G6AGGGATTA 
TGGTCTTCCT 
TGGCTGTATT 
TCCaXATOTA 
AATTCTGCAT 
CCCCAGATAA 
TCTGCTGGAT 
TATTTTTGCT 
AGAAGAAGCA 
GCCTTACATT 
ACXSTGACCTT 
TCTTTTACTG 
OAAAGTTAOG 
AGCAOACTGT 
CCACTAACTC 
GAAATGCTTC 
TTCAG6ATTT 
AAGGC3G6TAT 
TOTOA 



AATTTTTCXX5 
CCCTTOCAAA 
GCTGGACT08 
TCTTCATTAT 
CCTGGCCCTT 
TGTCGGTTGG 
CTATGGQCTT 
CAACAACAAT 
GAACGTCAGC 
ACTGGQAGCC 
TTTACTGG6A 
CATGTATCTG 
TGTGGCCAAA 
6CTGGCT6AA 
AAACCAAGGA 
CACCACACTG 
TACAGA6A03 
CACTGGAAAA 
GGCTCTCM3A 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 



Seq ID NO J 492 Protein sequence 
Protein AcceBSlon #i Eos sequence 



MVFSVRQCGH 
PSSNBVETTS 
GBIMFQYDKB 
TLNCTFTIKL 
VCLAOHPR6P 
PQPSAPIA88 
VQTDIVNT8S 
LKWDDIGLQ 
NSZGTITXiPS 
TVRNLTRNVT 
SHLTSFGVLL 
ILIQLCAALL 
VKVPNTSriRK 
AVFVITWGY 
ITWGPAFFAW 
NSDHSKTAH7 
NOVSFSVQNQ 



11 

1 

VGRTEEVIiLT 
LNDVTIiSIiLP 
8TVPQNQHZT 
NNTKMAGAAI 
PPS88QSZPV 
PAIOMPPQSB 
ISDLENQVIiQ 
LNFSNTTISb 
SLKNMLPAHD 
VTLXRINP8Q 
DliSRTSVLPA 
LLNLVFUiDS 
YZLKFCZVGH 
FCVZPLLNVS 
GPVNVTFMYL 
OLKKQTVNQO 
DVCLHDFTGK 



21 
I 

PKIFLVIICL 
SNETBKTKIT 
NGTLTGVLSIi 
AALERVKIRP 
VPRATVLSQV 
TISSPMPQTH 
KEKAIiSLGSL 
TSPSXiALAVI 
MELASRVQPN 
DELTVRCVPW 
QMMALTFITY 
WIALYKMQGL 
GVPAVWTII 
MFIWLVQLC 
FAIFNTLQGF 
VSSSSKSLQS 
QHMFKEKEDS 



31 
I 

HWLVTSIjEE 
IVKTFMASGV 
SELKRSELNK 
MEHCCCSVRI 
PKATSFAEPP 
VSOTPPPVKA 
EPNLAOEMIN 
RVNASSRITT 
FFETPALFQD 
DLGRNGGRGO 
lOCGLSSIFL 

cisvAVFunr 

LTISFDNYGL 
RZKKKXQIiGA 
FIFIFYCVAK 
SSNSTNSTTL 
CNGKGRMALR 



41 

I 

DTDNSSLSPP 

KPQRNianjS 

TLOTLSETYF 
PCP68PBBU3 
DYSPVTHNVP 
8FSSPTV6AP 
QVSRLLH8PP 
TFVAQOPANL 
PSLQILSLXS 
WSDNGCSVKD 
SVTLVTYIAF 
FLLV8FTHMa 
aSYOKPPNGS 
QRRTSZQDLR 
ENVRKQWRRY 
LVNNDCSVHA 
RTSKRGSLHP 



51 
I 

PAKL8W8FA 
SXCSmSAFFR 
ZNCATAEAQ8 
KLQCDIiC^PX 
SPIGEXQPLS 
ANVNTTSAPP 
DMLAPLAQRL 
QVSLETQAP6 
YVZSSSVAML 
RRI.NBTZCTC 
BKIRRDYPSK 
LBAFHMYLAL 
PDDFCWIKMN 
SZAGLTFLLO 
LCCGKZiRXAB 
SGNGNASTER 
ZEQM 



60 
120 
180 
340 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NOi 493 DNA sequence 
Nucleic Acid Accession ftt mjil5501 
Coding sequence! 241.. 1902 

1 11 21 31 41 51 

i I I I I I 

COGCAGAGGA GCCTOGGCCA GGCTAQCCAG GOOGOCCCCA GCCCCTCCCC AGGCOGCGAG 
COCCCCTGCC GCOOTGCCTG GCCTCCCCTC CCAGACTGCA GGQACAGCAC CCGGTAACTG 
08AGTGGAGC GGAGGACCOQ AGOGGCTGAO QAOAQAGGAO OCGGOGGCTT AGCT6CTA0G 
GGGTCGGOCC GGOGCCCTCC OQAGGG GG QC TCAGGAG6AG GAAG6AGGAC CCG TGOGAGA 
ATGOCTCTGC CCTGGAGCCT TGCXSCTCCOG CTOCTGCTCT OCTG60TGOC AG6TGGTTTC 
GGGAAOGOGO CCAGTGCAAQ 6CATCA0QGG TTOTTAOCAT OGGCAOGTCA GCCTGGGGTC 
TGTCACTATG GAACTAAACT GGCCTGCTQC TAaSGCTGOA GAAGAAACAG CAAGGOAOTC 
TGTGAAGCTA CATGG6AACC TGGATGTAAG TTTGGTGAGT GC9GTGGGACC AAACA AATGC 
AGATGCTTTC CA6GATACAC OGGGAAAACC TGCAGTCAAG ATGTGAATQA QTQ TQQAATq 
AAACCCCGGC GATGCCAACA GAaATGTGTO AATACACAOQ GAAGCTACAA GTGCTTTT6C 
CTCAGTGGCC. ACATGCTCAT 6CCAGATGCT AOGTGTGTGA ACTCTAGGAC ATGTGCCATG 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAOAAGGGC CACAGTGCCT GTGTCCATCC 
TCA66ACTCC GCCTQ6CCCC AAATGGAAGA GACTGTCTAO ATATTGATGA ATGT6CCTCT 
GGTAAAOTCA TCTGTCCCTA CAATOGAAOA TGTGTSAACA CATTTGQAAO CTACTACT6C 
AAATGTCACA TTQGTT T OOA ACTGCAATAT ATCSUnOGAC GATATGACT6 TATAQATATA 
AATGAATGTA CTATGGATAG CCaVTACX?rGC AGCCACCATQ CCAATTGCTT OUITAOCCAA 
GGGTCCTTCA AGTGTAAATQ CAAQGAGGOA TATAAAGGCA ATGGACTTCG GTOTTCTGCT 

atccctgaaa attctgtqaa ggaagtcctc agagcacctg 6taccatcaa aqacalsaatc 
aagaagttgc ttgctcacaa aaacagcato aaaaaoaaoo caaaaattaa aaatgttaoc 
ccagaacxx:a ccaggactcc tacccctaao gtoaacttgc agoccitcaa ctatgaagag 
atagtttcca qaggcggoaa ctctcatgga ggtaaaaaaq ggaatgaaga gaaaatgaaa 

GAGGGGCTTG AGGATGAGAA AAOAGAAQAG AAAOCXXrTOA AQAATGACAT AGAGGAGCX3A 
AGCCTGCGAO QAOATQTGTT TTTCCCTAAO GTQAATGAAG GAGGTGAATT 06GCCTGATT 
CTGGTCCAAA GGAAAGOGCT AACTTCCAAA CTGQAACATA AAOATTTAAA TAT CTOGG TT 
GACTGCAGCT TCAATCATG6 GATCT6T6AC TGGAAACAG6 ATAGAGAAGA TGATTTTGAC 
TGGAATCCTG CTGATOGAGA TAATGCTATT GGCXTCTATA TG6CA0TTCC G0CCTTG6CA 
GGTCACAAQA AAOACATTGa CCmTTGAAA CTTCTCCTAC CTGACCTGCA ACXXXaAAGC 
AACrrCTGTT TGCTCTTTGA TTACGGGCTO GCOGGAGACA AAGTGGGGAA ACTTC6A6T0 
TTTGTGAAAA ACAOTAACAA TOCOCTQGCA IGGGAGAAGA CCAOQAGTGA GGATG AAAAQ 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATO CTACCAAAAO CATCATTTTT 
GAAGCAOAAC GTG6CAAGGG CAAAAOOQGC GAAATOQCAO TOGATGGCGT CTTGCTTGTT 
TCAOGCTTAT GTCCAQATAQ CCTTTTATCT GTGQATGACT GAATGTTACT ATCTTTATAT 
TTGACTTTOT ATGTCAGTTC CCTGGTrTTT TTOATATTGC ATCATAQOAC CTCTGG CATT 
TTAGAATTAC TAGCTGAAAA ATT6TAATGT ACCAACAQAA ATATTATTOT AAGATGGCTT 

1146 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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TCTTOTATAA GATATGCCAA TATTTQCTTT AAATATCATA TCACTOTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAQTTTAT CTCCCCTCCT 2160 

CAOTATATCT GATTTOTATA AGTAAGTT6A TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACA6AGA AATGTTTAAC TGTTTGACTC TTATQATACT TCTT6GAAAC 23B0 

TATOACATCA AAOATAGACT TTTGGGTAAO TG6CTTAGCT OGGTCrTTCA TAGCCAAACT 3340 
TGTATATTTA AATTCTTTGr AATAATAATA TGCAAATCAT CAAAAAAAAA AAAAAAAA 



_ Seq ID NOt 494 Protein Bequence 
10 Proteia Accessioa ftt NP_0S6322 

1 11 21 31 41 51 

I i I 1 I I 

, _ NPZiFHSLALP LLLSHVAGOF GNAASARHBO LIiASARQPGV CKYGTKXACC YQHRSNSKGV 60 

IS CEATCBPGCK FGECVOFNKC RCFFGYTOKT CSQDVNBGGM KPRPOQHRCV NTRGSYKCPC 120 

ltS6HMU4PDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGUILAPNGR DCLDZDECAS 180 

GKVICPYNRR CVNTPGSYYC KCHIGFBLQY ISGRYDCIDI NBCTHDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGMGLRC8A IPGNSVKEVL RAPGTZKDRI KKLLAKKNSM KKXAKIKNVT 300 

PBPTRTPTPK VNLQPFNYEB ZVSROGNSKa GKKGtlEBKMR B6LEDBKRBB KALKNDIEER 360 

20 SIiRGDVFFPK VNEAGEFdil LVQRKALT8K LEHXDUnSV DC8FNKGICD HKQDREDDFD 420 

WNFADRSMAI GFYMAVPALA GHKXDIGRLR LLLFDLQPQS NFCUiPDYRL AQDKVGKLRV 480 

FVKNSroiALA WEKTTSEDBK NKTGKIQLYQ GTIIATXSIIP BAERGKGKIG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

25 Seq ID NOt 49S DNA sequence 

Nucleic Acid Accession fit NM_003506.1 
Coding sequencei 259.. 2379 

I 11 21 31 41 51 

30 I I I I I I 

GCAGCTCCAG TCCCGGACX3C AACCCCGGAG CCOTCTCAOO TCCCTGGGGQ GAACGGTGGG 60 

TTAQACOGGG AOQGGAAGGG ACAGOQGCCT TCGACCGCXTC CCCGAGTAAT TQACCC AGGA 120 

CTCATTTTCA GGAAASCCTG AAAATQAOTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGQAT GGGQATCTTC TGAGGAT6CA AAOAGTGATT CAT0CAA60C ATOTGOTAAA 340 

35 ATCAQGAATT TGAAQAAAAT GGAGATOTTT ACATTTTTGT TGACX3TGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAOTCT CTTCACCTGT GAACCAATTA CTGTTCCCAO ATGTATGAAA 360 

ATGGCCTACA ACATGAOGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGQTGGAAA TGGAOCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 460 

ACTTTCCTCr 6CAAAGCATT T6TACCAACC TGCATAGAAC AAATTCATGT GQTTCCACCT 540 

40 TGTGGTAAAC TTTGTQAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CAGTTTTGGG 600 

ATCCGATGGC CTGAGQAOCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTOTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GQTCCTCAGA AQAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAO 780 

TTTCTGGGAA TTGSACCAGTG TGOGCCTCCA TGCCCCAACA TGTATTTTAA AAGTOATGAQ 840 

45 CTAGAGTTTG Ca^AAAAGTTT TATTQGAACA OTTTCAATAT TTrGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTrrTT AATTQATGTT AOAAGATTCA GAXACCCAQA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGOQATAGCA CA60CTGCAA TAAGGCAGAT GAGAA6CTA0 AACTTGQTOA CACTGTTGTC 1080 

CTAGGCrCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

50 6CTGGCACTG TGTOGTGGGT QATTCTTACC ATTACTTGGT TCTTAQCTGC AGOAAGAAAA 1200 

TGGAOTTGTG AAGCCATCGA GCAAAAAGCA GTGTQGTTTC ATGCTGTTGC ATOGGQAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATQACCTG GATGCTTCTC GCTACTTTGT ACTCTT6CCA 1380 

CTGTGCCTTT QTGTGTTTGT TGGGCTCTCT CTTCTTTTAO CTGGCATTAT TTGCTTAAAT 1440 

55 CATGTTCX3AC AAGTCATACA ACATGATGGC OGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTOGAATTQ GAGTCTTCAG aSGCTTGTAT CTTGTGCCAT TAOTGACACT TCTCGGATGT 1560 

TA06TCTAT0 AOCAASTGAA CAGGATTACX: TGGGAQATAA CTTGGGTCTC TQATCATTGT 1620 

CGTC3U6TACC ATATCCCATG TCCTTATCAO GCAAAA6CAA AAGCTOGACC AQAATTGGCT 1680 

TTATTTATGA TAAAATACXTP GATGACATTA ATTOTTGGCA TCTCTGCTOT CTTCTGGGTT 1740 

60 GGAAGCAAAA AGACATGCAC AGAAT6GGCT GGGTTTTTTA AA06AAATG0 GAAGAGAGAT 1800 

OCAATCAGTQ AAAGTOGAAG AGTACTACAO GAATCATOTO AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAQGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TQCAGTAGCA 1980 

^ ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAOAA 3040 

65 ACATCAATGA GAGAGGTGAA A6CX3GACGGA GCTAGCACXX: CCAGGTTAAG AGAACA6GAC 2100 

TQTGGTGAAC CT0CCT06CC AGCAOCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GQGAAOGGCC AGGCAGGCAG T6TATCTGAA AGTGOGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAQAGTGATA TTACTGACAC TGGCCT O GCA CAGAQCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA CCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCAC CXIAGT TTCAGGAGTG 2340 

70 AGAAAAQAGC AGGGA6GTGG TTGTCATTCA GATACrTGAA GAAGATTTTC 'I'CI'CGI TACT 2400 

GAGAAOCAAA TTTGTQTTAC ACTQaAAOTQ AOCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TAOOTTCTTC TTTTeCACPT AAAtSTTGCAT TGOCTACTGT TATACTGGAA AAAATAGAQT 2520 

TCAAQAATAA TATOACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

75 CCTTTTCTAT TTATGAAGAT TCTACTCTTG QTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCITTT TOATATAAAA TCAAGATATT TCTTTGCTGA AGTAT TTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATOTAT TTGAA CTTTT TTGAAATCCT 2820 

ATTCAAOTAT TTTTATCATG CTATTGTX3AT ATTTTAGCAC TTTGOTAGCT TTTAC3VCTGA 2680 

ATTTCTAAGA AAATTGTAAA ATAOTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

80 GTCTTATAAT AGGAATTTAA CTTTAAAAAC CXaVCTTATTG ATAOCTTACC ATCTAAAATG 3000 

TGTGATTTTT ATAQTCTOGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3060 

GGiaCTTACT CftAAQAOTOT OCACTATTOA TTOTATTATO CTGCTCACTQ ATCCTTCTGC 3120 

ATATTTAAAA TAAAATGTCC TAAAGGGtTA OTAOACAAAA TOTTAOTCTT TTGTAT ATTA 3180 

OGCCAAQTGC AATTGACTTC CCTTTTTTAA TOTITCATOA CCACCCATTG ATTOTATTAT 3240 
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AACCACTTAC AOTTGCTTAT ATTTTTrGTT TTAACTTTTG TTTCTTAACA TTTAOAATAT 3300 
TACATTTTOT ATTATACftGT AOCTTTCTCR GACATTTTOT AO 

Seq ID MO: 496 Protein eequence 
Protein Accession «t NP_003497.1 

1 11 21 31 41 51 

I I i I ! I 

MEMFTPLLTC IFLPLLRGHS LPTCBPITVP RCMWAYNMT FPPNU1GHYD QSIAAVEMEH 60 

PLPLANLECS PNIETPIiOCA FVPTCIEQIH WPPCRKLCB RVnTSDOCKIiI DTFGIRWPEE 120 

LECDRLQYCD ETVFVTFDPH TEFI<GPQKKT BQVQRDI6FW CPRRLRTSGG QOYXPX^IOQ 160 

CAPPCPNMYF KSDELBPAKS PIGTVSIPCL CATLPTPLTP LIDVRRPRYP ERPIIYY8VC 240 

YSIVSLMYPI OPLLGDSTAC NKADBKLBLG DTVVLGS<»nC ACTVLFMLLY PPTMAGTVWH 300 

VZLTZnfFLA AGRKWSCBAX EQKAVWFHAV AWGTPGPLTV MLIiALKKVEQ DNISGVCFVG 360 

XiYDLDASRyP VLLFLCLCVP VGLSLLLA0I XSLNHVHQVI QHDGRNQBKL KKFHIRIGVP 420 

SGLYLVPLVT I*LGCmBQ!V NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 480 

LMTLIVQISA VFWVGSKKTC TEWAQPFRRN RKRDPISESR RVLQBSCBPP LKHNSKVKHK 540 

KKHYKPSSHK LKVISKSM6T STGATANHGT 8AVAITSHDY LGQETLTBIQ TSPFTSMREV 600 

KADGASTPRL RBQD06EPAS PAASISRI*SG EQVD6KGQAG SV8SSAR8BQ RISPKSDITD 660 
TGLAQSNZtLQ VPSSSEPSSIi RGSTSIfLVHF VS6VRKBQG6 GCHSDT 

Seq ID NOt 497 DNA sequence 
Nucleic Acid Accession NM_005046 
Coding sequencet 16.. 777 

1 11 21 31 41 51 

I I 1 I I I 

GGATTTCOGG GCTCCATGGC AAGATCCCTT CTCCTGCCCC TGCAQATCCT ACTGCTATCC 60 
TTAGCCTTGO AAACTGCAGQ AGAAOAAOCC CAGGG1QACA AGATTATT6A TGGGGCCCCA 120 
TOTGCAAOAG GCTCCCACCC ATGQCAGOTO GCCCTGCTCA GTGGCAATCA QCTCCACTOC 160 
GGAGGOGTCC TG6TCAATGA 6C0CTGGGTG CTCACTGCOj CCCACTGCAA GATGAATGAG 240 
TACACCGTOC ACXriGGGCAQ TGATACGCTG GGOGACAGGA QAGCTCAGAG GATCAAGGCC 300 
T06AA0TCAT TOOGCCACCC CGGCTACTCC ACACAQACCC ATGTTAATQA CCTCATGCTC 360 
GTGAAGCTCA ATAQCX31GGC CAG6CTGTCA TCCATGGTGA AGAAAGTCAO GCTGCOCTCX: 420 
OGCTGOGAAC CCCCTGGAAC CACCTGTACT GTCTCCGGCT GGG6CACTAC CAGGAGCCXIA 480 
GATGTGACCT TTCCCTCTOA CCTCATOTGC OTOGATGTCA AGCTCATCTC CCCCCAGGAC 540 
TGCACGAAGG TTTACAAGGA CTTACTGQAA AATTCCATGC TGTGCGCTGO CaTCCCOGAC 600 
TCCAA6AAAA A06CCTGCAA TGGTGACTCA GGGGGACOQT TGGTGT6CAQ AGGTACCCTG 660 
CAAGQTCTGO TGTCCTGGGG AACTTTCCCT TGOGGCCSU^C CCAATGACCC A66AGTCTAC 720 
ACTCAAQTGT GCAAGTTCAC CAAGTGGATA AATGACAOCA TGAAAAAGCA TOGCTAACXtC 760 
CACACTGAGT TAATTAACTG TOTGCTTCCA ACAOAAAATG CACAGGAGTG AGGACX5C0GA 840 
TGACCTATGA AGTCAAATTT GACTTTACCT TTCCTCAAAG ATATATTTAA AOCTCATGCC 900 
CTGTTGATAA AOCAATCAAA TTGGTAAAGA CCTAAAACCA AAACAAATAA AGAAACACAA 960 
AAOCCTCAA 

Seq ID MOt 498 Protein sequence 
Protein Accession ftt 1IP^005037 

1 11 21 31 41 51 

I I I I I I 

HARSLLLPLQ ILLLSLALBT AGBBAQGDKI IDGAPCARG8 RPWQVALLSG NQLHOOGVLV 60 

NERMVLTAAH CKMNEYTVHL GSDTLGDRRA QRIKASKSFR HPGYSTQTHV NDLMLVKUIS 120 

QARLSSMVXK VRLPSRCBPP OTTCTVSGWG TTrSPDVTFP SDLMCVDVKL IfiPQDCTKVY 160 

KDLLarSHLC A6IPD8KKKA QIODSGGPLV CROTLQGLVS W6TFP0GQPN DPGVnQVCK 240 
PTKHINDTHX KHR 

Seq ID NO I 499 DMA sequence 
Nucleic Acid Accession tfi NM_007196 
Coding sequence I 162.. 962 

1 11 21 31 41 51 

I I I I I i 

GTTCCCAQAA GCICCCCAGG CTCTAGTGCA GGAGGA8AAG GAGGAGGAQC AGQAGGTGQA 60 

GATTOCCAOT TAAAAGQCTC CAOAATCGTO TACCAOOCAO AGAACTQAAO TACTGGGGCC 120 

TC C TCCACTG GGTCCGAATC AGTAGGTGAC CCCGCCOCTG OATTCTOGAA OACCTCAOCA 160 

TGOGACGCCC COGACCTCGT GOGGCCRAQA OQTG Q ATGTT CCTGCTCTTG CTGQGGG6AG 240 

CCTOGGCAGG ACACTCCAGQ GCACAGQAGG ACAAGGTGCT GGGGGOTCAT QAGT GCCAAC 300 

CCCATTOGCA GCCTTGGCAG GOGGCCTTBT TCCAQGGCCA OCAACTACTC T6TGG0GGTG 360 

TCCTT0TAG6 TGGCAACTGG GTCCTTACAG CTGCCCACra TAAAAAAC06 AAATACACAO 420 

TAOGCCTGGG AGACCACAGC CTACAGAATA AA6ATGGCCC AGAGCAAGAA ATACCTGTGO 460 

TTCAGTCCAT COCACACCCC TGCTACAACA GCAGCGATGT GGAGQACCAC AACCATGATC 540 

TGATGCTTCT TCAACTQOGT GACCAGGCAT CCCTGGGGTC CAAAGTGAAO CCCA TCAG CC 600 

TG6CAGATCA TTGCACCCAG CCrGGCCAOA AOTGCACOGT CTCAGGCTGQ GGCACTGTCA 660 

CCAGTCCCCG AOAOAATTTT CCTGACACTC TCAACTGTGC AGAAOTAAAA ATCTTXOCCC 720 

AGAAGAAOTG TGAGGATGCT TACCCGGGGC AOATCACAOA TQGCATGQTC TGT6CAGGCA 780 

GCAGCAAAGG GGCTGACAOO TGCCAGGGCQ ATTCTGGAGG CCCCCTGGTO TGTGATGOTG 840 

CACrOCAGGG CATCACATCC TGGGGCTCAQ ACCCCTOTGG QAOQTCCGAC AAACCTGGOQ 900 

TCTATAOCAA CATCTGCCGC TACCTOGACT GGATCAA6AA GATCATAGGC AGCAAGGGCT 960 
OATTCTAGGA TAAGCACTAO ATCTOCCTTA ATAAACTCAC AACTCTC 

Seq ID NOi 500 Protein sequence 
Protein Accession fti NP_009127 

1 U 21 31 41 51 
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I I I I I I 

MGRPRPRAAK TWMPLLLLGG AHAGHSRAQE DKVLGGHECQ PHSQPWQAAL FQQQQLLaSQ 60 

VliVGGNWVZiT AAHCKKFKYT VRIXsDHSIiQN KD6PEQEIFV VQSIPHPCm SSDVEDHNHD 120 

LMLLQLRDQA SIX3SKVKPI8 LADRCTQF6Q KCTVSGHQTV TSPRSNFPDT LNCABVRIPP 160 

5 QKKCEDAYPG QITDGMVCAO SSKOADTOQO DSGOPLVCDG AX^ZTSHQS DP06RSDXPG 240 
WTNICSYU) WIKKIIGSKG 

Seq ID NOt 501 DMA eequence 
Nucleic Acid Accession « t 11M_006103 
10 Coding sequences 29*. 406 

1 11 21 31 41 SI 

I I I 1 I I 

CACCTOCAOC C0GCC06GGC ATAOCACCAT GCCTGCTTC5T OOCCTAGGCC OQCTAGCCGC 60 

15 OGCCCrCCTC CTCAGCCTGC TGCTGTTCGG CTTCACCCTA GTCTCAGGCA CAGQAGCAGA 120 

GAAGACTGOC OTGTGCCCOO AGCTCCAGGC TGACCAGAAC TGCAC3GCAA0 AGTaOGTCTC 180 

GGACAGCXSAA TGCGCCGACA ACCTCAAGTG CTGCAGCGCG GGCTOTGCCA CCTTCTGCCT 240 

TCTCTGCCCA AATOATAAGG AGGGTTCCTG CXXXCAGGTG AACATTAACT TTCCCCAGCT 300 

^. CXSGCCTCItSr GGGQACCAGT GCCAGQTGGA CAGGCAGIOT CCTOGCGAGA TQAAATGCTG 360 

20 COGCAATGOC TGTGGGAAGO TGTCCIOTGT CS^CTCCCAAT TTCTGA GOTC CAGCCACCAC 420 

CAGGCTGAGC AGTGAGQAGA GAAAGTTTCT GCCTGGCCCT GCATCTGOTT CCAGCCCACC 460 

TGCCCTCCCC TTTTTCGGGA CTCTGTATTC CCTCTTGGQC TGACCACAOC TTCTCCCTTT 540 
CCCAACCAAT AAAOTAACCA CTTTCAGGAA AAAAAAAAAA AAAA 

25 Seq ID NOt 502 Protein sequence 
Protein Accession #•. NP_006094 

1 11 21 31 41 51 

I I I > I ' 

30 MPACRLQPIA AALLLSLLLP GFTLVSGTGA EKTGVCPBLQ ADQNCTQBCV 8DSGCADNI1K 60 

CC8AGCATPC LLCPNDKEGS CFQVNZNFPQ LGIiCRDQOQV DSQCFGQMKC CRliGOGKVSC 120 

VTFNF 

Seq ID NOt 503 DNA sequence 
35 Nucleic Acid Accession « I NM,002407 
Coding sequence t 65 . . 3 52 

1 11 21 31 41 51 

Ai^ \ I I I I I 

40 CCTCCACAGC AACTTCCTTG ATCCCT6CCA OGCACQACTO AACACAGACA OCAGCOeCCT 60 

CGCCATGAAG CTQCTOATGG TCCTCATQCT GGCQGCCCTC CTCCTGCACT GCTATGCAGA 120 

TTCTGGCTGC AAACTCCTGG AGGACATGGT TGAAAAGACC ATCAATTCCG ACATATCTAT ISO 

ACCTGAATAC AAAGAGCTTC TTCAAGAGTT CATAQACAGT OATQCCGCTG CAGAGGCTAT 240 

OOaOAAATTC AAOCAOTOTT TCCTCAACCA GTCACATAQA ACTCTGAAAA ACTTTGGACT 300 

45 GATGATCCAT ACAGTGTAOG ACAGCATTTG QTOTAATATO AAGAGTAATT AACTTTACOC 360 

AAGGOGTTTG GCTCAGAGGG CTACAGACTA TGGCCAGAAC TCATCTOTTQ ATTGCTAGAA 420 

ACCACTTTTC TTTCTTGTGT TGrCTTTTTA TGTGGAAACT GCTAGACAAC TGTTQAAACC 460 

TCAAATTCAT TTCCATTTCA ATAACTAACT GCAAATC 

50 Seq ID NO: 504 Protein sequence 
Protein Accession #1 NP_002398 

1 11 21 31 41 51 

..I I I I I I 

MKLU4VLMLA ALLUICYADS GCKIiLBDIiIVB KTINSDISZP BYKEXiLQBPI DSDAAAEAMG 60 
KFKQCFIjMQS BRTIiKNFGLM MHTVYDSItfC NMKSN 

Seq ID NO: 505 DNA sequence 
Nucleic Acid Accession ftt NM_014791.1 
OO Coding sequence: 171.. 2 12 6 

1 11 21 31 41 51 

I I I I I 1 

TT6GOQGGC6 GAAOGGOCCA CAACCCGGOQ ATOQAAAAGA TTCTTAGGAA CGCOSTACCA 60 

65 GCGGCGTCTC TCAGQACAGC AGGCCCCTGT CCTTCTOTCG GGCGCCBCTC AOCCQTGCCC 120 

TCCGCCCCTC AGGTTCTTTT TCTAATTCCA AATAAACTTG CAAGAGOACT AT GAAAG ATT 180 

ATGATQAACT TCTCAAATAT TATGAATTAC ATGAAACTAT TQGGACAGGT GGCTTTGCAA 240 

AGGTCAAACT TGCCTGCCAT ATCCTTACTQ GAGAGATGGX AGCTATAAAA ATCATGGATA 300 

AAAACACACT AGGGAGTOAT TTGCCCCGGA TCAAAAOOOA GATTQAGOCC TTGAAG AAOC 360 

70 TQAGACATCA GCATATATGT CAACTCTACC ATGTGCTAGA GACAQOCAAC AAAATATTCA 420 

TGGTTCTTQA GTACTGCCCT GGAGGAQAGC TOTTTGACTA TATAATTTCC CAGGATCGCC 480 

TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCrGTT GCTTATGTGC 540 

ACAGCCAGGO CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT QAT6AATATC 600 

ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTOCAAAACC CAAGGGTAAC AAGGATTAOC 660 

75 ATCTACAGAC ATGCTGTGGG AGTCTGGCTT ATGCAGCACC TGAGTTAATA CAAGGCAAAT 720 

CATATCTTGQ ATCAGAGGCA GATGTTTGGA GCATGGGCAT ACTGTTATAT GTTCTTATOT 780 

GTOGATTTCT ACCATTTGAT GATGATAATG TAATGGCTTT ATACAAGAAG ATTATGAOAQ 840 

GAAAATAT6A TGTTCCCAAQ TGGCTCTCTC CCAGTAGCAT TCTGCTTCTT C AACA AATGC 900 

TXSCAGGTGGA CCCAAAQAAA OOOATTTCTA TGAAAAATCT ATTGAAOCAT CCCTGGATCA 960 

80 TGCAAGATTA CAACTATCCT GTTGAGTGQC AAAOCAAQAA TCCTTTTATT CACCTOQATG 1020 

ATGATTG03T AACAGAACTT TCT6TACATC ACAGAAACAA CAGQCAAACA ATGGAGGATT 1080 

TAATTTCACT GTGGCAGTAT GATCACCTCA CGGCTACCTA TCTTCTGCTT CTAGCCAAGA 1140 

AGGCTCGGGO AAAAOCAGTT CGTTTAAGOC m ' Ci ' l ' Ci ' l T CTCCTOTGGA CAAQCCAGTG 1200 

CTACCCCATT CACAQACATC AA6TCAAATA ATTGGAGTCT GGAAOATGTG ACCGCAAGTO 1260 
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ATAAAAATTA TGTGGCX3GC3A TTAATAGACT ATGATTGGTG TQAAGATGAT TTATCAACAG 1320 

OTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTG GACAQAATCA AATGGGGTOO 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATOTATA TACTCCTAAO TCTGCTGTAA AQAATGAAGA QTACTTTATO TTTCCT6AGC 1500 

CAAAOACTCC AGTTAATAAG AACCA6CATA AGAGAfiAAAT ACTCACTACO CCAAATCGTT 1560 

ACACTACACC CTCAAAAGCT AGAAACCAGT GCCTGAAAGA AACTCCAATT AAAATACCAQ 1620 

TAAATTCAAC AGGAACAGAC AAGTTAATGA CAGGTGTCAT TAGCCCTOAG AGQCGQTOCC 1680 

6CTCAGTGGA ATT6QATCTC AACCAAGCAC ATATGQAGOA GACTCCAAAA AGAAAGGGAG 1740 

CCAAAGTGTT TGGGAOOCTT GAAAG6GGQT TGGATAAGGT TATCACTGTG CTCACCA6GA 1800 

6CAAAAGGAA GGGTTCTGCC AGA6AGG66C CCAGAAGACT AMOCTTGAC TATAATQTGA 1860 

CTACAACTAG ATTAGTGAAT CX»GATCAAC TOTTGAATQA AATAATGTCT ATTCTTCCAA 1920 

AQAAGCATOT TOACTTrGTA CAAAAGGGTT ATACACTGAA QTGTCAAACA CAGTCAQATT 1980 

TTGGGAAAGT GACAATGCAA TTTQAATTAQ AAOTGTGCCA QCTTCAAAAA CCC3GATGTGQ 2040 

TGGGXATCAQ GAGOCAGOGO CTTAAGGGOQ ATGCCTGOGT TTACAAAAQA TTAGT GQAAG 2100 

ACATCCTATC TA6CTGCAAG GTATAATTGA TGGATTCTTC CATCCTOCOQ OATGAOrGTG 2160 

GGTGTGATAC AOOCTACATA AAGACTGTTA TGATOSCTTr GATTTTAAAQ TTCATTGSAA 2220 

CTACCAACTT 6TTTCTAAAQ AGCTATCTTA AQACCAATAT CiXri T lt S fTT TTAA ACAAAA 2280 

GATATTATTT TGTGTATGAA TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT 2340 

TAATCATGT9 OTTTTOTATA TTAATAATTG TTQACTTTCT TAGATTCACT TCCATATOTO 2400 

AATGTAAGCT CTTAACTATO TCTCTTTOTA ATOTOTAATT TCmCTOAA ATAAAACCAT 2460 
7TGTGAATAT 

8eq ID NO: 506 Protein sequence 
Protein Accession #: NP_055606.1 

1 11 21 31 41 51 

I I I I I I ' 

HKDYDBLLICy YELHBTIQTG OFAKVKIiACH ILTGGMVAIK IMDKNTLGSD LPRIXTBIEA 60 

laOILRHQBIC QLYRVLETAN KIFKVLBVCP GGBLFDYIIS QDRIiSBBETR WFRQIVSAV 120 

AYVHSQOYAH RDLKPENLLP DBYHKLKLID PGLCAKPKGM KDYHLQTCOG SliAYAAPELI 180 

QGKSyiiGSEA DVWSMQILLY VU1CGFLPFD DDNVMALYKX IMRGKYDVPK WLSPSSILLIi 240 

QQMLQVDPKK RISMKNLLNH PWIMQDYNYP VEWQSKNPPI HLDDDCVTEXi SVHHRNNRQT 300 

MEDLISI«WQY DHLTATYLLL LAKKARGKPV RLRL8SP8CG QASATPFTDI KSNNWSLSDV 360 

TASDKMYVAG LIDYDWCBDD LSTGAATPRT 8QPTXYWTBS MGVBSKSLTP ALCRTPANXL 420 

KNKENVYTPK SAVKNSBYFM FPEPKTPVMK NQHKRBILTT PNRYTTPSKA RMQCLKETPI 480 

KIPVNSTGTD KLMTGVISPE RRCRSVELDL NQAHMEBTPK RKGAKVPGSL ERGLDKVZTV 540 

LTRSKRKGSA RDGPRRIiKLH YNVTTTRLVN PDQLLNBIMS ILPKKHVDFV QKGYTLKOQT 600 
QSDFGKVTMQ PELEVCXIIOK PDWGIRRQR LK6DAHVYKR LVEDXI*SSCK V 

Seq ID NO: . 507 PMA sequence 
Nucleic Acid Accession ffi NH_000S83 
Coding sequence : 88 . . 990 

1 11 31 31 41 51 

I I I I I I 

GCAGAQCACA GCATOGTCGG GACCAGACTC GTCTCAGGCC AOTTGCAQCC TTCTCAGCCA 60 

AAOGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAQ TGATTTGCTT TTGCCTCCTA 120 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAOCAG 180 

CTTTACAACA AATACCX^GA TGCTGXGQGC ACATGGCTAA ACCCIQACCC ATCTCAGAAG 240 

CAGAATCTCC TAGCCCCACA GACGCTTCCA AGTAAGTCGA ACGAAAGCCA TGACCACATG 300 

GATGATATGG ATGATGAAGA TGATGATGAC CATaTGOACA GCCAGQACTC CATTQACTCQ 360 

AAOQACTCTG ATQATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTGATGAAT CTGATQAACT GGTCACTGAT TTTCCCACX30 ACCTGCCAGC AAOO GAAS TT 480 

TTCACTCCaG TTGTOCCCAC AOTAOACACA lATOATGGCSC OAQOTGATAG TGrOGTTTAT 540 

GGACTGAGGT CAAAATCTAA QAAGTTTOGC AOACCTGACA TCCAOTAOCX: TGATOCTACA 600 

6A0GAGGACA TCACCTCACA CATGQAAAOC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCX3TTGCCC AGGACXTTGAA CGCX3CCTTCT GATTGGGACA GCCGTGCSGAA GGAGAGrTAT 720 

GAAACXSAGTC AGCTGGATGA CCAOAGrGCT GAAACCCACA GCCAC3U^CA GTCCAGATTA 780 

TATAAGOGGA AAGCCAATGA T6AGAGCAAT GAGCATTCCQ AT6TQATTGA TAGTCftGGAA 840 

CTTTCCAAAG TCAGOCGTGA ATTCCACAGC CKCQMiTm ACAOOCATGA AGATATGCTG 900 

6TTGTAQACC CCAAAAGTAA GOAAGAAGAT AAACACCTGA AATTTOtSTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAAT0AAA6 AGAACATGAA ATG CTTC TTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAOTCTGGA AATAACTAAT 0TOTTT6ATA 1140 

ATTAGTTTAG TTTOTGGCTT CATQQAAACT CCCTGTAAAC TAAAAOCTTC A6GGTTATGT 1200 

CTATGTTCAT TCTATAGAAG AAATGC3UUVC TATCACTOTA TTTTAATATT TGTTATTCTC 1260 

TCATQAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGA AT 1320 

ATAACATTTT ATOTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTOTTOTGAT 1380 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TQglGOTG TC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCX: AOCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 

Seq IS NO: 508 Protein sequence 
Protein Accession fti NP_000573 

1 11 21 31 41 51 

I 1 I I I I 

MRIAVICPCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWmPDPSQ KQNUiAPQTIi 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVXmT DDSHQSDESH HSDBSDBLVT 120 
DFPTDLPATB VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRFDXQYPDA TDEDITSHME 180 
SEEUIGAYKA ZPVAQDU7AP 8DUDSRGKDS YETSQXiDDQ9 ABTHSBKQSR LYKRXANDES 240 
HEKSDVID8Q BLSKVSRBPH 8HEFH8KSDH LVVDPK6KBB DKSLKFRI8H BLDSASSBVN 
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Seq ID NOi 509 DMA sequence 
Nucleic Acid Accession 6: AB051390.1 
Coding sequencet 34.. 2457 

5 1 11 21 31 41 51 

I I i I I I 

AGO0GCO60Q GCACAAAGTT GG0GQC060G AAGATGAGGC TGTCCCCGGC GCCCCTGAAG 60 
CTGAGCCGGA CTCCGGCACT GCTGGCCCTG GQGCTGCCCC TGGCCGCGGC GCTOGCCTTC 120 
^ ^ TCCGACOAaA CXTCTGOACAA AGTGCCCAAG TCAGAGGGCT ACTGCAGCCG TATCCTGCGC 180 
10 GCCCAGGGCA CGCGGCX5CGA GGGCTACACC GAGTTCAGCC TCCGCGTGGA GGGOGACXXC 240 
QACTTCTACA AGCGG6GAAC CAGCTAC06C GTAACACTTT CAGCTGCTCC TCCCTCCTAC 300 
TTCAOAaaAT TCACATTAAT TGCCCTC3VGA GAGAACAGA6 AGGGTGATAA GGAAGAAQAC 360 
CATGCTGGGA CCTTCCAGAT CATAGAOGAA GAAGAAACTC AGTTTATQAG CAATTGCCCT 420 
GTTOCAGTCA CXQAAAQCAC TCCAOGGAGG AGGACCOGGA TCCAGGTGTT TTGGATAGCA 480 
15 CCACCAQOGQ QAACaGGCTG CGTGATTCTG AAGGCCAGCA TOGTACAAAA ACG CATTA TT 540 
TATTTTCAAG ATQAGGOCTC TCTGACCAAG AAACTTTOTG AACAAGATTC CACATTTGAT 600 
GQOOTOACTO ACAAACCCAT CTTAGACT6C TGT6CCTGC0 GAACT6CCAA GTACAGACTC 660 
AOVTTTTATG GGAATTGQTC OGAGAAQACA CACCCAAAGQ ATTACCCTC3Q TCGGGCCAAC 720 
CACTQGTCTQ OQATCATCOG AGGATCCCAC TCCAAGAATT ATGTACTOTQ GGAATATGQA 780 
20 GGATAT6CCA GCX3AA66GGT CAAACAAGTT GGA6AATTGG GCTCACXXXTT GAAAATGGAG 640 
GAAGAAATTC GACAACAQAO TQATGAGOTC CTCAOCGTCA TCAAAOCCAA AGCCCAATGG 900 
0C3\GCCTGGC AGCCTCTCAA CJOTOAQAGCA GCACCTTCAa CTOAATTTTC CGTGGACAGA 960 
AOGOGCCAtT TAATQTCCTT CCTQACCATG ATGGGCCCTA QTCCCGACTG GAACOTAGGC 1020 
TTATCTGCAG AAGATCTGTG CACCAAGGAA T6TGGCTGGG TCCAGAAGGT GQTGCAAGAC 1080 
25 CTGATTCCCT GGGACGCTGQ CACCGACAGC GGGGTGACCT ATGAGTCACC CAACA AACC C 1140 
ACCATTGCCC AOGAGAAAAT COOGCCCCTO ACCAGCCTQG AOCATCCICA GAGTCCTTTC 1200 
TATGACCCAG AGGGTGGOTC CATCACTCAA GTAGCCAOAQ TTGTCATGQA GAGAATOQCA 1260 
CGGAAGQGTO AACAATGCAA TATTOTACXTT QACAATGTCG ATGATATTGT AGCTQACCTO 1320 
GCrCCAGAAG AGAAAGATGA AGATGACACC CCTQAAACCT GCATCTACTC CAACTGOTCC 1380 
30 CCATGGTCCG CCTGCA6CTC CTCCACCTOT QACAAAGGCA AGAQGATQCG ACAQOGCATG 1440 
CTGAAAGCAC AGCTG6ACCT CAQCGTCGCX: TGCCCTGACA CCX»GGACTT GCAOCGCTGC 1500 
ATGGGCCCTG GCTGCAGTGA CGAAGACQOC TCCACCTGCA CCATQTC08A OTGGATCACC 1560 
TGGTOGCCCT GCAGCATCTC CTGCGGCATG GGCATGAGGT CCCX3GGAGAG GTATGTGAAG 1620 
CAGTTCCCGG AGGAOGGCTC CX5TGTGCACG CTGCCCACTG AGGAAAOGOA GAAGTGCACX3 1680 
35 GTCAAOGAGO AGT6CTCTCC CAGCAGCTOC CTGATGAC06 AGTGGGGGGA GTGGGACGAG 1740 
TGCAQOSCCA CCTGGGGCAT GGGCAT6AA0 AA6GQ6CACC GCATGATCAA GATGAACCCC 1800 
GCAQATGGCT CCATGTGCAA AGOOGAGACA TCACAGGCAO AGAAGTOCAT GATGCCAGAG 1860 
TGCCACACCA TCCCATGCTT GCTGTCCCCA TGGTCOSAGT GGAGTQACTG CAGCGTGACC 1920 
TGCGGQAAGG GCATGOGAAC CCGACAGG6G ATGCTCAAGT CTCrGGCAGA ACTTGGAGAC 1980 
40 TGCAATGAGO ATCTGGAGCA GGTGGAGAAQ TGCATGCTCC CTGAATGCCC CATTGACTGT 2040 
GAGCTCAOCO AGTGGTCCCA GTGGTCX^GAA TGTAACAAOT CATGTGGQAA AGGCCAOGTG 2100 
ATTCX3AACCC GGATGATCCA AArTGGAGCCT CA6TTTGGA0 GTQCACCCTO CCCAGAGACT 2160 
GTGCAGCGAA AAAAGTGCCG CATCOSAAAA TGCCTTOGAA ATCCATCCAT CCAAAAGCTA 2220 
CGCTGGAGGQ AGGCCCGAGA GAGCCGGCGG AGTGAGCAOC TQAAGOAAGA GTCTGAAGGG 2280 
45 GAGCAGTTCC CAGGTTGTAG GATGCGCCCA TGGACOOCCT GGTCAOAATG CACCAAACTG 2340 
TGGGGAGGTG GAATTCAGGA ACGTTACATG ACTGTAAAGA AGAGATTCAA AAGCTGCCAO 2400 
TTTACCAGCT GCAAAGACAA GAAOGAQATC AGAGCATGCA ATOTTCATCC TTGTTAGCAA 2460 
GGGTACGAGT TCCCCAGGGC TGCACTCTAG ATTCCAQAGT CACCAATGGC TGGATTATTT 2520 
GCTTGTTTAA GACAATTTAA ATTGTGTACQ CTAGTTTTCA TTTTTGCAGT GTGGTTCGCC 2580 
50 CAGTAGTCTT GTGGATGCCA GAGACATCCT TTCTGAATAC TTCTTGATGG GTACAG GCTG 2640 
AGTGGGQG6C CCTGAGCTOC AGGCAGCCTC TTCCTGCAGA 6GA0TA0TGT CAOCCA CCTT 2700 
QTACTAAOCT GAAACATOTC OCTCTQGAOC TTCCACCTGG CCAGGQAOGA OGGAGACTTT 2760 
GACCTACTCC ACATGGAGAO GCAACCATGT CTGGAAGTGA CTATGCCIGA OTCCCAGGGT 2820 
GOaGCAGGTA GQAAACATTC ACAGATGAAG ACAGCAGATT CCCCACATTC TC ATCTTTGG 2880 
55 CCTGTTCAAT GAAACCATTG TTTGCCCATC TCTTCTTAGT GGAACTTTAG GTCTCTTTTC 2940 
AAGTCTCCTC AGTCATGAAT AGTTCCIGGQ GAAAAAGAOA OCTGQTAGAC TTGAAGAGGA 3000 
6CATTGATGT TGGGTGOCTT TTOTTCTTTC ACTGAGAAAT TOSGAATACA TTTGTCTCAC 3060 
CCCTGATATT GGTTCCTGAT GCCCCCCCAA CAAAAATAAA TAAATAAATT ATGGCTGCTT 3120 
TATTTAAATA TAAOGTAGCT AGTTTTTACA CCTGAOATAA ATAATAAGCT TAGAGTGTAT 3180 
60 TTTTCCCj^ CTTTTGGGGG TTCAGAGGAO TATGTACAAT TCTTCTaGGA AGCCAGCCTT 3240 
CTQAACTTTT TGGTACTAAA TCCTTATTOG AACCAAQACA AAOGAAGCAA AATTGGTCTC 3300 
TTTA6A6ACC AATTTGCCTA AATTTTAAAA TCTTCCTACA CACATCTA6A OGTTCAAGTT 3360 
TGCAAATCAG TTTTTAGCAA GAAAACATTT TTGCTATACA AACATTTT6C TAAGTCTGCC 3420 
CAAAGCCCCX: CCAATGCATT CCTTCAACAA AATACAATCT CTGTACTTTA AAGTTATTTT 3480 
65 AGTCATQAAA TTTTATATGC AGAGAGAAAA AGTrACXJOAG ACAGAAAACA AA TCTA AGGG 3540 
AAAGGAATAT TATGGGATTA AGCIGAGCAA GCAATTCTGG TGGAAAGTCA AACCTGTCAG 3600 
TGCTCCACAC CAGGGCTGTG GTOCTGCCAG ACATGCATAO GAATGOCCAC AGOTTTACAC 3660 
TGCCTTCCCA QCAATTATAA GCACACCA6A TTCAGGGAGA CTGACCACCA AGGGATAGTG 3720 
TAAAAGQACA TTTTCTCAGT TGGGTCCATC AGCAOTTTTT CTTCCTGCAT TTATTGTTGA 3780 
70 AAACTATTGT TTCATTTCTT CTTTTATAGG CCTTATTACT OCTTAATCCA AATGXGTACC 3840 
ATTGGTGAGA CACATACAAT GCTCTGAATA CACTAOGAAT TTGTATTAAA CACATCA6AA 3900 
TATTTCCAAA TACAACATAQ TATAGTCCTG AATATGTACT TTTAACACAA GAGAGACTAT 3960 
TCAATAAAAA CTCACTGGOT CTTTCATGTC TTTAAGCTAA GTAAGTGTTC AGAAG GTTCT 4020 
TTTTTATATT GTCCTCCACC TCCATCATTT TCAATAAAAC ATAGGGCTTT TGCTCCCTTG 4080 
75 TTCrTGGAGQ GACCATTATT ACATCTCTGA ACTACXnTTG TATCCAACAT GTTTTAAATC 4140 
CTTAAATGAA TTGCTTTCTC CCAAAAAAAG CACAATATAA AGAA ACACAA GAT TTAAT TA 4200 
TTPTTCTACT TGGOGOGAAA AAAGTCCTCA TGTAGAAGCA CCCACTTTTO CAATOTT6TT 4260 
CTAAGCTATC TATCTAACTC TCAGCCCATG ATAAAOTTCC TTAAGCTGGT QATT CCTA AT 4320 
CAAGGACAAQ CCACCCTAOT GTCTCATGTT TGTATTTGGT CCCAOTTGGG TACATTTTAA 4380 
80 AATCCTGATT TTGGAGACTT AAAACCAGGT TAATGGCTAA QAATGGGTAA CA TGACTC TT 4440 
GTTGGATTGT TATTTTTTGT TTGCAATGGG GAATTTATAA 6AA0CATCAA GTCTCTTrCT 4500 
TACOUWGTC TTGTTAGGTQ GTTTATAGTT CTTTTG6CTA ACAAATCATT TTGGAAATAA 4560 
AGATTTTTTA CTACAAAAAT G 
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Seq ID NOt SlO Protein sequence 
Protein Accession «t BAB18461.1 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



1 
I 

MRLSPAPUOf 
FSUIVBGDPD 
BTQFMSNCPV 
liCBQOSTPDQ 
KNYVLWEYGG 
PSABFSVDRT 
VTYESPNKPT 
NVDDIVADLA 
PDTQDFQPCM 
PTBBTBKCTV 
QAEKCMMPCC 
KLPECPIDCE 
LRMPSZQKLR 
VKKRFKSSQP 



IX 

I 

SRTPALLAIA 
PYKPGTSYRV 
AVTESTPRRR 
VTDKPIUXr 
YASBGVXQVA 
RHLNSFLTMM 
IPQBKIRPLT 
PBBXDEDDTP 



MBBCSPSSCL 
HTIPCLLSPH 
LTEWSQWSEC 
HREARESRR8 
T&CKDKKBIR 



21 
I 

LPLAAALAF8 
TLiSAAPPSYP 
TRIQVFWIAP 
AOGTAKYRLT 
BLGSPVKHBB 
6PSPDHHVGL 
SU3HPQSPFY 
BTCIYSNWSP 
TCTMSEWITW 
MTSNtSGNDEC 
SEHSDCSVTC 
NKSCX5KGHVI 



31 
I 

DETLDKVPKS 
RGFTLIALRE 
PAOTGCVIUC 
FY6KWSEKTH 
BIRQQSDBVL 
SAEDLCTKEC 
DPBGGSZTQV 
HSACSSSTCD 
SPGSISGQffi 
SATOOtGNKK 
GK04RTRQRM 
RTRNIQMEPQ 
QFPGCRMRPH 



41 

I 

BGYCSRILRA 
NREGDKEBDH 
ASIVQKRIIY 
PXDYFRRANB 
TVIKAKAQWP 
GHVQKWQDL 
ARWISRIAR 
KGKRMRQRML 
MR8RSRYVKQ 
RKRNintNPA 
LK8LAELGDC 
FGOAPCPBTV 
TAWSECTKLC 



SI 
I 

QGTRREGYTB 
AGTFQIIDBB 
FQDBGSLTKK 
HSAXZGGSHS 
AWQPUIVRAA 
IPNOAGTDSG 
KOEQQIZVPD 
KAQLDLSVPC 
FPEDGSVCTL 
DGSMCRAET8 
NBDLEQVEKC 
QRKKCRIRKC 
GGGIQBRYMT 



ACaiVHPC 



Seq ID NO: Sll DNA eequence 

Nucleic Acid Accession ft: MM_00310B.l 

Coding sequence I 76.. 1401 



51 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1 11 21 31 41 

I I I i I I 

GGG0T6GGAG G6GGAGGGGG AGCTCC6CAC GA6ACCCA6C G6CCCQGQTT QGAGCGTCCA 
GOOCIGCAAC GOATCATGGT GCAGCAG60Q GAQACCTIGG AAGOGGAGAO CAAGCT6CGC 
GGG6AG606C TGGACACX3GA GGAGG6C3GAA TTCAT6GCTT GCAGCXXX3OT G0CCCT6QAC 
GAGAGCGACX: CAQACTGGTG CAAGACX3GCX3 TCQGGCCACA TCAA0t3GGCX: GATGAACGCG 
TTCATOGTAT QGTCCAAQAT CGAAOGCAGG AAGATCATGQ AGCAGTCTCC GQACATGCAC 
AAOOOOGAGA TCTCCAAGAQ QCTGG6CAAG OGCTGOAAAA TGCTGAAGOA CAGC6AGAAG 
ATCCOCTTCA TCOOOGAOGC OGAOCS O CTG 03GCTCAAGC ACATQGCGGA CTACXXOGAC 
TACAAGTACC G6CCCC3GGAA AAAGCCCAAA AT6GACCCCT OSGGCAAGCC CA60GCCA0C 
CAGAGCCCAG AGAAQAOCQC G0CCX3G0GGC GGOQGCGGGA GCX5CGGG0GG AGGCGCGQGC 
GGTGCCAAGA CCTCCAAGGQ CTCCAGCAAG AAATGOGQCA AGCTCAAGGC CCCOGOGGCC 
GG66G0QCCA AGGGGG30SC GQ6CAAGG06 GCCCAOTCOQ G6GACTA06G GGG06066GC 
GAOQACTAOQ TGCTGGGCAa OCTGOGOSXG AGOQGCTCOG GCXX30GG0GG GGCOGGCAAG 
AC366TCAAOT GOGTCmCT GGAT6AG6AC GAOQAOQAGG AO6AO6A00A GGAC6AGCTG 
CAGCTGCAGA TCAAACAGGA GC0GGACGA6 GAG6A0GAGG AACCACOGCA CCAGCAGCTC 
CTQCAGCCGC CQGGGCAGCA GCOGTCGCAG CTGCTGAGAC GCTACAAOOT CQCCAAAOTG 
CXXXKXA6CC CTAOSCTGAa CAOCTCGOGG GAGTCCCCOG AGGGAGOGAO CCTCTAOGAC 
GAGGTGC6GG C06G0G0GAC CTOQGGOGGC GGGG60GGCA GCGGCCTCTA CTACAGCTTC 
AAGAACATCA CCAAGCAGCA CGCGCCX3GCG CTOQ06CM3C C0G06GTGTC GCCOSCGTCC 
TCGCaCTOQQ TOTCCACCTC CTCGTCCAGC AGCAGOGGCA GCAGCAOCGG CAGCAGCGGC 
GAGGAOGCOQ AOGACCTGAT GTTOGACCTQ AGCTTGAATT TCTCTCAAAG OGOOCACAGC 
GCCA8CGAGC AGCAGCTGGG GG60G6CGCX3 G0GGC0G6GA ACCTOTCCCT GT0GCTGGT6 
GATAAGGATT TGGATT06TT CAGOGAGGGC AGCCTOGGCT CCCACTTCQA GTTCCGCGAC 
TACTGCAGGC CGGAGCTGAG CGAGATGATC GOGGGGGACT GGCT6QA6GC GAACTTCTCC 
GACCTOGTGT TCACATATTG AAAGGOGCXX: GCTGCTCGCT CTTTCTCTOQ GAGGQTGCAG 
AGCTGGQTTC CTTGGGAGGA AGTTGTAGTG OTGATOATGA TGATOATGAT AATGATGATG 
ATQATGGTGG TGTTGATGGT GGOGGTGGTA G6GTGGAGGG GAGAGAAQAA GATGCTGATG 
ATATTGATAA GATGTCGTGA G6CAAAGAAA TTGGAAAACA TQATGAAAAT TTTGOTGGAa 
TTAAAGTQAA ATGAGTAOTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC CCCCCTCOCT 
TCCTTTATCG TGTCTCAAGQ TAGTTGCATA CCTAGTCTGO AGTTGTQATT ATTTTCCCAA 
AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 
GA0GGG60GG CGOOGCGGAG GGGAG6TA0G ACCC6CTCC0 GAAG60GCTG TTT6AAQCTT 
GTOGGTC T TT GAAGTCTG6A AGAOQTCrOC AOAOGACSOCT TTTGGCAaCA CAACT GTTAC 
TCTAGGGAGT TGGTGGA6AT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 
TTTTTAACAA AAAAAGGG 

Seq ID NOt 512 Protein sequence 
Protein Accession «i NP_003099.1 



1 

I 

MVQQAESLEA 
KIERRKIMEQ 
RKKPKMDPSA 
GAGKAAQS(3) 



A7SGAGGGSR 
U1FDLSLNFS 
LSENIAGDHL 



11 
I 

BSNLFREAIiD 
SPDMHNAEIS 
KPSASQ8PEK 
YGGAGDDYVL 
PHQQIiLQPPG 
LYYSPKNITK 
QSAHSASEQQ 
BANPSDLVFT 



21 
I 

TEEGBFMACS 
KRLGRRWKML 
SAAG6GGG8A 
GSI^RVSGSGG 
QQPSQLLRRY 
QHPPPLAQPA 
LGGGAAAGMI* 
Y 



31 
I 

PVALDBSDPD 
KDSEKIPFXR 
GGGA66AXTS 
GGAOKTVKCV 
NVAKVPASPT 
LSPASSRSVS 
8LSLVDKDLD 



41 

I 

HCKTASGHZK 
BAERLRLKHM 
KGSSKKCGKL 
FliDEDDDDDD 
LSSSAESPBG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1360 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 



51 
I 

RPMMAFMVHS 
ADYPDYKYRP 
KAPAAAGAKA 

DDDELQLQIK 
ASLYDEVRAG 



SFGEaSXXISH FEFPDYCTPB 



60 
120 
180 
240 
300 
360 
430 



Seq ID NO I 513 DNA sequence 

Nucleic Acid Accession 8 s CAT Cluster 

1 11 21 31 41 51 

11)111 
GGTCQACCTA AATCTGATAA CTGGCTTATT ATOTAATTTA TTGQTGTTAT TATAGTAGAQ 
ATTGGTAATC TACAOTAAGA TTTTCAGTTA GGATTTGAGA TTATGATAAT AACTAATAGA 
ATATTTCTAA ATTGQAATTA OAAGATTGTT OTATGAGAGA GAGTCAOQAC TTQCCATTTG 



60 
120 
180 



1152 



wo 03/042661 



0CAAACATC3V AAGTCATTGT TTGGTGTGTA ATAGTACAAA ATCATCTTCC TTAACAQAGA 240 

AAGGATATCT GTTGCTCCOG AATGAAACAA TTTTTCTGAA ATAOAGGGCC CAGAATTG6T 300 

CTCTGAC3UIT TAATAAAOAC ATCAAAGATA GCAAAATGAT TTTTATATCT TAGGGCCAAT 360 

ACTACCAATT TAATAATTAA AACAAGTTCT QOTCSAGCTCT GAACTTGQCA GAATTGQTGG 420 

CAACATAQAC TTTGGATTTT CCAAATTCCC CACATAAAAC AAAGGGGATC AACTAGATAG 480 
AAAAACCAGA AACCTTT6GA AATATCTGTT TAAAAAAAAA AAAAAGTC6A 0GCX30GCC 

Seq ID KO: 514 PMA sequence 

Nucleic Acid Accession ftt CAT cluster 



1 11 21 31 41 51 

I I I I I I 

GGAGOCACAa T6AAAGTCAA GAATOTOUST GATTOCACAT TTAATATCTA CATTTTTGCA 60 

OGOCAOTTAC TCTTTTGTAO TATAACATTO A6CTQATA6C AGATAGTGTA GACAAGT6AA 120 

TACAGGATTC TCTGGGTTGT ATTCCCAGAA GTCTGQAGGT CATTTCX3ATA TTTOTOGOCC 180 

CTTGGCTTCA CTCTGACTTQ TOTOACACAT AAAAATTGTG ATGAAATQTC CTATAGATGT 240 

CCTGCAGGTC TTAAAAGAAC CTTTCCAAAC TAT6AAACAG CCCAGCA6CA CTQAOTTAOA 300 

GGXAAATTCT GAAOOCrrOA ACACTAAAAC TATTCTAACT QCACATAOAA TTGQCAAOTA 360 

GCATTCTATO TCTATQAACA OTATOTCTTT TCTATATAAC AOAGAAAATC TTTTTAAOCA 420 

AACTACTCAQ TTTAAAACCT AATTCTTCTC ATAATCTCAQ TACTTTTGAA TOAAGACATA 480 

TCAATOCAAC AGTACACTCT TATTCAQOCA TTTGAAAQAA AGAATTCGAQ ATCTAGTTTG 540 

TATC310ATAT TATAAATTAQ TATGGTTTAO TCTTTGTCAT GAAATTCTAC TTAATTTTTO 600 

GACTATA6OT TTAAGAATOT AAOCAGAAGT TCTOCACXAA TCAOAATAAG CTACATTATO 660 

CTTQA0T6AC AACTACTGTA AT6ACAAAAT ATCAGTG6CT TAATACAATQ GTTTTTCTCT 720 

CATACTTGTT CATAAAQAGT CAGCAAGOAC CCTGCTCATT ATGGTCCCTC AGGGACCCAG 780 

G6TTQTTGGA AQCTCCACCA TTTTAGATAQ CTCCCTTCAA AGTCAGCCAT CTTTGGCAGT 840 

CCATGTCGCC CAACAGGCT6 GCAAAATTTG GCTCTGQATG 6CTTCAAGGA TTOAGCATCG 900 

GGCAGTTTAA AT6CTTTCAA CATGOAAAGT GGACACOQGC CACTCOCACT CACATOCCTT 960 

GG6CCA6AAC TAG6TCACTQ GGCCOGGAOC TAACTTOGQA OGGTTGGGGA ATTGTAATTC 1020 

CrCCATGTAC CCAAAGTGQA GAQAAOCCAO ATACTGAGAA ACATCAATAA T66CTAACAO 1080 

AAATCCATTC TACCATTCCC TTrGCCTAAA GTQAAAAQAT GAGTACTTTC ATCAATTTOT 1140 

AAACTGTACT TTTGAAOTAA ATCCTGQTAG CTT6CA1QG0 GOCTGGATTT CCAGAAAGCC 1200 

ATATGTAATT TGGGAATGAC ATTCACTTAA GCTCATAGAA TATCATTATT TGATGTAAAA 1260 

TGGCCTCATT TGCAATACAG GACCAAAATG CACTAA0C31C AAAACCCCCC TCCCCAOGGG 1320 

GCCOGGGQTC CCTATTCCOC TCCATCCCTT TAAATGAGGC ATTCTATQAT TTGGAATGGA 1380 

AGCCCAGTTQ TAGTCGTAAG AATTTTACTT AATTCAAGAA TTATTCTCAC TGAATATGTG 1440 

CCAGTTCTGA AAGGAATGCA AAOTCAAATT TTGCATCTTC TTTGCTCAAG GGCCTTTAGA 1500 

TOTAACAACA CAGACATGAT ACAAGGCTGA CAATGACATT AT6ATTTAAA TATGTTAAAC 1560 

AACTTATTAA ATTGTQAATC AACAAAAAAT TATGTTCTTT ATTTTATGGT TTT6CATAGT 1620 

CCTGACTCaC TOCCTACATA OOOCTCTTOT TCCTCAGTTC TTATOOCTGA TTTCTTACAG 1680 

GATGGCXTTAA GACAGCTGTA GATGTTTTTA TTTAGCAAAA AAAAAAAAAA AAAAGTGQAC 1740 
GCX36C06CX3A ATTTAGTAG 

8eq ZD NOt 515 DNA sequence 
Nucleic Acid Accession NM_012427 
Coding sequence! 43.. 924 

1 11 21 31 41 51 

I I I I I I 

CTTGTGGTTC CTCTCTACTT GGG6AAATCA GGTGCAGOGG CCATGGCTAC AGCAAQACCC 60 

CCCTGGATQT GGGTGCTCTQ TGCTCTQATC ACAQCXTTTGC TTCTGGGGGT CACAGAGCAT 120 

GTTCTCGCCa ACaATGATGT TTCCTGTQAC CACCCCTCTA ACACCGTGCC CTCTGGQAGC 180 

AACCAGGAGC TGGGAGCTGG GOOCGGGtSAA GAOGOCCGGT OGGATOACAG CAGGAGCGGC 240 

ATCATCAATO GATCCiGACTG OGATATGCAC ACGCAGCX3QT GGGAGGOCX3C GCTGTTGCTA 300 

AGGCCCAACC AGCTCTACTG CGQGGOGGTG TTGGTOCATC CACAGTOGCT GCTCACQGCC 360 

GCCCACTGCA GGAAOAAAGT TTTCAGAOTC OGTCTCGGCC ACTACTCCCT GTCACCAGTT 420 

TATQAATCTG GGCAGCA6AT GTTCCAGGGG GTCAAATCCA TCCCCCACCC TGGCTACTCC 480 

GACOCIOGCC ACTCTAAOQA GCTCATGCTC ATGAAACTOA ACA6AAGAAT T06TGCX3U:T 540 

AAAGATGTCA GAGCCATCAA 06TCTCCTCT CATTGTOCCT CTGCTGGGAC AAAGTGCTTO 600 

GTGTCTGGCT GGGGGACAAC C3UW3AGCOCC CAAGTOCACT TCCCTAAGGT CCTCCAGTGC 660 

TTGAATATCA GCGTGCTAAG TCAGAAAAGQ TGOGAGGATG CTTACCCGAQ ACAGATAGAT 720 

GACACCATOT TCIGCGCOGG TQACAAAGCA GGXAGAGACT CCTGCCAGGG TGATTCTGGG 760 

GGG0CT9TOG TCTGCAATGG CTGOCTGCAG GGACTGGTOr CCrGGGOAOA TTACCCTTGT 840 

6CC0GGC0CA ACAQACOGGG TGTCTACA08 AACCTCTGCA AGTTCACCAA OTGOATCCAG 900 

GAAACCATOC AGGCCAACTC CTGAGTCATC CCAGGACTC!A GCACACOGGC AT CCCC ACCT 960 

GCTGCAGGGA CAGCXXTTGAC ACTCCTTTCA GACCCTCATT CCTTCCCAGA GATQTTGAOA 1020 

ATGTTCATCT CTOCAGCCCC TOACXXCATO TCTCCXGQAC TCAGGGTCTG CTTCCCCCAC 1080 

ATTGGGCTGA COBT O TCTCT CTAOTTGAAC OCTGQGAACA ATTTOCAAAA CTOTCCASOT 1140 

GG66GGTTGC GTCTCAATCT CCCTGGGGCA CTTTCATCCT CAAOGTCAGG GCGCATOOCT 1200 
TCTCTGCAGC TCTGAOCCAA ATTTAOTCCC AQAAATAAAC TGAGAAGTGG AAAAAAAAAA 

Seq ID BO: 516 Protein sequence 
Protein Accession tfi NP_036559 

1 11 21 31 41 51 

i i I I I 1 

MATARPPWMW VLCALITALL LQVTBUVUW MDVSCDHPSN TVPSGSKQDL GAGAGBDARS 60 
DDSSSRIING SDCDMHTQPW QAALLLRFNQ LYGGAVLVHP QffLLTAAHCR XKVFRVRLGH 120 
YSLSPVYBSO QQMFQGVKSZ PHPGYSHPGH SNDLMLIKLH RRIRPTKDVR PINV8SHCPS 180 
AGTKCLVS6W GTTICSPQVHP PKVLQO^S VLSQXRCBDA YPRQIDDTMF GAGDKAGRDS 240 
OQGDSGGPW OIGSLQGLVS NGDYPCARPII RPOVYTNI^ FTKHIQBTZQ AN8 



1153 
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PCTAJS02/36810 



Seq ID NO: 517 DMA sequence 
Nucleic Acid Accession #: KM_001719 
Coding sequence I 123.. 1418 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



1 

I 

GGGCGO^GCO 
CTGCCACCTO 
OGATGGAOGT 

GCTTCATCCA 
CCATTTTGGG 
CCATGTTGAT 
GCCAGGGCTT 
GCCTGQVAGA 
TGGAACATGA 
TTTCCAAQAT 
ACATCOQOQA 
AGCACTTG60 
AGGAGG6CZO 
GGCACAACCT 
AGTTGG06GG 
TCTTCAA60C 
GCCAGAAOCO 
AGAACAGCAO 
GAGACCTGGG 
AGGGGQAOTG 
AGAOGCTGOT 
AGCTCAATGC 
ACAQAAACAT 
TTGGGGCCAA 
CTaCCTTTTQ 
AAACATGAGC 
TOCTACAAGC 
GCOGGGGCAG 
TTATGAGCGC 
GGGCACATTO 
CAATAAAACG 



11 
I 

GGGCCOQTCT 
GGGOGGTGOG 
GC6CTCACT0 
GCTGOGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTQ 
CTCCTACCCC 
TAGCCATTTC 
CAASGAATTC 
CCCAGAAGGG 
AOQCTTGQAC 
CAGGGAATOQ 
GCTGGTGTTT 
GGGCCTGCA6 
CCTGATTGGG 
OVGGQAGOTC 
CTOCAAGAOQ 
CAGOQACCAO 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTG6TC06G 
GTTTTTCTGQ 
TGAQACCTTC 
AGCATAT6GC 
TOTOGAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTQTCTGTQC 
AATGAATG 



21 

I 

GCAGCAAGTG 
GGOCOG6AGC 
CQAOCIGCGG 
GCCCT0GCC6 
06CAGCCAGG 
CX5CCOGOGCC 
TACAACX3CCA 
TACAAGGCOS 
CTCAO0GACX3 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTOC 
GACAT CACAO 
CTCTCGGTOG 
OGGCACGGGC 
CACTTCGGCA 
GOCAAGAAOC 
AGGCAGGCGT 
TGGATCATCG 
CTGAACTCCT 
AACCCXX3AAA 
CTCTACTTOO 
GCCTGTGGCT 
ATCCTCXyVTT 
CCCTCXXTTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCAGCCAG 
QAAAGGAAAA 



31 
I 

ACOGACGGCC 
COGGAOCCOO 
OGCOQCACAO 
ACTTCAQGCT 
AG0GGCG6QA 
0GC31CCTCCA 
TQGOGGTGQA 
TCTTCAGTAC 
COGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TGOQGATCAO 
T6CT0GACAG 
CCACCAGCAA 
AGAOQCTGGA 
CCCAQAACAA 
aCATCCX3QTC 
AGGAAGCGCT 
GTAA6AAGCA 
OGCCTGAAGG 
ACATGAAOQC 
G6GTGCCCAA 
ATGACAGCTC 
GCCACTA6CT 
OCTCGCCTTG 
CCCCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
A6CCAT6CAC 
COGTOGGAGG 
TT6A0C0GGA 



41 

1 

GGGACGGCCG 
GGTAGCGOGT 
CTTG6TG6C6 
GGACAAGGA6 
GATGCAGC6C 
GGGCAAGCAC 
GGAGGOOSGC 
OCAGGGOCCC 
CATQAGCTTC 
TCaAQAGTTC 
ATTCOGQATC 
CGTTTATCAG 
OOQTACCCTC 
CCACTGGOTO 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGG6AGC 
G06GATGG0C 
CQA6CTGTAT 
CTACGCOGCC 
CAOCAACXAC 
GGCCT6CTGT 
CAAGQTCATC 
CCTCOQAGAA 
GCCAQGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAAOGCATA 
GGACTOQTTT 
AAGGGQGGGT 
AGTTCCTGTA 



51 

1 

CCTGCCCOCT 
AGAGCOGGCO 
CTCTGGGCAC 
GTGCACTCX3A 
GAGATCCTCT 
AACTOSQCAC 
GGGCCCGGOG 
CC T CTGOCCA 
OTCAACCTOG 
CGGTTTGATC 
TACAAGGACT 
6TGCTCCAGG 
TQ660CT0Q0 
QTCAATCOGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGGGCA 
AA08IG0CAG 
GTCAGCTTCC 
TACTACTOTG 
GCCATCGTGC 
GCGCCCAOGC 
CIGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAQA 
AAGAAAAAT6 
CCAGAGGTAA 
G6CAAG0GGT 
ATAAATQTCA 



8eq ID MOi 518 Protein sequence 
Protein Accession #t NP 001710 



1 
I 

MHVRSLRAAA 
IlfGLPHRPRP 
LQDSHPLTDA 
XRERFDNETF 
BNUGLQLSVB 
QliRSKTPlQIQ 
GECAFPWSy 
RNMWRACXX: 



11 
I 

FHSFVALWAP 
HLQGKHN8AP 
DMVMSPVNLV 
RISVYQVIiQE 
TLDGQ8X1IPK 
BALRHAMVAB 
MNAINHAIVQ 
B 



21 

I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HXiGRESDLFL 
tASLIGRBGP 
NSSSDQRQAC 
TLVHPINPET 



31 
I 

PSLDNBVHSS 
AVEEGGGPGG 
YHHRBPRPDL 
LDSRTLWASB 
QMKQPFHVAF 
KKRSEtYVSFR 
VPKPOCAPTQ 



41 
I 

FIHRRIiRSQB 
QGFSYPYKAV 
SKIPBGEAVT 
E6HLVFDXTA 
FKATBVHFRS 
DLGWQDIHIIA 
LMAXaVLyPD 



Seq ID KOi 519 DMA sequence 

Nucleic Acid Accession fti Bos sequence 

Coding eequencei 264.. 782 



1 
I 

CCCTGCTCCA 
TCATGGCG6G 
TTCCTCAGAC 
TGC0G6TGGC 
AACTTATCAO 
TGATOGOGGT 
TCAGCA60CA 
CCAGGOCCAG 
CAGAGAQAGA 
ATAGCTTGGA 
ACACACAAGT 
AGAACATAAA 
TCTGGTATTT 
GAATTCCAAA 
GQAAAT6CCA 
CAAAAAGAAT 
GCTGGTCTGT 
AAAACAGGCC 
TAAAOTATGA 
AAATTTTAAA 
TGACTTGCCT 
AAAATAGCAA 



11 
I 

GTCACACCOG 
ACTAATTTTC 
TGAQAACTGT 
ACTACTGAGA 
CAAGGAOCTC 
GQTGTCCCTC 
CATGOCXXAC 
CCCTGGCCAC 
CATCXXaUVTG 
TAO CTCCT GC 
OQTCTTTTCT 
GGAAATCACA 
TGTCAACCCT 
TATTTTTAAT 
TTTTTCCCCC 
GTGGAGAAGA 
ACCCAAAAAA 
AATGCCXX33G 
GAACTATGQA 
AAGGTTGAAT 
TTGAGCTCTT 
CCACCACCA 



21 
I 

6AA6CT6ACT 
CTTAAAATTT 
TTCCAGTATA 
GAC6AGGTGC 
ATCATGCTGA 
TTCCTGCAGO 
AAGAGTGAAC 
CATCATCCAC 
TCTGATTCCC 
A6TT0GCCTC 
GACCCTGGAO 
GATTATGTCA 
GCTCTGTCTG 
GGGGTCCAGT 
TTAAACAAGG 
AAACTGATAA 
GCTGTTOGTT 
CAAGAAAGGT 
GTCCATCAGC 
CAGCTGrrGT 
TAATTATTGG 



31 
I 

GGTCCAOGCA 
AGACTTGCAC 
TACATCAAGT 
CAGGGTG6TT 
CAGAAQTCAT 
CCTGCTTCCT 
AGATACTGAA 
CTGCTGTCAA 
TTTACAGGCA 
CTGCCTQOCA 
AACTAAAAAA 
ATGTCAATCC 
AGCCAGCGGA 
TCTCTATGGA 
CAT6GG6CTC 
ATACACAGAG 
CCTCAAAAAC 
TGAGATCAGA 
AGA0ATA6TA 
AGA6TTCTAT 
CAATAAACAA 



41 
I 

CACCTGAAGC 
AGTAAGGACT 
CACTGAGATC 
CCIGAAAGT6 
OGAGOTCTGO 
CAC06GCATC 
AGOGGCCAGT 
AGAGATGAAG 
TGACAGGGAC 
G6CXACAGAG 
TGACTCCCC5Q 
AQAAAGACAC 
ATATGATCAA 
TTCTTACATT 
ACAAGTCTAT 
GTCCTGAAGA 
AAAAACAAGQ 
TGTTAGGAAQ 
GTQAAGTCTC 
TT6GCAATGT 
CTTCTTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 

xoeo 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



51 
1 

RREMQREILS 
P8TQGPPI1AS 
AABFRIYKDY 
TSHBWWNFR 
IRSTGSKQRS 
PEOYAAYYCB 
DSGNVXLKXy 



60 
120 
180 
240 
300 
360 
420 



51 

I 

ATGAGGAAAC 
TCAACTGACC 
TCCAGCACCC 
CCTGAGCXXC 
GAXOGCTTAO 
AACTACCTGC 
CTCCAGGTTC 
GAGACTCAGA 
ACACCCTCAG 
GATOTGQATT 
CTGGACTAT6 
AAGCCCAGTT 
OTGGCCATGT 
TAATTTGTAG 
GGAGACAGGC 
CCCATGGACT 
CTTGGCTGGG 
AACTTTCAGQ 
TCCCCAGGGA 
GATGGTTAAA 
AGTTTTAAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID MO; 520 Protein sequence 
Protein Accession 9) Eos sequence 
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1 II 21 31 41 51 

I I I I I I 

HLTBVMBVHH GLVZAWSLP LQACPL.TAIII YLLSRHMAHX 8BQILRMSL QVPRPSP6HB 60 

BPPAVKSQQB TQTESDIPMS DSLYRHDSDT PSOSLDSSCS 8PPA0QATGD VDYTQWPSD 120 
PGBLKNDSPL DYKNIKBITD YVNVNFBItHR P8FWYFVNPA LSEPABYDQfV AM 

Seq ID NO: 521 DMA sequence 

itucleic Acid Accession #i Bos sequence 

Coding sequences 107.. 328 

1 11 21 31 41 51 

I I I I I I 

CT6CTCTGTC TGAGOCAGOQ GAATATQATC AAGTG6CCAT GTQAATTCCA AATATTTTTA 60 

ATGOGGTCO^ GTTCTCTAT6 GATTCTTAOV TTTAATTTGT A06GAAATGC CATTTTTCCC 120 

CCTTAAACAA GGGATGGG6C TCACAAGTCT ATG6AGACAG GCCAAAAAGA ATQTGOAQAA 180 

GAAAACT6AT AAATACACAQ AGOTCCTCAA GACCCATGGA CTCCTGGTCT GTACCCAAAA 240 

AAGCTGTTOS TTCCTCAAAA ACAAAAACAA 6GCTTGGCT6 GGAAAACASG CCAATGCCCC 300 

GGCAAGAAAG QTTQAGATCA GATGTTA8GA AOAACTTTCA GGTAAACrTAT OAOAACTATO 360 

6A0TCCATCA GCAGAOATAa TAGTGAAGTC TCTOCOC A GG GAAAATTTTA AAAA66TTGA 420 

ATCAGCTQTT GTAaAOTTCT ATTTGGCAAT CTCATQGTTA AATQACTTCC CTTTGAGCTC 460 

TTTAATTATT G6CAATAAAC AACTTCTTTA AAAQTTTTAA ATAAAATA6C AACCACCACC 540 
A 

8eq XD HOt 522 Protein sequence 
Protein Accession #s Bos sequence 

1 11 21 31 41 51 

I I I I I . I 

MPFFPIiKQGH GLTSLHRQAK mVEKKTDKir TSVUCIHOU* VCTQKSCSFL XNXNXAHLQK 60 

QANAPARKVB IRC 

Seq ID NOt 523 DWA sequence 

Nucleic Acid Accession (ft Eos sequence 

Coding sequences 211..1B95 

1 11 21 31 41 51 

i ! I 1 I I 

GGATCTGAGO GGCGCCCAGT CACTTCCTCC AGQTTCTG6T GCTGGGCGG6 AGGAGCGGAT 60 

OGOOCTTGGQ AGQCAOCCTQ CTCTCCAGTC CCTATCCACC CACAQGTTTT TTGG6T06QA 120 

QAGQAATTAT CTOATAAAAT TCCTGGGTTA ATATTTTTAA AAAOGGAGAQ TTTTTAAAAA 180 

TQATTTTTTT CCCTOQAAAA TOACCTTTTT ATQCTTCQAA GCAOTTTGTC AACCAGCATA 240 

GTGCTTTTTC TTTTCTCTTC TTTTTCTACG ATAAATGAAA GCATTTCTTC AAGAAAAAGG 300 

CACAGGTTCC TTQAACAGCT GGATTCTGAT GGCACCATTA CTATAQAGQA GCAGATTGTC 360 

CTTGTGCTGA AAGCGAAAGT ACAATGTQAA CTCAACATCA CAGCTCAACT CCAGOAGGOA 420 

OAAGGTAATT GTTTCCCTGA ATGGGATGGA CTCATTTGTT GGCCCAGA6G AACAGTGGGG 480 

AAAATATOGG CTOTTCCATG CCCTCCTTAT ATTTATGACT TCAACCATAA AGQAGTTGCT 540 

TTCCGACACT GTAACCCCAA TGGAACATGG GATTTTATGC ACAGCTTAAA TAAAACATGQ 600 

GCCAATTATT CAQACTGCCr TOGCTTTCTQ CAGCCAGATA TCAGCATAGQ AAAOCAAOAA 660 

TTCTTTQAAC GCCTCTATGT AATGTATACC GTTGGCTACT CCATCTCTTT TGQTTCCTTQ 720 

GCTOTOGCTA TTCTCATCAT TGGTTACTTC AGAGGATTGC ATTGCACTAG GAACTATATC 780 

CACAT6CACT TATTTGTGTC TTTCATQCTQ AQAGCTAGAA GCATCTTTQT CAAAQACAGA 840 

GTAGTCCATG CTCACATA6G AGTAAAGQAG CTGQAGTCCC TAATAATGCA GGATGACOCA 900 

CAAAATTCCA TTGAGGCAAC TTCTOTGQAC AAATCACAAT ATATOSGGTG CAAGATT6CT 960 

GTTGTGAT6T TTATTTACTT CCTGGCTACA AATTATTATT GGATCCTGGT GGAAGGTCTC 1020 

TACCTGCATA ATCTCATCTT TGTGGCTTTC TTTTOGGACA CCAAATACCT OTGGGOCITC 1080 

ATCTTSATAO UC f GUOUCriT TOCAGCAGCA TTTGTTGCA6 CAT066CT6T G6CA06AGCA 1140 

ACTCTG6CTG ATOGQAGGTO CTGGGAACTT AGTGCTGGAG ACATCAAGT6 GATTTATCAA 1200 

GCACCGATCT TAGCAQCTAT TGGGCTGAAT TTTATTCTGT TTCTOAATAC GGTTAGAGTT 1260 

CTAGCTACCA AAATCTGGGA GACCAATGCA GTTGGGCATO ACACAAGGAA GCAATACAGG 1320 

AAACTGQOCA AATOQACACT O OT O Cr G GTC CTAOTCTTTG GAGTGCATTA CATOGTGTTC 1380 

GTAT6CCT6C CTCACTCCTT CACT6GOCTC GGGTGG6AGA TCCGCAT6CA CTGTGAGCTC 1440 

TTCTTCAACT CCTTTCAGGG TTTCTTTOTO TCTATCATCT ACTGCTACTG CAATGGAGAG 1500 

GTTCAGGCAQ AGGTGAAflAA GATGTGGAGT OGGTGGAATC TCTCOGTGGA CTGGAAAAGG 1560 

ACACOQCCAT GTGGCAGCCG CAGAT60QGC TCAGTGCTCA CCACOGTGAC GCACAOCACC 1620 

A6CAGCCAGT CACAGOIGGC GGCCAGCACA OGCAttSGTGC TTA7CTCTGG CAAAGCTGCC 1680 

AAQATCQGCA GCAQACAOCC TGACAGOCAC ATCACTTTAC CTGGCTATGT CTQQAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CAOGAGGAGA OCAAGGAAGA TASTGGGA6G 1800 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAOACACT 1860 
GAAGGAtGCC AAOGAGAAAC TQAOQATGTT CTCTGA 

Seq ID NOi 524 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

111)11 

MLRSSIiSTSI VLFLF8SFST INESZSSSKR BRFLEQLDSD GTITIEEQIV LVLKAKVQCE 60 

LKITAQLQBG EGMCPPEWDG LICWPRGTVG KISAVPCPPY lYDFKHKOVA PRHCNPNGTW 120 

DFMHSUIKTW ANYSDCLRPL QPDISIGKQE FFBRLYVMYT VGYSISFOSL AVAILIIGYP 180 

RRLRCTRMYI HMRLFV8FML RAT8IFVXDR WHAHIGVKB LE8LIMQDDP QK8XfiAT6VD 240 

KSQYIGCXIA WNFIYFXAT UnWlVmSL YUSOiIFVAF P8DTKyLMGF ILIGK6FPAA 300 

FVAAMAVARA TLADARCHBL SASDXXNXyQ APZLAAIGUI PILPIMTVRV UVTKIWETNA 360 

VGUDTRROfYK KLAKSTI.VI.V LVPOVHYXVP VCLPHSPTGL GWBXRKHCEL PFNSPQGPPV 420 

SIXYOrOIGB VQAEVRKMWS RKNLSVDWKR TPPOOSRRGQ SVL7TVTHST 8SQSQVAAST 480 
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RMVLI86KAA KXASRQPDSH ITLP6YVHSN SEQOCLPHSP REBTKBDSGR QGDDIUfEXP 540 
SRPHBSNPDT E60QGBTBDV L 

Seq ID NOt S2S DIOl eequence 
Nucleic Acid Accession #t KM_00504B 
Coding sequence i 143.. 1795 " 

1 11 21 31 41 51 

I I I t I I 

GQCOGGTGGC CX3QGGCG06A CCACGCOUSC TG06GGT0GT TACTGGCCAC AAGTTT6CTC 60 

TGGGCCA6CC AAOTTGGCAA CTTGGAAGCT TCTCCGGGQC TCTGGAGGAO GGTOCCTGCT 120 

TCXTCCTACA GCOOTTCOGG GCATG0CG60 GCTGGGGGCXI TCGCTCCACG TCTGGGGTTG IBO 

GCTAATGCTC GGCAGCT6CC TCCTG6CCAG A6CCCAGC1G GATTCTGATG GCAGCATTAC 240 

TATAGAOGAO CAGATTGTCC TTGTGCTQAA A8GQAAAGTA CAATGIGAAC TCAACATCAC 300 

AGCTCAACTC CAOQAGGGAG AAGGTAATTG TTTCCCT G AA TGG6ATGGAC TCATTT6TTG 360 

GCCCAGAGGA ACAGTGGGGA AAATATCGQC TQTTCCATGC OCTCCTTATA TTTATOACTT 420 

CAACCATAAA GQAGTTOCTT TCC6ACACTQ TAACCCCAAT GQAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGG6 CCAATTATTC AGACTGCCTT 08CTTTCT6C AOCCAGATAT 540 

CA6CATA0QA AA6CAAGAAT TCTTTQAAGO CCTCTATOTA ATOXATAOOG TTGGCTACTC 600 

CATCTCTTTT G6TTCCTTGG CT6TGGCTAT TCTCATCATT GGTTACTTCA GAGGATT8CA 660 

TTGCACTAQQ AACTATATCC ACATGCACTT ATTTGrOTCT TTCATGCTOA GAGCTACAAO 720 

CATCTTTGTC AAAGACAGAG TA6TCCATGC TCACATAGGA GTAAAGQAGC TGGAQTCCCT 780 

AATAATGCAO GATGACCXAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 840 

TATOGGGTGC AAGATTGCTG TTOTOATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTO GAAOOTCTCT ACCTGCATAA TCTCATCTTT GTGGCrrTCT TTTOGGACAC 960 

CAAATACCTO TGOOaCTTCA TCTTOATAGQ CrGaOGGTTT CCAGCAGCAT TTGTTGCAGC 1020 

ATGGGCT6TO GCACOAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGQAOA 1080 

CATCAASTGG ATTTATCAAO CACOGATCTT AGCAOCTATT OGQCTOAATT TTATTCTGTT 1140 

TCTGAATAOO GTTAGAGTTC TAGCTAOCAA AATCTGGGAG ACCAAT6CAG TTGQ GCRT OA 1200 

CAGAAGGAAG CAATACAGGA AACTGGCCAA ATOGACACTG OTCCTGGTCC TAOTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTOG GGTGGGAGAT 1320 

CCGCATCCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 13 BO 

CTGCTACTGC AATGOAOAGG TTCA6GCAGA OGTQAAGAAG ATGTGGA6TC GGTGGAATCT 1440 

CTCOaTGGAC TGGAAAAGGA CACOGCCATG TOGCAGGOGC AGATGGQGCT CAGTGCTCAC 1500 

CACOGTGACG CACAGCACCA 6CA6CCA0TC ACAGGTG60G GCCAGCACAC GCATGOTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 1620 

TGGCTATGTC TGGAGTAACT C7VGAGCA6QA CTGCCTGCCA CACTCTTTCC ACOAGQAQAC 1680 

CAAG6AAGAT A6TG66A06C AGGGA6ATGA TATTCTAATO GAGAAGCCTT CCAG6CCTAT 1740 

GQAATCTAAC CCAGACACTG AAOGATGCCA AGGAGAAACT GAGGATOTTC TCTOAATGQA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCX: AATOGCTGGT TGTGTGAGAG GGCTTG6CTG 1860 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGO CTCCTGTAAA TACTAAOGAC ATOAAAATGC AAGTGTCAAT 1980 

GGAOTAOTTT ATTACCTTCT ATTGGCATCA AGTTTTCXrrC TAAATTAAT6 TATGGTATTT 2040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAOATT CAATTGCTTG 2100 

GCTGTAGCTT TCTCTCATAT ATATCACCGT AAATATAATO AAOATCTTTT AOTGTGTATC 2160 

ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

ATTTATTTTO CCTGTQCATA GOAGCAATTA GGATCTAAAA AAATATATGO QAAGATAAAA 2280 

GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTrGGGIGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATaiGTT TTTGGGAACA AGQAAAATTT CTCAAAAAAO AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG eXTiCT iTOr G ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAOCTTQ TGTCTGCAAA 2520 

TTOATTTTGT TTGTAATGTA TTTT3ATAGC AAATCATGCT GCATCTATAT CTTTTTCTTO 2580 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TXAAAAATTT GTTTTAAAAA 2640 
T 

Seq ZD NOi 526 Protein sequence 
Protein Accession #i NP_005039 

1 11 21 31 41 51 

I I I I t I 

MAGLQASLHV WGWLMLGSCL LARAQLDSZX} TITIBEQIVL VLKAKVQCEL NITAQLQBGB 60 

GNCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAP RHOIPNGTWD FMHSLNKTWA 120 

NYSDOiRFIiQ PDISIGXQBP FERLYVMYTV GYSXSFGSIiA VAILZIQYPR RLHCTSNYIH 180 

MRLPVSFMLR AT8IFVKDRV VKAHXGVKBL ESLIKQmPQ MSIEATSVDR SQYIGCKXAV 240 

VMPIYFLATN YYWILVBGLY UlNLIFVAPP SDTKYWfGPI LIGWGPPAAP VAAWAVARAT 300 

LADARCWBLS AGDIKMIYQA PILAAIGLMP ILFLNTVRVL ATKIWETKAV GHDTRKQYRK 360 

LAKSTLVLVL VFGVHYIVFV CLPHSFTGLQ WEIRMHCELP FNSFQGFPVS IIYCYCNGBV 420 

QAEVKKMWSR TOUiSVDWKRT PP00SRR06S VliTTVTHSTS SQSQVAASTR KVLISGKAAX 480 

lASRQPDSHX TLP6YVWSH8 EQDCLPHSFH BETKED86RQ GDDILMBKPS RFHBSNPDTB 540 
GGQ6BTEDVL 

Seq ID NOs 527 DNA sequence 
Nucleic Acid Accession ft: XH_036683 
Coding sequence i 3 8 3 6 55 

1 11 21 31 41 SI 

I I t 1 i I 

GCTTTGCCCA GTAGTTGGAA AGTGAACTCQ ACTOGTQATO GTTCTCCTOT CACTTTQGTT 60 

GATAGCAGCC GCTCTGGTAG AGGTTAGGAC TTCAGCTGAT OGACAAOCTO GTAATGAA6A 120 

AATGGTGCAA ATAGATTTAC CAATAAAQAQ ATATA0A6AG TATQAGCTGG TGACTCCAGT 180 

CAGCaCAAAT CTAGAAGQAC GCTATCTCTC CCATACTCTT TCTQCGAGTC ACAAAAAQAG 240 

GTCAGCGAGQ GACGTGTCTT CCAACCCTQA GCAGTTGTTC TTTAACATCA CGQCATTTGG 300 

AAAAGATTTT CATCTGGGAC TAAAGCCCAA CACTCAACTA GTAGCTCCTG GGGCTOTTGT 360 

GQAOTOGCAT GAGACATCTC IX S GIGOCT OG QAATATAAOC GATCCCATTA ACAACCATCA 420 



1156 



wo 03/042661 
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ACCAGGAAOT GCTAOOTATA GAATCCGGAA AACAGAGCCT TTGOVGACTA ACrGTGCTTA 4 BO 
TGTTGOTGAC ATOGTGGACA TTCCAGGAAC CTCTOTTGCC ATCAGCAACT GTGATGGTCT 540 
G6CTGQAAT0 ATAAAAAGTG ATAATGAA6A GTATTTCATT GAACCCTTGG AAAGAGGTAA 600 
ACAQATG6A0 GAAGAAAAAG GAAG6ATTCA TGTTOTCTAC AAQAOATCAG CTGTAGAACA 660 
5 OGCTCCCATA GACATGTCCA AAOACTTCCA CTAGAOA6A6 TOGGACCTGG AAGGCCTTGA 720 
TGATCTAGGT ACTCTTTATG GCAACATCCA CX»GCA6CTG AATGAAACAA TGAGACGCCG 780 
CAOACRCGCG GGAGAAAACX3 ATTACAATAT CQAGGTACTQ CTGGGAGTGG ATGACTCTGT 840 
GGTCOOTTTC CATGGCAAAO A6CA0GTCCA AAACTACCTC CTGACCCTAA TGAACATTGT 900 
GAATQAAATT TACXATOATG AGTCCCT0G6 AGTQCATATA AATOTSOTOC TGG1X30QCAT 960 
10 GATAATGCTG G6ATATQCAA AOTCCATCAG CCTChTAOAA AGGX36AAAOC CATOCAGAAG 1020 
CTTGGAGAAT GTGTGTCX5CT GGGOOTCXXa ACAGCAAAGA TCTGATCTCA ACCACTCTGA 10 BO 
ACACCATGAC CATGCAATTT TTTTAACCAQ GCAAQACTTT GGACCTOCTG OAATGCAAGO 114 0 
ATATGCTCCA 6TCACC66CA TGTGTCATCC A6TGAGAAGT TOTACCCTGA ATCATGAGGA 1200 
TOGTTTTTCA TCTGCTTTT6 TAOTAQCCCA TGAAAG6GGC CATOTOTTGO GAATGOAGGA 1260 
15 TGATGOACRA GGCAACA6GT GTGGTGATC3A OACTGCTATG GGAAGTGTCA TGGCTCCCTT 1320 
GGTACAAGCA GCATTCCATC GTTACCACTG GTCCCQATGC AGTGGTCAAG AACTOAAAAG 1380 
ATATATCCAT TCCTATQACT GTCTCCTTGA TGACOCTTTT GATCATGATT GGCCTAAACT 1440 
CCCAGAACTT CCTGGAATCA ATTATTCTAT GGATQAQCAA TGTOGTTTTO ArrriXiUT UT 1500 
TGGCTATAAA ATGTtSGAGGG GGTTGOQAAC CTTTQAGOCA TQTAAACAOC TOTSaTCTAG 1560 
20 CCATCCTQAT AATOOCTACT TTTGTAAGAC 7AAAAAGGGA OCTCCACTTG ATGGQACTQA 1620 
ATGTGCTGCT GGAAAATGGT GCTATAAGGG TCATTGCATG TGGAAGAATG CTAATCAGCA 1680 
AAAACAAQAT GGCAATTGGG GGTCATOOAC TAAATTTGGC TCCTGTTCTC GGACATGTGG 1740 
AACTG6T6TT CGTTTGAGAA CSVCGCCAQTG CAATAATCCC ATGCCCATCA ATGGTGGTCA 1800 
QGATTOTGCT GOTOTTAATT TTQAOTAOCA GCmOTAAC ACAGAAOAAT 6CCAAAAACA 1860 
25 CTTTGAGGAC TTCAGAGCAC A6CAGTGTCA GCAGC6AAAC TGGCACTTTQ AATACCAGAA 1920 
TACCAAACAC CACTGGTTGC CATATGAACA TGCT6ACCCC AAGAAAAOAT GCCACCTTTA 1980 
CTOTCAGTCC AAGOAGACTG GAGATGTTGC TTACATGAAA CAACTGGTGC ATGATGGAAC 2040 
GCACTGTTCT TACAAAGATC CATATAGCAT ATGTOTGOGA GGAGAGTGTO TGAAAGTGGG 2100 
CTGTQATAAA GAAATTGGTT CTAATAAGGT TGAG6ATAAG TGTGGTGTCT GTGGAGGAGA 2160 
30 TAATTCCCAC TQCCXSAACOS TGAAG6GGAC ATTTACCAGA ACTCCC3VGGA AQCTTGGGTA 2220 
CCTTAAGATG TTTOATATAC CCCCTGGGGC TAGACATOTG TTAATCCAAG AAGA0GAG6C 2280 
TTCTCCTCAT ATTCTTGCTA TTAAGAACCA GGCTACAGGC CATTATATTT TAAATGGCAA 2340 
AGGGQAGGAA GCCAA0T06C GGACCTTCAT AGATCTTGGT GT6GAGTG6G ATTATAACAT 2400 
TOAAGATOAC ATTGAAAGTC TTCRCACCGA TGGACCTTTA CATGATCCTG TTATTGTTTT 2460 
35 GATTATACCT CAA6AAAATG ATACCOGCTC TA6CCTGACA TATAAGTACA TCATCCATQA 2520 
AQACTCTOTA CCTACAATCA ACAGCAACAA TGTCATCCAG GAA6AATTAQ ATACTTTTGA 2580 
GTGGGCTTTG AAGAGCTGGT CTCAGTGTTC CAAACCCTGT GQTGGAOGTT TCCAGTACAC 2640 
TAAATATGGA T6C0GTAGQA AAAGTGATAA TAAAATGGTC CAT06CAGCT TCTGTGAGGC 2700 
CAACAAAAAG COSAAACCTA TTAQACGAAT GT6CAATATT CAAGA6TGTA CACATGCACT 2760 
40 CTGGQTAGCA GAAGAATGGG AAGACTGGAC CAAAACCTGT GGAAGTTCTQ GCTATCAGCT 2820 
TOSCACrOTA CGCTGCCTTC AGCXyVCTCXn* TGATGGCACC AACC3GCTCTG TGCACAGCAA 2880 
ATACTOCATG GGTGACCGTC CCGAQAGCCG CCGGCCCTGT AACAGAGTGC CCTGCCCTGC 2940 
ACAOTGQAAA ACAGGACCCT GGAOTQAGTG TTCAGTGACC TGCX3GT6AAG GAA06GAGGT 3000 
GAGGCAGQTC CTCTGCAG06 CTG60QACCA CTGTGATGGT OAAAAGGCTO AOTOaOTCAO 3060 
45 AGCCTGTCAA CTGCCTCCTT GTAATGATQA ACCATGTTTO GGAOACAAOT CCATATTCTQ 3120 
TCAAATGOAA aTaTTGQC3VC GATACTGCTC CATACCAGGT TATAACAAOT TATGTTGTGA 3180 
GTCCTQCAGC AAGOQCAGTA GCRCCCXGCC ACCACCATAC CTTCTAGAAG CTGCTGAAAC 3240 
TCATGATOAT OTCATCTCTA ACCCTAGT6A CCTCXXTTAGA TCTCTAGTGA TGCCTACATC 3300 
TTTGGTTCCT TATCATTCAG AGACOOCTGC AAAOAAGAtG TCTTTQAOTA QCATCTCTTC 3360 
50 AOTGOGAGGT CCAAATGCAT ATGCTtSCTTT CAGGCCAAAC AGTAAACCTG ATQGTOCTAA 3420 
TTTAOGCCAG AGGAGTGCTC AGCAAGCAGG AAGTAAGACT GTQAGACTGG TCACC30TACX: 3480 
ATCCTCCCCA CCCACCAAGA GQQTCCAOCT CAOTTCAGCT TCACAAATGQ CTGCTGCTTC 3540 
CTTCTTTGCA GCCAGTQATT CAATAG6T6C TTCTTGTCAG 6CAAGAACCT CAAAGAAAGA 3600 
TOGAAAGATC ATTGACAACA GAGGTGCQAC AAOATCATGC AOCTTAGAAA GATGAGAAAO 3660 
55 TGAACCAAAA AGGCTAGAAA CCAGAGGAAA ACCIGGACAA CCTCTCTCTT CCCATGGTGC 3720 
ATATGCTTGT TTAAAGTGGA AATCTCTATA QATOGTCAGC TCATTTTATC TOTAATTGQA 3780 
AGAACAOAAA GTGCTGQCTC ACTTTCTAGT TGCTTTCATC CTCCTTTTGT TCTGCATTGA 3840 
CTCATTTACC AGAATTCATT GGAA6AAATC ACCAAAGATT ATTACAAAAQ AAAAATAT6T 3900 
^ TQCTAAOATT UTtfr i W r Ui CTCTCTOAAG CAOAAAAGGG ACTGQAAOCA ATTGTGCATA 3960 

60 TCAGCTGACT TTTT G ' m ' G l' TTTAGAAAAO TTACAGTAAA AATTAAAAAG AGATACCAAT 4020 
GGTITACACr TTAACAAGAA ATTTTGGATA TGGAACAAAQ AATTCTTAGA CTTGTATTCC 4080 
TATTTATCTA TATTAGAAAT ATTGTATGAG CAAATTTGCA GCTGTTGTGT AAATACTGTA 4140 
TATTGCAAAA ATGAGTATTA TTTTAAOAGA TGTOTTCTCA AATOATT QTT TACTA TATTA 4200 
CATTTCTGGA TGTTCTftGGT GOCTGTOGTT GAOTATTQOC TTGTTTGACA TTCTATAGGT 4260 
65 TAATTTTCAA AGCAQAGTAT TACAAAAGAO AAGTTAjQAAT TACAGCTACT OACAATATAA 4320 
A03GTTTTQT TQAATCAACA ATX3TGATACG TAAATTATAG AAAAA6AAAA GAAACACAAA 4380 
AGCTATAGAT ATACAQATAT CAGCTTACCT ATTGCCTTCT ATACTTATAA TTTAAAGGAT 4440 
TGOT Q TCTTA QTACACTTGT GGTCACAQOO ATCAAOGAAT AOTAAATAAT GAACTCGTGC 4500 
AAGACAAAAC TGAAACCCTC TTTCCAOQAC CTCAfiTAGGC AOCBTTOAOa TOl ' C C n 'TgT 4560 
70 TTTTOTGTGT GTGTGTTCTT TTTTAATTTT CGGATTGTTQ ACA6ATACAA ACAGTTATAC 4620 
TCAATOTACT OTAATAATOG CAAAGGAAAA A6TTTTGGGA TAACTTATTT GTATGTTGGT 4680 
AGCTQAGAAA AATATCATCA GTCTAGAATT GATATTTGAG TATAGTA6AG CTTTOGGQCT 4740 
TTQAAGGCAQ GTTCAAGAAA GCATATGTOO ATG8TTGAGA TATTTATTTT CCATATGGTT 4800 
CAT6TTCAAA TGTTCACAAC CACAATGCAT CTGACTGCAA TAATOT6CTA ATAATTTA1G 4060 
75 TCAGTAOTCA CCTTGCTCAC AGCAAAGCCA GAAATGCTCT CTCCAGGGAG TA6AT6TAAA 4920 
OTACTTOTAC ATAGAATTCA QAACTGAAGA TAlTrATTAA AAGTTGATTT TTTTTTCTTG 4980 
ATAGTATTTT TATGTACTAA ATATTTACAC TAATATCAAT TACATATTTT GQTAAACTAG 5040 
AGAQACATAA TTAGAGATOC ATGCTTTGTT CTQTGCATAO AGACCTTTAA GGAAACTACT 5100 
0^ ACA6CCAACT CAAAAGCTAA AACTGAACAA ATTTGATGTT A1GCAAACAT CTTGGATTTT 5160 
80 TAOTAGTTGA TATTAAGTT6 ATXSACTTGTT TCCCTTCAAG GAAACATTAA ATTGTATGQA 5220 
CTCAGCTAGC TGTTCAATGA AATTGTGAAT TAGAAACATT TTTAAAAOTT TTTGAAAQAO 5280 
ATAAGTGCAT CATGAATTAC ATGTACATGA GACGAQATAG TGATATCAGC ATAATGATTT 5340 
TGAOGTCAGT ACCTGAGCTG TCTAAAAATA TATTATACAA ACTAAAATGT AGATGAATTA 5400 
AOCTCTCAAA 0CACA0AAT6 TGCAAQAACT TTTGCATTTT AATG8TTGTA AACTAACAGC 5460 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TTAAACTATT 
OTCAAATTAQ 
CCTAAAAATT 
OCACAGGOQA 
ATACTTCTGA 
TACACTACTG 
AT 



OACTCTATAC 
GCAAATTCCA 
COCATAGAAT 
AGATAAAGAT 
OCTTTGGGCT 
ATGCTGTTTG 



CTCTAAAGAA 
GATAOTAAAA 
AAAATTCTCT 
GGTCACACAA 
TTCTTTTCTA 
TT6TACTQA0 



TTGCTGCTAC 
CAATCCCTAA 
CTAGTTTACT 
ACAQTTTCCA 
CTAAGCTAAA 
AGCAOGTACX: 



TTT6T6CAA8 
QGCTTAAGTC 
TGTGTGTGCA 
TAAAESVT6TA 
AATrCCTTTT 
AATAAAAA70 



AACTTT6AAG 

TACATCTCAT 
CATATTCATT 
TATCAAAOTG 
TTAACAAAAT 



5520 
5580 
5640 
5700 
5760 
5620 



Seq ID NOi 528 Protein sequence 
Protein Accession «i XP_036683 



1 
I 

KVIiLSLKLIA 
LSASHKKRSA 
TDPXKNKQPG 
ZBPLBRGRQM 
UHETMRRRRH 
INWLVRMIM 
FaPAGMQOYA 
MGSVMAPLVQ 
QCRFDFGVGY 
MWKNANQQKQ 
NTEECQKHFE 
KQLVHDGTHC 
RTPRKLOYLK 
GVEWDYNIED 
QBBLDTFGHA 
IQECTHPLWV 
QIRVPCPAQW 
LGDKSIFCQM 
RSIiVMPTSLV 
TVRLVTVPSS 
STLER 



11 
I 

AALVEVRTSA 
RDV6SNPEQL 
SATYRIRKTE 
EEEKGRIHW 

AqoroYKZEV 

L6YAK5ISLI 
PVTGMCHPVR 
AAFHRYHWSR 
KMCTAPRTFD 
DCaiWGSWTKF 
DFRAQQCQQR 
SYKDPYSICV 
MFDZPPGARH 
DIBSLHTDGP 
UCSWSQCSKP 
AEEWKHCTKT 
KTGPWSECSV 
EVLARYC8IP 
PYHSETPAKK 
PPTKRVHIiSS 



21 
I 

DGOAGNEEMV 
PPNITAFGKD 
PLQINCAYVQ 
yiORSAVBQAP 
LLGVDDSWR 
ERGNPSRSLE 
SCTIiNHEDGP 
CSGQELKRYI 
PCKQLWCSHP 
GSCSRTCGTG 
NSHPBYQNTK 
RQBCVKVGCD 
VLIQEDBA8P 
UIDPVIVLII 
CX3GGFQYTKY 
CGSSGYQLRT 
TOGEGTEVRQ 
GYKKLCCE8C 
MSLSSISSVG 
ASQMAAASFF 



31 
I 

QIDLPIKRYR 
FHLRLKPNTQ 
DIVDIPOTSV 
ZDMSRDFRYR 
FHGKEHVQNY 
NVCRWASQQQ 
S8APWAHET 
HSYDCLLDDP 
DNPYFCKTKK 
VRFRTRQCNN 
HHWLPYEHPD 
KEIGSNKVED 
HXLAIKNQAT 
FQENDTRSSL 
GCRRKSmnCM 
VRCLQPLLDG 
VI^CRAGDUCD 
SKRSSTX«PPP 
GPNAYAAFRP 
AASDSIGASS 



41 

I 

BYELVTPVST 
LVAPGAWEW 
AZSNCDGIiAG 
BSDLBGLDDL 
LLTLKUIVNE 
RSDLNBSEHH 
GHVLGMEHDG 
PDHDWPKLPE 
GPPLDGTECA 
PMPIKGGQDC 
PKKRCHLYOQ 
KCX3V0GGDNS 
GHYILNGKGB 
TYKYIIHEDS 
VHRSFCEANK 
TNRSVHSKYC 
GBKPESVRAC 
YLLEAAETHD 
NSKFZXSANLR 
QARTSXXDGK 



51 
I 

NLB6RYLSHT 
HETSLVPGNI 
MIKSDN6BYF 
GTVYGNZBQQ 
lYHDBSLGVH 
DHAIPLTRQD 
QGNRCGDETA 
LPGINY8MDB 
AGKWCYKGHC 
PGVNFBYQIiC 
SKBTGDVAYM 
HCRTVK6TFT 
EAKSRTFZDL 
VPTINSNNVI 
KFKPIRRMCN 
MGDRPESRRP 
QLPPCMDEPC 
DVZ8NPSDLP 
QRSAQQAOSK 
ZXDNRRPTRS 



Seq ID NO: 529 DNA sequence 
Nucleic Acid Accession «t NM_002774 
coding sequences 24 6.. 980 



AOGCGGAGAA 
AGCCGGGGGC 
GCTGGCTOQC 
QQGAQCTGTT 
OGGCCATGAA 
AGAATAAGTT 
TCTACACCTC 
CAQCTOOCCA 
AAAG6GAGAO 
ATGCCGCCAG 
CTGAACTCAT 
ACATCCTOGQ 
AC ATOCAOCT 
ACATGTTQTO 
OTCCQCTGGT 
GATCAAAGGA 
AAACCATTCA 
TGGCTGGTTC 
TGACGCTGAT 
CA6CTCCATC 
TACCCCCACC 
TTCCACACGT 
CTOCTTACTG 
AAGATGAAGA 
TTTCAQUZCT 
TATTTT 



11 

I 

AGCCCGATTG 
AGGG60GGG6 
TCTCTOCTOG 
CCTTCOCCOQ 
GAAGCTGATG 
GGTGCATSGC 
G6GCCACTTG 
CTGCAAAAAA 
TTCCCAGGAG 
CCATGACCAG 
CCAGCCCCTT 
CZGGGGCAAG 
GGTOTCCOQT 
TGCTGGGQAT 
ATQTGGAQAC 
GAAGCCAGGA 
GGGCAAOTGA 
CAGAAOGTCT 
6CTTAATAAA 
CTT6CATCAC 
ACTAAGAGAA 
TTQATTTCTT 
TOOGIOTCCX: 
TAAOQATGAT 
ATGACATACA 



21 
I 

TT0CT6GGCC 
GCCAGTGTGG 
CMSAGACAGAO 
ACTCAA8AAT 
GT66TGCTQA 
GGACCCTQCG 
CTCTGTGGTG 
GOGAATCTTC 
CAQAGTTCTG 
GACATCAT6C 
CCCCTGGAGA 
ACA6CAGATG 
QAjGQAOTQTO 
GAGAAGTAOO 
CACCTCOQAO 
GTCTACACCA 
CCCTGACATQ 
CTCACCTAGA 
CSOCAGOGACG 
TGG6GAGGAC 
TACAGGAAAIV 
CCTGCAGAOG 
CTTQG6ATQT 
ACAOTCTOCA 
TOGOATAGCA 



31 
I 

CTTTCCCCAT 
TOACACACGC 
OTOQOCAaGC 
CCGG0GAG6C 
GTCTGATTGC 
ACAA6ACATC 
GGGTCCTTAT 
AGGTCITOCT 
TTCTCCGGGC 
70TTG0GCCT 
GGGACTGCTC 
QTQATTTCCC 
AGCATOCCIA 
GGAAGQATTC 
GOCTTQTGTC 
AOGTCTGCAG 
TOACATCTAC 
CCTTOOCTOC 
TQAGGQTCCT 
GTGATGAGTG 
TCCCTTCTAG 
CCCAOCXnCG 
AOCTTTCTTC 
TCAGGCA6TG 
CCTGGG006C 



41 

I 

0606CCTGGG 
TGTAGCTGTC 
AGCACACAOA 
COGGAGGCCT 
TGCAGOCT6G 
TCACCCCTAC 
CCATCCACTO 
GGQGAAQCAT 
TGTGATCCAC 

ggcao6cxx:a 

AGCCftACACC 
TGACACCATC 
ooctggccag 
CTGCXZAGGGT 

atggggtaac 
atacaogaac 

CTOCOQACCT 
CCTOCrCT O C 

qattctcoct 

aggacttggg 
gcatctcctc 
txstctggaat 

ACTGCAOATT 
6CXGTTGQAA 
GATGCACTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



51 
I 

cctgctcccc 

TCCCCGGCTG 
GGGACCTAOQ 
GCAGCAGGAG 
GCAGAGGAGC 

caagctgccc 
tgggtcctca 

AAGCTT06GC 
CCTGACTATO 
GCCAAACTCr 
ACCAGCTQCC 
GAQTGTGCAT 
ATCACCCAQA 
GATTCTGGG6 
ATCCCCTGTG 
TGGATCCAAA 
ACCAOCCCAC 
TGGCCAGCTC 
GGTTTTACCC 
TCCTOGGTCT 
TCCCCAACCC 
CCCAGCTCCG 
TCTCACCZGT 
AGATTTAAlGA 
ATAAAGAATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 



Seq ZD NOi 530 Protein sequence 
Protein Accession #t NP^002765 

1 11 21 31 41 51 

I I I I I I 

MKKLHWLSL XAAANAEBQN KLVEG6PCDK TSHFYQAALY TSGHLLCGGV LIHPLWVIiTA 
AHCKKPNLQV FLGKHNLRQR ESSQEQSSW RAVIHPDYDA A8HDQDINLL RIARPAKLSB 
LIQPLPLERD C8ANTTSCHZ LGHGKTADQD FPDTZQCAYZ HLVSRBEGEH AYPGQZTQMM 
EX^GDEICYOK DSGQODSOQP LVCTOHLRGL VSNGNZPCOS KBKPGVYIW CBTOmZQICr 
ZQAK 

Seq ZD KO: 531 DWA sequence 
Nucleic Acid Accession ii NM_012152 
coding sequence: 43.. 1104 



60 
120 
180 
240 



11 



21 



31 



41 
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I I 1 I I 1 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

GACAAGCAOV TGOACTTTTT TTATAATAGG AGCAACACTQ ATACTGTOGA TGACTGQACA 120 

. OQAACAAAGC TTGTSATTGT Tl ' mmiq iT QaOAOOTTTT TCTOCCTGTT TATTTTTTTT 180 

5 TCTAATTCTC TGGTCATCGC G8CAGTGATC AAAAACAGAA AATTTCATTT CXXXTTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCOGAT TTCITOGCTG GAATT6CCTA TGTATTCCTQ 300 

AT3TTTAACA CAGGCCCAGT TTCAAAAACT TTaACTGTCA ACCGCTGGTT TCTCCXTTCAQ 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

^ AGGCAGATGT CAATCAXGAO GAT60GGQTC CATAOCAACC TQACCAAAAA GAGGGTGACA 480 

10 CTGCTCATTT TGCTTGTCTG GGOOVTOQCC ATTTTTA7GG GGGCGQTCOC CACACTGG6C 540 

TGGAATTQCC TCT6CAACAT CTCTGCCTGC TCTTCCCTGO CCCCCATTTA CAGCAGGAGT 600 

TACCTTOTTT TCTGQACAGT OTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGC6QATCT ACX3TQTACX3T CAAOAGGAAA ACCAAC30TCT TGTCTCOGCA TACAAGTGOG 720 

TCCATCAGOC OCX3GGA66AC AGCCATQAAG CTAATQAAGA CGGTOATQAC TGTCmGGG 780 

IS 606TTTGTGQ TATGCTGGAC CCGGGGGCT6 GTGGTTCT6C TCCTC6AG66 CCTBAACT6C 840 

AGGCAGTQTQ GC3GTGCAGCA TGTQAAAAGG TGOTTCCTQC TGCTGGOQCT GCTCAACTCC 900 

GTCOTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATQATCT6CT OCTTCTCTGA GGAGAACCXA GAOAGGGBTC CCTCTOOCAT CCCCTCCACA 1020 

OTOCTCAOCA GGAOTOACSIC ASOCAGCCAO TACATAGAiOO ATAOTATTAG CCAAOGTGCA IQBO 

20 GTCT6CAATA AAA6CACTTC CTAAACTCTO GATGCCTCTC G6CCCACCCA GGTQATGACT 1140 
GTCTTAGG 



25 



8eq ID NOt 532 Protein sequence 
Protein Accession «i NP 036284 



1 11 21 31 41 51 

I I 1 I I i 

KNECHYDKHM DFPmRSNTD TVDOMTOTXL VIVLCVQTPF CLFXFFSNSL VIAAVIKNRK 60 

PEFPFYYIiLA MXAAADPFAO lAYVFLHFNT GPVSXTLTVN RWFLRQGIiLD SSXiTASXiTHIi 120 

30 LVXAVERHHS ZMRMRVHSNIt TKKKVTLLXL LVHAIAXFMO AVPTLGNNCL OfXSACSSLA 180 

PIYSRSYLVF WTVSNLMAPL IMVWYLRIY VYVKRKTNVL SPHTSOSISR RRTPMKLHKT 240 

VMTVLGAFW CHTPGLWLL LDGLNCRQOG VQHVRRWFLL LALLNSWNP XIYSYKDEDM 300 
YGTMKKMICX: FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCMR STS 

35 Seq ID KO: 533 DNA sequence 

HUcleic Acid Accession #: NN_002821 
Coding sequence 1 150.. 33 62 

1 11 21 31 41 51 

40 I 1 I I I I 

AACTCCCGCC T0GGQACX5CC TCGGGGTOGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 60 

GCGCTCOGOT GCGTCOQCCT CCTOTGCCOG CCG0C3GAGCA GTCTGOCGCC CGCCGTGOGC 120 

CCTCAGCTOC TTTTCCTGAQ CGCX3C06CGA TGGGAOCTGC GOGGGGATCC OGGGCXaGAC 180 

CCGGGGGGTT O CC ICTGCTC AGGOTGCrQC TGCIGOOO C T OCTGGQGGGT ACOCAGACAa 240 

45 CCATTGTCTT CATCAA6CAG CGGTCCTCCC AGOATGCiACT GCAGGGGOGC C9G06CX3CTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATOTGTA CTGGCTOCTC GATGGGGCXX 360 

CTGTOCAGGA CAC6GAGCGG CGTTTOOCCC AGGGCAGCAG CXTTGAGCTTT GCAOCTGTGO 420 

AC066CTGCA OGACTCTGGC ACCTTCCAGT GTGTGGCTGG GGATGATGTC ACTGGA6AA0 480 

_^ AAGCCOGCAG IGCCAAGGOC TOCTTCAACA TCAAATQGAT TOAOQCRGGT CCTGTOOTOC 540 

50 TGAAGCATCC AGCCTOGGAA GCTGAGATCC A6CCACAGAC CCAG6TCACA CTT O G TT G CC 600 

ACATTQATGQ GCACCCTCGQ CCCACCTACC AATGOTTCCQ AGAT6GGACC CTCCTTTCTG 660 

ATGOTCAOAG CAACCACACA GTCAGCAGCA AGGAGOSGAA CCTGACGCTC CGGCCAGCTG 720 

GTCCTQAGCA TAGTGGGCTG TATTCCTGCT GOGCCCACAG TGCTTTTGGC CAGGCTT6CA 780 

GCAGCCAGAA CTTCACCTEG AGCATTGCTO ATQAAAGCTT T6CCA0GQTO GT6CTGGCAC 840 

55 CCCAGGA06T GGTAGTAGCX3 A6GTATGAG6 AGGCCATGTT GCATTQGCAO TTCTCAOCCC 900 

AGCCACCCCC GAGCCTGCAO TOGCTCTTTO AGGATGAGAC TCCCATCACT AAC06CAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACX3G GTCTCTGCTQ CTGACCCAGG 1020 

TCCX3G0CA0G CAATGCAGGG ATCTAGOQCT GCATT660CA GGGQCAGAQQ GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAOCAG AGATtGAAGA CATGCOGCTA TTTGAGCCAC 1140 

60 GGGTGTTTAC AGCTGGCAGC GAGGA60GTG TGACCTGCCT TCCCCCCAA6 GGTCTGCCAO 1200 

AGCCCAGOOT GTGGTGOaAO CACaCXSGOAQ TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA OSAGCTGQTG TTGGCCAATA TTGCTQAAAO TGATOCTGGT OTCTACACCT 1320 

GCCAOGCGGC CAACCT6GCT GGTCAGOGGA GACAGQATGT CAACATCACT GTGGCGACTG 1380 

TGCCCTCCTG GCTGAAGAAO CCCXAAGACA GCCM3CXGGA GOAGGGCAAA CCXX3QCTACT 1440 

65 TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTT8T CTGGTACAGA AACCAQAT6C 1500 

TCATCTCAGA GGACTCACX3G TTOaAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGOG 1560 

TGGAGGTGTA TGATGGGACA TOGTACCXTrT CrTATGAGCAG CACCCCAGCC GGCAGCATOG 1620 

AGG06CAAGC CCGTQTCCAA GTGCTGQAAA AGCTCAAGTT GACAOCACCA CCCCAGCCAC 1680 

AGCAOTGCAT GGAOTTTGAC AAGGAGGCCA OGGTGCCCTG TTGAOCCACA GGGGGA6AGA 1740 

70 AGCCCACTAT TAAGTGGGAA CXX3GCA0ATG GGAGCAGCCT CCCAGAGTGG 6TGACAGACA 1800 

ACGCTGGGAC CCTGCATTTT GCOCGGGTGA CTOSAGATGA OGCTGGCAAC TACACTTGCA 1860 

TTGCCTCCAA CGGGCCGCAO GGCXaWIATTC GTGCCX3VTGT CCA6CTCACT GTQQCAGTTT 1920 

TTATCACCTT CAAAGTGQAA CCAGAGCGTA CXSACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGOSA GGCCCAGGGO GACCCCAAGC OGCTGATTCA GTGGAAAGOC AAGQA006CA 2040 

75 TCCTGGACCC CACCAAGCTO GOACCCAGGA TGCACATCTT CCA6AATGGC TCCCTGGTGA 2100 

TCCATGAC3GT GGCCCCTGAO QACTCAGGCC GCTACACCTG CATTGCAGQC AACAGCTGCA 2160 

ACATCAAGCA CACGQAGGCC CCCCTCTATG TCGTGGACAA GCCTOTGCOG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCOCTCOC CCCTACAAGA TGATCCAGAC CATTGGGTTO TOGGTGGGTG 2280 

CCGCTGTGQC CTACATCATT GCG0TGCT6G OCCTCATOTT CTACIGCAAG AAG00CT8CA 2340 

oO AAGCCAAGCG GCTGCAGAAO CAGCXXX3A0G GCX3AGGAGCC AGAQATGGAA TGGCTCAAOG 2400 

QAGGGCCTTT GCAGAACGGO CAGCCCTGAO CAGAGATCCA AGAAQAAGTO GCCTTGACCA 2460 

GCrrGGGCTC OGGCCCCGOO GCCACCAACA AAOGCCACAO CACAAGTGAT AAGATGCACT 2520 

T0CCAC60TC TAGGCTGCAG CCCATCACCA OGCTGGGGAA 6A6T6AGTTT GGGGAGGTGT 2580 

TGCTOOCAAA Q6CTCA6GGC TTGOAGQAOO GA6TG6CAGA QAOCCTGGTA CTTGTGAAGA 2640 
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GCXTTGCAGAC GAAGGATGAG CAOCAGCAGC TGGACTTCCG GA6G6AGTTG GAGATOTTTG 2700 

C3GAAGCTGAA CX3V0GCCAAC GTGGTGCGGC TCCTGGGGCT OTGCCGGGAG GCTOAGCCCC 2760 

ACTACftTGQT OCTGGAATAT 6TGGATCTQG GAGACCTCAA GCAOTTCCTG AGGATTTCCA 2820 

A6AGCAAQQA TQAAAAATTG AAGTCACAGC GCCTCAGCAC CAAGCAOAAG GTGGOCCTAT 2880 

5 GCACCCftQGT A6GGCTQGGC AT6QAGCACC TGTCCAACAA O OaClTrUW CATAAGGACT 2940 

TGGCTGCQCG TAACTGCCTG GTCAGT6CCC AGAGACAA6T GAAGGTGTCT G C CCTGGGCC 3000 

TCAGCAAGQA TQTGTACAAC AGTGAGTACT ACCACTTCCX3 CCAOQCCTGG GT G COGCTOC 3060 

GCTGGATGTC CCCGQAOOCC ATCCT6QAGG GTGACTTCTC TACCAAGTCT 6ATGTCT6GQ 3120 

^ CCTTOaaiUT GCTQATQTGG GAAOTOTTTA CACATGGAGA GATGCCCCAT GGTG6GCAG6 3100 

10 CAOATGATQA AOTACTGGCA QATTTQCAGQ CTOGQAAGGC TAOACTTCCT CAOCCOGAOG 3240 

GCTGCCCTTC CAAACTCTAT OGGCTGATGC AGCGCTGCTQ GGCCCTCAGC CCCRAGQACC 3300 

GGCCCTCCTT CAGTGA6ATT GCCAG06CCC TGGGAGACA6 CACCGTGGAC AGCAAGCCGT 3360 

GA66AGGGA6 CCGQCTCAOG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

GM3CAT6AT6 GGCAAOATCC CIGTOCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

15 TTOCTGAGOT CTOAGCAGOO CCUG G OCm' CCTCCTCTTC CTCAOCCTCA TCCTTTGGGA 3540 

GOCTGACTTG GACCCAAACT GGGC6ACTA6 GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAOOGACA GTQTGGOTGC CACAGGTAAC CCCAATTTCT GOCCTTCAAC 3660 

TTCTOCCCTT GACOGGGTCC AACTCTGCCA CTCATCT6CC AACTTTGCCT GGG6AGGGCT 3720 

AG0CTT6GGA TGAGCTQ6GT TTGTG6GGA0 TTCCTTAATA TTCTCAAGTT CTGGOCACAC 3780 

20 AGGGTTAATG AGTCTCTTGC CCACTGGTOC ACTT6GGGGT CTAOACCAGG ATTATA6AG0 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCAOGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATQAAG6AQ TTTTCAGGAQ 3960 

CTTTTGACAC TATATAAACC 6CCCTTTTT0 TATGCACCAG GG60GGCTTT TATATGTAAT 4020 

TGCAOCX3TG6 OGTOQGTGGO CATOGOAGQT AOGGOTGGGC CCTOQAOATO AGQAGGQTGO 4080 

25 GCCATCCTTA CCXXaCACTT TT A TTGTTQT C G TTTTTT Q T T TO TTTTOTT TTTTl ' U ' r ' m ' 4140 
TQTTTTTOTT TTTACACTOQ CTGCTCTCAA TAAATAAOCC TTTTTTA 
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Seq ID NOt 534 Protein sequence 
Protein Accession 4s NP_002812 



1 11 21 31 41 51 

I 1 I 1 I I 

MGAARG8PAR PRSIiPLLSVL LLPLLGGTQT AIVFIKQPSS C2DALQ68RAL LRCEVEAP6P 60 

VHVYWLU9GA FVODTERRFA QGSSLSPAAV DRLQDSGTFQ CVARDDVTGE EARSAHASEN 120 

35 ZKWIBA6PW LKHPASEAEI QPQTQVTLRC BIDGHPRPTY QWFRDGTPLS DGQSNBTVSS 180 

KESNLTLRPA GPEHSQLYSC CAHSAFGQAC SSQNFTLSIA DBSFARWLA PQDWVARYB 240 

EAMFHCQFSA QPPPSLQWLF EDETPITORS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CXGQGQR6PP IIIiBATIiHXiA BIEDMPLPEP RVFTAGSBBR VTCLPPKGLP EPSVHWBHAO 360 

VRZ^PTHGRVY QKGBELVIiAM lABSDAGVyT CRAANIiAGQR RQ0VNITVAT VPSHLKKPQD 420 

40 SQLBBQKPOY LDCLTQATPK PTWHYRNQM LX9EDSRFBV FXX9GTLRXNS VEVYDOTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQOIEFDKEA TVPCSATGRB KFTIKWERAD 540 

GSSLPBWVTD NAGTLHFARV TRDDAGNYTC lASNGPQGQI RAHVQLTVAV PITFKVEPBR 600 

TTVYQGHTAL LQCBAQGDPK PLIQWKOKDR ILDPTKLGPR MHIFQHGSIiV IHDVAPEDSQ 660 

RYTCZAGNSC NIXHTBAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYXXAVL 720 

4S GLMFYCKKRC KAKRLQKQPE GEEPEMECLK GGPI/QNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH FPRS8LQPXT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

LDFRRELEMF GKLNHANWR LLGLCRBAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALO LSKDVYNSBY 960 

^ YHFRQAWVFIi RMMSPEAIIiE GDFSTKSDVW AFGVLMWBVF THGEHPHGGQ ADDEVLADLQ 1020 

50 AGKARIiPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA liGDSTVDSKP 



Seq ID MO: 535 PNA sequence 
Nucleic Acid Accession «: NM_013952 
Coding seqpieneet 161.. 1357 ~ 



1 11 21 31 41 51 

) I 1 1 i I 

TTCAGAAGGA GGAGAGACAC OGGGCCXAGG GCACCCTCGC GGGOSGGCGG ACCCAAGCAG 60 

TGA6GGCCTG CAGCCGGCCG GCCAGGGCAG CGOCAGGOGC GGCCCGQACX: TACGQGAGQA 120 

OO AGCCCCGAGC CCTCGGCGGG CTGCX5AGCGA CTCCCOGGOS ATGCCTCACA ACTCCATCAG 180 

ATCrOGCCAT 6QAGGGCTGA ACCAQCTOGO AGGGGCCTTT 6TGAATGGCA GACCTCTGCC 240 

QGAAGTGGTC CGCCAGCGCA TCGTAOACCT GGCCCACCAG GGTGTAAGGC CCTGOGACAT 300 

CTCTOGCCAG CTCOGCGTCA GCXATGGCTG OGTCAGCAAG ATCCTTGGCA GGTACTACGA 360 

- GACTGGCAGC ATCOSGCCTQ GAGTGATAGO GGGCTCCAAQ COCAAGGTGG CCACOCCX»A 420 

65 GGTGGTGGAQ AAGATTGGGG ACTACAAAOG CXaVGAACCCT ACCATGTTTG CCTGGGAGAT 480 

CCGAGACCGG CTCCTGGCTG AGGGCGTCTG TOACAATGAC ACTOTGOXA GTGTCAGCTC 540 

CATTAATAGA ATCATCCGGA CCAAAGTGCA GCRACCATTC AACCTCCCTA TGGACAGCTG 600 

CQTOQCXACC AAGTCCCTGA GTCCOGGACA CACOCTGATC CCCAOCTCAO CTGTAACTCX: 660 

CCOGGAGTCA CCCCAGTCGG ATTCCCrGGO CTCCACCTAC TCCATCAATG GGCTCCTGGG 720 

70 CAT06CTCAG CXntSGCAGGO ACAAGAGQAA AATGGATGAC AGTGATCAGG ATAGCTGCOG 780 

ACTAAGCATT GACTCAC9U3A 6CA6CASCAQ 0G6ACCCCGA AAGCACCTTC GOVCGGATGC 840 

CTTCAGCCAG CACGACCTCG A6G06CTGQA GTGCCCATTT GAGCGGCAGC ACTACCCAGA 900 

GGCCTATGCC TCCOCCAOCC ACACCAAAGG CX3AGCAGGGC CTCTACCCGC TGCCCTTGCT 960 

CAACAtSCACC CTGGAOGAOQ GGAAGGCCAC CCTGACCXXTT TCCAACACQC CACTGGGGCG 1020 

CAACCTCTCG ACTCACCAGA CCTACXXXX3T GOTGaCAGCT COQCCCTTTT GOATCTOCAG 1080 

CAAGTOGGCT CX3GGGGTCCC GCCCTTCAAT GCCTTTCCCC ATGCTGCCTC OTrGTACGGG 1140 

CAOTTCAOGO GCCAGGCCCT CCTGTCAGGG GQAOAGATGG TGGGGCXCAC GCTGCCCGGA 1200 

TACCCACCCC ACATCCCCAC CAGOSGACAG GGCAGCTATG CCTCCTCTGC CATOGCAGGC 1260 

ATGGTGGCAG GAAGTQAATA CTCT6GCAAT GCCTATGGCC ACACCCCCTA CTCCTCCTAC 1320 

AGCX3AQGCCT GGGQCTTCCC CAACTCCASC TTGCTGAGTT CCCCATATTA TTACAQTTCC 1380 

ACATGAAGGC GQAGTGCACC 0CCCACC3^ GCCACGGCCT TTGACCATCT GTAQTTGGCA 1440 
TGGGGAOIGT O 
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Seq ID NDi 536 Protein sequence 
PKotein Accession fti NP_039246 

I II 21 31 41 51 

I 1 I I I i 

MPHNSIRSGH GGLHQLGGAF VKGRFLPBW RQRIVDIiAKQ GVRPCDI8RQ LRVSHOCVSX 60 
IIXatYYETCS IRPGVIGGSK PKVATPKWB KIGDYKRQNP TMFAWBIRDR LLAEGVCDND 120 
TVPSVSSIMR XIRTRVQQPF NLPMDSCVAT KSl«SP(3HTLX PSSAVTPPBS PQSDSL6STY 180 
SXNGLLGZAQ PG$DKRKMDD 8OQD6CRI18Z DSQSS8S0PR XKUtTDAFSQ HHLBPLECP? 240 
ERQHyPEAYA 8PSHTKQBQ0 LYPLPLUIST LDXX3KA.TLTP SNTPbQRNbS THQTYPWAA 300 
PPPHICSKSA PGSRPSNPPP MLPPCTGSSft ARPSSQGERN WO PR CPP TOP T8PPADRAAM 360 
PPLPSQAHtgO BVNTLANPMA TPPTPPTARP GASPTPAC 



^ . Seq ZD NOi 537 DWA sequence 
15 Hucleic Acid Accession «i 2m_003466.l 



Coding sequence : 11 .. 13 63 

1 11 21 31 41 51 

1 1 I I I I 

GAATTOQGCQ ATGCCTCACA ACTCCATCAO ATCTQQOCAT GGAGGGCTGA ACCAGCTGG6 60 

AG G GG C CTTT OTGAATGGCA GACXTTCTGCC GGAAGTGGTC CGCCAGCGCA TCGTAGACXT 120 

GOCCCACCAO G0TGTAAG6C QCIGCQUCAT CTCTOSCCAO CTCCGCX?TCA GCX:aTGGTTO IBO 

CGTCAGC3U\G ATCCTTGGCA GGTACTACGA GACTG6CA6C ATCOQGCCTG GAQTGATAGO 240 

GG6CT0CAA0 CCCAA(X3TBO GCAOOOOCAA GGTGCnOGAG AAGATTOGQa ACTACAAACO 300 

OCAGAAOCCT ACCATQTTTO CCTGOGAGAT CCGAGACOGO CTCCTGGCTG AGGOCGTCTQ 360 

TGACAATGAC ACTGTGCCCA GTGTCAGCTC CATTAATAGA ATCATCC66A CCAAAGTGCA 420 

GCAACCATTC AACCTCCCTA TGGACAGCTG OQTGGCCACC AAGTCCCTGA GTCCCGGACA 480 

CA06CTGATC CCXIAGCTCAG CTGTAACTGC CC0G6AGTCA CCCCAGTOGG ATTCCCTGOG 540 

CTCCACCTAC TCCATCAATO GGCTCCTQGG CATOGCTCAG CCTGGCAGCG ACAA6AGGAA 600 

AATGGATGAC AGTGATCA6Q ATA6CTOC06 ACTAAGCATT GACTCACAQA GCAGCAGCAO 660 

CGQACCCCQA AAGCACCTTC GCACGGATGC CTTCAGCCAG CACCACCTCO AGCCQCTCOA 720 

GTGCCCATTT GAGCGGCAGC ACTACCCAGA GGCCTATGCC TCCCCCAGCC ACACCAAAG6 760 

CGAGCAGGQC CTCTACCC6C TGCCCTTGCT CAACAGCACC CTGGACXSACX3 GGAAG6CCAC 840 

CCTOACCXXrr TCCAACAOQC CACTQGOQCG CAACCTCTCG ACTCACC3W3A CCTACCCCGT 900 

GGTGGCAQAT CCTCACTCAC CCTTCGCCAT AAAGCAGGAA ACCCCCGAGQ TGTCCAGTTC 960 

TAQCTCCACC CCTTCCTCTT TATCTAOCTC CX3CCTTTTTG 6ATCTQCAGC AAOTCGGCTC 1020 

COGGOTCCCO CCCTTCAATG CCTTTCCCCA TGCTGCCTCC GTGTACGGGC AOTTCAOGGG 1080 

CCAGGCCCTC CTCTCAGGGC GAOAGATGGT GGGGCCCAGG CTGCC0G6AT ACCCACCCCA 1140 

CATCCCCACC AGCGGACAGG 6CA6CTATGC CTCCTCXGCC ATCGCAGGCA TGGTGGCAGG 1200 

AAGT6AATAC TCT6GCAATG GCTATGGCCA CACCCCCTAC TCCTCCTACA GOQAOGCCTO 1260 

606CTT0C0C AACTCCAOCT TGCIOAOTTC COCATATTAT TACAGTTCCA CATCAAGQGC 1320 
GAGTGCACOG CCCACCACTG GCAQ3Q0CTT TGACCATCTTG TA6TTGAAGC TT 

Seq ZD NOt 538 Protein seguendb 
Protein Accession NP_0034S7 

1 11 21 31 41 51 

I I I I i I 

MPHNSIRSGH GGLKQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ XiRVSHGCVSK 60 

ILGRYYETQS IRPGVIGGSK PKVATPKWE KIGDYKRQNP XMFAWEIRDR LLAEGVCDND 120 

TVPSVSSIMR IIRTKVQQPP NLPKOSCVAT KSLSPGHTLI PSSAVTPPE8 PQSDSLGSTY 180 

SZNGLLOZAQ PGSDKRKMDD SDQDSCRLSZ DSQSSSS6PR RBLR113AFSQ HHLBPLECPF 240 

ERQHYPBAYA 8PSHTK6EQG LYPLPLLNST LDDGKATLTP SNTPLGRNLS THQTYPWAD 300 

PHSPPAIKQE TPEVSSSSST PSSLSSSAFL DLQQVGSGVP PFNAFPHAAS VYGQPTGQAL 360 

LSGREMVGPT LPGYPFHIPT SGQGSYASSA XAGKVAGSEY 8GNAYGHTPY SSY8EAHRFP 420 
HSSLLSSPYY YSSTBRPSAP PTTATAPDHL 

Seq ZD NOs 539 DMA secfuence 
Nucleic Acid Accession #« NM_006799 
Coding sequence: 19.. 963 " 

1 11 21 31 41 51 

I } I I I i 

GCGGGGGGAO AGGAG6CCAT GGG0GG606C G6GG06CTGC TGCTGGOQCT 6CTGCT6GCT 60 

GGGGCTGOAC TCAGGAAGCC GGAGTOGCAO 6AGG0GG0GC GGTTATCAGG ACCAT006GC 120 

OGACGGGTCA TCACGTCGCG CATCGTGQGT OQAQAOQAGG CC3QAACrCG0 GOGTTGGCC6 160 

TGGCAGGGGA GCCTQOGCCr GTGGGATTOC CACGTATGC3G GAOTGAGCCT GCTCAGOCAC 240 

C6CTG0GCAC TCAGGGOGGC GCACTGCTTT QAAACCTATA OTQACCTTAO TGATCCCTCC 300 

GGGTGGATGG TCCAGTTTGG CCAGCTQACT TCCATGCCAT CCTTCXGGAG CCT6CAGGCC 360 

TACTACACCC GTTACTTOGT ATOQAATATC TATCTGA60C CT06CTAGCT GGG6AATTCA 420 

COCI ^TQftC A TTOCCTTGOT mGCZOTCT OCAO CTOTCA CCIACACTAA ACACATC CAO 480 

CCGATCTGTC TCCAGOCCTC CACATTTGAG TTTQAGAACC GGACAGACTG CrGGOTGACT 540 

QOCTGGGGGT ACATCAAAGA GQATQAOGCA CTGCCATCTC CCCACACCCT CCAGGAAOTT 600 

CAGGTOQCCA TCATAAACAA CTCTATGTGC AACCACCTCT TCCTCAAGTA CAGTTTCOGC 660 

AAGGACATCT TTGGAGACAT GQTTTGTGCT GGCAATGCCC AAGGCQQGAA GQATQCCTGC 720 

TT OQGTO ACT CAGGrGOACC CnU UC d 'tfl AACAAOAATO GACTGTGQIA TCAGATTGGA 780 

GTCGT6A0CT GGGGAGTGGG CTOTGGTCBO CGCAATCGGC COSGTGTCTA CACCAATATC 640 

AGCCACCACT TTGA6TGGAT CCAGAAGCTG ATGGCCCAGA GTGGCATGTC CCAGCCAGAC 900 

CCCTCCTGGC CGCTACTCTT TTTCCCTCTT CTCTGGGCTC TCCCACTCCT QGGQCOGGTC 960 

TOAGCCTACC TGAGCCCATG CAGCCTGQGO CCACTGCCAA GTCAGQCCCT GQTTCTCTTC 1020 
I GTCIT O TTT GOTAATAAAC ACATTOCAGT TGATGCCTTG CAGGGCATTC TTCAAAA 

Seq ZD MOt 540 Protein sequence 
Protein Accession #: IfP 006790 



1161 



wo 03/042661 



PCT/US02/36810 



1 11 21 31 41 51 

I I 1 I 1 I 

MQARQALUA LLXARAOLRK PBSQBAAPIiS 6P0GRRVXT8 RIVG6EDAEL GRWPHQ6SLR 60 

J LHDSBVC6VS LLSRRHALTA ABCFETySDL SDPSGHIWQF GQLT8KPSFH SLQAYYTRYP 120 

VSNIYLSPRY LGNSPYDZAL VKLSAPVTYT KHIQPICLQA STFEFENRTD CWTGWGYIK 160 

8DEALP8PHT LQEVQVAIXK KSMQIHLPLK YSFRKDIFGD MVCAGNAQGO K0ACP6DSG0 240 

PXiACSIKNGLH YQIOWSWOV GOGRPNRPGV YTNISHHFBW IQKLHAQSGM 8QPDPSWPLL 300 
PPPLLMALPL LGPV 
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Seq ID NO: 541 DHA sequence 
Nucleic Acid Accession «i NM_014344 
coding sequence: 131.. 1444 



1 11 21 31 41 51 

111)11 

G0GGCO6OQA TGGGGCOQAA GCX3CCOQAAO CCCOGGAGCC CACAAACT6C C6GGCCCGCC 60 

TCOCOQCOQQ SACCGGGOTQ CCTGGGGTCQ GCTTGAAGCG G0G606GCGC ACOGGCACAG 120 

COSOGGGAGC ATG6GCAGQA GQATG0GGG6 CGCCGCCGCC ACC3SOGSG66C TCTQGCTGCT ISO 

20 QGGQCTGGGC TCGCTGCTGO CGCTGTGGGO AOGGCTCCTO COOCOQOGGA COQAGCTQCC 240 

CGCCTCCCGG COGCCCGAAG ACCGACTCCC AC3GG0GCCCG GCCOGGAGCG GC6GCCC0GC 300 

GCCOGOSCCT CX3CTTCCCTC TGCCCCCGCC CCTOGCGTGG GACGCCCGOG OCXSQCKXCT 360 

QAAAACTTTC CX3QG00CT0C TCACCCTGOC OGCOGQCGCG GA066CC0GC CCCGGCAGTC 420 

^ O066AGOCSAO CCCAGGTGGC AGQTOTCAGC CAGGGAGCCC OQGGOGGAGQ AQASOQCOOC 480 

25 GGTGCAC6GG 6GGGTCTTCT GGAGCOGCGG CCTOGAGGAG CAGOTGCCOC CGGGCTTTTC 540 

GGA66CCCA6 G0GG0660GT G6CTGGAG6C GGCT0G06GC GGCXX3GATGQ TGGCCCT06A 600 

GOQOGGGGOT TG0GG600CA GCTCCAACCG ACTGGCCCOT TTTGCOGAOG GCACCCGCX3C 660 

CT60QT6CGC TACGGCATCA hCCCOGPJ3Ch GATTCAG6GC GAOGCCCTGT CTTACTATCT 720 

in qgggOQOCTQ C TOGOC CTCC A 005C CA0OT GCGQCOQCTQ GGACTGGCTC GOCSTaOAOGC 780 

3U TOOGGQOGOQ CAGTGGGCOC AG6T6CAGGA GQAGCTG03C GCTGOGCACT G6AC0SAGGG 840 

CAGOOTOGTO AQCCTOACAC GCT6QCTGCC CAACCTCAOG GAOGTGGTQG TQCCCGCGCC 900 

CTGGOQCTCO GAGGACQOCC QTCTGOQCCC CCTCCGGGAT 6CCGGGGGTG AGCTGGCCAA 960 

CCTGAGCCAO G0GGA6CTGG TGGACCTAGT ACAATGGACC GACTTAATCC TTTT08ACTA 1020 

GCTGACOGOC AACTTOGACC QQCTOaTAAO CAACCTCTTC AGCCTGCAOT OPqACCCGOQ 1080 

OD 06TCAT6CAG GGT6CCACCA GCAACCTGCA CG6CGGTCC6 G6CX3GG60GC TQGTCTTTCT 1140 

QGACAATGAG GOGGGCTTGG TGCAOGQCTA COQGGTAGCA 66CATGTGGG ACAAGTATAA 1200 

06AGCOGCTG TTGCAGTCAO TGTGOGTGTT CX:GCGAGCGG ACCGCGCX3GC GOGTCCTGGA 1260 

GCIOCAGCGC GGACAG6A0S CGGG6GGGG8 GCXGCTGCGC CTCTAC066C OCCAGQAOCC 1320 

TOQCTTCCOC GAOCTGQOOO COCTTG CAQA CCCCCAC96CT CAGCTGCTAC A OOGCO G C CT 1360 

40 G6ACTTCCTC GCOUIGCACA TTTTGCACIG TAAGGCCAAG TA0G6C06CC GGTCTGGQAC 1440 

TTAGTGTCAC CX3GQAGGAAA AGAGAOAGAT CTGGGGCTGG GGTATGGATQ ATGGGGGGAA 1500 

GGGOGGTCGC CTCTGCCACT GTCAGGGACX: AGCGGGCCAA OGCCX3VCCCX3 CAAAGGT6TC 1560 

TAAAAACTTC AGCTTTTCAC CCACCTGCCC CTTTCITTC31 ATC0C3\0GCT GTTTCCTTTC 1620 

. AAA6TTCTGG GAGGACGAAC TCACCX3AGGC GAGAAGTGTA AGATTCTCTC CACCCA6CTT 1680 

4S ATAAAAGGAT TCTTTACTGT G0CAGCACX3G GGATTGGATC GBAAGAAACT GOCrACTGGG 1740 

QTTTGGCCCC CGAGTGGCCG TCCCTGTGGG AGATQCACCC CATTCTTGOG CXXXCCTCAT 1800 

TCCCTTTCCG AAAAAGGAAA ACTTGCGTTT GAGCCGTTGA GCTAATTCTG CAATTTTCTA 1860 

CCAAACAOAO 06CTGGTG6C CC06GAGCAG GGCTGTQACA TTGGCTG6TG GAQCCCCTTC 1920 

CTOTGTTCTC CCTTTGTTCC AGC3GC0G0GA TGGTGAGATC ACTGTTCCAA GCAOGGGGAC 1980 

50 GGCTCGOQAT AGQACAAAGA GA6CAG6ACC TCCAGACTCT GGGGAGCCCT GCAQACCTTG 2040 

ACAATTTGCC TGACTCATTC CTGACCTCTT GTCATTTTGO CCTGAAGGCT ACAAATTCAG 2100 

GGTCAGCTGT ATGCACTAAG TCAAATAATG AATTTCTTCC TCCCTCTOGC AACCGACCAA 2160 

AATTTTGACA AOGATGATGT TCACCAOAAO GAAAAAAAAA TCA6TTTTAT GCACTTTAT7 2220 

_ - TTGTTTTGAT TTTCATTTTT TATTAAGAAA AAATTTTATT TTACAGAATT TAGCTTCTCT 2260 

55 GTATATATGT GCATAAAGTG TGGTGTAAA7 ATACTAAACA AACTTATATT TCAATTkAAAO 2340 
GGAGTTTAAA ATTTAAAAAA AAAAAAA 



6eq ZD NOt 542 Protein sequence 
Protein Accession #i NP_05S159 

1 11 21 31 41 51 

I I I I I I 

MGRRMRGAAA TAGLHLLALG SLLALWGGLL PPRTELPASR PPEDRLPRRP ARS66PAPAP 60 

RFPIiPPPLAW DARGGSIiKTP SALLTLAAQA DGPPRQSRSE PRHHV8ARQP RPBBSAAVHQ 120 

GVFHSRGLEE QVPPGPSEAQ AAAWIiBAARG ARMVALERGG CGRSSMRLAR FADGTRACVR 180 

YGINPEQIQO EALSYYLARL LGlJQRHVPPL ALARVEARGA QWAQVQEELR AAHWTE6SW 240 

SLTRHLPNLT DWVPAPWRS EDGRLRPLRD AGOELAHLSQ AELVDLVQWT DLILFDYLTA 300 

NFDRIiVSZOiF SLQWDPRVKQ RATSNUIRGP GGALVFIjDNB AGbVHGYRVA QWDKYNSPL 360 

LQSVCVFRER TARRVLBLRR OQDAAARbLR XiYRRBBPRFP BLAALADPBA QLLQRRUDFL 420 
AKHILHCKAK YGRR6GT 

6eq ID NO: 543 DNA sequence 

Nucleic Acid Accession tfi XM_007652.4 

Coding sequence I 1..1290 



1 11 21 31 41 51 

1 I i I I 1 

ATG6CCGGCT CTGG06CGTG GAAGCOCCTC AAATCTATOC TAAGOAAOGA TGATGOGCOG 60 

CTGTTTTTAA ATOACACCAQ CGCCTTTGAC TTCTOGGATG AGGOSGOGQA CGAGGGGCTT 120 

80 TCT0G6TTCA ACAAACTTOQ AGTTGTGGTO GCCGATGAOO GTTCGGAAGC CC0G6AAA0G 180 

CCTGTTAACO GGGCX3CACCC GACXOTCCAO GCCGAOGATQ ATTCCTTACT GGACCAAGAC 240 

TTACCTTTGA CCAACAGTCA GCTGAQTTTG AAGGTGQACT CCTGTQACAA CTGCAGCAAA 300 

CAGAGA6AGA TACTGAAGCA aAGAAAGGTG AAAGCCAGGT TGACCATT6C TGCCGTTCTO 360 

TACTTGCTTT TCATGATrOG AGAACTT6TA 06TGGATACA TTOCAAATAG OCTAGCAATC 420 

1162 



wo 03/042661 



ATQACAOATO CACTTCATAT GFTAACrCSAC CTAAGOGCCA TCATACTCAC CCTGCTTGCT 480 

TTQTGGCTAT CATCAAAATC ACCAACCAAA AGATTCACCT TTGGATTTCA TOOCTTAGAG 540 

GTTTTGTCAG CTATGATTAG 'iVr GC l VlTO GTGTATATAC TTATGGQATT CCTCTTATAT 600 

GAAGCTGT6C AAAGAACTAT CCATATGAAC TATGAAATAA ATQGAGATAT AA7GCTCATC 660 

AC06CAGCTG TTGGAGTTGC AGTTAAT6TA ATAATGGGOT ITCTGTTGAA CCAGTCTGGT 720 

CACXSSTCACT CCCATTCCCA CTCCCTGCCr TCAAATTCCC CTACCAfiAGG TT CTGGG TGT 780 

GAAOGTAACC ATGGGCAGGA TAGCCTGGCA GTQAGAGCTa CATTTGTACA TQCTTTGGGA 840 

OATTTGGTAC AGAGTCrTGO TOTGCTAATA GCTGCATACA TCATACGATT CAAGCCAGAA 900 

TACAA6ATT0 CT6ATCCCAT CTGTACATAC QTATTTTCAT TACTTOTGGC TTTTACAACA 960 

TTTC6AATCA TATGG6ATAC AGTA6TTATA ATACTAQAAG QTGTGOCAAO CCATTTQAAT 1020 

GTA6ACTATA TCAAAOAAGC CTTGATQAAA ATA6AAGATG TATATTCAGT C6AAGATTTA 1080 

AATATCTGGT CTCTCACTTC AGGAAAATCT ACTQCCATAG TTCACATACA GCTAATTCXTT 1140 

GQAAOTTCAT CTAAATOGQA GGAAGTACAG TCCaUUVGCAA ACCATTTATT ATTGAACACA 1200 

TTTGGCATGT ATAGATGTAC TATTCAGCTT CAGAOTTACA GGCAAGAAGT GGACAGAACT 1260 
TGT6CAAATT GTCAGAOTTC TAGTCCCTGA 

Seq ID KOt 544 Protein sequence 
Protein AcceoBion 6: XP_007652.1 

1 11 21 31 41 51 

I I I I I 1 

MAGSGAWKRL KSMLRXDDAP LPLNDTSAFO PSDEAGDBGIi SRFinCLRWV ADDGSEAPER 60 

PVMGAHPTLQ ADDDSIiLDQD LPLTNSQLSL KVDSC3JNCSK QREILKQRKV KARLTIAAVL 120 

YLLFMIGELV GGYIANSLAI MTDALHMLTD LSAIILTUA LWLSSKSPTK RFTFGFHRLB IBO 

VLSAMISVUi VYILMGFLLY EAVQRTIHMN YEINGDIMLI TAAVGVAVNV IMGFLU^SG 240 

HRHSBSBSLP 8N8PTRGS6C ERNHGQDSLA VRAAFVHALG DIiVQSVGVt.1 AAYIIRFKPE 300 

yXIADPICTY VFSLLVAFTT FRIIWDTWX ILBGVPSHLN VDYIKEALMK lEDVYSVEDL 360 

HXWSLT8GK8 TAIVHIQLXP 6SSSKWEEVQ SXANHLLLNT FGHYRCnQL QSYRQBVDRT 420 
CAKCQSS9P 

Seq ID NOi 545 PMA sequence 
Nucleic Acid Accession «: AB037765.1 
Coding sequence i 1..2478 

1 11 21 31 41 51 

I I I I i i 

ATGTTTTCOG GCTTCAATGT CTTTAGAGTT 6GGATCTCTT TTQTCATAAT GTGCATTTTT 60 

TACATGCCAA CaGTAAACTC TTTACCAGAA CTQAOTCCTC AGAAATATTT TAGTACATTG 120 

CAACCAGGAA AAGCCTCTTT AGCTTATTTT TGTCAAGCTG ATTCCCCAAQ AACATCTQTA 180 

TTTCTTGAAO AACTGAATGA GGCTGTTAGA CCTCTQCAGG ACTATGQAAT TTCAGTTGCX; 240 

AAGOTTAATT GTGTCAAAGA AGAAATATCA AOATACTOTG GAAAAOAAAA GOATTTGATG 300 

AAAGCATATT TATTCAA6GG CAACATATTG CTCAGAOAAT TCCCTACTGA CACCTT6TTT 360 

OATGTGAATG CCATTGTCOC CCATGTTCTC TTTQCTCTTC TTTTTAOTGA AGTOAAATAT 420 

ATTACCAACC TGGAAQACCT TCAGAACATA GAAAATGCTC TGAAAGGAAA AGCAAATATT 480 

ATATTCTCAT ATGTAAGAGC CATTGGAATA CCAGAGCACA GAGCAGTCAT GGAAGCGGCT 540 

TTTGTGTAT6 GGACTACATA OCAATTTQTC TTAACCAC3U3 AAATTGCOCT TTTGGAAAGT 600 

ATTGGCTCTO AGQATQTGGA ATAT6CACAT CTGTACTTTT TTCATTOTAA AC1A0TCTTQ 660 

GACTTGACCC AGCAATGTAG AAGAACACTA ATGOAACAQC CATTGACTAC ACTGAACATT 720 

CAOCTGTTTA TTAAGACAAT GAAAGCACCT CTGTTGACTG AAGTTGCTGA AGATCCTCAA 780 

CAAGTTTCAA CTGTCCATCT CCAACTGGGC TTACCACTGO TTTTTATTGT TAGCCAACAG 040 

GCTACTTATG AAGCIGATAG AAGAACIGCA GAATGG6TT0 CTTGGGOTCT TCTG6GAAAA 900 

GCAGGAGTTC TACTCTTGTT AA6GGACTCT TTGGAASTOA ACATTCCTCA AGATGCTAAT 960 

GTGGTCTTCA AAAGA6CAQA AQAGGGAGTT CCAGTGGAAT TTTTGQTATT ACATGATGTT 1020 

GATTTAATAA TATCTCATGT GGAAAATAAT ATGCACATTG AGGAAATACA AQAAGATGAA 1080 

GACAATGACA T6GAA60TCC AGATATAGAT 6TTCAGGATG ATGAAGTGGC AGAAACTGTT 1140 

TTCAGAGATA OGAAOAGAAA ATTAGCTTTG GAACTTACAG TGGAACTAAC AGAAGAAACA 1200 

TTTAATGCAA CAGTQATGGC TTCTGACAaC ATACTACTCT TCTATQCTGO TTQGCAAGCA 1260 

GTATCCATGG CATTTTTGCA ATCCTATATT GATGTGGCAO TTAAACTGAA AG6CACATCT 1320 

ACTATGCTTC TTACTAGAAT AAACTOTGCA GATTGGTCTG ATQTATGTAC TAAGCAAAAT 1380 

GTTACTGAAT TTCCTATCAT AAA6ATGTAC AAGAAA6GGQ AGAAGOCAGT ATCTTATGCT 1440 

OGAATGTTAO GAAGGGAAGA TCTCCTAAAA TTTATCCAGC TCAACAGOAT TTCATATOCA 1500 

GTGAATATAA CATOGATCCA AGAAQCAGAA GAATATTTAA GTGGGQAATT ATATAAAGAC 1560 

CTCATCrrGT ATTCTAGTGT GTCAGTATTQ GGACTATTTA GTCCAACCAT GAAAACAGCA 1620 

AAAGAAGATT TTAGTGAAGC AGGAAACTAC CTAAAAGQAT ATGTTATCAC TGGAATTTAT 1680 

TCTGAAGAAG ATGTTTT6CT ACTGTCAACC AAATATGCTG CAAGTCTTCX: AGCCCTGCTO 1740 

CTTGCCAGAC ACACAOAAOG GAAAATAQAG AGCATCCCAC TAGCTAGCAC ACATGCACAA 1800 

GACATAGTTC AAATAATAAC AGAT6CACTA CTGGAAATGT TTCCGGAAAT CACTGTGGAA 1860 

AATCTTCCCA GTTATTTCAG ACTTCAGAAA CCATTATTGA TTTTGTTCAG TOATGGCACT 1920 

GTAAATCCTC AGTATAAAAA AGCAATATTG ACACTGGTAA A6CAGAAATA CTTGGATTCA 1980 

TTTACTCCAT GCTGQTTAAA TCTAAAGAAT ACTCCAGTGG 6GAGAGGAAT CTTQAGGGCA 2040 

TATTTTGATC CTCTGOCTOC CCTTCCTCTT CTTGTTTTaO TGAATCT6CA TTCAGGTGGC 2100 

CAAGTATTTO CATTTCCTTC AGACCAG6CT ATAATTGAAO AAAACCITOT ATTGTGGCT6 2160 

AAOAAATTAO AAGCAGGACT AGAAAATCAT ATCACAATTT TACCTGCTCA AGAATGGAAA 2220 

CCTCCTCTTC CAGCTTATGA TTTTCTAAGT ATQATAQATG CCGCAACATC TCAACGTGGC 2280 

ACTAOGAAAG TTCCCAAGTG TATGAAAGAA ACAGAT6TGC AGGAGAATGA TAAGGAAGAA 2340 

GATOAAGATA AATOGGCAGT OGAAAAGAA COOATTGAAA CTCTQAQAAT AAAGCATTGG 2400 
AATAGAAGTA ATTGOTTTAA AGAAGCAGAA AAATCATTTA GAGGTGATAA AGAGTTAGGA 2460 
TGCTCAAAAG TGAACTAA 

Seq ID NO: 546 Protein sequence 
Protein Accession fft BAA925Q2.1 

1 11 21 31 41 51 

I I 1 I I I 
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MPSGFNVPRV GISPVIMCIP YMPTVNSLPB LSPQKYFSTL QPGKASLAYP OQAOSPRTSV 60 

FLEBXjNEAVR PIjQDYGISVA KVNCVKBBIS RYCGKEKDLM KAYLPKGNIL LREPPTDTLP 120 

DVNAIVARVL FALLPSEVKY ITNLBDLQNX ENALKOKANX XFSYVRAIGI PEHRAVMBAA IBO 

FVYGTTYQFV I,TTEIALLES IGSEDVEYAB LYPFHCXLVL DLTQQOtRTL HBQPLTTUII 240 

HLFIXIMKAP LLTEVASSPQ QVSTVHLQIiG LPIiVFIVSQQ ATYBADRSTA EffVAHRLLGK 300 

AGVLLLLRDS LEVKIPQDAM WFXRAEBGV PVEPLVLRDV DLIISRVENN MHIEBIQEDB 360 

DNDKEGPDID VQDDEVAETV FRDRKRKLPL ELTVBLTEET FMATVHASDS XVIiFYAGWQA 420 

VSMAFIiQSyX DVAVKLKGTS TMLLTRIKCA DWSDVCTKQN VTEFPIIKMY RRGBNPVSYA 4 BO 

^ GMLGTEDUJC PXQLNRXSYP VNXTSXQBAB BYLSGELYKD LILYSSVSVL GLFSPTMKTA 540 

iU KEDPSBAGNY LKOYVITGIY SEEDVLLLST KYAASLPALL LARHTfiSKXB SXPLASTKAQ 600 

DIVQIITDAL LEMPPEITVE NLPSYPRLQK PLLIliPSDGT VNPQYKKAlIi TLVKQKYLDS 660 

FTPCWUJLKN TPVGRGILRA YPDPLPPLPL LVLVNLHSGG QVPAFPSDQA IIBENLVLWL 720 

KKLEAGLBKH XTXLPAQEWK PPIiPAYDFLS NXDAA7SQR0 TRKVPKa4XE TDVQENDXBQ 780 
BEDKSAVRKB PXBTLRIKRW NRSNHFXEAB KSFRRDKBL6 C8KVN 

Seq ID KO: 547 DNA sequence 
Nucleic Acid Accession #t MM_033102.i 
Coding seqpience; 1..1662 

20 1 11 21 31 41 51 

I t i I I I 

ATGOTOMA GQCTGTGGGT GAGCCGCXTTG CTGOGGCACC GGAAAGCCCA GCTCTTGCTG 60 

CTCAACCTGC TAACCTTTGG CCTGGAGGTG TGTTTGGC06 CAGGCATCAC CTATGTGCOG 120 

^- CXTTCTGCTOC TGOMOrGGG GGTAQAGGAO AAiGTTCATOA OCATOGTrQCT OGQCATTGQT 180 

Z.) CCA6TGCTGG 60CTGGTCTQ T6TCCC36CTC CTAG6CTCA0 OCAGTGACCA CTG605TGGA 240 

06CTATGGCC GCOGCOBOCC CTTCATCTGG GCACTGTCCT TGOGCATCCT GCTGAGCCTC 300 

TTTCrCATCC CAAGGGCOGQ CTGGCTAGCA GGGCTGCTOT GCCCGGATCC CAGGCCCCTG 360 

QAGCTGGCAC TGCTCATCCT GG6GGTGGGG CTGCTGGACT TCIGTGGCCA GGTGTGCTTC 420 

ACrCCACTGG AGGCCCTGCT CTCTGACCTC TTCOGGGACC GG6ACCACT6 TCX30CAGGCC 480 

TACTCIGTCT ATGCCTTCAT GATCAGTCTT GGGGGCTGCC TG6GCTACCT CCTGCCTGCC 540 

ATTQACTGOa ACACCAOTGC CCTGGCXXXC TACCTGGGCA CCCAGQAGGA GTGCCTCTTT 600 

GGCCTGCTCA CCCTCATCTT CCTCACCTGC GTAGCAGCCA CACTGCTGGT GQCTGAQGAG 660 

O agCGC TGG GCCCCAC06A GGCA6CAGAA G66CTGTCG0 CCCCCTOCTT GTOGCCXXZAC 720 

TGCTGTCCAT GC06GGCCO6 CTTOGCTTTC GOGAACCTGG G0G00CT6CT TCCCCGGCCG 760 

CACCAOCTGT GCTGCOGCAT GCCXX3GCACC CTGGGCCXSGC TCTT0STG6C TGAGCTGTGC 840 

AOCTOQATGG CACTCATGAC CTTCACGCT6 TTTTACA06G ATTTOGTGGG OGAtSGGGCTG 900 

TACCAG6GCG TGCCCA6AGC TGAGCOGG6C ACXX3A6GCCC GGAGACACTA TGATGAAGGC 960 

GTTOQQATGG GCAGCCTGGQ GCTQTTOCTG CAGTGG6CCA TCTCGCIGOT CTTCTCTCTG 1020 

GTCATGQAOC GGCTGGT6CA GOGATTOSGC ACTOQAGCAG TCTATTTGGC CAGTGTGGCA 1080 

6CTTTOCCTQ TGGOCGCOGG TCCX:ACAT6C CTGTCCC3^ GTGTG6G06T GGTGACAGCT 1140 

TCAGCCX3CCC TCACCGGGTT CACCTTCTCA GCCCTGCAGA TCCTGCXTTA CACACTGGCC 1200 

TCCCTCTACC ACCGGGAGAA GCAGGTGTTC CTGCXX3UVAT ACOGAGQGGA CACTGGAGGT 1260 

GCTAGCAGTQ AGGACAGCXTT QATGACCAGC TTCCTGCCAG GCCCTAAGCX: TG6AGCTCCC 1320 

TTOCCTAATQ GACAOGTGOG T6CTGGA06C AGTGGCCTX3C TCCCACCTCC ACXTGOGCTC 1360 

TGCXSGGGCXTT CT6CCTGTGA TQTCTCaJTA CGTOTGGTQO TGGGTGAGCC CACCX3AGQCC 1440 

AGGGTGGTTC CGGGCCGGGG CATCTGCCTG GACCTCGCCA TCCTGQATAQ TGCCTTCCTG 1500 

CTGTCCCAGG TGGCXXXIATC CCTGTTTATG GGCTCCATTG TCCAGCTCAG CCAGTCTGTC 1560 

ACTGCCTATA TGGTGTCTOC 06CAGGCCT0 GGTCTGOTOG OCATTTACTT TGCTACACAG 1620 
GTASTATTTO ACAAGA60GA CTTGGCOUU^ TACTCAGOST GA 
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8eq ID NOi 548 Protein sequence 
Protein Accession #t NP 149093.1 



- - 1 11 21 31 41 51 

55 I I I I 1 I 

MVQRLHVSRL LRHRKAQLLL VmiLTFGLEV CLAAOITYVP PLLLEVGVEB KFMTMVLGIG 60 

PVZiGLVCVPL LGSASDHWRQ RYGRRRPFIW ALSLOILbSL FLIPRAGWLA GLLCPDPRFL 120 

EIiALLXLGVG LU>P06QVCF TPLEALLSDL PRDPDHCRQA Y8VYAFMI8L GGCLGYLLPA 180 

XDHDTSALAP YLGIQEBCLP GLLTI«XFLTC VAATLLVAEE AALGPTEPAB GLSAPSliSPH 240 

CCPCRARLAF RNL6ALLPRL HQIiCCRMPRT LRRLFVAELC SWMAUfTFTL FYTDFVGBGIi 300 

YQGVPRAEPO TBARRHYDEG VRM0SL6LFL QCAI8LVFSL V>Q3RLVQRFG TRAVYLASVA 360 

APPVAAGATC LSHSVAWTA SAALTGFTPS ALQILPYTLA SLYHRBKQVP LPKYRGDTGG 420 

ASSEDSLMTS FLPGPKPOAP FPHGHVGASG SGLLPPPPAL OSASAGSVSV KWVGEPTEA 480 

RWFGRQICL DIAXLDSAFZ. LSQVAPSIiFM GSZVQLSQSV TAXMVSAAGI. GLVAIYFATQ 540 
WFDKSDLAK YSA 



Seq ID NO: 549 TOA sequence 

Nucleic Acid Accession fft Eos sequence 

Coding sequence : 1 . . 13 89 

1 11 21 31 41 51 

I I I I i i 

ATGGGC7ACC AOAGGCAQGA GCCTGTCATC CCGCCGCAGA GA6ATTTAGA TGACAGAQAA 60 

AC0CTT6TTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTOTCAACT COATTATAGG ATCTGGTATA ATAGGATTOC CTTATTCAAT GAAGCAAGCT 180 

GQG IT TC CTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACX3GA CTTri - CXXTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 300 

AAAACTTTCG GCTTTCCAGO GTATCTGCTC CTCTCTGTTC TTCA6TTTTT GTATCCTTTT 360 

ATAGCRATGA TAAGTTACAA TATAATA6CT OQAGATACTT TGAGCAAAGT TTTTCAAAQA 420 

ATCCC3U3GAG TTGATCXTTGA AAACX3TQTTT ATTGGTCGCC ACTTCATTAT TGOACTTTCC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCQAAATA TAGCAAAGCT TGQAAAGGTC 540 

TCCCTCATCT CTACAGQTTT AACAACTCTO ATTCTTGQAA TTOTAATGGC AAOGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAOAA GAOQ CTTGG Q TATTTGCAAA GCCCAATGCC 660 

ATTCAAGOSO TOGGGGTTAT GTCTTTTQCA TTTATTTGCC AOCATAACIC CnCTTAOTT 720 
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TACAGTTCTC 
GTGATTTCTG 
TTCAOCCAAG 
AGATTTTGTT 
GAGQTAATTG 
ACAGTGATGO 
GTTCTAGAAC 
TQTTATCXXjA 
atgcttccca 
CAAGACTGCA 
AATACCTCAG 
TTTCAATQA 



TAQAAGAACC 
TATTTATCTG 
GGQACTTATT 
ATGGT6TCAC 
CCAATGTGTT 
TCATCACTQT 
TOXATGGTGT 
AACTGTCTGA 
TTGOTQCTGT 
CCCATGGGCA 
AGTCTCATGT 



CACAGTA6CT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTOOG 
AGCCAOGCTT 
GCTCTGTGCA 
AOAACCAAGG 
GGTOATGOTT 
GQAAATGTTC 
TCAGCAOAGA 



AAGTGGTCCC 
6CTACATGTQ 
TGCAOAAATO 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCOG 
TTTGGATTOG 
TACTGCTTTC 

ACACAAcrrr 



OCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
OGGTTTTCCA 
TGATTQATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 
CTGACAATTT 
CTACTTTAAA 



TATGTCCATC 
ATTTACTOQC 
AACATTTGGA 
TGTQACAAGA 
CATTQTTGTA 
CCTOGGGATA 
TCCATCAGCC 
GTCTTGTXyrC 
TACAAATACT 
CTCTCTCACA 
TATTAGTATC 



780 
B40 
900 

960 
1020 
lOBO 
X140 
1200 
1260 
1320 
1380 



Seq ZD NOt 550 Protein Bequenco 
Probein Accession #t Bos sequence 



MGyQRQEPVZ 
OFPLGILLLF 
IAMI8YKI2A 
SLISTGLTTL 
YSSLEEPTVA 
RFCYGVTVIL 
VLELNGVLCA 
QDCTHGQEMF 



11 
I 

PPQRDLDDRB 
nVSYVTDFSL 
GDTLSKVFQR 
ILQIVMARAI 
KWSRLIBMSI 
TYPHBCPVTR 
TPLIFZXPSA 
YCPPDNPSLT 



21 
I 

TLVSEHSYKB 
VLLIKGGALS 
IPGVDPENVP 
SIX3PHIPKTB 
VISVFICIFP 
EVIA29VFFGG 
CYUCLSEEFR 
NTSESHVQQT 



31 
I 

KTOQSAAIfFN 
OTDTyQSLVH 
lORHFIIGLS 
DAWVFAKPMA 
ATCXSYLTFTG 
NLSSVFHIW 
THSDKIMSCV 
TQIiSn^ISI 



41 

I 

WNSZIGSGZ 
KTFGFPGYUi 
TVTPTLPLSL 
IQAVGVMSPA 
FTQGDLPBHY 
TVMVITVATL 
MLPIGAWMV 
PQ 



51 
I 

ZGLPYSKKQA 
LSVLQFLYPF 
YRNIAKLGKV 
PICHHNSFLV 
CRNDDLVTFG 
VSUiZDCLGI 
FGFVMAZTNT 



Seq ZD HO: 551 DNA sequence 

Nucleic Acid Accession fti Eos sequence 

Coding sequence t 1..1284 



ATGOGCTACC 
AAGCAA6CT0 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAOAA 
QGACTTTCCA 
GOAAAGOTCT 
AGG6CAATTT 
CGCAATGCXA 
TTCTTAGTTT 
ATGTCCATOO 
TTTACTGGCT 
ACATTTGGAA 
OTGACAAGAa 
ATTOTTQTAA 
CTOQGQATAO 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCZCTCACAA 
ATTAGTATCT 



11 
I 

AaAOQCAGGA 
GOTTTCCTTT 
TTTTATTQAT 
AAACTTTCGQ 
TAOCAATQAT 
TCCCAGGA6T 
GAOTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGOT 
AGAGTTCTCr 
TQATTTCTOT 
TCACCCAA6G 
GATTTTGTTA 
AGQTAATT6C 
GAOTGATGGT 
TTCTAGAACT 
GTTATCTQAA 
TGCTTCCCAT 

aagactgcac 
atacctcaga 
ttcaactcga 



21 
I 

OCCTQTCATC 
GGQAATATTO 
AAAAGQAGGG 
CTTTCCAGGG 
AAOTTACAAT 
TGATGCTGAA 
TACTCTGCCT 
TACAGQTTTA 
ACACATACCA 
CXXX9QTTATQ 
AGAAGA ACCC 
ATTTATCT6T 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
C31TCACT6TA 
CAATGGTQTG 
ACTGTCTQAA 
Tt^TOCTGTO 
CCATG6GCA6 
OTCTCATGTT 
OTAA 



31 



41 



CCOCCQCAQA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCT6 
AAOQTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AAAACAGAAG 
TCTTTTQCAT 
ACAGTAGCTA 
ATATTCTTTQ 
GAAAATTACT 
GTCATTTTGA 
TTTQ9TGGGA 
GCCAOGCTTO 
CTCTQTQGUV 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



QA GGATT GCC 
G66TTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GA6ATACTTT 
TIGGTGQCCA 
ACOGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AOrGGTCCOO 
CTAGATGTGQ 
GCAGAAATX3A 
CATACCCTAT 
ATCTTTCATC 
T6TCATTGCT 
CTCCCCTCAT 
CACACTCOQA 
TTGGATTOGT 
ACTGCTTTCC 
CAGAACTTTC 



51 
1 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTQ 
GAGCAAAGTT 
CTTGATTATT 
AOCAAAGCTT 
TGTAATG6CA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
ATACTTGACA 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GAXTGATTGC 
TTTTATCATT 
TAAGATTATO 
CATGGCTATT 
TGACAATTTC 
7ACTTTAAAT 



Seq ZD NO: 552 Protein sequence 
Protein Accession Bos sequence 



1 
I 

MGYQRQEPVZ 
LVNKTFGFPG 
GXiSTVTFTLP 
PNAZQAVGVM 
FTGPTQCTLP 
ZWTVKVITV 
8CVMLPZGAV 
X8ZFQLB 



11 
I 

PPQRGLPYSM 
YLLLSVLQFL 
LSLYRNZAKZi 
SFAPZCKHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFQFVMAZ 



21 

1 

KQAGFPLGZL 
YPPZAMZSYN 
GKVSLISTGL 
FIiVYSSLEEP 
TPGRPCYGVT 
LQIVLBLNGV 
TNTQDCTHGQ 



31 

I 

LLFWVSYVTD 
ZZAGDTLSKV 
TTLZLGZVMA 
TVAKHSRI.ZB 
VZLTYPMECP 
LaVTPLIPIZ 
EMFYCFPDNF 



41 

i 

FSLVLLZKGG 
FQRZPG\a)PE 
RAZSLGFHZP 
MSZVZSVFZC 
VTREVIANVF 
PSACYLKLSE 
SLTNTSBSHV 



Seq ZD NO: 553 DNA sequence 
Nucleic Acid Accession #i Bo{ 
Coding sequence i i . . 1203 



ATGGQCTACC 
AAAGGAQaGO 
TTTCCAGGGT 
AGTTACAATA 
GATCCTQAAA 
ACTCTGCCTT 
ACAOGTTTAA 
CACATACCAA 



11 

I 

AGAGGCAGGA 

ATCTGCTCCr 
TAATAGCTOG 

ACGTGTTTAT 
TATCCTTOTA 
CAACTCTGAT 



21 

1 

GCCTGTCATC 
AACAOATACC 
CTCTQTTCTT 
AOATACTTTO 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
C6CTTGGGTA 



31 

I 

CCOCOGCAGT 
TACCAOTCTT 
CAOTTTTTOT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTO 
GTAAT6GCAA 
TTTGCAAAGC 



41 

1 

TTTCCCTTGT 
TGGTCAATAA 
ATCCTTTTAT 
TTCAAASAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 



60 
120 
180 
240 
300 
360 
420 



60 
120 

leo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



51 
I 

ALSGTDTYQ8 
NVFZGRHFZZ 
XTEDAHVFAX 
XFFATOGYLT 
FGGNIiSSVFH 
EPRTHSDKIH 
QQTTQLSTIiN 



60 
120 
180 
240 
300 
360 
420 



51 

I 

TTTATTQATA 
AACTTTOGGC 
AGCAATGATA 
CCCAGGAGTT 
AGTTACCTTT 
CCTCATCTCr 
ACTGGGTCCA 
TCAAOCGGTC 



60 
120 
180 
240 
300 
360 
420 
480 
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GGGGTTATGT CTTTTGCATT TATTTOCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 

GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT QATTTCTQTA 600 

TTTATCTGTA TATTCTTTGC TACATGTQGA TACTTOACAT TTACTGGCTT CACCCAAGGQ 660 

GACTTAT TTG AAAATTACTG CA6AAATGAT GACCTGCTAA CATTTGGAAG ATTTTGTTAT 720 

GQTGTCACTO TCATITTQAC ATA0UCTAT6 GAATGCTTTO TGACAAQAGA 6GTAATT6CC 780 

AATGTGTTTT TT G GTQQQAA TCTTTCATCG GTTTTCCACA TTGTT6TAAC AGTGATGGTC 840 

ATCACTQTAG CCACGCTTCJT OTCATTGCTG ATTOATTOCC TCGGGATAGT TCTAQAACTC 900 

AAT6GTGTGC TCTOTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCXQ TTATCTQAAA 960 
CTGTCTGAAO AACCAAGGAC ACACTCCQAT AAGATTATGT CTTOTCSTCAT GCTTCCCATT 1020 
GGTOCTQTO Q TOAIGGTTTT TGGATTOSTC ATGGCTATTA CAAATACTCA AGACTGCACC 1080 
CATGG6CAQG AAAT6TTCTA ClUCm ' CCl ' GACAATTTCT CTCTCAOVAA TAGCTCAGAO 1140 
TCTCATGTTC AGCA6ACAAC ACAACTTTCT ACTTTAAATA TTAOTATCTT TGAACTGQAO 1200 
TAA 

15 Seq ID NOx 554 Protein sequence 
Protein Accession #: Bos sequence 

1 11 21 31 41 51 

ZU MOYQKQBFVI PPQFSIiVLLX KOQALSGTDT YQSLVNKTFG FPGYLLLSVL QPLYPFZAMI 60 

8YNIIAGDTL 8ICVFQRIP6V DPENVFIGKH PIIGLSTVTP TLPLSLYRNI AKLGKV8LIS 120 

TOLTTLILQI VMARAI8LGP HIPKTEDAWV PAKPNAIQAV GVMSPAPICH HNSPLVYSSL 180 

EEPTVAXWSR LIHMSIVISV FICIFFATCX3 YLTFTGFTQG DLPENYCRMD DLVTFGRFCY 240 

OVTVILTYPM BCFVTREVIA MVPFGONLSS VFHXWTVMV ITVATLV8LL IDCU3ZVLEL 300 

25 NGVIiCATPLI FIZPSACYLK LSEBFRTH6D KZMSCVMLPI OAWMVFOFV MAITNTQDCT 360 
HGQEMFYCFP DNPSL1MT8B SHVQQTTQLS TU7ZSIFQLB 

Seq ID NOi 555 DN A sequence 
Nucleic Acid Accession fts Eos sequence 
30 Coding sequence: 1..1140 

1 11 21 31 41 51 

I I I 1 I t 

ATGQQCTACC AGAGOCAGGA OCCTGTCATC COOCCGCAGG TCAATAAAAC TTTGOQCTTT 60 

35 CCAGGGTATC TGCTOCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATOATAAGT 120 

TACAATATAA TAGCTGGAGA 7ACTTTGAGG AAA6TTTTTC AAAGAATCCC AGGAOTTGAT 180 

CCTGAAAACX3 TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CrOCCTTTAT CCTTGTACCX3 AAATATAGCA AA0CTT6GAA AGGTCTCCCT CATCTCTACA 300 

GOTTTAACAA CTCTQATTCT TGGAATTGTA ATGGCAAG60 CAATTTCACT GGGTCSCACAC 360 

40 ATAGCAAAAA CAOAAGAOGC TTOGGTATrT GCAAAOCXXA ATGCCATTCA AGCXIGTOQQO 420 

GTTATGTCTT TTGCATTTAT TTOCCACCAT AACTOCTTCT TAGTTTACAO TTCTCTAGAA 480 

GAACCCACAG TAGCTAAOTG GTCCOGCCTT ATCCATATGT CCATOGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTG6ATAC TT6ACATTTA CTOaCTTCAC CXAAGGGGAC 600 

TTATTTGAAA ATTACT6CAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTQTTATOGT 660 

45 GTGACTOTCA TTTTGACATA CCCTATG6AA T6CTTTGTGA GAAQAGAQGT AATT6CCAAT 720 

GTGlTrm ' G GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAOT OATGGTCATC 780 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GQATAGTTCT AGAACTCAAT 840 

GOTQTOCTCT QTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

. - TCTGAAGAAC CAAGGACACA CTCCSGATAAO ATTATOTCTT GIGTCAT6CT TOOCATTGGT 960 

50 GCTGTGGTQA TGGTTTTTGG ATTOGTCATO OCTATTAGAA ATACTCAAGA CTGCACCCAT 1020 

GG6CAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
aUTGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTOGAOTAA 

Seq ID NOi 556 Protein sequence 
55 Protein Accession 9t Eos sequence 

1 11 21 31 41 51 

I I I I I I 

McnrOROBPVz ppovmcrpoF POTLLLSVLQ FLYPFIAMZS TOIIAGDTIA KVFQRZPOVD 60 

PENVPI6RHF IIGLSTVTFT ItPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAXSLGPR 120 

IPKTEDAWVP AKPNAIQAVQ VKSFAFICHH NSPLVY8SLB EPTVAKHSRL IHMSIVISVF 180 

ICIPFATCGY LTPTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VPFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLELN GVLCATPLIF IIPSACYLKIi 300 

SEEPRTHSDK XMSCVMLPI6 AWHVFGFVM AITNTQDCTH GQEMFYCPPD NFSLTNTSBS 360 
HVQQTTQLST UnSIFQLB 

Seq ID NO; 557 DNA sequence 
Nucleic Acid Accession #t XM_0S7168.1 
Coding sequence i 769.. 4269 " 

1 11 21 31 41 51 

I I t I 1 I 

ATGG6ATT6C CTCTCCCTCT CACTCTGGGC TTCTGTCCCA CTCTTATCTT AGVGTCAOTC 60 

CTCOOCCAAG TCTGTQTCCC TCTCTCTCCC CTAAATCTCT GGCCCCTCCT TTCTQAGTTC 120 

CTGCCCTTCC CCCAATTCTT TGGTTTTTQC ATOCCCCTCT GCCCCTTOCC TCAGTCAAGT 180 

CTCCCCTGTG TOTCTCTCTC CCCCOGQCCC GGACCTCTGC ACCCCCCAGG TCX5CT0TCCC 240 

TCTGTCCCCT TATCGCGQCC TGGGACCOGC CCTCTCCCCG CCTCCOGCTT TGQOOTCTCC 300 

AAGACTCCCC 6CCCCCCAGA GCTCGCCCCO CGCCAGGCTA GGCTG6AAAG TGGAGGATCC 360 

Wm U. 'l'CJ OGGCOQOTCT G6AAGCAGAG COGGOSGAGG GA6C6G0GG0 GCO CT G G G CT 420 

GGAGGAGGTT GOGGOQGCOQ OGGCAGCATG GTGGTGCOGG AGAAGGAOGA OAQCTGGATC 480 

CXXAAGATCT TCAAGAAGAA GACCTGCACG ACX3TTCATAG TTGACTCCAC AOATCCGQGQ 540 

GATGGGGTCT CCCTCTATTQ CCCAG8CTQT TCTCAAACTC CTGGGCTCAA GCAGTCCTTC 600 

tGCCTTQACC TCCCAAAGTX3 CTTGGATTGT 6CCCAGC0GT CCTTGAAGTT TTGCTCA6AA 660 

GAGCAAACTT TCTGGGAAGT AGCTGCAGGT GTTQQAABTA OCT6CAGGGG AACTAGGGGA 720 
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TTCAAGGATG OAGCTQAAAT GOOTGAAOGG ACAAAGTOGG TAAACTGAAT GGAOGATGCC 780 
TTCGGGGCAG CCGTGGTGAC OGTGTGGGAC AGCGATGCAC ACACCACOGA OAAGCCCACC 840 
GATGCCTAOQ GAOAGCrGGA CTTCACGOGG GCCGO(XGCA AGCAGAGCAA TTTCCTCOGG 900 
CTCTCTGACC GAAOGGATCC AGCTGCAGTT TATAGTCTGG TCAGA06CAC ATGGG6CTTC 960 
CGT6CCC0QA AOCTGGTGGT GTCAGTQCTG G6GGGAT06G GGGGOOCGST CCTCCAGACC 1020 
TGGCTGCAGG ACCTGCTGOG TCGTGGGCTO GTGCGGGCTG CXTAGAGCAC AGQAGCCTOG 1080 
ATTGTCACTG GGGGTCTCCA CAOGGGCATC GQCCGOCATG TTGOTGTGGC TGTACGGGAC 1140 
CATCAGATGG CCAOCACTOa QOGCACCAAG G TGQTGGCCft T G GGTGTGGC CCCCTGGGOT 1200 
GTOGTCOOGA ATAGAGACAC OCTCATCAAC CCCAAGQGCT OGTTCOCTGC GAGGTACX3GG 1260 
TQGOGCGGTO AGCCXSGAGOA 0GG6GTGCAG TTTCCCCTG6 ACTACAACTA CTOGGCCTTC 1320 
TTCCTGGTGG ACGAOGGCAC ACAOGGCTGC CTGGGGGGOG AGAACOGCTT COGCTTGOGC 1380 
CTGGAGTCCT ACATCTCACA GCAGAAGAOG GGCGTGGGAG GGACTGQAAT TQACATCCCT 1440 
6TCCTGCT0C TCCTGATTGA TGK3TQATGA0 AAGATGTTGA OGOQAATAGA GAACGCCACC IS 00 
CAOGCTCAGC TCGCATQTCT CCTOQTGQGT G6CTCAGGGG GAGCTGGGQA CTGOCTGGCQ 1560 
GAGACCCTGG AAOACACTCT OGOCOCAGGO AGTGGGGOAG CCAGGCAAQG OQAAOCCOQA 1620 
GATOGAATCfV GGCGTTTCTT TCCCAAA6GG GACCTTGAGG TCCTGCM3GC CCA(X3T6GAG 1680 
AGGATTATGA CCCaGAAGGA GCTCCTQACA GTCTATTCTT CTGAGGATGG QTCTGAGOAA 1740 
TTCGAGACCA TAGTTTTGAA GGCCCTTGTG AAG6CCTGTG GGAGCTCGGA Q6CCTCA0CC 1800 
TACCTGGATG AGCT G Q Q TTT GGCTQTQ6CT TGQAACG60Q TGQACATTGC GCAOAGTCAA 1860 
CTCTTTOGGO GGGACATCCA ATGGOOGTOC TTGCATCTOG AAG C TTCCCT CAtOGAOGCC 1920 
CTGCTGAATG ACGGGCCTGA GTTOGTGCGC TTGCTCATTT CCCA06GCCT CAGCCTGGGC 1980 
CACTTCCTQA CCCt3GATQ0O CCTGGCXXIAA CTCTACAGCG CGGCX3CCCTC CAACT03CTC 2040 
ATC0GCAAC2C TTTTGGACCA GGCGTCCCAC A6CGCAGGCA CCAAAGCCCC A6CGCTAAAA 2100 
66GGQAGCTG CGQAOCTCOQ GGGCCCTGAC GTOQGGCATQ TGCTQAQQAT GCTQCTOGQO 2160 
AAGATGT6CXS 0GC06AG0TA CCCCTC0GG6 GG06CCTGGG ACCCTCACCC AGGCGA6G6C 2220 
TTCGGGGAGA GCATQTATCT GCTCTCGGAC AAGGCCACCT CGCOGCTCTC GCTGGATGCT 2280 
GGCCTCGGGC AGGCCCCCTG GAGCGACCTG CTTCTTTGGG CACTGTTGCT GAACAGGGCA 2340 
CAGAT66CGA TGTACTTCTO G6AGATGGGT TGCAATGCAG TTTCCTCAGC TCTTGGGGCC 2400 
TOi Tltac roC TCOQQQTQAT GQCAOSCCTG GAQCCTQAOQ CrGAGQAGGC AOCACGOAGQ 2460 
AAA6ACCTGG CGTTCAAGTT TGAGGGGATG GG06TTGACC TCTTTG60QA GT6CTATGGC 2520 
AGCAGTGAGG TGAGGOCTOC COSCCTCCTC CTCXXTTCGCT 6CCCX3CTCTG OGGGQATOCC 2580 
ACTTGCCTCC AGCTGGCCAT GCAAGCTGAC GCCOSTGCCT TCTTTGCCCA GGATGGGGTA 2640 
CAOTCrCIGC TGACACAQAA GTGGTX3GGGA GATATOGGCA GCACTACACC CATCTGGGCC 2700 
CTGQTTCTCO CC TTCTTTTq OOCTCCACTC ATCTACAOCC GOCTCATCAC CTTCA6QAAA 2760 
TCAGAAGA6G AGCCCACACG GOAGSAOCTA GAGTTTQAGA T6GATAGTGT CATTAATGGG 2820 
GAAGGGCCTQ TCGGOACGGC GOACCCAOCC OAOAAGAOGC CGCTGGGGGT CCCGCGCCAG 2880 
TCGGGCCX?rC CX3GGTTGCTG CGGGGGCOGC TGCGGGGGGC GCOGGTGCCT ACGCCGCTGG 2940 
TTCCACTTCT GQGGCGCGCC GGTQAGCATC TTCATGQGCA AOntSOTCAa CTACCTGCTO 3000 
TTOCTGCTGC TTTTCT OG OG GGT G CTGCTC OTGGATTTCC AGOOQOOGCC GOOCGGCTCC 3060 
CTGQAGCIQC TOCTCTATTT CTGQGCTTTC AiCGCTGCTGT G06AGGAACT GOGCCAGQGC 3120 
CTGAGC3GQA0 GCGGGQGCAO CCTCX3CCAQC OOGGGCCCCG GGCCTGGCCA TGCCTCACTG 3180 
AGCCAGOSCC TGCGCCTCTA CCTOGCOQAC AQCTGGAACC AGTGCGACCT AGTGGCTCTC 3240 
ACCTGCTTGC TCCTGGGCGT GGGCTGCOQO CTQACCCGOQ GTTTGTACCA CCTGG6C0GC 3300 
ACTGTCCTCT GCATa3ACTT CATGOTTTTC AGG6TG0GGC TGCTTCACAT CTTCAOGGTC 3360 
AACAAACAGC TGGGGOCCAA GATCX3TCATC 0T6A6CAAGA TaATQAAGGA CGTGTTCTTC 3420 
TTCCTCTTCT TCCTCX3GCGT GTGGCTGGTA GCCTATGGCG TGGOCACGGA GGOGCTCCTO 3480 
AGGCCACXX» ACAGTGACTT CCCAAGTATC CTGCGCCGCG TCTTCTACCa TCCCTACCTG 3540 
GAOATCTTOQ GGCAGATTCX: CCAG6AG6AC ATSQAGGTGG CCCTCATGGA GCACAGCAAC 3600 
TGCrCSSTOQO AGCCGGGCTT CTQG6CACAC CCTCCTGaOG GCXAGGOGOG CAGCTGOSTC 3660 
TCGGAOTATG CCAACTQQCT GGTGGTGCTG CTCCTGGTCA TCTTCCT6CT G8TGGCCAAC 3720 
ATCCTGCTGG TCAACTTOCT C31TTGCCATG TTCRGTTACA CATTOGGCAA AGTACAGGGC 3780 
AACAGCX5ATC TCTACTQGAA GGCGCAGCGT TACCSGCCTCA TCOGGOAATT CCACTCTCXaG 3840 
CCOGOGCIGG CCCOGCCCTT TAT06TCKTC TCCXIACTTGC GCCTCCTOCT CA6GCAATTG 3900 
TGCAOGOGAC CC0GGA6OGC CCSU3CCQT0C TGCGGGOOOC TGGAGCATTT CCGGGTTTAC 3960 
CTTTCTAAGG AAGCCGAGCG GAAOCTGCTA ACGTGG6AAT OQGTGCATAA 6GA6AACTTT 4020 
CTOCTGGC3VC QOJCTAGGGA CAAGCGGGAG AGCGACTCOQ AGOSTCTGAA GCX3C3VCX3TCC 4080 
CAGAAGGTGQ ACTTGGCACr OAAACAGCTG GGACACATCC QOOAGTACaA ACAOCGCCTG 4140 
AAAGTGCTGG AGOGGGAGGT CCAGCAGT6T AG0CX3C0TCX: TGGG6TGGGT GGCG6AG6CC 4200 
CTGAOGCXXrT CTOCCTTGCT GCGCCGAGGT OOGCCOOCAC OOOCTOAGCT GCCTGGGTCC 4260 
AAAGACTGAO COCTGCTGGC GGACTTCAAG GAGAAGCCCC CACAGG6GAT TTTGCTCCTA 4320 
GAGTAAGGCT CATCTGGGCC TOGGCCCCOG CACCTGGTGG CCrTGTCCTT GAGGTGAGCC 4380 
CCATGTCCAT CTGGGCCACT GTCAGGACCA OCTTTGGGAG TGTCATCCTT ACAAACCACA 4440 
6CATGCC0GG CTCCTCCCAG AACCAGTCCC A6CCTGGGA0 GATCAAGGCC TGGATCCGGQ 4500 
GCCGTTATCC ATCTOGAGOC TGCAGG6TCC TTGGGGTAAC AGGOACCACA 6ACC0CTCAC 4560 
CACTOUAGA TTCCTCACAC TGGGGAAATA AAGCCATTTC AQAGQAAAAA AAAAAAAAAA 4620 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NOt 558 Protein sequenea 
Protein Accession #: XP 057188.1 



1 
I 

MEDAFGAAW 
TWGPRAPNLV 
AVRDHQMAST 
YSAPPLVDDO 
ENATQAQLPC 
AQVERimRK 
AQSBLFROdl 
SNSLIRNLLD 
PGQGPGBSMY 
ALGACLLLRV 
WGDATCLQLA 
TFRKSBEBPT 



11 

I 

TVWDSDAHTT 
VSVLGGSGQP 
GGTKWAMGV 
TRGCLGGENR 
LLVAGSGGAA 
EUiTVYSSED 
QWRSPHLEAS 
QASHSAOTKA 
LLSDKATSPIi 
MARZXPDAEB 
MQADARAFPA 
REEXaBFCMDS 



21 

I 

EKPTDAYGEL 
VLQTWIiQDLL 
APWGWRNRD 
PRLRIiESYIS 
DCLABTLEDT 
GSEEFGTIVL 
LMDALLNDRP 
PALKOGAAEL 
8U3AGLGQAP 
AARRKDXiAFK 
QDGVQSLLTQ 
VINQEGPVGT 



31 

1 

DFTGAGRKHS 
RRGLVRAAQS 
TLINPKGSFP 
QQKTGVGGTG 
LAPGSGGARQ 
KALVKACGSS 
EPVRLLISHG 
RPPDVGHVLR 
WSDLLLWALL 
PEGMGVDLFG 
KWWGDMASTT 
ADPABKTPLG 



41 
1 

NFLRliSDRTD 
TGAHIVTCGL 
ARYRHRGDPE 
IDIPVliLLLI 
6EARDRIRRF 
EA8AYLDBLR 
LSUaPLTPM 
MLLGKKCAPR 
LNRAQMAMYP 
EOfRSSBVRA 
PIKALVUVPP 
VPRQSORPGC 



51 
I 

PAAVYSLVTR 
HTGIGRBV6V 
DOVQPPLDYM 
DGOEKMLTRI 
FPXGDLEVLQ 
LAVAWNRVDI 
RZAQLYSAAP 
YPSOQAMDPH 
WaCSNAVSS 
ARLZiZiRRCPL 
CPPLIYTRLI 
CGGROGGRRC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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10 



VBRHmmtOA PVTIFM6NW SYLLFLLLFS RVLLVDFQPA PPGSLKLLLY FWAFTLLCBB 780 

LRQGIiSGGGG SIASGCraPO KASLSQRLRL YLADSHNQCO LVM.TCPU.0 VGaU.TP6X.y 640 

BbGRTVIiCZD FMVPTVRLLH ZFTVNRQLGP KIVIVSKMMK DVFPPLFPLG VNLVAYGVAT 900 

EGLLRPRDSD FPSILRRVFY RPYLQZFQQZ PQEDMDVALM EKSNCSSEPO FWAHPPQAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLVNL LZAMFSYTFO KVQGNSDLYW KAQRYRLIRE X020 

FHSRPAIAPP PZVZSHLRLL LRQLCRRPRS PQPSSPALEH FRVYXiSKEAB RKLLTWBSVH 1080 

KBNFLLARAR DXRBSDSBRL KRTSQKVDIA ZJQQLGBZRBY GQRLKVLERB VQQCSSVLGW 1140 
VABALSRSAL LPPG6PPPPD LFGSKD 

Seq ID NO: 559 DNR sequence 

Nucleic Acid Accession S: NM_006853.1 

coding sequence X 26.. 874 ^ 



^_ 1 11 21 31 41 51 

15 I I I I I I 

A6QAATCTQC GCTCGOGTTC CGCAOATGCA GAGCTTTGAGG TGGCTGCGQG ACTGGAAGTC 60 

AT0QG6CAGA GGTCTCACAG CAQCCAAGGA ACCTGGGGCC CGCTCCTCCC C3CCTCCAGGC 120 

CATQAGGATT CTGCAGTTAA TCCT G CTTGC TCTGGCAACA GGGCTTGTAO GGGGAGAGAC 180 

^ CAGGATCATC AAGGGGTT06 AGTGCAAGCC TCACTCCCAG CCCTGGCAGG CAOCCCTGTT 240 

20 CGAGAAGACG CGGCTACTCT GT6GGGCGAC GCTCATCGCC CCCAGATGGC TCXTPGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CA6CACAACC TCCAGAAGGA 360 

GGAGG6CTGT GAQCAQACCC GGACAGCCAC TGAGTCCTTC CX«CACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACOGCAATGA CATCATGCTO GTGAAGATGG CATCXiCCAGT 480 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA OGCTGTOTCA CT6CTGGCAC 540 

25 CA6CTGCCTC ATTTCOGGCT GGGGCAGCAC GTCCAOCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA CCATCATTQA GCACCAGAAO T0T6AQAACQ CCTACCCOGG 660 

CAACATCACA GACACCA7GG TGTGTGGGAG CGTGCAGGAA 6GGGGCAAGG ACTCCT6CCA 720 

GGGTGACTCC OGGGGCCCTC TGGTCTGTAA COVSTCTCTT CAAGGCATTA TCTCCTOGGG 780 

CXaiGGATCOG TGTGCQATCA CXXX3AAAGCC TGGTGTCTAC ACGAAAQTCT GCAAATATGT 840 

30 GOACTGGATC CAGGAGACGA TGAAGAACAA TTAQACTGGA GCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAOA AACCCTAAGC 960 

CAAGACCCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACXTO GGGTTOGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

QACTC TGGGA ATOACAACAC ClWriWlT C TCTOTTOT A TGOCCAGCCC CAAAGACAGC 1140 

33 TCCTGGCCAT ATATCAAOOT TTCAATAAAT ATTTGCTAAA TGAOTQ 

Seq ZD NO: 560 Protein sequence 
Protein Accession #x NP^006844.1 

40 1 11 21 31 41 51 

I I 1 I I I 

KRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALP BKTRLLCGAT LIAPRWLLTA 60 

AHCLKPRYZV HLGQRtll^KB EGCEQTRTAT ESFPHPGFMN SLFNKDHRMD ZMLVKMASPV 120 

. 8ZTNAVRPLT LSSRCVTAQT SCLZ8GH08T SSPQIiRZiPHT ZiRCANZTIZB HQKCENAYPG 180 

4j KXTDTKVCA8 VQEGGKDSGQ 6DSGGPLVCK QSLQ0ZZ8HG QDPCAITRKP GVYTKVCKXV 240 
DHXQETKKNN 

Seq ID KOt 561 DMA sequence 
nucleic Acid Acceaaion «t Ay046419.1 
DU Coding sequence t 1..1743 

1 U 21 31 41 51 

I I 'I I I I 

AT6TTTACCT TCCTGTCATC TGTCACTSCT GCTGTCaUTra OGCTOCTQQT GOGTTATGAA 60 

33 CTTGOGATCA TCTCTGGGGC 7CTTCTTCAG ATCAAAACCT TATTAGCOCT GA6CTGGCAT 120 

GAGCAGGAAA TGGTTGTGAG CTCCCTCGTC ATTGGAGCCC TCCTTGCCTC ACTCAC06GA 180 

GGGGTCCTGA TAGACAGATA TGGAAGAAGG ACAGCAATCA TCTTGTCATC CTGCCTQCTT 240 

G6ACTGGGAA GCTTAOTCTT 6ATCCTCAGT TTATCCTACA GQGTTCTTAT AGTGGGAOGC 300 

ATT60CATA0 GGQTTTOCAT CTCCCTCTCT TCCAT TGCCA CmSflWWh CATOGCAGAG 360 

OO ATTGCTOCTC AACACAGAAO AGGCCTTCTT OTQTGACTQA ATGAGCTGAT GATTOTCATC 420 

GGCATTCTTT CTGCCTATAT TTCAAATTAC GCATTTGCCA ATGTTTTCX3V TGGCTGGAAG 480 

TACATGTTTG GTCTTGTGAT TCCCTTGGGA GTTTTGCAAG CAATTGCAAT GTATTTTCTT 540 

CCTCCAAGCC CTCX3GTTTCT GGTGATGAAA GGACAAOAGG GAGCTGCTAG CAAGGTTCTT 600 

GQAAOOTTAA GAGCACTCTC AGATACAACT GAGOAACTCA CTGTQATCAA ATCCTCCCK3 660 

03 AAAGAT6AAT ATCAOTAGAG TTmGOGAT t ' lWrVO l in CAAAAGACAA CATGOGGACC 720 

OGAATAATGA TAGGACTAAC ACTA6TATTT TTTGTACAAA TCACTGGCXrA ACCAAACATA 780 

TTGTTCTATG CATCAACTGT TTTGAAGTCA GTTGGATTTC AAAOCAATGA GGCAGCTAGC 840 

CTOGCCTCCA CTGGGGTTGG AGTOGTCAAG GTCATTAGCA CCATCCCTGC CACTCTTCTT 900 

OTAGACGATG TCXSGCAOCAA AAGATTGCTC TGCATTGGCr CCTCTGTGAT GGCAGCTTOS 960 

/U TTGGT6ACCA TGGGCAT06T AAATCTCAAC ATCCACATGA ACTTCAGOCA TATCTGCAGA 1020 

AGCXaVCAATT CTATCAACCA GTCCTTGGAT GAGTCTX3TGA TTTAT6QACC AGQAAACCTG 1080 

TCAACX:AACA ACAATACTCT CAGAGAOCAC TTCAAAGGGA TTTCTTCCCA TAGCA6AA6C 1140 

TCACTCATGC CXXTTGAGAAA TGATGTGGAT AAGAGAGGGG AGA0GACX:TC AGCATCCTTG 1200 

CTAAATGCTO GAT7AAGCCA CACTGAATAC CAGATAGTCA GAGACCCTGG GGAGGTOOCA 1260 

/3 GCTTTTTTGA AATGGCTGTC CTTAGCCAGC ri ' GCri 'GTTT AT G TT G CT G C TTTTTCAATT 1320 

OGTCTAGGAC CAATOCCCTG GCTGGTGCTC AGC3GAQATCT TTCCTGOTGG GATCAGAGGA 1380 

CGAGCCATGG CTTTAACTTC TAGCATGAAC TGGGGCATCA ATCTCCTCAT CTCGCTGACA 1440 

TTTTTGACTG TAACTGATCT TATTGGCCTG CCATGG6T6T GCITTATATA TACAATCATG 1500 

AOTCTAGCAT COCTOCrTTT TSTTGTTATG TrTATACCIO AOACAAAGGO ATGCTCTTTO 1560 

OU GAACAAATAT GAATG6A6CT A6CAAAAGTG AACTATGTGA AAAACAACAT TTGTTTTATG 1620 

AOTCATCACC AAOAAOAATT AGT6CCAAAA CAOOCTCAAA AAAQA AAACC CCAGGA6CAG 1680 

CTCTTGOAGT GTAACAAGCT GTGTGGTAQG GGCX3VAT0CA GQCAGCTTTC TCCAGAGACC 1740 
TAA 
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Seq ID NOi S63 Protein sequence 
Pzotein Accession 8: AIU.Oa32T.l 
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1 
I 

MFTFLSSVTA 
GVLIDRYQRR 
lAPQHRRGLL 
PPSPRFLVNK 
RZMIGLTLVF 
VDHVGSKTPL 
STNNNTLRDH 
AFLKHLSLAS 
FLTVTDLIGL 



I 

AVSOLLVGYE 
TAIIIiSSCLL 
VSLiNHLMIVI 
GQEQAASKVL 
FVQITGQFNX 
CIGSSVMAAS 
PKGISSHSRS 
LLVYVAAFSZ 
PHVCFZyriM 
QFQKRKFQEQ 



21 
I 

U3IISGALLQ 
GliGSLVLILS 
GILSAYISNY 
GRLRALSDTT 
LFYASTVLKS 
LVTMQIVNLN 
SLMPLRNDVD 
GL6PMPHLVL 
SIASLLFWM 
LLEanCLGOR 



31 
I 

IKTLLALSCS! 
LSYTVLIVGR 
AFANVFHGHK 
BBLTVIKS8L 
VGFQSNEAAS 
IHMNPTHICR 
KRGBTTSASL 
SBIFPGGZRG 
PXPBTRQCSIi 
GQSRQLSPET 



41 

I 

BQEHWSSLV 
lAIGVSISLS 
YMFGLVIPLG 
KDBYQYSFWD 
LASTGVGWK 
SHKSINQSLD 
LHAOLSHTEY 
RAMALTSSMK 
BQZSMBIiAKV 



51 
I 

IGALLASLTG 
SIATCVYIAB 
VLQAIAMYFL 
LFRSKDNMRT 
VXSTZPATLL 
BSVZYGPGNL 
QIVTDPGDVP 
WGZNZJiISLT 
NYVKNNZCFM 



Seq ID NO 2 563 pah Beguence 

Nucleic Acid Accession #t XN_059466.1 

Coding sequence t 1..894 



ATGQAGCGQC 
CTGCTOGTCA 
GA6A6CTG06 
CCX5CTGTCX3C 
GGCCCGGGGC 
GCCGAGTGCG 
CTGGGCATCQ 
GCCATCAAGT 
AAGACXIATAC 
CTCGQCATQG 
TGG6AG6AGA 
TQCACCATTT 
AAGGTAATTT 
OCCTGOTOGiA 
ATTAGCOGGA 



11 
I 

GGGCGCTCGT 
CGGCCATCTT 
A6CXXA6CCG 
ACCTQCOOCT 
GCGCCX3ACCC 
GCCGGCCCCT 
ACOGGGACAT 
ACCACTTTTC 
AGCAAGATGA 
CCX3TAQCCQT 
GCTTGACCCA 
CCCTCT6TAC 
ATAGCXrrOCC 
QTTTAGOCTT 
CCAAGATTGC 



21 
I 

CAOGGGGCTC 
CACGQACCAC 
CGOGGGOQCC 
GGGGGACT06 
CQAGTCCTGG 

CTTCGCxaicx: 

CX3ACACCCTC 
TCAGCCCATC 
GTGGCACCTG 
CCTTCTCTGC 
GCACGTGGCT 
TTATGCOGCC 
TGCTGATGTG 
TATTQTQGCA 
ACAGCXAAAG 




CGCTCQCTCC 
TA CTOG GGCC 
ATCCTGAAAG 
GQCTTG06AA 
CTTCATTTAA 
QGCTGCATTG 
GGACTCCTGT 
AGTATCTCGT 
GAACATG6TT 
6CTGGA6GTC 
TCTG6CAQAQ 



41 

I 

TCAGCCTGTG 
CGGACCCCOG 
ACCAGAAGAA 
GOCGCCOGCT 
TGGGGCTCGG 
TCTGQAGGAA 
GTATTGOGCA 
ACATTCCTTT 
GAAGAATCAC 
TGGCCACAGT 
TGCTCArOAC 
ATGATTTGAA 
ACAGCTGGTC 
TCTGCATCX3C 
ACTCCA0G6T 



51 
I 

CTCCCTGGGG 
GGGCCACAAO 
OOGCCTGATG 
GCrCCOGGGC 
OGGGCTGGAC 
GTGCTACTTC 
G06ATGCA0G 
TAATTTAAOC 
TGCTGGCTTC 
CAGTTTCTTC 
AGGGATATTT 
COGGCTCCCA 
CATCTTTTOC 
TTATCOGTTT 
AT6A 



Seq ID NOx 564 Protein sequence 
Protein Accession #t XP_059466.l 

1 11 21 31 41 51 

I I 1 I I I 

MSPRALVTAL SLSLSLCSLG LLVTAIFTDH WYEIDPRRHK ESCSBR8RAGA DPPDQXNRliM 
PLSRLPLRDS PPLGRRLLPG GPGRADPESW R8LLGLGGLD AECSRPLFAT YSGIMRKCYF 
LGIDRDIDTL ILKGIAQRCT AIKYHFSQPI RLRMIPFNLT KTIQQDEWHL LHLRRITAGP 
LGMAVAVLliC GCIVATVSFF WBESLTQHVA GLLFLMTQIF CTISLCTYAA SISYDLNRLP 
KLIY8LPADV EHQYSHSZFC AWCSIjGFIVA AGQLCZAYPF ISRTKIAQLK SGRDSTV 

Seq ID NOt 565 DWA sequence 

NUcXeic Acid Accession #: Eos sequence 

coding sequence : 1 . . 3315 



ATGTCCTTTC 
ACCCGOACCC 
TTGGTGAATT 
TCCAAGOCCA 
ACCCAGATCA 
GACGCCTTTG 
TCCTGCGACA 
ACACCCAACC 
A1QCQCAA0A 
A086GA6QCA 
ATCAGCA6GA 
TCCAACCGG6 
CTTATGGAT6 
CTGCTCGTGG 
CTAGAGAAGT 
ATTGTQTGTT 
AAAAATAAAA 
A6CCTGGTG6 
TTTTTAOOCC 
CTCAAA6AAA 
GATGAAATTG 
CAAGACAAGG 
TTAGCCAATG 
GTCATGTTTA 
GGCTTGAACC 
TTCAGCACQC 
CTCAOGTTTG 
AATGGC06GG 
CTGCAAGCTC 



11 

I 

GGGCAGCCAG 
TGTACTCCAQ 
TTATTCAAGC 
GQQAOAAiaT 
ACCAAAGTGA 
GGGATATTCA 
CGGAOGCGGA 
TGGTCATTTC 
TCTTCAG003 
CCCATTATGG 
GTTCAGAGGA 
ACACCCTCAT 
ACTTCACAAG 
ACAATGGCTG 
ATATCTCTGA 
TT6CCCAAGQ 
TTCCTTGTGT 
AGGTGQAGGA 
GCACGGXGTC 
TTCTOSAATG 
TGAOCAATGC 
ATAACTGQAA 
ATGAGATTTT 
CGGCTCTCAT 
TAOGGAAOTT 
TTGTGTACOG 
TCTOGAAACT 
AOGAGATGGA 
TCTTCATCTG 



21 
I 

GCTCAGCATG 
CGCGTCrOGG 
AAATTTTAAG 
GTGCAAGTGT 
GAAATGGAAC 
GTTTGAGACA 
AATCCTTTAC 
TGTGAGGGGG 
GCTCATCTAC 
CCTGATGAAO 
GAATATTGTG 
CAGGAATTGC 
AGATCCACTG 
TCATGGACAT 
GCGCACTATT 
AGGTGGAAAA 
GGTGGTGGAA 
TGCCCTGACA 
C06GCXGCCT 
TTCTCACCTA 
CATCTCCTAC 
TGGGCAGCTQ 
CACCAATGAC 
AAAGGACAGA 
TCTCACCCAT 
GAATCTGCAG 
GOTTGCGAAC 
CATAOAACTC 
GQCCATTCTT 




60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
130 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



CTGAACTGTT 
ATTCCTATAA 
6CTTCCGGAA 
CrCCTATTAC 
AS6AACTCTC 



51 
I 

TCTGGACAGC 
TGAAAGCGAC 
TACCAAAQAT 
GATGOAAGGC 
ATTTCCTAGC 
TATAOGTCTG 
GCACCTGAAA 
GAAGCG6G0C 
TTG6ATTCTC 
AOATAACACC 
GGGCATGGTC 
AOCCCAGTAC 
CACACATTT6 
COCaGAATGAG 
CAAGATCCCC 
TACCTCCATC 
TGTGATCGCT 
GCTGGTGGGC 
GATCAAATGG 
AGAAGCTGGG 
CACCAGTGAG 
CCAGCTGQAC 
CCTTCAA6AA 
TCTGGAGAAT 
CTCCAACCAC 
TGATGCCCTC 
GGAAGACAGA 
TCGGCACCCC 
CAAAOTCATT 



60 
120 
IBO 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 



1169 



wo 03/042661 



PCT/US02/36810 



TGG6AGCAGA CCAGG6GCTG CACTCTGGOV GCCCTGGGAO CCAGCAA6CT TCT6AAGACT IBOO 

CTGOCCAAAG TGAAQAAGGA CATCAATGCT GCTGGOGAGT COGAGGAGCT GGCTAATGAG 1860 

TAOSAGACCC GGGCTGTTGiV GCrGTrOVCT GAGT6TTACA 6CAGCQATGA AGACTrGGCSV 1920 

GAACAGCTGC TGCTCTATTC CTGTQAAGCT TGGGGTGGAA OCAACTGTCT GGAGCT6606 1980 

5 GTCCWMSGCCA CRGACCW5CA TTTCATOGCC CM3CCTGGC3G TCCAGAATTT TCTTTCTAM 2040 

CAATGGTATO GAGAGATTTC CC6AQACACC AAGAACTGQA AGATTATCCT GTGTCTGTTT 2100 

ATTATACCCT TGGTGGGCTG TGGCTTTGTA TCATTTAGGA AQAAACXTGT CGACAAGCAC 2160 

AAGAAGCTGC TPrGGTACTA TGTGGO GT TC TTCACCTOCC CCTrOGTGGT CTTCTCCTGG 2220 

A A TOT GG TCr TCTACAT08C C rA 'C C TCC T Q CWrrr G CCT AOGTOCTOCT CATGOATTTC 2280 

10 CATTCGGTOC CACACCCCCC 0GA6CTG0TC CTCTACTOGC imJlt, T nt»T CCTCTTCTOT 2340 

GATGAAGTGA GACAGTGGTA C3GTAAATGGG GTGAATTATT TTACTGACCT 6TGGAATGTG 2400 

ATGGACyVCOC TGGGGCTTTT TTACTTCATA GCAGGAATTG TATTTOOGCT CCACTCTTCT 2460 

AATAAAAGCT CTTTQTATTC TGGAOSAGTC ATTTrCTGTC TGGACTACAT TATTTTCACT 2520 

CTAAOATTGA TCCACATTTT TACTGTAftOC AOAAACTTAS GACCCAAOAT TATAATGCTG 2580 

15 CAGAGGATGC TGAT06ATGT GTTCTTCTTC CTCrrCCTCT TT GO SGTGTG 6ATG6TG60C 2640 

TTTGGCXnXSO CCA06CAAGG GATCCTTAGO CAGAATGAOC AGCGCTGGAG GTGGATATTC 2700 

CGTTC6QTCA TCTACGAGCC CTACCTGGCC ATGTTCGGCC AGGTGCCCAQ TGACGTGGAT 2760 

6GTACCA0GT ATGACTTTGC CCACTGCACC TTCACTGGGA ATGAGTCCAA GCCACTGTGT 2820 

GT66AGCTGG A7QAGCACAA CCTOOCXXMG TTCXXXGA6T GGATCACCAT COCGCXGGTG 2680 

20 TGCATCTACA T6TTATCCAC C3UVCATCCTG CTGGTCAACC T6CTGGT0GC CATGTTTGGC 2940 

TACACQGTGG GCACCGTCCA GGAQAACAAT QACCAGGTCT GGAAGTTCCA GAGQTACTTC 3000 

CTOGTGCAGG AGTACTGCAG CCGCCTCAAT ATCCCCTTCC CCTTCATCGT CTTCGCTTAC 3060 

TTCTACATGG TGGTGAA6AA GTGCTTCAA6 TGTTGCTGCA AGQAGAAAAA CATG6AGTCT 3120 

TCTGTCT6CT GTTTCAAAAA TOAAGACAAT 6ABACTCTGG CATOGQAGGG TGTCAT6AA0 3180 

25 GAAAACTACC TTGTCAAGAT CAACACAAAA OGCAAOSACA GCTCAOAaaA AATGAGGCAT 3240 

OOATTTAGAC AACTGOATAC AAAGCTTAAT GATCTCAAG6 OTCTTCIOAA AGAaATTGCT 3300 
AATAAAATCA AAT6A 

t%g\ 8^ ZI> NOt 566 Protein eequence 
30 Protein AcceBsion ttt Eos Beguence 

1 11 21 31 41 51 

I I 1 I I I 

MSFRAARLSM RNIIRMOTLOS TRTLYSSASR STDLSYSESD LVNPIQAXIFK XRECVFFTKD 60 

35 SKATENVCKC GYAQSQKMBG TQINQSEKHN YKKHTKEFPT DAFGDIQFET L6KKGKYIRL 120 

SCDTDABIIiy BLLTQHWHLK TPNLVISVT6 GAKNFALKFR KRKIFSRLIY ZAQSKGAWIL 160 

IGGTHYGIiMK YIGEWRDNT AIOIAAHGMV SMKDTLIRKC DAKGYFLAQY 240 

LMDDFTRDPXi YILDNNHTHL UiVDNGCHGH PTVBAKLRNQ LEKYISERTI QDSNYGGKIP 300 

IVCFAQGG6K BTUCAINTSZ KNXZPCVWB GSGQIADVZA SLVEVEDALT SSAVKEKLVR 360 

40 FLPRIV8RU> EEETESWZKH LKBILEC8HL LTVIKMEBAG DEIVSNAISY ALYKAFSTSE 420 

QDKDNWNGQL KLLLEWNQLD LAHDBIFTND RRWESADLQB VMFTALIKDR PKFVRLPLEM 480 

GLNLRKFIiTH DVIjTELFSNH PSTLVYRNLQ ZAKNSYNDAL LTFVHKLVAN FRRGFRKEOR 540 

N6RDEMDIEL HDV6PZTRHP LQALFIWAZL QNXKELSKVZ HEQTRGGTIiA ALOASKLLKT 600 

. LAKVKNDINA AOBSEEIiANE YGTRAVBLPT ECYSSDEDIA EQLLVYSCEA WOGSNCCiELA 660 

45 VEATDQHPIA QPGVQNPLSK QWYGBISRDT KNWKIILCLP ZZPLVGCX5PV SFRKKPVDKH 720 

KKLliWYYVAF FTSPFWFSW NWFYIAFLL LPAYVLUOP HSVPHPPELV LYSLVFVLFC 780 

DBVRQWYVNG VNYFTDLWNV MDTL6LPYF1 AGZVFRLHSS NKSSLYSGRV IPCLDYIIFT 840 

LRLZHZFTVS RNLGPKZIML QRMLZDVFFF LFLFAVHMVA FGVARQGZLR QNEQRWRWZF 900 

RSVIYEPYUV MFGQVPSDVD GTTYDFAHCT FTGETESKPLC VBIjDEHNLPR FPEHZTZPLV 960 

50 CZYMLSTNZL LVOTJiVAMFG YTVGTVQBNN DQVWKFQRYP LVQEYCSRLN ZPFFFIVFAY 1020 

FYMWKKCFK COCKERNMBS SVCCFKNEIH} BTLAHEGVMK ENYXiViaNTK ANDT8EEMRH 1080 
RFRQLDTKZ^ DLXGLLKEZA NKZR 

. - Seq ZD NO: 567 PKA sequence 
55 Nucleic Acid Accession Ifi NM_006911.1 
Coding sequence: 1..558 ~ 



1 11 21 31 41 51 

60 1 I I I I 1 

ATGCCTCGCC T G TTCTT G TT CCACCTGCTA GAATTCTGTT TACTACTGAA COWITTTTCC 60 

AGAGCAGTOG CGGCCAAATG GAAGGAOGAT OTTATTAAAT TATGCXSGCCG C3GAATTAGTT 120 

GGCXSOGCAGA TTGCCATTTG GGGCATGAQC ACCTGGAGCA AAAGGTCTCT GAGGCAGGAA 180 

GATGCTCCrC AGACACCTAQ ACCROTGGCA GAAATTGTAC CATCCITCAT CSkACAAAGAT 240 

65 ACAQAAACTA TAATTATCAT OTTGOJVATTC ATTGCTAATT TGCCACOGQA GCTGAAGGCA 300 

GCCCTATCTG AGAGGCAACC ATCATTACCA GAGCTACAGC AGTATGTACC TGCATTAAAG 360 

QATTCCAATC TTAOCTTTGA AOAATTTAAG AAACTTATTC GCAATAGGCA AAGTGAAGCX: 420 

GCAGACA6CA ATCCTTCAGA ATTAAAATAC TTAGGCTTGG ATACTCATTC TCAAAAAAAG 480 

AGAOSACCCT AOGTGGCACT GVTTQAQAAA T O I TOO C T AA TT OaTfOTA C CAAAAOOTCT 540 
CTTGCTAAAT ATTOCTGA 



70 
75 
80 



Seq ID NOt 568 Protein sequence 
Protein AccesBion <|t NP_008842.1 

1 11 21 31 41 51 

I 1 I I . t I 

MPRIjFLFHLL BFCLLLNQFS RAVAAKWKDD VIKLGGRELV RAQIAIOGM8 TWSKRSLSQB 60 
DAPQTPRPVA BZVPSFINKD TBTIIIHLEP lANLPPELKA ALSBRQPSLP BLQQYVPALK 120 
DSNLSFEEFK KLZRNRQSBA ADSNPSELKY L6LDTBSQKK RRPYVALPBK CCXiZGCTKRS 180 
LAKYC 

Seq ID NO: 569 DNA sequence 

Nucleic Acid Accession ftt XH 036453.1 



1170 



wo 03/042661 



Oodlog sequences X..3978 

1 12 21 31 41 51 

i I I I I ) 

ATGCTGCCCG TOTACCAGOA GQTGAAGCCC AACCOSCTGC AGGACGOGAA CCTCTGCT C A 60 

OGCGTGTTCT TCTGGTGGCT CAATCCCTTG TTTAAAATTG GCC3VTAAACX; GAQATTAQAG 120 

GAAQATGATA TOTATTCAOT GCTGCCAGAA GACCGCTCAC AGCACCTTGG AGAGQAGTTG 180 

CAAGGGTTCT GGGATAAA6A AGTTTTAAQA GCTGAGAATQ AOOCACAOAA 6CCTTCTTTA 240 

ACAA6A6CAA TCATAAAGT6 TTACTGGAAA TCTTATTTAG TTTTGGGAAT TTTTAOGTTA 300 

ATTGAGOAAA GTGCCAAAGT AATCCA6CCC ATATTTTTGG GAAAAATTAT TAATTATTTT 360 

GAAAATTATG ATCCCATGGA TTCTGTGGCT TTGAACACAG C3GTACGCCTA TOCCAOGOTG 420 

CTQACTTTTT GCAOGCTCAT TTTGGCTATA CTGCATCACT TATATTTTTA TCAOGTTCAG 480 

TGTGCT6GGA TGAGGTTACG AOTAGCCATG TGCCATATOA TTTATOSQAA GQCACTTOOT 540 

CTTAOTAACA TGGCC ATGGG GAAGACAACC ACA6G0CA6A TA6TCAATCT GCTGTOCftAT 600 

GATGTGAACA AGTTTGATCA GGTQACAOTG TTCTTACACT TCCTGTGGGC AGGACCACTG 660 

CAGGCGATCG CAGTQACTGC CCTACTCTGG ATGGAGATAG GAATATCOTQ CCTTGCTGOG 720 

ATGGCAQTTC TAATCATTCT CCTGCCCTTG CAAAGCTGTT TTGGGAAGTT GTTCTCATCA 780 

CTGAGGAGTA AAACTGCAAC TTTCAOGGAT GCCAGGATCA GGACCATQAA TOAAOTTATA 840 

ACTOCTATAA GGATAATAAA AATGTAG6CC TGGGAAAAGT aVTTTTCAAA TCTTATTAOC 900 

AATTT8A6AA AGAAGOAGAT TTGCAAGATT CTQAQAAOTT CCTOCCTCAa GGGGATQAAT 960 

TTGQCTTCGT TTTTCAGTQC AAOCAAAATC ATOGTGTTTO TQACCTTCAC CACCTAOSTG 1020 

CTCCT06GCA GTQTGATCAC AGCXSU3G00C QTGTTOGTGG CAGTGA06CT GTATGGGGCT 1080 

GTGCGGCTGA CGGTTACCCT CTTCTTCCCC TCAGCCATT6 A6A6G0T6TC AQAGGCAATC 1140 

OTCAGCATCC GAAGAATCCA OACCTTmO CTACTTQATG AGA7ATCACA GGSGAAOOBT 1200 

CA6C1GG0QT CA6ATGGTAA AAAGATGGT6 CATGTGCAaG ATTTTACTQC TTTTTOOQAT 1260 

AAQGCATCAG AOACCCCAAC TCTACAAGGC CTTTCCTTTA CTGTCAGACC TGGCX3AATTG 1320 

TTAGCTGTGG TCGQCCCOGT GGGAGCAGGG AAGTCATCAC TGTTAAGTQC OGTGCTCGGO 1380 

OAATTGGCCC CAAOTCACX3G GCTGGTCAQC GT6CATGGAA GAATT6CCTA TOTGTCTCAG 1440 

CAGGCCTGGG TGTTCTOQGO AACTCTOAGQ AOTAATATTT TATTTGGOAA GAAATAOQAA 1500 

AAG6AA0QAT ATGAAAAAOT CATAAA06CT TGTGCTCTGA AAAAGGATTT ACA6CT0TTG 1560 

GAGGATGGTG ATCTGACTGT GATAGGAGAT OQGGGAACCA OGCTGAGTGG AGGGCAGAAA 1620 

GCACGGGTAA ACXHTGCAAG AGCAGTGTAT CAAGATGCTG ACATCTATCT CCTGGAC30AT 1680 

OCTCTCAOTG CS^QTAGATGC G6AAGTTAGC AGACACTTGT T0GAACT6TG TATTTGTCAA 1740 

ATTTTGCATO AGAAGATCAC AATTTTAGIO ACTCATCAGT TGCAGTAGCT CftAAOCTGCA 1600 

AGTCAQATTC TGATATTQAA AGATGGTAAA ATG0T6GAQA AGGG6ACTTA CACTGAGTTC 1860 

CTAAAATCTO GTATAGATTT TGGCTCCCTT TTAAAOAAGG ATAAT6AGGA AAGTGAACAA 1920 

CCTCCAOTTC CAGGAACTCC CACACTAAGG AATOQTACCT TCTCAGAGTC TTCXK3TTTGG 1980 

TCTCAACAAT CTTCTAQACC CTCCTTGAAA GATQOTGCTC TGGAGA6CCA AGATACAGAG 2040 

AATGTCCCAO TTACACTATC AGAGGAGAAC GOTTCTGAAG GAAAAGTTOQ TTTTCAGQOC 2X00 

TATAAOAATT ACTTCAGAOC TGGTGCTCAC TGGATTGTCT TCATTTTGCT TATTCTCCTA 2160 

AACACTGCAG CTCAGGTTGC CTATGTGCTT CAAGATTGGT GGCTTTCATA CTGOGCAAAC 2220 

AAACAAAGTA TGCTAAATGT CACTGTAAAT GGAGGAGGAA ATGTAACCGA 6AAGCTAGAT 2260 

CTTAACTGGT ACTTAGGAAT TTATTCAGOT TTAACTGTAG CTACCGTTCT TTTTGGCATA 2340 

GCAAQATCTC TATTGGTATT CTAGGTCCTT GTTAACTCTT CACAAACTTT 6CACAACAAA 2400 

ATGTTTGAGT CAATTCTGAA AGCTCOQOTA TTATTCTTTO ATAOAAATCC AATAGGAAGA 2460 

ATTTTAAATC GTTTCTCCAA AGACATTGGA CACTTGGATG ATTTOCTGCC QCTGACGTTT 2520 

TTAGATTTCA TCCAQACATT 6CTACAAGTG -GTTGGTGTGG TCTCTGTGGC TGTGGCCGTQ 2580 

ATTCCTTGOA TCGCAATACC CTTGGTTOCC CTTGGAATCA TTTTCATTTT TCTTCGOOGA 2640 

TAl'lT rntJU AAAOGTCAAG AQAT6TGAAG G6CCTQGAAT C7ACAACT0G GAOTOCAGTG 2700 

TTTTOOCACT TGTCATCTTC TCTCCA6GGG CTCTGGAOCA TCOGGGCATA CAAAGCAGAA 2760 

GAGAGOTGTC AGGAACTGTT TGATGCACAC CAGGATTTAC ATTCAGAGGC TTGCTTCTTQ 2820 

rrrn tiACAA cotccogctq ottogoogtc cgtctggats ccatctgtgc catgtttgtc 2880 

ATGAT06TTG CCTTT666TC CCTGATTCTG GCAAAAACTC TGGATGCCGG GCAG6TTG6T 2940 

TT0GCACT6T CCTATGCCCT CA06CZGATG GGGATGTTTC AGTGGTGIGT TOQACAAA6T 3000 

GCTGAAQTT6 AGAATATGAT GATCTCA6TA GAAAGGGTCA TTGAATACAC A6ACCTT8AA 3060 

AAAGAA8CAC CTTGGGAATA TCAGAAAOGC CCACCACCAG CCT GGC CCCA T6AAGGAGTG 3120 

ATAATCTTTG ACAATGTGAA CTTCATOTAC A6TCCAGQT0 G6CCTCXGGT ACTGAAGCAT 3180 

CTGACAGCAC TCATTAAATC ACAAGAAAAO GTT6GCATTG TG6GAAGAAC GGGAGCTGGA 3240 

AAAAGTTGOC TCATCTCAOC CCTTTTTAGA TTOTCAOAAC COSAAGGTAA AATTTOOATT 3300 

QATAAGATCT TGACAACTGA AATTGGACTT CA06ATTTAA GGAAGAAAAT GTCAATCATA 3360 

CCTCAGGAAC CT Q TTTTOT r CACTGGAACA ATGAGGAAAA ACCTGGATCC CTTTAATGAQ 3420 

CACACGGATG AGGAACTGTO GAATGCCTTA CAAGA6GTAC AACTTAAAGA AACCATTQAA 3480 

GATCTTCXTtO QTAAAATGGA TACTQAATTA GCAGAATCAG GATGCAATTT TAGT6TTQGA 3540 

GAAAQACAAC TGGIGTGOCT TCCXAGGGGA ATTCTCAGGA AAAATCAGAT ATTGATTATT 3600 

GATGAAGCGA OGGCAAATGT GCSATCCAAGA ACTGATGAGT TAATACAAAA AAAAATC06G 3660 

GAGAAATTTG OXACTOCAC CGTGCTAACC ATTGCACACA GATTGAACAC CATTATTOAC 3720 

AGOGACAAGA TAATGOTTTT AOATTCAGGA AGACTGAAAG AATATX^TGA GCCGTATGTT 3780 

TTGCT6CAAA ATAAAGAGAG CCTATTTTAC AAOATG6TQC AACAACTQGG CAAGGCAOAA 3840 

GC08CTGCCC TCACIQAAAC AGCAAAACAG GTATACTTCA AAAGAAATTA TCCACATATT 3900 

GGTCACACTO ACCACATGGT TACAAACACT TCCAATGQAC AGGGCTOGAC CTTAACTATT 3960 
TTOQAGACAO CACTGTGA 



Seq ID HO: 570 Protein sequence 
Protein Accession «t XP_036453.1 

1 11 21 31 41 51 

I I I I I I 

KLPVYQBVXP NPLQDANLCS RVFFNWUIPL FKIGBKRRLB BDDMYSVLPB DRSQKLGEBL 60 
Q6FWDKEVLR AENDAQKPSL TRAXZKOrWK SYLVLGIPTL XBB8AKVZ0P XFI/SKIIliyP 120 
ENVDPMDSVA LNTAYAYATV LTPCTXiIIiAI LHHLYPYHVQ CAGMHIiRVAM CHMIYRKALR 180 
I.SNMAKGKTT TGQIVNLLSN DVNKFDQVTV PIiHPLMAGPL QAIAVTALLW MEIGISCIAG 240 
MAVLIILLPL QSCPGKLFSS LRSKTATPTD ARZRTMNEVI TGIRIIKMYA WCXSFSNLXT 300 
KUUCKBX8XI LRSSCLRGMN LASFPSASXX XVFVTPmV LLGSVITASR VFVAVTLYOA 360 



1171 



wo 03/042661 



PCT/US02/36810 
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VRLTVTLFFP 
KASETPTLQO 
QPNVFSGTIjR 
ARVNIARAVY 
SQILIIiKDGK 
SQQSSRPSLK 
NTAAQVAYVL 
ARSLLVFYVI. 
LDFZQTXJbQfV 
P8HLSSSLQG 
IIVAFOSLIL 
KBAPHEYQKR 
XSSLZSAI«FR 



DSVTANVDPR 
liLQNKBSLFY 
PETAL 



8AIBRV8EAI 
LSPTVRPGEL 
SNILFGXKyE 
QDADIYUiDD 
MVQKGTYTEF 
DGALESQDTE 
QDWHLSyUAN 
VNSSOTLHNK 
VOWSVAVAV 
LNTIRAYKAB 
AKTLDAGQVa 
PPPANPHEGV 
LSEPBQKXKI 
QBVQXiKBTIB 
TDBLXQKXXR 
KMVQQLGKAB 



VSIRRIQTPL 
LAWOPVOAO 
KERYEKVIKA 
PLSAVDABVS 
UCSGIDFGSL 
NVPVTLSEaf 
KQSMLHVTVN 
r4PBSILKAPV 
XPWIAIPLVP 
BRCQELFDAH 
LALSYALTLM 
IIFDNVNFMY 
DKILTTBIOL 
XHiPGKMDTEL 
EKFAHCTVLT 
AAALTBTAKQ 



LLDEISQRNR 
KSSLLSAVL6 
CAIiSCKDUQIiL 
RKLPBLCIOQ 
liKKDNBESEQ 
RSEGKVGFQA 
GOGNVTBKLD 
LFPORKPIOR 
LGIIFXFLRR 
QDLHSBAVfFL 
GMFQWCVRQS 
SPGGPLVLKB 
KDUfUCXMSII 
AE80SNF8VG 
ZAHRIMTZZD 
VyPKRHYFRX 



QLPSDGKKMV 
ELAPSHGLV8 
BDCDLTVIGD 
ILBBfaTZLV 
PPVPGTPTLR 
YKNYPRABAH 
UIWYLOiySO 
ILMRFSKDI6 
YFLET8RDVK 
FLTTSRNPAV 
AEVENMMISV 
LTALIKSQEK 
PQEPVZtFTGT 
QRQLVOiARA 
SDKIMVLDSO 
GHTOHKVniT 



RVQDPTAFHD 
VHGRIAYVSQ 
ROTTlfSGGQK 
THQLQYLKAA 
NRTFSBSSVW 
wrVPIPLILL 
LTVATVLPQI 
BLDDLLPLTF 
RLESTTRSFV 
RLDAZCAMFV 
ERVIEYTDLE 
VGZVGRTQA6 
MRKHLDPFMB 
ILRKNQILXI 
RXJOSyDBPYV 
SNGQPSTLTX 



420 
480 
540 
600 
660 
720 
760 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID KOt 571 DNA aequence 
Nucleic Acid Accession fit AF07120 
Coding sequence i 11 6.. 4 093 



11 



21 



31 41 51 

1 I 1 ) i I 

GGACAGGCGT GGOGQCCGOA OCCCCAQCAT CCCTGCTTGA G6TCCAG6A0 CGGA6CCGQC 60 

GGCCACCGCC GCCTQATCAG CGCGACCCXX3 GCCCGOGCCC GCCCOGCCCG GCAAQATGCT 120 

GCCCGTGTAC CAGQAGGT6A AGCCCAACCC GCTGCAGQAC GCGAACATCT GCTCAOGCGT 180 

GTTCTTCTGG TGGCTCAATC CCTTGTTTAA AATTGGCCAT AAAOGGAGAT TAQAQQAA6A 240 

TGATATGTAT TCAGTGCTGC CAGAAGACOG CTCACAGCAC CTTGGAGAOO AOTTGCAAGG 300 

GTTCTGGQAT AAAQAAGTTT TAAGAGCTGA GAATGAOGCA CAQAAGCCTT CTTTAACAAG 360 

AGCAATCATA AAGTGTTACT GGAAATCTTA TTTAGTTTTG GQAATTTTTA CGTTAATTQA 420 

OGAAAGTGCC AAAQTAATCC AGCCCATATT TTTGGGAAAA ATTATTAATT ATTTTQAAAA 480 

TTATGATCCC ATGGATTCTG TGGCTTTGAA CACA0CX3TAC GCCTATGCCA CGQTGCTGAC 540 

TTTTTGCACX3 CTCATTTTGG CTATACTGCA TCACTTATAT TTTTATCAOG TTCA6TGTGC 600 

TGGGATQAGG TTACGAGTAG CCATGTGCCA TATGATTTAT CGGAA6GCAC TTC36TCTTAG 660 

TAACATGGCC ATGGGQAAGA CAACXavCAGG CCAGATAQTC AATCTGCTGT CCAATGATGT 720 

GAACAAGTTT GATCAGGTGA CAGTCTTCTT ACACTTCCTQ TGGGCRGGAC CACTGCAGGC 780 

GATOGCAGTG ACTGCCCTAC TCTGGATGGA GATAGQAATA TCX5TGCCTTG CTGGGATGGC 840 

AGTTCTAATC ATTCTCCTGC CCTTGCAAAG CTGTTTTGGG AAGTTGTTCT CATGACTGAG 900 

GAGTAAAACT GCAACTTTCA CGGATGCCAG GATC3VGGACC ATGAATGAAG TTATAACTGG 960 

TATAAGGATA ATAAAAATOT ACGCCTGGGA AAAGTCATTT TCAAATCTTA TTACCAATTT 1020 

6A0AAAGAAG GAOATTTCCA AGATTCTGAG AAGTTCCTGC CTCAGGGGGA TGAATTTGOC 1080 

TTOaTTTTTC AGTGCAAGCA AAATCATCQT GTTTGTGACC TTCACCACCT AOGTGCTCCT 1140 

CGGCAGTGTG ATCACAGCCA GCCGCGTGTT CGTGGCAGTQ ACGCTGTATO GGGCTGTGGG 1200 

GCTGAOSGTT ACCCTCTTCT TCCCCTCAGC CATTGAQAGG GTQTCAGAGG CAATCGTCAG 1260 

CATCCX3AASA ATCCAOAOCT TTTTOCTACT TQATGAGATA TCACAGCGCA ACCG7CAGCT 1320 

GCCGTCAGAT GGTAAAAAGA TGGT6CATQT GCftGGATTTT ACTGCTTTTT QGGATAAGGC 1380 

ATCAGAQACC CCAACTCTAC AAGQCCTTTC CTTTACTGTC AGACCTGGCG AATTGTTAGC 1440 

TGTGGTCX3GC CCCGTGGGAG CAGGGAAGTC ATCACTGTTA AGTGCCGTGC TOGGGGAATT 1500 

GGCCCCAA6T CACQG6CTGG TCA6CX3TGCA TGGAAGAATT GCCTAT6TGT CTCAGCAGCC 1560 

CTGGGTGTTC T066GAACTC IGAGGAGTAA TATTTTATTT GGGAAGAAAT AT6AAAAGQA 1620 

AC8ATATGAA AAAGTCATAA AGGC7TGT6C TCrOAAAAAG GATTTACAGC TQTTGOAGGA 1680 

TGGTGATCTO ACT6T6ATAG GAGATCX3GGG AACCA06CTG AGTGGAGGGC AGAAAGCACG 1740 

GGTAAACXXr GCAAGAGCAQ TGTATCAAGA TGCXGACATC TATCTCCTGG ACOATCCTCT 1800 

CA6TGCA0TA QATGGGGAAS TTAGCAGACA CTTGTT06AA CTGT6TATTT GTCAAATTTT 1860 

GCAT6AGAA0 ATCACAATTT TAOTQACTCA TGAOTTQCMS TACCTCAAAO CTGCAAGTCA 1920 

GATTCTGATA TTGAAAOATQ QTAAAATGOT GCAGAAGGOO ACTTACACTG AGTTCCTAAA 1980 

ATCTGOTATA GATTTTGGCT CCCTTTTAAA GAAGGATAAT GAGGAAAGTG AACAACCTCC 2040 

AGTTCCAGGA ACTCCCACAC TAAOGAATOG TACCTTCTCA GAOTCTTCGG TTTGGTCTCA 2100 

ACAATCTTCT A6ACCCTCCT T0AAAGAT6G TGCTCTG6AG AGCCAAGATA CAGAGAATGT 2160 

COCAGTTACA CTATCAGAGO ASAA008TTC TQAAGGAAAA QTTQGTTTTC AGGCCTATAA 2220 

GAATTACTTC AGAGCTGOTG CTCACTGGAT TGTCTTCATT TTCCTTATTC TCCTAAACAC 2280 

TGCAGCTCAQ GTT6CCTATG TGCTTCAAGA TTGGTGGCTT TCATACTGGO CAAACAAACA 2340 

AAGTATOCTA AATGTCACTG TAAATGGAGG AGGAAATGTA ACCX5AGAAGC TAGATCTTAA 2400 

CTGGTACTTA GGAATTTATT CAGGTTTAAC TGTAGCTACC GTTCTTTTTG GCATAGCAAQ 2460 

ATCTCTATTO GTATTCTAG6 TCCTTGTTAA CTCTTCACAA ACTTTGCACA ACAAAATGTT 2520 

TGAGTCakATT CIGAAAGCTC OQGTATTATT CTTTGATAGA AATCCAATAO QAAGAATTTT 2580 

AAATCGTTTC TCCAAAGACA TTGQACACTT GGATGATTTG CTGCOGCTGA CGTTTTTAQA 2640 

TTTCATCCAG ACATTGCTAC AAGTGGTTGG TGTGGTCTCT GTGGCTGTGG COOTQATTCX: 2700 

TTGGATCGCA ATACCCTTGG TTCCCCTTGO AATCATTTTC ATTTTTCTTC GQCOATATTT 2760 

TTTGQAAAOO TOUUZAGATQ TOAAGGQOCT GGAATCIACH ACTCX3GAGTC CAGTGTTTTC 2820 

CaVCTTGTCA TCTTCTCTCC AG G GGCT C TG GACGAT0060 GCATAGAAAG CAOAAaAOAG 2880 

GTGTCAGGAA CTGTTTGATG CACACCAOOA TTTACATTCA GAGGCTTGGT TCTTGTTTrT 2940 

GACAACGTCC CGCTGGTTCG CCGTCOGTCT GGATGCXZATC TGTGCCATGT TTGTCATCAT 3000 

OGTTGCCrrr GGGTCCCTGA TTCTGGCAAA AACTCTGQAT GCCX3G6CAGG TTGGTTTGGC 3060 

ACTGTCCTAT GCCCTCAOGC TCATG6GGAT amCAOTOG TGTGTTGGAC AAAGTGCTGA 3120 

AGTTGAOAAT ATQATGATCT CftGTAGAAAG OOTCATTQAA TACACAGACC TTGAAAAAGA 3180 

AGCACCTTGG GAATATCAOA AAOGCCCACC ACCAGCCTGG CCCCATGAAG GAGTX»^TAAT 3240 

CTTTGACAAT GTGAACTTCA TGTACAGTCC AGOTGGGCCT CTGQTACTGA AGCATCTGAC 3300 

AGCACTCATT AAATCACAAG AAAAGGTTGG CATTGTGGGA AGAACOCGAG CT6GAAAAAG 3360 

TTCCCTCATC TCftGCC C TTT TTAGATTGTC AGAACC06AA GOTAAAATTT GGATTGATAA 3420 

QATCrTGACA ACTGAAATTG 6ACTTCA0QA TTTAACQAAG AAAATGTCAA TCATACCTCA 3480 

GGAACCTOTT TT G TTCACTG GAACAATGAG GAAAAACCIG GATCOCTTTA AGGAGCACAC 3540 

GGATGAOGAA CTGTOOAATG GCTTACAAQA GOTACAACTT AAAGAAAGCA TTGAAGATCT 3600 
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TOCTGQTAAA 
ACAACTGOTG 

PTFtGOCCkC 
OUU3ATAATG 
OCAAAATAAA 
TGCCCTCACT 
OICT6ACCAC 
GACAGQ^CTG 
TTTGGACTAT 
CAAOATGCTA 



AT6GATACTG 
TGCCTTGCCA 
AATGT6CSATC 
TGCACGGTGC 
GTTTTA6ATT 
GAGAGCCTAT 
GAAACAGCAA 
ATGGTTACAA 
TGAATCCAAC 
GTAAACCACA 
GTTCATTTGA 



aattaocaga 
gggcaattct 
caagaactga 
taaocatt(k: 
cagoaagact 
tttacaagat 

AACAGGTATA 
ACACTTCCAA 
CAAAATOTGA 
TTGTACTTTT 
ATATTTCTCC 



ATCAGGATCC 
CAGOAAAAAT 
TGAGTTAATA 
ACACAGATTO 
GAAAGAATAT 
GGTGCAACAA 
CTTCAAAAGA 
TGGACAGCCC 
AGTC O QTTCC 
TTTTACTTTO 
C 



AATTTTA6TG 
CAQATATTQA 
CAAAAAAAAA 
AACACC ATTA 
GATGAGCOGT 
CTGOQCAAGG 
AATTATCCAC 
T06ACCTTAA 
OAAGQGATTT 
6CAACAAATA 



TT8GACAAAO 
TTATTGATGA 
TCOQGGAGAA 
TTGACAGGGA 
ATGTTTTGCT 
CAGAAGCXX3C 
ATATTGGTCA 
CTATTTT06A 
TCCACTAGTT 
TTTATACATA 



3660 
3720 
37B0 
3840 
3900 
3960 
4020 
4080 
4140 
4300 



Seg 10 KO: 572 Protein sequence 
Protein Accession «< AAC27076.1 



KLPVYQBVKP 
QGFWDKEVLR 
EHYDPHDSVA 
LSNMAMGKTT 
MAVLIILLPL 
NIiRKKEISKI 
VRLTVTLFFP 
KASBTPTX^QO 
QPHVP8GTLR 
ARVNLARAVY 
8QILXLXSQX 
SQQSSRPSLK 
NTAAQVAYVL 
ARSLLVFYVL 
LDFIQTLLQV 
FSHIiSSSU20 
XXVAFGSLZL 
KEAPWByQKR 
KSSLISALFR 
KTDEELHNAL 
DSATANVDPR 
UiQNKESIiFy 
FETAL 



H 
I 

HPLQDAKICS 
ASIDAQKPSL 
LNTAYAYATV 
TGQIVNLL8H 
QSCFQKLF8S 
LRSSCLRGMN 
8AXERVSEAX 
LSFTVRP6EL 
SNILFCKKYE 
QDADZYLIiDD 
MVQKGTYTBP 
DGUVLB8QDTB 
ODHHIiSYHAN 
VNSSQTLHNK 
VGWSVAVAV 
LWTIRAYKAE 
AKTLDAGQVG 
PPPAWFHEGV 
LSEPEQKIWI 
QEVQLKETIE 
TDBLIQRRIR 
XMVQQLSKAE 



21 
I 

RVFFWWLNPL 
TRAZIKCyWK 
LTFCTLILAX 
DVMKPDQVTV 
LRSKTATFTD 
LASFFSASKZ 
VSXRRlQTFIi 
LAWGPVGAG 
KERYEKVIKA 
PL8AVDAEV8 
LKS6XDFQSL 
NVFVTXiSEBN 
KQSMLNVTVN 
MFBSILKAFV 
IPHIAIPLVP 
EROQELFDAH 
XiAZiSyALTLM 
XXFDNVNFMY 
SKXLTTEIGL 
DIiPGKMDTEL 
EKFAHCTVLT 
AAALTBTAKQ 



31 
I 

FKXGHKRRLE 
SYIiVUSXFTL 
LRHLYFYHVQ 
FLHFLHAOPL 
ARIRTMMEVI 
IVPVTFTnfV 
UiDBXSQRHR 
KS8LLSAVM 
CALKKDIiQLL 
RHLFEIiCZCQ 



R8BGKV6FQA 
GGGNVTBKLD 
LFFDRNPZGR 
LGZZFZFLRR 
QDLHSEAWFL 
GMFQWCVRQS 
SPQOPLVIiKR 
HDLRKKMSZI 
AESQSKFSVG 
ZAHRLNTXZD 
VYFKRNYFHX 



41 

! 

EDSMYSVIiPE 
IBE8AKVIQP 
CAGMRZJCVAM 
QAXAVTALLH 
TGZRXIKNYA 
LLOSVXTASR 
QLP8DGXXMV 
BLAPSHQLVS 
EDGDLTVIGD 
XLHSKITZLV 
PPVPGTPTLR 
YXNYPBAOAH 
XdlWYLOZYSG 
ZLNRFSKDZG 
YPLBT8R0VK 
FLTTSRMFAV 
AEVENMMZSV 
LTALXK8QBK 
PQEPVIiFTGT 
QRQLVCIiARA 
8DKZMVLDSG 



51 

I 

ORSQHUaEEL 
XFLGKXXNYF 
CBMXYRKALR 
MEXGZSCLAG 
WEKSFSNLIT 
VFVAVTLYGA 
HVQDFTAFWD 
VHGRXAYVSQ 
RGTTLSGGQX 
THQLQYLKAA 
NRTFSB8SVW 
HXVFXFLXLL 
LTVATVXiFGX 
HXiDDXiLPLTF 
RLESTTRSPV 
RLDAICAMFV 
ERVXEV7DL8 
VGXVGRTQAG 
MRKNLDPFKE 
ZLRKUQZLZZ 
RLKSYDEPYV 
SNQOPSTLTX 



Seq ZD NO: 573 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence i 1 . . 13 65 



ATGGAATCAA 
6GCATAAATG 
TTTGCCAAAT 
AGAAATCCTA 
GATGCTCTCA 
CTGTGGGACC 
AGQATAAACC 
TTQATTGTCA 
GCCA6CCGGC 
CTTOCCCOCC 
ATTGAAAATT 
AOCTTGGCCA 
AGAAACCAAC 
ATAGTTGCCA 
CAACTTTATT 
TGTAOAAAAC 
CTCTGCTTAC 
GTTCATGCAA 
ATCTCCTTTQ 
TCAGTGAGCA 
OTOQCTCTGC 
GAAGAGTACT 
ATTGTAATTC 



11 
1 

TCTCTATGAT 
GTATCAAAGA 
CCTTGACCAT 
AGTTTGCTTC 
CAAAAACAAA 
TGAGACATCT 
AGTACCCAQA 
AAGGATTTAA 
AQQTTTATAT 
AGTTGAATTT 
TACCCCTACX3 
CATTTTTTTT 
AGAGTGACTT 
TTACTTTGCT 
AOGGCACCAA 
AGCTTQGATT 
OQATGAGAAG 
ATATTGAAAA 
GCATAATGAG 
ATGCTTTAAA 
TCATAAGTAC 
ACAQATTTTA 
TGQATCTTTT 



21 
I 

GGGAA6CCCT 
TGCAAGOAAO 
TOGACTTATT 
TGAATTTTTT 
TATAATATTT 
6CTT6TGG6T 
ATGCAATQCT 
TGTTGTCTCA 
ATGCAGCAAC 
CATTCXX31TT 
ACTCTTTACT 
CCXTTATTCX: 
TTACAAAATT 
CTCCCTAGTA 
GTATAGGAQA 
ACTAAGTTTT 
GTCAGAOAGA 
CTCTTGOAAT 
CCTTGGCTTA 
CTGGAGAGAA 
TTTCCATGTT 
TACACCAOCA 
GCAGCTTT6C 



31 

I 

AAGAGCCTTA 
GTCACTGTAG 
AGATGCX36CT 
CCTCATGTGG 
GTTGCTATAC 
AAAATCCTGA 
6AATATTTQG 
GCTTOGGCAC 
AATATTCAAG 
GACTTGGGAT 
CTCTGQAOAG 
TTTGTGAGAO 
CCTATAQAGA 
TACCTOGCAG 
TTTCCACCTT 
TTCTTOGCTA 
TATTTGTTTC 
GAGGAAGAAG 
CTTTCCCTCC 
TTCAOTTTTA 
TTAATTTATG 
AACTTTQTTC 
AGATAGOCAG 



41 

1 

6TGAAACTTG 
OTGIGATTGG 
ATCATGrOOT 
TAGATGTCAC 
ACAQAGAACA 
TTOATQTGAG 
CTTtZATTATT 
TTGAGTTAGO 
0G06ACAACA 
CCTTATCATC 
GG0CA6TGGT 
ATGTQATTCA 
TTOTQAATAA 
GTCTTCTGGC 
GGTTGGAAAC 
TGGTOCATQT 
TCAACATOQC 
TTTGGAQAAT 
TGQCAOTCAC 
TTCAGTCTAC 
GATGGAAACX3 

r niei'CiTCr 

ACIGA 



60 
120 

leo 

240 
300 
360 
430 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1030 
1080 
1140 
1200 
1260 
1320 



51 
I 

TTTACCTAAT 
AAGTGGAGAT 
CATAGGAAGT 
TCATCATGAA 
TTATACCTCC 
CAATAACATG 
CCCAGATTCT 
ACCTAA66AT 
GGTTATTGAA 
AGCCAOAGAO 
GGTAGCTATA 
TCCATATGCT 
AACCTTACCT 
AGCTGCTTAT 
CTGGTTACAG 
T6CCTACAGC 
TTATCAGCAG 
TGAAATGTAT 
TTCTATCCCT 
ACTTGGATAT 
AGCTTTTQAO 
TTT G OOCTCA 



Seq ZD KOt 574 Protein sequence 
Protein Accession fft Eos sequence 



1 
I 

MESZSMMGSP 
RHPKPASEFF 
RXKQYPESHA 
LARQLKPZPZ 
RNQQSOFYKZ 
CRKQLGLLSF 
ISFOXMSLGXi 
BBYYREYTPP 



11 
1 

KSLSETCLPN 
PHWDVTHHB 
EYliASIiPPOS 
DLGSLSSARB 
PZBZVNKTLP 
PPAMVHVAYS 
LSliLAVTSXP 
NFVLALVLPS 



21 
I 

GINGZKDARK 
DALTKTNZZP 
LIVKQFNW8 
XBIZiPLRliFT 
ZVAXTIjLSLV 
Z«CLPKRR8BR 
SVSNALNWRB 
ZVILDLLQLC 



31 
I 

VTVGVZGSGD 
VAZHRBHYT8 
AHALQLGPXD 
LHRGPWVAI 
YXiAQIjLAAAY 
YLFliNMAYQQ 
F8FIQSTL0Y 
RYFD 



41 

I 

FAKSLTZRLZ 
LNDLRHIiLVQ 
ASRQVYZCSN 
SZATFFFLY8 
QLYYGTKYRR 
VHANZENSWN 
VALLZSTFHV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
730 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 
I 

ROGYHWZGS 
KZLZDVSNim 
NZQARQQfVZB 
FVRDVIHPYA 
FPPWLETWLQ 
EEEVWRZEMY 
liZYGWKRAFB 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NO: S7S DWA sequence 

Nucleic Acid Accession «i NH_001873.X 

Coding sequence t 3.. 1721 



1 11 21 31 41 SI 

I i I I I I 

AAATGOOQTG CCOQTCTCTC OOCCGGCOOC CTGCCTOQCA GTGGTTTCTC CTGCAGCTCC 60 

CCTGGGCTCC GCGGCCAGTA GTGCAGCCCG TGGAGCOCOG GCTTTOCOOO T C rOCT C TG G 120 

GTGGCCGCAa TGC6CGG6CT QACACTCATT CAGCOQGGGA AGQTQAQGOS AGTA6A6GCT 180 

G6TGCGGAAC TTGCCGCCCC CAGCAGOGCC GGCGGGCTAA GCCCAGGGCC GGGCAGACAA 240 

AAGAGGCOGC CCGCGTAGGA AGGCACGGCC GGOSGOGGCG GAGCX3CAGCG ATGGCCGGGC 300 

GAGGGGGCAO 060GCTGCTG GCTCTGTGCG GGGCACTGGC TGCCTGCGGG TGGCTCCTGQ 360 

GCGC0GAA6C CCAGGAGCCC GGGGCGCCCG CGGOSGGCAT GAGGCGGCGC OGGOGQCTOC 420 

AGCAAOAOOA COQCATCTCX: TTCGAGTACC ACOGCTACCC CGAGCTGCGC GAGGOGCTOS 480 

TGTCC G TGT G GCTGCAGTGC ACCGCCATCA GCAGGATTTA CACGGTGGGG CXSCAGCTTOG 540 

AGGGCOGGGA GCTCCTOGTC ATCGAGCTGT CCQACAACCC TGQOGTCCAT GAGCCTGGTG 600 

AGCCTQAATT TAAATACATT GGGAATATQC ATGGGAATGA GGCTGTTGGA OSAGAACTGC 660 

TCATTTTCTT GGCCCAGTAC CTATGCAACG AATACX3U3AA GOOGAAOQAG ACAATTGTCA 720 

ACCTGATCCA CAGTACCCX3C ATTCACATCA TGCCTTCCCT GAACXXaOAT GGCTTTQAGA 7B0 

AGGCAGCGTC TCAGCCTGGT GAACTCAAGG ACTGGTTTGT GGQTOGAASC AATGCCCAOO 840 

GAATAGATCT QAACCOQAAC TTTOCAGACC TGQATAGGAT AGTGTAOGTG AATGAGAAAG 900 

AAGGTGGTCC AAATAATCAT CTGTTGAAAA ATATGAAGAA AATTOTQQAT CAAAACAC3UV 960 

AGCTTQCTCX: TGAGACXyVAO GCTQTCATTC ATTGGATTAT OQATATTCCT TTTGTGCTTT 1020 

CTGCXaU^TCT CCATGGAGGA GACXTTTGrGG CCAATTATCC ATATGATGAQ ACGCGGAGTG lOBO 

GTAGTGCTCA CGAATAC3W3C TCCTCCCCAG ATGAOQCCAT TTTCCAAAOC TTGGCCOGGQ 1140 

CaTACTCTTC TTTCAACCCG GCCATGTCTQ ACCCCAATCG GCCACCATGT OGCAAGAATG 1200 

ATGATGACAO CA6CTTTGTA GATX3GAACCA CCAACG6TGG TGCTTGGTAC AGCX3TACCT0 1260 

GAGOQATQCA AOACTTCAAT TACCTTAGCA GCAACTGTTT TGAGATCACC GTGGAGCTTA 1320 

6CTGTGAGAA GTTCCCACCT GAAGAGACTC TGAAGACCTA CTG6GAGGAT AACAAAAACT 1380 

OCCTCATTAO CTACCTTGAO CAOATACACC GAGGAGTTAA AGGATTTOTC CGAGACCTTC 1440 

AA06TAAC0C AATTGCGAAT GCCACCATCT CC0TGGAAG6 AATAOACCAC GATGTTACAT 1500 

COGCAAAGGA TCGTOATTAC TGGAGATTGC TTATACCTGQ AAACTATAAA CTTACAGCCT 1560 

CAGCTCCAGO CTATCTGGCA' ATAACAAAGA AAGTGGCAOT TCCTTACAGC CCTGCTGCTQ 1620 

GGGTTGATTT TGAACTGGAO TCATTTTCT6 AAAGQAAAGA AOAGGAGAAO 0AAQAATT6A 1680 

TOGAATQOTO OAAAATGATQ TCAGAAACIT TAAATTTTTA AAAAGGCTTC TAGTTAGCTG 1740 

CTTTAAATCT ATCTATATAA TOTAOTATGA TGTAATGTGO TCTTTTTTTT AQATTTTGTG 1800 

CAGTTAATAC TTAACATTGA TTTATTTTTT AATCATTTAA ATATTAATCA ACTTTCCTTA 1860 

AAATAAATAG CCTCTTAGGT AAAAATATAA GAACTTQATA TATTTCATTC TCTTATATAG 1920 

TATTCATTTT CXTTACCTATA TTACACAAAA AA6TATAGAA AAGATTTAAO TAATTTTGCC 1980 

ATCCTAOGCT TAAATGCAAT ATTGCTGOTA TTATrTACAA T6CAGAATTT TTT6AGTAAT 2040 

TCTAGCTTTC AAAAATTAGT GAAOTTCTTT TACTOTAATT GGTGACAATO TCACATAATG 2100 

AATGCTATTG AAAAGGTTAA CAGATACAOC TCGGROTTGT GAGCACTCTA CTGCAAGACT 2160 

TAAATAGTTC AGTATAAATT GTCGTTTTTT TCTTGTGCTG ACTAACTATA AGCAT6ATCT 2220 

TGTTAATGGA TTTTTQATGO 6AAGAAAAGQ TACATGTTTA CAAAGAGGTT TTATGAAAAG 2280 

AATAAAAATT GACTTCTTGC TTGTACATAT AGGAGCAATA CTATTATATT ATGTA0TCG6 2340 

TTAACACTAC TTAAAAGTTT AGG6TTTTCT CTTOQTTQTA QAOTGGCCCA GAATTOCATT 2400 
CTGAAT6AAT AAAGGTTAAA AAAAAATCCC C3M3TGAAAAA AAA 



Seq ID KOt 576 Protein sequence 
Protein Accession «i MP 001864.1 



1 
I 

MAGRGGSALL 
EALVSVHLQC 
RELLIFIiAQY 
NAQGIDLNRN 
FVLSAMLHGO 



NKNSLI8YLE 
LTASAPGYLA 



11 
I 

AI.CX3ALAA0G 
TAISRIYTVO 
LCSTEyQKGNB 
FPDIiDRIVYV 
DLVAMYPYDE 
DGT71IGGAWY 
QIHRGVKGFV 
ITKKVAVPYS 



21 

I 

WLLGAEAQEP 
RSFEGRELLV 
TZVNLZKSTR 



31 

I 

6APAAGMRRS 



TRSGSAHEYS 
SVPGGMQDFN 
RDKX3NPIAH 
PAAOVDFELE 



IHIMP8UIPD 
LLXMMKKIVD 
88PDDAIFQS 
YLSSNCFBXT 
ATI8VBQIDH 



41 

1 

RRLQQEDOIS 
EPOEPBPKrX 
6FEKAASQPG 
QNTKXiAPBTK 
LARAYSSFIfP 
VBL5CEKFPP 
DVTSAKDGDy 
EBLHEWWKKH 



51 
I 

FEYHRYPSLR 
GNMB6NEKVQ 
ELKDWPVGR8 
AVIHWIMDIP 
AMSDPNRPPC 
EETIiKTYWED 

moiLiPGtmc 

SETLNF 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 577 DWA aequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 1..933 



ATGT6CAGGA 
TTOQACAAGA 
TTCOCCTGTG 
QACTGTCGC3G 
QCCCGCTACC 
AATAACIGTC 
GG6CAGGTOT 
ATCATG6GCA 
CACCAGOGGA 
CT6CTGTCCC 
AATAATGGCA 
CCACCCTCCT 
COGCCCTACT 
OGOTCGQGQA 
GACACCAOCC 



11 

I 

ATGGAOQQTG 
GTQATGAGAA 
CCAGOGGCAT 
ATGGCA6CGA 
ACTGCAAGAA 
AAGACAACAO 
TT8TQACTTC 
OCTCOGTCAT 
AGOGGAACAA 
GCCTGQTGGT 
TCCAGTATGT 
ACTGOQAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCOQGO 



21 

I 

GATCCOGGGC 
GGAGT6000C 
GCATTGCATC 
TGAAGAGAAC 
OGGCCrCTGT 
TSATGAGGAA 
A0A6AACCAA 
' mT BT G CTQ 
CCTCATGAOQ 
OCTOGATCAC 
GGCXIAGCCAG 
CTTGCTQ6AC 
GGAATCTCTG 
TQCCAGCTCC 
GCAGCCTGOC 



31 
I 

OCCT0GCA6T 
AAGGCTA AGT 
ATTGOTOGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTOGTOGOOC 
CTGCCOQTGC 
CCCCACCACT 
GOGGAGCAGA 
CAGAQGOCTO 
AACCAAGCOQ 
CAOQCAGCCA 
CCCCAGGAGG 



41 

I 

GTGA06GGCT 
CGAAATGTGG 
TCOQOTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAQA 
ACCCCAGCAT 
T0CTG6CACT 
AOOGGCTGCA 
GCAACOTCAC 
ATGCX3TCGGA 
OOTGGTATGA 
AOCT GO OCCC 
GCAGCCTCCT 
OCACTQCTGA 



SI 
I 

GCCTGACTGC 
CCCXIACCTTC 
TOGGTTTQAG 
TTGCTCCACC 
OQATGQACAG 
ACOOGGCAGT 
CACCTATGCC 
GGTCTT6CAC 
GCACCCTGTG 
CTACAAOOTC 
AGTAGGCTCC 
CCTTCCTOCA 
CTACOSCTCC 
GAGCGTOQAA 
GCCCAOGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1174 
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TCTGAiGCCav GCCAGGGCAC TQAAGAAGTA TAA 



Seq ID NOs 578 Protein secwence 
Protein Accession tti Eos sequeiice 



1 II 21 31 41 51 

I I 1 I I I 

KCSNGRCIPQ AWQCD6LPDC FDKSDBKBCP KAKSKOGPTP PPCASGIHCI ZGRPRCHGFB 60 

DCPD68DEEN CTAIIPLI.CST ARYHCaCNGIiC ZDKSFXCDGO NNGC^SOBB SCBSSQEPGS 130 

GQVPVTSEKQ LWYPSXTYA ZZGSSVXPVlf WALLAbVLR HORXRIOILKT LPVBSLQHFV 180 

LL6RLWL0H PHKCNVTyNV NKOZQWASQ ABQNA8BVG8 PP8YSEALLD QRPAWYDLPP 240 

PPY8SDTBSL NQADIiPPYHS RSGSANSASS QAASSLLSVB DTSHSPGQPO PQEQTABPRD 300 

SEPSQOTBBV 



Seq ID NO: 579 DliA sequence 
Nucleic Acid Accession #: AF179374.1 
Coding sequence t 1 . . 1125 



1 11 31 31 41 51 

I I I I I i 

ATGGTGCTGT GGGA6TCCCC G06GCAGTGC AGCAGCTGQA CACTTTGCGA GGGCTTTTGC 60 

TGGCTGCTGC TGCTGCCCGT CATGCTACTC ATC6TA6CCC GCCOGGTGAA GCTOGCTGCT 120 

TTCCCTACCT CCTTAAGTGA CT6CCAAA0G CXX^OGGCT GGAATTGCTC TGGTTATGAT 180 

QACAQ A GAAA ATOATCTCTT CCTCTGTGAC A0C31ACACCT OTAAATTTGA T6G6QAA1X3T 240 

TTAAGAATTG GAGACACfCST GACTTGCGTC TGTCAGTTCA AGTGCAACAA T6ACTATGTG 300 

CCTGTOTGTG OCTCCAATGG GGAGAGCTAC CAGAATOAGT GTTACCTOCa ACAGGCT6CA 360 

T6CAAACAGC AGAGTGAGAT ACrrGTGGTO TCAGAAGGAT CATGTGCCAC A6ATGCA0GA 420 

TCAGGATCTG GAGATGGAQT GCATGAAGGC TCTG6A6AAA GTAOTCAAAA G6AGACATCC 480 

ACCTOTG AXA TTTG CCAGTT TGQTGGAGAA TOTQAOGAAG AT6GGGAG6A TGTCTGGTGT 540 

GTGTGTAATA TTGACTGTTC TCAAAOCAAC TTCAATCCCC TCT606CTTC TGATGGOAAA 600 

TCTTATGATA ATGCATGCXZA AATCAAAGAA GCAT0GT6TC AGAAACAGGA GAAAATTGAA 660 

GTCATGTCTT TGGGTCX3ATG TCAAGATAAC ACAACTACAA CTACTAAGTC TGAAGATGGG 720 

CATTAT6CAA GAACAGATTA TQCAGAGAAT GCTAAGAAAT TAOAAGAAAG TGCCAGAGAA • 780 

CACCACRTAC CTTGTCCGQA ACATTACAAT GGCTTCTGCA TGCATGGGAA GTGTQAGCAT 840 

TCTATCAATA TGCAGGA6CC ATCTTGCAGG TGTGATGCTG GTTATACTGG ACAACACTGT 900 

QAAAAAAAOQ ACTAC3U3TOT TCTATACGTT GTTCCCGGTC CTGTACGATT TCAGTATGTC 960 

TTAAT06CAG CTQTGATTG6 AACAATTCAG ATTGCTOTCA TCTGTGTGGT GQTCCTCTGC 1020 

ATCACAAGGA AATGCXICCAG AA0CAACA6A ATTCACAQAC AGAA6CAAAA TACAGG6CAC 1080 
TACAGTTCAO ACAATACAAC AAGAGOGTCC AOQAGGTTAA TCTGA 

8eq ZD NO: 580 Protein sequence 
Protein Accession «i NP_057276.2 

1 11 21 31 41 51 

I I I I I I 

MVLWE8PRQC SSWTLCBGPC ULLLLPVMLL IVARPVKIAA PPTSLSDCQT PTGVINCSGYD 60 

DRKNDLFLCD TNTCKFDGEC LRZGDTVTCV CQFKQ4NDW PVCGSKQESY QNEOrLRQAA 120 

CKQQSEILW 8EGSCATDAG SGSGDGVHE6 8GETSQXETS TCDICQF6AE CDEDAEDVWC 180 

VCNIDC8QTN FMPLCASDGK 8YX7HAGQIKE ASOQKQEKIB VMSLGRCQDN TTTTTKSEDG 240 

HYARTDYABN ANKLEBSARB HHIPCPEHYN GFCMHGKCEH 8IMMQBPSCR (SAOYTGQHC 300 

EKKDYSVLYV VPGPVRFQYV LIAAVZGTIQ ZAVICVWLC ZTRKCPRSHR IHRQKQNTGH 360 
YSSDMTTRAS TRLZ 



Seq ID NOi 581 DNA sequence 
Nucleic Acid Accession #t 878203.1 
Coding sequence I 1..2190 



1 11 21 31 41 51 

I I 1 I I I 

ATGA A TCCTT TCCAGAAAAA TGAGTCCAAG 6AAACTCTTT TTTCAGCTOT CTCCATTGAA 60 

OAGGTACCAC CTOGACCACC TAaCCCTCGA AAOAAOCCAT CTCOGAOVAT CTGTGGCTCC 120 

AACTATCCAC TGAGCATTGC CTTCATTGIG 6TGAATGAAT TCTGCGAGOG CTTTTCCTAT 180 

TATOOAATOA AAGCTOTGCT OATCCIOTAT TTCCTOTATT TCCTGCACTG GAATGAAGAT 240 

ACCTCCACAT CTATATACCA TGCCTTCAGC AGCCTCTGTT ATTTTACTCC CATCCTGGQA 300 

GCAGCCATTQ CTGACI06TG GTTGGQAAAA TTCAAQACAA 7CATCTATCT CTCCTTGOTG 360 

TATGTGCTTG GCCATGTGAT CAAGTGCTTG GGTGCCTTAC CAA7ACTGGG AGQACAAGTG 430 

GTACACACAO TCCTATCATT GATCGGCCTO AGTCTAATAO CTTTGGaOAC AGGAGGCATC 480 

AAACCCTGTG TGGCA6CTTT TGGTGGAGAC CAGTTTGAAQ AAAAACAT6C AGA66AACGG 540 

ACTAOATACT TCTCAGTCTT CTACCTGTCC ATCAATGCAO G6AGCTTGAT TTCTACATTT 600 

ATCACACCCA TGCTGAGAGO AOATGTGCAA TGTTrTGGAG AAGACTGCTA TGCATTG6CT 660 

TTTG6A6TTC CAG6ACIGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT GGGAAGCAAA 730 

ATATAGAATA AACCACCCOC TGAAGGAAAC ATAGTGGCTC AAOTTTTCAA AT3TATCTGG 780 

TTTGCTATTT CCAATCOTTT CAAGAACOGT TCTGGAGACA TTCCAAAGOG ACAGCACTGG 840 

CTAQACTGGQ CAGCTOAGAA ATATCCAAAG CAGCTCATTA TOGATGTAAA GGCACTQACC 900 

AGGGTACTAT TCCTTTATAT CCCATTGCCC ATGTTCTGGG CTCTTTTGGA TCAGCAGGGT 960 

TCAC6ATGGA CTTTGCAAGC CATCAGGATG AATAGGAATT 'IXJ OGG lTi rr TGTGCTTCAO 1020 

COQQACCAGA TOCAGOTTCT AAATOOCTTT CTGOTTCTTA TCTTCATCCC GTTGTTTGAC 1080 

TTTGTCATTT ATCGTCTGGT CTCCAAGTGT GGAATTAACT TCTCATOVCT TAGQAAAATG 1140 

GCTGTTGGTA TGATCCTAGC GTGCCTGGCA TTTGCAGTTQ CGGCAGCTGT AGAGATAAAA 1200 

ATAAATOAAA TGGCCCCAGC CCAQTCAGGT CCCCAGGAflQ TTTTCCTACA AGTCTTGAAT 1260 

CTGGCAGATQ A1GAG6TQAA GOTGACAGTO OTGQGAAATG AAAACAATTC TCT6TTGATA 1320 

6AGTCCATCA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT GAAAACAAAA 13 80 

A6GCAGQATT TTCACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 1440 

GTGCAGGAOA AOAACTGOTA CAGTCTTGnC A7T00TGAA0 ATGG6AACA0 TATCTCCAGC 1500 



1175 



wo 03/042661 



PCT/US02/36810 



ATGA TGGTAA AQGATACAQA AA6CAAAACA ACCAATGGGA TGACAACCGT GAGGTTTGTT 1560 

AACACTTTGC ATAAAGATOT CAACATCTCC CTGAaTACAO ATACXTCTCT CAATGTTGGT 1620 

GAAOACTATO GTGTGTCTGC TTATAGAACT GTGCRAAQAQ GAGAATACCC TGCAGTGCAC 1680 

TGTASA ACAG AAQATAAOAA CTTTTCTCTG AATTTOGQTC TTCTAGACTT TGGTGCAGCA 1740 

J TATCTGTTTG TTATTACTAA TAACACCAAT CAGOGTCTTC AGGCCTGGAA GATTGAAGAC 1800 

ATTCCAGCCA ACAAAATOTC CATTOOOTGG CAGCTACCAC AATATGCCCT GGTTACAGCT 1860 

GGOOAGGTja TGTTCTCTQT CACAGGTCTT GAGTTTTCTT ATTCTCAGGC TCCCTCTAGC 1920 

ATGAAATCT G TGCTCCAGGC AGCTTGGCTA TTGACAATTG CAGTTGGQAA TATCATCGTG 1980 

- ^ CTTCTTOTGQ CACAGTTCAG TGGCCTGGTA CAGTGGGCCX3 AATTCATTTT GTTTTCCTGC 2040 

lU crarrGCTGO tgatctgcct gatcttctcc atcatgggct actactatgt tcctotaaao 2100 

ACAGAGOATA T6CGGGGTCC AGCA6ATAAG CACATTCCTC ACATCCAGGG 6AACATGATC 2160 
AAACTAGAGA CCAAGAAGAC AAAACTCTGA 

^ . Seq ID NOt 582 Protein gequence 
IJ Protein Accession #i AAB34388.1 

I 11 21 31 41 51 

t I I I I I 

MNPPQKNBSK BTLP8PV81E BVPPRPPSPP KKPSPTICSS NYPLSZAPIV VNEFCERFSY 60 

ZU YQUCAVLILY PLYPLHWNBD TST8IYHAPS SLCYPTPILG AAIADSWWK FKTIIYLSLV 120 

YVU3HVIKSL GALPXLGGQV VHTVL8LI0L SLIALGTGOI KPCVAAFGGD QPBEKHABER 180 

TRYPSVFYLS INAGSLISTF ITPMLOIGDVQ CPGEDCYAIiA FGVPGLLMVI ALWPAMGSK 240 

lYNKPPPBGM IVAQVPKCIW FAISNRPKNR SQDIPKRQHW LDWAAEKYPK QLIHDVKALT 300 

RVIiPIiYIPLP MFWALLDQQG SRWTLQAIRM NRZMFFVU) PDQMQVI^PF LVLXFIPLPD 360 

Zj FVIYRLVSKC GINF8SLRKM AVGMILACLA FAVAAAVEZK ZNEKAPAQSG PQBVPLQVLN 420 

LADDBVKVTV VGNENNSLLl ESIKSPQKTP HY8KLHLKTK SQDPHPHLKY HNLSLYTEHS 480 

VQBKNWYSLV IREDQJSISS MMVKDTBSKT TNGMTTVRFV NTLHKDVNI8 LSTDTSLNVG 540 

BDYGVSAYRT VQRGBYPAVH CRTBDKNPSL NLGLLDFGAA YLFVITONTN QOLQAWKIED 600 

IPANKMSIAH QLPQYALVTA OBVMP8VT6I* BPSYSQAPSS MXSVXiQAAWL LTIAVGNZXV 660 

J\) LWAQPSGLV QWAEPILPSC IjLLVICLXFS IMQYYyVPVX TEONRGPADK aiPHZQQimZ 720 
KLETKKTKL 

Seq 10 NOt 583 DHA Bequence 

Nucleic Acid Accession «: NM_032642.1 

CSoding sequence I 184.. 1263 ~ 

1 11 21 31 41 51 

I I I t I I 

GACCATTAGC AGGCACCX31G 6CCTGTCTTT GGCTCOOAAA GGGTGOOCCC CAAT6TAGGC 60 

4U TAOTTTGAAC CTAGQAACTG CAGGACCAGA GAGATTCCAC IGGAGGCTGA TG6AO0OOTO 120 

ACAGAGGGAA CCCTACTCTG QAAACTQTCA GTCCCAGGGC ACTGGGGAGG GCT6AGGCCG 180 

ACCATGCCCA GCCTGCTGCT GCTOTTCACXS GCTGCTCTGC TGTCCAQCTG GGCTCAGCTT 240 

CTOACAGACG CCAACTCCTG GTGGTCATTA GCTTTGAACC CGGTGCAGAG ACCCGAGATG 300 

TTT ATCA TCG GTGCCCAGOC OSIQTGCAaT CAGCTTCCOQ GGCTCTGCGC TGGCCAGAOQ 360 

4D AAGCTGTGCC AATTOTACX3V 6QAGCACATQ GCCTACATAO GGGA8GGACC CAAGACTQ6C 420 

ATCAAGGAAT GCX»GCACCA OTTCOGGCAG CX3QCX3GTGGA ATTGCAGCAC AGOGGACAAC 480 

GCATCTOTCT TTGGGAGAGT CATGCAGATA GGCAGCOGAG AGACOQCCTT CACCCAOQCG 540 

GTOAOOGCCG OGGOOGTGGT CAAOQOCATC AGOOGGGCCT GCOGOOAGGG 06AGCTCTCC 600 

Cn ACCTGOGGCT GCAGCOXSAC OGOGOGGCCC AA0SACCT6C CCOSGQACTG GCTGTQGQGC 660 

3U GGCTGTGGGG ACAAOGTOQA GTAOOGCXAC 06CTT0GCCA AGGAGTTTGT GGATGCCOGG 720 

QAGCaAGAGA AGAACTTTGC CAAAGGATCA GAOQAGCAOQ QCOGGGTQCT CAT6AACCT0 780 

CAAAACAACG AQGCCGOTOO CAGGGCTGTG TATAAGATGQ CAGAOGTAGC CTGCAAATGC 840 

CAOGGCGTCT CGGGGTCCTG CAGCCTCAAO ACCTGCTGGC TOCAGCTGGC aSAOTTCOSC 900 

AA6GT0GG0G ACOQGCTGAA GQAGAAGTAC GACAGGGOGa CCGGGATGGO OOTCAOCOOC 960 

AAGGGCOGGC T6GAGCTGOT CAAC31GC0GC TTCAOCCAGC CCACCCOGGA GGAOCTGGTC 1020 

TATOTGGACC CCAGCCCOGA CTACTGCCTG CX3CAACGAGA GCAOGGGCTC CCTGQGCACO 1080 

CAGGGCOGCC TCTQCAACAA GACCT06GAQ GQCATGGATG GCTGTGAGCT CATGTGCTGC 1140 

GGaOOTCGCr ACAACCAGTT CAAGA6CX3TG CAGGTGGAGC GCTOCCACTO CAAGTTCCAC 1200 

TGGTGCTGCT TCGTCAOOTQ TAAOAAOTGC AOGGAOATOS TGOACCAOTA CATCTGTAAA 1260 

OU TAGCCCGGAG GGCCTGCTCC CX360CCC!OCC TCCACTCTGC CTCACAAAOQ TCTATATTAT 1320 

ATAAATCTAT ATAAATCtAT TTTATATTTQ TATAA6TAAA TGGGTGGGTG CTATACAATG 1380 

GAAAGATGAA AATGQAAAGG AAGAGCTTAT TTAAGAGACX3 CTGOAGATCT CrOAGGAGTG 1440 

GACTTTGCTQ GTTCTCTCCr CTTGGTGGGT GGGAGACAGG GCTTTTTCTC TCCCTCTGGC 1500 

^. GAGGACTCTC AOG ATGTA GG GACTTOGAAA TATTTACTOT CTGTCCACC31 COGCCTGGAG 1560 

Oj GAGGGAGOTT QTGOTTGGAT GGAQGA6ATQ ATCTTGTCTG 6AAGTCTAGA OTCTTT6TT0 1620 

6TTAGAGGAC TGCCTGTGAT CCTGGCCACT AGGCCAAGAG GCCCTATGAA GGTGGGGGGA 1680 

ACTCAGCTTC AACCTCGATQ TCTTCAGGGT CTTGTCC3VGA ATGTAGATGQ GTTCCGTAAG 1740 

AG6CCTGGTG CTCTCTTACT CTTTCATCCA 05TGCACTTG T606GCATCT GCA6TTTACA 1800 

GQAAOQGCTC CTTCOCTAAA ATGAGAAGTC OUVQGTCATC TCTGGCGCAO TQAOCACAGA 1860 

/U GAOATCTGCA CCTCCOGQAC TTCAGGCCTG 0CTTTCCA6C GAGAATTCTT CATCCTCCAC 1920 

GGTT CACTAG CTCCTACCTG AAGAOGAAAO GGGGCCATTT OACCXOACAT GTCAGGAAAQ 1980 

CC CTAA ACTO AATGTTTGOS CCTGGGCTGC AGAAGCCAGG GTGCATQACC AGGCTGCGTO 2040 

GAOGTTATAC TGTCTTCCCC CACCCCCGOO 6AGGGQAAGC TTGAGCTGCT GCTGTCACTC 2100 

^. CTCCftCOOAa GGAG6CCTCA CAAACCAGAO OAOGCTGCAA OGGGTCAGOC TGGCQGQGCC 2160 

/ J GGOGTGCTCA TCATCTCTGC CCCAQOTOTA OGGTTTCTCT CTGACATTAA ATOCCCTTCA 2220 
TGGAAAAAAA AAAAAGAAAA AAAAAAAAAA AA 



80 



Seq ID NO I 584 Protein Bequence 
Protein Accession ftt 1IP_116031.1 

I 11 ' 21 31 41 51 

I I I ! I i 

HPSLLLLFTA ALLSSHAQI«L TDANSWWSLA LNPVQRPSMP IIGAQPVCSQ X«PGl«SPGQRK 60 

UX2LYQBHKA YIOEGAKTGX KEGQHQFRQR RNNCSXADNA 8VFGRVHQIG SRETAFTOAV 120 

1176 



wo 03/042661 



PCT/US02/36810 



SAAGWNAIS RACRBGBLST OGCSRTARPK DLPRDHLMGG OaDNVBYGYR PAKBPVDARB 180 

RBKNFAKBSB BQ6RVLNHLQ NNEAGRRAVY KMADVACKCH 6VS0SCSLKT CWIiQLAEPRK 240 

VGDRLKEKn) SAAAMRVTRK 6RLBLVMSRP IQFTPEDLW VDPSPOYCLR NESTCSIiGTQ 300 
^ GRU2JKTSEG MDGCELMCCG RGYNQFKSVQ VERCaOCFHW OCFVRCKKCT BIVDQYICK 

Seq ID MO} 585 DKA sequence 

Nucleic Acid Accession fts Bos sequence 

Coding sequencei 1..1479 

10 1 11 31 31 41 51 

I I I I I I 

A10GCTTTGA ACTCAGGGTC ACCACCAGCT ATTGGACCTT ACTATGAAAA CCATGGATAC 60 

CAACCGGAAA ACCCCTATOC OGCACAGCCC ACTGTGGTCC CCACTGTCTA CX3AGGTCCAT 120 

1 C COOGCTCAGT ACTACCCX5TC CCCCGTCCCC CAGTACXSCCC OQAG GG TCCT QAOGCAGGCT 180 

O TCCAACCCCa TC3GTCTGCAC GCAGCCCAAA TCCCCATCOO GQACAGTGTG CACCTCAAAO 240 

ACTAAGAAAG CACTOTGCAT CACCTTQACC CTGGGGACCT TCCTOOTGGG AGCTGCGCTG 300 

GOOOCTGGOC TACTCTOGAA OTTCATGGGC AGCAAGTGCT CCAACTCIGG OATAQAGTaC 360 

GACTCCTCAO GTACCTGCAT CAACCCCTCT AACTGGTGTG ATX30C0TGTC ACACTGCCCC 420 

^ GGOGGGGAGG A0GAGAAT06 GTGTGTTOGC CTCTACGGAC CAAACTTCAT CCTTCAG6T0 480 

ZU TACTCATCTC AGAGQAAGTC CTGGCACCCT QTGTGCCAAG AOQACTGGAA OOAGARCTAC 540 

GQTO GGGCGQ CCTGCAGGGA CATGGGCTAT AAOAATAATT TTTACTCTAO CCAAGGAATA 600 

GT08AT6ACA OOOGATOCAC CAGCTTTATG AAACTGAACA CAAOTGOCGG CAATGTCQAT 660 

ATCTATAAAA AACTGTACCA CAGTGATGCX: TOTTCTTCAA AAGCAGTGGT TTCTTTACGC 720 

^ TQTATAGCCT 0C30GGGTCAA CTTQAACTCA AGCCQCCAGA GC3VGQATCGT GGGCQQOraG 780 

/.J A6CXS0QCTCC CGGGGGCXTTQ GCCCTGOCAO QTC3VSCCTGC AOGTCCAOAA CXSTCCAOSTG 840 

TGOGQAG OCT CCATCATCAC CCCC6AGTGG ATCGTGACAG CCX5CCCACT0 CGTGGAAAAA 900 

CCTCTTAACA ATCCATGGCA TTGOAOGGCA TTTGCX3GGGA TTTTGAGACA ATCTTTCATQ 960 

TTCTATGGAG COGGATACCA AOTAGAAAAA QTGATTTCTC ATCCAAATTA TOACTCCAAG 1020 

ACCAAGAACA ATGACATTQC GCTOATOAAG CTOCAGAAGC CTCTQACTTT CAAOQACCTA 1080 

jy) GTGAAACCAG TGTGTCTGCC CAACCCAGGC ATOATGCTGC AGCCAOAACA GCTCTGCTGG 1140 

AT TTCO GGGT GGOGGGCCAC OGAGGAGAAA GGGAAGACCT CAGAAGTGCT GAACGCTGCX: 1200 

AAGGTGCTTC TCATTGAGAC ACAGAOATGC AACAGCAGAT ATOTCTATQA CAACCTGATC 1260 

ACACCAGCCA TQATCTGTGC OGGCTTCCTG CAGG6GAACX3 TOGATTCTTG CCAGGGTGAC 1320 

AGTGGAGGGC CTCTGGTCAC TTCOAAGAAC AATATCTGGT GQCTQATAGG GGATACAAGC 1380 

TGGGGTTCTG GCTGTGCCAA AGCTTACAGA CCAGGAGTGT ACGGQAATQT GATGQTATTC 1440 
AC6QACTGGA TTTATOQACA AATGAGGGCA GAOGGCTAA 



40 
45 
50 



Seq ID KO: 586 Protein sequeoce 
Protein Accession ftt Eos sequence 

1 11 21 31 41 51 

I i I I I I 

MAUISGSPPA IGPYYENHQY QPEMPYPAQP TWPTVYBVH PAQYYPSPVP QYAPRVLTQA 60 

SNPWCTQPR SP86TVCT8R TKKALCITLT LOTFLVOAAL AAGLLHKFMO 8KCSNSQISC 120 

DSSGTCINPS NWCDGVSHCP GGEDQIRCVR LYGPNFILQV YSSQRKSHKP VC9QDDNMENY 180 

GRAACRDMGY KNNPYSSQGI VDDSGSTSPM KLNTSACaJVD lYKKLYHSDA CSSKAWSLR 240 

CIACGVNIiNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 300 

PLNNPWHVTTA FAGILRQSPM PYOAOYQVBK VISHPNYDSK TKNNDIALMK LQKPLTPNDI, 360 

VKPVCLPNP6 MMLQPEQZiCW I8GHGATEBK GKTSEVUIAA KVUiIBTQRC ITSRYVYDNLI 420 

TPAMICAGFIj QGNVDSOQGD SG6PLVTSKH NIHWLiaDTS NGSGCAXAYR FGVYGNVHVP 480 
TDWIYRQMRA DG 



Seq ID NOt 587 DHA sequence 
Nucleic Acid Accession #i MM 005656.1 
DD Coding sequence: 57.. 1535 

1 11 21 31 41 51 

''"III 

Af\ GTCATATTGA ACATTCCAOA TACCtATCAT TACTCGATGC TOTTOATAAC AGCAAOATGO 60 

OU CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAAOCAT GGATACCAAC 120 

OGGAAAACCC CTATCCOGCA CAGCCCACTG TGGTCCCCAC TGTCTAOGAG GT6CATCCQ0 180 

CTCAGTACTA CCCGTCCCXJC GTGCCCCAGT AOGCCCOQAG OOTCCTGAOG CAGGCTTCCA 240 

ACCCCOTCGT CTOCACGCAO CCCAAATOCC CS^TCOGOQAC A6TGTGCACC TCAAAGACTA 300 

^ AGAAAGCACT GTGCATCAOC TTQACCCTQG G6ACCTTCCT CGTGQGAQCr GGSCTGGCCG 360 

OD CTGGCCTACT CTGQAAGTTC ATGGGCAGCA AtSTGCTCCAA CTCTOGOATA QAGTX30QACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GQTGTGATGG 05TGTCACAC TGCCCCGGCG 480 

GGOAGGACGA QAATOGGTGT GTTCGCCTCT AOGGACCAAA CTTGATCCTT CSVGATQTACT 540 

CATCTCAGAO GAAGTCCTGG CACXX^XSTCT GCCAAGACGA CTGGAAOSAG AACTACGGGC 600 

GGG06GCCTG CAG66ACATG GGCTATAAGA ATAAITTTTA CICTAGCCAA OQAATAGTGQ 660 

/U ATGACAGOQG ATCCACCA8C TTTATQAAAC TGAACACAAO TGC0G6CAAT OTOQATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTOTT CTTCAAAAOC AOTGGTTTCT TTAOGCTGTT 780 

TAGCCTGOGG GGTCAACTTG AACTCAA6CC GCOVGAGCAG GATCJOTGGGC GOTOAGAGOG 840 

CGCTCXCGGG GGCCTGGCCC TOOCAGOTCA 6CCTGCA0QT CCAGAACGTC CACGTGTGCG 900 

GAOGCTCCAT CATCACCCCC 6AGTG6AT00 TGACAGCGOC CCACTOOGTQ GAAAAAOCTC 960 

/D TTAACAATCC ATGGCATTCG AOQGCATTTG OOGGGATTTT GAGAOiATCT TTCATGTTCT 1020 

ATGGAGCOGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATQAC TCCAAGACCA 1080 

AGAAC3UVTQA CATTGOGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAOTQA 1140 

AACCAQTGTG TCTGCCCAAC CCAGGCATQA TGCTOCAGCC AGAACAOCTC T6CTGGATTT 1200 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AOTOCTOAAC aCTGOCAAOQ 1260 

OU TGCTTCTGAT TGAGACACAG AGAT6CAACA OCAQATATOT CTATOACAAC CTO A TCACAC 1320 

CAGCCATGAT CTGTGCXX5GC TTCCTGCAGG GGAAOGTCGA TTCTTOCCAG GGTGACAGTG 1380 

GAQ GQCC TCr GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGOAT ACAAGCTGGQ 1440 

GTTCTGGCTG T6CCAAAGCT TACAGACCAG OAGTGTAOGG QAATGTGATG OTATTCAOGG 1500 

ACTG6ATTTA TOGACAAAIG AAOGCAAAC3Q GCXAATCCAC ATGOrCTTCG TCCTPGAOGT 1560 

1177 
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06TTTTACAA 
ATGATTCAGA 
TGOCATACTG 
OOOCaW BGGQ 
0TTGQJV8GCT 
TGACCATGGA 
GGAAAGGQAO 
TCCCCAGCCT 
GATGC3TG6CC 
AAG6GGAACA 
GAGGGAAGCA 
CATTGGGTGG 
TCCTAGCACC 
ATQT0CS6CCT 
ATGCTOVGTT 
CTGAGTTCAA 



GAAAACAAT6 
QOTCACTTCA 
TGCAOGCTGC 
TGATGGOCGG 
GCC0CCATT6 
GCTGTCACTT 
ACAGCCAGGT 
ACTTCACAAQ 
AGAAATAAAO 
GAAACATTTT 
ATTGAAAAGG 

CTGGAGAGTG 
CrrCAGQCCT 
TAAGGTACAC 
AGCCATCTT 



GGGCTGGTTT 
TTTTTATTAA 
AGTGGCTCCC 
CTGGTTGTGQ 
AOATCTTCCT 
CTCAGCTGCT 
GGCACCTGCA 
GGGATTTTGC 
GGAGCAOOCC 
TGTTCTTATG 
AACTTGCOCT 
AGGGAGACTC 
AATGCCCCTT 
GATAOTCATT 
TGTTTOCATG 



ACAGTGAACT 
CTGCCCAGCC 
QCACTGGOGQ 
GCT6AGTCCT 
OQATQACTTG 
GCGGCT60CC 
TGATGGGTTC 
TTCATGGOTG 
GG6TQAGAAT 
GAGCACTGCT 
AGCCTTCCTC 
GGTCCCTGGC 
G6AAATTGA0 
TTATGTTTGT 



TGCATQATTT 

TGCTCTCCCT 
TCAATTGTGG 
TTCCAGQG6C 
AGATGAAAAA 
TCTGGGGCCA 
TTAGAGCCTT 
GTGA0C3TGGT 
ATAGACAGTG 
GGTQCAGOTC 
CTCATCCTCC 
AGGQOGCQ^^ 
OTCCATGGQG 
ACAOITTQAT 



ACTCTTAGAO 
TGGCACrcrc 
AACCCCTTOT 
AAGGAAGAOG 
CAATTTTGGA 
G6AGAGACAT 
CTTGGTAOTG 
AGCAOCCCTO 
AGTCACTTGT 
GGCTTGOTGC 
TCCACCTGCA 
CTGACCCTGC 
GTT7G6CACC 
G AAATCAA GG 
GGTG6TQA0C 



Seq ID NO: 5BB Protein sequence 
Protein Accession #: MP 005647.1 



1 
I 

MALNSGSPPA 
SNPWCTQPK 
DSSGTCINPS 
OSAACRDKGY 
CLACGVNLNS 
PLNNPHHWTA 
VKFVCLPNPG 
TPAMZCAGFL 
TDWIYRQMKA 



11 

1 

IGPYYEHHGY 
SPSGTVCTSK 
NHCDGVSHCP 
KNNFYSSQGI 
SRQSRIVGGE 
FAOILRQSFM 
KMLQPEQLCW 
QGNVDSCQOD 
KG 



21 

1 

QPENPYPAQP 
TKKALCZTLT 
GGBDEHRCVR 
VDDSGSTSFN 
8ALPGAWPWQ 
FYQAOYQVQK 
ISGWGATEEK 
SGGPZiVTSNN 



31 
1 

TWPTVYEVH 
LGTFXiVGAAL 
LYGFNFILQM 
RLNT8AGHVD 
VSIiHVQNVHV 
VISBPNYDSK 
GKTSEVLNAA 
NZWWLIGDTS 



41 

1 

PAQYYPSPVP 
AAGLLWKFMG 
YSSQRKSMHP 
lYKKLYHSDA 
OSGSIITPEW 
TKNNDIALMK 
KVLLZETORC 
HGSGCAKAYR 



51 
I 

QYAPRVLTQA 



VCQDDHNENY 
C8SRAWSLR 
IVTAAHCVEK 
LQKPLTFNDL 
NSRYVYDNIiI 
POVYGHVMVP 



1620 
16B0 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NOx 589 DNA sequence 
Nucleic Acid Accession «i 1IM_001935.1 
Coding sequence t 1..2301 ~ 

1 11 21 31 41 51 

I 1 Ij I I I 

ATQAAGACAC CGTGQAAGAT TCITCTGGQA CTGCTGGGTQ CTGCTGOGCT TGTCACX31TC 
ATCAC06TX3C 0CGTG6TTCT GCTGAACAAA GGCACAGATG ATGCTACAGC TGACAGT06C 
AAAACTTACA CTCTAACTGA TTACTTAAAA AATACTTATA QACTGAAOTT ATACTCCTTA 
AQATGQATTT CAQATCATOA ATATCTCTAC AAACAAGAAA ATAATATCTT GGTATTCAAT 
GCTGAATATG GAAACA6CTC AGTTTTCTTG GAGAACAGTA CATTIGATGA GTTTGQACAT 
TCTATCAATG ATTATTCAAT ATCTCCTGAT GGGCAGTTTA TTCTCTTAGA ATACAACTAC 
GTGAA6CAAT GGAGGCATTC CTACACAGCT TCATATGACA TTTATGATTT AAATAAAAGG 
CAGCTGATTA CAGAAGAGAO OATTCCAAAC AACACACAGT GGOTCACATG GTCACCA6TG 
GGTCATAAAT TOQCATATQT TTGGAACAAT QACATTTATG TTAAAATTGA ACCAAATTTA 
CCAAGTTACA GAATCACATG OACGGGGAAA GAAGATATAA TATATAATGG AATAACTGAC 
7GG0TTTATG AAGAGGAAGT CTTCAGTGCC TACTCT6CTC TGTGGT6GTC TCCAAACGGC 
ACTTTTTTAO CATATGCCCA ATTTAACGAC ACAGAAGTCC CACTTATTOA ATACTOCTTC 
TACTCTGATO AGTCACTGC3V GTACCCAAAG ACTGTACGGG TTCCATATCX: AAAG6CAGGA 
GCTGTGAATC CAACTGTAAA Gl 'lt.T n ' G ' n GTAAATACAG ACTCrCTCAO CTCACTCACC 
AATOCAACTT TCATACAAAT CACTQCTCCT GCTTCTATGT TGATAGGGGA TCACTACTTG 
TGTGATtTTGA CATGGGCAAC ACAAGAAAGA ATTTCTTTGC AOTGGCTCAG GAGGATTCAO 
AACTATTGGQ TCATGGATAT TTGT6ACTAT GATGAATCCA GTGGAAGAT6 6AACT6CTTA 
GTGGCACXMC AACACATTGA AATGAGTACT ACTGGCTGaG TTGGAAGATT TAGGCCTTCA 
GAACCTCATT TTACCCTTGA TGGTAATAGC TTCTACAAGA TCATCAGCAA TQAAGAAaGT 
TACAGACACA TTT6CTATTT CCAAATAGAT AAAAAAGACT GCACATTTAT TACAAAAGGC 
ACCTGGGAAG TCATCGGGAT AGAAGCTCTA ACCAGTGATT ATCTATACTA CATTAGTAAT 
GAATATAAAG GAATGCCAGG AGGAAGGAAT CTTTATAAAA TCCAACTTAG TGACTATACA 
AAAGTQACAT GCXrrCAGTTG TGAGCTGAAT COQGAAAQGT GTCAOTACTA TTCTGTGTCA 
TTCAGTAAAO AGGOOAAGTA TTATCAOCTG AGATGTTCCX3 GTCCT6GTCT GCCCCTCTAT 
ACTCTTACACA GGVGOGTGAA TGATAAAGGG CTGAGA6TCC TGGAAGAOUV TTCAGCTTT6 
QATAAAAT6C T6CAGAATGT OCAOATGCCC TCCAAAAAAC TQGACTTCAT TATTTTGAAT 
GAAAC3UUUVT TTTGGTATCA QATQATCTTG CXTTCCTCATT TTGATAAATC CAAGAAATAT 
CCTCTACTAT TAGATGTGTA TGCAGGCCCA TGTAOTCAAA AACCAGACSIC TGTCTTCAGA 
CTGAACTGGO CCACTTACCT TOOUIGCACA GAAAACATTA TAGTAGCTAO CTTTGATGGC 
AGAGGAAGTG GTTACCAAGG AGATAAGATG ATGCATGCAA TCAACAGAAG ACTGGGAACA 
TTTGAAGTTQ AAQATCAAAT TQAAOCAGCC AGACAATTTT CAAAAATG06 ATTTGTGGAC 
AACAAACGAA TT6GAATTTG GGGCTG6TCA TATGGAGGGT ACX3TAACCTC AATGGTCCTG 
GGATCGGGAA GTGG06TGTT CAAGTQTOGA ATAGCCX3TGQ CGCCTGTATC CCGGTGGGAO 
TACTATGACT CAQTOTACAC AGAAOGTTAC ATGGGTCTCC CAACTCCAGA AGACAACCTT 
GACCATTACA GAAATTCAAC AGTCATGAGC AGAGCTGAAA ATTTTAAACA AGTTGAGTAC 
CTCCTTATTC ATQQAACA6C AGATGATAAC GTTCACTTTC AGCA6TCAGC TCAGATCTCC 
AAAGCCCTGG T0QAT6TTGG AGTGGATTTC CAGGCAATGT GGTATACT6A TQAAGACCAT 
GGAATAGCTA GCAGCAGAQC ACACCAACAT ATATATACCC ACATGAGCX31 CTTCATAAAA 
CAATGTTTCT CTTTACCTTA O 

Seq ID NO: 590 Protein sequence 
Protein Accession ft: NP 001926.1 



60 
120 
180 
240 
300 
360 
420 
480 



60 
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160 
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1111)1 
MKTPmaiiLG LLGAAALVTI ZTVFWUiNK GTDDATADSR KTYTLTDYLK NTYRLKLYSIf 
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RWISDHBYLY KQENNILVFN ABYCNSSVFL ENSTFDEFGH SINDY8I8PD GQPILLBYNY 120 

VRQKRBSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSPV GHKLAWWNN DIYVKIBFNL 180 

PSYRITWTGK EDIIYNGITD WVYEEEVFSA YSALUMSPNG TFLAYAQPND TBVPLIEYSP 240 

YSDBSLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT 2IATSIQITAP ASMLIGCIRYXi 300 

5 CDVTIfATQER IStiQWLRRIQ SJYSVMDICDY DB8S6RWNCL VARQHIENST TQNVGRFRPS 360 

EPHPTUXarS PYKIISNBEG YRHICYPQID KKDCTPITKG TWEVIGIEAL TSDYLYYISN 420 

EYXGMPGGRN ItYKIQLSDYT KVTCLSCSLN PBR0QYYSV8 PSKEAKYYQL RCSGPQLPLY 480 

TLHSSVNDKQ LRVLBDKSAL DKMLQKVQMP SXKU>PXIW BTKFWYQMXL PPHPDKSKKY 540 

PLUiDVYAQP CSQKADTVPR LNWATYLAST EHIXVASPDQ RGSGVOSOXX MHAZKRRIX;! 600 

10 PSVEOQIEAA RQPSKKGFVD NKRIAIKGN8 YGOYVTSMVL 6S6SGVFKGQ lAVA PVSRWB 660 

YYDSVYTBRY l«a.PTPEDllL DHYRNSTVHS RAENFROVBY LLZBOTADDN VHFQQSAQI8 720 
KALVDVGVDF QAKHYTOEDH GXASSTABQR ZYTBMSHFXK QCFSLP 

Seq ID NO: 591 DMA sequence 
15 Nucleic Acid Accession tf: KM_016077.1 
Coding sequence t 128 . . 667 

1 11 21 31 41 51 

«^ I I I I I i 

ZU TC6CTTTGTG ATTCTTQATC CX3GAACTTTG TCACCCAGGA ACCCCGGAAO AGGTAGCTCA 60 

GGGGATAGAA AOGTGTTOOC TTGCXXAGAA QAAGGGAAGG OSCXSAGTGAG GAAAGGAGGT 120 

ACTGTAGATQ OCCTCCAAAT CCTTGGTTAT GOAATATTTQ GCTCATCCCA GTACACTCQO 180 

CTTGGCTGTT GQASTTGCTT QTGGCATGTa CCTGQGCTOO AOCCTTOGAG TATGCTTTOO 240 

GATGCTCCCC AAAAGCSUVOA CGAGCAAGAC ACACACAQAT ACTGAAAGTO AAOCRAGCAT 300 

25 CrrGGGAGAC AGCGGGGAOT ACAAGATGAT TCTTGTGOTT OSAAATGACT TAAAGATGGG 360 

AAAAQQQAAA GTGGCTGCCC AOTGCTCTCA TGCTGCTGTT TCAGCCTACA A6CAGATTCA 420 

AAGAAGAAAT CCTGAAATGC TOVAACAATQ GGAATACTGT QOCCAGCCCA AGGTGGTGGT 480 

CSUUVSCTCCX GATOAAGAAA CCCTOATTGC ATTATTGGCC CATGCAAAAA TGCTGGOACT 540 

GACTGTAAGT TTAATTCAAG ATGCTGGAOG TACTCAOATT GCACCAGGCT CTCAAACXGT 600 

30 CCTAGG6ATT GGGCCAGGAC CAGCAGACCT AATTGACAAA GTCACTGGTC ACCTAAAACT 660 

TTACTAGGT6 GACITTGATA T6ACAACAAC OCCTCCATCA CAA0T8TTT0 AAGGCTOTCA 720 

GATTCTAACA ACAAAAGCTO AATTTCTTCA CCCAACTTAA ATGTTCTTGA GATGAAAATA 780 
AAACCTATTC CCATGTTCTA AAAAAA 

35 Seq ID NO: 592 Protein sequence 
Protein Accession fti NP_057161.1 

1 11 21 31 41 51 

Af^ ^ 1 I ' > ' 

4U MPSKSI.VKEY LAHPSTLGLA VGVA0GMCL6 WSLRVCFGMIi PKSKTSKrHT DTESEASIU3 60 

DSGEYKNXLV VRMDLKMGXO KVAAQCSHAA VSAYXQZQRR NPEMLKONEY OGQPKWVKA 120 

PDBBTLIAUi AHAKMLQLTV SLIQDAQRTQ XAP6SQTVLG IGPOPADLID KVTGKLKLY 



Seq ID NOs 593 DNA sequence 
45 Nucleic Acid Accession ft: FGENGSK predicted 
Coding sequence i 1..1896 

1 11 21 31 41 51 

CA I I 1 > ' • 

50 ATGCGCGCCO TQCCGCTGCC OGCCCOGCTC CTGCCGCTGC TGCTOCTOGC GCTCCT06CC 60 

GCrCCCGCOG CGGGOGGCAG GAGAOCOOAG T0C6TCTCGG GQCCGIOGOC C6AACC0QA0 120 

CG0GAGTG6C GGCCACOGCC CGGGCOOGGG CCC66GAACA GCACCOGGTT TG6GTCTGG0 180 

GOGGCGGGOQ GCAG0GGCA6 CTCCAGCTCC AACAOCAGTO GOGACGCCTT GGTGACCCGC 240 

ATTTCCATCC TCCTCOGOGA CCTACCCACC CTCAAGGCAQ CCGTGATCGT GGOQTTOGCC 300 

55 TTTACCACCC TCCTCATOGC CTGCCTOCTO CTGOGOOTCT TCAGGTOGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATOA TATCATCACC ACTCCASCAG AGGGAGTGGA AATGGCGCCA 420 

CTAAATGAAO AGGATGATGA AGATGAGGAC TOCACAOTAT TCOACATCAA ATAC AGAOTG 480 

TOCTTGGOGO CTGCACTGAa ACGTCAGCTO CCAG66TGCC AGAOQCTACT GACAOTTCCT 540 

GTGCCCCCAC OCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAOTGG AAOGCCTGAT 600 

60 GGTGGAATCA GACCTQGTAA AACCTGTTTC CCAGCCTGGT OGCATOCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGIGT GAAGGACTOG ACCTGGAAGC C C TCTi' QO aT OQGAOGTOTT 720 

GAAACCAAAA GGAAOGTTAT GTATAAAACC CCAGCIOCAT 0GTG08TGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC 6TCACCACAA TGGAGTTOCT TCTGCCACCC 840 

^ TTTGG6CATC CCTTTAAAQT GCCCCCTACT TCTACTCCOC ATGGTTTTC6 ACAACTGCAG 900 

65 CTQAATCTCA TGOAAAAGCT GGATTCCTCT 6CCTTA06CA 6AAACAGCGG GGCTCCATCT 960 

GCCA6GTGCT TGCCACIGGT CCTQGCAGAA ATGGGGGCTQ CTOAAA6TGA CCTTCCAAAT 1020 

CCTTGGTGGC ACTTCAGCOC CACAGGCTCT CCAATAAAAA COCTTTACAC ACRAACCATO 1080 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGOCACCTT TTGTGAAGAC 1140 

AGAGCAGTGA CTAAGGTTCT CCAGGGTAOC TCTTTCTCCA AACAGCTGCG CTQGARGCCA 1200 

70 6C0CTAGAGA tfitl AGG 'm'CC OCATCATCTC AGGCTTCTCA GAGA6TGTCC TCOOCTQAGC 1260 

ACCGATCCTQ TCAGGTTGGC TGGTTCAOAT GGOCGGGGAC AAGCCAGCCT GAOQGGQAOG 1320 

AGGGTGTTTC G6C3TC0GC6 GCAGTCTCT6 CATGGG66A6 GGTCAGCG66 TAOCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAOG CGCCATCCTC ACCrTGAOCT CXTCTACAAA 1440 

— - ATCTGTCTCC CCTGCTGTGC OGTGGAACAC CTAOGGGAAG CCAAGAGAAG CTCAGTGACT 1500 

75 GTCCTTGCGT CATTTGAGCA GA6CCCACAA AAGGCA6CT0 CIGC0CA06G GOAGCCrGTC 1560 

AAAC6AGGGC CCAOTGOGCA ATTQACCAGA CACACATGCC CTGGCIGGOG QATCACACAT 1620 

G0GAACCT6C AOACAATTCC A6ATA0CCAA G6CCAGGAAG GCCCAOGTGA OQATOT CACT 1680 

CACCCTGGAG GAGACTTGGA TGG G OTGGCA AATTTCTATT TGOAGGAAGA GGGTTTCCAO 1740 

GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAQ GGCCACCTAO TTTQACAGGA 1800 

80 TGTGAGAGGC TGACAGOTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG CTCCTTCCTT I860 

TCCOCOOQAC AGCCCCrGTT TCTOTCCAGG OCCTGA 



Seq ID NO: 594 Protein sequence 
Protein Accession fts FGENESH predicted 
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1 11 31 31 41 51 

I i I I I 1 

HRAVPLPAPL LPLLUALLA APAARASRAB SVSAFWPBPE RBSRPPPGPG PGNTTRFGSO 60 

AAGG SQSSSS KSSGDALVTR ISILUtDLPT UCAAVXVAFA FTTItblACLb LRVFRSQKRI* 130 

KKTRKYDIIT TPAERVEMAP LNEEDDKDBD STVFDIKYRV SLPAALRHQIj PQCQTLLTVP IBO 

VPPPPILDID LPARCSGRPD G0IRP6KTCP PAWWHPVESH SAATWGVKDW TWKPSCVGGV 240 

ETKTNVMYKT PAPSCVSGIC SDCHWQARPH VTTMBLLLPP FGHPPKVPPT STPHGFRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVIAB MAAAB8DLPN FHWHP8AT6S PXKn.YrQ3M 360 

STbOLOVFOQ AOQRGTFCED RAVTKVUXSS SPSKQLRHRP ALBSQPPHEL RLXiRECPPItS 420 

THPVRLARSD ARGQASbTGR RVPRRPRQSL HGGGSAGTAT OiLVLKILLR RHPHUJLPYK 480 

ICLPCCAVEH LRBAKR8SVT VLASPBQSPQ KAAAAHGBPV KR6PSGQLTR HTCPGWGITH 540 

ANLQTIPDTQ GQEQPREDVT HPGGDLD6VA NFYLSEEGFQ DGROQKMVLM SEEGPPSLTG 600 
CBRLTGSHHF 8SBSR8NSFL SPRQPIiFLSR P 

Seq ID N0» 595 DWA sequence 

Nucleic Acid Accession «t NM_021614.1 

Coding sequence! 1..1740 " 



1 11 21 31 41 51 

I I t ) I i 

ATGA6CA6CT GCAOOTACAA CGGGGGOOTC ATGCX3QCCGC TCAGC7WVCTT GAGCGCQTCC 60 
0GCX3GGAACC TGCAOGAGAT G6ACTCAGAG G0GCA6CCCC TGCAGCCCCC 0GC6TCT0TC 120 
GGAQGAGOTG GOGGOGOBTC CTCOCOQTCT GCASCOSCTG C0GG0GCX3GC G6CT0TTT06 180 
TCCTCAGCCC COGAGATOGT GGTGTCTAAG CCOQAGCACft ACAACTCCAA CAACCIGGOO 240 
CTCTATGGAA CCGGOGGOGO AGGCAGCACT GGAGGAGGCQ 60GGCGGTGG OGGGAGOGGG 300 
CACX5GCAGCA GCAGTGGCAC CAAGTCCAGC AAAAAGAAAA ACCAGAACAT CGGCTACAAG 360 
CTGGGCCACC GGC6CGCCCT GTTOGAAAAG CGCAAGG6GC TCAGC6ACTA 0606CTCATC 420 
TTGQ6CAT6T TOOGCATCGT GGTCATGGTC AT0QAGACCX3 A0CTGT06TG GGGCGCCTAC 4 BO 
GACAA6G0GT CGCTGTATTC CTTA6CTCT0 AAATGCCTTA TCAQTCTCTC CAOGATCATC 540 
CTG C TOGGTC TGATCATCOT GTACCACGCC AGGGAAATAC AOTTQTTCAT GGTGGACAAT 600 
GGAGCAGATG ACTGGAGAAT AGCCATGACT TATGAGCGTA TTTTCTTCAT CTGCTTGGAA 660 
ATACTGGT6T GTGCTATTCA TCCCATACCT GGGAATTATA CATTCACATG GA06GCCCGG 720 
CTTQCCTTCT CCTATOCCCC ATCCACAACC ACOGCTQATG TOOATATTAT TTTA TCTATA 780 
35 CCAATGITCT TAAOACtCTA TCTGATT6CC AGA6TCATGC TTTTACATAG CAAACTTTTC 840 
ACTGATGCCT CCTCTAGAAG CATT6GAGCA CTTAATAAGA TAAACTTCAA TACAOGTTTT 900 
GTTATGAAGA CTTTAATGAC TATATGCCCa GOAACTGTAC TCTTGGTTTT TAGTATCTCA 960 

TTATG6ATAA TTGCOQCATG GACTGTGOQA OCTTGTGAAA GQT ACC31TQ A TCAACAGGAT 1030 

GTTACTAGCA ACTTCCTTGQ A60QATGTGG TTGATATCAA TAACTTTTCT CTCCATT6GT 1080 

40 TATGGTGACA TQGTACCTAA CACATACTQT GGAAAAOGAO TCTGCTTACT TACTGGAATT 1140 

ATGGGTGCTG GTTGCACAGC CCTGGTGGTA GCTGTAGTGQ CAAGGAAGCT AGAACTTACC 1200 

AAAGCAGAAA AACA06TGCA CAATTTCATG AT6GATACTC AGCTQACTAA AAGAGTAAAA 1260 

AATGCAGCTG CCAATOTACT CA6GQAAACA T6GCTAATTT ACAAAAATAC AAAfiCTAOTQ 1320 

AAAAAGATAG ATCATGCAAA AGTAAGAAAA CATCAAGGAA AATTOCTGCA AGCTATTCAT 1380 

CAATTAA6AA GTQTAAAAAT GQAGCAGAOa AAACTOAATO ACCAAGCAAA CACTTTGQTG 1440 

6ACTTGGCAA AGACCCAGAA CATCATGTAT GATATGATTT CTGACTTAAA CGAAAGQA6T 1500 

GAAGACTTCG AGAAGAGGAT TGTTACCCTG GAAACAAAAC TAGAGACTTT 6ATTGGTA6C 1560 

ATCCACGCCC TCCCTGGGCr CATAAGCCAG ACCATC3VGGC AGCAGCAGA6 A6ATTTCATT 1620 

OAaOCTCAaA TGGAGAGCTA 08ACAA6CAC QTCACTTACA ATGCTGAGOS CTC0CX3GTOC 1680 
TCGTCCA6GA GGCGGCQGTC CTCTTCCACA GCACCACXAA CTTCATCAQA GAOTAGCTAG 



Seq ID NO: 596 Protein sequence 
Protein Accession #t NP 067627.1 



1 11 21 31 41 51 

t I I I I I 

MS8CRYN6GV MRPLSNliSAS RRNLHEMDSB AQPLQPPASV GQG0QAS8PS AAAAAAAAVS 60 

SSAPB1W8K PBHMNSNMLA LYGTGGGGST GGGGGGGGSO HGSSS6TKSS KKKNQNIGYK 120 

LGRRRAUPEK RXRLSDYALI F(a4FOIWMV IBTEI.SWGAY DKASLYSLAL KCLZSLSTXI 180 

OU LMLXIVYHA REIQLFMVDN GADDWRIAMT YBRZFFICLE ZLVCAIRPXP GNYTFTOTAR 240 

LAPSYAPSTT TADVDIIL81 PMFLRLYLIA RVMLLHSKLP TDASSRSIGA LNKIHPNTRF 300 

VMKTUWICP GTVIiLVFSIS LWIXAAWTVR ACBRYHDQQD VT8NFU3AMW LISITPLSIG 360 

YGDMVFNTYC OKSVCLLTGI MGA6CTALW AWARKLELT KAEKHVHNFM MDTQLTKRVK 420 

NAAANVXiRET HLXYXHTXLV KKIimiCVRX HQRXPLQAXR QLRSVN4BQR KLNCQANTLV 480 

05 DLAKTQNXMY SHISDLNERS BDPBKRIVTL BTKLBTLX6S ZHAIiPGLXSQ TIRQQQRDPI 540 
BAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSBSS 

Seq ID NO: 597 DNA sequence 
„ Nucleic Acid Accession NM_016029.1 
tV coding sequence! 228.. 1097 " 

1 11 21 31 41 51 

I I I 1 I I 

CT606ATCCC GCAGGGCAGC GAOGOGACTC TGGTGCGGGC OGTCTTCTTC CCCCCGAGCT 60 

GGG06TG06C GGCOGCAATG AACTGGGAGC TGCTGCTGTQ GCTGCTOOTO CTGTQCQOGC 120 

TGCTCCTGCT CTTGQTGCAG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 180 

TATGGGCCGA GTG6CAGGGA CGAGGCCCAG AATGaOAGCr GACTGATATG GTGGTGTGGG 240 

TGACTGGAGC CTC6AGTGGA ATTGGTGA60 AGCTGGCTTA CCAGTTGTCT AAACTAGGAG 300 

rnVfUnW GCTQTCAGCC AGAAGAGTGC ATGAGCTOGA AAGGGTGAAA AGAAQATGCC 360 

TAQAGAATGG CAATTTAAAA GAAAAAQATA TACTrOTTTT GCCCCTTGAC CTOACCOACA 420 

CTGGTTCCCA TQAAGOGGCT ACCAAAGCTG TTCTCCAGGA GTTTGOTAGA ATOGACATTC 480 

TGQTCAACAA TGGTGGAATG TCCCAGGOTT CTCTGTGCAT GGATACCAGC TTGGAT6TCT 540 

ACAOAAAGCT AATAGAGCTT AACTACTTAG GQA08GTGTC CTTQACAAAA TGTGrrCTGC 600 

CTGACATGAT C8A6AGGAA0 CAAG6AAAQA TTGTTACTOT GAATAOCATC CTGGGTATCA 660 
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10 
15 
20 



TATC TQTA CC TCTTTCCATT GGATACTGTG CTAGCRAGCA TGCTCTCOSG GGTTTTTTTA 720 

ATGGCCrrOO AACAGAACTT GCCACATACC CAOOTATAAT AOTTTCTAAC ATTTGCCCAG 780 

OACCTOTGCA ATCAAATAtT GTGGAGAATT CCCTAGCTGG AGAAOTCACA AAGACTATAG 840 

GCAATAATGG AQACCAOTCC CACAAGATGA CAACCAGTOG TTGTGT GO GG CTGAT6TTAA 900 

TCAGCATGGC CAATQATTTG AAAGAAGTTT G6ATCTCAGA ACAACCTTTC T TCTIA OTAA 960 

CATATTTGTG GCAATACATO CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 1020 

AAAGGATTGA GAACTTTAAO AQTGGTGTGG ATGCAGACTC TTCTTATTTT AAAATCTTTA 1080 

AGACAAAACA TGACTGAAAA GA6CACCTGT A LTriTC AAO CCACTGGAGG GAGAAATGGA 1140 

A AACATG AAA ACA6CAATCT TCTTATGCTT CTOAATAATC AAAOACXAAT TTGTGATTTT 1200 

ACTTTTTAAT AQATATGACT TTGCTTCCAA CAtOGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 

Seq ZD MOi 598 Protein sequence 
Protein Accession «t NP_0S7113.1 

1 11 21 31 41 51 

i I I I I I 

MN WELL LWLL VLCALLLLLV QLLRFLRADO DLTLLMAEHQ 6RRPEHBLTD MWHVTGA88 60 

GIGBBIiAYQL SXLQVSLVLS ARRVHELBRV KRRCIiBIGNIi KBKDZLVLPZi DLTDTQSBEA 120 

ATKAVLQBFO RIDILVNNGO MSQRSZjCMDT SLDVYRKLIB LNYLGrVSLT KCVLPBMIER 180 

KCX3KIVTVNS ILailSVPLS IGYCASKHAL RGFPNOLRTB LATYPGIZVS HICPGPVQSN 240 

IVENSLAGBV TKTIGNNGDQ SHKMTT8RCV RLMLISMAND LXBVHXSBQP FUiVTYLNQY 300 
MPTWAWWITN KMGKKRIENF KSGVDADSSY FKIFKTKHD 

25 Seq ID NOt 599 DMA sequence 

Nucleic Acid Accession Si NM_000793.2 
Coding sequence t 401.. 1222 ~ 

^ 3^ 31 41 51 

30 I I I I I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAA6GGAAA GACAQAGAGT 60 

GAGAAAAAA6 AGGAOTCAGT CGCTCGTGGG GAA6GGAGAG AGTGAGACTG GQAGAAA6AG 120 

AAGCACAGAA AGTGTGTGTA AAAGGQAGTA AA6AAA6AAA AAAAAAAAAC TACCCTTAAA 180 

GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAA6AAAGA AACA66CTAC GTTTAAAQAO 240 

CATAGAGACA ATGAAAG6CT AAAQAAAATT TTAAAATCTC TOCCACAGTC TCATAG6T6C 300 

TTGQAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGA6G GTGAAGGGGA 360 

ACCA GAGCGC ACAAGGGAAC TGACTCAGGA GGGAGAGAAG ATGG6CATCC TCAGOGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAOT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

CTATQACTOG GTCATTCTGC TCAAGCAOQT GGT6CTGCTG TTQAGCOGCT CCAAGTOCAC 540 

T0QGG6AGAG TGG00G06CA TGCTGACCTC AOAGGGACTO CQCT GC XyfCr GGAAGAGCTT 600 

CCrCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT QCCCCCAATT CCAGTGTGGT 660 

GCATGTCTCC AGTACAGAAG GAGGTOACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGG AGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAG0GCXX3VC TAGTGGTCAA 780 

CTTT6QCTCA GOCACTTQAC CTCCTTTCAC GAGGCASCTO OCAGOCTTCC OCAAACTGGT 840 

0GAA6AGTTC TOCTCAOTGO CTGACTTCCT OCTQOICTAC ATTGATQAOG CTCATCCATC 900 

AGATOaCTGG OOIATACOGG GGGACTCCTC ' i ' rm ' l ' Crm ' GAGOTQAAGA AGCACCAGAA 960 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA OCTTCTaOAG CGTTTCTCCT TGCCGCCCCA 1020 

QTGOOGAOTT GTGGCT6ACC GCATGGACAA TAAOGCCAAC ATAGCTTAGO GGGTA6CCTT 1080 

TOAAO GTGTG TGGATTGTOC AGAOACAGAA AATTOCTTAT CiaGGAaQAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAA6 TOCGGCATTG GCTG6AQAAO AATTTCAGCA AOAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAQGTATGA TTATAAQAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TOAAAOACAO GAGTTTATCT ATCGGAAGAA CAT6AATCTC TAACA6CTCC ATACTTCTTT 1380 

C ACTAC TCAA AT6GCATTGG QCrQAGTAAO 7AA0CATATC ACCTCTCTTC TTAGTAAAAA 1440 

GCCCTATGTO AAAAQATCCC AAGATG6AGA OQAAGAAAGG CTAATTCAGC A T G TGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TOCATGCTTT GAGACCAGAA QAAAA6ACTT 1560 

ACCTQAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAQ GGTATTQCCT 1620 

TGGCTCTATT TQGCATGGAT GOAGOOCAGT TGGAAAATTC CCAAATATTA CAACAAGTGC 1680 

TTOAA OOOUS 6CCATGTGGT TAOAOOTTOG TGTTAAOGTT AOACCTTATQ TTAGAaTGAT 1740 

TTCTGATOTT CCAOCTTCTA GCXaTOTAOT GCTCTOVGTC TTCATAOOCC AGAAATTAIT 1800 

GGTATATTTO TAGATACOGA GAATGATCCC TCAGTCTQAG AGOTTAGAAT GATCATCTOT 1860 

AATCTGAGGO TTAATTTCTA GGCAGCTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTA GGAAA GAGGAGGCAG AAAGATTTG6 AAAATTCACA QAOTTTCACC CTTAA6CTGT 1980 

AQAGAGTGGO TCACATTTGT TAGCCAGGGA AACATAGAAA CATACACAAG 60CAGAAAAA 2040 

GAAGAAGGAO CTCAACTAAA AGTOGCATAO AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TA GAGAG GAG AAAGGGQT6A TTOAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAAT TGTQAG GGGTTTCTTT TGGAAATAAT ^-ACTTTTOAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA OCTATTAAAG GAAAACCAOT 6GGTCTGGT0 2280 

OTGCTGOTCT TTTCCTCCCC ATTCCTACAA TTTCXATGT6 OOCCAAGTCA TTCCTAATCT 2340 

T6GTCTCTAT AGCAGTOTTC TCTCT6AATG CTGAGCTOAA GAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCT6CTGCG GGAGGTAGGT 2460 

ACCATGGCCA TTCAGCACAQ CXTTTOATTTC CTCCCAAAGT AGGTGAGCXA TAGTGAAGAA 2520 

TAGGT6CAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAOAGGAA GTTAGAGATA 2580 

ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCGC3GGTATA TCATGGAATT 2640 

TCCATTGACA TTTGAATTTO GACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGrTTAGT 2700 

A ATCTAAAG T CCCTA TAGTA TATGATTATA ATQCTATTTT AAAAAATATA TATATAAAAT 2760 

ATTTTTTrCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 

AGCTCCCAAA AGAATGGACX: AACXJICTTTC GTATCATAAT TTCTTTTrGG TAAATATQAG 2880 

ACTATTATGA AATCATAGTA TATGATTOTA TTTAAAGOIA CAATC3UU\G6 ATCTTTTOTC 2940 

CATTCCATTA ATAACTQAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTT6AA 3000 

AATACATTCT TAAACTAQTT OTCTGAAATQ AGAAAAGAGT GAGAACTAOQ TOTGCAAGAA 3060 

CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTC30TG TCACCA6CTG 3120 

OTATATTGTQ TAAATATTAA AGCTCCATTG GGACTOATTT TTCATGGCAA CATCAGCTTT 3180 

CrAATGTTCT AAATTCTATA AAAACCACCC ACAAA6AAAC AAAGCAAATT TCATTATCTA 3240 
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ATGAGTTQCT GGAAAATCAT ATTGAGAATA ATTATTTCAO ATTCCTCAOT TQTTAACTTC 3300 
TACATTCAAQ GGCTTATCTC TQCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAQATTT 3360 
ACTGTGGAAC CCTAAA6CAG TAAAATAAAA AACCT6GTTG CA6CACATTC AGACTOTIGT 3420 
CCTTAAAATT CCOLVrrm CTCTATQTAC QAtAAAQTAA CA0TA3GTCA GATAAGOOGG 3480 
TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAOTCAACTG GGG6AAAAG6 ATQATQQAAA 3540 
AATCACGCAQ TTGTGCTATA TTTTTAAAGA AGGAQGTOGT TTATGTGTGC AGACAATTCT 3600 
CCCTGAGGTT AGCCCAATGO AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 
TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GOCTOTGGAA 3720 
GQOCCAATOG AOAAAATOAA TGGACAAAGC TCAGGAATCC CTAOGCTATG TAGAATOTTC 3780 
TTGGTGTTAT CAGGGTTAAO CCCTQTAATT ATGTAACCTA TTTATOGCAA CAT6AATTTT 3840 
TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAO AACTCATTAT TTTGAGGTAG 3900 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTQ 3960 
CGTTTCATGT GTTTTATAAO GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 
GAAATOrOTA TCTATTTATA TATCATAGTA TAAATCTATO ATATATTTAT ATCATATATA 4060 
AAA6TCTGA0 TTC T CTTTCT TAGTCCCTAA TCATOTTTCT CXXATAGGCT GTGTTTACAT 4140 
GGAGCTATCG GTTTAGCCTT TTAA6CTTCA TTAGCTTGTC TATTATTQAA ATAOTTTCCA 4200 
AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTQAAAGAC 4260 
TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTOC TTAGTGAAAA TACTAOTGGQ 4320 
ATCAACAATG ATTTTCTTGA ATGOGCATGA ATGGAGATGC OOGCACAQTA ATGTAGAAAT 4380 
GTTTCATACA GCTATTAAAA TOTAACTGAC CTCCTTAGAG GCAQATTAGT AACTOTTCCT 4440 
ACTTTGTATA GCTAAGTGAC AOTCACTTAA CTTACATGAC TTTCTTTTTT CACATTGGGT 4500 
CTCTGGTCCT GTGTCTTCAC CTCATTTATA GCACGTCTCC TTGATTTTT6 GTAGTATCAA 4560 
CTTCCCAGTG ATCTGTTCAG TTAAGTrCTT CTCCOQTTAA CCAGGAAOTG CTTATTCTCT 4620 
CATCACA6TG GGAAGAATAG CCTATTGTCT TTCATTTT6C CTGAGTGTAT TTTACTATTT 4680 
GGGCTCTGAA ATAAAAATTA T6AAATATG0 TQAGGTCACA TGTTGGTOCT GCCTTGCTGC 4740 
ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT GGCCTTTOGG GAAAATTCAA 4800 
A6GGTGGGAT TACAAGGGTG TTCCTCAGGC ATGCCCCTAT GGGCCCXATG TOGAAOCAAO 4660 
AAGAATTGAC TGATTTACAQ GACTTCTCTT TATGTCAATC TTAA6AGGAT GGATGAATCT 4920 
GGACATTTGT TCCACCOQAC CTCTGACTGA TGGTTTGGAA AATAACTTTA ATTAOQATCA 4980 
TATGACCATT QAAAAAGQAA AAATGTAOAC TCTGACTTCC GTCCCACT6A A6GATTAATG 5040 
AAAACCTTTA CTAGCATTTA GAGCTTTTCA GAACATCCCC ACTGTCATGT GTCTCAGCAG 5100 
TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGOTTTTT TTTTTTTTTT TTTTTTCCCC 5160 
TAAATGGTAT GGCCAAAAQT CAGAGTTAAA ATATATATAQ TTAGATTCCA ACTTCXTTCCT 5220 
TCACTCTAAA AATAGAATCC AAACCCACTC TTCATATATG CTTCXAGAAT GGGGCTTAAG 5280 
TACCAATCTC TGCTTTGCAA TGGGCACAAT CTTGGTCATG TCCTGAGGCT CTCTAAGAAA 5340 
AGAGAGGATC TAGGATGGGA GAGCTAGAAA GTTGCTAACT GGGAAGAACA AGGCCCTGAG 5400 
GGOTTGOTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTQ AAGGAAGGCT 5460 
GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 5520 
ATGCXXTTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 5580 
CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 5640 
T6CAAGTGGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAQ GGA6CTTATT 5700 
CAATATTTAC TGAOACTTCO AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 5760 
AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 5820 
AACCCACAGA AGGGGATGGG AAATAAAGAA AAT7CTCTCA AGACT7CXCC AGGCCCATGT 5880 
CACTGGTCAG OGTGQTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 5940 
AGTACTTTAT AAGCAAA6CA ATTAAATGAT ATTGGGGTAG 6GAATGTTG0 CXJ^GTTrTGT 6000 
TTAGTTTT O C CATCACATTG TCACCCAOAC CTCACCTAGC CCCAAOTAAT OGQGCGCCCC 6060 
GAAGAGGGAG ACAGAGATGT GCCAGAGTT6 ACCXIAGTGTG CX3GATGATAA CTACTGAOSA 6120 
AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 6180 
TTTGTCTCCC TGGCAAGGAG AATATGCXXIG ACATQATGCT AAGAGGCCTG GGTAAAT6TG 6240 
OTQAGAATGC AOQOQTOCAT ATQCXACACA TATOTOCTTC TCAGTTGCA6 AAAATGAACT 6300 
GCTTTG06A0 ATTATCA6TA 6AAAGAGT6T TATCATATTG GT6CTGAGTG CTATGTGTGC 6360 
TTATACAATT TGTTCnGXA TTTTAATAAA CTTTGAATAA AAGAATAAAA AAAAAAAAAA 6420 
AAAAAAAAA 

Seq ID NOi 600 Protein eeguence 
Protein Accession 8s MP_000784.2 

1 11 21 31 41 51 

i I I I I I 

MGXLSVDLI.I TLQILPVFFS NCLPLALYDS VILLKHWLL LSRSKSTRGE HRBMLTSEGL 
RCVWKSFLLD AYKOVKLGEaD APNSSWRVS STEGODNSGH GTQEKIAEQA TCBLLDFASP 
BRPLWNFGS ATUPPPTSQL PAFRKLVEEF SSVADFLLVY IBEAHPSDGN AIPGD8SI1SF 
EVKKHQNQED RCAAAQQLLE RF8LPPQCRV VADSHDMNAN ZAYGVAFERV CIVQRQICIAY 
LGGKOPFSYN LQEVRHHLEK MFSKRUKKTR LAG 

Seq ZD MOt 601 DMA Bequenea 
nucleic Acid Accession tti NM_005233.1 
CSoding sequence 1 101 . . 3052 



60 
120 
180 
240 



1 
I 

CCATGGATGG 
ATGGAGATAT 
CCTCCTCCTT 
CAATGAAGTC 
TCCATCACAT 
TTACCAGGTG 
CCCCAGGAAC 
TAGCATTCCA 
TGATOATQAT 
A6CTGATGAA 
QATTAQAGAA 
TGCTTGTOTT 



11 
I 

TAACTTCTCC 
GCTCCTCTCA 
CTCAGCTGCT 
AATCTACTGG 
GGGTQG6AA0 
TGCAATGTCA 
TCAGCTCAOA 
TTGGTTTTAO 
CAT6G00TGA 
AGTTTCACTC 
OTAGGTCCTG 
GCCTTGGT O T 



21 
I 

AGCAATCAGA 
CTGCCCTCTG 
CTGTTCTOGA 
ATTCAA AAAC 
AQATCAGTGG 
TGGACCACAG 
AGATTTATOT 
GAACTTGCAA 
AATTTOQAGA 
AAATG6ATCT 
TCAACAAGAA 
CTCJrGAGAGT 



31 

I 

GCGCTCCCCC 
CACCAOCAAC 
CAGCTTCGGG 
AATTGAAGGG 
TOTGGATGAA 
TCAAAACAAT 
GGAGCTCAAG 
GGAGACATTC 
GGATCAOTTT 
TGGGGACGGT 
GGGATTTTAT 
ATACTTCAAA 



41 
I 

TCACATCAGT 
ATGGATTGTC 
GAACT6ATTC 
QA6CXGGGCT 
CATTACACAC 
TGGCTGAGAA 
TTCACTCTAC 
AAOCTGTACr 
ACAAA6AT7G 
ATTCTGAAGC 
TTG6CATTTC 
AAQtGCOCAT 



51 
I 

GGCATGCTTC 
AGCTCTCCAT 
OGCAQCCTTC 
OQATCTCTTA 
CCATCAGGAC 
CAAACTGGGT 
GAGACTGCAA 
ACATGGAGTC 
ACACCATTGC 
TCA ACACT GA 
AAGATGTTGG 
TTACAGTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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QAATCTGOCT 
AGQGTCTTOT 
AGGCG ftATGQ 
TTTTATGTQC 
TGCTAAOTGC 
GAATAATTAC 
TTCACCAAGA 
GCCCCTGGAC 
GTGOWVTATA 
TGGACTCACC 
TGAGATTGAT 
GOTCAGCATC 
GACCTCCAGA 
ATTGGACTAC 
GAGGOCAAOA 
CCAAATCOGA 
AACTAQTCCA 
TTCA6CGGCA 
CTQTGOCTAT 
TTTAAAACTT 
AGCTGTTCAT 
TGQAGCAGGT 
6ATTTCAQTG 
CCTQaOAQAA 
AGTT8TTAGC 
GGATAGTTTC 
TCX3AGGGATA 
0GCT6CTG8O 
TT0G08TGTC 
AATCAG6TG0 
ATGGAGTTAT 
QATGTCCAAT 
aOACIQCCCA 
CAGACCCAAG 
CCTOAAOATC 
TGTQQATATC 
CTGCAAGGAA 
CAGAGATGAC 
TAGCATTAAA 
GQAAQTGCTT 
CAATAATTCT 



ATGTTTCCAQ 
GTCAACAATT 
CTTGTACCCA 
CAAGCTTGTC 
C06CCTCACA 
TTCOGGGCAO 
AATGTTATCT 
ACAGGAG6CC 
AAAOU3TGTG 
AACACCAOGG 
GC06TTAATQ 
ACAACTAATC 
AATAGCATCT 
GAGGTGAAAT 
GGCACAAAT6 
OCOOGAACAG 
OACTCTTTCT 
OTAGCAATTA 
AAGTGAAAAC 
CCAGGTCTCA 
GAGTTTGCCA 
GAATTTGGAG 
GCCATTAAAA 
OCAAQCATTA 
AAAA6TAAGC 
CTAGGTAAAC 
GCATCTGQCA 
AACATCTTGA 
CTQQAGGA.TG 
ACATCACX3U3 
GGGATTGTTC 
CAGQATGTAA 
GCTGOCTTGT 
TTTGAGCAGA 
ATCACCAQTQ 
TCTACCTTCC 
ATCTTCACGG 
ATGAAAAAGG 
GCTCTAOAAA 
CTGQAOGGAA 
GGAGATACTG 



ACAOGGTACC 
CTAAGGAGQA 
TTG6CAA0TG 
GACCAGGTTT 
OTTCTACTCA 
ACAAAGACCC 
CTAATATAAA 
GGAAAGATGT 
A6CCATQCAG 
TGACAGTQAC 
GGGTGTCAQA 
AGGCTGCTCC 
CTTTGTCCTG 
ACTATGAAAA 
TTACCATCAO 
CCGCTGGATA 
CCATCTCTOG 
TTCTCCTCAC 
ATOGGGCAOA 
GGACTTATGT 
AGGAATTGGA 
AGGTGTGCAG 
CCCTGAAAGT 
TGGGACAOTT 
CAGTTATGAT 
AOGATGCCCA 
TGAAGTACCT 
TCAACAGTAA 
ACCCAOAAGC 
AAGCTATAGC 
TCTGGGAGGT 
TTAAAGCTGT 
ATCAGC TGAT 
TTGTTAGTAT 
CAGCCGCAAG 
GCACAACAGG 
GCGTGGAGTA 
TTGGTGTCAC 
CX3CAATCAAA 
GTGGTGGCTG 
GTGQAAGTT 



CATGGACTCC 
AGATGCTCCA 
TTCCTGCAAT 
CTACRA OG» 
GGAAGATGGT 
TCCATCCATO 
OQAOACCTCA 
TACCTTCAAC 
OCCAAATQTC 
AGACCTTCTG 
GCTOAGCTCC 
ATCACCTOTC 
GCAAGAACCT 
GCAGGAACAA 
TA6CCTCAA6 
TGGQACGAAC 
TGAAAOTAGC 
TGT7GTCATC 
TGAAAAAAGA 
TGACCC31CAT 
TGCCACCAAC 
TGGTOGCTTA 
TGGCTACACA 
TGACCACCOC 
TGTCACAGAA 
GTTTACTGTC 
GTCAQACATG 
CTTGOTGTGT 
TOCTTATACA 
CTACOGCAAG 
GATGTCTTAT 
AGATGAGGGC 
QCTOOACTGC 
TCTGQACAAO 
GCCATGAAAC 
TGACTGGCTT 
CAOTTCTTGT 
0GTGGTTG6G 
GAATGGCOCA 
TGQAAOOOGT 



CA6TCCCTGG 
AG6AT0TACT 
GCTGGCTATG 
TTGGATGGTA 
TCftATGAACT 
GCTTGTACCC 
GTTATCCTGO 
ATCATATGTA 
OGGTTOCTOC 
GO^CATACTA 
CCACCAAOAC 
CTGAOGATTA 
GAACATCCTA 
GAAACAAGTT 
CCTGACACTA 
AGCCGCAAGT 
CAAGTGGTCA 
TATGTTTTQA 
CTTCAXmO 
AATATGAA6 
ATATCCATTG 
AAACTTCCTT 
GAAAAGCASA 
AATATCATTC 
TACATGGAGA 
ATTCAGCTAG 
GGCTATGTTC 
AAGGTTTCTG 
ACAAGAGGAO 
TTGAOGTCAO 
GGAGA6AGAC 
TATCGACTGC 
TGGCAGAAAG 
CZTATCOGGA 
CTTCTTCTGG 
AATGGTGTCC 
GACACAATAQ 
CCACAGAAGA 
GTTCCCGTGT 
CAA6TCATCC 



TGGAGGTTAG 
GCAGTACAGA 
AAGAAAGAOG 
AT ATCAAOTQ 
GCAGGT01T3A 
GACCTCCATC 
ACTGGAGTTO 
AAAAATGTGG 
CTOQACAGrr 
ACTACACCTT 
AGTTTGCTGC 
AGAAAGATCG 
ATGGGATCAT 
ATACCATTCT 
TATACGTATT 
TTQAGTTTaA 
TGATCGCCAT 
TTGGGAGGTT 
QCAATOGGCA 
ACCCTAOCCA 
ATAAAGTT67 
CAAAAAAA6A 
GGAGAGACTT 
GACTG8AAG6 
ATGGTTOCTT 
TOGGGATGCT 
ACOGAGACCT 
ATTTOGGACT 
GGAAGATCCC 
CCAGOGATQT 
CATACTGGGA 
CACCCCCCAT 
ACAGGAACAA 
ATCCOGGCAG 
ACCAAAGCAA 
GGACAGCACA 
CCAAGATTTC 
AGATCATCAG 
AAAGCAOGAC 
TGCA6ACAGA 



Seq ID NO: 602 Protein sequence 
Protein Accession «t MP 00S224.1 



1 
I 

MDGQLSILLL 
BYTPIRTYQV 
NLYYMESCOD 
LAFQDVGACV 
RMYCSTBGEW 
SMHCRCEKNY 
ZICKK06WNI 
PPRQPAAV8I 
BTSYTILRAR 
QWMIAISAA 
TYEDPTQAVH 
EKQRRDPIiGB 
IQLVGMLRGI 
TRGOKIPIRW 
YRLPPPMDCP 
LLLDQSNVDI 
PQKKZZ6SXK 



11 
I 

LSCeVLDSFG 
QJVMDHSQNK 
HGVKFREHQP 
ALVSVRVYPK 
LVPIGKCSCN 
FRADKDPPSM 
KQCEPCSPNV 
TniQAAPSPV 
GTNVTISSLK 
VAIILLTWI 
BFAXELDATN 
ASZMGQFDHP 
ASGMKYLSDM 
TSPBAIAYRK 
AALYQLMLDC 
STFRTTGPWb 
ALBTQ8XN0P 



21 

1 

ELIPQPSNEV 
WLRTNWVPRN 
TKIDTZAADE 
KCPFTVKNLA 
AOYEEROFMC 
ACTRPPSSPR 
RFLPRQFGLT 
LTIKKDRTSR 
PDTIYVPQIR 
YVLIGRFC3GY 
ZSIOKWGAG 
KIZRIiEGWT 
GYVHRDZiAAR 
PTSASDVWSY 
HQKDRMNRPK 
NOVRTAKCKB 
VPV 



31 
1 

NLU)SKTIQG 
SAQKIYVELK 
SFTQMDIiGDR 
MPPDTVPMDS 
QACRPGFYKA 
NVZSNINETS 
NTTVTVTDLL 
K8Z8LSWQEP 
ARTAAGYGTN 
KSKH6ADBKR 
EFGSVCSGRL 
KSKPVMZVTB 
NZLZMSNLVC 
GZVLHEVMSY 
FEQZVSZLDK 
XFXOVEYSSC 



41 

I 

ELGHI8YPSH 
FTLRDCNSIP 
ILKLNTEIRE 
QSLVEVRGSC 
UXSNMKCAKC 
VILDWSNPLD 
ABTNYTFBID 
EHPKQIIIiDY 
SRKFEFBTSP 
LRFGKGHLRL 
XLPSKKBZSV 
YMBKGSLDSF 
KVSDFGLSRV 

LIRNPGSLKZ 
DTZAKZSTDD 



Seq ZD NOt 603 DNA sequence 

Nucleic Acid Accession NM_005737.1 

Coding sequence: 122.. 647 



1 

I 

GCCAGGCGTC 
TGGAGCCTCA 
CATGCAQTGC 
GTQTGGTGCA 
GAAGATCTTC 
CATOOCAGCC 
T6AGAGCAA6 
GGTTGCAGCT 
GCT6GTAGTG 
GAACACCACC 
CTCACCCTAC 
CAACACAGCX: 
CTTCAATCAG 



11 
1 

CCTCTGCCTG 
GCAGTTCCCT 
TTCAGCTTCA 
GCCCTQTTGG 
GQGCCACTGT 
GGOOTTOTQG 



CCTGCCATCA 
ATGAAAGGGC 
TTCAAAGAOA 
AATGAAACCT 
CTTTTGTATG 



21 
I 

CCCACTCAGT 
CTTTCAGAAC 
TTAAGACCAT 
CAGTGGGCAT 
GGTCCAGTGC 
TCTTTGCTCT 
TGAOGTTCTT 
CCTTQQTGTA 
AGAAAGATTA 
TCAAGTGCTG 
ACAGTGCCTT 
GCACCAAGCA 
ACAT0CX3AAC 



31 

I 

GGCAACACCC 
TCACTGCCAA 
GATGATCCTC 
CTGGGTGTCA 
CATGCAGTTT 
TQGTTTOCrrG 
CTTCATCCTC 
CACCACAATO 
TGGTTCCCAG 
TGGCTTCACC 
TCCCCCATTC 
AAAGGCTCAC 
TAATGCIVGTC 



41 

I 

GGGAGCTGTT 
GAGCC CTGA A 
TTCAATTTGC 
AT06ATGGGG 
GTCAAOGTGG 
GGCTGCTATG 
CrOCTCATCT 
GCTGAQCACT 
GAAGACTTCA 
AACTATACGG 
TGTTGCAATG 
GACCAAAAAG 
ACOGTOOQTO 



780 
640 
900 
960 
1020 

loeo 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1930 
1980 
2040 
2100 
2160 
3220 
2260 
2340 
2400 
3460 
2530 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 



51 

I 

GWBEIS6VDB 
LVW5TCKBTP 
VGPVNKKGFY 
VNNSKEEDPP 
PPHSSTQEOG 
T6GRXDVTFM 
AVNGVSELSS 
EVKYYEKQEQ 
DSFSISGESS 
PGLRTYVDPH 
AIKTLKVGYT 
LRKHDAQFTV 
LEDDPEAAYT 
QDVZKAVDBG 
ITSAAARPSN 
MKKVQVTWG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
730 
780 
640 
900 
960 



51 

I 

TroTccrrxG 

CAGGAGCCAC 
TCATCTTTCT 
CATCCXTTCT 
GCTACTTCCT 
GT6CTAA6AC 
TCATTGCTGA 
TCCTGAOGTT 
CTCAAGTGTG 
ATTTTGAGGA 
ACAACGTCAC 
TAGAGGGTTG 
OTGTGGCAOC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TGQAATTGGO 
ACAATAAGTC 
CCCTGGCAAG 
AATGGACCTG 
ATGCCTGACT 
6TAGCCR0TT 
TAOTGGTQAT 
AAGTQAAATC 
TOTTACAATG 



GGCCTCGAGC 
CACTTCTGCC 
CAGCAGTGAT 
CCCTTTCTGC 
TTCCTTCCAT 
CTGTT6CCCA 
CCCAGTGCTC 
AGCAGAGCCT 
TTAAAAAA 



TGGCTGOCAT 
TCTGCCACTA 
TGGGGGAGGG 
TCCAGAC1T9 
T GO TGGGTUQ 
TTCCCCCAOT 
TACTGG6GGA 
CTGGGTGQAT 



GATTGTGTCC 
CTGCTGCCAC 
GACAGQATCT 
GG6CTAQATA 
AT6GGTGGG6 
CTATTAAACC 
TGAGAGAAAG 
OTGTAQAAGG 



ATOTATCTGT 
ATGGGAACTG 
AACAATGTCA 
GGGACCACTC 
GGCATTCCAO 
CTTQATATOC 
GCATTTTATA 
CACTTCAAAA 



ACTC^AATCT 
TGAAGAGGCA 

CTTTTAG6GQ 
AGCGTCTAAa 

CCOCTAOGCC 
QCCTGGGCRT 
TGCATAAACX: 



Seq ZD NO) 604 Protein aeouence 
Protein Accession «: NP O0S718.1 



51 



1 11 21 31 41 

I 1 I I I I 

MQCP8PIKTM MILPHLLIFL OQAALLAVOI WVSIDGASFL KIPGPZ<SSSA MQFVNVC3YPL 
lAAOVWFAL GPLGCYGAKT ESKCALVTFP PILLLIPIAB VAAAWAIiVY TTMAEHPLTL 
LWPAIKRDY GSQEDPTQVW NTTMK6LKCC GFTMYTDPED SPYPKENSAP PPPCCNDNVT 
NTANETCTEQ KAKDQKVBGC PNQLLyOIRT NAVTVGGVAA GIGGLBLAAN IVSMYLYCSIL 
0 

Seq ID NO J 605 DNA eequence 

Nucleic Acid Accession «: NM_000729.2 

Coding sequence: ~" 



1 

I 

GGCTCAGCTG 
AGCCATGAAC 
GACGCAGCCX5 
GCCC06TAGG 
CXSCCCTGCTG 
CGTTAAQAAC 
CTGQATGGAT 
GCCGCCATCA 
TCACACTCAT 
TCAATGTGAA 
TOTGCAAACT 
TATGCTATTA 



11 

I 

CCGGGCTGCT 
AOCGOGQTQT 
GTGCCTCCCG 
CAGCTGAGGO 
GCAAOATACA 
CT6CA6AACC 
TTTGGCOOTC 
OOCCAAOGGA 
AACTCATTGT 
AATTGTGTCT 
GAA6ACAAAA 
AAOTGATTTC 



21 
I 

CGOGTTGGAA 
GCCTOTGOGT 
CAGATCCOGC 
TATOGCAGAG 
TCCAGCAGGC 
TGGACCGCAG 
GGAGTQOC33A 
AfiCAACCTCC 
CTGTGGAGTT 
GTAAGATTGT 

ATTCTGOC 



31 
I 

AOGCCAAGCC 
QCTGATGGCX3 
GGGCTCOGGO 
AACGGATGGC 
COGGAAAGCT 
CCACAGGATA 
GGAGTATQAO 
CAACCCAGAG 
TGACATTGAA 
CCAGTGCAAC 
CATGTGTGAC 



41 

I 

AGCTGCCGTC 
GTACTGGOGG 
CTGCA6CGGG 
QAGTCCCGAG 
CCrrCTGGAC 
AGTGAC06GG 
TACCCCTCCT 
GAGGCAGAAT 
TOTATCTATT 
CACACACGCT 
TCCTGTTCTG 



Seq ID NOi 606 Protein sequence 
Protein Accession #t llPj000720.1 



41 



51 



1 
I 

AGCACTCTCC 
ATAACCTCGG 
CAGGGCTGCT 
CTCTTTGCTC 
TCTTTGTGGT 
6GTTGGTTTC 
GCTGCAGTGA 
TGATTCTCTC 
CCATGGAGAA 
GCATTCTTOT 
A TCAC CACGG 
GOOTTCTCTA 
GGTGGGGAGG 
COQAGATA6G 
TGAGGGQATT 
GATGCTCOCT 
GTTCTTAQTG 
TGGGTCATGC 
TGOCAAGAGC 
TCTGACCCAA 
GTGAOCCATT 
GGAGGAGGAA 
CTTTTCACAA 
GTCXXTCATO 
GCTCTTCTGG 
AACTATQQGA 
TTCCCCTCAA 
TCATTGTAGC 
ATCAATATTG 
OAGXGATCAC 



11 
I 

AGCCTCTCAC 
GAGGOGGGTC 
GCCAGCACCT 
ACAAiGTTACC 
GCACATOGCT 
CAATACGGTA 
CAGCCTGTCA 
TATCATCTTC 
GGGAAACCG6 
QOQOGT G TCC 
CTATTOCXAC 
TCTOOTOCTQ 
AG6AAG0CGT 
GGAGGGGGGA 
CTCTACT8CC 
T6ATGG6GTC 
ACACACACTO 
ACTGAGGTCC 
AGATACTQTC 
AGCAAAACAT 
GTCCCTCTTT 
AGAGAGTTTG 
TGGAAAACCT 
QAGACCTCAT 
AGTTTCrCTA 
TCCAAGGGGC 
TGCAGTCAGT 
AOUVTCCTAT 
A0QATCA6G6 
CTCTT6G6GA 



21 
I 

CGCAAAATTA 
CTTCCCCTCA 
GCCACTCAGA 
AAAAAAAAAA 
ACTGTTATTA 
GATGCATCAG 
TATGCCAGTG 
TOTQTCATTO 
TTCTTCCTCT 
ATCTACACTA 
ATCCTGGGCT 
AGAAAGAAAT 
TGAA7CTG0G 
GGGGGAAGCA 
AA8CCCCTGC 
CAGAGA6CCT 
TCTGGGQCCC 
ACAGACCTAC 
TTTGTGCTGA 
CACATTGCAO 
GGAACA6ATA 
TAAGGTCATG 
GGGGGATG6T 
G0CAT6GTCT 
AAG TCAC TAG 
AOTCTTOCAA 
GTTTCTTTTA 
TACTCTTCCT 
CTCCTAGGCr 
0CCT6CCTAT 




CAGGGGCCAC 
OTCATTATGC 
GGATCTGCTT 
AAGGCCGGAC 
AGGGAAGTGG 
AAGGGGGGAO 
CCTGGGGAGA 
CCCTGCAGCC 
CATCAGCTGC 
TGCACTQAGT 
ATAGGCTAAG 
TCrSAAGTGC 
TTTAGCTCTG 
CTGGTGGGTT 
CAGAGCCCAG 
TTGCTAG60C 
TQAACAATTC 
CAGT6CCAT0 
AGTATACAAC 
CTAACATTTT 
CAGPIGGTAGC 
COCACTTCAC 



41 
I 

TACAOCAGCA 
ATACTTCCAG 
CX3CTGGGACC 
GTTGGTAITG 

GAAAAACTGT 
CAAGACAGTG 
CTTOGTGTTC 
CACACTGGTO 
GAATOQTGAT 
CTGCTTTCAGC 
GAGTTCATGO 
AGGTTGCTGT 
GTCAAATCOC 
AAGTAOTTGG 
ACCAGACTTO 
CACAACACCA 
TAAAATAGOO 
OCZGGAAGCC 
CTACTGGGGO 
TGGAATTCAG 
AGCTAAA(XA 
TOGAGACCrC 
TCTTGCXGAA 
GGVGGTAAAA 
TTAGGGTTAT 
AGGAGAQAGA 
TGAGGAAGTT 
TCTGGCTTAG 
AGQTQA6GCA 



B40 
900 
960 
1020 
1080 
1140 
1200 
1260 



51 

I 

CTAATCCAAA 
CTGG06CCCT 
CA6AGGAGGC 
OGCACCTGGG 
GAATGTCCAT 
ACTACATGGG 
AOAGGACGCA 
AAGACAACSU^ 
TATTAAQTTC 
CACCAGAAOT 
AAAATGTT6T 



1 11 31 31 

t I I I I I 

MNSGVOiCVIi MAVXAAGAUr QPVPPADPAO SOLQRABBAP RRQLRVSQRT DGESRAHLGA 
LLARYIQQAR KAP8GRMSZV KIII.QNLDP6K RI8DRDYMGH MDPGRRSA^ YBYPS 

Seq ID KOt 607 DMA sequence 

Nucleic Acid Accession tf: NM_001423.1 

Coding sequence: 2 19.. 6 92 " 



51 
I 

GAGGAAACTT 
AAGAGCGGAC 
CTTCAGAACT 
CTG6CTGGTA 
6CCAATGTCT 
ACCAACATTA 
CAGGCCTTCA 
CAGCTCTTCA 
TGCTGGCTGT 
GGAACGCAGT 
TTCATCATCG 
GGATCTGGG6 
ACAGGAAAAA 
AAACCATTAC 
CTAOTACTTT 
GCCTCCAGCT 
GCCCCACTTC 
QTACAAQTTC 
ATCCTGCCCT 

TGACAAAATG 
AGAAGGAGAC 
ACACACGGCT 
AGCCAAGGCA 
GTACCACACA 
6TTTTTAGGA 
TGGACATGGC 
TTGTCTAATT 
ACACCACCTG 
TGGCAATTCT 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



1184 



wo 03/042661 



PCT/US02/36810 



GGAAOCTCsAT TAAAAC/^CAC ATAAAOCAAA ACCAAACAAC AGGCCCTTGG GTGAAAGOTG 1860 

.CTATATAATT GTCAAGTATT AAGCCTACOG TATTTCAGCC ATQATAAGAA CAGAGTGCCT 1920 

GCATTCOCAO 6AAAATA0QA AAATGCCATG AGATAAATAA AAATATAOGT GATGGGCAGA 1980 

' lt. ' nili.rn ' AAAATAAAAA AGCAAAAACT CTTGTGGTAC CTAGTCAGJVT GGTAGAOGAO 204C 

5 C T0TCIt3CTa COQCAOGAGC ACCTCTATAC AGGACTTAGA AlGTAGTATGT TATTCCTGGT 3100 

TAAGCAGOCA TT6CTTTGCC CTGGAQCAOC TATTTTAAGC CATCTCAOAT TCTGTCTAAA 2160 

GOOOTTTTTT GGGAAGAOGT TTTCTTTATC GCCCTGAGAA C3ATCTACCCC AGGGAGAATC 2320 

TGAGACATCT TGCCTACTTT TCTTTATTAG CTTTCTOCTC ATCCATTTCr TTTATACXTT 2280 

TCCTTTTTOO GQAGTTQTTA TGCCATQATT TTTGOTATTT ATGTAAAAGO ATTATTACTA 2340 

10 ATTCIATTTC TCTATG T TT A TTCTAOTTAA GGAAATGTTQ AGGGCAAGCC ACCA AATTAC 2400 

CTAGGCTGAG GTTAGAGAGA TTGGCCAGCA AAAACTGTOa GAAGATQAAC TTTGTCATTA 2460 

TQATTTCATT ATCACATGAT TATAOAAGGC TGTCTTAGTG CAAAAAACAT ACTTACATTT 2520 

CAGACATATC CAAAGGGAAT ACTCACATTT T67TAAGAAG TTGAACTATG ACTOQ AQTA A 2580 

ACCATGTATT CCCTTATCTT TTACTTTTTT TCTOTOACAT TTATGTCTCA TOTAATTTGC 2640 

15 ATTACTCTGG TGGATTGTTC TAGTACTGTA TTGGGCTTCT TGOTTAATAQ ATTATTICAT 2700 

ATACTATAAT TCTAAATATT TTGATACAAA TQTTTATAAC TCTAOGGATA TAAAAACAGA 2760 
TTCTGATTCC CTTCAAAAAA AAAAAA 

Seq ID NO: 608 Protein aequence 
20 Protein Accession «t NP_001414.1 

1 11 21 31 41 51 

I I I I I I 

MLVLLAGIFV VHIATVIMLF VSTIANVWLV 6NTVDASVGL HKNCTNISCS DSIjSYASEDA 60 
25 LKTVQAFMII* SIIFCVIALL VFVFQLPTME KGNRFFLSGA TTLVCHLCIL VOVSIYTSBy 120 
ANRDGTQYHR GYSYILOWIC FCF8PII6VL YLVLRKK 

Seq ID NO I 609 DNA sequence 
Nucleic Acid Accession Ut NM_004961.2 
30 Coding sequence: 55.. 1575 

1 11 21 31 41 51 

1 I I I I I 

GCCAGAQCGT GAGC0GCX3AC CTCCGCGCAG GTGGTCGCGC CGGTCTCCQC GOAAATGrTQ 60 

35 TCCAAAtnrC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTOGAG GGTCGAGGGA 120 

CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCX3TQAT0 TTGTCTATGQ CCCCCAGCCC 180 

CAGGCTCTGG AAAATCAGCT CCTCTCTGAO QAAACAAAOT CAACTGAGAC TGAGACTGGG 240 

AGCAQA6TTG GCAAACTGCC AGAAGCCTCT CGCATCCTQA ACACTATCCT QAOTAATTAT 300 

QACCACAAAC TGCGCCCTGQ GATTGGAQAQ AAQCCCACIQ TGGTCACTGT TGAGATCGCX: 360 

40 QTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATG6AAT AOVCCATTQA CATCATCTTC 420 

TCCCAGACCT GGTAOQAOGA AGQGCTCTGT TACAAC6ACA CCTTTGAGTC TCTTGTTCTG 480 

AATGQCAATG TGGTGAGCCA GCTATGGATC C0G6ACACCT TTTTTAGGAA TTCTAAGAGG S40 

ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATOQCAAG 600 

GT G TTQT A CA CAATTAGGAT QACCATTGAT QCCOGATGCT CACTCCACAT GCTCAGATTT 660 

45 CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 720 

ATOATCTACA AGTGGQAAAA TTTCAAGCTT QAAATCAATG AGAAGAACTC CTGG AAQCTC 780 

TTCCAOmG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATGACAAC C CCAOTTGOT 840 

GACTTCATGG TCATGAOGAT TTTCTTCAAT GTQAGCAGGC GGTTTOOCTA TQTTGCCnT 900 

CAAAACTATG TCCCTTCTTC OGTQACCAOG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 

50 ACA0A6TCTG CTCCAGCCOQ OACCTCTCTA GGGATCACCT CTGTTCTGAC CATQACCACG 1020 

TTG6GCACCT TTTCTOGTAA QAATTTCCOG OOTOTCTCCT ATATCACAGC CTT6GATTTC 1080 

TATAT06CCA TCTGCTTCGT CTTCTGCTTC TGCGCTCTOT TGGAGTTTGC TGTGCTCAAC 1140 

TTCCTGATCT ACAACCAGAC AAAAGCCCAT OCTrCTCCTA AACTCOGCCA TCCTCGTATC 1200 

AATAOCOGTO CCCATGCCCG TACCXX3TGCA CGTTCCOQAG CCTGTGCCOQ CCAACATCAQ 1260 

55 GAAGCTTTTG TGTGCCAGAT TGTCACCACT QAGGQAAOTG ATGGAGAGaA GOCCCOGTCT 1320 

TQCXCAGCXX: AOCAGCOCCC TAQGCCAGQT AOCCCTGAGG GTGCGCGCAG CCTC TOCT CC 1380 

AA6CTGQCCT GCTGTQAGTG OTGCAAGGGT TTTAAOAAOT ACTTCXGGAT OGTGOOOSAT. 1440 

TGTGA6GGCA OTACCTGGCA GCAGGGCOGC CTCTGCATCC ATGTCTACOQ CCTG6ATAAC 1500 

TACTOGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTQTT 1560 

60 TQOCTTAACT TGTAGGTACC AQCTGGTACC CTOTGGGGCA ACCTCTCXaVG TTCCCCAGGA 1620 

GOTCCAAOCC CCTTGCCAAG GGAGTTQGGa GAAAGGAOGA 6CA6GA8CAG QAOOQACTAO 1660 

AGTTTTTCCT GCGCCATTGC CCAAACAGAA GCTTOCAQAO GGrtTOXCTf TGCTGCCCCT 1740 

CTCOCCTACC TOGCCCAT^C ACTGAGTCTT CTCAGCAQAC CATTTCAAAT TATTAATAAA 1800 

TGGGOCACCT CCCTCTTCTT CAAGGAGCAT COGTGATGCT CRGTGTTCAA AACCACAGCC 1860 

65 ACTTAGTGAT CAOCTCCCTA AAAOCATGCC TAAGTACAGG OGGATTAGCT ATCTT CCAAC 1920 

AATGCTGACC ACX»GACAAT TACTGCATTT TTCCAGAAOC OCACIATTGC CTTTGTAGIG 1980 

CTTT06QCCC AGTTCTGGCC TCAOOCTCAA AGTGCACOQA CTAGTTGCTT GOCTATACCT 2040 

GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGO AAGAGATCCC TCTCCrTTGG 2100 

TCAOATTATT ATQTTCTCAQ TTCTCTCTCC CTGCTACCCC TTTCXCTGCA GATAGATAGA 2160 

70 CACTGGCATT ATCXrCTTTAG 6AAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 2220 

TTOCTCTCTC T CT GCTGCI G TGACATCTGC CICTCCTTOC TGOCTCCATC TTTCGTCTOC 2260 

ACTACCAATT CAATGCCCTT CATCCAATGG QTATCTATTT TTGTGTGT G A T TATAO TAAC 2340 

TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTQA CCCCTGTGAC TCTTTCXGTA 2400 

ACTTTCCCAO TOACTTCCCC TAGCCCTGAC CCAGGCACTA OOCCrrGGTQ ACrTCCTGGG 2460 

75 GCCAA6AAAC TAAG6AAACT OQGCTTTGCA ACAOGCATTA CT060CATTG ATTGGTGCCC 2520 

ACCC31SG6CA CACTGTOQQA OTTCTATCAC TTQCTTQACC CCTQQAOOCA TAAACCAGTC 2580 

CACTOTTATA CCOCGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 

GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT 0GA6CTTCAT 2700 

OATAQCCTTG TGACATCTTT AGQGCAQGAT TCTTATCCXX: ATTTTQCAOA TGAAAACCCT 2760 

80 GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCOCACTOTC 2820 

ACCTTCTAGA OCACATGATA GGGCTAQACA GCTCAGTTGA OCATGATTCT CTTCTGTCAC 2880 

CTCT G CTG G C ACACCAGTGG CAAGGGCCAG AATGGCQACC TCTCTTTAGC TCAATTTCIG 2940 

OQCCTGAGGT GCTCAOACTG CCCCXIAAGAT CAAATCTCTC CTGGCTGTAO TAAOCCAGTG 3000 

GAATGAATTT G6ACATG00C CAATGCTTCT ATATGCTAAG 7GAAATCXGT GTCTGTAATT 3060 

118S 



wo 03/042661 



TGTTGGG6GG TGfflVTACGQT GGG G TCTCCA ItTlACTm 'T GTOVOCATCA TCIGAAATGO 3120 
OGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 

Seq n> NOt 610 Protein eeguence 
Protein Accession 8f NP.004953.1 

1 11 ai 31 41 51 

I I I 1 I I 

MLSKVLPVLb GILLILQSRV E6PQTBSKKB ASSRSWyQP QPQPLBNQLL SBBTKSTBTE 60 

TGSRVGKLPE ASRILNTILS NYOHKIiRPOI GEKPTWTVB lAVNSLGPIiS ILDKBYTIDZ 120 

IPSQTWYDER LCYNDTFBSL VLNCSMWSQL WIPDTFFRMS KRTHEHBITM PNQMVRIYKD 180 

GKVLYTIRMT IDAGCSLHML RPPMDSHSCP liSFSSFSYPB NEMIYKWEMP KLBINBKNSW 240 

KLFQFDPTGV SNKTBIZTTP VGDFMVMTIP FNVSRRFGYV AFQNYVPSSV TTMLSWSFH 300 

IKTESAPART SLGITSVLTN TTLGTFSRKN FPRVSYITAL DFYXAICFVP CFC3UiLBPAV 360 

UTFLIYNQTR AHASPKLRHP RXN8RASIART RARSRACARQ HQBAFVOQZV 1TB0SD0BER 420 

PSCSAQQPP8 PGSPB6PR8L CSKLACCBHC KRFKKyPOIV PDCBG8THQQ GRLCIHVYRL 480 
DNYSRWFPV TPFFEMVLYW LVCLNL 

Seq ZD NOi 611 PWA sequence 
Nucleic Acid Accession #t NM_03l9B4.l 
Ooding sequence t 572 . . 1753 "* 

1 11 21 31 41 51 

I 1 I I I I 

6CCA6AG0GT GAGC0606AC CTCCX3CGCAG GTG6T0Q0GC GGQTCTOOQC GGAAATGTTQ 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTQATCC TCCAOTOSAQ AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAO CTGATQAGTT GTCAAAAAAT GACCACAGCX3 180 

GTGTAAAGAA A6CCAAATCA A6GACG0GAA TGT GAGCRG G ACCT CAGAAG CCCTCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GQACTTCIAA CACCATCXSGO TGaAGGQACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCX3TGATGTT GTCTATG6CC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAOACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACZC C3GCXXTGQCA TTOOAOAOAA 6CCCACTGT0 GTCACTGTTQ A6ATCTC06T 540 

CAACAOOCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

OCAQACCTGG TAG6ACXUUVC GCCTCTGTTA CAA06ACACC TTTGAGTCTC TTGTTCTGAA 660 

TOQCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGQAATT CTAAGAGOAC 720 

CCAOQAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG AT6GCAAGGT 760 

GXTOTACACA ATTAGGATGA GCATTGATGC 06GATGCTCA CTCCACATGC TCAGATTTCC 840 

AATGQATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGA6AT 900 

GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT G6AAGCTCTT 960 

CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTQA 1020 

CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGCGG TTTGGCTATG TTGCCTTTCA 1080 

AAACTATOTC CCTTCTTCOG TGACCACGAT GCrCTCCTGG GTTTCCTTTT GGATCAAQAC 1140 

AGAQTCTGCT CCAGCCXIX^GA CXTTCTCTAGG GATCACCTCT GTTCTGACCA TGACCAOSTT 1200 

GGGCACCTTT TCTOGTAAGA ATTTCCOGOQ TGTCTCCTAT ATCACAGCCT TGGATTTCTA 1260 

TATOGCCATC TGCTTCGTCT TCTGCTTCTG OOCTCTGTTG GAGTTTOCTO TGCTCAACTT 1320 

CCTOATCTAC AACCAQACAA AAGCCCATQC TTCTCCTAAA CTCCQCCATC CTCX5TATCAA 1380 

TAGCCGTGCC CATGCCCOTA CCXOTOCACO TTCCX3QA0CC TOTGCCCGCC AACATCAGGA 1440 

AGCTTTTGTG TGCCftGATTG TCACCACTQA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTO 1500 

CTCAGCCCAG CASCCCCCTA GCCCAGGTAG CCCTGAGGGT CCC06CAGCC TCTGCTOCAA 1560 

GCTGGCCTGC TOTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TOAGGGCAQT AOCTGGCAGC AGGCCCGCCT CT6CATCCAT GTCTACSSGCC TGGATAACTA 1680 

CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACr GGCTTGTTTG 1740 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

TCCAAGCCCC TT6CCAAGGG AGTTGGGGGA AAGCAOCAGC AGCAGCAGGA GCGACTAGAG 1860 

TTTTTCCTGC CCCATTCCCC AAACAQAAGC TTGCAGAGGG TTTGTCTTTQ CTGCCCCTCT 1920 

CCCCTACCTQ GCCCATTCAC TOAOTTTTCT CAGCAQACCA TTTCAAATTA TTAATAAATO 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GT6TTCAAAA CCACAGCCAC 2040 

TTA0TGATCA GCTCCCTAAA AOCATGCCTA AOTACAGQCO GATTAGCTAT CTTOCAACAA 2100 

TGCTGACCAC C3Ma\CAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAQTGCT 2160 

rrOGGCCCAG TTCT6GCCTC AGOCTCAAAG TGCAC06ACT AGTTGCITGC CTATACXTIGG 2320 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCITTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT CCCTTTAGGA AGA06GGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 2400 

CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TOGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TOCAATGGGT ATCTATTTTT GTGTOTGATT ATAOTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTQACC CCTGTGACTC TTTCTGTAAC 2580 

TTTCCCAOTG ACTTCCCCTA GCCCTGACCC AGOCACTAOO CCTTQGTGAC TTCCTGGGGC 2640 

CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT 06CCATTGAT TGGTGCCCAC 2700 

CCAGGGCACA CTGTCGGAGT TCTATCACTT 6CTTGACCCC TGGAGCCATA AACGAGTCCA 2760 

CTGTTATACC CG GGGC ACTC TAACCATCAC AATCAATCAA TGAAATTOCC TTAAATTTOT 3620 

ATGGCACTGG AACTTTGGCA AAGCACTTTT GAGAAGTTGT GTCTQATTOG AGCTTCATQA 2880 

TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCA6ATG AAAACCCTQA 2940 

GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CC3VCTGTCAC 3000 

CTTCTAGACC ACATGATAQG GCTAGACACC TCAOTTCACC ATGATTCTCT TCT6TCACCT 3060 

CTGCTGGCAC ACCAGTGGCA A6GCCCAGAA IG6C6A0CTC TCTTTAGCTC AATTTCTQGO 3120 

CCTGAGGTGC TCAGACTQCC COCAAQATCA AATCTCTCCT 6GCTGTA6TA ACCCAGTGQA 3180 

ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATOCTAAGTO AAATCTGTOT CTOTAATTTO 3240 

TTGGGGGQTG GATAGGGTGG GGTCTCCATC TACtTTTTGT CACCATCATC 1GAAATGGG6 3300 
AAATATGTAA ATAAATATAT CAGCAAAGC 

Seq ID NO: 612 Protein sequence 
Protein Accession 0i NP_068ei9.1 



1186 



wo 03/042661 



1 11 21 31 41 51 

I I I t I ) 

MEYTIDIIFS QTWYDBRIiCy NDTFBSLVLN GNWSQLWIP DTPFRNSKRT HEHEITIIFKQ 60 

KVRIYKZX3ICV LYTZRMTZDA GC8XiH14LRFP MDSHSCPLSP SSPSYPQICM lYKHENFXLB 120 

INEKNSWKLF QLDPTOVSNK TBIITTPVGD FHVKTIPPDV SRRPQYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTPSRKNFPR VSYITAIiDFy lAICPVFCPC 240 

AIiL EPAV IJJP LIYNQTKAHA SPKURHPRIN SRAHARTRAR SRACARQHQB APVCQIVTTB 300 

GSDG&ERPSC SAQQPPSPGS PEGPRSIiCSK lACCEWCKRF XKYFCSfVPDC EGSTHQQARL 360 
CIHVYRliDNY SRWFPVTPP FFHVLYHLVC UIL 

Seq ID NO I 613 DMA sequence 
Nucleic Acid Acceesion 0: NM_0219B7.1 
Coding sequence; 572.. 1657 " 

1 11 21 31 41 51 

I i 1 I I I 

GCCAGAGOOT GAQCXXSCGAC CTCXX3CX5CAO GTGGTCGOGC CXK3TCTCCGC GGAAATOTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTOOAO AACATOTATA 120 

GMSAOAAQTO CTCAAATCAT AAQTGTACAG CTOATGAGTr 6TCAAAAAAT GACXACAOOQ 180 

GT6TAAAGAA AGCCAAATCA A66AGCGQAA TQTGAGCAGO ACCTCAGAAG CL ' CCL ' mm ' 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GQACTTCTAA CACCATOGGG TCOAGGGACC 300 

TCAGACTGAA TCAAAGAATQ AAQCCTCTTC COGTQATGTT GTCTATGGCC CCCAGCCCCA 360 

6CCTCTGGAA AATCA6CTCC 7CTCTGAGGA AACAAAGTCA ACTQAQACIO AGACTGGQAG 420 

CAOAOrrGGC AAACTGCCAO AAOCCTCTCG CATGCTQAAC ACTATCCIGA GTAATTATGA 480 

CCACAAACTG GGCCCTGGCA TTGGAQAiQAA GCOCACTGTQ GTCACTOTTO AG A T C TCOGT 540 

CaACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACX^^TGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGQAT66CA AGGTGTTGTA CACAATTAGG ATQACCATT6 ATGCCX3GAT0 720 

CTCAC TCCAC ATGCTCAGAT TTOCAATGQA TTCTCACTCT TGGCCTCTAT CTTTCTCTA6 780 

CTTTTCCTAT CCTGA6AATG AGATQATCTA CAA0TG6GAA AATTTCAAGC TT6AAATCAA 840 

TGAGAAGAAC TCCTGGAAOC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTQA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGAOG ATTTTCTTCA ATGTOAGCAO 960 

GOGGTTTGOC TATGTTGCCT TTCAAAACTA TCTCCCTTCT ' TCOGTGACCA CGAT6CTCTC 1020 

CTGGGTTTCC TTTTGGATCA A0ACA6AGTC IGCTCCAOGC OGOAOCTCTC TAOGQATCAC 1080 

CTCT G TTCTO ACCATGACCA 03TTGG0CAC CTTTTCTOOT AAGAATTTCC CX30QT6TCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCX3C CATCTGCTTC GTCTTCTOCT TCTGCGCTCT 1200 

GTTGQAGTTT GCTQTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCOGC CATCCTOGTA TCAATA GCOG TQCCCAT6CC CGTACCCGTG GAOJTTCCCG 1320 

AGOCTQTGCC OQCCAACATC AG6AAGCT7T TGrOTQCGAG ATTQTCACCA CTQAGGGAAa 1380 

TGATGGAGAG GAGOOCCCXST CITGCTCAGC CCA6CAGCCC 0CTA6CCCA6 6TAGCCCTGA 1440 

GGGTCCCCX3C AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGQTGCAAGC OTTTTAAGAA 1500 

GTACTTCTGC ATGGTCOCOG ATTGT6AGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CC ATGTC TAC C3GCCTGGATA ACTACTC36AG AOTTGTm'C CCAGTGACTT TCTTCTTCTT 1620 

CAATQTGCTC TACTGGCTTG TTTGGCTTAA CTTGTAGGTA CCASCTGGTA 0CCTGT66G0 1680 

CAAOCTCTCC AGTTCCGCAG GAGGTCCAAO CCCCTTGCCA AGGGAQTTGG GG6AAAGCA9 1740 

CAGCAGCAGC AOSAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 

AGGGTTTQTC TTTGCTGCCC CTCTCOXrPA CCTGGCCCAT TCACTGAGTT TTCTCRGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAA6GA6C ATCGGTGATG 1920 

CTCAOTOTTC AAAAOCACAO CCACTTAGTO ATCAGCTCCC TAAAACCATO GCTAAGTACA 1980 

G6G06ATTA0 CTATCTTCCA ACAATGCTGA OCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

GCCCACTATT GCCTTTGCAG TGCTTTCX3GC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 

GACTAGTTGC TTQCCTATAC CTGQCACCTC ATTAAGATGC TX3GGCRGCAG TATAACAGGA 2160 

G GAAG AGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTOCTACC 2220 

CCTTTCTCTQ CAOATAOATA GAGACTGGCA TTATCCCTTT AGGAAGAGGO GGGGGCA6CA 2280 

AGAGAGGCTA TTTGGGACAG CATTCCTCTC TCTCTGCTGC T0TGAC9V7CT CCCTCTOCTT 2340 

GCTGGCTCCA TCTTTOOTCT GCACTACCAA TTCAATGCCC TTCATCC3VAT GGGTATCTAT 2400 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATQ CCACCXTCTT CCTTCTCTTT 2460 

GACCCCTOTG ACTCTTTCTG TAACTTTCCC A6TGACTTCC CCTA6CCCT0 ACCAGGCACr 2520 

AGQCCTTOer GACTTCCTGG GGCCAAGAAA CTAAGGAAAC TGG6CTTT6C AACAGGCATT 2580 

ACTOGCCATT GATTGGTGCC CACCXaGGGC ACACTGTCGG AGTTCTATCA CTTGCTTQAC 2640 

CCCTGGACCC ATAAACCAGT CCACTGTTAT ACCCGGGGCA CTCTAACCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGQAACTTTO GCAAAGCACT TTTGACAAGT 2760 

T GTGTC TQAT TGGAOCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTCTTATCCC 2820 

C3VTTTTGCAG ATGAAAACCC TGAGTCACAO ATTTCTQTGO GAGTGTGQAT CTCACTGGAA 2680 

GCTATCCAAO AOCCC3VCTOT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAOTTC 2940 

ACCATGATTC TCTTCTGTCA CCTCTGCTQO CACACCAGTG QCAAGGCCCA GAATGGOQAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCXX!CAAQA TCyU\ATCTCT 3060 

CCTGGCTGTA GTAACCCAOT GGAATGAATT TGGACATGCC CCAATGCTTC TATATGCTAA 3120 

GTGAAATCTO TGTCIGTAAT TTGTTGGGGG OTGGATAGGG TGGGGTCTCC ATCTACTTTT 3180 
TGTCACCATC ATCTGAAATQ GGGAAATAT6 TAAATAAATA TATCAGCAAA GC 

Seq ID NO: 614 Protein sequence 
Protein Accession #: NF_068822.1 

1 11 21 31 41 51 

I I I I i I 

MEYTIDIIFS QTWNSKRTKE HBITMPNQMV RIYKDGKVLY TIRMTIDAGC 8LHMLRFPMD 60 

SHSCPLSFSS FSYPQIEMIY KHENFKLBZN BKNSHKLFQF DFTGVSNfCTE IITTPVGDFM 120 

VNTIFFHVSR RFGYVAFQNY VP8SVTTMLS NVSFHIKTES APARTSLGIT SVLTMTTIiGT 180 

F8RXNFPRV5 YITALDFYIA ICPVFCFCAL LBFAVU7FLI YKQTKAHASP KLRHPRIN8R 240 

AHARTRARSR ACARQHQBAF V0QXVTTBG8 DGEERP8C8A QQPP8FGSPB GPRSLCSKIiA 300 

CCEHCKRFKK YFCMVPDCBG STWQQGRLCX HVYRLDNYSR WFPVTPPPF NVLYWLVCLN 360 
Ii 
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Seq ID NO: 615 DHA sequence 

Nucleic Acid Accession S: N>l_021990.1 

Coding sequence i 1309.. 3490 
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1 11 

i I 

GGCAGAGGGT GAGC06CGAC 
TCCAAAGTTC TTCCAOTCCT 
CAGAGAAGTG CTOVAATGAT 
OTOTAAAGAA AGCCAAATCA 
CACTGCCTCC CAOCAAAGGC 
CCTTGGCAQA TGGCCTTTAA 
TTTTCTTGGC TQTQGTGCAT 
TCCTGGATGG CTQT C T G TGG 
OCTGCTCTTT AOCCTCCTTC 
CACAACCAAC AAAACOSCAA 
CTCTGGCTTT TCCTCTCAGC 
TCA OGCTGAC TAGA06CCAA 
AAATGOCCrC TTCATTTGAC 
TtTTTTCTTA AATAAAAGAO 
CAGCAAGGTT TTAAAGAAAT 
CCAOTCACAA TTTAACTAGG 
GAGGCACTAG CTGGG6CCCC 
TCTAQGQTOG AGGQACCTCA 
TATGGCCCCC AGCCCCAGCC 
GAQACTGAGA CTGGGA6CAG 
ATCCTQAGTA ATTATGACCA 
ACTGrXGAOA TCTCCGTCAA 
ATT6ACATCA TCTTCTCCCA 
GAGTCTCTTG TTCTGAAT60 
AGGAATTCTA AGAGGACCCA 
TACAAGGATG GCAAGGTGTT 
CACATGCTCA GATTTCCAAT 
TATCCTQAGA ATOAGATQAT 
AACTCCTGGA AGCTCTTCCA 
ACAACCCCAG TTOQTOACTT 
GGCTATGTTG CCTTTCAAAA 
TCCTTTT6GA TCAAGACAQA 
CTQAOCATGA GCAGQTTOGQ 
ACAOCCrrGG ATTTCTATAT 
TTTGCTGTGC TCAACTTCCT 
CX5CCATCCTC GTATCAATAG 
GCCOGCXIAAC ATGAGQAA6C 
GA06AGCGCC OGTCTTGCTC 
OGCAGCCTCT GCTCCAAGCT 
TGCATGOTCC CCGATTGTGA 
TACCGCCTGQ ATAACTACTC 
CTCTACTGQC TTQTTTGCCT 
TCCAGTrCCC CAGGAG6TCC 
AGCAGQAGOG ACTAGAOTTT 
GTCTTTGCTG CCCCTCTCXC 
CAAATTATTA ATAAATGGGC 
TTCAAA AOCA CAGCCACTTA 
TAGCTATCTT CCAACAATGC 
ATTGCCmO CAOTGCTTTC 
TGCTTGGCrrA TACCTGGCAC 
ATCCCTCTCC TTTGGTCAGA 
CT6CA6ATAG ATAGACACTG 
CTATTTGGGA CAQCATTCCT 
GCATCTTTOG TCTGCACTAC 
TGTGATTATA 6TAACTACTC 
GTGACTCTTT CTGTAACTTT 
GOTGACTTCC TGGG6CCAAG 
ATT6ATTGGT GCCCACCCAO 
CCCATAAACC AOTCCACTGT 
ATTCCCTTAA ATTTGTATGG 
GATTGGAGCT TCATGATAGC 
CAGATGAAAA CCCTGAGTCA 
AAGAGCCCAC TGTCACCITC 
TTCTCTTCTG TCACXTTCTGC 
TAGCTCAATT TCTGGGCCTG 
GTAQTAACCC AGTGGAATGA 
C TQTGTClX a r AATTTOTTGQ 
ATCftTCTGAA ATGOGGAAAT 



31 31 41 51 

I I I I 

CrcOSOGCAG GTGGTOGCGC OGGTCTCOGC GGAAATGTTG 60 

CCTA6GCATC TTATTQATCC TCCAGTOGAG AACAT6TATA 130 

AAGTGTACAG CTGAT6AGTT GTCAAAAAAT GACCACAGCX3 180 

AGQACCC6AA T6TGA6CAGG ACCTCAGAAO CCCCCTTTGT 340 

AGCACTATCC GGACTTCTAA CACCATCGOT GAGTTTCATA 300 

CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

GQCTOrGGAG CTCAGGGTGG ACTCCTGTTO GGCAGCCAGT 420 

GT0QA6GACT OCIGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

CCTTGATCCC CTTCCCCTGC CCCCAGT6CA AOQAGTATTA 540 

AATATTCCCA CAATTTTCTO OTCCTCTCTG GOAQAGGCOG 600 

CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGG TG CT G G 660 

06CGACCAAC ACTAGGCAAA OQOGGCCAGC OCTGAQACAT 720 

GTGTAACATT CTTTTAAAAT ClAGGTCTTG GTTTTOTTGA 780 

TGATGATAAA A0AGGGACAQ CATAGAAAGT CCCCAAA6AG 840 

TCACAABCCT AATCTGTCAC TGTCTTATAA TTTOCTATTA 900 

TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

TACAGAGTGC AGGGCAGAGC TTCATTTTTC OTTTQAATGr 1020 

GACTGAATCA AAGAATQAAG GCTCTTCC06 TGATGTTGTC 1080 

TCTGGAAAAT CAGCTCCTCT CTQAGGAAAC AAAGTCAACT 1140 

AOTTGGCAAA CTGCCA6AAG CCTCT06CAT CCTGAAOVCT 1200 

CAAACTGOQC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

CAGCCTTOOT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

GACCTG6TAC GAOOAAOOOC TCTOTTACAA OGACACCTTT 1380 

CAAT6TGGTG A6CXA6CTAT GGATCCCGGA CACCTTTTTT 1440 

CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCOSCATC 1500 

GTACACAATT AGGATGACX:A TTGATGCCGG ATGCTCACTC 1560 

G6ATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

CTACAAOTGO GAAAATTTCA AGCTTGAAAT CAATGAGAAO 1680 

GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCQGTTT 1800 

CTATQTCCCT TCTTCOGTGA CCAOQATGCT CTCCTQGGTT 1860 

OTCTGCTCCA GCCGGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CAO cnv itri' ootaaoaatt TCCO G Ogixyr ctcctatatc i980 

OGCCATCTGC TTCX5TCTTCT GCTTCTGOGC TCTGTTGGAQ 2040 

GATCTACAAC CAQACAAAAG CCCATGCTTC TCCTAAACTC 2100 

CCGTGCCCAT GCCCGTACCC GT6CA0GTTC CCGAGCCTGT 2160 

TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

A6CCCAGCAG CCCCCTAGCC CAGGTAGCXX! TGAGGGTCCC 2280 

GGCCTGCTGT GAGTGGTGCA AGOGTTTTAA GAAGTACTTC 2340 

GOQCAOTACC TGGCAGCAGG GCCQCCTCTO CATCCATGTC 2400 

QAQAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 

TAACTTGTAa GTACGAGCTG GTACCCTGTG GGQCAACCTC 2520 

AAGCCCXTTTG CCAAGGGAGT TGGGGGAAAG CA6CAGCAGC 2580 

TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

CTACCTGGCC CATTCACTGA GTTTTCTCAO CAOACCATTT 2700 

CACCTCCCTC TTCTTCAAGG AGCATCOGTG ATGCTCAGTG 2760 

GTGATCA6CT CCCTAAAACC ATGCXTTAAGT ACAGGOOGAT 2820 

TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 

GGCCCAGTTC TGGCXirrCAGC CTCAAAGTGC ACOGACTAGT 2940 

CTCATTAA6A TGCTGGGCAG CAGTATAACA GGAGGAAGAG 3000 

TTATTATOTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 

GCATTATCCC TTTAGGAAGA GGGG6GGGCA GCAAGAGAGC 3120 

CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 

CAATTCAATG CCCTTCATCC AATGGOTATC TATTTTTOTG 3240 

CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 

CCCAGTGACT TCCCCTAGCC CTQACCAGGC ACTAG6CCTT 3360 

AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTOGCC 3420 

GGCACACTOT GGGAGTTCTA TCACTTOCTT QACCCCTGGA 3480 

TATACCOGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 

CACTOGAACT TTGGCAAAGC ACTTTTGACA AOTTGTGTCT 360O 

CTTGTGACAT CTTTAGGGCA GGATTCTTAT CCCXIATTTTG 3660 

CAGATTTCTG TGGGACTGTG GATCTCACTG GAAGCTATCC 3720 

TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 

AGGTGCTCAG ACTGCCCCCA AGATCAAATC TCTCCTrGGCr 3900 

ATTTGGACAT GCCCCAATGC TTCTATATGC TAAGTOAAAT 3960 

OGGGTGOATA GGGTGGGGTC TCCATCTACT TTTT6TCACC 4020 
ATGTAAATAA ATATATCAGC AAAGC 



Seq ZD MO: 616 Protein sequence 
Protein Accession 9t HP_066830.1 

1 11 21 31 41 51 

) I I I I I 

MBYTIDIIPS QTWYDERLCY NDTPG8LVU7 GNWSQLHIP DTPPRN9KRT HEKEITMPNQ 
MVRIYKDGKV LYTIRMTIDA OCSLHMLRPP MDSHSCPLSP SSPSYPKNQ1 lYKWQTFKLE 
INBKNSWiaP QPDFTGVSNK TBIITTPVOD FWVMTIPPNV SRHPffifVAPQ NYVPSSVTTM 
LSWV8PHZKT BSAPART8LG ZTSVLTMTTL OTPSfiXMPPR VSYITALOPY ZAXCPVPCPC 



60 
120 
180 
340 
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A LLBFAV UJP LIYNQTKAHA SPKLRHPRIN SRAHAHTRAR SRACARQHQB AFVOQXVTTB 300 

GSDQBBRPSC SAQQPPSPGS PBGPRSLCSK lACCBMCKRF KECYFCKVPDC EGSTHQQ6RL 360 

czHVYRunnr srwppvtfp ppnvlywlvc uil 

5 Seq ID NO; 617 DNA secmence 

Nucleic Acid Accession #: NM_004864.1 
Coding sequence i 26 . . 952 " 

CQGAACGAGG GCAACCTGCA CAGCXIATGCC OGGGCAAOAA CTCAGGAOGG TGAATGGCTC 60 

TCAOATGCTC CTGGTGTTOC TGGTOCTCTC GTGGCTGCCX5 CATGGGGGCX3 CCCTOTCTCT 120 

0800GA6GCX3 AGCOGOOCAA GTTTCCXX5G0 ACXXTTCAGAG TTGCACTCCQ AAGACTCCAO ISO 

1 ^ ATTCCGAGAG TTGOGQAAAC GCTACGAGGA CCTGCTAACC AOGCTGOQGa OCAACCAGAO 240 

I J CTGGGAAQAT TCQAACACCG ACCTOGTCCC GGCCCCTGCA QTCCGGATAC TCAOGCCAOA 300 

AGTGOGGCTG GOATCGGGGO GCCACXTTGCA CCTG O QTATC TCTCGGGCCG CCCTTCCOGA 360 

OG GGCT CCCC GAGGCCTCXX: GCCTTCACOG GGCTCTQTTC OGGCTGTCCC OGAOGOCOTC 420 

AAOOTOGTQG GAOGTGACAC GACCGCT0C3G GOGTCAQCTC AOCCTTGCAA GACCCC3UU3C 480 

GCCXX3CGCTG CACCTGOGAC TOTOOCOQCC GCOGTOQCAQ TCGGAOCAAC TGCTGGCAGA 540 

ZU ATCTTCGTCC GCACGGCCCC AGCTOOAOTT GCaCTTGOOG COQCRAGCOG OCAGGGGGCG 600 

CGQCAGAGCG CXntK3GCGCA AOGGQQACGA CTGTCOGCTC GGGCXXXKJOC GTTOCTGCCG 660 

TCTQCACACG GTCOGCGCGT CGCTGQAAGA CCTGGGCTGG GCOSATTGGQ TGCTGTCGCC 720 

AOGGGAGGTG CAAOTGACCA TGTGCATCX3G CGCGTGCCOG AOCCAGTTOC GGGOGGCAAA 780 

- - CATGCACGCG CAGATCAAQA OOAOCCTOCA OOGGCTQAAO CCOQACAOGO A6CCA0CGCC 840 

Zj CTCCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCrCATT CAAAAOACCG ACACOGGGGT 900 

QTCGCTCCAG ACCTATQATQ ACTTGTTAGC C3VAAQACTGC CACTGCATAT GAGCAOTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGCX5GA GGCGACCTCA QTTGTCCTGC CXrTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAOAC ACCOOATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTQ 1080 

an TTATTTATTA TTAATTTATT GGG6TGACCT TCTTGQGaAC TO3G0GGCTG GTCTGATGGA 1140 

JU ACTGTGTATT TATTTAAAAC TCTGOTQATA AAAATAAAGC T0TCT6AACT GTTAAAAAAA 1200 



35 



45 



AAAA 

Seq ID NO: 618 Protein sequence 
Protein Accession HP 004855.1 



1 • 11 21 31 41 51 

I I I I I I 

MPGQBLRTVN GSQHLLVLLV LSKLPHG6AI* SLAEASRASF PGPSELHSED SRFRELRKRY 60 

EDLLTRLRAH QSHEDSNTDL VPAPAVRILT PEVRLQSG6H UQiRZSRAAL PBOLPEASRL 120 

4U HRALPRLSPT ASR8NDVTRP LRRQIiSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

ELHLRPQAAR 6RRRARARMQ DDCPLGPGRC C3UOTVRA6L EDLGWADWVL 8PREVQVTNC 240 

IGACPSQPRA ANMHAQIKTS IiHRLKPDTBP APOCVPASYN PHVLZQKTDT CVSLOfTYCDL 300 
LAKDCHCZ 



Seq ID nOi 619 DHA sequence 

Nucleic Acid Accession ftt NM_003979.2 

Coding sequencet 254.. 1357 " 



<n } 11 21 31 41 51 

50 I I I 1 I I 

ATAACAGCAT 6AAGTGCCOT GGAACTGOAA TAGOCGTGTC CTCTCCCTCX3 ACCCTCCCCC 60 

TCCTTGTCCC TCTGCTCACC CCTCGCTCX5T TCCCTCCCTC OGOCGAGGQC CGCCTTTATA 120 

ACAACTGCTC A GAGTG OGAQ GGOGGGATAG CTGTCCAAGG TCTCCCCCAQ CACTOAGGAQ 180 

- _ CT060CTGCT GOCCTCTTGC G0G0G6GAA6 CAGCACCAAO TTCACG6CCA AGGCCTT66C 240 

DD ACTAGGGTCC AGAAT GGCTA CAACRiOTCCC TQATGGTTGC OQCAATGGCC T6AAATCCAA 300 

GTACTACAOA CTTTGTGATA AGGCTGAAGC TTGGGGCATC GTCCTAQAAA OSGTGGCCAC 360 

AGCOGGGGTT GTGACCTCQG TGGCCTTCAT GCTCACTCTC C03ATCCT00 TCTGCAAGGT 420 

GCAGGA CTCC AACAGQQGAA AAAT6CTGCC TACTCAQTTT CTCTTCCTCC TGG G TGT GT T 480 

OOGCATCTTT GOGCTCAOCT TOGCCTTCAT CATOSGACra GAOQGQAGCA CAGOOOCCAC 540 

OU AOOCrPCTTC CTCTTTCCGA TCCTCTTTTC CATCTQCTTC TOCTGCCTCC TCOCTCATGC 600 

TGTCAGTCTG ACCAAGCTOQ TCOSGGGGAG QAAGCCCCTT TOCCTGTTGG TOATTCTGGG 660 

TCTGGCCGTG GGCTTCAGCC TAGTCCAGGA TGTTATOGCT ATT6AATATA TTGTCCTGAC 720 

CATGAAT AGQ ACCAAOGTCA ATGTCTTTTC TGAGCTTTCC GCTCCTCGTC GCAATOAAGA 780 

^- CTTtOTCCTC CT6CTCACCT AC3GTCCTCIT CTTQATGOCa CTGACCTTCC TCATGTCCTC 840 

OD CITCACCTTC TOTOGTTCCT TCAC3GGGCT0 GAAGAGACAT OGGCCCCACA TCTACCTCAC 900 
GATGCTCCTC TCCATTGCCA TCTGGOTOOC CTGOATCACC CTGCTCATGC TTCCTOACTT 



960 



TGACCGCAGG TGGQATGACA CCATCCTCAG CTCCGCCTTQ GCTGCCAATG GCTGGGTGTT 1020 

CCTOTTGQCT TATGTTAOTC COaAGTTTTG GCTGCTCSkCA AAGCAAOQAA ACCXX3VTGGA 1080 

~ TTATOCTOTT GAGGATGCTT TCTGTAAACC TGAACTGGIG AAGAAGAGCT ATQGTGTG G A 1140 

fV GAACAGAGCC TACTCTCAAO AGGAAATCAC TCAAG GT TTT GAAGAGACAG OGQACACGCT 1200 

CTATGCCCCC TATTCCACAC ATTTTCAOCT 0CA6AACCAQ OCTCCCCAAA AGQAATTCTC 1260 

CATCCCACGG GCCCACGCIT GGCCGAGCCC TTACAAAGAC TATQAAGTAA AOAAAGAGGG 1320 

CAGCTAACTC T6TCCTGAAG AGTQGGACAA ATGCAGCOGO 60Q6CAGATC TA0G6GGAGC 1360 

TCA AAGGG AT 6TGG6CXSAAA TCTTGAGTCT TCTGAQAAAA CTGTACAAGA CACTA06GGA 1440 

/J AGAOTTTGCC TCCCTCCCAO GCTCAAOCAC AATTCTTCCA TGCTG GGG CT GATQTOGQCT 1500 

AOTAAGACTC CAGTTCTTAG AGGOOCTQTA GTATTTTTTT IT r mmVi ' CATCCTTTCG 1560 

ATACTl'C m' TAAGTQGGAG TCTCA6GCAA CTCAAGTTTA GACCCTTACT ClllimTTT 1620 

GTTTTTT6AA ACAGGATCTT GCTCTGTCAC CCAGGCTPOA 0T6CAGTGGT GOQATCACAa 1680 

CCCAGTGCAO CCTOSAOCAC CTOTGCTCAA GCAATCCTCC CATCTCCATC TCOCAAAGTG 1740 

OU CTGGGATQAC AGG06TGAGC CACAGCTCCC A0CCTA66CC CTTAATCTTG CT6TTATTTT 1800 

CCATGGACTA AAGOTCTGGT CATCTGAGCT CAOGCTGGCT CACACAGCTC TAGGOGCCTG 1860 

CTCCTCTAAC TCACAGTGGG TTTTOTQAGG CTCrOTGGCC CAGAGCAQAC CTQCATATCT 1920 

GAGCAAAAAT AGCAAAAGCC TCTCTCAGCC CACTGGCCTG AATCTACACT GGAAGCCAAC 1980 

TT6CTG6CAC CCCOQCTCCC CAAOCCTTCT TQCCT6GGTA OGAGAGGCTA AAGATCACCC 2040 

1189 
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TAAATTTACT 
ATTCACAGGT 
TAATCTCCCC 
CTCCTTGTCA 
CTCATCTTGC 
GTGG6CA,TGG 
GCAATAAAGA 



CATCTCTCTA 
CACCCCTCTC 
CTAOGCTCTG 
GCAGAATTTO 
ACOCCA ACCT 
TCTCCTAATO 
TGTGGTG6GC 



GTGCTGCCTC 
TTCTTGCACT 
CCAGGAATTC 
TAGATCATTC 
CTGTAAATAO 
GAGGAGTGTT 
ACTCinCAT 



ACATTQGGCC 
GTCCCCAAAC 
TTTCAQACCT 
TCACTTCAAA 
ATTTAGOGCA 
CATT6TATAA 
GQTGOltnCA 



TCAGCAGCTC CCCAGCACCA 
TTQCTGTCAA TTCCGAGATC 
CACTAGCACA AGCCOSGTTG 
TTCCTQQGOC IGATACTTCT 
TTTACXIGCTO CATTCTSTAA 
TAAOTTATTC ACCTGAGTAT 
GCSUUUUUUUl AAAAAA 



Seq ZD NOt 620 Protein sequence 
Protein Accession «t NP_00a970.I 



KATTVPDGC31 
RXUCMLPTQFL 
KIiVRGRKPLS 
LTYVLPUIAL 
DDTZLSSAIA 
SQEBXTQ6PB 



11 
1 

NGUCSKVyRL 
PLLGVLGZFG 
LLVILGLAVG 
TPIitSSPTFC 
ANGWVFLLAY 
BIODTIiYAPy 



21 
I 

GDKABANOIV 
LTFAFZZ6IS 
FSLVQDVZAZ 
GSFTGHKRBG 
VSPEFHLLTK 
8THFQLQKQP 



31 
I 

LETVATAGW 
GSTGPTRPPZi 
BYZVLTMNRT 
AHZYLTMUiS 
QRNPMDYPVE 
PQKBF8ZPRA 



41 
I 

TSVAFMLTLP 
FGZXiPSZCFS 
HVNVFSELSA 
lAZHVAWITL 
DAFCKPQLVK 
HAMFSFYKDY 



51 
I 

IhVCXVQDSS 
CLZiAHAVSLT 
PRUNED PVUi 
LMLFDFDRRW 
K8YGVENRAY 
EVKKBGS 



Seq ZD NOt 621 DMA sequence 

Nucleic Acid Accession NM_002423.2 

Coding 8eq:uencet 48*. 651 "" 



1 

1 

ACCAAATCAA 
TGCTGTGTGC 
6AG6CATGAG 
ATQACTCAGA 
TCTTTGGCCT 
CCAGATGTGQ 
CTTOCAAAGT 
TGGAT06ATT 
GGAAAGTT6T 
ACTOCTACCC 
GTCT06GAGG 
GGATTAACTT 
GCTCTQATOC 
AACTTTGCCA 
GAAAGAAATA 
TGTTGCACAA 
CTTTTTTOXT 
AT6GTGTGAC 
AT6TTACATA 



11 

1 

CCATAGGTCC 

7GAGCTACA0 
AACAAAAAAT 
ACCTATAACr 
AOTGCCAGAT 
GGTCACCTAC 
A0T6TCAAAO 
ATGG6GAACT 
ATTT6ATGGG 
AC3ATGCTCAC 
CCTGTATGCT 
TAATGCAGTO 
GGATGATATT 
GAAACTTCAO 
TCAGAATTGA 
GCAGTTGOTT 
TGT8TCTTAT 
GACAAATAAA 



21 

I 

AAQAACAATT 
CTGCCTGGCA 
TG6GAACAG6 
GCCAACAGTT 
GGAATGTTAA 
GTTGCAQAAT 
AGGATCX3TAT 
GCTTTAAACA 
GCTGACATCA 
CCAGGAAACA 
TTOQATGAGG 
GCAACTCATG 
ATGTATCCAA 
AAAGQCATTC 
GCAOAACATC 
TAAGCACTGT 
TTTGAATQTC 
TCCATCTATG 
TAAAATGTTT 



31 
1 

GTCTCTGGAC 
GCCTGGCCCT 
CTCAGGACTA 
TAGAAQCCAA 
ACTCCCX5CX3T 
ACTCACTATT 
CATATACTOS 
TGTGGGGCAA 
TGATTGGCTT 
OGCTGGCTCA 
ATQAAOSCTG 
AACTTGGCCA 
CCTATGQAAA 
AGAAACTATA 
CATTCATTCA 
TCCTCCACTC 
TTTCACTCCT 
AGCTTTOTCA 
ATTOCATGGT 



41 

I 

GGCAQCTATO 
OCOOCXGCCT 
TCTCAAGAOA 
ACTCAAGGAO 
CATAGAAATA 
TCCAAATAQC 
AGACTTACGG 
AGAQATCCCC 
TGC6CGAGGA 
T6CCTTTGC0 
GAC36GATGGT 
TTCTTTGGGT 
TOOAOATOCC 
TG6AAAGAGA 
TTCATTOQAT 
CATTTAGCAA 
TTTATTQGTT 
GTGGSOOTAO 
AAATTTA 



Seq ZD NO J 622 Protein sequence 
Protein Accession #< NP_002414.1 

11 



51 



2100 
2160 
2220 
2280 
2340 
2400 



51 
1 

C6ACTCACCG 
CA5QAGGCXX3 
TTTTATCTCT 
ATGCAAAAAT 
ATGCA0AA6C 
CCAAAATQGA 
CATATTACAG 
CTGCATTTCA 
GCTCATOQ6G 
CCTGGGACAG 
AGCAGTCTAG 
ATGGGACATT 
CAAAATTTTA 
AGTAATTCAA 
TGTATATCAT 
TTATGTCACC 
AAACTCCTTT 
ATGTCAATAA 



1 11 21 31 41 

I I I I I I 

KRLTVLCAVC UiPQSLALPL PQEAG6MSEL QWEQAQDYLK RFYLYDSBTK NANSLEAKLK 
EMQXPF6LPZ TGMX^SRVZE ZMQKPRCGVP DVAEYSLFFN SPKNT8KWT YRXVSYTRDL 
PHXTVIHILVS KAIMKHGREX PLHFRXWWG TADXMZGFAR GAH6DSYPFD GPGNTLAHAF 
APGT6L6GDA HFDEDERW1D GSSLOZHPLY AATHBLGHSL GK6B88DPNA VHYPTYQNGD 
PQNFKLSQDD ZKGZQKLYGK RSHSRXK 

Seq ZD NO I 623 DMA sequence 

nucleic Acid Accession fti >IM_031457.1 

Coding sequence! 2 04.. 956 ~ 



AAACA08AAA 
CAGGAAATAG 
ATTCTCTCAG 
AG6T<%CTGC 
CT6T6TTGGT 
TQCCCC TGTA 
TGOTGTCGAA 
GGGCCATCCA 
TTCTCGTAGG 
TGTGGTTTAT 
QCCTGCTGTC 
TCATACTCTT 
CTTACGCCTQ 
TCCTGGAGTT 
AATCAA6CAA 
CAGAACC6GT 
ACAGATTCTG 
CCATAACCCA 
TCTCACCTTC 
ACAGAGATTT 
TATGTGG6CA 



11 
I 

TAAATAOGAA 
AGGACTTOG6 
GAAAAAAAAC 
TGGCAGAGCA 
OGTGGCACCC 
TCCAAACAGC 
TGTGAATGGG 
GATCATCATT 
GGAATACCTQ 
CATTTCAGGA 
TGGCAGTTTG 
CATCACAOAT 
GOOTGTGAAC 
TGGCATOGCA 
TGTGAGTGTC 
OACCTCACCA 
OAAGCATCTT 
OOTOOTTCCT 
ATTCTTCAAT 
TAAACAGATG 
TCCA6CCTCT 




GGCTTQAACA 
CTAAGTATTC 
CCrOGAATGO 
TQOOCATCTT 
ATCTATCCAA 
CCAAGTTATT 
TCACTGGGAC 
GTTCTGACAG 
TCAGTCTAGG 
TTAACCAA6A 
GGGOCCTTQG 



31 
I 

CTCTAAGCAO 
CCCTACCACC 
CAGCAAA6AA 
CGATGACTTC 
ATCCTGTGAC 
TCCACCTAGT 
AGAAAGCTCr 
ACATCGGOCT 
TCTAOGGAGG 
TQGCAQCAGA 
TOGTCAGTGC 
CCCACCCATA 
CQATTTCTGG 
CCCACTTTGG 
ACATCTATGC 
CCAGTQAGAT 
CAAAAGAAGT 
CTOAGGAAAC 
AAACCATGCT 
GGGACTCCCX 
CACACACACA 



41 

I 

CATGTAACCT 
CAACTGGOCC 
AAGGAATAGO 
AGCAGTTCC6 
CCCAGGAA7T 
TCCTGGGAAC 
GAAAGAAGGC 
GGGCTCCATC 
CTTTCCCTTC 
AAATCAGCCA 
AATCTGCTCT 
TGCCTACCCC 
OGTGCTGCTG 
CTGCCAGTTG 
AGCAAACCCA 
CCAAGCAAAT 
CCTCCTCCCT 
GTCTCTCCCA 
GTTTCTCTAT 
AQGQCACA7G 
TTCGTQTGCT 



60 
120 
180 
240 
300 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 



51 

! 

G60CTGCATC 
CAGTACATTC 
ATCAAGAGAT 
GTGGCCAATT 
ATGTCTCAOG 
CCACCTAGTT 
AAAACCTTGG 
ATGG06ACGQ 
TGG6GAGGCT 
TATTCTTATT 
GCAGTT6GA0 
GACTATTATC 
GTCTTCTGCC 
GTCTOCTGTC 
GTGATCACCC 
AAGTAAGGCT 
TTCTGGGCTT 
CPGTTTGTAC 
CAAGAAGAAO 
GATCAGCACA 
CTGCTGCATQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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TGAGCTTGTG GGTTAGAGGA ACWUITATCT AGACATTCAA TCTTCACTCT TTCAATTSTG 1320 
CATTCKTTTA ATAAATAGAT ACT6AGCATT CAAAAAAAAA AAAAAAAAA 

_ Seq ID NO: 624 Protein sequence 
D Protein Accession S: NP_113645.1 

1 11 21 31 41 51 

I 1 I I I I 

- - KNSMTSAVPV ANSVLWAPB NOYPVTPQIM SHVPLYPNSQ PQVHLVPGNP PSLVSNVNGQ 60 
XU PVQKALKEGK TLQAIQIIZG LABIOUSSIN ATVLVOBYLS Z8FY06FPFW GGLWPIZSGS 120 

LSVAAEKQPY SYCLLSGSLG LNIVSAIC8A VGVZLPZTOL 8ZPKPYAYPD YVPYAWCVHP 180 
GMAISGVLLV FCUiBFGIAC ASSHFGOQLV OOQSSNVSVZ YPNZYAANPV ZTPEPVT8PP 240 
SYSSBIQANK 

15 Seq ID NOt 625 DWA sequence 

Nucleic Acid Accession «t NM_D05221.3 
Coding sequence i 1 . . B70 " 

^rt I 11 21 31 41 51 

20 I I I I I I 

ATGACAQQAO TGTTTGACAQ AAGGGTCCCC AGCATCCGAT COGGOSACTT CCAAGCTCCG 60 

TTCCAGAOGT CCGCAGCTAT GCACCATCOG TCTCAGGAAT OQCCAACTTT GCCCQAGTCT 120 

TCAGCTACOG ATTCTGACTA CTACAQCCCT A0GGGGG6AG CCC06CA0QG CTACTGCTCT 180 

- . CCTACCTOSG CTTCCTATGG GAAAGCTCTC AACCOCTACC AGTATGAQTA TCACGGOGTG 240 
ZD AAOSQCTGOO CGGQGAOCTA CCCAGCCAAA GCTTAT6C00 ACTATAGCTA OGCTAGCTCC 300 

TACCaCCAGT ACGOaSGCQC CTAC3UICCX3C GTCXXaAOOQ CCACCAACX31 OCCAGAGAAA 360 

GAAGTGACCG AGCCG6AGGT GAGAATGGTG AAT6GCAAAC CAAAOAAAGT TCOTAAACXX: 420 

AGGACTATTT ATTCCAGCTT TCAOCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TAGCTGGCCT TGCOGGAAOG GGCCX3AGCIG GC060CTCGC TGGGftTTQAC ACAAACACAO 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATQAA AAA0QGG6AG 600 

ATG0C0CX3GG AGCACAGTCC CAGCTCCAGC GACCCAATGG CX3TGTAACTC GCXX3CAGTCT 660 

CCAGOGQTOT GGGAGCCCCA GGQCTOGTCC CXSCTCXSCTCA GCCACCACCC TCATGCCCAC 720 

CCTCOJACCT CCAACCACTTC CCCAG03TCC AOCTACCTGO AGAACTCTGC ATCCTGGTAC 780 

ACAAG TGCAO CXAGCTCAAT CAATTOCC!AC CTGCGGC06C OGGGCTCCTT ACAGCAOCGO 840 
CTG60GCTG0 CCTCCGGGAC AGTCTATTAG 



30 
35 
40 
45 



55 
60 
65 



75 
80 



Seq ID NO I 626 Protein sequence 
Protein Accession #s NP_O0S212.1 

1 11 21 31 41 51 

I ) I i I I 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTIjPES SATDSDYYSP TGGAPHGYCS 60 
PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGQAYNR VPSATMQPBK 120 
EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTO YLALPERAEL AASU3LTQTQ 180 
VRIWF(»9KRS KIKKIMKNQB NPPEHSP8SS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLEN8A8WY T8AAS8IM8H LPPPGSLQHP tiALASGTLY 



Seq IS NOi 627 DNA sequence 
Nucleic Acid Accession #t NM_014420 
50 coding sequence: 116.. 792 ~ 



1 11 31 31 41 51 

I I I 1 I I 

GCACGAGAGA OGAOQTGCTG AGCT6CCA0C TTAGTGGAAG CTCTGCTCTO GGTGGAGAGC 60 

AGCCTCGCTT TG6TGAOQCA CAGTGCTGGG AGCCTCCAGG AGCCC06GGA TTGAAGGATQ 120 

GTGGOGGCOQ TCCTGCTGOa GCTGAGCTGG CTCTOCTCTC CCCTGGQAGC TCTGGTGCT6 180 

GACTTCAACA ACATCAGQAG CTCTGCTGAC CTOCATGGGG CCCGOAAGGG CTCACAOTOC 240 

CTOTCTOACA CGGACTGCAA TACCAGAAAa TTCTGOCTCC AGCCC0GC3A TGAGAAGCCO 300 

TTCTGTGCTA CATGTOGTGG GTTGCGGAGO AGOTGCCRGC GAGATGOCAT GTQCT6CCCT 360 

OGGACACTCT OTOTQAAOGA TGTTTGTACT ACGATGGAAQ ATGCAACCCC AATATTAGAA 420 

AGGCAGCTTG ATGAGCAAGA TGGCACACAT GCAGAA6GAA CAACTOGGCA GCCABTOCAO 480 

GAAAACCAAC CCAAAAGGAA GCCAAGTATT AAGAAATCAC AAGGCAGGAA GGGACAAGAO 540 

GGAGAAAGTT GTCTGAOAAC TTTTGACTGT GGCCCTGGAC TTTGCTOTGC TOGTCATTTT 600 

TGGACGAAAA TTTGTAAGCC AGTOCTTTTQ GAGGGACAGG TCTGCTCCAG AAGAGGGCAT 660 

AAAGACACTG CTCAA6CTCC AOAAATCTTC CAGCaTTOOG ACTGTGGCCC TGGACTACTG 720 

TOT06AA6CC AATTGACCAG CAATOSGCAG CAIGCT08AT TAA6AQTAT0 CCAAAAAATA 780 

GAAAAGCTAT AAATATTTCA AAATAAAGAA GAATCCAOIT T6CAAAAAAA AAAAAAAAAA 840 
A 



70 Seq ID NO: 628 Protein sequence 
Protein AccesGion 9i NP 055235 



1 11 21 31 41 51 

I I I t I I 

KVAAVLLGLS WLCSPLGALV LDFNNIRSSA OLHGARRGSQ CbSDTDQrTR KFCLQPRDBR 60 
PFCATCRGLR RROQRDAMCC PGTLCVNDVC rTMEDATPIL BRQU3BQDGT BABOTTaHPV 130 
QENQPKRKFS IKKSQGRKGQ EQBSCLRTFD 09PGLCCARH FNTKICRPVL LBGQVC8RR0 180 
HKDTAQAPBI FQRCDCGPGL LCRSQLTSNR QHARLRVCQK ZEKL 

Seq ID NOt 629 DNA sequence 

Nucleic Acid Accession fti NM_002448.1 

Coding sequeneei 241.. 1134 

1 11 21 31 41 51 

1191 
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1 I 1 I I i 

GOGOGAGTGC TCCGOGGAAC TCTGCCTGOQ CGQCCSOCAGC GAC0GGA06C CAGGCCCAQC 60 

ACGCOGQAOC TGGCCT6CT0 GGGAGGGGCO GGAGG060QC G0GGC3AGGQT COQCCCGGCC X30 

AGGCCCCGGG CCCTCGCAGA GGCXX3GCOSC GCTCCCAQCC CGCXTOGGAGC CCATGCCOGG 180 

5 0G6CTQGCCA OTGCTGCGOC AGAAGGGGGO GCCOGQCTCT GCATGQCCCC GGCTQCTGAC 240 

ATGACTTCTT TGCCACTCGQ TGTCAAAGTG GAGGACTCOQ OCTTCGGCAA GCOGGOGGGG 300 

GGAGGCSGCGQ GCCAGGC3CCC CAOCX5CG6CC G0GGCCACX3G CAOCCGCCAT GGGOGCGGAC 360 

GAGGAGGGGO CCAA6CCCAA AGTGTCCCCT TCGCTCCTGC CCTTCAGOGT GQAOGOGCTC 420 

ATGGCOGACC ACAGGAAGCC GGGGGCCAAG GAGAGCQCCC TGGCX5CCCTC CX5AGGG0GT0 480 

10 CAGGOGGaSO GTGGCTOGGC GCAGCCACTG GGCGTCCOGC OGGGGTC36CT GGGAGCOXO 540 

GAOGGOGOCT CTTOGC06CG 6006CTG0GC CATTTCTOQG T6GGGGQACT CCTCAAGCTG 600 

CCAOAAfiATG CX3CI0GTCAA AGCOQAGAGC GCOGAGAAGC CCGAGAGGAC CCOGTGGATO 660 

CA6AQCCCCC GCTTCTCCCC GCOSCCGGCC AGGOGGCTGA GCCCCXX»QC CTGCAOXTC 720 

CGCAAACACA AGAOGAACOG TAAGCOGOGO A00CC3CTTCA CCACOGCX^CA GCTGCTGGOG 780 

IS CTGGAGOQCA AGTTGC6CCA GAAGCA6TAC CTGTCCATCX3 CGQAG06GQC GGAGTTCTCC 840 

AGCT06CTCA GGCTGACTGA 6A06CAGGTG AAQATATOGT TGCAOAAOCO C0QCX3CCAAG 900 

GCAAAOAGAC TACAAGAQGC AOAGCTGOAO AAGCTGAAGA TGGCOGOCAA GCCCATGCTG 960 

CCACOGQCTG CCTTCGGCCT CTCCTTCCCT CTOGGOGGOC COGCAGCTGT* AGOOGCCGCXS 1020 

GCGGQTGCCT CGCTCTACGG TGCCTCTGGC CCCTTCCAGC GCGCOGCGCT GCCTGTGGCX3 1080 

20 CCOOTGGGAC TCTACAOGGC CCATGTGGGC TACAGCATCT ACCACCTGAC ATAGAGGGTC 1140 

CCAGGTCCCC ACCTGTGGGC CAGCOQATTC CTGCAOCOCr GGTOCT6TAC CCC 0BAC3 GTQ 1200 

CTCCCCTGCr COGCACOQCC AGCCGCCTTC CCTTTAACOC TCACACTGCT CCAGTTTCAC 1260 

CTCTTTOCTC CCTGAGTTCA CTCTCCGAAG TCTGATCCCT GCCAAAAAGT GGCTGQAAGA 1320 

OTCCCTTAQT ACTCTTCTAG CATTTAGATC TACACTCTOG AGTTAAAGAT GGGGAAACTG 1380 

25 AGGGCAGAGA GGTTAACAGA TTTATCTAGG GTGCCCA6CA GAATTQACAO TTGAACAGAO 1440 

CTAQAGGCCA TGrCTOCTQC ATAGCTTTTC CCIOTCCTGA CACCAGOCAA GAAAAGOGCA 1500 

QAGAAATCGG T6TCTQAGGA TTTTGGAAAT GAGAACAATC TCAAAAAAAA AAAAAAAAAA 1560 

AAAAAAAAAA GAAAAGAGAA AAAAAAGACT AGCCAGCCAG GAAOATGAAT CCTAGCTTCT 1620 

TCCATTGOAA AATTTAAGAC AAGTTCAACA ACAAAACATT TGCTCIGGGG GGCAGGGAAA 1680 

30 ACACAGAT6T GTTGCAAAGG TAGGTTGAAO GGA 

Seq ID KOi 630 Protein sequence 
Protein Accession «: 2iP_o 0243 9.1 

35 1 11 21 31 41 51 

I i i i t I 

MTStiPIXSVKV EDSAF6KPAO GGAGQAPSAA AATAAAMGAD EB6AKPKVSP SLLPFSVEAL 60 

MADHRKPGAK ESALAPSBGV QAAGGSAQPL GVPPGSLGAP DAPSSPRPLG HPSVGGLLKL 120 

PEDALVKAES PEKPERTPWM QSPRPSPPPA RRLSPPACTL RKHKTNRKPR TPPTTAQLLA 180 

40 LERKFRQKQY liSIAERAEPS SSLSLTETOV KIWFQKRRAK AKRXiQBAELE KLKMAAKPMZ* 240 
PPAAFOLSFP LGQPAAVAAA AGA8LYGA80 PFQRAALPVA PVGLYTARVQ ySKYHLT 



Seq ID NO: 631 DNA sequence 
Kucleic Acid Accession #i NM_002557.l 
45 Coding sequence « 13.. 204 9 

1 11 21 31 41 51 

I i i ) ) I 

CAGACCATTG AQATGTGGAA GCTGTTGCTG TGGQTTGGGC TGOTTCTTGT GCTGAAACAC 60 

50 CAaSATGGTG CTOCCCATAA ACTCOTGTGT TATTTCACX3^ ACTGGGCACA CAOTOGGCCA 120 

GGCCCTGGCT OSATCTTGCX: CCATGACCTO OAOCCCTTTC TCTGC3\CCCA CCTGATATTT IBO 

GCCTTTGCCT CAATGAACAA CAATCAGATT QTT6CTAAGG ATCTCCAGGA TGAGAAAATT 240 

CTCTAOCCAG AGTTCAACAA ACTAAAGGAG AGGAACAGAG AGCTQAAAAC ACTACTGTCX: 300 

ATOGGCGGGT GGAACTTTOG CACCTCAAGA TTCACCACTA TGTTGTCC3VC ATTTGCCAAC 360 

55 GOTGAAAAGT TTATTGCTTC AGTTATATCC CTTCTGAGGA CACATGACTT TGATG OTCTT 420 

GAOCTTTTCT TCTTAXATCC TGGACTAAGA GGCAOOCGCA T6C3VIt3AO06 GTGGACTTTT 480 

CTCTTCTTAA TTGAAGA6CT OCTGTTTGOC TTCOGGAAGG AGGCACTGCT CACCAT6CXSC S40 

COGAGGCTGC TGCTGTCXGC TOCTGTTTCT GGGGTCCCAC ACATOGTCCA AACATCCTAT 600 

- QATOTGOGCT TTCTAGGAAQ ACTCCTGGAT TTCATCAATG TCTTGTCTTA TGACTTACAT 660 

60 QGAAGTTG6G AAAGGTTCAC AGGACATAAT AGCCCCXTOT TCTCTCTGCX: TGAAGACCCC 720 

AAATCTTOSQ CATATOCTAT OAATT A TTG O AGAAAOCTTG GGGCACCCTC AOAQAAGCTC 780 

ATCATGGOOA TCCCOVGCTA TGGACX?TAGC rrrOGCCTCC TCAAAQCTTC TAAOAATGGO 840 

TTGCAGGCCA QA6CX3ATGGG ACCAGCATCT OCAGGGAAGT ACACCAAGCA AGAAGGCTTC 900 

^ TTGGCTTATT TTGAGATTTG TrCCTTTGTC TGGGGAGOGA AQAAOCACTG GATTQATTAC 960 

65 C3Un'ATGTCC GSTATGOCAA CAAGGGGAAA QAGZGGGTTG 6CTATGACAA TGCCATCAGC 1020 

TTC3U3TTACA AG6CATGGTT TATAAGGOGA OAGCATTTTG 6GG0GQCCAT OGTaTQGACA 1080 

TT6QACATGG ATGAGGTCAO GGGCA06TTC TGTGGCACrG GCCCTTTCCC CCTTGTCTAC 1140 

GTATTQAATO ATATCCTGGT QCGGGCTGAG TTCAGTTCAA CTTCTTTACC ACAATTTTOG 1200 

CTGTCATCTO CTGTQAATTC TTCAAGCACT GACCCTGAAA GGCTGGCTGT QACCAOGGCA 1260 

70 TGGACCACTG ATAOTAAGAT TTTGOOCCCA GGAGQAGAGG CTGGGGTCAC TGAGATCCAC 1320 

GGAAAGTGTG AAAATATGAC TA13UV0CCCT AGAGGTACAA CrOTGAOOOC TACAAAGGAA 1380 

ACTGTATCCC TTGGAAAGCA CACTOTAOCT CTAGQAGAGA AOACTQAQAT CACIGGGGCA 1440 

ATQACCATGA CTTCTGTGGG TCATCRGTCC ATGACCCCTG GAGAGAAGGC CCTGACCCCT 1500 

GTGGGTCATC AATCTGTGAC CACTGGACAG AAOACCCTQA CCTCTGTGGG TTATCAGTCT 1560 

75 GTOACCCCTO GGGAAAA6AC CCTGACCCCT GTGGGTCATC AGTCTGTGAC CCCTGTGAGT 1620 

CATCAGTCTG TGA6CCCT6G AGGAACGACT ATGACCCCTG TCCATTTTCA OACTGAGACC 1680 

CTTAGACAGA ATACAGTGGC CCCTAGAAGG AAGGCTGTGG CCCGTGAAAA GGTGACT6TC 1740 

CCCTCCAGAA ACATATCAGT CACCCCTQAA GGGCAGACTA TGCCTTTAAG AGGGGAGAAT 1800 

TTQACTTCTG AGQTGGQCAC TCACCCCAGG ATGGGTAACT TGGGTCTTCA GATGGAAGCT 1860 

80 GAAAACAGGA TGATGCTGTC CTCCAGCCCC GTCATCCAGC TCC06GAACA AACTOCTCTA 1920 

* QCTTTTGACA ACC6CTTTOT TCOCATCTAT GOAAACCATT CCTCTGTCAA CTCAOTAACC 1980 

CCTCAAACAA GTCCTGTTTC TCTAAAAAAA GAAATCCCAO AAAAC TCTGC TGTGGAT6AA 2040 

GAAGOCTAAG CCCCTCTGGT GTCAGAAACC AGOGAAAACC CTTGTCTTTT CTTCTAAOTO 2100 

ACATGTTGQA AGCCTTCTCA TCCGGGGGCA AAGCAGGCAT CAAAACCA6A ATAGQCCAAT 2160 
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5 
10 
15 
20 
25 



35 
40 
45 
50 
55 



CTCTTTTCCA TTAAATAAAC TGTAAACACA AGAACOOW 

Seq ZD NOj 632 Protein sequence 
Protein Accession ft; llP_00254d.l 

1 II 21 31 41 51 

I I i I I } 

MWKLLLWVQL VLVLIOiHDGA AHKLVCYFTN WAHSRP6PAS ILPHDLDPPL CTHLI PAPAS 60 

MNNMQIVAXD LQDEKILYPB FMKLKBRIiRB UCTLLSIGGN NFOTSRPTTM LSTPANRBKP 120 

lASVXSLLRT HDFDGU>LFF LYPGLRGSFN HDRHTFLPLI BELLFAFRKB ALLTMRPRLL 180 

LSAAVSGVPH IVQTSYDVRP LORLLDPINV LSYDLHGSWE RPTGHNSPLP SLPBDPKSSA 240 

YAMNYWRKI/S APSEKLIMGI PTYGRTPRLL KASKNGLQAR AIGPASPQKY TKQEGFLAYF 300 

EICSFVWGAK KHHIDYQYVP YAKKOKEHVO YDNAISFSYK AHFIRREKFG GAMVWTLDMD 360 

DVRGTPCX3T6 PPPLVYVUID ILVRABPSST SLPQFHLSSA VNSSSTDFBR LAVTTAWTTD 420 

SKIZiPPGGEA GVTBIHGKCS NMTITPR6TT VTPTKBTVSL 6KHTVAZ/3EK TBITGAMTMT 480 

SVCHQSMTPG EKALTPVGHQ SVTTGQKTLT SVOYQSVTPQ EKTLTPVGHQ SVTPVSHQSV 540 

SPGGTTMTPV HFQTETLRQN TVAPRRKAVA REKVTVPSRN ISVTPBGQTM PIiRGKNLTSE 600 

VGTHPRMGNL GLQMEAEHRM MLSSSPVIQL PBQTPXAFDN RFVPIY6NHS SVNSVTPQT8 660 
PI«SLXXBIPE NSAVDEEA 

Seq ID NOt 633 DHA sequence 

Nucleic Acid Accession #: NM_003885.1 

Coding sequence: 98.. 1021 "* 



11 31 31 41 51 

I ) I I t ! 

AAACTCAGAA TTTTOGOGGO CTC3GGTGAGC OOTTTTATCC CTCCGGCOC3G CAGGCTGGGC 60 

GCAOGGGGCXS AGCCCCCGCC OGOCX5CX3CAG CA6CACCATQ G6CACG0T0C TGTCCCTOTC 120 

TCCCAGCTAC COQAAGGCCA OQCTOTnOA GQATGOOSOO 6CCAC06T0G GCCACTATAC 180 

DK) GGCOSTACAG AACAGCftAGA A0QCCAA6GA OVAGAACCTO AAGCGCCACT CXATCATCTC 240 

CGTGCTGCCT TGGAAGAQAA TOGTGGGCGT GTOGGCCAAO AAGAAGAACT CCAAGAAG6T 300 

GCAQCCTAAC AGCAGCTACC AQAACAACAT CAOGCACCTC AACAATGAGA ACCTGAAQAA 360 

GTCX3CTQT00 TQCX3CCAACC TOTCXaCATT OGCCX»GCCC CXACCX3GCCC AGCXrOCCTGC 420 

ACCCCOGGGC AOCCAGCTCT C U WritAJCA OACOGGGQGC TCCTCCTCAG TCAAGAAAGC 480 

CCCTCACCCT GCCGTCACCT CCGCAGGGAC GCCCAAAC6G GTCATCX3TCC AGG06TCCAC 540 

CAGTGAQCTG CTTCOCTGCC TGGGTQAGTT TCTCT6CCGC CGGT6CTACC GCCTGAAGCA 600 

CCTGTCCCCC ACGGACCCCG TGCTCTGGCT GCGCAGOOTO GACCGCTCQC T6CTTCTGCA 660 

GGGCTQGCAG GACCAGGGCT TCATCACOCC 6GCCAA0GT0 OTCTTCCTCT ACATGCTCTO 720 

GAGGGATGRT ATCTCCTCOG AGOTOGQCTC GOATCACQAa CTCX»GGCCQ TCCT6CTQAC 780 

ATGCCTGTAC CTCTCCTACT OCTACATGGG CAAOSAGATC TCCTACCCGC TCAAOCCCTT 840 

CXrrGGTGGAG AOCTGCAAGG AGGCCTTTTG GGACCX3TTGC CTCTCTGTCA TCAACCTCAT 900 

GAGCTCAAAG ATGCTGC3VGA TAAATGCOGA CXXACACTAC TTCACACAGQ TCTTCTCOOA 960 

CCTGAAGAAC OAQAGOGGCC A6QAGGACAA GAAGCGGCTC CTCCTAGGCC TGGATC6GTG 1020 

AGCAC TOTAG CCTGGOTCAT GGCTCAAGQA TTCAATGCAT TTTTAAGAAT TTATTATTAA 1080 
ATCA6TTTTG TGl'ACAG 



Seq ID NO I 634 Protein sequence 
Protein Accession #t NP_003876.1 

1 11 21 31 41 51 

) I I I I I 

M3TVL8LSPS YRKATLFEDG AATVGHYTAV QNSKKAKDKN LKRHSIISVL PWKRIVAVSA 60 
KKKNSKKVQP NSSYQNNITH UTNENXiKKSL 8CANL9TFAQ PPPAQFPAPP ASQLSGSQTG 120 
GSSSVKKAFH PAVTSAGTPK RVIVQAST8E LLRCLGBFLC RRCmLKHLS PTDPVLWLRS 180 
VDRSLLLQGH QDQ6FZTPAN WFLYMLCRD VX8SEVGSDH BLQAVLLTCX YLSYSYMGNE 240 
XSYPliKPFLV ESCKEAFHDR CLSVXMLMSS KMLQZNADPH YFTQVFSDUC NBSGQEDKKR 300 
XiLLGUR 
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nsf. UnlqwEospnbeseiidenaivnuirtMr 

ExAoen: Exeiniibr Amnion number, Genbadtamsslannvnitar 

IMgsnalD; IMgena number 
(Mgene IWe: Unlgene gene OOe 

SeqlDNo: Sequence identificalkmnunitKfllnUiiglnlbiinalkin In TeUe79A to s 





ExAccn 


UnigenelD 


424212 


Kli4.005814 


Hs.143131 


424503 


NiyL00220S 


Hs.149609 


416007 


M13509 


Hs^169 


416007 


M13509 


H5.83169 


418738 


AW388633 


Hs.6602 


443846 


A(085198 


Hs,164226 


40S95^ 


AW103384 


H5.727 


422867 


L32137 


Hs.1584 


444381 


BE387335 


HSJ283713 


421562 


A1910275 


Hs^0470 


411789 


AF245505 


Hs.72157 


452281 


T93500 


Hsj28792 


428698 


AA852773 


Hs.334638 


421552 


AF026692 


H3.1O570O 


425247 


NII4_005940 


Ks.155324 


432201 


AI538813 


Ks.298241 


447377 


X77343 


Hs.334334 


446921 


AB012113 


Ks.16530 


418868 


AU076801 


Hs.89436 


432179 
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SeqlDNo.C157&C372 
SeqIONaC158&C373 
SeqiDNo.C159&C374 
SeqiDNaC160&C375 
SeqU)NaC161&C376 
SeqtDNaC162&C377 
SeqIONaC163&C378 
SeqtONa C164&C379 
SeqlONo.C165&C380 
SeqDNocC166&C381 
SeqlDNo.C167&C382 
SeqtONaC168&C383 
SeqtONo.C169&C384 
Seq(OKo.C170&C385 
8eqlONaC171&C386 
SeqIONo.C172&C387 
8eqtDNo.C173&C38& 
SeqlDNaC174&C389 
SeqlDNo.C175&C390 
SeqtONaC176&C391 
SeqlDNo.C177&C392 
SeqlONo.C178&C393 
SeqlONaC179&C394 
SeqlONaC180&C3S5 
SeqlDNaC181&C3% 
SeqlONo;ci62&C397 
SeqlONo.C1B3&C398 
SeqlONo.C184&C399 
SeqlDNo.C185&C400 
SeqlONo.C186&C401 
SeqlDNo.C167&C402 
SeqlONo.C188&G403 
SeqlDNo.C189&C404 
$eqlONaC190&C405 
SeqlONaC191&C406 
SeqlDNo.C192&C407 
SeqlDNaG193&G408 
SeqIDNo.C194&C409 
SeqlDNo.C195&G410 
SeqiDNo.C196&C411 
SeqlDNo.C197&C412 
SeqlDNaC198&C413 
SeqlONo.a99&C414 
SeqlDNo.C200&C415 
SeqlDNo.C201&C416 
SeqlDNo.C202&C417 
SeqlDNo.C203&C418 
8eqlDNo.C204&C419 
SeqlDNo.C205&C420 
SeqlDNaC206&C421 
SeqlDNaC207&G422 
SeqlDNaC208&C423 
SeqlDNaC209&C424 
SeqiONaC210&C4^ 
SeqiONaC211&C426 
8eqlDNo.C212&C427 
SeqlDNaC213&G428 
SeqlONaC214&C429 
SeqlONaC215&C430 
SeqlONo.C216&C431 
SeqlDKo.C432 &0433 
SeqtONo.C434&C435 



Pkey: IWque Eos probesel identifier number 
CAT number Gene duster number 
Accesston: Genbank accesston numbers 

Pkey CAT Number Accesston 

414991 1785136.1 D78831 C17898D78863 



TABl£79C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Uentifier(GI) numbers. Dunham I. el ai' refers to OiepubRcatkmenStled The DMA 

sequence of human chromosome 22.* Dunham L et a!., Nature (1999) 402:489495. 
Strand: Indicates DNA strand fhxnvfMdieacons were predicted. 
NLposiBon: Indicates nudeoUde positions of predidedexons. 

Pkey Ref Strand NLposfBon 
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404682 9797231 Minus 40977-41150 

404287 2326514 Rus 53134-53281 

404287 2326514 Phis 53134^1 

404287 2326514 Plus 53134^1 
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Table 8a 

Seq ID NOt Cl DMA sequence 
Nucleic Acid Accession it HM_005814 
5 Coding sequence: 3 45.. 1304 

1 11 21 31 41 SI 

I I I I I I 

CTACCCCTTT GT6AGCA6TC TAGGACTTTQ TACA0CT6TT AAGTAGGGAO AAGGCAGGOG 60 

lU AiGGTGGCTGO TTTAAGGGGA ACTTGAGGGA AGTAOGGAAO ACTCCTCTTQ GGACXnTTGG 120 

AGTAGGTOAC ACATGAGOCC AGCCCCAOCT CACCTTGCCAA TCCA6CT6AG GAGCTCACCT 180 

GCCAATCCAG CTGAGGCTGG GCAQAGGTGG GTOAOAAGAO GGAAAATT6C AGGGACCTCC 240 

AGTTGG6CCA GGCCAGAA6C TGCTGTAGCT TTAACCA6AC AGCTCAGACC TGTCTGGAGO 300 

CTG CCAQ TQA CAGGTTAaOT TTAGGGCAGA GAAGAAGCAA GACCATGGTG GGGAAGATGT 360 

GGCCTGTGTT GT6GACACTC TGTGCAGTCA GGGTQACCGT CGATGCCATC TCTGTGGAAA 420 

CTCCGCAGOA OGTTCTTCGG GCTTCGCAGG GAAAGAGTGT CACCCTGCCC TGCACCTACC 480 

ACACTTCCAC CTCCAGTCQA GAjGGGACTTA TTCAATQGQA TAAGCTCCTC CTCACTCATA 540 

OGGAAAOaQT GQTO^TCTGG CX3GTTTTCAA ACAAAAACTA CATCCATGGT GAGCTTTATA 600 

. _ AGAATOQOOT CAGCMATCC AACAATGCTG AGCAOTCOQA TGCCTCCA7C ACCATTGATC 660 

20 AOCTOACCAT GGCTQACAAC GGCACCTAOG AGTQTTCTGT CTCX3CTGATG TCAGACCTGG 720 

AGGGCAACAC CAAGTCAOST GTCCGCCTGT TGGTCCTCGT GCCACCCTCC AAACCAGAAT 780 

GCGGCATCGA GGGAGAGACC ATAATTGGGA ACAACATCCA GCT6ACCT0C CAATCAAAGG 840 

AGGGCTCACC AACCCXTTCAG TACAGCTGQA AGAGGTACAA CATCCTGAAT CAGGAGCAGC 900 

CCCTGOCCCA GCCA6CCTCA QGTCAGCCTG TCTCCCTGAA GAATATCTCC ACAGACACAT 960 

CGGGTTACTA CATCT6TACC TCCAGCAATG AGGAGGGGAC GCAGTTCT6C AACATCACG6 1020 

TGGCCGTCAG ATCTCCCTCC ATGAACGTGO CCX^rGTATOT GG6CAT060Q GTG6GCX3TG6 1080 

TTQCAGCCCT CATTATC3VTT GGCATCATCA TCTACTGCTG CTGCTGCCGA GGGAAGGACQ 1140 

ACAACACT6A AGACAAGGAG GATGCAAGGC CGAACCGGGA AGCCTATGAG GAGCCACCAG 1200 

^ A6CAGCTAAG AGAACTTTCC AGAGA6A6GG AGGAGGAGGA TGACTACAGG CAAGAAGAGC 1260 

3U AGAGGAGCAC TGGGCGTGAA TCXXXXSGACC ACCTOGACCA GTGACAGGCC AGCAGCAGAG 1320 

GGCGQCGGAG GAAGGGTTAG GGGTTO^TTC TCCCGCTTCC TGGCCTCCCT TCTCCTTTCT 1360 

AAGCCCTQTT CTCCTGTCCC TCCATCCCA6 ACATTQATGG G6ACATTTCT TCCCCAGTGT 1440 

CAGCTGTGGG GAACATGGCT GGCCTGGTAA GGGGQTCCCT GTGCT6ATCC TOCTGACCTC 1500 

ACTGTCCTGT GAAGTAACCC CTCCTGGCTG TGACACCTGG TGCGGGCCTG GCCCTCACTC 1560 

AAGACCAGGC TGCAGCCTCC ACTTCCCTCG TAGTTGGCAO GAGCTCCXGG AAGCACAGCG 1620 

CTGAGCATGG GGCGCTCCCA CTCA6AACTC TCCAGGQAGG OQATOCX^aC CTTGGGGGGT 1680 

6GG0GCTGTC CTGCTCACCT GTGTGCCCAg CAOCTGQAOQ GGCACCAGGT G6A GGG TTTG 1740 

CACTCCACAC ATCTTTCTTG AATGAATGAA AOAATAAOTG AGTATOCTTG GGCCCTOCAT 1800 

TGGCCTGGCC TCCAGCTCCC ACTCCCTTTC CAACCTCACT TCCOGTAQCT GCC3\GTATGT 1860 

4U TCCAAACXXrr CCTGGGAAGG CCACCTCCXSA CTCCTQCTCC ACAOOCXCTO GGGAQCTTTT 1920 

6CCCACACAC TTTCCATCTC TGOCTGTCAA TATOGTAOCT GTCCCTCC A Q GCOCATCTCA 1980 

AATCACAAGQ ATTTCTCTAA CCCTATCCTA ATT8TCCAGA TAOGTGGAAA CAATCCTGTT 2040 

ACTCTGTCXX: ACGTCCAATC ATGGGCCACA AGGCACAGTC TrCTGAGCGA GTGCrCTCAC 2100 

TQTATTAGAG OGCCAGCTCX: TTGGGGCAGG GCCTGGGCCT CATGGCTTTT GCTTTCCCTG 2160 

AAGCCCTAGT AGCTGGCGCC CATCCTAGTO GGCACTTAAG CTTAATTGGG GAAACT6CTT 2220 

TGATTGGTTQ TGCCTTGCCT TCTCTGOTCT CCTTGAGATG ATOSTAGACA CAGGGATQAT 2280 

TCCCAOCCAA ACCCAOQTAT TCATTCAGTO AOTTAAACAC QAATTOATTT AAAGT6AACA 2340 

CACACAAGGG AGCTTGCrTTO CAGATCGTCT GAOTTCTTGT GTCCTGOTAA TTCOTCTCCA 2400 

GGCCAGAATA ATTGQCATGT CTCCTCAACC CACATGGGGT TCCTGGTTGT TCCTQCATCC 2460 

OGATACCTCA GCCCTG6CCC TGCCCAGCCC ATTTGG6CTC TGGTTTTCTG GTGGGOCTOT 2520 

CCTOCXGCCC TOCCACAGGC TCCTTCTGTT TQTOSAGCAT TTCTTCTACT CTTGAGAGCT 2580 

CAG6CAG0GT TAGG6CTGCT TAGGTCTCAT GOAOCajSTGG CTGSTCTCAC CCAACTGCAG 2640 

TTTACTATTG CTATCTTTTC TGGATGATCA GAAAAATAAT TGCATAAATC TATTGTCTAC 2700 

TTGCX3ATTTT TTAAAAAATO TATATTTTTA TATATATTOT TAAATCCTTT GCTTGATTCC 2760 

AAATGCTTTC AGTAATAATA AAATT6TGGG TGG 2793 
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Seq ID NOt C2 DNA Sequence 

Nucleic Acid Accession #t Bos sequence 

Goding sequence i 1..3150 



1 11 21 31 41 51 

I ) I I i I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CAOGCCOTQC AGCTGOGCTG GGGCCXICCGG 60 

CQCXX3ACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GQCCCCAOCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

6GATTCTCAG TGGAGTTTTA CGGGC0G66A ACAGA0G66G TCAG1X3TGCT GQTGGGAaCA 240 

CCCAASGCTA ATACX»GCCA GCCAGGAOTG CTGCA0G6TG GTGCT6TCTA OCTCTGTC C f 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGO CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGOCTG TGGAGTACAA GTCCTTGCAG 420 

TGGTTGGGGG CAACAGTTGG AGOCCATGOC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTOOGOGA CAOAGAAGGA GCX»CTGAGC GAOOOOOT G O GCAOCTGCTA GCTCTCCACA 540 

GATAACTTCA CXXX3AATTCT GGAOTATGCA COCTGC06CT CAGATTTCAG CTG66CA6CA 600 

GGACAGG6TT ACTGCCAAGO ACGCTTCAQT GCOGAGTTCA CCAAQACTGG Ca?rGTGQTT 660 

TTAGGTGOAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGO TTCAGGQGCA GCTGCAGACT 780 

OSCCI^GOCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGOrG ATGACACA6A AGACTTTQTT OCTGOTOTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

on CAGATGGCCT CCTACTTTGO CTATGCAGTG GCCGCCACAQ AOGTCAATGG GGACX3GGCT6 1020 

OU GATOACrraC TGGTGGGGGC ACCCCTGCTC ATGGATOGGA CCCCTGACGO GC3GGCCTCAG 1080 

GAG6T6GGCA 0G6TCTA09T CTACCTGCAG CACCCAGGOQ GCATAOAGGC CAOQCCCAOC 1140 

CTTACCCTCA CTGGOCATGA TQAGTTTGGC OGATTTGGCA GCTGCTTQAC CCCCCTGGQQ 1200 

GACCTGGAGC AGQA TOQC TA CAATGATGTQ GCCATOGGGG CTCOCTTTGG TG0G6AGACC 1260 

CAGCAGGOAG TAGTOTTTGT ATTTOCTOOO GGOCCAGGAa OQCTOGGCTC TAAGOCTTOC 1320 
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10 



15 



20 



CAGGTTCTGC AGCCCCTGTG GGCAOCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1300 

OGAGGAOGCC GAGACCTGGA TGGC3UVTGQA TATCCTGATC TGATTGTQGO GTCCTTTCGT 1440 

GTGGACAAGG CTGTGGTATA CAGOGQCOK: CXXATOOrPOT CCGCTAGTGC CTCCCTCACC 1500 

^ ATCTTCCCOG CCATGTTCAA CCCAQAGGAO CGQAGCTGCA GCTTAOAGGG CSAACCCTOTO 1560 

D GCCTGCATCA ACCTTAOCTT CTGCCTCAAT GCTTCTOQAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGQACTQO CAOAAGCAQA AGGGAGGGOT A0G60QQGCA 1680 

CTGTTCCTGG CCTCCAQQCA GGCAACCCTG ACCCAGACCC TGCTCATOCA GAAT6GGQCT 1740 

CGAGAGGATT GCAQAGAGAT OAAQATCTAC CTCAGQAAOG AQTCAOAATT TCGAGACAAA 1800 

CTCTOGCOQA TTCACATCaSC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CA OGGC CTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTG6AGA AGACAACATC TQTOTG O CT G ACCTGCM3CT GOAAOIGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTQAC AAOAATOOCC T6AACCTCAC TTTCCAT6CC 2040 

CAGAATGTGG GTGAGGGTGG COCCTATOAO OCTOAOCTTC 6GGTCAC0GC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAQACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCOGTGA ACCAGAGCOG CCTGCTGGTG T6TGAGC1X3G GCAACCCGAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCOQGGACAC TAAOAAAACC 2280 

ATCCAGTTTG ACTTCCAQAT CCTCAGCAAO AATCTCAACA ACTOGCAAAO 0GA0GT66TT 2340 

TCCTTTCGGC TCTCOGTGGA GGCTCAGGCC CAGGTCACCC TGAAOQOTOT CTCCAAQCCT 2400 

GAGGCAGTOC TATTCCCAOT AACOGACTGO CATCCCCGAG ACCAGCCTCA GAAGGAGOAG 2460 

GACCTGGQAC CTGCTGTCCA CCATGTCTAT QAGCTCATCA ACCAAG6CCC CAGCTCCATT 2520 

A6CCAGGGT6 TGCT6GAACT CAGCTQTCCC CAGGCTCTG6 AAGGTCA6CA GCTCCTATAT 2580 

GT GACC AGAG TTACGGGACT GAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GA6TTGGATC COGAGGGTTC CCTGCACCAC CA6CAAAAAC GGQAAGCTCC AAGCCGCAGC 2700 

- TCTGCTTCCT OGGGACCTCA GATCCTQAAA TOCCX^GOAGG CrGAGTGTTT CAGGCTGCGC 2760 

ZD TGTGAOCTGG GGCCCCTGGA CCAACAAGAG A6CCAAAGTC TGCAGTTGCA TTTC06A0TC 2820 

T0GGGCAA6A CTTTCTTQCA OGGGGAGCAC CAOCCATTTA GCCT6CAQTG TGA66CT6TG 2880 

TACAAA6CCC TGAAGAT6CC CTACOGAATC CT6CCT0GGC AGCTGCCCCA AAAAGAGCOT 2940 

CAGGTG6CCA CAGCTGTGCA AT6QACCAAG GCAGAAGGCA GCTATGGC6T CCCACTGT60 3000 

ATCATCATCC TAGCCATCCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

3U TACAAOCTTO GATTCTTCAA ACXK^TCCCIC CCATATGGCA CCGGCATGGA AAAAGCTCAG 3120 

CTCAAGGCTC CAGCCACCTC TGATGCCTGA 3150 

6eq ID MO: C3 DNA Sequence 
Nucleic Acx.d Acceaeion #: NM_002421.2 
3«) Coding sequence: 1..1410 

1 11 21 31 41 51 

I i I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGGGACTC TAGAAACACA AGAGCAAGAT GTGGACXTAG TCGAGAAATA CCTQGAAAAA 120 

TACTACAACC TGAAOAATQA TGGGAG8CAA OTTGAAAAGC GGAGAAATAG TGGCOCAGTQ 180 

GTTGAAAAAT TGAAGCAAAT GCAOOAATTC TTTOGGCTGA AAOTOACTGO GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACTO AGGGQAACCC TOGCTOGGAO CAAACACATC TQACCTACAG GATTGAAAAT 360 

TACAGGCCAO ATTTGCCAAG AGCAGATX3T6 GACCATGCCA TTGAGAAAOC CTTCCAACTC 420 

TGGAGTAATQ TCACACCTCT GACATTCACC AAGOTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATaSGOAC AACTCTCCTT TTGATQQAGC TGOAOQAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTQGAGGGQ ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAQAGAGTAC AACTTACATC GTGTTGOOOC TCATGAACTC 660 

G6CCATTCTC TTGOACTCTC CCATTCTACT 6ATAT0GGGG CTTTGAT6TA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCIAOCTCAG OATOACATTa AIOGCATGCA AGCXS^TATAT 780 

GGAOGTTGCX: AAAATCCTGT CCA6CCCATC GGCCCACAAA CGCCAAAAGC ATGTGACAGT 840 

AAOCTAACCT TTGATOCTAT AACTACX3ATT OGGGQAGAAG TQATGTTCTT TAAAGACAGA 900 

TTCTACATGC GCACAAATCC CTTCTACCXX3 GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TG GCCACAA C TGCCAAATQG GCTTGAA6CT GCTTAOQAAT TTGCCXIACAG AGATGAAGTC 1020 

CGGTTTTTCA AAOGGAATAA GTACT06GCT GTTCAGGG3U: AQAATOTGCT ACAOGGATAC 1080 

CCCA AGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTQ TGAAGCATAT C6ATGCTGCT 1140 

CTTTCT6AGG AAAACACTGO AAAAACCTAC TTCTTTGTTO CTAACAAATA CTGQAOOTAT 1200 

GATQAATATA AACOATCTAT GQATCCAGGT TATCXXMAA TGATAGCACA TGACTTTCCT 1260 

OU GGAATTGGCC ACAAAQTTGA TGCAGTTTTC ATGAAAOATO OATTTTTCTA TTTCTTTCAT 1320 

GGAACAAQAC AATACAAATT TGATCCTAAA AG6AAGAGAA TTTTGACTCT CC3U3AAAGCT 1380 

AATAGCTGGT TCAACTGCAG GAAAAATTAO 1410 

^ Seq ID NO: C4 DNA Sequence 

00 Nucleic Acid Accession #i Eos sequence 
Coding sequence i 1..1410 
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1 11 21 31 41 51 

i I I I I I 

ATGCACAGCT TTCCTCCACT GCrGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGGGACTC TAGAAACACA ASACCAAGAT OTGGACTTAO TCGAGAAATA CCTQGAAAAA 120 

TACTACAACC TGAAGAATGA TGG6A66CAA OTTGAAAAGC GGAGAAATAG TGGCCCAGTO 180 

GTTGAAAAAT TGAAGCAAAT GCAGOAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 3O0 

GTCCTCACTG AGGGGAACCC TOGCTGGQAO CAAACACATC TQACCTAOVG GATTGAAAAT 360 

TACA0GGCA6 ATTTGCCAAG AGCAGATGTO GACCATGCCA TTOAQAAAOC CTTCCAACTC 420 

TGGAOTAATG TCACACCTCT GACATTCACC AAGOTCTCTG AOGOTCAAGC AGACATCATO 480 

ATATCTTTTG TCAGGGGAGA TCATOGGGAC AACTCTCCTT TTGATGQACC TGGAGQAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAG OT^ CCAACAATTT CAGAGAGTAC AAC TTACA TC GTGTTO OGGC TCATGCCCTC 660 

GQCCATTCTC TTGGACTCTC CCATTCTACT OATATGGGGO CTTT6AT6TA CCCTAOCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG OATOACATTG ATOQCATOCA AGOOVTATAT 780 

GGAOGTTCCC AAAATCCTOT CCAGCOCATC GGCCCACAAA CCCCAAAA6C ATGTQACAQT 840 

AAGCTAACCT TTGATGCTAT AACTAGQATT OOGOGAGAAO TGATOTTCTT TAAAGACAGA 900 
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TTCTACATGC GCKCAAATCC CTTCTACC06 GAAOTTGACC TCAMTTCRT TTCTGTTTTC 960 

TQ6CCACAAC TGCCAAATGG GCTTGAAGCT GCTTACX3AAT TTGCCGACAG AOATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA 6TACTQGGCT GTTCAGGGAC AGAATGT6CT ACACGGATAC 1080 

CCCyUVGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTQAQO AAAACACT60 AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGOAGGTAT 1300 

GAT6AATATA AAOQATCTAT GGATCCAGGT TATGOCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAArraOCX: ACAAAOTTGA TGCA O TTTTC ATGAAAOATG G A TTTTTCTA TTTCTTTCAT 1320 

GQAACAAGAC AATACAAATT TGATGCTAAA A08AA0AGAA TTTTGACTCT CX3U»AAGCT 1380 

AATAGCTGGT TCAACT6CA0 GAAAAATTAG 1410 

Seq ID NOi CS DNA Sequence 

Nucleic Acid Accesaion tft 1IMJ>14331.2 

Coding sequence i 1..1506 

1 11 31 31 41 51 

I I I I I I 

ATGGTCAGAA AGCCTGTTGT GTCCAGCATG TCCAAA8GAQ OTTACCTGCA GGGAAATOTT 60 

AAOGGGAGGC TGCCTTCOCT GGGCAACAAG GAGCXavCCTG GGCAGGAGAA AGTGCAGCTQ 120 

AAGAGGAAAG TCACTTTACT GAGGGOAGTC TCCATTATCA TTGQCACCAT CATTGGAQCA 180 

GGAATCTTCA TCTCTCXTAA GGGOGTGCTC CA6AACACGG GCAG06TGGG GATGTCTCTG 240 

ACCATCTQGA CG G T G TOTGG G G TCCTgTC A CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

6GAAGAACTA TAAAOAAATC TGGAGGTCAT TACACATATA TTTT6GAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTAOOAGT CTQGOTGOAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG AOGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTQ OGATCAAGCT CATTACA6CT GTGG6CATAA CTGTAG1X3AT G6TCCTAAAT 540 

AGCATQAGTG TCA6CTG6AG OGCCOGQATC CAGATT7TCT TAACCTTTT6 CAA6CTCACA 600 

6CAATTCTGA TAATTATAGT CCCTG6A6TT AT6GAGCTAA TTAAAGOTCA AA0GCA6AAC 660 

TTTAAAGAOG CGTTTTCAGG AAGA6ATTCA AGTATTACGC GGTTGCCA.CT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAOAA 780 

AACCCTQAAA AAACCATTCC CCTTGCAATA TQTATATCCA TQGCCATTGT CACCATTGGC 840 

TAT6TGCTGA CAAATGTGGC CTACTTTAOO ACGATTAATG CTGAGGAQCT GCTGCTTTCA 900 

AATQCAGTGG CAGTGACCTT TTCTGAGOQG CTACTQGQAA ATTTCTCATT AGCAGTTCOG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAAC6GTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCXATQ TTGOGTCTCG AGAGGQTCAC CTTCCAQAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCrCTGGAO ACCTOQACAO TCTTTTGAAT TTCXTCAGTT TTGCCAGGTG 6CTTTTTATT 1200 

GG6CT66CAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATOOTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCOOACCC ATTTAGTACA GOGATTGGCT TCGTCATCAC TCTQACTGOA 1380 

GTGCCTGGGT ATTATCTCTT TATTATATGG GACAAGAAAC OCAGGTGGTT TAGAATAATG 1440 

TOVGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAO TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA QATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATA6GGA 1560 

TTTTTACTTC ATTTTCTQAA AGTCTAOAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTTIAGAA ATTTAGTTAT 1680 

AACTGTATGT A6TTATA6AA AGTGAATAT6 CAGTTATTCT ATGAGTCX3CA CAATTCTT6A 1740 

GTCTCIGATA CCTACCTATT GOGGTTAGGA GAAAAGACTA GACAATTACT ATQTQ9TCAT 1600 

TCTCTACAAC ATATGTTAGC A0G6CAAAGA ACCTTCAAAT TQAAQACTGA GATTTTTCTO 1860 

TATATATGOQ TTTTQTAAAG ATGGTTTTAC AOlCTACAGA TOTCTATACT GTQAAAAQTO 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT 6ATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCGCCT TAGATACCAA TTTAOATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGOGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAQACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGQA AAQTGTATAA CACATCTTAG 2280 

AGCAAOAGTT AGTTTGGTAT TAAATCCTCA TTA6AACAAC CACCTGTTTC ACTAATAACT 2340 

TAOCCCTOAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAQAAAT AACCAACAAA GAAGATGTTC AAAATAATAO TCCCATATCX GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGA7GA 2520 
TGAATTTTOA CAOTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTQT TAAAATATCT 2580 
CrrCAGATCA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 
AAGAAATGTC GeTOTAAATA AGATTTACAA CTQATOTTTC TAGAAAATTT CCACTTCTAT 2700 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 
CTQATAAQAA QAAAATTQAA ATQAGAATCT GTGGATAAOT GTTlXSTGrrC AGAAGATGTT 2820 
GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 
GCACTTTGGG AGGCTGAGGG GGT6GATCAC CTGAGGTCX3G GAGTTCTAGA CCAGCCTGAC 2940 
CAACAT6GAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGG6CAT GGTGGCACAT 3000 
GCTGGTAATC TC3^CTATTG AGGAGGCTGA GGCAGGAGAA TTGCTT6AAC C06GGAGGCG 3060 
6A0GTTGCAG TGAGCCAAGA TT6CACX3VCT 6TACTCCA6C CTGGOIGACA AAGTCAOACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 3144 



Seq ID NO: C6 DNA Sequence 

Nucleic Acid Accession #i NN.003246.1 

Coding sequence ■ 113.. 3634 

1 11 31 31 41 51 

I I I I I t 

GGAOGCACAG GCATTCCCCG CGCCCCTCCA GCCCTCGCCG CCCTOGCCAC CGCTCCOGGC 60 

CGCCGCGCTC OGGTACACAC AGGATCCCTG CTOGGCACCA ACAOCTCCAC CATGG6GCTG 120 

GCCTGGGGAC TAGGOGTCCT GTTCCTGATG CATGTGTGTG OCACCAACCG CATTCCAGAG 160 

TCTG6CGGAG ACAACAGCOT GTTT6ACATC TTTGAACTCA C0GGGGCCX3C CCGCAAGGGQ 240 

TCTGOGOGCC 6ACT6GTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATOGAQGAT 300 

GOCAACCIGA TCCCXXCTOT GCCTGATGAC AAGTTCCAAO ACCT G GTGGA TGCT G TGOGQ 360 

6CAGAAAAGG G Tr TOClCCT TCTGGCATCC CTQAG6CAGA TGAAGAAGAC CCGGOGCAOO 420 



1200 



wo 03/042661 
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20 



TGGAQOGGAA AGACCACTCT GGCCAGQTCT TCRGCX3TGGT GTCCAATGGC 480 

AAGGC3GGGCA CCCTOOACCT CAGCCTOACC GTCCAAOOAA AGCAGCACGT GGTGTCTGTG 540 

GAAOAAGCTG TCCTGGCAAC OGGCCAGTGG AAGAGCATCA CCCTGTTTGT GCAGGAAGAC 600 

AGQQO CCAGC TGTACATOQA CTGTOAAAAG ATGGAGAATQ CTGAGTTGGA CBTCCCCATC 660 

J CAAAGOGTCT TCACCAGAGA CCTGGGCAGC ATCGCCftGAC TCOGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTOAOOT I ' miClimU AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA G TO T CV l tg r CACCCTTOAC 640 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCOGCACTA ACTACATTGG CCACAAGACA 900 

AAOQACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

lU CTCAGGGGCC 7GCGCACX:AT TGTOACCA06 CTGCAQOACA GCATCOGCAA AGTGACTGAA 1020 

QAGAACa^AAG AGTTGGCCAA TGAGCTQAGG CGGCCTCCCC TAT6CTATCA CAAOQQAGTT 1080 

CAGTACAGAA ATAAGGAGQA ATGGACTQTT 6ATAGCTGCA CTGAGTGTCA CXGTCA6AAC 1140 

TCAGTTACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTQCTCCAA TGCXACAOTT 1200 

CXTTGATGGAG AATOCTGTCC TCOCTGTTGG CCCAGCGACT CTGCGGAOQA TGGCTGGTCT 1260 

1j CCATGGTCCG AGTGGACCTC CTGTTCTAOG AGCTGTGGCA ATGGAATTCA GCAGCOOGGC 1320 

C6CTCCTG0Q ATAGCCTCAA CAAGOGATGT GAGGGCTCCT 0Q6TGCA0AC A06GACCTGC 1380 

CACATTCAGG AGTQTGACAA AA6ATTTAAA CAG6ATG0T0 GCIGOAOCCA CTG6TCCC0G 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTQATCA CAAGGATCOS GCTCTGCAAC 1500 

TCTCCCAGCC CXXyiQATGAA TGGGAAACCC TGTGAAOGCG AAGCJGOSQQA QACCAAAGCC 1560 

TGCAAGAAA G ACQCC TGCCC CATCAATGGA GGCTGGGGTC CTTGGTCACC ATGGQACATC 1620 

TGTTCTGTCA CCT6TGGA0G AOGGGTACAG AAAGOTAOTC OTCTCTGCAA CAACCCOGCA 1680 

CCCCAOTTTG OAGQCAAGGA CTGOCSTTQGT GATGTAACAG AAAACXAGAT CTGCAACAA6 1740 

CAGGACTGTC CAATTQATG6 ATGCCTOTCC AATCCCTGCT TTGCOGQCJOT 6AAGTGTACT 1800 

AGCTACCCTG ATGGCAGCTQ GAAATOTGGT GCTTGTCCCC CTGGTTACAG TGGAAATGQC 1860 

ATCCAGTGCA GAGATGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

GGA0A6CAGC QQTGTGAGAA CACX3GAC0CC GGCTACAACT GCCTGCCCTO CXXXXXACGC 1980 

TTCACOGGCT CACAGCCCTT GX36CCAO06T GTC6AACAT6 CCAGGOCGAA CAAACAGGTG 2040 

TGCAAGCCCC GTAACCCCTG CAOaOATGGG ACCCAOGACT GCAACAAGAA CGCCAAGTGC 2100 

AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTQOS AGTGCAAGCC TGGCTACQCT 2160 

iU GGCAATGGCA TCATCTQCGQ GQAGGACACA GACCXGGATO GCTGGCCCAA TGAGAACCTG 2220 

OTGTGOGTGG GC3MTG0QAC TTACGACTGC AAAAAGGATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGG6CAG0 AAGACTATGA CAAGGATOQA ATTG6T6ATG CCTGTGATGA TGAC6ATGAC 2340 

AATGATAAAA TTCCA6ATGA CAGGGACAAC TGTCCATTCC ATTACAACCC AGCTCAOTAT 2400 

GACTATGACA GAOATGATGr GGOAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 2460 

GATCAGGCAG ACACA6ACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTQATGGA 2520 

GAOGGTATCC TCAATGAAGO GGACAACTGC CAGTA06TCT ACAATGT6GA CCAGA6AGAC 2580 

ACIGATATGG ATGGGOTTGG AGATCAGT6T GACAATTGCX: OCTTGGAACA CAAT0CX3QAT 2640 

CAGCTGGACT CTGACTCAOA COGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 2760 

4U GACCATGACA AAGATGGGAA GGOAGATGCC TGTGACCACO ATGATGACAA OGATGGCATT 2820 

CCTGATQACA AGGACAACTG CAOACTGGTG COCMTOCOG ACCAGAAOGA CTCTOACX3GC 2880 

QATGGTOGAO GTQATQCCTG CAAAGATQAT TTTOACCATG ACAGTGTGCC AGACATCGAT 2940 

GACATCTGTC CTGAQAATGT TGACATCAGT GAQACCGATT TCCXSCCGATT CCAQATGATT 3000 

CCTCTGGACC CCAAAGGQAC ATCXXAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

AAAGAACTCQ TCCAGACTGT CAACTGTGAT CXTOQACTCO CTQTAGGTTA TaATGAGTTT 3120 

AATGCTGTGO ACTTCAGTGG CACCTTCTTC ATCAACAC06 AAAGG6A0GA TQACTAT6CT 3180 

GGATTTGTCT TTGGCTACCA GTCCAGCAGC OGCTTTTATG TTGTQATGT6 GAAGCAAGTC 3240 

ACCXaOTCCT ACTGGGACAC CAACCCCACX3 AGGGCTCAG6 GATACTOGGG CCTTTCTGTG 3300 

AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAOCACCTGC GGAAOGCCCT GTGGCACACA 3360 

G6AAACACCC CT6GCCA66T 6CGCACCCTG TGGCATQACC CTGGTCACAT AGGCTGGAAA 3420 

GATTTCACaS CCTAGAGATG GOGTCTCAGC CACAOQOCAA AGAOSGOrrT CATTAGAGTG 3480 

GTGATGTATG AAGGQAAGAA AATCATGQCT GACTCAGGAC CCATCTATGA TAAAACCTAT 3540 

GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA TGGTGTTCTT CTCTGACCTG 3600 

AAATACGAAT GTAGAGATCC CTAATCATCA AATTGTTQAT TGAAAGACTG ATCATAAACC 3660 

AAT6CTGGTA TTGCACCTTC TGGAACTATG GGCTTQAOAA AACCCCCAGO ATCACTTCTC 3720 

CTTGGCTTCX: TTCTTTTCTO TOCTTGGATC AOTGTGGACT OCTAGAAOST GOSAOCTGCC 3780 

TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCASCATT CnOCCTCCAA TGAATAAOAC 3640 

ATCTTCCAAG CATATAAACA ATTGCTTTGO TTTCCrrTTO AAAAAGCATC TACTT GC T T C 3900 

AGTTGGGAAG GTGOCCATTC CACTCTGCCT TTGTCACSW3A OCAGGGTGCT ATTGT6AGGC 3960 

OU CATCTCT 3967 



35 
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Seg ID NOi C7 DNA Sequence 

Hudelc Acid Accession ftt MM_002192 

Coding sequences 86.. 13 66 ~ 

1 11 21 31 41 51 

I I I I I I 

TCCACACACA CAAAAAACCT GOGOGTGAGG GGGGAGGAAA AGCAGGGCCT TTAAAAAGGC 60 

AA TCAC AACA ACTTTTGCTG CXaGQATGCC CTTGCTTTGG CTGAGAGGAT TTCTGTT6GC 120 

AAGTTGCTQO ATTATA0T6A GGAGTTCCCC CACCCCAGGA TC06AGGGGC ACAGCGOGGC 180 

CC0C3SACIGT C06TCCTGT6 OGCTGQCOGC OCTCOCAAAG GATOTAOCCA ACTCTCAGCX: 240 

AGAGATGGTG GAGGCCOTCA AGAAGCACAT TTTAAACATG CIGCACTTQA AGAAGAGACC 300 

CGATCTCACC CAGCOGGTAC CCAAGGCX3GC GCTTCTGAAC GCXSATCAGAA AGCTTCATQT 360 

GGGCAAAGTC GGGGAGAACX3 GGTATGTGGA GATAGAGGAT GAGATTGGAA GGAGGGCAGA 420 

AATGAATGAA CTTATGGAGC AGACCT06GA GATCATCACG TTTGCCXaAaT CAGGAACAGC 480 

CAOGAAGAGG CTGCACTTOG AQATTTGCAA G6AAG6CA6T GAOCTGTCAG TGGTGGAGC6 540 

TGCAGAAGTC TG6CTCTTCC TAAAAGTCCC CAAGGCCAAC AGQACCAGGA CCAAAGTCAC 600 

CATCOGCCTC TTCCAGCAGC AGAAGCACOC GCAGGGCAGC TTGGACACAG GGGAAGAGGC 660 

OGAGGAAGTG GGCTTAAAGQ GGGAGAGGAQ TQAACTGTT6 CTCTCPQAAA AAGTAGTAGA 720 

CXSCTOGGAAG AGCACCTGGC ATGTCTTCCC TOTCTCCAOC ACCATCCAGC GOTTGCTGGA 780 

CCA6GGCAAG AOCTCOCTG G AOGTTGGGAT T0CCTGTGA6 CAGTGCCAaO AGAGTOGOGC 840 

CAGCTTGGTT CTCCTGGGCA A8AAGAAGAA GAAAGAAaAO GAGGG6GAAG GGAAAAAGAA 900 

GGGCGGAGGT GAAG6TGGGG CAGGAGCAGA TGAGGAAAA6 GAGCAOTOQC ACAGACCTTT 960 

CCTCATGCTQ CAOGCGOGGC AGTCTGAAGA CCACCCTCAT GBOOGGGGTC GG06GGGCTT 1020 

1201 



wo 03/042661 



PCTAJS02/36810 



GGAGTGTGAT OGCAAOGTCA ACATCTGCTG TAAGAAACAG TTCTTTGTCA GTTTCAAGGA 1080 

OVTOGGCTGO AATGACTCSGA TCATTGCTCC CTCTGQCTAT CATGCCAACT ACTGCXSAGGG 1140 

TGAQTCCCOQ AGCCATATAO CAGGCACGTC CGGGTCCTCA CTGTCCTTCC ACTCAACAGT 1200 

. CATCAACXAC TACCOCATGC GOOQOCATAa COCCTTTOCC AACCICAAAT OGTGCTOTOT 1260 

J GCXXIACCAAO CTGAGAOCCA TGrCCATOTT OTACTATQAT GATGGTCAAA ACaXCATCAA 1320 

AAAGGACATT CAQAACATQA TCGTGGAGGA OTOTGGGTGC TCATAGAGTT GCCCAQCCCA 1380 

QQQGGAA AGG QAGCAAGAGT TGTCXyVQAOA AGACAQTGGC AAAATGAAGA AATTTTTAAG 1440 

GTTTCTOACT TAACCAGAAA AATAGAAATT AAAAACAAAA CAAAACAAAA AAAAAAACAA 1500 

1 n AASTAAATTA AAAACAAAOC TGATSAAACA GATGAAAC3VG ATGAAGGAAG IS 60 

L\J ATGTOGAAAT CTTAGCCTGC CTTAGCCAGG OCTCAOAOAT GAAGCAGTGA AQAGACAGAT 1620 



15 



20 



65 
70 



Seg ID NOt C9 DNA Sequence 

Nucleic Acid Accession #: XN 057014 

Coding sequences 143.. 874 ~ 



T^AGCGAA AGGOAGAATG GTGTACCCTT TATTTCTTCT GAAATCACAC TGATQACATC 1680 

AGTTQTTTAA ACGGGGTATT OTCCTTTCCC CCCTTGAGGT TCCCTTGTGA 6CTTGAATCA 1740 

AOCAATCTQA TCTGCAGTAG TGTGGACTAG AACAACCCAA ATAGCATCTA QAAAGCCATG 1800 

A6TTTGAAAG GGCCCATCAC AGGCACTTTC CTAGCCTAAT i840 

Seq ID NOi C8 DNA Sequence 

Nucleic Acid Accession tf: NM_000095.1 

Coding sequence; 26.. 2299 ~~ 



1 11 21 31 41 51 

i i 1 I i I 

CRGC ACCCAQ CTCCCCOCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTOCTCAC 60 

« - OCTOGCTGCC CTCGGCGGGT CC6GACM3GO CCAOAGCCCO TTGGGCTCAQ ACCTGGGCCC 120 

ZD GCAGATGCTT OGGQAACTGC AGGAAACCAA O0OG60GCTG CAGGACOTGC GGGACTGQCT 160 

GC3GGCAGCAQ GTCAGGGAGA TCACGTTCCT GAAAAACACQ GTGATGGAGT GTGACGCGTG 240 

OGGQATGCAO CAGTCAOTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCQC 300 

GCCCGGCTTC TGCTTCCCOG GCQTGGCCTO CATCCAGACG GAGAGCGGCG GCCGCTGCXJO 360 

CCCCTGCCCC GOSGGC TTCA CGGQCAAOGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CQCCCACCCC TGCTTCCCCC QAGTCCXJCTG TATCAACACC AGCCOGGQGT TCCGCTGOGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCXX^VC OCACCAGGQC GTGGGGCTGO CTTTCGCCAA 540 

GGCCAACAAG CAQGTTTGCA CGGACATCAA CGAQTGTGAQ ACOGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATCA ACACCOGGGG CTCCTTCCAG TGCGGCCOGT GCCAGCCCX3G 660 

CTTCOTOGGC OACCAGOOGT CCGGCTGCCR GCX5CGQCGCA CAQCQCTTCT GCCCCGACGQ 720 

jO CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

OTGCXSTGrOT OOCGTTGGCT GGGCCX5GCAA CGGGATCCTC TQTGGTCGOQ ACACTGACCT 840 

AOACGGCTTC CCGGACGAGA AGCT00GCTX3 CCOGGAGCOG CaCTGCCGTA AGOACAACTO 900 

OOTGACTOTG OCCAACTCAQ GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTC 960 

OQATOOQGAT OCOGAOOQGO ACGGGGTCCC CAATGAAAAG GACAACTGCC OGCTGGTCCG 1020 

W QAACCCAOAC CRG06CAACA CGGACGAGOA CAAGTQGGQC GATGOGTGCQ ACAACTGCOO 1080 

GTCCCAQAAG AAOGACGACC AAAAGGACAC AGACCAGQAC GGCCGG0GC8 ATGCGTGOGA 1140 

CQACGACATC GACGGOGACC GOATCOGGAA CCAGGOCOAC AACTOCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAQGGGAT GCCTGTGACA ACTGTCCCCA 1260 

. 6AAGAGCAAC CCGGATCAGG CGQATGTGGA CCAOIACTTT GTGGGAGATG CTTOTQACAG 1320 

OGATCAAGAC a«3GATGGAG ACG0ACATC3V GGACTCTOGQ GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAQQACT CAGACCAOJA TQG0CA6GGT GATGCCT0C3 AOQAOQAGGA 1440 

OQACAATGAC GGAGTCCCTO ACAGT0GG(5A CAACTGC06C CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGACGCG aACAGGGACG GOGTGGGOGA CGTGTGCCAG QACX5ACTTTG ATGCAGACAA 1560 

GOTGGTAGAC AAGATCGACG TGTGTCCGQA GAACX3CTGAA QTCAOOCTCA CCOACTTCAG 1620 

DV GQCCTTCCAO AC3W3TCGTGC TGQACCCGGA GGGTGACXSCG CAGATTQACC CCAACTGGGT 1680 

G6TGCTCAAC CAGGGAAGGQ AOATOGTGCA GACAATQAAC AGGGACCCAO GCCIGOCTOT 1740 

GOOTTACACr GCCTTCAATG GCOTOOACTT COAGGOCACG TTCCATQTGA ACACQGTCAC 1800 

GGATQACGAC TATGCGGQCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

- _ CATCTGGAAG C3VGATGGAGC AAACGTATTO GCAQGOGAAC CCXrrTCCGTG CTCTGQCCGA 1920 

DD GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCOGGGGAAC AGCTGCGQAA 1980 

OGCTCTGTGG CATACAGGAG ACACAGAOTC OCAOSTGGGG CTGCTGTG G A AOGACCGQGO 2040 

AAACQTGGGT TGGAAGGACA AQAAGTCCPA TOGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGCGAT TCTATGAGGO CCCTQAGCTG GTGGCOGACA GCAACOTGGT 2160 

,^ CTTGGACACA ACCATGCGGG GTGGCCX3CCT GGGGOTCTTC TGCTTCTOCC AGGAGAAOVT 2220 

OV CATCTGOGCX: AACCTGCGTT ACCX3CTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGa CRAGCCTAGG GACCAQGOTO AOGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GG6GGCTCT0 CACCCAGGCC AAOGGGTGGC CGTCCTGAGO GGGAAOTGAO 2400 

AAGGQCTCAG AGAGGACAAA ATAAAOTOTG TQTGCAGGO 2439 



1 11 21 31 41 51 

i I I I I I 

GGGAGGGAGA GAGGCGOGOG GGTGAAAGGC GCATTOATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAO CCAGACOCTG ACCAOGTTCC TCTCCTCGGT CTCCTCOQCC TCCAGCTCCQ 120 

fO CGCTGCCCGG CAGCCGGGAG CCATGCX3ACC CCAGGGCCCC GCCGCCTCOC CGCAGCGGCT 180 

CCX3C3GGCCTC CTGCTGCTCC TQCTGCTGCA GCTGCCCGOG 0C8T0SA00S OCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGO CGCAGCTOCG 6CAQAGGGAG GTGGTGGACC TGTATAATGO 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTOCC TGOTOGAGAC GGGAGCCXTtO GGGCCAATGG 360 

p-. CATTCOGGQT AC ACCT OGGA TCCCAGGTOG GGATGGATTC AAAGGAGAAA AGGGQGAATG 420 

OU TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGT6TT CATGGAGTTC 460 

ATTQAATTAT GGCATAGATC TTGGGAAAAT TGC6GAGTGT ACATTTACAA AGATGOGTTC 540 

AAATAGTGCT CTAA6AGTTT TGTTCAGTGO CTGACTTCGO CTAAAAT6CA GAAATOCATG 600 

CTGTCROCGT TGGTATTTCA CATTCAATGG A6CTQAATGT TCAG6A0CTC TTCCCATTGA 660 
AGCTATAATT TATTTGGAOC AAGGAAGOOC TQAAATGAAT TCAACAATTA ATATTCATGG 



720 
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TGTACAATTT 
CAACCTTAAA 



OTGGAAGQAC 
ACTTGTTCA6 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTGA 
ACTTTCTTCA 
TTTTTCTCTT 
QTAAATQTTA 
AAAAAAAAAA 



TTTGTSAAOO 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATQT 
TTTTGCTTCA 
TAGTCACATT 
AOTATAGCAT 
AQAATTTTTT 
AAAA 



AATTGQTGCT 
AGQAGATGCr 
ATAAATGCTT 
TAAATGACAT 
TTACAOACCA 
ATCAAAA0T6 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



Q6ATTAGXGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAQ 
AAGT6TGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAOC 
TAAATAAAAA 



Seq ID NOt CIO DNA Sequence 
Nucleic Add Accession #i NN_003225 
Coding sequence: 41.*. 295 ~ 



1 

I 

ATCCCTGACT 
GGTOATCTQC 
GACAGAGA08 
GCCCTGGCAG 
OT6CTTCTAT 
TCTGCAGGGA 
GAGCTOOOCT 
CACAACACAG 
GATCQATATT 



11 

I 

CGGG6TCGCC 
GCCCTGGTCC 
TGTACAGTGG 
TGTGCAAATA 
CCTAATACCA 
TCTGCCTGCA 
GCCACCrCCA 
ATT6ACTGCT 
AAAAAAAAAA 



21 
I 

TTTGGAGCAG 
TGOTGTCCAT 
CCCCCC6TGA 
AGGGCTGCTG 
TGOACGTCCC 
TCCTGACGGG 
CCQQACACCT 
CT6ACTTTGA 
AAAAAAAAAA 



31 
I 

AGAGGAGGCA 
QCTGGCCCTC 
AAGACAGAAT 
TTTCX3ACGAC 
T0CA6AAGAG 
GTGCCGTCCC 
CAGACACQCT 
CTACTCAAAA 
AAAAAAAAAA 



41 

I 

ATGGCCACCA 
GGCACCCTGG 
TQTGGTTTTC 
ACG6TTCGT6 
GAGTGTQAAT 
CAGCACGGTG 
TCTGCAGCTG 
TTG6CCTAAA 
AAAAAAAAAA 



ATOTTGCTAT 
G6AATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAQT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



SI 
I 

T6GAGAACAA 
CCGAGGCCCA 
CTGGTGTCAC 
GGGTCCCCTG 
TTTAGACACT 
ATTAGTCCCA 
TGCCTCGGCT 
AATTAAAAGA 
AAAAAAAAAA 



Seq ID NO: Oil DUA Sequence 

Nucleic Acid Accession it MM_015419.1 

Coding sequence : 1 . . 8487 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1284 



1 11 21 31 41 51 

I 1 I I I I 

ATGCCCAAGC GCXSCGCACTG GGGGGCCCTC TCCGTGGT6C TGATCCTGCT TTGGGGCCAT 
CCQCGAGTGG OGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG 06AGGTCCAC 
TGCAGQTTCC GATCCCTQGC TTC08TGCCC QCTG6CATTG CTAQACAOQT GGAAAGAATC 
AATTT666GT TTAATAGCAT ACAG6CCCTG TCAGAAACCT CATTTGCAGG ACT6AGCAAG 
TTGQAGCTAC TTATGATTCA CGGCAATQAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 
CAGAGCCXCC AG6GTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAO 
TTTATCCACC CTCAAGCTTT CAAOGGCTTA AOGTCICTGA GGCTACTOCA TTTGGAAOGA 
AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCAOBT TCACATTTTT G6ATTATTTC 
AGACTCTCCA CCATAAQGCA CCTCTACTTA GCAGA6AACA TGGTTAQAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGQAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 
TGOGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 
TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAOTCCAAAG 
AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 
GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTQAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 
AATATGACCX3 ACGAGCACGQ GAACATGQTG AACTTGGTCT QTGACATCAA GAAACCAATG 
GATGTGTACA AGATTCACTT GAACCAAACX3 QATCXTTCCAG ATATTQACAT AAATGCAACA 
GTTOCCTTGG ACTTTGAQTG TCCAATGACC OOAQAAAACT ATGAAAAGCT ATG6AAATTG 
ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAQTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACACGT 
GT6AQAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AQATATCCAG 
CTQAACCGAC GTCAGAGTAC GGCCAAQAAG GTGCTACTTT CCTACTACAC CCAQTATTCT 
CAAACAATAT CCACCAAAGA TACAAGGCAO GCTCGOGGCA GAAGCTGGGT AATGATTGAQ 
CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG 6GGGTCCATG CCA6TTGAGC 
TGCAACGTGA AAQCTTCTGA GAOTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 
CTGAAAGCGC CCATGGATGA CCCAGACA6C AA(?rTCTCCA TTCTCA6CAG TGGCTGGCTG 
AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC A6TGCATTGC TCAAGTGA6G 
GATGAAATGG A0C6CATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 
GAGAAAOACA CAOTGACAAT TGGCAAGAAC CCAGGGGAGT OGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 
GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAQ 
GTOCAAGTCA GTGATAGTGG TTAC7ACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 
CATTTTACGG TGGQAATCAC AGTGACCAAO AAAGGGTCTG QCTTGCCATC CAAAAGAGGC 
AGACGCCCA6 GTGCAAAGGC TCTTTCCA6A QTCAOAGAAO ACATOGTGGA GGATGAA6GG 
GGCTCGG6CA TGG6AGATGA A6A6AACACT TCAAGGAGAC TTCT6CATCC AAAGGACCAA 
GAOGTQTTGC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAA6GG 
AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAOACCAA TGTTGCAGAA 
GGTGQCAGAO TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCC6 
GAGCGCIGGG CIOATATTTT AGGCAAA8TC CQTGGGAAAA ATCTCOCTAA GGGCACAOAA 
GTACCCCOGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 
TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGT6CAGA CAGTAACCAG TGCTGAAGAA 
TCCTCA6CAQ ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 
GCCAGCATOG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA A6ACTGAGGA QATAACTTCC 
ACTGAAGOAG ACCTGAAfiOO OACAfiCAGCC CCTACACTTA TATCTGAGCX: TTATGAACCA 
TCTCCTACTC TGCACACATT AGACACAQTC TATGAAAAGC CCACCCATGA AGAGACGGCA 
ACAGAGGGTT GGTCTGCAGC AGATGTTGQA TCGTCACCAG AGCCCACATC CAGTGAGTAT 
GAGCCTCCAT TGOATOCTGT CTCCTTQGCT GAOTCTGAGC CCATGCAATA CTTTGACCCA 
OATTTX^GAGA CTAAGTCACA ACCA6ATGAG OATAAGATGA AAGAAQACAC CTTTGCACAC 
CmCTCCAA OCCCCACCAT CTGGOTTAAT GACTCCAOTA CATCACAOTT ATTTGAGGAT 
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TCTACTATAG OQQAACCAGG TGTCCCAGGC CAATCACATC TACAAC3GACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTOATTAA AAAQGGTATG 3180 

AAAQAGATOT CTCAGACACT ACAGGGAGGA AATATOCTAG AGGGA GACCC CACACACTCC 3240 

AGAAGTTCTO AGAGTGAGGG CC3U^GAGC AAATCCATCA CTTTOCCTGA CTCCACACTO 3300 

5 G6TATAATQA GCAGTATGTC TCCAGTTAAO AA0CCT6CQG AAACCACAGT TCGTACXXTTC 3360 

CTAGAOUUVO ACAGGACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACCATQAGCA CTCAOCCTTC TOQAAQGAQA CCCAAOC50QA GAAGGAOATT ACGCCCCAAC 3480 

AAATTCOGCX: ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA QTCAAOTGGA QAGTTCTCTG 3600 

10 GTTCCTACAG CTTGGGTGGA TAACAC»GTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 

AATGCAGAAC CCACATGCAA GGQAACAGCA 06GAGAAAAC ACGGGAAGAG 6CCAAACAAA 3720 

CATGGATATA CCCCTTCTAC AGTQAOCTCA AOAOCOTCOJ GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTOTTACT CCCAGTTa\a AAACTATACT TTTGCCTAQA 3B40 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATQAC AACCACCAGA 3900 

15 AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

TCAGATGGAA AAGAAATTAA GGATGATOTT GCCACAAATQ TTGACAAACA TAAAAOTOAC 4020 

ATTTTAOTCA CTGGTGAATC AATTACTAAT GCX3VTACCAA CTTCTCGCTC CTTGOTCTCC 4080 

ACTATGGGAO AATTTAAGGA AGAATCCTGT CCTGTAGGCT TTCCAG6AAC TCCAACCTGG 4140 

AATCCCTCAA GGAOGGCCCA GCCTGGGAGG CTACAQACAO ACATACCTOT TACCACTTCT 4200 

20 GGGOAAAATC TTACAQACCC TCCCCTTCTT AAAGAGCTTO AQGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC A6TCTCCACA CCATTTCACC AGGAAQAAGC TGGTTCTTCC 4320 

AGAACTCTCT CAA6CATAAA AGTGGAGOTG GCTTCAAGTC AGGCACAAAC CACCACCCTT 4380 

GATCAA6ATC ATCTTGAAAC CACTGTG6CT ATTCTCCTTT CTGAAACTAO ACCACAGAAT 4440 

CACACCXXTTA CTGCT6CC03 OATOAAGGAO CCAGCATCCT CX3TCXX:CATC CACAATTCTC 4500 

25 ATGTCTTTGG GACAAAOCAC CACCACTAAG CCA6CACTTC CCAGTCGAAG AATATCTCAA 4560 

GCATCTAGAQ ATTCCAAGGA AAATGTTTTC TTGAATTATQ TGQGOAATCC AGAAACAGAA 4620 

GCAACCCCAO TCAACAATOA AG6AACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 46B0 

CXXrrCTTOCG ACCGGGATGC ATTTAACTTG TCTACAAAOC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAaTA GGAGTCTACC ACOTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

30 GCTTCTCATC AACTAACCAG AGTGCCTGCC AAACCCATGC TACCAACAGC AACA0TGA60 4860 

CTACCTGAAA TGTCCAGACA AAOOOCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATGCTTCTG OGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTOCAAA 5040 

CCCAGCATTC CTAGTAAGTT TACTQACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

35 GTGTTTGGAA ATAACAACAT CCCTQAGGCA A6AAACCCA0 TTGGAAAOCC TCCCAGTCCA 5160 

AOAATTCCTC ATTATTCCAA TQGAAOACTC CCTTTCTTTA CCAACS^AGAC TCTTTCTTTT 5220 

CCACAQTTG6 GAGTCACCOG GAGACCCCAG ATACCCACTT CTCCT60CGC AGTAATGAOA 5280 

GAOAQAAAAG TTATTOCAGQ TTCCTACSUVC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGT TG TTG CACACTCCGC AGACCAOGGQ ATCACCCTCA 5400 

40 ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCASA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC A6TCCTC3^ AAOCTTOCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGQA 5520 

CCTGCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAOTCCCCA 5580 

CAGACTOTGr CCGTCACOGC TGAOACAOAC ACTGTGTTCC CCTGTGAGGC AACAGQAAAA 5640 

CCAAAGCXTTT TC6TTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCOQAATACC 5700 

45 AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACX:TTAQ TQATAOGGAA GGTTCAAGTA 5760 

CAAGATCGAG G0CA6TATAT GTGCAG060C AGCAACCTGC AGGGCCTGGA CAGGATGGTG 5620 

GTCTtGCTTT 0G6TCACC6T GCAGCAACCT CAAATCCTAO OCTOCCACTA CGAGQAOQTC 5880 

ACTGTCTACC TOOOAGACAC CATTGCAATO GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGOAGG GTGTGGCAAA CTGT6TCCCC CGTGGAQAGC 6000 

50 CGCATCACCC TGCAC6AAAA CCGGACCCTT TCCATCAAGG KQGOSTCCrr CTCAGACAGA 6060 

G60GTCTATA AGTGCSTG G C CAGCAATOCA GCOOGGGCOG ACAQGCTGQC CATOGQOCTO 6120 

GA08TG6060 CACTGCCCCC GGTTATGCAC CAOQAGAAGC TG6A6AACAT CTGGCT6GCC 6180 

OOGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCX:CCrGCC CAGOSTGOBC 6240 

TGGGTQCTCG GGGAOGGTAC CCAGATCOQC CCCTOQCAGT TCCTCCAOOG GAACTTOTTT 6300 

55 GTTTTCCCCA A0GG6A06CT CTACATCOGC AACCT06G6C CCAAGGACA6 CGGG06CTAT 6360 

GAGTGGSTGQ COGCCAAOCT GGTAGGCTGC G08CX3CAG6A OGGTGCAOCT OAAGGTGCAO 6420 

OSTGCAGCAG CCAA080G0Q CATCAOOGGC ACCTCC006C GGAGGAOGQA OGTCAGGTAC 6480 

GGAOGAACCC TCAAGCTGGA CT6CAGOGCC TCGGGGGACC CCTGGCCGOO CATCXTTCTCG 6540 

^ AQGCTGCOST CCAAGAGOAT GATOSAOQOO CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

60 TTTGCCAATG GGACCCTGGT GGTQAAATCA 6TGA0GGACA AAQATGCGGG AGATTACCTO 6660 

TCGGTAGCIC GAAATAAGQT TGOIGATGAC TAOGIGOTQC TGAAAGTGGA TQTOGTGATG 6720 

AAACCG6CCA A6ATTGAACA CAAGGAGGAO AAOGACCACA AAGTCTTCTA GGGGGGTGAC 6780 

CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCC3UITCCCG AGATCTCCTO GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TOGGATGACA GOGGTGQACG CACCAAGCGC 6900 

65 TATGTCGTCT TCAACAATGG GACACTCTAC TTTAA06AAG TGGGGATQAG GQAOGAAGGA 6960 

OACTACACCT GCTTTGCTGA AAATCAGGTC GOGAAOGAOG AOATGAGAGT CAGAOTCAAO 7020 

GTGGTGACAG 06CC0GCCAC CATCOGGAAC AAGACTTACT T6GGGGTTCA 6GTGCCCTAT 7080 

GGAGACX3TGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TTGTCCCCAA CCAACAROGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

70 GGCACrCTCC TTATTCAGAA AGCCCAGOGT TCTOACAOOG GCAACTACAC CTGCCTGGTC 7260 

A0GAACA60G CGGGAGAGGA TAGGAAGACG QTGTGGATTC AOGICAAOGT CCAGOCACCC 7320 

AAGATCAAOG GTAACCCCAA COCCATCACC AC06T00666 A6ATA6CAQC OQGGGQCAGT 7380 

CGQAAACTGA TTGACTGCAA AGCTGAAQGC ATCCCCACCC 06AGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC A6CTCCATAC TATGGAAACC GGATCACTOT CCATGGCAAC 7500 

75 QOTTCCCTGQ ACATCAGGAG TTTGAGQAAG AG0GACTC06 TCC3U3CIGGT ATGCATGGCA 7560 

G6CAA06A0G GAGGGGAGGC GAGGTTQATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TOCAOGACCC GATCAGOGAG AAGATCACGO CCATOGOGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCCX3C GGGOACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACOSATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGAOGGCATG 7800 

80 CTACaCATTA GOGGTCTCTC CTCGGTGGAC GCTQGGGCCT ACOGCTGOGT GGCC06CAAT 7860 

GCOOCIGGCC ACA06GAGAG GCTGGTCTCX: CTQAAGGTGO GACTOAAaCC AOAAOCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATO AATGOTQAGA CCCTGAAOCT CCCCTGCAOC 7980 

GCTCCaSGGG CTGGGCAGGO A06TTTCTGC TGGAOGCTCC GCAATGQCA7 GCATCTGOAO 8040 

GGCOCCCAAA CCCTGG6AGG O Sl Ti' C r CA T CTaOACAATO GCACOCTCAC GGTTOGTQAO 8100 
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GCCrCGGTGT TTGACAGGGG TAOCTATGTA TQCMSOKSGO AGA0G6AGTA OQGCCCTTOG BI60 

GTCACCAGCA TCCCOOTGAT TGTGATGGOC TATCCTCCCC GGATCACCAO 0GA6CGCACC 6320 

COtSGTCATCT ACACXXGGCC OGGGAACACr GTGAAACTGA ACTGCATGGC TATGGGQATT 8280 

CCCAAAGCTG ACATCAOGTG GQAGTTACOG GATAAGTOGC ATCTGAAGGC AaSGGTTCAQ 8340 

GCTOGTCTQT ATGGAAACAO ATTTCTTCAC CCCCAQ6QAT CACTGACCAT CCAGCAT6CC 8400 

ACACAGAGA6 ATCC C GGCTT CTACAAGTGC ATG6CAAAAA ACATTCTG66 CAGTGACTGC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAG6AACIX3 8520 

ACAACAAAGC GGGGTTTGTA AGGQAAGCCA GGTTGGGGAA TAOQAQCTCT TAAATAATGT 85B0 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAA(3TTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGOGAAAA GGAAGCAAT6 CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGOCTTC ATT C AG GG T O TCTGT G CTCT GACTGCAATT 8760 

TTTCTTCTTT T6CAAATGCC ACTOGACTGC CTTCATAAGC GTCCATAGGA TATCTGAG6A 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCX3VT TGAAGACX3CA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAQACTT TGTTCCAGAT TGACAAGTCA 8940 

TCTTTCA6TT ATTTCCTCTG TCACTTCAAA ACTOCAGCTT 6CXOUVTAAG GATTTAGAAC 9000 

CAQAOTGACT 6ATATATATA TATATATTTT AATTCAGA6T TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAQAOGAT GAGACTAOAA GGAQAAATAC 'm'C l tJT C n' 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGQAA 9240 

ATATAATTTT AAAAAATTTC TCTCXaACCT CCTTCAAATT CAOTCACCAC TGTTATATTA 9300 

CCTTCTCCAO GAACCCTCCA GTGGGGAAGG CTGCQATATT AOATTTOCTT OTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGQAGAG 6AAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAQTCTTCC COQAAAAGCC CAGAAACTTC TCTGCAOTAT 9480 

CTGGCTTGTC CATCTQOTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 

ATACIGTACA TTTGATAATA AAATAATATT CTCCX3VAAAA AAAAA 9645 



Seq ID NO: C12 DNA Sequence 
30 Nucleic Acid Accession #: AK001903 
Coding sequence i none 

1 11 21 31 41 51 

1 I I 1 1 I 

TATCATGCAT GTGGGAAGGT GGGTGTGGTG AGAAAAGTTT TAAGGCAAGA GTAGATGGCC €0 

ATOTTCAACT TTACAAAATT TCTTGOAAAA CTGGCAQTAT TTTGAACTGC ATCTTCTTTO 120 

GTACG6GAAC CT6CAGAAAC A6TGTQAGAA ATTAAGTOCT GGTTCACTGC GCAGTAGCAA 180 

AGATG6TCAA GGCCATGGAA AAAQCAGAAA TTTACCAAGA AAGCTGATAC CCATGTATAG 240 

TTCCCACTCA TCTCAAATAC ATCT6CTATC TTTTTAAGCT AAGTCCTAGA CATATCQGGG 300 

40 ATA ACATG GG OGTTGATTAO TGACCACAGT TATCAGAAGC AGAGAAATGT AATTCCATAT 360 

TTTATTTGAA ACTTATTCCA TATTTTAATT GQATATTGAO TGATTGGGTT ATCAAACACC 420 

CACAAACTTT AATTTTGTTA AATTTATATG GCTTTGAAAT AGAAGTATAA GTTQCTACCA 480 

TTTTTTGATA ACATTGAAAG ATAGTATTTT ACCATCTTTA ATCATCTTGG AAAATACAAG 540 

TCCTGTQAAC AACCACTCTT TCACCTAGCA GCATGAGGCC AAAAGTAAAG GCTTTAAATT 600 

45 ATAACATATG GGATTCTTAG TAGTATGTTT TTTTCTTGAA ACTCAGTGQC TCTATCTAAC 660 

CTTACTATCT CCTCACTCTT TCTCTAAGAC TAAACTCTAG GCTCTTAAAA ATCTGCCCAC 720 

ACCAATCTTA GAAGCTCTGA AAAGAATTTG TCTTTAAATA TCTTTTAATA GTAACATGTA 780 

TTTTATGGAC CAAATTGACA TTTTCGACTA TTTTTTCCyuV AAAAGTCAGG TGAATTTCAG 840 

CACACTQAGT TGGGAATTTC TTATCCCAGA AGACCAACCA ATTTCATATT TATTTAAGAT 900 

5U TGATTCCATA CTCCX3TTTTC AAGGAGAATC CCTQCAGTCT CCTTAAAGGT AGAACAAATA 960 

CTTTCTATTT TTTTTTTCAC CATTGTGGGA TTGGACTTTA AQAGGTGACT CTAAAAAAAC 1020 

AGAGAACAAA TATGTCTCAG TTGTATTAAO CAOGGACCCA TATTATCATA TTCACTTAAA 1080 

AAAATQATTT CCTGTGCACC TTTTGGCAAC TTCTCTTTTC AATGTAGGGA AAAACTTAQT 1140 

CACCCTQAAA ACCCACAAAA TAAATAAAAC TTGTAGAT6T GGGCAGAAGG TTTGGGGGTQ 1200 

55 GACATPGTAT GTGTTTAAAT TAAACCCTGT ATCACTGAGA AGCTGTTGTA TGOGTCAGAQ 1260 

AAAAT6AAT0 CTTAQAAGCT GTTCACATCT TCAAGAGCAG AAGCAAACCA CATGTCTCAG 1320 

CTATATTATT ATTTATTTTT TATQCATAAA OTGAATCATT TCTTCTGTAT TAATTTCCAA 1380 

AGGGTTTTAC CCTCTATTTA AATGCTTTGA AAAACAGTGC ATTGACAATG GOTTGATATT 1440 

TTTCTTTAAA AGAAAAATAT AATTATGAAA GCCAAGATAA TCTGAAGCCT Q TT Tl ' Arm ' 1500 

OU AAAACTTTTT ATGTTCTGTG OTTGATGTTG ' iTrGTO - GlT TGTTTCTATT TTGTTGGnT 1560 

TTTACTTTGT TTTTTGTTTT GTTTTQTTTT GTTTTGCATA CTACATGCAG TTCTTTAACC 1620 

AATGTCTGTT TGGCTAATGT AATTAAAGTT GTTAATTTAT ATGAOTGCAT TTCAACTATG 1680 

TCAATGGTTT CTTAATATTT ATTGTGTAQA AGTACTGGTA ATTTTTTTAT TTACAATATG 1740 

- TTTAAAGAGA TAACAGTTTG ATATGTTTTC ATGTGTTTAT AGCAGAAGTT ATTTATTTCT 1800 

05 ATGGCATTCC AGCGGATATT TTGGTGTTTG OGAGGCATGC AGTCAATATT TTGTACaGTT 1860 

AGTGGACAGT ATTCAOCAAC GCCTGATAGC TTCITT66CC TTATGTTAAA TAAAAAGACC 1920 

TGTTTQGGAT GT 1932 

70 Seq ID NO: C13 Protein Sequence 

Nucleic Acid Accession #i Bos sequence 
Coding sequence t i . . 5001 

1 11 21 31 41 51 

75 I I 1 I 1 I 

ATGCCAGGCA CAAAACTAAC CCGAACA6GC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCTCTCAAQ AGOACGAATT GQATGTACCT GAOGACATCA GCGTCCGGGT TATGTCATCT 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTOGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACOGTGCG CTAT06AGA0 AAGGGG6AAT TGGCCAGGTG GGATTATAAG 240 

CAGATC6CTA ACAGGGGTGT GCTQATTGA6 AACCTGATTC CAGACACIGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GOCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACAGCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAO TTGTCGCTGC ATCTTGG6AT GOGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGOACACAQO ACTGTTTTCA GTTTCCTCCT TOCAACCATC TGCCAAATCA 540 
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TTTCAGAATA CATTCTTTCA TACGCCCOGQ CTCTCAAACX: ATTTGGAGCA AAQT O CCfC A . 600 

CCTATCCTGQ AQACACTACT TCTGCCCTGO TGQATGGTCT GCAGCCTQGQ GAA06CTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TOGCTATGCC AACAAQAATO CAGCTGTACC CAQAAC3GATT TCAGTTGTCT AGCTTACCTG 780 

ATCG ATATCC AAACCRAACA AGTTAATAAA QATCCACAAC TGQAAOOGAG TGTrTTrGOA 640 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGOGQCTT TTCCTTCATT 900 

ATGT6CTATQ AAGACCCANN TGTTTCTTCT TTOACAGOCA ATTCTTTAAA ATCTGTT6CA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACAOG GAGGACAATO GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAO AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

CCCCAAGGGA GAAATGOCAA GGACCTTCTT CrrQACTTQA AGAAC3UUUIT ATTGGCTAAT 1140 

GGTQGGGCX3C CCGOAAAACC OCA6CTT08C GGCAAQAAOG CAGAGGA6CT GGATCTTCAO 1200 

TOGACAGAAA TCACTGQQ6A GQAGGUIOCTQ GQllX X ' CW U AGQACT06CC CATGTCACCC 1360 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCOGCCAA GTAGACACGG CCACTCGOTG 1320 

GTTGCTCCCG GCAQ GACT GC AGTGAGGGCC CGGATGCCAG CXSCTGCCXXG AACGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CXITOGCCAOQ CAGCCCCGCC CAGGGGCQCC CCCXTTCX^GCT 1440 

TOSQCCrCTC CTGCCCACCA GGOGTCCAGC CAGGGCACCT CTCATOBTCC TTCCCTGGCT 1500 

GCCA6CTTGA ATGACAA08A CTTGGTGOAC TCAGAOGAAG ATQAGGGOGC TGT6G6CTCC 1560 

CTCCACCCCA AGGGOGCCTT OGCCCAGCCC CGGCJCAGCCC TGTCCCOCAG COGCCAGTCC 1620 

CXCTCCAGCG TTCTCOQCXSA CAGAAGCTCT GTGCACCCCG GOGCAAAGCC AGCCTOGCCG 1680 

GOGCGGAGQA CCOCCCATTC AGGGGCOSCA GAGGAAOATT CCAGTGCCTC A6CCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGG6GATCA TCTGGGGTGC T6C0CACGCA GCCACAGCra 1800 

AGCTCTCCAC TTTCCAAGGG 0G6GAAG6AT GGTGA6GACG CCCCAGCCAC CAACTCCAAT 1860 

GCOCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTOOTC CAGGA06CAG 1920 

GTCTCTGTIGG GAGCGGAG6C TTCTGATGOT QAAAGCCAOG GTOACGGCQA TAQGQAAGAC 1980 

GQ06GAAGGC AGGCGGAGGC CA06GCCCAG ACGCTGOGGG CCOOGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCA6ACACAA A0CCTTT6CT GCCAACQGQA G0TCTCCAA6 CAGOTTCAGC 2100 

ATTGG6C6GG GACCT06GCT GCAGCCCTCC AGCTCCCOVC AGTGGACTGT GCCCTCCOQA 3160 

GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACOCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCOGCTTCCT GCCACCGTTG TCAATQACCA OGTGCCTTCC 22 BO 

TCCTCCAGGC AGCCCATCTC C0G66GCTGG 6AGGACTTAA GGAGAAGCGC GCAGA6AGG6 2340 

6CCAGGCTGC AT0GGAAG6A AOCCATGCCA GA6AA0C0CA AATCCACAGG GaCAGATACA 2400 

CATCCTCAGO 6CAAGTACTC CTGGCT66GC TCCAAG6CTC AGGATGTTCA ACAOAGCACA 2460 

GACGOGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GOTCCACAGA CCGCCAGGOG 2520 

TCCCCTGCTC GTCCTCCCGC AGCAOGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATQ 25 80 

ACACGOGGCC GGGCCCCAGA ACA6CAGCCC CCTCCTCC06 TCGCCACOTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAQAGA 060GGQTC8G TCACCTTCCC AGCCCAGOCT CTCACTGACC 2700 

GAGGCCGOGC QQCC C C3C0C CACGT08CAG G0C06CTCCC ACTOCTCCTC OGACOCTTAC 3760 

AOGGCGAGCr CCAGA6GGAT GCTCCCGACO GCCCTCCAGA ACCAGGACGA GOATGCCCAG 2820 

G0CAGCTA06 A06ACQACA6 CACAGAAGTC GAGGCCCAGG ATGTGOGGGC CCCCGCGCAC 2 880 

GCCGCGOGOO CCAAGGAGGC AGCTQOQTGC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGG06CA6 GGGCAGQTGG C6ACCACAG0 TCCCAGG6G8 GACATGCGGC CTCCCC08CC 3000 

AGGCCCAGGC GACCOGOCOO CCCCCAGTCC 0QG6CCCGGG TCCCCAGCAG GGCAGOGCCG 3060 

GGOAAGTCGG AGCCTCCTTC CAAGCGGCCC CPGTCCTCCA AQTCCCAGCA GTCGGTCTCA 3120 

GCCQAGGACQ AOQAaOAGGA GGACGCGGGG TTTTTTAAAG GOGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTQ GCCCTCTTCC TCCACTCCCA GGGGCG6CAA AOACGCOGAT 3240 

GGQAGCCTCQ GCAAGGAAGA GAGGGAGCCT GCCATOGCGC TT6CC0CTGG OGGAGGGAGC 3300 

CTGGCTCCTO TGAA6C6ACC TCTCCCCCCA CCTCCftGGCA GCTCCCCCAG G6CCTGCCAC 3360 

GTCCCTTCCC GACOGCCGOC TOQCAGOGCT GCCACCGTOA GCCCCQTOGC GG6CACCCAC 3420 

CCCTGGCCGC OOTACACCAC GCGCGCCCCV CCTGQCCACT TCTCCACCAC CCOGATGCTG 3480 

TCCTTG06CC AGAGGATGAT GCATGCCAGA TTCC8TAACC CTCTCTCCOQ ACAGCCTGGC 3540 

AGACCCTCTT ACAOACAAOG TTATAAT6GC AGACCAAATG TA6AAGG6AA AGTCCTTGCT 3600 

GGTAGTAATO GAAAACOGAA TGGAGAGAGA ATTATGAATQ GCCCTCAAGG AACAAAGT6G 3660 

QTTGTOQACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAA6GTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGOAT TAAACTAGGA GGAOATGGTC GAACCATTGT AGATCTGGAA 3780 

OQGACCCCCG TGGTGAGTCC TGA0G6CCTC CCACTCTTTG GGCAGGGGC6 ACATG6CAGA 3640 

CCTCTGGCCA ATQCCCAA6A TAAGCCAATT TTGAGTCTTO 6AGGAAA6CC GC1G6TGGGC 3900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATOCAOCC CACCACTACT 3960 

AOGAOGCCCC TOCCTACCAC TACAACCCOG AGGCOCACCA CT6CCA0CAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACOQAQGC CCACCACTGC CACCACCCQC 4080 

OQCAOQACCA CCAGG06TCC AACAACCACA GTC06AACCA CTACGOQGAC AACCACCACC 4140 

ACCAOCOCCA AACCCACCAC TCCCATCGCC AOCTSTOOCC CTGOQAOCTT GGAACGGCAC 4200 

OACGATGATO GCAACCTQAT AATGA6CTCC AATGG6ATCC CAGAGTGCTA OGCTGAAGAA 4260 

GATGAGTTCT CAGGCTTGGA GACT6ACACT GCAGTAOCTA CGGAAGAG6C CTAOGTTATA 4320 

TATGATGAAG ATTATGAATT TGAGA06TCA AGGCCACCAA CCACCACTGA GOCTTCGACC 4380 

ACTGCTACCA CACOQAGGGT GATGCCAGAO GAAGGGGGCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTIGATC TG6CTG6AAG GAAAOGATTT G Tr G CTCClT ACGTGAGGTA CCIAAATAAA 4500 

GACCCATCAO CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAA6AAG AGTQATCTQC CTCCCCAGCA TGCTCCCCGC 4620 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTG6A TTGGGACAAA 4680 

6CCACCCCAG GA0ATTTG6T CACAGGTTAT TTGGTTTACA GXGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AlOTTTTCCAC TCAAGCTrCA TCAOTAACTC ACTTQCCCAT TOAGAAOCTA 4800 

AA6CCCAACA CGAGGTATTA TTTTAAAOTG GAA6CACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT OGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TQTGAGGCCC 4920 

CCAGGGGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACAOGGACT 4980 

GCCATGGACG GCAATATGTG AAGOQCAOGT GGTATCOAAA OTTCGTGGGA UTltnTCYri ' 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA 6TGACAACCT GAAAGATACA TTCTACAGCA 5100 

TTGQAGACAG CTGGGQAAGA GOTQAAGACC ATT6CCAATT TGT6GATTCA CAC C TT G ATG 5160 

GAAQAACAQQ GCCTCAGTCC TATQTAQAAG CCCTCCCTAC TATTCAAGOC TACTATC6CC 5220 

AGTATCOTCA GGAGCCTOTC AGGTTTGGGA ACATOGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTOaAAAOTQ QTAATGACAG GACCGTCATQ 5340 

CT6CAA0CTT GCCCTGCCCA GCOOCACCAA CTAAOTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCA6CAT GCTCAGCOCC GCTGCCCTAO OTOOCnOGAA OSTGACAGAT GGACACTGGC 5460 

CATTCIGGTC ATCTCAGTCT GQAACTCAOT CCCACTTCTT 060CTG6ACA ATOAACAOQA 5520 

TTCAGTTTTG CTGTTAACTT T6CTTCTCTA ClVrmTlti TTTGTTT8TA ATAGCACATC 5560 
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CCAGAGACAT CACAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAQGCATCA 5640 

AATTCXK3ACA CTTCAG TATT TCCAGGAATA GCATATGCAC GCTOTTCTTQ CTTCATOGAA 5700 

TGCTACATGC TTTClXmTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 

TCAGQTCCCT TTGTATOCAS TAGAAAOGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 

ATATCCTACT TOAAATTTAC TCTATGGACT TACCCACTQC TAGAATAAAT GTATCAAATC 5880 

TTATTTGTAA ATTCTCAATT TTOATATATA TATGTATATA TGCA7ATACA TATCCACACT 5940 

TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTOTACT TGTTCAOCaA AAAAAAAAAA 6000 
AAAAAAA 



Seq ID NO: CI 4 DMA Sequence 
Nucleic Acid Accession «: NM_003014 
Coding sequence* 238.. 1278 ~ 



Seq ID NOt 015 DNA Sequence 
Nucleic Acid Accession #: MM_005940 
Coding sequence: 23.. 1489 " 



6007 



1 11 21 31 41 51 

1 ) I I I I 

GGCGGGTTCG CGCCCCGAAG GCTQAGAGCT GGCGCTGCTC 6TGCCCTC3T8 TGCCAGAOGG 60 

CGGAQCTCCG CGGCCGGACC CCGCG6CCCC GCTTTGCTGC 0GACT6QAGT TTOOOGGAAG 120 

AAACTCTCCT GCGCCCCAGA AQATTTCTTC CTC6G0GAAO GGACAGCGAA AGATGAGGGT 180 

GGCAGGAAGA GAAGGCQCTT TCTGTCTGCC GGGGTCGCAQ CGCGAQAGQG CAGTGCCATG 240 

ZU TTCCTCTCCA TCCTA0TG6C OCTGTGCCTO TGGCTGCACC TGGCGCTGGG CGTGCGCOOC 300 

GOQCCCTGCG AGGCGGT6CG CATCCCTATQ TGCCQGCACA TG0CCTG6AA CATCA06CG0 360 

ATGCCCAACC ACCTGCACCA CAOCAOGCAG GAQAAOGCCA TCCTGGCCAT COAOCAGTAC 420 

GAGGAGCTGG TGGACGTGAA CTGCAGOGCC OTGCTGCGCT TCTTCTTCTG TGCCATGTAC 480 

GCQCCCATTT GCACCCTQGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 540 

ZD CAACGCGOGC GOGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCOGAA 600 

A GCCTG GCCT OOQACGAGCT GCCTGTCTAT GA006TG6C6 TGTGCATTTC GCCIOAAGCC 660 

ATCGTCA066 ACCTCCCX3GA 6GATGTTAA0 TG6ATAGACA TCACACCAGA CATOATCOTA 720 

CAGGAAAGGC CTCTTGATGT TGACTGTAAA OGCCTAAQCC CCQATCGGTG CAAQTGTAAA 780 

AA6GTGAAGC CAACTTTGGC AACQTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 

DU AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATOTAAAA 900 

GAGATCTTCA AOTOC TCATC ACCCATCCCT OQAACTCAAO TCCOGCTCAT TACAAATTCT 960 

TCTTGCCAGT GTCCACACAT CCT6CCCCAT CAAOATOTTC TCATCATGTO TTACGAGT6G 1020 

GOTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATOGAGAGA TCAGCTTAGT 1080 

^ AAAAGATOCA TACAGTGGGA AQAOAGGCTG CAGGAACAGC GGAGAACAGT TCAGQACAAG 1140 

J J AAGAAAACAG COGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 

GCTCCCAAAC CAGCCAiGTCC CAAOAAGAAC ATTAAAACTA GGAGT6C0CA GAAGAGAACA 1260 

AACCCGAAAA GAGTGTGAGC TAACTA6TTT CCAAAGOQQA GACTTCGBAC TTCCTTAGAG 1320 

GATGAGGCTG QGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 1380 

A(\ ACTCACTGCA QTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 1440 

4U 6TTTTTCTTT QTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 

6AGTTAAAGC TGOTGOAAAA GGCTTATTOC ATTGCATTCA GAGTAACCTG TOTGCATACT 1560 

CTAOAAGRGT AGOGAAAATA ATOCTTGTTA CAATTOQACC TAATAT0T6C ATTGTAAAAT 1620 

AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTQA 1680 

TATCTGTTGT TGCAATGTTA GTQATGTTTT AAAATGTQAT GAAAATATAA TGTTTTTAAQ 1740 

AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT QTTTATGGTC TGCAGAAGGA 1800 

TTm'Gl'GAT GAAAQGGQAT TTTTTGAAAA ATTAGAGAA6 TAGCATATGG AAAATTATAA 1860 

TGTGTTTTTT TACCAATGAC TTCAGTrPCT GTTTTTAGCT AOAAACTTAA AAACAAAAAT 1920 

AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTa 1980 

GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 2040 

jU A CAGTG AGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 2100 

ATTTTATACC CACAAGAGAG GTAT6TCACT CATCTTACTT OCCAOOACAT CCACCCTOAG 2160 

AATAATTTOA CAAGCTTAAA AAT6GCCTTC ATGT6AGTGC CAAATTTTOT TTTTCTTCAT 2220 

TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 2280 

AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAQ TTGGGATACT TTAATCAGAA 2340 

AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 2400 

ATTTATTTTA AAAAACAATT TTATTGGOCT TTTGCTAACA CAOTAAGCAT GIATTTTATA 2460 

AGGCATTCAA TAAATOCACA ACX3GCCAAAG GAAATAAAAT CCT A TCTAAT OCTACTCTCC 2520 

ACTACACAGA GGTAATCACT ATTAOTATTT TGOGATATTA TTCTCCAGGT GTTTGCTTAT 2580 

GCACTTATAA AATGATTTGA ACAAATAAAA CTAGQAACCT GTATACATGT GTTTCATAAC 2640 

OU CT6CCTCCTT TGCTTGGCCC TTTATTGAOA TAAGTTTTCC T6TCAAGAAA GCAGAAACCA 2700 

TCTCATTTCT AACAGCTGTG TTATATTGCA TAGTATGCAT TACTCAACAA ACT G TTQTOC 2760 

TATTGGATAC TTAGGTGOTT TCTTCACTGA GAATACTGAA TAAACATCTC ACOGGAATTC 2820 



1 11 21 31 41 51 

7n I i I I I I 

/U AAOCOCAOCA GCCCCGGGGC GGATGGCTCC GGCOGCCTGO CTCCQCAGOO CGGCCGCGOG 60 

0 6CCCT0 CTG CCCCC6AT6C TGCTGCTGCT GCTGCA6GGG CG6COQCT6C TGGCCOGGQC 120 

TCTOCOGCOG GAGGTCCACC ACCTCCATGC 06AGAGQAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCOGG CACCTGCCCC TGCCAOGCAG GAAGCCCCCC GGCCT6CCAG 240 

CAGCCTCAGG CCTCCCCGCT OTGGOGTGCC CGACCCATCT GATGGQCTGA GTGCCCGCAA 300 

/J CCGACAQAAQ AGQTTCQTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTOGG TTCCCATGGC AGTTGGT6CA GGAGCA66TG GGGGAOAaSA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCQATG TOACGOCACT CACCTTTACT GAGGTOCACG AGGGCOGTGC 480 

TGACATCATG ATCGACTTOG CCAGGTACTG GCATGOGGAC GACCTGCCGT TTGATQGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAQACTCAC CGAGAAGGGG ATGTCCACTT 600 

OU OGACTATGAT GAGACCTGGA CTATOJGGGA TGACCAGGGC ACAOACCTGC TGCAOOTGGC 660 

AGCOC ATOA A TTTGGCCACG TGCTGOGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCOOCCTTC TACACCTTTC GCTACCCACT GAGTCTCAOC CCAOATQACT GC3U3GGOGOT 780 

TCAACAOCTA TATGGCCAOC CCTOGCCCAC TOTCAOCTCC AGGACCOCAa CCCTGGGCCC 840 

CCAG6CTGG0 ATAGACACCA ATGAGATTOC ACCGCTGGAO CCAOACSGCCC COCCAGATGC 900 
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CTG TGAGGO C TCCTTTGAOS OGGTCTCCAC CATCC3GAGGC 6AGCTCTTTT TCTTCAAAGC 960 

GGOCTTTQTG T6G06CCTCC GTGGGGGCCA 6CT6CAGCCC GGCTACCO^O CATTGQCCTC 1020 

TCGCCACTGQ CAOGOACTGC CCAGCCCTGT GGAOGCTGCC TTCGWSGATG CCCAOGGCCA 1080 

CATTTGGTTC TTCX:AAGGTG CTCAGTACTG GGTCTACQAC GGTGAAAAGC CAGTCCTGGO 1140 

CCCCQCACXX: CTCACOSAGC TGGGCCTGGT GAGGTTCCCX3 GTCCATGCTG CCTTGGTCTG 1200 

OGGTCCOSAG AAGAACAAOA TCTACTTCTT OCGAOGCAGO QACTACTGGC GTTTCCAGCC 1260 

CAGCACCOOG G6TGTAGACA GTC C OGTGCC C0GCAGG6CC ACT6ACT6GA QAGGGGTQCG 1320 

CTCTGA QATC QAOGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGC3GCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CrGTCSAAGOT GAAGaCTCTO QAAGGCTTCX: CCCGTCTCGT 1440 

GGOTCCTGAC TTCTTTGGCT GTGCOCSAGCC TGCCAACACT TTCCTCTGAC CATCGCTTGO 1500 

AT GOOCTCAG GGGTGCTGAC COCTOOCAGO CXAOQAATAT CAGQCTAGAG ACCCATCaCC 1560 

ATCTTTGTGO CTGTGGGCAC CAGGCATGGG ACTQAGCCCA TX5TCTCCT0C AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCOS GOAGGGCXSVC GCAGGTCGTG GTCACCTQCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATQAC TTAAOAGOAA GGGCAGTCTT 1740 

1 J 6QGACCCGCT AT6CAGQTCC TGGCAAACXrT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1800 

6TAGCACCAT G6CAGGACTG GGGGAACT6G AGTGTCCITG CTGTATOCCT 6TTGTGAGGT 1860 

TCCTTCCAGO GGCTGGCACT 6AA6CAAG6G T6CTGG060C CCATGGCCTT GAGCCCTGGC 1920 

TGAGCAACTG GGCTGTAGGQ CAGGGCCACT TCCTQAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTQ ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGQA6 G6GTATTCTT CATGCAGGAG ACCCCA66CC CTGGAGGCT6 2100 

CAACATAGCT CAATCCTGTC CCAGGCOGGA TCCTCCIQAA QCOCTTTTOO CAOCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AOTGTGTATA AAOCTTCTTC TTCTTTTTTT 2220 

TTTTTAAACT GAGGATTGTC ATTAAACACA G TT GT TT T C l ' 2260 

25 Seq ID NO: CI 6 DNA Sequence 

Nucleic Acid Accession fit KM_024022 
Coding Beguencei 202.. 1563 ~ 

OA ^ 21 31 41 51 

30 I 1 I I I 1 

A00QG6CACC GQA06GCTCQ GGTACTTTCS TTCTTAATTA OGTCATGCOC GTGTGAGCCA 60 

GQAAAGGGCT GTOTTTATOO GAAGCCAQTA ACACTOTGGC CTACTATCTC TTCCGTGQTO 120 

CXyVTCTACAT TTTTGGGACT OGGGAATTAT GAGGTAGAGG TGGAGGOSGA GCCXWATGTC 180 

AGAGGTCCTQ AAATAGTCAC CATGGGGQAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CXXMTTTTGG CCTTGATGAT TTGAAAATAA 6TCCTQTTGC ACCAQATGCA 300 

GATOCTCTTa CTQCACAOAT CCTGICACTG CTGOCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTQGTC TGGGCATCCA CTTCGACTGC 420 

TCAOGGAAGT ACAGATGTOQ CTCATCCTTT AAGTGTATCX3 AGCTQATAGC TCGATGTQAC 480 

GGAGTCTCGG ATTGCAAAQA OGGGGAQQAC GAOTACOGCT GTGTCOaGGT GGGTGGTCAG 540 

AA TQCCG TGC TCCAGGT6TT CACAGCTGCT TOGTGGAAGA CCATOTGCTC OGATGACTGG 600 

AAGGGICACT AOSCAAATOT T6CCTGTGGC CAACTGGGTT TCCCAAGCTA TGTGAQTTCA 660 

OATAAOCTCA GAGTGAGCTC GCTGGACGGG CAGTTCOGGG AGGAOTTTOT GTCCATCX5AT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACXyvCT CAGTATATGT GAGGGAGGGA 7B0 

TOTGCCTCTQ GCCACGTGGT TACCTTGCAG TGCACAGCCT OTQGTCATAG AAGGGGCTAC 840 

AGCTC3V00CA TCX3TGGGTGG AAACATGTCC TTGCTCTOGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAOTTCC AGGGCTACCA CCTGTGCGQO GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACT GCT0C 3VC ACTQT GTTTA TGACTTGTAC CTCCCCAAOT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGQACAA TCX3W3CCCCA TCCCACTTGG TXMAGAAGAT TGTCTAOCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATOQCCC TTATGAAGCT GGC066GCCA 1140 

CTCAOGTTCA ATGAAATGAT CCAGCCTGTG TGCCT6CCCA ACTCTGAAGA GAACTTCCCC 1200 

GATOGAAAAG TGTGCTGGAC GTCAGGATGG GGGOCC3VCAG AGGATGGAGG TGACGCXTTCC 1260 

CCTOTCCTGA ACCAC3G0GGC OSTCCCTTTQ ATTTOCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTAOGGTG GCATCATCTC CXXCTCCATG CTCTOOQCGG GCTACCTGAC GGGTGGOQTO 1380 

GACAGCTGGC AGGGGGACAG OGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGQAAO 1440 

J J TTAGTGGGAG CGACCAGCTT TOGCATCGGC TQOQC3U3AGG TGAACAAGCC TGGGGTGTAC ISOO 

ACCCOTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TOaAGAQAGA CCTAAAAACC 1560 

TQAAGAGGAA GOGGACAAQT AGCCAOCTQA GTTCCTGAGG TGATGAAGAC AGCC06ATCC 1620 

TCCCXTPGGAC TCCCGTGTAG GAACCTGCAC AOQAGCAQAC ACXXTTOOAG CTCTGAGTTC 1680 

CGGCACCAOT AGCAGGCCOG AAAGAGGCAC CCTTCCATCT GATTCCAGCA GAACCTTOVA 1740 

OU GCTGCTTTTT GTTTTTTGTT. TTrrrOAGGT GQAQTCTOGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGG06A AATCCCTGCT CACTGCAGCC TCCOCTTCCC TGGTTCAAGC GATTCTCTPG 1860 

CCTCAGCTTC CCCAGTAQCT GGGACXaCAO GTGCCOGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCAOCATG TTGGCCAGGC TGCTCTCAAA COCClOAOCT 1980 

^ CAAATGATGT GCCTGCTTCA GCCTCCCACA OTGCTGGGAT TACAGGCATG GGCCACCAOS 2040 

OD CCTAGCCTCA CGCTCCTTTC TQATCTTCAC TAAGAACAAA AGAAGCAOCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGOTTTT CTCTCCAGGQ GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCAOGTGCAA AQCCACCAAC AGCCACTCAG AAAAGAOSCA 2220 

CCAGCCCAGA AGTGCAGAAC TXSCAGTCACT GCACX^TTTTC ATCTCTAG6G ACCAGAACCA 2280 

AAC CCACC CT TTCTACTTCC AAGACTTATT TTCACATQTO GGGAOGTTAA TCTAGGAATQ 2340 

/U A CTCGT TTAA GGCCTATTTT CATGATTTCT TT6TA0CATT TGOTGCTTGA OGTATTATTO 2400 

TCCTTTGATT OCAAATAATA TGTTrCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 

75 
80 



35 
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Seq ID NO: CI 7 DNA Sequence 
Nucleic Acid Accession |t MM_003220 
Coding sequences 63.. 1376 " 

1 U 21 31 41 51 

i 1 1 I 1 I 

GAATTCOGGC TCTCTGGGTG AGAGACOGAG AGGGGCATAT COGTTCACGC CGATCCATGA 60 

AAATGCTTTG GAAATTQACG GATAATATCA AGTACGAGQA CTOOGAGGAC CGTCACGACG 120 

GCACCA6CAA OGGGACGGCA OGGTTGCCCC AGCTGGGCAC IXSTAGGTCAA TCTCCCTACA 180 

G6A606CGCC GCCGCTGrOC CACACCCCCA ATGCOCSACTT GCAGOCOCCA TACTTCCCCC 240 

CACOCTAGCA GCCTATCTAC OCCCAGTCGC AAQATCCTTA CTOCCACQTC AACGACCCCT 300 
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ACA60CTGAA OOOOCTGO^C GCCCAOCOSC AGCXSQGAGCA CCCAGGCT6G CCGGSOCAGA 360 

GGCAGAGCCA GGAGTCTCSGG CTCCTGCACA OGCACCGOGG GCTGCCTCAC CAGCTOTCJGO 420 

GCCTGGATCC T0GCAGGC5AC TACAGGOGGC AOSAGGACCT CCTGCAOGGC CCACACGCGC 480 

TCAGCTCAGG ACTOGGAOAC CTCTCGATCC ACTCCTTACC TCACGCCATC QAGGAGC3TCX: 540 

06CATGTAGA AGACCGOGGT ATTAACATCC CAGATCAAAC TGTAATTAAQ AAAG6CCCCG 600 

TGTCCCIGTC CAAOTGCAAC AgCAATGGOO TCTCC3QCCAT CCCTATTAAC AAGGACAACC 660 

TCTTCGGOGG OGTGGTGAAC CCOUIOGAAG TCTTCIGTTC AGTTCCGGGT CGCCTCTOJC 720 

TCCTCAGCTC CACCTGOAAG TACAAGGTCA OGGTGGCGGA AGTGCAGCGG CGGCTCTCAC 7 80 

CACCOGAGTG TCTCAACX3CG TCGCTGCTGG QOGGAGTGCT COGGAGGGCG AAGTCTAAAA 840 

ATX9GAGGAAG ATCTTTAA6A GAAAAACTCG ACAAAATAOG ATTAAATCTG CCTGCAGGGA 900 

GAOOIAAAGC TOCCAAOGTT ACGCTGCTCA CATCACTA6T AGACGQAOAA GCTOTGCACC 960 

TAGCaVGGGA CTTTGGGTAC GTGTGCGAAA CCGAATTTCC T6GCAAAGCA GTA6CTGAAT 102O 

TTCTCAACOG ACAACATTCC GATCCCAATG AGCAAGTGAC AAOAAAAAAC ATGCTCCTGG 1080 

CTACAAAACA GATATGCAAA QAGTTCACOG ACCTGCTGGC TCAGGACCGA TCTCCCCTGG 1140 

G6AACTGA00 GCCCAACCCC ATCCTGGAGC COGGCATCCA GAGCTGCTTG ACCCACTTCA 1200 

ACCTCATCTC CCAC66CTTC GGCAGCCC09 GOGTOTaTGC 06G0GTCA0G QCOCTaCAOA 1260 

ACTATCrCAC GGAG6CCCTC AAGGCCATGG ACAAAAT6TA CCTCAGCAAC AACCCCAACA 1320 

GCCACAOGQA CAACAAOGCC AAAAGCAGTG ACAAAGAGGA GAAGCACAGA AA6TGAGGCT 1380 

CTCCTCOCGC CCCGCCCCTC CCACGCCTCA CCAGOCCCCC GCGCXSCCCAC CXTTCCGGCGG 1440 

2U GTGACAGCTC 0GGGATCA6C AACCCTTCCT OCTGCTGCTA CTGCTGCT6C TGCTGCCX3CC 1500 

GCOGCOQOOQ OOSCTGCCCT TGGOTCCOCC OBAOTCTOO Q OQA C T G COCT CTCX3ACTGTC 1560 

AGT6GQ6CA0 CCTCTCOGAC TCT6CACC00 CCTCGACCTC CCCACCC3GCT CXXACACCCC 1620 

TGTGCCCCXX3 GAATTC 1636 
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Seq ID NO: CI 8 DNA Sequence 
Nucleic Acid Accession #a NM_002988 
Coding sequence t 7X..340 



1 11 21 31 41 51 

30 I I I I I I 

COGQCAOQAG AGGAGTTGT G AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 

GCCCAGCATC ATGAAGGGCC TT6CAGCTGC C C TC CTTGTC CTOGTCTOCA CCATGOCCCT 120 

CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TQCCTOGTCT ATACCTCCTG 180 

GCAQATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 

AGGTOTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 

GOTCCAOAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 

GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATOGCCCTQ 420 

CCAC CCTGGA GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 

CTTTAATCTT AQTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT OTTGAGCTGC 540 

ATTATGAAAT TAQTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 

TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC A0T8TGATTA GCTTTCTCAG 660 

CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT QAATGGTTTT GCTCAGCACC 720 

ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AAOCAGCATT CTCACTGTGA 780 

AAAAAAAAAA AAAAAAAAAA AAA 803 



Seq ID NO J CI 9 DNA Sequence 
Nucleic Acid Accession #i NM_004063 
Coding sequence! 121.. 2619 ' 



1 11 21 31 41 51 

I I I I I I 

AGGGAGTGTT CCCGGGQQAQ ATACTCCAGT CGTAGCAAGA 6TCTCQACCA CTGAATGGAA 60 

GAAAA6GACT TTTAACCACC ATTTTGTGAC TTACAGAAAG GAATrTOAAT AAAGAAAACT 120 

ATGATACTTC AGGCCCATCT TCACTCCCTG TGTCTTCTTA TGCTTTATTT GGCAACTGGA 180 

TATOOCCAAG AGGGGAAGTT TAQTGGACCC CTQAAACCCA TGACATTTTC TATTTATGAA 240 

GGCCAAGAAC CGAGTCAAAT TATATTCCAG TTTAAGGCCA ATCCTCCTGC TGTQACTTTT 300 

QAACTAACTG GGGA6ACAGA CAACATATTT GTGATA6AAC GG6AGGGACT TCTGTATTAC 360 

AACAGAGCCT TGGACAGGGA AACAAGATCT ACTCACAATC TOCAGOTTQC AGCCCTQQAC 420 

GCTAATGOAA TTATAGTGGA GGGTCCAGTC OCTATCACCA TA6AAG1X3AA GGACATCAAC 480 

GACAATC6AC CCAOQTTTCT CCAGTCAAA6 TA06AAGGCT CAOTAAGQCA GAACTCTCGC 540 

CCAGGAAAGC CCTTCTTGTA TGTCAATGCC ACAGACCTGG ATGATCCOOC CACTCCCAAT 600 

GGCCAQCTTT ATTACCAOAT TXTTCATCCAG CTTCCCATGA TCAACAATGT CATGTACTTT 660 

CAGATCAACA ACAAAACGGG AGCCATCTCT CTTAOCOGAG AOGGATCTCA GGAATTGAAT 720 

CCTGCTAAGA ATCCTTCCTA TAATCTGGTO ATCTGAGTOA AGGACATGGG AGGCCAGAOT 780 

GA6AATTCCT TCAGTGATAC CACATCTOTO QATATCATAG TGACA6AGAA TATTTGGAAA 840 

6CACCAAAAC CrGTGQAGAT GGTQGAAAAC TCAACTGATC CTCACCOCAT CAAAATCACT 900 

CAGGTGCGGT G6AATQATCC 06GT0CACAA TATTCCTTAO TTGACAAA6A 6AAGCTGCCA 960 

AOATTCCCAT TTTCAATTGA GCAGGAAGGA GATATTTAOQ TGACTCAGOC CTTGGACCQA 1020 

GAAGAAAAGQ AT6CATAT6T TTTTTATOCA QTTGCAAAGQ AT6AGTACGG AAAACCACTT 1060 

/O TCATATCCGC TGGAAATTCA TGTAAAAGTT AAAGATATTA ATGATAATOC ACCTACATGT 1140 

CCGTCACCAG TAAOCGTATT TGAGGTCCAG GAGAATGAAC QACTGGGTAA CAGTATCGGG 1200 

ACCCTTACTG CACATQACAO GGAT6AAQAA AATACTGOCA ACAGTTTTCT AAACTACAGG 1260 

ATTQTGGAGC AAACTCCCAA ACTTOCCATG GATGGACTCT TOCTAATCCA AAOCTATGCT 1320 

GGAAT6TTAC AOTTAGCTAA ACAGTCCTTG AA6AA6CAA0 ATACTCCTCA GTACAACTTA 1380 

AOQATAGAGG TGTCTOACAA AGATTTCAAG ACCCTTTGTT TTGTGCAAAT CAAOGTTATT 1440 

GATATCAATG ATCAGATCCC CATCTTTGAA AAATCAGATT ATGGAAACCT GACTCTTGCT 1500 

GAAGACACAA ACATTGGQTC CACCATCTTA ACCATCCAGO CCACTGATGC TGATGAOCCA 1560 

TTTACTG6GA GTTCXAAAAT TCTGTATCAT AICATAAAGQ QAGACABTGA GGGAC3GGCTG 1620 

GGGGTTGACA CAGATCCCCA TACCAACACC GGATATGTCA TAATTAAAAA GCCTCTTQAT 1680 

TTTGAAACAG CAGCTGTTTC CAAC AT TGTG TTCAAAGCAG AAAATCCTGA GCCTCTAGTO 1740 

TTTOGTGTGA AGTACAATGC AAOTTCTTTT GCCAAGTTCA COCTTATTGT GACAGATGTG 1800 

AA7GAA6CAC CTCAATTTTC CCAACAOGTA TTCCAAGOGA AA6TCAGTGA GGATGTAGCT 1860 

ATAGGCACTA AAGTGGGCAA TGTGACTGOC AAGGATCCAO AAG6TCT6GA CATAAGCTAT 1920 

TCACTGAGGG GAQACACAAO AGGTTGQCTT AAAATTQACC AGGTGACTGO TGAGATCTTT 1960 
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AGTGTQOCTC CATTGGACA6 A6AAGC0GGA AGTCCATATC GGQTACAAGT GGT60CCACA 2040 

GAAfiTAGGGG GglCnwrT GAGCTCTOTO TCROAGTTCC ACCTGATOCT TATCGATGTX5 2100 

AATGACAACC CTCCCAGGCT AGCCAAGQAC TACACGGGCT Isyri ' ClT C lti CCATCCCCTC 2160 

AGTGC ACCTG GAAGTCTCAT TTTCGAGGCT ACTGATGATG ATCAGCACTT ATTTC GG GGT 2220 

D CCCCATrTTA CATTTTCCCT OGaCAGTGGA AGCTTACAAA AOGACTOOOA AGTTTCCAAA 2280 

ATCAATGOTA CTCATGCCGG ACTGTCTACC AOGGACACAG AGTTTGAGQA GAQGGAGTAT 2340 

GTOGTCTTGA TC06CATCAA TQATGGGGGT CGOOCACOCT TGGAAGGCAT TGrPTCTTTA 2400 

CCAGTTACAT TCTGCAGTTQ TGT GG AAGGA AGTTGTTTCC GGCCAGCAGQ TCACCAGACT 2460 

GGGATACCCA CTGTGGOCAT GGCAGTTGGT ATACTGCTGA CCACCCTTCT GOTQATTGGT 2520 

lU ATAATTTTA6 CAGTTCTGTT TATCC6CATA AAGAAGGA7A AAGGCAAAGA TAATGTTGAA 2580 

AQTQ CTCAAG CATCTGAAlOT CAAACCTCTG AGAA6CT6AA TTTGAAAAGG AATQTTT6AA 2640 

TTTATATAGC AAGTOC TATT TCAGCAACAA CXtATCTCATC CTATTACTTT TCATCTAACG 2700 

TOCATTA TAA riTm'AAAC AGATATTCCC TCTTGTCCTT TAATATTTGC TAAATATTTC 2760 

^ - TTTTTTGAGG TGGAGTCTTO CTCTGTOGCC CAGGCTGGAG TACAGTGGTG TGATCCCAGC 2820 

TCACTGCAAC CTCOGCCTCC T6GGTTCACA TGATTCTCCT 6CCTCAGCTT CCTAAGTAGC 2880 

TGOTTTTACA OGCACCCAGC ACCATGCCCA GCTAATTTTT GTATTTTTAA TAGAGACGGG 2940 

GTTTOGCCAT TTGGCCAGGC TGGTCTTOAA CTCCTGAOGT CAAOTGATCT OCCTGCCTTC 3000 

OTCTCCCAAT ACAGGCATGA ACCACTGCAC CCACCTACTT AGATATTTCA TOTQCTATAG 3060 

ACATTAGAgA GATTTTTCAT TTTTCCATGA CATTTTTCCT CTCTGCAAAT GGCTTAGCTA 3120 

/U CTTGTGTTTT TCCCTTTTGG GGCAAGACAG ACTCATTAAA TATTCTGTAC ATTTTTTCTT 3180 

TATCAAOOAO ATATATCAOT GTTOICTCAT AGAACTGCCT GGATTCCAIT TATGTTTTTT 3240 

CTGATTCCAT CCT0T6TCCC CTTCATCCTT GACTCCTTTO GTATTTCACT OAATTTCAAA 3300 

CATTTGTCAG AGAAGAAAAA OGTGAGGACT CAGGAAAAAT AAATAAATAA AAGAAOWSCC 3360 

TTTTCCCTTA GTATTAACAG AAATGTTTCT QTQTCATTAA CCATCTTTAA TCAATGTGAC 3420 

ZD ATOTTGCTCr TTQGCTGAAA TTCTTCAACT TGGAAAT6AC ACAGACCCAC AGAAGGTGTT 3480 

CAAA CACAA C CTACTCTGCA AACCTTGGTA AAOGAAGCSU} TCAGCTGGCC AGATTTCCTC 3540 

ACTACCT6CC ATGCATACAT GCTGC6CAT0 TTTTCTTCAT TOOTATOTTA GTAAAGTTTT 3600 

GGTTATTATA TATTTAACAT GTGGAAGAAA AGAAGACATO AAAAGAGTGG TGACAAATCA 3660 

A6AATAAACA CTGGTTOTAG TCAGTTTTOT TTGTTAA 3697 
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Seq ZD NO: C20 DMA Sequence 
Nucleic Acid Acceoeion #s NM_004443 
Coding sequence: 28.. 3024 " 



1 11 21 31 41 51 

I ) I i I I 

GQCTCGGCTC CTAGAGCTGC CACGGCCATO GCCAGAGCCX: GCCOGCCGCC GCOGCCGTOS 60 

CCGCCGCCGO GQCTTCTGCC GCTGCTCCCT CCGCTGCTGC TOCTGCCGCT GCTGCTGCTG 120 

CXXXS CCGG CT GCOXSGCXJCT GGAAGAGACC CTCATGGACA CAAAATGGGT AACATCTGAG 180 

TTGGOGTQGA CATCTGATCC A6AAAGTGGG TGGGAAGAGG TGAGTGGCTA CQATOAGGCC 240 

ATGAATCCCA TCCS3CACATA CCAGGTGTGT AATGTGCGCQ AGTCAAGCCA GAACAACTGG 300 

CTTCGCAGGO GGTTCATCTG GCGGCGGGAT GTGCAGOSaa TCTAOGTGQA GCTCAAGTTC 360 

ACTGTGCOTQ ACTGCAACAG CATCCCCAAC ATCCCCGGCT CXTTGCAAGGA GACCTTCAAC 420 

CTCTTCTACT ACGAGGCTGA CAGOQATGTG GCCTCAGCCT CCTCCCCCTT CTGGATGGAO 480 

AA CCOCTA OQ TGAAAGTGGA CACCATTGCA CCCGATGAGA GCTTCTOGCG GCTGGATGCC 540 

GGCOSTOTCA ACACCAAGGT GCX3CA0CTTT GGGCCACTTT CCAAGGCTGG CTTCTACCTQ 600 

GCCTTCCAGG ACCAGGGOGC CTQCATGTCG CTCATCTCCG TGCGOQCCTT CTACAAGAAO 660 

TGTGCATCCA CCACCQCAGG CTTOGCACTC TTCCCX3GAGA CCCTCACTGO GGCGGAGCCC 720 

ACCTC6CTGG TCATTGCTOC TGGCACCTGC ATCCCTAAOG COGTGOAGGT QTOGQTGCCA 780 

CTCAA6CXCT ACTGCAACGG CGATGGGGAQ TGGATGGTGC CTGTGGGTGC CTGCACCTGT 840 

GCCACCGGCC ATGAGCXaVGC TGCCSVAQOAQ TCOCAGTGCC OCCCCTGTCC CCCTGGGAGC 900 

TACaUVGQCQA AQCAGGGAGA GGGGCCCTGC CTCCCATGTC CCGCCAAC3U5 CG8TACCACC 960 

TCCCCAGCOG CCAGXa^TCTG CACCTQCCAC AATAACTTCT AGCOXGCAGA CT066ACTCT 1020 

J J QOGGACAGTG CCTGTACCAC CGTGCCATCT CCACCCOGAG GTOTQATCTC CaUVTGTGAAT 1080 

GAAACCTCAC TOATCCTOGA OTGOAGrGAG CCCOGGGACC TGGQTGGCOQ GGATGACCTC 1140 

CTGTACAATG TCATCTGCAA GAAGTGCCAT GGGGCTGGAQ GGGCCTCAGC CTGCTCAC36C 1200 

TGTQATGACA ACX3TG6AGTT TQT6CCT0GQ CA6CTGG6CC TQA0GGA0C3G 006GGTCCAC 1260 

ATCAGCCATC TGCTGGCCCA CACGOGCTAC ACCTTTG2VGG TOCAOQCOOT CAACG6TGTC 1320 

TOGGGCAAGA GCCCTCTGCC GCXrTOGTTAT GCGGCOGTGA ATATCACCAC AAACCAGGCT 1380 

GCCCOSTCTG AAGTGCCCAC ACTACGCCTQ CACAGCAGCT CAGGCAGCAG CCTCACCCTA 1440 

TCCTGGGCAC CCCCAQAGCG GCCCAACGQA QTCATCCTGG ACTAOGAGAT aAAOTACTTT 1500 

GAGAAGAGC6 AGGGCATCGC CTCCACAGT6 ACCAQGCA6A TOAACTCGGT GCAGCT08AC 1560 

GGGCTTCGGC CTGACGCCCG CTATGTGGTC CAGGTCXSJrG CCCGCACAOT AGCTQGCTAT 1620 

GGGCAGTACA GCCGCXXnXSC OQAGTTTGAG ACCACAA6TG AGAGAGGCTC TGGGGCCC3VG 1680 

CAGCTCCAOG AGCAGCTTCC CCTCATOGTG GGCTCCGCTA CAGCTQQGCT TOTCTTCGTQ 1740 

GTGGCTGTCO TGGTCATCX5C TATCGTCTGC CTCAGGAAGC AGCGACAOGG CTCTGATTCG 1800 

GAGTACAOGO AGAAGCTGCA GCAGTACATT GCTCCTGOAA TOAAOOTTTA TATTQAOCCT 1860 

TTTACCTACO AGGACCCTAA TGAOGCTGrT OGGGASTTTQ OCAAGGASAT OGAOSTOTCC 1920 

TGOGTCAAOA TOGAGGAGQT GATOGOAGCT GGGQAATTTO GGGAAGTGTG COOTGGTOGA 1980 

CTGAAACAGC CTG6C0GCOG AGAGGTGTTT GTGGCCATCA AGACGCTGAA GGTGGGCTAC 2040 

ACCGAGAGGC AGCGGCGGGA CTTCCTAAOC GAGGCCTC5CA TCATGGGTCA GTTTGATCAC 2100 

CCCAATATAA TCCGGCTOGA GGGG6TQGTC ACCAAAAGTC OGCCAOTTAT GATGCTCACT 2160 

GAQTTCATOa AAAACTGOGC CCTGGACTGC TTCCTCCGGC TCAACQATGG GCAGTTCACG 2220 

GTCATCCAGC TGGTGGGCAT GTTGCXWGGC ATT6CTGC00 OCATGAAQTA CCTGTCOGAG 2280 

ATGAACTATG TGCACXSCQA CCTGGCTGCT CX3CAAC3^TCC TTGTCAACAG CAACCTGGTC 2340 

TGCAAAGTCT CAGACTTTGG CCTCTCXXXX: TTCCTGGAGO ATGACCCCTC CXSATCCTAOC 2400 

TACACCAGTT CCCTGGGGGG GAAGATCGCC ATOOGCTGGA CTGCCCaUSA GGCCATA60C 2460 

TATCGGAAGT TCACTTCTGC TAGTGATOTC TGGAGCTAOG GA A TTGT CA T GTGGGAGGTC 2520 

ATQAGCTATG GAGAGCX3ACC CTACTGGGAC ATGAGCAACC ACQATGTCAT CAATGCOGTO 2580 

QAGCAOGATT ACOGGCTGCC ACCACCCATG QACTGTCCCA CAGCACTGCA CCAGCTCATG 2640 

CTGGACTGCT GGGTGGGGGA COGGAACCTC AGGCCCAAAT TCTGCCAQAT TGTCAATACC 2700 

CTGGACAAGC TCATGCXSCAA TGCTGCCAGC CTCAAGGTCA TTX3CCAGGGC TCA6TCXG6C 2760 

AT6TCACAGC CCXnXXriGGA COGCAOGGTC OCAQATTACA CAACCTTCAC GACAGTTGOT 2820 
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GATTGGCTGG ATGCCATCAA GAT6GGGCGG TACAAGQAGA GCTT06TCAG TGOQGOGTTT 3880 

GCATCTTTTG ACCTGGTGGC CCAOATGAOG GCAGAAOACC TGCTCOGTAT T6GG6TCACC 3940 

CTGGCOGGCC ACCAGAAQAA GATCCTGAGC AGTATCCAGG ACATOCOGCT GCAQATCAAC 3000 

- CAGACGCTGC CTX3TGCAGGT CTGACACOGG CTCCCA0G6Q GACCCTQAGG ACOGTGCAGG 3060 

J GATGCCAAGC AGCCGGCTGG ACTTTOGGAC TCTTGGACTT TTGGATOCCT GGCCTTAGGC 3120 

TQTGGCCCAG AAGCTGQAAG TTTGGGAAAO GCCCAA6CTG G6ACTTCTCC AG6CCT6TQT 3180 

TCCCTCCCCA GGAAGTOCaC CCCAAACCTC TTCATATTGA AGA1GGATTA GGAGAGGGGG 3240 

TGATGACCCC TCCCCAAGCC CCTCAGGGCC CAGACCTTCC TGCTCTCCAO CAGGGGATCC 3300 

- CCACAACCrC ACACTTGTCT GTTCTTCAGT GCTGGAGGTC CTGGCAGGGT CAGGCTGGGG 3360 

lU TAAGCCGGGG TTCCACAGGG CCCAGCCCTG GCAGGGOTCT GGCCCCCCAG GTAGGCGGAG 3420 

AGCAGTCCCT CCCTCAGGAA CTGGAGGAGG GGACT0CAG6 AATGGGGAAA TGTtZACACCA 3480 

CCATCCTGAA GCCAOCTTOC ACCTCCAGTT TGCACAGGGA TTTOTCCTGG G6GCTCA6GG 3540 

CCCTGTCCCC ACCCCCGCCC TTGCTGCTGT CATAAAAGGG CAGGCAGGGQ CAGGCT6AQQ 3600 

AGTTGCCCTT T6CCXXCCAQ AGACTGACTC TCAGAGCCAG AGATGGGATG TQTGAGTGTQ 3660 

ij T6TGTGTGT6 TGTGOCXX5CX3 OGCX5CG0GTG TGTQTGT6C3V CGCACTG6CC TGCAOUSAGA 3720 

0CATG6GT6A GCXSTOTAAAA GCTTGGCCCT GTGOCCTACA ATGGG6CCAG CTGGGCCGAC 3780 

AGCAGAATAA AGGCAATAAO ATQAA 3805 

20 Seq ID NO I C21 DMA Sequence 

Nucleic Add Accession NN_0D1804 
Coding sequence 1 82.. 879 

^ " 21 31 41 51 

25 I 1 1 I I I 

A0GTGAG06G TTGCT0GT08 TGG8GO0GQC GGOCAOGGGC GGCTCCAGGG OCCAGCATGC 60 

GCGGGQGACC CGGCGGCCAC CATGTATGTG GGCTAT G TGC TGGACAAGGA TTC0GCC8T0 120 

TACCCC3GGCC CAGCCAGGCC AGCCAGCCTC GQCCTGGGCC CGGCAAACTA CGGCCCCCCQ 180 

GCCXXXK3CCC CX5606CCCCC GCAGTACXTCC GACTTCTCCA QCTACTCTCA CGTGGAGCCG 240 

JU GCCCXW300C CCCOQAGGQC CTOGGGGGCG CCCTTCCCTG 0GCX:C3\AGGA CX^ACTGGGCC 300 

GCGGCCTAC8 GCCOOGGCCC O60GGCCCCT GCOGCCAGCX: CAGCTT06CT GGCATTCGGG 360 

CCCCCTCCRa ACTTTAGCCC GGTGCOQGOG CCCCCTGGGC COGGCCCGGG CCTCCIGGCO 420 

CA6CCCCT0G GGGGCCCGGG CACAC06TCC TOGCCCGQAO OGCAGAGGCC GAOGCCCTAC 480 

QAGTGGATGC GGCGCAGCGT GGCGGCXX^GA GGCGGCX3GTG GC3VGCGGTAA GACTCSGGACC 540 

J J AAGGACAAGT ACCGCQTQGT CTACACCQAC CACCAAC3GCC TGGA6CTGGA GAAGGAGTTT 600 

CATTACA6CC GTTACATCAC AATCOGGGGG AAATCAGAGC TQGCT6CCAA TCT60GGCTC 660 

ACTGAACX3GC AGOTGAAGAT CTGGTTOCAA AACC6G0GGG CAAAGGAGCO CAAAGTGAAC 720 

AAGAAGAAAC AGCAGCAGCA ACAGCCCCCA CAGC06CX3GA TGGCCCACGA CATCACGGCC 780 

ACXXXAGCCG GQCCATCXXrX GGGGGGCCPO TGTCCCAGCA ACACCAGCCT CCTGOCCACC 840 

4U TCCTCTCCAA T6CCTGTGAA AGAG6AGTTT CIOCCATAGC CCCATGCCCA GCCTGTGGGC 900 

OGGGOGACCT GGOGACTOGG GTGCTGG6A6 TGTGGCTCCT GTGG6CCCA0 QAOGTCTQQT 960 

006 AGTC TCA GCCCTGACCT TCIGG6ACAT GGTGGACAGT CACCTATCCA CCCTCTOCAT 1020 

CCCCTXQQCC aVTTGTGTGC AGTAAGCCTQ TTGGATAAAG ACXTTTCCAQC TCCTGTGTTC 1080 

. TAGACCTCTO GGGGATAAQG GAGTCCAGGG TGGATGATCT CAATCTCCCG TGGGCATCTC 1140 

43 AAGCCCCAAA TGGT TGGGGO AGGGGCCTAG ACAAGQCTCC AGGCCCCACC TCCTCCTCCA 1200 

TAOGTTCAGA GGT0CAGCT6 GAGGCCTQTG TGGGGACCAC ACTGATCCTG QAQAAAAOGG 1360 

ATG6AGCTGA AAAAGATGGA ATGCTTGCAO AGCATGACCT GAGOAGGGAG GAACGTGGTC 1320 

AACTCAC31CC TGCCTCTTCT GCAGCCTCAC CTCTACCTGC CXXCATCATA AGOGCACTGA 1380 

GCCCTTCCCA GGCTGGATAC TAAGCACAAA GCCCATAGCA CTGGGCTCTG ATGGCTGCTC 1440 

DU CACTGGGTTA CAGAATCACA GCXCTCATGA TCATTCTCAO TGAGGGCTCT GGATTOAQAG 1500 

G6AGGCCCTQ GQA66AGAGA AGG6GGCA0A 6TCTTCCCTA GCAG6TTTCT ACACC0C08C 1560 

CAGGCTGCCG ATCAGGGCCC AGGQAGCCOC CAOAGGACTT TATTCGOACC AAGCAGAGCT 1620 

CACAO CTGGA CAGGTGTTGT ATATAGA6T6 GAATCTCTT6 GATGCAGCTT CAA6AATAAA 1680 

TTTTTCTTCT CTTTTCAAA 1699 
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Seq ID KO: C22 CNA Sequence 
Nucleic Acid Accession #t IIM_021978 
Coding sequence I 36.. 2603 ~ 



60 1 11 21 31 41 51 

I I I I i I 

6A0SCCTGTG AOAOOOSCQA GGG6CCT0GG GGACCATGGG GAGGGATCQG GCCOGCAAGO 60 

GC6GAGGGGG COOGAAGGAC TTOOGOGOGG GACTCAAGTA CAACTOCCGG CAOGAGAAAG 120 

TGAATGGCTT GGAOGAAGGC GTGGAGTTCC TQCCAGTCAA CAACGTCAAO AAGGTGGAAA 180 

03 AGCATGGCCC GGGGOOCTGG GTGGTGCTOG CA0CC0T6CT GATCGQCCTC CTCTTGGTCT 240 

TOCTGGGGAT OGGCTTCCTO GVG1GGCATT TGCAOTACCG G6AGSTQCGT OTOCAOAAGO 300 

TCTTCAATGO CTAC ATQAGQ ATCACAAATQ AGWVTTTT6T GGATGCCTAC 6AGAACTCCA 360 

ACTCCACTGA GTTTOTAAGC CTGGCCAGCA AGOTGAAGGA OQCGCTGAAG CTGCTGTACA 420 

GCGGAGTCCC ATTCCTGGGC COCTACCACA AGQAQTOGGC TGTGAOGGCC TTCAGCGAOG 480 

/U GCAGOGTCAT CGCCTACTAC TGQTCTGAGT TCAGCATCCC GCAGCACCT6 6TGGAGGAGG 540 

C0QA6OGCGT CATGGCCOAO GAGG6CGTAG TGAT6CTGCC CCCGCaGaCO OGCTCCCTGA 600 

AGTCCTTTGT GGTCACCTCA GTGOTGGCTT TCOCCAGGGA CTCCAAAACA GTACAGAGGA 660 

CCCAGQACAA CAGCTGCAGC TTTGGCCTGC A06CCCG0GG TOTGGAOCTO ATGCGCTTCA 720 

CCACQCCCGG CTTCCCTGAC AGCCCCTACC COOCTCATGC CCGCTGCCAQ TGGGCCCTGC 780 

/j GQGGGGAOOC CGACTCAGTG CTGAGCCTCA CCTTCOGCAQ CrTTQACCTT GCGTCCTGCG 840 

AOGAGG0OG6 CAGGGACCTG GrGAOGGTGT ACAACACOCT GAGCCCCATG GAG0CCCAC6 900 

CCCTGGTGCA GTTOTGTGOC ACCTACCCTC CCTOCTACAA CCTGACCTTC CACTCCTCCC 960 

AQAACOTCCT GCTCATCACA CTGATAACCA ACACTGAOCO GOGGCATCCC GGCTTTGAGG 1020 

CCACCTTCTT CCAGCTGCCT AGGATGAGCA GCTCTGGAGG CCQCTTACGT AAAGCCCAGG 1080 

OU GGACATTCAA CAGCCCCTAC TA0CCAG6CC ACTACCCAOC CAACATT6AC TGCACATGGA 1140 

ACATTQAGOT GCCCAACAAC CAOCATOTGA AGOTGCOCIT CAAATTCTTC TACCT6CTQ0 1200 

AGCC06G08T GCCTGCGOOC ACCT QO CC C A AGGACTACQT GOAOATCAAT GGGGAOAAAT 1260 

ACTGCOGAGA fiAGGTCCCAO TTCGTOGTCA CCAGCAACAa CAACAAGATC ACAOTTOQCT 1320 

TCCACTCAQA TCAGTCCTAC ACOOACACOG GCTTCZTAOC TQAATACCTC T0CTAC6ACT 1380 
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OCAGTQAOCC ATCCCOCX3CX3 CAOTTCAGGT GCOGCAOGGG GOQOTGTATC G8GAAGQAGC 1440 

T60QCT6TGA TGGCTGGGCC GACTGCACCG ACCACAG06A TGAGCTCAAC TGCAGTTQ06 1500 

A0QCC3GGCCA CCAGTTCACO TGCAAQAACA AGTTCTGCAA GCCCCTCTTC TGGGTCTGOS 1560 

ACRG TGTGAA CGACTGOGGA GACAACAGCG AOGAGCAGGG GTGCAGTTGT CCGGCCCAOA 1620 

CCTTCAGGTG TTCCAATGGG AAGTGCCTCT GGAAAA6CCA GChGTGCAAT GGGAAGGAOQ 1680 

ACT6TGGG6A OGGGTCOOAC QAQ OC CTCCT GCCCCAAGQT OAAOCSTOOTC ACTTQTACXA 1740 

AACACACCTA CCGCT6CCTC AATG6GCTCT GCTTGAGCAA GGGCAAGCCT GAGTGT6A00 1800 

GGAAGGAGGA CTGTAGCGAC GGCTCAQATG AGAAGGACTQ CGACTGTOGO CTGCGGTCAT 1860 

TCAC3QAGACA GGCTCGTGTT GTTGGGGGCA CGGATGCOGA TGAGGGCQAQ TGGCCCTGGC 1920 

A GGTAAGCC T GCATGCTCTG GGCCAGGGCC ACATCTGOQG TGCTTCCCTC ATCTCTCCCA 1980 

ACTGGCVGGT CTCTGCCGCA CACTGCTACA TOGATGACAO AGGATTCAGQ TACTCAGACC 2040 

CCA0GCA6TG GAOGGCCTTC CTGG6CTTGC AOSACCWSAG CCAGCGCAGC GCCXTTGGGG 2100 

TGCAG6AG0G CAOQCTCAAG CX3CATCATCT COCACCCCTT CTTCAATGAC TTCACXTTTOG 2160 

ACTATGAOVT CGCGCTGCTG GAGCTGGAGA AACCGGCAGA GTACAGCTCC ATGGTQCX3GC 2220 

I J CCATCTGCCT GCCGQAOGCC TCCCATGTCT TCCCTGCOGG CAAGGCCATC TGGGTCAOGG 2280 

GCTGGGGACA CACCCAGTAT GGAQGCACTO GGOOQCTGAT CCTGCAAAAG GOTGAQATCC 2340 

GCGTCATCAA CCAGACCACC TGC6AGAACC TCCTGCX36CA 6CAGATCAC6 CCGCGCATGA 2400 

TGTGCGTGGG CTTCCTCAQC GG0GG06TGG ACTCCTGCCA GGGTGATTCC GGGGGACCCC 2460 

TGTCCAGOGT GGAGQCGGAT GGGOGGATCT TCCAGGCOGO TGTGGTGAGC TGGGGAGACG 2520 

GCTGOGCTCA GAGGAACAAG CCAGaC3GTGT ACACAAGGCT CGCTCTGTTT CGGGACTGGA 2560 

TCAAAGAOAA CACTGGGGTA TAGGGGCGG6 GGGCACCXIAA AT6TGTACAC CTGCX^GGGCC 2640 

AC0CATCX3TC CACCCCAGTG TGCACGCCTQ CAGGCTGGAG ACTGGACCQC TGACTGCACC 2700 

AGCGCCCCCA GAACATACAC TGTGAACTCA ATCTCCAGGG CTCCAAATCT GCCTAQAAAA 2760 

CCTCTCQCTT CCTCAGCCTC CAAAGTGGAG CTGGGAGGTA OAAGGGGAGG ACACTGGTGO 2820 

TTCTACTGAC CCAACTOGGG GCAAAGGTTr GAAGACACAG CCTCCCCC36C CAGCCCCAAG 2680 

CTGGGCGGAG GC6GQTTTGT * GTATATCTGC CTCCGCTGTC TGTAAGGAGC AGCGGGAACG 2940 

GAGCTTCGOA GOCTCCTCAG TGAAQGTGGT GGGGCTGCCG GATCTGQGCT GTGGGGCCCT 3000 

TGGGCCAC6C TCTT GAGGAA 6CCCAGGCTC GGAOGACCCT GGAAAACAGA 08GGTCTGA6 3060 

ACTGAAAATG GTTTACCAGC TCCCAGGTOA CTTCABTGTG TGTATT6TGT AAATGAGTAA 3120 

AACATTTTAT TTCTTTTTAA AAAAAAAAA 3X49 



20 



25 



30 



35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



8eq ID MO: C33 DNA Sequence 

Nucleic Acid Acceesion it Sob sequence 

Coding sequence : 1 . . 2268 

1 11 21 31 41 SI 

i 1 I I I I 

ATOCCCCCTT TCCTGTTGCT GGAAGCCGTC TGTGTTTTCC TGTTTTCCAQ AGTGCCCCOV 60 

TCTCTCCCTC TCCAQGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AG0GTC3GGGA AAGOGAGCTT TGAAAGGTCC AAQCACTTTG CCATCACAGT CTGTGAOGGT 240 

CTGGACATCA GCCCOOAGAG GGTCAGAGTG 6GAQCATTCC AGTTCAGTTC CACTCCTCAT 300 

CTG GAATT CC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

AT GG'm TCA AAGGAGGGOG CACGGAGACG GAACTTGCTC TGAAATACXTT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGQ 480 

AAGTOCCAGG GGGATGTGGC ACTOCCATCC AAGCAGCTQA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGO 6GGTCAGGTT TCCCAGOTGG GAGGAGCTGC ATGCACTGGC CAGOGAGCCT 600 

AGAGGGC3U3C ACQTQCTGTT GGCTQAQCAG GTGGAGGATG CXS^CCAAGGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGQCCAT CTGCTCCAGC GCCAOGCCAG ACTGCAGQGT CX3AGGCTCAC 720 

CCCTGT3A6C ACAGQAOGCT GGAQATGGTC CGGGAGTTOG CT6GCAAT6C CXX3^TGCTGG 780 

AGAGGATCX3C GGCGGACCCT TGOGGTGCTQ GCTGCACACT GTCCCTTCTA CAQCTGGAAG 840 

AGAGTOTTCC TAACCCACCX: TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

TCX3CAGCCCT GCCAGAATGG AGGCACATGT OTTGCAQAAG GACXGGAOSG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGQ AG6GGAGGCT AACTGTGCCC TGAAGCTGA6 CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGQACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 

GGGGCXAAAG TCTTCGTGAA GCGGTTTGTQ CX3GGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCXrrGT GGGGGAOTAC 1200 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCOGTGG TQQOCOCACC 1260 

CTGAOGGGCA GTGCCTTGCG GGAGGCX^GCA GAGOGTGGCT TCX3GGAGCGC CA0CAG6ACA 1320 

GGCC AGGACX: GGCCACGTAG AGTGGTGGTT TTGCTCACTG AQTCACACTC OQAGGATGAG 1380 

GTTGOGGGCC CAGOGOGTCA OGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCaSTGC GGGCAGAGCT GOAGGAGATC ACAQGCAGCC CAAAGCATGT GATGGTCTAC ISOO 

TCX3GATCCTC AGGATCTGTT CAACCAAATC CCTGAGCT6C AGGGGAAGCT GT6CASCGGG 1560 

CAGOGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTOGTCT TCATGTTGGA GAGCTCT6GC 1620 

TCRG TAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTOCAG 1680 

TTTGAGGTGA ACCCTGAaST GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGQACAC CAAACCCACC CGGGCTGOQA TGCTGCJGGGC CATTAGCCSW3 1800 

GCCCCCTACC TAGGTGQGGT GGGCTCAGCC GGCACCX3C0C TGCTOCRCAT CTATGACAAA 1860 

GTGATGACCX3 TCCAGAGGGG TGCCOGGCCT GGTGTCCCCA AAGCTOTGGT GCTOCTCRCA 1920 

GGOQGGAGAG GCGCAGAGQA TGCAGCCGTT CCTGCCXAGA AGCTGAGGAA CAATGGCATC 1980 

TCT6TCTTG0 TOOTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGOGGAO GCTTGCAGGT 2040 

CCCCGGGATT CCCTQATCCA CGTGGCAGCT TAOGCOGACC TGCGGTACCA CCAGGAO0TX3 2100 

CTGATTQAGT GGCTGTGTGG AGAAGCCAAG OSGCCAGTCA ACCTCTGOUV ACGCAGC006 2160 

TGCAT6AATG AGGGCAGCTG CGTGCTGGAG AATGGGA6CT ACOGCTQCAA GTOT O GGGAT 2220 

GGCTGOQAGG GCCCCCACTG CGAGAACCOA TTCTTGAGAC GCCCCTGA 2268 

Seq ID NO: C24 DNA Sequence 

Nucleic Acid Accession 8i Eos sequence 

Coding sequencet X..2424 

1 11 21 31 41 51 

ATOCCCCCTT TCCTGTTGCT OOAGGCOOTC TOTOTTTTOC TOTTTTOCAG AOTGCCCCCA 60 

TCTCTCCCTC TCCAQaAAOT CCATGTAAGC AAAGAAACCA TOGGGAAOAT TTCAGCTGCC 120 
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AGCAAAATGA 
AGCX3TOC3GQA 
CTGGACATCA 
CTOGAATTCC 
ATOGTTTTCA 
GGGTTGCCTG 
AAGTCCCAGG 
TTTGCTOTGO 
ASAGGGCAOC 
ACCCTCAGCA 
CCCXX3TGAGC 
AGAGGATCGC 
AGAGTGTTCC 
TCGOWSCCCT 
CTCTGCCCGC 
AGGGTCX3ACC 
CGGGCCPMG 
CGAGTGGGTG 
CAGGATGTOC 
CTGACGGGCA 
G6CCAGGACC 
GTTGCGGGCC 
6AGGCCGTGC 
TOGGATCCTC 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTQCCTTCG 
GCCCCCTACC 
GTQATQACCX3 
GGC6GGAGAG 
TCTGTCTTGG 
CCC066GATT 
CTCATTGAGT 
TGCATGAATG 
GGCTGGQAGG 
CAQGQAT6GA 
CX3TACCCCTC 
AATGTCTGTG 



TGTGGTGCTC 
AA8GGAGCTT 
GCCCO BABAQ 
CCTTGGATTC 
AAGGAGGGCO 
GAGGCAGAAA 
GQQATGTGGC 
GGGTCAGGTT 
AOGTGCTGTT 
CCTCGGCCAT 
ACAGGAOGCT 
GGCQQACCCT 
TAACCCACCC 
GCXAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
TCTTCGTGAA 
TGGCCACATA 
CTGACCTGGT 
GTGCCTT60G 
GGCCACGTAG 
CAGCQCGTCA 
GGGGAGAGCT 
AGGATCTGTT 
GGTGCCGGAC 
COGAGAATTT 
ACCCTGACXST 
GGCTGGACAC 
TAGGTGG6GT 
TCCAGAGGGG 
GCGCAGAGGA 
TCGTGGGCGT 
CCCTOATCCA 
GGCTGTGTGG 
AGGGCAOCTG 
GCCCCCACTG 
TTCTTOAGAC 
CXA6CAACTA 
OCCCAGGTCC 



GGCTGCAGTO 
TGAAAGGTCC 
GGTCAGAGT6 
ATTTTCAACC 
CACG6AGACG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTQG 
G6CTGAGCAG 
CTGCTCCAGC 
GGAGATGGTC 
TGOGGTGCT6 
TGCCACCTGC 
AGGCACATGT 
AGGGQA6GCT 
GCTGGACA6C 
GCGOTTTGTG 
CAGCA6GQA0 
CTGGAGCCTC 
GCAGG066CA 
AGTGC3TGGTT 
OGCAAGGGOG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTO 
TGCTCAGATG 
GACACAGGTC 
CAAACGCACC 
GGGCTCAOCC 
TGCCCGGCCT 
TGCAOCOOTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
OGTCCTGCAG 
CGAGAACCX3T 
GCCOCTGAGO 
CAGAGAAGGC 
TTAG 



GACATCATQT 
AAGCACTTTG 
GGAQCATTCC 
CAACAGGAAG 
GAACTT6CTC 
CCCCAGATCC 
AAGCAGCTGA 
GAGGAOCTGC 
GTGGAGGATG 
GCCA OGCCAG 
CGGGAGTTCG 
GCTGCACACT 
TACAGQACCA 
GTTCCAGAAG 
AACTGTGGGC 
TCTGOGGGCA 
COGGOCQTGC 
CTGCTGGTGG 
GATGGCATTC 
GAGCGTGGCr 
TTQCTCACTQ 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC 
GACCTCGTCT 
CAGAGCTTTG 
0GCCTGGTG6 
C6GGCTG0GA 
GGCACXX3CCC 
6GTGTCCCCA 
CCTGCCCAOA 
CTAA6T6AG0 
TACGCOGACC 
CAGCCAOTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGOCACTG 



Seq ID NOt C25 X3NA Sequence 

Nucleic Acid Accession 9i XM_0973B6.3 

Coding sequences 142.. 795 ~ 



I 



CTOGCAGAAC 
GGGACCCCCG 
CTACCTCAGG 
TCC06TGCAC 
GCTGGACCCC 
GGGTOGAG6G 
GTGTCTGTGG 
TTGAGCXTTGC 
OQAGGCTGGG 
GGRAA TGQGA 
COGrGTCTGC 
TCTCCTQCCA 
CAGCACCCCC 
CTGGATGGGT 
C31CAGTCAAG 
GGAACCTTCC 
AG6GGGGGCA 
TCTTATCATG 
CCTTGGCAAA 
GTTTGTCAGA 
AACTTAAAAA 
GTCTOGCGCT 
GTATTAAAAG 



11 

I 

CACCT06ACT 
TCCCAOGCCT 
TCTGACTTTT 
CAGGGTCTGC 
TGGGQOCGOC 
CTGGGTOOTC 
6CGATC0GGC 
TGTTAACTCC 
TGGOGGCCCA 
GOGGOTTGCA 
A0A6GCTCCT 
CGGCCTGTC3G 
CCTTATGCAG 
TCTAOTTCAC 
GCCAAOGGGQ 
GTGGGACCOG 
OCTGGGGTCG 
TTTTACrCAC 
TGTAAACTCA 
06GGGTGG6A 
AATTTTTAAA 
GGCa3CX36CA 
GAAAAAA 



21 
I 

CTGTCCGTGT 
GGGCCCCGCG 
GATGCCAAAA 
AGCAGCCACT 
GTGCACCTGC 
TOCCTGGGGC 
CTCXrCGQCGG 
OATGATTGTA 
GGAGAAG6AA 
GAGGGTCTAT 
CCCT8CCTCA 

ACTGGGAGGG 
TT6GGAC0GT 
GCTGCGCCrO 
TAAGTGGCTG 
TTGTTAAGGG 
CTTGTCCCTT 
GCCTTTC9VTT 
ACCACGCCTG 
AAACATAAAA 
GGGTGGCCTG 



31 
I 

C TOTC CCCCG 
COQGGGGAAG 
TCTGAGCCCC 
GGGGCCTGGC 
CCAGCTOGGA 
TGC G TOTGTG 
TGGGTGQACC 
GGGACAGGCG 
OTCOCTQAAO 
GGGGOCCGGT 
GQTGGCCCGQ 
AAATTOGGTG 
GGTCX3GGCAG 
GGGGCCTGGC 
CTCTGAOATG 
TCCA6AAA0G 
TCACGCATCT 
TTTTCCCCAA 
CATGA06T0T 
AAACrCAGGT 
CTACTCTCTA 
TAACAATTTC 



rrCTGTTASA 
CCATCACAQT 
AOTTCAGTTC 
TGAAGGCAAO 
TGAAATACCT 
TCATCATOGT 
AGGAAAGOGG 
AT6CACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTG6CAAT6C 
GTOCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAQCTGAG 
CCACTCTOGA 
TGA6CGA06A 
0G6TGCCTGT 
CCTTCCGTGG 
T0GGQA6CGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGOVTGT 
A6GGGAAGCT 
TCATGTT6QA 
TGAGAAGCT6 
TGTATGOCAG 
TGCTGG6GGC 
T6CTGCACAT 
AAGCTGTGGT 
AGCTGAGGAA 
GTCTGOGQAO 
TGCG6TACCA 
ACCTCTGCAA 
ACCGCTGCAA 
CTT6CTCTGT 
CCGT6CAGGA 
AAATGGTGCC 



41 
I 

GCCTCCA6G6 
CGCCTGCTGC 
TGGGGTGCCT 
TGCCTGCTGC 
GCCTGGQGAG 
TGTCGGGAAT 
TGGATTCTAA 
GGGTGGGTGQ 
QCAQTGGCCA 
CCTGGATACT 
TTC AACCC CA 
TGGCTTTCTG 
TCCCCTCAGC 
T606TACTGA 
TTGG6AGAAA 
C6GGAGGGT0 
GTACAGTTGA 
TTGTQCTTTT 
GAAATTTCAG 
AATAGQAGGA 
OCTCTGGCTG 
AGTTTTGGCA 



TQQGTCTAAC 
CTGTGA0G6T 
CACTCCTCAT 
AATCAAGAGG 
TCTGCACAGA 
CACTGATGGG 
TGTCACTGTC 
CAGCGA6CCT 
CCTCTTCAGC 
CQAGGCTCAC 
CCCAT6CTGG 
CA6C1G6AAG 
CCCCTGTGAC 
CTACCAGTGC 
CCTG6AAT6C 
GGGCTTCCTG 
CTCT06GGCC 
GGGGGAGTAC 
TGGCCCCACC 
CACCAQGACA 
CGAGGATGAG 
TGTAGGCAGT 
GATGGTCTAC 
GTGCAGCOGQ 
CACCTCTGCC 
TGCCCTCCAG 
CCAGGT6CAG 
CATTAGCCAG 
CTATGACAAA 
GGTGCTCACA 
CAATGGCATC 
GCTT0CAQ6T 
CCAGGACGTQ 
ACCCAGCCCG 
GTGT0GG6AT 
ATGTGTGA6C 
GGGCAGCA6C 
TACCTTCTGG 



51 
I 

CTCCTCTCCC 
CTATCTCTGT 
CTCCCCCQCC 
ATCTGGCGGC 
QTOO OGTQCA 
CCTGCGTGTC 
CTCAOAGGAC 
GGGGTGGG06 
TOCTGQCOGT 
CGGCAGGAAO 
GCOGTGCCCA 
GGOTGCAQCT 
CACGAGGACC 
GTGGGTGCCC 
G80QGCTTCT 
GGCACGGGGC 
ATTTCCTTTC 
GCATTTTTTT 
TTTCTCTGGA 
AAAAAAAAAA 
GGCCCAGCCT 
GAACATTCAG 



Seq ZD NOt C26 DNA Sequence 

Nucleic Acid Accession St Eos sequence 

Coding sequence 1 95.. 2128 



GGGGTAOTTT 
CTCCTGGAAA 
TGCTTCCTGO 
CACACTGAGA 
6ATCAATATA 
GCTTCTOAAA 
CTTCACCCCT 
60GAGTCA0C 



11 

1 

GTAGGOAGGC 
GGA6AGACAC 
aAGCTTGTGG 
GTATCTGGAG 
TGCCAAGACr 
ACOCACTGQA 
CATOAT AAAA 
TXCTCAGCTO 



21 

I 

AGCTCTCCAC 
CAGCATTTGC 
TCOGOGTTOA 
ACCTTCATGT 
GGTCAOACTT 
OCCTQQACAA 
TGCT6C6CCT 
TGOTTTTTAA 



31 
I 

GTGCGOGACT 
CACAATGCTG 
CCATCCCAAT 
TGGA6GAGTG 
TQCTCTTTGO 
ATATGQGOTC 
TOOTCT OC C B 
AGCTOTOVOT 



41 
I 

GOGAGGCTGG 
TCATCCACTG 
GAAGAGCAGC 
ATGCTCAAGT 
TGG6AACAGA 
CAGGCAGATO 
AATTTQAAOA 
QATATCTGOl 



51 
I 

A0GCTAC6GG 
ACTTTACATT 
AGAAAGACGT 
TAGTAGAACA 
AGCATTGCTG 

TQOTQAQQTT 
AAATOCTQAA 



lao 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1600 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2424 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1337 



60 

120 

180 

240 

300 

360 

420 

480 
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TATTAGAAGA TCRGAAQAGC TTTCCTTGTT AAAGCOGTCT GGTGACTATT TTAAGAAGAA 540 

GAAGAAAAAA GACAAAAATA ATAAGGAACC CATAATTGAA GATATTCTAA ACCTOOAGAO 600 

TTCTCCAACA GCTTCAGGTT CATCAGXAAO TCCTGGTTTA TACAGTAAAA CCATGACCCC 660 

5 TATATA TQAC CCCATCAATG GAACACCAGC ATCATCCACC ATGACTTGOT TCAGTGACAG 720 

CCCirrOACG GAACAAAACT OCAGCATCCT OGCATTCAGC CAACXXXXCC AOTCCCCAQA 780 

AGCACTTGCG GATATGTACC AGCCTCGGTC TCTGOTTGAT AAAQGCAAGC TCAATGCAOG 840 

TTGQCTAQAC TCCTCACGCT CCCTTATGGA ACAAGGCATC CAAGAGGATQ A6CAGCTGCT 900 

CTTACGATTT AAATATTATT CTTTCTTOGA CTTGAATCCT AAATATGATG CTGTCCGAAT 960 

- ^ AAAOCAACTC TATGAGCAAG CCAGGTGGGC CATTCTCTTA GAAGAAATTG ATTGCACAGA 1020 
iV GSAAGAAATO TTGATCTTTG CAGCTCTACA GTACCACATT AGCAAACTGT CGTTGTCTGC 1080 

TGAAA CACAG GATTTTGCAG GCGAGTCCGA GGTTGATQAA ATAGAAOGGG CGCTTTCTAA 1140 

TTTGQAAGTA ACXXTTAGAAG GTGGAAAAGC GGACAGCCTT TTOQAOGACA TTACXOATAT 1200 

CCCTAAACTT GCAGATAATC TCAAATTATT TAGOCCCAAG AAGTTACTAC CAAAA6CTTT 1260 

- . CaW«»ATAT TGGTTTATCr TTAAAGACAC ATCCATAGCA TACTTTAAAA ATAAGQAACT 1320 
ID TGAACAAGGA OAACCACTAG AAAAACTAAA TCTTAOAGGC TGCOAAGTTG TGCCCGATGT 1380 

AAATGTAGCA GGAAGAAAAT TTGGAATCAA OTTACTAATC CCltrrTUOUG ATGGTATGAA 1440 

TGAAATGTAT TTGAGATGTG ACCATGAGAA TOUITACSCC CAAT6GATGG CTGCCTOCAT 1500 

G TTGG CATCg AAGGGCAAAA CCATGGCAGA CA3CTCCTAC CAGCCAOAGG TCCTCAACAT 1560 

CCTTTCATTT CTGAGGATQA AAAACAGGAA CTCTGCATCT CAGGTGGCTT CCAGTCTCOA 1620 

ZU AAACATGGAT ATGAACCCAG AATOTTTTGT OTCACCACGG TGTGCAAAAA GACACAAATC 1680 

CAAACAGCTQ GCX3GCG06GA TCCIGGAGGC GCACCAGAAC GTOGOCXAGA TGCCCCTGGT 1740 

CGAAGCCAAG CTGOGGTTCA TCCAGG0GT6 OCAGTCACTG CCTGA6TTT0 GCCTCAGCTA 1600 

CTACCrrGTC AQATTTAAAO GAAGCAAAAA AGATQACATT CTOQQAQTTT CATATAACAG 1860 

_ _ GTTGATTAAA ATTGAT6CAG CCACaSQGAT TCCAGTGACA AOVTGGAGAT TCACAAATAT 1920 

ZD CAAA CAGTGG AATGTAAACT GGGAAACCCO GCAGGTGOTC ATCGAGTTTG AGCAAAACGT 1980 

CTTTACTGCT TTCACCTGCC TGAGTGCAQA TTGCAAGATT GTGCACGAGT ACATTGGCOG 2040 

CTAC3VTTTTC TT6TCCACCC GCTCCaAGGA CCAGAATGAA ACACTCGATQ AGGACTTGTT 2100 

CCAC3VAATTG ACCGGOGGTC AGGATTGAAA CAAQCA0GC3G TGCTOGGCTC ACACCAACAA 2160 

GQCAAGCCAA AGGCGCCCCT CCCCAQAGGG ATCCCTAACG TGCCCAGCAT GTAGATTCTG 2220 

D\) GACTAACAGA CAACATACAT TCACCGCTGG TCACXXaOAT CCTCATTCAA ACCCACTGCT 2280 

GGCACATCCC TTTCCTTACT TTGCXX:TGT0 CTACCA6CCA 0GGAAG6AGC CTCTCTTGTT 2340 

TTTTCTATAA AATGGGTAGO CAGC3AGAAAA GCAGGTOCXrC TAAGATTQCT CTAAGGCCCA 2400 

GCATGTGGTT ACAGTTCTCT GACTTGCAGA ACCT6CCAGG TGTATGGCTA CAAGTTATCC 2460 

TCGTGCTGAT CTGTCTCATT ACTAAGTCAA TGQAQAAGAC AGAAAGGTAA AAATCACGTG 2520 

JO TAOCAA GAAC AACTCTTATT TCACAAACTC AGGTATQAAA CGAAAOGCCT QTCCTTCATG 2580 

GAACTGCTTT TAGCTCCTGT CTTTTCAAAA TQGCAGAGGG AGTTOCTACA CAC3VCTTTTT 2640 

CXXTOOAOGC CAAG6TCTAG G6GTAGAAA0 GGGAGGGGTG GGGCTACCAO GTAGCAOTTG 2700 

ACAACCCAAG GTCAGAGQAG TGGCCCTCAG TGTCATCTGT CCACAGTGAT ACCTGCCAAQ 2760 

ATOACCACTG ACCCACATCT GGTCTTAGTC ATTOQTCTCC TCAGATTTCT GGGGCCACXn* 2820 

4U GCAAGCCCCA TTCCATTOCT ACAGATCTCT CAGCCACCTG TAAOTCCTTT GTGAAGATOT 2880 

GGGTGACACA gGGGACAGG AAAACCCATT TCTGAACCCA GATCCATGTC TCCACTGCTT 2940 

CTACTCTOOQ TTGGGATTCA G6AAGACAG6 CACAOTCCTC TCTGTTCATA OAAACACCTQ 3000 

CCAGTGTCAA GGATTCCAQT CAGOTGTCTA TCCCAACTGG TCAGGGAGAG AAGGGCAGAC 3060 

CCATTCTCAA AGACCACCAT GTCCAAGGTC TGACAGCTCC CCACTGGCTG CCCCCACAGQ 3120 

HJ GQCTTTAOGC TGGTCTGQGT CATGGGGAAG CGTCXXrrCTT ATCGCTGGTC TOTGTTCTCC 3180 

rrGGATTTGOT ATCTATGTTG GTAOOACTCC TGGCCTTTTA TCTAAAOGAC TTTQ6CTTTT 3240 

GTAAATCACA AGCCAATAAT AOACTTTTTT CTCOOCCTCT GTTTTTTQCT GT O TC A TCTC 3300 

TGCCTTGAGA CTGCCTTGAG ACAGTGCTTG CCTTGAGAGA GTGAGCCAAT TAACAGCTGC 3360 

_^ CrOAATTGTC ATTTTCCATT TTGQTTTGTT AGAGGTGGGA GGGGTGGGTT TTGAGAAGGT 3420 

jU CAAAAGCAAT ACCAGAAGTA AAGGGAAATA TCAGACAATA TTTTATTATT TTTTCATAGA 3480 

TGTTCTGOCA CACAAAGAAC TTGG60TGTA AGGATAAGGC AAAAGCTCCA ATCCCATTTT 3540 

TCAGTTCTCC TAGQATGCA C COCTCAtSGGA GCCTGGCCA6 AGTTCOGAGG CCCGTGAGCO 3600 

TCAGCTGTTG CTTTATTTTC CATCAAAGCX: CTCTGAQAAO TGAGACCTCA GCAATTCOGG 3660 

GAGCCACATA GAGACAGACT TGGCAAGGQA CCCCCTGGTT CTGAGCCAGT AGCTGCCATC 3720 

J J TGGAAATTCX: TCTTTTAGCX: TCTCCTTAOA GOTQAATOTO AATGAAGCCT CCX31GGCACC 3780 

OGCTQAATTT CTGAGGCXHT GCTTAAAGCT CAGAAGT66T TTAGGCATTT GOAAAATCTG 3840 

QTTCACATCA TAAAGAACTT GATTTGAAAT GTTTrCTATA GRAACAAGTG CTAAGTGTAC 3900 

COTATTATAC TTGATGTTGG TCATTTCTCA GTCCTATTTC TCAGTTCTAT TATTTTAGAA 3960 

CCTAOTCAGT TCTTT AAGAT TATAACTGGT CCTACATTAA AATAATGCTT CTOQATGrCA 4020 

OU GATTTTACCT GTTTGCTGCT GAGAACRTCT CTGCCTAATT TACCAAAGCC AGACCTTCAG 4080 

TT CAACATGC TTCCrTAOCT TTTCATA GTT GTCTGACATT TGCATQAAAA CAAAGOAACC 4140 

AACTTTQTTT TAAOCAAACT TTGTTTGGTT ACAOTTTTCA GQGGAGCGTT T C TTCC A TGA 4200 

CACACAGCAA CATCCCAAAO AAATAAACAA GTGTGACAAA AAAAAAAAAA AACAAACCTA 4260 

^ AATGCTACTG TTCCAAAGAG CAACTTGATG QTTTTTTTTA ATACT6AGTG CAAAAGGTCA 4320 

OD CCCAAATTCC TATGATGAAA TTTTAAATTA ATGGGCACCT TTCAACATCA TTTQCTTCCT 4380 

TATCTACAGT T6ATTCAGAA ATCIQCATTT TTTATTCTTT TATATQACTT TTAAGTAAAA 4440 

GATTTATATG GATTTGAAAA AAA7MAAAA A 4471 

Seq ID NOs C27 Protein Sequence 
/U Protein Accession #t NP_005161,1 

1 11 21 31 41 51 

I I 1 I I 1 

MDGGTLPRSA PPAPPVPVGC AARRRPASPE LLRCSRRRRP ATABTGGGAA AVARRNBRER 60 

/O NRVKZiVNU3F QALRQHVPEO GASXKLSKVE TLRSAVEYIR ALQRLLAERD AVRNAIAQGL 120 

RPQAVXtPSAP RGPPGTTFVA A8PSRA88SP ORGOSSBPGS PRSAYSSDDS GCBGAIiSPAB 180 

RBUiDPSSWL GGY I93 

Seq ID KO: C28 DNA Sequence 
oU Nucleic Acid Accession #t NM^017763 
Coding sequencei 169.. 2520 

} 11 21 31 41 51 

I 1 I I I I 
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AAAAAAAAAA, AACTTTAGAO AAAGGAAGOG CCAAAACTAC GACTTGGCTT TCT8AAA0QG 
AAGCHTAAAT GTTCTTTTCC TCOITTTGTC TCGATCTGAO AACCTGCATT TOtSTATTAOC 
TAGT6GAAGC AGTATGTATG GTTGAAOTGC ATTGCTOCAO CTG6TAGCAT GAGTGGTG G C 
CACCAGCTGC AGCTGGCTGC CCTCTGGCCC TGGCTGCTGA TGGCTACCCT GCAGGCAGGC 
TTTOGAOGCA CMSGhCTOGT ACTGGCAGCA GC3GGTGGAGT CTGAAAGATC AGCAGAACAG 
AAAOCTQTTA TCAGAOTGAT CCOCTTOAAA ATOOACCCCA CA06AAAACT GAATCTCACT 
TTGGAACQTG T6TTTGCTG6 TGTTGCTGAA ATAACTCCAG CAGAAG6AAA ATTAATGCAG 
TCCCACCCAC TGTACCTGTG CAAT6CCAGT OATQAOOACA ATCTGGAQCC TGGATTCATC 
AGCATCOTCA AGCTGGA6AG TCCTCGAOGG GCCCCCCGCC CCTGCCTGTC ACTGGCTACC 
AA6GCT0GGA TGGCX3GGT6A GOSAGGAGCC AGTGCrGTCC TCTTTGACAT CACXGAG6AT 
00A6CT6CTQ CTQA6CAOCT GCAGCAGOOS CT6Q GQ C T GA CCT060CAGT G6TGTTGATC 
TGGOGTAATO AOGCTGAGAA GCTGATGQAG TTTGTGTACA AGAACCAAAA GGCGCATGTG 
AGGATTGAGC TGAAGGAGCC CCCGGCCTGG CCAOATTATG ATQTGTGGAT CCTAATGACA 
GTGGTGGGCA CCATCTTTGT QATCATCCTG GCTTOGGTGC TGOGCATCGG GTGCOGCCCC 
GGGCACAGCA GGCOQGATCC 6CTTCAGCA0 AQAACAOCCT GQGCCATCA6 CX»OCTGGGC 
ACGAGOAGOT ACCAGQCCAO CTGCAGGCAG OCCOG G GGTG AOTGGCCA6A CTCAGGQAGC 
AGCTGCAGCT CAGCCCCTGT GTGT6CCATC TQTCTOQAGQ AGTTCTCT6A GGGGCAGGAO 
CTA06GGTCA TTTCCTQCCT CCATGAGTTC CATCGTAACT GTGTGGACCC CTGGTTACAT 
CAGCATOGGA CTTGCCCCCT CTGCGTGTTC AACATCACAO AGGGAGATTC ATTTTCCCAG 
TCC CTGG GAC CCTCTOGATC TTACCAAGAA CCAGGTCGAA OACTCCACCT CATTOGCXaG 
CATCCCGGCC ATGCCCACTA CCACCTCCCT GCTGCCTACC TOTTGQGCCC TTCCOGGAGT 
GCAGTG6CTC G6CCCCCACG ACCTGGTCCC TTCCTGCCAT CCCAQGAGCC AGGCATGGQC 
CCTCX5GCATC ACCGCTTCCC CAGAGCT6CA CATCCCCGGO CTCCAGGAGA GCAGCAGC3GC 
CTGGCAGGAG CCCAGCACCC CTATGCACAA GGCTGGGGAA TGA6CCACCT CCAATCCACC 
TCACAQCAGC CTGCTGCTTG CCCAGTGGCC CTAC6CCGGG CCAGGCCCCC TGACAGCAGT 
GOATCTGGAG AAA6CTATT6 CACAGAAGBC AGT6GGTACC TGGCAGATGG GCCAQCCAGT 
QACTCCAGCT CAGGGCCCTG TCATGQCTCT TCCAOTGACT CTGTGGTCAA CT6CACGGAC 
ATCAGCCTAC AGGGGGTCCA TGGCAGCAGT TCTACTTTCT GCAGCTCCXTT AAGCAGTGAC 
TTTGACCCXX: TAGTGTACTG CAGCCCTAAA GGGGATCCCC AQCQAGTGGA CATGCAGCCT 
AGTGTGACCT CTCGGCCTOQ TTCCTTGGAC TG66TGGTGC CCACAGGGGA AACCCAGGTT 
TCCAGCCATG TCCACTACXIA CCGCCACOQO CAOCAOCACT ACAAAAAGOG QTTCCAOTGa 
CAT66CAGGA AGCXTTGGCCC A6AAAC0GGA GTCCCCCAGT CCAGGCCTCC TATTCCTOGG 
ACACAGCCCC AGCCAGAGCC ACCTTCTCCT GATCAGCAAG TCaCCGQATC CAACTCAQCA 
GCCCCTTOGG GGOGGCTCTC TAACCCACAG TGCCCCAGGG CCCTCCCTGA GCCAGCCCCT 
GGCCCAGTTG ACGCCTCCAG CATCTGCCCC AGTACX^GCA GTCTGTTCAA CTTGCAAAAA 
TCCA6CCTCT CTGCCOGACA COCACAQAGG AAAAGGOOGG GOGGTCCCTC CGAGCXX3^CC 

cctggctctc ggccccagga tgcaactgtg cacccagctt gccagatttt tccccattac 
acccccagtg tggcatatcc ttggtcccca gaggcacacc ccttgatctg tgoacctcca 
ggcxrrggaca agaggctgct accagaaacc cx»ggcccct gttactcaaa ttcacagcxav 
cri'gux ju 'x- l 'ut gcctgactcc tcgccagccx: ctggaaccac atccacctgg ggagqggcct 
tctgaatgga gttctgacac 06caga6ggc aggccatgcc cttatcoqca ctgccaggtg 
CTGTCXjGCCC agcctggctc agaggaggaa ctcgaggagc tgtgtgaaca ggctgtgtga 

6ATGTTCAGG GCTAGCTCCA ACCAAGAGTO TGCTCCAOAT aTOTTTOGOC CCTACCTOGC 
ACASAGTCCT GCTCCTG6GA AAG6AAAG6A CXSVCAGCAAA CACCATTCTT TTTGCOGTAC 
TTCCTAGAAG CACTGGAAGA GGACTGOTQA TGGTGGAGGG TGAGAGGGTG COGTTTCCTG 
CTCCAGCTCC AGACCTTGTC TGCAGAAAAC ATCTGCAGTG CAGCAAATCC ATGTCCAGCC 
AGGCAACCAG CTGCTGCCTG TGGCGTGTGT GGGCTGGATC CCTTGAAGGC TGAGrTTXTG 
AOGGCAGAAA GCTAGCTATG GGTAOCCASG TGTTACAAA6 QTQCTGCTCC TTCTCCAACC 
CCTACXT6GT TTGCCTCACC 0CAA6CCTCA TGTTCATACC AGCCAGTGGG TTCAGCAGAA 
CGCATGACAC CTTATCACCT CCCTCCTTGG GTOAGCTCTO AACACCAQCT TTGGCCCCTC 
CACAGTAAGO CTGCTACATC AGGGGCAACC CTGGCTCTAT CATTTTCCTT TTTTGCCAAA 
AGGACCAGTA GCATAGGTGA GCCCTGAGCA CTAAAAQGAG GGGTCCCTGA A6CTTTCCCA 
CTATAOTGTG QAGTTCTGTC 0CI6A66ZGG GTACAQCAGC CTTGGTTCCT CTGGGGGTTG 
AGAATAAGAA TAGTGGGGAO GGAAAAACTC CTCCTTGAA6 ATTTCCT6TC TCAGAGTCXTC 
AQAGAGGTAO AAAGGAGGAA TTTCTGCTGG ACTTTATCTG GGCAGAGGAA 6GATGGAATG 
AAGGTAGAAA AGGCAGAATT ACAGCTGAGC GGGGACAACA AAGA6TTCTT CTCTGGGAAA 
AGTTTTOTCT TA6AGCAAGG ATGGAAAATG GOQACAACAA AGQAAAAGCA AAGTGTGACC 
CTTGOOTTTG GACA9CCCAQ AiGGCCCAGCT COOCAGTATA AGOCATACAG GCCA6GGACC 
GACAGGAGAG TGGATTA6A0 CACAAGTCTO GOCTCACTGA GTGGAC3U\6A GCIGATaOGC 
CTCATCAGGG TGACATTC31C CCCAGOGOVS CCTGACCACT CTTGGCCCCT CAGGCATTAT 
CCCATTTGGA ATGTGAATGT GGT6GCAAAG TGGGCAGAGG ACCCCACCTG GGAACCTTTT 
TCCCTCAGTT AGTGGGQAOA CTA6CACCTA GGTACCCACA TGGOTATTTA TATCTGAACC 
AGACAGAOGC TTGAATCAG6 CACTATQTTA AOAAATATAT TTATriGCTA ATATATTTAT 
CCACAAAAAA AAAAAAAAAA AA 



Seq ID NO: C29 Protein Sequence 
Protein AcceBsion St NP_004260.2 

1 11 21 31 41 51 

I I I I 1 I 

MNSSAHYnVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPBQ 
TLPG1NLTSF LSPVDNHMIttl LTSQDLLYDL DINXFDB2NL MSXATBDHFD PIOV8QLFDB 
PDSDSGIiSLD SSHHUTSVIK SNSSHSVOJB GAXGYCTDHB 6SSRHDLBQA VGGYYPBPSK 
LCHLDQSDSD PHQ^LTFQHV PHNHTYHLOP TAPESTSBPP PWPGKSQKIR SRYLEDTDRN 
LSRDBQHAKA LHIPPSVDBI VGMPVDSPNS MLSRYYLTDL QVSLIRDIRR RGKNKVAAQN 
CRXRKLDIIL NLEDDVCNLQ AKKETLXRBQ AQQIXAINIM KQXUIDLYHD IFSRLRDDQG 
RPVNPNHYAL QCTKDGSZLX VPKBLVASQI XXBTQXGKRK 
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I I I I I I 

GCCCOOQGAA GCGCAGCCAT GGCTCT G CGG AG6CT0GGGG CCGOSCTGCT GCTGCTGCOG 60 

CT6CTCGC00 COGTGGAAQA AACXXTTAATG GACTCCACTA CAGCGACTGC TGAGCTGGGC 120 

TGGATGGTGC ATCCTCCATC AGGGTGGGAA GAGGTGAGTQ GCTACGATGA GAACATGAAC 180 

AOGAT CCGC A 0GTACCAG6T GT6CAAGGTG TTTGAGTCAA 6CCAGAACAA CTGGCTAOGQ 240 

ACCAAGTTTA TCCG6O6C0G TG60GCCCAC OGGATOCHOG TGQASATQAA GmTOGOTO 300 

OGTGACTGCA GCAG CATC CC CAGOGTGCCT GGCTCCTGCA AGGAOACCTT CAACCTCTAT 360 

TACTATGAGG CTGACTTTGA CTOGQCCACC AA6ACCTTCC CCAACTCGAT GGAGAATCCA 420 

TGGGTGAAGG TGGATACXaVT TQCAGCCXSAC GACAGCTTCT CCCA6GT0GA CCTG GG TGOC 480 

lU OG CGTCA TQA AAATCAACAC OQAGGTGOGG A6CTT006AC C r GTGTOCO G CAGCG6CTTC 540 

TACCIGGCCT TCCA66ACTA TGGGGGCTGC ATGTCOCTCA TGGOGGTQGQ TGTCTTCTAC 600 

OSCAAGTOCC CCCGCATCAT CCAGAATGGC GCCaTCTTCC AGGAAA0CX:T GTC6GGGGCT 660 

GAGAGCACAT CGCTGGTGGC TQCCCGGGGC AGCTGCATOG CCAATQOGGA AGAGOTGGAT 720 

GTACCCATCA AGCTCTACTQ TAACXX3GGAC GGOGAGTGGC TGGTGCCCAT CGGGCX3CTGC 780 

ATGTGCAAAQ CAGGCTTOQA GGCOOTTQAG AATG6CAC0Q TCTGCOGAGG TT6TCCATCT 840 

GGGACTTTCA AG6CCAACCA AGGGQATGAO 60CT6TACCC ACTQTCCCAT CAACAGCCGQ 900 

ACCACTTCTO AA6GGGCCAC CAACTGTOTC TGCOGGKATG GCTACTACAQ AGCAGACCTG 960 

GACCCCCTGG ACATGCCCTG CACAACCATC CCCTCCGOOC CCCAQOCTOT GATTTCCAGT 1020 

GTCAATGAGA CCTCCCTCAT GCTGGAGTGG ACCCCTCCCC GOQACTCOGG AGGCCGAGAG 1080 

2U GACCTOQTCT ACAACATCAT CTGCAAGAGC TGTQQCTOGG GCOGGGGTGC CTGCACCCGC 1140 

TG06G0QACA ATGTACAlGTA CGCACCA06C CAGCTA06CC TGAC06A6CC AOOCATTTAC 1200 

ATCA6TGACC TOCTGGCCCA CACCCAGTAC ACCTTC6AGA TCCAGGCTGT GAACGG06TT 1260 

ACTGACCAGA GCCCXTTCTrC GCCTCAGTTC GCCTCTGTGA ACATCACCAC CAACCAGGCA 1320 

QCTCCATCGG CAQTGTCCAT CATGCATCAG GTGAGCCGCA CCGTGGACAG CATTACCCTG 1380 

2 J TCGTG6TCCC AGCCAGACCA GCCCAATOGC OTGATCCTGG ACTATaAGCT GCAGTACTAT 1440 

GAOAAGQACC TCAOTGAGTA CAACGCCACA GCCATAftAAA 6CCCX:aCCAA CACX3GTCACC IS 00 

GTGCAGGGCC TCAAA6CCGG OGCCATCTAT OTCTTCCAGG TGCX3GGCA0Q CAC06TGGCA 1560 

6GCTA0GGGC GCTACAG06G CAAGATGTAC TTCCAGACCA TGACAGAAGC CGAGTACCAG 1620 

ACAAQCATCC AQQAGAAGTT OCCACTCATC ATCGGCTCCT CGGCCGCTGG CCTGGTCTTC 1680 

CTCATTQCTG TGQTTGTCAT CGCCATCGTO T6TAACAGAA GACGGGGGTT TGAGCX3TGCT 1740 

GACTGGQAGT ACAOOGACAA OCTGCAACAC TACACCA6TG GCCACATGAC CCCA6GCATG 1800 

AAGATCTACA TCGATCCTTT CACCTAC6A0 GACCCCAACG AGGCAGTQOQ GGAOTTTGCC 1860 

AAGGAAATTG ACATCTCCTG TGTCAAAATT GAGCAG6TGA TCGGAGCAOG GGAGTTTGGC 1920 

GAGGTCTGCA QTGGCCACCT GAAGCTGCCA G6CAA6AGAG AGATCTTTGT GGCCATCAAG 1980 

AC36CTCAAGT CGGGCTACAC GQAQAAGCAG CGCCGGGACT TCCTQAGCXSA AGCCTCCATC 2040 

ATGGGCCAOT TCQACCATCC CAACGTCATC CACCTGGAGG GTGTCGTGAC CAAGAGCACA 2100 

CCTGTGATGA TCATCACCGA GTTCATGGAG AATGGCTCCC TGGACTCCTT TCTCOGGCAA 2160 

AAC3QATGGGC AGTTCACAGT CATCCAGCTG GTGGGCATGC TTCGGGGCAT CGCAGCTGGC 2220 

ATOAAGTACC TGQCAGACAT GAACTATGTT CACCGT6ACC TQQCTGCXXX3 CAACATCCTC 2280 

QTCAACAGCA ACCTGGTCTG CAAGGTGTCX3 GACTTTGGGC TCTCACGCTT TCTAGAGGAC 2340 

OATACCTCAG ACCCCACCTA CACCAGTGCC CTGGGOGGAA AOATCX;CXyVT CCGCTGGACA 2400 

GCCG06GAAG CCATCCAGTA CCGGAAGTTC ACCTCGGCCA GTGATGTGTG GAOCTACGGC 2460 

ATTGTCATOT GGGAGGT6AT GTCCTATGGO GAGC6GCCCT ACTGGGACAT GACCAAOCAG 2520 

GATGTAATCA ATGCCATTGA GCAGGACTAT CX3GCTGCCAC CGCCCATGQA CTGCCOOAGC 2580 

4^ GCCCTOCACC AACTCATGCT GGACTGTTGQ CAQAAGQACC GCAACCACCG GCCCAAGTTC 2640 

GGCCAAATTG TCAACAC3GCT AGACAAGATG ATCCQCAATC CCAACAOCCT CAAAGCCATG 2700 

GCX3CCCCTCT CCTCTGGCAT CAACCTGCCG CTGCTGGACC GCAOGATCCC OGACTACACC 2760 

AGCTTTAACA CGGT6GACQA QTGGCTGGAG GCCATCAAOA TGGGGGAOTA CAAGGAGA6C 3820 

TTCGCCAATG CCGGCTTCAC CTCCTTTGAC GTCGTGTCTC AGATQATGAT GQAGGACATT 2860 

CTCOGGGTTO GGCTCACTTT GQCTGGCCAC CAGAAAAAAA TCCTGAACAG TATCCAGGTG 2940 

ATGCGGGOGC AGATGAACX3V GATTCAGTCT GTGGAGGTTT OACATTCACC TOCCTOGGCT 3000 

CACCTCTTCC TCX^AGCCCC GCCCCCTCTG CCCCACGT6C CGGCCCTCCT GGTGCTCTAT 3060 

CCACTGCAGG GCCAGCCACT C?GCCAGGAGQ CCAOGGGCCA GGGOAAGAAC CAAOCQOTQC 3120 

. . CAGCX:AOaAG ACGTCACCAA GAAAACATGC AACICAAACG AOGQAAAAAA AAAG66AAT0 3180 

J J GGAAAAAAGA AAACAQATCC TGGGAGGGGG CX3GQAAATAC AA6GAATATT TTTTAAAGAG 3240 

GATTCTCATA AGGAAAGCAA TGACTGTTCT TGOGGGGGAT AAAAAA06GC TTGGGA6ATT 3300 

CATGCGATQT GTCCAATOGO AGACAAAAGC AOTTTCTCTC CAACTCCCTC TGG6AAGGTG 3360 

ACCTGGCCAG AGCCAAGAAA CACTTTCAGA AAAACAAATG TQAAOQaaAO AGACAGGGaC 3420 

CGCCCrPGGC TCCTGTCCCT GCTOCTCCTC TAGGOCTCAC TCAACAACCA AGOGCTTGQA 3480 

GGACGGGACA GATGGAC3«5A CAGCCACCCT GAGAACOCXTT CTGGGAAAAT CTATTCCTGC 3540 

CACCACTGGG CAAACAGAAG AATTTTTCTG TCTTTGGAGA GTATTTTAGA AACTCCAATG 3600 

AAAGACACTG TTtCTCCTGT TGGCTCACAO GOCTOAAAGG GGCTTTTGTC CTCCTGGGTC 3660 

AGGGAGAACG GGGGGACCCC AOAAAGGTCA GCCTTOCTGA GGATGGGCAA CXXXAGGTCT 3720 

GCAGCTCCAG GTACATATCA C3GC6CACAGC CXGOCRGOCT GGCCCTCCTO OTGCCXACTC 3780 

CCGCCAQCCC CTGCCTCGAQ GACTGATACT GCAGTGACTG COGTCAGCTC <3QACTGCCGC 3840 

TGAGAAGGGT TGATCCTGCA TCTGOOTTTO TTTACAGCAA TTCCTGGACT OGGGGGTATT 3900 

TTGGTCACAG GGTGGTTTTO OTTTAGGGGO ' i " i"m ' mm '' i ' GGGTTGTTTT TTOTTTTTTa 3960 

GTTTTTTTTA ATOACAATGA AOTGACACTT TGACATTTCC TAGCTTTTQA G6ACTTGATC 4020 

_^ CTTCTCCAGG AAGAAOGTGC TTTCTGCTTA CTQACTTAGQ CAATACACCA AGOGGQANAT 4080 

/U TTTATATOCA CATTTCTGGA TTTTTTTATA OGGTTTTCAT TGACACTCTT CCCTCCTCCC 4140 

ACCTGCCACC AGGCCTCACC AAAGCXX31CT GCCATGGGGC CATCTGGGCC ATTCAGAQAC 4200 

TG6AGT0AGA TnGGGlCTG GAGGGGQAGG CGCCAAGGTG GAGGAGCTTC CCACTCCAGO 4260 

ACTGTTGATG AAAGGGAGAG ATTQAGGA6G AAOIGGGCTC TGAGGCTGCA GGGCT6QAA6 4320 

TC CTTG CCCA CTTCCCACTC TOCTGOOCCA ATCTATCTAG TACTTCGCA6 GCAAATAGGC 4380 

CCCTTTGAGQ CTCCTGAQTG CXXTTCAGATG GTCAAAACCC AGTTTTCCCT CTGGGAGCCT 4440 

AAACCAGGCT GCAT066AGG CCAGGACCCQ GATCATTCAC TOTGATACCC TQCCCTCCAQ 4500 

A6GGTX3CGCT CAOAGACACO GGCAAGCATO CCTCTTCCCT TCCCTGGAGA GAAAGTGTGT 4560 

GATTTCTCTC CCACXTTCCTT COCCCXSkOCA GACCTTTQCT GGGCCTAAAG GTCTTGGCCA 4620 

T0GGGACX3CC CTCAGTCTAO GGATCTGGCC AC3\QACT0CC TCCTGTGAAC CAAGAGAOAC 4680 

ACCCAAGCAG AGCAATCAGT TAOTGAATrQ GAATTCCCX3V AGTCTTTGCT ATTGTGAATA 4740 

OTGCTGCAAT AAACATAOGT GTGCATOTGT CTTTATAOTA GAAT6ATCTA TAATCCTCTO 4800 

GGTAT6TACC CAQTAATGGG ATTGCTGGGT CAAATGGTTT TTCTGGTTCT AGATCCrTGA 4860 

QGAATTGCCA CACTGTCTTT CACAATGGTT OAACTAATTT ACACTGCTAC CAACAOTGTA 4920 

AAAGTCTTOC ■A H m' C TOC A CATOCTCTOC AGGATCTOTT OTTTOCTGAC TTTTTAATOA 4980 
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10 



15 
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TTGOCATTCT AACTG8TQT0 AOATCOTATC TGATTXirOOG TTTGATTTGC ATTTCTCTAA 5040 

T6ACCAGTGA AGATGAGCTT TTTTTCATAT OTTTGTTQGC GACAT6TTT0 TTGTTTCTTT 5100 

TGAOAAGTGT CTGTTCATAA CCTTCACCAA TTTTTGATGA GGTTGTrTGT TCTTTTCTTQ S160 

TAAACTTAAT GAAATAAAGC ATGAAGACAA GATTAGAAQA AAAAGAATGA AAAGGAACAA 5220 

ACAAAGCGTC CAA6AAATAT GGGACTAT6T 6ACAAGAACA AACTTAOGTT TGACT GG T G T 5280 

GCTGAAWVTG AOVGOQAOAA TQMUVCQIAG TTGQAAAACA CTCTTOIGGA TATTATCCAO 5340 

6AQAACTTCT CCAACCTAGC AA8ACA6ACC AAOGTTCAAA TTCAGGAAAT ACAGAGAACA 5400 

CCCAAAGATA TTTCTOGAOA AQAQCAACCC GAAQACACAT AATTGTCAGA TTCACCAAGG 5460 

TTGAAAT6AA GGAAAAAATG CTAAGGOCAO CCAGAQAGAA AGGTCA6GTT ACTCACAAAA 5520 

GQAAOCCCAT CAGACTAACA GCAGATCTCT CTGCAGAAAC CCTACAACCT AGAA6A6AGT S5B0 

GGGGACCAAT ATTCAACATT CACAAAOAAA A6AATTTTCA ATCCAGAATT TCATATCCAG 5640 

CCAA ACTA AQ CTTCATAAGC AAAGQAiGAAA TAAAATCCTT TACAGACAAO CAAATCCTG6 5700 

GAGATTTTGT CACCACCAGQ CCTGCCTTAC AAGACATCCT OAJUXSAAGCA CTAAATATGG 5760 

AAAAGAAAAA CTGGTGCCAO CCACTGCAAA AAATACCAAA TTGTAGAGAC CATTGACACT 5820 

ATGAAGAAAC GGT6TCAACT AATGG6CAAA ATAACCAGCT AGTATCATAA TGACAGGATC 5880 

AOATTCACAC ATAACAATAT TAACCTTATA TCTAAATGGO CTAAATCCCC CAATTAAAAG 5940 

AOOCAGACTO 6CAAATT0GT TAAAGAGTCA AGACTCATTG OTGT GC CGTA TTCAOGAGAC 6000 

CCATCTCAOQ TGCAAAGACA CACATAGQGT CAGAGTAAAA GGGATACAGG GGAATTC 6057 

20 Seq ID NO: C31 DNA Sequence 

Nucleic Acid Accesaicm fii NN_031942.1 
Coding sequences 145.. 1260 ~ 

^^1 11 21 31 41 51 

25 I I I I I I 

aXGAGOCGC GCCCCTCG6G GCCCX3GGTC8 6CX3C6CCCAG CCTGCCAGCC GCGCTOCTQG 60 

T6CTCCTCCT GCTGTGOGAC 0GCT6ACCGC GOGGCTGCTC CGCTCTCCCC GCTCXSUVGOG 120 

CCGATCTGGO CACXX3GCCAC CAGCATGGAC GCTCGCCGCG TGCOQCAOAA AGATCTCAOA 180 

GTAAAGAAGA ACTTAAAGAA ATTCAGATAT GTGAAGTTGA TTTCCATGGA AACCTOGTCA 240 

TCCTCTGATG ACAGTTGTGA CAGCTTTGCT TCTGATAATT TtGCAAACAC GAGGCTGCAG 300 

TCAGTTOGGG AAOGCTGTAO GACGOGCAGC CAGTGCA6GC ACTCTGGACC TCTCAGGGTG 360 

6CGATGAAGT TTCCAGOGCG GAGTACCAGG GGAGCAACCA ACAAAAAAGC AOAGTCCCGC 420 

CAGCCCTCAG AGAATTCTGT GACTGATTCC AACTCCGATT CAGAAGATGA AAGTGGAATG 480 

AATTTTTTGG AGAAAA66GC TTTAAATATA AAGCAAAACA AAGCAATGCT T6CAAAACTC 540 

35 ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCX3TGGAA GACATCCCCT CCX^GGCTCC 600 

GACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCXXSG GTGTTGCTTC CAGGAGAAAC 660 

CCTGAACX5GA QAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTOGG GTCCXmXSAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAO AAAGAGGAAG 780 

ACXX3TGGATG GCTACATGAA TGAAQATGAC CTOCCXSVOAA GCCXTTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCX5CCCAGTG GAAGAAATTA CAGAGGAGQA OTTGGAGAAC 900 

GTCTGCAGCA ATTCTCGAOA QAAQATATAT AACOGTTCAC TGGGCTCTAC TTGTCATCAA 960 

TGCOGTCAGA AGACTATTGA TACCAAAACA AACTQCAGAA ACCC^QACTG CTGGGGCGTT 1020 

GGAGGCCAGT TCT6TGGCXX CTGCCTTCGA AACXX3TTAT6 GTGAAGAGGT CS^GGGATGCT 1080 

CTGCTGGATC CGAACTGGCA TTGCCCGCCT TQTOGAGGAA TCTGCAACTG CAOTTTCTGC 1140 

OGGCAGCQAG ATGGAOGGTG TGCGACTGGG GTCCTTGTQT ATTTAGCCAA ATATCATG6C 1200 

TTTGGGAATG TGCATOCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

TATCTGGAAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTQAO TTAAAAATCT TOATQATCAG OCT G TTTCAT AAGAAACTCC 1380 

AATCAAGTTA ATCTTA6CAO ACAT6TGTTT CTGOAGCATC ACAQAAGGTA TATT6CTAGT 1440 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCXa\ ATGCTCCTCT 0CAACCX5CTT AOTTTCTGAA TTTCTTTTAA ATTACA6TTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTQ 1620 

GAA ACAC AAT AATAGTATTA ACTAACTAOA TCIATTGAAT TTCAGAGAAG AGGCTTCTAA 1680 

CTTGTTTACA CAAAAAOSAG TATGATTTAG CS^CTCATACT AGTTGAAATT TTTAATA6AA 1740 

TCAAGGCACA AAAGTCTTAA AACGATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTO TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCOO AAGGGTAA6T GGTGOSTCCA GCTTACAOGA TCATAATTCA 1920 

AAGGTTGOTO 6GCAATGTAA TACTXAATTA AAATAATOAT GQAAGA6CTA TCTGGAQATT 1980 

ATGAGTAAGC TQATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAO TTTGOUUUST 2040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCA G TTTCTT GTAAATTTAT AAACCAGOCA 2160 

CAAGGTTCAA GTTTAQATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCT6ACAT CTCTGCCAGT CTAOTTTCTG OGCAGOTtTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTOGTAGA GGIGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTOTGCCA TTCAATOTTT 2400 

QATGCATAAT TGGACXTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCWT 2460 

TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 2510 
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Coding eequencet 276.. 1271 

1 11 21 31 41 51- 
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GCAOGAGGGA AGA6GGTGAT CCGACCCGGG GAAGGTCQCT GGGCAGGGOG AGTrCGGAAA 60 

GCGGCAGCCC CCGCCOCCCC CGCAGCCCCT TCTCCTCCTr TCTCCCAOGT CCTATCTGCC 120 

TCrOSCTGGA GGGCAGGCOG TGCAGCATCG AAGACAGGAG GAACTGGAGC CTCATTGGCC 180 

GGCCCX3GGGC GCOGGCCTOG QGCXTAAATA GGAGCTGOGQ GCTCTGGCTG 66ACC0GACC 240 

GCTGCCGGCC GCGCTCCCGC TOCTCCTGCC G GG TQATGGA AAACCCCAGC CCGGOOGCCG 300 

CCCTGGGCAA GGCCCTCTGC GCTCTCCTCC TGGCCACTCT OGGOGCCGCC GQCCAGCCTC 360 

TTGGGGGAQA GTCCATCTGT TCCGCCAQAQ CCCOGGCCAA ATACAGCATC ACCTTCACGG 420 

GCAAGTG6AG CCAGAOGGCC TTCCCCAAGC AOTACCCCCT GTTCCGCCCC CCFGCGCAGT 480 

GGTCTT06CT GCTGGGGGCC G06CATAGCT CCGACTACAG CATGTGGAGO AAOAACCAGT 540 

AGGTCAOTAA CGGGCIGCSC GACTTTGOQG AG06CG6CGA G6CCTGGGG0 CTOATGAAGG 600 
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AGATCGAGGC GGCGGGGGAG GCX5CTGCAGA 60GT6CAO3C G GTG ' m TC G G0C3CC0GCCG 660 

TCCCCAGCGG CACCGGGCAG AGGTOGGGGG AGCTGGAGGT GCAGG6CAG0 CACTCX3CTGG 720 

TCTOGTTTGT GGTGCG CATC GTGCCCAGCC COGACTGGTT CX3T GGG CG TG GACAGCCTGG 780 

5ACCTGTGCGA OGGGGACCGT TGGOGGGAAC AGGOGGCXSCT GOACCTOTAC CCXTTACXSAOG 840 

CCGGGACGGA CAQCGGCTTC ACCTTCTCCT CCCCCAACTT CGCGACCATC COGCAGGACA 900 

CGGTGACCGA GATAAOGTCC TOCTCTCCCA G0CACCC6GC CAACTCCTTC TACTACCGGC 960 

GGCTGAAGGC CCTGCCTCCC ATGGCCAGG6 T6ACACTGGT GOQ Q CTGOGA CAGAGCCCCA 1020 

GGGCCTTCAT CCCTCCCQCC CCAGTCCTGC CCAGCAGGGA CAATGAGATT GTAGACAGCG 1080 

^ CCTCAGTTCC AGAAACGCCX3 CTGGACTGCG AGGTCTCCCT GTGGTOC?rCC TGGGGACTGT 1140 

lU GCGQAGGCXa^ CTGTGGGAGG CTOGGGACCA AGAGCAG6AC TCGCTACGTC CGGGTCCAGC 1200 

COGCCAACAA 0GGGA6CCCC TGCOCOQAGC T0GAA6AAGA GGCTQAGTGC GTOCCTGATA 1260 

ACTGOGTCTA AGACCAGAGC COOGCAGOOC CTGG G GCOX' OQGAOCCATO GGGTGTCGGG 1320 

GGCrCCTGTQ CAGGCTCATG CTGCAGGOGG COGAGGCACA GGGOGTTTCG OGCTGCTCCT 1380 

GAC OGOGGT G AGGCOGCGCX: GACCATCTCT OCACTGAAGO GCCCTCTGGT GGCCX3GCACG 144 0 

ID GGGATTGGGA AACAGCCTCC TCCTTTCOCA ACCTTGCTTC TTAG G G G OCC CCGl'Cn' C ' C XXJ 1500 

TCTQCTCTCA GCCTCCTOCT CCTOCAGQAT AAAOTCATCC CCAAGGCTOC AQCTACTCTA 1560 

AATTATGGTC TCCTTATAAG TTATTGCTGC TCCACGAGAT TOTCCTTCAT OOTCCAGOGQ 1620 

CCTGGCTCCC AOGTGGTTGC AGATACCTCA GACCTGGTGC TCTAGGCTGT GCTGAGCCCA 1680 

CTCrCCCGAG GGOGCATCCA AGCGGGGGCC ACTTGAGAAG TGAATAAATG GGQOGGTTTC 1740 

ZV GGAAGCGTCA GTGTTTCCAT GTTATGGATC TCTCTGOGTT TGAATAAAQA CTATCTCTGT 1800 

TQCTCAC 1907 

Seq ID NO: C33 DNA Sequence 
Nucleic Acid Accession 8: Eos sequence 
ZD Coding sequence: 1..1314 

1 11 21 31 41 51 

I 1 I 1 I I 

ATGTT ACAG Q ATCCTQACAG TGATCAACCT CTQAACAGCC TCQATGTCAA ACCCCTGOJC 60 

J\) AAACC COGTA TCCCCATGGA GACCTTCAGA AAOOTOGGGA TCCCCATCAT CATAGCACTA 120 

CTGA6CCTG0 OGAGTATCAT CATTGTGGTT GTCCTCATCA AGOTGATTCT GGATAAATAC 180 

TACTTCCTCT G OGGG CAGCC TCTCCACTTC ATCCCQAGGA AGCAGCTGTO TGAOGGAGAG 240 

CTGGACTGTC CCTTQGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CX3AAGGGCCT 300 

GCAGTGGCAG TCOGCCTCTC CAAGGACOGA TCCAC3VCTGC AGGTGCTGQA CTCGGCXACA 360 

J J OGGAACTGGT TCTCTGCCTG TTTOGACAAC TTCACAGAAG CTCTCX3CTGA GACAGCCTGT 420 

AGGCA6ATGG GCTACAQCAG CAAAGCCACT TTCAGAGCTG TGGAGATTGG CCX3VBACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TT06CATG0G GAACTCAAGT 540 

GQGCCCTQTC TCTCAGGCTC CCTGGTCTCC CTGGACTGTC TTGCCTOTGQ GAAGAQCCTG 600 

AAGACCCCCC GTGTGGTGGG TOGGGAGGAG GCCTCTOTGG ATTCTTGGCC TTGGCAGGTC 660 

4U AOCATCCAGT ACGACAAACA GCAC6TCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGOCAG CC CACTG CTT CAOOAAACAT ACOGATGTGT TCAACTGGAA GGTGOGGGCA 780 

OGCTCAQACA AACT6GGCAG CTTCCCATCC CTGGCTGTQG CCAAOATCAT CATCATTGAA 840 

T TCAA CCCCA TGTACCOCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTQCCCTTCT TTGATGAGGA GCTCACTCCA 960 

4D GCCACCCCAC TCT6GATCAT TGGATGGGGC TTTAOGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AQTCCAGGTC ATTGACAGCA CACGGT6CAA TGCAGACGAT 1080 

OGGTACCAGG GGGAAGTCAC CGA6AAGATG ATGTGTGCAG 6CATCC0GGA AGGGGGTGTG 1140 

GACACCTGCC AGGOTGACAO TOOTGGGCCC CTGATGTACC AATCT6ACCA GTGGCATGTQ 1200 

GTGGGCATCO TTAGCTGGGG CTATGGCTGC GGGGGCCCQA 6CACCCCAGG AGTATACACC 1260 

DU AAGOTCTCAO CCTATCTCAA CTG6ATCTAC AATGTCTGQA AGGCTGAGCT OTAA 1314 



55 



Seq ZD NOs C34 DHA Sequence 

Nucleic Acid Accession #t 1IH_003045.1 

Coding sequencei 148.. 2037 " 



1 11 2X 31 41 51 

I I I I I I ' 

OGATCCTGCC GGAGCCCOGC OGCCGCCGGC TTGGATTCTC AAACCTTCCT TGTATCCCTC 60 

CTGAGACATC TTTOCTGCAA GATOGAGGCT GTCCTCTGGT GAGAAGGTGG TOAGGCTTCC 120 

OU COTCATATTC CAGCTCTGAA CAGCAACATG GGGTGCAAAO TCCTGCTCAA CATTGGGCAG 180 

CAGAT6CT6C GG0G6AAGGT 0GTG6ACTGT AGCOGGGAGO AGAOGCGGCT GTCTCGCXGC 240 

CTGAACACTT TTGATCTOGT 0GCCCT06GG GTGGQCAGCA CACTGGGTGC TGGT6TCTAC 300 

OTCCTGGCTG GAGCTGTGGC COGTQAGAAT GCAGGCCCTO CCATTGTCAT CTCCTTCCTG 360 

^ ATOGCTGOQC TGGCCTCAGT GCTGGCTGGC CTGTGCTATG GCGAGTTTGG TGCTCGGGTC 420 

OD CCCAAQAC56G GCTCAGCTTA CCTCTACAGC TATGTCACOO TTOGAGAGCT CTGGGCCTTC 480 

ATCACOGGCT GQAA CTTAAT CCTCTCCTAC ATCATOGOTA CTTCAAGOGT AQCGAGGGCC 540 

TQOAGCGOCA CCTTOGAOGA GCTGATAGGC A0ACCCAT06 OOQAOTTCTC ACGGACACAC 600 

ATGACTCTGA ACGCCCCCGQ CGTGCTGGCT GAAAACCCOG ACATATTOGC AOTGATCATA 660 

ATTCTCATCT TGACAGGACT TTTAACTCTT GGTGTGAAAG AGTOGGCCAT GGTCAACAAA 720 

/V ATATTCACTT GTATTAAOGT CCTGGTCCTQ GGCTTCATAA TGGTGTCAGG ATTTGTGAAA 780 

GGATCGGTTA AAAACTGGGA 0CTCAC6GAG GAGGATTTTG GGAACACATC AGGCCGTCTC 840 

TOi A i^AACA ATGACACAAA AGAAGGGAAO CCOGGTOTTG OTGGATTCAT GCCCTTOGGG 900 

TTCTCTGOTQ TCCTGTOGGG GGCAGOGACT TGCTTCTATQ CCTTCGTGGG CTTTQACTGC 960 

ATCGCCACCA CAGGTGAAGA GQTQAAGAAC CCACAGAAGG CCATCCCCGT GGGGATOGTG 1020 

/D GCO TCCC TCT TGATCTGCTT CATOGCCTAC TTTGGGGTGT CG6CTGCCCT CA06CTCATG 1080 

ATGCC CTACT TCTGCCIGGA CAATAACAGC CCCCTGCCOG AOGCCTTTAA GCAOGTGGGC 1140 

TQGQAAG8TO CCAA6TA06C AGTGG0CGT6 GGCTCCCTCT G03CTCTTTC OGCCAGTCTT 1200 

CTAGGTTCCA TGTTTCCCAT GCCTCGGGTT ATCTATGCCA TGGCTGAGGA TGGACTGCTA 1260 

TTTAAATTCT TAGCCAACGT CAATGATAGG ACCAAAACAC CAATAATOGC CACATTAGCC 1320 

OU T0GGGT600G TTGCTGCTGT GAT6GCCTTC CTCTTTGACC TCAAQGACTT GGTGGACCTC 1380 

ATGTCCATTO GACTCTCCT GGCTTACTOQ TTGGTGOCTO CCTGTQTGTT GQTCTTAOGO 1440 

TACCAGCCftO AGCA6CCTAA CCTGGTATAC CA6ATG6CCA GTACTTCOGA OQAGTTAGAT 1500 

OCAGCAGACC AAA ATGA ATT GGCAAGCACC AATQATTCCC AGCTGGQGTT TTTAGCAGAG 1560 

GCAOAGATGT TCTCTTTGAA AACCATACTC TCACCCAAAA ACATGQAGCC TTCGAAAATC 1620 

1218 
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5 

10 

15 

20 

25 
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TCTGGGCTAA 
ATTGTGACOG 
CTCX5CAGGGT 
AGCAAGACCA 
TTCQTGAAOG 
TGGATGCraV 
TCCCZGCATO 
ACAQCCC06C 
CACCCCACCC 



TTGTGAAOlT 
T6CTTGGAAG 
CTGCCCTCCT 
AGCTCTCATT 
TCTATCTCAT 
TAQGCTTOVT 
CCQAOCAA GC 
GCCOOQGAGO 
TCOGCACCAO 



TTCAACCftGC 
GGAGGCTCrC 
CTOTGCCOT6 
TAAGGTTCCC 
GATGCft OCTO 
CATCTACTTT 
AAGGACTCCT 
TQ0CAGCA6C 
T6CAACAGAA 



CTTATAGCTG 
ACOVAAGGGO 
GTCAOOGGCG 
TTCCTGCCAG 
GACCAGGGCA 
OGCTATGGCC 
GAGG6CAACT 
CCGQAG6QAC 
ACCACCT600 



TTCTCATGAT 
OGCTGTGOGC 
TCATCTGQAG 
TGCTCCCCAT 
CCtGGGTCOQ 
TGTGGCACAG 
TGGACCAOTG 
OOCCCC RGAq 
TGCACACCCr 



CACCTTCTGC 
AGTCTTTCTG 
GCAGCCOQAG 
CCTGAGOVTC 
GTTTGCTGTG 
0GAGGAGG06 
CA AGTO AOOC 

CACTGCA 



8eq 10 NOi C35 DMA Sequence 

Nucleic Acid Accession <}i NM_002776.1 

Coding sequence i 82.. 912 



1 
1 

ACCAGOGGCA 
G60GACTCCC 
GCCCGOQCrC 
GCGCTQCTCC 
OGCOGCrCGC 
GTCCTGGTGO 
GCTCGAGTAO 
CGCTCTQTTO 
GATGAGCA06 
CX3GGCCCTGC 
TGGGGCACCA 
ACTATCCTQA 
ATATGTGCTG 
GTCT6TGACG 
CAGCATCCAG 
C6CTCCAACT 
CAGATGCCCA 
TGTCTGCACT 
CATTCCCCCA 
CAAAGGTTTA 
CT6GGGTCAC 
AGTQCCCTCT 
TCrrAOACAT 
OTCATGTAAG 
AAAAAAAAAA 



11 

1 

GACCACA6GC 
AGATCCTGGC 
TGGCGAAGCT 
CCCAAAAOQA 
AGCCCTGGCA 
ACCAGA6TTG 
GGGATGATCA 
TCCATCCCAA 
ATCTCATGTT 
AGCTTCCCTA 
0G6C06CC0S 
GCCCTAAAGA 
GACTGGACCG 
AGACCCTCCA 
CTGTCTACAC 
GATGC31GATG 
GAGGCTCCAT 
GTTCAAACCT 
CCTATCCCCA 
TTCCAGAGAA 
CCAACCTGAC 
CTGAAGCTCA 
GTTG7GAGGA 
GCTTAACACA 



21 
I 

AGG6CAGAGG 
CATGAGAGCT 
GCTGCCGCTG 
CAOGOQCTTG 
GGTCT06CTC 
GOTGCTGACO 
CCTGCTX3CTT 
GTACCACCAG 
GCTAAAGCTG 
COGCTQTGCT 
GAGAGTGAAO 
GTGTGAGGTC 
GGGCCAGGAC 
AGGCATCCTC 
CCAGATCT6C 
CTAC3GCTCCA 
CQTCXa^TCCT 
CTGCCXSCCCT 
TTCTCT6CCT 
6CCAG6AAGC 
TTCCTCTGCC 
GTTTCCTCAT 
GACTATGATA 
GTGQGT6GTG 



31 
I 

CAOQTCTGGG 
CCGCACCTCC 
CTGATGQCGC 
GACCCOGAAG 
TTGAACGGCC 
QCG006CACT 
CTTCAGGGCG 
06CTCA6GCC 
GCX aGGC CCXS 
CAGCC0G6AG 
TACAACAAGG 
TTCTACCCTG 
CCTTGCCAQA 
TCGTGGGOTG 
AAATACATGT 
GCTGATCCAG 
CTTCCTCCCC 
CCACACCTCT 
GTACTGAAGC 
OQGTCATCAC 
ACTCCCOGCT 
CTGCAAAATG 
TAACATGTGT 
AGTTCTGACT 



41 

I 

TCCCCTCCCT 
ACCTCTCCGC 
AACrCTGGGC 
0CTATGG06C 
TCTOGTTCCA 
GC9GGAAACAA 
AGCA6CTC06 
CCATCCTOCC 
TAGTGC0GG6 
ACCAOTQCCA 
GCCTGA0CT6 
GCGTGGTCS^C 
GTGACTCTGQ 
TTTACCCCTO 
CCTGGATCAA 
ATGTTAT6CT 
AGTCGGCT6A 
AAACATCTCC 
TQAAATGCAG 
CCAGCCTCTQ 
GTGTGACTTT 
GGAACAATGA 
ATGTAAATCT 
AAAGGTTACC 




CTCCAGCATC 
CAACAACATQ 
AGGOXCCTG 

TAAAQTCATA 
CCTGCTGATC 
ACTCtCCCCT 
CCTCTCACCT 
GAAGTGGTGG 
AGAGCASTTA 
GGGCAAGCCA 
08TGCCTACC 
TCATGTGATT 
TGTTGTOGTG 



Seq ID NOt C36 DHA Sequence 
Nucleic Acid Acceesion #i XH_09S088 
Coding sequences 1..4074 " 



AXGACCCGGG 
ACGGCOGGCC 
.GACTGGCA6C 
GAGCTGCCAA 
GCTGTAOGCT 
TGCGCTGGAA 
GGATTCAGGG 
AOCAOOQGCQ 
GTGCTGCTAC 
CAACAAGTGC 
GCCGTGGGGA 
TGGCTGGGGG 
GGQGCCOCTT 
CTCATGCTGA 
ATCAC6CAGC 
AGCCAGCAGG 
CGGCTTCOOG 
CTGC06CTGT 
GGGQTX3CIGC 
CACCAAAGGC 
ACCTGOGGGA 
GQAQGGCCCT 
GTGGCACCTG 
GGAGACATCr 
TTCTCTCTGG 
ATGTTAGAGA 
ATOGCCTGTC 
AAACACCCAG 
GATTCACTCC 
TTAAAAAAAT 
CTTAACCAAT 
GTCTTCASTA 
TTGAAAT6TA 
ATAATTCATA 
TGCTCAA6CG 
GAATGTQ6CA 



11 
I 

CCGCGACAGC 
TGCOGOGOQC 
GOGGCTGCGT 
G0QGG6TGCT 
CCOGCOGAGG 
OGGAOACCCG 
GAT6OC06GC 
T0GACT60QO 
AGGGTTTGGA 
AGGAGCGCCA 
GCCCTOGCCC 
AGCTGCTGGC 

AGGA6CAGAA 
TGGAQCAGAA 
ATGGGGGCCT 
TGCTGACATQ 
CAGCGGACAG 
TTTGGGTCCC 
CAGCAGAGGG 
CGCTGCCCCA 
GTCCCCAGGA 
CAGTGGCCAA 
OTCOCCTCTG 
CGGAGTGGCA 
ACTACAGAAA 
TGGAGCAAAA 
TTACATGTTC 
AAAAAGTAAT 
GTTGTAAAAG 
QTTTGTCAAA 
AATTTTCAAA 
AAAAATATGG 
CTAA06A6AA 
GTACTACACA 
AAGCCTTTAO 



21 

I 

CGAGCCGGQC 
CTTCCTQCAG 
GCACCTGCGA 
GGAGGGGCTO 
CGGOGCAGTQ 
GGGCGGAOGC 
GGACCCCTGT 
CCT6CTGAAG 
GATGAT6G00 
GTGCCGCCTG 
ACTGGQAC6G 
TGAGGCCTGT 
6ACGTCCACC 
CCGACTCCTC 
GT0GGC6CTC 
GTCCCCGGCT 
GGCTGGGGCT 
TGGGGGACCC 
ATCTCCAGGG 
CGCGTG6CAG 
CTTTGAGTCC 
GGGGGATCGC 
AOTGACACCC 
TCCCAAGGGA 
ATGCCTQOAT 
CCTGTTCTCC 
TAAAGAGCCC 
TCATTTCAAC 
ACCAAGAACA 
AGTAGATGAG 
TACCCAAAAC 
TTCCAATAGA 
CAAATCATTT 
GTCCTACAAA 
TAAAAGAATT 
GTGGCCCTCA 



31 



0GC6T6TCTC 
AGCCTGCGCA 
GA6ATCCAGT 
AGCCAQCGGC 
CCCCGCGGGG 
AGOTOCAGOQ 
GCCOGAGGGG 
CAGATGAAGO 
CAGGGC0009 
GGCCAGAGCA 
CTACrQCCCA 
GCOGGTOGGG 
TCOTGCCOGG 
ACCCAGGAGG 
ATTAAOCAGC 
GGCCCCCACA 
CTCTTGAGTC 
CTCCACGAGT 
AAAAGAACTG 
CTOGGATOCQ 
CACAAAACAA 
AGCTGGAGCC 
AAOOGTGAGG 
CTGTTGACAT 
CATGCTCAGC 
CTGGGTATGA 
CAGAATATAA 
CAAGACCTTC 
TATGGAAAAT 
TGTQAGQTGC 
AAAATATTTC 
CACAATGCAA 
TGCATGTTTT 
TGTOAAGAAT 
CTTACTOGAG 
AACCTTACTA 



41 

I 

CCGCCTOGCC 
CCCTGTTA6A 
CCCTCTGGGT 
GC66GCCQCA 
CCCGAGCGGT 
GCCTGCAGGG 
AACACOGGAG 
AGCT6GAGCA 
ATTGGTACGA 
GAGCCAGCGC 
AGGTACAGGA 
CCCTGCCCAC 
GCTGGCA6CA 
TGACCGAQAA 
TATTTGAGQC 
TTGAGCCCTT 
CGCATAGTCC 
TACCAOATAC 
CTGACOCCAG 
GOOCTGAGGC 
CCTGTGAGCC 
ACCTGGGCGC 
AGGCTGCAGG 
TCAGAGACAT 
AGAATTTATA 
CTGTCTCTAA 
AGAGAAATGA 
AGCCAGAGCA 
GTGGACATGA 
ACAAAQGAGO 
AGACTCATAA 
GA7ATACTGG 
CACACCTAAA 
GTGGCAAATC 
AGAAACCCTA 
GACATAAOAa 



1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2157 



51 

1 

CGCGOGCAGC 
CATCCT6GAC 
C6AAG0GCGG 
GC06GGA6CA 
TCCAGAGCGC 
CCTTGGAGGC 
GCACACCATC 
GGAGAAGGAG 
GCAOCAGCTG 
CGACrTTGGG 
GGTGGCCOQG 
ATOCTCCTCC 
6CA0ATCATC 
GAGTGAGCGC 
CCGCGCCCTG 
GACTCGGTTT 
GCAQCTACTA 
CTGGTTTCCA 
GCTQCACTTC 
TGOQCCCGAO 
TGACTCCCTC 
AGCGTTTGAT 
GAGTCGGCAC 
AGCTATAGAA 
TAGAGATGTG 
GCCAGACTTG 
GATGGCAGCC 
GAGCATAAAA 
GAATTTACAA 
TTATAATGAC 
ATGTGTCAAA 
AAAGAAACAT 
TCAACATCAO 
CTTTAACCAC 
GAGATGTGAG 
AATTCAGACT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1060 

1140 

1200 

1260 

1320 

1380 

1440 

1454 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

ISOO 

1560 

1620 

1660 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 



1219 



wo 03/042661 



PCT/US02/36810 



GOAGAQAAAC CCTACGCATO TGAACSAATC5T GGCCAAQCCT TTACGOGCTC CTCAACACTT 2220 

AQAGAATTCA TACTGGAGAG AGACCCTACA AATCTCAAGA ATGTGGCAAA 2280 

S^^™ TATCCTCAOC CCTCATTTAC CACAAGAGAA TTCATACIGG AGAGAAACCC 2340 

5 It^SJJ^ CAAACCCTTT AACTCCTCCT CGACTCTTAA GACACATAAG 2400 

J ATAATTCATA CTGGAOAGAA ACCCTACACA TGTXSAAQAAT OTCGCAGAAC CTTTAACTOC 2460 

TCCTCAACTG TAAAGGCACA TAAGAOAATT CATACTGGAG AGAAACCATA CAAATCTQAA 2S20 

GAATGTCACA AACCTTTTAA GTGGCATTCA AOTCTTQCTA AAC3VTAAGAT AATTCACACT 2580 

GGAGAGAAAC CCTAC»AAT6 CAOTOACAGC AAAGCCTTAG CCAAATCATC AGAAGTGCAA 2640 

^ AAGGTCTACT CTQGAQATGG GGAAAATGQA ATCOGTOTAC ATAAGAAAAA GGAGACACAG 2700 

lU GGCTGGCTTG TGAGAAACAA GAACGAAAAT A6AACAGGGC TOTTCCAGAT CCGGGCTCCC 2760 

GTQAGACCTA ACAGGGACCC TTCATGGGGA CAGCAAGAAG GTTCACTGAC TQACCCAATT 2820 

CAGAGGAAGG AGQAACCTGA CCTTCAAAAT CACTATOACC ATCAGAATGC CTTAGAAOAT 2880 

CAAAGAAATA CTGGAGTGGG TCGACTGITG ACATTCAGAG ATGTAGTCAT AGAATTCTCT 2940 

1 S ^I^SJST ?*C«^lf6CCT GGATCAOSCT CAOCAQAATT TATATAGAGA IGTCATCTTA 3000 

S2SSfSI*°^ GAAACCTX30T CTCCCTGG6T ATTGCTGTCT CTAAGCCAGA CTTCATCACC 3060 

TGTCTGGAGC AAAATAAAGA GCCTTOGAAT ATAAAGAGAA ATGAGATGGT AACCAAACAC 3120 

CXyVGACCTTC CX3CCAQAGCT AGGCATAAAA GATTCACTCC AAAAAGTAAT ACCAAGAAGA 3180 

TATGGAAAAA GTGGACATGA CAATTTACAA OTAAAAACAT GTAAAAGCAT QOQTGAGTGT 3240 

OAGOTGCAAA AAOGAGOTTO TAATGAAGTT AACCAATOTT TGTCAACTAC CX^AAACAAA 3300 

ATATTTCAGA CTCATAAATG TQTCAAAGTC TTCGGCAAAT TTTCAAATTC CAATAGACAT 3360 

AAGAC3UU3AC ATACTGGAAA GAAACATTTC AAATGTAAAA AATATCGCAA ATCATTTTGC 3420 

ATGQTTTCAC AACTACATCA ACATCAOATA ATTCATACTA GGGAGAATTC CTACCAATGT 3480 

GAAGAATGCX3 GCAAACCCTT CAACT6CTCT TCAACCCTTT CTAAACATAA AAGAATTCAT 3540 

^- ACTGGAQAGA AACCCTAGAO ATGTGAGGAA TGTGGCAAAO CTTTTACCTG GTCCTCAACC 3600 

CTTACTAAAC ATAGGAGAAT TCATACTGQA GAAAAACCCT ACACATOTGA A6AATGTGGC 3660 

CAAGCCTTTA GCCOCTCCTC AACACTTGCT AACCACAAOA GAATTC3VTAC TCGAGAQAAA 3720 

CCATACACAT GTGAAGAATX3 TGGCAAAGCC TTTAGCTTAT CCTCATCCCT CACTTACCAC 3780 

AAGAGAATTC ATACTGGAGA GAAACCCTAC ACATGTQAAG AATQTGGCAA A6CCTTTAAC 3840 

TGCTCCTCAA CCCTTAAGAA ACATAAOATA ATTCATACTO GA6AQAAACC CTACAAATGT 3900 

OKI AAAGAATGTG GQAAAGCCTT TGCCTTCTCC TCAACTCTTA ATACTCATAA QAGGATTCAT 3960 

ACTGGAGAGG AACCCTACAA ATQTQAAGAA TGTQAC3VAAG CTTTTAAGTO GTCCTCAAOT 4020 

CTT6CTAATC ATAAGAGTAT GCATACTOGA QAOAAACOCT ACAAATOTOA ATAA 4074 

Seq ID NO: C37 DKA Sequence 
Nucleic Acid Accession «t m 032044 
Coding sequence i 182 . . 658 " 

1 11 21 31 41 51 

I I I ) i i 

^^TATAAA AGCTOCAGAA ACXSTTGACTG GGACCACTGG AQACACTGAA GAAGGCAGGG 60 

GCCXrXTAGAG TCTT60TTGC CAAACAGATT TGCAGATCAA GGAGAACCCA OaAGTTTCAA 120 

AGAAGCGCTA GTAAOOTCTC TGAGATCCTT GCACTAQCTA CA-TCCTCAOG GTAGOAGOAA 180 

GAT^CTTCC AGAAGCATGC GGCTOCTCCT -ATTGCTQAGC TGCCTGGCCA AAACAGGAGT 240 

CCTGGOTGAT ATCATCATGA GACCCAGCTG TQCTCCTGGA TGGTTTTACC ACAAGTCCAA 300 

TTGCTATQGT TACTTCAGGA AGCTGAGGAA CTGGTCTGAT GCX3GAGCT0G AGTGTCAQTC 360 

TTACX3GAAAC GGAGCCCACC TGGCATCTAT CCTGAGTTTA AAGGAAGCCA GCACC3WAGC 420 

AGAGTACATA AGTGGCTATC AGAGAA6CCA QCOGATATGO ATTCGCCTGC AOSACCCACA 480 

<5AAGAGGCAG CAGTGGCAGT QGATTGATGQ GGCCATGTAT CTOTACAGAT CCTGGTCTGG 540 

25?^'^ GOTGQQAACA AGCACTGTGC TOAGATQAGC TCCAATAACA ACTTTTTAAC 600 

JVr TTGGAOCAGC AAOGAATOCA ACAAGCGCCA ACACTTCCTG TGCAAGTACC GACCATAGAG 660 

CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCOTCC TCTTCCTTTC TOCTAGCCTC 720 

GCTAAATCTG CTCATTA TTT CAGAGGGGAA ACCTAQCAAA CTAAGAGTOA TAAGGGCCCT 780 

ACTACACTGG CTTTTTTA GG CTTAGAQACA QAAACTTTAG CATTGGCX3CA QTAGTGGCTT 840 

^^2^^^ ATOTTTOCXX: CGCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 900 

GTCTCTGQCT (JTCTC6AGC3V GTCTAGAAQA GTGCATCTCC AQCCTATGAA ACAGCTGGGT 960 

CnrrGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAOOAAO AAACTCAGGA GXAAGCTTCT 1020 

AGACCCCTTC AGCTTCTACA CCCTTCTQCC CTCTCTCXaT TOCCTGCACC CCACCCCAGC 1080 

C31CTCAACTC CTGCTTQTTT TTCCTTTGGC CATAOGAAGG TTTACCAOIA GAATCCTTGC 1140 

TAOOTTGATO TGGGCCaTAC ATTCCTTTAA TAAACCATTO TSTACATAAQ AAAAAAAAAA 1200 

Seq ID NOt C38 DMA Sequence 

Nucleic Acid Accession Ui Eos sequence 

Cbding sequences 52.. 3 042 

65 1 11 21 31 41 51 

^ ' i I I I 

GCTCA CgAG QAAAAATATG CAATCGTCCC ATTGATATAC AOGOCSUrTAC AATGGATGGA 60 

GTTAACXntA GCACC6AGGT TQTCTACAAA AAAGGCXaCO ATTATAGGTT TGCTTGCTAC 120 

~ GACOGGGOCA GAGCCTGCCG GAGCTACXWT GTACGOTTCC TCTGIGGQAA GCCTCTCAGG 180 

/V CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCIGAA CTTGGAGGAT 240 

AATGTACAGT CATGGAAACC TGGAGATACC CT6GTCATT6 CCAOTACTQA TTACTCCATC 300 

TACCAGQCAG AAQAGTTCCA GGTGCTTCCC TOCAGATCCT G0G0C0CC3kA CCAOOTCAAA 360 

GTQGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGQCGT GGACATCCGG 420 

GCGGAOTTTO GGCTTCTGAG CCGGAACATC ATAGTGATGO GGGAQATGGA GGACAAATGC 480 

rt2SlEI^°^ GAAACCACAT CTGCAATTTC TTTGACTTCO ATACCTTTGG GGGCGACATC 540 

AAGTTTOCTC TGGGATTTAA GGCAOCACAC TTOGAOGGCA CJCaABCTGAA GC3VTATGGGA 600 

CAG^GCTGG TGGGTCAGTA CCCGATTCAC TTCCACXaGG COGGTGATCT AQAOSAAAGG 660 

GGA^TTATO ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

RO TCCATOGCrC CAATGGCTTG TTGATCAAGG ACGTrGTOGQ CTATAACTCT 780 

JI2j?^^ GCTTCTTCAC.GGAAGATGGG CCGGAGQAAC OCAACacrrr TGACCACT6T 840 

CTTGGCCTCC TTOTCAAGTC TGGAACCCTC CTCCCCTOGQ AOaSTOACaMJ CaWOATOTCC 900 

AAQATCATCA CAGGAGACTC CTACCCAGGG TACATCCCC31 AOCOCRGGCA A6ACT0CAAT 960 

GCTOTOTCCA CCTTCTGGAT GGCOUVTCCC AACAACAACC TCRTCAACTG TGCCGCTGCA 1020 

OOATCTSAOG AAACTGGATT TTOOTTTATT TTTCACCAOO TACCAAOGOQ CCCCTCCtflXI 1080 

1220 



40 
45 
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5 
10 
15 
20 



30 



GGAATCTACT CCCCAGGTTA TTCRGAOCAC ATTCCACTGO GAAAATTCTA TAACAACC^GA 1140 

GCACArrcCA ACTACOGGGC TG6CATX3ATC ATAGACAA06 GAGTCAAAAC CAC0GAG6CC 1200 

TCTGCCAAGG ACAAGOGGCC GTTCCTCTCA ATCATCTCTG (3CMSATACA0 CCCTCACCAG 1260 

GAOGCOGACC CGCTQAAGCC CXX3GGAGC0G GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AA0OU3GACC A0G6G6CCTO GCTGOGOGGC GGGGATGTGT G6CTGGACAG C lXXVatflTi ' 1380 

GCTGACAATG GCATIOGCCT QACOCTGGCC AGTG6TGGAA CCTTCC06TA TGAGQA0Q6C 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTT6G06AGA GT6GCAA0GT G6GGA06GAA 1500 

ATGATGGACA ATAGGATCTG GG G CC C i TO C GGCTTGOACC ATAGOGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCC6AAAGTT TGT6GCCCIG GAG6GCCGGC ACACCA6C6C CCTGGCCTTC 1680 

CGCCTGAATA ATGCCT66CA GAGCrPQCCCC CATAACAAOQ TGACGGGCAT TGOCTTTOAG 1740 

GACSGTTCCGA TTACTTCCAG AGTOTTCTTC GGAGAGCCTO GGCCCTGGTT CAAOCAGCTG 1800 

GACATGGATG GGGATAAGAC ATC T GT G TTC CATQAOGTCO A0GGCTCC3GT GTOOGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGQTCC GGCACCCAGA CTGC31TCAAT 1920 

GTTCCCGACT GGAGAGQGGC CATTTGCAGT G6GTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTG06 AATQAAGATC ATCAAGAATQ ACTTOCCGAG CGAOCCTCTT 2040 

TACCTOQAGO GGGOOCTCAC CAOGAGCAOC CATTACCA6C AATACCAACC G6TTGTCACC 2100 

CTGCAGAAGG GCTACACCAT CCACTOGQAC CAOAOSQCCC CCGCXX3AACT CQCCATCTGG 2160 

CTCATCAACT TCAACAAGGG OGACTGGATC OGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTOGGA IGTTCACAAT 06CCXGCTGA AGCAAAOGTC CAAGACX3GGC 2280 

6TCTT0STGA GGACCTT6CA GATGGACAAA QTGQAGCAGA GCTAOCCTGG CAOQAGCCAC 2340 

TACTACTGGG ACGAGQACTC AG6GCTGTTG TTCCT6AA6C TGAAAGCTCA GAACX3A6A6A 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGOC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

^. CCAAAGAACO CAGGOGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACOGAGAGG 2520 

25 GCTGTOSTAG AC6TGCCQAT GCCXAAGAAG CTCTTTGOTT CTCA6CTGAA AACAAAGGAC 2580 

CATTTCTTGO AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAA08AC 2640 

TTCGCTTACA TTQAAGTGOA TOGOAAGAAG TACCCX91GTT OQQAGQATGQ CATCCAGGTG 2700 

6TGGTGATTG AOGGGAACCA AG6GGGCGT0 GTGAGCCACA OGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCXSA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATAOGTC TCCAGAGGCC CATGGACCAG AGTGCTGQAA 2880 

AAOCTTOGOG CAQACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTOGT TG6CTTCAAA 2940 

GGCAGCTTOC GGCCCATCTG GQT6ACACT0 GAGACTQAGO ATCACAAAGC CAAAATCTTC 3000 

CAAQTTGTX3C CCATCCCTGT GGTGAAQAAG AAGAAGTTGT GAGGACAGCT GCCGCCCX36T 3060 

GCCACCT(X!T GGTAGACTAT GACX3QTQACT CTTGQCAGCA GACCAGTGGG GQATGGCTGG 3120 

3j GTCCCCCAGC CCCTGC!CAGC AGCTGCCTGG GAAGGCCX3TG TTTCAGCCXTT QATGGGCCAA 3180 

OGQAAGGCTA TGAOAOAGOC T6GTGCT0CC ACCT6CXXCT ACTCAAGTQT CTACCTGGAG 3240 

CCCCTG0G6C GGTGCTGGGC AATGCTGGAA ACATTCACTT TCCTGCAOCC TCTTG6GTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA QACCTGGGTT 3360 

GTGCTGACAG CAAAQATCCA CTTTGOCAGG AGCCCTGACC CAGCTAQQAQ QTAGTCTGGA 3420 

GGQCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGOCCTTA AGGAAATCTT TACTCCTGTA ASCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTQTCCACCT TTCAGOAGAC TTTGAGTGGC AGGTTTGGAC TTC3GACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT 66ATATCCAC TGATATCXAT GATGCTGGGT GCCCXAGCGC ACAOGGGATQ 3780 

GAOAOGTGAO AACTAATQCC TAOCTTGAGG GGTCTOCAOT CCAGTAGGGC AGOCAGTCAG 3840 

GTCOVTGTGC ACTGCAATGC CAGGTOGAGA AATCACAGAG AG6TAAAATG 6A6GCCA0TG 3900 

CCATTTCAGA GGQGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAQATGAGG CAGCCTCTGG AATGGCTCAG GOATTCAGCC CTCCCTGCCQ 4020 

OU CX0CCTGCT6 AAOCTGGTGA CTAGGGGGTC GCCCTTTGCT CAOGTCTCTC TGGCCCACTC 4080 

A3X3ATOQAOA A6TGTGGTCA GAGGGOAGCA ATGGGCTTTG CT8CTTATGA QCACAGAOGA 4140 

ATTGAOTCCC CAGGCAGCCC TGCCTCTQAC TCCAA6AGGG TQAAGTCCAC AGAAGT6AGC 4200 

TCCTGCXTTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT QAOCACAQGO GOCCTCCAGG 4260 

AGACCCTAGA TGT6CTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTA6CCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT 0A6CCCCCAA GAT6GGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCT6GGGAG CCGCACCCTA GCCCTT6CT6 CCACACXACA 4440 

TTGCCTCAAC AACCGQCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTOGAAGGAA AGGAAATGAC TAQAGTAQAA TQACAGCTAO CAGATCTCTT 4560 

CCCTC CTGC T CCCAGCX3CAC ACAAACCCX3C CCTCCCCTTG GTGTTGGCGG TCCCTQTGGC 4620 

CTTCACTTTG TTCACTACCT QTCAGCCCAG CCTGGGTQCA CAGTAGCTGC AACTCCCCAT 4680 

TOGTGCTACC TOGCTCTCCT 6TCTCTGCA6 CTCTACAGGT GAGGCCCAGC AGAGG6AGTA 4740 

GGGCTCX^CCA TGTTTCTGGT GAGCCSU^TTT GGCTGATCTT GOOTQTCTGA ACAGCTATTG 4800 

GGTCC3VCCCX: AGTCCCTTTC AGCTGCTGCT TAATQCXXTTG CTCTCTCCCT GGCCCACXTTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAQG GAGAACTCTA TCTGTGGTTT ATAATCITGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT C3UU«SAGGGC CTGCCTOGCT CCCTCCACCC AACTQCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCOCCA GGTGOGAQCC AACTGTCAGQ GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCT6CTTTT AAAQATATGG 5160 

„ CTGCTTCAAA GGCCAGAGTC ACAGGAAGQA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

/U AGGA6AGTTA AAATGACCTC ATGrCXTTTCT TGTCCAOGGT TTTGTTQAGT TTTCACTCrT 5280 

CTAAT6CAAG GGTCTCACAC TOTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AQTTCTCAaA 5400 

GTTGTACa^TA TGTTTCACAQ TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCr TGTTAT TTCT G TTTGTAA QA CTTAAGTGAG TTAGQTCTTT AAGGAAAGCA 5580 

AOGCTCCTCT GAAATGCTTQ TCTTTTTTCT GTTGCOSAAA TAGCTGGTCX: TTTTTOGGGA 5640 

GTTAQAT6TA TAGA6TGTTT GTATGTAAAC ATTTCTTGTA GGCATCAGCA TGAACAAAGA 5700 

TAtATTTTCT ATTTATTTAT TATAT6TGCA CTTCAAGAAG TCACT6TCA6 AGAAATAAAG 5760 

AAITGTCTTA AATOTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 5808 
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Seq ID HOt C39 DNA Sequence 
Nucleic Acid Acceesion »t MM_014373 
Coding sequence: 322.133S " 
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1 
I 

GTGGCCTCGA 
GCCrCXXKSAG 
ATGTCTCCQA 
AAATAACKTA 
6A>CCCACTGG 
TCTTGAAATT 
CAGTTAOQTC 
GGGAAAATAT 
TTTATOGAAT 
TCCATTATAT 
ATCTQCCTAT 
CTGACAGCTT 
T6TCAAAAAT 
GTTTTGGGAO 
TGTCCTTTCT 
TTTGTAGCTT 
ACTTCCTATA 
QTGAGATCTA 
CCATTTGTAC 
GAGATGAATA 
TTTAATTC3TC 
TGGAAGTGCT 
TCAATAATQA 
TATGAACAGA 
CCCTGACTGA 
GTTTTATTTT 
TTACAAATAT 
GATGTTTQTQ 
ACTGACAACT 
AAAAAAAAA 



11 
I 

GGTGGTGGCA 
QCAGGTTCX5G 
GCTTACTCAC 
ATTGAAG6CA 
AGAGGACAGA 
TAACTAAAAA 
AAACAAACCA 
TATTAAATAT 
ATTTTT6CAT 
TGTATTTCAQ 
TTACTCAAAT 
GTATAQATTA 
TATTTTATTT 
ACCCAGCCAT 
ATGTCAGCAT 
TCATAACCTG 
TGAATGAAAC 
AAAAAATATT 
TACTTGAG6T 
TTCCCTGGTT 
ACAAQCTTAA 
GCTTCATTCC 
TTTGTTAATA 
AAGAACTCAG 
TAGCATTTCA 
AAAAACAAAA 
TACTTT6TTA 
TTACTCAAAA 
TTAAGATATC 



21 

I 

GGGCCGCCCC 
CTGGAAGGAA 
ATAGCATATT 
GTAAAAGTGA 
AAATGAAGCA 
TATGACTQCT 
GCCCCTAGAC 
CCTTACACTA 
TTCACTA6CA 
GQATTTTGTA 
TATTTCCTTT 
TTGCCTGAAT 
CTTTACAGTA 
CTACCAAAGC 
TCAGAGTTAC 
TTGGGAAGAA 
TATCTTATAT 
CTTATCCAAG 
AATCATTGTT 
ATACTTTGTC 
TTTAAAAGAC 
ACTTACAATT 
TTATTAATTA 
GACATATTAA 
QAATGTGTCT 
TAATTCCAAG 
TTAACACAAA 
AAACTACIGG 
AACCTAAACA 



31 
I 

CTGCAGTC06 
CCGCTCTOGC 
GGTATATCAA 
AATTAAATAG 
GTOTTTTATC 
CrCTCTTCAO 
GTTAACTATC 
G6AATGAGAA 
TTOGTTGATC 
CTTTTAAGCA 
ACTTATGGCT 
TTCTCTAAAA 
ATTTTAATTT 
CTQAAGGCAC 
TGGCTGTCAT 
GTTACTACTT 
TTTCCTTTTT 
CTCATTGTCT 
TTACTTAAAG 
AATAGTTTTC 
ATTGGATTAC 
CCTAATCTTQ 
AAAGTTACAG 
AAAATAAACT 
TTTGAAGGGC 
AAGTTTTTAT 
AA6TGATAA6 
ATQCAAACTG 
TTTTTATTAA 



41 
I 

GAGACXSAACX3 
TTCXSTCCTAC 
AATGAAATGC 
GAA6ATCATC 
ATGTGTATTT 
AGAACTGCTC 
TGCTATTCTT 
GAAAAAACAC 
TTTTACTTTT 
TTAGGTTCAC 
TTTTGCATTA 
CAACCAAGCT 
GGATTTCAGT 
AGAATGCTTA 
TTTTCATGGT 
TGGTACAGGC 
CATCCCACTC 
GTTTTCTCAG 
TTCAGATTCC 
TCATTQCTAC 
CTTTGGATCC 
AGCAAATTGA 
CTGTCATAAG 
GAACTAAAAC 
TATACCAGTT 
AQTTATTCAQ 
AGTTAACATT 
TTATGTAAAT 
ATOTTCAAAT 



Seq ID NO: C40 DNA Sequence 
Nucleic Acid Accession «t BC012089 
Godlng sequence i 1..2571 



1 
I 

ATGGCCCTOG 
GGGCAGCCTT 
TATGAGACCC 
ATCTTTCTCT 
GAGAAGQCAT 
ATTCTATGCT 
TTTTGTATGT 
AATGG6CCCT 
AGCATTGGCA 
AGTOSGAAAC 
GA6CAAATGA 
C7QAACAGTA 
ATCCCTGTTC 
TTGGAOAACSl 
AGTCTGACCA 
CATCCATCAA 
CCTGAACTGA 
AGGACAGATT 
AGAOTACAAC 
GQTTCAGATA 
TCTGTTTATG 
TATGATTCAT 
ATTTTTTACT 
ACCACCOGAG 
AGTTTCCTCT 
AATGTGGAAA 
ACACCCTACT 
TCAAAAATGA 
TACGGCACTC 
CATACTGGAA 
TT6G6TGCA6 
AATTATGACA 
TTT6CATATG 
CTQAAAA6AG 
CAATCACrOA 
TTGGAGAGAG 
AATACTTCCT 
TTTGAACATT 
CCT6TGGGCA 
CCCTTGAATT 
ATTTTTGOOO 
GATGTTGAAA 
CATGTATATO 



11 
I 

TACT0Q6CTC 
CATCCACAGA 
AAGACTCCCA 
ATGTGGTACA 
ATGAATCCAA 
6TGTCCTGGG 
GTCGTTGCTG 
TCCTGAGGAA 
TCTTCTATG6 
TGGCAGATAG 
AATATATATT 
TCAATTCAGT 
TTGATGAGAT 
T6AACAGCAC 
60GTGAAAAC 
GTGAAACCTG 
GGCAGCTTCC 
TGQATGGCCT 
GCCAAAOCAC 
TCQACAATQT 
TTAATAACAC 
ACTGGTGGCT 
ACCTGGGCTT 
GCT0T6TCTC 
TTT6CTG6AT 
AACT6ATCTG 
TACTAAATGA 
AGCTCACTTT 
TTCACCT6CA 
GCATAAGCAG 
CAGGAAGAAA 
GCTACTTGGC 
ATCTAGAAQC 
AIGCACAAAC 
OCACTCTATA 
TAACTAGGAT 
CTQTTATTAT 
ATCT6CAGTG 
CXX3CTCTAQA 

T GiT r ix;Gi " r 

TAAAACTGGC 
CTATACCCAT 
GTATTCACAA 



21 

I 

CCTGTT6CTG 
TGCTCCTAAG 
TAAAGCTGGA 
GCOQOGTGAT 
AATTGATTAT 
GCTGCTOTTT 
TAACAAATGT 
ATGCTTTGCA 
TTTTGTGGCA 
CAATTTCAA6 
GGCCX3M3TAC 
GCIAGGAGGC 
TAAGTCCATG 
CTTGAAGAGC 
TAQCCTGCGG 
CAACAGCATC 
ACCCGTGGAT 
GG TCCA ACAO 
GACTGTOQTA 
AACTCAGOGT 
TGAAAGTTAC 
GGGTGGCCTG 
ACTGTGTGGC 
GAACACOOGA 
ATTGAT6ATC 
TGAACCTTAC 
AGACTGGGAA 
TGAACAAGTT 
GAACAGCTTC 
TGAATTGGAA 
AAAOCTTCAG 
TCAGACTGGT 
AAAA6CAAAC 
TATTAAAACA 
CCAAAG08TC 
TCTAGCTTCT 
TGAGGAAACT 
GATCXSAQTTC 
TACTGCTGTT 
TGGCATAGGA 
TAAGTACTAT 
GAAAAATATG 
TCCT6TTATG 



31 
I 

CTGGGGCT6T 
GCTTGGAATT 
CCCATTGGCA 
TTCCCAGAAO 
GACAAQATTO 
ATTATTCTGA 
GGTGGAGAAA 
ATCTCCCTQT 
AATCACCAGO 
GACTTGCX3AA 
AACACTACC31 
GGAATTCTTG 
GCAACAGGGA 
TTGCACCAAC 
TCATCTCTCA 
AOATTGTCTC 
GGAGAACTTG 
GGCTATCAAT 
GCAGGTATCA 
CTTCCTATTC 
ATCCACAGAA 
GTCATCTQCT 
GTGTGCGGCT 
GGCGTCXTCC 
ATTCPGGTTC 
ACX3AGCAAGG 
TACTATCTCT 
TACAGTGACT 
AATATCAGTG 
AGTCTGAAG6 
GATTTTGCTG 
AAATCCCCCG 
AGTTTGCCCC 
ATTCACCAQC 
AA6ATACTTC 
CTGGATTTTO 

aaqaagtatg 
tctatcagtq 
gatotctttc 

AAAGCTACTG 
cgtcgaatgg 
GAAAATGGTA 
ACAAGCCCAT 




TGCACCAGCG 
TGGTGATTTG 
TAAGAACC06 
CTCTCTTGAA 
AGGACAAGGC 
ACOGACTGAG 
TCAAGGAGAC 
AAAOTACACA 
ATGACCCTCT 
TAAGCCAGCT 
ACAACGTTAA 
CCCTTAATGA 
AAAGGGTCTT 
AGGATATACT 
ATTTAOCTAC 
CTCTGCTGAC 
ATGACA06CA 
TCATOGTTGG 
TTACCTTTQT 
AATTATTCCJS 
CTGGGAAGCT 
GCAAAAAAAA 
AACATCTCAA 
TAAATCTTAA 
CTTCTGGAAT 
CAGGAGTGAA 
CAGGAAATTT 
A AOGA G TCCT 
AA06CACAG6 
CTCAGAACTT 
GGAGAACAAT 
AGAAAGT6GC 
TGTGTAGCTA 
TATTTTTACT 
ATTCGQAGGA 
ATAATGGTTA 
CACAACATTG 



51 
I 

CAO6GAC0GG 
ACTTGCGCAA 
AAGGAACCAA 
AGTCAAGGAA 
CA6CAGGTCT 
TTTTCAGTAC 
GATCATACTT 
CTGTCAAAAT 
GOTAAACATT 
TAAATACCAC 
TCCAGTTTTC 
TTCATTTAAG 
CCTTGCTTAT 
TTCTOGTCAC 
GATGATTTTA 
TATCAGGATA 
CAGTTATACT 
TACCTGGTTA 
AGCATATATT 
AGTGTATTGG 
ATTTGTCAAC 
AAAGCCTATA 
ATCATAATTT 
AACTTTTGCC 
ATTAAATAGT 
GGACACTATA 
TGGCTATACT 
CTGAGATTTC 
GTAAGCAAGA 



51 

I 

CTTTTCAGGA 
TGCAACAAAT 
ACTAGTGCAT 
AAAAXTCTTA 
A6CAGGGATT 
GGGGTATTTC 
ACAGAAGGAA 
TATAATAATA 
GATGAAAAGG 
TGAAACTCCA 
GTTCACAGAT 
ACCCAACATC 
CAAAGAGGCG 
GCTTAGCAGC 
6TGCTTGGTG 
GAATAGCAAC 
TAAOGTTCTT 
TATACXTOVC 
GAATTOCATT 
CTCAGCATTC 
ATTGGAAGAG 
CCTCATGGTQ 
TGCCACOGCG 
AGTTGGATTA 
CTTTGGTGCA 
GGTTTTGGAT 
ATTTAATAAA 
TAGAGGCACT 
CATXAA7QA0 
TATCTTTCTG 
AGACAGAATG 
TCTTTTATCA 
GAGGAACTCC 
TCCTATAGAA 
GAATGGATTQ 
CATCACAAAC 
AATAGGATAT 
ATGGTGCAAA 
CATTATCGAC 
TCG6GCTCTA 
OGTGTAOGAT 
TCATAAAGAT 
A 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

16B0 

1740 

1749 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1060 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1660 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2571 



Seq ID NO: C41 DMA Sequence 
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Nucleic Acid Accession 9: MM_033049 
Coding sequence: 2 8.. 1566 ~ 
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GQVOQCGTCC 
CTCCTTTCTG 
GAAACTGCGA 
GAAACTOCTA 
CCCATAATTA 
AGACATAGTT 
GTAAATTCAT 
ACAATGGTTC 
CAATCATCAG 
GOTGCCAGCA 
CATAATACAA 
AAAGGAAAGG 
GAGAAACATT 
GTATTTGGCA 
CCAAGATCTO 
GCAGAAACCA 
AGTAGCTCAA 
AACCAGACTG 
AGGCCTAACX: 
AAGQCACAGC 
GTGCCCGGCT 
GOACTCGACT 
ATTGTCATTC 
AAGCATATTG 
ACAGGCTTCA 
TCCAGAGACA 
TATTA6AATC 
TTAGAGTGTT 
TTTTTCTTCC 
AATCCAGCTC 
CTTGTCAGAG 
TTTTCTTAAT 
CCCAGCATTO 
GQAGACATAC 
CAAGGGTCAC 
ATGGAGAGAT 
AGACTGGAAA 
TTTTTGiATGC 
TTTTTGAAAT 
AGAGAGGCAA 




XI 
1 

GAGCAAGAAC 
TAAACACAGC 
CTAGTOGTCC 
GCACX3VCAGC 
GTACACATAG 
CCTCCACAAT 
TA6CTACCTC 
CTTCTGAAAC 
GGCCTCCCAC 
ATCCTTGCCA 
GTTTTTGCCT 
TATTCCCTGG 
CCATGGCCTA 
CATCTGTTTA 
AAAT6CGTGC 
CAAGTGACAA 
QCAACTTTCT 

cggatgactg 
cacagagccc 
acaaocaatq 
acx:aggaaga 
gtaaggacaa 

TCAGCATGAT 
AAGAAGAGAA 
CX»ATCTTGG 
GCCAGATGC3V 
ATAA6AAT6T 
TAGAAAGACT 
ATCT6ACATC 
ACTTGCTAAA 
AGGTGGTTTT 
TCTTTTCCTG 
CTTGCTCCTG 
GCATCAACTO 
AAGCrCACTC 
TCTGGCAACC 
GAGACATATC 
TAATTAQAAO 
GCTAGGAGGC 
AGGAAACTGG 
AGGTCCCCTC 
TGGGAGAAGG 
GAOACAAJICT 
AGCCTCGGGQ 
AGGCTCAGCT 
GCAACCACAT 
CCTTGTCCTC 
QAATGQTAAT 



21 
I 

AGCTAAAATO 
CACCAACCAA 
TACAGTAGCT 
AAATACACCT 
TTCCTCCACA 
TCCTATACCT 
TGACATAATC 
ACAAAGTAAC 
TGGCACOGCT 
AGATGATCCC 
6TGTTTAQAA 
6AAGATTTCA 
TCAAGACTTO 
TGGACAGACT 
TGATGACAAG 
TGAGAAGACT 
AAACTATQAT 
CCTCAATGGT 
TTTCTGCQTT 
CTTAATAAAG 
TGCTAATGGG 
ATTTCAGCTG 
AATTGCATTG 
CTTQATTGAC 
AGCAGAAGGG 
AAATCCCTAT 
GQAACCOGCC 
GAT6GAGAAG 
T6CCAGCCTC 
TAAGAATCTA 
CTTCAATCAG 
GTAGGGCAAC 
GGCAAACATT 
TGOAGQTCCG 
TCTGACAAGT 
TTTGAACAGC 
TCTCAGCTTT 
GCCItSGACTA 
AOAAGGGQCA 
GTGGGGAGGA 
CTCCATCAQC 
TTTAAAAACA 
GTCTGGGAAA 
TGTAaGTTTC 
GTGGCCACCA 
CCA6TACAA0 
TTAAAAGTTA 
GCAATAAAOT 



31 
I 

AAAGCCATCA 
GGCAACrCAG 
GCAGCTGATA 
TCTTTCCCAA 
ATTCCTACAC 
ACTGCTGCAG 
AC06CTTCAT 
AATGAAATGT 
TTATT6GA6A 
TGTGCAGATA 
GGGTATTACT 
GTGACAGTAT 
CATAGTGAAA 
GTAATTCTTA 
TTTGTTAATG 
GTQACTGAGA 
TTGACCCTTC 
TTAGCATGCO 
GCTTCCAGTC 
AAGAGTGGTG 
AACTGCCAAA 
ATCCTCACTA 
ATTGTCACAG 
GAAGACTTTC 
AGCGTCTTTC 
TCAAGACACA 
ATGGCCCCCA 
TGAO CACCAG 
TCTGAATGGA 
TGACATTAAA 
TACAAAGTAC 
AAOAACCATT 
GCTCTTGAGT 
AGGGQAIOAQ 
CAGAATAGQG 
CCAGAGCTTQ 
TTCAGGAGGC 
AAAATOTCCA 
GAQAGTAAAA 
TCAATTAGAO 
AAAGGAGCAC 
AAAAATGCAG 
GGGT606AGA 
TGAOGTOTGC 
ACACACAACC 
CTTTTACAAA 
TTTTATTTGT 
GCCTTTOTTA 



41 

I 

TTGATCTTAC 
CTGATGCTGT 
CCACTGAAAC 
CAGCTACTTC 
CTGCTCCCCC 
ACAGTGAGTC 
CTCCftAATGA 
CCCCCACCAC 
CCAGCACCCT 
ATTOGTTATG 
ACAACTCTTC 
GAGAAACATT 
TTACTAGCTT 
CTQTAAGCAC 
TAACAATAGT 
AAATTAATAA 
GGTGTGATTA 
ATTGCAAATC 
TCAAQTGTCC 
GGGCCCCTGA 
AGTGTOCATT 
TTCTGGGCAC 
CAAGATCAAA 
AAAATCTAAA 
CTAAGOTCAG 
GCAGCATGCC 
ACCAATGTAC 
T AAAGAT CTG 
AGTTGTQAAT 
TGTAGTAGAT 
TGAGACAATG 
TCCAATCTAG 
TAAGTGAGCT 
AAGGGATACC 
ACACTGCTTC 
CAACCTAGCC 
GT6CCTGGGA 
CTATGGGGT6 
AACATGACCT 
AGGAGGCACC 
TTCTCTAATC 
QAGTAAGAGC 
GGGAOCTTGT 
CATTOGGGCC 
ACACACACAC 
TGTTATTAGT 
TATTATTATT 
GATGGTGAAA 



51 

J 

TCTTCITGCT 
AACAACCACA 
TAATTTOCCT 
ACCTGCTCCX: 
CATAATTA6T 
AACCACAAAT 
TQGATTAATC 
AGAAGACAAT 
AAACAGCACA 
TGTTAAGCTG 
TACATGTAAG 
TGAQXAGAA 
GTTTAAAGAT 
ATCTCTGTCA 
AACAATTTTQ 
AGCAATTAGA 
TTATGGCTGT 
TGACCTGCAA 
TGATGCXTTGC 
GTGTGCX5TGC 
TGGCTACAGT 
CATCGCTGGC 
TAACAAAACG 
ACTGCGGTCG 
GATAACGGCC 
CCGCCCTGAC 
AAGCTATTAT 
GCCTCOQQGG 
6TTTGCAA00 
GCTATTAGCQ 
GTTAGGGTTG 
AGGAAAGCTC 
AATTCCCCTO 
CACCACCTTT 
TATCCCTCCA 
TCACCCAAGA 
ATOCAGGAAC 
CACTCTAOUS 
GGTAOAAGGA 
TGGGATCCAC 
ATGCCCTCCC 
CTTAGQTCAG 
GCTCAGGAGT 
TCASCCTTCT 
AACXyVCACAC 
GTCCTTTTTT 
T6TTCTTGAC 
AAAAAAAAAA 



Seg ID KOt C42 DNA Sequence 

Nucleic Acid Accession #i NM_001432.1 

Coding sequence 1 167.. 676 



TCACTTGCCT 
TGAC RflCOGC 
TCOGCAaCCG 
GAGGATGGAG 
TCTTCTACAG 
TGATAACTGC 
AAC A AAflXCiX 
GGACATSAGT 
CTTC^TTTA 
TATTATTTTG 
AAATOQAAAA 
GTTGCOGCAA 
GAATGGOGCC 
CATTTTATTA 
GTACTTGAAA 
TGTGCAGAAA 
6CTAAATGCT 
AGCAATTCCC 
ATATATAGTC 
GGTTTGACTG 
TAATGCAATT 
AAACAAGCTA 
TATATGAGG6 



11 
1 

GATATTTCCA 
TCTCCAGCCA 
COCTCCGCCA 
ATGCTCTGTG 
GCAGTCCTCA 
ACA0CTTTA6 
AGCTCTGACA 
CAAAACTACT 
ACOGTCQICC 
TTTCTTATCA 
AGTAAAGAAC 
GTCTGAATAT 
ATCAAACTTA 
ATAATATTTA 
AATGTTTTTA 
ATATTTAATA 
TCATTOAAAG 
AGGTCATAGC 
AAATCGATTT 
TACCATGGTT 
AGAATTTGGG 
ACCCTGGGGA 
GCAGTGGACA 



21 

I 

GT6TCAGAGG 
CTGCOGOGAO 
AGCCGCAGOG 
OOGGCAGGOT 
GTACAACTGT 
TTCAQACAGA 
TSAATGGCTA 
GCAGOTGTGA 
AAGCTTTAAG 
CA6TCGT0GG 
CAAAGAAGQA 
GAGAGAGTTA 
TGOGCAGGGA 
TGTTGGGTCA 

Trmxa ' rm ' 

TCAAAAGAAA 
CTTCAAAGTT 
TCAAQAATTG 
AGTAAGTATG 
TGATATOTAG 
AGAAGCAAAT 
OTCTATGOTC 
GTTCGCTATQ 



31 
I 

GACACAGCCA 
CCCQTCZGCT 
0C06CTCCCA 
OCCTGOGCTG 
GATTCCATCA 
AGACAATCCA 
TTGTTTGCAT 
AGTGGGTTAT 
CAAAGAGTAT 
TTCCACATAT 
ATATGAGA6A 
CCTCAGGG6A 
TAACAGTGTG 
AGTGTTAGGT 
ATTTTTGACA 
ATTGATATTT 
TATAT6CCT6 
TTAGCAAATO 
TTTTTTATGT 
TTGGCACCAT 
ATAGGTCCTO 
TCTTCACTCA 
CCAACTCAOO 



41 
I 

ACGTGGGGTC 
CCCQ0CCT6C 
T0GC0QAT6A 
CTOCTCTGCC 
TGTATCCCAG 
06TQTGGCTC 
CWfAdAfTTUCA 
ACTGGTQTCC 
GTGGCTTTQA 
TATTTCTGCA 
GTTACCrCAG 
TCCAOAOTTa 
CCTGQTTAAT 
CAATAACACT 
GACTATTTGC 
TTATACAAOT 
GXGCACAGTG 
ACA6ATTTCT 
TCCTCAAATC 
GGTATCATAT 
TGTTAAACAC 
GGTCTCA6CT 
ACTCCTACA6 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1660 

1920 

1960 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2887 



51 
I 

CCTTCTAGGC 

CCSIGCACrC 

CCGOGGGGAG 

TGGGTTTCCA 

GAGAGTCCAG 

AAGTGTCAAT 

TCTATCTGGT 

GATGTGAACA 

OCGTQATTCT 

GATGGTACAG 

GGGATCCAGA 

COQCAAOTCT 

ATTAATATTC 

GTATTTTAAT 

TAATGTATAA 

AATTTCCTGA 

CTTAGAASTA 

GTAAGCCTAT 

AGTGATAATT 

ATTAAAACAA 

TACACATTTO 

ATAATTCTGT * 

GTACTAGTCA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

760 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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60 



Seq ID NO I C43 DNA Sequence 

KUclelc Acid Accession #t AF011468.1 

Coding sequence: 257.. 1468 



CTCATCTACC AGATTCTQCC TATGTAAAAT GAATTQAAAA ACAATTTTCT QTAATCTTTT 1440 

ATTTAAGTAG TGGGCATTTC ATAGCTTCAC AATOTPCCTT TTTTOTATAT TACAACATTT 1500 

ATGTG AGGTA ATTATTOCTC AACAQACAAT TAOAAAAAAG TCCACACTTG AAGCCTAAAT 1560 

5TTGTGCTTTT TAAGAATATT TTTAGACTAT TTCTTTTTAT ACGGC3CTTTG CTGAATTCTA 1620 

ACATTAAATC ACAGCXX3U\A ATTTGATGGA CTAATTATTA TTTTAAAATA TATGAAQACA 1680 

ATAA TT CTAC ATGTTGTCTT AAGATGGAAA TACAGTTATT TCATCTTTTA TTCAAOQAAO 1740 

T TTTAA CTTT AATACAGCTC AGTAAATGGC TTCTTCTAISA ATGTAAAGTT ATGTATTTAA 1600 

AGTTGTATCT TOACACAGGA AATGGGAAAA AACTTAAAAA TTAATATGGT GTATTTTTCC 1860 

- - AAATGAAAAA TCTCAATTGA AAGCTTTTAA AATGTAGAAA CTTAAACACA CCTTCCTGTO 1920 

lU GAGGCTGAGA TQAAAACTAG G6CTCATTTT CCTGACATTT GTTTATTTTT T6GAAGAGAC 1980 

AAA&ATTTCT TCTGCACTCT GA60CCATA0 GTCTCAOAGA GTTAATAGOA GTATTTTTGG 2040 

GCTATTGGAT AA GGAG OCAC TGCT6CCACC ACTTTTGGAT TTTATGGQAG GCTCCTTCAT 2100 

CGAATGCTAA ACCTTTQAGT AGAOTCTCCC TGQATCACAT ACCAGGTCAO GGAGGATCTG 2160 

TTCTTCCTCT AOGTTTATCC TGGCATGTGC TAGGQTAAAC GAAGGCATAA TAAGCCATGG 2220 

1 J CTGACCTCTG GAGCACGAGO TGGCAGGACT TGTCTCCATG IGTATCCATG CATTATATAC 2280 

CCTGGTGGAA TCACACGACT GTCATCTAAA GTCCTGGCCC TGGGCCTTAC TATTAOQAAA 2340 

ATAAACAOAC AAAAACAAQT AAATATATAT GGTCCTATAC ATATTOTATA TATATTCATA 2400 

TACAAACATG TATGTATACA TGACCTTAAT GGATCATA6A ATTGCAOTCA rrPGGTOCTC 2460 

TGCTAACCAT TTATATAAAA CTTAAAAACA AGAGAAAAGA AAAATCAATT AGATCTAAAC 2520 

ZU AGTTATTTCT GTTTCCTATT TAATATAGCT GAAGTCAAAA TATGTAAGAA CACATTTTAA 25 BO 

ATACTCTACT TACAGTTG6C CCTCTGTGGT TAGTTCCACA TCTGTGGATT CAACCAAOCA 2640 

AGG AOGGAAA ATGCTTAAAA AATAATACAA CAACAACAAA AAATACATTA TAACAACTAT 2700 

TTACTTTTTT TTTTTTCTTT TTGAGATGGA OTCTCQCTCT OTTGCCCAGG TTGGAGTGCA 2760 

^ GT60CACX3AT CTCGGpTCAC TGCAACXTTCA CCTCCCGGGT TCAAGAQATC CTCCTGCCTC 2820 

Zj AGCCTCCTGA GCAGCTGGGA CTACAGGCGC ATGCCACCAT GCCCAGCTAA TTTTTGTATT 2880 

TTTAOTAQAO GOGOGGTTTC ACCATGTTGG CCAG6ATGGT CTCAATCTCC TAACCTTTQAO 2940 

ATOCACCCTC CACAGCCTCC CAAACTOCTG G6ATTACAG0 C3GTGAGCCAC CGCAGGTAGC 3000 

ATTTACATTA GGTATTACAA GTAATGTAAA GATGATTTAA GTATACAGGA OQATGTGAAT 3060 

AGGTTATATG CAAGCACTAT GCCXnTTTAT ATAAGTGACT TGAACATCTG TGCCCGATTT 3120 

J\) TAGTATGT6C AGGGGG600A TCTGGGAATC AGTCCCCTGT GQATACCAAG GTACAACTGT 3180 

ATTTATTAAC OCTTACTAGA TOTGAGGAQA OTCTGAATAT TTTCAOTGAT CTTGGCTGTT 3240 

TCAAAAAAAT CTATTGACTT TTCAATAAAT CAOCTQCAAT CCATTTATTT CATTTACAAA 3300 

AGATTTATTQ TAAGCCTCTC AATCTTGGTT TTTCAGTTGA TCTTAAGCAT GTCAATTCAT 3360 

AAAAACAAGT CATTTTTGTA TTTTTCATCT TTAAGAATGC TTAAAAAAGC TAATCCCTAA 3420 

J J AATAGTTA6A TCTTTGTAAA TGCATATTAA ATAATAAAGT ATQACCCACA TTACTTTTTA 3480 

TGGGT GAAA A TAAGACAAAA ATAATAGTTT TAGT6AGGAT GGTGCTGAGT AAACATAAAA 3540 

ACTQATTPGC TCTCAGCTGA TGTGTCCTGT ACACAGTGOG AAGATTTTAO TTCACACTTA 3600 

GTCTAACTCC CCCATTTTAC AGATTTCTCA CTATATATAT TTCTAGAAGG GGCTATGCAT 3660 

. - ATTCAATGTA TTQAGAACCA AAGCAACCAC AAATGCATAA ATGCATAATT TATGGTCTTC 3720 

4U AACCAAGGCC ACATAATAAC CCAGTTAACT TACTCTTTAA CCAGGAATAT TAAGTTCTAT 3780 

AACTAGTACT CAAGGTTTAA CCTTAAAATT AAGATTTCCT TAACXriTAAC CTTAAAATTa 3840 

ATATTATATT AAACATACAT AATACAATGT AACTCCACTG TTCTCCTOAA TATTTTTTGC 3900 

TCTAATCTCT CTGCCGA AAG TCAAAGTGAT GGGAGAATTG GTATACTGGT ATGACTACGT 3960 

CTTAAGTCAG ATTTTTATTT ATGAGTCTTT GAGACTAAAT TCAATCACCA CCAGGTATCA 4020 

4D AATC AACTTT TATGCAGCAA ATATATGATT CTAGTGTCTG ACTTTTGTTA AATTCAGTAA 4080 

TQCAGTTTTT AAAAACCTGT ATCTGAOCCA CTTTGTAATT TTTQCTGOUV TATCCATTCT 4140 

OTAGACTTTT GAAAAAAAA6 TTTTTAATTT GAT6CCCAAT ATATTCTGAC C!GTTAAAAAA 4200 

TTCTTGTTta TATGGQAGAA GGGGGAGTAA TGACTTGTAC AAACaWJTATT TCTGGTGTAT 4260 

ATTTTAATGT TTTTAAAAAG AGTAATTTCA TTTAAATATC TGTTATTCAA ATTTGATGAT 4320 

GTTOAATGTA AT ATAAT QTA TTTTCTTTTT ATTTTGCACT CTGTAATTGC ACTTTTTAAG 4360 

TTTGAAOAGC CATTTTGGTA AAOaCTTTTT ATTAAAOKIO CTATOGAACA TAAAGTTGTA 4440 

TT GCATGCRA TTTAAAGTAA CTTATTTOAC TATGAATATT ATCX3GATTAC TGAATTGTAT 4500 

CaATTTGTTT GTGTTCAATA TCAGCTTTGA TAATTQTGTA CCTTAAGATA TTGAAGGAGA 4560 

AAATAGATAA TTTACAAGAT ATTATTAATT TTTATTTATT TTTCTTGGGA ATTGAAAAAA 4620 

J J ATT6AAATAA ATAAAAATGC ATTGAACATC TTGCATTCAA AATCTTCACT GAC 4673 



1 11 21 31 41 51 

I I 1(11 

GGAAGACTTG GGTCCTTGGQ TCXKSWJGTGG GAGCCX3A0G0 GTGGGTAGAC aSTGGGGGAT 60 

Oj ATCTCAGTGG CGGACGAGGA OGGOGGGGAC AAGGGGOGGC TGGTCXKSAGT GGOQGAGOGT 120 

CAA GTCOCC r GTCJGOTTCCT COGTCCCTGA QTGTCCITGG CGCTGCCTTG TGCCXJGCCCA 180 

G0GCCTTT6C ATCCGCTCCT GGGCACOGAG GGQCCCTGTA GQATACTGCT TQTTACTTAT 240 

TACAQCTAGA GQCATCATGQ ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 300 

TACAGCTCCA GTTGGAGGTC CAAAAOGTQT TCTOQTGACT C3U3CAAATTC CTTGTCAGAA 360 

i\) TCC ATTACCT 6TAAATAGT6 GCCAGGCTCA GCGGGTCTrG TGTCCTTCAA ATTCTTCCCA 420 

G0G06TTCCT TTCCAAGC31C AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 480 

GCAGAAGCAA TTGCAGQCAA CCAGTGTACC TCATCCTGTC TCCAQQCCAC TGAATAACAC 540 

CCAAAAQAGC AAGCAQCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC 600 

ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGO GCTTTGGAAG ACTTTGAAAT 660 

tJ TGG TOQCC CT CTGGGTAAAG QAAAGTTTGG TAATGTTTAT TTGGC3VAGAG AAAAGCAAAO 720 

CAAOTTTATT CTOGCTCTTA AAGTGTTATT TAAAGCTCAG CTGOAQAAAG CCX5GAGTGGA 780 

G CATC AGCTC AGAAGASAAG TAGAAATACA GTCCCACCTT OGGCATCCTA ATATTCTTAG 840 

ACTGTATGOT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 900 

g-. TGGAACAGTT TATAGAGAAC TTCAGAAACT TTCAAAGTTT GATGAGCAQA GAACTGCTAC 960 

OU TTATATAACA 6AATT06CAA ATGCCCTOTC TTACTGTCAT TOGAAGAGAG TTATTCATAG 1020 

A GAOCT TAAQ CCAGAOAACT TACTrCTTGG ATCAGCTGGA OAGCTTAAAA TTGCAQATTT 1080 

TQGGTGOTCA 6TACATGCTC CATCTTCCAG GAGGACCACT CTCTGTGQCA CCCTGGACTA 1140 

CCTG OCCOCT GAAATGATTO AAOOTOGGAT GCATQATQAQ AAG0TG6ATC TCTGGAGCCT 1200 

TQGAGTTCTT TGCTATQAAT TTTTAOTTOG GAAGCCTCCT TTTGAGGCAA ACACATAGCA 1260 
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ACAOAOCTAC AAAAGAATAT CAGGGGTTGA ATTCACATPC CCTGACTTTG TAACAGAGGG 1320 

AGCCAOGGAC CTCATTTCAA GACTGTTCAA GOVTAATOCC JV6CCAQAGGC CAATGCTCAO 1380 

AGAAOTACTT GAACACCCCT GGATCACAGC AAATTCATCA AAACCATCAA ATTQCCAAAA 1440 

CAAAGAATCA GCTAGCAAAC AOTCTTAGQA ATCGTGCAGG GGGAGAAATC CTTGAGCCAQ 1500 

0 GGCTGCXaiTA TAACXTTGACA GGAACATCCT ACTBAAGTTT ATTTTACCAT TQACTGCTGC 1560 

CCTCAATCTA GAACGCTACA CAAGAAATAT Umi lTA CT O^GCAGGTGT GCCTTAACCT 1620 

OCCTATICAO AAAGCTCCAC ATCAATAAAC ATQACACTCT GAAGTGAAAG TAGCCAOSAS 1680 

AATTGTGCTA CTTATACTGO TTCATAATCT GGAGQCAAQG TTOGACTQCA GCCGCCXXCT 1740 

CAGCCTCTGC TAGGCATGGT GTCTTCACAG GAGGCAAATC CAGAGCCTGG CTGTGGGGAA 1800 

lU AGT GACCA CT CTGCCCTGAC CCOGATCAGT TAAGOAGCTG TGCAATAACC TTCCTAGTAC 1860 

CTGA OTGAOT GTGTAACTTA TTGGGTTG6C GAAGCCTGGT AAAGCTGTTG GAATGAGTAT 1920 

GTQATTCm TTAAGTAT6A AAATAAAOAT ATATGTACAO ACTTGTATTT TTTCTCTGGT 1980 

GGCATTCCTT TAGGAATGCT GTGTGTCTGT COGGCACCCC GGTAGGCXTTG ATTGGOTTTC 2040 

TAGTCCTCCr TAACCACTTA TCTCCCATAT GAQAQTGTGA AAAATAGGAA CACGT6CTCT 2100 

ID ACCTC CATTT AGGGATTTGC TTGGGATACA GAAQAOOCCA TGTGTCTCAO AGCTGTTAAG 2160 

GGCTTATTTT TTTAAAACAT TGGAGTCATA GOKTGTQTQT AAACTTTAAA TATGCAAATA 2220 

AATAAGTATC TATGTCTAAA AAAAAAAAAA AAA 2253 



20 



8eq ID NO: C44 DNA Sequence 
Nucleic Acid Accession ft: MM_013372 
Coding sequencer 63.. 617 ~ 



1 11 21 31 41 SI 

or I I I I 1 I 

ZrJ GCQGCCGCAC TCAGOGCCAC GCGTCGAAAG C3GCAGGCCCC GAGGACCCGC COCACTGACA 60 

GTAT6AGCGG CACAGCCTAC ACGGTGGGAO CXXTTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGG6AAAAAG AAAGGGTCCC AAGGT6CCAT CCCCCCGCCfi OACAAGGCCX: 180 

AGCACAATGA CTCAGAGCAQ ACTCAGTOGC CCCAGCAGCC TGGCTCCAG6 AACX3GGGGGC 240 

^ GGGGCCJ^AGG GCGGGGCACT GCCATGCCOG GGQAQGAGGT GCTGOAOTCC AGCCAAGAGG 300 

J\) CCCTGCATGT OACGGAGOGC AAATACCTGA AGC3QAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACOGC TTCTGTTACG 420 

GCCAOTGCAA CTCTTTCTAC ATCCCCA6GC ACATCOGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAOAAATTCA CTAOCATGAT GQTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACAOGTGT GAAOCAOTGT OOTTGCATAT 600 

JJ CCATCGATTT GGATTAAGCC AAATCCAGGT GCAOCCAGCA TGTCCTAGGA ATGCAGCCCX! €60 

AGGAAGTCCC AGACXTTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTC CTGG CAGGAGCCTG CTTGTGCGTA GTTOGTGTGC ATGAGTGTGG 780 

ATGGGTGCCr GTGGGTGTTT TTAGACACCA GAGAAAACAC AOTCTCT GC T AGASAGCACT 840 

^ CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTAOC AOAAACCCAC CTCACXXCOO 900 

4U CTCACATCTA AAGGGGCGGG GCOGTGGTCT GGTTCTGACT TT C TO Trm ' GIGCXXTTCCT 960 

GGQQA CCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AQQCTCCTCT QAGGGCAAGA 1020 

GACC TGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAOTTO ATAOTGACTA TOTCAGTCTA 1140 

TGCXaAGGTT CCTAAATTAA TTCACTTAAC CATGAtGCWV ATOTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGQAGTQAQA AAGGGAGGGT GGAGGGTGAG GCCAAATCAQ OTCCAGCAAA AGTCAGTAGG 1320 

GACATT GCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAQTC 1380 

TTTTC CTAGT ATTTAACAGA ACCCAAGTGA ACAGAGaAGA AATGAGATTO OCAQAAAGTG 1440 

ATTAACTTTG GCOGTTGCAA TCT6CTCAAA CCTAACACCA AACTOAAAAC ATAAATACTO 1500 

DU ACCACrCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TC rG CTTTGT 1560 

CCCTCAGQTO GAAAAGAGAG GTAOTTTAQA ACTCTCTGCA TAGGGOTQGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAO TCAGCTCATT 1680 

TCCATTOCAC TATTTCCCAT AATGCTTCTG A6AG0CACTA ACTTGATTGA TAAA6ATCCT 1740 

- « GCCTCTGCTG AGTGTACCTG ACAOTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTQ GGGGTCTTTT TQTTTTAACT ATTGTCAGGA QATTGGQCTA 1860 

R AGAGA AGAC GACQAGAOTA AGGAAATA/yi GGGRATTOCC TCTGGCTAGA GAQTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGAGCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGG ATCTGAG OGGACOCIGT TAGQAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

^« CTACrGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACPGTCC 2100 

OU TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TOATCTAAGG GCX^AAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGQAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTO AATACTCTTT TTGCCTTOTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACXG CAGACTTQAG ATTCAGTT6C GGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTOM. GAAGCTGTTT TTATTTOGTT TTTGTTTTGA 2400 

00 TCCAGT6CTC TCCCATCTAA CAACTAAACA GQAGCCATTT CAAGGOGGGA GATATTTTAA 2460 

ACAOCCAAAA TGTTGGQTCT GATTTTC3UUI CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGC GAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCXXACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTOCAA GAATAAAGOl CAGAGTGGAT TTAATTAAGC 2640 

ACACAAAT6C TAAGGCAQAA TTTTX3AGGGT GGGAGAGAAG AAAAGGGAAA QAAGCTGAAA 2700 

fV AT GTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGrrOTTTT AACTCXGCCA CAAGAATGCA ATTTCGTTAA TGGAQATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC AOIGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAA6TAAA6A 6AATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTOTG OCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACXaXA 3060 

TTTTATTCGA GTCACTGATG ATGTAATQAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAO ATATTTOrGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT QATGTACA CT TTGTGCTTGO CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

„ OTCTGTAAOT T3ITTTTTGT TACTOTAGOT CTTCAAAGTT AAGAOTGTAA GTGAAAAATC 3300 

OV TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATOTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTA ATQTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TQTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTQAOTTOG GCAAAGAAGA AGCTQACSUav CCGTATGTTO 3480 

TTAGAGTCTT TTATCT GGTC A GGGGAA ACA AAATCTTQAC CCAGCTQAAC ATGTCTTOCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTOAATGTTC CTTAAAGOTT AACATTTCTA 3600 
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AAGCAATIVTT AAGAAAGACT TTAAATGTTA TTTIGGAAGA CTTAGGAT6C ATOTATACAA 3660 

ACGAATAGCA QATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAQT TAATGCCTAA QAGTGAAAGT 3780 

AGTTCTATTQ ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

J AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAQ GTACCTTQCT QTGTAGGAGG 3900 

ATBAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAQAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 

TAAATTAAAC CTATTCTTTC AAAAAAAAA 4049 

10 Seq ID not C45 DNA Sequence 

Kucleic Acid Accession ft : Eos sequence 
Coding sequence : 200 . . 2932 

^- 1 11 21 31 41 SI 

15 1 I I I I 1 

ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 60 

ATACTTTCTT TGCAAACAGC ATAAGAAQTQ ATTGA6CCAC AAGTATACTG AAGGAAGGGC 120 

TCGCTCGAGT TCTGGTGTQA AQA6ATAAAT CACCAGTCAC AOACTATGCA CCCX3ACT6CT 180 

GCTGTTCAGT CCAGGGAARA TGAAAGTTGO AOTOCTGTGO C i WrntTr TCTTCACCTT 240 

2U CACTGAOGGC CAOGGTGGCT TCXTTGGGGAA AAATQATGGC ATCAAAACAA AAAAAQAACT 300 

CATTGTGAAT AAQAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 360 

GACCTATAQA OATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 420 . 

TCCATTATTA TOGTCACATO G6CTAATTA0 AATTATCAGA GCAAAGGCTA CCACAGACTQ 480 

CAACAGCXnO AATOQAGTCC TGCAOTQTAC CTOTGAAGAC AGCTAGACCT 6GTTTCCTCC 54 0 

25 CTCATGCCTT GATCCCCAGA ACTGCTACCT TCAC3VG6GCT GQAQCACTCC CAA6CTGTGA 600 

ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTrGT GAGAGAACAA AGATTTGGGG 660 

CACTTTCAAA ATTAATGAAA GGTTTACAAA TQAOCTTTTO AATTCATCTT CTGCTATATA 720 

CTCCAAATAT 6CAAATGGAA TTGAAATTCA ACTTAAAAAA GGATATGAAA GAATTCAAGG 780 

rrrTGAGTCQ OTTCAGGTCA CCCAATTTCG AAATGGAAGC AT0GTTGCT6 G6TATGAA8T 840 

30 TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT 6AACATGTTG CCGAGAAGGC 900 

TAAGACAGCC CTTCACAAGC TGTTTCXATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 960 

AG&CAGTGT AATGACATTG TCTTTGGATT TGGGTCXAAG GATGATGAAT ATACCCT6CC 1020 

CTGCAGCAGT 6GCTACAG6Q GAAACATCAC AGCCAAGTGT GftGXCCTCTa GGTGGCAGGT 1080 

^- CATCAGGGAG ACTTGTGTGC TCTCTCTGCT TGAAGAACTG AACAAGAATT TCAOTATGAT 1140 

35 TGTAGGCAAT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG CAAAATCTTT CTGTCATCAT 1200 

TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGQCTTCX3 OTGOrGTOSA TTCTOAGCAA 1260 

TATTTCRTCT CTGTCACTGG CXAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGAT67 1320 

CATCAOTATA GCTQAGAATA TCCTTAATTC ASCCTCAGTA A0C3VACTQQA CAGTCTTACT 1380 

. GCXSGGAAOAA AAGTATGCCA GCTCAOG G TT ACTAGAGACA TTAGAAAACA TGAGCACTCT 1440 

40 GGTGCCTCCG ACAGCTCTTC CTCTGAATTT TTCTCOGAAA TTCATTGACT GGAAAQQQAT 1500 

TCCAGTGAAC AAAAGCCAAC TCAAAAGOGO TTACAGCTAT CAOATTAAAA TGTOTCXXX^ 1560 

AAATACATCT ATTCCCATCA GAGGCOGTGT GTTAATTGGG TGA6ACCAAT TCCAGAGATC 1620 

CCTTCCAGAA ACTATTATCA GCATGGGCTC QTTGACTCTO GGQAACATTC TACCOGT CT C 1680 

CAAAAATG6A AATGCTCA60 TCAATGQACC TGTGATATCC AOQGTTATTC AAAACTATTC 1740 

45 CATAAATGAA QTTTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 1800 

TGTGTTTTGG GATTTCAGTC ATTTGCAGTG GAACGATGCA GGCTGCX3VCC TAGTOAATGA 1860 

AACTCAAGAC ATOGTGAC36T GCCAATGTAC TCACTTGACC TCCTTCTCCA TATTGATGTC 1920 

ACCrTTTGTC CCCTCTACAA TCTTC0C06T TGTAAAATG6 A TCAOCTATO Tt3GGACTGGG 1980 

TATCTCCATT GGAAGTCTCA TTTTATGCCT GATCATOGAG 6CTTTGTTTT G6AAGCAGAT 2040 

50 TAAAAAAAGC CAAACCTCTC ACACAOGTOG TATTT6CATG 6TGAACATAG CCCTGTCCCT 2100 

CTTGATTGCT GATGTCTGGT TTATTGTTGG TGCX3VCAGTG GACACCACGG TGAACCXnTTC 2160 

TOQAGTCTGC ACAGCTGCTG TGTTCTTTAC ACACTTCTTC TACCTCTCTT TGTTCTTCTQ 2220 

GATGCTCATG CTTGGCATCC TOCTGGCTTA COGQATCATC CTCGTGTTOC ATCACATGQC 2280 

CCA6CATTTG AT6ATGGCT6 TTGGATTTTG CCTGGGTTAT GGGTGCCCTC TCATTATATC 2340 

55 T6TCATTACC ATTGCTGTCA OGCAACCTAG CAATACCTAC AAAAGGAAAG ATGTOTGTTQ 2400 

GCTTAACTGG TCCAATGGAA GCAAACCACT CCTGGCTTTT GTTGTCCCTQ CACTGGCTAT 2460 

TGTGGCTGTG AACTTCGTTG TGOTGCTGCT AGTTCTCAC3V AAGCTCTGGA GGCOQACTGT 2520 

TG6GGAAAGA CTGAGTOGGG ATGACAAGGC CACCATCATC GGGGTGGGGA AOAGCCTCCT 2580 

CATTCTGACC CCTCTGCTAG G6CTCACCTG G66CTTTGGA ATAGGAACAA TAGTGGACAG 2640 

OO CCA6AATCTG GCTTGGCATG TTATTTTTGC TTTACTCAAT OCATTCCAGQ GATTTTTTAT 2700 

CTTATGCTTT GOAATACTCT TGGACAGTAA GCTGCGACAA CTTCTGTTCA ACAAGTT6TC 2760 

TGCCTTAAGT TCTTGGAAGC AAACAGAAAA GCAAAACTCA TGAGATTTAT CTGCCAAACC 2820 

CAAATTCTGA AAGCCTTTCA ACGCACT6CA AAACAAAGGC CATTATQCAT TTTCTCATAC 2880 

TGGAGATTCC TCOGACAACA TCATGCTAAC TCAGTTTGTC TCAAATGAAT AAQGGAAOOA 2940 

05 ATCATAAAAT CAAGAAAAAA TTTCCAGAAC AACT1GACAT TTAGAGACAA ATGTCAAT6A 3000 

AGAAATTATQ CTCAOTATTC GATOGGGTTT TCIQATTTAG GOaTCiaQGA ATAAAACAAG 3060 

AATGTCTCAO TGGCTTC31 3078 



70 
75 
80 



6eq ID NOt C46 DNA Sequence 

Nucleic Acid Accession «t NM_000584.1 

Coding sequences 75.. 3 74 " 

1 11 21 31 41 51' 

I I i I t I 

AGCAGAGCAC ACAAQCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA 6GAACCATCT 60 

CACTGTGTOT AAACATGACT TCCAAGCTGQ CCGTGGCTCT CTTGGCAQCC TTCCTOATTT 120 

CTGCAGCTCT GT6T6AAGGT GCAGTTrrGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 180 

GCATAAAGAC ATACTCCAAA OCTTTOCACC CCAAATTTAT CAAAGAACTG AGAOTGATTG 240 

AGAGTGGACC ACACTGGGCC AACACAGAAA TTATrGTAAA QCTTTCTGAT GGAAGAGA6C 300 

TCTQTCTGGA CCCCAAGGAA AACTGG G TGC AGAGGGTTGT GGAGAAGTTT TTGAA6AGGG 360 

CXGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA OCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAOATTC CTQGTTAAAT TTGAATTTCA GTAAACAATG AATAOTTTTT 540 

CATTOTAOCft TGAAATATCC AGAACATACT TATATGTAAA OTATTATTTA TTTQAATCTA 600 
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CAAAAAACAA 
TACAAATAGC 
GAATGGGTTT 
TTTGCCATAA 
CCTAGTCTQC 
TTTCTAAGTG 
TGGAAGCACT 
CTATTTATTA 
AGAAGATOAA 
OATATTAAAT 
i^AATTGGGT 
AGTACATTAT 
AGTCTTGTCA 
AACTATTAAA 
TGTTTATTAT 
AAGOCTTTAT 
TTCTQATTOT 
AAGTAAAAAA 



CAAATAATTT 
AAAATTBAGC 
GCTAOAATGT 
AGTCAAATTT 
TAGGCAGOAT 
GAAAAAOTAT 
TTAAGTTTTT 
TTTATGTATT 
TCATTGATTG 
GATGTTTTAT 
AGCCA6TTAA 
TGTTTATCTG 
TTGCCAGCTG 
ACAGCCAAAA 
GTACAA ATAG 
ATTTTTAACT 
ATGGAAATAT 
AAAAAAAAA 



TTAAATATAA 
CAAG66CCAA 
GATATTTGAA 
AGCTGGAAAT 
CCACAAGTCC 
TACCCACCAT 
TCATCATAAC 
TATTTAAGCA 
AATAGTTATA 
TAGATAAATT 
ATTTTCATTT 
AAAGTTTTAA 
TGTTGQTAGT 
CTCCACAGTC 
ATTCTTATAA 
TTAA6ATGTT 
AAAA6TAAAT 



QQATTTTOCT 
6AGAATATCC 
GCATCACATA 
CCTGGATTTT 
TTGTTCCACT 
CTTACCTCAC 
ATAAATTATT 
TCAAATATTT 
AAGATOTTAT 
TCAATCAGGG 
CAGATAAACA 
TTGAACTAAC 
0CTGT6TTGA 
AATATTAGTA 
TATTATTTAA 
TTTATGTGCT 
ATGAAACATT 



AGATATTGCA 
GAACTTTAAT 
AAAATGATGO 
TTTCTGTTAA 
GTGCCTTGGT 
AQTQATOTTG 
TTCAAGTOTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCCTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 
CTOCAAATTT 
TAAAATATAA 



CGGGAGAATA 
TTCAQQAATT 

GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAS6ACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 
GGTTGAAACT 
TTTTAAATAC 
TTTTTACTGT 
TTTGTTGTCA 



Seq ID NO I C47 DKA Sequence 

Nucleic Acid Accession 8t NM_005603. 

Coding sequence i 1..37S6 



ATOAGTACAG 
GTOQTTCCCT 
GAACCAQAAC 
QAATGTACAT 
AACACAAAAT 
AAGTACAAGG 
AATTTATATT 
TGGTACACCA 
6TGGA0QAT6 
ATTAAOGATG 
OGTCTGAAAA 
AACASCCTCT 
ATGTCACTTG 
OGTTTTATTG 
TGGAGAAACA 
AGQAACACCX3 
AAGAA TAGTO 
GTTTACAOGA 
TATTGGGAAG 
CCCTCCTACC 
CCCATCTCTC 
TGGGACCTGC 
CTCAATGAAC 
CAAAATATCA 
GATGCCTCTC 
GCTGATGGQA 
6A60CA6AAQ 
AGGACTGATG 
GCTGCCAGGA 
GAACTGGGCA 
AAQOQAATGT 
6CTGACACTG 
GATGGCCTOG 
ATTGAAGAAA 
ACCAACOGQG 
CTGG6AGCTA 
CTTQCAAAAO 
AATATAGGAT 
ATTAATTCTC 
AAGTTTGCAC 
ATCACTGGTT 
CTQAAOCTQA 
AGGCTAGAAG 
AQCXKaOTCA 
AAQAGQTACA 
ATCAAAACTG 
TOSAGTGACT 
OGATGGTCTT 
TTTACTTTGQ 
GAGGATT6GT 
66GCTGCTGG 
GTGGGACAAA 
GTCCTAACAT 
CA6GATGGAG 
QTAATAACAG 
TTTTCAATTT 
G6AATACATG 
AGACAOOCAT 



11 

1 

AAAGAGACTC 
ACAGTGATGA 
AAAACCGAGT 
GGCAAGTCAA 
TCTTGTGTAT 
CATTTACCTT 
TCCT6GCTCT 
CACTAGTOCC 
TGGCTCGCCA 
GCAGGTTCAA 
AAAATGATTT 
GCTATGTGGA 
AAATCACAGA 
AATGT6AAGA 
CAAGTTTTCC 
ATTTCTGCCA 
GGAAAACCAO 

CACAGGTGGG 
GTGGATTCCT 
TCTATGTCAG 
AAATGTACTA 
A6CT0QGGCA 
TGACCTTTAA 
AACACAACCA 
AGCT7GCATT 
TA06ACAGTT 
GTCAGCrCAA 
ACTTTGGCTT 
CTGAAAGGAC 
CTATCATTGT 
TTATTTATGA 
ATATCTTTOC 
AAGAATTTAC 
ACXSAAGCTCT 
CAOCTATTGA 
CT8AGATTAA 
TTGCTTGTaA 
TTCTTCATGC 
CTCCTGTGCA 
CTTG6TTGAA 
AOTTCCCAAG 
CTAAGAAAGA 
TCTGCTGCCQ 
AQAAAGCCAT 
CCXACATTGG 
ATTCCTTTGC 
ACATAAGGAT 
TTCATTTCTG 
TCATCACCCT 
AGCAGGATGT 
GAGACTTACT 
GGAT6ATCCT 
AGGCACCTTC 
TCAATTTCCA 
TTGGAAGCAT 
TTCTCTTTCC 
ACMTtGGTr 



21 

I 

AGAAACGACA 
TGAAACAGAA 
CAACAGGGAA 
AGCAAAOGAT 
TAAGQAGAGT 
TATACCAATG 
TCTTATCTTA 
CCT6CTTOTQ 
TAAAATGGAT 
AGTTGCTAAG 
TGTTCCAGCT 
AACAGCAGAA 
CCAOTACCTC 
ACCCAATAAC 
TTTOGATGCT 
CGGCTTAGTC 
ATTTA AAA6A 
TCTTATTCTG 
CAATTCCTCT 
CATTTTCTQO 
GGTGGAAGTG 
TGCTGAQAAO 
GATCCATTAT 
AAAGTGCTGT 
CAACAAAATA 
TTATGACCAC 

CTACCAGGCA 
TGCCTTCCTC 
TTACAATGTT 
AA6AACCCCA 
A08QTTACAT 
AAATGAAACT 
AGAATGGAAT 
GGATAAA6TA 
AGACAAGCIA 
GATCTG G GT G 
ACTTCTGACT 
AAGGATGGAA 
GGAATCTTTT 
TGAAATTCTT 
AACAGAAGAA 
GCAGOGGCRQ 
OGTCACCJOCC 
CACGCTBGCC 
OGTTGGAATA 
TCAGTT006A 
GTGCAAGTTC 
GTACTCCTTC 
CTACAACGTG 
GAOTGACAAA 
ATTCAACTAT 
CTTCTTCATA 
CGACTACX3VG 
GATT6GCTT0 
TGCACTTTAT 
ATCTGCATTT 
AACTATCATC 



31 

1 

TTTGACGAGG 
GATGAACTTG 
GCAGAGGAGA 
CGCAAGTACC 
AAATATGCX3A 
AATCTOTTTG 
CftG GCRG TTC 
GTGCTGGGOG 
AA6GAAATCA 
TGGAAAGAAA 
GACATTCTCC 
CTGGAOGGAG 
CAAA6AGAAG 
CGACTAGATA 
GATAAAATTT 
ATTTTTGCAG 
ACTAAAATTG 
CTTTCTGCtG 
TGGTAOCTCT 
GGCTATATCA 
ATTOGTCTTO 
GACACACCCX3 
ATCTTCTCTO 
ATCAACX3G6C 
GAGCAAGTTQ 
TATCTTATTG 
CTOOCAGTTT 
GCCTCTC0CX3 
GCCAGGACCC 
CTTGCCATTT 
GAAGGCAATA 
OGAATOAATC 
CTTAGAAGCC 
AAAAAGTTTA 
TATGAGGAQA 
CAGGATGGAO 
CTTACTGGAG 
GAAQACACCA 
AACCAGAGGA 
TTTCCACCCG 
CTCGAQAAAA 
GAAAOAOGGA 
AAAAACTTTO 
AAGCAGAAGQ 
ATCGGAGATG 
AGTGGACAAG 
TATCTGCAGA 
CTAOGATACT 
TTCAATGGCT 
CTGTACACCA 
CT6AGCCTCC 
AAGAGATTCT 
OCTCTTGGAO 
TCTTTTGCCG 
GATACTTCTT 
TTTGGCATCA 
CAATTTACAO 
CTrOACrGTTG 



41 

1 

ATTCTCAGCC 
ATGACCAGGG 
ACGGGGAGCC 
AGGAACAACr 
ATAATGCAAT 
AGCAGTTTAA 
CTCAAATCTC 
TCACTGCAAT 
ACAATAGGAC 
TTCAAGTTGG 
TGCT6TCTAG 
AAACCAATTT 
ATACATTGGC 
AGTTTACAGG 
TGTTAOGTGG 
GTGCTQACAC 
ATTACTTGAT 
OTCTTGCCAT 
ATGAT6GAGA 
TTGTTCTCAA 
GACftGAGTCA 
CAAAA6CTAG 
ATAAGACGGG 
AGATATATGG 
ATTTTAGCTG 
A6CAAATCCA 
GCCACACA6T 
AT6AA6GTGC 
AGAACACCAT 
TGGACTTCAA 
TCAAGCTTTA 
CTACTAAGCA 
TATGCCTTTG 
TGGCTGCCAG 
TTGAAAAAGA 
TTCCAGAAAC 
ACAAAAAGGA 
CCATCTGCTA 
ATAQAGGTGG 
GTGGAAACXX3 
AGACCAAGAG 
TGGQGACCCA 
TGGACCTGGC 
CCATQGTGQT 
GGGCCAATGA 
AAG6AATGCA 
GOCTACTGCT 
TCXTTTACAA 
ACTCTGOGCA 
GCCTGCOCGT 
GATTGCCT6G 
TTGTAAGCTT 
CTTATCTGCA 
TCACCATTGC 
ATTOGACTTT 
TGTTTGACTT 
GCACAGC TTC 
CTGTOTOCTT 



CTTAATTCTC 
CATTTCAAAA 
AACTGCTGAA 
TGGGGAG6AT 
OGTCTACGCA 
TGCCTTAATC 
AAATAAGATT 
AAOTAAAAGG 
CTG06AGTGC 
GGACCTGGTG 
C6TGAACATG 
AGCTGTCATG 
GGTGCATGGC 
AAACTTTGCC 
GACTGCATAC 
GCTCCTCATG 
GTTATACATA 
GTTGCATGGG 
AACC6TAGGG 
CTCTGCTCTT 
TGTGAATGCT 
TGATAGTGCT 
aaaggctctg 
ACTAOCCXjTC 



660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

ISOO 

1S60 

1620 

1639 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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GTTGOCUTC GMTCCIIGTC AATGAOCATC TGGCOnCnS AAABIGMAA GNTCCA6AA6 3540 

CMTCOCAASC GOTTGAAOGC 0GAGGA6CAC T66CAG0QAC OOCRGCAGQT GTTCOOOOOa 3600 

GGCGTCTCAA OOCGQCGCTC GQCXTTAOQCC TTCTCGO^ ASOGGOQCTA COOGQACCTC 36S0 

. ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CGCTCGCOOC TTOATaCCAT COTGGOOGAT 3720 

0 GGCAC06CG0 AGTACAG606 CAOOQGOGAC A6CTGA 37S6 
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8eq ZD NO I C48 OKA Sequence 
Nucleic Acid Accession fit XH_044533 
Ceding sequence! 238.. 2751 

1 11 21 31 41 51 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT OCGGGGCCGG OSCCGGCGGG AGGACTGCGG TGCCC0GCX3G 60 

AGGGGCTQAG TTTGCCAGGG CCCACTTGAC CCTGTTTCXX: ACCTCXX3GCC CCCCAGGTCC 120 

66A6GCGG6G GCCCCC6GGG CGACTOGGGG GOG6ACCX3CX3 GG606GAGCT GCCGCCCGTG 160 

AGTCOQQCOG A6CCACCTGA GOCGGAGCOQ G6G6ACACCG TOGCTCCTGC TCTCC6AATG 240 

CTGCGCACOQ OQATGGGCCT GAGGA6CTGG CTOGCOSCCC CAT66G60GC GCTQCCGCCT 300 

CGGCCACCGC TGCTQCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCX5ACC 360 

TGGGCGCTCR GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGOGGCC ATTCCTCAGA 420 

TTOSAAGCTG AACAOVTCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CT6TAGGTGG GTGCTOGAGA OGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAQT ACCAGGAGCT GCTTTGGGGT GCAGAC6CAG AGAAGAAACA 6CAGT0CAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTQT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

A6CGGCAQTC ACCTGTTCAC CTGTGGCACA GCA6CCTTCA GCCCCAT6T0 TACCTACATC 720 

AACATGGA6A ACTTCACCCT GOCAAGGGAC QAGAAGGGGA ATGTCCTCCT GGAAGATG6C 780 

AAGGGG06TT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TQATOGCGAG 840 

CTCTACaCTQ GAACAGTCAG CAGCTTCCAA GGGAATGACX: CGGCCATCTC GCGGAGCCAA 900 

AGCCTTOSCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TQCAAGACCC AGCTTTTGTQ 960 

QCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATQA CAAGATCTAC 1020 

TTTTTCTTCA GOGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGT6TCCCGC 1080 

ATTGCCCGCA TCT6CAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTOTGCTCA CGGCCOGAOQ ATGGCTTCCC CTTCAAOOTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT OQCaTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGC06T CTGTGTCTTC 1320 

ACAATGAAG6 ATGTGCAGAG AGTCTTCAGC GGCXTTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCXTTGOAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCA6AC0G GGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTGCGAA GCXX3CATGCT GCTGCXGCAO 1560 

CCCCAGGCTC GCTACX3U3CX3 CGTGGCTGTA CACCGGGTCC CTGGCCTGCA GCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGQC CGGCTCXACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA GCCACAGGGO 6CTGCTGTAT GC6GCCTGAC ACXCX3GG06T AGTCCAGGTG 1800 

CCGATGOCCA ACT6CAGCCT 6TACAGGAGC TGTGGGQACT GCCTCCTOQC C06G6ACCCC 1860 

TACTGTGCTT 06A6G66CTC CAGCTQCAAG CACX3TCAGCC TCTACCA6CC TCA6CTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCOCXaV AGQACCTTTG CA0C6CGTCT 1980 

TCGGrrGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CC3GCTCCTCT CCAACCTGGC GACCCX3ACTC 2100 

TGQCTAOGCA AOBGGGC OCC 0GTCAAT6CC TCG6CCTCCT 6CCAC6T6CT ACCCACTGG6 2160 

GACCTGCT6C TGCPTOGGCAC GCAACAGCTG GGGGAGTTOC AGT6CTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CA6CTACTGC CCAGAGGTG6 TGGAGGAOGG 6GTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACXXIGTC ATTATCAGCA CATOGCGTGT GAGTGCACCA 2340 

GCTGOTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAQTT CCTGGTGATG 2400 

TGCAOSCTCT TTCJTGCXOQC OGT G CTGCTC CCAGTTTTAT TCTTGCTCTA COOGGACOGO 2460 

AACAGCATGA AAOTCTTCCT GAA0CAGGG9 GAATGTOCCA GCGT6CACCC CAAGACCTGC 2520 

CCTGTGOTGC TGCCXXOTQA GACCOGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCOS 2580 

CTCGATCACC GAGGGTACX31 GTCCCTGTCA GACAGCCOCC C3GGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAQA AGAGGCCACT CA6CAT0CAA GACAGCTTCG TGGAGGTATC CCCAOTGTOC 2700 

CCC0Q6CCCC GOGTCOQGCT TGGCT CGGAG ATCCGTQACT CTGTGGTGIG A6AGCTGACT 2760 

TCCAGAG6AC GCT6CCCTGG CTTC3U36GGC T6T6AATGCT 0G6AGA0GGT CAACTGGACC 2820 

TCCCCTCOGC TCTGCTCTTC GTGGAACAOG ACCX5TGGT6C CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAA6CT CCTACCACCC AGACACCCAA 2940 

ACAGC06TGG CCCCAGAOGT CXZTGGCCAAA TATGGGGGCC TGCCTAG6TT GGTGGAACAG 3000 

TQCTOCTTAT GTAAACTQAG CCCmGTTT AAAAAACAAT T0CAAATGT6 AAACTAGAAT 3060 

GAGAGGGAAO AGATAGCATG GCAT6CAGCA CACA0G6CIG CTGCAOTTCA TOGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCOVCOGGCT GCCCTGTCTC ACTQCAQATT 3240 

plGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCX!AGTC AGCOGAGGAT GTAGTTGTTG 3300 

CTGCOGTCGT CCGACCACCT CAOGGACCAG AGGGCTAGGT TGQCftCTGOG GCCCTCACXIA 3360 

60TCCT06GC T0G6ACCCAA CTCCTG6AGC TTTOCAGCCT GTATCA66CT GTGGCCACAC 3420 

6AGAGGACAG 06CGAGCTCA GQAGAGATTT OOTQACAATG TAOGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC G'l"iH' n \ j C W GAGAAOCCOT GTGCCCCTTC 3540 

CCACCAXATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTQ 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAOGGATA 3660 

TCAACACTGC CCAGCACAGG G6CCCTGAAT TTATQT6GTT TTTATACATT TTTTAATAAG 3720 

ATGCACTTTA TQTCATTTTT TAATAAAGTC T6AAQAATTA CTGTTT 3766 

Seq ID NOt C49 DMA Sequence 

Nucleic Acid Accession «i NM_007019.1 

Coding sequencei 41.. 580 

1 11 21 31 41 51 

I I I I I I 

G6CA0GAGGG AGTTCCTGTC TCTCTGCCAA CGCCQGCCGG ATGGCTTCCC AAAAC060QA 60 
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CCCAGCCGCXr ACTAGCGTCC3 CGGCOOCCOO TAAAGGAGCT GAGCOSAGOG GGGGCGC06C 120 

CCGGGGTCCX5 GTGGGCAAAA GQCTACAGCA GGAGCTGATQ ACCCTCRTGA TGTCTGGOSA 180 

TAAAGGOATT TCTCCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG GGACCATCCA 240 

TGGAOCAGCT GGAACAGTAT ATQAAGACCT GAGOTATAAO CTCTOQCTAO AQTTGCCCAG 300 

TGOCTAOCXrr TACAATGOGC CCACAGTGAA GTTCCTCACG CCCTGCTATC ACCCCAAOGT 360 

GGACACCCAO OGTAACATAT GCXTTGGACAT CCTQAAGQAA AAGTGGTCTQ CCCTGTATGA 420 

TGTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 480 

CTTGAACACA CATGCT6CG6 AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACXTGCA 540 

AGAAACCTAC TCAAAOCAGQ TCACCAGCOk GGAGCCCTGA CCCA0GCT6C CCAGCXTTOTC 600 

CTTGTGTOGT CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 660 

GACTCTTTAT CTTGAQCTOT GGTATTTTT6 ' mWITm ' QTCTTTTAAA TTAAGCCTOG 720 

GTT6AGCCCT T6TATATTAA ATAAAT6CAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 780 

AAA 783 

Seq ID NOs CSO DMA Sequence 

Nucleic Acid Accession it iiiM_oi4584.1 

Coding sequencet 227.. 1633 

1 11 21 31 41 51 

I I I I I I 

GCACGAGCCC CQGGCTGCCG GCGCQGGCX5C CGCGGCACGT CCACAGGCTG GGTCGCGAGG 60 

TGGCGATCGC TGAGAGGCAG GAGGGCCGA6 GC6GGCCTGG GAGGCGGCCC GGAGGTGGGG 120 

CX3CCX3CTGGG GCCGGCCCGC ACGGGCTTCA TCTGAGGGCG CAOQGCCCX3C 6ACCGAGCGT 180 

GCGQACTGGC CTCCCAAGCX3 TGGGGCGACA AGCTGCCGGA GCTGCAATGG GCCOCGGCTG . 240 

GGGATTCTTG TTTGGCCTCC TGGGCGCOGT GTG6CTGCTC AGCTCGOGCC ACG6A6AGGA 300 

GCAGCCCCCG GAGACAGCGG CACAGAGGTG CTTCTGCCAG GTTAOTGQTT ACTT6GATGA 360 

TTOTACCTGT GATGTTGAAA CCATTGATAG ATTTAATAAC TACAGGCTTT TCCCAAGACT 420 

ACAAAAACTT CTTGAAAQTG ACTACTTTAQ OTATTACAAG GTAAACCTGA AGAGQCCGTG 480 

TCCTTTCTGG AATGACATCA GCCAGTGTGG AAGAAQGGAC TGTGCTGTC31 AACCATGTCA 540 

ATCTOATGAA GTTCCTQATQ GAATTAAATC TGOSAGCTAC AAGTATTCTG AAGAAGCCAA 600 

TAATCTCATT GAAGAATGTO AACAAGCTGA A06ACTTGGA GCAGTGQATG AATCTCTGAG 660 

TGAGGAAACA CAGAAGGCTQ TTCTTCAGTG GACCAAGCaT GATGATTCTT CAGATAACTT 720 

CTGTGAAGCT GATQACATTC AGTCCCCTQA AGCTGAATAT GTAOATTTGC TTCTTAATCC 780 

TGAGOGCTAC ACTGGTTACA AGGGACCAGA TGCTTGGAAA ATATGGAATG TCATCTACGA 840 

AGAAAACTGT TTTAAGCCAC AGACAATTAA AAGACCTTTA AATCCTTTGG CTTCTGGTCA 900 

AGGGACAAGT GAAGAGAACA CTTTTTACAa TTGGCTAGAA GGTCTCTQTG TAGAAAAAAG 960 

AGCATTCTAC AGACTTATAT CTGGCCTACA TGCAAGCATT AATGTQCATT TGAGTGCAAG 1020 

ATATCTTTTA CAAGAGACCT GGTTAGAAAA GAAATGGGGA CACAACATTA CAGAATTTCA 1080 

ACAGCGATTT GATGGAATTT TGACTQAAGG AGAAGGTCCA AQAAGGCTTA AGAACTTQTA 1140 

TTTTCTCTAC TTAATAGAAC TAAOOOCTTT ATCXJUUM3T0 TTACCATTCT TOGAGCGCXX: 1200 

AGATTTTCAA CTCTTTACTG GAAATAAAAT TCAOGATGAO GAAAACAAAA T6TTACTTCT 1260 

GGAAATACTT CATGAAATGA AOTCATTTCC TTTGCATTTT GATGAGAATT CATTTTTTGC 1320 

TGGGGATAAA AAAGAAGCAC ACAAACTAAA GGAGGACTTT CGACTGCATT TTAGAAATAT 1380 

TTCAAGAATT ATGGATTQTG TTGGTTGTTT TAAATGTCGT CTGTGGGGAA AGCTTCAGAC 1440 

TCAGG G TrrG GGCACTGCTC TQAAGATCTT ATTTTCTGAG AAATTGATAG CAAATATGCC 1500 

AOAAAGTGSA CCTA8TTATG AATTCCATCT AACCAGACAA GAAATAGTAT CATTATTCAA 1560 

GGCATTTGOA A6AATTTCTA CAAGTSTGAA AGAATTAGAA AACTTCAGGA ACTTGTTACA 1620 
GAATATTCAT TAAAGAAAAC AA6CTGATAT GTGCCTGTTT CTGGACAATG GAGGCXSAAAG 1680 
AGTGGAATTT CATTCAAAGG CATAATAGCA ATGACAGTCT TAAGCCAAAC ATTTTATATA 1740 
AAGTT6CTTT TGTAAAGGAG AATTATATTG TTTTAAGTAA ACACATTTTT AAAAATT6TO 1800 
TTAAOTCTAT GTATAATACT ACTGTOAGTA AAAGTAATAC TTTAATAATG TGGTACAAAT 1860 
TTTAAAGTTT AATATTQAAT AAAAGGAGQA TTATCAAATT CATATATGAT AAAAGT8AAT 1920 
GTTCTAAGTC TCTCRAACTA GCX3TTTTATG TAATAATATG TAATATAAAT AAAACTATGQ 1980 
TAAATGTGAC AAGCATTTAA TAGGAAAATG CTAAGGAGGC CTCATAAATG ACCCATAATT 2040 
ACCAAG6TAG AATTTTTCA6 TACATTTAGG GTTGCTGGAT TTAGCAAATA AAAATAAAGA 2100 
TTGCGCAGTT AGATTTGAAT TTCAGATAAA GAATTAGTTT TTTAATATTT TACATG6AAT 2160 
ATTTGGAAAA TACTTATACT AAAAAATTAT TT6TTTGAAA TTCACATTTA ACTGGGAGTC 2220 
TTGTATTTTA TCTGGCAATC CTAAAATACa TTGGTATGAA ACAAATCACT TTTAGAAOTA 2280 
TATTGCTATT TTGATTGOaT TGTTTTTGTG TGTAGAAACG TACAATAACA ACTCAAAGGC 2340 
ACAGQAGATT TCTAAACATT GTGAAAA6TT GAATAGATTA TATATTTATT CTCATAATAC 2400 
TTTCACTAAT ACTAAATAAA ATTT80QGAA CACTTTTTAT TTTTATATAA TTTCCAATTT 2460 
ACAGAAAAGT TTCAAAAATA 8TACAAAGAG CTCTCTTACC CAGATTCACT AATTGTTCAT 2520 
ACOTQCTTTA TCTTTCATGC TTTCTCTGTA CAC31CACACA CACACACAAA TTTTTCCTCA 2580 
ATCATTTGAA AGTCAOTTAT AGGCATCATG CCCCTTAAAC CCTAAATACT TCAGTGTGTA 2640 
ATACTGAATA ATTACTAAAA AT6ATTTTCT CAGAAAAAAA AACTCCCACA ATTCTGGAAC 2700 
TA TAATACTG TAAGCCTTAO AATAAATAAT ACTTTCAAGT TCCAATCTAA A6TTCTTT7T 2760 
GAGTTTTGTT 6CtXX3TTTTA TGCTTGATGT GTATAGTAAT AGGGTAGGCT ATTTATTTTA 2820 
TTAAAATTTT TTTTAGAGAC AAQGTTTTGC TO'i'O'n'G C CC AAGCTGGAAC TTGAAOGACT 2880 
GGGCTGAAGT GATCTTCCCA CCTCAGCCTC CCAAGTAGCT GGGAATACAG GTQTCTQCCA 2940 
CCATACCCAO TTTCATTTTT GTTTTTTATA CCOGAAGTTC ATTTCCTTTG TCTCCCTAAA 3000 
ACTGAACTQT AATTTTGGQA GGTTTTCATT AOIGQAAGCT CTTCATTTAT AAAGCTATTT 3060 
OAAGGGGTTT AGQAATTTAT ATCACATG6T AATT6TAGAO AAAAAGAAGC TATATACCTC 3120 
AAAATOGTGC CCTCTTTACA TATGTCTTAT CAGGTATAAC ATGTTG^UUVT GTCACATTAG 3180 
TAGTAAAGTG GGGTTTATTT ATATAGTGGT TAAGAAATGT CAGTTTACAC TGCTGTATAC 3240 
TTCTTCTTCT GTGTCCCTAA GGGCTGGTAC AGTGCCAAGC ACATACTTGG TATCCAATAA 3300 
ATATTTGTEG 6ATQAAAAAA AAAAAAAAAA AAAA 3334 



Seq 10 NO: C51 DMA Sequence 
Nucleic Acid Accession S: NM_002886. 
Coding sequence: 3 7.. 72 3 



11 21 31 



I 

1229 



wo 03/042661 



PCTAJS02/36810 



10 
15 
20 
25 
30 
35 
40 
45 



55 
60 
65 
70 
75 
80 



CCAOGTCCGG GGTGCCGAGC CAACTTTCCT GCX5TCCATGC AGCCCGGGCG GCAACXX5CTG 60 

CCCOCTCCCT GGTCOGGGCC CAGGOGCCCG CGCCCCACCG CCCOQCTGCT CQOGCTGCTG 120 

CTGTTGCTCG CCCCGGTG6C GGCGCCOGGG GGGTCCC3GG6 GCCCOQAOQft OCCTGG6CAG 180 

CCTGAOQMti CTG6GGTGCC GGGCAGGCTC CTGCAGCftQA AGGCGOSGGC GGOGCTTCftC 240 

TTCTTCAACT TC06GTC06G CTCX3CCCAGC G0SCTGC6AG TGCTG G CO O A GGTGCA6QAG 300 

GGCCGOGOGT GGATTAATCC AAAAGAGGGA TGTAAAGTTC ACGTGGTCTT CAGCACAGAG 360 

CGCTACAAOC CAGAGTCTTT ACTTCAGOAA GGTGAGGGAC GTTTGGGGAA ATGTTCTGCT 420 

OGAGTGTTTT TCAAGAATCA GAAACCCAGA CCAACCATCA ATGTAACTTG TAC31C GG CTC 480 

ATCGAOAAAA A6AAAAGACA ACAAGAGGAT TACCT6CTTT ACAAGCAAAT GAAGCAACTO 540 

AAAAACCOCT TGGAAATAOT CA6CATACCT GATAATCAT6 GACATATTGA TCCCTCTCTG 600 

AGACTCATCT GGGATTTGGC TTTCCTTGGA AGCTCTTACG TQATGTGGGA AATGACAACA 660 

CAGQTGTCAC ACTACTACTT GGCACAOCTC ACTAOTGTGA GGCAGTOOGT AAGAAAAACC 720 

TGAAAATTAA CTTGTGCCAC AAGAGTTACA ATCAAAGTGQ TCTCCTTAGA CTGAATTCAT 780 

6TGAACTTCT AATTTCATAT GAAGAGTTGT AATCACATTT ATTTCAATAA ATATGTGAGT 840 

TCCTGC 846 

Seq ID NO: C52 DMA Sequence 

Nucleic Acid Accession ft: NM_005409.a 

Coding seguencet 94.. 3 78 

1 11 21 31 41 51 

I I 1 i I i 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGA6 CA6CAAAGCT GAAGTAGCAG 60 

CAACA6CACC AGC3U3CAACA QCAAAAAACA AACATOAGTG TGAAG66CAT GGCTATAGGC 120 

TTGGCTGTGA TATTGTGT6C TACAOTTGTT CAAGGCTTCC CCATGTTCAA AAOAGGAOQC 180 

T6TCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACXZCT GAAAGAAAAT 300 

AAAGGACAAC 6AT6CCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATtSAAGTC CTQOAAAAGa GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTQACTACT6 AAAT6ACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGOTTTTOT QACTTTCAAC TTTTGTACAG TTATGTQAAQ GATQAAAQGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT 6AATGAC3VAT CAGAATTCCA 600 

CTGCCCAAAG QAQ TCCAG CA ATTAA ATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTAGCATC G6AGTTTACA AAGTGCTTTC AOOTTCTTAC TTGTTOTATT ATACA7TCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGT6ACTAT 780 

GAGAACATTT CTGTCTCTJU5 AAGTTATCTO TCTOTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAQ 900 

CATCTATOTG TOOTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTA6CACAT CAATAT6TA6 GQAAACATTC TTATGCATCA TTTGQTTTGT 1020 

TTTATAACXA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATAOGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT A6CAGCACCX5 QTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT O^TTGAGATG TTTTQAAGCA ATTAGGATAT GTGTGTTTAC 1200 

TGTACTTTTT OTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TT6TTCAT6C CTATATACTG TAAAATTTAO GTATACTCAA QACTAOTTTA AAGAATCAAA 1440 

GTCATTTTTT TCTCTAATAA ACTACOUaA CCT TTCTTT T TTAAAAAAAA AAA 1493 



cr\ 8^ ^* C53 JXm Sequence 
5U Nucleic Acid Accession tfi F6BNESU predicted 
Coding sequence! 1..609 



1 11 21 31 41 51 

I I i I I I 

ATGCTGCGGC AGGTGCTTCG CAGAGGGCTC CAGTCX?rTCT GCCACAGGCT GGGTTraTQC 60 

GTGAGCCGGC ACCCGGTCTT TTTCCTCACC GTGCCCGCAG TCCTGACAAT CACCTTOQGC 120 

CTCAGC6CGC TCAACCGCTT CCAGCOOGAG GGCGACCTGG AGCGCCTGGT CGCTCXXZAGC 180 

CACAGCCTGG CCAAGATC6A GCGCAGCCTG GCCAGCAGCC TTTTCCCCCT GGACCAGTCC 240 

AAAAGCCAGC TCTATTCGGA CTTACACACC CCTGGGAGGT AT6GCAGGGT GATCCTCCTC 300 

TCCCCAACC6 GGGACAATAT TTTGCTCCAO GCTGAGGGGA TCCTGCAGAC CCACCGAGCC 360 

GTOCTGGAAA TGAAGGTGAA CCACAAGGGC TATAATTATA CTTTTTCCCA TCT G TGTGTG 420 

TTGAQAAATC AGGATAAGAA ATGCOTGCTO GATQATATTA TTTCAGTOCT AGAGGATCTC 480 

AGGCAGGCTG GOSTCTGCAA 7AAGACAACA GGCAQGGTGC AAQTGAGGTA TCCCAACACT 54 0 

AAATTAAAGG TATGCTCCTT CT6CATGCTT CTQGCAATTA AAOAOaCAOC ACTTCATTTC 600 

TTGCCCTAA 609 

Seq ID KO: 054 DMA Sequence 

Nucleic Acid Accession fit NM_002438.1 

Coding sequence: 104.. 4474 " 

1 11 21 31 41 51 

I I I I I I 

GGQAACTTGG ATTA6QTGQA GAGGCAGTTG GGGGGCCTOG TTGTTTTGCG TCTTAGTTCC 60 

GCCCTCCTGT CCATCAOGAG AAGGAAAGGA TAAACCXTTOG GCCATGAGaC TACCCCTGCT 120 

CCTGGTTTTT GCCTCTQTCA TTCCGGGTGC TGTTCTCCTA CTGGACACCA GGCAATTTTT 180 

AATCTATAAT GAAGATCACA AGCGCTGCGT GGATGCAGTG AQTCCCAGTG CCGTCCAAAC 240 

CGCAGCTTGC AACC3VGGATG CCGAATCACA GAAATTCCQA TGGGTGTCCG AATCTCAGAT 300 

TATGAOTOTT GCATTTAAAT TATGCCTGGG AGTGCCATCA AAAACAGACT GGGTTGCTAT 360 

CACTCTCTAT GCCTGTGACT CAAAAAGT6A ATTTCAGAAA TOGGAOTGCA AAAATQACAC 420 

ACTTTTOGGO ATCAAAGGAG AAQATTTATT TTTTAACTAC GGCAACAGAC AAGAAAAGAA 480 

TATTAT6CTC TACAAGGGAT CX30GTTTATG 0AGCAG0T60 AAGATCTAT6 GAACCACAGA 540 

CAATCT6TGC TCCAGAGGTT ATGAAGCCAT GTATAC6CTA CTAGGCAATG CCAATGGAGC 600 

AAGCTGTGCA TTCC06TTCA AGTTTQAAAA CAAGTGGTAC 6CAGATTGCA OGAGTGCTGG 660 

6CG6TCGGAT GGATGGCTCT GGTGGGGAAC CACTACTGAC TATGACACAG ACAAGCTATT 720 

1230 



wo 03/042661 



PCT/US02/36810 
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TGGATATTGT CCATTGAAAT TTGAGGGCAG TGAAAGCTTA TGGAATAAW3 ACCOGCTGAC 780 

CAQCX5TTTCC TACCAGATAA ACTCCAAATC CGCTTTAACG TGGCACCAAG OGAOSAAAAG 640 

CTGCCAACAA CAGAACGCTG AGCTCCTGAG CATCACAGAG ATACATQAGC AAACATACCT 900 

6ACAGGATTA ACCAGTTCCT TGACCTCAGG ACTCTGOATT GGACTTAACA 0TCTGA6CTT 960 

CAACAGCGOT TGGCAGTGGA GTQACCGCAG TCCTTTCCGA TATTTGAACT GGTTACCAGG 1020 

AAGTCCATCA GCTGAACCTG GARAAAGCTG TQTGTCACTA AATCCTGGAA AAAATGCTAA 1080 

ATGGGAAAAT CTGGAATGTG TTCAGAAACT GGGCTATATT TGCAAAAAGG GCAACACCAC 1140 

TTTAAATTCT TTTOTTATTC CCTOUSAAAG TGATGTGCCT ACTCACTGTC CTAGTCAGTG 1200 

GTGGCCGTAT GCXX3GTCACT GTTACAAGAT TCACAGAGAT GAGAAAAAAA TCCAGAGGGA 1260 

TGCTCTQACC ACCTGCAGGA AGGAAGGOGG TGACCTCACA AGTATCCACA CCATCGAGGA 1320 

ATTGGACTTT ATTATCTCCC AQCTAGGATA TGAQCCAAAT GAGQAATTGT GOATCGGCTT 1380 

AAATGACATT AAGATTCAAA TGTACTTTGA GTGQAGTQAT 6GGA0CCCTG TAACGTTTAC 1440 

CAAATGGCTT CGTGGAQAAC CAAGCCATGA AAACAACAGA CAGGAGGATT GTGTGGT6AT 1500 

GAAAGGCAAG QATGGGTACT GGGCAGATC6 GGGCTOTGAG TGGCTCTCTTO GCTACATCTG 1560 

1^ CAAOATGAAA TCAC6AAGCC AAGGTCCAGA AATAGTGGAA GTOQAAAAAG OCTGCAGGAA 1620 

AGGCTGGAAA AAACATCACT TTTACTGCTA TATGATTGQA CATAOQCTTT OU^TTTOC 1680 

AGAAGCAAAC CAAACCTGTA ATAATGAGAA TGCTTATTTA ACAACTATTO AAOACAGATA 1740 

T GAAC AAGCC TTCCTGACTA QTTTCGTTGG CTTAAGGCCT GAAAAATATT TCTGGACAGG 1800 

_ _ ACTTTC3VGAT ATACAAACCA AAGGGACTTT TCAGTGGACC ATOGAGGAAG AGGTTOGGTT 1860 

ZU CACCCACTGO AATTCAGATA TGCCAGGGGG AAAGCCAGGG TGT G TT G CC A TGAGAACOGG 1920 

GATT6CAGGG GGCTTATGGG ATGTTTTGAA ATGTGATQAA AAQQCAAAAT TTGTOT O CAA 1980 

GCACTGGGCA GAA60AGTAA CCCACOCACC GAA6CCCAGG A0QACTCCC6 AACXXAAATG 2040 

TCCGGAG6AT TGGGGOGCCA GCAGTAQAAC AAGCTTGTGT TTCAAGCT6T ATGCAAAAGG 2100 

AAAACATGAG AAGAAAAOGT GGTTTGAATC TOGAGATTTT TGTCGAGCTC TGGGTGGAGA 2160 

CTTAGCTAGC ATCAATAACA AAQAG6AACA 6CAAACAATA TGGOGATTAA TAACA6CTAG 2220 

TG6AAGCTAC CACAAACTGT TTTGGTT0Q6 ATTGACATAT GGAAGGCCTT CAQAAGGTrr 3280 

TACTTGGAGT QATGGTTCTC CTGTTTCATA TGAAAACTGG OCTTATGGAG AACCTAATAA 2340 

TTATCRAAAT GTTGAATACT GTGGTGAGCT GAAAGGTGAC CCTACTATGT CTTGGAATGA 2400 

TATTAATTGT GAACACCTTA ACAACTGQAT TTGCCAGATA CAAAAAGGAC AAACACCAAA 2460 

ACCTGAGCCA ACACCAGCTC CTCAAQACAA TCCACCAGTT ACTGAAQATO GGTGGGTTAT 2520 

TTACAAAOAC TACCAGTATT ATTTCA6CAA AOAGAAGGAA ACCATGGACA AT60GGQAGC 2580 

GTTTTGCAAG AGGAATTTTG GT6ATCTTGT TTCTATTCAA AGTGAAAGTG AAAAGAAGTT 2640 

TCTATGGAAA TATGTAAACA GAAATGATGC ACAGTCTGCA TATTTTAtTG QTTTATTGAT 2700 

CAGCTTGGAT AAAAAGTTTG CTTGGATGGA TGGAAGCAAA GTGGATTACG TGTCTTGGGC 2760 

CAC AGGTGA A CCCAATTTTG CAAATGAAGA TGAAAACTGT GTGACCATGT ATTCAAATTC 2820 

AGGGTTTTGQ AATGACATTA ACTGTGGCTA TCCAAAOQCC TTCATTTGCC AGCGAGATAA 2880 

CAGTAGTATC AATGCTACCA CA6TTATGCC TACCATGCCC TOGGTCCCAT CAGGG7GCAA 2940 

GGAAGGTTGG AATTTCTACA GCAACAAGTG TTTCAAAATC TTTGGATTTA TGGAAGAAGA 3000 

AAGAAAAAAT TGQCAAGAGG CACGAAAAGC TTGTATAGGC TTTGGAGGGA ATCTGGTCTC 3060 

CATACAAAAT GAAAAAGAGC AAGCATTTCT TACCTATCAC ATGAAG6ACT CCACTTTCAG 3120 

TGCCTGGACT QGGCIOAATQ ATGTCAATTC AGAACACAOO TTCCTTT GG A 0GGATG6ACG 3180 

AGGAOTCCAT TACACAAACT GGGGGAAAGO TTACCCTGGT GGAAOAAGAA GCAGTCTTTC 3240 

TTATGAAGAT GCTGACTGTG TTGTTATTAT TGGAGGT6CA TCAAATQAAG CAGGAAAATG 3300 

GATGGATGAT ACCPGCX3ACA GTAAACGAGQ CTACATATGC CAGACACGAT CCX3ACCCTTC 3360 

CTTGACTAAT CCTCCAGCAA CGATTCAAAC AGATGGCTTT GTTAAATATQ GCAAAAGCAG 3420 

CTATTCACTC ATOAGACAAA AATTTCAATG GCATGAAGGG GAGACATACT GCAAGCTTCA 3480 

CAATTCCCTT ATAGCCAGCA TTCTGGATCC CTACAGTAAT GCATTTGOGT GGCTGCAGAT 3540 

GGAAACATCT AATGAAOGTG TGTGQATCX5C CCTGAACAGT AACTTGACTG ATAATCAATA 3600 

CACTTGGACT GATAAGTGGA GGGTGAQGTA CACTAACTGG GCTGCTGATG AGCCCAAATT 3660 

GA AATC AGCA TGTGTTTATC TGGATCTTGA TGGCTACTGG AAGACAGCAC ATTGCAATGA 3720 

AAGTTTTTAC TTTCTCTGTA AAAOATCAOA TGAAATCCXTT 6CTACTQAAC CCCCACAACT 3780 

GCCTGGCA6A TGCTOSGAGT CAGATCACAC AGCATGGATT CCTTTCCATG GTCACTGTTA 3840 

CTATATTGAO TCXTTCATATA CAAGAAACIQ GGGCCAAGCT TCTCTGGAAT GTCTTOGAAT 3900 

GGGTTCCTOT CTGGTTTCCA TTGAAAGTGC TGCAGAATCC AGTTTTCTGT CATATOGGGT 3960 

TGAGCCACTT AAAAGTAAAA CGAATTTTT6 GATAGGATTG TTCA6AAATG TTGAAGGGAC 4020 

G TGGC IGTGG ATAAATAACA OTCOGGTCTC CTTTQTGAAC TGQAAC3U3U3 6AQATCCCTC 4060 

TOOTGAAOGG AATGATTGTG TAGCTTTACA TGCGTCTTCT GGGTTTTGGA 6TAATATTCA 4140 

CTGTTCTTCC TACAAAGQAT ATATTTGTAA AAGACCAAAA ATTATTGATG CTAAACCTAC 4200 

^ TCATGAATTA CTTACAACAA AA6CTGACAC AAGGAAGATG GACCCTTCTA AACOGTCTTC 4260 

OU CAACGTGGCC GGAGTAGTCA TCATTOT6AT CCTCCTGATT TTAA0GG6TG CTGGCCTTGC 4320 

OQCCTATTTC TTTTATAAGA AAA6A08T6T GCACCTAGCT CAAGAGGG09 CCTTTGAAAA 4380 

CACTCTGTAT TTTAACAGTC A6TCAAG0CC AGGAACTAGT 6ATATGAAA6 AT CT OGrPGGG 4440 

CAATATTGAA CAGAATGAAC ACTCOGTCAT CTAGTACCTC AATGCGATTC TGAGATATTT 4500 

GAATTTCATA AAATTGTAAC TGAAATTTAA AATTTTTAGT TCAATGTGAT TGTTTTCTTT 4560 

AAA ATGAGTA CTGAATTQTA CTOGTCTOTC CTTTTTTCCT TTGCCTAATT GAAGAAATAA 4620 

TTGCTTQTTT TCTAGGCTGG CAAGATATTT TOITAAAAGA GGGATAACAA TGCTQATTAC 4680 

TACCTTTTAA AATATTTTA6 ATAAATGCAC A6CACCACAG CAGCACATCT AAGCATTAGT 4740 

GATGGGTAGC TGATGTCAGC TTCATGTGGA TTTTAAGCAC TCTAGAAACA ATGAAQCTTC 4800 

TTGOCATATT TTAAGGAGCT CCCAAAATOT GTTACCTATT AAATTGTAAC TCAGOUWSTA 4B60 

GAAGAGCATT TGAAAAOTCA GGTACAAATT TOCTCAAGTG GCATAAAAAT GTA6TCAGTT 4920 

TTCTCTTTTA CCAGTTTTTA TTTCCACTCC AATTATTTAO AACTTTATTT GTACATGTGC 4980 

AGAAGAATAA GOCAGCTQAG AATCTTGTTT CGCCCAAGAO AGTTTTACAO OCrGAGTGTT 5040 

GCAAATGTGT TCITTGTCCT GTTATATGTA TATCAG6AAT ACAAGOATGT OAAATAAAAC 5100 

TOTAAATTTG CATAACTGGA TGTACTTAQA TAATGTGAAA TAAACATTAA A6ACAAGGTC 5160 

TATTTTTAAT AAAAAAAAAA AAAAA 5185 
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Seq ID MO; CSS DHA Sequence 

Nucleic Acid Accession ft: NM_024574.2 

Coding sequence} 424.. 2130 

1 11 21 31 41 51 

111))) 

AGTGOQACTA 60GGCQ6C0G GCBAA6C0CG CGCCAGCCCC GCGGOTCGCT G006CATCCT 60 

CCCGCCGCCC AGCCCC06CA OGCTCOCT G C AGTTTAAAAG GACCTCCCGC C6CTTCTCGG 120 
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CGCTGCCCCG GGATTCCCCA GCCC0GCGC6 GCTCCCTACT CCACTTCGCA GCAACTTOGG 180 

OCSACXrOCGOG COGCCCGCCC TCGCCXX5CCT TTOAAOTTra CTGTGCCGAC CGCAAAGTTG 240 

GGACACTTCA GCOGATTQAA TTTTTCTCTT TTATCTGCCT COGTCCOCQC CCTGCAGGCT 300 

TCTCGTTCCr GGATATTGGT GCTTAGCATC TTGGCAGGGT C06GGQA0ST OGACTATTTC 360 

GCACAGCACA CCA0G6GGAG GGATTTTTTT CTATTTTCCC TA08AAAAAC AGATCTTTIT 420 

AAGGATGGTG CTGCTCCACT GGT G CCTGCT GTGGCTCCTG TTTCCACTCA QCTCAAGQAC 480 

CCAGAAGTTA CCCACXXXK5Q ATQAQOAACT TTTTCAGATG CAGATCCGGG ACAAGQCATT 540 

TTTTCATGAT TCGTCAGTAA TTCX»GATGa AGCTGAAATT AGCAQTTATC TCTTTAQAGA 600 

TACA OCTAAA AOOTATTTCT TTGTGGTTGA AGAAGACAAT ACTCCATTAT CAGTCACAGT 660 

GAaSCCCTGT GATG06CCTT TGGAGTGGAA GCTGAGCCTC CAGGAGCTGC CAOAGGACAG 720 

GAGOGGGGAA GGCTCAGGTG ATCTGGAACC TCTTGAGCAG CAGAAOCAGC AOATCATTAA 780 

TGAQOAAGGC ACTQAGTTAT TCTCCTACAA AGGCAATGAT OTTQAGTATT TTATATOGTC 840 

TAGTTCCCXA TCCQGTTTAT ATCAOTTGQA TCTTCTTTCA ACAGAGAAAQ ACACACATTT 900 

_ CAAAGTATAT GCCACCACAA CTCCAGAATC TGATCAGCCA TACCCTGAGT TACXTCTATQA 960 

1 J CCCAA6AGTA GAT6TGACCT CACTGOGOaS CACCAOGGTC ACTTTGGCCT GGAAACCAAO 1020 

CCCCACTGCC TCTTTGCTGA AACAACCCAT TCAGTACTGT OTOOTCATCA ACAAAQAOCA lOBO 

CAATTTCAAA AGTCTCTGTG CAGTOGAAGC AAAACTGAGT GCAQATGATG CTTTTATQAT 1140 

GGCACOGAAA CCTGGTCTGG ACTTCAGCCC CTTTGACTTT QCCCACTTTG GATTTCCTTC 1200 

TGATAATTCA GQTAAAGAAC 6CAGTTTCCA GGCAAAGCCT TCTCCAAAAC TGGGGCGTCA 1260 

TGTCTACTCC AGGCCCAAGG TTOATATTCA GAAAATCTGC ATAGGAAACA AGAACATCTT 1320 

CACXXSTCTCT GATCTQAAAC CCGACACGCA GTACTACTTT GAOGTATTTG TGOTCAACAT 1380 

CAACAGCAAC ATGAOCACCG CTTATOTAGG TACCTTTGCC AGGAOCAAGG AAGAAGCCAA 1440 

ACAGAAGACA GTCGAGCTCA AAGATGGGAA GATAACAGAT GTATTTGrTA AAAGGAAGGG 1500 

AGCAAAGTTT CTAOGGTTTG CTCCAGTCTC TTCTCACCAA AAAQTCACCT TCTTTATTCA 1560 

CTCTTGTCTO GATGCTGTCC AAATCCAAQT QAGAAGAGAT GGQAAACTTC TTCTGTCTCA 1620 

QAATGTGGAA GGCATTCAGC AGTTTCAGCT TAGAGGAAAA CCTAAAGCTA AATAOCTGQT 1680 

TOGACTGAAA GQAAACAAOA AAGGAGCATC TATQTTQAAA ATTCTAGCTA CCACAAGGCC 1740 

TACTAAGCAG TCATTTCCCT CTCTTCCTGA AGACACAAGA ATCAAAGCCT TTGACAAGCT 1800 

CCGTACCTGT TCCTCGGCCA CCOTGGCTTG GCTAGQCACT CAGGAAAQGA ACAAGTTTTG 1860 

JU CATCTACAAA AAAQAAGTGG ATGATAACTA CAATGAAGAC CAGAAGAAAA GAGAGCAAAA 1920 

CCAATGTCTA GOACCAGATA TAAGGAAGAA GTCAGAAAAQ GTCCTCTGTA AATATTTCCA 1980 

CAGTCAAAAC CTQCAGAAAG CAGTGACCAC AGAAACAATT AAAGGTCTTC AQCCTGGCAA 3040 

ATCTTACCTG CTGGATGTTT ATGTCATAGG ACATGGGGGG CACTCTGTAA AGTATCAGAG 2100 

TAAGGTTGTO AAAACTAQAA AGTTCTGTTA GTTACCTTCT TATAGAGATA TATTATGTAG 2160 

J J AACTCCAGGA GGGACATTAA ATCACTTTAA GTATAAACXG ACTACTCCCA CAGTTOAGAG 2220 

AAGTTGTGAC CTGTACTTGT ACTATGGAAO GAAGGATATC AAG6T6TQTA TATTGATGTT 2280 

TATATAAGTA ACTCTTGAAG GAGACTTGTT CTA6CGTGCC CCATG6TACC TA G TGTOTQT 2340 

CTGATGCCX30 TTGGTGTCAA AGATAGAGGQ CTTCTTGAAG GAACTTGCCA TTCCTTGCTT 2400 

TGACCACTGC ATGAACTGCT TCTAAATTAT TTTATTACCT AAAAATTTAA AATATGCCAT 2460 

4U TCATTGCACA CACCCACAAA TGCAAATCAT TCCTCTCTAT AGATQCTAGG ATATATATAA 2520 

ATTATTTTAT AAATTCTTGT TTTAAATQTC AOTGnTTCTA TGATTGTAAA CTATTAAATT 2580 

CTTTTCCTA7 TAAAGTACAG ATCTAATCTA AGTATTATTA AOTTGATAGC CCTCTAOTCA 2640 

GTTATATTGC TATTQTAAAT TCTTGTTTGT TCAGTAAAAT GTTTAAATAC TATATOTATC 2700 

TCATGTACAA AGTTGACATA CATTATATTC ATGTACATAA AATTAAAGAG ATTAOATTAT 2760 

ATACTOTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2808 



20 
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Seq ID NO: C56 DNA Sequence « 
Nucleic Acid Accession #: BC034229.1 
0\J coding sequence i 373. » 1422 
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1 11 31 31 41 SI 

I I I I I I 

ATCCGGCGQT GGTGACGCXSA GAGGCTGGGG TCTCCAGGAC CAACTCCTCT TCATCTTCGT 60 

C TTCCTC AGC CTGCTCAATG TGAAOCCCCT GATCAIGATT CACTTCCaCT TAATAAATAA 120 

AGTCTTTACA AATCAGAATA ACTTTTAGAC AATATTAAGG TOGTAATCAT GAACAQAAAA 160 

GATTTIOXAQ TTCTTCCATG GGGAAAACCT GGAAATTCT6 TAAAGCTAAA ATATA6CAAT 240 

GTAATAATTA AAACAAAAGT CTAAGATTTG AAGAGATAAT TTGCTTCAGG ATTTTGATGG 300 

AAGGCAAATO CTAACTTTAA AAACCAGATT TCGGAGAAGT ACAAAAGAAA TAGAAATGCT 360 

CAAGAACTGC GAATGGAGAA AGTAC3VGTTA GAGTTTGAGA ACCAAQAGAT GGAGAAGAAA 420 

CTGCAAGAAT TGOGATCCAC AAOAAACAAA GAAAAG6AAG ATAGAQAGTC AA60QAOTAT 480 

TACTOGAAAT CTGGAAAAGT GGGCAAATTG OTCAATCAAT CATATATQAT GTCACAAAAT 540 

AAAGGAAATG TTGTTAAGTT TTCTGCTGGA AAAGTGAAAT TAAAATTOCT QAAGGAACAO 600 

ATTCAAGAGC CAQTOAAACC AACAOTTAAT TATAAAATGG CAAATTCTTC AGAATGTGAA 660 

AAACCCAAGA TAAATGGGAA AQTTTGTG6A CAGTOTOAQA ACAAAGCT6C TCTACTGGTA 720 

TGCCTT6AAT GTGQAGAAGA TTATTGTTCA GGATGCTTTG CTAATGTTCA CCAGAAA0G6 780 

GCACTAAAGC TCCACAGAAC AACTCTTTTG CAOGCAAOAT CTCAAATATT ATTCAAT6TA 640 

TTGGATGTTG CCCATCAGTT TATAAAGGAT GTTAATCCAG ATGAACCCAA AGAGGAGAAT 900 

AATTCTACAA AGGAAACCAG TAAAATTCAA CATAAACXK» AATCTGTACT TCTCX»GAGG 960 

AGCAGCTCTG AGGTAGAAAT TACAAGGATG AAAAGAGCAC AAOSTACAAA AGCAAGAAAG 1020 

AGTCTGTTQT OTOAAGGGTC ATTOGATQAA GAAGCTTCTQ CACAGTCCTT TCAGGAAOTG 1080 

TTAAOTCAAT 6GAGAAC0GG AAATCATQAT GACAACAAGA AACAGAATTT ACATGCAGCA 1140 

GTAAAAGACT CATTGGAAGA ATGOQAAGTA CAGACTAATC TGAAAATTTG GAGAGAACCA 1200 

CTTAATATTG AACTTAAAGA AGACATTCTA TCCTATATGG AAAAATTATG GCTTAAAAAA 1260 

CACAGGAGAA CTCCACAA6A GCAACTTTTT AAATGCTACC AGATACGTTC CCACATCCAC 1320 

ATGAAACCAC TQGTQATGCA CAOTQTTCTC AAAATQAAAA C3GATGAAGAT A0TGATGGT6 1360 

AGGAGACCAA AOTACAACAC ACAGCTCTTT TATTGC3CAGT AGAAACATTA AACATAGAQA 1440 

GACCTGAACX: ATCTCTGAAG ATAQTOGAAC TGGATGATAC TTATGAAGAG GAATTTGAAG 1500 

AAG CAGAAA A TATTGTGCCT TACAAAGTTA AATTAGCTGA TQCAOACAGT CAAaSAAGTT 1560 

GT GCTTTT CA TGATTGTCAG AAGAATAGCT TTCCATATGA AAATGGCATC CATCAACATC 1620 

ATGTTTTOCSA TAAOGGAAAG AGAGACTTCT TAAATCTTTQ TCTGftQAAAC AGCTCTACTT 1680 

ATTATAAAOA TAATTCAAAA GGAGAAACTT CAAACACAGA TTTTQACAAC ATOQTGGATC 1740 

CTGATGTGTA TTCTTCTQAC ATTGAAAAAA TTQAGQAAAG GACCTCCnT GAAAOAAATT 1800 

TAAAGGAOAA AAATATAGGT TTAGAAAGTA ATCAAAAOTC TQATQATTCC TOTOTATCAC 1860 

1232 



wo 03/042661 



PCT/US02/36810 



TXGAAAGCAA GGACACTTTO CTAC3GTAGAG ATTTAGAAAA AGCTCCCATT GAGGAGAAAT 1920 

I^JflStJ?^ ^1^^^ TCCTTGOAAT TCAGCRATCT GTATAAGAGQ CCAAGCTTTG 1980 

JJSJ^IS?^^ TCATCACTGT TGTTACAAGA AATAGCCTGC AGAAGTAAGC 3040 

^ CTATAACAAA ACAATATCRA GQACTTGAGA GATTCTTTAT TTTTGATACA AATGAAAGAC 2100 

J TCAACTTACT TCCTTCTCAT CGTTTAGAAT GCRACAATTC CAOTACTACG ATTACACTTG 2160 

CAGQTCAGAA ATC ACAGAG A CCTTCAACAG CAAATTTTCC ACTTTCCAAC TCTGTTAAAG 2220 

AAAGCICCAO TTGCCTrrCA TCCTCTCATC CTCGATCAAG AAGTGCAGCT QCTCAATCAT 2280 

^ISUS^^ TGCTTCPGAA ATTTCAGAAA TTGAATATAT TGATATTACT 6ACCAGAATO 2340 

1 0 ^^JJ^^ AGATGACACT ACTGATCAAC ATACTTTAQA CAATTTGGAA AAAGAATTAC 2400 

AAGTGCTQAG ATCTCTTGCA GATACTTCAO AAAAGCTTTA CAGCTTAACC TCAGAAGAOT 2460 

TCCCAGATTT CAGCAGCCAA TCACTGAATA TAAGTCA6AT TTCCACAGAT TTCCTTAAGA 2520 

CCTCACATOT QftGGGGTCCC TGTOGAGTTG AGGAATTGAG CTGTTCTOGA AQAGATACCA 2580 

AAATTCRGTC TTTGCTOTCA CTTTCTGAGA GC3W3TACAGA T6AGGAGQAG GAAGATTTTC 2640 

1 S Tgy^Q^ ACATGTCATC ACACTACCX5T GGTCAAAGAO TACTTAAAGA TTATTTOTTC 2700 

1 J ATTACTGTTT CCATTTTGTA CCCAGAGTAA AGCAAACRAC TGAGAAAAGT AACCAAOTOA 2760 

TTACCTATCC AAGTGCTGGA GATTTTGATT ACTAATCTCT TTQATGTTTC AAGGCTACAA 2820 

ACTAATAAAA GTAAAATTAT AAGTTCAAAA AAATTTTTAA AAAAAAAAAT AAAAAA 2876 

20 Seq ID NO: CS7 DNA Sequence 

Nucleic Acid Accesalon Us NM_0246B7.l 
Coding sequence: 138.. 1706 



25 



] 11 21 31 41 51 

I I i I'll 

AAAAACATGA TGACAACAAG AAACA6AATT TACAT6CAGC AGTAAAAQAC TCATTGGAAG 60 

AATGOGAAGT ACAOACTAAT CTGAAAATTT GGAGAGAACC ACTTAATATT GAACTTAAAG 120 

AAOAC ATTCT ATCCTATATG QAAAAATTAT GGCTTAAAAA ACACAGGAQA ACTCCACAAG 180 

A6CAACTTTT TAAAATGCTA TCAGATACGT TCCCACATOC ACATGAAACC ACTGGTCAIO 240 

CACAGTGTTC TCAAAATGAA AACX3ATGAAG ATAGTGATG6 TOAGGAQACC AAAGTACAAC 300 

ACACA6CTCT TTTATTGCCA 6TAGAAACAT TAAACATAGA GAGACCTGAA CCATCTCTGA 360 

AOATAGTCGA ACTGQATGAT ACTTAT6AAG AGGAATTTQA AQAAGCAGAA AATATTGTGC 420 

CTTACAAAGT TAAATTAGCT 6ATQCA6ACA GTC3VACGAAG TT6TGCTTTT CATQATTGTC 480 

AGAAGAATAG CTTTCCATAT GAAAATGGCA TCCATCAACA TCATSTTTTC GATAAOOOAA 540 

J J AGAOAGACTT CTTAAATCTT TGTCTGAGAA ACAGCTCTAC TTATTATAAA GATAATTCAA 600 

AAGGA6AAAC TTCAAACACA GATTTTGACA ACATCGTGGA TCCTGATQTG TATTCTTCTG 660 

ACMTGAAAA AATTGAGQAA AGCACXTTCCT TTGAAAGAAA TTTAAAGGAG AAAAATATAG 720 

GTTTAGAAAG TAATCAAAAG TCTGATGATT CCTGTGTATC ACTTGAAAGC AAGGACACTT 780 

TGCTAGGTAG AGATTTAGAA AAAGCTCCCA TTGROCaGAA ATTATCTCAA 6ACATCAAAG 840 

HV AATCCTTGGA ATT0A6CAAT CTOTATAAGA GGCCAAGCTT TCAAGAATCA AAAACTACAA 900 

AGTCATCACT GTTGTTACAA GAAATAGCCT OCAGAAGTAA GCCTATAACA AAACAATATC 960 

AA GGACT TGA GAGATTCTTT ATTTTTGATA CAAATGAAAG ACTCAACTTA CTTCCTTCTC 1020 

ATOGTTTAGA ATGCAAC3UIT TCCAGTACTA GGATTACACT TGCAGAAGAC AGAGAATCGA 1080 

. - TTCCAGACCA TAOCTTAAOT GAATATSCXG ATAATGCAAT mi Vr iUitfr GTTCTGCftGG 1140 

GTGCTCAOAG TCCATCATCA AGTAOAAAAC AGCAAAAGAT GGGTCAGAAA TCACAGAGAC 1200 

CTTCAACAGC AAATTTTCCA CTTTCCAACT CTOTTAAAGA AAGCTCCAGT TGCCTTTCAT 1260 

CCTCTCATCC TCGATCAAGA AGTGCAGCTG CTCAATCATC ATCTAGAOCT GCTTCTGAAA 1320 

TTTCAGAAAT TQ AATAT ATT GATATTACTQ ACC3VGAAT6A GCTTTCCTTA GATGACACTA 1380 

CTGATCAACA TACTTTAOAC AATTTGGAAA AAOAATTACR AOTOCTGAGA TCTCTTGCAG 1440 

ATACTTCAGA AAAGCTTTAC AGCTTAACCT CAGAA6AGTT CCCROATTTC AGCAGCCAAT 1500 

CACTGAATAT AAGTC3U5ATT TCCACAGATT TCCTTAAGAC CTCACATGTG AGGGGTCXXT 1560 

GTGGAGTTGA GGAATTGAGC TQTTCTGGAA GAGATACCAA AATTCAGTCT TTGCTCTCAC 1620 

TTTCTGAGAQ CAGTACAGAT GAGGAGGAGG AAQATTTTCT CAACAAGCAA CATGTCATCA 1680 

CACTACCGTQ GTCAAAGAGT ACTTAAAGAT TATTTOTTCA TTACTOTTTC CATTTTGTAC 1740 

CCAGAGTAAA GCAAACAACT GAGAAAAGTA ACCAAOTGAT TACCTATCCA AGTGCTGGAQ 1800 

ATTTTG ATTA CTAATCTCTT T^TOTTTCA AGGCTACaAA CTAATAAAAG TAAAATTATA 1860 
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AGTTCAAAAA AAAAAAAAAA AAAA 

Seq ID NO I C58 DNA Sequence 

Nucleic Acid Accession #3 NM_00S40B.l 

Coding sequence: 76.. 372 ~ 



Seq ID NO: CS9 DNA Sequence 

Nucleic Acid Accession #: AK097746.1 

Coding sequence t 185 . . 2324 



] " 21 31 41 51 

AAAAGGCCGQ CGGAACAGCC AGAGGAGCAG AGAGGCAAAQ AAACATTCTC AAATCTCCAA 60 

. CTCTTAACCT TCAACATGAA AGTCTCTGCA GTGCTTCTGT GCCTGCTGCT CATGACAGCA 120 

GCTTTCaACC CCCAGGQACT TGCTCAGCCA QATGCACTCA ACGTCCCATC TACTT6CTQC 180 

TTCACATTTA 6CAGTAAGAA GATCTCCTTG CAGAGGCTGA AGAOCTATOT GATCACCACC 240 

AGCAGGTQTC CCCAOAAGGC TOtolTCTTC AGAACCRAAC TGQGCAAG6A GATCTGTGCT 300 

QACCCAAAGQ AGAAGTGOGT CCAGAATTAT ATGAAACACC TQGGCOGGAA AGCTCACACC 360 

CTGAAGACTT GAACTCTGCT ACCCCTACTG AAATCAAGCT GQAGTACGTG AAATQACTTT 420 

TCCATTCTCC TCTGGCCTCC TCTTCTATGC TTTGGAATAC TTCTACCATA ATTTTCAAAT 480 

AGGATGCATT CGGTrrrGTO ATTCAAAATG TACTATGTOT TAAGTAATAT TGGCTATTAT 540 

^ TTGACTTGTT GCTOGTTTQQ AOTTTATTTG AOTATTOCTG ATCTTTTCTA AAGCAAGGCC 600 

/J TTGAGCAAOT AGOrTQCTGT CTCTAAGCCC CCTTCCCTTC CACTATOAGC TGCTGGCAGT 660 

GGGTTTGTAT TOOGTTCCCA GGGGTTGAGA GCATGCCTGT GGGAGTCAtXS GACATOAAGG 720 

TGTAGGAAGO AGAGCTCTTT GTGAATOTGA OQTGTTQCTA AATATGTTAT 780 

TGTGGAAAGA TGAATGCAAT AGTAGGACTO CTGACATTTT GCAGAAAATA CATTTTATTT 840 
AAAATCTCCA AAAAAAAAAA 
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1 1 I I I i 

CTTTCAT6AC AQTAAOUUVT CCAAGArTTT GGAAAAGOOC CTAOSATATT TAAAT6AGCA. 60 

CTTC3VCATAC AACTTATATT OTAATATATO OCGATCACTA TTTGAGAAGO ACAAGCTGTT 120 

ATTTTCCTTT TTATTATGTG CCAATCTTCT TCTGGCAAOa AAAGAGATTO AATACCAGQA 180 

ACTGATGTTT CTTTTAACTG GAGOAOTAAG TCTTAAAAGT GCTGAGAAAA ATCCTGATCC 240 

AACTTGGCTA CAGGACAAAA GCTGGGAGGA AATCTGTCGG GCAAGTGAAT TTCCTGCCTT 300 

CAGAGGACTC AGGCAACATT TTT6TGAACA TATATATGAA TGGC6A6AAA TCTATGACAG 360 

TAAAGAGCCA CATAATGCTA AATTTCCA6C ACXIAATGGAT AAGAACCTAA ATGAACTACA 420 

GAAAATAATA ATTCTTGGGT GTTTAAGACC TCATAAGATA ACCCCAGCTA TAACAAACTA 460 

TGTAACTGAC AAACTAGGGA AAAAGTTTGT AGAOCCTCCA CCATTTGATT TGACAAA6AG 540 

TTACTTGGAT TCAAATTGCA CCATTCCCTT AATTTTTGTT CTATCTCCAG GAGCAGATCC 600 

TATGGCCAGC CTGCTGAAAT TTGCAAATGA TAAATCTATG TCTGGAAATA AOTTTCAAGC 660 

TATTTCACTG G6ACAGGGAC AAGGACCGAT T6CAGCAAAA ATGATTAAAG CA6CAATTGA 720 

AOAAGGAACT IGGGTGT6CC TACAGAATTG CGATCTTGCA 6T6TCCT6GA TGCCCATGTT 780 

GGAAAAAATA T6TGAAGATT TTACCTCTGA AACCTOTAAC TCATCCTTTA GGCTTTGGCT 840 

GACAAGCTAT CCATCTTCAA AATTCCCAGT AACAATTCTA CAGAATGGAO TAAAAATGAC 900 

TAATQAACCT CCCACGGGTC TTOGGCXQAA TCTCCTTCAA TCATATCTCA CTQATCCAGT 960 

TTCTGATCCT GAGTTTTTCA AGG6ATGCCX3 TGGAAAGGAA CTGTTATTTA TCAATGAATA 1020 

TGATACAATT CCATTTGAAG CTATATCTTA CCTOACTGGO QAaTOTAATT ATGGAGGAAO 1080 

AGT6ACAGAC GATTGGQACA GAGGTCTTCT ATTAACCATG CTGGCT6ACT TTTATAATCT 1140 

GTACATAGTT GAAAACCCTC ATTATAAGTT TTCTCOCAaT GGAAACTATT TTQCACCTOC 1200 

TAAAGGCACT TATGAGGACT ACATTGAATT CATTAAGAAA CTTCCATTTA CTCAACACXX: 1260 

TGA6ATATTT GGATTACATG AAAACGTTGA CATCTCCAAG GATCT7CAAC AAACAAAAAC 1320 

CCTCTTTGAO TCCTTGCTCC TGAOCCA003 AGGCTCCAAA CAOACAGGAO CCTCAG6AAG 1360 

TACIGATCAG ATTCTQTTAQ AAATTAGCAA AGATATCCTC AACAAGCTCC CTAGTGATTT 1440 

CGACATTQAA ATGGCACTAC GQAAOTATCC TOTGAGATAT GAAGAAAGCA TGAATACTGT 1500 

GTTAGTACAA GAAATGGAAA GATTTAACAA TTTAATTATA ACTATAOGTA ACACTCTACG 1560 

GGACCTTGAA AAAOCTATTA AGGGTGTGGT TGTQATGGAT TCTGCATTG6 AGGCACTCTC 1620 

CA6TAGCTTA CTTGTTGGAA AGGTTGCAOA AATATGGGCC AAAOGTTCAT AOCCAAGCCT 1680 

TAAGCCCCTG GQAAGTTACA TCACAOATTT CCTA6CC06G TTGAACTTTT TACAGQACTG 1740 

GTATAATTCA GGAAAACXTTT GTQTGTTTTQ OCTOTCAGGT TTCTTTTTCA CTCAGGCCTT 1800 

TTTAACTGGA GCTATGCAGA ATTATGCCAG AAAATATACC ACCCCTATTG ATTTGCTAGG 1860 

ATATGAATTT OAGOTTATCC CATCTQATAC ATGT6ACACA TCACCAGAAG ATGGTGTTTA 1920 

TATCCAOGQA CTGTATCTCG ATGGCGCAOQ CTGGGACCGA GAAAGTGGAT T6CTTGCTGA 1980 

ACAATATCCC AAACTTCTGT TTGACCT6AT 6CCCATCATA T6GATAAAAC CAACTCAAAA 2040 

ATCTOGGATT ATAAAGTCX5G ATGCCTATGT CTGTCCCCTC TACAAGACAA GTGAACGTAA 2100 

AGGAACTCTT TCCACTACGG GACATTCTAC TAACTTTGTC ATTGCAATGT TGTTAAAAAC 2160 

AGACCAACCT ACTCGGCACT GGATCAA60Q G6GQOTTGCT TTGCmGTC AGTTGGATGA 2220 

CTAAATTGGA CAAATTTATA AAACATCCAA AAOTTT 22S6 

Seq ID NO: C60 DNA Sequence 
Nucleic Acid Accession #t J02761.1 
Coding sequence} 14.. 1159 



GAATTCGGGT 
6CTCT6TGGC 
TGAGTTCTG6 
ACAGQAAGTC 
CCACATCCTT 
GGAGCAGGAG 
TGAOGACTAC 
CTGTATGCAC 
GTCAGACCCC 
OGTCCTCCCT 
CTCOQAGCAG 
GGGGATCCAA 
GQTQOTACCT 
CCTGCTCGAC 
GTGCTCCATG 
CTCTGAGTGC 
CATACCACAG 
GCAATTTGTG 
CXaVCACCACC 
CCACAGCCCC 
GAOGGGCTCT 
CCAGGCTGAA 
CACTTCCACT 
ACCACAGTCT 
GCATCTGGA6 
AAAAACCAGC 
CCCXaUTGCAC 
AAAAT6GGAA 
TCAAAAAATC 
GQGATGOCGA 
TTGTTAAGAT 
GATTGAAAGT 
GGGTTGTGCA 
AQTCCTATTC 




CTGGGCCTGT 
CTGCCCAAAC 

CAATTCCCCA 
GCCAT6ATTC 
CTGGTGGOGG 
ACGCTGCTGG 
GATQAGAGCG 
CACCTCTGCA 
GCAATGCTCC 
GAGCAGCACA 
TGCCAGGCCC 
GACCTTTGAT 
GGGACCATGO 
AAGAAGCCTC 
GGTGGACCAC 
GACACCAGAG 
GAGCTCTOCA 
CACTGCTTTA 
TCAAAQATTG 
CATACAGAAC 
AGCAATTCCC 
CCGGGCAAGC 
ATAAAAAGGG 
CCTAGGGTGA 
AGCTAGGGAC 
TCTTTTAAAA 



21 

I 

AGTCACACCT 

craccTGOAC 

T6GAGCAAGC 
TGGGAGCOGA 
CCAAGGAGGC 
TCCCCTTGAA 
TCATGGACTA 
GCAAATCOCO 
CTCTG06G6A 
GGGCCCTCCA 
TTCCTCTCCC 
CCAAGGGTGC 
GOGGCATCTQ 
GCCGCATGCT 
CTGGCCCAAG 
TGTCCGTGAC 
AGGCCTGTGT 
CX3CCCCAGCT 
TCGGGGTGTG 
GAGAACTCAG 
TGACCAGGCT 
AGCTCCCACA 
GGGCCCCCAG 
CCCACTTCCA 
GCCTCCACAC 
CAGGACAGGG 
GATTTTACAG 
TCTAAAAGAT 
CAGGGTAGTC 
TTTCTTCCTC 
GTTTCTTTTT 
TTCTATTTCT 
CCATTCCTGT 
AAAAAAAAAA 



31 
I 

GCTGCAGTGG 
CaCCTCATCC 
ATTGCA6TGC 
TGACCTATQC 
CATTTTCCAG 
GCTGCTCATG 
CTTCCAGAAC 
GCAaOCAQAO 
CCCTCT6CCA 
GGCGAGGCCT 
CTATTQCTGG 
GCTAGCTGTG 
CCAGTGCCTG 
GCCCCAGCTG 
GTCGCCQACA 
CACCCAGGCC 
TGGCTCCTGQ 
6CTQACCCTO 
TGGGACCATG 
CTOTCCAGCT 
CTTCCCCTQC 
COGCCCTCCT 
CCCTGTGTOG 
TCCTCTCTGG 
CTACCACQAC 
GGTTGAAGCr 
CTACTTGCAA 
AGACATCAGA 
AAGGGTGGAC 
GAGATGCTCT 
GTCTTTCTGT 
QCTGTQATTT 
GTAATACAAT 
AAAAAAAAGG 



41 

I 

CTGCTGCTGC 
TTGGCCTGTG 
AGAO OOCTAG 
CAAGAGT6TG 
GACACGATGA 
CCCCAGTGCA 
CAGACTGACT 
GCAGAOCAGG 
GACCCTCTGC 
GGGCTCTGACA 
CTCTGCAGGG 
GCAGTGGCCC 
GCTGAOOSCT 
GTCTGCGGCC 
GGAGAATGGC 
GGGAACAGCA 
CTGGACAGGG 
GTGCCGAGGG 
TGCA6CCCTC 
GCAAAGGAAA 
TCCCTGGCCC 
CACCTOCCTT 
GCCTTGTCTG 
TGTGAGGCAC 
CTCCCAGGGC 
GAGCCCCGCC 
TTCAAAATTC 
AATTGTTAAG 
ACTGCACXfCT 
GCTGCTTGAQ 
AAGGTGGACT 
ATCTGCTGAA 
GTCTGCACCA 
GAATTC 



51 

I 

T6CIGCCCAC 
OCCAGGGCCC 
GGCATTQCCT 
A66ACAT0GT 
GGAAG7TCCT 
ACCAAGTGCT 
CAAACGGCAT 
AGGCAGGQAT 
TGGACAAGCT 
CACAGGATCT 
CTCTGATCAA 
AG6TGTGC06 
ACTCGOTCAT 
TCGTCCTOO O 
TGC0G06AGA 
GCGAGCAGGC 
AAAAGTGGAA 
GCTGGGATGC 
TCOVGTGTAT 
AGCCAAGTGA 
TOGCCAGCTG 
CCTOGGCAGT 
TCTCAGCTCA 
AGCGAGGGCA 
TGGGCTCAGG 
TCACAOCCAC 
AGAAGAATAA 
TTAAGCTTTT 
CT6GCATGAT 
AGCTATTGCT 
TCCAGATTTT 
AGCTCAGCTG 
AT6CTAATAA 



60 

120 

160 

240 

300 

360 

420 

460 

540 

600 

660 

720 

760 

840 
900 
960 
1020 

1060 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2026 
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Coding sequeneet 19. .SS2 
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1 

I 

GG6GTCTGGQ 
CTGCAGGGCT 
GGTQACATAT 
TGCCTGCGTC 
AAC6T60GGA 
AGCZX;CAGCA 
TTOCTGGCGG 
ACCAAGAAGA 
QTGOAGGGAO 
QAGGA66ATT 




GTCCGTGTQA 
CGCXX3TCCAC 
GCACOQAGGO 
AGGGGAGTCC 



31 
I 

GTTGGGCTGT 
AAATGGAATC 
GAGCTGTGG6 
CTACC36CAAT 
GTGGGCGCTG 
CTGGTGGGCr 
CATGTCCAAG 
GGGCAGGGTG 
OGAAGGGACO 
COGGGGGACT 



31 
I 

GGGATCCCAQ 
CAGGGATTCT 
GGCCAGGCGG 
GGGGTCTGCT 
GTCTGGACGT 
AAGCGCCGGG 
TCOQTCTOGC 
CCAGTOGCCC 
GAGGAGGGTG 
GCTCAATACA 



41 

I 

OGCTGGQCCT 
TCTACCCATG 
CCATOGATAG 
ACCACCA600 
6OVG0GGCCT 
AOGTQCTGCA 
TGCTCTCCAA 
TOTCCAAAGA 
AGGAQACAGA 
GATA06GTGG 



51 
1 

GCTCCTGCTG 
GAGCTGTGAG 
CCCCAACCTC 
TCCAGACXMA 
CCTCCTCCTG 
TATGCCCGGT 
6CACCGAGGG 
GTCCAGGGAT 
GGGOGAGGAA 
ACQ 



Seq ID KO: C62 DMA Sequence 

Nucleic Acid Accesaion nm_054023.2 

Coding Bequencei 98.. 379 ~ 



GGGGACACTT 
ATTTTTAAAC 
GCTGGTGACC 
GCTTCCTGTT 
ATTAAAGCTT 
GAA6TGTGTA 
GCTATCACAC 
TGCTCCTATC 
0TAGT6ACCC 
AAAAAAAAAA 




31 
I 

AACTGTCATG 
TGCTACTGCC 
TCTCGACAAC 
TTCTOTTOAG 
TTCTGAAGCT 
AGAGOGGAGG 
CTACCAATTA 
AATGAATACT 



41 
I 

QATTTTGCTA 
AAGCTGGTAA 
TTCCTCATCA 
ATTCTTCCCT 
CACCTTOTGO 
GTGAAGAAAC 
TGGATGGGGA 
TAGATCAAAT 
AAAAAAAAAA 



51 
I 

GATTTTTCTG 
CTATCTTCCT 
ACAAAGTGCC 
TTATGGATCC 
A6GGGCTAAG 
TGCTGGAGGC 
TGGAAGATGA 
GCCCTAAAAT 
AAAAAAAAAA 



Seq ID NOt C63 DNA Sequence 

Nucleic Acid Accession ff: PGQIESH predicted 

Coding sequence x 1 • . 2 6 74 



60 
120 

leo 

240 
300 
360 
420 
480 
540 
593 



60 

120 

180 

240 

300 

360 

420 

480 

540 

550 



1 11 21 31 41 51 

I I I I I I 

ATQCCCCTGT CCTATGCCTA TAAAAACGCT GAGACXXTTAG CAGGCAGACA CAC3WM3CAGC 
TGGATGTCX3A GA6GAGCATA TCA6CGGAG0 AACACAOGGG CAGCTGQAGS TCCAGAGGAA 
T0CACT6ACA GAAACTGGCA T6CTGGCA0A ACAOGTGGAA TrrGGCTGGG GCAOTTGOAG 
GAGAQATGTT CAGATGTGTT CGGAGTTTCT TTCTTCTGGT GGGTTOGTGG TCTCGCTGGC 
TCAGQAQCGA AGCTGCAGAC CTTCAOGCCA GCCCAGGAAQ GGGCTCCCAC AQTGCAGCGG 
CAGGCTGAAO CGCTCCTCAA GTGCOGCCAG AGTGGGCGTC CAGGCAGAGG AGGCGCCQAG 
AOGGAGGQAO GGAGOGATGC CAOCATGCTG TCACCTCTCA GTGCTGCCAT G08AAACTAC 
CCAAOOTGCT CTACCATCCC TCCAAGAAGA TCCTACTCTC CAACGQAAAT T6CTCACAA0 
AGTTACTCCT GCAGCCTTCC AGACATGAAA ATCTCCATGG CAQAATCTGG CCCCTCCTTG 
6ATAGCCTTG ACATTCTGOA GGATGGCXJAG TCTGGGTCAC CATTTCTTGT GACTCATTTG 
TACTTTCTGG GGGTTGTCAC CACTGGQATO GAACAACTAG ATTTTGAAAC AGGACCAAAC 
ATATTTGATT T6CAGATTTA TOTGAAOGAT GAGGTTGGTG TCACA6A0CT GCAAGTCCTG 
ACTGTCCAGS TAACA6ATGT GAAOQAGGCA OCTCAGTTTC AAG6CAACTT GGCAGAA6AT 
CATCTCCGTO CAQACCAGCC ACATTTCAAT GCTCATAGTC ACAOGTACGT GAGGGTAGTG 
GCTACTGCAT TGGCCAGGCA CAGGCTTAGA TCTAGCATTG GTTCCCOCTT CCTGGGCACC 
TTCTGTQTTO TGGTGGGCAT GCAGTATTTC CTOATTTCTC CCCCAAAflAG CTTCAGAATG 
TCT6CTAAT6 6CACCCTCTT CTCCAOiACA 6AATTG6ACT TTQAAGCAGG ACACAGAAGT 
TTCCATCTCA TGOTGOAGGT (S^GGGACAGT GGAGGCCTCA AAGCCTOCAC AGAGCTCCAG 
QTQAACATOQ TGAACCTCAA CGAOGAAGTC CCTOSCTTTA CCAGCCOGAC ACGAGTGTAC 
ACAGTCCTGG AGGAACTGAG TCCAGGAACC ATOOTGGCCA ATATCACAGC GGAGGATCCT 
GATGATGAAG GTTTTCCCAO CCAOCTCCTC TACAGCATTA CCACTGrTAO CAAATATTTC 
ATGATAAATC AGTTOACTGO TACAATCCAA GTG6CCCAAA G6ATAGA000 AOATGCAGQT 
GAATTGAGAC AAAATCCCAC CATTTCOCTG GAAGTTCTAO TGAA60ACA6 ACCATAT6GG 
GGTCAGGAGA ATCGCATCCA GATAACCTTC ATTGTGGAAG ACGTCAACGA CAATCCTGCC 
ACATQCCAAA AGTTCACCTT CAGATCCAQT CTCCACCCTG CTCTGTGCTC CAAGACGCTG 
ACCTGGATOa ATACCGTATT AGACTGTTTT CATGCTGCTG ATAAA GATAT ACCTG TGACT 
GG6GGATTTA OAAAGAAAG AGGTTTAATT GOACTTACAQ TTOCACATGG CTOGGGAAGC 
CTCACAATCA T6GCA0AAG6 CAAG6AGGA0 CAAGTCACAT CTTACATGGA T6GCAGCAGG 
CAAAGAGATA GAGCTTGTGT AGGGAAACTC CTCCTTATAA AGCCATCAGA TCTCATGAGA 
CTTAGTCACT ATCACGAGAA CAACTCAGOA AAGACTTGCC CCCATQATTC CATTTCCTCC 
TACCAGGTCC CTCCCACAAC ATGTAGGAAT TCAAGAATCC AG6CCACCAA CAAOGAAGAC 
ACAAGCTCTG TCACTGTTAC TGTGAACATC CTTQAAQAAA AT6ATGAAAA QCCAATTTGT 
ACTCCAAACT CTTATTTCCT QGCCCTCCCA GTGGATCTGA AAGTTGGCAC AAATATTCAO 
AATTTCaVAGC TGACATGTAC OGAOCTT6AT TCCAGCCCCA GATCTTTCOG TTATTCCATT 
GGCCCAGGTA ACX3TCAACAA TCATTTCACC TTCTCTCCCA ATGCTGGTTC CAATGTCACA 
C6CCTGCTGC TTACATCTCG CTTTGACTAT GCTGGTGGGT TT6ATAAGAT CTGGGACTAC 
AAGCTACTTQ TCTACQTAAC TGATGACAAC TTGATOTCTO ACAGGAAGAA AG06GAGGCT 
CTTGTT6A6A CAGGAACAGT GACACTGA6T ATTAAAGTCA TTCCCCACGC AACCACTATC 
ATCACCACGA CCCCCAGGCC CAGGGTCACC TATCAGGTCC TQAOQAAAAA OGTTTACTCT 
CCATCTGCAT G0TACX3TGCC GTTTGTCATC ACTTTGGGCT CCATATTGCT TCTGGGTCTC 
CTCGTGTACC TGGTCGTCCT ATTGGCCAAA GCCATCCACA QACACTGCCC CTGGAAGACT 
GGGAAGAACA AGQAACCTCT 6ACAAAGAAA GOAOAAACOA A6ACTGCA6A 6A6AGACGTC 
6TGGTGQAAA CTATCCAOAT GAACACTATC TTTOATGGAG AAGCCATAOA TCCAGAGCCT 
GAGCAAGCTT CACTCGAGCT CTATGCCCTO CTOCCCAGCT GCTGC6AGCC TAGTCCAGTA 
ACCCTAAGAA AGGTCCAGGT 8TGTGGGGAG AGTGAAGAGA COGGTCAOTO T T OOOOCCAC 
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ATCACACTTC CCGGCAAOAT TCCAGTOGAT GACCCAAGGA AACAGGAAAC AGGCCTGCAG 2760 
GGTOATTTCG AGGTCTGGAC TCTATGCCCC GCTCSTGAAGG TGGTTQTAGO CAGCCCTCAA 2820 
GCT6AAGGGT GCATTGGATT GGCTCTCAGT CTGAAAAA6T ACAGTTCT6A TTAA 2874 

Seq ID NO I C64 DMA Sequexice 

Hudelc Acid Accesalon ttt XM_168S7l.l 

Coding sequence: 155.. 988 ~ * 



1 
I 

TACACAGTCC 
CCTGATGATG 
TTCATGATAA 
66TGAATTGA 
0GGG6TCAGG 
6CCACATOCC 
TTGCTTCTTG 
TTCAACTTCA 
GGCTCTG6GA 
6CCGGCAATA 
AATAACOTCT 
AGGCCATCCT 
CCTGAGGAAA 
CTCCATATTG 
CAGACACTGC 
GAAGACTGCA 
AGAAGCCATA 
AAAGTGGAAA 
TGCXXXy^CGC 
AGAAGATGAO 
GGGTGGCAAG 
CAAACCACAC 
TGCCTCACCC 
TGAGGGGATT 
GAAAG6GGTT 
ATTTCAGAAC 
TGTGTTTCTT 
TAGGAATTCC 



AAGTCACCTC 
AAGAGTTGGT 

TCAGTTCTCT 
CTACACTTGG 
ACTACAGGCA 
ATCACCTAAO 
TAAAAAATAG 
AGGCTGAGGT 
GTGCCACTGC 
AAGTGAATTA 





CCCTGCAAGA 
GAGAGAGACG 
GATCCAGTGA 
GATCCACTAA 
AGAGTCACT6 
CTGAOTGGCA 
CTGGGCAACC 
CCAOGAAAGT 
TAAATTCTAT 
CAGACATCCA 
TGATCACATA 
TACCTGTGCT 
CTTTGCATTG 
TGACATAAAT 
TTTAACATCA 
AAATCTTCTT 
CATCIGGAAA 
TCAGCATCCT 
TCATGGTTAT 
OTGQCAGCTQ 
GGGCATGATG 
QTCAGGASTT 
AAAAATTAOC 
AGGGAGAACT 
ACTCTAGCCT 
CAACACT 



TGGAOTGGGG 
GATTQGTGAT 
GATAATCCAG 
CCTAACAAGC 
TGTGTCAGAA 
CTCCATCTGC 
TCCTCX3TQTA 
CTGGGAAGAA 
TCGTGGTGGA 
CCGGGGAAAC 
CCX3UUITGCX: 
CTGGGGAAGG 
AAGCOTGGGC 
CAAAGAACAG 
AAACGGGGTC 
GGGGATGGTG 
GGGTCAAACA 
GTTGCGTQTT 
TCTGATAAGC 
ACTGCTAGOA 
AGTGAA6ACT 
CCCAGATTTC 
CAAAAGAAGC 
GAAGAAAACT 
GCATGTGAOA 
GTCTGGTTTC 
CTTCAGATTT 
6CTCATGCCT 
CAAGATCAGC 
TTG G TGTGCT 
GCTTAAACCT 
G6QT0ACAAA 



31 
I 

ACCATCGTGG 
CTCTACAGCA 
CAAGTGGCCC 
CT6GAAGTTC 
TTCATTGTGG 
ATGGTGCCX3G 
GATGATGACA 
AGCGGCAGCA 
CTAGACTACG 
GTGCAGGATG 
CXyVGAAAATG 
AGAAGGCCCX; 
ATGGTACGTG 
CCTGGTCXSTC 
CAAGGAACXn: 
AACTATCX31G 
ATAT6AATTC 
AAAATGGAAA 
GATGGGGTCA 
TGAGGATGCT 
AAATCCAGCC 
TAAGGAGG6G 
TGGGCATGGT 
TGGGATGTTT 
CTQAAATGAT 
AAGACTGTTA 
AGCTCTATTC 
ATCCTTACAT 
TTCAGTTATA 
AGAACAGTGA 
CAGTAGGCAC 
TTCATCCAOG 
ATTCTATGAT 
TTTACTTTTA 
GTAATGCCAG 
CTG6CCTAGA 
GGTG GGCAC A 
GA0AG6TGGA 
GCAA6ACTCC 



41 

I 

CCAATATCAC 
TTACCACTGT 
AAA6GATAGA 
TA6TGAAGGA 
AAGACGTCAA 
AAAGAACAGC 
GTGAGGCACC 
GATTTTTACA 
AAAATCCAAG 
TGGCCCCCCC 
AGTTTCCTCr 
CCCAGGGTCA 
COSTTTGTCA 
CTATTGGCXA 
CTQACAAAGA 
ATOAACACTA 
AACTCAAAAA 
GAGTGCA6CC 
CTGAGAAGTG 
GGTCTGGGTT 
rrCATGAACA 
CCTGTCAATC 
GTAGGGGGGA 
GACAAATTTT 
ACAGGAACAT 
ACTTTGGGGT 
TGTTCACCAT 
CTGGTTTCCA 
AATATGCCAT 
AAAAAACAGA 
CTTCTTTTGT 
TTGTCCTGTC 
TATATCACAA 
AAAAATATAC 
CACTTTGCCA 
TOGCAAAACC 
TGTAATCCCA 
GGTTACA6T0 
ATCTGAGAAA 



51 

I 

AG0GGAG6AT 
TAGCAAATAT 
CCGAQATGCA 
CAGACCATAT 
CX3ACAATCCT 
CAAGGGGACG 
AAACAACAGA 
GGATCCAGCT 
TAACCTAOCA 
TTACTATAAA 
CATTTTTGAT 
CCTATCAGGT 
TCACTTTGGG 
AAGCCATCCA 
AAGGAGAAAC 
TCTTTGATGG 
CTGGAGCCAO 
ACXIAGGQAGC 
CCAACTGGGA 
CCAGAAATGA 
GGGCTTACCC 
ACTGAGATGC 
AAATGTGGGC 
TAAACAAATA 
TTTCTATCAG 
GTGGAATTGT 
AGAAAGTTTG 
CCTTATTTTC 
ACACCTTTGT 
TGAGTAAGTT 
TTTTTCTTGT 
TAGCAGTAGT 
TTTATCTATT 
TTAAAAGTGA 
AGGTQOGCAG 
CTGTCTCTAC 
GGTACTTGGQ 
AOTTGAGATT 
AAAAAAATAA 



Seq ID NO: C6S DNA Sequence 

Nucleic Acid Accession «t NN_005366.3 

Coding sequencet 123.,119B ~ 



G6CACGAGGC 
ACACAAGGCA 
GATGGG06AT 
GGTAG6CAAG 
TGCTQAGTCT 
CT6CCAGAAT 
QCAOATCATC 
GCGCATGCAG 
TGOCTCrrAC 
TGOAAGGATT 
CACCACCATG 
CACCCTGCAT 
GCCCACAGAG 
CCTTAGGCro 
AC0GGG6CA0 
CTGCACACCA 
CAATCCCTTC 
AGTAOGAGQT 
GOCTQAOGTO 
CAAGCGACGT 
CrCCTTTATO 
GTCCCTTCTG 
TCTCAATGAC 
A6GT8ACAAC 
TTCAGATTAC 
OQAAGGGATA 
GGGTTCTCTA 



11 
I 

CATTTTCAAA 
GCAAGCACTG 
TGGAGCTTCX: 
GTCTGGCTCA 
TCCTGGGGGG 
GTCTGCTACG 
TTCQTCTCCA 
GAGAAGCGCA 
GAGTAGCGGG 
6CCCTCCAGG 
GAGGTGGGCT 
GTCTQCCGCA 
AAGAAT6TCT 
6CTGAACTCT 
CACATGOCTA 
CCCCCC6ACT 
AGCAATAATA 
CAGGAGCAGA 
CCCAATGGAG 
CTTAGTAAGG 
GQAGGATCAO 
AACTGATGCT 
GTTGCTC3VTT 
CCACCCAQAC 
TCATGAAACA 
GCCAQAGQGA 
AGTTCCTAGC 



21 

I 

CAOTCCCTCC 
GGAGACGAAA 
TGGGAAATTT 
CTGTCCTCTT 
ATGA6CAGGC 
ACCA6GCTTT 
CGCCCTCTCT 
AGCTACGGGA 
TGQGAGAGAA 
GCACTCTGCT 
TCATTOTGGG 
GQAOTCCCTG 
TCATTGTCTT 
ACCACCT6GG 
A6TGCCAGCT 
TTAATCAGTG 
T6GCCTCCCA 
CTCCTOGGGA 
TCTCACCAGG 
CCAGCAGCAA 
GACCAGGTGG 
TTCTCA.CTGT 
AATTCTAGAA 
TGCAOTTCCC 
GGGTAGAGGO 
T AOAAT GACT 
TCCTTGACCT 



31 

I 

TGGGAGAACA 
GTTTTGGCAT 
CCTGGAGQAA 
CATATTCCGT 
T6ATTTC0GG 
CCCCATCTCC 
GGTGTACATG 
GGCCGAGAQG 
66CAQAACTG 
CAACACCTAT 
CCAGTACTTC 
TCCCCACCOG 
TAT6CTGGCT 
CIGGAAGAAO 
TTCT6GCCCC 
CCTGGAGAAT 
ACAAAACACA 
AGGTTTCATC 
TCAC06CCTT 
G6CAAGGTCA 
GAACAAAGGA 
CATCACT6CT 
ACTATAACCA 
TCCCCACOCT 
AAAGAACG6A 
CTCTCTCTAC 
GATCAOCCTC 



41 

I 

CAGACAGGCA 
CTGTTCCCTG 
GTACACAA6C 
AXGCTOGTGC 
TGTGATA08A 
CACATTCGCT 
GGCCACGCCA 
6CCAAAGAGG 
TGCTGCTGGG 
OTGTGCAGCA 
ATCTACGGAA 
GTCAACTGTT 
GTGGCTOCAC 
ATCAOACAGC 
TCT0TQ6GCA 
GGOCCTGGGG 
GACAACCTGG 
CAGGTTGGTT 
CGCCATG6CT 
QAT6ACCTAT 
GGCTC3VGAGA 
TGQCTOCTTT 
GGGCTCTGGG 
CTACOCAOTA 
AGCATOGCAA 
ATACCAOCAO 
CCTCCTOCAA 



51 
I 

GAGQATTACA 
GCTGTGCCAA 
ACTCGACCGT 
T6GGCACA6C 
TTCAGCCT08 
ACTGGGTGCT 
TGCACACTGT 
TCCGGOOCTC 
AGGAA GGGAA 
TCCTGATCOG 
TCTTCCTGAC 
AOGTATCCCXS 
TOTCCCTCCT 
QATTTGTCAA 
TAOTCCAOAG 
GAAAATTCTT 
TCACCGAGCA 
ATGGCCAGAA 
ATCATAGTGA 
CAOTOTaACC 
GGAAAOACGT 
GAGCCCCGGG 
ATAGTAAGAG 
TAOQAAOCCT 
AAGCTGGCCT 
CATAOCAAAT 
GOAAGAGCTC 



60 
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60 
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AAAGTTCCOV 
CTGTTGTCTC 
CTTCTOCTCC 
CTTTACAAAT 
GCCCGACCTT 
GGTGTCAOCC 
TCTGTCACCC 

6CT6TTGTCC 
CCAOGCCTTO 



GCCAATAGAC 
CATGGACCAT 
CTCTACTCAG 
GOCTTGGTAT 
GGTGATGTTQ 
CTGCTA6GGA 
TGCAGCTCCT 
TGTTCACCCA 
TCTCTCXSAAC 
TTAAAAAAAA 



AGCATGAATC 
TCCTCGGAGT 
CCTTAAAAAO 
GTACCTCTGG 
CAACCTTTCC 
GTCACTGTAC 
TTACAGTTCA 
GCCATTCCCT 
CCTGACCAGA 
AAAAAAAAAA 



AAGGAACTTQ 
AGTGGTGAGA 
GGCTTCTTGG 
CAAATGCCCT 
TTCTGCTAG6 
ACACAAACTC 
ATCCAATGAT 
GAAG6CCTTA 
TCATCAGCCA 



CATTATATGT 
TGGCCTTQGG 
AACTTTACCA 
GGTAT6TACC 
GTQTACACCT 
TACTGGAATT 
AGAAACCATC 
CCAACAG6AA 
CTGAGGCCAS 



GCTCTTQAAT 
TTGCCCTTGG 
GCAGCCTGAQ 
TCTGG»AAT 
AGCCTGTGCA 
CCTGCCAACA 
CCTTCCCTTT 
TATCCAAGAA 
T6GAATTTCC 



Seq ID KOt C66 DNA Sequence 

Nucleic Acid Accession D: im_014459.2 

Coding sequence t 738.. 3407 ~ 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2190 



1 11 21 31 41 SI 

I I 1 i I I 

GTAQATGC3VG TCCX5CCGCCG CXX3CTGCCTC AGCCAGCAAT GCAAGATTAO ATCTCTAAAT 60 

GCAGCAAAAC ACTGCCTGAA AACAGACOGG CCC006CAGC AA6CAGACAT TTCAOGOTGC 120 

GCTGGGGAAG CTTCAAAATA TATCTGTGAC TCTGTCTTOQ TTGCTCTTCA TCCXX3KTCAA 180 

TTTCATCACX3 GGAGGCGAGC AGCAAGTAAG AATTTC3VCTT TCGQATCTGC CTAGAGACAC 240 

ACCTCCCTGC TCCCTCCCCC ACTCGATGTG AAQAQTATTC CGGAGTCTCC GGGOGGGAGT 300 

AGATTTGCAO CACCCTAGCX3 GGAGCX3AGQA AAACCTACTG ATTCTTTAGC TCATTATCAT 360 

CTCTCCCAGA OGAQATTTCC TTCTTATOGC CTGCCTCATC GCTCAAGTTT GAGCCTCCCX3 420 

AAGTCCGGGC GGGAGAGACG AAACCCCTGG CTCACCCCCA GCCGCAGGAA GCCAOCGCCT 480 

TGCTCCAAGC CCCTGCAGCT CTGCTGCACC GCAGCTTCTC ACCCAGTGCG GATGCTGTAQ 540 

ATCAACAGGT TCAGGGAACT TGAGCAGAAT AAGGAGAGAC CACCGGGTGC CGCAGCTOGG 600 

GTGCAGAGGG AAAAAAGGAC CCATAGACTT GTX^GCTCGCQ TCGOGCGCGC A0GCT606CC 660 

AGGGCCCCAG 0CTGGCX3GGC ACTCGCTCTC TGGCTCCTCC AGTCCGATTG CTCCTGOCCC 720 

CACCTTACAG GTCTGGGATG TACCTTTCCA TCTGTTGCTG CTTTCTTCTA TGGGCCCCTG 780 

CCCTCACTCT CAAQAACCTC AACTACTCCO TGCCGGAGGA GCAAGGGGCC GGCACGGTGA 840 

TCGGGAACAT CGGCAGGGAT GCTCGACTGC AGCCTGGGCT TCC36CCTGCA GAGCGCX3GCG 900 

GCGGAGGGOG CAGCAAGTCG GGTAGCTACC GGGTGCTGGA GAACTCCGCA CCGCACCTGC 960 

TGGAOGTGGA CGCAGACAGC GGGCTCCTCT ACACCAAGCA GCXSCATOSAC CX3CX3AGTCCC 1020 

TGTGCCGCCA CAATGCCAAG TGCCAGCTGT CCCT0SA6GT GTTOGCCAAC GACAAGGAQA 1080 

TCTGCATQAT CAAGGTAGAG ATCCAGGACA TCAACGACAA C6CGCCCTCC TTCTCCTOGG 1140 

ACCAGATG8A AATGGACATC TC6QAGAACG CTGCTCCGGG CACGCGCTTC CXXXITCACCA 1200 

GOGCACATGA CCCCGACGCX: GGOQAGAATG GGCTC06CAC CTACCTGCTC ACGCGCGACG 1260 

ATCACGGCCT CTTTGGACTa GAOGTTAAGT CCCGCGGOGA CX3GCACCAAQ TTCCCAQAAC 1320 

TGGTCATCCA GAAGGCTCTG GACCQCGAGC AACA6AATCA CCATACGCTC GTGCTGACTO 1380 

CCCTGGAOGG TGGC6AGCCT CCAOGTTCCG CCACOGTACA GATCAACGTG AAGGTGATTG 1440 

ACTCCAAGGA CAAC3U3CCCG 6TCTT0QA6G OGCCATCCTA CTT6GTG0AA CTGGCCQAQA 1500 

ACX5CTCCX3CT GGOTACAGTG QTCATOGATC TQAACGCCAC CGACX3CCQAT GAAGGTCCCA 1560 

ATGGTGAAGT GCTCTACTCT TTCAGCAGCT ACGTGCCTGA CCGCGTGCXSa QAGCTCTTCT 1620 

CCATOGACCC CAAGACCGGC CTAATC0GT6 TGAAGGGCAA TCTGGACTAT GAGQAAAACG 1680 

GGATGCTGGA GATTGACGTG CAGGCCGGAG ACCTGGGGCC TAACCCTATC CCAGGCCACT 1740 

GCAAAGTCAC GGTCAAGCTC ATGGACCXXA AOSACAATOC OCCGTCCATC GGTTTOGTCT 1800 

COGTGCGCCA GGGGG06CTG AGCGAGGCOG CCCCTCCCGG C3VC0GTCATC GCCCTGGT6C 1860 

G66TCACTGA CCGGGACTCT GGCAAGAAOG GACAGCTGCA 6TGTCGGGTC CTAGOOGOAG 1920 

GAGGGACGGG CGGCGGOSGG GGCCTGGGOQ GGCCCGGGGG TTCC6TCCCC TTCAAGCTTG 1980 

AGGAGAACTA OGACAACTTC TACAOGGTGO TGACT6AC06 GCG6CIGGAC GOOGAOACAC 2040 

AAGAOSAGTA CAAGGTGACC ATOOTGGOGC GOGAOGGGGG CTCT O CTCCC CTCAACTCCA 2100 

CCAAGTGQTT CGCX3ATCAAG ATTCTAGACX3 AGAACGACAA CCQGCCTGG6 TTCACCAAAG 2160 

GGCTCTA03T GCTTCAGGTG CACGA6AACA ACATCCGQGG AOAGTACCTG GGCTCTGTGC 2220 

TCGCXX3U3GA TCCCQACCTG GGCC3W3AACX3 GCACOGTATC CTACTCTATC CTGCCCTOGC 2280 

ACATOQGGOA CQTGTCTATC TACAOCTATO TGTCTOTGAA TCOCAOGAAC GGGGCCATCT 2340 

ACOCCXTTGOQ CTCCTTTAAC TTOGAGCAGA CCAAG6CTTT TGAOTTCAAO GT6CTTGCTA 2400 

AGGACTOSGG GGOBCCCX3CX3 CACTTGGAGA GCAAC36CCAC GGTGAGGGTG ACAOTGCTAO 2460 

ACGTGAATQA CAACGCGCCA GTGATOQTGC TCCCCAOGCT GCAGAACGAC ACCX3CXX3AGC 2520 

TGCAGGTGCC GCGCAAOGCT GGCXrTGGGCT ATCTGGTGAG CACTGTGGGC GCCCTAGACA 2580 

OGQACTTCXSa GGA8A60GG6 CXSTCTCACCT AOGAGATOGT GQAGGGCAAC GAOQACCACC 2640 

TGTTTGAOAT GGACCGOTCC AGOGGCXjAQA TC06CACGCT 6CAGCCTTTC T66GAGGA06 2700 

TGACGCCOGT GGTGGAGCTG GTGGTGAAGO TGACGGACX:A CGGCAAGCCT ACCCTGTCCXS 2760 

CAGTGGCCAA GCTCATCATC CX5CTCGGTGA GOGQATCCCT TCCCGAGGGG GTACCAOGGG 2820 

TGAATGGCGA GCAGCACCAC TGGGACATGT C6CTGC06CT CAT0GTX3ACT CTGAQCACTA 2880 

TCTCCATCAT CCTCCTAOCXS GCCATGATCA GCAT00CCX3T CMOTQCMO OOOGAGAACA 2940 

A6GAGATC00 CACTTACAAC T60CGCATOG COQAGTACAG CCACCCX3CAG CTGGSTGGGG 3000 

GCAAGGGCAA GAAGAAGAAO ATCAACAAAA ATQATATCAT GCTGGTGCAG AGCGAAOTGG 3060 

AGGAGAGGAA OGCCATGAAC GTCATGAACG TGGTGAGCAG CCCCTCCCTG GCCACCTCCC 3120 

CXATGTACTT OGACTACCAG ACCCX3CCTGC GOCTCAGCTC GCCCOGGTGG GAGGTGATGT 3180 

ATCTCAAACC GGCCTCCAAC AACCTGACTG TCOCTCAGGO GGAGGOGGGC T60CACACCA 3240 

GCTTCACX^GG ACAAGGGACT AATGGAAGCG AGACCCCTGC CACTOQGATG TCCATAATTC 3300 

AGACAGACAA TTTTCCOGCA GA6CCCAATT ACATGGGCAO CAGGCA6CAG TTTGTTCAAA 3360 

GTATTTCAGT AGCTCCACGT TTAAGGACCC AGAAAGAGCC AGCCTGAGAG ACAGTGGGCA 3420 

CGGGGACAGT GATCAGGCTG ACAGTQACCA AGACACTAAC AAAGGCTCCT GCTGTGACAT 3480 

6TCTGTTAGG OAGGCACTGA AGAT6AAAAC TACTTCAACT AAAAGOCAAC CACTTGAACA 3540 

AGAACCAQAA GAOTGTGTTA ATTGCAGAGA TQAATGCOQA U'X' Q CnWfC ATTCTGACAG 3600 

GTGCTGGATG CCACAGTTCC CTGCAGCCAA TCAGGCTGAA AATGCAGATT ACCGCACAAA 3660 

TCTCTTTGTA CCTAC3VGTTG AAGCTAATGT TGAGACTGAG ACTTAOGAAA CTGTQAATCC 3720 

CACXGGGAAA AA6ACTTTTT GTACATTTGG AAAAGACAAG CQAGAGCACA CTATTCTCAT 3780 

T6GCAACXITT AAACCTTATT TAAAAGCCAA ACXSTGCOCTO A6CCCTCTGC TOCAAGAGOT 3840 

CCCCTCAGCA TCAAOCAGCC CAACCAAGGC GTGCATOIAG CCnOCACCT CAACAAAAGG 3900 

CTCXXXGGAT GGCTGTGAAG C3UUUICGAG6 A6CCCTGGCT 6AAG€»AGCA GTCAGTACTT 3960 

GCCCACTGAC AOTCAATATC TGTCACCTAO TAAOCAACCA AGAQACCCTC CCTTCATGGC 4020 
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TTCCGATCRG ATGGCAAGGO TCTTTGCAGA TGTGCMTCC AQAGOCAGCC GGQATTCCAO 4080 

TGAGATOGGT GCTQTTCTTG AGCAGCTTGA CCACCCCAAC AGGGMCIGG 6CAGAGAGTC 4140 

TGTGGATGCA GAGGAAQTTG TOAGAGAAAT TGATAAGCTT TTGCAAOACT GCCGGGGAAA 4200 

CGACCCPGTG GCTCTGAGAA AGTGAAAAAA GAAAAAAAAA AAGGCATTGG CATTTTCTTG 4260 

J TCTCTTCTGT TGATTTAAAA ATGATGCCTC CTGOTQATAA CCCATTTTAC AGGGATQAAG 4320 

AAAGACCAAT GCTGCTTTAA GGCTTTTAGT GAACftTCTGA AGTGCCC21CA AGTATGTTCT 4380 

TTCCACTGCT GATTTCTTTT TCAGAOATAA CAATGGTTTC GTTTTQAGCA AACTTOTATT 4440 

AGGACAGAAT TAATGATGCT TAAAGAGAAA AGAAAAAAAG AOAGAAGAAA AAGGAGAGAT 4500 

^ GAAAAAGOAG GATQAGGAQA AGAATTACCT TTTGACAATC TGTTAGGAAG GTATGCAGTC 4560 

lU TG AGAA CTGA AGTATTTCTG ATCACTCTCA GACTOTCCTC CGTGATTTAT GCT6ACTTAA 4620 

CTGTTTACCT ATAAACCCCA TACAAAGCAG 6GTCATAATT TGTGATCTGT GGTGGATTTC 4680 

TAGCAGTCAT CACAGOCTTC TACTGAAAOT GCTOAAAAGA CCTTGCAGTA GTCCAAGCTA 4740 

CACCAAACAT TAACACATAT TTGTGGTAAA CATTTCTGTA TAAAGTTACC TGACACACAT 4800 

ATAAACACAA GGAACATTCC ATATCATTAQ TCX3AAAACAA AAACAAAAAA AAAACCTTTG 4860 

GTCATTTGTA AGACATCTCA TGTCATATAA AAGTTAAATO TAAAAAGATA CAGTCCATTT 4920 

TGTCCTGCAC ACA0GTA6AC TAATTCAOGT GAAAAAAAAA AAAAAA 4966 
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Seq ID NOt C67 Z7KA Sequence 

Nucleic Acid Accession 8: NM_00S601.2 

Coding sequence t 101.. 598 

1 11 21 31 41 51 

I I I I I I 

CCCAGGAGTC TGGGTGCACA GCCTCCTTCT CTCTGAQATr CAAGAGTCTG ATCAOCAGCC 60 

TCTTCCrCCT CCaCGACCCA QAAGCCCTGA GCTTATCCCC ATG6A6CTCT GCCGGTCCCT 120 

GGOCCTGCTG GGGG6CTCCC TOOaCCTGAT GTTCTGGCTG ATTGCTTT6A GCACGGATTT 180 

CT60TTTGA0 GCTGTGGGTC CCACCCACTC AGCTCACTOG GGCCTCTGGC CAACAGGOCA 240 

TGGGGACATC ATATCAGOCT ACATCCACX^ GAOQCAGACC TTCAGCATTA TGGCTGTTCT 300 

OTGGGCCCTG GTGTCCGTGA GCTTCCTGGT CCTGTCCTGC TTCCCCTCAC TQTTCCCCCC 360 

AGGCCA06GC COGCTTGTCT CAACCACOGC AGCCTTTQCT GCAGCCATCT CCaTGGTGGT 420 

GGCCAT66G6 GTQTACACCA GGQAG06GIG G6ACCAG0CT OCACACCCCC AGATCCAGAC 480 

CTTCTTCTCC TGOTCCTTCT ACCTGGGCTG GGTCTCAGCT ATCCTCTTGC TCTGTACAG6 540 

TGCCCTGAGC CTGGGTGCTC ACTGTGGOGG TCCCCGTCCT G6CTATQAAA CCTTGTGAGC 600 

AGAAGGO^AQ AGOGGCAAGA TGAGTTTTGA GC6TTGTATT CCAAAGGCCT CATCTGGAGC 660 

CTCGGGAAA6 TCTGGTCCTA CATTTGCCC3S CCCTTCCAGC CCTTCCCCA6 OCCCTCCTCT 720 

TGTTTCTTCA TTCATTCAAC AAAATTT6GC T6QAAAAAAA AAAAAAAAAA AAAAAAAAAA 780 

AAA 783 



Seq 10 MOt C68 DNA Sequence 
Nucleic Acid Accession ff: MM_00€433.2 
Coding sequence: 129.. 566 " 

^-1 11 21 31 41 51 

45 I I I I I I 

GTATCTGTGG TAAACOCAGT GACAOGGGGG AGATGACATA CAAAAAGGGC AOGACCTGAG 60 

AAAGATTAAG CTGCAGGCTC CCTGCCCATA AAACAGGGTG TGAAAGGCAT CTCAGCGGCT 120 

GCCCCACCAT GGCTACCTGO GCCCTCCTGC TCCTTGCAGC CATGCTCCTG GGCAACCCAG 180 

GTCTGGTCTT CTCTCGTCTO AGCCCTQAGT ACTAOQACCT GGCAAGAGCC CACCTGOGTG 240 

jU ATGAGGAGAA ATCCTGCCOQ TGCCTGGCCC AGGAGGGCCC CCAGGGTGAC CraTTGACCA 300 

AAACACAGGA GCTGGGCOGT GACTACAGGA OCTGTCT G AC GATA6TGCAA AAACTGAAQA 360 

AGATGGTGGA TAAGCCCACC CAGAGAAGTG TTTCCAATGC TGCGACCCGO GTGTOTAGGA 420 

OGGGGAGOTC ACGATGOCGC GACGTCTGCA GAAATTTCAT GAGGAGGTAT CAGTCTAGAG 480 

TTACCC AGGG CCTCGTGGCC GGAGAAACTO CCCAGCAQAT CTQTQAGQAC CTCAGGTTGT 540 

6TATACCTTC TACAGGTCCC CTCTGAGCCC TCTCACCTTG TCCTGTGQAA GAAGCACAGG 600 

CTCCTGTCCT CAGATCCOGG QAAOCTCAGC AACCTCT6CC GGCTCCTOGC TTCCTCGATC 660 

CAOAATCCAC TCTCCAGTCT O CCTCCCCTG ACTCCCTCTG CTGTOCTCCC CTCTCACGAQ 720 

AATAAAGTGT CAAGCAAG 736 



Seq ID NO I C69 DNA Sequence 

Nucleic Acid Accession fti NM_00298S.2 

Coding sequence : 69 . . 344 



1 11 21 31 41 51 

I I I I I i 

GCTGCAGAGG ATTCCTGCAO AGGATCAAGA CAGCAOGTGG ACCTOGCACA GCCTCTCCCA 60 

CAGGTACCAT GAAGGTCTCC G0G6CAGCCC TCGCTGTCAT CCTCATTGCT ACTGCCCTCT 120 

GCGCTCCTGC ATCXOCCTCC CCATATTCCT CGGACACCAC ACCCTGCTGC TTTGCCTACA 180 

/U TTGCCOGOCC ACTGCCC06T GCCCACATCA AGGAGTATTT CTACACCAOT GGCAAGTGCT 240 

CCAACCCAQC AQTCOT C TTT GTCACCGGAA AGAACCGGCA AGTGTGTGCC AACCCAGAGA 300 

AGAAATGGGT TCOOQAGTAC ATCAACTCTT TQGAGATGAO CTAGGATGGA GAGTCCTTQA 360 

ACCTXSAACTT ACACAAATTT GCCTGTTTCT GCTTGCTCTT GTCCTAGCTT GGGAGGCTTC 420 

CCCTCACTAT CCTACCCCAC CCQCTCCTTG AAGGGCCCAG ATTCTACCAC ACAGCAGCAO 480 

/O TTACAAAAAC CTTCCCCAGG CTGGAOGTGG TGGCTCAOGC CTGTAATCCC AGCACTTTGG 540 

GAGGCCAAGO TGGGTGGATC ACTTGAGGTC AGGA6TTC3A QAOCA6CCT0 GCCAACAT6A 600 

TQAAACGCCA TCTCTAGTAA AAATACAAAA AATTAOCOGG GCGT6GTAGC 6GG0GCCTGT 660 

AGTCCCAGCT ACTCGGGAGG CTOAGGCAGG AGAATGGOGT GAACCCGGGA GGOGGAGCTT 720 

GCAGTGAGCC GAQATCGCGC CACTGCACTC CAGCCTGGGC GACAGAGCGA GACTCOGTCT 780 

GAAAAAAAAA AAAAAAAAAA AAAATACAAA AATTAGCCOG 0CGT6GTGGC CCAaSCCTGT 840 

AATCCCAGCr ACT066GAO0 CTAAGGCACG AAAATTGTTT GAAOCCAGGA OGTGQAOGCT 900 

GCAOTQAGCr aAOATTGrQC CACTTGACTC GAQCCTGOOT GAGAAAGTGA GACTCCGTCA 960 

GAACAACAAC AACAAAAA6C TTCCCCAACT AAAQCCTAGA AGAGCTTCTO AGOCGCrGCT 1020 

TTGTCAAAAG 6AAOTCTCTA GGTTGTGAGC TCTGGCTTTG CCTTG6CTTT GGCAGGGCTC 1080 
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TOTGACCAGO AAGGAAGTCA QGATGOCTCT AGAGGCAAGQ A0GGGAO6AA CACTGCACTC 1140 

TTAA6CTTCC GCOGTCTCAA CCOCTCACAG GA6CTTACTG GCAAACATGA AAAATOSGCT 1200 

TACCATTAAA GTTCTCAATO CAAOCATAAA AAAAAAA 1237 

Seq ID NO I C70 ONA Sequence 

Nucleic Acid Accession ft* NN_033154.2 

Coding sequence: 1381.. 1722 ~ 

1 11 21 31 41 SI 

I 1 I I I I 

AGTQTGGTTT TAOTTTTTCC TAAGAAGTGG 0GTG6TTTGG G6CTTTATAT CX3GG6A60AG 60 

CATATGTACX5 CAAATCCTGG GGCGTTTGCA AACCCGGATC OGGGGCGTCT GGCCCCATGC 120 

CCGGCC3GGGC GTTTGAGGGC TACTGCCACG CAGOGTTTCT GGAGCCTGCC GGCTGGTGCC 180 

CTGGTGGCCT TTATCTCT6T CCCCCrnxyr CCTCTTTATC TCAGGCTCTC CAGGAGGCCG 240 

GGGGGCCCAC TCOGCCTATC GCTCCCCTOG GCTACGCTGC CACTCCAATQ CGCG6CAGGT 300 

0606AGCTGC T G TTCTTTOQ AAGGC6C0G6 AGAACCAGGG GCGTCCOGCG CCAGCTCTGA 360 

CTOGGAGCAG OGCCGAGCAC TGACGCTCCC GCOCTTGGGC AAGGAOGCCA GTGCX5<X06C 420 

6CGCGTCCCT CTGCGCX3QCA GCCCGTCGCX3 GGCCCTCAAG GQQAAQCCCA GQCCAGOATQ 480 

GCCCOGGGTC 60QCGGTGGC 06GGCTCCTG TTGCTGGCGG CCGCCGGCCT OGGAGGAGTG 540 

GOSGAGGGGC CAGQOCTAGC CTTCAGCGAG GATGT6CTGA GOGTGTTCGG C6CGAATCTG 600 

AGCCTGTG6G 0GG0GCA6CT GCAQCACrTG CTGGAGCAGA TGGGAGC06C CTCCOSOGTG 660 

GGCOTCCCGG AGCCTGGCCA GCTGCACTTC AACCAGTGTT TAACTGCTGA AGAGATCTTT 720 

TCCXrrrCATG GCTTTTCAAA TQCTACCCAA ATAACCAGCT CCAAATTCTC TGTCATCTGT 780 

CCA6CAGTCT TACAGCAATT GAACTTTCAC CCATGTGAGG ATOGGCCCAA GCACAAAACA 840 

AGACCAAGTC ATTCAOAAGT TTGGGGATAT GGATTCCTST CAGTOAOQAT TATTAATCTO 900 

GCATCTCTCC TCGGATTGAT TTTGACTCCA CTGATAAAGA AATCTTATTT CCCAAA6ATT 960 

TTGACCTTTT TTGTGGGGCT GQCTATTGOG ACTCTTTTTT CAAATGCAAT TTTCCAACTT 1020 

ATTCCAGAGG CATTTGGATT TGATCCCAAA GTCX5ACAGTT ATGTTGAGAA GGCAGTTGCT 1080 

GTGTTTGGTQ GATTTTACXT ACTTTTCTTT TTTGAAAGAA TGCTAAAGAT GTTATTAAAG 1140 

ACATAT6GTC AGAATGGTCA TACCCACTTT GQAAATGATA.ACTTTGGTCC TCAA6AAAAA 1200 

ACTCATCAAC CTAAAGCATT ACCTGCCATC AATGGTGTGA CATGCTAT6C AAATCCTGCT 1260 

GTCACAGAAG CTAATGGACA TATCCATTTT GATAATGTCA GTQTOaTATC TCTACAGQAT 1320 

GGAAAAAAAQ AGCCAAGTTC ATGTACCTGT TTQAAGGGGC CCAAACTGTC AGAAATAGGQ 1380 

ACGATTGCCT GQATGATAAC 6CTCTG0GAT GGCCTCCACA ATTTCATCGA TGGCCTGGOO 1440 

ATTGG06CTT OCTGCACCTT GTCTCTCCTT CAGQGACTCA OTACTTCCAT A6CAATCCTA 1500 

TGTGAGGAGT TTCCCCA06A 6TTAGGAQAC TTTGTGATCC TACTCAATGC AG6GATGA6C 1560 

ACTCGACAAG CCTTGCTATT CAACTTCCTT TCTGCATGTT CCTGCTATGT TGGGCTAGCT 1620 

TTTGGCATTT TGGTGGGCAA CAATTTCGCT CCAAATATTA TATTTGCACT TGCTGQAGGC 1680 

AT6TTCCTCT ATATTTCTCT GGCAGATATG TTTCCAGAGA TGAATGATAT GCTGAGAQAA 1740 

AAGGTAACTG 6AAGAAAAAC OGATTTCACC TTCTTCATGA TTCA6AAT6C TGGAATGTTA 1800 

ACT6GATTCA CAGCCATTCT ACICATTACC TTOTATGCAO GAQAAATOQA ATTGGAOTAA 1860 

TAQAAAATGG AAQATGGTGT TGTTAATAAA GGCATTTAAT AGATAAAAAC ATCTCCAAAA 1920 

AGGATTTTGA AOCTGATCCT ATTTAGTTAA AAAGATAATT TTOCTTTCAA CTOTAGGTCC 1980 

AGAAAACTAA TTATTGGCAT CAOTCTGTGA AATAGTCCAT TATTTGTTGT TAAAAATGCT 2040 

TCAAAAGGTT TTCA6TGTCA GTCTQA6ATG CCTGGTATAT AGGAGCCTTT GG6AAATACT 2100 

TATTTTTCAO TATTCXaVTGC ATATTAGATA TCACCATOAA GCAAOAQACA TGCATTCTAT 2160 

AATCaTGTAG ACACTCAGAC TCAGGGGAAA ATACAAOTTA TATCCTGAAA GCCTTTAAAA 2220 

CTCTATGGTA GQATCAAAGA TTCAAATGGT TTCAGAGAGG TTTTATTTCA ATTAATTTGT 2280 

TCTAGT6CTT TCAAGAGCAA GTACATCAAA ATGTAGAAGG TAAAATGTAT 6CAACACTAA 2340 

TATAAATTAT TCCAAGTCTT TAAGGAGCCA AAGAAAAAAA AGATTTCTCA CAGCTTTTTG 2400 

TTCTGTTTTG TATTTCAATT AGGAACTTGC AGTATTATTT TGAAAACCAT TCTAAAATAA 2460 

TAGGAGTTAG GAAATAAATA AAOTTTTGCT AGCCCTGCTA AGTTCAGGCT TAGAGGCTTA 2520 

TOGCTAAGTN TAAACTTCAC CAGATTCCAC GAAAAGCTGQ ATAGCTTTTT TTCTGACTTA 2580 

TGTTOTGGTT GCACCCCTCA CAAATGGCAG AACAGTATGT AAAGCTGGTA ACACCTCGGT 2640 

TTCAGTGCAC CATOTOTTTG CTTTOTGAAO GTGAAGAATA TGTTGGTTTA QAGAAAQAAA 2700 
TTGGATGTAA TTTTATGCAA TTTACTTTTA AAGACAAACA TAACTATTTA GCAGAGAATA 2760 
TTTTAATAAA TGCAAAACAA CAGCTGGACT GCTGTACATC AAGGACAGAT TAACTGQAAA 2820 
ACATATGTTC CTTATGTGTG ATTGAGAGCC ATTCAGAAAA GACTTCCTTT GTGTTCAGCC 2880 
TATACTTTTC CATATOOTAT ACCTTGAAAA AAATTAGCAC ACCATGGTTA TTTTTCTACC 2940 
TTTTATAAAA GACAGAGCCT GTTTACTCAT TTAGAAGATA GAGAAAATTG GTCTAAAATT 3000 
GAACATCCTA GATTCACACT CCCAAGTCAC TTAAGGTGAT TTGATGGTGA GGAAAATGAT 3060 
TGACAAAGCC CAACAATQAT CTCAGOAATT ACATTTTCCA ACAGACCAAA AAATQTTTTC 3120 
ATOTAGCAGC AATGCAGATT TGGTGAATAT TTAATATATA TTTTAGTATG TATTTCACTT 3180 
TATGACTGAC AATTAAAAAA TATTGTTTGG CXAAATAGTA AACACCCTTT TGAAACCATG 3240 
AAAAAA 3246 



rjfx Seq ID NO: C7l ONA Sequence 

/U Nucleic Acid Accession #: NM 004184.2 



Oodlng sequence i 188 . . 1603 

1 11 21 31 41 SI 

I 1 I t I I 

CQAAAAAAGA GGGGAAGAGT ATTAAAGACC ATTTCTGGCT GGOCAGQGCA CTCTCAGCAO 60 

CTCAACTGCC CAGCGTGACC AGTGGCCACC TCTGCAGTGT CTTCCACAAC CTGGTCTTQA 120 

CTCQTCTGCT GAACAAATCC TCTGACCTCA GGCCGGCTGT GAACXSTAaTT CCTQAQAOAT 180 

AGCAAACATG CCCAACAGTG AGCCXX3CATC TCTGCTGGAG CTGTTCAACA GCATCGCCAC 240 

ACAAGGGGAG CTCGTAAGGT CCCTCAAAGC GGGAAATGOG TCAAAGGATO AAATTGATTC 300 

TGCAGTAAAG ATOTTGQTGT CATTAAAAAT GAGCTACAAA GCTGCOGOGG GGGAGGATTA 360 

CAA6GCTGAC TGTCCTCCAG GGAACCCAGC ACCTACCAGT AATCATQGCC CAGATGCCAC 420 

AGAAOCTQAA GAGGATTTTG TGGACCCATG GACAGTACAO ACAAGCAGTG CSU^UUSGCAT 480 

AGACTAOQAT AA6CTCATTQ TTGGGTTTGO AAOTAGTAAA ATTGACAAAG A6CTAATAAA 540 

COOAATAGAG AGAGCCACOO GCCAAAOACC ACACCACTTC CTGCGCAGAG GCATCTTCTT 600 
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CTCACACAGA GATATGAATC AGQTTCTTGA TGCCTATGAA AATAAQAAGC CATTTTATCT 660 

GTACACX3GGC OGGGOCCCCT CTTCTGAAGC AATGCATGTA GGTCACCTCA TTCCATTTAT 720 

TTTCACAAAG TGGCTCCAGG ATGTATTTAA OGTOCCCTTG GTCATCCAGA TGAOGOATGA 780 

SGBAOAAOTAT CTBTGGAAGG ACCTGACCCT GCSAGCAGGCC TATG6GSATG CTGTT6AGAA 840 

TGCCAAGOAC ATCATCOCCT OTGGCTTTGA CATCAACAAO ACTTTCATAT TCTCTGACCT 900 

OQACTACATG GGC5ATGAGCT CAG G TTT C TA CAAAAATGTG GTGAAGATTC AAAAQCATGT 960 

TA CCTTC AAC CAAGTGAAAQ GCATTTTC3C3G CTTCACTGAC AGC3GACTC3CA TTGGGAAGAT 1020 

CAGTTTTCCT GCCATCCAGG CTOCTCCCTC CTTCAGCAAC TCATTCCX3VC AGATCTTCCG 1080 

- - AGACAGGAGO GATATCCAGT GCCTTATCCC ATGTGCCATT GACCAGGATC CTTACTTTAG 1140 

W AATQACAAQG GAOOTCGCXX: CCAGQATCGQ CTATOCTAAA CCA6CCCTGT TGCACTCCAC 1200 

CTTCTTCCCA GCCCTGCAGG GCGCCCAGAC CAAAATGAGT GCCAGCGACC CAAACTCCTC 1260 

CATCTTCCTC AC0GACACX3G CCAAGCAGAT CAAAACCAAG GTCAATAAGC ATGCX?rTTTC 1320 

TGGAGGGA GA GACAOCATQG AGGAGCACAG GCAiGTTrGGa GGCAACTGTG ATOTGQAOGT 1380 

- GTCTTTCATG TAOCTOACCT TCTTCCTCGA GGACGACX^AC AAGCTOOAGC AGATCAGGAA 1440 

iD GOATTACACC AGOGGAGGCA TGCTCACOGG TGAGCTCAAG AAGGCACTCA TAGAGGTTCT 1500 

GCAGCCCTTG ATCGCAGAGC ACCAQGCCCG GCGCAAGGAG GTCAOGGATG AGATAGTGAA 1560 

A6AGTTCATG ACTCCCCGQA AGCTGTCXnT CGACTTTCAG TAGCACTOGT TTTACATATG 1620 

CTT ATAA AAG AAGT6A7GTA TCAGTAATGT ATCAATAATC CCAGCCCAQT CAAAGCACCG 1680 

_ CCAOCTGTAG GCTTCTOTCr CATGGTAATT ACTGGGCCTG GCCTCTGTAA GCCTGTGTAT 1740 

ZU OTTATCAATA CTGTTTCTTC CTGTQAGTTC CATTATTTCT ATCTCTTATG GGCAAAGCAT 1800 

TGTGGGTAAT TGGTGCTGGC TAACATTGCA TGGTCGGATA QAGAAGTCCA GCTGTGAGTC 1860 

TCTCCCCAAA GCAGCCCCAC AGTGGAGCCT TCGGCTGGAA GTCCATGGGC CACCCTGTTC 1920 

TTOTCCATGG AGGACTTCCG AGGGTTCCAA GTATACTCTT AAGACCCACT CTGTTTAAAA 1980 

^WlTAIATTC TATGTATGCG TATATGGAAT TGAAATGTCA TTATTGTAAC CTAGAAAGTG 2040 

CTTTGAAATA TTGAT6TGGG GAGGTTTATT GAGCACAAGA TGTATTTCAG CCCAT6CCCC 2100 

CTCCCAAAAA GAAATTGATA AGTAAAAGCT TCGTTATACA TTTGACTAAG AAATCACCCA 2160 

GCTTTAAAGC TGCTTTTAAC AATGAAGATT GAACAGAGTT CAGCAATTTT GATTAAATTA 2220 

AGACTTGGGG OTGAAACTTT CXSVGTTTACT GAACTCCAGA CCATGCATGT AGTCCACTCC 2280 

in CTOGCTTCCC TTGGCACACC AQTGTTCTCC TGCCAAATGA CXXTAGACCC 2340 

3U TCTOTCCTQC AGAGTCAGGG TQGCTTTTCC CCTQACTGTG TCCGATGCCA AGGAGTCCTG 2400 

GCCTCOGCAG ATGCTTCATT TTGACCCTTQ GCTGCAGTGG AAGTCAGCAC AOAGCAGTGC 2460 

CCTGGCTGTG TCCTGGACGG GTGGACTTAG CTAGGGAGAA AGTCX3AGGCA GCAGCCCTCG 2520 

AGGCCCTCAC AGATGTCTAG GCAGGCCTCA TTTCATCACO CAGCATOTGC AGGCCTGGAA 2580 

^ _ GAGCAAAGCC AAATCTCAGG GAAGTCXnTG GTTGATGTAT CTGGGTCTCC TCTGGAGCAC 2640 

J J TCTGCCCTCC TGTCACCCAG TAGAQTAAAT AAACTTCCTT GGCTCX^'AAA AAA 2693 
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Seq ID NO: C72 DNA Sequence 

Nucleic Acid Accession #t NM_00493B.l 

Coding sequence: 337.. 4632 



i 11 21 31 41 SI 

) 1 ) I I I 

OSGAGGACAG CCGGACCGAQ CCAACX3CCGG GQACTTTGTT CXXTTCCACGG AGGGGACTCG 60 

GCAACTCGCA GCGQCAGGGT CTGGGGCC6G CGCCTGGGAG GGATCTGCX3C CCCCCACTCA 120 

CTCCCTAGCT GTGTTCCCGC CGCCGCCCCG GCTA6TCTCC GGCGCTGGCG CCTATGOTOG 180 

GCCTCCGACA GCGCTCCG6A GGGACCGGGG GAGCTCCCAG 6C8CC0G6GA CTGGAGACTG 240 

ATGCATGAGG G GCCT ACGGA G6CX3CA6GAO OGGTG G TGAT GGTCTGG6AA GGGGAGCTGA 300 

AGTCCCCTGQ GCTTTGGTGA GGCX3TGACAG TTTATCATGA COGTG T TCAG GCAGGAAAAC 360 

GTGGATGATT ACTAOGACAC CGGCGAGGAA CTTGGCAGTG GACA6TTTGC GGTTGTGAAG 420 

AAATGCCGTG AGAAAAGTAC CGGCCTCCAG TATGCCGCCA AATTCATCAA GAAAAGGAGG 460 

ACTAA6TCCA GCCX3GCGGG6 T0TGAGC06C GAGGACATOG A60GGGAGGT CAGCATCCTG 540 

AAOGAGATCC AGCACCCCAA TGTCSVTCaCC CTOCAOQAGQ TCTATQAQAA CAAGAOGGAC 600 

GTCA TCCTG A TCTTGGAACT CGTTGCAGGT GGCGAGCTGT TTGACTTCTT AGCTGAAAAG 660 

GAATCTTTAA CTGAAGAQGA AGCAACTQAA TTTCTCAAAC AAATTCTTAA TGGTGTTTAC 720 

TACCTQCACT CCCTTCAAAT OGCCCACTTT GATCTTAAGC CTGAGAACAT AATGCTTTTG 780 

GATA6AAATG TCCXrCAAACC TC3GQATCAAO ATCATTGACT ITGGGTTGGC CCATAAAATT 840 

GACTTTGGAA ATGAATTTAA AAACATATTT QGGACTCCAG AOTTTOTCGC TCCT6AGATA 900 

GTCAACTATG AACCTCTTOG TCTTGAGGCA GATATGTGGA GTATOGGGGT AATAACCTAT 960 

ATCCrCCTAA GTCGGGCXTTC CCCATTTCTT GGAGACACTA AGCAAQAAAC GTTAGCAAAT 1020 

GTATC06CTG TCAACTAOOA ATTTQAGGAT GAATACTTCA GTAATACCAG TGCCCTAGCC 1080 

A AAGATT TCA TAAGAAGACT TCTGGTCAAG GATCGAAAGA AOAOAATGAC AATTCAAGAT 1140 

AGTTTGCAGC ATCCCTGGAT CAAGOCTAAA GATACACAAC AGGCACTTAG TAGAAAAGCA 1200 

TCAGC AGTA A ACATGGAGAA ATTCAA6AAG TTTQCAGCCC GGAAAAAATO GAAACAATCC 1260 

GT TOGCT TGA TATCACTGTG CCAAAGATTA TCCAQGTCAT TCCTGTCCAG AAGTAACATG 1320 

AGTGTTGCCA GAA6CGATGA TACTCTGGAT GAGGAAGACT CCTTTGTGAT GAAAGCCATC 1380 

ATCCAT6CCA TCAAOGATGA CAAT6TG0CA GGCCTGCAGC AOCTTCTG GO CTCATTATCC 1440 

AACTATGATG TTAACCAACC CAAC3UVGCAC GGOACACCTC CaVTTACTCAT TGCT G CTGGC 1500 

TGTGGGAATA TTCAAATACT ACAGTTGCTC ATTAAAAGAG GCTOGAGAAT CGATGTCCAG 1560 

GATAAGGGOQ GGTCCAATGC OGTCTACTGG GCTGCTCGGC ATGGCCACGT CGATACCTTG 1620 

AAATTTCTCA GTGAGAACAA ATGCCCTTTQ GATGTGAAAG ACAAGTCTGG AOAGATGGCC 1680 

CTCCACOTGQ CAGCTCX3CTA TGGCCATGCT 6A08IG6CTC AAGTTACTTO TQC3V6CTT06 1740 

6CTCAAATCC CAATATCCAG GAGAAAOQAA GAA6AAACCC CCCTGCACTG TGCTGCTTGG 1800 

CACGGCTATT ACTCTGTGGC CAAAGCCCTT TGTGAAGCCO GCTGTAACX3T GAACATCAAG 1860 

AACXX5AGAAG 6AGAGACGCC CCTCCTGACA GCCTCTGCCA GGGGCTACCA CGACATCGTQ 1920 

/j GAGTGTCTGG CCGAACATGQ AGCCGACCTT AATGCTTQCG ACAAGGACGG ACACATTGCC 1980 

CTTCATCTGG CTGTAAGACG GTGTCAGAT6 GAGGTAATGA AGACTCTCCT CAGCCAAGGG 2040 

TGTTTCGTCG ATTATCAAGA CAG6CAC0GC AATACTCOCC TCCATGTGGC ATOTAAAQAT 2100 

GGCAACATGC CTATOGTGGT GGCCCTCTGT GAAGCAAACT GCAATTTGGA CATCTCCAAC 2160 

AA6TATGGGC GAACGCCTCT GCACCTTGOG GCCAACAAOG GAATCCTAQA CGTGGTCCGG 2220 

TATCTCTGTC TGATGGGAGC CAGCGTTGAG GCX»:TGACCA CGGACGGAAA GAOGGCAGAA 2280 

GATCTTGCTA GATCGGAACA GCAOGAGCAC OTAGCAGGTC TCXTTTGCAAG ACTTCGAAAG 2340 

GATACGCACC GAGGACTCTT CATCCAGCAG CTCCGACCCA CACAGAACCT GCAGCCAAGA 2400 

ATTAAGCTCA AGCTGTTTGG CCACTOGGGA TC0Q6GAAAA CCACCCTTGT AGAATCTCTC 3460 

AAOTGTGGGC TGCTQAGGAG CTTTTTCAGA AGGC3nXXX3C CXAGACTGTC TTCCACCAAC 2520 
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TCCAGCAGGT TCCCACCTTC ACXXCTGGCT TCTAAGCCCA CAGTCTCAGT GAGCATCAAC 2580 

AACXnXTTACC CAGGCTGCXyv GAAOGTGAGT GTGAGGAGCC GCAGCATGAT GTTCGAGCCG 2640 

GOTCTTACCA AAGGGATGCT GGAGOTGTTT GTGGCCCCGA CCCACCACCC GCACTGCTC3G 2700 

GCOGATGAOC AGTCCACCAA GGCCATOGAC ATCCAOAAGG CTTATTTGAA TGGAGTTGGC 2760 

D GATTTCAGOG TGTGGGAGTT CTCTGGAAAT OCTGTGTATT TCTGCTGTTA TGACTATTTT 2820 

GCTGCAAATQ ATCCCACGTC AATCCATGTT GTTGTCTTTA QTCTAGAAQA GCCCTATGAG 3B80 

ATCCAGCTGA ACCCAGTGAT TTTCTGQCTC AGTTTCCTGA AOTCCCTTQT CCCAGTTGAA 2940 

6AACCCATA0 CCTTCGGTGG CAAOCTGAAG AACCCACTCC AAGTTGTCCT GGTGGCCACC 3000 

CAOQ CTGAC A TCAT6AATGT TCCTGGACOO GCTGGAG6CG AGTTTGGATA TQACAAAQAC 3060 

lU ACATOGTTGC TGAAAGAGAT TAGGAACAGG TTTGGAAATG ATCTTCACAT TTCAAATAAG 3120 

CTGTTTGTTC TGGATGCTQG GGCTTCTGGG TCAAAGQACA TQAAGGTACT TOGAAATCAT 3180 

CTGCAAGAAA TACGAAGCCA GATTGTTTCG GTCTGTCCTC CCATGACTCA CCTGTGTGAG 3240 

AAAATCATCT CCACQCTOCC TTCCPGGAGG AAGCTCAATG GACCCAACCA GCTGATGTCG 3300 

^ CTGCAGCAGT TTGTGTACX3A CGTGCACGAC CAQCTOAACC CCCTGGCCAG CGAGGAGGAC 3360 

kJ CTCAQGCGCA TTGCTCAGCA GCTCCACAGC ACAGQCGAGA TCAACATCAT 6CAAAGTGAA 3420 

ACAGTTCAGG AOGTGCTGCT CCTGGACCCC CGCTOOCTCT QCACAAAOGT CCTGOOGAAG 3480 

TTGCTGTCCG TOQAGACCCC ACGG6CGCTG CACCACTACC GGGGCOGCTA CACCGT6GAG 3540 

GACATCCAGC GCCTGGTGCX; CXaACAGCGAC 6TGGAGGAGC TGCTGCAGAT CCTCGATGCC 3600 

ATQGACATCT GOGCCCGGGA CCTGAGCAGC GGGACCATGG TGGACGTCCC AGCCCTGATC 3660 

AAGACAGACA ACCTGCACCG CTCCTGGGCT GATGAGGAGQ ACGAGGTGAT GGTGTATGGT 3720 

GGCGTGCGCA TCGTGCCCX3T GQAACACCTC ACCCCCTTCC CATGTGGCAT CTTTCACAAO 3780 

GT(XPS3QTGA ACCTOTGCCO OTGGATCCAC CAGCAAAGCA CAGAGGCOGA 06C0GACATC 3840 

CX3CXTCTGGG TGAATGGCTO CAAGCTGGCX: AACOGTGGGG CCOAQCTGCT GGTGCTGCTG 3900 

OTCAACCACQ GCCAGGGCAT TGAG6TCCAG GTCOGTGGCC TGGAGACGGA GAAGATCAAO 3960 

TGCTGCCTGC TGCTGGACTC GGTGTGCAGC ACCATTQAGA ACX5TCATGGC C:ACX:aC6CT6 4020 

CCAGGGCTCC TGACOGTGAA GCATTACCTG AGCCOCCAGC AGCXXSCGGQA GCACCATGAG 4080 

CCCGTCATGA TCTACCAOCC ACGGGACTTC TTCOGQGCAC AQACTCTGAA GGAAACCTCA 4140 

CTGACCAACA CCATGGGGGG GTACAAGGAA AGCTTCAGCA GCATCATQTG CTTCGOGTOT 4200 

CACGAOGTCT ACTCACAGGC CAQCCTOGQC ATGGACATCX ATGCATCAGA CCTGAACCTC 4260 

CTCACTOGGA GGAAACTGAG TOGCCTGCTQ GACC06CXXG ACCCCCTGQG GAAGGACTGO 4320 

TGCXnrCTCG CCATGAACTT AGGCCTCCCT GACCTGGTGG CAAA6TACAA CAOCAATAAC 4380 

GGGGCTCCCA AGGATTTCCT CCCCAGCCCC CTCCAOOCCC TCCTGOGOQA AT60ACCACC 4440 

TACCCTQAQA GCACAQTGGG CACXZCTCATG TCCAAACTGA GGGAGCTGGG TCX3CCGGGAT 4500 

GCXX3CAGACC TTTTGCTQAA GGCATCCTCT QTGTTCAAAA TCAACCTGGA TGGCAATGGC 4560 

CAGGAGGCCT ATGCCTCGAG CTGCAACAGC GGCACCTCTT ACAATTCC3VT TAGCTCTGTT 4620 

GTATCCCQGT GAGGOCAGCC TCTGGCTTGG ACAGGGTCTG TTT6GACIGC AGAAC^CAAGG 4680 

GGGTGATGTA GCCCATCCTT CCCTTTGGAG AT6CTGAGG6 TGTTTCTTCC TOCACCCAOV 4740 

GCCy«3GGGQA TGCCACTCCT CCCTCC3GGCT TGACCTGTTr CTCTGCOGCT ACCTCCCTCC 4800 

CCGTCTCATT CCGTTGTCTG TGGATGOTCA TTGCAGTTTA AGAQCAGAAC AQATCTTTTA 4860 

CTTTGGCCXSC TTGAAAAGCT AGTQTACCTC CTCTCAGTGT TTTGGACTCC ATCTCTCATC 4920 

C TCCAG TACC TTGCTTCTTA CTGATAATTT TGCTG6AATT CCTAACTTTT CAATGACATT 4980 

TTTTTTAACT ATCATATTQA TTGTCCTTTA AAAAAGAAAA GTGCATATTT ATCCAAAAT6 5040 

TGTATTTCTT ATACGCTTTT CTGTGTTATA CCATTTCCTC AGCTTATCTC TTTTATATTT 5100 

GTAGGAGAAA CTCCCATGTA TGGAATCCCA CTGTATGATT TATAAACAGA C^ATATGTGA 5160 

GTGCCTTTTG CAGAAGAGGG TGTGTTTGAA ATCATCGGAG TCAGCCAGGA GCTGTCACCA 5220 

AGG AAACG CT ACCTCTCTOT CCCTTGCTOT ATGCTOATCA TOOCXaGAGG TGCTTCACCC 5280 

TQAQTTTTGT TTTGTATTGT TTTCTGACAG T rm 'C l ' G 'iT TTGTTTGGCA AGGAAAGGGG 5340 

AGAAGGGAAT CCTCCTCCAG GGTQATTTTA TGATCAGTGT TGTTGCTCTA GGAAGACATT 5400 

TTTCCGTTTG CTTTTGTTCC AATGTCAATG TGAACGTCCA CATGAAACCT ACACACTGTC 5460 

ATGCTTCATC ATTCCCTCTC ATCTCA6GTA GAAGGTTGAC ACAGTT6TA0 G6TTACAGA6 5520 

AGCTATGTAA GAATTCAOAA GACCCCTGAC TCATCATTTO TGGCAGTCGC TTATAATTGG 5580 

TGCATAGCAG ATGGTTTOCA CATTTAGATC CT6GTTTCAT AACTTCCTGT ACTTGAAGTC 5640 

TAAAAGCAGA AAATAAAGGA AGCAAGTTTT CTTCCATGAT TTTAAATTOT GATOQAGTrr 5700 

. TAAATTQATA GGAGOOAACA TGTCCTAATT CTTCTGTCCT GAGAAGCATG TAATGTTAAT 5760 

GTTATA TCAT ATGTATATAT ATATATGCAC TATGTATATA CATATATATT AATACTGGTA 5820 

TTTTTACTTA ATCTATAAAA TGTOGTTAAA AAGTTGTTTG ' ri ' lTmtTi ' TTTTTATAAA 5880 

TAAACTGTTG CTOGTTAAAA AAAAAAAAAA 5910 
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Seq ID NO I C73 DNA Sequence 

Nucleic Acid Accession #: NM_002081.1 

Coding sequence I 222*. 1898 

1 11 21 31 41 51 

i I I I I I 

GQCTQCCOGA GCGAGCGTTC GGACCTCGCA CCCOGOGCGC CCCGOGCCGC OGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CGTOGGCOK: CX3CG60CrCT GGACCQCGAG CX3GCGOGOBC 120 

CGGGAGCTTG GCTCT6CCCT TOGCX3GGCG6 OAACTGGGCA GGACCCGGCC AGGATCCGAQ 180 

AGAGGOGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCC3GC CATGGAGCTC GGGGCCOGAG 240 

GCTGQTGGCT GCTATGTGCG GCOGCAGCQC TOaTCOCCTG OGCCCGCGGQ GACCCX3GCCA 300 

GCAAGAGCOG GAGCTGCGGC GAGGTC06CC AGATCTAOGG AGCCAAGGGC TTCAGCCTGA 360 

GGGAOQTGCX: CCAGG0G6A0 ATCTOGGGTG AGCACCTGOO QAJCTOTCCC GAGQGCTACA 420 

CCTGCT3CAC CAGOGAGATG GAGQAGAACC TGGCCAAOOG CAGCCATGCC GAGCTGOAGA 480 

CCGOGCTCCG GGACAGCAGC CGCOTCCTGC AQGCCATGCT TGCCACCCAG CTGOGCAGCT 540 

TOGATQACCA CTTCCAGCAC CTGCTGAACX3 ACTCGGAGOG GACGCTGCAG GCCACCTTCC 600 

CCGGCX5CCTT CXSQAGAGCTG TACAOSCAGA AOGCGAGGGC CTTCOGGQAC CTGTACTCAG 660 

A6CTGG6GCT GTACTACC6C GGTGCXIAACC TQCACCTGOA GGA0A08CT0 GCOQAGTTCT 720 

6GGCCC3GCCT OCTOGAGCGC CTCTTCAAOC AGCTGCACCC CCA6CTGCT0 CTGCCTGATG 780 

ACTACCTGGA CT6CCTGGGC AAQCAGOCCO AOGCXSCTGOO GCXXTTTCXSGO GAGGCCCCGA 640 

GAGAGCTOOQ CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TOGCTCCTTT QTGCAGGQCC 900 

TGGGCX3TGGC CAGCGACGTO GTCCGGAAAG TGOCTCAGQT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTQT CATGAAGCTO GTCTACTGTG CTCACTOCCT GGGAQTCCCC QGC6CCAGGC 1020 

CXTTGCCCTGA CTATT6CGGA AATGTGCTCA AGGGCTGCCT TGCCAACCAO GCOSACCTGO 1080 

ACXKX3GAGTG GAGGAACCTC CTGGACTCCA TGGTGCT C AT CAOCGACAAG TTCTOQGGTA 1140 

CATCOGGTGT GGAGAGTGTC ATOOGCAGOO TQCACAOOTG 0CTQ6GGGAG GCCATCAACX3 1200 
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CCCTCCAGGA CAACAG6GAC AC6CTCA0Q6 GCAAGGTOIT CCAGOGCTGC 06GAACCCCA 1360 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGC AGAAGCX^GCXS CCGGGGCAAG CTG G CCCCGC 1320 

GGGAGAGGCX: ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 1380 

GOGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA AOGOGATGGC CA6AGGC0GG TACCTOCCOG 1500 

AGGTCATGG6 TGACGGCCTG 6CCAACCAGA TCAACAACCC OOAGGTGGAG GTGGACATCA 1560 

CCAAGCOGGA CATGACCATC CGGCAGCAGA TCATGCA6CT GAAGATCATG ACCAAC06GC 1620 

TGOGCAGOGC CTACAACG6C AACGAOGTGG ACTTCCAQGA 0GCCAGT6AC GACGGCA60G 1680 

^ GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTG06Q CXX^GAAGGTC AGCAGGAAGA 1740 

lU 0CTCCA6CTC COGGACGCCC TTGACCCATG CCCTGCX3U3G CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCdVGC CCOCGACCTT OCTCCTGCCC CTCCTCCTCT 1660 

TCCTGGCCCT TACAGTAGCC AGGCCCCQGT G60QGTAACT GCCCCAAGGC CCCAGGQACA 1920 

GAGQCCAAGG ACTGACTTTG CCAAAAATAC AACACAGAOG ATATTTAATT CACCTCAGCX: 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTOGC CTCGCCTGCX: TTTCTGCCTT TTAATTTTGT ATGAGOTCCT 2100 

CAGOT CMSCT GGOAGCCAOT OTGCC^AAA GCCATGTATT TCA66QACCT CAGGGGCACC 2160 

TCCX3GCTGCC TAGCCCTCCC CCCAGCItXC T6CAC0QCGG CAQAAGCASC CCCTOOAGGC 2220 

CTACAGAGOA GQCCTCAAAG C3UVCCCGCTG GAGCCCACAO OOAOCCTOTG CCTTCCTCCC 2280 

CGCCrCCTCC CACTGGGACT CCCAGCAGAO CCCACCAQCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCXS^QAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAOAGCC 2400 

TCTGAGATGA TGCATGAT6C CCTCCCCTCA 60GCAOGCT6 CAGAGCCOGG OCCCACCTCC 2460 

CTOOGOCCTT GAGGG6CCCC AGOGTCTGCA GGQTGAOGCC TQAGAGAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TCCAQTGAGG GGCCCTCCAT GOGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGOCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGQA CCCG6AGGGC TGAGQAGCAG OCAGOACCGG 3760 

CCTGCTCGCA TCCTCACCCA OATCAGGAAC CA06QCCTCC CTGTTCAOGG TGACACAGGT 2830 

CAGGGCTCAO AGTGACCCTC GGCTGTCACC 7GCTCACAGG GATGCTGGTG GCTGGTGAGA 3880 

CCCCGCACTG CACACGGOAA TQCCTAGGTC CCTTCCCQAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TQTTCGCTCC TCCCAOATGO CTTOGQAGGC CCGCAGG6CC 3060 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACT6AGGCC ATCAGGGCCC TGCCGCAG6C 3120 

CTGGAOGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GT6GCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GQAAGQGOTC CTQCAGGGGG AGQAGGACTT GQAGGGTCTG GGGGCAGCTQ 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCX3C TTAGTGCTGC TTTGCTTTTC ATCACOGTCC 3300 

08CACAGT6G AGQGAOGTCC OCGi B TT GC TQ OTCAOOTGOC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAG6A6CC CCCAACACAG GCAAGTCCAC CXCATAATAA 3420 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCC6GGC GCCAGGACAG 3460 

CAGCACrcCC GCTGCACACA GACGGCCTAG GGGTGGCX3CT CAGACCCCAC CCTACX3CTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC T6GTCAGGGC AGTGGCCAAO CCTGCTGT6T 3600 

GCTTCCTCCA CAAOOTCCCC CCACGSCTCA GTQTCAaCQO GTGACXSTGT6 TTCTTTTGAG 3660 

TCCTT6TATG AATAAAAGGC T6GAAACCTA AA 3692 
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Seq ID NOi C74 DNA Sequence 

NUcleic Acid Accession #i BC03020S.1 

coding sequence t 4 S . . 8 7 8 



1 11 21 31 41 SI 

50 I I 1 I I I 

GTGAGC3VGCC CCTAACAGGC T6TTACTTCA CTACAACTGA 06ATATQATC ATCTTAATTT 60 

ACTTATTTCT CTTGCTATGG GAA6ACACTC AAGGATQ6GG ATTCAAQGAT GGAATTTTTC 120 

ATAACTCCAT ATGGCTTGAA CGAGCAGCCG GTGTGTACCA CAGAGAAGCA CXX3TCTGGCA 180 

AATACAAGCT CACCTACGCA GAAGCTAAGG CGGTGTGTGA ATTTGAAGGC GGCCATCTCG 240 

CAACTTACAA GCAOCTAGAG GCAGCCAGAA AAATTGGATT TCATGTCXGT GCTGCTGGAT 300 

GQATGGCXAA GGGGAGAGTT GGATACOCCA TTGTOAAGGC AGGGCCCAAC TGTGGATTTG 360 

GAAAAACTGG CATTATTOAT TATGGAATCC GTCTCAATAO QAGT6AAA6A TGGGAT6CCT 420 

ATTGCTACAA CCCACACX3CA AAGGAGTGTG GTGGCGTCTT TACAGATCCA AAGCAAATTT 480 

TTAAATCTCC AGGCTTCCCA AATGAOTACQ AAGATAACCA AATCTGCTAC TGGCACATTA 540 

GACTCAAGTA TGGTCAGOGT ATTCAGCTGA GTTTTTTAGA TTTTGACCTT GAAGATGACC 600 

CAOGTTGCTT GQCTGATTAT OTtGAAATAT ATQACAGTTA OQATGATGTC CATG6CTTT0 660 

TGGGAAGATA CTGTGGAGAT GAiSCrrCCAG AT6ACATCAT CAGTACAGGA AAT6TCATGA 720 

CCTTGAA6TT TCTAA6T6AT GCTTGAGTGA CAGCTGGAGG TTTCCAAATC AAATATQTTO 780 

CAATGGATCC TGTATCCAAA TCCAGTCAAG GAAAAAATAC AAGTACTACT TCTACTGGAA 840 

ATAAAAACrr rTTAGCTGGA AGATTTAGCC ACTTATAAAA AAAAAAAAAG GATGATCAAA 900 

ACACACAGTG TTTATGTT66 AATCTTTTOa AACrOCTTTO ATCTCACTGT TATTATTAAC 960 

ATTTATTTAT TATTTTTCTA AATGTGAAAG CAATACATAA TTTAOGGAAA ATTGGAAAAT 1020 

ATAGQAAACT TTAAACGAGA AAATQAAACC TCTCATAATC CCACTGCATA GAAATAACAA 1080 

GCGTTAACAT TTTCATATTT TTTTCTTTCA GTCATTTTTC TATTTGTGGT ATATGTATAT 1140 

ATGTACCTAT ATGTATTTOC ATTT6AAATT TTGGAATCCT GCTCTATGTA CAGTTTT6TA 1200 

TTATACTTTT TAAATCTTGA ACTTTATAAA CATTTTCTGA AATCATTGAT TATTCTACAA 1260 

AAACATGATT TTAAACAGCT GTAAAATATT CIATOATATG AATGTTTTAT GCATTATTTA 1320 

AGOCTGTCTC TATTGTTGGA ATTTCAOGTC ATTTTCATAA ATATT6TTGC AATAAATATC 1380 

GTTGAACACA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1430 



Seq ID NO: C7S Z2NA Sequence 

Nucleic Acid Accession «i NM_001982.1 

Coding sequence i 199.. 4227 

1 11 31 31 41 51 

I 1 I I I 1 

CTCTCACACA CACACACCCC TCCCCTGCCA TCCCTCOCGG GACTOCGQCT COGQCTCOQA 60 

TTGCAATTTG CAACCTGOGC TGCCGTOGCC 6CAGCAGCCA CCAATTGGOC A G OGGTTCAG 120 
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GTGGCTCTTO CCTCGATGTC CTAGCXTTAOG GGCCCCCGGG CCGGACTTGG CTGGGCTCCC 180 

TTCACCCTCT GOGGAGTCAT GAGGGCGAAC GACGCTCTOC A GGTG CTGGG CTTGCTTTTC 240 

AGCCTGGCCC GGGGCTCCGA GQTGGGCAAC TCTCAGGCAG 7GTGTCCTGG GACTCTQAAT 300 

G6CCTGAGTG TGAC0G60GA TGCTGAGAilC CAATACXAGA CACTGTACAA GCTCTAOSAa 360 

AGQTGTGAGG TGGTGATGGG GAACCTT6AO ATTGTGCTCA 06GGACACAA TGCCGkCCTC 420 

TCCTTCCTGC AGTGGATTCG AGAAOTGACA GGCTATGTCC TCQTGGCCAT GAATGAATTC 480 

TCTACTCTAC CATTGCCCAA CCTCOGG6TQ GTGOGACGGA CCCAQOTCTA CGATGGGAAG 540 

TTTGCCATCT T00TCAT6TT GAACTATAAC ACCAACTCCA GCCACGCTCT GOGCCAGCTC 600 

OGCTTGACTC AGCTCACCGA GATTCTGTCA GGGGGTGTTT ATATTGAGAA GAACOATAAG 660 

CTTTGTCACA TOOACACAAT TGACTGGAGG QACATCGTGA GGQACCGAGA TGCTGAGATA 720 

GTX5GTGAAGG ACAAT6GCAG AAGCTGTCCC CCCTGTCA7G AGGTTTGCAA GGGGOSATQC 780 

TGQGOTCCTG GATCAGAAGA CTGCCAGACA TPOACCAAGA CXaVTCTOTGC TCCTCAQTGT 840 

AATGGTCACT GCTTTGGGCC CAACCCCAAC CAGTGCTGCC ATQATGAGTG TGCOGGGGGC 900 

TGCTCAGGCX: CTC3«;GACAC AGACTGCTTT GCXnXSCCGGC ACTTCAATGA CAGTGGAGCC 960 

TGTGTACCTC GCTOTCCACA GCCTCTTGTC TACAACAAGC TAACTTTCCA GCTGGAACCC 1020 

AATCCCCACA GCAAGTATCA GTATGGAGGA GTTTGTGTAG CCAGCTGTCC CCATAACTTT 1080 

OTGGTGGATC AAACATCCTG TGTCAGGGCC TGTCCTCCTG ACAAGATGGA AGTAGATAAA 1140 

AATGGGCTCA AGATGTGTGA GCCTTGT6GG GGACTAT6TC CXIAAAGCCTG TGAGGGAACA 1200 

GGCTCTGGGA GCCGCTTCCA GACTGTQGAC TCX3AGCAACA TTGATGGATT TGTGAACTGC 1260 

ACCAAGATCC TGOGCAACCT GGACTTTCTG ATCACCX56CC TCAATGGAGA CCCCTGGCAC 1320 

AAGATCCCTG CCCTGOACCC AGAGAAGCTC AATOTCTTCC GQACAOTAOG GGAGATCACA 1380 

GGTrACCTGA ACATCCAOTC CTG0OC3GCCC CACATGCACA ACTTCAGTGT TTTTTCCAAT 1440 

TTGACAACCA TTGGAGGCAG AAGCCTCTAC AACCGGGGCT TCTCATTGTT OATCATOAAG 1500 

AACTTQAATG TCACATCTCT GQGCTTCCGA TCCCTGAAGG AAATTAGTGC TGGGOOTATC 1560 

TATATAAGTG CXJUVTAGGCA GCTCTGCTAC CACCACTCTT TGAACTGGAC CAAGGTGCTT 1620 

OGGOOQOCTA CG6AA0A008 ACTAQACATC AAflCATAATC GGCCGCX3CAG AGACTGCGTG 1680 

GCAGAGGGCA AAGTGTGTGA CCCACTOTGC TCCTCTGGGG QATGCTGG6G CCCAGGCCCT 1740 

GGTCAGTGCr TGTCCTGTOG AAATTATAGC CGAGGAGQTO TCTGTGTGAC CCACTGCAAC 1800 

TTTCTGAATO GGGAGCCTC6 AGAATTTGCC CATGAGGCOG AATGCTTCTC CTQCCACCCG 1860 

GAATGCCAAC CCATGGGGGG CACTGCCACA TGCAATGGCT OGOGCTCTGA TACTTGTGCT 1920 

CAATGTOCCC ATTTTGGAOA TGGGCCCCAC TQTOTGAGCA GCr6CXX:CCA TGGAGTCCTA 1980 

GGTGCCAAGG GCCCAATCTA CAAGTACCCA GATOTTCAGA ATGAATGTOG QCCCTGCCAT 2040 

GAGAACTGCA CCCAGGGGTG TAAAGGACCA GAGCTTCAAG ACTGTTTAGG ACAAACACTG 2100 

GTGCTGATCX3 GCAAAACCCA TCTGACAATG GCTTTGACAG TGATAGCAGG ATTGGTABTG 2160 

ATTTTCATGA TGCTGGGCGG CACTTTTCTC TACTGGCGTO GGCGCXX3GAT TCAQAATAAA 2220 

AGGGCTATGA GGOGATACTT GGAAGG0G6T GAGAGCATAG AGCCTCTGGA CCCCAGTGAG 2280 

AAGGCTAACA AAGTCTTGGC CAGAATCXTC AAAOAGACAG AGCTAAGOAA GCTTAAAGT6 2340 

CTTGaCTCXKS GTGTCTTTOO AACTGTGCAC AAAGGAGTGT GGATCCCTGA 6GGTGAATC3V 2400 

ATCAAQATTC CAGTCTGCAT TAAAGTCATT GAGOACAAGA GrGQACGGCA GAQTTTTCAA 2460 

GGTGTQACAG ATCATATGCT GGCCATTGGC AGCCTGGACC ATGCCCACAT TGTAAGGCTG 2520 

CTGOGACTAT GCCCAGG6TC ATCTCTGCAG CXTOTCACTC AATATTT6CC TCIGGGTTCT 2580 

CTGCTGGATC ATGTGAGACA ACAC0GG6G0 GCACTQGSGC CACAGCTGCT OCTCAACTGG 2640 

GGAGTACAAA TTGCCAAGGG AATGTACTAC CTTGAGGAAC ATGGTATGGT GCATAGAAAC 2700 

CTGGCTGCCC GAAACGTGCT ACTCAAGTCA CCCAOTCAGG TTCAGGTGQC AGATTTTGGT 2760 

GTGGCTQACC TGCTGCXTTCC TGATGATAAG CAGCTGCTAT ACAGTGAGGC CAAGACTCCA 2820 

ATTAAGT6GA TGGCCCTTGA 6AGTATCCAC TTT6GGAAAT ACAGACACCA GAOTOATGTG 3880 

TGGAGCTATG GTGTGACAGT TTGGGAOTTG ATGACCTTOG OGGCAGAGCX: CTATGCAGGG 2940 

CTACGATTGG CTGAAGTACC AGACCT6CTA GA6AAGGGGG AGOGGTTGGC ACAGCCCCAG 3000 

ATCTGCACAA TTGATGTCTA CATGGTGATG GTCAAQTGTT GGATGATTGA TOAQAACATT 3060 

CGCCCAACCT TTAAAGAACT AGCCAATGAG TTCACCAGGA TGGCCOGAGA CXX31CGA0GG 3120 

TATCTGGTCA TAAAGAQAGA GAGTGGGCCT GOAATAGCXrC CTQGGCCAQA GOOOCATGOT 3180 

CT6ACAAACA AGAAGCTA6A GGAAGTAGAG CTGQA6CCAG AACTAGACXZT AGAOCTAGAC 3340 

TTGGAAGCAG AGGAGGACAA CCTGGCAACC ACCACACTGO QCTCOGCCCT CAGCCTACCA 3300 

OTTGOAACAC TTAATCX3GCC AC30TGGOAGC CAGAGCCTTT TAAGTCCATC ATCTGQATAC 3360 

ATGOCCATGA ACCAGGGTAA TCTTGGGGGG TCTTGOCAGG AGTCTGCAOT TTCTGGQAGC 3420 

AOTGAAOGGT GGCXXX30TCC AGTCTCTCTA CSVGCCAAT6C CAOGGGGATG CCTGGC31TCA 3480 

GAGTCATCAG AGGGGCAT6T AACA6GCTCT 6A66CTGAGC TCCA0GA6AA AGTGTGAATG 3540 

TGTAGAAGCC GGAGCAGGA6 CCGGAGOCCA CGGCCAOGCG GAGATAGCX3C CTACCATTCC 3600 

CAGOGCCACA GTCTGCTGAC TCCTGTTACC CCACTCTCCC CACCCXX3QTT AQAGGAAGAG 3660 

GATOTCAACG GTTATGTCAT GCXaGATACA CACCTCAAAG GTACTCCCTC CTCCCGGGAA 3720 

GGCACOCTTT CTTCA6TGG0 TCTCAGTTCr GTCCTGGQTA CTGAAGAA6A AGATGAAQAT 3780 

GAGGAOTATO AATACATGAA CGGGAGGAGA AGGCACAGTC CAGCTGATOC OCCTAGGCCA 3840 

AGTTCCCTTO AGGAGCTGGG TTATGAGTAC ATGGATGTGG GGTCAGACCT CAGTGCCTCT 3900 

CTGGGCAGCA CACAGAGTTG CCCACTCCAC OCTGTACCCA TCATGCCCAC TQCAGOCACA 3960 

ACTCCAGATG AA6ACTATGA ATATATGAAT OQGCAACGAG ATGGAGGTGG TCCTGGGGGT 4020 

GATTATGCAG CCATGG6GGC CT6CCCAGCA TCTGAGCAA6 G6TATGAAGA GATGAGAGCT 4080 

TTTCAGGG6C CTGGACATCA GGCCCCCCAT GTCCATTATG CCOGCCTAAA AACTCTACGT 4140 

AGCTTAGAGG CTACAGACTC TGCCTTTGAT AACCCTQATT ACTGGCATAG CAGGCTTTTC 4200 

CCCAAGGCTA ATGCCCAQAG AACGTAACTC CTGCTCCCTQ TGGCACTCAQ GGAGCATTTA 4260 

ATGGCAQCTA GTGCCTTTAO AGGOTACCGT CTTCTCCCTA TTCCCTCTCT CTCCCAGGTC 4320 

CCAGOCCCTT TTCOCCAGTC CCAGACAATT CCATTCAATC TTTGGAGGCT TTTAAACATT 4380 

TTGACACAAA ATTCTTATGG TATOTAOCCA GCTGTOCACT TTCITCXCTT TOCCAACCCC 4440 

AGOAAAGGTT TTCXTTTATTT TGIXTP G CTTT CCCAGTCCX3^ TTCCTCAGCT TCTTCACAGG 4500 

CACTCCTGQA GATATGAAGO ATTACTCTCC ATATCCCTTC CTCTCAGGCT CTTGACTACT 4560 

TGGAACTAG6 CTCTTATGTG TGCCTTTGTT TCCCATCAGA CTGTCAAGAA GAGGAAAGGG 4620 

AGGAAACCTA GCAGAGGAAA GTGTAATTTT GGTTTAT6AC TCITAACCGC CTAGAAAGAC 4680 

AQAAGCrTAA AATCTGTGAA OAAAGAGGTT AGGAGTAOAT ATTQATTACT ATCATAATTC 4740 

AGCACTTAAC TAT6AGCCAG GCATCATACT AAACTTCACC TACATTATCT CACTTAGTCC 4800 
TTTATCATCC TTAAAACAAT TCTGTGACAT ACATATTATC TCATTTTACA CAAAGGGAAG 4860 
TCGGGCATGG TGGCTCATGC CTGTAATCTC AGCACTTTGG GAOGCTGAGG CAGAAGOATT 4920 
ACCTQAGOCA AGGAOTTTQA GACCAOCTTA 6CCAACATA0 TAAGACCCCC ATCTC 4975 



Seq ID MO: C76 USA Sequence 
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nucleic Acid Accession «■ NN_00121S.l 
Coding sequence I 43.. 1422 ~ 

1 11 21 31 41 SI 

I I I I I I 

GCCCGTACAC ACCGTGTQCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTQGC TCCCTCTGTT GATCCCX3GCC CCTQCTCCAG GCCTCACTGT GCAACTGCTG 120 ' 

CTGTCACTGC TGCTTCrOAT GCCT6TCCAT CCCCAGAGGT TGCCCOSQAT 6CAGGA6GAT 180 

TCCCCOrTGG GA06A0GCTC TTCT60G6AA GATQAOCCAC TGGGG6A06A G6ATCTGCCC 240 

AGTGAAQAOG ATTCACCCAa AGAGGAGGAT CCACCOGGAG AGGAGOATCT ACCT66AGA0 300 

GAGGATCTAC CTGGAQAGGA GQATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT TAGAGQATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCXC :ACAGGaACAA AGAAGGGGAT GACCAGAGTC AT7GQCGCTA TGGAGGCQAC 480 

COGGCXITGGC CCCX;GGTGTC OCCAGCCTGC GGGGGOCGCT TCCAOTCOCC GGTGGATATC 540 

CGCCCCCAGC TCGCCOCCTT CTGCCOQQCC CT6CX3CCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCOGCCGC TCCCAGAACT GOGCCTGCXSC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CCTCCXGGQC TAGA6ATGGC TCTGGGTCCC 6GGCGG6AGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG T06TCCX3GOC T06GA6C3VCA CTGTGGAAGG CCACOGTTTC 780 

CCTGOOSAGA TCCACGTGGT TCAGCTCAGC AGOGCXTTTG OCAGAGTTGA GSAG6CCTTG 840 

GGGOGCCOSG GAGGCCTGGC GGTGTTGGCC GCCTTTCTGO AGGA6GGCCC G6AAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCOCTTQ GAAGAAATCG CTGAGGAAGG CTCRGAGACT 960 

CAGGTCCCAQ QACTGGACAT ATCTGCACTC CTGCXCTCTG ACTTCAGCCQ CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAOACAGTQA TGCTQAOTQC TAAOCAGCTC CACACCCTCT CTQACACGCT OTGGQQAGCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCGGA GCQACGCAGC CTTTGAATGG GCGAOTGATT 1200 

GAGGCCTCCT TCCCTGCTGQ AGTGGACAGC AGTCCTCGGQ CTOCTGAGCC AGTCCAOCTG 1260 

AATTCCTGCC TGGCTGCTGQ TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGrC 1320 

ACX»GGGTOO GGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GT6AGCTA0C GCCCAGCAGA GGTAGCGGAG ACTGGAGCCT AGM3GCTGGA TCTTGGAiSAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG G6AGC0GGTA ACTGTCCTGT CCTGCTCATT 1500 

ATGCCACTTC CTTTTAACTO CCAAGAAATT TTTTAAAATA AATATTTATA AT 1552 

Seq ID NO: C77 DNA Sequence 

Nucleic Acid Accession #: NM_004207.1 

Coding sequences 63.. 1460 

1 11 21 31 41 51 

I I i I I i 

GGCX3A6AGGC GGGCTGAGGC GGGCCAGCGG 06GCAGGTGA GGCGGAACCA ACCCTCCTGG 60 

CCATGGGAGG GGCCGTGOTG GAGGA6GGCC CCACAGGOGT CAAGGCCCCT GACGGCXX3CT 120 

GGGGCTGGGC OGTGCTCTTC GGCTGTTTCG TCATCACTGQ CTTCTCCTAC GCCTTCCCCA 180 

AGGC03TCAQ TGTCTTCTTC AAGGAGCTCA TACAGGAGTT TGGGATCGGC TACAGCGACA 240 

CAGCCTGGAT CTCCTCCATC CTGCTGGCXa^ TGCTCTAOGG QACAGGTCCJG CTCTGCAQTa 300 

TGTGOGTGAA CCGCTTTGGC TGCCGGCCCQ TCATGCTTGT 6GGGGGTCTC TTTGCX3TCX3C 360 

TGGGCATGGT GGCTGCGTCC TTTTGCCGGA GCATCATCCA GGTCTACXTC ACXACTGGGG 420 

TCATCACGGG GTTGGGTTTG GCACTCAACT TCCAGCCCTC GCTCATCATG CTGAACCGCT 480 

ACTTCAGCAA GCGGCXKXCC ATGGCCAACXS GGCTGGCGGC AGCAGGTAGC CCTGTCTTCC 540 

TGTGTGCXXrr GAGCCXX3CTG GGGCAGCTGC TGC3VGGACCG CTACGGCTGG CGGGGCGGCT 600 

TCCTCATCCT GGGCGGCCTG CTGCTCAACT GCTGCGTGTG TGCCGCACTC ATGAGQCXCC 660 

TGGTGGTCAC GGCCX^GCCQ GGCTCGGQQC CJOCCGOQACC CTCCCQQCGC CTGCTAGACC 720 

TQAGCX3TCTT CCGGGACCGC GGCTTTGTGC TTTACGCOGT GGCCGCCTCG GTCATX3GTGC 780 

TGGGGCTCTT CGTCCCGCCC QTGTTCGTGO TQAGCTACGC CAAGGACCTQ GGCGTGCXXX3 840 

ACACCAAGGC QGCCTTCCTG CTCACCATCC TGGGCTTCAT TGACATCTTC GCGCGGCCGG 900 

CGOOGGGCTT OGTGGOGGGG CTTGGGAAGG TGOGGCCCTA CTC06TCTAC CTCTTCAGCT 960 

TCTCCATGTT CTTCAACGGC CTCGGGGACC TGGOGGGCTC TA0660GGGC GACTACGGCG 1020 

6CCT0GTGGT CTTCTGCATC TTCTTTGGCA TCTCCTA06G CATGGTGGGG GCCCTGCAGT 1080 

TCGAGGTGCT CATGGCCATC GTGGGCACCC ACSWWSTTCTC CACTGCCATT GGCXTTGOTGC 1140 

TGCTGATGGA GGCGGTGGCC GTCCTCGTCG GGCCCCCTTC GGGAGGCAAA CTCCTGGATQ 1200 

CGACCCACGT CTACATGTAC OTOTTCATCC TGGCGQGGGC OaAGOTQCTC ACCTCCTCCC 1260 

TGATTTTGCT GCTGGGCAAC TTCTTCTGCA TTAGGAAGAA GCCCAAAGAG CCACAGCCTG 1320 

AGGTGGCGGC CGCGGAGGAG GAGAAGCTCC ACAAGCCTCC T6CAGACTCX3 GGG6TGGACT 1380 

TGOGGGAGGT GGAGCATTTC CTGAAGGCTG AGCCTGAGAA AAAOGGGGAG GTGGTTCACA 1440 

CCCCGGAAAC AAGTGTCTGA GTGGCTGGGC GGGGCGGGCA QGCACAGGGA GGAGOTACAG 1500 

AAGCCGGCAA CGCTTGCTAT TTATTTTAC3V AACTGGACTG GCTCAGGCAG GGCCACGGCT 1560 

GGGCTCCAOC TGCCGGCCCA GCX3GATCGTC GCCCGATCAG TGTTTT G AGG GGGAAGGTGG 1620 

CGGGGTGGGA ACCGTGTCAT TCCAGAGTGG ATCTGCX3GTG AAGCCAAGCC GGAAGGTTAC 1680 

AAGGCATCCT CACCAGGGGC CCCGCCTGCT 6CTCCCAGGT GGCCTG0G6C CACTGCTATG 1740 

CTCAAGGACC TGGAAACCCA TGCTT06AGA CAACGTGACT TTAATGGGAG GGTGGGTGGG 1800 

CGGCAGACAO GCTGGCAGGG CAGGTGCTGC GTGGGGCCCT CTCCAGCCCQ TCCTACCCTQ 1860 

GGCTCACATG GGGCCTGTQC CCACCCCTCT TGAGTGTCTT GGGGACAGCT CTTTCCACCC 1920 

CTGGAAGATG GAAATAAACC TGCGTGTGG6 TGGAGTGTTC T06TGCXXSAA TTCAAAAAGC 1980 

TT 1982 

Geq ID NO: C78 DNA Sequence 

Nucleic Acid Accession #s KM_000358.1 

Coding sequence* 48.. 2099 



ofx 3. 11 21 31 41 51 

80 I 1 I 1 1 I 

QCTTGCCOGT CGGTCGCTAG CTOGCTOGGT GOGCGTCGTC CCOCTCCATQ GOGCTCTTOG 60 

TGCGGCTGCT GGCTCTCGCC CTGGCTCTGG CCCTGGOCCC C6C0G0GACC CTGGCGGGTC 120 

CCGCCAAGTC GCCCTACCAG CTGGTGCTGC AGCACAGCAG GCTC06GGGC CGCCAGCAOG 180 

GCCOCAAOQT OTGTGCTGTG CAGAAG8TTA TTGGCACTAA TAGGAAOTAC TTCACCAACT 240 
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Seq 10 NOt C79 DNA Sequence 

Nucleic Acid Accession 4t llM_00653fi.2 

ceding sequence: 109.. 2940 " 



GCAAGCAGTG CTACCAAAOG AAAATCTGTG GCAAATCAAC AGTCATCAGC TACQAQTGCT 300 

G TCCTO GATA TGAAAAGGTC CCTGGGQAGA AOGGCTOTCC AGCAGCCCTA CCACTCTCAA 360 

ACCTTTACGA GACCCTGGGA GTCGTTGGAT CXZACX^^CCAC TCAGCTOTAC ACGGACCGCA 420 

5CGGAGAAGCT GAGGCCTGAO ATOGAGGGGC CCGGCAGCTT CACCATCTTC GCCCCTA6CA 4 BO 

ACGAGGCCTO GGCCTCCTTG CCAGCTGAAG TGCTGGACTC CCTGGTCAGC AATGTCAACA 540 

TTGAGCTGCT CAATGCCCTC CGCTACCATA TGGTGGGCAG GCGAGTCCTG ACTGATCAGC 600 

TGAAACAOQG CATQACCCTC ACCTCTATGT ACCAGAATTC CAACATCCAG ATCCACCACT 660 

ATCCTAATGO GATTGTAACT GTGAACTGTG CCCGGCTCCT GAAAGCCGAC C3VCCATGCAA 720 

1 n CCAACX5GGGT GGTGCACCTC ATOGATAAOG TCATCTCXavC CATCACCAAC AACATCCAGC 7B0 

lU AGATCATTGA GATOSAGGAC ACCTTTGAGA CCCTTCGG6C TGCTGTGGCT GCATCAGGGC B40 

TCAACACGAT GCTTGAAGOT AACGGCCAGT ACAC3GCTTTT GGCOCCQACC AATGAGQCCT 900 

TC6A6AAGAT CCCTAGTGAG ACTTTGAACC OTATCCTGGG CGACCCAGAA GCXXTTOAGAG 960 

ACCTGCTGAA CAACCACATC TTGAAGTCAG CTATGTGTGC TGAAGCCATC GTTGCGGGGC 1020 

^ TGTCTGTAGA GACCCTGGAG GGCACGACAC TGQAGGTGGQ CIGCAOCGGG GACATGCTGA 1080 

1 J CTATCAACGG QAAOGOQATC ATCTCCAATA AAOACATCCT AGCXIACCAAC GGGGTGATCC 1140 

ACTACATTGA TGAGCTACTC ATCCCAQACT CAGCCAAGAC ACTATTTGAA TTGGCTGCAa 1200 

AGTCTGATGT OTCCACAGCC ATTGACCTTT TCAGACAAGC CGGCCTCGQC AATCATCTCT 1260 

CTGGAAGTGA GCGGTTGACC CTCCTGGCTC CCCTGAATTC TGTATTCAAA GATGGAACCC X320 

CTCCAATTGA TGCCCATACA AQC3AATTT0C TTOOOAACCA CATAATTAAA GACCAGCTGG 1380 

CCTCTAAGTA TCTQTACCAT GGACAOACCC TOOAAACTCT 6GG0GGCAAA AAACTOAOAG 1440 

TTTTTOTTTA TCGTAATAGC CTCTQCATTO AGAACAGCTG CATOGOQGCC CACGACAAGA 1500 

GGGGGAGGTA CGGGACCXTG TTCACGATGG ACCGGGTGCT GACCCCCCCA ATGGGGACIQ 1560 

TCATGGATGT CCTGAAGGGA GACAATCGCT TTAGCATGCT GGTAGCTGCX: ATCCAGTCTO 1620 

CAGGACTGAC GGAGACCCTC AACOGGGAAO GAGTCTACAC AOTCTrTSCT CCCACAAATG 1680 

Z J AAGCCTTCCG AGCCXTTGCCA CCAAGAGAAC OQAGCAGACT CTTGGGAGAT GCCAAGGAAC 1740 

TTGCCAACAT CCTGAAATAC CACATTGGTG ATGAAATCCT GGTTAOaSGA GGCATCGGGG 1800 

CCCTGGTGOG GCTAAAGTCT CTCCAAGGTG ACAAGCTGGA AGTCAGCTTG AAAAACAATG 1860 

TGGTGAOTGT CAACAAGGAG CCTGTTGCCG AGCCTGACAT CATG6CCACA AATGGCG7GG 1920 

„ TCCATGTCAT CACXaATOTT CTGCAGOCTC CAGCCAAC^G ACCTCAGGAA AGAGGGGATO 1980 

0\J AACTTGCAGA CTCTGCGCTT GAGATCTTCA AACAAGCATC AGOGTrTTCC AGGGCTTCXr 2040 

AGAGGTCTOT GCGACTAGCC CCTGTCTATC AAAAGTTATT AGAQAG6AT0 AAGCATTAGC 2100 

TTGAAGCACT ACAGGAGGAA TGCACCACGO CAGCTCTCCG CCAATTTCTC TCAGATTTCX: 2160 

ACAGA6ACTG TTTGAATGTT TTCAAAACCA AGTATCACAC TTTAATGTAC ATGGGCOGCA 2220 

CCAT AATGAG ATGT6AGCCT TGTGCAT6TG GGGGAGGAG6 GAGAGAGATG TACTTTTTAA 2280 

DJ ATCATGTTCC CCCTAAACAT GGCTGTTAAC CCACTOCATG CAQAAACTTG GATGTOVCTG 2340 

CCTGACATTC ACTTCXaVGAG AGGACCTATC CCAAATGTGG AATTGACTGC CTATQCCAAG 2400 

TCXCTGGAAA AGGAGCTTCA GTATTGTGGG GCTCATAAAA CATGAATCAA GCAATCCAGC 2460 

CTCATGGGAA GTCCTGGCAC AGTTTTTGTA AAGCCCTTGC AOWSCTGGAQ AAATOQCATC 2520 

A(\ ATT ATAAOC T ATGAGTTQAA ATOTTCTGTC AAATGTOTCT CACATCTACA CQTGGCTT60 2580 

HU AGGCTTTTAT GGGGCCCTGT CCAGGTAGAA AAGAAATGGT ATOTAGASCT TAGATTTOCC 2640 

TATTGTGACA GAGCCATQOT GTGTTTGTAA TAATAAAACC AAA6AAACAT A 2691 



1 11 21 31 41 51 

«:0 I 1 I I I 1 

ACCTAAAACC TTGCAAOTTC AGGAAGAAAC CATCTGCATC C3VTATT6AAA ACCTGACACA 60 

ATGTATGCAO CAGGCTCAGT GTGAQTGAAC TQQAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG OTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAQACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCT CAGG T ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAO CTTCATTTTA CCTATTTAAT 0CTACC3UIGA GAAGAGTATT TTTC3M3AAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTQACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG 6TGTGGAAAA QA6GGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATOATAACTT AACAGCTGGC TACX3GATCAC GA6GCC98A0T OTTTOTOCAT 600 

OU OAATOOQCXX: ACCTCOGTTQ GGGTGTOTTC OATQAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGOCATTTTT 720 

GT0TGTX3AAA AAGGTCCTTG CCCCCAAGAA AACTOTATTA TTA6TAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AOTTTATCTT CTGTG G TTGA ATTTTGTAAT OCAAGTACCC ACAACCAAGA AGCAOCAAAC 900 

CTACAGAACC A OATQ TGCAG CCTCAGAAOT OCATGGGATO TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGQACT GAGCTTCCAC CTCCTCCCAC ATTCTCQCTT 1020 

GTACAGQCTG GTGACAAAGT GGTCTQTTTA GTGCTGGATG TGTCCAGCAA GATGGCAQAG 1080 

GCTGACAGAC TCCT TCAACT ACAACAAGCC GCAGAATTTT ATTTGAT6CA aATTGTTQAA 1140 

ATTCATACCT TCGIQGGCAT TCCCAOTTTC GACAGCAAAO GAGAOATCAG AGCOCAGCTA 1200 

/U CACCAAATTA ACA6C3KATGA TGATCXSAAAO TTGCTGGTTT CATATCTGCC C3VCCACTGTA 1260 

TCAGCTAAAA CAGACATCAQ CATTTQTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAA CTGA ATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TQACCAGCGQ AGATGATAAG 1380 

Ci'TCrmGCA ATTGCTTACC CACTGTGCTC AGCAOTGGTT CAACAATTCA CTCCATTGCC 1440 

«c CT GGGnxaT CTGCAGCCCC AAATCTGQAS GAATTATCAC GTCTTACAOG AGGTTTAAAG 1500 

iJ TTCJJL'AtJn'C CAGATATATC AAACTCCAAT AGCATGATT6 ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTO OAGACATTTT CCAGCAACAT ATTCAGCTTO AAA6TACAGQ TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAAOSACACT 1680 

ATGTTTCTAQ TTACGTGGCA GGCCAGTGGT CCTCCTGAOA TTATATTATT TCATCCTGAT 1740 

OA GQACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTGGGAC AGCTAGTCTT 1800 

OU TGGATTCCAG GAACAGCTAA OCCTQQGCAC TGGACTTACA OCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAQ CCCTGAAAGT GACAGTQACC TCTCGCOCCT CCAACTCAGC TGTQCCCCCA 1920 

GCCACTGTGG AAGCXTTTTOT GQAAAGAGAC A6CCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATO TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAOGCAGAOA CTQGAQATCC TGTTACaCTQ A6ACTCCTT6 ATGATG6A0C AGGTOCTOAT 2100 
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GTT ATAAAA A ATGATGOAAT TTACTCOAGO TATTTTTTCT CCTTTGCTGC AAATCOTAQA 2160 

TATASCTTGA AAGT0CATX5T CAATCACTCT CCCAGCATAA BCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAQOT TACACACCAA ACGGTAATAT TCAGATGAAT 22 BO 

^ AATCAGTAGG CAGAAATCAO 6AG6AG06AA AGIGGQGCTT TAGOOGAGTC 2340 

J AGCTCAGOAG GCTCCTTTTC AGTCCrGGOA GTTCCAGCTC GCCCCCACCC TOATOIOTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT QACCCTATCT 2460 

TCOACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATOA AATAAGAATQ 252 0 

AGTAAAAGTC TACAQAATAT CCAAOATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2S80 

AA6CGAAATC CTCAGCAAGC TQGCfcTCAGG GAGATATTTA OtnrCrCA 3640 

IV ACGAATGGAC CTGAAC3VTCA QCCAAATGGA 6AAACACATG AAAOCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAOTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCT CTGTTT A TTCCCXTCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

G6AGTTTTAA CAGCAATGGG TTTGATAGOA ATCATTTGCC TTATTATAGT TGTOACACAT 2880 

CATACTTTAA GCaCGAAAAA GAGAGCAGAC AAGAAM3AGA ATOOAACAAA ATTATTATAA 2940 

1 J ATAAATATCC AAAGT6TCTT CCTrCTTAOA TATAAGACCC ATGGCXTTCG ACTACAAAAA 3000 

GATACTAACA A AGTCA AATT AACSVTCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TQQTAQATCA ACAATTCTTT TTGGGQGTAG ATTAGAAAAC 3120 

CCTTACACTT TGQCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGOAA GGQTAAAGTC GOACCAOTGT CAAGGAAAGT TTOTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA A GCAO AGAAA AGGAGGGTAQ GTCTGCATTA TAACTGTCTG TOTGAAOCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TOTGCAGTAC AGGTrcCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATp" TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

^- TTTCACTOTA AGAOQTAACC TTTAACAATA TOGGTATTAC CTTT G TCTCf TCATACCGGT 3S40 

TTTATGACAA AGGTCTATTO AATTTATTTG INTQTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTICTAAGTT TATTOCCTTG OQTTATTATO GAATGATAGT TATAOCCCXW TATAATGCCT 3660 

TACCTAOGAA A ^^^^ 

Seq ID NO: C80 DNA Sequence 
0\) Nucleic Acid Accession fti Eos sequence 
Coding sequence i 1..1413 

1 11 21 31 41 51 

I t t I I I 

ATGAAOTTTC TTCTAATACT QCTOCTGCAG GCCACTGCTT CTOGASCTCT TCCCCTGAAC 60 

AGCTCT ACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTCQ AAACTTAATQ 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGOTCTGA AAGTGACCGG GCAACTGGAC 240 

ACATCTAqCC TGGAGATOAT QCAOQCACCT COATOTGOAO TCCCCGATGT CCATCATTTC 300 

AGGGAAATOC CAGGGGGGCC CGTATGQAGG AAACATTATA TCACCTAC3W3 AATCAATAAT 360 

TACACACCTG ACATOAACCG TGAGGATGTT GACTAC6CAA TCmSAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCTT QAAATTCAGC AAGATTAACA CAGGCATGQC TGACATTTTG 480 

GTGGTT-i-iTG CCCGTGG AGC TCATGGAOAC TTCCATGCTT TTGATGGCAA AQGTGGAATC S40 

- - CTAGCCCATG CTTTTGGACC TGOATCTGGC ATTGQAGGGG ATGCACATTT CGATGAGGAC 600 

**J GAATTCTG6A CTACACATTC AGOAGGCACA AACTTGTTCC TCACTQCTGT TCAOSAGATT 660 

GGCCATTCCT TAGGTCTTGQ CCATTCTAGT GATCCAAAGQ CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATQ ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACC CAAAAG A GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 840 

CTCTGTOACC CCAATTTQAO TTTTOATGCT GTCACTACCQ TGGOAAATAA GATCTTTTTC 900 

•>V TTCAAAGACA G0TTCTTCT6 GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TOTTAATTTA 960 

ATTTCTTCCT TATGG CCAAC CTT6CCATCT GGCATTGAAG CTCCTTATGA AATTGAAGCC 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAQACCA 1080 

GAGCCAAATT ATCCCA AGAO CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

GATGCAGCTO TTTTTAACOC AOQTTTTTAT AGGACCTACT TCTTTQTAGA TAACCAGTAT 1200 

TGOAGOTATO ATGAAAQGAO ACAGATQATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAO GAATOSGGCC TAAAATTQAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGQATCTAA CCAATTTGAA TATGACITCC TACTCCAACO TATCACX»AA 1380 

ACACTGAAAA GCAATAGCTG amOGTTGT T6A 1413 

60 Seq ZD NOi C81 DNA Sequence 

Nucleic Acid Accession it Eos sequence 
Coding sequence t 1..1413 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTCGAC 240 

/V ACATCTACCC TGGAGATGAT GCAOaCACCT OGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACATGAACCG TGAGGATGTT GACTAOGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG T TACCC CCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 480 

OTGGTTTTTQ COOQTG GAOC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGQAATC 540 

iJ CTAGCCCATG CTTTTOGACC TGGATCTQ6C ATTGQAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGCCATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

80 ™TATQaAQ ACCCAAAAGA GAAC CAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 840 

OU CTCTGTGACC CCAATTTQAG TTTTGATGCT GTCACTACCQ TGGGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTQ GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TOTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTOAAOCC 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAO CATACATTCT ITlGO T m ' C CTAACTTTGT GAAAAAAATT 1140 
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5 

10 
15 
20 
25 



35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



GATGCAGCTO TTTTtAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAO GAATGGGGCC TAAAATTGAT GCAOTCTTCTr ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TAT6ACTTCC TACTCCIUVGG TATGACCAAA 1380 

ACACTGAAAA 6CAATAGCTG OTrrGGTTGT TGA 1413 

Seq ID NO: C82 DNA Sequence 

Nucleic Acid Accession iiM_006952.i 

Coding sequence i 11.. 793 " 

1 11 21 31 41 51 

I I 1 I i I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAG66CC TGCTGATTTT 60 

TG6AAATGTG ATTATTGGTT GTTGOGGCAT TGCCCTGACT GGGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAAGGAT6 ACATCTATGG 160 

OQCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATOAAGTCCA GCAGGAAAAT TCTTCTGGCQ TATTTCATTC TQATOTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACA6CAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAA6C AQATGCTAGA GAGOTACCAA AACAACAGGC CTGCAAACAA 420 

TGATGACCAG TG6AAAAACA ATGGAGTCAC CAAAACCTGG GACAG6CTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCOGGAC 540 

TGAGAATAAT QATGCTGACT ATCCCTG6CC TCX3TCAATGC TQTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGGGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTQCTATGAA CTGATCTCTG GTCCAATOAA CCOACAOGCC TGGGGGOTTG CCTGGTTTGO 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 

AATT6AATAT TAAQAA 796 



Seq ZD NO: C83 DNA Sequence 
Nucleic Acid Accession #t MM.001793.2 
3U Coding sequences 71.. 2560 



1 11 21 31 41 51 

I 1 I I I 1 

AAAGGG6CAA GAGCTGA0C6 6AACACCQGC C0GC06TG6C G6CAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TGCAGGTTTG 120 

CTGGCTGCAG TGOGCGGCCT CCGAGCOGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

CTTGGAGGGG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCA6 CTCT6TTTAG CACTGATAAT 6ATGACTTCA CTGTGCGGAA 300 

TOGCGAGACA GTGCAGGAAA GAAGGTCACT 6AA0GAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTAC6AA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

T6AAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGO GCCGGGGQCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCA06C T6TGTCA6A0 AATOGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCQTQACOGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTA6AGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGO ACCCACACQA CCTCATGTTC AOCATTCACC GGAGCACAGG 900 

CACCATCAOC OTCATCTCCA GTGGCCTGOA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATQACA AIGCTCCCAT OTTTQACOCC CAGAAGTAOQ AGQCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGQTGCA GAG6CTGAC6 GTCACTGATC TGGACGCCCC 1140 

CAACTCAGCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGAC6A06 G6GACCATTT 1200 

TACCATCAOC ACCCACCCTO AGAGCAACCA GGGCATCCTG ACIUVCCAOGA AGGGrTTQOA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCmiXST 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGOTCCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT T6AGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT OTCTACACTG CAGAAGACCC TGACAAGOAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGA6A GACCCAGCAO GGTGGCTAGC CATGGAGCCA GACASTGGGC AGQTCACAGC 1560 

TGT6GGCACC CTCOACCGTG AGGATQAGCA GTTT0T6A0G AACAACATCT ATGAAQTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTQAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC OGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGGCAGO TGCTGAACAT CA06GACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCA6 GCGCAGCTCA CAGATGACTC AGACRTCTAC TGGACGGCAG AGGTC3VACGA 1860 

GGAAGGTGAC ACAaTGGTCT TGTCCCTGAA GAAGTTCCTG AAOCAGGATA CATATGACGT 1920 

GCACCTTTCr CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTQATCA GGGCCACTGT 1980 

GTGCGACTGC CATGaCCATQ TCGAAACCTG CCCTQGACCC TGGAAGOGAa GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCT6C TTTTGTTGGT 2100 

GAGAAAGAAG CGQAAGATCA AGGAGCCCCT CCTACTCCGA GAAOATGACA CCCGTQApUl 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGA6GAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCQA GGTCTGGAGG CCAGGC06GA GGTGGTTCTC CGCAAtGAOG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAQATGAAA TOGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGOTOTTC GACTATGAGG GCAGCGGCTC CQACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC 6ACCAAGACC AAGATTAOGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGQA GGACGACTAO GCGGCCTGCC TGCAGGGCTG 2580 

GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

0ACTTCGGA6 CTTGTCAGOA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

AGGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGOATGOAGG AATGTGGGCA QTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAOG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CQTAAAATGC TCAACCCTGT QTCCTGGGCC TGGQCCTQCT GTGACTGACC 2880 

TACAGTCGAC TTTCTCTCTG QAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT OCTATCTTCA AAACGTTAGA QAAAGTTCTT CAAAAOTQCA GCCCAGAGCT 3000 

GCTOGOCCCA CTGGCCGTCC TOCATTTCTQ GTTTCCAGAC CCCAATGCCT CCCATTCGQA 3060 
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TGGATCTCTG OGTTTTTATA CTGAGTGTGC CTAGGTTGCC OCTTATTTTT TATTTTCCCT 3120 

GrrGOGTTGC TATAGATC5AA GGGTGAGGAC AATOHCTAT ATGTACTAGA ACTTTTTTAT 31B0 

TAAAGAAACT TTTCCCAGAA AAAAA 3205 

S Seq ID KOt C84 DNA Sequence 

Nucleic Acid Accession fit NM 00S629.1 
Coding sequencet 639.. 2546 

tn ^ " 21 31 41 51 

TAGTCGGAGC GAGGTOGCGA GTCGCTGAGC CCGCOGCGGC CCCGAOAGCQ GCTGC3«3CCG 60 

CCGCXIXSCOGQ GAAGGAQAGG GCGAGGCGCG CCCQAGCCGC CX3CCGCCGCC GCCACOSCCG 120 

CCGCOGCCAC CACCGCCACC GGAGTCG0G6 GCXAGCOGGG CA6CCTCCGC GGGCCCCGGC 180 

1 < CGGGGCGQGG GGCX30GGGCC ACRGGCCCCT GCTCGGGCOO TOOTTTGCAO ACCX3CGGGCG 240 

ID COGATOTOSC COQOGCCCOG TTAGGATGAG TCT0GG0TC6 GGOGAGGAGC CX5CCGCAGCC 300 

GCCX3CC0CCC GAGCCGCGGG CAGGAGCCTC GGQAQCOGCC GCCGCCGCCG CCGCCGCCCG 360 

GCCG GGCC CC GACGCCGCCC GCGCGCCCCC GGGCCCCCGA CACACATGAG ATTCTTCAGG 420 

CTCACTTTCA AGTGCTTCGT GGACTGCTTC TGACTGCQCC GCCOGCGCCC OGCACCCCGC 480 

OQTCOGCXXB OOGCCOOGTC CCGOSGGCOG GCCGCXXXXX: GGCXTCCGGC CX3GCCCGCGC 540 

ZU CCTOGGGGCC CTCCCCGGTG C06COGOTGC CCaXXXXTTG ACOGCCGCCC CCOSTGAGGC 600 

GCCGOGACXX: CGGCCXXX3CC GTGCGGCCCG CCGQOOCCAT GGCGAAGAAO AG0GCCGA6A 660 

ACGGCATCTA TAGCGTGTCC GGCX3ACGAGA AGAAGGQCCC CCTCATCGCG CCCGGGCCCG 720 

ACGGG GCCCC GGCCAAGGGC GAOGGCCCOG TGGOCCTOOQ OACACCCGGC GGCCGCCTGG 780 

rye CCOTGOCGOC GOGCQAGACC TGGAOQCGCC AGATGGACTT CATCATGTCX3 TGC3GTGGGCT 840 

ZD TOGC0GT6GG CTTGGGCAAC 6TGTGG0GCT TCCCCTACCT OTGCTACAAG AACX3GCGGAG 900 

GTGTQTTCCT TATTCCCTAC GTCCTGATCG CCCTGGTTGG AGGAATCCCC ATTTTCTTCT 960 

TAGAGATCTC GCTGGGCCAO TTCATGAAQG CCGQCAOCAT CAATGTCTGG AACATCTGTC 1020 

CCCTGTTCAA A GGCCTGGG C TAC30CCTCCA TGGTGATOGT CTTCTACTQC AACACCTACT 1080 

_ AGATCATGGT GCTGGGCTGG GOCTTCTATT ACCTOGTCAA GTCCTTTACC ACCACX3CTGC 1140 

J\) CCTGGGCCAC ATGTGOCCAC ACCTGGAACA CTCCCX3ACP0 COTGQAQATC TTCCGCCATG 1200 

AAGACTGTQC CAATGCCAGC CTOGCCAACC TCACCTGTGA CCAGCTTGCT GACC33CCGGT 1260 

CCCCTGTCAT CXyunrCTGG GAGAACAAAQ TCTTGAGGCT GTCTGGQGGA CTGGA6GTGC 1320 

CAGGGGCCCT CAACTGGGAG GTGACCCTTT GTCTGCTGGC CTGCTGGGTG CTGGTCTACT 1380 

- . TCrGTGTCTG GAAGG6GGTC AAATOCAOGQ GAAAGATOGT GTACTTCACT GCTACATTCC 1440 

J J CCTACX3TGGT CCTOGTOSTG CTGCTGGT6C GTGGAGTGCT GCTGCCTGGC OCCCTGGATG 1500 

GCATCATTTA CTATCTCAAQ CCTGACTGGT CAAAGCTGGG GTCCCCTCAG GTGTGGATAO 1560 

ATGCGGGGAC CCAGATTTTC TTTTCTTACG CXZATTGGCCT GOGOOCCCTC ACAGCCCTGG 1620 

GCAGCTACAA CCX3CTTCAAC AACAACTGCT ACAAGGACGC CATCATCCTG GCTCTCS^TCA 1680 

ACAOTGGGAC CAGCTTCTTT GCTG6CTTC3a TGGTCTTCTC CATCCTGGGC TTCATGGCTG 1740 

4U CAQAGCAGGG OSTGCACATC TCCAAGGTGO CAGAGTCAGG GCCGGQCCTG GCCTTCATCO 1800 

CCTACCCGCG GGCTX3TCACG CTGATGCCAO TGGCCCCACT CTGGGCTGCC CTGrrCTTCT 1860 

TCATQCTGTT GCTGCTTGGT CTCGACAGCC AGTTTGTAGG TGTGQAGGGC TTCATCACCG 1920 

GCCTCCTCGA CCTCCTCCOQ GCCTCCTACT ACTTCCGTTT CCAAAGGGAG ATCTCTGTGG 1980 

yfC C OCTCTG TTG TGCCCTCTGC TTTGTCATCO ATCTCTCCAT OGTOACTGAT GGCGQGATGT 2040 

4D AOOT CTTCCA GCTQTTTGAC TACTACTOGG CCAGCGGCAC CACXXTGCTC TGGCAGGCCT 2100 

TTTGGGAGTG CGTGGTGGTG GCCTGGQTGT ACGGAGCTQA COGCTTCATO GACGACATTQ 2160 

CCTOTATGAT CGGGTACCGA CCTTGCCCCT GGATGAAATG GTGCTGGTCC TTCTTCAOCC 2220 

CX3CTGQTCTG CATGGGCATC TTCATCTTCA ACGTTGTOTA CTAOQAGCCG CTGCrTCTACA 2280 

ACAACACCTA CGTGTACCCG TGGTGGGGTG AGGCCATGGO CTGGGCCTTC GCCCTGTCCT 2340 

DU CCATGCTGTG OQTGCOGCTG CACCTCCTGQ GCTGCCTCCT CAGQGCCAAG GGCACCATGG 2400 

CTGAGCXSCTG GCAGCACCTG ACCCAGCCCA TCTGGGGCCT CX3VCCACTTG GAGTACGQAO 2460 

CTCAGGACGC AGATGTCAGQ GGCCTGACCA CCCTQACCCC AGTX5TCXMAG AGCAGCAAGG 2520 

TOGTOGTGGT GGAGAGTGTC ATGTGACAAC TC3U3CTCACA TCACCAGCTC ACCTCTGGTA 2580 

CC GC CATAGCAG CCCCTGCTTC AGCCCCACOG CACCCCTCCA GGGGGCCTGC CTTTCCCTQA 2640 

DD CACTTTTGGG GTCTGCCTGG GGGAGGAGGG GAGAAAGCAC CATGAGTGCT CACTAAAACA 2700 

ACTTTTTCCA TTTTTAATAA AACGCCAAAA ATATCACAAC CCACXSUUWU^ TAQATGCCTC 2760 

TCCCCCTCCA GCCCTAGCCG AGCTGGTCCT AGGCCC060C TAGTGCCCCA CCOCCACCCA 2620 

CAGTOCTQCA CTCCTCCTGC CXXTTGCCACG CCCACCCCCT GCCCACCTCT CXAGGCTCTQ 2880 

CTCTGCAGCA CACCC6TGGQ TGACCCCTCA CCCCAGAAGC AGCAGTGGCA GCTTGGGAAA 2940 

OU TGTGAGGAAG GGAAGGAGGG AGAGACGGGA GGGAGGAGAG AGAGGAOAAO GGAGGCAGGG 3000 

GAGGGGCAGC AGAACCAAGG CAAATATTTC AGCTGGGCTA TACCOCTCTC CCCATCCCTG 3060 

TTATAGAAGC TTAGAGAGCC AGCCAGCAAT GGAACCTTCT QOTTCCTGOG CCAATC6CCA 3120 

CCAGTATCAA TTOTOTGAGC TTGGGTGCGA GTGCACGCGT GCGTGAGTAC GGAGAGTATA 3180 

TATAQ ATCTC TATCT CTTAG CAAAGGTOAA TGCC3VGATGT AAATGQOQCC TCTGGGCAAA 3240 

OJ GGAOOCTTOT ATTTTGCACA TTTTATAAAA ACTTQAGAGA ATGAGATTTC TGCTTQTATA 3300 

TTTCTAAAAA GAGGAAGGAG COCAAACCAT CCTCTCCTTA CCACTCCCAT GCCTGTGAGC 3360 

CCTACCTTAC CCCTCTGCCC CTAGCCAAGG AGTGTGAATT TATAGATCTA ACTTTCATAO 3420 

GCAAAACAAA AGCTTCXaAGC TOTTGCGTGT GTGAQTCTGT TGTGTGGATG TGCGTGTGTG 3480 

GTCCCCAOCC CCAGACTGQA TTGGAAAAGT GCATGGTC3GG GGCCTOGGGG CTGTCCCCAC 3540 

/U GCTGTCCCTT TGCCACAAGT CTGTGGGGCA AGAGGCTGCA ATATTCOGTC CTGGGTGTCT 3600 

GGGCTGCTAA CCTGGCCTGC TCAGQCTTCC GACOCTGTGC GGQQCACACC 00CA6GAA6G 3660 

GACCCTGGAC ACGGCTCCCA CGTCCAGGCT TAAGGTOGAT GCACTTCCOO CACCTCCAGT 3720 

CTTCTGTQTA GCAGCTTTAA CCCACOTTTO TCTGTCAOGT CCAGTCCOGA GAC3GGCTGAG 3780 

__ T6ACCCCAAG AAAGQCTTCC COGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 

iJ GGGTGGOGGG CCTOCGGGQA CATTCTACTO T6CTAAAAA6 CCACTGCA6A CATAGCAATA 3900 

AAAACATGTC ATTTTCC 3917 

Seq ID NOi CSS DNA Sequence 
Nucleic Acid Accession fit NM_006516.1 
oU Coding sequencet 180..1658 

1 11 21 31 41 51 

I I 1 i I I 

TAGTOQCGGG TCCCOGAGTG AGCA06CCA6 GGAGCAGQAO ACCAAAOOAC GGGQGTCGGA 60 
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GTCAGAGTC3G CAGTGGGAOT CCCCGQACC3G GAGCACX»GC CTGAGCGGGA GAGCX3CCGCT 120 

CGCACGCCOG TOGCCACCCG CGTACCCGGC GCAGCCAGAG CCACCAGCGC AGCGCTGCCA 180 

TGGAGCCCAG CAQCAAQAAG CTGACGOGTC GCCTCATGCT OGCTGTGGGA GGAGOVGTGC 240 

TtQGCTCCCT GCAGTTTGGC TACAAOVCTG GACPTOITCAA TGCCCCCCAG AAGGTGATOQ 300 

AGGAGTTCTA CAACCAGACA TGGGTCCAOC GCTAT6G6GA GAGCATCCTG CCCACCACGC 360 

TCACCACGCT CTGGTCCCTC TCAGTGGCCA TCTTTTCTGT TGGGGGCATG ATTGGCTCCT 420 

TCTCTQTGQG CCTTTTCGTT AACCGCTTPO GCOGGCQQAA TTCAATGCTG ATGATGAACC 480 

TGCTGGCCTT CGTGTCCGCC GTGCTCATGG GCTTCTCGAA ACTGGGCAAG TCCTTTGAGA 540 

TGCTGATCCT 0G6C06CTTC ATCATOGGTG TQTACT6G66 CCTGACCACA G6CTTCGTGC 600 

CCAT6TATGT GGGTQAAGTG TCACXX^ACAG CCTTTCC3T66 GGCCCIGGGC ACCCTGCACC 660 

AGCPGGGCAT CGTCGTOGGC ATCCTCATCG CCCAGGTGTT CGGCCTGGAC TCCATCATGG 720 

GCAACAAGGA CCTGTGGCCC CTGCTGCTGA GCATCATCTT CATCCCGGCC CTGCTGCAGT 780 

GCATCGTGCT GCCCTTCTGC OTGAGAGTC CCC6CTTCCT GCTCATCAAC CGCAACGAGG 840 

AOAACOQGGC GAAGAOTGTG CTAAAGAAGC TG06GGG6AC AGCTGAOGTG ACCCATGACX! 900 

T6CAG6AGAT GAAGOAAGAG AGT066CAGA TGATGOGGQA GAAGAAGGTC ACCATCCT6G 960 

AGCTGTTCCXS CTCCCCCGCC TACCGCX^^GC CCATCCTCAT CGCTGTGGTG CTGCAGCTGT 1020 

CXXaVGCAGCT GTCTGGCATC AACGCTGTCT TCTATTACTC CACGAGCATC TTCXSAGAAGG 1080 

OGGGGGTGCA GCAGCCTGTG TATGCCACCA TTGGCTCC3GG TATCGTCAAC ACGGCCTTCA 1140 

CTCITOOTGTC GCTOTTTGTO QTGQAOCQAO CAGGCOQQCXI QAOOCTGCAC CTCATAOGCC 1200 

T0GCT6GCAT 0GCGGGTT6T GCCATACTCA TGACCATCGC GCTAGCACTG CTGGAGCAGC 1260 

TACCCTGGAT GTCCTATCTG AGCATCGTGG CCATCTTTGG CTTTGTGGCC TTCTTTGAAG 1320 

TGGGTCCTGG CCCCATCCCA TGGTTCATCG TGGCTGAACT CTTCAGCCAG GGTCCACGTC 1380 

CAGCT6CCAT TGCCGTPGCA GGCTTCTCCA ACTGGACCTC AAATTTCATT GTGGGCATGT 1440 

GCTTGCAGTA TGTGGAGCAA CTaTOTGaTC CCTAOOTCTT C31TCATCTTC ACTGTQCTCC ISOO 

TGGTTCTGTT CTTCATCTTC ACCTRCTTCA AAGTTCCTGA OACTAAAOOC 0GQACCTTCX3 1S60 

ATGAGAT03C TTCXX3GCTTC OGGCAGGGGG GAGCCAGCCA AAGTGATAAG ACACCCQAGQ 1620 

AGCTGTTCC3V TCCCCTGGGG GCTGATTCCC AAGTGTGAGT CXSCOXAGAT CACCAGCCCG 1680 

GCCTGCTCXX AGCA6CCCTA AGGATCTCTC AGOAGCACAO GCA6CI06AT GAGACTTCCA 1740 

AA0CTGACA6 ATGTCA6CXX3 AGCGGOGCCT GGG6CTCCTT TCTCCAOCCA GGAATOATGT 1800 

CCAQAAGAAT ATTCA6QACT TAAOGGCTCC AGGATTTTAA CAAAAGCAAQ ACTGTTGCTC 1860 

AAATCTATTC AGACAAGCAA CAGGTTTTAT AATTTTTTTA TTACTGATTT TGTTATTTTT 1920 

ATATCAGCCT GAGTCTCCTG TGCCCACATC CCAGGCTTCA CCCTGAATGG TTCCATGCCT 1980 

GAGGGTGGAG ACTAA6CCCT GTGGAGACAC TTGCCTTCTT CACCCAGCTA ATCTGTAGGG 2040 

CTGGACCTAT GTCCTAAGGA CACACTAATC GAACTATGAA CTAGAAAOCT TCTATCXXAG 2100 

6A0GTGGCTA T0GCCACCXX3 TTCT6CTGQC eTGGATCTCC CCACXCTAGG GOTCAGGCTC 2160 

CATTAGGATT TGCCCCTTCC CATCTCTTCC TACCCRACCA CTCAAATTAA TCTTTCTTTA 2220 

CCTGAGACCA GTTGGGAGCA CTGGAGTGCA GGGAGGAGAG GGGAAGGGCC AGTCTGGGCT 2280 

GCXX3GGTTCT AGTCTCCTTT 6CACTGAGGG CCACACTATT ACCATGAGAA 6AGGGCCTGT 2340 

6GQA6CCTGC AAACTCACTG CTCAAGAAGA CATG6AGACT CCTGCCCTGT TGIGTATAGA 2400 

4U T6CAAGATAT TTATATATAT TTTTGGTTGT CAATATTAAA TACAGACACT AAOTTATAGT 2460 

ATATCTGGAC AAGCCAACTT GTAAATACAC CACCTCACTC CTGTTACTTA CCTAAACAGA 2520 

TATAAATGGC TGGTTTTTAG AAACATGGTT TTGAAATGCT TGTGQATTQA QGQTAGGAGG 2580 

TTTGGATGGG AGTGAGACAG AAGTAAGTG6 GGTTGCAACC ACTGCAACGG CTTAGACTTC 2640 

QACTGAGQAT CCAGTCCCTT ACAOSTACCT CTCATCAGT6 TCCTCTTGCT CAAAAATCTO 2700 

TTTGATOCCT 6TTACGCAGA QAATATATAC ATTCTTTATC TTGACATTCA A6GCATTTCT 2760 

ATCACATATT TGATAGTTGG TGTTCAAAAA AACACTAGTT TT6TGCCAGC C6T6AT6CTC 2820 

AG6CTTQAAA T06CATTATT TTQAATOTQA AGGGAA 2656 
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cf\ ^ CBfi Sequence 

DU Nucleic Acid Accession #: XM_035292.2 
coding sequence* 53.. 1576 

1 11 21 31 41 51 

I 1 1 I I I 

J J GCTCGCTGGG C0G0G6CTCC CG66TGTCCC AGGCCOQGCC GGTGCOCAGA 6CATG6066G 60 

TGCGGGCCOG AAG06QC6CG CX3CTAGCX3GC GCXS60G6CC GAQGAGAAGG AAGAGGCGCG 120 

GGAGAAGATQ CTGGCC6CCA AGAGCGCGGA 0GGGT0G600 CCGGCAGGOG AGGGCX3AGGG 180 

OGTGACCCTG CAGOGGAACA TCAOGCTGCT CAAOGGOGTG OCCATCATOG TGGGGACCAT 240 

TATC3GGCTC0 GOCATCTTCQ TGAOSCCCAC GGGOGTGCTC AAG6AGGCAG GCTOQCOGOG 300 

GCTOOOaCTG CTOQTOTOGO COQCX3TQ00O CQTCTTCTCC ATCXSTGQOCM C3GCTCTGCrA 360 

CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGOGGCQAC TACGCCTAC3V TGCTGGAGQT 420 

CTACGGCTCG CTGCCOGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCOGGCCTTC 480 

ATOOC3M5TAC ATOGTGGCCC TGGTCTTCGC CACCTACXTTG CTCAAGCOGC TCTTCCCCAC 540 

CT QCCX3 BGTG CCX:GAGGAGG CAGCX»AGCT OGTGGGCTGC CTCTGGGTGC T6CTGCTCAC 600 

GGC0GT6AAC TGCTACROCO TGAAGOCCGC CACCOGGGTC CAlGGATGCCT TTGCCTCXX3C 660 

CAAGCTCCTG GCCCTGGCCC TGATC3VTCCT GCTGGGCTTC GTCCAGATOG GAAAGGGTQA 720 

TGTGTCCAAT CTAQATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATQ TGGGGAACAT 780 

TGTGCTX3GCA TTATACAQCX3 GCCTCTTTGC CTATQGAGGA T6GAATTACT TGAATTTOGT 840 

„ CAC ^AG GAA ATGATGAACC CCTACAGAAA CCTGCOCCTO GCCATCATCA TCTCCCTGCC 900 

/U CATOGTGAOG CTGGTGTAOG TOCTOACCAA CCTGGCCTAC TTCACCACCC TGTCCACOGA 960 

GCAOATGCTG TC3GTCCGAGG CCGTGGCOST GGACTTOGGG AACTATCACC TGGGCGTCAT 1020 

GTCCTGGATC ATCCCCGTCT TOGTQOGCCT GTCCTGCTTC GOCTCCQTCA ATGGOTCCCr 1080 

GTTCACATCC TCCAGGCTCT TCTTOGTGGG GTCCOXXSAA GGCCACCTGC CCTCX3VTCCT 1140 

CTCCAT6ATC CACCCACAGC TCCTCACOCC 08TGCCGTCC CTCOTGTTCA COTG TG TGAT 1200 

GACGCT6CTC TACGCCTTCT CCAAGGACAT CTTCTCOGTC ATCAACTTCT TCAOCTTCTT 1260 

CAACTGGCTC TOCOTGGCCC TGGCCATCAT GGGCATGATC TGGCTGOGCC ACAGAAAGCC 1320 

TGAGCTTGAG OGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGOCCTG 1380 

CCTCTTCCTO ATOGCCGTCT CCTTCT G GAA GACACCCGTG GAGTGTGGCA TOGGCTTCAC 1^40 

CATCATCCTC AGCGGGCXGC CCGTCTACTT CTTOGGQOTC TGGTQGAAAA ACAAGOCCAA 1500 

GTG6CTCCTC CAGGGCATCT TCTCCAOQAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 1560 

CCCCCAGGAG ACATAGCCAG GAGGCCGAGT G6CT0CCGGA OGAGCATGC 1609 



60 



65 



75 
80 



Seq ID NO: C87 DHA Sequence 

Nucleic Acid Accession 8t m 005268.1 
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coding sequence I 168.. 989 

^ 11 21 31 41 51 

^ i. ' I • I i 

J TAAAAAGCWl AAQAATTOGC GGCCQOOTOG ACAOGGGCTT CCCCGAAAAC CTTCCCCX3CT 60 

TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCT66 GAGCCAGGAO 120 

AGCOCTGAGG AGTAGTCACT CAGTAGCAGC TQACGCGTGG GTCCACCATG AACTGGAGTA 160 

TCTTTOAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AOC CiT T OG G CGCATCTGGC 240 

- TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GAOGGCOGAQ 06T6TGTGGA 300 

lU GTGATGACCA CAAGGACTTC GACTGCAATA CTOGCCAGCC CX3GCTGCTCC AAOGTCTGCT 360 

TTGATOAGTT CTTCCCTGTG TCCCATGTGC GCCTCTG6GC CCTGCAGCTT A TCCTGOTOA 420 

CATGCCCXrrC ACTGCTCGTG GTCATGCAOS TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 460 

ACCGAGAA6C CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAOOGGG 540 

- - OTGGOCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCXSAGCXTPG GACATOGCCT 600 

ID TTCTCTATOT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGXG GTCAAGT6CC 660 

AOGC AGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCOCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTQ GCCACAGCTG CCATCTOCAT CCTGCTCAAC CTCGTGQAGC 780 

TCATCTACXrr GGTGAGCAAQ AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 640 

TGTGCAC3«3a TCATCACXXX: CACX5GTACCA CCTCTTCCIG CAAACAAGAC GACCTCCTTT 900 

ZU CGGGTGACXrr CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA QACOGCCCCX: 960 

GAGACCATQT GAAQAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GGAOGTTGOG 1020 

CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAOTGGG OGAOCTAAGC 1080 

CATGAGGTAG GGGCAGGCyW^ GAGAGAGGAT TCAGAOGCTC TGGGAGGCAO TTCCIAOTCC 1140 

_ TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 

GCTCGGTTTC CTTTTCTAGA ATGQAAATAG TGAGGGCCAA TGC 1243 
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Seq ID NO: CSS DNA Sequence 
KUclelc Acid Accession km_005130 
coding sequence: 98.. 602 " 



1 II 21 31 41 51 

I i I I I I 

CTCTACCTGA CACAGCTGCA GCCTGCAATT CACTCCCACT GCCTOGQATT 6CACTGGATC 60 

CGTGTGCTCa GAACAAGGTG AACGCCCAGC TGCAGCCATG AAGATCTGTA GCCTCACCCT 120 

DJ GCTCTCCTTC CTCCTACTGG CTGCTCAGQT GCTCCTG6TG GAGGGGAAAA AAAAAGTGAA 180 

GAATGGACTT CACAGCAAAG TGGTCTCASA ACAAAA66AC ACXCrGGOCA ACAOCCAGAT 240 

TAAGCAGAAA AGCAGGCCCO GGAACAAAGG CAAGTTT6TC ACCAAAGACC AAGOCAACTG 300 

CAGATGGGCT GCTACTGAGC AGGAGGAGGG CATCTCTCTC AAGGTTGAGT GCACTCAATT 360 

GGACCATGAA TTTTCCTGTG TCTTTGCTGG CAATCCAACC TCATGCCTAA AGCTCAAGGA 420 

TGAGA6AGTC TATT6GAAAC AAGTIGOCGG GAATCTGCXSC TCACAGAAAG ACATCTGTAG 480 

ATATTCCAAG ACAGCTGT6A AAACCAGAGT GT6CAGAAAG GATTTTCCA6 AATCCA6TCT 540 

TAAGCTAGTC AGCTCCACTC TATTTGOQAA CACAAAGCCC AGGAAGGAGA AAACAGAGAT 600 

GTCCCCCAGG GAGCACATCA AGGGCAAAGA GACCACCCCC TCTAGCCTAG CAGTGACXXA 660 

GACC3VTGGCC ACCAAAGCTC CCGAGTQTGT GGAGGACCCA GATATGGCAA ACCAGAGGAA 720 

GACTGCCCTO GAGTTCTGTO GAGAGACTT6 GAGCTCTCTC TGCACATTCT TCCTCAGCAT 780 

AGTGCAGGAC AGGTCATGCT AATGAGGTCA AAAGA6AA0G GGTTGCTTTA A6AGAT6TCA 840 

TGTCX3TAAGT CCCTCTGTAT ACTTTAAAGC TCTCTACSVGT CCCCC3CAAAA TATGAACTTT 900 

TGTGCTTAGT GAGTGCAAOG AAATATTTAA ACAAGTTTTG TATTTTTTGC TTTTGTGTTT 960 

TGGAATTTGC CTTATTTTTC TTGGATQCQA TOTTCAGAGG CTGTTTOCTG CAGCATGTAT 1020 

TTCCATGGCC CACACAGCTA TGTGTTTGAG CAGOGAAGAG TCTTTGAGCT QAATGAGCCA 1080 

GAGTGATAAT TTCS^OTGCAA 06AACITTCT GCTGAATTAA TGGTAATAAA ACTCTGG6TG 1140 

TTTTTCAAAA AAAAAAAAAA AAA 1163 

Seq ID NO J C89 DMA Sequence 
Dj Nucleic Acid Accession #t BC022542 
Coding sequence : 274 . . 927 



1 11 21 31 41 51 

i 1 I 1 I I 

ACTTGGTCCC AGCOGATAAA TCTGGGGCAG CGCGG6GTAG GA6CTGC60G CGGCCAGGCC 60 

CCTTCCTGCO TCCGCACCTG GCCCCGOGCG CCCCTCTCQO OCOTCCQGCT TCCGGCOTCC 120 

TGGCGGCTCO GGTGGCGGOO GTT0QGG0G6 CG6CCTG6CT GCTCCT0GG6 GOGGCGAOGO 180 

GGCTCACGOG CGGGCCCGCC ACGGCCTTCA CCGCCGCGOG CTCTGAOQCC GGCATAAGGG 240 

CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 300 

TTTTAATCAA AGTGAA3TTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 360 

TTAAACAGGA CATTCCTGCA GGACTTTATG TG6ATC0GTA TGAGTIGGCT TCATTACGAG 420 

AGAGAAACAT AACAGAGGCA GTGATQGTTT GAGAAAATTT TGATATAGAG GCCCCTAACT 460 

A TTTG TCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTQACT 540 

6TTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCaCAGT GAAGATGGAG 600 

AAGCCTCGAT TGTGGTCAAT AACCCAGATT TOTTGATOTT TTGTGACCAA GAGTTCCCQA 660 

TTTTGAAATG CTGGGCrCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AAT6AGGATA 720 

TATGCCAATO GAACAAGATG AAOTATAAAT CAGTATATAA GAATGTGATT CTAOUVGTTC 780 

CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 840 

TGTGCTCTAC ATTQATCCTT GTAGCAGrTT TCAAATATGG CCATTTTTCC CTATAAGTTT 900 

TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTO A06AGAGGTG 960 

TTCTTCTAQA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGOCAAAA TTATCTTTAC 1020 

TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATOGAT 1080 

CCTTGGTAAT CCTCAAGCAT CAGAT6CCAT AAGGGQAAAC TTAATTCTGC TAAATTAATG 1140 

TTTATTTTGT GAGAAGTGAC TTTA TCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 1200 

OTAGAG ACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTQAGAAAA 1260 

CCOT TATAAA TAAGAATAAA ATA60CCA00 CACA0T6GCT CACAOCTGTA ATCCCA6CAC 1320 

TTT GGQA QGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGOTQAA 1380 

ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTQGGOCTG GTGGTG6GCA TCTGTAOTCC 1440 

CAGCTAATTO GAAGGGTGAG GOQGQAG6AT C6CTTQAACC TGG6AGG0Q0 AOOTTCCAQA 1500 
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GAGCCAAGAT 
GGAAAAACAA 
ATGTCATGAG 
CTAAGAAATT 
AOQCACTCOV 
GGACTTGATG 
TTGCCATTTT 



CJGCACCACTG 
AAAAGAAGAA 
ACTATTAAAG 
AATATTAATA 
TTCTCCTTTT 
AAACTOAGTA 
AAATAAAGTT 



CACTACAGCC 
TAAAATAATT 
ATGXGCCAGA 
TAAAAATTAT 
ACATTTTATC 
CTAA6ATTT0 
GTACAT6AAC 



TGGOOGACAG 
TGGATQAAAA 
GTTTCAATGA 
TGATAATCTT 
ATGTTTCTTT 
GTACAGAGTA 
AAAAAAAAAA 



AAOGAGACCC 
TCATGTTTAT 
AAATGATTAA 
AAATTATTOA 
TGAATATATO 
TGTCAOQAAO 



TGTCTCCAAA 
TTAAATACSTA 
AGTAGGACAG 
TTATTGCTTA 
AATTQ6CAAA 
ACAACTCAGA 



Seq ID NOt C90 DNA Sequence 
Nucleic Acid Accession NM_004994 
Coding seqvencei 20.. 2143 



AGACACXrrCT 
GGGCTGCTGC 
CCTGAGAACC 
CACT0GGGT6 
CCAGAA6CAA 
G06AACCCCA 
CAAGTGGCAC 
GGCG6TGATT 
CACCTTCACT 
GCACGGAGAC 
TG0CCCCX3QC 
GGGCXSTCGTO 
CATCTTCXSAQ 
CTGGTGCAGT 
GAOACTCTAC 
CCAAGGCCAA 
OQOCACCACC 
CT08ACQGTO 
GGGTAAGGAG 
TACCACCTOG 
TTTGTTCCTC 
GCGOGAGGGQ 
C6A0GTGAAT 
AACCACCACC 
TGTCCACXXX: 
AGGTCCCXXX: 
TGCCTGCAAC 
CAAGGATGGO 
CCTTATOGCC 
GCTCTCCAAO 
GGTGCTGGGC 
OGGGGGCCTC 
GTTC6A06TG 
CCCOGGOaTG 
CCAGGACXX3C 
GG6CTAC6T0 
GCftGTGCCAT 
CAAACTGGTA 
TCACCTTTGT 



II 

! 

GCCCTCACCA 
TTTGCTGCCC 
AATCTCACCQ 
GCAGAGATQC 
CTGTCCCTGC 
CGGTGCGGGG 
CACCACAACA 
GACGAOGCCT 
CX3CGT0TACA 

gggtatccct 
attcagggag 
gttccaactc 



ACCACGGCCA 
ACCGGGGACG 
TCCTACTCOG 
GCCAACTAGG 
ATGGGGGGCA 
TACTOGACCT 
AACTTTGACA 
GTGGCXjGOGC 
CTCATGTACC 
GGCATCCX3GC 
ACACCGCAGC 
TCAGAGCX3CC 
ACTGCTGGCC 
8TGAACATCT 
AAGTACTGGC 
GACAAGTGGC 
AAGCTTTTCT 
OOGAGGCGTC 
0GGA6TGGCA 
AA66CGCAGA 
CCTTTGGACA 
TTCTACTGGC 
ACCTATGACA 
GTAAATCCCC 
TTCTOTTCTG 
TTTTTOTTGO 



21 
I 

TGAGCCTCTG 
CCAGACAGCO 
ACAGGCAGCT 
GrGOAGAOTC 
CCXsAGACOGG 
TCCCAGACCT 
TCACCTATTQ 

QCCGGOAOOC 
TCGACG6GAA 
ACGCCCATTT 
GGTTTGGAAA 
ACTCTGCCTO 
ACTACGACAC 
GCAATGCTGA 
CCTGCACCAC 
ACCGGGACAA 
ACT06G0GGG 
GTACCA6O0A 
GCGACAAGAA 
ATGAGTTCGG 
CTAT6TACGG 
ACCTCTATGG 
CCACGGCTCC 
CCACAGCTGG 
CTTCTA0G6C 

ToaA oaccAT 

GATTCTCT6A 
CCG06CTGCC 
TCTTCTCTGG 
TGGACAAGCT 
GGGGGAAGAT 
TGGTGGATGC 
CXSCACGACOT 
G0GT6AGTTC 
TCCT6CAGT6 
ACTG6GACCA 
GAfiOAAAGGO 
AGTG7TTCTA 



31 
I 

GCAGCCCCTO 
CCAGTCCACC 
6GCAGA6GAA 
QAAATCTCTG 
TGAGCTGGAT 
GG6CAGATTC 
OATOCAAAAC 
CTTGGCACTO 
AGACATCGTC 
GGAOGGGCTC 
CGACGATGAC 
CGCAGATG6C 
CACCACG6AC 
CGAOGACCGG 
TGGGAAACCC 
GGAOGGTGOC 
GCTCTTOGGC 
GGA6CTGTGC 
GGGCCGCGGA 
GTGGGGCTTC 
CCACGCGCTG 
CTTCACTGAG 
TCCTC6CCCT 
CCCGACX3GTC 
CCCCACAGGT 
CACTACTGTG 
06GQGAGATT 
GG6CA06GG6 
COSCAA6CTG 
606CCAGGTG 
6G0GCTGGGA 
GCTOCTGTTC 
C0GGA6CGCC 
CTTCCAGTAC 
CCGGAGTGAG 
CCCTGAGGAC 
ACCCTGGGGA 
AGGAGTGGAO 
ATAAACTTGG 



41 

I 

GTCCTG6TGC 
CTTQTOCTCT 
TA0CT6TACC 
GGQCCTGCGC 
AGCGCCAOGC 
CAAACCTTTG 
TACTGQGAAQ 
TQQAGGOGGG 
ATCCAGTTTG 
CTGGCACACG 
GAGTT6TGGT 
GOGGGCTGCC 
GOr OG CT O CO 

TGCCAGTTTC 
TCGGAOGGCT 
TTCTGGOGGA 
GTCTTOCCCT 
6ATGGGCGCC 
TGCXXX5GACC 
GGCTTAGATC 
GG6CCXCCCT 
GAACCTGAGC 
TGCCCCACCG 
CCCCCCTCAG 
CCTTTGAGTC 
GGGAACCAGC 
AGCCGGCOGC 
GACTCGGTCT 
TGGGTGTACA 
GCX30A0GTGG 
AOGOGGOGGC 
AGOGAGGTGG 
CQAGAGAAAG 
TTGAACCAGG 
TAGG6CTGCC 
AGGAGCCA6T 
GTGG6CT6G0 
ATTCTCTAAC 



51 

1 

TCCTGGTGCT 
TCCCTGGAQA 
GCTATGGTTA 
TOCTGCTTCT 
TGAAGGCCAT 
AGGGCGACCT 
ACTT0CC606 
TQA06CGGCT 
GTGT0G0G6A 
CCTTTCCTCC 
CCCTGGGCAA 
ACTTOCCCTT 
ACG6CTTGCC 
GCCCCAGOGA 
CATTCATCTT 
ACCGCTGGTQ 
OCGGAGCTQA 
TCACTTTOCT 
TCTGGTGOOC 
AAGGATACAQ 
ATTCCTCAGT 
TGCftTAAGGA 
CAC6GCCT0C 
GACCCCCCAC 
CTGGCCCCAC 
OGGTGGACGA 
TGTATTTGTT 
AGG6CCCCTT 
TTGAG6AGCC 
CAGGCGCGTC 
CCCAGGTGAC 
GCCTCTGGAG 
ACOGGATGTT 
CXH'ATTTCTG 
TGCACXAAGT 
GTCCTGCTTT 
TTGCXXSGATA 
CCCTCTCTTC 
CTTT 



Seq ID 2^0: C91 Sequence 

Nucleic Acid Accession A: NM_000213 

Goding sequence i 188.5656 



G06CTGGCGB 
T06CC06OQC 
GCCCC6A6GT 
GAAGAGGATG 
CAGOGTCAGC 
CACGGAOTGT 
006GOGCTGC 
GGTCATGGAG 
CA6CCAGATG 
TGAGCTG6AO 
CTCCAACTCC 
GGTCCTGAGC 
CAGCG-rcCCG 
CCCCCCCTTC 
TAAACTGCAG 
CATOCT6CA0 
GCrGGTCTTC 
TGOCATCATG 
GTACAGQACA 
CATCATCCCC 
CTATTTOOCT 
GCTOQAGGAG 
COGAGGCCTT 
TCACATGOQO 




AACACCCAGG 
AGCAQCTTCC 
TCCCCCCAAG 
GTGTTTGAGC 
ATGTCCQATG 
CAGCrCACCA 
CAGAOGGACA 
TCCTTCAAQA 
GGAOAGOGGA 
ACAGCTGTGT 
TCCACOGAGT 
AGCXX3CAACG 
CAGGACTACC 
ATCTTTGCTG 
GTCTCCTCAC 
GCCTTCAATC 
CGQACAGAOG 
OS0G6GGAAG 



21 
I 

ACOCCGCAAC 
CATCrCCTAG 
OGGGGGCACA 
GCCCCAQCCC 
CCTTGGCAAA 
ATAAGOACTG 
CGGAGCT6CT 
AAATCACAGA 
GCCTGOGGGT 
CACTGGAGAO 
ATCZGGACAA 
6GGACTACAC 
TQAGGCCTGA 
ACGTCATCAG 
TCTCAGGCAA 
GCA0GAGG6A 
CAGCCTTCTCA 
ATGAAOGGTG 
OGTCGGTGCC 
TCACCAACTA 
TGGGGOTGCT 
GGATOOOCrC 
TCAOCTCCRA 
TGGGTATATA 



31 
1 

OGGCAGCCCA 
GCAGCAGCCXS 
ATGQGCCAGO 
COGCTGCAAG 
GQCCTACTGC 
G6CC3QOGG6C 
GGAGAGCCAG 
CCGTCTG066 
CCCCDGTOGAC 
GCTCAAOAAO 
TATTOGATTT 
GAAGCTGAA6 
CCTGACAGAA 
CCTGGATGCr 
GATTGGCTGG 
CTATQAOGCT 
CCACCTGGAC 
CACCCTGGTG 
CTCCTATAGC 
6CAGGA0GAC 
CAACCTGGAC 
GATGTTCCAG 
CCAGGT6CAG 



41 
I 

GGCCCTOGGA 
GG0GCX3GAGG 
AGGCTGGCCG 
CTGCTCCTGQ 
AAGGCCCCAG 
ACAGACGAGA 
T60CAG0GGG 
ATTGACACCA 
CCOQGTGAGG 
CTGTACATCC 
ATGGGGGUSA 
GGCAAGTTTO 
GAGCCCTGGC 
GATGTGGATG 
CCTGAGG60G 
GGCCOGGACA 
GATGGOOCCA 
ACCACGGCCA 
CGCCTGCTCO 
TACTACGAGA 
TOOTOCAACA 
ATCOGGGCCC 
AAGAGQAGGA 
CIGCGGGCCC 



1560 
1620 
1680 
1740 
1800 
1860 
1906 



51 
I 

CA6TGCCTGC 
6A60GAGTCC 
GGAG AGGG AG 
CAOCCTTGAT 
TGAA6AGCTG 
TGTTCAQQGA 
AGAGCATCGT 
CCCT60GGCG 
AGOGGCATTT 
TCATGGACTT 
ACCTGGCTOO 
TGGACAAAGT 
CCAACAOTQA 
AGTTCOGGAA 
GCTTOGATGC 
GCACCCACCT 
ACGTGCTGGC 
CCTACACCCA 
CCAAGCACAA 
A GCTT CACAC 
TOQTGGAGCT 
TAGACAGCCC 
CTGGGTCCTT 

ttcAgcaogt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

760 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2334 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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GGATGGGAOG CACGTGTGCC AGCTSCC G GA GGACCAGAAG GGCAACATCC ATCTGAAACC 1500 

TTCCTTCTCC GAOOGCCTCA AGATGGACGC GGGCATCATC TGTGATGTQT QCACCTQOGA 1560 

GCTGCAAAAA GAGGTGOOGT CAG C TOGCTG CAGCTTCAAC GGA6ACTT0Q T6T6CGGACA 1620 

GT G TGTGT G C AGOGAGGGCT GGAGTC G CCA GAOrTGCAAC TQCTCCACOQ GCTCTCTQAG 1680 

TGACMTCAO 0CCTGCCT6C GGGAOGOOQA. G6ACAM3COQ TCCTCCGGCC GTGG6GA6T0 1740 

CCAGTGOGGG CACTGTGTGT GCTACGGCGA AGGCCGCTAC GAGGGTCAGT TCTGCXMTA 1800 

TGACAACTTC CAQTOTCCCC GCACTTCCX30 GTTCCTGTGC AATGACCX3AG GACGCTGCTC 1860 

CAT6GGCCAG TGTGTGTGTG AGCCTGGTTG GACA66CCCA AGCTGTGACT GTGCCCTCAG 1920 

CAATGCCAOC TGOkTCXSACA GCAATGGCGG CATCI6TAAT GQA09TQGCC ACTQTGAGTO 1980 

TGGCOGCTGC CACT6CCACC A0CAQTCX3CT CTACACGGAC ACCATCT6CG AGATCAACTA 2040 

CTCGGOGATC CACCCGGGCC TCTGCGAGGA CCTA06CTCC TGOGTGCAGT GCCAGGOGTG 2100 

GGGCACCX36C GAGAAGAAGG GGCGCACQTG TGAGGAATGC AACTTCAAGQ TCAAGATGGT 2160 

GGAOGAGCTT AAGAGAGCCG AGOAGGTGGT GGTG06CT6C TCCTTCCX3GG ACGAGGAT6A 2220 

CX3ACT6CACC TACA6CTACA CCAT6GAAGG TGAGG60GCC CXirZGQGOCCA ACAGCACTGT 2260 

CCTGOTGCAC AAGAAGAA6G ACTGCCCTCC GGGCTCCTTC TGGT6GCTCA TCCCCCTGCT 2340 

CCTCCTCCTC CTGCCX5CTCC TGGCCCTGCT ACTGCTGCTA TGCTGQAAGT ACTGTGCCTG 2400 

CTGCAAGGCC TGCCTGGCAC TTCTCCCGTG CTGCAACCQA GGTCACATGG TOGGCTTTAA 2460 

GGAAGACCAC TACATGCTGC GGGAGAACCT GATGGCCTCT GACCACTTGG ACAGGCCCAT 2520 

GCTOOQCAOC GGGAAOCTCA AGGGCCGTGA 06TQ8TGCGC TQGAAGQTCA CCAACAACAT 2560 

GCAGGGGCCT 6GCTTT0CCA CTCATGC06C CAGCATCAAC CCCACAGAGC TGGTGCCCTA 2640 

OGGGCTGTCC TTGOOCCTGG CCXXSCCTTTG CACC6AGAAC CTGCTQAAGC CTGACACTOO 2700 

GGAGTGCGCC CAGCTGCGCC AGGAGGTGGA GGAGAACCTG AACGAGGTCT ACAGGCAGAT 2760 

CTC0G6TGTA CACAAGCTCC A6CAGACCAA GTTC0G6CAG CAGCCCAATG CX^GGGAAAAA 2820 

GGAAGACCAC ACCArrGTQG ACACAGTGCT 0ATGGG6CCC 00CTCG6CCA AGCOGGGCCT 2880 

GCTGAAGCTT ACAGAQAAGC AGGTGGAACA GAGG6CCTTC CA06ACCTCA AGGTOGCCCC 2940 

CGGCTACTAC ACCCTCACTG CAQACCAGGA CGCCOGGGGC ATGGTGGAGT TCCAGGAGQG 3000 

CGTGGAGCTG GTGGACGTAC GGOTGCCCCT CTTTATCXXX3 CCTGAGGATG ACGACGAGAA 3060 

GCAGCTGCTO GTGGAGGCCA TOGACGTGCC CGCAGGCACT GCCACCCTOQ OCOGCOGCCT 3120 

GQTAAACATC ACCATCATCA AG6AGCAA6C CAGAGAOGTG GTGTCCTTTG AGCAGCCTGA 3180 

GTTCTOGGTC AGCCGCGGGG ACCAGGTGOC CCOCATCCCT GTCATC06GC QTGTCCTGQA 3240 

CGGCGGGAAG TCCCAGGTCT CCTACCGCAC ACAGGATGGC ACCGCX3CAGG GCAACCGGGA 3300 

CTACATCCCC GTGGAGGGTG AGCTGCTGTT CCAGCCTGGG GAGGCCTGGA AAGAGCTGCA 3360 

GGTGAAGCTC CTGGAGCTGC AAGAA6TTGA CTCCCTCCTG OGGGGCCGCC AGGTCCGCCG 3420 

TTTGCACGTC CAGCTCAGCA ACCCTAAGTT TGGG6CCCAC CTG06CCA6C CGCACTCCAC 3480 

CACCATCATC ATCAGGGACC CAGATGAACT GGACCX3GAGC TTCACGAOTC AGATOTTOTC 3540 

ATC3VCAGCCA CCCCCTCACO GCGACCTGGG CGCCCCGCAG AAOXCAATG CTAAGGCCGC 3600 

TGGQTCCAGG AAQATCCATT TCAACTGOCr QCCCCCTTCT GGCAAGCCAA TOGGOTACAG 3660 

GGTAAAGTAC T06ATTCAGG G06ACTCGGA ATCCQAAGCC CACCTGCTCG ACAGCAAGGT 3720 

GCCCTCA6TG GAGCTCACCA ACCTGTACCC GTATTGC6AC TAT6AGATGA AQOTOTGGQC 3780 

CTAOGGGGCT CAGG6CGAG0 6ACCCTACAQ CTCCCTGGT6 TCCTGCOGCA CCCACCAG6A 3840 

AGTQCCCAGC GAGCCAGGGC GTCTGGCCTT CAATOTCGTC TCCTCCACGG TGACCCAGCT 3900 

GAGCTGGGCT GAGCCGGCTG AOACCAAOQQ TGAGATCACA GCCTACGAGG TCTGCTATOa 3960 

CCTG6TCAAC GATQACAACC GACCTATTGG GCCCATGAAG AAAGTGCTGG TTGACAACCC 4020 

TAAGAACCGG ATGCTGCTTA TTGAOAACCT TCQGQAOTCC CAGCCCTACC GCTACAOGST 4080 

GAAOGCGCGC AACGGGGCCG GCTGGGGGCC TGAGCGGGA6 GCCATCATCA ACCTGGCCAC 4140 

CCAGCCCAAG AGGCCCATGT CCATCCCCAT CATCOCTGAC ATCCCTATCQ TGGACGCCCA 4200 

GAGOGGGGAG GACTACGACA GCTTCCTTAT QTACAGCGAT QAa3TTCTAC GCTCTCCATC 4260 
GGGCAGCCAG AGGCCCAGOG TCTOCGATGA CACTGGCTGC GGCTGGAAGT TCGAGCCCCT 4320 
GCTGGGGQAQ GAGCTGQACC TQCGGOQCQT CAOTrGGCGG CTGCCCCCX5Q AGCTCATCCC 4380 
GCGCCTGTCG GCCAGCAGOG GGOGCTCCTC CGACGCOGAG 6CCCCCACGG CCCCCCGGAC 4440 
GACGGCGGOG CGGGCGGGAA GGGCGGCAGC COTGCCCOGC AGXGCGAGAC COGGGCCCCC 4500 
CGGAGAGCAC CTGGTGAATG GCCGGATGGA CTTTGCCTTC CCGGGCAQCA CCAACTCCCT 4560 
GCACAGGATG ACCACX3ACCA GTGCTGCTGC CTAIGGCACC CACCTGAGCC CACACGTGCC 4620 
CCACCGCGTG CTAAGCACAT CCTCCACCCT CACACGGGAC TACAACTCAC TGACCCGCTC 4680 
AGAACACTCA CACTCGACCA CACTGCCCAG GGACTACTCC ACCCTCACCT CCGTCTCCTC 4740 
CCAOGACTCT 06CCTGACTG CTGGTOTQCC OGACACGCCC ACCOGCCTGG TGTTCTCTGC 4800 
CCTGGGGCCC ACATCTCTCA GAGTGAGCTG GCAGGAGCCG OGGTGCQAGC GGCCGCTGCA 4860 
GGGCTACAGT GTGGAGTACC AGCTGCTQAA OGGGQGTGAG CTGCATOGGC TCAACATCCC 4920 
CAACCCTGCC CAGACCTCGG TGGTGGTGGA AGACCTCCTG CCCAACCACT CCTACGTGTT 4980 
CCGCGTGCGG GCCCAOAGCC AGQAAGGCTG GGGCCGAGAO CGTQAGGGTG TCATCACCAT 5040 
TGAATCCCAG GTGCACCCGC AGAGCCCACT GTGTCCCCTG CCAGGCTCCG CCTTCACTTT 5100 
GAGCACTCCC AOTGCCCCAG GCCCGCTGGT GTTCACTGCC CTGAGCCCAQ ACTCGCTGCA 5160 
GCTQAGCTGG GAGCGGCCAC 6GAGGCCCAA TGGGGATATC GTCGGCTACC TGGTGACCTQ 5220 
TQAGATGGCC CAA6GAGGAG GGCCAGCCAC CGCATTCCGG 6TGGATGGAG ACAGCCCCGA 5280 
GAGCCGGCTG ACOGTGCCGG GCCTCAGCGA GAA0GT6CCC TACAAGTTCA AGGTGCAGGC 5340 
CAGGACCACT GAGGGCTTCQ GGCCA6AGCG CGAGGGCATC ATCACCATAG AGTCCCAGGA 5400 
TGGAGGACCC TTCCCGCAGC TGGGCAGCCG TGCOGXSGCTC TTCCAGCACC CGCTGCAAAG 5460 
CGAGTACAGC AGCATCACCA CCACCCACAC CAGOGCCACC GAGCCCTTCC TAGTGGATGG 5520 
GCTGACCCTG GGQOCCCAGC ACCTGGAGGC AGGOGGCTCC CTCACCOGGC ATGTGACCCA 5580 
GGAGTTTGTG AGCCGGACAC TGACCACCAG CGGAACCCTT A6CACCCACA TGGACCAACA 5640 
G7TCTTCCAA ACITQAGCQC ACCCTGCOCC ACCCCOGCCA TGTCCCACTA GGOGTCCTCC 5700 
CGACTCCTCT OCCGGAOCCT CCTCACCTAC TCCATCCTTG CACCCCTGGO GGCCCAGCCC 5760 
ACCCGCATGC ACAGAGCAGG GGCTAGGTGT CTCCTGGGAG GCATGAAGOG 6GCAAGGTCC 5820 
GTCCTCTGTG GGCCCAAACC TATTTGTAAC CAAAOAQCTG GGAGCAGCAC AAQQACCCAO 5880 
CCTTTGTTCT GCACTTAATA AATGGTTTTG CTACTGCTAA AAAAAAAAAA AAAAAAAAAA 5940 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 5994 



80 



Seq ID NO: C92 DNA Sequence 
Nucleic Acid Accession »t NM_02391S 
Coding sequence: 250.. 1326 ~ 



1 11 21 31 41 51 

I I I I ] I 

GGCAOQAGGG TTTOGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 
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TCAAAGCTTA 
GTGAATGGAC 
CCCAG6CCTC 
AACTGAAGAA 
CAAOAGAGTC 
AATGAATTTG 
TTGCTGAATQ 
TTCTATCTCA 
ATAGTCCATG 
TCAOTTTTGT 
GATOGCTATC 
ACQAAGGTTT 
ATCCTGACAA 
CCmGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGQCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



TTCTTAATTA 
AGCCAGCCAC 
AATOGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
QTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTOATCGG 
TAAGTCAGTC 
TTTTTACCTQ 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



GAGACAAGAA 
CACAATGAAA 
AAGTQTTTCC 
CTTGAOGCTT 
CAACAGGA6C 
CTTGC06GTG 
GTG6ATCTTC 
GOTTGCAOIC 
TGGACCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGQAC 
TAOGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATOG 
OTTGGAATOO 
AA 



ACCTGTTTCA 
GAAATGAAAC 
TGACAC6CAT 
GCAAAATTAC 
GACG66CCA6 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTQ 
AATATCCATG 
ACCTATGTGA 
GCCATATCXIA 
GGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
T6CCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



ACTTGAAGAC 
CAGGAATAAC 
CTTTGCTTAC 
CAAATAAOGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
OGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGTCTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GOTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTAT6A 
AGTGTAAATA 



ACCXTTATGAQ 
CTATQCTGAA 
AGTOCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTOGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGTTQTT 
TTTTACTTTT 
CTGCAAA6AA 
CTTTTTCATG 
GAGTQAAAGC 
TTACACTGAT 
AATGTTTCTT 



Seq ID KO: C93 DKA Sequence 

Kucleic Acid Accession tit NM_020789.1 

Coding sequence! 208.. 3699 

1 11 21 31 41 51 

1)11 I 1 

G6CACGAGGG TGGAGC0GA6 OGOTGCGGAO CAQATCIGGT GGTTCTCOGO AGAOCAGCTT 
CCTT6GGTGT TACATGAGCC AA6CCCTCAC TGTACAGAAG AGTGAGAGCT GAAACCTGTT 
CCCTGAGCTQ ATCAOAAGQA CATCCCTTGO CCCCTCXATC TGGGCTCCTG TGGATAGGAG 
GGGCTGGGTG AGCAGGCCAG CTGGGCTATG GTGTGGTGCC TCGGCCTGGC CX3TCCTCAGC 
CT6GTCATCA GCCAGGGGGC TOAOG6TCX3A GGGAAGGCTG AGGTGGTATC GGTGGTGGGC 
C6GGCTGAGG AGAGTGTGGT GCTGGGCIOT GACCTGCTGC CCCXX36CXX30 COGOCXXXCC 
CTGCAT6TCA TCGAGTG6CT GG6CTTTGGA TTCCTGCTTC CCATCTTCAT CCAGTTCX3GC 
CTCTACTCTC CCOGAATTGA CCCTGATTAC GTCGGAGGAa TCOQGCTGCA GAAGGGGGCC 
TCrCTCCAGA TTGAGGGTCT CCGGGTGGAA GACCAGGGCT GGTACGAGTG C0GCX3TGTTC 
TTCCTGGACC AGCACATCCX: TGAAGAOQAT TTTQCTAAOQ GCTCCTGQGT GCATCTGACA 
GTCAATTCAC CCGCTCAATT CCAGGAGACA CCTCCTGCTQ TGTTG6AAGT GCAGQAACXO 
GAGOCTGTGA CCCTGCGTTG T G T G GCCCgT G6CAGCCCCC TGCCTCATGT GAG8TGGAAG 
CTCCX3A6GAA AGGACCTTGG CCAGGGCCAG GGCCAOGTGC AAGTGCAGAA CQGGACGCTG 
CXSGATCCGCC GGGTAGAGCG AGGCAGCTCT GGGGTCTACA CXTTGCCAAGC CTCCAGCACT 
GAGGGCAGCG CCACCCACGC GACCCAGCTO CTAGTGCTAG GACCCCCAGT CATCGTGGTO 
CCCOCCAAGA ACA6CACAGT CAATGCXTTCC CAGGAIGTTT CATTGGCCTG OCATGCTGAO 
GCATAOCCTO CTAACCTCAC CTACAGCI60 TTCCAGGACA ACATCAATGT CTTCCACATT 
AGCCGCCTGC AGCCCCGGGT GCAGATCCTG GTGGACX3GGA GCXT6CGGCT GCTG6CCACC 
CAGCCTOATG ATQCCX3GCTG CTACACCTGT GTGCCCAGCA ATGGCCTCCT GCATCCACCC 
TCAGCCTCTG CCTACCTCAC TGTGCTCT6C ATQCOGGGGG T6ATC0GCTG CC0QGTTCG7 
GCXAAGCGCC CACT6CTCTT TGTGAGCTQG ACCAAGGATG GAAAGGCXCT GCAGCTGGAC 
AAGTTCCCTG 6CTGGTCCCA GGGCACAOAA GGCTCACTGA TCATOSCXICT GGGGAAGGAG 
GATGCCCTGQ GAGAATACTC CTGCACCCXXr TACAACAGTC TTGOTACOGC CXWGCCCTCT 
CCTGTGACCC GCGTGCTGCT CAAGGCTCXX CCAGCTTTTA TAQAGCGGCC CAAGGAAGAA 
TATTTCCAAG AAGTAGGGCG GGAGCTGCTC ATCCCCT6CT COGGCCAAGG GGACOCTCCT 
CCT Q TTgrCT CTTGGACCAA GGTGGGCOQO GGQCTGCAAO GCCAGGCCCA G6TGGACAGC 
AACAGCAGCC TCATCCT600 ACCATT6AGC AAG6AG6CCC ACOGOCACTO GGAATGCAGT 
GCCAGCAATG CTGTGGCCCO AGTGGCCACC TCCACGAACG TCTAOGTGCT GGGCACTAGC 
CCTCATGTTG TCACCAATGT GTCCGTOGTa GCTTTQCCCA AGGGTGCCAA TGTCTCCTOG 
GAGCCTGGCT TTGATGGTGG TTATCTGCAG AGATTCAGTG TCTGGTACAC CCCACTGGCC 
AAGOGTCCTO AC06AATQCA CCATGAdGO GTGTCX:ttGO CAGT8CCT6T GGGGGCTGCT 
CACCTCCTAO T60CAGG6CT 6CAGCCCCAC ACCCA6TACC AGTTCAGOQT GCTAGCTCAG 
AACAAGCTGO OGAOTGGTCC CTTCAOOGAA ATOOTCTTOT CTGCTCOQGA AGGGCTTCCT 
ACCAOGCCAG CTGCACCCGG GCTTCCCCCA ACAGAGATAC CGCCTCCCXn' GTCCCCTCXX5 
06GG0TCTGG TGGCAGTGAG GACACCCOGG GGGGTACTCC TGCATT6GGA TCCCCCA6AG 
CTX^GTCCCTA A6AGACTGGA TGGCTAOGTC TTG6AAGGCC GGCAAGGCTC CGAGGGCTGG 
GAGGTGCT6G ACCOGGCrGT 6GCAGGCACA GAAACAOAGC TGCTGGTGCC AGGCCTCATC 
AAGGATGTTC TCTACGAGTT CCGCCTCGTG GCCTTGGOGG GCAGCTTG6T CAGOGACCCC 
AGCAACACGG CCAAOQTCTC C3^CTTCCGQT CTGQAGGTCT ACCCTTCOC3Q CAOGCAGCTG 
C0G60CCTCC TGCCTCAGCC OGTGCXGGCX: GGCGTGOTGG GOGGAGTCTQ CTTTCTGGGA 
OTQOOCGTCC TTGTGAGCAT CCIOGCOGQC T6CCTCCTQA ACGGG06CAG G6CT6CCCGC 
OGCOGCOGCA AG0GCCTCCX3 CCAAGATCCA CCTCTTATCT TCTCTCOQAC CSGGGAAiSTCA 
GCTGCACCCT CTGCTCTGGG CTCAGGCAGT CCTGACAGOG TGGOGAAGCT GAAGCTCCAO 
GGATCCXXAG TCCCCAGCCT GOGCCAGAGT CTGCTCTGGG GGGATCCTGC OGGAACTCCC 
AGCCCCCACC CGGATCCTCC ATCTAGCCGG GGACCCTTAC CTCTGGAGCC CATTTGCOGG 
G0CXCAQAG6 6G0GCTTTGT GATGGGGGCC ACTOT G OCQQ CC0CCCA6QA AAGGTCA66C 
06GGA0CAGG CAGAACCTCG GACTCCAGCC CAG06TCTGG CCG6GTCCTT TGACTGTAGC 
A0CA6CAGCC CCAGTGGGGC ACCCCAGCCC CTCTGCATTG AAGACATCAG CCXrTGTGGCA 
CCCCCTCCAG CAGCCCCACC CAGTCCCTTQ CCAGGTCCTG GACCCCTGCT CCAGTACCT6 
AOCCTGCCCT TCTTCGGAGA GATGAATGTG GATGGG6ACT 6GGCCCG6CT T6AGGAGCCC 
AGCCCTGCTG CACCOCCAOA TTACATGOAT ACGC33606CT GTOOCACCTC ATCTTTGCTT 
CGTTCTOSUS AAACCCCTGC TGTATCCOCX: AGGGAATCAC TTOCIGGGGC TGTGGTAGGG 
GCTGGGGCCA CT6CAGAGCC CCCTTACACA GCUfl G G C TG ACTGGACACT GAGGQAGOGQ 
CTGCTGCCAG GCCTTCTCCC TGCTGCCCCT OGAGGCAOCC TCAOCAGCCA GAGCAG06G6 
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OQAGGCAGOO CTTCGTTCCT CCGGCCCCCC TCCACAGCCC CCTCTGCAGG AGGCAGCTAC 3300 

CTCAGCCCTG CTCCAGQAGA CACCAGCAOC TOGGCCAOTG GCCCTGAGAO ATG6CCCGGA 3360 

ACGGAGCAT6 TGGTGACAGT CAGCAAGAGG AGGAACACAT CTGTGGA06A GAACTATGAG 3420 

. TGGGACTCAa AATTCOCTGQ GGACATGQAA TTGCrGQAGA CTTTGCACCT GGGCTTGGCC 3480 

D AGCTCCOGGC TCAGACCTGA AGCTGAGACA GAGCTAGGTG TQAAGACTCC A6AGQAGGGC 3540 

TGCCTCCTGA ACACTGCCCA TGTTACTGGC CCTGAGGCCC GCTGTGCTGC CCTTOGGGAG 3600 

GAATTCCTGG CCTTCCGCOG CXX5CCX5AQAT GCTACTAGGO CTOGGCTACC AGCCTATOGA 3660 

CAGCCAGTCC CCCACCCOGA ACAG6CCACT CTGCTGTGAA CATCCCTAAT GTGAGGCTGT 3720 

. - GAAAAG6CAT ATOQACCTOC AAAGGAOGCC OCCAACCAOA CAOACCIAGT TTCAAACGAG 3780 

iU G6CACTGCCC CTGCCTGCCC CTTTGGTGCC CAGGCACAGA OCXriGATAGT GGGTTTGGGT 3840 

CACCTTGGTA TGGAATGTAT GTGCTGACCC CCTAGGTGAG TCTGGGGATT GGAACAGGGA 3900 

TCTTAGGTCT GCCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTOT GT G TQTO'i d ' 3960 

GTGT6TGAAG TTTTTTACAG GTGAATAAAC AAAOTTTGAA AGAAAAAAAA AAAAAAAAAA 4020 

AAAA 4024 
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Seq ID NO: C94 DNA Sequence 
Nucleic Acid Accession #: NN_006875 
Coding sequencet 166.. 1190 



Seq ID NO: C95 DNA Sequence 

Nucleic Acid Accession #i NM^OOasiO.l 

Coding sequence : 92 . . 1774 



1 11 21 31 41 51 

11)111 

GAATTCGGCA CGAGOGCGCG GCGAATCTCA ACGCTGCGCC GTCTGCGGGC GCTTCCGGGC 60 

CAccswrnrc tctqctttcx: accctggcgc cccccagccc tggctcccca gctgcgctgc 120 

CC0G06GGTC CACX3CCCTGC GGGCTTAGOG GGTTCAOTGG GCTCAATCTO 06CAG0GCCA 180 

CCrCCATGTT GACCAAGCCT CTACAGGG6C CTCC06CX3CC CCCOGGGACC CCCAC6CCGC 240 

CGCCAGGAGG CAAGGATCGG GAAGCGTTCQ AQGCOSAOTA TOGACTOGOC CCCCTCCTGG 300 

OTAAGGGGGG CTTTGGCACC GTCTTCGCAG GACACOQCXTT CACAGATOGA CTCCAGGTGG 360 

CCATCAAAGT GATTCCCCGG AATOGTGTGC TGGGCTGGTC CCCCTTQTCA GACTCAOTCA 420 

CATQCOCACT 06AAGTCGCA CT6CTATGGA AAGTG0GT6C AGGTGGTOOS CACXX^XXSCO 480 

TGATCOSCCT GCTTGACTGG TTTQAQACAC AGGAAOGCTT CATGCTGOTC CTGGAGCGGC 540 

CTTTGCCCQC CCAGGATCTC TTTGACTATA TCACAGAGAA GGGCCCACTO GGTGAAGGCC 600 

CAAGCCGCTG CTTCTTTGGC CAAQTAGTGG CAGCCATCCA GCACTGCCAT TCCCGTGGAG 660 

TTGTCCATCG TGACATCAAG GATGAGAACA TCCTGATA6A CCTAOGCOGT GGCTGTOCCA 720 

^_ AACTCATTQA TTTTGOTTCT OOTGCCCTGC TTCATGAT6A ACCCTAGACT GACTTT6ATG 780 

Dt> GGACAAGOGT GTACAGCCCC CCAGAGTG6A TCTCTOQACA CCAGTACCAT GCACTCCOOO 840 

CCACTCTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTQAGC TCCACTTCCC AGCCCATCTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCCGACCC TCACTGGAAG 1020 

AGATCCTGCT G6ACCCCTOO ATOCAAAGAC CAGCCQAGQA T6TTACCCCT CAACCXXTTCC 1080 

AAA6GAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

TGGCCCCCAA TGGTCAGAAG AGCXSVTCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 1200 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGQ 1260 

ATTGAGQATC AOGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 1320 

CAAA6GAGCC TTOCTCCCAO AACCTGTGGT OCCTQATTTT GGAGGQGQAA C T TC TOCVr 13B0 

45 CTCATTTTGC TAAGQAAGTT TATTTTGGTG AAGTTGTTCX: CATTTTGAGC CCCGOOACTC 1440 

TTATTTTGAT GATGTOTCAC CXXACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT 1500 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

A6TAAAGGQA CCCTTTCCCC TAGCCTAGGO TCCCATATTG GGTCAAGCTG CTTACCTGCC 1620 

TCAGCCCAGG ArTTTTTATT TTGGGG6AGG TAATGCCCTO TTGTTACCCC AAGGCTTCTT 1680 

TTTTTTTTTT TTTTTTTTTO GGT6AGG0QA CCCTACTTTO TTATCCCAAG TGCTCTrATT 1740 

CTGGTGAGAA GAACCTTAAT TCCATAATTT GOGAAQQAAT GGAAGATGGA CACCACCX3GA IB 00 

CACCACCAGA CAATAGGATG GQATGGATGG TTTTTTGGGG GATGGGCTAG GGQAAATAAG 1860 

GCTTGCTGTT TGTTTTCCTG GG60GCTCCC TCCAATTTTG CAOATTTTTG CAACXTCCTC 1920 

CTGAGG0QG6 ATTGTCCAAT TACTAAAATC TAAATAATCA G8TATTGTGG GGAOGGQAGT 1980 

TCGAAGTGTO CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGOCAT GIGTGGAAAG 2040 

CCACTATTTA ATAAAAGTAA TAGAATCAOA AAAAAAAAAA AAAAAAAA 2088 



1 11 21 31 41 51 

I I I ^ 1 I I 

CAOATGOCAO AAOAACACTG TTQCTCTTGO TGOAOGaQCC GAOAGOAATT CAGAGTTAAA 60 

CXTTTGAGTCC CTGCGTCCGT GAQAATTCAO CATOQAAT6T CTCPACTATT TCCTGGGATT 120 

TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCOSCCAAA CQATTTCATG ATGTGCTGGG 180 

CAATGAAAGA CCTTCTGCTT ACATGAGGQA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

TGAAAATGAC TGGAAT6AAA AACTCTACCC AGTGTGGAA6 06GGGAGACA TGAGGTGGAA 300 

AAACTCCTGG AAGGGAGGGC GTOTGCAGOC OGTGCTOACC AOTGACTCAC CAOCCCTCGT 360 

/U GG6CTCAAAT ATAACATTT6 OGGTGAACCT GATATTCCCT AGATGCGAAA AGGAAGATGC 420 

CAATGGCAAC ATAOTCTATO AQAAGAACTG CAGAAATCAG GCTGGTTTAT CTGCTGATCC 480 

ATATGTTTAC AACTGGACAG CATGGTCAGA GCACAGTGAC GGCOAAAATO GCACCX3GCCA 540 

AAGCCATCAT AAC36TCTTGC CT6ATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACBTCT TCCACACACT TGOTCAGTAT TTCCAQAAAT TGGGA06ATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAACACATG QACGGGCATA T6TTCCCATC GCACAAGTQA AAGATGTGTA 780 

CGTGGTAACA GATCAGATTC CTQTQTTTGT GACTATGTTC CAGAAGAACQ ATOOAAATTC 840 

ATCCGAOQAA ACCTTCCTCA AAGATCTOCC CATTATQTTT QAT6TCCTGA TTCATQATCC 900 

TA6CCACTTC CTCAATTATT CTACCATTAA CTACAAOTOO AGCTTOSGGO ATAATACTGG 960 

CCTOTTTGTT TCCACCAATC ATACTOTGAA TCACAOOTAT OTOCTCAATG GAACCTTCAG 1020 

CCTTAACCTC ACTGTGAAAQ CTGCAGCACC AGGACCTTGT CXSCX! A CXX3C CACCACCACC 1080 

CAOACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGCC CACTTTCAAO CCACCATCAC 1200 

AATTOTAGAG GGAATCTTA6 AG6TTAACAT C3VTCCAGATO ACAQAOGTCC TGATGCCGGT 1260 

1254 



wo 03/042661 



PCT/US02/36810 



5 
10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



GCXyVTGGCCT 
GGAGGTCTGT 
CCCTGTGGAT 
GACGTACTGT 
GATTTCTGTT 
CrCOGTTGGC 
CAAGGAATAC 
TOTCTTTCTC 
ACTCAAAAAC 
TTTTCAGTGC 
TTATTGTTAA 
TTTAQAGATG 
AAA6CAACTT 
TAACTAGTA6 
AQCCTAACCC 
ATQCATAAAG 
TTCAATACAC 
GTGCACACTT 
TGACAAOCTA 
QGACATTTAG 
ATTTCCAAAT 
A0ATGA60TC 
ACTAAAACCA 
AAGTQTGGGA 



OAAAGCTCXX 
ACCATCATTT 
GTGGATGAGA 
GTQAACCTCA 
CCTGACAGAG 
TGCTTGGCCA 
AACCCAATAG 
AACCGTGCAA 
CAAGAATTTA 
CATTGATGTQ 
ATAGATATTG 
GGGAGAGGGA 
AGCAAG6CTT 
QATAOAAACA 
AGGTTAACra 
CCAATGTAGT 
ACTCATGAAC 
GCTAGACTCA 
CrTTGCTTGG 
TTAOTGCTTT 
TTTTGTATAG 

TCTACTATAT 
AGAGACAAAA 



TAATAGACTT 
CTQACCCCAC 
TGTGTCTGCr 
OCCTGGGOGA 

AcccAGCxrrc 

TATTTGTCAC 
AAAATAGTCC 
AAGCOGTOTT 
AAGGAGTTTC 
AGAT6TGCTG 
TGQTTTGQOQ 
TTATACTGCA 
CTTTTCATTA 
CTGTOTC C OO 
CAAGAAGAGG 
CCAGTTTCTA 
TCCTGATGGA 
GAAAAAATAC 
CTQAGT6AA0 
TTATATACCA 
TCGCTGCACA 
CATGOCAACT 
GTTAGACATG 
AAAAAAAAA 



TGTCGTGACC 
CTGCGAGATC 
GACTGTQAGA 
T QACAC AAGC 
GCCTTTAAGG 
TGTGATCTCC 
TGGGAATGTG 
CTTCCOGGGA 
TTAAATTTCO 
GAGTG6CTAT 
AAGTTGAATT 
GGCAGCTTCA 
TTTTTTATQT 
AaAOTAAGGA 
CGGGATACTT 
AQATCATQTT 
ACAATAACAG 
TACTCTCATA 
GAATGATATT 
GGCATQATGC 
TATTTQAAAT 
TGATCAGTAA 
ACATTCTTTT 



TGCCAAGG6A 
ACXXAGAACA 
OGAA CCTTCA 
CTGGCTCTGA 
ATGGCAAACA 
CTCTTGOTOT 
GTCAGAAGCA 
AACCAOGAAA 
ACCTTOTTTC 
TAACCTTTTT 
TTTTATAGQT 
GCCATGTTGT 
TTCACTTATA 
GAGAAGCTAC 
TCAGCTTTCC 
CCAAGCTAAC 
GCCCAAGCCT 
AATGGGTGGO 
CATATATTCA 
TGAGTQACAC 
CATATATTAA 
GGATTTCACC 
TCTCTCCTTC 



GCATTCCCAC 
CAGTCTGCAO 
A TGGGTCTGG 
OGAGCACCCT 
GTGCCCTGAT 
ACAAAAAACA 
AAGGCCTGAG 
AGGATC06CT 
TGAAOCTCAC 
TTCCTAAAGA 
TAAATGTCAT 
GAAACTGATA 
AAGTCTTA6G 
TATTGATTAG 
ATGTAACTGT 
TQAATCXXyiC 
GTGGTATGAT 
AQTATTTTGG 
TTTATTCCAT 
TCTTGTGTAT 
GACTTTCCAA 
TCTGTTTGTA 
CTGAAAAATA 



Seq ID NO: G96 DNA Sequence 

Nucleic Acid Accession fft Eos sequence 

Coding sequences 1.4247 



1 11 21 31 41 

11)11 
ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG 
TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG 
ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA 
CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT 
AAATTTCAGG GTTQGQATAA AACATCATTG GAAAACACAT TCATTCATAA 
ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC 
TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATQTCATC 
GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG 
GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAQ GGAAGTTAAG 
ATTTTGTTTG AGQTTGGQAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT 
GAAAGTGTTA GTCGTTTTGG QAAGCAGGCT GCTTTAGATC CATTCATACT 
CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC 
ACAQACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA 
GCTGTTTTTT OTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT 
TTACAAAAGA ATTTTGGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT 
ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA 
GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA QACCTOQAGT 
ACCATQATTG AGAAGTTTGC AOTTTTGTAC CAGCAGTTGG ATGQAGAGGA 
CATQAATTTT TGACAGATGO CTATCAAGAC TTGGGTGCTA TTCTCAATAA 
AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA 
AGCGACCAAC TGATT6TCGA CATGCCTACT GATAATCGTG AACTTGATCT 
TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAQAGG GAAAAGACAT 
GCTATTGTQA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA 
ATTTCTACXIA CAACACACTA CAATOGCATA GGGAC3GAAAT ACAAT6AA6C 
CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTQ ATGTTCCCAA 
AATTCCACTT CXXSACCAOT CACTAAATTA GCCACAGAAA AAGATATTTC 
CAGACXGTQA CTGAACTGCC ACCTCACACT GTGQAAGGTA CTTCAGCCTC 
GGCTCTAAAA CTGTTCTTAO ATCTCCACAT ATQAACTTOT OGGGGACTGC 
AATACAGTTT CTATAAC3W5A ATATGAGGAG GAGAGTTTAT TGACCAGTTT 
ACTGGAOCTG AAGATTCTTC AGGCTCCAQT CCX36CAACTT CTGCTATCCC 
GAGAACATAT CCCAAGGGTA TATATTTTCC TCOSAAAACC CAGAGACAAT 
GTCCTTATAC CAGAATCTGC TAGAAATGCT TCXmAGATT CAACTTCATC 
GAATCAGTAA AGGATCCTTC TATGGAGGGA AATQTGTGGT TTCCTAGCTC 
ACAGCACAGC CCX3ATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA 
ATACGTGTTG ATGAATCTOA GAAGACAACC AAGTCCTTTT CTGCAGOCCC 
CAGGGTCCCT CAQTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC 
ACTGAGGTAA CACCTCATGC TTTTACCCX» TCCTCCAQAC AACAGGATTT 
GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG 
CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATGCG AGAAGAAGGC 
CTTGTGATCX3 TGTCAQCCCT GACTTTTATC TGTCTAGTGG TTCTTOTGGG 
TACTGGAGGA AAT6CTTCCA GACT6GACAC TTTTACrTAG AGGACAGTAC 
GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAQ ATQATOTOGQ 
ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT 
TTTGAGGAAG TGCAQAOCTG TACTOTTOAC TTAGQTATTA CAOCAQACAG 
CCAGACAACA AGCACAAGAA TCGATACATA AATATCOTTG CCTATGATCA 
AAGCTAGCAC AGCTTGCTGA AAAGGATGGC AAACTGACTG ATTATATCAA 
GTTGATGGCT ACAACAOAOC AAAAGCTTAT ATTGCTGCTC AAGQCCCACT 
GCTGAAQATT TCTGGAGAAT GATATGGGAA CATAATGTGG AAGTTATTGT 
AACCTOGTGG AGAAAGGAAG GAQAAAATGT GATCAGTACT GGCCTGCCGA 
GAGTACGGGA ACTTTCTGGT CACTCAGAAG AGTGTGCAAG TGCTTGCCTA 
AGGAATTTTA CTCTAAGAAA CACAAAAATA AAAAAGGGCT COCAGAAAGG 
GGAOGTGTGO TCACACAGTA TCACTACAOG CAGTGGCCTG ACATGGGAGT 
T C CCTGCCAG TGCIOAOCTT TOTGAGAAAO GGAGCCTATQ GCAAOGOCCA 



1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
ISOO 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2669 



51 
I 

CCGCCTGGAT 

CTGGTCCTAT 

TAGCCCAAAA 

TAAGAAACTT 

CACTGGGAAA 

AGAAATGGTG 

TGATGGATCA 

CTTTGATGCA 

AGCTTTATCC 

TGATGGAGTC 

GTTGAACCTT 

TCCTCXXTTGC 

AAGCCAGTTG 

GATGGACTAC 

TTCCTCATAC 

TGTTCAGGCT 

OGTTTATGAT 

CCAAACCAAG 

TTTGCTACCC 

TGGAAAATAC 

TTTCCCTGAA 

TGAAGAAGGC 

GGAACCCCAG 

CAAQACTAAC 

TACATCTTTA 

CTTGACTTCT 

TTTAAATGAT 

AGAATCCTTA 

CAA6CTTGAT 

ATTCATCTCT 

AACATATOAT 

AGGTTCAGAA 

TACAGACATA 

TTACACTGAG 

AGTQATGTCA 

CTACTTCCCA 

GGTCTCCACG 

TAATAGTAGC 

AGTTATACCC 

TATTCTCATC 

ATCCCCTAOA 

AGCAATTCXa^ 

TACTGAAGAA 

CTCCAACCAC 

TAGCAGGGTT 

TGCCAATTAT 

GAAATCCACA 

CATGATAACA 

TGGGAGTGAG 

TTATACTGTG 

AAGACCCAGT 

ACCAGAGTAC 

TSCAGTGGGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

3100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 
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CCTGTTQTOJ TCCACTGCAG TX3CTGGAGTT GGAAOAACAG QCACATATAT TGTGCTAGAC 3240 

AGTATGTTGC AGCAGATTCA ACAOGAAGGA ACTCJTCAACA TATTTGGCTT CTTAAAACAC 3300 

ATCCGTTCAC AAAGAAATTA TTTG6TACAA ACTGAGGAGC AATATOTCTT CATTCATGAT 3360 

ACACTGOTTG AGQCCATACT TAGTAAAGAA ACT6A0GTGC TGGACAOTGA TUTTCATGCC 3420 

TAT6TTAATG GACTCCTCAT TCCTG6ACCA GCA06CAAAA CAAAGCTAGA GAAACAATTC 3480 

CAGCTCCTGA GCXIAGTCAAA TATACAGCAG AGTGACTATT CTGCAQCCCT AAAGCAATGC 3S40 

AACAGGGAAA AGAATCGAAC TTCTTCTATC ATCCCTGTGG AAAGATCAAG GGTTGGCATT 3600 

TCATCCCTGA GTGGAGAAOG CACAGACTAC ATCAATGCCT CCTATATCAT GGGCTATTAC 3660 

CAOAOCAATG AATTCATCAT TACXXAGCAC C C l 'CT C Cri' C ATACCATCAA GGATTTCTGG 3720 

AG6ATGATAT GGGACCATAA T6CCCAACTG GTCGTTAT6A TTCCTGATGG OCAAAACATG 3780 

GCAOAAOATG AATTT6TTTA CTGGCCAAAT AAAGATQAGC CTATAAATTG TGAGAGCTTT 3840 

AAGGTCACTC TTATGGCTGA AGAACACAAA TGTCTATCTA ATGAGGAAAA ACTTATAATT 3900 

CAGGACTTTA TCTTAGAAGC TACACAGGAT GATTATGTAC TTGAAGTGAG GCACTTTCAO 3960 

TGTCCTAAAT G6CCAAATCC AGATAQCXXX: ATTAGTAAAA CTTTTGAACT TATAAGTQTT 4020 

ATAAAAGAAG AAGCTGCCAA TAG6GATGG6 CCTATGATTG TTCATGATGA GCATGGAGGA 4080 

QTGACGGCAG GAACTTTCTG TGCTCTGACA ACCCTTATGC ACCAACTAGA AAAAGAAAAT 4140 

TCCGTGGATG TTTACCAGGT AGCCAAGATG ATCAATCTGA TGAGGCCAGG AGTCTTTGCT 4200 

GACATTGAGC AGTATCAGTT TCTCTACAAA 6TGATCCTCA GCCTTOTQAG CACAAGGCAG 4260 

GAAGAGAATC CATCCACCTC TCTGGACAGT AAIGGTOCAG CATTGCCTGA TGGAAATATA 4330 

GCTQAGAOCT TAGAOTCTTT AGTTTAA 4347 

Seq ID NO: C97 DNA Sequence 
Nucleic Acid Accession ff: XM_031379 
Coding sequence i 148.. 7095 



1 11 21 31 41 51 

1 I I 1 I I 

CACACATACX3 CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTOG CTCCCCCTCC CTCTCCACTC TGAGAAOCAQ AGGAGCCGCA 120 

CGGCGAGGGG CCX3CAGACCX5 TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCXX3 CXrTGGATTGG GCTAATGGAT ACTACAQACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAQ 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTQATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGOTT GQGATAAAAC ATCATTGGAA 420 

35 AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TQACTACCGT 460 

6TCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAQ C31TAGTTT7VG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGQAC CGATTTTCAA GTTTTQAGGA AGCAGTCAAA 660 

GGAAAAQGOA AQTTAAGAGC TTTATCCATT TTQTTTGAGG TTGOQACAGA AOAAAATTTG 720 

40 GATTTCAAAG CQATTATTGA TGGAGJPCQAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAQATCCAT TCATACTOTT GAACCTTCTG CCAAACTCAA CTOACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA QACACAGTTG ACTGGATTGT TTTTAAAOAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAQTTCTTAC AATGCAAC3UI 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

45 TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AOTTCAGAAC CA6AAAATGT TCAG6CTGAC CCAGA6AATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CT06AGT06T TTATGATACC AT6ATT6AGA AGTTTGCAGT TTTGTACCAO 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATQGCTA TCAAGACTTG 1260 

^ GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

5U T6CACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTOQACAT GCCTACTGAT 1380 

AATCXTTGAAC TT6ATCTTTT GCCTGAATTA ATTGGAACTG AAGAAATAAT CAAG6AGQAG 1440 

GAAGAGGGAA AAGACATTGA AGAA66CGCT ATT6T6AATC CTGGTAGAGA CA6T6CTACA 1500 

AACCAAATCA GGAAAAAGGA ACXXX»GATT TCTACCAC3A CACACTACAA TCX3CATAGGG 1560 

ACGAAATACA ATGAAGCCRA GACTAACOGA TCCCXZAACAA GAGGAAGT6A ATTCTCTGGA 1620 

55 AA6GGTGATQ TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT 6ACTTCTCAG ACTGTQACT6 AACTGCXACC TCACACTQTG 1740 

GAAGGTACTT CAGCCTCTTT AAAT6AT6GC TCTAAAACXG TTCTTAGATC TOOVCATATG 1800 

AACTTGTGGG GGACTGCAGA ATCX7TAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGQ CTCCAQTCCC 1920 

OO GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAa AATCXGCXA6 AAATGC TT OC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATGCTTCTAT G6A0G6AAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCOG ATGTTQGATC AGGCAGA6AG 2160 

^ AGCTTTCTCC AQACTAATTA CACTGAGATA COTGTTGATG AATCTGAGAA GACAACCAAO 2220 

55 TCCrrrrCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAO TTAGAGATCT GGAAAXOOCa 2280 

CATTATTCTA CCTTTQOCTA CTTCCCAACT OA OGTAA CAC CTGATGCTTT TACCGCATCC 2340 

TCCAOACAAC ACK3ATTTGGT CTCCAOGGTC AAOGTGOTAT ACTCGCAGAC AACCCAACGG 2400 

QTATACAATG GTGAQACACC TCTTCAACCT TCCTACAGTA OTGAAGTCTT TCCTCTAOTC 2460 

^ ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

7U TOGGCCTTGC ATGCTAGGCC TGTATTTCCC AGTGTGGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATQCTGCAGC TTT6CTTCCA Ym iXTXC lM CTTCCTTGAO TAC3IQAATTG 2640 

TTTGGCCATC T6CATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACOQAGAGT 2700 

GATAAGOTQC (XTTGCATGC TTCTCTGCXA GTGQCrGGGO GTGATTTQCT ATTAQAGCCC 2760 

„ AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GAOGCTGGAA 2620 

/5 TTTGGTAOTO AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTOATO CCATGATQCA TGCAOGTTCT TCAOaGCCTG AACCTTCTTA TQCCTTQTCT 2940 

GATAATGA6G GCTCCCAACA CATCTTCACT GTTTCTTACA 6TTCTGCAAT ACCT G T G CAT 3000 

GATTCTGTGG GTQTAACTTA TCAGGGTTCC TTATTTAGOS GCCXrTAGCCA TATACCAATA 3060 

CXTTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTC3V TGCCCTCTCT 3120 

OU GGTGATGGGG AATGCTCrTGO AGCCTCTTCT GATAGTGAAT TTCTTTTAOC TGACACAGAT 3180 

GGGCTGACAO CCCTTAACAT TTCTTCACXTT GTrTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTO GTGATOATAA TAAOGOOCTT TCTAAAAGTG AAATAATATA TGGAAAT6A0 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATG8TTTACC CTTCTGAAAG CACASTCATQ 3360 

CCCAACATGT ATGATAATGT AAATAAOrrO AATGGGTCTT TACAAGAAAC CAVimri-O : 3420 
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ATTTCTAGCA CCAAGGC5CAT GTTTCCAGGO 'i^ClTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGOTGACAC TTOOCTTAAA CCTGTGCTTA 6TGCAAACTC AGAGCCAGCA 3600 

TCCTCXOACC CTGCTTCrAa TGAAATGTTA TCTCCTTCAA CTCAGC TCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAOTATTG CTACAACCTT CCTTTCA6QC TTCTGATGTT 3730 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACX: 37 BO 

CCCAAAGTTQ ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGT6T06CC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTT6ACC ATTTCCTATG CAAGTGAGAA ATATGAACXA 3960 

CTTTTGTTAA AAAGTGAAAG TTCCXyvCCAA GTGGTACCTT CTTTGTACAG TAATOATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTQCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA C3VCTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAArTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATQTTG CCATTACAGC ' I WrVClXXX: C3«AGAGATG GTTCTGTAAC CTCAACAAAO 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCOGGT 4380 

TTAGTGGGTQ GTGGTOAAQA TGQTQACACT GATGATGATG GTGATGATGA TGATQATGAC 4440 

AQAGaTAOTQ ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAOACACC CAGOAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTGAC AAGT6TATCC 4620 

TCAOACAGTC AAACTGGTAT GGACAGAAGT CXrTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AAT6ACATTC AGACTGGTAO TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTQACAA GT6ATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTC3U3ATAG CCTTAATQAO AATOAGACTT CCACAQATTT CAGTTTTOCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

G6ATTCCCAC AGTOCXX»AC ATCATCTGTTi ACTA006AGA ACTCAOAAGT GTTCXIAOaTT 4980 

TCAQAGGCAG AGGCCAGTAA TAGTA6CCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTO 5040 

GAATCCGAQA AGAAGOCAGT TATACCCCTT GTQATOOTOT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGOAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAOTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATQTOGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCA6ATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GA6ACACTGA AAGAGTTTTA OCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAOCT CCAACCACCC AGACAACAAG* 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGQGTTAA GCTA6CACAQ 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TQAAGATTTC 5580 

T0GA6AATGA TATGGQAACA TAATOTGGAA GTTATTGTCA T6ATAACAAA CCTGGTGGAG 5640 

AAAGGAAGGA 6AAAATGTGA TCAGTACTGG CCTGCXX3ATG GGAGTGAGGA GTAC6GGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA 6ACCCAGTGG A0QT6TGGTC 5820 

ACACAOTATC ACTACA06CA GTGGCCTGAC ATGGGA6TAC CAGAGTACTC GCIGCCAGTG 5880 

CTGACCTTTG TGAGAAAOGC AGCCTATGCC AAG06CCATG CAGTGGGGCC TGTTGTOGTC 5940 

CACT6CAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAQATTCAAC ACGAAGQAAC TGTCAACATA TTrGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AQAAATTATT TGGTACAAAC TGAGQAGCAA TATGTCTTCA TTCATGATAC ACT OG TT Q AG 6120 

GCCATACTTA OTAAAGAAAC T6AGGTGCTG GACAGTCATA TTCAT6CCTA TGTTAAT6CA 6180 

CTCCTCATTC CTGQACCAGC AGGCAAAACA AAGCTAGAOA AACAATTCCA GCTCCTGAGC 6240 

CA6TCAAATA TACA6CAGAG TGACTATTCT 6CAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATOGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCX: TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT AOCATCAAGO ATTTCTGGAO GATGATATG6 6480 

GACCATAATO CCCAACTGGT GGTTATGATT GCTOATGGCC AAAACATG6C AOAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTO AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGrG TCCTAAATGG 6720 

CCAAATCCAQ ATAGCCCCAT TAOTAAAACT TrTQAACTTA TAAGTGTTAT AAAAOAAQAA 6780 

6CT0CCAATA GGQATGGGCC TATGATTGTT CATGAT6AGC ATGGAGGAGT GACGGCAG6A 6640 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTA6AAA AAGAAAATTC OQTGQATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCPGA CATTGAGCAG 6960 

TATCAGTTTC TCTAGAAAGT OATCCTCAGC CTTGTOAGCA CAAGGCA6GA AGAGAATCCA 7020 

TCCACCTCTC TGOACAlGTAA TG6TGCAGGA TTGCCTOATO GAAATATAGC TGAGAGCTTA 7080 

GAOTCTTTAG TTTAACACAG AAAGGGGTGG GGG QACTC AC AT CTGAG CAT TGTTTTCCTC 7140 

TTCX:TAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTQATTT CCCATCACCT 7200 

GACAOTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTO AATTTTACAG 7320 

TATTTCTAAG AATG6AATTG TGGTATTTTT TTCT6TATTG ATTTTAACA6 AAAATTTC!AA 7380 

TTTATAOAaQ TTAGGAATTC CAAACTACA6 AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAOTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTO TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCXAGTT CTQTGTAATT 7680 

6TTTAGTTTA ATGA06TAGT TCATTAOCTG GTCTTACTCT ACCAGTTTTC T6ACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

QAAATACCTT CATTTTGAAA GAAGTTTTTA TGAQAATAAC ACCTTACCAA ACATTOTTCA 7850 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 

AAAAAAAAAA AAAAAAAAAA AAAA 7944 



Seq ID NO: C98 DMA Sequence 
^ Nucleic Acid Accession S; NM_002851 
oO Coding sequence: 77.. 4518 ~ 

1 11 21 31 41 SI 

I I I I I 1 

CACACATACQ CA06CAOQAT CTCACTTOGA TCTATACACI G6AGGATTAA AACAAACAAA 60 
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CAAAAAAAAC ATTTCCTTCO CTCCCCCTCC CTCTCCACTC TOAGAAGCAG AGGAGCCGCA 120 
CGGOGAGGGQ CGGCAGACCG TCTC5GAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT G TQTTT G COG CCTGGATTGG GCTAATGGAT ACTACA(2ACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCXrTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
5 AAATATCCAA CATGTAATAQ CCCAAAACAA TCTCCTATCA ATATTOATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA 6AAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATT GGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACOGT 480 
GTCA606GA0 GAGTTTCAGA AATGGTGTTT AAAGCAAOCA AGATAACTTT TCACTGGOGA 540 
AAATGCAATA TGTCATCTGA TGOATCAQAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
10 GAGATGCAAA TCTACT6CTT TGATGOGGAC OGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAOO TTGGGAC AGA AGAAAATTTG 720 
GATTTGAAAO GGATTATTQA TG6A0T0GAA AGTGTTAGTC GrTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTO CCAAACTCAA CTQACAAGTA TTAC ATTTAC 840 
AATGGCTCAT TGACATCTCC TCXTCTGCACA GACACAGTTG ACTGQATTGT TTTTAAAQAT 900 
15 ACAGTTAGCa TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAOTTCTTAC AATGCAACAA 960 
TCTCGTTATO TCATGCTGAT GGACTACTTA CAAAACAATT TTCOAGAGCA ACA GTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAG6AA0 AOATTCATaA AGCAGTTTGT 1080 
AGTTCAGAAC CA6AAAATGT TCAiSGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGQAAAGAC CTOGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
20 CAGTTGGATG GAGA06ACCA AACCAAGCAT GAATTrTTQA CAGATGGCTA TCAAGACTTQ 1260 
GGTGCTATTC TCAATAATTT GCTAGCCAAT ATGAfflTATG TTCTTCAGAT AGTA&CCATA 1320 
TGCACTAATG GCTTATATGG AAAATACA6C GACCAACTGA TTQTCGACAT GCCTACTGAT 13 BO 
AATCCTGAAC TTGATCTTTT CCCTQAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGOOAA AAGACATTGA AQAAGGCX3CT ATTGTGAATC CTGQTAGAGA CAGTGCTACA 1500 
25 AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TQG^TAGGG 1560 
AOSAAATACA ATSAAOCCAA 6ACTAACGGA TCCCCAACAA GAGGAACSTGA ATTCTCTGGA 1620 
AAGG6TGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTA6CC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTOTG 1740 
GAAGGTACTT CA6CCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
30 AACTTGTOGG GGACTGCAGA ATOCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAO 1860 
AGTTTATTGA CCAGTTTCAA OCTTGATACT COAOCTGAAO ATTCTTCAGG CTCCAGTOCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTOAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTOCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAQG TTCAGAAQAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
35 GTGTGGTTTC CTAGCTCTAC AGAC3VTAACA GCACAGCCOG ATGTTGGATC AGGCAGAQAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTQTTGATG AATCTQA6AA GACAACCAAG 2220 
TCCTTTTCTO CAGOCCCAGT GATOTCACAG GGTCCCTCAO TTACAG ATCT GOAAATOCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACX3GTC AACGTGGTAT ACTCGCAGAC AACCCAACOG 2400 
40 GTATACAATG CAGAGGCCAG TAATAGTA6C CATGAGTCTC GTATTGOTCT AG CTGAGG GG 2460 
TTGGAATCCG AGAAQAAGGC AGTTATACCC CTTOTGATOO TGTCAGCCCT GACTTTTATC 2S20 
TGTCTAGTGG TTCTTOTGGQ TATTCTCATC TACXGGAS6A AAT6CTTCCA QACTGC ACAC 2580 
TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACXTTCCAAC ACCTA TCTTT 2640 
CCAATTTCAG ATGATGTCGG AQCAATTCCA ATAAAGCACT TTCCAAAGCA TQTTGCAGAT 2700 
45 TTACATQCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAQTT TTACCAGGAA 2760 
GTCCAGAGCT GTACTaTTOA CTTAGGTATT ACAGCAGACA OCTCCAACCA CCCAOACAAC 2820 
AAQCACAAGA ATCGATACAT AAATATCGTT GCXTTATGATC ATAGCAGGOT TAAO CTAGCA 2880 
CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TQTTGATGGC 2940 
TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AQCTGAAGAT 3000 
50 TTCTGGAGAA TGATATGGGA ACATAATGTG 6AAGTTATTG TCATGATAAC AAACCTCGTG 3060 
GAGAAAGGAA GGAGAAAATO T8ATCAGTAC T O GOCT G COG ATOQGA OTGA GGAGTAOOOO 3120 
AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTOCTTOCCT ATTATACTOT GAGGAATTTT 3180 
ACTCTAAQAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG QAAGACXXaO TGGACGTOTG 3240 
QTCACACAGT ATCACTACAC QCAGTGGCCT GACATGGGAG TACCAGAOTA CTCCCTGCCA 3300 
55 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCC3U^G0GCC ATGCAG TGGG GCCT GTTOTC 3360 
GTCCACTGCA GTGCTGGAGT TGGAAGAACA GOCACATATA TTGTGCrAGA GAGT^GTTG 3420 
CAGCAGATTC AACAOGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA ^TCCGTTCA 3480 
CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TOVTTCATGA TACACTGGTT 3540 
GAGGCCATAC TTAGTAAAGA AACTGAGGTQ CTGQACAGTC ATATTCATGC CTATGTTAAT 3600 
60 GCACTCCTCA TTCCTGGACC A6CAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTO 3660 
AGCCAGTCAA ATATACAGCA GAGTOACTAT TCTGCAGOCC TAAA6CAATG CAACA ^GA A 3720 
AAQAATCGAA CTTCTTCTAT CATCCCTGTG 6AAAQATC3VA GGGrTGGCAT TTCATCCCTG 3780 
AGTOGAGAAG GCACAGACTA CATC3VATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 
GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGQATTTCTG GAGGATGATA 3900 
65 TGGGACCATA ATOCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT G6CAGAAGAT 3960 
GAATTTOTTT ACTGGCCAAA TAAAOATOAa CCTATAAATT GTGAGAGCTT TAAGGT CACT 4020 
CTTATOGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACT TATA AT TCAGGACTTT 4080 
ATCTTAGAAQ CTACACAGGA TQATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 
TGGCCAAATC C3W3ATAGCCX: CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 
70 GAAGCTGCCA ATAGGGATGG GCCTATGATT OTTCATGATQ AGCATGGAGG AGTGACGGCA 4260 
GGAACTTTCT GTGCTCTGAC AACCCTTATO CAOCAACIAG AAAAAGAAAA TTCOSTGGAT 4320 
GTTTACCAGO TAQCCAAGAT GATCAATCTG ATOAGGCCAO QAGTCTTTGC TOACATTGAG 4380 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 
CCATCCACCr CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGQAAATAT AG CTGAGAQC 4500 
75 TTAGAGTCTT TAGTTTAACA CAGAAAQGOG TGGGGGGACT CAC ATCTGA G GATTGTTTTC 4560 
CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTQT TATCTGTTGA TTTCCCATCA 4620 
CCTQACAGTA ACTTTC31TGA CATAGGATTC TGCCX3CCAAA TTTATATCAT TAACAATGTG 4680 
TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATOTTTGA ACTA AAATG A TTGAATTTTA 4740 
_^ CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTQATTTTAA CAGAAAATTT 4800 
80 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAQAAAATGT TTGTTTTTAG TGTCAAATTT 4860 
TTAGCTGTAT TTGTAOCAAT TATCAGGTTT GCTAGAAAXA TAACTTTTAA TAGAOTAGCC 4920 
TGTAAATAAA ACACTCTTOC ATATQATATT CAACATTTTA CAACTGCAGT ATTCAC3CTAA 4980 
AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGOCCTAO T gTCTCCATO OACWATTT 5040 
ATATTTATAA TTOTAOATTT TTATATTTTA CTACTOAGTC AAGTTTTCTA 0TTCTGT6TA 5100 
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ATTCrrTTAOT TTAATGAOrr AGTTCATTAG CTGGTCTTAC TCTACCAOTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAOAAGrTT TTATGAOAAT AACACCTTAC CAAACATTOT 5260 

TCAAATGGTT TTTATCCAAa GAATTGCAAA AATAAATATA AATATT6CCA TTAAAAAAAA 5340 

AAAAAAAAAA AAAAAAAAAA AAAAAAA 5367 

Seq ID NO I C99 DNA Sequence 

Nucleic Acid Accession 8: Eos sequence 

coding sequence t 501..4S14 

1 11 21 31 41 51 

I 1 I 1 I 1 

CACACATAOG CACGCACX5AT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTT06 CTCCCOCTOC CTCTCCACTC TGAOAAGCAG AGGAGCOGCA 120 

15 0G6CGAGGG0 CCGCAGACOG TCTG6AAAT6 OQAATCCTAA AOO GT TTCCT GGCTTGCATT 180 

CAGCTCCTCT GTGTTT G CCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTOTTGAAG AQATTGGCTG GTCCTATACA GGAGGACTGA ATCAAAAAAT TGGGOAAAGA 300 

AATATCCAAC ATGTAATAGC GCAAAACAAT CTCXn>ATCAA TATTGATGAA GATCTTACAC 360 

_ AA6TAAATGT GAATCTTAAO AAACTTAAAT TTCAGGGTTG GQATAAAACA TCATTQQAAA 420 

ZU ACACATTCAT TCATAACACT GOGAAAACAG TGGAAATTAA TCTCACTAAT 6ACTACCGTG 460 

TCA6C6GAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATQCAATAT GTCATCTQAT GGATCAQAGC ATAGTTTAGA AGQACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAA6GGAA GTTAAQAGCT TTATCCATTT TGTTTGAGGT TGGGA CAGAA GAAAATTTGO 720 

25 ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCXa TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTO AACCTTCT6C CAAACTC3VAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAQ ACACAGTTQA CTGGATTGTT TTTAAAGATA 900 

CAGTTAOCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

. - CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TGGAGAGCAA CA6TACAAGT 1020 

30 TCTCTAGACA GGTGTTTTCC TCATACACIG 6AAAGGAAGA GATTCATGAA GCA6TTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCT6ACC CA6AGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACXSV TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATQG AQAGQACCAA ACCAAGCATO AATTTTTGAC AGATGGCTAT CAAOACTTGO 1260 

6TGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCA6ATA GTAGCCATAT 1320 

35 GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 

ATCCTGAACT TGATCTTTTC CCTGAATTAA ITGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGC^GCTA TTGTGAATCC TGGTAGA6AC AGTGCTACAA 1500 

ACCAAATCAG GAAAAAGGAA CCCCA6ATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

OQAAATACAA TGAAGCCAAG ACTAACOGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

40 AGG6TGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCA6TCACT AAATTAOCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGT6ACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

45 CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCQ 1980 

AAAACCCAGA GACAATAACA TATQATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCOG 2040 

AAGATTCAAC TTCATCAGGT TCAOAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAAT6 2100 

TGTGOTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 

- GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

50 CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCA6T TACAGATCTG GAAATGCCAC 2280 

ATTATTCTAC CTTTGCXITAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAQACAACA GGATTTGQTC TCCACXSOTCA ACGTGOTATA CTCGCAOACA ACCCAACCX3Q 2400 

TATACAATGA GGCCAGTAAT AOTAGOCATG AGTCTCX3TAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCOGAGAA GAAGGCAQTT ATACCCCTTG TOATCGTOTC AGCCCTQACT TTTATCTGTC 2520 

55 TAGTGGTTCT TGTGGGTATT CTCATCTACT G6AGGAAATG CTTCCAGACT GCACACTTTT 2580 

ACTTAGAaGA CAOTACATCC CCTA6AGTTA TATGCACAGC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGGA ATTCCAATAA A6CACTTTCC AAAGCAT6TT GCAGATTTAC 2700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG A6ACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

00 ACAAGAATG6 ATACATAAAT ATGGTTGOCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 

TTGCT6AAAA GGATGGCAAA CTQACIQATT ATATCAATGC C3iATTATQTT GATOOCTACA 2940 

ACAQACCAAA AGCTTATATT GCTGCCCAAG GCX7CACTGAA ATCGACAGCt GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAA6 TTATTGTCAT GATAACAAAC CTCGTGGAQA 3060 

- AAG6AAGGAG AAAATGTGAT CAGTACTGGC CT6CCQATGG GAGTGAG6AG TACGGGAACT 3120 

05 TTCrOGTCAC TCAOAAGAGT GTGCAAGTGC TTGOCTATTA TACTGTGAGG AATTTTACTC 3180 
TAAGAAACAC AAAAATAAAA AAG06CTCCC AGAAAOQAAG ACCCACrTGGA OGTGTGOTCA 3240 
CACAGTATCA CTACAOGCAG TGGOCTGACA TGGQAGTACC AGAGTACTCC CT6CCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGOGCCATGC AGTGGGGCCT GTTGTOGTCC 3360 
ACTGCAGTGC TGGAGTTGGA A6AACAGGCA CATATATTGT GCTAGACAQT ATGTTGCAGC 3420 
70 AGATTCAACSl CXSAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC 06TTCACAAA 3480 
GAAATTATT7 GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CIG6TTGA0G 3540 
CCATACTTAG TAAAGAAACT GAG G T GC T G G ACAGTCATAT TCAIGCCTAT GTTAATGCAC 3600 
TCCTCATrcC TGGACCAGCA GGCAAAACAA AGCTAGAOAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
75 ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
6AGAAGGCAC AGACTACATC AATGCCTCCT ATATCAIGGO CTATTACCA6 AGCAATGAAT 3840 
TCATCATTAC CCA6CACXX?r CTCCTTCATA CCATCAAGQA TTTCTGGAGG ATOATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATQAGCCTA TAAATTQTGA GAGCTTTAAG GTCACTCTTA 4020 
oO TOOCTGAAGA ACACAAATGT CTATCTAATO AGGAAAAACT TATAATTCAG QACTTTATCT 4080 
TAGAAOCTAC ACAGOATOAT TATGTACITO AAQT6AGGCA CTTTCAGTGT CCTAAATQOC 4140 
CAAATCCAGA TA6CCCCATT AGTAAAACTT TTOAACTTAT AAGT6TTATA AAAGAAGAAO 4200 
CTOGCAATAG GGATG6GCCT ATGATTGTTC ATGATGAGCA TGGAGGAOTG AOSGCAGGAA 4260 
CTTTCTGTGC TCTGACAACC CTTATQCACC AACTAGAAAA ACAAAATTCC QTGGATGTTT 4320 



1259 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ACCAGGTAGC 
ATCAGTTTCT 
CCACCTCTCT 
AOTCTTTAGT 
TCCTAAAATT 
ACAGTAACTT 
TTTTTGCAAG 
ATTTCTAAGA 
TTATAGAGGT 
CTGTATTTGT 
AATAAAACAC 
GAAATAATCr 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



CAAGATCATC 
CTACAAAGTG 
GGACAGTAAT 
TTAACACAGI^ 
AGGCAOGAAA 
TCATGACATA 
ACTTGTAATT 
ATGGAATTGT 
TAGGAATTCC 
AGCAATTATC 
TCTTOCATAT 
GTTACTTATT 
AGATTTTTAT 
TGACGTAGTT 
AA6TCATTAA 
ATTTTGAAAG 
TCCAAGGAAT 
AAAAAAAAAA 



AATCTGATGA 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTG G G 
ATCAGTCTAG 
GGATTCTGCC 
TACTTATTAT 
6GTATTTTTT 
AAACTACAGA 
AGGTTTGCTA 
GATATTCAAC 
GTAAATACTG 
ATTTTACTAC 
CATTAGCTGG 
CTTTGTTTCA 
AAGTTTTTAT 
TGCAAAAATA 
AAA 



GGCCAGOAQT 
TTGTGAGCAC 
TGCCTGATGG 
GGGACTCA Ca 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
QAAATATAAC 
ATTTTACAAC 
CXXTAGTGTC 
TGA6TCAAGT 
TCTTACTCTA 
GCATGTAATT 
GAGAATAACA 
AATATAAATA 



CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCT6A6CATT 
TGTTGATTTC 
TATCATTAAC 
AAATGATTGA 
TTTTAACAQA 
TTTTAGTGTC 

ttttaataca 
tgcagtattc 
tcx:atggacc 

TTTCTAGTTC 
CCAGTTTTCT 
TTAACTTTTQ 
CCTTAOCAAA 
TTGCCATTAA 



ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCA^rrG 
AAT6TGTGCC 
ATTTTACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACXTTAAAGTA 
AAATTTATAT 
TGTGTAATTG 
GACATTGTAT 
TGGAAAATAG 
CATTGTTCAA 
AAAAAAAAAA 



Seq ID NO: ClOO DNA Sequence 

Nucleic Acid Accession #i Eos sequence 

Coding sequence* 148.. 4362 



4380 
4440 
4500 
4560 
4620 
46B0 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5363 



1 11 31 31 41 51 

I I I 11 I 

CACACATACG CAC6CAGQAT CTCACTTCX3A TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

OGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT OGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGQ GCTAATGGAT ACTACAGACA ACAGAQAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAQ 300 

AAATATCCAA CATGTAATAO CGCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTG6AA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTOGAAAT^A ATCTCACTAA TOACTACCGT 480 

GTCAGCGGAG GAGTTTCA6A AATGGTGTTT AAAGCAAGCA AQATAACTTT TCACTGG6GA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

OAGATGGAAA TCTACTOCTT TQATGGGGAC GQATTTTCAA GTTTTCSAOGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAOC TTTATGCATT TTGTTTGAG6 TTGGGACA6A AGAAAATTTQ 720 

GATTTGAAAG CGATTATTGA TGGAGTOGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTQ CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTQ ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAO CCAGTTGGCT GTTTTTTTOT GAAGTTCTTA CAATGCAACA 960 

ATCTGGTTAT GTCATGCIGA TGQACTACTT ACAAAACAA7 TTTOSASAGC AACAGTACAA 1020 

GTTCTCTAGA CAGGTGTTTT GCTCATACAC TGGAAAGGAA GAGATTCATG AAGCAGTTTG 1080 

TAGTTCAGAA CCAGAAAATG TTCAGGCTGA CCCAGAGAAT TATACCAGCC TTCTTGTTAC 1140 

ATGGGAAAGA CCTOQAGTCX^ TTTATGATAC CATGATT6AG AAGTTTGCAG TTTTGTACCA 1200 

6CAGTTGGAT GGAGAOGACC AAAGCAAGCA TGAATTTTTO ACAGATOGCT ATCAAGACTT 1260 

G6GTGCTATT CTCAATAATT T6CTACCCAA TATGAGTTAT GTTCTTCAGA TAQTAGCCAT 1320 

ATGCACTAAT GGCTTATATG QAAAATACAG CGACCAACTG ATTGTCGACA TGCCTACTGA 1380 

TAATCCTGAA CTTGATCTTT TCCCTGAATT AATTGGAACT GAAGAAATAA TCAAGGAGGA 1440 

GGAA6AGGGA AAAGACATTO AAGAAGGCGC TATTGTGAAT CCTGGTAGAG ACAGTGCTAC 1500 

AAACGAAATC AGGAAAAAGG AAGCCCAGAT TTCTACCACA ACACACTACA ATG6CATAG6 1560 

GA06AAATAC AAT6AAGCCA AGACTAACCG ATCCCCAACA A6AG6AAGTG AATTCTCTGG 1620 

AAAGGGTGAT GTTCCCAATA CATCTTTAAA TTCCACTTCC CAACCAGTCA CTAAATTAGC 1680 

CACAGAAAAA OATATTTCCT TGACTTCTCA GACTGTGACT GAACTGCCAC CTCACACTGT 1740 

GGAAGGTACT TCAGCCTCTT TAAATGATGG CTCTAAAACT GTTCTTAGAT CTOCACATAT 1800 

OA ACTTG TOG OGGACTGCAG AATGCTTAAA TACA6TTTCT ATAACAGAAT ATGAGGAG6A 1860 

GAGTTTATTG ACCAGTTTCA AGCTTGATAC TGGAGCTGAA GATTCTTCAG GCT0CA6TCC 1920 

OGCAACTTCT GCTATCCCAT TCATCTCTGA GAACATATCC CAAQGGTATA TATTTTCCTC 1980 

CGAAAACCCA GAGACAATAA CATATGATGT CCTTATACCA GAATCTOCTA QAAATGCTTC 2040 

06AAGATTCA ACTTCATCAG 6TTCAGAAGA ATCACTAAAG GATCCTTCTA TGGAGGGAAA 2100 

w iiyiwrrr cctaoctcta cagacataac agcagagcgc gatqttggat caggcagaga 2160 

GAGCTTTCTC CAGACTAATT ACACTGAGAT AOQTGTTGAT GAATCTGAGA AGACAACGAA 2220 

GTCCTTTTCT GCAGGCCCAO TGATGTCACA GGGTCCCTCA GTTAGAGATC TGGAAATGCC 2280 

ACATTATTCT ACCTTTGCCT ACTTCCCAAC TGAGGTAACA CCTCATGCTT TTACCCCATC 2340 

CTCCAGACAA CAGGATTTGG TCTCCACGGT CAAOGTGGTA TACTOGCAGA CAACCCAACC 2400 

GGTATACAAT GAOGCCAOTA ATAGTA GCCA TGA6TCTG6T ATTGGTCTAG CTGAGG GQTT 2460 

GGAATCOGAG AAGAAGGCAG TTATACCCCT TGTGATG6T6 TCAGCGCTGA CTTTTATCTG 2 520 

TCTAGTGGTT CTTGTGGGTA TTCTCATCTA CTGGAGGAAA TGCTTCCAGA CTGCACACTT 2580 

TTACTTAGAG GACAGTACAT CCCCTAGAGT TATATCCACA CCTCCAACAC CTATCTTTCC 2640 

AATTTCAGAT QATGTCGGAG CAATTCCAAT AAAGCACTTT CCAAAGCATG TTGCAGATTT 2700 

ACAT6CAAGT AGTGG6TTTA CTGAA6AATT TGAGACACTG AAAGAOTTTT ACCAG6AAGT 2760 

GCAQAGCTGT ACTGTTGACT TAGGTATTAC AGCAGACAGC TCCAACCACC CAGACAACAA 2820 

GCACAAGAAT CGATACATAA ATATOGTTGC CTATGATCAT AGCAGGGTTA AGCTAGCACA 2880 

GCTTGCTGAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAOACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGAXTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCQTGGAG 3060 

AAAGGAAOGA GAAAATGT6A TCAGTACTGO CCTGC06AT6 GGA6TGAGGA GTACGGQAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACT6TGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG AOGTOTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CIGACCTTTO TGAGAAAGGC AG0CTAT6CC AAGCGCCATO CAGTG6GGCC T G TTOTOGTC 3360 

CACTGCAGT6 CTGGAGTTOO AAGAACAGGC ACATATATT6 TGCTA6ACA0 TATGTTGCAO 3420 

CAGATTCAAC A06AAGGAAC T6TCAACATA TTTG6CTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGSTACAAAC TGAGOAGCAA TATGTCTTCA TrCATOATAC ACTGGTTQAG 3540 
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GCXaVTACTTA GTAAAGAAAC TGAGQTGCTG QACAQTCATA TTCATGCCTA TGTTAATGCA '3600 

CTCCTCATTC CTGGACC3VGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTG6GCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACIG ATOCTCCTAC CTCAGCCTCC OGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCA6A6TGA CTATTCT6CA GCCCTAAAGC AATGCAACAO GGAAAAOAAT 3640 

CGAACTTCTT CTATCATCCC TGT6GAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCAT6GGCT ATTACCAQAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 40 80 

GTTTACTG6C CAAATAAAGA TGAGCCTATA AATTGTGAQA QCTTTAAGOT CACTCTTATQ 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGA6GCACT TTCAGT6TCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT QAACTTATAA QTGTTATAAA AGAA6AAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATO 6AGGAGTGAC GGCAGGAACT 4380 

TTCTQTGCTC TQACAACCCT TATGCACCAA CTAGAAAAAO AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCXIA AGATGATCAA TCTGATQAGG CCAGGAQTCT TTGCTGACAT TGAGCA6TAT 4500 

CAGTTTCTCT ACAAAGTOAT CCTCAGCCTT 6TGGGCACAA GQCAGQAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAOCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAO GCAGGAAAAT CA6TCTAGTT CTGTTATCTG TT6ATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGO ATTCT6CCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TT8TAATTTA CTTATTATGT TTGAACTAAA ATOATTQAAT TTTACAGTAT AB'SO 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTQATT TTAACAQAAA ATTTCAATTT 4920 

ATAGAG6TTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAOTOTCAA ATTTTTAOCT 4980 

GTATTTGTAG CAATTATCAG GTTT6CTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCGATAT6A TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTA6A 5100 

AATAA7CTGT TACTTATTOT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 

ATAATTGTAQ ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATQ ACGTAQTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCMC ATGTAATTTT AACTTTTGTX5 GAAAATAGAA 5340 

ATACCTTCAT TTTQAAAGAA GTTTTTATGA QAATAAGACC TTACCAAACA TTGTTCAAAT 54 00 

GOTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT 6CCATTAAAA AAAAAAAAAA 5460 

AAAAAAAAAA AAAAAAAAAA A 5481 

Seq ID NO: CI 01 DNA Sequence 

Nucleic Acid Accession ft; Eos sequence 

Coding sequence t 1 . . 3 3 4 0 

1 11 21 31 41 SI 

I I I I I I 

ATGCGAATCC TAAAGC6TTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTT6 CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACA6AQA AAACTTGTTG AAGAGATTGO CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAQAAATATC CAACATGTAA TA6CCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG QAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC GGTGTCAGGG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAAT6CA ATATGTCATC T6ATGGATCA 420 

GAOCATAGTT TAGAAGGACA AAAATTTCCA CTTGAOATOC AAATCTACTG CTTTGATGCG 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAQGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTrrOTTTG AGGTTQGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TQATGGAGTC 600 

GAAAGT6TTA GTOGTTXTGG GAAGCAGGCT OCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTGC^AAACT CAACTGACAA OTATTACATT TACAATGGCT CATTGACATC TCCTCCCT6C 720 

ACAGACACA6 TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

OCTGTTTTTT GTGAAQTTCT TACAATGCAA CAATCTGOTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAG6TGTT TTCCTCATAC 900 

ACTGGAAAGO AA6A6ATTCA TGAAGCAOTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCr 960 

GACCCA6A6A ATTATACCAG CCn ' Ci ' l 'Crr T ACATGGGAAA GACCTC6AGT OGTTTATGAT 1020 

ACCATGATTG AGAA6TTT6C AGTTTTGTAC CAGCAGTTGG ATGGAGA6GA CCAAACCAAO 1080 

CATGAATTTT TGACAGATGG CTATCAAQAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT AT G TTCTTCA QATA6TAGCC ATATGCACTA ATG6CTTATA TGGAAAATAC 1200 

A0G6ACCAAC TGATTGTGQA CAT6GCTACT OATAATCCTQ AGGGCAQTAA TAQTAOCCAT 1260 

GAGTCTCGTA TTGGTCTAGC TGAGGGGTTO GAATCCGAGA AGAAGGa\GT TATACCCCTT 1320 

OTGATCGTGT CAGCCCTQAC TTTTATCTGT CTAGT6GTTC TTGTGGGTAT TCTCATCTAC 1380 

TGGAGGAAAT GCTTCCAGAC TGCACACTTT TACTTAGAGG ACAGTACATC CCCTAGAGTT 1440 

ATATCCACAC CTCCAACACC TATGTTTCCA ATTTCAGATG ATGTCGGAGC AATTCCAATA 1500 

AAGCACTT7C CAAACCATGT TGCA6ATTTA CATGCAAOTA GTGGGTTTAC TOAAGAATTT 1560 

GAGACACTGA AAGAGTTTTA CCAGGAA6TG CAGAGCT6TA CTGTTGACTT A6GTATTACA 1620 

GCAGACAGCT CCAACCACCC AGACAACAAG CACAAGAATC GATACATAAA TATCGTTGCC 1680 

TATGATCATA GCAGGGTTAA GCTAGCACAQ CTTGCTGAAA AGGATGGCAA ACTGACTGAT 1740 

TATATCAATG CCAATTATGT TOATGGCTAC AACA6ACCAA AAGCTTATAT TGCTGCCCAA 1800 

GGCCCACTGA AATCCACAGC TGAAGAtTTC TGGAGAAT6A TAT6GGAACA TAATGTG6AA 1860 

GTTATTGTCA TGATAACAAA CCTCGTGGAa AAAGGAAGGA GAAAATGTGA TCAGTACTGG 1920 

CCTGCCGATG GGAGTGAGGA GTAOGGGAAC TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG 1980 

CTTGCCTATT ATACTGTGAG GAATTTTACT CTAAGAAACA CAAAAATAAA AAAGGGCTCC 2040 

CAGAAAGGAA GACCCAGTGO ACGTGTGGTC ACACAGTATC ACTACACGCA GTGGCCT6AC 2100 

ATGGGAGTAC CA6AGTACTC CCTGCCAGTQ CTGACCTTTO TGAGAAAG6C AGGCTAT6CC 2160 

AAG06CCATG CAOTGGGGCC TGTTGT08TC CACTGCAGTQ CrOGAOTTGO AAOAACAGGC 2220 

ACATATATTG TGCTAGACAG TATGTTGCAG CAGATTCAAC AOGAAGGAAC T6TCAACATA 2280 

TTTGGCTTCT TAAAACACAT CCGTTCACAA AGAAATTATT TGOTACAAAC TQAQQAGCAA 2340 

TATGTCTTCA TTCATGATAC ACTGGTT6AG GOCATACTTA GTAAAGAAAC TGAGGT0CTX3 2400 

GACAGTCATA TTCATGCCTA TQTTAATGCA CTCCTCATTC CTGGAOCAQC AGGCAAAACA 2460 

AAGCTAGAGA AACAATTCCA GCTCCT8AGC CAOTCAAATA TACAGCAGAO TOACTATTCT 2520 

GCA6GCCTAA AGCAAT3CAA CAGGOAAAAO AATOGAACTT CTTCTATCAT COCTGTG G AA 2580 

AGATCAAGGG TTGGCATTTC ATCOCTGAGT 0GA6AAGGCA CAGACTACAT CAATGCCTCC 2640 
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TATATCATGQ GCTATTAOCA GAOCAATGAA TTCATCATTA CTCAGCACCC TCTCCTTCAT 2700 

ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACP6GT GGTTATGATT 2760 

CCTGATGGCC AAAACATG6C AGAA0AT6AA TTT6TTTACT GGCCAAATAA AGATGAGCCT 2B20 

ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATG6CTGAA0 AACACAAATG TCTATCTAAT 2880 

GAGOAAAAAC TTATAATTCA G6ACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 3940 

GAAGTGAG6C ACTTTCAOTG TCCTAAATGG CCAAATCCAG ATAGCCGCAT TAGTAAAACT 3000 

TTTGAACTTA TAAGTGTTAT AAAAGAAOAA GCTGCCAATA GGGATGGGCC TATGATTGTT 3060 

CATGATGAOC ATGGAGGAOT QACGOCAGGA ACTTTCTOTQ CTCTGACAAC CXrTTATGCAC 3120 

CAACTAGAAA AAGAAAATTC 06TGGATGTT TACCAGGTAG GGAAQATGAT CAATCTGATG 3180 

AGGCCAGGA6 TCTTTGCTGA CATTGAGCAG TATCA6TTTC TCTACAAAGT GATOCTCAGC 3240 

CTTGTGAGCA CAAGGCAGGA AGA6AATCCA TCTACCTCTC TGQACA6TAA TGGTGCAGCA 3300 

TTG C CTQATG 6AAATATAGC TGAGA6CTTA GAOTCTTTAG 3340 

Seq ZD NO: C102 DMA Sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 1..4480 

1 11 ai 31 41 51 

I I 1 I I I 

ATOOOAATCC TAAAGOGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTO AAGAGATTQ6 CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAA6TAA ATQTGAATCT TAAGAAACTT 240 

AAATTTCAGO GTTGG6ATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAOTGGAAA TTAATCTCAC TAAT6ACTAC OGTGTCAGCQ GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAQCATAGTT TAGAAGGAC3V AAAATTTCCA CTTQAGATGC AAATCTACTG CTTTGATGCA 480 

GACCGATTTT CAAGTTTT6A GGAAGCAGTC AAAGGAAAAG GGAA6TTAAG AGCTTTATCC 540 

ATTTTGTTTO AGGTT0G6AC AGAAGAAAAT TTGGATTrCA AAGOGATTAT TGATGOAOTC 600 

GAAAGTGTTA GTOGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 660 

CTOCXJU\ACT CAACTQACAA GTATTACATT TACAATGGTT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAQCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAA7GCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTGGAGA GCAACAGTAC AAGTTCTCTA GACAGOTGTT TTCCTCATAC 900 

ACTGGAAAGO AA6AGATTCA TGAAGCAGTT TGTAOTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT OGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCRAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA 6ATAGTAGCC ATATGCACTA A1QGCTTATA TGQAAAATAC 1200 

AGCGACCAAC TGATTOTOQA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAG6C 1320 

GCTATTGTGA ATCCTQGTAG AGACAGTQCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 

ATTTCTACCA CAACACACTA CAATOGCATA GGGA06AAAT ACAATGAAGC CAAGACTAAC 1440 

OQATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAOG6TQ ATGTTCCCAA TACATCTTTA 1500 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AA6ATATTTC CTTGACTTCT 1560 
CAGACTGTGA CTGAACTGCC ACCTCAC3VCT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 
GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CX3GGGACTGC AGAATCCTTA 1680 
AATACAOTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCA6TTT CAAOCTTGAT 1740 
ACTGGAGCTG AAGATTCTTC AGGCTCCAOT OCGGCAACTT CTGCTATCGC ATTCATCTCT 1800 
GAGAACATAT CCCAAGGGTA TATATTTTCC TCGGAAAACC CA6AGACAAT AACATAT6AT 1860 
GTCCTTATAC CAGAATCTGC TAGAAATGCT TCOQAAGATT CAACTTCATC AGGTTCAGAA 1920 
GAATCACTAA AGGATCX:TTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 
ACAGCACAGC COGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 
ATACGTGTTG ATGAATCTGA GAAGACAACC AA9T0CTTTT CTGCAGGCCC A6TGATGTCA 2100 
CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTAOCTTTGC CTACTTCCCA 2160 
ACTGAGGTAA CACCTCATGC TTTTATCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 
GTCAAOGTGG TATACT06CA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAQTAGC 2280 
CATGA6TCTC GTATTGGTCT AGCTGAGGGG TTGGAATCOG AQAAGAAGGC AGTTATACCC 2340 
CTTOTGATCG TGTCAGCCCT GACTTTTATC TGTCTAGTGO TTCTTGTGGQ TATTCTCATC 2400 
TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAO AGGACAGTAC ATCCCCTAOA 2460 
GTTATATCCA CACCTOCAAC ACCTATCTTT CXauVTTTCAG AT6ATGT0GG AGCAATTCCA 2520 
ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA QTAGTGGGTT TACTGAAGAA 2580 
TTTGAGACAC TGAAAGAOTT TTACCAOGAA GTGCAGAGCT GTACTGTTGA CTTA6GTATT 2640 
ACAGCAGACA GCrCCAACCA CCCAGACAAC AAGCACAAQA ATOIATACAT AAATATC6TT 2700 
GCCCATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 
GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 
CAAGGCCCAC TGAAATCCAC AQCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 
GAAGTTATTG TCATGATAAC AAACCTOGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 
TGGCCTGCCG ATGGGAGTGA GGAGTACX3GG AACTTTCTGO TCACTCAGAA GAGIGTGCAA 3000 
GTGCTTCCCT ATTATACTOT GAGQAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 
TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GGAOrGGCXT 3120 
GACATGGGAQ TACCAQAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 
GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCA6ATTC AACAOGAAGG AACTGTCAAC 3300 
ATATTTGOCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTQA66AG 3360 
CAATATGTCT TCATTCATGA TAC3VCTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 3420 
CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGGGTCTC ACrCTGTCAC CCAGGCTGGA OTGCAGAGGC 3540 
ACAATCTCGG CTCACTQCAA CCTTCCTCTC CCTGGCTTAA CIGATCCTCC TACCTCA6CC 3600 
TCCCGAGTGG CTAGGACTAT ACTCCTQAGC OVGTCAAATA TACACCAGAG TGACTATTCT 3660 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATOGAACTT CTTCTATCAT CCCTGTGGAA 3720 
AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAQAAGGCA CAGACTACAT CAATGCCTCC 3780 
TATATCATGG GCTATTACCA GA6CAATGAA TTCATCATTA CCCAQCACCC TCTCCTTCAT 3840 
ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACTGGT GGTTATGATT 3900 
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CCTGATGGCC AAAACATGGC AQAAGATGAA TTTGTTTACT GGCCAAATAA AGATC3AGCCT 3960 

ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATGGCTGAAQ AACACAAATQ TCTATCTAAT 4020 

QAGGAAAAAC TTATAATTCA GGACTTTATC TTA6AAGCTA CACAGC3ATQA TTATOTACTT 40 BO 

GAAGTGAGGC ACTTTCAGTG TCCTAAATQG OCAAATCCAG ATAGCCCCAT TAQTA AAACT 4140 

5 TTTGAACTTA TAAGTOTTAT AAAAGAAGAA GCTGCGAATA GGQATGOOCC TATCATTGTT 4200 

CAT6ATGAGC ATGGAGOAGT GAOGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC 4260 

CAACTAGAAA AAGAAAATTC OGTGQATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG 4320 

AGGCCAGGAG TCTTT G CTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 4380 

CTTOTGAGCA CAAGGCAGGA AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCAGCA 4440 

10 TTQCCTGATG GAAATATAOC TGAGAGCTTA GAGTCTTTAG 4480 
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Seq ID NOt C103 OKA Sequence 

Nucleic Acid Accession it Eos sequence 

coding sequence 1 1..4220 



11 



1 

I 

ATGCGAATCC 
TGGGCTAATG 
ACAGGAGCAC 
CAATCTCCTA 
AAATTTCAOO 
ACAGTGGAAA 
TTTAAAGCAA 
GAOCATAGTT 
GAC06ATTTT 
ATTTTGTTTO 
GAAAGTGTTA 
CTGCCAAACT 
ACAGACACAG 
OCTGTTTTTT 
TTACAAAACA 
ACTGGAAAGG 
GACCXHAGAGA 
ACCATGATT6 
CATGAATTTT 
AATATGAGTT 
AGCGACCAAC 
TTAATTGGAA 
GCTATTGT6A 
ATTTCTACCA 
CGATCCCCAA 
AATTCCACTT 
CAGACTGTGA 
GGCTCTAAAA 
AATACAGTTT 
ACTGGAGCXO 
GAGAACATAT 
GTCCTTATAC 
GAATCACTAA 
ACAGCACAOC 
ATAOSTOTTG 
CAGGGTCCCT 
ACTGAGGTAA 
GTCAACX3TGG 
CATGAOTCTC 
CTTGTGATCG 
TACTGGAGGA 
GTTATATCCA 
ATAAAGCACT 
TTTOAOACAC 
ACAGCAGACA 
GCCTATGATC 
GATTATATCA 
CAAGGCCCAC 
GAAGTTATTG 




21 31 

I I 

TAAAGCX3TTT CCTCGCTTGC ATTCAGCTCC 
GATACTACAG ACAACAGAGA AAACTTGTTQ 
TGAATCAAAA AAATTGGGGA AA6AAATATC 
TCAATATTGA TGAAGATCTT ACACAAGTAA 
6TTGGGATAA AACATCATTG GAAAACACAT 
TTAATCTCAC TAATGACTAC 0GTQTCAGCX3 
GCAAQATAAC TTTTCACTGG GOAAAATQCA 
TAGAAGGACA AAAATTTCCA CTTGAOATGC 
CAAGTTTTQA 6GAAGCAGTC AAAGGAAAA6 
AGGTTGGOAC AGAAQAAAAT TTGQATTTCA 
GTCGTTTTGG GAAGCAGGCT GCTTTAGATC 
CAACTGACAA GTATTACATT TACAATGQCT 
TTQACTGOAT TGTTTTTAAA 6ATACAGTTA 
GTOAAGrrCT TACAAT6CAA GAATCIOOTT 
ATTTTGQAQA QCAACAGTAC AAGTTCTCTA 
AAGAGATTCA TGAAGCAGTT TGTAGTTCAQ 
ATTATACCAQ CCTTCTTGTT ACATGGOAAA 
AGAAGTTTQC AGTTTTGTAC CAGCAGTTGG 
TGACAGATGG CTATCAAGAC TTGGQTGCTA 
ATGTTCTTCA GATAOTAGOC ATATGCACTA 
TGATTGTOGA CATGCCTACT GATAATCCTG 
CTGAAGAAAT AATCAAGGAO GA6GAAGAGG 
ATCCTGGTAO AGACAGTGCT ACAAACCAAA 
CAACACACTA GAATGGCATA GG6AGQAAAT 
CAAGAGGAAG TGAATTCTCT GGAAAG6GT6 
CCCAACXywrr CACTAAATTA OCXaVCAOAAA 
CTGAACTQCC ACCTCACACT GTGGAAGGTA 
CTGTTCTTA6 ATCTCCACAT ATGAACTTGT 
CTATAACA6A ATATGAGQAO GAOAGTTTAT 
AAQATTCTTC AGGCTCCAGT CCCGCAACTT 
CCCAAGGGTA TATATTTTCC TCX3GAAAACC 
CAGAATCTGC TAGAAATGCT TCCGAAGATT 
AOGATCCTTC TATGGAGGGA AATGTGTGGT 
COGATQTTQG ATCAGGCAGA GAGAGCTTTC 
ATGAATCTGA GAAGACAACC AAGTCCTTTT 
CAGTTACAGA TCTGQAAATG CCACATTATT 
CACCTCATGC TTTTACCCCA TCCTCCAGAC 
TATACTCGCA GACAACCCAA C0G6TATACA 
GTATTGOTCT AGCTGAGGGG TTG6AATCG0 
TGTCAGCCCT GACTTTTATC TOTCTAGTOQ 
AATGCTTCCA GACTGCACAC TTTTACrTAG 
CACCTCCAAC ACCTATCTTT CCAATTTCAG 
TTCCAAAGCA TGTTGCA6AT TTACAT6CAA 
TGAAA6AGTT TTACGAGOAA GTGCAGAGCT 
GCTOCAACCA CCCAGACAAC AAGCACAA6A 
ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG 
ATQCCAATTA TGTTGATGGC TACAACAGAC 
TGAAATCCAC AGCTGAAGAT TTCTG6AGAA 
TCATGATAAC AAAGCTOQTG QAOAAAGGAA 
ATGGGAGTGA 0GAGTAG6G0 AACTTTCTGG 
ATTATACTGT GAGGAATTTT ACTCTAAGAA 
GAAGACCCAQ TGGACX?rGTG GTCACACAGT 
TACCAGAGTA CTCCCTGCXZA GTGCTQAGCT 
ATGCAOT6GG GCCTOTT G TC OTCCACTQCA 
TTGTOCTAOA CAGTATGTTG CAGCAGATTC 
TCTTAAAACA CATCCGTTCA CAAAGAAATT 
TCATTCATGA TACACTGGTT QAGGCCATAC 
ATATTGATGC CTATGTTAAT GCACTCCTCA 
AGAAACAATT CCAOCTCCTG AGCCA6TCAA 
TAAAGCAATG CAACA6G6AA AAGAATCGAA 
GOGTTOOCAT TTCATCCCTO A6T06AGAAG 
TGGQCTATTA CCAGAGCAAT GAATTCATCA 
AGGATTTCTG GAGQATGATA TGGGACCATA 
GOCAAAACAT GGCAGAAGAT GAATTTGTrT 
GTGAGAGCTT TAAOGTCACT CT TATG GCTG 
AACTTATAAT TCAGGACTTT ATCTTAGAAG 
ACTrrCTGTG CTCTGACAAC CCTTATGCAC 



41 
I 

TCTGTGTTTG 
AAGAGATTGG 
CAACATGTAA 
ATGT6AATCT 
TCATTCATAA 
GAGGAGTTTC 
ATATQTCATC 
AAATCTACTG 
GQAAGTTAAG 
AAG06ATTAT 
CATTCATACT 
CATTGACATC 
GCATCTCT6A 
ATGTCATGCT 
QACAGGTGTT 
AACCAGAAAA 
GACCTCGAGT 
ATGGAGAGGA 
TTCTCAATAA 
ATG6CTTATA 
AACTTGATCT 
GAAAAGACAT 
TGAGGAAAAA 
ACAATGAA6C 
ATGTTGCCAA 
AAGATATTTC 
CTTCAGCCTC 
CGGGGACTGC 
TGACCAOTTT 
CTGCTATCCC 
CAGAGACAAT 
CAACTTCATC 
TTCCTAGCTC 
TGCA6ACTAA 
CTGCAGGCCC 
CTACCTTTGC 
AACAGOATTT 
ATGAGGCCA6 
AOAAGAAOGC 
TTCTTOTGGG 
AGGACAOTAC 
ATGATGTOGG 
GTAGTGGGTT 
GTACTQTTGA 
ATC6ATACAT 
AAAAGGATGG 
CAAAAGCTTA 
TGATATGGGA 
GGAGAAAATO 
TCACTCAOAA 
ACACAAAAAT 
ATCACTACAC 
TTGTGAQAAA 
GTGCTGGAGT 
AACACGAAGG 
ATTTGGTACA 
TTA6TAAAGA 
TTCCTGGACC 
ATATACAGCA 
CTTCTTCTAT 
GCACAGACTA 
TTACCCAGCA 
ATGCCCAACT 
ACT6GCCAAA 
AAGAACAGAA 
CTACACAGQC 
CAACTAGAAA 



SI 
I 

COGCCTGGAT 
CTGQTCCTAT 
TAGCCCAAAA 
TAA6AAACTT 
CACTGGGAAA 
AQAAATGGTG 
TGATGGATCA 
CTTT6ATGCG 
AGCTTTATCC 
TGATGGAGTC 
GTTGAACCTT 
TCCTCCCTGC 
AAGCCAGTTG 
GATGGACTAC 
TTCCTCATAC 
TGTTCAGGCT 
CGTTTATGAT 
CCAAACCAAG 
TTTGCTACCC 
TGOAAAATAC 
TTTCCCTGAA 
TGAAGAAGGC 
GGAACCCCAG 
CAAGACTAAC 
TACATCTTTA 
CTTGACTTCT 
TTTAAATQAT 
AOAATCCTTA 
CAAOCTTGAT 
ATTCATCTCT 
AACATATGAT 
AGGTTCAGAA 
TACAGACATA 
TTACACTGAO 
AGTGATGTCA 
CTACTTCCCA 
GQTCTCCAOG 
TAATAGTAGC 
AQTTATACCC 
TATTCTCATC 
ATCCCCXAGA 
AGCAATTCCA 
TACTGAAOAA 
CTTAGGTATT 
AAATAT08TT 
CAAACT6ACT 
TATTGCTGCC 
ACATAATGTG 
TGATCAOTAC 
GAGTGTGCAA 
AAAAAAGGGC 
GCAGTGGCCT 
GGCAGCCTAT 
TG6AAGAACA 
AACTGTCAAC 
AACTGA6GAG 
AACTGAGGTG 
AGCAGGCAAA 
GAGTGACTAT 
CATCCCTOTG 
CATCAATQCC 




60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

3340 

2400 

2460 

2520 

2580 

2640 

3700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3340 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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5 



OOTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG A6GCCAGGAQ TCTTTGCTGA 4080 

CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAQCA CAAGGCAGGA 4140 

AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCA6CA TTGCCTGATG GAAATATAGC 4200 

TGAGAGCTTA GAGTCTTTAG 4220 

Seq ID NO! C104 DNA Sequence 

Nucleic Acid Accession XM_0029X4.6 

Coding sequence: 1..4314 " 

10 1 11 21 31 41 SI 

I I I I I I 

ATGAAGGATA TCGACATAGG AAAAGAGTAT ATCATCCCCA GTCCTGGGTA TAGAAGTGTG 60 

AGG6A6AGAA CCAGCACTTC TGGGAOGCAC AGAGACGGTG AAGATTCCAA GTTCAGGAGA 120 

1 C ACTO GAOOGT TGGAATQCCA AGATGOCTTG GAAACAGCAG OOCQAGOC36A GGGCCTCTCT IBO 

IJ CTTGATGCXnr CCATGCATTC TCA6CTCAGA ATCCTGGATG AGGA6CATCC CAAGGGAAAQ 240 

TACCATCATG GCTTGAGTGC TCTGAAGCCC ATCCGGACTA CTTCCAi^ACA CCAGCACCCA 300 

GTGGACAATG CTGGGCTTTT TTCCTGTATG ACTTTTTCGT GGCTTTCTTC TCTGOCCCGT 360 

GTGGCCCACA AGAAG66G6A GCTCTCAATG GAAGAC6TGT GGTCTCTGTC CAAGCAOSAG 420 

TCTTCIGAOG TGAACTGCAO AAOACTAGAO AOACTGTGGC AA6AAGA6CT GAATGAAGTT 480 

ZU GgCCAGACG CTGCTTCCCT GOGAAGGGTT GTGTGGATCT TCTGCCGCAC CAGOCTCATC 540 

CTGTCCATOO TGTGCCTGAT GATCACGCAG CTGGCTGGCT TCAGTGGACC AGCCTTCATG 600 

GTGAAACACC TCTTGGAGTA TACCCAGGCA ACAGAGTCTA ACCTGCAGTA CAGCTTGTTG 660 

TTAGTGCTGQ GCCTCCTCCT 6A0GGAAATC GTGCGGTCTT GGTCGCTTGC ACTGACTTGO 720 

GCATTGAATT ACCX3AAC0GG I 'G'l'CU U Cn'O GGOGGGGCCA TCCTAACCAT GGCATTTAAG 780 

AAOATCCTTA AGTTAAAGAA CATTAAAQAG AAATCCCT6G 6TGAGCTCAT CAACATTTGC 840 

TCCAACX3ATG GGCAGAGAAT GTTTGAGGCA OCAGCCQTTG 6CAGCCTGCT GGCTGGAGGA 900 

CCCGTTGTTG CCATCTTAGG CATGATTTAT AATGTAATTA TTCTGGGACC AACAGGCTTC 960 

CTGGGATCAG CTGTTTTTAT CCTCTTTTAC CCAGCUUITOA TOTTTGCATC ACGGCTCACA 1020 

GCATATTTCA GQAQAAAAT6 CX3T66CCGCC A06GAT6AAC GTOTCCAGAA GATGAATGAA 1080 

GTTCTTACTT ACATTAAATT TATCAAAATG TATGCCTGGG TCAAAGCATT TTCTCAGAGT 1140 

GTTCAAAAAA TCCGCX5AGGA GGAGCXyiOGG ATATTGGAAA AAGCTGGGTA CTTCCAGAGC 1200 

ATCACTGTGG GTGTGGCTCX: CATTGTGGTG GTQATTGCCA GOGTGGTGAC CTTCTCTGTT 1260 

CATATGACCC TGGGCTTCGA TCTGACAGCA GCACAGGCTT TCACAGTGGT GACAGTCTTC 1320 

AATTCCATGA CTTTT6CTTT GAAAGTAACA CGQTTTTCAG TAAAGTCCCT CTCA6AAGCC 1380 

TCAGTGGCTG TT6ACAGATT TAAQAGTTTG TTTCTAATGG AAGAGQTTCA CATGATAAAG 1440 

AACAAACCAG CCAGTCCTCA CATCAAGATA GAGATGAAAA ATGCCACCTT GGCATGGGAC 1500 

TCCTCCCACT CXZAGTATCCA GAACTCGCCC AAQCTGACCC CCAAAATGAA AAAAGACAAG 1560 

AGGGCTTCGA GGGGCAAGAA AGAGAAGGTG AGGCAGCTGC AGCGCACTGA GCATCAGGCG 1620 

GTGCTGGCAG AGCAGAAAGG CCACCTCCTC CTGOACAGTG ACGAGOGGCC CAGTCCCGAA 1680 

GAGGAAGAAG 6CAAGCACAT CCACCTGGGC CACCTGCGCT TACAGAGGAC ACTQCACAGC 1740 

ATCGATCTGQ AGATCCAAGA GGGTAAACTG GrPGGAATCT GCX5QCAGTGT GGGAAGTGGA 1800 

AAAACCTCTC TCATTTCAGC CATTTTAGGC CAQATQACGC TTCTAGAGGG CAGCATTGCA 1860 

ATCAGTGGAA CCTTCGCTTA TGTGGCCCAG CAGGCCTGGA TCCTCAATGC TACTCTGAGA 1920 

GACAACATCC TGTTTGGGAA GGAATATGAT GAAQAAAGAT ACAACTCTGT GCTGAACAGC 1980 

TGCTGCCTGA GGCCTGACCT GQCCATTCTT CCCAGCAGCQ ACCTGAOGGA GATTGGAQAG 2040 

CX3A0QAQCCA ACCTGAGCGG TGGGCAGCGC CAGAGGATCA GCCTTGCCCQ GGCCTTGTAT 2100 

AGTGACAGGA GCATCTACAT CCTGQACQAC CCCCTCAGTQ CCTTAGATGC CCATGTGGGC 2160 

AACCACATCT TCAATAGTGC TATCCXSGAAA CATCTCAAGT CCAAQACAGT TCTGTTTGTT 2220 

ACCCACCAGT TACAGTACCT GGTT6ACTGT QATGAAGTGA TCTTCATGAA AGAGGGCTGT 2280 

ATTACXSGAAA GAGGCACCCA TGAGGAACTG ATQAATTTAA ATGGTGACTA TGCTACCATT 2340 

TTTAATAACC TGTTGCTGGG AGAGACACCG CCAGTTQAGA TCAATTCAAA AAAGQAAAOC 2400 

AGTQGrrCAC AGAAGAAGTC ACAAGACAAG GGTCCTAAAA CAGGATCAGT AAAGAAGGAA 2460 

AAAGCA6TAA AGCCAGAGOA AGGGCAGCTT GTGCAGCTGG AAGAAAAAGG GCAGGGTTCA 2520 

GTGCCCTGGT CAGTATATGG TGTCTACATC CAGGCTGCTG GGGGCCCCTT GGCATTCCTQ 2580 

GTTATTATGG CCCTTTTCAT GCTGAATGTA GGCAGCACCG CCTTCAGCAC CTGGTGGTTG 2640 

AGTTACTGGA TCAAGCAAGG AAGCGGGAAC ACCAC7GTGA CT06AGGGAA OQAGAGCTGG 2700 

GTGAGTGACA GCATGAAGGA CAATCCTCAT ATGCAGTACT ATGCCAGCAT CTAOGCCCTC 2760 

TCCATGGCAG TCATGCTGAT CCTGAAAGCC ATTCGAGGAG TTGTCTTTGT CAAGGGCACG 2820 

CTG06AGCTT CCTCCOGGCT GCATGACGAG CTTTTCOGAA GGATCCTTCX3 AAGCCCTATG 2880 

AAQTTTTTTG ACACGACCCC CACAGGGAGG ATTCTCAACA GGTTTTCCAA AGACATGGAT 2940 

GAAOTTGACG TGCGGCTGCC GTTCCAGGCX: GAGATGTTCA TCCAGAACGT TATCCTQOTG 3000 

TTCTTCTGTG TGGGAATGAT CGCAGQAQTC TTCCCGTGGT TCCTTGTGGC AGTGGGGCCC 3060 

CTTGTCATCC TCTTTTCAGT CCTGCACATT GTCTCCAGGG TCCTGATTCG GaAGCTOAAG 3120 

CX3TCTGGACA ATATCACGCA GTCACCTTTC CTCTCCCACA TCACGTCCAG CATACAGGGC 3180 

CTTGCCACCA TCCACXSCXTTA CAATAAAGGG C3WK1AGTTTC TGCACAGATA CCAGGAGCTG 3240 

CTGQATQACA ACCAAGCTCC TTTTTTTTTG TTTACGTGTG OGATGOtSGTO GCTGGCIQTG 3300 

CGGCTGQACC TCATCAGCAT CGCCCTCATC ACCACCACGG GGCTGATOAT C6TTCTTATG 3360 

CACOGQCAGA TTCCCCXAGC CTATGCGGGT CTCGCCATCT CTTATGCTGT CCAGTTAAOG 3420 

GGGCTGTTCC AGTTTAOGGT CAGACTGGCA TCIGAGACAG AAGCTCQATT CACCTCGGTG 3480 

CAGAGGATCA ATCACTACAT TAA6ACTCTG TCCTTGGAAG CAGCT6CCA6 AATTAAGAAC 3540 

AAGGCTCCCT CCCCTGACTG GCCCCAGGAO GGAGAGGTGA CCTTTQAGAA CGCAGAGATG 3600 

A6GTAC0GAG AAAACCTCCC TCTOGTCCTA AAGAAAGTAT CCTTCAOGAT CAAACCTAAA 3660 

GAGAAGATTG QGATTGTGGG GCGGACAGGA TCAGGGAAGT CCTCGCTGGG GATGGCCCTC 3720 

TTCCGTCTGG TGQAaTTATC TGGAGGCTGC ATCAAGATTG ATQGAGTGAG AATCAGTGAT 3780 

ATT6GCCTT0 CCGACCTCOG AAGCAAACTC TCTATCATTC CTCAAQAGCC GQTGCIGTTC 3840 

AGTGGCACTG TCAOATCAAA TTTGGACCCC TTaOVCCAGT ACACTGAAGA CCAQATTTGG 3900 

QATQCCCrGG AGAGGACACA CATGAAAGAA TGTATTGCTC AGCTACCICT GAAACTTGAA 3960 

TCTGAAGTGA TGGAGAATGO G6ATAACTTC TCAGTGGGGG AAOGGCAGCT CTTGTGCATA 4020 

GCTAGA60CC TGCTCCGCCA CTGTAAGATT CTGATTTTAG ATGAAGCCAC AGCTGCCATG 4080 

GACACAOAGA CAGACTTATT GATTCAAGAO ACCATCCQAG AAGCATTTGC AGACTGTACC 4140 

ATGCTGACCA TT6CCCATCG CCTGCACAOG GTTCTAGGCT CCGATAGGAT TATGGTGCTG 4200 

GCCCAGGQAC AGGTGGTGGA GTTTQACACC CCATCGGTCC TTCrGTCCAA CGACAGTTCC 4260 

CGATTCTATG CCATGTTTGC TGCTGCA6AO AACAAGGTCG CTGTCAAGGG CTAG 4314 
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CCGGGCAGGT GGCTCATGCT CGGQAGCGTG GTTGAGCX5GC TGGOGCGGTT GTCCTGGAGC 60 

AGGGGOGCAG GAATTCTGAT GTGAAACTAA CAGTCTGT6A GCCCTGGAAC CTCGGCTCAa 120 

AGAAGATX^ GGATATGGAC ATAGGAAAAO AGTATATCAT OOOCAGTCCT G06TATA6AA 180 

GTGTGAGGGA GAQAACCA6C ACTTCT06GA OGCACAGAGA CCGTGAAGAT TCCAA6TTCA 240 

GGAGAACTOG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCOSA GCCGAGGGCC 300 

TCTCTCTTGA TOCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGO 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AOCCCATCCO GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT OTATQACTTT TT0GTG6CTT TCTTCTCXQG 480 

CCC6TGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAQA OGTQTaaTCT CTGTCCAAOC 540 

AOGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGA(3AQACT GTGGCAAGAA GAGCT6AAT0 600 

AAGTTGQGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCXrrOTC CATCGTGTGC CTGATGATCA 06CAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATOQTGAA ACACXTTCTTG GAGTATACCC AG6CAACA6A GTCTAACXTTO CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTQOG GTCTTGOTCXS CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCOGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CXTTGOOTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATQTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCOST TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GOAOCAACAO 1080 

6CTTCCTG6G ATCAGCTGTT TTTATCCTCT TTTACCCAQC AATGATGTTT GCATCACGGC 1140 

TCACAQCATA TTTCAGGAGA AAATGCX3TGG COGCCACGQA TGAAOGTGTC CAGAAGATGA 1200 

AT6AAGTTCT TACTTACATT AAATTTATCA AAATOTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGT6TTCA AAAAATCCGC GAGGAGQAGC GTCGQATATT GGAAAAA6CC GGGTA CTTC C 1320 

A6GGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGOTGAT TGCCAGOGTG GTGACCTTCT 1380 

CTGTTCATAT GACCXTTGOGC TTOQATCTGA CAGGAGCACA GGCTTTCACA 6T6GTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACOSTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AQATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CQCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAQAGOGC TTCCAGQGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGOGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGAOGAG OGGCCCAGTC 1800 

CXmAGAOGA AGAAGGCAAG CACATCCACC TGGGCCACCT GC6CTTACAG AGGACACTGC 1860 

ACAGCATOGA TCTGGAQATC CAAGAGGGTA AACTGGTrOG AATCTQOGGC AGTGTXjGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GAOGCTTCTA GAGG6CAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGA6ACAA CATOCTOTTT GGGAAGGAAT ATGATQAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTOAOOCCT GACCTGGCCA TTCTTCXX3VG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGOGGTGGGC AOCGCCAGAG GATCAGCCTT GCCXX5GGCCT 2220 

TGTATAGTGA CAGGAQCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAQTTCTGT 2340 

TTGTTACCCA GCA6TTACA0 TACCTGOTTG ACTQTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAOAGGC ACCCATGAGG AACTQATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGA6A CACCGCCAGT TOAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAQTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

OTTCAOTOCC CTGQTCAOTA TATGOTGrCT ACATCTAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTT A T TATG6C0CTT TTCATGCTGA ATGTAGGCAG CACOGCCTTC AGCACCTGGT 2760 

GGTTGA6TTA CTOGATCAAG CAAGGAA0C3G GGAACACCAC TGTGACrOGA GGGAAOGAGA 2820 

CCTOGOTGAG TGACAGCATO AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCX3 AGGAGTTGTC TTTGTCAAGG 2940 

GCAOGCTGCX} AOCTTCCTCC CX3GCTGCATQ ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTT6ACAG6 AC OC CCACRG GGAGOATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACXSTGCGG CTGCOGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTr OTGQCAGTGG 3180 

GGCGCCTTGT CATCCTCTTT TCA6TCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGOAGC 3240 

TQAA80STCT 06ACAATATC A06CAGTCAC CTTTCCTCTC CCACATCAOG TCCAOCATAC 3300 

AG60CCTT0C CAGCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCRGQ 3360 

AGCTGCTGQA TGACAACCTU^ GCTCCTTTTT TTTTGTTTAC GTGTGCGATQ OGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCX3CCC TCATCACCAC CACX3GGGCTG ATGATCX3TTC 3480 

TTATGCACGG GCAGATTCXX: CCAGCCTATG OGGGTCTOGC CATCTCTTAT GCTGTCXAQT 3 540 

TAA06GG6CT GTTCCRGTTT AOGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 
GGQTOGAGAQ GATCAATCAC TACATTAAGA CTCrGTCCTT GGAAGCACCT GCCAGAATTA 3660 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAOQGAGA GGTGACCTTT GAGAACOCAG 3720 
AGATGAGGTA C06AGAAAAC CTCCCXCTTG TCCTAAAOAA AGTATCCTTC ACGATCAAAC 3780 
CTAAAGAGAA 6ATTGGCATT GTGGGGG6GA CAGGATCAGG GAAOTCCTCG CTGGGGATGG 3840 
CCCTCTTCGS TCTGGTGGAO TTATCTQQAO GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 
GTOATATTOG CCTTOCCGAC CTOOGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT QAAGACCAGA 4020 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 
TTGAATCTGA AGTQATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACQG CAGCTCTTGT 4140 
GCATAOCTAO AOOCCTOCTC OSOCACTGTA AGATTCTQAT TTTAGATGAA GCCACAGCTG 4200 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT COGAQAAGCA TTTGCAGACT 4260 
QTACCATGCT GACCATTGCC CATCGC CT GC ACACXWTTCT A06CTCCGAT AGGATTATGG 4320 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAAOIACA 4380 
GTTCCOGATT CTATGCCATG TTTGCTQCTO CAGAGAACAA GGTOGCTGTC AAGGGCTGAC 4440 
TCCTCOCTOT TGA06AAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGO 4500 
CCCCTCATCG CGTOCTCCTA CCQAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACACCA 4560 
O T TCCGQATT QGCTTGTOTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 
ATTCCATATT CATGTAAACA AAATTTAOTT TTTGTTCrPA ATTQCACTCT AAAAGGTTCA 4680 
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GGGAACOGTT 
TCTATATATA 
TATTAAAATA 
TTGCTGTACT 
CTCTAGCTGG 
ATAGTGGGCC 
GAGACGGGT6 

TTTCACTCCC 
TTTCCTQCCT 
TCCCACTGCC 
GTTGGTTCCA 
ATTCCCACAC 
CTCACCGCAG 
CA0CTCTT6C 
ACCTCAGGTT 
GGGGCTOGTA 
ATGTOGTGAC 
GAAAAATCTG 
AAAAAAAAAA 



ATTATAATTG 
ATTCTGTACA 
AGCACTGTGC 
AGAGATCTGG 
TGQTTTCACG 
CTCCGACAGC 
GGCGGCTGGA 
GTCACTTACT 
TCCATCAAGA 
TCTTCTTTTT 
TCAGGTTCCT 
AGCCCTGGAG 
CTCCACAGTT 
TCGTCGCACA 
TAATCAGTGT 
GCTGGTTGCT 
GCTCAGGTGG 
CAACTAGACA 
AAAATGTGAA 
AAAAAAAA 



TATCAGAGGC 
TAGCCTATAT 
TAATAACAGT 
TTTTGCTATT 

gtgcx:aggtt 
cccctctgcc 
gaccatgcag 
gtttctqtca 
atggggatca 

GCTQTTGTTT 
ATGGCTGGCC 
CCAACTGCTG 
CAGTGGCAGG 
GTCTCTCTCT 
CTCACACTGO 

GCGTGGTCAC 
TT CTGTCGCC 
TAAAATTATT 



CTATAATOAA 
TTACAGTQAA 
GCATATTCCT 
AGACTGTAGG 
TTCTGGGTGT 
GCCTCCCCAC 
AGCGCCXSTGA 
GGAGAGCAGC 
CAOAGACATT 
CTAAACAA6A 
ACTGCACAGA 
CTTTTTGAGG 
GCTCAGGATT 
CTCrCTCCCC 
OOTAOAAGTT 
GTOTGTTCCC 
TGCTGTCATC 
TTAGCATGTT 
TTG6ATTTTG 



GCTTTATAOG 
AAT6TAAGCT 
TTCTATCATT 
AAGAGTAGCA 
CCAAAGGAAG 
AGCCGCTCCA 
GTTCTCAGGG 
GGG6CGAAGC 
GCTCGQAGCC 
ATCAGTCTAT 
GCrCTCCAGC 
TGGCACTTTT 
TOGTGGGTCT 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTOAATGG 
TGCTGAACAC 



TQTAGCTATA 
GTTTATTTTA 
TTTGTACAQT 
TTTCATTCTT 
AOGTGTGOCA 
GGGGTGGCTG 
CTCXTTGCXTTT 
CCA66CCCCT 
GGGGAGTTX'C 
CCACA6A0AG 
TCCAAGACCT 
TCATTTGCCT 
GTTTTCCTTT 
CAACTTTAAG 
AAAOAQAOCT 
TTTGTGCTGT 
TCAGOGTTGC 
CTTGTGGAAG 
AAAAAAAAAA 



Seq ID NO: C106 DNA Sequence 
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ACAGCGGAGC 6CAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGGGG 60 

A6ACA0AGAC T6A6G6GCCC G6CAC06CCA T6CCT60GCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTGGCT CCTCCTGCCC GCAGCGGGGG CCACCTGCAG 6AGGGAAGTC TGTGATT6CA 180 

ATGGGAAOTC CAOGCAOTOT ATCTTT6ATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACGG GCACAGAGAA AGQGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATOTGAC AACTCTGGAC GGTGCAGCTG TAAACCAG6T GTGACAGGAG 420 

CCAGATGCGA C06ATGTCT6 CCAGGCTTCC ACATGCTCAC GQATG0GGG6 TGCACCCAAG 480 

ACCAGA6ACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGQTGTCGAT 600 

CAGGTTACTA TAATGTOGAT GGGGGGAACC CTGAGGGCTG TACXXA0T8T TTCTGCTATG 660 

GGCATTCA6C CAGCT6C06C A6CTCT6CA0 AATACA6TGT CCATAAGATG ACXTTCTACCT 720 

TTCATCAAQA TGTTQATGGC TG6AAG6CT6 TCCAACGAAA T6GGTCTGCT GGAAAGCFCC 780 

AATG6TCACA GCGCCATCAA GAT6TGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAQ CTATGGGGAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTQ ATTCTGGAAG 960 

GTGCTG6TCT AGGGATCACA GGTCCCTTGA TGGCACTTGG CAAGACACTQ CCTTGTQ6GC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCC3VA6 CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCOGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCXTCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACaAGGGGC 1260 

AATTCTGCCA OGATTOTOCT TCTGGCTACA AOAOAGATTC AGOQAGACTG GG6CCTTTTG 1320 

GCACCTGTAT TCCTT6TAAC TGTCAA6G66 GAGGGGCCTG T6ATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT OOTTTCTACA 1440 

ACGATCCGCA OOACCCCOGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACX36AO GAGGTGGTGT GCAATAACTG CCCTCCGGGG GTCACCGGTG 1560 

CC06CTGTQA GCTCXOTQCT GATGGCTACT TT6GGGACCC CTTTGGTGAA GAT66CGCAG 1620 

TGAGGCCrre TCA6CCCTGT CAATGCAACA ACAATGTGGA CCGCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGO TGTTTGAAGT GTATCCACAA CACAGCCX^GC ATCTACTGCO 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTOTAACCCC ATGGGCTCAO AGCCTGTAGG A1X3TOGAA6T GATGGCACCT 1860 

GTGTTTGCAA GCCAOGATTT GGTGGCCCCA ACTGT8A6CA TGQAGCATTC AGCTGTOCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCftGGTT CAGAGAATG6 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGG7G ATGGAGTAGT ACCTGATACA GAGCTGGAAO 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTGTCCAGT TGGCCAAGGT GAGGAGCCAA 6AGAACAGCT 2160 

ACCAGA6C08 CCTGGATGAC CTCAAGATGA CTGTGGAAAa AGTTGQG6CT CTGGGAAGTC 2220 

AGTACCAGAA CCC3AQTT06G QATACTCACA GGCTGATCAC TCAGATGCAQ CTGAOCCTGO 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACXAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTQ GCTCAGQAGO CCACAAGATT AGCAGAAAGC CACX3TTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAA6GG AAACTGAGGA CTATTCCAAA CAA6CCCTCT 2460 

CACIGGTGOQ CAAGGCCCTG CATGAAGOAO TGQGAAGGQa AAGOGGTAGC CGGQAOSaTQ 2520 

CT6TGGTGCA AGGGCTTOTO GAAAAATT60 AGAAAACCAA GTCCCTGOCC CAGCAGTT6A 2580 

CAAGGGAGGC CACTCAAGOS GAAATTGAA6 CAGATAGGTC TTATCAGCAC ACTCTCCGCC 2640 

TCCTGGATTC A6TGTCTCGG CTTCAGGGAG TCAGTQATCA OTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCX3GATTCAC TCTCAAOGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGOG TACACAAAA6 AATCTGGGAA ACTGGAAAGA AOAAGCACAG CAGCTCTTAC 2820 

AGAAT66AAA AAGTGGGAGA GAGAAATCAG ATCA0CT6CT T T OCOGTGCC AATCTT6CTA 2880 

AAAGCA6AGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTA7GAA GTTGA6AGCA 2940 

TCCTTAAAAA CCTCAGAQAO TTTGACCTGC AGGTGOACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATQAA GAGACTCTCG TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

AOCAAGCAGA AAOAGCCCTQ GQGAGOGCTG GTGCIQATOC ACA0AGG6CA AAOAATGGQO 3120 

COGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACftGGA GATTGGQAGT CTGAACTTGO 3180 

AAGGCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTQGGC TCTCTGAAGA 3240 

OTGAGATGAG GGAAOTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTO ATTACAGAAO CCCAGAAGQT TGATACCAGA GCCAAGAACG 3360 

CTGQGQTTAC AATCCAA6AC ACACTCAACA CATTAOACGG CX:TCCTGCAT CTQATGQACC 3420 
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AGCCTCrCAG TGTAOATGAA GAGGGGCTGG TCTTACTGQA GCAGAAGCTT TCCCGAGCCA 34 BO 

AGACCCAGAT CAACAGCCAA CTGOSGCCCA TGATGTCAQA GCTQQAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGOA GAACATTAGG GACAACCTGC CCCCAOGCTQ CTACAATACC CAGOCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCP CAACTGAGGT TCTTGGGATA CAOATCTCAO 3720 

GGCTCX5GGAG CCATGTCATG TGAGTGGGTG GGATGGGQAC ATTTQAACAT GTTTAATGGG 37 BO 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTT6CTTC CTGATGCTGQ GCAATGAG6C AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAOO ATCT6GACCC CAAAGAATAO ACT6GATGGA AAGACAAACT GCACftOGCAO 3960 

ATGTTT6CCT CATAATAGTC GTAAGPGGAG TCXTTCGAATT TGGACAAGTG CTGTTG6GAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGQAAAA AACTTTGACT TTGCOCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGOCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

AOCCAGGOTG TGAACATGTT CTGCATTTTC AAGCTGGAAG AAGTGAGCAG TQTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGA6CT ATGGT6CTTG CTGGT GC CrO CCACCTTCAA 4320 

GTTCTGGACC TOGQCATQAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTOG 4440 

GTTTCAAAGT GATAOAAAAG TCSTGGCTTGG GCATTQAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTA6TCCTA ATTCAATCCT ACTTTTGGAA CACCAAAAAT GATGOGCATC AA-TOTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAOAQAA TCTTCCTACT 4620 

CACACTTCAO CTGGOTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGT6ACATAS TCTCTQCCCT GATAQAOTTQ ATTGTCTAGT GAGGAAGACA 4600 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTQTT TGTCACAAGT 6GTGTTTATT 4860 

ZD GCAATAACC33 CTTGOTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAQTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT OTCACTCTGT QCX:TTTCTAC AACTGATTOC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATT6CT GGAGGAACCA GAGGCACTTC CACCTTOGCT GGGAAGACTA 5100 

TOGTSCTQCC TT6CTTCTGT ATTTCCTT G G ATTTTCCTOA AAGTGTTTTT AAATAAAGAA 5160 

CAATTGTTAQ ATGCC 5175 



5 

10 
15 
20 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



Seq ID MO: C107 DMA Sequence 
Nucleic Acid Accession «s NM_021101 
Coding sequences 2 21.. 85 6 

1 11 21 31 41 51 

I I I I I I 

OAATTOGGCA GGAGGCCTCX3 TGC0GGG6AG CAACCTCAGC TTCTAGTATC CAGACTCCAO 60 

06C00CCCO3 G6C60G6ACC OCAAOOOCQA CCCAGAOCTT CTCCAGOGGC OGGOCAGCGA 120 

GCAGGGCTCC CXX3CCTTAAC TTCCTCOGCG GGGCCCA6CC AOCTTOGGGA 6TCGGGGTTG 180 

CCCACCTGCA AACTCTCXX3C CTTCTGCACC TGCCACCCCT QAGCCAGOGC GGGCGCCC3QA 240 

GCGAGTCATQ GCCAACGCGG GGCTGCAGCT GTTGGGCTTC ATTCTCX3CCT TCCTOGGATG 300 

OATCX3GCX3CC ATOGTCAGCA CTGCCCTGCC CCAGTGGAGG ATTTACTCCT ATGCCG6CGA 360 

CAACATOGTO ACOGCCXAOO CCAT0TA06A GGG6CTGTGG ATOTOCTGOG TGTCGCAOAG 420 

CACOGGQCAG ATCCAGTGCA AAGTCTTTGA CTCCTTGCT6 AATCTGAGCA GCACATTGCA 480 

AGCAACCCOT GCCTTGATGG TGGTTGGCAT CCTCCTGOQA GTGATAGCAA TCTTTGTGGC 540 

CACCXTTTGGC ATGAAGTGTA TGAAGTGCT7 G6AA0ACGAT GAGGT6CAGA AGATQAGGAT 600 

GGCTGTCATT GGGGGCGCX3A TATTTCTTCT TOCAGQTCTQ OCTATTTTAG TTGCCACAGC 660 

ATGGTATGGC AATAGAATOG TTCAAGAATT CTAT6ACCCT A7GACCGCA6 TCAATGGCAG 720 

GTA06AATTT GGTCAGGCTC TCTTCACTGG CTGGGCTGCT GCTTCTCTCT 6CCTTCTGGG 780 

AGGTGCCCTA CTTTGCTQTT CCTGTCCCCX! AAAAACAACC TCTTACCC3UV CACCAAGGCC 840 

CTATCCAAAA CCTOCACCTT CCAGCGGQAA AGACTAOGTG TGACACAGAG GCAAAAGGAG 900- 

AAAATCATGT TQAAACAAAC GQAAAATGGA CATTGAGATA CTATCATTAA CATTAGGACX: 960 

TTAGAATTTT GGGTATTGTA ATCTAAAGTA TGTTATTACA AAACAAACAA ACAAACAAAA 1020 

AACCCATGTG TTAAAATACT CAGT6CTAAA CATGQCTTAA TCTTATTTTA TCTTCTTTCC 1080 

TCAATATAGG AGGGAAGATT TTTCCATTTG TATTACTGCT TCCCATTGAO TAATCATACT 1140 

CAAATGGGGQ AAGGG6TGCT CCTTAAATAT ATATAGATAT GTATATATAC ATGTTTTTCT 1200 

ATTAAAAATA GCCAGTAAAA AAAAAAAAAA AAAAAAA 1237 

Seq ZD NO: C108 DMA Sequence 
Huclelc Acid Accession #: AF508964.1 
Coding sequences 98.. 1531 

1 11 21 31 41 51 

I I I i I I 

CA0A6CCGCA A006CAGGGA AGGCCTCCCC GCACGGTGGG GGAAA60GGC 0GGT6CAGCG 60 

CGGGGACAGG CACTCGGGCT GGCACTGGCT GCTAGGGATG T OG TCCTGGA TAAGGTGGCA 120 

TG QACC OQCC ATGOCOOGQC TCTGGGOCTT CTGCTGGCTG GTTGTGGGCT TCTGGAGGGC 180 

OOCTTTOQCC TGTCCCAOGT CCTGCAAATG CAGTGCCTCT C6GATCTGGT GCAGG6ACCC 240 

TTCTCCTGGC ATOGTGGCAT TTCOGAGATT GGAGCCTAAC AGTGTAGATC CTGAGAACAT 300 

CACCGAAATT TTCATCGCAA ACCAGAAAAG GTTAOAAATC ATCAAOJAAG ATGATGTTQA 360 

AGCTTATGTG GGACTOAGAA ATCTGACAAT TGTGGATTCT GGATTAAAAT TTGTGGCTCA 420 

TAAAGCATTT CTQAAAAACA OCAACCT6CA GCACATCAAT TTTACCCGAA ACAAACTGAC 460 

GAGTTTOTCT AGGAAACATT TCOGTCACCT TGACTTGTCT GAACTQATCC TG6TGGGCAA 540 

TCCATTTACA TGCTCCTOTG ACATTATQTO GATCAAGACT CTCCAAQAGG CTAAATCCAG 600 

TCCAGACACT CAGGATTTGT ACTGCCTGAA TQAAAGCAGC AAOAATATTC CCCTGGCAAA 660 

CCTGCAGATA CCCAATTGTG GTTTOCCATC TGCAAATCTG GCCGCACCTA ACCTCACPGT 720 

GGAGGAAGGA AAGTCTATCA CATTATCCTG TAGTGTGGCA GGT6ATCCG0 TTCCTAATAT 780 

GTATTOQQAT GTTGOTAACC TGGTTTCCAA ACATATGAAT GAAACAA6CC ACACACAGG6 840 

CTCCTTAAG6 ATAACTAACA TTTCATCCGA TGAGAGTGGG AAGCAGATCT CTT6TGT0QC 900 

GGAAAATCTT GTAGGAQAAG ATCAA6ATTC TGTCAACCTC ACTGTGCATT TTGCACCAAC 960 

TATCACATTT CTCGAATCTC CAACCTCAGA CCACCACTGG TOCATTCCAT TCACTQTQAA 1020 

AGGCAACCCC AAACCAGCGC TTCAGTGGTT CTATAACGGO GCAATATTGA ATQAOTCCAA 1080 

ATACATCTOT ACTAAAATAC ATGTTACCAA TCACAOQQAG TACCACGGCT 6CCTCCAGCT 1140 
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GGATAATCCC ACTCACATGA ACAATGGGCSA CTACACTCTA ATAGCCAA6A ATGAGTATGO 1200 

GAAGGATGAG AAACAGATTT CTGCTCACTT CATGOQCTGG CCTGGAATTG ACGATGGTGC 1260 

AAACCCAAAT TATCCTOATG TAATTTATGA AGATTATGGA ACTGCAGG6A ATGACAT06G 1320 

G6ACACCACX3 AACAGAAGTA ATGAAATCOC TTCCACAGAC OTCACTGATA AAAC0GGTG6 1380 

GGAACATCTC TOGOTCTATG C ltSTG QTG G T GATPGOGTCT GTGGTGGGAT TTT GCCTTT T 1440 

GGTAATGCTG TTTCTOCTTA AGTTGGCAAO ACACTCCAAG TTTGGCATGA AAGQTTTTGT 1500 

TTTGTTTCAT AAGATCCCAC TGGATGGGTA GCTGAAATAA AGGAAAAGAC AGAGAAAGGG 1560 

GCTGTGGIGC TTGTTGGTTG AT6CTGCCAT GTAAGCTGGA CTCCTGGGAC TGCTGTTGGC 1620 

TTATCCCGGG AAGTGCTGCT TATCTGG6GT TTTCTGOTAG ATGTGOGOGQ T G TTTGGAGG 1680 

CIGTACTATA T6AA60CT6C ATATACTQTG AGCTGTGATT GGGGAACACC AATGCAGAGG 1740 

TAACTCTCAG GCAGCTAAGC AGCACCTCAA GAAAACATGT TAAATTAATG CTTCTCTTCT 1800 

TACAGTAGTT CAAATACAAA ACTGAAATQA AATCCCATTG GATTGTACTT CTCTTCTGAA 1860 

AAGTGTGCTT TTTGACXXTTA CTGGACATTT ATTGACTTAA TTGCTTCTGT TTATTAAAAT 1920 

TGACCT6CAA AGTTAAAAAA AAATTAAAtST TGAGAACAGG TATAAGTGCA CACTQAATAG 1980 

TCTAATCTAC ATOTAACACA TATTTTA6TA T6ATTTTCTA TACTCTAATC AGCACTGAAT 2040 

TCAGAGGGTT TGACTTTTTC ATCTATAACA CAGTGACTAA AAGAGTTAAQ GQTATATATA 2100 

CCATCACTTT GGGACTTGGT AGTATTATTA AAAGGTTATT TCCTTCACTG TCAATAAAAG 2160 

TCCAAATGTT TAGCTTAGGT CT6AGAGTCA AACAATGTTA AG6ATT6TCT TAAAGTTCCT 2220 

TAGCCAGCAA AACAAAACAA AACAAAACAA ACAAATGAAA AA06TTTAAA AAGAAGAAGA 2280 

AGAAAAAAAA CAAGAACAAG CAGCAACAGC I XJ T m XSTT G GGGCTATA6A TTTAAGTTAG 2340 

GCATAGTCAA TTTCAGAATA ACTAAGAQTG GAATATATGC ATATGGTGAA ATTATAACCT 2400 

TGCCCTTTTT TATTTGCCCT CTGCGATCCA CCTGGCTTTT TAGAAGTCTG CCQAGTGAGA 2460 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTnCCTAC TCTGAAAAGG 2520 

GCrGGGAGCA GAATGOCTGO OCT6CTGTGA GCAGGAGAG6 A6ATTCTAAG AA6GATA6TC 2580 

CCGCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGG6 TAGGAAAGCT TOTCCTGACC 2640 

CCA6CAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 2700 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TOACCCATGG 2760 

ACACCTGTTT CATTCACTTT AQCATCACAG TGACCTTTGT ATGCTCTGTT CA6TCTGTGT 2820 

CAQGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCOC OCACCCCACC CACCX9VCCCT 2880 

GTTCCTTTTT TATCAGGAGG ACTTCAQAGC CAGGCCT6CA GCATTTTGTT TGAAAACACA 2940 

ATCAGCTCTG ACAGTTAGAC ATGCACACAG ACGCCATAGC T6GATTGGAA ACATTGATGT 3000 

TTTAAAAATT TATTTTTTTT GGAAATAGTT GC3VCAAATQC TGCAATTTAG CTTTAAGGTT 3060 

CTATAGATTT TTAACTA6TC CAACACAGTC A6AAACATTG TTTTGAATCC TCT6TAAACC 3120 

AAGGCATTAA TCTTAATAAA CCAGGATCCA TTTAG6TACC ACTTGATATA AAAA6GATAT 3180 

CCATAATGAA TATTTTATAC TGCATCCTTT ACATTA6CCA CTAAATAOGT TATTGCTTGA 3240 

TGAAGACCTT TCACAGAATC CTATGGATTG CAGCATTTCA CTTGGCTACT TCATACCCAT 3300 

GCCTTAAAGA GGGGCAGTTT CTCAAAAGCA GAAACATGCC GCCAGTTCTC AAGTTTTCCT 3360 

CCTAACTCCA TTTGAATQTA AGGGCAGCTG GCCCCCAATG TG^GAGGTC CGAACATTTT 3420 

CTGAATTCCC ATTTTCTTGT TGGCGGCTAA ATGACAOTTT CTGTCATTAC TTAGATTOOG 3480 

ATCTTTCCCA AAGGTOTTOA TTTACAAAGA GGCCAGCTAA TAGCAGAAAT CATGACCXTTG 3540 

AAAGAGAQAT GAAATTCAAG CTGTGAQCCA GGCAGGAGCT CAGTATGGCA AAGGTTCTTG 3600 

AGAATCAGCC ATTTGGTACA AAAAAGATTT TTAAAGCTTT TATGTTATAC CATGGAGCCA 3660 

TAGAAAG6CT ATGGATTGTT TAAGAACTAT TTTAAAGTGT TCCAGACCCA AAAAGGAAAA 3720 

ATAAAAAAAA AGGAATATTT GTAOCCAACA OCTAGAAGOA TTGCAAGGTA GATTTTTGTT 3780 

TTAAAATGGA GAGAAGTGGA CAGATAAGGC CATTTAATAT ATCAAAGATC AGTT6ACATC 3840 

TCTAGGGAAT GATGAAACAG CAGGCTATTA GAAAATTATT TCATATAGTT CTOSTGTTCT 3900 

TTTCTTTTTT TTAATCCCTQ AAGGOAGATC AQTAACATAG CTTCTCTTTT CTGTACTCTA 3960 

GACCACCCCT TTTCATCATT TTGCTTTTTA T6TCTCCCAT AAGAAATGTO CTTTTTASAG 4020 

CTTCCTAATG CATGTGTTGC ATTATTGCAO CATTAGAAAA 0QA6AGQXAG CATTTTT6CT 4080 

GAAATCGGGC CTGTCACTCT CCAATAAAGG TTCTGGCACT TCAATX30CAG GCAGGTCTCC 4140 

TAAATGAACA GAATGATCTG TGTGAaCC3GA TGCCTGCCCT TCCAGAGGGQ CCACT6TCCC 4200 

CAGCCGCAGC CAACTGTGTC CCACAGGAAT GGGAGCCTAQ GTTTCCAAAT CTTGTGATTC 4260 

TTTAG6AGAA ACATGAAACC TGGATTT06T GTGAAATGTC CCXaATTGTTA AAAAOTTGQC 4320 

TGAATTATTT TTAAAACATT TTGTAAGCCA ACAAAAOTCT GTGGGCTOCC AGTTTATTAC 4380 

TTTTGTCTTA AAACATGATC ATTGTTCTCT CAGG6TATCC TTCTGTCTTC C0GTT6CAAA 4440 

TTCACTTTTC TTTCTTCCTQ ACATTGCCAT TQAGGGCTTT GTTACCACAA GCTAAGAAAC 4500 

TQAGTTTAAC AGCCCAGTTA TCTGCAACAT GTCAATTACC TTTGCTCCTC TCCTGTGATT 4560 

CCCACCATGC TGTGACCCTC AGCTGTCTCC CTTTGCTGGG AATTCT6CAC CAATOTCTTC 4620 

CCCTCAACCC ATTCCCTGGT TGGTCCTACT CCCGTGTGaC CAGAGAGATC CTAfiCAAATC 4680 

CTTCCTCCTA TTATATCTGA CACTAATTTC TTTTCAACAO CGCTC31TGTC TCTTGGCCCA 4740 

GTCAGGTGCT GCCAGGTTTA GATAGGAAAG TACATGTCCC ATTTTC3VTGG GTGCCCTTAA 4 BOO 

TGTGGTCXaC GTCCTATATC TTATTATATT TACTCATGGC TGAATGGGQG CCTCCAGAGA 4860 

CCCTCTCAGO CTGCTGAGCT AGACTAAGGA AT6CATCCAC TGTCATCACA TGAGACACTG 4920 

ACTCTGT6AC GACAAAAGTA CAAACAOTCT OAGGCTAAOA AAOOTTCATC TGACAACAGO 4980 

AAAAACAAAT CTCAACACAC ATTAGAOATA ATT6ATTCAG GG6TTTTCTC TCCCAGTCTC 5040 

CCAGCAGGGA CTGATTTCAT TTCTGACCCA CTAGGTTTTC TTTCCA6AAA TAGGTAGCAA 5100 

GGACAAQAAC TAAACAATCC CAGCCCCACC CAGCAACACA GAACACAGGA GTTTGCTTTT 5160 

GGCTTCTCAC TCTCCAAGTA ACCCTGAATT AGGCCCAGAA TGGCTGAG6C TTGGAGCATC 5220 

TCCTCAGACA GAGCAGAGGC GACACCTCTT CAG Q QQTCT G TOGAGTAAAT AGCT0QAA6A 5280 

GCTGAAGACA GAAAACCAGT TTGAOGCCAO GT60GA6AQA GAGCATAATG GAGGGAAGCC 5340 

CGCTTTCTCT CTCCTCTTCT TTTCTCTTTA TTTCTTTAGA GCACTTOACT TTTTTTTCTC 5400 

TCTCTCrCTA GTATTCTAAA CTGACCCCAT GACCAACTGA GAATTTATTT TTGTTTCATT 5460 

GGTTGTTTCA CAGAATTAGA ACACACACGA CTTTTTATTC CTCCATTGCA AAATGGAATC 5520 

AAGATACTAC ACAAOACCTG TGCTITCTTC CTTTGCATGA TTTACACCTC 0GGCT6TTTT 5580 

GGTGCTA6CT GTCTAGAACT TCTCTCTTGG TTTGAATCTG ATTCCTTCAC ACTAOICTAG 5640 

AAGTTTATTT CATCTTGTTT TOTCTAGACT CCAGATACAQ AGGGACAOCT GGACTGAGGA 5700 

CAAGCAATTC CATCTAGCAT AGGGTCTCTC AGGGTTGGTG CATCCAGCCA CATGGGCAGG 5760 

GCCAGTCACA TCTAGTCTAT 6TC0CCAGAG CCCTTGQAGT TGOQCAOCTT A6CT6ACTTG 5820 

ACTCCAAGGA AATTA8TACA GAAGTAACCA CTCTATTAAG TSTGTTCTGC TATGTTCACA 5880 

TGCCTGTAGT ACCTGCAAAC CATGCCAOGT TCATCTAAA6 ACATAGGGGA AGATTAAGGA 5940 

CTCTTTTGGA CAOACCATGA ATTGAATTTG CTGCCAGGTG CTGCCAGACT GAATTTGGCT 6000 

GACAGAACTC CCAOCCCAGO AAAGTTCCAT GACAATGACT GTCGCAGAAG GAAATTTCCC 6060 

ACTAAAGTCA OTCCATTTTC AAGTTTTGGT CTTCAiQAGAC AAAA6AAC0T CCCAGCCACC 6120 

TQATTTTGAT GQTGAGGTAA CTCTAAGTTQ AATTCAGGCT AGTGTT6CAG TATAGCTTTO 6180 
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5 
10 
15 



GCATGTTCAT GAGTGAGCAC CCAGAATGTG TT6AACCAAC CCCCACCOCT AACTACTGAC 6240 

TATGACTGCA GTOQGTTTTT ATGGGGAAAA AAAGTGTGAA AAGCAAAAAG AAAGGAACAG 6300 

AGATTTTTTA TCACCTTTAT TOTAAGACAG TCCATTTATG AATTGAGTAT AAACACATAC 6360 

AA AGTAA CAA OAQMrTOCTA AGAAAOQGAA ATCCTTGAGT TTCAOGCACT TCATCTTCAA 6420 

CCAT 'imcm TAATCCAGAO GCAGCCT6TG AATCATTCTC AT6C0CTGTT li ilUVn ' i ' 6480 

TTTTCCTATA ATGTTCTGGG TTTAAAAGCC ATCTTTTOCA CS^TTTTCTOT AAATAATGGA 6540 

TAATCATTTT AAAAAGTTTT ATTTTTAGTG CTGTTTTAAC AATGTAGATA GATCATAAAT 6600 

GTACTTGCTG AATTCAATCA TTTTTAACAA GCCAATAAAO TTTOATAATT CAAAAAAAAA 6660 

AAAAAAAAAA AAAAAAAAA 6679 

Seq ID NO: CIO 9 DNA Sequence 

Kucleic Acid Accession fit NM_006180.1 

Coding sequence i 353.. 2820 ~ 



L 11 21 31 41 51 

1)1111 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAQAG TCCOGGACGC CGCCGGTCGG 60 

TGCCOGGOGC GCCGGGCCAT 6CAGCGACGG CCGCC6C6GA GCTCOQAGCA GOGGTAGOQC 120 

CCCCC TGTAA AGCGGTTCQC TATGCCGGOA CCACTCTGAA CCCXQOOGOC TGOOQGAACA 180 

ZU CTCTTOSCTC CGGACCAGCT CAOCCTCTGA TAAGCTGGAC TOGGCACGCC CGCAACAAGC 240 

ACCGAGQAGT TAAGAGAGCC GCAAGCGCAG GGAAG6CCTC CCGGCA0GG6 TGGGGOAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAQGCACTCG GGCTGGCACT GGCTQCTAGG GATGTCQTCC 360 

TGQATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGQ GCTTCTGCTG GCTGGTTQTG 420 

^- GGCTTCTGGA OGGCCGCTTT C6CCTQTCCC ACOTCCTOCA AATGCAOTGC CTCTOSGATC 480 

ZD TGGTQCAGC6 ACCCTTCTCC TGGCATOGTG GCATTTCCGA 6ATTQ6A0CC TAACAQTGTA 540 

QATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

QAAGATQATO TTGAAGCTTA TGTGGQACTG AGAAATCTGA CAATTGTGQA TTCTGGATTA 660 

AAATTTGTGO CTCATAAAQC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

- _ OGAAACAAAC TGAOGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTQACTT OTCTQAACTQ 780 

ATCCIGGTGG OCAATCCATT TACAIGCTCC TGTGACATTA T0T6GATCAA GACTCTCCAA 840 

6AGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTOOCCGCA 960 

CCTAACCTCA CTGTGGAGQA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

^- OOGGTTCCTA ATATGTATTQ GGATGTTGOT AACCTGGTTT CCAAACATAT QAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT COGATGACAO TGGGAAGCAG 1140 

ATCTCTT6TG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCr CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT OGTTCTATAA GGGGGCAATA 1320 

TTQAATQAGT CCAAATACAT CrGTACTAAA ATACATGTTA CCAATCACAC GQAOTACCAC 1380 

4U GGCTGCCrCC AGCTGGATAA TCCCACTCAC ATGAACAATG GQOACTACAC TCTAATAGCC 1440 

AAGAATOAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATQQG CTGQCCTGGA 1500 

ATTGAOGATG GTGCAAACCC AAATTATCCT GATQTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCX5AATGACA TCGGGGACAC CACGAACAGA AOTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

. _ GA TAAAAO OS GTOGQGAACA TCTCTOGGTC TATGCTG7GG TGGTGATTGC GTCTGTGGTG 1680 

Hj GGATTTTGCC TTTTGGTAAT GCTOTTTCTO CTTAAGTTQG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT QATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AQATCCCTGT CATTGAAAAT CCCCAGIACT TTGGCATCAC CAACAGTCAO 1920 

CTCAAGCCAG ACACATTTGT TCAOCACATC AAOCGACATA ACATTGTTCT GAAAAGGGAG I960 

DU CTAGGOGAAG GAGCCTTTG6 AAAAQTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AQTQAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

QACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATOAQCACAT OGTCAAGTTC 2160 

TATGGCGTCT 0C6TG6AGGG OGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATQGG 2220 

QACCTCAACA AGTTOCTCAG G6CACA0Q0C OCTGATOCOQ TGCT6ATQ6C TGAGGGCAAC 2280 

03 C06CCCACG0 AACTGAC6CA OTCQCAGATQ CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA OCACTTOGTG CACCGOGATT TGOCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CQTGTACAGC 2460 

ACTGACTACr ACAGGGTOGG TQGCCACACA ATGCTGCCCA TTCGCTQGAT GCCTCCAGAG 2520 

AOCATCATGT ACAGQAAATT CAGQACX3GAA AGOQACGTCT GGAGCCTG6Q GGTCGTGTTG 2580 

OU 1X3GGAGATTT TCACCTAT6G CAAACAQCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGG6CCG AGTCCTGCAG OGACCCCGCA OGTOCCCCCA GGAGGTGTAT 2700 

GAGCTQATGC TGGQGTGCTG GCAGCQAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CA TACCCTCC TTCAGAACTT G GCCAA GGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCGCAA GGTACTCCTC AGA06GGC70 AGAGGATGAA 2880 

OD CATCTTTTAA CTGCCGCTG6 AGGCCACCAA GCT6CTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CC6AGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTAT TGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTQ GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTOGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTQAATCAAT CTG0CTTCT6 CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

/v CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAOT TTGCCCACCA CAACTAACAA 3240 

TGCC TTGT TQ TATTCCTGCC TTT6ATGTGQ ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT OGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACT6TTCT TATTGTTTTT GGATGGCTTA AGCCTGTOTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTQA AQCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

/j ATTTATTATG AACC6CAATA TGGGAGGAAC AAAGACAACT ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAOCAACTGT TAGCTGGGAA GAATGTATTC GOCACCTTOC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TQTGCCATGO ATGATTCTTT 3660 

TCCCATCACC AGAAAT6ATA GCGTQCA6TA GAGAGCAAAG ATGGCTT 3707 

80 8eq ID NO: CllO DNA Sequence 

Nucleic Acid Accession 8 s Eos sequence 
Coding sequence: 93 9.. 2 3 72 

1 11 . 21 31 41 51 
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AAQAOGGATT CTCAGACAAG GCTTGCAAAT GCCCC6CAGC OVTCATTTAA CTGCACCCGC 60 

AGAATA6TTA OQOTTTGTCft COOQACCCTC CCX3QAT0GCC TAATTTGTCC CTAOTGAOAC 120 

. GCX3GAG6CrC TGCCCGCGCC TO6CTTCTTC GTAGCTGGAT GOIT A TOOTG CTCGGGGCAa 180 

J OQG66GC3QCA GGGCAGGOn' TGGOGCAOIC CCTA6CACAC ATGAACAOGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGOGATT GTAGAAGAGA 300 

CT6TGGT0TG AATTAGGQAC CGGGAGGCGT COAAOGGAGQ AA0G6TTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTOCC CCTGCTCTGC 6TCAGCCCTC AOGTCACTTC GCCA6CAGTA 420 

^ , GCAOAGGOGG 0GG000CG6C TCCX3GGAATT GGGTTGGAGC AGGABGCT06 CTGGCTGCTT 480 

lU \ GGCT06CGCT CTAOQOGCTC AGTCCGOOaC GGTAGCAGGA GCCTGGACCC AGGOGGCGGC 540 

GGOGGGCGTG AGGCX3CCXWA 6CXX3BGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCXXQ GGAGOGCCGC CGGTCGGTGC CCGGCGOSCC 660 

GGGCCATGCA GC GAQGG CCG C06G6GAGCT CCGABOIGCG GTAGCGCCCX: CCTOTAAAQC 720 

I g GGTXCXSCTAT GCGGGGGCCA CTGTGAACXC 7GCGGCCTGC G6GAAGACTC TTC6CTCGG6 780 

ID ACCAGCTCA6 CCTCTGATAA GCTQOACTOQ GCAGGCCGGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCOGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGO GGGAAAGCGG CCGGTGCAGC 900 

GCOGGQACAO GCACTCGGGC TGQCACTGGC TQCTAGGGAT GTOGTCCTGG ATAAGGTGGC 960 

ATGGACCOGC CATGGOGOGG CTCTGGGGCT TCT6CTGGCT GGTTGTGGGC TTCT6GA6GG 1020 

COGCTTTOGC CTQTCCCSkCG T0CT6CAAAT GCMITGCCTC TOGGATCIGG TGCAOOGACC 1080 

ZU CTTCTCCTGG CATOQTGGCA TTTC0QA6AT TGGAGCCTAA CAGTGTA6AT CCTQAQAACA 1140 

TCACOSAAAT TTTCATOGCA AACCAGAAAA GGTT A GAAAT C3VTCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCT6AAAAAC AGCAACCT6C AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

^. OGAGTTTOTC TASGAAACAT TTCOGTCAOC TTGACTTGTC TGAACTGATC CTGGTG6GCA 1380 

25 ATCCftTTTAC AT6CTCCTGT 6ACATTATGT G6ATCAAGAC TCTCCAAGAG 6CTAAATCCA 1440 

QTCCAGACAC TCAOGATTTQ TACTGCCTOA AT6AAAGCAQ CAAGAATATT CCCCTGGCAA 1500 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCOS GTTCGTAATA 1620 

^ TGTATTGGGA TGTTG6TAAC CTG6TTTCCA AACATATGAA T6AAACAAGC CACACACAGG 1680 

30 GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAOCAGATC TCTTGT6TGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTQCACCAA 18 OO 

CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGOCAACCC CAAACCAGCG CTTCAGTGGT TCTATAAGGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CAT6TTACCA ATCACACGGA GTACCAOGGC TGCCTCCAGC 1980 

35 TGGATAATCG CACTCACATG AACAAT6GGG ACTACACTCT AATAGCCAAO AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTGACT TCATGGGCTG GCCTGGAATT GACGATGGTO 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATQ AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTOCACAGA OHtlACTGAT AAAACCGGTC 2220 

. _ GGGAACATCT CTCX30TCTAT GCTGTGGT6G TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

40 TGGTAATGCT QTTTCTGCTT AAQTTGGCAA GACACTCCAA GTTTGGCATQ AAAGGTTTTa 2340 

rrrrGTTTCA TAAQATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGQ 2400 

6GCTGTGGT0 CTTGTTOGTT OATGCTGCCA TQTAAGCTGG ACTCCTGGGA CTGCTGTTGO 2460 

CTTATCCOGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCX3 GTGTTTGGAO 2520 

GCTGTACTAT ATOAAGCCTQ CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAQAQ 2580 

45 GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATO TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCX3VTT GGATTGTACT TCTCTTCIGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TT6ACCTGCA AAGTTAAAAA AAAATTAAA6 TTGAGAACAG GTATAA6TGC ACACTGAATA 2820 

- GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

50 TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

AOCATCACTT TG66ACTTG6 TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 300O 

GTCCAAATQT TTAGCTTAGQ TCTOAGAGTC AAACAATOTT AAOGATTOTC TTAAAOTTCC 3060 

TTA6CCAGCA AAACAAAACA AAAC»AAACA AACAAAltSAA AAAOGTTTAA AAA0AA6AAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAO CrOTTTTGTT GGGGCTATAO ATTTAAGTTA 3180 

55 GGCATA6TGA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGGCCTTTT TTATTTGCCC TCXGGGATOC ACXnXSCTTTT TAGAA8TCTQ OOGAGTGAGA 3300 

A06CCACA6T ATCTCATGCT 6TTT6CATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 
CCT6GGAGCA GAATGGCT6G CCTGCTGTGA GCAGGAOAGG AGATTCTAAG AA6GATAGTC 3420 
CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

OO CCAGCAQCAA AGAGGTQGCA 66T06CTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 
TGCTQAiQAGG GCAGCCTTAlG AGCTOTGGAT TTCT6CATCC CCOCTGAGTC TGAOOCATGG 3600 

ACAOCTOTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATQCTGTGTT CAGTCTGT6T 3660 
CAGGCAGTAT GCTTGTOCTG AAGAGAGGTT TGGCTATCCC CACOCCACCC CACCCCACCC 3720 
TQTTCCTTTT TTATCAGGAG GACTTCAQAa CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 
65 AATC AOCTCT GAGAGTTAQA CATGCACACA GACX3CCATAG CTGGATTGGA AACATTGAT6 3840 
TTTTAAAAAT TTATTTTTTT TGGAAATA6T 7GCACAAAT0 CTGC3UVTTTA OCTTTAAOOT 3900 
TCTATAOATT TTTAACTAOT OCAACACAOT CAGAAACATT OTTTTGAATC CTCTGTAAAC 3960 
CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 
--^ TCXa^TAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATAOQ TTATTGCTTG 4080 
70 ATQAAOACCr TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGOCTAC TTCATACCCA 4140 
TGCCTTAAAG AGGGGCA6TT TCTGAAAAGG AGAAACATGC 06CCAGTTGT CAA6TTTTGC 4200 
TCCTAACTGC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT 6T6G6GAGGT CGGAACATTT 4260 
TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 4320 
GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TC3VT6ACCCT 4380 
75 GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGA6C TCA6TATGGC AAAGGTTCTT 4440 
GAGAATCAGC CATTTGGTAC AAAAAAOATT TTTAAAGCTT TTATGTTATA 0CATG6AGCC 4500 
ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 
AATAAAAAAA AAGGAATATT TOTACCCAAC AGCTAGAAGG ATTGCAAGGT AGArr mtJT 4620 
TTTAAAATGG AOAGAAGTGG ACAQATAAGQ CCATTTAATA TATCAAAQAT CAGTT6ACAT 4680 
oO CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 4711 

Seq IZ> NOi Gill DNA Sequence 
Nucleic Acid Acceosion «: NM_130830.1 
Coding oequencei 1..1746 " 
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ATGCCACTGA AGCATTATCT CCTlTl ' GCi ' G GTCQGCTGOC AAOCCTGGGG TGCAGGGTTG 60 

GCCTACXyVTG GCTQCCCTAQ OGAGTGTAOC TOCTCCAGGG CCTCOCAGGT GGACTGCACC 120 

GGQGCACX3CA TTGTGGCGGT GCCCACCCCT CTCCCCTGGA ACGCCATQAG CCTGCAGATC 180 

CTCAACACX3C ACATCACTGA ACTCAATGAG TCCCXOTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGOA TTGAGAAQAA TOAGCTGTCO CGCATCAOSC CrGGGGCCTT CCGAAACCTX3 300 

G0CT06CTGC GCTATCTCAO CCTOGCCAAC AACAAGCTGC A GQ TTCT G CC CATOSGCCTC 360 

TTOCAOGGCC TGGACAOCCT TGAGTCTCTC CTTCTGTCCA GTAACCA6CT QTTGCAOATC 420 

CAGCOGGCCC ACTTCTCCCA GTGCA6CAAC CTCAAGGAGC TOCAGTTGCA OGGCAACCAC 480 

CTGGAATACA TCCCTGAOGG AGCCTTCOAC CACCTGGTAG GACTCAC3GAA GCTCAATCTG 540 

GGCAAGAATA QCXTTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GT0CTC08GC TGTATGA6AA CAGGCTCAOG GATATCCCCA TGGGCACTTT TOATGGGCTT 660 

GTTAACCTGC AOOAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGQTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAOCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 640 

CTGAAGOAGC TCTCTCTGGG GATCTTOGGG CCCATGCCCA ACCTGOGGGA GCTTTGGCTC 900 

TAT6ACAACC ACRTCTCTTC TCTAGOOBAC AATGTCTTCA GCAACCTCOO CCAGTTOCAa 960 

GTCCTOATTC TTAGCOGCAA TCAOATCAGC TTCATCTCCC aJ Utf iXaLXriT CAAOGGGCTA 1020 

AOGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGQACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCXAACCT GCAGAACATC TCCCTGCAGA ACAATOGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTOGCCAA CGTCAATGGC CTCATGOCCA TOCAOCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCGCCTOQG CATCITOGAT GACXntSGGGA AACTGTGTGA GCTGOGGCTG 1260 

TATQACAATC CCTGGAeGTG TGACTCftGAC ATCCTTCCOC T0C6CAACT0 GCTCXTTGCTC 1320 

AACCAGCCTA OOTTAGGGAC GGACACTGTA CCTGT6TGTT TCAGCCCA6C CAAT0TCXX3A 1380 

GGCCAQTCCC TCATTATCAT CAATGTCAAC OTTGCTGTTC CAAGCGTCCA TCTCCXTrOAG 1440 

G TGCC TAOTT ACCCAGAAAC ACCATGGTAC OCAGACACAC CGAOTTACCC TOACACCACA 1500 

TCGGTCTCTT CTACCACTGA GCTAAOCAOC CCTGTOaAAlQ ACTACACT6A TCTGACTACC 1560 

ATTCAGGTCA CTGATGACCG CAGCX3TTTGG GGCATGACXX: AGGCCCAOAO CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTOTOGCC CTGGCCTGCT CCCrOGCTGC CTGC6T0GGC 1680 

T6TTGCTGCT GCAAGAA6AG GAGCCAAGCT GTCCTGATGC AGATGAAGQC ACCCAATGAG 1740 

TGTTAA 1746 

35 Seq ZD NO I C112 UNA Sequence 

Nucleic Acid AccesBion #t NM_002658.1 
Ooding sequences 77.. 1372 " 

j^r, ^ ^1 21 31 41 51 

40 I I I I I I 

GTCCCCGCAG 0GCXX3TCGCG CCCTCCTGCC OCAGGCCACC GAGGCOGCOG CCGTCTAGCG 60 

CCCCOACCTC OCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGOQACTCC AAAGGCAOCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TaaAGGAACA TOTGTCTCC A ACAAGTACTT CTCCAACATT CACTQGTGCA ACTGCCCAAA 240 

GAAA7TCGGA GGGCAOCACT GTGAAATAGA TAAOTCAAAA ACCTQCTATG A6GGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACXaTGGGC COGCCCTGCX: TGCCCTGOAA 360 

CTCT6CCACT OTCCTTCAQC AAAOGTACCA TGCCCACAGA TCTOATGCTC TTCAGCTGGG 420 

CXrrOGGGAAA CATAATTACT GCAGGAACCX: AGACAACOGO AGGGQACXXn* GGTGCTATGT 480 

GCAOOTGGGC CTAAAGCOSC TTGTCCAAGA GTGCAT6GTG CAT6ACT00G CAGATGGAAA 540 

AAAGCCCTCC TCTOCTCCAO AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

COGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCX5AG AACCAOCOCT GGTTTQC3GGC 660 

C ATCTA CAGG AGGCACCXKSG GGGQCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CGCTTGCTGG 6TGATCA606 CCACACACTO CTTCATTGAT TACCCAAAQA AGGAGGACTA 780 

CC CATCGTCTAC CTOGGTOQCT CAAGGCTTAA CTCCAACACG CAAGOGGAGA TGAAGTT7GA 840 

G6T6GAAAAC CTCATCCTAC ACAAG6ACTA CAGCXSCTGAC AOGCTTOCTC ACCACAACXSA 900 

CATTGCCTTG CTGAAGATCC OTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCOGQACTAT 960 

ACAQACCATC TGCCTQCCCT CGATGTATAA CGATCXXXywS TTTGGCACAA GCTGTGAGAT 1020 

CACTGGC TTT QGAAAAGAGA ATTGTACOGA CTATCTCTAT COGGAGCAGC T6AAAATQAC 1080 

TGTTQTGAAO CTGATTTOCC ACOGGOAGTQ TCAOCAGCCC CACTACTAOB GCTCT G AAGT 1140 

CACCACCAAA ATGCTATGTG CT6CI6ACCC CCAATG6AAA ACA6ATTCCT GCCAGGQAGA 1200 

CTCAGGGOGA CCCCTOQTCT GTTCCCTCCA AG6CX»CATG ACTTTGACTG GAATTGTQAG 1260 

CTGG GGCCGT GGATGTGCCC T6AAGGACAA GCCAGGCGTC TACACXSAGAO TCTCACACTT 1320 

CTTACCCT60 ATCGQCAGTC ACACCAAGGA AQAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGQA6GAAA GGGGCACGAC CC6CTTTCTT GCTGOTTGTC ATTTTTGCAG TAOAGTCATC 1440 

TCCATCAGCT GTAAGAAQAG ACTG66AAGA TAGGCT C TGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGO GTGAACGACA ATAGCTTTAC CCTCAOOOAT AGGCCTGGOT GCTGGCTGCC 1560 

CAQACCCTCT GGCCAGGATG GAOGGOrGGT CCTGACTCAA CATGTTACTG ACXaVGC3UlCT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTGQAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT T6TGAGGCGC ATGGTTGAOA 1740 

AATGAATAAT TTCCXyUlTTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAG6 GAGCTTAGCC 1800 

AATGT06GAG CAGOGGTTTG GGGAGCAGAG ACACTAAOGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATQ TGTGTGTATQ TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTQAITG TTAAGTCTAA ATATTTCCTT 1980 

AAACTOTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGT6ACAG TOCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAOTT 2160 

ATCCCTTCCT TTTAG CCTAG TTCATCCAAT CCTCACTGOa TGGGOTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTO 2280 

ATCAATAAAA TOTGATTTTT CTGA 2304 



45 
50 



60 
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TTTACMSATA AAACTGGTAC ACTCACAGAA AATOAGATCC AGTTTCOGQA 60 
ATCTTOVATT AATGGCATGA AATACCAAGA AATTAATGGT AGACnOTAC CCGAA^C 120 
:> AACACCAGAC TCTTCAGAAG GAAACTTATC TTATCTTAOT ACHTTATCCC ATctJ^ III 
CTTATCCCAT CTTACAACCA GTTCCTCTTT CAGAACCAOT CCTGAAA^ AaSJ^ III 

QJATCTTCAA ACTGACTCCA CTG6TQATG0 TCCCTGGCAA TCCAACCTGG CACCATCGCA 360 
GTTGGAOTAC TATGCATCTT CACCAQATGA AAAGQCTCTA CTA^ 420 

^ TTTATTOGCA ArrCTQAAGA AACTATCGAG GTTAAAACTC TloStllSJJ 480 

fSiSS^^S ii^CTGCTTC ATATTCTGGA ATTTGATTCA GATCOTAGOA GAATCAGTCT S40 
AATTCTTCAa GCACCTTCAO 6TSA0AAGTT ATTATTTGCT AAAGGAGCTO AOTCATCAAT 600 
SEIS?????^ GAGAAATAGA AAAAACCAGA ATTCATOTAQ ATCAATTTGC 660 

1 S "TGAAAOGG CTAAGAACTC TGTGTATAGC ATATAGAAAA TTTACATCAA AAOAQTATQA 720 

GGCAGCTGTT TTCGAGTTCA TAGAGAAA6A CCTGATATTA CTTGGAGCCA CAGCAGTAGA 840 

JSSSS^ SiJSilSS^ 

^SlS^i CTTACT6GGG ATAAACATQA AACAQCTGTT AGTOTGAQTT TATCATCTCG 960 

90 CCATTTTCAT AGAACCAT6A ACATCCTTGA ACTTATAAAC CAGAAATCAG ACAGCOAaTO 1020 

GCTGGTAGTQ QATOGQACCA GCCTATCTCT TGCACTCAGG GAGCATOAAA AACTATTTAT 1140 

GQAAGTTTGC AGAAATTQTT CAGCIGTATT ATCCTOTCGT ATGGcS tcS^S^ Htl 

i^SI^Tii AAATATCACC TOAGAAACCT ATAACATTOG CTGTTGGTGA 1260 

TGGTGCTAAT GACGTAAQCA TGATACAAGA AGOCCATGTT GQCATAGGAA TCATGQOTAA 1320 

GAAACAGTGA CTATGCAATA GCCAGATTTA AGTTCCrCTC 1380 

CWWLTTG^ TTTGTTCATO GTCATTTTTA TTATATTAGA ATAGCTACCC TTOXACAGTA 1440 

TTTTTTTTAT AAQAATQTGT GCTTTATCAC ACCCCAGTTT TTATATCAGT TCTACTOTTT 1500 

GTTTTCTCAG CAAACATTGT ATQACAGOGT OTACCTQACT TTATACAATA TTTOTTTTAC 1S60 

TTCCCTACCT ATTCTGATAT ATAOTCXTTT GQAACAGCAT GTA6ACCCTC ATOTGTTACA IMO 

TCTTTATTGG ACCATCCTOQ OCTTCAGTCA TGCCTTTATT TTCTTTTTTG QATCXTTATTT 1740 

T^??7^ AAAGATACAT CTCPGCTl'QU AAATGGCCAG ATGTTTGGAA ACTGGACATT 1800 

J^SiS^ ^rSnSf"^ TCATGGTTAT TACAGTCACA GTAAAGATGG CTCTGGAAAC 1860 

35 I^^T^I^ ACCATCTCGT TACCTGGGGA TCTATTATAT TTTATTTTGT 1920 

i^^IISE^ TTTTATGOAG GQATTCTCTG GCCATTTTrra GGCTCCCAGA ATATOTATTT 1980 

^g^gCT GT CAAGTGGTTC TGCTTGGTTT GCC3^TAATCC TCATGGTTGT 2040 

TTTCTTGATA TCATAAAGAA GGTCTTTGAC CX3ACACCTCC ACCCTACAAG 2100 

TJ^TOAAAAG OCACAGCTTA CTGAAACAAA TX3CAGGTATC AAOTGCTTGQ ACTCCATOTC 2160 

CTGTTTCCOQ GAAGQAQAAG CAGCOTGTOC ATCTCTTGGA AOAATOCTOa AAOGAOTTAT 2220 

40 AGGAAGATGT AGTCCAACXX ACATCAGCAG ATCATGGAGT GCATCQGWC ctScTATAC 2280 

CAACQJ^ AGCATCTTGA ^280 

TAOTACTTTG TOGOAOCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTQAT CACCCTOTTA 2400 

^ISSSif^S J^"™^ ATTAATTTCC AAAATCTTTG TAOTAGTTCA TACCCACTCA 2460 

. GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAOC C0CTC0C3UU: ACCCTTAATT 2520 

^ CATGTTAAAA TTTGAGAATA AAGAOACATT TTTCAOTCT tStCTGGOT IIH 

TGTCCCTTOT GCTTATGOGA CTCCTAATCG CATTTCAGTC TCTTOCTGAG GCCATTA™ 2640 

JSJJSJ^— AAGAGAQAAA TCTTAGTAAA QAGTArrrTT TAOTATTAGC 2700 

I^JJJJIS ^"^ ^^ TTAAATCTGC TTCTGTAAAT TATGCTCAAA GTTTCCCITO 2760 

50 JS^f^I H^^^ GAGTTATATT TAAAOCTTTT CAW5GQAAAA GTTAATCTCA 2820 

J^^^^S^ ATTTTGGTCC CTCAGTCACC TGTOTTOTTA ATTCATTAAT GCATTCTQAG 2880 

SSSSS^ ^a^^^iSS^ AATCATTTCC AACCATTATT TACTCCAGTA TGGGGAGTAA 2940 

^JS^SSS SJISS^f^^i CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAOGGTATA TCATATATAC AAATCAGGAA TCAGGTCOGT TCACCQAACT TCAAATTGAT 3060 

55 '^^^STGA CAGAGTATAA AGACOTATA HH 

' I^^IiT'' ""^TTTAAT QTCTTTATCA TTGGATCTTT TOCAT6CTTT AATCI^A 3180 

n^r^I^ j^^CTTTT TTTCTCTTTA CCTGAAGGCT CTGTOTATAG TATTTCATQA 3240 

5?JS22?ri SJST'^" ATATCAATAA AAAGTTTGGA C3W3TATTTAA ATATTGCAAA 3300 

r^^^:^^ TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAOAAGAGC 3360 

60 AGCCGACTTA OACATGCTCT TCCCTTTCTA TAAOCTAGAT TTT^AATAA 3420 

OU AGGGTTTCAG TTAATAATCT TATTTTCAGO TTATOTCATC TAACTTATAO CAAJ^TCA 3480 

CAATACAGTG AGTTCTGCCA GTCTCCCAGT ACAAGGCATA TTTCAGGTCT GGCTGTCGAA 3540 

JflJSS^!^ ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAQAATGAT 3600 

TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

65 JJ^m^IS^ OTAGTTCAGT ATTGTCATTT GTTTTTOTTT TATTOAAAAO TAATOTTOTC 3720 

TTAAGATTTA GAAOTGATTA rTAGCTTGAO AACTATTAOC CAGCTCTAAO CAAATAATQA 3780 

USSJiSiSil jmrAAGATAA TGGTTAAATG 0C5GTTTTACC AAGTTTTCCC T7X3AAAATX3T 3840 

f^J^^^^ T^J^^**^^ TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATQAATATAA 3900 

??S^Si ^^^"^ CAGGGQAAAG AATGGTAGAG ACAQAAATTA AOACTTTATC 3960 

70 CrrGTTTGCT TGTAAACTAT TATTTTCTTO CTAATOTAAC ATTTCTCTOT TCCAQTCATC 4020 

S^IiH^^ AGCTGGATTT AGQAAQATCT GTTATTCTGG AAGTACTAAA AAQAATAATA 4140 

CAAOOTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGQATATGG AAATTTGCTC ATAAAATCTC T^AAAACG 426^ 

75 J^V^'^^ AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATOACC QTGTTTAm 432^ 

«^CAOC T03CCTATAA TGGTGCTTAA AGTCTQATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGOA QAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTQAACTOTT ACTCCAAATC CACTCCGTTT 4S60 

80 TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAA0A6TTTT CTATTTATTT AAGAAAOTAA 4620 
5U CAATGCAOTC TOCAAQCrTT CAGTAOTTTT CTA6TCCTAT ATTCATOCTO TaSaCTCTT 
ACTAOGTAAC CAGTAATCAC AAGGAAAGTO TCCCCTTTGC ATATTTCTTT AAAATTCTTT 
CTTTOOAAAG TATOATGTTG ATAAITAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 
GCTAAATAOO TTATTGCTAA TCAGTGOTCT CAAATCQATT TGCCTCCCTT TGCCTCGTCT 
OAGGGCTOTA AGCCIOAAGA TAOTOGCAAG CACCAAGTCA 6TTTCCAAAA TTOCCCCTCA 



4680 
4740 
4800 
4660 
4920 
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GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGOGTAGGCT CACCCTGGGC 4980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTOCTCTCQ AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTG6GTCT AAAGCTCATA 5100 

5ATACACAAAG GCTTCCAGAC CTGAOCCACA 0CCA06CCCT ATCCTQAACA GGAGACTAAA 5160 

OUZAGGCAAA TCAAOCCTAQ GAAATACTTO CATTCTGCCC TACGOTTAOT ACCRGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGACSA TTQAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAa AGCCAAAAOT TTTTAAAATA 5340 

TGCT ACTCT G AAAAATCTOG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TttGGOTATGA TATAAGCA60 TATTAATAAA AATAACACAC 5460 

lU CAAAGAGTTA OGTAAAACAT GTTTTATTAA TTTTGGTCCC CAC6TACAGA CATTTTATTT 5520 

CTATTTTGAA ATQAOTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAAT6TT GTTCCTATAA AAAATA6ATC ATAACTCATQ ATATGTTTGT 5640 

AATCA TGQTA ATTTA GATTT TTATGAGGAA TGAGTATCTO GAAATATTGT AGCAATACTT 5700 

^ - GGTTTAAAAT TTTG6ILCCTG AGACACTGTG GCTGTCTAAT OTAATCCTTT AAAAATTCTC 5760 

Ij TGCATTGTCA GTAAATGTAG TATATTATT3 TACAGCTACT CATAATTTTT TAAAOTTTAT 5820 

GAAGTTATAT TTATCAAATA AAAACTTTOC TATAT 5855 

8eq ID NO: C114 OKA Sequence 
^ Hucleic Acid Acceesion ft: XM_0B7461.1 

Z\) Coding sequences 236.. 1138 " 

1 11 21 31 41 51 

1 1 I I I i 

^_ C066CGC0GG GG6C3G60GGG GAACCXTAAA CGCAACOGGG TCT06AGGGA TCCXrOGCGCC 60 

ZD 6A6CCAGCCG COGTCACCGC CTCC6CGCCX3 CCXX:TQ0GGG CTTGGCAGGC GCCCQGOGOQ 120 

CCCGCACTQC QCCCX3GCCGC CGGCTCCCGC GGTCCCACCG TGAOCTCGCC GGCCOQTCGC 180 

CCX3CTCGCCA TOCAACCGCC GCCGQCCTOQ CGOSCOTAGG CGCCCGCCGC AGGCCAT6CT 240 

6CCCCTGCTC GCOGCGCTCC TGGCCGCOGC CTGCCCGCTG COSCCCOTCC GCGGOSGGGC 300 

GGOGGAOGOG CCOGGCCTCC TOGGGGTGCC CTCCAATGCT TCAGTCAACO CGTCCTCCGC 360 

G6CGAGCCCA TC6CCCCGCG GCTGCTGOCC TCXK30GGCCC CXZGGGCCCCC OGAGCGGCCG 420 

GGCCCGGAGG AGGCGGCGGC GGCGGCGGCG CCTGTGCAAC ATCAGGGT6C AGOGOCAOAT 480 

GCTGAGCTCG CTGCTCOTQC GCTGGGGCCG CXXX3CGGGGC TTCCAGTGCG ACCTACTOCT 540 

CTTCTCCACC AAOGOGCAOG GCCGCGCTTT CTTCGCCGCC GCCTTCCACC GCGTOGGGCC 600 

GCCOCTQCTC ATOGAOCACC TGGGGCTGGC GGOGGGOGGC GOQCAGCAGG ACCTGC2GCCT 660 

CTGCGTGGGC TGCGGCTGGG TGCGCGGTCG CCGCACOBGC OGCCTOCJQGC CCX3CCGCCGC 720 

CCCCAGCGCC GC06CCGCCA COGCCX3G6GC GCCCAC0606 CTGCCAGCCT ACCC080GGC 780 

CGAGCOGCCC GGGCCGCTGT GGCTGCAG60 OaAOCOGCTQ CATTTCTSCT 6CCTAGACTT 840 

CAGCCTOOAO GAGCTGCAGG GOGAGCCGGG CTGGOGGCTG AAC06TAAGC CCATTGAOTC 900 

CA0GCTG6TG GCCTGCTTCA TGACXXTTGCT CATOOTGOTG TGQAGOGTGO CCGCCCTCAT 960 

CTGGCOGGTO CCCATCATCG COGGCTTCCT GCCCAAOGGC ATGGAACAOC OCOGOACCaC 1020 

CGCCAQCACC ACCGCA6CCA CCCCCQC06C AGTGCCCGCA GGGAGCAOOQ CA6C060CGC 1080 

GGCOSCCGCC GCTGCC3GC0Q CCGCCGCGGC CGTCACTTCO GGGGTGQOQA CC3«0TQACC 1140 

CSGCTCOGCTC CTCCCTGTQT COGTCCTGTQ TCCGCG0GCX3 CGGGTGCCTT TCCXXSCCGGA 1200 

GACTCGGCCG GTGTGCTTOG TGCTQTAGTT ATOGTTAGTT CCTCTTCCCG AGATGGGGCC 1260 

GCXSAGAGAC CCCAGCGCCT TTGAAAAGCA AGGTTTGT6C TGCOCTTCCA OTTCCGAAAA 1320 

GCAGATGTTT AAGCCCTTGG ACTGAGGOTG GGATOGCAOC TGGGAAGAOG GAGAGGAGGG 1380 

AAATGQGGCC CTTTCCCXTTC TATTGCATCC OCCTOCCCXJA CTCCTTCOOC 6CACCCAGGT 1440 

GCX XTTAG ATT CATGG CAGAA AATGACCAAA TCCTGTGTAT TTGTTTTATA TATTTAATAA 1500 

CTGTTTTAAA TGAAAGTTTT AGTAAAAAAA ATACAAAACA AAAAGATTAA ATTGCTATTG 1560 

CTOTAGTAAa AGAAGCTCTT TGTATCTGAA CATAGTTGTA TTTGAAATTT 6TGGTTTTTT 1620 

AATTTATTTA AAATTGGGGQ GAGGQCATOO OAAGQATTTA ACACOQATAT ATTGTTACCS 1680 

CTGAAAATGA ACTTTATGAA CCTITTOCAA GTTGATCTAT CCAGTOAOGT OOOCrGGTOG 1740 

GCGTTTCTTC TTGTACTTAT GTGGTTTTTT QOCTTTTAAT ACAOACATTT TOCTOC 1796 



30 
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Seq ID NO: CI 15 DNA Sequence 
Nucleic Acid Accession #s XN_,051522.4 
Coding sequences 127.. 1215 "* 



1 11 21 31 41 51 

60 I I I I I I 

ACAGTTGTTG CAAAGTGCTC AGCACTAAG6 GAGCCAGGGC ACAGCACAGC CAOBAAGGGG 60 

AGOGAGCCCA GCCAGCCCAG CCAGCCCAGC CAGC006GA0 OTCATTTOAT TCCOOGCCTC 120 

AGAACGATGG ATCTGCATCT CTTCGACTAC TCAGA6CCAG GGAACTTCTC GGACATCAQC IBO 

TGGCCATGCA ACAGCAOCOA CTGCATCOTO OTGGACAOGG TGATSTQTCC CAACATGCCC 240 

AACAAAAGCG TCCTGCTCTA CACGCTCTCC TTCATTTACA TTTTCATCTT COTCATCGGC 300 

ATGATTGCCA ACTCCGTGGT GGTCTGGGTO AATATCCAGG GCAAGACCAC AGGCTATQAC 360 

ACGCACTGCT ACATCTTGAA CCTGGCCATr 6C0GACCT8T OGOTrGTCCT CACCATCOCA 420 

GTCTGGQTGG TCAGTCTCGT GCAGCACAAC CAGTGGCCCA TX3GG0QAGCT CACGTGCAAA 480 

GTCACACACC TCATCTTCTC CATCAACCTC TTOGGCAGCA TTTTCTTCCT CAOGTGCATG 540 

/U AGCGTGGACC GCTAGCTCTC CATCACCTAC TTCACCAACA COCOCAGCAG CAGQAAGAAG 600 

AT6GTA0GCC GTGTOSICIO CATOCTGQTa TQOCTGCTGQ OCITCTGCGT GTCTCTGCCT 660 

GACACCTACT ACCTQAAGAC O0TCA06TCT GOCTCCAACA ATOAGAOCTA CTGGCGGTCC 720 

TTCTA CCCC Q AGCACAGCAT CAAGGAGTGG CTQATOGGCA TGGAGCTGGT CTCOGTTGTC 780 

TTGGGCTTTQ CCGTTCCCTT CTCCATTATC GCTXTTCTTCT ACTTCCTGCT GGCCAGAGCC 840 

ATCTCGGOOT CCAGTGACCA GGAGAAGCAC AGCAGCCGGA AGATCATCTT CTCCTAOGTG 900 

GTGGTCTTCC TTOTCTOCTQ GCTGCCCTAC CAGOTOGOGG TQCTGCXXiGA CATCTTCTCC 960 

ATCCT6CACT ACATCCCTTT CACCTGCCGG CTGGAGCAOO CCCTCTTCAC 6GCCCTGCAT 1020 

GTCACACAGT GCCTGTOGCT GGTGCACTGC TG03TCAACC CTGTCCTCTA GAGCTTCATC 1080 

AATOOCAACT ACAGGTAOQA GCTGATGAAG GCCTTCATCT TCAAGTACTC GGCCAAAACA 1140 

OOGCTCACCA AGCTCATCGA TQCCTCCAGA GTCTCAGA6A OQGAOTACTC TGCCTTQQAG 1200 

CAGAGCACCA AATGATCT6C CCTGGAGAGG CTCTGGGAOG GGTTTACTTG TTTTTGAACA 1260 

6GGTGATGGG CCCTATGGTT TTCTAGA6CA AAGGUUVOTA GCTTOGGGTC TTGAT6CTTQ 1320 

AGTAGAGTGA AGAGGGGAGC AOGTGCCCCC TQCATCCATT CTCTCnTCr CTTGATGAOG 1380 

CAOCTGTCAT TTGaCTOTOC OTGCTGACAG TTTTGCAACA GGGAOAacra TGTOGCACAG 1440 
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CAGTGCTOTG 
TTTCTOTQTT 
TCTCACTATT 
T QOGAGG OVT 
AOTTTTGACT 
ATATAAATAT 
ATCTGTQTGQ 
TTQTGACATT 
TOCAAATCTO 
ATTTTTTTTT 



CGTCAGAGCC 
TCCTC5AATTT 
GGTGTACCTT 
AGTGCTGACA 
AAGGATGACA 
ATAAATATAT 
TQTTTTGT A C 
AATAGTATTQ 
CACACACAAC 
AATAAAGATT 



AGCTGAGGAC 
TTTATATQQT 
ATAAATOTAT 
TATATTCAQA 
CTAATTGTTA 
GCCAGTCTTO 
CGGCACGOGA 
TAAAGTTACA 
QAACAGTTGC 



AG6CTTGCCT 
GATTTGTATT 
TTGAAAGTTA 
GTQTTOTAGT 
GCTGTTTTGA 
GCT6AAATGT 
TATGGAACGA 
TTTTAAAATA 
ATTTGAQAGA 



GQACTTCTGT 
TAAATTTTAA 
AATATATTrr 
TTTAAGGTTA 
AATTATATAT 
TTTATTTAOC 
AAACTGCTTT 
AACAAAAAAC 
GTTCTCTCAA 



AAGATAGGAT 
GACTTTATTT 
AAATATT6TT 
GOQTGACTTC 
ATATAAATAT 
ATAGTTTTAT 
GTAATGCAGT 
T6TTCTGGAC 
TTTGTAAOTT 



Seq ID NO: CI 16 DNA Sequence 

Nucleic Acid Accession 4)t NM_000350.1 

Coding sequence I 82.. 6 903 ^ 



1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2010 



1 11 31 31 41 51 

I I I i I I 

CTGQCTCTTA AOQGOQTrTA TGTOCTTTGC TGTCTGAG6G GCCTCAQCTC TGACCAATCT 60 

GGTCTTCQTQ TGOTCATTAG CATGGGCTTC GTGAGACAGA TACAGCTTTT GCTCTGQAAG 120 

AACTGGACOC TGCGGAAAAG GCAAAAGATT CGCTTTGTGG T6GAACTCGT GTGGCCTTTA 180 

TCTTTATTTC TGOTCTTQAT CTGQTTAAGQ AATGCCAACC OGCTCTACAG CCATCATQAA 240 

T6CCATTTCC CCAACAAGGC GATGOCCTCA GCAGGAATGC TGCOGTGGCT CCAGGGGATC 300 

TTCTGCAATG TGAACAATCC CTGTTTTCAA AGCCCCACCC CAGGAQAATC TCCTGGAATT 360 

GTGTCAAACT ATAACAACTC CATCTTG6CA A6GGTATATC GAGATTTTCA AGAACTCCTC 420 

ATGAATGCAC CAGAGAGCCA GCACCTT66C CGTATTTGGA CAGA6CTACA CATCTTGTCC 480 

CAATTCATGG ACACCCTCC6 6ACTCAGCGG 6AQAGAATTG CAGGAAGAGG AATAOGAATA 540 

AGGGATATCT TGAAAGATGA A6AAACACTO ACACTATTTC TCATTAAAAA CATCGGCCTG 600 

TCTGACTCAO TGGTCTACCT TCTGATCAAC TCTCAAOTCC OTCCAGAGCA 0TTC6CTCAT 660 

GGAGTCCCGG ACCTGGCGCT GAAGGACATC GCCTGCAGCG AGGCCCTCCT GGAGCGCTTC 720 

ATCATCTTCA GCCAGAGACO CG6G6CAAAG ACGGTGCGCT ATGCCCT6TG CTCCCTCTCC 760 

CAG6GCAC0C TACAGTGGAT AGAAGACACT CTGTATGCCA A06TG6ACTT CTTCAAGCTC 840 

TTC0GTGT6C TTCCCACACT CCTA6ACAGC CGTTCTCAAO GTATCAATCT GAGATCTTG6 900 

GGAGGAATAT TATCTGATAT GTCACCAAGA ATTCAAGAGT TTATCCATCG 6CCGAGTATG 960 

CAGGACTT6C TGTGGGTGAC CAGGCCCCTC ATGCAGAAT6 GTGGTCCAGA GACCTTTACA 1020 

AAGCTQATQO OCATCCTGTC TGACCTCCTO TGT66CTACC CCGAGGGAGG TGGCTCTCGO 1080 

6TGCTCT0CT TGAACTOGTA TQAAOACAAT AACTATAA06 CCTTTCTGGG GATTGACTCC 1140 

ACAAGGAAGG ATCCTATCTA TTCTTATGAC A6AAGAACAA CATCCTTTTG TAATGCATTO 1200 

ATCCAGAGCC TGGAGTCAAA TCCTTTAACC AAAATOGCTT GGAGGG066C AAA6CCTTTG 1260 

CTGATGGGAA AAATCCTGTA CACTCCTGAT TCACCT6CAG CACGAAGOAT ACTGAAGAAT 1320 

GCCAACTCAA CTTTTGAAGA ACTGGAACAC GTTAGGAAGT TGGTCAAAGC CTGGGAAGAA 1380 

GTAGG6C0CC AGATCTGGTA CTTCTTT6AC AACA6CACAC AGATGAACAT GATCAGAGAT 1440 

ACCCTGQGGA ACCCAACAGT AAAAaACTTT TTGAATAGGC AGCTTGGTGA AGAAGGTATT 1500 

ACTGCTGAAG CCATCCTAAA CTTCCTCTAC AA6GGCCCTC GGGAAAGCCA GGCTGACGAC 1560 

AT6GCCAACT TCGACTGQAG GOACATATTT AACATCACTO AT06CACCCT CCOCCTGGTC 1620 

AATCAATACC TGGAGTGCTT GGTCCTGGAT AAGTTTGAAA GCTACAATGA TGAAACTGAG 1680 

CTCACCCAAC 6TGCCCTCTC TCTACTGOAG GAAAACATGT TCTGOSCGGG AGTOGTATTC 1740 

CCTGACATOT ATCCCTGQAC CAGCTCTCTA CCACCCCAOQ TGAAGTATAA GATCGGAATO 1800 

GACATAGACQ TGGTGGAGAA AACCAATAAG ATTAAAGACA GGTATTGGGA TTCTGGTCCC 1860 

AGAGCTQATC CCGTOOAAOA TTTCCQGTAC ATCTGGGGCG GGTTTGCCTA TCTGCAGOAC 1920 

ATGGTTGAAC AGGGGATCAC AAGQAGCCAG GT6CAGGCGG AGGCTCCAGT TGGAATCTAC 1980 

CTOCAGCAGA T6CCCTACCC CTGCTTGGTQ GACGATTCIT TGATGATCAT CCI6AAC06C 2040 

TGTTTCCCTA TCTTCATGGT GCTG6CATGG ATCTACTCTG TCTCCATGAC T6TGAAGAGC 2100 

ATCGTCrrOQ AOAAGGAGTT 6CGACTQAAG GAGACCTTGA AAAATCAGGG TGTCTCCAAT 2160 

GCAGTGATTT GGTGTACCTG GTTCCTGGAC AGCTTCTCCA TCATGTCGAT GAGCATCTTC 2220 

CTC CTGAOQA TATTCATCAT GCATGGAAQA ATCCTACATT ACAGC6ACCC ATTCATCCTC 2280 

TTCCTGTTCT TGTTGGCTTT CTOCACTGCC ACCATCATGC T6T6CTTTCT GCTCAGCACC 2340 

TTCTTCTGCA AGGCCAOTCT GGCA6GAGCC TGTAGTGGT6 TC AT C TA TTT CAGCCTCTAC 2400 

CTGCCACACA TCCTQTGCTT CGCCTGQCAG GACCGCATGA CCOCTQAGCT GAAGAAGGCT 2460 

GTGAGCTTAC TGTCTCCGGT GGCATTTGGA TTTGGCACTG AGTACCTGGT TCGCTTTGAA 2520 

GAGCAAGGOC TGGGGCTGGA GTGGAGCAAC ATCGGGAACA GTCCCAOGGA AGGGGAOGAA 2580 

TTCAGCTTCC TGCTGTOCAT GCAGAT6ATG CrCCTTGATG CTGGGTGCTA TGGCTTACTC 2640 

GCTTGGTACC TTGATCAGGT GTTTCCAGGA QACTATOGAA CCCCACTTCC TTGGTACTTT 2700 

CTTCTACAAG AGTCGTATTO GCTTAGCGGT GAA6GGT6TT CAACCA6AGA AGAAAGA6CC 2760 

CTGGAAAAGA CCGAGCCCCT AACAGAGGAA A06GAGGATC CAOAGCACCC AGAA6GAATA 2820 

CAOQACTCCT TCTTTGAAOQ TGAGCATCCA GGGTGGGTTC CTGGGGTAT6 C6TGAAGAAT 2880 

CTQGTAAAQA TTTTTGA6CC CTGTGGCOGG CCAGCTGTGG ACCOTCTSAA CATCACCTTC 3940 

TACQAGAACC AGATCACCOC ATTCCTOGGC CACAATGQAG CTGOGAAAAC CACCACCTTG 3000 

TCCATCCTGA CGGGTCTGTT GCCACCAACC TCTGGQACTQ TGCTCGTTGG GGGAAGGGAC 3060 

ATTGAAACCA GCCTGGATGC AOTCCGGCAO AGCCTTGGCA TGTGTCCACA GCACAACATC 3120 

CTGTTGCACC ACCTGAOGGT G6CTQAGCAC ATGCT6TTCT ATGCCCAGCT GAAAGGAAAG 3180 

TCCCAGGAGO AGGCCCAGCT GGAGATGGAA GCCATOTTGO AGQACACAOO GCTCCAOCAC 3240 

AAGOOGAATG AAGAOGCTCA GGAOCTATCA GGTGGCAT6C AGAGAAAGCT GTOGGTTGCC 3300 

ATTGCCTTTG TGGGAGATGC CAAGGTGGTG ATTCTGGACO AACCCACCTC TGGGGTGGAC 3360 

CCTTACTOQA GACGCTCAAT CTGOGATCTG CTCCTGAAGT ATCOCTCAGQ CAGAACCATC 3420 

ATCATGCCCA CTCACCACAT GGAGQA6GCC GACCACCAAG GGGACC6CAT TGCCATCATT 3480 

OCCCAGGQAA OGCTCTACTG CTCAOGCACC CCACTCTTCC TGAAGAACTO CTTTQQCACA 3540 

GGCTTGTACT TAACCTTGGT GOGCAAQATG AAAAACATCC AGAGCO^AAG GAAAGGCAGT 3600 

QAGGGGACCT GCAQCTGCTC GTCTAAGGGT TTCTCCACCA OOTOTCCAGC CCACGTOOAT 3660 

GACCTAACTC CAQAACAAGT CCTGGATGGO GATGTAAATG AGCTGATGGA TGTAGTTCTC 3720 

CACCATGTTC CAGAG6CAAA GCTGOTQGAG TGCATTGOTC AAGAACTTAT CTTCCTTCTT 3780 

CCAAATAAQA ACTTCAAGCA CAGAGCATAT 60CASOCTTT TCAGAGAGCT GGAGGAGACO 3840 

CrSGCTGAOC TTGGTCTCAG CAGTTTTGGA ATTTCTGACA CTCCGCTGQA AGAGATTTTT 3900 

CTOAAGOTCA CGGAGQATTC TGATTCAOGA OC f CT G TTTQ OG G GTO G OBC TCAGCA6AAA 3960 

AGAGAAAACQ TCAACCCCGO ACACCCCTGC TTGGQTGCCA QAQAOAAOGC TGOACAOACA 4020 
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SSSS^SS^SI ?=WITGTCTG CTCXCCAGGG GCX3CCGGCTG CTCACCCAQA GGGCCAGCCT 4080 

CATCTCCAG6 C0CTGCT0C3T CAAGAQATTC CAACAC^OOl TCCGCAGCCA CAAQGACTTC 4200 

SSSS^ IS^"^ GGCTACCTTT GlGUmUi CTCTGATCCT tSSS 4260 

SSS?^^'' cccoiLTnxi AcccrrcAcc cctooatata togooS II20 

TACACXnrCT TCAOCATGGA TGAACCAGGC AGTOAGCAGT TCACGGTACT TCCAGAOT^ llla 

CCCTGTGGCA ACTCAACACC CTGGAAGACT CCTTCTX5TCT CCCCaAACAT oStcaqctq 4^»a 
TTCCAQAAGC AGAAATGGAC ACAGGTCAAC CCTT^^ 

GAGAAGCTCA CCATGCTCCC AGAGTOCCCC GAGGGTCCCG GGC^CCTOTC lltl 

till 

Jl^^SJiiS TCTTATAAQA AGCAGCTTAA AQAGCAAATT CTGGQTCAAT 4740 

GAACAGAGGT ATCGAGOAAT TTCCATTGaJK QQAAAGCTCC CAGTOGTCCC CATCaSgGG 4800 

GAAGCJ^ TTGGGTTTTT AAOCQACCTT GGCOSGATCA TO^toS 4^60 

^ISS??^ JiSSS^ '^^^^ ^"^A 

^SJJ^ TAACAAAGGC TCGCATGCCC TCGTCAGCTT TCtSaTGTG 49^ 

SS^ISSf^S «5a=*°ccrG cx:taaggaca ggagccccqa ggagtatgqa so'o 

"*«5«3UICC cctgaacctq accaaqgagc AGCTCTCAGA OATTACACTQ SlOO 

CTGACaCTT CAGTGGATGC TGTOGTTGCC ATCTGC6TGA TTTTCTCCAT OTCCTTOTTC lllo 

CCAGCCAGCT TTOTCCTTTA TTTGATCCAG OAGCGGOTGA ACAAATCCAA qScctSaG 5220 

TTTATCAGTG GAGTGAGCCC CACCACCTAC TGGGTGACCA ACTTCCTCTO GGACATCATC S280 

^^r^? T^OTGCTGG 6CTGGTGGTG GGCATCTTCA TCGGGTTTCA OAaScC i^O 

SSSJ^ CAOAAAACCT TCCTGCCCTT GTGGCACTQC TCCTGCTCTA tS^QQ lllo 

GTCATTCCCA TQATCTACCC AGCATCCTTC CXGTTTGATO TCCCCAOCAC AQOCrATGM S460 

OCTTTATCTT GTGCTAATCT OTTCATCGGC ATCAACAOCA OTCC^TTOC cScAtcS^ SsIS 

JI^^S 22^^™ CCTGGGCCGG GGCCTCAITO ACCTTCCACT OAGCCAGGCT S640 

GTGACAGATQ TCTATGCCCX5 GTTTGGTGAG GAGCACTCTG CAAATCOQTT CCACTGOTAC 570D 

CTOATTGQ^ nil 

CTOCTGOTCC AGCGCCACTT CTTCCTCTCC CAATGOATTG COGAGCCCAC TAAGQAGCCC SB20 

in^JJ^J? il^I^™ ^^^^ GAAAGACAAA GAATTATTAC TGGTGGAAAT 5880 

JSJiS^^ ACATGAACTA ACCAAGATTT ATCTCGGCAC CTCCAGCCCA 5940 

GCAGTGGACA GQCTGTGTGT CGGAGTTCGC CCXGGAGAGT GCTTTOGCCT CCTGGGAGTG 6000 

AATGGTGCCG GCAAAACAAC CACATTCAAG ATOCTCACTG GGGACaOCAC StoACCtS loll 
GG^J^ CaTTAGCAGG CAAOAGTATT 

rl^ZnZ ^^SiSII ^"^^TC GATCAGCTGC TCACAGGACO AQAACATCTT 6180 

liSIIT?^ CCCGGCTTCX3 AGQTGTACCA GCAGAAGAAA TCGAAAAGGT TGCAAACTGQ 6240 

OACIGTCTAC GCCQACTCCC TGGCTOGCAC GxSS^ 63^0 

OTCAACAAGC GGAAACTCTC CACAOCCATC GCACTCATTO GCTGCOCACC OCTOGTCCTG 6360 

^^l^J!:'^'^^^ lllo 

SSJS 1^^^^ GAGGGCTCTG GTCCTCACAT CCCACAGCAT OOAAGAATGT 64M 

GAGGCACTOT GTACCOGGCT GGCCATCATQ GTAAAGGGOG CCTTTCGATG TATGGGCACC 6S40 

ATTCAGCATC TCAAOTCCAA ATTTGGAGAT GGCTATATOG TCACA^ GAtSaS^ sToo 

CCGAAGOACXS ACCTGCITOC TGACCTOAAC CCT0TO6R0C AOTTCTToS OGGQaS 666^ 

CCAGGCAGTC TGCAGAGOGA GAGGCACTAC AACATGCTCC AOTTcS CTotS 6720 

tccctggcga nil 

TACTCAGTCA CACAGACCAC ACTGOACCAG GTGTTTCrrAA ATTTTOCTAA ACAGC^ACT 6840 

TGATCTTTCA CACOOCTOST TCCT6CA0CC AGAAAOGAAC TCTOGOCAGC TGGAGGCGCA 6960 

^^l^.^l^'^'''^''^'^'^'^^ 'III 

IJi^^ ""^^^ GAGCATCCAQ OGAATTCAQA AAGAGGTOT TCAGAAGGAA 7080 

ACCGAAACTO ACTTGCTCAC CTGGAACACC TGATGGTGAA ACCAAACAAA TA^AaScc 7140 

TTCTCCaOAC CCCAGAACTA GAAACCCCGG GCCATCCCAC TAGCAGCTTT GOCXTTCCATA 7200 

sss^ j;^^ ^^'^^^ 'III 

OTGATnTCA TGGAAAAATA AAATGCaAAT GCACTCATCA CAAAAAAAAA AAAAAAAA 7318 

Seq ID MO I C117 DNA Sequence 

Nucleic Acid Accession #i NM 006671.2 

Coding sequences 138.. 1820 " 



I 

G6CA0GAGGC 
QCCCAQACCC 
AGCAGGCCAG 
GGGGGAATGO 
TCTTCTTGAO 
AGCTCCTGAT 
COGOACrTGC 
ACTACCTOTG 
ACCCAGGCAO 
CAGCOQATQC 
CATTCAAACA 
AGGAGGCXrCC 
TGCAOAACTT 
GCACCA6GQA 
TGCTGGGCCG 
AOTOGGTCAT 
TCATTQCX3GG 
TCTACTCAQT 
TCTACTTCTT 
TQCTCATCX3C 
T6CTGGAGAA 
CCATCAACAT 



11 
I 

TQGTGTTTAG 
TOTGCCWCOG 
QCTCACCATG 
ACTCCTCATC 
GACCCGGCX3C 
GAGGATGCTG 
CrOCCTGGAT 
GACCACCTTC 
06CGQCCCAG 
CCTQTTGGAC 
GTACCGCACC 
TCCTOGGOGG 
OGCCCTGGAC 
TGGCATGAAT 
CATGGOTQAC 
GAAQAT06T0 
TAAOATCCTG 
CACXXSTGOTG 
CATCACCAAO 
GCTGGCCACC 
CAACCACATC 
GGA0G6CACT 



21 
I 

CAACTCCGAC 
QCOGGGCTCT 
GTGCOGCATG 
CTGTCrrGTGC 
CTCTCACCAC 
AAGATGATGA 
GCCA AOAC CT 
ATGGCTOTCA 
AAGGAGACCA 
CTCATCOGGA 
AAGACCACCC 
ATCCTCATCT 
CTGACCCOGC 
GTGCrOGGCA 
AGOGGGGCCC 
G0SGT6GCTG 
GA0ATGGACX3 
TG06GGCIGG 
AAGAATCCCA 
TCCTCCAGCT 
GACOQOCGGA 
606CTCTA0G 



31 
I 

CACCTGOCTG 
CATGOQTGGA 
OCaTCTTGGC 
TGTCTOTCAT 
AGGAAATTAG 
TCCTGCCACT 
CTAGC08CCT 
T0GTGG6CAT 
OGGAGCAGAG 
ACATGTTCCC 
CAGTTGTCAA 
ACQ6GGTCCA 
OQCCOGAGGT 
TOGTCTTCTT 
CCCTGQTCAG 
TQTGGTATTT 
ACCCCAGOGC 
TGCTCCAOSG 
TOGTCTTCAT 
CAOCCACACT 
TOGCTOGCTT 
AGGCTGTGGC 



41 

I 

CTGAGGGQCT 
ATGGTGCTOT 
A0GG6GGAGG 

CGTGGGCTGC 
TTACTTCCAG 
OQTGGTCTCC 
QGG08TCCTC 
CTTCATGGTC 
TGGGAAGOCC 
AGCCAACCTA 
GTCCCOCAAG 
GGAGOAGAAT 
CX3TTTACAAG 
CTCTGCCACC 
CTTCTGCCAQ 

O0TOGGCAA6 
GCTCTTTATC 
CCOCOGCATC 
GCCCATCACX: 
COTGCTGCCC 
CGCCATCTTC 



51 
I 

AGAGCCCTCA 
GCOCCTTGCC 
GA06T6TGCA 
CTCCTOGGCT 
TTCCCTGGAQ 
AGCTTGATGT 
ACCGTOGOGT 
TCCATCATCC 
ATCATOAGCT 
GTAGAAOCCA 
0TX3GCACCAG 
GGCTCCCATG 
TCAGAGCOGG 
ATGGGCATCA 
TGOrrCAATQ 
ATTCTOTTCC 
AAGCTGGGCr 
CTGCCCCTGC 
CTGCAGGCTC 
TTCAAOTGCC 
GT6G0T6CCA 
ATC0CCCAG6 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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TCAACAACTA COAGCTGGAC TTTGGCCAOA TOVTCACOVT CAGTATCACA GCCACTGCAQ 1380 

CCAGCATTGG GGCAGCTGOC ATCXCCOVGQ CCGGCCTOGT CACCATGGTC ATOQTGCTCA 1440 

CCTCCGTCGO ACTGCCCACC GATOACATCA CCCTCATCAT TGCOOTTGAC TGGGCTCTOa 1500 

ACOCSTTTOaS CACCATGATT AA00T6CTGG GTGATGGGCT GGCA6GGGG0 ATCATGGCGC 1560 

5 ATATATGTOS GAAGGATTTT GCC0G6GACA CAGGCAC06A GAAACT6CT0 CCCTGOCSAGA 1620 

CCAAGCCAOT GAGCCTCCAG GAGATOGTGG CAGCCCAGCA GAATGGCTGT GTGAAGAGTG 1680 

TAGCOGAGGC CTCOGAGCTC ACCCTGGGCC CCACCTOCCC CCACCACGTC CCCGTTCAAQ 1740 

TGGAGCGGGA TGAGGAGCTG CCCGCTGGGA 6TCTGAACCA CTGCACCATC CAGATCAGOS 1800 

AGCTGGAGAC CAATGTCTGA GCCTGCGGAG CTGCAGGGGC AGGOGAGGCC TCCAG6GQCA 1860 

10 GGGTCCTGAG GCAGGAACTC GACTCTCCAA CCCTCCTGAO CAOCOGGCAO OGGCCAGGAT 1920 

CACACATTCT TCTCACCCTT GAGAGGCTGG AATTAACCCC GCTTGACGGA AAATGTATCT 1980 

CAGA6AAGGG AAAGGCTGCA TGGGGGAGCC CCATCCAGGG AGTGATGGGC COGOGATTGC 2040 

CTGAGGCCCC GCTGTGACAO TTTCCCCXWT GTGAGCCCGG TGAGGGCX3GC AGGCAGGGGT 2100 

TATCCX3GCXX: C3VCTTTCTGG ATGACAGACT TGAGGCTCTG AGAGCTGAAA ACACTTGTCC 2160 

15 AAGGTCTCAC GTTAAQGTCA AGACACTAAC TCAAATCTTT CAAGCCCCGC CTCTCCTCTT 2220 

GQAOOACAGG GCA6CCTGCA GCTGTGTGCA GQCCCAGGCC CCACCCCATA ACAGGTGGCC 2280 

TCAOCCACAC AGTTCTCCCC AAGGGGAGCA GCCCAGGGCC AAGCCCOGCT GCCTTCCCCA 2340 

GGCCACAGTG CGTCCAGTCT CXJTGTCCTGC CAOGTGTCTT TTGCAAAGCT CCTTGGATGT 2400 

GGAGACAQAT GTCTTTACTA GAGCTGAAAQ GCCOCCTTGA CACATCCAGO CCAACCTCXX: 2460 

20 AT66AATAGG TAGGCAAGCC AGOACTCCGG GAAGGAGGTG CAGCCAGGAT GCTCTGGTGG 2520 

AGCTGCCQAT 0066CCCTGG TOTCAOAACT COCCAAAGGC CT6TGCGTCC AAGTGQAGTC 2580 

AOOTTTTCTA 'n' O CrrrCro TQTTTCCAAA TTCAOTOTTA ACTAAATAAA GOTATTTTGT 2640 

TTTTCAAAAA AAAAAAAAAA AAA 2663 

25 Seq ID NO: C118 DMA Sequence 

Nucleic Acid Acceaaion #t NM_005689 
coding sequence: 27 8.. 2 606 



1 11 21 31 41 51 

I I I I ) I 

GGQOCTGCAO TTGGGAaAAO GGTCCOQGGC CCAGAGCXAG C60GGCCGTG CTGAGACX3GC 60 

OTACGT G CCC TG06TC3AGT0 0GT660Q6GG GOGOGTGCGC TAGGGGAGTO GGCGGTGAGG 120 

CCTGGTCCAC GTGCGTCCCT TCCOGGGACC CCOOCAGCTT GGCX3CCCAGC GGCTACGTGA 180 

GCCAAGGCAC CCGGATOTCC GCGCCCCTCT CCGAGTGACA AGTCCOGGCC TCCGGTCCC36 240 

CAGTGCCCX5C AGCXTTOGGCC GGOSTCCAOO CATTGCCATG OTGACTOTGO GCAACTACTG 300 

OQA06CCX3AA QQGCOGOTGO GT0C9G0GCT0 GAIGCAOaAT GQOCTGAGTC CCTGCTTCTT 360 

CTTCAOGCTC OTGCXCT O GA G6GGQATGQC TCZAGG6ACT CTGGCCTTGG T6CTG6CTCT 420 

TCCCTGCAGA CaCGGGOAOC GGCCCGCTOO TGCTGATTOG CTGTCTTGGO GGGCCX3GCCC 480 

TCGCATCTCT CCCTACGTGC TGCAGCTGCT TCTGGCCACA CTTCAGGOGG OGCTGCCCCT 540 

GGCCJGGCCTG GCTGGCOGGG TGG6CACTQC CXX3QG6G6CC CCACTGCCAA QCTATCTACT 600 

TCTGGCCTOC OTGCTGOAiOA GTCTGGCCGO C30CCTGT6 G C CTSTGGCIGC TTGTGGTGGA 660 

GCGGAGCCAG GCACG6CAQC GTCT66CAAT GGQCATCTGO ATGAAOTTCA GQCACAGOCC 720 

TCGTCTCCXO CTCCTCTGGA CTGTGGCOTT TOCAGCTGAG AACTTGGCCC TGGT6TCTTG 780 

GAACAGCCCA CAGTGGTGGT GGGCAAGGGC AGACTTGGGC CAACAGGTTC AGTTTAGCCT 840 

GTGGGTGCTG OGGTATGTGO TCTCXGGAG6 6CTGTTTGTC CTGGGTCTCT OGGCCCCTGG 900 

ACTTOOTCCC CAGTOCTATA CATTQCAG6T TCATGAAQAO GACCAA6AT6 TGOAAAGGAQ 960 

OCAGGTTCGG TCACCAGGCC AACAGTCTAC CTGGOS^QAT TTTG6CA6GA AOCTCOGOCT 1020 

CCTGA6TGGC TACCTGTQGC CT06AGGGAG TCCAOCTCTG CAGCTGGTGG TGCTOVTCTG 1080 

CCTGGGGCTC ATGGGTTTGG AACGGGCACT CAATQTGTTG OTQCCTATAT TCTATAGGAA 1140 

CATTGTOAAC TTGCTOACTG AGAAGQCACC TTGGAACTCT CTGGCCTGGA CTGTTACCAQ 1200 

TTAGQTCTTG CTCAAQTTCC TCGAGGGQSQ TGGGACTGGC AGTAGAQQCT TCX7IGASCAA 1260 

CCTGOSCACC TTCCTGTGGA TCOGQQTGCA GGAOTTCAOG TCTCOGCGOG TG6A0CTGCT 1320 

CATCTTCTCC C31CCTGCACG AGCTCTCACT GOGCTGGCAC CTGG6GCGCC GCACAGGGGA 1380 

GGTGCTGOSG ATCGCGGATC GGGGCACATC CAGTOTCACA GGGCTGCTCA GCTACCTGGT 1440 

GTTCAATGTC ATCCCCA06C TGGCCGACAT CATGATTGGC ATCATCTACT TCAGCATGTT 1500 

CTTCAAOGOC TGGTTTG6CC TCATT6TGTT CCTGTGCATQ A O TCTTTACC TCAOCCIGAC 1560 

CATTaiaOTC ACTGAOTGOA GAACCAAGTT TOSTGQTGGT ATGAACACAC A0QAGAACX3C 1620 

TACCOGGGCA 06AGCAGTGG ACTCTCTGCT AAACTTCQAQ AOOOTGAAGT ATTACAACGC 1680 

CGAGAGTTAC GAAGTGGAAC GCTATCGAGA GGCX^TCATC AAATATCAGG GTTTGGAGTG 1740 

60 GAAGTCGAGC GCTTCACrGG TTTTACTAAA TCAGACCCAG AACCTGGTGA TTGGQCT06G 1800 

GCTCCTOOCC GGCTGCCTQC TTTQOGCATA CTTTQTCACT GAOCAOAAGC TACAGGTTGQ 1860 

GGACTATOTG CTCTTTGOCA OCTACATTAT CCA6CTGTAC ATOCCCCTCA ATTGGTTTOQ 1920 

CACCTACTAC AGGATGATCC AGACCAACTT CATTGACATG GAGAACATGT TTGACTTGCT 1980 

^ GAAAGAGGAG ACAGAAGTGA AGGACCTTCC TGGAGCAGGG CCCCTTOGCT TTCAGAAGGG 2040 

65 COGTATTGAG TTTGAGAACG TGCACTTCAG CTATGCGQAT GGGCGGGAGA CTCTGCAGGA 2100 

cv m i v rr ru actgtgatgc ctggacagac actx go ocix s gtggggccat ctggggcago 216o 

GAAGAGCACA ATTTTGC6CC TGCrGTTTCXS CTTCTACX3AC ATCAGCTCTG GCIGCATCCXS 2220 

AATAGATGQO CAGQACATTT CACAGGTGAC CCAG6CCTCT CTCOGGTCTC ACATTGGAGT 2280 

TGTGCCCCAA GACACTGTCC TCTTTAATGA CACCATOGCC GACAATATCC GTTAaSGCCG 2340 

70 TGTCACAGCT GG6AAIX3AT0 AGGTGGAGGC TGCTGCTCAG QCIY3CAGGCA TCCATGATGC 2400 

CATTATG6CT TTCOCTGAAG 6GTACAGQAC ACAGGTGGGC aAGGGQQQAC TGAAOCrTGAO 2460 

0660GGGGAG AAGCAGGGGO TOGOCATTGC CG6CACCATC CTCAAOGCTC OQGGCATCAT 2520 

TCTGCTGQAT GAGGCAAOGT CAQ06CTGGA TACATCTAAT GAGAGGGCCA TCCAGGCTTC 2580 

TCTGGCCAAA GTCTGTCCCA ACCXK:ACCAC CATCGTAGTG GCACACAGGC TCTCAACTGT 2640 

75 GGTCAATGCT GACCAGATCC TOGTCATCAA GGATGGCTGC ATCCrrGGAGA GGGGACGACA 2700 

OGAGGCTCTG TTQTCC0QA6 GTGGGGTGTA T6CTGACAT6 TGOCAQCTQC AGCAGGGACA 2760 

GGAAGAAACC TCTGAAGACA CTAAGOCTCA GACCATGGAA OGGT OACA AA AGTT TGGC CA 2820 
CTTCCCTCTC AAAGACTAAC CCAGAAGGGA ATAAGATGTO TCTCCTTTCC CTGGCTTATT 2880 
TCATCCTGGT CTTGGGGTAT GGTGCTAGCr ATOQTAAGGG AAAGGQACCT TTCC3QAAAAA 2940 

80 CATCTTTTGG GGAAAXAAAA ATGTGGACTO TGAAAAAAAA AAAAAAAAAA AAA 2993 
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Seq ID NO: C119 DHA Sequence 
Nucleic Acid Acceaaion tti NN_000676 
Coding aequence: 333.. 1331 
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Seq ZD NO: C120 DNA Sequence 
Nucleic Acid Accession #: NM_052932 
Coding sequence: 217.. 786 " 



} 11 21 31 41 51 

I I I I I I 

GGGCAATTTO TTAOTTATOC GCOGCCROOV AOAOGGGGCA OGGOGCCTOG ACC66AGGQ0 60 

J CCCCG060GG G0606AACTT TGGGCT0GG6 OQAOrGGOTG QT6CTC0QCC CAGCC06A6A 120 

CGGGCGGGCG C3GCGGGCCAA TGGGTGCOGC CTCTTGGCOG CGGGOOQCCC OOACCOGTGG 180 

GTCCOGGCCA CXyVGCGCaX AGCCCOQAGG CTCAGAAOOG GCAGGCGQAO GCGCGGTCCG 240 

GGCGCTATGQ CCATGCCGGG CGGGTCTCAC GOGGCTGCCC CTCGCCOGGC GCGCCTTOSQ 300 

^ . TAGGGGG06C CCGGGGCCCA GCTGOCOOGQ CCATGCTGCT GGAGACACAG GACGOGCTOT 360 

lU AOG TGGOGCr GGAGCTGGTC ATOGCOGOGC TTT06GTGGC GGGCAACX3TG CTGGTGTGCG 420 

COGCGGTGGG CACGGCGAAC ACTCTGCAGA OGCCCACCAA CTACTTCCTO OTGTCCCTGG 480 

CTGCGGCCGA CGTGGCXGTO GGGCTCTTOG CCATCCCCTT TGCCATCACC ATCAGCCTGG 540 

GCTTCTGCAC TGACTTCTAC GGCTGCCTCT TCCTOGCCTQ CTl'OO'ltl C m GTGCTCACGC 600 

^ AGAGCTOCAf CTTCAOCCTT CTGGO0GTG6 CAGTG6ACAO ATACCTGGCC ATCTOTGTCC 660 

ID OGCTCAGGTA TAAAAGTTT6 GTCAOGGGGA C3C06AGCRAO AOOQGTCATT GCTGTCCTCT 720 

GGGTCCTTGC CTTTGGCATC GGATTGACTC CATTCCTX»Q GTGGAACAGT AAAGACAGTQ 780 

CCACCAACAA CTGCACAGAA CCCTGGGATG GAACCAOGAA TGAAAGCTGC TGCCTTGTGA 840 

AGTGTCTCTT TGAGAATGTG GTCCCCAT6A GCTACATGGT ATATTTCAAT TTCTTTGGGT 900 

GTGTTCTGCX: CCCACTGCTT ATAATGCTGG TOATCTACAT TAAGATCTPC CTGGT6GCCT 960 

/U GCAGGCAGCT TCAGCGCACT GA6CTGATGG ACCACTCGAG GACCACCCTC CA6CGGGAGA 1020 

TCCATGCAGC CAAGTCACTO QCCATGATTG TGGGGATTTT TGCCCTGTGC TGOTTACCTG 1080 

TGCATGCTGT TAACTGTGTC ACTCTTTTCC AGCCAGCTCA GGGTAAAAAT AAGCCCAAGT 1140 

GGGCAATGAA TATGGCCATT CTTCT0TC3VC ATGCCauVTTC AGTTGTCAAT CCCATTGTCT 1200 

ATGCTTAGOQ GAAC06AGAC TTCC6CIACA CTTTTCACAA AATTATCTCC A667ATCTTC 1260 

/>0 TCT6CCAAGC AGATGTCAAG AGTGGGAATG GTCAGGCTGG GGTACAGCCT GCTCTOOGTG 1320 

TGGGCCTATG ATCTAGGCTC TOGCCTCTTC CAGGAGAAGA TAC3VAATCCA CAAGAAACAA 1380 

AGAGGACACQ GCTGQTTTTC ATTGTGAAAG ATAGCTACAC CTCACAAGGA AATGGACTGC 1440 

CTCTCTTGAG CACTTCCCTG GAGCTACCAC GTATCTAGCT AATATGTATG TGTCAGTAGT 1500 

^. AGGGTGGAAO QATTGACAAA TATATTTAT6 ATCTATTGAG CTGCTTTTAC TOTGTGGATT 1560 

J\) ATG CCAA CRG CTTGAATGOA TTCTAACAGA CTCTTTTGTT TTTAAAAGTC TOCCTTOTTT 1620 

ATGQTGQAAA ATTACTGAAA CTATTTTACT GTQAAACAGT OXGAACTATT ATAAT6CAAA 1680 

TACTTTTTAA CTTAGAGGCA ATGGAAAAAT AAAAGTTGAC TGTACTAAAA ATG 1733 



1 11 21 31 41 51 

I I I I I I 

CCCRGCCCGG CCCCGCXIGCC CCGGCTGOGC AOGCGAOGCC CCCTCCAGGC CCC36CTCCTG 60 

CGCXXTATTT GGTCATTOSG GGGQCAAGOS GCGGGAGGGG AAACGTQCXSC GGCCGAAGGG 120 

QAAGCGGAGC CGQCGCCGGC TGCGCAOAGa AQCCQCTCTC GCCGCCGCC3V CCTCGGCTGG 180 

GAGCCXaOGA GGCTGCCGCA TCXOTCCCTC GGAACAATOG QACTOOGCGC GCGAGGTGCT 240 

TGGGCCG06C TGCTOCTGGG GAGGCTGCAS GT6CTAG0GC TGCTGGGGGC 06CCCATQAA 300 

AOOQCAGCCA 1GGC66AGAC TCTCCAACAT GTGCCTTCTG ACCATACAAA TGAAACTTCC 360 

AACAGTACTQ TQAAACCACC AACTTCAGTT GCCTCAGACT CCAGTAATAC AAOOOTCACC 420 

A(XATGAAAC CTACAGCGGC ATCTAATApA ACAACACCAG GQATGGTCTC AACAAATATO 480 

ACTTCTACCA CCTTAAAGTC TACACCCAAA ACAACAAGTG TTTCACAGAA CACATCTCAG 540 

ATATCAACAT CCACAATGAC GGTAACCOVC AATAGTTGAG TGACATCTGC TGCTTCaTCA 600 

QT AACAAT CA CAACAACTAT GCATTCTGAA GCAAAGAAAO GATCAAAATT TGATACTQGG 660 

AGCTTTGTTG GTOGTATTOT ATTAAOQCTO GGAOTTTTAT CTATTCTTTA CATTGOATGC 720 

AAAATGTATT ACTCAAGAAO AGGCATTCGG TATOQAACCA TAGATGAACA TGATGCXATC 780 

ATTTAAGG AA ATCCATOGAC CAAGGATG6A ATACAGATTG ATGCTGCCCT ATCAATTAAT 840 

TTTQOTTTAT TAATAGTTrA AAACAATATT CTCTTTTTQA AAATAGTATA AACAGGCCAT 900 

6CATATAAT0 TACAGTOTAT TAOQTAAATA TGTAAAGATT CTTCAAGGTA ACAAOGGTTT 960 

GGGTTTTGAA ATAAAC3VTCT GGATCTTATA GACOGTTCAT ACAATGGTTT TAGCAAGTTC 1020 

ATAOTAAGAC AAAC3VAQTCC TATCTTTTTT TTTTTGGCTG GGGTOGGGGC ATTGGTCACA 1080 

TATGAC CAOT AATT6AAAGA CGTCATGACT GAAAGACAGA ATGCXZATCTG GGGATACAAA 1140 

TAA6AA8TTT GTCACAGCAC TCAGQATTTT GGGTATCTTT TGTAGCTCAC ATAAAGAACT 1200 

TCAGTGCTTT TCAGAGCTX30 ATATATCTTA ATTACTAATG CCACACAGAA ATTATACAAT 1260 

C3«ACTAGAT CTGAAOCATA ATTTAA6AAA AACATCAACA TTTTTTOTGC TTTAAACTGT 1320 

AGTAGTTGGT CTAGAAACAA AATACTCCAA 6AAAAAGAAA ATTTTCAAAT AAAACCCAAA 1380 

ATAATAGCTT TGCT TA GCCC T6TTAGGGAT OCATTGGA6C ATTAAGGAGC ACATATTTTT 1440 

ATTAACTTCT TTTGAGCTTT CAATGTT6AT GTAATTTTTO TTCTCTGTGT AATTTAG3TA 1500 

AAGTGCAGTG TTTAACATAA TAATGTTTTA AAQACTTAGT TOTCftGTATT AAATAATCCT 1560 

GGCATTATAG GGAAAAAACC TCCTAGAAQT TAGATTATTT GCTACTGTGA GAATATTGTC 1620 

ACCACTGGAA GTT ACTTT AQ TTCATTTAAT TTTAATTTTA TATTTTGTGA ATATTTTAAG 1680 

AACTGTAGAG CTGCTTTCAA TATCTAQAAA TTTTTAATTG AQTOTAAACA CACCTAACTT 1740 

TAAGAAAAAG AAOCGCTTGT ATOATTTTCA AAAGAACATT TAGAATTCTA TAGAGTCAAA 1800 

ACTATAGCXn* AATGCTGTGT TTATTAAGCC AGGGATTGTG QGACTTCCCC CAGGCAACTA 1860 

AACCTGCAGG A TQAAAA TQC TATATTTTCT TTCATGCACT GTOGATATTA CTCAQATTTQ 1920 

G GGAAAT GAC ATTTTTATAC TAAAACAAAC ACX»AAATAT TTTAQAATAA ATTCTTAGAA 1980 

AGTTTTGAGA GGAATTTTTA GAGAGGACAT TTCCTCCTTC CTXSATTTGGA TATTOXTCA 2040 

AATOOCTCCT CTTACTCCAT GCTGAAGGAG AAGTACTCTC AGATOCyVTTA TGTTAATGQA 2100 

GA6AAAAAGC ACAGTATTGT AGAGACACCA ATATTAQCTA ATGTATTTTG GAGTOTTTTC 2160 

CATTTTACAG TTTATATTCC AGCACTCAAA ACTCAGGGTC AAGTTTTAAC AAAAOAOGTA 2220 

TQTAGTCACA GTAAATACTA AGATGGCATT TCTATCTCAG AGGGCCAAAG TGAATCACAC 2280 

CAGTTTCTGA AGGTOCTAAA AATAGCTCAG ATGTCCTAAT GAACATGCAC CTACATTTAA 2340 

OA TAOGA GTACA ATAAAACTGT TGTCAGCTTT TXJTTTTACAG AQAAOGCTAG ATATTAAGAA 2400 

OU TTTTOAAATO GATCATTTCT ACTTGCTGTG CATTTTAACC AATAATCTGA TGAATATAGA 2460 

AAAAA ATQA T CCAAAATATG GATATGATTO GATGTATGTA ACACATACAT GGAGTATGGA 2520 

GGAAATTTTC TGAAAAATAC ATTTAGATTA GTTTAGTTTG AAGOAGAGGT OGOCTGATGQ 2580 

CTQAQTTGTA TGTTACTAAC TT6QCCCTGA CrGGTTGTGC AACCATTGCT TCATTTCTTT 2640 

GCAAAATGTA OTTAAGATAT ACTTTATTCT AATGAAGGOC TTTTAAATTT GTOCACTGCA 2700 
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TTCTTGGTAT TTCACTACTT CAAGTCAGTC AGAACTTOST AGAC06ACCT GAAOTTTCTT 2760 

TTTOAATACT TGTTTCTTTA GCACTTTGAA GATAOAAAAA CCACTTTTTA AGTACTAAGT 2B20 

CftT»TTTGC CTTGAAAGTT TCCTCTGCAT TGGGTTTQAA GTAGTTTAGT TATGTCTTTT 2B80 

TCrcrOTATg TAAGTAGTAT AATTTGTTAC TTTGAAATAC COQTACXTTG AAT0TA8GTT 2940 

TmPTGTTGT TGTTATCTAT AAAAATTGAG GGAAATGGTT AT6CAAAAAA ATATTTTGCT 3000 

TTOOACCATA TTTCTTAAGC ATAAAAAAAT GCTCAGTTTT GCTTGCATTC CTTGAGAATG 3060 

TATTTATCTG AAGATCAAAA CaUUVCAATCC AGATOTATAA GTACTAGGCA GAAGCCAATT 3120 

TTAAAATTTC CTTGAATAAT CCATGAAAG6 AATAATTCAA ATACAOATAA ACAOAOTTGG 3180 

CAGTA TATTA TAGTGATAAT TTTGTATTTT CAAHAAAAMV AAAGTTAAAC TCTTCTTTTC 3240 

TTTTTATTAT AATGACCAOC TTTTGGTATT TCATTGTTAC CAAGTTCTAT TTTTAGATAA 3300 

AATTGTTCTC CTTCTAAAAA AAAAAAAAAA AAAAAAAA 3336 

Seq ID NOt C121 DNA Sequence 
^ . Nucleic Acid Accession « t KN_004195 
ID Coding sequence t 1..726 ~ 

1 11 31 31 41 51 

I I I I t I 

ATGGCACAGC A06GGGOQAT GGGCGCGTTT CX3GGCCCTGT GOSGCCTGGC GCTGCTGTGC 60 

ZU G06CTCAGCC TGGGTOVGOG CCCCACOGGG GGTCCCGGQT GCX36CCCTQO QOGCCTCCTQ 120 

CTTGGGAC3GG OAACGGACGC GCGCTGCTGC ' CGGGTTCACA CGACGCXKTTG CT6COGCX3AT 160 

TACCCX3GGCG AGQAQTGCTQ TTCCGAGTGQ GACTGCATQT GTGTCCRQCC TGAATTCCAC 240 

TOCGGAGACC CTTGCTOCAC GACCTGCCGG CACCACCCTT GTCCCCCAGG CCAGGGGGTA 300 

CAGTCCCAOQ GGAAATTCAa TTTT6GCTTC CAOTOTATOO ACTOTGCCTC 6GGGACCTTC 360 

TCCOGGGGCC ACGAAGQCCA CTGCAAACCT TGGACAGACT GCACCCAGTT C GG GTTTCTC 420 

ACTGTGTTCC CTGGGAACAA GACCCACAAC GCTGTGTQCO TCCCAGGGTC CCOGCOGGCA 460 

QAGCCGCTTG GGTGGCTGAC CGTCGTCCTC CTGGCCGTQO CCGCCTGCGT CCTCCTCCTG 540 

ACCTOGGCCC AGCTTQOACT GCACATCTGG CAOCTGAGGA GTCAGTGCAT GTGGCCC06A 600 

^ - GA6ACCCAGC TGCTOCTGGA GGTGC06C00 TCGACCGAAG ACGCCAGAAG CTQCCA9T7C 660 

3U CCCGAGGAAO A6C6GGGGQA QCGATCGGCA GAGGA6AAGG GGCG6CTGGG AGACCT6TGG 720 

GTGTGA 725 

Seq ID NO: C122 DMA Sequence 
^- Nucleic Acid Acceeaion #: AX091896.1 
5 J Coding sequence: 2 8.. 1572 
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1 11 31 31 41 SI 

I I I I I I 

AGATCOGOGA GCCCGTCAGC CTGCGCCATG GGCTGCGAOG GCXX3CGTGTC GGGGCTGCTC 60 

OGCOGCAACC TGCAGCCCAC GCTCACCTAC TGGAGOQTCT TCTTCAGCTT CGGCCTGTGC 120 

ATOGCCTTCC TGGGGCCCAC GCT6CTGGAC CTGOGCTGTC AGA06CACAG CTCX3CTQ0CC 160 

CAGATCTCCT GGGTCTTCTT CTOGCAGCAG CTCTGCCTCC TOCTGGGCAG OGCCCTOGGG 240 

GGOGTCTTCA AAAGGACCCT GGCCCAQTCA CTATGGGCCC T6TTCACCTC CTCTCTGGCC 300 

ATCTOCCTGG TGTTTGCCGT CATCOCCTTC TGCCGCGACQ TGAAGGTGCT GGCCTCAGTC 360 

ATGGOGCTGG CGGGCTTGGC CATGGGCTGC ATCGACACOQ TQQCCAACAT GCAGCTGGTA 420 

AGGATGTACC AGAAGGACTC GGCCX3TCTTC CTCCAGGT6C TCCATTTCTT OQTGGGCTTT 460 

GGTGCTCTGC TGAGCCCCCT TATT6CTGAC CCTTTCXrrGT CTGAGGCCAA CitKJT' mCC r 540 

GCCAATAGCA OGGCCAACAC CACCTCCOGA GGCCACCTGT TCCATGTCTC CAGOGTGCTG 600 

GGCCAGCACC ACGTAGATGC CAAGCCTTGG TCCAACCAGA CQTTCCCAGG GCTGACTCCA 660 

AAGGAOGGGG CAGGGACCOG AGTGTCCTAT GCCTTCTGGA TCATOGCCCT CATCXJATCTT 720 

CCAGTGCCC3V TGGCTGTOCT GATGCTGCTO TCCAAGGAGC GGCTGCTGAC CIGCTGTCGC 780 

CAGAGGAGGC CCCTGCTTCT GTCTGCTGAT GAGCTTQCCT TtSGAOACACA GCCTCCXaAO 640 

AAGGAAGATG CCTCCTCACT GCCCCCAAAG TTTCAGTCAC ACCTAGGGCA TQAGGACCTO 900 

TTCAGCTGCT GCXAAAGGAA GAACCTCAQA GOAGCCCCTT ATTCCTTCTT TGCXIATCCAC 960 

D3 ATCACGGGOG CCXTTGOTACT GTTGATGAOS GATGGGTTGA OGGGTGCCTA TTCCGCCTTC 1020 

GTOTACAGCT ATGCTGTGGA GAAOCCCCTG TCTGTGGGAC AC3VAG8TGGC TGGCTACCTC 1080 

CCCAGCXTCr TCTGGGGCTT CATGACACTO GG0CG6CTCC TCTCCATTOC CATATCCTCA 1140 

AGAATGAAGC OGGC CACX!AT GGTTTTCATC AA08TGGTTQ GOQTGGTGGT GACGTTCCTG 1200 

GTGCTGCTTA TTTTCTCCTA CAAOGTCGTC TTCCTGTTCG TGGQOAOGGC AAOCCTGGGC 1260 

CTGTTTCTCA GCAGCACCTT CCCCAGCATG CTGGCCTACA CGGAGGACTC GCT6CAGTAC 1320 

AAAGGCTGTG CAACCACAGT GCTGGTGACA GGOGCAGGAO TTGGOQAGAT GGTGCIGCAQ 1380 

AT GCTGGT TG GTTOQATATT CCAGGCrCAG GGCA6CTATA tfmtX.mGV CTGTGGOGTO 1440 

ATCTTTGGTT GTCTGGCTTT TACCTTCTAT ATCTTOCTCC TOTTTTTCCA CAGGATGCAC 1500 

^ CCTGGACTCC CATCAGTTCC TACCCAAGAC AGATCAATTG GAATGGAAAA CTCTGAGTGC 1560 

TACCAGAGGT AAAACTGG6T GAAGAAGGCA AGAQAAOACT TTCAGGCTCT T6ATCACCAQ 1620 

CAOSACCATA CTGTTTCAGA AAGCTGGGTG GTGGTGGAGQ OGCTCTCTCA AT60CTATTC 1680 

AAGTCTTCTC CACTA AAAC T TGGTTGGGTA GAGGAAATTA AATT6AGTCC TGGTACCTGG 1740 

TCAAAATCAT TAGAAGTTTA CCTGGCTTCT CAAGTTATCT TCTTCCCTGO TTCAQACTOT 1800 

TGGTAAGAGC TGTCCAGATA CCXaOATGGG AAGGAAG6AG ACAGOCXSOGC GCTTCACTCC 1660 

/U ATTTGTCACC TCATGCATGG ACC3VTACTCT GGGrTTTGAGA TCATTCTTCA 7TQAAGTTTG 1920 

TAAAAATAGG TTOAAATTGT AAAGCTCCAT QATCACTGCT ATATGTAGAT ATATTTCAAT 1980 

TTAAGCAAAA CAAGCTGCAA GTTATTCCCT GGCAT6CTCA AAGGATTTTC OTGCTTTTCA 2040 

CTTAATAGTC CAAAGTCTCT TAAATTCCTG CTGCAGATAT CAATAGCTTA TCTATATTCT 2100 

CAAACACCAA AAOGAAAAGT TGAATCTTGC TCTCTTTGGT ATACTAATGT AGTGGTATGC 2160 

TAAGCTGGCT CATACCAACT TAGAAAAGCT GATTQTAAAA TTTTCATTTT GACAGCTGGT 2220 

TATTAAATGC AOCCATTATT AAAAATCAAA TCATACAAAC TTATAATTAA ATGAATTACA 2260 

TTTAAAACAA AGGTAATAAA TATTCAAAGC ATATCACTTC CT 2322 
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Seq ID NO: C123 DNA Sequence 

Nucleic Acid Accession #i NM_002203.2 

Coding sequence: 43.. 3566 " 

1 11 21 31 41 51 

t t 1 I I I 
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CTGCAAACCC AGCGCAACTA COGTCCCCOG GTCAGUVCCCA GGATGGGGCX; AGAAOSGACft 60 

GGGGCOGOGC CGCTGCOGCT GCTGCTGGTQ TTAGCQCTCA GTCAAGOCAT TTTAAATTOT 120 

TGTTTGGCCT ACAATOTTGG TCTCCCAGAA GCAAAAATAT TTTCOGGTCC TTCAAGTGAA IBO 

CA QTTTCGCT ATCCAGTGOl GCAGTTTATA AATCCAAAAS QCAACTGGTT ACTOGTTGQT 240 

J TCACCCTGGA OTGGCTTTCC TGAGAA006A ATGGGAGATG TGTATAAATG TCCTGTTQAC 300 

CTATCCACTG CCACATOTQA AAAACTAAAT TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

ACTQAQATGA AAACCAACAT GAGCCTOGGC TTGATCCTCA CCAGGAACAT QGGAACTGGA 420 

GGTTTTCTCA CATGTGOTCC TCTGTGGGCA CAGCAATGTX3 GGAATCAGTA TTACACSU^OS 480 

GGT GTGTO TT CTGACATOG TCCTOATTTT CAGCTCTCAG CCW3CTTCTC ACCTGCAACT 540 

lU CAGCCCTGCC CTTOCCTCAT AGATGTTGTO GTTGTGTGTG ATGAATCAAA TAQTATTTAT 600 

CCTTGGGATG CAGTAAAGAA TTTTTTOGAA AAATTTOTAC AAGGCCTTGA TATAGGCCCC 660 

ACAAAQACAC AGGTGGGGTT AATTCAGTAT GCCAATAATC CAAGAOTTGT OTTTAACTTQ 720 

AACACATATA AAACCAAA6A AGAAATQATT GTAGCAACAT CCCAGACATC CCAATATGGT 760 

^ . G6GGACCTCA CAAACACATT GGGAGGAATT CAATATGC3UI QAAAATATGC CTATTCAGCA 840 

Ij GCTTCTGGT6 GOC6ACX3AAO T6CTA06AAA GTAAT6GTAG TTGTAACTGA OGGTGAATCA 900 

CATGATGGTT CAATGTTGAA AGCTGTGATT GATCAATOCA ACCATGACAA TATACTGAGG 960 

TTTGGCATAG CAGTTCTTGG GTACTTAAAC AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

AAAGAAATAA AAGCGATCGC TAOTATTCCA ACAGAAAOAT ACTTTTTCAA TGTOTCTGAT 1080 

on Q ftAOCAG CTC TACTAQAAAA GGCTGGGACA TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

2U ACTGTTCAA6 GAGGA6ACAA CTTTCAQATG GAAAT6TCAC AAGTGGGATT CAGTGCAGAT 1200 

TACrCTTCTC AAAATGATAT TCTGATGCTG GGTGCAOTGO GAGCTTTTGG CTGQAGTGGQ 1260 

ACXyVTTGTCC AGAAGACATC TCATQGCC3VT TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

ATTCTGCAGG ACAGAAATCA CAGTTCATAT TTAGQTTACT CTGT6GCTGC AATTTCTACT 1380 

_ _ GGAGAAAOCA CTCACTTTGT TGCTOGTGCT CCT0G06CAA ATTATACOGG CCAQATAGTG 1440 

Zj CTATATAGTG TGAATQAOAA TQGCAATATC AOGGTTATTC AGGCTCACOG AGGTGACCAG 1500 

ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT TCAGTTQATG TGQATAAAGA CACCATTACA 1560 

GACGTGCTCT TGGTAGGTGC ACCAATOTAC ATGAGTGACC tAAAGAAAGA GGAAGGAAGA 1620 

GrCTACCTGT TTACTATCAA AAAG06CATT TTGGGTCAGC ACCAATTTCT TOAAGGCCCC 1680 

6AGGGCATTQ AAAACACTCX3 ATTIG8TTCA GCAATTGCAO CTCTTTCAGA CATCAACATG 1740 

OATGOCTTTA ATGATGTQAT TGTTGGTTCA CCACTAOAAA ATCAQAATTC TGGAGCTOTA 1800 

TACATTTACA ATGGTCATCA GGGCACTATC CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TCCGATGGAG CCTTTAGGAG CXIATCTCCAG TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

6ATTTAAATG 6GGATTGCAT CACCGATGTQ TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

-jc CTCTGG TCAC AAAGXATTOC TOATOTAGGT ATAQAAGCTT CRTTCACACC AGAAAAAATC 2040 

ACTTTGOTCA ACAAGAATGC TCAGATAATT CTCAAACTCT 6CTTCAGTGC AAAOTTCAGA 2100 

CCTA CTAAGC AAAACAATCA AGTGGCCATT GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTTCATCCA GAGTAAOCTC CAGGGGGTTA TTTAAAQAAA ACAATGAAAO GTGCCTGCAG 2220 

AAGAATATGG TAGTAAATCA AGCACAQAGT T6CXX:0GAGC ACATCATTTA TATACAGGAG 2280 

CCCTCTGATG TTGTO^ACTC TTTGGATTTG 06TGIGGACA TCAGTCTGGA AAACCCTGGC 2340 

4U ACTAGCCCTQ CCCTTGAAGC CTATTCTGA6 ACTGCGAAGG TCTTCAGTAT TCCTTTCCAC 2400 

AAAQACTGTG GTGAGQATGG ACTTTGCATT TCTGATCTAG TCCTAGATGT CXXSACAAATA 2460 

CCAGCTGCrC AAGAACAACC CTTTATTGTX: AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

GTAAC ACTGA AAAATA AAAG GQAAAGTQCA TACAACACTG GAATTGTTGT TGATTTTTCA 2580 

GAAAACtTGT TTTTTGCATC ATTCTCCCTA CXX3GTTGATG GGACAGAAGT AACATGCCAG 2640 

4 J GTGGCTGCAT CTCAQAAGTC TGrTGCCTGC GATGTAGGCT ACCCTGCTTT AAAGAOAGAA 2700 

CAACAGGTOA CTTTTACTAT TAACTTTGAC TTCAATCTTC AAAACCTTCA GAATCAGGCO 2760 

TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC CAAGAAGAAA ACAAGGCTGA TAATTTGOTC 2820 

A ACCTCAA AA TTCCTCTCX:rr GTATQATGCT GAAATTCACT TAACAAGATC TACCAACATA 2880 

AATTTTTATO AAATCTCTTC GGATGGGAAT GTTCCTTCAA TOGTGCACAO TTTTGAAOAT 2940 

0\) GTTG GTCa VA AATTCATCTT CTCCCTGAAQ GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

GCAACTGTAA TCATCCACAT CCCTCAGTAT ACCAAAGAAA AGAACCCACT GATGTAOCTA 3060 

ACTGGGGTXK: AAACAGACAA GGCTGGTGAC ATCAGTTGTA ATGCAGATAT CAATCCACra 3120 

AAAATAGQAC AAACATCTTC TTCTGTATCT TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

GAATTQAACT GCAO AACT GC TTCCTGTAGT AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

DD ATGAAAGGAS AATACTTTGT TAATGTGACT ACCAGAATTT GGAAOGGGAC TTTOGCATCA 3300 

TCAAOrrTCC AOACAGTACA GCTAACX3GCA GCTGCAGAAA TCAACACCTA TAACCCTOAO 3360 

ATATATGTGA TTGAAQATAA CACTOTTAOG ATTCCCCTGA TGATAATGAA ACCTGATGAO 3420 

AAAOCOQAAG TACCAACAGG AGTTATAATA GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

TTA6CTCTG6 TTGCAATTTT ATGQAAGCTC GGCTTCTTCA AAAGAAAATA TGAAAAOATO 3540 

OU ACCAAAAATC CAGATGAGAT TGAT6AQACC ACAGAGCTCA GTAOCTGAAC CAQCAGACCT 3600 

ACCTGCAGTG GGAACOGGCA GC3VTCCCAGC CAGGGTTTGC TGT i 'ma i ' l Xt CATGGATTTC 3660 

TTTTTAAATC CCATATTTTT TTTATCATGT OJtAGGTAAA CTAACCTOOT ATTTTAAGAO 3720 

AAAACTGCAG GTCAGTTTGG ATGAAGAAAT TGTGGGGGGT GGGGGAGGTG OGGGOGGCAG 3780 

M AATAGGGAAA ATACCTATTT TATATQATGG GGQAAAAAAA GTAATCTTTA 3840 

OD AACTQGCTGG C0CA3AGTTT ACATTCTAAT TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 

CAA GCAT GAC AACTTTTAAA GAAAAATATG ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

TCTCTTTAAA ATATTTGTCT TTAAACAGCA ACTACAGAAG TGOAAOTGCT TGATATOTAA 4020 

GTACT TCCAC TTGTGTATAT TTTAATGAAT ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 

„ ACA60TTTTT TCAATTTATG CPGCTCATCC AAAGTTGCCA CAGATGATAC TTCCAAGTOA 4140 

/U TAATTTTATT TATAAACTAG GTAAAATTTG TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

CTTCCACACC CCATCTTGCT CTAATGATCA AAACATGCTT GAATAACTQA GCTTAGAGTA 4260 

TACC TCCTAT ATGTCCATTT AAGTTAGGAG AOGGGGCGAT ATAGAOACTA A60CACAAAA 4320 

TTTTGTTTAA AACTCAQAAT ATAACATTTA TOTAAAATCC CATCTGCTAO AAGCCCATCC 4380 

— « TGTGCCAGAG GAAGGAAAAG GAGGAAATTT CCTTTCTCTT TTAG6AGGCA CAACAGTTCT 4440 

/D CTTCTAGGAT TTGTTTQQCT aACTGGCAQT AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 

TTCTTTGGCA ACCTTCCTCC TCCCTTACTG AACCACTCTC CCACCT C CTB GTGGTACCAT 4S60 

TATTATAGAA GCCCTCTACA OCCTGACTTT CTCTCCAGCG GTCCAAA8TT ATOCCCTCCT 4620 

TTACCCCTCA TCCAAAGTTC CCACTCCTTC AGGACAGCTQ CTGTGCATTA GATATTAGGG 4680 

GGGAAAGTCA TCTGTTTAAT TTACACACTT GCATOAATTA CTGTATATAA ACTCCTTAAC 4740 

oU TTCAGGGAGC TATTTTCATT TAGTGCTAAA CAAGTAAGAA AAATAAGCTA GAGTGAATTT 4800 

CTAAATGTTG GAATOTTATG GGAT6TAAAC AATGTAAAGT AAAACACTCT CAGGATTTCA 4860 

CCAGAAGTTA CAGATGAGGC ACTGGAAACC ACCACCAAAT TAGCA6GTGC ACCTTCTGTG 4920 

GCTGTCTTGT TTCT6AAGTA CTTTTTCTTC CACAAGAGTG AATTTGAGCT A6GCAA6TTT 4980 

GTTCAAAAGG TAGATCCTGA GATGATTTGG TCAGA7T6GQ ATAA06C0CA GOVATGTtXA 5040 

1279 



wo 03/042661 



PCTAJS02/36810 



10 



15 



20 



25 



35 



40 
45 
50 
55 
60 
65 
70 
75 
80 



TTTTAACRAG CACCCCAGTC ACTAGGATGC AGATGQACXa CACTTTGAGA AACACCACCC 5100 

ATTTCTACTT TTTGCACCTT ATTTTCTCTG TTCCTGAGCC CCCACATTCT CTAGGAQAAA 5160 

CTT AGATT AA AATTCACAGA CACTACATAT CTAAAQCTTT GACAAOTCCT TGACCTCTAT 5220 

AAA CTTCAG A GTCCTCATTA TAAAATGGGA ASACTGAGCT GGAOTTCAQC AOTQATGCTT 53 BO 

TTTAGTTTTA AAAOTCTATG ATCTBATCTG GACTTOCTAT AATACAAATA CACAATCCTC 5340 

CAAGAATTTG ACTTGGAAAA G 5361 

Seq ID NO: C124 DNA Sequence 
Nucleic Acid Accession Ui im_031460 
Coding sequence i 103..1101 ~ 



X 11 21 31 41 SI 

I I I I I I 

AGCAG6GGTT TGCGAGA6GA GATACGAGCT GGAOGCCTGG CCCTTCCCTC CCACCGGGTC 60 

CTAGTCCACC 6CTCC06GC0 CCGGCTCCCC OCCTCTCCCQ CTATGTACCG ACOQCGAGCC 120 

CGGGOSGCTC CCGAGGGCAG GGTCCGGGGC TGOQOGGTGC CCAGCACCGT GCTCCTGCTG 180 

CTCGCCTACC TGGCTTACCT GGOGCTGGGC ACaJGOQTGT TCTGGACGCT GGAGGGCCGC 240 

GCG6O6CAG0 ACTCCAGCCG CAGCTTCCAG CGOGACAAGT GGQAGCTGTT GCAOAACTTC 300 

AOQTGTCTOG ACOGCCCGGC GCTGGACTCG CTOATCOGGG ATGTCQTCCA AGCATACAAA 360 

AA066AGCCA GCCTCXn^CAG CAACACCACC AGCATGGOQC OCTGGGAGCT GOrGGGCTCC 420 

TTCTTCTTTT CTGTGTCCAC CATCACCACX; ATTGGCTATG QCAACCTGAO CCCCAACAOQ 480 

ATGGCTGCCC GCCTCTTCTG CATCTTCTTT GCCCTTGTGO GQATCCCACT CAACCTOGTG 540 

GTGCTCAAOC GACTGGGGCA TCTCATGCAG CAGGGAGTAA ACCACTGQOC CA6CAGGCTG 600 

GGGGGCACCT 6GCAGGATCC TQACAAGOOO CG6TG6CTGG GGGGCTCT6G OGCCCTCCTC 660 

TCGGGCCTCC TGCTCTTCCT GCTGCTGCCA CCGCTGCTCT TCTCCCACAT GGAGGGCTGG 720 

AGCTACACAO AOGQCrTCrA CTTOGCCTTC ATCACCCTCA GCACXSTGGG CTTCX3G0GAC 780 

TACGTGATTG GAATGAACCC CTCCC3\GAG0 TACCCACTOT GOTACAMSAA CATGGTGTCC 840 

CTOTOOATCC TCTTTGGGAT GGCATGGCTG GCCTTGATCA TCAAACTCAT CCTCTCCCAG 900 

CTGGA0AC3GC CAGGGAGGGT ATOTTGCTGC TGCCACCACA GCTCTAAGOA AGACTTOVAG 960 

3U TCCCAAAGCT 6QAGACAGGG ACCTGACOGG GAGCCAGAOT CXXACTCCCC ACAGCAA6GA 1020 

TGCTATCCAO AGGGACCCAT OGOAATCATA CAGCATCTOQ AACCTTCTGC TCA06CT6CA 1080 

GGCTGTGGCA AGCSACAGCTA GTTATACTCC ATTCTTTGGT OSTCG TCC TC GOTAGCAAOA 1140 

CCCCTOATTT TAAGCTTTGC ACATGTCCAC CCSUUVCTAAA GACTACATTT TCCATCCACC 1200 

CTAGAGGCTG GGTGCAGCTA TATGATTAAT TCTGCCC3UVT AGGGTATACA GAGACATGTC 1260 

CTGOOTGACA TGOaATGTGA CTTTCQGGTQ TCGGGGCAGC AT6CCCTTCT CCCCCACTTC 1320 

CTTACTTTAG CGGGCT6CAA TGCCGCCGAT ATGATGGCTG GQAGCTCTGO CA6CCATAGB 1380 

GCACCAT6AA 6TAGCGGCAA TGTTTGAGGG GCACAATTAG ATAGGAAGAG TCTGGATCTC 1440 

TGAT6ATCAC AOAGCCATCC TAACAAAOGG AATATC3VCCQ ACCCTCCTTT ATGTGAGAGA 1500 

GAAATAAACA TGTAT6AAA 1519 



Seq ID NO I C125 DMA Sequence 
Nucleic Acid Accession #i NM_004154 
Coding sequence: 309. .1295 " 

1 11 21 31 41 51 

I I I I I I 

AAGGACAGAG QAG6G6CCCT TCCTGTCftGC TGGCTG OG AG CAGAGGTGGC TTT G T C TTTT 60 

CG6AAGAACT G6TTCTGTGQ AATTTGTGCT TATTTCCCAT CAAGGATCAA GGACCTGCTC 120 

TQGGaCTACC TCAGGGCCCC ACAGGATC3AG GGGCTGGTTT TCAGATGAGT TTTCTGCTTG 180 

CCTGTCATCT GGATA6TGTC TAAAAATTTG CAAACTGCCT TCTTGTCAGT GTCTTGCTCA 240 

TTCTTCATGA CACTOCTGAT ATGTCTCTCA GTTTOCrCAT CT6CT6CCTC TCCAGACTTC 300 

TGCCAGAACA TTGCAOGCQA CAGTTTCAGO GAGAOAACTG ACTG0CA6CA GG6GCTGCTC 360 

CACGAGTGGQ AATTTGCTCC AGCACTTCAC GGACTGCAAG CGAGGCACTT GCTAACTCTT 420 

GGATAACAAO ACCTCTGCCA GAAGAACCAT GGCTTTGGAA GGCGGAGTTC AG6CTGAGGA 480 

GATGGGTGCO OTCCTCAGTG AGCCCCTGCC TCCCTGAACA TAGGAAACCC ACCTGGOCAO 540 

CXATGOAATG GQACAATGGC ACAGGOC3tf30 CICTGG6CTT 6CCACCCACC AGCTGTGTCT 600 

AC0G06AGAA CTTCAAGCAA CT O CT GC TGC CACCTGTOTA TTCGOOGOTG CIGGOGGCTO 660 

GCCTGOCGCT OAACATCTGT OTCATTACCC AQATCTGCAC GTCCCGCCGG GCCCTQACCC 720 

GCACGGCCGT GTACACCCTA AACCTTGCTC TOGCTGACCT OCTATATGCC TGCTCCCTGC 780 

CCCTOCTCAT CTACAACTAT GCCCAAGGTG ATCACTG6CC CTTTGGOGAC TTCGOCT6CC 840 

6CCTG6TCCG Cl ' i 'C L T Cn ' C TAIGOCAAOC TGCAOOQCAO CATOCTCTTC CTCACCTGCA 900 

TCA6CTTCCA GOGCTACCTG QQCATCrOCC ACC06CTGGC CGCCtGGCAC AAAOOTOGQG 960 

GCCGC0GG6C TOCCTGGCTA G TQTGTQTAG CCQTGTGOCT GGCCGTGACA ACCCAGTGCC 1020 

TGCCCACAGC CATCTTOGCT GCCACAGGCA TCCAGCGTAA COQCACTGTC TQCTATGACC 1080 

TCAGCCCGCC TGCCCTGGCC ACCCACTATA TGCCCTATGG CATG6CTCTC ACTGTCATCG 1140 

GCTTCCIQCT OCCCTTTOCT GCCCTGCTGG CCTGCTACIG TCTOCI06CC TGCCGCCTOT 1200 

GCCGCCAGGA TGGCCCGGCA QAGCCTOrGG OCCAGGAGOG GOGTO G CAAG GGGGCCGGCA 1260 

TGGCOGTGGT GGTGGCTGCT GCCTTTGCCA TCAGCTTCCT GCCTTTTCAC ATCACCAAGA 1320 

CAGCCTACCT GGCAGTGCGC TOGACGCOGG GCGTCCCCTG CACTGTATTG GAGOCCTTTG 1380 

CAGCGGCCTA CAAAGGCACG CGGCOQTTTO CCAQTGCCAA CAGCGTQCTG GACCCCATGC 1440 

TCTTCTACTT CACCCAGAAG AAGTTCOGCC GGOGACCACA tOAGCTOCTA CAGAAACTGA 1500 

CAGCCAAATQ GCAGAGGCAG GGT O GCTGAO TCCT C CAGGT CCTGG6CA0C CTTCATATTT 1560 

GCCATTGTGT CCGGGGCACC AGGAGCCCCA CCAACCCCAA ACCATGCGGA GAATTAGA6T 1620 

TCAQCTCAGC TCGGCATGC5A QTTAAGATCC CTCACAGGAC CCAGAAGCTC ACCAAAAACT 1680 

ATTTCTTCAG CCCCTTCTCT G6CCCAGACC CTGTGGOCAT GGAGATGGAC AGACCTGGGC 1740 

CTGGCTCnG AGAG6TCGCA GTCAGCCATG GAQAGCTGGG GAAACCACAT TAAGGTGCTC 1800 

ACAAAAATAC AGTGTGAC6T GTACTGTCAA AA 1632 

Seq ID NO: C126 ONA Sequence 
Nucleic Acid Accession ft; NM_007197 
Coding sequence: 18.. 1763 

1 11 21 31 41 51 

I I i I I I 
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ACACX3TCCAA 
TGGGCTOGTQ 
ACCXTCATCCSA 
TGATGGGCO^ 
TGGAGTAOGO 
GCACOQAGCA 
TCAAGTGCTC 
6GAAACTCCC 
CGGACGAGCC 
ACAGCGCGCA 
OGGGCAAGTT 
TGGACGTGTA 
CGQTGCTGTG 
GCTTCCGCTA 
TGGGCTACCT 
GCCAGCTCTA 
TCCTCTACTA 
TCCTGGC06C 
ACCTGGCAQC 
TGGOGGGGGA 
CXX3GCTTCGT 
GGGGCTTOGT 
ACAAOCTGGA 
CCACCTGTGT 
TGGOGGCGCA 

ccGCxrrccAT 

GOATCACCAO 
GCAGCOGTAO 
GGATTTACAA 
CCCAGTCGCC 
TAAGATQTGO 
ATAAAAGCAA 
ACACTGAAAG 
AAGCTCCTCC 
CCCTCAGAAG 
GGCTTT6CTA 
OCCCrSGGTT 
GOGGCATCCC 
GAGTTGCTTA 
ATCAAACCTC 
AAATAAAAAO 
AAAAGAACTA 
AT6QAAACCA 
G0GAGAA66A 
AAT6ATTTCC 
TTCTCTTCAC 
TTTTTCATGC 



OGCCAGCATG 
CGCCGCCATC 
GATCC0GAT6 
OGAGAACCAO 
CTGCCAOGGC 
GGTCTCTACC 
CC06ATTATG 
CAACAAGAAC 
CAOOOGGGGC 
GGA6CACCXX3 
CCACCAOOTG 
CTGGAGCXX5C 
CTTCTTCTCC 
CCCOQAGCOC 
CATCCGCCTC 
TGTCATCCAG 
CTTCGGCATG 
CX3GCAAGAAG 
CTGGGCXyiTC 
CGAQCrCACC 
GCTCATTCCC 
GGCCCTGTTC 
GAAGCTCATG 
GATOGCCTGC 
GCACAAGTGC 

CGGGATGTGG 
OTTAAAGAAG 
AAAAOCCCAO 
CACCTG06TG 
TGCTTTTCTT 
AAOAGAAATA 
QAAAAATGTA 
AGTGAAGTAO 
AAAACTTTTO 
CCCATTTACA 
AACAAATQAO 
TGCAGAGGAA 
AAATAGACTC 
ACATTTGT6A 

agaaacaagt 
agaggcxx:ag 

A6CCAATGTT 
TCATTCAAAA 
ACAAACAQAT 
AOTGCCAGQA 
TCCACTATTT 



CAGOGCCOGG 
AGCTCCATGG 
TGCAAGGACA 
G6GQA0GCAQ 
CACCTCOGCT 
CCCATCCC06 
GAGCAGTTCA 
GACCCCAACT 
T OGGGCCTGT 
CTGAAGGACG 
GAGAAGAGCO 
GAGGACAAGC 
AG06CCTTCA 
CCCATCATCT 
TTCGCCGGCG 
GAGGGACTGG 
GCX^GCTOGC 
TGGGGCCACG 
C06GCGGTGA 
6GGGTCTGCT 
CTGGCCTGCT 
CACATCCGGA 
0TGCGTATCX3 
TACTTTTACG 
AAAATGAACA 
GAGATCTTCA 
ATTTGQACCT 
AAGAGCCGGA 
CATCCCCAGA 
TGAACA66GC 
GGTTGTGTTT 
CATAAAAAAG 
CTTAAAGGGT 
CCTCTTOTGT 
TTTAGAGCCC 
AATAAGAGQA 
CXZATCCCCAG 
AGACA6GACC 
TGGCCTTCAC 
ACTCAAACGA 
ATTTTGCTAT 
CCCrCAGAAA 
ATAGACXvTTT 
GTTACCCAAA 
CAGGAAGCAC 
AAaAGTGGTT 
TATTAAAAAT 



GCCCCCGCCT 
ACATGQAGCG 
TGGGCTACAA 
OCATCCAQIT 
TCrrCCAtiT G 
CCTGCCGQGT 
ACTTCAAGTG 
ACCTGTGCAT 
TCCCGCCGCT 
GGG6CCCCX3G 
OOTOGTGOOC 
GCTTOGCAGT 
COGTGCTCAC 
TCCTCTCCAT 
COGAGAGCAT 
AGAGCACOGG 
TGTGGTGGGT 
AGGCCATOGA 
AGACCATCCT 
AOGTGGGCAG 
ACCTGGTCAT 
GGGTGATGAA 
GGCTCTTCTC 
AACGCXnxaA 
ACXrAGACTAA 
TGGTGAAGAT 
CCAAGACTCT 
GAAAACCX3GC 
AAACTCACCA 
T6GAGGGAA6 
TTCTTTCTTC 
TGTTTACCCT 
TTTGTTTT6T 
AACTAATTTG 
TCCQTAAATA 
CAGATAACTG 
GGOCCACCCC 
COGGGCCCGC 
CAATAGTCTC 
TGT6CAATAC 
ATATAAAGAC 
CCCTTCAGTG 
GGACTGATTT 
GGGCTTATTG 
TAGGTTGGCA 
TCTGCGT6TG 
AAAATATQ'IT 



GTGGCTGGTC 
CCCX^OQAC 
CATGACTOGT 
OCAGQAOTTC 
CTOGCTGTAC 
CATGTGCGAG 
GCCCGACTCC 
GGAGGOGCCC 
GTTCOGGCCG 
GCG0G606GC 
GC06CTCTGC 
GGTCTGGCTG 
CTTCCTCATC 
GTGCTACTGC 
CGCCTGCXSAC 
CTGCACX5CTG 
GOTCCTCACG 
AGCCAACAGC 
GATCCTGGTC 
CATGGACGTC 
OGGCACGTOC 
GA0GG60GGC 
TGTGCTGTAC 
CATGQATTAC 
AACG CTGGAC 
CTTTATGCTG 
GCAGTCCTGG 
CAG0GT6ATC 
CGGGAAATAT 
GGCACAGGGG 
TTCTTCTTTT 
QAAATTCAGO 

TGGTAAAGTA 
TACATCTGTG 
CTTTGCAAAT 
CAGGAAGQCC 
CTCACACCCC 
TCTGCAAGAC 
ATTTTTTTCT 



CTACATTTTG 
GTGGAAAG6A 
ACTCTTTCTA 
GAGACACTTT 
TATATTTGTA 
CTTTAAAAAA 



CTGCAG6TGA 
GOCAAATGCC 
ATGCOCAACC 
GCaCOQCTGG 
GCGCOGATGT 
CAGGCOCGGC 
CTGGACTGCC 
AACAAOGGCT 
CAGCGOGOCC 
TGCGAC3\ACC 
ACGCCCGGOG 
GCCATCTGGG 
GACCCG6CCC 
GTCTACTOCG 
CGGGACAGCG 
GTCTTCCTGG 
CTCAOCTGGT 
AGCTACTTCC 
A7GCGCAGGG 
AAOG06CTCA 
TTCATCCTCT 
GAGAACACG6 
ACCGTGCCX3G 
TGGAAGATCC 
TGCCTGATGG 
CTGGTGGT6G 
CAGCA6GTGT 
ACCAGCGGTG 
GAGATCCCTQ 
OGCCCGGAGC 
TTTTrrTTTT 
ATGCTOTGAT 
CAGOGAAGGG 
GTTGATTCAG 
TATTTGAGTT 
TCAAGAGCCT 
ACAOTGCrGG 
AGTGGATTTG 
AGAAACCTCC 
CTTTCCTTGA 
ATCrCCTAAC 
TGGCTTTTTA 
G6GGGGAAGA 
TTGTTAAACA 
GTCTAGTGTA 
ATATATOATA 
A 



Seq ID MOt CI 2 7 DNA Sequence 

Nucleic Acid Accession #: NM_005761.1 

Coding sequence » 2 50.. 4956 



1 
I 

GCOAGGAGGA 
CTGCOGGAGC 
GGATGGGGCG 
CTCCGTTGCC 
CCCfiGCCCCA 
CCACTGCCCC 
CC06TGTG6C 
TTTGTG606A 
OGCCTGTACC 
OGGCCCCGGC 
GGGCTCGGGO 
CCOCTQGGCA 
COGCAGGGCT 
60GGTGGC00 
GCATCCGACC 
CAGGAGCTGG 
TTTCTCTGGA 
GCCAGOGGCT 
CAGGCATCCC 
AGCCTAGTGG 
CAGGAGCGGC 
GGGCX30GCCA 
GATCAAGCTG 
TCCGTTTATG 
CAGTTACTTA 
GAAATTAAAG 
TACATTTATC 
CATAAATCCT 
CTACAAAGGT 
GATATTTOGT 



11 

i 

AACGOTGCOG 
6AGCCTGC06 
GCOGGGGGAG 
GCX306CCTGA 
TGGAGGTCTC 
TGCTCGOCTA 
GGTCGGA6CA 
GOGGCAGCTG 
GGGACCAAGC 
CCGGGAGCAG 
GGCTGCTGCr 
AGCTOAGCOO 
0GACSGGCXX3G 
CCACCTACOT 
ACGACACGGC 
GGCX3CCTCAA 
ACG6CAGCAT 
GGCCCAGCAT 
TOQACTGCXSG 
AGGCCCTGGA 
GCTCCOCCAC 
AGAGGGTCAO 
AAAQAOTCCA 
GCACXX3TGGT 
AGGTTATTCT 
AAGAGACACC 
TAACAOCTGO 
GTTCGGAGTO 
GCACTITTCA 
CTGGAGCAAA 



21 

1 

GAGOGOOCAG 
CGOGOOGGOC 
CCGGAGCGGG 
GCCGCCX3T06 
CCGGA6GAAG 
TCTGCTGGCA 
AGCCATGGGA 
CCTGGACCAG 
66GCAACTGC 
CTTCAGCAAO 
CAC066CTGG 
CAACTCCCTG 
0GTG6TGTAC 
GCTGCCTGAG 
CATOGOGCTC 
GCTGT60GAO 
CTACTTCCCC 
GGG6GGCATC 
CCAOGGCCAC 
OGTCTGGGOG 
CACCAOGGCO 
CTGGQACTTC 
AOCAATOOCA 
AATGAACAGG 
TGGTGAGAAT 
TGTTTTCTAC 
OAAAOAOOTG 
TTTAACA6CC 
AGGAGATTGT 
AAA6T6CCCT 



31 
I 

GGCTTGCTGC 
TCCCOGCTCT 
OQCAGGAACC 
CCGCCGCGOG 
GCGCCX3CCGC 
CTGGOGGCTC 
OGCAT060GG 
CT6GACTACA 
ACA6AGCGGG 
CTGCTGCTGC 
ACCTTOGACC 
CCX3\A0Q6CA 



CGGGAGAOQG 
AAGGACACGG 
GGCG06GGCA 
TACTACOCCT 
GC6CAQAGCA 
CCOGAOGGCC 
GGAGTGTTCA 
CTCTQCCTCT 
AAGAOG6000 
TCATCTACCT 
ACTGTTTTAT 
TTGACTTCAA 
AAACTOGTTC 
AGGAGAATTC 
ACAQAOCCTC 
6TACATTCAG 
AAAATTCAGA 



41 

I 

CX3CCACCGCC 
CCTTCCTGGG 
GCGGCCGCCQ 
CCCTGCCOGG 
GCCCCC06CG 
CC6GCCX3GGG 
GGAGGCA6GA 
6CCT6GA6CA 
TCTOGCTGGC 
OCTACOGCGA 
GGGGCXKX^ 
COQAGGTGGT 
GGAACAACOG 
GGAGCOGCTG 
AGGGGOGO^ 
6CCTGCACTT 
ACAACTATAC 
C0QAG6TGCT 
GCOQCCTGCT 
GC6CGGC0GC 
TCAGAATGAG 
AOAGCCACTG 
TGATCCATTC 
TCTTGGGQAC 
ATTGTCCAGA 
CTQATCCTGT 
GT6TTGCAAA 
ACTGOOOTTG 
AGAACTTAGA 
TAATTGQAAO 



60 

120 

180 

240 

300 

360 

420 

4B0 . 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

2760 

2811 



51 
I 

GCTGCACAGG 
GGAGCTGCGG 
CGGCCOGGGT 
GGG06GCCCC 
CCC0GCA6CG 
CGCGGACGAG 
GGA0G6O6TG 
CAGCCTCTOG 
GCCCCOCGOG 
GGGGGOGGCC 
CGAGGTXK3GG 
GTCGTGCCAC 
CT6GTACCTO 
CAACCC0G08 
CCTGGCCAOG 
06TGGA06CC 
GAG0GG06CT 
GTTCCA8GGC 

TGGAGAGGGC 
TGAGATCCAG 
CAAAQAAGGO 
06A0CTQACA 
TGGAGATGGC 
GGTTATCTAT 
GAAGAATATC 
CTGCAATAAA 
GTQOCATTGG 
AAACTGOCTG 
CAGTAAA6AA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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wo 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AAGJVCTAOVO TGACTAT6GT GQQAAGCTTC TCTGCAA6AC ACTGAAAOTO CATOGTQAAG 
AAT6TGGACT CTAGCAGG6A GCTCTGOCAG AATAAAAGTC A6CCCAACCG GACCT GCA OC 
TOTAGCATCC CAACCAGAGC AACCTACAAA GATGTTTCAG TTGTCAAOGT GATGTTCTCC 
TTOGGTTCTT GGAATTTATC AGACAGATTC AACTTTACCA ACTGCTC3VTC ATTAAAAGAA 
TGCXXIAGCAT 6CGTAGAAAC TQGCTGC3GC0 TCGTGTAAAA GT6CAA6AAG GT6TATGCAC 
CGCTTCACM3 CTTQOGACOC TTCTQATTAT GAGM3AAACC AGQAACAOXG TCCAGTGOCT 
GTCGAGAAGA CATCAGGAG6 AGGAAGACGC AAGGAQAACA AGG6GAACAG AACCAAOCAG 
OCTTTACAQO TCTTCTACAT TAAGTCGATT GAGOCACAGA AAGTATCGAC ATTAGGGAAA 
AGCAAOGTGA TAGTAACGGG AGCAAACTTT ACCCCGGCAT CX5AACATCAC AATGATCCTG 
AAAGGAACCA GTACCTGTGA TAAGGATGTO ATAGAGGTTA GCCATGTGCT AAATGACACC 
CACATGAAAT TCTCTCTTCC ATCAAGC0G6 AAA6AAATGA AGG A TGTGT G TATCCAGTTr 
QATGGTGGGA ACTOCTCTTC TGTGGGATOC TTATCCTACA TTGCTCTGCC ACATTGTTCC 
CTTATATTTC CTOCTACCAC CTGGATCAGT GGTGOTCAAA ATATAACCAT GATGGGCAGA 
AATTTTGATO TAATTGACAA CTTAATCATT TCACATQAAT TAAAAGGAAA CATAAATGTC 
TC7GAATATT GTQTGGOGAC TTACIGOGGQ TTTTTAGCCC CCAQTTTAAA GAOTTCAAAA 
6TG0GCAGGA ATGTCACTGT GAAGCTGAQA GTACAAGACA CCTACTT60A TTQTG6AACC 
CTGCAGTATC GGGASQACCC CAGATTCACG GQGTATCGGO TGGAATCOQA GQTGQACACA 
OAACTGOAAO TGAAAATTCA AAAAOAAAAT GACAACTTCA ATATTTCCAA AAAAGACATT 
GAAATTACTC TCTTCCATGQ GGAAAATGGG CAATTAAATT GCAGTTTTGA AAATATTACT 
AGAAATCAAQ ATCTTACCAC CATCCTTTGC AAAATTAAAO GCATCAAGAC TGCAAGCACC 
ATTOCCAACT CTTCTAAGAA AGTTCG6GTC AAGCTGGQAA ACCT0GA6CT CTAOGTGGAG 
CAOOAOTCAO TTCCTTCCAC ATOOTATTTT CTOATTOTGC TCOCTXSri'Crr GCTAGTGATT 
GTCATTTTTG CGGCOSTGGG GGTGACCAGG CACAAATOGA AGGAGCTGAO TOQCAAACAG 
AGTCAACAAC TAGAATTGCT GGAAAGC6A0 CTCCXS6AAAG AGATAGGTGA OGGCTTTGCT 
GAGCIGCAGA TGGATAAATT GGATGTGGTT GATAGTTTTG GAACTGTTCG CTTCCTTGAC 
TACAAACATT TT6CTCTGAG AACTTTCTTC CCTGAOTCAG GTGGCTTCAC CCACATCTTC 
ACTGAAOATA TGCATAACAG AGAOGOCAAC GACAAQAATG AAAGTCTCAC A6CTTTGGAT 
GCCCTAATCT GTAATAAAAG LTri ' LTl ' Urr ACTGTCATCC ACACCCTTGA AAAGCAGAAQ 
AACTTTTCTG TGAAGGACAG c ro 'l-ClXyrTf GCCTCXTTTCC TAACCATTGC ACTGCAAACC 
AAGCTGGTCT ACCT6ACCAG CATCCTAGAG 6TGCT6ACCA GG6ACTTGAT GGAACAGTGT 
AGTAAGAT6C A6CCGAAACT CAT6CT6A6A OGCAOGGAGT CC8TC0TGQA AAAACTOCTC 
ACAAACTGGA TGTCCGTCTG CCTTTCTGGA TTTCTCCX3GG AGACT6TGGG AGA6GCCTTC 
TATTTGCTGO TGAC3GACTCT GAACCAGAAA ATTAACAAGG GTCCCGTGQA TGTAATCACT 
TGCAAAOCX^: TGTACACACT TAATQAAGAC TGGCTGTTOT GQCAGGTTCC QGAATTCaOT 
AGTGTGGCAT TAAACGTOGT CTTTGAAAAA ATCCC6GAAA A0GA6A6TGC A6ATGTCTGT 
OKSAATATTT CAGTCAATCIT TCTCGACTGT GACAC CATTO OCCAAOGCAA AGAAAAOATT 
TTCCAAGCAT TCTTAAGCAA AAATG6CTCT OCTTATGGAC TTCAGCTTAA TGAAATT60T 
CTTQAGCTTC AAATGGGCAC ACX3ACAGAAA GAACTTCTGG ACATCGACAG TTCCTCa5TG 
ATTCTTGAAG ATGGAATCAC CAAGCTAAAC ACCATTGGCC ACTATGAQAT ATCAAATGGA 
TCX!ACTATAA AAGTCTTTAA GAAGATAGCA AATTTTACIT CAGATGTOQA GTACTCGGAT 
GAOCACTOCC ATTTGA'^rTT ACCAGATTOG GAAGCATTCC AAQATUTUCA AOQAAAGAOA 
CATG6AGGGA AGCACAAGTT CAAAGTAAAA GAAATGTATC TGACAAAGCT GCIQTOGACC 
AAGGTGGCAA TTCATTCTGT GCTT6AAAAA CTTTTTAGAA GCATTTGGAG TTTACCCAAC 
AGCAGAGCTC CATTTGCTAT AAAATACTTT TTTGACTTTT TGGACGCCCA GGCTGAAAAC 
AAAAAAATCA CAGATCCTGA GGTOSTACAT ATTTOGAAAA CAAACAGCCT TCCTCTTOGC 
TTCTQGGTAA ACATCCTGAA GAAOCCTCAO ' iTl ' U T LTrm ACATTAAGAA 6ACACCACAT 
ATAGACX36CT 6TTTGTCAGT 6ATTGCCX»G GCATTCATGQ ATGCATTTTC TCTCACAGAG 
CAGCAACTAG GQAAGOAAOC ACCAACTAAT AAGCTTCTCT ATGCCAAQOA TATCCCAACC 
TACAAAGAAG AAGTAAAATC TTATTACAAA GCAATCAGGG ATTTGCCTCC ATTGTCATCC 
TCAGAAATGG AAGAATTTTT AACTCAGGAA TCTAAGAAAC AT6AAAAT0A ATTTAATQAA 
GAAOTGGOCr T6ACAGAAAT TTACAAATAC ATCGTAAAAT ATTTT6ATGA GATTCTAAAT 
AAACTAGAAA GAGAAOGAGO GCTGGAAGAA GCTCAGAAAC AACTCTTGCA TGTAAAAGTC 
TTATTTGATQ AAAAGAAGAA ATGCAAGTGO ATGTAAGCAC TCTGGGGCXn' GGCTTAATCT 
GGCAAAGTTC TTCAGACSSAC TTGGGAGCAA AATGGCTGCT TGAGCTACTC TGTGTCGTTA 
ATTTGTTGTT TGCACATAGG TTCCACTTTG GGCACTGTCT TTTTAAGAGA OCAAGGCACA 
TGCACAGCTT TTAGAAAGCA A 

Seq ID NOt C128 DNA Sequence 

Nucleic Acid Accession U-- NM_002185.1 

Coding sequence: 23.. 1402 



1 

I 

CTCTCTCTCT 
TTTACTTCAA 
AfiAACTGGAT 
TTCACIGACC 
ATOTOGOGCC 
CATCGAGACA 
AAAGAGTCTA 
TGACCTGAGT 
ACACTTGCAA 
GGATGAAAAC 
AAAGCTCCAA 
TAAAGGCTTC 
TA6CTCAGG0 
OGCTCTQTTG 
GCrCAGTCTC 
TTTAAATQTO 
CATTCAAGCT 
AGAATCTGAG 
TGTAGTOGTC 
1GTCAGTGCA 
TGGCAAGAAT 



11 

I 

ATCTCTCTCA 
GTOGXTTCTG 
6ACTACTGAT 
TGT6CTTTTG 
CTOGTGGAGG 
AAOAAATTCT 
ACCTGCAAAA 
OTCATCTATC 
AAGAAGTATG 
AAATGGACGC 
COGGCAGCAA 
TGOAOTGAAT 
GAGATGGATC 
GTCATCTTGG 
CCCOATCATA 
AGTTTCAATC 
AGAGATGAAG 
AAQCAGA6GC 
ACTCCAGAAA 
TQTQA06GCC 
GG6CCTCAT0 



31 
I 

GAATGACAAT 
GA6AAAGTGG 
TCTCAT6CTA 
AGGACCCAGA 
TAAAGTGCCT 
TACTGATTGG 
AAATAGACCr 
GGQAAGGAGC 
TAAAAGTTTT 
ATQTGAATTT 
TGTATGAGAT 
GGAGTCCAAG 
CTATCTTACT 
CCTGTGT6TT 
AQAAGACTCr 
CTQAAAGTTT 
TGGAAGGTTT 
TTGGAGGGGA 
GCTTTOQAAG 
CTATTCTCTC 
TGTACCAGGA 



31 
I 

TCTAGGTACA 
CTATGCrCAA 
TAGCCAGTTG 
TGTCAACACC 
GAATTTGAGQ 
AAAGAGCAAT 
AACCACTATA 
CAATGACTTT 
AATGCATGAT 
ATCCAGCACA 
TAAAGTTCGA 
TTATTACTTC 
AACCATCAGC 
AT06AAAAAA. 
GGAACATCTT 
CXTTGOACTGC 
TCTQCAAGAT 
TSTQCAGAGC 
AGATTGATCC 
CTCTTCCAGG 
CCTOCTGCTT 



41 

I 

ACTTTTGGCA 
AATGGAGACT 
GAAGTGAATO 
ACCAATCTGG 
AAACTACAAO 
ATATGTGTGA 
GTTAAACCTG 
GTGGTGACAT 
GTAGCTTACC 
AAGCTGACAC 
TOCATCCCTG 
AGAACTCCAG 
ATTTTGAGTT 
AGGATTAAGC 
TGTAAQAAAC 
CAGATTCATA 
ACGTTTCCTC 
CGCAACTGGC 
CTCACATGCC 
TCCCTAGACT 
AOCCTTGOGA 



1860 

1930 

1980* 

2040 

2100 

3160 

3220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4660 

4920 

4980 

5040 

5100 

5131 



51 

I 

TGQTTTTTTC 
TGGAAGATGC 
GATOGCAGCA 
AATTT6AAAT 
AOATATATTT 
AGGTTGGAGA 
AGGCTCCTTT 
TTAATACATC 
GCCAGQAAAA 
TCCTGCAGAG 
ATCACTATTT 
AGATCAATAA 
TTTTCTCTGT 
CTATCGTATO 
CAAGAAAAAA 
GGGTGQATOA 
AOCAACTAGA 
CATCTGAOQA 
TGGCTGGGAA 
GC3U;GQAGAa 
CTACAAACAG 



60 

120 

ISO 

240 

300 

360 

430 

460 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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10 

15 

20 

25 

30 

35 
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80 



CAOGCTGCCC CCTCCATTTT CTCTCCAATC TGGAATCCTG ACATTGAACC CAOTTGCTCA 
GGGTCAGCCC ATTCTTACTT CCCTGGOATC AAATCAAGAA GAAGCATATG TCACCATGTC 
CA6CTTCTAC CAAAACCAGT GAAGTGTAAG AAACXX^VGAC TGAACTTACC GTQAGCGACA 
AAGATGATTT AAAAGGGAAO TCTAQAGTTC CTAOTCTCCC TCACAOCACA GAGAAGACAA 
AATTAGC3UVA ACXXX^lCTAC ACAGTCTGCA AGATTCTGAA ACATTGCTTT GACCACTCTT 
CCT6AGTTCA GTGGCACTCA ACATGAOTCA AGAGCATCCT GCTTCTAOCA TGTGGATTTG 
GTCACAAGGT TTAAGQTGAC OCAATGATTC AGCTATTT 

Seq ID NO I CI 2 9 DNA Sequence 

Nucleic Acid Acceesion #: NM_002722.1 

coding eequencet IS.. 302 



1320 
1380 
1440 
1500 
1560 
1620 
1658 



ACTCTGQACT 
60GTGQCTCT 
AOCCAGGGGA 
ACATCAACAT 
TCTCGOAGTO 
AAT6CCACCT 
AOCCTTOGCT 
AAOCC 



11 

I 

C06GATGGCT 
CTTACTACAG 
CAATGCCACA 
GCTGACCAGG 
0QGGTCCCCX3 
TCTGTCTCCT 
CTQGCCAAAO 



21 

I 

GCCGCACGCC 
CCACTGCrOG 
CCA6A6CAGA 
CCTAGGTATG 
CATGCTGCTG 
ACGACTCCAT 
CTTGCTCCCT 



31 
I 

TCTGCcrcrc 

GTGCCCAGGQ 
TGGCCCAGTA 
GGAAAAOACA 
TCCCCAGGGA 
GAGCAG06CC 
GCTCCCACAC 



41 

I 

CCTGCT6CTC 
AGCCCCACTQ 
TGCAGCTGAT 
CAAAQAGGAC 
GCTCAGCCCO 
AGCCCAGCTC 
AGGCTCAATA 



51 
I 

CTGTCCACCT 
GAGCCAGTOT 
CTCOOTAGAT 
ACQCTGGCCT 
CTGGACTTAT 
TCCCCTCTGC 
AAGCAAOTCA 



Seq ID NO: CI 30 DNA Sequence 

Nucleic Acid Accession 9i NM_032S45.1 

coding sequence 1 47.. 718 



AAACTGATCT 
CCATGTCAGG 
CTATCAAAGA 
GCACCGACAG 
CGAGGGCTGG 
CQCGOQGCOG 
GCCG6CCCAC 
OCTGGAGCAC 
CCTGCACTGC 
CCACGCTCAC 
ACTCCTGCAC 
CGTCCTCCAG 
TTCTATGTTG 
TTTTTATTGG 



11 
I 

TCAATGCACT 
CTTCTGTTTA 
QAOAAACATA 
TCACCGCTCA 
GGGCCGGA6G 
CGCTGCTGCA 
TTCACOQGCC 
GGAGCCTGQA 
CTCCCCCTCC 
GGGCCGAGCO 
C6CCTCCTGC 
CGOQAGGGGC 
TAAATAATAG 
6TAATAAATA 



21 
I 

AAGAGAAQGA 
CGGTCAOTTT 
ACGGCGOTAG 
ACTG6ACCTC 
AGCCGCTCCC 
G6AAGGG0GQ 
GCTACIGCGA 
OCCTC06CQC 
AGAOGOCTGA 
COOGOQaCGC 
GCCCX3GATGC 
OOCOCTQCGG 
A ' lVltl TTT A G 
7TTTCATGAA 



31 

1 

GACTCTCAAA 
GGCATTACAG 
AOAOOAAGTC 
CAQTCATTTC 
CTACTCCCGG 
TACCTGCGTG 
GCATOACCAG 
CTGCCAOCTC 
CCGCTGTGAC 
GCCCAGCCTG 
GCCCGCGCAC 
AAGGCOQGGA 
TTTACGGTAA 
AGG6GCAAAA 



41 
I 

CCAAAAATGA 
ATCATGAATT 
ACCAAOGTTG 
GGAGAGGTSA 
GCTTTOGGAG 
CTGGGCAGCT 
AGGCGCAGTG 
TOQUjGTGCA 
CCGAAAGACT 
CTACTCTTGC 
CCTCGGTCCC 
CTTOGGCATC 
QCIGAAOCAC 
AAAAAAAAAA 



51 

1 

CCTGGAGGCA 
TGGGAAACAG 
CCACTCAGAA 
CTGGGAGC6C 
AGGGTGCGTC 
TCTQCGTGTG 
AATGOGGGGC 
TCTTCGGGQC 
TCCTGGCCTC 
TOCCCTGCOC 

GCCTTTAATT 
TGGOTGAATA 
AAAAAAAAAA 



Seq ID NO I C131 DNA Sequence 

Nucleic Acid Accession tt: MM_006533.l 

Coding sequence: 72.. 467 



1 
I 

A6G6AGAGA0 
CACAOTCCAC 
TCrCCGGACC 
06GACCAGGA 
C0GACT6C0Q 
AGG6C06T6G 
CTGCTCGCCT 
GCAAAQTCGA 
CTQGCCCTQC 



11 
I 

GGAG6GGAGG 
GATGGGCOGQ 
TGGTGTCAGG 
GT6CAGCCAC 
ATTOCTGACC 
G0G6CTCTTC 
GGGCTATTTC 
TOTQAAQACA 
OOTTTCCCCT 



21 
I 

AAATT6GA6A 
TCCCTGGTGT 
GGTG6TCCTA 
OCTATCTCCA 
ATTCAOCQGQ 
TGGGGAG6CA 
CCCAGTAQCA 
QACAAATGGG 
CCTTOGQTTT 



31 
I 

CCCCA8CACC 
GCCTTGOTGT 
TGCCCAAGCT 
TGGCTGTGOC 
GCCAAGT6GT 
GG6TTCA0GG 
TTGTCCGAQA 
ATTTCTACTG 
ATOCAAATAC 



41 
I 

CCCTT6CTCA 
CATCATCTTO 
GGCTGACCGG 
CCTTCAOGAC 
GTATGTCTTC 
AGATTACTAT 
GGACCASACC 
CCAGTGAGCT 
AATCA6CCCA 



Seq ZD NOt 0132 DNA Sequence 
Nucleic Acid Accession ft: AB064272 
Coding sequence i 1 . . 708 



1 
I 

ATGACACAAG 
AAAACCACAA 
G66AAAAACA 
ACTQAaAGCA 
ACAAAGACTA 
TCTCATCTAA 
ACATCTAGAA 
TAGCTCAATA 
TCATTCCCTG 
TT0CTT6GCC 
AACAOCCAGT 
ATG6A6CAGC 



11 
I 

TCACAGAAAA 
GAACCCCAGA 
CACCAGTCCC 
TAAAAOGCCC 
TAAAACCTTC 
ATAAAACTGA 
COAAGCTGAG 
AAGATGGCTC 
CATQGOCCAT 
TQATCTTCTT 
ACAAT GATGC 
AGAATCTTGO 



21 
1 

GTCCACAGAA 
AAAGCCTAOQ 
AGAAAAGCCT 
AOTAAAOTCC 
AGTCSVAGGTC 
AGTTACTCAT 
TTCTATCACA 
ACAOAAAGGT 
AGTTATT8TG 
GGTCTCCTAT 
AGAGOATGAG 
CATGGGCCAS 



31 
1 

CACCCAGAAA 
CTATACTCAG 
ACA6AAAACC 
ACAGAAAACC 
ACAGGAOACA 
CAGGTGCCCA 
TCAGAAGCCA 
ATCCAGGCTG 
GTCCTGGTQQ 
ATGATOOGGA 
GGIGQCCCCA 

ATCccrrccc 



41 

I 

AGACCAGQTC 
AQAAGACCAT 
TGGGGAACAC 
CAGAAAAAAC 
AATCTCTCAC 

CAGGAAACQA 
GACAGATGGG 
CrOTGATTCT 
CACGCO SCAC 
ATTCCTACCC 
CACGGTGA 



60 

120 

ISO 

240 

300 

360 

420 

425 



51 
I 

CrCTCTTGCr 
CTGTCTGCCT 
AAGCTGTGTG 
TACATGGCCC 
TCCAAGCTGA 
GGAGATCTGG 
CTQAAACCTG 
CAGCCTACCG 
GTGCAAAC 



Seq ZD NOt C133 DNA Sequence 
Nucleic Acid Accession ft: NM_080B70. 
Coding sequence t 3 . . 710 ^ 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

846 



60 

120 

180 

240 

300 

360 

420 

480 

538 



51 
I • 

AACCACAGAG 
ATGCACCAAA 
CACACTGACC 
AOCAGCAGTC 
TACTACCTCT 
CACCCTCATT 
QAGCCATCCA 
AGAGAATGAT 
GCTCCTGGZO 
ACTAAC CCRG 
GGTCTACCXG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

708 
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1 
I 

AGATGACAOV 
AGAAAACCAC 
AAGGGAAAAA 
CCACTGAGAC 
TCACAAAGAC 
CTTCTGATCT 
TTACATCTAG 
CATACCTCAA 
ATTCATTCCC 
TGTTCCTTGG 
AGAACACCCA 
TGATGGAGCA 
G6C6CCCAOC 
AATTTCTATT 
CTTCATCTGT 
AGGGGACAAA 
CTQAGAATGA 
TQAOTCCTAA 
GGTTTATGGG 
TTATTTCCAT 
AOGAOOCTTG 
AAGAAGAAAA 
CCAGAA6AAA 
TTAOTTTTCC 
GOACAOGCAA 
GACC TTOO CT 
GTAGCTTTAT 
CCTTCACCCC 
ATTTTCTCTC 
' AGGAGAATGA 
AT6TT 



11 
I 

AGTCAOUSAA 
AAGAACGCCA 
CACACCAGTC 
CATAAAAGCC 
TATAAAACCT 
AAATAAAACT 
AA06AAOCTG 
TAAAGATGGC 
TGCATOGQCC 
CCTGATCTTC 
GTACAATGAT 
GCASAATCTT 
CCTGGCTCTT 
TCCGGGACTA 
TCTTQAAACT 
GAAOAAAGAA 
AAAG G TGTTT 
AGOAAGGACA 
GAAAGGGAGG 
TCACTATTAC 
CCCCCACCCC 
QAGAATTTAT 
TCATAAATAT 
TAAATQTCTA 
GGAGGGGATT 
QATTOQTGTC 
CTCOTAAAAO 
AGATTCAAGT 
TCTCTTCTTA 
TGATGATAGT 



21 

I 

AAGTOCftCAG 
QAAAAGCCTA 
CCAGAAAAGC 
CCAGTAAAGT 
TCAGTCAAGG 
GAAOTTACTC 
AGTTCTATCA 
TCACAGAAAG 
ATAOTTATTG 
TTGGTCTCCr 
OCAGAOQATG 
GGCAT06GCC 
CCATGCTCTG 
CAGGAAGGGC 
GGTTGGGGAA 
TGAATAATAC 
GATGGACATG 
GGAGCCTTAT 
GACTGAGG6C 
TTAAaAGTTT 
GATTTTTTTA 
TTCCTTCTCC 
CTCTCATCTA 
CAGTGGACGC 
TTTATTTGGC 
TCAOCATTTA 
TTAGCCATCT 
TTTCCTCCTT 
TGGCCATTTC 
TTCCTTCTGT 



31 

I 

AACACGCA6A 
CXXn-ATACTC 
CTACAGAAAA 
CCACAGAAAA 
TCACAGGAQA 
ATCAGCT60C 
CATCA6AAGC 
GTA-rcCAOGC 
TGOTCCTGOT 
ATATGATGCX3 
A0GGTG6C0C 
AGATCCCTTC 
CCCCTTTCCT 
AGAGAATACT 
T6AGGTGATA 
OA6C3U3ACAT 
TTGTGGGGGC 
AG6CAATQCC 
AGAGTCTCTG 
GTGTGTAAAC 
ATGAAAAAAA 
ACTCTCTCCA 
CATGGTTGCT 
CCTGTTGGTT 
CAGCAGTCTC 

rrrrccj G 'i'C 

TCTCTACTOT 
GTAGGCATTT 
ACCTTATTAC 
CTATTGACCT 



41 

I 

AAAGACCAGQ 
AGAaAAOACC 
CCTGGQQAAC 
CCCAOAAAAA 
CAAATCTCTC 
CACTGOTTCT 
CACA6GAAAC 
TGGACAGATG 
GGCTGTGATT 
GA CAOGCCG C 
GAATTCCTAC 
CCCAOGGTGA 
GGATGAGGAA 
GAOGGTTACC 
AGCAAG6AGG 
TCTCTQTAQA 
ACCAAT6CAG 
CCAQACIGAC 
GGTTTCAGGA 
AG6CTCATCT 



TGCCCTGGAG 
TCCTCTTCCT 
TGGCTTGCTQ 
ACCCACTQAT 
TCTTCCACCA 
CCCCATTCTC 
CATCTQTGTG 
TGATTGGGTA 
TTTTTATAAT 



51 

I 

TCAAOCACAG 
ATATQCACCA 
ACCACACTGA 
ACAGCAGCAQ 
ACTACTACCT 
TTCACCCTCA 
GAOAGCCATC 
GGAGAGAATG 
CTCCTCCTGO 
ACACTAACCC 
COGGTCTACC 
TCTTGGAGTA 
CC3GGACTCAC 
AGTATTAACC 
GTXrrAAGTTT 
AG6TAATG6T 
AACACTGCAC 
TT0TGAGTG3 
CAGCATTATG 
CTGAGTTCTC 
AAACGQATCC 
AAAAAAAAGT 
CCCAAATCCC 
GGTTGTGGGT 
CTCCACCCCA 
AAAGCCAGCT 
TCTCCTCCCA 
TGTTTTCTGG 
GAGGGGGAAA 
AAAGTATAAC 



Seq ID MO: C134 DMA Sequence 

Nucleic Acid Accession #i FGENBSH predicted 

Coding sequence: i.. 10674 



ATGTGGCCTC 
TTTCAGCAGA 
GGGGCCCCCG 
GTGGA6CGGC 
CGCCTGGAGC 
AOCQAGCTCA 
CGCGTGGCCA 
TCCACCOGCC 
TCCTACOGAG 
CTTCATGCTA 
GGGGGAGACC 
TTTGGCATAT 
QAQCACTGTT 
CATGAAGATC 
TGTGATGAAG 
GGCCATTTTG 
ACAGCTTGCC 
ATTCCATGTC 
GTCTGCAOAa 
CTGAAOCCTC 
GCCTGTGGGO 
CTACCCAATO 
CTCOGCCAGC 
ACATQTTTGG 
CAAGGAAACA 
TTTCAGATGC 
TTTGG6ACGA 
ATGCTGAGAT 
GACOTGQAGG 
CAAGATTCTG 
QTGTCAGTCC 
GCTATCX5TAT 
AAGGTTATTQ 
GTCTG6GAGA 
GCTGAATTQO 
ATAGTACAGT 
GTCATAAAAG 
ACrCCAGATA 
GAAGOGTCTA 
ACCACTQAAT 
GAGATGTTCT 
aCATTTQAGA 



11 
I 

GCCTGGCCTT 
TGTCCCCGTC 
GGAGTATCCC 
TGGQCCAGGC 
TTGTCTTCCT 
TGTTCGTCCG 
TCX5TGACCTT 
GOGGGCGCO^ 
GTGGCGGCAC 
GAGAAAACTC 
CTAGACCAAT 
GGCAAGGGAA 
ACCTQCTACA 
TACCTTCTGO 
6CAAGGACTQ 
A6TGCATCTG 
CATCGGGGAC 
CTGATGAAAA 
AGGGATACAO 
CCGAAAATGG 
TCXX3ATGTCA 
GTTTGTGaTC 
OQAAACATGG 
TTQCCTGTGA 
GCCAGTGGGA 
CCAAAGATGT 
TCTGCTATGT 
GTACCACTTC 
CrCCTCAAAT 
CCAATGTTAC 
ACQTTCATCC 
ACACGGCAAC 
ATGCAGAACC 
AGGTACATGC 
TCATTACCAG 
ATACAGCXWC 
GTTCTCOCTG 
ATACTGGAGT 
CTGACS^AGTA 
GGCCAOACTO 
ACAAAiGCAGC 
OQACCCTGGO 



21 
I 

GCGCAATTTC 
CGCGCCX5CCC 
QTTCCGGOGA 
GGTGGATGAT 
CAAGCTGCTQ 
CTOGTCCAAG 
GCACAAGTGC 
CTACACCAAG 
AACAAAAGTT 
T8CAGCGTCA 
CATTCGAGAG 
CAOTTTTGAA 
QAGTTTTATT 
CTOTGACCGA 
TQAAAAGGGG 
ATACAAACCT 
TCACACCTCT 
GGCATCTGGC 
TTACTTTATC 
CCCTGQATTT 
OQQTTCAGAO 
CCACATCAQC 
TQAAGGGTAC 
TGGGCCAOAA 
CATCATATCC 
AAGTTGCOGC 
TGGAAAAT6G 
CAACTQTCCT 
CTGGCAGATT 
AGCTTTCACC 
TQACCTATCX: 
ACCTGTCATA 
CXSCAAGCTGG 
AAGTCATACA 
TGACCCCTCA 
TQAAATTCCA 
CAACTOTACA 
TTATTGTGCT 
TGCCAAAAAA 
TOOTTGTGAT 
AAAAATGOTC 



31 

I 

GGTCTGGCX3C 
AGCTTCCQCC 
GCTCCTGGCG 
CGG0T60GGC 
TCGTCCAQCG 
TCOGACTTCC 
AACTA06TG0 
GCGCTGCTCC 
GGOaCCTTCC 
GTATTTCTCA 
CTOOQAOATT 
CTGAATGACA 
QAATTTGAGG 
CAAQATQATA 
ATGGGAAGCT 
TATTAOQGGA 
6AAGGCTCAC 
CCACCTGGAA 
CAGACCTGTG 
CAAAACACrr 
GATCTTGTGG 
AGCTACTGCA 
TGTTCTACAA 
AGACTAGAAO 
CCCOGGTGTS 
CCCCACAACT 
CAAOGOITCA 
AATGTOGGAG 
AAGGACATA6 
CCAACAGCTA 
CCACCTTACX: 
GOCAACCAGO 
GACTGGTGCA 
GATQAGCCTC 
CAAGGAGACC 
GGCAATAACA 
TTCACACCTG 
TTAACTTGCT 
TATGAAGATG 
OGmTGCAA 
GACACAOATC 
CCATCATTTT 



41 

I 

TOGTTTCGGG 
TCTTCCCCGA 
AC36AAGOGGC 
TGCTGOGGGA 
TGGGCQAAGT 
COGTGGTGCC 
TGCOGCGCGT 
TCCAAQAQAT 
AGCAAOCCSC 
TCACT6ATGG 
CAGGAGTGGA 
TGGCTTOCAC 
CTTTAGCTOQ 
TGQTCGACTQ 
GCAAATGTOS 
AAGGTCTGCA 
CAGGAGGAAT 
GCACATCCCC 
AACTTGTCCA 
GCAACAACCA 
GAAGCAGCAT 
GAGTAAGAAC 
GGGAAATGTT 
GCAGT6ATAA 
TQGAOOQCCA 
GTGGCAAGCA 
TTTTATCTGG 
TTCAGOCAGC 
AGGCTAAGAC 
AA6ACAACTC 
TTTTCCCAAT 
CCAGCTQCAT 
GATCTCCACC 
A6TTCTCAGA 
TTTTOCCTCA 
GGACATGTGA 
TAAATGGGGA 
TGGAG60CTA 
GOGTCTQGAA 
ACCAOQGGTT 
TQAT6AAGAA 
GTAOTGATOC 



51 
I 

CTOGGCXSACC 
GACCGCGCCC 
GGGGAGCAQA 
GCTCAGOSAG 
CAACTTO06C 
CACGGCCACG 
OGATTACATC 
CCCTGCCATC 
GCAAATTCTT 
ATATTCCAAT 
GATCTTCACT 
CXXZAAAOGAG 
C06GGCATTG 
CTCATATCTT 
6ACACACACA 
GTATGAATGC 
CaGCAGTTOC 
TGAAQACTGT 
CTGCCCTGCC 
CTTCAAT6CA 
CATCTTATGT 
ATGTCCTCAT 
ATATAAGACA 
GCTTACnGT 
CTOTTCCAOC 
GCCAGCCAAA 
AGTCAAAOAA 
TQTGTGTAAA 
TCTGGAACAG 
TGGTOAAAAO 
TGGAGATGTT 
TTTCCATATC 
TCCOGTCCAO 
CAACTCAGGG 
AGGGGAGACT 
TATCCATATT 
TTTTATATQC 
TGATTTCACA 
ACCAACATAT 
CAAGTCCTTT 
GTTTTCT6AA 
AGAGGACATT 



60 

120 

180 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

lOBO 

1140 

1200 

1260 

1320 

1380 

1440 

ISOO 

1560 

1620 

1680 

1740 

1800 

1805 



60 

120 

180 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 



1284 



wo 03/042661 



PCTAJS02/36810 



GACTGCAQAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATOACTAT 2580 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGQGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACGATGACT TCCTG6ACAC T6TGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 

. TCACGOATTA AAAOAAOTQC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

5 ACAGCTAGT6 TGCCATTACX: C6AT6AAAGA AATQATACCC TT6AATGQGA AAATCAGCAA 2820 

CGACTCCTTC AQACATTGGA AACTATCACA AATAAACT6A AAAGQACTCT CAACAAAOAC 2680 

CCCATOTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TTCATTAGAA 2940 

ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGOG TATGTGTGTC 3000 

AATTGCCCTT TGGGAACCTA TTATAATCTQ GAACATTTCA CCTGTGAAAG CT6CC3GGATC 3060 

10 GGATCCTATC AAGAT6AAGA AGGGCAACTT GAGT6CAA6C TTTGCCCCTC TGGGAT6TAC 3120 

A06GAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTOTAA ACAA6GCACC 3180 

TACTCATACA GTGGACTTGA GACTTGTGAA TOSTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 

TTTGGTTCCC GGAGCTGCCT CTCGTQTCCA QAAAACACCT CAACTGTQAA AAGAGGAGCC 3300 

GT6AACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTOGCG TTCTGGGTTA 3360 

15 ATGCCCPGTC ACCCATGTCC TC3GTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 

CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTCK3TT CCAGATCCAT CACAGAATQT 3480 

TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3540 

TQTCCTTCTG AaGTTTTCCA TQAATGCTTC TTTAACCCTT GCCACAATAG TGGAACXTTGC 3600 

CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 

20 GAAACAGACA TOGATGAGTG CAGCCCACTG CCTTQCCTCA ACAATGGAOT TTGTAAAGAC 3720 

CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 37 BO 

AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGT6T6 3B40 

GCTGOCTATC OTTGCACATG TGTQAAAOaA TTTGTAGGCX: TGCATTGTGA AACAGAA6TC 3900 

AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 

25 TTCTTGTGCA AATGCCCACC TOQATTTTTG GGTACCCQAT QTGGAAAGAA CGTCGATGAQ 4020 

TGTCTCAGTC AGCCATQCAA AAATGGAGCT ACCTGTAAAG AOGGTGCCAA TAGCTTCAQA 4080 

TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTQAAT TGAACATCAA TGAATGTCAO 4140 

TCTAA'TCCAT GTA6AAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 4200 

TGTCAGCCAQ GATTTTCAGG CAAAAGQTQT OAAACAGAAC AGTCTACAGG CTTTAACCTG 4260 

30 GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 

CATGCTCTAA CXDTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAAGAGC3V 4380 

ATCTCCTATG CAQTTOATAA CGQCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 4440 

TGGQTTCTTT ATGTGAATGG CAGGQAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 

AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATQ GCATCTGGAA AGTCTATATC 4560 

35 GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 

GGTGOQTTAG TTGTG6GGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAO CCCAGCTGAQ 4680 

TCTTTTGTGO GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 

6TGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGQAAACGT GTTAQCATGG 4800 

. _ CCTQATTTCT TGTGAGOAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAQ CATATTTTGT 4860 

40 TCTGATTGCC CA06CTTAGG AGGGTCAGTG CXTTCATCTOA GAACTGCaTC TGAA6ATTTA 4920 

AftOCCAOGTT OCAAAOTCAA TCT0TTCT6T GATCCA06CT TGCAOCT<K}T OQGQAACCCT 4980 

GTGCAGTACT GTCTGAATCA AGGACAGTG6 ACACAACCAC TTCCTCACTQ TGAAOGCATT S040 

AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCOQATGA CTTCTATGCT 5100 

GGCAGCACAG TAACCTACCA GTGCAACAAT G6CTACTATC TATTGGGTGA CTCAAGGATG 5160 

45 TTCTGTACAO ATAAT6GGAG CTGGAACGGC 6TTTCACCAT CCTGCCTTGA TGTCGATGAG 5220 

TGTGCAaTTO GATGAGATTG TAOTQASCAT GCTTCTTOCC TGAAC9GTAGA TGGATCXrTAC 5280 

ATATGITCAT GTGTCCCACC GTACACAGGA OATGGQAAAA ACTGT6CA6A ACCTATAAAA 5340 

TGTAAGGCTC CAG6AAATCC GGAAAATGGC CACTCCTCAG GrGAGATTTA TACAOTAGGT 5400 

GCCGGAGTCA CATTTTOGTG TCAGGAAGGA TACCAGTTGA TGGGAOTAAC CAAAATCACA 5460 

50 TGTTTGGAGT CTGGAQAATO GAATCATCTA ATACCATATT GTAAAGCTOT TTCATQTGGT 5520 

AAACOGGGTA TTOCAGAAAA TGGTTOCATT GAGGAGTTAO CATTTACTTT 1GGCAGCAAA 5580 

GT6ACATATA GGTGTAATAA AGGATATACT CT G GC OG GT Q ATAAAGAATC ATGCTGTCIT 5640 
GCTAACAGTT CTTGQAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCX3 5700 
GAAAATATAA ATAATGGAAA ATATATTTTQ AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 
55 TATTCATGOG ATACAGGATA CAGCTTACAG GGCCCTTCXIA TTATTGAATG CA06GCTTCT 5820 
G6CATCTGGG ACAGAG06CC AOrTGCCTGT CACCTC36TCT TCTGTGGAGA ACXAOCIGCC 5880 
ATCAAA6ATG CTGTCATTAC GGGGAATAAC TTCAC7TTCA GGAACAOOGT CACTTACACT 5940 
TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCXrTGGC OGACX36CAAG 6000 
TGGAGTAGAA GTGAOCAGCA GTGCCTGGCT GTCrCCTGTG ATGA3CX3VCC CATTGTGGAC 6060 
60 CACGCCTCTC CAQAGACTGC CCATCG6CTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 
GATGOTTACA GCCTAGCAGA CAATTCCCAG CTTCTGTGCA ATGCCX»GG6 CAAOIGGQTA 6180 
COCC CAQAAfl GTCAAGACAT G0 CC0 6 TT G T ATA8CTCATT TCTGT6AAAA ACCTCCATOG 6240 
GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAA6€!AAAAT TTGCAGCTGO CTCAGTT6TG 6300 
^ AGCTTTAAAT GCATGOAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGA6A 6360 
65 GGTGGGCAGT GGAACCCTTC CCCCATGTCX: ATCCAGTGCA TCCCTGTGOQ GTGTGGAGAG 6420 
CCA0CAA6CA TCATQAATQQ CXATGCAAST GQATGAAACT ACA0TTTTGO AGCCATQGTQ 6480 
GCTTACA6GT 6CAACAAGGO GTTCTACATC AAA6GGGAAA A6AAGAGCAC CIGCGAAOCC 6540 
ACAGGGCAGT GGAOTAGTCC TATAC06A0G TGCCACCCX3G TATCTTOTGG TOAACCACCT 6600 
^ AAGOTTGAQA ATGGCTTTCT GGAGC3VTACA ACTGGCAGGA TCTTTGAGAG TGAA6TGAG0 6660 
70 TATCAGTGTA ACXX3GGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCrG CCAAGCCAAT 6720 
OSCCACTOGC ACS^GTQAATC CCXTTCTGATG TO TQTT CCTC TOOACTGTGG AAAACC TOCC 67B0 
COGATCCAQA ATOGCTTCAT 6AAAGGAGAA AACTTTGAAQ TAGGGTCOUl GOnCAGTTT 6840 
TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 
^ AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCOGCCXZCTC 6960 
75 TTG6AAAACC AGCTAGTATT AAAGGAGTTO ACCACOGAGG TAGGAGTTGT QACATTTTCC 7020 
TGTAAA6AA0 G6CAT0TCCT GCAAGGCGCC TCTGTCCTQA AATQCTTGCC ATCCCAGC3A 7080 
TGGAATGACT CTTTOOC T GT TTGTAAGATT OTTCTrrGTA CCCCAOCTCC CCTAATTTCC 7140 
TTTGOTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTOT 7200 
GTAGGTGGQT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 
oO AGCTCTCCAC TGCCAGAATG TOTTCCAGTA QAATGTCC CC AACCTGAGGA AATCCCCAAT 7320 
G6AATCATTG ATGTGCAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 7380 
GGCTTTGAAT TGGTQGGAAA TACTACCACC CTTTQTGGAG AAAATGGTCA GTGGCTTGGA 7440 
GGAAAAOCAA CATGTAAAGC CATT6AGTGC CTGAAACOCA AGGAGATTTT GAATGG CAAA 7500 
TTCTCTTACA 06GACCTACA CTATGGACAO AC06TTA0CT ACTCTTGCAA COQAGGCTTT 7560 

1285 
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15 

20 
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30 
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40 
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65 



CGGCTCGAAO GTCCCAGTGC CTTGACCTGT TTAGAGACAQ GTGATTGGGA TGTAGATGCC 
CCATCTTGCA ATGCCATCCA CTGTGATTCC CCACAACCCA TTQAAAATGG TTTTCTAGAA 
GGTGCAGATT ACAGCTATQQ TOCCATAATC ATCTACAGTT GCTTOCCTGG GTTTCAGGTG 
GCTGGTCATG CCATGCAGAC CTGTGAAGAa TCAGGATGGT CAAGTTCCAT CCCAACATOT 
ATGCCAATAO ACT6TGGCCT C CCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 
GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 

CACCCTCCrr atcatttggo agcaotggct aaaacctggg aaaatacaaa gqagtctcct 

GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 
TATQAACTTC TGGGQAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTO GAATGQCAOT 
GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 
OGTTTTACAG AGACTAGCAT GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 
CTAGC3W3QCT CTGACTTAAQ QCTTTOTCTA QA6AATAGAA AGTGGAGTGG T6CCTCCCCA 
CGCXGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 
AGCAACTACa CATACCTOAG CACGTTGTAC TAT6AGTGTG ACCCOGQATA TGTGCTGAAT 
GGCACTGA6A GQAGAACATG CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 
ATTCCTGT6G ACTGCAGTTC ACGCCCAGTC TCAGCCAATG GCCAG6TGA6 AGGAGACGAO 
TACACATTCC AAAAAGAOAT TGAATACACT T6CAATGAAG GGTTCTTGCT T6AGGGAGCC 
AGGAGTCGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 
GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA OGGAAGGCCT GGACTATGGC 
TTCATGAAGG AAGTAACATT CC3VCTGTCAC GAGGGCTACA TCTTGC3V0GG TGCTCCAAAA 
CTCACCTGTC AQTCAGATGO CAACTGOGAT GCAOAGATTC CTCTCTGTAA ACGAGTCAAC 
TOTGGACCTC CT6AAGATCT TGCCCATGGT TTOCCTAATQ GTTTTTCCTT TATTCATGGG 
GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 
TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 
AO^CCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 
GOQATTCAGT GCTTCAAAGG CTTCAAGCTC CTAOGACTTT CTGAAATCAC CTGTGAA0OC 
GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 
AT3ATACCAA ATOOQTTCAT CAGTGA6ACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 
AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAQATC T6ATTTGTAC AGAGAAAGGG 
GTATG6A6CC A6CCTTATCC AGTCTGTGAG CCCTTGTCGT OTGGOTCGCC ACGGTCT6TC 
QCCAATGCAG TGGGAACTGG AGAGGCACAC ACCTATQAAA GTGAAGTGAA ACTCAOATOT 
CTGGAAGGTT ATAGQAT6GA TACAGATACA 6ATACATTCA CCTOTGAOAA AQATGGTOGC 
TGGTTCCCTO AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 
CATATACTT6 TACATGGGGA CGATTTCAGT 6TGAATAGGC AAGTTTCTGT GTCATGTGCA 
6AAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AQCTTGATGQ AACCTGGGAG 
CCACCATTCT CCGATOAATC TTGCAOTOCA G TT T CTTGTQ G6AAACCTGA AAOTOChOAA 
CAT6GATTT0 TGCnTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAOTGTQAG 
CCTGGCTATG AACTAGAGGG GAACAGGGAA GGTGTCTGCC AGGAGAACAG ACAGTGGAGT 
G6AGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 
AAAGCTGACA TTGAAAACAG 6ACGACTGGA GCCAACGTGG TATATTCCTG CAACAGAGGC 
TACAGTCTTG AA6G6CCATC TGAG6CACAC TGCACAGAAA ATGOAAOCTG 6AGCCACCX31 
GTCCCTCTCT GCAAACCAAA TCCATGCGCT GTTCCTTTTG TQATTCCQGA GAATGCTCTG 
CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 
CT6CTGCAGG GCCACGGCAT CATTACCTGC AACCCCGACG AGAOGTQQAC ACAQACAAGC 
GCCAAATQTG AAAAAATCTC ATGTGGTCCA CCAGCTCAOO TAGAAAATGC AATTGCTCX3A 
GG06TACATT ATCAATATGG AGACAT6ATC ACXH'ACTCAT GTTACAGXGG ATACAT6TTG 
GAOGGTTTCC TGAGGAGTGT TTGTTTA6AA AAT6GAACAT GGACATCACC TCCTATTTGC 
AGAGCTGTCT GTOGATTTCC ATGTCAGAAT GGGGGCATCT GCCAAOGCCX: AAATGCTTGT 
TCCTGTCCAG AGGGCTGGAT GGGGCSGCCTC TGTGAAQAAC CAATCTGCSIT TCTTCCCTGT 
CTGAACiGGAG GTCX3CTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 
TCTGGCT3TC ATACAGCTGT TT6CXA0TCT CCCTGCTTAA ATGQTQGAAA ATOTOTAAaA 
CCAAACOGAT 6TCACTGTCT TTCTTCTTGG AC6G6ACATA ACTGTTOCAG OTAA 

Seq ID NO I CI 3 5 DNA Sequence 

Nucleic Acid Acceasiozi #i FGOlESa predicted 

Coding sequence i 1 . . 3 90 



ATQAGGTTCA 
TATGT6TCA0 
GAACCATGGC 
CAGTGCTGTT 
TGCACCTTCT 
TTTGTTOTGA 
AGTAAATGTG 



11 

I 

GTGTCTCAGG 
TCTOTCICCT 
TGTGCC3VaCC 
ACAATGACX3C 
GGCCCTGCTT 
AGCIQAAGGT 
AAAGA66C06 



21 
I 

CAT6AGGACC 
OC TC TTGTOT 

GGCAcccs^aa 

CATOGTGTCC 
TGAGCrCTGC 
TCAGGOTGTO 
GATATGTTAG 



31 

I 

GACTACCCCA 
CCAAGGGAAG 
TaTGQAGACA 
CTGAGCGAGA 
TGTCTTGATT 
AATTCCCAGT 



41 

I 

GGAOTQTQCT 
TCATOQCTCC 
AQATCTACAA 

CXXX3CCAATG 
CCTTTGGOCT 
GCCACTCATC 



51 
I 

GGCTCCTGCT 
C3QCTGGCTCA 
CCCXZTTGGAG 
TGGTCCCCCC 
CACAAAOGAT 
TCCCATCTCC 



7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 
9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 • 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10674 



60 

120 

180 

240 
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360 

390 



Seq ID NO: C136 DNA Sequence 
70 Nucleic Acid Accession BC035671.1 
Coding sequence: 126.. 1745 

1 11 21 31 41 51 

\ I I II I 

/D G6CAGCGACT GGGCCCCGTC CC6GCGC0GC GCTCGTCCGC AGAG6A6GCG GCCCGGCCOQ 60 

GGCAGCTGOG GCTCGGGATC CGTCQAGGGG AGGCCOAGCT TGOCAAGCTG GG6CCCA6C0 120 

GGGTCATGGT GCGOGGCGGC OGCGGOGGOG GOGCACTOGC GOGOOCTGCC GGGCGG6G0C 180 

TCCTGGCTTT GCTGCTCGCG GTCTCOGCCC OGCTCOGGCT GCAGGOGGAG GAGCTGGGTG 240 

ATGQCTGTGG ACACCTA6TG ACTTATCAGG ATAGTGGCAC AATGACATCT AAGAATTATC 300 

oO CCGGGACCTA CCCCAATCAC ACTOTTTGOG AAAAGACAAT TACAGTACCA AAGGGGAAAA 360 

OACTGATTCT GAOGTraaaA OATTTGQATA T08AATCCCA QACCTOTGCT TCTGACTATC 420 

TTCTCrrCAC CAGCTCTTCA GATCAATATG GTGCATACTO TGGAAGTATG ACT6TTCGCA 480 

AAOAACTCn; GTTQAACACA AGTGAAOTAA CGGTCGGCTT TGAQAGTGQA TCCCACATTT 540 

CTGGCCGGGG rm TT G CT Q ACCTATGG6A GCAGOGAOCA TCCAGATTTA ATAACATGTT 600 
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TG6AACGAGC TAGCCATTAT TTC3AAGACAG AATACAGCAA ATTCTGCO CA GCTG6TTQTA 660 

GA6A0GTAGC AGGAGACATT TCTGQGAATA TGGTAGATGO ATATAGAGAT ACCTCTTTAT 720 

TGTOCAAAGC TGCCATCXy^T GCAGGAATAA TTGCTGATGA ACTAGOTGGC CAGATCAQTG 780 

TGCTTCAGCQ CAAAGGGATC AGTOGATATQ AAGGGATTCT GQCCAATGOT GTTCTTTOOA B40 

GGGATGGTTC CCTGTCAGAC AA6CX5ATTTC TGTTTACCTC CAATGOTTGC AGCAGATCCT 900 

TaAGTTTTQA ACCTGAGGGQ CAAATCAQAO CTTCTTGCTC ATGGGAOTGG 6TCAATGAQA 960 

GTGGAGACCA A6TTCACTG0 TCT O CT GG CC AAGCCOQACT TCAGQACCAA 6GCCCATCAT 1020 

GGGCTTCGGG CGACAGTAGC AACAAOCACA AACCAOGAGA GTGGCTGGAG ATCQATTTGG 1080 

GGGAGAAAAA GAAAATAACA GGAATTAOGA CCACAGGATC TACACA0TCX3 AACTTCAACT 1140 

TTTATGTTAA GAGTTTTGTG ATGAACTTCA AAAACAATAA TTCTAAGTGG AAGACCTATA 1200 

AAOOAATTCIT GAATAATQAA GAAAAGGTGT TTCAGGGTAA CTCTAACTTT OGGGACCCAG 1260 

T6CAAAACAA TTTCATCCCT CCCATGGTGG CCAGATATGT 60G6GTTGTC GCCCAGACAT 1320 

GGCACCAGAG GATAGCCTTO AAGGTGGAGC TC A TTGOTTG CCAGATTACA CAAGGTAATG 1380 

ATTCATTGGT GTGGCGCAAG ACAAGTCAAA GCACCAGTOT TTCAACTAAG AAA6AA6AT6 1440 

AGACAATCAC AAGGCCCATC CCCT0G6AAG AAACATCCAC AGGAATAAAC ATTACAAOGG ISOO 

TGGCTATTOC ATTGGTGCTC CTTSTTGTCC T6GT6TTTQC TGOAATGOGQ ATGTTT6C3U3 1560 

CCTTTAGAAA GAAGAAGAAG AAAGGAAGTC CX3TAT6GATC AGCAGAGGCT CAOAAAACAG 1620 

ACTtSTTGGAA GCAGATTAAA TATCCCTTTO CCA6ACATCA OTCAQCTGAO TTTACCATCA 1680 

QCTATGATAA TGAGAAGGAG ATGACACAAA AGTTAGATCT CATCACAAGT GATATGGCAG 1740 

GTTAACTCGG TTQACTGCCA AAATAGCATC CCCAAC6TGC AGCC C T CO GC ATCTATCAGC 1800 

AGGTTGCCCC GQATGGATCT CAGAGATGA6 GATCGQAACA GCATOTTCTT TCOCACOCTA 1B60 

ACAACAACAA AGG6CA0TAA ATTAAAGTAC TCTTTGTAAG 0TACA6TTAC GGATTAATCT 1920 

AGAGATAAAA T A TTTTCTT A AAAiVTATATT TGATTAAACA CCTATOCTST CTCTATAAAA 19 BO 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2010 

Seq ID NOt C137 DNA Sequence 

Naclelc Acid Accession #i Bos sequence 

Coding sequence t 1 . . 1761 

1 11 21 31 41 51 

I 1 I I I i 

AT0GGATTC6 GTG06GG6CA GCGACTGCGC CC0QTCC0G6 0GC0606CTC GTCXXSGAGAO 60 

GAGGOGGCCC GGCCCGGGCA GCTGGGGCTC GGGAT0C6TC GAGGGGAGGC CGAGCTTGCC 120 

AAOCTGGCGC CCAGCGGGGT CATQOTGCCC GQCGCXXX3CX5 GCGGOOGCGC ACTGGCGCX3G 180 

GCTGCCX3GGC GGGGCCTCCT GGCTTTGCTG CTCGCGGTCT CCGCCOOGCT COGGCTGCAG 240 

GCGQAGGASC TGGGT6ATG6 CTGTG6ACAC CTAGTQACTT ATCAGGATAG TGGCACAAT3 300 

ACATCTAAGA ATTATCCCGG GACCTACCCC AATCACACTG TTTGCXSAAAA GACAATTACA 360 

GTACCAAAGO OGAAAAGACT GATTCTGAGG TTGGGAGATT TGGATATOGA ATCCCAGACC 420 

TGTGCTTCTG ACTATCTTCT CTTCACCAGC TCTTCAGATC AATATGGTCC ATACTOTGOA 480 

AGTATGACTO TTCCCAAAGA ACTCTT6TTG AACACAAGTG AAGTAACOGT COGCTTTGAG 540 

AOTGGATCOC ACATTTCTGG CGQGGGTTTT TTQCTQACCT AIGOQAGCAO OQACCATCCA 600 

GATTTAATAA CATGTTTGGA ACX3AGCTA6C CATTATTTGA AOAC!AGAATA CAGCAAATTC 660 

TGCCXAGCTG OTTGTAGAGA CGTAGCAGGA GACATTTCTO GGAATATGGT AQATGGATAT 720 

AGAQATACCT CTTTATTGTG CAAAGCTGCC ATCCATGCAG GAATAATTGC TGATGAACTA 780 

GGT^CCAGA TGAGTQT6CT TCAQCGCAAA GGGATCAGTC 6ATAT6AAGG GATTCTGGCX: 640 

AATGGTGTTC TTTOGAGOGA TGOTTCCCTO TGAGACAAGC GATTTCTQTT TACCTCCAAT 900 

G6TTGCAGCA GATCCTTOAG TTTTGAACCT GAOGGGCAAA TCAGAGCTTC TTCCTCATGO 960 

CAGTOGGTCA ATGAGAOTGG AGACCAAGTT CACTGGTCTC CTGGCCAAGC COGACTTCAG 1020 

GACCAAGGCC CATCATGGGC TTCGGGCGAC AOTAGCAACA AOCACAAACC A03AGAGT6G 1080 

CTGQAGATOO ATTTOGGGGA 6AAAAAGAAA ATAACAGGAA TTA6GACCAC AGGATCTACA 1140 

CAGTOGAACT TCAACTTTTA TGTTAA6AGT TTT6TGATGA ACTTCAAAAA CAATAATTCT 1200 

AAGT6GAAGA CCTATAAAG6 AATTGTGAAT AATGAAGAAA AG6TGTTTCA GGGTAACTCT 1260 

AACTTTCGGG ACCCAGTGCR AAACAATTTC ATCCCTCCCA T03TGGCCAG ATATOTGOQO 1320 

GTTGTCCCCC AGACATGGCA CCAGAGGATA GCCTTGAAQQ TGGAGCTCAT TGGTTGCCAQ 1380 

ATTACACAAG GTAATGATTC ATTGGTGTGG OGCAAGACAA GTCAAAGCAC CAGTOTTTGA 1440 

ACTAAOAAAG AAOATGAGAC AATCACAAGG CCCATGCCCT OQQAAGAAAC ATCCACAGQA 1500 

ATAAACATTA CAAOQOTGOC TATTCKATTG G'l q C f CVn' U TTSTCXTOOT GTTTGCTOQA 1560 

ATGGGGATCT TTGCAGCCTT TAGAAAGAAG AAQAAGAAAO QAAGTCOOTA TGQATCAGCA 1620 

GAGGCTCAGA AAACAGACTO TTGGAAGCAG ATTAAATATC CCTTTGCCAG ACATCAGTCA 1680 

GCTGAGTTTA CCATCA6CTA TGATAATGAQ AAGGA6ATGA CACAAAAGTT AGATCXCATC 1740 

ACAAOXGATA 7GGCAGGTTA A 1761 



Seq ID NO: CI 3 8 DKA Sequence 
IJucleic Acid Accession FGENBSB predicted 
05 coding sequence i 1..2310 

1 11 21 31 41 51 

I i I I I ) 

ATGTTCCAGC GGCAGGAAAG ATTTCTTGAC TTATCTTCAG CTQAAGCAGT GaCAaCTTGG 60 

70 ATATTACATC AACATCCTGA CATTATTAAC AAAGGTGATG GCTGTGGACA CCTAGT6ACT 120 

TATCAGGATA GT6GGACAAT GACATCTAAO AATTATCCOQ GGACCTACCC CAATCACACT 160 

GTTTGOGAAA AGACAATTAC A6TACCAAAG GG6AAAAGAC TGATTCtGAO GTT6GGA0AT 240 

TTGGATATCG AATCCCAGAC CTGTGCTTCT GACTATCTTC TCTTCACO^ CTCTTCAGAT 300 

CAATATGGAA TGCAGAAGGA GGAGGAGACA GAAGTGCTTT GTCTTTCAGT GGCTGQCGCT 360 

75 CAGAGAGTGO ACATTCCTOT QCAGCTQTTG CCCAGCTTCC T6GAAGGGTO GAAGGGTCAT 420 

GCTGATGCAA GAGGTCCATA CT6TGGAAGT ATGACTGTTC CCAAAGAACT CTTGTTQAAC 480 

ACAAGTGAAG TAACOOTCGG CTTT6AGAGT GGATCCCACA TTTCTGQCCG GGGTTTTTTG 540 

CTGACCTATG CGAGCAGC36A CCATCCAGAT TTAATAACAT GTTTGGAAOO AGCTAGCCAT 600 

TATTTGAAGA CAGAATACAG CAAATTCTGC CCAGCTGGTT GTAGAGAOGT AGCAGGAGAC 660 

oU ATTTCTGGGA ATATGGTAGA TGGATATAGA QATACCTCTT TATTGTGCAA AGCT6CCATC 720 

CATGCAGGAA TAATTGCTGA TGAACTAGGT GGCCAGATCA Gm i UC ll CA GCOCAAAGGQ 780 

ATCA6TCGAT ATGAAGGGAT TCTGGCCAAT OG T Wl ' drrr COAGGGATGO TTCGCTQTCA 840 

GACAAGGGAT TTCTGTTTAC CTCXAATGGT TGCAGCAGAT CCrrGAGTTT TGAACCTGAC 900 

GOGCAAATCA GAGCTTCTTC CTCATGGCAO TOGGTCAATG AGAGTGGAGA CCAA6TTCAC 960 
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•irsGrersccixi 

AGCAACAACC 
ACAG6AATTA 
GTGATGAACT 
GAAGAAAA66 
CCTCCCATOQ 
TTGAAGGTGG 
AAGACAAGTC 
ATCCCCTOGG 
CAGATGATCT 
ACAGCGQAAA 
TTCTTAGCAC 
CCCTTTGCCA 
ACACAAAAGT 
6GCACCGGGA 
GAGGCA6GG0 
CA0GAGTACX3 
OGGCAOGTGC 
CAGCCCGGCC 
GCCCAGGACG 
COGOCCAAAG 
CCAAOGCATC 
AACCAGACGG 



GCCAAGCCOG 
ACAAACX31CX3 
GQACCACA6G 
TCAAAAACAA 
TGTTTCAGGG 
TGGCCAGATA 
AGCTCATTGG 
AAAGCACCAQ 
AAGAAACATC 
CACAAAGG6A 
GCATGGTTAG 
ACACACCTGA 
GACATCAGTC 
TAGATCTCAT 
CAGTCACGAQ 
TGAGCACCGA 
CGCTGCCCCT 
TGCGCGCCCA 
ACAAACACTC 
GAGACTATCA 
CTGTCAGCX3C 
CCGGGAOQAO 
CCATGACTGC 



ACTTCAGGAC 
ACAGTGGCTG 
ATCTACACAG 
TAATTCTAAO 
TAACTCTAAC 
TGTQOQOOTT 
TTGCCAGATT 
TOTTTCAACT 
CACAGAT6CC 
GAATCTGGGA 
AGTAGTGTTT 
GGAGGACATT 
AGCTGAGTTT 
CACAAGTGAT 
GAAGGGCTOC 
TGCCGGCGGC 
GGCGCCCCOG 

CACxnrcTCT 

AAGGCCACAC 
CCTCGCCACC 
TQACA6CTAT 
CCTTTTGTGA 



CAAC6CCCAT 
GAGAT06ATT 
TGGAACTTCA 
TGGAAOACCT 
TTTGGGGACC 

qtccccx:aga 
acacaaggta 
aagaaagaao 

ATGCQAGTGC 
CCTGATGA6G 
GCTOTTGTOG 
GATCACTACT 
ACCATCAGCT 
ATGGCAGATT 
ACCTTCCGGC 
CACTATGACT 
GAGCCGGAGT 
GOQCAGAGOO 
GGGGGCTTCT 
AQGQCACAGC 
GAAA50G66C 
TCTGCCCCCA 



CATGGGCTTC 
TGGGGGAGAA 
ACTTTTAT6T 
ATAAAGGAAT 
CAGTGCAAAA 
CATGGCACCA 
ATGATTCATT 
ATGAGACAAT 
AGATT6T0GG 
GCAAAATACC 
TTAATQACCT 
GTTGGAAGCA 
ATGATAATGA 
ACCAGCAGCC 
CCATGGACAC 
GCCOGCAGOO 
AOGCCAOGCC 
OCTACCGCGT 
CGCCCGTAGC 
CTGOGGACAG 
ACCCTQACTC 
GAOACTQCCT 



GGGCX5ACAGT 
AAAGAAAATA 
TAAGAGTTTT 
TGT6AATAAT 
CAATTTCATC 
QAGGATAGCC 
GGTGTGGOGC 
CACAA6GCCC 
AGACCATACX: 
TTTTAAAGGC 
TQGCATGCTG 
GATTAAATAT 
GAAG6AGATG 
CCTCATQATT 
GGATGCCGAO 
GGCOQGCOQC 
CATGGTGGAG 
CCCAGGOCCC 
GGGTGTGGGC 
GGGCTAOGAC 
TCAGAAGCCC 
CACACCCCTC 



Seq ID NO: CX39 DNA Sequence 
Nucleic Acid Accession #t NM_004616.2 
Coding sequence: 180.. 893 



AGT6CCCCA0 
ATTTAAATGT 
GACAAGCCTG 
TGGCAGGTGT 
TATGTGGTAT 
GAATTTTTGO 
CTOTAGGTGC 
GTGQCTGCAT 
CGACAGGTAT 
TCTATGAAAA 
CCATAATTGT 
GGGGAAATAA 
6CCAAAGCTA 
TCTTGGCAAA 
TACTGGGTTT 
GATQCATCAA 
AAATATGTAA 
AGACCACAGA 
AATQATATGA 
AAAAAAAAAA 



11 
1 

QAOCTATQAC 
CCGTGGATAC 
TAACGAATAG 
GAGTGCCTGT 
CTTGATCCTA 
TTCTGAAGAT 
CATCATCA7G 
GCTTCTQTTG 
CCTAGGAGCT 
CACAAAGCTT 
0TTTCAA6AA 
TTTTCAACAC 
TAATGQAAAA 
AAATTTGATT 
GGTGTTTTCT 
CCXATCQTCA 
GTGCTATATA 
TATCTTCTAQ 
ATGTGTATTT 
AAAAAAAAA 



21 
1 

AAGCAAAGGA 
AQAAATCTCT 
TTAAATTCAC 
ATAAAATAT7 
6CATTAGCAA 
GTAGGCTCTA 
ATTCTGG6CT 
TTTTTCATAG 
GTTTTCAAAT 
TTGAGOGCCA 
GAGTTXAAAT 
TATCCTGAAT 
CAAGTTTACA 
ATA6TTATTG 
AT6GTCCTGT 
QTCAAftCCCC 
AGTCAGQAGC 
ACATATTOAA 
TTACTCAAAA 



31 
I 

ACATACTTGC 
6CAGGCAAGT 
GGCATCTGGA 
CTATGTTTAC 
TATGGGTAOG 
GCTCCTAOOT 
TCCTGGGAT6 
GCTTGCTTCT 
CTAAQTCTGA 
CAGGGGAAAG 
OCTGOQOTTT 
TATGTGCCTO 
AAGA6ACCTG 
GAATATCATT 
ATTQCCA6AT 
TTTAAAAT6T 
AGCTGTCTTT 
CACATTTAAQ 
TAAAAGTAAC 



41 

I 

CTGGAGATAG 
TGCTCCAOAG 
TTCCTAATCC 
CTTCAACTTC 
AGTAAGCAAT 

TGcraroGAC 

CTGOGGTGCT 
QATCCTGCTC 
TCGCATTGTG 
TGAAAAACAA 
GOTCAATQGA 
TCTAGATAAG 
TATTTCTTTC 
TGGACTGGCA 
OG GGAACA AA 
TGCTTT06CT 
TTAAAATGTC 
ATTTQAGGGA 
TQTTTAOQTT 




GACTCTCAAG 
ATATTGATTG 
ATAAAA6AAA 
CTGCAGGTGG 
AATGAAACTC 
TTCCAGGAAG 
GCTGCTQATT 
CAGAGACCAT 
ATAAAAGACT 
GTTATTGAGA 
TGAATCTGTG 
TTOTAAATTT 
TCG6CTA6CT 
TATAAOGOAA 
AAAAAAAAAA 



Seq ID MO I C140 DNA Sequence 
Nucleic Acid Accession #t NM_004617.2 
Coding sequences 232 ..840 " 



1 
I 

CTTCAGGTCA 
AGCTGAAGCA 
ATTGAATTGG 
AQQAQQCAlGT 
GGGGGCrOTG 
GCTAACATCC 
CAAGAGATCT 
CTGGTGTTCT 
GGQAAOCQAT 
GQATACTGGT 
AATAGTACAT 
AACAAGTGCC 
CTGGTCGTAQ 
G6GACCCTCT 
ACCTCC6AGA 
CTTCTTCTCT 
AGGCAGTTAT 
TAAAAAATAG 
TTAOAATTTA 
ATGTATAAAG 
T6GAAGTTTT 

TTTTTTGCTG 
GGGTTCTTTT 



11 

I 

GGGAGAAT6T 
ACTCCAAGGA 
AGACAATTAC 
GA6GAGCTTT 
CCAC3ATGGCT 
TGTTATTTTT 
GGTTTTTOQG 
TGGGCCTGAA 
TTGCGATGTT 
TTATCATCTC 
GGGGCTACCC 
GAGAGCCTCT 
GAGGAATCCA 
GTGGGQACTG 
TGAGCTGCTC 
TCTTGGAATT 
TCCTTCTTTC 
TTTGGCCACT 
CCAACAGGTT 
TCACAT6TAC 
ACATAAATCA 
AAGAGGAAAA 
CCTTQCTTQA 
ATGTATGTTA 



21 
I 

ATAAATGTCC 
CACAGTTCAC 
AAG6ACTCTC 
TGATTGCTGA 
GGGGGGGACC 
TCCTGGAGGA 
AOGAATATTA 
GAACAATGAC 
CACCTCCAOG 
AGCCATTTCA 
CTTCCAOQAC 
CAATGTGGTT 
GATGGTTCTC 
CCAGTGTTGT 
AGACTCTACA 
ATTAATTCCT 
CAACX3l6CrT 
TAACAAATTT 
CAAAGCATAC 
TGCCATACTA 
AAGGAAGAAA 
AGAATQATTG 
GTTGCTTGT6 
AATTAAAACC 



31 
I 

ATTGCCATCG 
AGAAATTTGG 
TGGCCAAAAA 
CCI6TGT08T 
CTCATTCCCC 
AAAGTGATAG 
GGAAGCGGTG 

ATATTTGCTG 
ATCAACAAGG 
GGGQATTATC 
CCCTGGAATC 
TGCGOCATCC 
GGCTGCTSTO 
GCATQA08AC 
ATCTGCTTOC 
TGCTCGAOTT 
GATTTATAAA 
TTTTCATQAT 

crrcTrPGTA 

GCACATTTAA 
AIGTATCCTA 
ACT6ATCTTT 
TGAATTCAGA 



41 

I 

AGGTTCTGCT 
TTCTCAGCCC 
CGCTTGAAGA 
ACCACCCCAG 
TTGCTTTTTT 
ATGACAAOGA 
TCTTGATGAT 
GCTGCQGCAA 
TOGTTGQATT 
GTCCTAAATS 
7CAATGATGA 
TGACCCTCTT 
AGGTGGTCAA 
Q0GGA0ATQ8 
TACAATTTCT 
TAGCTGATAA 
AGAATTTTGT 
TCTTTCAAAT 
TTTTTTATTA 
TATAAAGATO 
AATX3AGAAAC 
AGTATTGTTA 
TQAGGCTOTC 
GGTAACGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2310 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1159 



51 • 
I 

ATTTTTGAGA 
CAAAATACTG 
GGCCCC6TGA 
AATGTGCACT 
TGOCTTCCTO 
CCACCTTTCC 
CTTCCCTGOQ 
CGAGG6CTGT 
CTTGGGAGCT 
CCTCATGGCC 
GGCCTTATGG 
CTCCATCCTG 
TGGCCTCCTG 
AOCOSTTTAA 
TTTCATAAAA 
A6CTTAGAAA 
TATTTTCAAA 
TAGTTCCTTT 
CAAATOTAAA 
TTTATATCrr 
TAAGACCAAT 
TTTGTTQTCT 
ATCATGGCTA 



60 

120 

180 

240 

300 

360 

420 

460 

540 

600 

660 

720 

780 

840 

900 

960 

102O 

1080 

1140 

1200 

1260 

1320 

1380 

1428 



Seq ID HOt C141 DMA Sequence 
Nucleic Acid Accession ft: NM 002381.2 
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Coding sequencer £4.. 1524 

1 11 21 31 41 51 

I I I I 1 I 

AAATCOGAGC CTCGCGTGGO CTCCTGGOCC CCGA06GACA CCACXa^GGCC CAOGGAGCCC 60 

ACCATGGGGC GCCOGGCCCC CGC3CX3GOSC CTOCGOGOAC TGCTCCTGCT GCTCTGGOOQ 120 

CT6CTGCT6C T6CCCTCCGC CGCCCCOGAC CXrOGT GQ CCC GO00GG6CTT COGQAGGCTG 180 

GAQACCCGAG GTCCCGGGGO CAGCCCTGGA CGCt3GCCCCT CTCCTGOGGC TCCOGAOGGC 240 

GOGCCCGCTT COGGGACCAO CGACCCTGGC CGCGCCCGCO GTCCAGGTGT TTGCAAGAGC 300 

10 AGACCCTTGG ACCTGGTGTT TATCATTQAT AGTTCTOGTA GCX5TACGGCC CCTGGAATTC 360 

ACCAAAGTGA AAACTTTTOT CTGCOGOATA ATOOACACTC TGQACATTOG GCCAGC06AC 420 

AGGCGGGTGG CAGTGGTGAA CTATGCTAGC ACIGTGAAGA TOGAGTTOCA ACTCCAOGCC 480 

TACACAGATA A0CA6TCCCT GAAGCA6GCT OTQGGTOQAA TCACACCCTT GTCAACAGGC 540 

ACCATGTCAG GCCTAGCCAT CCAGACAGCA ATGGACGAAG CCTTOICAGT GGAGGCAGGG 600 

GCTOSAGAGC CCtCTTCTAA CATCCCTAAO GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GQCXK3CTGGG QCCCAAaCAT CTG Q TATTGA GCTCTATGCT 720 

G TG GG OGT GG ACGGGGCAGA CATGGCGTCC CTCAAGAT6A TG6CCAGT6A 6CCCCTAGA0 780 

QAGCATOrrr TCTAOOTGOA QACXrrATQOO GTCATTGAOA AACTTTOCTC TAGATTCCAG 840 

GAAACCTTCT GTGOSCTGGA OCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCAOGTCTGC 900 

ATGAGTGAT6 GGGAAGGCAA GCACCACXOT GAQTQTAQCC AAGQATACAC CTTGAATGCC 960 

GACAA6AAAA CGTGTTCAGC TCTTGATAQG TGTGCTCTTA ACACCCA0G6 ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAGAAG TOGCTCTTAT CATTGTGAGT 6CTATGAAGG TTATACCTT6 1080 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTGGGTAC CXIATGGGTGT 1140 

CAGCACATTT OTGTGAATQA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATG CAQATAAAAA AACATGTTCA GTCGGTGACA AGTGTGCCCT AGGCTCTCAT 1260 

GGTTGCCAGC ACATTT6TGT GAGTGAT6G0 0C06CATGCT AOCACIOTQA TTGCTATCCT 1320 

GGCTACACCT TAAATOAGGA CAAQAAAACA TOTTCAGbCA CTOAGGAAac AGGAAGACTT 1380 

GTTTCCACTG AAGATGCTTQ TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCA6C 1440 

TOQTATCTTC AAAGACTGAA CACTAAACTT QATGACATTT TGGAGAAGTT QAAAATAAAT ISOO 

GAATATGGAC AAATACATCG TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATO CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTQCTAA 1620 

TAATT1GCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCAGAATC AGCATGATTT TTCCAAGGAA ATACATATGC AQATACTTAT TAAGAGCAAA 1740 

CTTTAQTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

QTGTTTCTTT ATCTACTAAT TCAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCAGAATG 6AAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGT6CAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGmTTTG CTTATTTTQT TGGAQTATTA 2040 

TTGAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

TGAACATTAC ATTGCCATTT TTAATTCATT CTGGTCTTTO AAAGAAATGT ACTACTAAAO 2160 

AOCACTAOTT OTGAATTTAO GGTGTTAAAC TTTTTACXSUl OTACAAAAAT COCAAATTCA 2220 

CTTTATTATT TTGCTTCAGG ATCCAAGT6A CAAAGTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGACAAAA TCTAATGTTQ TCTTTTTAAT GTTAGTGATC CACXT6CCTC A6CCTCCCAA 2340 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTQAAAC TTGCAACTTT GAGAACAAAA CAQTCCTTTA AATTTTGCAC 2460 

TQCTCAATTC TQTTTTTOQT TTGCATTOTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATO TCT A TTTTTQ AT6ACTGATC AATTTIGTCT ATTAAAGATA TTTCTTTAAA 2580 

TTAAAAAAAA AAAAAAAAA 2599 



15 



20 



25 



30 



35 



40 



45 



50 
55 
60 
65 
70 
75 



8eq XD MO; C142 EHA Sequence 

Nucleic Acid Accession «i NM_016€39.1 

coding sequence: 40.. 429 " 

1 11 21 31 41 51 

I I I I I I 

GOQGGGGGOQ CAGACAGCGG COGOOSCAaO ACGTGCACTA TGOCTGGQGO CTGGCTGOOC 60 

OGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGGGT TGCTGOGCTC OGTGGCCGGG 120 

GA6CAAG06C CAGGCACOGC CCCCTGCTCC OQCGGCAGCT CCTGGAGCOC GGACCTGGAC 180 

AAGTGCATGG ACTGCG06TC TT6CAGGGGG GGACCGCACA GCGACTTCTG CCTGGGCTGC 240 

GCTGCAGCAC CTCCIGCCCC CTTCCGGCTQ CTTTGGCCCA TCCTTGGGGG COCTCTGAGC 300 

CTGACCTTCG T6CTG6G6CT GCTTTCTG G C TTTTTGGTCT 0GAGAC6ATG CC6CAGGAGA 360 

GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGOGGAGAGG GCTGCCCABC TGTGGOOCTG 420 

ATCCAOTGAC AATOTGCCCC CTGCCAGCOQ GQGCTCQCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTOCC TCCCAGAOGC GG06GGAGCC AAOCTCCTCC AACCACAAGG 540 

GGG8T6GGGG GGGGTQAATC AOCTCTGAGG GCTGGOGCCA OGGXTCAGGG GAACCTTOCA 600 

AGGTGTCTGG TTGCCCT6GC TCTGGCTCCA QAACA6AAA0 GGA6CCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTO CAGCATTTOC ACAGGGGAGG GGGGTGCOCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGOGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTOCTGAAAT TCCACCAOGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGQCTGGCCC ACTAGQAGGO CTOGOCCTAA GATACAOAOC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGQAGGQ AGAATTTATT 960 

AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 998 

Seq ID NOt C143 DNA Sequence 
Nucleic Acid Accession #i MN_001819 
coding sequences 113.. 2146 



„^ 1 11 21 31 41 51 

80 1 I I 1 I I 

CCAGGAGGCA OQCTGGTTTT COGGGGOOGC TCCATOGOGC CriWl'OriXi OGCCTOGCTT 60 

CTOOGGTOCA QCOGCCATCr TCCTTi- QOUC AaUXSGGCCO O OgAGO GOGG CCATQCASCC 120 

AAGGCTQCTT CTCAGCCTCC T6GGAQC0GT GG6GCTGG00 OCTGTCAATT CCATGCCAOT 180 

OOATAACAQG AACCACAATG AAOQAAIGOT GACTGQCTOC ATCATTGAGG TCCTCTCAAA 240 

1289 



wo 03/042661 



PCT/US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



TGOCTTGTOQ 
GAGTAGAAAA 
AAOATTGTTA 
6GCA0GAGGC 
GGCAGAGGGA 
CTCCGACAGC 
GGATGAOGAG 
TGAAGAGAAA 
GCA6GCTTCA 
GaVTTCTCAG 
AGGGAGCCAG 
TGAGGAAGCTT 
CCACCACGGG 
AAAG6GACAC 
AAAGAGGGAC 
AGAAATAAAG 
GG6CA6AGGA 
GGAGGACAAG 
AGAAAGTGAG 
AGQGGAGCCA 
AGGACACCAC 
GTGGAAAGAG 
GCAGCAGCAG 
TAAACAATAT 
CCCATACTAC 
TGACAATTTT 
CXXAOAATAC 
GTATGAGAAG 
G6TGGCCCAA 
CTATGATTCT 
TGACCAQACA 
TTTGGAACTA 
GTCGGCACTG 
ACCATTTATA 
TTGGAATTGT 
OCATATTCTT 
TCTTTQ8AAA 



AAGTCCAGCG 
GACQTCAAAG 
AGAGACCCAG 
CCA6GGGAGG 
GGCGGGCACA 
CAAOTCTCTC 
GAGGAGGAGG 
CACCTTGAAO 
GCTATAAAAA 
GAGAAGACAC 
GAGAATCACC 
GAG6AA6AT0 
AGGAGCAGGC 
CCCCA6GAGG 
CACCATTCAA 
GGTTATCCAG 
AGTGAAGAAT 
AGAAACTACC 
GAAGAGAGGO 
COTGCCTATT 
CX3TGTCCAAO 
CTGOACAGAA 
GGA6ACCTGC 
AGCTCCCATC 
GACCCTCTCC 
CTCGAGGGTG 
AACTATQACT 
AGAAACCTCG 
CTGGACCSVGC 
QAGGAGCOGG 
GTCCTGACAG 
CAGAAGATAG 
TTAAGAAGCA 
TACCCAAGGO 
CTTTAATTTC 
TTCT GCAA AA 
AATGTTTGTC 



CTCCACCCAT 
ACAAAGAGAC 
CTGATGCCTC 
AOGACATCCA 
GC0GAGA0G6 
AAGAAGTGAA 
GAGAGAACTA 
A0CCAG6A6A 
AAGAGGAGTT 
ATAGCCGA6A 
CCCAGGAGTC 
CCACCTCTGA 
CCGACAGQTC 
AATCTGAG6A 
CCCACTACAO 
GOGTCCAGGC 
ACAGGGCTCC 
CCAGCTTAGA 
QCCTTGAGCC 
TCATGTCTQA 
AAAACCAGAT 
ATTATCTCAA 
A6GACACTAA 
ACACAGCTGA 
AOTGGAAGAG 
AGGAGGAAAA 
GGTGGGAGAA 
CCAGGGTCCC 
TCCTTCACTA 
TQAGCACCCA 
AOQACQAGAA 
CTGAGAAATT 
GCCATCACAT 
CAGAAAGTAQ 
TGTCAGAATG 
TAOACATATT 
TCA8TTGQAA 



CACCCCTGAG 
AACT6AAAAT 
GQAAGCCCAC 
AGGCCCAACA 
AGC9G6ATGAG 
QACAGGCCAT 
TCAAAAAGGG 
OACACAAAAC 
A0T66CCAGA 
GAAGAGTA6C 
TAAAGGCCAA 
GGTGGACAAA 
CTCTCAAGGA 
GTCAAACGTC 
GGCTTCAGAG 
CCCTGAGGAC 
AAGACCTCAG 
GCTTGATAAG 
GGGAAAGGGA 
CACCAGAGAA 
GGACAAGGCA 
CTACGGTGAG 
AGAAAACA6G 
AAAGAG6AAG 
CAGCXyVTTTT 
TGAGCTGACX: 
AAAGCCCTTC 
CAAGCTGGAC 
CAGQAAGAAQ 
CCAGGAGGCA 
AAAAGAACTC 
CAGCCAAAGG 
GATCTGTTTT 
AACTTACTAT 
CTATTGAAAA 
AACATOCTTA 
ATAATAAAAO 



TGCOGCCAAG 
GAAAACACAA 
GAGTCCTCCA 
AAG6CAC3ACA 
OOCCAGTGGA 
TCTQASAAGA 
GAGOCSAGGGG 
GCTTTTCTCA 
T0G6AAACAC 
CAGGAGAGTG 
CCC06AA6CC 
0GAGGCAC6A 
GGQAGTCTTC 
AOCATGGCCA 
GAAGAACCTG 
CTGGAGTGGO 
AGTGA6GAGA 
ATGGC3VCATQ 
CGCCATCACA 
GAGAAAAG6T 
AGGAOGCATC 
GAAGGAOCCC 
GAGGAAOCTA 
AGATTAGGGG 
QAAAQAAGAG 
TTGAAC3GAGA 
TCTGAGGATG 
CTGAAAAGGC 
TCAGCTGAGT 
GAAAATGAAA 
GAAAACTTGG 
GGCTGACTGT 
TCACCACTTC 
TCATTAAATQ 
TGTGAATTGC 
TGACAATGAC 
ATTCACCTGA 



TCCTGAAGAC 
AGTTTQAAOT 
GCAGGGGAGA 
CAGAGAAATG 
GCCTCTATCX! 
GCCAGAGAGA 
AAGATAGCAO 
ATGAAAGAAA 
ATGCTGCCGG 
GAGAGGAGGC 
AGGAAGAATC 
GGCCCAGACA 
CCTCTGAGGA 
OTTTAGGGGA 
AATATGGAGA 
AGCGCTATAG 
GTTGGGATGA 
GATATGOTGA 
OAGGCAGGGG 
TCTTGGGTGA 
CACAAGGTGC 
CAGGGAAGTG 
QGTTTCAAGA 
AACTGTTCAA 
ACAACATGAA 
AGAATTTCTT 
TGAACTGGGG 
AATATGACA6 
TTCCAGACTT 
AGGACAGGGC 
CTQCAATGGA 
CATTGOAGOQ 
ACTGAAAGAC 
TTTGACACAA 
ATGACTT6TA 
T6TGCTACTG 
GACC 



Seg ID NO: C144 SNA Sequence 

Nucleic Acid Accession #i XM_093082.1 

Coding sequence: 93..19BB 



CTTCrrGTGQ 
AATTT6TAAA 
AOCCCAACTG 
GTGGG6CTCT 
AGACAAAACT 
6CAAATTACA 
TTGGAAIGGT 
ACTOTCAGQA 
TTGCTGCTGC 
CAGOAAOACT 
ACCCAGCATC 
ACTTAGAAOT 
CCCTACAAGT 
TGCCTAATTC 
AOGTGACTGT 
GAGAACA6CT 
CTCCTTG66A 
AGCAGQCTGA 
TTAAAQATOQ 
TGCAGCCAGC 
6CCAAAACCA 
GCGTTCAAGG 
GAACACCTTC 
AAGAAAACTT 
AAGGTTATTA 
TCACTTCTTC 
GTGCOQCXIAT 
AAGAAAGAAA 
GAGAAAGCGA 
CCATTCATGA 
AOGAGCXrTGC 
TOGAGCTGGA 
CASAOTCAOA 
AGGCATAG 



11 
I 

TAG6GACCTC 
AAATACCCTC 
TGTATTACAG 
AGCSUSCTOTT 
GGAAGCTTCA 
GAA6CTCAAC 
TGCPUVATTGT 
CAGTTCTTTC 
AAAGGATTGT 
TGAGGCA6TG 
TCAGGGTGGG 
CAAG GCAGAA 
TGTGCTOCTT 
TCCTGGTCAG 
TQGATCTAAT 
TTCCATCCAG 
OOAQOGQAAG 
ACTCCAGATT 
AATTACAGGQ 
AGACAGTGGA 
AG6CATCCTC 
AAGACTAGAA 
CCCTGTGTAC 
CAACCCAACC 
CCAGTGTACT 
ACATCCAGAA 
CATCATCTCT 
TTCTAA6ACC 
AGCAATGCCA 
GACTGGCCCT 
CCCAGA6CCT 
GCTGGAOCCA 
GOCTGGGGTT 



21 



TCCTCAGTAT 
GAAGATTCAa 
ACATTGAGGT 
CTTAGCACCA 
GCTTTGAAAT 
TTGCAOTTTT 
TCTGOTTTCT 
AAATGTGATC 
GGGAGTCCTA 
GAGGAGGTTT 
GGGCATT6T0 
CCAAGCCTGA 
T6CAAAATCT 
GTTAGT6TGG 
OTCACTCrCA 
TGGTCTTTCr 
TGGCCAGATG 
TACTTTTCTC 
TCCAAOSATC 
ATTTACATCT 
AACGTCAGTG 
ACTGSCCACA 
TACTGGdTA 
ACOGGGATTT 
6CCATCAACA 
GTTGGAATCA 
OTTGTGTOCT 
AT060G6AAC 
AQAGAAGACQ 
GATACCATCC 
GCCCCAGGAT 
6AAAOGCA0T 
GTAGTTGAGC 



31 
I 

TTGAAACTAA 
OAATQAAGCT 
GGTACOGGTO 
GTCAGTGQCT 
TGCTCTATGG 
CTTTATCTGT 
OGOGATCATT 
TTTGTAAGCT 
AGTCCTTCCT 
TGGGGTTGGG 
AAAACTATGG 
GAAAAGGTQG 
TCTCCCTCAA 
TGCAAGT6AC 
TCTGCATCTA 
TCCATAAGAA 
TTGAGGCTGT 
AAGGTGGACA 
CAGGTAATGC 
GGQATGTTAA 
T6TTAGTGAA 
CTATTTCGCT 
AACTTGAGGQ 
TGOTCATTQG 
GACTTGGCAA 
TTGTTGGGGC 
TCX3CAAGGAA 
TTGAGCCAAT 
CTACXCAACT 
AAGAACCAOA 
CAGAGCCTAT 
GOGAATTGGA 
CCTTAAOTOA 



41 

1 

CCAGCATCTTG 
TCTQTOTQAA 
CCTTATCTCT 
CACTSAACTG 
AGGCTTAAAA 
AAGG6CTGCA 
GGTGCAATCT 
GCTCTt360C7r 
ATCAGAAGGG 
GGTGCTTGTA 
OTCTTTTAQA 
TAT6GATCTC 
ACTATTTCTC 
CATCCCAOAC 
CACCACCACT 
GGAGATGGAG 
GAA0G6CACT 
AOCTOTAOCC 
ATCTATCACT 
CAACCCCCCA 
AOCTTGTAAG 
TTOCTGTCTC 
AAlGAGACATC 
AAATCTGACA 
TACTTCCTQC 
CTTGATTGGT 
TAAGGCAAAA 
GACAAA6ATA 
AGAAGTAACT 
CTATOAOCCA 
GGCAGTGCCT 
GCCAOAOOCA 
AGATQAAAAO 



51 

1 

ACAGATTTCX3 
GGATTAAAAC 
TCTGCTTCTT 
6AATTTAGTG 
GATCCAAATT 
AAACTTCCAG 
CATTTTGGCT 
TCCACCAGAG 
GTGAACTGGG 
CAGCCCGGTG 
aACTTGGTGO 
GAGAGACCCA 
TTTATTGCAT 
GGTTTCX3TGA 

CGAATTTCTT 
CTTGATGQAC 
ATOGGGCAAT 
AT CTOG CATA 
GACTTTCTCQ 
CCCCXTTGTA 

GTGCCAQTGA 
AATTTTGAAC 
GAAATCQATC 
AGCCTGGTAG 
GCAAAG6CAA 
AACCCAAGGO 
CTACCATCTT 
AAGCCTACTC 
GACCTTGACA 
OAOCCAGAOC 
6GAQTGQTTA 



300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2454 



60 

130 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1988 



Seq ID NO: C145 DNA Sequence 

Nucleic Acid Accession 9t FGBNBSH predicted 

Coding sequences 1..1242 



11 



21 



31 



51 

1290 



wo 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



I 

ATCGTGTTCG 
GTGGTGCAAG 
CTCATCTQCA 
TTCTTCCATA 
• QATGTTGAGG 
TCTCAACGTG 
GATCCA6GTA 
ATCTGCGATQ 
AGTGTGTTAO 
CACACTATTT 
CATAAACTTG 
ATTTTGGTCA 
AACAOACITQ 
ATCATTGTTG 
T6CTT0GCAA 
GAACTTQAGC 
QAOGCTACCC 
ATCCAAGAAC 
GGATCAQAGC 
CAGTGGOAAT 
GAGGCCTTAA 



I 

CATTTTGGAA 
TQACCATGCC 
TCTACACCAC 
AGAAGGAGAT 
CT6TGAAGGQ 
GACAAGCTGT 
ATGCATCTAT 
TTAAGAACXX: 
TGAAACCTTC 

c cemixr i ' u 

AGGGAAGAGA 
TTGGAAATCT 
GOUlTAlOTTC 
G6G0CTTGAT 
GGAATAAGGC 
CAATGACAAA 
AACTAGAAGT 
CAGACTATGA 
CTATGGCAGT 
TOGAGCCAOA 
GT6AAGATGA 



GGTCTTTCTG 
AGACGOTTTC 
CACTGTGGCC 
G6A6CC3VATT 
CACTCTTOAT 
AGCCATCGGG 
CACTATCTGG 
OCCAQACTTT 
TAAGCCCCTT 
TCTCTCTGOO 
CATCGTGCCA 
GACAAATTTT 
CTGOQAAATC 
TGGTAGCCTG 
AAAAGCAAAG 
GATAAACCCA 
AACTCTACCA 
GCCAAAGCCT 
GCCTGACCTT 
GCCAGAOCCA 
AAAGGGAGTG 



I 

ATCCTAAOCT 
OTGAACGTGA 
TCCOGAGAAC 
TCTTCTCCTT 
OGACAGCAGG 
CAATTTAAAG 
CATA TGCAGC 
CTOGGOCAAA 
TGTAGOOTTC 
CTTGGAACAC 
GTGAAAGAAA 
GAACAAGGTT 
GATCTCACTT 
GTAGGTGCGG 
OCAAAAOAAA 
AGGGGAGAAA 
TCTTCXIATTC 
ACTCAGGAOC 
GACATCQAGC 
GAaCCAOAOT 
GTTAAGGCAT 



6CCTTGCAGG 
CTOTTGGATC 
AOCTTTOCAT 
GGGAGGA606 
CTGAACTCCA 
ATCOAATTAC 
CA6GAGACAG 
A0CAA66CAT 
AAGGAAGACC 
CTTCCCCTOT 
ACTTCAACCC 
ATTACCAGTG 
CTTCACATCC 
CCATCATCAT 
GAAAITCTAA 
OCGAAQCAAT 
ATGAQACTGG 
CTGCXrCAOA 
TGGAGCTGGA 
CAGAOCCTGO 
AG 



I 

TCAGGTTAGT 
TAATGTCACT 
CCA6TGGTCT 
GAAGTGGCCA 
GATTTACTTT 
AGGGTCCAAC 
TGGAATTTAC 
CXn'CAACQTC 
AGAAACTGGC 
GTACTACTGG 
AACCACCGGG 
TACTGCCATC 
AGAA6TTGGA 
CTCTGTTGTG 
OACCATCGCG 
GCCAAGAGAA 
CCGTOATACC 
GCCTGCCCCA 
GCCAGAA AOG 
GGTTGTAGTT 



Seq ID nOt CI 4 6 DNA Sequence 

Nucleic Acid Accession ftt MM_003020.1 

Coding sequence! 29.. 664 



CGCTCCrOGG 
TGGCCTACTG 
CCCTGACOQG 
GGGCATTGOC 
CCAGAGCATT 
CCCCAACATC 
GGGGTACCCA 
AAACACCCCT 
TCCGGAACAT 
GAAGGQAGGA 
GQATAAT6TT 
OTAAASAQAA 
TGCACGT6TA 
TACAAAGCAG 
AAATTAGAAT 
TAAAAATTAA 
TQGM3TTTAA 
TTTGATTTTO 
GCTGTACTCA 
GCAAGCATTO 



11 
I 

GCTGCCCCTC 
TTTTGGCTQG 
GTCTCAGAAG 
AGGCCCCGAG 
GAAGGTGGAG 
GTGGCAGAGT 
GACCCTCCAA 
GACACTGCAG 
6ACTATCCAG 
GAGAGAOGAA 
GTTGCAAAGA 
GATGCTAGAC 
AATG6AGTGC 
CTGTAT6TA6 
AAGAGCTTTT 
AATGTGAATQ 
AATGOTaXCT 
ATTATCTAGT 
AOAGGAGGAG 
GC 



21 
I 

GGTTGACAAT 
CATCTGGAT6 
CAGATATCCA 
TGGAATATCC 
CTCATOAAGG 
TGACTGGAGA 
ATCGCT6TCC 
AGTTCAGTCG 
GCTTGGGCAA 
A6CGQAGGAG 
AGTCTGTCCC 
GAAAACCCAC 
CTGT6AATGA 
ATAGTGTATT 
TTGTTTCTTG 
TCAACAATAA 
GkAGGTTOTAC 
TCATCCAGCC 
CTGACACATT 



31 
I 

GGTCTCCAGG 
GACTCCAGCA 
GAGGCTGCTT 
AGCTCACCAO 
ACTTCAGCAT 
CAACATTCCT 
T6TTGGAAAA 
A6A6TTGCAG 
GTGGAACAAG 
TGT CAATCC A 
CCATTTTTCA 
ATTACCTGTT 
CAGCATGTTT 
GTCTTCACAC 
GOTTTTTAAA 
AAAGCAAGAC 
TATTTTGGCC 
CTTGGGCATT 
TCACTTGGCT 



41 

I 

ATGGTCTCTA 
TTTGCTTACA 
CATGGTGTTA 
GCCATGAATC 
TTGGGTCCTT 
AAGGACTTTA 
ACA6ATGATG 
TTGCACCAGC 
AAACTCCTTT 
TATCTACAAQ 
GATGAG6ATA 
AOOOCTCAGC 
CTTACATAGA 
CGATGATTCT 
ATGTGAATCT 
TATGAAAGGC 
AAGTCrOTAO 
GTTATACAOC 
6GQTCTTAAT 



SI 
I 

CCATGCTATC 
GCCCCCGGAC 
TQGAGCAATT 
TTGTGGGCCC 
TTGGCAACAT 
GTGAGGATCA 
GATGTCTAGA 
ATCTCTTTGA 
AOGAGAAGAT 
GACAGAGACT 
AGGATCCA6A 
AtOGCXTATO 
TAATTATGQA 
GCTTTTTGCT 
GCAATGATCA 
TCAGATTTCT 
AAAGCT8TCA 
AOTAAAGAAG 
AAACATGAAT 



Seq ID KOt C147 DNA Sequence 

Nucleic Acid Accession #i NM_024021.2 

coding sequence: 144.. 806 " 



1 
I 

AACATTCCTG 
CTTTGGAACA 
ATTCGAGCAC 
GG6CT6GCCC 
AAGGATTGCA 
TGACTGCCCT 
ATGGAAGTAA 
TTATTTCAGG 
GTA0TCTA6G 
CATTTAGCTT 
ATAATTQTCA 
GTCTGCTGGA 
GTACOCCTGG 
CTCCCACACC 
AATCTAT6CT 
TACTGATAGA 
AGAGAGATAA 
TTCACTGGCT 
GCAACAGCAC 
GCAAATAAQG 
OCAAAACACA 
AAAGTGCTTT 
CTCCTGCTTT 
OOkCSAQCTA 
TCAACTTATA 
AATTATATCT 
TTTTATTGCA 



11 
I 

CAAATGGTTT 
ACTTAAATAA 
CTTTTCTGCT 
TGGTGT6CCC 
AGAOAAOTTC 
GATGAGCCTT 
CCCTATTTCC 
ATCCTTGTCA 
AA TGAATAT C 
OGCGTTTTAT 
TGGGACTATG 
ATTCTGCATT 
TGGGG'l'ltSTG 
ACTTAATQAO 
GACTGTGACA 
CTTGTTGATA 
TAAATTCAAA 
CTTTATOGAG 
TTGAAAGTTT 
TTTGGAAGCA 
TACTTCCCTC 
TTCCTTCAGQ 
TCTTTCCCCA 
TCATTTQAQT 
TTTTCTATTT 
TAATCATATA 
GAGCAAAAAT 



21 
I 

CAATATATGC 
GTCAAATATA 
GCCATGACAA 
CAGCTGOOAA 
TTaAAGGQAQ 
AGCATGGGAA 
OTGTATATCO 
ATTGCAGCAG 
ACCAGCTCTO 
TCATTCCATC 
TCCATCTTAA 
GCTGTGTCCC 
TTAATTCTGC 
OmtlAGGCC 
CAAGA6CCTC 
TTATTATTAT 
ATTATGTTCT 
AGTACTAGAA 
TTCATTCATC 
GAAGAGCAAA 
CCATTTATTT 
AAGTGGAGAT 
TAGAAAGTAC 
AAAAGTATAC 
CATCTTCTTC 
T6QAAATGTG 
AAATCAAATT 



31 
I 

ASATGTCTOQ 
CTTGGAGCTT 
CCATGCAAG6 
ACATGGCTOT 
AACCCAAAOT 
TAACAATGAT 
GGTACACAAT 
GAATTA6AAC 
TACTGOCTGC 
ACCCTTACTG 
TGGGTCTGGA 
TCTCTGCCTT 
CATCACATTC 
ACCAAAAOAT 
ACATGAGAAA 
ATGTAATCCA 
CATTTTTTTC 
GTTAAATTAA 
ATAAGAACTT 
AAAAAGATAT 
AACTTTTTTT 
GCATGGCCAT 
CTTGAAGTAG 
ATQGAGTAAA 
CTTTCCCTTC 
CAACATATGG 
AGAAOCAATA 



41 
I 

ATATA6GAAT 
TAAAAATTAA 
AATGGAACAG 
CATACATTCA 
CCTTGGOOTT 
GTGTATG6CA 
TTGGGGGTCA 
TACAAAAGGC 
ATCAGOGATC 
TAACTACTAT 
TGGCATGGTO 
TGGATGTAAA 
TCACATGGCA 
CAACAOACAA 
TTACCAGTAT 
ATTATGAACT 
OCTGGAACTC 
TAAATAATGC 
TATATAAAGG 
TGTTAAAATG 
TTCTCCTACC 
CTCCCCCTCC 
CACAGTCCGT 
AATCATATTA 
TGGCAOCTTC 
TA'fTTQTTAA 
AAAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

lOBO 

1140 

1200 

1242 



51 
I 

GAAATTAOGT 
AAGGAGAGAG 
OCCATGCCAO 
CATCTGTGGA 
GTGGAGATTC 
TCTAATACTT 
OTAATGTTTA 
CTGGTCOGAG 
TTAATCAACA 
GGCAACTCAA 
CTCCTCTTAA 
OTGCTCTGTT 
GAAACAGCAT 
AT6CTCCAGA 
CCAACTTGGA 
GTGTGTGTAT 
AATAACTCAT 
ATTTAATGAG 
CATTACATTG 
AGGCCTCCAT 
TATGGGGACC 
CTTTTTCCTT 
CGTTGCATGT 
AGCATCAOAT 
TA CTGGQCA T 
ATA08TTT6T 
AAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1152 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1619 



1291 



wo 03/042661 PCT/US02/36810 



Seq ID NO: C148 DNA Sequence 
KucXeic Acid Acceesion ft: nm_002091.1 
^ Coding sequences 56.. 502 ~ 

1 11 21 31 41 51 

I 1 I 1 I I 

AGTCTCTGCT CTTCCCAGCC TCTCOSGOGC GCTCCAAGGQ CTTCCOGTOQ GGACCATGOG 60 

OGGCAQTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCXJC CCCGGGGGCX3 120 

10 AGCGQTCCCG CTGCCTGCGG GCGGAG6GAC CGTGCT G ACC AAGATGTAGC GGC6CGGCAA 180 

CCACTGG G OQ GTGGGGCACT TAATGGG6AA AAAGAGCACA GG6GAGTCTT CTTCTGTTTC 240 

TGAGAGAGGO AGCCTGAAGC AGCAOCTOAG AOAGTACATC AGGTGGQAAO AAGCT6CAAG 300 

GAATTTGCTG GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GOCCTTGOQC AATGAQCAOC CTTOGTOGQA TTCAQAGOAT AOCAQCAACT TCAAAGATGT 420 

15 AGGTTCAAAA GGCAAAGTTQ GTAGACTCTC T6CTCX:aGGT TCTCAAGGTO AAGGAAGGAA 480 

CCCCCAGCrS AACCASCAAT GATAATGATO OOCTCT C TCA AAA6AGAAAA ACAAAACC CC 540 

TAAGAGACTG AGTTCTGCAA GCATCAGTTC TAOQGATCAT CAACAAGATT TCCTTGTGCA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TT6ACTAAAT TOGTGATTTT CAAGCAGCAT 660 

^ _ CTTCTGGTTT AAACTTGTTT GCTGTQAACA ATTGTCGAAA AQAOTCTTCC AATTAATGCT 720 

20 TTTTTATATC TAGGCTACCT GTTGGTTAQA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 780 

TAAAAGCTTA AACACAT 797 

Seq ID NOt C149 DNA Sequence 
Nucleic Acid Accession NM_0I2261.1 
25 Coding sequences 203.. 1045 

1 11 21 31 41 51 

I I I I 1 I 

GATTTGCTCT GCCAGCAGCT GTOGGTGCCG CGCTCOACAC CGAGTCCTAG CTAGGCGCTC 60 

30 ACAQAATAOG CGCTCCCTCC CTCTCCCTTC TCTGTCCCCC OCCTCTCGCT CACCCXXWCC 120 

CACTCCAGOG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA 6CACAGCCGQ 180 

CCTCATTCGG GGCACTGCGA GTAtGGATCT CCAA66AAGA GGGGTCCCCA GCATGGACAQ 240 

ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCRG AACAAGAAGT 300 

GGAAAATCTC TCAGGCXTTTT CCACTAACCC TQAAAAAQAT ATATTTGTGG TGCGGGAAAA 360 

35 TGGGACQAOG TGTCTCAT66 CAGAGTTT6C A6CCAAATTT ATTQTACCTT AT6ATGTGT6 420 

GGCCAGCAAC TACGTA6ATC T6ATCACAQA ACROO OOGAT AT06C3VTTGA 0C0GGG6AGC 480 

TGAGGTGAAG G6CC6CTGTG 6CCACAGCCA 0T06GA0CTG CAAGTGTTCT G6GT6GAT03 540 

CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AG6GACCTGA 600 

GGCGACTTGG AGGCTQAGCA AAGTX3CAGTT TGTCTACGAC TCCTCGGAGA AAACCCACTT 660 

40 OVAAGACXSCA GTCAGTGCTG GGAAGCACAC AGCCAACTCQ CACXACCTCT CTGCCTTGGT 720 

CACCGOOGCT GGGAAGTCCT ATGAGTGTCA AQCTCAACAA ACCATTTCAC TG6GCTCTAG 760 

TQATCGGCAG AA6A0GGTCA OCATGATCCT GTCTGOQOTC CACATCCAAC CTTTTGACAT 840 

TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATQCCCA GTGGATGAGC GGGAGCAACT 900 

GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTOOTCATCA TGGTAACACT 960 

45 CGCGATTTAC CAOOTGCACG ACAAAATGAC TGCCAAGCAO GTGCAGATCC CTCGGGACAO 1020 

ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGO CSVCCCCCTAT TCCTGCTCOC 10 80 

CCAACT6QAT CAGGTAOAAC AACAAAAOCA CTTTTCCATC TTQT ACAO OA GATACACCAA 1140 

CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGGCTTGC TTGGCTTGTO TCCAT6CTTA 1200 

AACCCACGQA AGGGGGAGAC TCTTTOOQAT TTGTAGQGTO AAATQGCAAT TATTCTCTCC 1260 

50 ATGCTGGG6A GGAGGGGAGG AGGGTCTCAG ACAG CTTTC Q TGCT CATGGT GGCTTGGCTT 1320 

TGACTCTGCA AAQAGCAATA AATGCCACTT GGAGCTGTAT CTGOCCCCAA AGTTTAGGGA 1360 

TTGAAAACAT GCTTCTTTGA GGAGGAAAGC CCTTTAGGTT CA(3UGAATA TGGGGTGCTT 1440 

TGCTCCCTTO QACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 1500 

CC TCATG CTOCC TGCAGCAAGA CCXXTTGAAAG TQATTCATGC TTCTGGCTGG CATTCTGCAT 1560 

55 OTTTAGTQAT TGTCTTGGGA ATGTTTCACT GCTACCOGCA TOCAG06ACT GCA6CACCAG 1620 
AAAACX3ACTA ATGTAACTAT GCAGAQTTGT TTGGACTTCT TGCTQTQCCA GGTOCAAOTC 1680 
GGGGGACCTG AAGAATCAAT CTGTGTQAGT CTGTTTTTCA AAATQAAATA AAACACACTA 1740 
TTCTCTGGC 1749 

60 Seq ID NOt C150 DMA Sequence 

Nucleic Acid Accession #: MM_003226.1 
Coding eequencet 2.. 226 

1 U 21 31 41 51 

65 1 I I I I I 

GATGCTGG6G CTOGTCCTGG CCTT6CT0TC CTCCAGCTCT GCTGAGOAOT AGGTGG6CCT 60 
OTCTGCAAAC CAGTGTGC06 TGCOQGCCAA GGACAGGGTO GACTGOGQCT ACCCCCATGT 120 
CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC IBO 
TTGOTGTTTC AAGCCCCTGA CTAGGAAGAC AQAATGCACC TTCTGAGGCA CCTCCAGCTQ 240 
70 CCCCIGGGAT GCA6GCTGAG CACCCTTGCC C6GCTGTGAT TGCTGCCAGQ CACTGTTCAT 300 
CICAGTTTTT CTGTCCCTIT GCTOCOGGCA A6CTTTCTGC TGAAAGTTCA TATCTOGAOC 360 
CTGATGTCTT AACGAATAAA GGTCCCATGC TGCACCOG 398 



75 



Seq ID NO: CI 51 DNA Sequence 

Nucleic Acid Accession it MH_002993.1 

Coding sequence t 64.. 408 



«^ 1 11 21 31 41 51 

80 I 1 I I I I 

GGCAOQASCC AGTCTCGOCG CCTGCACCCA OCTCAGOAAC CGGCOAACCC TCTCTTOACC 60 

ACTATOAGCC TCCOOTCCAO CGQGQCQGCC O G TGTC C OQG OT C CTTO GG Q CTCCTTGTGC 120 

GOGOrOCTOG COCTGCTGCT CCTG CTOA OQ COOCOOGQOC CCCTOGCCAO OQCTQOTCCT 180 

OTCT C TGCTO TGCTGACAOA GCTQCQTTGC ACTTGTTTAC GGGTTAGQCT GAGA6TAAAC 240 



1292 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCCAAAAOGA 
GTGGTA6CCT 
AAOAAAOTCA 
ACCATGCATC 
CAOTAAGAAT 
GAAGAGT6TG 
CIMVTATAGT 
CAATTQACCA 
TGAAGATAAC 
ATTTCQTATG 
ACTCACTCTT 
TCTTTACCCT 
TTATCTGTGC 
CTAATATATT 
CAT6ATTTAC 
ATTCTGGTCA 
TGATT6CTAA 
AAT6ATCT6T 
CATTTAGTCC 
TTTAAAGGTT 
AAATTGCACT 
TGOAGAAACA 



TTGGTAAACT 
CCCTGAAGAA 
TCCAQAAAAT 
ATAAAATTGC 
AAQAAGGAAO 
GG6GAAAGCC 
ATTTCCACTA 
TATTGTGAOC 
TATTQTATTT 
GAAATAATOT 
CTCATAAAAT 
A6QATGCTAT 
AGAATATATT 
CTCTTCCTAT 
TCATTAAACT 
CTAAATATAC 
TTTACATAGA 
GCTCTGCAAA 
TCAAAATATA 
TTGACCATTT 
TTTATTTTTT 
ATAAAAGATT 



GCAOGTGTTC 
CGGGAAGCAA 
TTTQGACAGT 
CCAGTCTTCA 
GGTTGGTTTT 
TAOGCTTCTC 
TTTACTOTTA 
AAAGAATCAC 
CTATCATACA 
TTTAT7AGTG 
AGGAAATATT 
TXAA6TTGTA 
TCCTTATTCA 
GGTTTTAGAT 
TTGATTTTQT 
ACTTTA6ATA 
AATGTATTCT 
GTTTTGAAAA 
TACAGCATTG 
TGTTATGAGG 
CCTGTGTGTC 
TCTAAACCAA 



OCOQCAGGCC 

GGAAACAAGA 
GOGOAGCAGT 
TTTCCATTTT 
CCTQAAGTTT 
TTTTACCTOA 
TGGTTATTAG 
TTCCTTAAAO 
TQCTGTTGAQ 
TTAOTTCTGT 
CTGTATTAGA 
GAATTTCTAA 
GTTTQATGTC 
ATGCTATTTT 
OATGAAGAAG 

ci T O c rrmT 

TATATTTGAA 
CTAAGATTTT 
AATTATACAT 
ATGTTGOTTT 
AAAAAAAAAA 



06CAGT6CTC 
AOCGGGAAGC 
AAAACTQAOT 
TTTCTG6AGA 
CTACATGQAT 
ACA6CTCA6C 
TAAGTTATTO 
TCTTTCAATG 
TCTTACOGAA 
GQAGGTATCC 
TTTCTTGGGO 
ACACTGGGTG 
AAATTTAAGT 
TTCTTAGTAT 
TTCACTATAO 
CCCAAAAACA 
TAAATAAAAG 
CAATTTGAAT 
CAGATATCTA 
GTATCACATT 
TTGGTACTT6 
AAAAAAA 



CAAGQTQGAA 
CCCTTTTCTA 
AACAAAAAAO 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTQAAT 
AAGGCTGTGG 
TGTTGTTCTT 
AATATGTTAC 
TGTCATACOO 
TCTGTAAGGG 
GGCATAATOT 
QATGACTATA 
GATAAATTCC 
CAAAATTAAC 
ATAAATTCAT 
TTGTGGATCT 
CACTATATTA 
TATTGTCATT 



8eq ID KOt C152 DNA Sequence 

Nucleic Acid Accession ft: NH_005242.2 

Coding sequence : 1 4 8 .. 13 4 1 



TTTCTCTCGG 
GCGCC06GC0 
GCCGCCATCC 
TCCTCTAAAG 
GQAOTTACAO 
AAACTGACCA 
AQTAACGQCA 
ATTTACATGG 
ATTQCCTATC 
ATTOGCTTTT 
ChQAGQTATT 
ATTGGCATCT 
GTGAAGCAGA 
6AGCAGCTCT 
CTGTTCCCAQ 
TCTOCCATGG 
QTCCTGGCCA 
TTTCTGATTA 
CTCTCTACCC 
AGGGATCATG 
GTATCCCTCA 
ACTGTTAAGA 
TGQAACXTOT 
ACATACCACC 



11 
I 

TGGG6AGG0Q 
TGGQTCCAGT 

TGCTAGCAGC 
GAAGAAGCCT 
TTOAAACAOT 
OQQTCTTCCT 
TGQCCCTGTG 
CCAATCTGGC 
ACATACATGC 
TCTATGGCAA 
GGGTCATCQT 
CCCTGGCAAT 
CCATCTTCAT 
TGGTGGGA6A 
CCTTCCTCAC 
ATGAAAACTC 
TGTACCTGAT 
AOAGCCAGGG 
TTAACAGCTG 
CAAAGAAG6C 
CCTCAAAGAA 
CCTCCTATT6 
TTAATGTTAT 
G 



21 
1 

0GCA6CAGAG 
6QAGCTCTGA 
CAGGAGGATG 
CTCTCTCTCC 
TATTGGTAAO 
CTTTTCTGTQ 
TCCAATTGTC 
GU 'lVrnXTAT 
CTTGGCTGAC 
CAACAACTGG 
CATGTACTGT 
GAACCCCATO 
ATGGCTGCTG 
TCCTGCCCTG 
CATGTTCAAT 
AGCCTCTGCX! 
AGAGAAGAAA 
CTGCTTCACT 
CCAGAGCCAT 
CATCGACCCC 
TCTCCTTTGC 
ACACTCCAGG 
AGTTTTCCAO 
GAOGAOOTCrr 



31 
I 

6CTOCX3ATTC 
GTTTOGAATC 
OGGAGCCCCA 
TGCAGTGGCA 
GTTGATOGCA 
GATGAGTTTT 
TACACAATTG 
TTCC6AACTA 
CTCCTCTCTG 
ATTTATGGGG 
TCCATTCTCT 
GGGCACTCCA 
ATTCTGCTGG 
AACATCACGA 
TACTTCCTCT 
TATGTGCTGA 
AG6AAGAGGG 
CCTAGTAACC 
GTCTATQCCC 
TTTGTCTATT 
OGAAGTGTCC 
AAATCCAGCr 
GTCCTC ftOAT 
CTQTTATTTC 



41 
I 

GGGGCAGGTG 
GGT6G0G60G 
OOGOGGOSTO 
CCATCCAAGG 
CATCCCAOCT 
CTGCATCTGT 

A6AAGAAGCA 
TCATCTGGTT 
AAGCTCTTTO 
TCATOACCTG 
GQAAGAAGGC 
TCACCATCCC 
CCTGTCATGA 
CTCTGGCCAT 
TGATCAGAAT 
CCATCAAACT 
TTCTGCTTGT 
TGTACATTGT 
ACTTTOTTTC 
GCACTGTAAA 
CTTACTCTTC 
GGQAATTGCA 
CTAATCAAAA 



51 
I 

AGAGGCTGAC 
GATTCCCOGC 
6CTGCTGGGG 
AACCAATAGA 
CACTGGAAAA 
OCTCACTOQA 
G6GTTTGCCA 
0CCTGCT6TG 
CCCCTTGAAG 
TAATOTOCTT 
GCTCAaTGTG 
AAACATTGCC 
TTTQTATOTC 
TGTTTTGCCT 
TGGGOTCTTT 
GCTGCGATCT 
CATTGTCACT 
GGTGCATTAT 
AGCCCTCTGC 
ACATGATTTC 
6CAGATGCAA 
AAGTTCAACC 
CAGTAGGATG 
AGGTCTCAOC 



Seq ID VOt C153 DNA Sequence 
Nucleic Acid Accession #e NM_003469.2 
Coding sequence t 92 . . 1945 



GAAAOGQCCC 
CATATAAACA 
AGCAGCCCTG 
TCAGAGAAAC 
TCCCAGTCCT 
TAAGGAAGAA 
AAAAGAAAAT 
GATGAGAATA 
AGAAAATAAG 
TTATGAGACA 
TGAAGAGAAT 
TACTCCTCAA 
ACCAAACAAC 
AGATGATATC 
QAACCCAGTA 
GAATATAGGA 
CCAGGAAGAA 
AATTGCCTAT 
AAATQGGGAA 
GCrOATTGAA 
AACTOGOQAO 
CCTAGATQAC 



11 

I 

GAGAAGCTCG 
AAAAGAGGAA 
TCTCTTATCC 
CAGCTGCTTC 
GAAATGATCA 
AGCAGCCCAG 
GGCQATQAAA 
ATACTOGAAG 
CCCTATGCCT 
CAGCAOTGGC 
TCCAGGGATA 
AGCCTTGCTA 
CAGAAACGTG 
TACAAGGCTA 
GAGGAGAAAA 
AAAAATGAAC 
GATCTTCGGA 
TTGAAAAGGT 
AOGGCCACCA 
ATCTCAAGGA 
AAGCGQAATO 
ATCIGAGAGG 



21 

I 

CCCXX3AGAAC 
ATCTTTCAAA 
CTTTAATTTT 
AGAAAGAACC 
GGGCTTTGGA 
ATTATAATCC 
GCCACTTGCC 
CTTTQAGACA 
TGAATTCAGA 
CAGAAAGAAA 
ACCCCTTTAA 
CATTGGAATC 
AGAGGATGGA 
ATAACATTGC 
TAGAGAGTCA 
AAATCAACGA 
AAGAOAGTAA 
TAGTAAATGC 
GGCTTTTTGA 
ATTTACAGAT 
GATCAGTGGA 
CT6ACTTAGA 



31 
I 

GGGGAOGAAT 
CATGGCTGAA 
CCTCATCTCT 
AGACCTCAGG 
GTACATAGAA 
CTACCAAGGT 
CX3AGAGGGAT 
GGCTGAAAAT 
AAAGAACTTT 
GCTTAAGCAC 
AOGCACAAAT 
T6TCTTCCAA 
TGAGGAGCAA 
CTATGAAGAT 
AACCCAGQAA 
TGAGATGAAA 
AGACCAACTC 
TQGAGGAAGT 
GAAACCrCTT 
ACCCCCAGAA 
ACC6GAG0QG 
CCATCCAGAC 



41 

I 

ATGCTGTGGA 
GCAAAGACCC 
GGGGCTQAAG 
TT6GAAAATG 
AACCTCOGAC 
OTCTCTGTCC 
TCACTGA6T0 
GAGGCTCAGT 
CCAATGGACA 
ATGCAATTCC 
GAAATAGTGG 
GAGCTGGGGA 
AAACTTTATA 
GTGGTCGGGG 
QAGGTGAGAa 
OGCTCAGGGC 
TCAGATGATG 
GGGAGGTTAC 
GATTCTCAGT 
GACTTAATTO 
GAGCTTGACC 
CTOTTCCAAA 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

lOBO 

1140 

1300 

1260 

1320 

1360 

1440 

1500 

1547 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1451 



51 
I 

GCTCCrCTGC 
ACTGGCTTGG 
CAGCTTCATT 
TCCAAAAGTT 
AACAAGCTCA 
CCCTTCAGCA 
AA6AA6ACTG 
CTGCACCAAA 
TGAGTGATGA 
CTCCTATGTA 
AGGAACAATA 
AACTQACAGG 
OGGATGATGA 
GAGAAGACTO 
ACAGCAAAGA 
AGCTTGGCAT 
TCTCCAAAGT 
AGAATGGGCA 
CTATTTATCA 
AQATOCTCAA 
TTCCTGTTGA 
ATAGGATGCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



CTCCAAGAGT 
GCTCAGT6TT 
GTOGTATTTT 
TQOAAGATCT 
ACAGATGGCA 
GATGCTAGTT 
AGGCTCATCT 
TTTGAATCAA 
TQTGGG6GCC 
AATGAAAGTG 
TAA6AGAGCA 
OVCCCCAAGC 
TAATATCTTT 
TCCTCTQAGC 
ATTTATTATG 
GGCT6TGGCA 
TTTTTAATAT 
AAAAAAATAA 



GGCTACCCTA 
QAGGATATTT 
CCCAATCCAT 
AOATOOAACC 
TAT6AAAACC 
AAATACCCTG 
GAAGATOACC 
GGCAGCTCTC 
GOQAAGAATG 
CTGGAATAGC 
AT66AAAATA 
AAATCCCAAC 
AAATGATGTA 
TGTTATCTTG 
TCCATTATTC 
TTGTTGATQC 
TTATTGAATT 
AGCATTATAA 



AAACACCTGO 
TAAATCTTTT 
ATAACCAGGA 
A6CTTCCCAA 
T6AA0QACAA 
AGATCATTAA 
TGCAGGAAGA 
AGGAGACTGA 
ATOATACCCC 
TCAATCAAGA 
TGTAA6CTGC 
ATTTCTCTTC 
CAGGCAGATG 
TQTATGGATA 
AAGAAAGATA 
TCAGATAT6A 
ATTTTGTTAC 
ATATA 



TOCTGCTGGG 
AGGGATGOAO 
GAAAOTTCTG 
AGCTGCCTGO 
GGATCAAGAA 
TTCAAACCAA 
GGAACAAATT 
CAAGCTG6CC 
AAATAGGCAG 
AAAGGCAGAA 
TTTCATTAAT 
AGTGTGTTQA 
AAACCAGGTC 
TGTGTAAATG 
7CTATGACTG 
TAAAAAAGT6 
TGTCTOTAGC 



ACTGAGGCCC 
AGTCCAGCAA 
CCAAGGCTCC 
ATTCOICATG 
TTAGGTQAOT 
GTG71AGCX3AG 
GAGCAGGCCA 
CCXjGTGAGCA 
TACTGGGAT6 
AAGGGAAGGG 
TACCCTACTT 
CTTCTATCCT 
ACT6GCX3AGT 
TTATGACTCC 
TGTTTAATAG 
TCCTATAATT 
GTTTTGTrGQA 



TACCAGACGO 
ATCAGAAAAC 
CTTATGGTGC 
TTGAAAACA6 
ACTTGGCCA6 
TTCCTGGTCA 
TCAAAGAGCA 
AAAGGTTCCC 
AAGATCTQTT 
AGCATATTQC 
TCATTCCTCC 
GTTAACACTG 
CTGCTTCATT 
TTGATAAAAA 
TATATCTAAT 
CTATTGAAAG 
OTACTGGACC 



1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2365 



Seq ID NO: C154 SUA Sequence 
Nucleic Acid Accesalon #t N}l_030955 
Coding sequence i 327.. 5108 

1 11 21 31 41 51 

i I I t I I 

GAATTCCGGG AGCGGGCGGO CT6C6AG6CC GOGQQGCATO GGGQAG60G0 AGGGOTOGQA 
COGGGTGGCT GCGCCCATTC CACACCCGCC GAAA6GG6AC ACTGTCAGCT QAATCACTGC 
CXTTTTTAGGA GGAGGGAGGG GGAAAAGOTG TCTAGCTAAT TTCTGCTTAA AAAAGCACA3 
GAGATCXXZGG GTCAGCTTTG CAGTCX3CTGC CTTCTCGCXX: CTGACCATGC ACCCCTGCAT 
CTTCCTGCTO GGCACA66CG AGCX3CTTTAT TTCTGQAGCT GAGGGCTAAA ACTTTTTTCA 
CTTTTCTTCT GCTCAACATC TGAATCATOC CAT6TGCCCA OAGGAGCTGG CTTGCAAAOC 
TTTCOGnrOGT GQCTCAGCTC CTTAACTTTG GOQ00CTTT9 CTATGG6AGA CAGCCTCAGC 
CAGGCCCGGT TCGCTTCCCG GACAGGAGGC AAGAGCATTT TATCAAGGGC CTQCCAGAAT 
ACCACGTGGT GGGTCCAGTC CGAGTAGATG CCAGTGGGCA TTTTTrGTCA TATGGCTTGC 
ACTATGCCAT CA06A6CAGC A6QAGGAAGA GAGATTTGGA TGGCTCAGAO GACTGGGTGT 
ACTACAGAAT TTCTCAOGAG GAGAAG6ACC TGTTTTTTAA CTTGAOGGTC AATCAAGGAT 
TTCTTTCCAA TAGCTACATC ATGGAGAAOA GATATGGGAA CCTCTOCGAT GTTAAGATOA 
TGGCTTCCTC TGCCCCCCTC TGCCATCTCA GTG6CACGGT TCTACAGCAO GGCACCAGAQ 

TTGGQACGac agcxx:tcagt gcctgccatg gactgactgg atttttccaa ctaccacatg 

GAGACTTTTT CATTGAACCC GTGAAGAAGC ATCCACTGGT TGAGGGAGGG TACCACCCGC 
AC ATOGTT TA CAGOAGGGAG AAAGTTCCAG AAAOCAAGGA GCCAACCTGT GGATTAAAGG 
ACA6TGTTAA CATCTCCCAG AAGCAAGAGC TATGOGGGGA GAAGrGGGAG AOGCACSIACT 

tgccaagcag aagcctctct cggcgttcca tcagcaagqa gagatgggtg gagacactgg 
tgotggccga cacaaagak3 attgaatacc atqggaotga gaatgtggag tcxttacatcc 
tcaccatcat gaacatggtc actgggttgt tccataaccc aaocattggc aatqcaattc 
acattqttgt ogttgqgctc attctactcq aagaagaaga gcaaggactg aaaatagttc 
accatgcaga aaagacacto tctagcttct 6caagtggc9v gaagagtatc aatcccftaga 

GTGACCTCAA TGCTGTTCAT CAOGAOGTGG CTGTCCTTCT CACCAGAAAG GACATCTGTG 
CTGOTTTCAA TCGCX;CCTQC GAQACCCTGa QCCTQTCTCA CCTTTCAGGA ATGTGTCAGC 
CTCACC9GC3U3 TTGTAACA7C AATGAA6ATT GGGGACTCX!C TCTG6CTTTC ACAATTGCCC 
ATOAGCTSUSQ A CACAO CTTC GGCATCCAGC ATQATGQGAA AQAAAATGAC TQTGAOCCTG 
T6GGCAQACA TCOGTACATC ATGTCCGGCC A6CTCCAGTA 08ATCCCACT CCGCTGACAT 
GGTCCAAGTQ CAGCGAGGAO TACATCACCC GCTTCTTGGA CCGAGGCTGG GGGTTCTGTC 
TTGATGACAT ACCTAAAAAG AAAGGCTTGA AGTCCAAOGT CATTQCCCCC GGAGTGATCT 
ATGATGTTCA CXACCAGT6C CAGCTACAAT ATGGACCCAA TGCTA CCTTC TGCCAGGAAG 
TA6AAAACOT CTGCCAGACA CTOTGGTQCT COQTGAAGGG CTTTTOTCGC TCTAAGCTGG 
AC6CTGCTGC A6ATGGAACT CAATGTGGTG AGAAGAAOTG GTGTATGGCA G6CAAGTGCA 
TCACAGTGGG GAAGAAACCA GAGAGCATTC CXGGAGGCTG GGGCCGCTGG TCACCCTGGT 
CCCACTGTTC CAGGACCTGT GGGGCTGGAG TCCAGAGOGC AGAGAGGCTC TGCAACAACC 
C OQAG CCAAA GTTTGGAGG6 AAATATTGGA CTGQAGAAAG AAAAOGGTAT CX3CTTGTGCA 
AOGTCCACOC CTGTOQCTCA GAOGCACCAA CATTTOGGCA GATGCAGTGC AGTOaATTTO 
ACACTGTTOC CTACSU^GAAT OAACTCTACC ACTGOTTTCC CATTTTTAAC CCAGCACATC 
CTTGTGAGCT CTACTGCCQA CCCATAGATG GCCAGTTTTC TQAGAAAATO CTGGAT6CTG 
TCATTGATGG TACCCCTTGC TTTGAAGGCG GCAACAGCAG AAATGTCTGT ATTAATGGCA 
TATGTAAGAT G6TTGGCTGT GACTATGAQA TOQATTCCAA TGCX3VC0QAG GATOQCTGOO 
GIGTOTGCCT GGQAGATGGC TCTTGCTGCC AGACTGTGAG AAAGAT6TTT AAGCAQAAGG 
AAGGATCT6G TTATGTTGAC ATTGGGCTCA TTCCAAAAGG AGCAA6GGAC ATAAGAGTGA 
TGGAAATT6A GGGAGCTGGA AACTTCCTGG CCATC3M5GAG TGAAQATCCT QAAAAATATT 
ACCTGAATGO AGGGTTTATT ATCCAOTGGA ACGGGAACTA TAAGCTGGCA GGGACTOTCT 
TTCAGTATGA CAGGAAAGGA 6ACCTGGAAA AGCTGATQGC CACAGOTCCC ACCAATGAGT 
CIGTGTQGAT CCAGCTTCTA TTCCAG6TGA CTAACGCTGG CATCAAGTAT GAGTACAC3UI 
TCCA6AAAGA TG6CCTTQAC AATGATGTTG AGCAQAT8TA CTTCTGGCAO TAOGGGCACT 
GGACAGAGTG CAGTGTQACC TGOGGQACAG- GTATCCGCOG CCAAACTGCC CATTGCATAA 
AGAAGGGCCQ CGGGATGOTG AAAGCTACAT TCTGTGACCC AGAAACACAG CCCAATGGGA 
GACAGAAGAA GT6CCAT6AA AAG6CTTGTC CACCCAGGTG GTGGGCAGGG GAGTGGGAAG 
CATGCTOGQC GACATGOGGO CCCCAOQGGG AGAAGAAGCX3 AAC0GT6CTG TQCATCCAGA 
CCATGGTCTC TOAC36AGC3W3 6CTCTCCOGC CCACAGACTG CCAQCACCTG CTOAAOCCCA 
AGACCCTCCT TTCCTGCAAC AOAOACATCC TQTGCCCCTC GGACTGGACA GTGGGCAACT 
GGAGTGAGTG TTCTGTTTCC TGTGGTGGTG GAGTQCGGAT TOGCAGTOTC ACATGTGCCA 
AGAACCATGA TGAAGCrTGC GAT0T6ACAA QQAAACCCAA CA600GAGCT CT6TGTGGGC 
TGCAGCAATG CCCTTCTAGC CX3GAGAGTTC TQAAACCAAA CAAAGGGACT ATTTCCAATO 
GAAAAAACCC ACXAACACTA AAGC0CX3TCC CTCCACCTAC ATCCAGGCCC AGAATGCTGA 
CCACAOCCAC AOGGCCTQAO TCTATGAGCA CAAGCACTCC AGCAATCAGC AGCCCTAGTC 
CTACCACAGC CTCCAAAGAA 6GAGACCT6G GTGGGAAACA GTOGCAAGAT AGCTC3UVCCC 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2680 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 



I 
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AACCTQAGCT 
CTTCOCAATC 
CTACCT06G/V 
GCAACCCTAT 
TGGAQATTCA 
ATCCTGTAAT 
AAATGCCACT 
TCAGCACAGT 
CACCCAGAGT 
GAGACCACOV 
ACAACATGAA 
T6ATTACTGA 
GCTCTGCACA 
CCTACTGGAA 
GACCTGACCC 
GGAOCAAGTO 
ACAGC0GG6A 
CCCCATTGAG 
GCCAGTGCTC 
GCCTCTGTGA 
GTCACT6G6C 
AGAGGATTGT 
TATGTQATCA 
AAAGTGCCGA 
AAGCCATGAA 
AGACACACAT 
AACTCTAAGC 



GAGCTCTG6C 
CTTQAQCATT 
GGGAGGCCTT 
CACTTGGCCT 
CAGTGGCTCA 
ATGOACCAAG 
TGCACCTCCA 
AATGGAAGGA 
TGAQGQGATG 
GCCAGAACCC 
CGAAACAAAA 
GUU C m 'TTG 
CTGGATOGTC 
AAGGGTGGAG 
TGCAAAAAGA 
CTCGAGAAAC 
CCACX33GAAC 
CATGAGCTGT 
CAGGTCCTGT 
TTGGACAAAA 
CACTGGGAAC 
CCAATGTGTG 
CAAACCCAGA 
TTTACTTTGC 
GAAATGTTCT 
CACACACACC 
CCAAA 



TATCTCATTT 
CAGCCAAGTG 
GTAGCTACAA 
GTGACTCCAT 
6GGGAAGAAA 
ATCAOAOTAC 
CTAACACCAO 
CTGCTCCCCA 
GTTACTOAAA 
TCA6GAAAGA 
AGTTCTQAAC 
CTAAAT6CCT 
GGAAACTGGA 
TGCACX3VCCC 
TGCCACXnTCC 
TGCAGTGGGO 
CT6AGGCCAT 
AACCOGGAGC 
GGAGGTGGAG 
AGACCGACAT 
TOGGACCT6T 
CCCTCAGA6G 
CCTCCAQAAT 
ACTTIAGGACA 
GTGCCCACOG 
CAAAGGCAAA 



GCACTGGAA6 
AGGAAAATGT 
CAACAAQTGG 
TTTACAATAC 
GAGAACAGCC 
CTGGAAATQA 
ATCTCAGCAG 
GCCAAAGGCC 
AGCCAGCCAA 
CG6CAAA0CG 
CAGTCCTQAC 
CCAATTACAA 
G06AGTGCTC 
AGATGGATTC 
GTCCCTGTGC 
GCTTGAAOAT 
TTCACIGCCA 
CCTGTQAGOC 
TTCAGGAQAG 
CCACCATGTC 
GTTCCACTTC 
GCAATAAAAC 
TCAAAAAATG 
AACTGTCAGC 
TGAGGGGTGA 
GAAGGCAAOG 



CACTTCOCAG 
TTCCAGTTCA 
TTCTGGCTTG 
CTTGACCAAA 
TGAGGACAAA 
OGCTOCAGTO 
GGAGTCCTGO 
CACTACTTCC 
CACTCTGCTC 
TAACCACCTG 
TQAGOAOaAT 
GCA6CTCACA 
CACCACATGT 
TGACTGTG06 
TGGCrGQAAA 
AGGGGAGATT 
GTTCCTGGCC 
OTGGCAGGTG 
AGGAGTGTTC 
TTGCAATQAG 
CT6TG6AGGT 
TGAAGACCAA 
CAACCAGCAG 
CAGTTTCTGC 
GTGCTGCTTC 
GTTGCTCCAA 



OOCATCCTCA 
GATACTOOTC 
TCATCTTCCC 
GGTCCAGAAA 
GATGAAAGCA 
OAAASTACAO 
T060CA0CCT 
GAAACTGGQA 
CCTCTGGGAG 
AAACTTCCAA 
GCAACAAGTC 
AACG6CCA06 
GGCCT6GGGG 
GCCATCCAGA 
GTGGGAAACT 
CA0T6O6T6G 
GGCATTCCTC 
GAGCCTTGGA 
TGTCCAGGAG 
CACCTOTGCT 
GGCTTTCAGA 
GACCAATGTC 
GCCTGCAAGA 
CAGACACT6A 
TCGTOTCCCC 
AAGTCAAAA6 



3600 
3660 
3720 
3780 
-3840 
3900 
3960 
4020 
4080 
4140 
4300 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
SlOO 
5115 



Seq ID NOx CIS 5 DNA Sequence 

Kuclelc Add Accession #t NM_001062.1 

Coding sequence i 76.. 1360 



GCTCTCATTA 
TACACIGTT6 
TCTTTTATTC 
CTAAAACCTC 
AATQTTOTGT 
ATCCAACauVA 
GCCTTQATTA 
TAOCACCTQA 
CACAATGGCA 
CTGTTCAATG 
AACTATTATT 
ACCrGTOTOA 
AACATCAGTA 
GGTCTCATTG 
GACTATTATA 
ATTTCTCAAG 
GGAAAGACCT 
AACATCTCOG 
OTCAATTACT 
TCTGTCnTCC 
ACAATOQAGG 
AATAAT6ACA 
GGTAGTTACX2 
GCCCAAACTT 
TTATGCCTTC 
TCTCTACATO 



11 

1 

CCTTCTGCCC 
GAGAGATGAG 
CAAGCCAACT 
TGTTGAATAC 
TGTCCCTCAA 
TCAAATACAA 
TACTG6CTTT 
CTGACAAGCT 
CTCCCCTGAC 
GGAACTACTC 
TT66TAGCCA 
AGAAOAOTCT 
TTTATACAAA 
QAAACACATT 
ATGAAAATGA 
GAGCATTCAG 
TCTTQGATAT 
CTGATGAGCC 
CTOTGAGAAT 
TCAQTGTGAT 
AGCGCTCATG 
GAACCTACTG 
TTGTCCGCAA 
TCCTCAGCTG 
TTCTTCATTT 
TTCAATAAAA 



21 
! 

ATCACTTAAT 
ACAGTOICAC 
ATGOGAGATT 
AATGATGCAO 
ACTTaTTQGA 
TGTGAAAAGC 
GGGAOTATQT 
A6AAAATAAA 
TAACTACTAC 
AACOGCGQAA 
6TTCTCAGTA 
AATAAATQGG 
GTCACTGGTA 
TAGCACAGQA 
CTGGAATTGC 
TAATCCAAAC 
TAACAAAOAC 
TATAACTGTG 
CAATQAAACA 
GGAGAAAGCC 
GGGGCOCTAT 
GGAACTTCTG 
TGGAGAAAAC 
CATAAAATCC 
ATCCCAGTAC 
GTTGTTOAAA 



31 

I 

AAATAGCCAQ 
CAGCTGCCCC 
TGTGAGGTAA 
TCAAACTATA 
ATCXAGATCC 
AGATTGTCAG 
CQTAA06CT0 
TTGCAAGOVG 
CAGCTCAGCC 
GTTGTCAACC 
GATACTGGTG 
CAGATChAAO 
GAAAAGATTC 
OAAGCCATGC 
CAACAAACTC 
G CTGCAGCC C 
TCTTCTTGOO 
ACAGCTCCT6 
TATTTCACCA 
CAGAAAATGA 
ATCACCTGTA 
A0TQQAG60G 
TTGGAGGTTC 
ATTTGCAGTG 
GAGCAGGAGA 
GATIAAC 



41 

I 

CCAATTCATC 
TAGTGGGGCT 
GT6AAGAAAA 
ACAGGGGAAC 
AAACCCTGAT 
ATGTAAGCTC 
AOGAAAACTT 
AAATTGAAAA 
TGQA06TTTT 
ACTTCACTCC 
CAATGGCTGT 
CAOATGAAGQ 
TQTCTGAGAA 
AGGCGCTCTT 
TGAATACAGT 
AGGTCTTACC 
TCTCTGCTTC 
ACTCACAATC 
ATGTCACTGT 
ATGATACTAT 
TTCAGQGCCT 
AACCACTGAG 
OCIGQAOCAA 
GAGTTCCATG 
GTTAATAACC 



51 

1 

AACATTCTGG 
CrrACTGTTT 
CTACATC06C 
CAG06CTGTC 
GCAAAAGATG 
GGGAGAGCTT 
AATATATQAT 
TAT6GAAGCA 
GGCCTTGTGT 
TGAAAATAAA 
CCTGGCTCT6 
CAGTTTAAAa 
AAAAGAAAAT 
TOTATCATCA 
GCTCACGGAA 
TGCCCTGATO 
AG6TAACTTC 
ATATATCTGC 
6CTAAATGGT 
ATTTGGTTTC 
ATGTGCCAAC 
GCAA6GAGCT 
ATACTAATAA 
TTTATTGTCC 
TCCCCTTCrC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1537 



Seq ID NOi CI 56 DNA Sequence 
Nucleic Acid Accession Ut KN_00459l 
Coding sequence: 59.. 34 9 ~ 



1 

I 

CACTCCCAAA 
GTGCTGTACC 
CGGOGAATCA 
TCATCCTAAA 
TGCTATCATC 
GGTGAAATAT 
TTTCTGGAAT 
TTCACTTGGA 
CAATTAATGA 
6TTAAACT6T 
TAATTTTCCA 
AAGATTATAT 
CrmOTTTA 
CATTAATAAG 



11 

I 

GAACTGGGTA 
AAGAGTTTGC 
GAAGCAGCAA 
TTTATTQTGG 
TTTCACACAA 
ATTGTGCGTC 
GGAATTGGAC 
CATCATGGAO 
AOTTGATTCA 
ATTTTATGTT 
TAAGCTATTT 
GGACTTTCTT 
TATTGTTTTG 
ACAAATATT 



21 

I 

CTCAACACTG 
TCCTGGCTGC 
GCAACTTTQA 
GCTTCACAOQ 
AGAAAAAGTT 
TCCTCAGTAA 
ATAGCOCAAG 
GGTTTAGTGC 
TATTGCATCA 
ATTTATAGCT 
TGGTTTAOTG 
GCAAGCAACA 
TCTCCTAAAT 



31 
I 

AGCAOATCTG 
TTTOATGTCA 
CTGCTGTCTT 
GCAGCTQQCC 
GTCTGTGTQC 
AAAAGTCAAG 
AACAGAAA6A 
TTATCTAATT 
TAOTTTGCTT 
QTA6GTTTTC 
CAAAGTATAA 
AGCTATTTTT 
T6TTGTAATT 



41 

I 

TTCTTTGAGC 
GTGCTQCTAC 
GGATACACAG 
AATGAAGGCT 
GCAAATCCAA 
AACATGTAAA 
ACCTTGCTGG 
TGTGCCTCAC 
TQTTTAAGCA 
TGTGTTTAGC 
AATTATATTT 
TAAAAAAACT 
GGATTATAAA 



51 
I 

TAAAAACCAT 
TCCACCTCTQ 
ACCGTATTCT 
QTOACATCAA 
AACA6ACTTG 
AACTOTGGCT 
GGTTGGAGGT 
TG6ACTTGTC 
TCACATTAAA 
TATTTAATAC 
GGGGGGGAAT 
ATTTAACATT 
ATAAGAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

799 



Seq ID NO: €157 DNA Sequence 

Nucleic Acid Accession #< NM 013271.1 



1295 



wo 03/042661 



PCTAJS02/36810 



Coding sequence! 27.. 809 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



TCOGGAGCCA 
C0GGGQ60QT 
TCTGG6000Q 
AGACTGGCGC 
T6CAGQAGCT 
GG600GAGGC 
TCTGGGGCGC 
CT0CA6C6CA 
CCCAGCTTGT 
ACGACX3GCCC 
CCGAGCTGTT 
TGGGAGCXXX: 
CTGAGGGOGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCC3GGC 
GATCTGAGC 



11 

I 

GGCTCGCTGO 

GCC06TAAAG 
TCCTOGCOGC 
G6CG0GGGCG 
6CAGGAGGCT 
CCCC06CAAC 
GCTOGCTOGC 
CCCCGCGCCC 
CGCGGGCCCX3 
GA66TACTTG 
GCGC06CCTC 
GCTGGGG6CG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCGC 



21 
I 

GGCAGCATGG 

GAACCC0600 
TTCCGGOGGT 
CTGGCGCATC 
GAOGATCAGC 
TCTGATOOGG 
GCTCTGCTCC 
GTCCCOGCCG 
GATGCTGAGG 
CTGGGAOGGA 
OGCOGT6C00 
CT6CTG0GTG 
CXACCCTGAG 
CX3CCACCAGG 
TCTCACCX3GA 



31 
1 

06GGGTG6CC 
TGCTG6GCCT 
OCCTAAGOGC 
CAGTGCCCCX5 
TGCTGGA6GC 
AGGCXSOGCGT 
CTCTGGGOCT 
6C6CG0QGCT 
CGGCGCTCCG 
AGGCAGGOQA 
TTCTTG06G6 
C06ACXA0GA 
TX3AAAG6CCT 
CACTGCCCGG 

GGATCCCTAC 



41 

I 

GCFGCTCTGG 
GTTTOGGGOO 
AOOGTCTCOO 
AGGTGAGGCG 
OQAAOGTCAG 
CCTGGOGCAG 
GOAOQAOGAC 
TQACCCTQCC 
ACCCOGGCCC 
OCSAOACACCC 
AA6CGGGGAC 
T6TG6GCTCT 
A6AGACCCCG 
ATCCCGTGCft 
GCXyvGCACGT 
CCCCTQGCCC 



51 
I 

GG6CCG0GGG 
CCCCC060GC 
CCCTTG G CTG 
GC66G6G0GG 
GA6CGGGCGC 
CTGCTGCGCG 
GOG0AO6O6C 
GOCCTASCAG 
COGGTCTACG 
GAOSTGGACC 
TC06AGGGG6 
GAGCTG(XCC 
GCGCXZCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAAGAT 



,Seq ID NO: C15B DMA Sequence 
Nucleic Acid Accession #t NM_002245.2 
Coding sequence i 1B3 . . 11 93 



GGGCAGGAAG 
GCGGGCQGGA 
060GCTCC6G 
AQATGCTOCA 
CCTGGTGCTT 
TCTTCTCCTC 
AG06A06CTT 

GGAACTQQGA 
ATGGCCACAC 
TTQQCATTCC 
TCACCGGCAG 
CCATGGTCCA 
CCGCTGTCTT 
TTATTTCCCT 
AATTCAGAGA 
T6TTGGTAGT 
TCTATGTGAA 
CCTTCTCCTC 
CTTTTGTGGC 
ATTTGTTGCA 
CATTTTTATC 
AATGTCTTAT 
A6GATGTCTA 
TCTCGACCTT 
TTTTATACTT 
ATAGCAAAAT 
GTTTATQTOT 
CAAACTCACr 
TGTTTATATT 
ACTATAGATA 
ATAAAATAAO 



11 

I 

ACGGOGCTGC 
GCCAGQCCCO 
CCG6TCTGCG 
OTOCCTGOCC 
CGGCTTCCTG 
GGTGGAGCTG 
CTTGGAG6AG 
GGAGGCCAGC 
CTTCACCTCC 
OGTGCCCTTG 
CTTCACCCTC 
GCOGGTCCTC 
TGO0GT6CTC 
CTGAOTCCTG 
QAGCACCATT 
GCTCTATAAG 
TCTGOAAACC 
6AAGQACAAG 
6ATCACAGAC 
CACCCAGTCA 
TTATGCTAGA 
AGAATGOUVA 
TAAAAAACAA 
ATATGTGAGG 
ACATAGQAGO 
TTAACTGGAA 
TTAXATTTAO 
ACTGQTTTGC 
ATTTATAATO 
CTGTACATAT 
TTTTGTTTCT 
GGGAATAATA 



21 

I 

CCGGAGGAGC 
GGOGGGGGCG 
QC6TTGGCCT 
GGCAGCTCGT 
GTGCTGGGCT 
CCCTATGAGG 
OlOGAGTGCC 
AACTAGG6GQ 
GOGCTCTTCT 
TCAGATGGAG 
CTOTTCCTGA 
TACTTCCACA 
CTTGGGTTTG 
GAGGATGACT 
G6CCTGGGGG 
ATTGGGATCA 
TTCT6TGAAC 
QAGOAaOATC 
CAGGCAGCTG 
TCTGCCTGCG 
GCACCAGGGT 
AGCGAAAATT 
GAAAAAAAOA 
AAATGAGATG 
AGAATACTTO 
ACTTTGGGGT 
AA6CAAAAAA 
ATGTACCCAC 
CATAG6TAAC 
GOTTTAGGTC 
TTTGATTTCT 
AACTTGAGAO 



31 

I 

GGGGCGGGCG 
GGG60G60GG 
TGGCTTTG6C 
GGOTGOGCCT 
ACTTGCTCTA 
ACCTGCTGCG 
TGTCTGAGCA 
TGTCGGTGCT 
TGGGCA6CAC 
GTAAGGCCTT 
CGGCTGTGGT 
TCCGCTGGGG 
TCACTGTGTC 
GGAACTTGCT 
ATTATGTGCC 
06TGTTACCT 
TCCATGAGCT 
AGGTSCACAT 
QCATGAAAGA 
TGGATGGCCC 
CAGQGT GCAA 
ATGTCACTTT 
GACATGGAAC 
TCCACCTAAA 
AAGCAGTAXG 
TTGCATTTAG 
AAAAAGCATA 
CCAAAATGAT 
CATTAACTAT 
ACCAOATCCT 
CTTTATACTA 
TGAATAACCA 



41 

I 

GGCG06GGGG 
GGCCAG AAGA 
TTTOGCGGCG 
C^TGQAGOGG 
CCTGGTCTTC 
CCAGGAGCTG 
GCAGCTGGAG 
CAGCAACGCC 
CGT6CTCTCC 
CTGCATCATC 
CCAGOGCATC 
CTTCTCCRAG 
CTGCT TCTTC 
GGAATCCTT7 
T6GGGAAGGC 
GCTACTT6GC 
6AAAAAATTC 
GATAOAGCAT 
QGAOCAGAAO 
TGCAAACCAT 
GGAAGAGGCT 
AAGAAATAGC 
AAAGAAGCTG 
ATTCATATGT 
CTGCTGTGGT 
ATCATTTAGC 
GAGATGTGTT 
TATTTTTQGA 
OTACATATAA 
AQTQTAGTTC 
AAGAATCCAO 
T 



Seq ZD NO I C159 DNA Sequence 
Nucleic Acid Accession ftt NM_005472.1 
Coding sequence i 93.. 404 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

969 



51 
I 

GGAGCGGGGG 
GG0GG06GGC 
GOGGTGGAGA 
CACOGCTCQG 
GGC6CAGTGG 
CGCAAGCTGA 
CAGTTCCTGG 
TOGGGCAACT 
ACCAGAGOTT 
TACTCOGTCA 
ACCGTGCACG 
CAGGTG6TGG 
TTGATGCC6G 
TATTTTTGTT 
TACAATCAAA 
CTTATTGCCA 
AGAAAAATGT 
OACCAACTGT 
CAAAATGAGC 
TGAGCGTAGG 
TAAGTATGTT 
TACTGTTTGC 
TSACCOCAOC 
6ACAAAATTA 
TAQAAGCAGA 
TGATGGCTAA 
TTATAAATAO 
QAATCTAAOT 
AOTATAAATA 
TQAAACTAAa 
A6TTQCTACA 



AAAGG6ACTC 
CCQAOTCTTC 
GGTATGAGAG 

TACCTGGCOQ 
TAACTGTGGG 
ACCCCTATCA 
GGGTGGAAGA 
TGCATCAGOT 



11 
I 

CTTGAAACTG 
CCCCACCTCA 
CCTGCATGCC 
AGGGCCAGGG 
TQATGACAAC 
CAGCCTCATC 
TGTGTATATC 
CCAAGACACC 
CT 



21 
1 

ATTGAGAGCC 
ATCCCTGTTG 
GTGCTGAAGG 
CTGGGGCCAG 
TCCTACATGT 
CTGGGATACA 
AAGAACOGTG 
TGGGGATTGC 



31 
1 

CAGTGGATTT 
CTATGQAGAC 
CTCTAAATGC 
ACAACCA6AC 
ACATTCrCTT 
CCCGCTCCCG 
TGTCTATGAT 
GTCTGGGGCC 



41 

1 

GCCAGCAGTT 
TAGCAATGGA 
CACTCTTCAC 
TGAAGAGAOG 
TGTCATGTTT 
CAAAGTGGAC 
CTAACACGAG 
TCCAGAACTC 



51 
I 

TGAGCTTCTA 
ACGGAGACCT 
AGCAATTTGC 
GGGGCCAGCC 
CTATTTQCTG 
AAGCGTAGTG 
AGGGCTGGGA 
TGCTGTGGAC 



Seq ID NO: C160 DNA Sequence 

Nucleic Acid Accession 9: NM_005245.1 

Coding sequencei 187.. 13959 

1 11 21 31 ' 41 51 

I I I 1 I I 

CTGG6CGGCC GGQG6GGGGQ AGAGGGCGOO 0GAGCG6CTC GT6066CA66 TACCAT6GQG 

1296 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

ISOO 

1560 

1620 

1680 

1740 

1800 

1860 

1901 



60 

120 

180 

240 

300 

360 

420 

480 

492 



wo 03/042661 



PCT/US02/36810 



AGGOQOCSAGC CCQCCGAGGC CC0GGCAG6C CCXTTCCCTQC TOGOGGGOGC 6CTGAGACX30 120 

CGGGTGAGCT CCAGGftGAGC GCXrCTCGCCA CTTCGGGCQV ACTTTBCGAT TGCGQAGAOT IBO 

TAAGCAATG6 GGAGACIVTTT GGCTTTGCTC CTGCTTCTGC TCCTTCTCTT CCAACATTTT 240 

GGAGACAGTG ATGGCAGCCA ACGACTTGAA CAGACTCCTC TGCAGTTTAC ACACCTCGAG 300 

5 TACAACGTCA CCGT6CAGGA GAACTCTGCA GCTAAQACTT ATGTGGG6CA TCCTGTCAAG 360 

ATGGGTQTTT AC^TTACACA TCCAGOGT GG GAAOTAAGST ACAAAATTGT TTCGG6AGAC 430 

AGTGAAAACC TOTTCAAAGC TGAAGAGTAC ATTCTOQQAG ACTTTTGCTT TCTAAQAATA 480 

AGGACCAAAG GAGGAAATAC AOCTATTCTT AATAQAGAAG TGAA6GATCA CTACACATTG 540 

ATAOTGAAAQ CACTTGAAAA AAATACTAAT GTGGAGGCGC GAACAAAGGT CAGGGTOCAG 600 

10 GTGCTGGATA CAAATGACTT 6AGACCX3TTA TTCTCACCCA CCTCATACAG OGTTTCTTTA 660 

CCTGAAAACA CAGCTATAA6 GACCAGTATC GCAAGAGTCA 6G6CCAGGGA T6CAGACATA 720 

GGAACCAACG GGQAATTTTA CTACAOTTTT AAAGAT06AA CAOATATOTT T6CTATTCAC 7 BO 

CCAACCftGTO GTGTGATAGT GTTAACTGGT AGACTTGATT ACXTTAGAGAC CAAGCTCTAT B40 

GAGATGGAAA TCCTCGCTGC GGACCGTGGC ATGAAGTTGT ATGGGAGCAO TGGCATCAGC 900 

IS AGCAT6QCCA AGCTAACXSGT GCACATGGAA CAGGCCAATG AAT6TGCTCC GGTGATAACA 960 

GCAOIGACAT TGTGACCATC AQAACTG6AC AGGGAGGCAG CATATGCAAT TGTGAGAGIG 1020 

GATGACTGCG ATCAGGGTQC CAATQGTGAC ATAGCATCTT TAAGGATGGT GGCAGGTGAC 1080 

CiTCTCCAGC AGTTTAGAAC AGTGAGGTCC TTTCCAGG6A GTAA6GAGTA TAAA6TCAAA 1140 

GCCAT06GTG ACATTGATTG GGACAGTCAT CCTTTCGGCT ACAATCTCAC ACTACAGGCT 1200 

20 AAA6ATAAA0 GAACTCCX3CC CCAGTTCTCT TCTGTTAAAG TCATTCA08T GACTTCTCCA 1260 

CAGTTCAAAO CC6GGCCAGT aVA6TTT8AA AAG6ATGTTT ACAQAOCAQA AATAftO TGAA 1320 

TTTGCTCCTC CCAACACACC TGT6GTCATG GTAAA66CCA TTCCT6CTTA TTCCCATTTQ 1380 

AGGTATGTTT TTAAAAGGAC ACCTGOAAAA GCTAAATTCA GTTTAAATTA CAACACTGGT 1440 

CTCATTTCTA TTTTAGAACC AGTTAAAAGA CAGCAGQCAG CCCATTTTGA ACTTGAAGTA 1500 

25 ACAACAA6TG ACAGAAAAGC 6TCCACCAAG GTCTTGGTGA AAGTCTTAGG T6CAAATAGC 1560 

AATOCCCCTG AATTTACCCA QACA60GTAC AAAGCIGCTT TTGATGAOAA OSTQCCCATT 1620 

GGTACTACTA TGATQAGCCT GASTGCCGTA QACCCTQATO AG6STGAGAA TGGGTACGTG 1680 

ACATACAOTA TOGCAAATTT AAATCATGTG CCGTTTGCQA TTGACCATTT CACTGGTGCC 1740 

GTQAQTACGT CAGAAAACCT QQACTACGAA CTGATGCCTC GGGTTTATAC TCTGAGGATT 1800 

30 CX3TGCATCAG ACTGGGGCTT GCOGTACCGC CGGGAAGTCG AAGTCCTT6C TACAATTACT 1660 

CTCAATAACT TQAATQACAA C:ACACX:tTTG TTTGAGAAAA TAAATTGTGA AOGGACAATT 1920 

CCCA6AGATC TAG6CGTGGG AGAGC3MTA ACCACTGTTT CTGCTATTGA T6CAGATGAA 1980 

CTTCAGTTGG TACAGTATCA GATTGAAOCT GGAAATGAAC TGGATTTGTT TAGTTTAAAC 2040 

CCCAACTCGG GGGTATTGTC ATTAAAGCGA TCGCTAATGG ATGGCTTAGG TGCAAAGGTQ 2100 

35 TCTTTCCACA GTCTGAGAAT CACAGCTACA GATGGAGAAA ATTTT6CCAC ACCATTATAT 2160 

ATCAACATAA CA0T6GCTGC CACSTCACAAO CTGGTAAACT TGCAGTGT6A AGAGACTGGT 2220 

6TT6CCAAAA T6CTGGCAGA GAAGCTCCTG CAGGCAAATA AATTACACAA CCAQ6QA0AG 2280 

GTGGAGGATA TTTTCTTOOA TTCTCACTCT GTCAATGCTC ACATACCGCA GTTTAGAAGC 2340 

ACTCrrCCGA CTGGTATTCA GGTAAAGGAA AACX:AGCCTG TGGGTTCCAG TGTAATTTTC 2400 

40 AT6AACTCCA CT0ACCTTC3A CACTGGCTTC AATGGAAAAC TGGTCTAT6C TGTTT C TGGA 2460 

GQAAATOAGO ATAGTTGCTT CATOATTGAT ATGGAAACAG GAATGCTGAA AATT TTAT CT 2520 

CCTCTT6ACC GTGAAACAAG AGACAAATAC ACCCTGAATA TTACXXSTCTA TGACCTTGGG 2580 

ATACCXXyvOA AGGCTGCGTQ QOGTCTTCTA CATGTC6TGQ TTGTCX5ATGC CAATGATAAT 2640 

CX:ACXX:GAGT TTTTACAGGA GAGCTATTTT GTGGAAGTGA GTGAAGACAA GGAGGTACAT 2700 

45 AGTGAAATCA TCCAGGTTGA AGCCACAGAT AAAQACCTGG GQCCCAACXK3 AGAOGTGAOQ 2760 

TACTCAATTC TTACAGACAC A6ACACATTT TCAATT6ACA GOOTGAGGGO TGTTGTTAAC 2820 

ATCGCACGCC CTCTGGATC6 AGAGCTGCAO CATGAGCACT CCTTAAAGAT TGAGGCCAGG 2880 

GACCAAGCCA GAGAAGAGCC TCAGCTGTTC TCCACTGTOG TTOTGAAAGT ATCACTAGAA 2940 

GAT6TTAATG ACAACCCACC TACATTTATT CCACCTAATT ATCOTGTGAA AGTCXMAGAG 3000 

50 6ATCTTCCAG AAOGAACCOT CATCATGTGG TTAGAAGCXX; ACXSATCXTTGA TTTAGGTCAG 3060 

TCTGGTCAGG TGAGATAGAG CCTTCTGGAC CACGQAGAAG GAAACTTOGA TOTOQATAAA 3120 

CTCAOTQGAO CAGTTAGQAT CGTGCAGCAO TTGGACTTTG AGAAGAAGCA A6TGTATAAT 3180 

CTCACTGTGA GGGCCAAAGA CSW^GGGAAAG CCAGTTTCTC TGTCTTCTAC TTGCTAT6TT 3240 

GAAGTTGAGG TGGTTGATOT QAATGAGAAC CTOCACCCAC CCGTGTTTTC CaQCTTTGTG 3300 

55 6AAAAGGGGA CAGT6AAAQA AGATGCACCT GTTGGTTCAT IG6TAATGAC GGTGTC6GCT 3360 

CATGATGAGG ACQCOOGAAO AGAT6GGQAG ATCOQATACT CCATTAQAGA TGGCTCTQGC 3420 

CrnGG T G TTT TCAAAATAGG TGAAGAGACA GGTGTCATAG AGACGTCAGA TCGACTGGAC 3480 
CGTGAATCGA CCTCCCATTA TTGGCTAACA GTCTTTGCAA CCX5ATCAGGG TGTCGTGCCT 3540 
CTTTCATCGT TCATAGAGAT CTACATAGAG GTTGAGGATQ TCAATGACAA TGCACCACAG 3600 
60 ACATCAGAGC CTGTTTA7TA CCCAGAAATC ATG6AAAATT CTCCTAAAGA TGTATCTQTG 3660 
GTCGAGAT06 AG6CATTTGA TCCAOATTOO A6CTCTAATG ACAAGCTCAT GTACAAAATT 3720 
ACAAGTGGAA ATCCACAAGG ATTCTTTTCA ATACATCCTA AAACAGGTCT CATCAGAACT 3780 
ACGTCAAGGA AOCTAOACOO AGAACAGCAA GATGAACACA TATTAGAGGT TACTGTGACA 3840 
GACAATGGTA GTCCCX:CCAA ATCAACCATT GCAAGAGTCA TTGTQAAAAT CCXTGATGAA 3900 
65 AATGACAACA AACCTCAGTT TCTGCAAAAO TTCTACAAAA TCAOACTCOC TGAGCGGGAA 3960 
AAGCCAGACC GAGAAAOAAA T6CCAGAGGG 6A6C0GCTCT AT06C3GTCAT AGCCACOSAC 4030 
AAGGATGA6G 6CCCCAATGC AGAAATCTCC TACAGCATGO AAGAOGGGAA TGAGCATGGC 4080 
AAATTTTTCA TCGAAC03AA AACTGGAGTG GTTTOSTCCA AQAGGTTTTC AGCA6CTGGA 4140 
QAATATGATA TTCTTTCAAT TAAGGCAGTT GACAATGGTC GCCCTCAAAA GTCATCAACC 4200 
70 AGCAGACXCC ATATTGAATG QATCTCCAAG CCCAAACAGT CCCTGGAOCC CATTTCATTT 4260 
GAA6AATCAT TTTTTACCTT TACTGTGA1G GAAAGTGACC CG8TTGCTCA CAT6ATTGGA 4320 
GTAATATCTG TGGAGCCTCC T66CATACCC CTTTGGTTTG ACATCACTGO TGGCAACTAC 4380 
GACAOTCACT TOGATGTGGA CAAGGGAACT GGAACCATCA TTGTTGCCAA ACCTCTTGAT 4440 
GCAGAACAGA AGTCAAACTA CAACCTCACA GTCGAGGCTA CAGATGGAAC CACCACTATC 4500 
75 CTCACTCAGG TATTCATCAA AGTAATAGAC ACAAATGACC ATOGTCCTCA GTTTTCTACA 4560 
TCAAAGTATO AACTTGTTAT TCCTGAAGAT ACAGGGCXAG AAAOUSAAAT TTTGCAAATC 4620 
AGTGCTGTGG ATCAGGATGA GAAAAACAAA CTAATCTACA CTCTGCAGAG CAGTAGAGAT 4680 
CCACTGAGTC TCAAQAAATT TCGTCTTGAT CCTGCAACCG GCTCTCTCTA TACTTCTOAG 4740 
AAACTGOATC ATGAAGCTGT TTCACCAGCA CACCTCACGG TCATGOTACO AGATCAAGAT 4800 
80 GTGCXriGTAA AACGCAACTT TGCAAGGATT QTGGTCAATG TCAGOGACAC GAAT GACC AC 4860 
6CCCCGT0GT TCACCGCZTC CTCCTACAAA 66Q060GTTT ATGAATCXXSC AGCOGTTGGC 4920 
TCAGTTGTGT TGCAGOTQAC GGCTCTGGAC AAGGACAAAG GGAAAAATOC TGAAGT6CT0 4980 
TACTCGATCO AGTCAGGAAA TATXGGAAAT ATTGGAAAXT CTTTTATGAT TGATCCTGTC 5040 
TTGGGCTCTA TTAAAACTQC CAAAOAATTA GATOOAAGTA ACCAAGOGQA GTAT6ATTTA 5100 
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ATGGTAAAAO CTACAGATAA GGaCAGTCOV CCAATQAGTG AAATAACTTC TGT606TATC 5160 

TTTGTCACAA TTGCTGACAA CGCCTCTCCO AAGTTTACAT CAAAAGAATA TTCTGTTGAA 5220 

CTTAGTGAAA CTGTCA6CAT TGGGAGTTTC GTTGGGAT60 TTACAGCCCA TAGTCAATCA 5280 

TCAGTOGTQT AT6AAAT/VAA AGATGGAAAT ACAGGTGATO CTTTTGATAT TAATCCACAT 5340 

TCTGGAACTA TCATCACTCA GAAAGCCCTG GACTTTGAAA CTTPGCCCAT TTACACATTG 5400 

ATAATACAAG GAACTAACAT GGCTGGTTro TCCACTAATA CAACGGTTCT AGTTCACTTG 5460 

CAGGATGA6A ATGAGAA06C GCGAGnTTT ATGCAGGCAG AATATACAGG ACTCATTAGT 5520 

GAATCAGCCT CAATTAACAQ aJTGGTCCTA ACAQACAGGA ATOTCCCACT GGTGATTOGA 5580 

GCAGCTGATG CTQATAAAGA CTCAAATGCT TTGCTTGTAT ATCACATTGT TGAACCATCT 5640 

GTACACACAT ATTTTGCTAT TGATTCTAGC ACT GG TGCTA rrCATACAGT ACTAAGTCTQ 5700 

QACTATQAAO AAACAAGTAT TTTTCACTTT ACCGTCCAA6 TGCATCACAT GGGAACCCCA 5760 

CGTTTATTTG CT6AGTAT6C AGOGAATGTA ACAGTACAT6 TAATTGACAT TAATGACTGC 5B20 

CCCCCTOTOT TTGCCAAGCC ATTATATGAA OCATCTCTTT TOTTACCAAC ATACAAAGGA SBBO 

6TAAAAGTCA TCACAGTAAA TGCTACAQAT QCTQATTCAA GTGCATTCTC ACAGTTGATT 5940 

TACTCCATCA CC6AAGGCAA CATGGGQGAQ AAOTTTTCTA TGGACTACAA GACT6GTQCT 6000 

CTCACTGTOC AAAACACAAC TCAGTTAAGA AGCG6CTAC36 AGCTAAC06T TAGAGCTTCC 6060 

GATGGCAQAT TT G CCGGCCT TACCTCTOTC AAAATTAAT6 TOAAAOAAAO CAAAGAAAGT 6120 

CACXTTAAAOT TTACCCAGGA TGTCTACTCT GCGGTAGTGA AAOAGAATTC CACCGAGGCC 6180 

OAAACATTAG CTGTCATTAC TGCTATTGGG AGTCCAATCA ATGAGCCTTT 6TTTTATCAC 6240 

ATCCTCAACC CAOATGGCAG ATTTAAAATA AGG08CACTT CAGGGGTTCT GTCAACCACT 6300 

G6CA0QC0CT TGOATCGTOA GO^CAGGAG GGOTTTGATG TGGTTGnAQA AGTGATAGAG 6360 

GAACATAAGC CTTCT6CAGT GGCCCACGTT GTOGT6AAGG TCATTGTAQA A6ACCAAAAT 6420 

GATAATOCGC CJGGTGTTTGT CAACCTTCCC TACTACGCOG TTGTTAAAGT GGACACTGAG 6480 

GTGGGCCATG TCATTCGCTA TGTCACTGCT GTAGACAGAQ ACAGTGOCAG AAACX3GQGAA 6540 

GTQCATTACT ACCTCAAGOA ACATCATGAA CACTTTCAAA TTGGACCCTT GGGTQAAATT 6600 

TCACT6AAAA AOCAATTTGA GCTTGACACC TTAAATAAAO AATATCITOT TACAGTGOTT 6660 

GCAAAAGATG GAGGGAACCC GGCCTTTTCA OOQGAAGTTA TOGTTCCSAT CACTGTCATG 6720 

AATAAAGCCA TGCCTGTGTT TOAAAAACCT TTCTACAGTQ CAGAQATTGC AGAGAGCATC 6780 

CAGOTGCACA GCCCTOTCSGT CCAOGTGCAG GCTAACAGCC OGGAAQQCCT GAAAGTGTTC 6840 

TACAGCATCA CAGACGGAGA CCCTTTCAGC CAGTTCACTA TTAACTTCAA TACTGGAGTT 6900 

ATCAATGTCA TAGCTCCTCT GGACTTTOAQ QCCGACCGGG CATATAAGCT GAGCATAGQC 6960 

GCAACTGACT CCTTGAOSGG 0GCTCAT6CT GAAGTATTTG TG6ACATCAT AGTAGACQAC 7020 

ATCAATGATA ACCCTCCTGT GTTTGCTCAG CAGTCTTATO CGQTOACCCT GTCTGAGGCA 7080 

TCTGTAATTG GAACGTCTGT TGTTCAAGTT AGAGCCACCG ATTCTGATTC AGAACCAAAT 7140 

AGAGGAATCT CATACX»GAT GT7TGGGAAT CACAGCAAGA GTCATGATCA TTTrCATGTA 7200 

6AGA6CA6CA CTGGCCTCAT CTGACTACTC AGAAOCCTGG ATTACQA6CA GTCCOGGCAG 7260 

CACAOGATTT TTGTGAGGGC AGTTGATG6T GGTAT6CCCA GGCTGAGCAG TGATGIGATT 7320 

OTCACJGGTGG ACGTTACOOA CCTCAATGGT AATCCACX31C TCTTTQAACA ACAOATTTAT 7380 

GAAGCCAGAA TTAGCGAGCA OGCCCCTCAT GGGCATTTCG TGACCTGTOT AAAAGCCTAT 7440 

GATGCAGACA GTTCAGACAT AGACAAGTTG CAQTATTCCA TTCTGTCTGG CAATGATCAT 7500 

AAACATTTTG TCATTOACAO TGGAACAGGO ATTATCAOOC TCTCAAACCT 6CAC0GGCAC 7560 

GCCCTGAA6C CATTTTACAG TCTTAACCT6 TCA6TGTCTG ATQGAGTTTT TAQAAGTTCC 7620 

ACCCAGGTTC ATGTAACTGT AATTGGAGGC AATTTGCACA GTCCTOCTTT CCTTCAGAAC 7680 

GAATATGAAG TGGAACTAGC TGAAAAOGCT CCCCTACATA CCCTGGTGAT GGAGGTGAAA 7740 

ACTACX3QATG GGGATTCTGG TATTTATQGT CACGTTACTT AOCaVTATTGT AAATOACTTT 7800 

OCCAAAOACA GATTTTACAT AAATGAQAGA GGACAGATAT TTACTTTGQA AAAACTTGAT 7860 

GGAGAAACXX: C3GGGG6AGAA AGTGATCTCA GTCCGTTTAA TGGCTAAGGA TGCIGGAGGA 7920 

AAAGTTGCTT TCTGCACXX3T GAATGTCATC CTTACAGATG ACAATQACAA TGCACCACRA 7980 

TTTC6AGCAA CCAAATACGA AOTQAATATC GGGTCCAGTG CTGCTAAAGG GACTTCAGTC 8040 

GTAAAGTCTO CAAOTQATGC CQATGAGGGC TCCAATGCCO ACATCAOCTA TGCCATTGAA 8100 

GCAGACTCTQ AAAOIGTAAA AGAG AATTTG GA AATTAAC A AACTQTOOGQ OGTAA TCACT 8X60 

ACAAAGQAGA QCCICATTQG CTTGGAAAAT OAATTCTTCA CTTTCTTTGT TAOAGCTGTO 6220 

GATAATGGQT CTCCATCAAA A6AATCTGTT GTTCTTGTCT ATGTTAAAAT CCTTCCACCQ 8280 

GAAATGCAGC TTCCAAAATT TTCAGAACCT TTCTATACCT TTACAGTGTC AOAGOACXrrG 8340 

CCTGTTGGAA CAGAGATAGA TCTCATCOGA GCAGAACATA GTGGGAC7GT TCTTTACAGC 8400 

CTG6TGAAAG GGAATACTOC AGAAAQCAAT AGGQATGAGT O CTTTOTGAT TGACAGACAO 8460 

AG06GQAGAC T6AA6TTGGA GAAGAOTCTT GATCATQAGA CAACTAA6T0 GTATCAGTTT 8520 

TCCATACTGG CX»GCTGCAC TCAA6ATQAC CATGAGATGG TGGCTTCTGT AGATGTTAGT 8580 

ATCCAAOTGA AAQATGCAAA TGACAACAQC COGGTCTTTO AATCTAOTCX: ATATGAGGCA 8640 

TTCATTGTTG AAAACCTGCC AGGGGGAAGT AGAGTAATTC AGATCAGGGC ATCTGAT6CT 8700 

6ACTCAQGAA CCAACGGCCA AGTTATGTAT AGCCTGGATC AGTCACAAAG TGTGQAAOTC 8760 

ATTGAATCCT TTGCCATTAA CATGGAAACA G6CTGGATTA CAACTTTAAA GGAACTTGAC 8820 

CATGAAAAGA GAQACAATTA CCAGATTAAA GTGGTTGCAT CAGATCATGG TGAAAAGATC 8880 

CAGCTATCCT CCACAGCO^T TQTGGATGTT ACCGTCACXX3 ATGTCAACQA TAGTCCAOCA 8940 

OGATTCAOGG CXX3AGATCTA TAAAGGGACT GTGA6TGAGG ATGACCOCCA AGGTGGGQTG 9000 

ATTGGCATCr TAA0TA0C31C GQATGCTGAT TCTGAAGAGA TCAACAGACA AiSTTACATAT 9060 

TTCATAACAO 6AGGG6ATCC TTTAGGACAG m ' O C C e m AAACTATAGA GAATGAAIGG 9120 
AAGGTATATG TGAAGAAACC TCTAGACAGG GAAAAAAGGO ACAATTACCT TCTTACTATC 9180 
ACGGCAACTG ATGGCACCTT CTCATCAAAA GCX3ATAGTTG AAGTGAAAGT TCTGGATGCA 9240 
AATGACAACA GTCCAGTTTG TGAAAAQACT TTATATTCAO ACACTATTCC TOAAGAOaTC 9300 
CTTCCTGGAA AATTQATCAT GCAOATCTCT GCTACAGAOG CAGACATCGG CTCrAAOGCT 9360 
GAAATTACTT ACACGTTATT GGGTTCA6GT GCAGAAAAAT TCAAACTAAA TCCABACACA 9420 
GGTGAACTGA AAAOGTCAAC CCCCCTTGAT OGTGAGQAGC AAGCTGTTTA TCATCTTCTC 9480 
GTCAGGGCCA CAGATGGAGG AGQAAGATTC TGCCAAGCCA GTATTGTCOT CA0GCTAC3AA 9540 
GATGTGAACO ATAAOGCCCC CXSAATTCTCT OCCQATCCTT ATGCCATCAC OQTGTTTQAA 9600 
AACACAQAGC OQGGAAOOCT GCTGACAAGA GTQCAG6CCA CAGAT6C0GA CGCA06ATTA 9660 
AATOGQAAGA TTTTATACTC ACTGATTQAC TCTGCTQATO GCCAGTTCTC CATTAAOQAA 9720 
TTATCTGGAA TTATTCAGTT AGAAAAACCT TTQGACAGAG AACTCCAGGC AGTATACACX: 9780 
CTCTCTTTQA AAGCTGTGGA TCAAGGCTTG CCAAGGAGGC TQACTGCCAC TGQCACTGTG 9840 
ATTQTATCAO TTCTTGACAT AAATGACAAC CCCCCTGTOT TTGAGTACCG TQAATATGGT 9900 
GCCACOOTOT CTGAOGACAT TCTTOTTGOA ACTGAAGTTC TrCAAOTGIA TGCAGCAAOT 9960 
OGGGATATTG AAGGAAATGC A6AAATCACC TACTCAATAA TAAOTOQAAA TGAACATGGG 10020 
AAATTCM3CA TAGATTCTAA AACAGGGGCC GTATTTATCA TTGAOAATCT GGATTATGAG 10080 
AGCTCTCATG AGTATTACCT AACAGTAGAa GCCACTGAIG OAGGCA06GC TTCACTGA6C 10140 
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GACXSTIGOCA CTGTGAACGT TAATGTAACA GATATCAAOO ATAATACCCC TOTGTTCAGC 10200 

CAAGACftCCT ACAC3GACAGT CATCA5TGAA GATGCXXH^C TTGAGCAGTC TGTCATCACG 10260 

GTTATGGCCO ATGATGCCGA TGGACCTTCC AACAGCCACA TCCACTACTC AATTATAGAT 10320 

GGCAACX^AG GAAGCTCGTT CACAATTGAC CCXXTTCAGGQ GAGAAGTCAA AGTGACCAAA 103 BO 

5 CTTCTCGACC GAOAAACSAT TTCAGGTTAC AOQCTCACGQ TTCAAGCTTC TGATAATQGC 10440 

AGTCC31CCCA GAGTCAACAC GAOSACOGTG AACATOGATG TGTCOGATQT CAATGACAAC lOSOO 

G06CCX:QTCT TCTCCAOGGG AAACTACAGT OTCATTATCC AGGAAAATAA GCCA6TGGGC 10560 

TTCAGCGTGC TGC3M5CTGGT AGTAACAGAT GAGQATTCTT CCX3VTAACGG TCCACCCTTC 10620 

TTCTTTACTA TTQTAACTQG AAATGATQAG AAGGCTTTTG AAGTTAACCC GCAAGGAGTC 10680 

10 CrCCTGACAT CATCTGCX3VT CAAGAGGAAG GAGAAAGATC ATTACTTACr GCAQOTGAAQ 10740 

GT6GCA6ATA ATGGAAAGOC TCftQTTGTCA TCTTTGACAT ACATTQACAT TAGGGTAATT 10600 

GAGGAOAGCA TCTATC06CC T606ATTTTG GCCCTGGAQA TTTTCATCAC CTCTTCTGGA 10860 

GAAGAATACT CAGGTGGC6T CATTGGGAAO ATCCATQCCA CAOACCAGGA CGTGTATGAT 10920 

ACTCTAACCT ACAGTCTCGA CCCTCAOATG GACAACCTGT TCrCTQTTTC CAGCACAGGG 10980 

15 GGCAAGCTGA TAGCACACAA AAAGCTAGAC ATAGGGCAAT ACX:TTGTCAA 1X3TCAQCGTA 11040 

ACAGATGGGA AGTTCAOGAC GGTGQCXXSAC ATCACAGTGC ATATCAGACA AGTCACACAG 11100 

GAGATGTTGA ACCACACCAT CGCGATOCGC TTTGCCAACX: TCACTCOGGA AOAATTOOTT 11160 

GGTGACTACT GGCGCAACTT CCAGOGAGCT TTAOGGAACA TCCTQGGTGT GAGGAGGAAC 11220 

GACATACAGA TTGTTAGTTT GCAGTCCTCT QAACCTCACC CACATCTGGA OGTCTTACTT 11280 

20 TTTGTAGAGA AACCAGGTAG TGCTCAGATC TCAACAAAAC AACTTGTOCA CAAGATTAAC 11340 

TCTTCOGTQA CTOACATTGA GGAAATCATT GGAGTTAGGA TACT6AAT6T ATTOCAGAAA 11400 

CTCTGCX30GG GACT6GACTG CCCCTGGAAG TTqTGOQATG AAAAGOTGTC TGTGGATGAA 11460 

AGTGTGATGT CAACACACAO CACAGCCAGA CTQAOTTTTG TGACTCCCCG CXa^CCACAGG 11520 

GCAGCGQTGT GTCTCTGCAA AGAGGGAAGQ TGCCCACCTG TCCACCATGG CTGTGAAGAT 11580 

25 GATCCGTGCC CTGAGGGATC CGAATGTGTG TCTGATCCCT QGGAGGAGAA ACACACCTGT 11640 

GTCTOTCCCA 6GGGCAGGTT TGGTCA6T0C CCAGGQAGTT CATCTATGAC ACTGACTGGA 11700 

AACAGCTACG T6AAATACGG TCTGAlCGGAA AATGAAAACA AATTAGAQAT GAAACTGACC 11760 

ATGA6GCTCA GAACATATTC CACX3CATGCG GTTGTCATGT ATGCT06AGG AACTGACTAT 11820 

AGCATCTTGG AGATTCATCA TGGAAGGCTQ CAGTACAAGT TTQACTGTGG AAGTGQCCCT 11880 

30 GGAATTOTCT CTGTTCAGAG CATTCAGGTC AATGATGGGC AGTGGCACGC AGTGGCCCTG 11940 

GAAGTGAATG GAAACTATGC TOGCrXGGTT CTAGACCAAG TTCATACIGC ATG8GGCACA 12000 

6CCXX3VGO0A CTCTGAAAAC CCTGAACCTQ GATAACTATG TGTTTTrPQO TGOCCACATC 12060 

CX5TCAGCAGG GAACAAGGCA TGGAAGAAGT CCTCAAGTTG GTAATGGTTT CAGGGOTTGT 12120 

ATGGACTCCA TTTATTTGAA TGGGCAGGAG CTCCCTTTAA ACAGCAAACC CAGAAGCTAT 12180 

35 GCAOVCATOO AAGAGTCGGT GGATGTATCT CCAGGCTGCT TCCTGAGGGC CACGGAAGAC 12240 

T6CGCCAGCA AC0CT1GCCA GAATGGAGGC GTTTOCAATC CX2TCACCT6C TGQAOGTTAT 12300 

TACXGCAAAT GCAGTQCCTT 6TACATAGGG ACCCACTGTG A0ATAA6CGT CAATGCGTGT 12360 

TCCTCCAACC CATGCCTCTA TGGGGGCAOS TGTGTTGTOG ACAAOGGAGG CTTTGTTTGC 12420 

CAGTQTAGAG GATTATATAC TGGTCAGAGG TGTCAOCTTA GTCCATACTG CAAAGATGAA 12480 

40 CCCTGTAAGA ATGGC3GGAAC ATGCTTTGAC AGTTTGGATG GOGCOGTTTO TCAOTOTGAT 12540 

TGGGGTTTTA GGGGAGAAA0 OTQTCAGAGT OATATCXSAOQ AOTQCTCrOG AAACCCTT6C 12600 

CTGCAOGGGG CCCTCTGTGA GAACAOGCAC GGCTCCTATC ACTGCAACTG CASCCAOSAO 12660 

TACAGGGQAC GTCACTGCGA GGATGCTGCX3 CXX»ACCAGT ATGTGTOCAC GCOGTGGAAC 12720 

ATTGQGTTQG OGGAAGGAAT TGGAATOGTT GTGTTTGTTG CAGGGATATT TTTACTGGTG 12780 

45 GTGGTGTTTG TTCTCTGCCG TAAGATGATT AGTCX3QAAAA AGAAGCATCA GGCTGAACCT 12840 

AAA6ACAAGC ACCTGGGACC CGCTAOJOCT TTCTTGCAAA QACOSTATTT TOATTCCAAO 12900 

CTAAATAAQA ACATTTACTC AGACATACCA GOCCAGGTGC CTGTCOGGCC TATTTCCTAC 12960 

ACCCCX3AGTA TTCCAAGTGA CTCAAGAAAC AATCTGOACC GAAATTCCTT CGAAGGATCT 13020 

GCTATCCCAQ AGCATCCCGA ATTCAGCACT TTTAACCCOQ A6TCTGTGCA 0GGGCACCX3A 13080 

50 AAAGCAGTGG CGGTCTGCAG CGTGGCGCCA AACCTGCCTC COCCACXXWC TTCAAACTCC 13140 

CCTTCTGACA GCGACTCCAT CC3VGAAGCCT AOCTGGGACT TTOACTATOA CACAAAAGTO 13200 

GTGGATCrrG ATCCCTGTCT TTCCAAGAAG CCTCTAGAGG AAAAOCCTTC CCAOCCSITAC 13260 

AGTGCCCGGG AAAGCCTGTC TGAAGTGCAG TCCCTGAGCT CCTTCCAGTC OJAATCXSTGC 13320 

GATGACAATG GGTATCACTG GGATACATCA GATTGGATGC CAAGa3TT0C TCTGCOGGAC 13380 

55 ATACAAGAGT TCCCCAACTA TGAGGTGATT GATGAGCAGA CACCCXTTOTA CTC AGCAG AT 13440 

CCAAAOGCCA TCGATAOCSGA CTATTACCCT GGAGGCTACXJ AGATGOAAAG TGATTTTOCT 13500 

CCACCCCCAG AAGACTTCCC CGCAGCTGAT GAGCTACCAC GQTTACOGCC CBAATTCA6C 13560 

AATCAOTTTO AATCCATCCA CCCTCCTAGA GACATGCCTG COGOGGGTAG CrTGGGTTCT 13620 

TCATCAAGAA ACCGGCA6AG GTTCAACTTG AATCAGTATT TGCXX»ATTT TTATCCCCTC 13680 

60 GATATGTCTG AACCTCAAAC AAAAGGCACT GGTQAGAATA GTACTTGTAG A6AACCCCAT 13740 

GCCCCTTACC OGCCAGGGTA TCAAAGACAC TTOQAGGCGC OOSCIOTOGA GAGCATGCCC 13800 

ATGTCTGTQT AOOCCTCCAC OGCCTCCTQC TCrOACGTGT CAOOCTGCTO GGAAGTGGAO 13860 

TCCGAGGTCA TGATGAGTGA CTATGAGAGC GGGGACGACG GCCACTTCGA AaAGGTGAOQ 13920 

ATCCCGCCCC TGOATTCCCA GCAGCACAOG QAAGTCTGAC TCTCAACTCC CCCCAAAGTG 13980 

65 CCTGACTTTA GTGAACCTAG AGGTGATGTG AGTAATCOQC GCTGTTCTTT GCA6CAGTGC 14040 

TTCCAAGCTT TTTTTGGTGA GC06AATGGQ CATGGCTGCO CTGGATOCTG 00C CTCTG6 A 14100 

OQTGCTAGCC ATTTCCAGTG TCCCAACTAC TGTCATOGTO AGGTTTTCAT 06GCTGTGCC 14160 

ATTTCXXAAC GTCTTTTGGG ATTTACATCT GTCTGTGTTA AAATAATCAA AOG AAAAA TC 14220 

AGTCCTGT6T tTGTCAGCATG ATTCATGTAT TTATATAGAT TTGATTATTT TAATTTTOCT 14280 

70 G 'I' CA ' CrX ' i ' T T TTTGTAAATT TTATGTACAG ATTTGATTTT TCATAGTTTT AACTAGATTT 14340 

CCAAGATATT TTGTGCATTT QTTTCAACTG AATTTTOGTO GTGTCAGTGC CATTATCTAG 14400 

CACOCTGATT r i Tr m TTT TACTATAACC AGGGTTTCAT TCTGTCrTTT TGCACTGAAO 14460 

TGTGACATTT TGTTAGTACA TTTCAGTGTA GTCATTCATT TCTAGCTGTA CATAGGATGA 14520 

^ AOaAQAOATC AOATACATGA ACATQTCTTA CATGGGTTGC TGTATTTAGA ATTATAAACA 145 80 

75 TTTTTCATTA TTGGAAAGTG TAACGGGGAC CTTCTGCATA CCTGTTTAGA ACCAA AACCA 14640 

CCATGACACA GTTTTTATAQ TGTCTGTATA TTTGTGATGC AATGGTCTTG TAAAGGTTTT 14700 

TAATGAAAAC TACCATTAGC CAGTCTTTCT TACTGACAAT AAATTATTAA TAAAAT 14756 

Seq ID NO: C161 DNA Sequence 
SO Nucleic Acid Accession 0i NMJ>14220.1 
Coding sequence : 102 . . 710 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GTGAAOACAA 
CACGATGCAT 
TGCTTTACTT 
TC5TGQTTCTT 
TCATT6GGCT 
QATGTGC3QAT 
GTGTCATTGT 
AGTX3GAACTA 
GOBftGTGCAC 
TG6CTCTTG0 
QAGGGVTATG 
CCAGQACAOA 
ATTTGTAAAA 
TAAAGACTGO 
GTTTATT1X3T 
TTACAGACTQ 
AAATAACATT 
AGTTGCTTTT 
CACTGCTTGT 
AACI6CCTTG 
TGTTCCAATC 
OAGATTCAAA 
ATCATGTGTT 
AATAAAGCAG 
AG6AAATACC 
AAAAAAAAAA 



CTTTCrCCAC 
AGABAAG6G0 
OGGACATTCT 
TOCCAATGGO 
TTCTGGCATC 
GGAACAGOAT 
GCTTTCTTCT 
GGCAGCGCTT 
CACCTTTGCC 
TC3AACCCAAG 
TGGAATTGAA 
TGGCTTTTGC 
GCCACAATCT 
CTTTGTATTA 
CATCTTCACA 

T ' nit^riT ' rr 

AOTQACAGTA 
CCAATCACTA 
TATAACACCA 
ATGATGTT7C 
TGTTCTGTGA 
CAAATQAATO 
TTTTTCTAAC 
TAAAAAAAGA 
TT6ATCAGCA 
CTCAAAACTA 
AAAAAAAAAA 



CAGAAGGOCA 
GAGAAAACCr 

CTGGTGGGGC 
GAAACAAAGT 
GTAGGAGGTG 
GACTGCT6TG 
GTATTGGCTG 
GGCTTAGCAG 
A6CACCGAGG 
CACATTGTGG 
TTCATCTTGT 
TGCTCrCACC 
TCCTCTATTT 
GTGTAACATA 
GGATGTCAGT 
TTTTAAGGAA 
CTCAGTATAT 
TTGTATATAT 
AGAAGGAGAA 
CCATTCATAC 
GAAAOUUVTA 
CATCACAACT 
ATATGGAAAG 
AAGQCTACGA 
TCATTGGAAC 
ACTTOTTTAC 
AAA 



ChCTTTCATC 
AGCAGACCAC 
TOGCCCTCCT 
ATGCCTCOQA 
GCCTGCTGAT 
GCTGCTGTGG 
CrCTCATTGO 
AAG6ACCACT 
GCCAGTACCT 
AATGGAATGT 
GTCTTATTCA 
AACAGCAATA 
CATTGTAATT 
CTCCCCACAG 
GTTTAAATTT 
TGAOGAAACA 
CT6A0ATAAA 
GTGCATGTAT 
AATCCXJACAA 
ACCTATAAAT 
TTTACTTAGA 
TAQU^TGCTG 
CCTTTTGTCC 
TQACTGQGCA 
ATGGGGAOSA 
AACAAAATAA 



TAATTTGOQG 
CATGTGCTAT 
QTGCATCGOG 
AAACCACCTC 
GCTCCTGCCA 
CCATGAAAAC 
AATT6CAGGA 
ATOTCTTQAT 
TCTGGATACC 
ATCTCTGTTT 
AOTAATAAAT 
TGACTGCTAA 
TATATATTTC 
TCTACTTTTA 
AGTAAACTTC 
AACCACOCTC 
CTCTATAATG 
TTTTTAAATT 
CCTGOAAAGA 
CTCTAACAAG 
GTGGAA GGAC 
CTCATT6TTG 
TCCAAAGATG 
AGAAOAAAQA 
GTGACGGCAG 
AGTATTCACT 



TATCACTGAG 
GGGAAGTGTG 
GCTAATATTT 
AGCOGCTTOG 
OCATTTGTCT 
TOTGGCAAAC 
TCTGGCTACT 
TCCCTOGGCC 
TOCACATGGT 
TCTATCCTCT 
GGAGTGCTTG 
AAGAAGCAAC 
ACTTOTATTC 
CAAAOOCCTG 
TTTTTTGTTT 
T6GGGGTAGT 
TTTTGGATAA 
AAAGATGTCT 
TTTTTQTTTT 
AGGCCCTTTG 
TGATTGAGAA 
TGACTACTAT 
AGTACTAGC^ 
TGGGAAACTG 
6AG6ACCA0G 
AGGAAAAAAA 



60 
120 

leo 

240 

300 

360 

420 

4B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1583 
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GTrcm ty w ACACATCACA CAGAATTGGA GTGCTGTCCT TCTGQAGAGT GGTGSAGAAC 60 

CAA6ATACAG TTCAGAACCA AAGGAATAQA QAAGGQCTTT GATTTCTTTT TGGCTTTA6A 120 

TTGGGGATTT GGQA6GCTTA GCAGGAAAGA TGTCCACTQA AAATGTGGAA GQGAAGOCCA ISO 

GTAACCTTGG GGAGAGAGGA AGAGCCCGGA GCTCCACTTT CCTCAGGGTT GTCCAGCCAA 240 

TGTTTAACCA CAGTATTTTC ACTTCTGCAG TCTCTCCTGC TGCAGAAOGC ATCOGATTCA 300 

TCTTGGGAGA GGAGGATGAC AGCCCAGCTC COCCTCAGCT CTTCACGGAA CTGGA TGAGC 360 

T6CTGGCCGT GGAT6GGCAG GA6ATGGAGT 08AAGGAAAC AGCCAGGTGQ ATCAAGTTT8 420 

AAQAAAAAGT GGAACA6G6T 6GGGAAA0AT G6A6CAAGCC CCATGTG6CC ACATTGTCCC 480 

TTCATAGTTT ATTTGAGCTG AGGACATGTA TGGAGAAAGG ATCCATCATG CTTGATOGGG 540 

AGGCTTCTTC TCTCCCACAG TTGGTGGAGA TGATTGTTGA CCATCAOATT GAGACAGQCC 600 

TATTQAAACC TGAACTTAAG GATAAGGTGA OCTATACTTT QCTCCGGAAG CACCGGCATC 660 

AAACCAAGAA ATCCAACCTT OQGTOCCTGG CTGACATTGG QAAGACAGTC TOCAOTGCAA 720 

GTAGGATGTT TACCAACCCT GATAATGGTA GCCCAGCCAT 6AGCCATAGG AATCTOACTT 780 

CCTCCAGTCT QAATGACATT TCTGATAAAC COOAQAAGGA CCAGCTGAAO AATAAOTTCA 840 

TGAAAAAATT GCCACGTGAT GCAGAAGCTT CCAACGTGCT TGTTGGGGAG QTTGACTTTT 900 

TGGATACTCC TTTCATTGCC TTTGTTAGGC TACAQCAGGC TGTCATGCTO GGTOCCCTGA 960 

CTGAAGTTOC TGTGCCCACA AGGTTCrTGT TCATTCTCTT AGGTCCTAAG GGGAAACCCA 1020 

AGTCCTACCA CGAGATTGGC A6AGCCATTG CCAC0CT6AT GTCTGATGAQ GTGTTCCATQ 1080 

ACATTQCTTA TAAAGCAAAA GACAGGCACG ACCTQATTGC TGGTATTGAT GAGTTCCTAG 1140 

ATGAAGTCAT OQTCCTTCCA CCTGGGGAAT GGGATCCAGC AATTAGGATA GAGCCTCCTA 1200 

AOAGTCTTCC ATCCTCTGAC AAAAGAAAGA ATATOTACTC AGGTGGAGAG AATGTTCAGA 1260 

TGAATG6GGA TACGCCCCAT GATG6A6GTC ACGGAG6A6G AGGACATGGG GATTGTGAAO 1320 

AATTGCAOCG AACTGOACQO TTCTGTGOTG GACTAATTAA AGACATAAAO A6GAAA606C 1380 

CATTTTTTGC CAC3TGATTTT TATGATGCTT TAAATATTCA AGCTCTTTOG GCAATTCTCT 1440 

TCATTTATCT GGCRACTGTA ACTAATGCTA TCACTTTTGG AGGACTGCTT GGGGATGCCA 1500 

CTGACAACAT GCAGGGCGTG TTGGAGAGTT TCCTGGGCAC TGCTGTCTCT GGAGCCATCT 1560 

TTTGCCTTTT TGCTG6TCAA CCACTCACTA TTCTGAGCAG CACCGQACCT GTCCTAGTTT 1620 

TTQAGAGGCT TCTATTTAAT TTCAGCAAGG ACAATAATTT TGACTATTTG GAGTTT06CC 1680 

TTTGGATTGO CCT0XX30TCC GCCTTCCTAT OTCTCATTTT GQTAGCCACT GATGCCAGCT 1740 

TCTTGGTTCA ATACTTCACA OOTTrCAOGG AOGAGGGCTT TTOCTCTCTG ATTAQCTTCA 1800 

TCTTTATCTA TGATGCTTTC AAGAAGATGA TCAAGCTTGC AGATTACTAC CCCATCAACT 1860 

CCAACTTCAA AGTOOaCTAC AACACTCTCT TTTCCTOTAC CTGTGTGCCA CCTGACCCAG 1920 

CTAATATCTC AATATCTAAT GACACCACAC TGGCCCCAGA GTATTTGCCA ACTATGTCTT 1980 

CTACTGACAT GTACCATAAT ACTACCTTTO ACTOGGCATT TTTGTCGAAG AAGGAGTGTT 2040 

CAAAATAGGG AGGAAACCTT GTCGGGAACA ACTCTAATTT TGTTCCTGAT ATCACACTCA 2100 

TGTCTTTTAT CCTCTTCTTG GGAACCTACA CCTCTTCCAT GGCTCTGAAA AAATTCAAAA 2160 

CTAGTCCTTA TTTTCCAACC ACAGCAAGAA AACTGATCAG TGATTTTGCC ATTATCITGT 2220 

CCA7TCTCAT CTTTTGTGTA ATAGAT6CCC TA6TA6GCGT GGACACCOCA AAACTAATTO 2280 

TOCCAAGTGA GTTCAAGCCA ACAAGTCCAA ACOQAGGTTQ GTTOGTTCCA CC6TTTQ6A0 2340 
AAAACCCCTG GTGG G TGTGC CTTGCTGCTG CTATCC0G6C TTTGTT G GIC ACTATACTGA 2400 
TTTTCAT6GA CCAACAAATT ACA6CTGTGA TTGTAAACAG GAAAGAACAT AAACTCAAGA 2460 
AAGGAGCAGG GTATCACTTG GATCTCTTTT OGGTGGCCAT CCTCATGGTT ATATGCTCCC 2520 
TCATGGCTCT TCCGTGOTAT GTAGCTGCTA COGTCATCTC CATTGCTCAC ATCGACA6TT 2580 
TGAAOATCGA GACAOAGACT TCXGCACCTQ GAQAACAACC AAAGTTTCTA GGAGTGAGGG 2640 
AACAAAOAOT CACTGOAACC CTTGTGTTTA TTCTGACTGG TCTGTCAOTC TTTATGGCTC 2700 
CCATCTPGAA GTTTATACCC ATGCCTGTAC TCTATGGTGT GTTCCTQTAT ATQQQAOTAG 2760 
CATCCCTTAA TOGTOTGCAG TTCATGGATC OTCTGAAQCT OCTTCTQATQ CCTCTGAAGC 2820 
ATCAGCCTOA CTTCATCTAC CTGCGTCATG TTCCTCTGCG CAGAQTCCAC CTGTTCACTT 2880 
TGCTGCAGGT GTTGTOTCTG GCCCTGC7TT GGATCCTCAA GTCAACQQTG QCTQCTATCA 2940 
TTTTTCCAGT AATGATCTTO OCACXTGTAG CTQTCAGAAA AGGCATGQAC TACCTCTTCT 3000 

1300 



wo 03/042661 
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CCCAGCATGA CCTCAGCTTC CT6GATGAT6 TCATTCCAGA AAAGGACAAG AAAAAQAAGG 3060 

AGQATGAOAA QAAAAAOAAA AAGAAGAAGO GAAGTCTOQA CAGT6ACAAT GATGATTCTG 3120 

ACTGCCCATA CTCAGAAAAA GTTCCAAGTA TTAAAATTCC AATGQACATC ATGGAACAGC 3180 

AACCTTTCCT AAGCGATAOC AAACCTTCTG ACAOAQAAAG ATCACCAACA TTCCTTGAAC 3240 

5 GCX3VCACATC ATGCTGATAA AATTCCTTTC CTTCAGTCAC T0GC5TATGCC AAGTCCTCCT 3300 

AGAACTCCAG TAAAAGTTQC CTCAAATTAO ACTAQAACTT GAACCTGAAG ACAATGATTA 3360 

TTTCTGGAGG AGCAAG06AA CA6AAACTAC ATTGTAACCT tfmtJ ' i ' Cm ' CTTAAAACTG 3420 

ACATTTGTTG TTAATGTCAT TTOTTTTTGT TTO3CTGTTT GTTTATrTTT TAACTTTTAT 3480 

TTOOTCTCAQ TTTTTGGTCA CAGGCCAAAT AATACAGCGC TCTCTCTGCT TCTCTCTTGC 3540 

10 ATAGATACAA TCAAGACAAT AGT6CACCGT TCCTTAAAAA CAGCATCTGA GGAATCCCCX: 3600 

rmOTf C TT AAACmCAO ATCTGTCCTT TGATAACCAA ATTCTGTCAC TCAAGACACA 3660 

GACACCCACA GACCCTGTCC TTTGCCTCTA TTAAGCAGftG GATGGAAGTA TTAAGGATTT 3730 

TGTAACACCT TTTATGAAAA TGTTGAAGOA ACTTAAAACT TTAGCTTTGO AGCTOTOCTT 3780 

ACTGGCTTGT CTTTGTCTGG TAGAACAAAC CTTGACCTCC AGACAGAGTC CCTTCTCACT 3840 

IS TATAGAGCTC TCCAGGACTG GAAAAAGTGC TGCTATTTTA ACTT6CTCTT QCTTGTAAAT 3900 

CCTAATCTTA OAGTTATCAA AAGAAGAAAA AACTGAAOGT ACTTTACTCC CTATAGAOAA 3960 

ACCATTGCCA TCATT6TAGC AAGT6CT6GA ATOTCCCTTT TTTCCTATOC AACTTTTTTA 4020 

TAACCCTTTA ATGAACTTAT CTGTGGAGTA CATTQAAGAA TATTTTTCTT CCTAGATTTT 4080 

GTTGTTTAAA TTATGGGGCC TAACCTGCCA CTTATTTTTT GTCAATTTTT AAAACTTTTT 4140 

20 TTTAATTAGT GTAAAOAAAA TGAATTTTTT CCT6CAGCAG GAAACATAGT TTTCA6TAGT 4200 

TCTAOCTCTT ATTTGTAGCT GCGAGGCTTT CTGTAAAAAT TGTATXGTAT ATAATGTQAT 4260 

TTTTACACAT ACATACACAC ACAAATACAC AATCTCTAGG GTAA6CCAOA AGGCAAOATC 4320 

AGATTAAAAA CACCATGTIT CTAA6CATCC ATTTTTCCCT TTCTTTAAAA GAAACTTAAC 4380 

TGTTCTATGA AGGAGATTGA GGOAGAAGAG ACAAACTCCT ATOTCATGAO AATAACCGAT 4440 

25 GTTCTGATAA TAGTAGCATC TAGGTACAGA TGCTGGTTGT AT TACC ACXST CAATGTCCTA 4500 

TGCAGTATTG TTAGACATTT TCTCATTTTO AAATATTTOT OTOTTTOIOT ATGTQCTCTQ 4560 

TGCCATGOCT GGTOTATATA TGTGCSU^TGT TAGAAGGCAA AAGAGTGATG GTAGGCAGAG 4620 

GGCAAAQTCA TTGAATCTCT TATGCCAGTT TTCATAAAAC CCAAACCACA TATQAAAAAA 4680 

TCCATTAAGG GTCCAAGAAG TCTGTCCATA TGAAAATGAG GGTAAATATA GrTTATTTCC 4740 

30 CAGGTATCAG TCATTATAAT TQATATAATA GCTCTAACAT 6CAATATAAA ATTCATAGGA 4800 

OTATTAATAG CCCATTTACA CATCTATAAA ATGTAATOGG ATTGCAGAGC TGCAGAGTAC 4860 

AGTGTAACAO TACTCTCATG CAAlTrTTTT CAGGATGCAA AGGCAATTAT TCTTTGTAAO 4920 

CGGGACATTT AGATATATTT GTGTACATAT TATATOTATO TATATTTCAA AOTACCACAC 4980 

TGAAAATTAG ACATTTATTA ACX:AAATTTA ACX3TGGTATT TAAAGGTAAT ATTTTTAATA 5040 

35 TQATACATTA CATATTGTGA ATGTATACTA AAAAAACATT TTAAATGTTA AAATTATAAT SlOO 

TTGAGATTCA TATAAOCACA ACTGTGATAT ATCCTAACTA TAACCAGTTG TTGAGGGGTA 5160 

TACTAQAAGC AGAATGAAAC CACATTTTTT GGTTTGATAA TATGCACTTA TT6ACTO0CA 5220 

CTCATTQTTA TGTTAATTAA GTTATTATTC TOTCTCCTTG TAATTTTGAT TACAAAAATT 5280 

TTATTATCCT GAGTTAQCTG TTACTTTTAC AGTACCTGAT ACTCCTAAAA CTTTTAACTT 5340 

40 ATACAAATTA GTCAATAAT3 ACCCCAATTT TTTCATTAAA ATAATAGTGG TGAATTATAT 5400 

GTTATT6TGT TAAAACCTCA CTTGCC A AAT TCTQQCTTCA CATTTGTATT TAGGGCTATC 5460 

CTTAAAATGA TGAGTCTATA TTATCTAOCT TTCTATTACC CTAATATAAA CTGGTATAAG 5520 

AAGACTTTCC TTrTTTCTTT ATGCATGGAA GCATCAATAA ATTGTTTAAA AACCATGTAT 5580 

AGTAAATTCA GCTTAACOCQ TGATCTTCTT AAGTTAAAGG TACTTTTOTT TTATAAAA6C 5640 

45 TCTAGATAAA ACTTTCTTTT CTQATCATGA ATCAAGTATC TGTGGTTTCA TGCCCCTCTC 5700 

TATACCTTTC AAAGAACTGC TGAAGCAACT TAACHXATCA TTTCAGCCTC TGAOTAOAaG 5760 

TAAAACCTA7 GT6TACTTCT GTTTATGATC CATATTQATA TTTATGACAT GAACACA6AA 5820 

TAGTACCTTA CATTTGCTAA ACA6ACAGTT AATATCAAAT CCTTTCAATA TTCTQGGAAC 5880 

CCAGGGAAGT TTTTAAAAAT GTCATTACTT TCAAAGOAAC AQAAGTAGTT AACCAAACTA 5940 

50 ACAA6CAAAA CCTGAGGTTT ACCTAOTGAC ACCAAATTAT 06GTATTTTA ACTGAATTTA 6000 

CCCATTOACT AAOAATOAAC GQ6ATTTGQT GaTGGTTTTG TTTCXATGCA AACTGOACAC 6060 

AAATTACAAC AGTAAATTTT TTTATAAQTG CTTCTCCCTT CTCCATGATG TGACTTCCX5Q 6120 

AGATAAAGQA TTCAAAAGAT AAAGACAAAO TAOGCTCAGA GTTQTTAACC AGAAAGTCCT 6180 

GGCTGTGGTT GCAGAAACAC TGTTGGAAGA AAAGAGATGA CTAAGTCAAG TGTCTGCCTT 6240 

55 ATCAAAAGAG CAAAAAT6CC TCT6GTTTTG TaTTTGGGAa AAAAATATCT 7GGA0GCACT 6300 

trmTCCX ' l ' G ATAAAAGTCA TCTTCTCTAC TOTQTGAAAT GAATACTTGG AATTCTAATT 6360 

GmTGTGTQ CCA6GGGCAG TA A TGTCCCT GCCT C TTCTC CCAATCAA6G TTOAGGAGTQ 6420 

GGGCTGGGGA GAGGACTTAA CTGACTTAAG AAGTAGGAAA ACAAAAACCT CTCTCCTCAG 6480 

CCTTCCACCT CCAAGAGAGG AGGAAAAACA GTTGTCTGCT GTCTCTAATT CAQTTTGCGT 6540 

60 GTATTTTATG CTCAIGCACC AACGGATACA GAQTAAATCT TTTATCAACT ATATACTGGT 6600 

GTTXAATA6A GAATGATTGT CTTGCX3AGTT V rri Xi Gf V r C C TTTTTTAACT GTGfTTAAAGT 6660 

ACTTQAAATG TATTGACTGC TGACTATATT TTAAAAACAA AATGAAATAA TTTGAGTTGT 6720 

ATTACAGAGG TTGACATTGT TCAGGGATGG GACAAAGCCT TCTTCAATCC TTTTCATACT 6780 

ACTTAATGAT TTTGGTGCAG GAACCTGA6A TTTTCTOATT TATATTTCAT GATATTTCAC 6840 

65 ATTTGCTCTT CACA6CATGA GCAT6AAGCC CAGTGGCACX: AAATGGCTGG GTACAATCAA 6900 

GTOATATTTT GTA6CA0CTC ACTATCrOAA AGGCCAIOAO TTTTCAGATQ ATTTCATTGA 6960 

GCTTCATTQC AOCCTGAAAT TTTAAAAAAG TTGTGTAATA OGCCAACCAO TCAAQTT6TG 7020 

TTTTGGCCAG AGATTTAGAT ATGTCCAATT TCCTGGCTCA TTTCATTQTG CTCTATGGGT 7080 

ACJGTATAAAA AOCAAGAATT CTGTTTCCTA GGCAAACATT GCAACTCAGG GCTAAAGTCA 7140 

70 TGCAGTGAAA CTTTTAGAGC C3U3AAGTAAC TTTGTCCCAO TGCTACAATO TGAAA AGAGT 7200 

QAATA6TTGC CTCTTTTTAO GCATTTTCAT GOCTOGTACA TATTCSTAOQ CATTACTTTT 7260 

CAGAATCAAT ACGCACTTTC AGATATTCTT ATTTTTATTC TCTTAAQTCT TTATTAACTT 7320 

TGGAOAQAQA AATGATGCAT CTTTTTATTT TAAATGAAGT AQATCAACAT GGTGGAACAA 7380 

AATQATAAAQ AACAGAAAAC ATTTCAATAT ATTACTAATA ACTTTTTCCA AT ATAAA TCC 7440 

75 TAAAATTOCT ATAACATAOT ATTTTACAGT TTTATGAAQC TTTCTATTGT GACTTTTATG 7500 

GAATTAAGAO ATQAAGAAOA TOAGATATTT TAOCATTTAT A TTTTTCAAA ATTATATGTA 7560 

TACTTAAAAA TAAAfiTAACT TTATOC 7586 

Seq ID NO I CI 6 3 DNA Sequence 
oO Nucleic Acid Accession fit MM_000958 
Coding sequence i 389.. 1855 " 



11 
I 



21 
I 



31 
I 



41 



51 



1301 



wo 03/042661 



CGGCACAGCC TCACACCTGA ACGCTOTCCT CCCGCPSSKCQ AGACCGGCGG GCACTGCAAA 60 

GCTGGGACTC GTCTTTGAAG GAAAAAAAAT AGCGAGTAAG AAATCCAGCA CCATTCTTCA 120 

CTGACCCATC CCGCTGCACC TCTTGTTTCC CAAGTTTTTG AAAGCTGGCA ACTCTQACCT IBO 

CGGTGTCCAA AAATCGACAO CCACTGAQAC CX^GCTTTGAG AAGCaSAAOA TTTGGCAGTT 240 

TCCAGACTGA GCAOOACAAO GTGAAAGCAG GTTGGAG60Q GGTCCAGGAC ATCTGA6GGC 300 

TGAC OC TQGG GGCTGOTQAG GCTGCCACCG CTGCIGCCXSC TACAGACCCA GCXTT6CACT 360 

CCAAG6CTGC GCAC06CCAG CCACTATCAT GTCCACTCCC GGGOTCAATT CGTCCGCCTC 420 

CTTGAGCCCC GACCGQCTOA ACAQCCCAQT GACCATCCCG GCGGTGATGT TCATCTTCGG 480 

GGTGGTGGGC AACCTGOTGG CCATCX3TGGT GCTGTGCAAG TCGCGCAAGO AQCA6AAGGA 540 

GACGACCrrC TACAOGCTGO TATGTGGGCT GGCTQTCACC QACCT0TT6G GCACTTTGTT 600 

GOTGAGCCCS GTOACCATOS CCAC36TACAT GAAGG60CAA TGGOCTGGGG GCCAGCXX3CT 660 

OTGC3GAGTAC AGCACCTTCA 'II Vm CA'Crr CTTCA8CCIG TCOGGCCTCA GCATCATCTG 720 

CXSCXaVTOAGT GT06AGCX3CT ACCTGGCCAT CAACCATGCX: TATTTCTACA GCCACTACGT 780 

GGAC3UIGCGA TTGGCGGQCC TCACGCTCTT TGCAGTCTAT GOGTCCAAOG TGCTCTTTTQ 840 

GGOGCTGCrC AACATGGGTC TCX3GTAGCTC GOQGCTGCAG TACCCAGACA CCT6GTGCTT 900 

CATOBA CTQQ ACCACCAACQ TGA OGGOGCA 06C0GGCTAC TCCTAC ATGT A06C0GGCTT 960 

CAGCTCCITC CTCATTCTCG OCACOQTCCT CTGCAACGTQ CTTGTGTGOG G06C6CTGCT 1020 

CCGCATGCAC CGCCAQTTCA TGCGCOGCAC CTCGCTGGGC ACOGAGCAGC ACCACGOGGC 1080 

CGCGGCOGCC TOGGTTGCCT CCCGGGGCCA CCCCGCTGCC TCCCCAGCCT TGCCGCQCCT 1140 

CAGGGACTTT 0G60GC03CC GQAGCTTCOG CCGCATC60G G60GC0GAGA TCCAGATX3GT 1200 

CATCTTACTC ATTGCCACCT CCCTGGTGGT GCTCATCTGC TCCATC006C TCX3TGGTGGG 1260 

AGTATTC6TC AACCAGTTAT ATCAGCCAAG TTTOGAaCX^A GAAGTCAGTA AAAATCCAGA 1320 

TTTGCAGGCC ATCCGAATTG CTTCTGTGAA CCCCATCCTA GACCCCTGGA TATATATCCT 1380 

CCTGAQAAAO ACAGTGCTCA GTAAAGCAAT AGA6AAGATC AAATOCCTCT TCTGCCGCAT 1440 

T6G06GGTCC CGCAGGGA6C GCTCCGGACA GCACTGCTCA GACAGTCAAA 6GACATCTTC 1500 

TGOCATGTCA GGCCACTCTC GCTCCTTCAT CTCCOGGQAO CTGAAGGAGA TCAGCAGTAC 1560 

ATCTCAGACC CTCCTGCCAG ACCTCTCACT GCCAGACCTC AGT6AAAATG GOCTTGGAGQ 1620 

CAG6AATTTG CTTCCAGGTG T6CCTGGCAT GGGCCTGGCC CAGGAAGACA CCACCTCACT 1680 

GAGGACTTTG CX3AATATCAG AGACXTTCAGA CTCTTCACAG GGTCAGQACT CAGAGAGTGT 1740 

CTTACTGGTG GATGAGGCTG GTGGGAGOGG CAGGGCT6GG CCTGCCCCTA AGGGQAQCTC 1800 

CCT6CAAGTC AGATTTCCCA GTOAAACSMrT GAACTTATCA 6AAAAATGTA TATAATAG6C 1860 

AAGGAAAGAA ATACA6TACT GTTTCTGGAC CCTTATAAAA TGCIGT6CAA TAOACAGATA 1920 

CATGTCACAT TTAGCTGTGC TCA6AAGGGC TATCATCA 1958 



6eq ID NO I C164 DMA Sequence 

Nucleic Acid Accession #: NM_002659.1 

Coding sequencet 427.. 1434 



CAGTATCCX^ 
TATTTA008T 
CCAGGTTAGG 
ATCACAACTC 
6GAAG6AA6T 
GGG6CTGACT 
TCCTTCCT6A 
CGCGACATGG 
GCCTCTTGGG 
TOCGCCCTOQ 
QA6CXGGA6C 
TATCGGACTG 
AACCA6GGCA 
TCCTGTGGCT 
AGCCCIGAAG 
OGTCCAAAGO 
AATQGTTTCC 
AACGAGGGCC 
TGCAAGGGGA 
G6CCCCATQA 
ATGGTAAOAO 
AGCATGAACC 
GATGTCCAGT 
ATCACCCTGC 
AATCCCCCTC 
GCCCTACAGA 
TCCCAGCTAT 
GAAGGCCGTG 
GTTGTTATTA 
TOG 




GAAAACAGCT 
GOCMITGGGA 
ATTAATATTC 



CACCTTCCAC 
GCTTGAAAAT 
TGGATGCTOC 
GGTAGCX9VCC 
GGCCTCAAT3 
CTCCT6CTGT 
GGCTGCTCCT 
CAGACTGTGQ 
CTGGATCCG6 
OAGCTCAGGC 
ATCTCACAAA 
OAOCTCTTGT 
ATATTATTTA 



31 
I 

TOCTGTTAGC 
TACAGCAGTG 
GCACCCAGGC 
GCCCCTTCAC 
AOGGQAGQAG 
G6GAG6AGTC 
TGCAGGGACC 
CTGCTX3CTGC 
AAGACCAACG 
AOGATOQTOC 
CACTCAOAGA 
ACCQAGGTTG 
TATTCCCGAA 
AGGGGCCX^ 
ACCCACTGGA 
TGTGGGTAGC 
TTCCTGAAAT 
CTGCOGCAGA 
TCT6AAQA0A 
GGCACTCACX3 
TGCCAACATG 
ACTAAAAGTO 
CRGCCTGGCC 
GGAGGCACTC 
GGGAOCCCTT 
CAGTOTG CCG 
GTTGTGTGAA 
TATTATTAAT 
TTTTATACTT 



41 
I 

CAAATAATCA 
GAGAOCQATT 
AATCTGGGGA 
CACCAGCCGG 
GGGGAGGCX3C 
CCTGGGGCCA 
CO0C6CACAG 
TOCTCCACAC 
GGGATTGCCG 
GCTT6TGGGA 
AGAOCAACAQ 

'itiTOTUOurr 

GCOGTTACCT 
ACCAGAGCCr 
TCCAGGAAGG 
TTCCOOOCTG 
GCTGGAACAC 
AT6GC0GCCA 
CTTTCCTCAT 
AACGQAAAAA 
COCACCTOOO 
GCTGTAACCA 
CTQCCCATCT 
TCCTCTGGAC 
TGCCCTTCCC 
ACCTCTCTGG 
6CAGAAGAGA 
ATTGTtGOCO 
ACATAAAGAT 



51 
I 

GGCACATTCA 
GOCOCQQGTC 
CAGAGCTGTG 
CCGCGCCCCG 
CCACX3CATCT 
CAAAACT60C 
GAGCTGGGCT 
CTG06TCCCA 
TGTGGAAQAG 
AGAAGGAGAA 
GACOGTGAGC 
AGACTTGTGC 
OGAATGCATT 
GCAQTOCOGC 
TGAA6AAGG0 
C0GGG6CTGC 
CA0CAAAT6C 
GTGTTACAGC 
TGACTGC06A 
CCAAA6CTAT 
TGA06CCTTC 
CCCAGAOCTG 
CAOCCTCACr 
CTAAACCTGA 
TC6GCTCCCA 
GCCTCAGTTT 
AAAGCXGGAO 
CTGTTQTGTT 
TTTGTACaUS 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1743 



Seq ID NO I C165 DNA Sequence 
Nucleic Acid Accession «: AK027843.I 
Coding sequence I 193.. 1731 



1 11 21 31 41 51 

I I I I I 1 

TTGCTTGAGT CATCTTCTGA A6CTTTAAAA ACAATTGATG AATTGGOCTT CAAGATAGAC 60 

CTAAATAGCA CATCACATOT GAATATTACA ACTOGGAACT TOGCTCTCAO OOTATCATCC 120 

CTGTTACCAO GGACAAATGC AATTTCAAAT TTTAGCATTG GTCTTCCAAG CAATAATQAA 180 

TGGTATTTCC AGATG6ATTT TGA6AGTGQA CAAGTGGATC CACTGGCATC TOTAATTTTG 240 

CCTOCAAACT TACTTGAOAA TTTAA6TCCA GAAOATTCTG TATTAGTTAG AAGAGCACAG 300 

TrTACTTTCT TCAACAAAAC T6GACTTTTC CAGGAT8TA6 GACCCCAAAO AAAAACTTTA 360 

OTGAGTTATQ T0A7G60GT0 CAGTATTGQA AACA7TACTA TOCAGAATCT GAAG6AT0CT 420 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GTTCAAATAA AAATCAAACA TACAAGAACT CACGAAGTGC ATCATCCCAT CTCTGCCTTC 4 BO 

TGGGATCTGA ACAAAAACAA AAGTTTTGGA GGATGGAACA OQTCAGQATQ TOTTGCACAC 540 

AQAOATTCAG ATGCAAGTOA OACAGTCTGC CTOTGTAACC ACTTCACACA CTTTGGAGTT 600 

CTGATGGACC TTCCAAGAAQ TGCCTCACAG TTAGATGCAA GAAACACTAA AGTCCTCACT 660 

TTCATCAQCT ATATTGGGTO TGQAATATCT GCTATTTTTT CA6GA0CAAC TCTCCTQACA 720 

TATGTTGCTT TTGA6AAATT G0QAAG66AT TATCCCTCCA AAATCTTQAT GAACCTGAGC 760 

ACAGCCCTGC TGTTCCIGAA TCTCCTCTTC CTCCTA6ATG GCTGGATCAC CTCCTTCAAT B40 

GTGQATGGAC TTTQCATTGC TGTTOCAGTC CTQTTGCATT TCTTCCTTCT 6GCAACCTTT 900 

ACCTGGATGG GGCTAGAAGC AATTCACATG TACATTGCTC TAGTTAAAGT ATTTAACACT 960 

TACATTG6CC GATACATTCT AAAATTCT6C ATCATTG6CT GGGQTTTGCC T6CCTTAGTG 1020 

GTGTCA6TTG TTCTAGGGA6 CAGAAACAAC AATGAAGTCT ATGGAAAAGA AA6TTATGO0 1080 

AAAGAAAAAG GTGATGAATT CTGTTOGATT CAAGATCCAG TCATATTTTA TGTGACCXGT 1140 

GCTGGOTATT TTQQAQTCAT tfriYrnVW AACATTGCCA TGTTCATTGT GGTAATQGTG 1200 

CAGATCTGTO GQAG6AATGG CAAGAGAAGC AACOGGACCC TQAQAGAAGA AQTGTTAAGG 1260 

AACCroCQCA GTQT6GTTA6 CTTQACCTTT CTGTTGGGCA TQACATGGGG TTTTGCATTC 1320 

TTTGCCTGGG OAOCCTTAAA TATCCCCTTC ATGTACCTCT TCTOCATCTT CAATTCATTA 1380 

CAAOOCTTAT TTATATTCAT CTTCCACTOT OCTATGAAOG AOAATQTTCA QAAACAGTG6 1440 

CGGOQOCATC TCTQCTGTGG TAGATTTCGG TTAGCAGATA ACTCAGATTG GAGTAAGACA ISOO 

GCTACCAATA TCATCAAGAA AAGTTCTQAT AATCTAGGAA AATCTTTGTC TTCAAOCTCC 1S60 

ATTGGTTCCA ACTCAACCTA TCTTACATCC AAATCTAAAT CCAGCTCTAC CACXTTATTTC 1620 

AAAAGGAATA GCCACACAGA TAATGTCTCC TAT6AGCATT CCTTCAACAA AAGTGOATCA 1660 

CTCAGACAOT GCTTOCATG6 ACAAGTCCTT OTCAAAACTG 6CCCATGCT6 ATGGAGATCA 1740 

AACATCAATC ATCCCTGTCC ATCAGQTCAT TGATAAGGTC AAGGGTTATT GCAATGCTCA 1800 

TTCAQACAAC TTCTATAAAA ATATTATCAT OTCAGACACC TTCAQCCACA GCACAAAGTT 1860 

TTAATGTCTT TAAGAAAAAG AAATCAATCT GCAGAAATOT GAAGATTTGC AAGCAGTGTA 1920 

AACTGCAACT AGTGATGTAA ATQTGCTATT ACCTAGOTAA CTGCAT ATAT ATAAOOAATG 1980 

TATTTTOTTA AGAAGGCTTT TGTGAAATTC A6AATTTTTC TTTTTAATAT ATTTCTTCCA 2040 

TGGAAGAGTT GTCATCACTA AAACTTCAGT ACTGAQAGTA ACATQACTCA 0TA6CCACA0 2100 

AAGC7ATGAT TTGTAAAATA TATAATTGAA TCAOAGTAAT CATAATGCAG GGGAGACATT 2160 

CAAATTAGAG ACAAGGGAGA AGCAAT6CTQ AGGAA6ACCC TAS ATAGA GC TCATTTTACT 2220 

CCACCTAATC OTTATATCTG GATATAGGCA TTTTCTGCAT CTTCTTTCTC AACAATAAAC 2260 

TGTCCTTGCT TTGGAGACTT TAAGACATTT CCTAAAGCAC AAATAAAAGC CTCGTATTTC 2340 

CCCATTGAGA GTTTTGTTCC AA6GAATATQ AAOTGAGACA TATGGGTGAG TCATAATAAT 2400 

CAAAATAATT TATQAA6A0C TGGGTCTGCA ATA6CTAGTC TAAAAACTAC TTGTGTGTCA 2460 

GTCCTCTGGT TATAGTATAT AAGAGCCTGA G6A6QTCTGG CAAGATAOAT G6TQTATTAT 2520 

TTATGQATCA GGCTGCTGCA TACAAACCTT GCATACTATT ATGCAGCTTA CCTAACTCTC 2580 

AGACTATTCT GAGTAATGCT TGCTTGCTAA TGAAT6TATA GGAGACCACA TTGTAATTQT 2640 

TCTTAGATQA TGQAGTCCAT GCAQTTTCTT AGAAATCGGT CTGAGTGCAT GCTGTGCTTT 2700 

TTCACATTTG CTCTGGGTTA TCTGGGAAGT ATCAGGTTCT GGQAGGCAAC AGCATTAAQT 2760 

QATAAGAAAA GQAGACATTC TGGCAAAGCC AATCTGCTTA AAGGCAAAGT CCAGAACCTG 2820 

QAACCTAOAa QCCTTTCTCT CTGCACQAAA AACAGOTAGT TTGCAGTCTG A6ATATGQGA 2880 

GAGCTTTTAG GCTACACAGC AACCCAAGGQ ACCTCTCAOC TTTXQCTGAO CTTCAATCAO 2940 

GAAGCTATTT GGCTGGCTCC AGGAGATQAT GAGATAATGA G6TA6TGGGT TTTTTATTAC 3000 

TGTTCCATTT T6CAACATCC TGCAACACCA TCCTGGGAGA CAAQAGCATT ACCCAOCTTO 3060 

GCTTTCACX30 GGGAGGGTTG TATTCAGT 3088 

Seq ID NOi C166 SNA Sequence 

Nucleic Add Accession fft MM_000574.1 

Coding sequence i 66.. 1211 



1 

I 

CCGCTGGGCG 
GCGCCATGAC 
CCCGGCTGCT 
CCOCAGATOT 
CTGTAATAAC 
TGATCTGCCT 
AGGTGCCAAC 
TTCCAGTOGG 
TATCACCAAA 
AAAAGAAATC 
GCATATTATT 
CX3ACTTCTAG 
AGTGCAGAGA 
AACOTQACCA 
TGATTGGAGA 
CACCACCTGA 
CTACCACAGT 
AAACCACCAC 
ATTTTCATGA 
TATCTGGGCA 
TGCTGACTTA 
AGTTTCTTAG 
ATGCTTTCAT 
GTCCT66AAT 
TGAGAGTGAT 
GGGATCACGA 
CC3VCTTATAA 
CCCAATTCAG 
TCTTTGGCTO 
ACAT6CTGGT 
AATCTTCCTT 
CTTAATGTCT 



11 

1 

TAGCTGOGAC 
CGT0G06GGG 
GCTGCTGOTG 
ACCTAATGCC 
GTACAAATOT 
TAAGGGCAGT 
AAOGCTAAAT 
TACTGTTGTG 
ACXAACTTGC 
ATGCCCTAAT 
TGGTGCAACC 
TTTTTGTCTT 
AATTTATT6T 
TT ATOQAT AT 
GCACTCTATT 
ATGCAQAGGA 
AAATGTTCCA 
ACCAAATGCT 
AACAACCCCA 
CACGTGTTTC 
GCCAAAGAAG 
ACTTATCTGC 
TOTCTTTAAG 
CACATTCTTA 
TCCTl'rCVr A 
GGAAAAGAGA 
AGGAAATAAA 
TCTCTTCTAA 
TAAGGCATTT 
OAAOCa OGGQ 
tGTTSeACAA 
tTAAAAOTAT 



21 

I 

TCGGOGGAGT 
CXX3AGCQTGC 
CTGTT6TGCC 
CAGGCAGCTT 
GAAGAAAGCT 
CAATGGTCRQ 
TCTGCATCCC 
GAATATGA6T 
CTTCAGAATT 
006GGAQAAA 
ATCrCCTTCT 
ATTTCAGGCA 
CCAGCACCAC 
AQACAGTCTG 
TATTOTACTG 
AAATCTCTAA 
ACTACAGAAG 
CAAGCAACAC 
AATAAAG6AA 
AOGTTGACAO 
AGTTAAGAAG 
ATATTGGATA 
ATGTGTTAGG 
QCACACCTAC 
AAAOTOTAAO 
A6GAAAGTGA 
AAATGAAAAA 
GCAAAATTGC 
TCATCTTTCC 
T6TT0ATGGT 
ATAOAOTTTO 
GCAGAGATAC 



31 
I 

CC0GG0GG06 
CXX30GGCGCT 
TGCOGGCGGT 
TG6AAGGO0G 
TTGTGAAAAT 
ATATTGAAGA 
TCAAACAGOC 
GCCGTCCAGG 
TAAAATGOTC 
TA06AAATG6 
CATGTAACAC 
GCTCTGTCCA 
CACAAATTGA 
TAAC9GTAT6C 
TGAATAATGA 
CTTOCAAGGT 
TCTCACCAAC 
GQAQTACACC 
GT6GAA0CAC 
GTTTGCTTGG 
AAAATACACA 
AAATAAATGC 
AATGTCAACA 
ACCTCTTGAA 
AAAGCATA GA 
TTTTTTTCCA 
CATTATTTGG 
TAAAG AGAGA 
TTOGGGTTGG 
GATAAGGGAO 
QAAAAAGCCT 
tACAATATTA 



41 

I 

CGTCCTTQTT 
GCCCCTCCTC 
GTGGGGT6AC 
TACAAOITTT 
TrcrCGGGAG 
GTTCTGCAAT 
TTATATCACr 
TTACAGAAGA 
CSlCAaCAOTC 
TCAGATTGAT 
AGGGTACAAA 
GTGGAGTGAC 
CAATGGAATA 
ATGTAATAAA 
TGAAGGAGAQ 
CCCACCAACA 
TTCTCAGAAA 
TOTrrCCAGO 
TTCAGGTACr 
GACGCTAGTA 
CAAGTATACA 
AATTGTGCTC 
QAGCAAGQAG 
AATAGAACAA 
QATTTGTTCO 
CAAGATCTGT 
ATATCAAAAG 
TGAACCACAT 
CAAAATATTT 
GAATATAOAA 
GTGAAAGGTG 
ACATAAOAAA 



51 

1 

CTAACCCGGC 
GGGGAGCTOC 
TGTGGCCTTC 
CCCGAQGATA 
AAGGACTCAG 
OGTAGCTGOQ 
CAGAATTATT 
6AACCTTCTC 
GAATTTTGTA 
GTACCAGGTG 
TTATTTOGCT 
COGTTGCCAG 
ATTCAA6GGG 
GQATTCACCA 
TGQAGtCGCX: 
GTTCAGAAAC 
ACCACCACAA 
ACAACCAAGC 
ACCCXJTCTTC 
ACCATGGGCT 
GACTGTTCCT 
TTC3VTTTAGG 
AAAAAAGGCA 
CTTGCAGAAT 
TATTTAOAAT 
AATGTTATTT 
CAAATAAAAA 
TATAAAGTAA 
TAAAGGTAAA 
TOAAAGACTB 
TCTTCTTTQA 
AOATTATATA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1660 

1920 
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TTATTTCTGA ATCGAGATGT CCATAGTCAA ATTTGTAAAT CTT A TTCTTT TGTAATATTT 1980 

ATTTATATTT ATTTATGAC3V GTGAACATTC TGATTTTACA TGTAAAACAA GAAAAGTTGA 2040 

AOAAGATATO TGAAGAAAAA TCTATTTTTC CTAAATAGAA ATAAATGATC CCATTTTTTG 2100 

GT 2102 

Seq ID MOt CI 6 7 DMA Sequence 

Nucleic Acid Accession 0t Eos sequence 

Coding sequence: l . .2651 



1 11 21 31 41 51 

i I I I I I 

ATGGACACCT CCCGGCTCGG TGTQCTCCTG TCCTTGCCTQ TGCTGCTGCA GCTGGCGACC 60 

GGGGGCAGCT CTCCCAGGTC T G GTGTGTTG CTGAGGGGCT QCCCCACACA CXOTCATT6C 120 

^ . GACO CCSAOS GCAGGATGTT GCTCAGGGTG GACT G CT003 AGCrGGGGCT CT0GGA6CT6 IBO 

ID CCTTGCAACC TCAGOGTCTT CACCTCCTAC CTAGACCTCA GTATOAACAA CATCAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGOGGGAAAC 300 

GCTCTQACAT ACATTCCCAA GGOAQCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

CTGCAGAATA ATCAGCTAAG ACAOQTACCC ACAGAAGCTC TQCAGAATTT GOGAAGCCTT 420 

CAATCCCrOC GTCTG6ATGC TAACCACATC AGCTATGIGC CCCCAAOCTQ TTTCAGTG6C 480 

CTGCATTOCC T6AGGCACCT GTGGCTG6AT GACAATOCQT TAACAQAAAT CCOCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAQACT ATGCCTTTGO AAACCTCTCC AGCTTGGTAQ TTCTACATCT CCATAACAAT 660 

AGAATCCACT CCCTGG6AAA GAAATGCTTT GATGGGCTCC ACAGCCTAQA GACTTTA6AT 720 

TTAAATTACA ATAACCTTQA TQAATTCCGC ACTGCAATTA GQACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA 6ATCTQCTTT TCAACATTTA 840 

CCTOAACTAA GAACACTOAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTQATTTA 900 

ACTGQAACTG CAAACCTGGA QAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

CAAACOOTCT OCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

^ GAA6ATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAaA AAATTOACCT AAGACATAAT 1080 

jU GAAATCTACO AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATOGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

ATAAAGCTGG AOCTATOQTC CAACCTCCTO TCQTCTTTTC CTATAACTGa GTTACATGGT 1260 

TTAACTCACT TAAAATTAAC AGOAAATCAT 60CTTACA6A GCTTGATATC ATCTGAAAAC 1320 

TTTC CAOAAC TCAAGGTTAT AQAAATOCCT TATOCTTACC AGTGCTGTGC ATTTGQAGTO 1360 

TGTGAOAATG CCTATAAOAT TTC7AATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATOTTT CAGGCTCAAQ ATOAAOOTQA CCTTGAAGAT 1500 

TTCCTGCTTG ACTTTGAGGA AGACCTQAAA GCCCTTCATT CAGTGCAOTG TTCACCTTCC 1560 

CCAGGCCCCT TCAAACCCTQ TGAACACCTQ CTTGATGGCT GGCTQATCAG AATTGGAGTG 1620 

TGGAOCATAG CAGTTCTG6C ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAQA 1680 

TCCCCrCTGT ACATTTCCCC CATTAAACTG TTAATTGGGQ TCATCGCAGC AGTGAACATG 1740 

CTCAOGGGAG TCTCCAGTGC CGTGCTGGCT GGTOTQOATO OGTTCACTTT TGGCAOCTTT 1800 

OCACGACATG OTGCCTGGTO GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTCTCC 1860 

A TTTTT GCTT CAGAATCATC TGTTTTCCTG CTTACTCTGa CAGCCCTGGA OOGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTOCTCT GTOCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGGTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAOCCCAG CACCATGGGC 2100 

TACATGQTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATOATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGQA OACCTGQAGA ATATTTaOGA CTOCTCTATG 2220 

cn QTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

•)U TT6TCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAQTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCT GGTG G TAQTCCCACT TCCTGCATGT CTCAATCCCC TTCICTACAT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTQAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTQATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTOTGACTCA 2520 

ACTCAAGCCT TOQTAACCTT TACCAGCTCC AGCATCACTT ATGACXTTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTQAG AGCTGCCATC 'm'CCrCiXrr GGCATTTGTC 2640 

CCATOTCTTA A 2651 



Seq ID NO: CI 6 8 DNA Sequence 

Nucleic Acid Accession «: MM_003667.2 

Coding sequence: 49.. 2772 

1 11 21 31 41 51 

I i I I I i 

TGCTGCTCTC OGCCOGCGTC CGGCTCGTGG CCCCCTACTT CGGGCACCAT GGACACCTCC 60 

OGGCTCGGTG TGCTCCTGTC CTTGCCTGTG CTGCTGCAGC TGGCGAOOGG GGGCAGCTCT 120 

CCCAGGTCTG GTGTGTTGCT GAGGGGCTGC CCCACACACT GTCATTOCGA GCCCOACQGC 180 

AOGATGTTGC TCAGGGTGGA CTGCTCCGAC CTGGGGCTCT COOAOCIGCC TTCCAACCTC 240 

AGOGTCTTCA CCTCCTACCT AQACCTCAGT ATGAACAACA TCAQTCAGCT GCTCCOGAAT 300 

CCCCT6CCCA GTCTCOGCTT CCTGGAGQAG TTACGTCTTG OGGQAAAOGC TCTGACATAC 360 

ATTCCCAAGQ GAGCATTCAC T60CCTTTAC AGTCTTAAAQ TTCTTATGCT GCAGAATAAT 420 

CAGCTAA6AC ACGTACCCAC AGAAGCTCTG CAiQAATTTQC OAAGCCTTCA ATCCCTQCOT 480 

CTGGATGCTA ACCACATCAG CTATGTGCCC CCAAGCTOTT TCAGTG6CCT 6CATTCCCT6 540 

AGGCACCTGT GGCTGGATGA CAATQOGTTA ACAGAAATCC COGTCCAGGC TTTTAGAAOT 600 

TTATOGGCAT TGCAAGCCAT GACCTTGGCC CTGAACAAAA TACACCACAT ACCAQACTAT 660 

GCCTTTGGAA ACCTCTCCAG CTTGGTAQTT CTACATCTCC ATAACAATAG AATCCACTCC 720 

CTGGGAAAGA AATGCTTTGA TGGGCTCCAC AGCCTAGAOA CTTTAOATTT AAATTACAAT 780 

AACCTTOATG AATTCCCCAC TQCAATTAGQ ACACTCTOCA ACCTTAAAOA ACTAGOATTT 840 

CATAGCAACA ATATCAGGTC GATACCTGAG AAA6CATTT0 TAGOCAACCC TTCTCTTATT 900 

ACAATACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 960 

CCTOAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTOATTTA 1020 

ACTGQAACTG CAAACCTGGA QAOTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 1080 

CAAACOGTCT GCAATCAGTT ACCTAATCTC CAAGTQCXAO ATCTOTCTTA CAACCTATTA 1140 

OAAQATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAOA AAATTOACCT AAGACATAAT 1200 

OAAATCr AOS. AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGOCTCOO ATCGCTGAAT 1260 
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TTGGCTTGGA 
ATAAAGCTGG 
TTAACTCACT 
TTTCCAQAAC 
TG TGAG AATG 
QACCTTCATA 
TTCCTOCTTO 
CCAG6CCCCT 
TGOACCATAG 
TCCCCTCTGT 
CTCACX3GGA0 
GOVCGACATG 
ATTTTTGCTT 
TCTGTGAAAT 
ATTTTGCTCr 
AA0TATG0G3 
TACATGGTOG 
ACCAAOCTCT 
GTAAAACACA 
TTGTCCTTCT 
CTTCTGGTGG 
CCTCACTTTA 
AAACACCCAA 
ACTCAAGCCT 
GTGCCATCAC 
OCATGTCTCT 
GAGATTGAGT 



ACAAAATTGC 
ACCTATOGTC 
TAAAATTAAC 
TCAAGGTTAT 
CCTATAAGAT 
AOAAAGATGC 
ACTTTGAOGA 
TCAAACCCTQ 
CAGTTCTGGC 
ACATTTCCCC 
TCTCCAGTGC 
GTGCCTGGT6 
CAGAATCATC 
ATTCTGCAAA 
GTGCCCTXSCT 
OCrOCCCTCT 
CTCTCATCTT 
ACTQCAATTT 
TTQCCCTOTT 
CCTCTTTAAT 
TAGTOOCACT 
AGGAGGATCT 
GCTTQATGTC 
TGGTAACCTT 
CAOCTTATCC 
AATTAATATG 
ATATGAGAGC 



TATTATTCAC 
CAACCTCCTG 
AGGAAATCAT 
AGAAAT6CCT 
TTCTAATCAA 
TCGAATOTTT 
AOACCTGAAA 
TGAACACCTG 
ACTTACTTGT 
CATTAAACTG 
OGT G CTG G CT 
GGAQAATGGG 
TGTTTTCCTG 
ATTTGAAAOG 
GGCCTTGACC 
CTGCCTGGCT 
GCTCAATTCC 
GGACAAGGQA 
GCTCTTCACC 
AAACCTTACA 
TCCTGCATGT 
GGTGAGCCTG 
AATTAACTCT 
TACCAGCTCC 
AGTGACTGAG 
TGAAGGAAAA 
AGTAATTAAT 



CCCAATGCAT 
TCGTCTTTTC 
GCCTTACAGA 
TATGCTTACC 
TGGAATAAAG 
C AGGCTC AAO 
GCOCTTCATT 
CTTGATGGCT 
AATGCTTTGQ 
TTAATTGGGG 
GGTGTGGATG 
GTTGGTTGCC 
CTTACTCTGG 
AAAGCTCCAT 
ATGGC06CA6 
TTOCCTTTTG 
CTTTGCTTOC 
GACCTGGAQA 
AACTGCATCC 
TTTATCAGTC 
CTCAATCCCC 
AGAAAGCAAA 
QATGATGT03 
AGCATCACTT 
AOCTOCCATC 
TGTTTTCAAA 
AAGAAGAGCT 



TTTCCACTTT 
CTATAACTGG 
GCTTGATATC 
AGTGCTGTGC 
GTGACAACA6 
ATGAAOGTGA 
GAGT6CAGTG 
G6CTGATCAG 
TQACTTCAAC 
TCATCGCAGC 
OGTTCAGTTT 
ATGTCATTGO 
CAGCCCTGGA 
TTTCTAGCCT 
TTCCCGTQCT 
OGGAGCCCAO 
TCATGATQAG 
ATATTrGGOA 
TAAACTGCCC 
CTQAAGTAAT 
TTCTCTACAT 
CCTAGGTCTG 
AAAAACAOTC 
ATGACCT6CC 



gottgagaac 

GAOQTGAAAC 



GCCATCCCTA 
GTTACATGGT 
ATCT6AAAAC 
ATTTGGAGTG 
CAGTATGGAC 
CCTTGAAGAT 
TTCACCTTCC 
AATT6GAGTG 
AGTTTTCAGA 
A6TGAACATG 
TOGGAGCTTT 

'rrr rr rGit x 

GCGTGGG7TC 
GAAAGTAATC 
GGGTGGCAGC 
CAOCATGGGC 
CATT60CTAC 
CTGCTCTATG 
TGTGGCTTTC 
TAAOTTTATC 
CTTGTTCAAT 
GACAAGATCA 
CTOTQACTCA 
TCCCAGTTCC 
GGCATTT6TC 
CTGAAAATGT 
TOGQTTTAAA 



8eq ID NO: CI 6 9 DNA Sequence 

Nucleic Acid Accession #: mi^003S06.1 

Coding sequence: 259.. 2379 



1320 
X380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 



1 11 21 31 41 51 

I I i I I I 

GCAGCTCCAG TCCCX3GACGC AACCCCGGAG CCXnCTCAGO TCXXT6GGGG GAA06GTGGG 60 

TTAGACGOGG AOGGGAAGGG ACAGOGGOCT TC6ACCG0CC CCCGAQTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT T6AACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCX: ATGTGGTAAA 240 

ATGAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACA6TCT CTTCACCT6T GAACCAATTA CTGTTCCCAQ AT6TATGAAA 360 

ATGGCCTACA ACATQAOGTT TTTCCCTAAT CTGATGGGTC ATTATGAOCA GAGTATTGCC 420 

GOGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTGACC AAACATT6AA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GOTTCCACCT 540 

TGTCGTAAAC TTTGTGAQAA AGTATATTCT QATTGCAAAA AATTAATTQA CACTTTTGGG 600 

ATC06ATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCAGACAC AGAATTTCTT GOTCCTCAGA AaAAAACAOA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAG6CAT CTTAAGACTT CTGGGG6ACA AGGATATAAG 780 

TTTCTGGGAA TTQACCAGTG TGCGCCTCCA TGCCXXSU^CA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTQGAACA OTTTCAATAT TTTOTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTQATGTT AQAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATrOTA TCTCTTATGT ACTTGATTOG ATTTTTGCTG 1020 

GGOQATAGCA CAGCXTTGCAA TAAGGCAGAT GA6AAGCTA0 AA CTl t S OT G A CACTGTT G TC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACGGTT TTGTTCATGC TTTTGTATTT TTTCACAATQ 1140 

GCTGGCACTG TGTGGT6GGT GATTCTTACC ATTACTTGGT TCTTAGCTQC AGGAA6AAAA 1200 

TGGA&TTOTG AAOCCATCOA GCAAAAAGGA OTGTGQTTTC ATQCiaTTGC ATGGGQAACA 1260 

CCAGGTTTCC TGACTOTTAT GCTTCTTGCT CTGAACiAAAO TTGAAGGAGA C3UVCATTAGT 1320 

G6AGTTTGCT TT G T TG GCCT TTATGACCTG GATGCTTCTC GCTACTTT G T ACTCTTGCCA 1380 

CTGTOCCTTT GTOTOTTTOT TOGGCTCTCT CTTCTTTTAO CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC OGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTOGAATTG GAOTCTTGAG CGGCTTGTAT CTTGTGCCAT TAQTQACACT TCTOGGATGT 1560 

TAG6TCTATG AOCAAGTGAA CA6GATTACC T06GAGATAA CrTGGGTCrC TGATCATTGT 1620 

CGfTCAOTACC ATATCCCATQ TCCTTATCAG GCAAAA6CAA AA6CT0GACC AGAATTGOCT 1680 

TTATTTATGA TAAAATACCT QATGACATTA ATTGTTGGCA TCTCTGCTGT CTl'Cl'GGG'X"! 1740 

GQAAOCAAAA AGACATGCAC A0AATG6GCT GGGTTTTTTA AAGGAAATOG CAAQAQA6AT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAOCACTAT AAAOCAAGTT CACACAASCT GAAGGTCATT 1920 

TCCAAATCCA T6GGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAOTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACA6 AAATCCAAAC CTCACCAGAA 2040 

ACATCAATQA GAGAGGTGAA AGCGGAG6QA 6CTA0CACCC CCAGGTTAAO AOAACAGOAC 2100 

TGTGGT6AAC CTGCCTOSCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCX3AC 2160 

GGQAAGGGGC AGGCAGGCAG TGTATGTQAA AOTOO OOGQ A OTQAAGGAAO QATTAOTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGOCA CAGA6CAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAQAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAOTQ 2340 
AGAAAAQAGC AGG6AGGTGG TT6TCATTCA GATACTTGAA GAACATTTTC TCTCQTTACT 2400 
CA6AAGCAAA TTrGTOTTAC ACTGGAAGTG ACCTATGCAC TGTTTT6TAA GAATCACIGT 2460 
TAOSTTCTTC TTTTGCACTT AAASTTGCAT TGCCTACTOT TATACTGGAA AAAATAOAOT 2520 
TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAO 2580 
AAATGTGCAO GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGQAATCTT 2640 
CCTTTTCTAT TTATGAAQAT TCTACTCTTO GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 
TTTACCTTTT TGATATAAAA TCAAOATATT TCTTTGCTGA AOTATTTAAA TCTTATCCTT 2760 
GTATCTTTTT ATACATATTT 6AAAATAA8C TTATATGTAT TTGAACTTTT TTQAAATCCT 2820 
Ant3UU3TAT TTTTATCATG CTATTGTGAT ATTTTA6CAC TTT60TAGCT TTTACACTGA 2860 
ATTTCTAAGA AAATTGTAAA ATAOTCTTCT TTTATACTOT AAAAAAAGAT ATAOCAAAAA 2940 
GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATAGCTTAGC ATCTAAAATO 3000 
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TOTGATTTTT ATAGTCTOGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3060 

GGTGCTTACT CAAAGAGTGT CCACTATTQA TTGTATTATO CTQCTCACTO ATCCTTCTGC 3120 

ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGOCAAGtGC AATTGACTTC CCTTTTTTAA T G TTTCATOA CCACOCATTO ATTGTATTAT 3240 

AACCACTTAC AOTTOCTTAT ATTTTTTOTT TTAACTTTTO TTTCTTAACA TTTAGAATAT 3300 

TACATTTTQT ATTATACAGT ACCTTTCTCA GACATTTTQT AO 3342 



Seq ID NO: C170 DNA Sequence 
Nucleic Acid Accession fit NM__OO0SB2 
Coding sequence: 88.990 



1 11 21 31 41 SI 

I t I I I I 

GCA6AGCACA GCATOGTCGG GACCAGACTC GTCTCAGGCC AGTTGGAGCC TTCTCAGCCA 60 

AACGCOGACC AAGOAAAACT CACTACCATG AGAATTGCAG TQATTTGCTT TTGCCTCCTA 120 

GOCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTO GAAGTTCTGA GQAAAAGCAO 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAQ 240 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAOTCCA ACGAAAGCCA TQACCACATG 300 

GATGATATGG AT6ATGAAQA TGATQATQAC CATGTG6ACA GCCAGGACTC CATTGACTCG 360 

AAOGACTCTG ATOATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTQATGAAT CTGATQAACT GGTCACTQAT TTTCCXZACGG ACCTOCCAQC AACCQAAGTT 480 

TTCACTCCAO TTQTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TOATGCTACyv 600 

GACGAOQACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCOGTTBCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCQTGGGAA OGACAGTTAT 720 

GAAAGGAOTC AGCTQQATOA CGAGAGT6CT GAAACCCACA QCCACAAGCA GTCXZAGATTA 760 

TATAAOOGGA AAGCXAATGA TGAGAGCAAT 6AGCATTC0G ATGTGATT6A TAQTCAGOAA 640 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAQAAQAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAOTG CATCTTCTGA GGTCAATTAA AAGGA6AAAA AATACAATTT CTCACTTTGC 1020 

ATTTAOTCAA AAGAAAAAAT GCTTTATA6C AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1140 

ATTAGTTTAG TTTOTGQCTT CATGGAAACT CCCTGTAAAC TAAAAOCTTC AGGGTTATGT 1200 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 

ATAACATTTT ATOTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 

TATCTTTTTO TGQT G T G AAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGQTGGTG TC 1440 
AATTQCTTAT ' nmi T rCCC AOS G TTGTCC A6CAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 1524 



Seq XD KO: C171 SNA Sequence 
Nucleic Acid Accession ftt NM_002821 
Coding sequence 1 150.. 3 3 62 

1 11 21 31 41 51 

i 1 I I i I 

AACTCCGGCC T06GGAC6CC TGGGGGTOGG GCTCCG6CTG 0Q0CTQCT6C TGCGGCGCGC 60 

GGGCTCOGGT GCGTCCGCCT CCTQTGCCCG 0CGC6GA0CA GTCTOOGGOC CSCOOTGCGC 120 

CCTCAGCTCC TTTTCCTQAQ CCCGCCGCGA TGGGAOCTGC GCGGGGATCC CCGGCCAGAC 180 

CCCGCOGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCOCT GCTGGGOQGT ACCCAGACAG 240 

CCATT6TCTT CATCAAGCAO CCGTCCTCCC A6GATGCACT 6CAGGGGCGC 06GGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT COQGGCCOGG TAGATGTGZA CTGGCTGCTC GATG G6QC CC 360 

CTGTCCAGGA CACQGAGCGO CX3TTT0SCCC AGGGCAGCAG OCTGAGCTTT GCAGCTGTGG 420 

AC0GGCT6CA GGACTCTGGC ACCTTOCAGT GTGTG6CT0G GQATGATGTC ACTGGAQAAO 480 

AAGCCCGCAQ TGCCAACGCC TCCTTCAACA TCAAATGGAT TQAGQCAGGT CCTGTGGTCC 540 

TGAAGCATCC AGCCTCGGAA GCT6AGATCC AGGCACAGAC CCAGOTCACA CTTCCTTGCC 600 

ACATTGATGG GCACCCTOGO CCCACCTACC AATGGTTCCO AOATGOGACC CCCCTTTCTO 660 

ATGGTCAQAG CAACCACACA GTCAGCAGCA AGGAGOGGAA CC TOAOSC TC OGGC CAGC IG 720 

GTCCTGAOCA TAGTGGGCTG TATTCCTGCT GCGCOCACAG TQCTTTTGGC CAQGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TQCCAGGGTG GTGCTGGCAC 840 

CCCAG6AC6T G6TAGTAGC6 A6GTATGAGG AGGCCAT6TT CCATTGCCAO TTCTCAGCCC 900 

AGGCACCCCC QAGCCTGCAO TGOCTCTTTG AGQATOAOAC TCOCATCACT AAC06CAOTC 960 

GCCOCCCACA CCTCCGCAOA GCCACAGTGT TTGCCAAC3Q0 GTCTCTGCTO CTQACCCAOO 1020 

TCCGGCCACO CAATGCAGGG ATCTACOOCT GCATTGGCCA G6GGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAQ AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTOTTTAC AGCTGGGAGC GAGGAGGGTG TGACCTGCCT TCCGCCCAAG GOTCTGCCAG 1200 

AOCCCAaOOT GTOOTOGGAG CAOGCGGQAO TCCGGCTOCC CACCCATGGC AGGQTCTACC 1260 

AGAAGG6CCA 06A6CTGGTG TTGGCCAATA TTGCTQAAAG TGAT6CTG6T QTCTACACCT 1320 

OCCACGOGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCQGCTACT 1440 

TGGATTGCCT 6ACCCAGGCC ACACCAAAAC CTACAGTTOT CTGGTACAOA AACCAGATGC 1500 

TCATCTCAGA OQACTCAOQG TTOGAGOTCT TCAAGAATGG GACCTTGCGC ATCAACAGGO 1560 

TGGAGGTOTA TGATGGGACA TGGTACGGTT GTATGAGCAO CACCCCAGCC GGCAGCATOO 1620 

AGGOQCAAGC COGTQTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAOTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAQAGA 1740 

AGGCCACTAT TAAGTGGQAA CGGGCAGATG 6GAQCAGCCT CCCAGAGTGG GTGACAGACA 1800 

A0GCIG6GAC CCTQCATTrT GCOOGG GT GA CTCQAGAT6A CGCTGGCAAC TACACT TGCA 1860 

TTOCCTCCAA CGG6C0GCAG GQCCAGATTC OTQCCCATGT CCAGCTCACT GTGQCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCOTA C3GACT6TOTA OCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGQ GACCCCAAGC OOCTCATTCA GTOQAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT GCAGAATGGC TCOCTGGTGA 2100 

TCCATQAOOT GOCCCCTQAO GACTCAGOCC GCTACACCTG CATT6CAGGC AACA0CT6CA 2160 

ACATCAAGCA CACGGAGOCC CCCCTCTATO TOQTGQACAA GCCTGT6C0G GAGOAOrGGG 2220 

AGGGCCCTGG CAQCCCTCCC CCCTACAAOA TOATCCAGAC CATTGGGTTG TOOGTOOOTG 2280 

CCOCTGTGaC CTACATCATT GCCGT G CTGG OCCTCATOTT CTACTGCAAO AAGCGCTGCA 2340 
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AAGCCAAGGG 
□AGGGCCTTT 
GCTTGGGCTC 
TCCCAGGGTC 
TCCTG6CAAA 
GCCTGCAGAC 
GGAAGCTQAA 
ACTACATGGT 
A0A6CAAGQA 
GCACCCA60T 
TGGCTGOGCG 
TCAOGAAGGA 
GCTCjGATGTC 
CCTTG60TGT 
CAGATGATGA 
GCTGCCCTTC 
G6CCCTCCTT 
GAGGAGGQAG 
CAGCATOKia 
TTGCTGAGGT 
GGCTGACTTG 
CTCTTCCTCT 
TTCTCCCCTT 
AGGCTTGOGA 
AGGGTTAATG 
ACACAGCAAG 
CCCCACCCTT 
CTTTTQACAC 
TOCAGOGTGG 
GCCATCCTTA 
TGTTTTTGTT 



6CTGCAGAAG 
GCAGAA06GG 
OGGCXrCOGOQ 
TAGCCTGCAO 
GGCTCAGGGC 
GAAGGAT6AG 
CCACGCCAAC 
GCTG6AATAT 
TQAAAAATTG 
AGGCCTGOGC 
TAACTGCCTG 
TGTQTACAAC 
CCCOGAGGCC 
6CTGATGTG0 
AOTACTGGCA 
CRAACTCTAT 
CAGTQAGATT 
CCCOCTCAGG 
GGCAAOATCC 
CTGAGCAGGG 
GACCCAAACT 
ATCAGGGACA 
GACOGGGTCC 
TQAOCTGQGT 
AGTCTCTTGC 
TOAQTCCTCC 
CTCTCCTTTC 
TATATAAACC 
GGTGGGTGGG 
CCCCACACTT 
TTTACACTOG 



CAGCCOGAGG 
CAGCCCTCAG 
GCCAGCAACA 
OOCATCAGCA 
TTGGAOQAGG 
CAGCA6CAGC 

GTGQATCTGG 
AAGTCftCAGC 
AT0QA6CACC 
GTCAGTGCCC 
AGTQAGTACT 
ATCCTG6AGG 
GAAOTGTTTA 
6ATTT6CA60 
OGGCTGATGC 
GCCAGCGCCC 
ATGGCCTGGG 
CTOTCCTCCT 
CCTGGOCTTT 
GGGGGACTA6 
6TGTGGGTGC 
AACTCTGCCA 
TTGTGOOOAG 
CCACTGGTGC 
CCACTCTGGG 
CTCATCCTAA 
GCCCTTTTTB 
GATGG6A06T 
TTATT G TTGT 
CTGCTCTCAA 



GOGAGGAGCC 
CAOAQATCCA 
AAGOGCACAO 
OGCTGGGGAA 
QAOT60CAGA 
TGQACTTCOG 
TCCTQ0G6GT 
GAGACCTCAA 
CCCTCA6CAC 
TOTCCAACAA 
AGAGACAAGT 
ACCACTTCOG 
6TGACTTCTC 
CACATOQAQA 
CTGGOAAGGC 
AGCX3CTGCTG 
TGGOAQACAO 
CAGGGGAOQA 
GG6CCCTGAG 
CCTOCTCTTC 
GGCTTTQAGC 
CACAOOTAAC 
CTCATCTGCC 
TTOCTTAATA 
ACTTGGGGGT 
CTTOTGCACA 
GT8CCTGGCA 
TATGCACCAC 
A0GGGTGG6C 

c o Trrm ft f 

TAAATAA6CC 



AGAOATGGAA 
AGAAGAAGTG 
CACAAGTGAT 
GAGTQA6TTT 
GAOOCTGQTA 
GAGGGAGTTG 
GTQCOGGQAQ 
GCA6TT0CX0 
CAAOCAGAAG 
C06CTTTGTG 
GAAGGTGTCT 
CCAGGCCTGG 
TACCAAGTCT 
GATGCCCCAT 
7AQACTT0CT 
GGCCCTCAGC 
CACCGTGGAC 
CATCTCTAGA 
6TG0CCTA0T 
CTCACCCTCA 
TGGGCAGTTT 
CCCAATTTCT 
AACTTTGCCT 
TTCTCAAiGTT 
CTAGACCAGG 
CTGACCCAGA 
GATGAAGGAG 
QG606GCTTT 
CCT60AGAT0 
TTGTTTTGTT 
TTTTTTA 



Seq ID NO: C172 DMA Sequence 

Nucleic Acid Accession #: IIM_002309.2 

Coding sequence i 65. .673 



TGCCTCAACO 
GCCTTGACCA 
AAGATGCACT 
GGGGAGGTGT 
CTTGTGAAOA 
GAGATGTTTG 
GCTGAGCCCC 
AGGATTTCCA 
GTGGCGCTAT 
CATAAGQACT 
GCCCTGGGCC 
GTGCCX3CTGC 
GATGTCTGGG 
GGTGG6CAGG 
CAGCCC6AG0 
CCCAAGGACC 
AGCAAGCCGT 
GGGAA6CTCA 
GCAACAOQCA 
TCCTTTGGGA 
CCCCTGCCAC 
GGCCTTCAAC 
GGGGAGGGCT 
CTGGQCACAC 
ATTATAQAGG 
CCCAGGTCTT 
TTTTCAGGAG 
TATATGTAAT 
A0GA0G6TGG 

TTrrx ' tfrm ' 



51 



2400 
2460 
2520 
2SB0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4187 



1 11 21 31 41 

I I i I I I 

ATGAACCTCT GAAAACTGCX; GGCATCTGAG GTTTCCTCCA ACGCCCTCTG AAGTGCAGCC 60 

CATAATGAAG GTCTTGGCGG CAGOAGTTGT QCCCCTGCTG TTGGTTCTGC ACTGOAAACA 120 

TGGGGOGGGG AGCCCCXTTCC CC3VTCACCCC TGTCAACX5CC ACCTGTGCCA TACGCCACCC ISO 

ATGTGACAAC AACCTCATQA ACCAGATCAO GAGCCAACTG GCACAGCTCA AT6GCA0TGC 240 

CAATGCCCrC TTTATTCTCT ATTACAOVOC CCAGGOOQAQ CCGTTCCCCA ACAACCTGQA 300 

CAAGCTATGT GGCCCCAACG TGAOGGACTT CCCJGCCCTTC CACGCCAACG GCACGGAGAA 360 

GGCCAAGCTG GTGGAQCTGT ACCGCATAGT OGTGTACCTT GGCACXTTCCC TGGGCAACAT 420 

CACCCGGGAC CAGAAGATCC TCAACCCCAG TGCCCTCAGC CTCCACAGCA AGCTCAACGC 4 BO 

CACCGCCX5AC ATCCTOOGAQ GCCTCCTTAQ CAAOGTGCTQ T3C0QCCTQT QCAOCAAOTA 540 

CCACGTGGGC CATGTGGAOO TQACCTAOGG CCCTGACACC TOGGGTAAGG ATGTCTTCCA 600 

OAAGAAGAAG CTGGGCTGTC AACTCCTG6G GAAGTATAAG CAQATCATOQ CCGTGTTGGC 660 

CCAGGCXnrC TAGCAGGAGC TCTTGAAGTG TGCTOTGAAC COAGGQATCT CAGQAGTTGQ 720 

OTCCAGATGT GGGGGCCTGT CCAAGGGTGG CTGGGGCCCA GOGCATOSCT AAACCCAAAT 780 

GQGGGCTGCT GGCAQACCCC GAGQQTQCCT GGCCAOTCCA CTCXaVCTCTQ GGCTGGOCTO 840 

TGATGAAGCT QAGCAGAGTG GAAACTTCCA TAGGGAGGGA GCTAGAAGAA GGTGCXICCTT 900 

CCTCTGGGAO ATTOTGQACT GGGGAGOGTG GGCTGGACTT CTGCXTTCTAC TTGTCCCTTT 960 

GGCCXXTTGC TCACTTTGTG CAGTQAACAA ACTACACAAG TCATCTACAA GAGCCCTGAC 1020 

GACAOGGTOA GACAGCAGOG CCCAGGGGAO TGGACCAGCC CCCAGCAAAT TATCACCATC 1080 

T6TG0CTTT0 CTGCCCCTTA 6GTT6QGACT TA06TGG6CC AGAQGGGCTA GGATCCCAAA 1140 

GGACTCCTTG TCCCCTAQAA GTTTGATGAG TGGAAGATAG AGAGGGGCCT CTGGGATGGA 1200 

AGGCTGTCTT CTTTTQAGOA TGATCAGAQA ACTTGGGCAT AGGAACAATC TGGCAGAAOT 1260 

TTCCAGAA6G AGGTCACTTG GCATTCAGGC TCTTGGGGAG GCAQAOAAQC CACCTTCAGG 1320 

CCTGGGAAGG AAGACACTGG QAGQAQOAaA QGCCTQGAAA GCTTTGGTAG GTTCTTOGTT 1380 

CTCTTCOCOS TQATCTTCCC TGCA6CCTGG GATGGCCAGG GTCTGATGGC TGGACCTGCA 1440 

GCA6GGGTTT GTGGAGOTGG 0TAGG6CAGG G6CAGGTTGC TAA6TCAGGT 6CAGAGGTTC 1500 

TGAGGGACCC AGGCTCTTCC TCTGGGTAAA GGTCTGTAAG AAGGGGCTGG QGTAGCTCAG 1560 

A0TAGCA6CT CACATCTGAO GCCCTGOGAG GTCTTGTGAG GTCACACAGA GGTACTTGAO 1620 

.GGGGACTOGA G6CGGTCTCT 6QT0CCCAG0 GCAAGGOAAC AGCAOAACTT A6GGTCAGGG 1660 

TCrCAQGQAA CCCTGAGCTC CAAGOGTQCT GTGGGTCTGA 0CTG6CATGA TTTCTATTTA 1740 

TTATGATATC CTATTTATAT TAACTTATTO OTaCTTTCAG TGGCCAAOTT AATTCCCCTT 1800 

TCCCTGGTCC CTACTCAACA AAATATGATG ATGGCTCCCG ACACAAGOGC CAGGGOCAGO 1860 

GCrrAQCASa GCCrGOrCTG GAAOTCGACA ATGTTACAAG TGGAATAAGC TTAOQGGTGA 1920 

AGCTQU3A8A AGGGTOGGAT CTGAQAGAAT GGGGAGGGCT GAGTOGGAGT GGGGGGCCTT 1980 

GCTCCACCCC CATCOCCTAC TGTQACTTGC TTTAOCGTGT CAGGGTGCAO GCTGCAGGGO 2040 

CTGGGCCAAT TTGTGGAGAG GCOGGGTGCC TTTClt?IXn i ' OCTTCCAOGO GGCTGGTTCA 2100 

CACTGTTCTT GGGOSCCCCA GCATTGTGTT GTGAGGOGCA CTGTTCCTGG CAOATArrGT 2160 

GCCCOCT G GA GCAQTGGGCA AGACAQTCCT TOTGGCCCAC CCTGTCCTTG TTTCTGTGTC 2220 

CCCATGCTGC CTCTGAAATA GCGGOCTGGA ACAACCCTGC CCCTQCACOC AGCATGCTCC 2280 

GACACAGCAO GGAAGCTCCT C C POTGQCCC G6ACACGCAT AGA0GGT6CB GGGOGCCTGO 2340 

CTGGQCCAGA CCCCAGGAAG OTGGGGTAQA CTGGGGGGAT CAGCTGCCCA TTGCTCCCAA 2400 

GAOGAGQAGA GGQAGGCTGC AGAOOCCTGQ QACTCAGACC AGGAAGCTaT GGGCCCTCCT 2460 

GCTCCACOCC CATCCCACTC CCACCCATGT CTGGGCTCCC AGGCAGGQAA CCCGATCTCT 2520 

TCCTTT8TGC T06GGCCA0G OQAaTGOAGA AACGCCCTCC AGTCTGAGA6 CA6GGGAGGG 2580 

AAGGAGGCA8 CAGAGTTGGG 6CAGCT6CTC AGAGCAOTGr TCTOGCITCT TCTCAAACCC 2640 

TGAGGGGGCr GCOGGC CTCC AAOTTCCTCC GACAA6ATGA TGGT ACTA AT TATGOTACTT 2700 

TTCACTCACr TTQCACCTTT CCCTGTCGCT CTCTAAGCAC TTTACCTGGA TGGC60GTGO 2760 
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GCM3TGTGCA 
CCATCTGTCC 
CCCTTGTCCA 
CTGGCTGCAT 
GCTGATTCCA 
GGAGCTCTGT 
GGACTTGGCC 
GGTCCAOCCT 
CGGGGAGCTG 
GGGAGGAAGA 
TTTTTTTAAA 
CTTGTCCCTQ 
CTCTCCAOAQ 
COCTGCAGAT 
CGOCATCCAG 
GAGGTTTCCC 
TGATQAATGT 
CATATTTAAA 
G3U1AAATGAA 



GGCAGGTCCT 
ATCCCAACAG 
GGATGCAGGG 
TCCGCCAOGA 
GCCG6GGGGC 
60CASGCCCA 
CCATGGGACC 
GCACAOCATC 
GGTTTCTCTT 
GAGAOTTTAC 
AAGCACTGCT 
ACrrGGGGCA 
TAQACATAGT 
GQTACAGATO 
CCTTCAGTCC 
TCCTTCCTTT 
ACGCTGTGGG 
ATATTTATTT 
TAAA6AATCT 



GAGGCCTGGQ 
CAAGACGAGG 
ACTGCCTTCT 
TGGGCTTOGA 
COGGCTGACT 
CCTTOCGOOC 
CATCCTCAGT 
ACTGAATCAC 
CCCTTTTTAT 
CAAGAGAGAT 
AGTTTACTTG 
CTTCCACCCT 
GTGTGGGGTT 
TTCCTGCCTT 

ggcxx:ac30to 
cx:actgaaaa 
gatqtttcat 

TTTATACCAA 
ACTCTTOG 



GTTGGGGTGG 
ATGTGGCTGT 
CCTTCCTGCT 
GAAAGACAAA 
OGGOCATO^C 
CTGGCTCTGA 
GCTCCCTCCA 
AGAGCCTTTO 
CTGCTGGTQT 
GTCTC0GG6C 
TCTCTCCTCC 
GACCCAGCCA 
GGAGCTCTGG 
AGAGTCATCT 
CTAGCTOOQT 
GCACATGGCC 
ACTGACAGAT 
ATGAATCACT 



AQGGTOGGGC 
TGAGATGTGG 
TCATCGGGCT 
CTTGTCTG6A 
CTCATCTCCC 
GTOGCTCTCC 
GATCCOGTCC 
GOTGAAAGAG 
GGACCACACC 
CCTTATTTAT 
CCATGGTCCC 
GTCCAGCTCT 
CACCOQGGGA 
CTAGTTGCCC 
GGOCCCACOG 
TT6GGTGACA 
TATTTTTATT 
TTTTTTTTTA 



CCGGAGTTGT 
GCCACACTCA 
TA6CTTGGGG 
AACCAGAGTT 
TGTGGACTTG 
CACCCAGCCT 
GGCAGCTTGG 
CTCTGCCAGG 
TGGGCCTG6C 
TATTTAAACA 
CATCGTCCTC 
GCCTTGCOGG 
GGTAGCATTT 
AOCTCAATCC 
TOOGGCCTTA 
AATTCCTCTT 
TATTCAATQT 
AGAAAAAAAA 



Seq ID N0» C173 DNA Sequence 
Nucleic Acid Accession H t XM_097506 
Coding sequence: 44.. 2788 ' 



1 11 

I I 
TGAAAGGCGG TTGTGGTGCA 
GACACTTGTT TGTCACGTGT 
CACAGAGCTT CAGCCTGOCC 
TCGQAACCAT CTCTCACACA 
CCTGATGCTG CAGAACAATC 
GAGCCTGCAG TC3GCTGCGCC 
TGAGGGGCTG TCCTCCXn'CC 
TGTCAGGGCC CTCAACAAOC 
CAGCCACATC CCCGACTAOS 
TAACAACCGC ATCCAGCATC 
ACTAGACCTG AATTATAACA 
ACTGCAGGAA CTGGGGTTCC 
GGGGAACCCT CTGCTACAOA 
ATOQGCATTC CAGTACCTGC 
CCAGQAGTTT CCPJSATCTCA 
AGGCATCXGG CTGCTCCCAT 
ACTGTCTCAC AATCAAATTG 
AATOGGCCTC CAACACAACC 
CTGCCTGCAA GCCCTGGATC 
CTCCACCCTG CACTCCCTGG 
CCTGGCTGGA CTTGGGGGCT 
GGGCTTCTCC AAG6ACAGTT 
GTGCTOTGOC TATGGGATGT 
AGAOCTTCAC CTTGATGATG 
AGCAGAGAAC CACTATGACC 
GCGACACCCC AOTGTCCAGT 
CTTTGAAAGC TGOGGCATCC 
CAATOQACTG GTGCTOCTGA 
GTTTGTGQTA GGTGOGATTO 
AQCCTCAGTC GATGCCCTQA 
GGGGCTAGGC TGCCGQGCCA 
GCTGCTCACT CTG6C06CAG 
GAAOTCCCGC TCOCTOGGCA 
6CTQ6CCGCC G06CT6CCCC 
GCCCTAOGCG CCACCTGAGG 
GATGAACTCC TTCTGTTTCC 
G CCGOSGGG C GACTTTGAGG 
CATCTTC6CA GAOOGGCTOC 
GGGCCTCTTC CCTGTCACGC 
GCCTGCCTGC CTCAACCCAC 
TOSGCXXSCTT CGGCCCCGCQ 
GCTGGAQAAG AGCTCCTGTG 
CATTCTGGAA GCTTCTGAAG 
AGTGAOCXrrC ATCTCCTGTC 
AGAGCCAGAG GGGAACCACT 
GAGGGCAGAG GGATCTACGC 
CTCTGGCTTQ GCCTTTGCTT 
TCTTCCCTTT CCTCTCTCCC 
AAACTCAGCA QTGTGATCTA 
CTCCTGTGAC CATCACCAAC 
TTCACCTTGA TACTGGGCCT 
CTTITGTCTG CTTAAGGGAA 
AGGGCACAGT GGACAGGGAO 
CTCATGGATA GGATACAAAA 
AAAGACTTCC TATTAAAATA 



21 31 41 51 

I I I I 

AAGGAAAACC CACAGGCCAA GGAAT6GGAA GACCAAGGTT 
CAATAATCAT CTCTGCCXXSG GACCTCAGCA TGAACAACXH* 
TCTTCCACCA OCTQOGCTTC TTGOAGGAGC TOOGTCTCTC 
TCCCA6GACA AGCATTCTCT GGTCTCTACA GCCTGAAAAT 
AGCTGGGAGO AATCCCOOCA GAGGOGCTGT OGGAGCTGCX: 
TAGATGCXJUV CCTCATCTCC CTGGTCCCGG AQAGGAGCTT 
G CCACC TCTG GCTGGACGAC AATQCACTCA OGGAGATCGC 
TCCCTGCCCT GCAOQOCATG AGCCTGGGOC TCAACOGCAT 
CGTTCCAGAA TCTCA0CA6C CTTOTGGTGC TGCATTTGCA 
TGGGGACXCA CAGCTTCQAG GGGCTGCACA ATCTGGAQAC 
AGCTGC3VGGA GTTCCCTGTG GCCATCOQGA CCCTGGGCAG 
ATAACAACAA CATCAAGGCC ATCTCAQAAA AGGCCTTCAT 
OGATACACTT TTATGATAAC GCAATCCAGT TTGTGGGAAG 
CTAAACTCCA CACACTATCT CTGAATGGTG CCATGGACAT 
AAGGCACXJ^C CAaCCTGGAG ATCCTQACCC TGACCCGCGC 
CGGGQATGTG CCAACAGCTQ CCCAC3GCTCC GAGTCCTGGA 
AGGAGCTGCC CAGCCTGCAC AGOTOTCAGA AATTQGAGGA 
GCATCTGGGA AATTGQAGCT GACACCTTCA GCCAGCTGAG 
TTAGCTGGAA OGCCATCCGG TGCATCCACC CCGAGGOCTT 
TCAAGCTGGA CCTGACAGAC AACCAGCTGA CCACACTGCC 
TGATGCATCT GAAGCTCAAA GGGAACCTTG CTCTCTCCCA 
TCCCAAAACr GAGGATCCTG GAGGTGCCTT ATGCCTACX3V 
GTGCCAGCTT CTTCAAGGCC TCTGGGCAGT GGGAGGCTGA 
AGGAGTCTTC AAAAAGGCCC CTGGGCCTCC TTQCCAGACA 
AGGACCTGGA TQAGCTCCAG CTGGAGATGG AGGACTCAAA 
GTAGCCCTAC TCGAGGCXTCC TTCAAQCCCT GTQAGTACCT 
GCCTGGCCOT GTGGGCCATC GTGTTGCTCT CCGTGCTCTG 
CCGTGTTCGC TGGOGGGCCT GTCCCCCTGC CCCOGGTCAA 
CAGGCGCCAA CACCTTQACT GGCATTTCCT GTGGCCTTCT 
CCTTTGGTCA GTTCTCTGA6 TAOGGAGCCC GCTGGGAGAC 
CTGGCTTCCT GGCAGTACTT GQGTOQGAGG CATC3GGTQCT 
TGCAOTGCAG CGTCTCCGTC TCCTGTGTCC GGGCCTATGG 
GCOTTGOAQC AGGGGTCCTA GGCTGCCTGG CACTGGCAGG 
TGGCCTCMT GGGAGAATAC GGGQCXinPCCC CACTCTGCCT 
OTCAGCCAGC AGCCCTGGGC TTCACCGTGG CJCCTGGTGAT 
TGGTCGTGGC CGGTCCXTTAC ATCAAACTGT ACTCTGACCT 
OOGTGTGGGA CTGCXX:CATG GTGAGGCAOG TGGCCTGGCT 
TCTACTGTCC C6TGGCCTTC CTCAGCTTTG CCTCCATGCT 
OOGAGGCCJGT CAAGTCTGTC CTGCTGGTGG TGCTGCCCCT 
TGCTGTACCT GCTCTTCAAC CXXXyVCTTCC GGGATQACCT 
CAGGGGACTC AGGGCCCCTA GCXITATGCTG CGGCCGGGGA 
ATTCTACXXA GGOCCTGGTA GCCTTCTCTG ATGTGGATCT 
CTGGGCGGCC CCCTGG6CTG GAGACCTATG GCTTCCOCTC 
AGCAGCCAGO GGOCXXXIAGG CTG6AGGGCA GCCATTGTGT 
TTGGGAACCC CCAACCCTCC ATGGATGGAO AACTOCTQCT 
CAGCAGGTGG AG6CTTGTCA GGGGGTGGOG GCTTTCAGCC 
CACACGTGTA AATATCCCTC CCCATTCTTC TCTTCCCCTC 
CCTCXMTGAA TGATGGCTGC TTCTAAAACA AATACAACCA 
TAGCAGQATG GCCCAOTCCC TGGCTCCACT QATCACCTCT 
GGGTGCCTCT TGGCCTGGCT TTCCCTTGGC CTTCCTCAQC 
CTTCCTTGTC ATGTCTGAAG CTGTGGACCA GAGACCTGGA 
ATGAGGGAAO TAAAGACAGT GAAGGGOTGG AGGGTTGATC 
ACCTCACAGA QAAAGGCCTG GAAQGTGATT TCCOGTGTGA 
TGTGTTCCAT GTACCATTAA TCTTGACATA TGCCATQCAT 
AGCTTTGGAA GAG 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3S40 
3600 
3660 
3720 
3780 
3840 
3868 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3273 



Seq ID NO: C174 DMA Sequence 
Nucleic Acid Accession fti NM_130849 



1308 



wo 03/042661 



PCT/US02/36810 



Coding sequence I 101.. 2044 

1 11 21 31 41 SI 

D AGTCTGGCCC TQGACAJVCCC CAGCAAAOCC GGGCTGAGCC AOCCCAOAAa CACTGG6CCT 60 

TGGCCACAGC AACACCCACT GAGCAOGCTO GGAGCTGAGT ATGGOSTCCC TGGTCT06CT 120 

GGAGCTGGGG CTGCTTCTGG CTGTGCTGGT GGTGAOGGCG ACGGOGTCCC OQCCTGCTGG 180 

TCTGCTGAGC CTGCTCACCT CTGGGCAGGO CQCTCTGQAT CAAGA06CTC TGGOCSGGCCT 240 

GTTAAATAOO CT660GGACC GTGTGCACTQ CACCAA0G6G CGGT6TGGAA AGTGCCTGTG 300 

lU TOTGGAGGAC GOCCTGGGCC TGGG0GA6CC TGAGGGGTCA GGGCTGCCCC G66GCCCX36T 360 

OCTGGAGGCC AGGTACGTCG CCOGCCTCAG TGCCGCCOCX: GTCCTGTACC TCAGCAACCC 420 

OGAGGGCACC TGTGAGGACA CTOGGGCTGG CCTCTGGGCC TCTCATGCAG ACCACCTCCT 480 

GGCCCTGCTC GAOAGCCCCA A66CCCT6AC CCOGOGCCTO AGCTQGCTQC TOGAQAGGAT 540 

, - GCA660C0GG GCTGCOGGCC AGACOOCCAA GAOQGCCTGC GTAGATATCC CTCAGCIGCT 600 

15 GGAGQAGOCG GTG6GG60GG GGGCTCOQGQ GAOTGCTGQC GGCGTGCTGG CTGOCCTGCT 660 

GGACCATGTC AGGA6CGGGT CTTGCTTCCA CGCCTT6CGQ A6CCCTCAGT ACTTOGTGGA 720 

CTTTGTGTTC CAGCAGCACA GCAGC3GAGGT CCCTATGACG CTGGCOGAGC TGTCAGCCTT 780 

GATGCAGOGC CTG6GG6TGG GCAGGGAG6C CCACAGTGAC CACAGTCATC GGCACAG66G 840 

. AG0CA6CAGC GGGQACCCIG TGCCCCTCAT CAGCTCCA6C AACAOCTCCA 6TGTGTGGGA 900 

ZU CAOGGTATGC CTGAGTGCCA GG6A0GTGAT GGCTGOVTAT GGACTGTOGG AACAGGCTGQ 960 

GGTGACCCCO GAGGCCTGGG CCCAACTGAG CCCTGCCCTG CTCCAACA6C AGCTX3AGTGG 1020 

AGCCTQCAGC TCCCAOTCCA GGCCCCXXX3T CCAGGACCAG CTCAGCCAGT CAGAGAGGTA 1080 

TCTGTACGGC TCCCTGGCCA CGCTGCTCAT CTGCCTCTGC G0GGTCTTT6 6CCTCCT6CT 1140 

^. 6CTGACCTGC ACTGGCTGCA GGGGGQTGQC CCACTACATC CTSCAOACCT TOCTGAiGCCT 1200 

A J 66CAGTGGGT 6CACTCACTG GGGA06CTGT CCIGCATCTG A06CCCAAGG TGCTGGGGCT 1260 

6CATACACAC AGCX3AAGAGG GCCTCAGCCC ACAGCCCACC TGGCXSCCTCC TGGCTATGCT 1320 

GGCCGGGCTC TACGCCTTCT TCXTrCTTTGA GAACCTCTTC AATCTCCTGC TGCCCAGGGA 1380 

CCCGGAGGAC CTGGAGGAOO GGGCCTGCGG CCACAGCAGC CATAGCCACG GGGGCCACAG 1440 

^. CCA06GTGTG TOCCTGCAOC TGOCACCXAG GGAaCTCGGG CAQCCCAAOC CCCCCCACX» 1500 

30 GG6CTCCCX3C GCAGACCTGO TGGC3GGAGGA 6AGC006GA6 CTGCTGAAGC CT6A6CCCAG 1560 

OAGACTGAGC CCAGAGTTGA GGCTACTGCC CTATATGATC ACTCTGGGCXJ AOGCCOTGCA 1620 

CAACTTCGCC GACX3GGCTGG CCGTGGGCGC CGCCTTCGCG TCCTCCTGGA AGACCGGGCT 1680 

OGOC ACCTOG CTGGCC3GTGT TCTGCCACGA OTTGCCACAC GAGCTGGGGG ACTTCX3C0GC 1740 

^ - CTT6CT6CAC GOQGGQCTST CCGTQOSCCA AGCACTQCXQ CTQAACCTGG CCTC0606CT 1600 

35 CA06GCCTTC 6CTGGTCTCT ACGnrOGCACT GGCGGTTGGA GTCS^GGGAGG AGAGC6AG6C 1860 

CTGGATCCTG GCAOTGQCCA CCGGCCTGTT CCTCTACJ8TA GCACTCTGCG ACATGCTCOC 1920 

GGCGATGTTG AAAGTACGGG ACCXX3CX3GCC CTGGCTCCTC TTCCTGCTGC ACAACGTGGG 1980 

CCT6CTG66C GGCTGGACCO TCCTGCTOCT GCT6T0CCT6 TAOGAGGATG ACATGACCTT 2040 

CTGATACCCT GCCCTAGTCC CCCACCTTT6 ACTTAAGATC OCACACCTCA CAAACCTACA 2100 

4U GCCCAGAAAC CAGAA6CCCC TATAGA6QCC OCAST(X!CAA CTCCAGTAAA GACACTCTT6 2160 

TCCTT06AAA AAAAAAAAAA AAAAAAAAAA AA 2192 

Seq ID MO: C175 SNA Sequence 
Nucleic Acid Acceasion tt< im_01897l 
45 Coding sequence i 1..1128 

1 11 21 31 41 51 

I I I I I I 

^ ATGGCGAAOG G6A60QAGCC GGGTGGCAGC GGGGG06GCG AGGOGGCXXX: CCTGGGCCTC 60 

50 AAGCTGGCCA G6CTCAGCCT GCTGCTGTOC GTGAQCCTAG CGGGCAAOGT 6CTGTXXX3CG 120 

CTGCTGATCQ T6CGGGAGCG CAGCCTGCAC CGOGCCCCGT ACTACCTGCT GCTCX3ACCTQ 180 

TOCCTGOCCG AOGGGCTGCG CGCXSCTCGCC TGCCTCCCGG CCGTCATGCT GQCGGCGCGG 240 

CG7GCGG0GG COGCGGCGGG G606CCXSCC0 GGCXSOGCTGG GCTGCAAGCT GCTCGCCTTC 300 

CTOGCOOCGC TCTTCT6CTT CCAOGOCGCC TTCCTGCTGC TGGGGGTGGG 06TCACGGGC 360 

55 TA0CTG6CCA TOGOGCACCA CC6CTTCTAT GCAGAGOGCC TG6CCG6CTG GCCGTGGGCC 420 

GCCATGCTGG TGTGCGCC6C CTGGGCSQCrG GCGCTG6C0Q OGGCCTTCCC 6CCAGT6CTG 480 

GAOGGOGGTG GGQACOACGA GGAOQGGCGG TGCGCCCTGG AGCAG03GCC OGAOSGCGCC 540 

CC0GG0606C T6GGCTTCCT GCTGCTQCTG GCCGTGGTGG TGGGCGCCAC GCAGCTOQTC 600 
TACCTCOGOC TGCTCTTCTT CATCCAOOAC CQCGQCAAGA TOaSGCOOSC 008CCTG0TG 660 

OO CC06G08TCA GCCACGACTG 6ACCTTCCAC GGCC06GGCG CCACC6GCCA GGCGGCGGCC 720 
AACTGOAOOO OGGGCTTCGO CCGCGGGCCC ACGCCGCCCQ CGCTTOTGGG CATCOGGCCC 780 
GCAGGGCOGG GCCGCGGOGC GCGCCGCCTC CTCGTGCTGG AAGAATTCAA GACGGAGAAG 640 
AGGCTGT6CA AGATGTTCTA C6CCGTCACG CTGCTCTTCC TGCTCCTCTG GGGGCCCTAC 900 
OTCGTGGCCA GCTACCTGOG O q TC C TGGTG OQGGCOGGGO OCOTCOOCCA G6CCTACCTG 960 
D5 ACGGOCTCCG TGTGGCTQAC CTTGGCGCAa GOOGGCATCA ACCGCGTGGT GTGCTTCCTC 1020 
TTCAACAG6G AfiCIGAiGGaA CTGCTTCAGG QCCCAOTTCC CCTOCTGCCA GAGCCCOOGG 1080 
ACCAOCCAGO C6ACCCATCC CTGCGAGCTG AAAGGCATTG GTTTAT6A 1128 

Seq ZD NOi C176 DNA Sequence 
70 Nucleic Acid Accession #i NH_00S631 
Coding sequence! 290.. 2653 

1 11 21 31 41 51 

nc \ I I I I t 

/D GGCACGAGQG GGCTGAAGAC AACTTOGATT GOGAGGCTAO GGCTTQGGGA GTOOTGCATC 60 
CCGTTCCGGG CCTCOGCAGC CCAACATGGG CCCOGGGTTC CAAAGTTTGC GAAGTTGGGC 120 
GCOQAGGGGC OGGGGOOOQC GOAGCQTCCQ GGGGGGCCCG GGCCCGGATT CTCTGGGOGC 180 
ACAGGTC60C TGAGC06CCT CCGOGG^GC OQAGGTCGTG CGTGTG6C0G GGGGGCTCCG 240 
f^f^ AGGAGCAOQC GGG6G0000G GGGCTTTTQC TGAGTTGGCG GGGTT66CCA TGGCG6CTGC 300 
oO CCOCCCAGCO CGGGGOCOGG AGCTCCCGCP CCTGGOGCTG CtGCTGCTGC TGCPGCTGGG 360 
GGACCCGGGC OBGGGGGOQG CCTCGAGCGG GAACGCGACC GGGGCT6GGC CT0GGAG06C 420 
GGGCGGGAGC OOQAGGAGGA GCGCGGCGGT GACTGGCCCT C06COGCOGC TGAGCCACTG 480 
OGGCCGGOCT GCCCCCT6C0 AGCOQCTGCG CTACAACGTG TGCCTGGGCT CGGTGCT6CC 540 
CTACGGQGCC ACCTOCACAC T6CTG6CCGQ AGACTOGQAC TCOCAOGAGG AAGCGCAOQG 600 

1309 



wo 03/042661 
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80 



CRAGCTOGTG CTCTCQTCGQ GCCTCOSOAA TGCCCCCCGC TGCTGGGCAO TGATCCAGCC 
CCTGCTGTGT GCCGTATACA TGCCCAAGTG TGAGAATGAC OGGGTGGAGC TGCCCAGCOB 
TACCCT CTGC CA06CCACCC GAQGCGCCT6 TGCCATOOTQ 6AGAGGQAGC GGGGCTGGCC 
TQACTTCCTG 06CTGCACTC CTGACGGCTT CCCTGAAGGC TGCACX^TO AGGTOCAOAA 
CATCAAGTTC AACAGTTCAO GCGA6TG0QA AGTGCCCTTO OTTOGGACAG ACAACCCCAA 
GAGCTGGTAC GAG6A0GTGG AGGGCTGCGG CATCCAGTGC CAGAACCOGC TCTTCACAGA 
GGCTGAGGAC CAGGACATGC ACAGCTACAT OQCGOCCTTC GGQQCCGTCA OGGGCCTCTG 
CACGCTCTTC ACCCTG60CA CATT0GTG6C T6ACTGG0G6 AACTCGAATC GCTACOCTGC 
TGTTATTCTC TTCTACQTGA ATGGGTGCTT CTTTGTGGGC AGCATTGQCT OGCTGGOOCA 
GTTCATGGAT GQTGCCCX3CC QAfiAGATOGT CT6C0GTGCA GATGGCACCA TGAG6CTTG0 
GGAGCCCACC TCCAATGAOA CTCTGTCCTG CGTCATCATC TTTGTCATOQ TGTACTAOGC 
CCTGATGGCT GGTGTGGTTT GQTTTGTGGT CCTCACCTAT GCCTGGCACA CTTCCTTCAA 
AGCXXmSGGC ACCACCTAGC AGCCTCTCTC OG6CAAGACC TCCTACTTCC ACCTGCTCAC 
CT G GT CA CTC CGCTTTOTOC TCACT6T6GC AATCCTT6CT GTGGGGCAGG T6GATGGGGA 
CTCTQTGAOT GQCATTTQTT TTQTGGaCTA CAAGAACTAC 06ATAC06TO 06G6CTT0GT 
GCTGGCCCCA ATCGGCCTGG TGCTCATOGT GGGAGGCTAC TTCCTCATCC GAGGAGTCAT 
6ACTCT0TTC TCCATCAAGA GCAACCACCC OGGGC TGCTG AGTOAQAAGG CTGCCAGCAA 
GATCAAOGAO ACCATGCTGC GCCTGGGCAT TTTTOSCTTC CTGGCCTTTG GCTTTGTGCT 
CATTACCTTC AGCTGCCACT TCTACX3ACTT CTTCAACCAG GCTOAGTGGO AG0GCA6CTT 
CCGGQACTAT 6TGCTAT6TC AG6CCAATGT GACCATOGGG CTGCCCACCA AGCAGCCCAT 
CCCTGACT6T GAGATCAAGA ATOGCCCGAG CCTTCTGGT6 GAGAAGATCA ACCTG7TTGC 
CATGTTTQOA ACTGGCATCO CCATGAGCAC CTGGGTCTGG ACCAAGGCCA CGCTGCTCAT 
CTGGAG60GT ACCTGGT6CA GGTTGACTGG GCAGAGTGAC GATGAGCCAA AGGGGATCAA 
GAAGAGCAA6 ATGATT6CCA AGGCCTTCTC TAAGCX3GCAC QAGCTCCTGC A6AACCCAGG 
CC3W3GAGCTG TCCTTCAGCA TGCACACTGT GTCCCACGAC GGGCCXXTTCG OGGGCTTGQC 
CTTTGACCTC AATOAQCCCT CAGCTGATOT CTCCTCTGCC TGGGCCCAGC AT0TCACC3UV 
GATGGTGGCT CGGAGAGQAQ CCATACTGCC CCAGGATATT TCTGTCACCC CTGTGGCAAC 
TCCA6T6CCC CCAGAGOAAC AAGCCAACCT GTGGCTGGTT QAGGCAQAGA TCTCCCCAGA 
6CTGCA6AA6 CGCCTGGGCC G6AAGAAGAA GAGGAGGAA6 AGGAAGAAGG AOGTGTGCCC 
GCTGGOGCCG CCCCCT6AGC TTCACCCCCC T6CCCCTGCC CCCAGTACCA TTCCTOGACT 
GCCTCAGCTG CCCCGGCAGA AATGCCTGGT GGCTGCAGGT GCCTGGG6AG CTGGGGACTC 
TTGCOSACAG GGAGCXSTGGA CCXnrGGTCTC CAACCCATTC TGOCCAGAGC CCAGTCCCCC 
TCAGGATCCA TTTCTGCCX3V GTGCACCGGC CCCCGTGGCA TGQGCTCATG QCCGCCGACA 
GGGCCTGGGG CCTATTCACT CCCGCACCAA CCTGATGGAC ACAQAACTCA TGGATGCAGA 
CTCGQACTTC TGAGCCTGCA GAGCAGGACC TGOGACAGQA AAGAaAQQAA CCAATACCTT 
CAAGGCTCTT CTTCCTCACC GAGCATGCTT CCCTAGGATC CCGTCTTCCA GAGAACCTGT 
GGGCTGACTG CCCTCOGAAG AGAGTTCTGQ ATGTCTGGCT CAAAGCAGCA GGACTGXGGQ 
AAAGAGCXTTA ACATCTCXAT GGGGAGGCCT CACCCCAGGO ACAGGGCCCT GGAGCTCAGG 
GTCCTT6TTT CTGCCCTGCC AGCTGCAGCC TGGTTGGCAG CATCTGCTCC ATCGGGGCAG 
GGGGTATOCA GAGCTT6TGG TGGGGCAGGA ACXSGTGQAGG CAGAGQTGAC AGTTCCCAGA 
QTGGGCTTTG GTGGCCAGGG AGGCAGCCTA GCCTATQTCT GGCAGATGAG GGCTGGCTGC 
CQTTTTCTGa GCTGATGGGT GCCCTTTCCT OQCAGTCTCA OTCCAAAAGT GTTGACTOTG 
TCATTAGTCC TTTGTCTAAG TAGGGCCAGG GCACXX3TATT CCTCTCCCAG GT6TTT0TGG 
GGCTGGAAGG ACCTGCTCCC ACAGGGGCCA TGTCCTCTCT TAATAGGTGG CACTACCCCA 
AACCCATCTT TTGTTCTCCT ATATCCTCCT TCTCCT6TTC CATTTCAQTT CAOTTTCAGC 
GGTGCCAACC TCTTTG06TT TCCTTTTTGT TGATGAGGAC CCAGAGCTGC TGCACACACT 
CACCTCTAAC ccxx:tcccct CGCTGCTGGG OCCCATCTCC ACAQQAQAGA CTGGTTOGGC 
TCTA6GGCCT CAGTCTGQAG TGGGATAGGA GCAGTGA6TG AGAAA6CCTC TGAAA6ATGC 
ATCATCTCTT CCTCACACCC ATTTAQTGGG GQATQGQTCC TCTAGACTTG AGGGGCTACC 
CTGGGAAGCT GCCGTAGCTT CAGCCAGGCA AGAAAGCTTC CTTCAACCTG CATAGCC3GGT 
GGGTGAGGAO ATTCX:CACCr TGCATAGCCT CCAAACAT6T TCCCAA6G0C CCACTTTCAA 
GAATCAOACA GCAGQAAGCX: ATA0ATGCT6 OCTGGGTTCC AGGTTATQGO GAOAAQAAAT 
ACAGTGAATA AAAGGTTTTT GTATAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
A 

Seg ID KOi CI 7 7 DNA Sequence 
Nucleic Acid AccesBlon #< AK094595 
Coding sequence i 1..2B53 



ATGGGGGCCC 
COGAGGCTGA 
TGAGCGCCA6 
AAGAACAAGC 
TGCAAOGGCG 
ACCCTG6GG0 
CAGGTGGAGG 
G06GGCACCA 
GATCAGQAGC 
COGCCQGAGG 
CCCACCCAOG 
CGCCTGTOGG 
AGCACCACTQ 
TCACCCTGCT 
CCOGCTCCAC 
ACCACCATCT 
ACTGAGTGTG 
OQTGACTGCA 
AATAAGAAAA 
CraTATQOQQ 
6TGGTG3TGT 
GOOCTOACTO 
CTCCTACACC 



11 

1 

GGAGOGGAGC 
GCCAAGCAOG 
CAGAGCC6CT 
CTGTGGAGCT 
AGTGGOTCAG 
CGCGGGGCQG 
AGCTCTTTGG 
CCAAGAGTCG 
CTCTGG6CAA 
GGGT6GCTGT 
ACACCAACTT 
ACACTGCCAA 
CCACOGTCAT 
CCAACG6CTG 
TCAAGGGAGG 
GCCCAGTCQA 
CCCACTGGCG 
GCGGGACGCT 
CTCTAAGCQA 
GGCTGSTGGT 
ACOGCCGCAA 
OTGOTTTCCA 
CCTCTGTGCC 



21 
1 

TCGGGGCGCG 
TAGGAAGCQA 
GCCCTACTTC 
COGCTGCOGC 
CCAGAAOGAC 
CCT60GGGT0 
6CIG6AGGAT 
COGAGCCTAC 
GGAGGTGCCC 
GGCGQAGGTG 
CCTGCTCACC 
CTATACCTGC 
C6TCTA0GT6 
TGGCCGAGGC 
GGCCTTCT8C 
TGGGGCGTQO 
TAGCGGCGAG 
GCTCGACTCT 
CCCCAACAGC 
GGCCATCTTC 
CTGCCGTQAC 
CCCCG TCAAC 
TGCTGACCTG 



31 
1 

CTGCTGCTGG 
TCQGGTGAGQ 
CTGCAGGAGC 
GCCTTCCCCQ 
CAOGTCACAC 
OQOGAGGTGC 
TACTGGTGCC 
GTCOGCATOG 
CTGGACCATG 
GAATGGCTCA 
ATCGACCAOV 
GTGGCCAAGA 
AAOX^GCGGCT 
TGGCAGAA6C 
GAGGGGCAGQ 
ACQGAGTGGA 
TOCATGGCGC 
AAGAACTGCA 
CACCTGCTGG 
GTGOTOGTOQ 
TTOSACACAG 
TTTAA6A0Q0 
ACASCCAGCO 



41 
I 

CACTGCTGCT 
TGCTCCCTGA 
CACAGGACGC 
GCACACAGAT 
AGGAAGGCCT 
AGAT06AGGT 
A6T60G7GGC 
GCTACCTGCO 
A6GTTCTCCT 
AGAATGAGQA 
ACCTCATCAT 
ACATCOTGGC 
GGTCCAGCTG 
GCACCCGGAC 
CATTCCAQAA 
GCAAGTGOTC 
CCCCACCCCA 
CAQATGGGCr 
AQGCCTCAGG 
CAATCCTCAT 
ACATOICTGA 

C0G6CATCTA 




CTACATTGTG 
CTACTTCAAG 
GGAT6AG6CC 
OTOGOGGCAO 
CTGGAGCTCC 
CAAGAACTTC 
GCAGTGC06C 
TGTCATOGAC 
COOCCAGOCC 
CAAA06CGG0 
GGCAGAGTGO 
CTGCACCAAC 
GACOGOCTGC 
AGCCTGCAGC 
GAACGGAGGC 
QTGCAT6CAA 
0GATGCG6CG 
GGOaOTaQGO 
CTCATCTGCT 
CAACCCGCAO 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

ISOO 

1S60 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3160 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3760 

3781 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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GTGTATGOCC 
CCCTTACCCA 
CTGGCAGATG 
OCCOOOGACA 
GGOCT G CCCC 
CTCAGCATCC 
AAGTTCTACG 
GGGACCCACA 
060COOGTCA 
CAGCTCAAGA 
ACCCTGAACA 
CAGCTGGGCA 
CAGCTGGCOG 
TGCCTGOAGO 
GGATACTTGO 

GAGATCCCCT 
CT6GA6AGGC 
6AAG66GAGG 
CTQQACACTC 
TTCAAGATCC 
CGGGGCAATG 
TTT6CCACCA 
GAOGATGGGG 
CrGGTQGCTG 
ACTGGCAGGA 
CTQCTTCCTC 
ACCATGACCA 
CXACCCCX3GC 
A6CCTCTGGA 
CCCTCAGCCC 
COTTATTQAT 
GAA6AGTCAA 
QAAAQAAA6A 
ACAAGGGAAA 
TTGGATAAOT 



TGCAGGACTC 
GCCTTAAGGT 
GGCCTGACCT 
CCCACTTCCr 
GAGACCCAGG 
CCGGCACAGG 
AGATGTATCT 
CAGTATTGAS 
TCCTdOCAT 
CCCAGGOCCA 
CACCCTGCTA 
CCTAOGTGTT 
TCTTOGCCCC 
ACA03CCTOT 
TGGA6GAGGC 
ATOACCTCCC 
TCTATCACAT 
ACAGCTTGGC 
GCCAGATATT 
TCT6CTCTGC 
CACTGTCCAT 
ACTGGCGGAT 
AAGCGAGCCC 
ACCTCAACAG 
TQGCCACGGA 
GGCAGGTGCA 
CCAGTTCACA 
6CCTTAGAAA 
TCTCTCTCTC 
GGCAGTTG6T 
GGCAACTTCT 
TTCTCCTTTC 
GTACAATTCA 
AAGCTTCAGA 
AGAAAAACCX: 
CCTTTTTAAA 



CACCXSACAAA 
CAAGGTCTAC 
GCTGGGGGTC 
GCACCTG08C 
GAGCAGOQTC 
GGTCA6CXTG 
ACTCATCAAC 
CCCCTCGGTG 
6CC0CACTGT 
CCAGG6CCAC 
CTGCCAGCTQ 
CACGGGOQAG 
OGCCCTCTGC 
AGCACtGAAG 
6AAACC6CTA 
CX3VTQCCCAT 
TTGGAGTGGC 
CTOCACAQAO 
CCAGCT6CAT 
CGCT66CAGC 
CGQCCAGAAG 
6TTAGCACAG 
CAOGGOTOTO 
CCTG6G6AGT 
OQGGGACTGC 
GGGAGGCXTTQ 
OCCAOAOTTO 
ATCCAT6TAC 
TTGGCX^TGAa 
TG6GGG0GGG 
GGGTTCCATG 
TCCCTAACCC 
GACAAACT6C 
COGCTAGTAA 
AGTTTCTTAG 
A 



ATCCCCATGA 
AGCTCCAGCA 
TTGC06CCT0 
A60GCCA8CC 
AGOGGCACCT 
CTGGTOpXA 
AAGGCAGAAA 
ACCTGTGGAC 
OCOQAAOTCA 
TGGGAGGAG6 
OAQCCCAGGG 
TCCTATTCCC 
ACCTCCCTGO 
QAGGTGCTGQ 
ATGTTCAAG6 
TQQAQQAGCA 
AOCCAGAAGG 
CTCACCTGCA 
ACCACTCTGO 
ACTOTCACCA 
ATATQCAACA 
AAGCTCTCTA 
ATCCTG6A0C 
6CCTTGGAG0 
TQAGCCrCCT 
GG6CAGCCTC 

TCTGTTGTTA 
ATCTCTGTGC 
CAGGCAGGAO 
GGTTTTAGTT 
(XCTTCTGCT 
TTTCTCCTGT 
GGCTCAAAGA 
GAAA06CAAA 



CCAACTCTCC 
CCAOGGGCTC 
GCACATACCC 
TOG G TT C CCA 
TTGGCTGCCT 
ATGGAGCCAT 
GTACCCTCCC 
CCA CAGGCC T 
GTGCCCGT6A 
TGGTGACOCT 
CCTGTCACAT 
GCTCAGC3KGT 
AGTACAOCCT 
AOCTGGAGOG 
AQUSTTACCA 
AGCTGCTGGC 
CCCTCCACTG 
AGATCTGOOT 
CftGAO ACAC C 
CCCAGCTG6G 
GCCTAGAT6C 
TGGACGGGTA 
TCTGGGAAOC 
AGATGGGCAA 
GGGACAOGGO 
CTGATGGGGA 
TCCTCTTCCC 
GAGGGCCCAG 
AGGAACCAAO 
OCCCTCCCTC 
CCQTTCTC3GT 
TCCACGCCCT 
CCAAAAGCAA 
AOAAQAAAAA 
OGATTTATTA 



TCTGCTQQAC 
TGGGCCAGGC 
TAGCGATTTC 
QCAGCTCTTG 
GGGT6GGAGG 
TCC0CA6GGC 
GCTTTCAGAA 
CCTGCTGTGC 
CTGGATCTTT 
0GATGAG6AG 
CCTGCTGGAC 
C3UVG0GGCTC 
CCGGGTCTAC 
GACTCT(X3GC 
CAACCTGOGC 
CAAATACCAO 
CACTTTCACC 
QOGGCAAOTG 
TGCTG6CTCC 
ACCrX'ATGCC 
CCCCAACTCA 
CCTGAATTAC 
TCTGCAGCAO 
QA0T6AGATG 
6CTG6CAGGG 
TGnTTGGCCT 
CAACCCCCAQ 
AGTTCCTTCT 
AT8GG0CT0A 
CACCCCCCCA 
TTTCTTCCTC 

AAAGGCAAAG 
CACCAAAACC 
TCCAGATTAT 



Seq ID NOt C17B DKA Sequence 
Nucleic Acid Accession #: 1IM_004625 
Coding sequence: 310.. 135 9 " 



1 
1 

GAGG6AGGG0 
GCACCGGTOG 
CCTCCOGCTC 
CGCACCGACA 
TTOCCCCQCC 
ATCGGGACTA 
ATGGTCTACC 
TGTAACAAGA 
GCXa^TCATCG 
06CAATGGCC 
AAAOTGGGGA 
GCCATCACAG 
CAA6GCCAGT 
TAOGGCATCG 
ACTCTCATGA 
CTGGAATGTA 
CTGCCRCAGT 
GTGGAGCCTG 
CTOTOSTACX: 
TGCXSAGOAGQ 
GCTCCCCAGO 
TACGCCCGGG 
ACGTGCAGCG 
CCCGCTGCAA 
AGGATGCTQA 
TCC6TGGGGG 
CTCCACCCTC 
CAGGAACTCT 
ATAATATTAA 



11 

1 

CGGGGGCTGG 
CTCGCGCGCA 
OGTGGCTCCC 
GACGGOCCCG 
GGGGCOGGCC 
TGAACCGGAA 
TCCGQATOGG 
TCCCAGGOCT 
TCATAGGAGA 
GCTGGAACTQ 
GCCGGGA06C 
GTGCCTGTAC 
ACCACOGGGA 
GCTTCX3CCAA 
ACTTGCACAA 
AOTGCCACGO 
TTCGGGAGCT 
TGCGT6CCAG 
GCAA6CCCAT 
ACCOQGTGAC 
CCAGCX3GCTa 
TGTGGC3VGT0 
AOCGCACGGA 
0TCAGATT6C 

AAAAAAAATC 
OCCCAGACAC 
OQACCCCTGO 
TTTATTTAAT 



21 

I 

AGGCAGCAGC 
GCCCGGOGTC 
GTQCTCCTGO 
GGGACGCCTC 
06CGGGATCA 
AGCGOGGOGC 
TGGCTTCTCC 
GGCTCCCAGA 
AGGCTCACAA 
CTCTGCACTG 
TGOGTTCACC 
CCAGGGCAAC 
OGAGGGCTGG 
GGTCTTTGTG 
CAA06AGGCA 
OSTGTCAGGC 
GGGCTA06TG 
CCGCAACAAG 
GGACACQGAC 
OGGCAGTGTG 
TGACCTCATQ 
CRACTGTAAO 
QATGTACAOO 
TGGGAGQACT 
TCTGCTGAGG 
TCTCAGAGCC 
AGCCCAGGTC 
GCCTCATCAC 
TAAAAA6AAT 



31 

1 

GCCCCCX3CAC 
GCCCCAOGCX: 
C0A0GCTCA6 
GGCTCGCGCC 
GCACAGCXXX} 
TGCCTGGGCC 
TCAGTGGTAO 
CAGCGGGOGA 
ATGGGCCTGO 
GQAGAOOOCA 
TACX3CCATCA 
CTGAGCGACT 
AAGTGGGGTG 
GATGCCCGGG 
GGCCXSAAAGA 
TOSTGCACCA 
CTCAAGGACA 
CX3GCCCACCT 
CTGGTGTACA 
GGCACCCAGG 
TGCTGTGGGC 
TTCCACTGGT 
TGCAAOTGAO 
GGACGGTTTC 
AGGGTACTTT 
CTCAACTATT 
CCTC0606GC 
AGCAATATT7 
TCTTCCaCAA 





GTGGCTOCGA 
GCTGCTCTGC 
AGATCAAGCA 
TGCTGGAGGA 
CCAAGAOGTG 
AGTACAA06A 
TCCTGAAGAT 
TCX3AGAAGTC 
GCCX3CGCCTG 
QTGGCTACAA 
GCTGCTATGT 
CCOOQTGTGC 
CAAGCTG06G 
TCCTGGGTTT 
CTGTTCCACA 
TGGAGCX^AAG 
AACAATTtAT 
AAAAAAAAAA 



GOCGGACGGO 
GCTATGTTGA 
COQOOGGCCA 
CAGCCTGGGC 
AAGCATCATC 
CCGGC CCGAC 
GTTTCAGTTC 
QAAGGA6CTC 
CGTGGCCCAC 
CAAAGAGAAO 
OGACATCCGC 
GAATGCCOGG 
GAACATGAAQ 
CTGQACCACA 
GGCCGTTCAC 
C7UVGAAGCCA 
GCCCAACTAC 
CAACAAGACG 
CACCCACCAG 
CAA6TGCAAC 
ACACCACCCT 
GCTCCCTGOC 
CCTGCAGGCA 
CCCAATGCTG 
CCTTCTGCT^ 
TCTOATAAAA 
AAAAAA 



Seq ID NO: CI 7 9 DNA Sequence 0BR3 
Nucleic Acid Accession fts NM_0037a6 
Coding sequence: 71.. 4 654 



11 



21 



31 41 51 

) I I I I I 

CTCOGOCGCC CGCTCT6CCC GCCGCTGGOT CCOACCGCOC TOQCCTTCCT TGCAGCCGCQ 
CCTCGGCCCC ATGGACQCCC TGTGOGGTTC CG6GGA6CTC GGCTCCAA6T TCTGGGACTC 
CAACCTQTCT GTGCACACAG AAAACCOGGA CCTCACTCCC T0CTTCCA6A ACTCCCTGCT 
GGCCTGGGTG CCCTGCATCT ACCTOTGQGT GGCCCTOCCC TGCTACTTGC TCTACCTGCG 
GCACCATTOT OQTGGCTACA TCATCCTCTC CCACCTGTCC AAGCTCAAOA TGSTCCTQGQ 



1440 
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1560 
1620 
1680 
1740 
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2040 
2100 
2160 
2220 
2280 
2340 
2400 
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60 

120 

180 

240 

300 



1311 



wo 03/042661 



TGTCCTGCTG TGOTGCOTCT CCTGGGOGGA CCTTTTTTAC TCCTTCCRTG GCCTGGTCCA 360 

TGGCOGGGCC CCTGCCCCTG rm ' LTrm T CACCCCXTTTG GTGGTGGGGQ TCACCATGCT 420 

GCTGGCCACC CTQCTQATAC AGTATGAOCO GCTGCAGGGC GTAGAGTCTT CGGGG6TCCT 4 BO 

CATTATCTTC TG6TTGCTGT GTGTGGTCIG GGGCATOGTC CCATTCG6CT CCAAGATCCT 540 

TTTAiSCCAAG GCA6AG0GTQ AOATCTCAGA CCCCTTCC6C TTOVCCACCT TCTACATCCA 600 

CTTTGCCCTG GTACTCTCTG CCCTCATCTT GGCCTGCTTC AGGGAGAAAC CTCCATTTTT 660 

CTCOGCAAAO AATOTOOACC CTAACCCCTA CCCTGAGACC AGOGCTGGCT TTCTCTCCCG 720 

CCTGrnTTC TGGTGGrrCA CAAAGATGGC CATCTATGGC TACGGGCATC CCCTGGAGGA 760 

GAAGGAGCTC TGGTCCCTAA AGQAAQAGGA CAGATCXXSUS A TGGTOQTGC AOCAGCTGCT 840 

GGAGGCAT6G AjSOAAGCAGO AAAAGCAQAC GGOVGOACAC AAG6CTTCAG CAGCACCTGG 900 

GAAAAATGCC TCOGGOGAGG A06AGGTGCT GCTGGGTGCC OGGCCCAGGC CCOSQAAGCX: 960 

CTCCTTCCTG AAGGCCCTGC TCGCCACXTTT OGGCICCAOC TTCCTCATCA OTQCXTTQCTT 1020 

CAA6CTTATC CAGGACCT6C TCTCCTTCAT CAATCCACAG CT6CTCAGCA TCCTGATCAG 1080 

GTTTATCTCC AACCCCATGQ CCCCCT O CTG GT6GG6CTTC CTG G T G GCT g GQCTGATGTT 1140 

CCTGTGCTCC ATGAT6CAGT CGCTGATCTT ACAACACTAT TACCACTACA TCTTTGTGAC 1200 

TGGGGTGAAG TTTGGTACTG GGATCATG6G TGTCATCTAC AGGAAGGCTC TGGTTATCAC 1260 

CAACTCAGTC AAACOTGCOT CCACTOTGGQ GGAAATTGTC AACCTCATGT CAGTGGATGC 1320 

CCAGCGCTTC ATGGACCTTG CCCCCTTCCT CAATCTGCTG TGGTCAGCAC CCCTGCAGAT 1380 

CATCCTGGaS ATCTACTTCC TCTGGCAGAA CCTAQGTCCC TCTOTCCTGG CTGGAGTCGC 1440 

TTTCATGQTC TTGCTGATTC CACTCAACX3Q AGCTGTGGCC GTGAAGATGC GCGCCTTCCA 1500 

GGTAAAGCAA ATGAAATTQA AGGACTCGCG CATCAAGCTG ATGAGTGAGA TCCTOAACGO 1560 

CATCAAGGTG CTGAAGCTGT ACGCCTGGQA GCCCAGCTTC CTGAAGCAGG TGGAGGGCAT 1620 

CAGGCAGGGT GAGCTCCAGC TGCTGCGCAC GG06GCCTAC CTCCACAGCA CAACCACCTT 1680 

CACCTGGATG TGCAGCCCCT TCCTGGTGAC CXTTGATCACC CTCTG G GT G T ACGTGTACGT 1740 

GGACCCAAAC AATGTGCTGG ACGCCGAGAA GGCCTTTGTa TCTGTGTCCT TGTTTAATAT 1800 

CTTAAQACTT CCCCTCAACA TGCTQCCCCA GTTAATCAQC AACCTGACTC AGGCCAQTGT 1860 

GTCTCTGAAA OGGATCCAGC AATTCCTGAG CCAAGAGGAA CTTGACCCCC AGAGTGTGGA 1920 

AAGAAAGACC ATCTCCCCAO QCTATGCCAT CACCATACAC AGTGGCACCT TCACCTGGQC 1980 

CCAGGACCTG CCCCCCACTC TGCACAGCCT AGACATCCAG GTCCCGAAAG GGGCACTGGT 2040 

OaCCGTGGTG GGGCCTGTGG GCTGTGGQAA GTCCTCCXTTG OTOTCTGCCC TGCrGGGAOA 2100 

GATGQAGAAG CTAGAAGGCA AAGTGCACAT GAAGGGCTCC GTGGCCTATG TGCCCCAGCA 2160 

GGCATGQATC CAQAACTGCA CTCTTCAGGA AAACGTGCTT TTCGGCAAAG CCCTQAACCC 2220 

CAAGCGCTAC CAGCAGACTC TGGAGGCCTG TGCCTTGCTA GCTGACCTGa AGATGCTGCC 2280 

TGGTGGGGAT CAGACAGAGA TTGGAGAGAA GGGCATTAAC CTGTCTGGGG GCCAGCG6CA 2340 

GCGGOTCAGT CTGGCTCGAG CTGTTTACAG TGATGCXX3AT ATTTTCTTGC TGGAT6ACCC 2400 

ACTGTOCX^OG GTGGACTCTC AT6TGGCCAA 6CACATCTTT QACCACGTCA TCGGGCCAGA 2460 

AGGCGTGCTO GCAG6CAAGA CGOQAQTGCT GGTGAGGCAC GQCATTAGCT TCCTGCCCCA 2520 

GACAOACTTC ATCATTCTGC TAGCTGATGQ ACAGGTGTCT GAGATGGGCC CGTACCCAGC 2580 

CCTGCTGCAG CGCAACGGCT CCTTTGCCAA CTTTCTCTGC AACTATGCCC CCGATGAGGA 2640 
CCAAGGGCAC CTGGAGGACA 6CTGGACCX3C 6TT6GAAGGT 6CAQAGGATA AGGAGGCACT 2700 
GCTGATTGAA OACACACTCA GCAACCACAC OQATCTQACA 6ACAATGATC GA6TCACCTA 2760 
TOTGQTCCAG AAOCAOTTTA TOAGACAOCT QAOTGCCCTG TCCTCAGATa GGGAGGGACA 2820 
GGGTCGGCCT OTACGCOGGA GGCACCTGGG TCCATCAGAG AAGGTGCAGG TGACAGAGGC 2880 
GAAGGCAQAT GGGGCACTGA CCCA6GAGGA GAAAOCAGCC ATTGGCACTG TGGAGCTCAG 2940 
TGTOTTCTGG GATTATGCCA AOQCCGTGGO GCTCTGTACC ACXSCTGGCXa TCTOT CTCCT 3000 
OTATOTGGGT CAAAGT6CQG CT60CATTGG AGGCAATGT6 TG6CTCAGTG GCrOQACAAA 3060 
TOATQCCATG 6CAGAC3U3TA OACAGAACAA CACTTCXJCTG AGGCXGGGOG TCTATGCIQC 3120 
TTTAGQAATT CTGCAAGGGT TCTTGCTGAT GCTGGCAGCX: ATGGCCATGG CAGC3GGGTGG 3180 
CATCC31GGCT OCCGGTGTGT TGCACCA66C ACTGCTGCAC AACAAGATAC GCTCX3CCACA 3240 
GTCCTTCTTT GACACCACAC CATCAGGCCX5 CATCCTQAAC TGCTTCTCCA AGGACATCTA 3300 
TGTOGTTQAT OAGOTTCTGQ CGCCTQTCAT CCTCATQCTO CTCaUVTTCCT TCTTCAAG8C 3360 
CATCTCCACT CTTGTGGTCA TCATGGCCAG CAOGCOGCTC TTCACTGTGG TCATCCTGCSC 3420 
CCTGGCTGTO CTCTACACCT TAGTGCAGOO CTTCTATGCA GCCACATCAC OQCAACTGAA 3480 
GOGGCTGQAA TCAGTCAGCX: GCTCACXriAT CTACTCCCAC TTTTCGGAGA CAGTGACTGG 3540 
TGCCAGTOTC ATCGGGGCCT ACAACCGCAG CGGGGATTTT GAGATCATCA GTGATACTAA 3600 
OGTGGATGCC AACCAOAQAA GCIGCTACCC CXACA7GATC T0C!AACXX3OT GGCT6AGCAT 3660 
OGGAGTGGAG TTOGTGGGGA ACT606TGGT 6CTCTTT6CT GCACTATTTG CCGTCKTOGG 3720 
GAGGAGCAGC CTQAACCOGO GGCTGGTGGG GCTTTCTGTG TCCTACTCCT TGCAGGTGAC 3780 
ATTTGCTCTG AACTGGATGA TAOGAATGAT GTCA6ATTTG GAATCTAACA TC3GTGGCTGT 3840 
OQAGAGGOTC AAGQAGTACT CCAAGACAQA GACASAQGGG CXXTGGGTGO TGOAAGQCAG 3900 
OOGCOCTOCC GAA0GTT6GC OCOCAOGTGO G6A86TGGAG TTCOGGAATT ATTCTGTGOO 3960 
CTACOQQCOQ GGCCTAQACC TGGT6CTGA0 AGACCI6AGT CTGCATOTGC AOSOTOGOQA 4020 
GAAGOTGGGO ATOGTGGGCC GCACTGGGGC TGQCAAGTCT TCCATGACCC TTT6CCTGTT 4080 
CCXSCATCCTG GAG6CGGCAA AGGCTTGAAAT C0GCATT6AT GGCCTCAATG TGGCAGACAT 4140 
CGGCXrrCCAT GACCTGCGCT CTCAGCTOAC CATCATCCCG CAGGACGCCA TCCTGTTCTC 4200 
GOGGACCCTG CGCATGAACC TGGAGC3CCTT OGGCAGCTAC TCAGAGGAGG AGATTTGOTS 4260 
GGCTTTGGAO CTGTCCCACC TGCACAOQTT TGTGAGCTCC CASCC86CAQ GCCTGGACTT 4320 
CCAGTGCTCA GAGGGCGGGG AGAATCTCAO OGTGGGCCAG AG6CAGCT0G TGTGCCTGGC 4380 
CCGAGCCXTTG CTCOGCAAQA GCGGCATCCT GGTTTTAGAC QAGGCCACAG CTGCCATCX3A 4440 
CCTGGAGACT GACAACCTCA TCXIAGGCTAG CATCC6CACC CAGTTTGATA CCTGCACTGT 4500 
CCIGACXIATC QCACAOOGGC TTAACACTAT GATGGACTAC AOCAGGGTCC TGOTCCTOQA 4560 
CAAA6GA6TA QTAGCTGAAT TT6ATTCTCC AGCCAACXrrC ATTGCAGCTA GAQGCATCrr 4620 
CTAOGGGATO GCCAGAGATG CT6GACTTGC CTAAAATATA TTCCTGAGAT TTCCTCCTQG 4680 
CCTTTCCTGO TTTTC3VTCAa GAAGGAAATG ACACCAAATA TGTCCGCAGA ATGGACTTGA 4740 
TA6CAAACAC TQGGGGCACC TTAAGATTTT GCACCTGTAA AGTGCCTTAC AGGGTAACTG 4800 
T6CTGAAT0C TTTAGATGAG GAAATGATCC CCAAGTG6T0 AATGACAOGC CTAAOGTCAC 4860 
AGCTA0TTTG AGCCACSTTAQ ACTA6TCCCC 0GGTCTCCXX3 ATTCCCAACT GAGIGTTATT 4920 
TGCACACTGC ACTGTTTTCA AATAACQATT TTATQAAATG ACCTCTQTCC TCCCTCTGAT 4980 
TTTTCATATT TTCCTAAAGT TTOGTTTCTG TTTTTTAATA AAAAGCTTTT TCCTCCTGQA 5040 
ACAGAAGACA GCTGCTGGGT CAGGCCACCC CTAGGAACTC AGTCCTGTAC TCTGOGOTGC 5100 
TGGCTOAATC CATTAAAAAT aOQAGTACTG ATGAAATAAA ACTACATGOT CAACAGTAAA 5160 
AAAAAAAAAA AAAAAA 5176 

Seq ZD NOt C180 DNA Sequence 



1312 



wo 03/042661 



PCT/US02/36810 



Nucleic Acid Accession fit nm_004626 
Coding sequencei 124.. 1188 ~ 
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TAACCOGCCG 
CAGGOGGAGQ 
A0GATGA66G 
OGOCSTOTGCT 
AACX:AaA06C 
CGCAGCAACC 
TGTOGCGGGG 
TATTTGCTTG 
6C0GCCATGA 
TGOGGCCCOQ 
AACCTCA6CT 
AAAACAGOAT 
GCTCTGC606 
ATC06CACCT 
CXSATACCTGT 
CCCAAGGACC 
CCTGACTTCT 
AACAAQACAT 
CCCTACACAO 
ACCTGC06CA 
COCCCCACQC 
GGAGACACTC 
TQCTTTGCCT 
CftGGACATCA 
GCCCACAG6C 
TTTCAGGGCC 
TGOGGGACCC 
TAGCTTGGGT 
GCTACAGAAA 
TTATATATTT 
AGCTTTTTAA 
CGCGGCCTCO 
CCAG6CA 



11 
I 

ACTTCGGCXsC 
06CGGGC0CA 
ATGGCATCAA 
AACACTGCAA 
TGGAGCTCAT 
CCTTTGCCGA 
ACCT66A6A0 
GOCAOGCCAT 
TOCCAGGTGA 
AOGGGCTCCT 
CCCAAGCCAA 
CCTCTCTGGA 
GCTOGAAGGQ 
CGGCCACCAA 
TGGATATCCG 
GCATGAAOAA 
CCAACGGAAG 
ACOGCQTGGT 
GGTGTGAGCG 
AGGAGCGAGG 
CATGGAGCTC 
TCACTTGGAA 
AAGGAAACCG 
T6GTGTAAGG 
ACAGAGACCA 
CCTGGCCCAC 
TGTAAATGTT 
TATTTATAAA 
TATATATATA 
6CAGCTQTAT 
TCAAOTGAAC 



21 
1 

CCCCGGCTGC 
GGCTCCTOCT 
GGT C TGOGAO 
GTG6CTG6G6 
GCAGCTGGAO 
GCACAOGGTG 
CATGG6CTGG 
A6GGACCCG0 
C3GOCX3GG6CC 
OCCACCCGGG 
CATGGGGGCC 
TAAACTQATG 
AATGAAGTGT 
GCTGCAG6A0 
GGTAGTGCAC 
GCCTGTG/UVG 
TGAGAAGGTG 
CX3ACAGCTGC 
0GAG0G6TGC 
TACOGTGGAG 
ACTCTGCTCA 
TGCTTGTGAA 
GCCACCA6GA 
ACAAGATTAA 
AG0GGG6CTT 
GCCTCOGGGA 
CCTGGGGTCT 
TTCTTTTGTT 
ACATAGCTTT 
AATATATATG 
GAAATAAATG 
T0QGCA6ACC 



31 
I 

AGG0GG06TG 
GGGTGTGACC 
G0GC7GCTCT 
CTGTCXAAGA 
GGTCTGGTGT 
GTGCACX3CGG 
AACTGCTCCT 
GAGTOGGCCT 
TGCA0CTCCX3 
CCOQGGAACC 
AAGTTTTCCG 
CGTCTACACA 
AAGTQCCATG 
CTQCAGGATG 
G6ACCCAT66 
GACTCGGAAC 
GQCrCCCACG 
GACGTTATGT 
CACTOTAAOT 
CGCTATGTCT 
AGGACCCTCA 
TTCCAGATGC 
ACAGAAGGTC 
AAATAACTTG 
GGGATGG6TG 
AGGGGTCT6C 
GAGCCTGCTG 
TTTTGCTTTT 
TTCTTTOGGQ 
TATATATATA 
CTQAOTGAOC 
CTGGGGCTGO 



41 51 
I I 

CAGGACCAGC GGGGGCOGTG 60 

CXX3GGCQ0GC CCGCCX30GO6 130 

T00CCCTG6C GCTCCAGACC 160 

CACCATOGGC CCTG^ACTO 240 

CTGCACAGGT GCAGCTGTGC 300 

GCOGCGAGGT CATGAAGGCC 360 

CCATTGAGCT CGCCCCCAAC 420 

TGGTGTATGC GCTGTCGGCC 480 

GCQAGCTGCC 066CT6CTCC 540 

GCTGGGGAGG ATGTGCGGAC 600 

ATGCTCCTAT GAAGGTGAAA 660 

ACAGTGAAOT GGGGAGACAG 720 

GGGTGTCTGG CTCCTCCTCC 780 

TGGCTGCrOA CCTCAAOACC 840 

GCftCCCGCAA GCACCTGGTG 900 

TCGTCTATCT GC3VGAGCTCA 960 

GGACACAAGA CAGGCAGTGC 1020 

GCTGCGGGCG TG6CTACAAC 1080 

ACCACTGGTQ CTQCTAOQTC 1140 

GCAAGTGAGG CCCTGCCCTC 1200 

OCAACTGQGG CCAGOGGCCT 1260 

O^CATGGG AGGCGGCTTG 1320 

TGGCCACCCT GGAAGGAGGG 1380 

GCAGCCTQAG GCTCTGGAGT 1440 

AGACTGATAC AGACTTCSICX: ISOO 

CC3GCCTTCTT CAGAATGTTC 1560 

GGCCCACCAC ATGGAATCAC 1620 

TCTTCxrrrTO ggatgtggaa leso 

TGGCACTTCT CAATTCCTCT 1740 

ATGATCrCTA TTTTAAAACT 1800 

CCCAGCCOGC 'CCCT6CAGTT 1860 

CAQAOGaASC TCTCCAOTTT 1920 
1927 



Seq ID NO: C181 DMA Sequence 
nucleic Acid Accession #: NM_031B66 
Coding sequence i 6 . . 2090 



ACAGCATGGA 
TOCAGOQCTC 
CCGTGCOGCT 
ACGACACGCA 
AGTGCTCGCC 
ACTACAAGAA 
OGCCQCTCAT 
CCGAGCAAGG 
CCGCQCCCAG 
GCAGCGGCCA 
GTG60GGCGG 
GGCCCCCTGG 
TGA6C6TGTC 
ACTGCGCGCT 
TCTGOATCGQ 
TCCTTATCGA 
GCTACCTCTT 
0GT6CAG0GQ 
GOGOGGGCGC 
TQGGOOOGGT 
TCTTCTTGCr 
TCACATGOTT 
AGTACTTCCA 
TCAGCTOOGT 
ACAACCTGOG 
TCCTQCTGGC 
QCCCCACCAA 
TCTACACCGT 
CGCQCTGGGA 
GCAGGCCCGA 
CCTCGGGCGT 
GCTGCTGCTG 
GGGGTGGOGG 
GOGGCGGCGQ 
CGGOGAGCTC 
GGGGGOGCCC 
TTGATGGGCT 
TCTTAATGGT 
TGAAGCCTCC 



11 
I 

GTGGGGTTAC 
TAG06GCGCT 
GTGTAAGGGC 
AOACGAGGCG 
CGATCTCAAG 
GCCGCTGCOO 
GCGCCAGTAC 
CAACCCTGAC 
CCCGCCGOGC 
OGGCCGCCOO 
GGAOGCGGOG 
CGGCGGCGC6 
CAGCQAQCGC 
GCCCTGCCAC 
CCTGTGGTC6 
CATGGAGCGC 
OGTGTOGGTG 
TGGCGCGCCG 
GGCGGGCGCG 
6GA6CAGCAC 
GGTCTACTTC 
CCTGGOGGCC 
CCTGGCCGOG 
GGACGGOOAC 
CGGCTTC6TG 
CGGCTTOBTG 
GACGCACAAG 
GCCCGGOGCG 
GGCCACGCAC 
CTACGCCGTC 
GTGGGTCTGG 
GGCCAGGAAG 
CGGGCCGGG6 
GGGCTCCCTC 
CGTGTCTTAT 
AGGAGGGGTG 
GAGGTTCCCA 
ATCCATTAGC 
CAGACCCAGC 



21 

1 

CTGTTGGAAG 
GOGGCOGCCT 
ATCCGCTACA 
GQCCTGGAGG 
TTCTTCCTGT 
CCGTGCOGCT 
GGCTTOGCCT 
ACGCTGTGCA 
CGCCTQCCGC 
CCGGGGQCCA 
GOGCCCCCAO 
GCTCCCTGCG 
CACOCGCTCT 
AACCCCTTTT 
GTGCTCTGCT 
TTCAAGTACC 
GGCTACCTAG 
GGOGOOGGGG 
GGCGOGGGGG 
GTGCGCTAOG 
TTOGGCATGG 
GGTATGAAGT 
TGGCTTGTGC 
CCGGTGGOGG 
CTGQCGCCGC 
TCCCTGTTCC 
CTGQAGAAGC 

AACTGCCCGT 
TTCATGCTCA 
TCCGGCAAQA 
GGOGCCGCGO 
GGC6G0GGCG 
TACAGCGACG 
CCAAAGCAGA 
GOGAGGGGGG 
CCCCITCACA 
TOGGACTTAA 
CCCTTTTOCT 



31 
I 

TGACCTOGCT 
CGGCCAAGGA 
ACTACACCTA 
TGCACCAQTT 
GCAGCATGTA 
CGGTGTGOGA 
GGCCCGACCG 
TQGACTACAA 
CGCOGCCGCC 
GGCCCCCGCA 
CTOQOGGCGO 
AGCCCGGGTG 
ACAACCOCGT 
TCAGCCAGGA 
TCGTGTCCAC 
0GGAG06GCC 
TGC6CCTGGT 
GCGCTGGGGG 
GCCCGGGOGG 
AGACCACCGG 
CCA6CTCCAT 
GGGGCAAOGA 
CCAGOGTCAA 
GCATCTGCTA 
TGGTCATCTA 
GCATCOGCTC 
TGATGATCCG 
CCTGOCTCTT 
GCCTGOGGGA 
AGTACTTCAT 
CGCTGGAGTC 
TGGGOGGGGG 
GCGGGGGACC 
TCAGCACTGG 
TGCCATTGTC 
CGAGQAGACC 
GTGTTGATTG 
ATGACTCACT 
OCATTGATGT 



AGCCATCGCC 
GTCCATCGCQ 
GOTGGGCAAC 
CCTCTTCATC 
GGTCATCAAO 
CCTGGGCCTG 
CTAOQAGCAG 
CCTGCAGCCC 
GTGCCTAGTG 
CTGOCGCTCC 
CGCGGQCGCC 
CQG0GG0G6C 
CCTGACGTGO 
CCAGGTCTQA 
CAAGTGCAGC 
CTATTAGCAT 
TAGAACAAAG 
GGGGGGAGCr 



51 
I 

TTGGOGCTGC 
CAAGAGATCA 
CAGTTCftACC 
OTGGAQATCC 
TGCCTAGAQG 
GCCGGCTGCG 
GACCGGCTGC 
CTAACCACOS 
C O GCCTTOOQ 
GGCAGGGGGQ 
GGGAAGGCGC 
GOGCCTATGG 
CAGATOQCTA 
TTCACCGTCT 
GTCTCCACCT 
CTCTOGGCCT 
GAGAAGGTGG 
GCGGOGGOGG 
TAOGAGGAGC 
TGCACOOTGG 
ATCTTGTOGC 
GGCTACTOGC 
QTGCTGGCJGC 
CAGAGCCTGO 
GGCACCATGT 
CAACAGGAOG 
TTCACOGTGC 
CACAACCGCC 
GACCAGOCAC 
GTGGGCATCA 
CTGTGCACCC 
A06GCCGCGG 
GGGGGGCOGG 
0QGTGS66CA 
GOGGAGGGQA 
GAAGGGACAC 
QATAATGAAC 
TACCTGGCAT 
CCTCCOGOCA 
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CGCGTTAATT 
7GQA0CCCTC 
GA6AACCTCT 
0AG6AGGGGT 
AAAT6CCTTA 
TTCTTACATT 
AAAATATGTA 
TACCCCTQTA 
GATTTAACCA 
AGGACGCATG 
TGGAGAAAAT 
TG AACTGA AA 
CTCTTTTCTQ 
AOGCAOTAAT 
AACCCTGAAA 



TCTOTTGGCT 
OCTGGCTGCA 
TTTTCTCCCT 
GACCGCCACC 
AOTAAACAAA 
AGA0GAT6TA 
TATATCCAAA 
AGAACAOATA 
TTGCCCTCTC 
OGGAAAATGO 
TACTTGTTAA 
TCAAAATTGA 
ATCCATGOAT 
TCCCCTTAGG 



GAOGAGGGTG 
CTTOGCTOGG 
CQACTCTTCC 
TGATGGGATT 
CAAGAAATGT 
TTTATATAAT 
QATATAGTGT 
TAAGTATTCT 
CCCOGCCTCT 
ACATTTTCTG 
ACTCTAGTTC 
GTTTTTGCAC 
AACAACTCrC 
CCTTGTTATT 



6ACTCT6CGG 
TTTOCAOTCA 
TAOQTAAACT 
QCACGGTTTG 
CTTAATTATA 
TATTTGTTAA 
GTACATTTTT 
ATTTTGTCAA 
TCTGAOCTGT 
GCTTGTCATT 
TTAAGTTGTT 
CTTCCCCAAA 
ACTTTAGTGG 
TATCCTGCAT 



8eq ZD NO: CI 8 2 DNA Sequence 
Nucleic Acid Accession «s XM_05062S 
Coding sequence 1 222.. 1109 ~ 



I 

CCOGOTOGGA 
CCGGTGTCCC 
GCACCCAGCG 
CGGCTCCGCT 
CGCTGCTGCT 
TCTTTGGCCA 
TGCAGCTGTG 
A6ACCATGAA 
GCCACCGGGA 
TAGACGAGAC 
CGGTCATGTC 
AGGACAACGA 
AAGCTCCAAA 
AAAOGCTTTG 
ACC6AGATAC 
TGTCCGAAAG 
GTGAGQAGAT 
AGCTGGTGAT 
CCOGCAGCAT 
CTCCAGAGCA 
TCCTAGCT6C 
CCAGCATTTC 
GCCCACOCGA 
TT 



11 
I 

GCCCCCCGGA 
GCTTCTCCGC 
AA6AGAGCGG 
CCCTCT6CCC 
GCTCTTCCTC 
GCCCOACTTC 
CCACGGCATC 
GGAGGTGCTG 
CACCAAGAAG 
CATCCAGCCA 
CQCCTTCGGC 
CCTTTGCATC 
OGTATGTGAA 
TAAAAATGAT 
CAAAATCATC 
GGACCTGAAG 
GAACGACATC 
aVCCTOGGTG 
COGCAAGCTG 
CQQCTGACCA 
TCCAGTCTCA 
CTQAGTTATA 
ATCTTGTAQA 



21 

I 

GCTG0QCGC6 
GCCGCAGOCG 
GCCCOGGAGA 
CCTOGGGGTC 
GCCTC6CACT 
TCCTACAAGC 
GAATACCAGA 
GAGCAGGCGG 
TTCCTGTGCT 
TGCCACTOQC 
TTCCCCTGGC 
CCCCTCQCTA 
GGCTGCAAAA 
TTTGCACTQA 
CTGGAGACCA 
AAATCQGTGC 
AAOGOGOOCT 
AAOCGCTGGC 
CAaTGCTAQT 
TTTCTGCTCC 
GCCTGGGCAG 
AGGCCACA60 
AATATTCAAA 



31 

I 

GOCTTGCAGC 
CCGOCTGCCA 
AGCTGQAACT 
GCGC6CCCAC 
QCIGCCTOGQ 
GCAGCAATTG 
ACATQC8GCT 
GCGCTTGGAT 
OGCTCTTOGC 
TCTGCGTQCA 
CCGACATGCT 
GCAGOQACCA 
ATAAAAATGA 
AAATAAAAGT 
AGAGCAAGAC 
TGTGGCTCAA 
ATCIGOTCAT 
AQAA6GG3CA 
CCC6GCATCC 
GGGATCTCAO 
CTTCCCCCTG 
AGTGGATAGC 
CTAATAAAAT 



Seq ID NO: CI 83 DMA Sequence 
Nucleic Acid Accession fti MM 001306.1 
Coding sequencei 199.. 861 ~" 



AATTCGGCAC 
GO60GCGC06 
CGCCAGGCCC 
GGCCTTGCCX3 
CTG6GCTGGC 
ATOGGCA6GA 
GTGGAQAGGA 
GACCTTCAGO 
CTAGTGQOGC 
AAGATCACCA 
GTOTCCIGGT 
CAGAA60606 
CTGGGGGGOG 
AAGGT06TCT 
GAGOGCAAGG 
CAACACGACC 
GCCTCGQAGG 
TC0CCA6CAG 
GCATGQACTG 
ACCACCCOGT 
GCCCTGGAAO 



11 
I 

GAOQGC AGGT 
TGGGT6AGTC 
AOOGGCCCCG 
C3GGCAGCCAT 
TQGGCACCAT 
ACATCATCAC 
CGQGCCAGAT 
CGGCCC60GC 
TGGT6GGCGC 
T0GT60CAGG 
OGGCCAACAC 
AGATGGGOGC 
CGCT6CTCTG 
ACTCOGOGCC 
ACTAOGTCTA 
ACCAOCACCQ 
CCAGCCCACC 
CCACGGCTTT 
TGAAACCTCA 
CGAGCCCCAT 
GG6CACTTGA 



21 
I 

OCAGGGGCAC 
AGTCCGTCOO 
GCCCCTCOTC 
GTCCAT03QC 
CGTGTGCTGC 
GTGGCAGAAC 
GCAGTGCAAG 
CCTCATCGTG 
CCAGTGCACC 
C6TGCTGTTC 
CATTATC0G6 
GGGCCTQTAC 
CTGCTCGTOT 
GCGCTCCACC 
AGGGACAGAC 
CQAGCTGGAG 
CCCAGAAGGC 
GCGGGCCTGGG 
CCCTTCTGGA 
OGGGCOQCTG 
TATTTTTCAA 



31 
I 

GCGGOGAGAG 
TCOOTCOOTC 
TCCCCGCACC 
CTGGAGATCA 
G06TTGCCCA 
ATCTGGQAG6 
GTGTAOGACT 
GTGGCCATCC 
AACTGOQTGC 
CTTCTOGCCG 
OACTTCTACA 
GTGGGCTGGG 
0C0CCA060Q 
GGOCCGGGAG 
GCAGGGAGAC 
CGOQCACCAa 
AG6AAGCCGC 
CAGTCGACTT 
GCACGGGGCC 
CCCCCATGTC 
TAAAAOCCTC 



CGTTTCCAGA 
6ATACACAGA 
CCCACCCCTG 
GGTATTCTTA 
CACCCCACGT 
ATTGTAAAAA 
TTGTAAAAAG 
TAAAATGACT 
CACCTTTAAA 
CTGTACACTO 
AGCCAAGTAA 
GACGQTGTTT 
AT6TAAAT6G 
6GTATCACTA 



41 

I 

G CCTCG CCCG 
6CTTTTCGGQ 
C0GGC06CCT 
GATGCTGCAG 
CTCGQCGOGC 
CAAQCCCATC 
GCCCAACCTG 
CCOGCTQGTC 
CCCCGTCTGC 
GGTGAAGGAC 
T6AGT6CGAC 
CCTCCTGCCA 
TGATGACAAC 
6AAGGAGATA 
CATTTACAAG 
AGACAGCTTG 
GGQACA6AAA 
QAGAOAGTTC 
TGATGGCTCC 
CTCCOQTTCC 
CCTTTTGCAC 
TQTTTTCACC 
CATGAATATT 



41 
I 

C0TATGQA6C 
C6T0GGGGGQ 
OGGAGCCACC 
CGGGCACCGC 
TGTG6CG0QT 
GCCTGTGGAT 
CGCTGCTGGC 
TGCTGGCCGC 
AGGACGACAC 
CCCT6CTCAC 
ACCCOGTGGT 
OGGCCGOGGC 
AGAAGAAOTA 
CCAGCCTGGG 
CCCACCACCA 
GCCATCCA6C 
OG06CTQGAC 
CGGGGCCCAO 
TQGGTGACOQ 
GOGCTGGGCA 
TCOTTTTAOC 



ACCOGAGATT 
TTTCACCTGG 
ACTTACCCTO 
ATGACCAG6C 
AAATAGQGGT 
AAAAAAGTGT 
TTTAGAGGCT 
TTTGATAAAT 
GTGCrrSCTA 
AGCTTAGOCA 
ATATCATTGT 
TTCATGGGAG 
AACTTCTGCA 
AAGGTTTCAA 



51 
I 

OGCTGTCCTC 
GOCCOGAGTC 
C6C0CTTCCC 
GGGCCT6GCT 
GGGCTCTTCC 
CCTGCCAACC 
CTGGG0CAC6 
ATGAA6CAGT 
CTOGATQACC 
CGCTGCGCCC 
CQTTTCCCCC 
GOCACOSASG 
GACATAATGG 
ACCTACATCA 
CTGAAGGGTG 
CA6TGCACCT 
CAGGGT0GG6 
AAGOGCATCT 
GACAGGCCTG 
CCAAGCACAC 
GTTTGCATCC 
TAAAGGAAAA 
TTTATQAA8T 



51 
I 

CGAOCOOTTA 
C06CAGCTCC 
GGGTGGAGOG 
GCTGGCOGTG 
QTCG6CCTTC 
GAACTGOGTG 
ACTGCCACAG 
CTTCQGGCTG 
GGCCAAGGCC 
CCTCGTGCOG 
GCCOGAGGOO 
GCTGCAGCTG 
CACGGCCACC 
CACAGGCTAC 
CCACCACCAC 
GTGCAGGCTT 
TGGGGCAGCT 
GGACCAACCT 
CCAATACTTG 
GGGACC06CA 



Seq ZD KOt C184 DNA Sequence 

Nucleic Acid Accession Us m 012449.1 

Coding sequence* 66..108S ~ 



11 



21 31 41 51 

I I t t I I 

COQAGACTCA CGGTCAAGCT AAGGCQAAQA QTGGGTGGCT GAAGCCATAC TATTTTATAO 
AATTAATGQA AAGCAGAAAA GACATCACAA ACCAAOAAGA ACTTTOQAAA ATGAAGCCTA 
OGACAAATTT AOAAGAAGAC OATTATTTGC ATAAGGACAC GOOAQAGACC AGCATOCTAA 

1314 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3195 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1382 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 t 

1060 

1140 

1200 

1250 



60 

120 

180 



wo 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AAAGACCTGT 
CAOAACTTCA 
CTATTATAGC 
CAACTTOCCA 
CAATGGTTTC 
TCCAACTTCA 
TAACAAQAAA 
GTCTGTCTTA 
AGGTCCAACA 
ATCSTOTCTCr 
CATCTGTGAG 
TTGTTTCCCT 
ATATAAAACA 
TTOTTOTCCT 
AGATTAGACA 
TGTAOAATTA 
TCAAGmXJT 



6CTTTTGCAT 
GCACACACAG 
ATCTCTGACT 
TCAACAATAT 
CATCACTCTC 
TAATGGAACC 
GCAGTTTGOO 
CCX3UVTGAGG 
AAATAAAGAA 
GGQAATT6TG 
TGACTCTTTG 
TCTACTGGGC 
ATTTGTATGG 
O ATATT TAAA 
TGGTTGGOAA 
CTOTTTACAC 
AlTWiTAAT 



TTGCACCAAA 
QAACTCTTTC 
TTTCTTTACA 
TTTTATAAAA 
TTQ6CATTGG 
AAGTATAAGA 
CTTCTCAOTT 
OCSATCCTACA 
GATGCCXGGA 
G6ATTGGCAA 
ACATGGAGAG 
ACAATACACG 
TATACACXrrC 
AGCATACTAT 
GAOSTCACCA 
AGATTTTTGT 
AAAATGATTA 



cagcxx:atgc 
cacagtggca 
ctcttctgag 

TTOCAATCCT 
TTTACCTOCC 
AGTTTCCACA 
TCTTTTTTGC 
6ATACAAGTT 
TTGAGCAT6A 
TACTQGCTCT 
AATTTCACTA 
CATTGATTTT 
CAACTTTTAT 
TCCTGGCAT6 
AAATTAACAA 
TCAATATTGA 
TTCAAGGAAA 



TGATGAATTT 
CTTGOCAATT 
GGAAGTAATT 
GGTCATCAAC 
AOGTGTQATA 
TTGGTTGGAT 
TGTACTGCAT 
OCTAAACTGO 
TGTTTQ6AGA 
GTTGGCT6T0 
TATTCAGAGC 
TGCCTGGAAT 
GATAGCTGTT 
CTTGAG6AA0 
AACTGAGATA 
TATATTTTAT 
AAAAAAAAAA 



GACTQCCCTT 
AAAATAGCTG 
CACCCTTTAG 
AAAGTCTT6C 
GGAGCAATTG 
AAGT6GATGT 
GCAATTTATA 
GCATATCAAC 
ATQGAGATTT 
ACATCTATTC 
AAGCTAGGAA 
AAOTGQATAG 
TTCCTTCCAA 
AAGATACTGA 
TGTTGCCAGT 
C3\CCAACATT 



Seq ID NOt Ci85 DNA Sequence 
Nucleic Acid Acceseion flt NM_00177S.I 
Coding sequence ; 70 . . 972 



11 



CTAAAGCTCT 
TGGAGCCCTA 
CTCTCTAGGA 
GTGCTOGOGQ 
CGCTTTCCGG 
AGACATGTAG 
CCTTGCAACA 
CCTTGCAACA 
6TCXAG06GG 
ACATGGTGTG 
AAGGACT6CA 
GAAGCTGCCT 
AAAAACAGCA 
CTAGAG6CCT 
ACX3VTAAAAO 
ATCTACAGAC 
TCTQAGATCT 
CATCATACAT 
CAATAAG6TC 
CATCAGCATA 
AAT6AAAATT 



TGGCCAACTG 
GAGCCCAACT 



A6ACC6TCCT 
ACTGCCAAAG 
TTACTQAAGA 
AGATTCTTCT 
ACATQTTCAC 
GTGAATTCAA 
GCAACAACCC 
GTQATOTGGT 
CTTTTGGGAG 
GGGTGATACA 
AGCTOQAATC 
CTGACAAGTT 
GA6CCAGTCG 
GACTCAGCAT 
AAT6CCA6AG 
CCTTTATTQT 
GTATGTTAAG 



21 
I 

AGCCTCCTGC 
CGAGTTCSXGC 
CTGTCTTG6C 
GAQGTGGCOC 
G60G0GATGC 
T6TATGGGAT 
AGACTATCAG 
TTGGAGCAGA 
CCTGGAGGAC 
CACTTC CAAA 
TGTTTCi^TA 
CCATGTGATG 
TGTGGAAGTC 
TGGTGGAAGA 
GATTATAAGC 
TCTTCAGTGT 
CTGTGGTTGT 
ACCTGCTGQT 
ACX3GAAGCCT 
GATCTATCAA 
TTACTTCCTT 



31 
I 

CGGCCTCATC 
CCGGTGTCCXj 
GTCAGTATCC 
CAGAOOTGGA 
GTCAAGTACA 
GCTTTCAAGG 
CCACTAATGA 
ATAAAAGATC 
ACGCTGCTAG 
ATAAACTATC 
TTCIGGAAAA 
CTCAATGGAT 
CATAATTTGC 
GAAGATTCCA 
AAAAGGAATA 
GTQAAAAATC 
TTTAGCTCCT 
GCAGAQCTGA 
TTTTCOCCAA 
TAGTGAAOAA 
TAG 



41 
I 

TT06CCCAGC 
G66ACAAACC 
TGGTCXrrOAT 
GOQGTCOQGG 
CTQAAATTCA 
GT6CATTTAT 
AGTTGGGAAC 
TGGCCCATCA 
GCTACCTTGC 
AATCTTGGOC 
CeGTTTCCC33 
CCCGCAGTAA 
AACCAGAGAA 
GAGACTTATG 
TTCAATTTTC 
CTGAGGATTC 
TGACTCCTTG 
AGATTTTGGA 
AGTCT7AAAA 
AAATTATTQT 




CACCACCAAG 
TCCTGAGATG 
TTCAAAACAT 
TCAGACOSTA 
GTTCACACAO 
TGATGACCTC 
AGA CTGQA GA 
GAGGTTTGCA 

aatctttgac 
ggttcagaca 
cx:aggatccc 

CTGCftAGAAT 
ATCTTGCACA 
TGGTTTATGT 
GGGTCCTCCA 
TAACTTATAT 
ATAAGATTAG 



Seq ID NO: C186 DNA Sequence 
Nucleic Acid Accession #« XM_120513.2 
Coding sequence: 1..2208 " 



1 
1 

ATGQTGTCAT 
TAGGCCTTOC 
AGQACCCCAA 
TTTTCAGAAG 
TACTCTGACX; 
GTCATGCAGG 
GCCTGGAGTC 
GCCCCCCOOO 
AOGGCACAGA 
CGGA6GCACC 
COSAGGCGCC 
ATCTACAAGC 
CT06CCAAGC 
CCG6C6CAGC 
GTCAGACTCG 
CCGCCCCTCC 
TCCCOGCCAT 
CATACACATA 
TTTGGATTGA 
ATTCTTGGAT 
GTGGCGGTCC 
TTGGATGCCT 
CTCCTGGACT 
TACAACX3AAA 
AATGTAQACA 
GTTTTCACTG 
TCGTCA6CCA 
CATACATTAG 
TCCAGAGAGC 
GGTGCTTCAA 
CTTQAGGATG 
AOCTCCTTGO 



11 

I 

OCAOGTTCTC 
GAOCTTTTAT 
A6CCCCGTAA 
GATCGGAGAO 
ACCCCGCCCC 
GGGCGCGAOO 
CACATCTGGG 
GGGGGGCTCG 
CACACACCCT 
CTCQGAGCCC 
GGATGOAGGA 
AGCX3CAGCCC 
GCTACGAGCO 
CX3CCGCCCGC 
GGGCTCCTCC 
TTCCTGCCCC 
CCCCTC6TCC 
TGTGGAGGTC 
TGGCTCAGAG 
CTTGGGGTAC 
TGCTGCCAAG 
TTGCCCAGCA 
CCAACCATTC 
QACAGGAGCA 
TAGAGATGCA 
ACXCTCTACA 
6TGAATATGG 
AAAATATTTC 
CCCCAGTCAA 
OACCATTTTC 
GCATCCGCAG 
GAAGCAAOCA 



21 

I 

GGGGCCCCTA 
GTTCCOCATG 
ACACCGCGCG 
CAATTCCCOA 
AA06ACTAGC 
A6CT0QTTTC 
CATTGGCGAG 
GGQATOCGAG 
CCCACAOGCG 
GG06CCCG6C 
AQAGATGCAG 
CTACAGCQTC 
ACCCACCCTQ 
GTCCCCCGCC 
GCGTCC6CCA 
GGGAGTCGCA 
CCX3TCCGTGG 
CCAGTCCACA 
AAGATGGCAG 
TTGGCCATGT 
GTCTGAGG6T 
OTOCAGTCGA 
TCA6TCCATG 
CTGTCACATT 
GTATATGCAA 
AAATTACCTG 
CCATCTGGCC 
ACTTGATTCC 
GAGTGATTTT 
AGGGGCACAG 
CCCCQTCCCT 
GCXGTTGAAC 



31 
I 

GGGGAAACAA 
AGCTCAGAGG 
ACCACCCGGG 
TTGGTAGAAG 

GGAAOCCGGA 
GCAGCCTGTC 
GCTCA3CQAC 
OGCACCAGGG 
GGGQAQGGGA 
COGOCAGAGG 
CTCAAGACGT 
GTGGAOCTGC 
GCCTCCTOGT 
CQCOQTGGAT 
GGCACTCTGC 
CAGQCTGCOO 
CTTCCAGQAT 
CATAGATCTT 
GGACAGGATT 
AATACTGCTC 
GTTCTTAGCC 
ATTTCTTGCG 
GGGAAAGGGG 
AGGAAACAAC 
CTCTCGGGAA 
GA0C3TGQATC 
ACAGCTTCCC 
CCAAATCCTT 
CAAAGCATOQ 
TTQAOTTOTG 
AACTATCCTG 



41 

I 

ATGAAAA06T 
C06CQATGCT 
CCAAGATCTT 
AACTTGCTOT 
TGCAACCCCG 
CCCOGCCOGC 
AATCAGGAGC 
AGGCGGCGGC 
CAGACCCGGC 
CGTGCTCCGA 
AGGGQCCCAO 
TCCCCAGCAA 
CGCACGGCCA 
CGTCTTCGTT 
TTOGGGCGOO 
TCCCACCGCC 
OSCOGCGGGG 
CIGACACCAT 
TAAA6CA6TT 
GGCTGGAGAA 
CTAAGAAGAG 
TCTTAAATTG 
TAAAGCAGGA 
TOCACAGTCA 
AAACTTCTGC 
GCTTTCCAAC 
CTCTGTCAAC 
TGTGTAAATC 
TGCAGCAGGC 
CTTACAGGOT 
AGGCCTTGGA 
TCTACATAAC 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

119S 



51 
1 

GAAAAAGTTC 
CGGGGAAAGC 
CAAGAGGTTC 
AATACACACO 
AQAGTTTGGG 
A0CC6CAQAA 
TCGGGOGGCA 
GGCGGOGGCC 
GGGCAGGOCSG 
GGGACXX»CC 
CX3TCCCCAAA 
GAGACCGGOG 
CTTGAOGACT 
CGC06CTGTC 
GGGAACCATC 
CACGTOGTOG 
A6GGACCTCA 
GGTCTCTGTC 
TGAGTGGGGA 
GQAGGGTCAO 
TOtSAATGATC 
TOGAGGAAAA 
AGGCTCAAGT 
GACCTCAGAC 
CTTTTTGAGG 
TCCAAACCCC 
CrCTCCTGTQ 
TAGGCATCTA 
CTTGQCTGGG 
GAACTCTGAA 
AATGGATTTG 
GAGCAAACA6 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1233 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



1315 



wo 03/042661 



PCT/US02/36810 



TGGGATGAGG CTGTAAATTC TTCAAAGAAA GATGGGAGAC GGCTCCTTOG ATACCTCATC 1980 

AGATTTGTrr TCACAACCQA TGACCTTAAG TACTCATGCO OCCTTGQGAA AAGQAAAAGO 2040 

TCAGTGCAOT CAGGAGAQAC AG Q TCCOGAA AGAG6CCCTC TG6ATCCAGT TAAAGTAACA 2100 

TGCCTCCOAO GTACTGOVTC CTT0G6CTCA OTGTCAOCftT CTOTGATCTC ATTTCAOOGC 2160 

5 ATTG6CTGT0 GCTCTCCCOG TAOUVGTGTT CAGCCTTCTO TATTTTQA 2208 

Seq ID NO I C187 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 26.. 1744 

1 11 21 31 41 51 

1 I I t I I 

ATGGTGGAAC A06CTGCCCA CAAACATGGA AACGA COGTT CTCAGTGGGA TCAACTTOGA 60 

GTAC3VA6GQC AT6ACAGGCT GGGAGOTGQC TGQTQATCAC ATTTACACAG CTGCTGGAGC 120 

15 CTCAQACRAT GACTTCATGA TTCTCACTCT GGTT0T6CCA GOATTTRGAC CTCCXSCAGTC XBO 

GGTGATGGCA GACACA6AGA ATAAAQAGGT GGGCAGAATC ACATTTGTCT TTQAGACCCT 240 

CTGTTCTGTG AACTQTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGT6GAGAC36 TOGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGOAOAA 360 

CACTACCAOQ AOCTTCACCT QGGCCTTCCA GAGQAGCACT TTTCATGAOO CAAGCAOGAA 420 

20 GTACACCAAT GA06TT6CX3V AGATCTACTC CATCAAT6TC ACCAATOTTA TGAATGG06T 4 BO 

GGOCTCCTAC TGCCGTCCXTT GTGCCCTAGA AGCCTCTQAT GTGOGCTCCT CCTGCACCTC 540 

TTCTCCTGCT GGTTACTATA TTGACOGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AaCCTTATGG TGTCCAC3GCC TGTGTGCCCT OTGGTCCAGG 660 

GACCAAOAAC AACAAGATCC ACTCTCTGTG CTACAATGAT T6CACCTTC7 CAOGCAACAC 720 

25 TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACOGTCA CTCTTOCTGO 780 

AGQGCCAAGC TTCACTTCCA AAGGOTTQAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTQACC TCOGQATTCC 900 

T6AGGGT6A0 TCAGGQTTCT CCAAATCTAT CACAQCCTAC GTCTQCCAGG CAGTCATCAT 960 

CCCCCCAQAQ GTQACAGGCT ACAAGGCC366 GGTTTCCTCA CAQCCTGTCA GCCTTGCTGA 1020 

30 TOQACTTATT GGGGTOACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CA6CT6AACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACXX3QA CGTGATCTTC TTTTATAGGT CCAATOATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC OTCAOOTGCA GTCCACAGAA 1200 

AACTGTCCCT OOAAGTTTGC TQCTGCCAGG AACGTQCTCA GATGGGACCT GT6ATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCXX3ACTA 1320 

35 CCATGCTATC GTCAGCAGCT GTGTGGCTGO GATCCAGAAG ACTACTTA06 T6TGG06AGA 1380 

ACCCAAGCTA T6CTCTG6TG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAA6 TGGGCATCTC TGCAGGCACC TQTACTGCCA TCCTGCTCAC 1500 

OGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA QAQTACAAGT ACTCCAAGCT 1560 

GGTGATQAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA OCTQACAGCT GCX5CCATCAT 1620 

40 QGAA00CX3AG GATGTAGAGG AOGACCTCAT CTTTACCAGC AAQAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACXTTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTQAAQA CATCCTCAGQ AGQCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 1860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

45 ATGAOATGTT TGAATTTCAQ ATCTTTTTTT ATAQAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATQCCCTTG CTTGTATCTT 2040 

OTTTCCCAAA ATQGCOCATC 06CCAGAGCC ATAOCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATQA AAATATTTCT ATCTTCTTAA GTATA6AAAC TATTTCCTCT GTCCTCTAAC 2160 

^ TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

50 AGAGAGGCTG GGTGAGGAG6 GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

OATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGXAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGGCTTGA AAAGCATAAA AA6AGGCCQG G06CGGTGGC 2400 

TCAOGCCTGT AATCCCAGCA CTTTGGGA6G CCGAGGCGGG CAQATCATGA GGTOSGGAGA 2460 

TTGAGACCAT CCTGOCTAAC ACGGT6AAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

55 CG666CGTGG TGGGGGGTGC CTGTGGTCCC A6CTACT06G GAGGCTGAGG OGGGAGAATA 2580 

60QTGG60CT GGAAGG0G6A GCTTGGAOTG A6CC6A0ATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGOCTCA AAAAAAAAAA AAAAAAAGAA AA6CACAAA0 2700 

AGA6GCAACA AGGAATGTTT TTGTTTTTQA GACAGGCTCT CACTCTOTCA CCTAGGCTGG 2760 

AGTGCAGTGG OQTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 

60 CCATCTGAGC CTCTCaUlOTA GCTGGGACTA OaAOTOTOCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACAOOG GGTTXCACOG TGTTGCCCAG GCTGGTCTCC AACTCCTGGQ 2940 

CTCAAGTQAT CT G T C CGCCT OGGCCTCCCA AACTGCTGGO ATTACAOGCA TAAGCCACTG 3000 

C3VCTCAGCCT TTTATTTGTT TTTTAAACCA CX3TAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGOCC AAAGCCCTGG 3120 

65 TGQATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCXATTGGO TTTACCCACA 3240 
GACATAOCAA ACCCTOTCAa TGAOGAAAAT TCCCCATGCT TGAGTGCCGC 06TCCTAGAA 3300 
GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 
TTTTAATGGQ GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAQ GTGGC06TGA 3420 
70 TGTGTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAAOGTCTA GGCTGGCCAA 3480 
AAAAGGAACT GAAT OCCAG Q CCCA TAOOCX AGCACCAOAA TCA AACCAG T CTTCAAGQAA 3540 
GGAAGGCTAG GAGAOTTTAA CAAGATTTTC ACTGGGCCCA GCAT66TGGC TCACACCTGT 3600 
AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 
TQAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAOQTQ TTCACXIAAGC TGQGATACTT 3720 
75 CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 
AACTAAAGAG ACTTCTO O TC C3UVmCTGG CSU^TCCCT TCTGAAAGGT GAGTAGAGTG 3840 
GbTGiClTCT ATGCCCATTT TOCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 
TACCTAQAAT GGGTAAAACC AGAGC3U16AC rTTAAATTAC LTi ' dlVri"! ' CTTCTACTYX3 3960 
CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGOTCAAG CXXCAATTGC 4020 
80 CCATGATTTO TGCCTOTQCC CTTTCTCCAQ TGACCATTTO CTOACCAGAT GGTAQATATA 4080 
GAAAGGG6AT GQCATTTGCA AOTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 
TGCTGOCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGOTCCCT 4200 
GGCCATCAAC J^OTGTTGGTG A0GGCAG66A OTCCCmGG TTTAATAAAT CCAGTTTTTC 4260 
TTTOGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA OCTOTGTCCA 4320 
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10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



TOTTGOAACT 
TTCTGTAGCC 
CAAAGGCCAC 
TTGACTTCTT 
ACTACCCCTT 
GTTAGQCTAC 
ACAGCTTTAT 
ATGGTTTTCA 
TTCTTTTCCT 
CCATGATCTT 
OCTCCCCAGT 
TAGTA6AGAT 
ATCCACCCAC 
CTACGTGTTC 
TGTCCAAGCT 
TTGAGTCCTG 
ATACAAATBA 
GTGCXrAACAA 
AACAGAAACT 
CTGGTTG6AT 
TATTGCATGT 



tccxxxagtg 
tcagcaatac 
agacagccx:t 

TATTCTGACT 
ACCGTGCTGA 
TGGGGTCACC 
TQAOATATAA 
GCAAACTCAC 
' iiilTl ' Tiri ' 
GGCTCACTGC 
AGCT6GGATT 
6GGGTTTCAC 
CTOGGCCTCC 
AATTTTCTAT 
QTTGTTGGTA 
GAACACSCAO 
A6ATTGTGGT 
AACTTTTTTT 
TTGTAATAGA 
TTCCTTTTTA 
CACTTTTTTT 



TGGATOCAGA 
TTGGGCACCT 
TAGACTATTC 
CCACTGATTT 
CTTCTGCAGO 
TGTTGCTCCC 
TTCACATATT 
AGAGTTQTCC 
GAGACAGAGT 
AGCCTCCCCA 
ACAGGCA TGC 
GGTGTTGGCC 
CAAAGT6CTG 
GAACAAAGGC 
QAGCKSAOTAT 
TAACTOTCAT 
GTATTCAAOC 
TGTCTQACTA 
TTTTTTCAGC 
CCCTGTAATC 
TTTTTGTAAA 



TACGCAGCTC 
GCTGTCTCAC 
CGGAAACAGT 
TAGCCATAAT 
TCTGCOCTGT 
CTAGCAA6TT 
ATACAATTCA 
GCCCACTTGA 
CAGCTTTGTC 
TCCT6GGTTC 
6CCACCA0GC 
AGGCTG6TCT 
GGATTGCACG 
TTTAGTCCTT 
GATAAAATGT 
GCTATAGTCA 
ACFTAGGGTTT 
CATTAAAGAT 
TTTGTQAAAT 
CAA60GTTAA 
A7AAAAACAT 



CTGAGCTCCA 
TGAATAGCTT 
AGGAAAAATT 
ACTTTAAGGA 
6ACCTGTCA0 
AGGCATGTCA 
CCTTTAAAAC 
GAGCAAACAC 
GCCCAGGCTG 
AAGTGATCCT 
CTAGCTAATT 
CAAACTCCTG 
TGTGAGCX»C 
GACCCAGGSC 
TTAAATGTCA 
TCATCTOTAT 
TTGCTTTTGT 
AAGACTGACT 
CGAATTTTTT 
TAGTTTGTTA 
AOCTTAC 



GCXTTAAAOTC 
TCTTTTGTQA 
ACATATGTCT 
GCTTACTTTTT 
GAACTCCTGA 
TATATTTTTA 
ATAOGATTCA 
ATGTTCAA7T 
QAGTQCAGTO 
TCTGCTTCAQ 

GACTCAAGTG 

TAAAOTGOTC 
TTTGGTTACC 
TTGOCTGGGA 
TTTTGTTTTA 
ATATTTATAC 
TTCATCAGGG 
GAAGATGGGT 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5X00 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5567 



Seq ID N0> ClSe DNA Sequence 

Nucleic Acid Accession #t NM_014324.l 

Coding sequence: 89.. 1237 



41 



51 



1 11 21 31 

I I.I I i I 

GGOGCCQGQA TTGGGAGGGC TTCTTGCAGG CTGCTGGGCT GGGGCTAAGQ GCTGCTCAGT 
TTCCTTCAGC GGGGCACTGG GAAGCGGCAT GGCACTGCAO GGCATCTOQQ TCGTGGAGCT 
GTOCGQCCT G 6CCC0GGQCC GTNTCTGTGC TATGGTCCTQ GCTGACTTOQ GG0G6GSIGT 
GGTACGGGT6 GACGG6CCCG GCTCCOGCTA OQAOGTGAGC C6CTTGQGCC GGGGCAAGOO 
CTCGCTAGTG CTGGACCTGA AGCAOCCX3CX3 GGAGCCGCCT GCTGCGGCGT CTOTGCAAGC 
GGTCGQATGT GCTGCTGGAG CCCTTCCGCC GOGGTGTCAT GQA6AAACTC CAGCTGGGCC 
CAGAGATTCT GCAGCGGQAA AATCCAAGGC TTATTTATGC CAOGCTGAGT GGATTTGGCX: 
AOTTCAOGAA AGCTTCXX3GC GGTTAGCTGO CCA06ATATC AACTATTTGG CTTTGTCAGG 
TGTTCTCTCA AAAATTGQCA QAAGTGGTGA QAATCOOTAT GOCCOSCTGA ATCTQg TGGC 
TGACTTTGCT GGTGGTaQCC TTATGTQTGC ACTGGGCATT ATAATGGCTC TTTTTGACOO 
CACACGCACT GACAAGGGTC AGGTCATTQA TGCAAATATG GTGGAAGQAA CAGCATATTT 
AAGTTCTTTT CTGTGGAAAA CTCAGAAATC OAGTCTGTGO GAAGCACCTC QAGGACA6AA 
CATQTTGGAT G6TQQAGCAC CTTTCTATAC 6ACTTACAG6 ACAGCAGATG G6GAATTCAT 
GGCTGTTGGA 6CAATAGAAC CCCAGTTCTA 06AGCTGCTG ATCAAA08AC TTGGACTAAA 
GTCTGATGAA CTTCCCAATC AGATGAGCAC GGAT6ATTGQ CCAGAAATQA AGAAGAAGTT 
T6CAGATGTA TTTGCAAAGA AGACX5AAGGC AGAGTGGTOT CAAATCTTTO ACGGCACAQA 
TOCCTOTGTQ ACTCCGGTTC TGACTTTTOA GGAGGTTGTT CATCATGATC ACAACAAGGA 
ACGG66CT06 TTTATCAOCA OTGAGGAGCA QQAC6TGAGC CCCOGCCTTQ CACCTCTGCT 
GTTAAACACC CCAGCCATCC CTTCTT(XAA AGGGGATCCT TTCATAGGAG AACACACTGA 
GGAGATACTT GAAQAATTTG GATTCAGCCX3 AGAAGAGATT TATCAGCTTA ACTCAGATAA 
AATCATPGAA AGTAATAAGO TAAAAGCTAG TCTCTAACTT CCAGGCCCAC GGCTCAAGTG 
AATTTGAATA CTGCATTTAC AGTGTAGAGT AACACATAAC ATTGTAT6CA TGGAAACATG 
OAOGAACAQT ATTACAGTGT GCTAOCACTC TAATCAAOAA A ftQAATT ACA GACTCTQATT 
CTACAOTOAT GATTGAATTC TAAAAATGGT TATCATTA60 GCTTTTQATP TATAA AACTT 
TGGGTACTTA TACTAAATTA TGGTAGTTAT TCTGCCTTCC AGTTTGCTTG ATATATTTGT 
TQATATTAAG ATTCTTGACT TATATTTTOA ATGGOTTCTA GTQAAAAAGG AATGATATAT 
TCTTGAAGAC ATOGATATAC ATTTTATTTAC ACTCTTGATT CTACAATGTA GAAAATGAGG 
AAATGCCACA AATTGTATGG 7GATAAAAGT CAOQTGAAAC AOAGTGATTG GTTGCATOCA 
GGCCTTTTGT CTTGQTGTTC ATGATCTCCC TCTAAGCACA TTCCAAACTT TAGCAACAGT 
TATCACACTT TGTAATTTGC AAAGAAAAGT TTCACCTOTA TTGAATCAGA ATGCCTTCAA 
CTGAAAAAAA CATATCCAAA ATAATGAGGA AATGTGTTGO CTCACTACGT AGAGTCCAGA 
GGGACAGTCA GTTTTAGGGT TGCCTGTATC C AGTAA CTOG GGOCC TGTTT CCOCX3TG6GT 
CrCTOGOCTG TCAOCTTTOC TTTCTCCATG TGTTTGATTT CTCCTCAGGC TGQTA6CAA0 
TTCTGGATCT TATACXXAAC ACACAGCAAC ATOCAGAAAT AAAGATCTCA GGAOOCCCCA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

Seq ID NOt CI 8 9 DNA Sequence 

Nucleic Acid Accession «t XM_091332.1 

Coding sequence x 1..1401 



I 

AT6CAAAGGT 
CCACAAACAO 
CCCCTGTTCT 
TGGGCTGAGG 
GCCTCCATCC 
CC0G6CATGC 
GAAT6GACTG 
GGOGTCCAOG 
GAGCAGCTAC 
TATCTCCCT6 
AAGAGCCTGA 
CAAAAG6CAA 
GGGTCATGGA 
CAGGAAOQCC 



11 
I 

OGACACTGTO 
ACATCAGTAT 
GGATGGAGTT 
CCGACCTCTA 
ACAGCTGGGA 
CAGCXGACQT 
ATGGCTCATC 
CGGACCCAGA 
AGGCCCCAGA 
AGGATTTCCC 
AACCATCCAA 
AGACCC6ACA 
AAGAAAOAAT 
GGCTOOOGTO 



21 
I 

GGCTQCAGCC 
CAOTCCAGCC 
CAAAGGCCAC 
CTOTTCTGAG 
GGAGAATGTC 
CTGGACA6QC 
CTATGACTAC 
AGAAGA6QAC 
QCCCCAOTTA 
AGCTOAGOCC 
GAGTCATCTT 
TAGGAGCCTG 
OAATGCAGAC 
CAGGGAGOGC 



31 

I 

TTCCT6AC0C 
CTSCCAGAOC 
T6CTAT0GAT 
TTCTCTQTGQ 
TTTGTATATG 
CTTCATGATC 
AGCTACTGGG 
TGCGTGCAGA 
CCCTTATCAA 
AA6CTCATGQ 
ATGOAGGCAC 
OGCGGrOQTCT 
TACGGGGQAA 
C36CCTGCGGG 



41 
I 

TCCACTCTQC 
TGCCCCTGCC 
TCTTCCCTCT 
GCA6GAAGTC 
ACCTGGTGAA 
ACAOACAOGA 
ATGGCAGCCA 
TATGGTACAG 
TCTCAGAGGC 
ACCAGTCCTG 
CCACTCCA6T 
GGTGGCCATC 
GAAAGOGATC 
CT6CTT0G00 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1060 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

206B 



51 
I 

ACAG6CCTTT 
TTCCCTGTGC 
CAATAAGACC 
CGCCAAGCTG 
CAGCTGTGTT 
AOGQCAQTTT 
GCCAGATGAT 
GCCTACCAGT 
CACAGATGTC 
GGTQTCCAGO 
GGOCAAGCAC 
AGQTAA(X3CT 
GGCCC0GAO6 
CCA6GGT0QA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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5 

10 

15 

20 

25 

30 

35 

40 
45 
50 
55 
60 
65 
70 
75 
80 



CCCGAGCaCC 
GGAGGGGTGT 
CAGCGGGGTG 
TGGAGGGGGC 
CTTCCGGGGA 
QAA6AAAAGA 
TGTGACCAAO 
GtSGCCTTACC 
CTACAAAGTG 
ATTGCTCGA6 



AGCGCAAQCA 
CCCCAATGCG 
CGGCGCCX3GA 
AGC6GCTCCA 
ATGCACA086 
ATAATCXKaiC 
ATAGCAAGGA 
GGTTAGTTAA 
GAGGGGAAC3T 
AACCTCCATA 



GCXXKVVACAG 
GGGCGCCCAA 
OTGCXSGGGAO 
0CCCCA6A0C 
CCTGGCGGGC 
CCGTTTCTCA 
GCAGTCTTTA 
GAAAAAAAGA 
TTGTCTGGAT 
A 



GAGCXKXAGG 
GCGTGGCAGC 
AACCACCA6G 
GCCXK3GCTCT 
6CCTT0QTGC 
GGTGCTTACT 
AGGCOACAOG 
G6ACCTGTT6 
TTTCCTGTAG 



AGAGAGGCT6 
ACGGGCTGGG 
C6CCGGAATT 
GTCACTTTGC 
AGCCCGCCCT 
TCACCATGTC 
GCAGAGAGGC 

ccTGTCCtrrc 

AACTGAOGGC 



G6AAGAACT0 
A6CGGGGAGC 
G6GGAGCACG 
ATTAASAAAO 
GCAGGT6CA0 
OGATCX36AOQ 
AGAAAAAGAT 
TAGCTTTGAA 
AGGGACCTG6 



900 

960 

1020 

lOBO 

1140 

1300 

1260 

1320 

1380 

1401 



Seq ID HOt C190 DNA Sequence 

Nucleic Acid Accession #i XM_054B69.2 

Coding sequence: 26. .2902 

1 11 21 31 41 51 

I I I 1 I I 

TAGAOGCGGA GCCCAAGGAG GTAAAATGCA CACTT6CTGC CCCCCAQTAA CTTTGGAACA 60 

GQACXrrrCAC AGAAAAATGC ATAGCTGGAT GCTGCAQACT CTA0CX3TTTG CTGTAAOITC 120 

TCTOQTCCTT TCXSTGTGCAa TWiACCATCQA TTATTATGGO GAAATCTGTO ACAATQCATG 180 

TCCTTGTGA6 GAAAAGQACG GCATTTTAAC TGTGAQCTGT QAAAACOGGG GGATCATCAG 240 

TCTCTCTGAA ATTAOCCCTC CCOGTTTCCC AATCTACCAC CTCTTGTTGT COGGAAACCT 300 

TTTGAACCGT CTCTATCCCA ATGAGTTTGT C3ATTACACT GGGGCTTCAA TTTTGCATCT 360 

AGGTAGCAAT GTTATCCAGa ACATTGAQAC CX3GGGCTTTC CATGGGCTAC GGGGTTTOAG 420 

QAGATTOCAT CTAAACAATA ATAAACTGGA ACTTCTGCGA GAT6ATACCT TCXTTTGOCTT 480 

G6AGAACCT6 GAGTAGCTAC AGGTCQATTA CAACTACATC AGCGTCATT6 AACCCAATGC 540 

TTTTGGGAAA CTOCATTTQT TGCAGGTGCT TATCCTCA^T GACAATCrTT TGTCCAGTTT 600 

ACX:CAACAAT CTTTTCCGTT TTOTGCCCTT AACGGACTTQ QACCTCOGGG GGAACOSGCT 660 

GAAACTTCTO CCCTAOGTGG GGCTCTTGCA GCACATGGAT AAAGTTQTG6 AGCTACAGCT 720 

GGAGGAAAAC CCTTGGAATT GTTCTTGTQA GCIGATCTCT CTAAAGGATT GGTTGGACAQ 780 

CATCTCCTAT TCAGCCCTQG TGGGGGATGT AGTTTGTGAG ACCCCCTTCC GCTTACACGG 840 

AAGGGACTTG GAC3SAGGTAT CCAAGCAGGA ACTTTGCCCA AGQAQACTTA TTTCTGACTA 900 

OGAGATGAGG CGGCA6ACX5C CTTTGAGCAC CACGGGGTAT TTACACACCA CCCXX5G0QTC 960 

AGTGAATTCT GTGGCCACTT CTTCCTCTGC TGTTTACAAA CCCCCTTTGA AGCCCCCTAA 1020 

GGGGACTCGC CAACCCAACA AGCCCAGGGT GCX3CCCXACC TCTCGGCAGC CCTCTAAGGA 1080 

CTTOGGCTAC AGCAACTATQ GCCCCAGCAT CX5CCTATCAG ACCAAATCCC CGGTGCCTTT 1140 

GGRGTGTOCC ACOSOQTGCT CTTGCAACCT GCAOATCTCT GATCTGGGCC TCAACGTAAA 1200 

CTOCCAOOAG CGAAAGATCO AGAGCATCGC TGAACTGCAG CCCAAGCCCT ACAATCCCAA 1260 

GAAAATGTAT CTGACAQAGA ACTACATCGC TGTC3GTGCGC AGOACAOACT TCCTQGAGOC 1320 

CAOGGGGCTG GACCTCCTOC ACXrCGGGOAA TAACX:GCATC TCQATGATCC AGGACCX3CGC 1380 

TTTCGQGGMT CTCA0C3U^ TGSAOGOSCCT CTACCTGAA7 GGCAACAGGA TCGAGAGGCT 1440 

6AGCCCGGAG TTATTCTATO OCCTQCAOAO CCTGCRGTAT CTCTTCCTCC AGTACAATCT 1500 

CATCCGCGAG ATTCAGTCTG GAACTTTTGA CCCGGTCCC3V AACCTCCAGC TGCTATTCTT 1560 

GAATAACAAC CTCCTGCAGG CX:ATGCCCTC AGGCGTCTTC TCTGQCTTGA CCCTCCTCAG 1620 

GCTAAACCTG AGGAGTAACC ACTTCACCTC CTTGCCAGTG AGTGGAGTTT TGGACCA6CT 1680 

GAAGTCACTC ATCCAAATCX3 ACCTGCATGA CAATCCTTGO GATTQTACCT GT0ACATT6T 1740 

GGGCATGAAG CTGTGGQTGO A6CA6CTCAA A0TGGGCQTC CTA6TGGA06 AGGTGATCTG 1800 

TAAGGOGCCC AAAAAATTC3G CTGAGACOSA CATGCGCTCC ATTAAGTCX3G AGCTGCTGTG 1860 

CCCTGACTAT TCAQATQTAG TAOTTTCCAC GCCCACACCC TCCTCTATCC AGGTCCCTGC 1920 

QAGOACXAaC GCCGTQACTC CTGCGGTCOG GTTQAATAGC ACOGGGGCCC CCGOGAGCTT 1980 

GGGGGCAOGC GGAGG66GGT 0GTG6GT8CC CTTQTCTQTO TTAATTCTCA GCCTCCTGCT 2040 

GOTTTTCATC ATGTCOGTCT TO Q TG G COGC OSGGCTCTTC GTGCTGGTCA TGAAGCGCAO 2100 

GAAGAAGAAC CAGAGCX3ACC ACACCAGCAC CAAC3VACTCC GACGTGAGCT CCTTTAACAT 2160 

GCA6TACA6C GTGTACGGCX3 GCGGOSGCQO CACX3GG06GC CACCCACACX3 CX5CACGTGCA 2220 

TCACCGCGGG CCCOCGCTGC CCAAGGTGAA 6A0GCCCG06 GGCCAOSTGT ATGAATACAT 2280 

CCXXXACCCA CTGGOCCACA TOTQCAAAAA CCOCATCTAC 06CTCCCGAG AGGGCAACTC 2340 

OOTAGAGGAT TAGAAAGAGC TGCAC9QAGCT CAAGGTC3VCC TACAGCAGCA ACCACCACCT 2400 

GCAGCAGCAO CA6CAGCGGC CGCGGCCACC 6CAGCAGCCA CASCAGCAGC CCCCGCCGCA 2460 

GCTGCAGCTG CA6CXXX3GGG AGGA6GAGA0 6G6GGAAA6C CACCACTTGC GGAGCCCOQC 2520 

CTACA6C6TC AGCACCATOO AGCCCOGGGA GQAGCTGCTG TCX3(XGGTGC AGGAOGCGGA 2580 

CCX3CTTTTAC AGQGGCATTT TAOAACCAGA CAAACACTGC TOCACCACCC CC6C0G6CAA 2640 

TAGCCTCCOG OAATATCGCA AATTCC0GT6 CAGCCCOOCT GCTTACACTT TCTCCCOCAA 2700 

CTATGACCTO AGAOGCCCCC ATCAQTATTT GCACCCGGGG GCAGGGGACA GCAGGCTAOQ 2760 

GGAACCXIGTG CTCTACAGCC CCCCGAGTGC TGTCTTTGTA OAACCCAACC G0AACX3AATA 2820 

TCTGQAGTTA AAAGCAAAAC TAAACX3TTQA GCCGGACTAC CTOGAAGTGC TGGAAAAACA 2880 

GACCAOGTTT AGCCAGTTCT AAAAGCAAAO AAACTCTCTT GGAOCTTTTO CATTT AAAAC 2940 

AAACAAGCAA GCAGACACAC ACAOTGAACA CATTTGATTA ATTGTGTTGT TTCAACGTTT 3000 

AGGGT6AAGT GCCTTGGCAC GGGATTTCTC AGCTTOGGTG GAAGATAGGA AAAGGGTGTO 3060 

CAATTTCCTT TAAAATTTAC ACGTGGGAAA CATTTGTQTA AACTGGGCAC ATCACTTTCT 3120 
CTTCTTGCGT QTOGGOCAGG TGTGGAGAAG GGCTTTAAGG AG6CCAATTT GCTGCGCGGG 3180 
TGA0CT6TGA AAGGTCAC3U5 TCATTTTTGT AGTaaTTGOA AOTGCTAAOA ArOGTGGATG 3240 
ATOGCAGAGC ATAOATTCTA CTCTTCCTCT ' m ' GC lTCCf CCCCCTCCCC CGCXXXTTGCC 3300 
CCACCTCTCT TTCTCCCCTT TTAAGCCATG GGTGGGTCTA ACTGGCTTTT QTGGAGAAAT 3360 
TAGCACACCC CAACTTTAAT AGQAAATTTG TTCTCTTTTT COGCCCCTCT CCTTCTCTCC 3420 
TCCCCTCCCC TCCCTTCTCA TTCCTTTTCr TT G TTTTTAA AGGATGTGTT TGTATOCATT 3480 
CTGGAOWnT GAATTAAAAA AAAAGTArTQ TGATCCTGTA AAGGATCACC ATAGATGTGG 3540 
ACAAATCATT AAAATTACAO AGCTATATGA TCCATAATTG ATTAGTCAAA ATAACTTATT 3600 
GAT6AAATAT ACAAATATTT TATTGTAOCA CCTATTTTTA TATGCACATT TAGCATTCCT 3660 
CTTTCCTTCA CTATTTAQCC TATGATTTTG CAGAGQTGTC ACACTGTATT AGGATCTGCA 3720 
TTTCTAAAAC TGACX?PGQTA TC3W3QAAGGC ATTTTCAATC ATTCAAAATO TGGAQAATTT 3780 
AATGGCTAAA TCTTTAAAAO CCAATGCAAC CCACCCAATT QAATCTGCAT TTTCTTTTAA 3840 
GAAAACAGAG CTGATT8TAT CCCAATGTAT TTTAAAAAAT AGGGCAATTG ATTGOGCCAT 3900 
TCOSAGAGAA TTGTTTGCAA GTTTTGQOTT TTATTAGAAA ATATTTGAAA OTATTTTTAT 3960 
TAATGAACCA AAATGACATG TTCATTTGAC TACTATTGTA GCGGATTTTC GATTGTTTAA 4020 
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CCAAACCCAG TTGCATTTGr ACAGATCCAC GTGTACTGGC AOCTCAGAAG ACCAAATCAT 40 BO 

GGACTGTACA AOTCTCTATA CAATGTCrTT ATCCCTGTGG GCAGCAAGCA ATQATGATAA 4140 

TQACAAACAO gATATCTGTA AGATGGOQCT ACTOTTQTTA CAOTCTCATA TGTATCCCAG 4200 

CACATGTAAT TTTTTAAATA QTTTCTGAAT AAACACTTGA TAACTATGTC <350 

Seq ID NOt C191 DNA Sequence 

Nucleic Acid Accession «: im_000793.2 

Coding sequence! 401.. 1222 

10 1 11 21 31 41 51 

I I I I I I 

GCCTGCAGAO AGAGGCACTT TGCACX:ACAG ACAGATA6CA AQAAGGGAAA GACAGAGAGT 60 

GAQAAAAAAO AGGAGTCAGT CXSCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAO 120 

AAGCACAQAA AGTGTGTGTA AAAOGGAGTA AAGAAAGAAA AAAAAAAAAC TACO CTTAAA 180 

IS GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

CATA£3AGACA ATGAAAG6CT AAA6AAAATT TTAAAATCTC TGCCACA6TC TCATAGOTGC 300 

TTGGAAATGA AAOTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTX5AAGGGGA 360 

ACCAGAGCGC ACAAGG6AAC TGACTCAGGA GGCAGAGAAG AT6GGCATCC TCAGOOTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCS^ TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

20 CTATGACTCXS GTCATTCTQC TCAAGCA06T GQTGCTGCTQ TTQAGCXXOT CCAAGTCCAC 540 

TGGOGQAGAO TGGC6QC6CA TGCTGACCTC AGAG6GACTQ CGCTGOQTCT GGAAGAGCTT 600 

CCTOCTCQAT GCCTACAAAC AQOTGAAATT OOGTGAGGAT GCCCCCAATT CCAGTGTGGT €60 

GCATGTCTCC AGTACAGAAG GAGGTQACAA CAGTGGCAAT GGTACCCAGG AQAAOATAGC 720 

TGAGGGAGCC ACATOCCACC TTCTTGACTT TGCCAOCCCT GAGCXKXTCAC TAGTGGTCAA 780 

25 CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 

GGAAGA6TTC TCCTCAGTGO CTQACTTCCT GCTGGTCTAC ATTQATGAGG CTCATCCATC 900 

AQATGOCTOO OCGATACCGO GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AGCACXyvOAA 960 

CCAGGAAGAT 0GATGTGCA6 CAGCOCAGCA GCTTCTGGAG 06TTTCTCCT TGCXXJCCCCA 1020 

GTGCXXSAGTT GTGGCTGACC GCATGGACAA TAAOGCCAAC ATAQCTTACG GGGTAOCCTT 1080 

30 TGAACGTGTG T6CATTGTQC AGAGACA6AA AATTGCTTAT CTGGGAGGAA AGGOCCCCTT 1140 

CTCCTACAAC CTTCAAOAAO TCGOGCATTO 6CTGQA6AAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTA6CTGGTT AAAGGTATGA TTATAAGAGA GCTTATTOTT TTAAAAAOTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGG CTTAC 1320 

TGAAAGACAG GAGTTTATCT ATOGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCITT 1380 

35 CACTACTCAA ATGGCATTGO GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

GOOCTATGTO AAAAGATCCC AAGATGQAQA GGAAGAAAOG CTAATTCAQC ATGTGTTCAT 1500 

TCTGCATTGA GAAGQAACTG ATACRTCTGA TOCATOCTTT GAGACXSVaAA QAAAAGACTT 1560 

ACCTQAATAA TTACTACATT AG6GARGCTA CTGTCTACGT TAAGATAAAQ GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1680 

40 TTQAACCCAO GCCATOTGGT TAGAOGTTGG TOTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTQATGTT CCAOCTTCTA OCCAtGTAQT GCTCTCAC3TC TTCATACCCC AGAAATTATT 1800 

GQTATATTTO TAGATACCOA GAATGATCCC TCAOTCTGAO AGQTTAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT 6AAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA QAGTTTCACC CTTAAGCTGT 1980 

45 A6A6AGTGGG TCACATTTGT TAGCCACX3GA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

GAAGAAGGAO CTCAACTAAA AGTGGCATA6 AGAAXACAGA TATAAAAACA ATATATTTGT 2100 

CATAIGCTCC TAGAGAGGAG AAAG6GGT6A TTGAAAGAAA AAAAAATACT TAAATATTTO 2160 

TAATTGTGAG GGGTTT C TTT TG6AAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG OTTAATTATA CCCCATGATA CXTTATTAAAQ GAAAACfCAGT GGGTCTGGTG 2280 

50 QTGCT6GTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 

TGGTCTCTAT AGCAOTGTTC TCTCTGAATO CTQAaCIGAA OAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTQCTGOG GGAGGTAGGT 2460 

ACCATGGCXA TTCAGCACAG CCTTGATTTC AQGTGAGCTA TAGTGAAGAA 2520 

TAQOTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 

55 ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCCGGTATA TCATGGAATT 2640 

TCCATTQACA TTTGAATTTG ^CTTGGATC TTCOCTTGOT CCCATTAGCT GAGGTTTAQT 2700 

AATCTAAAGT CCCTATAQTA TATGATTATA ATGCTATTTT AAAAAATATA TATA1AAAAT 2760 

ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAOTAATATT TAAAGATTAT 2820 

AOCrCCCAAA AGAATOGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAQ 2880 
60 ACTATTATQA AATCATAGTA TATGATTOTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 
CATTCCATTA ATAACTQAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTT6AA 3000 
AATACATTCT TAAACTAQTT GTCTGAAATO AQAAAASAOT QAOAACTAGO TGTGCAAGAA 3060 
CCAAAOGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTOGTG TCACCAGCTG 3120 
GTATATTGTQ TAAATATTAA AGCrCOVTTG GGACTGATTT TTCAT0GC3A CATCAQCTTT 3180 
65 CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAOAAAC AAAGCAAATT TCSVTTATCTA 3240 
ATGAGTTGCr G6AAAATCAT ATTGAGAATA ATTATTTCAG ATTCCXCAGT TGTTAACTTC 3300 
TACATTCAAO GOCTTATCTC TGCOCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 
ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 
CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAQTATGTCA GATAAGCOGG 3480 
70 TGGGGGQATG AGATTAGGCT GAGGCAGTGC TAGTCAACTQ G6GGAAAAGG ATGATGGAAA 3540 
AATCACCCAO TTGTGCTATA TTTTTAAA6A AGGAGGTOGT TTATGTOTGC AGACAATTCT 3600 
CCCTGAGOTT AGGCCAATGG AGAAATGAAG GAGAOGAAGO AAACATA6AA AGACATGGQC 3660 
TATCAGGGAG GAAGATGTTC AATAOAACAT GC3VAGAATTT CTGGAAGAAA GGCTGTGGAA 3720 
GGGCCAATGO AGAAAATGAA TGQACAAAGC TCAGQAATCC CTACGCTATG TAGAATGTTC 3780 
75 TT GG TGTTAT CAGGGTTAAQ CCCTGTAATT ATGTAACCTA TTTATOGCAA CATG AATTTT 3840 
TATGATTTCT TGTGATGTAT TCmTATQA AATTAACAAG AACTCATTAT TTTGAGOTAG 3900 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATT6CCAA TACTCATOTQ 3960 
CGTTTCATGT GTTTTATAAG OTTTCTTCCT TTGAAGAATT GTAOTTCTTA GTCCCACAGG 4020 
GAAATOTGTA TCTATTTATA TATCATAOTA TAAATCTATG ATATATTTAT AT CATA TATA 4080 
80 AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGrTTCT CCCATAOGCT GTG TTTA CAT 4140 
60A0CTAT0Q OTTTAOCCTT TTAAGCTTGA TTAaCTTGTC TATTATTGAA ATAGTTTCCA 4200 
AGAAATTTTA 6ATATTATCA TAACATCTGG OTCTACTCAA ACACTTATTO TTTGAAAGAC 4260 
TTATOTCTTO GACCTATCAA AAACTGACTT TATTTATTOC TTAOTGAAAA TACTAGTGGG 4320 
ATCAACAATG ATTTTCTTGA ATGGGCATGA ATGGAOATGC CC6CACAGTA ATGTAQAAAT 4380 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GTTTCATACA 
ACTTTGTATA 
CTCTGGTCCT 
CTTCCCAGT6 
CATCACAGTG 
G6GCTCTGAA 
ATAAAATTCT 
AGOGTGGOAT 
AAQAATTGAC 
GGACATTTGT 
TATGACCATT 
AAAACXriTTA 
TGG AGACTO C 
TAAATG6TAT 
TCACTCTAAA 
TACCAATCTC 
AGAGAGGATC 
GGGTTGGTCT 
GAAGGCTGCT 
ATQCCCTATG 
CAAGGTGAGT 
TGCAAGTCGG 
CAATATTTAC 
AAACCCTCCA 
AACCCACAGA 
CACTGGTCAG 
AGTACTTTAT 
TTAGTTTT6C 
GAAOAGGGAO 
AAGAGTCATC 

GTGAGAATGC 
GCTTTGGGAG 
TTATACAATT 
AAAAAAAAA 



GCTATTAAAA 
GCTAAGTGAC 
GTGTCTTCAC 
ATCTOTTCAQ 
GGAAGAATAO 
ATAAAAATTA 
AGGAGGGCAG 
TACAAGGGTO 
T6ATTTACA0 
TCCACCOGAC 
GAAAAAGGAA 
CTAGCATTTA 
AAOTAAGGCT 
GGCCAAAAOT 
AATAGAATCC 
TGCTTTGCAA 
TAGGATGGGA 
ACCAATCTG6 
GCAAOTCATT 
TQTATAGTAC 
CTOAACCAAT 
GGCTGGCTAC 
TGAQACTTC6 

c nxrixxxxc 

AGG66ATGGG 
CX3TGGTTTTT 
AACCAAAGCA 
CATCACATTG 
AGA6A0ATQT 
GAOCTCAGTT 
TGGCSUVGQAO 
ACX30GT6CAT 
ATTATCAGTA 
TGTTCTTGTA 



TGTAACTGAC 
AQTCACTTAA 
CTCATTTATA 
TTAA6TTCTT 
CCTATTGTCT 
TGAAATATGG 
GTTAGQAQAC 
TTCCTCAGGC 
GACTTCTCTT 
CTCTGACTGA 
AAATGTAGAC 
GAGCTTTTCA 
TTTAATTTTA 
CAGAGTTAAA 
AAACOCACTC 
TGGOCACAAT 
GAGCTAGAAA 
GAAOATTTQA 
GAGTGACTTT 
CAGAAGCAAG 
AGAAAGCAAA 
CCGTCTTAGO 
AAGACGCAGC 
TCCCCTCAAA 
AAATAAAGAA 
ATGTGTATTA 
ATTAAATGAT 
TCACCCAGAC 
GCCAOAOTTO 
AGTGGTTGGA 
AATAT6CGGG 
ATGCTACACA 
GAAA6AGTGT 
TTTTAATAAA 



CTCCTTACAG 
CTTACATQAC 
GCAGGTCTCC 
CTGOOQTTAA 
TTCATTTTGC 
TGAGGTCACA 
AGTTATGTAT 
ATGCCOCTAT 
TATQTCAATC 
TG0TTT66AA 
TCTOACTTCC 
GAACATCCCC 
GGAGGTTTTT 
ATATATATAQ 
TTCATATATO 
CTTGGTCATG 
GTTGCTAACT 
AAACAAACTT 
AG6ATGAGCA 
GTCTCAGACT 
CATGTGCAOA 
CAGCAACAGC 
AQATGTTTAA 
AAGOCAAGAG 
AATTCTCTCA 
GGATTGGGGG 
ATTGGGGTAG 
CTCACCTAGC 
ACCCAOTGIO 
TOTAGTCACA 
ACATGATGCT 
TATOTQCTTC 
TATCATATTQ 
CTTTGAATAA 



GCAGATTAGT 

TTTcrrrrrr 

TTGATTTTT6 
CCAGQAAGTG 
CTGAOTGTAT 
TGTTQGTGCT 
GGCCTTTOGG 
GGG0CCTATX3 
TTAAGAOQAT 
AATAACTTTA 
GTCCCACTGA 
ACTGTCATOT 

ri TTr rrrrr 

TtAQATTCCA 
CTTCCAGAAT 
TCCTQAGGCT 
GGGAAGAACA 
CTCGCAACTG 
AAACATTQGQ 
TAACAGACCC 
TATCCAAACA 
AGAGCTCCAG 
TGAAGTGACT 
GTAAACACAT 
AGACTTCTGC 
ATGTGAAGAA 
GGAATGTTGG 
CCCAAOTAAT 
OGG ATGAT AA 
TTAGTTTGCC 
AAGAGCCCTG 
TCAGTTGCAG 
6TGCTGA6TG 
AA6AATAAAA 



AACTGTTCCT 
CACATTGOGT 
GTAGTATCAA 
CTTATTCTCT 
TTTACTATTT 
GCCTTGCTGC 
GAAAATTCAA 
TGGAAGCAAG 
GGATGAATCT 
ATTAGGATCA 
AGGATTAATG 
GTCTCAOCAG 
TTTTTTCCCC 

6GGGCTTAAG 
CTCTAAGAAA 
AGGCCCIGAG 
AAGGAAQ6CT 
OCACTTCCTA 
AGCTCTOTTC 
AOACTGCTCA 
GGAGCTTATT 
ATTrrGGCTC 
AAATQAAAGA 
AGGCCGATGT 
ATAAGTATCC 
CCAGTTTTGT 
CGGGCGOCCC 
CTACTGACX3A 
TCTCCCCATC 
GGTAAATGT6 
AAAATGAACT 
CTATGTGTGC 
AAAAAAAAAA 



Seq ID NOt CI 92 DNA Sequence 
Nucleic Acid Accession 8: NM_006549.2 
Coding sequence: 824.. 3590 " 



GAOCCTGGGG 
CAACACAGAG 
AAAAAAAAAT 
AGGCCAAGGC 
AACCCCATCT 
CCAGCTACTC 
6ATCCGAGAT 
AAGAAAAAAA 
TGTGAGTGTC 
ACCCAOTTAA 
CrCSGATTCCC 
OGXAOGACGG 
TTATCCAGCT 
ATGGGAGTGC 
TAGCCAGCCC 
CAGCAGCGAA 
OCTGGQCATG 
GGGCTTGGGG 
OGGGTCCCAG 
GGGTGGGCTG 
GCGCTACTCA 
GGAGTCTCAC 
CCTQAAOQAT 
TGACAATACC 
CTTTCCACGT 
CAGGGGCCCC 
CAATGTQGTG 
GTTOGAACIG 
AGACCAGGCC 
GAAGATCATC 
CAAGATOGCT 
CACGGTQG6C 
CTCTGGGAAG 
CCAGTGCCCA 
CCTGGAA'fTT 
6CTGGACAAG 
CAOGAGGCAT 
GACTGAAGAO 
GGTGAAGACC 
GGAACGCTCA 
GTCCCTGTCT 
COCCCXSTGGG 



11 
I 

AGGTOGAGGG 
AGACCCTGTC 
GG6AQTGGGC 
GGGTGGATCA 
CTACGAAAAA 
GGGAGACTGA 
GQOQTCACXG 
AAAAAAAAAA 
CCTAOGGCCT 
CTAGAGGGCr 
TTTTGGCTQT 
TTCAGGGAGC 
GG06GAT0CA 
CCCAGTGTGC 
AGCAGCAACC 
AGCCAGAAGC 
QAGTGCTTCA 
OGGGACOGGC 
GCCCX3GCCCC 
GCAGCGGGTG 
CCCGTGA6CT 
CAOQTCTOCA 
6AAATTGGAA 
TACTATQCAA 
OGCCCTCCAC 
ATTQAGCAG6 
AAGCTGGTGG 
6TCAACCAAG 
OGTTTCTACT 
CACCGTGACA 
GACTTTGGTG 
AGGOCOGCCT 
GCCTTGQATG 
TTCATGGAOQ 
CCAGACCAQC 
AACCCOGAGT 
GGGGGGGAGC 
QA06TCGAGA 
ATGATAOGTA 
CTOTCAGOQC 
6A6CTCAAG6 
GOAGGAOGAA 



21 
I 

TGCAGOGAGC 
TCAAAACAAA 
OGGGOGCGGT 
OGAGGTCAQG 
TACAAAAATT 
GGCAGAGAAC 
CACTCC3U3CX3 
AAAAAGGGAG 
AOAAATACCA 
TTGAACCTTT 
TTQGCAGGGT 
CCTCCAGGTC 
A06CACACTT 
TGOATGAAGC 
GGGCCGCCCC 
CCTGTGA6GC 
TTGTGGTGAC 
CCCTGGAGGC 
ACCTCTCOGG 
GCAGCCTGGA 
CCCCGCAGTC 
TCAOGGGTAT 
AG6GCTCCTA 
TGAAGGTOCT 
CCCGAGGCAC 
TGTACCAOGA 
A6GTCCTG6A 
G6CCC6TGAT 
TCCAGGATCT 
TCAAACCTTC 
TGAOCAATGA 
TCATGGCACC 
TTTGGGCCAT 
AGOOQATCAT 
CCGACATAGC 
OGAGGATCOT 
0CTTG008TC 
ACTCAOTOIA 
AAOGCTCCTT 
CTGQAAACTT 
AA6CAAGGCA 
GTGCTCTTGT 



31 

1 

CGTGATCGTG 
CAAACAAACA 
GACTCACACC 
AATTCAAGAT 
AGCCAAGTAT 
TGCTTGAACC 
TGGG06ACA0 
T06GGGTG6A 
GA6AAGCACA 
TATTAACTTG 
CAQTGA8ACA 
TTOGTTTCTC 
AAGOCTGCAO 
TGGC6CATGC 
CCAGGATGAG 
OCTGCGGGGC 
OCSAGTGTGAG 
C9GATGGCCAA 
TG6CAAGCTG 
CATQAACGGA 
CTGGCCTGGG 
GCAGQACTGT 
TGGTGTOGTC 
GTGCAAAAAG 
COGGCCAGCT 
AATTGCCATC 
TQAOOOCAAT 
GGAAGTGCCC 
GATCAAAGGC 
CAACCTCCTG 
ATTCAA6G6C 
CGAOTOGCTC 
66GTGT6ACA 
GT6TTTACAC 
TGAGGACTTG 
GGTGCOGGAA 
GGAGGATQAG 
ACACATTCCC 
TGGGAACCCA 
GCrCACCAAA 
GCQAAGACAA 
GAGAG6CAGT 



41 

I 

CTACTGCACT 
AACAAACAAA 
TGTAATCCCA 
TAGCCTGGAC 
GGTGGCCGGC 
T6GGAGGCAG 
A60SAGACTC 
GCTCTCATTG 
TOGGAAOQGG 
GAGGTTGACT 
TCCCCTGGOT 
CTCTTCCCOO 
CAAAGTGGCT 
ACCATGTCAT 
CTGGGGGGCA 
CTCTCATCCT 
OOQGGCTGTG 
GAGOTCCCCC 
TCTCTGCAAG 
CGCTGCATCT 
CTGCCCOGGC 
OTGGAGCTQA 
AACTTGGCCT 
AA6CTGATCC 
CCTGOAGGCT 
CTCAASAAGC 
GAOGACCATC 
ACGCTCAAAC 
ATCGAGTACT 
GT0GGAGAA6 
AGTGAOSOGC 
TCTGAGACCC 
CTATACFGCT 
AGTAAGATCA 
AAGGACCTGA 
ATCAAOCTGC 
AACTGCACGC 
AOCTTGQCAA 
TTOGAGGGCA 
AAACCAACCA 
CCTCCAGGGC 
OCCIGOQTGG 



4440 

4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5330 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5620 
5880 
5940 
6000 
6060 
6120 
6160 
6240 
6300 
6360 
6420 
6429 



51 

I 

CCAGCCTGGG 
CAAAACAAAO 
GCACTTTC06 
AACATGGTGA 
GCCTGTAATC 
AGGTTGCAGT 
OGTTTCAaAA 
GCTCX5TT6CA 
CTGGAAATCC 
CTCCT6TCAA 
OQCTOQACCC 
CACAOTGCTO 
C06CTGCCGG 
CATGTGTCTC 
GGGGCAGCAG 
TGAGCATCCA 
CTGTGGAGCT 
TTGACTOCTC 
AGOGGTCCCA 
QCCCOTCCCT 
GGCCX3ACAGT 
ATCAGTATAC 
ACAATGAAAA 
GGCAGGCOGG 
GCATCCAGCC 
TGQACCACCC 
TQTACAT6GT 
CACTCTCTQA 
7ACACTACCA 
ATGGGCACAT 
TCCTCTCCAA 
GCAAOATCrr 
TTGTCTTTGG 
AGAGTCAGGC 
TCACCCGTAT 
ACCCCTGGGT 
TGGTOQAAOT 
OOGTGATCCT 
GCCGGCGGGA 
GGGAATGTGA 
ACCGACCOQC 
AAAGTTGCTG 



60 
130 
180 
240 
300 
360 
420 
480 
540 
600 
660 
*720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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GGCCCCCGCC CCOGGCTCCC CCGCAOOCAT GCATCCACTO OGGCCGGAGG AGGCCATGGA 2580 

GCCOGAGTAG CTGCCTGGAT CGCTOOACCT GGCATGOGOQ COGCGTCGCC TCTOGGGGGC 2640 

TGCTGCACOO COTTTCC^TA GCftCCATGTC CTAOGGAAAC CCAGCACGTG TGTAGAGCCT 2700 

0GAT08TCA7 CTCTCGTTAT iiCUAi ' l ' m Cinmimi ' TTAAAaOGOA CAAAAAAWIA 2760 

5 AAAAAAAG6A CTTCACTCCA TGAOGTGGAC 0QTG6C06CT G6CTGGCTGG ACAGG OGGGT 2B20 

GTGAGGAOTT GCAGACCCAA ACCCAOGTGC ATTTTGGGAA ATTOCTTTTT AAAAGATTTT 2880 

TATGCCAAAA ATCCTTCATT GTGATTTTCA GAACCACOTC AGATATACCA AGTGACTGTG 2940 

TQTGOGGTTT QACAACTGTO QAAAGGCGAG CAGAAAACTC OGGGGGTCTG AGGCCATGGA 3000 

GGTGGTTGCT GCATTTGA6A GGGAGTAGGG GGCTAGATGT GOCTGCTAOT GCAAAC06GA 3060 

10 AACCATG6CA CCTTCCAGAO COQTGGTCTC AAGGAOTCAG A6CAGGGCTQ GCCCTCAGTA 3120 

GCTGCAGGGA GCTTTGATGC AACTTATTTG TAAGAAGGAT TTTTAAATTT TTTATGGGTA 3180 

GAATTGTAGT CAGGAAAACA GAAAGGGCTT GAAATTTAAT AAGTGCTGCT GGAAGGGGAT 3240 

TTTCCAAGCC TQGAAGGOTA TTCAGCAGCT GTGGTGGGGA AACATTTCTC CTGAAAGACT 3300 

GAACGTGTTT CTTCATGACA GCTGCTCAAA GCAGGTTTCT QAOATAGCTO ACOSAGCTCT 3360 

15 GGTAAATCTC TTTQTCAAAT TACGAAAACT TCAGGQTGAA ATCCTATGCT TTCAT6TACA 3420 

TTACATGGCT TAAGATTAAA CAAAAACATT TTTCAAQTCT CTAACTAGAO TGAACTCTAO 3480 

AGCACAGTAQ TTCAQAAACT ATTTAQAQCT TCCAGGATAT ATTTCACAGC TTCAGGCATG 3S40 

TGATCAGTTA GAGCCGATGA AACCTATGCC C6CCTGTATA TATATTAGCA GCTTAGCTAQ 3600 

_ TTCATAACCT GTATATTCTA AAGACTGCTA AGGTTTTGTT TTCATTTTAA ATCCTAGCTG 3660 

20 ATTGTTGTGG TCAATGAAAT ACCCAGTTTC T6AAGGGCCA GGTGGGAAAT 6CTTTCACT0 3720 

GACCAACACA CAAATGATCA TCCTGAGGAT CTGAGCTTCC CTAGACTCCA CACAATAACC 3780 

TTGGGGCACC CTTTTAGAGA AGACTGTTGA AACCCACAGC ACTCGTTGGG GTAT6AGGAA 3840 

ACCAGGGCTT GGCACAGGAA GTTCCCCTTT GTAGCTAAAA GTCCAGAAAG AAAGGOTTCA 3900 

TCTTTTTGAC TTCCAACTQA TATTGGGAAG TTTGGTTGAO GTTCAAGTGT 6ACTCCTTCC 3960 

25 AGAGCCACAG GTAGGGGAGT GTGAAGTTGA GGGGGAGGAA AGCTGGAAGO ACTCTGCCTT 4020 

GGGAGATTCC CAGCTCTGCT TTCCAGOJCT TGGTGQAATC TGGGCTGGGG AAAGACGGCA 4080 

CCGGGAAACT CTGCTTCCCC ATTGTTTCCA TCTQATCAGC TGTGGTGTQA GGACTTCTCA 4140 

GACAAAGGCA AG6CCTG6TG CCCCTGCCCA GCCCATTCAT GQAGCCCT6G GCCTTCTT6G 4200 

CTTCCATAGA TCCTAA6CTC TTGACTGTAQ TTTAGCCAGA CTTGTTITGC TATCTTATAA 4260 

30 GCAGTTCAGA ATTAGGGAAT GCTGGTTTTG AAGAOCAAAG GACAGOTAGT CTAGAGAGGG 4320 

TCQTCTGGCC TOCTTGCTGG GTCTTTQTAA CCCAGCACTT CCTCTTGOCC TCCTGOCTTT 4380 

ATGTTTATGO GAGAGGACTC AATAGCTCCA CCCCTTCTGG CACCAQATGG OaCTTGGTTA 4440 

GTTTGCAATA AGCACCTTGC AGAGGTTAAA GCCAGOGGGT CCCTAGTCTT AGGCCCA6CC 4500 

TGCTTGTGTG GGCTCTCGCC TGGCCTGGTG GCTGGCCCAG GGGGCAGCAO TGCTTAOAGC 4560 

35 TTCTGCAGGG CTTCTCTTGT TTACACAQCT GCATCAQACA ATGCCATTTC TCCCCACCAC 4620 

GQAACCrrCC ATCTAAGATT TCTTCCAGGG AATGCCAGCA ATCAGGCAGC ACCCAGCTGT 4680 

GGGGGCAGTG GGGTGGGGGA GACCCACATT GATGACTTTT TTTTTTTCTT TTAATGAAGA 4740 

AACACCAAAG AAAGCTGTGG AAAGGACCTG CCCCACATX3A AAAGGATAAG CCAAGATGOC 4800 

TGTAAACACA GAGCATTTGA GCTGCCACTC TTGGAGCACA TTGATTTTTC AAAAGCCAGC 4860 

40 TCTGTCAQQA AAQGAGGTGC TGTTATGACA GCTCTTCCAO TGGGCAAAGA G6ACGCCCAT 4920 

AATTTCTTCC ATTGCTAGCT CATCTGTGGG ACCAATTTGG TGTAAGCAAC CTQTGGCCTQ 4980 

CACTTGTGQC CTCGAAGGAA GCACAAACCC TCCATCCACT TCCCATTTCC TCTGCCCTTT 5040 

TCCACCTCCC CCTTCCATCC CACCAGCTGC CAGTGGCTCC CAGAAAGCCT TATTGAGCCC 5100 

CTTGTTQACA CTTGGGGCTG CGQAGGCCTT TCCCTACTGQ TCTQGCCTTT CCTGAGAGGC 5160 

45 AGGTCTTCOG TCCTCAGAGC CTTTCTGQAA CAAGGAGAAT GCCTGTGCAG GTGGACACAC 5220 

AG6CCTGGCC TGTCGCTCTC ACTTGTCTTC CAGOGGGGAG CTTCAOGTTO CCQAQTGGAA 5280 

GAACCATGAC CTCCACTTGC TTCCAAGOTG CTAGQQAAGT TTCAGGGTAC GCTGGTTCCC 5340 

CTCTCCAGCT GGAGGCCGAG TTTCTGGGGA CTGCAGATTT TTCTACrCTG TGATCG ATTC 5400 

AATQCCOGAT GCTTCTGTTT CATTCCCGAC CCTTTCTACT ATGCATTTTC CTTTTATCAG 5460 

50 OTQTATAAAa TTAAATACTG TOTATTTATC ACTAAAAAOT ACATGAACTT AAGAGACAAC 5520 

TAAGOCTTTC CTGTTTTTCC ACAGGTOTTT AAGCTTCTCT OTACAGTrOA AATAAACAGA SSBO 

C3UXAAAATG QTAAAAAAAA AAAAAAAAAA A 5611 

Seq ID NOi 01 93 DKA Sequence 
55 Nucleic Acid Accession ftt MM_018646 
Coding eequencet 217.. 2394 



60 
65 
70 
75 
80 



GCTCTGCCAA 
TCCCAAGOAA 
GQGCCGGCCC 
CGGCCTCAGG 
AAAGGGCTAA 
TGGGCCCAGA 
CTCCTTCTAG 
GATTGCAAGG 
TATGACAACC 
CCCATGACAT 
AACATQAACC 
GGCACTGCCT 
TTTGCTGCCT 
ATCCGGGCCC 
AAAACCTTTG 
CTGCAGCCCC 
GTGGAGGGTA 
AOGTATGGAC 
GAGCAGTCCC 
GAGCAOAOGC 
TTCTGCATGC 
TACOGCC C CC 
CAGCAGAAGC 
GAGCTOOTGA 
AGAATOaGGO 



11 
I 

GTGTAACAAA 
CTCGTCAGGA 
TTGGGGGGGC 
CCCCCAAGGA 
TTCTCTGCCT 
OCOQAOATGA 
CTGCCAAAGA 
TGCACCAGAG 
TGGAGGCCGC 
CTGA6CTCTA 
TGGTGCOAGC 
TCCGCCGTAG 
GTGTGAACAO 
AGGACTCCCT 
CCTGCCAGAT 
TGGACCTCGT 
ACACTGT6AT 
CACTQACCTC 
T6CTGGAACT 
OGGTQAAGOA 
TGGGTGCCAT 
TCAAGCCCAG 
TACTTCAGGA 
CTGTCATTGG 
TCACTC6CTT 



21 
I 

CTCACAOCCC 
AGGCAGGAGA 
TGATGTGGCC 
GCOGGCCCTA 
ATGGAGCAAO 
GCAOAACCTG 
TAATQATGTC 
AGQAGCCATO 
CATGGTGCTG 
TOAGGOTCAO 
CCTGCTTGCC 
TCCCTGCAAC 
TQAG6AGATC 
GG6AAACACA 
GTACAAGCTG 
GCCCAATCAC 
GTTTCAGCAC 
GACTCTCTAT 
TATCATCAOC 
GCTGGTGAGC 
ATATCTGCTG 
QACCAATAAC 
A6CCTACATG 
GGCTATCATC 
CTTTGGACAG 



31 

! 

TCTCCAAACT 
CAGGAGAGGG 
CCAAGQCTGA 
CACCCCATGO 
TTCTGCAGAT 
CTGCAGCAGA 
CAGGCCCTGA 
GGGGAAACAG 
ATGGAGGCTG 
ACTGCACTGC 
CGCAGGGCCA 
CTCATCTACT 
QTGOGGCTGC 
GTGTTACACA 
TTGCTGTCCT 
CAGGGTCTCA 
CTQATGCAGA 
GACCTCACAG 
ACCAAGAAGC 
CTCAAGTGGA 
TACATCATCT 
COCAGQAGCC 
ACCCCTAAGG 
ATCCTQCTGG 
ACCATOCTTG 



41 

I 

GGCTGGGGCT 
GACCTCTACA 
GTCCOGTCAG 
GTTTGTCACT 
GGTTCCAGA6 
AGAGGATCTG 
ACAAQTTGCT 
CGCTACACAT 
CCCOGGAGCX 
ACATOGCTGT 
GTGTCTCTGC 
TTGGGGAGCA 
TCATTGAGCA 
TCCTCATCCr 
ACGACAGACA 
CCCCTTTCAA 
AGOQGAAGCA 
AGATCGACTC 
GGGAGGCTCG 
AG06GTA0GG 
GCTTCACCAT 
CCOGGGACAA 
AOGATATCCG 
TAGAGGTTCC 
GGGGGCCATT 



51 

I 

GCTGGGAGAC 
GGGAGAGGGT 
GGTCTGGCCT 
GCCCAAGGAG 
ACGGOAGTCC 
GGAGTCTCCT 
CAAGTATGAG 
AGCAGCCCTC 
GGTCTTTGAG 
TGTGAACCAG 
CAGAGCCAGA 
CCCTTTGTCC 
TGGAGCTGAC 
CCAGCCCAAC 
TGGGGACCAC 
GCTGGCTGGA 
CACCCAGTGG 
CTCAGGGGAT 
CCAOATCCTG 
0OQGGC6TAC 
GTGCTGCATC 
CACOCTCTTA 
GCTOGTOQGG 
AQACATCTTC 
CCATGTCCTC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1300 

1260 

1320 

13B0 

1440 

1500 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 



ATCATCACCT 
GGGGAGGTGG 
GC COgAG GAT 
QACCTGAltSC 
TATATCATCT 
GCCCTGTTCA 
GTGGACCTGC 
CTCATGCTCft 
OGGQA.TGAGC 
CCTOGCTGCC 
TGGTTCCTGC 
CAGGCCTTCC 
CTGGGCTGTC 
TCC06CAGCA 
QGOATAATCA 
GTTCTCACTT 
AACCAAACAC 
CGCCAAGQAA 
GGCAGAG6AA 
TOCTCATCAC 
TGCTTGGGCG 
ACCCAGOCAQ 
AAG66CTCT0 
TTTTCGGATC 



ATOCCTTCAT 
TACCCATCTC 
TCCAGATGCr 
GATTCTGCTO 
TCCABACAGA 
GCACXTTTOQA 
CCTTCATGTA 
AGCTCCTCAT 
TQTGGAGGGC 
TGTG6CCTC6 
GGGTGGAM^ 
ACACCCGGGG 
CCTTCAGCCC 
GTGOCAATTG 
ACAQGGOTCT 
OGCTTOCTGG 
CCAGAGGTCT 
TGTAOOTTGA 
GCCCAGCCCA 
CCTTCCXSACA 
CTCGGTCTCC 
GTGCA60GAA 
GGGGTGGGAA 
OCIGAAAAAA 



GU-lXjCit iC JTO 
CTTTGCACTC 
AGGCCCCTTC 
GCT6ATGGCT 
GGACCCCGAG 
GCTOTTCCTT 
CAGCATCACC 
TGCCATGATG 
CCAGATT6TG 
CTCGGGGATC 
CAGGCAAGAT 
CTCTQAGGAT 
CCACCTGTCC 
6GAAA06CTT 
6GAGGA0GGG 
AACTTGCTCT 
CATCTCCCAQ 
GAATCACTGC 
AGChCXIGGGC 
GOAGCACTGC 
ACCCCAGGGT 
GTGCAOAGCT 
GtGGGGCTAG 
AAAAAAAAAA 



ACCATGGTGA 
GTGCTGGGCT 
ACCATCATGA 
GTGGTCATCC 
6AGCTAG6CC 
ACCATCATCG 
TATGCTGOCT 
GGCGACACTC 
GCCACCACXXS 
TGOGGAGGOO 
CTCAAOCGGC 
TTGGACAAAO 
CTTCCTAT6C 
0G6CAAGGGA 
GASAGCTGGG 
CATTTTCCTQ 
GCCCCAGGGG 
TCCAGGCCTG 
TGGrCAOGGCB 
ATGTCA6AGC 
CATAA6TGGG 
TGTGGAAAGC 
GTCTTGCCAA 



T6CGCCTCAT 
GGTGCAACGT 
TTCAOAAGAT 
IGGGCTTTGC 
ACTTCTAGGA 
ATGGCCCAGC 
TTGCCATCAT 
ACTGG06AQT 
TQAT6CTGGA 
AOTATGGCCT 
AGOGGATCCA 
ACTCAGTGGA 
CCTCAGTGTC 
CCCTGAGGAG 
AATATCAGAT 
GGTGCATCAA 
AGAAAGAGGA 
CATTACTCCT 
TGAOGAACTC 
ACTTTAAAAA 
GAGAGA6CCC 
GTGTGAGTGA 
CTCCATCTTC 



CAGTGCCAGC 
CATGTACTTC 
GATTTTTGGC 
TTCAGCCTTC 
CTACCCCATG 
CAACTACAAC 
CX3CCACACTG 
GGCC CATCAG 
G06GAAGCTQ 
6GGAGACCGC 
AOGCTACXSCA 
AAAACTAGAG 
TOQAAGTACC 
AOAOCTOGGT 
CTQACTGCGT 
ACAAAACAAA 
GTAGCATGAA 
TCAGCTCTGO 
TCCTGT6G0C 
CAGGGCA6GC 
TTCCCAGGGC 
GGGAGACAGG 
AATAAAGTGG 



Seq ID KOt C194 DKA Sequence 

Nucleic Acid Accession ft: NM_021910.1 

Coding sequence t 2 60.. 601 



1 
I 

GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCOO 
CCCACTCCAC 
CCCGGGCCAG 

GCACAGCCTC 
CATOGTCATG 
TCCCCTGACC 
GTCOGGTCAC 
ATGAGGACAG 
TGCACAQAAA 
GGCAGGGCCT 
AATQATTOTG 
TTGGGGCACC 
CTGCTTGTTG 
OGCCCTCTGC 
AAGGAATGAA 
CATTCCAOOA 
ATAATAGTAA 
TTTCCAAATA 
AACATACAAT 
CCCAATACCT 
CTCAAAACAC 
ACAATAAACT 



11 

1 

ACTGCCAGCA 
CTTAGGGOGT 
QAACCTCTGC 
GGTGCAGCTO 
CGCTCTQACA 
CTGGACGCCA 
CAGOTTOQCG 
AGTGAGTGQA 
ACTCAGCTCT 
CATCCAGGGO 
ACCAGCTGAA 
CTTOAACTCC 
CATCTCAOCT 
CCTCTGCCCA 
ATGAGAAGGT 
GGAAAAGCCC 
CTCCTCAGCC 
AGAGAGCTCr 
AAACTGGGAC 
6CCCCT6TGA 
CCCAAATATC 
GACXaVCATAQ 
AATAAGGCAT 
CCAAAAATAT 
0GT6ACTTQT 



21 

1 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
CGGCTTATCT 
TGCAGAA66T 
ATGACCTAGA 
GGCTCATCTG 
GGAGCTOGGG 
CCCCAACAGG 
AGACTCCACC 
ATTGGGTGGA 
AGGATGGAAT 
CTCGCAAGAQ 
AGCAGCCTGG 
TGGCGTGCCC 
ACAGGCCTOT 
ATGGGACCrC 
AACCAGATOG 
ATAGGATC6T 
CCCTCTCACT 
CCTOCAAOCC 
TGTGA6AACT 
GCGAAATGTT 
CTCCTCCAAT 
CCCCTC 



31 

I 

CAGGCAAAAC 
GATAAAGGCA 
GAACCACACA 
CTCAOCCCAG 
GACCCTGGGC 
AGATAAAAAC 
CGCTGGGGTT 
GGAGCAGGGG 
TGCAAAATGC 
TCTCATCACC 
GQACCGTTCT 
TCTTCCTCCT 
GGTCTCTTTG 
AGACTTCCTA 
TGGAGGCTGA 
TCCCTTQTGG 
ATATGCAATT 
AACACTGQAA 
CCCGCTATGA 
TACCX:OQAaA 
CX3TTAAATAA 
TCCAACAAGC 
CTCATQAACT 
GTCCTGAGAC 



41 

1 

ACATCATCTA 
GGACTGOGCC 
GGOCOSftOTT 

cx3aqatgcx:a 

CTGCTTGTGT 
AOTCCTTTCr 
CTGTGG6GCA 
GQCCQOGGCr 
AAATGCAAGT 
CCCGGCTCAG 
CTQTCCCCAG 
CTGCTGGGAC 
TTCAATTTTT 
TGTGTGCATT 
CACAQAGGCT 
CTTGGGACAT 
TGGGATTTAC 
CATTCCAGTG 
TGGAAGTGTT 
CCTCACTTTA 
TTCCCTATGC 
CTCftAAOTCC 
ACCCCACAAC 
ATGAAOCCAA 



1560 
1620 
16B0 
1740 

leoo 

1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2918 



SI 

I 

AGGAAAAGAA 
T0GCA6CCGG 
TCACCC3OTC 
GCCTTCCTGT 
TCCTGGCAGG 
ACTATOACTQ 
TGGGCATCAT 
GGQGCTCCCC 
TT6GCCAGAA 
CCCAAAGCTG 
6TCCTGTCTC 
TCCTTTQCAT 
TTTAATCTAA 
GGGGTGGGGC 
OGCACTGAGC 
GGCACAGGCC 
TAGTAGCCAA 
GACOCTGGAC 
CAGACAGTTT 
TTACAAGATC 
TACCCTTAAT 
CTT6AQACTC 
A06CCTAAAA 
AAAOAQACCC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1466 



60 
65 
70 
75 
80 



Seq ID NOs C195 DNA Sequence 

Kucleic Acid Accession MM_005971.2 

Coding sequence} 176.. 43 9 



1 
I 

GTTCTCCACA 
OTGAGGTTTG 
ATTTCrCCOQ 
GAAGGTGACC 
CCTAGAAGAT 
CATCTGOGCT 
ATOCAAOTTT 
AGGCTCAGCC 
GTCCCCAGGT 
GCTGGGACTC 
CAATTTTTTT 
TGTGCATTGG 
CAQAGGCTGO 
TGOQACATQG 
GGATTTACTA 
TTCCAGTGQA 
GAAGTGTTCA 
TCACTTTATT 
CCCTATGCTA 
CAAAGTCCCT 
CCCACAACAC 



11 
I 

ACTGCCAGCA 
CTTA6GGGGT 
GAACCTCTGC 
CTGGGCCTGC 
AAAAACA6TC 
GGGGTTCTGT 
GGCCAGAAOT 
CAAAGCTGAT 
CCTGTCTCTG 
CTTTGCATGG 
TAATCTAAAA 
GGTGGGGCTT 
CACTGAGCCT 
CACAGGCCCG 
GTAGCCAAAA 
CCCTGGACCA 
GACAGTTTAT 
ACAAGATCTT 
CCCTTAATAA 
T6A0ACTCCC 
GCGTAAAACT 



21 

1 

ATCCTTCCAC 
GGCAOCTTOG 
TCAGCCTGGT 
TTGTGTT C CT 
CTTTCTACTA 
GOGCCATGGG 
COGGTCACCA 
GAGGACAGAC 
CACAGAAACT 
CAGGGCCTCA 
TGATTGTGCC 
GGGGCACCAT 
GCTTGTTGGG 
CCCTCTGCCT 
GGAATGAAAG 
TTCCAGQAAA 
AATAGTAAGC 
TCCAAATACC 
CATACAATGA 
CAATAGCTAA 
CAAAACACCC 



31 

I 

CAGGCAAAAC 
GATAAAOGCA 
GAACCACACA 
GGCAGQCTTT 
TGACTGQCAC 
CATCATCATC 
TCCAGGGGAG 
CAGCTGAAAT 
TQAACTCCAG 
TCTCACCTCT 
TCTGCCCAAG 
GAGAAGGTTG 
AAAAGCCCAC 
CCTCAGCCAT 
AGAGCTCTAA 
ACTGGGACAT 
CCCTGTGACC 
CAAATATCCC 
CCACATAOTG 
TAASGCATGC 
AAAAATATCT 



41 

I 

ACATCATCTA 
GGACTGOGCC 
GGCCAGOGCT 
CCTGTCCTGG 
AGCCTCCAGG 
GTCATGAOTO 
ACTCCACCTC 
TGGGT6GAGG 
QATGGAATTC 
G6CAAGAGGG 
CAGCCTGGAO 
GCGTGCCCTG 
AGGCCTQTTC 
GGQACCTCAT 
CCAGATOQAA 
AGGATOGrTCC 
CTCTCACTTA 
T6CAAGCO0G 
TGAQAACTTC 
GAAATGTTCT 
CCTCCAATGT 



51 
I 

AGGAAAAGMV 
TGGCAGCCOO 
CTGACATOCA 
ACGCCAATGA 
TTGGCGGGCT 
CAAAATGCAA 
TCATCACCCC 
ACOGTTCTCT 
TTCCTCCTCT 
TCTCTTTGTT 
ACTTCCTATO 
GAGGCTGACA 
CCTTGTGGCT 
ATGCAATTTQ 
CACTGGAACA 
G6CTATQATQ 
CCCCGAGACC 
TTAAATAATT 
CAACAAGCCT 
CATGAACTAC 
CCTGAGACAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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OAACCCAAAA AOAGACCCAC AATAAACTOG TCACTTOTCC CCTC 



1304 



5 

10 

15 

20 

25 
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65 

70 

75 

80 



Seq ID NO: CI 96 ONA Sequence 

Nucleic Acid Accession fit NM_004961.2 

Coding sequences 55.. 1575 ~ 

1 11 21 31 41 51 

1111 I I 

GCCAGAGCX3T GA6CCGGQAC CTOOG O GCAG GTGGTG6C3GC OGGTCTCOQC G GAAAT OTTQ «0 

TCCAAAOTTC TTCCAOTCCT CCTAOGCATC TTATTQATCC TOCAGT06AG GGTG6AG66A 120 

CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCX5TGATQ TTGTCTATGG CCCCCAGCCC 180 

CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAQACTGGG 240 

AGCAGAOTTG GC3VAACTGCC AGAAGCCTCT OGCATCCTGA ACACTATCCT GAGTAATTAT 300 

GACCACAAAC TGOGCCCTGG CATTGQAGfiG AAGGCCACTG TOGTCACTOT TGAOATOSOC 360 

GTCAACAGCC TT6GTCCTCT CTCTATCCTA 6ACATGGAAT ACACCATTGA CATCATCTTC 420 

TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACX3ACA CCTTTGAOTC TCTTOTTCTG 480 

AATGQCAATG TGOTQAQCCA GCTATGGATC CCGGACACCT TTTTTAGQAA TTCTAAGAGG 540 

ACCCACGAGC ATQAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGAT6GCAA0 600 

GTOTTQTACA CAATTAGGAT GACCATTGAT GCGGGATGCT CACTCCACAT 6CTCAGATTT 660 

CCAAT6GATT CTCACTCTTG CCCTCTATCT TTCTCTA6CT TTTCCTATCC TGAGAATGAG 720 

AT6ATCTACA AGTGQGAAAA TTTCAAGCTT 6AAATCAATG AGAAGAACTC CTGGAAGCTC 780 

TTCCAOTTTQ ATTTTACAOO AOTGAGCAAC AAAACTQAAA TAATCACAAC CCCAGTTGGT 640 

GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC 66TTTGGCTA TGTT6CCTTT 900 

CAAAACTATG TCCCTTCTTG OGTGAGCAGG ATOCTCTCCT GGGTTTCCTr TTGGATCAAG 960 

ACAGAGTCTG CTCCAGCCOG GACCTCTCTA GGGATCACCT CT6TTCTGAC CATQACCACG 1020 

TTGGGCACCT TTTCTCOTAA GAATTTCCXX3 OOTGTCTCCT ATATCACAGC CTTGGATTTC 1080 

TATATCGCCA TCTGCTTCGT CTTCTGCTTC TGOQCTCTQT TGGAGTTTGC TGTQCTCAAC 1140 

TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCITCrCCTA AACTC08CCA TCCTOSTATC 1200 

AATAGCCGTG CCCATGCCOO TACCOQTGCA GQTTCCX6AG CCTGT6CCC0 CCA ACATCAG 1260 

GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAG6GAAQTQ ATQGAGAGGA GCGCCOQTCT 1320 

TGCTCAGCCC AGCAGCCCCC TAQCCCAGGT AGCCCTGAGG GTCCCCGCAQ CCTCTGCTCC 1380 

AAGCTGGCCT GCTGTQAQTG GTGCAAGOGT TTTAAGAAQT ACTTCTGCAT GGTCCCOOAT 1440 

TGTGAGGGCA GTACCT6GCA OCAGGGCCGC CTCTGCATCC ATGTCTACCO CCTGGATAAC 1500 ' 

TACTCGAGAG i tc TTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 1560 

TGCCTTAACT TGTAGQTACC ACKZTGOTACC CTGTGGQOCA ACCTCTCCAG TTCCCCAGGA 1620 

GGTCCAAGCC CCTTGCCAAG GGAOTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCXSACTAG 1680 

AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTQCAGAa QGTTTGTCTT TQCTQCCCCT 1740 

CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCftGCAQAC OVTTTCAAAT TATTAATAAA 1800 

TGGGCCACCT CCCT Cr T C r r CAAGGAGGAT OGGTGATGCT CAGTGTTCAA AACCACAGCG 1660 

ACTTAGTQAT CAGCTCCCTA AAACCAT6CC TAAOTACAGG 06GATTAGCT ATCTTCCAAC 1920 

AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 1980 

CrrrOGQCCC AQTTCTGQCC TCAGCCTCAA AGTOCACCaA CTAGTTGCTT GCCTATACCT 2040 

GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGQAGG AAGAGATCCC TCTCCTTTGG 2100 

TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAOATAQA 2160 

CACTGGCATT ATCCCTTTAG GAAQAGGOGG GGGCAGCAAG AGA6CCTATT TGGGACAGCA 2220 

TTCCTCTCTC TCTGCTGCTG TQACATCTCC CTCTCCTTGC TOGCTOOVTC TTTOQTCTOC 2280 

ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 2340 

TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCrTTCTGTA 2400 

ACTTTCCCAG TQACTTCCCC TAGCCCT6AC CCAGGCACIA GGCCTTGOTG ACTTCCTGOG 2460 

6CCAA6AAAC TAAGGAAACT OGGCTTTGCA ACAGGCATTA CTOGCCATTG ATTGGTGCCC 2520 

ACCCAGGGCA CACTGTOGGA GTTCTATCAC TTOCTTGACC CCTGOACCCA TAAACCA6TC 2580 

CACTGTTATA CCOGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 

GTATGGCACr GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GQAGCTTCAT 2700 

GATAGCCTTQ TGACATCTTT AGGGCAOGAT TCTTATCOCC ATTTTGCAOA T6AAAA0CCT 2760 

GAGTCACA6A TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCOCACTGTC 2820 

ACCTTCTAQA CCACATQATA GGGCTAGACA GCTCAfiTTCA CCATGATTCT CTTCTGTCAC 2880 

CTCTGCTGGC ACACCAGTGQ CAAGGCCCAG AATGQOQACC TCTCTTTAGC TCAATTTCTQ 2940 

GGCXnXSAGGT GCTCAGACTG CGCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCX»GTG 3000 

GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAA6 TGAAATCTGT GTCTGTAATT 3060 

TGTTGGGGGG TGQATAGGQT GGGGTCTCCA TCTACTTTTT GTCAC9CATGA TCTGAAAT6G 3120 

GGAAATATOT AAATAAATAT ATCAGCAAAO CAAAAAGAAA AAAAAAAA 3168 

Seq ID NOt C197 DNA Sequence 

Nucleic Acid Accession fix NM_0219a4.1 

Coding sequence: 572.. 1753 " 



1 
I 

GCCAGAGCGT 
TCCAAAOTTC 
CAGAGAAGTG 
GTGTAAAGAA 
CACTGCCTCC 
TCAGACTGAA 
GCCTCTGGAA 
CAGAGTTGGC 
CCACAAACTG 
CAACAGCCTT 
OCAOACCTGG 
TGGGAATarO 
GCAGQAGCAT 
GTTGTACACA 
AATGGATTCT 
GATCTACAAG 



11 

I 

GAQCOGCGAC 
TTCCAOTCCT 
CTCAAATCAT 
A6CCAAATCA 
CAGCAAAGGC 
TCAAAGAATO 
AATCAGCTCC 
AAACTGCCAO 
CGCCCTGGCA 
GGTCCTCTCT 
TACGAOQAAC 
GTGAGCCAGC 
GAGATCACCA 
ATTAGGATGA 
CACTCTTGCC 
TGG6AAAATT 



21 
1 

CTCOQCGCAG 
CCTAGGCATC 
AAGTGTACAG 
AGGACCOGAA 
AGCACTATCC 
AAGCCTCTTC 
TCrCTGAGGA 
AAGCCTCTCO 
TTGGAGAGAA 
CTATCCTAGA 
GCCTCTGTTA 
TATG6ATCCC 
T6CCCAACCA 
CCATTGATGC 
CTCTATCTTT 
TCAAGCTTQA 



31 

I 

GTGGTCQCQC 
TTATT6ATCC 
CTGATGAOTT 
TGTGAGCAGG 
GGACTTCTAA 
COOTQATGTT 
AACAAAGTCA 
CATCCTQAAC 
GCCCACTGTG 
CATGGAATAC 
CAAOQACACC 
GGACACCTTT 
6ATGGTCCGC 
OGGATGCrOl 
CTCTAQCTTT 
AATCAATGAQ 



41 

I 

CGGTCTCCGC 
TCCAGTOGAG 
GTCAAAAAAT 
ACCTCAGAAG 
CACCATOGGG 
GTCTATGOCC 
ACTBA6ACTG 
ACTATOCTGA 
GTCACTGTTG 
ACCATTQACA 
TTTGAGTCTC 
TTTAGGAATT 
ATCTACAAGG 
CTCCACATGC 
TCCTATCCTO 
AAGAACTCCT 



51 
1 

GGAAATGTTG 
AACATGTATA 
GACCACAGCQ 
CCOOCTTTGT 
TOGAGGGACC 
CCCAGCCCCA 
AGACTGGGAG 
GTAATTATGA 
AGATCTCCGT 
TCATCTTCTC 
TTGTTCT6AA 
CTAAGAGGAC 
ATGGCAAGGT 
TCAGATTTCC 
AGAATGAGAT 
GGAAGCTCTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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CCAOTTGGAT TTTACAGOAO TGAOCAACAA AACTGAAATA ATCACAACCC CA6TTGGTGA 1020 

CTTCATGGTC ATGAOGATTT TCTTCAATGT OAGCAGGCGQ TTTQQCTATQ TTGOCTTTCA 1080 

AAACTATGTC CCTTCTTCCQ TGACCACOAT GCTCTCCTGG Cr r n ' CCm ' T GGATCAAOAC 1140 

AOAOTCTGCr CCAGCXX^GGA CCTCTCTAGO GATCACCTCT GTTCTGAOCA TQAOCAOGrTT 1200 

5 GGGCACCTTT TCT06TAAGA ATTTCOCOOO T G TCTCCTAT ATCACAGCCT TGQATTTCTA 1260 

TATCGCCATC TGCTTOGTCT TCTGCTTCTG OGCTCTGTTQ GAOTTTGCTG TGCTCAACTT 1320 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTC03CCATC CTCGTATCAA 1380 

TAGCCQTQCC CATOCCOSTA CC06T6CAGQ TTCCOGAGCC TGT6CCCGCC AACATCAGGA 1440 

, ^ AGCrmGTO TGCCAOATTO TCACCACTGA GGGAA0T8AT G6AGAG6A0C GCCOOTCTTQ 1500 

10 CTCAGCCCAG CAGCOCXXITA GC0CAQ6TA0 CCCT6AGQGT CCCOOCAGCC TCTGCTCCAA 1560 

GCTGGCCTGC TGTCAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TGAGGGCAGT ACCTGGCAGC AGGCCOGCCT CTGCATCCAT QTCTACOCSCC TOQATAACTA 1680 

CTOOAGAOTT GTTTTCCCAO TOACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 1740 

OCTTAACTTG TAGGTACCAO CTGGTAOCCT GTQOQGCAAC CTCTCCAOTT CXXXaOGAGO 1800 

IS TOCAAOCCCC TTGCCAAGGG AGTTGOGGGA AAGCAGCAGC A6CAGCAGGA GG6ACTAGA6 1860 

TTTTTCCTGC CCCATTCCCC AAACAGAAOC TTGCAGAGGG TTTGTCnTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TOAGTTTTCT CAQCAGACCA TTTCAAATTA TTAATAAATQ 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCAT6CCTA AGTACAGGOS OATTAGCTAT CTTOCAACAA 2100 

20 TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAOTOCT 2160 

TTCGGCCCAQ TTCTGGCCTC AGCCTCAAAG TGCACCQACT AGTTGCTTGC CTATACXTOG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA AC3W3GAGGAA GAGATCCCTC TCCTTTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT OXTTTAGGA AGAOQGGGGG GCAGCAAOAO AOGCrATTTG GGACAGCATT 2400 

25 CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TOGTCTGCS^C 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTOTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CXrrGTQACTC TTTCrGTAAC 2580 

TTTCCCAQTG ACTTCCCCTA GCCCTGACCX: AGOCACTAGQ CCTTGOTGAC TTCCTGGGOC 2640 

CAAGAAACTA AGGAAACTOQ GCTTTGCAAC AOGCATTACT OQCCATTQAT TGGTGCCCAC 2700 

30 CCAGGGCACA CTGTCGGAGT TCTATOVCTT GCTT6A0CCC TGGACCCATA AAOCAOTCCA 2760 

CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC TTAAATTTGT 2820 

ATGGCACTGG AACTTTGGCA AAGCACTTTT GACAAGTTGT GTCTGATTGQ AGCTTCATGA 2880 

TAGCCITGTG ACATCTTTAO GGCAGGATTC TTATCCCCAT TTTGCAGA70 AAAACCCT6A 2940 

GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAA6CT ATCCAAGAGC OCACTGTCAC 3000 

35 CTTCTAGACC ACATGATAGG GCTAGACAOC TCAGTTCACC ATGATTCTCT TCTOTCACCT 3060 

CTOCTGGCAC ACCAGTGGCA AGGCCCAGAA TGGCGACCTC TCTTTAGCTC AATTTCTGGG 3120 

CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTOTAQTA ACCCAOTOGA 3180 

ATQAATTTGO ACATGCCCCA ATGCTTCTAT AT6CTAAGTG AAATCTGTGT CTGTAATTTG 3240 

TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTOT CACCATCATC TQAAATGGGO 3300 

40 AAATATGTAA ATAAATATAT CAQCAAAQC 3329 



8eq ID NOt CI 9 8 DNA Sequence 
Nucleic Acid Accession «: NM_021987.1 
Coding sequence i 572.. 1657 

45 

1 11 21 31 41 51 

I I I I I I 

GCCAOAGOQT QAQCOGOQAC CTCCGOGCAG GTGGTCX50GC CGGTCTCOGC GGAAATGTTG 60 

TCCAAAGTTC TTCXaVGTCCT CCTAGGCATC TTATTGATCC TCCAOTCGAO AACATGTATA 120 

50 CAQAGAAOTO CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

6TGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGQ ACCTCAGAAG CCCCCTTTQT 240 

CACTGCCTOC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TOGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC COGTQATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGQA AACAAAOTCA ACTQAQACTO AGACTGGGAG 420 

55 GAGAGTTGGC AAACTGCCAG AAGCCTCTOQ CATCCTGAAC ACTATCCTQA GTAATTATGA 480 

CCACAAACTG OQOOCTGGCA TTGGAGAGAA GCCJCACTGTG GTCACTGTTG AGATCTCOGT 540 

CAACAOCCTT QOTOCT C T C T CTATOCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCA6ACCTGG AATTCTAAGA GGACCXSlCm GCAT6AGATC ACCATGOCCA ACCAGATGGT 660 

CCQCATCTAC AAOGATGGCA AGOTOTTOTA CACAATTAOG ATGACCATTG ATG0CX3GAT0 720 

60 CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTOCTAT CXrTGAGAATQ AOATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAA6C TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG 6TGACTTCAT GGTCATGAOO ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT TCXX3TGACCA CGATGCTCXC 1020 

65 CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC OGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTO ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC OGOGTGTCTC 1140 

CTATATCACA GCCTTGGATr TCTATATGQC CATCTGCTTC GTCTTCTOCT TCTGOGCTCT 1200 

OTTGGAaTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTCGTA TCAATAGCOO TGCCCATGCC OGTACCOGTG CAOGTTCXX33 1320 

70 AGCCTGTGGC GGOCAACATC AGGAAGCITT T6TG1GCCAG ATTOTCACCA CTGAGGGAAG 1380 

TQATGGAOAQ GAGOGOCOGT CTT6CTCAGC GCAGCAGCCX: CCTAGOOCAQ GTAGCGCTGA 1440 

GGGTCOCOGC AfiOCICTGCT CCAAGCTG6C CT6CTGTGA6 TGCTTGCAAOC OTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCOG ATTGTGAOGO CAGTAOCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CCATGTCTAC CGCCTGGATA ACTACTOGAQ AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 

75 CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTOTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 

CAACCTCTCC AGTTCCGCAO GAGGTCCAAG CCCCTT60CA AGGGAGTTGO QGQAAAGCAG 1740 

CAGCAGCAGC AG6AG0GACT AOAGTTTITC CTOCCCCATT CCCCAAACAG AAOCTTGCAG 1800 

AGGGTTTQTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGQAGC ATCCX3T6ATQ 1920 

OO CTCAGTGTTC AAAACCACAG CCACTTAGTQ ATCAGCTCCC TAAAACCATG CXrTAAGTACA 1980 

GGOGGATTAG CTATCTTCCA ACAATGCTGA OCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

OCCCACTATT GOCTTTOCAG TQCTTTO GG C OC A tfri' C A' U U CCTCAGCCTC AAAGTGCACC 2100 

GACTAOTTGC TT Q CCT A TAC CTG6CACCTC ATTAA6ATGC TGGGCA6CAO TATAACAGGA 2160 

GGAAGAOATC OaVit XT X T GGTCAOATTA TTAIOTTCTC AOTTCTCTCT CCCT6CTACC 2220 
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CCTTTCTCTG 
AGAGAGCCTA 
GCTGGCTCCA 
TTTTQT G TGT 
6ACCCCT0TG 
AGGCCTTGGT 
ACTCGCCATT 
CCCTG6ACCC 
CAATCAAATT 
TGTGTCTGAT 
CATTTTGCAG 
GCTATCCAAG 
ACCATQATTC 
CTCTCTTTA6 
CCTG6CTGTA 
GTGAAATCTG 
TGTCACCATC 



CAGATAGATA 
TTTGGGACAG 
TCTTTCOTCT 
GATTATAGTA 
ACTCTTTCra 
GACTTCCTGQ 
GATTGGTGCC 
ATAAACCAGT 
CCCTTAAATT 
TGQAC3CTTCA 
ATGAAAACCC 
AGCCCACTGT 
TCTTCTGTCA 
CTCAATTTCT 
GT AACCC AOT 
TGTCTGTAAT 
ATCTOAAATO 



GACACTGGCA 
CATTCCTCTC 
GCACTACCAA 
ACT ACTC CCT 
TAACTTTCCC 
GGCCAA6AAA 
CACCCAGGGC 
CCACTGTTAT 
TGTATG6CAC 
TGATAQCCTT 
TGAGTCACAG 
CACCTTCTAG 
CCTCTGCTGG 
GGGOCIGAGO 
OQAATOAATT 
TT G TTG G GGG 
GQOAAATATO 



TTATCCCTTT 
TCTCTGCTGC 
TTCAATGCCC 
GCTTTATATO 
AGTGACTTCC 
CTAAGGAAAC 
ACACT6TCG0 
ACCOGGGGCA 
TGGAACTTTO 
QTGACATCTT 
ATTTCTGTGG 
ACCACAT6AT 
CACACCAGTO 
TQCTCAGACT 
TG6ACATGCC 
STQQATAGGG 
TAAATAAATA 



AGGAAGAGGG 
TGTGACATCT 
TTCATCCAAT 
OCAOCCTCTT 
CCTAGCCCPG 
TOGGCTTTGC 
AGTTCTATCA 
CTCTAAOCAT 
GCAAAGCACT 
TAGGGCAGGA 
GACTGTGQAT 
AGGGCTAGAC 
GCAAGGCCCA 
QCCCCCAAGA 
OCAATGCTTC 
TOQOQTCT C C 
TATCAGCAAA 



GGGGOCAGCA 
CCCTCTCCTT 
GGGTATCTAT 
CCTTCTCTTT 
ACCAGGCACT 
AACAGGCATT 
CTTGCTTGAC 
CACAATCAAT 
TTTGACAAGT 
TTCTTATCCC 
CTCACTGGAA 
AGCTCAGTTC 
GAATGGOGAC 
TGAAATCTCT 
TATATGCTAA 
ATCTACTTTT 
GC 



2280 
2340 
2400 

2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31B0 
3232 



Seq ID NO I C199 DNA Sequence 

Nucleic Acid Accession fft 2IM_021990.1 

Coding sequence t 1309.. 2490 

1 11 21 31 41 51 

I I I I I I 

GCCAOAGGGT GAGCGGOGAC CTC0606CAG GTGGTGGOGC OGGTCTOOOC GGAAATCITTO 60 

TCCAAAQTTC TTCCAGTCCT CXTTAGGC^TC TTATTOATCC TCCAGTOQAO AACATGTATA 120 

CAGAGAAGTQ CTCAAATCAT AAGTGTACAG CTGATOAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCOGAA TOTOAOCAGG ACCTCAGAAG CCOCCTTTGT 240 

CACTOCCTCC CAGCAAAGGC AGCACTATOC GGACTTCTAA CACCATOGGT GAC7TTTCATA 300 

CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTG6C TGTQQTGCAT GGCTGTGGAO CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

GCTGCTCTTT AGCXTTCCTTC CX:TTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAAOOGCAA AATATTCGCA CAATrTTCTO OTCCTCTCTG GQAOAGTOQQ 600 

CTCTG6CTTT TCCTCTCAOC CCTGOCCCTC TOCCTGCTCC ICACTCCTGO TTGGT6CTG0 660 

TCAGGCTGAC TAGAGGCCAA GGCX3ACCAAC ACTAGGCAAA OGGGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTQ GTTTTGTTGA 780 

TTTTTTCTTA AATAAAAGAQ TOATCATAAA AGAGGGACAG CATAGAAAGT CXX:CAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTSraC TGTCTTATAA TTTGCTATTA 900 

CCAGTCACAA TTTAACTAGG TTTTGrTGTTG AAAACTTOTT Tl Wrn^SC r TCTGT CCCAA 960 

GAGGCACTAO CTGGGGCCCC TACAGAGTGC AGGGCAQAGC TTCATTTTTC GTTTGAATGT 1020 

TCTAGGGTCG AGGGACCTCA GACTGAATCA AAQAATGAAG CCTCTTCCOG TGATGTTGTC 1080 

TATGGCCCCC A6CXXX»GCC TCTQOAAAAT CAGCTCCTCT CTQAGQAAAC AAAGTCAACT 1140 

GAOACTGAGA CTGG6AGCAQ AQTTGGC3UA CTGCCAGAAO CCTCTOOCAT CCTGAACACT 1200 

ATCCTGAOTA ATTATOACCA CaAACTOOQC CCTGGCATTO GAGAGAAQCC CACTGTGeTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA QACCTGGTAC GAOGAACGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTQAATGO CAATGTGGTG AGCX3U3CTAT GGATCCOQQA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACGCA GOAOCATGAG ATCAOCATGC 0CAACCM3AT GGTC06CATC 1500 

TACAAGGATO GCAAGQTOTT OTACACAATT AOGATGACCA TTGATGCCXSG ATG CTCAC TC 1560 

CAC3VTGCTCA GATTTCCAAT GGATTCTCAC TCTTOOCCTC TATCTTTCTC TAGCTTTTCX: 1620 

TATCCTGAQA ATGAGATGAT CTACAAGTGQ GAAAATTTCA AGCTTGAAAT CAATGAGAAQ 1680 

AACroCTGGA AGCTCTTCCA GTTTGATTTT ACAG6AGTGA GCAAC3UUAC TGAAAT AATC 1740 

ACAACOCCAO TTGGTGACTT CATGGTGATO AGQATTTTCT TCAATGXGAG CAG6C6GTTT 1800 

6GCTATGTT6 CCTTTCAAAA CTATGTCCCT TCTTCOGTGA CCACGATGCT CTCXriGGCTT 1860 

TCCTTTTGQA TCAAQACAGA QTCTGCTCCA GCOOGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTQACCATGA CCACXTrrGGG CACCTTTTCT OGTAAGAATT TCCCGOQTOT CTCCTATATC 1980 

ACAGCCTTQG ATTTCTATAT OGCCATCTGC TTCGTCTTCT GCTTCTGOGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTOCT OKTCTACAAC CAGACAAAAG CCXATGCTTC TCCTA AACTC 2100 

CGCCATCCTC GTATCAATA6 COGTGCCXaT GCCOGTACCC GTGCAOGTTC C06AGCCTGT 2160 

GCCOGCCAAC ATCAOGAAOC TTTTGTOTGC CAOATTOTCA CCACTGAGGG AAGTOATGGA 2220 

GA6GAQCGCC CQTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TQAGGGTCCC 2280 

CGCAGCCTCT OCTOCAAGCT GGCCT6CTGT GAOTGGTGCA AGCX3TTTTAA GAAGTACTTC 2340 

TGCATGGTCC CGGATTGTGA GGGCAGTACC TGGCAGCAGG GOOG OCTCTG CATCGATGIC 2400 

TACaSCCTGG ATAACTACTC GAQAOTTOTT TTOOCStfJIGA CTTTCTTCTT CTTCAATGTG 2460 

CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTQ CCAAOGGAQT TGGGGOAAAO CAGCAG CAGC 2580 

AQCAGGAOOO ACTAGAOTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

GTCTTTGCTG CCCCTCTCCC CTACCTGQCC CATTCACTGA GTmCTCAQ GA8ACCATTT 2700 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCITCAAGO AOCATCOGTG ATGCTCAGTQ 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGC3GGAT 2820 

TAGCTATCTT CCAACAATQC TGACCACCAG ACAATTACTG CATTTTTCXai GAAOCCCACT 2880 

ATTGCCTTTG CAGTQCTTTC GGCCCAGTTC TG6CCTCAGC CTCAAAGTGC ACOQACTAGT 2940 

TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAOTATAACA GGAO QAAQ AO 3000 

ATCCCTCTCC TTTGOTCAQA TTATTATGTT CTCAOTTCTC TCTCCCTGCT ACCCCTTTCT 3060 

CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 

CTATTTGGGA CAGCATTCCT CTCTCTCTGC TOCTOTGACA TCTCCCTCTC CTTGCTGGCT 3180 

CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TAlT m'G'rG 3240 

TGTGATTATA GTAACTACTC CCT6CTTTAT AT6CCA0CCT CTTCCTTCTC TTTOACCCCT 3300 

GTQACrCTTT CTOTAACTTT 0CCA6TGACT TCCCCTAGCC CTXSAOCAGGC ACTAGGCCTT 3360 

GGTGACTTCC TGGGGCCAAO AAACTAAGGA AACTCGQCTT TGCAACAGGC ATTACTOOCC 3420 

ATTGATTGGT QCCCACCCAG GGCACACTGT CXK3AGTTCTA TCACTTGCTT GACCCCTGQA 3480 

CCCATAAACC AQTCCACTGT TATACCOGGG GCACTCTAAC C ATCACA ATC AATCAATCAA 3540 

ATrCOCTTAA ATTTOTATQG CACTGGAACT TTGGCAAAGC ACTTTTOACA AGTTOTGTCT 3600 
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75 



GATTGGAGCT TCATGATAOC CTTGTGACRT CTTTAGGGCA GGATTCTTAT CCOCATTTTO 3660 

CAGATGAAAA CCCTGAQTCA CAGATTTCTG TGGGACTCTO GATCTCACTG GAAGCTATCC 3720 

AAOAGCCCAC TGTCACCTTC TA6ACCACAT C5ATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TTCTCTTCTQ TC A CCT C T G C TGGCACACCA GTGGCAAGOC CCAOAATGGC QACCTCTCTT 3840 

TAQCTCAATT TCTGG6CCT6 AOOTQCTCAO ACTCCCCCCA AGATCAAATC TCTCCTGGCT 3900 

GTAGTAACCC AGTGGAATGA ATTTGGAC3VT GCCCCAATGC TTCTATATGC TAAC5TGAAAT 3960 

CTQTOTCTGT AATTTGTTGG GGGGTGGATA GQ0T6GG6TC TCCATCTACT TTTTOTCACC 4030 

ATCATCTGAA ATGGOQAAAT ATCTTAAATAA ATATATCAGC AAAGC 4065 

Seq ID NOt C200 DMA Sequence 
Nucleic Acid Accession «: NM_021B19.1 
Coding sequence! 39.. 1619 



1 11 21 31 41 51 

IS 1 I I i I I 

TAGGGCXX3TC GCQTAATTOG GCACGAGGGC 6CTCACGTAT GCCGGCGGTC AGTGGTCXaUS 60 

GTCCCTTATT CTGCCTTCTC CTCCTGCTCC TGGACCCCCA CAGCCCTGAO AOGGGGTGTC 120 

CTCCTCTAOG CAGGTTTGAO TACAAGCTCA GCTTCAAAGG CCXIAAGGCTG GCATTGCCTO 180 

^ GG6CTGGAAT ACCCTTCTG6 AGCCATGATG GAGAOQCCAT CCTGQOOCTG GAGGAAGTGC 240 

20 GGCTQAOSCC ATCCATOAGG AACGOOAGTG GCX3CO0TOT G GAGCAG66CC TCTGTCCX3CT 300 

t TCTCT G CCT G GQAAGTAGAO GT6CAAATGA GGGT6A0GGG ACTGGGGOGC OGGGGAGCCC 360 

ACGGCATOaC CQTGTGOTAC ACCXX3GGQCA GQOGCCATQT AGGCTCTGTC CTTGGGGGGC 420 

TQGCTTCGTG GGACX3GCATC GGGATCTTCT TTOACTCTCC GGCAGAGGAT ACTCAGGACA 480 

^ GTCCTGCCAT CGGTGTGCIG OCCA60GAGO GQCACATOCC CTCTQAOCAG CCT6GGGAT6 540 

25 GAGCTAGCCA AGG G CTGGGC TCCTGTCATT 06GACTTCCG GAAC0G6CCA CACCCCTTCA 600 

GAGCAGGGAT CACCTACTGG GGGGAGAGGC T6CX3CATQTC CTTGAACAGT GGCCTCACTC 660 

CCAGTGATCC AGGTGAGTTC TGTOTGGATG TGGGGCCCCT GCTTTrGGTC CCTGGAGGTT 720 

TCTTT6GGGT CTCAGCAGCC ACCX3GCACCC TGGCAGATGA TCATGATGTC CTOTCCTTCC 780 

TGACCTTCAG CCTGAOTOAG CCCAGOCCAO AG OTTCQOCC TCAGCCCTTC CTGQAGATGC 840 

30 AGCAGCTCOB CCTGGOGA66 GAGCTGGAAO GGCTGTGG6C AAGGCTGGGC TTGG6CACCA 900 

GGGAGQATGT AACTCCAAAA TCAGACTCTG AAGCTCAAGG AGAAGGGGAA AGGCTCTTTG 960 

ACCTGGAGGA GACGCTGGGC AGACACOGCC GGATCCTGCA GGCTCTQCGG GGTCTCTCCA 1020 

A6CAGCTGGC CCA6GCTGA0 AGACAAT6GA AOAAOCAGCT GGGGCCCCCA OGGCAAiQCCA 1080 

- . GGCCTGACX30 AOGCTGGGCC CTOGATQCTT CCT6CCAGAT TCCATCCACC Ca\66GAC3GG 1140 

35 GTGGCCACCT CrCCATGTCA CTCAATAAOG ACTCTGCCAA GOT O G O T Q CC CTGCTCCATG 1200 

GACAQTGQAC TCTQCTCCAO GCCCTGCAAG AQATQAQaOA TGCAGCTGTC CGCATGGCTG 1260 

CAQAAGCCXaV GGTCTCCTAC CTGCCTGTGG GCATTGAGCA TCATTTCTTA GAGCTGGACC 1320 

ACATGCTGGO CCrCCTQCAG GAGGAGCTTC GGGGCCCGGC GAAGGCAGCA GCCAAGGCCX: 1380 

CGOGCOCACC TGGCCAGCCC CCAAGGGOCT OCTGQTGCCT GCAGOCTGGC ATCTTCCTGT 1440 

40 TCTACCTCCT CATTCAGAGT GTAG6CTTCT TOGQGTACXvT GCACTTGAlQO CAGQAOCTGA 1500 

ACAAGAOCCT TCAGGAGTQT CTGTCCACAG GCAGCCTTCC TCTGGGTCCT GCACCACACA 1560 

CCCCCAGGGC CCTGGGGATT CTGAGGAGGC AGCCTCTCCC TGCCAGCATG CCTQCCTGAC 1620 

CCACCTCAGA GCCTGCTTTG CATCACTGGG AAOCAGGCAO TGTCTTGGGT GGGGGCTTGO 1680 

TCAOTATCCT CTOOQTCTGG GTGGOCAGCT CCCAGGGACA CCTGA6CTTT GBGCATGCIC 1740 

45 CCACCTOGTT AAAGGT6ATT TCCCTCTOUl AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1800 

A 1801 

Seq ZD NOi C201 DNA Sequence 
Nucleic Acid AccesBion #t XM_117036.1 
50 Coding sequence i 25.. 495 *" 

1' 11 21 31 41 51 

1 I I I I I 

AOCCAAGAGA Q06GGGACAG ACAGATGGAA A OAOSQ ACAC qGGQ AQCTC T aaaOAOCAOQ 60 
55 OOQCCCCCAC CCCCACTTCC AGCTCT6A0A CATTTGTGCA CAGGOCTGCA GG0G6CTGGG 120 
ATGGCCTG6C CGGGCACGCT CTGGA6ACAC ACATGCCAAG GCAGGGCCKG GGCAGCOGAG 180 
GGCCOSTGGG GGCTGTTCCG ACCACACAGG TGCCCGAGGG AGGCAGGGCA GGCCCCGGTG 240 
GGACCCAGGC CCGAAAGGCA GGGTGTC6CC CACGTTTGCA GGOGGGCTOS GGTGT06GTG 300 
GATGA60GG6 AACCGGGGGG CQQOQ O CT A C GCAATCCAOQ TTA0GCGQC9 OTGGAAGGGG 360 
50 TGTCACOGOC ACAGCGGCOQ CACOOTGO G T GGCTC l tSTTT CCTGGAAAOG TCCAGAACAO 420 
GCAGCCXXXS AGACA6GQA0 GGGCCCAGGG GTOGCCAGGQ GAAGTGGGGA TGGGAATQAG 480 
TGCGGATGGG GCTGAGGTTC TTCCTGAAGG ATGAATGCTC TGAAACTGTG AATAGACACA 540 
GGGAACTGT6 GA06TTCAC39 GGTGAACGST GTGGGGAGTG AATCCATCTC AACAGAGCTG 600 
TTACACAGIO AACAA6AGCA CGANGOGTOG A6GCCNCTT0 TCTGCTCCTO CAGATCGAfiC 660 
G6QCT6CTCA 6A6CCGGGAG ATGGCCACTC AGCTCCITTA ACTG 704 



Seq ZD NOt C202 DNA Sequence 
Nucleic Acid Accession #i XM_167803.2 
Coding sequences 1162.. 1488 



1 11 21 31 41 51 

i 1 I I I I 

AACATCATAC ATAGTAGGTG AATC6TTTTG TAGAQTQAAG AAT6CTAAT6 TAAAGCAAAT 60 

AGTCACOCAC GTTCCTTGTA AATCCAAAT6 TTTCTATATT GTAGCTTTGC TTAAAAT6GG 120 

QT06GCCCCA ACTGCATCCT CCTCnTGGC 66GCTG0Q0A GOGGCGCCCA GCOGGGACGG 160 

GAGGQCAGOQ ACCCCGA6GC CTCGTGGACG TGGGAGAGAG TGTGGTGG6A AGTCTTGAGC 240 

GGAGGAGGGG ATCTGCCCTT CTCCACTCCT CTCTTGGATC OGCCTCGGTT TCCTGTCCCC 300 

CCACCACCCG CCTGCCCCGC GQAAGACOGC CCAGTQA8CC AGCCCCCACC TTCCAGGCGC 360 

fyf. CTTGGCCCTG GGQATCCAAC GAACTTGTAT C6AGTGGQGG GGGCAACOGC TCOCCATTTT 420 

OO TCCCGAGCCC CQCCCACAOA O CTCTTAGCC AATOCTATGC AGAGAGCATC TCCT GGC AGG 480 

GGTCTCTTCC CAACCAGACC CCACCCAGGC ACATTAGCGA CCAGGCTTGO GCTTCCCCAG 540 

OGCCCCACCA CCACCACGTO CAGGTGGAGC TCTGGGATGC TATGTTGG60 CGGCAAGOGG 600 

TGGOCCGAGO GCCOGGTAGG CTAGCAC6G0 AGGTAAGGGT GGTATGGGAT GGGG06GGGG 660 

OGGTCTAGGG CAATAGGAGA GCAGAGAATQ GGGQAACITG AQQGTOGGGG GAGGGCACGG 720 
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GAGCCTTGCC 
CCOCATCTGT 
CACCATCACC 
AOTOTAATTA 
AGCTAGCTGA 
GGCTTCGTCT 
TTTCTTGTGA 
6CACCT6C0C 

ccAGGaoaac 

GAGTGCAAGT 
GCAGCGAAGA 
ACAAAATTTT 
TATTTTTGTG 
05TGAAAAAQ 



ACaVTCCCAG 
AAAATQATGG 
TQACCTCbGG 
COQAATOCTQ 
GGCGAAG6CA 
GCCATCTTTG 
CCATCTCCCC 
TCTGGCACCA 
TGCCCGGAGC 
GTTTTATAAA 
GAAATGAAGA 
TCCACCACAC 
GGAATGGGCC 
T 



GACTTTGGGC 
TAATAATACT 
GTCAAGGCAG 
ACTGCAAG6C 
GGCOOVTCOG 
TGGGTGCCCT 
ATGAAAGTGC 
GAT6CCAG60 
CGCAGQGGTQ 
AAATAAAACA 
AAAACTGAAO 
ATACCCTCTA 
CTGCTTTTTG 



AAGTCACCOG 
TCACCTACCT 
GAGGACTC06 
CCACCTGCCC 
TCTCTTCACT 
AGACTTAQTC 
TGTACAAATT 
AAGGGACAOA 
GGGATTGGTG 
AAAACCCACC 
AAAAAAAAAA 
A6CCAGCAAG 
TGGCAAGGCC 



CACTCCCTGQ 
CATA6GGGAG 
AAGGT6CTAC 
CTCCOCCACA 
CTGTCGCA6G 
CTTATCTTOT 
CCACCCGCCC 
G6AAAAGAGC 
6CCACTGTTT 
ATCACAAAAA 
AACAGC3AAAA 
ATTTCCTCTT 
TGTTCTGATT 



GCCraSCSTTT 
GTTGTGAGGC 
CCGTQAGCAA 
GAGCCTCCAG 
CCCTTTCATG 
CCTGGTTTCC 
CAGGACCCCC 
CACAAACAAG 
GTAT6TTCTT 
AAAAAATTTT 
AAAGAACCAT 
TGCAAAATCA 
AATAAAGGAT 



Seq ID MO 2 C203 DNA Sequence 

Nucleic Acid Accession tft IIM_0247B0.1 

Coding sequence i 31.. 102 3 



1 
I 

ATTAATCTGQ 
CGAGGGCACG 
GGCATTCTTT 
ACCCrCATTG 
AATTCCTTCC 
GGCCTTCAGG 
GTGTGGATTG 
ATCATGTTCT 
TGGOGGOCCT 
GGGGTCTTGT 
CCTTTTCGAQ 
ACAOGGCCTG 
TTCTTTTTCA 
QAGGQAAOGA 
AAAATGTTCC 
AGCTCACTTG 
CAT6ATGGCA 
TGATGACTCT 
A TTTCTTC CA 
CAGGAGOGGA 
TTTAATACCT 
GTTQTAATAT 
ATTATCTTCA 
TCCCASACCT 
CATGTAATAC 
ACTTTGAACr 
CTGAACTTTC 
GCCCTGIGAO 
AGTACOGTAO 
ATACOCCACC 
GAlSACTGAAT 



I 

CCGTQCCATG 
AAGTCTACGT 
GTTACTATTG 
GCCAGGACAT 
TGGGGGAGTT 
AGTTT6ACAT 
GCATCTTCTT 
ACTCCAAAAA 
CACAOATGAT 
GCACCCTGQC 
GTCTGCCTCT 
GCTACCTGTG 
TCCTGACCCT 
AGATTATOAT 
TGATAGAAAA 
TTCTQQAAAO 
GTCTTGACTT 
TTTQQTAAGC 
TGCCACCTGT 
AACTGACTAC 
QGATTGCTGA 
TGAAATGAGC 
AGAAGTGTGT 
66CAAAGGTT 
TCCAGCAGGG 
AAAGACATCC 
TTGCGTACTG 
OTGTTATACA 
CCAATGAGAT 
GTaATQACTT 
AAO 



21 

1 

CATCTACTCX: 
TCTCCTGATC 
GCTCAACACC 
CTACCX3QCTC 
TCTGAGGAGA 
TGCCAGGAAC 
CTGCCCCCTG 
TATCAGCCTG 
QACTTTCTTC 
CATCACCATC 
CTTCATTCAC 
6GTTGTTTGG 
CATTGTGCTA 
AAGOCTGCTC 
ATTGATCAAG 
GAGAQAGOTQ 
6C6ATCTAGA 
AGACACCAAT 
GCCTTTAGGA 
CATGTAATTA 
TTTTTCRAGA 
CTACAAAAAC 
GCAOOAATGA 
TAGAAACTGT 
ATATGAAGCC 
CCCATGTCCC 
ATTACATGAG 
TGACCATCAA 
TTATCGGA6A 
GAAATA7AAT 



31 
I 

ATGTTCAOGC 
COAAAC ATCT 
GTGGCCCTGT 
CTTCTGATGG 
ATCATTGGGA 
GTrCTAOAAC 
CTGCCCTTTA 
ATGATGAATT 
ATCTTCTTGC 
TGGAGATTGA 
TCCATCTACA 
ATCTATCGGA 
ATCATCACCT 
CATGAGCAGA 
CTGCAGGATA 
QAGCAACAAG 
AGATCABTTC 
CAAATAAGGG 
ACT6CCCAGA 
TCAAAGTAAA 
CAAAATACTT 
CTAGQAAGAG 
TTGGTTCTTA 
TGCTAA6AAA 
TCTGAATTGT 
AAAGGTGGAA 
TCTTTGGAGT 
AGTCCTACGT 
060GATTATT 
CSVGOGCTGGC 



41 

I 

TTGTGGAGAG 
TTTTGAAAAT 
CT6QTGAAGA 
ATTTTOTQTT 
T6CAACTGAT 
TGATCTATQC 
TCCAAATGAT 
TCCAGOCTCC 
TCTTTTTCCC 
AGCCTTCAGC 
GCTGGATOQA 
ACCTCATTG6 
ATCTTTACTG 
TCATTAATGA 
TGGAQAAOAA 
GCTTTTTGCA 
AAGAAGGTAA 
GAGGAQAGQA 
AGAAAATCCA 
ATTGGGCATT 
GGGGTTTTCC 
ATAACTAGG6 
GAAATCTCTC 
AGTGGTCCAT 
AGAACCTGCA 
TACAACCAGA 
OGGGGATGGA 
CAAGCTAGCT 
GCTAATTG^ 
AATTTTTGAC 



51 
I 

GTACGAQATO 
ATCAATCATT 
GTGTTGGGAA 
CTCTTTAQTC 
CACAAGTCTT 
ACAAACTCTQ 
TATGCTTTTC 
QAOCAAAGCC 
ATCCTTCACC 
TGACT6TGGC 
CACCCTAAGT 
AA6TGTGCAC 
GCAGATCACA 
GGGCAAAGAT 
AGCAAACCCC 
TTTGGGGGAA 
TCCAAGG6CC 
AAATGOAATO 
AGGCTTTAGC 
CCATGCTATT 
AATAAAGATT 
AATAATGTAT 
CTGCCAGACT 
CCTGAATAAA 
TTTATTTGTG 
GGTCTCATCT 
GGA6GTTCTG 
TTGCAGTGGC 
AATTTTOCCA 
AOTCTCTAOG 



Seq ID NO: C204 Protein Sequence 
Protein Accession #i Eos sequence 



1 
I 

AGOGGGGTTA 
ATCCTGGCAT 
CCCT60GAGT 
CAGTGGAGAA 
TCATCATACA 
TTTTTCAAGA 
TTQATGTGAT 
TGCGGTTTTC 
TQGOCATAGA 
CCCTTATACC 
TQQAAQAGAA 
GTATCCTTCA 
GATCCAAGAA 
AGATCATTGG 
TTTTGAAGTT 
TTACOGTGGC 
GTTATTTTAG 
GGAACA606G 
TQACCACCTG 
TTAATCCCCA 
TCACTCATGA 
ACCTGTCAQA 
TCTCTGCCTA 
CCGTTTATTA 
CGGTGCTQTT 
OCATGTTCAO 



11 
I 

GATOCAGGGA 
TTTAAAAATA 
CCCAATCAAT 
GAAGGCTTGC 
GGTTGA6AAT 
AAAG6T6CT6 
AGACTCTCAO 
AACATCTTTG 
CACTCXTOGT 
CAT6AC3^TC!C 
AAOQAAOCTT 
GCTCAATTGC 
CAGCCTQTCQ 
AGGCAAGTTT 
CAACATTTTC 
CAAAAAGAAC 
GGACACAGTQ 
GGCATOCTAC 
CTTCTTCAGT 
CATTTACTCC 
AAAAGCTOTO 
GCTCGGTa\6 
CATGGTAGOC 
CCTGGCTGAG 
ACTGCCTTTC 
6CTTGTQGA0 



21 

I 

GTTATGTTGT 
ATGTGAATGG 
GCGQGTGTGA 
AGGAGGCAGG 
GTTTCCTCTG 
CTAGACTCAA 
ACAOTCAGCSl 
AATQAGTOGA 
TCTTCACATG 
AOAGACAGAA 
AGQAAAATAG 
TGTATTCAQT 
GAAATTCTGA 
GGAACCAOCG 
TCATTCATCC 
ACGCTCCAGT 
AtYSTACTATG 
AACATGCAGC 
TTGCTGTTCA 
GGAGGGATCA 
AAGCTAAAAC 
6AGAATTCCA 
TGGGTTGTCT 
TACAACTTAG 
GTTOTGTCCT 
AGGTAOQAOA 



31 

I 

GATTTGTGTA 
TGTATCCCTT 
CTGCTGTATG 
AGATGCTGTC 
GGGTCCAAA6 
GCATCAACAT 
AAAGGAATGA 
TGTCTCAGAC 
AAACTGTTCA 
TTAAAGCCAT 
TTGAGAAAGA 
GTCTGAACTC 
ATTCCATCAG 
TCCTCTCCTA 
TGAACTTCAG 
TCACTGQGCT 
6CTTTTACAC 
TGGCXrrACAT 
GCATGGCCAA 
CXAAGCTGAT 
ABAA6AATCT 
AGTTGAOGTT 
CTACAGGAGT 
AGTTCCTGAA 
GCATTAATCT 
T6CCA06GCA 



41 
I 

GOGGTGGCTT 
TGTOATCATC 
AAGTCTCAGG 
OGATGACCAC 
CCACXX3VTCC 
GOTTTTOTCA 
CCAAAAGGGT 
CCTTCATAGC 
AGGAGAGAAG 
CAGGAACCAO 
AAAAAQCAAA 
CATTTCOOGG 
CCTGTGGCAQ 
TTTCAACTTT 
CTTGATCATA 
GGAGTTTTTC 
CAATTCCACC 
CTTCACAATC 
GTATTTCCGG 
CTTTTGCTGG 
TAGCACTQAG 
CAATCAGCTG 
GOCCATAGCC 
GACACACAGT 
GGCCGTGOCA 
CX3AAGTCTAC 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1511 



51 
I 

TGQTGAGCTC 
ATCACTTTTT 
AAGCCCACCC 
GTGAATGAAA 
TCAAATCAGA 
ATATCTGACA 
AACCA6GTGC 
CTAGAAT6CA 
TTAATCGCAT 
CCAAGGACCA 
CAGACCGATC 
GCTTAT066A 
AAGAOGCTGA 
CTGAGATGGC 
ATCCCTCABT 
ACTGGGGTGQ 
ATCCAGCAGG 
OGAGCATQCT 
AACAACTTCA 
GACTTCACTO 
ATAAGGGAGA 
CT GACCO OCT 
TGCTGTGCAG 
AACCCTGGGQ 
TGCATCTACT 
GTTCTCCTQA 



€0 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1813 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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TOOGAAACAT 
COGTGGCCCT 
TCCTTCTGAT 
QAATCATTGG 
AOQTTCTAGA 

TGATQATGAA 
TCATCTTCTT 
TCTG6AGATT 
ACTCCATCTA 
GGATCTATOG 
TAATCATCAC 
TCCATQAGCA 
AGCTGCAGGA 
T66AGCAACA 
GAAGATCAGT 
ATCAAATAAG 
OAACTGCCCA 
TATCAAAGTA 
QACAAAATAC 
ACCTAGGAAO 
GATTGGTTCT 
GTTGCTAAOA 
CCTCTGAATT 
CCAAAGtSTGQ 
AGTCTTTGGA 
AAAGTCCTAC 
GACGOGATTA 
ATCAGGQCrS 



CTTTTTGAAA 
GTCTGGTGAA 
GQATTTTGTG 
GATGCAACTG 
ACTGATCTAT 
TATCCAAATG 
TTTCCAGCCT 
GCTCTTTTTC 
GAAGCCTTCA 
CAGCTGGATC 
GAACCTCATT 
CTATCTTTAC 
GATCATTAAT 
TAT GGAGAAG 
AGGCTTTTTG 
TCAAGAAGGT 
GGGAGGAGAY 
GAAGAAAATC 
AAATTGG6CA 

r itiuou rrrr 

AGATAACTAQ 
TAGAAATCTC 
AAA6TGGTCC 
6TAGAACCTG 
AATACAACCA 
OTOGGGQATG 
OTCAAGCTAQ 
TT6CTAATTG 
GCAATTTTTO 



ATATCAATCA 
GAGTGTTGGQ 
TTCTCTTTAG 
ATCACAAGTC 
GCACAAACTC 
ATTATXJCTIT 
CCX3AGCAAAG 
CCATCCTTCA 
GCTGACT6T0 
GACACCCTAA 
GGAAGTGTGC 
TGGCAGATCA 
GAGQGCAAAO 
AAAGOUUVCC 
CATTTOGGGG 
AATCCAAGGG 
GAAAATGGAA 
CAAGGCTTTA 
TTCCATGCTA 
GCAATAAAOA 
GGAATAAtGT 
TCCTGCCAGA 
ATCCTGAATA 
CATTTATTTG 
GAGQTCTCAT 
GAGGAGGTTC 
CTTTQCAOTG 
GAAATTTTCC 
ACAOTCTCTA 



TTGGCATTCr 
AAACOC TCAT 
TCAATTCCTT 
TTGGCCTTCA 
TGGTGTQQAT 
TCATCATGTT 
CCTGGCGGGC 
CCGGGGTCTT 
GCCCTTTTOO 
GTACAOGGCC 
ACTTCTTTTT 
CAGAGGGAAG 
ATAAAATGTT 
CCAGCTCACT 
AACATQATGG 
CCTQATGACT 
TGATTTCTTC 
GCCAGGAGCG 
TTTTTAATAC 
TTOTTOTAAT 
ATATTATCTT 
CTTCCCAQAC 
AACATGTAAT 
T6ACTTTGAA 
CTCTGAACTT 
TGCCCCTGTG 
GCAGTACCGT 
CAATACCCCA 
OGQAOACTGA 



TTGTTACTAT 
TGGCCAGGAC 
CCTGGGOGAO 
GGAGTTTGAC 
TGGCATCTTC 
CTACTCCAAA 
CTCACAGATG 
OTGCACCCTG 
AGGTCTGCCT 
TQGCTACCT6 
CATCCTCACC 
GAAGATTATQ 
CCTGATAGAA 
TGTTCTGGAA 
CAGTCTTGAC 
CTTTTGGTAA 
CATGCCACCT 
GAAACXGACT 
CTGGATTGCT 
ATTOAAAWRA 
CAAQAAGTGT 
CTGGCAAAGG 
ACTCCAGCAG 
CTAAA QACAT 
TCTTGOGTAC 
AGGTGTTATA 
AGCCAATX3AG 
C0QTGAT6AC 
ATAAG 



TG6CTCAACA 
ATCTAOOGGC 
TTTCIOA66A 
ATTGCCAGGA 
TTCTGCOCCC 
AATATCAGCC 
ATGACTTTCT 
GCCATCACCA 
CTCTTCATTC 
T6GGTTGTTT 
CTCATTGTGC 
ATAAGGCTGC 
AAATTGATCA 
AGGAGAGAGG 
TTGOQATCTA 
CCAGACACCA 
GTGGCTTTAG 
ACCATGTAAT 
GATTTTTCAA 
RMMWAMAAAA 
GTX3CA6GAAT 
TTTAGAAACT 
GGATAT6AAG 
CCCCCATOTC 
TGATTACAT6 
CATQACCATC 
ATTTATCCX5A 
TT6AAATATA 



Seq ID KOi C205 DKA Sequence 

Nucleic Acid Accession MM_002350.1 

Coding sequence: 397*. 1680 



1 
1 

GTCCTTCGGT 
TGT6ACAAAG 

GCCTOCCCCT 
CCCAGAGGGC 
GCTGAOSACC 
GCAGGAAGCT 
CTG GQGOCC T 
GCACTGGTGC 
TTCGGGGGGT 
ACCTTCTTAC 
ACOQACAACO 
GT0CT6QAGC 
CAGGATTTAG 
GAAGOGCTGC 
CTGCGCAGCG' 
OGCTTCCGCC 
CTGG6CCTCA 
CAOGCTOTTA 
CTGACCATCO 
TGCACTGQAG 
CT6GA6TTTA 
AAAGAGATGA 
ACTCGCAG6A 
AACXjGGTTCC 
GTGGACATCT 
CACC3GGGCCC 
CZGCTTAGCA 
CT6GACCCAC 
CTGCCACTCC 
TGGCTTGGAG 
G6CCTTGGCC 
ATCAAAAACA 
GCGACTGQAO 
GATTCTGGGA 
GGGGCTGGAG 
CAOAQAGCAC 
AATAAATQTT 



11 



CCCATCGTGT 
CGCAAGACAC 
ACCCTG6GGC 
ACAGGGCTCT 
TGCAAGCCAC 
GGCT6AGCCC 
TGA6A06C00 
T6GCAGGAAC 
GCTCX3TGGGC 
TCCTCTGCCT 
GGCTGOGGGA 
TOGTGGTQTO 
GG6G60CX3CT 
TGTCCCTGGC 
GCGTCCTGCT 
ACTGGTTOGT 
C6GTTGGCCT 
ATQGCACTQO 
GCTATGGTGA 
TCATGG6TGT 
ACAAGGCAGA 
AGQAGTCOGG 
A6GAGTCTCA 
GCCAGGTGCO 
CCAAGATGCA 
TGGAGAAACA 
CTQCOCTGGG 
GAGGA6GAAC 
TQACCXAGCC 
TGGGTTGGCT 
CAGCTGAG6C 
CTCTCACTAT 
GCAGGACTCC 
GQCTTCAGTT 
GTGGCGCCCC 
CTGGTGGGGA 
AAABCCAO 




31 



41 

I 



GGTGTGTGTC ACAAA6TACA 
GTCTTTATCC AOQTGGATGG 
CTCCTTATTC TCAAAAGGGG 
CXTTCCGOGCC CTGCXTrGGCC 
AGAGCAAAAG TCCCAGAGCC 
CTGTGCGTGC TGCQAGGTGG 
GGGGCCATGG GOGGGGATCT 
CTGGAGCAGG AQAAGTCTCT 
CTCATGGTGC TGCAT6CAQA 
TTCCTGGTTA AATGCACGAT 
TTTCATGCCA AAGAGQTCCA 
G06CTGAC0G GGOGQCAGGC 
GCX36CGCXCG TG0QGG60CC 
CAGCCGTGGC OGGGATTOCT 
CGTCTCTACC TGOXGOCCCX5 
TACOGCAGCA T0GGC6CTCT 
TACATGAACA OGCACCXnXSG 
ACOGCCrOGG TQCTGTCOGT 
GACACACTTT GGCTGATCCC 
GGCACCATGT GGGGCAAGAT 
CTGCTGCACA GCCCTGCTGG TQGCCGTGGT 
GAAGCA06TG CACAACTTCA TGATGGATAT 
TGCC06AGTG CTACAAQAAO CCTGGATGTT 
TGCT60C06C AGGCATCAGC GCAAGCTGCT 
GCTGAAACAC GGGAAGCTCC GGGAACAAGT 
CATGATOCTO TATGACCTGC AGCAGAATCT 
GATTGACAOG CTGGCGGGGA AGCTGGATGC 
GCOGAGGCAO CTTCCAGAAC CCA6CCASCA 
CAGGCTACTT TCCCCAGTAC TGAGOTGGTO 
CTGAACAAAO CACCTCAAGT GCAAGGACCA 
TGCTGATQGC TGCTGGAGGG GACGCTGGCT 
CCCAGGTGGG AACATGOTGA CCOCCACTCT 
GCTGCIATGG AOGAOCTCCA GCmCAGTT 
TGG6TCCCTG 6GAAAGAGQG TACTAGGG6C 
ACCX3CTGGCC GAGCTOAAGA ACTGGGTATG 
CTGGTGGGAC AACAAAGAGG ACACCATTTT 
GGAAGAAGTG TAACTCACXIA GCCTCTGCTC 




GG6CCAAGG6 
CGCOGTGCTC 
CAATCAAGTC 
OOGCCTGCTG 
GGCOGAOAGG 
CATCACATTC 
GGTCTGCCTO 
GGCCC3GGAAG 
CCA6TATACC 
CTACAAACAT 
GGC06CCATC 
GAACTOCATG 
GAGCAGCTCA 
CCTGACTGAG 
GTCCAAOTAO 
GACATCGTCT 
AAGGGGGCGC 
AAAGTGGGTA 
GCATACCCTC 
ACAAGTGCAG 
COGGATCCAG 
AGOCTGGOGC 
TCCAQAGCTG 
TTATCTTTGT 



Seg ID NO: C206 DNA Sequence 

Nucleic Acid Accession «t 1IM_025257.1 

Coding sequence : i . . 213 9 



1620 
16B0 
1740 

IBOO 
1B60 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3285 



51 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2238 



1 11 21 31 41 

I I I I I 1 

ATGGGGGGAA AGCAQGGGGA CGAGGATGAC GAGGCCTACO GOAAOCCAGT CAAATAGQAC 
CCCTCCTTTC GAOGCCCCAT CAAGAACAGA AOCTGCACAO ATG TCATC TG CTGCGTCCTC 
TGATTCTAGG TTACAT08TQ GTGGQGATTG TGQOCTGOTT GTATGQAQAC 

1328 
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CCCCGGCAAG TCCTCTACCC CAGGAACTCT ACTCCSOGCCT ACTGTGGCAT GGGGGAGAAC 240 

AAAGATAAQC CX3TATCTCCT GTACTTCAAC ATCTTCAGCf GCATCCTGTC CAQCAACATC 300 

ATCTCAGTTG CTGAGAAOGG CCTACAGTQC CXXACACCCC AGO TOTOTGT GTCCTC CTOC 360 

COGGAGGACC CATGGACTOT GGOAAAAAAC QAGTTCTCAC AGACTGTTGG GGAAGTCTTC 430 

TATACAAAAA ACA6GAACTT TTGTCTGCCA GGOGTACCCT GGAATATQAC GGT6ATCACA 480 

AGOCTGCAAC AGGAACTCTQ CCCCAGTTTC CTCCTCCCCT CTGCTCCAGC TCTGGQACX3C 540 

TOCTTTCCAT GGACCAACAT TACTCCACCO 6CGCTCCCAQ GGATCACCAA TGACACCACC 600 

ATACAGCAGG GGATCAGOSG TCTTArTGAC AGCCTCAATQ CCCGAGACAT CAQTG TTAAG 660 

ATCTTTGAAG ATTTTOOCCA OrCCTGGTAT TGGATTCTTG TTGCCCTGGG GOT6GCTCIO 720 

GTCTTOAOCC TACTGTTTAT CTi - G C ri 'CTG OGCCTGGTGG CTGOOCCCCT GOTGCTGGTG 760 

CTGATCCTGG GAGTGCTGGG OgTGCTG G CA TATGOCATCT ACTACTGCTG GGAGGAGTAC 840 

COAOTGCTGC GGGACAAGGG COCCTCCATC TCCCAGCTGG GTTTCACCAC CAAC CTCAQ T 900 

GCCTACCAGA GOGTGCAGGA GACCTGGCTQ GCCGCCCTGA TCSTGTTGGC GGTGC TTGAA 960 

GCCATCCTGC TGCTGGTGCT CATCTTCCTO 0G6CAGGG6A TTG6TATTGC CATG6CX3CTC 1020 

CTGAAGGAGG CCAGCAAGGC TGTGGGACAO ATGATGTCTA CCATGTTCTA CCCAC TGGTC 1080 

ACCTTTGTCC TCCTCCTCAT CTGCATTGCC TACTGGGCCA TQACTGCTCT GTATCCTCTG 1140 

CCCACGCAGC CAGCCACTCT TGGATATGTG CTCTGGGCAT CCAAC ATCA O CTCCCCX3G0C 1200 

TGTGA6AAAG TGCCAATAAA TACATCATGC AACCCCACGG CCCACCTTGT GAA CTCCTO Q 1260 

TGCCCAGGGC TGATOTOOGT CTTCCAGGGC TACTCATCX3V AAGGCCTAAT CCAAGQTTCT 1320 

GTCTTCAATC TGCAAATCTA TGGGGTCCTG GGOCTCTTCT GOACCCTTAA CTGGGTACTG 1380 

GC C CT G GGCC AATQOQTGCT CXsCTQQAGCC TTTGCCTCCT TCTACTGGGC CTTCCACAAG 1440 

CCCCAG6ACA TCCCTACCTT CCCXTTTAATC TCTGCCTTCA TCCGCACACT CCCTTACCAC 1500 

ACTGGGTCAT TGGCATTTGG AOCCCTCATC CTOACCCTTG TGCA6ATAGC C066GTCATC 1560 

TTGGAGTATA TTGACCACAA GCTCAGAGGA GTGCAGAACC CTGTAGCCOO CTGCATCATQ 1620 

TGCTGTTTCA AGTQCTGCCT CTGGTGTCTG OAAAAATTTA TCAAGTTCCT AAACCGCAAT 1680 

GCATACATGA TGAT06CCAT CTACGGGAAQ AATTTCTGTG TCTCAGCCAA AAATGCX3TTC 1740 

AlKSCTACTCA TGOGAAACAT TGTCAGOGTQ GTCGTCCTGG ACAAAGTCAC AGACCTGCTG 1800 

CTGTTCTTTG GGAAGCTQCT OGTCQTCGGA GQCGTGGGGG TCCTGTCCTT CTTTTTTTTC 1860 

TCOGGTOGCA TCCOQGGGCT GGOTAAAGAC TTTAAOAGCC CCCACCTCAA CTATTACTGG 1920 

CTGCCCATCA TGACCTCCAT CCTGQGGGCC TATGTCATOG CCAGCGGCTT CTTCAGOGTT 1980 

TTGQGCATGT GTCnXSGACAC GCTCTTCCTC TGCTTCCTGQ AAGACCTQGA GCGGAACAAC 2040 

GQCTCCCTGG AOOGGCGCTA CTACATGTCC AA0A6CCTTC TAAAGATTCT GGGCAAGAAG 2100 

AAOGAGQOGC CCCCGQACAA CAAGAAGAGG AAGAAGTGAC AGCTCOGGCC CTGATCCAGG 2160 

ACTGCACCCC ACCXXCACCQ TCCAGCCATC CAACCTCyvCT TCGCCTTACA GGTCTC CATT 2220 

rrOTGOTAAA AAAAGOTTTT AGGCCAGGCG CCGTQGCTCA OGCCTGTAAT CCAACACTTT 2280 

GAGAOGCTGA GGCX3GG0GGA TCAGCTQAGT CMSQNSTSCQ AOACCAOCCT GQCCAACATG 2340 

GTGAAAC 2347 

Seq ZD NOt C207 DMA Sequence 

Nucleic Acid Acceaaion ft: MM_016180.1 

Coding sequence t 26.. 1618 



1 11 

1 I 
CAG6AAG0TT CCTCTCCCAG 
CTATAAATCC CTAGCTGATG 
CAGCAOACTC ATCATGCACA 
GGCAG06TAT GTGACCCCAG 
TQTOTGGTTC CTCAG(XCCA 
GQACCACTGC C66TCCAGGT 
GATQCTOQTO GGCATQGCTC 
TAACCCAAGO AGGAA6CT66 
TGATTTTGCr GCOQACTTCA 
CCATCAGGAC AAOGAGAAGG 
CXrTGGGTTAC CTTTTGGGTQ 
TACAGAATTC CAGGTCATGT 
TCATCT6TGC AGTATCTCTG 
OCAAACCCCT CAOOACCCTC 
GAAAGTTAAA AATGGTTAOG 
TCATGCT6AA CAGACTOQCA 
CATOGCTCCT CACTAC06GT 
6TCCAACAT0 CTGTTCTTGA 
TAOTGCACAC AACTCCACAG 
GGGCTTCTGC ATCAACTCOS 
ATCCTACATT GQATTAAAGQ 
G6GATTTATT G60CTCTTCC 
TGTAATGTCC AGCACCCTQT 
GGAAGAAAAG GA6AGGCAGC 
GGGCATGGAC TG06CCACCC 
TGGCCTGGGC TTTCTGGTCA 
TG0G6TGGCA CIGATAGQCT 
CAATAAAOA0 ACAATGACCC 




TOTACCTCAA 
TTT0C3O0CAT 
TTGATGQGCC 
GCCTCCACTA 
CTATAGACTQ 
TCTTCTTCTC 
AAGCCCCACT 
CATTGTCATC 
TAAATCCAGA 
6G6CAATQAC 
AOCrTTGCAT 
GAGATTTCAT 
AGTTTCTCAT 
TGTTTTOCTC 
OTCTTTACTT 
CGAATGTCTA 
ACACTGTGCC 
AGGCCCCAGG 
TCACATQCAT 
ACACAGCCX3G 
GTTGCTTTGT 
TAAAAAAAAA 



31 
I 

TAGCAACAGT 
TGACTCTGTO 
GTTGGGAAGA 
06TAGGTCTG 
CXTCCTGCAG 
GAGACCCTAC 
TGGGGGTACT 
AAGTGTCACC 
CATCftAAGCC 
CCATGCCCTC 
GGCCCATCTO 
T6CATT06TQ 
TACASAGGTT 
AGATGGAATG 
GCTGGCAATG 
ATTAAAGTCA 
CMCCACCSC 
GGOOCAGATT 
CTA06AAAGA 
ACTTTATTCT 
CACGGGATAT 
CTCCACX3CTO 
CTTTAACCTC 
AGGGGACCCA 
GGTGCAGCT6 
GAG06TTGTC 
OGCTCTCTTT 



41 

I 

GGGCAGGCTG 
GAOCOQCCTA 
GAGTTCTGCT 
CCCAGCAQCC 
CCCGTGGTOG 
ATCCTCACCC 
QTT6TAGCA0 
AIGATAGOTG 
TACTTATTTG 
TTCACAGGTT 
GAGCTGGGAA 
CTCACTTTGT 
aCSUUUSGGCA 
TACX3AGTAT6 
CAGGGAGCAA 
CTGCTOAGAG 
ATTOQATQGA 
GTOTACOGGQ 
GGAGTOGAGG 
TACTTTCAGA 
TTGCTGTTTG 
OTCCTffTQCA 
ATTACT6AGT 
GACAACA806 
GCTCAQATCC 
GTGGTGGTGA 
OTTAGATATS 



51 
I 

GCCGCCACAT 
AAAQACCCAC 
ACGCX3GTGGA 
TGTACAGCAT 
GATCGGCCAG 
TGGQAOTCAT 
CTTTGATTGC 
TCGTTCTCTT 
ATGTCTGCTC 
TTGGAGGTGC 
GACrOTTGGG 
GTTTTACTGT 
TTOCCCCACA 
GTTCTATOGA 
AAAACAAAAA 
CACTGGTGAA 
OGSCX^TTOCT 
G6GATCCCTA 
TTGGATGTTG 
AAGTTTTGGT 
GCCTGGQQAC 
6CCIGTTTG6 
ACCAOOSGQA 
TGAGAOGGAA 
TGGT0GGA6G 
TCACAGOGTC 
TGGATTAGGT 



Seq ID NOt C208 Z3NA Sequence 

Nucleic Acid Accession #: NM_003273.1 

Coding sequence i 255 . . 2024 



60 
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900 
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1 11 21 31 41 51 

i 11 I I ! 

OG006CGGGO COGGATCCTC GGCGGGGCOG AGTCCATCTC CTGGGAAATG GGGOGGACAG 
WrnHJCT iq ACXQACTATT GTGAGOGCCC TCT C TCTCCG GOBGAGGGGA 6ACCAT00CC 
GCCACTCAGG CCCCO GGG OC CG6CTGGAAT T0G6AGG6CC CCTQOQTAAT GaOGGAGAGA 
GATGOGACCT GOGGGAAAOG CTAAOOQAAO GAOAGCTGOA GGGGGTGAAC TAAQAOOGOG 
GGCGAGATCT QAGGATGQAA GGCTTTGGGG OTGTOGGAGG CAQAOGQACC CQGGQGTTTG 
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PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



CAGOGAAGGG TGTCTGQAGA GGGAGAGCTG ACGAGGC3GCC 
GGGQATTTAT GGTGTCGACT GGGAGCAGGA GGAGGGTCTT 
GACTAA6ATG GAGGCCTGGG AAGGQAACTO GGAGGCAGOG 
CmAGGAOQG GGTGOGQAGO C6CACTCT00 GAATGO CGAG 
GG006C0GTQ CGGGOOGGGG GGGAGCTGGO GG6CTAGG6G 
CTTCCCGCAG GCX3CCX5CGGC TCTGCTACTG CTGCTGCCOG 
CTGGCC5GCCC GTTCGGGCCC CGCGCGCCTG CTGGGTCCAC 
GAGGTGCTGT GQAGCCCAGG GGOGCTGCTO CTQTQGCT06 
G06CTGTACC TACTGCOSGC GCGCAAG6T0 GGG60CXX3GC 
GAAGCGAATG GGCTCGGCGA G6GAAAGGAC GCCCOGGGCC 
CCGCAOTGGC CGAGGGGCAG QAATTQAAG6 ACAAGAGTOG 
GCTTCCAGGC CCTGGTGCTG ACAGCCCTGT TGGTGG GGCT 
CTCTGGGGGC GCTCC0G6AA ATGCTGCTGC CCTT660GTT 
TCATCTTCAO ar i' C 'l"l' TC ' A 'C TAGATGAAGG CX3CA0GTAGC 
CTGGGGGGAA CTCA06CAAT COGATTTACG ACTTTTTTCT 
OTATCTQTTT CTTOGACTTC AAATATTTCT GTGAACTGOO 
TCCrCATCAA CCTGGCCCTG TTQATGAAGG AGGCAGAGCT 
CCATOTGGCT GGTCAATGGC TTGCAGTTGC TCTA06TGGQ 
AG600STCCT CACCACCATG OATATCACAC ATOACGGGTT 
GGGACATGGC CT GG G T GCXrC TTCACCTACA GCCTGCAiSGC 
CGCAGCCCCT GQGGTTGCCC ATGGCCTCTG TCATCTGCCT 
ACATCTTCCG TGGGGCGAAT TCCCAGAAAA ACACTTTCCG 
GAGTGGGTOG GCTTGAGACC ATCTCTACAO CCACAG6GC0 
GGTOGGGTAT GGTCC6CCAT CCCAACTATC TTGGAOACCT 
CCTTGCCCTG CGGGGTGTCA CACCTGCTGC CCTACTTCTA 
TGCTGGTGC3V CCGTQAGGCC CX3GGATGAGC GGAGTGCCTG 
GCAGGAGTAC TQCCGGCOTG TGCCTTACCG CATCATGCCr 
CACCACCCCA GGT6GGGCAT 6T6CCCACTC ATCCAGCAGC 
OGACACACTT GGGACTCAAG GGCTTGCAGC CCACXXAGOC 
AGAAGAGGTa GTTTAGAGCA AGGAAAAAAA TGAAACCAGT 

Seq ID NO: C209 DNA Sequence 

Nucleic Acid Accession fit NM_015720.1 

Coding sequencet 21..183B 



11 



21 



GGTTCTGGGQ GCTGCAGAAC 
OGAGGGGCCT GGGGG06GGG 
GGGTGCCTGG GGGCOQAGGO 
AGGGTCOOGC AOAQACaTCA 
CG6A06G0QA G6TGATG6CC 
CCACCATGTT CCACCTGCTC 
CCGOGTCCCT GCCCGGGCTG 
CCTGGCT0G6 CCTGCA6GCG 
TGGCGGAOGC TOGQOGGAGG 
TTATGAOAGC OCCCTTGGAC 
CCTGCGCTAT CCTATTAAOG 
GGGGATGTCA GOGGGGCTGC 
TGTCGCCACC CTCACCGCTT 
CCCAGTTTOG GCCCTG6CAC 
GGGAGGAGAG CTCAACCCTC 
ACCCGGCCTC ATCGQCTGGG 
TOSAGGCAGT CCCTCACTGG 
TGATGCCCTC TGGCAOGAGG 
TOOCTT CATO CTGGGGTTTO 
CCAGTTCCTO CT6CACCAGC 
CATCAATGCT ACTGGTTACT 
AAAGAATCCT TCTGACCCCA 
GAAACT6CT0 OTGTCTGGGT 
CATCATGOCT CTQ6GTT0GT 
CCTCCTCTAC TTGACOGOGC 
CAGAAQTAOQ OCCTGGCCTG 
TACATCTACT GAAGOGGCTC 
ACACCCAGGA CCAGGA6CCT 
CTGA6GATGA ACAACCTC3U3 
GACCAAAAAA AAAAAAAAAA 
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31 41 

I 1 I 

CXAGTTCOGO CACGAGGACC ATGGGCCGGC TOCTGCGGGC CGCCOGOCTG COGCOGCTGC 60 

TTTCGCCGCr GCTQCTTCTG CTGQTTGGGQ GAGCX3TTCCT GGGTGCCTGT GTG6CIG6GT 120 

CTGATGAGCC TGGCCCAGAG GGCX:TCACCT CCACCTCCCT GCTAGACCTC CTOCTGCCCA 180 

CTGGCrTGGA GCCACTGGAC TCAOAOGAGC CTAGTOAGAC CATGGGCCTG GGAGCTGGGC 240 

TGGQAOCCCC TGGCTCAGGC TTCCCCAOOG AAQAGAATGA AGAGTCTCX3G ATTCTGCAGC 300 

CACCACAOTA CTTCTGGGAA GAGGAGQAAG AGCTGAATGA CTCAAQTCTG GACCTGGGAC 360 

CX3VCTGCAQA TTATOTTTTT CCTGACTTAA CTGAGAAGGC AGGTTCCATT GAAGACACTA 420 

GCCAGGCTCA AGAGCTGCCA AACCTCCCCT CTCCCTTGCC CAAGATGAAT CTGGTTGAGC 480 

CT O OCTOGCA TATGGCTCCC AQAGAGGAQO AAGAAGAGGA AGAGGAAGAG GAGGAGAGGG 540 

AGAAGGAA6A GGTAGAOAAA CAAGAGGAOG AGGAAGAGGA GQAGCTQCTC CCTGTGAATO 600 

6ATCCCAAGA AGAAOCCAAG CCTCAGGTCC GTGACTTTTC TCTCACCAGC AGCAGCCAGA 660 

CCCCAGGGGC CACCAAAAGC AGGCATGAAO ACTCCGGGQA CCAGGCCTC3V TCAGQTGTGO 720 

AGOTGQAGAG CA6CATGGGG CCCAGCTTGC TGCTGCCTTC AGTCACCCCA ACTACAGTGA 780 

CTCOGOQGGA CCAGGACTCC ACCAGCCAAG AGGCAQAGGC CACAGTGCTG CCAGCTGCAG 840 

GGCTTGGGGT A6AGTTC36AG GCTCCTCAGG AA6CAAG0GA GGAAGCCACT GCAGGAGCAG 900 

CTGGTTTQTC TGGCCAGCAC OAOGAGQTGC C3GQCCTTGCC TTCATTCCCT CAAACCACAG 960 

CTCCCA6TGG GQCXmGCAC CCAGATGAAG ATCCCCTTGQ CTCTAGAACC TCAGCCTCTT 1020 

CCCCACTGGC CCCTGGAGAC ATQGAACTQA CACCTTCCTC TGCTACCTTG GGACAAGAAG 1080 

ATCTCAACCA GCAGCTCCTA GAAGG6CAG0 CA6CTGAAGC TCAATCGAGG ATACCCTGGG 1140 

ATTCTAOGCA GGTGATCTGC AAOGACTGGA GCAATCTGGC TGGQAAAAAC TACATCATTC 1200 

T6AACATGAC AGAGAACATA GACTGTGAGQ TGTTCCGGCA GCACCGGGGG CCACAGCTCC 1260 

TGGCCCTGGT GQAAGAGGTQ CTGCCCCGCC ATGGCAGTGG CCACCATGGO GCCTOGCACA 1320 

TCTCTCTQA3 CAAOCXX»GC GAGAAGGAOC A6CACCTTCT CATGACACTG GTGGGOGAGC 1380 

AGGGGOTGOT GCCCACTCAA QATGTCCTTT OCAXGCTGGG TGACATCCQC AGGAOCCTGG 1440 

AGGAOATTGO CATCCAGAAC TATTCCAOIA OC3U3CAGCTG CCAGGC6CGG GCCAGCCAGG 1500 

TG0GCAGC6A CTAOGGCAOG CTCTTOGTGG TGCTGGTGGT CATTGGGGCX: ATCTGCATCA 1560 

TCATCATTGC GCTTGGCCTQ CTCTACAACT GCTGGCAGOQ COGGCTGCOC AAGCTCAAGC 1620 

ACGTGTOGCA CX3G0QAGGAG CTGCGCTTOG TGGAGAACGG CTGCCAOQAC AACCCCACGC 1680 

TGGAGGTGGC CAOOQACAGC CAGTGGGAQA TGCS^GGAOAA OCACCGCAGC CTGAAOGGCG 1740 

GCQ66GCCCT CM0QGCCXX3 GOQAGCTGGG GGGOGCTCAT GOGGGGCAAG CGGGACCCCG 1800 

AOGACTCGGA CGTGTTCQAO GAGGACAOGC ACCTCTTGAGC GCAGOGAGQC GCAOGCCGAG 1860 

TGGGCCGCCA GGACCAAGOQ AGGTG6ACCC CGAAACGGAC GGCCCGGAGC COGCACCAGC 1920 

CCCGCGCCTA CCGGGCGGCC CCOGOGOCTG GCCCTOGGCG CGGGCTCCTT CCOGCTTCCC 1980 

COQACTTCAC AGOOOGGCTT OGGAGCAACT CCCTCACTGC CX3CCCQAGGG GCAGGCCTCA 2040 

AAGCCOGCCT TGGCCCC6CT TTCOCGCCCC T6AACCCCGG CCCCGGGGGC GGOGGGGGCQ 2100 

CTTCCTGOQC CCCGGGACTC AATTAAAOCC GCCCX3QAQAC CACGCGGQCC CAGCGAAAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2220 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 2269 

Seq ID NO: C210 DMA Sequence 
Nucleic Acid Accession «i NM_001197.3 
Coding 8c»quence t 61 . . 543 

1 11 21 31 41 51 

1 I I I 1 I 

GACftOGAAGC CTCOOGOGT O GCTTACAGAC GCTQCCAGCA TOGCGGCOSC CAQAGGAGAA 60 

ATGTCTGAAG TAAGACCCCT CTCCAGA6AC ATCTTGATGO AOACCCTCCT GTATGAGCAO 120 
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CTCCTGGAAC 
CCTATGQAGQ 
CTGGCCTGCA 
TGCX3AGGTG6 
AG6GATGTTC 
TTCTGGAGAT 
CTGCTGCTGC 
TGAGGCCCCX3 
OOCOGCCIGC 

TTCCAGTTTT 
ACTGTGGCCT 
AGGGOGAGTG 
GAATAGATTC 
AAA 



CCCCGACCAT 
ACTTCQATTC 
TCGGGGAOGA 
CCATGC ACAQ 
TTAGAAGTTT 
CCCCGAACCC 
TGGCGCTGCT 
GCGGCTCAGG 
T6CTGTTATC 
TGCTGGAACA 
C XS m TTTCT 
OTGCCCAGGA 
CTGGTCACAC 
08AGGAGCA0 



GGAGGTTCTT 
TTTGOAATGC 
GAT6GA0GTG 
CCTGGGTCIO 
CATGGAOGGT 

GCTGCCGCTG 
GOGGQGCTGG 
TTTTTAACTO 
CTGCTGAGGT 
AAAAGATGAA 
AGAGCCATTC 
OCCTGTGTGA 
GAGTGCTCAA 



GGCATGACTG 
ATGGAGGGQV 
AGCCTCAGGG 
GCTTTCftTCT 
TTCACCACAC 
GTGTC C TOOG 
CTCAGCGGGG 
CCCGACCCCC 
TTTTCTCATQ 
TTTATACTCA 
TTOCTATGOC 
ACTCCTGOCC 
TATGTQATGC 
TAAAATOTTO 



ACTCTQAAOA 
GTGAGQCATT 
CCCCGOGCCT 
A0QAGCA6AC 
TTAAGGAOAA 
AACAGGTGCT 
GCCTGGACCT 
ATOACCACTO 
AroOCTTTTT 
GGXTTTTTOT 
TCTGCAATTG 
CTGCCCACAC 
CCTGGGCAAA 
GTTTOCAGCA 



GGACCTGGAC 
GGCCCTGOGO 
GGCCCAQCTC 
TGAGGACATC 
CATAAT6AGQ 
GCTGGCGCrO 
GCrOCTCAAO 
CCCTQGAGGT 
ATATTTAAAC 
TTTTTTTTTA 
TCACC6GTTA 
GGCAGGTAGC 
QAATCTACTG 
AAAAAAAAAA 



IBO 
240 
300 
360 
420 
4B0 
540 
60O 
660 
720 
780 
840 
900 
960 
963 



Seq ID NO: C211 DHA Sequence 
Nucleic Acid Accession fit AF272357 
Coding sequences 83.. 1060 



GCTGCTGCCG 
GGCC0QG0G6 
GCQ6CTGCT0 
CGCCGGCCAC 
GGCAAGGTGT 
CCAGCAAGGO 
CAGACTGOAA 
ATCAACTC08 
CT0G6CAGGG 
CAGGCCCCAC 
GGAGCCCGQG 
GGOOQOTGCA 
COGGCTGACT 
COQGATCTCG 
CCAA0G6CAA 
GGCCTCCTOG 
6GCCCCQACC 
GCCCCTGCCG 
CACCT6CTCC 
AACATGTTTT 

tgcx:ago6ag 

GTTCGGTTTA 
GG6TAATTCT 



11 

1 

ACXSCGGAGCC 
CGGCGGTGAA 
CGGCTGCTGC 
CCGGATGTAG 
CCTCCTGGTG 

QATGAGATT6 
CCCCTACCCA 

GGGCAGGGGC 

GQAGGGCAAG 
OCCGCCCTCT 
CAGAAGCKIOG 
CCTQGGGACC 
CAGATGCTGT 
GATGAGGAGA 
GGGQAAATGG 
GCCCCXAGCT 
CCGACCTCGA 
GATGCTQTGT 
ACXrCCCGAAC 
GACXXXXIAAA 
AGAGACSUUU^ 



21 

1 

CGGAGCCOGC 
GGATGGOGAC 
TCTOOGGCCT 

CACATGCXTTQ 
CCAGOATOOQ 
ACTTCCTG6C 
A6GACCX3ACA 
TGGAGCTGGG 
GCrCCCCTGT 
CCGACGGCCT 
CGQTAGCCTC 
ACTAOQCCAC 
AGCGQCTGGC 
OCCTGQAGaS 
AT0A6GA0QG 
JtfaGTO OQCAA 
CACX3GCCTGC 
GGCCCCOGGG 
GCTTTTGQCT 
CTTTGTGCCA 
CTGOAOGGGG 
GCCAATTAAA 



31 

1 

GCCQAGCCCC 
GCCGCTGCCT 
C3GTCCTGGGC 
CGGGAGCCTG 
TGGGCCCTGC 
C06GCCTCCA 
OCAGQAGCTT 
6CQQCTCCCG 
CCTCCCCTCC 
GTCATCCGAC 
CGCCCTT6TG 
CCTCTGCTGG 
TGCX3AAG60C 
ACAGAGOGOG 
GCATAAAGAG 
A6ACTTCAC6 
GCCTCTGTTC 
ACT6CCATQA 
GAGGGGCAGG 
GGGCCTCGGG 
GGACACCTCX: 
CATGGAOAAC 
6TCCATTTCA 



41 
I 

TGGGCTOGCG 
CC6CGCTCCC 
QCOQCCCTQC 
GACTGTGCCC 
CTTCAOCCCT 
GGGGGGGGCC 
GCCCGQAAOG 
GAGCCTGCCA 
ACTCCAGGAA 
CCGOTGCACA 
CTGATCCTGG 
TGCAOGCTGC 
CCT66CTCAC 
GAGATGTACC 
CCACCCAAGG 
GTGTAOBAGT 
GACCACOOOO 
CXTIGGAGGCA 
GCCTGGAOCT 
CTCCAGGCCC 
TGGTCCCCTO 
CX3TAGAGGQC 
6AAAAAAAAA 



51 
I 

GTGCCATGCT 
C6CGGCACCT 
6T6GAGC0GC 
TGAAGAGGOG 
TCCAGGAGGA 
GGCCCCAGCC 
AOTCTGGACA 
CCCTQ6GCTT 
CCCCCAOGCC 
TGTOGCCCCT 

AOCaTQAQAT 
CT6CAGCTCC 
ACTACXaOCA 
AGCTGGACAC 
GCCCGGGCXZT 
CACTOTGOGC 
6ACAGACGCC 
TCCCACTAAA 
TGGGACCCCT 
CACCTCTCCT 
AGGAAOGGOT 
A 
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Seq ID NO; C212 DNA Sequence 

Nucleic Acid AccesBion ttt NM_004445.1 

Coding sequence I 799.. 3819 



1 11 21 31 

I I I I 

CGGAGGGGGC GGGCGGGGCT GCGTTOQCTC CAOCOGOGGC 
GACGCGGGAC CCAGCTTGGC GAOGGCGATT CTCGAOQOGG 
GCCCCACCTC T6GAGCAGCC CCTGCOGCCA GOQTCAGGTC 
TCTCGGCQCC GAACGGACCC GGGCCX»3TGC AACGGGGTCC 
GTGGCACOGT GCGAGCTCCA GGAQCCCCGG GTCCACTGCG 
CTGCAGAOAC TGOQGCCAAC OGGAAGAAAT AAAGGGATTA 
CTTCTGAGAC TCAGACAOGA GGAGA6ATAG AGAAC06CCA 
A6GAGGTGCC AAGCCTGTTT GTCT7CATTG TGACACTGGA 
AAOATCAGGG TGCOGGCATG GTCAGTTCCT GGC3GAAGCCT 
CCCTCTTCTT TGTTGTGTCC TCGAATGGCA GAAAAAGQGG 
GAGAGTAAAT GAAGAGAAAG AACTGGAATA ACCCCTT6CA 
CTTAGCTGTA CACCCTGAGT CTTQCAAAAG CTQCAGCCCC 
CTGGGGOQAT GGTGGACOCC CTQAAC^TGT CCCATGGCTA 
GGGAACAGAG TGGCGGGCAT GGT6TGTAGC CTATGGGTGC 
CTGGCTCTGG AAGAGGTATT QCTGQACACC ACOGGAGAGA 
ACCTACCCAC CAGGGGGGTG GGAOGAOGTG AGTGTTCTGG 
CGGACCTTTG AGGCATGTCA TGTGGCAGGO GCCCCTCCAG 
TTGCAGACAC ACTTT6TGQA 6CGGCGCGGQ GCCCAGAGGG 
TCTGTG06G0 CAT6CTCCAG CCTGGGTGTG AGCGGCGGCA 
CTTTACTACC GTCAGGCTGA GGAOCCOQAC AGCXXH^CA 
AAA06CTGGA CCAAGGTGGA CACAATTGCA GCAGAC6AGA 
TCCTCCTCCT CTTCTTCCTC TGCAGCGTGG GCT0T6GQAC 
GCTGGACTGC AACTGAA06T CAAAGAGCX3G AGCTTTGGGC 
TA06TGGCCT TCCAGGACAC GGGGGCCTGC CTGGC CCTGG 
TACACCTGCC CTGCOGTGCT GOQATCCTTT GCTTCCTTTC 
GCT6GGG6GG CCTCCCTGGT GGCA6CTGTG GGCACCTQTG 
GAGGATGGAG TAOGGGOCCA GOCAGGAGGC AGCCCCCCCA 
GGCAAGTGGA TGGTAGCTGT CGGGG6CTGC OGCTGCCAGC 
GGAGACAAGG CCTGCCAAQC CTGCCCAC6G GOGCTCTATA 
COCTGCTCAC CATGCCCTGC CCGCAGTCAC GCTCCCAACC 
TGCCTGGAGG GCTTCTACGO GGCCAGTTCC GACCCACCAG 
CCAT0G6CTC OCCAGGAGCT TTGGTTTQAG GTGCAAGGCT 



41 

I 

TCTACAOCAG 
GCCCCCAGGA 
CAOCOCGGAA 
CCGGACTGGA 
AGGCCTOGGG 
TAGTCCACCC 
ATCTCTAGAT 
GTCTAGATGC 
CTCTTCTAGG 
TGGCTGTTGG 
GAAAAAAAAA 
ACCCAOQAGC 
CTGAAGGGGC 
TGCTCCTGGT 
CATCTGAGAT 
AOGAOCAGOQ 
GCAC066GCA 
CGCACATTOQ 
CCTGCOGGGA 
GCX3TTTCCTC 
GCTTTCCCTC 
CCCACGGQGC 
CTCTCACCCA 
TCGCTOTCAQ 
CAGAGACGCA 
TGGCTCATGC 
GGCTGCACTG 
CTGQATACCA 
AGTCTTCTOC 
CAGCAGCCCC 
AGOCOCCCTG 
CAGOVCTCAT 



51 
1 

OGGGOGGGGG 
TTCTGCOQQC 
TCCCAGGGAC 
6AAQAC6CGG 
GGGCGCAGAC 
AATTCACAGA 
OW3WGCAA 
T6GGAA6TCC 
TTTCAGACTO 
AGGAAG6GA0 
AAAAGGGAAG 
AGGGTOGTaO 
TGCCCAGnA 
GTCTTCAOTT 
TGGCTGQCTC 
AC6CCTGACT 
GGACAATTGG 
ACTCCACTTC 
OACCTTCACC 
CTGGCACCTC 

TGGGCAGCGG 
AOGCGGCTTC 
GCTCTTCTCC 
GGCCAGTGGG 
AGAGCXrAGAG 
CAAC6GGGA0 
ACCAQCAOGA 
TGGGAATGCT 
OGTTTGCCCC 
CACTGGTCCT 
6CTACACTQG 
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OGCCTGCCTC GGGAGCTGGG GGGTCGAGGQ GACCTGCTCT TCAATGTCGT GTQCAAGGAG 1980 

TGTQAAOGCC OCCAGGAACC TGOCAGOGGT GCrrGGGGGCA CTTGTCACOG CTGCAGGGAT 2040 

GAGGTGCACT T06ACCCTC6 CCAGAGAGGC CTGACTGAGA 0CCX3ACITGTT AGTGGGGGGA 2100 

CTCGGGGCHC AOOTACCCTA OVTCTTAGAG GT6CA06CT0 TTAATGGGGT 6TCTGAGCTC 2160 

AGCCCTGACC CTCCTCAG6C TGCAGCCATC AATGTCAGCA CCAGCCATOA AOTOCCCTCT 2220 

CCTQTCCCTG TGGTGCACCA GGTOAGCOSG QCATCCAAOV GOVTCACGGT GTCCTGGCCXS 2280 

CAGCCX3GACC AGACCAATGG GAACATCCTG GACTATCAGC TCCGCTACTA TGACCAGGCA 2340 

GAAGACGAAT CCCACTCCTT CACCXTOACC AGOGAGACCA ACACTGCCAC CGTGACACAG 2400 

CTOAOCCCTO GCCACATCTA TGOTTTCCAG GTQC3GQGCCC GGACTGCTGC 0QGCXA066C 2460 

CCCTACGOOG QCAAAQTCTA TTTCCAGACA CTTCCTCAAO GGGAGCTGTC TTCCXaGCTT 2520 

CGGGAAAGAC TCTCCTTGQT GATCGGCTCC ATCCTGGGGG CTTTOOCCTT CCTCCTGCTG 2S80 

GCAGCCATCA COQTGCTGGC GGTCGTCTTC CAQOGGAAGC GOOGTGGGAC TGGCTACACG 2640 

GAGCAGCTGC AGCAATACAG CAGCCCAGGA CTCGGGGTGA AGTATTACAT OGACGGCTCC 2700 

ACCTAG8AG0 AOCXXTTOTCA GQCCATC06A QAACTTGOCC OGQAAGTOQA TCCTGCTTAT 2760 

ATCAA6ATTG AGGAGGTCAT T66GACAG6C TCTTTTGGAB AAOT606GCA GG6C0QCCT0 2820 

CAGCCACGGG GA06GAGQGA GCAQACTGTG GCCATCCAGG CCCTGTGGGC OGGGGGCXSCC 2680 

GAAAGCCTGC AGATGACCTT CCTGGGCOGG GCCGCAGTGC TCGOTCAOTT CCABCACCCC 2940 

AACATCCTGC GGCTGGAGGO 06T6GTCACC AAGAGCOQAC CCCTCATOGT 6CTGAGGGAG 3000 

TTCATGGAGC TTQQCCXXCT GGACAGCTTC CTCAG6CAGC GGGAOGOCCA GTTCAGCAGC 3060 

CTGCAGCTGG T6GCCATGCA GOGGGGAGTG GCTGCTGCCA T6CAGTACCT GTOCAGCTTT 3120 

GCCTTCGTCC ATCGCTCGCT QTCTQCCCAC AGOOTGCTGO TGAATAGCCA CTTGGTGTGC 3180 

AAGGTGGCCC GTCTTGQCCA CAGTCCTCAG GGCCCAAGTT GTTTQCTTCO CTGGGCAGCC 3240 

CCAGAGGTCA TTGCACATGQ AAA6CATACA ACATCCAGTQ ATGTCTGGA6 CTTTGG6ATA 3300 

CTCATGTGQG AAOTQATGAG TTATGQAGAA 0Q6GCTTACT GGQACATGAG TGAGCAGGAO 3360 

GTACTAAATO CAATAGAGCA GGAGTTCOGO CTGCCCCOGC CTCCAG6CTG TCCTCCT6GA 3420 

TTACATCTAC TTATGTTGGA CACTTGGCAO AAGGACCGTG CCXX5GCX3GCC TCATTTTGAC 3480 

CAGCTGGTGG CTGCATTTGA CAAOATGATC OGCAAGCCAO ATACTCTGCA GGCTGQCOGG 3540 

GACCCAOGGQ AAAGGCCTTC CCAGGCCCTT CTGACCCCTG TGGCCCTGGA CTTTCCTTGT 3600 

CTOGACTCAC 0CCA6GCCTG GCTTTCA60C ATTGOACIGG AGTGCTACCA 6GACAACTTC 3660 

TCCAAGTTTG GCCTCTGTAC CTTCAGTGAT GTGGCTCAGC TCAGCCTAGA A6ACCTQCCT 3720 

GCCCTGGGCA TCACCCTGGC TGGCCACCAG AAGAAGCTGC TGCA0CAC3VT CCAGCTCXnTT 3780 

CAGCAACACC TGAGGCAGCA GGGCTCAGXa QAGGTCTGAO AATOACGATA CCCGTGACTC 3840 

AGCCCTGGAC ACTG6TC0QA GAAGGGACAT GTGGGACGTG AGCOGGGCTC CAACAGCCTC 3900 

TGTGAGAGAT GCGCGACACC AAACCCAACC CT0C0GATG6 CTGCATTCCC TG6TCCTCCQ 3960 

CCTCTCCACC AGGCCCCTCC TCATTAAAGG GAAAGAAGGG AATTTGCAAA 4010 

Seq ID NO: C213 KJA Sequence 

micleic Acid Accession #s XM_043340.4 

Coding sequence: 195. .1067 

1 11 21 31 41 51 

I I i I I 1 

OGGOGGOGCX: CAATGGGCTG 06C6GAGG6T CACTTCCCGG CAGCGGGAGG CQAGT6G06A 60 

GTGGOGAGTQ GCSAOTQTCA GGGGOaOGGC OGGOQGGGGC G0GGCG6CXX3 GAGGA06CX3T 120 

TGGCAQCX3GG CTOGGACCCA 060GGCX3COO CXSGCCOGCXT GGCCTGCAGC GCTCCCAOCC 180 

CCGGCGQCGG CACGATGCCC TTTGACTTCA GGAGGTTTGA CATCTACAGG AAGGTGCCCA 240 

AOQACCTTAC QCAGCCAACG TACACCGQGG CCATTATCTC CATCTGCTGC TGCCTCTTCA 300 

TCCTCTTCCT CTTCCTCTCX3 GAGCICACGO GATTTATAAC GACAQAAGIT GTQAACXIAQC 360 

TCTATGTCGA TGAOCCAGAC AAGGACAOGG GTGGCAAGAT OGAOGTCftGT CT6AACATCA 420 

6TTTACCCAA TCTGCACTGC GAGTTGGTTG GGCTTGACAT TCAGGATGAG ATGGGCAGGC 460 

ACGAAOTGGG CCACATCGAC AACTCCATGA AQATCCCX3CT OAACAATCGG GCAGGCTGCC 540 

GCTTCGAGGG GCAGTTCAGC ATCAACAAGG TCX:OCGGCAA CTTCCAOGTQ TCCACACACA 600 

6T6CCACAGC CCAQGCACAG AACCCAGACA TGACXiCATGT CATCCACAAO CTCTCCTTTG .660 

G6GACACGCT ACAQGTCCAO AACATOCAOG GAGCTTTCAA TGCTCTGGGQ GGAGCAGACA 720 

GACTCACCTC CAACGGCCTG GCCTCCCAOG ACTAC3VTCCT GAAOATTQTG COCAOGQTTT 780 

ATGAGGACAA QA6TGGCAAG CAGOGGTACT CCTAGCAGTA CACGGTG6CC AAC3VAGGAAT 840 

AOGTOGCCTA CAGCCACACG GGCCGCATCA TCCCTGC3VAT CTGGTTCOGC TAOGACCTCA 900 

GCCCCATCAC GGTCAA6TAC ACAGAGAOAC GGCAGCOGCT GTACAGATTC ATCACCACGA 960 

TCTGTGCCAT CATTGGCGGG ACCTTCAGGO TOSCOGGCAT OCTGQACTCA TGCATCTTCA 1020 

CAGCCTCTGA GGCCTGQAAO AAGATCCAGC 7GGGCAAGAT QCATTGACGC CACAOOCA6C 1080 

CTAATGGCCX3 AGGACCCTGQ GCATCGCCAG OCTTGCCTCC AGTGCCCTGT CTCCTTTQGC 1140 

CCTCAATCTG GTCCCAAATC TGGCTGTGTC CCAAAGOGTG TQTGGGAAOT GGOGGGAAAO 1200 

TAQAOGATOG CTOQATGTTT TGCAGCTACC T Cm ' T CCCC GTGTTTCTTT TTA6ACAAAT 1260 

TACACTGCCT GAAGTTGCAG TTCCCCTTTC 0CTGGG6AGC OCCAAOAACA GAGTCAOGCA 1320 

AGGGGTGGGG AOTCCAGGGG AACATCCCAG AATGCATATC GATCAGCTCT CAGCCAGGCT 1380 
TCGACAATCT CGCAGCCCXX: ACTAGQTGGA CACATTAATG ATTTGGTTTC TCCCCTGGQC 1440 
AGCXSmCCTG CCXXS^AGGC ACXAGACCTG GGCTTTCA6C TTTGGGACCA GGCTGCCCAA 1500 
AGGTACTCCT TTATACACCC GGCACCTTCC ACX3AAA6ATG GTACTTCCCA AGCAAGCCCC 1560 
TATGATTTGT CACTATAGAT G6AAATGTGT GGCATGCACA TGAGTTQAAA TTCTTTTATO 1620 
CATTTTTTTO AAGAAAAAAA AAAAAACAAC TCTGAGGACA TAGGGGATGT CAGITTOCTA 1680 
TGGAAGAGAC ACCTCT6ACC OGTTATTGTT ATAATCAAAA TCTGAft fiGGA AAAAAAZGTT 1740 
TTAOTTCTTT CCCCACT06T TGCKSTTCAAC TAGATTAAAA GGCTGATTTT CAG 1793 

Seq ID NOt C214 UNA Sequence 

Nucleic Acid Accession S: NM__002151.1 

Coding sequences 246.. 1499 

1 ' 11 21 31 41 51 

I I I I I I 

TOGAGCCCGC TTTCCAGGGA CCCTACCTGA GGGCCCACAG GTGAGGCAGC CT6GCCTA0C 60 
AGGCCCCACG CCACOGCCTC TGCCTCCAOG GC0CCO5CTG CTaCGGGGCC ACCATQCTCC 120 
TGCCCAGGCC TQGAGACPGA COOQACCCCG GCACTACCTC GAGGCTCCX3C CCCCACCTGC 180 
TGGACCCCAO GGTOCCACCC TG6CCCAGGA GGTCAGCCAG 6GAATCATTA ACAAGAGGCA 240 
GTGACATGGC GCAGAAOQAG GQTGGOCGGA CIGTGCCATO CTGCTCCA6A CCCAAGGTGG 300 
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CAGCT CTCAC TGCGGG6ACC CTQCTACTTC TGACAGCCAT OGGQGOCWCA TCCTGtSGCCA 360 
TTGTOCCTGT TCTCCTCAGG AGTQACCAGG AGCOGCTGTA CCXACTGCAO OTCaGCTCTQ 420 
CG6A0GCTCG 6CTCATGGTC TTTGACAAGA 06GAA0GGAC GTGG06GCTG CTGPGCrCCT 



480 



5CG0GCTGCAA OGCCAGGGTA GCXX3CSACTCA GCTGCX3AGGA QATGGGCTTC CTCAGGGCAC 540 

TGACCCACTC CGAGCTGGAC GTGOGAAOGG OGGGCGCCAA TGGCACGTOO GQCTTCTTCT 600 

GTGTGOACGA GGGGAG6CT0 CCCCAC31CXX: AGAGGCTGCT GQAGGTCATC TCCOTCTGTO 660 

ATTGCO OCAQ ACGOOGTTTC TTGGCOOCCA TCTOCCAAGA CPGTGGCOSC AGGAAGCIGC 720 

COGTGGACCG CATOGTGGGA GGCGQGQACA CCA6CTTGGQ COGGTGGCCG TGGCAAGTCA 780 

^ ^ 6CCTT0GCTA T0ATGQA6CA CACCTCTSTG GGGGATCCCT GCTCTCCGGG GACTGOaTGC 840 

lU TGACAG COGC CCACTOCTTC COGGAGCX5GA ACCGGGTCCT GTCCCGATGG CGAGTGTTTG 900 

COGG TGCOO T G6CCCAGGCC TCTCOCCACG 6TCTGCAGCT GGGGOTQCAG GCTOTGGTCT 960 

ACCAOBGGGG CTATGTTOCC TTTOGGGAGC CCAACAGOGA GGAGAACAGC AAC6ATATTG 1020 

CCCTGGTCCA CCTCTCCAGT CCCCT60CCC TCACAQAATA CATCCAGCCT GTG T GCCTCC 1080 

- - CAGCTGCOQG CCAOOCCCTG GTGQATGGCA AGATCTGTAC CGTGAOGGGC TGGGGCAACA 1140 

ID CGCA GTACTA TGGCCAACAG GCCOGGGTAC TCCAGQAGGC TOGAGTCCCC ATAATCSWSCa 1200 

ATGATQTCTQ CAATGG06CT GACTTCTATG GAAACCAGAT CAAGCCCAAG A TGTTCT Q TG 1260 

CTGGCTACCC CGAOOOTGGC ATTGATOCCT GCCAGGGCGA CAGCGGTGGT CCCTTTGTGT 1320 

GTQAGGACAG CATGTCT066 AC6CCA08TT 6GCG0CTGTQ TG6CATTGT0 AOTTGGGOCA 1380 

CTGOCTGTOC CCTQGOCCAG AAGCCAGGOG TCTACACCAA AGTCAGTGAC TTO06GGAQT 1440 

Z\) GGATCTTCCA GGCCATAAAG ACTCACTCOO AAGCCAGCGG CATQGTGACC CA6CTCTGAC 1500 

CGGTGaCTTC T0GCTGC6CA GCCTCCAGGG CCCGAGQTGA TCCOGGTGGT GGGATCCACG 1560 

CTGGGCOGAG GATGGGACGT TTTTCTTCTT QOGCCCGGTC CACACGTCCA AGGACACCCT 1620 

CXXrrCXJVGGQ TCCTCTCTTC CACAGTGGCXS GGCXXaVCrCA GCCCCX3AGAC CACXXSVACCT 1680 

_ CACCCTCCTG ACCOCCATOT AAATATTOTT CTGCTGTCTO GGACTCCTGT CTAGGTGCOC 1740 

ZD CTQAT6ATGG GATGCTCTTT AAATAATAAA GATGGTTTTG ATT 1783 
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Seq ID NO I C215 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 26., 1744 



1 11 21 31 41 51 

I I I I I I 

ATGGTGGAAC ACGCTGCCCA CAAAC3VTGGA AAOGACCGTT CTCAGTQGGA TCAACTTOSA 60 

OTACAAGGGC ATGACAGGCT GGGAGGTGGC T60TGATCAC ATTTACACAG CTGCTQQAGC 120 

CTCAOACAAT OACTTCATQA TTCTCACTCT OOTTGTGCCA G6ATTTAGAC CTCC3GCAGTC 180 

GGTGA TGGCA GACACAGAGA ATAAAGAGGT OOCCAOAATC ACATTTGTCT TTGAGACCCT 240 

CTCTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTCQAGAOG TGGAAAGGTT CCAAAGGCAA AGAOTCCTAT ACCTACATCA TTGAGGAGAA 360 

CACTACCAGG AGCTTCACCT GGGCCTTCCA GAGQACCACT TTTCATQAGG CAAGCAOQAA 420 

GTACACCAAT GAOOTTGCCA AQATCTACTC CATGAATGTC ACCAATGTTA TGAATGGC3GT 480 

GGCCTCCTAC TGCOGTCCCT GTGCCCTAGA AOCCTCTGAT GTGQGCTCCr CCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TPCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATOG TQTCCAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC A ACAAG ATOC ACTCTCTGTG CTACAAT6AT TGCACCTTCT CACGCAACAG 720 

TCCAACCAGO ACTTTCAACT ACAACTTCTC OGCTTTQGCA AACACOGTCA CTCTTGCTOG 780 

AGGGCXXAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAO GGTAGGAAAA TGTCTGTGTG CACCX5AC3U^T GTCACTGACC TCOSGATTCC 900 

TGAGGGTGAQ TCAGGGTTCT CCAAATCTAT CACAOCCTAC GTCTGCCAGO CAGTCATCAT 960 

CCCCCCAGAQ GTGACAGGCT ACAAGGCOQG GGTTTCCTCA CAGCCT6TCA GGCTTGCTGA 1020 

TCGACTTATT G6GGTGACAA CAGATATGAC TCTGGATGGA ATCACCTGCC CA6CTGAACT 1080 

TTTCCACCTG GAGTCCTTGQ GAATACCGGA OGTQATCTTC TTTTATAGGT CCAATOATGT 1140 

GACCCAGTCC TGCAGTTCTQ GOAQATCAAC CACCATCOSC GTCAGQTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAA6TTTGC TGCTG0CA6G AAOGTGCTCA GATOOGACCT GTQATGGCTG 1260 

CAACTT0C3VC TTCCTQTGGG AjQAGOaOQaC TGCTTGOCOO CTCTGCTCAO TGGCTGACTA 1320 

CCATGCTATC QTCAGCAGCT GTGTGGCTGG GATCCAGAAO ACTACTTA03 TGTGGC6AGA 1380 

ACCCAAGCTA TGCTCT6GTG GCATTTCTCT GCCTGAGCAQ AGAGTOVCCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAO TGGQCATCTC TGCAQQCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACX: TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAOT ACTCCAAGCT 1560 

GGT6ATGAAT GCTACTCTCA AGGACTOTOA CCIGCXAGOV GCTGAGAGCT GCX30CATC3VT 1620 

GGAAGGCGAO GATGTAGAGG AOGACCTCAT CTTTACCAGC AAGAACTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAQCX316C TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTOATOGATT TQACTCaCTG CCGCTGAAGA CATCCTCAGQ AGGCCCAOAC ATOGACCTGT IBOO 

GAGAGGCACT GCCTOCCTCA CXTTGCCTCCT CACCTTGCAT AGCACCTTTO CAAGCCTGOG 1860 

GOBAT TTOGQ TGGCAGCATC CT6CAACACC CACIGCTOaA AATCTCTTCA TTOT GG CCTT 1920 

ATCAGATOTT TGAATTTCAG ATCTTTTTTT ATAGAOTACC CAAACCCTCC T TT C i T ^LTlXJ 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTOTAAAT TATGCCCTTO CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGOCAa AAAGA6CTGG GAGTTTTOCT OGCATGCCCT CAGCTCATGA TCTCTTCAGO 2220 

/U AGAOAGGCrO GGTQAGGAGG GTGTOGGGGT TCCCTG0TG6 ATAATCTTCA TAGCAGOCTG 2280 

QATCCATTTC CCCTGGATAA CCAGCTCAAA GGQAGTQAAA ATGOTAOTCr GAGGGCAAGQ 2340 

GQAOCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCOGG GCG0G0TX3GC 2400 

TCACGCCTGT AATCCXMCA CTTTOGGAQG COGAGGOGGG CAGATCATGA GGTCX3GGAGA 2460 

TTGAGAGCAT GCTOGCTAAC AOGGTGAAGC OCOGTCTCTA CTGQAAATAC AAAAAATTA6 2520 

0 0GGG CX3TOQ TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGOCTGAGG OGGGAQAATA 2580 

003TGGGCCT GGAA6G0G6A GCTTGCAGTG AGCXX5AGATC GOQCCACTGC ACTCCATOCA 2640 

GGCTGGGTOA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGQCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTOTCA CCTAGGCTGG 2760 

- - AOroCAOTGO COTAATCACT OTTCAGTGCA GOCTCAACCT CTTGGGCTCA G6CTAT0CTC 2820 

^'^^^CAG C CrCTCAAGTA GCTG6QACTA CGAGTGTGCA CCAGCAG6CT CACTAATTTT 2880 

TQTOTTTTTT GTAGACACGG GGTTTCACCG TGTTGCXrCAG GCTGGTCTCC AACTCCTGGG 2940 

CTGAACTGAT CTCTCCGCXTT OOOCCTCCCA AACTGCTGQG ATTACAGGCA TAAGCCACTG 3000 

CACTCA6CCT TTTATTTGTT TTTTAAACCA COTAGCTCAT TGCCTTCTCT TAAOTAAATG 3060 

ATAGATATTC TCACTQAAGC CAAAGOAATA AGTTCATCAA GAAAATGCCX: AAAGCCGTGO 3120 
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Seq ID NO: C216 DNA Sequence 
Nucleic Acid Accession #: KM_004864.1 
4D Coding sequence i 26-952 
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TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACT6AAAA 3180 

GAACCAOOTA AGOOCCAAAC CCAQA ltff T C CA6CCTTATC CTCTATTGG6 TTTACCCACA 3240 

OACATAGCAA ACCCTGTCAO TBAGGAAAAT TCCCCATCCT TGAGTGCCCC OQTCCTAOAA 3300 

GTTT GGQCCA TATTATQGAA CAGGGGTCTC TTATtTGAAA AQAGCACAAG GAGGCX»AGA 3360 

TTTTAATGQG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGQCACAGAA TAAAOGTCTA GGCTGGCCAA 3480 

AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACXIAGAA TCAAACCAGT CTTCAAG6AA 3S40 

OGAAOGCTAO GAOAOTTTAA CAAGATTTTC ACTGQOCCCA GCATGOTQQC TCACACCTGT 3600 

AATCCCAAGQ CAGAATGGTQ GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 

TGAGACCCTQ TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGQATACTT 3720 

CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTQATAAAA 3780 

AACTAAAOAG ACTTCTGQTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 

GGTGTCTTCT ATGCCCATTT TCXXXAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 

TACCTAGAAT GGGTAAAACC AQAQC3JIGAC TTTAAATTAC C lT ClTCnT CTTCTACTGG 3960 

ID CAOTTCTGCC TCCATCACTA TCAGOCTAQG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 

CCATGATTTG TGCCTGTGCC CTTTCTCXaG TGACCATTTG GTQACCAGAT GGTAGATATA 4080 

GAAAOQQGAT GGCATTTGCA AOTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 

TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATQAGAACT AGA6CCAQGC TQTGGTCCCT 4200 

GGCCATCAAC AGTGTTGGTG AOSGCAGGGA GTCCCTTTGQ TTTAATAAAT CCAOTTTTTC 4260 

TTTGGGTATC CAAATTCTCX: CCTCCTTTTG TAGGAGTCA6 GCTCTCAOAA CCTQT G TCC A 4330 

TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAQCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAOCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCXAC AGACAGCCCT TAQACTATTC CGGAAACAGT AGGAAAAATT ACATATOTCT 4500 

TTQACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4560 

ZD ACTACCCCTT ACCGTGCTGA CTTCT6CAGG TCTGCCCTGT GACCTGTCAO GAACTCCTQA 4620 

GTTA CGCTA C TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4680 

ACAGCTTTAT TQAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC AT6TTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT QAQACAGAQT CAOCTTTGTC GCCCAGGCTG GAGTQCAOTO 4860 

CCATGATCTT 6GCTCACTGC AGCCTCCCCA TCCTGGQTTC AAGT6ATCCT TCTGCTTCAO 4920 

CCTCCCCAGT AGCTGGQATT ACAGGCATGC GCCACCAOGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAQAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTa 5040 

ATCCACCXAC CTCXSGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGGCAC CGTGCCTGGC 5100 

CTA06TGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAaOQC TAAAGTGGTC 5160 

T6TCCAAGGT GTTGTTQGTA GAGGGAGTAT QATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

TTGAGTCCTG GAACAOGCAG TAACTGTCAT GCTATAGTCA TCATCTOTAT TTGQCTGGGA 5280 

ATACAAATGA AGATTQTGOT GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA 5340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTQTGAAAT OQAATTTTTT TTCATCAGGG 5460 

CTGGTTGGAT TTCCTTTTTA CCCTGTAATC CAAGOGTTAA TACTTTGTTA GAAGATGGGT 5520 

TATTGCATGT CACTTTTTTT TTTTTGTAAA ATAAAAACAT ACCTTAC 5567 



1 11 21 31 41 51 

I I I I i I 

CGGAACGAGG GCAACCTGCA CAOCCATGCC OGGGCAAGAA CTCAGGAOGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TOGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCX3AGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAOAQ TTGCACTCCG AAGACTCCAO 180 

ATTCOQAGAG TTGCGGAAAC GCTACXyVGGA CCTGCTAACC AGQCTGCGGQ CCAACX»GAG 240 

CTGGGAAOAT TCGAACACCO ACCT06TCCC G6CCCCTGCA GTCCX3QATAC TCACQCCAGA 300 

AGTQCX3QCTG GGATCOGGCQ GCCACCTGCA CCTGCXaTATC TCTCX3O0CO3 CCCTTCCOQA 360 

GGGGCTCCCC QA06CCT0CC OCCTTCAOC38 GGCTCTGrrC OGGCTOTCCC COACGGOGTC 420 

AAGGTCGTGG QACGTQACAC GACCGCTG06 GCGTCAGCTC AGCCTTOCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TQTCGCCGCC QCXmCGCAQ TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACXSGCCCX: AGCTGGAGTT GCACTTGCX3Q C06CAAGCCQ CCAGGGGGC6 600 

CCQCAGAGCO 00TQGG06CA AOGGGOAOQA CTGTGOGCTC GGGGG0QG6C GTTGCTGC06 660 

TCTGCACAC6 GTCCQ0Q08T 06CTGGAA6A CCTGGGCTGG GCCX3ATTGGG TGCIOTOGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGOGTGCCOG AGCCAGTTCC GGGOGGCAAA 780 

CATGCAOGCG CAGATCAAGA CGAGCCTGCA CXJGCXTTGAAG CCCGACAOGO AGCCAGCXSCC 840 

CTGCTGOGTG CX:CXXX:aGCT ACAATCCCAT GGTGCTCATT CAAAAGACOS ACACOGGGGT 900 

GTCGCTCCAQ ACCTATGATG ACTTGTTAGC CAAAQACTGC CACTGCATAT OAGCASTCCT 960 

GGTCCTTCCA CTGT6CA0CT GOSOGGGGGA GGOQAOCTCA GTTGTCCTGC CCTOTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCQATTCC TGCCC3UUICA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TOSGGGGCTO GTCTGATGGA 1140 

ACTOTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 

AAAA 1204 

Seq ID NO: C432 DNA Sequence 

NUcIeic Acid Accession #: NM_052858.1 

Coding sequence t 54.. 1259 " 

1 11 21 31 41 51 

11(111 

GGCAC3GAGGT GTTGCCCTCA GQTCGCTCCC GGGCQC3GGAC AOGGAACCOG GCCATGQAAG 60 

ATCGGTCX3GG GGCTCX3CX3AG CCCCGGGCCC GGCOOAOAGA GOGQGACCO} GGAGGGOGCX: 120 

CCCACCCAC3A CCAAOGOCGC ACCCACGATC GACOGCJOOOA GOGACCXXSGG GACCCGCQCA 180 

GOAAaOSAAG CAGCOACX3GG AAC0G60QAA GG6A0GGG6A CGGGGACCC6 AAGAGAGACC 240 

AGGAGAGGGA G6GGAACXXC GACCGGAACC GGQAC0G6GA GAGGGAGAGA GAGAGGGAAA 300 

QAGACCOOOA COQAGOCCXX: CGCCGGGACA CACACAGGGA CGOGGGCCCT CGCGCAGGTG 360 

AACACX3GAGT TTOGGAAAAA CCGOGCCAAA GOOQQACGCG GGAOGQAOCC CGGGGACTGA 420 

CCTGGGAOQC AGO0GG6CCT CCTG0GCCXX3 G8CCCT0GQA AGCCOOSGAO CC6CCGCAGC 480 
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OGCAGAGGAA 
ATCTGCCCTC 
AAQGACTCCT 
TGCTG6AGGT 
CAGGOGGCTA 
CTTACAGTGG 
ACCAGCTAAA 
TCTGCTGCCT 
TG6TGAC0GA 
TCTACTTCCA 
TGTACCAAAG 
CTGGAATCTT 
A GGGCTA CCG 
GGTTTCTAAA 
TTCOCAAGAA 
GTGTGGTGGG 
CTTCTGGA6T 
TGGC06GCCT 
ATCACTTQAO 
TAAAATACAA 
AGGCTGAGOC 
OGCCACTGCA 



GGGAGACCCC 
GACCCCCAGO 
GGAAT6CCAC 
TCTGCrOAAC 
CACG6GCATC 
CTTTGATGGT 
GCTGCCCATQ 
CTTCGTTGCC 
A6GCTTGTT6 
CTACCTCTCT 
CAAAGGCTAC 
TGCTGCCCTG 
AAAAGTTAGG 
AGGGTCTGAC 
TCCCTTOTTG 
CGGAGCTCCC 
CCTCTGTGAG 
GGGGGGCTCA 
GTCftOGAGCT 
AAAAATTAGC 
AGQAGAATOS 
CTCCAGCCCA 



GGGCGCCGCA 
CCTGGACGAG 
AAAT6CAAAT 
TTGCTGATOC 
ACCAGCTTOG 
GCTGACGGGG 
GTCACTGTGG 
ATGGGTGTCC 
GACATGCTCA 
6CT6CCTAT0 
AGC6GTTTCG 
G0CATT6TGG 
AAGCTAAAAG 
AGAT6GAAGT 
TGGAAGTTTC 
A6TCGCATGG 
TGAGGGACCA 
GACCTGTAAC 
GQAOAOCAGC 
CAGG08TG6T 
CTTQAATCTG 
GGTGACAGAO 



GACCCGAAAG 
AGGAGGTGGA 
ACTTGTGCAC 
TGGCCT6CA6 
G6GGCATTTA 
AGAAOGCCCA 
CAATGGCCTG 
TGCGGOTCCC 
TOGOGGGGGG 
GCTCTCCTGT 
GCTGCAGTTT 
TCTTTGCCCT 
AGAAGCCAGC 
GOTOG TOGAA 
CAATGCTGGA 
AGCQOTOTTC 
ATCAAAATTA 
CCCAGCACTT 
TTQGCCAACA 
GGOGGGOGCC 
GQAOGCGGAG 
OBAGACTCCA 



TGAACCCCCT 
ATATTACCAG 
TGGGAGA6GC 
CTCTG TO TCT 
CTACTATCAO 
GCAACTG6AT 
TAGTGGAGCC 
GTGGCATTGT 
GTACATCCCG 
GTGTAAAGAG 
CCA0GGA6CA 
GGGGGCGGTC 
AGAAATGTTT 
GSTAGTCIGA 
AAAGC3^0GA 
ATGGATGCAA 
TTTTTCAAAA 
TGGOAOGCTG 
ItSQTGAGCCC 
TGTAATGOCA 
ATTQCAGTGA 
TCTCAAAAAA 



TOGGAOAQAT 
TCAGAGGCX3G 
T6CT0CCAAA 
TACAGTTCCA 
TT066AGGGG 
OTCCAGTTCT 
CTCACAGCCC 
CCACT6TTGC 
GCCTTGTACT 
A0GCAG6GGC 
GATATAGQAG 
CTGGCCATAA 
GAATTTTAAG 
GCCACTGCCT 
GCCAGOGTTG 
CAGACCCTGG 
AGCAAAAAAA 
AGGTGGGT6G 
COOTCTCT A C 
GCTACTTGGG 
GGGQAGATCC 
AAAAAAAAAA 



Seq ZD MOs C434 DHA Sequence 

Nucleic Acid Accession fit Eos sequence 

Coding sequence: 261.. 2 861 



51 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



1 11 21 31 41 

I 1 I 1 I 1 

GAGCTAGCGC TCAAGCA6AG CCCAGGGOGG TGCTATOGGA CAGAGCCTGG CGAGGGCAAG 60 

CX5GCG0GGGO AGCCAGCGGG GCTGAGCGCQ GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACXSCCCOSGG AGCTCGCGGC GCCTGGCG6T CAGCGACCAO 180 

AOGTC06GGG CGGCTGGOCT CCT6GCCGGC GAG6CGTGAC ACTGTCTCX3G CTACA6ACCC 240 

AGAGGGAGCA CACTQCCAGG ATGGGAGCTG CTGGQAGGCA GGACTTCCTC TTCAA060CA 300 

TGCTGACCAT CAGCTGOCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC T6ACCAQA6C CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGAOC 420 

ACCAT6TGCA TATCGGCCAG GGCAAGACAC TQCTGCTCAC CTCTTCTQCC ACX3GTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAA6CTGG TCATTAAAGA CCA08AGQAG CCGATTGTTT 540 

TGGQAACGCQ 6CACATQCI6 ATTGACAA05 GAG6AGAGCT aCATQCTGGG MTQCOCSCT 600 

GCCCTTTGCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATQAA GGTATTCAGC 660 

06GATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGOQCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TQAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCX3 TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA 6TCATCCATT CTGACCGGTT TGACACCTAT A6ATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAOTATTTGA ACGOGOTGCC GGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCX3AAATC TGGAT6ACAT GGCCAGGAAG G06ATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTQ GATTTA6ACA CCCTTGGA6T TTTCTAACTG 1060 

TQAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATOSAGGCr 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAOAGCA TQGCGAAXAT TTCAATOTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAQAOGTGG AGTGGAC3GQA GTGGTTCX3AT CATGATAAAG 1260 

TATCTCAGAC TAAA6GTGGG 6A6AAAATTT CAGACCTCTG GAAAGCTCAC CCAGQAAAAA 1320 

TATQCAATCO TCCCATTQAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAOGATTATA GGTTTGCTTG CTACGACX^GG GGCAGAQCXTT 1440 

GCCGGAGC7A C06TGTACX3G TTCCTCTOTQ GQAAGCCTGT GA6GCCCAAA CTCACAGTCA 1500 

CCATTQAC3VC C3\ATGTGAAC AGCACCATTC TGAACTTQGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAQA TACCCTGQTC ATTQCCAGTA CTGATTACTC CATOTACX3W3 GCAGAAGAGT 1620 

TCCAGGTGCr TCCCIGCAGA TCCT6CGCCC CCAACCAGGT CAAAGTG6CA GGGAAACCAA 1680 

TGTACCTGCA CATC6GGGAG GAGATAOACQ GOGTGGACAT GCGGGOGGAO GTTGGGCTTC 1740 

TGAGCGGQAA CATCATAQTG ATGGGGGAGA TGGAG6ACAA ATGCTACCCC TACAGAAACC 1800 

ACATCT6CAA TTTCTTTGAC TT06ATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACrTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCOGAT TCACTTCCAC CTGGCOGGTG ATCTAOAOQA AAGGGGAGOT TATGACCCAC 1980 

CCACATACAT CAGQGACCTC TCCATCXS^TC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGAC3GTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCAC3GGAAGA TGGGCCX3QAG GAAOQCRACA CTTTTGACCA CTGTCTTGGC CTCCITGTCA 2160 

. AGTCTGGAAC CCTCCTCCCC TOGGACOGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACXC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTQTO TCCACCTTCT 2280 

GGATGGCXyUl TCCC3UICAAC AACXTTCATCA ACTGTGCOGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CAOGTACCAA CX^GGCCCCTC CX3rPQGQAATQ TACTCCCCAG 2400 

QTTATTCAOA QCACATTCCA CTOGQAAAAT TCTATAACAA COSAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATA6AC AAOGGAGTCA AAACCACOQA G6CCTCTGCC AAGGACAA6C 2520 

GGCXX7TTCCT CTOVATCATC TCTGCCAGAT ACAGCCCTCA CCAGGAOGCC GACCCGCTGA 2580 

AGCCCCX3GGA GGGGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCO GGGOGGGGAT 6TGT6GCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAQG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCA6GGA TTCA6CCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCOCC CTTTGCTCAC GT C T C TCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAO 2880 

GGQA6CAATG GQCTTTGCTG CTTATGAGCA CAGAGQAATT CAGTCCCCAQ GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTQA AGTCXyiCAaA AQTGAQCTCC T6CCTTA0GQ CCTCATTTGC 3000 

TCTTCATCCA GGGAACTQAG C3VCAGGGGGC CTCCftGGAGA CCCTAGATGT GCTOGTACTC 3060 

CXTTOGGCCTG GQATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 

TAACAGATGG GGAAAGTQAG CXIXXXIAAOAT 6GGAAA0AAC CACACAGCTA AGGGAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCOCAGA 3240 

OTGOCCAGGC ACTCCTOAOG TA0CTTCT6G AAATGGGGAC AAGTCGCCTC GAAGGAAAGG 3300 
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AAATGACTAG AOTAQAATGA CAGCTAGCAO ATCTCTTCCC TCCTGCTCCC AGOGCACACA 3360 

AACCO G CCCT CXrCCTTGGTQ TTQGCGGTCC CTGTGGCCTT CACTTTOTTC ACTACCTGTC 3420 

MCCCMSCCT GGQTOCACAG TAGCTGCAAC TCCCCATTQG TGCTACCTGG CTCTCCTGTC 34 BO 

TCTGCAGCTC TACAGGTGAG GCCCAGCA6A GGGAGTAGQG CTOGGCATGT TTCTQGTGAG 3540 

5 CCAATTTGGC TGATCTTQGG TGTCTOAACA 6CTATTGGGT CCACCCO«3T CCCTTTCAGC 3600 

TGCT6CTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3660 

TAAGAGGQAG AACTCTATCT GTGGTTTATA ATCTTGCACX5 AGGCACCAGA GTCTCCCTOG 3720 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

^ ^ A0AGG6CCTG GCTGGCTCCC TCCACCCAAC TGCACCCAIG AGAGTCSQCnC CAAGAGTCCA 3840 

10 TTCCCCAGGT GGQA6CXAAC T8TCAGGGA0 GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA 6ATATGQCTG CTTCAAAGGC CAGAGTCACA 3960 

GGAAGGACTT CTTCCAGGGA QATTAGTQGT GATCGAQAGG AGAGTTAAAA TGACCTCATG 4020 

TCCrrCTTGT CCACQGTTTT aTTGAGTTTT CACXCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA OGATGTQATC AGTTTCAGGT GGCX3tf3GAAT GTTQAATGTC TTTGGCTCAO 4140 

15 TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAOTT OTACATATQT TTCACA6TAC 4200 

AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AA6AGCCAAT ATCTAGGCAT 4260 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTO TCCTCCTTGT TATTTCTGTT 4320 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAAOG CTCCTCTGAA ATGCTTGTCT 4380 

'ivrn ' eitn '' r gcogaaatao ctggtocttt ttggggaott a qatgi ataq AjGiGTrroTA 4440 

TGTAAACATT TCTTGTAGGC ATCAGCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CT6TCAQAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACC7AQAG OCAAGGAAAT TGGCTCT6GT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG AT6TCAAAAA AAAAAAAAAA 4680 

AAAAAAAAAA AAAAAAAAAA AA 4702 
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8eq XD NOi C217 Protein Sequence 
Protein Accession #t NP 005605.1 



^^1 11 21 31 41 51 

30 I I I I I 1 

MVGKMWPVLW TLCAVRVTVD AISVETPQDV LRASQGKSVT LPCTYHTSTS SREGLIQWDK 60 

UiLTHTERW IWPFSNKNYI KGBLYKNRV8 ISNNAEQSDA SITIDQLTMA DNGTYECSV8 120 

LMSDLEGIITK SRVRLLVLVP PSKPBG6IE6 ETXIGNNZQL TCQSKEGSPT PQYSttKRYNI 180 

XiNQEQPLAQP ASGQPVSIiKN ZSTDTSGVyi CT881IEEGTQ PQIITVAVRS P8MNVALYV6 240 

35 lAVGWAALI ZZGZIZYCXX! CR6KDDNTBD KEDARPNHEA YESPPEQIiRB LSRBREBEDD 300 

YRQEEQRSTG RESPDHLDQ 319 



Seq ZD NOt C21B Protein Sequence 
Protein Accession #: Eos sequence 



1 XI 21 31 41 51 

i I I I I I 

MGSRTPESPL KAVQLRWGPR RRPPLIiPIiLL LLLPPPFRVG GFMU>AEAPA VLSGPPOSFF 60 

GFSVBPyRPG TDGVSVLVGA PKA2ITSQPGV LQGGAVYLCP HGA8PTQCTP lEFDSKQSRL 120 

45 LBSSLS88EG EEPVBYKSLQ WFGATVRAH6 SSZLACAPLY SWRTSKEPLS DPVGTCYXiST 180 

DNPTRILEYA PC31SDFSWAA GQOYCQGGFS AKFTKTGRW LGGPGSYFWQ GQZLSATQEQ 240 

ZAESYYPBYL ZNLVQGQLQT RQASSZYPDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTZLNOSD ZRSLYMFSGB QMASYFOYAV AATDVKQDQL DDLLVGAFUt MDRTFDGRPQ 360 

^ EVGRVYVYLQ RPAGZEPTPT LTLTGRDEFG RFGSSLTPL6 DLDQDGYNDV AZ6APFQQBT 420 

50 QQGWFVFPG GP6GL6SKPS QVLQPLWAAS HTPDFFGSAL RGGRDUXJtlG YPDLXVOSFG 480 

VDKAWYRGR PIVSASASLT XFPAMFNPBB RSCSLBGNPV ACZNLSFCUJ ASGKHVADSX 540 

GFTVEIXiliDW QKQKGGVRRA LFIiASRQATL TQTIiLXQNGA REZ>CREMKZY LRNESEFRCK 600 

liSPXHXALNF SliDPOAFVDS KGIAPALEZYQ SKSRZEDKAQ ZIiLDCX3EDNZ CVPDLQI4EVF 660 

GEQNSVYLGD lOTALNLTPHA QNV^XSGAYE AELRVTAPPB ABYSGLVRHP GNFSSLSOJY 720 

55 PAVNQSRI*XtV CDL6NPMKA0 ASLWGQIiRFT VFHUCDTKKT ZQFDFQZLSK NLNNSQSDW 780 

SPRLSVEAQA QVTLNGVSKP EAVLFPV8DW HPRDQPQKEE DLGPAVHHVY ELZNQGPSSZ 840 

SQGVLELSCP QALB6QQLLY VTRVTGLNCT TMHPZNPKGIi ELDPEGSLBH QQKREAP8RS 900 

SASSGPQZLK CPBABCFRZA CELGPLHQQB 6Q8LQLHFRV NAKTFLQREH QPFSLQCBAV 960 

YKALKMPYRZ LPRQLPQKBR QVATAVQWTK AEGSYGVPIM IZZLAZLFGL IiZiXiGLLXYXL 1020 

OO YKLGFFKRSL PYGTAMEKAQ LKPPAT8DA 1049 



Seq XD NOj C2X9 Protein Sequence 
Protein Accession ft: 1IP_002412.1 

1 11 21 31 41 51 

I I I I 1 I 

MHSFPPLLLL LFWGW8HSF PATLBTQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRKS6FV 60 

VBKLKQMQBF FGLKVTGKPD ABTLKVMKQP RCGVPDVAQF VLTEGNPRHE QTHLTYRXEN 120 

YTPDLPRADV SBAIBKAFQIi NSHVTPLTFT KV8BGQADIM ISPVRGDHRD N8PFD0PG6N 180 

LAHAFQPGPG XGGDAHFDED ER14TNNFRBY HIARVAAHBL GHSLGLSHST DZGALMYPSY 240 

TFSGDVQLAQ DDXDGZQAZY GRSQUSVQ^X GPQTPKACDS KLTFDAITTZ RGEVMFFKDR 300 

FYMRTNPFYP EVELNFZSVF WFQLPNGLBA AYEFADRDEV RFFKGNKYWA VQGQKVLHGY 360 

PKDXY8SF6F PRTVKHXDAA LSEENT6KTY FFVAHiOnfRY DEYXRSMDPG YPKMIAKDFP 420 

GX6HKVDAVF KKDOFFYPPH OTRQYKFDPK TKRXLTLQKA NSHFHCRXN 469 

Seq XD NOt C220 Protein Sequence 
Protein Accession it Bos sequence 



_^ 1 11 21 31 41 51 

80 I I i I I I 

MHSFPPLLLL LFWGWSHSF PATLBTQEQD VDLVQKYLEK YYMLKHDGRQ VEKRRNSGPV 60 

VEKLKQMQEP FGLKVTGKPD ABTLKVMKQP ROGVPDVAQF VLTEGNPRWE QTHLTYRXEN 120 

YTPDLPRADV DHAIBKAPQL WSNVTPLTFT KVSEQQADXM XSFVRGDHRD NSPPDGPGGN 180 

LAHAFQPGPG ZGGDAKFDED BRNTNNPRBY NIiHRVAAHAL GHSLGLSHST DIGALKYPSY 240 
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TPSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTPDAITTI RGBVMPPKDR 300 

PYMRTNPFYP EVKLNPISVP WPQLPNQLEA AYEPADRDEV RFPKGNKYMA VQGQNVLHOY 360 

PKDIYSSFGP PRTVKHIDAA LSEEMTGKTY FFVAKKYKRy DSYKRSMDPQ YFKMIAHDPP 420 

^ 6X6HKVDAVP NKDGPFYPFH GTRQYKFDPK TKRII«TIiQKIi NSWFNCEUCN 469 

8eq XD HO: C221 Protein Sequence 
Protein Accession 9: NP_055146.X 

1 11 21 31 41 51 

10 I I I I I I 

MVRRPWSTI SKGGYIiQGNV MGRLPSLGNK BPPGQEKVQL KRKVTI/LROV 8IIIGTXIGA 60 

GIPISPKGVI. QNTGSVGMSL TIWTVCGVLS LPGALSYABL GTTIKKSGGH YTYILEVPGP 120 

LPAPVRVHVE LLIIRPAATA VI8LAPORYI LBPPPIQCEI PELAIKLITA VGITWMVIJI 180 

8MSVSNSARX QIFLTPCKLT AlbXXXVPGV KQLIKOQTQN Fia>AF8GRDS SXTRLPIAFY 240 

IS YGKYAYAaWF YUIFVTEEVB OTSKTIPIAZ CZ8MAZTI6V yVLTNVAYPT TINAEBLLIiS 300 

NAVAVTPSER LLQNPSIAVP IPVALSCFGS MNGOVPAVSR LPYVASREGH LPBH.SMIHV 360 

RKHTPLPAVI VLHPLTMIML PSGDUJSIiLM PLSPARWLPI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLPIPAL PSPTCLFKVA LSLYSDPPST OIQPVITLTO VPAYYLPIIW DKKPRWPRIM 480 

8EKITRTLQX XIiEWFBEDX L SOI 



20 



Seq XD NOt C222 Protein Sequence 
Protein Accession #: HP 003237.1 



1 11 21 31 41 51 

25 1 I 1 I I I 

MGIAWGLGVIi PLMBVCXTTMR IPESGGDNSV PDIPELTGAA RKGSGRRLVK GPDPSSPAPR 60 

lEDANLIPPV PDDKPQDLVD AVRAEKGPLL LASLRQMKKT ROTLLALBRK DH8QQVPSW 120 

SN6KAGTUDL SLTVQGKQHV VSVEEALXAT GQWKSXTLPV QEDRAQLYID CBKMEtlAELO 180 

VPIOSVFTBD LASIARLRIA KQGVMDNFQO VLQNVRFVFO TTPSDXUtNK GCSSSTSVUi 240 

30 TLDNNWNGS 8PAXRTNYXG KKTKDIiQAIC GISCDBLSSM VLBMIGIATI VTTMJDSIRK 300 

VTEBNKEIiAN ELRRPPI*CYH NGVQYRNKEE WTVDSCTECH CQNSVTICKK VSCPIMPCSN 360 

ATVPDGECCP RCWPSDSADD GWSPWSBWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

RTCHIQBCDK RPKQDGGWSH HSPWSSCSVT OGDGVITRIR LCNSP8PQMN 6KPCB6EARE 480 

TKACKKDACP lUGGHGPWSP HDXCSVTOGG GVQSSRSRUai NPAPQFOGKD CVGDVTBNQI 540 

35 C3IKQDGPXD6 CLSNPCPAGV KCTSYPDGSW KCGACPPGYS GI9GIQCTDVD BC3CEVPDACP 600 

NHNGEHRCEN TDPGYNCLPC PPRPTGSQPF GQGVEHATAN KQVCKPRNPC TDQTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN BNLVCVANAT YHCKKDNCPN 720 

LPNSGQEDYD KDGIODACDD DDDNDKIPDD RCNCPPHYKP AQYDYDRDDV GDRCDHCPYN 780 

BNPDQADXDH HGEGDACAAD XOGDQXX^ER DNOQYVYHVD QRDTDHD6VG DQC3}NCPLBR 840 

40 NPDQLDSDSD RXQOTCDNNQ CIOEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

DGIPDDKiniC RLVPNPDQKD 8DGDGRGDAC KDDPDHDSVP DXDDICPBNV DISETDPRRP 960 

QMIPIjDPKGT S<»IDPNWVVR HQQKBLVQTV NCDPGIiAVGY DEPNAVDFSG TPPIMTERDD 1020 

DYAGPVFGYQ SSSRPYWMW KQVTQSYWDT KPTRAQGY6G L6VKWKSTT GPGEHliRNAIi 1080 

HHTGNTPGQV RTLHHDPRHI GWKDFTAYRH RLSKRPKTOP XRWMYBOKK IMADSOPXYD 1140 

45 KTYA66RIX3L PVP8QEKVFP 8DLKYECRDP 1170 

Seq XD NOz C223 Protein Sequence 
Protein Accession #i NP_002183.1 

50 1 11 21 31 41 51 

I I i I I 1 

MPLLWLRGPL LASCWIIVRS SPTPGSEQHS AAPDCPSCAL AALPKDVPNS QPEMVEAVKK 60 

RlLNMUiLKK RPDVTQPVPK AAZ^IJIAXRKL HVGRVGENGY VBXEDDIORR AEMNELMEQT 120 

8BXITFAESG TARXTI£FBX SKEGSDIiSW BRAEVHLPLK VPKANRTRTK VTXRLFQQQK 180 

55 BPQGSLDTGE EAEEVGLKGE R8ELLL8EKV VDARKSTWHV PPVSSSXQRIi IiDQGKSSIiDV 240 

RXACEQGQES GASIiVX«IiGKK KKKEEEGEGK KKOGGEGGAO ADEEKEQSHR PPLMLQARQ8 300 

EDHPHRRRRR GIjECDGKVNX CCKKQFPV8P KDIGWMDWIX AP8GYHANYC EGECPSHIAG 360 

T6GSSLSFB8 TVXMHYRMRO HSPPANLKSC CVPTfOiRPMS MLYYDDGQKX IKKDXQNMIV 420 

EBGQCS 426 

60 

Seq ZD NO: C224 Protein Sequence 
Protein Accession fti NP_000086.1 

1 11 21 31 41 51 

65 t I I 1 I I 

. MVPDTACVLL LTLAALGASG QGQSPIiGSDL GPQMLRELQB TNAALQDVRD WLRQQVRBIT 60 

FLKNTVMECD AO^IQQSVRT GLPSVRPLLH CAPGPCPPGV ACXQTBSGGR OGPCPAGFTG 120 

NQSHCTDVNB CNAHPCPPRV RCINTSPQFR CEACPPGY8G PTHQGVGIAP AKANKQVCID 180 

XNECBXG QH W CVPHSVCXNT RGSFQCGPGQ PGFVSDQASG GQRGAQRFCP DGSP8ECUBH 240 

70 ADCVLERD6S RSCVCRVGWA GNOXLOGRDT DLDGFPDBfO. RCPBPQCRKD NCVTVPNSGQ 300 
BDVDROaXGD ACDPDADGDG Vf'tTEKDNCPL VRNPDQRNTD EDKMGDACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRX RNQADNCPRV PNSDQKDSDG DGXGDACDNC PQKSNPDQAD 420 
VDHDPVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQBD8D HDGQGDAOaD DDDNDGVPDS 480 
RDMCRLVPNP GQEDADRDGV (asVOQDDPDA DKWDKXDVC PEM AEVT ItTD PRAFQTWLD 540 
75 PEGDAQXDPN WWLNQGREX VQTMNSDPGL AVOYTAFMGV DFB6TPHVNT VTDDDYA6PX 600 
POYQDSSSPy WMWKQMEQT YWQANPPRAV ABPGIQLKAV K8STGPGEQL RliAIiHKr(H)T 660 
ESQVRLLWKD PRNVGWKDKK SYRWPLQHRP QVGYIRVRPY SGPBLVAD8N WUWTIlRGa 720 
RLGVFCFSQE HXXNAMIiRyR CNDTXPEDYB THQLRQA 757 

80 8eq XD NO: C225 Protein Sequence 
Protein Accession #i NP_612464 

1 11 21 31 41 51 

I I 1 I I I 
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KRPQGPAA8P QRLRGLLLLL UJQh?KPSSH SEIPKOKQXA QLRQRBWDL YNGMCUIGPA 60 

GVPGRDGSPG AMGIPGTPGI PQRDGFKOBK GBCLRBSFSB SWTSNyKQCS NSSUiyOIDL 120 

GKIABCTPTR MRSNSALRVL FSGSLRLKCR HACOQRWYFT PNGABCSGPL PXEAIXYLDQ 180 

GSPEMHSTZN XHRT8SVBGL CBGIQAOLVD VAZWVGTCSD YPKGDASTGW NSVSSIIXBB 340 

I.PK 243 

Seq ID NO: C226 Protein Sequence 
Protein Accession ft: NP_003216.1 

1 11 21 31 41 51 

I I I 1 i I 

MATMENKVIC ALVLVSMLAL GTLAEAQTBT CTVAFRERQN OGFPGVTPSQ CANXOCCFDD 60 
TVR6VPWCFY PNTIDVPPEB ECEF 84 

Seq ID MO: C227 Protein Sequence 
Protein Accession §: NP 056234.1 



1 11 21 31 41 51 

i I I ) I I 

MPKRAHW6AL SWLXItLKGR FRVAZiACFHP CACYVPSEVH CTFRSLASVF AGZARHVERI 
NL6FK8IQAL SETSFAGLTK hEUMlKGtJB IPSIPOGALR DLSSLQVFKP SYKKLRVITG 
QTLQOLSKUI RLHIDHNKIB FIHPQAFKGL TSLRLLHLEG NLLHQLHPST FSTPTFLDYF 
RLSTIKHIiYL AENMVRTLPA SKLRNMPLLE NLYLQGNPWT CDCEMRWFLB WDAKSRGILK 
CKKDKAYBQQ QIiCAMCFSPK XXtYKRBIHXL KDMTCLKPSI B8PLRQNR8R 8IEEBQEQEB 
DGGSQLZLBK FQLPQHSISL NHTDERGNMV m»VCDIKKPM DVYKIHUIQT OPPDZDINAT 
VALDFBCPMT RENYEKLHKL ZAYYSBVPVK LHRELMLSKD PRVSYQYRQD ADBBALYYT3 
VRAQILAEPE WVMQP8IDZQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ AR6RSWVMIE 
PSGAVQRDQT VIiEGGPOQLS CNVKASESPS ZFWVLPDGSZ LKAPMDDPDS KFSILSSGWL 
RZKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA BKDTVTIGKN PGESVTLPCN 
ALAIPEAHLS HZLFKRRXZN DLAHTSBVYM LFNOniSZPK VQVSDSQYYR CVAVNQQGAD 
HFTVGZTVTK K6SGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDBEin' SRRLLHPKDQ 
EVPLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 
ERWADZIiAKV RGKNLPXOTB VPPLZXTTSP PSLSLEVTPP PPAVSPP8AS FVQTVT6AEB 
SSADVPLLGB BBHVLGTZSS ASMGZJSKNHII GVZLVBPEVT STPLBBWDD LSBKTBBZT8 
TE6DLK0TAA PTbZSBPYEP SPTZJITLDTV YEKPTHEETA TBGWSAADV6 SSPEPT88BY 
BPPLDAVSIA ESEPMQYPDP DLETKSQPDB DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 
STIGEPGVPG QSHLQGLTDN IHLVKSSI*ST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 
RSSBSEGQES XSZTLPDSTL GZMS8MSPVK KPABTTVGTL ZiDKDTTTVTT TPRQKVAPSS 
THSTHFSRRR PKGRRRLRPH KFRHRHKQTP PTTFAPSBTF STQPTQAPDZ KZSSQVBSSl* 
VPTAWVDNTV NTPKQLQIEK NAEPTSKXTTP RRXHOKRPNK RRYTPSTVS8 RASG8KPSP8 
PKEJKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 
SDGKEZKDDV ATNVDKHKSD ZLVTGBSITN AZPTSRSLVS TMGEPKBESS PVGFPGTPTW 
NPSRTAQPGR LQTDZPVTTS GEITLTDPPLXi KELEDVDFT8 EPLS8LTVST PFHQE6AGSS 
TTLS8ZRVSV ASSQABTTTL DQDHLBTTVA ILLSETRPQN KTPTAARMKE PA888PSTZL 
MSLGQTTTTK PALPSFRZSQ ASRDSKENVF LNYVGNPETE ATPVNNEGTQ HM8GFNBLST 
P8SDRDAPNL STKLELBKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA KPZLPTATVR 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLS5T TIPLPLHMSK 
PSZPSKFTDR RTDQFNGYSK VPGNNNIPBA RNFVGKPPSP RZPHYSNGRL PFFTNKTLSF 
PQIX3VTRRPQ ZPTSPAPVHR ERXVZPGSYN RIBSHSTFHL DF6PFAPPLL HTPQTTGSP8 
TNLQNZPMVS 8TQSSZ8FXT SSVQSSGSFH OSSSKPFAGG PPA8XFH6LG BKPQZLTKSP 
QTVSVTAETD TVFPCEATGK PKPFVTVmCV STQALMTPNT RIQRFEVLKN GTLVIRKVQV 
QDRQQYMCTA SNIiBGLDRMV VLLSVTVQQP QUiASHYQDV TVYIiGDTIAM ECLAKGTPAP 
QISWIFPDRR VWCrrVSPVES RITLHENRTL SZKEASFSDR GVYKCVASNA AGADSLAZRL 
HVAALPFVXH QBKLENISLP PGLSXBIBCT AXAAPLP8VR NVLGDGTQXR PSQPLHGNLF 
VFPNGTLYXR NIAFKDSGRY ECVAANLVGS ARRTVQUIVO RAAAKARITG TSPRRTDVRY 
GGTLKLDCSA SGDPWPRILW RLPSKRMZDA LFSFDSRIKV FAMGTLWKS VTDXDAGDYI* 
CVARNKVGDD YWLKVDWM KPAKZEHKEE NDHKVFYGGD LKVDCVATGL PNPBISWSLP 
DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNBVGMREBQ DYTCFABNQV GKDEMRVRVK 
WTAPATZRN KTYLAVQVPY GDfWTVACBA KGBFNPKVTH LSPTNKVIPT SSBICYQZYQO 
GTU.ZQ1UVQR SDSGNYTCLV RNSA6BDRKT VHXHVNVQPP KXN^IPNPXT TVRBXAA30S 
RiOiZDCKAEG IPTPRVLHAF PBGWLPAFY YGNRZTVHGN GSLDZRSLRK SDSVQLVCMA 
RNB6GEARLI VQLTVLBPME KPZFHDPZSE KITAMAGHTI SLNCSAAOTP TFSLVWVLPN 
GTDLQSGQQL QRPYHKADOM UIZSGLSSVD AGAYRCVARN AAGHTBRLVS LKVGLKPBAN 
XQyHNLVSZZ KGBTLKLPCT PF6AOQGRPS HTIiPNGKHLB OPQTZiGRVStj LDNGTLTVRB 
ASVFDRGTyV CRMBTEY6PS VTSZPVZVZA YPPRZTSBPT PVZYTRP6N7 VKLNCMAKOZ 
PKADXTMELP DK8HLKA6VQ ARLYOIRPLH PQ68LTI^ TQRDAGPYKC MAKNXI^SDS 
KTTYXHVF 



8eq ID NOt C228 Protein Sequence 
Protein Accession fti Eos sequence 



MP6TKLTRT0 
8RQYTVRYRE 
TPESAPTTAP 
FQNTFFHTPR 
8L0QQEGSCT 
MCYEDP7VSS 
PQGRNAKDUi 
8DTQDQKRTL 
SASPAHHAST 
P8SVLRDRSS 
S8PLSXGGXD 



11 
I 

APADYRVIUC 
KGELARWDYK 
EMLNVWPVNG 
liSNHLBQSPS 
QKDPSOAYI* 
LT6NSXJC8VA 
LDLKNKIIiAN 
RPPSRHGHSV 
QGTSHRPSIiP 
VHPGAKPASP 
GEDAPAtMSN 



21 
I 

TSQBDBLDVP 
QIANRRVLZE 
KPTWAASWD 
PILETLLXiPW 
IDIQTKQVNK 
ASKADVQQNT 
GGAFRKPQLR 
VAPGRTAVRA 
ASLNDNDLVD 
ARR7PBSGAA 
AP8RSZMSSS 



31 
I 

DDZ8VRVHSS 
NLZFDTVYEP 
ALPETEGKVK 
WMVCSIXSNAI 
DPQLEGSVFG 
BDNGKPBKPE 
AKKABBIiDXjQ 
RMPALPRREG 
8DEDBRAVGS 



VSSKLSSRTQ 



41 

I 

QSVI<VSHVDP 
AVRZ8QGBRD 
VCLL0TGLFS 
FSKSGFOTGE 
PCFLFYFLTF 
PSSP8PRAPA 
8TBITGEBBL 
VDKP0F9LAT 
LHPKGAPAQP 
SRLSPFHGGS 
VSEGAEASD6 



51 
1 

VLEKQKKWA 
QKWSTSVPQR 
VSSFQPSAKS 
AWDLTPKPSL 
MLDIGGPSPX 
SSQHP8VPAS 
GSREDSPNSP 
QPRPGAPPSA 
RPALSPSRQ8 
SRLLPTQPHI* 
BSHGDGDRBD 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
960 

1020 
lOBO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2828 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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GGRQAEATAQ TLRARPASGH PHI*LRHKPPA ANGRSPSRPS IGRGPRLQPS SSPQSTVPSR 720 

AHPRVPSHSD SHPKLSSGIH QDEEDBKPLP ATWNDHVPS SSRQPISRCT BDLRRSPQRG 780 

ASLHRKBPIP BNPKSTGADT RPQGKYSSIA SKAQDVQQST DADTBGHSPK AQPGSTDRHA 840 

SPABPPAARS QQHPSVPRRN TFGRAPBQQP PPFVATSQHH P6PQ5RDAGR 8PSQPRLSLT 900 

5 QA6RPRPTSQ GRSHSSSDPY TASSRGMLPT ALQUQDEDAQ GSYDDDSTBV BAV3VRAPAB 960 

AARAKBAAAS LPKHQQVBSP T6AGAG60HR SQRGHAA8PA RPSRPGGPQS RARVPSRAAP 1020 

GKSEPPSKRP LSSKSQQSVS AEDEBEEDAG PFKGGKEDLL 8SSVPKHPSS STPRGGKDAO 1080 

GSLAKEBRBP AXALAPRGOS LAPVKRPLPP PPGSSPRASH VPSRPPPRSA ATVSPVAGTH 1140 

. ^ PWPRYTTRAP PGKFSTTPML SLRORMNSAR FRNPL8RQPA RPSYROGYKO RPMVBOKVIiP 1200 

10 GSNGKPNGQR IIKGPQGTKH WDUIRGLVL NAB6RYLQDS HGNPLRZKXiO GD6RTZVDLB 1260 

GTPWSPOGL PLFGQGRHGT PLANAQDKPI IiSLGGKPLVG LEVIKKTTHP PTTTMQPTTT 1320 

TTPLPTTTTP RPTTATTMQP TTTTTPLPTT TPRPTTATTR RTTTRRPTTT VRTTTRTTTT 1380 

TTPKPTTPIP TCPPGTLERH DDDGNLXMSS NGIPBCYABE DBFSGLETDT AVPTEBAYVI 1440 

YDBDYBFETS RPPTTTGPST TATTPRVIPB B8AI88PPBB EFDLAGRKRF VAPyVTYLNK 1500 

IS DP8APC8LTD ALOHFQVDSL DEZIPMDLKK SDLPPQHAPR NZTWAVEOC RSPVZVDHDK 1560 

ATPGDLVTQY LVYSASYEDP IRZiKPSTQAS SVTHIjPIBNXi KPtrTRYYFKV QAQNPHGYGP 1620 

ISPSVSFVTB SDNPLLWRP PGGEIiSGSaS LSMMZPATRT AMDGNM 1666 

^ Seq ZD NOt C229 Protein Seqaence 
20 Protein Accession ftt NP_00300S.l 

1 11 21 31 41 51 

111)11 

- MPLSZLVAXiC UfLKLALGVR GAPCBAVRZP KCRHMPNNIT RMPMHXiKHST QENAZZAZEQ 60 

25 YEBLVOVNCS AVLRFFFGAM YAPICTLBPL BDPZKPCKSV CQRARODCBP LMKMYKKSHP 120 

ESLACDELPV YDRQVCISPE AIVTDLPEDV KWIDITPDMM VQERPLDVDC KRL9PDRCKC 180 

KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNBVTTWDV KEIFKSSSPI PRTQVPLITN 240 

SSGQCPHXLP HQDVLIMCYE WRSRMMIiLEN CLVEKWRDQL SKRSIQWEER LQEQRRTVQD 300 

KKKTAGRT6R SNPPKPR6KP PAPKPASPKK NIKTR8AQKR TNPKBV 346 



30 



60 



Seq ID NO I C230 Protein Sequence 
Protein Accession #: NP 00S931.1 



35 1 11 21 31 41 51 

1 i I I I I 

MAPAAWUtSA AARALI*PPMIi LLLLQPPPLL ARALPPDVHH LHAERRGPQP VIHAALPSSPA 60 

PAFATQEAPR PA8SLRPPRC GVPDPSDGLS ARNRQKRFVI* SGGRWBKTDIa TYRXLRFPWQ 120 

LVQEQVRQTM AEALKVWSDV TPLTPTEVHB GRADZNXDPA RYHKCa^DLPF DQPGGZIiAHA 180 

40 FPPKTHRBGD VHFDYDETWT ZGDDQGTDLL QVAAEBFGHV U3LQHTTAAK ALMSAFYTFR 240 

YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN BIAPIiEPDAP PDACEASPDA 300 

VSTIRGELFP PKAGPVWRLR GGQLQPGYPA LA8RHWQGLP SPVDAAFEDA QGHIWFPQGA 360 

QYWVYDGEKP VLGPAPLTBli GLVRFPVHAA LVHOPEKNKZ YPPRGROYVfR PBPSTRRVDS 420 

PVPRRATDWR GVPSBIDAAP qOADGYAYFL RGRLYWCPDP VKVKALBGPP RLVGPDFFGC 480 

45 AEPANTFL 468 

Seq ZD NO: C231 Protein Sequence 
Protein Accession #t NP_076927 

50 1 11 21 31 41 51 

I 1 I I I I 

MGENDPPAVB APFSFRSLFG LDDLKZSPVA PDADAVAAQI LSLLPLKPPP IIVIGIIALI 60 

LALAIGIiGIR FDCSGKYRCR S3FKC1ELZA RCDGVSDCKD GEDEYRCVRV GGQNAVI4VF 120 

TAA8WKTMCS DDWKGHYANV ACAQLGFP8Y VSSDNLRVSS LEGQFRSBFV SXDSiLIiFDDK 180 

55 VTALRHSVYV REGCASGBW TLQCTAOGHR RQYSSRXVGG NMSLLSQHPW QASLQFQGYH 240 

bCGGSVITPL WIITAAHCVY DLYLPKSWTI QVOLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEB NFPDGKVCWT SGWGATEDGQ DASFVLNHAA 360 

VPLZ8NRZCH HRDVYGGXIS PSKLCA6YLT GGVD80QGD8 GGPLVCQERR LWKLVGATSF 420 

GXGCAEVinCP GVYTRVTSPL DWZHEQMBRD LRT . 453 



Seq ZD NO* C232 Protein Sequence 
Protein Accession «> NP 003211 



^^1 11 21 31 41 51 

65 ! I 1 I II 

MLWKLTDNIK YEDCEDRHDQ TSNGTARLPQ LGTVGQSPYT SAPPLSHTPN ADFQPPYFPP 60 

PYQPIYPQ8Q DPYSHVNDPY SLNPLHAQPQ PQHPGWPGQR QSQESGLLHT HRGIjPHQLSG 120 

U)PRRDYRRH EDLLHGPHAL SSGLGDLSZK SLPHAIBEVP HVEDPGINIP DQTVXKKGPV 180 

SLSKSNSNAV SAIPINKDNIi PGOWNPNEV FCSVPGRLSL LSSTSKnCVT VAEVQRR1*8P 240 

70 PBCIiNASUja GVLRRAKSKN GGRSLREKU) KZGIiNIiPAGR RKAANVTUiT 8LVEGBAVHL 300 

ARDFGYVCET EFPAKAVAEF UmQHSDFNB QfVTRKNMLIiA TXQZCKEFTD XjIAQDRSPIA 360 

N8RPNPILEP GIQSCLTHFN LZSHGFQSPA VCAAVTALQN YLTBAIiRAMD KKYLSNNPNS 420 

BTDNNAKSSD KEEKHRK 437 

75 Seq ZD NO: C233 Protein Sequence 
Protein Accession ftt NP_002979.1 

1 11 21 31 41 51 

on I I I I > ' 

OU bOCGLAAALLV LVCTKALCSC AQVOTNKEIiC CLVYTSWQIP QKFZVDY8ET SPQCPKPGVZ 60 

LLTKRGRQIC ADPNKKWVQK YZSDLKLNA 89 



Seq ID NOt C234 Protein Sequence 
Protein Accession ft: NP 004054.1 
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1 11 21 31 41 51 

I I I 1 I I 

HJLQAHLHSL CLUfLYIATO YGQBGKPGOP LKFMTFSZYB OOBPSQXXFQ FKANPPAVTP 60 

ELTGETDNZF VIEREGLLYY NIUVU3RETRS THNU>VAAIiD AHGIXVBQPV PITIEVKDXN 120 

DNRPTFLQSR YBQSVRQNSR PGKPPLYVNA TDLDDPATPN GQLYYQIVIQ LPMINNVMYP 180 

QINNKTGAIS liTRBGSQBUl PAKMPSYNLV ISVKDMGGQS ENSPSDTTSV DIIVTEHIWK 240 

APKPVEMVEN STDPHPIKXT QVRWNDPGAQ YSIjVDKEKLP RFPF8IDQE6 DIYVTQPIiDR 300 

EEKDAYVPYA VA!a>ByGXPL SYPLBIHVKV KDXNDMPFTC PSPVTVFBVQ EHERLGNSIG 360 

TLTAHDRDEE NTANSFLNYR IVEQTPKLPM DGLPLIQTYA GMLQIAKQSL KKQDTPQYNL 420 

TIBVSDKDPK Tl/CFVQINVX DINDQXPXFE KSDYGMLTIA EDTNIGSTIL TIQATDADEP 480 

PTGSSKILYH IIKGDSBGRL GVDTDPHTNT GYVIIKKPLD PBTAAVSNIV PKAENPBPLV 540 

FGVKYNASSP AKFTLIVTDV NEAPQPSQHV FQAKVSEDVA IGTKVGNVTA KDPBGLDXSY 600 

SLRa]>TRGWL KIDHVTGEIP SVAPLDREAG SPYRVQWAT EVGOSSI«SSV SBFHLXIMDV 660 

NDHPPRLAKD YTGLFFCHPL SAPQSLIFEA TDDDQHLFRG raPTFSLGSG 8LQNDWBV8K 720 

INOTBARItST RaTEFGERBY WltXRXNDGG RPPLEGXVSIi PVTPCSCVEG SCFRPAGHQT 780 

GXP7V6MAV0 ILLTTLLVZG IILAWFIRX KKDXGISNVE SAQASEVKPL RS 832 

Seq ID KO: C235 Protein Sequence 
Protein Accession fit NP_004434.1 

1 11 21 31 41 51 

i 1 I 1 I 1 

MARARPPPPP SPPP6liI.PIiL PPLLLLPIiLL LPAGCRALBB TLMDTKWVTS ELAWTSHPBS 
GHEEVSGYDB AMNPIRTYQV aJVRESSQNM WLRTGPIWRR DVQRVYVEUC PTVRDCNSIP 
NXFGSCKETF NLFYYBADSD VASASSPFWM BNPYVKVDTI APDESF8RLD AQRVNTKVRS 
FGPLSXASFY LAFQDQGACM SLISVRAFYK KCASTTAGPA LFPETLTQAE PTSLVIAPGT 
CXPMAVBVSV PLKLYQJGDG EWMVPVQACT CATGHEPAAK BSQCRPCPPG SYKAKQGBQP 
CLPCPPNSRT TSPAASICTC HNNFYRADSD SADSACTTVP SPPRGVISNV NETSIiXLEHS 
BPRDliGGROD IiLYNVXCKKC HGAGGASAC6 RCDDNVEFVP RQLGLTERRV KISHLLAHTR 
YTFSVQAVMG VSGICSPXiPFR YAAVMXTTNQ AAPSEVPTLR LHSSSGSSLT LSWAPPERFN 
GVIIDYGMKY FBKSEGXAST VTSQVNSVQh DGIiRFDARYV VQVRARTVAG YGQYSRPAEF 
OTTSERGSGA QQLQEQI«PLI VGSATAGLVF WAVWIAIV CLRKQRHGSD SBYTEKLQQY 
lAPGMKVYXD PFTYEDPNBA VREPAKBIDV SCVKIEBVIG AGKFGEVCRG RLKQPGRREV 
FVAIKTLKVG YTBRQRRDFL SBASIMGQPD HPNIIRLEQV VTKSRPVMIL TEPMENCALD 
SFLRUTOGQF TVXQLVGKLR OIAAQMXyLS EMNYVHRDLA ASKXLVNSNL VCKVSDFGL8 
RFliEDDPSDP TYTSSLOGKl PIRWTAPEAI AYRKPTSASD VWSYGIVMWE VMSYGBRPYW 
DMSKQDVINA VEQDYRLPPP MDCPTAIiHQL MliDCWVRDRN LRPKFSQXVN TU3KLXRNAA 
SLKVIASAQS GMSQPLLDRT VPDYTTFTTV GDWLDAIKMG RYKB8PV8AG FASFDLVAQM 
TAEDLI*RIGV TLAGHQKKXL SSIQDMRLQM NQTLPVQV 

Seq ID HO I C236 Protein Sequence 
Protein Accession fit NP_001795.l 

1 11 21 31 41 51 

I I 1 I I I 

IffVGYVtDKD SPVYPGPARP ASLGLGPANY GFPAPPPAPP QYPDFSSYSH VEPAPAPPTA 
WGAPFPAPKD DWAAAYGPGP AAPAASPASL AFGPPPDP8P VPAPPGPGPG LLAQPLGGPG 
TPSSPQAQRP TPYBWMRRSV AAGGGGGSGK TRTKDKYRW YTDHQRLSLE KBPHYSRYIT 
IRRKSELAAN LGLTBRQVKI WFQNRRAXBR KVNKXKQQQQ QPPQPPMAHD ITATPAGPSL 
GGLCPSNTSL LATSSPMFVK EEFLP 

Seq ID NOt C237 Protein Sequence 
Protein AcceSBion «t NP_068813.1 

1 11 21 31 41 51 

I I I 1 I I 

KGSDRARK06 6GPKDFGAGL KYMSRHEKVN GLEBGVEFLP VNNVKKVEKH GPGRWWLAA 
VLIGLLLVLL GIGFLVWHLQ YKDVRVQKVP NGYMRITNEN FVDAYENSNS TEFVSLASKV 
KDAIiKLLYSG VPFliGPYHKB SAVTAFSEGS VXAYYWSEFS IPQHLVEEAE RVMAESRWM 
IiPPRAR8I»XS FWTSWAFP TDSKTVQRTQ DNSCSFGliHA fiOVBLMRFTT PGFPD5PYPA 
HAROQHALRO DADSVLSLTF RSFDXASCDB RGSDIiVTVYN TL8FMEFBAL VQLOGTYPPS 
YNLTPHSSQN VLLITLITNT BRRHPGPEAT FPQLPRMSSC GGRLRKAQGT PNSPYYPGHY 
PPNIDCTWNX BVPNNQHVKV RFRFFYLLEP 6VPAQTCPKD YVEINGEKYC GKRSQPWTS 
NSNKXTVRFB SDQSYTDTGF LAEYLSYDSS DPCPGQFTCR TGRCIRXELR CDGUADCTDH 
. 8DEU7CSCDA GHQFTCIQIKF CKPLFWVCDS VNDOGDNSDE QOCSCPAQTF RCSNGKOiSK 
SQQCNOKDDC GDGSDEASCP KVNWTCTKH TYRCUIGLCL SKGNPEOXac BDCSDGSDEK 
DCDGGLR8FT RQARWGGTD ADEGEWPWQV SXiKAIiGgGHI 0GA8L X8PN W LVSAAHCYID 
DRGPRYSDPT QWTAFLGLHD QSQRSAPOVQ ERRUCRIISH PFPNDPTFDY DIALLBLEKP 
ABY8SMVRPX CLPDA8BVFP AGKAXWVTGW GHTQYGGTGA LILQKGBXRV XNQTTCBtTLL 
PQQITPRMMC VGPLSGOVDS GQGDSOGPXiS SVEADORZFQ AGW8WGDGC AQSNXFGVYT 
RLPLFRDHIK EMTGV 

Seq ID NO: C238 Protein Sequence 
Protein Accession #> Bos sequence 

1 11 21 31 41 51 

I I I 1 I I 

MPPFLLLBAV CVPLP8RVPP SLPLQEVHV8 KBTIGKISAA 8KMMHCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAXTVCDG LDXSPERVRV QAFQFS3TPH LEPPIiDSFST QQEVRARIiCR 120 

MVF K GG R T U r BLALKYliLBR GI»F6GRNASV PQXLXXVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRPPRH BBLHALASEP RGQBVXilABQ VBDA7NC3LF8 TLSSSAXCSS ATPDCRVBAH 240 

PCBHRTLEMV RBFAONAPCW ROSBRTLAVL AAHCPFYSHX RVFXtTHPATC YR TTCP GPCD 300 

SQPOQMGGTC VPG6LD6YQC LCFLAFG6EA NCMaXLSLBC RVDLLFLLDS SAOTTLDGFL 360 

1340 
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RAKVFVRRPV HAVLSEDSRA RVGVATYSRB LLVAVFVGBY QDVPDLVHSL DGIPFRGGETT 420 

LTQSALRQAA ERGFQSATRT GQDRPRRVW LLTESHSEDB VAGPARHARA RBLLLLGVGS 4 BO 

BAVRABLEBI TGSPKHVMVY SDPQOI.FKQI PBLQGXLCSR QRFGCRTQAL DLVFMU7T8A 540 

SVGPEHPAQM QSFVRSCAIiQ FBVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAZSQ 600 

APYLGGVGSA GTALLKIYDK VMTVQR6ARP GVPKAVWLT GGROAEDAAV PAQKLRKNGI 660 

SVIiWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWUXSEAK RFVNLCKPSP 720 

CMNSOSCVLQ NQSYRC3CCRD OMBGPHCEMR PLRRP 755 

Seq ID NOt C339 Protein Sequence 
Protein Accession #t Bos sequexiee 

1 11 31 31 41 51 

I I I I 1 1 

NPPFIiLLBAV CVFLFSRVPP SLPLQEVBV8' KETXGKXSAA SKKMHCSAAV DZMFLLD6SH €0 

SVGKGSPBRS KHFAZTVOXS LDISPERVRV GAFQFS8TPK LEPPU)SFST QQEVKARIXR 120 

MVFK6GRTET BLALKYIiLHR GLPGGRIIASV PQILIIVTDG KSQGDVALP8 KQLKER6VTV 180 

PAVGVRPPRW BELHALASBP RGQHVLItAEQ VEZJAIKGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCH RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPOQNaOTC VPGOLDOYQC XiCPIAFGQBA NCAUCLSLBC RVDLLFUDS SAGTTLDGFL 360 

RAKVFVKRFV RAVLSEDSRA RVGVATySRB U.VAVPVGEy QDVFDLVW8L DGZPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRFRRVW LLTESHSEDB VAGPARHARA RELLLLGVGS 480 

BAVRAELEBZ TGSPKRVMVY SDPQDLFNQI PELQGKLCSR QRFGCRTQAL DLVFKLDTSA 540 

SVGPENFAQM QSFVR8CALQ FBVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVOSA GTALUUYDK VMTVQRGARP GVPKAVWLT GGROAEDAAV PAQKLRNNGX 660 

SVLWGVGFV LSEGLRRXiAG PROSLIHVAA YADLRYHQDV LIEWLCX3BAX QPVNLCKPSP 720 

CMNBQSCVLQ NOSYRCKCRD GHBGPHCENR BHS8CSVCV8 QGWZLBTPLR HMAFVQEGSS 7 BO 

RTPPSNYRBG LGTEMVPTFW MVCSUPGP 807 

Seq ZD NOt C240 Protein Sequence 
Protein Accession ffi XP_097386.1 

1 11 21 31 41 51 

I I I I I r 

MPKSEPLGCL 6PA8RAP68A AATGANLPAA 8G0PGPL6PP CTCPPR8LGR 6RAG8RAGS8 60 

PSGCVCVSGX LRWSV6DPA SRRWVDLDSN SEDLSLLLTP HIVOTGGVGG GVIARGHVPAQ 120 

BKBVABGSGH AGRffllGRRLQ RVYGARSHZL GRKPCLQRLL FASGGPVQPQ PCPSPATACR 160 

WGPKFGVAFW GAAQHPPLGR LGG6RAVPSA TRTLDGF 217 

Seq ID NOt C241 Protein Sequence 
Protein Accession #: CAC03433 

1 11 21 31 41 51 

I I I I I 1 

MLSSTDFTFA SWELWRVDH PNEEQQKDVT LRVSGDZiRVO GVHLKLVEQI KISQDNSDFA 60 

LWWEQKHCHL LKTHNTLDKY GVQADAKLLP TPQHKHLRLR LFNLKMVRLR V8FSAWFKA 120 

VSDICKILNI RRSEELSLLK FSGDYFKKKK KXDXHNKEPZ lEDILNLBSS PTAS6SSVSP 180 

GLYSKTMTPI YDPINQTPAS 8TMTWFSDSP LTEQNCSILA FSQPPQSPEA LADMYQPRSL 240 

VDRAKLNAGW LDSSRSLMBQ GZQEDEQLLL RFKYYSFFDL MPKYDAVRIM QLYEQARHAZ 300 

LLBEZDCTEB EMLIFAAZiQY HZSKLSL8AB TQDFAOBSSV DBIBAALSNL BVTLBGGKAD 360 

SLLEDZTDIP KLADmiKLFR PKKLLPKAFR QYWFIFKDTS lAYFKNKELE QGEPLEKLNL 420 

RGCBWPDVN VAGRKFGIXL LIPVADGMNE MYLRCDHENQ YAQWMAACKL ASKGKTMADS 480 

SYQPBVLNIL SFLRMKNRNS ASQVASSLEN MDMNFBCFV8 PRCAKXRKSK QLAARZLEAH 540 

QNVAQHPLVB AXLRFZQAWQ SLPBFOLTYr LVRFKGSKKD DILOVSYNRL IKIDAATGIP 600 

VTTWRFTNZK QHNVNWBTRQ VyiEFDQKVF TAFTCLSADC KXVHBYIGGY XFLSTRSRDQ 660 

NETLDEDLFH KLTG6QD 677 



Seq ID N0« C242 DNA Sequence 
Nucleic Acid Accession #i NM_005170 
coding sequencei 337.. 918 

1 11 21 31 41 51 

I I I I I I 

GGGCX3TGAGA AAOGCJGAOGG OOGOGQGGOS 6AGGAGGGTT ATCTATACAT TTAAAAACCA 60 

OCaSCCTGGO GCGOGGCTGC GGAOACCTOG GA0AGTCXX3G CCGCAOGGGC GGGACAGQAG 120 

CGTCCCAOSC TCCCTOGCQC GTACJOGCXTTG CCACCACTAG GCCTCCTATC CCGGG6CTCC 180 

AGAOQACCTA GGAOGGGTGC CCXGGGQAGT TGCCTGG06Q OGGOOTGCXIA GAAGCCXXXT 240 

TQGQGCGCCA CAQTTTTCXX: C6T0QCCT0C G GTTCCTCX'G CCTGCACCTT CCT6CGGCX3C 300 

GCGGGGACCT GQAGOGGGGO GGTGGATGCA OGGGGGATGG AOGGGGGCAC ACTGCCGAOG 360 

TCOGOGCCCC CTGOQOCCGC OGTCCCTGTC GGCTGOGCTG CXXXtOOSGAG ACCOGCGTCX: 420 

CCGGAACTGT TG06CTGCAG C06G06GCGG CGACCGGCCA C0GCA6AGAC CGGAGGOGGC 480 

6CAGC6GCGG TAGGGGGG06 CAATGAGOGC GAGOQCAACC GC3GTGAA6CT GGTGAACTTG 540 

GGCTTCCAGG 06CT6CX3GCA 6CA0GTGCCG CAC3GGCGGCG CCA6CAAGAA GCRGAGCAAG 600 

GTGGABA06C TGG6CTCAGC OGTGGAOTAC ATCG6CQGQC TGCAGGGCXIT GCT66C06AG 660 

GAC6A0GC00 TGOGCAAOGC 6CTGGCX3GQA GGGCTOAGGC OGCAGGCOGT 60GGCCX3TCT 720 

GCGCCCOGCG GQCOGCCAGG GACCACCCOG GTOGCCGCCT CGCX:CTCCCG CGCTTCTTCX5 780 

TCCCCOGGCC GCGGQGGCAG CTCX5GAGCCC GQCTCCCCGC QTTCOGCCTA CTCGTCGGAC 840 

GACAGOGGCT GOGAAGGCGC GCTQAGTCCT GCGGAGGGCX3 AGCTACTCX3A CTTCTCCA6C 900 

TGGTTAOGGG 6CTACTGAGC GCCCTGGACC TA 932 

Seq ID NOt C243 Protein Sequence 
Protein Accession fti NP_060233.1 



1 11 21 31 41 51 
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MSGGHQU}liA 
LNIiTLEXJVFA 
SLASKASMAG 
KAHVRIELKE 
SQLATRRYQA 
PWLHQHRTCP 
PSRSAVARPP 
IQSTSQHPAA 
NCTDZSLQOV 
ETQVSSHVHY 
SNSAAPSGRL 
8BPTPGSRFQ 
NSQEVHIiCLT 
QAV 



I 

ALHPtfLLMAT 
GVAfilTPAEQ 
ERQASAVLPD 
PPAHPDYDVW 
8CRQAR0BWP 
LGVFHITBQD 
RPGPFLPSQB 
CFVPLRRARP 



LQAGF6RTGL 
KLKQSHPIiYL 
ITEDRAAAEQ 
IIjMTWGTIF 
DSGSSCSSAP 
9PSQSL6PSR 
PGMQPRKHRF 



HRHRHHHYKK 
SNPQCPRALP 
DATVHPACQI 
PRQPLBFHPP 



XfSSDFDPIiVY 
RFQWH6RKPO 
BPAPGPVDAS 
FPHYTPSVAY 



I 

VLAAAVBSBR 
C34ASDDDNLB 
LQQPIiGI'TWP 
VIILASVLRI 
VCAICLEEPS 
SYQBPGRRLH 
PRAAHPRAPG 
CTERSGYXiAD 
CSPK6DPQBV 
PBTGVPQSRP 
SICPSTSSIiP 
PW8PEAHPLI 
TAE6RPCPYP 



1 

SAEQKAVIRV 
PGFISrVKLE 
WLIUGNDAE 
RCRPRH8RPD 
BGQELRVISC 
LIRQHPGHAH 
EQQRLAGAQH 
GPASDSSSOP 
DMQPSVT8RP 
PIPRTQPQPG 
NLQKSSLSAR 
GGPPGLDKRL 
BGQVIiSAQPO 



I 

XPLKMDPTGK 
SPRRAPRPCL 

KI>KFVYKNQ 
PliQQRTAHAI 
liHEFHRNCVD 
YHLPAAYLLG 
PYAQGWO^SH 
CHGSSSDSW 
RSLDSWPTG 
PPSPDQQVTG 
HPQRKRRGGP 
LPBTPGPCYS 
SBBBIfBBLCB 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

783 



Seq ID KOi C244 DNA Sequence 
Ifucleic Acid Accession #t KM_004289 
Coding sequence t 4 93.. 1695 



11 



21 31 41 51 

I 1 I 1 I I 

GCCGCC6CCT OGTCCACC3G6 AGGA6CCX3GC GCCAG0GTG6 ACX3GOGGCA6 CCAGQCTQTG 
CAGGGGGGOQ GaSGGQAOCC COGAGOQGCT CGGAGTGGOC OCrTGGACGC CGGGGAAOAG 
GAGAAG6CAC CCGOGGAACC GACG6CTCA0 GTGCOSGAOG CTG60GGATG TGCOAGCXSAG 
GAGAATGGGG TACTAA6AGA AAAGCAOQAA GCTQTQGATC ATA6TTCCCA GCATGAGGAA 
AATGAAGAAA GGOTGTCAGC CCAGAAGGAO AACTCACTTC AGCAGAATGA TGATGATQAA 
AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC TA6AAATQAG 
AOACATCTQA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 
TCACAGCCTG AAAATTCACT 6GAGGGCATC TCATTGGGAG ATATTCCTCT TCCAOGCAOT 
ATCAOTQATG GCATQAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA GGCTATAAGT 
CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAQAAQAQAT 
CCAACAGCAA GGACTTCACA OTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 
AATCCTOAOC AAAOCCTTCC TQGAACTAAT TTGACAGGAT TTCTTTCACC GGTT GACA AT 
CATATGAGGA ATCTAACAAQ CCAAGACCTA CTQTATOACC TTGACATAAA T ATATT TGAT 
GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 
CTTTTTGATG AACCAQATTC TGATTCTGGC CTTTCTTTAG ATTCAAOTCA CAATAATACC 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 
ACT6ACCATG AATCTA8TTC CCATCATGAC TTAGAAGGT6 CTOTAQGTGO CTACTAOCCA 
GAAGOCAGTA AGCTTTOTCA CTTOaATCAA AOTOATTCTO ATTTCCATGO AGATCTTACA 
TTTCAACAOO TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 
TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAQAAQATAA GGAQTAOATA CCTTGAAGAC 
ACAGATAGAA ACTTGAGCOQ TGATQAACAG OGTGCTAAAG CTTTGCATAT CCCTTTTTCT 
STAGATOAAA TTGTGG6CAT GOCTGTTGAT TCTTTCAATA GCATOTTAAO TAOATATTAT 
CTGACAGACC TACAAOTCTC ACTTATCOOT GACATCAQAC OAAOAGGOAA AAATAAAGTT 
GCTG06CA6A ACTGTCGTAA ACGC3U\ATTQ GACATAATTT TGAATTTAGA AGATGATGTA 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 
ATTAACATAA TOAAACAOAA ACTGCATOAC CTTTATCATG ATATTTTTAG TAGATTAAGA 
GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTOTAC OCATGATGGA 
AGTATCTTOA TAOTACOCAA AGAACTGQTO GCCTCAGGCX: ACAAAAAGGA AAOCCAAAAG 
GGAAAGAGAA AGTGAGAAGA AACTOAAOAT GGACTCTATT ATGTGAAOTA G TAAT GTTCA 
GAAACTGATT ATTTGGATCA GAAACCATTQ AAACTGCTTC AAGAATTGTA TCTTTAAGTA 
CTGCTACTTG AATAACTCAG TTAAOGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAOQA 
CTTCAAGATC ACACTTGTGG GCAATCTQGQ GOAOCCACSUV CTTTTCATGA AGTGC ATTg T 
ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGAC TTTC 
CATCTTGGCA GCCATCCTTT TTAAGAQTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 
QQATCAAATT ATTTTAAOAG QTATTTCAGT TTTAAAT6CA AAATAGCCTT ATTTTCATTT 
AGTTTGTTAO CACTATAGTG A6CTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 
AAATTTTGCT TTCT 



Seq ID NOi C245 Protein Sequence 
Protein Accession MP 004433 



MALRRLGAAL 
VC39VFBS8QM 
DSATKTFPNW 
YOGCMSLIAV 
CKGDGEWLVP 
TNCVCRNGYY 
ZCKSCX38GRO 
SPQFASVNIT 
YNATAIKSPT 
LPLIIGSSAA 
FTYEDPNEAV 
TBXQRRDFL8 
VIQLVGMLRG 
YT8ALGGKIP 
GQDYRLPPPM 
INLPLLDRTZ 
LAGRQXXZLN 



11 

I 

LIiLPLLAAVE 
NWLRTKFIRR 
MEMPWVKVDT 
RVFYRKCPRI 
IGRCMCKAGF 
RADLDPLDMF 
ACTRCGDNVQ 
TNQAAPSAVS 
NTVTVQGLKA 
GLVPLIAVW 
REFAKBXDIS 
EASIMGQFDH 
lAAGMKYLAD 
IRWTAPEAIQ 
DCPSALHQLM 
PDYTSPNTVD 
SIQVMRAQMN 



21 
1 

BTIMDSTTAT 
RGAHRIHVEM 
lAADESPSQV 
IQNGAIFQET 
EAVEMGTVCm 
CTTIPSAPOA 
YAPRQLGLTE 
IMHQVSRTVD 
GAIYVPQVRA 
lAIVOJRRRO 
CVKIEQVIGA 
FNVIHLEGW 
KNYVHRDLAA 
YRKPTSASDV 
LDCWQKDRNB 
EHLEAIKM6Q 
QXQSVEV 



31 
1 

AELGWMVHPP 
KFSVRDCSSl 
DXX3GRVMKIN 
L8GAESTSLV 
GCPSGTFKAN 
VISSVNET8L 
PRIYISDIiLA 
SITLSWSQPD 
RTVAGYGRY8 
FERADSBYTD 
GEFGEVC8GH 
TKSTPVMIIT 
RNILVNSKLV 
WSYGIVIWBV 
RPKPGQZVNT 
YKESFAZIAaF 



41 

I 

8GWEEVSGYD 
PSVPGSCKET 
TEVRSFGFVS 
AARGSaANA 
QGDEACIHCP 
MLEWTPPRDS 
HTQYTFEIQA 
QPNGVIWYB 
GKMYFCYIKTB 
KLQHYT8GHM 
LKLPGKREIP 
BFMENGSliDS 
CKVSDFGLSR 
MSYGERPYKD 
LDXMXRNPN8 
T8FDW8QMH 



51 
1 

ENMNTIRTYQ 
FNLYYYBADF 
R8GFYLAFQD 
BEVDVPIXLY 



GGREDLVYNI 
VNGVTDQSPF 
LQYYEKEliSE 
AEYQTSZQEK 
TPGMKIYIDP 
VAIKTLKSGY 
PLRQNDGQFT 
FliEDDTSDPT 
KmOOVINAI 
UCAMAPLSSG 
HSDILRVGLT 
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Seq ZD KOi C246 Protein Sequence 
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Protein Accession #i NP_11414B.1 

1 11 21 31 41 51 

C I I I 1 I i 

3 MDARRVPQXD LRVKKHLKKF RYVKLISKBT 8SSSDDSCDS FASDNPANTR LQSVRE8CRT 60 

RSQCRHSQPL RVAMKPPARS TRGATNKKAE SRQPSEH8VT DSNSDSEDBS GMNFLEKRAL 120 

NIKQNKAMLA KLHSBLESFP GSFRGRHPLP GSDSQSHRPR RRTPPGVASR RNPERRARPL IBO 

TRSRSRlIiGS LDALPMBBEE EBDKYMLVRX RKTVDGYMHB DDLPRSRRSR 8SVTLPHIIR 240 

PVBBXTEOOV GSRLQQPSXR RYXTVHWAU. VINAVRRLLX PXQTABTQTA GAFEASSVAP 300 

10 AFETVMVKR8 QOiCWIRTGI ARLVEESATA VSAGSEMDGV RL6SLCI 347 

Seq ID NO: C247 Protein Sequence 
Protein Accession #: NP_036577.l 

15 1 11 21 31 41 SI 

I I 1 I I I 

MEKPSPAAAL GKALCALLIiA TLGAAGQPLG GESICSARAP AKYSITPTGK WSQTAFPKQY 60 

PLFRPPAQWS SLLGAAHSSD YSMWRKNQYV SNGLRDFAER GBAWAIMKBI EAAGEALQSV 120 

BAVPSAPAVP 8GTGQT8AEL EVQRRHSIiVS FWRZVPSPD HFVGVDSIiDL GDODRIfREQA 180 

20 ALDLYPYDAG TDSQFTFSSP NPATIPQDTV TBIT8SSPSH PANSPYYPRL KAXtPPlARVT 240 

LVRLRQSPRA FIPPAPVLPS RDNEZVDSAS VPBTPLDCBV 8LWSSWGL06 GHG6RU3TXS 300 

RTRYVRVQPA NNGSPCPELB EBAECVFSNC V 331 

Seq ID NOf C248 Protein Sequence 
25 Protein Accession #i NP_063947.l 

1 11 21 31 41 51 

I I I I I I 

MLQDPDSDQP LNSLDVKPIA KPRIPMETPR KVGIPIIIAli LSLASIllW VLIXVILDKY 60 

30 YFLOGQPIiHP IPRKQXiCDGE LDCPLGEDEB HCVKSFPEGP AVAVRL5KDR STLQVLDSAT 120 

GNWPSACPDM FTEALAETAC RQMGYSSKPT PRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQKVC GGSILDPHWV 240 

LTAAKCFRKH TDVFNWKVRA GSDXLGSFPS LAVAKIIIZE FNPMYPKDMD ZALMXLQFPL 300 

TPSGTVRPIC LPPFDBBLTP ATPLWIIGH6 FTKQHGGKMS DILXiQASVQV IDSTRCNADD 360 

35 AYQGBVTEKM MCAGIPEGGV DTC3QGDSG6P LMYQSDQWHV VQIVSWOYGC GGPSTPOVYT 420 

KVSAYLNWIY NVWKABL 43*' 



40 



65 



Seq ID NO: C249 Protein Sequence 
Protein Accession ttt NP 003036.1 



1 11 21 31 41 SI 

I I I I I i 

MGCKVLUJIG QQMLRRKWD CSRBETRLSR ClOTPDIiVAL QVGSTLGAGV YVLAGAVARE 60 

NAGPAIVISF LIAALASVLA GI/nfGEPGAR VPKTGSAYLY SYVTVGBUIA FZTGWNLILS 120 

45 YIIGTSSVAR AWSATFDELI GRPIGBFSRT HMTLNAPGVI* AENPDIFAVI IIIilLTGLIiT 180 

LGVKESAMVN KIFTCINVLV LGPIMVSGPV KQSVKNWQLT EEDFGNTSGR LCLNNDTKBG 240 

KPQVGQFMPP GFSGVLSGAA TCFYAFVGPD CIATTGEEVK NPQKAIPVGI VASLLICPIA 300 

YFGVSAAIiTIi MMPYFCLDNN SPLPDAFKHV GWBGAKYAVA VGSLCALSAS LLGSMFPMPR 360 

VIYAMAEDGL LFKPLANVND RTKTPIIATL ASGAVAAVMA FLPDLKDLVD LMSIOTLLAY 420 

50 SLVAACVLVL RYQPBQPNLV YQMASTSDEL DPADQNELAS TNDSQLOPLP EAEMFSUCTI 480 

LSPKNMEPSK ISGLIVNIST SLIAVIillTP CIVTVIiGREA LTKGALWAVP LLAGSALUA 540 

WTGVIWRQP ESKTKLSFKV PFLPVLPILS IFVNVYLMMQ IiDQGINVRFA VHMLXOFIIY 600 

FGYGLWHSEE ASLDADQART POGNLDQCK 629 

55 Seq ID NOt C250 Protein Sequence 
Protein Accession #: 1IP_0D2 767.1 

I 11 21 31 41 51 

/:n I ' i ' ' ' 

OU MRAPHLHIiSA ASGARALAKL LPLLMAQLWA ABAALLPQITO TRLDPEAYGA PCARGSQPWQ 60 

VSLFNGIiSFH CAGVLVDQSW VLTAAHCGNK PLWARVGDDH UOiLQGEQLR RTTRSWHPK 120 

YBQGSGPILP RRTDEHDliML LKZiARPWPG PRVRALQLPY RCAQPGDQCQ VAGWGTTAAR 180 

RVKYNKGLTC SSXTILSPKE CEVFYPGWT NNMICAGLDR GQDPOQSDSO GFLVCDETLQ 240 

GZLSHGVYPC GSAQBPAVYT QXaOMSHXN KVaRSN 276 



Seq XS NO I C251 Protein Sequence 
Protein Accession Si XP 09508B.3 



„^ 1 11 21 31 41 SI 

70 1 I I I I I 

MTRAATABFG RVSPASPARS TA8LPRAFLQ SLRTULDIU) DWQRGCVHLR BIQSLWVEAR 60 

ELPSGVLEQL 6QRR6PQPGA AVRSRR66AV FRGARAVPER CAGTETRRGR RCS6LQRU3G 120 

GFRGCPADPC ARGEHRRHTI TS6VD0GLUC QMKELBQEXE VLLQGLEMMA QGRDHYQQQli 180 

QQVQERQCRL GQSRASADPG AVOSPRPLGR LliPKVQEVAR WLGBLLAKAC AGRALPTSSS 240 

75 GPPCSALTST 8SPGWQQQII LMUCEQNRLL TQBVTEKSER ITQLEQKSAL IKQLPEARAL 300 

SQQDGGbSPA GPHIBPLTRF RLPVLTHAOA UiSPHSPQLL LPLSADSGOP LHEL PDTWFP 360 

AVLLWVPSPG XRTAHARLBF RQRPABGAHQ LGCGAEAAPB TCGTLFHFES HKTTCEPDSL 420 

GGPCPQBGDR SHSHLGAAFD VAPAVAKVTP KREDAAGSRH GDICPLCPKG LLTPRDIAXE 480 

FSliAEWOdiD HAQQNLYRDV MLENYRNI*PS LOfTVSKPDL lACLEQNKEP QNIKRNEMAA 540 

80 KHPVTCSHFN QDIiQPEQSXK DSLQKVIFRT YGKOQEQILQ LKKOCKRVDE CEVHKGGYND 600 

UIQCLSNTQN KZFQTHKCVK VF8RPSNSIIR KNARYTGKKH LKCKKYGX8F CMF8HLNQHQ 660 

XIHTKEK8YK CEB0GK8FNH 888GTTHKRI LTGEKPYRCB BCXSCAFRWPS NLTRKKRIHT 720 

GBKPYACBEC GQAPRRSSTL IKBXRZHTGB RPYKCBBCX3K AP8VSSALIY HKRIBTGEKP 780 

YTCBBCGKAP NCSSTLKIHK XIHTGBKPYT CBBCGRTFHC SSTVKAHXRI HTOEKPYKCB 840 
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10 



45 



75 



ECDKAFKWHS SLAKHKIIHT GBKPYKCSDS KAIiAKSSEVQ KVYSGDOEHG IRVHKKKBTQ 900 

GWLVRNKNBN RTGLFQIRAA VRPNRDPSKQ QQBGSLTDPI QRKEEPDIiQH HYDHQNAIjBD 960 

QRNTGVGGUi TPRDWIBFS LBBWQCLDHA QQNLYRDVML ENYHMLVSLG lAVSXPDLXT 1020 

CLBQNKEPWI ZKBNEMVTKH FDIiPPBLGZK DSLQKVZPRR YGKSGHDHLQ VKTCKSMGBC 1080 

EVQXGGC3tEV NQCLSTTQNK XFQTHKCVKV FGKFGNSNRB KTRHTOKKHP KGXKYGKSFC 1140 

MVSQLHQHQI IHTRaiSYQC KEOGKPFNCS STLSKHKRIH TGEKPYRCBE OGKAPTWSST 1200 

LTKHRRIHTG EKPYTGEECQ QAFSRSSTLA NHKRIHTGBK PYTCKEOGKA PSLSSSLTYH 1260 

KRIHTGEXPY TCEE06KAFN CSSTLXKHKZ IBTGBKPYKC KEC6KAPAPS STLNTHKRIH 1320 

TGEEPYKCBB CDKAPKNSSS LANHXSKHTQ BRPYKCB 1357 

Seq ID NOx C252 Protein Sequence 
Protein Accession fts MP 114433.1 



^ ^ 1 11 21 31 41 51 

IS I I t 1 I I 

MASRSMRLLL LLSCIAXTGV LGDIIMRPSC APGWFYHKSN CYOyFIUQURN tfSDABLBOQS 60 

YGNGAHLASI LSLKEASTIA BYISGYQRSQ PIWIGXiHDPQ KRQQHQWZDO AHYLYRSNSG 120 

KSKGGNKHCA EMSSNKNPLT WSSNBC37KRQ HFLCXYRP 158 

20 Seq ID NO: C253 Protein Sequence 
Protein Accession #: XP_051860.2 

1 11 21 31 41 51 

1 I I I I ' 

25 MDGVNLSTBV VYKKGQDYRF ACYDRGHACR SYRVRFLCOK PVRPKI»TVTI DTNVMSTIIH 60 

LEDNVQSWKP GDTIiVIASTD YSMYQAEEFO VLPCRSCAPN QVKVAGKPMY LHIGBBIDGV 120 

DMRAEVGLLS RNIIVMGEME DKCYPYRNHI CWFFDFDTPG GHIKFALGFK AAHLEGTELK 180 

HMGQQLVGQY PIHPHLAGDV DBROQYDPPT YIRDLSIBHT PSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFPT EDGFEERlTrF DHCLGLLVKS GTliXiPSDRDS XHCKNIT6DS YPOYXPKPRQ 300 

30 DCNAVSTFWM ANPKNNLINC AAAGSKETQF HFIFHHVPT6 PSVGMYSPGY SEHIPLGKPY 360 

NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAIIRHPI 420 

AYKNQDHGAW LRGGDVWLDS CRPADKGIGL TIiASGOTFPY DDGSKQEIKN SIiFVGBSQNV 480 

GTEMMDNRIW GPGQLDH8GR TLPIGQKFPI RGIQXiYDGPI KIQNCTFRKF VALEGRHTSA 540 

lAFRLNNAWQ SCPHNNVTGI APmVPITSR VFFGEPGPWF NQLDMDQDKT SVFHDVD68V 600 

35 SEYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY XQAYKTSNIiR MKIIKNDPPS 660 

HPLYLB6ALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEI* AIWLXNFUKG DWIRVGLCYP 720 

RGTTPSILSD VHNRLLKQTS KTGVFVRTLQ MDiCVEQSYPQ RSHYYMDBDS GLLFLKLKAQ 780 

NBRBKFAFCS MKOCERXKIK ALIPKMAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 840 

TKDHPLEVKM ESSKQHFFHIi WNDFAYIEVD GKKYPSSEDG XQVWXDGNQ GRWSRTSFR 900 

40 NSILQGXPHQ LFNYVATIPD KSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLXSQMAFV 960 

OFXOSFRPZH VTLDTBDHXA KIFQVVPIPV VKXKKL 996 



Seq ID NO: C254 Protein Sequence 
Protein Accession «i NP 055188.1 



1 11 21 31 41 51 

I I I I I 1 

MTALSSENCS POYQLRQTNQ PXjDVMYLLPIj IILGKILLNX LTW»4RRKNT OQNFMBYFCX 60 

SIAPVDLLLIi VNXSIILYFR DFVLLSXRFT KYHICLFTQI ISFTYGPLHY PVPLTACXDY 120 

50 CLNFSKTTKL 8FK0QKLFYF FTVILIWXSV LAYVLGDPAI YQSIiKAQNAY SRHCPPYVSX 180 

QSYHL8FFMV MILFVAFITC HSEVTTLVQA IRITSYMNBT ILYFPP8SHS 8YTVRSKKXF 240 

LSKLIVCFLS TWLPFVLLQV IIVLLKVOIP AYIEMKIPWL YFVM8FLIAT VYWPNOIKLN 300 

LKDIGLPLDP FVKWKCCFIP LTIPNLEQIE KPISXMXC 338 

55 Seq ID NOt C255 Protein Sequence 
Protein Accession #i Eos sequence 

1 11 21 31 41 51 

1 I I ' I 1 

OU MALVLGSLLL LGLCGHSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAQ PIGILFELVH 60 

IFLYWQPRD FPEDTLRKFL QKAYESKZDY DKIVYYEAGI ILCCVLGLLF IILMPLVGYF 120 

FCMCRCX3IKC GOEMHQRQKE NGPPLRKCPA ISLI/VICIXI SIGXFYOFVA NHQVRTRIKR 180 

SRKIADSNFK DLRTLLNBTP BQIKYILAQY NTTKDKAFTD UISINSVU3G GILDRLRPNI 240 

IPVLDBIKSM ATAIKBTKKA LBNMNSTLKS LHQQSTQLSS SLTSVKT8LR SSLNDPLCLV 300 

65 HPSSETQJSI RIiSLSQUTSN PELRQLPPVD ABU3HVNMVli RTDLDGLVQQ GYQSIJTDXPD 360 

RVQRQTTTW AOXKRVIiHSI GSDIDNVTQR LPIQDIL8AF SVYVNNTESY IHRNLPTIiBE 420 

YDSYWHLGGL VXCSLLTLIV XFYYIGLLOO VOOYDRRATP TTRQCVSNTG GVFLMVGVGL 480 

SFLFCWILMI IWLTFVFGA NVEKLICBPY TSKELFRVLD TPYLLNEDWB YYLSGKLFNK 540 

SKMKLTFEQV YSDCKKNRGT YGTIiHLQNSP NISBHLNINB HTGSISSBLB SLKVNLHIFL 600 

70 U3AAGRKNLQ DFAAOaXDRM NYDSYLAQTG KSPAOVNXiLS FAYDIjBAKAN SLPPGiniRNS 660 

LXRDAQTXKT IHQQRVLPIB QSL8TLYQSV KILQRTGHGIi LERVTRILAS LDFAQNFITK 720 

NT8SVIIEET KKYGRTXIGY FEOYLOIflEF 8ISBRVASCK FVATAU3TAV DVPIiCSYIXD 780 

PUILPHPGIG KATVFLLPAL IFAVKLAKYY RSMDSEDVYD DVETIPHKNN ENGNNGYHKD 840 

KVYGIHNPVM T8PSQB &56 



Seq ID NO: C2S6 Protein Sequence 
Protein Accession #i NP_149038.1 



1 11 21 31 41 51 

80 t I I ) I I 

MKAIIHLTLL ALLSVNTATH QGNSADAVTT TBTATSGPTV AAADTTBTOF PETASTTANT 60 

PSFPTATSPA PPXI8THSSS TIPTPAPPII 8THSSSTXPI PTAADSBSTT NVNSIATSDI 120 

ITASSFMDOI* ITKVPSBTQS NNBNSPTTED KQSSGPP1GT AIOiETSTliNS TGPS HPOQDD 180 

PGADNSLCVK LHNTSFCLCL BOYVYNSSTC XiQSlCVFPGKI SVTVSETFDP BBKRSMAYQD 240 
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LHSBITSLFK DVFGT8VYGQ TVILTVSTSL SPR8EMRADD KFVNVTIVTI LABTTSDNEK 300 

TVTEKI21XAI RSSSSNFLNY DLTLRaYYO CaiQTADDCUl GIJVCDCKSDL QRPHPQSPFC 360 

VASSUCCPDA QlAQHKQCLt KKS6QAPECA CVPGYQEDMI GKCQKCAFGY 86I1DCIDKFQ 420 

. LILTXVOTIA OIVIbSMZIA LXVTARSMNX TUIIBBBNIiZ DEDFQNLKLR STGPrNLGAB 460 

J GSVFPKVRIT ASRDSQMQHP YSRHSSMPRP DY 512 



10 
15 



25 



35 
40 
45 
50 



60 



75 
80 



&eq ZD MO: C257 Protein Sequence 
Protein Accession #1 NP_001423.1 

1 11 21 31 41 51 

I I I I I I 

»<TAGRRNEHL CAORVPALLL CLOPHLLQAV LSTTVIPSCZ PGESSDKCTA LVQTBDNPRV 60 

AQVSITKCSS DMNGYCUIGQ CIYLVDMSQN YCRCBYGYTG VRCEHPPLTV KQPLSKBYVA 120 

LTVILZZLFlf JTW6STYYF CRWYRNRKSX BPKKBYBRVT SGDPEhPQV 169 

Seq ID KOi C25a Protein Sequence 
Protein Accession ft: AAC63902.1 



«^ 1 11 21 31 41 51 

20 I I I I I I 

MDRSKENCIS GPVKATAPVQ GPKRVLVTQQ IPOQNPLPVM SGQAQRVLCP SMSSQRVPLQ 60 

AQKLVSSHKP V^IQKQKQIO ATSVPBPVSR FLNNTQKSKQ PLPSAPENNP BEELASKQKN 120 

BESXiCRQHAL EDFBI6RPL6 KGKFGNVYLA REKQSKFILA LKVLFKAQLE KAOVEHQLRR 180 

EVEIQSHLRH PNILRLYOYP HDATRVYLIL BYAPLGTVYR BLQKLSKPDE QRTATYITEL 240 

ANALSYCHSK RVIHRDIKPB NLUiGSAGEL KIADP6WSVH APSSRRTTLC OTLDYLPPEM 300 

IBGRMHDEKV DLWSUSVLCY EFLVGKPPFE ANTYQETYXR ISRVEPTFPD FVTBQARDLZ 360 

SRIJ.XHNPSQ RPMLREVLEH PNITAN8SKP SNOQNKBSAS KQS 403 



o/v Protein Sequence 

3U Protein Accession ft 1 MP 037504 . 1 



1 11 31 31 41 51 

I I I i I I 

MSRTAYTVGA LLLLI.GTLLP AAEGKXKGSQ GAIPPPOXAQ HNDSEQTQSP QQPOSRNRQR 60 

GOGROTAMPQ EBVIiESSQBA LHVTERKYLK RDMCKTQPLK QTIHBEQCNS RTIINRFCYG 120 

QCNSFYIPRH IRKEBGSFQS CSPCKPKKPT TMMVTIiMCPB LQPPTKKKRV TRVXQCKCIS 180 

ZDLD 164 

Seq ZD MOi C260 Protein Sequence 
Protein Accession 81 Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKBLXVNKKK HLGPVBSyQL IiLQVTYRDSK 60 

EKRDLRHFLK LLKPPLLWSH GLIRZIRAKA TTDCNSUfGV LQCTCEDSYT WFPPSOiDPQ 120 

NCYLHTAGAL PSCECHUJNL SQSVNFCERT KIWGTFKINB RPTODLLNSS SAIYSKYANO 180 

Z6IQLKKAYB RIQGFESVQV TQPRNGSIVA 6YEWGSSSA SELLSAIBHV AERAKTALHK 240 

LPPLEDGSFR VPGKAQOIDI VFGFGSKDDE YTLPCSSOYR GNITAXCBSS GWQVIRSTCV 300 

LSUjBBIiNKN FSMIVOIATB AAVSSPVQNL SVIZRQNPST TVGKLASWS ZZtSNISSLSZ) 360 

A8HFRVSNST MEDVISIADN ZtNSASVTNW TVIiLREEKYA SSRZiLETLBir ZSTLVPPTAZi 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RQRVLI6SDQ FQRSLPETII 460 

SMASLTLGMI LPVSKNONAQ VNGPVISTVI C»^YSINBVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNBTQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLQZSIGSL 600 

ILOaZBALP HKQIfCKSQTS HTRRICMVNI ALSLLIADVW FZVGATVDTT VNPSGVCTAA 660 

VFFTHFFYLS LPFWMLMIiGZ LLAYRIILVF HHMAQHLMMA VGFCLOYOCP LIZSVITZAV 720 

TQPSNTYKRK DVCWLNWSNO SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERIiSR 780 

DDKATIIRVG KSLLILTPUi GLTWGPGIGT IVDSQNLAWH VIFALLNAFQ GFFILCPGIL 840 

LDSKLRQLLF NKLSALSSHK QTBKQHSSDL 8AKPKF8KPF NPLQNXGHYA PSBTGDSSCN 900 

IMLTQFVSNE 910 



Seq ID NOt C261 Protein Sequence 
Protein Accession ftt NP 000575.1 



^ " 21 31 41 51 

65 I I i I I I 

MTSKLAVALL AAFLISAALC EGAVLPRSAK ELROQCIKTY SKPFHPKFZK EI1RVIES6PH 60 
CAMTBZZVKL SDGRELCLDP KBIHVQRWE KFLKRAEKS 99 

„ Seq ID KOt C262 Protein Sequence 
/U Protein Accession fti MP 005594.1 



1 11 21 31 41 51 

I I 1 I I I 

M5TERDSBTT FDEDSQPNDB WPYSDDBTB OELDDQGSAV EPEQNRVMRB ASOfRSPFRX 60 

ECTWQVKAND RKYHEQPHFM NTKFLCZKBS KYANNAIKTY KYNAFTFZPM HLFBQPKRAA 120 

NLYFLALLIL QAVPQI6TLA WYTTLVPMiV VliGVTAIKDIi VDDVARHKMD KBINNRTCEV 180 

IKDORFKVAK WKEIQVGDVI RLKKNDPVPA DILLLSSSBP NSLCYVBTAB UX5BTNLKFK 240 

MSLBITDQYL QREDTLA7FD GFIBCBBPNN RLDKFTGTLP WRNT8FPLDA DKIUiRGCVZ 300 

RNTDFCHGLV IFAGADTKZM RHSGKTRFKR TKIDYLMNYM VyTZFVVI.ZIi LSAGLAZGHA 360 

YWEAQVGNSS HYLYDGBDDT PSYRGFZiZFH GYIIVLNIMV PISLYVSVEV ZRLGQSRFIH 420 

WDUJMYYAEK DTPAKARTTT IiNBQUSQIHY IFSDKTOTLT QNIMTPKKCC INGQIYGDHR 480 

DASQHNHNKI EQVDFSWNTY ADGKLAPYDH YLIEQIQSGK EPEVRQFPPL LAVCHTVMVD 540 

RTD6QUIYQA ASFDBGALVN AARMFGFAFL ARTCOTITZ8 ELGTERTYNV LAIIiDFNSDR 600 

KRM8ZIVRTP EGNIKLYCXG ADTVZYERLH RMNPTIGQBTQ DALDIFANBT LRTLCXiCYKE 660 
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IBEKBFTEmi KKFMAASVAS TNRDBMjDKV YEBIEKOLIL LGATAIEDKL QDGVPETISK 720 

lAKADIKZHV IiTGDKKETAE NIGFACBXiLT BDTTZarGED INSLLBARKE NQRNRGGVYA 780 

KFAPPVQESP PPFGGNRALZ ITGSWLNBZL LBKRTKRNKI LKLKPPRTEE ERRMRTQSXR S40 

RLBAKXEQRQ KNFVDIACBC 8AVICCRVTP RQKAMWDLV KRYKKMTUV IGDGANDVNM 900 

IKTAHIGVOI S0QB6MQAVM SSDYSFAQFR YLQRXjLLVHO RWSYIRMCKF LRYFPYKKFA 960 

FTLVHFWYSP FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRPPGLYl 1020 

VGQROLIiFNY KRFFVSLUIG VLTSMILPPI PLQAYIiQTVG QDOEAPSDYQ 8FAVTIASAL 1060 

VXTVNFQIGL OTSYWTFVNA FSIFQSIALY FQIMFDFUSA GIHVIfFPSAF QFTGTASNAL 1140 

RQPYIWLTII LTVAVCLIiFV VAIRFLSMTI HPSE8DKIQK HRKRUCAEBQ HQRRQQVFRR 1200 

GVSTRRSAYA FSHQRGVADL ISSGR8IRKX RSPLDAIVAD GTABYRRTGa) 8 1251 

Seq ID NOt C263 Proteia Sequence 
Protein Accession XM_044533 

1 11 21 31 41 51 

1111)1 

MLRTAMGUIS WLAAPWGALP PRFPLLXiLLL T.T..T.T.TiQPPPP THALSPRISL PliGSEBRPPL 60 

RFBABHI8NY TAIiLLSROGR TLYVGAREAL FALSSNL8FL PGGBYQBLLW GADAEXKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHIiFTGG TAAFSPKCTY HIMEEIFTLAR DEKGNVLLBD 180 

GK6RCPFDPN FKSTALWDG BLYTGTVSSF QGNDPAI8R8 QSLRPTRTBS SUmLQDPAP 240 

VASAYIPBSL GSLQGDDDKI YFFF6BTGQE FEFPENTIV8 RIARICKGDE GGERVLQQRU 300 

TSFLKAQLIiC SRPDDGPPFN VLQDVPTLSP SPQDWRDTLF YGVPTSQWHR GTTBGSAVCV 360 

FTMKDVQRVF SGLYKBVNRE TQQVrYTVTHP VPTPRPGACI TNSARERKIN SSX^LPORVL 420 

NFZiKDHFLND GQVRSRMLLL QFQARYQRVA VHRVPGLHHT YDVLPLGTOD ORLHKAVSVO 480 

PRVHIZEBLQ ZF8SGQFVQM XiLLDTHROIiL YAASBSGWQ VPMANCSLYR SOGDCLLARD 540 

PYCAHSGSSC KHVSLYQPQL A7RPWIQDZE GASAKDLCSA 8SWSPSFVP TGERPCBQVQ 600 

FQPNTVNmiA CFLIiSHLATR LWLRNGAFVN ASASCHVLPT 6DLLLVGTQQ LGBFQCK8LE 660 

B6FQQLVA8Y CPEWEDGVA DQTDEGGSVP V1IST8RVSA PAGGKA8WGA DRSYWKEFLV 720 

MCTLFVIiAVL LFVLFIiLYRH RNSMKVFLKQ GECA8VHPKT CFWLPPETR PZ^GLGPPST 780 

PLDHRGYQSL SDSPP6SRVF TE8EKRPLSZ QD8FVB\r8PV CPRPRVRLGS BZRDSW 837 

Seq ZD NOt C264 Protein Sequence 
Protein Accession #: NP_00a95O.l 

1 11 21 31 41 51 

I I 1 i I I 

MASQNRDPAA T8VAAARKGA EPSG6AARGP VGKRLQQBLM TLMMSGDKGI SAFPBSDNLF 60 

KWVGTIHGAA GTVYEDLRYK I1SL6FP8GYP YNAPTVKFLT PCYBPNVDTQ GHXCXiDXLKB 120 

KWSALYDVRT 3IiL8ZQSZ*La BPNXDSPLNT RAAELWKNFT AFKRYLQETY SRQVT8QBP 179 

Seq ZD NO: C265 Protein Sequence 
'Protein Accession #t HP 055399.1 



1 11 21 31 41 51 

45 1 I I I I 1 

MGRGNGFLFG LLGAVWLLSS GKGEEQPPET AAQRCFCQVS GYLDDCTCDV BTXDRFNNYR 60 

LFPRLQKLLE SDYFRYYKVN LKRPCPFWKD ISQCXSRRDCA VKPOQSDBVP DGIKSASYKY 120 

SSBANNLZEB CEQABRLQAV DE8LSEETQK AVLQHTKHDD SSDNFCEADD XQSPEABYVD ISO 

UiUIPERYTG YKGPDAWKXW NVZYEQICFK PQfTZXRPUTP XASGQQTSBB NTFYSHLEOL 240 

CVEXRAPYRL ZSGLHASINV HLSARYLLQB TWI.EKKWGHN XTBFQQRFDO ILTEOBOPRR 300 

LKNLYPLYLI BLRALSKVLP PFEnUPDFQLF TGKKZQDBBN KMLLLBILHE ZKSPPLHFDE 360 

NSFFAGDKKB AHKLKEDFRL HFRNISRIMD CVGCFKCRLH GKLQTQCHXTr ALKILFSEKL 420 

lANMFBSGPS YEFBLTRQEZ VSLFNAFGRZ 8TSVKELBNF RNLLQNZH 468 



Seq ZD NO I C266 Protein Sequence 
Protein Accession It NP_002879.l 

1 11 21 31 41 51 

I I I 1 I I 

MQPRRQRLPA PWSGPRGPRP TAPZtLALLLL ItAPVAAFAGS G6PDDPGQPQ DAOVPRJRLLQ 60 

QKARAALHFF NFRSGSPSAL RVLAEVQBGR AWINPKEGCK VHWFSTERY NPESULQEGE 120 

GRLGRCSARV FFRNQKPRPT INVTCTRLIE KKKRQQEDYL LYKQMKQLKN PliBZVSZFON 180 

HGHZDPSLRL ZWDX1AFLGS8 YVMWEMTTQV SHYYliAQLTS VRQWVRKT 228 

Seq ZD NOt C267 Protein Sequence 
Protein Accession «: NP 005400.1 



1 11 21 31 41 51 

nn ^ I 1 I I I 

/U MSVKGMAIAL AVZLGATWQ GPPMPKRGRC LCIGPGVKAV XVADZEKASI NYPSNNCDKZ 60 
BVIITLKBNK GQRCLNPKSK QARLIIKKVE RKNF 94 



Seq ZD NOt C268 Protein Sequence 
Protein Accession ft: F6BNBSB predicted 



1 11 21 31 41 51 

I I i I I I 

MLRQVLRRGL QSFCHRLGLC V8RUPVFFLT VPAVLTZTFG LSALNRPQPB GDLERI.VAPS 60 

RSLARIERSL ASSZiFPLOQS K8QLYSDi;aT PGRYGRVZUi 8PIGDNZLLQ AEGZLQTHRA 120 

oU VLEMKVNHKG YNYTFSHLCV LRNQDKKCVL DDZZSVLSDL RQAAVSNXTT ARVQVRYFNT 180 

KLKVCSFCKL LPIKBAAZiHF LP 202 



8eq ID NOt C269 Protein Sequence 
Protein Accession #t NP 002429.1 
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1 II 21 31 41 51 

I I I 1 1 I 

MRLPLLLVFA SVIPGAVLLL DTROFLIYNB 0UKRCVDAV9 PSAVQTAAOI QDAESQKFRH 60 

V8BSQXMSVA FKtiCLGVPSK TDWVAITLYA CDSXSBFQKW ECXNDTIiLOZ KGEDLPFNYG 120 

NRQEKHIMLY KGSGLHSRHK lYGTTDNLCS RGYEAHYTLL GNAKQATCAF PFKFENKWYA 160 

DCTSAGRSDG WLWOGTTTDY DTDKLPGYCP LKFEGSBS1A« NKDFLTSVSY QINSRSALTW 240 

BQA&KSCQQQ NAELLSITEI HDQTYLTGLT SSLTSGLWIG UlSLSFNSGH QNSDRSFFRY 300 

UmLPGSFSA EPGKSCVSU9 PGIQIAKWENL ECVQKLOYIC KKCafTTUISF VIP8BSDVPT 360 

RCP8QWWFYA GHCYKIHRDB KKZQRDAIiTT CRKEGGDI.T8 IHTXBSLDPI ISQL6YBPND 420 

BLWIGUIDIK IQMYFEH8DG TPVTFTKWLR GEPSHEKNRQ EDCWMKOKD GYUADRGCEH 480 

PU3YICKMKS RSQGPEIVEV EKGCRKGWKK HHPYCYMIGH TLSTFAEANQ TCNNENAYLT 540 

TXEDRYEQAP LTSFVGLRPB KYFVrr6I*SDI QTKGTFQHTZ EBEVRFTRHN SDMPGRKPGC 600 

VAMRTGZAGG LNDVLKOSEK AKFVCKHMAE GVTBPPKPTT TFBPKCPBDH GASSRT8LCP 660 

KLYAKGKHEK KTWFBSRDFC RAIiGGDIASI NNKBEQQTIW RLZTASG8YH KLFWLGLTYO 720 

SPSEGFTWSD GSPVSYENWA YGEPUMYQNV EYCGBLKGDP TMSWNDINCB HLNNWICQIQ 780 

KGQTPKPEPT PAPQDNPPVT EDGWVIYKDY QYYPSKEKET MDNARAFCKR NFGDLVSIQS 840 

ESEKXPLHKY VNRNDAQSAY PZGLLXSLDX KFAHMDGSKV DYVSWAT6EP NFANEDEKCV 900 

THYSNSGFWN DXNC6YPNAP ZOQBBNSSZN ATTVMPTMPS VFSGCKBGWN FYSNKCFiaP 960 

GFMEEERKNW QBARRACZGF GGNLV8ZQNB KBQAPLTYHM KDSTFSAHTQ LNDVHSBHTF 1020 

LKTDGRGVHY TNWGKGYPGG RRSSLSYEDA DCWZIGGAS MEAOKWKDDT CDSKROYZCQ 1080 

TRSDPSLTNP PATIQTDGFV KYGKS5YSLM RQKFQNHEAE TYCKLKNSLI ASII«DPYSNA 1140 

FAWLQMETSK ERVWZALNSN LTDNQYTHTD KWRVRY7NWA ADEPKLK8AC VYIiDLDGYWK 1200 

TAHC3«E8FYF LCaCRSDBZPA TEPPQLPGRC PESDBTAHIP EH6RCYYZBS SYTRNHGQAS 1260 

LECLRMGS8L VSIBSAABSS FLSYRVBPIJC SKTNFWZGLF RNVEGTWUTZ HNSPVSFVHW 1320 

NTGDPSGERN DCVALHASSQ FWSKZRCSSY KGYZCKRPKX ZDAKPTHEIiL TTKADTRKMD 1380 

PSKPSSNVAG WIIVIUiZL TOAGLAAYFF YKRRRVRLPQ EOAPENTLYP NSQSSPGTSD 1440 

MKDLVGMXEQ NEKSVZ 1456 

Seq ZD KO: C270 Protein Sequence 
Protein Accession Eos sequence 

1 11 21 31 41 51 

I I i I I I 

MVLLHWCLLW LLFPLSSRTQ KLPTRDEEZiF QMQIRDKAPF HDSSVZPDGA BZSSYliFRDT 60 

PKRYFFWEB DKTPLSVTVT PCDAPLEWKL SLQELPEDRS GEGS6DLEPL EQQKQQIINE 120 

EGTELF8YKG NDVEYFZSSS 8PSGLYQLDL LSTEKDTHFK VYATTTPESD QPYPELPYDP 180 

RVDVT8L0RT TVTLAWKPSP TASLLKQPXQ YCWZNKEHN FKSLCAVBAR LSADOAFMMA 240 

PKPGLDF8PF DFARFGFPSD N8GKER8FQA KPSPKLGRHV YSRPKVDIQK ZCXOnCNZFT 300 

VSDLKPDTQY YFDVFWNIN SNMSTAYVGT FARTKBEAKQ KTVBLKDOKZ TDVFVKRKQA 360 
KFLRFAPVSS HQKVTPPIHS CLDAVQIQVR RDGKLLLSQK VB6ZQQFQLR GKPKAKYLVR 420 

LKGNKKQASM LKILATTRPT KQSFPSLPED TRZKAFDiCLR TCSSATVAWL OTQERNKFCZ 480 
yKKEVDDNYN EDQXKREQNQ CLGPDZRKKS EKVLCIOrFHS QNLQKAVTTE TZK6LQPGKS 540 
yUJ>VYVXGR GGHSVKYQSK WKTRXFC 568 

Seq ZD NO: C271 Protein Sequence 
Protein Acceasion #i AAH34229.1 

1 11 21 31 41 51 

I I 1 I I I 

HEKVQLBFE17 QEMEKKLQBP RSTRMKEKED RESSBYYNICS GKVGKLVNQS YHMSQ2fKGNV 60 
VKFSAGKVKZi XLLKEQIQBP VXPTVNYKMA NSSBCEKPKX NGKVOGQCEN KAALLVCLEC 120 
GEDYCSGCFA NVHQKGALKL HRTTLIiQARS QILFNVLDVA HQPIKDVNPD EPKEENNSTK 180 
ETSKZQHKPK SVLLQRSSSB VEZTTMKRAQ RTKPRKSLLC BGSFDEEASA QSFQEVLSQW 240 
RTGNHDONKK QNLHAAVKDS LEECEVQTMIi KZHRBPLNIB liKEDXLSYMB KLWUCKHRRT 300 
PQEQLFKCYQ IRSHXHMKPL VNaSVIiKKKT MKIVKVRRPK YMTQLFYOQ 349 

Seq XD NOi C272 Protein Sequence 
Protein Accession «t 1IF_078963.1 

1 11 21 31 41 51 

i I I I I i 

MBKLWLKKBR RTPQEQLFRM LSDTFPHPBE TTGDAQCSQZf EHDEDSDGEE TKVQHTALLL 60 
PVBTLNIERP BPSLKIVBIiD DTYEEEPEEA BNZVPYKViCL ADADSQRSCA FHD0QKN8FP 120 
YENGIHQBHV FDK6KRDFLN LCLRNSSTYY lOSNSKGBTSN TDFDNZVDPD VYSSDZBKZE 180 
ESTSFERULR EKNZGLBSMQ KSDDSCVSIiB SXDTLLGRDL ERAPXEBKLS QDXKESLELS 240 
NLYRRPSFBB SRTTX8SLLL QEZACRSKPZ TKQYQGLERF PZFDTNERLH LLPSHRLEC3I 300 
NS8TRITLAB DREWZPDHSL SEYADNAZVL GVLQGAQSPS 8SRKQQKMGQ KSQRFSTANF 360 
PLSNSVKESS 8CLS88HPR8 RSAAAQSS8R AA8EI8EZEY XDZTDQNEI.S IiDDTTDQHTL 420 
DNLEKELQVL RSLADT8BKL YSLTSEEFPD FSSQSLHISQ XSTDFLKTSH VRGP06VEEL 480 
8C8GRDTKIQ SLLSLSBSST DBBEEDFLNK QHVITLPWSK ST 522 

Seq XD NO 3 CS73 Protein Sequence 
Protein Accession «t NP_005399.1 

1 11 21 31 41 51 

i I I I I I 

MKVSAVLLCL LUflAAFNPQ GZiAQPDALNV P8TCCFTPS8 KKXSLQRLKS YVXTTSRCPQ 60 

KAVZFRTXZiG KBICADPKER HVQNYMKHLG RKARTUCT 98 

Seq XD NOs C274 Protein Sequence 
Protein Accession 0i BAC05158.1 

1 11 21 31 41 51 

1347 



wo 03/042661 



PCTAJS02/36810 



I i I I I I 

MFLLTG6VSL KSAEKNPDPT WLQDKSWEEI CRASBFPAFR GLRQHFCEHI YEWRBIYDSR 60 

BFHNAKPPAP MDKNLNELQK IIILRCLRPD KXTPAXTVYV TDKLGKKFVE PPFFDLTKSY 120 

LDSNCTXPLZ FVLSPGADPM ASUiKPAMDK 8MSGNKFQAX SLGQ6Q6PXA AKMXKAMEB 160 

D GTWVCLQNCH lAVSKMPMLE RICEDFT8ET CNS5PRLHLT SYPSSKFPVT ILQKGVRMTN 240 

BPPTGLRLNL LQSYLTDPVS DPEFPKGCR6 RELLPIKEYD TZPFEAISYL TGEOTYGGRV 300 

TDDWDRRLLL TMLADFYNLY IVENPHYKPS PSGNYPAPPK GTYEDYIEFI KKLPPTQHPB 360 

IFGLHEMVDX SRDLQQTKTL FESLLLTQGG SKQTGASGST DQIUiBITKD ILNKLPSDFD 420 

^ lEMALRKYFV RYBBSMNTVL VQQ4ERFHNL ZITIRUTLXtD LEKAIX6WV MDSALEALSS 480 

10 8LLVGKVPBI HAXR8YPSLK PL6SYITDFL ARLNFLQDHY N8GKPCVFHL 8GPFFTQAPL 540 

TGAMQNYARK YTTPIDLL6Y BPSVIPSOTS DTSPEDGVYI KQLyifiQARN DRESGIiLAEQ 600 

YPKLLFDLTIP IIHIKPTQKS RIZKSDAYVC PLYKTSBRXO TLSTTGHSTN PVIANLLKID 660 

QPTRHHIKRO VAZiLOQLDD 679 

15 Seq ID NOt C275 Protein Sequence 
Protein Accession fit AAA60212.1 

1 11 21 31 41 51 

1 I I I 1 I 

MABSKXiXK^WI. LLLLPTLCOP GTAAWTTSSZi ACAQGPEFHC QSLEQALQCR ALGHCLQBVH 60 

GHVGABDLCQ ECEDIVHILN KMAKEAIFQD TMRKFLEQEC NVLPLKLLMP QOfQVLDDYF 120 

PLVIDYFQNQ TDSNGICMHL GLCKSRQPEP EQEPGMSOPL PKFLSDPLPD PLI<DKLVLFV 180 

LPGALQARPG PHTQDLSEQQ PPIPLPYCWL CRALIKRIQA MZPKGALAVA VAQVCRWPL 240 

VAOGIGQCLA ERYSVILLDT LL6RMLPQLV CRLVIiRCSHD DSJU3PRSFTQ EWLPROSECH 300 

25 LCaiSVTTQAG NSS6QAXPQA MLQACVGSWIi DREKCXQTVE QHTPQXiLTZiV PR6HDAHTTC 360 

QALGVCGTMS SPLQCIHSPD L 381 



30 
35 



Seq ID KOt C276 Protein Sequence 
Protein Accession #« NP_631911.1 

1 11 21 31 41 51 

1 1 I I I I 

HLGCGIPAL6 LLLLLQGSAD ^IQGFFYP WSCEGDIHDR ESCGGQAAID SPNLCLRLRC 60 

CYRNGVCYHQ RPDENVRRKH KWALVWTCSO LLLLSCSICL FWWAXRRDVL HMPGFLAGPC 120 

OMSKSVSLLS KHRGTKKTPS TGSVPVAIiSK BSRDVEGGTE GB6TEE0BET EGEBBED 177 

Seq ID NO! C277 Protein Sec[uence 
Protein Accession Ut NP_473364.1 

40 1 11 21 31 41 51 

I I I I I I 

MKLVTIFLLV TISLCSYSAT AFLINKVPLP VDKIiAPLPLD NILPFMDPLK LLUCTLGISV 60 

EHLVBGLRKC VNELGPEASE AVKKLLEALS HLV 93 

45 Seq ID NOt C278 Protein Sequence 

Protein Accession #t FGENE8H predicted 

1 11 21 31 41 51 

MPLSYAYKMA STZiAGRBTSS WMSRGAYQHR NTRAAGRPEE CTDRNWHAOR TROlWLGQIiE 60 

BRCSDVFGVS PFWWVRGLAG SGAKLQTFTP AQEGAPTVQR QAEALLKCRQ SGRPGRGGAE 120 

SBRARDASML SPLSAAMRNY PTSSTIPPRR' SYSPTEIAHK SYSCSLPDMK ISMAESGPSL 180 

DSLDILBDOE SGSPFLVTHL YFLGWTTGM BQLDFETGPH IFDLQIYVKD EVGVTDLQVL 240 

TVQVTDVNEP PQFQONIiAED HIiRADQPBFII AHSHTYVRW ATAIARHRLR SSXGSPFIiQT 300 

FCWVGMQYF LISPPKSFRM SANGTLFSTT ELDPEAGHRS FHblVEVRDS GGLKASTELQ 360 

VNIVNLNDBV PRFTSPTRVY TVLEELSPGT IVANITAEDP DDBGFPSHLL Y9ITTVSKYF 420 

MINQLTGTIQ VAQRIDRDAG BLRQNPTISL EVLVKDRPYG GQENRIQITF IVBDVNDNPA 480 

TCQKFTFRSS LHPAIiCSKTL TVfMDTVIiDCF HAADKDIPVT GRFTKEROLI QLTVPHGWGS 540 

LTIMAEGKEB QVTSYMDGSR QRDRACV6KL LLIKPSDX.MR LSHYHENNSG KTCFHDSI8S 600 

YQVPPTTCRN SRIQATNNED TSSVTVTVNI LBBNDBKPIC TPNSYFLALP VDLKVGTNIQ 660 

NFKLTCTDtiD SSPRSFRY8Z GPGNVMNHPT F8PNAGSNVT RLLLTSRFDY AGGFDKIWDY 720 

KLLVYVTDDN LMSDRKKAEA LVBTGTVTLS ZKVZPHPTTZ ZTTTPRPRVT YQVLRKNVYS 780 

PSANYVPFVI TLGSZLLXiGL LVYLWLLAR AIHRHCPCKT GKNKEPIiTKK GBTKTAERDV 840 

WETIQMHTI FD6EAZDPEP BQASLELYAL LPSCCDPSPV TLRKVQVOGB SEBTGQCSGH 900 

ITLPGKIPVD DPRKQETGLQ ODFEVWTLCP AVKWVGSFQ AERCZRLALS LKKYSSD 957 

Seq ZD NOt C279 Protein Sequence 
Protein Accession #i XP_1 66571.1 

1 11 21 31 41 51 

I I i I I 1 

MZNQLTQTZQ VAQRIDRDAG ELRQNPTZSL EVLVKDRPYG GQENRZQZTF ZVEDVNDNPA 60 

TOQKFTFSIM VPERTAKGTL LLDLNKFCFD DDSEAPNNRF NFTMPSGVGS GSRFLQDPAG 120 

SGKIVLIQDL DYEMPSNLAA GNKYTVIZQV QDVAPPYYKN NVYVYILT9P ENEFPLIFDR 180 

PSYVFDV6ER RPAQGHLSGP EBKRLI.SZCH VRAVCHHFGIf HZASGSPRVP GRPZGQSHFQ 240 

TLPLQDWEEQ GTSDKERRNE DCRERRRGOI YPDEHYL 277 

Seq ID NO: C2B0 Protein Sequence 
Protein Accession NP_005257.2 

1 11 21 31 41 51 

I I I I i I 

MGDWSFL6NF LBEVHXKSTV VGKVWIiTVXiF IFRMLVLGTA ABSSWGDEQA DFRCDTIQPG 60 
CQNVCXDQAP PXSRIRYWVL QIXFVSTPSL VYMGHANRTV RMQBRRXLRE AERAKEVROS 120 



55 
60 
65 
70 
75 
80 



1348 



wo 03/042661 



GSYBYPVABK AELSCWEEGN GRIAWJQTLL NTYVCSILIR TTMBVGPIVG QYPIYGIPLT 180 

TLHVCRR8PC PHPVNCYVSR PTEKNVFXVF MLAVAALSLL LSIASLYHLG WKKIRQRFVK 240 

PRQKMAKOQL S6PSVGIVQS CTPPPDFNQC LENGPG6XFP NPFSNNMASQ QinDNLVTEQ 300 

VRGQEQTP6B 6FIQVRYGQK PEVPNGV8PQ HRLraGYHSD KRRLSXASSX ARSDDLSV 358 

6eq ID NOt C281 Protein Sequence 
Protein Accession fti i7P_0SS274.2 

1 11 21 31 41 51 

I 1 I I i 1 

MYLSXCCCFL LWAPALTLKK liNYSVPBBQG AGTVIGNIGR DARLQPGLPP AERGGGGRSK 60 

SGSYRVLEN8 APHLLOVDAD SGIiLYTKQRI DRESLCRHNA KCQLSLEVFA NDKEICMIKV 120 

EIQOXNDNAP SPSSDQZaiD ISENAAPQTR FPLTSAHDPD AGENGLRTYL LTRDDHGIiFG 180 

LOVXSRGDGT KPPELVIQKA liDRBOQHHRT LVLTALDGGB PPRSATVQXN VKVXOSNDNS 240 

PVFBAPSYLV ELPENAPLGT WIDLNATDA DBSPNGBVLY SPSSYVPDRV RELPSZDPKT 300 

GLIRVKGNLD YEENOMLEID VQARDLGPNP IPAHCKVTVK LXDRNTOJAPS IGPVSVRQQA 360 

LSEAAPPGTV lALVRVTDRD SGKNGQLQC31 VLGGGGTGGG GGLGQPGGSV PFKLEENYDN 420 

FYTWTDRPL DRBTQDEYNV TZVARDGGSP PLNSTKSFAI KILDEHDHPP RPTKGLYVLQ 480 

VHEMNIPGEY LGSVLAQDFD LGG|KGTVSYS IZ*PSHXGDVS lYTYVSVNPT NQAZYALRSF 540 

NFBQTKAFBF KVXJUCDSGAP AHLBafATVR VTVLDVNDNA PVXVLPTLQN DTABLOVPRM 600 

AGLGYLVSTV RALOSDFGBS GRLTYEIVDG NDDHLFBXDP SSGHIRTLHP FWEDVTPWE 660 

LWKVTDHGK PTLSAVAKLX XRSVSQSLPE GVPRVNGEQH HWDMSLPLIV TliSTXSXXLL 720 

AAMITIAVKC KRENKBIRTY KCRXAEYSHP QLGGGKGKKK KXHKNDIMLV QSEVEERKAM 780 

NVMNWSSPS lATSPMYFDY QfTRLPXiSSPR SBVHYI.KPAS NNLTVPQ6KA GG»T8PTGQ0 640 

TNASBTPATR MSXXQTONFP AEPHYMGSRQ QFVQSISVAP RIATQKEPA 889 

Seq XD NOt C282 Protein Sequence 
Protein Accession #s NP_005592.l 

1 11 21 31 41 51 

I 1 I I I I 

MBLCRSLALL GGSLGLMPCl* lALSTDFWFE AVGPTHSAH8 GLWPTGHGDI ISGYXHVTQT 60 

FSXMAVLWAL VSVSFLVLSC FPSLFPPGHG PLVSTTAAPA AAISHWAMA VYTSBRWDQP 120 

PHPQIQfTFFS WSFYIiGWVSA ILLLCTOALS LaAHGOGPRP GYEIXi 165 

Seq ID NOt C283 Protein Sequence 
Protein Accession ft: NP_006424.2 

1 11 21 31 41 51 

I ) I 1 t I 

MATWALLLIiA AMIiI/SNPGLV FSRLSPEYYD IiARAKIiRDBB KSCPCLAQEG PQGB)IiLTKTQ 60 

ELGRDYRTOj TIVQKLKKMV DKPTQRSVSN AATRVCRTGR SRNRDVCRNF MRRYQSRVTQ 120 

GLVAGETAQQ XCEDLRIjCIP STGPL 145 

Seq XD NO: C2B4 Protein Sequence 
Protein Accession #: NP_005594.1 

1 11 21 31 41 51 

I I I I I I 

MKVSAAAXiAV XLIATALCAP ASASPYSSDT TPCCFAYZAR PIiPRAKIKEY FYTSGKCSNP 60 

AWPVTRKMR QVCAHPEKKW VREYXNSLEM 6 91 

Seq ID NO: C285 Protein Sequence 
Protein Accession ttt NP_071437.l 

1 11 21 31 41 51 

I I I I I I 

MAPGRAVAGL LLLAAAGLGO VAE6PGLAFS EDVLSVFGAN LSIiSAAQLQH LLEQKGAASR 60 

VOVPBPGQLU FNQCLTAEEI FSLHGFSNAT QXTSSKFSVI CPAVLQQI^ HPCEDRPKHK 120 

TRPSH8EVWQ YGPLSVTXXN LASIiLGLXLT PLXKKSYFPK XLTPFVOLAX OTLFSNAXFQ 180 

LXPEAFOFDP KVDSYVEKAV AVFGGFYXiLF FFERMLKMLL KTYGQNGUTH FGNDNFGPQE 240 

KTHQPKALPA XK6VTCYANP AVTEANGHXH FDNVSWSLQ DGKKEPSSCT CLKGPKLSEX 300 

GTXAWMXTLC DAIjHNPIDGIi AIGASCTLSL LQGLSTSIAX LCEEFPKELG DFVXZiLNAGM 360 

STRQALLFNF LSACSCYVQL AFGILVGNNF APNIXFALAQ GMFLYISLAD MFPEKNDMIiR 420 

EKVTGRKTDF TFFMIQNAGM LTGFTAIIiLI TLYAOBIBLE 460 

Seq XD NOt C286 Protein Sequence 
Protein Accession #i NP_004175.1 

1 11 21 31 41 51 

I I ) I i I • 

MPNSBPASLIi ELFNSXATQG ELVRSLKAGN ASKDEXDSAV KMLVSLKMSY KAAAGEDYKA 60 

DCPPta^PAPT SNHGPDATEA BEDFVDPWTV QT8SAKGIDY DKLXVRFGSS KIDKELXKRX 120 

ERATGQRPHH PLRRGIFFSB RDMNQVLDAY ENKKPPYLYT GRGPSSEAMH VGHLXPFXPT 180 

KNLQDVFNVP ItVlQMTIIDEK YLMKDLTLDQ AYQDAVENAK DXIA06PDIN KTFIFSDLDY 240 

MGMSSQPYKN WKIQKHVTF NQVKQIPOFT DSDCXGKISF PAIQAAPSPS NSPPQIPRDR 300 

TDXQCLIPCA XDQDPYFRMT RDVAPRXGYP KPALLHSTPP PALQGAQTKM SASDPN88XP 360 

LTDTAKQIKT KVNKHAFSGO RDTXEEHRQF GGNCDVDVSP MYiTFFLEDD DKLEQIRKDY 420 

TSQAMLTGBL KXALIBVUQP LIAEBQARRR EVTDBIVKEF MTPRXL8FDP Q 471 



Seq XD NO: C287 Protein Sequence 
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Protein AcceBsion 0i NP 004929.1 
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MTVPRQEMVD 
lEREVSILKB 
KQILNGVYYL 
PEPVAPEIVN 
PSNT8ALAKD 
ARKKHICQSVR 
QHLI/3SLSNY 
RHGHVDTLKF 
TPLHCAAWKG 
CDKDGHIALH 
NCNU>ZSMKY 
QLLARLRKDT 
RPRLSSTNSS 
PTHHFHCSAD 
FSLBBPYBZQ 
GEFGYDKOTS 
PPMTHLCBKI 
EINIMQSBTV 
ELLQZLDAMD 
FP03IFHKVQ 
GLETEKIKCX: 
AQTLKETSLT 
PDPLGKDWCL 
LRELGRRDAA 



11 
I 

DYYISTGEBLG 
IQHPNVITLH 
HSLQIAHFDL 
YBPLGLEADM 
FXRRLbVXDP 
LISLCQRLSR 
DVNQPNKHGT 
LSENKCPLDV 
YYSVAKAbCE 
lAVRRGQKBV 
GRTPLHLAAN 
HRGLPIQQLR 
RFPPSPLASK 
DQSTKAIDIQ 
LNPVIFHLSP 
liLKBXHNRFO 
ISTLPSWRKL 
QDVLLLDPRW 
ICARDLSS6T 
VNLCRHIHQQ 
LLLDSVCSTZ 
NTMGGYKBSP 
LAMNIiGLPDL 
DliIiLKASSVF 



21 
I 

SGQFAWKRC 
EVYENKTDVI 
KPEMIMLLDR 
WSIGVITYII^ 
KKRMTIQDSXi 
SPLSRSMMSV 
PPLLIAAGCX3 
KDKSGEMALH 
AG07VNIKNR 
IKTXiIiSQGCF 
NGIU)WRyL 
PTQNLQPRIK 
PTVSVSINNL 
NAYLNGVGDF 
XiKSLVFVBBP 
NDLHISKKLP 
NGPNQLMSLQ 
LCTNVLGKIiL 
MVDVPALIKT 
STEGDADIRL 
ENVMATTIiPO 
SSIMCFGCHD 
VAKYNTNNGA 
KINLDGHGQE 



31 
I 

RBKST6LQYA 
LZLELVAGQH 
NVPKPRIKII 
LSGASPFLGD 
QRPWZKPRDT 
ARSDDTLDEB 
NIQILQLLIK 
VAARYGHADV 
EGETPLLTAS 
VDYQDRHfflJT 
CLM6ASVBAL 
LKLP6HSGSG 
YPGCQIVSVR 
SVWBFSGNPV 
lAFGGXUCNP 
VLDAGA8GSK 
QFVYDVQDQL 
SVETPRALHH 
DNUmSWADB 
HVNGCKLANR 
LLTVKHYLSP 
VYSQASLGMD 
PKDFLPSPLR 
AYASSC3TSGT 



41 
I 

AKFZKRRRTK 
LFDFLiAEKES 
DFGLAHKIDF 
TKQETLANVS 
QOAIiSRKASA 
DSFVMKAIIH 
RGSRIDVQDK 
AQVTCAASAQ 
ARGYHDZVEC 
PLBVACKDGN 
TTDGKTABDL 
KTTLVBSLKC 
SRSMMFEPGL 
YFCCYDYFAA 
LQWLVATBA 
DMKVIiRNKLQ 
NPLASEEDIiR 
YRGRYTVEDI 
SDEVMVYQGV 
GAELLVLLVN 
QQLRBHKEPV 
IHASDLNLLT 
ALLREWTTYP 
SYNSISSWS 



51 
I 

SSRRGVSRED 
LTBEEATEFL 
GNBFKNIFGT 
AVKYEFEDEY 
VMMBKFKKFA 
AXM0DNVP6L 
GGSNAVYWAA 
IPISRTKEEE 
LABIGADLMA 
MPZWAIiCBA 
ARSEQHBHVA 
OLLRSFFRRR 
TKGMLEVFVA 
NDPTSIHVW 
DIMNVPRPAG 
EXRSQZVSVC 
RIAQQLHST6 
QRLVPDSDVE 
RIVPVEULTP 
H6QGZEVQVR 
KZYQPRDFFR 
RRKLSRI.U>P 
ESTVOTIiMSK 
R 



Seq ZD HQs C288 Protein Sequence 
Protein Accession #t NP 002072.1 



MELRARGWWL 
ICPQGYTCCT 
TLQATFPGAP 
QLIiLPDDYLD 
PDBPECSRAV 
TDKFWGTSGV 
ROKLAPRERP 
RGRYLPEVMQ 
ASDDGSGS6S 



11 
I 

ZiCAAAALVAC 
SEMEENIJVNR 
GELYTQNARA 
CliGRQABALR 
MKLVYCAHCL 
BSVZGSVBTH 
PSGTXiBKXiVS 
DGLANQXNNP 



21 
I 

ARGDPASKSR 
SHABLETALR 
PRDLYSBLRL 
PFGBAPRELR 
6VPGARPCPD 
LAEAZKALQD 
BAKAQLRDVQ 
EVBVDZTXPD 



31 
I 

SCX3EVRQ1YG 
DSSRVLQAML 
YYRQANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRSTLTAKVZ 
DFHZ8LPGTL 
HTZRQQZNQXi 



41 

I 

AXGFSLSDVP 
ATQLRSFDDH 
ETIiAEFWARL 
RSFVQGL6VA 
ANQADLDAEH 
QGOGNPKVNP 
CSBKKALSTA 
KZMTNRIjRSA 



51 
I 

QAEZSGEHLR 
FQHLLNDSER 
LERLFKQIiHP 
SDWRKVAQV 
RNLLDSMVLZ 



YNGNDVDFQD 



Seq ID NO: CSe9 Protein Sequence 
Protein Accession #i AAH30205.1 



1 11 

I I 
MZZLZYLFLL LWEDTQGWGF 
BGGHLATYKQ LEAARKIGFH 
ERWDAYCYNP BAKECOGVFT 
DLBDDPGCZiA DYVBIYDSYD 
QZKYVANDPV 8K8SQ6KNT8 



31 31 41 51 

I I I I 

KDGIFBNSXW XiERAAlSVYHR BARSGKYKLT YAEAKAVCEF 
VCAAGHMAKO RVGYPZVKPG PNOQFGKTGI IDYGZRLNRS 
DPXQIFKSPG FPKEYEDKQI CYWHIRLKYG QRIHL8FLDF 
DVHGFVGRYC GDBIiPDDXlS T(a«VHTUCFIi SDASVTAGGF 
TTSTGNKNFL A6RFSHL 



Seq ID NO I C290 Protein Sequence 
Protein Accession #1 HP_001973.1 



MRAMDALQVL 
GHIiEZVLTGH 
LtlYNlNSSHA 
R8CPPCSEVC 
TDCFACRHFN 
CVRACPPDKM 
LDFLITGLNG 
RSLYNRGF8L 
RLDZKHNRPR 
REFAHEAECF 
YKYPDVQNEC 
GTFLYNRGRR 
GTVHKOVWIP 
S8LQLVTQYL 
LLKSP8QVQV 
VWELMTFGAE 
LANEFTRMAR 
NliATTTIjGSA 
PVSLHPMPRG 
TPVTPLSPPG 
KRRRRH8PPH 
EYKNRQRDGG 
8AFDNPDYHH 



11 
I 

OLIiFSIiARGS 
NADLSFLQWI 
XiRQUUiTQLT 
XGRCH6PGSE 
DSGACVPRCP 
EVDKNGLKMC 
DPWHKIPALD 
LIMKNLNVT8 
RDCVAEGKVC 
SCHPEOQPKG 
RPCHEEICTQG 
IQKiOlAMRRY 
EGESIKIPVC 
PliGSUiDHVR 
ADFGVADLLP 
PYAGLRIAEV 
DPPRYLVIKR 
ZjSLPVGTLNR 
CLASSSSEGH 
LEEEDVNGYV 
PPRFSSLEEL 
OPGGDYAAMG 
8RLFPKANAQ 



21 

I 

EV<nJSQAVCP 
REVTGYVLVA 

BZIIS6GVYIE 

DOQTLTRTIC 
QPZiVYNKLTF 
BPOSGLCPKA 
PEKIiMVFRTV 
I/SFRSLKEIS 
DPLC8SGGCH 
GTATOraSGS 
CKGPELQDCL 
LERGESXEPL 
IKVIEDKSGR 
QHRGALGPQL 
FDDKQLLYSE 
PDI.LEKGERL 
ESGPGZAPGP 
PRGSQ8LL8P 
VTGSBAEI/QE 
MPDTHLKGTP 
GYEYHDV6SD 
ACPASBQGYE 
RT 



31 
I 

GTINGLSVTG 
MNEFSTLPLP 
RMDKXfCHMDT 
APQGNGHCFG 
QIiEPNPHTKY 
CEGTGSGSRF 
RBITOYLWIQ 
AQRIYISANR 
GPGPGQCLSC 
DTCAQCAHFR 
GQTLVLIGKT 
DPSEKANKVL 
QSFQAVTDHM 
LLNWGVQIAK 
AKTPIKWMAL 
AQPQICriDV 
BPHGLTNKKL 
SSGYMPMNQG 
KVSKCR8RSR 
88RU6TLS8V 
LSA8L0S1QS 
EMRAFQGPO! 



41 
I 

DAENQYQTLY 
NLRWRGTQV 
ZDWRDXVRDR 

ramwoiDB 

QYGGVCVASC 
QTVDSSNIDG 
SI9PPHMHNFS 
QLCYHH8UIW 
RNYSRGGVCV 
DQPHCVSSCP 
HLTMALTVIA 
ARIFKETELR 
LAIG8LDHAH 
GMYYLEBHGM 
ESIHPGKYTH 
YMVMVKCWMI 
EEVELEPELD 
NLGG8GQESA 
SRSPRPRGD8 
GLSSVLGTBB 
CPZdiPVPZMP 
QAPBVHYARL 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1431 



60 

120 

180 

240 

300 

360 

420 

480 

490 



60 

120 

180 

240 

277 



51 
I 

KLYBRCEJWM 
YDGKFAIPVM 
DAEXWKDNG 
CAGGCSGPQD 
PHNFWDQTS 
FVNCTKILGN 
VFSNLTTIGG 
TKVLRGPTEE 
THCNFLNGEP 
HOVLGAKGPI 
GLWIFMMLG 
KLKVLGSGVF 
IVRLLGLCPG 
VHRNLAARNV 
QSDVWSYGVT 
DENIRPTFKE 
LDLDLBAEED 
V8GS8BRCPR 
AYHSQRHSIiL 
BDBDBEYEYM 
TAGTTPDBDY 
KTLR8LEATD 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1342 
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Seq ID HOt C291 Protein Sequence 
Protein Accession fit KP_001207.l 

^1 11 21 31 41 SI 

5 I I t i I I 

MAPLCPSPWL PLLIPAPAP6 LTVQLLLSLL LLMFVHPQRL PRKQEDSPLG 6GSS6EDDPL 60 

GBEDLPSEED SFRBEDPPGB EDLPGBEDLP GEEDLPEVKP KSEEBGSLXL EDX#PTVBAP6 120 

DPQBPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAORF QSPVDIRPQL AAFCPALRPL 180 

BLL6F0LPPL PBLRUUINGB SVQLTLPPGL EMAIiGPGREy RALQUILKHQ AAGRP6SEKT 240 

10 VE6HRFPABX HWHLSTAFA RVDEAU3RPG GLAVIiAAFLB EGPEEKSAYB QLIiSRliEEIA 300 

BE6SETQVPG LDISALLPSD PSRYFOYBOS LTTPPCAQGV XWTVFNQTVN LSAKQIATLS 360 

DTLWGPGDSR LQUfFRATQP LNGRVIBASP PAGVDSSPRA AEFVQUfSCb AAGDILALVP 420 

GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 459 

15 Seq ID NO I C292 Protein Sequence 
Protein Accession «: NP_00419B.l 

1 11 21 31 41 51 

ZU KGGAWDEGP TGVKAPDGGW GWAVLF6CFV ITGPSYAFPK AVSVPPKBLI QEFQIOYSDT 60 

AWISSILLAM LYQTGPLCSV CVMRFGCRPV MLVGGLFASIi GMVAASPCRS IIQVYLTT6V 120 

ITGLGLALNF QPSLIMLNRY FSKRRPMANG LAAAGSFVFL CALSPLGQLL QDRYGWRGGF 180 

LILGGU«LNC CVCAAIiMRPIi WTAQPGSGP PRPSRRLIiDL SVFRDRG7VL YAVAA8VMVL 240 

OLFVPPVFW 8YAKDL6VPD TKAAFLLTIL GPIDIFARPA AOFVAGLaXV RPYSVYLFSF 300 

25 SMFFHGLADL AaSTAGDYGO LWFCIFFGI SYGHVQAZiQF EVLHAIVOIH KFSSAIGLVL 360 

IMBAVAVLVG PPSGQCLLDA TRVYMYVFIIt AGABVLTSSL ILLU3HFFCI RKKPKBPQPB 420 

VAAAEEBXLH XPPADSQVDL REVSHFZiKAB PBXMGBWHT PET8V 465 

Seq ID NOt C293 Protein Sequence 
30 Protein Accession fit NP_000349.l 

1 11 21 31 41 51 

I I I I I I 

KALFVRLLAL ALALALGPAA TLAGPAKSPY QLVIiQIISRIiR GRQBGPNVCA VQXVIGllIRK 60 

35 YFTNCKQWYO RKIC6KSTVI SYECCF6YBK VPGBKGCPAA IiPLSNLYETli GWGSTTTQL 120 

YTDRTEKLRP EMBGPGSPTI FAPSNEAWAS LPAEVLDSLV SNVNIELLNA LRYHMVGRRV 160 

LTDELKHGMT LTSMYQNSNI QIHHYPNGIV TVNCARLLKA DHHATNGWH LIDKVI8TIT 240 

NNIQQXIEXE DTPBTI«RAAV AASGLNTMLB GN6QYTLIAP TNEAFBKIPS BTLNRILQDP 300 

BAZiRDUiNNH IbKSAMCABA IVAQLSVETL BGTTliEVGCS GDNLTXHQKA XISMXDIIAT 360 

40 NGVIHYIDEL LIPDSAKTLF ELAAESDVST AIDLFRQA6L GNHLSGSERL TLLAPLNSVF 420 

KD6TPPIDAH TRNLLRNHII KDQLASKYLY .HGQTLBTLGG KKLRVFVYRN SIiCIENSCIA 480 

AHDKRGRY6T LFTMDRVLTP PMGTVMDVLK GDKRFSMLVA AIQSAGIiTET LNREGVYTVF 540 

APTNEAFRAL PPRERSRUjG DAXSLANILK YHZGDBILVS GGIQALVRZiK 8LQGDKLEVS 600 

LKNNW8VNK £PVAEPDXMA IMOWEVITO VIiQPPANRPQ ERGDELADSA LBIFKQA8AF 660 

45 SRASQRSVRL APVYQKLLBR MKH 683 

Seq ID NOs C294 Protein Sequence 
Protein Accession fii NP 006527.1 



50 1 11 21 31 41 51 

I I i / I I I 

MTQRSIAGPI CKLRFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINFQ VPHNQHLISN 60 

IKEMITEASF YhPtlhTKBRV FFRNIKILIP ATWKAMNNSK IKQBSYEKAN VIVTDWYGAR 120 

QDDPYTIiQYR GOGKEGKYIH FTPNFLIiNDH LTAGYGSSGR VFVHEHABLR WGVFDEYNND 180 

55 XPFYIKGQNQ IKVTRCSSDI TGIFVCEK0P CPQEUCXXSX LFKEGCTFIY N9TQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSASJDVIT DSADFHHSFP MNGTELPPPP 300 

TF8LVQAGDK WCLVLDVSS KMAEADRLU3 LQQAAEPYLM QIVEIHTFVO lASFDSKQEI 360 

RAQXiHQINSN DDRKLLVSYL PTTVSAKTDX SICSGLKKSF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGHCL PTVLSSGSTI H8IAL688AA PNLEBLSRLT GGIiKFFVPDI SKSNSHIDAF 480 

60 SRISSGTGDI FQQHXQLEST GENVKPBBQL KNTVTVDNTV GmSTMFLVTH QASGPFEIXL 540 

FDPDORKYYT NNFITNLTFR TAStWIPGTA KPGHVfrYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMXYANVKQ QFYPILNATV TATVEPBTGD PVTLRUiDDG 660 

AGADVXKNDG lYSRYFFSFA ANGRYSLKVH VNHSPSISTP AH8IP6SHAM YVPGYTANGM 720 

IQMNAPRKSV GRNEBERKWO FSRV8S6GSP SVLGVPAOPH PDVFPPCKII DXiBAVXVBEB 760 

65 LTIiSWTAPGB DFDQGQATSY EIRMSKSIiQN IQDDFMNAIL VNT8KR17PQQ AGXREIFTF8 840 

PQX8TNOPEH QFN0BTBE8H RIYVAIRAMD RKSLQSAVSN XAQAPLFXPP NSDFVPARDY 900 

LIIiKOVLTAM GLZOZICLII WTHRTLSRX KRADXKEaiQT KUi 943 

Seq ID NO I C295 Protein Sequence 
70 Protein Accession fit Eos sequence 

1 11 21 31 41 51 

I I I I I I 

»1KFLLILLIiQ ATASGALPLN SSTSLEKNNV LF6ERYLEKF YGLBINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LOLKVTGQLD TSTLEMMHAP RCX^VPDVHHF REMPGGPVHR KRYITYRUIN 120 

YTPDMNREDV DYAIRKAFQV MSNVTPLKFS KXimSMADIL WFiVRGAHGD FHAFDGKGGX 180 

LAHAFGPGSO IGGDAHFDED EFWTTHSG6T NLPLTAVHEI GHSLGLGH&S DPKAVMFPTY 240 

KYVDHTTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSPDA VTTVGNKXFF 300 

FKDRFFWLKV SERPKTSVmi XSSLWPTLPS GIBAAYEIEA RNQVFLFRDD KYmiXSNLRP 360 

EPNYPXSIH8 FGFPMFVKKI DAAVFNFRFY RTYFFVDKQY WRYDERRQMM DPGYPKLXTK 420 

NFQGXGPRID AVFYSKtlKYY YFFQGSNQFB YDFLLQRXTK TLKSNSWFGC . 470 

Seq ID NO I C296 Protein Sequence 
Protein Accession fit Eos sequence 



75 
80 
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1 XI 21 31 Al 51 

I I 1 1 I I 

MKFUiZLIiLQ ATASGALPLN SSTSLEKHNV LFQERYIiBKP YGLEINKLPV TXMKYSGNLM 60 

KBKIQEMQHP LGUCVTGQLD TSTLEMMHAP RCGVPDVHRF RENFGG?VHR XHYITYRZini 120 

YTPDMNREDV DVAIRKAFQV WSNVTPLKFS KINTGMAOIL WPARQAHOD FHAFDGKGGI 180 

lAHAFGPGSG IGGDAHPDED BPWTTHSGGT NLFLTAVHAI GHSLGIiGHSS DPKAVMPPTY 240 

KyVDINTPRL SADDIR0IQ8 LYGDPKENQR LPNFDNSEPA LCDPNIiSFDA VTTVGHKIFF 300 

FKDRFFHLKV SBRPKTSVNIi JSSLHPTLPS GIBAAYBISA RHQVPLFKUD KYHLISHLRP 360 

EPNYPKSIHS FQFPNPVKKI DAAVPNPRPY RTYFFVDHQY HRYDERRQNH DPGYPKLITX 420 

NFQGIGPKID AVFYSKNXYY YFFQGSNQPB YDFLLQRITK TLKSNSWFGC 470 

6eq ZD NO: C297 Protein Sequence 
Protein Accession «i NP.oOBSes.l 

1 11 21 31 41 51 

I I I I I I 

MAKDNSTVRC FQ6LLIFGNV II6C06IALT ABCIFFVSDQ BSLYFIiLBAT DNDDIYQAAH 60 

IGIFVOICIiF OjSVLGIVQI MKSSRKILLA YFILMPIVYA FEVA6CXTAA TQRDFFTPNL 120 

FLKQMIiERYQ NMSPPNNDOQ WKNNOVTKTW DRZiHLQDNCC GVNGPSDKQX YTSAFRTENN 180 

DADYPHPRQC CVMNNLKBPL NLBACKLOVP GFYHNQGCyB IiZSGPMMRHA VIGVAWFGFAZ 240 

LCWTFWVXiLQ TMFyWSRXBY 260 

Seq ZD NO: C298 Protein Sequence 
Protein Accession #i NP_001784.2 

1 11 21 31 41 51 

I I I I I I 

MGLPRGPLA8 LLLLQVCWLQ CAASEPCRAV FREAEVTZ£A GOAEQBFGOA L6KVEf4GCPG 60 

QEPALFSTDN DDFTVRNGST VQBRRSIiKER NPLKZFPSKR ZLRRHKRDHV VAPZSVPEHG 120 

KGPFPQRLNQ LKSNKDRDTK IFYSZT6PGA DSPPEGVPAV EKBTGWLLLN KPLDREEZAK 180 

YELFGHAVSE NGASVEDPMN ISZZVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAZYTYN QWAYSIHSQ BPKDPHDLMP TZHRSTGTZS VISSQLDRBK VPBYTLTZQA 300 

TDMDGDGSTT TAVAWEZLD ANDNAPHFDP QKySARVPEN AVGHBV^T VTDIjDAPKSP 360 

AWRATYXjZMQ GDDGDHFTZT THPESNQQZZi TTRKBLOPEA RNQHTLYVBV TNEAPFVIiKXj 420 

PTSTATZWH VEDVNEAPVP VPPSKWBVQ EGIPTGEPVC WTAEDPDKB NQKISYRILR 480 

DPAGWLAMDP D5GQVTAVGT LDREDEQFVR NNXYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRQVLNZTDK DLSPHTSPPQ AQLTDDSDZY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVZRATVOX: HGHVETCFGP HKGQFILPVL 660 

GAVLALLFLIi LVI.LLLVRK1C RKIKEPLLLP EDDTRDNVFY YGEEG6GEED QDYDZTQLBR 720 

GLEARPEWZi RNDVAPTZIP TPMYRPRFAN FDBZGNFIIB HLKAANTDPT APPYDTLLVF 780 

DYBQSGSDAA SLSSLTSSAS DQDQDyDYUN EWQSRFXKLA DKYGGGEDD 829 

Seq ZD NO I C299 Protein Sequence 
Protein Accession #: NP_005620.1 

1 11 21 31 41 51 

i I 1 1 I I 

MAKKSAENGZ YSVSGDEKKO PLZAPGPDGA PAKQDQPVQL GTPGGRIiAVP PRETWTRQMD 60 

PZMSCVGPAV GLGNVWRPPY LCYKNGGGVP LZPYVLIALV GGlPZPFIiEI SLGQPMKA6S 120 

ZNVWNZCPLP KGLGYASKVZ VFYOJTYYZM VLAWQPYYLV KSFTTTLPWA TCX3HTWNTPD 180 

CVBZFRHEDC ANASIANLTC DQIADRRSPV ZEFWBNKVLR IjSGGLEVPGA LNWEVTLCLli 240 

ACWVLVYFCV WKGVKSTGKZ VYFTATFPYV VLWLLVRGV LLPGALDOZZ YYLKPDWSKL 300 

QSPgVWZDAQ TQIFFSYAZO LQALTALGSY NRFNNNCYKD AlZIiALZNSG TSFFAGFWF 360 

SZLGFMAAEQ GVHZ8KVAES 6P6IAFZAYP RAVTLHPVAP LWAALFFFML LLLGLDSQFV 420 

GVEGFZTGLL DLLPASYYFR FQREZSVAIiC CALCFVZDLS MVTDGGMYVP QLFDYYSASG 480 

TTLZiHQAFWB CWVANVYGA DRFMDDZACM Z6YRPCPWMK WCHSFFTPLV CMGZFZFNW 540 

YYBPXiVYNNT YVYFWHGBAM GWAFAZ*8SML CVPLHIiU3CL IiRAKOTKAER WQKLTQPZWQ 600 

LHBZiEyRAQD ADVROLTTLT FVSESSKVW VESVM 635 

Seq ZD NO: C300 Protein Sequence 
Protein Accession ft: NP_006507.1 

1 11 21 31 41 51 

11)111 
MEPSSKKLTQ RLMLAVGGAV LGSLQFGYNT GVZNAPQKVZ EEPYNQTWVH RYGESZLPTT 60 
IiTTLWSLSVA ZFSVGGMZGS FSVGLFVNRF GRRNSMLMMN LLAFVSAVLM OFSKLGKSFE 120 
MLZLGRPZZQ VYCX3LTTGFV PMYVGBVSPT AFRGALGTLH QLGZWGZLZ AQVFGLDSZM 180 
GNKDLWPLLL SZZFIPALLQ CIVLPFCPES PRFLLINRNE ENRAKSVLKK LRGTADVTHD 240 
LQEMKEBSRQ HMRBKKVTZL ELFRSPAYRQ PXLZAWLQIi SQQLSGZNAV FYYST8ZFBK 300 
AOVQQPVYAT X68QZVMTAF TWSLFWER AGRRTIALZG lAGMAGCAZL MTZALALLEQ 360 
LFHKSYLSZV AXFQFVAFFB VGFGPZPWFZ VAELFSQGPR PAAZAVAOFS NIVTSNFZVGM 420 
CPQVVBQXiCO PYVFZZFTVL LVLFPZPTYF KVPBTKGRTF DBZA8GFRQG GASQSDKTPB 480 
ELFHPLQADS QV 492 

Seq ZD NO: C301 Protein Sequence 
Protein Accession #t XP_035292.2 

1 11 21 31 41 51 

I I I I I I 

MA6AGFKRRA ZAAPAABEKB EAREKKLAAK SADGSAPAGB GBGVTLQRNZ TLUIGVAZZV 60 
GTZZGSGZFV TFTGVLKBAO SPGLALWWA ACGVFBIVGA LCYAEUTTTZ SKSGGOYAYM 120 
LBVYGSUPAF LKLNZELLZZ RPSSQYZVAL VFATYLLKPL FPTCFVFESA AKLVACLCVL 180 
LliTAVNCYSV KAATRVQDAF AAAKLXALAL ZZXiXAFVQZO KQDVSNZAPN PSFEGTKLDV 240 
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GNIVLALYSQ LPAYGGWNYL NFVTBEMINP YWJLPLAIII SI.PIVTLVYV LTOLAYFTTL 
STEQMLSSEA VAVDFGMVHL GVMSWIIPVP VGLSCFGSVN GSLPTSSRLP PVGSRBGHLP 
SILSMIHPQL LTPVPSLVPT CWTUiYAFS KDIFSVINFF SPFNWLCVAL AIIGMIWLRH 
RKPBLBRPZK VMLALPVFPI lACLPLZAVS FWXTFVEGGZ GFTIXLSGLP VyPFGVNNKN 
KPKHLLQGIP STTVLOQKLM QWPQBT 

Seq ID NO: C302 Protein Sequence 
Protein Accession #: NP_0052S9.1 

I 11 21 31 41 51 

I I I I I I 

KNWSIFBGLL SGVNKYSTAF GRIHLSLVFI FRVLVYLVTA ERVWSODHKD FDOnRQPGC 
SNVCFDBFFP VSRVRLHALQ LILVTCFSLL WMHVAYRBV QEKRHRBAHG EHSGRLYLNP 
GKKRGGLNHT YVCSliVPXAS VDIAFLYVFH 8PYPKYILPP WKCBADPCP NIVDCFISKP 
6EKNXPTLFH VATAAICZLL NLVBLIYLVS KRCHEOAAR KAQAMCTGHK PKGnrT8SCKQ 
DDLLSGDLIP LOSDSHPPLL FDRPRDHVKK TIL 

Seq ID NO: C303 Protein Sequence 
Protein Accession «< NP 005121.1 



11 



51 



21 31 41 

i 1 1 I I i 

MKICSIiTLLS FLZiLAAQVLL VBOKKKVKNO LHSKWSEQK DTLGNTQIKQ KSRPGNXGKF 
VTKDQANCRN AATEQEBGZS LKVBCTQLDH EF8CVFAGNP TSCLKLKDER VYWXQVARNb 
RSQXDICRYS KTAVKTRVCR KDPFESSIiKL VSSTLFGNTK FRXEKTEMSP REHIXOXETT 
PSSIiAVTQTM ATKAPBCVBD PDMANQRKTA LBFOGBTHSS IiCTFFLSZVQ DTSC 



Seq ZD MOt C304 Protein Sequence 
Protein Acceeaion #i AAH22542 



11 



21 



31 
I 



41 



51 



KCSEZZLROB VLKDGFHRDL LZKVKPGBSI EDUSTCRLLI RQDIPAOLYV DPYEZASLRE 
RNITEAVMVS ENFDIEAPNY LSXESEVLIY ARRDSQCXDC FQAFIiPVHCR YHRPHSEDGB 
ASIWMNPDL LMFCDQAGSR RMIRFRFDSF DKTIBFPILK CWAHSEVAAP GALBNBDIGQ 
HNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLZTILCS RKKKK 

Seq ID NOx C305 Protein Sequence 
Protein Accession Us NP 004985.1 



MSLWQPLVLV 



ZTYWZQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 



PMYRFTEGPF 
PTAGPTGPPS 
RFSE3RGSRP 
IiDKLQLGADV 
THDVFQYRSK 



11 
I 

LIiVliGCCPAA 
UilfliQKQIiSIi 
DLPRAVZDDA 
AFPPGPGZQO 
DOLFWCSTTA 
YRHCATTANY 
LHCATTSNFD 
LHKDDVNGZR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFOQDRFYW 



21 

1 

FRQRQSTLVL 
PBTGBliDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGPCP 
SDXKSrGFCPD 
HLYGPRFEPB 
PSTATTVPLS 
PALPRKLDSfV 
RGKMUiPSGR 
RVSSRSBUIQ 



31 
1 

FPGDLRnn/T 
LKAMRTPROG 
VTPLTPTRVY 
SLGKGWVPT 
CPSERLYTRD 
TRAD8TVMG6 
QOYSIiFLVAA 
PRPPTTTTPQ 
PVDDAOIVNZ 
FEEPLSKKLP 
RLWRFDVXAQ 
VDQVGYVTYD 



41 

! 

DRQIiAEEYIiY 
VPDLQRFQflF 
8RDADZVZQP 

RFGNADGAAC 
QIADGKPCQF 
NSAGBLCVFP 

PBlPPTVCPT 
FDAZAEZGNQ 
FFSQRQVHVY 
MVDPRSASEV 
ZLQCPED 



51 
I 

RYGYTRVAEM 



Seq ID NOt C306 Protein Sequence 
Protein Accession #i MP 000204 



MAGPRP8PNA 
CSTTQABLLAA 
EVFBPLE8PV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFKRIRSUL 
THVOQLPEDQ 
C8BGHSGQTC 
PQCPRTSGFI* 
CHCHQQ8LYT 
LKRAEBVWR 
LLPLLALLLL 
SGNLKGRDW 
AQIiRQBVEBN 
LTEKQVEQRA 
LVEAIDVPAO 
K8QVSYRTQD 
VQLSNPKFGA 
RKIHFNWItPP 
AQGEGPYSSIi 
NDDXmPIGPM 
KRPMSIPIIP 
BBLOLSRVTW 



11 
I 

RLLLAALISV 
G0QRE8IVVM 
DLYILMDPSN 
KEPHPNSDPP 
HRPDSTHLLV 
VRLLAKRNIX 
DIRALDSPRG 
KGNIHLKPSP 
NC8TGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTOHMQR 
LNEVYRQISG 
PHDLKVAPGY 
TATLGRRIiVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPKGYRVK 
V8CRTHQEVP 
KKVLVDNPKN. 
DIPZVDAQSG 
RLPPELZPRL 



21 

I 

SLSGTLANRC 
BSSFQITEET 
SMS0DU3NLK 
FSPiCNVISLT 
FSTBSAFHYE 
PIPAVTNYSY 
LRTEVTSKMP 
SDGLKMDAOI 
QPOiRBGEDK 
QCVCBPGHTO 
IHPOIiCEDLR 
TYSYTMEGDQ 
ACLALLPCCN 
PGFATHAA8I 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELItPQP 
IZRDPDEIiDR 
YHIQGDSBSE 
SEPGRIiAFNV 
RMLLIENLRE 
BDYDSFLMY8 



31 
I 

KKAPVKSCTE 
QIDTTIiRRSQ 
KMGQNIARVL 
EDVDEFRNKIi 
ADGANVLAQI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCS6RGE0QC 
PSCDCPIiSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGPKED 
NPTELVPYGL 
QQPNA6KKQD 
GMVEFQEGVE 
WSPBQPEFS 
GBAWKBLQVK 
SPTSQMLSSQ 
AHLIiDSKVPS 
VSSTVTQL8W 
8QPYRYTVKA 
SDVLR8P8GS 
BAPTAPRTTA 



41 

I 

CVRVDKDCAY 
MSPQGIiRVRL 
SQLTSDYTIG 
QOERI8GI7LD 



FVSSL6VLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TGZDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYHIiRatLMA 
SLRLARLCTE 
HTIVDTVIHA 
LVDVRVPLFZ 
VSRGDQVARZ 
LLELQEVDSIi 
PPPBGDLGAP 
VBLTNLYPYC 
AEPAETNGBI 
RNQAGWGPER 
QRPSV6D0TG 
ARAQRAAAVP 



GVABHGDGYP 
HFPPIFEGRS 
PPXFQGQSYS 
FTFLGKEYST 
HSSVPEAUIY 
6PPTVHP8ER 
LYLFKDGKYW 
TQASVLGPRR 
DRMFPGVPLD 



51 

I 

CTDEMFRDRR 
RF6EERHFEL 
FGKFVDKVSV 
APEGGFDAZL 
DTTGTYTQYR 
DSSNIVBUiE 
QLRALEHVDG 
NGDFVOGQCV 
YBGOFCBYDN 



CNPKVKMVDE 
FWHLIPLIiLL 
SDHLDTPMIiR 
NLLRPDTREC 
PR8AKPALLK 
RPEDDDBKQL 
PVZRRVLDGO 
LRGRQVRRPH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYBVCYGLV 
EAimLATQP 
06HKPBPLL6 
R8ATPGPFGE 



300 
360 
420 
460 
507 



60 

120 

160 

240 

273 



60 
120 
160 
234 



60 
120 
160 
225 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

707 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

loeo 

1140 
1200 
1260 
1320 
1380 
1440 
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HLVHGKMDFA 
SHSTTLPRDY 
SVBYQLLNGG 
QVHPQSPIiCP 
AQGGGPATAF 
PFPQLGSRAO 
VSRTLTTSGT 



FPGSTKStiHR 
STLTSVSSHD 
BLHRtKIPNP 
LPQSAPTLST 
RVDGDSPBSR 
IiFQHPLQSEY 
LSTHMDQQFF 



NTTTSAAAYO 
SRLTAGVFDT 
AQTSVWBDIi 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTRTSA 
QT 



THLSPHVPKR 
PTRLVFSAUS 
LPWHSYVPRV 
AZiSPDSLQLS 
PYKFKVOART 
TEPFLVDGLT 



Seq ZD NO: C307 Protein Sequence 
Protein Accession NP_076404.1 



1 
I 

MGFNLTLAKL 
GLAVWXFFHI 
PYANMYTSIV 
NGQPTBDKIH 
XSQSSRKRRH 
FLSACNVCLD 



11 
1 

PNMEIiKQQES 
RNKTSFIFYI* 
FLGLXSIDRY 
DCSKLXSPLO 
NQSIRWVAV 
PIIYPFMCRS 



21 
I 

HNS6NRSDGP 
KKIWADMH 
LKVVKPPt3)S 
VKHHTAVTYV 
FFTCFLPYHIi 
F8RRI.FXKSN 



31 
I 

GKNTTLHNEP 
TLTFPFRIVH 
RMYSITFTKV 
NSOiFVAVLV 
CRIPFTFSHL 
XRTItSBSIRS 



VliSTSSTLTR 
PT8LRVSNQB 
RAQSQGOWGR 
WERPRRPNGD 
TEOFGPGREG 
ZiGAQHIiEAQQ 



41 

I 

DTIVLPVLYL 
DAGFGPWYPK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDBSAQK 
I4SVRRSBVR 



DYNSLTRSEB 
PRCBRPLQGy 
BREOVXTIES 
IVGYLVTCEM 
IITIBSQDGO 
SLTRHVTQBP 



51 
I 

IIFVASILIiN 
FILCRYTSVL 
VLSLPMIILT 
RYIHKSSRQP 
ILYYCKBITL 
lYYDYTDV 



Seq ID KOi C308 Protein Sequence 
Protein Accession #i NP 065840.1 



1 
I 

MVWCL6LAVL 
GFLLPIFIQF 
DFANGSWVHL 
QGQVQVQNGT 
SQDVSLACHA 
CVPSKGIiIjHP 
EGSLIIALGN 
LIPCSAQCaJP 
TSTNVYTLGT 
WVSLAVPVOA 
PTEIPPPLSP 
TETELLVPQL 
A6VVG6VCFL 
SPOSVAKUCL 
PTVAAPQBR8 
LPGPGPLLQY 
PRESLPGAW 
PSTAPSAGG6 
EliliETUILQL 
DATRARLPAY 



11 

I 

SLVISQGADG 
GLYSPRIDPD 
TVNSPPQFQE 
LRIRRVERGS 
EAYPANLTYS 
PSASAYLTVL 
EDALGBYSCT 
PPWSWTKVG 
SPHWTNVSV 
AHLLVPGLQP 
PRGLVAVRTP 
IKDVIiYBFRL 
GVAVLVSILA 
QGSPVPSIiRQ 
QREQAEPRTF 
LSLPPFREMN 
GAGATAEPPY 
YLSPAPGDTS 
ASSRLRPEAB 
RQPVPHPBQA 



21 

I 

RGKPEWSW 
YVGRVRLQKQ 
TPPAVLSVQE 
SGVYTCQASS 
HFQDKINVFH 
CMPOVIRCPV 
PYNSL0TA6P 
RGLQGQAQVD 
VALPXGANVS 
HTQYQFSVXiA 
R6VLLHWDPP 
VAFAGSFVSD 
GCLIiNRRRAA 
SUiHQDPAGT 
AQRLARSFDC 
VDGDWPPLEB 
TALADWTLRB 
SWASGPERWP 
TELGVCTPBB 
TLL 



31 
I 

GRAEESWLG 
ASLQIEGLRV 
LEPVTLRCVA 
TEGSATHATQ 
XSRLQPRVQI 
RANPPLLFVS 
SPVTRVLLKA 
SNSSLXLRPL 
NBPGFDGGYL 



ELVPKRU7GY 
PSMTANVSTS 



PSPHTOPPSS 



PSPAAPPDYM 
RLIjPGLLPAA 
RREHWTVSK 
GCXjLNTABVT 



41 

1 

CDLLPPAGRP 
EDQQWYECRV 
RGSPLPHVTW 
IiLVIiGPPVIV 
LVDGSLRLLA 
WTKDGKALQL 
PPAFIERPKE 
TKEAH6HWEC 
QRF8VHYTPL 
BXVLSAPEGL 
VIiEORQGSQG 
QLEVYPSRTQ 
PPI.XFSPTQK 
RGPIiPLBPXC 
PLCIEDI8PV 
DTRRCPTSSP 
PRGSLTSQSS 
RRNTSVDENY 
GPEARGAAliR 



51 
I 

PliHVIEWLRP 
PFLDQHIPED 
KLRQKDLGQQ 
VPPKNSTVIIA 
TQPDDAOCYT 
DKFPGWSQGT 
EYPQBVGREL 
8ASNAVARVA 



FTTPAAPGIiP 
WEVLDPAV7VG 
LPGLIiPQPVL 
SAAPSAL6S6 
RGPDGRFVMG 
APPPAAPPSP 
IjRSPETPPVS 
GRGSASFLRP 
EWDSBFPGDM 
EBFZiAFRRRR 



Seq ID NO: C309 Protein Sequence 
Protein Accession #: 



41 



51 



1 11 21 31 

I I I I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA BYRLGPLLGK GGFGTVFAGH RLTDRLQVAI 
KVXPR21RVL6 WSPLSDSVTC PLEVAIiLWKV GAGGGHPGVX RLLDHFETQB GFMLVLERPL 
PAQDLFDYIT EKOPLGEGPS RCFFGOWAA XQRCH&RGW RRDIKDENXL IDLRRGCAKL 
XDFGSGALLH DEFYIDFDGT RVYSPPEWXS RHQYHAIiPAT VHSLGILLYD MVC6DXPFER 
DQEXLBAEUI FPAHVSPDCC ALIRRCXiAPK PS8RF8LEBI LLDPWMQTPA EDVTPQPLQR 
RPCPFGLVLA TLSIAMPQLA PKGQKSHFm MSQG 



Seq ID NO: C310 Protein Sequence 
Protein Accession #i IIP_002501.1 



1 
I 

MEOiYYFLGF 
VWKRQDMRHK 
RNEAQL8ADP 
GQYFQKLGRC 
TMFQKNDRNS 
HTYVX^OTPS 
RYGHFQATXT 
CEITQNTVCS 
PLRMANSALI 
FPGNQBKDPL 



11 
I 

LLLAARLPLD 
NSHXQGRVQA 
YVYNWTAWSB 
SVRVSVNTAN 
SDETFLKDLP 
LMLTVXAAAP 
XVEGXLBVin 
PVDVDEMCLL 
SVGCLAIPVT 
LKNQBFXGVS 



21 
I 

AAKRFHDVLQ 
VLTSDSPALV 
DSDGENGTGQ 
VTLGPQLKBV 
XMFDVIilHDP 
GPCPPPPPPP 
ZC2MTDVXMPV 
TVRRTFNGSO 
VXSIiLVYKKH 



31 
I 

NBRPSAYMRB 
68NITFAVML 
SHHNVFPDGK 
TWRRHGRAY 
SHFI^STXK 
SPSKPTPSIiG 
PHPE8SLIDF 
TYCVNLTLGD 



41 

1 

BNQLNGHSSD 
ZFPRCQREDA 
PFPHHPGKRR 
VPXAQVKDVY 
YKW8FGDNT6 
PAGDNPIiBLS 
WTOQGSIPT 
DTSLALTSTL 
QNWRSKGLS 



51 

1 

BNDHNEKLYP 
NGNIVYEKNC 
WNFIYVFHTL 
WTDQIPVFV 
LFVSTNHTVN 
RiroENGQIN 
BVCTIISDPT 
Z8VPDRDPA8 
VFLilRAKAVF 



Seq XD MO: C311 Protein Sequence 
Protein Accession #: Eos seq 



1 11 

I I 
MRXLKRFLAC XQLLCVCRU) 
QSPZKXDBDL TQVNVNLKKL 
FKASKITPHW GKCNMSSDOS 
XLFEVGTBEN LDFKAIIDGV 
7DTVDHZVPR DTVSISBSQIi 



21 31 41 51 

till 

hahoyyrqqr klvebxgwsy tqalhqxnhg kkyptcnspr 
kfqgwdktsl entfihntgk tvbxnltody rvsg6vs0w 
ehsiiegqkfp lekqiycfda drf8sfbeav kgkgklrals 
esvsrfgkqa aldpfiuinl lpkstdkysfx ykgslt8ppc 
avfcbvltnq qsoyvmuidy lqmnfrbqqy kfsrqvfssy 

1354 



1500 
1560 
1620 
1680 
1740 

laoo 

1822 



60 

120 

ISO 

240 

300 

358 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1163 



60 

120 

180 

240 

300 

334 



60 

120 

180 

240 

300 

360 

420 

480 

540 

560 



60 

120 

180 

240 

300 
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TCKEEIHBAV CSSEPHIVQA DPBNYTSLLV THERPRWYD TMIEKPAVLY QQLDQEDQTK 360 

HBPLTDOYQD LOAILNNLLP NMSYVIiQIVA ICTNOLYGKY SDQLIVDMPT DNPBLDLPPE 420 

LIOTEBIIKB EEEGKDIBBG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYN EAKTW 480 

RSPTRGSEP8 OXODVPNTSL NSTSQPVTKL ATBRDZSLTS QTVTBLPFilT VBGTSASUID S40 

5 GSKTVLRSPH Mllt,SGTAESL NTVSITBVEB BSLLTSFRLD TGABDSSGSS PAT8AIPPIS 600 

BNISQGYIFS SENPETITYD VLIPBSARNA SBOSTSSGSB ESUCDPSMEG NVWPPSSTDI 660 

TAQPDVOSGR ESPLQTMYTE IRVDBSBKTT KSP8AGPVMS QOPSVTDLEM PHYSTFAYPP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNBASNSS HESRI6LAEG LESEKKAVIP 780 

LVrVSALTPI CLWIiVGILI YWRKCFQTAH PYLBDSTSPR VISTPPTPIP PISDDVGAIP 840 

10 IKHPPKHVAD liHASSGPTBE PBEVQSCTVD LGITADSSMB PDNKHKMRYI MIVAYDHSRV 900 

KLAQIiAEKDG KLTOYINANY VDGYNRPKAY lAAQGPLKST AEDFWRMIWB HNVBVIVMIT 960 

NLVEKGRRKC DQYWPADGSB BYGNPI.VTQK SVQVLAYYTV HNPTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDKGVPBY SLPVLTPVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVliD 1080 

SMLQQIQHBO TVNIPGFLKH XRSQRNYLVQ TEEQYVPIHD TLVEAIX.SKE TEVXiDSBXBA 1140 

15 YVNALLIPGP AGKTKLBKQP QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVBRSRV6I 1200 

SSLSGEGTDY INASYIMQYY QSNBPIITQH PLLHTIKDFH RMIWDHNAQL WMIPDOQNM 1260 

AEDEFVYWPN KDEPINCESP KVTLMABEHK CliSNEBKLII QDPILEATQD DYVLEVRHFQ 1320 

CPXWPNPDSP ISKTPELISV IKESAANRDG PMZVBDEHGG VTAGTFCALT TIjMHQLBKEN 1380 

SVDVYQVAKM INLMRPGVFA MBQYQPLYK VXLSLV8TRQ EBIPSTSIiDS NGAAIiPDGHI 1440 

20 AESLESLV 1*48 

Seq ZD NOx C312 Protein Sequence 
Protein Accession XP_031379 

25 1 11 21 31 41 51 

1 r I I i i 

MRILKRFLAC IQLLCVC3UiD WANGYYRQQR KLVEEIGHSY TGALKQKNVfG KKYPTCNSPK 60 

QSPINXDBDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVBINLTNDY RVBGGVSEMV 120 

PRASKITPHW GKCNMSSDGS EHSLBOQKFP LEMQIYCPDA DRF88FBBAV KQXGKLRALS 180 

30 ILFBVGTBEN UDPKAIIDGV BSVSRFGKQA ALDPFILIiHL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDHIVPK DTVSISESQI« AVPCBVLTMQ QSGYVMLMDY MJNNFREQQY KFSRQVF8SY 300 

TOKEBIHEAV CSSEPENVQA DPEMYTSIiLV TWERPRWYD TMIEKPAVLY QQUXJEDQTK 360 

HEPLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNOLYGiOf SDQLIVDMPT DNPELDLPPE 420 

LIGTEEIIKB BEBGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKXNEARni 480 

35 RSPTRGSBPS GKGDVPNTSL NSTSQPVTIOi ATEKDISLTS QTVTELPPHT VEOTSASLND 540 

GSKTVXjRSPH KNL8GTAE8L NTVSITBYEE ESLLT8PKLD TGAEDSSGSS PATSAIPPIS 600 

EWISQGYIFS SENPETITYD VLIPESABNA SEDSTSSGSE ESLKDPSMEG NVWPPSSTDI 660 

TAQPDVGSGR ESPLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYPP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSBVPPL VTPUiXiDNQZ 780 

40 LNTTPAASSS DSALHATPVP PSVDVSPESI LSSYDGAPLL PPSSASFSSE LFRHLHTVSQ 840 

ILPQVT8ATB SDKVPLBASIj FVAGGDLLLB PSLAQYSDVL STTKAASETL EFGSESGVLY 900 

KTLMFSQVEP P8S0AMMBAR SSGPEPSYAL 6DNB68QHIP TVSYSSAIFV HDSVGVTYQG 960 

SLPSGPSHIP IPKSSLITPT ASLDQPTHAL SGDGBWSQAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTVT TSVFGDDNKA LSKSBIIYGN BTELQIPSFW EMVYPSESTV MPNMYDNVNK 1080 

45 LNASLQBTSV SISSTKGMPP GSUVHTTTKV PDHEISQVPE NNPSVQPTHT VSQASGDTSL 1140 

KPVI.SANSEP ASSDPASSEM LSPSTQIOiPy ET9A8FSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPIIiVE TFKVDKISST MLSLIVSNSA 8SENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASBKYB PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDBILTST KSSVTGKVPA GIPTVASDTF VSTDHSVPIQ NGHVAITAVS 1380 

50 PHRDGSVTST KLLFPSKATS ELSH8AKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGL8IH 1440 

KCMSC8SYRB SQBKVKNDSD THBNSIMDQN HPISYSLSEH 8EEDNRVT8V SSDSQTOIDR 1500 

SPOKSPSASIO LSQKHNDGKB BlDlOfTOSAL LPLSPBSKAH 'AVLTSDEBSG SGQGTSDSLN 1560 

ENETSTDPSP ADXNEKDADG ILAAGDSBIT PGPPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HBSRIQLAEO IiESEKKAVIP LVtVSALTPI CLWLVOILI YWRKCPQTAH PYLEDSTSPR 1680 

55 VISTPPTPIP PISDDVGAIP IKHPPKHVAD LHASSGPTBB FBTLKBFYQE VQSCTVDLOI 1740 

TADSSNRPDH KHKNRYINIV AYDHSRVRIA QIAEXDQKLT DYINAMYVDG YNRPKAYIAA 1800 

QGPIjKSTAED FWRMIwanW BVIVMITNLV EKGRRKCDQY WPADGSEBYG NFLVTQKEVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKQRPSGRV VTQYHYTQWP DMGVPBYSLP VLTPVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IPGPLKHIRS QRNYIiVQTEB 1980 

60 QYVPIHDTLV BAILSKETBV LDSHIHAYVN ALLIPGPAGK TKLBKQFQLL SQSNIQQSDY 2040 

8AALKQCNRB KNRTSSZZPV ERSRVGZ8SL 80BGTDYXNA SYZHQYYQSN EPIITQHPLL 2100 

HTZKDFWRMZ MDHKAQLWM IPIX3K»IMABD BFVYWPKKDB PINCESFKVT LMASEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV LEVRHFQCPK WPNPDSPZSK TFELZ8VIKE EAANRDGPMI 2220 

VHDEHGGVTA OTFCALTTLM HQLEKENSVD VYQVAKMINL MRP6VFADZB QYQFLYKVIL 2280 

65 8LVSTRQEEN P8T9LDSKGA ALPDGNIAE8 LE8LV 2315 

Seq ID NOs C313 Protein Sequence 
Protein Accession ftt HP_002842 

70 1 11 21 31 41 SI 

jlRZLKRFLAC ZQUjCVCRLD ttAMQYYHQQR KLVEBZGW8Y TGAZ^KNWG KKYPTCa^SPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKOIMSSDGS EHSLEGQKFP LiMQIYCPDA DRFSSFEEAV KGKGKLRALS 180 

75 ILPBVGTEEN LDFKAIIDGV BSVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVPCEVLTHQ QSGYVMLMDY LQNNFRBQQY KFSRQVPSSY 300 

TOKEBIHEAV CSSEPENVQA DPBNYT8LLV THERPRWYD TMIEKPAVLY QQLDGEDQTK 360 

HEFLTDGYQO LGAILNNLLP NMSYVLQIVA ICTOGLYOKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKB BEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

80 RSPTRGSBPS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VBGTSASLND 540 

0SKTVLR8PH MNLS0TAE8I. NTV8XTBYEB E8LLT8PKIJ) TOAmSSQSS PATSAIPFIS 600 

BNISQGYIFS SOTPBTZTYD VLIPBSARRA SOJSTSSaSB BSLKDPSMEG NVWPPSSTDI 660 

TAQPDVOSGR ESFLQTNYTB IRVDESBKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYPP 720 

TBVTPBAFTP SSRQQDLVST VNWYSQTTQ PVYNAEA8NS SHBSRIOLAB GLBSEKKAVI 780 



1355 



wo 03/042661 



PCT/US02/36810 



PLVIVSALTP ICLWLVGIL lYWRKCFOTA HPYLEDSTSP RVISTPPTPI FPISODVGAI 840 

PIKHFPKHVA DLHASSGFTE BFETLKEPYQ EVQSCTVDLG ITADSSNKPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDOKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVXVMITML VEKGRRKCDQ YKPADGSBBY GNFLVTQKSV QVLAYTmN FTUUITKIKR 1020 

5 GSQK6RPSGR WTQYHYTQW PDM6VPBYSL FVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TQTYIVLDSM LQQIQHEGTV NIPGFUOIIR SQRHYXiVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVQISS LSQEQTDYIN ASYIKOYYQS NBFIITQHPL LHTIKDFWRM IimRNAQLW 1260 

MXPD6QNMAE DEFVYWPKKD EPINCESFKV TLMAEEHKCL SNEBKLXIQD FZLBAIX^Y 1320 

10 VLEVRHFQCP KWPNPOSPZS KTFELXSVIK BBAANRDQPM IVHDEKGOVT AGTFCALTTL 1380 

MHQLBXEHSV DVYQVAXMZM LMRPGVFADI BQYQPliyKVI LSLVSTRQEB NPSTSLOSNO 1440 

AALPDGNIAB SLESLV 1456 

Seq ID NO: C314 Protein Sequence 
15 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I i I ' I I I 

KRIUCRFLAC IQLLCVCRLD WAN6YYRQQR KLVEEI6WSY TOALNQKNHQ KKYPTCNSPK 60 

20 QSPINIDEDL TQVNVNLKKL KFQ(3WDKTSL ENTFXHNTGK TVEINLTNDY RV86GV&EMV 120 

MVPKASKITF HW6KCNMSSD GSEHSLEGQK FPLEMQIYCF DADRF8SFES AVKGKGKLRA 160 

LSILFEVGTB ENLDFKAIID GVESVSRFGK QAALDPFZLL m/LPNSTDKY YIYNGSLTSP 240 

PCTDTDWrVP KDTVSISESQ LAVFCBVLTM QQSGYVMLMD YLQNNFREQQ YKFSRQVPSS 300 

YTGKEEIHEA VCSSBPENVQ ADPENYTSLL VTWERPRWY DTMIEKFAVL YQQLDGEDQT 360 

25 KHBFLTDGYQ DLGAILNNLL PNMSYVLQIV AICTNQLY6K YSDQUVDMP TDNPELDLPP 420 

ELIGTEEIIK EEEEGKDIEE 6AXVNPGRDS ATKQXRKKEP QXSTTTHYNR XOTKYNEAKT 480 

NRSPTROSEF SQKGDVPNTS LNSTSQPVTK LATEKDISLT SQTVTELPPH TVEGTSASLN 540 

DGSKTVLRSP HMNLSGTAES LNTVSXTEYE BESLLTSFKL DTGAEDSSGS SPATSAXPFX 600 

SENISQOYXP SSENPETXTY 0VLXPBSARN ASEDSTSSGS BESLKDPSMB GNVWPPSSTD 660 

30 XTAQPDVGSG RESFLQTNYT BXRVDBSEKT TKSPSAGPVM SQQPSVTDLE MPHYSTFAYF 720 

PTEVTPHAFT PSSRQQDLVS TVNWYSQTT QPVYNEASNS SHESRIGLAB GLESEKKAVX 780 

PLVXVSAIiTF XCIfWLVQIL lYWRKCFQTA HPYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG XTADSSlOiPD NKHKNRYINX 900 

VAYDHSRVKL AQLABKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DPWRMIMEHN 960 

35 VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

6SQKGRPSGR WTQYHYTQW PDMC3VPSYSL FVLTFVRKAA YAKRBAVOFV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQRB6TV NIPGFLKHXR SQRMYLVQTB EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL L8QSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGBGTDYIN ASYIMOYYQS NBFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

40 MIPDGQNMAB DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEBKLXIQD FILBATQDDY 1320 

VLEVRHFQCP KNPNFOSPZS KTFELXSVIK EBAANRDGFM ZVHDBHGGVT AOTFCALTTL 1380 

MKQLEXBNSV DVYQVAXMXN LMRPGVFADI EQYQFLYKVZ LSLVSTRQEE NPSTSU3SNG 1440 

AALPDGNIAB SLESLV 14S6 

45 Seq ID NO: C315 Protein Sequence 
Protein Accession fts Eos sequence 

X 11 21 31 41 51 

!>0 MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEX0W8Y TGALNQKNHO KKYPTCNSPK 60 
QSPZNIDEDL TQVNVNLKKL KF06HDKT8L BNTFZBNTOK TVEXNLTMDY RVSGGVSEKV 120 
FKASKITFHW GKCMMSSD6S EHSLEGQKFP LQtQIYCFDA DRP8SFEBAV XOXGKLRAIiS 180 
ILFEVQTEBN LDFKAIIDOV BSVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLT8PPC 240 
TDTVDWIVFK DTVSISESQL AVFCBVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 
55 TOKEBXHBAV CS8BPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGBDQTK 360 
HEFLTD6YQD L6AILNNLLP NMSYVLQXVA ICTNGLYQKY SDQLXVDNPT DNPELDLFPB 420 
LIGTEEIIKB EEEGKDZEE6 AIVNPGRDSA TMQXRKKEPQ ZSTTTHYNRI GTKYMBAKTN 480 
RSPTRGSBFS GKQDVPNTSL NSTSQPVTKL ATEKDXSLTS QTVTELPPHT VEOTSASLND 540 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 
60 ENISQGYIFS SENPBTITYD VLXPESARNA 8EDST8SGSB ESLKDPSKEG NVWFPSSTDI 660 
TAQPDV6S6R BSFLQTNYTE IRVDBSBICrr KSFSA6PVMS QGFSVTDIiBM PBYSTFAYPP 720 
TEVTPHAPTP SSRQQDLVST VNWYSQTTQ FVYNBASN8S HESRIGLABG LBSEKKAVZP 780 
LVrVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 
^ IKHFPKHVAD LHAS8GFTEB FETLKEFYQE VQSCTVDLQI TADSSNHPDN KHKNRYINIV 900 

65 AYDHSRVKLA QLABKDGKLT DYINANYVD6 YNRPKAYXAA QGPLKSTAED FWRMXHEHNV 960 
EVZVMIINLV SXORRKCDQY WPADGSBEYG NFLVTQKSVQ VLAYYTyRNF TLRNTKIKKO 1020 
SQKGRP8GRV VTQYHYTQWP DM6VPBY8LP VLTPVRKAAY AKRHAVGPW VHC8AGVGRT 1080 
GTYIVLDSML QQXQHEGTVN XFGFLKHXRS QRNYLVQTBB QYVFIHDTLV EAILSKETEV 1140 
LDSHIHAYVN ALLXPGPAGK TKLEKQFQGL TLSPRLECRG TXSAHCNLPL PGLTDPPTSA 1200 
70 SRVAGTILLS QSNXQQSDY8 AALKQCNREK NRTSSXIFVE RSRVGISSLS GEGTDYINAS 1260 
YIMSYYQSNB FXXTQHPIiLH TIKDFHRMZW DHHAQLWMX roOQNMABDB FVYWPHKDEP 1320 
INCBSFKVTL MABEHKCLSN EEKLZIQDFI LBATQDOYVL EVRHFQCPKH PNPOSPXSKT 1380 
FELX8VXXEE AANRDOPMXV HDEUGOVTAG TFCALTTLMR QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYXVIL8 LVGTRQEEttP 8T8LDSNGAA LFDGNZAB8L B8LV 1494 



75 



Seq ID NO: 0316 Protein Sequence 
Protein Accession St Bos sequence 



1 11 31 31 41 51 

80 I 1 I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEBIGWSY TGALHQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGHDRTSL ENTFXUNTGK TVBXNLINDY RV8GGVSQ4V 120 

FKASKITFHW GKCNM8SD6S EHSLEGQKFP LEMQIYCFDA 0RPS8FBBAV KOKOKLRALS 180 

ILF8VGTBEN LDFKAZIDOV ESVSRFGtQQA ALDPFILLNL LPNSTDKYYI YN6SLT8PPC 240 

1356 
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TDTVDWIVFK 
TGKBBIHBAV 
BEFLTDGYQD 
ESRI6IAEGL 
ISTPPTPIFP 
AOSSNHPDNK 
GPLK8TAEDF 
LAYVTVRNPT 
KRHAVGPWV 
YVFIHDTLVE 
AALKQCNREK 
TIKDFWRMIW 
BBiailQDFI 
HDEHGGVTAQ 
liVSTRQEENP 



DTVSISESQL 
CSSBPEKVQA 
LQAILMNLLP 
BSEKKAVZPL 
ZSDDVGAIPI 
HKNRYn9IVA 
WRMIWEHNVE 
XiRNTKIKKOS 
HC8AGVGRTG 
AILSKETEVL 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TPCaUiTTLMH 
8TSLDSNQAA 



AVPCEVLTMQ 
DPBMYTSLLV 
NMSYVZiQZVA 
VIV8ALTFIC 
KHFPKHVADL 
YDHSRVKLAQ 
VrVMITNliVB 
QKGRPS6RW 
TVIVLDSMLQ 
DSHIHAYVNA 
RSRVQISSLS 
PDGQNMAEDE 
EVRHFQCPKH 
QLEKENSVDV 
LPDGHIAESIi 



QSGYVMLMDY 
TWBRPRWYD 
ZCTKGXiYGKY 
LWLVGZLZY 
HASSGFTBEF 
LAEKDGKLTD 
KGRRKOXJYW 
TQYHYTQWPD 
QZQHEGTVHZ 
LLIPOPAGKT 
GEGTDYIHAS 
FVYWPNKDEP 
PNPDSPZSKT 
YQVAKMZNLM 
BSL 



LQNNFREQQY 
TMZEKFAVLY 
SDQLTVDMPT 
HRRCFQTAHF 
BTLKBFYQEV 
YZNANYVDQY 
PADGSESYGM 
MGVPEYSLPV 
FGFLKHIRSQ 
KLBKQFQLLS 
YZMGYYQSNB 
ZNCBSPKVTL 
FELZ8VZXEB 
RP6VFADZEQ 



RP8RQVFSSY 
QQLDGSDQTK 
DNPEASNSSH 
YLBDSTSPBV 
QSCTVDL6ZT 
NRPKAYIAAQ 
PLVTQKSVQV 
LTFVRKAAYA 
RNYLVQTBBQ 
QSNZQQSDYS 
FZZTQHPLLK 
MAEBOCCLSN 
AANRSGFMZV 
YQFLYKVILS 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1113 



Seq ZD NOt C317 Protein Sequence 
Protein Accession Eos sequence 



1 
I 

MRZLKRFLAC 
QSPZKZDEDI* 
FKASKZTFHW 
ILFEVGTBEN 
TDTVDWZVFK 
TGKEEZHBAV 
HBFLTDGYQD 
LZGTESIZKE 
RSPTR6SBFS 
GSKTVIiRSPH 
ENZSQGYIFS 
TAQPDVGS6R 
TEVTPHAFZP 
LVZVSALTPZ 
ZXHPPKKVAD 
AHDHSRVKZA 
BVZVMZTNLV 
SQKQRPSGRV 
GTYZVLOSMI) 
LDSHIBAYVN 
SRVTOITZLLS 
YZKQYYQSNE 
ZNCBSFKVTL 
FELZSVZKEB 
RPGVFADIBQ 



11 

1 

ZQLLCVCRIiD 
TQVMVNLKKL 
GKCNMSSDGS 
LDPKAZZDOV 
DTVSZ8ESQL 
CSSBPENVQA 
LGAZLNNLLP 



GKGDVPNTSIi 
KNLSGTAESL 
SENPETITYD 
BSFLQTNYTE 
SSRQQOLVST 
CLWLVGIbZ 
LBASSGFTBE 
QIiAEKDGKZiT 
BK6RRKCDQY 
VTQYHYTQWP 
QQZQHB6TVN 
ALLZPGi;ACnC 
Q8N1QQ8DYS 
FZZTQHPZiLH 
MAEEHKCLSN 
AANRDGPMZV 
YQFIiYKVZIiS 



21 
I 

WANOYYRQQR 
KFOGWDKTSL 
BHSLEGQKFP 
ESVSRFGKQA 
AVFCBVLTMQ 
DPENYTSIjLV 
NMSYVLQZVA 
AZVNPGRDSA 
NSTSQPVTKL 
NTVSZTBYBB 
VLZPESARMA 
ZRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETZiKEFYQE 
DYZNANYVDG 
WPADGSBBYO 
DMGVPBYSLP 
ZFGFLXHZRS 
TKIiBXQFQOL 
AAliKQOVREK 
TZKDFHRMIW 
EEKLZZQDFZ 
HDEHGGVTAQ 
LVSTRQEENP 



31 
I 

KLVEEIGWSY 
EMTPZHNTGK 
liEMQlYCFDA 
ALDPPZIiLNL 
QSGYVMLMDY 
TWERPRWYD 
ZCTNGLYGKY 
TNQZRKKEPQ 
ATEKDISLTS 
ESIiLTSFKLD 
SED8T8SGSE 
KSFSA6PVMS 
FVYNEASNSS 
FYIjEDSTSPR 
VQSCTVDLQZ 
YNRFKAYXAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEB 
TZiSPRIiBCRG 
NRTSSIIPVE 
DHNAQLWMZ 
LEATQDDYVL 
TFCALTTLMH 
ST8LDSKGAA 



41 

1 

TOAUIQKNWG 
TVBINLTODY 
DRFSSFEBAV 
LPNSTDKYYZ 
LQKNPREQQY 
TWZEKFAVLY 
SDQLZVDMPT 
ZSTTTHYNRZ 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QQPSVTDLEM 
BESRIGLAE6 
VISTPPTPZF 
TADSSNHPDH 
Q6FLKSTAED 
VLAYYTVRNP 
AKRHAVGPW 
QYVFZHDTLV 
TZSAHCNLPL 
RSRVGZSSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNZAESL 



51 
I 

KKYPTCNSPK 
RV86GVSEMV 
RBKOKLRALS 
YN0SLT8PPC 
KFSRQVF8SY 
QQLDGEDQTK 
DNFELDLFPB 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPPZ8 
NVWFPSSTDl 
PHYSTFAYFP 
LESEKKAVZP 
PISDDVGAIP 
KHKNRYINIV 
FWRMZWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAZLSKETBV 
PGLTDPPT8A 
GEGTDYZNAS 
FVYWPNKDEP 
PNPDSPZSKT 
YQVAKMZNLH 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1493 



Seq ID NO I C318 Protein Sequence 
Protein Accession #: Bob sequence 



1 
I 

MRZLKRFLAC 
QSPZNZDEDL 
FKASKZTFHW 
ZLFEVGTEBN 
TDTVDWIVFK 
TGKBBZHEAV 
HEFLTDGYQD 
LIGTBEIIKB 



11 
I 

ZQLLCVCRLD 
TQVNVNLKKL 



21 
I 

WANGYYRQQR 
KPQ6WDKT8L 



GSKTVLRSFH 
ENZSQGYIFS 
TAQPDVOSGR 
TEVTPHAFTP 
LVIVSALTFZ 
IKBFPKBVAD 
AYDHSRVKLA 
EVZVMZTNLV 
SQKGRP6GRV 
GTYIVLDSML 
liDSHZHAYVN 
ERSRVOISSL 
ZPDGQNMAED 
DGRKFLCSDN 



LDFKAIIDGV 
DTVSISESQL 
CSSBPENVQA 
LGAILNNLLP 
EBEGKDZEEG 
GKGDVPNT6L 
MNLSGTAESL 
6ENPETZTYD 
ESPLQTNYTE 
8SRQQDLVST 
CLVVLV GILI 
LHASSGFTEB 
QLAEKD6KLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQZQHEGTVN 
ALLZPGPAOK 
SGEGTDYINA 
EPVYWPNKDE 
PYAPTRKRKF 



AVFCBVIiTNQ 
DPENYTSLLV 
NMSYVLQZVA 
AZVNPGRDSA 
NSTSQPVTKL 
NTVSITEYBB 
VLZPE8ARNA 
ZRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FBTLKBFYQB 
DYZNANYVDG 
WPADGSEBYG 
DMGVPBYSLP 
IFGFZiKHZRS 
TKLEKQFQLL 
SYZMGYYQSN 
PINCESFKVT 
RGCLPGSQDD 



31 
I 

KLVEEZGWSY 
ENTFZHNTGK 
LEKQZYCF32A 
ALDPFXLUiL 
QSGYVMLMDY 
TWERPRWYD 
ZCTNGLYGKY 
TNQZRKKEPQ 
ATEKDISLTS 
ESLLTSFKIiD 



41 

I 

TGALNQKNWG 
TVBINLTNDY 
DRFSSFEBAV 
LnrSTDKYYZ 
LQNNPREQQY 
TMZEKFAVLY 
SDQLZVDMPT 
ZSTTTHYNRZ 
QTVTBLPPHT 



KSFSAGPVMS 
PVYNEASNSS 
FYIiEDSTSFR 
VQSCTVDLQX 
YNRPKAYZAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRMYLVQTEB 
SQSNZQQSDT 
BFZXTQHPLL 
U4AEEHKCLS 
QSDBARSLC 



ESLKDPSMEQ 
QGPSVTDLEM 
KESRIGIiAEG 
VISTP9TPIF 
TAD88NHFDN 
QGPLK8TAED 
VLAYYTVRNP 
AKRHAVGPW 
QYVFIBDTLV 
SAALXQCaiRB 
BTIKDFWRMI 
NEBKLIIQDF 



51 

I' 

KKYPTCNSPK 
RVSGGVSBMV 
KGKGKLRALS 
YNGSLT8PPC 
KFSRQVF8SY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VBGTSASLND 
PATSAZPFI8 
NVWFPSSTDl 
PHYSTFAYFP 
LESBKKAVIP 
PZSDDVOAIP 
XHRNRYXNIV 
FWRMZWEHNV 
TLRNTKZKK6 
VHCSAGVGRT 
BAIIiSKBTBV 
KNRT68IIEV 
WDHMAQLWM 
ILEATQAWRS 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1359 



Seq ZD HOt C319 Protein Sequence 
Protein Accession fft XP 002914.4 



1 11 21 31 41 51 

I t I I I I 

MKDIDIGKEY ZIPSPGYRSV RERTST8GTH RDREDSKFRR TRPLBOQDAL ETAARABOLS 60 

LDASMHSQLR ILDBBHPXGK YHUGLSALXP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGBLSM EDVWSLSKHB 8SDVNCRRLB RLHQBELNBV GPDAASLRRV VWZFCRTRZ.Z 180 
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LSIVCU1ITQ LAQFSGPAFM VKHXjLBYTQA TBSNLQYSLL LVIiQLLLTEI VRSWSIALTW 240 

A3UIYRTGVRL RGAILTMAFK KILKLKNIKB KSLOELINIC SNDGQRMPEA AAVGSUAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFII*FY PAMMFASRLT AYFRRKCVAA TDERVQKMNB 360 

VLTYIKPIKM YAWVKAFSQS VQKIRBBEBIR XLBKAQYPQS ITVGVAPIW VIASWTPSV 420 

HMTIGPDLTA AQAPTWTVP MSMTPALKVT PFSVKSLSBA SVAVDRFKSL FLMEEVHMIK 4 BO 

UKPASFHIRI BMKKATLAWD SSHSSIQKSP KLTPKMKXDK RASR6KRBKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPB BEBGKHIHLG HLRLQRTLHS IDIiBIQBGKL VGICGSVGSG 600 

KTSLISAXLG QHTLLEGSZA ZSGTFAyVAQ QAWimATLR DNILFGKEYD EERYNSVUIS 660 

CCLRFDLAII* PSSDLTEIGE R6ANLS6GQR QRISLARALY SDR8I YILDD FliSAUaAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVSC DEVIFMKB6C XTER6TBBBL MNLNGDYATI 780 

PNNLLIiGETP PVEIKSKKET 8QSQKKSQDK GPKTOSVKKB KAVKP2EGQL VQLEEKGQGS 84 0 

VPWSVYGVYl QAAGGPLAFL VIMAL.FMLNV GSTAFSTWWL SYWIKQGS6N TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILICA ZRGWFVKGT LRASSRIiHDE LFRRILRSPM 960 

KFFDTTPTGR ZLNRFSKDKD EVDVRLPFQA EMFI(^1VILV FFCVGHXA6V FPWFIiVA.VQP 1020 

LVILFSVLHI V6RVLIRELK RLDNXTQSPF LSBZTSSIQG LATZHAYNKG QEFLHRYQEL 1080 

IiDDNQAPPPXi FTCAMRMIAV RLDLZSXALI TTTGUIIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SBTEARFTSV BRINHYIKTL SLBAPARIKN KAPSPDWPQE GEVTPENAEM 1200 

RYRENLPLVL KKVSFTZKPK EKXQZVGRTQ SGKSSLGMAL FRLVELSGGC ZKXDGVRZSD 1260 

IGLADLRSKL SZZPQBFVLP SGTVRSIiU>P FNQYTEDQZH DALBRTHNRS CZAQIiFLKLE 1320 

SEVMENGDNF SVGERQLLCZ ARALLRHCKZ LZLDBATAAM DTBTDLLZQE TIRBAFADCT 1380 

MLTIAHRLHT VliGSDRXMVL AQGQWEFDT PSVXjLSMDBS RFYAMFAAAB NRVAVKO 1437 

Seq ZD NOs C320 Protein Sequence 
Protein Accession ftt HP_0OS679.1 

I 11 21 31 41 51 

111)11 

MKDIDIGKEY IZFSPGYRSV RERT8TSGTH RDREDSKFRR TRPLBOQDAL ETAARAGOLS 60 

LDASMHSQLR ILDEBHPKGK YHHGliSAUCP ZRTT8KHQBP VDNAGLF8CM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRMiE RWQEEUJEV GPDAASLRRV VWIFCRTRLZ 180 

LSIVCLMZTQ lAGPSQPAFM VKHLLEYTQA TESNLQYSLL LVL6LLLTEI VRSWSLALTW 240 

ALNYRTGVRI* RGAZLTMAPK KILKLKNIKE KSLGBLINIC SNDGQRMPEA AAVGSIiLAGQ 300 

PWAILGMZY NVZIliGPTGF LGSAVFXLFY PAMMFASRLT AYPRRKCVAA TDERVQKMME 360 

VLTYZKFIKM YAWVKAFSQS VQKXREEERR XLEKAOYPQQ ITVGVAPIW VIASWTFSV 420 

HMTLQFDLTA AQAFTWTVP NSMTFALKVT PFSVKSIiSBA SVAVDRFKSIi PIMBEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTERQA 540 

VLAEQKGHLL LDSDERPSPB EEEGKHZHLG HLRLQRTIiHS ZDLEZQBGKL VGZC3GSVGSG 600 

iCTSLISAILQ QMTLLEGSIA ISGTPAYVAQ QAWILNATLR DNILPGKEYD EERYNSVUTS 660 

CdiRPDLAIL PSSDIiTElGE ROANLSQGQR QRZSLARALY SDRS IYILP D PLSAIOAEVG 720 

NHZFNSAIRK HLKSKTVLFV THQLQYLVDC DBVXFMKEGC ZTERGTHEEL MNLNGDYATI 780 

FNNLU/3ETP PVEZHSKKBT SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYOVYZ QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDB LFRRILRSPM 960 

KFFDTTPTCR ILNRPSKDMD EVDVRLPFQA EMPIQNVILV FPCVGMZAGV FPWPLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDMITQSPP LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

UJDNQAPFFL FTGAMRWLAV RLDLISZALI TTT6LMIVIM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SBTEARFTSV BRINHYIKTL SLBAPARIKN KAPSPDWPQE QBVTFENAEM 1200 

RYRENLPLVL KKV8PTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CZAQLPIiKLB 1320 

SEVMEN6DNF 8VGERQLLCI ARALLRHCKI LILDEATAAM DTBTDLLZQE TZREAFADCT 1380 

MLTIAHRLHT VL8SDRXMVL AQQQWEFDT P8VLLSNDS8 RFYAMFAAAB MKVAVICO 1437 

Seq ID NOt C321 Protein Sequence 
Protein Accession #i 1IP_005553.1 

1 11 21 31 41 51 

1111)1 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD RBLHRQTQJO FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPOICNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD C3JPAQIAGPC DAGRCVCKPA VTGERCDRCK SGYYNLDGGN 180 

PEGCXQCPCY GH8ASCR88A EY8VHKXTST FHQDVDGWKA VQRNOSPAKL QWSQRHQDVF 240 

SSAQRLDFVY FVAFAXPL6N QQV8YGQSLS FDYRVDRGGR HP8AHDVILE GAGLRITAPL 300 

MPLGKTLPOG LTKTYTFRLN EWPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGBYSTGYI 360 

DNVTLISARP VSGAPAPWVB QCICPVGYKG QFOQDCASQY KRDSARLQPF GTCIPCMGQO 420 

OGACDFDTQD CYS6DENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC 8VMPETEBW 480 

CNNCPPGVTQ ARCBLG3UXIY FGDPFGBHGP VRPGQPOQCN NNVDPSASGN CDRLTGRCLK 540 

CZHNTAGZYC DQCKAGYFCX) PLAPNPADKC RACNCNPMGS BPVGCRSDGT CVCKPGPGGP 600 

NCBHGAFSCP ACYNQVKIQM DQPMQQLQRM EALZSKAQGQ DOWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ UUCVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT HIPASDKYVG PNGFKSXiAQB ATRLAESHVE SASNMEQLTR 780 

BTEDYSKQAL SLVRKALHEG VGSGSGSPDO AWQ6LVEKL BRTKSLAQQL TREATQABIB 840 

ADRSYQHSLR LLDSVSRLQG V8DQ8F0VEB AKRIKQKAD8 L8TLVTRHMD EFKRTQKNI^ 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYBVES ILKNLRBPDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQABRALGSA AADAQRAKNG AGEALEISSB 1020 

lEQBIOSLNL BANVTADGAL AMEKGIASLK SEMREVEGBL BRKELBFDTN MnAVQMVZTB 1080 

AQKVDTRAKN AGVTZQDTLN TLDGLLHLMD QPLSVDBBGL VLLBQKLSRA KTQZNSQLRP 1140 

MMSBLEBRAR QQRGHLHLLE TSZDGZLADV KNLQIIRDNL PFGCYNTQAL EQQ 1193 

Seq ID NOt C322 Protein Sequence 
Protein Accession #: NP_066924.1 

1 11 21 31 41 51 

1 I 1 I I I 

MAHAOLQLLQ FILAFLGWIG AIVSTALPQW RIYSYAGONI VTAQAMYBGL HMSCVSQSTO 60 
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QIQCKVFDSL LNLSSTLQAT RALMWQILL GVIAIFVATV GMKCMKCSjED OEVQKMRMAV 120 
IGGAIFUiAG LAZLVATAWY GKRIVQEFYD PMTFVNARYB FGQAXiFTGHA AASLCLXiOaA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 211 



8eq ID NOt C323 Protein Sequence 
Protein Accession fit AAM77876 



1 11 21 31 41 51 

MSSWZRWHGP AMARLKGFOH LWGFWRAAP ACPT8CKC8A 8RIWCSDPSP GXVAPPRLEP 60 

NSVDPENITB IPIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKPVAHKA PLKNSNLQHI 120 

NPTRNKLTSL SRKHFRHLDI* SELILVGNPP TCSCDIMHIK TLQBAKSSPD TQDLYCLNES 180 

SKNIPLANIfQ IFKC6LPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYU 0VGNLV8KHN 240 

NETSHTQ6SL RITNI8SDD8 GXQZSCVAEH LVGBDQDSVN IiTVBFAPTIT PLBSPTSDKH 300 

WCIPPTVKGN PKPALQWFYN GAZLNESKyX CTKIHV7MHT BYKGCLQLDII PTBKNNGDYT 360 

LXARNBY6KD EKQZ8AHFMG HPGZDDGANP NypOVZYBDY GTAANDZGDT TNRSNEZPST 420 

DVTSKTGREH I*SVYAVWIA SWOFCLLVM LFIiUOARHS KFCMKGFVLF HKIPLDG 477 



Seg ID NO: C324 Protein Sequence 
Protein Accession it !IP_006171.1 

I 11 21 31 41 51 

I I I I I I 

MSSHZRHHGP AMARUH6FCW LWOFHRAAF ACPT8CKCSA SRZWC8DP8P GZVAFFRIiEP 60 

NSVDPENZTB IPIAMQKRLE ZINEDDVEAY VGIjRHLTZVD SGLKPVAHKA FZjKNSNLQHZ 120 

NPTRMKLTSL SRKHFRHLDL SELXIiVGNPP TCSCDIMWIK TLQBAKSSPD TQDLYCIJIES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AODPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS 6KQISCVAEN LVGBDQDSVN LTVHFAPTXT PLESPTSDHH 300 

HCIPFrVKKa? PKPALQWFYN GAZLNESKYX CTKXHV1NHT EYHGCLQLDN PTHMNNGDYT 360 

LXAXNEYGXD EKQI8AHPMG WP6IDDGANP NYPDVIYEDY GTAANDXGOT TNRSNBXPST 420 

DVTDKTGREH LSVYAWVIA SWGPCLLVM LPLLKLARHS KPGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV lENPQYPGIT NSQLKPDTFV QHIKRHNIVL S40 

KRELGEGAFG KVFLAECYNL CPEQOKXLVA VKTLKDASDN ARKDFHREAB LLTNLQHBBI 600 

VXFYGVCVEG DPLXNVFEYM KHGDUnCFLR AHQPOAVLMA BQNPPTELTQ SC2NLRXAQQX 660 

AAGMVYLASQ HFVHRDLATR NCLVQSNIiLV KKTOFGMSRD VYSTDYYRVG GHIMLPIRNM 720 

PPESXMYRKF TTESOVNSLG WLNBIFTYG KQPWYQLSNN BVIECITQGR VLQRPRTCPQ 780 

BVYELMLGCW QREPHMRKHX KGXBTLLQNL AKASPVYLDI LQ 822 



Seq ID NOs C325 Protein Sequence 
Protein Accession ftt Eos sequence 



1 11 21 31 41 51 

I I I I I i 

MSSWXRWHGP AMARLHGFCH LWGFHRAAF ACPT8CKCSA 8RXWCSDPSP OXVAFPRLBP 60 

NSVDPEMITE IPIANQKRLE IINEDDVEAY VQLRNLTXVD SGLKFVAHXA PLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIHWIK TLQBAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN liAAPNLTVEE GKSITLSCSV ACaPVPNKYM DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGBDQDSVN LTVHFAPTXT PLESPTSDHH 300 

WCIPFTtnCGN PKPALQHFYN GAXUIESKYI CTKXHVTNBT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD BKQXSABPMG HPGXDD6ANP NYPDVXYBDY GTAANDXGDT TNRSNBXPST 420 

DVTDKTGREH LSVYAWVIA SWGPCLLVM LPLLKLARHS KPGMKQFVLP HKXPLDQ 477 

Seq ID NO: C326 Protein Sequence 
Protein Accession (ft NP_570843.1 

1 11 21 31 41 51 

I I I I I 1 

MPLKHYLLLL VGOQAWGAOL AYHGCPSECT CSRASQVBCT GARIVAVPTP LFWNAM8LQI 60 

LNTHITEUIB SPFLNISALX ALRXEKNBLS RXTPGAFRNL GSLRYLSLAN NKUJVLPXGL 120 

FQGLDSLESL LLSSNQLLQX QPAHPSQCSN LKBLQLHOm LBYXPDGAPD HLVGLTKLNL 180 

GKNSLTHXSP RVFQHLGNLQ VLRLYENRLT DIPM6TFDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLQIFO PMFNLRELWL 300 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS FISPGAFMGL TELRELSLHT NALQDLDGNV 360 

FRMLANLQNX SLQNNRLRQL PGNXPAKVNQ LMAIQLQNNQ LENLPLGXFD HIAKLCBIiRL 420 

YDNPWRCDSD XLPLRNHLLL NQPRLGTDTV FVCFSPANVR GQSLXXXNVN VAVPSVRVPB 480 

VPSYPETPWY PDTPSYPDTT SVSSTTBLTS PVBDYTDLTT XQVTDDRSVW GMTQAQSGLA 540 

ZAAXVI6IVA LAC8LAACVG CCCCKKRSQA VLMQMKAPNE C 581 



Seq XD NO: C327 Protein Sequence 
Protein Accession #t NP_002649.1 

1 11 21 31 41 51 

i I I i I I 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNQOTCV SNKYPSNIHW CNCPKKPGOQ 60 

HCEIDKSKTC YBGNGHPYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPiaiRRR FNCYVQVGLK PLVQBCMVHD CADGKKPSSP PBELKPQOGQ KTLRPRPKII 180 

G6EFTTIENQ PWFAAXYRRH RGGSVTYVCQ QSLISPCWVI SATHCFIDYP KKBDYIVYLQ 240 

RSRLNSNTQG EMKPBVBNLI LHKDYSADTL AHHNDIALLK XRSKEGRCAQ PSRTIQTXCL 300 

PSMYNDPQFG TSCBITGPGK ENSTDYLYPB QLKMTWKLX SHREOQQPHY YOSEVTTKML 360 

CAADPQWKTD SOQGDSGQPL VCSLQGRMTL TGIVSWORGC ALKDKPGVYT BVSHFLPWXR 420 

8HTKBBNGLA L 431 



Seq XD NO: C326 Protein Sequence 
Protein Accession «i XP_0872S4.1 

1359 



wo 03/042661 



1 
I 

MQFRECSING 
NETELIKEHD 
AAARIGIVPI 
AESSILPKCI 
REEKIiAAVFQ 
SLSCOHFHRT 
BKLFMBVCRN 
GIMGKEGRQA 
QFYCLPSQQT 
SI KTFLYWTI 
MALETHFNTW 
ILMWTCLFL 
IiERVIGRCSP 



11 
I 

HKYQEINGRL 
LFFKAVSLCH 
QISSETNEVK 
GGEIEKTRIH 
FIEXDLILIX3 
MNILBLINQK 
C8AVLCCRMA 
ARNSOYAIAR 
LYDSVYLTLY 
LGFSHAFZFF 
IKBLVTHOSI 
DIIKKVFDRH 
THISRSWSAS 



21 

1 

VPE6FTFDSS 
TVQISNVQTD 
TLGKLERYKL 
VDBFALKGIiR 
ATAVEDRLQD 
SDSBCABQUl 
PLQKAKVIRL 
FKFLSKUiFV 
NICFTSLPIL 
F6SYLLIGKD 
IFYFVPSLFY 
LHFTSTEKAQ 
DPPYTNDRSI 



31 
I 

EGNLSYLSSXi 
CTGDGFWQSN 
LHILEFDSDR 
TLCIAYRKFT 
KVRETIBALR 
QLARRITBDH 
IKISPEXPZT 
HGHFYYIRIA 
lYSLLEQHVD 
TSLLGNGQMP 
GGIIiWPFLGS 
LTETMAOIKC 
LTLSTMDSST 



41 
I 



lAPSQLBYYA 
RRMSVIVQAP 
SKBYEBIDKR 
MAGIKVWVL7 
VIQKGLWDG 
lAVGDQANDV 
TLVQYFFYKN 
PHVLQNKPTL 
GNVJTFGTLVP 
QNMYFVFIQL 
LDSKCCPPBG 
C 



51 
I 

T8SSFRTSPE 
SSFDEKALVE 
SGEKLLFAKG 
ZPEARTALQQ 
GDRKETAV5V 
TSLSUOiREK 
8MZQBARVGI 
VCFITPQFLY 
YRDISKNRLL 
TVMVITVTVK 
LSSGSAWFAI 
EAACA8VGRM 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

761 



Seq ID NOt C329 Protein Sequence 
Protein Accession XP_0B746l.l 

1 11 21 31 41 SI 

I i i 1 ) I 

MliPIiLAALLA AACPLPFVRG GAADAPGLIiG VPSNASVNAS SAASPSPRGC WPRRPPGPPS 
ARARRRRRRR RRLQflSVQR QMLSSIiLVRW GRPRGFQCSL LLPSTNABGR AFFAAAFBRV 
GPPUiXBHIiO XiAAGGAQQDL RIiCVGGGHVR GRRTGRURPA AAPSAAAATA GAFTALPAYP 
AAEPPOPLWL QGEPUIPCCL DFSliESLQGE FGNRLNRICPX BSTLVACFMT LVIWW8VAA 
liXWPVPZIAO FLPNOMEQRR TTASTTAATP AAVPA6TTAA AAAAAAAAAA AVTSGVATK 

Sag ID NO: C330 Protein Sequence 
Protein Accession #t XP 051522.2 



60 

120 

180 

240 

299 



1 11 21 

I I 1 

MDLHLPDYSE PGKFSDISWP OISSDCIWD 
ANSWVWVNI QAKTTGYDTH CYILNIiAlAD 
HLIFSINLFG SIFFLTCMSV DRYLSITYPT 
YYLKTVTSAS NNBTYCRSFY PEHSIKEWLI 
ASSDQEXHSS RKIIFSYWV FLVCWI*PYHV 
QCLSLVHOCV KFVLYSPINR MYRYBLMKAF 
TK 



31 
I * 

TVHCFNMPNK 
LWWI.TIPVW 
NTP8SRKKMV 
GMEXiVSWLG 
AVLLDIFSXL 
IPKYSAXTQL 



41 
I 

SVLLYTLSFI 
WSLVQHNQW 
RRWCILVWL 
FAVPFSIIAV 
HYIPPTCRI.E 
TKLIDASRVS 



51 

1 

YXFIFVIGMI 
PMGELTCKVT 
LAFCVSLPDT 
FYFLIjARAIS 
HALFTALHVT 
STEYSALBQS 



60 

120 

180 

240 

300 

360 

362 



Seq ID NOi C331 Protein Sequence 
Protein Accession #: NP_000341.1 



1 

MGFVRQIQLIi 
MPSAGMLPWL 
HLGRIHTBLH 
LINSQVRPBQ 
EDTIiYANVDF 
RPUtQNQOTE 
SYDHRTT8FC 
LEHVRKLVKA 
FItYKGPRESQ 



11 



PRYIWOGFAY 

LAWIYSVSMT 
HGRILHYSDP 
AWQDRKTABL 
QMMLLDAACY 
TEETEDPBHP 
FLGHNQAGKT 
AEHMLFYAQL 
KWILDEPTS 
, SGTPI^LKNC 

SF6ISDTPLE 
SPGAPAAHPE 
ATFVFIiALML 
GNRCLKEGWL 
ECPEQAGOLP 
SIGGKLPWP 
NKGWHALVSF 
WAICVIPSM 
LWGZFIGFQ 
FIGINSSAZT 
FGBBKSANPF 
VABERQRIXT 
TFKMLTGDTT 
GVPABEIBKV 
KDPQARRMLW 
FGOGYIVTMK 
WjIiSHKUSItTf 



LWKNWTIiRKR 
QOIFCNVNNP 
ILSQFMDTLR 
FAUGVPDIiAL 
FKLFRVLPTL 
TFTKLMQILS 
KALIQSLESN 
WEEVOPQIWY 
ADDMANFDWR 
WFPDMYPHT 
liQDMVBQGIT 
VKSIVIiEKBL 
FILFLFLLAF 
KKAVSIiLSFV 
GLliANYLDQV 
EGIHDSPFBR 
TTLSILTGLIi 
KGKSQEBAQIi 
GVDPYSRR8I 
FGTGLYLTLV 
WLHHVPEAK 
EIFLKVTEDS 
GQPPPEPECP 
SIVXLPFGEY 
PEYPGGK8TP 
PFQRTQRSTE 
ITGBALVGPL 
lillVAHNAILR 
SFVPASFVLY 
KXAYTSPEHL 
FILBLFDMNR 
HWDLIGXNLF 
GGMXTDILRL 
VTSGDATVAG 
ANWSIKSLGIi 
NVZVSIIRKG 
IKSPKDDLLP 
IBBYSVTQTT 



21 
I 

QKIRFWELV 
CPQSPTPGES 
THPERIAGRQ 
KDIACSEALL 
U>SRSQGINl4 
DLLOGYFEGG 
PLTKIAHRAA 
FFDNSTQMNM 
DIFNITDRTL 
SSLPPHVKYK 
R8QVQAEAPV 
RLKETLKNQ6 
STATIMIiCPL 
AFQPGTBYIiV 
FFGDYGTPLP 
EHPGWVP6VC 
PPTSGTVLVG 
EMEAMXjEDTG 
WDLLLKYRSG 
RKMKNIQSQR 
LVECIGQELI 
DSQPLFAGGA 
GPQLNTGTQL 
PALTLHPWXY 
NKTPSVSFNX 
XLQDLTDRNZ 
SDLGRIMNV8 
ASLPKDRSPB 
LIQERVNKSK 
PALVAUiZjLY 
TLLRFMAVIiR 
AMWEGWYP 
HELTKIYI/ST 
KSILTNISEV 
TVYADCLA6T 
RAWLT8H8M 
DUIPVBOFFQ 
LDQfVFVNFAK 



31 
I 

WPLSLFLVLI 
PGIVSNYKNS 
IRIRDIIiKDB 
ERFIIF8QRR 
RSV«GGIIiSDM 
GSRVLSFNHY 
KPLLMGKILY 
IRDTLGNPTV 
RItVNQYLECL 
ZRMDIDWEK 
GIYLQQMFYP 
VSNAVIWCTW 
LSTFFSKASL 
RFEEQGXiGLQ 
WYFIiLQSSYH 
VKNLVKIFEP 
GRDIETSLDA 
LEHKRNEBAQ 
RTIIMPTHHM 



41 



FUjPNKNPKH 
QQKRENVNPR 
VLQHVQALLV 
GQQYTFFSMD 
TQLFQKQKNT 
SDPLVKTYPA 
GGPITRBASK 
BYGITVISQP 
HIiQFISGVSP 
OWAV IPMMY P 
KLLIVFFHPC 
liLTLLVQRHP 
SSPAVDRLCV 
HQNMGYCPQF 



EECEALCTRIi 
GNPP6SVQRB 
QQTBSiasiiPL 



WXJOIANPLYS 
IIiARVYRDFQ 
ETliTLPLIKN 
GAKTVRYALC 
SPRIQEFIHR 
EDNNYKAFLG 
TPDSPAARRI 
KDFLNRQIiGE 
VLDKFESYND 
TNKIKDRYWD 
CFVDDSFMII 
FLDSFSIMSM 
AAACSGVIYF 
WSNIGNSPTB 
LS6EGCSTRE 
GGRPAVDRLN 
VRQ8LGMCPQ 
DLSGGNQRiOi 
DEADHQGDRI 
SKOFSTTCPA 
RAYA8LFREL 
HPdiGPREKA 
KRFQHTIR8H 
EP6SBQFTVL 
QVNPSPSCRC 
LIRSSUCSKF 
BIPDFXiKHLE 
LNLTREQLSE 
TTYWVTNPLW 
ASFLFDVPST 
LGRGLXDLAL 
PLSQWIAEPT 
GVRPGBCPGL 
DAIOEUiTGR 
TAIAI«IGCPP 
AIMVKOAFRC 
RHYNMLQFQV 
HPRAAOASRO 



51 

I 

HHECHFPNKA 
BLLMNAPESQ 
XGLSD8WYL 
SLSQGTIjQWI 
PSMQDLIiWVT 
IDSTRKDPIY 
LXNANSTFEE 
EGITAEAILK 
ETQLTQRALS 
8GPRADFVED 
LNRCFPIFMV 
SIFLLTIFIM 
TLYIiPHILCF 
GDBFSFLLSM 
ERALEKTEPL 
ITFYBNQITA 
HNILFHHLTV 
SVAIAFVGDA 
AIIAQGRLYC 
HVDDLTPBQV 
EETIiADLGLS 
GQTPQDSNVC 
KDFIiAQIVLP 
ADVLUTKPGF 
STREKLTMLP 
WVNBQRYGGI 
TEDNXKVWFM 
ITVLTTSVDA 
DIMNYSVSAG 
AYVALSCANIj 
SQAVTDVYAR 
KSPIVDEDDD 
LGVNGAGKT7 
EHLYLYARLR 
LVliLDBPTTG 
MOTIQHLKSK 
SSSSLARIFQ 
AQD 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1600 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2273 
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Seq ID NO: C332 Protein Sequence 
Protein Accession ft: NP_006662.2 

I 11 21 31 41 51 

I i I i I i 

MVPHAILARO RDVCRRNGLL ILSVLSVIVG CLLGFPLRTR RI*SPQBISYP QFPGELIJ4RM 60 

UOWILPLW SSUteGLASL DAKTSSRLGV LTVAYYLVfTT FMAVIVGIPM VSIIHPGSAA 120 

QKBTTEQSQK PIMSSADALL DLIRNMPPAN liVEATPKQYR TICTTPWKSP RVAPGEAPPR 180 

RILXYGVQBB NOSHVOMPAL DLTPPPBWY KSBPGTSlXm HVL3IVFFSA TMOIMLGRMQ 240 

DSGAPIiVSPC QCLNESVHKI VAVAVWYFPP QIVFLIAGKI LEMDDPRAVG KKI/3FYSVTV 300 

VCXHjVXjHQLF IliPWiYPPIT KKNPIVPIRG ILQALLIALA TSSSSATLPI TFKCLLENNH 360 

IDRRXARPVL PVGATINMDG TALYEAVAAI FIAQVNNYBL DFGQIITISI TATAASIGAA 420 

GIPQAGbVTN VIVLTSVGLP TODITbllAV DWAIJ)RFRTM mVLGDALAA 6IMAHICRKD 480 

FARDT6TEKL LFCETKFV8L QEIVAAQQHG CVKSVABASB LTLGFTCPHH VFVOVBRDEB 540 

LPAASUJHCT IQISELBTNV 560 



Seq ZD NO: C333 Protein Sequence 
Protein Accession ft: NP_00568D.l 

1 11 21 31 41 51 

i I 1 I I I 

MVTVGNYCBA EGPVGPAWMQ DGLSPCFFFT LVPSTRMALG TXjALVLALPC RRRERPAGAD 60 

SliSWGAGPRI SPYVLQIiUA TLQAALPLAG LAGRVGTARG APLPSYLIiLA SVLESLAOAC 120 

GLMLLWERS QARQRIiAMGI WIKPHHSPGI* I*IiLWTVAPAA EHIALVSHMS PQWWWARADIi 180 

GQQVQPSLWV LRYWSGGLP VI/3LWAPaiiR PQSVTLQVHB EDQDVERSQV RSAAQQSTWR 240 

DFGRKUUiLS GYLWPRGSPA LQLWLICLa LMGLERALNV LVPIFYRNIV NLLTEKAPMN 300 

SLAWTVTSYV FLKFLQGGGT GSTGPVSNLR TPLWIRVQQF TSRRVELLIF SHLHELSLRW 360 

HLGRRTGBVL RIADRGTSSV TGLLSYLVPM VIPTLADZII OIXYFSMPFN AWPOLIVPLC 420 

MSLYLTLTIV VTEWRTKFRR AMNTQEIIATR ARAVDSLLKP BTVKraJAES YEVERYRBAI 480 

IKYQQLEWKS SASLVLLNQT QNIiVIGIiGLL AGSLLCAYFV TEQKLQVGDY VLFGTYIIQL 540 

YMPIiNWFGTY YRMIQTNFID MENMFDLLKE ETEVKDLPGA GPLRPQKGRI EFENVHPSYA 600 

DGRBTLQDVS FTVMPGQTIiA LVGPSGAGKS TILRLLFRFY DI8SGCIRID GQDISQVTQA 660 

SLRSHIGWP QDTVIiPNDTI ADNIRYGRVT AGNDEVBAAA QAAGIHDAIM APPBGYRTQV 720 

GERGLKLSGG EKQRVAIART ILKAPGIZUi DBAT8ALDTS NERAIQASLA RVCANRTTIV 780 

VAHRI^TWH ADQILVIKDG CIVERGRHEA LLSRGGVYAD MNQIiQQGQEB TSEDTKFQTM 840 



Seq ID NOt C334 Protein Sequence 
Protein Accession ft: NP_000667.1 

1 11 21 31 41 51 

1 ) I I i I 

MLLETQDALY VALELVIAAL SVAGNVLVCA AVGTANTLQT PTNYFLVSIA AADVAVGLFA 60 

IPFAITISW3 PCTDPYOCLP LACFVLVLTQ SSIFSLLAVA VDRYLAICVP LRYKSLVTGT 120 

RARGVZAVLW VIAPGIGLTP FLGWNSKDSA TNNCTEPWDG TTNBSCCLVK CLFENWPMS 180 

YMVYPHPFGC VLPPLLXMLV lYIKIFLVAC RQLQRTELMD HSRTTLQRBI HAAKSLAMIV 240 

GIPALCWLPV HAWCVTLFQ PAQGKNKPKW AMNMAILLSH ANSWNPIVY AYRNRDFRYT 300 

FHKIISRYUi OQADVKSGNG QAGVQPAU3V GL 332 



Seq ID NO I C335 Protein Sequence 
Protein Accession ft: NP_443l64 

1 11 21 31 41 51 

I I i I I I 

MGU3ARGAHA ALLXiQTLQVIi AW3AAHBSA AMABTLQHVP SDHTNETSNS TVKPPTSVAS 60 

DSSNTTVTTM KPTAASMTTT PGMVSTNMTS TTLKSTPKTT SVSQNTSQIS TSTMTVTHNS 120 

8VT8AAS8VT ITTTMH8EAK XGSKFDTOSF VQQZVLTLOV LSXXiYZGCKM YY6RRGIRYR 180 

TXOBHDAZZ 

Seq ID NOt C336 Protein Sequence 
Protein Accession #t 1IP^004186.1 

1 11 21 31 41 51 

I I I I I t 

HftQHGAMGAF RALOQXiALLC ALSIiGQRPTG GPGCGPGRLL I/STGTDARCC RVHTTRCCRD 60 

ypGBECCSEH DCMCVQPEPH CXSDPCCTTCR HHPCPPGQGV QSQGKP8FGF QCXDCASGTF 120 

SGGHEGHCKP NTDCXQFGFL TVFPQNKTBH AVCVPGSFPA EPIiGWLTWL LAVAACVLLL 180 

TSAQLQLHXW QLRSQCMVfPR ETQLLLBVPP 8TBDARSGQF PEBESRGBR8A BEKGRI43DtM 240 

V 241 



Seq ID HO: C337 Protein Sequence 
Protein Accession #: BAC03767.1 



1 11 21 31 41 51 

I I I I I i 

KGOXSRVSGL lARNLQFTLT YHSVFFSFGL CIAPliGPTIiL DLROQTHSSL PQISWVFF8Q 60 

QLCLLLGSAL GOVPKRTIAQ SLWALPTSSL AISLVFAVIP FCRDVKVLAS VMALAOLAMG 120 

CIDTVANMQL VRMYQKDSAV FLQVLHFFVG FGALLSPblA DPPLSSANCL PANSTANTTS 180 

RGHLFHVSRV LGQHHVDAKP WSNQTPPGLT PKDGAGTRVS YAFWIMALID LPVPMAVLML 240 

LSXBRLLTCC PQRRPLLLSA DELALETQPP EKEDASSLPP KFQSKLGHB3 LPSCOQRKNL 300 

RGAPYSFFAX HITt3AIiVI.FH TDGLTGAYSA PVYSYAVBKP LSVGHKVAGY LPSLPWGPXT 360 

ZiGRLLSXPXS SRMKPATMVP IHWGVWTP LVLLIF8YNV VFLFVGTASI. OLFLSSTFPS 420 

MLAYTEDSLQ YKGCATTVLV TGAGVOEMVL QMLVGSXFQA QGSYSFLVCO VIPGCLAPTF 480 
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YILLLPPHRM HPGLPSVPTQ DRSIGMEKSE CYQR 514 

Seq ID NO: C338 Protein Sequence 
^ Protein Accession ftt NP_002194.1 

1 11 21 31 41 51 

1 1 1 I I i 

K6PBRTGAAP LPbLLVLALS QGILNCCLAY NVGLFBAKIP SGPSSEQFOY AVQQFINPKG 60 

^ NHLLVGSPNS GFPEMRMGDV YKCFVDLSTA TCBIQUILQTS TSIPNVTEMK TNMSLGIiXLT 120 

!0 RMMGTG6Pr.T CQPLMAQQCXS MQYYTTGVCS DISPDFQLSA SFSPATQPCP S1.IDWWC3) 180 

BSNSIYPWDA VKMFIiBKFVQ GLDIGPTKTQ VGIiIQYANNP RWFNLNTyK TKEEMIVATS 240 

QTSQYGGDLT NTPQAIQYAR KYAYSAASGG RRSATKVMW VTDGESHDGS MLKAVIDQCN 300 

HDKILRF6IA VLGYUIRNAI« OTKNLIKBIK AIASIPTBRY FfMVSDBAAX. LEKAGTLGEQ 360 

ZF8IEGTVQG 6DHFQMEMSQ VGFSADYSSQ NDILMIiGAVQ AFGH8GTZVQ KT8HGHLXPP 420 

IS KQAFDQXLQD RMBSSYLGYS VAAISTGEST HFVAGAPRAH YTGQIVLYSV NENGNITVZQ 480 

AHRGDQIGSY PGSVLCSVDV DKDTITDVLL VGAPMYMSDL KKEEXSRVYLP TIKKGIliGQH 540 

QFLEOPBGIE NTRFQSAIAA LSDINMDGFM DVIVGSPIiEH QNSGAVYIYN GHQQTIRTKY 600 

SQKILGSDGA FRSHLQYFGR SLDGYGDLNG DSITDVSIGA FGQWQLHSQ 8IADVAIBAS 660 

- FTPBKZTltVN KNAQIZLKLC FSAKFRPTKQ NNQVAZVYNI TLDADGFSSR VTSRGLFXEN 720 

20 MERCLQKNMV VNQAQSCPER IIYZQEP8DV WSLDIAVDZ SliQTPOTSPA LEAYSETAKV 780 

PSIPPHKDCG EDGLCISDLV LDVRQIPAAQ BQPFIVSNQN KRLTPSVTLK NKRESAYNTG 840 

IWDFSENLP FASFSLPVDG TBVTCQVAAS QKSVACDVGY PALKREQQVT FTZNFDFNLQ 900 

NLQKQASLSF QALSBSQEEN KADNIiVMLKI PLLYDABIHL TRSTNZNPYB ZSSDGNVP8I 960 

VHSFBDVGPK FIPSLKVTTO SVPVSMATVl IBIPQYTKEK NPLMYLTGVQ TPKA GPXS QT 1020 

25 ADZNFLKZ6Q TSSSVSFKSE HPRHTKEZ^ RTASCSNVTC fUiKDVKNKGE YPVNVTTRZW 1080 

NGTFASSTFQ TVQLTAAAEZ NTYNPEZYVZ BDMTVTZPIM XMKPDBKAEV PTGVXXGSXZ 1140 

AGZLLUiALV AZLNKLGFFK RKYEXMTIQIP DBXDBTTELS 8 1161 

_ _ Seq ID NOt C339 Protein Sequence 
30 Protein Accession ftt NP_I13648.1 

1 11 21 31 41 51 

i I I I I I 

MYRPRARAAP EQRVR6CAVP STVLUjLAYL AYLALGTGVF WTLEGRAAQD SSRSFQRDKH 60 

ELXiQNFTCLO RPALDSIiZRO WQAYKNGA8 LLSNTTSMGR WBLVGSFFPS VSTXTTXGYG 120 

35 NLSPNTMAAR LFCXFFALVG IPLMLWINR LOHLMQQGVN HWASRLGGTW QDPDKARWLA 180 

GSGALLSGIjL LFLLLPPLLF SHMBGWSYTE GFYFAFZTLS TVGF(a>YVXG HNPSQRYPLM 240 

YKNMVSLMIL FGMAWLAIiII KLII*SQIjETP GRVCSCCHHS 8KS>FKSQSH RQGPDRBPES 300 

HSPQQGCYPE GPMQIZQHLE PSAHAAGCGK DS 332 

40 Seq ID KOi C340 Protein Sequence 
Protein Accession #: NP_o 04145.1 

1 11 21 31 41 51 

^^1 I I 1 I I 

45 MEWDNGTGQA LGLPPTTCVY RENFKQLUjP PVYSAVLAAG LPLMICVITQ ICTSRRALTR 60 

TAVYTLNLAL ADLLYACSLP LLIYNYAQGD HWPFGDPACR LVRPLFYANL HGSXIiPLTCI 120 

SFQRYLGICH PLAPWHKRGG RRAAWLVCVA VWLAVTTQd* PTAIFAATGI QRNRTVCYDL 180 

SPPALATHYM PYGMALTVIG FLLPPAALLA CYCLLACRLC RQDGPABPVA QBRRGKAARM 240 

AWVAAAPAI SPLPFHITKT AYIAVRSTPG VPCTVUSAPA AAYKGTRPFA SAMSVU3PIL 300 

50 FYFTQKKFRR RPHELLQKLT AKWQRQGR 328 

Seq ID NO: C341 Protein Sequence 
Protein Accession #t NP_009128.1 

55 1 11 21 31 41 51 

1 I I I I 1 

MQRPGPRLWL VLQVMQSCAA ISSMDMBRPG DGKOQPIEIP MCKDIGYNMT RMPNLKGHEN 60 

QREAAIQLHE FAPLVEYGCH GHLRFFLCSL YAPMCTBQVS TPIPACRVMC EQARLKCSPI 120 

MEQFNFKWPD SLDCRXIiPNK NDPNYLCMBA PNNGSDBPTR 68GLFFPLFR FQRPHSAQBB 180 

60 4 PLKDGGPGRG GCDNPGKPHH VEKSASCAPL CTPGVDVYHS REDKRFAWW LAIWAVLCPF 240 

SSAFTVLTFL IDPARFRYPB RPIIFI*SMCY CSnfSVQYLIR LPAGAESIAC DRDSGQLYVI 300 

QBGLESTGCT LVFLVLYYPO MASSLWWWL TLTWFLAAGK KWGHEAIEAN SSYFHIiAAWA 360 

IPAVKTIIiIL VMRRVAGDEL TGVCYVGSMD VNALTGFNOil PIiACYLVIGT 6PILSGFVAL 420 

FHIRRVMKTG GBNTDKLBKL MVRIGLFSVL YTVPATCVIA CYFYERLNHD YWKILAAQHK 480 

65 CKKNNQTKTL DCLMAASXPA VEZFMVKXFM LLWGITSGM WINTSKTIiQS WQQVCSRRLK 540 

KK8RRKPASV XTSGaiYKKA QBPQKIBHGK YEXPAQ8PTC V 581 



70 



Seq ID NO I C342 Protein Sequence 
Protein Accession #i NP_00S752.1 



I 11 21 31 41 51 

111)11 

MEVSRRKAPP RPPRPAAPLP LLAYUiAIAA PGRGADBPVW RSEQAIGAIA ASQBDGVFVA 60 

SGSCLDQLDY SLEHSLSRLY RDQAGNCTEP VSIiAPPARPR PGSSFSKIiLL PYREGAAGLG 120 

75 GLLLTGWTFD RGACEVRPLG NLSRNSLRNG TBWSCHPQG STAGWYRAG RNNRWYLAVA 180 

ATYVLPEPST ASRCNPAASO BDTAZALKDT BGRSZiATQEL GRLKLCBQAG SLHFVDAPLW 240 

MG8XYFPYYP YWYTSQAATG WPSMARIAQS TBVLFQGQAS LDCGHGHPDG RRIiLLSSSLV 300 

EALDVWAQVF SAAAGEGQBR RSPTTTAI^CL FRMSBIQARA KRVSWDFKTA ESHCKEGDQP 360 

BRVQPIASST LIHSDLTSVY GTWMNRTVL FLGTGDGQLL KVILQEHLTS NCPEVIYEIK 420 

80 EBTPVFYKLV PDPVKNIYIY LTAGKBVRRI RVANOIKHKS CSECLTATDP HCGWCHSUJR 480 

CTPQGDCVHS WUOmjDie SGAKKCPKIQ IIRSSKEKTT VTMVGSFSPR HSKCMVKNVD 540 

8SRBLCQNKS QPNRTCTC8I PTRATYKDVS WNVMFSFGS WNLSDRPNFT NCSSLKECPA 600 

CVETGCAWCK 8ARRCXUPFT ACOPSDYERN QBQCPVAVEK TSGGGRPKEN KGNR1NQALQ 660 

VFYXKSZBPQ KVSTLOKSNV IVTGANPTRA SNITMZIiKST STCDKDVZQV 8HVLND1IIMK 720 
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FSLPSSRKEM KDVCIQFDGO NCSSVGSL8Y 
VIDMLIISHE LKGNINVSEY CVATYCGFLA 
REDPRPTGYR VBSBVDTELB VKIQKENDNP 
DLTTILCKIK GZXTASTIAN SSKKVRVKI/3 
5 AAVGVTHHKS KELSRKQSQQ LELLBSBLRR 
FALRTFFPE9 GOFTHIPTED MHMRDANDKN 
VKDRCLPASP LTIALQTKLV YLTSILBVLT 
MSVCLSQPLR BTVGBPPYLL VTTIiMQKINK 
LNWFBKIPB NESADVOtNI SVNVUIGDTI 

10 QMGTHQXBLL DZDSSSVILE DGITKLNTIG 
HLILFDSBAP QDVQGKRHRG KRKPKVKEMY 
PFAIKYFPDP LDAQAENKKI TDPDWHIWC 
CLSVIAQAFM OAFSLTBQQL GKEAPTNKLL 
EEPLTQB8KK KENEFNEEVA LTBiyKYZVR 

IS BKKKCKHM 



lALPHCSLIP PATTWISGGQ NITMMORHFD 780 

PSLKSSKVRT NVTVKLRVQD TYLDOOIW 840 

NISKKDIBIT LFHQEMGQUI CSPSIITRMQ 900 

NIiBLYVEQBS VPSTWYPLIV LPVLLVIVIF 960 

BIRIXjPABIiQ MDKLDWDSF GlVPFtiDYKH 1020 

BSLTALDAI*! CNKSFLVTVI HTLEKQKNFS 1080 

RDLMBQC8NM QPKLMLRRTB SWEKLLTKW 1140 

GPVDVITCKA LVTLNEDWLL WQVPEFSTVA 1200 

GQARBKIFQA PLSKNGSPYG LQLNEIGLBL 1260 

HYBISNGSTI KVFKKIANFT SDVBYSDDHC 1320 

LTKLLSTKVA IHSVLBKLFR SIWSLPHSRA 1380 

TNSLPLRFWV NIWCNPQFVF DHOCTPHIDG 1440 

YAKDIPTYKE EVKSYYKAIR DLPPLSSSEM ISOO 

YFDBIIiinaiB RBRGLBEAQK QLLHVRVLFD 1560 

1568 



Seq ID KO: C343 Protein Sequence 
Protein Acceosion tt: NP_002176.1 

20 1 11 21 31 41 SI 

MTILGTTPGM VFSLLQWSG BSQYAQNGDL BDAELDDYSP SCYSQt^BVNG 8QH3LTCAPB 60 

DPDVNTTNLE PEIOGALVBV KCLNFRKLQB lYFIBTKKPL LIGKBlHCVK VGBKSLTCKK 120 

IDLTTXVKFE AFFDX»8VZYR EGAMDFWTF NTSHLQKKyV KVLMHDVAYR QBKDENKHTH 180 

25 VHLSSTKLTL LQRKLQPAAM YBZRVRSZPD HYFKGFWSEW SPSYYPRTPE INMSSGEMDP 240 

ILLTISILSP FSVALLVILA CVIiWKKRIKP IVWPSLPDHK KTLEHLCKKP RKNLNVSFNP 300 

ESPLDOQIHR VDDIQARDBV EGPIiQDTFPQ QLEESEKQRI* GGDVQSPKCP SEDVWTPBS 360 

FGRD8SLTCL AGNVSACDAP ILSSSRSLDC RESGKNQPHV YQDLLLSIiGT TMSTLPPPFS 420 

bOSGILTLNP VAQGQPILTS LQSNQEEAYV TMSBFYQHQ 459 

30 

Seq ID KOt C344 Protein Sequence 
Protein Accession »! MP_002713.1 



1 11 21 31 41 51 

35 i I I I 1 I 

KAAARIiOiSL LLLSTCVAU* LQPLLGAQGA PLBPVYPGDN ATPEQKAQYA ADLRRYINMI. 60 
TRPRYGKRHK EDTIAPSEWG SPHAAVPREL SPLDL 95 



. _ Seq ID NO* C345 Protein Sequence 
40 Protein Accession «i 1IP_115934.1 

1 11 21 31 41 51 

irrWRHHVRIiL ftvsxalqii NLGMSYQRBK HNGGREBVTK VATQKHRQSP UTNTSSHFGB 60 

45 VTGSAE6WGP EEPLPYSRAF GEQASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 120 

SBCXSAIiEHGA WTLRACHLCR CIFGALHCLP IiQfrPDRC3>PK DPIASHAHGP SAQGAPSLLL 180 

LXiPCALUmii LRPDAPAHPR SIiVPSVLQRB RRPOGRPOLO BRL 223 



Seq ID NO: C346 Protein Sequence 
50 Protein Accession #s NP_006524.1 

1 ■ 11 21 31 41 51 

I I 1 I.I 1 

MARSLVCLGV IILIiSAFSGP GVRGGPMPKL ADRiCLCADQE C8HPISMAVA LODYMATOCR 60 

55 FLTIHRGQW YVFSKIiKSRG RLFWGGSVQG DYYGDLAARL GYFPSSIVRB DQTLXPGXVD 120 

VKIDKMDFYC Q ^^1 



Seq ID NO I C347 Protein Sequence 
60 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

JrrOVTBKSTE HPBKTTSTTE KTTRTPEKPT LYSBKTICTK GKNTPVPBKP TESUSrCTUT 60 

65 TETIKAPVKS TENPEKTAAV TKTIKPSVKV T6DKSLTTTS SHLNKTBVTH QVPTGSPTLI 120 

TSRTKLSSIT SEATGMKSHP YtNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVItliLV 180 

FLGLIFLVSY MNRTRRTLTQ NTQYNDAEDE GGPNSYFVYL MBQQNL®4QQ IPSPR 235 



70 Seq ID NOi C348 Protein Sequence 
Protein Accession #i MP_543146.l 

1 11 21 31 41 51. 

75 irWTEKSTB HPEKTTSTTB KTTRTPEKPT LYSBKTICTK GKNTPVPBKP TEMMNTTLT 60 

TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTBVTH QVPTGSPTLI 120 

TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 

PLGLIPLVSY h!MRTRRTLTQ NTQYNDABDB GGPNSYPVYL MBQQNI^SMGQ IPSPR 235 

80 Seq ID NOt C349 Protein Sequence 

Protein Accession #: F6EHBSH predicted 



1 11 21 31 41 51 

1 1 I t 1 i 
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MMPRLAFCCW GLALVSGHAT FQQMSPSRNF SFRIiPPBTAP GAPGSIPAPP APGDEAAGSR 60 

VERIiGQAPRR RVRLLRELSE R1»ELVFLVDD SSSVGEVNPR SELMFVRKLL SDPPWPTAT 120 

RVAIVTFSSK NYWPRVDYI STRRARQHKC ALIiDQEIPAX 8YRGG0TYTK GAFQQAAQIL 180 

UIARENSTKV VFLITDOYSN GGDPRPIAAS LRDSGVEIFT FGIWQGKIRE UIDMASTPKE 340 

S QICYLLHSFB BFEALARRAL HBDLPSGSPI QDDMVHCSYL CDE6KDCCDR KGSCK09THT 300 

GHFECICBKG YYGRGLQYEC TACPSGTYKP EX3SPGGISSC IPCPDSaiTS PPGST8PEDC 360 

VCRBGYRASO QTCEL.VHCPA LKPPENGYPI QNTCNMHPNA ACGVRCHPGF DLVGSSIILC 420 

LPMGLHSGSB SYCRVRTCPH LRQPKHGHIS CSTRENLYKT TCLVACDBOY RLEGSDXLTC 480 

, ^ QGNSQNDGPB PRCVBRHC8T FQMPKDVXZS PKNGQXQPAK FCTICWSCR QGFXIiSGVKB 540 

10 MLRCTTSGKH NVGVQAAVCK DVBAPQIKCP KDZBAKTLBQ QDSANVTHQI PTAKDNSGEK 600 

VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDABPPVI DHCRSPPPVQ 660 

VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDliFPQGET IVQYTATDPS GNNRTCDIHI 720 

VIKGSPCBIP PTPVKGDFIC TPDNTGVMCT LTCLBGYDFT BGSTDKYYCA YEDGVWKPTY 780 

TTBWPDCAXX RPANKOFKSF EMFYKAARO) DTDLMKXFSB AFETTLGKMV PSFCSDAEDI 840 

IS DCRLBENLTR KyCLBYNYDY ENGFAXGPGO HGAANRLDYS YSDFIOTVQB TATSIQKAKS 900 

SRIKRSAPLS DYKXKLXFNI TASVPLPDER NDTIjEHENQQ RLLQTLBTXT HKLKRTLNKD 960 

PMYSFQLASE ILIADSNSLE TKKASPFCRP GSVLRGRMCV NCPLGTYYNL EHFTCBSCRI 1020 

GSYQDBEGQXi ECKLCPSGMY TEYXHSRNXS DCKAQCKQGT YSYSQLBTCB SCPLGTYQPK 1080 

FGSR8CLSCP ENTSTVKRGA VNXSAC6VPC PB8KPSR86L MPCHPCPRDY yQPNAGKAPC 1140 

20 IACPFY6TTP FAGSRSXTEC 5TSVIMXTXF OGFGHLELLN CPSBVFHECF FNPCHNSGTC 1200 

QQIiGRGYVCL CPLOYTGLKC BTDIDECSPL PCLNMOVCXD LVGEPXCBCP SGYTGQRCEE 1260 

NINEC8S8PC UfKGICVDGV AOYRCTCVKG FVGLHCETBV KBCQSNPCUf NAVCBDQVGG 1320 

FliCKCPmL GTROGKZtVDB CLSQPCXNGA TCKDOANSFR CLCAAGFTQS HCBIiHINBCQ 1380 

SNPCRNQATC VDBLNSYSCK 0QP6F8GXRC ETEQSTGFNIi DFBVSGXYQY VMLDGMLPSb 1440 

25 HALTCTFWMK 8SD0MNY6TP ISYAVDNGSD MTLLLTDYNG WVLYVNGREK XTMCPSVNDG 1500 

RWUHXAXTWT SANGIWKVYZ DGKZiSDGQAG LSVGLPIPGG GALVLGQEQD KKGBGF8PAB 1560 

SPVGSISQLN LWDYVI*SPQQ VKSLATSCPB ELSKGNVLAW PDFLSGIVGK VKIDSKSXFC 1620 

8DCPRLGQSV PHIJITASEDL XPGSKVinbFC OPOFQLVGNP VQYCLNQGQW TQPLPHCBRX 1680 

SC38VPPPLEN GPHSAODFYA GSTVTYQCaiN GYYUiGD^ FCliaiGSWNa VSPSCLDVDE 1740 

30 CAVGSDCSEH ASCUIVDGSY XCSCVPFYTG DGKNCABPIK CKAPGITPEIIG HSSGBXYTVO 1800 

AGVTPSCQBG YQLMGVTKIT CLESGEWNHL IPYCKAVSCQ KPAIPENGCI EBLAPTFGSK 1860 

VTYRQIKGYT LAGDKESSCL ANSSWSHSPP VCEPVKCSSP ENmNGKYIX* SGLTYLSTAS 1920 

YSCDTQYSLQ QPSIIECTAS GIWDRAPPAC ULVFOGBPPA XKDAVXTGNN FTFRNTVTYT 1980 

CaCEOYTIAGL DTXBCLADGK WSR80QQCLA VSC33BPPXVD HASPETAHRL FaOXAFYYCS 2040 

35 DGYSLADNSQ LliCNAQGKWV PPEGQDMPRC XAHFCEKPPS VSYSXLBSVS KAKPAAa8W 2100 

SFKCMBGFVL NT8AKIBCMR GGQWNPSPMS IQCIPVRCGE PPSIMNGYAS GSNYSFQAMV 2160 

AYSCNK6FY1 KGBKKSTCEA TGQWSSPIPT CHPVSCGEPP KVBNGFLEHT TGRIFESBVR 2220 

YQOrPGYKSV G8PVFVCQAN RHWHSB8PI1M CVPUSOGKPP PXQNGFMXGB NFEVGSKVQF 2280 

FCNEGYBLV6 DS8HTCQKSG KHNKKSNPKC MPAKCPEPPZi LENQbVLKBIi TrBVGWTFS 2340 

40 CKEGHVLQGP SVLKCLPSQQ WNDBFPVCKX VLCTPPPLXS F6VPXPSSAL HF6STVKY8C 2400 

VGGFFLRQKS TTLCQPDGTW SSPLPECVPV ECPQPEEIPN GIXDVQGLAY LSTALYTCKP 2460 

GPBLVONTTT LCQENGHWLG GKPTCKAIEC LKPKBILNGK PSYTDLHYGQ TVTYSCURQF 2520 

RLBGPSALTC LETGDWDVDA PSCNAXHCDS PQPXEKGFVE GADYSYGAXX XYSCFPGFQV 2580 
ACSAMQTCEE SGHSSSXPTC MPIDCXSIiPPH lOFSDCTKLK DDQGYFBQBD DMMBVPYVTP 2640 
45 HPPYHLQAVA KTWOmCBSP AlHSSNFLYG TMVSYTaiPQ YEXJ/3HPVLX CQEDQTWNGS 2700 
APSCI8IECD LPTAPEKGPL RFTET8MGSA VQYSCKPGHI LAGSDLRLCL ENRKWSQASP 2760 
RCEAISCKKP NPVMNOSIKG SNYTYLSTLY YBCDPGYVLN GTBR^ITCQDD KNWDEDEPXC 2820 
IPVDC88PPV 8ANGQVRGDB YTFQKEIEYT QIEGPLLEGA RSRVGLANGS WSOATPDCVF 2880 
VRCATPPQIiA NGVTEGLDYG FMKEVTFHCH BOyXUiOAPK liTOQSDGNW} ABZPLCKFVN 2940 
50 CX3PPEDLABG FPNGPSPXHG GHIQYQCFPG YKLHGHSSRR CL8NGSH8GS SPSCLPCRCS 3000 
TPVXEYGTVH GTDFDCGKAA RXQCFKGPKL LGLSEXTCBA D6QWSS0FPK CEHTSC6SLP 3060 
MIPNAFISBT 8SWKENVXTY SCRSGYVIQG SSDLXCTEKG VWSQPYPVCB PLSCGSPPSV 3120 
ANAVAT6BAH TYB8BVKLRC LEGYTMDTDT DTFTOQKDGR WFPERISCSP KKCPLPENXT 3180 
HXLVHODDPS VNRQVSVSCA EGYTPBGVNX SVOQLOGTWB PPFSDESCSP V8C3GKPBSPE 3240 
55 HGFWGBKYT FE8TXXYQCE PGYELBGNRB RVOQENRQHS GGVAXCKETR CBTPXjBFLNO 3300 
KADIENRTTG PNWYBO^RG YSLEGPSEAH CTKMGTWSHP VPLCKPNPCP VPFVXPENAL 3360 
L8BKBPYVDQ NVSIKCREGF LLQGKOIITC NPDBTWTQTS AKCBKISCGP PAHVENAIAR 3420 
GVHYQYGDMX TYSCYSGYML BGFLRSVCLB N6TWTSPPXC RAVCRFPOQN 60X0QRFNAC 3480 
SCPEGWMGRIi CEEPICILPC LNGGRCVAPY QC!DCPPaNTG 8RCHTAV0QS PCLMGGXCVR 3540 
60 PNRCHCLSSW TGHNCSR 3557 

8eq ID NO: C350 Protein Sequence 
Protein Acceseion #: FOENESH predicted 

65 1 11 21 31 41 51 

I i I I 1 I 

MRFSVSGMRT DYPR8VLAPA YVSVCLLLLC PREVIAPAG8 BPWLCQPAFR CX3DKIYKPLB 60 
QCCYNDAIVS LSBTRQCOPP CTFHPCFELC CLDSFGLTOD FWKIiKVQGV N8QC»88PX8 120 
SXCBRGRZC 129 



70 



Seq XD NOi C351 Protein Sequence 
Protein Accession #s AAH35671.1 



1 11 21 31 41 51 

75 I I I 1 I 1 

MVFGARGGGA LARAAGRGLL ALLLAV8APL RLQAEELGDG CXJHLVTYQDS GmTSKNYPG 60 

TYPNHTVCBK TITVPKGKRIi IIjRLGDIiDIE SQTCASDYLli PTSSSDQYQP YCGSMTVPKE 120 

LLLNTSBVTV RFBSGSHISG RGPLLTYASS DHPDLITCLB RASBYLKTEY SKFCPA6CRD 180 

VAGDISGNMV DGYRDTSLLC KAAIHAGIIA DELOGQISVIi QRKaiSRYBO IIiAKGVLSRD 240 

SO QSLSDKRFLP TSN6CSRSLS FBPDGQIRAS SSWQSVNBSG DQVHW8PGQA RLQDQGP8WA 300 

86D88m7HKP RBWLBIDLGE KKKITQIRTT GSTQSNFNFY VKSFVMNFKN NNSKWKTYKO 360 

IVNNEEKVFQ GNSNPRDPVQ NNPIPPIVAR YVRWPQTWH QRIALKVELl GCQITQGNDS 420 

LVHRRTSQST SV8TKKBDET XTRPXPSBBT STOINITTVA XPLVLLWLV PAGMOXFAAF 480 

RKXKKKQSFY GSABAQKTDC WKQXKYPFAR HQSABFTZ8Y DNEKEMTQKL DLITSDHAQ 539 
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Seq ID KOi C3S2 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I i I I 

KGFGAGQRLR PVPAPRSSAE EAARPGQLRL GIRRGEABLA KLAPSGVKVP GARGGGAIiAR 60 

AAGRGLLALIi LAVSAPUILQ AEBLCTGOGH LVTYQOSGTM TSKMYPOTYP NHTVCEICriT 120 

VPKGXRLILR LGDLDIESQT CASOYU^FTS SSDQYGPYOG SMTVPKELLL HTSEVTVRPB 180 

S6SHISGRGF LLTYASSDHP DLITCLERAS BYLKTBYSKF CPAGCRDVAG DISGNMVDGY 240 

BDTSLLCKAA IHAGIIADEL GGQISVIJQRK 6ISRYEGILA NGVLSRDGSL 8DKRFLFTSN 300 

GCSRSLSPEP DGQIRASSSW QSVNSSGDQV HWSPGQARLQ DQGPSWASGD SSNNKKFREW 360 

LBIDLGBKKK ITGIRTTGST QSNFNFYVKS FVMNFKNNN8 KWKTYKGIVII NEBKVPQ®IS 420 

NFROPVQNNF IPPIVARYVR WPQTWHQRI ALKVELIGOQ ITQ6NDSLVM RKTSQSTSVS 480 

TKXEDBTITR PZPSBETSTG ZNITTVAIPL VUiWLVFAQ MGIPAAFRKX KKKGSPYGSA S40 

BAQKTDCWKQ ZKYPFARHQS ABPTXSYDNB XEMTQKLDLI T8DMA0 586 

Seq ID NOt C353 Protein Sequence 
Protein Accession 0: FGBNBSH predicted 

1 11 21 31 41 51 

11)111 

MFORQBRFU) LSSAEAVAAW IXJJQHPDIIN KODGCXSHLVT YQDSGTMTSK NYPQTYPNHT 60 

VCEKTITVPK GKRLILRLGD LDIESQTCAS DYtLPTSSSD QYGMQKBBET EVI.CLSVAQA 120 

QRVDXPVQLL PSFLB6HKGH ADARGPYGGS MTVPKBLUiN TSBVTVRFBS GSKISGRGPL 180 

LTYASSDHPD LITCLERASH YLKTBYSKFC PAGCRDVAGD IS6NMVDGYR DTSLIiCKAAI 240 

HAGIIADEL6 GQI6VLQRKG ISRYGGIIiAN GVLSRDGSLS DKRPLFTSKQ CSRSLSFEPD 300 

GOIRASSSWQ SVNBSGDQVH WSPGQARLQD QGPSWASGDS SMNHKPRBWL EIDLGEKKKI 360 

TGIRTTOSTQ SNPNFYVKSF VMNFKNNNSK WKTYKGIVNN BBKVPQGNSN FRDPVQNNFI 420 

PPZVARYVRV VPQTNHQRIA UCVELZGGQI TQSNDSLVMR KTSQST8VST KXBDBTITRP 460 

IPSEETSTDA MPVQIVGDBT QMISQRENLO PDBSKIPFRO TAESMVRWP AVWNDLGML 540 

PLAHTPEEDI DHYCWKQIKY PFARHQSABP TISYDMEKEM TQKLDLITSD MADYQQPLMI 600 

GTGTVTRKQS TPRPMDTDAE EAGVSTDAGG HYDCPQRAGR HEYALPLAPP EPEYATPIVE 660 

RRVLRAHTPS AQSGYRVPGP QP6KKHSLS8 GGFSPVAGV6 AQDGDYQRPH SAQPADRGYD 720 

RPKAVSAZiAT BSGHFDSQKP PTHPGTSDSY SAPRDCLTPL NQTAMTALL 769 

6eq ID NOt C354 Protein Sequence 
Protein Accession #t 1IP_004607.1 

1 11 21 31 41 51 

I I I I I I 

MAGVSACIKY SMFTFNFLFW LCGILILALA IWVRVSNDSQ AIPGSEDVGS SSYVAVDILI 60 

AVGAHMIIiG FLGCCGAIKE SRCMI^LIiFPI GLLLILLLQV ATGILQAVFK SKSDRIVNET 120 

LYBNTKLLSA TGESBKQFQE AIIVFQBEPK CCOLVNGAAD MGNNFQHYPB LCACLDKQRP 180 

CQSYNGKOVY KETCISFIXD FLAKNLIZVX GXSFGLAVXE XLGLVF8KVL YGQIGNK 237 

Seq XD EIO: 0355 Protein Sequence 
Protein Accession #: NP_004608.l 

1 11 21 31 41 51 

I I I I I ) 

MCTGGCARCL QGTLIPLAPF QPLANILLFF PGGKVIDDND HLSQBIWPFG GILGSGVLMI 60 

FPALVPLOLK NNDCCGCCX3N BGCGKRFAMP TSTIFAWGF LGAGYSFIIS AISINKGPKC 120 

U4ANSTWGYP FHDGDYU4DB ALWNKCREPL NWPNNLTLF SILLWGGIQ MVLCAIOWM 180 

GLLGTLOSDC QOCX3C08GDG FV 202 

Seq XD NO I C3S6 Protein Sequence 
Protein Accession #i NP_002372.1 

1 11 21 31 41 51 

I i 1 t I I 

MPRPAPARRL PGLLLLLWPL LLLPSAAPDP V7VRPGPRRLE TRGPGGSPCa RPSPAAPDGA 60 

PASGTSBPGR ARGAGVCKSR PLDLVPXIDS SRSVRPLEPT KVKTFVSRII DTLDIGPADT 120 

RVAWNYAST VKXBPQLQAY TDKQSLKQAV GRXTPLSTGT MSGLAXQTAM DBAFTVEAGA 160 

. REP88NXPKV AIIVTDGRFQ DQVNEVAARA QASGXELYAV GVDRADMASL KMMA8BPLEE 240 

HVPYVBTYGV XBKLSSRFQE TFCRLDPCVL GTHQOQHVCX SDGBGKHHCB CSQO^TLNAD 300 

KKTCSAUJRC ALNTHQCEHI CVNDRSQSYH CECYBQYTLN BDR3CTCSAQD KCALGTHGCQ 360 

HICVNDRTGS HHCBCYEGYT LNADKKTCSV BDKCALGSHG CQHICVSDGA ASYHCDCYPQ 420 

YTLNEDKKTC SATBEARRLV STEDAOGCBA TLAFQDKVSS YLQRLNTKU) DILEKLKXNB 480 

YGQXBR 486 

Seq ID NOt C357 Protein Sequence 
Protein Accession #: NP_057723.1 

1 11 21 31 41 51 

MAROSLRRLL RUVLOLWLA LLRSVAOEQA FOTAPCSROS SHSADLDKOI DCASCRARPH 60 
SDPCL6CAAA PPAPFRLX^ ILGGALSLTP VLGUiSOFLV HRRCRRRBXF TTPIBBTG6B 120 
GCPAVALIQ 129 

Seq XD NO I C3S8 Protein Sequence 
Protein Accession «t HP 001810.1 
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1 H 31 31 41 51 

I 1 I I I I 

MQPTUiLSLL GAVGUUVVNS MFVDNRNIQiE GMVTRCIIEV LSNALSKSSA PPITPEOIQV 60 

LRTSRKDVKD KETTENCMTK FBVRLLRDPA DASEAHBSSS RGEAGAPGEB DIQ6PTKADT 120 

EKHABGGGKS RERADBPQWS LYPSDSQV86 EVKTRRSBKS QRBDGEEEE6 ENYQKGERGB 180 

DSSEEKHLEE PGGTQNAFLN ERKQASAIKK BELVARSBTB AAGHSQEKTH SREKSSQESG 240 

EEAGSQENHP QBSKGQPR8Q EESEB6EEDA TSEVDKRRTR PRHHHGRSRP DRSSQGGSLP 300 

SBBKGHPQEB SBBSNVSMAS ZiGEKRDHHST HYRASEEEFB YGEBIKGYPG VQAPEDlfWB 360 

RYRGRGSEEy RAPRFQSEES NOBEDXRHYP SLSLDKMAHO YGBBSBEBRG LEPGKGRHHR 420 

GRGGBPRAYF MSOTRBEKRP ZX3BGKHRVQB NCH1DKARRHP QGAHKEZjDRN YIMYGEBGAP 480 

GKWQQQGDLQ DTKENREBAR FQDKQYSSHH TAEKRKRUGE LFNPYYDPLQ WKSSHFERRD S40 

NMNDNFLEGE EENELTLNEK NPFPEYNYDW WEKKPFSEDV NWQYBKRNLA RVPKLDLKRQ 600 

YDRVAQU)Ql« LHYRKKSAEF PDFYDSBEPV STHQBAENEK DRADQTVLTB DEKKELEaHiA 660 

AMDLBLQKIA BKFSQRO 677 

Seq ZD NO: C3S9 Protein Sequence 
Protein Accession ftt XP_093082.1 

1 11 21 31 41 51 

I I I I I I 

MKLLCEGLKQ PNCVLQTLRW YRCLISSASC GALAAYLST8 QWLTELEPSE TKLEASALICL 60 

LYGGLKDPNC KLQKLNLQPS LSVTAAKLPV GMVGMCSQFS OSLVQSHFGY CQDSSPKCDL 120 

CKLIiWPSTRV AAAKDG6SPK SFLSBGLHWA GRLBAVBEVL GLGVltVQPGD PASQGGGHCB 180 

NYGSFRDLVD LEVKAEPSLR KGGMDLQRPT LQWLLCKIF SUGLFLFIAIi PNSPGQVSW 240 

QVTIPDGPVN VTVGSNVTLI CIYTTTVASR EQLSIQWSFF HXKEMEPISS PWBBGKHPDV 300 

EAVXGTLDGQ QAELQIYPSQ GGQAVAIOQF KDRITGSKDP GNASITISHM QPADSGIYIC 360 

0VNMPPDPLG QKQGILNVSV LVKPSKPLCS VQGRPETGHT X8LSCLSALQ TPSPVYYWHK 420 

LEGRDIVPVK ENPNPTTGIL VIGMLINFBQ GYYQCTAINR LGSSSCEIDL TSSHPEVGII 480 

VGALIGSLVG AAIIISWCP ARNKAKAKAK BRMSKTZAEIi tePHTKINPRO ESEAMPREDA 540 

TQLEVTXiPSS IHBTGPDTIQ BPDYBPKFTQ EPAPBPAPGS EPHAVPDU)! BIiBLEPBTQS 600 

ELBPEPEPEP ESEPGVWEP LSEDEKGWK A 631 

Seq ZD NO: C360 Protein Sequence 
Protein Accession St FGENESR predicted 

1 11 21 31 41 51 

I I i I I 1 

MVPAFWKVPIi ILSCLAGQVS WQVTIPDGF VNVTVGSNVT LICIYTTTVA SREQLSIQHS 60 

FFHKKEMBPI SSPWEBOKWP OVEAVKOTLD GQQAELQIYP SQGGQAVAIG QFKDRITGSN 120 

DPGMASITIS HMQPADSGIY ICDVNNPPDP LGQNQGILNV SVLVKPSKPL CSVQGRPBTG 180 

HTISZiSCLSA XjGTPSFVYYW HKLEGRDIVP VKaJPNPTTG ILVIGNLTNP EQGYYQCTAI 240 

NRLGNSSCBI DLTSSHPBVG IZVGALXGSL VGAAIIISW CFARNKAKAK AKERNSKTIA 300 

BLEPMTKINP RQB8EAMPRB DATQLEVTLP SSIH6TGPDT IQBPDYSPKP TQBPAPEPAP 360 

GSEPMAVPDL DIBLBLBPBT QSBLBPBPBP BPBSBPGVW EPLSEDEKGV VKA 413 

Seq ZD NOi C361 Protein Sequence 
Protein Accession #t NP_003011.1 

1 11 21 31 41 51 

I I I I I I 

MVSRHVSTHL SGLIiFHLASG NTPAFAYSPR TPDRVSEADZ QRLLR6VMEQ LGIARPRVEY 60 

PAHQAMNLVG PQSIEGGARB GLQHIiaPFGN IPNIVAELTG xmPKDFSED QGYPDPPNPC 120 

FVGKTDDGCL ENTFDTAEFS RBFQLHQHIiF DPEBDYPOLQ KNNXXLLYBK KKGGERRKRR 180 

SVNPYLQGQR LDNWAKKSV PHFSDEDXDP E 211 

Seq ZD NO: C362 Protein Sequence 
Protein Accession #i HP_076926.2 

1 11 21 31 41 51 

I I 1 I I I 

HmfQGNEQA MPGA6PGVPQ LGNMAVIRSH UHRGLQBKFIi XaBPKVLGW QILTALMSLS 60 

MGZTMMCMAS NTYGSNPISV YIOYTZHQSV HFIXSGSItSZ AAOZRTTKBL VRGSXXWNZT 120 

SSVLAASGZIi INTFSLAFYS FHHPYOVYYG N8MNCHGTMS ILMGLDGMVL LLSVXiEFCIA 180 

VSL8AFQCRV LCCTFGGWL ZLPSHSBHAE TASPTPUIEV 220 

Seq ID KOi C363 Protein Sequence 
Protein Accession #t NP_0020B2.1 

1 11 21 31 41 51 

I I I I I I 

MR6SELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR 6NHNAVGHLM GKKSTGBSSS 60 

VSBRGSLKQQ LREYZRHEBA ARNLLGLXBA RENRNHQPPQ PKALI3IQQPS NDSBDSSNFK 120 

DVGSKGKVGR LSAFGSQRE6 RNPQLNQQ 148 

Seq ID NO: C364 Protein Sequence 
Protein Accession #i NP_036393.1 

1 11 21 31 41 51 

1 I I 1 1 1 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVBNLSGLS TNPEKDIPW RENOTTCLMA 60 

SPAAKPIVPY DVNASNYVDL ITEQADIALT ROAEVKORCO HSQSELQVFW VDRAYALRML 130 

FVKESBNMSK OPBATNRI*SK VQFVYDSSBK THFXDAVSAO KHTANSHHLS ALV TPAG KSY 180 

BOQAQQTISIi ASSDPQKTVT MZLSAVHIQP FDZISDFVF8 BEHKCPVDBR BQXiBBTLPLX 240 

LGbZLGXiVIM VTIAIYHVHH lOITANQVQZP RDRSQYKHM6 280 
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Seq ID NO: C365 Protein sequence 
Protein Accession ft: NP_0032l7.l 

1 11 21 31 41 51 

I I i I I I 

MLGLVUVLLS SSSAEEYVGL SANQCAVPAK DRVDCX3YPKV TPBEQINRaC CFDSRXPGVP 
WCFKPLTRKT ECTP 

Seq ID NO: C366 Protein Sequence 
Protein Accession ft: NP_002984.1 

1 IX 21 31 41 51 

111)11 
MSLPSSRAAR VPGPSGSLCA lAMULLULTP PGPIASAGPV SAVLTBLRCT CLRVTLRVNP 
KTI6KLQVFP AGPQCSKVEV VASLKK6KQV CLDPBAPFLK KVIQKXU)80 NXKH 

Seq ID KO: C367 Protein Sequence 
Protein Accession «i NP 005233.2 



KRSPSAAWLL 
VDfiFSASVLT 
DLLSVIWPPL 
MGHSRKKANI 
NYFLSIAIGV 
TPSNLIiLWH 
CRSVRTVKQM 



11 
I 

GAAIIiLAASL 
GKLTTVFLPI 
KIAYHIKANN 
AIGISLAIWL 
FLFPAPLTAS 
YFLIKSQGQS 
QV8LTSKKHS 



21 
1 

SCS6TIQG7M 
VYTIVFWOL 
WIYGEALCaiV 
LIIiVTIPLY 
AYVLMIRMLR 
HVYALYIVAL 
RKS8SY8SSS 



31 
I 

RSSKGRSLI6 
PSKGMAUrVP 
LIOFFYGNMY 
WKQTIPIPA 
SSAMDEHSEK 
CLSTLMSCID 
TTVKTSY 



Seq ID NO: C366 Protein Sequence 
Protein Accession fit NP 003460.1 



1 
I 

MABAKTHWLG 
YIENLRQQAH 
AENBPQSAPK 
RTNEIVEEQY 
YEDWGGBDW 
DQLSDDVSKV 
PPEDItlEMLK 
RAOTBALPDO 
AAWIPHVEKR 
BQIEQAIKEH 
KAEKGREHIA 



11 
I 

AALSLIPLIF 
KEESSPDYNP 
ENKPYALHSB 
TPQSIiATICS 
NPVEBXIESQ 
lAYLRRLVNA 
TGBKPNQSVE 
LSVEDILNLL 
QMAYENLNDR 
LNQOSSQETD 
KRAMENM 



21 
I 

LISGAEAASF 
YQGVSVPLQQ 
KNFPMDMSDD 
VFQELGKLTO 
TQEEVRDSKE 
AGSGRXiQNGQ 
PERELDLFVD 
GMESAANQKT 
DQELGEYIiAR 
KLAPVSKRFP 



31 
I 

QRNQLLQKEP 



41 

1 

KVDOTSHVTG 
LFRTKKXHPA 
CSILFHTCLS 
LNITTCHDVL 
KRKRAIKLIV 
PFVYYPVSHD 



41 
I 

DLRIiENVQKP 



51 
I 

KGVTVBTVPS 
VXYMANZiALA 
VORYWVIVNP 
PEQLLVGDMF 
TVIAMYLICF 
FRDKAKNALL 



51 
I 

PSPEMIRALE 
MRIILEALRQ 



YETQQHPERK 



NIGKNBQXND 
MGERATRIiFE 
LDDISEADLD 
SYPPNFYNQE 
MLVKYPBIIN 
V6PPKNDDTP 



LKKHQFPPMY 
BBQKLYTDDB 
EMXR8GQLGI 
KPLDSQSIYQ 
HPDLFQNRML 
KVLPRLPYGA 
SNOVXRVFGQ 
NRQYWDEDLL 



DDIYKANHIA 



LIEISRNIiQI 
SKSGYPKTPG 
GRSRSNQliPK 



MKVLEYLNQE 



60 
74 



60 
114 



60 
120 

leo 

240 
300 
360 
397 



60 

120 

180 

240 

300 

360 

420 

460 

540 

600 

617 



Seq ID NO: C369 Protein Sequence 
Protein Accession #s NP 112217.1 



1 
1 

MPCAQRSWIiA 
DASGHFLSYO 



KHPLVBGGYK 
SISKERWVET 
LEEBEQGLKI 
IiGLSHLSGMC 
RQLQYDPTPL 
QYGPNATFCQ 
IFG6W6RWSP 
PTFRQMQCSE 
GCTOSNVCIN 
XilPXGARDIR 
EKIMATGPTN 
TGIRRQTAHC 
GBKKRTVLCI 
GGVRIRSVTC 
VPPPTSRPRK 
ISTGSTSQPI 
PPYNTLTKQP 
PDLSRESKWP 
KTANRNHLKIi 
HSBCSTTCGL 
GGPKIREIQC 
6VQERGVFCP 
BQNKTEDQDQ 
TVRABCCP8C 



11 
I 

NLSWAQLLN 
LHYPITSSRR 
MMASSAPLOI 
PHIVYRRQKV 
LWADTKMIB 
VHHABKTLSS 
QPKRSGNINE 
THSKCSEEYI 
EVENVOOTLW 
H8HCSRT0QA 
FDTVPYKNEL 
GICKMVGCDY 
VMEIEGAGNF 
ESVniQLLFQ 
IKKGRGMVKA 
QTMVSDEQAL 
AXNHDEPCDV 
LTTPTGPBSM 
LT8QSLSXQP 
EKBIH8GS6E 
PFSTVMEGX«I< 
PNMMNQTKSS 
GAYWKRVECT 
VDSRDHRNXiR 
GGLCDVTTKRP 
CLCDKKPRPP 
PQTRITHTQR 



FGALCYGRQP 



LSGTVLQQGT 
PETKBPTCGL 
YHGSENVESY 
FCKWQKSINP 
DSGLFXAFTX 
TRFLDRGVTOF 
CSVKGPCRSK 
GVQSAERLOf 
YHHFPIFNPA 
EIDSNATEDR 
liAIRSEDPEK 
VTNPGIKYBY 
TFCDPBTQPN 
PPTDOQHLLK 
TRKPNSRALC 
STSTFAI8SP 
SEENVSSSDT 
EREQPEDKDB 
PSQRPTTSET 
EPVLTEBDAT 
TQMDSDCAAI 
PPHOQFLAGI 
TSTY4SQ7BHL 
EFKKCNQQAC 
QRRQRLLQKS 



31 

I 

QP6PVRFPDR 
VYYRZSHBBK 
RVGTAALSAC 
KDSVNISQKQ 
ILTIMNMVTG 
KSDLNPVHHD 
AHBLGHSFQI 
CLDDIPKKKG 
LDAAADGTQC 
NPBPKPGGKY 
HPCEIiYCRPI 
06VCLCDGSS 
YYUK3GFIIQ 
TIQKDGLDND 
GRQKKCHEKA 



41 

1 

RQEHPIKGLP 
DLFFNLTVNQ 
HGXiTGFFQLP 
ELWREKWERH 
LFHNPSIGNA 
VAVIiLTRKDI 



GLQQCPSSRR 
SPTTASKEGD 
GPT8EGGLVA 
SNPVIWTKIR 
GTPRVEGMVT 
SltlTEGFLLN 
QRPDPAKRCH 
PPPLSMSCNP 
CCHWATGNWD 
KKSADLLCTK 
KEL 



LKSKVIAPGV 
GEKKWCMA6K 
CTGERKRYRL 
DGOFSEKMLD 
CQTVRKNFKQ 
WITCNYKLAQT 
VEQMYFWQYG 
CPPRWWAGBW 
ILCPSDWTVG 
VLXFNXGTIS 
liGGKQWQDSS 

rrrsGSGLSs 

VPGbZDAPVES 
ERPANTXjIiPIi 
ASNYXQLTNG 
UIPCAGNXCVG 
BPCEAHQVEP 
LC8TSCGGGF 
DKLSASFCQT 



51 
I 

EYHWGPVRV 
GFLSNSYXME 
HQDFPIEPVX 
NLP8R8LSRR 
IHIVWRLIL 
CAOFNRPCST 
FVORHPYIMS 
XYDVBHQOQL 
CITVGKKPES 
CNVHPCRSEA 
AVIDGTPCFB 
KEGSGYVDIG 
VFQYDRKGDL 
HHTECSV7CG 
BACSAtCGPK 
NHSECSVSOQ 
NGKNPPTLKP 
TQPBLSSRYL 
SRNPITWPVT 
TEMPIAPPLT 



HOSAHWIVGN 
NWSKCSRNCS 
WSQCSRSCGG 
QKRIVQCVPS 
LXAMXKCSVP 



60 

120 

IBO 

240 

300 

360 

420 

480 

540 

600 

660 

720 

760 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1593 



Seq ID NO I C370 Protein Sequence 
Protein Accession Ut NP 001053.1 



21 



31 



51 
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MRQSHQLPLV 
LKLVCIQIQT 
KLENXFQABX 
SO reVD TOAM 
TFSTGEAKQA 
DINKDSSCV8 
VMEKAQKMND 
RNGENLEVRH 



1 

GLLLFSPIPS 
LKQKMIQQIR 
EKMEARNGTP 
AVLALTCVKX 
LFVSSDYYNB 
ASGNFNISAD 
TIFGPTMESR 
SKY 



I 

QLCBICEVSE 
YNVKSRIiSDV 
LTNYYQLSLD 
SLINGOIKAO 
NDWNCQQTLN 
EPITVTPPDS 
SWGPyZTCIQ 



I 

BNYIRLKPLL 
SSGELALIIL 
VLALCLFNGN 
BOSLKNISiy 
TVLTEISQGA 
QSYISVNYSV 
GLCANNNDRT 



NTMIQSNYNR 
ALGVCRNAEE 
YSTAEWNHP 
TKSLVEKILS 
PS2IFNAAA0V 
RINETYFTNV 
YWELLSQGEP 



GTSAVNWI£ 60 

NLIYDYHLTD 120 

TPENKNYYFO 180 

BKKENGLIGN 240 

LPALKQKTPL 300 

TVLNGSVPLS 360 

LSQGAGSYW 420 
433 



Seq ZD NOi C371 Protein Sequence 
Protein Accession #i NP_0045B2.1 

1 11 21 31 41 51 

I I I I t 1 

KCCTRSLLIA ALMSVLLLHL CX3ESBAASNF DCCLGYTDRI UHPKFIVGFT RQLANE6CDI 
NAIIFHTKKK LSVCANPKQT WVKYIVRXiS KKVKNM 

Seq ID MOi C372 Protein Sequence 
Protein Accession St NP_037403.1 

1 11 21 31 41 51 

I I I I I 1 

MAG8FLLH6P RAGGVGLLVL UiLGLFRPPP ALCARPVXBP ROLSAASPPL AETQAPRRPR 
RSVPRGEAAO AVQEIiARAZiA HLLBABRQER ARAEAQBAED QQARVLAQLL RVKGAPRNSD 
PALGLDDDPD APAAQLARAL LRARU)PAAL AAQLVPAFVP AAALRPRPPV YDDGPAOPDA 
EEAGOBTPDV DPELLRYLLG RZZtAOSADSB GVAAPRRUIR AADBDVGSBL PPE6VLGALL 
RVKRLETPAP QVFARRLLPP 

Seq ZD NO: C373 Protein Sequence 
Protein Accession Ut NP_002236.1 



11 



21 



31 41 51 

I t I I I 

MLQSLAGSSC VRLVERKRSA WCPGFLVLGY LLYLVFOAW PSSVELPYED XiLRQBUUa.K 
RRFLEEHECL SEQQLEQFLG RVLBASNYGV SVLSKASGNW NVIDFTSALFP ASTVIiSTTGY 
GHTVPLSDGG KAPCIIYSVZ GIPPTLLPLT AWQRITVHV TRRPVLYPHl RWGPSKQWA 
XVHAVLLGFV TVSCFFFZPA AVFSVLEDDW NFLBSFYFCF ZSIiSTIGLGD YVFGBGYNQK 
FRELYXiaZT CYbLLGLIAK LWLBTFCEL BELKXFRIMP YVKRDKDEDQ VHIZEKDQLS 
P6SZTDQAAG MKaX)KQNEP FVATQSSACV DGPAMH 

Seq ID KO: C374 Protein Sequence 
Protein AccesBlon S: NP 005463.1 



1 11 21 31 41 51 

t I I 1 I I 

METTNGTETW YESLHAVLKA LNATLHSNtiL CRPGPGLGPD MQTEBRRASL PGRDDNSYMY 
ZLFVKFLFAV TVGSLZLGYT RSRKVDKRSD PYHVYIKNRV SMZ 

Seq ID MOi C375 Protein Sequence 
Protein Accession #t NP 005236.1 



MGRHLALLLL 
VYITHPAWEV 
KALEKNTNVE 
NGEFYYSFKD 
AKIiTVHZBQA 
QQFRTVRSFP 
KAGPVKFEKD 
8ILEPVKRQQ 
TIMSLSAVDP 
SDHGLPYRRE 
LVQYQIEAGN 
ITVAASHKLV 
PTGIQVKENQ 
DRBTTDKYTb 
ZIQVEATDKD 
ARBBPQLFST 
QVRYSLLDHG 
BWDVNEHLH 
VFKIGBETGV 
EPVYYPEZME 
RXLDREQQDE 
DHERKARREP 
DILSIKAVDN 
8VEPPQIPLW 
QVPIKVZDTN 
SLKXFRLDPA 
WFTASSYKGR 
SZKTAKEUJR 
BTVSZGSFV6 
QGTNMAGIiST 



11 
1 

LLliLFQHFGD 
RYKIVSGDSE 
ARTKVRVQVL 
RTDMFAIHPT 
NECAPVZTAV 
GSKEYKVKAI 
VYRAEISBPA 
AAHPBLBVTT 
DEGBNOYVTY 
VBVLATZTLN 
SLDLFSLNFN 
NliQCBETGVA 
PVGSSVZFMN 
NITVYDLGIP 
LGPNQHVTYS 
WVKVSLEDV 
EGNPDVDKLS 
PPVPSSFVBK 
lETSDRLDRE 
KSPKDVSWQ 
HILBVTVTDN 
LYRVIATDKD 
GRPQKSSTTR 
FDITGGNYDS 
DHRPQPST8K 
TOSLYTSBKL 
VYESAAVGSV 
SNQAEYDLMV 
MVTAHSQSSV 
NTTVLVHLQD 



21 
I 

SDOSQRLEQT 

NLFKAEEYIL 

DTNDIiRPLPS 

SGVXVLTGRL 

TLSPSELDRD 

6DXDWDSHPF 

PPNTPWMVK 

SDRKASTKVX* 

SIANUJHVPP 

NIjNDNTPLFB 

S6VLSLKRSI> 

KMLAEKIiLQA 

STDLDTGFNG 

QKAAWRIiLHV 

IliTDTDTFSZ 

NDNPPTPIPP 

GAVRIVQQLD 

GTVKEDAPVG 

STSHYWLTVF 

ZEAFDPDSSS 

GSPPKSTIAR 

EGPNABISYS 

LBIEWI6RPK 

KFDVDKGTGT 

YEWZPBDTA 

DHBAVSFAHL 

VXiQVTALDKD 

KATDKGSPFM 

VYBZKDOnG 

ENDNAPVFMQ 



31 

1 

PLQFTHX»EYN 
GDPCFIiRIRT 
PTSYSVSLPE 
DYLBTKLYEM 
PAYAIVTVDD 
GYNLTIjQAKD 
AIPAYSHLRY 
VKVLGANSNF 
AIDHPTGAVS 
KZNCBQTIPR 
MDGLGAKVSF 
NKLHHQGEVE 
KLVYAV6GGN 
VWDANDNPP 
DSVTGWNIA 
NYRVRVREDL 
PBKKC2VYNI*T 
SLVMTVSAHD 
ATDQGWPLS 
NDKLMYKZTS 
VrVKILDEND 
lEDGNEHGKP 
QSLEPISFEB 
ZIVAKFLDAE 
PETEILQISA 
TVMVRDQDVP 
KOKNAEVLYS 
SE1T8VRIFV 
DAFDINPHSG 
AEYTGLISBS 



41 

I 

VTVQENSAAK 
K6GNTAILNR 
NTAIRTSIAR 
EILAADRGMK 
CDQGANGDIA 
ROTPPQFSSV 
VFKRTPGKAK 
PEFTQTAYKA 
TSENLDYELM 
DLSVOEQITT 
HSLRITATDG 
DIFFDSHSVN 
EDSCFMIDMB 
EFLQBSYFVB 
RPLDRELQKB 
PEGTVI^4WI.B 
VRAKDK6KFV 
EDAGRDGBIR 
SFIEIYIBVE 
GNFQGFFSIH 
NKPQFLQKFY 
PIBPKTGWS 
SPPTPTVMES 
QKSNYNLTVE 
VDQDBKNKLZ 
VKRKFARZW 
IB86NI6NI0 
TZADNASPXP 
TZITQKAIi)F 
ASINSWLTD 



60 
96 



60 
120 

180 
240 
260 



60 

120 

180 

240 

300 

336 



60 
103 



51 
I 

TYVGHPVKMG 
EVKDHYTLIV 
VSATDADIGT 
LYGSSGZSSM 
SLSIVAQDliL 
KVIHVTSPQP 
FSLNYNTGLZ 
AFDENVPIGT 
PRVYTLRZRA 
VSAXDADELQ 
ENFATPLYZN 
AHIPQFR8TL 
TGKLKILSPZi 
VSEDKEVK8B 
H8LKZEARDQ 
AHDPDLGQSG 
SLSSTCYVEV 
Y8IRDGSGVG 
DVNDNAPQTS 
PICTGLITTT8 
KZRIiPEREKP 
SKRFSAAGBY 
DPVAHMIGVI 
ATDGTTTILT 
YTLQSSRDPL 
NVSDTNDHAP 
NSFMIDPVLQ 
TSKEYSVELS 
BTLPIYTLII 
RNVPLVZRAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



1368 



wo 03/042661 



PCT/US02/36810 
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DADKDSNALL VYHIVEPSVH TYPAIDSSTO AIHTVLSLDY BET8IFHFTV QVHDP«3TPRL 
PABYAANVTV HVIDINDCPP VPAKPLYKAS UiPTYKGVK VITVNATDAD SSAFSQLIYS 
ZTEGNIGEKF SKDYKTGALT VQNTTQLRSR YELTVRASDG RFA6LTSVRI WVKBS KESH L 
KPTQDVYSAV VXENSTBABT LAVITAI68P INEPLFYUIL N9DRRFKZSR TSOVLSTTGT 
PPDREQQBAF DWVEVXBEH KPSAVAHWV KVZVEDQNSN AFVFVXILpyy AWKVDTBVG 
HVIRYVTAVD RDSGRNQEVH YYLKBHHEHP QIGPLGBISL KKQPBLDTLM KEYLVTWAK 
DGGNPAPSAB VIVPITVMNK AMPVFEKPFY SABIAESIQV HSPWHVQAN SPBGUCVPYS 
ITDGDPFSQF TINFIITGVXN VXAPLDFBAH PAYKLSIRAT DSLtQARAEV PVDXIVDDIN 
DNPPVFAQQS YAVTLSEASV JGTSWQVRA TDSDSEmRO I8YQHFQNHS K8HDHFHVDS 
8T0LI8LLRT LDYEQ8RQHT ZFVRAVD6GM PTLSSDVZVT VDVTDZJKaiP PLPEQQZYEA 
RISEHAPHGH FVTCVKAYDA DSSDIDKLQY SILSGNDHKH PVIDSATGII TLSNLHRHAL 
KPFYSUILSV SDGVFRSSTQ VHVTVIGGNL HSPAFLQNEY EVBIiAENAPL HTLVMBVKTT 
DGDSGIYGHV TYHZVNDFAK DRFYZNERGQ ZFTLBXLDRE TPAEKVZSVR U4ARDAG6KV 
AFCTVNVILT SDNDNAPQPR ATKYEVNZGS 8AAKGTSVVK SASDADEGSN ADZTYAZEAD 
6BSVKENLEZ NKLS6VZTTR BSLZGLENBF PTFFVRAVX3N GSPSKESWL VYVKZLPPEM 
QLPKPSBPFY TFTVSEDVPV GTEZDLZRAE HSGTVIiYSLV KGNTPESNRD ESFVZDRQSO 
RLKLEKSLDH ETTKWYQPSl LARCTQDDHE MVASVDVSXQ VKDANDNSPV PESSPYBAPI 
VENIiPGGSRV IQIRASDADS GTNGQVMYSL DQSQSVEVZB 8FA1NMETGH ITTLKBLDHB 
KROtlYQlXW ASDHGEKZQL 8STAZVDVTV TDVNDSPFRF TABXYKGTVS EDDPQGGVIA 
IZtSTTDADSB BXNRQVTYFX T66DPLGQFA VETZQNEHKV YVKXPU>REK RDNYLXjTZTA 
TDOTFSSKAZ VBVKVLDAND NSPVCBKTLY SDTXPEDVIiP GKLZKQZSAT DADIRSNAEZ 
TYTLLGSGAE KFiOlIPDTGE LKTSTPLDRE BQAVYHLLVR ATDGGGRPC3Q ASIWTLEDV 
NDNAPEFSAD PYAZTVFEMT BPOTLLTRVQ ATDADAGLNR RXLYSLZDSA DGQFSZHELS 
GZZQLBKPLD RBLQAVYTIiS LKAVDQGLPR RLTATGTVIV SVLDXNDHPF VFBYREYGAT 
VSEDILVGTB VLQVYAASRD ZEANAEXTYS ZXS6HEBGKF SXCSKTOAVF IXEtniOYESS 
HEYYLTVEAT DGGTPSLSDV ATVNVNVTDX NDNTPVPSQD TYTTVISEDA Vl<EQSVITVM 
ADDADGPSNS HIHYSIXDON QQSSPTIDPV RGBVKVTKLL DRETISGYTIi TVQASDNGSP 
PRVNTTTVNX DVSDVNDNAP VFSRGMYSVX XQENKPVOPB VLQLWTDED SSHNGPPFFF 
TIVTGNDEKA FEVNPQOVLL TSSAIKRKBK DHYLLQVKVA DMGKPQLSSL TYXDIRVIBE 
SIYPPAILPL EIFIT8SQEE Y8GGVIGKXH ATOQDVYDTL TYSLDPQMDN liFSVSSTGGK 
LIAHKKLDIG QYLIjNVSVTD GKFTTVADIT VHIRQVTQO^ LNHTIAIRFA MLTPBEFVGD 
YWRNFQRALR NILOVRRNDI QIVSLQSSEP HPHIiDVLLPV BKPGSAQIST KQLLHKINSS 
VTDIBBIIGV RILMVFQKLC AGLDCPWKFC DBKVSVDBSV MSTHSTARLS FVTPRHHRAA 
VCLCKEGRCP PVHHGCEDDP CPBGSECVSD PNEEKBTCVC P88RFGQCPG 88SMTLTGNS 
YVKYRIiTEKE NKLEMKLTMR LRTYSTHAW MYARGTDYSI LBIHHGRLQY KFDOGSGPGI 
VSVQSIQVND GQWHAVALEV NGNYARLVU) QVHTASOTAP GTLKTLNLKI YVPPGGHIRQ 
QGTRHORSPQ VGNGFRGCMD SXYIWGQELP LN8KPRSYAH lEBSVDVSPQ CPLTATEDCA 
SNPCQNGGVC NPSPAGGYYC KCSALYIGTH CEXSVNPCSS NPCLYOGTCV VDNG6FVCQC 
RGLYTGQROQ LSPYCKDEPC KMGOTCFDSL ZX3AVCQCDSG FRGER0Q8DX DECSGNPCZfl 
OALCENTHGS YHCKCSHBYR GHHCEDAAPM QYVSTPWNIO LAEGXOZWF VAOZFLZiWV 
FVLCRKMZER XXKKQAEPXD KHLGPATAPL QRPYFDSKLN KNXYSDIPPQ VPVRPISYTP 
SXP8DSRiniL DRHSPB08AI PEHPEPSTFN PESVHGHRKA VAVCSVAPKL PPPPPSNSP8 
DSDSIQKPSW DPDYDTKWD LDPCLSKKPL EEKPSQPYSA RESLSEVQSL SSPQSESCDD 
NGYHWDTSDW MPSVPLPDIQ EFPNYBVIDB QTPLYSADPN AXDTDYYPGQ YDXESDFPPP 
PEDFPAADEL PPLPPEFSNQ FESXHPPRDM PAAGSLGSSS RliRQRFNliNQ YLP HFYP LDM 
SBPQT K ST G E HSTCREPHAP YPPGYQRHPB APAVESMFMS VYASTASC6D V8ACCEVE8B 
VMHSDYESGD D6HFEEVTXP PZJ)SQQHTBV 

Seq XO NOt C376 Protein Sequence 
Protein Acceesion #t NP_0S5035.1 

X 11 21 31 41 51 

1)1111 
MCYQKCARCI GHSLVGLALIi CIAANILLYP PNGETKYASE NHLSRFVWPF SGIVGQQLLM 
ZiLPAFVFXQL EQDDCOGCOS HENCOKRCAM LSSVIAALXG XAGSGYCVXV AAIiGLAEGPL 
CLDSLGQWNY TFASTBGQYL X1OTSTW8ECT EPXHXVEHNV SLFSILLAI/3 GXEFXIiCLXQ 
VXMGVLG6IC GFCCSHQQQY DC 



Seq ID NO I C377 Protein Sequence 
Protein Accession Ui NP_0037S0.1 



1 

I 

MSTENVEGKP 
PPQLFTEltDE 
. MEKGSXMIiDR 
ADXGKTVSSA 
SNVLVGBVDP 
ATLMSDEVFH 
NMYSGGENVQ 
IillXQAI«8AXL 
XLSSTGPVLV 
BEGFSSLXSF 
LAPEYLPTMS 
TSSMALKKFK 
MR6WFVPPF0 
HVAXLNVXCS 
ILTGLSVFMA 
VPLRRVHIiFT 
VXPEKDKXKK 
DRBRSPTFLB 



11 
I 

8HLGBRGRAR 
XiLAVDGQBNB 
EASSZiPQLVB 
SRMFTNPDNO 
LDTPPIAFVR 
DIAYKAKDRH 
MNGDTPHDGO 
FZYLATVTNA 
PERIiLFNFSK 
IPIYDAPKKM 
STDMYHNTTF 
TSPYPPTTAR 
BNFHWVCIiAA 
LMALPHYVAA 
PILKPIPMPV 
FLQVLCXJUiL 



21 

1 

SSTFLRWQP 
WKBTARWXKF 
MXVDHQXETG 
SPAKIBRULT 
LQQAVML6AL 
DLIAGXDBFL 
HGGGQHQDCB 

DNNFDYZjBFR 
XKLADYYPIN 
DWAFLSKKEC 
KLXSDFAXXL 
AXPALLVTZL 
TVZSXAHZDS 
LYGVFLYMGV 
WILKSTVAAI 



31 
I 

MFNKSXFTSA 



UiKPBLKDKV 
8SSLNDXSDK 
TEVPVPTRFL 
DBVIVLPPGE 
BLQRTGRFCG 
TDNMQGVLBS 
LMXOLHSAFL 
SNFKVGYNTIi 
SKYGGNIiVGN 
8XLXFCVIDA 
XF MDQQXTA V 
LKMETBTSAP 
ASLNGVQFMD 
IFPVMIIALV 
DCPYSEKVPS 



41 

1 

VSPAAERXRP 
WSKPHVATLS 
TYTLIiRKHRH 
PEKDQLKNKF 
FXIiLGPKGKA 
HDPAXRIEPP 
6LXXDXKRKA 
FLOTAVSQAX 
CLIIiVAODAS 
PSCTCVPPDP 
NCNFVPDITL 
LVGVDTPKLI 
IVKRKBKXLK 
GBQPKFI/SVR 
RLXLLIMPLR 
AVRKGMDYLP 
ZKIPMDXMEQ 



1860 

1920 

19S0 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

2760 

2820 

2680 

2940 

3000 

3060 

3120 

3160 

3240 

3300 

3360 

3420 

3460 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4590 



60 
120 
180 
202 



51 
I 

IliGEEDDSPA 
LHSLFELRTC 
QTKKSNXiRSL 

^aaa.PRDABA 

KSYHBIGRAX 
KSLP88DKRK 
PFFASDFYDA 
FCLFAGQPXaT 
PLVQYFTItFT 
ANI8ISNDTT 
MSPILFLOTY 
VP8BFKPTSP 
KGAGYHLDLF 
EQRVTGTLVF 
HQPDFXYLRH 
6QKDL8PLDD 
QPFLSDSKPS 



RHTSC 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

103S 



8eq ZD NOt C376 Protein Sequence 
Protein Accession St NP_000949.1 



1369 



wo 03/042661 



1 11 21 31 41 51 

I 1 I I I I 

HSTPGVKSSA SLSPORLKSP VTIPAVMFXP GWONLVAXV VXiCKSRKEQK ETTFYTLVOG 60 

lAVTDLUSTL LVSPVTIATY MKGQWPOGQP LCEYSTFILb FPSLSGLSII CAMSVBRYIA 120 

IMHAYPYSHY VDKRLACSLTL FAVYASNVLP CALPNK0U5S SRUJYPDTWC PIDWTTNVTA 180 

HAAYSYMYAG PSSFIiILATV LCNVLVCOAL UIMHRQFMRR TSLGTBQHHA AAAASVASRQ 240 

HPAASPALPR LSDPRRRRSF RRIAGAEIQM VILLIATSLV VLICSIPLW RVPVNQWQP 300 

SLERBVSKNP DLQAIRIASV NPILDPWIYI LLRKTVLSKA IBRIKCLPCR XGGSRRERSO 360 

QKCSDSQRTS SAMSGHSRSF ISRELKBISS TSQTLLPDLS I.FDI1SEMGLQ GRNIiLPGVPG 420 

MGLAQEDTTS LRTLRISETS DSSQGQDSBS VLLVDEAOGS GRAGPAPKGS SLQVTPPSET 480 

UILSBKCI 488 

Seq ID NO: C379 Protein Sequence 
Protein Accession #t NP_002650.1 

1 11 21 31 41 51 

i I 1 I I I 

KGHPPUiPLL LLLHTCVFAS HQLRCMQCKT NGDCRVBECA LQQDLCRTTI VRLWBBGEEL 60 

BW/EKSCTHS EKTNRTLSYR TGliKITSLTE WCGLDLCNQ GNSGRAVTYS RSRYLECISC 120 

GSSDHSCERO RHQSLQGR8P EBQCLDWTH WIQBGEEGRP tCDDRHLROOO YLPGCPGSKO 180 

PBNIIDTFHFL KCCNTTXCNB OPILELENLP QNGRQCYSCK GN8THGCS8E ETPLIDCRGP 240 

KNQCLVATOT KEPKHQSYKV RGCATASMCQ HAHLGDAFSM NHIDVSCCTK SGOiHPDLDV 300 

QYR8GAAPQP GPAHLSLTIT LLMTARLWGG TLLITT 335 

Seq ID NOt C3ao Protein Sequence 
Protein Accession j)i BAB55406.1 

1 11 21 31 41 51 

I I I I I I 

MDFESGQVDP LASVILPFNIj LENLSPEDSV LVRRAQFTPF NICIGLFQDVG PQRRTLVSYV 60 

MACSIGNITX QNLKDPVQIK IKHTRTQEVH HPXCAFWDUl KNKSFGGWNT SGCVAHRDSD 120 

ASETVCLCNH FTHFGVLMDL PRSASQLDAR NTKVIiTFISY lOCGISAIPS AATLLTYVAP 180 

BKLRRDYPSK IIiMNLSTAUi FLNLLFLLDQ WIT8FNVDGL CIAVAVLLHF FLIATFTWMG 240 

LEAIHMYIAL VKVFNTYIRR YILKFCXIGW GLPALWSW LASRMNNBVY GKBSYGKEKG 300 

DEFCWXQDFV XFYVTCAOYF GVMFFLNXAM PXWHVQIOO RNGKRSNRTL REEVLRNLRS 360 

WSLTFLLGM TWGPAFPAWQ PLMIPPKyiiF SXFNSLQGLF XFXFHCAMKB NVQKQWRRHI* 420 

OCGRFRIiADN SDHSKTATHX XKXSSQNLQK SLSS8SI0SN STYIiTSKSKS SSTTYFKRNS 4 BO 

HTDNVSYEHS FMKSGSLRQC FHGQVI«VKTO PC 512 

Seq ID NOi C381 Protein sequence 
Protein Accession ft: NP_000565.1 

1 11 21 31 41 51 

I I 1 1 I I 

HTVARPSVPA ALPIiLGEIiPR LLLLVXiLCLP AVWGDOSLPP DVPNAQPAI£ GRTSFPEDTV 60 

ITYKCEBSFV KIPGEKDSVX CLKGSQHSDZ BEFCNRSCEV PTRIjNSASLK QFYXTQNYFP 120 

VGTWBYECR PGYRREPSLS PKLTCLQNLK WSTAVEPCKK KSCPMPGBIR MGQIDVPGGI 180 

LFQATISFSC NTGYKLFGST SSFCLISGSS VQWSDPLPEC REIYCPAPPQ IDNGIIQGER 240 

DHYGYRQSVT YAQJKGFTMI QJBHSIYCTVN NDEGEWSGPP PECRGKSLTS KVP PTVQ KPT 300 

TVIilVPTTEVS PTSQKTTTKr TTPNAQATR8 TPVSRTTKHF KETTPNKGSG TTSGTTRLIiS 360 

GHTCPTLTGL LGTLVTKGLL T 381 

Seq ID NO I C382 Protein Sequence 
Protein Accession A: Eos sequence 

1 11 21 31 41 51 

I i I 1 ) { 

KDTSRLGVLL SLPVLLQIiAT GGSSPRSGVIt LRGCPTHOIC EPDORMLIiRV DCSDLGtiSBL 60 

PSNLSVFTSY LDLSMtlNISQ LLPNPLPSLR FLEELRLAGH ALTYXPKOAP TGLYSLKVLM 120 

LQNNQLRHVP TEALQNZiRSL QSLRU3ANBI SYVPPSCFSG LHSLRHIiflliD 0NALT8XPVQ 160 

AFRSLSALQA MTLALNRXHR IFDYAFGNLS SLWLHIiHMN RIHSL6KKCF DGXiKSIiBTLD 240 

LNYNNIiDBFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTItTLNG ASQITBFFDIi 300 

TCTTANLESLT LTGAQI8SLP QTVCNQLFNL QVLDLSYNLL EDI1P8PSVCQ KLQKIDLRHN 360 

BIYBXKVDTF QQLLSIiRSLN lAHNKXAXXH PNAP8TLPSL XKLDLSSNLL SSFPXTGUIQ 420 

. IiTHZiKIiTGNH ALQSIiXSSEH FPBbKVZSfF YAYQCCAFGV CENAYKXSNQ HNKGDNSSMD 480 

DLHKXDAGMF QAQDBRDtiBD FLLDFBEDLK ALK8VQCSPS PGPFKPCBIUi LDGHLXRIGV 540 

WTIAVIALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAPTFGSF 600 

ARHGAWHEKG VGCHVIOFLS IPA8ESSVFL LTZ<AAI<ERGF SVKY8AKFBT KAPFSSLKVI 660 

XLLCALLALT KAAVPLLGGS KYGASPLCLP LPPQEPSTMG YMVALZLLKS LCFLMMTIAY 720 

TKLYCNLDKO DliEMXHDCSM VKBXALUiFT NCXUICPVAF LSFSSLXNLT FI8PBVXKFX 780 

LLVWPLPAC LNPLLYXLFN PHFKBDLVSL RKQTYVHTItS KHPSLHSXNS DDVBKQSC9S 840 

TQALVTFTSS SITYDLPPSS VPSPAYPVTB SCHLSSVAPV PCL 883 

Seq ZD NO: C383 Protein Sequence 
Protein Accession fi: NP_003658.1 

1" 11 21 31 41 51 

i I I I I 1 

MDTSRIiGVLL SLPVLLQLAT GGSSPRSGVL IiROCPTHCHC EPDGRMLLRV DCSDU3LSEL 60 

PSNLSVFTSY LDLSKNNISQ LI«PNPXiPSI«R FftBRfiRTiAGM ALTYXPKOAP TGLYSLKVXit 120 

LQMNQLRHVP TEALQNLR6L QSLRLDANHX SYVPPSCP80 LHSLRULNLD DNALTGIPVQ 180 

AFRSLSALQA MTLAZMaBH IPDYAFGNLS SLWLHUINN RXHSLOKXCF DOUISLBTLD 240 

LNYNNIABFP TAIRTLSNLK BLGFHSNNXR 8IPSKAFVGH PSLZTXHFYD NPIQFVORSA 300 

1370 
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FQHLPELRTL TU9GASQITE FPDLTGTANli BSLTLTGAQI SSLPQTVCNQ LPNliQVLDLS 
YNLLEDIjPSP SVCQKIiQKID LRHNBIYBIK VDTPQQLLSL RSUHiAWNKI AIIHPNAFST 
LPSLIKU3LS SMLLSSFPIT GLHGIiTHLKL TGKBALQSLI SSQfFFELKV lEMPYAYQCC 
AFGVCaiAyK ISHQWmCQOH SSKDDLHKKD AGMFQJVQDBR DLEDFLLOFB BDLKALHSVQ 
CSPSPGFFKP CBHLLOGWLI RIGVHTZAVL ALTCNALVTS TVFRSPliYIS FIKUilGVIA 
AVNMLTGVSS AVLAGVDAFT FGSFARHOAW KBNGVGCHVI 6FLSIFASES SVFU^TLAAL 
ERGFSVKYSA KFETKAPFSS LKVIILLCAL LALTMAAVPL LGGSKYOASP LCLFLPPGEP 
STMGYMVALI LLNSLCFLMM TIAYTKLYCN IjDKGDLENIW DCSMVKHIAL LLFTNCILNC 
FVAFLSPSSL INLTFISPEV IKFILLWVP LPACLNPLLY ZLFNFHFKED LVSLRKQTYV 
WTRSKHPSLM SIMSDDVEKQ 8CDSTQALVT FTSSSITYDL PPSSVPSPAY FVTB8CBLSS 
VAFVPCL 

Seq ID KOt C384 Protein Sequence 
Protein Acceseion #t NP_003497.1 

1 II 21 31 41 51 

) I I 1 I I 

MEMFTFLLTC IFLPLI-RGHS LFTCBPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 
FLPLANLBCS PNIETFLCKA FVPTCIBQX)! WPPCRKIiCB KVYSDCKEOiX DTFOIRWPEB 
LBCDRLQYCD BTVPVTFDPH TBFLGPQKRT EQVQRDZGFW CPRHLKTG06 QGYKFLGIDQ 
CAPPCPNMYP KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRPRYP BRPIIYYSVC 
YSIVSLMYPI GFLLGDSTAC NKADEKLBLG DTWLQSQNK ACTVliFMUiY FFTMAGTVWW 
VILTITWFLA AGRKWSCKAI BQKAVWFHAV AMOTPGFLTV MLLALMKVEG DNISGVCFVQ 
LYDLDASRYP VLLFLCLCVF VdiSLLLAai ISUSHVRQVZ QBDGRNQBKL KKFNIRXGVF 
SGLYLVPLVT LLGCWYBQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELAIiPMIKY 
LMTliIVGISA VFWVGSKKTC TEWAGPPKRM RKRDPXSESR RVMJESCBPP LKHNSKVKHK 
KKHYKPSSHK LKVISKSMGT STOATANHGT SAVAITSHDY I/^QBTLTEIQ TSPETSMREV 
KAD6ASTPRI* REQDCQBPAS PAASI8RLS0 EQVD6KGQAG SVSESARSBG RISPKSDITD 
TGIiAQSNNIQ VFSSSEPSSIi KSSTSLLVHP VSGVRXEQGG GCHSDT 

Seq ID MOi C385 Protein Sequence 
protein Accession #t HP_000S73 

1 11 21 31 41 51 

11)111 

MRIAVICPCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH USDGSDELVT 120 

DPPTDLPATB VPTPWPTVD TYDGRGDSW Y6LRSKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYXA IPVAQDLNAP SDWDSRQKDS YBT8QLDDQS AETH8HKQ8R LYKXUCAMDES 240 

NEHSDVIDSQ BLSKV8REFH SHEFHSHEDH I^WDPKSKEB DXHLXFRISH ELDSA8SEVN 300 

Seq ID NO I C386 Protein Sequence 
Protein Accession #: IIP_002612 

1 11 21 31 41 51 

I I 1 I I I 

MGAAR6SPAR PRRLPLLSVL IiLPLLGQTQT AIVPIKQPS8 QDALQGRRAL LRCEVEAPGP 60 

VHVYWLLDGA PVQDTERRPA QGSSL8FAAV DRLQDSGTPQ CVARDDVTGE EARSANASFN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS IBO 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DBSPARWLA PQDVWARYB 240 

EAMPHGQPSA QPPPSLQWLP BDBTPITNR8 RPPHUIRATV PANGSZJiLTQ VRPRNAGIYR 300 

CIGQGQRGPF ZILBATLHLA BIEDMPZ*FEP RVFTA6SBBR VTCLPPKGLP EPSWmEHAG 360 

VRLPTHGBVY QKGKELVLAN lABSDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKFQD 420 

SQLBEGKPGY LDCLTQATPK PTWWYRNQM IiISEDSRFBV FKNGTLRINS VBVYDOTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKPTPPPQP QQ01EFDKEA TVPC8ATGRE KPTIKWERAD 540 

GS8LPBHVTD NAGTLHFARV TRDDAGNYTC lASNGPQGQI RAHVQLTVAV FITPKVEPER 600 

TTVYQCSirAL LQCEAQGDPK FLIQWKGXDR ILDPTXLGPR MHZPQNOSIiV IHDVAPEDSG 660 

RYTCIAQTSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK MIQTI6L8VG AAVAYIIAVL 720 

GLMFYCKKRC KAKRIiQKQPE GEBFEMEdiN GGPLQNGQPS AEIQEEVALT SliGSGPAATV 780 

KRHSTSDKMH PPRSSLQPIT TLGKSEFGEV FLAKAQGbEB GVABTItVLVK 8LQTKDEQQQ 840 

U>FRREZiEMP GKUIHAHWR liLQXiCRBAEP HYKVLEYVDL GDLXQFXiRIS KSKDEKUCSQ 900 

PIiSTRQKVAL CTQVALGMEH bSNNRPVHXD lAARNCSiVSA QROVRVSALG LSKDVYNSEY 960 

YBPROAHVPIi RHHSPBAILB 6DPSTKSDVH AF6VLKWBVF THGBMPBGGQ ADDBVLADLQ 1020 

AGKARLPQPB GCP8KLYRLM QRCHALSPRD RPSF8BXASA LGDSTVDSXP 1070 

. Seq ZD HO: C387 Protein Sequence 
Protein Accession #i NPj002300.1 

1 11 21 31 41 

I I 1 I I 

MKVLAA0VVP LLLVXj&WKHG AGSPLPITFV KATCAIRHPC HMNIMNQIRS 
ALPZLYYTAQ GBPPPMMLDK LOGFNVTDFP PraANOTEKA KLVBLYRIW 
ROQKILNPSA L8LHSXUIAT ADILROLLSN VLCRLCSKYB VGKVDVTYGP 
KKLGOQLIiGK YKQIIAVLAQ AF 

Seq ID KO: C388 Protein Sequence 
Protein Accession 4i XP_097S08 

I 11 21 31 41 51 

MBRPRLTLVC HVSIIZSARD LSKHHLTELQ PGLFSaLRFL EBLRLSGNHL SHIPGQAF80 60 

LYSLKILMLQ HNQLGOIPAB ALHELPSLQS LRUSANLISL VPER8FB6L8 SLRHLNIiDDN 120 

ALTBIPVRAL NNLPALQAMT LALNRISHIP DYAFQNLTSIi WLHLHMNRI QBU3THSFGG 180 

UmhBnj>Ut YNKLQBFPVA ZRTLGRLQBL GPHNNNIKAI PBKAPMGNPL LQTIHFYDNP 240 

1371 
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XQFVGRSAFQ YLPKLKTLSL NGAMDIQSFP DLKOTTSLEI LTLTRAGIRIi I<FSGMOQQLP 300 

RLRVLBbSBN QIBBLPSLHR CQKIiEEZGLQ HNRIWEIGAD TFSQLSSLQA LDLSHNAIRS 360 

ZHPBAFSTIiH SLVKXiDLTDN QLTTIiPLAGli GGLMRLKLKG NUUiSQAPSK DSFPKIiRIXjE 420 

VFYAYQCCPY QMCASFFKAS GQNEAEDLUL DDEBSSRRPL GXtLARQABlH YDQDLDELQt. 480 

5 EKEDSKFKPS VQCSFTPGPF KPCEYIiPBSH GIRLAVHAIV LLSVLORSLV LLTVFAlGGFV 540 

PLPPVKFWG AIAGANTLTG ISCGLLASVD ALTF6QFSBY GARWBTGLGC RATGFLAVLG 600 

SBASVLLLTL AAVOCSVSVS CVRAYGKSPS LGSVRAGVLG CLAXiAGLAAA LPLASVGSYG 660 

ASPLCLPYAP PEGQPAALGP TVALVMMHSF CFLWAGAYI KLYCDLPRGD FBAVWDCAMV 720 

^ . RRVAWIiXFAD GXiLYCPVAFL SFASNLGLFP VTPEAVKSVli LWLPLPACL KPUiYLLFNP 780 

10 HFSDDUIBLR PRAGDS6PLA YAAAGELEK8 SCDSTQALVA FSDVDLIIiEA SEAGRPPOLB 840 

TYGPFSVTLX SOQQFGAPRL EGSHCVEPEG NHF6NPQPSN DGBUiLRABO STPAGGGLSG 900 

GGOFQPSGLA FASHV 9X5 

^ . Seq XD NOs C389 Protein Sequence 
15 Protein Accession «i MP_S70901 

1 11 21 31 41 SI 

1 t I I I I 

HASLVSLBLO LI«LAVLVVTA TASPPAOLLS UtTSOOGALD QBALGGLLNT LADRVHCTN6 60 

20 P06KCLSVBD ALOLGBPBGS GLPPGPVLBA RYVARLSAAA VLYLSNPE6T CEDTRAGLWA 120 

SKADKLLALL ESPKALTPGL SWIiLQRKQAR AAGQTPKTAC VDIPQLLBBA VGAGAPGSAG 180 

GVLAALLDHV RS6SCPKALP SPQYFVDFVF QQHSSBVPMT LAELSALKQR LGVGREAHSD 240 

HSHRHRGA8S RDPVPLI8SS NSSSVWDTVC liSARDVMAAY GIiSSQAGVTP BAWAQLSPAL 300 

^- LQQQLSQACT 8QSRPFVQDQ LSQSBRYIjYG SLATUtXCLC AVFGLI«LLTC TGCRGVAHYI 360 

25 LQTFLSLAVG ALTGDAVUm TPKVLGUITB SEEGLSPQPT WtLLAMLAOL YAFFLFENLF 420 

NLLLPRDPED JjEDGPOGHSS HSHGOHSHOV SLQLAPSELR QPKPPHEX5SR ADLVAEESPE 480 

LLNPEPRRI^ PBLRLLPYMI TLQDAVHNFA DGIiAVGAAFA SSWKTGLATS IiAVFCHBLPH 540 

EL6DFAALLH AGLSVRQALL LKLASALTAF AGLYVALAVG VSEESEAnili AVATGIiFLYV 600 

ALCDMLPAMIi KVRDPRPNLL FUiHX3V6Z<LG 6WTVLLLI«Sl» YEDDXTF 648 



30 



Seq XD NO: C390 Protein Sequence 
Protein AccesB ion «t IIP_061644 



^^1 11 21 31 41 51 

35 I 1 1 I I ! 

MANASEPGGS GGGEAAAIiGL KLATLSLIiLC VSIiAOmjPA LLIVRERSLH RAPYYLLLDL 60 

CliADGLRAIiA CLPAVMLAAR RAAAAAGAPP GALGC3CHjAF LAAXiFCPHAA PLLLGVGVTR 120 

YLAXAHHRFY AERLAGWPCA AMLVCAAWAL ALAAAFPPVL DGGGDDEDAP CAIiEQRPDGA 180 

PGALGFLLLL AVWGATRLV YUlliLFFXBD RRKNRPARLV PAVSHDHTFK OPGATOQAAA 240 

40 NWTAOFGRGP TPPALVGXRP AGP^GARRL LVLEEFKTER RLCRMFYAVT LLPLLLWGPY 300 

WASYLSVLV RPGAVPQAYIi TA8VWLTFAQ AGXNPWCFIi FNRELRDCFR AQFPCCQSPR 360 

TTQATHPCDIi KOXGL 375 

Seq XD NOt C391 Protein Sequence 
45 Protein Accession fti NP_005622 

1 11 21 31 41 51 

I 1 I I I I 

KAAARPARGP BLPLLGLLLL LLLGDPGRGA ASSGHATOPQ PRSAGGSARR SAAVTGPPPP 60 

50 LSHOGRAAPC EPLRYNVCLG SVLPYGATST LLAGDSDSQE EAHGKLVLW8 GLRNAPRCHA 120 

VIQPIiliCAVY MPKCENDRVE LPSRTLCQAT RGPCAIVERE RGWPDPIiRCT PDRFPEGCIN 180 

EVQNIKFN8S GQCEVPLVRT ONPKSWYEDV EGOGIQOQNP LFTEAEHQDM HSYIAAFGAV 240 

TGLCTLFTLA TFVADWRNSN RYPAVXLFYV NACFFVGSIG WLAQFMDGAR RBIVCRADGT 300 

MRLGEPTSNB TLSCVIIFVI VYYALMAGW WFWLTYAWH TSPKALGTTY QPIiSGKTSYF 360 

55 HLLTWSLPFV LTVAILAVAQ VDGDSVSGXC FVGYXNYRYR AGFVLAPXGL VLIVGGYFLX 420 

RGVMTLFSIK SNHPGLLSEK AASKINBTML RLGIFGFLAP GPVLITFSCH FYDPFNQAEW 480 

BRSPRDYVLC QANVTIGLPT KQPIPDCBIK NRPSLLVEKI NLFAMFGTGI AMSTWVWTKA 540 

TLLIWRRTWC RLTGQSDDEP KRIKKSKMIA KAFSKRHBLL QNPGQELSPS MHTVSKDGPV 600 

AGIAFDLNEP SADVSSAWAQ HVTKMVARRG AILPQDXSVT PVATPVPPEE QAMIMLVEAE 660 

60 ISPELQKRLG RKKKRRKRKK EVCPLAPPPB LRPPAPAPST XPRIiPQLPRQ KCLVAAQAWO 720 

AGDSCaWAW TLVSNPFCPB PSPPQDPFLP SAPAPVAHAK QRRjQOLGPXB SRTNIMDTBL 780 

MDAOSDF 788 

Seq XD NO: C392 Protein Sequence 
65 Protein Accession #i BAC04382 

1 11 21 31 41 51 

I 1 I I I 1 

MGARSGARGA LLLALLLCMD PRLSQAGRKR SGBVLPDSFP SAPAEPLPYF LQBPQDAYIV 60 

70 KNKPVELRCR AFPATQXYFK CaiGENVSQND BVTQBGLDBA TLGARGGLRV REVQXBVSRQ 120 

QVEBLFGLED YNCQCVAWSS AGTTKSRRAY VRXAYLRKNF DQBPLGKEVP LDHEVLLQCR 180 

PPB6VPVAEV EWLKNEDVXD PTQDTNFLLT XDHNLXIRQA RLSDTANYTC VAKNXVAKRR 240 
STTATVIVYV NGGWSSWAEW SPCSNROGRG WQKRTRTCTN PAPLMGQAFC EGQAFQKTAC 300 
TTICPVDGAW TBWSKWSACS TECAHWRSRE CMAPPPQNGG RDCSGTLLDS KNCTDGIiCMQ 360 

75 NKKTLSDPNS HLLEASGDAA LYAQLWAIF VWAIU4AV0 VWYRRNCRD FDTDITDSSA 420 
ALTGQFHPVN FKTARPSNPQ LLHPSVPPDL TASAGIYRGP VYALQDSTDK IPMTNSPLLD 480 
PLPSLKVBVY SSSTTGSGPG LADGADLLGV LPPGTYPSDP ARDTHPIiHLR SASLGSQQLL 540 
QbPRDPGSSV SOTPGCLOGR LSIPGTOVSL LVWJGAIPQG KFYEMYLWN KABSTLPL8E 600 
GTQTVLSPSV TCGPTOLLLC RPVILTMPHC AEVSARDWIP QLKTQAHQGH WBEWTLDEE 660 

OU TUfTPCYCQIf EPRACHILIiD QLGTYVFTGE SYSRSAVKRL QLAVFAPALC TSLEYSLRVY 720 
CLEDTPVALK BVX«ELERTIX3 GYIiVEBPKPL MFKDSYHNLR LSLRDZiPHAH WRSKLLAKYQ 780 
BIPFYHIWSQ SQKALHCTFT IiBRHSLASTE LTCKICVRQV EGEGQXPQLH TTLABTPAGS 840 
LDTIiCSAPGS TVTTQLGPYA FKIPLSIRQK ICNSIiDAPNS RGNDWRMLAQ XLSMDRYUVY 900 
FATKASPT6V ILDLHEALQQ DD6DLNSIAS ALEBMQKSBH liVAVATDGDC 950 
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Seq ID NO: C393 Protein Sequence 
Protein Accession ffi NP_004616 

1 11 21 31 41 SI 

I i 1 I I I 

MNRKARRCLG HLFLSLOWY LRIGGPSSW ALGASIICNK IPGLAPRQRA ICQSRPDAII 60 

VIGEGSQMGL DECQPQFRMG RMNCSALGER TVPGKBLKVO SREAAPTYAI lAAGVAHAIT 130 

AACTQGNLSD G6CDXEKQGQ YHRDBGHKNO GCSADIRYGI GFAXVFVDAR BIKQNARTLM 180 

NLHNNBAORK ILEENMKLEC XCHOVSOSCT TKTCHTTXiPQ FRBLGYVLKD KniEAVKVBP 240 

VRASRNKRPT FXiKIKKPIiSy RKPKDTDLVY XBRSPMYCBB DFVTOSVGTQ GRACNXTAPQ 300 

ASGOSLMCOO RGYNTHQYAR VHQQfCKFHW CCYVROITCS ERTEHYTCK 349 

Seq ID NOt C394 Protein Sequence 
Protein Accession Vt NP_003777 

1 11 21 31 41 51 

I i I I I I 

MDALCX3SGEL GSKFWDSmiS VHTEHFDLTP CFQKSLLAHV PCIYUfVAIiP ClfUiYIiRHKC 60 

ROYIILSHLS KLKKVLGVLL WCVSWADLPY SPHGIiVHGRA PAPVPPVTPI* WOVTMLLAT 120 

LLIQYBRIiQG VQSSGVLIIF WFLCWCAXV PFRSKILLAK AEGEISDPFR FTTFYIHPAL 180 

VLSALILACF RBKPPPPSAK NVDPNPYPET SAGFIiSRLFP WWPTKMAIYG YRHPLBEKDli 240 

WSLKEEDRSQ MWQQLLBAH RKQEXQTARH KASAAPGKNA SQBDEVLLGA RPRPRKPSFL 300 

KALIiATFGSS FLISACPXLI QDUiSFINPQ LLSIIiIRFXS HPMAP8WW0F LVAGLMFLCS 360 

MMQSLILQHY YHYIFVTGVK FRTGXMOVIY RKALVITNSV KRASTVGBIV KIMSVDAQRP 420 

MDLAPFLNIili WSAPLQIILA IYFI.WQNI1GP SVLAGVAFMV IiIiIPLNGAVA VKKRAFQVKQ 480 

MKLKDSRIKL MSBILNGIKV LKLYAWBPSP LKQVEGIRQG ELQLIiRTAAY LHTTTTFTWM S40 

CSPFLVTLIT LWVYVYVDPN NVLDAEKAFV SVSLFNIIiRL PLNMLPQLIS KLTQASVSLK 600 

RIQQFLSQBE LDPQSVBRXT ISPGYAITIH SOTFTNAQDL PPTLKSLDZQ VPKGAIiVAW 660 

GPVGCGKSSL VSALLOEMEK LEGKVHMK6S VAYVPQQAWI QNCTLQENVIj FGKALNPKRY 720 

QQTLEACALL ADLEMLPGGD QTEIGBKGIN LSGGQRQRVS LARAVYSDAD IPLLDDPLSA 780 

VDSHVAKHIP DHVIGPEGVI* AOKTRVLVTH QISFliPQTDP IIVLADGQVS EMGPYPALLQ 840 

RNOSFANFLC NYAPDEDQ6H LEDSWTALBG AEDXEALLIE DTLSHHTDLT DNDPVTYWQ 900 

KQFHRQL8AL SSDGEGQGRP VPRRHLOPSE KVQVTEAKAD GAIiTQBEKAA IQTVELSVPW 960 

DYAKAVGLCT TUVICIiLYVa QSAAAIGAKV WLSAWTKDAM ADSRQNNTSL RU3VYAALGI 1020 

LQGFLVMLAA MAMAA6GIQA AHVUIQALLH NKIRSPQSFF DTTPSGRILN CFSKDIYWD 1080 

EVLAPVIIiMIi LNSPFNAIST LWIHASTPI* PTWILPLAV LYTIjVQRFYA ATSRQLKRliE 1140 

SVSRSPIYSH F8BTVTGASV IRAYNR8RDF BIISDTKVDA MQR8CYPYII 81IRWI*SIGVE 1200 

FVGNCWLFA AIiFAVXGRSS LNPGLVGLSV 8YSLQVTFAL NHMZRMMSDL ESNI VAVg tV 1360 

KEYSKTETEA FWWEGSRPP EGWPPRGBVE FRNYSVRYRP GLDLVIJIDLS LBVHGGEKVG 1320 

IVGRTGAGKS SMTLCLFRIL EAAKGEIRID GLNVADIGIjH DLRSQLTIIP QDPILFSGTL 1380 

RMNLDPFGSY SEEDIWWALB LSHLHTFVSS QPAGLDFQCS EGGENLSVGQ RQLVCLARAL 1440 

LRKSRILVliD BATAAIDLBT DNLIQATIRT QFDTCTVLTI AHRLNTIMDY TRVIiVLDKGV 1500 

VABFDSPANL IAARGIFYGM ARDAOLA 1S27 

Seq ZD KOi C395 Protein Sequence 
Protein Accession ftt 1IF_004617 

1 II 21 31 41 SI 

I I I 1 I I 

MRARPQVCEA LLFALALQTG VCYGIKWLAL SKTPSALALN QTQHCKQLEG LVSAQVQLCR 60 

SNLBLMHTW HAAREVMKAC RRAFADMRWN CSSIBIAPNY LLDLERGTRE SAFVYALSAA 120 

AISHAIARAC TSGDLPGCSC GPVPQEPPGP GNRHGGCADN LSYGLU46AK FSDAPMKVKK 180 

TGSQAKKLMR LHNSEV6RQA LRASLBMKCK GHGV8G8CSI RTCHKGLQSIi QDVAADLXTR 240 

YLSATKWHR PMGTRKKIiVP KPLDIRFViO} SELVYIKISSP DFOfKNBRVG SH6TQDRQCN 300 

KTSNGSDSCD LMCCGRGYNP YTDRWBRCH CKYHWCCYVT CRRCBRTVER YVCK 3S4 

Seq ID NO: C396 Protein Sequence 
Protein Accession i: NP_114072 

1 11 21 31 41 51 

1)111) 

MEW6YLLEVT SliLAALALLQ R8SGAAAASA XELAOQBITV PLCKGIQYNY TYMPNQFKHD 60 

TQDEAGLSVH QFWPLVEIQC SPDLKFFLCS MYTPZdiBDY KKPIiPPCRSV CBRAKAGCAP 120 

Ii4RQYGFAHP DRMRCDRLPE QGNPDTLCMD YNRTDLTTAA FSPFRRLPPP PPGEQPPSGS 180 

. GHGRPPGARP PHRGGGRGGG GCTAAAPPAR GGGGGGKARP PGGGAAPCEP 6CQCRAPMVS 240 

VSSERHPLYN RVKTGQIANC ALPCHNPFFS QDERAFTVFW IQLWSVLCFV STFATVSTFL 300 

IDMERFKYPE RPIIFLSACY LFVSV6YLVR LVAGHEKVAC S6GAPGAGGA 6GAGOAAAGA 360 

6AA0AGAGGP GGRGBYBELG AVBQHVRYBT TGPALCTWP LLVYFFGKAS SIHHVII^T 420 

WFLAAGMKWG NEAIAGYSQY FHLAAWLVPS VKSIAVIiALS SVDGDPVAGl CYVQIQSIiDM 480 

LRGFVIiAPLV lYIiFIQTMFL LAGFVSLFRl RSVIKQQDGP TKTHKLEKLM IRLGbFTVLY 540 

TVPAAWVAC LPYEQHNRPR WEATHNCPCL RDLQPDQARR PDYAVFMLKY FMCLWGITS 600 

QVWVWSGKTL ESWRSLCTRC CWASKGAAVG GGAGATAAGO GGGPGGGGGG OPGGGGGPGO 660 

GGGSLYSDVS TGLTHRSGTA SSVSYPKQMP LSQV 694 

Seq ID KG: C397 Protein Sequence 
Protein Accession #t XP_OS0625 

1 IX 21 31 41 51 

I ) i I I I 

MLQGPQSLLL LFLASHCCLG SARGLFLPGQ PDPSYKRSNC KPIPANI<QLC HGIBYQNMRL 60 

PNLLGHHTMK EVLEQAQAWI PLVMKQCHPD TKKFLCSLPA PVCLDDLDET IQPCHSLCVQ 120 
VKDRCAPVMS AP6FPWPDML EODRPPQnND XiCIPIiASSDH LLPATBEAPK VCEAGKNKND 160 
DDNDIKBTIiC iOtDFALKIKV KBZTYZNRDT KZILBTRSKT IYKLNGV8ER DLKKSVLHUC 240 
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PSLQCTCEBM NDINAPYLVM GQKQGGEIiVI TSVKRWQKGQ RBFXRISRSI RKLQC 295 

Seq ID KOs C398 Protein Sequence 
Protein Acceselon iiP.001297.1 

1 11 21 31 41 51 

I I 1 I I I 

KSKGLBITGT ALAVLGWLGT ZVCCALPMHR VSAFIGSHII TSQNZNEGLN MNCWQSTGQ 60 

MQCKVyDSLL ALPQDLQAAR ALIWAILUV AFGLLVALVG AQCTNCVQDD TAKAKITIVA 120 

GVIiPIiLAALL TLVPVSW8AN TIIRDFYNFV VPEAQKRQ4G ASLYVGHAAA ALQLLGGAUi 180 
CC8CPPREKK YTATKWYSA PRSTOPQASIi GTOYDRKDYV 220 

Seq ID NO I C399 Protein Sequence 
Protein Accession «i NP_036581.1 

1 11 21 31 41 SI 

I I I I I I 

MBSRKDITMQ EBLWKMKPRR NLSSDDYLRK DTGETSMLKR PVLLHLHQTA HADSFDCPSE 60 

LQHTQELFPQ WHLPIKIAAI lASLTFLYTL LREVIHPIAT SHQQYFYICIP ILVINKVLPM 120 

VSITLLALVY LPGVIAAIVQ liHNGTKYlCKF PHWLDKWMLT RKQFGUiSFF FAVLHAIYSL 180 

SYPMRRSYRY KLLNUAYQQV QQNKEDAWIE HDVWRMEIYV SIAIVOLAIL ALLAVTSIP8 240 

VSDSLTWRBP HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVHYTPPT FMIAVFLPIV 300 

VLIFKSILFL PCLRKKILKI RHGWEDVTKI NKTEICSQL 339 



25 Seq ID NO: C400 Protein Sequence 
Protein Accession «: NP_001766.1 



1 11 21 31 41 51 

i I I I I I 

MANCEFSPVS GDKPCCRIiSR RAQLCLGVSI LVLILVWLA VWPRWRQTW SQPQTTKRFP 60 

BTVLARCVKY TEIHPEMRHV DCQSVWDAFK QAFISKHPCN ITEEDYQPLM KLGTQTVPCN 120 

KILLWSRIKD IiAHQFTQVQR DMFTLEDTliL GYLADDLTWC GBENTSKINY QSCPDWRKDC 180 

SNNPVSVFWK TVSRRFAEAA a>WHVMI2«3 SRSKIPDKNS TFGSVBVHNL QPBKVQTLEA 240 

WVIHGGREDS RDLCQDPTIK ELESII8XRN IQFSCKNIYR FOKFLQCVKN PEDSSCT8BI 300 

Seq ZD NOt C401 Protein Sequence 
Protein Accession fti XP_120S13.2 



^- 1 11 21 31 41 51 

40 I I I III 

MVSCTPSGPL RETNENVKKP YALRAFMFRM SSBAAMLGBS RTPKPRKHRA TTRAKIFKRF 60 

FSEGSBSNSR LVEELAVIHT YSDDPAPTTS PSSVQPREFO VMQQAPRARF GSRTPPAAAE 120 

ASSPHLGIGE AACQSGARAA APRA0ARRCX2 PQRQAAAAAA TAQTHTLPHA RTRADPAGRR 180 

RRHPRSPAPO 6BGTCSEGPA PRRRMEESfQ PAEEGP8VPK lYXQRSPYSV LKTFPSKRPA 240 

LAKRYERPTL VEZiPKGHLRT PAQPPPASPA ASSSSSFAAV VRLGAPPRPP RRGFRAR6TZ 300 

PPUjFAPGVA GTLIjPPPTSS 8PP8PRPRPW HAAAPRGGTS HTHMHR8QST tiPGSDTMVSV 360 

FGLMAQRRWQ HRSLKQFEWG II/3SWGTHPC GQDWLBKBGQ VAVUjPRSEG NTAPKKSRMI 420 

LDAPAQQCSR VLSLLNCGGK LLDSNHSQSM XSCVXQEGSS YNERQEHCHI GKGVUSQTSD 480 

NVDIEMQYMQ RKQQTSAFliR VFTDSLQNYZi IfSOSFPTPNP 8SA8BYGHLA DVDPLSTSPV 540 

HTLENISIiDS TASLCKSRHL SREPFVK8DF FNPLQQAXA6 GASRPFSGAQ QSZAYRVNSB 600 

LEDGIRSFVP LSCEALEMDL TSLGSKQLUI NYPVYITSKQ NDEAVNSSXK DGRRLLRYLI 660 

RFVFTIDEliK YSCGLGKRKR SVQSaETQPB RRPLDPVKVT CEiRQTASFRS VSPSVZSFHR 720 

IGCGSPRTSV QPSVF 735 



Seq ID NOt C402 Protein Sequence 
Protein Accession di BAA92562.1 

1 11 21 31 41 51 

) I I I I I 

MBTTVLSGIN FEYRGMTGHE VAGDHXYTAA GASDNDFMIL TLWPGFRPP QSVHADTENK 60 

EVARITFVPE TLCSVNCELY FMVGVNSRTN TFVBTWKGSK GKQSYTYIIB ENTTTSFTWA 120 

PQRTTFHBAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LBA8DVGSSC TSCPAGYYID 180 

RDSGTCHSCP PNTILKAHQP YGVQACVPOG PGTKNNKIHS LCYNDCTF8R NTPTRTFNYN 240 

P8ALANTVTL AG0P8PT8KG LKYFHHFTLS LOCaiQGRKKS VCTDNVTDLR IPB0B86F8K 300 

SITAYVCQAV IIPPBVTGYK AOVSSQPVSIi ADHLZ6VTTD NTLDGITSPA EIiFHLBSLGI 360 

PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTO) GOIFHFLHBS 420 

AAACPLCSVA DYHAIVSSCV AGIQKTTYVW RBPKLCSGGI SLPBQRVTIC KTIDFWLKVO 480 

XSAOTCTAIIi LTVLTCYFNK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGBDVEDD 540 

LIFTSXK8LF GKXKSFTSKQ PAPVTI6L8B DS 572 

Seq ID MO I C403 Protein Sequence 
Protein Accession «: NP 055139.1 



1 11 21 31 41 51 

75 ! I I III 

MALQGISWB LSGIAPGRXC AMVLADFQAR WRVDRPGSR YDVSRLGRGK RSLVLDLKQP 60 

R5PRAAASVQ AVGCAAGALP PROIGBTPAG PRDSAAGKSX AYLCQAEWIH PVQESFCRLA 120 

GHDINYX4ALS GVLSKXGRSG ENPYAPLNLV ADFAGGGIiMC ALGIZMALFD RTRTDKGQVI 180 

DANMVBGTAY LSSPLHKTQK S8LWEAPRGQ NKLOGGAPFY TTYRTAD6BF MAVOAIBPQF 240 

OU yEXil.IKGL6L RSDBLPNQMS TDDHPEMXKK FADVFAKRTK ABWOQIFDGT DACVTFVLTF 300 

EBWRHDHNK BR0SFIT8BB QDVSPRLAPL LU3TPAIPSS XODPPIGBBT EBXIjEEFGFS 360 

RBBIYQU9SD KIIBSNKVKA SI. 382 

Seq ZD NOt C404 Protein Sequence 
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Protein Accession «t XP_091332.1 

1 11 21 31 41 SI 

1 i 1 I I I . 

MQRHTLHAAA PLTLHSAQAP PQTDISISPA LPBLPLPSLC PLFWMEFKGH CYRPPPLNKT 60 

WAEADI.YCSB PSVGRKSAKL ASIHSWBENV FVYDIiVNSCV POXPADWTO LHDHRQEGQP 120 

EWTDGSSYDY SYWDGSQPDO 6VKADPBE33D GVQIWYRPTS EQLQAPEPQL PI.8XSEATDV 180 

YLPEDFPAEP KLMDQSWVSR KSIiKPSKSHL MBPPTPVAKR QKAKTRHRSL RGVHWPS6KA 240 

OSWKBRMNAD YORRKRSAPR QEQRLRCRER RLRAASGQQR PEGQRXQRQQ ERQERC3WEEL 300 

G6VSPMRGAQ AWQHGIiGAGS QRGAAPEOGE NHQAPELGST WRGQRLQPQT AALCHPALRK 360 

LPGNAHGIiAA AFVQPALQVQ EEKNNRTRFS GAYFTMSDPT CDQDSKEQSL RRHGRBAEXD 420 

GPYRIiVKKKR GPVACPS3FE LQSGGEVCLD FPVBLRAOTW lAREPP 466 



^ ^ Seq ZD NO I C405 Protein Sequence 
1j Protein Accession V: XP 054B69.2 



20 
25 
30 
35 



flj 1 11 21 31 ■ 41 SI 

-I I I I I I 

MHTCCPPVTL BQDLHRKMHS WMLQTLAPAV TSLVIjSCAST IDYYQEZCDN ACPCEBKDOX 60 

IiTVSCEimGI ISLSBISPPR FPIYKLLLSG KhimhYPUE FVKyTCASXL HLGSNVXQDX 120 

ETGAFK6LRG LRRUUiNNNX LELIiRDDTFL OLEKLEYLQV DYNYXSVIBP NAFGKLHLLQ 180 

VIiXUJDNLLS SLPNNLFRFV PLTHLDLRGN RLKLLPYVGI* LQHMDKWBL QLEENPWNCS 240 

CBLISLKDWL DSISYSALVG DWCBTPFRIi HGKDIiDSVSK QEZjCPRRLIS DYBMRPQTPL 300 

STTGYLHTTP ASVNSVATSS 8AVYKPPLKP PKGTRQFNKP RVRPTSRQPS KDLGYSMYGP 360 

SIAYQTKSPV PLECPTACSC NLQXSDLGLIT VNOQERKIES XAELQPKPyN PKKMYLTENY 420 

lAWRRTDFL BATGLDIiIiHL (anmiSMIQD RAFGCLTNLR RLYLNCSIRIE RLSPELFYGL 480 

QSLQYLFLQY NLIREXQSGT FDPVPHLQLL FLNNNLLQAM PSGVFSGLTL LRLNLRSllHF 540 

TSLPVSGVIiD QLKSLXQIDL HDtlPWDCTCD IVGMKLWVEQ LKVGVLVDEV XCKAPKKFAE 600 

TDMR8XKSEL LCPDYSDVW 8TPTP8SXQV PARTSAVTPA VRIilSTGAPA SLGAGGGASS 660 

VPLSVItlLSL LLVFXMSVFV AAGLFVLVMK RRKXNQSDHT SINNSDVSSF NMOYSVYGGO 720 

GGTGGHPHAH VHHRGPALPK VKTPAGHVYE YXPHPLGBMC KNPIYRSRE6 KSVEDYKDLH 780 

EliKVTYSSNH HLQQQQQPPP PPQQPQQQPP PQLQIjQPGEE ERRESHHLRS PAYSVSTIEP 840 

REDLLSPVQD ADRFYRGILE FDKHCSTTPA GNSLPBYPKF PCSPAAYTFS PNYDLRRPHQ 900 

YLHPGAGDSR LREPVLYSPP SAVFVEFNRN EYLELKAKLN VEPDYLEVLE KQTTFSQF 958 

8eq ZD NOx C406 Protein Sequence 
Protein Accession fti KP 000784.2 



j,^ 1 11 21 31 41 51 

40 I 1 I I I I 

MGILSVDIiLX TXiQXLPVFFS NCLFLALYDS VXLLKHWLL LSRSK8TRGE WRRMLTSEGL 60 

RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 120 

ERPLWNFQS ATUPPFTSQL PAFRKLVBEP SSVADFLLVY XDEAHPSDGW AXPGDSSIiSP 180 

EVKKHQNQBD RCAAAQQLLB RFSI.PPQCRV VADRMDNNAM ZAYGVAPBRV CZVORQKZAY 240 

LGGKOPFSYN LQEVRHWLEK NFSKRUKXTR IMS 273 



45 
50 
55 
60 
65 
70 



Seq ID NO: C407 Protein Sequence 
Protein Accession #: NP_006540.2 

1 11 21 31 41 51 

I I I i I I 

MSSCVSSQPS SNRAAPQDBL GGRGSSSSES QKPCEALRGL SSIiSlHLGME SFIWTBCEP 60 

GCAVDLGLAR DRPLBADGQE VPLDSSGSQA RPHLSGRKLS LQERSQGGLA A66SLDMNGR 120 

CICPSIiPYSP V8SPQSSPRI* PRRPTVESHU VSXT6MQDCV QLHQYTLRDB ZOKOSYGWK 160 

LAYNENDNTY YAMXyLSKXK LXRQAGFPRR PPPRGTRPAP G6CXQPR6PI EQVYQEIAXL <240 

XKLDHPNWK LVEVLDDPNE DHLYMVFELV NQGPVKBVPT LKPL8EDQAR FYFQDLXKGX 300 

BYLEYQKXXB RDIKPSNIiLV GEDGHIKIAD FGVSNEFK6S DALLSNTVGT PAFMAPESLS 360 

ETRKIFSGKA LDVWAKGVTL YCFVFGQCPF MDERXMCLfiS KXK8QAI1EFP DQPDXAEDLK 420 

DliITRMLDXN PBSRXWPBI KLBPWVTRHG ABPLPSEDEN CTLVBVTBBB VEHSVKHIP8 480 

LATVXLVKTK XRKR6F6IIPF E6SRRBERSL SAPOIIiLTKK PTRECB8LSB LKEARQRRQP 540 

FGKRFAFR6G GGSAXiVRGSP CVESCWAPAP GSPARMRPLR PEBAMEPB 588 

Seq ID NO: C408 Protein Sequence 
Protein Accession #1 Np 061116.2 



1 11 21 31 41 51 

I I I I I I 

KGLSLPKEKQ LILCLHSKFC RWFQRRBSWA QSRDEQNLLQ QKRXVrESPXiL LAAIOTNDVQA 60 

LNKLLKYEDC KVHQRGAMGE TALHIAALYD NLEAAMVLME AAPELVFEPH TSELYEGQTA 120 

XfiZAWNQHM HLVRALLARR ASV6ARATGT AFRR8PC37LI YFGEHPI.8FA ACVKSEBXVR 180 

LLIEHGADXR AQDSLGNTVL HILILQPNKT FAOQMYNLLL SYDRHGDHLQ PLDLVPNHQG 240 

LTPFKLAGVE GNTVMPQHLM QKRKHTQWTY GPLTSTLYDL TEIDSSGDBQ SLLBLIITTK 300 

KREARQIIiDQ TPVKELVSLK WKRYGRPYFC MLGAIYLLYI ICFTMCCIYR PLKPRTWNRT 360 

SPRDNTLLQQ KI.LQBAYMTP KDDXRLVOBIi VTVXGAIXIIi LVBVPDXFRH GVTRFF6QTI 420 

75 USGBVBVhZX TYAFMVLVTM VMRLXSASQE WPMSFAIiVL GWCaJVMYFAR GFQMIiGPFTI 480 

MIQKMIFGDL MRFCWLMAW ILQFASAFYI IFQTEDPEEL GHPYDYPMAL PSTPELPLTI 540 

IDGPANYNVD LPFMYSITYA AFAIIATLWI LNLLIAMMGD THWRVAHEHD ELWRAQIVAT 600 

TVMLERKLPR CIiWPRSGICXS REYGIjODRNF LRVBDRQDUI RQRIQRYAQA FHTRGSEDLO 660 

KDSVBKLBLG CPPSPHLSLP MPSVSR8T8R SSANHBRLRQ GTLRRDLRGZ ZMRGLEDGES 720 

OU WEYQX 725 

Seq ID NO; C409 Protein Sequence 
Protein Accession ft: NP 068710.1 
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1 11 21 31 41 51 

1 I I I I I 

MQKVTLGUiV PLAQPPVLDA NDLEDXNSPP VYDHHSLOVa GLlCMSVhCK HQXXIVM8BN 60 
^ RSSGE0A6RG WGSPPLTTQL SPTGAKCKCK P6QKSGHHPG BTPPLITF6S AQ8 113 

Seq ID NO: C410 Protein Sequence 
Protein Accession 8s NP_005962.1 

^^1 11 21 31 41 51 

10 I I I i 1 1 

MQKVTIiGLLV FLAGPPVLDA NDLBDIOISPF YYDHKSLQVO GLICAOVLCA KGIZIVMSAK 60 
CKCRFOQKSO KHPGBTPPLI TP6SAQS 87 



15 



70 



Seq ID NO: C411 Protein Sequence 
Protein Accession «: KP 004952.1 



1 11 21 31 41 SI 

I I I I I I 

MLSKVIjPVLL GIUilliQSRV BGPQTBSKNE ASSRDWYGP QPQPLENQU, SEETKSTBTE 60 

20 TGSRVGKLPE ASRILNTILS NYDHKLRPGI GEKPTWTVE lAVNSIiGPLS ILDMBYTIDI 120 

IPSQTWYDER LCYNDTPESL VLNGNWSQL WIPDTFFRNS KRTHEHEITM PNQMVRIYKD IBO 

GKVLYTIRMT IDAGCSLHML RFPMDSHSCP LSFSSFSYPE NQflYKWGNF KLEINEKNSH 240 

KLPQFDPTGV SMKTEIITTP VGDFMVMTIP FNVSRRFOYV AFQNYVPSSV TTMLSWVSFW 300 

_ - IKTESAPART SLGITSVLTM TTLGTPSRKN PPRVSYITAIi DFYIAICPVP CFCALIiEFAV 360 

25 UiPLIYNOTK AHASPKLRHP RINSRAHART RARSRACARQ HQBAFVCQIV TTEGSDGEER 420 

PSC8AQQPPS FGSPE6PRSL CSRLACCEWC KRFKICyPOtV PDCB6STWQQ GRLCIHVYRI* 480 

OMYSRWFFV TPFFPHVLYW LVOiNL 506 

ID NO* C412 Protein Sequence 
30 Protein Accession It NP_068eX9.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWYDERLCy NDTFBSLVLN GNWSQIjWIP DTPPRNSKRT HKHEITMPNQ 60 

35 HVRIYKDGKV LYTIRMTIOA GCSLHMLRFP MDSHSCPLSP SSPSYPEMEM lYKWBNPKLE 120 

INEKNSWKLP QLDFTGVSNK TEIITTPVGD FMVMTXFPNV SRRPGYVAFQ NYVPSSVTTM 180 

LSWVSPWIKT ESAPARTSLQ ITSVLTMTTL GTPSRKNFPR VSYITALDPY lAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIH 8RAHARTRAR SRACARQHQB AFVOQIVTTE 300 

GSDGBBRPSC SAQQPPSPGS PEGFRSLCSK LACCEHCXRF KKYPCNVPDC EGSTWQQARL 360 

40 CIHVYRIiDNY SRWFFVTFF FFHVLYHLVC LNL 393 

Seq ID NO: C413 Protein Sequence 
Protein Accession #i NP_068B22.1 

45 1 11 21 31 41 51 

I I I I I ) 

MEYTIDIIFS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 60 

SHSCPLSPSS FSYPENEMIY KWENFKLBIN BKNSHKLFQP DPTGV8NKTE IITTPVGDFM 120 

VKTIPPNVSR RFOYVAFONY VPSSVTTMLS WVSFWIKTBS APARTSMJIT SVLTMTTLOT 160 

50 F8RKNFPRV8 YITAIiDFyXA ICFVFCPCAL LEFAVUNFXiX YNQTKAHASP KLRHPRINSR 240 

AKARTRARSR ACARQBQBAF VOQIVTTEGS DGEERPSCSA QQPPSPGSPB GNPRSLCSKXA 300 

CCBNCKRFKK YFCMVPDCEG STWQQQRIiCI HVYRLDNYSR WFPVTFFFF NVLYWLVCLN 360 

I, 361 

55 Seq ID NO: C414 Protein Sequence 
Protein Accession it NP_068830.1 

1 11 21 31 41 51 

I I I ' ' ' 

OU MBYTIDXIFS QTWYDERLCY NDTFBSLVUI GNWSQLWIP DTFFRNSXRT HBHBITMFNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSP SSPSYPENEM lYKWENFKLE 120 

INBKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFPNV SRRPGYVAFQ NYVPSSVTTM 180 

LSWVSPWIKT ESAPARTSLG ITSVLTMTTL GTPSRKNFPR VSYITALDPY lAICPVPCPC 240 

ALLEFAVLNF LIYNOTXAHA 6PKLRHPRDI 8RAHARTRAR SRACARQHQB AFVCQIVTTE 300 

65 6SDGEERPSC SAQQPPSPGS PBQPRSLCSK IiACCEWCKRF XKYFCMVFDC EGSTMQQGRL 360 

CIHVYRLDNY SRWFPVTPP PPNVLYWLVC LKL 393 



Seq ID NO: C415 Protein Sequence 
Protein Accession it NP 068591.1 



1 11 21 31 41 51 

I I I I i I 

KPAVSGPGPL FCLLLLLLDP HSPBTGCPPL RRFEYKLSFK GPRLALPGAG IPFWSHKGDA 60 

IL6LEEVSLT PSMRNRSQAV HSRASVPFSA NEVEVQMRVT 6LQRRGAHGM AVWYTRGRGH 120 

75 VGSVLGOLAS WDOIGXFFDS PABDTQDSPA XRVLASDOHI PS8QPQDGAS QGLGSCHNDF 180 

RNRPHPFRAR ITYWGQRLRM SLNSQLTPSD PGEPCVDVGP LLLVPGQPPG V8AATGTLAD 240 

DHDVLSFLTF 8LSEPSPEVP PQPFLEMQQL RLARQLEGLW ARLGLGTRED VTPKSDSEAQ 300 

6EGBRLFDLB BTX/3RBRRIL QALRGLSKQL AQABRQWKKQ LGPPGQARPD GGWALDASCQ 360 

IPSTPGRGGH LSNSUQCDSA XVGALLHGQW TLLQALQQ4R DAAVRMAAEA QVSYLFVGIE 420 

80 HHFLBLDHXL 0LLQEELR6P AKAAAXAPRP PGQPPRA88C LQPOIFLFYL LIQTVGFF6Y 480 

VHFRQEUIKS LQECLSTGSL PLGPAPHTPR ALGZLRRQPL PASMPA 526 



Seq ID NO: C416 Protein Sequence 
Protein Accession ft: XP 117036.1 
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1 11 31 31 41 SI 

I I I I I I 

MERRTRGALG SRRPPPPLPA LRHLCTGLQA A(9WHF0TLH SBTOOraAXA AEOPMGLFRP 60 

URCFSEAGQA PVQPSPETQG VABVCSRAKV SVDERSPGGO AYAHHVTPRH KGCBSRSaRT 120 

VRGSVSHKRP EQAAPETGRO PAVASGSQDO NBCOHO ISt 

Seq ID NOi C417 Protein Sequence 
Protein Accession 8i XP_167a03.3 

1 11 31 31 41 51 

I I I I I I 

NFGKGQRRTA TNKFOGLFGA P6VGI6GBCL YVLECKCFIK MKTKTHHHKK KNFAAKRNEE 60 
KLKKKKKQER KNHTKFFHHT YPLSQQDFLF AKSYFOSKGP CPLWQGLF 108 

Seq ID NO: C418 Protein Sequence 
Protein Accession S: NP 079056.1 



^_ 1 11 21 31 41 51 

20 I I I I I 1 

MFRLVBRYEM FRHEVYVLLI RNIFLKISIZ aiI«CYYWLtlT VALSOBECWB TLIGQDIYRIi 60 

LLMDFVPSLV NSFL6BFLRR ZZ6MQLZTSL OLQEFDIARN VLBLIYAQTL VHIGIFFCPIi 120 

LPPIQMIMLP IMFYSKMXSL MMMFQPPSKA WRASQMMTFP IPliLPPPSPT OVLCTLAITI 160 

WRIiKPSADCG PPRGbPLPIH SIYSWIDTLS TRPOYLWWW lYRNLIGSVH FPFILTLIVL 240 

2 J IITYLYWQIT EGRKIMIRU/ HSQIXNEGIQ} KMFLZBKLIK UQDMEKKANP SSLVIiERREV 300 

EQQQPLHLGB HD6SLDLRSR RSVQ&GNPRA 330 



Seq ID NOt C419 Protein Sequence 
Protein Accession ti: Eos sequence 

1 11 21 31 41 51 

I 1 I I ) i 

KLSDDUVNEZ IIQVEI1VS86 VQSHPSSNQI PQEKVLLDSS UMVLSISDI DVXDSQTVSK 60 

RNDQRGNQVL RFSTSLNESM SQTLHSLECM GIDTPGSSHE TVQGQKLIAS LIPMTSRDRI 120 

KAIRNQPRTM EEKRNLRKIV DKEKSKQTHR ILQLNCCIQC UISISRAYRR SKNSLSEILN 180 

SISLWQKTLK IIGGKFGTSV LSYFMFLRWL LKPNIPSFII* NFSPIIIPQP TVAKKNTLQF 240 

TGLBPFTGVG YPRSTVimG FYTNSTIWO KSGASYNMQb AYIFTZGACXi TTCFPSLLPS 300 

MAKYFRNNFI NPHXYSGGIT KLZFCNDFTV THSKAVKLKQ KNLSTEZREH LSELRQENSK 360 

LTFKQLLTRF SAYMVAWW8 T6VAZACCAA VYYIiAEYNLB PLKTHSNPGA VLLLPFWSC 420 

ZNLAVPCIYS MFRLVBRYEM PRHEVYVLLZ RNZFLKI9ZX GIIiCYYWLNT VALSGBBCWE 460 

TLZGQOZYRL IjLKDFVFSLV MSFIiGEFLRR ZZGMQLITSL GLQEPDZARH VLELZYAQTL 540 

VHIGIFFCPL LPPZQMZHLP IMFYSKNZSL MMNFQPPSKA HRA8QMHTPF ZFZiLFFPSFT 600 

GVLCTLAITX HRLKPSADG8 PFRQLPLFZH SZYSHZDTLS TRPGYLWWH lYRMLIGSVK 660 

FFFZLTLIVL ZZTYLYHQIT EGRKZMZRZiL HBQZZNBQKD KMFbZBKIiZK LQDMEKKANP 730 
SSLVLBRRBV BQQGFI^BLGB HDOSXiDIiRSR RSVQBGNPRA 760 

Seq ZD NO: C420 Protein Sequence 
Protein Accession #: NP 002241.1 



1 11 21 31 41 51 

I 1 I I I I 

MGQDLVLGLQ ALRRRKRLLE QBKStiAGWAL VLAOTOIGLM VLHAEMLWFO GCSWALYLFL 60 

VKCTXSISTF LLIiCLIVAFH AKBVQLPMTD NGLRDWRVAL TGRQAAQIVIi ELWCGLHPA 120 

PVR6FPCVQD LGAPLTSPQP WPGPLGQGBA UjSUVMLLRL YLVPRAVXiLR SGVLLNASYR 180 

55 SZGALNQVRP RHWFVAKLYM NTHPGRLLLG LTLGLWLTTA WVLSVABRQA VNATGHLSDT 240 

LWLIPZTFLT IGYGDWPGT MWGKZVCLCT OVMCVCXTTAL LVAWARKLE PNKAEKHVHN 300 

PMMDIQYTKE MKESAARVLQ EAWMFYKHTR RKESHAARRH QRKLWIAINA FRQVRLKHRK 360 

LREQVNSMVD ISKMHMILYU LQQNLSSSKR ALBXQZDTLA GKLDALTELL STAL6PRQLP 420 

EPSQQSK 427 



Seq ID NO* C421 Protein Sequence 
Protein Accession tft HP 079533.1 



1 11 21 31 41 51 

65 1 I I I I 1 

. MGGKQRDEDD EAYGKPVKYD PSFROPZKNR SCTDVZCCVL FLLPILOYIV VQIVAWLYGD 60 

PRQVLYPRNS TGAYCGMGEN KDKPYLLYPN ZFSCILSSNI ISVAENQLQC PTPQVCVSSC 120 

PEDPWrVGKN EFSQTVGBVF YTKNRNFCLP GVPHNKTVZT SLQQEbCPSF LLPSAPALGR 180 

CPPtmiZTPP AIiFGZTNDTT ZQQGZSGLID SLNARDISVK IFEDFAQSWY WILVALGVAL 240 

70 VLSLLFIIiLL SLVAOPLVLV LIL6VLGVIA YOZYYGWBEY RVLRDKSASI SQLGFTTNLS 300 

AYQSVQBTm. AALIVLAVLB AILLLVLZPL RQRZRXAZAL LKEA8KAVGQ MMSTMFYFLV 360 

TFVLLIilCIA YWAMTALYPL PTQPATLQYV LWASNISSPG CEKVPINTSC NPTAHLVNSS 420 

CPGLMCVFQ6 Y8SKGLZQRS VPNLQZYGVL GLFWTLNWVIi AI/3QCVXiAC3A PASFYWAFHK 480 

PQDIPTFPLI SAFZRTIiRYH TQSLAFOALZ LTLVQZARVI LEYZDHKLRG VQNPVARCXM 540 

OCPKCCbHOi BKFXKFIiNRN AYZMIAIYGX KPCV8AKNAF MLLMRNIVRV WLDXVTDLL 600 

LPFOKLLiWG GVGVLSPFFF S6RIPGLGXD FKSPBLNYYW LPIMT8XLGA YVXASGPP8V 660 

PGNCVDTLFL CFZiBDLERNN GSLDRPYYMS K5LLKILGKK NEAPPDNKKR KK 712 



Seq ZD NO I C422 Protein Sequence 
oU Protein Accession fti MP_057264.1 

1 11 21 31 41 51 

1 I I I I I 

MGSNSGQA6R HZYKSLADDG PPDSVBPPKR PTSRLZMHSM AKFGRBFOrA VEAAYVTPVL 60 
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LSVGLPSSIiY SIVWFLSPIL GFLLQPW6S ASDHCRSRWG RRRPYILTLG VMKLVGMALY 
UNGATWAAL lAMPRRKLVH AISVTMIGW LFDFAADPXO GPIKAYLFDV CSHQDKBKOL 
BYHALPTGFG GALGYLLGAI DWAHLELGRL IX3TEFQVMPP PSALVLTLCF TVRIiCSISEA 
PLTEVAKOIP PQQTPQDPPL SSDGMYEYGS IBKVKNGYW PEZtAMQGAKH XNKABQTRRA 
MTLKSLLRAL VNMPPHYRYL CISHLIGWTA PLSNMLPPTD PMGQIVYRQD PYSAHNSTBP 
LIYERGVEVG CWGPCINSVF SSLYSYPQKV LVSYIGLKGL YPTGYLLPGL GTGPIGI*PPN 
VYSTLVIiCSL P6VMSSTLYT VPFNLXTEYH REBEKBRQQA PGGDPONSVR QKGMDCATLT 
CMVQLAQILV GGGLGPLVNT AGTVWWIT ASAVALIGCC PVALPVRYVD 

Seq ID NO: C423 Protein Sequence 
Protein Accession {I: NP_003264.1 

1 11 21 31 41 51 

t I I I I I 

MBGPGGVGGR GTRGPAAKGV WRGRAEEGPV LGAAER6PMV STGSRRRVFB GPGGGGLRHT 
PGKGTGRQRG AWGPRABDGV RRRTLGMPRQ SRRQVRAPCG PAGSWGARGO RRRDGPSRRR 
RGSATAAARH RVPPAPGGPP OPRAPAGSTR VPARAGGAVE PTGAAAVARL ARPAGGALPT 
AGAQGAGPAR GRSGEGSEWA RRGKGRPGPY QSPLGPAVAE GQEliKDKSRL RYPINGFQAL 
VLTALLVOLO KSAGLPLQAL PEMXiXiPIiAFV ATLTAFIP8L FLYMKAQVAP VSALAPGGNS 
QIPIYDFPLG RRXiNPRXCFF OFXYFCBLRP GLIGHVLINIi ALLMXEAEIiR GSPSLAMWLV 
NGFQLLYVQD ALWHEEAVLT TMDITHDOFG FMLAFODMAW VPPTYSLQAQ PLLHHPQPLG 
LPMASVICLI NATGYYIFRG ANSQKNTFRK NPSDPRVAGb ETISTATGRK LLVSGWWGMV 
RHFNYLGDLI MALAHSLP06 VSHLLPYPYL LYFTALLVHR EARDERSACR STAHPGRSTA 
OVdiTASCPT STEAAPPPQV GKVPTRPPAH PGPGASTHLG LKQLKPTQP 

Seq ID NO: C424 Protein Sequence 
Protein Accession #t NP_0S6S35.1 

1 11 21 31 41 51 

i 1 I I I I 

MGRLLRAARL PPLLSPLLLL LVG6AFLGAC VAGSDBPGPE GLTSTSLliDL LLPTOLBPLD 
SBEPSETMGIi GAGLGAPGSG FFSBENEESR ILQPPQYFWB EEEELNDSSL DLGPTADYVF 
POLTEXAGSI BDTSQAQELP NLPSPIiPKMN LVEPPWHMPP RBEEEEECEE EBREKEEVEK 
QEBBEBEELL FVNGSQEEAX PQVRDFSLTS 8SQTP6ATKS RHEDSGDQAS SGVSVBSSHG 
PSIiLliPSVTP TTVTPGDQDS TSQEAEATVL PAAGL6VBPB APQBASEEAT AGAA6LSGQH 
EEVPALPSFP QTTAPSGAEH PDEDPLGSRT 8ASSPLAPGD HBLTPSSATL GQEDIiNQQLL 
BQQAAEAQSR IPWDSTQVIC KDWSNIiAGKN YIILNMTENI DCBVPRQHRG PQLLALVEEV 
IiPRHOSGHHG ANHISLSKPS EKEQHIiUCrL VGBQGWPTQ DVLSMLGDZR RSLEBIQIQN 
YSTT8S0QAR ASQVRSDYGT LFWLWXGA ICIIIZALGIi LYNCWQRRIiP KLKHVSHGEE 
LRFVEHOCHD NPTLOVASDS QSEMQEKKPS LNGGGALMGP 0SWQAI14GGK RDPBD8DVPB 
EDTHL 

Seq ID MO I C425 Protein Sequence 
Protein Accession #t NP_001188.1 

1 U 21 31 41 51 

I I 1 I I I 

MSBVRPLSRO ILMETLLYGQ LLEPPTMEVL GMTDSEEDLD PMEDFDSLBC MEGSDAIiAIiR 
LACI6DEMDV SLRAPRIiAQI* SEVAMHSI/3L AFIYDQTEDI RDVIiRSFMDG FTTUOBMIMR 
FNRSPNPOSN VSCBQVLLAL IiLLLAIiLLPL LSGGUUVZiUC 

Seq ID NOt C426 Protein Sequence 
Protein Accession Hit AAP76225.1 

1 11 21 31 41 51 

I 1 I I I I 

MATPLPFPSP RHLRLLRLLL S0LVXX3AALR GAAA6HPDVA ACPGSLDCAL KRRARCFPGA 
HAOGPOiQPF QBDQQGLCVP RMRRPPGGGR PQPRLEDBZD FLAQELARKE SGQSTPPLPK 
DRQRLPEPAT LQFSARGQGli ELGLPSTPGT PTPTPSTSbS 6PV68DFVKM 8FUBPRGGQ6 
DGtiALVLXLA FCVAGAAALS VASLCffCRLQ REIRLTQKAD YATAXAPaSP AAPRISPGDQ 
RIAQSAEHYH YQHQRQQHLC LERHKEPPKB LDTASSDEQf BD6DFTVYEC PGLAPTGEME 
VRNPIfDHAA LSAPLPAPSS PPALP 

Seq ID NO; C427 Protein Sequence 
Protein Accession 9t NF_004436.1 

1 11 21 31 41 51 

I I t I I I 

MVCSLWVLLL VSSVLALBEV LLDTT6BT8B IGKLTYPPGO WDEVSVliDDQ RRLTRTFBAC 60 

HVAGAPPGT6 QDNWLQTKFV ERRGAQRAHI RIiHFSVRACS SLGVSGGTCR BTFTLYYRQA 120 

BBPDSPDSVS SWHUCRWTKV DTIAADB8FP SSSS88S88S 8AAWAVGPKG A6QRAGLQLN 160 

VKBR8FGPLT QRGFYVAPQD TQACLALVAV RLPSYTCPAV LRSFASFPET QASGAGQASL 240 

VAAVOTCVAH AEPEED6VGG QAGGSPFRLH CNGE6XNMVA VGGCRGQPGY QPARGDKAGQ 300 

ACPRGLYK6S AGNAPCSPCP ARSHAPNPAA PVCPCIfGFY RASSDPPBAP CIGPPSAPQB 360 

LHPEVQGSAL MLHWRLPREL GGRGDXiLFNV VOCECBGRQB PASGGGGTCH RCRDEVHFDP 420 

RQRGLTBSRV LVGGLRAHVP YILBVQAVNG VSBLSPDPPQ AAAINVSTSH EVPSAVPVAm 480 

QVSRASNSIT VSWPQPDQTN GNILDYQLRY YDQAEDESHS FTLTSBTNTA TVTQLSPGHI 540 

YGFQVRARTA AGHGPYGGKV YFQTLPQGEL SSQLPERLSL VIGSIIX3ALA FLLLAAITVL 600 

AWFQRKRRG TGYTEQLQQY 8SPGLGVKYY IDPSTYEDPC QAIRELAREV DPAYIKIEEV 660 

lOTGSFGBVR QGRIiQPRGRR EQTVAIQALW AGQAESI.QMT FI/3RAAVLGQ FQHPNILRLE 720 

GWTKSRPLM VLTEFMELGP U3SFLRQREG QF8SLQLVAM QRGVAAAMQY LSSFAFVHRS 760 

LSAHSVLVNS HLVCKVARLO KSPQGPSCLL RHAAPEVIAH GiOiTTSSDVW SFGILMWEVM 640 

SYGBRPYHDM 8EQBVLNAIE Q6FRLPPPPG CPPGLHLtML DTWQKDRARR PHFDQLVAAF 900 

DXMXRKFDTIi QAG6DPGERP GQALLTPVAb DFPCSiDSPQA WLSAIOLECY QDNF8KFGLC 960 
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TPSDVAQLSL EDLPAU3ITL AGHQKKLLHB IQZiIiQQKLRQ QGSVEV 1006 

Seq ID NO: C428 Protein Sequence 
^ Protein Accession 8: XP_043340,2 

1 11 21 31 41 SI 

I I I I I 1 

KPFDPRRFDI YRKVPKDLTQ PTYTGAIISI CCCLPILFLF LSBLTGFXTT BWNBLYVDD 60 

PDXDS6GKID VSLNISLPNL HCGLVGLDIQ DEMGRHEVGH IDNSMKIPLN NGA6CRFEGQ 120 

10 FSINKVPGNP HVSTHSATAQ PQNPDMTHVI KKLSFGDTLQ VQinHGAFNA LGGADRLTSN 180 

PLASKDYILX IVPTVYEDKS GKQRYSYQYT VANXEYVAYS KT6RIIPAIW FRYDLSPITV 240 

KYTBRRQPLY RFZTTXCAXZ GOTFTVAOIL DSCIFTASEA HKKIQLGXMK 290 

Seq ID NO I C429 Protein Sequence 
15 Protein Accession lit NP_002142.1 

1 11 21 31 41 51 

I i 1 I I I 

HAQKEGGRTV PCXrSRPXVAA LTAGTLLUiT AZOAASNAIV AVLLRSDOEP ItYPVQfVSSAD 60 

20 ARIiHVFDKTE GTHRL1.C8SR SNAXTVAGLSC BEHGFLRALT HSBLDVRTAG ANQTSaPFCV 120 

DBGRLPHTQR IiLEVISVOX: PRGRPLAAIC QDCX3RRKLPV DRIVQGRDTS LGRWPWQVSL 180 

RYDGAHLCGG SLLSGDWVLT AAHCFPBRNR VLSRWRVFAO AVAQASPHGL QLGVQAWYH 240 

GGYLPFRDPN SEENSMDIAIi VHLSSPLPLT BYIQFVCLPA AGQALVDGKI CTVTGWGNTQ 300 

YYGQQA6VLQ EARVPIZSND VCaiGADFYGN QIKPKMFCAG YPE66IDACQ 6DSGGPFVCE 360 

25 DSISRTPRHR LCGIVSHGTG CAXiAQKPGVY TXVSDFREWI FQAIXTHSBA SGMVTQL 417 

Seq ID NO: C430 Protein Sequence 
Protein Accession $t BAA92562.1 

30 1 11 21 31 41 51 

) i I I I 1 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGPRPP QSVMADTENK 60 

BVARITPVFE TLCSVNCBLY PMVGVNSRTN TPVETWKGSK GKQSYTYXIB BNTTTSFTWA 120 

^ FQRTTFHBA8 RKYTNDVAKX YSXNVTHVMN GVASYCRPGA LEASDVGSSC TSCPAOYYXD 160 

35 RDSGTCKSCP PNTXLKAHQP YGVQACVFCG PGTKNNKXHS LCYNSCTPSR NTPTRTFNYN 240 

FSALANTVTL AGGPSPTSKG LKYFHHFTIjS LOGNQGRKMS VCTDNVTDLR IPEGESGFSK 300 

SITAYVCX)AV IIPPEVTGYK AGVSSQPVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 360 

PDVIFFYRSN DVTQSCSSGR STTXRVRCSP QKTVPGSLLL PGTCSDGTCD GOIFHFLWES 420 

. _ AAACPIiCSVA DYHAXVSSCV AGXQKTTYVW RfiPRLCSGGX SLPBQRVTIC KTXDPWLKVO 480 

40 ISAOTCTAXIi LTVLTCYFHK KMOKLEYKYS KXiVMNATLKD CDIiPAADSCA XMEGEDVBDD 540 

LIPTSKKSLF GKIKSFTSKQ PAPVTISLSB DS 572 



45 



70 



Seq XD NO: C431 Protein Sequence 
Protein Accession ffi NP_004BS5.1 



1 11 21 31 41 51 

I I I I 1 I 

MPGQELRTVN GSQMUjVt^IiV LSWX^HGGAL SliAEASRASF PGP8ELHSED SRFRSLRXRY 60 

EDLLTRLRAN QSHBDSNTDL VPAPAVRILT PSVRLQSGGH UHXiRISRAAL PEGLPBASRL 120 

50 BRALFRLSPT ASRSHDVTRP LRRQLSLARP QAPALHZjRXiS PPFSQSDQLL ASSSSARFQL 180 

ELBLRPQAAR GRRRARARN8 TOCPLOPGRC CRIiHTVRASL BDLGWADHVL SPREVQVTMC 240 

IGACPSQFRA AMHHAQXICXS UQUiKFDTBP APCCVPASYN PMVLIQKIDT GV6I4}TYDDIi 300 

lAKDCHCI 308 

55 Seq ID MOt C433 Protein Sequence 
Protein Accession «i NP_443 090.1 

1 11 21 31 41 51 

60 MEDPSGAREP RARPRERDP6 RRPHPDQGRT HDRPRDRPGD PRRXRSSDGN RRRD6DR0PK 60 

RDQERDCaniD R29RDRBRERE REROPDRGFR R0IKRDAGFR AGEHGVHEKP RQSRTRDGAR 120 

GLTWDAAAPP GPAPWEAPEP PQPQRKGDPG RRRPESBPPS ERYLPSTPRP GREBVEYYQS 180 

EABGLLECKK CKYLCTGRAC CQMLBVLLNL LXLACSSVSY SSTGGYTOXT SLGOXYYYQF 240 

GGAYSGFDGA DGEKAQQZW QFYQIiKLFMV TVAHAC86AL TALCCLFVAM 6VLRVPHHCP 300 

65 LLIiVTEGLLD MLIAGGYXFA LYFYFHYIiSA AYGSFVCECBR QALYQSKGYS GFGCSFBOAD 360 

IGAGXFAALG IWFAIiGAVIi AXKOYKKVRK LKEKPASMFB F 401 



Seq XD NO: C43S Protein Sequence 
Protein Accession #t Eos sequence 



1 11 21 31 41 51 

t I I I I ! 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHXGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL XDNGGBIiHAG SALCPFQmiF 120 

75 TIILYGRADE OIQPDPYYGIi KYIGVGKGGA LELHGQKKL8 WTFLNKTLHP GGMAEGOYPP 180 

BRSMGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKBSERLV QYLNAVPDGR ILSVAVNDEG 240 

6RNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKOIPSS SVEDHIEYHG HRGSAAARVP 300 

KLFQTEHGBY FNVSI,SSEWV QDVEWTEWFD HDKVSQTKGG BKISDLWKAH PGKXCNRPXD 360 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLOGKPVRPK LTVTIDTOVN 420 

80 STILNLEDNV QSWKPGDTLV lASTDYSMYQ AEBFQVLPC31 SCAPNQVKVA GKPMYLHIGB 480 

BIDGVDMRAB VGLLSRNIIV MGEMEDKCYP YRNHICNFFD PDTFGGHIKF ALGFKAAHLE 540 

GTBLKHMGQQ LVGQYPXHFH LAGDVDERGG YDPPTYIRDL 8IHHTFSRCV TVHGSNQLLI 600 

XDWGYNSLG HCFFTEDGPE ERNTFDKCLO LLVKSGTLLP SOROSKMCKM ITEDSYPGYX 660 

FXF8QDCNAV STFWMANPNN NLXNCAAAQ8 BBTGFHFXFH KVPTGP8VGM YSPOYSEHXP 720 
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LGKPYNNRAH SNYRAGMIID NGVKTTBASA KDKRPFLSII SARYSPHQDA DPLKPRBPAI 780 
IRHPIAYIQIQ DHCZAHLROGD VWLDSCHFRO BAQEGFLLTG MKAGQILIXX3 DEAA8GHAQG 840 
FSPPCRCLXiK LVTTGSPPAH VSLAKS 666 



1380 



wo 03/042661 



PCTAJS02/36810 



It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
5 sequences of accession numbers, and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication, accession number, or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 . A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising detecting a nucleic acid comprising a sequence at least 80% 

3 identical to a sequence as described in Tables 2A-80 in a biological sample from said patient, 

4 thereby determining the presence or absence of said pathological cell. 

1 2. The method of Claim 1 , wherein: 

2 a) said pathology is described in Table 1, including a cancer; and/or 

3 b) said biological sample comprises isolated nucleic acids. 

1 3. The method of Claim 1, wherein said biological sample is tissue from an organ 

2 which is affected by said pathology of Table 1 , including a cancer. 

1 4. The method of Claim 2, wherein said nucleic acids are mRNA 

1 5. The method of Claim 2: 

2 a) further comprising a step of amplifying nucleic acids before said step of detecting 

3 said nucleic acid; or 

4 b) where said detecting is of a protein encoded by said nucleic acid. 

1 6. The method of Claim 1 , wherein said nucleic acid comprises a sequence as 

2 described in Tables 2A-80. 

1 7. The method of Claim 2, wherein: 

2 a) said detecting step is carried out by: 

3 i) using a labeled nucleic acid probe; 

4 ii) utilizing a biochip comprising a sequence at least 80% identical to a sequrace 

5 as described in Tables 2A-80; or 

6 iii) detecting a polypeptide encoded by said nucleic acid; or 

7 b) said patient is: 

8 i) undergoing a therapeutic regimen to treat said pathology of Table 1 ; or 

9 ii) is suspected of having said pathology or cancer. 

1 8. An isolated nucleic acid molecule comprising a sequence as described in 

2 Tables 2A-80. 
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1 9. The nucleic acid molecule of Claim 8, which is labeled. 

1 1 0. An expression vector comprising the nucleic acid of Claim 8, 

1 11 . A host cell comprising the expression vector of Claim 1 0. 

1 12. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 comprising a sequence as described in Tables 2A-80. 

1 13. An antibody that specifically binds a polypeptide of Claim 12. 

1 14. The antibody of Claim 13: 

2 a) conjugated to an effector component; 

3 b) conjugated to a detectable label, including a fluorescent label, a radioisotope, or a 

4 cytotoxic chemical; 

5 c) which is an antibody fragment; or 

6 d) which is a humanized antibody. 

1 15. A method for specifically targeting a compound to a pathological cell in a 

2 patient, said method comprising administering to said patient an antibody of Claim 13, 

3 thereby providing said targetting. 

1 16. A method for detennining the presence or absence of a pathological cell in a 

2 patient, said method comprising contactmg a biological sample with an antibody of Claim 13. 

1 17. The method ofClaim 16, wherein: 

2 a) said antibody is conjugated to: 

3 i) an effector component; or 

4 ii) a fluorescent label; or 

5 b) said biological sample is a blood, serum, urine, or stool sample. 

1 1 8. A method for identifying a compound that modulates a pafliology-associated 

2 polypeptide, said method comprising the steps of: 
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3 a) contacting said compound with a pathology-associated polypeptide, said 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

5 at least 80% identical to a sequence as described in Tables 2A-80; and 

6 b) determining the functional efifect of said compound upon said polypeptide. 

1 1 9. A drug screening assay comprising the steps of: 

2 a) administering a test compound to a manunal having a pathology of Table 1 or a 

3 cell isolated therefrom; and 

4 b) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as described in 

6 Tables 2A-'80 in a treated cell or manunal with the level of gene expression of said 

7 polynucleotide in a control cell or manomal, wherein a test compound that 

8 modulates said level of expression of the polynucleotide is a candidate for the 

9 treatment of said pathology. 
10 
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